Query         psy6233
Match_columns 271
No_of_seqs    159 out of 301
Neff          4.8 
Searched_HMMs 46136
Date          Fri Aug 16 21:11:16 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy6233.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/6233hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG3878|consensus              100.0  4E-101  8E-106  719.2  19.3  271    1-271   101-469 (469)
  2 PF13897 GOLD_2:  Golgi-dynamic 100.0 9.8E-56 2.1E-60  369.0  13.9  132  135-270     1-136 (136)
  3 KOG3287|consensus               97.8 0.00021 4.5E-09   65.1  10.5   87  129-267    34-126 (236)
  4 KOG1692|consensus               95.3   0.093   2E-06   47.3   8.2   33  238-270    81-113 (201)
  5 KOG1693|consensus               93.8    0.48   1E-05   43.0   9.1   30  238-267    85-114 (209)
  6 PF01105 EMP24_GP25L:  emp24/gp  84.0    0.32   7E-06   40.3   0.0   32  238-269    66-98  (183)
  7 KOG0163|consensus               80.8     9.5 0.00021   41.3   9.2   30   57-86    965-994 (1259)
  8 KOG1029|consensus               78.1     4.5 9.7E-05   43.6   5.8    8   73-80    418-425 (1118)
  9 smart00110 C1Q Complement comp  74.2     2.7 5.8E-05   35.3   2.5   24  229-252    35-58  (135)
 10 KOG0817|consensus               67.6     3.6 7.7E-05   35.4   1.9   22    1-22     68-89  (142)
 11 PF00386 C1q:  C1q domain;  Int  65.9     7.5 0.00016   31.2   3.4   24  229-252    29-52  (127)
 12 PTZ00266 NIMA-related protein   58.5      22 0.00048   39.5   6.3   10   18-27    423-432 (1021)
 13 PF09394 Inhibitor_I42:  Chagas  53.7      93   0.002   23.4  10.4   34  133-167     1-34  (92)
 14 PF15236 CCDC66:  Coiled-coil d  51.6      79  0.0017   27.8   7.4   27   63-89    102-129 (157)
 15 KOG1029|consensus               50.6      53  0.0012   35.9   7.3   18   26-43    338-355 (1118)
 16 PF01117 Aerolysin:  Aerolysin   41.6      33  0.0007   33.8   3.9   44  131-174   185-229 (360)
 17 KOG2891|consensus               41.0      65  0.0014   31.3   5.7   17   53-69    391-407 (445)
 18 KOG2412|consensus               39.8      64  0.0014   33.7   5.8   12    9-20    200-211 (591)
 19 KOG0163|consensus               39.3 1.3E+02  0.0029   33.0   8.1    8  245-252  1170-1177(1259)
 20 cd05840 SPBC215_ISWI_like The   34.3      22 0.00048   28.1   1.3   23  242-264    38-62  (93)
 21 KOG3006|consensus               33.8      27 0.00058   29.7   1.7   20  243-263    76-95  (132)
 22 TIGR03769 P_ac_wall_RPT actino  33.2      31 0.00068   23.5   1.7   16  238-253     6-21  (41)
 23 PF06868 DUF1257:  Protein of u  32.7      61  0.0013   26.3   3.6   30  131-170    15-44  (105)
 24 CHL00193 ycf35 Ycf35; Provisio  31.2      83  0.0018   26.5   4.3   29  131-169    33-61  (128)
 25 COG3081 Nucleoid-associated pr  29.3 1.1E+02  0.0024   29.5   5.2   69   71-163   246-314 (335)
 26 TIGR02568 LcrE type III secret  29.1 1.8E+02   0.004   26.5   6.5   36   68-104    62-99  (240)
 27 PLN03086 PRLI-interacting fact  27.8 1.6E+02  0.0035   30.8   6.5   20  138-157   171-190 (567)
 28 PF13540 RCC1_2:  Regulator of   27.7      86  0.0019   19.4   2.9   24  132-157     1-24  (30)
 29 PF07946 DUF1682:  Protein of u  27.6      77  0.0017   30.2   3.9   11    7-17    237-247 (321)
 30 PF00855 PWWP:  PWWP domain;  I  26.8      50  0.0011   24.5   2.0   25  241-265    31-57  (86)
 31 smart00293 PWWP domain with co  26.6      55  0.0012   23.7   2.1   22  243-264    37-60  (63)
 32 KOG3054|consensus               25.8 1.5E+02  0.0033   28.2   5.4   16   54-69    139-154 (299)
 33 cd05838 WHSC1_related The PWWP  25.1      44 0.00095   26.4   1.5   23  242-264    36-60  (95)
 34 KOG1144|consensus               25.1 1.3E+02  0.0028   33.2   5.2    7   74-80    290-296 (1064)
 35 cd05162 PWWP The PWWP domain,   24.3      58  0.0013   24.6   2.0   22  243-264    36-59  (87)
 36 PRK09510 tolA cell envelope in  24.0 2.4E+02  0.0052   28.2   6.7   18  137-154   324-341 (387)
 37 PHA03051 Hypothetical protein;  23.9      38 0.00081   26.4   0.9   14   15-28     33-46  (88)
 38 PF09805 Nop25:  Nucleolar prot  23.6 1.8E+02  0.0038   24.6   5.0    9   22-30     21-30  (137)
 39 PF02337 Gag_p10:  Retroviral G  23.6      70  0.0015   25.6   2.4   18    5-22     28-45  (90)
 40 KOG2357|consensus               23.4 2.2E+02  0.0047   28.9   6.2   17   14-30    365-381 (440)
 41 PLN03086 PRLI-interacting fact  22.4 1.9E+02   0.004   30.4   5.8   13   26-38      8-20  (567)
 42 PF03531 SSrecog:  Structure-sp  21.6 1.3E+02  0.0029   27.6   4.1   36  235-270   139-174 (222)
 43 KOG2701|consensus               21.2 2.4E+02  0.0052   29.9   6.2   27    4-30    501-527 (608)
 44 KOG2880|consensus               21.2 2.1E+02  0.0045   28.7   5.5   34    9-43     60-93  (424)

No 1  
>KOG3878|consensus
Probab=100.00  E-value=3.7e-101  Score=719.20  Aligned_cols=271  Identities=58%  Similarity=0.991  Sum_probs=244.7

Q ss_pred             CChHHHHHHHHHHHhhhCCCchhHHHHHHhhHHHHHHH--HHHHHHHHHHhhhhhhcccccchhhhHHHHHHHHHHHHHh
Q psy6233           1 MSSKDAMSNFIDLLDLKCILFKPYIDAHVAQKEEENRK--AREQIEREEKVKSEQVNGNEVSQETSIEEKKRRLIQDALN   78 (271)
Q Consensus         1 ~s~~~am~~fi~ll~~~cp~f~~~v~a~~~~~ee~~~~--~~~~~~~~~~~~~~~~~~~~~~~e~~~~e~~~~qi~~aln   78 (271)
                      ||+++||.+||+|||+|||+|+|||++||++|+|..+.  .+..+++.++.++++..+++.++|+++.|+||+|||+|||
T Consensus       101 ~sre~AM~~FV~Lldr~C~~F~~yia~~k~~kde~lkE~e~r~~ee~~e~~~lQe~~qr~l~ee~~~~E~Qr~Qiq~ALN  180 (469)
T KOG3878|consen  101 ISREQAMEGFVDLLDRMCSAFRPYIAAVKQDKDETLKEKELRLMEEKKEARELQENAQRELLEEGYKEELQRRQIQDALN  180 (469)
T ss_pred             ccHHHHHHHHHHHHHhcchhhhhHHHHhhhhhhhHHHHHHHHHHHhhhcchhHHHHHHHHHHHhhhhHHHHHHHHHHHHh
Confidence            79999999999999999999999999999999998763  4444555555556666678888999999999999999999


Q ss_pred             hhhHHHHHHHHHhcCCCChhhhhhhh-h----------------------------------------------------
Q psy6233          79 QQTYAQFRAYAEDQFPGNPEQQAVLV-Q----------------------------------------------------  105 (271)
Q Consensus        79 ~qt~~qf~~ya~~q~P~~~eqq~~li-~----------------------------------------------------  105 (271)
                      .|||+||++||++||||||+||++|| |                                                    
T Consensus       181 ~QT~~QF~~YA~~QfPGnpEQQ~vLIrQLQeqHYqQYMqQly~~~~aQ~q~~~Q~~~~v~~s~s~lPn~~e~n~~~p~~~  260 (469)
T KOG3878|consen  181 KQTYQQFKLYAEKQFPGNPEQQAVLIRQLQEQHYQQYMQQLYLQNQAQNQNGHQEAEHVPGSSSDLPNAMEGNKLGPVET  260 (469)
T ss_pred             HHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhccCcccCCCCchhhhcccCchhH
Confidence            99999999999999999999999999 1                                                    


Q ss_pred             ------------------------------------ccCCCcccccchhhhHHhhhccCCceeEEEccCceEEEEEEeec
Q psy6233         106 ------------------------------------IGPANMWTLTNIKSFKEHIRKEGGDAIIKIGHGETVTVRVPTHE  149 (271)
Q Consensus       106 ------------------------------------~vP~s~wt~~~i~~fk~~i~~~~~~~~i~V~~Get~tV~V~v~~  149 (271)
                                                          +.|+||||||+|.+||.+|..++++++|+||||+++|||||||.
T Consensus       261 ~shq~qa~~~dds~e~~le~v~~eeagE~~p~s~p~I~pas~WtRpdI~~FK~~i~~d~~~gvItvGhGetVTVRVPThe  340 (469)
T KOG3878|consen  261 QSHQQQAEQQDDSVEQTLEGVGQEEAGEEEPDSYPMICPASIWTRPDIEQFKTEISADDGDGVITVGHGETVTVRVPTHE  340 (469)
T ss_pred             HHHHhhhhccCCchhhhccccchhhhcccCCcccceecchhhcCcccHHHHHHHhccCCCCCeEEecCCceEEEeccccC
Confidence                                                89999999999999999999998889999999999999999999


Q ss_pred             CCCEEEEEEEeeCccEEEEEEEEecCCCCCcceeccccCccccccC-----cCCcCCCCCCCCCccc--cccCCCCceee
Q psy6233         150 GGSCLYWEFCTDHYDLGFGIYFEWNKSPTNQVSVHVSETDDEMISD-----KDTESGRTGSQPRKTT--KIVNRSPFSII  222 (271)
Q Consensus       150 ~Gs~L~WeF~Td~yDIGFGV~fe~~~~~s~~vsv~v~eS~d~~~~~-----~~~e~g~~~~~~~~~~--~~~~~~~~~eI  222 (271)
                      .|+||||+|+||+|||||||||+|+++.+|+||||||+|+||+|..     |+++.|..+.+.++.|  ..++++.+++|
T Consensus       341 nGsclFWEFATD~YDIGFGvYFEWt~~~~n~VsVHVSeSddded~~~~~e~E~~e~G~~~~E~gA~~n~~~anKp~~deI  420 (469)
T KOG3878|consen  341 NGSCLFWEFATDSYDIGFGVYFEWTKPVTNEVSVHVSESDDDEDCVYLSETEDLESGSLSQERGAVNNPTAANKPPIDEI  420 (469)
T ss_pred             CCceEEEEeccccccccceEEEEeecCCCceeEEEecccccchhhhhhhhhhhhhcCCchhhhhhhcCCCCCCCCCcccc
Confidence            9999999999999999999999999999999999999999988644     6666663211122211  24789999999


Q ss_pred             eeCeeeccccceecceEEeccceEEEEEEecCccceeeceeEEEEEEeC
Q psy6233         223 SPIMRRDCHQEVYAGSHCYPGEGVYLLKFDNSYSLWRSKTLYYRVYYTR  271 (271)
Q Consensus       223 vP~~R~~sh~~~~~Gs~~~~~~GvY~L~FDNSYS~~RsK~L~Y~V~~t~  271 (271)
                      ||+||++||.++|+|||.|||.|+|+|+||||||+||||+|||||||||
T Consensus       421 vPvYRRdCheEVYaGSH~YPGrGvYLLKFDNSYSlWRsKtlYYRVYY~r  469 (469)
T KOG3878|consen  421 VPVYRRDCHEEVYAGSHSYPGRGVYLLKFDNSYSLWRSKTLYYRVYYER  469 (469)
T ss_pred             cchhhhhhhHHhhcccccCCCCceEEEEecchhhhhcccceEEEEEecC
Confidence            9999999999999999999999999999999999999999999999997


No 2  
>PF13897 GOLD_2:  Golgi-dynamics membrane-trafficking
Probab=100.00  E-value=9.8e-56  Score=369.04  Aligned_cols=132  Identities=60%  Similarity=1.067  Sum_probs=118.0

Q ss_pred             EccCceEEEEEEeecCCCEEEEEEEeeCccEEEEEEEEecCCCCCcceeccccCccccccCcCCc----CCCCCCCCCcc
Q psy6233         135 IGHGETVTVRVPTHEGGSCLYWEFCTDHYDLGFGIYFEWNKSPTNQVSVHVSETDDEMISDKDTE----SGRTGSQPRKT  210 (271)
Q Consensus       135 V~~Get~tV~V~v~~~Gs~L~WeF~Td~yDIGFGV~fe~~~~~s~~vsv~v~eS~d~~~~~~~~e----~g~~~~~~~~~  210 (271)
                      |+||+++|||||+|++|++|+|||+||+|||||||||+|+++++++|||||++|+|+||++++.+    .|.  .+.  |
T Consensus         1 v~~Ge~~tvrVpt~~~G~~l~WeFaTd~yDIgFG~~few~~~~s~~vsv~vses~de~~~~~~~~~~~~~~~--ve~--g   76 (136)
T PF13897_consen    1 VGRGETVTVRVPTHPEGKCLFWEFATDSYDIGFGVYFEWTPPTSNQVSVHVSESSDEEDEEEEEEEDSESGD--VEK--G   76 (136)
T ss_pred             CCcCcEEEEEcccCCCCCEEEEEEeeCCCCceEEEEEEecCCCCCceeEecccCccccccccccccccCccc--hhc--c
Confidence            78999999999999999999999999999999999999999999999999999999886664333    121  111  1


Q ss_pred             ccccCCCCceeeeeCeeeccccceecceEEeccceEEEEEEecCccceeeceeEEEEEEe
Q psy6233         211 TKIVNRSPFSIISPIMRRDCHQEVYAGSHCYPGEGVYLLKFDNSYSLWRSKTLYYRVYYT  270 (271)
Q Consensus       211 ~~~~~~~~~~eIvP~~R~~sh~~~~~Gs~~~~~~GvY~L~FDNSYS~~RsK~L~Y~V~~t  270 (271)
                      .....++.+.+|||++|++||++|++|+|.|++||+|+|+|||||||||||+|+|+|+||
T Consensus        77 s~~~~~~~~~eviPv~R~dsH~~~~~Gs~~c~~~GvYvLkFDNSYS~~rsK~l~Y~V~yt  136 (136)
T PF13897_consen   77 SERSSRPEMDEVIPVYRRDSHLEVEAGSHTCPGPGVYVLKFDNSYSWFRSKKLYYRVYYT  136 (136)
T ss_pred             ccccCCCCeeEEeEeeeeecCcceeceEEECCCCeEEEEEeeCcceeEEeeEEEEEEEeC
Confidence            134567889999999999999999999999999999999999999999999999999998


No 3  
>KOG3287|consensus
Probab=97.79  E-value=0.00021  Score=65.12  Aligned_cols=87  Identities=26%  Similarity=0.439  Sum_probs=64.7

Q ss_pred             CceeEEEccCceEEEEEEeecCCCEEEEEEE-eeC---ccEEEEEEEEecCCCCCcceeccccCccccccCcCCcCCCCC
Q psy6233         129 GDAIIKIGHGETVTVRVPTHEGGSCLYWEFC-TDH---YDLGFGIYFEWNKSPTNQVSVHVSETDDEMISDKDTESGRTG  204 (271)
Q Consensus       129 ~~~~i~V~~Get~tV~V~v~~~Gs~L~WeF~-Td~---yDIGFGV~fe~~~~~s~~vsv~v~eS~d~~~~~~~~e~g~~~  204 (271)
                      ++-.+.|.+|++--.-=|++ .|..|.-||. +++   -||+|-+.-    |    .                   |   
T Consensus        34 ~dftv~ipAGk~eCf~Q~v~-~~~tle~eyQVi~G~GDl~i~Ftl~~----P----~-------------------G---   82 (236)
T KOG3287|consen   34 YDFTVMIPAGKTECFYQPVP-QGATLEVEYQVIDGAGDLDIDFTLLN----P----A-------------------G---   82 (236)
T ss_pred             cceEEEecCCCceeeeeecc-CCeEEEEEEEEEecCCccceeeEEeC----C----C-------------------c---
Confidence            55688899999888777776 6688888887 454   477777761    1    0                   1   


Q ss_pred             CCCCccccccCCCCceeeeeCeeeccccceecceEEe--ccceEEEEEEecCccceeeceeEEEE
Q psy6233         205 SQPRKTTKIVNRSPFSIISPIMRRDCHQEVYAGSHCY--PGEGVYLLKFDNSYSLWRSKTLYYRV  267 (271)
Q Consensus       205 ~~~~~~~~~~~~~~~~eIvP~~R~~sh~~~~~Gs~~~--~~~GvY~L~FDNSYS~~RsK~L~Y~V  267 (271)
                                     ..+++-+|-      ..|.|..  .++|-|.|.||||||.+.+|.|++-+
T Consensus        83 ---------------~~lv~~q~k------~dg~ht~e~~e~GdY~~CfDNsFS~fs~K~Vffel  126 (236)
T KOG3287|consen   83 ---------------EVLVSDQRK------VDGVHTVEVTETGDYQVCFDNSFSTFSRKLVFFEL  126 (236)
T ss_pred             ---------------cEEeecccc------cCceeEeeccCCcceEEEEcCccccccceEEEEEE
Confidence                           255555543      2466664  58999999999999999999999987


No 4  
>KOG1692|consensus
Probab=95.32  E-value=0.093  Score=47.31  Aligned_cols=33  Identities=24%  Similarity=0.414  Sum_probs=30.2

Q ss_pred             eEEeccceEEEEEEecCccceeeceeEEEEEEe
Q psy6233         238 SHCYPGEGVYLLKFDNSYSLWRSKTLYYRVYYT  270 (271)
Q Consensus       238 s~~~~~~GvY~L~FDNSYS~~RsK~L~Y~V~~t  270 (271)
                      ++.+|.+|+|.+.|+|.+|=+.+|+|-+.|.+.
T Consensus        81 tF~a~~~G~Y~fCF~N~~s~mtpk~V~F~ihvg  113 (201)
T KOG1692|consen   81 TFTAPKKGTYTFCFSNKMSTMTPKTVMFTIHVG  113 (201)
T ss_pred             EEEecCCceEEEEecCCCCCCCceEEEEEEEEe
Confidence            456899999999999999999999999999864


No 5  
>KOG1693|consensus
Probab=93.79  E-value=0.48  Score=43.05  Aligned_cols=30  Identities=27%  Similarity=0.479  Sum_probs=26.0

Q ss_pred             eEEeccceEEEEEEecCccceeeceeEEEE
Q psy6233         238 SHCYPGEGVYLLKFDNSYSLWRSKTLYYRV  267 (271)
Q Consensus       238 s~~~~~~GvY~L~FDNSYS~~RsK~L~Y~V  267 (271)
                      .++..+.|+|.+.|+|.||-+..|.++...
T Consensus        85 ~f~ae~~G~Y~fCFsN~fstf~~Kiv~~~~  114 (209)
T KOG1693|consen   85 LFKAEGKGEYTFCFSNEFSTFSHKIVYMDF  114 (209)
T ss_pred             EEEEecceEEEEEecCccccccceEeeehh
Confidence            467788999999999999999999887654


No 6  
>PF01105 EMP24_GP25L:  emp24/gp25L/p24 family/GOLD;  InterPro: IPR009038  The GOLD (for Golgi dynamics) domain is a protein module found in several eukaryotic Golgi and lipid-traffic proteins. It is typically between 90 and 150 amino acids long. Most of the size difference observed in the GOLD-domain superfamily is traceable to a single large low-complexity insert that is seen in some versions of the domain. With the exception of the p24 proteins, which have a simple architecture with the GOLD domain as their only globular domain, all other GOLD-domain proteins contain additional conserved globular domains. In these proteins, the GOLD domain co-occurs with lipid-, sterol- or fatty acid-binding domains such as PH, CRAL-TRIO, FYVE oxysterol binding- and acyl CoA-binding domains, suggesting that these proteins may interact with membranes. The GOLD domain can also be found associated with a RUN domain, which may have a role in the interaction of various proteins with cytoskeletal filaments. The GOLD domain is predicted to mediate diverse protein-protein interactions []. A secondary structure prediction for the GOLD domain reveals that it is likely to adopt a compact all-beta-fold structure with six to seven strands. Most of the sequence conservation is centred on the hydrophobic cores that support these predicted strands. The predicted secondary-structure elements and the size of the conserved core of the domain suggests that it may form a beta- sandwich fold with the strands arranged in two beta sheets stacked on each other [].  Some proteins known to contain a GOLD domain are listed below:   Eukaryotic proteins of the p24 family.  Animal Sec14-like proteins. They are involved in secretion.  Human Golgi resident protein GCP60. It interacts with the Golgi integral membrane protein Giantin. Yeast oxysterol-binding protein homologue 3 (OSH3).  ; GO: 0006810 transport, 0016021 integral to membrane; PDB: 1P23_A 1M23_A.
Probab=83.96  E-value=0.32  Score=40.27  Aligned_cols=32  Identities=28%  Similarity=0.373  Sum_probs=0.0

Q ss_pred             eEEeccceEEEEEEecCccceee-ceeEEEEEE
Q psy6233         238 SHCYPGEGVYLLKFDNSYSLWRS-KTLYYRVYY  269 (271)
Q Consensus       238 s~~~~~~GvY~L~FDNSYS~~Rs-K~L~Y~V~~  269 (271)
                      ++..+++|.|.+.|+|+.+-+.. |++.+.+..
T Consensus        66 ~f~~~~~G~y~iCf~n~~~~~~~~~~v~~~~~~   98 (183)
T PF01105_consen   66 SFTAKESGEYQICFDNSSSSFSPSKRVSFDIDV   98 (183)
T ss_dssp             ---------------------------------
T ss_pred             EEEeccCCCEEEEEEcCCCCccccEEEEEEEEE
Confidence            35567999999999999888877 888877653


No 7  
>KOG0163|consensus
Probab=80.82  E-value=9.5  Score=41.30  Aligned_cols=30  Identities=13%  Similarity=0.178  Sum_probs=16.3

Q ss_pred             cccchhhhHHHHHHHHHHHHHhhhhHHHHH
Q psy6233          57 NEVSQETSIEEKKRRLIQDALNQQTYAQFR   86 (271)
Q Consensus        57 ~~~~~e~~~~e~~~~qi~~aln~qt~~qf~   86 (271)
                      +.+|++++++|.-.++|..++-.|...+=.
T Consensus       965 K~eEeqr~~qee~e~~l~~e~q~qla~e~e  994 (1259)
T KOG0163|consen  965 KAEEEQRKAQEEEERRLALELQEQLAKEAE  994 (1259)
T ss_pred             HHHHHHHHhhhhHHHHHHHHHHHHHHHHHH
Confidence            444455566555556666666655544433


No 8  
>KOG1029|consensus
Probab=78.05  E-value=4.5  Score=43.63  Aligned_cols=8  Identities=50%  Similarity=0.596  Sum_probs=4.0

Q ss_pred             HHHHHhhh
Q psy6233          73 IQDALNQQ   80 (271)
Q Consensus        73 i~~aln~q   80 (271)
                      +++-|||-
T Consensus       418 ~qem~~Qk  425 (1118)
T KOG1029|consen  418 RQEMLNQK  425 (1118)
T ss_pred             HHHHHhhh
Confidence            44555544


No 9  
>smart00110 C1Q Complement component C1q domain. Globular domain found in many collagens and eponymously in complement C1q. When part of full length proteins these domains form a 'bouquet' due to the multimerization of heterotrimers. The C1q fold is similar to that of tumour necrosis factor.
Probab=74.22  E-value=2.7  Score=35.27  Aligned_cols=24  Identities=25%  Similarity=0.524  Sum_probs=21.6

Q ss_pred             ccccceecceEEeccceEEEEEEe
Q psy6233         229 DCHQEVYAGSHCYPGEGVYLLKFD  252 (271)
Q Consensus       229 ~sh~~~~~Gs~~~~~~GvY~L~FD  252 (271)
                      ..|..+..|.++||-||+|.+.|.
T Consensus        35 g~~yd~~TG~Ftcpv~GvY~F~f~   58 (135)
T smart00110       35 QGHYDPRTGKFTCPVPGVYYFSYH   58 (135)
T ss_pred             CCCccCCCCEEECeeceEEEEEEE
Confidence            456788899999999999999997


No 10 
>KOG0817|consensus
Probab=67.59  E-value=3.6  Score=35.45  Aligned_cols=22  Identities=18%  Similarity=0.361  Sum_probs=19.6

Q ss_pred             CChHHHHHHHHHHHhhhCCCch
Q psy6233           1 MSSKDAMSNFIDLLDLKCILFK   22 (271)
Q Consensus         1 ~s~~~am~~fi~ll~~~cp~f~   22 (271)
                      ||+++||..||+++..++|-..
T Consensus        68 ~s~~eA~~~Yv~~~~~l~~~~~   89 (142)
T KOG0817|consen   68 MSKEEAMEAYVEKVEELIPKYG   89 (142)
T ss_pred             CCHHHHHHHHHHHHHHHHHHhh
Confidence            7999999999999999998543


No 11 
>PF00386 C1q:  C1q domain;  InterPro: IPR001073 This entry represents the C-terminal domain of C1q. C1q is a subunit of the C1 enzyme complex that activates the serum complement system. C1q comprises 6 A, 6 B and 6 C chains. These share the same topology, each possessing a small, globular N-terminal domain, a collagen-like Gly/Pro-rich central region, and a conserved C-terminal region, the C1q domain []. The C1q protein is produced in collagen-producing cells and shows sequence and structural similarity to collagens VIII and X [, ]. This domain is also found in multimerin and EMILIN proteins.; PDB: 1O91_C 2JG8_D 2JG9_A 2WNV_A 2WNU_A 1PK6_A 4DOU_A 1C3H_C 1C28_C 2OII_A ....
Probab=65.93  E-value=7.5  Score=31.25  Aligned_cols=24  Identities=25%  Similarity=0.394  Sum_probs=18.0

Q ss_pred             ccccceecceEEeccceEEEEEEe
Q psy6233         229 DCHQEVYAGSHCYPGEGVYLLKFD  252 (271)
Q Consensus       229 ~sh~~~~~Gs~~~~~~GvY~L~FD  252 (271)
                      ..+..+..|.++||.+|+|.+.|.
T Consensus        29 g~~yn~~tG~Ftap~~G~Y~F~~~   52 (127)
T PF00386_consen   29 GNAYNPSTGIFTAPVPGVYFFSFT   52 (127)
T ss_dssp             TS-EETTTTEEE-SS-EEEEEEEE
T ss_pred             CCccEeecCEEecCCCCEEEEEEE
Confidence            446778899999999999999874


No 12 
>PTZ00266 NIMA-related protein kinase; Provisional
Probab=58.48  E-value=22  Score=39.46  Aligned_cols=10  Identities=10%  Similarity=-0.004  Sum_probs=4.5

Q ss_pred             CCCchhHHHH
Q psy6233          18 CILFKPYIDA   27 (271)
Q Consensus        18 cp~f~~~v~a   27 (271)
                      |+.+..-++.
T Consensus       423 ~g~~g~r~ek  432 (1021)
T PTZ00266        423 NGHYGGRVDK  432 (1021)
T ss_pred             cCccccccch
Confidence            4444444433


No 13 
>PF09394 Inhibitor_I42:  Chagasin family peptidase inhibitor I42;  InterPro: IPR018990 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties.   Chagasin reversible inhibitor of papain-like cysteine proteases []. Chagasin has a beta-barrel structure, which is a unique variant of the immunoglobulin fold with homology to human CD8alpha [, ].; PDB: 2NQD_A 2NNR_A 2H7W_B 3E1Z_A 3CBK_B 3CBJ_B 2OUL_B 2FO8_A 2WGN_B 2C34_A ....
Probab=53.74  E-value=93  Score=23.35  Aligned_cols=34  Identities=21%  Similarity=0.502  Sum_probs=26.8

Q ss_pred             EEEccCceEEEEEEeecCCCEEEEEEEeeCccEEE
Q psy6233         133 IKIGHGETVTVRVPTHEGGSCLYWEFCTDHYDLGF  167 (271)
Q Consensus       133 i~V~~Get~tV~V~v~~~Gs~L~WeF~Td~yDIGF  167 (271)
                      |+|..|+++.|.++-. +++-..|.+..+..-+.+
T Consensus         1 I~v~~g~~~~I~L~~n-pstGY~W~~~~~~~~l~l   34 (92)
T PF09394_consen    1 ITVKVGDTFEIELPEN-PSTGYSWSLSSDSDGLQL   34 (92)
T ss_dssp             -EEETTSEEEEEEEEB-CCGTBEEEECTSTTTEEE
T ss_pred             CeecCCCEEEEEECCC-CCCCeEEEEecCCCeEEE
Confidence            6799999999999976 777899999985544554


No 14 
>PF15236 CCDC66:  Coiled-coil domain-containing protein 66
Probab=51.57  E-value=79  Score=27.78  Aligned_cols=27  Identities=15%  Similarity=0.224  Sum_probs=11.4

Q ss_pred             hhHHHHHHHHHHH-HHhhhhHHHHHHHH
Q psy6233          63 TSIEEKKRRLIQD-ALNQQTYAQFRAYA   89 (271)
Q Consensus        63 ~~~~e~~~~qi~~-aln~qt~~qf~~ya   89 (271)
                      ++..|..++..++ +..+.|..-|...+
T Consensus       102 ~~E~E~~~~~~KEe~~~~k~~~l~e~~q  129 (157)
T PF15236_consen  102 QFEEEQRKQREKEEEQTRKTQELYEAMQ  129 (157)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3444444433332 44444444454433


No 15 
>KOG1029|consensus
Probab=50.57  E-value=53  Score=35.89  Aligned_cols=18  Identities=28%  Similarity=0.449  Sum_probs=7.4

Q ss_pred             HHHHhhHHHHHHHHHHHH
Q psy6233          26 DAHVAQKEEENRKAREQI   43 (271)
Q Consensus        26 ~a~~~~~ee~~~~~~~~~   43 (271)
                      +..++++|+.+++.++++
T Consensus       338 eqqqreree~eqkEreE~  355 (1118)
T KOG1029|consen  338 EQQQREREEVEQKEREEE  355 (1118)
T ss_pred             HHHHHHHHHHHHHHHHHH
Confidence            333444444444433333


No 16 
>PF01117 Aerolysin:  Aerolysin toxin;  InterPro: IPR005830  This family represents the pore forming lobe of aerolysin, and the related toxins haemolysin and the leukocidin S subunit.  Aerolysin [] is a cytolytic toxin exported by Aeromonas hydrophila, a Gram-negative bacterium associated with diarrhoeal diseases and deep wound infections []. The mature toxin binds to eukaryotic cells and aggregates to form holes (approximately 3 nm in diameter) leading to the destruction of the membrane permeability barrier and osmotic lysis. The structure of proaerolysin has been determined to 2.8A resolution and shows the protoxin to adopt a novel fold []. Images of an aerolysin oligomer derived from electron microscopy have helped to construct a model of the protein and to outline a mechanism by which it might insert into lipid bilayers to form ion channels [].; GO: 0009405 pathogenesis, 0005576 extracellular region; PDB: 3G4N_B 3C0N_B 3C0O_B 1PRE_A 1Z52_B 3C0M_A 3G4O_A.
Probab=41.56  E-value=33  Score=33.82  Aligned_cols=44  Identities=18%  Similarity=0.335  Sum_probs=36.4

Q ss_pred             eeEEEccCceEEEEEEeecCCCEEEEEEEee-CccEEEEEEEEec
Q psy6233         131 AIIKIGHGETVTVRVPTHEGGSCLYWEFCTD-HYDLGFGIYFEWN  174 (271)
Q Consensus       131 ~~i~V~~Get~tV~V~v~~~Gs~L~WeF~Td-~yDIGFGV~fe~~  174 (271)
                      ..++|+++..++|+|.+...-.-+-+.|.++ +|||.|-=|+.|.
T Consensus       185 ~~vtVPP~s~~~V~i~~~rs~i~iPYt~~~~i~Ydv~f~GFlRwg  229 (360)
T PF01117_consen  185 YRVTVPPHSKVPVRIELYRSTIDIPYTAKADISYDVEFTGFLRWG  229 (360)
T ss_dssp             E--EB-TTEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEB-TT
T ss_pred             EeeccCCCceEEEEEEEEeeEEEeeEEEEEEEEEEEEEeeeEeec
Confidence            4689999999999999988788899999987 9999999999986


No 17 
>KOG2891|consensus
Probab=41.03  E-value=65  Score=31.33  Aligned_cols=17  Identities=12%  Similarity=0.151  Sum_probs=10.0

Q ss_pred             hhcccccchhhhHHHHH
Q psy6233          53 QVNGNEVSQETSIEEKK   69 (271)
Q Consensus        53 ~~~~~~~~~e~~~~e~~   69 (271)
                      ++++++.++++|..|.|
T Consensus       391 eerrkqkeeeklk~e~q  407 (445)
T KOG2891|consen  391 EERRKQKEEEKLKAEEQ  407 (445)
T ss_pred             HHHHhhhHHHHHHHHHH
Confidence            34455556677766555


No 18 
>KOG2412|consensus
Probab=39.81  E-value=64  Score=33.67  Aligned_cols=12  Identities=8%  Similarity=-0.197  Sum_probs=5.2

Q ss_pred             HHHHHHhhhCCC
Q psy6233           9 NFIDLLDLKCIL   20 (271)
Q Consensus         9 ~fi~ll~~~cp~   20 (271)
                      .||.-+.+.|.+
T Consensus       200 ~~i~~~~q~~eq  211 (591)
T KOG2412|consen  200 QAIQREKQRKEQ  211 (591)
T ss_pred             HHHHHHHHHHHH
Confidence            344444444443


No 19 
>KOG0163|consensus
Probab=39.33  E-value=1.3e+02  Score=33.03  Aligned_cols=8  Identities=38%  Similarity=0.974  Sum_probs=3.3

Q ss_pred             eEEEEEEe
Q psy6233         245 GVYLLKFD  252 (271)
Q Consensus       245 GvY~L~FD  252 (271)
                      |.+...||
T Consensus      1170 gmWyaHFd 1177 (1259)
T KOG0163|consen 1170 GMWYAHFD 1177 (1259)
T ss_pred             ceEEEecC
Confidence            44444444


No 20 
>cd05840 SPBC215_ISWI_like The PWWP domain is a component of the S. pombe hypothetical protein SPBC215, as well as ISWI complex protein 4.  The ISWI (imitation switch) proteins are ATPases responsible for chromatin remodeling in eukaryotes, and SPBC215 is proposed to also bind chromatin.   The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding,  proteins that function as transcription factors regulating a variety of developmental processes.
Probab=34.33  E-value=22  Score=28.09  Aligned_cols=23  Identities=26%  Similarity=0.381  Sum_probs=19.0

Q ss_pred             ccceEEEEEE--ecCccceeeceeE
Q psy6233         242 PGEGVYLLKF--DNSYSLWRSKTLY  264 (271)
Q Consensus       242 ~~~GvY~L~F--DNSYS~~RsK~L~  264 (271)
                      .++|.|.+.|  |++|+|...+.|.
T Consensus        38 ~~~~~~~V~FFg~~~~~Wv~~~~l~   62 (93)
T cd05840          38 KNKRTYPVMFFPDGDYYWVPNKDLK   62 (93)
T ss_pred             CCCCeEEEEEeCCCcEEEEChhhcc
Confidence            4688899998  8999999887763


No 21 
>KOG3006|consensus
Probab=33.78  E-value=27  Score=29.73  Aligned_cols=20  Identities=25%  Similarity=0.481  Sum_probs=16.5

Q ss_pred             cceEEEEEEecCccceeecee
Q psy6233         243 GEGVYLLKFDNSYSLWRSKTL  263 (271)
Q Consensus       243 ~~GvY~L~FDNSYS~~RsK~L  263 (271)
                      .||+|.|.||-. ++|+++-|
T Consensus        76 ~~GtYr~~~dT~-~Y~~a~gv   95 (132)
T KOG3006|consen   76 IPGTYRLVFDTE-PYYKALGV   95 (132)
T ss_pred             ccceEEEEEecc-cccccCCc
Confidence            489999999976 77888764


No 22 
>TIGR03769 P_ac_wall_RPT actinobacterial surface-anchored protein domain. This model describes a repeat domain that one to three times in Actinobacterial proteins, some of which have LPXTG-type sortase recognition motifs for covalent attachment to the Gram-positive cell wall. Where it occurs with duplication in an LPXTG-anchored protein, it tends to be adjacent to the substrate-binding protein of the gene trio of an ABC transporter system, where that substrate-binding protein has a single copy of this same domain. This arrangement suggests a substrate-binding relay system, with the LPXTG protein acting as a substrate receptor.
Probab=33.22  E-value=31  Score=23.48  Aligned_cols=16  Identities=19%  Similarity=0.451  Sum_probs=12.5

Q ss_pred             eEEeccceEEEEEEec
Q psy6233         238 SHCYPGEGVYLLKFDN  253 (271)
Q Consensus       238 s~~~~~~GvY~L~FDN  253 (271)
                      ......||+|.|.|.=
T Consensus         6 nW~FT~PG~Y~l~~~a   21 (41)
T TIGR03769         6 NWVFTKPGTYTLTVQA   21 (41)
T ss_pred             ceeeCCCeEEEEEEEE
Confidence            3456789999999953


No 23 
>PF06868 DUF1257:  Protein of unknown function (DUF1257);  InterPro: IPR009666 This family represents Ycf35, which is encoded in algal chloroplast and in cyanobacteria. The function of these proteins are unknown. As the family is exclusively found in phototrophic organisms it may play a role in photosynthesis.
Probab=32.72  E-value=61  Score=26.27  Aligned_cols=30  Identities=20%  Similarity=0.344  Sum_probs=20.6

Q ss_pred             eeEEEccCceEEEEEEeecCCCEEEEEEEeeCccEEEEEE
Q psy6233         131 AIIKIGHGETVTVRVPTHEGGSCLYWEFCTDHYDLGFGIY  170 (271)
Q Consensus       131 ~~i~V~~Get~tV~V~v~~~Gs~L~WeF~Td~yDIGFGV~  170 (271)
                      ..|.-.+|.++.+.+-+..++          +|||||-.-
T Consensus        15 ~~vrgy~gq~~~aelvi~~~~----------~~DIGF~wn   44 (105)
T PF06868_consen   15 QPVRGYRGQTQKAELVIKQNN----------GYDIGFRWN   44 (105)
T ss_pred             eeccccCCCeEEEEEEEEcCC----------CccEEEEEC
Confidence            455566788888877776443          788887543


No 24 
>CHL00193 ycf35 Ycf35; Provisional
Probab=31.20  E-value=83  Score=26.52  Aligned_cols=29  Identities=28%  Similarity=0.475  Sum_probs=21.2

Q ss_pred             eeEEEccCceEEEEEEeecCCCEEEEEEEeeCccEEEEE
Q psy6233         131 AIIKIGHGETVTVRVPTHEGGSCLYWEFCTDHYDLGFGI  169 (271)
Q Consensus       131 ~~i~V~~Get~tV~V~v~~~Gs~L~WeF~Td~yDIGFGV  169 (271)
                      ..+...+|.++.+.+-+..+          .+|||||..
T Consensus        33 ~~vrgy~gq~~~a~lvi~~~----------~~~diGf~~   61 (128)
T CHL00193         33 QVIKGYNGQTHNADLVIKQS----------NNYDIGFVW   61 (128)
T ss_pred             ceeeccCCCeEEEEEEEEcC----------CCCceeEee
Confidence            46777888888877777655          377887765


No 25 
>COG3081 Nucleoid-associated protein [General function prediction only]
Probab=29.31  E-value=1.1e+02  Score=29.52  Aligned_cols=69  Identities=25%  Similarity=0.366  Sum_probs=47.1

Q ss_pred             HHHHHHHhhhhHHHHHHHHHhcCCCChhhhhhhhhccCCCcccccchhhhHHhhhccCCceeEEEccCceEEEEEEeecC
Q psy6233          71 RLIQDALNQQTYAQFRAYAEDQFPGNPEQQAVLVQIGPANMWTLTNIKSFKEHIRKEGGDAIIKIGHGETVTVRVPTHEG  150 (271)
Q Consensus        71 ~qi~~aln~qt~~qf~~ya~~q~P~~~eqq~~li~~vP~s~wt~~~i~~fk~~i~~~~~~~~i~V~~Get~tV~V~v~~~  150 (271)
                      .+-|+-||.+.+..|-  |++-|-=-+        .+||---|-.....|              +|+|+-++|.+..+-=
T Consensus       246 S~~lp~~~e~~F~~ft--~eq~YgLee--------s~PadrsTl~qltkf--------------~Gsg~GltisFdA~LL  301 (335)
T COG3081         246 SKELPGLNEVAFYDFT--AEKGYGLEE--------SFPADRSTLRQLTKF--------------AGSGGGLTISFDAMLL  301 (335)
T ss_pred             HhhCCCcCcchHHHHH--HhcccCCCc--------cCCccHHHHHHHHHh--------------cCCCCceEEeehhhhh
Confidence            4567778887665553  444554333        367654443333222              7899999999999989


Q ss_pred             CCEEEEEEEeeCc
Q psy6233         151 GSCLYWEFCTDHY  163 (271)
Q Consensus       151 Gs~L~WeF~Td~y  163 (271)
                      |+.++|.=+||.-
T Consensus       302 g~rvfwD~~tDtL  314 (335)
T COG3081         302 GERVFWDPATDTL  314 (335)
T ss_pred             CCeeeecccccee
Confidence            9999999999864


No 26 
>TIGR02568 LcrE type III secretion regulator YopN/LcrE/InvE/MxiC. This protein is found in type III secretion operons and, in Yersinia is localized to the cell surface and is involved in the Low-Calicium Response (LCR), possibly by sensing the calcium concentration. In Salmonella, the gene is known as InvE and is believed to perform an essential role in the secretion process and interacts with the proteins SipBCD and SicA.//Altered name to reflect regulatory role. Added GO and role IDs. Negative regulation of type III secretion in Y pestis is mediated in part by a multiprotein complex that has been proposed to act as a physical impediment to type III secretion by blocking the entrance to the secretion apparatus prior to contact with mammalian cells. This complex is composed of YopN, its heterodimeric secretion chaperone SycN-YscB, and TyeA. PubMed: 15701523
Probab=29.05  E-value=1.8e+02  Score=26.52  Aligned_cols=36  Identities=28%  Similarity=0.441  Sum_probs=28.7

Q ss_pred             HHHHHHHHHHhh--hhHHHHHHHHHhcCCCChhhhhhhh
Q psy6233          68 KKRRLIQDALNQ--QTYAQFRAYAEDQFPGNPEQQAVLV  104 (271)
Q Consensus        68 ~~~~qi~~aln~--qt~~qf~~ya~~q~P~~~eqq~~li  104 (271)
                      ++...++..|+.  ++..|+..|+.+.||+ |.+|-.++
T Consensus        62 ~k~~~l~~~l~~~~~~~~~ll~~l~~~f~D-~s~~~laL   99 (240)
T TIGR02568        62 QKLKELRAELSGQAAGLEQLLALARGAFPD-PSDQALAL   99 (240)
T ss_pred             HHHHHHHHHHcccCCCHHHHHHHHHhhCCC-hHHHHHHH
Confidence            455677777786  8999999999999995 77676666


No 27 
>PLN03086 PRLI-interacting factor K; Provisional
Probab=27.81  E-value=1.6e+02  Score=30.83  Aligned_cols=20  Identities=20%  Similarity=0.018  Sum_probs=13.1

Q ss_pred             CceEEEEEEeecCCCEEEEE
Q psy6233         138 GETVTVRVPTHEGGSCLYWE  157 (271)
Q Consensus       138 Get~tV~V~v~~~Gs~L~We  157 (271)
                      |..++|..-.-+.|+.+.-.
T Consensus       171 ~~~v~v~~~~Lpkgt~vklq  190 (567)
T PLN03086        171 VPLVEVRYIWLPKGTYAKLQ  190 (567)
T ss_pred             CCeEEEEEeecCCCCEEEEe
Confidence            55677776666777776544


No 28 
>PF13540 RCC1_2:  Regulator of chromosome condensation (RCC1) repeat; PDB: 3QI0_D 1JTD_B 3QHY_B.
Probab=27.71  E-value=86  Score=19.42  Aligned_cols=24  Identities=21%  Similarity=0.509  Sum_probs=18.8

Q ss_pred             eEEEccCceEEEEEEeecCCCEEEEE
Q psy6233         132 IIKIGHGETVTVRVPTHEGGSCLYWE  157 (271)
Q Consensus       132 ~i~V~~Get~tV~V~v~~~Gs~L~We  157 (271)
                      ++.|..|..|++-+.  ..|.+..|-
T Consensus         1 V~~ia~G~~ht~al~--~~g~v~~wG   24 (30)
T PF13540_consen    1 VVQIACGGYHTCALT--SDGEVYCWG   24 (30)
T ss_dssp             EEEEEEESSEEEEEE---TTEEEEEE
T ss_pred             CEEEEecCCEEEEEE--cCCCEEEEc
Confidence            467889999998774  579999996


No 29 
>PF07946 DUF1682:  Protein of unknown function (DUF1682);  InterPro: IPR012879 The members of this family are all hypothetical eukaryotic proteins of unknown function. One member (Q920S6 from SWISSPROT) is described as being an adipocyte-specific protein, but no evidence of this was found. 
Probab=27.58  E-value=77  Score=30.19  Aligned_cols=11  Identities=18%  Similarity=0.169  Sum_probs=4.8

Q ss_pred             HHHHHHHHhhh
Q psy6233           7 MSNFIDLLDLK   17 (271)
Q Consensus         7 m~~fi~ll~~~   17 (271)
                      |.-++.|.|.+
T Consensus       237 ~~~v~~l~D~~  247 (321)
T PF07946_consen  237 LKLVFYLIDKL  247 (321)
T ss_pred             HHHHHHHHHHh
Confidence            33444444444


No 30 
>PF00855 PWWP:  PWWP domain;  InterPro: IPR000313 Upon characterisation of WHSC1, a gene mapping to the Wolf-Hirschhornsyndrome critical region and at its C terminus similar to the Drosophila melanogaster ASH1/trithorax group proteins, a novel protein domain designated PWWP domain was identified []. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. It is present in proteins of nuclear origin and plays a role in cell growth and differentiation. Due to its position, the composition of amino acids close to the PWWP motif and the pattern of other domains present it has been suggested that the domain is involved in protein-protein interactions [].; PDB: 3LYI_B 2L89_A 2NLU_A 1RI0_A 1KHC_A 3QKJ_C 2DAQ_A 1N27_A 3PFS_B 3QJ6_A ....
Probab=26.80  E-value=50  Score=24.48  Aligned_cols=25  Identities=24%  Similarity=0.448  Sum_probs=19.1

Q ss_pred             eccceEEEEEE--ecCccceeeceeEE
Q psy6233         241 YPGEGVYLLKF--DNSYSLWRSKTLYY  265 (271)
Q Consensus       241 ~~~~GvY~L~F--DNSYS~~RsK~L~Y  265 (271)
                      -..++.|++.|  ||+|+|...+.|..
T Consensus        31 ~~~~~~~~V~Ffg~~~~~wv~~~~i~~   57 (86)
T PF00855_consen   31 KRKDGHVLVRFFGDNDYAWVKPSNIKP   57 (86)
T ss_dssp             SSSSTEEEEEETTTTEEEEEEGGGEEE
T ss_pred             CCCCCEEEEEecCCCCEEEECHHHhhC
Confidence            45577778887  67899999988763


No 31 
>smart00293 PWWP domain with conserved PWWP motif. conservation of Pro-Trp-Trp-Pro residues
Probab=26.57  E-value=55  Score=23.65  Aligned_cols=22  Identities=27%  Similarity=0.407  Sum_probs=17.3

Q ss_pred             cceEEEEEE--ecCccceeeceeE
Q psy6233         243 GEGVYLLKF--DNSYSLWRSKTLY  264 (271)
Q Consensus       243 ~~GvY~L~F--DNSYS~~RsK~L~  264 (271)
                      .++.|.+.|  ||+|+|...+.|.
T Consensus        37 ~~~~~~V~Ffg~~~~awv~~~~l~   60 (63)
T smart00293       37 FENLYPVLFFGDKDTAWISSSKLF   60 (63)
T ss_pred             CCCEEEEEEeCCCCEEEECcccee
Confidence            467788887  7789999888774


No 32 
>KOG3054|consensus
Probab=25.76  E-value=1.5e+02  Score=28.21  Aligned_cols=16  Identities=19%  Similarity=0.158  Sum_probs=7.6

Q ss_pred             hcccccchhhhHHHHH
Q psy6233          54 VNGNEVSQETSIEEKK   69 (271)
Q Consensus        54 ~~~~~~~~e~~~~e~~   69 (271)
                      +..++.+++|+++|+.
T Consensus       139 E~erKkdEeR~~eEae  154 (299)
T KOG3054|consen  139 EAERKKDEERLAEEAE  154 (299)
T ss_pred             HHHHhhhHHHHHHHHH
Confidence            3334444555555443


No 33 
>cd05838 WHSC1_related The PWWP domain was first identified in the WHSC1 (Wolf-Hirschhorn syndrome candidate 1) protein, a protein implicated in Wolf-Hirschhorn syndrome (WHS).  When translocated, WHSC1 plays a role in lymphoid multiple myeloma (MM) disease, also known as plasmacytoma. WHCS1 proteins typically contain two copies of the PWWP domain.  The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
Probab=25.15  E-value=44  Score=26.38  Aligned_cols=23  Identities=13%  Similarity=0.294  Sum_probs=18.0

Q ss_pred             ccceEEEEEE--ecCccceeeceeE
Q psy6233         242 PGEGVYLLKF--DNSYSLWRSKTLY  264 (271)
Q Consensus       242 ~~~GvY~L~F--DNSYS~~RsK~L~  264 (271)
                      ..+|.|.+.|  ||+|.|+..+.|.
T Consensus        36 ~~~~~~~V~Ffgs~~y~Wv~~~~l~   60 (95)
T cd05838          36 HCIGEFCVMFFGTHDYYWVHRGRVF   60 (95)
T ss_pred             CCCCeEEEEEeCCCCEEEecccccc
Confidence            3457888888  7779999988774


No 34 
>KOG1144|consensus
Probab=25.09  E-value=1.3e+02  Score=33.16  Aligned_cols=7  Identities=29%  Similarity=0.430  Sum_probs=2.8

Q ss_pred             HHHHhhh
Q psy6233          74 QDALNQQ   80 (271)
Q Consensus        74 ~~aln~q   80 (271)
                      +.||+.-
T Consensus       290 ~~a~aea  296 (1064)
T KOG1144|consen  290 EAALAEA  296 (1064)
T ss_pred             HHHHHHH
Confidence            3444433


No 35 
>cd05162 PWWP The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids.  The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation.  Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.  The function of the PWWP domain is still not known precisely; however, based on the fact that other regions of PWWP-domain proteins are responsible for nuclear localization and DNA-binding, is likely that the PWWP domain acts as a site for protein-protein binding interactions, influencing chromatin remodeling and thereby regulating transcriptional processes.  Some PWWP-domain proteins have been linked to cancer or other diseases; some are known to function as growth factors.
Probab=24.32  E-value=58  Score=24.63  Aligned_cols=22  Identities=27%  Similarity=0.454  Sum_probs=16.5

Q ss_pred             cceEEEEEE--ecCccceeeceeE
Q psy6233         243 GEGVYLLKF--DNSYSLWRSKTLY  264 (271)
Q Consensus       243 ~~GvY~L~F--DNSYS~~RsK~L~  264 (271)
                      .++.|.|.|  +|+|+|...+.|.
T Consensus        36 ~~~~~~V~Ffg~~~~~wv~~~~l~   59 (87)
T cd05162          36 KEGKVLVLFFGDKTFAWVGAERLK   59 (87)
T ss_pred             CCCEEEEEEeCCCcEEEeCcccee
Confidence            356677777  6779999988874


No 36 
>PRK09510 tolA cell envelope integrity inner membrane protein TolA; Provisional
Probab=24.05  E-value=2.4e+02  Score=28.16  Aligned_cols=18  Identities=33%  Similarity=0.540  Sum_probs=10.8

Q ss_pred             cCceEEEEEEeecCCCEE
Q psy6233         137 HGETVTVRVPTHEGGSCL  154 (271)
Q Consensus       137 ~Get~tV~V~v~~~Gs~L  154 (271)
                      .|.+.+|+|.+.+.|.++
T Consensus       324 ~gktc~VrI~LapDG~V~  341 (387)
T PRK09510        324 AGKTCTLRIKLAPDGTLL  341 (387)
T ss_pred             cCceEEEEEEEcCCCcEE
Confidence            355666666666666544


No 37 
>PHA03051 Hypothetical protein; Provisional
Probab=23.90  E-value=38  Score=26.44  Aligned_cols=14  Identities=36%  Similarity=0.705  Sum_probs=10.4

Q ss_pred             hhhCCCchhHHHHH
Q psy6233          15 DLKCILFKPYIDAH   28 (271)
Q Consensus        15 ~~~cp~f~~~v~a~   28 (271)
                      |+.||||+..+..-
T Consensus        33 nsrcplfksliska   46 (88)
T PHA03051         33 NSRCPLFKSLISKA   46 (88)
T ss_pred             CccChhHHHHHHHH
Confidence            57899998876543


No 38 
>PF09805 Nop25:  Nucleolar protein 12 (25kDa);  InterPro: IPR019186 Nop12 is a novel nucleolar protein required for pre-large subunit rRNA processing and in yeast normal rates of cell growth at low temperatures []. 
Probab=23.63  E-value=1.8e+02  Score=24.61  Aligned_cols=9  Identities=22%  Similarity=0.527  Sum_probs=4.5

Q ss_pred             hhHHH-HHHh
Q psy6233          22 KPYID-AHVA   30 (271)
Q Consensus        22 ~~~v~-a~~~   30 (271)
                      +.||+ =||+
T Consensus        21 ~eyLTGFhKR   30 (137)
T PF09805_consen   21 REYLTGFHKR   30 (137)
T ss_pred             HHHHHHHHHH
Confidence            34665 3444


No 39 
>PF02337 Gag_p10:  Retroviral GAG p10 protein;  InterPro: IPR003322 Retroviral matrix proteins (or major core proteins) are components of envelope-associated capsids, which line the inner surface of virus envelopes and are associated with viral membranes []. Matrix proteins are produced as part of Gag precursor polyproteins. During viral maturation, the Gag polyprotein is cleaved into major structural proteins by the viral protease, yielding the matrix (MA), capsid (CA), nucleocapsid (NC), and some smaller peptides. Gag-derived proteins govern the entire assembly and release of the virus particles, with matrix proteins playing key roles in Gag stability, capsid assembly, transport and budding. Although matrix proteins from different retroviruses appear to perform similar functions and can have similar structural folds, their primary sequences can be very different. This entry represents matrix proteins from beta-retroviruses such as Mason-Pfizer monkey virus (MPMV) (Simian Mason-Pfizer virus) and Mouse mammary tumor virus (MMTV) [, ]. This entry also identifies matrix proteins from several eukaryotic endogenous retroviruses, which arise when one or more copies of the retroviral genome becomes integrated into the host genome [].; GO: 0005198 structural molecule activity, 0019028 viral capsid; PDB: 2F77_X 2F76_X.
Probab=23.55  E-value=70  Score=25.57  Aligned_cols=18  Identities=28%  Similarity=0.660  Sum_probs=14.3

Q ss_pred             HHHHHHHHHHhhhCCCch
Q psy6233           5 DAMSNFIDLLDLKCILFK   22 (271)
Q Consensus         5 ~am~~fi~ll~~~cp~f~   22 (271)
                      .-...|+.-++++||-|.
T Consensus        28 ~~L~~f~~~i~~~~PWF~   45 (90)
T PF02337_consen   28 KDLINFLSFIDKVCPWFP   45 (90)
T ss_dssp             HHHHHHHHHHHHHTT-SS
T ss_pred             HHHHHHHHHHHHhCCCCC
Confidence            345789999999999995


No 40 
>KOG2357|consensus
Probab=23.36  E-value=2.2e+02  Score=28.92  Aligned_cols=17  Identities=0%  Similarity=-0.293  Sum_probs=9.0

Q ss_pred             HhhhCCCchhHHHHHHh
Q psy6233          14 LDLKCILFKPYIDAHVA   30 (271)
Q Consensus        14 l~~~cp~f~~~v~a~~~   30 (271)
                      +.+.|+.++-.-++.++
T Consensus       365 k~kt~~~RQ~~~e~~~K  381 (440)
T KOG2357|consen  365 KAKTDKNRQRVEEEFLK  381 (440)
T ss_pred             HhhhhhhHHHHHHHHHH
Confidence            34555555555555554


No 41 
>PLN03086 PRLI-interacting factor K; Provisional
Probab=22.37  E-value=1.9e+02  Score=30.40  Aligned_cols=13  Identities=23%  Similarity=0.399  Sum_probs=6.7

Q ss_pred             HHHHhhHHHHHHH
Q psy6233          26 DAHVAQKEEENRK   38 (271)
Q Consensus        26 ~a~~~~~ee~~~~   38 (271)
                      .+.|.++|.++|+
T Consensus         8 ~~~~~~~~~~~~~   20 (567)
T PLN03086          8 AREKLEREQRERK   20 (567)
T ss_pred             HHHHHHHHHHHHH
Confidence            4445555555554


No 42 
>PF03531 SSrecog:  Structure-specific recognition protein (SSRP1);  InterPro: IPR000969 Human structure-specific recognition protein, SSRP1, [] binds specifically to DNA modified with the anti-cancer drug cisplatin. An 81kDa protein is predicted, containing several highly-charged domains and a stretch of 75 residues that share 47% identity with a portion of the high mobility group (HMG) protein HMG1. This HMG box probably constitutes the structure recognition element for cisplatin-modified DNA, the probable recognition motif being the local duplex unwinding and bending that occurs on formation of intra-strand cross-links []. SSRP1 is the human homologue of a recently identified mouse protein that binds to recombination signal sequences []. These sequences have been postulated to form stem-loop structures, further implicating local bends and unwinding in DNA as a recognition target for HMG-box proteins. A Drosophila melanogaster cDNA encoding an HMG-box-containing protein has also been isolated [, ]. This protein shares 50% sequence identity with human SSRP1. In vitro binding studies using Drosophila SSRP showed that the protein binds to single-stranded DNA and RNA, with highest affinity for nucleotides G and U. Comparison of the predicted amino acid sequences among SSRP family members reveals 48% identity, with structural conservation in the C terminus of the HMG box, as well as domains of highly charged residues. The most highly conserved regions lie in the poorly understood N terminus, suggesting that this portion of the protein is critical for its function []. This entry contains Pob3 Q04636 from SWISSPROT, which is a subunit of the heterodimeric yeast FACT complex (Spt16p (IPR013953 from INTERPRO)-Pob3p) []. The FACT complex facilitates RNA Polymerase II transcription elongation through nucleosomes by destabilising and then reassembling nucleosome structure [, ].; GO: 0003677 DNA binding, 0005634 nucleus; PDB: 3F5R_A 2GCL_B 2GCJ_D.
Probab=21.57  E-value=1.3e+02  Score=27.64  Aligned_cols=36  Identities=19%  Similarity=0.454  Sum_probs=29.9

Q ss_pred             ecceEEeccceEEEEEEecCccceeeceeEEEEEEe
Q psy6233         235 YAGSHCYPGEGVYLLKFDNSYSLWRSKTLYYRVYYT  270 (271)
Q Consensus       235 ~~Gs~~~~~~GvY~L~FDNSYS~~RsK~L~Y~V~~t  270 (271)
                      ..+-......|.|-+.+=-+|=.+|.|+.-|+|.|+
T Consensus       139 f~~i~~ltPRGRydi~~y~~~lrl~GktyDykI~y~  174 (222)
T PF03531_consen  139 FEDILCLTPRGRYDIEMYPTFLRLHGKTYDYKIQYS  174 (222)
T ss_dssp             EEEEEEEETTEEEEEEE-SSEEEEEESSBEEEEEGG
T ss_pred             cccccccccccccccccccccccccccccccccccc
Confidence            344455677899999999999999999999999986


No 43 
>KOG2701|consensus
Probab=21.19  E-value=2.4e+02  Score=29.86  Aligned_cols=27  Identities=19%  Similarity=0.171  Sum_probs=22.9

Q ss_pred             HHHHHHHHHHHhhhCCCchhHHHHHHh
Q psy6233           4 KDAMSNFIDLLDLKCILFKPYIDAHVA   30 (271)
Q Consensus         4 ~~am~~fi~ll~~~cp~f~~~v~a~~~   30 (271)
                      .+=|..||.+||..|..|..|+-.-+-
T Consensus       501 K~Y~~keiSylndi~t~lsd~~~ka~~  527 (608)
T KOG2701|consen  501 KDYREKEISYLNDICTQLSDWMWKAKL  527 (608)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHhccchh
Confidence            467999999999999999999875443


No 44 
>KOG2880|consensus
Probab=21.16  E-value=2.1e+02  Score=28.72  Aligned_cols=34  Identities=24%  Similarity=0.314  Sum_probs=23.7

Q ss_pred             HHHHHHhhhCCCchhHHHHHHhhHHHHHHHHHHHH
Q psy6233           9 NFIDLLDLKCILFKPYIDAHVAQKEEENRKAREQI   43 (271)
Q Consensus         9 ~fi~ll~~~cp~f~~~v~a~~~~~ee~~~~~~~~~   43 (271)
                      .||.|.-..||--.+|= |.|.+++.=-++.++.+
T Consensus        60 ry~tLfiEkipkHrDy~-s~k~ek~d~~~klk~~~   93 (424)
T KOG2880|consen   60 RYITLFIEKIPKHRDYR-SVKPEKEDIRKKLKEEA   93 (424)
T ss_pred             HHHHHHHHhcccCcchh-hhchhHHHHHHHHHHHh
Confidence            46666667777778887 77777777666665554


Done!