Query         psy4031
Match_columns 115
No_of_seqs    113 out of 202
Neff          5.1 
Searched_HMMs 46136
Date          Fri Aug 16 23:38:03 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy4031.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/4031hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF04597 Ribophorin_I:  Ribopho 100.0 3.7E-46 8.1E-51  312.5   7.8  111    4-114   289-418 (432)
  2 KOG2291|consensus              100.0 1.3E-44 2.7E-49  309.6   6.0  111    4-114   315-443 (602)
  3 PRK13986 urease subunit alpha;  65.4      26 0.00057   28.1   6.2   57   33-89     76-137 (225)
  4 PF05986 ADAM_spacer1:  ADAM-TS  62.1     6.2 0.00013   27.6   1.9   14   41-54     19-32  (114)
  5 PRK13192 bifunctional urease s  59.7      39 0.00085   26.8   6.2   57   33-89     75-141 (208)
  6 cd06495 p23_NUDCD3_like p23-li  57.5      13 0.00028   25.9   2.9   23   32-54     10-35  (102)
  7 cd06493 p23_NUDCD1_like p23_NU  48.5      20 0.00043   23.3   2.6   22   33-54      5-28  (85)
  8 PF14903 WG_beta_rep:  WG conta  46.4      31 0.00068   18.1   2.8   24   69-92      2-25  (35)
  9 PF09972 DUF2207:  Predicted me  40.9      77  0.0017   26.0   5.5   62   32-102   128-190 (511)
 10 PHA02090 hypothetical protein   40.6      14  0.0003   24.7   0.8   13   12-24     48-61  (79)
 11 PF08968 DUF1885:  Domain of un  39.5      20 0.00043   26.5   1.6   15   36-50     82-96  (130)
 12 cd06467 p23_NUDC_like p23_like  37.4      46 0.00099   21.0   2.9   21   34-54      6-28  (85)
 13 cd06492 p23_mNUDC_like p23-lik  34.7      54  0.0012   21.7   3.1   23   32-54      4-30  (87)
 14 KOG2566|consensus               33.7      80  0.0017   27.9   4.6   51   33-88    437-488 (518)
 15 PF10976 DUF2790:  Protein of u  30.9      68  0.0015   21.5   3.0   31   48-79     36-66  (78)
 16 TIGR03000 plancto_dom_1 Planct  30.7      37 0.00081   22.8   1.7   14   37-50      2-15  (75)
 17 PF13860 FlgD_ig:  FlgD Ig-like  29.6      80  0.0017   20.3   3.2   17   37-53     13-29  (81)
 18 PF12503 CMV_1a_C:  Cucumber mo  27.5      32  0.0007   23.7   1.0   14   12-25     38-55  (85)
 19 PF07617 DUF1579:  Protein of u  26.3   2E+02  0.0044   21.4   5.2   35   15-50     64-98  (159)
 20 KOG3179|consensus               25.0      28  0.0006   28.1   0.3   35   42-78    159-193 (245)
 21 PF04525 Tub_2:  Tubby C 2;  In  24.9      71  0.0015   23.6   2.5   42   43-87     33-74  (187)
 22 cd01777 SNX27_RA Ubiquitin dom  24.9      55  0.0012   22.6   1.8   14   37-50      1-14  (87)
 23 COG4824 Phage-related holin (L  24.5      18  0.0004   26.8  -0.7   10   10-19     19-28  (133)
 24 PF13716 CRAL_TRIO_2:  Divergen  23.2      54  0.0012   22.7   1.6   16   72-87     10-25  (149)
 25 KOG3264|consensus               23.1      55  0.0012   26.1   1.7   17   38-54    171-187 (221)
 26 PF08487 VIT:  Vault protein in  23.1 1.2E+02  0.0026   20.9   3.3   37   63-102     7-43  (118)
 27 COG4856 Uncharacterized protei  22.2      65  0.0014   27.9   2.0   17   36-52    288-304 (403)
 28 PF10515 APP_amyloid:  beta-amy  22.0      41 0.00088   21.2   0.6   11   20-30     41-51  (52)
 29 PF08089 Toxin_20:  Huwentoxin-  22.0      25 0.00054   20.8  -0.3   10   11-20     26-35  (39)
 30 PF13349 DUF4097:  Domain of un  21.7 1.4E+02  0.0029   20.9   3.4   21   36-56    118-140 (166)
 31 cd06494 p23_NUDCD2_like p23-li  21.5 1.1E+02  0.0025   20.6   2.8   22   32-53     11-32  (93)
 32 PF08813 Phage_tail_3:  Phage t  21.5 1.1E+02  0.0023   22.9   2.9   35   16-50     90-128 (165)
 33 PF14879 DUF4489:  Domain of un  20.6 2.2E+02  0.0048   21.2   4.4   53   41-103    12-67  (141)
 34 PRK11615 hypothetical protein;  20.4 1.8E+02   0.004   22.7   4.0   42   39-85     49-91  (185)

No 1  
>PF04597 Ribophorin_I:  Ribophorin I;  InterPro: IPR007676 Ribophorin I is an essential subunit of oligosaccharyltransferase (OST), which is also known as dolichyl-diphosphooligosaccharide--protein glycosyltransferase, (2.4.1.119 from EC). OST catalyses the transfer of an oligosaccharide from dolichol pyrophosphate to selected asparagine residues of nascent polypeptides as they are translocated into the lumen of the rough endoplasmic reticulum. Ribophorin I and OST48 are thought to be responsible for OST catalytic activity []. Both yeast and mammalian proteins are glycosylated but the sites are not conserved. Glycosylation may contribute towards general solubility but is unlikely to be involved in a specific biochemical function []. Most family members are predicted to have a transmembrane helix at the C terminus of this region.; GO: 0004579 dolichyl-diphosphooligosaccharide-protein glycotransferase activity, 0006486 protein glycosylation, 0005783 endoplasmic reticulum, 0016021 integral to membrane
Probab=100.00  E-value=3.7e-46  Score=312.50  Aligned_cols=111  Identities=32%  Similarity=0.517  Sum_probs=108.7

Q ss_pred             ccccceeccCcccceEeeecCChhhhcc------------------ccceeeEEEEEEcCCCeeeeeEeCCcceeecccc
Q psy4031           4 ARVTWARSRYWHALYIPGHRRSPYDYAA------------------RAQPRNAKVNIILPEHSLVRQIHTPYPVSSQSDG   65 (115)
Q Consensus         4 ~~~~~plfGGWky~ftiGyn~pl~~~l~------------------d~~~d~~~l~ViLPEGA~~i~v~~P~~v~~~~~~   65 (115)
                      .||||||||||||+|++|||+|+++||+                  |++||+++++|+|||||+||+|.+|+++++.+++
T Consensus       289 l~PRfPLfGGWk~~FtiGyn~p~~~~l~~~~~~~y~L~vp~~~~~~d~~~d~~~l~i~LPEGA~~i~v~~P~~~~~~~~~  368 (432)
T PF04597_consen  289 LKPRFPLFGGWKYNFTIGYNLPLSNFLRKSGDGRYVLKVPFLPGIKDIVYDNVELRIILPEGAKNIKVSSPFPVDSVSVS  368 (432)
T ss_pred             EEcCCcccCCcceeEEEEccCChHHhEEECCCCcEEEEEECcCCcCceEEEEEEEEEECCCCceeeeEeCCccceeeecc
Confidence            6899999999999999999999999999                  9999999999999999999999999999999999


Q ss_pred             ceEEeccCCCCcEEEEEeccCcccCc-ceEEEEEEeCchhhcccceeeee
Q psy4031          66 THSTYLDTAGRPMITLRAHNLVENHI-EDIEIEYEFSKSYLGKEPLLRAF  114 (115)
Q Consensus        66 ~~~TYLDt~GRp~v~l~~~Nlvd~h~-~~i~V~Y~~~~~~~l~kPl~i~~  114 (115)
                      .++||||+.|||+|+++++||+|+|+ ++|+|+|+|+..+||+|||+|++
T Consensus       369 ~~~tyLDt~GR~vv~l~~~nlvd~~~~~~~~v~Y~~~~~~~~~kPl~i~~  418 (432)
T PF04597_consen  369 THKTYLDTTGRPVVVLEKKNLVDEHNDQDFQVTYTYPSSAMLRKPLLIAG  418 (432)
T ss_pred             ceeeeeeccCceEEEEEeccCCHhHCCeeEEEEEEcCHHHHHHHHHHHHH
Confidence            99999999999999999999999988 99999999999999999999875


No 2  
>KOG2291|consensus
Probab=100.00  E-value=1.3e-44  Score=309.64  Aligned_cols=111  Identities=31%  Similarity=0.477  Sum_probs=108.6

Q ss_pred             ccccceeccCcccceEeeecCChhhhcc------------------ccceeeEEEEEEcCCCeeeeeEeCCcceeecccc
Q psy4031           4 ARVTWARSRYWHALYIPGHRRSPYDYAA------------------RAQPRNAKVNIILPEHSLVRQIHTPYPVSSQSDG   65 (115)
Q Consensus         4 ~~~~~plfGGWky~ftiGyn~pl~~~l~------------------d~~~d~~~l~ViLPEGA~~i~v~~P~~v~~~~~~   65 (115)
                      .||||||||||||+|+||||+|+++||+                  |++||+++++|+|||||+||++.+||++.+.+.+
T Consensus       315 irPRfPlFGGWkt~ftiGy~lP~~eyl~~~g~ry~L~~~~~~~~~d~~V~dkl~ikvvLPEGak~i~i~tP~~is~~p~e  394 (602)
T KOG2291|consen  315 IRPRFPLFGGWKTNFTIGYNLPLEEYLFSKGRRYALKIILIDHIFDDTVYDKLTIKVVLPEGAKDIEIDTPYEISRSPIE  394 (602)
T ss_pred             eccCCccccCceeeEEEecCCcHHHHhhccCceeEEccccccCCCccceeeeEEEEEEccCCCcccccccceeeccCchh
Confidence            5999999999999999999999999999                  6799999999999999999999999999999999


Q ss_pred             ceEEeccCCCCcEEEEEeccCcccCcceEEEEEEeCchhhcccceeeee
Q psy4031          66 THSTYLDTAGRPMITLRAHNLVENHIEDIEIEYEFSKSYLGKEPLLRAF  114 (115)
Q Consensus        66 ~~~TYLDt~GRp~v~l~~~Nlvd~h~~~i~V~Y~~~~~~~l~kPl~i~~  114 (115)
                      +++|||||.|||+++++|+|+++.|+++|+|+|+|++..||+|||+|++
T Consensus       395 ~~~syLDt~GR~Vvv~ek~Nvv~~h~~~i~v~Y~f~~~sml~ePL~i~a  443 (602)
T KOG2291|consen  395 LKYSYLDTNGRPVVVLEKNNVVPDHNQDIVVHYTFSKSSMLQEPLLIIA  443 (602)
T ss_pred             hhhhhhhccCcEEEEEEccccCCCCCccEEEEEEechhHhhhccHHHHH
Confidence            9999999999999999999999999999999999999999999999975


No 3  
>PRK13986 urease subunit alpha; Provisional
Probab=65.40  E-value=26  Score=28.09  Aligned_cols=57  Identities=14%  Similarity=0.304  Sum_probs=41.3

Q ss_pred             cceeeEEEEEEcCCCeeeeeEeCCcceeeccccceEEecc-----CCCCcEEEEEeccCccc
Q psy4031          33 AQPRNAKVNIILPEHSLVRQIHTPYPVSSQSDGTHSTYLD-----TAGRPMITLRAHNLVEN   89 (115)
Q Consensus        33 ~~~d~~~l~ViLPEGA~~i~v~~P~~v~~~~~~~~~TYLD-----t~GRp~v~l~~~Nlvd~   89 (115)
                      -.++++.+.-.+|.|.+-|.|+-|+.-+..-.--.+-+-|     ..||+.++++-.|.-|.
T Consensus        76 ~ml~~vqVEatFpDGTkLVtvh~PI~~~~~~~PGe~~~~~~~I~lN~gr~~~~l~V~NtGDR  137 (225)
T PRK13986         76 SMIHEVGIEAMFPDGTKLVTVHTPIEANGKLVPGELFLKDEDITINAGKKAVSVKVKNVGDR  137 (225)
T ss_pred             HhcceeeEEEEcCCCCEEEEeCCCcCCCCCCCCceEecCCCCeecCCCCcEEEEEEEeCCCC
Confidence            3578899999999999999999999522111111222222     46999999999998775


No 4  
>PF05986 ADAM_spacer1:  ADAM-TS Spacer 1;  InterPro: IPR010294 This domain represents the Spacer-1 domain from the ADAM-TS family of metalloproteinases []. A cellular disintegrin and metalloproteinase (ADAM) is a family of genes with structural homology to the snake venom metalloproteinases and disintegrins []. There is variation amongst members of the family, however, all have a similar domain organisation comprising a preproregion, a reprolysin-type catalytic domain, a disintegrin-like domain, a thrombospondin type-1 (TS) module, a cysteine-rich domain, a spacer domain without cysteine residues, and a COOH-terminal TS module [, ]. They are involved in embryogenesis and have been implicated in some cancers and inflammatory diseases [].; GO: 0004222 metalloendopeptidase activity, 0031012 extracellular matrix
Probab=62.10  E-value=6.2  Score=27.56  Aligned_cols=14  Identities=14%  Similarity=0.211  Sum_probs=12.1

Q ss_pred             EEEcCCCeeeeeEe
Q psy4031          41 NIILPEHSLVRQIH   54 (115)
Q Consensus        41 ~ViLPEGA~~i~v~   54 (115)
                      -+.+|.||++|+|.
T Consensus        19 v~~IP~GA~nI~I~   32 (114)
T PF05986_consen   19 VVTIPAGARNIRIT   32 (114)
T ss_pred             EEECCCCceEEEEE
Confidence            47899999999884


No 5  
>PRK13192 bifunctional urease subunit gamma/beta; Reviewed
Probab=59.69  E-value=39  Score=26.81  Aligned_cols=57  Identities=21%  Similarity=0.282  Sum_probs=42.5

Q ss_pred             cceeeEEEEEEcCCCeeeeeEeCCcceeecccc-----ceEEecc-----CCCCcEEEEEeccCccc
Q psy4031          33 AQPRNAKVNIILPEHSLVRQIHTPYPVSSQSDG-----THSTYLD-----TAGRPMITLRAHNLVEN   89 (115)
Q Consensus        33 ~~~d~~~l~ViLPEGA~~i~v~~P~~v~~~~~~-----~~~TYLD-----t~GRp~v~l~~~Nlvd~   89 (115)
                      -.++++.+.-.+|.|-+-|.|+-|+.-+..+.+     -..-.-|     ..||+.++++-.|.-|.
T Consensus        75 ~ml~~vqVEatFpDGTkLVtvh~PI~~~~~~~~~al~PGei~~~~~~I~lN~gr~~~~l~V~NtGDR  141 (208)
T PRK13192         75 DMVPEVQVEATFPDGTKLVTVHDPIRPAEGDLADALYPGEILPGDGEIELNAGRPAVTLDVTNTGDR  141 (208)
T ss_pred             hhcceeeEEEEcCCCCEEEEecCCcCCCCCchhhccCCCEEEcCCCCeeeCCCCCEEEEEEEeCCCC
Confidence            357889999999999999999999975433321     1222222     47999999999998775


No 6  
>cd06495 p23_NUDCD3_like p23-like NUD (nuclear distribution) C-like domain found in human NUDC domain-containing protein 3 (NUDCD3) and similar proteins.   Little is known about the function of the proteins in this subgroup.
Probab=57.53  E-value=13  Score=25.85  Aligned_cols=23  Identities=9%  Similarity=0.209  Sum_probs=19.2

Q ss_pred             ccceeeEEEEEEcCCC---eeeeeEe
Q psy4031          32 RAQPRNAKVNIILPEH---SLVRQIH   54 (115)
Q Consensus        32 d~~~d~~~l~ViLPEG---A~~i~v~   54 (115)
                      .+..++++++|=||+|   |+++++.
T Consensus        10 tQTl~eV~V~i~lp~~~~~~kdv~v~   35 (102)
T cd06495          10 SQDYTDVEVRVPVPKDVVKGRQVSVD   35 (102)
T ss_pred             EeECCeEEEEEECCCCCccceEEEEE
Confidence            7788999999999998   5676653


No 7  
>cd06493 p23_NUDCD1_like p23_NUDCD1: p23-like NUD (nuclear distribution) C-like domain found in human NUD (nuclear distribution) C domain-containing protein 1, NUDCD1 (also known as CML66), and similar proteins. NUDCD1/CML66 is a broadly immunogenic tumor associated antigen, which is highly expressed in a variety of solid tumors and in leukemias. In normal tissues high expression of NUDCD1/CML66 is limited to testis and heart.
Probab=48.47  E-value=20  Score=23.33  Aligned_cols=22  Identities=23%  Similarity=0.301  Sum_probs=17.4

Q ss_pred             cceeeEEEEEEcCCCe--eeeeEe
Q psy4031          33 AQPRNAKVNIILPEHS--LVRQIH   54 (115)
Q Consensus        33 ~~~d~~~l~ViLPEGA--~~i~v~   54 (115)
                      +.-+++++.|.+|+|+  +++++.
T Consensus         5 Qt~~~V~v~i~~p~~~~~~dv~v~   28 (85)
T cd06493           5 QTEEDLTLTIRLPEDTTKEDIRIK   28 (85)
T ss_pred             EeCCEEEEEEECCCCCChhhEEEE
Confidence            4568999999999996  666653


No 8  
>PF14903 WG_beta_rep:  WG containing repeat
Probab=46.40  E-value=31  Score=18.13  Aligned_cols=24  Identities=17%  Similarity=0.282  Sum_probs=17.8

Q ss_pred             EeccCCCCcEEEEEeccCcccCcc
Q psy4031          69 TYLDTAGRPMITLRAHNLVENHIE   92 (115)
Q Consensus        69 TYLDt~GRp~v~l~~~Nlvd~h~~   92 (115)
                      .++|+.|+.++..+.+.+-.-.+.
T Consensus         2 G~id~~G~~vi~~~yd~i~~~~~g   25 (35)
T PF14903_consen    2 GYIDKNGKIVIPPKYDEIYPFSNG   25 (35)
T ss_pred             EEEeCCCCEEEEccccCccccCCC
Confidence            578899999988888776554443


No 9  
>PF09972 DUF2207:  Predicted membrane protein (DUF2207);  InterPro: IPR018702 This domain has no known function.
Probab=40.95  E-value=77  Score=26.05  Aligned_cols=62  Identities=23%  Similarity=0.270  Sum_probs=36.2

Q ss_pred             ccceeeEEEEEEcCCCeeeeeEeCC-cceeeccccceEEeccCCCCcEEEEEeccCcccCcceEEEEEEeCc
Q psy4031          32 RAQPRNAKVNIILPEHSLVRQIHTP-YPVSSQSDGTHSTYLDTAGRPMITLRAHNLVENHIEDIEIEYEFSK  102 (115)
Q Consensus        32 d~~~d~~~l~ViLPEGA~~i~v~~P-~~v~~~~~~~~~TYLDt~GRp~v~l~~~Nlvd~h~~~i~V~Y~~~~  102 (115)
                      ++-+++++++|.||++...+++..= -+..     ...+ ++ ...-.++++.+|+-  .++.+.|...||+
T Consensus       128 ~~~i~~v~v~i~~P~~~~~~~~~~~~g~~~-----~~~~-~~-~~~~~v~~~~~~l~--~~~~~~v~~~fP~  190 (511)
T PF09972_consen  128 DVPIENVTVTITLPKPVDNSKAWGHPGPYG-----GTVE-ID-DDDGTVTFTTDNLP--PNEGVEVRVSFPK  190 (511)
T ss_pred             CCccceEEEEEECCCCCcceEEEEeccCCC-----ccce-ee-ecCCEEEEEEeccC--CCCeEEEEEEccc
Confidence            7889999999999987766655321 0111     1111 22 23345667777743  4455666666676


No 10 
>PHA02090 hypothetical protein
Probab=40.57  E-value=14  Score=24.68  Aligned_cols=13  Identities=15%  Similarity=0.043  Sum_probs=10.3

Q ss_pred             cCcccceE-eeecC
Q psy4031          12 RYWHALYI-PGHRR   24 (115)
Q Consensus        12 GGWky~ft-iGyn~   24 (115)
                      ||||++-+ +||+.
T Consensus        48 g~~ktna~flgy~i   61 (79)
T PHA02090         48 GGWKTNAEFLGYAI   61 (79)
T ss_pred             CCccccceeeeeee
Confidence            89999976 57764


No 11 
>PF08968 DUF1885:  Domain of unknown function (DUF1885);  InterPro: IPR015062 This family consists of hypothetical proteins produced by bacteria of the Bacillus genus. ; PDB: 1T6A_A.
Probab=39.53  E-value=20  Score=26.48  Aligned_cols=15  Identities=13%  Similarity=0.463  Sum_probs=8.8

Q ss_pred             eeEEEEEEcCCCeee
Q psy4031          36 RNAKVNIILPEHSLV   50 (115)
Q Consensus        36 d~~~l~ViLPEGA~~   50 (115)
                      +...+.|+||+||+.
T Consensus        82 e~~~IQv~LP~~ATh   96 (130)
T PF08968_consen   82 EQSYIQVVLPDGATH   96 (130)
T ss_dssp             TEEEEEEE--TT--H
T ss_pred             cceEEEEECCCCCcc
Confidence            467899999999963


No 12 
>cd06467 p23_NUDC_like p23_like domain of NUD (nuclear distribution) C and similar proteins. Aspergillus nidulas (An) NUDC is needed for nuclear movement. AnNUDC is localized at the hyphal cortex, and binds NUDF at spindle pole bodies (SPBs) and in the cytoplasm at different stages in the cell cycle. At the SPBs it is part of the dynein molecular motor/NUDF complex that regulates microtubule dynamics.  Mammalian(m) NUDC associates both with the dynein complex and also with an anti-inflammatory enzyme, platelet activating factor acetylhydrolase I, PAF-AH(I) complex, through binding mNUDF, the regulatory beta subunit of PAF-AH(I).  mNUDC is important for cell proliferation both in normal and tumor tissues.  Its expression is elevated in various cell types undergoing mitosis or stimulated to proliferate, with high expression levels observed in leukemic cells and tumors.  For a leukemic cell line, human NUDC was shown to activate the thrombopoietin (TPO) receptor (Mpl) by binding to its ext
Probab=37.39  E-value=46  Score=20.99  Aligned_cols=21  Identities=24%  Similarity=0.273  Sum_probs=16.1

Q ss_pred             ceeeEEEEEEcCCCe--eeeeEe
Q psy4031          34 QPRNAKVNIILPEHS--LVRQIH   54 (115)
Q Consensus        34 ~~d~~~l~ViLPEGA--~~i~v~   54 (115)
                      .-+++.+.|-+|+|.  +++++.
T Consensus         6 t~~~V~i~i~~~~~~~~~dv~v~   28 (85)
T cd06467           6 TLDEVTVTIPLPEGTKSKDVKVE   28 (85)
T ss_pred             eCCEEEEEEECCCCCcceeEEEE
Confidence            457899999999984  666653


No 13 
>cd06492 p23_mNUDC_like p23-like NUD (nuclear distribution) C-like domain of mammalian(m) NUDC and similar proteins. Mammalian(m) NUDC associates both with the dynein complex and also with an anti-inflammatory enzyme, platelet activating factor acetylhydrolase I, PAF-AH(I) complex, through binding mNUDF, the regulatory beta subunit of PAF-AH(I).  mNUDC is important for cell proliferation both in normal and tumor tissues.  Its expression is elevated in various cell types undergoing mitosis or stimulated to proliferate, with high expression levels observed in leukemic cells and tumors. For a leukemic cell line, human NUDC was shown to activate the thrombopoietin (TPO) receptor (Mpl) by binding to its extracellular domain, and promoting cell proliferation and differentiation.
Probab=34.71  E-value=54  Score=21.72  Aligned_cols=23  Identities=0%  Similarity=-0.039  Sum_probs=17.5

Q ss_pred             ccceeeEEEEEEcCCC----eeeeeEe
Q psy4031          32 RAQPRNAKVNIILPEH----SLVRQIH   54 (115)
Q Consensus        32 d~~~d~~~l~ViLPEG----A~~i~v~   54 (115)
                      .+..++++++|-||+|    |+++++.
T Consensus         4 ~QT~~ev~v~v~l~~~~~~~~kdv~v~   30 (87)
T cd06492           4 TQTLSEVELKVPFKVSFRLKGKDVVVD   30 (87)
T ss_pred             EeecCEEEEEEECCCCCCccceEEEEE
Confidence            3567899999999887    4677664


No 14 
>KOG2566|consensus
Probab=33.72  E-value=80  Score=27.93  Aligned_cols=51  Identities=14%  Similarity=0.156  Sum_probs=34.0

Q ss_pred             cceeeEEEEEEcCCCeeeeeEeCCcceeeccccce-EEeccCCCCcEEEEEeccCcc
Q psy4031          33 AQPRNAKVNIILPEHSLVRQIHTPYPVSSQSDGTH-STYLDTAGRPMITLRAHNLVE   88 (115)
Q Consensus        33 ~~~d~~~l~ViLPEGA~~i~v~~P~~v~~~~~~~~-~TYLDt~GRp~v~l~~~Nlvd   88 (115)
                      |.|-=...+=.||+||..|....-.     +.+.+ -++|--.|+.+|++--+|-.+
T Consensus       437 mfya~~hFSkFl~pGs~Rv~~~i~~-----~~~ve~~aflnpdGskvvVllnk~s~~  488 (518)
T KOG2566|consen  437 MFYALGHFSKFLPPGSVRVGHSINQ-----NLDVEATAFLNPDGSKVVVLLNKNSLD  488 (518)
T ss_pred             HHHHHHHHhhcCCCCceEeeeeecc-----ccccceeEEEcCCCcEEEEEeccCCCC
Confidence            4444455677899999988764322     33333 478889999988777666544


No 15 
>PF10976 DUF2790:  Protein of unknown function (DUF2790);  InterPro: IPR021245  This family of proteins with unknown function appear to be restricted to Pseudomonadaceae. 
Probab=30.89  E-value=68  Score=21.48  Aligned_cols=31  Identities=19%  Similarity=0.167  Sum_probs=19.9

Q ss_pred             eeeeeEeCCcceeeccccceEEeccCCCCcEE
Q psy4031          48 SLVRQIHTPYPVSSQSDGTHSTYLDTAGRPMI   79 (115)
Q Consensus        48 A~~i~v~~P~~v~~~~~~~~~TYLDt~GRp~v   79 (115)
                      |+.|.+..+-+ .-.-+..+-+|+|..|+.-.
T Consensus        36 akVis~t~~~~-~C~Vvpa~MtY~DS~G~~h~   66 (78)
T PF10976_consen   36 AKVISVTPPPN-VCGVVPARMTYEDSQGELHT   66 (78)
T ss_pred             heeEEecCCCC-CCcEEccEEEEECCCCCEEE
Confidence            35566654432 23445567799999999863


No 16 
>TIGR03000 plancto_dom_1 Planctomycetes uncharacterized domain TIGR03000. Domains described by this model are found, so far, only in the Planctomycetes (Pirellula sp. strain 1 and Gemmata obscuriglobus), in up to six proteins per genome, and may be duplicated within a protein. The function is unknown.
Probab=30.75  E-value=37  Score=22.79  Aligned_cols=14  Identities=21%  Similarity=0.503  Sum_probs=11.5

Q ss_pred             eEEEEEEcCCCeee
Q psy4031          37 NAKVNIILPEHSLV   50 (115)
Q Consensus        37 ~~~l~ViLPEGA~~   50 (115)
                      .+++.|.||++|+.
T Consensus         2 ~a~itv~vPadAkl   15 (75)
T TIGR03000         2 AATITVTLPADAKL   15 (75)
T ss_pred             ceEEEEEeCCCCEE
Confidence            46899999999863


No 17 
>PF13860 FlgD_ig:  FlgD Ig-like domain; PDB: 3C12_A 3OSV_A.
Probab=29.60  E-value=80  Score=20.33  Aligned_cols=17  Identities=18%  Similarity=0.309  Sum_probs=12.4

Q ss_pred             eEEEEEEcCCCeeeeeE
Q psy4031          37 NAKVNIILPEHSLVRQI   53 (115)
Q Consensus        37 ~~~l~ViLPEGA~~i~v   53 (115)
                      .+.+++.||++|.++++
T Consensus        13 ~~~~~~~l~~~a~~v~v   29 (81)
T PF13860_consen   13 KGSIEYTLPEDADNVTV   29 (81)
T ss_dssp             EEEEEEEECSSCEEEEE
T ss_pred             EEEEEEeCCCcccEEEE
Confidence            45677778888887755


No 18 
>PF12503 CMV_1a_C:  Cucumber mosaic virus 1a protein C terminal ;  InterPro: IPR022184  This domain family is found in viruses, and is approximately 90 amino acids in length. The family is found in association with PF01443 from PFAM, PF01660 from PFAM. There is a conserved GLG sequence motif. 1a protein is the major virulence factor of the (Cucumber mosaic virus. The Ns strain of CMV causes necrotic lesions to Nicotiana spp. while other strains cause systemic mosaic. The determinant of the pathogenesis of these different strains is the specific amino acid residue at the 461 residue of the 1a protein. ; GO: 0008168 methyltransferase activity, 0016817 hydrolase activity, acting on acid anhydrides
Probab=27.51  E-value=32  Score=23.67  Aligned_cols=14  Identities=29%  Similarity=0.496  Sum_probs=11.2

Q ss_pred             cCc----ccceEeeecCC
Q psy4031          12 RYW----HALYIPGHRRS   25 (115)
Q Consensus        12 GGW----ky~ftiGyn~p   25 (115)
                      |+|    ++.|++|||..
T Consensus        38 GrWm~P~~~~Y~VGyNe~   55 (85)
T PF12503_consen   38 GRWMYPEGYEYMVGYNES   55 (85)
T ss_pred             CceecCCCeEEEeeecCC
Confidence            677    67899999963


No 19 
>PF07617 DUF1579:  Protein of unknown function (DUF1579);  InterPro: IPR011473 This is a family of paralogous hypothetical proteins identified in Rhodopirellula baltica that also has members in Gloeobacter violaceus, Rhizobium meliloti and Agrobacterium tumefaciens, amongst others.
Probab=26.29  E-value=2e+02  Score=21.38  Aligned_cols=35  Identities=14%  Similarity=0.059  Sum_probs=30.6

Q ss_pred             ccceEeeecCChhhhccccceeeEEEEEEcCCCeee
Q psy4031          15 HALYIPGHRRSPYDYAARAQPRNAKVNIILPEHSLV   50 (115)
Q Consensus        15 ky~ftiGyn~pl~~~l~d~~~d~~~l~ViLPEGA~~   50 (115)
                      ...-++||+.....|+ ...+|+.-..+-..||.-+
T Consensus        64 ~~~~~lGYD~~~~~yv-gtWidSM~t~~~~~~G~~d   98 (159)
T PF07617_consen   64 EGIGTLGYDPAKKKYV-GTWIDSMGTGIWVYEGTLD   98 (159)
T ss_pred             EEEEEEEECCccCeEE-EEEeccCCCceeeEccccC
Confidence            3457899999999999 8999999999999999844


No 20 
>KOG3179|consensus
Probab=24.96  E-value=28  Score=28.11  Aligned_cols=35  Identities=20%  Similarity=0.166  Sum_probs=23.4

Q ss_pred             EEcCCCeeeeeEeCCcceeeccccceEEeccCCCCcE
Q psy4031          42 IILPEHSLVRQIHTPYPVSSQSDGTHSTYLDTAGRPM   78 (115)
Q Consensus        42 ViLPEGA~~i~v~~P~~v~~~~~~~~~TYLDt~GRp~   78 (115)
                      +-|||||+.+-.+.-..++-  .+.....|...|||-
T Consensus       159 le~PE~a~llasSe~ceve~--fs~~~~~l~fQGHPE  193 (245)
T KOG3179|consen  159 LELPEGAELLASSEKCEVEM--FSIEDHLLCFQGHPE  193 (245)
T ss_pred             ecCCchhhhhccccccceEE--EEecceEEEecCCch
Confidence            35899999997766655532  223345677789985


No 21 
>PF04525 Tub_2:  Tubby C 2;  InterPro: IPR007612 This is a family of plant and bacterial uncharacterised proteins.; PDB: 1ZXU_A 2Q4M_A.
Probab=24.90  E-value=71  Score=23.58  Aligned_cols=42  Identities=21%  Similarity=0.287  Sum_probs=22.5

Q ss_pred             EcCCCeeeeeEeCCcceeeccccceEEeccCCCCcEEEEEeccCc
Q psy4031          43 ILPEHSLVRQIHTPYPVSSQSDGTHSTYLDTAGRPMITLRAHNLV   87 (115)
Q Consensus        43 iLPEGA~~i~v~~P~~v~~~~~~~~~TYLDt~GRp~v~l~~~Nlv   87 (115)
                      .=..|...-+|..   -.-.+.....+.+|..|+|.++|+++.+.
T Consensus        33 ~D~~G~~vf~V~g---~~~~s~~~~~~l~D~~G~~L~~i~~k~~~   74 (187)
T PF04525_consen   33 YDENGNVVFRVDG---GKFFSIGKKRTLMDASGNPLFTIRRKLFS   74 (187)
T ss_dssp             EETTS-EEEEEE-----SCTTBTTEEEEE-TTS-EEEEEE-----
T ss_pred             EcCCCCEEEEEEE---ecccCCCCEEEEECCCCCEEEEEEeeecc
Confidence            3445555555544   01146678889999999999999997443


No 22 
>cd01777 SNX27_RA Ubiquitin domain of SNX27 (sorting nexin protein 27). SNX27_RA   SNX27 (sorting nexin protein 27) belongs to a large family of endosome-localized proteins related to sorting nexin1 which is implicated in regulating membrane traffic.  The domain architecture of SNX27 includes an amino-terminal PDZ domain, a PX (PhoX homologous) domain, and a carboxy-terminal RA (RAS-associated) domain.
Probab=24.89  E-value=55  Score=22.57  Aligned_cols=14  Identities=29%  Similarity=0.641  Sum_probs=11.7

Q ss_pred             eEEEEEEcCCCeee
Q psy4031          37 NAKVNIILPEHSLV   50 (115)
Q Consensus        37 ~~~l~ViLPEGA~~   50 (115)
                      +++++|.||-|++.
T Consensus         1 ~V~L~V~Lpdg~~i   14 (87)
T cd01777           1 DVELRIALPDKATV   14 (87)
T ss_pred             CeEEEEEccCCCEE
Confidence            47899999999854


No 23 
>COG4824 Phage-related holin (Lysis protein) [General function prediction only]
Probab=24.53  E-value=18  Score=26.79  Aligned_cols=10  Identities=10%  Similarity=-0.250  Sum_probs=8.1

Q ss_pred             eccCcccceE
Q psy4031          10 RSRYWHALYI   19 (115)
Q Consensus        10 lfGGWky~ft   19 (115)
                      +||||+|.+.
T Consensus        19 lFG~~~~l~~   28 (133)
T COG4824          19 LFGGVDYLLL   28 (133)
T ss_pred             HhcchHHHHH
Confidence            7999998763


No 24 
>PF13716 CRAL_TRIO_2:  Divergent CRAL/TRIO domain; PDB: 2D4Q_B 3PG7_B 2E2X_B 3P7Z_B 3PEG_A.
Probab=23.23  E-value=54  Score=22.75  Aligned_cols=16  Identities=38%  Similarity=0.675  Sum_probs=12.5

Q ss_pred             cCCCCcEEEEEeccCc
Q psy4031          72 DTAGRPMITLRAHNLV   87 (115)
Q Consensus        72 Dt~GRp~v~l~~~Nlv   87 (115)
                      |..|||++++-.+++.
T Consensus        10 d~~g~pV~~~~~~~~~   25 (149)
T PF13716_consen   10 DREGRPVVVFIASRLP   25 (149)
T ss_dssp             BTTS-EEEEEEGGG-C
T ss_pred             CCCcCEEEEEECCcCc
Confidence            7789999999999994


No 25 
>KOG3264|consensus
Probab=23.10  E-value=55  Score=26.06  Aligned_cols=17  Identities=6%  Similarity=0.249  Sum_probs=14.0

Q ss_pred             EEEEEEcCCCeeeeeEe
Q psy4031          38 AKVNIILPEHSLVRQIH   54 (115)
Q Consensus        38 ~~l~ViLPEGA~~i~v~   54 (115)
                      ++++|++|+|++|-.+.
T Consensus       171 velsV~aPa~~E~~~va  187 (221)
T KOG3264|consen  171 VELSVVAPAGQENYEVA  187 (221)
T ss_pred             EEEEEecCCccchhhHH
Confidence            78999999999976543


No 26 
>PF08487 VIT:  Vault protein inter-alpha-trypsin domain;  InterPro: IPR013694 Inter-alpha-trypsin inhibitors (ITIs) consist of one light chain and a variable set of heavy chains. ITIs play a role in extracellular matrix (ECM) stabilisation and tumour metastasis as well as in plasma protease inhibition []. The vault protein inter-alpha-trypsin (VIT) domain described here is found to the N terminus of a von Willebrand factor type A domain (IPR002035 from INTERPRO) in ITI heavy chains (ITIHs) and their precursors. 
Probab=23.07  E-value=1.2e+02  Score=20.86  Aligned_cols=37  Identities=27%  Similarity=0.267  Sum_probs=27.5

Q ss_pred             cccceEEeccCCCCcEEEEEeccCcccCcceEEEEEEeCc
Q psy4031          63 SDGTHSTYLDTAGRPMITLRAHNLVENHIEDIEIEYEFSK  102 (115)
Q Consensus        63 ~~~~~~TYLDt~GRp~v~l~~~Nlvd~h~~~i~V~Y~~~~  102 (115)
                      +...+-+..|...|++++..+.|-.   .....+.|.|+-
T Consensus         7 s~~v~~~I~~~~a~t~v~q~f~N~~---~~~~E~~y~fpL   43 (118)
T PF08487_consen    7 SVHVKVTIIDRFARTTVTQTFENPS---SEPLEAVYSFPL   43 (118)
T ss_pred             EEEEEEEEEccEEEEEEEEEEECCC---CCcEEEEEEeEC
Confidence            3445567788899999999999854   445677888764


No 27 
>COG4856 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=22.17  E-value=65  Score=27.95  Aligned_cols=17  Identities=18%  Similarity=0.200  Sum_probs=13.7

Q ss_pred             eeEEEEEEcCCCeeeee
Q psy4031          36 RNAKVNIILPEHSLVRQ   52 (115)
Q Consensus        36 d~~~l~ViLPEGA~~i~   52 (115)
                      +..++++.||||++-++
T Consensus       288 t~~tv~lpvPegv~sv~  304 (403)
T COG4856         288 TTKTVKLPVPEGVKSVS  304 (403)
T ss_pred             ceEEEEeeCCCcceecC
Confidence            35789999999997653


No 28 
>PF10515 APP_amyloid:  beta-amyloid precursor protein C-terminus;  InterPro: IPR019543  This is the amyloid, C-terminal, protein of the beta-Amyloid precursor protein (APP) which is a conserved and ubiquitous transmembrane glycoprotein strongly implicated in the pathogenesis of Alzheimer's disease but whose normal biological function is unknown. The C-terminal 100 residues are released and aggregate into amyloid deposits which are strongly implicated in the pathology of Alzheimer's disease plaque-formation. The domain is associated with IPR008154 from INTERPRO, further towards the N terminus. ; PDB: 2ROZ_A 3DXD_D 1X11_D 2LP1_A 2LOH_A 3MXC_L 3MXY_L 3DXC_B 3DXE_D 3L81_B ....
Probab=22.02  E-value=41  Score=21.22  Aligned_cols=11  Identities=27%  Similarity=0.561  Sum_probs=9.4

Q ss_pred             eeecCChhhhc
Q psy4031          20 PGHRRSPYDYA   30 (115)
Q Consensus        20 iGyn~pl~~~l   30 (115)
                      -||.+|++.|+
T Consensus        41 nGYENPTYkyf   51 (52)
T PF10515_consen   41 NGYENPTYKYF   51 (52)
T ss_dssp             TEEESCTCHHC
T ss_pred             cCCcCCceecc
Confidence            39999999886


No 29 
>PF08089 Toxin_20:  Huwentoxin-II family;  InterPro: IPR012625 This family consists of the huwentoxin-II (HWTX-II) family of toxins secreted by spiders. These toxins are found in venom that secreted from the bird spider Selenocosmia huwena Wang. The HWTX-II adopts a novel scaffold different from the ICK motif that is found in other huwentoxins. HWTX-II consists of 37 amino acids residues including six cysteines involved in three disulphide bridges [].; GO: 0005576 extracellular region; PDB: 2KGH_A 1I25_A.
Probab=21.98  E-value=25  Score=20.83  Aligned_cols=10  Identities=10%  Similarity=0.275  Sum_probs=6.9

Q ss_pred             ccCcccceEe
Q psy4031          11 SRYWHALYIP   20 (115)
Q Consensus        11 fGGWky~fti   20 (115)
                      -||||..|.+
T Consensus        26 ~GGwkcK~n~   35 (39)
T PF08089_consen   26 KGGWKCKFNM   35 (39)
T ss_dssp             -TTEEEETTE
T ss_pred             cCCceEEEEE
Confidence            4899987653


No 30 
>PF13349 DUF4097:  Domain of unknown function (DUF4097)
Probab=21.73  E-value=1.4e+02  Score=20.87  Aligned_cols=21  Identities=29%  Similarity=0.381  Sum_probs=16.4

Q ss_pred             eeEEEEEEcCCCe--eeeeEeCC
Q psy4031          36 RNAKVNIILPEHS--LVRQIHTP   56 (115)
Q Consensus        36 d~~~l~ViLPEGA--~~i~v~~P   56 (115)
                      ....+.|.||++.  .++++.+-
T Consensus       118 ~~~~i~I~lP~~~~l~~i~i~~~  140 (166)
T PF13349_consen  118 NKSKITIYLPKDYKLDKIDIKTS  140 (166)
T ss_pred             CCcEEEEEECCCCceeEEEEEec
Confidence            4678999999997  57877664


No 31 
>cd06494 p23_NUDCD2_like p23-like NUD (nuclear distribution) C-like found in human NUDC domain-containing protein 2 (NUDCD2) and similar proteins.  Little is known about the function of the proteins in this subgroup.
Probab=21.54  E-value=1.1e+02  Score=20.59  Aligned_cols=22  Identities=5%  Similarity=0.167  Sum_probs=17.4

Q ss_pred             ccceeeEEEEEEcCCCeeeeeE
Q psy4031          32 RAQPRNAKVNIILPEHSLVRQI   53 (115)
Q Consensus        32 d~~~d~~~l~ViLPEGA~~i~v   53 (115)
                      .+..++++++|=||+|++-=++
T Consensus        11 ~QT~~eV~v~i~lp~~~~~kdv   32 (93)
T cd06494          11 YQTMDEVFIEVNVPPGTRAKDV   32 (93)
T ss_pred             EeEcCEEEEEEECCCCCceeeE
Confidence            6778999999999998744433


No 32 
>PF08813 Phage_tail_3:  Phage tail protein;  InterPro: IPR014918 This entry is represented by Bacteriophage T1, Orf41. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. Proteins of this entry include phage tail proteins. They probably include bacterial Ig-like domains related to IPR003343 from INTERPRO. Which also includes a number of phage tail invasin proteins. 
Probab=21.53  E-value=1.1e+02  Score=22.87  Aligned_cols=35  Identities=11%  Similarity=0.128  Sum_probs=24.4

Q ss_pred             cceEeeec--CChhhhcc--ccceeeEEEEEEcCCCeee
Q psy4031          16 ALYIPGHR--RSPYDYAA--RAQPRNAKVNIILPEHSLV   50 (115)
Q Consensus        16 y~ftiGyn--~pl~~~l~--d~~~d~~~l~ViLPEGA~~   50 (115)
                      .+|++.|.  +|-+.-|+  +-.=+-..+|+.||.|++.
T Consensus        90 lt~t~a~dp~~~~y~aL~~a~~~~~~~a~r~~~p~G~~~  128 (165)
T PF08813_consen   90 LTFTFAYDPDDPGYKALRAADDSKEVRAFRVTLPNGSTI  128 (165)
T ss_pred             EEEEEEeCCCCHHHHHHHHHhhcCCeEEEEEEcCCCCEE
Confidence            46888888  55555555  3333445689999999977


No 33 
>PF14879 DUF4489:  Domain of unknown function (DUF4489)
Probab=20.65  E-value=2.2e+02  Score=21.23  Aligned_cols=53  Identities=26%  Similarity=0.367  Sum_probs=35.2

Q ss_pred             EEEcCCCeeeeeEeCCcceeeccccceEEeccCC--CCcEEEEEe-ccCcccCcceEEEEEEeCch
Q psy4031          41 NIILPEHSLVRQIHTPYPVSSQSDGTHSTYLDTA--GRPMITLRA-HNLVENHIEDIEIEYEFSKS  103 (115)
Q Consensus        41 ~ViLPEGA~~i~v~~P~~v~~~~~~~~~TYLDt~--GRp~v~l~~-~Nlvd~h~~~i~V~Y~~~~~  103 (115)
                      .+.||+++.   ..+||++.+..       |||.  -.|.|.|++ .|++-.-.....+.++.-+.
T Consensus        12 ~~~lP~~t~---~~~~~~vAslt-------lDTs~l~~p~ikLeFtsnI~~~~~~~~tLnFQifK~   67 (141)
T PF14879_consen   12 SVPLPVITT---AGTTFTVASLT-------LDTSCLCNPCIKLEFTSNIVTPAAVSGTLNFQIFKQ   67 (141)
T ss_pred             ceEcccccc---CCceEEEEEEE-------EccccCcCcEEEEEEEEeEEeeeeeEEEEEEEEeeh
Confidence            368999888   77888876544       6764  578899998 56655444445555554443


No 34 
>PRK11615 hypothetical protein; Provisional
Probab=20.36  E-value=1.8e+02  Score=22.68  Aligned_cols=42  Identities=14%  Similarity=0.288  Sum_probs=28.4

Q ss_pred             EEEEEcCCCeeeeeEeCCcceeeccccceEEeccCCCCcEE-EEEecc
Q psy4031          39 KVNIILPEHSLVRQIHTPYPVSSQSDGTHSTYLDTAGRPMI-TLRAHN   85 (115)
Q Consensus        39 ~l~ViLPEGA~~i~v~~P~~v~~~~~~~~~TYLDt~GRp~v-~l~~~N   85 (115)
                      .+.+.||.|-.+..-  +  . -..--..+=|.|.+||.+| ++..+|
T Consensus        49 Kl~FtLPag~sdqsg--k--~-Gtq~nn~~vYad~tg~kavIVi~gD~   91 (185)
T PRK11615         49 KLSFTLPADMSDQSG--K--L-GTQANNMHVYADATGQKAVIVILGDD   91 (185)
T ss_pred             EEEEEcCCccccccc--c--c-cccccceEEEEcCCCCEEEEEEeCCC
Confidence            578899999887632  1  1 1233456789998888886 776554


Done!