Query         psy5004
Match_columns 173
No_of_seqs    55 out of 57
Neff          3.3 
Searched_HMMs 46136
Date          Fri Aug 16 23:22:54 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy5004.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/5004hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 cd00117 LU Ly-6 antigen / uPA   97.5 0.00015 3.2E-09   50.4   4.0   47   20-74      1-52  (79)
  2 smart00134 LU Ly-6 antigen / u  97.1 0.00049 1.1E-08   48.0   3.4   45   20-72      1-49  (79)
  3 PF00087 Toxin_1:  Snake toxin;  96.4  0.0035 7.6E-08   43.4   3.2   44   20-74      1-45  (63)
  4 cd00206 snake_toxin Snake toxi  96.0  0.0079 1.7E-07   41.7   3.2   46   20-75      1-47  (64)
  5 PF00021 UPAR_LY6:  u-PAR/Ly-6   95.5   0.018 3.8E-07   37.2   3.3   43   22-72      1-48  (77)
  6 PF06211 BAMBI:  BMP and activi  94.9   0.031 6.8E-07   43.8   3.4   67   12-97     18-85  (107)
  7 PF01064 Activin_recp:  Activin  94.7   0.031 6.8E-07   38.8   2.9   78   20-135     1-80  (83)
  8 PF05444 DUF753:  Protein of un  94.5   0.057 1.2E-06   42.5   4.1   51   18-75     81-133 (152)
  9 PF05444 DUF753:  Protein of un  93.5    0.11 2.4E-06   40.8   4.0   49   23-75      1-50  (152)
 10 PF02988 PLA2_inh:  Phospholipa  91.9    0.27 5.9E-06   36.7   4.0   45   22-73      3-51  (83)
 11 PF06579 Ly-6_related:  Caenorh  89.0    0.61 1.3E-05   37.2   4.0   85   41-133    38-129 (129)
 12 TIGR02052 MerP mercuric transp  56.7       9 0.00019   24.8   1.9   19    1-19      1-20  (92)
 13 KOG2052|consensus               53.5      34 0.00073   33.3   5.8   44   19-74     34-78  (513)
 14 PRK11443 lipoprotein; Provisio  47.9      13 0.00028   29.3   1.8   28    1-38      1-32  (124)
 15 PF11912 DUF3430:  Protein of u  45.3      16 0.00035   29.2   2.0   14    1-14      1-14  (212)
 16 PF11137 DUF2909:  Protein of u  35.5      22 0.00047   25.4   1.3   20    1-20      1-20  (63)
 17 PF08194 DIM:  DIM protein;  In  30.0      38 0.00083   22.0   1.6    7    1-7       1-7   (36)
 18 PF11777 DUF3316:  Protein of u  28.4      38 0.00083   25.6   1.6   19    1-19      1-19  (114)
 19 PRK09810 entericidin A; Provis  26.3      59  0.0013   21.6   2.0   14    1-14      2-15  (41)
 20 PF07172 GRP:  Glycine rich pro  25.1      58  0.0013   24.6   2.1    8    2-9       4-11  (95)
 21 PF11153 DUF2931:  Protein of u  24.2      58  0.0013   26.8   2.1   46    1-57      1-47  (216)
 22 PF13956 Ibs_toxin:  Toxin Ibs,  24.0      42 0.00091   19.2   0.8   11    1-11      2-12  (19)
 23 PF00879 Defensin_propep:  Defe  21.6      71  0.0015   22.3   1.8   20    1-20      1-22  (52)
 24 PF04202 Mfp-3:  Foot protein 3  20.5      65  0.0014   23.8   1.5   17    3-19      8-24  (71)
 25 PF08138 Sex_peptide:  Sex pept  20.1      34 0.00075   24.3   0.0   15    1-15      1-15  (56)

No 1  
>cd00117 LU Ly-6 antigen / uPA receptor -like domain; occurs singly in GPI-linked cell-surface glycoproteins (Ly-6 family,CD59, thymocyte B cell antigen, Sgp-2) or as three-fold repeated domain in urokinase-type plasminogen activator receptor. Topology of these domains is similar to that of snake venom neurotoxins.
Probab=97.49  E-value=0.00015  Score=50.40  Aligned_cols=47  Identities=30%  Similarity=0.732  Sum_probs=36.9

Q ss_pred             eeeecccCC-CCCCCCCCCCCccccccCCCCCCceEEEEEecC----cceeEeechhhhh
Q psy5004          20 LLCYQCVST-QPGCGTPFKWLWYPSISCPEEDDICVKIIERKG----AEEIITRDCLSTV   74 (173)
Q Consensus        20 L~CY~C~S~-~PgC~dpfn~~~~~~etCp~~~d~CvKVier~g----~~~~itRdClS~~   74 (173)
                      |+||.|.+. ...|..+        ++||+.++.|+++.....    ....++|+|+++.
T Consensus         1 L~C~~C~~~~~~~C~~~--------~~C~~~~~~C~~~~~~~~~~~~~~~~~~rgC~~~C   52 (79)
T cd00117           1 LECYSCTGVSTSSCSTE--------TNCPSPDDQCLTAVATVIEESVRLSLVVRGCASDC   52 (79)
T ss_pred             CccCcCCCCCCCCCCCC--------CccCCCCCEeeEEEEEEEeeccccceEECcccCCC
Confidence            789999883 4468876        599999999999877742    3466889999954


No 2  
>smart00134 LU Ly-6 antigen / uPA receptor -like domain. Three-fold repeated domain in urokinase-type plasminogen activator receptor; occurs singly in other GPI-linked cell-surface glycoproteins (Ly-6 family, CD59, thymocyte B cell antigen, Sgp-2). Topology of these domains is similar to that of snake venom neurotoxins.
Probab=97.14  E-value=0.00049  Score=48.03  Aligned_cols=45  Identities=22%  Similarity=0.751  Sum_probs=34.2

Q ss_pred             eeeecccCC-CCCCCCCCCCCccccccCCCCCCceEEEEEecC---cceeEeechhh
Q psy5004          20 LLCYQCVST-QPGCGTPFKWLWYPSISCPEEDDICVKIIERKG---AEEIITRDCLS   72 (173)
Q Consensus        20 L~CY~C~S~-~PgC~dpfn~~~~~~etCp~~~d~CvKVier~g---~~~~itRdClS   72 (173)
                      |.||.|.+. ...|..+        ++||+.++.|.+++.+..   ....++|+|.+
T Consensus         1 L~C~~C~~~~~~~C~~~--------~~C~~~~~~C~~~~~~~~~~~~~~~~~rgC~~   49 (79)
T smart00134        1 LQCYSCTGNPDSSCSSE--------EECRSPDDVCLTVVAEVISGSVGSVVYKGCAT   49 (79)
T ss_pred             CcceecCCcCCCCCCCc--------ceeCCCCCEeEEEEEEEEecCCCceEEcCccC
Confidence            689999875 3468776        579999999999875432   24568899993


No 3  
>PF00087 Toxin_1:  Snake toxin;  InterPro: IPR003571 Snake toxins belong to a family of proteins [] which groups short and long neurotoxins, cytotoxins and short toxins, as well as a other miscellaneous venom peptides. Most of these toxins act by binding to the nicotinic acetylcholine receptors in the postsynaptic membrane of skeletal muscles and prevent the binding of acetylcholine, thereby blocking the excitation of muscles. Snake toxins are proteins that consist of sixty to seventy five amino acids. Among the invariant residues are eight cysteines all involved in disulphide bonds. The structure is small, disulphide-rich, nearly all beta sheet.; GO: 0005576 extracellular region; PDB: 3NEQ_B 2H5F_A 3PLC_A 1NOR_A 1UG4_A 1FFJ_A 1CCQ_A 1CB9_A 1VB0_A 1ONJ_A ....
Probab=96.42  E-value=0.0035  Score=43.41  Aligned_cols=44  Identities=34%  Similarity=0.704  Sum_probs=34.4

Q ss_pred             eeeecccCCCCCCCCCCCCCccccccCCCCCCceE-EEEEecCcceeEeechhhhh
Q psy5004          20 LLCYQCVSTQPGCGTPFKWLWYPSISCPEEDDICV-KIIERKGAEEIITRDCLSTV   74 (173)
Q Consensus        20 L~CY~C~S~~PgC~dpfn~~~~~~etCp~~~d~Cv-KVier~g~~~~itRdClS~~   74 (173)
                      |.||++.+ .  |..        .++||+.+++|. |+....+...+|.|+|.++.
T Consensus         1 ~~C~~~~~-~--~~~--------~~tC~~ge~~Cy~k~~~~~~~~~~~~rGCa~tC   45 (63)
T PF00087_consen    1 RTCYNCPS-S--CFT--------TKTCPEGENICYKKTWMDHPRGMRVERGCAATC   45 (63)
T ss_dssp             -EEEESST-S--STE--------EEEESTTSSEEEEEEEEETSTEEEEEEEEESSS
T ss_pred             CCccCCCC-C--CCc--------ceECCCccCeeEEEEEecCCCCceEecchhccC
Confidence            57888877 4  444        489999999998 66666657889999999876


No 4  
>cd00206 snake_toxin Snake toxin domain, present in short and long neurotoxins, cytotoxins and short toxins, and in other miscellaneous venom peptides. The toxin acts by binding to the nicotinic acetylcholine receptors in the postsynaptic membrane of skeletal muscles and preventing the binding of acetylcholine, thereby blocking the excitation of muscles. This domain contains 60-75 amino acids that are fixed by 4-5 disulfide bridges and is nearly all beta sheet; it exists as either monomers or dimers.
Probab=95.98  E-value=0.0079  Score=41.67  Aligned_cols=46  Identities=33%  Similarity=0.579  Sum_probs=33.9

Q ss_pred             eeeecccCCCCCCCCCCCCCccccccCCCCCCceEEEEEec-CcceeEeechhhhhh
Q psy5004          20 LLCYQCVSTQPGCGTPFKWLWYPSISCPEEDDICVKIIERK-GAEEIITRDCLSTVK   75 (173)
Q Consensus        20 L~CY~C~S~~PgC~dpfn~~~~~~etCp~~~d~CvKVier~-g~~~~itRdClS~~~   75 (173)
                      |.||+|.+..  |..        .++||+.+++|.|..... +....|.|+|.++.-
T Consensus         1 ~~C~~~~~~~--~~~--------~~tC~~ge~~Cyk~~~~~~~~g~~i~rGCa~tCP   47 (64)
T cd00206           1 LTCYNCPSPF--PFT--------TETCPDGENICYKKSWKDTPRGVRIERGCAATCP   47 (64)
T ss_pred             CCccCCCCCC--CCc--------ceeCCcccCccceeEEEcCCCCCEEEccccCcCc
Confidence            5688887663  333        489999999999886554 445578899998763


No 5  
>PF00021 UPAR_LY6:  u-PAR/Ly-6 domain omitted due to poor similarity.;  InterPro: IPR001526 CD59 (also called 1F-5Ag, H19, HRF20, MACIF, MIRL, P-18 or protectin) inhibits formation of membrane attack complex (MAC), thus protecting cells from complement mediated lysis. It has a signalling role, as a GPI-anchored molecule, in T cell activation and appears to have some role in cell adhesion through CD2 (controversial). CD59 associates with C9, inhibiting incorporation into C5b-8 preventing terminal steps in polymerisation of the (MAC) in plasma membranes. Genetic defects in GPI-anchor attachment that cause a reduction or loss of both CD59 and CD55 on erythrocytes produce the symptoms of the disease paroxysmal nocturnal haemoglobinuria (PNH).  A variety of GPI-linked cell-surface glycoproteins are composed of one or more copies of a conserved domain of about 100 amino-acid residues [], []. Among these proteins, U-PAR contains three tandem copies of the domain, while all the others are made up of a single domain. As shown in the following schematic, this conserved domain contains 10 cysteine residues involved in five disulphide bonds - in U-PAR, the first copy of the domain lacks the fourth disulphide bond.  +------+ +------------------------+ +---+ | | | | | | xCxxCxxxxxxCxxxxxCxxxxxCxxxxxxxxxxxxxxxxxxCxxxxCxxxxxxxxxxxxxxCCxxxCxxxxxxxx | | | | +---------------------+ +--------------+ 'C': conserved cysteine involved in a disulphide bond.  CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/).; PDB: 2I9B_H 1YWH_E 3BT2_U 2FD6_U 3U74_U 3BT1_U 3U73_U 2L03_A 3LAQ_V 2J8B_A ....
Probab=95.50  E-value=0.018  Score=37.24  Aligned_cols=43  Identities=23%  Similarity=0.794  Sum_probs=33.8

Q ss_pred             eecccCCC-CCCCCCCCCCccccccCCCCCCceEEEEEecC----cceeEeechhh
Q psy5004          22 CYQCVSTQ-PGCGTPFKWLWYPSISCPEEDDICVKIIERKG----AEEIITRDCLS   72 (173)
Q Consensus        22 CY~C~S~~-PgC~dpfn~~~~~~etCp~~~d~CvKVier~g----~~~~itRdClS   72 (173)
                      ||.|.+.. ..|..+        ++||.+++.|+.+..+..    ....+.|+|..
T Consensus         1 C~~C~~~~~~~C~~~--------~~C~~~~~~C~~~~~~~~~g~~~~~~~~kgC~~   48 (77)
T PF00021_consen    1 CYSCYSPSSSSCSGE--------VTCPGSEDVCYTATVTVTSGGNSVTLVSKGCAT   48 (77)
T ss_dssp             EEEEECSTTTTSCTE--------EEECTTTSEEEEEEEEEETTTCCEEEEEEEEEE
T ss_pred             CCcccCCCCCCCCCC--------CCcCCCCCeeEEEEEEeeecccCceEEEeeccc
Confidence            89999863 369885        699999999987776543    36788899994


No 6  
>PF06211 BAMBI:  BMP and activin membrane-bound inhibitor (BAMBI) N-terminal domain;  InterPro: IPR009345 This family consists of several eukaryotic BMP and activin membrane-bound inhibitor (BAMBI) proteins. Members of the transforming growth factor-beta (TGF-beta) superfamily, including TGF-beta, bone morphogenetic proteins (BMPs), activins and nodals, are vital for regulating growth and differentiation. BAMBI is related to TGF-beta-family type I receptors but lacks an intracellular kinase domain. BAMBI is co-expressed with the ventralising morphogen BMP4 during Xenopus embryogenesis and requires BMP signalling for its expression. The protein stably associates with TGF-beta-family receptors and inhibits BMP and activin as well as TGF-beta signalling [].
Probab=94.86  E-value=0.031  Score=43.79  Aligned_cols=67  Identities=18%  Similarity=0.582  Sum_probs=47.2

Q ss_pred             HHHhccCceeeecccCCCCCCCCCCCCCccccccCCCCCCce-EEEEEecCcceeEeechhhhhhhhhhcCCcccccccc
Q psy5004          12 SFVSRCSALLCYQCVSTQPGCGTPFKWLWYPSISCPEEDDIC-VKIIERKGAEEIITRDCLSTVKAFRTDIPADRYEGCR   90 (173)
Q Consensus        12 ~~i~~geAL~CY~C~S~~PgC~dpfn~~~~~~etCp~~~d~C-vKVier~g~~~~itRdClS~~~~~r~dipad~yegcr   90 (173)
                      .++.+ .+++|| |+..+  |..+       +..|-.+-..| -|+..+...++..+++|+-++.        ...+.|.
T Consensus        18 ~lltk-GEIRCY-Cn~P~--CV~T-------GYMCkSel~~CfSklldp~n~nspl~HGCld~~~--------~~~~~C~   78 (107)
T PF06211_consen   18 VLLTK-GEIRCY-CNAPH--CVAT-------GYMCKSELNACFSKLLDPQNTNSPLTHGCLDSLA--------STADICK   78 (107)
T ss_pred             HHHhc-CceEEe-cCCcc--eecc-------cceeccccccceeeccCcccCCCccccccccccc--------CCCCccc
Confidence            34444 459999 87655  9998       67999888899 4565566677889999998773        3456666


Q ss_pred             cCcCCCc
Q psy5004          91 AAAKDPK   97 (173)
Q Consensus        91 ~~a~d~~   97 (173)
                      +....-+
T Consensus        79 ~~~~~~~   85 (107)
T PF06211_consen   79 SEQAQNH   85 (107)
T ss_pred             Ccccccc
Confidence            6544333


No 7  
>PF01064 Activin_recp:  Activin types I and II receptor domain;  InterPro: IPR000472 Transforming growth factor-beta (TGF-beta) forms a family with other growth factors described in PDOC00223 from PROSITEDOC. The receptors for most of the members of this growth factor family are related. These proteins are receptor-type kinases of Ser/Thr type PDOC00100 from PROSITEDOC), which have a single transmembrane domain and a specific hydrophilic Cys-rich ligand-binding domain [, , ]. The C-terminal part of the extracellular domain is conserved. Some of the receptors of this family contain subclass-specific N-terminal extensions of this homology domain. The type I receptors also possess 7 extracellular residues preceding the cysteine box.; GO: 0004675 transmembrane receptor protein serine/threonine kinase activity, 0005024 transforming growth factor beta-activated receptor activity, 0016020 membrane; PDB: 3EVS_C 1BTE_A 1LX5_B 2GOO_C 2HLR_A 4FAO_W 2H64_C 2H62_D 1S4Y_C 1NYU_C ....
Probab=94.73  E-value=0.031  Score=38.81  Aligned_cols=78  Identities=26%  Similarity=0.534  Sum_probs=56.0

Q ss_pred             eeeecccCCCCCCCCCCCCCccccccCCCCCCceEEEEEecCc--ceeEeechhhhhhhhhhcCCcccccccccCcCCCc
Q psy5004          20 LLCYQCVSTQPGCGTPFKWLWYPSISCPEEDDICVKIIERKGA--EEIITRDCLSTVKAFRTDIPADRYEGCRAAAKDPK   97 (173)
Q Consensus        20 L~CY~C~S~~PgC~dpfn~~~~~~etCp~~~d~CvKVier~g~--~~~itRdClS~~~~~r~dipad~yegcr~~a~d~~   97 (173)
                      |+|| |.+.+  |.++       ..+|.. ...|.+.++....  .++++|+|++    ...+.+.+. ..|...++.  
T Consensus         1 l~C~-C~~~~--C~~~-------n~~C~t-~~~Cf~~~~~~~~~~~~~~~~GC~~----~~~~~~~~~-~~C~~~~~~--   62 (83)
T PF01064_consen    1 LKCY-CDSDD--CNET-------NQTCET-GGFCFTSWEQDNGGGVEIVKKGCWS----NEEDCPLCR-NQCRSSSSP--   62 (83)
T ss_dssp             EEEE-EETTC--HCTT-------TEEEEE-SCEEEEEEEEETTESEEEEEEEEEE----STGGGTTTS-SSEEECSSS--
T ss_pred             CEeE-eCccc--CCCC-------CCEeCC-CCEEEEEEEEeeccceeEEEcCccC----CccCcceec-ceeeccCCC--
Confidence            7899 98877  8865       368876 8899888888766  6899999998    445666666 888884443  


Q ss_pred             chhhhccCCCCccccCceeeccceEEEeeccccccccc
Q psy5004          98 LANYVNHSIPELDVQRPSELKTGYTEEIITRDCLSTVK  135 (173)
Q Consensus        98 l~~Yv~~sike~D~kr~~~~~t~~~~~~~~~~C~s~~~  135 (173)
                                         ....+| |.-+..||+.++
T Consensus        63 -------------------~~~~~C-C~~~D~CN~~~~   80 (83)
T PF01064_consen   63 -------------------SSVVFC-CCSGDFCNSNLS   80 (83)
T ss_dssp             -------------------TCEEEE-EESSTTGGGSEE
T ss_pred             -------------------CCeEEE-CCCCCccCCCCC
Confidence                               112345 445778888765


No 8  
>PF05444 DUF753:  Protein of unknown function (DUF753);  InterPro: IPR008472 This entry contains sequences which are repeated in several uncharacterised proteins from Drosophila melanogaster.
Probab=94.46  E-value=0.057  Score=42.46  Aligned_cols=51  Identities=31%  Similarity=0.727  Sum_probs=36.6

Q ss_pred             CceeeecccCCCCCCCCCCCCCccccccCCC--CCCceEEEEEecCcceeEeechhhhhh
Q psy5004          18 SALLCYQCVSTQPGCGTPFKWLWYPSISCPE--EDDICVKIIERKGAEEIITRDCLSTVK   75 (173)
Q Consensus        18 eAL~CY~C~S~~PgC~dpfn~~~~~~etCp~--~~d~CvKVier~g~~~~itRdClS~~~   75 (173)
                      ..|+||+|. ....|..+-   ....+.|+.  ..+.|...+.   ....+.|+|+|+|.
T Consensus        81 ~~~~C~qC~-~~~~C~~~~---~~~~~~C~~~~~~d~Cyt~~~---~~~~~~RGC~s~l~  133 (152)
T PF05444_consen   81 DRLSCYQCT-DDSDCVSSD---SSTAKPCPNYSEDDQCYTRVD---DGGVVERGCLSDLE  133 (152)
T ss_pred             cCCEeeeCC-CCCCCCCCC---CcCcCcCcCCCCCCcCEEEEe---eCCEEEeCCCCccc
Confidence            459999998 666791111   123689986  4899988776   34557899999884


No 9  
>PF05444 DUF753:  Protein of unknown function (DUF753);  InterPro: IPR008472 This entry contains sequences which are repeated in several uncharacterised proteins from Drosophila melanogaster.
Probab=93.47  E-value=0.11  Score=40.81  Aligned_cols=49  Identities=33%  Similarity=0.675  Sum_probs=33.8

Q ss_pred             ecccCC-CCCCCCCCCCCccccccCCCCCCceEEEEEecCcceeEeechhhhhh
Q psy5004          23 YQCVST-QPGCGTPFKWLWYPSISCPEEDDICVKIIERKGAEEIITRDCLSTVK   75 (173)
Q Consensus        23 Y~C~S~-~PgC~dpfn~~~~~~etCp~~~d~CvKVier~g~~~~itRdClS~~~   75 (173)
                      |+|.|. .+.|..+- +.....+.|++..+.|+.++...+.   +.|+|+++|.
T Consensus         1 ~~C~s~~~~~C~~~~-~~~~~~~~C~~~~~~C~t~~~~~~~---~~RGC~~~~~   50 (152)
T PF05444_consen    1 YSCDSCEDPNCTDPD-PSTSATEECPQQDDQCYTRFDDDGT---TIRGCLSDLP   50 (152)
T ss_pred             CCccCCcCCCCCCCC-CCceecccCCCCCCCCEEEEeCCCc---eEcCchhhcc
Confidence            455554 67893322 1233368999998889999976544   6799999884


No 10 
>PF02988 PLA2_inh:  Phospholipase A2 inhibitor;  InterPro: IPR004126 Proteins in this entry inhibit basic phospholipase A2 isozymes in snake's venom [, ].; GO: 0004859 phospholipase inhibitor activity, 0005576 extracellular region
Probab=91.87  E-value=0.27  Score=36.72  Aligned_cols=45  Identities=22%  Similarity=0.640  Sum_probs=33.7

Q ss_pred             eecccCCCCCCCCCCCCCccccccCCCCCCceEEEEEec---Ccce-eEeechhhh
Q psy5004          22 CYQCVSTQPGCGTPFKWLWYPSISCPEEDDICVKIIERK---GAEE-IITRDCLST   73 (173)
Q Consensus        22 CY~C~S~~PgC~dpfn~~~~~~etCp~~~d~CvKVier~---g~~~-~itRdClS~   73 (173)
                      |.-|.+.++.|.-.       .+||..++|.||+++...   |... ++.+.|+++
T Consensus         3 CEvC~~~G~~C~G~-------~~tC~~~eDtC~~~~~E~~~~~~s~~~~~K~C~~S   51 (83)
T PF02988_consen    3 CEVCHGSGKDCSGK-------MKTCEDGEDTCVTVVTEVSSAGVSFRTTHKGCFSS   51 (83)
T ss_pred             cceecCcCCCCCCC-------eeEcCCCCCEEEEEEEEeccCCeEEEEEEecccCc
Confidence            77788888889887       489999999998886543   2222 335888874


No 11 
>PF06579 Ly-6_related:  Caenorhabditis elegans ly-6-related protein;  InterPro: IPR010558 This family consists of several Caenorhabditis elegans specific ly-6-related HOT and ODR proteins. These proteins are involved in the olfactory system. Odr-2 mutants are known to be defective in the ability to chemotax to odorants that are recognised by the two AWC olfactory neurons. Odr-2 encodes a membrane-associated protein related to the Ly-6 superfamily of GPI-linked signalling proteins [].
Probab=89.02  E-value=0.61  Score=37.17  Aligned_cols=85  Identities=20%  Similarity=0.373  Sum_probs=47.8

Q ss_pred             cccccCCCCCCceEEEEEe---cCc--ceeEeechhhhh--hhhhhcCCcccccccccCcCCCcchhhhccCCCCccccC
Q psy5004          41 YPSISCPEEDDICVKIIER---KGA--EEIITRDCLSTV--KAFRTDIPADRYEGCRAAAKDPKLANYVNHSIPELDVQR  113 (173)
Q Consensus        41 ~~~etCp~~~d~CvKVier---~g~--~~~itRdClS~~--~~~r~dipad~yegcr~~a~d~~l~~Yv~~sike~D~kr  113 (173)
                      ..++.|   .++||.++|.   .|+  ...+.|||++++  .+|...+=...++. +...++.++.+-+....+..   .
T Consensus        38 i~~~~C---~s~Cvtl~e~~~~~gg~~~~~yIRGC~~~I~~~G~n~t~t~~~~~~-~~~C~~~~~s~Lf~~~~~~~---~  110 (129)
T PF06579_consen   38 IPTVPC---SSICVTLFEPLDIFGGTRGFGYIRGCLDSILRHGFNDTITLSFLDR-RDICRRLRLSQLFPPGRRRE---H  110 (129)
T ss_pred             cceeec---CchheeeEeccccccccccccEEEcCHHHHHHhCCCCchHHHHhcC-CCeeeeccHHHhcCcccccc---c
Confidence            345888   7899999999   233  345569999999  45544322233333 33445666655443332211   0


Q ss_pred             ceeeccceEEEeeccccccc
Q psy5004         114 PSELKTGYTEEIITRDCLST  133 (173)
Q Consensus       114 ~~~~~t~~~~~~~~~~C~s~  133 (173)
                      .--...+.|.|.- ..||++
T Consensus       111 ~~~~~v~vCsC~g-d~CN~s  129 (129)
T PF06579_consen  111 PSDSTVQVCSCYG-DRCNGS  129 (129)
T ss_pred             CCCCeEEEEEECC-CccCCC
Confidence            1112346788864 458764


No 12 
>TIGR02052 MerP mercuric transport protein periplasmic component. This model represents the periplasmic mercury (II) binding protein of the bacterial mercury detoxification system which passes mercuric ion to the MerT transporter for subsequent reduction to Hg(0) by the mercuric reductase MerA. MerP contains a distinctive GMTCXXC motif associated with metal binding. MerP is related to a larger family of metal binding proteins (pfam00403).
Probab=56.66  E-value=9  Score=24.84  Aligned_cols=19  Identities=37%  Similarity=0.310  Sum_probs=12.2

Q ss_pred             ChHHHHHHHHHHHHhc-cCc
Q psy5004           1 MKLLVLTLSLISFVSR-CSA   19 (173)
Q Consensus         1 MK~~l~~LLLl~~i~~-geA   19 (173)
                      ||.++.+|+||+.... .+|
T Consensus         1 ~~~~~~~~~~~~~~~~~~~~   20 (92)
T TIGR02052         1 MKKLATLLALFVLTSLPAWA   20 (92)
T ss_pred             ChhHHHHHHHHHHhcchhhh
Confidence            8888877776644333 555


No 13 
>KOG2052|consensus
Probab=53.50  E-value=34  Score=33.30  Aligned_cols=44  Identities=27%  Similarity=0.579  Sum_probs=32.2

Q ss_pred             ceeeecccCCCCCCCCCCCCCccccccCCCCCCceEEEEE-ecCcceeEeechhhhh
Q psy5004          19 ALLCYQCVSTQPGCGTPFKWLWYPSISCPEEDDICVKIIE-RKGAEEIITRDCLSTV   74 (173)
Q Consensus        19 AL~CY~C~S~~PgC~dpfn~~~~~~etCp~~~d~CvKVie-r~g~~~~itRdClS~~   74 (173)
                      +++|| |.+-.+  .++        .||.- +-.|-..++ ..++.+...|+|+..-
T Consensus        34 ~~~C~-C~~~~~--~n~--------~tC~t-~g~C~~s~~~~~~g~~~~~~gC~~~~   78 (513)
T KOG2052|consen   34 LLKCY-CSSCLC--SNA--------NTCET-DGACFVSVEENDDGKEQHHRGCMTLE   78 (513)
T ss_pred             eEEEE-ecCCCc--CCC--------Ceeee-cCeEEEEEEecCCCceEEEecccccc
Confidence            68999 766663  332        27766 788966665 8888889999999843


No 14 
>PRK11443 lipoprotein; Provisional
Probab=47.87  E-value=13  Score=29.35  Aligned_cols=28  Identities=18%  Similarity=0.385  Sum_probs=18.4

Q ss_pred             ChHHHHHHHHHHHHhccCceeeecccCC---CC-CCCCCCCC
Q psy5004           1 MKLLVLTLSLISFVSRCSALLCYQCVST---QP-GCGTPFKW   38 (173)
Q Consensus         1 MK~~l~~LLLl~~i~~geAL~CY~C~S~---~P-gC~dpfn~   38 (173)
                      ||.++++++++++..         |.|.   .| .|.. -+|
T Consensus         1 Mk~~~~~~~~~lLsg---------Ca~~~~~~~~~~~~-~dW   32 (124)
T PRK11443          1 MKKFIAPLLALLLSG---------CQIDPYTHAPTLTS-TDW   32 (124)
T ss_pred             ChHHHHHHHHHHHHh---------ccCCCCCChhhhcc-cCH
Confidence            998777777665544         8773   33 4655 578


No 15 
>PF11912 DUF3430:  Protein of unknown function (DUF3430);  InterPro: IPR021837  This family of proteins are functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 209 to 265 amino acids in length. 
Probab=45.33  E-value=16  Score=29.18  Aligned_cols=14  Identities=43%  Similarity=0.641  Sum_probs=8.5

Q ss_pred             ChHHHHHHHHHHHH
Q psy5004           1 MKLLVLTLSLISFV   14 (173)
Q Consensus         1 MK~~l~~LLLl~~i   14 (173)
                      ||.++.+|||++++
T Consensus         1 MKll~~lilli~~~   14 (212)
T PF11912_consen    1 MKLLISLILLILLI   14 (212)
T ss_pred             CcHHHHHHHHHHHH
Confidence            99965555554433


No 16 
>PF11137 DUF2909:  Protein of unknown function (DUF2909);  InterPro: IPR021313  This is a family of proteins conserved in Proteobacteria of unknown function. 
Probab=35.49  E-value=22  Score=25.37  Aligned_cols=20  Identities=45%  Similarity=0.502  Sum_probs=15.8

Q ss_pred             ChHHHHHHHHHHHHhccCce
Q psy5004           1 MKLLVLTLSLISFVSRCSAL   20 (173)
Q Consensus         1 MK~~l~~LLLl~~i~~geAL   20 (173)
                      ||.++++++++..++-+.|+
T Consensus         1 ~Ki~iv~lll~ii~sL~saL   20 (63)
T PF11137_consen    1 MKILIVLLLLAIIASLFSAL   20 (63)
T ss_pred             ChHHHHHHHHHHHHHHHHHH
Confidence            78988888888777766665


No 17 
>PF08194 DIM:  DIM protein;  InterPro: IPR013172 Drosophila immune-induced molecules (DIMs) are short proteins induced during the immune response of Drosophila []. This entry includes DIMs 1 to 4 and DIM23.
Probab=30.04  E-value=38  Score=22.05  Aligned_cols=7  Identities=57%  Similarity=0.648  Sum_probs=4.3

Q ss_pred             ChHHHHH
Q psy5004           1 MKLLVLT    7 (173)
Q Consensus         1 MK~~l~~    7 (173)
                      ||.+.++
T Consensus         1 Mk~l~~a    7 (36)
T PF08194_consen    1 MKCLSLA    7 (36)
T ss_pred             CceeHHH
Confidence            7776553


No 18 
>PF11777 DUF3316:  Protein of unknown function (DUF3316);  InterPro: IPR016879 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=28.36  E-value=38  Score=25.59  Aligned_cols=19  Identities=37%  Similarity=0.401  Sum_probs=14.1

Q ss_pred             ChHHHHHHHHHHHHhccCc
Q psy5004           1 MKLLVLTLSLISFVSRCSA   19 (173)
Q Consensus         1 MK~~l~~LLLl~~i~~geA   19 (173)
                      ||.++++++++++.....|
T Consensus         1 MKk~~ll~~~ll~s~~a~A   19 (114)
T PF11777_consen    1 MKKIILLASLLLLSSSAFA   19 (114)
T ss_pred             CchHHHHHHHHHHHHHHhh
Confidence            8988888877666665555


No 19 
>PRK09810 entericidin A; Provisional
Probab=26.34  E-value=59  Score=21.60  Aligned_cols=14  Identities=36%  Similarity=0.420  Sum_probs=9.3

Q ss_pred             ChHHHHHHHHHHHH
Q psy5004           1 MKLLVLTLSLISFV   14 (173)
Q Consensus         1 MK~~l~~LLLl~~i   14 (173)
                      ||.++++++++++.
T Consensus         2 Mkk~~~l~~~~~~~   15 (41)
T PRK09810          2 MKRLIVLVLLASTL   15 (41)
T ss_pred             hHHHHHHHHHHHHH
Confidence            77777777665553


No 20 
>PF07172 GRP:  Glycine rich protein family;  InterPro: IPR010800 This family consists of glycine rich proteins. Some of them may be involved in resistance to environmental stress [].
Probab=25.13  E-value=58  Score=24.60  Aligned_cols=8  Identities=25%  Similarity=0.085  Sum_probs=3.5

Q ss_pred             hHHHHHHH
Q psy5004           2 KLLVLTLS    9 (173)
Q Consensus         2 K~~l~~LL    9 (173)
                      |++|++.|
T Consensus         4 K~~llL~l   11 (95)
T PF07172_consen    4 KAFLLLGL   11 (95)
T ss_pred             hHHHHHHH
Confidence            44444433


No 21 
>PF11153 DUF2931:  Protein of unknown function (DUF2931);  InterPro: IPR021326  Some members in this family of proteins are annotated as lipoproteins however this cannot be confirmed. Currently, there is no known function. 
Probab=24.21  E-value=58  Score=26.78  Aligned_cols=46  Identities=22%  Similarity=0.425  Sum_probs=26.2

Q ss_pred             ChHHHHHHHHHHHHhccCceeeecccCCCCC-CCCCCCCCccccccCCCCCCceEEEE
Q psy5004           1 MKLLVLTLSLISFVSRCSALLCYQCVSTQPG-CGTPFKWLWYPSISCPEEDDICVKII   57 (173)
Q Consensus         1 MK~~l~~LLLl~~i~~geAL~CY~C~S~~Pg-C~dpfn~~~~~~etCp~~~d~CvKVi   57 (173)
                      ||.+++++|++++..         |.+..+. =...+.|  ..+..-|....+=|.-.
T Consensus         1 mk~i~~l~l~lll~~---------C~~~~~~~~~~~~~W--~~~~~~P~~ypv~V~~~   47 (216)
T PF11153_consen    1 MKKILLLLLLLLLTG---------CSTNPNEPLQPYFEW--RFGVAAPKHYPVWVTYA   47 (216)
T ss_pred             ChHHHHHHHHHHHHh---------hcCCCccCCCCCCcc--EEEEecCCCCEEEEEEE
Confidence            899998886665555         7665331 1155666  33345576544444433


No 22 
>PF13956 Ibs_toxin:  Toxin Ibs, type I toxin-antitoxin system
Probab=24.04  E-value=42  Score=19.23  Aligned_cols=11  Identities=36%  Similarity=0.773  Sum_probs=5.6

Q ss_pred             ChHHHHHHHHH
Q psy5004           1 MKLLVLTLSLI   11 (173)
Q Consensus         1 MK~~l~~LLLl   11 (173)
                      ||.++.+++|+
T Consensus         2 Mk~vIIlvvLL   12 (19)
T PF13956_consen    2 MKLVIILVVLL   12 (19)
T ss_pred             ceehHHHHHHH
Confidence            56555544443


No 23 
>PF00879 Defensin_propep:  Defensin propeptide The pattern for this Prosite entry doesn't match the propeptide.;  InterPro: IPR002366 Defensins are 2-6 kDa, cationic, microbicidal peptides active against many Gram-negative and Gram-positive bacteria, fungi, and enveloped viruses [], containing three pairs of intramolecular disulphide bonds []. On the basis of their size and pattern of disulphide bonding, mammalian defensins are classified into alpha, beta and theta categories. Alpha-defensins, which have been identified in humans, monkeys and several rodent species, are particularly abundant in neutrophils, certain macrophage populations and Paneth cells of the small intestine. Every mammalian species explored thus far has beta-defensins. In cows, as many as 13 beta-defensins exist in neutrophils. However, in other species, beta-defensins are more often produced by epithelial cells lining various organs (e.g. the epidermis, bronchial tree and genitourinary tract). Theta-defensins are cyclic and have so far only been identified in primate phagocytes.   Defensins are produced constitutively and/or in response to microbial products or proinflammatory cytokines. Some defensins are also called corticostatins (CS) because they inhibit corticotropin-stimulated corticosteroid production. The mechanism(s) by which microorganisms are killed and/or inactivated by defensins is not understood completely. However, it is generally believed that killing is a consequence of disruption of the microbial membrane. The polar topology of defensins, with spatially separated charged and hydrophobic regions, allows them to insert themselves into the phospholipid membranes so that their hydrophobic regions are buried within the lipid membrane interior and their charged (mostly cationic) regions interact with anionic phospholipid head groups and water. Subsequently, some defensins can aggregate to form `channel-like' pores; others might bind to and cover the microbial membrane in a `carpet-like' manner. The net outcome is the disruption of membrane integrity and function, which ultimately leads to the lysis of microorganisms. Some defensins are synthesized as propeptides which may be relevant to this process - in neutrophils only the mature peptides have been identified but in Paneth cells, the propeptide is stored in vesicles [] and appears to be cleaved by trypsin on activation.  ; GO: 0006952 defense response
Probab=21.56  E-value=71  Score=22.26  Aligned_cols=20  Identities=25%  Similarity=0.453  Sum_probs=12.7

Q ss_pred             ChHHH--HHHHHHHHHhccCce
Q psy5004           1 MKLLV--LTLSLISFVSRCSAL   20 (173)
Q Consensus         1 MK~~l--~~LLLl~~i~~geAL   20 (173)
                      ||+++  .+|||+++..|.+.+
T Consensus         1 MRTL~LLaAlLLlAlqaQAepl   22 (52)
T PF00879_consen    1 MRTLALLAALLLLALQAQAEPL   22 (52)
T ss_pred             CcHHHHHHHHHHHHHHHhcccc
Confidence            77654  456666676766655


No 24 
>PF04202 Mfp-3:  Foot protein 3;  InterPro: IPR007328 Mytilus foot protein-3 (Mfp-3) is a highly polymorphic protein family located in the byssal adhesive plaques of blue mussels.
Probab=20.55  E-value=65  Score=23.84  Aligned_cols=17  Identities=24%  Similarity=0.313  Sum_probs=10.2

Q ss_pred             HHHHHHHHHHHHhccCc
Q psy5004           3 LLVLTLSLISFVSRCSA   19 (173)
Q Consensus         3 ~~l~~LLLl~~i~~geA   19 (173)
                      ++|+++|+.+|..|++|
T Consensus         8 VLlaLvLIg~fAVqSda   24 (71)
T PF04202_consen    8 VLLALVLIGSFAVQSDA   24 (71)
T ss_pred             HHHHHHHHhhheeeecC
Confidence            34444455577777777


No 25 
>PF08138 Sex_peptide:  Sex peptide (SP) family;  InterPro: IPR012608 This family consists of Sex Peptides (SP) that are found in Drosophila. On mating, Drosophila females decreases her remating rate and increases her egg-laying rate due, in part, to the transfer of SP from the male to the female. SP are found in seminal fluids transferred from the male to the female during mating. The male seminal fluid proteins are referred to as accessory gland proteins (Acps). The SP is one of the most interesting Acps and plays an important role in reproduction [].; GO: 0005179 hormone activity, 0046008 regulation of female receptivity, post-mating, 0005576 extracellular region; PDB: 2LAQ_A.
Probab=20.14  E-value=34  Score=24.28  Aligned_cols=15  Identities=27%  Similarity=0.419  Sum_probs=0.0

Q ss_pred             ChHHHHHHHHHHHHh
Q psy5004           1 MKLLVLTLSLISFVS   15 (173)
Q Consensus         1 MK~~l~~LLLl~~i~   15 (173)
                      ||+.+++|||++++.
T Consensus         1 Mk~p~~llllvlllG   15 (56)
T PF08138_consen    1 MKTPIFLLLLVLLLG   15 (56)
T ss_dssp             ---------------
T ss_pred             CcchHHHHHHHHHHH
Confidence            888887777766544


Done!