Query psy5004
Match_columns 173
No_of_seqs 55 out of 57
Neff 3.3
Searched_HMMs 46136
Date Fri Aug 16 23:22:54 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy5004.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/5004hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 cd00117 LU Ly-6 antigen / uPA 97.5 0.00015 3.2E-09 50.4 4.0 47 20-74 1-52 (79)
2 smart00134 LU Ly-6 antigen / u 97.1 0.00049 1.1E-08 48.0 3.4 45 20-72 1-49 (79)
3 PF00087 Toxin_1: Snake toxin; 96.4 0.0035 7.6E-08 43.4 3.2 44 20-74 1-45 (63)
4 cd00206 snake_toxin Snake toxi 96.0 0.0079 1.7E-07 41.7 3.2 46 20-75 1-47 (64)
5 PF00021 UPAR_LY6: u-PAR/Ly-6 95.5 0.018 3.8E-07 37.2 3.3 43 22-72 1-48 (77)
6 PF06211 BAMBI: BMP and activi 94.9 0.031 6.8E-07 43.8 3.4 67 12-97 18-85 (107)
7 PF01064 Activin_recp: Activin 94.7 0.031 6.8E-07 38.8 2.9 78 20-135 1-80 (83)
8 PF05444 DUF753: Protein of un 94.5 0.057 1.2E-06 42.5 4.1 51 18-75 81-133 (152)
9 PF05444 DUF753: Protein of un 93.5 0.11 2.4E-06 40.8 4.0 49 23-75 1-50 (152)
10 PF02988 PLA2_inh: Phospholipa 91.9 0.27 5.9E-06 36.7 4.0 45 22-73 3-51 (83)
11 PF06579 Ly-6_related: Caenorh 89.0 0.61 1.3E-05 37.2 4.0 85 41-133 38-129 (129)
12 TIGR02052 MerP mercuric transp 56.7 9 0.00019 24.8 1.9 19 1-19 1-20 (92)
13 KOG2052|consensus 53.5 34 0.00073 33.3 5.8 44 19-74 34-78 (513)
14 PRK11443 lipoprotein; Provisio 47.9 13 0.00028 29.3 1.8 28 1-38 1-32 (124)
15 PF11912 DUF3430: Protein of u 45.3 16 0.00035 29.2 2.0 14 1-14 1-14 (212)
16 PF11137 DUF2909: Protein of u 35.5 22 0.00047 25.4 1.3 20 1-20 1-20 (63)
17 PF08194 DIM: DIM protein; In 30.0 38 0.00083 22.0 1.6 7 1-7 1-7 (36)
18 PF11777 DUF3316: Protein of u 28.4 38 0.00083 25.6 1.6 19 1-19 1-19 (114)
19 PRK09810 entericidin A; Provis 26.3 59 0.0013 21.6 2.0 14 1-14 2-15 (41)
20 PF07172 GRP: Glycine rich pro 25.1 58 0.0013 24.6 2.1 8 2-9 4-11 (95)
21 PF11153 DUF2931: Protein of u 24.2 58 0.0013 26.8 2.1 46 1-57 1-47 (216)
22 PF13956 Ibs_toxin: Toxin Ibs, 24.0 42 0.00091 19.2 0.8 11 1-11 2-12 (19)
23 PF00879 Defensin_propep: Defe 21.6 71 0.0015 22.3 1.8 20 1-20 1-22 (52)
24 PF04202 Mfp-3: Foot protein 3 20.5 65 0.0014 23.8 1.5 17 3-19 8-24 (71)
25 PF08138 Sex_peptide: Sex pept 20.1 34 0.00075 24.3 0.0 15 1-15 1-15 (56)
No 1
>cd00117 LU Ly-6 antigen / uPA receptor -like domain; occurs singly in GPI-linked cell-surface glycoproteins (Ly-6 family,CD59, thymocyte B cell antigen, Sgp-2) or as three-fold repeated domain in urokinase-type plasminogen activator receptor. Topology of these domains is similar to that of snake venom neurotoxins.
Probab=97.49 E-value=0.00015 Score=50.40 Aligned_cols=47 Identities=30% Similarity=0.732 Sum_probs=36.9
Q ss_pred eeeecccCC-CCCCCCCCCCCccccccCCCCCCceEEEEEecC----cceeEeechhhhh
Q psy5004 20 LLCYQCVST-QPGCGTPFKWLWYPSISCPEEDDICVKIIERKG----AEEIITRDCLSTV 74 (173)
Q Consensus 20 L~CY~C~S~-~PgC~dpfn~~~~~~etCp~~~d~CvKVier~g----~~~~itRdClS~~ 74 (173)
|+||.|.+. ...|..+ ++||+.++.|+++..... ....++|+|+++.
T Consensus 1 L~C~~C~~~~~~~C~~~--------~~C~~~~~~C~~~~~~~~~~~~~~~~~~rgC~~~C 52 (79)
T cd00117 1 LECYSCTGVSTSSCSTE--------TNCPSPDDQCLTAVATVIEESVRLSLVVRGCASDC 52 (79)
T ss_pred CccCcCCCCCCCCCCCC--------CccCCCCCEeeEEEEEEEeeccccceEECcccCCC
Confidence 789999883 4468876 599999999999877742 3466889999954
No 2
>smart00134 LU Ly-6 antigen / uPA receptor -like domain. Three-fold repeated domain in urokinase-type plasminogen activator receptor; occurs singly in other GPI-linked cell-surface glycoproteins (Ly-6 family, CD59, thymocyte B cell antigen, Sgp-2). Topology of these domains is similar to that of snake venom neurotoxins.
Probab=97.14 E-value=0.00049 Score=48.03 Aligned_cols=45 Identities=22% Similarity=0.751 Sum_probs=34.2
Q ss_pred eeeecccCC-CCCCCCCCCCCccccccCCCCCCceEEEEEecC---cceeEeechhh
Q psy5004 20 LLCYQCVST-QPGCGTPFKWLWYPSISCPEEDDICVKIIERKG---AEEIITRDCLS 72 (173)
Q Consensus 20 L~CY~C~S~-~PgC~dpfn~~~~~~etCp~~~d~CvKVier~g---~~~~itRdClS 72 (173)
|.||.|.+. ...|..+ ++||+.++.|.+++.+.. ....++|+|.+
T Consensus 1 L~C~~C~~~~~~~C~~~--------~~C~~~~~~C~~~~~~~~~~~~~~~~~rgC~~ 49 (79)
T smart00134 1 LQCYSCTGNPDSSCSSE--------EECRSPDDVCLTVVAEVISGSVGSVVYKGCAT 49 (79)
T ss_pred CcceecCCcCCCCCCCc--------ceeCCCCCEeEEEEEEEEecCCCceEEcCccC
Confidence 689999875 3468776 579999999999875432 24568899993
No 3
>PF00087 Toxin_1: Snake toxin; InterPro: IPR003571 Snake toxins belong to a family of proteins [] which groups short and long neurotoxins, cytotoxins and short toxins, as well as a other miscellaneous venom peptides. Most of these toxins act by binding to the nicotinic acetylcholine receptors in the postsynaptic membrane of skeletal muscles and prevent the binding of acetylcholine, thereby blocking the excitation of muscles. Snake toxins are proteins that consist of sixty to seventy five amino acids. Among the invariant residues are eight cysteines all involved in disulphide bonds. The structure is small, disulphide-rich, nearly all beta sheet.; GO: 0005576 extracellular region; PDB: 3NEQ_B 2H5F_A 3PLC_A 1NOR_A 1UG4_A 1FFJ_A 1CCQ_A 1CB9_A 1VB0_A 1ONJ_A ....
Probab=96.42 E-value=0.0035 Score=43.41 Aligned_cols=44 Identities=34% Similarity=0.704 Sum_probs=34.4
Q ss_pred eeeecccCCCCCCCCCCCCCccccccCCCCCCceE-EEEEecCcceeEeechhhhh
Q psy5004 20 LLCYQCVSTQPGCGTPFKWLWYPSISCPEEDDICV-KIIERKGAEEIITRDCLSTV 74 (173)
Q Consensus 20 L~CY~C~S~~PgC~dpfn~~~~~~etCp~~~d~Cv-KVier~g~~~~itRdClS~~ 74 (173)
|.||++.+ . |.. .++||+.+++|. |+....+...+|.|+|.++.
T Consensus 1 ~~C~~~~~-~--~~~--------~~tC~~ge~~Cy~k~~~~~~~~~~~~rGCa~tC 45 (63)
T PF00087_consen 1 RTCYNCPS-S--CFT--------TKTCPEGENICYKKTWMDHPRGMRVERGCAATC 45 (63)
T ss_dssp -EEEESST-S--STE--------EEEESTTSSEEEEEEEEETSTEEEEEEEEESSS
T ss_pred CCccCCCC-C--CCc--------ceECCCccCeeEEEEEecCCCCceEecchhccC
Confidence 57888877 4 444 489999999998 66666657889999999876
No 4
>cd00206 snake_toxin Snake toxin domain, present in short and long neurotoxins, cytotoxins and short toxins, and in other miscellaneous venom peptides. The toxin acts by binding to the nicotinic acetylcholine receptors in the postsynaptic membrane of skeletal muscles and preventing the binding of acetylcholine, thereby blocking the excitation of muscles. This domain contains 60-75 amino acids that are fixed by 4-5 disulfide bridges and is nearly all beta sheet; it exists as either monomers or dimers.
Probab=95.98 E-value=0.0079 Score=41.67 Aligned_cols=46 Identities=33% Similarity=0.579 Sum_probs=33.9
Q ss_pred eeeecccCCCCCCCCCCCCCccccccCCCCCCceEEEEEec-CcceeEeechhhhhh
Q psy5004 20 LLCYQCVSTQPGCGTPFKWLWYPSISCPEEDDICVKIIERK-GAEEIITRDCLSTVK 75 (173)
Q Consensus 20 L~CY~C~S~~PgC~dpfn~~~~~~etCp~~~d~CvKVier~-g~~~~itRdClS~~~ 75 (173)
|.||+|.+.. |.. .++||+.+++|.|..... +....|.|+|.++.-
T Consensus 1 ~~C~~~~~~~--~~~--------~~tC~~ge~~Cyk~~~~~~~~g~~i~rGCa~tCP 47 (64)
T cd00206 1 LTCYNCPSPF--PFT--------TETCPDGENICYKKSWKDTPRGVRIERGCAATCP 47 (64)
T ss_pred CCccCCCCCC--CCc--------ceeCCcccCccceeEEEcCCCCCEEEccccCcCc
Confidence 5688887663 333 489999999999886554 445578899998763
No 5
>PF00021 UPAR_LY6: u-PAR/Ly-6 domain omitted due to poor similarity.; InterPro: IPR001526 CD59 (also called 1F-5Ag, H19, HRF20, MACIF, MIRL, P-18 or protectin) inhibits formation of membrane attack complex (MAC), thus protecting cells from complement mediated lysis. It has a signalling role, as a GPI-anchored molecule, in T cell activation and appears to have some role in cell adhesion through CD2 (controversial). CD59 associates with C9, inhibiting incorporation into C5b-8 preventing terminal steps in polymerisation of the (MAC) in plasma membranes. Genetic defects in GPI-anchor attachment that cause a reduction or loss of both CD59 and CD55 on erythrocytes produce the symptoms of the disease paroxysmal nocturnal haemoglobinuria (PNH). A variety of GPI-linked cell-surface glycoproteins are composed of one or more copies of a conserved domain of about 100 amino-acid residues [], []. Among these proteins, U-PAR contains three tandem copies of the domain, while all the others are made up of a single domain. As shown in the following schematic, this conserved domain contains 10 cysteine residues involved in five disulphide bonds - in U-PAR, the first copy of the domain lacks the fourth disulphide bond. +------+ +------------------------+ +---+ | | | | | | xCxxCxxxxxxCxxxxxCxxxxxCxxxxxxxxxxxxxxxxxxCxxxxCxxxxxxxxxxxxxxCCxxxCxxxxxxxx | | | | +---------------------+ +--------------+ 'C': conserved cysteine involved in a disulphide bond. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/).; PDB: 2I9B_H 1YWH_E 3BT2_U 2FD6_U 3U74_U 3BT1_U 3U73_U 2L03_A 3LAQ_V 2J8B_A ....
Probab=95.50 E-value=0.018 Score=37.24 Aligned_cols=43 Identities=23% Similarity=0.794 Sum_probs=33.8
Q ss_pred eecccCCC-CCCCCCCCCCccccccCCCCCCceEEEEEecC----cceeEeechhh
Q psy5004 22 CYQCVSTQ-PGCGTPFKWLWYPSISCPEEDDICVKIIERKG----AEEIITRDCLS 72 (173)
Q Consensus 22 CY~C~S~~-PgC~dpfn~~~~~~etCp~~~d~CvKVier~g----~~~~itRdClS 72 (173)
||.|.+.. ..|..+ ++||.+++.|+.+..+.. ....+.|+|..
T Consensus 1 C~~C~~~~~~~C~~~--------~~C~~~~~~C~~~~~~~~~g~~~~~~~~kgC~~ 48 (77)
T PF00021_consen 1 CYSCYSPSSSSCSGE--------VTCPGSEDVCYTATVTVTSGGNSVTLVSKGCAT 48 (77)
T ss_dssp EEEEECSTTTTSCTE--------EEECTTTSEEEEEEEEEETTTCCEEEEEEEEEE
T ss_pred CCcccCCCCCCCCCC--------CCcCCCCCeeEEEEEEeeecccCceEEEeeccc
Confidence 89999863 369885 699999999987776543 36788899994
No 6
>PF06211 BAMBI: BMP and activin membrane-bound inhibitor (BAMBI) N-terminal domain; InterPro: IPR009345 This family consists of several eukaryotic BMP and activin membrane-bound inhibitor (BAMBI) proteins. Members of the transforming growth factor-beta (TGF-beta) superfamily, including TGF-beta, bone morphogenetic proteins (BMPs), activins and nodals, are vital for regulating growth and differentiation. BAMBI is related to TGF-beta-family type I receptors but lacks an intracellular kinase domain. BAMBI is co-expressed with the ventralising morphogen BMP4 during Xenopus embryogenesis and requires BMP signalling for its expression. The protein stably associates with TGF-beta-family receptors and inhibits BMP and activin as well as TGF-beta signalling [].
Probab=94.86 E-value=0.031 Score=43.79 Aligned_cols=67 Identities=18% Similarity=0.582 Sum_probs=47.2
Q ss_pred HHHhccCceeeecccCCCCCCCCCCCCCccccccCCCCCCce-EEEEEecCcceeEeechhhhhhhhhhcCCcccccccc
Q psy5004 12 SFVSRCSALLCYQCVSTQPGCGTPFKWLWYPSISCPEEDDIC-VKIIERKGAEEIITRDCLSTVKAFRTDIPADRYEGCR 90 (173)
Q Consensus 12 ~~i~~geAL~CY~C~S~~PgC~dpfn~~~~~~etCp~~~d~C-vKVier~g~~~~itRdClS~~~~~r~dipad~yegcr 90 (173)
.++.+ .+++|| |+..+ |..+ +..|-.+-..| -|+..+...++..+++|+-++. ...+.|.
T Consensus 18 ~lltk-GEIRCY-Cn~P~--CV~T-------GYMCkSel~~CfSklldp~n~nspl~HGCld~~~--------~~~~~C~ 78 (107)
T PF06211_consen 18 VLLTK-GEIRCY-CNAPH--CVAT-------GYMCKSELNACFSKLLDPQNTNSPLTHGCLDSLA--------STADICK 78 (107)
T ss_pred HHHhc-CceEEe-cCCcc--eecc-------cceeccccccceeeccCcccCCCccccccccccc--------CCCCccc
Confidence 34444 459999 87655 9998 67999888899 4565566677889999998773 3456666
Q ss_pred cCcCCCc
Q psy5004 91 AAAKDPK 97 (173)
Q Consensus 91 ~~a~d~~ 97 (173)
+....-+
T Consensus 79 ~~~~~~~ 85 (107)
T PF06211_consen 79 SEQAQNH 85 (107)
T ss_pred Ccccccc
Confidence 6544333
No 7
>PF01064 Activin_recp: Activin types I and II receptor domain; InterPro: IPR000472 Transforming growth factor-beta (TGF-beta) forms a family with other growth factors described in PDOC00223 from PROSITEDOC. The receptors for most of the members of this growth factor family are related. These proteins are receptor-type kinases of Ser/Thr type PDOC00100 from PROSITEDOC), which have a single transmembrane domain and a specific hydrophilic Cys-rich ligand-binding domain [, , ]. The C-terminal part of the extracellular domain is conserved. Some of the receptors of this family contain subclass-specific N-terminal extensions of this homology domain. The type I receptors also possess 7 extracellular residues preceding the cysteine box.; GO: 0004675 transmembrane receptor protein serine/threonine kinase activity, 0005024 transforming growth factor beta-activated receptor activity, 0016020 membrane; PDB: 3EVS_C 1BTE_A 1LX5_B 2GOO_C 2HLR_A 4FAO_W 2H64_C 2H62_D 1S4Y_C 1NYU_C ....
Probab=94.73 E-value=0.031 Score=38.81 Aligned_cols=78 Identities=26% Similarity=0.534 Sum_probs=56.0
Q ss_pred eeeecccCCCCCCCCCCCCCccccccCCCCCCceEEEEEecCc--ceeEeechhhhhhhhhhcCCcccccccccCcCCCc
Q psy5004 20 LLCYQCVSTQPGCGTPFKWLWYPSISCPEEDDICVKIIERKGA--EEIITRDCLSTVKAFRTDIPADRYEGCRAAAKDPK 97 (173)
Q Consensus 20 L~CY~C~S~~PgC~dpfn~~~~~~etCp~~~d~CvKVier~g~--~~~itRdClS~~~~~r~dipad~yegcr~~a~d~~ 97 (173)
|+|| |.+.+ |.++ ..+|.. ...|.+.++.... .++++|+|++ ...+.+.+. ..|...++.
T Consensus 1 l~C~-C~~~~--C~~~-------n~~C~t-~~~Cf~~~~~~~~~~~~~~~~GC~~----~~~~~~~~~-~~C~~~~~~-- 62 (83)
T PF01064_consen 1 LKCY-CDSDD--CNET-------NQTCET-GGFCFTSWEQDNGGGVEIVKKGCWS----NEEDCPLCR-NQCRSSSSP-- 62 (83)
T ss_dssp EEEE-EETTC--HCTT-------TEEEEE-SCEEEEEEEEETTESEEEEEEEEEE----STGGGTTTS-SSEEECSSS--
T ss_pred CEeE-eCccc--CCCC-------CCEeCC-CCEEEEEEEEeeccceeEEEcCccC----CccCcceec-ceeeccCCC--
Confidence 7899 98877 8865 368876 8899888888766 6899999998 445666666 888884443
Q ss_pred chhhhccCCCCccccCceeeccceEEEeeccccccccc
Q psy5004 98 LANYVNHSIPELDVQRPSELKTGYTEEIITRDCLSTVK 135 (173)
Q Consensus 98 l~~Yv~~sike~D~kr~~~~~t~~~~~~~~~~C~s~~~ 135 (173)
....+| |.-+..||+.++
T Consensus 63 -------------------~~~~~C-C~~~D~CN~~~~ 80 (83)
T PF01064_consen 63 -------------------SSVVFC-CCSGDFCNSNLS 80 (83)
T ss_dssp -------------------TCEEEE-EESSTTGGGSEE
T ss_pred -------------------CCeEEE-CCCCCccCCCCC
Confidence 112345 445778888765
No 8
>PF05444 DUF753: Protein of unknown function (DUF753); InterPro: IPR008472 This entry contains sequences which are repeated in several uncharacterised proteins from Drosophila melanogaster.
Probab=94.46 E-value=0.057 Score=42.46 Aligned_cols=51 Identities=31% Similarity=0.727 Sum_probs=36.6
Q ss_pred CceeeecccCCCCCCCCCCCCCccccccCCC--CCCceEEEEEecCcceeEeechhhhhh
Q psy5004 18 SALLCYQCVSTQPGCGTPFKWLWYPSISCPE--EDDICVKIIERKGAEEIITRDCLSTVK 75 (173)
Q Consensus 18 eAL~CY~C~S~~PgC~dpfn~~~~~~etCp~--~~d~CvKVier~g~~~~itRdClS~~~ 75 (173)
..|+||+|. ....|..+- ....+.|+. ..+.|...+. ....+.|+|+|+|.
T Consensus 81 ~~~~C~qC~-~~~~C~~~~---~~~~~~C~~~~~~d~Cyt~~~---~~~~~~RGC~s~l~ 133 (152)
T PF05444_consen 81 DRLSCYQCT-DDSDCVSSD---SSTAKPCPNYSEDDQCYTRVD---DGGVVERGCLSDLE 133 (152)
T ss_pred cCCEeeeCC-CCCCCCCCC---CcCcCcCcCCCCCCcCEEEEe---eCCEEEeCCCCccc
Confidence 459999998 666791111 123689986 4899988776 34557899999884
No 9
>PF05444 DUF753: Protein of unknown function (DUF753); InterPro: IPR008472 This entry contains sequences which are repeated in several uncharacterised proteins from Drosophila melanogaster.
Probab=93.47 E-value=0.11 Score=40.81 Aligned_cols=49 Identities=33% Similarity=0.675 Sum_probs=33.8
Q ss_pred ecccCC-CCCCCCCCCCCccccccCCCCCCceEEEEEecCcceeEeechhhhhh
Q psy5004 23 YQCVST-QPGCGTPFKWLWYPSISCPEEDDICVKIIERKGAEEIITRDCLSTVK 75 (173)
Q Consensus 23 Y~C~S~-~PgC~dpfn~~~~~~etCp~~~d~CvKVier~g~~~~itRdClS~~~ 75 (173)
|+|.|. .+.|..+- +.....+.|++..+.|+.++...+. +.|+|+++|.
T Consensus 1 ~~C~s~~~~~C~~~~-~~~~~~~~C~~~~~~C~t~~~~~~~---~~RGC~~~~~ 50 (152)
T PF05444_consen 1 YSCDSCEDPNCTDPD-PSTSATEECPQQDDQCYTRFDDDGT---TIRGCLSDLP 50 (152)
T ss_pred CCccCCcCCCCCCCC-CCceecccCCCCCCCCEEEEeCCCc---eEcCchhhcc
Confidence 455554 67893322 1233368999998889999976544 6799999884
No 10
>PF02988 PLA2_inh: Phospholipase A2 inhibitor; InterPro: IPR004126 Proteins in this entry inhibit basic phospholipase A2 isozymes in snake's venom [, ].; GO: 0004859 phospholipase inhibitor activity, 0005576 extracellular region
Probab=91.87 E-value=0.27 Score=36.72 Aligned_cols=45 Identities=22% Similarity=0.640 Sum_probs=33.7
Q ss_pred eecccCCCCCCCCCCCCCccccccCCCCCCceEEEEEec---Ccce-eEeechhhh
Q psy5004 22 CYQCVSTQPGCGTPFKWLWYPSISCPEEDDICVKIIERK---GAEE-IITRDCLST 73 (173)
Q Consensus 22 CY~C~S~~PgC~dpfn~~~~~~etCp~~~d~CvKVier~---g~~~-~itRdClS~ 73 (173)
|.-|.+.++.|.-. .+||..++|.||+++... |... ++.+.|+++
T Consensus 3 CEvC~~~G~~C~G~-------~~tC~~~eDtC~~~~~E~~~~~~s~~~~~K~C~~S 51 (83)
T PF02988_consen 3 CEVCHGSGKDCSGK-------MKTCEDGEDTCVTVVTEVSSAGVSFRTTHKGCFSS 51 (83)
T ss_pred cceecCcCCCCCCC-------eeEcCCCCCEEEEEEEEeccCCeEEEEEEecccCc
Confidence 77788888889887 489999999998886543 2222 335888874
No 11
>PF06579 Ly-6_related: Caenorhabditis elegans ly-6-related protein; InterPro: IPR010558 This family consists of several Caenorhabditis elegans specific ly-6-related HOT and ODR proteins. These proteins are involved in the olfactory system. Odr-2 mutants are known to be defective in the ability to chemotax to odorants that are recognised by the two AWC olfactory neurons. Odr-2 encodes a membrane-associated protein related to the Ly-6 superfamily of GPI-linked signalling proteins [].
Probab=89.02 E-value=0.61 Score=37.17 Aligned_cols=85 Identities=20% Similarity=0.373 Sum_probs=47.8
Q ss_pred cccccCCCCCCceEEEEEe---cCc--ceeEeechhhhh--hhhhhcCCcccccccccCcCCCcchhhhccCCCCccccC
Q psy5004 41 YPSISCPEEDDICVKIIER---KGA--EEIITRDCLSTV--KAFRTDIPADRYEGCRAAAKDPKLANYVNHSIPELDVQR 113 (173)
Q Consensus 41 ~~~etCp~~~d~CvKVier---~g~--~~~itRdClS~~--~~~r~dipad~yegcr~~a~d~~l~~Yv~~sike~D~kr 113 (173)
..++.| .++||.++|. .|+ ...+.|||++++ .+|...+=...++. +...++.++.+-+....+.. .
T Consensus 38 i~~~~C---~s~Cvtl~e~~~~~gg~~~~~yIRGC~~~I~~~G~n~t~t~~~~~~-~~~C~~~~~s~Lf~~~~~~~---~ 110 (129)
T PF06579_consen 38 IPTVPC---SSICVTLFEPLDIFGGTRGFGYIRGCLDSILRHGFNDTITLSFLDR-RDICRRLRLSQLFPPGRRRE---H 110 (129)
T ss_pred cceeec---CchheeeEeccccccccccccEEEcCHHHHHHhCCCCchHHHHhcC-CCeeeeccHHHhcCcccccc---c
Confidence 345888 7899999999 233 345569999999 45544322233333 33445666655443332211 0
Q ss_pred ceeeccceEEEeeccccccc
Q psy5004 114 PSELKTGYTEEIITRDCLST 133 (173)
Q Consensus 114 ~~~~~t~~~~~~~~~~C~s~ 133 (173)
.--...+.|.|.- ..||++
T Consensus 111 ~~~~~v~vCsC~g-d~CN~s 129 (129)
T PF06579_consen 111 PSDSTVQVCSCYG-DRCNGS 129 (129)
T ss_pred CCCCeEEEEEECC-CccCCC
Confidence 1112346788864 458764
No 12
>TIGR02052 MerP mercuric transport protein periplasmic component. This model represents the periplasmic mercury (II) binding protein of the bacterial mercury detoxification system which passes mercuric ion to the MerT transporter for subsequent reduction to Hg(0) by the mercuric reductase MerA. MerP contains a distinctive GMTCXXC motif associated with metal binding. MerP is related to a larger family of metal binding proteins (pfam00403).
Probab=56.66 E-value=9 Score=24.84 Aligned_cols=19 Identities=37% Similarity=0.310 Sum_probs=12.2
Q ss_pred ChHHHHHHHHHHHHhc-cCc
Q psy5004 1 MKLLVLTLSLISFVSR-CSA 19 (173)
Q Consensus 1 MK~~l~~LLLl~~i~~-geA 19 (173)
||.++.+|+||+.... .+|
T Consensus 1 ~~~~~~~~~~~~~~~~~~~~ 20 (92)
T TIGR02052 1 MKKLATLLALFVLTSLPAWA 20 (92)
T ss_pred ChhHHHHHHHHHHhcchhhh
Confidence 8888877776644333 555
No 13
>KOG2052|consensus
Probab=53.50 E-value=34 Score=33.30 Aligned_cols=44 Identities=27% Similarity=0.579 Sum_probs=32.2
Q ss_pred ceeeecccCCCCCCCCCCCCCccccccCCCCCCceEEEEE-ecCcceeEeechhhhh
Q psy5004 19 ALLCYQCVSTQPGCGTPFKWLWYPSISCPEEDDICVKIIE-RKGAEEIITRDCLSTV 74 (173)
Q Consensus 19 AL~CY~C~S~~PgC~dpfn~~~~~~etCp~~~d~CvKVie-r~g~~~~itRdClS~~ 74 (173)
+++|| |.+-.+ .++ .||.- +-.|-..++ ..++.+...|+|+..-
T Consensus 34 ~~~C~-C~~~~~--~n~--------~tC~t-~g~C~~s~~~~~~g~~~~~~gC~~~~ 78 (513)
T KOG2052|consen 34 LLKCY-CSSCLC--SNA--------NTCET-DGACFVSVEENDDGKEQHHRGCMTLE 78 (513)
T ss_pred eEEEE-ecCCCc--CCC--------Ceeee-cCeEEEEEEecCCCceEEEecccccc
Confidence 68999 766663 332 27766 788966665 8888889999999843
No 14
>PRK11443 lipoprotein; Provisional
Probab=47.87 E-value=13 Score=29.35 Aligned_cols=28 Identities=18% Similarity=0.385 Sum_probs=18.4
Q ss_pred ChHHHHHHHHHHHHhccCceeeecccCC---CC-CCCCCCCC
Q psy5004 1 MKLLVLTLSLISFVSRCSALLCYQCVST---QP-GCGTPFKW 38 (173)
Q Consensus 1 MK~~l~~LLLl~~i~~geAL~CY~C~S~---~P-gC~dpfn~ 38 (173)
||.++++++++++.. |.|. .| .|.. -+|
T Consensus 1 Mk~~~~~~~~~lLsg---------Ca~~~~~~~~~~~~-~dW 32 (124)
T PRK11443 1 MKKFIAPLLALLLSG---------CQIDPYTHAPTLTS-TDW 32 (124)
T ss_pred ChHHHHHHHHHHHHh---------ccCCCCCChhhhcc-cCH
Confidence 998777777665544 8773 33 4655 578
No 15
>PF11912 DUF3430: Protein of unknown function (DUF3430); InterPro: IPR021837 This family of proteins are functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 209 to 265 amino acids in length.
Probab=45.33 E-value=16 Score=29.18 Aligned_cols=14 Identities=43% Similarity=0.641 Sum_probs=8.5
Q ss_pred ChHHHHHHHHHHHH
Q psy5004 1 MKLLVLTLSLISFV 14 (173)
Q Consensus 1 MK~~l~~LLLl~~i 14 (173)
||.++.+|||++++
T Consensus 1 MKll~~lilli~~~ 14 (212)
T PF11912_consen 1 MKLLISLILLILLI 14 (212)
T ss_pred CcHHHHHHHHHHHH
Confidence 99965555554433
No 16
>PF11137 DUF2909: Protein of unknown function (DUF2909); InterPro: IPR021313 This is a family of proteins conserved in Proteobacteria of unknown function.
Probab=35.49 E-value=22 Score=25.37 Aligned_cols=20 Identities=45% Similarity=0.502 Sum_probs=15.8
Q ss_pred ChHHHHHHHHHHHHhccCce
Q psy5004 1 MKLLVLTLSLISFVSRCSAL 20 (173)
Q Consensus 1 MK~~l~~LLLl~~i~~geAL 20 (173)
||.++++++++..++-+.|+
T Consensus 1 ~Ki~iv~lll~ii~sL~saL 20 (63)
T PF11137_consen 1 MKILIVLLLLAIIASLFSAL 20 (63)
T ss_pred ChHHHHHHHHHHHHHHHHHH
Confidence 78988888888777766665
No 17
>PF08194 DIM: DIM protein; InterPro: IPR013172 Drosophila immune-induced molecules (DIMs) are short proteins induced during the immune response of Drosophila []. This entry includes DIMs 1 to 4 and DIM23.
Probab=30.04 E-value=38 Score=22.05 Aligned_cols=7 Identities=57% Similarity=0.648 Sum_probs=4.3
Q ss_pred ChHHHHH
Q psy5004 1 MKLLVLT 7 (173)
Q Consensus 1 MK~~l~~ 7 (173)
||.+.++
T Consensus 1 Mk~l~~a 7 (36)
T PF08194_consen 1 MKCLSLA 7 (36)
T ss_pred CceeHHH
Confidence 7776553
No 18
>PF11777 DUF3316: Protein of unknown function (DUF3316); InterPro: IPR016879 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=28.36 E-value=38 Score=25.59 Aligned_cols=19 Identities=37% Similarity=0.401 Sum_probs=14.1
Q ss_pred ChHHHHHHHHHHHHhccCc
Q psy5004 1 MKLLVLTLSLISFVSRCSA 19 (173)
Q Consensus 1 MK~~l~~LLLl~~i~~geA 19 (173)
||.++++++++++.....|
T Consensus 1 MKk~~ll~~~ll~s~~a~A 19 (114)
T PF11777_consen 1 MKKIILLASLLLLSSSAFA 19 (114)
T ss_pred CchHHHHHHHHHHHHHHhh
Confidence 8988888877666665555
No 19
>PRK09810 entericidin A; Provisional
Probab=26.34 E-value=59 Score=21.60 Aligned_cols=14 Identities=36% Similarity=0.420 Sum_probs=9.3
Q ss_pred ChHHHHHHHHHHHH
Q psy5004 1 MKLLVLTLSLISFV 14 (173)
Q Consensus 1 MK~~l~~LLLl~~i 14 (173)
||.++++++++++.
T Consensus 2 Mkk~~~l~~~~~~~ 15 (41)
T PRK09810 2 MKRLIVLVLLASTL 15 (41)
T ss_pred hHHHHHHHHHHHHH
Confidence 77777777665553
No 20
>PF07172 GRP: Glycine rich protein family; InterPro: IPR010800 This family consists of glycine rich proteins. Some of them may be involved in resistance to environmental stress [].
Probab=25.13 E-value=58 Score=24.60 Aligned_cols=8 Identities=25% Similarity=0.085 Sum_probs=3.5
Q ss_pred hHHHHHHH
Q psy5004 2 KLLVLTLS 9 (173)
Q Consensus 2 K~~l~~LL 9 (173)
|++|++.|
T Consensus 4 K~~llL~l 11 (95)
T PF07172_consen 4 KAFLLLGL 11 (95)
T ss_pred hHHHHHHH
Confidence 44444433
No 21
>PF11153 DUF2931: Protein of unknown function (DUF2931); InterPro: IPR021326 Some members in this family of proteins are annotated as lipoproteins however this cannot be confirmed. Currently, there is no known function.
Probab=24.21 E-value=58 Score=26.78 Aligned_cols=46 Identities=22% Similarity=0.425 Sum_probs=26.2
Q ss_pred ChHHHHHHHHHHHHhccCceeeecccCCCCC-CCCCCCCCccccccCCCCCCceEEEE
Q psy5004 1 MKLLVLTLSLISFVSRCSALLCYQCVSTQPG-CGTPFKWLWYPSISCPEEDDICVKII 57 (173)
Q Consensus 1 MK~~l~~LLLl~~i~~geAL~CY~C~S~~Pg-C~dpfn~~~~~~etCp~~~d~CvKVi 57 (173)
||.+++++|++++.. |.+..+. =...+.| ..+..-|....+=|.-.
T Consensus 1 mk~i~~l~l~lll~~---------C~~~~~~~~~~~~~W--~~~~~~P~~ypv~V~~~ 47 (216)
T PF11153_consen 1 MKKILLLLLLLLLTG---------CSTNPNEPLQPYFEW--RFGVAAPKHYPVWVTYA 47 (216)
T ss_pred ChHHHHHHHHHHHHh---------hcCCCccCCCCCCcc--EEEEecCCCCEEEEEEE
Confidence 899998886665555 7665331 1155666 33345576544444433
No 22
>PF13956 Ibs_toxin: Toxin Ibs, type I toxin-antitoxin system
Probab=24.04 E-value=42 Score=19.23 Aligned_cols=11 Identities=36% Similarity=0.773 Sum_probs=5.6
Q ss_pred ChHHHHHHHHH
Q psy5004 1 MKLLVLTLSLI 11 (173)
Q Consensus 1 MK~~l~~LLLl 11 (173)
||.++.+++|+
T Consensus 2 Mk~vIIlvvLL 12 (19)
T PF13956_consen 2 MKLVIILVVLL 12 (19)
T ss_pred ceehHHHHHHH
Confidence 56555544443
No 23
>PF00879 Defensin_propep: Defensin propeptide The pattern for this Prosite entry doesn't match the propeptide.; InterPro: IPR002366 Defensins are 2-6 kDa, cationic, microbicidal peptides active against many Gram-negative and Gram-positive bacteria, fungi, and enveloped viruses [], containing three pairs of intramolecular disulphide bonds []. On the basis of their size and pattern of disulphide bonding, mammalian defensins are classified into alpha, beta and theta categories. Alpha-defensins, which have been identified in humans, monkeys and several rodent species, are particularly abundant in neutrophils, certain macrophage populations and Paneth cells of the small intestine. Every mammalian species explored thus far has beta-defensins. In cows, as many as 13 beta-defensins exist in neutrophils. However, in other species, beta-defensins are more often produced by epithelial cells lining various organs (e.g. the epidermis, bronchial tree and genitourinary tract). Theta-defensins are cyclic and have so far only been identified in primate phagocytes. Defensins are produced constitutively and/or in response to microbial products or proinflammatory cytokines. Some defensins are also called corticostatins (CS) because they inhibit corticotropin-stimulated corticosteroid production. The mechanism(s) by which microorganisms are killed and/or inactivated by defensins is not understood completely. However, it is generally believed that killing is a consequence of disruption of the microbial membrane. The polar topology of defensins, with spatially separated charged and hydrophobic regions, allows them to insert themselves into the phospholipid membranes so that their hydrophobic regions are buried within the lipid membrane interior and their charged (mostly cationic) regions interact with anionic phospholipid head groups and water. Subsequently, some defensins can aggregate to form `channel-like' pores; others might bind to and cover the microbial membrane in a `carpet-like' manner. The net outcome is the disruption of membrane integrity and function, which ultimately leads to the lysis of microorganisms. Some defensins are synthesized as propeptides which may be relevant to this process - in neutrophils only the mature peptides have been identified but in Paneth cells, the propeptide is stored in vesicles [] and appears to be cleaved by trypsin on activation. ; GO: 0006952 defense response
Probab=21.56 E-value=71 Score=22.26 Aligned_cols=20 Identities=25% Similarity=0.453 Sum_probs=12.7
Q ss_pred ChHHH--HHHHHHHHHhccCce
Q psy5004 1 MKLLV--LTLSLISFVSRCSAL 20 (173)
Q Consensus 1 MK~~l--~~LLLl~~i~~geAL 20 (173)
||+++ .+|||+++..|.+.+
T Consensus 1 MRTL~LLaAlLLlAlqaQAepl 22 (52)
T PF00879_consen 1 MRTLALLAALLLLALQAQAEPL 22 (52)
T ss_pred CcHHHHHHHHHHHHHHHhcccc
Confidence 77654 456666676766655
No 24
>PF04202 Mfp-3: Foot protein 3; InterPro: IPR007328 Mytilus foot protein-3 (Mfp-3) is a highly polymorphic protein family located in the byssal adhesive plaques of blue mussels.
Probab=20.55 E-value=65 Score=23.84 Aligned_cols=17 Identities=24% Similarity=0.313 Sum_probs=10.2
Q ss_pred HHHHHHHHHHHHhccCc
Q psy5004 3 LLVLTLSLISFVSRCSA 19 (173)
Q Consensus 3 ~~l~~LLLl~~i~~geA 19 (173)
++|+++|+.+|..|++|
T Consensus 8 VLlaLvLIg~fAVqSda 24 (71)
T PF04202_consen 8 VLLALVLIGSFAVQSDA 24 (71)
T ss_pred HHHHHHHHhhheeeecC
Confidence 34444455577777777
No 25
>PF08138 Sex_peptide: Sex peptide (SP) family; InterPro: IPR012608 This family consists of Sex Peptides (SP) that are found in Drosophila. On mating, Drosophila females decreases her remating rate and increases her egg-laying rate due, in part, to the transfer of SP from the male to the female. SP are found in seminal fluids transferred from the male to the female during mating. The male seminal fluid proteins are referred to as accessory gland proteins (Acps). The SP is one of the most interesting Acps and plays an important role in reproduction [].; GO: 0005179 hormone activity, 0046008 regulation of female receptivity, post-mating, 0005576 extracellular region; PDB: 2LAQ_A.
Probab=20.14 E-value=34 Score=24.28 Aligned_cols=15 Identities=27% Similarity=0.419 Sum_probs=0.0
Q ss_pred ChHHHHHHHHHHHHh
Q psy5004 1 MKLLVLTLSLISFVS 15 (173)
Q Consensus 1 MK~~l~~LLLl~~i~ 15 (173)
||+.+++|||++++.
T Consensus 1 Mk~p~~llllvlllG 15 (56)
T PF08138_consen 1 MKTPIFLLLLVLLLG 15 (56)
T ss_dssp ---------------
T ss_pred CcchHHHHHHHHHHH
Confidence 888887777766544
Done!