Query         psy9452
Match_columns 1789
No_of_seqs    98 out of 100
Neff          1.2 
Searched_HMMs 46136
Date          Fri Aug 16 19:42:41 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy9452.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/9452hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 smart00214 VWC von Willebrand   98.8 3.4E-09 7.4E-14   89.2   4.5   56  147-210     1-59  (59)
  2 PF00093 VWC:  von Willebrand f  98.8 3.7E-09   8E-14   88.7   2.7   55  147-210     1-57  (57)
  3 smart00215 VWC_out von Willebr  97.2 0.00043 9.4E-09   62.0   4.1   38  147-187     1-38  (67)
  4 PF12714 TILa:  TILa domain      96.3  0.0035 7.7E-08   53.9   3.0   47  146-197     1-49  (56)
  5 PF15430 SVWC:  Single domain v  95.1   0.045 9.7E-07   45.8   5.3   60  147-210     1-65  (65)
  6 PF14828 Amnionless:  Amnionles  93.5   0.035 7.6E-07   63.1   1.8   54  153-213   175-235 (437)
  7 KOG1544|consensus               71.4     2.7 5.8E-05   48.2   2.4   48  138-189   104-151 (470)
  8 PF05825 PSP94:  Beta-microsemi  56.4     7.3 0.00016   37.5   2.0   52  146-199    17-69  (94)
  9 PF05375 Pacifastin_I:  Pacifas  54.3     7.2 0.00016   32.6   1.4   21  166-187    12-33  (39)
 10 PF08097 Toxin_26:  Conotoxin T  25.1      28 0.00061   23.6   0.3    8  203-210     1-10  (11)
 11 PF12786 GBV-C_env:  GB virus C  22.5      88  0.0019   36.7   3.7   65  141-209    87-155 (413)

No 1  
>smart00214 VWC von Willebrand factor (vWF) type C domain.
Probab=98.83  E-value=3.4e-09  Score=89.17  Aligned_cols=56  Identities=34%  Similarity=1.045  Sum_probs=50.6

Q ss_pred             cccCCEEccCCCCCCCCCCCccceEEeCCc-ceeeeccCCCCCCCCCCCCCCC--CCCCCCCCCCcC
Q psy9452         147 CIIDGQLHSNNSRIETKNPCHVQCLCVNGA-PSCESLECPPPPPTHRNCMPIH--PGPDLCCPAYAC  210 (1789)
Q Consensus       147 CLVDGQVYANGSDVPKpEPCEV~CYCINGi-VVCDEVECelPPeNlKDCTPV~--IPPGECCPTYIC  210 (1789)
                      |+++|++|.+| ..|.+++|. .|+|.+|. |.|..+.|..+    ..|....  ..+|+|||  +|
T Consensus         1 C~~~g~~y~~G-~~W~~~~C~-~C~C~~g~~v~C~~~~Cp~~----~~C~~~~~~~~~g~CCp--~C   59 (59)
T smart00214        1 CVHNGEVYNDG-ETWKPDPCQ-ICTCLDGETVLCDPVECPPP----PDCPNPERVKPPGECCP--RC   59 (59)
T ss_pred             CccCCEEeCCC-CEECCCCCe-ECCcCCCCEEEeeeecCCCC----CCCCCCcccCCCCCcCC--CC
Confidence            89999999999 999999999 99999999 99999999873    3788776  68999999  65


No 2  
>PF00093 VWC:  von Willebrand factor type C domain;  InterPro: IPR001007 The vWF domain is found in various plasma proteins: complement factors B, C2, CR3 and CR4; the integrins (I-domains); collagen types VI, VII, XII and XIV; and other extracellular proteins [, , ]. Although the majority of VWA-containing proteins are extracellular, the most ancient ones present in all eukaryotes are all intracellular proteins involved in functions such as transcription, DNA repair, ribosomal and membrane transport and the proteasome. A common feature appears to be involvement in multiprotein complexes. Proteins that incorporate vWF domains participate in numerous biological events (e.g. cell adhesion, migration, homing, pattern formation, and signal transduction), involving interaction with a large array of ligands []. A number of human diseases arise from mutations in VWA domains. Secondary structure prediction from 75 aligned vWF sequences has revealed a largely alternating sequence of alpha-helices and beta-strands []. The domain is named after the von Willebrand factor (VWF) type C repeat which is found in multidomain protein/multifunctional proteins involved in maintaining homeostasis [, ]. For the von Willebrand factor the duplicated VWFC domain is thought to participate in oligomerization, but not in the initial dimerization step []. The presence of this region in a number of other complex-forming proteins points to the possible involvment of the VWFC domain in complex formation.; GO: 0005515 protein binding; PDB: 1U5M_A.
Probab=98.76  E-value=3.7e-09  Score=88.67  Aligned_cols=55  Identities=33%  Similarity=0.957  Sum_probs=33.8

Q ss_pred             cccCCEEccCCCCCCCCCCCccceEEeCCcceeeeccCCCCCCCCCCCCCC--CCCCCCCCCCCcC
Q psy9452         147 CIIDGQLHSNNSRIETKNPCHVQCLCVNGAPSCESLECPPPPPTHRNCMPI--HPGPDLCCPAYAC  210 (1789)
Q Consensus       147 CLVDGQVYANGSDVPKpEPCEV~CYCINGiVVCDEVECelPPeNlKDCTPV--~IPPGECCPTYIC  210 (1789)
                      |.++|++|.+| ..|.+++|. .|+|.+|.|.|..+.|+.     ..|...  ...+|+|||  .|
T Consensus         1 C~~~g~~y~~g-~~w~~~~C~-~C~C~~G~v~C~~~~Cp~-----~~C~~~~~~~~~g~CCp--~C   57 (57)
T PF00093_consen    1 CSFNGRVYQNG-ESWHPDPCT-TCTCQDGEVQCSRIQCPP-----LDCPNPELVKPPGECCP--VC   57 (57)
T ss_dssp             EESSS-EE-SS--EE-S-SSE-EEEEETTEEEEEE---S------SS-S-------SS-S----S-
T ss_pred             CeeCCEEeCCC-CEECCCCCc-EeEecCCEEEEcCCCCcC-----CCCCCCceecCCCCcCC--CC
Confidence            88999999999 999999999 999999999999999997     278877  788999999  66


No 3  
>smart00215 VWC_out von Willebrand factor (vWF) type C domain.
Probab=97.16  E-value=0.00043  Score=61.99  Aligned_cols=38  Identities=29%  Similarity=0.728  Sum_probs=35.1

Q ss_pred             cccCCEEccCCCCCCCCCCCccceEEeCCcceeeeccCCCC
Q psy9452         147 CIIDGQLHSNNSRIETKNPCHVQCLCVNGAPSCESLECPPP  187 (1789)
Q Consensus       147 CLVDGQVYANGSDVPKpEPCEV~CYCINGiVVCDEVECelP  187 (1789)
                      |+++|++|.+| ++|. +-|. .|+|++|.+.|+.+.|+..
T Consensus         1 C~~~G~~y~~g-~~w~-d~Cn-~CtC~~G~v~Ct~~~C~~~   38 (67)
T smart00215        1 CYNNGSYYPPG-AKWD-DDCN-RCTCLNGRVSCTKVWCGPK   38 (67)
T ss_pred             CeECCEEcCCC-Cccc-cCCC-CCEecCCCEEecCCcCCCc
Confidence            89999999999 7775 7899 9999999999999999984


No 4  
>PF12714 TILa:  TILa domain
Probab=96.25  E-value=0.0035  Score=53.87  Aligned_cols=47  Identities=28%  Similarity=0.697  Sum_probs=39.7

Q ss_pred             Ccc-cCCEEccCCCCCCCCCCCccceEE-eCCcceeeeccCCCCCCCCCCCCCC
Q psy9452         146 DCI-IDGQLHSNNSRIETKNPCHVQCLC-VNGAPSCESLECPPPPPTHRNCMPI  197 (1789)
Q Consensus       146 DCL-VDGQVYANGSDVPKpEPCEV~CYC-INGiVVCDEVECelPPeNlKDCTPV  197 (1789)
                      +|+ ++|+.|..|+.+|..+ |..+|+| .+|.|.|....|..    ..-|...
T Consensus         1 GC~d~~G~yy~~Ge~~~~~~-C~~~C~C~~~g~v~C~~~~C~~----~e~C~~~   49 (56)
T PF12714_consen    1 GCTDYNGRYYPPGESWWTDD-CTQRCTCQPNGQVQCQPSSCPP----GEVCQIQ   49 (56)
T ss_pred             CCcCcCCEEECCCCEEeCCC-CCEeEEEcCCCeEEEeCCCCCC----CCEeEeC
Confidence            599 9999999997666666 9999999 88999999999998    4456554


No 5  
>PF15430 SVWC:  Single domain von Willebrand factor type C
Probab=95.13  E-value=0.045  Score=45.80  Aligned_cols=60  Identities=27%  Similarity=0.727  Sum_probs=45.2

Q ss_pred             cccCCEEccCCCCCCCCCCCccceEEe--CCcceeeeccCCCCCCCCCCCCCCC---CCCCCCCCCCcC
Q psy9452         147 CIIDGQLHSNNSRIETKNPCHVQCLCV--NGAPSCESLECPPPPPTHRNCMPIH---PGPDLCCPAYAC  210 (1789)
Q Consensus       147 CLVDGQVYANGSDVPKpEPCEV~CYCI--NGiVVCDEVECelPPeNlKDCTPV~---IPPGECCPTYIC  210 (1789)
                      |+++|..|.+|..+...+||. .=+|.  .|.+.  ...|..+. ...+|....   .+.=.|||.|+|
T Consensus         1 C~y~~~~~~~g~~~~~~~pC~-~~~C~~~~~~v~--v~~C~~~~-~~~~C~~~~~~~~~yP~CCp~~~C   65 (65)
T PF15430_consen    1 CYYNGRTIPSGESYNPEEPCE-RWTCDASDGYVT--VEGCPPPA-PPPGCELVPDPTGPYPDCCPKYVC   65 (65)
T ss_pred             CEECCEEcCCCcEecCCCCCc-eEEEECCCCEEE--EEeCCCCC-CCCCCEEccCCCCCCCCCCCCccC
Confidence            889999999998787899999 88899  45543  67777743 245887665   223369999998


No 6  
>PF14828 Amnionless:  Amnionless
Probab=93.52  E-value=0.035  Score=63.12  Aligned_cols=54  Identities=30%  Similarity=0.675  Sum_probs=40.2

Q ss_pred             EccCCCCCCCCCCCc--cceEEeCCc---ceeeec--cCCCCCCCCCCCCCCCCCCCCCCCCCcCCCC
Q psy9452         153 LHSNNSRIETKNPCH--VQCLCVNGA---PSCESL--ECPPPPPTHRNCMPIHPGPDLCCPAYACDGP  213 (1789)
Q Consensus       153 VYANGSDVPKpEPCE--V~CYCINGi---VVCDEV--ECelPPeNlKDCTPV~IPPGECCPTYICPdd  213 (1789)
                      .|..++.......|.  ..|.|.+..   ..|+-+  .|+.+     .|.....|.|+|||  +|..-
T Consensus       175 F~g~~~~~v~~~~C~d~~gC~C~n~~~l~~ICs~v~~~C~~~-----~C~~pl~P~GhCC~--iCGa~  235 (437)
T PF14828_consen  175 FHGPGSLRVTPYRCSDPSGCPCGNDEVLEWICSNVLQRCPKP-----HCRSPLRPEGHCCP--ICGAI  235 (437)
T ss_pred             ecCCceEEEecccccccccCccCcccchhhhhHHhhCcCCCC-----ccCCCCCCCCCchh--hcceE
Confidence            333343445666773  379998766   579888  99984     89999999999999  99543


No 7  
>KOG1544|consensus
Probab=71.40  E-value=2.7  Score=48.20  Aligned_cols=48  Identities=21%  Similarity=0.501  Sum_probs=38.6

Q ss_pred             CCCCCCCCCcccCCEEccCCCCCCCCCCCccceEEeCCcceeeeccCCCCCC
Q psy9452         138 VNESPANGDCIIDGQLHSNNSRIETKNPCHVQCLCVNGAPSCESLECPPPPP  189 (1789)
Q Consensus       138 PsGIPGEGDCLVDGQVYANGSDVPKpEPCEV~CYCINGiVVCDEVECelPPe  189 (1789)
                      |+-.|- .+|+++|+.|.+|.-+|  +-|. .|.|.++.-.|+.--|...|.
T Consensus       104 p~~Qp~-~gc~~gg~~y~~G~t~~--~NCn-~CTC~n~qWKCdq~~CLv~Pd  151 (470)
T KOG1544|consen  104 PPFQPI-QGCMHGGRIYPVGGTYW--DNCN-RCTCQNRQWKCDQEPCLVDPD  151 (470)
T ss_pred             CCCCCh-hhcccCceecccCCeee--cccc-ceeecCCceecCCceeecCHH
Confidence            333343 46999999999995444  4699 999999999999999998653


No 8  
>PF05825 PSP94:  Beta-microseminoprotein (PSP-94);  InterPro: IPR008735 This family consists of the mammalian specific protein beta-microseminoprotein. Prostatic secretory protein of 94 amino acids (PSP94), also called beta-microseminoprotein, is a small, nonglycosylated protein, rich in cysteine residues. It was first isolated as a major protein from Homo sapiens seminal plasma []. The exact function of this protein is unknown.; PDB: 2IZ4_A 1XHH_A 3IX0_C 2IZ3_A.
Probab=56.44  E-value=7.3  Score=37.55  Aligned_cols=52  Identities=27%  Similarity=0.504  Sum_probs=37.7

Q ss_pred             Ccc-cCCEEccCCCCCCCCCCCccceEEeCCcceeeeccCCCCCCCCCCCCCCCC
Q psy9452         146 DCI-IDGQLHSNNSRIETKNPCHVQCLCVNGAPSCESLECPPPPPTHRNCMPIHP  199 (1789)
Q Consensus       146 DCL-VDGQVYANGSDVPKpEPCEV~CYCINGiVVCDEVECelPPeNlKDCTPV~I  199 (1789)
                      .|. .+|+.|.-| ..|+.+-|. .|+|....+.|-.....+-.-+..+|..++.
T Consensus        17 ~C~D~dG~~h~l~-S~W~T~~C~-~C~C~~~Gi~CC~~~~~Pv~yd~~~C~~if~   69 (94)
T PF05825_consen   17 ECKDWDGKKHPLN-STWKTDDCE-ECSCSEDGISCCSTYAIPVDYDKEKCEVIFN   69 (94)
T ss_dssp             CEEETTTEEEETT-EEEEETTSE-EEEEESSEEEEEESS-EEECS-TTTEEEEEE
T ss_pred             ccEeCCCcEEeCC-CeECCCCCc-EEEecCCceEeccCccccCcCCcccceEEec
Confidence            587 899999999 999999999 9999999999944443210011346776653


No 9  
>PF05375 Pacifastin_I:  Pacifastin inhibitor (LCMII);  InterPro: IPR008037 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties.  This family of serine protease inhibitors belong to MEROPS inhibitor family I19, clan IW. They inhibit chymotrpsin, a peptidase belong to the S1 family (IPR001254 from INTERPRO) []. They were first isolated from Locusta migratoria migratoria(migratory locust). These were HI, LMCI-1 (PMP-D2) and LMCI-2 (PMP-C) [, , ]; five additional members SGPI-1 to 5 were identified in Schistocerca gregaria (desert locust) [, ], and a heterodimeric serine protease inhibitor (pacifastin) was isolated from the hemolymph of Pacifastacus leniusculus (Signal crayfish) [].  Pacifastin is a 155kDa composed of two covalently linked subunits, which are separately encoded. The heavy chain of pacifastin (105 kDa) is related to transferrins, containing three transferrin lobes, two of which seem to be active for iron binding []. A number of the members of the transferrin family are also serine peptidases belong to MEROPS peptidase family S60 (IPR001156 from INTERPRO). The light chain of pacifastin (44 kDa) is the proteinase inhibitory subunit, and has nine cysteine-rich inhibitory domains that are homologous to each other. The locust inhibitors share a conserved array of six cysteine residues with the pacifastin light chain. The structure of members of this family reveal that they are comprised of a triple-stranded antiparallel beta-sheet connected by three disulphide bridges []. The biological function(s) of the locust inhibitors is (are) not fully understood. LMCI-1 and LMCI-2 were shown to inhibit the endogenous proteolytic activating cascade of prophenoloxidase []. Expression analysis shows that the genes encoding the SGPI precursors are differentially expressed in a time-, stage- and hormone-dependent manner.; GO: 0030414 peptidase inhibitor activity; PDB: 2VU8_I 1WO9_A 1GL1_K 1PMC_A 2XTT_A 2F91_B 1KJ0_A 1GL0_I 1KGM_A 3TVJ_I.
Probab=54.30  E-value=7.2  Score=32.60  Aligned_cols=21  Identities=38%  Similarity=1.160  Sum_probs=17.6

Q ss_pred             CccceEEe-CCcceeeeccCCCC
Q psy9452         166 CHVQCLCV-NGAPSCESLECPPP  187 (1789)
Q Consensus       166 CEV~CYCI-NGiVVCDEVECelP  187 (1789)
                      |. .|+|. +|...|..+.|...
T Consensus        12 CN-~C~C~~~G~~~CT~~~C~~~   33 (39)
T PF05375_consen   12 CN-TCTCSSDGKWACTRKACPPE   33 (39)
T ss_dssp             TE-EEEEBTTSSEEEE-SSSS--
T ss_pred             CC-CCCCCCCCcCccCcccCcCC
Confidence            99 99999 99999999999873


No 10 
>PF08097 Toxin_26:  Conotoxin T-superfamily;  InterPro: IPR012631 This family consists of the T-superfamily of conotoxins. Eight different T-superfamily peptides from five Conus species were identified. These peptides share a consensus signal sequence, and a conserved arrangement of cysteine residues. T-superfamily peptides were found expressed in venom ducts of all major feeding types of Conus, suggesting that the T-superfamily is a large and diverse group of peptides, widely distributed in the 500 different Conus species [].; GO: 0005576 extracellular region
Probab=25.10  E-value=28  Score=23.62  Aligned_cols=8  Identities=63%  Similarity=2.153  Sum_probs=6.0

Q ss_pred             CCCC--CCcC
Q psy9452         203 LCCP--AYAC  210 (1789)
Q Consensus       203 ECCP--TYIC  210 (1789)
                      .|||  +|-|
T Consensus         1 fccpvirycc   10 (11)
T PF08097_consen    1 FCCPVIRYCC   10 (11)
T ss_pred             CCcchhheec
Confidence            3899  7766


No 11 
>PF12786 GBV-C_env:  GB virus C genotype envelope;  InterPro: IPR024347 This domain represents an envelope protein found in polyproteins from the ssRNA GB virus, mainly genotype C.
Probab=22.47  E-value=88  Score=36.72  Aligned_cols=65  Identities=18%  Similarity=0.223  Sum_probs=44.3

Q ss_pred             CCCCCCcccCCEEccCCCCCCC-CCCCccceEEeCCcceeeeccCCCCCC---CCCCCCCCCCCCCCCCCCCc
Q psy9452         141 SPANGDCIIDGQLHSNNSRIET-KNPCHVQCLCVNGAPSCESLECPPPPP---THRNCMPIHPGPDLCCPAYA  209 (1789)
Q Consensus       141 IPGEGDCLVDGQVYANGSDVPK-pEPCEV~CYCINGiVVCDEVECelPPe---NlKDCTPV~IPPGECCPTYI  209 (1789)
                      +=..+.|.++|.+...+..+|- .+-=. .|.|..|...=....|.---=   .+.  -.....| -|||.|+
T Consensus        87 ~~d~C~C~~~Gsr~p~~~~~~~rgN~T~-vC~Cpfg~~~Wlp~lCsgl~W~dg~~~--g~~~~~P-~~CP~~V  155 (413)
T PF12786_consen   87 TFDSCVCRYNGSRVPPGEKPWDRGNGTQ-VCDCPFGPMVWLPALCSGLAWGDGIVH--GTQNDLP-LSCPRYV  155 (413)
T ss_pred             cccceeecCCCCcCCCCCCccccCCeeE-EEeCCCCceEeehhhheecccCCCccc--CCcccCC-CCCchhe
Confidence            3345679999999999988875 45545 999999998777777765211   011  2233334 7999877


Done!