Query psy9452
Match_columns 1789
No_of_seqs 98 out of 100
Neff 1.2
Searched_HMMs 46136
Date Fri Aug 16 19:42:41 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy9452.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/9452hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 smart00214 VWC von Willebrand 98.8 3.4E-09 7.4E-14 89.2 4.5 56 147-210 1-59 (59)
2 PF00093 VWC: von Willebrand f 98.8 3.7E-09 8E-14 88.7 2.7 55 147-210 1-57 (57)
3 smart00215 VWC_out von Willebr 97.2 0.00043 9.4E-09 62.0 4.1 38 147-187 1-38 (67)
4 PF12714 TILa: TILa domain 96.3 0.0035 7.7E-08 53.9 3.0 47 146-197 1-49 (56)
5 PF15430 SVWC: Single domain v 95.1 0.045 9.7E-07 45.8 5.3 60 147-210 1-65 (65)
6 PF14828 Amnionless: Amnionles 93.5 0.035 7.6E-07 63.1 1.8 54 153-213 175-235 (437)
7 KOG1544|consensus 71.4 2.7 5.8E-05 48.2 2.4 48 138-189 104-151 (470)
8 PF05825 PSP94: Beta-microsemi 56.4 7.3 0.00016 37.5 2.0 52 146-199 17-69 (94)
9 PF05375 Pacifastin_I: Pacifas 54.3 7.2 0.00016 32.6 1.4 21 166-187 12-33 (39)
10 PF08097 Toxin_26: Conotoxin T 25.1 28 0.00061 23.6 0.3 8 203-210 1-10 (11)
11 PF12786 GBV-C_env: GB virus C 22.5 88 0.0019 36.7 3.7 65 141-209 87-155 (413)
No 1
>smart00214 VWC von Willebrand factor (vWF) type C domain.
Probab=98.83 E-value=3.4e-09 Score=89.17 Aligned_cols=56 Identities=34% Similarity=1.045 Sum_probs=50.6
Q ss_pred cccCCEEccCCCCCCCCCCCccceEEeCCc-ceeeeccCCCCCCCCCCCCCCC--CCCCCCCCCCcC
Q psy9452 147 CIIDGQLHSNNSRIETKNPCHVQCLCVNGA-PSCESLECPPPPPTHRNCMPIH--PGPDLCCPAYAC 210 (1789)
Q Consensus 147 CLVDGQVYANGSDVPKpEPCEV~CYCINGi-VVCDEVECelPPeNlKDCTPV~--IPPGECCPTYIC 210 (1789)
|+++|++|.+| ..|.+++|. .|+|.+|. |.|..+.|..+ ..|.... ..+|+||| +|
T Consensus 1 C~~~g~~y~~G-~~W~~~~C~-~C~C~~g~~v~C~~~~Cp~~----~~C~~~~~~~~~g~CCp--~C 59 (59)
T smart00214 1 CVHNGEVYNDG-ETWKPDPCQ-ICTCLDGETVLCDPVECPPP----PDCPNPERVKPPGECCP--RC 59 (59)
T ss_pred CccCCEEeCCC-CEECCCCCe-ECCcCCCCEEEeeeecCCCC----CCCCCCcccCCCCCcCC--CC
Confidence 89999999999 999999999 99999999 99999999873 3788776 68999999 65
No 2
>PF00093 VWC: von Willebrand factor type C domain; InterPro: IPR001007 The vWF domain is found in various plasma proteins: complement factors B, C2, CR3 and CR4; the integrins (I-domains); collagen types VI, VII, XII and XIV; and other extracellular proteins [, , ]. Although the majority of VWA-containing proteins are extracellular, the most ancient ones present in all eukaryotes are all intracellular proteins involved in functions such as transcription, DNA repair, ribosomal and membrane transport and the proteasome. A common feature appears to be involvement in multiprotein complexes. Proteins that incorporate vWF domains participate in numerous biological events (e.g. cell adhesion, migration, homing, pattern formation, and signal transduction), involving interaction with a large array of ligands []. A number of human diseases arise from mutations in VWA domains. Secondary structure prediction from 75 aligned vWF sequences has revealed a largely alternating sequence of alpha-helices and beta-strands []. The domain is named after the von Willebrand factor (VWF) type C repeat which is found in multidomain protein/multifunctional proteins involved in maintaining homeostasis [, ]. For the von Willebrand factor the duplicated VWFC domain is thought to participate in oligomerization, but not in the initial dimerization step []. The presence of this region in a number of other complex-forming proteins points to the possible involvment of the VWFC domain in complex formation.; GO: 0005515 protein binding; PDB: 1U5M_A.
Probab=98.76 E-value=3.7e-09 Score=88.67 Aligned_cols=55 Identities=33% Similarity=0.957 Sum_probs=33.8
Q ss_pred cccCCEEccCCCCCCCCCCCccceEEeCCcceeeeccCCCCCCCCCCCCCC--CCCCCCCCCCCcC
Q psy9452 147 CIIDGQLHSNNSRIETKNPCHVQCLCVNGAPSCESLECPPPPPTHRNCMPI--HPGPDLCCPAYAC 210 (1789)
Q Consensus 147 CLVDGQVYANGSDVPKpEPCEV~CYCINGiVVCDEVECelPPeNlKDCTPV--~IPPGECCPTYIC 210 (1789)
|.++|++|.+| ..|.+++|. .|+|.+|.|.|..+.|+. ..|... ...+|+||| .|
T Consensus 1 C~~~g~~y~~g-~~w~~~~C~-~C~C~~G~v~C~~~~Cp~-----~~C~~~~~~~~~g~CCp--~C 57 (57)
T PF00093_consen 1 CSFNGRVYQNG-ESWHPDPCT-TCTCQDGEVQCSRIQCPP-----LDCPNPELVKPPGECCP--VC 57 (57)
T ss_dssp EESSS-EE-SS--EE-S-SSE-EEEEETTEEEEEE---S------SS-S-------SS-S----S-
T ss_pred CeeCCEEeCCC-CEECCCCCc-EeEecCCEEEEcCCCCcC-----CCCCCCceecCCCCcCC--CC
Confidence 88999999999 999999999 999999999999999997 278877 788999999 66
No 3
>smart00215 VWC_out von Willebrand factor (vWF) type C domain.
Probab=97.16 E-value=0.00043 Score=61.99 Aligned_cols=38 Identities=29% Similarity=0.728 Sum_probs=35.1
Q ss_pred cccCCEEccCCCCCCCCCCCccceEEeCCcceeeeccCCCC
Q psy9452 147 CIIDGQLHSNNSRIETKNPCHVQCLCVNGAPSCESLECPPP 187 (1789)
Q Consensus 147 CLVDGQVYANGSDVPKpEPCEV~CYCINGiVVCDEVECelP 187 (1789)
|+++|++|.+| ++|. +-|. .|+|++|.+.|+.+.|+..
T Consensus 1 C~~~G~~y~~g-~~w~-d~Cn-~CtC~~G~v~Ct~~~C~~~ 38 (67)
T smart00215 1 CYNNGSYYPPG-AKWD-DDCN-RCTCLNGRVSCTKVWCGPK 38 (67)
T ss_pred CeECCEEcCCC-Cccc-cCCC-CCEecCCCEEecCCcCCCc
Confidence 89999999999 7775 7899 9999999999999999984
No 4
>PF12714 TILa: TILa domain
Probab=96.25 E-value=0.0035 Score=53.87 Aligned_cols=47 Identities=28% Similarity=0.697 Sum_probs=39.7
Q ss_pred Ccc-cCCEEccCCCCCCCCCCCccceEE-eCCcceeeeccCCCCCCCCCCCCCC
Q psy9452 146 DCI-IDGQLHSNNSRIETKNPCHVQCLC-VNGAPSCESLECPPPPPTHRNCMPI 197 (1789)
Q Consensus 146 DCL-VDGQVYANGSDVPKpEPCEV~CYC-INGiVVCDEVECelPPeNlKDCTPV 197 (1789)
+|+ ++|+.|..|+.+|..+ |..+|+| .+|.|.|....|.. ..-|...
T Consensus 1 GC~d~~G~yy~~Ge~~~~~~-C~~~C~C~~~g~v~C~~~~C~~----~e~C~~~ 49 (56)
T PF12714_consen 1 GCTDYNGRYYPPGESWWTDD-CTQRCTCQPNGQVQCQPSSCPP----GEVCQIQ 49 (56)
T ss_pred CCcCcCCEEECCCCEEeCCC-CCEeEEEcCCCeEEEeCCCCCC----CCEeEeC
Confidence 599 9999999997666666 9999999 88999999999998 4456554
No 5
>PF15430 SVWC: Single domain von Willebrand factor type C
Probab=95.13 E-value=0.045 Score=45.80 Aligned_cols=60 Identities=27% Similarity=0.727 Sum_probs=45.2
Q ss_pred cccCCEEccCCCCCCCCCCCccceEEe--CCcceeeeccCCCCCCCCCCCCCCC---CCCCCCCCCCcC
Q psy9452 147 CIIDGQLHSNNSRIETKNPCHVQCLCV--NGAPSCESLECPPPPPTHRNCMPIH---PGPDLCCPAYAC 210 (1789)
Q Consensus 147 CLVDGQVYANGSDVPKpEPCEV~CYCI--NGiVVCDEVECelPPeNlKDCTPV~---IPPGECCPTYIC 210 (1789)
|+++|..|.+|..+...+||. .=+|. .|.+. ...|..+. ...+|.... .+.=.|||.|+|
T Consensus 1 C~y~~~~~~~g~~~~~~~pC~-~~~C~~~~~~v~--v~~C~~~~-~~~~C~~~~~~~~~yP~CCp~~~C 65 (65)
T PF15430_consen 1 CYYNGRTIPSGESYNPEEPCE-RWTCDASDGYVT--VEGCPPPA-PPPGCELVPDPTGPYPDCCPKYVC 65 (65)
T ss_pred CEECCEEcCCCcEecCCCCCc-eEEEECCCCEEE--EEeCCCCC-CCCCCEEccCCCCCCCCCCCCccC
Confidence 889999999998787899999 88899 45543 67777743 245887665 223369999998
No 6
>PF14828 Amnionless: Amnionless
Probab=93.52 E-value=0.035 Score=63.12 Aligned_cols=54 Identities=30% Similarity=0.675 Sum_probs=40.2
Q ss_pred EccCCCCCCCCCCCc--cceEEeCCc---ceeeec--cCCCCCCCCCCCCCCCCCCCCCCCCCcCCCC
Q psy9452 153 LHSNNSRIETKNPCH--VQCLCVNGA---PSCESL--ECPPPPPTHRNCMPIHPGPDLCCPAYACDGP 213 (1789)
Q Consensus 153 VYANGSDVPKpEPCE--V~CYCINGi---VVCDEV--ECelPPeNlKDCTPV~IPPGECCPTYICPdd 213 (1789)
.|..++.......|. ..|.|.+.. ..|+-+ .|+.+ .|.....|.|+||| +|..-
T Consensus 175 F~g~~~~~v~~~~C~d~~gC~C~n~~~l~~ICs~v~~~C~~~-----~C~~pl~P~GhCC~--iCGa~ 235 (437)
T PF14828_consen 175 FHGPGSLRVTPYRCSDPSGCPCGNDEVLEWICSNVLQRCPKP-----HCRSPLRPEGHCCP--ICGAI 235 (437)
T ss_pred ecCCceEEEecccccccccCccCcccchhhhhHHhhCcCCCC-----ccCCCCCCCCCchh--hcceE
Confidence 333343445666773 379998766 579888 99984 89999999999999 99543
No 7
>KOG1544|consensus
Probab=71.40 E-value=2.7 Score=48.20 Aligned_cols=48 Identities=21% Similarity=0.501 Sum_probs=38.6
Q ss_pred CCCCCCCCCcccCCEEccCCCCCCCCCCCccceEEeCCcceeeeccCCCCCC
Q psy9452 138 VNESPANGDCIIDGQLHSNNSRIETKNPCHVQCLCVNGAPSCESLECPPPPP 189 (1789)
Q Consensus 138 PsGIPGEGDCLVDGQVYANGSDVPKpEPCEV~CYCINGiVVCDEVECelPPe 189 (1789)
|+-.|- .+|+++|+.|.+|.-+| +-|. .|.|.++.-.|+.--|...|.
T Consensus 104 p~~Qp~-~gc~~gg~~y~~G~t~~--~NCn-~CTC~n~qWKCdq~~CLv~Pd 151 (470)
T KOG1544|consen 104 PPFQPI-QGCMHGGRIYPVGGTYW--DNCN-RCTCQNRQWKCDQEPCLVDPD 151 (470)
T ss_pred CCCCCh-hhcccCceecccCCeee--cccc-ceeecCCceecCCceeecCHH
Confidence 333343 46999999999995444 4699 999999999999999998653
No 8
>PF05825 PSP94: Beta-microseminoprotein (PSP-94); InterPro: IPR008735 This family consists of the mammalian specific protein beta-microseminoprotein. Prostatic secretory protein of 94 amino acids (PSP94), also called beta-microseminoprotein, is a small, nonglycosylated protein, rich in cysteine residues. It was first isolated as a major protein from Homo sapiens seminal plasma []. The exact function of this protein is unknown.; PDB: 2IZ4_A 1XHH_A 3IX0_C 2IZ3_A.
Probab=56.44 E-value=7.3 Score=37.55 Aligned_cols=52 Identities=27% Similarity=0.504 Sum_probs=37.7
Q ss_pred Ccc-cCCEEccCCCCCCCCCCCccceEEeCCcceeeeccCCCCCCCCCCCCCCCC
Q psy9452 146 DCI-IDGQLHSNNSRIETKNPCHVQCLCVNGAPSCESLECPPPPPTHRNCMPIHP 199 (1789)
Q Consensus 146 DCL-VDGQVYANGSDVPKpEPCEV~CYCINGiVVCDEVECelPPeNlKDCTPV~I 199 (1789)
.|. .+|+.|.-| ..|+.+-|. .|+|....+.|-.....+-.-+..+|..++.
T Consensus 17 ~C~D~dG~~h~l~-S~W~T~~C~-~C~C~~~Gi~CC~~~~~Pv~yd~~~C~~if~ 69 (94)
T PF05825_consen 17 ECKDWDGKKHPLN-STWKTDDCE-ECSCSEDGISCCSTYAIPVDYDKEKCEVIFN 69 (94)
T ss_dssp CEEETTTEEEETT-EEEEETTSE-EEEEESSEEEEEESS-EEECS-TTTEEEEEE
T ss_pred ccEeCCCcEEeCC-CeECCCCCc-EEEecCCceEeccCccccCcCCcccceEEec
Confidence 587 899999999 999999999 9999999999944443210011346776653
No 9
>PF05375 Pacifastin_I: Pacifastin inhibitor (LCMII); InterPro: IPR008037 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. This family of serine protease inhibitors belong to MEROPS inhibitor family I19, clan IW. They inhibit chymotrpsin, a peptidase belong to the S1 family (IPR001254 from INTERPRO) []. They were first isolated from Locusta migratoria migratoria(migratory locust). These were HI, LMCI-1 (PMP-D2) and LMCI-2 (PMP-C) [, , ]; five additional members SGPI-1 to 5 were identified in Schistocerca gregaria (desert locust) [, ], and a heterodimeric serine protease inhibitor (pacifastin) was isolated from the hemolymph of Pacifastacus leniusculus (Signal crayfish) []. Pacifastin is a 155kDa composed of two covalently linked subunits, which are separately encoded. The heavy chain of pacifastin (105 kDa) is related to transferrins, containing three transferrin lobes, two of which seem to be active for iron binding []. A number of the members of the transferrin family are also serine peptidases belong to MEROPS peptidase family S60 (IPR001156 from INTERPRO). The light chain of pacifastin (44 kDa) is the proteinase inhibitory subunit, and has nine cysteine-rich inhibitory domains that are homologous to each other. The locust inhibitors share a conserved array of six cysteine residues with the pacifastin light chain. The structure of members of this family reveal that they are comprised of a triple-stranded antiparallel beta-sheet connected by three disulphide bridges []. The biological function(s) of the locust inhibitors is (are) not fully understood. LMCI-1 and LMCI-2 were shown to inhibit the endogenous proteolytic activating cascade of prophenoloxidase []. Expression analysis shows that the genes encoding the SGPI precursors are differentially expressed in a time-, stage- and hormone-dependent manner.; GO: 0030414 peptidase inhibitor activity; PDB: 2VU8_I 1WO9_A 1GL1_K 1PMC_A 2XTT_A 2F91_B 1KJ0_A 1GL0_I 1KGM_A 3TVJ_I.
Probab=54.30 E-value=7.2 Score=32.60 Aligned_cols=21 Identities=38% Similarity=1.160 Sum_probs=17.6
Q ss_pred CccceEEe-CCcceeeeccCCCC
Q psy9452 166 CHVQCLCV-NGAPSCESLECPPP 187 (1789)
Q Consensus 166 CEV~CYCI-NGiVVCDEVECelP 187 (1789)
|. .|+|. +|...|..+.|...
T Consensus 12 CN-~C~C~~~G~~~CT~~~C~~~ 33 (39)
T PF05375_consen 12 CN-TCTCSSDGKWACTRKACPPE 33 (39)
T ss_dssp TE-EEEEBTTSSEEEE-SSSS--
T ss_pred CC-CCCCCCCCcCccCcccCcCC
Confidence 99 99999 99999999999873
No 10
>PF08097 Toxin_26: Conotoxin T-superfamily; InterPro: IPR012631 This family consists of the T-superfamily of conotoxins. Eight different T-superfamily peptides from five Conus species were identified. These peptides share a consensus signal sequence, and a conserved arrangement of cysteine residues. T-superfamily peptides were found expressed in venom ducts of all major feeding types of Conus, suggesting that the T-superfamily is a large and diverse group of peptides, widely distributed in the 500 different Conus species [].; GO: 0005576 extracellular region
Probab=25.10 E-value=28 Score=23.62 Aligned_cols=8 Identities=63% Similarity=2.153 Sum_probs=6.0
Q ss_pred CCCC--CCcC
Q psy9452 203 LCCP--AYAC 210 (1789)
Q Consensus 203 ECCP--TYIC 210 (1789)
.||| +|-|
T Consensus 1 fccpvirycc 10 (11)
T PF08097_consen 1 FCCPVIRYCC 10 (11)
T ss_pred CCcchhheec
Confidence 3899 7766
No 11
>PF12786 GBV-C_env: GB virus C genotype envelope; InterPro: IPR024347 This domain represents an envelope protein found in polyproteins from the ssRNA GB virus, mainly genotype C.
Probab=22.47 E-value=88 Score=36.72 Aligned_cols=65 Identities=18% Similarity=0.223 Sum_probs=44.3
Q ss_pred CCCCCCcccCCEEccCCCCCCC-CCCCccceEEeCCcceeeeccCCCCCC---CCCCCCCCCCCCCCCCCCCc
Q psy9452 141 SPANGDCIIDGQLHSNNSRIET-KNPCHVQCLCVNGAPSCESLECPPPPP---THRNCMPIHPGPDLCCPAYA 209 (1789)
Q Consensus 141 IPGEGDCLVDGQVYANGSDVPK-pEPCEV~CYCINGiVVCDEVECelPPe---NlKDCTPV~IPPGECCPTYI 209 (1789)
+=..+.|.++|.+...+..+|- .+-=. .|.|..|...=....|.---= .+. -.....| -|||.|+
T Consensus 87 ~~d~C~C~~~Gsr~p~~~~~~~rgN~T~-vC~Cpfg~~~Wlp~lCsgl~W~dg~~~--g~~~~~P-~~CP~~V 155 (413)
T PF12786_consen 87 TFDSCVCRYNGSRVPPGEKPWDRGNGTQ-VCDCPFGPMVWLPALCSGLAWGDGIVH--GTQNDLP-LSCPRYV 155 (413)
T ss_pred cccceeecCCCCcCCCCCCccccCCeeE-EEeCCCCceEeehhhheecccCCCccc--CCcccCC-CCCchhe
Confidence 3345679999999999988875 45545 999999998777777765211 011 2233334 7999877
Done!