Query 033708
Match_columns 113
No_of_seqs 59 out of 61
Neff 3.6
Searched_HMMs 46136
Date Fri Mar 29 05:22:38 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/033708.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/033708hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF04749 PLAC8: PLAC8 family; 65.6 4.6 9.9E-05 26.8 1.7 16 69-84 68-83 (106)
2 TIGR01571 A_thal_Cys_rich unch 59.9 6.5 0.00014 27.4 1.7 16 70-85 65-80 (104)
3 PF13980 UPF0370: Uncharacteri 53.5 22 0.00048 24.0 3.4 13 9-21 11-23 (63)
4 COG4856 Uncharacterized protei 52.7 11 0.00024 33.3 2.3 21 1-21 5-25 (403)
5 PF10731 Anophelin: Thrombin i 40.1 45 0.00097 22.7 3.2 26 7-32 8-38 (65)
6 PF05835 Synaphin: Synaphin pr 36.8 18 0.0004 27.3 1.1 15 72-86 65-79 (139)
7 PF07849 DUF1641: Protein of u 35.2 39 0.00085 20.3 2.2 22 1-23 20-41 (42)
8 PRK10244 anti-RssB factor; Pro 33.5 41 0.00088 24.0 2.3 20 6-25 25-44 (88)
9 PRK10588 hypothetical protein; 33.5 34 0.00075 24.6 2.0 16 6-21 16-31 (97)
10 TIGR02112 cyd_oper_ybgE cyd op 30.5 41 0.00089 24.0 2.0 16 6-21 12-27 (93)
11 PF07172 GRP: Glycine rich pro 29.3 57 0.0012 22.9 2.5 14 6-19 5-18 (95)
12 PF11299 DUF3100: Protein of u 28.4 23 0.00051 29.3 0.5 12 75-86 130-141 (241)
13 PF01841 Transglut_core: Trans 26.0 71 0.0015 20.7 2.4 19 8-26 55-73 (113)
14 PF09600 Cyd_oper_YbgE: Cyd op 26.0 67 0.0015 22.2 2.4 15 7-21 2-16 (82)
15 PRK13664 hypothetical protein; 22.0 1.8E+02 0.0038 19.7 3.7 17 5-21 8-24 (62)
16 PF06233 Usg: Usg-like family; 21.4 26 0.00057 24.8 -0.4 19 74-92 1-19 (82)
17 KOG3315 Transport protein part 20.8 1.1E+02 0.0023 24.7 2.9 43 3-48 82-130 (191)
18 PF04202 Mfp-3: Foot protein 3 20.7 1.3E+02 0.0027 20.8 2.8 12 10-21 6-17 (71)
19 PF03216 Rhabdo_ncap_2: Rhabdo 20.5 77 0.0017 27.6 2.2 18 4-21 56-73 (357)
No 1
>PF04749 PLAC8: PLAC8 family; InterPro: IPR006461 This group of sequences are described by a region of about 170 amino acids. These proteins have highly divergent N-terminal regions rich in low complexity sequence. PSI-BLAST reveals no clear similarity to any characterised protein. It is common in plants but found also in Homo sapiens (Human), Dictyostelium, and Leishmania; at least 12 distinct members are found in Arabidopsis. Most members of this family contain more than 10 per cent Cys, but no Cys residue is invariant across the family.
Probab=65.58 E-value=4.6 Score=26.83 Aligned_cols=16 Identities=31% Similarity=0.650 Sum_probs=13.1
Q ss_pred chhhhHHHHhhCCCCC
Q 033708 69 DAWSTRMREKYGLDTS 84 (113)
Q Consensus 69 DAWS~RmREKyGLdts 84 (113)
-.++..||||||+.-+
T Consensus 68 ~~~R~~iR~ry~I~g~ 83 (106)
T PF04749_consen 68 CSLRQQIRERYGIQGS 83 (106)
T ss_pred hhHHHHHHHHhCCCCC
Confidence 4568999999999764
No 2
>TIGR01571 A_thal_Cys_rich uncharacterized Cys-rich domain. This model describes an uncharacterized domain of about 100 residues. It is common in plants but found also in Homo sapiens, Dictyostelium, and Leishmania; at least 12 distinct members are found in Arabidopsis. Most members of this family contain more than 10 per cent Cys, but no Cys residue is invariant across the family.
Probab=59.89 E-value=6.5 Score=27.42 Aligned_cols=16 Identities=31% Similarity=0.619 Sum_probs=13.1
Q ss_pred hhhhHHHHhhCCCCCC
Q 033708 70 AWSTRMREKYGLDTSE 85 (113)
Q Consensus 70 AWS~RmREKyGLdts~ 85 (113)
..+.+||+|||+.-|.
T Consensus 65 ~~R~~~R~ry~i~gs~ 80 (104)
T TIGR01571 65 FIRIKLREKYGIQGAP 80 (104)
T ss_pred HHHHHHHHHhCCCCCC
Confidence 4579999999998754
No 3
>PF13980 UPF0370: Uncharacterised protein family (UPF0370)
Probab=53.54 E-value=22 Score=24.04 Aligned_cols=13 Identities=23% Similarity=0.588 Sum_probs=10.8
Q ss_pred HHHHHHHHHHHHh
Q 033708 9 ALIFLLAIIVRAA 21 (113)
Q Consensus 9 aLSilLAmiLRAl 21 (113)
+|.||++||+-++
T Consensus 11 iLl~lvG~i~n~i 23 (63)
T PF13980_consen 11 ILLILVGMIINGI 23 (63)
T ss_pred HHHHHHHHHHHHH
Confidence 8999999997654
No 4
>COG4856 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=52.74 E-value=11 Score=33.27 Aligned_cols=21 Identities=29% Similarity=0.738 Sum_probs=18.3
Q ss_pred CCchhHHHHHHHHHHHHHHHh
Q 033708 1 MNEPWNLQALIFLLAIIVRAA 21 (113)
Q Consensus 1 ~~~~~~~QaLSilLAmiLRAl 21 (113)
.|.||.+-.++||||++|=-+
T Consensus 5 lns~W~irIiaff~A~~Lfl~ 25 (403)
T COG4856 5 LNSPWLIRIIAFFFAILLFLY 25 (403)
T ss_pred hcCcHhHHHHHHHHHHHhhee
Confidence 378999999999999998654
No 5
>PF10731 Anophelin: Thrombin inhibitor from mosquito; InterPro: IPR018932 Members of this family are all inhibitors of thrombin, the peptidase that is at the end of the blood coagulation cascade and which creates the clot by cleaving fibrinogen. The interaction between thrombin and fibrinogen involves two different areas of contact - via the thrombin active site and via a second substrate-binding site known as an exosite. The inhibitor acts by blocking the exosite, rather than by interacting with the active site. The inhibitors are from mosquitoes that feed on human blood and which, by inhibiting thrombin, prevent the blood from clotting and keep it flowing.
Probab=40.14 E-value=45 Score=22.69 Aligned_cols=26 Identities=35% Similarity=0.544 Sum_probs=13.4
Q ss_pred HHHHHHHHHHHHHHh-----cCCCCCCCccc
Q 033708 7 LQALIFLLAIIVRAA-----NRPVEYDSDDE 32 (113)
Q Consensus 7 ~QaLSilLAmiLRAl-----~~~~~YDSDDe 32 (113)
+-.|-+.|+-|+.+. |..+.||.||+
T Consensus 8 ialLC~aLva~vQ~APQYa~GeeP~YDEdd~ 38 (65)
T PF10731_consen 8 IALLCVALVAIVQSAPQYAPGEEPSYDEDDD 38 (65)
T ss_pred HHHHHHHHHHHHhcCcccCCCCCCCcCcccC
Confidence 333444444466655 22234988774
No 6
>PF05835 Synaphin: Synaphin protein; InterPro: IPR008849 This family consists of several eukaryotic synaphin 1 and 2 proteins. Synaphin/complexin is a cytosolic protein that preferentially binds to syntaxin within the SNARE complex. Synaphin promotes SNAREs to form precomplexes that oligomerise into higher order structures. A peptide from the central, syntaxin binding domain of synaphin competitively inhibits these two proteins from interacting and prevents SNARE complexes from oligomerising. It is thought that oligomerisation of SNARE complexes into a higher order structure creates a SNARE scaffold for efficient, regulated fusion of synaptic vesicles []. Synaphin promotes neuronal exocytosis by promoting interaction between the complementary syntaxin and synaptobrevin transmembrane regions that reside in opposing membranes prior to fusion [].; GO: 0019905 syntaxin binding, 0006836 neurotransmitter transport; PDB: 3RL0_m 3RK3_E 1L4A_E 1KIL_E.
Probab=36.85 E-value=18 Score=27.34 Aligned_cols=15 Identities=40% Similarity=0.718 Sum_probs=8.1
Q ss_pred hhHHHHhhCCCCCCc
Q 033708 72 STRMREKYGLDTSEF 86 (113)
Q Consensus 72 S~RmREKyGLdts~f 86 (113)
+..||+||||.-++.
T Consensus 65 Rq~IRdKY~l~k~e~ 79 (139)
T PF05835_consen 65 RQHIRDKYGLKKKEE 79 (139)
T ss_dssp HHHHHHHHT------
T ss_pred HHHHHhhcccccccc
Confidence 578999999998774
No 7
>PF07849 DUF1641: Protein of unknown function (DUF1641); InterPro: IPR012440 Archaeal and bacterial hypothetical proteins are found in this family, with the region in question being approximately 40 residues long.
Probab=35.21 E-value=39 Score=20.34 Aligned_cols=22 Identities=32% Similarity=0.502 Sum_probs=18.0
Q ss_pred CCchhHHHHHHHHHHHHHHHhcC
Q 033708 1 MNEPWNLQALIFLLAIIVRAANR 23 (113)
Q Consensus 1 ~~~~~~~QaLSilLAmiLRAl~~ 23 (113)
|+.|.|-.+|.|+|++ ||++++
T Consensus 20 l~DpdvqrgL~~ll~~-lk~lGk 41 (42)
T PF07849_consen 20 LRDPDVQRGLGFLLAF-LKALGK 41 (42)
T ss_pred HcCHHHHHHHHHHHHH-HHHHcC
Confidence 5789999999999985 677765
No 8
>PRK10244 anti-RssB factor; Provisional
Probab=33.54 E-value=41 Score=24.04 Aligned_cols=20 Identities=20% Similarity=0.390 Sum_probs=17.4
Q ss_pred HHHHHHHHHHHHHHHhcCCC
Q 033708 6 NLQALIFLLAIIVRAANRPV 25 (113)
Q Consensus 6 ~~QaLSilLAmiLRAl~~~~ 25 (113)
-+||+++|++-+||.|+.+.
T Consensus 25 QVEAlellitAlL~~~~~~~ 44 (88)
T PRK10244 25 QVEALELLVTAMLRTMGKNG 44 (88)
T ss_pred HHHHHHHHHHHHHHHhChhh
Confidence 37899999999999997764
No 9
>PRK10588 hypothetical protein; Provisional
Probab=33.54 E-value=34 Score=24.59 Aligned_cols=16 Identities=25% Similarity=0.542 Sum_probs=13.9
Q ss_pred HHHHHHHHHHHHHHHh
Q 033708 6 NLQALIFLLAIIVRAA 21 (113)
Q Consensus 6 ~~QaLSilLAmiLRAl 21 (113)
.+++||++||+.+=++
T Consensus 16 plRaLSliLAl~la~~ 31 (97)
T PRK10588 16 PLRALSLVMALLLAGC 31 (97)
T ss_pred hHHHHHHHHHHHHHHH
Confidence 5789999999998775
No 10
>TIGR02112 cyd_oper_ybgE cyd operon protein YbgE. This model describes a small protein of unknown function, about 100 amino acids in length, essentially always found in an operon with CydAB, subunits of the cytochrome d terminal oxidase. It appears to be an integral membrane protein. It is found so far only in the Proteobacteria.
Probab=30.46 E-value=41 Score=23.99 Aligned_cols=16 Identities=38% Similarity=0.611 Sum_probs=13.7
Q ss_pred HHHHHHHHHHHHHHHh
Q 033708 6 NLQALIFLLAIIVRAA 21 (113)
Q Consensus 6 ~~QaLSilLAmiLRAl 21 (113)
.+++||++||+.+=++
T Consensus 12 ~lraLSlilAl~la~~ 27 (93)
T TIGR02112 12 LLRALSFILAFLLAGC 27 (93)
T ss_pred hHHHHHHHHHHHHHHH
Confidence 4689999999988776
No 11
>PF07172 GRP: Glycine rich protein family; InterPro: IPR010800 This family consists of glycine rich proteins. Some of them may be involved in resistance to environmental stress [].
Probab=29.27 E-value=57 Score=22.94 Aligned_cols=14 Identities=29% Similarity=0.297 Sum_probs=7.9
Q ss_pred HHHHHHHHHHHHHH
Q 033708 6 NLQALIFLLAIIVR 19 (113)
Q Consensus 6 ~~QaLSilLAmiLR 19 (113)
++=.|.||||++|=
T Consensus 5 ~~llL~l~LA~lLl 18 (95)
T PF07172_consen 5 AFLLLGLLLAALLL 18 (95)
T ss_pred HHHHHHHHHHHHHH
Confidence 34456666666653
No 12
>PF11299 DUF3100: Protein of unknown function (DUF3100); InterPro: IPR021450 Some members in this family of proteins are annotated as membrane proteins however this cannot be confirmed. Currently no function is known.
Probab=28.38 E-value=23 Score=29.26 Aligned_cols=12 Identities=58% Similarity=0.902 Sum_probs=10.3
Q ss_pred HHHhhCCCCCCc
Q 033708 75 MREKYGLDTSEF 86 (113)
Q Consensus 75 mREKyGLdts~f 86 (113)
+-||||+|+.|+
T Consensus 130 I~ekYGldSpEg 141 (241)
T PF11299_consen 130 ISEKYGLDSPEG 141 (241)
T ss_pred eehhcCCCCccc
Confidence 679999999874
No 13
>PF01841 Transglut_core: Transglutaminase-like superfamily; InterPro: IPR002931 This domain is found in many proteins known to have transglutaminase activity, i.e. which cross-link proteins through an acyl-transfer reaction between the gamma-carboxamide group of peptide-bound glutamine and the epsilon-amino group of peptide-bound lysine, resulting in a epsilon-(gamma-glutamyl)lysine isopeptide bond. Tranglutaminases have been found in a diverse range of species, from bacteria through to mammals. The enzymes require calcium binding and their activity leads to post-translational modification of proteins through acyl-transfer reactions, involving peptidyl glutamine residues as acyl donors and a variety of primary amines as acyl acceptors, with the generation of proteinase resistant isopeptide bonds []. Sequence conservation in this superfamily primarily involves three motifs that centre around conserved cysteine, histidine, and aspartate residues that form the catalytic triad in the structurally characterised transglutaminase, the human blood clotting factor XIIIa' []. On the basis of the experimentally demonstrated activity of the Methanobacterium phage psiM2 pseudomurein endoisopeptidase [], it is proposed that many, if not all, microbial homologs of the transglutaminases are proteases and that the eukaryotic transglutaminases have evolved from an ancestral protease []. A subunit of plasma Factor XIII revealed that each Factor XIIIA subunit is composed of four domains (termed N-terminal beta-sandwich, core domain (containing the catalytic and the regulatory sites), and C-terminal beta-barrels 1 and 2) and that two monomers assemble into the native dimer through the surfaces in domains 1 and 2, in opposite orientation. This organisation in four domains is highly conserved during evolution among transglutaminase isoforms [].; PDB: 2F4M_A 2F4O_A 1GGY_B 1FIE_B 1GGU_B 1GGT_B 1F13_A 1QRK_B 1EVU_A 1EX0_B ....
Probab=26.05 E-value=71 Score=20.69 Aligned_cols=19 Identities=21% Similarity=0.347 Sum_probs=14.6
Q ss_pred HHHHHHHHHHHHHhcCCCC
Q 033708 8 QALIFLLAIIVRAANRPVE 26 (113)
Q Consensus 8 QaLSilLAmiLRAl~~~~~ 26 (113)
..++.||+.+||++|=++.
T Consensus 55 ~~~a~l~~allr~~Gipar 73 (113)
T PF01841_consen 55 EDYASLFVALLRALGIPAR 73 (113)
T ss_dssp HHHHHHHHHHHHHHT--EE
T ss_pred HHHHHHHHHHHhhCCCceE
Confidence 4678999999999988774
No 14
>PF09600 Cyd_oper_YbgE: Cyd operon protein YbgE (Cyd_oper_YbgE); InterPro: IPR011846 This entry describes a small protein of unknown function, about 100 amino acids in length, essentially always found in an operon with CydAB, subunits of the cytochrome d terminal oxidase. It appears to be an integral membrane protein. It is found so far only in the Proteobacteria [].
Probab=26.02 E-value=67 Score=22.24 Aligned_cols=15 Identities=40% Similarity=0.609 Sum_probs=13.1
Q ss_pred HHHHHHHHHHHHHHh
Q 033708 7 LQALIFLLAIIVRAA 21 (113)
Q Consensus 7 ~QaLSilLAmiLRAl 21 (113)
+.+||++||+++=++
T Consensus 2 lR~LSlilAl~la~~ 16 (82)
T PF09600_consen 2 LRALSLILALALAAC 16 (82)
T ss_pred hHHHHHHHHHHHHHH
Confidence 478999999999877
No 15
>PRK13664 hypothetical protein; Provisional
Probab=22.00 E-value=1.8e+02 Score=19.65 Aligned_cols=17 Identities=24% Similarity=0.776 Sum_probs=12.3
Q ss_pred hHHHHHHHHHHHHHHHh
Q 033708 5 WNLQALIFLLAIIVRAA 21 (113)
Q Consensus 5 ~~~QaLSilLAmiLRAl 21 (113)
|-+-+|.||++||+-++
T Consensus 8 WWilill~lvG~i~N~i 24 (62)
T PRK13664 8 WWILVLVFLVGVLLNVI 24 (62)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 44557889999987554
No 16
>PF06233 Usg: Usg-like family; InterPro: IPR009354 This is a family of bacterial proteins, referred to as Usg. Usg is found in the same operon as trpF, trpB, and trpA and is expressed in a coupled transcription-translation system [].
Probab=21.37 E-value=26 Score=24.78 Aligned_cols=19 Identities=37% Similarity=0.627 Sum_probs=15.0
Q ss_pred HHHHhhCCCCCCcccCCCc
Q 033708 74 RMREKYGLDTSEFTYNPSE 92 (113)
Q Consensus 74 RmREKyGLdts~ftynp~~ 92 (113)
||-+-|||.|.|+-|-=.|
T Consensus 1 rql~GYgLTTAeilYrmPD 19 (82)
T PF06233_consen 1 RQLKGYGLTTAEILYRMPD 19 (82)
T ss_pred CccccccccceeeeeeCCC
Confidence 5778899999999985433
No 17
>KOG3315 consensus Transport protein particle (TRAPP) complex subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=20.81 E-value=1.1e+02 Score=24.71 Aligned_cols=43 Identities=21% Similarity=0.298 Sum_probs=32.0
Q ss_pred chhHHHHHHHHHHHHHHHh-cCCCC-----CCCcccccccccccccCCCCCC
Q 033708 3 EPWNLQALIFLLAIIVRAA-NRPVE-----YDSDDELIAPRQQIRQPLINRP 48 (113)
Q Consensus 3 ~~~~~QaLSilLAmiLRAl-~~~~~-----YDSDDey~~~r~~~R~PLLn~q 48 (113)
|--|+..|.|+=.-+.+.+ |..++ =|.|++|.- ..+.||+|+=
T Consensus 82 EtkilgiL~fi~~tvwk~Lfgk~ad~LEka~d~~~tYmi---id~~pl~n~f 130 (191)
T KOG3315|consen 82 ETKILGILQFIHSTVWKYLFGKEADKLEKANDDDRTYMI---IDKEPLVNTF 130 (191)
T ss_pred HHHHHHHHHHHHHHHHHHHhcchHHHHHhcccccceEEE---EcCcchhhhc
Confidence 3457888899889999998 45553 566777743 6899999953
No 18
>PF04202 Mfp-3: Foot protein 3; InterPro: IPR007328 Mytilus foot protein-3 (Mfp-3) is a highly polymorphic protein family located in the byssal adhesive plaques of blue mussels.
Probab=20.74 E-value=1.3e+02 Score=20.84 Aligned_cols=12 Identities=25% Similarity=0.539 Sum_probs=8.7
Q ss_pred HHHHHHHHHHHh
Q 033708 10 LIFLLAIIVRAA 21 (113)
Q Consensus 10 LSilLAmiLRAl 21 (113)
+++|||+||=++
T Consensus 6 i~VLlaLvLIg~ 17 (71)
T PF04202_consen 6 IAVLLALVLIGS 17 (71)
T ss_pred HHHHHHHHHHhh
Confidence 467788888766
No 19
>PF03216 Rhabdo_ncap_2: Rhabdovirus nucleoprotein; InterPro: IPR004902 This is a family of Rhabdovirus nucleocapsid proteins. These proteins undergo phosphorylation.; GO: 0019013 viral nucleocapsid
Probab=20.51 E-value=77 Score=27.56 Aligned_cols=18 Identities=39% Similarity=0.499 Sum_probs=14.9
Q ss_pred hhHHHHHHHHHHHHHHHh
Q 033708 4 PWNLQALIFLLAIIVRAA 21 (113)
Q Consensus 4 ~~~~QaLSilLAmiLRAl 21 (113)
|--.||||+|||+|+---
T Consensus 56 pqT~~ALsvLlafV~~g~ 73 (357)
T PF03216_consen 56 PQTNQALSVLLAFVTQGT 73 (357)
T ss_pred chHHHHHHHHHHHHHcCC
Confidence 566899999999998643
Done!