Query psy15675
Match_columns 202
No_of_seqs 110 out of 243
Neff 4.8
Searched_HMMs 46136
Date Fri Aug 16 18:41:04 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy15675.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/15675hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG3195|consensus 100.0 3E-67 6.6E-72 442.3 15.2 178 13-192 5-187 (213)
2 PF05832 DUF846: Eukaryotic pr 100.0 2.8E-60 6.2E-65 384.7 15.1 142 34-177 1-142 (142)
3 PF02229 PC4: Transcriptional 63.6 7.9 0.00017 26.6 2.7 25 83-108 12-36 (56)
4 PF07127 Nodulin_late: Late no 49.3 17 0.00036 24.7 2.4 30 152-181 5-34 (54)
5 PF04971 Lysis_S: Lysis protei 38.0 23 0.00051 25.9 1.8 24 161-186 43-66 (68)
6 KOG1693|consensus 36.8 21 0.00045 31.3 1.6 35 110-144 156-190 (209)
7 PF08043 Xin: Xin repeat; Int 24.6 62 0.0013 17.5 1.5 11 105-115 4-14 (16)
8 PF13127 DUF3955: Protein of u 23.1 67 0.0014 22.8 2.0 26 74-108 10-35 (63)
9 PF10674 Ycf54: Protein of unk 22.6 27 0.00058 27.0 -0.1 11 78-88 36-46 (93)
10 PF09685 Tic20: Tic20-like pro 21.7 90 0.002 22.8 2.6 28 157-184 74-101 (109)
11 PF07330 DUF1467: Protein of u 21.3 2.2E+02 0.0047 21.5 4.5 34 117-150 40-73 (85)
12 PF09951 DUF2185: Protein of u 20.8 67 0.0015 24.3 1.7 34 83-116 3-37 (89)
No 1
>KOG3195|consensus
Probab=100.00 E-value=3e-67 Score=442.32 Aligned_cols=178 Identities=50% Similarity=0.956 Sum_probs=167.1
Q ss_pred cccccCCCCCCCCccc----cc-cCCCCCChHHHHHHHHHHHHHHHHHhhcccccCCchhHHHHHHHHHHhhhhhheecc
Q psy15675 13 ENTLVDDDTTPFGEEE----LS-RRGTLVHPYVTFFHLFFRSMAILSYMFTGLFSSSGFIASFVFIVLLLSMDFWTVKNI 87 (202)
Q Consensus 13 ~~~~~~~d~~~~~~~~----~~-~~~~~~HP~~~~fhl~fK~~ai~~Yl~~~~f~~~~fv~~Fvi~ilLla~DFW~vKNI 87 (202)
..++.+||+++++.|| +. ..+|++||++++||++||.+||++|++|.+|+++ |+.+||++++|+|+|||+||||
T Consensus 5 ~~~~~~~~v~~~d~~~~~~~~~r~~qk~~hP~~~fFHL~Fr~aAi~~Yil~~~f~nS-Fi~~fv~ivlLLs~DFW~VKNv 83 (213)
T KOG3195|consen 5 DSNLDVDNVSKFDAEEEITDPPRNFQKSSHPVALFFHLFFRGAAIVFYILCELFSNS-FITQFVVIVLLLSCDFWTVKNV 83 (213)
T ss_pred ccCcchhccccCChhhhhhchhhhhhhccCchHHHHHHHhhhHHHHHHHHHHHHHhh-hHHHHHHHHHHHhhhhheeecc
Confidence 4578888999999766 22 3378999999999999999999999999999999 9999999999999999999999
Q ss_pred ccceeeeeeeeeeeCCCCceEEEEEeccCCCCCCCCccchhhhhhhhHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHh
Q psy15675 88 SGRLMVGLRWWNYVDDQGKSHWVFESRKGDLQNRVNGTEARIFWTSLIVCPVMWALFLLTALFGFNFKWMLLVWIALTLN 167 (202)
Q Consensus 88 tGR~LVGLRWWnevde~G~s~W~FES~~~~~~~~~n~~ds~~FW~~L~~~p~~W~i~~i~~l~~l~~~wl~l~ii~l~L~ 167 (202)
|||+||||||||++|+||+|||+|||+|++.+ .+|++|||+||++||++|++|+++++.+++++++.||+++++|++|+
T Consensus 84 tGRlLVGLRWWN~id~dG~ShW~FES~k~~s~-~~n~~dsriFWlgL~~~pv~W~if~v~al~~fk~~wL~lv~vg~~l~ 162 (213)
T KOG3195|consen 84 TGRLLVGLRWWNEIDVDGESHWVFESAKQSSP-NVNAIDSRIFWLGLYLCPVIWIIFAVFALFRFKFKWLILVVVGIALN 162 (213)
T ss_pred cccEEEeeeeeeecCCCCceeEEEeccccccc-cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 99999999999999999999999999997554 49999999999999999999999999999999999999999999999
Q ss_pred hhhhheeeeeCCCCccchhhhhhhh
Q psy15675 168 FANLYGYMKCKMGGNSDKSLMSTVS 192 (202)
Q Consensus 168 ~~Nl~gy~kC~~~~k~~~~~~a~~~ 192 (202)
++|+|||.||+++++++.++.+++.
T Consensus 163 ~aN~~Gy~rC~~~a~~~~~q~~~ds 187 (213)
T KOG3195|consen 163 SANLYGYSRCDKDAKKKFQQVASDS 187 (213)
T ss_pred hhhhheeeecCCccchhHHHHHHHH
Confidence 9999999999999999999988765
No 2
>PF05832 DUF846: Eukaryotic protein of unknown function (DUF846); InterPro: IPR008564 This family consists of a number of conserved eukaryotic proteins of unknown function.; GO: 0016021 integral to membrane
Probab=100.00 E-value=2.8e-60 Score=384.72 Aligned_cols=142 Identities=58% Similarity=1.104 Sum_probs=137.8
Q ss_pred CCCChHHHHHHHHHHHHHHHHHhhcccccCCchhHHHHHHHHHHhhhhhheeccccceeeeeeeeeeeCCCCceEEEEEe
Q psy15675 34 TLVHPYVTFFHLFFRSMAILSYMFTGLFSSSGFIASFVFIVLLLSMDFWTVKNISGRLMVGLRWWNYVDDQGKSHWVFES 113 (202)
Q Consensus 34 ~~~HP~~~~fhl~fK~~ai~~Yl~~~~f~~~~fv~~Fvi~ilLla~DFW~vKNItGR~LVGLRWWnevde~G~s~W~FES 113 (202)
|++||++|++|++||++|+++|++|++|++| ++.+|+++++|+|+|||+|||||||+|||||||||+||||+++|+|||
T Consensus 1 ~s~hP~~~~fhl~fK~~ai~~y~~~~~f~~~-fi~~fv~~illla~DFw~vKNitGR~LVGLRWwn~~~~~g~s~w~FEs 79 (142)
T PF05832_consen 1 KSSHPVACFFHLFFKIAAILVYLFGGLFSDS-FIIIFVITILLLAIDFWTVKNITGRLLVGLRWWNEVDEDGESHWVFES 79 (142)
T ss_pred CCCchHHHHHHHHHHHHHHHHHHHHHHhhcc-hHHHHHHHHHHHHHHHHHHhhhccceEEeeeeEEEeCCCCCeEEEEEE
Confidence 6899999999999999999999999999888 999999999999999999999999999999999999999999999999
Q ss_pred ccCCCCCCCCccchhhhhhhhHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHhhhhhheeeee
Q psy15675 114 RKGDLQNRVNGTEARIFWTSLIVCPVMWALFLLTALFGFNFKWMLLVWIALTLNFANLYGYMKC 177 (202)
Q Consensus 114 ~~~~~~~~~n~~ds~~FW~~L~~~p~~W~i~~i~~l~~l~~~wl~l~ii~l~L~~~Nl~gy~kC 177 (202)
+++ +.+++|+.|||+||++||++|++|+++++.+++++++.|++++++|++++++|+|||+||
T Consensus 80 ~~~-~~~~~n~~ds~~FW~~l~~~~~~W~~~~i~~l~~l~~~~l~l~~i~~~l~~~Nl~gf~kC 142 (142)
T PF05832_consen 80 RDQ-PRFVPNPIDSRIFWLTLYATPLIWIILLIIALISLQFFWLLLVIIALILNGANLYGFYKC 142 (142)
T ss_pred cCC-ccCcCCHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC
Confidence 985 345679999999999999999999999999999999999999999999999999999999
No 3
>PF02229 PC4: Transcriptional Coactivator p15 (PC4); InterPro: IPR003173 p15 has a bipartite structure composed of an amino-terminal regulatory domain and a carboxy-terminal cryptic DNA-binding domain []. The DNA-binding activity of the carboxy-terminal is disguised by the amino-terminal p15 domain. Activity is controlled by protein kinases that target the regulatory domain.; GO: 0003677 DNA binding, 0003713 transcription coactivator activity, 0006355 regulation of transcription, DNA-dependent; PDB: 3PM7_B 2LTD_A 2LTT_B 3OBH_B 2L3A_B 2PHE_B 1PCF_B 2C62_B.
Probab=63.56 E-value=7.9 Score=26.63 Aligned_cols=25 Identities=24% Similarity=0.502 Sum_probs=20.1
Q ss_pred heeccccceeeeeeeeeeeCCCCceE
Q psy15675 83 TVKNISGRLMVGLRWWNYVDDQGKSH 108 (202)
Q Consensus 83 ~vKNItGR~LVGLRWWnevde~G~s~ 108 (202)
++++--|+.+|.+|-|.+ ++||+..
T Consensus 12 ~v~~fkG~~~vdIRe~y~-~~~g~~~ 36 (56)
T PF02229_consen 12 SVSEFKGKPYVDIREWYE-KKDGEWK 36 (56)
T ss_dssp EEEEETTSEEEEEEEEET-TSSS-EE
T ss_pred EEEEeCCeEEEEEEeeEE-cCCCcCc
Confidence 467788999999999999 7788644
No 4
>PF07127 Nodulin_late: Late nodulin protein; InterPro: IPR009810 This family consists of several plant specific late nodulin sequences which are homologous to the Pisum sativum (Garden pea) ENOD3 protein. ENOD3 is expressed in the late stages of root nodule formation and contains two pairs of cysteine residues toward the proteins C terminus which may be involved in metal-binding [].; GO: 0046872 metal ion binding, 0009878 nodule morphogenesis
Probab=49.27 E-value=17 Score=24.69 Aligned_cols=30 Identities=20% Similarity=0.568 Sum_probs=26.3
Q ss_pred hhhHHHHHHHHHHHHhhhhhheeeeeCCCC
Q psy15675 152 FNFKWMLLVWIALTLNFANLYGYMKCKMGG 181 (202)
Q Consensus 152 l~~~wl~l~ii~l~L~~~Nl~gy~kC~~~~ 181 (202)
+++.|.++..+.+.+..+|...+..|..|+
T Consensus 5 lKFvY~mIiflslflv~~~~~~~~~C~~d~ 34 (54)
T PF07127_consen 5 LKFVYAMIIFLSLFLVVTNVDAIIPCKTDS 34 (54)
T ss_pred hhhHHHHHHHHHHHHhhcccCCCcccCccc
Confidence 567888999999999999999999998775
No 5
>PF04971 Lysis_S: Lysis protein S ; InterPro: IPR007054 The lysis S protein is a cytotoxic protein forming holes in membranes causing cell lysis. The action of Lysis S is independent of the proportion of acidic phospholipids in the membrane [].
Probab=38.00 E-value=23 Score=25.92 Aligned_cols=24 Identities=25% Similarity=0.374 Sum_probs=16.6
Q ss_pred HHHHHHhhhhhheeeeeCCCCccchh
Q psy15675 161 WIALTLNFANLYGYMKCKMGGNSDKS 186 (202)
Q Consensus 161 ii~l~L~~~Nl~gy~kC~~~~k~~~~ 186 (202)
++++.--.+|+ |.||+++.++.++
T Consensus 43 ~~~~lt~ltN~--YFK~k~drr~~a~ 66 (68)
T PF04971_consen 43 FFGLLTYLTNL--YFKIKEDRRKAAR 66 (68)
T ss_pred HHHHHHHHhHh--hhhhhHhhhHhhc
Confidence 34555556777 6899999887654
No 6
>KOG1693|consensus
Probab=36.83 E-value=21 Score=31.31 Aligned_cols=35 Identities=20% Similarity=0.341 Sum_probs=26.7
Q ss_pred EEEeccCCCCCCCCccchhhhhhhhHHHHHHHHHH
Q psy15675 110 VFESRKGDLQNRVNGTEARIFWTSLIVCPVMWALF 144 (202)
Q Consensus 110 ~FES~~~~~~~~~n~~ds~~FW~~L~~~p~~W~i~ 144 (202)
+|.+|+..+..+++..++|+.|++++...++=++-
T Consensus 156 y~R~RE~rn~~tv~st~~Rv~~~Sl~e~~~vv~iS 190 (209)
T KOG1693|consen 156 YYRLREARNRSTVESTNSRVTWWSLLEIIAVVVIS 190 (209)
T ss_pred HHHHHHhcCccchhcccchhhHHHHHHHHHHHHHH
Confidence 35566666677899999999999999877665443
No 7
>PF08043 Xin: Xin repeat; InterPro: IPR012510 The repeat has the consensus sequence GDV(K/Q/R)(T/S/G)X(R/K/T) WLFETXPLD. This repeat motif is typically found in the N terminus of the proteins, with a copy number between 2 and 28 repeats. Direct evidence for binding to and stabilising F-actin has been found in the human protein (Q702N9 from SWISSPROT) []. The homologues in mouse and chicken localise in the adherens junction complex of the intercalated disc in cardiac muscle and in the myotendon junction of skeletal muscle. mXin may co-localise with Vinculin which is known to attach the actin to the cytoplasmic membrane []. It has been shown that the amino-terminus of human xin (CMYA1) binds the EVH1 domain of Mena/VASP/EVL, and the carboxy-terminus binds the, for the filamin family unique, domain 20 of filamin C []. This confirms the proposed role of xin repeat containing proteins as F-actin-binding adapter proteins.; GO: 0003779 actin binding, 0030036 actin cytoskeleton organization, 0030054 cell junction
Probab=24.57 E-value=62 Score=17.48 Aligned_cols=11 Identities=27% Similarity=0.912 Sum_probs=7.9
Q ss_pred CceEEEEEecc
Q psy15675 105 GKSHWVFESRK 115 (202)
Q Consensus 105 G~s~W~FES~~ 115 (202)
.+..|.||+.+
T Consensus 4 ~~~~wlFEtqp 14 (16)
T PF08043_consen 4 KTTRWLFETQP 14 (16)
T ss_pred ceeEEEeeccc
Confidence 34689999753
No 8
>PF13127 DUF3955: Protein of unknown function (DUF3955)
Probab=23.12 E-value=67 Score=22.82 Aligned_cols=26 Identities=42% Similarity=0.652 Sum_probs=20.5
Q ss_pred HHHHhhhhhheeccccceeeeeeeeeeeCCCCceE
Q psy15675 74 VLLLSMDFWTVKNISGRLMVGLRWWNYVDDQGKSH 108 (202)
Q Consensus 74 ilLla~DFW~vKNItGR~LVGLRWWnevde~G~s~ 108 (202)
.+++.+=.+..+|..| .+|||||--|
T Consensus 10 ~~llg~~~l~i~~~~~---------syVd~~G~L~ 35 (63)
T PF13127_consen 10 LLLLGVVCLFIFNIIG---------SYVDEDGVLH 35 (63)
T ss_pred HHHHHHHHHHHHhccc---------ceECCCCeEe
Confidence 3456777899999987 7899999654
No 9
>PF10674 Ycf54: Protein of unknown function (DUF2488); InterPro: IPR019616 This entry represents proteins annotated as Ycf54. It is found encoded in the chloroplast genomes of algae, it is also found in plants and in the cyanobacteria. The function is unknown. As the family is exclusively found in phototrophic organisms it may play a role in photosynthesis.; PDB: 3JSR_A 3HZE_D.
Probab=22.60 E-value=27 Score=27.05 Aligned_cols=11 Identities=45% Similarity=1.008 Sum_probs=7.1
Q ss_pred hhhhhheeccc
Q psy15675 78 SMDFWTVKNIS 88 (202)
Q Consensus 78 a~DFW~vKNIt 88 (202)
.+|||.|+|..
T Consensus 36 ~iDFWlv~~Pa 46 (93)
T PF10674_consen 36 EIDFWLVKEPA 46 (93)
T ss_dssp ---EEEEESBG
T ss_pred CccEEEecChh
Confidence 57999999984
No 10
>PF09685 Tic20: Tic20-like protein; InterPro: IPR019109 This entry represents a group of uncharacterised conserved proteins including a chloroplast protein import component called Tic20. Chloroplast function requires the import of nuclear encoded proteins from the cytoplasm across the chloroplast double membrane. This is accomplished by two protein complexes, the Toc complex located at the outer membrane and the Tic complex located at the inner membrane. The Toc complex recognises specific proteins by a cleavable N-terminal sequence and is primarily responsible for translocation through the outer membrane, while the Tic complex translocates the protein through the inner membrane. Tic20 is a core member of the Tic complex and is deeply embedded in the inner envelope membrane. It is thought to function as a protein conducting component of the Tic complex [].
Probab=21.65 E-value=90 Score=22.84 Aligned_cols=28 Identities=21% Similarity=0.272 Sum_probs=17.2
Q ss_pred HHHHHHHHHHhhhhhheeeeeCCCCccc
Q psy15675 157 MLLVWIALTLNFANLYGYMKCKMGGNSD 184 (202)
Q Consensus 157 l~l~ii~l~L~~~Nl~gy~kC~~~~k~~ 184 (202)
++..++.+..-...++|-.|+.++...+
T Consensus 74 ~~~~~~~l~~~v~~I~~~~~a~~g~~~~ 101 (109)
T PF09685_consen 74 LLLIVLWLLSLVLSIIGAIKANKGEPYR 101 (109)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCeee
Confidence 3334444455666777888887776544
No 11
>PF07330 DUF1467: Protein of unknown function (DUF1467); InterPro: IPR009935 This family consists of several bacterial proteins of around 90 residues in length. The function of this family is unknown.
Probab=21.34 E-value=2.2e+02 Score=21.49 Aligned_cols=34 Identities=21% Similarity=0.244 Sum_probs=27.6
Q ss_pred CCCCCCCccchhhhhhhhHHHHHHHHHHHHHHHh
Q psy15675 117 DLQNRVNGTEARIFWTSLIVCPVMWALFLLTALF 150 (202)
Q Consensus 117 ~~~~~~n~~ds~~FW~~L~~~p~~W~i~~i~~l~ 150 (202)
++..+.|+.=.|-++.+.+++.++|.+.......
T Consensus 40 ~~sAP~~~~l~rk~~~TTiiaavi~~~~~~~~~~ 73 (85)
T PF07330_consen 40 DPSAPANPRLKRKALITTIIAAVIFAIIYLIIVS 73 (85)
T ss_pred CCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 3457888889999999999999999988765543
No 12
>PF09951 DUF2185: Protein of unknown function (DUF2185); InterPro: IPR018689 This domain has no known function.
Probab=20.82 E-value=67 Score=24.30 Aligned_cols=34 Identities=26% Similarity=0.237 Sum_probs=25.0
Q ss_pred heeccc-cceeeeeeeeeeeCCCCceEEEEEeccC
Q psy15675 83 TVKNIS-GRLMVGLRWWNYVDDQGKSHWVFESRKG 116 (202)
Q Consensus 83 ~vKNIt-GR~LVGLRWWnevde~G~s~W~FES~~~ 116 (202)
++|||. ++.-||--+--+...+..|-|+|-|-.+
T Consensus 3 vS~~i~~~~~~v~~~yRE~p~~~~DSGWrffsg~E 37 (89)
T PF09951_consen 3 VSKNILEEGEPVGYMYREEPEFPNDSGWRFFSGDE 37 (89)
T ss_pred EehHHhhCCCceEEEEecCCCCCCCCceEEEecCC
Confidence 478887 7777776665555556678899999765
Done!