Query psy5943
Match_columns 133
No_of_seqs 127 out of 429
Neff 5.3
Searched_HMMs 46136
Date Fri Aug 16 22:26:53 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy5943.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/5943hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1789|consensus 99.9 1.7E-25 3.7E-30 207.7 1.4 122 1-127 917-1039(2235)
2 PF14237 DUF4339: Domain of un 99.8 9.5E-20 2.1E-24 112.5 5.3 45 44-88 1-45 (45)
3 PF02213 GYF: GYF domain; Int 98.1 3.9E-06 8.5E-11 53.9 3.8 51 43-93 1-53 (57)
4 cd00072 GYF GYF domain: contai 98.0 8.1E-06 1.8E-10 52.8 4.1 49 43-91 2-52 (57)
5 smart00444 GYF Contains conser 97.5 0.00031 6.7E-09 45.3 5.0 50 44-93 2-52 (56)
6 KOG1862|consensus 90.5 0.38 8.2E-06 44.3 4.4 59 40-98 201-263 (673)
7 TIGR02675 tape_meas_nterm tape 47.8 18 0.00039 24.1 2.3 20 57-76 56-75 (75)
8 PF09851 SHOCT: Short C-termin 40.7 13 0.00028 20.9 0.6 18 59-76 5-22 (31)
9 COG3693 XynA Beta-1,4-xylanase 39.5 88 0.0019 27.2 5.7 71 42-130 66-141 (345)
10 PF10077 DUF2314: Uncharacteri 34.7 32 0.00069 25.4 2.1 26 41-66 92-117 (133)
11 PF13162 DUF3997: Protein of u 31.5 78 0.0017 23.2 3.7 29 44-72 74-104 (115)
12 COG4336 Uncharacterized conser 28.6 54 0.0012 27.2 2.6 56 59-114 14-74 (265)
No 1
>KOG1789|consensus
Probab=99.90 E-value=1.7e-25 Score=207.69 Aligned_cols=122 Identities=39% Similarity=0.669 Sum_probs=113.8
Q ss_pred CCCceeeeeehhhhhhccCccccccccceeecCCCCCCCCCCcceEEecCC-eeeCCCCHHHHHHHHHcCccCCCCeeec
Q psy5943 1 MVGINVLVDLLTLCHLHVSRATHVVQSNVLEAGSGPGLEDEEKEWYYGTSE-QSKGPVTFHQIKHLWSTGELNPKTKVWA 79 (133)
Q Consensus 1 ~~g~~~lvd~~~lah~~~~~~~~~~~~n~i~~~~~~~~~~~~~~Wyy~~~g-~q~GP~s~~eL~~l~~~G~I~~~TlVW~ 79 (133)
++|+.++||+.+++|+|++|+.+|.|||+|+++.+ |+ .+++|||.+.| ++.||++++-++.+|.+..|...|.||.
T Consensus 917 ~~~~~l~vdl~~~~h~~~~r~~i~~qsn~i~asa~-~~--~~~ew~y~dk~~~~vgp~~~~~~~sl~s~k~i~~~s~~~a 993 (2235)
T KOG1789|consen 917 SNILPLLVDLCVLAHLHVQRAKVQNQTNVIEASAE-QM--AEEEWYYHDKDAKQVGPLSFEKMKSLYTEKTIFEKSQIWA 993 (2235)
T ss_pred ecchHHHHHHHHHHHHhhhccCCccchhHHHhhhh-hc--CchhheeecCCccccCchhHHHHHHHhcccchhHHHHHHH
Confidence 47899999999999999999999999999999998 43 67899999875 8899999999999999999999999999
Q ss_pred CCcccceecCCcchHHHHHhhcCCCCCCCcchHHHHHHHHHHHHHHhh
Q psy5943 80 HGMESWKSLHQVPQLKWTLVAKNSGGVMNETELGCLILSMLTKVNLEN 127 (133)
Q Consensus 80 eGm~~W~p~~~v~eL~~~l~~~~pPP~l~~~~~~~~~L~~L~~~~~~~ 127 (133)
-||++|+.+++|+||+|..... -|.|++++++.+|||||+.||++.
T Consensus 994 ~gm~~w~~l~~i~~~rw~v~~~--ipv~~~s~~~~~~l~~L~~Mc~~f 1039 (2235)
T KOG1789|consen 994 AGMDKWMSLAAVPQFRWTVCQQ--IPVMNFTDLSVLCLDTLLQMCEFF 1039 (2235)
T ss_pred hhhhHHHhhhhhhhhhhhhhhc--ccccCHHHHHHHHHHHHHHHHhhC
Confidence 9999999999999999998433 489999999999999999999975
No 2
>PF14237 DUF4339: Domain of unknown function (DUF4339)
Probab=99.80 E-value=9.5e-20 Score=112.50 Aligned_cols=45 Identities=44% Similarity=1.076 Sum_probs=44.0
Q ss_pred ceEEecCCeeeCCCCHHHHHHHHHcCccCCCCeeecCCcccceec
Q psy5943 44 EWYYGTSEQSKGPVTFHQIKHLWSTGELNPKTKVWAHGMESWKSL 88 (133)
Q Consensus 44 ~Wyy~~~g~q~GP~s~~eL~~l~~~G~I~~~TlVW~eGm~~W~p~ 88 (133)
+|||.++|++.||||++||++++++|+|+++|+||++||++|+|+
T Consensus 1 ~Wy~~~~g~~~GP~s~~el~~l~~~g~i~~~tlvw~~g~~~W~pl 45 (45)
T PF14237_consen 1 EWYYARNGQQQGPFSLEELRQLISSGEIDPDTLVWKEGMSDWKPL 45 (45)
T ss_pred CEEEeCCCeEECCcCHHHHHHHHHcCCCCCCCeEeCCChhhceEC
Confidence 699999999999999999999999999999999999999999986
No 3
>PF02213 GYF: GYF domain; InterPro: IPR003169 The glycine-tyrosine-phenylalanine (GYF) domain is an around 60-amino acid domain which contains a conserved GP[YF]xxxx[MV]xxWxxx[GN]YF motif. It was identified in the human intracellular protein termed CD2 binding protein 2 (CD2BP2), which binds to a site containing two tandem PPPGHR segments within the cytoplasmic region of CD2. Binding experiments and mutational analyses have demonstrated the critical importance of the GYF tripeptide in ligand binding. A GYF domain is also found in several other eukaryotic proteins of unknown function []. It has been proposed that the GYF domain found in these proteins could also be involved in proline-rich sequence recognition []. Resolution of the structure of the CD2BP2 GYF domain by NMR spectroscopy revealed a compact domain with a beta-beta-alpha-beta-beta topology, where the single alpha-helix is tilted away from the twisted, anti-parallel beta-sheet. The conserved residues of the GYF domain create a contiguous patch of predominantly hydrophobic nature which forms an integral part of the ligand-binding site []. There is limited homology within the C-terminal 20-30 amino acids of various GYF domains, supporting the idea that this part of the domain is structurally but not functionally important [].; GO: 0005515 protein binding; PDB: 1SYX_F 1L2Z_A 1GYF_A 1WH2_A 3FMA_C 3K3V_A.
Probab=98.10 E-value=3.9e-06 Score=53.89 Aligned_cols=51 Identities=18% Similarity=0.415 Sum_probs=39.9
Q ss_pred cceEEecC-CeeeCCCCHHHHHHHHHcCccCCCCeeecCCcccc-eecCCcch
Q psy5943 43 KEWYYGTS-EQSKGPVTFHQIKHLWSTGELNPKTKVWAHGMESW-KSLHQVPQ 93 (133)
Q Consensus 43 ~~Wyy~~~-g~q~GP~s~~eL~~l~~~G~I~~~TlVW~eGm~~W-~p~~~v~e 93 (133)
+.|+|.+. |+.+|||+.++|+..+++|-.+.+..|++.+-+++ .+...+.+
T Consensus 1 ~~W~Y~d~~g~~qGPf~~~~M~~W~~~gyF~~~l~vr~~~~~~~~~~~~~~~~ 53 (57)
T PF02213_consen 1 KMWYYKDPDGNIQGPFSSEQMQAWYKQGYFPDDLQVRRVDDTQFIDPFGSIDR 53 (57)
T ss_dssp -EEEEESTTS-EEEEEEHHHHHHHHHTTSSTTT-EEEETTSTTT--SSCECCG
T ss_pred CEeEEECCCCCcCCCcCHHHHHHHHHCCCCCCCcEEEEecCCCCcccchhhhh
Confidence 47999976 69999999999999999999999999999976555 55555443
No 4
>cd00072 GYF GYF domain: contains conserved Gly-Tyr-Phe residues; Proline-binding domain in CD2-binding and other proteins. Involved in signaling lymphocyte activity. Also present in other unrelated proteins (mainly unknown) derived from diverse eukaryotic species.
Probab=98.02 E-value=8.1e-06 Score=52.85 Aligned_cols=49 Identities=16% Similarity=0.392 Sum_probs=43.9
Q ss_pred cceEEecC-CeeeCCCCHHHHHHHHHcCccCCCCeeecC-CcccceecCCc
Q psy5943 43 KEWYYGTS-EQSKGPVTFHQIKHLWSTGELNPKTKVWAH-GMESWKSLHQV 91 (133)
Q Consensus 43 ~~Wyy~~~-g~q~GP~s~~eL~~l~~~G~I~~~TlVW~e-Gm~~W~p~~~v 91 (133)
..|+|.+. |+.+|||+.++|+..+++|-.+.+-.|++. ....|.++.++
T Consensus 2 ~~W~Y~d~~g~vqGPF~~~~M~~W~~~gyF~~~l~vr~~~~~~~f~~l~~~ 52 (57)
T cd00072 2 VQWFYKDPQGEIQGPFSASQMLQWYQAGYFPDGLQVRRLDNGGEFYTLGDI 52 (57)
T ss_pred cEEEEECCCCCCcCCcCHHHHHHHHHCCCCCCCeEEEECCCCCCcEEHHHH
Confidence 57999975 788999999999999999999999999999 55789988775
No 5
>smart00444 GYF Contains conserved Gly-Tyr-Phe residues. Proline-binding domain in CD2-binding protein. Contains conserved Gly-Tyr-Phe residues.
Probab=97.46 E-value=0.00031 Score=45.27 Aligned_cols=50 Identities=16% Similarity=0.377 Sum_probs=44.0
Q ss_pred ceEEecC-CeeeCCCCHHHHHHHHHcCccCCCCeeecCCcccceecCCcch
Q psy5943 44 EWYYGTS-EQSKGPVTFHQIKHLWSTGELNPKTKVWAHGMESWKSLHQVPQ 93 (133)
Q Consensus 44 ~Wyy~~~-g~q~GP~s~~eL~~l~~~G~I~~~TlVW~eGm~~W~p~~~v~e 93 (133)
.|+|.+. |+.+||||-++|+..+++|-.+.+-.|++.+-++..++..+..
T Consensus 2 ~W~Y~d~~~~iqGPf~~~~M~~W~~~gyF~~~l~vr~~~~~~~~~l~~~~~ 52 (56)
T smart00444 2 LWLYKDPDGEIQGPFTASQMSQWYQAGYFPDSLQIKRLNEPPYDTLGDLDR 52 (56)
T ss_pred EEEEECCCCCEeCCcCHHHHHHHHHCCCCCCCeEEEEcCCCCCCcchhhhh
Confidence 6999975 6889999999999999999999999999999887777766543
No 6
>KOG1862|consensus
Probab=90.49 E-value=0.38 Score=44.25 Aligned_cols=59 Identities=25% Similarity=0.384 Sum_probs=51.3
Q ss_pred CCCcceEEecC-CeeeCCCCHHHHHHHHHcCccCCCCeeecCCccc---ceecCCcchHHHHH
Q psy5943 40 DEEKEWYYGTS-EQSKGPVTFHQIKHLWSTGELNPKTKVWAHGMES---WKSLHQVPQLKWTL 98 (133)
Q Consensus 40 ~~~~~Wyy~~~-g~q~GP~s~~eL~~l~~~G~I~~~TlVW~eGm~~---W~p~~~v~eL~~~l 98 (133)
+.+..|||.+. |+.+|||+..++...+..|....+..||...=.. ...++.+.++....
T Consensus 201 ~~d~~~~Y~DP~g~iqGPf~~~~v~~W~~~GyF~~~l~vr~~e~~~~~~f~tl~~~~~~l~~~ 263 (673)
T KOG1862|consen 201 DEELSWLYKDPQGQIQGPFSASDVLQWYEAGYFPDDLQVRLGENPERSIFQTLGEVMQLLKTR 263 (673)
T ss_pred CcceeEEeeCCCCcccCCchHHHHHHHHhcCccCCCceeeeccCCccccceehhhhhhhcccc
Confidence 67889999986 7999999999999999999999998888888777 88888887776655
No 7
>TIGR02675 tape_meas_nterm tape measure domain. Proteins containing this domain are strictly bacterial, including bacteriophage and prophage regions of bacterial genomes. Most members are 800 to 1800 amino acids long, making them among the longest predicted proteins of their respective phage genomes, where they are encoded in tail protein regions. This roughly 80-residue domain described here usually begins between residue 100 and 250. Many members are known or predicted to act as phage tail tape measure proteins, a minor tail component that regulates tail length.
Probab=47.84 E-value=18 Score=24.15 Aligned_cols=20 Identities=25% Similarity=0.408 Sum_probs=17.8
Q ss_pred CCHHHHHHHHHcCccCCCCe
Q psy5943 57 VTFHQIKHLWSTGELNPKTK 76 (133)
Q Consensus 57 ~s~~eL~~l~~~G~I~~~Tl 76 (133)
+|..+|+++..+|+|+.+.+
T Consensus 56 ~t~~~l~~~~~~Gkit~~~~ 75 (75)
T TIGR02675 56 VTRGELRKMLSDGKLTADVI 75 (75)
T ss_pred CCHHHHHHHHHCCCCccccC
Confidence 88999999999999998753
No 8
>PF09851 SHOCT: Short C-terminal domain; InterPro: IPR018649 This family of hypothetical prokaryotic proteins has no known function.
Probab=40.71 E-value=13 Score=20.90 Aligned_cols=18 Identities=22% Similarity=0.582 Sum_probs=14.6
Q ss_pred HHHHHHHHHcCccCCCCe
Q psy5943 59 FHQIKHLWSTGELNPKTK 76 (133)
Q Consensus 59 ~~eL~~l~~~G~I~~~Tl 76 (133)
+..|++++.+|.|+.+.|
T Consensus 5 L~~L~~l~~~G~IseeEy 22 (31)
T PF09851_consen 5 LEKLKELYDKGEISEEEY 22 (31)
T ss_pred HHHHHHHHHcCCCCHHHH
Confidence 578899999999987654
No 9
>COG3693 XynA Beta-1,4-xylanase [Carbohydrate transport and metabolism]
Probab=39.54 E-value=88 Score=27.21 Aligned_cols=71 Identities=17% Similarity=0.259 Sum_probs=45.9
Q ss_pred CcceEEecCCeeeCCCC---HHHHHHHHHc--CccCCCCeeecCCcccceecCCcchHHHHHhhcCCCCCCCcchHHHHH
Q psy5943 42 EKEWYYGTSEQSKGPVT---FHQIKHLWST--GELNPKTKVWAHGMESWKSLHQVPQLKWTLVAKNSGGVMNETELGCLI 116 (133)
Q Consensus 42 ~~~Wyy~~~g~q~GP~s---~~eL~~l~~~--G~I~~~TlVW~eGm~~W~p~~~v~eL~~~l~~~~pPP~l~~~~~~~~~ 116 (133)
+-.|.+... +.|=|. .+.+.++.++ -.+...||||..-.++|.+..+. ++..+...+
T Consensus 66 emKwe~i~p--~~G~f~Fe~AD~ia~FAr~h~m~lhGHtLvW~~q~P~W~~~~e~----------------~~~~~~~~~ 127 (345)
T COG3693 66 EMKWEAIEP--ERGRFNFEAADAIANFARKHNMPLHGHTLVWHSQVPDWLFGDEL----------------SKEALAKMV 127 (345)
T ss_pred ccccccccC--CCCccCccchHHHHHHHHHcCCeeccceeeecccCCchhhcccc----------------ChHHHHHHH
Confidence 345777654 455553 3445555544 45688999999999999998772 223333344
Q ss_pred HHHHHHHHHhhhhh
Q psy5943 117 LSMLTKVNLENKNT 130 (133)
Q Consensus 117 L~~L~~~~~~~~~~ 130 (133)
=+=+..+|++|++.
T Consensus 128 e~hI~tV~~rYkg~ 141 (345)
T COG3693 128 EEHIKTVVGRYKGS 141 (345)
T ss_pred HHHHHHHHHhccCc
Confidence 45577788888863
No 10
>PF10077 DUF2314: Uncharacterized protein conserved in bacteria (DUF2314); InterPro: IPR018756 This domain of unkown function is found in various bacterial hypothetical proteins, as well as putative ankyrin repeat proteins.
Probab=34.75 E-value=32 Score=25.44 Aligned_cols=26 Identities=12% Similarity=0.286 Sum_probs=22.0
Q ss_pred CCcceEEecCCeeeCCCCHHHHHHHH
Q psy5943 41 EEKEWYYGTSEQSKGPVTFHQIKHLW 66 (133)
Q Consensus 41 ~~~~Wyy~~~g~q~GP~s~~eL~~l~ 66 (133)
.-..|-|..+|...|++|+..|+.-.
T Consensus 92 ~IsDWm~~~~g~~~G~~Ti~~~~~~m 117 (133)
T PF10077_consen 92 DISDWMIYEDGRTYGGYTIRAMRSRM 117 (133)
T ss_pred HeeEeEEEECCceEccHHHHHHHhhC
Confidence 45679999999999999999998543
No 11
>PF13162 DUF3997: Protein of unknown function (DUF3997)
Probab=31.48 E-value=78 Score=23.22 Aligned_cols=29 Identities=14% Similarity=0.275 Sum_probs=22.1
Q ss_pred ceEEec--CCeeeCCCCHHHHHHHHHcCccC
Q psy5943 44 EWYYGT--SEQSKGPVTFHQIKHLWSTGELN 72 (133)
Q Consensus 44 ~Wyy~~--~g~q~GP~s~~eL~~l~~~G~I~ 72 (133)
.||+.+ ++...|||+.++.+..-++-.|+
T Consensus 74 ~Y~IId~~~~~v~GP~~k~~F~~k~k~l~I~ 104 (115)
T PF13162_consen 74 EYWIIDKKNDEVYGPFSKEQFQEKRKELNIS 104 (115)
T ss_pred eEEEEEcCCCcEECCCCHHHHHHHHHhcCCC
Confidence 366554 35889999999999887776665
No 12
>COG4336 Uncharacterized conserved protein [Function unknown]
Probab=28.58 E-value=54 Score=27.24 Aligned_cols=56 Identities=16% Similarity=0.184 Sum_probs=36.6
Q ss_pred HHHHHHHHHcCccCCCCeeecCCcccceecCCcchHHHHHh-----hcCCCCCCCcchHHH
Q psy5943 59 FHQIKHLWSTGELNPKTKVWAHGMESWKSLHQVPQLKWTLV-----AKNSGGVMNETELGC 114 (133)
Q Consensus 59 ~~eL~~l~~~G~I~~~TlVW~eGm~~W~p~~~v~eL~~~l~-----~~~pPP~l~~~~~~~ 114 (133)
-.|.|+++++|.++.-|-=|.+|+.+=.-+.-=.+++.-|. +..|=|+|-.++.+.
T Consensus 14 p~~aR~liR~g~~~~pTsG~~~g~~QANlvvlp~d~a~dFllfcqrNpkpCPlLdvte~Gs 74 (265)
T COG4336 14 PVEARQLIRNGLLTGPTSGWSEGYAQANLVVLPKDWADDFLLFCQRNPKPCPLLDVTEPGS 74 (265)
T ss_pred CHHHHHHHhcCccccCCcccccccccceEEEecHHHHHHHHHHHhcCCCCCCcccccCCCC
Confidence 35889999999999999999999754332222223333221 445667777665543
Done!