Query 537021.9.peg.582_1 Match_columns 37 No_of_seqs 1 out of 3 Neff 1.0 Searched_HMMs 39220 Date Tue May 24 21:34:52 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i peg_582.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 pfam12109 CXCR4_N CXCR4 Chemok 48.4 5.4 0.00014 21.2 0.2 11 13-23 23-33 (33) 2 pfam04318 DUF468 Protein of un 38.6 7.9 0.0002 20.5 -0.2 17 18-34 21-37 (84) 3 pfam10876 DUF2669 Protein of u 20.4 24 0.0006 18.2 -0.4 10 17-26 88-97 (133) 4 pfam11869 DUF3389 Protein of u 17.1 60 0.0015 16.3 1.0 11 24-34 2-12 (75) 5 pfam07818 HCNGP HCNGP-like pro 14.9 75 0.0019 15.8 1.1 27 3-29 12-38 (96) 6 pfam11922 DUF3440 Domain of un 14.8 51 0.0013 16.6 0.2 18 14-31 131-148 (157) 7 cd03730 SOCS_ASB14 SOCS (suppr 14.4 64 0.0016 16.1 0.6 17 2-18 32-48 (57) 8 pfam00660 SRP1_TIP1 Seripauper 12.2 58 0.0015 16.3 -0.1 25 6-30 12-36 (104) 9 KOG0489 consensus 10.9 78 0.002 15.7 0.2 17 12-28 171-187 (261) 10 cd03731 SOCS_ASB15 SOCS (suppr 9.6 1.1E+02 0.0029 15.0 0.6 17 2-18 32-48 (56) No 1 >pfam12109 CXCR4_N CXCR4 Chemokine receptor N terminal. CXCR4 and its ligand stromal cell-derived factor-1 (a.k.a. CXCL12) are essential for proper fetal development. CXCR4 is also the major coreceptor for T-tropic strains of human immunodeficiency virus 1 (HIV-1), and SDF-1 inhibits HIV-1 infection. Additionally, SDF-1 and CXCR4 mediate cancer cell migration and metastasis. The N terminal domain of most chemokine receptors is the ligand binding domain and so the N terminal domain of CXCR4 is the binding site for SDF-1. Probab=48.37 E-value=5.4 Score=21.23 Aligned_cols=11 Identities=45% Similarity=0.833 Sum_probs=9.3 Q ss_pred HHCCCCCCHHH Q ss_conf 51004434223 Q 537021.9.peg.5 13 LFQEKNAHFNQ 23 (37) Q Consensus 13 lfqeknahfnq 23 (37) -|++.|||||+ T Consensus 23 Cf~~eNa~FNr 33 (33) T pfam12109 23 CFRHENADFNR 33 (33) T ss_pred CCCCCCCCCCC T ss_conf 10113454368 No 2 >pfam04318 DUF468 Protein of unknown function (DUF468). These conserved ORFs probably are probably not translated into protein [Personal communication, Val Wood]. Probab=38.63 E-value=7.9 Score=20.47 Aligned_cols=17 Identities=59% Similarity=0.819 Sum_probs=13.5 Q ss_pred CCCHHHEEECCCCEEEE Q ss_conf 43422303014442652 Q 537021.9.peg.5 18 NAHFNQYISFSQDKIVI 34 (37) Q Consensus 18 nahfnqyisfsqdkivi 34 (37) -.|||-||||+..|-.| T Consensus 21 Y~Hfniyi~f~~pKycI 37 (84) T pfam04318 21 YPHFNIYISFSGPKYCI 37 (84) T ss_pred CCCEEEEEECCCCHHHH T ss_conf 86267999716951654 No 3 >pfam10876 DUF2669 Protein of unknown function (DUF2669). This family of proteins has no known function. Probab=20.45 E-value=24 Score=18.20 Aligned_cols=10 Identities=50% Similarity=0.763 Sum_probs=7.6 Q ss_pred CCCCHHHEEE Q ss_conf 4434223030 Q 537021.9.peg.5 17 KNAHFNQYIS 26 (37) Q Consensus 17 knahfnqyis 26 (37) -|.|||+|-+ T Consensus 88 Fn~HFNtyR~ 97 (133) T pfam10876 88 FNQHFNTYRG 97 (133) T ss_pred HHHHHHHHHH T ss_conf 8889888888 No 4 >pfam11869 DUF3389 Protein of unknown function (DUF3389). This family of proteins are functionally uncharacterized. This protein is found in bacteria. Proteins in this family are about 80 amino acids in length. Probab=17.12 E-value=60 Score=16.27 Aligned_cols=11 Identities=55% Similarity=0.754 Sum_probs=8.5 Q ss_pred EEECCCCEEEE Q ss_conf 03014442652 Q 537021.9.peg.5 24 YISFSQDKIVI 34 (37) Q Consensus 24 yisfsqdkivi 34 (37) -|+|||.||.. T Consensus 2 vI~Fs~GKiI~ 12 (75) T pfam11869 2 VIDFSQGKIIA 12 (75) T ss_pred EEEECCCEEEE T ss_conf 69844984998 No 5 >pfam07818 HCNGP HCNGP-like protein. This family comprises sequences bearing significant similarity to the mouse transcriptional regulator protein HCNGP. This protein is localized to the nucleus and is thought to be involved in the regulation of beta-2-microglobulin genes. Probab=14.94 E-value=75 Score=15.81 Aligned_cols=27 Identities=33% Similarity=0.567 Sum_probs=21.1 Q ss_pred CHHHHHHHHHHHCCCCCCHHHEEECCC Q ss_conf 089999887751004434223030144 Q 537021.9.peg.5 3 RLHQIIKHFILFQEKNAHFNQYISFSQ 29 (37) Q Consensus 3 rlhqiikhfilfqeknahfnqyisfsq 29 (37) .|.+-|.+|.-...++.|||+-|.-|. T Consensus 12 ~L~~Ki~~fl~lK~~g~~fN~~l~~s~ 38 (96) T pfam07818 12 ALNAKIEHFLKLKRQGIHFNQKLQNSS 38 (96) T ss_pred HHHHHHHHHHHHHHCCCCHHHHHHHHH T ss_conf 999999999999983998889998642 No 6 >pfam11922 DUF3440 Domain of unknown function (DUF3440). This presumed domain is functionally uncharacterized. This domain is found in bacteria. This domain is typically between 53 to 190 amino acids in length. This domain is found associated with pfam01507. This domain has a conserved KND sequence motif. Probab=14.85 E-value=51 Score=16.62 Aligned_cols=18 Identities=33% Similarity=0.586 Sum_probs=14.7 Q ss_pred HCCCCCCHHHEEECCCCE Q ss_conf 100443422303014442 Q 537021.9.peg.5 14 FQEKNAHFNQYISFSQDK 31 (37) Q Consensus 14 fqeknahfnqyisfsqdk 31 (37) --.||.|.-.|.|||+.| T Consensus 131 ~ilKND~~cr~l~Fs~tK 148 (157) T pfam11922 131 CILKNDHWCRYMGFGPTK 148 (157) T ss_pred HHHHCCHHHHHCCCCCCH T ss_conf 999110575114678884 No 7 >cd03730 SOCS_ASB14 SOCS (suppressors of cytokine signaling) box of ASB14-like proteins. ASB family members have a C-terminal SOCS box and an N-terminal ankyrin-related sequence. The general function of the SOCS box is the recruitment of the ubiquitin-transferase system. The SOCS box interacts with Elongins B and C, Cullin-5 or Cullin-2, Rbx-1, and E2. Therefore, SOCS-box-containing proteins probably function as E3 ubiquitin ligases and mediate the degradation of proteins associated through their N-terminal regions. Probab=14.44 E-value=64 Score=16.12 Aligned_cols=17 Identities=35% Similarity=0.645 Sum_probs=12.5 Q ss_pred CCHHHHHHHHHHHCCCC Q ss_conf 40899998877510044 Q 537021.9.peg.5 2 LRLHQIIKHFILFQEKN 18 (37) Q Consensus 2 lrlhqiikhfilfqekn 18 (37) |-+-..+|++|||.|-. T Consensus 32 lplP~~lk~YilyKEYD 48 (57) T cd03730 32 LPLPNRLKAYILYKEYD 48 (57) T ss_pred CCCCHHHHHHHHHHHHH T ss_conf 79978999999998864 No 8 >pfam00660 SRP1_TIP1 Seripauperin and TIP1 family. Probab=12.24 E-value=58 Score=16.32 Aligned_cols=25 Identities=20% Similarity=0.534 Sum_probs=19.1 Q ss_pred HHHHHHHHHCCCCCCHHHEEECCCC Q ss_conf 9998877510044342230301444 Q 537021.9.peg.5 6 QIIKHFILFQEKNAHFNQYISFSQD 30 (37) Q Consensus 6 qiikhfilfqeknahfnqyisfsqd 30 (37) |++.--.....-++|..+|+||.++ T Consensus 12 ~l~EL~v~l~Dvkshl~~Yls~~~~ 36 (104) T pfam00660 12 QLAELNVILSDVKAHLSDYLSLQAN 36 (104) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 8999999999999729999999835 No 9 >KOG0489 consensus Probab=10.90 E-value=78 Score=15.72 Aligned_cols=17 Identities=47% Similarity=0.708 Sum_probs=12.4 Q ss_pred HHHCCCCCCHHHEEECC Q ss_conf 75100443422303014 Q 537021.9.peg.5 12 ILFQEKNAHFNQYISFS 28 (37) Q Consensus 12 ilfqeknahfnqyisfs 28 (37) +|--||.-|||.|++-. T Consensus 171 llELEkEFhfN~YLtR~ 187 (261) T KOG0489 171 LLELEKEFHFNKYLTRS 187 (261) T ss_pred HHHHHHHHHCCCCCCHH T ss_conf 76644432011232316 No 10 >cd03731 SOCS_ASB15 SOCS (suppressors of cytokine signaling) box of ASB15-like proteins. ASB family members have a C-terminal SOCS box and an N-terminal ankyrin-related sequence. Human ASB15 is expressed predominantly in skeletal muscle and participates in the regulation of protein turnover and muscle cell development by stimulating protein synthesis and regulating differentiation of muscle cells. The general function of the SOCS box is the recruitment of the ubiquitin-transferase system. The SOCS box interacts with Elongins B and C, Cullin-5 or Cullin-2, Rbx-1, and E2. Therefore, SOCS-box-containing proteins probably function as E3 ubiquitin ligases and mediate the degradation of proteins associated through their N-terminal regions. Probab=9.57 E-value=1.1e+02 Score=14.97 Aligned_cols=17 Identities=35% Similarity=0.686 Sum_probs=11.9 Q ss_pred CCHHHHHHHHHHHCCCC Q ss_conf 40899998877510044 Q 537021.9.peg.5 2 LRLHQIIKHFILFQEKN 18 (37) Q Consensus 2 lrlhqiikhfilfqekn 18 (37) |-+-..+|.+|||.|-. T Consensus 32 lPlPp~Lk~YilykEYD 48 (56) T cd03731 32 LPLPPALKRYILYKEYD 48 (56) T ss_pred CCCCHHHHHHHHHHHHH T ss_conf 78988999888875532 Done!