Query 537021.9.peg.218_1 Match_columns 39 No_of_seqs 1 out of 3 Neff 1.0 Searched_HMMs 39220 Date Wed May 25 16:19:49 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i peg_218.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 KOG2481 consensus 15.9 68 0.0017 15.6 1.0 14 21-34 382-395 (570) 2 PRK10202 ebgC cryptic beta-D-g 15.2 1.2E+02 0.0031 14.4 2.7 31 5-35 39-69 (149) 3 TIGR00996 Mtu_fam_mce virulenc 12.5 1.5E+02 0.0037 14.0 3.0 36 2-37 78-113 (304) 4 KOG4175 consensus 11.0 88 0.0023 15.0 0.5 18 7-24 170-189 (268) 5 TIGR02928 TIGR02928 orc1/cdc6 9.5 1.2E+02 0.0031 14.4 0.7 12 8-19 44-55 (383) 6 pfam08783 DWNN DWNN domain. DW 8.5 1.6E+02 0.0042 13.7 1.1 13 19-31 61-73 (74) 7 PRK08201 hypothetical protein; 7.6 2.3E+02 0.0059 13.0 1.8 14 2-15 102-115 (455) 8 pfam05409 Peptidase_C30 Corona 7.3 2E+02 0.0051 13.3 1.1 17 5-21 106-122 (293) 9 PRK08554 peptidase; Reviewed 7.2 2.3E+02 0.0058 13.0 1.3 13 3-15 87-99 (438) 10 COG5222 Uncharacterized conser 7.1 1.9E+02 0.0049 13.4 0.9 14 20-33 66-79 (427) No 1 >KOG2481 consensus Probab=15.89 E-value=68 Score=15.63 Aligned_cols=14 Identities=57% Similarity=0.726 Sum_probs=11.3 Q ss_pred CCCCEEEEEEEEEE Q ss_conf 12100122368999 Q 537021.9.peg.2 21 ARSTSVINRRYVQI 34 (39) Q Consensus 21 arstsvinrryvqi 34 (39) .-.++||+|-|||- T Consensus 382 ~~~~~v~gR~YvQP 395 (570) T KOG2481 382 GQQTSVIGRTYVQP 395 (570) T ss_pred CCCCEEEEEEEECC T ss_conf 76415631244132 No 2 >PRK10202 ebgC cryptic beta-D-galactosidase subunit beta; Reviewed Probab=15.18 E-value=1.2e+02 Score=14.38 Aligned_cols=31 Identities=26% Similarity=0.359 Sum_probs=22.9 Q ss_pred EEEECCEEEECCCCCCCCCCEEEEEEEEEEE Q ss_conf 9973103531135421121001223689999 Q 537021.9.peg.2 5 MTIEDSFLYGKMGSTAARSTSVINRRYVQIS 35 (39) Q Consensus 5 mtiedsflygkmgstaarstsvinrryvqis 35 (39) -.|-||+.|---..++....-.-||||..+- T Consensus 39 ysIGDsl~y~~e~~~~~~~~fe~hRrY~DVh 69 (149) T PRK10202 39 HSIGDSLTYRVETDSATDALFTGHRRYFEVH 69 (149) T ss_pred EECCCEEEEEEECCCCCEEEEEEEEEEEEEE T ss_conf 8727747999803763022799770089999 No 3 >TIGR00996 Mtu_fam_mce virulence factor Mce family protein; InterPro: IPR005693 Mycobacterial species are usually slender, curved rods with a unique cell wall of complex waxes and glycolipids. They are resistant to acids, alkalis and dehydration, and are very slow to grow in vitro. The human pathogenic Mycobacteria (Mycobacterium tuberculosis and Mycobacterium leprae) are becoming resistant to conventional treatments and, together with HIV-related diseases, are fast posing a global health threat. An essential requirement, particularly of M. tuberculosis, is to gain entrance to, and to resist, the hostile intra-cellular environment of epithelial cells . The genome of M. tuberculosis contains four mammalian cell entry (mce) operons , which are widely distributed in both pathogenic and non-pathogenic mycobacteria suggesting that the presence of these putative virulence genes is not an indicator for the pathogenicity of the bacilli. At the 5' end of the transcriptional unit are two genes that have evolved from a tandem duplication, and whose products resemble YrbE, a conserved hypothetical protein found in Escherichia. coli, Haemophilus influenzae and Porphyra purpurea. All of the YrbE proteins, including the eight from M. tuberculosis, are probable integral membrane proteins with six TM alpha helices. The next six genes in each operon, the mce genes, are related, their products ranging in size from 275 to 564 amino acid residues. The corresponding protein sequences contain a number of highly conserved motifs that define a 24- member family with a common organization. Twenty of these proteins have a strongly hydrophobic segment at the NH2-terminal end that could span the lipid bilayer whereas the remaining four, all of which correspond to the seventh gene in their respective operons, mce1E to mce4E, are probably lipoprotein precursors. In all 24 cases the COOH-terminal domain of the mce proteins is predicted to be exposed on the external face of the cytoplasmic membrane . The ability to gain entry and resist the antimicrobial intracellular environment of mammalian cells is an essential virulence property of M. tuberculosis. This property is conferred by Mce1A, the third gene of operon 1, which when expressed in Escherichia coli conferred the ability to invade HeLa cells. The recombinant protein when used to coat latex spheres also promoted their uptake into HeLa cells. N-terminus deletion constructs of Mce1A identified a domain located between amino acid positions 106 and 163 that was needed for this cell uptake activity. Mce1A contains hydrophobic stretches at the N-terminus predictive of a signal sequence, and colloidal gold immunoelectron microscopy indicated that the corresponding native protein is expressed on the surface of M. tuberculosis. Recombinant Mce2A, which had the highest level of identity (67%) to Mce1A, was unable to promote the association of microspheres with HeLa cells and an mce-deletion mutant in Mycobacterium bovis greatly impaired the ability of the microbe to infect epithelial cells in vitro. Although the exact function of Mce1A is still unknown, it appears to serve as an effector molecule expressed on the surface of M. tuberculosis that is capable of eliciting plasma membrane perturbations in non-phagocytic mammalian cells . The distribution of the mce operons in both pathogenic and non-pathogenic mycobacteria suggests that the presence of these putative virulence genes is not an indicator for the pathogenicity of the bacilli - it may be that pathogenicity is determined by their expression , . The members of this family represent all 24 genes associated with the four mammalian cell entry operons of M. tuberculosis and their homologs in other Actinomycetales.; GO: 0009405 pathogenesis. Probab=12.48 E-value=1.5e+02 Score=13.97 Aligned_cols=36 Identities=25% Similarity=0.423 Sum_probs=27.1 Q ss_pred EEEEEEECCEEEECCCCCCCCCCEEEEEEEEEEEEE Q ss_conf 399997310353113542112100122368999974 Q 537021.9.peg.2 2 TFTMTIEDSFLYGKMGSTAARSTSVINRRYVQISMK 37 (39) Q Consensus 2 tftmtiedsflygkmgstaarstsvinrryvqismk 37 (39) .-+|.+..+..-+.--.-+-++++.+-.|||++..- T Consensus 78 ~v~~~~~~~~~iP~~~~A~I~~~~l~G~~~veL~pp 113 (304) T TIGR00996 78 RVTFSLDRSVTIPANATAAIRSTTLLGSRYVELTPP 113 (304) T ss_pred EEEEEECCCEECCCCCEEEEECCCEEEEEEEEEECC T ss_conf 999970677323772078873033034235776338 No 4 >KOG4175 consensus Probab=11.03 E-value=88 Score=15.05 Aligned_cols=18 Identities=44% Similarity=0.864 Sum_probs=14.6 Q ss_pred EECCEEE--ECCCCCCCCCC Q ss_conf 7310353--11354211210 Q 537021.9.peg.2 7 IEDSFLY--GKMGSTAARST 24 (39) Q Consensus 7 iedsfly--gkmgstaarst 24 (39) +.|||.| ..||+|..|++ T Consensus 170 ~adsFiYvVSrmG~TG~~~s 189 (268) T KOG4175 170 AADSFIYVVSRMGVTGTRES 189 (268) T ss_pred HHCCEEEEEEECCCCCCHHH T ss_conf 63255999871456661777 No 5 >TIGR02928 TIGR02928 orc1/cdc6 family replication initiation protein; InterPro: IPR014277 This set of DNA binding proteins shows homology to the origin recognition complex subunit 1/cell division control protein 6 family in eukaryotes. The proteins in this entry are found exclusively in the archaea. Several members may be found in a genome and interact with each other.. Probab=9.48 E-value=1.2e+02 Score=14.35 Aligned_cols=12 Identities=42% Similarity=0.955 Sum_probs=8.8 Q ss_pred ECCEEEECCCCC Q ss_conf 310353113542 Q 537021.9.peg.2 8 EDSFLYGKMGST 19 (39) Q Consensus 8 edsflygkmgst 19 (39) ..-|+|||-|+- T Consensus 44 ~Ni~iYGkTGtG 55 (383) T TIGR02928 44 SNIFIYGKTGTG 55 (383) T ss_pred CCEEEECCCCCC T ss_conf 725887888987 No 6 >pfam08783 DWNN DWNN domain. DWNN is a ubiquitin like domain found at the N terminus of the RBBP6 family of splicing-associated proteins. The DWNN domain is independently expressed in higher vertebrates so it may function as a novel ubiquitin-like modifier of other proteins. Probab=8.48 E-value=1.6e+02 Score=13.73 Aligned_cols=13 Identities=46% Similarity=0.549 Sum_probs=9.6 Q ss_pred CCCCCCEEEEEEE Q ss_conf 2112100122368 Q 537021.9.peg.2 19 TAARSTSVINRRY 31 (39) Q Consensus 19 taarstsvinrry 31 (39) --.|++|||-||- T Consensus 61 lIprnssVivrRv 73 (74) T pfam08783 61 LIPRNSSVIVRRI 73 (74) T ss_pred EEECCCEEEEEEC T ss_conf 7856888999968 No 7 >PRK08201 hypothetical protein; Provisional Probab=7.60 E-value=2.3e+02 Score=12.99 Aligned_cols=14 Identities=43% Similarity=0.674 Sum_probs=10.2 Q ss_pred EEEEEEECCEEEEC Q ss_conf 39999731035311 Q 537021.9.peg.2 2 TFTMTIEDSFLYGK 15 (39) Q Consensus 2 tftmtiedsflygk 15 (39) -|..+++|.+|||. T Consensus 102 PF~~~i~dG~lyGR 115 (455) T PRK08201 102 PFEPTIRDGKLYAR 115 (455) T ss_pred CEEEEEECCEEEEE T ss_conf 83258558889985 No 8 >pfam05409 Peptidase_C30 Coronavirus endopeptidase C30. Corresponds to Merops family C30. These peptidases are involved in viral polyprotein processing in replication. Probab=7.31 E-value=2e+02 Score=13.28 Aligned_cols=17 Identities=41% Similarity=0.505 Sum_probs=13.7 Q ss_pred EEEECCEEEECCCCCCC Q ss_conf 99731035311354211 Q 537021.9.peg.2 5 MTIEDSFLYGKMGSTAA 21 (39) Q Consensus 5 mtiedsflygkmgstaa 21 (39) -||..|||-|.-||..- T Consensus 106 ~TIkgSFl~GaCGSvGy 122 (293) T pfam05409 106 GTIRGSFIAGACGSVGY 122 (293) T ss_pred CEEEEEEECCCCCCCCE T ss_conf 80788887465788836 No 9 >PRK08554 peptidase; Reviewed Probab=7.22 E-value=2.3e+02 Score=13.00 Aligned_cols=13 Identities=31% Similarity=0.769 Sum_probs=8.2 Q ss_pred EEEEEECCEEEEC Q ss_conf 9999731035311 Q 537021.9.peg.2 3 FTMTIEDSFLYGK 15 (39) Q Consensus 3 ftmtiedsflygk 15 (39) |..+|+|.+|||. T Consensus 87 F~~~i~dg~lyGR 99 (438) T PRK08554 87 FKLTVKGDRAYGR 99 (438) T ss_pred CCEEEECCEEECC T ss_conf 6379989999815 No 10 >COG5222 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only] Probab=7.08 E-value=1.9e+02 Score=13.39 Aligned_cols=14 Identities=50% Similarity=0.432 Sum_probs=10.0 Q ss_pred CCCCCEEEEEEEEE Q ss_conf 11210012236899 Q 537021.9.peg.2 20 AARSTSVINRRYVQ 33 (39) Q Consensus 20 aarstsvinrryvq 33 (39) ..||||||-+|--. T Consensus 66 iprstsVIV~R~Pa 79 (427) T COG5222 66 IPRSTSVIVSRIPA 79 (427) T ss_pred EECCCEEEEEECHH T ss_conf 73365278870402 Done!