Query 537021.9.peg.852_1 Match_columns 38 No_of_seqs 1 out of 3 Neff 1.0 Searched_HMMs 39220 Date Wed May 25 02:13:00 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i peg_852.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 pfam06415 iPGM_N BPG-independe 12.6 1.1E+02 0.0029 15.1 1.4 13 2-14 65-77 (223) 2 TIGR02147 Fsuc_second hypothet 12.2 80 0.002 15.8 0.6 14 18-31 75-88 (281) 3 TIGR01579 MiaB-like-C MiaB-lik 10.5 1E+02 0.0026 15.3 0.7 12 2-13 399-410 (492) 4 pfam06730 DUF1208 Protein of u 10.1 1.5E+02 0.0039 14.6 1.4 22 12-33 68-89 (219) 5 KOG3710 consensus 10.0 1.3E+02 0.0033 14.9 1.0 10 1-10 210-219 (280) 6 pfam06926 Rep_Org_C Putative r 9.9 1.1E+02 0.0027 15.3 0.6 22 17-38 4-25 (95) 7 cd07598 BAR_FAM92 The Bin/Amph 9.8 1.6E+02 0.004 14.5 1.4 22 12-33 61-82 (211) 8 KOG3264 consensus 8.9 1.3E+02 0.0034 14.8 0.7 25 13-38 189-213 (221) 9 pfam07184 CTV_P33 Citrus trist 8.5 52 0.0013 16.6 -1.5 32 7-38 44-75 (303) 10 TIGR01675 plant-AP plant acid 8.4 1.8E+02 0.0047 14.2 1.3 15 2-16 149-164 (245) No 1 >pfam06415 iPGM_N BPG-independent PGAM N-terminus (iPGM_N). This family represents the N-terminal region of the 2,3-bisphosphoglycerate-independent phosphoglycerate mutase (or phosphoglyceromutase or BPG-independent PGAM) protein (EC:5.4.2.1). The family is found in conjunction with pfam01676 (located in the C-terminal region of the protein). Probab=12.61 E-value=1.1e+02 Score=15.13 Aligned_cols=13 Identities=38% Similarity=0.710 Sum_probs=10.9 Q ss_pred EEEECCCCCCCCH Q ss_conf 7886186566541 Q 537021.9.peg.8 2 VFLFSDRRDTLPL 14 (38) Q Consensus 2 vflfsdrrdtlpl 14 (38) +-+|.|-|||.|- T Consensus 65 lH~f~DGRDt~P~ 77 (223) T pfam06415 65 IHAFTDGRDVAPK 77 (223) T ss_pred EEEEECCCCCCCH T ss_conf 9986068888836 No 2 >TIGR02147 Fsuc_second hypothetical protein, TIGR02147; InterPro: IPR011873 This family consists of the 40 members of a paralogous protein family in the rumen anaerobe Fibrobacter succinogenes S85. Member proteins are about 270 residues long and appear to lack signal sequences and transmembrane helices. The only perfectly conserved residue is a glycine in an otherwise poorly conserved region, suggesting members are not enzymes. The family is not characterised.. Probab=12.25 E-value=80 Score=15.80 Aligned_cols=14 Identities=43% Similarity=0.733 Sum_probs=11.8 Q ss_pred HHHHHHHCCCCHHH Q ss_conf 64455742221788 Q 537021.9.peg.8 18 AEYFFLLVNFDQRQ 31 (38) Q Consensus 18 aeyffllvnfdqrq 31 (38) +.||-.+|+|+|-+ T Consensus 75 ~~YF~~MV~F~~Ak 88 (281) T TIGR02147 75 AAYFEAMVAFNQAK 88 (281) T ss_pred HHHHHHHHHHCCCC T ss_conf 89999974111578 No 3 >TIGR01579 MiaB-like-C MiaB-like tRNA modifying enzyme; InterPro: IPR006467 This clade of sequences is closely related to MiaB, a modifier of isopentenylated adenosine-37 of certain eukaryotic and bacterial tRNAs (see IPR006463 from INTERPRO). Sequence alignments suggest that these sequences perform the same chemical transformation as MiaB, perhaps on a different (or differently modified) tRNA base substrate. This clade represents a subfamily that spans low GC Gram-positive bacteria, alpha and epsilon proteobacteria, Campylobacter, Porphyromonas, Aquifex, Thermotoga, Chlamydia, Treponema and Fusobacterium , , .. Probab=10.55 E-value=1e+02 Score=15.30 Aligned_cols=12 Identities=33% Similarity=0.534 Sum_probs=9.4 Q ss_pred EEEECCCCCCCC Q ss_conf 788618656654 Q 537021.9.peg.8 2 VFLFSDRRDTLP 13 (38) Q Consensus 2 vflfsdrrdtlp 13 (38) +|-||+|.+|-. T Consensus 399 iFpyS~R~~T~A 410 (492) T TIGR01579 399 IFPYSARPGTPA 410 (492) T ss_pred EECCCCCCCCCC T ss_conf 542688432812 No 4 >pfam06730 DUF1208 Protein of unknown function (DUF1208). This family consists of several eukaryotic sequences of around 270 residues in length. Members of this family are found in mouse, human and Drosophila melanogaster. The function of this family is unknown. Probab=10.10 E-value=1.5e+02 Score=14.56 Aligned_cols=22 Identities=36% Similarity=0.637 Sum_probs=17.7 Q ss_pred CCHHHHHHHHHHHCCCCHHHHH Q ss_conf 5410076445574222178887 Q 537021.9.peg.8 12 LPLRNFAEYFFLLVNFDQRQIE 33 (38) Q Consensus 12 lplrnfaeyffllvnfdqrqie 33 (38) +.|+||||.|-.+-.+-|-+++ T Consensus 68 ~gL~~fae~~a~vqDYr~a~V~ 89 (219) T pfam06730 68 LGLKSFADEFAKLQDYRQAEVE 89 (219) T ss_pred HHHHHHHHHHHHHHHHHHHHHH T ss_conf 6799999999998878999999 No 5 >KOG3710 consensus Probab=9.99 E-value=1.3e+02 Score=14.85 Aligned_cols=10 Identities=50% Similarity=1.188 Sum_probs=8.4 Q ss_pred CEEEECCCCC Q ss_conf 9788618656 Q 537021.9.peg.8 1 MVFLFSDRRD 10 (38) Q Consensus 1 mvflfsdrrd 10 (38) .+|+|||||. T Consensus 210 LlffwSdrrn 219 (280) T KOG3710 210 LLFFWSDRRN 219 (280) T ss_pred EEEEEECCCC T ss_conf 8999856888 No 6 >pfam06926 Rep_Org_C Putative replisome organiser protein C-terminus. This family represents the C-terminus (approximately 100 residues) of a putative replisome organiser protein in Lactococcus bacteriophages. Probab=9.94 E-value=1.1e+02 Score=15.26 Aligned_cols=22 Identities=32% Similarity=0.644 Sum_probs=17.9 Q ss_pred HHHHHHHHCCCCHHHHHHHHCC Q ss_conf 7644557422217888775259 Q 537021.9.peg.8 17 FAEYFFLLVNFDQRQIEKRLMV 38 (38) Q Consensus 17 faeyffllvnfdqrqiekrlmv 38 (38) +.+||-+..||....|.||-|+ T Consensus 4 lS~f~d~F~nfs~KN~skR~ma 25 (95) T pfam06926 4 LSEFLDSFINFSSKNISKRAMA 25 (95) T ss_pred HHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999998855206899999 No 7 >cd07598 BAR_FAM92 The Bin/Amphiphysin/Rvs (BAR) domain of Family with sequence similarity 92 (FAM92). BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions including organelle biogenesis, membrane trafficking or remodeling, and cell division and migration. This group is composed of proteins from the family with sequence similarity 92 (FAM92), which were originally identified by the presence of the unknown domain DUF1208. This domain shows similarity to the BAR domains of sorting nexins. Mammals contain at least two member types, FAM92A and FAM92B, which may exist in many variants. The Xenopus homolog of FAM92A1, xVAP019, is essential for embryo survival and cell differentiation. FAM92A1 may be involved in regulating cell proliferation and apoptosis. BAR domains form dimers that bind to membranes, induce membrane bending and curvature, and may also be involved in protein-protein interactions. Probab=9.81 E-value=1.6e+02 Score=14.48 Aligned_cols=22 Identities=36% Similarity=0.581 Sum_probs=17.1 Q ss_pred CCHHHHHHHHHHHCCCCHHHHH Q ss_conf 5410076445574222178887 Q 537021.9.peg.8 12 LPLRNFAEYFFLLVNFDQRQIE 33 (38) Q Consensus 12 lplrnfaeyffllvnfdqrqie 33 (38) ..|+||||.+-.+-.+-|-+++ T Consensus 61 ~gL~~fae~~a~vqDYr~aeV~ 82 (211) T cd07598 61 QGLKNFAECLAALQDYRQAEVE 82 (211) T ss_pred HHHHHHHHHHHHHHHHHHHHHH T ss_conf 8999999999998878999999 No 8 >KOG3264 consensus Probab=8.87 E-value=1.3e+02 Score=14.81 Aligned_cols=25 Identities=44% Similarity=0.648 Sum_probs=17.7 Q ss_pred CHHHHHHHHHHHCCCCHHHHHHHHCC Q ss_conf 41007644557422217888775259 Q 537021.9.peg.8 13 PLRNFAEYFFLLVNFDQRQIEKRLMV 38 (38) Q Consensus 13 plrnfaeyffllvnfdqrqiekrlmv 38 (38) .+|+|||..--||+...-.+ ||+|+ T Consensus 189 ~mr~FaeqL~PLVhleKidy-~r~m~ 213 (221) T KOG3264 189 AMRSFAEQLKPLVHLEKIDY-KRLMV 213 (221) T ss_pred HHHHHHHHHHHHHCCCCCCH-HHCCC T ss_conf 99999998624442235786-65466 No 9 >pfam07184 CTV_P33 Citrus tristeza virus P33 protein. This family consists of several Citrus tristeza virus (CTV) P33 proteins. The function of P33 is unclear although it is known that the protein is not needed for virion formation. Probab=8.45 E-value=52 Score=16.63 Aligned_cols=32 Identities=38% Similarity=0.537 Sum_probs=0.0 Q ss_pred CCCCCCCHHHHHHHHHHHCCCCHHHHHHHHCC Q ss_conf 86566541007644557422217888775259 Q 537021.9.peg.8 7 DRRDTLPLRNFAEYFFLLVNFDQRQIEKRLMV 38 (38) Q Consensus 7 drrdtlplrnfaeyffllvnfdqrqiekrlmv 38 (38) ||..|.--.-|.-|||||-|-.-..+....|| T Consensus 44 drdptvladyfslyffllnnd~igplaasi~v 75 (303) T pfam07184 44 DRDPTVLADYFSLYFFLLNNDNIGPLAASIMV 75 (303) T ss_pred CCCCHHHHHHHHHEEEEECCCCCCCCEEEEEE T ss_conf 77842357634202566438875651246786 No 10 >TIGR01675 plant-AP plant acid phosphatase; InterPro: IPR010028 This entry represents a family of acid phosphatase , from plants which are closely related to the class B non-specific acid phosphatase OlpA (IPR006423 from INTERPRO, which is believed to be a 5'-nucleotide phosphatase) and somewhat more distantly to another class B phosphatase, AphA (IPR010025 from INTERPRO). Together these three clades define a subfamily of Acid phosphatase (Class B), which corresponds to the IIIB subfamily of the haloacid dehalogenase (HAD) superfamily of aspartate nucleophile hydrolases. It has been reported that the best substrates were purine 5'-nucleoside phosphates . This is in concordance with the assignment of the Haemophilus influenzae hel protein (from IPR006423 from INTERPRO) as a 5'-nucleotidase, however there is presently no other evidence to support this specific function for this family of plant phosphatases. Many genes from this family have been annotated as vegetative storage proteins (VSPs) due to their close homology with these earlier-characterised gene products which are highly expressed in leaves. There are significant differences however, including expression levels and distribution . The most important difference is the lack in authentic VSPs of the nucleophilic aspartate residue, which is instead replaced by serine, glycine or asparagine. Thus these proteins can not be expected to be active phosphatases. This issue was confused by the publication in 1992 of an article claiming activity for the Glycine max VSP . In 1994 this assertion was refuted by the separation of the activity from the VSP. This entry explicitly excludes the VSPs which lack the nucleophilic aspartate. The possibility exists, however, that some members of this family may, while containing all of the conserved HAD-superfamily catalytic residues, lack activity and have a function related to the function of the VSPs rather than the acid phosphatases.; GO: 0003993 acid phosphatase activity. Probab=8.38 E-value=1.8e+02 Score=14.18 Aligned_cols=15 Identities=53% Similarity=0.846 Sum_probs=0.0 Q ss_pred EEEECCCCCCCC-HHH Q ss_conf 788618656654-100 Q 537021.9.peg.8 2 VFLFSDRRDTLP-LRN 16 (38) Q Consensus 2 vflfsdrrdtlp-lrn 16 (38) |||.|+|.+|-- ||| T Consensus 149 ~flls~R~etP~tlR~ 164 (245) T TIGR01675 149 VFLLSGRKETPETLRS 164 (245) T ss_pred EEEECCCCCCCHHHHH T ss_conf 8897178721046788 Done!