Query 537021.9.peg.254_1 Match_columns 38 No_of_seqs 1 out of 3 Neff 1.0 Searched_HMMs 39220 Date Wed May 25 16:14:11 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i peg_254.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 pfam05541 Spheroidin Entomopox 25.1 52 0.0013 16.8 2.1 19 8-31 141-159 (943) 2 COG3564 Uncharacterized protei 20.9 81 0.0021 15.8 2.4 25 6-34 89-113 (116) 3 PRK09441 cytoplasmic alpha-amy 20.7 32 0.00083 17.7 0.3 11 1-11 360-370 (482) 4 COG3660 Predicted nucleoside-d 17.2 47 0.0012 16.9 0.5 17 14-30 191-207 (329) 5 pfam08413 Metallophos_C Calcin 15.8 88 0.0022 15.7 1.6 16 19-36 1-16 (38) 6 TIGR02485 CobZ_N-term precorri 11.3 95 0.0024 15.5 0.7 11 27-37 260-270 (467) 7 PRK02259 aspartoacylase; Provi 10.3 52 0.0013 16.7 -0.9 25 10-34 236-260 (263) 8 pfam11305 DUF3107 Protein of u 10.1 2E+02 0.005 14.0 2.6 20 17-36 44-63 (74) 9 KOG0546 consensus 10.0 85 0.0022 15.7 0.1 21 6-26 92-112 (372) 10 TIGR01271 CFTR_protein cystic 10.0 97 0.0025 15.4 0.4 7 3-9 625-631 (1534) No 1 >pfam05541 Spheroidin Entomopoxvirus spheroidin protein. Entomopoxviruses (EPVs) are large (300-400 nm) oval-shaped viruses replicating in the cytoplasm of their insect host cells. At the end of their replicative cycle EPVs virions are occluded in a highly expressed protein called spheroidin. This protein forms large (5-20 mm long) oval-shaped occlusion bodies (OBs) called spherules. The infectious cycle of EPVs begins with the ingestion by the insect host of the spherules, their dissolution by the alkaline reducing conditions of the midgut fluid and the release of virions in the midgut lumen. The infective particles first replicate in midgut epithelial cells, then pass the gut barrier to colonize the internal tissues, mainly the fat body cells. Whilst spheroidin has been demonstrated to be non-essential for viral replication, it plays an essential role in the natural biological cycle of the virus in protecting virions from adverse environmental conditions (e.g. UV degradation) and thus Probab=25.13 E-value=52 Score=16.76 Aligned_cols=19 Identities=42% Similarity=0.534 Sum_probs=15.0 Q ss_pred EEEECCCCCCCCCEEEEEECCEEE Q ss_conf 076348873123346677335278 Q 537021.9.peg.2 8 GYFHNENGYEKKGIKFLISKSRIR 31 (38) Q Consensus 8 gyfhnengyekkgikflisksrir 31 (38) -|||+.||+ +|||+++.|- T Consensus 141 i~f~ddng~-----ygLit~~nI~ 159 (943) T pfam05541 141 IYFVDDNGV-----YGLITKENIV 159 (943) T ss_pred EEEECCCCC-----EEEEECCCEE T ss_conf 999927981-----7887511002 No 2 >COG3564 Uncharacterized protein conserved in bacteria [Function unknown] Probab=20.88 E-value=81 Score=15.83 Aligned_cols=25 Identities=48% Similarity=0.850 Sum_probs=18.4 Q ss_pred EEEEEECCCCCCCCCEEEEEECCEEEEEE Q ss_conf 70076348873123346677335278986 Q 537021.9.peg.2 6 YGGYFHNENGYEKKGIKFLISKSRIRCVQ 34 (38) Q Consensus 6 yggyfhnengyekkgikflisksrircvq 34 (38) -||-|.-+||-|| .| .+.||+-.|| T Consensus 89 RGGmFSLdng~E~---RF-LtRsrv~~~~ 113 (116) T COG3564 89 RGGMFSLDNGREK---RF-LTRSRVCSVQ 113 (116) T ss_pred CCCEEECCCCCCE---EE-EEEEEEEECC T ss_conf 8864572578513---56-6511266314 No 3 >PRK09441 cytoplasmic alpha-amylase; Reviewed Probab=20.75 E-value=32 Score=17.74 Aligned_cols=11 Identities=55% Similarity=1.509 Sum_probs=8.8 Q ss_pred CCEEEEEEEEE Q ss_conf 96057700763 Q 537021.9.peg.2 1 VPCYFYGGYFH 11 (38) Q Consensus 1 vpcyfyggyfh 11 (38) +||-|||-|+- T Consensus 360 ~PcvfYgDyyg 370 (482) T PRK09441 360 YPCVFYGDYYG 370 (482) T ss_pred EEEEEEHHHCC T ss_conf 42477213206 No 4 >COG3660 Predicted nucleoside-diphosphate-sugar epimerase [Cell envelope biogenesis, outer membrane] Probab=17.18 E-value=47 Score=16.94 Aligned_cols=17 Identities=47% Similarity=0.448 Sum_probs=14.3 Q ss_pred CCCCCCCEEEEEECCEE Q ss_conf 87312334667733527 Q 537021.9.peg.2 14 NGYEKKGIKFLISKSRI 30 (38) Q Consensus 14 ngyekkgikflisksri 30 (38) .+||+.|+.|+||-||- T Consensus 191 k~l~~~g~~~lisfSRR 207 (329) T COG3660 191 KILENQGGSFLISFSRR 207 (329) T ss_pred HHHHHCCCEEEEEEECC T ss_conf 99874785189996068 No 5 >pfam08413 Metallophos_C Calcineurin-like phosphoesterase C-terminal. This domain is found to the C-terminus of Calcineurin-like phosphoesterase domains (pfam00149) in cAMP phosphodiesterases and the homologous Icc proteins. Probab=15.78 E-value=88 Score=15.66 Aligned_cols=16 Identities=44% Similarity=0.933 Sum_probs=11.4 Q ss_pred CCEEEEEECCEEEEEEEC Q ss_conf 334667733527898632 Q 537021.9.peg.2 19 KGIKFLISKSRIRCVQFG 36 (38) Q Consensus 19 kgikflisksrircvqfg 36 (38) .||.++-+-| .|+||- T Consensus 1 ~gv~~l~tPS--TC~QF~ 16 (38) T pfam08413 1 NGVRLLATPS--TCVQFK 16 (38) T ss_pred CCCEEEECCC--CEEEEC T ss_conf 9977984897--536545 No 6 >TIGR02485 CobZ_N-term precorrin 3B synthase CobZ; InterPro: IPR012831 Cobalamin (vitamin B12) is a structurally complex cofactor, consisting of a modified tetrapyrrole with a centrally chelated cobalt. Cobalamin is usually found in one of two biologically active forms: methylcobalamin and adocobalamin. Most prokaryotes, as well as animals, have cobalamin-dependent enzymes, whereas plants and fungi do not appear to use it. In bacteria and archaea, these include methionine synthase, ribonucleotide reductase, glutamate and methylmalonyl-CoA mutases, ethanolamine ammonia lyase, and diol dehydratase . In mammals, cobalamin is obtained through the diet, and is required for methionine synthase and methylmalonyl-CoA mutase . There are at least two distinct cobalamin biosynthetic pathways in bacteria : Aerobic pathway that requires oxygen and in which cobalt is inserted late in the pathway ; found in Pseudomonas denitrificans and Rhodobacter capsulatus. Anaerobic pathway in which cobalt insertion is the first committed step towards cobalamin synthesis ; found in Salmonella typhimurium, Bacillus megaterium, and Propionibacterium freudenreichii shermanii. Either pathway can be divided into two parts: (1) corrin ring synthesis (differs in aerobic and anaerobic pathways) and (2) adenosylation of corrin ring, attachment of aminopropanol arm, and assembly of the nucleotide loop (common to both pathways) . There are about 30 enzymes involved in either pathway, where those involved in the aerobic pathway are prefixed Cob and those of the anaerobic pathway Cbi. Several of these enzymes are pathway-specific: CbiD, CbiG, and CbiK are specific to the anaerobic route of S. typhimurium, whereas CobE, CobF, CobG, CobN, CobS, CobT, and CobW are unique to the aerobic pathway of P. denitrificans. This entry represents CobZ, which is essential for cobalamin biosynthesis and is complemented by precorrin 3B synthase CobG (1.3.99.1 from EC). The enzyme has been shown to contain flavin, haem and Fe-S cluster cofactors, and is believed to require dioxygen as a substrate. In Rhodobacter capsulatus, CobZ is found as a N-terminal domain, whereas in other species it exists as a separate protein. The C-terminal portion is homologous to the 2-component signal transduction system protein CitB (IPR012830 from INTERPRO).. Probab=11.27 E-value=95 Score=15.48 Aligned_cols=11 Identities=36% Similarity=0.715 Sum_probs=8.6 Q ss_pred CCEEEEEEECC Q ss_conf 35278986326 Q 537021.9.peg.2 27 KSRIRCVQFGI 37 (38) Q Consensus 27 ksrircvqfgi 37 (38) -|||.|+|.|| T Consensus 260 vtRid~~~lG~ 270 (467) T TIGR02485 260 VTRIDSVSLGV 270 (467) T ss_pred EEEECEEECCE T ss_conf 74412121131 No 7 >PRK02259 aspartoacylase; Provisional Probab=10.32 E-value=52 Score=16.73 Aligned_cols=25 Identities=36% Similarity=0.579 Sum_probs=0.0 Q ss_pred EECCCCCCCCCEEEEEECCEEEEEE Q ss_conf 6348873123346677335278986 Q 537021.9.peg.2 10 FHNENGYEKKGIKFLISKSRIRCVQ 34 (38) Q Consensus 10 fhnengyekkgikflisksrircvq 34 (38) |-||-.|-.|+|-|..++.+.-.+. T Consensus 236 FINEAAYyEK~iA~~lt~K~~i~~~ 260 (263) T PRK02259 236 FINEAAYYEKGIAMSLTKKEVINVK 260 (263) T ss_pred EECHHHHHHHHHHHHHHEEEEEECC T ss_conf 6238886652256543200796069 No 8 >pfam11305 DUF3107 Protein of unknown function (DUF3107). Some members in this family of proteins are annotated as ATP-binding proteins however this cannot be confirmed. Currently no function is known. Probab=10.07 E-value=2e+02 Score=13.96 Aligned_cols=20 Identities=25% Similarity=0.591 Sum_probs=0.0 Q ss_pred CCCCEEEEEECCEEEEEEEC Q ss_conf 12334667733527898632 Q 537021.9.peg.2 17 EKKGIKFLISKSRIRCVQFG 36 (38) Q Consensus 17 ekkgikflisksrircvqfg 36 (38) .+||-++++.-++|--|++| T Consensus 44 D~kGr~vlVpa~~iaYVEiG 63 (74) T pfam11305 44 DDKGRRVLVPAAALAYVEIG 63 (74) T ss_pred ECCCCEEEEECCCEEEEEEC T ss_conf 38998999985528999976 No 9 >KOG0546 consensus Probab=9.95 E-value=85 Score=15.71 Aligned_cols=21 Identities=43% Similarity=0.551 Sum_probs=0.0 Q ss_pred EEEEEECCCCCCCCCEEEEEE Q ss_conf 700763488731233466773 Q 537021.9.peg.2 6 YGGYFHNENGYEKKGIKFLIS 26 (38) Q Consensus 6 yggyfhnengyekkgikflis 26 (38) ||++|..||=--|.--.||+| T Consensus 92 YG~~FdDEnF~lKHdrpflLS 112 (372) T KOG0546 92 YGEKFDDENFELKHDRPFLLS 112 (372) T ss_pred CCCCCCCCCCEECCCCCHHHH T ss_conf 056304653021468601101 No 10 >TIGR01271 CFTR_protein cystic fibrosis transmembrane conductor regulator (CFTR); InterPro: IPR005291 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). These proteins are integral membrane proteins and they are involved in the transport of chloride ions. Many of these proteins are the cystis fibrosis transmembrane conductor regulators (CFTR) in eukaryotes. The principal role of this protein is chloride ion conductance. The protein is predicted to consist of 12 transmembrane domains. Mutations or lesions in the genetic loci have been linked to the aetiology of asthma, bronchiectasis, chronic obstructive pulmonary disease etc. Disease-causing mutations have been studied by 36Cl efflux assays in vitro cell cultures and electrophysiology, all of which point to the impairment of chloride channel stability and not the biosynthetic processing per se.; GO: 0005254 chloride channel activity, 0006811 ion transport, 0016020 membrane. Probab=9.95 E-value=97 Score=15.44 Aligned_cols=7 Identities=71% Similarity=2.124 Sum_probs=0.0 Q ss_pred EEEEEEE Q ss_conf 0577007 Q 537021.9.peg.2 3 CYFYGGY 9 (38) Q Consensus 3 cyfyggy 9 (38) |||||.+ T Consensus 625 cYFYG~F 631 (1534) T TIGR01271 625 CYFYGTF 631 (1534) T ss_pred EEEECCH T ss_conf 1242474 Done!