Query gi|254780408|ref|YP_003064821.1| hypothetical protein CLIBASIA_01465 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 75 No_of_seqs 115 out of 255 Neff 3.2 Searched_HMMs 39220 Date Sun May 29 15:45:35 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780408.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 COG5508 Uncharacterized conser 99.9 2.8E-23 7.2E-28 142.5 5.0 59 17-75 25-84 (84) 2 pfam07896 DUF1674 Protein of u 99.7 4.8E-19 1.2E-23 120.4 2.6 41 34-75 1-41 (41) 3 KOG3245 consensus 99.6 1.4E-16 3.7E-21 107.5 0.6 57 19-75 48-106 (106) 4 PRK11750 gltB glutamate syntha 67.4 7.2 0.00018 20.3 3.5 42 20-69 866-907 (1483) 5 TIGR00408 proS_fam_I prolyl-tR 29.5 15 0.00038 18.6 -0.0 13 61-73 359-371 (533) 6 TIGR01533 lipo_e_P4 5'-nucleot 24.8 23 0.00058 17.7 0.2 10 57-67 261-270 (295) 7 cd00862 ProRS_anticodon_zinc P 20.3 22 0.00057 17.7 -0.6 25 48-72 44-69 (202) 8 cd04822 PA_M28_1_3 PA_M28_1_3: 16.2 46 0.0012 16.1 0.3 14 56-69 68-81 (151) 9 COG1223 Predicted ATPase (AAA+ 15.3 41 0.0011 16.3 -0.2 12 59-70 142-153 (368) 10 pfam11239 DUF3040 Protein of u 14.8 1.2E+02 0.0029 14.0 3.0 41 23-63 2-42 (82) No 1 >COG5508 Uncharacterized conserved small protein [Function unknown] Probab=99.88 E-value=2.8e-23 Score=142.51 Aligned_cols=59 Identities=51% Similarity=0.861 Sum_probs=52.5 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHH-HHHHHCCCHHCCCCCCCCCCCCCCHHCCCEECCC Q ss_conf 5575445899999999999997412-2122106510188889986433610007865149 Q gi|254780408|r 17 IKTSHDPLSSIAKRALEEAKQRKSA-NNKDAKLPIEIGGRKGLDPTRFGDWEKNGISIDF 75 (75) Q Consensus 17 ~~~~~~~ls~aA~rAL~Ea~~rr~~-~~~~~~~pkE~gG~~G~ePtRyGDWE~kG~~~DF 75 (75) ...+..+|+||++++|+||+.|+.+ ..+...+|+||||++|+|||||||||+||+|+|| T Consensus 25 ~~~~~~~ltpAa~ralkea~arraa~ee~~~s~pkEiGGrgGlePtRyGDWe~KGr~iDF 84 (84) T COG5508 25 DARPSFDLTPAAQRALKEAEARRAASEEKNLSLPKEIGGRGGLEPTRYGDWEHKGRVIDF 84 (84) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCHHHCCCCCCCCCCCCCCCCCCEECCC T ss_conf 543255788378999999999887776641365065578788886544643006634579 No 2 >pfam07896 DUF1674 Protein of unknown function (DUF1674). The members of this family are sequences derived from hypothetical eukaryotic and bacterial proteins. The region in question is approximately 60 residues long. Probab=99.74 E-value=4.8e-19 Score=120.41 Aligned_cols=41 Identities=54% Similarity=0.953 Sum_probs=35.0 Q ss_pred HHHHHHHHHHHHHCCCHHCCCCCCCCCCCCCCHHCCCEECCC Q ss_conf 999974122122106510188889986433610007865149 Q gi|254780408|r 34 EAKQRKSANNKDAKLPIEIGGRKGLDPTRFGDWEKNGISIDF 75 (75) Q Consensus 34 Ea~~rr~~~~~~~~~pkE~gG~~G~ePtRyGDWE~kG~~~DF 75 (75) ||++|+++.. ...+|+|+|||+|||||||||||+||||||| T Consensus 1 eae~r~~~~~-~~~~P~E~GGp~g~ePtRyGDWE~kG~~~DF 41 (41) T pfam07896 1 EAEERRPAAR-KAVNPGEIGGPKGPEPTRYGDWERKGRVTDF 41 (41) T ss_pred CHHHHHHHHH-CCCCCCCCCCCCCCCCCCCCCCCCCCEECCC T ss_conf 9788888754-3458844588999997544650107775669 No 3 >KOG3245 consensus Probab=99.59 E-value=1.4e-16 Score=107.49 Aligned_cols=57 Identities=35% Similarity=0.520 Sum_probs=40.2 Q ss_pred CCCCCCCHH-HHHHHH-HHHHHHHHHHHHHCCCHHCCCCCCCCCCCCCCHHCCCEECCC Q ss_conf 754458999-999999-999974122122106510188889986433610007865149 Q gi|254780408|r 19 TSHDPLSSI-AKRALE-EAKQRKSANNKDAKLPIEIGGRKGLDPTRFGDWEKNGISIDF 75 (75) Q Consensus 19 ~~~~~ls~a-A~rAL~-Ea~~rr~~~~~~~~~pkE~gG~~G~ePtRyGDWE~kG~~~DF 75 (75) .+...|++. |..... |.+..+......+....||||+.|||||||||||+||+|||| T Consensus 48 ~p~~~L~e~~a~~~~~~Ekepl~~~p~~vNp~TgEiGGprGPEPTRYGDWErkgr~~~F 106 (106) T KOG3245 48 LPLPMLPEGDAPHPSHLEKEPLKPWPNDVNPYTGEIGGPRGPEPTRYGDWERKGRCSDF 106 (106) T ss_pred CCCCCCCCCCCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHCCCCCCC T ss_conf 57666664446684665205677886657855332378789997654506542522559 No 4 >PRK11750 gltB glutamate synthase subunit alpha; Provisional Probab=67.36 E-value=7.2 Score=20.30 Aligned_cols=42 Identities=33% Similarity=0.395 Sum_probs=32.4 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHCCCHHCCCCCCCCCCCCCCHHCC Q ss_conf 54458999999999999974122122106510188889986433610007 Q gi|254780408|r 20 SHDPLSSIAKRALEEAKQRKSANNKDAKLPIEIGGRKGLDPTRFGDWEKN 69 (75) Q Consensus 20 ~~~~ls~aA~rAL~Ea~~rr~~~~~~~~~pkE~gG~~G~ePtRyGDWE~k 69 (75) +.-.+|+.|.++|+.|-.|--. +-..|-+|-+|.|||||..- T Consensus 866 S~GsiS~eAHetlA~AmNriG~--------~SnsGEGGEd~~R~g~~~~S 907 (1483) T PRK11750 866 SIGALSPEAHEALAIAMNRLGG--------RSNSGEGGEDPARYGTEKNS 907 (1483) T ss_pred CCCCCCHHHHHHHHHHHHHHCC--------CCCCCCCCCCHHHHCCCCCC T ss_conf 6666798999999999998488--------32588888788770898888 No 5 >TIGR00408 proS_fam_I prolyl-tRNA synthetase; InterPro: IPR004499 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . Prolyl-tRNA synthetase is a class II tRNA synthetase and is recognized by IPR002314 from INTERPRO, which recognises tRNA synthetases for Gly, His, Ser, and Pro. The prolyl-tRNA synthetases are divided into two widely divergent families. This family includes the archaeal enzyme, the Pro-specific domain of a human multifunctional tRNA ligase, and the enzyme from the spirochete Borrelia burgdorferi (Lyme desease spirochete). The other family, IPR004500 from INTERPRO, includes enzymes from Escherichia coli, Bacillus subtilis, Synechocystis sp. (strain PCC6803), and one of the two prolyl-tRNA synthetases of Saccharomyces cerevisiae (Baker's yeast).; GO: 0000166 nucleotide binding, 0004827 proline-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006433 prolyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=29.47 E-value=15 Score=18.62 Aligned_cols=13 Identities=38% Similarity=1.050 Sum_probs=10.9 Q ss_pred CCCCCHHCCCEEC Q ss_conf 4336100078651 Q gi|254780408|r 61 TRFGDWEKNGISI 73 (75) Q Consensus 61 tRyGDWE~kG~~~ 73 (75) -+|.+||.|||=+ T Consensus 359 ~kf~~~E~kGiPl 371 (533) T TIGR00408 359 KKFSKWELKGIPL 371 (533) T ss_pred CHHHCCCEEECCE T ss_conf 1320453330426 No 6 >TIGR01533 lipo_e_P4 5'-nucleotidase, lipoprotein e(P4) family; InterPro: IPR006423 This group of sequences represents a set of bacterial lipoproteins belonging to a larger acid phosphatase family, which in turn belongs to the haloacid dehalogenase (HAD) superfamily of aspartate-dependent hydrolases. Members are found on the outer membrane of Gram-negative bacteria and the cytoplasmic membrane of Gram-positive bacteria. Most members have classic lipoprotein signal sequences. A critical role of this 5'-nucleotidase in Haemophilus influenzae is the degradation of external riboside in order to allow transport into the cell. An earlier suggested role in hemin transport is no longer current. This enzyme may also have other physiologically significant roles.. Probab=24.82 E-value=23 Score=17.69 Aligned_cols=10 Identities=40% Similarity=1.046 Sum_probs=6.8 Q ss_pred CCCCCCCCCHH Q ss_conf 99864336100 Q gi|254780408|r 57 GLDPTRFGDWE 67 (75) Q Consensus 57 G~ePtRyGDWE 67 (75) =|.| -||+|| T Consensus 261 LPNp-~YG~WE 270 (295) T TIGR01533 261 LPNP-MYGSWE 270 (295) T ss_pred CCCC-CCCCHH T ss_conf 6788-856514 No 7 >cd00862 ProRS_anticodon_zinc ProRS Prolyl-anticodon binding domain, long version found predominantly in eukaryotes and archaea. ProRS belongs to class II aminoacyl-tRNA synthetases (aaRS). This alignment contains the anticodon binding domain, which is responsible for specificity in tRNA-binding, so that the activated amino acid is transferred to a ribose 3' OH group of the appropriate tRNA only, and an additional C-terminal zinc-binding domain specific to this subfamily of aaRSs. Probab=20.32 E-value=22 Score=17.73 Aligned_cols=25 Identities=28% Similarity=0.797 Sum_probs=17.3 Q ss_pred CCHHCCCCCCCCC-CCCCCHHCCCEE Q ss_conf 6510188889986-433610007865 Q gi|254780408|r 48 LPIEIGGRKGLDP-TRFGDWEKNGIS 72 (75) Q Consensus 48 ~pkE~gG~~G~eP-tRyGDWE~kG~~ 72 (75) ..-.+.-+...-| -+|-+||.||+- T Consensus 44 iRv~iD~rd~~~pG~K~~~wE~kGVP 69 (202) T cd00862 44 IRVHVDDRDNYTPGWKFNDWELKGVP 69 (202) T ss_pred CEEEEECCCCCCHHHHHHHHHHHCCC T ss_conf 88999778786888888999980897 No 8 >cd04822 PA_M28_1_3 PA_M28_1_3: Protease-associated (PA) domain, peptidase family M28, subfamily-1, subgroup 3. A subgroup of PA-domain containing proteins belonging to the peptidase family M28. Family M28 contains aminopeptidases and carboxypeptidases, and has co-catalytic zinc ions. The PA domain is an insert domain in a diverse fraction of proteases. The significance of the PA domain to many of the proteins in which it is inserted is undetermined. It may be a protein-protein interaction domain. At peptidase active sites, the PA domain may participate in substrate binding and/or promoting conformational changes, which influence the stability and accessibility of the site to substrate. Proteins into which the PA domain is inserted include the following members of the peptidase family M28: i) prostate-specific membrane antigen (PSMA), ii) yeast aminopeptidase Y, and ii) human TfR (transferrin receptor)1 and human TfR2. The proteins listed above belong to other subgroups; relatively litt Probab=16.23 E-value=46 Score=16.08 Aligned_cols=14 Identities=21% Similarity=0.216 Sum_probs=8.2 Q ss_pred CCCCCCCCCCHHCC Q ss_conf 89986433610007 Q gi|254780408|r 56 KGLDPTRFGDWEKN 69 (75) Q Consensus 56 ~G~ePtRyGDWE~k 69 (75) .|...||||.|--| T Consensus 68 ~G~~mt~ygrw~yK 81 (151) T cd04822 68 NGPGLTRHAGLRYK 81 (151) T ss_pred CCCCCCEECCHHHH T ss_conf 87654232118889 No 9 >COG1223 Predicted ATPase (AAA+ superfamily) [General function prediction only] Probab=15.34 E-value=41 Score=16.33 Aligned_cols=12 Identities=50% Similarity=1.318 Sum_probs=7.5 Q ss_pred CCCCCCCHHCCC Q ss_conf 864336100078 Q gi|254780408|r 59 DPTRFGDWEKNG 70 (75) Q Consensus 59 ePtRyGDWE~kG 70 (75) +|-|||||--|. T Consensus 142 nPe~Fg~WAPkn 153 (368) T COG1223 142 NPERFGDWAPKN 153 (368) T ss_pred CHHHHCCCCCCE T ss_conf 968763457541 No 10 >pfam11239 DUF3040 Protein of unknown function (DUF3040). Some members in this family of proteins with unknown function are annotated as membrane proteins however this cannot be confirmed. Probab=14.82 E-value=1.2e+02 Score=14.01 Aligned_cols=41 Identities=24% Similarity=0.315 Sum_probs=0.0 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHCCCHHCCCCCCCCCCCC Q ss_conf 58999999999999974122122106510188889986433 Q gi|254780408|r 23 PLSSIAKRALEEAKQRKSANNKDAKLPIEIGGRKGLDPTRF 63 (75) Q Consensus 23 ~ls~aA~rAL~Ea~~rr~~~~~~~~~pkE~gG~~G~ePtRy 63 (75) |||+-.||.|+|-|+.=.+....-...-.-+++..+-+.|. T Consensus 2 pLSe~Eqr~L~eiE~~L~~eDP~fa~~~~~~~~~~~~~rr~ 42 (82) T pfam11239 2 PLSEHEQRRLEEIERALYAEDPKFAAAVRSGRPRRPSRRRI 42 (82) T ss_pred CCCHHHHHHHHHHHHHHHHCCHHHHHHHCCCCCCCCCCHHH T ss_conf 99889999999999999843979999751578888761269 Done!