Query gi|254780981|ref|YP_003065394.1| hypothetical protein CLIBASIA_04410 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 122 No_of_seqs 1 out of 3 Neff 1.0 Searched_HMMs 39220 Date Mon May 30 02:50:59 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780981.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 pfam09657 Cas_Csx8 CRISPR-asso 43.4 16 0.00041 17.9 2.0 69 30-102 67-144 (441) 2 pfam11239 DUF3040 Protein of u 42.3 14 0.00034 18.3 1.4 26 49-83 3-28 (82) 3 COG0321 LipB Lipoate-protein l 37.6 11 0.00029 18.7 0.4 16 14-29 170-185 (221) 4 PRK00008 lipB lipoyltransferas 31.1 17 0.00042 17.8 0.4 14 15-28 163-176 (212) 5 TIGR02670 cas_csx8 CRISPR-asso 25.0 34 0.00087 16.1 1.1 37 55-94 76-112 (493) 6 smart00017 OSTEO Osteopontin. 21.8 26 0.00066 16.8 -0.0 31 53-83 234-267 (287) 7 pfam09825 BPL_N Biotin-protein 20.5 30 0.00076 16.4 0.1 30 77-107 330-359 (364) 8 TIGR00777 ahpD alkylhydroperox 19.4 36 0.00092 16.0 0.3 13 11-23 122-134 (178) 9 PRK10954 periplasmic protein d 17.9 81 0.0021 14.1 1.9 21 1-21 1-21 (207) 10 TIGR01478 STEVOR variant surfa 15.7 34 0.00088 16.1 -0.5 10 13-22 269-278 (315) No 1 >pfam09657 Cas_Csx8 CRISPR-associated protein Csx8 (Cas_Csx8). Clusters of short DNA repeats with nonhomologous spacers, which are found at regular intervals in the genomes of phylogenetically distinct prokaryotic species, comprise a family with recognisable features. This family is known as CRISPR (short for Clustered, Regularly Interspaced Short Palindromic Repeats). A number of protein families appear only in association with these repeats and are designated Cas (CRISPR-Associated) proteins. This entry describes proteins of unknown function which are encoded in the midst of a cas gene operon. Probab=43.39 E-value=16 Score=17.87 Aligned_cols=69 Identities=29% Similarity=0.398 Sum_probs=32.9 Q ss_pred CCCCHHHHHHHCCHHHHHHHHHHHHHHHHHCCC----HHHHHHHH----H-HHHCCCHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 777113454310102477887899988651018----67786566----5-43001733333433576665433210246 Q gi|254780981|r 30 SPLPHIALESLSAENEKKELSEHEKKVIESQEN----PKKQFSEH----E-KKETDDPKSARKENIVMKKTFSQKSKKYT 100 (122) Q Consensus 30 splphialeslsaenekkelsehekkviesqen----pkkqfseh----e-kketddpksarkenivmkktfsqkskkyt 100 (122) ..|||.++|.+-. .+|+|+.+.|.|...-+ -||-|+.- + +.|.-+--. .-.....+.||..|.--|. T Consensus 67 d~l~Hk~lEn~L~---~~efs~E~ik~I~ekm~aNtilKKvF~KikfdgeNkeEiL~lI~-eNR~~iIkETFRnkk~lY~ 142 (441) T pfam09657 67 DTLHHKKVENILK---HKEFTEEIIKSINEKLSANTVMKKVFGKIKFDGTNKDEILDLID-KNRHSLIKETFRRKKDLYD 142 (441) T ss_pred CCCHHHHHHHHHC---CCCCCHHHHHHHHHHHHHCHHHHHHHHHCCCCCCCHHHHHHHHH-HHHHHHHHHHHHCCHHHHH T ss_conf 2015799998853---64258999999998730146999997404667889999999998-6289999999850114777 Q ss_pred HH Q ss_conf 14 Q gi|254780981|r 101 PY 102 (122) Q Consensus 101 py 102 (122) -| T Consensus 143 Ny 144 (441) T pfam09657 143 NY 144 (441) T ss_pred HC T ss_conf 40 No 2 >pfam11239 DUF3040 Protein of unknown function (DUF3040). Some members in this family of proteins with unknown function are annotated as membrane proteins however this cannot be confirmed. Probab=42.34 E-value=14 Score=18.30 Aligned_cols=26 Identities=42% Similarity=0.547 Sum_probs=15.9 Q ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHCCCHHHHHH Q ss_conf 87899988651018677865665430017333334 Q gi|254780981|r 49 LSEHEKKVIESQENPKKQFSEHEKKETDDPKSARK 83 (122) Q Consensus 49 lsehekkviesqenpkkqfsehekketddpksark 83 (122) |||||+++.+--|. + -..|||+-|+. T Consensus 3 LSe~Eqr~L~eiE~---~------L~~eDP~fa~~ 28 (82) T pfam11239 3 LSEHEQRRLEEIER---A------LYAEDPKFAAA 28 (82) T ss_pred CCHHHHHHHHHHHH---H------HHHCCHHHHHH T ss_conf 98899999999999---9------98439799997 No 3 >COG0321 LipB Lipoate-protein ligase B [Coenzyme metabolism] Probab=37.59 E-value=11 Score=18.73 Aligned_cols=16 Identities=31% Similarity=0.532 Sum_probs=11.6 Q ss_pred HCCCCCCCCCCCCCCC Q ss_conf 4056834455001333 Q gi|254780981|r 14 SSAINPCGIEEDNLKS 29 (122) Q Consensus 14 ssainpcgieednlks 29 (122) -+.|||||+..-..-| T Consensus 170 F~~I~PCG~~~~~~ts 185 (221) T COG0321 170 FNRIVPCGYAGMEVTS 185 (221) T ss_pred CCCEECCCCCCCCEEE T ss_conf 0624036668972547 No 4 >PRK00008 lipB lipoyltransferase; Reviewed Probab=31.13 E-value=17 Score=17.82 Aligned_cols=14 Identities=50% Similarity=0.871 Sum_probs=10.4 Q ss_pred CCCCCCCCCCCCCC Q ss_conf 05683445500133 Q gi|254780981|r 15 SAINPCGIEEDNLK 28 (122) Q Consensus 15 sainpcgieednlk 28 (122) +.|+|||+...... T Consensus 163 ~~I~pCG~~~~~vT 176 (212) T PRK00008 163 SRINPCGIAGLGVT 176 (212) T ss_pred CCEECCCCCCCCCC T ss_conf 66874788998724 No 5 >TIGR02670 cas_csx8 CRISPR-associated protein Csx8 (provisional); InterPro: IPR013487 Clusters of short DNA repeats with nonhomologous spacers, which are found at regular intervals in the genomes of phylogenetically distinct prokaryotic species, comprise a family with recognizable features . This family is known as CRISPR (short for Clustered, Regularly Interspaced Short Palindromic Repeats). A number of protein families appear only in association with these repeats and are designated Cas (CRISPR-Associated) proteins. This entry describes proteins of unknown function which are encoded in the midst of a cas gene operon. . Probab=25.02 E-value=34 Score=16.12 Aligned_cols=37 Identities=32% Similarity=0.451 Sum_probs=19.3 Q ss_pred HHHHHCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHH Q ss_conf 8865101867786566543001733333433576665433 Q gi|254780981|r 55 KVIESQENPKKQFSEHEKKETDDPKSARKENIVMKKTFSQ 94 (122) Q Consensus 55 kviesqenpkkqfsehekketddpksarkenivmkktfsq 94 (122) |++|-|-.-.+.|+|.+-+--.|--+ .|-|||++|+. T Consensus 76 k~vENiL~~t~E~tee~ik~I~ekl~---ANt~mKk~FgK 112 (493) T TIGR02670 76 KVVENILKKTEELTEEEIKLINEKLK---ANTVMKKVFGK 112 (493) T ss_pred HHHHHHHCCCCCCCHHHHHHHHHHHH---HHHHHHHHCCC T ss_conf 88875521554345577887752346---43687866265 No 6 >smart00017 OSTEO Osteopontin. Osteopontin is an acidic phosphorylated glycoprotein of about 40 Kd which is abundant in the mineral matrix of bones and which binds tightly to hydroxyapatite [1,2,3]. It is suggested that osteopontin might function as a cell attachment factor and could play a key role in the adhesion of osteoclasts to the mineral matrix of bone Probab=21.78 E-value=26 Score=16.78 Aligned_cols=31 Identities=45% Similarity=0.605 Sum_probs=24.9 Q ss_pred HHHHHHHCCCHHH---HHHHHHHHHCCCHHHHHH Q ss_conf 9988651018677---865665430017333334 Q gi|254780981|r 53 EKKVIESQENPKK---QFSEHEKKETDDPKSARK 83 (122) Q Consensus 53 ekkviesqenpkk---qfsehekketddpksark 83 (122) .-.+|+||||.|- .|.-||.+-.-||||-.. T Consensus 234 ~s~~iDSQesskvSsqef~S~e~~l~~d~kS~Ed 267 (287) T smart00017 234 HSDVIDSQESSKVSSREFHSHEDKLDLDPKSEED 267 (287) T ss_pred CCCCCCCHHHCCHHHHHHHHCCCCCCCCCCCCCC T ss_conf 0321000121100045544103444678522011 No 7 >pfam09825 BPL_N Biotin-protein ligase, N terminal. The function of this structural domain is unknown. It is found to the N terminus of the biotin protein ligase catalytic domain. Probab=20.48 E-value=30 Score=16.43 Aligned_cols=30 Identities=37% Similarity=0.566 Sum_probs=17.1 Q ss_pred CHHHHHHHHHHHHHHHHHHHCCCCHHHHHHH Q ss_conf 7333334335766654332102461455442 Q gi|254780981|r 77 DPKSARKENIVMKKTFSQKSKKYTPYFDHYM 107 (122) Q Consensus 77 dpksarkenivmkktfsqkskkytpyfdhym 107 (122) |+.++-|.-|+- ..=..-+.|.||||||-. T Consensus 330 d~~~~~K~Iv~~-~~~~~P~~k~TP~Fn~~~ 359 (364) T pfam09825 330 DPEAVPKTILFP-HDEDVPSSKLTPYFDHEL 359 (364) T ss_pred CHHHCCCEEEEC-CCCCCCCCCCCCCCCHHH T ss_conf 864678179947-988889976788776899 No 8 >TIGR00777 ahpD alkylhydroperoxidase, AhpD family; InterPro: IPR004674 Members of this family are alkylhydroperoxidases, which catalyze the reduction of peroxides to their corresponding alcohols via oxidation of cysteine residues. In these alkylhydroperoxidases, the cysteines are located in a conserved -CXXC- motif located towards the COOH terminus. In Mycobacterium tuberculosis, two non-homologous alkylhydroperoxidases, AhpD and AhpC, are found in the same operon.; GO: 0004601 peroxidase activity. Probab=19.39 E-value=36 Score=15.99 Aligned_cols=13 Identities=54% Similarity=0.746 Sum_probs=9.9 Q ss_pred HHHHCCCCCCCCC Q ss_conf 9974056834455 Q gi|254780981|r 11 LFVSSAINPCGIE 23 (122) Q Consensus 11 lfvssainpcgie 23 (122) -|--||||-||.- T Consensus 122 sfAVSAINGCg~C 134 (178) T TIGR00777 122 SFAVSAINGCGAC 134 (178) T ss_pred HHHHHHHCCCCCH T ss_conf 3543222001300 No 9 >PRK10954 periplasmic protein disulfide isomerase I; Provisional Probab=17.89 E-value=81 Score=14.07 Aligned_cols=21 Identities=19% Similarity=0.265 Sum_probs=16.4 Q ss_pred CCHHHHHHHHHHHHCCCCCCC Q ss_conf 906899999999740568344 Q gi|254780981|r 1 MKKYFTILTMLFVSSAINPCG 21 (122) Q Consensus 1 mkkyftiltmlfvssainpcg 21 (122) |||.|.++.+|+.|-+.+... T Consensus 1 mkk~~~~~~~llls~~a~Aa~ 21 (207) T PRK10954 1 MKKIWLALAGMVLAFSASAAQ 21 (207) T ss_pred CCHHHHHHHHHHHHHHHHHHH T ss_conf 915999999999997722503 No 10 >TIGR01478 STEVOR variant surface antigen, stevor family; InterPro: IPR006374 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see . This entry represents the stevor (short for "subtelomeric variable open reading frame") family of predicted variant surface antigens. This is the third largest variant surface antigen family in P. falciparum, with 33 genes identified in the genome of this parasite . The function of these proteins is not known, but they are expressed during several stages of the parasite life-cycle and may play a variety of roles in parasite survival and infection .. Probab=15.75 E-value=34 Score=16.09 Aligned_cols=10 Identities=40% Similarity=0.670 Sum_probs=0.0 Q ss_pred HHCCCCCCCC Q ss_conf 7405683445 Q gi|254780981|r 13 VSSAINPCGI 22 (122) Q Consensus 13 vssainpcgi 22 (122) ++|.++|||| T Consensus 269 ~~Stf~PyGI 278 (315) T TIGR01478 269 ATSTFLPYGI 278 (315) T ss_pred HHHHHHHHHH T ss_conf 8878613589 Done!