Query gi|254780966|ref|YP_003065379.1| hypothetical protein CLIBASIA_04335 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 53 No_of_seqs 1 out of 3 Neff 1.0 Searched_HMMs 39220 Date Mon May 30 01:51:57 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780966.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 pfam09695 YtfJ_HI0045 Bacteria 20.7 30 0.00077 17.5 0.2 26 7-36 52-77 (174) 2 COG4386 Mu-like prophage tail 19.5 41 0.001 16.8 0.6 20 30-49 431-450 (487) 3 PRK00149 dnaA chromosomal repl 18.2 1E+02 0.0026 14.8 2.5 26 9-34 215-240 (447) 4 PRK12422 chromosomal replicati 16.4 1.1E+02 0.0029 14.6 2.5 26 9-34 209-234 (455) 5 TIGR01590 yir-bir-cir_Pla yir/ 13.6 73 0.0019 15.5 0.7 16 2-17 155-171 (223) 6 pfam06854 Phage_Gp15 Bacteriop 12.8 83 0.0021 15.2 0.8 31 4-42 124-155 (182) 7 pfam03889 DUF331 Domain of unk 12.3 91 0.0023 15.0 0.9 17 4-20 15-31 (39) 8 pfam05254 UPF0203 Uncharacteri 11.8 1.4E+02 0.0036 14.1 1.7 26 10-36 25-50 (68) 9 pfam03236 consensus 11.6 1.4E+02 0.0034 14.2 1.6 22 9-36 40-61 (101) 10 pfam01797 Transposase_17 Trans 10.7 84 0.0021 15.2 0.3 15 7-21 64-78 (121) No 1 >pfam09695 YtfJ_HI0045 Bacterial protein of unknown function (YtfJ_HI0045). These are sequences from gamma proteobacteria that are related to the E. coli protein, YtfJ. Probab=20.71 E-value=30 Score=17.53 Aligned_cols=26 Identities=42% Similarity=0.626 Sum_probs=21.2 Q ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHHHHCC Q ss_conf 532256550642046889999888997526 Q gi|254780966|r 7 KSRTMHYLKGKSSRKECFLAIFNPLIECLD 36 (53) Q Consensus 7 ksrtmhylkgkssrkecflaifnpliecld 36 (53) |-|.++|+-|.+|-||- .+||||.+. T Consensus 52 KVrviq~iAGr~sakel----N~~l~~Aik 77 (174) T pfam09695 52 KVRVIQHIAGRSSAKEL----NAPLIEAIK 77 (174) T ss_pred CEEEEEEECCCCCHHHH----HHHHHHHHH T ss_conf 68999983368866777----389999999 No 2 >COG4386 Mu-like prophage tail sheath protein gpL [General function prediction only] Probab=19.53 E-value=41 Score=16.84 Aligned_cols=20 Identities=35% Similarity=0.464 Sum_probs=16.1 Q ss_pred HHHHHCCHHHHHCCHHCCCC Q ss_conf 89975269998500305865 Q gi|254780966|r 30 PLIECLDFKEIVEDKENFPS 49 (53) Q Consensus 30 pliecldfkeivedkenfps 49 (53) .+.+||+|-+||||-|.|-. T Consensus 431 ~~~~~le~~givE~~e~f~~ 450 (487) T COG4386 431 DVLEELEELGIVENAETFKK 450 (487) T ss_pred HHHHHHHHHHHHHHHHHHCC T ss_conf 99999998627775876446 No 3 >PRK00149 dnaA chromosomal replication initiation protein; Reviewed Probab=18.19 E-value=1e+02 Score=14.80 Aligned_cols=26 Identities=35% Similarity=0.595 Sum_probs=22.3 Q ss_pred HHHHHHHCCCCHHHHHHHHHHHHHHH Q ss_conf 22565506420468899998889975 Q gi|254780966|r 9 RTMHYLKGKSSRKECFLAIFNPLIEC 34 (53) Q Consensus 9 rtmhylkgkssrkecflaifnpliec 34 (53) -..|.+.||..-.|-|..+||-|++- T Consensus 215 DDiqfl~gk~~tqeeff~~fn~l~~~ 240 (447) T PRK00149 215 DDIQFLAGKEKTQEEFFHTFNALHEN 240 (447) T ss_pred CHHHHHHCCHHHHHHHHHHHHHHHHC T ss_conf 14888605577999999999999984 No 4 >PRK12422 chromosomal replication initiation protein; Provisional Probab=16.37 E-value=1.1e+02 Score=14.56 Aligned_cols=26 Identities=27% Similarity=0.450 Sum_probs=22.0 Q ss_pred HHHHHHHCCCCHHHHHHHHHHHHHHH Q ss_conf 22565506420468899998889975 Q gi|254780966|r 9 RTMHYLKGKSSRKECFLAIFNPLIEC 34 (53) Q Consensus 9 rtmhylkgkssrkecflaifnpliec 34 (53) --.|.+.||.+-.|-|..+||-|.+- T Consensus 209 DDIQfl~gK~~tqeEff~tfN~L~~~ 234 (455) T PRK12422 209 EDIEVFSGKGATQEEFFHTFNSLHTE 234 (455) T ss_pred EHHHHHHCCHHHHHHHHHHHHHHHHC T ss_conf 14788728488999999999999985 No 5 >TIGR01590 yir-bir-cir_Pla yir/bir/cir-family of variant antigens; InterPro: IPR006477 This group of sequences identifies a large paralogous family of variant antigens from several Plasmodium species (P. yoelii, P. berghei and P. chabaudi). It is not believed that there are any orthologs of this family in P. falciparum.. Probab=13.57 E-value=73 Score=15.55 Aligned_cols=16 Identities=38% Similarity=0.451 Sum_probs=9.7 Q ss_pred HHHHHHHH-HHHHHHCC Q ss_conf 45676532-25655064 Q gi|254780966|r 2 FYDLFKSR-TMHYLKGK 17 (53) Q Consensus 2 fydlfksr-tmhylkgk 17 (53) |||+||+- -||---.+ T Consensus 155 FYd~FKlLC~m~~e~~~ 171 (223) T TIGR01590 155 FYDAFKLLCNMYTEINE 171 (223) T ss_pred HHHHHHHHHHHHHHHHC T ss_conf 78898888877302100 No 6 >pfam06854 Phage_Gp15 Bacteriophage Gp15 protein. This family consists of bacteriophage Gp15 proteins and related bacterial sequences. The function of this family is unknown Probab=12.75 E-value=83 Score=15.24 Aligned_cols=31 Identities=32% Similarity=0.619 Sum_probs=20.4 Q ss_pred HHHHHH-HHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHC Q ss_conf 676532-256550642046889999888997526999850 Q gi|254780966|r 4 DLFKSR-TMHYLKGKSSRKECFLAIFNPLIECLDFKEIVE 42 (53) Q Consensus 4 dlfksr-tmhylkgkssrkecflaifnpliecldfkeive 42 (53) ||++.+ -||+-| |.|.||-|-|---++.|.. T Consensus 124 DL~~~~g~LHWwk--------F~ALf~~L~edt~i~kii~ 155 (182) T pfam06854 124 DLLKEQGKLHWYE--------FKALLNGLSDDTKMKRIIS 155 (182) T ss_pred CHHHHHCCCCHHH--------HHHHHHCCCCCCHHHHHHH T ss_conf 8778615260999--------9999943898678999998 No 7 >pfam03889 DUF331 Domain of unknown function. Members of this family are uncharacterized proteins from a number of bacterial species. The proteins range in size from 50-70 residues. Probab=12.33 E-value=91 Score=15.04 Aligned_cols=17 Identities=41% Similarity=0.511 Sum_probs=14.4 Q ss_pred HHHHHHHHHHHHCCCCH Q ss_conf 67653225655064204 Q gi|254780966|r 4 DLFKSRTMHYLKGKSSR 20 (53) Q Consensus 4 dlfksrtmhylkgkssr 20 (53) .||++|.-.--|||.|- T Consensus 15 ~LFr~rvek~kKGKGSy 31 (39) T pfam03889 15 PLFRQRVEKAKKGKGSY 31 (39) T ss_pred HHHHHHHHHCCCCCCCH T ss_conf 78898887444677541 No 8 >pfam05254 UPF0203 Uncharacterized protein family (UPF0203). This family of proteins is functionally uncharacterized. Probab=11.84 E-value=1.4e+02 Score=14.08 Aligned_cols=26 Identities=31% Similarity=0.764 Sum_probs=20.4 Q ss_pred HHHHHHCCCCHHHHHHHHHHHHHHHCC Q ss_conf 256550642046889999888997526 Q gi|254780966|r 10 TMHYLKGKSSRKECFLAIFNPLIECLD 36 (53) Q Consensus 10 tmhylkgkssrkecflaifnpliecld 36 (53) +-+||||.++..+|- .+|.--.+|+. T Consensus 25 sEkFLKG~~~~~~C~-~~f~~Y~~Cv~ 50 (68) T pfam05254 25 SEKFLKGDSTENECT-ELFEKYQTCVQ 50 (68) T ss_pred HHHHHCCCCCCCCHH-HHHHHHHHHHH T ss_conf 987306886657179-99999999999 No 9 >pfam03236 consensus Probab=11.60 E-value=1.4e+02 Score=14.15 Aligned_cols=22 Identities=50% Similarity=0.909 Sum_probs=14.7 Q ss_pred HHHHHHHCCCCHHHHHHHHHHHHHHHCC Q ss_conf 2256550642046889999888997526 Q gi|254780966|r 9 RTMHYLKGKSSRKECFLAIFNPLIECLD 36 (53) Q Consensus 9 rtmhylkgkssrkecflaifnpliecld 36 (53) |..-|+.| |+|||-|.+- +|-+ T Consensus 40 rr~Ay~sG----K~Cfl~i~~~--~C~~ 61 (101) T pfam03236 40 RREAYTSG----KECFLEIAKE--ECDE 61 (101) T ss_pred HHHHHHHH----HHHHHHHHHH--HCCH T ss_conf 99998511----7999999997--7799 No 10 >pfam01797 Transposase_17 Transposase IS200 like. Transposases are needed for efficient transposition of the insertion sequence or transposon DNA. This family includes transposases for IS200 from E. coli. Probab=10.75 E-value=84 Score=15.22 Aligned_cols=15 Identities=47% Similarity=0.685 Sum_probs=12.2 Q ss_pred HHHHHHHHHCCCCHH Q ss_conf 532256550642046 Q gi|254780966|r 7 KSRTMHYLKGKSSRK 21 (53) Q Consensus 7 ksrtmhylkgkssrk 21 (53) -|+-|..+||.|||. T Consensus 64 is~~v~~lKg~Ssr~ 78 (121) T pfam01797 64 ISKLVQRLKGRSSRR 78 (121) T ss_pred HHHHHHHHHHHHHHH T ss_conf 999999998899999 Done!