Query gi|255764516|ref|YP_003084344.1| hypothetical protein CLIBASIA_05532 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 341 No_of_seqs 2 out of 4 Neff 1.1 Searched_HMMs 39220 Date Mon May 30 06:30:16 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 255764516.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR01188 drrA daunorubicin re 51.3 16 0.0004 17.3 2.9 39 136-174 156-195 (343) 2 PTZ00054 60S ribosomal protein 44.6 23 0.00059 16.2 2.9 44 208-256 89-132 (139) 3 KOG4336 consensus 31.0 27 0.00068 15.8 1.5 46 280-325 255-304 (323) 4 COG0743 Dxr 1-deoxy-D-xylulose 30.7 6.9 0.00018 19.5 -1.6 16 220-235 229-244 (385) 5 PRK01269 thiamine biosynthesis 30.4 11 0.00028 18.3 -0.6 50 236-285 243-293 (483) 6 COG0093 RplN Ribosomal protein 29.4 39 0.00099 14.8 2.1 42 208-254 71-113 (122) 7 COG0301 ThiI Thiamine biosynth 28.6 11 0.00029 18.2 -0.8 49 236-284 242-291 (383) 8 TIGR02308 RNA_lig_T4_1 RNA lig 26.8 16 0.00041 17.2 -0.3 67 10-76 212-285 (399) 9 cd04451 S1_IF1 S1_IF1: Transla 26.8 37 0.00095 14.9 1.6 45 41-85 4-48 (64) 10 PRK12442 translation initiatio 26.5 40 0.001 14.7 1.7 45 41-85 10-54 (87) 11 pfam11311 DUF3114 Protein of u 25.7 33 0.00084 15.2 1.2 37 111-147 192-229 (252) 12 PHA00202 DNA replication initi 25.3 46 0.0012 14.3 1.9 83 95-185 50-132 (379) 13 pfam09158 MotCF Bacteriophage 25.0 24 0.0006 16.2 0.3 33 70-102 48-85 (103) 14 PRK00276 infA translation init 24.6 43 0.0011 14.5 1.6 45 41-85 10-54 (72) 15 TIGR03354 VI_FHA type VI secre 24.3 37 0.00095 14.9 1.2 70 17-95 28-103 (396) 16 pfam06517 Orthopox_A43R Orthop 23.7 54 0.0014 13.9 1.9 33 151-183 130-162 (204) 17 TIGR00243 Dxr 1-deoxy-D-xylulo 22.5 15 0.00039 17.4 -1.0 44 13-56 237-281 (406) 18 CHL00010 infA translation init 21.1 58 0.0015 13.7 1.7 45 41-85 10-54 (78) 19 TIGR00538 hemN oxygen-independ 20.5 45 0.0011 14.4 1.0 48 86-146 172-222 (462) 20 PRK10162 acetyl esterase; Prov 20.4 63 0.0016 13.5 3.9 97 135-249 9-111 (319) 21 TIGR03673 rpl14p_arch 50S ribo 20.1 64 0.0016 13.4 2.5 65 187-256 48-124 (131) No 1 >TIGR01188 drrA daunorubicin resistance ABC transporter, ATP-binding protein; InterPro: IPR005894 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This family contains the daunorubicin resistance ABC transporter, ATP binding subunit in bacteria and archaea. This model is restricted in its scope to preferentially recognize the ATP binding subunit associated with effux of the drug, daunorubicin. In other words it functions as an ATP dependent antiporter. . Probab=51.34 E-value=16 Score=17.31 Aligned_cols=39 Identities=26% Similarity=0.457 Sum_probs=28.9 Q ss_pred HCCCCHHHHHHHHHHHHHCCCC-EEEEEECCCEECCCCCC Q ss_conf 1288767889999987421564-47999624220244465 Q gi|255764516|r 136 VEGLDPQTRKYLQDIQTQLTSD-TLTLQLDDTRVDDTRGY 174 (341) Q Consensus 136 vegldpqtrkylqdiqtqltsd-tltlqlddtrvddtrgy 174 (341) --|||||||.-+-|+-..|-+. -.|.-|--.-.++.--+ T Consensus 156 T~GLDP~tR~~iWd~i~~lk~~~g~TilLTThYmeEAd~L 195 (343) T TIGR01188 156 TTGLDPRTRRAIWDYIEALKKEEGVTILLTTHYMEEADKL 195 (343) T ss_pred CCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHH T ss_conf 7688886699999999998740796999743786999823 No 2 >PTZ00054 60S ribosomal protein L23; Provisional Probab=44.64 E-value=23 Score=16.19 Aligned_cols=44 Identities=23% Similarity=0.332 Sum_probs=35.3 Q ss_pred CCCCCCCCEEEEECCCEEEEEEEEEEECCCCHHHHHHHHHHCHHHHHHH Q ss_conf 6667500013443154058801578712640378899875067777652 Q gi|255764516|r 208 NKSGSLENRIKFYDDKDVYINGQCFVKGTDTSIFDEIARQLTPRFLGLL 256 (341) Q Consensus 208 nksgslenrikfyddkdvyingqcfvkgtdtsifdeiarqltprflgll 256 (341) -+.|| +|+|-|.--|.||.+.-.+|| .||-.+||.+.-+|.-.. T Consensus 89 R~DGs---~I~FddNA~Viin~~g~pkGT--rIfGPVarE~~e~~~kI~ 132 (139) T PTZ00054 89 RREGY---FIYFEDNAGVIVNPKGEMKGS--AITGPVAKECAELWPKIS 132 (139) T ss_pred CCCCC---EEEECCCEEEEECCCCCEEEE--EEECCHHHHHHHHHHHHH T ss_conf 18999---999778769999999988141--683714599985501876 No 3 >KOG4336 consensus Probab=30.96 E-value=27 Score=15.82 Aligned_cols=46 Identities=22% Similarity=0.125 Sum_probs=31.6 Q ss_pred CEEEEEEECCCCCCEEEE----ECCCCEEEEEEECCCCCEEEEEECCCCC Q ss_conf 625777504777761454----0344201455311121023453113673 Q gi|255764516|r 280 DKIAYWAYPSENSSGYIS----ASATQEHTMAVSAENARKRWRIMGKTDS 325 (341) Q Consensus 280 dkiaywaypsenssgyis----asatqehtmavsaenarkrwrimgktds 325 (341) ++-.--.++|+++|+-|- -|-|...|..-.+++.+|.|-+|.+.+. T Consensus 255 ~~pe~~~~~~~~ts~~iee~~rks~~~~~~~~~~~~~l~~~~~~~~~~~~ 304 (323) T KOG4336 255 EKPESRHQSSRTTSKTIEEQDRKSMTRAMTCLPGVQKLKKPKIRMAKEER 304 (323) T ss_pred HCCCCCCCCCCCCHHHHHHCCCCHHHHCCCCCCCCCCCCCCCCCHHHHHH T ss_conf 17654344566410346743331223203468411258999863447776 No 4 >COG0743 Dxr 1-deoxy-D-xylulose 5-phosphate reductoisomerase [Lipid metabolism] Probab=30.67 E-value=6.9 Score=19.54 Aligned_cols=16 Identities=25% Similarity=0.538 Sum_probs=9.2 Q ss_pred ECCCEEEEEEEEEEEC Q ss_conf 3154058801578712 Q gi|255764516|r 220 YDDKDVYINGQCFVKG 235 (341) Q Consensus 220 yddkdvyingqcfvkg 235 (341) |+.-+|-|.-|..|.+ T Consensus 229 ~~~IeVvIHPQSiIHs 244 (385) T COG0743 229 YEQIEVVIHPQSIIHS 244 (385) T ss_pred HHHEEEEECCCCHHEE T ss_conf 8990689743210102 No 5 >PRK01269 thiamine biosynthesis protein ThiI; Provisional Probab=30.43 E-value=11 Score=18.27 Aligned_cols=50 Identities=22% Similarity=0.395 Sum_probs=41.4 Q ss_pred CCCHHHHHHHHHHCHHHHHHHCCCEEEECCC-CCCCCCCCCCEECCEEEEE Q ss_conf 6403788998750677776522420010133-2000111231005625777 Q gi|255764516|r 236 TDTSIFDEIARQLTPRFLGLLQGRTMVRSAN-LREKASIGDIITGDKIAYW 285 (341) Q Consensus 236 tdtsifdeiarqltprflgllqgrtmvrsan-lrekasigdiitgdkiayw 285 (341) .-+.+..||.....+++.+.+.-|-|.|.|+ +.++..+.-|+||+-+.-- T Consensus 243 ~f~~v~~eI~~~v~~~~~~vv~KR~M~R~A~~iA~~~g~~ALVTGESLGQV 293 (483) T PRK01269 243 DFEPVVGEILEKVDNGQMGVVLKRMMLRAASKVAERYGIQALVTGEALGQV 293 (483) T ss_pred CCHHHHHHHHHHCCHHHEEHHHHHHHHHHHHHHHHHCCCCEEEECCCHHHH T ss_conf 679999999873876344399999999999999998599889855503456 No 6 >COG0093 RplN Ribosomal protein L14 [Translation, ribosomal structure and biogenesis] Probab=29.42 E-value=39 Score=14.78 Aligned_cols=42 Identities=43% Similarity=0.617 Sum_probs=32.5 Q ss_pred CCCCCCCCEEEEECCCEEEEEEEEEEECCCCHHHHHHHHHHCHH-HHH Q ss_conf 66675000134431540588015787126403788998750677-776 Q gi|255764516|r 208 NKSGSLENRIKFYDDKDVYINGQCFVKGTDTSIFDEIARQLTPR-FLG 254 (341) Q Consensus 208 nksgslenrikfyddkdvyingqcfvkgtdtsifdeiarqltpr-flg 254 (341) -+.|| .|+|-|.--|.||-.--.+|| -||..+||.|.-+ |.. T Consensus 71 R~DGs---~i~FddNA~Viin~~g~PrGt--rI~GPVaRElr~~~~~k 113 (122) T COG0093 71 RPDGS---YIKFDDNAAVIINPDGEPRGT--RIFGPVARELRERGFMK 113 (122) T ss_pred CCCCC---EEEECCCEEEEECCCCCCCCC--EEECCHHHHHHHCCCCE T ss_conf 69988---898678559998999985533--58564238787557843 No 7 >COG0301 ThiI Thiamine biosynthesis ATP pyrophosphatase [Coenzyme metabolism] Probab=28.58 E-value=11 Score=18.21 Aligned_cols=49 Identities=27% Similarity=0.363 Sum_probs=42.9 Q ss_pred CCCHHHHHHHHHHCHHHHHHHCCCEEEECCC-CCCCCCCCCCEECCEEEE Q ss_conf 6403788998750677776522420010133-200011123100562577 Q gi|255764516|r 236 TDTSIFDEIARQLTPRFLGLLQGRTMVRSAN-LREKASIGDIITGDKIAY 284 (341) Q Consensus 236 tdtsifdeiarqltprflgllqgrtmvrsan-lrekasigdiitgdkiay 284 (341) .-|.+..+|.+...++++.++--|.|.|-|. +.|+...--|+||+-+.- T Consensus 242 ~f~~v~~~i~~~~~~~y~~v~~rR~M~riA~~iae~~g~~aIvtGEsLGQ 291 (383) T COG0301 242 PFTEVQEEILEKVPESYRCVLLKRMMYRIAEKLAEEFGAKAIVTGESLGQ 291 (383) T ss_pred CHHHHHHHHHHHCCCCCEEHHHHHHHHHHHHHHHHHHCCEEEEECCCCHH T ss_conf 62999999986367530204789999999999999839839983686104 No 8 >TIGR02308 RNA_lig_T4_1 RNA ligase, T4 RnlA family; InterPro: IPR012648 Members of this family are phage proteins with ATP-dependent RNA ligase activity. Host defence to phage may include cleavage and inactivation of specific tRNA molecules; members of this family act to reverse this RNA damage. The enzyme is adenylated, transiently, on a Lys residue in a motif KXDGSL.. Probab=26.79 E-value=16 Score=17.21 Aligned_cols=67 Identities=28% Similarity=0.399 Sum_probs=54.3 Q ss_pred CCEEEEE--EEEEEEEEEECCCH--HHEEEECCCCCC---CEEEECCCCCCCCCCCCCCEEEECCCCCCCCCCE Q ss_conf 4257998--60377876631880--131234247888---3100040123222433310021032233465515 Q gi|255764516|r 10 KDVSYRL--FGSYFVIPWTVKDP--SRIHAEVKYPDG---NMEELSPERDFKVDVDESSLILSSKRWINNNNAL 76 (341) Q Consensus 10 K~VsY~~--~GS~FVIPW~vKDP--SRIHAEV~Y~DG---~~eELs~~~DFkvDVD~~~~iLsskrwiNN~~~l 76 (341) ||-+.|- +|.|.|=-|.|.-| +.+-+||+-..| -|--..=-|-||+--|=-.+|-..|-=+||++.| T Consensus 212 ~Da~lr~rmvGvY~~~~~e~~s~~f~~w~~ev~~~~~IEG~va~m~DGq~FK~KsdWY~alH~tK~s~~n~~~L 285 (399) T TIGR02308 212 KDATLRPRMVGVYLVEAYEVDSPRFKEWIDEVRAAEGIEGVVAVMKDGQMFKLKSDWYVALHRTKDSINNPEKL 285 (399) T ss_pred CCCCCCCCEEEEEECHHHCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCEEEEEHHHHHHHCCCCCCCCCHHHH T ss_conf 46010220024321431147876689999998862087159998217733102201111110202311101788 No 9 >cd04451 S1_IF1 S1_IF1: Translation Initiation Factor IF1, S1-like RNA-binding domain. IF1 contains an S1-like RNA-binding domain, which is found in a wide variety of RNA-associated proteins. Translation initiation includes a number of interrelated steps preceding the formation of the first peptide bond. In Escherichia coli, the initiation mechanism requires, in addition to mRNA, fMet-tRNA, and ribosomal subunits, the presence of three additional proteins (initiation factors IF1, IF2, and IF3) and at least one GTP molecule. The three initiation factors influence both the kinetics and the stability of ternary complex formation. IF1 is the smallest of the three factors. IF1 enhances the rate of 70S ribosome subunit association and dissociation and the interaction of 30S ribosomal subunit with IF2 and IF3. It stimulates 30S complex formation. In addition, by binding to the A-site of the 30S ribosomal subunit, IF1 may contribute to the fidelity of the selection of the initiation site of th Probab=26.75 E-value=37 Score=14.91 Aligned_cols=45 Identities=22% Similarity=0.395 Sum_probs=29.0 Q ss_pred CCCEEEECCCCCCCCCCCCCCEEEECCCCCCCCCCEEEEECCHHH Q ss_conf 883100040123222433310021032233465515787267012 Q gi|255764516|r 41 DGNMEELSPERDFKVDVDESSLILSSKRWINNNNALRIFEGEKQT 85 (341) Q Consensus 41 DG~~eELs~~~DFkvDVD~~~~iLsskrwiNN~~~lRIFEGEKQT 85 (341) +|-+.|.-|..-|+|..|.-..++..--=--..+-|||..|.+-. T Consensus 4 ~G~V~e~lpn~~F~V~Leng~~v~a~~sGKmr~~~Iril~GD~V~ 48 (64) T cd04451 4 EGVVTEALPNAMFRVELENGHEVLAHISGKMRMNYIRILPGDRVK 48 (64) T ss_pred EEEEEEECCCCEEEEEECCCCEEEEEECCHHEEEEEEECCCCEEE T ss_conf 899999879988999978998999997233041269845899899 No 10 >PRK12442 translation initiation factor IF-1; Reviewed Probab=26.50 E-value=40 Score=14.72 Aligned_cols=45 Identities=20% Similarity=0.416 Sum_probs=25.1 Q ss_pred CCCEEEECCCCCCCCCCCCCCEEEECCCCCCCCCCEEEEECCHHH Q ss_conf 883100040123222433310021032233465515787267012 Q gi|255764516|r 41 DGNMEELSPERDFKVDVDESSLILSSKRWINNNNALRIFEGEKQT 85 (341) Q Consensus 41 DG~~eELs~~~DFkvDVD~~~~iLsskrwiNN~~~lRIFEGEKQT 85 (341) +|-+.|+-|..-|+|..|.-..++..--=---.+-|||..|.+-+ T Consensus 10 eG~V~e~lpn~~FrV~LeNg~~Vla~~sGKmR~~~IrIl~GD~V~ 54 (87) T PRK12442 10 DGIVDEVLPDSRYRVTLENGVEVGAYASGRMRKHRIRILAGDRVT 54 (87) T ss_pred EEEEEEECCCCEEEEEECCCCEEEEEEECCEEEEEEEECCCCEEE T ss_conf 789999879987999977998999998550133027776899899 No 11 >pfam11311 DUF3114 Protein of unknown function (DUF3114). Some members in this family of proteins with unknown function are annotated as cytosolic proteins. This cannot be confirmed. Probab=25.65 E-value=33 Score=15.24 Aligned_cols=37 Identities=24% Similarity=0.342 Sum_probs=17.8 Q ss_pred HHHHHHHHCCHHHHH-HHHHHCCHHHHCCCCHHHHHHH Q ss_conf 323331100115653-0555313243128876788999 Q gi|255764516|r 111 DGIVNDLAIQTEDVG-RKLEQIDLSKVEGLDPQTRKYL 147 (341) Q Consensus 111 dgivndlaiqtedvg-rkleqidlskvegldpqtrkyl 147 (341) .||||.-...-..-+ +.-.+.|..-+.-.||+-|+-. T Consensus 192 nGivNGaSFNYan~n~~rH~~LDv~Pi~~~DP~FR~~~ 229 (252) T pfam11311 192 NGIVNGASFNYANRNNPRHWLLDVDPPKLFDPQFRRRA 229 (252) T ss_pred CCCCCCCCCCCCCCCCCHHHHHCCCCCCCCCHHHHHHH T ss_conf 56703654444556874187746687320387899999 No 12 >PHA00202 DNA replication initiation protein Probab=25.29 E-value=46 Score=14.30 Aligned_cols=83 Identities=20% Similarity=0.262 Sum_probs=65.3 Q ss_pred EHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCHHHHCCCCHHHHHHHHHHHHHCCCCEEEEEECCCEECCCCCC Q ss_conf 10002333564432253233311001156530555313243128876788999998742156447999624220244465 Q gi|255764516|r 95 KKVNQVNVLTQKMNTIDGIVNDLAIQTEDVGRKLEQIDLSKVEGLDPQTRKYLQDIQTQLTSDTLTLQLDDTRVDDTRGY 174 (341) Q Consensus 95 kkvnqvnvltqkmntidgivndlaiqtedvgrkleqidlskvegldpqtrkylqdiqtqltsdtltlqlddtrvddtrgy 174 (341) ..++|-+...+|.-+.+-+-+.++-..|+.-+.+|+.+ -+-+.||+|--.-.++..|-|.|...|--.-.|| T Consensus 50 ~~~~~~~~r~~kfp~~p~~~~~~~~~~eeie~~~e~~~--------~~~~~~l~~~lkvF~~~VLGl~ls~~R~kG~~~Y 121 (379) T PHA00202 50 SGLLQPHCRAKKFPAPPVFNPTVAQTSDEIDAYNKAFD--------ICYRNYLEDCLRIFTNQVLGLSLSPMRGRGLHGY 121 (379) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHH--------HHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCC T ss_conf 55567444555677677545423489999999999999--------9999999999999999973655468456775354 Q ss_pred CCEEEEECCCC Q ss_conf 43045523457 Q gi|255764516|r 175 DSSIRFKDKDG 185 (341) Q Consensus 175 dssirfkdkdg 185 (341) ..|...-+.+| T Consensus 122 ~~S~~L~s~~G 132 (379) T PHA00202 122 NDSMVITSMTG 132 (379) T ss_pred CCCEEEECCCC T ss_conf 35337761788 No 13 >pfam09158 MotCF Bacteriophage T4 MotA, C-terminal. Members of this family adopt a compact alpha/beta structure comprising three alpha-helices and six beta-strands in the order: alpha1-beta1-beta2-beta3-beta4-alpha2-beta5-beta6-alpha3. The beta-strands form a single anti-parallel beta-sheet and the three alpha-helices pack side-by-side onto one surface of the beta-sheet. In this architecture, the domain's hydrophobic core is at the sheet-helix interface, and the second surface of the beta-sheet is completely exposed. The domain is a DNA-binding motif, with a consensus sequence containing nine base pairs (5'-TTTGCTTTA-3'), that appears to bind to various mot boxes, allowing access to the minor groove towards the 5'-end of this sequence and the major groove towards the 3'-end. Probab=25.01 E-value=24 Score=16.16 Aligned_cols=33 Identities=36% Similarity=0.526 Sum_probs=24.0 Q ss_pred CCCCCCEEEE-----ECCHHHHHHCCHHHHEHHHHHHH Q ss_conf 3465515787-----26701233224121110002333 Q gi|255764516|r 70 INNNNALRIF-----EGEKQTFKDFNIEVQKKVNQVNV 102 (341) Q Consensus 70 iNN~~~lRIF-----EGEKQTFKd~Nievqkkvnqvnv 102 (341) |+|+..+||| |-+.|.|++.+.++.-|.+.-|. T Consensus 48 i~n~G~~RIfgYkm~E~~~~~Ft~lg~~~kIk~ggknt 85 (103) T pfam09158 48 INNNGNMRIFGYKMMEHHIQKFTDIGMSCKIAKGGKNT 85 (103) T ss_pred EECCCCEEEEEEECCHHHHHHHHHCCCEEEEEECCCCE T ss_conf 81588189987522398999886428407983579617 No 14 >PRK00276 infA translation initiation factor IF-1; Validated Probab=24.57 E-value=43 Score=14.51 Aligned_cols=45 Identities=27% Similarity=0.437 Sum_probs=28.2 Q ss_pred CCCEEEECCCCCCCCCCCCCCEEEECCCCCCCCCCEEEEECCHHH Q ss_conf 883100040123222433310021032233465515787267012 Q gi|255764516|r 41 DGNMEELSPERDFKVDVDESSLILSSKRWINNNNALRIFEGEKQT 85 (341) Q Consensus 41 DG~~eELs~~~DFkvDVD~~~~iLsskrwiNN~~~lRIFEGEKQT 85 (341) +|.+.|.-|..-|+|..|.-..++..-.=--..+-|||..|.+-+ T Consensus 10 ~G~V~e~lpn~~F~V~Leng~~v~a~~sGKmR~~~Iril~GD~V~ 54 (72) T PRK00276 10 EGTVLETLPNAMFRVELENGHEVLAHISGKMRKNYIRILPGDKVT 54 (72) T ss_pred EEEEEEECCCCEEEEEECCCCEEEEEECHHEEEEEEEECCCCEEE T ss_conf 999999859988999978999999997413110169975899899 No 15 >TIGR03354 VI_FHA type VI secretion system FHA domain protein. Members of this protein family are FHA (forkhead-associated) domain-containing proteins that are part of type VI secretion loci in a considerable number of bacteria, most of which are known pathogens. Species include Pseudomonas aeruginosa PAO1, Aeromonas hydrophila, Yersinia pestis, Burkholderia mallei, etc. Probab=24.34 E-value=37 Score=14.89 Aligned_cols=70 Identities=27% Similarity=0.463 Sum_probs=43.4 Q ss_pred EEEEEEEEEECCCH----HHEEEECCCCCCCEEEE--CCCCCCCCCCCCCCEEEECCCCCCCCCCEEEEECCHHHHHHCC Q ss_conf 60377876631880----13123424788831000--4012322243331002103223346551578726701233224 Q gi|255764516|r 17 FGSYFVIPWTVKDP----SRIHAEVKYPDGNMEEL--SPERDFKVDVDESSLILSSKRWINNNNALRIFEGEKQTFKDFN 90 (341) Q Consensus 17 ~GS~FVIPW~vKDP----SRIHAEV~Y~DG~~eEL--s~~~DFkvDVD~~~~iLsskrwiNN~~~lRIFEGEKQTFKd~N 90 (341) +|+-.---|.+.|| ||.||+|-+.||...-- |-..-| |.++ ..=+...+-.|+-.|..-+.-+|- T Consensus 28 IGRs~~~dW~LpD~~r~IS~~Hc~I~~~dg~y~l~D~StNgt~---vN~~------~~pLg~g~~~~L~~GD~i~IG~Y~ 98 (396) T TIGR03354 28 IGRSEDCDWVLPDPERHVSGRHARIRYRDGAYLLTDLSTNGVF---LNGS------GSPLGRGNPVRLEQGDRLRLGDYE 98 (396) T ss_pred CCCCCCCCEEEECCCCCCCCCCEEEEEECCEEEEEECCCCCEE---ECCC------CCCCCCCCCEECCCCCEEEECCEE T ss_conf 4268889757508998625221699996997999988878657---8788------776789996306999999886789 Q ss_pred HHHHE Q ss_conf 12111 Q gi|255764516|r 91 IEVQK 95 (341) Q Consensus 91 ievqk 95 (341) |.|.- T Consensus 99 i~V~l 103 (396) T TIGR03354 99 IRVSL 103 (396) T ss_pred EEEEE T ss_conf 99995 No 16 >pfam06517 Orthopox_A43R Orthopoxvirus A43R protein. This family consists of several Orthopoxvirus A43R proteins. The function of this family is unknown. Probab=23.72 E-value=54 Score=13.90 Aligned_cols=33 Identities=42% Similarity=0.463 Sum_probs=22.6 Q ss_pred HHHCCCCEEEEEECCCEECCCCCCCCEEEEECC Q ss_conf 742156447999624220244465430455234 Q gi|255764516|r 151 QTQLTSDTLTLQLDDTRVDDTRGYDSSIRFKDK 183 (341) Q Consensus 151 qtqltsdtltlqlddtrvddtrgydssirfkdk 183 (341) .|-|||...|..|...|.----||||-|-+||. T Consensus 130 stiltskh~tysl~rsrc~iiigydsiiwykdn 162 (204) T pfam06517 130 STILTSKHKTYSLHRSRCIIIIGYDSIIWYKDN 162 (204) T ss_pred CEEEECCCEEEEEECCCEEEEEECCEEEEEECC T ss_conf 000202642677421316999804507996055 No 17 >TIGR00243 Dxr 1-deoxy-D-xylulose 5-phosphate reductoisomerase; InterPro: IPR003821 1-deoxy-D-xylulose 5-phosphate reductoisomerase synthesizes 2-C-methyl-D-erythritol 4-phosphate from 1-deoxy-D-xylulose 5-phosphate in a single step by intramolecular rearrangement and reduction and is responsible for terpenoid biosynthesis in some organisms . In Arabidopsis thaliana 1-deoxy-D-xylulose 5-phosphate reductoisomerase is the first committed enzyme of the non-mevalonate pathway for isoprenoid biosynthesis.; GO: 0003824 catalytic activity, 0008299 isoprenoid biosynthetic process. Probab=22.47 E-value=15 Score=17.35 Aligned_cols=44 Identities=25% Similarity=0.407 Sum_probs=32.2 Q ss_pred EEEEEEEEEE-EEEECCCHHHEEEECCCCCCCEEEECCCCCCCCC Q ss_conf 7998603778-7663188013123424788831000401232224 Q gi|255764516|r 13 SYRLFGSYFV-IPWTVKDPSRIHAEVKYPDGNMEELSPERDFKVD 56 (341) Q Consensus 13 sY~~~GS~FV-IPW~vKDPSRIHAEV~Y~DG~~eELs~~~DFkvD 56 (341) +|||||--+- |---|.--|=||+=|.|.|+-|=----+-|.++- T Consensus 237 A~WLF~~s~~~idv~ih~~SIiHSlVe~~D~SViAqLG~PDMr~P 281 (406) T TIGR00243 237 AHWLFGASYEQIDVVIHPQSIIHSLVEFQDGSVIAQLGSPDMRLP 281 (406) T ss_pred HHHHHCCCHHHCEEEECCCCEEEEEEEEECCCCEECCCCCCCCCH T ss_conf 988607886474164667662676678857600011288622003 No 18 >CHL00010 infA translation initiation factor 1 Probab=21.09 E-value=58 Score=13.67 Aligned_cols=45 Identities=27% Similarity=0.448 Sum_probs=26.1 Q ss_pred CCCEEEECCCCCCCCCCCCCCEEEECCCCCCCCCCEEEEECCHHH Q ss_conf 883100040123222433310021032233465515787267012 Q gi|255764516|r 41 DGNMEELSPERDFKVDVDESSLILSSKRWINNNNALRIFEGEKQT 85 (341) Q Consensus 41 DG~~eELs~~~DFkvDVD~~~~iLsskrwiNN~~~lRIFEGEKQT 85 (341) +|-+.|+-|..-|+|..|.-..++..------.+-|||..|.+-+ T Consensus 10 eG~V~e~Lpn~~F~V~Leng~~Vla~~sGKmR~~~Iril~GD~V~ 54 (78) T CHL00010 10 EGLVTESLPNGMFRVRLDNGDLVLGYISGKIRRNSIRILPGDRVK 54 (78) T ss_pred EEEEEEECCCCEEEEEECCCCEEEEEECCHHEECEEEECCCCEEE T ss_conf 899999869988999978999999999805200318974899899 No 19 >TIGR00538 hemN oxygen-independent coproporphyrinogen III oxidase; InterPro: IPR004558 This family represents HemN, the oxygen-independent coproporphyrinogen III oxidase that replaces HemF function under anaerobic conditions. HemN catalyses the anaerobic transformation of coproporhyrinogen-III into protoporphyrinogen-IX during porphyrin biosynthesis. Several species, including Escherichia coli, Helicobacter pylori, and Aquifex aeolicus, have both a member of this family and a member of another, closely related family for which there is no evidence of coproporphyrinogen III oxidase activity, IPR004559 from INTERPRO. Members of this family have a perfectly conserved motif PYRT[SC]YP in a region N-terminal to the region of homology with the related uncharacterised protein.; GO: 0004109 coproporphyrinogen oxidase activity, 0006779 porphyrin biosynthetic process, 0005737 cytoplasm. Probab=20.48 E-value=45 Score=14.39 Aligned_cols=48 Identities=33% Similarity=0.521 Sum_probs=29.6 Q ss_pred HHHCCHHHHEHHHHH---HHHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCHHHHCCCCHHHHHH Q ss_conf 332241211100023---3356443225323331100115653055531324312887678899 Q gi|255764516|r 86 FKDFNIEVQKKVNQV---NVLTQKMNTIDGIVNDLAIQTEDVGRKLEQIDLSKVEGLDPQTRKY 146 (341) Q Consensus 86 FKd~Nievqkkvnqv---nvltqkmntidgivndlaiqtedvgrkleqidlskvegldpqtrky 146 (341) .-|||-+||+-||.+ ..+-++|+.+-- .|= +-|.+--+=||--||..- T Consensus 172 vQDfd~~VQ~avnR~QP~e~i~~~~~~~R~-----------~Gf--~SiN~DLIYGLP~Qt~es 222 (462) T TIGR00538 172 VQDFDKEVQQAVNRIQPEEMIFELMEKARE-----------AGF--ESINLDLIYGLPKQTKES 222 (462) T ss_pred EECCCHHHHHHHCCCCCHHHHHHHHHHHHH-----------CCC--CEEEEEEECCCCCCCHHH T ss_conf 210785554443134868999999999986-----------698--278742013888878678 No 20 >PRK10162 acetyl esterase; Provisional Probab=20.37 E-value=63 Score=13.47 Aligned_cols=97 Identities=19% Similarity=0.285 Sum_probs=52.8 Q ss_pred HHCCCCHHHHHHHHHHHHHCCCCEEEEEECCCEECCCCCCCCEEEEECCCCCCCCEEEEEEECCEEEEEEEECCCCCCCC Q ss_conf 31288767889999987421564479996242202444654304552345754540345541220001465226667500 Q gi|255764516|r 135 KVEGLDPQTRKYLQDIQTQLTSDTLTLQLDDTRVDDTRGYDSSIRFKDKDGALGGSITRVVKGDITGLSIATKNKSGSLE 214 (341) Q Consensus 135 kvegldpqtrkylqdiqtqltsdtltlqlddtrvddtrgydssirfkdkdgalggsitrvvkgditglsiatknksgsle 214 (341) -.+-|+||-+..++..+.....+..++-.+.- -.+|+..-+|-.....-...-...+-|. .|.+. T Consensus 9 v~~~l~pe~~a~~~~~~~~~~p~~~~~~~~~~----R~~y~~~~~~~~~~~p~~~~~~~~i~~~-----------~g~i~ 73 (319) T PRK10162 9 VLDLISAEMKTVVNTLQPDLPPWPATGTIAEQ----RQYYTLERRFWNAGAPEMATRAYMVPTK-----------YGQVE 73 (319) T ss_pred CCCCCCHHHHHHHHHHHHCCCCCCCCCCHHHH----HHHHHHHHHHCCCCCCCCCEEEEEEECC-----------CCCEE T ss_conf 61228999999999875316998455999999----9999998764189999741256566579-----------99813 Q ss_pred CEEEEECC------CEEEEEEEEEEECCCCHHHHHHHHHHC Q ss_conf 01344315------405880157871264037889987506 Q gi|255764516|r 215 NRIKFYDD------KDVYINGQCFVKGTDTSIFDEIARQLT 249 (341) Q Consensus 215 nrikfydd------kdvyingqcfvkgtdtsifdeiarqlt 249 (341) -|| |.- -=+|+.|-+||-|.-.+ .|.++|+|. T Consensus 74 ~Ri--Y~p~~~~~~vily~HGGGfv~Gs~~t-hd~~~r~lA 111 (319) T PRK10162 74 TRL--FCPQPDSPATLFYLHGGGFILGNLDT-HDRIMRLLA 111 (319) T ss_pred EEE--ECCCCCCCEEEEEECCCEEECCCHHH-HHHHHHHHH T ss_conf 899--64489998189998698100688789-899999999 No 21 >TIGR03673 rpl14p_arch 50S ribosomal protein L14P. Part of the 50S ribosomal subunit. Forms a cluster with proteins L3 and L24e, part of which may contact the 16S rRNA in 2 intersubunit bridges. Probab=20.07 E-value=64 Score=13.43 Aligned_cols=65 Identities=23% Similarity=0.389 Sum_probs=42.6 Q ss_pred CCCEEEEEE-------ECCEE-EEEEEEC----CCCCCCCCEEEEECCCEEEEEEEEEEECCCCHHHHHHHHHHCHHHHH Q ss_conf 454034554-------12200-0146522----66675000134431540588015787126403788998750677776 Q gi|255764516|r 187 LGGSITRVV-------KGDIT-GLSIATK----NKSGSLENRIKFYDDKDVYINGQCFVKGTDTSIFDEIARQLTPRFLG 254 (341) Q Consensus 187 lggsitrvv-------kgdit-glsiatk----nksgslenrikfyddkdvyingqcfvkgtdtsifdeiarqltprflg 254 (341) +|--|.-+| ||++. ++-+-|| .+.|+ .|+|-|.--|.+|-+.-.+|| -||-.++|.|.-+|.- T Consensus 48 vGD~IvvsVkk~~p~~k~~v~~AVIVRtkk~~rR~DGs---~i~FddNa~VLin~~g~P~GT--RI~GPV~rEl~~k~~K 122 (131) T TIGR03673 48 VGDMVVVSVKKGTPEMRKQVFKAVVVRQRKEYRRPDGT---RVKFEDNAVVIVTPDGEPKGT--EIKGPVAREAAERWPK 122 (131) T ss_pred CCEEEEEEEECCCCCCCCCEEEEEEEEECCCEECCCCC---EEEECCCEEEEECCCCCEEEE--EEECCCHHHHHHHHHH T ss_conf 17399999940788755887887999975766908999---998189769999899999061--7826404998851258 Q ss_pred HH Q ss_conf 52 Q gi|255764516|r 255 LL 256 (341) Q Consensus 255 ll 256 (341) ++ T Consensus 123 I~ 124 (131) T TIGR03673 123 IA 124 (131) T ss_pred HH T ss_conf 87 Done!