Query gi|254780145|ref|YP_003064558.1| 50S ribosomal protein L10 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 172 No_of_seqs 125 out of 1666 Neff 6.9 Searched_HMMs 39220 Date Sun May 22 16:43:09 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780145.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK00099 rplJ 50S ribosomal pr 100.0 1.8E-43 0 308.1 17.9 161 1-171 3-164 (164) 2 cd05797 Ribosomal_L10 Ribosoma 100.0 8.7E-41 2.8E-45 290.3 16.8 156 2-157 1-157 (157) 3 COG0244 RplJ Ribosomal protein 100.0 2.5E-36 6.4E-41 260.8 16.0 170 1-170 3-174 (175) 4 cd00379 Ribosomal_L10_P0 Ribos 100.0 1.4E-35 3.6E-40 255.9 15.3 153 4-156 1-154 (155) 5 pfam00466 Ribosomal_L10 Riboso 99.9 4.3E-21 1.1E-25 160.1 11.8 98 1-98 1-100 (100) 6 KOG4241 consensus 99.8 6.8E-19 1.7E-23 145.5 11.8 161 9-170 78-238 (245) 7 PRK04019 rplP0 acidic ribosoma 99.5 1.3E-12 3.2E-17 104.1 11.1 83 3-86 5-90 (328) 8 cd05795 Ribosomal_P0_L10e Ribo 99.4 5.9E-12 1.5E-16 99.7 10.0 90 4-94 1-96 (175) 9 PTZ00135 ribosomal phosphoprot 99.3 1.3E-11 3.3E-16 97.4 10.0 90 3-93 7-102 (312) 10 PTZ00240 60S ribosomal protein 99.2 2.3E-10 5.8E-15 89.2 10.6 82 3-85 5-100 (323) 11 cd05796 Ribosomal_P0_like Ribo 99.2 1.5E-10 3.8E-15 90.3 9.4 83 4-88 1-90 (163) 12 KOG0816 consensus 98.0 7.8E-05 2E-09 52.5 9.5 113 2-116 19-148 (223) 13 KOG0815 consensus 97.9 5.7E-05 1.5E-09 53.4 7.8 84 3-87 7-95 (245) 14 pfam11909 NdhN NADH-quinone ox 64.9 2.9 7.3E-05 22.4 1.0 25 106-130 107-131 (154) 15 cd01744 GATase1_CPSase Small c 64.5 11 0.00029 18.4 4.0 78 23-115 1-82 (178) 16 PRK00758 GMP synthase subunit 56.3 12 0.0003 18.3 2.9 68 37-116 13-81 (184) 17 PRK05670 anthranilate synthase 52.7 18 0.00046 17.1 3.4 54 22-88 1-55 (192) 18 PRK06774 para-aminobenzoate sy 50.4 20 0.00052 16.7 3.4 80 23-115 2-85 (191) 19 TIGR01588 citE citrate lyase, 46.6 14 0.00035 17.9 2.0 70 45-132 166-240 (288) 20 pfam10678 DUF2492 Protein of u 46.4 22 0.00055 16.6 3.0 48 6-53 22-72 (78) 21 TIGR03675 arCOG00543 arCOG0178 39.6 23 0.00059 16.4 2.3 23 7-29 389-411 (630) 22 CHL00101 trpG anthranilate syn 39.3 30 0.00076 15.6 3.6 79 22-115 1-85 (190) 23 PRK07765 para-aminobenzoate sy 39.1 29 0.00074 15.7 2.7 45 37-88 14-58 (221) 24 PRK08857 para-aminobenzoate sy 38.3 27 0.0007 15.9 2.5 53 23-88 2-55 (192) 25 PRK12838 carbamoyl phosphate s 36.7 33 0.00084 15.4 3.3 42 76-117 208-252 (356) 26 PRK08007 para-aminobenzoate sy 34.9 33 0.00085 15.3 2.5 79 23-114 2-84 (187) 27 PRK07649 para-aminobenzoate/an 34.6 35 0.0009 15.2 2.7 80 23-115 2-85 (195) 28 TIGR02902 spore_lonB ATP-depen 34.4 22 0.00057 16.5 1.5 61 51-111 58-126 (532) 29 pfam11576 DUF3236 Protein of u 33.6 37 0.00094 15.0 5.1 54 3-56 18-71 (154) 30 PRK05637 anthranilate synthase 32.7 38 0.00097 15.0 3.6 55 21-89 3-57 (208) 31 cd01492 Aos1_SUMO Ubiquitin ac 31.0 40 0.001 14.8 3.4 62 71-154 105-168 (197) 32 pfam10126 Nit_Regul_Hom Unchar 30.0 30 0.00076 15.6 1.6 25 18-42 26-50 (110) 33 pfam08800 VirE_N VirE N-termin 28.0 45 0.0012 14.4 4.0 32 15-46 25-56 (136) 34 PRK05414 urocanate hydratase; 27.3 47 0.0012 14.4 4.5 60 28-87 276-371 (558) 35 COG1782 Predicted metal-depend 26.5 48 0.0012 14.3 2.4 25 7-31 8-32 (637) 36 PRK09522 bifunctional anthrani 25.5 50 0.0013 14.1 3.1 11 45-55 23-33 (531) 37 KOG3218 consensus 22.6 57 0.0014 13.8 3.6 55 75-129 67-122 (208) 38 pfam01175 Urocanase Urocanase. 22.6 57 0.0015 13.8 4.5 60 28-87 265-360 (545) 39 PRK11479 hypothetical protein; 21.2 61 0.0015 13.6 3.8 33 12-44 114-146 (273) 40 TIGR02739 TraF type-F conjugat 21.1 41 0.001 14.7 0.9 49 67-116 150-204 (270) No 1 >PRK00099 rplJ 50S ribosomal protein L10; Reviewed Probab=100.00 E-value=1.8e-43 Score=308.07 Aligned_cols=161 Identities=38% Similarity=0.655 Sum_probs=155.6 Q ss_pred CCHHHHHHHHHHHHHHHHHCCEEEEEEECCCCHHHHHHHHHHHHHCCCEEEEECHHHHHHHHHHCCCCCCCCCCCCCEEE Q ss_conf 97489999999999999508979999827998899999999996569589986157999997313742120015784378 Q gi|254780145|r 1 MNRQGKSVEISELSKIFSSSGSIVVAHYKGISVAQIKDLRKKMREAGGGVKVAKNRLVKIAIRDTSIRGISDLFVGQSLI 80 (172) Q Consensus 1 m~r~~K~~~V~~l~~~l~~s~~~~v~~y~gltv~~~~~lR~~lr~~~~~~~VvKNtL~k~A~~~~~~~~l~~~l~G~~a~ 80 (172) |+|++|+++|+++.+.|+++++++++||+||||+++++||++||+.|++++|+||||+++|+++++|+++.++|+|||++ T Consensus 3 m~r~~K~~~V~~l~~~l~~s~~~~~~~y~Gltv~~~~~LR~~l~~~g~~~~VvKNtL~k~Al~~t~~~~l~~~l~G~tav 82 (164) T PRK00099 3 MNREEKKEIVAELAEKFKEAQSAVVADYRGLTVAQMTELRKELREAGVELKVVKNTLARRALEGTGFEGLDDLLKGPTAI 82 (164) T ss_pred CCHHHHHHHHHHHHHHHHHCCEEEEEECCCCCHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHCCCCCCCCHHCCCCEEE T ss_conf 88899999999999999869969999768996699999999999769679996038999999546875541014787599 Q ss_pred EEE-CCCCHHHHHHHHHHHHCCHHHHHHHHCCCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 762-2342157887888641000122321024110798999998469998999999999999899999999977687999 Q gi|254780145|r 81 VYS-DSPVIAPKISVSFSNDNNEFRVLGGVVEKGVLNQDSIKQIASLPDLEGIRAGIISAIQSNATRLVRLLGTPQTQVV 159 (172) Q Consensus 81 ~f~-~d~~~~aK~~~~f~k~~~~l~i~gg~~eg~~l~~~~i~~lakLPs~~el~a~lv~~l~~p~~~l~~~L~~~~~~l~ 159 (172) +|+ +||+.++|++.+|.|++++++++||++||+++|.+++++||+|||++|++++++|+|++| ++++|+ T Consensus 83 ~f~~eD~~~~aK~~~~f~K~~~~~~ikgg~~eg~~l~~~~i~~lakLPskeel~a~l~g~l~aP----------~~~~l~ 152 (164) T PRK00099 83 AFSNEDPVAAAKVLKDFAKDNKKLEIKGGAFEGKVLSAEEVKALASLPSREELLAKLLGVLQAP----------PASKLA 152 (164) T ss_pred EEECCCCCHHHHHHHHHHHHCCCCEEEEEEECCEECCHHHHHHHHCCCCHHHHHHHHHHHHHCC----------CHHHHH T ss_conf 9826886278999999986458827899887784538999999974999999999999998371----------178999 Q ss_pred HHHHHHHHHHHC Q ss_conf 999999997626 Q gi|254780145|r 160 RAISAFVDKNQQ 171 (172) Q Consensus 160 ~~L~~~~~k~ee 171 (172) ++|+++.+|+|+ T Consensus 153 ~~L~ai~~kk~~ 164 (164) T PRK00099 153 RALKALAEKKEA 164 (164) T ss_pred HHHHHHHHHHCC T ss_conf 999999986339 No 2 >cd05797 Ribosomal_L10 Ribosomal protein L10 family, L10 subfamily; composed of bacterial 50S ribosomal protein and eukaryotic mitochondrial 39S ribosomal protein, L10. L10 occupies the L7/L12 stalk of the ribosome. The N-terminal domain (NTD) of L10 interacts with L11 protein and forms the base of the L7/L12 stalk, while the extended C-terminal helix binds to two or three dimers of the NTD of L7/L12 (L7 and L12 are identical except for an acetylated N-terminus). The L7/L12 stalk is known to contain the binding site for elongation factors G and Tu (EF-G and EF-Tu, respectively); however, there is disagreement as to whether or not L10 is involved in forming the binding site. The stalk is believed to be associated with GTPase activities in protein synthesis. In a neuroblastoma cell line, L10 has been shown to interact with the SH3 domain of Src and to activate the binding of the Nck1 adaptor protein with skeletal proteins such as the Wiskott-Aldrich Syndrome Protein (WASP) and the WASP-in Probab=100.00 E-value=8.7e-41 Score=290.31 Aligned_cols=156 Identities=39% Similarity=0.659 Sum_probs=151.7 Q ss_pred CHHHHHHHHHHHHHHHHHCCEEEEEEECCCCHHHHHHHHHHHHHCCCEEEEECHHHHHHHHHHCCCCCCCCCCCCCEEEE Q ss_conf 74899999999999995089799998279988999999999965695899861579999973137421200157843787 Q gi|254780145|r 2 NRQGKSVEISELSKIFSSSGSIVVAHYKGISVAQIKDLRKKMREAGGGVKVAKNRLVKIAIRDTSIRGISDLFVGQSLIV 81 (172) Q Consensus 2 ~r~~K~~~V~~l~~~l~~s~~~~v~~y~gltv~~~~~lR~~lr~~~~~~~VvKNtL~k~A~~~~~~~~l~~~l~G~~a~~ 81 (172) +|++|+++|+++++.|++++++++++|+||+|+++++||++||++|++++|+||||+|+|+++++|+.+.++|+|||+++ T Consensus 1 ~r~~K~~~v~~l~~~~~~s~~i~~~~y~gl~v~~~~~lR~~l~~~~~~~kVvKNtL~~~Al~~t~~~~l~~~l~G~~a~~ 80 (157) T cd05797 1 NREKKEEIVAELKEKLKEAKSVVVADYRGLTVAQLTELRKELREAGVKLKVVKNTLAKRALEGTGFEDLDDLLKGPTAIA 80 (157) T ss_pred CHHHHHHHHHHHHHHHHHCCEEEEEECCCCCHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHCCCCCHHHHCCCCEEEE T ss_conf 97889999999999998699799997689968999999999997698699961499999986158531255366776999 Q ss_pred EE-CCCCHHHHHHHHHHHHCCHHHHHHHHCCCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 62-23421578878886410001223210241107989999984699989999999999998999999999776879 Q gi|254780145|r 82 YS-DSPVIAPKISVSFSNDNNEFRVLGGVVEKGVLNQDSIKQIASLPDLEGIRAGIISAIQSNATRLVRLLGTPQTQ 157 (172) Q Consensus 82 f~-~d~~~~aK~~~~f~k~~~~l~i~gg~~eg~~l~~~~i~~lakLPs~~el~a~lv~~l~~p~~~l~~~L~~~~~~ 157 (172) |+ +||+.++|++.+|.|++++++++||++||+++|.+++++||+|||+++++++++|+|++|+++++++|++++++ T Consensus 81 f~~ed~~~~aK~i~~f~K~~~~~~~~gg~~eg~~l~~~~i~~lakLPs~eel~a~l~~~l~~p~~~l~~~L~~~~~~ 157 (157) T cd05797 81 FSEEDPVAAAKVLKDFAKENKKLEIKGGVVEGKVLDAEEVKALAKLPSREELLAQLLGLLQAPATKLVRVLNAPASK 157 (157) T ss_pred EECCCCCHHHHHHHHHHHHCCCCCEEEEEECCEECCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 93688538899999998752552012333257132899999997599999999999999998999999997366689 No 3 >COG0244 RplJ Ribosomal protein L10 [Translation, ribosomal structure and biogenesis] Probab=100.00 E-value=2.5e-36 Score=260.83 Aligned_cols=170 Identities=33% Similarity=0.549 Sum_probs=165.1 Q ss_pred CCHHHHHHHHHHHHHHHHHCCEEEEEEECCCCHHHHHHHHHHHHHCCCEEEEECHHHHHHHHHHCCCCCCCCCCCCCEEE Q ss_conf 97489999999999999508979999827998899999999996569589986157999997313742120015784378 Q gi|254780145|r 1 MNRQGKSVEISELSKIFSSSGSIVVAHYKGISVAQIKDLRKKMREAGGGVKVAKNRLVKIAIRDTSIRGISDLFVGQSLI 80 (172) Q Consensus 1 m~r~~K~~~V~~l~~~l~~s~~~~v~~y~gltv~~~~~lR~~lr~~~~~~~VvKNtL~k~A~~~~~~~~l~~~l~G~~a~ 80 (172) |.|++|+++|+++++.|+++++++++||+|+++.++++||+++|+.|++++|+||||+++|+++++++.+.++|+||+++ T Consensus 3 ~~~e~K~~~v~el~e~~~~s~~~~i~dy~Gl~~~ql~~lR~~lr~~g~~lkV~KNtL~~rAl~~~~~e~l~~~l~Gp~ai 82 (175) T COG0244 3 LAREWKKELVAELKELIKESPSVVIVDYRGLTVAQLTELRKKLREAGAKLKVVKNTLLRRALEEAGLEGLDDLLKGPTAI 82 (175) T ss_pred CCHHHHHHHHHHHHHHHHHCCEEEEEEECCCCHHHHHHHHHHHHHCCEEEEEEHHHHHHHHHHHCCHHHHHHHCCCCEEE T ss_conf 40777899999999999629879999836972899999999998579189997538999999864715678756487699 Q ss_pred EEE-CCCCHHHHHHHHHHHHC-CHHHHHHHHCCCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 762-23421578878886410-0012232102411079899999846999899999999999989999999997768799 Q gi|254780145|r 81 VYS-DSPVIAPKISVSFSNDN-NEFRVLGGVVEKGVLNQDSIKQIASLPDLEGIRAGIISAIQSNATRLVRLLGTPQTQV 158 (172) Q Consensus 81 ~f~-~d~~~~aK~~~~f~k~~-~~l~i~gg~~eg~~l~~~~i~~lakLPs~~el~a~lv~~l~~p~~~l~~~L~~~~~~l 158 (172) +|+ +||..++|++.+|.|.+ +.++++||++||..++..+++++++|||+++++++++|.+++|.+++++.|++++.++ T Consensus 83 ~fs~~dp~~~~K~~~~f~k~~~~~~~~~~~~~eg~~l~~~~v~~~aklp~~~el~~~l~g~~~ap~~~~~~~L~a~~~~~ 162 (175) T COG0244 83 AFSNEDPVAAAKLLKDFAKEAGDKAPIKGGVPEGKVLGAAEVIALAKLPSKEELVVMLLGVLQAPATKLLRALKAVPDKV 162 (175) T ss_pred EEECCCHHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCHHHHHHHCCCCCHHHHHHHHHCCHHHHHHHHHHHHCCCHHHH T ss_conf 99578879999999998654154014889983586168889876505684888999985221524789999970446776 Q ss_pred HHHHHHHHHHHH Q ss_conf 999999999762 Q gi|254780145|r 159 VRAISAFVDKNQ 170 (172) Q Consensus 159 ~~~L~~~~~k~e 170 (172) .+.+.++.++++ T Consensus 163 ~~~~~a~~~~g~ 174 (175) T COG0244 163 GLKLLAAYEKGV 174 (175) T ss_pred HHHHHHHHCCCC T ss_conf 446665303578 No 4 >cd00379 Ribosomal_L10_P0 Ribosomal protein L10 family; composed of the large subunit ribosomal protein called L10 in bacteria, P0 in eukaryotes, and L10e in archaea, as well as uncharacterized P0-like eukaryotic proteins. In all three kingdoms, L10 forms a tight complex with multiple copies of the small acidic protein L12(e). This complex forms a stalk structure on the large subunit of the ribosome. The N-terminal domain (NTD) of L10 interacts with L11 protein and forms the base of the L7/L12 stalk, while the extended C-terminal helix binds to two or three dimers of the NTD of L7/L12 (L7 and L12 are identical except for an acetylated N-terminus). The L7/L12 stalk is known to contain the binding site for elongation factors G and Tu (EF-G and EF-Tu, respectively); however, there is disagreement as to whether or not L10 is involved in forming the binding site. The stalk is believed to be associated with GTPase activities in protein synthesis. In a neuroblastoma cell line, L10 has been sho Probab=100.00 E-value=1.4e-35 Score=255.89 Aligned_cols=153 Identities=31% Similarity=0.550 Sum_probs=148.2 Q ss_pred HHHHHHHHHHHHHHHHCCEEEEEEECCCCHHHHHHHHHHHHHCCCEEEEECHHHHHHHHHHCCCCCCCCCCCCCEEEEEE Q ss_conf 89999999999999508979999827998899999999996569589986157999997313742120015784378762 Q gi|254780145|r 4 QGKSVEISELSKIFSSSGSIVVAHYKGISVAQIKDLRKKMREAGGGVKVAKNRLVKIAIRDTSIRGISDLFVGQSLIVYS 83 (172) Q Consensus 4 ~~K~~~V~~l~~~l~~s~~~~v~~y~gltv~~~~~lR~~lr~~~~~~~VvKNtL~k~A~~~~~~~~l~~~l~G~~a~~f~ 83 (172) ++|+++|+++++.|++++++++++|+|++++++++||++||+.|++++|+||||+++|+++++++++.++|+||++++|+ T Consensus 1 e~K~~~v~~l~~~~~~~~~v~v~~~~gl~~~~~~~lR~~l~~~~~~~~V~KNtL~k~Al~~~~~~~l~~~l~G~~a~~f~ 80 (155) T cd00379 1 EKKEELVEELKELLKKYKSVVVVDYRGLTVAQLTELRKELRESGAKLKVGKNTLMRRALKGTGFEELKPLLKGPTALAFT 80 (155) T ss_pred CCHHHHHHHHHHHHHHCCEEEEEECCCCCHHHHHHHHHHHHHCCCEEEEEECHHHHHHHHHCCCCCCCCCCCCCEEEEEE T ss_conf 94489999999999869989999779997899999999988579179998558899998605865443225477589982 Q ss_pred C-CCCHHHHHHHHHHHHCCHHHHHHHHCCCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 2-342157887888641000122321024110798999998469998999999999999899999999977687 Q gi|254780145|r 84 D-SPVIAPKISVSFSNDNNEFRVLGGVVEKGVLNQDSIKQIASLPDLEGIRAGIISAIQSNATRLVRLLGTPQT 156 (172) Q Consensus 84 ~-d~~~~aK~~~~f~k~~~~l~i~gg~~eg~~l~~~~i~~lakLPs~~el~a~lv~~l~~p~~~l~~~L~~~~~ 156 (172) + ||..++|++.+|.|+++++.++||++||++++++++..+|+|||++++++++++++++|+++++++|++++. T Consensus 81 ~~d~~~~aK~~~~f~k~~~~~~~k~G~~~~~~~~~~~~~~la~lPs~~~l~~~~v~~l~~p~~~l~~~L~~~~~ 154 (155) T cd00379 81 NEDPVEVAKVLKDFAKENKKLFAKGGVVAGKVLDPAGVTALAKLPSREELLAMLIGLLKAPIAKLARLLNALGI 154 (155) T ss_pred CCCCHHHHHHHHHHHHHCCCCEEEEEEECCEEECHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 78825899999999975678215788776845279999999849999999999999998899999999716899 No 5 >pfam00466 Ribosomal_L10 Ribosomal protein L10. Probab=99.86 E-value=4.3e-21 Score=160.08 Aligned_cols=98 Identities=37% Similarity=0.671 Sum_probs=93.3 Q ss_pred CCHHHHHHHHHHHHHHHHHCCEEEEEEECCCCHHHHHHHHHHHHHCCCEEEEECHHHHHHHHHHCCC-CCCCCCCCCCEE Q ss_conf 9748999999999999950897999982799889999999999656958998615799999731374-212001578437 Q gi|254780145|r 1 MNRQGKSVEISELSKIFSSSGSIVVAHYKGISVAQIKDLRKKMREAGGGVKVAKNRLVKIAIRDTSI-RGISDLFVGQSL 79 (172) Q Consensus 1 m~r~~K~~~V~~l~~~l~~s~~~~v~~y~gltv~~~~~lR~~lr~~~~~~~VvKNtL~k~A~~~~~~-~~l~~~l~G~~a 79 (172) |+|++|+++|+++.+.|++++.+++++|+||+++++++||++||+.|++++|+||||+++|++++++ +.+.++|+||+| T Consensus 1 ~~~~~K~~~v~~l~~~~~~~~~~~v~~~~gl~~~~~~~lR~~l~~~~~~~~V~KNtL~k~Al~~~~~~~~l~~~l~G~~a 80 (100) T pfam00466 1 MTREKKEEIVEELKELLKEYKSVVVVDYRGLTVAQLTELRKKLREAGAELKVGKNTLMRRALEGTGEEEGLSELLKGPTA 80 (100) T ss_pred CCHHHHHHHHHHHHHHHHHCCEEEEEECCCCCHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHCCCCCCHHHHCCCCEE T ss_conf 98789999999999999749839999549997899999999999779879997389999999808974215775669879 Q ss_pred EEEE-CCCCHHHHHHHHHHH Q ss_conf 8762-234215788788864 Q gi|254780145|r 80 IVYS-DSPVIAPKISVSFSN 98 (172) Q Consensus 80 ~~f~-~d~~~~aK~~~~f~k 98 (172) ++|+ +||..++|++.+|.| T Consensus 81 iif~~~d~~~~~K~~~~f~k 100 (100) T pfam00466 81 LIFTNEDPVAVAKVLKKFAK 100 (100) T ss_pred EEEECCCHHHHHHHHHHHCC T ss_conf 99968885799999997559 No 6 >KOG4241 consensus Probab=99.80 E-value=6.8e-19 Score=145.52 Aligned_cols=161 Identities=19% Similarity=0.289 Sum_probs=150.8 Q ss_pred HHHHHHHHHHHCCEEEEEEECCCCHHHHHHHHHHHHHCCCEEEEECHHHHHHHHHHCCCCCCCCCCCCCEEEEEECCCCH Q ss_conf 99999999950897999982799889999999999656958998615799999731374212001578437876223421 Q gi|254780145|r 9 EISELSKIFSSSGSIVVAHYKGISVAQIKDLRKKMREAGGGVKVAKNRLVKIAIRDTSIRGISDLFVGQSLIVYSDSPVI 88 (172) Q Consensus 9 ~V~~l~~~l~~s~~~~v~~y~gltv~~~~~lR~~lr~~~~~~~VvKNtL~k~A~~~~~~~~l~~~l~G~~a~~f~~d~~~ 88 (172) ...|+..+++++.++.||++..+++.++.-.|.+||++++.|+.|.|.+++.++++|+|+.+.+.|.|..+++|+.|+. T Consensus 78 ~~re~~~v~~~~R~~Avcq~~~v~a~d~~~~r~QLrk~ni~~ksygnkIlk~~~~~t~y~~l~plfvgnh~ill~~d~~- 156 (245) T KOG4241 78 NLREDWMVREEFRVMAVCQFLPVPARDLWFARNQLRKKNIEFKSYGNKILKKIFDKTPYSSLNPLFVGNHAILLAKDIS- 156 (245) T ss_pred HHHHHHHHHHHHHHEEEEECCCCCHHHHHHHHHHHHHCCCHHHHCHHHHHHHHHHCCCHHHHHHHEECCCEEEECCCHH- T ss_conf 9999999885310202232366767889999999976060010011889999983383344221114461379728768- Q ss_pred HHHHHHHHHHHCCHHHHHHHHCCCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 57887888641000122321024110798999998469998999999999999899999999977687999999999997 Q gi|254780145|r 89 APKISVSFSNDNNEFRVLGGVVEKGVLNQDSIKQIASLPDLEGIRAGIISAIQSNATRLVRLLGTPQTQVVRAISAFVDK 168 (172) Q Consensus 89 ~aK~~~~f~k~~~~l~i~gg~~eg~~l~~~~i~~lakLPs~~el~a~lv~~l~~p~~~l~~~L~~~~~~l~~~L~~~~~k 168 (172) ..|.+.....+.+.+.+.||+++..+++..++.++|+|||++.++|+++..+....+++...+.+.++.+..+|+++.+- T Consensus 157 kik~~lri~r~vP~~~llgG~I~dtvlsRnqlv~yaklpsL~~vqa~lvQ~l~qq~~~l~~ql~~~qr~~~svld~h~k~ 236 (245) T KOG4241 157 KIKSILRITRKVPLVMLLGGRIVDTVLSRNQLVEYAKLPSLEDVQAHLVQLLGQQVAQLTIQLDTVQRHLPSVLDSHIKT 236 (245) T ss_pred HHHHHHHHHHHCCHHEECCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHCCCCC T ss_conf 99999999862640013166438989877878888637980889999999742428999999999986110676502788 Q ss_pred HH Q ss_conf 62 Q gi|254780145|r 169 NQ 170 (172) Q Consensus 169 ~e 170 (172) .. T Consensus 237 ~~ 238 (245) T KOG4241 237 NQ 238 (245) T ss_pred CC T ss_conf 83 No 7 >PRK04019 rplP0 acidic ribosomal protein P0; Validated Probab=99.45 E-value=1.3e-12 Score=104.09 Aligned_cols=83 Identities=23% Similarity=0.486 Sum_probs=77.1 Q ss_pred HHHHHHHHHHHHHHHHHCCEEEEEEECCCCHHHHHHHHHHHHHCCCEEEEECHHHHHHHHHHCC---CCCCCCCCCCCEE Q ss_conf 4899999999999995089799998279988999999999965695899861579999973137---4212001578437 Q gi|254780145|r 3 RQGKSVEISELSKIFSSSGSIVVAHYKGISVAQIKDLRKKMREAGGGVKVAKNRLVKIAIRDTS---IRGISDLFVGQSL 79 (172) Q Consensus 3 r~~K~~~V~~l~~~l~~s~~~~v~~y~gltv~~~~~lR~~lr~~~~~~~VvKNtL~k~A~~~~~---~~~l~~~l~G~~a 79 (172) .++|.+.|+++.+.++.++.+++++..|+++.|++++|+.||.. +.++|.||||+++|+++++ ++.|.+++.|+++ T Consensus 5 ~~wK~~~v~eL~~ll~~y~~v~iv~i~~vps~Qlq~iR~~LRg~-a~i~m~KntLi~~Al~~~~~~~~~~L~~~l~g~v~ 83 (328) T PRK04019 5 PEWKKEEVEELKELIKSYKVVAIVDITGIPARQLQEIRRKLRGK-AELKVSKNTLIERALEEAGGEELEKLEEYLEGQVA 83 (328) T ss_pred CHHHHHHHHHHHHHHHHCCEEEEEEECCCCHHHHHHHHHHHHCC-CEEEEEHHHHHHHHHHHCCCCHHHHHHHHCCCCEE T ss_conf 47899999999999865987999980489679999999995588-47999657999999997476138999986158869 Q ss_pred EEEECCC Q ss_conf 8762234 Q gi|254780145|r 80 IVYSDSP 86 (172) Q Consensus 80 ~~f~~d~ 86 (172) ++|++++ T Consensus 84 lifTn~n 90 (328) T PRK04019 84 LIFTNMN 90 (328) T ss_pred EEEECCC T ss_conf 9994899 No 8 >cd05795 Ribosomal_P0_L10e Ribosomal protein L10 family, P0 and L10e subfamily; composed of eukaryotic 60S ribosomal protein P0 and the archaeal P0 homolog, L10e. P0 or L10e forms a tight complex with multiple copies of the small acidic protein L12(e). This complex forms a stalk structure on the large subunit of the ribosome. The stalk is known to contain the binding site for elongation factors G and Tu (EF-G and EF-Tu, respectively); however, there is disagreement as to whether or not L10 is involved in forming the binding site. The stalk is believed to be associated with GTPase activities in protein synthesis. In a neuroblastoma cell line, L10 has been shown to interact with the SH3 domain of Src and to activate the binding of the Nck1 adaptor protein with skeletal proteins such as the Wiskott-Aldrich Syndrome Protein (WASP) and the WASP-interacting protein (WIP). These eukaryotic and archaeal P0 sequences have an additional C-terminal domain homologous with acidic proteins P1 and P2. Probab=99.37 E-value=5.9e-12 Score=99.66 Aligned_cols=90 Identities=16% Similarity=0.337 Sum_probs=78.3 Q ss_pred HHHHHHHHHHHHHHHHCCEEEEEEECCCCHHHHHHHHHHHHHCCCEEEEECHHHHHHHHHHCC-----CCCCCCCCCCCE Q ss_conf 899999999999995089799998279988999999999965695899861579999973137-----421200157843 Q gi|254780145|r 4 QGKSVEISELSKIFSSSGSIVVAHYKGISVAQIKDLRKKMREAGGGVKVAKNRLVKIAIRDTS-----IRGISDLFVGQS 78 (172) Q Consensus 4 ~~K~~~V~~l~~~l~~s~~~~v~~y~gltv~~~~~lR~~lr~~~~~~~VvKNtL~k~A~~~~~-----~~~l~~~l~G~~ 78 (172) ++|+++|+++++.+++++.++++++.|+.++++.++|+.+|.. +.+.|.||++|++|+++++ ++.+.++++|++ T Consensus 1 ewK~~~v~~l~~~l~~y~~v~iv~~~n~~s~qlq~iR~~lr~~-~~~~~gKn~lm~~Al~~~~e~~~~l~~l~~~l~G~~ 79 (175) T cd05795 1 EWKKEYVEKLTELLKSYPKVLIVDADNVGSKQLQKIRRSLRGK-AEILMGKNTLIRRALRNLGDENPELEKLLPYLKGNV 79 (175) T ss_pred CHHHHHHHHHHHHHHHCCEEEEEEECCCCHHHHHHHHHHHHCC-EEEEEEHHHHHHHHHHHCCCCCHHHHHHHHHHCCCE T ss_conf 9379999999999865998999994688719999999982278-189984458999999862223242999998625886 Q ss_pred EEEEECCCCH-HHHHHH Q ss_conf 7876223421-578878 Q gi|254780145|r 79 LIVYSDSPVI-APKISV 94 (172) Q Consensus 79 a~~f~~d~~~-~aK~~~ 94 (172) +++|++++.. ..+.+. T Consensus 80 glifTn~~~~ev~~~~~ 96 (175) T cd05795 80 GFIFTNGDPFEIRKILE 96 (175) T ss_pred EEEEECCCHHHHHHHHH T ss_conf 99992899899999999 No 9 >PTZ00135 ribosomal phosphoprotein P0; Provisional Probab=99.33 E-value=1.3e-11 Score=97.39 Aligned_cols=90 Identities=14% Similarity=0.285 Sum_probs=78.8 Q ss_pred HHHHHHHHHHHHHHHHHCCEEEEEEECCCCHHHHHHHHHHHHHCCCEEEEECHHHHHHHHHHC-----CCCCCCCCCCCC Q ss_conf 489999999999999508979999827998899999999996569589986157999997313-----742120015784 Q gi|254780145|r 3 RQGKSVEISELSKIFSSSGSIVVAHYKGISVAQIKDLRKKMREAGGGVKVAKNRLVKIAIRDT-----SIRGISDLFVGQ 77 (172) Q Consensus 3 r~~K~~~V~~l~~~l~~s~~~~v~~y~gltv~~~~~lR~~lr~~~~~~~VvKNtL~k~A~~~~-----~~~~l~~~l~G~ 77 (172) .++|.+.++++.+.|++++.+++++..|+.+.|++++|+.||.. +.+.|.||||+|+|+++. .++.|.+++.|+ T Consensus 7 ~~~K~~y~~kl~~Ll~~y~~v~IV~i~nV~S~Qmq~IR~~LRg~-a~ilmgKNTlir~Al~~~~~~~~~le~L~~~v~gn 85 (312) T PTZ00135 7 QEKKKLYFERLTSLIKTYSKILIVSVDNVGSRQMASVRHSLRGK-ATILMGKNTRIRTALKKNFPASPQIEKLLPCVKLN 85 (312) T ss_pred HHHHHHHHHHHHHHHHHCCEEEEEECCCCCHHHHHHHHHHHCCC-CEEEEEHHHHHHHHHHHHHCCCCCHHHHHHHHCCC T ss_conf 89999999999999875997999972788739999999983598-18998157899999997530370189987640476 Q ss_pred EEEEEEC-CCCHHHHHH Q ss_conf 3787622-342157887 Q gi|254780145|r 78 SLIVYSD-SPVIAPKIS 93 (172) Q Consensus 78 ~a~~f~~-d~~~~aK~~ 93 (172) ++++|++ ||...-+.+ T Consensus 86 vg~IFTn~d~fel~~~l 102 (312) T PTZ00135 86 MGFVFCKADPMEVRNII 102 (312) T ss_pred EEEEEECCCHHHHHHHH T ss_conf 57999589989999999 No 10 >PTZ00240 60S ribosomal protein P0; Provisional Probab=99.20 E-value=2.3e-10 Score=89.16 Aligned_cols=82 Identities=12% Similarity=0.275 Sum_probs=71.6 Q ss_pred HHHHHHHHHHHHHHHHHCCEEEEEEECCCCHHHHHHHHHHHHHCCCEEEEECHHHHHHHHHHCC-------CCCC----- Q ss_conf 4899999999999995089799998279988999999999965695899861579999973137-------4212----- Q gi|254780145|r 3 RQGKSVEISELSKIFSSSGSIVVAHYKGISVAQIKDLRKKMREAGGGVKVAKNRLVKIAIRDTS-------IRGI----- 70 (172) Q Consensus 3 r~~K~~~V~~l~~~l~~s~~~~v~~y~gltv~~~~~lR~~lr~~~~~~~VvKNtL~k~A~~~~~-------~~~l----- 70 (172) -++|.+.++++.+.|++++.+++++..|+.+.|++++|+.||.. +.++|.||||+|+|+++.. ++.+ T Consensus 5 a~wK~~~v~kL~~Ll~~Y~~v~IV~~dnV~S~Qlq~iR~~LRg~-a~i~mgKNTlirkal~~~~~~~~~~~~~kl~~~~~ 83 (323) T PTZ00240 5 TTAKREYEERLVDCLTKYSCVLFVGMDNVRSQQVHDVRRALRGK-AEFVMGKKTLQAKIVEKRAQAKKASAEAKLFNDQC 83 (323) T ss_pred CHHHHHHHHHHHHHHHHCCEEEEEECCCCCHHHHHHHHHHHCCC-EEEEEEHHHHHHHHHHHHHHHCCCCHHHHHHHHCC T ss_conf 07899999999998852997999972788759999999982297-59998467899999998553047615666422001 Q ss_pred --CCCCCCCEEEEEECC Q ss_conf --001578437876223 Q gi|254780145|r 71 --SDLFVGQSLIVYSDS 85 (172) Q Consensus 71 --~~~l~G~~a~~f~~d 85 (172) .++++|+.+++|+++ T Consensus 84 ~~~~ll~Gnvg~IFTn~ 100 (323) T PTZ00240 84 EEKNLLSGNTGLIFTNN 100 (323) T ss_pred CCHHHCCCCEEEEEECC T ss_conf 20100148658999689 No 11 >cd05796 Ribosomal_P0_like Ribosomal protein L10 family, P0-like protein subfamily; composed of uncharacterized eukaryotic proteins with similarity to the 60S ribosomal protein P0, including the Saccharomyces cerevisiae protein called mRNA turnover protein 4 (MRT4). MRT4 may be involved in mRNA decay. P0 forms a tight complex with multiple copies of the small acidic protein L12(e). This complex forms a stalk structure on the large subunit of the ribosome. It occupies the L7/L12 stalk of the ribosome. The stalk is known to contain the binding site for elongation factors EF-G and EF-Tu; however, there is disagreement as to whether or not P0 is involved in forming the binding site. The stalk is believed to be associated with GTPase activities in protein synthesis. In a neuroblastoma cell line, P0 has been shown to interact with the SH3 domain of Src and to activate the binding of the Nck1 adaptor protein with skeletal proteins such as the Wiskott-Aldrich Syndrome Protein (WASP) and the WAS Probab=99.19 E-value=1.5e-10 Score=90.35 Aligned_cols=83 Identities=17% Similarity=0.322 Sum_probs=73.9 Q ss_pred HHHHHHHHHHHHHHHHCCEEEEEEECCCCHHHHHHHHHHHHHCCCEEEEECHHHHHHHHHHC-------CCCCCCCCCCC Q ss_conf 89999999999999508979999827998899999999996569589986157999997313-------74212001578 Q gi|254780145|r 4 QGKSVEISELSKIFSSSGSIVVAHYKGISVAQIKDLRKKMREAGGGVKVAKNRLVKIAIRDT-------SIRGISDLFVG 76 (172) Q Consensus 4 ~~K~~~V~~l~~~l~~s~~~~v~~y~gltv~~~~~lR~~lr~~~~~~~VvKNtL~k~A~~~~-------~~~~l~~~l~G 76 (172) ++|.++|+++++.+++++.++++++.++.++++.++|+.||. .++.+.||+||++|+..+ +++.+.++++| T Consensus 1 E~K~~~v~~l~e~l~~y~~v~iv~~~nv~s~qlq~iR~~lr~--~~i~~gKnkl~~~AL~~~~~~e~~~~l~~l~~~l~G 78 (163) T cd05796 1 ELKQKLVENIREAVDKYKYIYVFSVDNMRNNKLKDIRQEWKD--SRFFFGKNKVMQVALGRTPEDEYKPNLHKLSKYLKG 78 (163) T ss_pred CCHHHHHHHHHHHHHHCCEEEEEEECCCCCHHHHHHHHHHCC--CEEEEEHHHHHHHHHHHCHHHHCCCCHHHHHHHHCC T ss_conf 937999999999997699899999458866999999998167--819984579999997216443125569999998559 Q ss_pred CEEEEEECCCCH Q ss_conf 437876223421 Q gi|254780145|r 77 QSLIVYSDSPVI 88 (172) Q Consensus 77 ~~a~~f~~d~~~ 88 (172) +++++|++++.. T Consensus 79 ~~gliFTn~~~~ 90 (163) T cd05796 79 QVGLLFTNEPPE 90 (163) T ss_pred CEEEEEECCCHH T ss_conf 868999789999 No 12 >KOG0816 consensus Probab=97.99 E-value=7.8e-05 Score=52.55 Aligned_cols=113 Identities=12% Similarity=0.271 Sum_probs=88.2 Q ss_pred CHHHHHHHHHHHHHHHHHCCEEEEEEECCCCHHHHHHHHHHHHHCCCEEEEECHHHHHHHHHHCC-------CCCCCCCC Q ss_conf 74899999999999995089799998279988999999999965695899861579999973137-------42120015 Q gi|254780145|r 2 NRQGKSVEISELSKIFSSSGSIVVAHYKGISVAQIKDLRKKMREAGGGVKVAKNRLVKIAIRDTS-------IRGISDLF 74 (172) Q Consensus 2 ~r~~K~~~V~~l~~~l~~s~~~~v~~y~gltv~~~~~lR~~lr~~~~~~~VvKNtL~k~A~~~~~-------~~~l~~~l 74 (172) .++.|+.+|+++++..+..+.+++++..+|.-.-+-++|...+ |.++-.-||.+|-+|+..+. ...+...+ T Consensus 19 g~e~K~~i~~~ir~~vd~Y~~v~Vf~~~nMRn~~lk~iR~~~k--~sr~f~GknKVm~vaLG~~~~dE~~~~l~klskll 96 (223) T KOG0816 19 GKEKKEAIVEEIREAVDKYPYVFVFEVPNMRNTTLKEIRQDFK--NSRFFFGKNKVMQVALGRSREDEYKENLYKLSKLL 96 (223) T ss_pred HHHHHHHHHHHHHHHHHHCCEEEEEECCCCCHHHHHHHHHHHH--CCCEECCHHHHHHHHHCCCHHHHHHHHHHHHHHHH T ss_conf 0778987899999998628748999728864078999999852--26231141278899866780477888788776530 Q ss_pred CCCEEEEEECCCCHH----HH--HHHHHHHHCCH----HHHHHHHCCCCCCC Q ss_conf 784378762234215----78--87888641000----12232102411079 Q gi|254780145|r 75 VGQSLIVYSDSPVIA----PK--ISVSFSNDNNE----FRVLGGVVEKGVLN 116 (172) Q Consensus 75 ~G~~a~~f~~d~~~~----aK--~~~~f~k~~~~----l~i~gg~~eg~~l~ 116 (172) +|.+.+.|++.+..- ++ .-.||+..+.. +.|-.|.++..-.+ T Consensus 97 ~G~~GLlFTd~~keeV~e~f~sy~~~DyaR~g~vA~~Tv~ip~Gpl~~f~hs 148 (223) T KOG0816 97 KGSVGLLFTDMSKEEVIEWFRSYVEEDYARAGDVAPETVTIPEGPLEQFAHS 148 (223) T ss_pred CCCEEEEECCCCHHHHHHHHHHHHHHHHHCCCCCCCCEEEECCCCHHHCCCC T ss_conf 6756788517988999999998777765115776751374077442321344 No 13 >KOG0815 consensus Probab=97.93 E-value=5.7e-05 Score=53.45 Aligned_cols=84 Identities=20% Similarity=0.370 Sum_probs=75.0 Q ss_pred HHHHHHHHHHHHHHHHHCCEEEEEEECCCCHHHHHHHHHHHHHCCCEEEEECHHHHHHHHHHC-----CCCCCCCCCCCC Q ss_conf 489999999999999508979999827998899999999996569589986157999997313-----742120015784 Q gi|254780145|r 3 RQGKSVEISELSKIFSSSGSIVVAHYKGISVAQIKDLRKKMREAGGGVKVAKNRLVKIAIRDT-----SIRGISDLFVGQ 77 (172) Q Consensus 3 r~~K~~~V~~l~~~l~~s~~~~v~~y~gltv~~~~~lR~~lr~~~~~~~VvKNtL~k~A~~~~-----~~~~l~~~l~G~ 77 (172) +++|...-..+.+.+++....+++.-.+....|+...|+.||.. +.+.+-|||++|+|..+. .++.+.|.+.|+ T Consensus 7 ~e~k~~y~~K~~~L~~~y~~~~~v~~dnv~s~ql~~ir~~lrg~-a~vlmgkntm~rrair~~~~~~~~~~~llp~~~g~ 85 (245) T KOG0815 7 AEKKAVYFTKLRQLFEEYPKCFVVGADNVGSTQLQNIRKQLRGD-AVVLMGKNTMMRRAIRGHLENNPALEKLLPVVKGN 85 (245) T ss_pred HHHHHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHCCC-EEEEECHHHHHHHHHHHCCCCCHHHHHHCCCEEEC T ss_conf 67788888878999875584699951300027799999875675-36641665899999874068767777313205403 Q ss_pred EEEEEECCCC Q ss_conf 3787622342 Q gi|254780145|r 78 SLIVYSDSPV 87 (172) Q Consensus 78 ~a~~f~~d~~ 87 (172) +.++|..++. T Consensus 86 vgfvftk~~L 95 (245) T KOG0815 86 VGFVFTKGDL 95 (245) T ss_pred EEEEEEECCH T ss_conf 3689980637 No 14 >pfam11909 NdhN NADH-quinone oxidoreductase cyanobacterial subunit N. The proton-pumping NADH:ubiquinone oxidoreductase catalyzes the electron transfer from NADH to ubiquinone linked with proton translocation across the membrane. It is the largest, most complex and least understood of the respiratory chain enzymes and is referred to as Complex I. The subunit composition of the enzyme varies between groups of organisms. Complex I originating from mammalian mitochondria contains 45 different proteins, whereas in bacteria, the corresponding complex NDH-1 consists of 14 different polypeptides. Homologues of these 14 proteins are found among subunits of the mitochondrial complex I, and therefore bacterial NDH-1 might be considered a model proton-pumping NADH dehydrogenase with a minimal set of subunits. Escherichia coli NDH-1 readily disintegrates into 3 subcomplexes: a water-soluble NADH dehydrogenase fragment (NuoE, -F, and -G),the connecting fragment (NuoB, -C, -D, and -I), and the membra Probab=64.92 E-value=2.9 Score=22.37 Aligned_cols=25 Identities=24% Similarity=0.388 Sum_probs=21.5 Q ss_pred HHHHCCCCCCCHHHHHHHHCCCCHH Q ss_conf 3210241107989999984699989 Q gi|254780145|r 106 LGGVVEKGVLNQDSIKQIASLPDLE 130 (172) Q Consensus 106 ~gg~~eg~~l~~~~i~~lakLPs~~ 130 (172) .-=++||+++|.+|+..|++||.++ T Consensus 107 vLWliEG~vLS~~El~~L~~L~~~e 131 (154) T pfam11909 107 VLWLLEGQVLSKSELEYLCDLPQRE 131 (154) T ss_pred EEEEEECEECCHHHHHHHHHCCCCC T ss_conf 9999834044799999998402237 No 15 >cd01744 GATase1_CPSase Small chain of the glutamine-dependent form of carbamoyl phosphate synthase, CPSase II. This group of sequences represents the small chain of the glutamine-dependent form of carbamoyl phosphate synthase, CPSase II. CPSase II catalyzes the production of carbomyl phosphate (CP) from bicarbonate, glutamine and two molecules of MgATP. The reaction is believed to proceed by a series of four biochemical reactions involving a minimum of three discrete highly reactive intermediates. The synthesis of CP is critical for the initiation of two separate biosynthetic pathways. In one CP is coupled to aspartate, its carbon and nitrogen nuclei ultimately incorporated into the aromatic moieties of pyrimidine nucleotides. In the second pathway CP is condensed with ornithine at the start of the urea cycle and is utilized for the detoxification of ammonia and biosynthesis of arginine. CPSases may be encoded by one or by several genes, depending on the species. The E.coli enzyme is Probab=64.48 E-value=11 Score=18.42 Aligned_cols=78 Identities=15% Similarity=0.201 Sum_probs=42.6 Q ss_pred EEEEEECCCCHHHHHHHHHHHHHCCCEEEEECHHHHHHHHHHCCCCCCCCCCCCCEEEEEECCCCHHH--HHHHHHHHH- Q ss_conf 99998279988999999999965695899861579999973137421200157843787622342157--887888641- Q gi|254780145|r 23 IVVAHYKGISVAQIKDLRKKMREAGGGVKVAKNRLVKIAIRDTSIRGISDLFVGQSLIVYSDSPVIAP--KISVSFSND- 99 (172) Q Consensus 23 ~~v~~y~gltv~~~~~lR~~lr~~~~~~~VvKNtL~k~A~~~~~~~~l~~~l~G~~a~~f~~d~~~~a--K~~~~f~k~- 99 (172) ++++||- .|.+-+ +.+|+.|++..|++|+.--.-++. ..|.++++|..|..+. ....+..++ T Consensus 1 i~ilDfG----sqy~i~-r~lrelg~~~~v~~~~~~~~~i~~----------~~p~giIlS~GPg~p~~~~~~~~~~~~~ 65 (178) T cd01744 1 VVVIDFG----VKHNIL-RELLKRGCEVTVVPYNTDAEEILK----------LDPDGIFLSNGPGDPALLDEAIKTVRKL 65 (178) T ss_pred CEEEECC----CHHHHH-HHHHHCCCEEEEEECCCCHHHHHH----------CCCCEEEECCCCCCHHHHCCHHHHHHHH T ss_conf 9999887----499999-999987997999939899999984----------1999799899999957734449999999 Q ss_pred -CCHHHHHHHHCCCCCC Q ss_conf -0001223210241107 Q gi|254780145|r 100 -NNEFRVLGGVVEKGVL 115 (172) Q Consensus 100 -~~~l~i~gg~~eg~~l 115 (172) ..++.|.|-|+.-+.+ T Consensus 66 ~~~~iPILGIClG~Q~i 82 (178) T cd01744 66 LGKKIPIFGICLGHQLL 82 (178) T ss_pred HHCCCCEEEECHHHHHH T ss_conf 74699889981217889 No 16 >PRK00758 GMP synthase subunit A; Validated Probab=56.30 E-value=12 Score=18.34 Aligned_cols=68 Identities=15% Similarity=0.227 Sum_probs=40.3 Q ss_pred HHHHHHHHHCCCEEEEECHHHHHHHHHHCCCCCCCCCCCCCEEEEEECCCCHH-HHHHHHHHHHCCHHHHHHHHCCCCCC Q ss_conf 99999996569589986157999997313742120015784378762234215-78878886410001223210241107 Q gi|254780145|r 37 KDLRKKMREAGGGVKVAKNRLVKIAIRDTSIRGISDLFVGQSLIVYSDSPVIA-PKISVSFSNDNNEFRVLGGVVEKGVL 115 (172) Q Consensus 37 ~~lR~~lr~~~~~~~VvKNtL~k~A~~~~~~~~l~~~l~G~~a~~f~~d~~~~-aK~~~~f~k~~~~l~i~gg~~eg~~l 115 (172) ..+-+.||+.|+.+.|++|+.. .+ +....|.++++|+.|... +....++.++. +..+.|-++.-+++ T Consensus 13 ~ni~~~l~~~g~~v~v~~~~~~--------~~---~i~~~~dgiilS~GP~~~~~~~~~~~i~~~-~iPiLGIClG~Q~l 80 (184) T PRK00758 13 HLIHRTLRYLGVDAKIIPNTTP--------LE---EIKAEPDGIILSGGPEIERAGNCEEYLLEL-DVPILGICLGHQLI 80 (184) T ss_pred HHHHHHHHHCCCEEEEEECCCC--------HH---HHHHCCCEEEECCCCCCCCCCCHHHHHHHC-CCCEEEEEHHHHHH T ss_conf 9999999978992899959898--------99---996389989988999833322069999865-99889971776665 Q ss_pred C Q ss_conf 9 Q gi|254780145|r 116 N 116 (172) Q Consensus 116 ~ 116 (172) . T Consensus 81 ~ 81 (184) T PRK00758 81 A 81 (184) T ss_pred H T ss_conf 8 No 17 >PRK05670 anthranilate synthase component II; Provisional Probab=52.74 E-value=18 Score=17.11 Aligned_cols=54 Identities=22% Similarity=0.289 Sum_probs=31.0 Q ss_pred EEEEEEECC-CCHHHHHHHHHHHHHCCCEEEEECHHHHHHHHHHCCCCCCCCCCCCCEEEEEECCCCH Q ss_conf 799998279-9889999999999656958998615799999731374212001578437876223421 Q gi|254780145|r 22 SIVVAHYKG-ISVAQIKDLRKKMREAGGGVKVAKNRLVKIAIRDTSIRGISDLFVGQSLIVYSDSPVI 88 (172) Q Consensus 22 ~~~v~~y~g-ltv~~~~~lR~~lr~~~~~~~VvKNtL~k~A~~~~~~~~l~~~l~G~~a~~f~~d~~~ 88 (172) +++++|+.+ .| ..+-+.+|+.|+...|++|.-. ..+.+. -..|.++++|..|.. T Consensus 1 miLiiD~~dsfT----~nI~~~lr~~g~~v~V~~~d~~-------~~~~i~--~~~pdgiiLS~GPg~ 55 (192) T PRK05670 1 MILLIDNYDSFT----YNLVQYLGELGAEVVVYRNDEI-------TLEEIE--ALAPDAIVLSPGPGT 55 (192) T ss_pred CEEEEECCCCHH----HHHHHHHHHCCCEEEEEECCCC-------CHHHHH--HCCCCEEEECCCCCC T ss_conf 999996898689----9999999868996999989989-------999998--509898999999999 No 18 >PRK06774 para-aminobenzoate synthase component II; Provisional Probab=50.37 E-value=20 Score=16.74 Aligned_cols=80 Identities=20% Similarity=0.316 Sum_probs=39.7 Q ss_pred EEEEEEC-CCCHHHHHHHHHHHHHCCCEEEEECHHHHHHHHHHCCCCCCCCCCCCCEEEEEECCCCHH--HHHHHHHHHH Q ss_conf 9999827-998899999999996569589986157999997313742120015784378762234215--7887888641 Q gi|254780145|r 23 IVVAHYK-GISVAQIKDLRKKMREAGGGVKVAKNRLVKIAIRDTSIRGISDLFVGQSLIVYSDSPVIA--PKISVSFSND 99 (172) Q Consensus 23 ~~v~~y~-gltv~~~~~lR~~lr~~~~~~~VvKNtL~k~A~~~~~~~~l~~~l~G~~a~~f~~d~~~~--aK~~~~f~k~ 99 (172) ++++|+. -.| ..+-+.||+.|+++.|++|. +...+.+.. ..|.+++.|..|..+ +....+..+. T Consensus 2 ILiiDn~DsFT----~nl~~~l~~~g~~v~V~~~d-------~~~~~~i~~--~~p~~ivLSpGPG~P~~~~~~~~~i~~ 68 (191) T PRK06774 2 LLLIDNYDSFT----YNLYQYFCELGTEVMVKRND-------ELQLTDIEQ--LAPSHVVISPGPCTPNEAGISLAVIRH 68 (191) T ss_pred EEEEECCCCHH----HHHHHHHHHCCCEEEEECCC-------CCCHHHHHH--CCCCEEEECCCCCCHHHHCCHHHHHHH T ss_conf 99996999769----99999998779959998699-------889999985--399969999999896790730689997 Q ss_pred C-CHHHHHHHHCCCCCC Q ss_conf 0-001223210241107 Q gi|254780145|r 100 N-NEFRVLGGVVEKGVL 115 (172) Q Consensus 100 ~-~~l~i~gg~~eg~~l 115 (172) . .++.|.|-|+.-+++ T Consensus 69 ~~~~iPILGIClG~Q~i 85 (191) T PRK06774 69 FADKLPILGVCLGHQAL 85 (191) T ss_pred HCCCCCEEEEHHHHHHH T ss_conf 44699788616879998 No 19 >TIGR01588 citE citrate lyase, beta subunit; InterPro: IPR006475 This group of sequences represent the beta subunit of the holoenzyme citrate lyase (4.1.3.6 from EC) composed of alpha (2.8.3.10 from EC), beta, and acyl carrier protein subunits in a stoichiometric relationship of 6:6:6. Citrate lyase is an enzyme which converts citrate to oxaloacetate. In bacteria, this reaction is involved in citrate fermentation. The beta subunit catalyzes the reaction (3S)-citryl-CoA = acetyl-CoA + oxaloacetate. The group contains an experimentally characterised member from Leuconostoc mesenteroides . The group contains sequences from a wide range of Gram-positive bacteria. For Gram-negative bacteria, it appears that only the sequences from gamma proteobacteria are present.; GO: 0008816 citryl-CoA lyase activity, 0006084 acetyl-CoA metabolic process, 0005737 cytoplasm, 0009346 citrate lyase complex. Probab=46.61 E-value=14 Score=17.85 Aligned_cols=70 Identities=20% Similarity=0.319 Sum_probs=43.4 Q ss_pred HCCCEEEEECHHHHHHHHHHCCCCCCCCCCCCCEEEEEEC--CCCHHHHHHHHHHHHCCHHHHHHHHCCCC-CCCHHHHH Q ss_conf 5695899861579999973137421200157843787622--34215788788864100012232102411-07989999 Q gi|254780145|r 45 EAGGGVKVAKNRLVKIAIRDTSIRGISDLFVGQSLIVYSD--SPVIAPKISVSFSNDNNEFRVLGGVVEKG-VLNQDSIK 121 (172) Q Consensus 45 ~~~~~~~VvKNtL~k~A~~~~~~~~l~~~l~G~~a~~f~~--d~~~~aK~~~~f~k~~~~l~i~gg~~eg~-~l~~~~i~ 121 (172) ++|-++.-.++-++ .|.+.+|+..++ .+||+ |.- -|.++-.. +|-==|||| ++++.||+ T Consensus 166 p~G~ELfyAR~~i~-lAAR~AgI~AFD--------tVysdvNnee-------GF~~E~~L--ik~LGFDGKSLinPrQi~ 227 (288) T TIGR01588 166 PDGTELFYAREAIL-LAARAAGIAAFD--------TVYSDVNNEE-------GFLKEAQL--IKQLGFDGKSLINPRQIE 227 (288) T ss_pred CCCHHHHHHHHHHH-HHHHHCCCEEEE--------EEEECCCCHH-------HHHHHHHH--HHHCCCCCCCCCCHHHHH T ss_conf 76328999999999-998652614544--------5540476623-------57999999--986389873125626788 Q ss_pred HHHCC--CCHHHH Q ss_conf 98469--998999 Q gi|254780145|r 122 QIASL--PDLEGI 132 (172) Q Consensus 122 ~lakL--Ps~~el 132 (172) =+-+. ||..|. T Consensus 228 llh~vY~PT~KEi 240 (288) T TIGR01588 228 LLHKVYAPTEKEI 240 (288) T ss_pred HHHHHCCCCHHHH T ss_conf 7653148857788 No 20 >pfam10678 DUF2492 Protein of unknown function (DUF2492). This is a bacterial family of uncharacterized proteins. Probab=46.39 E-value=22 Score=16.58 Aligned_cols=48 Identities=13% Similarity=0.308 Sum_probs=35.3 Q ss_pred HHHHHHHHHHHHHHCCEEEEEEECCCCHHHHHHH---HHHHHHCCCEEEEE Q ss_conf 9999999999995089799998279988999999---99996569589986 Q gi|254780145|r 6 KSVEISELSKIFSSSGSIVVAHYKGISVAQIKDL---RKKMREAGGGVKVA 53 (172) Q Consensus 6 K~~~V~~l~~~l~~s~~~~v~~y~gltv~~~~~l---R~~lr~~~~~~~Vv 53 (172) |.++++.+.+.|....-+.-+.-.||++.++-+| |.++.+.+..+.+. T Consensus 22 ~~~L~~~i~~~FG~~arFhTCSae~mta~eLi~FL~~kgKfi~~~~g~t~~ 72 (78) T pfam10678 22 REELKEAIAQEFGEEARFHTCSAEGLTADELIEFLLKKGKFIESEGGFTTN 72 (78) T ss_pred HHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHCCCEEECCCCEEEC T ss_conf 999999999985866567665614598999999999769876528977863 No 21 >TIGR03675 arCOG00543 arCOG01782 universal archaeal KH-domain/beta-lactamase-domain protein. This family of proteins is universal in the archaea and consistsof an N-terminal type-1 KH-domain (pfam00013) a central beta-lactamase-domain (pfam00753) with a C-terminal motif associated with RNA metabolism (pfam07521). KH-domains are associated with RNA-binding, so taken together, this protein is a likely metal-dependent RNAase. This family was defined in as arCOG01782. Probab=39.60 E-value=23 Score=16.37 Aligned_cols=23 Identities=9% Similarity=0.290 Sum_probs=11.4 Q ss_pred HHHHHHHHHHHHHCCEEEEEEEC Q ss_conf 99999999999508979999827 Q gi|254780145|r 7 SVEISELSKIFSSSGSIVVAHYK 29 (172) Q Consensus 7 ~~~V~~l~~~l~~s~~~~v~~y~ 29 (172) ..+.+.+.+.+++-..+++=-|. T Consensus 389 ~~l~~~I~et~~rGGkVLIPaFA 411 (630) T TIGR03675 389 KELIKVVNETIKRGGKVLIPVFA 411 (630) T ss_pred HHHHHHHHHHHHCCCEEEEEEEC T ss_conf 99999999999639829998633 No 22 >CHL00101 trpG anthranilate synthase component 2 Probab=39.25 E-value=30 Score=15.63 Aligned_cols=79 Identities=15% Similarity=0.179 Sum_probs=41.7 Q ss_pred EEEEEEECCCCHHHHHH-HHHHHHHCCCEEEEECHHHHHHHHHHCCCCCCCCCCCCCEEEEEECCCCHH-----HHHHHH Q ss_conf 79999827998899999-999996569589986157999997313742120015784378762234215-----788788 Q gi|254780145|r 22 SIVVAHYKGISVAQIKD-LRKKMREAGGGVKVAKNRLVKIAIRDTSIRGISDLFVGQSLIVYSDSPVIA-----PKISVS 95 (172) Q Consensus 22 ~~~v~~y~gltv~~~~~-lR~~lr~~~~~~~VvKNtL~k~A~~~~~~~~l~~~l~G~~a~~f~~d~~~~-----aK~~~~ 95 (172) +++++|+-. +++. +-+.+|+.|+.+.|++|.- ...+.+.. ..|.++++|..|..+ .+.+.+ T Consensus 1 mILiiD~~d----sft~~i~r~lrelg~~~~v~~~d~-------~~~~~i~~--~~p~gIILS~GPg~p~~~~~~~~~~~ 67 (190) T CHL00101 1 MILIIDNYD----SFTYNLVQSLGELNSDLLVCRNDE-------IDLSKIKN--LNPRHIIISPGPGHPRDSGISLDVIS 67 (190) T ss_pred CEEEEECCC----HHHHHHHHHHHHCCCEEEEECCCC-------CCHHHHHH--CCCCEEEECCCCCCHHHCCCCHHHHH T ss_conf 999997888----089999999986899899986998-------99999970--79798999799999578576168999 Q ss_pred HHHHCCHHHHHHHHCCCCCC Q ss_conf 86410001223210241107 Q gi|254780145|r 96 FSNDNNEFRVLGGVVEKGVL 115 (172) Q Consensus 96 f~k~~~~l~i~gg~~eg~~l 115 (172) ... ....|.|-|+.-+++ T Consensus 68 ~~~--~~iPILGIClG~Q~i 85 (190) T CHL00101 68 SYA--PTIPILGVCLGHQSI 85 (190) T ss_pred HHH--CCCCEEEECHHHHHH T ss_conf 971--499878973988999 No 23 >PRK07765 para-aminobenzoate synthase component II; Provisional Probab=39.06 E-value=29 Score=15.74 Aligned_cols=45 Identities=18% Similarity=0.188 Sum_probs=28.0 Q ss_pred HHHHHHHHHCCCEEEEECHHHHHHHHHHCCCCCCCCCCCCCEEEEEECCCCH Q ss_conf 9999999656958998615799999731374212001578437876223421 Q gi|254780145|r 37 KDLRKKMREAGGGVKVAKNRLVKIAIRDTSIRGISDLFVGQSLIVYSDSPVI 88 (172) Q Consensus 37 ~~lR~~lr~~~~~~~VvKNtL~k~A~~~~~~~~l~~~l~G~~a~~f~~d~~~ 88 (172) ..+-+.||+.|+...|++|.=... ..+......+.++++|..|.. T Consensus 14 ~ni~~~lr~lg~~v~V~~~d~~~~-------~~~~~~~~~~dgIILSpGPg~ 58 (221) T PRK07765 14 FNLVQYLGQLGVEAEVWRNDDVRL-------ADEAAVAAGFDGVLLSPGPGT 58 (221) T ss_pred HHHHHHHHHCCCEEEEEECCCCCH-------HHHHHHHCCCCEEEECCCCCC T ss_conf 999999987799189998998998-------899876348998999699998 No 24 >PRK08857 para-aminobenzoate synthase component II; Provisional Probab=38.28 E-value=27 Score=15.90 Aligned_cols=53 Identities=26% Similarity=0.383 Sum_probs=30.3 Q ss_pred EEEEEEC-CCCHHHHHHHHHHHHHCCCEEEEECHHHHHHHHHHCCCCCCCCCCCCCEEEEEECCCCH Q ss_conf 9999827-99889999999999656958998615799999731374212001578437876223421 Q gi|254780145|r 23 IVVAHYK-GISVAQIKDLRKKMREAGGGVKVAKNRLVKIAIRDTSIRGISDLFVGQSLIVYSDSPVI 88 (172) Q Consensus 23 ~~v~~y~-gltv~~~~~lR~~lr~~~~~~~VvKNtL~k~A~~~~~~~~l~~~l~G~~a~~f~~d~~~ 88 (172) ++++|+. -.| ..+-+.||+.|++++|++|.-+ ..+.+. -..|.++++|..|.. T Consensus 2 iLiiDn~DsfT----~ni~~~l~~lg~~v~vv~~d~~-------~~~~i~--~~~p~~IilS~GPg~ 55 (192) T PRK08857 2 LLMIDNYDSFT----YNLYQYFCELGAEVKVVRNDEI-------DIAGIE--ALNPSHLVISPGPCT 55 (192) T ss_pred EEEEECCCCHH----HHHHHHHHHCCCEEEEEECCCC-------CHHHHH--HCCCCEEEECCCCCC T ss_conf 99997999779----9999999877992899969989-------999998--429795999899999 No 25 >PRK12838 carbamoyl phosphate synthase small subunit; Reviewed Probab=36.72 E-value=33 Score=15.37 Aligned_cols=42 Identities=17% Similarity=0.202 Sum_probs=21.1 Q ss_pred CCEEEEEECCCCHH--HHHHHHHHHH-CCHHHHHHHHCCCCCCCH Q ss_conf 84378762234215--7887888641-000122321024110798 Q gi|254780145|r 76 GQSLIVYSDSPVIA--PKISVSFSND-NNEFRVLGGVVEKGVLNQ 117 (172) Q Consensus 76 G~~a~~f~~d~~~~--aK~~~~f~k~-~~~l~i~gg~~eg~~l~~ 117 (172) .|-++++|+.|..+ .....+..++ ..++.+-|-|+..+++.- T Consensus 208 ~pDGiflSNGPGDP~~~~~~i~~vr~l~~~~PifGICLGHQllal 252 (356) T PRK12838 208 NPDGIVLSNGPGDPKELQPYLPTIKDLASSYPILGICLGHQLIAL 252 (356) T ss_pred CCCEEEECCCCCCHHHHHHHHHHHHHHHCCCCEEEECHHHHHHHH T ss_conf 974899438998968878899999999749888997488999999 No 26 >PRK08007 para-aminobenzoate synthase component II; Provisional Probab=34.89 E-value=33 Score=15.32 Aligned_cols=79 Identities=20% Similarity=0.286 Sum_probs=38.8 Q ss_pred EEEEEECC-CCHHHHHHHHHHHHHCCCEEEEECHHHHHHHHHHCCCCCCCCCCCCCEEEEEECCCCHH--HHHHHHHHHH Q ss_conf 99998279-98899999999996569589986157999997313742120015784378762234215--7887888641 Q gi|254780145|r 23 IVVAHYKG-ISVAQIKDLRKKMREAGGGVKVAKNRLVKIAIRDTSIRGISDLFVGQSLIVYSDSPVIA--PKISVSFSND 99 (172) Q Consensus 23 ~~v~~y~g-ltv~~~~~lR~~lr~~~~~~~VvKNtL~k~A~~~~~~~~l~~~l~G~~a~~f~~d~~~~--aK~~~~f~k~ 99 (172) ++++|+.. .| ..+-+.||+.|+++.|++|.-. ..+.+.. -.|.+++.|..|..+ +....+..+. T Consensus 2 ILiIDn~DsFT----~ni~~~l~~~g~~v~V~~~d~~-------~~~~i~~--~~p~~IvlSpGPg~P~~~~~~~~~i~~ 68 (187) T PRK08007 2 ILLIDNYDSFT----WNLYQYFCELGADVLVKRNDAL-------TLADIDA--LKPQKIVISPGPCTPDEAGISLDVIRH 68 (187) T ss_pred EEEEECCCCHH----HHHHHHHHHCCCEEEEEECCCC-------CHHHHHH--CCCCEEEECCCCCCHHHCCCCHHHHHH T ss_conf 99996898189----9999999877997999938989-------9999984--298979999999995890840468997 Q ss_pred C-CHHHHHHHHCCCCC Q ss_conf 0-00122321024110 Q gi|254780145|r 100 N-NEFRVLGGVVEKGV 114 (172) Q Consensus 100 ~-~~l~i~gg~~eg~~ 114 (172) . .++.+.|-|+.-++ T Consensus 69 ~~~~iPiLGIClG~Q~ 84 (187) T PRK08007 69 YAGRLPILGVCLGHQA 84 (187) T ss_pred HCCCCCEEEECHHHHH T ss_conf 5389988998799999 No 27 >PRK07649 para-aminobenzoate/anthranilate synthase glutamine amidotransferase component II; Validated Probab=34.64 E-value=35 Score=15.16 Aligned_cols=80 Identities=18% Similarity=0.273 Sum_probs=39.8 Q ss_pred EEEEEECC-CCHHHHHHHHHHHHHCCCEEEEECHHHHHHHHHHCCCCCCCCCCCCCEEEEEECCCCHH--HHHHHHHHHH Q ss_conf 99998279-98899999999996569589986157999997313742120015784378762234215--7887888641 Q gi|254780145|r 23 IVVAHYKG-ISVAQIKDLRKKMREAGGGVKVAKNRLVKIAIRDTSIRGISDLFVGQSLIVYSDSPVIA--PKISVSFSND 99 (172) Q Consensus 23 ~~v~~y~g-ltv~~~~~lR~~lr~~~~~~~VvKNtL~k~A~~~~~~~~l~~~l~G~~a~~f~~d~~~~--aK~~~~f~k~ 99 (172) ++++|+.. .| ..+.+.||+.|.++.|++|.-. ..+.+.+ ..|.+++.|..|..+ +....+..+. T Consensus 2 ILlIDn~DsFT----~ni~~~l~~~g~~~~v~~~d~~-------~~~~i~~--~~pd~IiLSpGPg~p~~~~~~~~~~~~ 68 (195) T PRK07649 2 ILMIDNYDSFT----YNLVQYLGELGQELVVKRNDEV-------TISDIEN--MKPDFLMISPGPCSPNEAGISMEVIRY 68 (195) T ss_pred EEEEECCCCHH----HHHHHHHHHCCCEEEEECCCCC-------CHHHHHH--CCCCEEEECCCCCCHHHCCCCHHHHHH T ss_conf 99995898079----9999999978997999889968-------9999984--198989988999995784761467997 Q ss_pred C-CHHHHHHHHCCCCCC Q ss_conf 0-001223210241107 Q gi|254780145|r 100 N-NEFRVLGGVVEKGVL 115 (172) Q Consensus 100 ~-~~l~i~gg~~eg~~l 115 (172) . .++.|.|-++.-+.+ T Consensus 69 ~~~~iPILGIClG~Q~i 85 (195) T PRK07649 69 FAGKIPIFGVCLGHQSI 85 (195) T ss_pred HCCCCCEEEECHHHHHH T ss_conf 52899784302999999 No 28 >TIGR02902 spore_lonB ATP-dependent protease LonB; InterPro: IPR014251 This entry represents LonB, a paralog of the ATP-dependent protease La (LonA, IPR004815 from INTERPRO). LonB proteins are unassigned peptidases belonging to the MEROPS peptidase family S16 (lon protease family, clan SJ) and are found strictly, and almost universally, in endospore-forming bacteria. This protease was shown, in Bacillus subtilis, to be expressed specifically in the forespore during sporulation, under control of sigmaF . The lonB gene, despite being located immediately upstream of lonA, was shown to be monocistronic. LonB appears to be involved in the post-translation control of sigmaH, but lonB mutation did not produce an obvious sporulation defect under the conditions tested . Note that additional paralogs of LonA and LonB occur in the Clostridium lineage and these are excluded from this entry. . Probab=34.43 E-value=22 Score=16.49 Aligned_cols=61 Identities=18% Similarity=0.341 Sum_probs=42.5 Q ss_pred EEECHHHHHHHHHHCCCCCCCCCCCCCE---EEEEECCC---CHHHHHHHHHHHHCCH--HHHHHHHCC Q ss_conf 9861579999973137421200157843---78762234---2157887888641000--122321024 Q gi|254780145|r 51 KVAKNRLVKIAIRDTSIRGISDLFVGQS---LIVYSDSP---VIAPKISVSFSNDNNE--FRVLGGVVE 111 (172) Q Consensus 51 ~VvKNtL~k~A~~~~~~~~l~~~l~G~~---a~~f~~d~---~~~aK~~~~f~k~~~~--l~i~gg~~e 111 (172) ++=+++.-.+-=.+-|++.|.-.|=||| .++|+.=- .+||+.+.+++|+++. |+-.+.++| T Consensus 58 k~RP~SF~EIiGQe~GI~ALKAALCGPNPQHVIiYGPPGVGKTAAARLVLeeAKk~~~SPFke~A~FVE 126 (532) T TIGR02902 58 KTRPKSFDEIIGQEEGIKALKAALCGPNPQHVIIYGPPGVGKTAAARLVLEEAKKNPASPFKEEAAFVE 126 (532) T ss_pred CCCCCCCCCCCCCHHHHHHHHHHCCCCCCCEEEEECCCCCCHHHHHHHHHHHHHCCCCCCCCCCCEEEE T ss_conf 667776332567355689999860686896389878869617899999999865087537898866898 No 29 >pfam11576 DUF3236 Protein of unknown function (DUF3236). This family of proteins with unknown function appears to be restricted to Methanobacteria. Probab=33.56 E-value=37 Score=15.04 Aligned_cols=54 Identities=17% Similarity=0.224 Sum_probs=42.2 Q ss_pred HHHHHHHHHHHHHHHHHCCEEEEEEECCCCHHHHHHHHHHHHHCCCEEEEECHH Q ss_conf 489999999999999508979999827998899999999996569589986157 Q gi|254780145|r 3 RQGKSVEISELSKIFSSSGSIVVAHYKGISVAQIKDLRKKMREAGGGVKVAKNR 56 (172) Q Consensus 3 r~~K~~~V~~l~~~l~~s~~~~v~~y~gltv~~~~~lR~~lr~~~~~~~VvKNt 56 (172) +-.|.+.|+.+++.+.+++-++|...++-...-.++.-.++-...+++--+..+ T Consensus 18 ~GD~~EEve~Iq~~I~sAk~ivV~t~N~kKf~vin~il~~~~~~~~~ml~i~Tn 71 (154) T pfam11576 18 RGDKEEEVEEIQEYIKSAKRIVVPTNNEKKFKVVNDVLRRFGLPEAEMLQIPTN 71 (154) T ss_pred CCCHHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHHCCCCCCEEEECCCC T ss_conf 465399999999998515649985598345689999999737975104664686 No 30 >PRK05637 anthranilate synthase component II; Provisional Probab=32.71 E-value=38 Score=14.96 Aligned_cols=55 Identities=18% Similarity=0.194 Sum_probs=33.2 Q ss_pred CEEEEEEECCCCHHHHHHHHHHHHHCCCEEEEECHHHHHHHHHHCCCCCCCCCCCCCEEEEEECCCCHH Q ss_conf 979999827998899999999996569589986157999997313742120015784378762234215 Q gi|254780145|r 21 GSIVVAHYKGISVAQIKDLRKKMREAGGGVKVAKNRLVKIAIRDTSIRGISDLFVGQSLIVYSDSPVIA 89 (172) Q Consensus 21 ~~~~v~~y~gltv~~~~~lR~~lr~~~~~~~VvKNtL~k~A~~~~~~~~l~~~l~G~~a~~f~~d~~~~ 89 (172) +.+++=+|.-.| ..|..-++..|....|++|+.-- +.+.. ..|.+++.|..|..+ T Consensus 3 ~ILlIDnyDSFT----~Nl~~~l~~~g~~v~V~rn~~~~--------~~~~~--~~pd~ivlSPGPg~P 57 (208) T PRK05637 3 HVVLIDNHDSFV----YNLVDAFAVAGYKCTVFRNTVPV--------ETILA--ANPDLICLSPGPGYP 57 (208) T ss_pred EEEEEECCCCHH----HHHHHHHHHCCCEEEEEECCCCH--------HHHHH--CCCCEEEECCCCCCH T ss_conf 799994889759----99999998679907999699999--------99985--199959999999995 No 31 >cd01492 Aos1_SUMO Ubiquitin activating enzyme (E1) subunit Aos1. Aos1 is part of the heterodimeric activating enzyme (E1), specific for the SUMO family of ubiquitin-like proteins (Ubls). E1 enzymes are part of a conjugation cascade to attach Ub or Ubls, covalently to substrate proteins consisting of activating (E1), conjugating (E2), and/or ligating (E3) enzymes. E1 activates ubiquitin by C-terminal adenylation, and subsequently forms a highly reactive thioester bond between its catalytic cysteine and Ubls C-terminus. The E1 also associates with E2 and promotes ubiquitin transfer to the E2's catalytic cysteine. Post-translational modification by SUMO family of ubiquitin-like proteins (Ublps) is involved in cell division, nuclear transport, the stress response and signal transduction. Aos1 contains part of the adenylation domain. Probab=31.00 E-value=40 Score=14.77 Aligned_cols=62 Identities=10% Similarity=-0.000 Sum_probs=41.7 Q ss_pred CCCCCCCEEEEEECCCCHHHHHHHHHHHHCCHHHHHHHH--CCCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 001578437876223421578878886410001223210--241107989999984699989999999999998999999 Q gi|254780145|r 71 SDLFVGQSLIVYSDSPVIAPKISVSFSNDNNEFRVLGGV--VEKGVLNQDSIKQIASLPDLEGIRAGIISAIQSNATRLV 148 (172) Q Consensus 71 ~~~l~G~~a~~f~~d~~~~aK~~~~f~k~~~~l~i~gg~--~eg~~l~~~~i~~lakLPs~~el~a~lv~~l~~p~~~l~ 148 (172) .+++.+-..++-+.|+...-..+.++..++...-+.|++ ++|.++. . +-+|..+++ T Consensus 105 ~~~i~~~D~Vvd~~dn~~~r~~iN~~c~~~~iplI~g~~~g~~G~vf~--------d--------------v~~pv~gvi 162 (197) T cd01492 105 EEFFSQFDVVVATELSRAELVKINELCRKLGVKFYATGVHGLFGFVFA--------D--------------LLAPVAAVV 162 (197) T ss_pred HHHHHCCCEEEECCCCHHHHHHHHHHHHHCCCCEEEEEECCCEEEEEC--------C--------------CCCCHHHHH T ss_conf 999828999999999999999999999981997899981375523676--------8--------------773366899 Q ss_pred HHHHHH Q ss_conf 999776 Q gi|254780145|r 149 RLLGTP 154 (172) Q Consensus 149 ~~L~~~ 154 (172) ++|++. T Consensus 163 Gsl~A~ 168 (197) T cd01492 163 GGILAQ 168 (197) T ss_pred HHHHHH T ss_conf 999999 No 32 >pfam10126 Nit_Regul_Hom Uncharacterized protein, homolog of nitrogen regulatory protein PII. This domain, found in various hypothetical archaeal proteins, has no known function. Probab=30.00 E-value=30 Score=15.64 Aligned_cols=25 Identities=20% Similarity=0.292 Sum_probs=20.1 Q ss_pred HHCCEEEEEEECCCCHHHHHHHHHH Q ss_conf 5089799998279988999999999 Q gi|254780145|r 18 SSSGSIVVAHYKGISVAQIKDLRKK 42 (172) Q Consensus 18 ~~s~~~~v~~y~gltv~~~~~lR~~ 42 (172) .+-..+++.+|+|++..+|..|+.+ T Consensus 26 ~gItGFyl~eYkGmSP~~wkgf~~~ 50 (110) T pfam10126 26 VGITGFYLYEYKGMSPDDWKGFSLR 50 (110) T ss_pred CCCCEEEEEEECCCCHHHHCCCCHH T ss_conf 5853688355558886774475131 No 33 >pfam08800 VirE_N VirE N-terminal domain. This presumed domain is found at the N-terminus of VirE proteins. Probab=28.03 E-value=45 Score=14.44 Aligned_cols=32 Identities=9% Similarity=0.208 Sum_probs=27.8 Q ss_pred HHHHHCCEEEEEEECCCCHHHHHHHHHHHHHC Q ss_conf 99950897999982799889999999999656 Q gi|254780145|r 15 KIFSSSGSIVVAHYKGISVAQIKDLRKKMREA 46 (172) Q Consensus 15 ~~l~~s~~~~v~~y~gltv~~~~~lR~~lr~~ 46 (172) ..+...+.++++|+.+++..++..+|..+... T Consensus 25 ~~l~~~sgli~lDiD~l~~ee~~~~r~~l~~d 56 (136) T pfam08800 25 KNMKAYNGLVCLDVDHLAKEELERVKQKAAAL 56 (136) T ss_pred HHHHHCCCEEEEEECCCCHHHHHHHHHHHHCC T ss_conf 45763688089982899989999999998469 No 34 >PRK05414 urocanate hydratase; Provisional Probab=27.28 E-value=47 Score=14.35 Aligned_cols=60 Identities=20% Similarity=0.322 Sum_probs=39.4 Q ss_pred ECCCCHHHHHHHHH----------------------HHHHCCCEEEEECHHHHHHHHHH----C-CCCCCCCCC------ Q ss_conf 27998899999999----------------------99656958998615799999731----3-742120015------ Q gi|254780145|r 28 YKGISVAQIKDLRK----------------------KMREAGGGVKVAKNRLVKIAIRD----T-SIRGISDLF------ 74 (172) Q Consensus 28 y~gltv~~~~~lR~----------------------~lr~~~~~~~VvKNtL~k~A~~~----~-~~~~l~~~l------ 74 (172) -.|+|..|.++||. ++.+.|+.+--|-|++...|.+. + .|+...|.+ T Consensus 276 P~G~t~EEa~~lr~~dP~~f~~~v~~Sm~rHv~Aml~l~~~G~~~FDYGNn~r~~A~~aG~~~aF~~PgfVp~yirPlFc 355 (558) T PRK05414 276 PQGWTLEEARELRKEDPEKYVKAAKASMARHVEAMLAFQARGAYTFDYGNNIRQMAFDAGVKNAFDFPGFVPAYIRPLFC 355 (558) T ss_pred CCCCCHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHH T ss_conf 89999999999988798999999999999999999999987986886666899999975832367787617886103553 Q ss_pred --CCCEEEE-EECCCC Q ss_conf --7843787-622342 Q gi|254780145|r 75 --VGQSLIV-YSDSPV 87 (172) Q Consensus 75 --~G~~a~~-f~~d~~ 87 (172) .||==++ .|.||. T Consensus 356 ~G~GPFRWValSGdpe 371 (558) T PRK05414 356 EGIGPFRWVALSGDPE 371 (558) T ss_pred CCCCCCEEEECCCCHH T ss_conf 3778732355379878 No 35 >COG1782 Predicted metal-dependent RNase, consists of a metallo-beta-lactamase domain and an RNA-binding KH domain [General function prediction only] Probab=26.53 E-value=48 Score=14.27 Aligned_cols=25 Identities=16% Similarity=0.242 Sum_probs=10.5 Q ss_pred HHHHHHHHHHHHHCCEEEEEEECCC Q ss_conf 9999999999950897999982799 Q gi|254780145|r 7 SVEISELSKIFSSSGSIVVAHYKGI 31 (172) Q Consensus 7 ~~~V~~l~~~l~~s~~~~v~~y~gl 31 (172) .+.-+.+.+.+...-.+.-++|.|- T Consensus 8 ~ei~~~i~~~~p~~~~It~vefEGP 32 (637) T COG1782 8 EEIRNKINEILPSDVKITDVEFEGP 32 (637) T ss_pred HHHHHHHHHHCCCCCEEEEEEECCC T ss_conf 9999999975877660788871386 No 36 >PRK09522 bifunctional anthranilate synthase/anthranilate phosphoribosyltransferase; Provisional Probab=25.47 E-value=50 Score=14.14 Aligned_cols=11 Identities=27% Similarity=0.444 Sum_probs=4.1 Q ss_pred HCCCEEEEECH Q ss_conf 56958998615 Q gi|254780145|r 45 EAGGGVKVAKN 55 (172) Q Consensus 45 ~~~~~~~VvKN 55 (172) ..|....|++| T Consensus 23 ~~g~~v~V~rn 33 (531) T PRK09522 23 SNGHNVVIYRN 33 (531) T ss_pred HCCCCEEEEEC T ss_conf 66995699969 No 37 >KOG3218 consensus Probab=22.65 E-value=57 Score=13.79 Aligned_cols=55 Identities=7% Similarity=0.241 Sum_probs=37.6 Q ss_pred CCCEEEEEECCCCHHHHHHHHHHHHCCHHHHHHHH-CCCCCCCHHHHHHHHCCCCH Q ss_conf 78437876223421578878886410001223210-24110798999998469998 Q gi|254780145|r 75 VGQSLIVYSDSPVIAPKISVSFSNDNNEFRVLGGV-VEKGVLNQDSIKQIASLPDL 129 (172) Q Consensus 75 ~G~~a~~f~~d~~~~aK~~~~f~k~~~~l~i~gg~-~eg~~l~~~~i~~lakLPs~ 129 (172) .+...+.|.++|....|.+..+.-....-.+.+|+ +-..-+++.-...++.+|++ T Consensus 67 ~~ki~V~F~~~~kvgvk~~k~~~~~~~~~ni~~~IlV~q~~mt~~A~k~i~~~~p~ 122 (208) T KOG3218 67 TDKIYVFFPEEPKVGVKTMKTYVIQMQSENIFRAILVVQNGMTPSALKALSDFTPK 122 (208) T ss_pred CCCEEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEEECCCCCHHHHHHHHHCCCC T ss_conf 67289994899842578999999998745723799996187996999888724976 No 38 >pfam01175 Urocanase Urocanase. Probab=22.59 E-value=57 Score=13.79 Aligned_cols=60 Identities=22% Similarity=0.290 Sum_probs=39.2 Q ss_pred ECCCCHHHHHHHHH----------------------HHHHCCCEEEEECHHHHHHHHHH----C-CCCCCCCCC------ Q ss_conf 27998899999999----------------------99656958998615799999731----3-742120015------ Q gi|254780145|r 28 YKGISVAQIKDLRK----------------------KMREAGGGVKVAKNRLVKIAIRD----T-SIRGISDLF------ 74 (172) Q Consensus 28 y~gltv~~~~~lR~----------------------~lr~~~~~~~VvKNtL~k~A~~~----~-~~~~l~~~l------ 74 (172) -.|+|..|.++||. ++.+.|+.+--|-|++...|.+. + .|+...|.+ T Consensus 265 P~G~s~eEa~~lr~~dP~~f~~~v~~Sm~rHV~Aml~l~~~G~~~FDYGNn~r~~A~~aG~~~aF~~PgfVp~yirPlFc 344 (545) T pfam01175 265 PAGYTLEEAAELRKSDPEEFVKLARESMARHVEAMLEFQARGAEVFDYGNNIRQEAFDAGVENAFDFPGFVPAYIRPLFC 344 (545) T ss_pred CCCCCHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCHHHHHHHHHCCCHHHCCCCCCHHHHHCHHHH T ss_conf 89999999999987699999999999999999999999987986886666899999975825332687507887122654 Q ss_pred --CCCEEEE-EECCCC Q ss_conf --7843787-622342 Q gi|254780145|r 75 --VGQSLIV-YSDSPV 87 (172) Q Consensus 75 --~G~~a~~-f~~d~~ 87 (172) .||==++ .|.||. T Consensus 345 ~G~GPFRWvalSGdpe 360 (545) T pfam01175 345 EGKGPFRWVALSGDPE 360 (545) T ss_pred CCCCCCEEEECCCCHH T ss_conf 2778732455379878 No 39 >PRK11479 hypothetical protein; Provisional Probab=21.17 E-value=61 Score=13.60 Aligned_cols=33 Identities=15% Similarity=0.178 Sum_probs=28.5 Q ss_pred HHHHHHHHCCEEEEEEECCCCHHHHHHHHHHHH Q ss_conf 999999508979999827998899999999996 Q gi|254780145|r 12 ELSKIFSSSGSIVVAHYKGISVAQIKDLRKKMR 44 (172) Q Consensus 12 ~l~~~l~~s~~~~v~~y~gltv~~~~~lR~~lr 44 (172) .+.+.++.++.++++.+-|||..+.+.||.=-. T Consensus 114 ~L~~ai~~sn~~valR~p~LTp~~a~klr~fa~ 146 (273) T PRK11479 114 SLKKAIKHSDKLFVLRVPDLTPQQATKITAFAN 146 (273) T ss_pred EHHHHHHHHCCEEEEECCCCCHHHHHHHHHHHH T ss_conf 889976200658999759999799999999997 No 40 >TIGR02739 TraF type-F conjugative transfer system pilin assembly protein TraF; InterPro: IPR014110 This entry includes TraF; a protein that is part of a large group of proteins involved in conjugative transfer of plasmid DNA, specifically in the F-type system. TraF has been predicted to contain a thioredoxin fold and has been shown to be localized to the periplasm . Unlike the related protein TrbB (IPR014109 from INTERPRO), TraF does not contain a conserved pair of cysteines and has been shown not to function as a thiol disulphide isomerase by complementation of an Escherichia coli DsbA defect . The protein is believed to be involved in pilin assembly . Even more closely related than TrbB is a clade of genes (IPR014111 from INTERPRO), which do contain the CXXC motif, but it is unclear whether these are involved in type-F conjugation systems per se.. Probab=21.13 E-value=41 Score=14.75 Aligned_cols=49 Identities=10% Similarity=0.194 Sum_probs=34.0 Q ss_pred CCCCCCCCCCC-EEEEE---ECCCCH--HHHHHHHHHHHCCHHHHHHHHCCCCCCC Q ss_conf 42120015784-37876---223421--5788788864100012232102411079 Q gi|254780145|r 67 IRGISDLFVGQ-SLIVY---SDSPVI--APKISVSFSNDNNEFRVLGGVVEKGVLN 116 (172) Q Consensus 67 ~~~l~~~l~G~-~a~~f---~~d~~~--~aK~~~~f~k~~~~l~i~gg~~eg~~l~ 116 (172) -+.....+.+. -+++| |.+|+. -|-++.+|+|++. +.+.+-.+||++.+ T Consensus 150 ~~~aI~~l~~~G~Gl~FFYrG~sp~s~~~a~~i~~F~~~y~-~~vi~vS~DG~~~p 204 (270) T TIGR02739 150 KEKAIEQLSQSGYGLFFFYRGKSPISQKLAPVIQAFAKEYG-ISVIPVSVDGTLIP 204 (270) T ss_pred HHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHHCC-CEEEEEECCCCCCC T ss_conf 99999999762762478865998456789999998885419-66756612432054 Done!