Query gi|254780815|ref|YP_003065228.1| hypothetical protein CLIBASIA_03530 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 232 No_of_seqs 133 out of 925 Neff 7.9 Searched_HMMs 39220 Date Sun May 29 17:49:11 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780815.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 COG4395 Uncharacterized protei 100.0 9.3E-43 0 291.7 18.0 229 4-232 42-281 (281) 2 pfam04280 Tim44 Tim44-like dom 100.0 1.5E-39 3.8E-44 271.3 17.9 146 85-230 1-146 (146) 3 KOG2580 consensus 99.9 4.4E-24 1.1E-28 173.0 12.2 147 81-227 298-449 (459) 4 TIGR00984 3a0801s03tim44 mitoc 99.9 2.2E-22 5.6E-27 162.2 9.3 148 80-227 229-385 (385) 5 pfam07961 MBA1 MBA1-like prote 97.6 0.0031 7.9E-08 40.5 12.2 125 98-228 67-198 (235) 6 KOG4599 consensus 97.2 4.3E-06 1.1E-10 58.6 -6.0 140 85-225 150-310 (379) 7 pfam11027 DUF2615 Protein of u 74.4 5.3 0.00014 19.9 4.1 16 9-24 55-70 (103) 8 pfam10108 Exon_PolB Predicted 69.9 5.2 0.00013 20.0 3.2 54 88-150 124-177 (211) 9 pfam07466 DUF1517 Protein of u 66.9 7.3 0.00019 19.1 3.4 120 6-148 54-178 (280) 10 PRK10533 putative lipoprotein; 66.0 9.4 0.00024 18.4 3.8 64 107-173 45-114 (171) 11 pfam12273 RCR Chitin synthesis 63.0 8.3 0.00021 18.7 3.1 24 12-36 7-30 (124) 12 PRK00068 hypothetical protein; 61.5 12 0.00032 17.6 9.3 10 220-229 499-508 (978) 13 cd05782 DNA_polB_like1_exo A s 61.1 10 0.00026 18.1 3.3 28 112-139 177-204 (208) 14 TIGR02957 SigX4 RNA polymerase 59.1 12 0.00032 17.6 3.4 30 107-136 168-197 (287) 15 PRK04598 tatA twin arginine tr 57.0 11 0.00027 18.0 2.8 26 6-31 1-29 (82) 16 pfam07446 GumN GumN protein. T 56.1 15 0.00039 17.0 4.7 30 106-135 173-202 (259) 17 PRK00442 tatA twin arginine tr 55.4 11 0.00028 17.9 2.7 25 6-30 1-28 (90) 18 PRK03554 tatA twin arginine tr 54.8 12 0.00031 17.7 2.8 26 6-31 1-29 (89) 19 PRK01833 tatA twin arginine tr 50.6 16 0.0004 17.0 2.8 25 6-30 1-28 (75) 20 COG3086 RseC Positive regulato 49.3 18 0.00045 16.6 2.9 28 11-38 108-135 (150) 21 pfam05223 MecA_N NTF2-like N-t 48.6 20 0.00051 16.2 9.7 28 107-134 3-30 (118) 22 pfam09252 Feld-I_B Allergen Fe 48.5 20 0.00052 16.2 3.4 31 106-138 31-61 (67) 23 pfam08332 CaMKII_AD Calcium/ca 46.8 21 0.00055 16.1 7.3 25 109-133 7-31 (128) 24 PRK01614 tatE twin arginine tr 44.7 23 0.00059 15.9 3.2 27 6-33 1-30 (85) 25 PRK09635 sigI RNA polymerase s 44.1 24 0.0006 15.8 4.2 48 86-135 157-204 (290) 26 pfam07366 SnoaL SnoaL-like pol 42.5 17 0.00042 16.8 1.9 25 111-135 4-28 (126) 27 PRK09636 RNA polymerase sigma 41.2 26 0.00067 15.5 3.9 44 89-134 153-196 (289) 28 TIGR02439 catechol_proteo cate 40.5 20 0.00052 16.2 2.1 27 176-212 128-154 (288) 29 PRK01371 sec-independent trans 40.4 20 0.0005 16.3 2.0 36 102-137 42-80 (135) 30 TIGR00448 rpoE DNA-directed RN 39.6 28 0.00071 15.4 3.9 84 124-217 20-103 (184) 31 PRK00182 tatB sec-independent 39.3 21 0.00053 16.2 2.0 21 2-24 1-21 (165) 32 PRK03100 sec-independent trans 38.1 23 0.00059 15.9 2.1 21 2-24 1-21 (131) 33 PRK08241 RNA polymerase factor 38.1 29 0.00074 15.2 2.8 31 106-136 218-248 (341) 34 COG1826 TatA Sec-independent p 34.7 31 0.00078 15.1 2.3 17 8-24 4-20 (94) 35 TIGR02465 chlorocat_1_2 chloro 33.8 31 0.0008 15.0 2.2 87 108-212 30-129 (258) 36 PRK03625 tatE twin arginine tr 32.8 35 0.00091 14.7 3.0 25 6-30 1-28 (67) 37 PRK06531 yajC preprotein trans 31.9 28 0.00071 15.3 1.7 10 6-15 1-10 (120) 38 PRK00191 tatA twin arginine tr 31.8 37 0.00094 14.6 2.6 19 6-24 1-19 (107) 39 TIGR03116 cas_csf3 CRISPR-asso 30.2 30 0.00075 15.2 1.6 21 112-132 136-158 (214) 40 PRK04561 tatA twin arginine tr 29.7 40 0.001 14.4 2.3 25 6-30 1-28 (75) 41 PRK02958 tatA twin arginine tr 28.7 41 0.0011 14.2 2.3 25 6-30 1-28 (75) 42 PRK05886 yajC preprotein trans 27.9 43 0.0011 14.2 2.6 23 6-28 1-23 (108) 43 pfam05961 Chordopox_A13L Chord 27.6 43 0.0011 14.1 4.0 16 20-35 12-27 (68) 44 TIGR03053 PS_I_psaM photosyste 26.7 45 0.0011 14.0 3.3 24 8-31 4-27 (29) 45 pfam02807 ATP-gua_PtransN ATP: 26.4 43 0.0011 14.2 1.8 19 123-141 7-25 (76) 46 KOG1585 consensus 25.8 47 0.0012 13.9 3.3 46 95-144 219-264 (308) 47 pfam07932 DAP_C D-aminopeptida 25.7 47 0.0012 13.9 2.6 24 206-229 47-73 (97) 48 PRK04098 sec-independent trans 25.6 47 0.0012 13.9 2.1 18 5-24 3-20 (153) 49 pfam07465 PsaM Photosystem I p 24.9 49 0.0012 13.8 2.9 23 7-29 3-25 (29) 50 pfam05261 Tra_M TraM protein. 24.7 49 0.0012 13.8 2.6 13 93-105 56-68 (127) 51 COG0753 KatE Catalase [Inorgan 24.5 49 0.0013 13.8 2.8 21 207-227 116-136 (496) 52 TIGR02096 TIGR02096 conserved 24.4 44 0.0011 14.1 1.6 25 112-136 5-30 (145) 53 PRK05996 motB flagellar motor 22.9 53 0.0013 13.6 2.2 16 15-30 38-53 (431) 54 TIGR02902 spore_lonB ATP-depen 22.4 54 0.0014 13.5 2.2 21 9-29 4-24 (532) 55 pfam11444 DUF2895 Protein of u 22.2 55 0.0014 13.5 2.0 37 121-157 87-124 (199) 56 PRK13428 F0F1 ATP synthase sub 22.0 55 0.0014 13.5 3.0 21 6-27 1-21 (445) 57 pfam08769 Spo0A_C Sporulation 21.8 56 0.0014 13.4 2.3 23 112-134 63-85 (106) 58 PRK12799 motB flagellar motor 21.8 56 0.0014 13.4 2.3 16 15-30 35-50 (424) 59 pfam11714 Inhibitor_I53 Thromb 21.5 36 0.00091 14.7 0.7 16 11-26 4-19 (78) 60 pfam09214 Prd1-P2 Bacteriophag 21.1 46 0.0012 14.0 1.2 16 209-224 189-204 (560) 61 TIGR02875 spore_0_A sporulatio 21.1 26 0.00067 15.5 -0.1 20 114-133 223-242 (270) 62 PRK01770 sec-independent trans 21.0 54 0.0014 13.5 1.5 17 8-24 4-20 (171) 63 TIGR01165 cbiN cobalt transpor 20.4 60 0.0015 13.3 4.6 22 13-34 9-30 (94) 64 pfam10694 DUF2500 Protein of u 20.1 60 0.0015 13.2 2.3 18 11-28 6-23 (107) No 1 >COG4395 Uncharacterized protein conserved in bacteria [Function unknown] Probab=100.00 E-value=9.3e-43 Score=291.65 Aligned_cols=229 Identities=23% Similarity=0.365 Sum_probs=188.2 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCC-------CCCCCCCCCC---CCCCCC Q ss_conf 126702699999999999999999864154445444654333434444444444-------5444566665---443333 Q gi|254780815|r 4 IKMDSGDFLILLFAFITFFVFLQLRGVLGKKTGNEKPFSGFFSGKYVFSKRDER-------GIITLGKGKK---KDNLDS 73 (232) Q Consensus 4 ~~m~~~~i~Iii~A~IA~fl~~RLr~vLG~rtg~e~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~---~~~~~~ 73 (232) +||.+..+-.++++++++|+++.|+.++|.+++.+.++...+-++......... .........+ ...... T Consensus 42 ~gl~~~~~~~~f~gia~~~~~l~l~~lig~~~~~~~r~f~~~r~q~~a~~~~~~~~~~d~~v~~~~~~~~q~~~~~~~~~ 121 (281) T COG4395 42 MGLSGMFFGGLFFGIAAFLIFLLLITLIGFVIMLEMRFFDPYRPQRPAYDAAQASVAPDNYVTLPRAPEAQPASAADIPA 121 (281) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCEECCCCCCCCCCCCHHHCCC T ss_conf 76899999989999999999999999998888875044455445665300257766765410367654568875222334 Q ss_pred CCC-CCCCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHCHHHHHHHHHHHHHHHHCCCCC Q ss_conf 333-3455420245777766126767989999999999999999984276789997318999999998777644349820 Q gi|254780815|r 74 INE-LFPIGTRLNKVMRDIVSVYTDFDPKDFLNEARNSYEAIVDSFFEGKVHAIEKLVDSRVYQDFNDSLSTRKSNEKNV 152 (232) Q Consensus 74 ~~~-~~~~~~~~~~~~~~i~~~~~~Fd~~~Fl~gAk~af~~I~~A~~~gD~~~L~~lls~~~~~~f~~~i~~r~~~~~~~ 152 (232) ... ..+......++++.+++.+|.|++.+||.+||.+|+|||+||..+|.++|++|+||++|+.|.+++.+|+..|.++ T Consensus 122 ~~~~~g~a~~~~~ag~~~v~~~~~~f~p~~fl~~a~~a~~~Iq~a~~~~D~~tL~~L~tpev~~~~~~e~~e~~~~G~~~ 201 (281) T COG4395 122 AGDAIGGAGGPLAAGARAVHNADPSFDPARFLNGARAAYEMIQQAYGAGDRKTLRELLTPEVMEYLEAEIAERESKGETN 201 (281) T ss_pred CCCCCCCCCCCCCCCHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHCCHHHHHHHHHHHHHHHHCCCCC T ss_conf 43334678853000136664078656626777789999999999860100999987558999999999976554227545 Q ss_pred EEEEEEHHHHHHHHHCCCCCEEEEEEEEEEEEEEEEECCCCCEECCCCCCCEEEEEEEEEEECCCCCCCCEEEEEEECCC Q ss_conf 23662000124542113464569999998887776676899680576578489999999996189899866895763279 Q gi|254780815|r 153 KSSLVGIDDMKIINASIEENTVYITIRIVGQFISASYDKDNLLISSDPEIFGKVIDIWTFVRNIPPSNPNWVLISTKLGE 232 (232) Q Consensus 153 ~~~~v~i~~~~i~~~~~~~~~~~itV~F~~~~i~~~~d~~g~vveG~~~~~~~~~e~WtF~R~~~~~~p~W~L~~iq~~~ 232 (232) +.++|++..++|..++.+++.+++||++.++.|+.++|+.|+|||||++.+.++.|+|||+|+.++++|||+|++||++| T Consensus 202 ~ssfv~~~~~di~~a~~~~~~~~atv~~~~~~i~~~~dr~G~vVdGd~~~~~e~~ElWTFtR~~~s~~p~W~LaaIq~~~ 281 (281) T COG4395 202 QSSFVTILQADIARADVEGDEDYATVAIRYQGIDVTRDRSGKVVDGDPDKPEEFAELWTFTRDTGSRDPNWKLAAIQQAE 281 (281) T ss_pred CCEECCHHHHHHHHCCCCCCCEEEEEEEEEEEEEEECCCCCCEECCCCCCCHHHHHHEEEECCCCCCCCCCEEEEEECCC T ss_conf 42000220544333134488037999988655310002567521388775213445204541488999880687642259 No 2 >pfam04280 Tim44 Tim44-like domain. Tim44 is an essential component of the machinery that mediates the translocation of nuclear-encoded proteins across the mitochondrial inner membrane. Tim44 is thought to bind phospholipids of the mitochondrial inner membrane both by electrostatic interactions and by penetrating the polar head group region. This family includes the C-terminal region of Tim44 that has been shown to form a stable proteolytic fragment in yeast. This region is also found in a set of smaller bacterial proteins. The molecular function of the bacterial members of this family is unknown but transport seems likely. The crystal structure of the C terminal of Tim44 has revealed a large hydrophobic pocket which might play an important role in interacting with the acyl chains of lipid molecules in the mitochondrial membrane. Probab=100.00 E-value=1.5e-39 Score=271.26 Aligned_cols=146 Identities=29% Similarity=0.591 Sum_probs=143.0 Q ss_pred HHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHCHHHHHHHHHHHHHHHHCCCCCEEEEEEHHHHHH Q ss_conf 45777766126767989999999999999999984276789997318999999998777644349820236620001245 Q gi|254780815|r 85 NKVMRDIVSVYTDFDPKDFLNEARNSYEAIVDSFFEGKVHAIEKLVDSRVYQDFNDSLSTRKSNEKNVKSSLVGIDDMKI 164 (232) Q Consensus 85 ~~~~~~i~~~~~~Fd~~~Fl~gAk~af~~I~~A~~~gD~~~L~~lls~~~~~~f~~~i~~r~~~~~~~~~~~v~i~~~~i 164 (232) +.+++.+++.||+||+++||+|||.+|++||+||++||++.|++||||+||+.|+.+|++|...|.+++.+++++.+++| T Consensus 1 ~~~~~~i~~~dp~F~~~~Fl~~Ak~~f~~I~~A~~~~D~~~L~~~~s~~~~~~~~~~i~~r~~~g~~~~~~~v~i~~~~i 80 (146) T pfam04280 1 ASALAAIKAADPSFDVEEFLEGAREAYEMIQEAWANGDLETLRELLTPEVYEEFAAEIAEREAQGLTNDATIVGIRDAEL 80 (146) T ss_pred CHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHCCHHHHHHHHHHHHHHHHCCCCCCEEEEEEEEEEE T ss_conf 96899999878598989999999999999999998599889987648999999999999999769965408998745798 Q ss_pred HHHCCCCCEEEEEEEEEEEEEEEEECCCCCEECCCCCCCEEEEEEEEEEECCCCCCCCEEEEEEEC Q ss_conf 421134645699999988877766768996805765784899999999961898998668957632 Q gi|254780815|r 165 INASIEENTVYITIRIVGQFISASYDKDNLLISSDPEIFGKVIDIWTFVRNIPPSNPNWVLISTKL 230 (232) Q Consensus 165 ~~~~~~~~~~~itV~F~~~~i~~~~d~~g~vveG~~~~~~~~~e~WtF~R~~~~~~p~W~L~~iq~ 230 (232) .+++..++.+++||+|.++|++|++|++|+||+|+++++.+++|+|||+|++++++|+|+|++||+ T Consensus 81 ~~~~~~~~~~~itV~f~~~qi~~~~d~~G~iieG~~~~~~~~~e~W~F~R~~~~~~~~W~L~~i~q 146 (146) T pfam04280 81 LEAKMEGNTAVVTVRFVTQQISVTDDASGEVVEGDPDEPVEVTEVWTFERDLGSPDPNWRLVGIQQ 146 (146) T ss_pred EEEEECCCCEEEEEEEEEEEEEEEECCCCCEEECCCCCCEEEEEEEEEEECCCCCCCCEEEEEECC T ss_conf 665973990799999999999999969997972898872798899999976899797989998529 No 3 >KOG2580 consensus Probab=99.91 E-value=4.4e-24 Score=172.99 Aligned_cols=147 Identities=20% Similarity=0.411 Sum_probs=138.3 Q ss_pred CCHHHHHHHHHHHCCCCCCHHHHHHHHHHHH-HHHHHHHHCCCHHHHHHHHCHHHHHHHHHHHHHHHHCCCCCEEEEEEH Q ss_conf 4202457777661267679899999999999-999999842767899973189999999987776443498202366200 Q gi|254780815|r 81 GTRLNKVMRDIVSVYTDFDPKDFLNEARNSY-EAIVDSFFEGKVHAIEKLVDSRVYQDFNDSLSTRKSNEKNVKSSLVGI 159 (232) Q Consensus 81 ~~~~~~~~~~i~~~~~~Fd~~~Fl~gAk~af-~~I~~A~~~gD~~~L~~lls~~~~~~f~~~i~~r~~~~~~~~~~~v~i 159 (232) ..+..+.++.++.+||+||.++||+..+... ..|++||.+||++.|+.||+++.|+.+...+++..+.|...+..+++| T Consensus 298 ktE~Sev~tei~~iDPsF~~~~Flr~~ee~IiPnVLeAyvkGD~evLK~wcsea~~~~~aa~~keykk~gv~~d~kILdI 377 (459) T KOG2580 298 KTEMSEVLTEIKKIDPSFDKEDFLRECEEYIIPNVLEAYVKGDLEVLKKWCSEAPFSQLAAPIKEYKKHGVYFDSKILDI 377 (459) T ss_pred HHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHCCEEECCEEEEE T ss_conf 35788999999834888881899999998624799999985349999999856079888778999985570420325531 Q ss_pred HHHHHHHHCCCCCEEEEEEEEEEEEEEEEECCCCCEECCCCCCCEEEEEEEEEEECCCCCCC----CEEEEE Q ss_conf 01245421134645699999988877766768996805765784899999999961898998----668957 Q gi|254780815|r 160 DDMKIINASIEENTVYITIRIVGQFISASYDKDNLLISSDPEIFGKVIDIWTFVRNIPPSNP----NWVLIS 227 (232) Q Consensus 160 ~~~~i~~~~~~~~~~~itV~F~~~~i~~~~d~~g~vveG~~~~~~~~~e~WtF~R~~~~~~p----~W~L~~ 227 (232) +.++|...++.++.+.+.|.|.+++|+|+||++|+|+||+|+++..++.+|.|+|+...=|| .|+|.+ T Consensus 378 ~~Vdia~~KmM~d~PVlIitFqaQeI~~vRd~~GevveGd~d~i~~v~y~wvl~rd~~El~~d~~~~WRLlE 449 (459) T KOG2580 378 RGVDIASGKMMEDGPVLIITFQAQEIMCVRDAKGEVVEGDPDKILRVYYAWVLCRDQDELNPDEYAAWRLLE 449 (459) T ss_pred CCCHHHHHHHHCCCCEEEEEEEEEEEEEEECCCCCEECCCCCCEEEEEEEEEEECCHHHCCCCHHHHHHHHH T ss_conf 341267766644687799997547999998688765138987515677653135368661961224689999 No 4 >TIGR00984 3a0801s03tim44 mitochondrial import inner membrane, translocase subunit; InterPro: IPR005682 The mitochondrial protein translocase (MPT) family, which brings nuclearly encoded preproteins into mitochondria, is very complex with 19 currently identified protein constituents. These proteins include several chaperone proteins, four proteins of the outer membrane translocase (Tom) import receptor, five proteins of the Tom channel complex, five proteins of the inner membrane translocase (Tim) and three "motor" proteins.The inner membrane translocase is formed of a complex with a number of proteins, including the Tim17, Tim23 and Tim44 subunits. Tim17 and Tim23 are thought to form the translocation channel of the inner membrane.; GO: 0015450 P-P-bond-hydrolysis-driven protein transmembrane transporter activity, 0006886 intracellular protein transport, 0005744 mitochondrial inner membrane presequence translocase complex. Probab=99.88 E-value=2.2e-22 Score=162.18 Aligned_cols=148 Identities=17% Similarity=0.349 Sum_probs=137.4 Q ss_pred CCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHH-HHHHHHHHCCCHHHHHHHHCHHHHHHHHHHHHHHHHCCCCCE-EEEE Q ss_conf 54202457777661267679899999999999-999999842767899973189999999987776443498202-3662 Q gi|254780815|r 80 IGTRLNKVMRDIVSVYTDFDPKDFLNEARNSY-EAIVDSFFEGKVHAIEKLVDSRVYQDFNDSLSTRKSNEKNVK-SSLV 157 (232) Q Consensus 80 ~~~~~~~~~~~i~~~~~~Fd~~~Fl~gAk~af-~~I~~A~~~gD~~~L~~lls~~~~~~f~~~i~~r~~~~~~~~-~~~v 157 (232) ..++....++.++.+||+||.++||+-.+.-- +.|++||-+||++.|+.||++..|+.+...+++..+.|.... .+++ T Consensus 229 s~tE~S~Vl~~~~~iDp~F~~~~Fl~~l~~~IvP~ilEAyv~Gd~~vLk~W~~Ea~y~~l~~~~k~y~~~~~~~~dsrIL 308 (385) T TIGR00984 229 SETEVSEVLTEIKKIDPTFDKEHFLRYLREYIVPEILEAYVKGDLEVLKKWCSEAPYSVLATTVKEYKKLGVVAKDSRIL 308 (385) T ss_pred HHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCCCEEEEEECCHHHHHHHCCCCCHHHHHHHHHHHHHCCCEECCCEEE T ss_conf 55347899999852389988357898852215863021001215688620204576035668999998759355264586 Q ss_pred EHHHHHHHHHCCCCCE--EEEEEEEEEEEEEEEECCC-CCEECCCCCCCEEEEEEEEEEECCCCCC--C--CEEEEE Q ss_conf 0001245421134645--6999999888777667689-9680576578489999999996189899--8--668957 Q gi|254780815|r 158 GIDDMKIINASIEENT--VYITIRIVGQFISASYDKD-NLLISSDPEIFGKVIDIWTFVRNIPPSN--P--NWVLIS 227 (232) Q Consensus 158 ~i~~~~i~~~~~~~~~--~~itV~F~~~~i~~~~d~~-g~vveG~~~~~~~~~e~WtF~R~~~~~~--p--~W~L~~ 227 (232) .|+.++|...++.... +.+.|-|.++.|++++|+. |+|++|||++|+...++|.|+|+...=| | .|+|.+ T Consensus 309 DI~~Vd~~~~kll~~~DIPVLiv~F~~q~v~~~k~~~~Gev~~Gd~d~~~~~~Y~~vltRd~eeld~~e~~aWklld 385 (385) T TIGR00984 309 DIRGVDIASGKLLEQGDIPVLIVTFRAQEVNVIKNAKSGEVVAGDPDNIQRISYAWVLTRDEEELDDDETEAWKLLD 385 (385) T ss_pred ECCCHHHHHHHHCCCCCCCEEEEEEEEEEEEEEEECCCCCEECCCCCCCCEEEEEEEEECCHHHCCCCHHHCCCCCC T ss_conf 11526799876517898227999878999999972787510148976001143466883564332787330340369 No 5 >pfam07961 MBA1 MBA1-like protein. Mba1 is an inner membrane protein that is part of the mitochondrial protein export machinery. It binds to the large subunit of mitochondrial ribosomes and cooperates with the C-terminal ribosome-binding domain of Oxa1, which is a central component of the insertion machinery of the inner membrane. In the absence of both Mba1 and the C-terminus of Oxa1, mitochondrial translation products fail to be properly inserted into the inner membrane and serve as substrates of the matrix chaperone Hsp70. It is proposed that Mba1 functions as a ribosome receptor that cooperates with Oxa1 in the positioning of the ribosome exit site to the insertion machinery of the inner membrane. Probab=97.58 E-value=0.0031 Score=40.47 Aligned_cols=125 Identities=14% Similarity=0.173 Sum_probs=93.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHCHHHHHHHHHHHHHHHHCCCCCEEEEEEHH-HHHHHHHCC------C Q ss_conf 798999999999999999998427678999731899999999877764434982023662000-124542113------4 Q gi|254780815|r 98 FDPKDFLNEARNSYEAIVDSFFEGKVHAIEKLVDSRVYQDFNDSLSTRKSNEKNVKSSLVGID-DMKIINASI------E 170 (232) Q Consensus 98 Fd~~~Fl~gAk~af~~I~~A~~~gD~~~L~~lls~~~~~~f~~~i~~r~~~~~~~~~~~v~i~-~~~i~~~~~------~ 170 (232) ++-....+.|-.-|..+-+||++++++.|.+.|+-.+.+.+.+...... .+.+.++..+.+. ..+|+.... . T Consensus 67 ~~F~~WKn~AIe~yv~vNkaFA~~~l~~l~~~~~~~v~~aL~~Ra~~~P-~~~kl~W~L~k~~~~PKlvs~~~~p~~~~~ 145 (235) T pfam07961 67 PSFLLWKNKAIETYVQVNKAFAARDLEQLEPLVSLWVQEALEARAKQLP-DNVKLDWELIKFNSVPKLVSFQPLMIPGRP 145 (235) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC-CCCEEEEEEEEECCCCCEEEEEEECCCCCC T ss_conf 7177899899999999999999865887636665999999999986099-986587776310579717985320269997 Q ss_pred CCEEEEEEEEEEEEEEEEECCCCCEECCCCCCCEEEEEEEEEEECCCCCCCCEEEEEE Q ss_conf 6456999999888777667689968057657848999999999618989986689576 Q gi|254780815|r 171 ENTVYITIRIVGQFISASYDKDNLLISSDPEIFGKVIDIWTFVRNIPPSNPNWVLIST 228 (232) Q Consensus 171 ~~~~~itV~F~~~~i~~~~d~~g~vveG~~~~~~~~~e~WtF~R~~~~~~p~W~L~~i 228 (232) ....++.++|.+.|.-...|+....++ ..-..+.|+-.|.-+..++ .|+|+|. T Consensus 146 ~e~vQ~V~r~~TkQrli~~~k~s~~~e---~~dr~v~dyvv~~~d~~t~--e~lL~Gs 198 (235) T pfam07961 146 LELVQLVYRFDTKQRLIKVDKGSSKVE---KLDRDVVDYIAFLCDATTN--DMLLVGS 198 (235) T ss_pred CEEEEEEEEECCEEEEEEECCCCCCCC---CCCCCHHHEEEEEECCCCC--CEEEEEE T ss_conf 248999999513047898416887532---4454311318999548877--2899974 No 6 >KOG4599 consensus Probab=97.22 E-value=4.3e-06 Score=58.63 Aligned_cols=140 Identities=17% Similarity=0.223 Sum_probs=105.5 Q ss_pred HHHHHHHHHCC--CCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHCH-----------HHHHHHHHHHHHHHHCCCC Q ss_conf 45777766126--7679899999999999999999842767899973189-----------9999999877764434982 Q gi|254780815|r 85 NKVMRDIVSVY--TDFDPKDFLNEARNSYEAIVDSFFEGKVHAIEKLVDS-----------RVYQDFNDSLSTRKSNEKN 151 (232) Q Consensus 85 ~~~~~~i~~~~--~~Fd~~~Fl~gAk~af~~I~~A~~~gD~~~L~~lls~-----------~~~~~f~~~i~~r~~~~~~ 151 (232) .-+.+.++..+ ..|+.++|=..|+..|..+.-+-+..|...+..|+++ .+...|-..+.+..+. .+ T Consensus 150 q~sir~i~~k~~~~~F~ik~f~~kakDifIqaH~~l~~~de~kays~l~e~~fvhl~~~~~t~~~~flp~m~~k~K~-~~ 228 (379) T KOG4599 150 QMSIRRIRDKDEIENFEIKDFGAKAKDIFIQAHLCLNNSDEMKAYSFLTESEFVHLKCPSITNLLHFLPVMQEKVKK-GT 228 (379) T ss_pred HHHHHHHCCCCCCCCEECCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCCCCCCCCCCCHHHHCCCCCHHHCC-CC T ss_conf 76334321477300200022114767789999998753648878877512221223478764356651445433246-72 Q ss_pred CEEEEEEHHH-HHHH-----HHC--CCCCEEEEEEEEEEEEEEEEECCCCCEECCCCCCCEEEEEEEEEEECCCCCCCCE Q ss_conf 0236620001-2454-----211--3464569999998887776676899680576578489999999996189899866 Q gi|254780815|r 152 VKSSLVGIDD-MKII-----NAS--IEENTVYITIRIVGQFISASYDKDNLLISSDPEIFGKVIDIWTFVRNIPPSNPNW 223 (232) Q Consensus 152 ~~~~~v~i~~-~~i~-----~~~--~~~~~~~itV~F~~~~i~~~~d~~g~vveG~~~~~~~~~e~WtF~R~~~~~~p~W 223 (232) ..+.++.... ..++ +.- ..+..++||||++++|...++|+-|...-|+++..+.|.|+-.|+.++.+....| T Consensus 229 vR~~~vs~leP~~vv~~rc~d~~~~s~n~~aqitvRkh~~q~Lavydrfg~lm~g~E~i~KDv~eyvvfe~hi~~~~g~w 308 (379) T KOG4599 229 VRWSFVSVLEPSRVVYVRCDDDNDKSGNFIAQITVRKHTRQCLAVYDRFGRLMFGSEDIKKDVLEYVVFENHIQNAYGRW 308 (379) T ss_pred EEEEEEEEECCCEEEEEEECCCCCCCCCHHHEEHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 46888764055205898724776420041302013788899999999986886067533311568877777541045431 Q ss_pred EE Q ss_conf 89 Q gi|254780815|r 224 VL 225 (232) Q Consensus 224 ~L 225 (232) +| T Consensus 309 r~ 310 (379) T KOG4599 309 RL 310 (379) T ss_pred HH T ss_conf 00 No 7 >pfam11027 DUF2615 Protein of unknown function (DUF2615). This small. approximately 100 residue, family is conserved from worms to humans. It is cysteine-rich with a characteristic FDxCEC sequence motif. The function is not known. Probab=74.36 E-value=5.3 Score=19.93 Aligned_cols=16 Identities=13% Similarity=0.548 Sum_probs=8.2 Q ss_pred HHHHHHHHHHHHHHHH Q ss_conf 2699999999999999 Q gi|254780815|r 9 GDFLILLFAFITFFVF 24 (232) Q Consensus 9 ~~i~Iii~A~IA~fl~ 24 (232) .++-++++.++|++++ T Consensus 55 ~~~i~m~w~~~a~~ly 70 (103) T pfam11027 55 GFMLMMAWMLLAMALY 70 (103) T ss_pred HHHHHHHHHHHHHHHH T ss_conf 8999999999999999 No 8 >pfam10108 Exon_PolB Predicted 3'-5' exonuclease related to the exonuclease domain of PolB. This domain is found in various prokaryotic 3'-5' exonucleases and hypothetical proteins. Probab=69.86 E-value=5.2 Score=20.01 Aligned_cols=54 Identities=9% Similarity=0.202 Sum_probs=34.9 Q ss_pred HHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHCHHHHHHHHHHHHHHHHCCC Q ss_conf 777661267679899999999999999999842767899973189999999987776443498 Q gi|254780815|r 88 MRDIVSVYTDFDPKDFLNEARNSYEAIVDSFFEGKVHAIEKLVDSRVYQDFNDSLSTRKSNEK 150 (232) Q Consensus 88 ~~~i~~~~~~Fd~~~Fl~gAk~af~~I~~A~~~gD~~~L~~lls~~~~~~f~~~i~~r~~~~~ 150 (232) +..+-..|.-.+..+ .++.++|.+|+++.|+.||-.+|++.+--=++-+--.|. T Consensus 124 ~a~l~G~PGK~~~~G---------~~V~~~~~~G~l~~I~~YCe~DvlnTylv~LR~~ll~G~ 177 (211) T pfam10108 124 VAALLGLPGKMGVDG---------SQVWELYQAGKIEEIRDYCETDVLNTYLVYLRFQLLRGR 177 (211) T ss_pred HHHHCCCCCCCCCCH---------HHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 999819998787764---------589999985988899999887689999999999998088 No 9 >pfam07466 DUF1517 Protein of unknown function (DUF1517). This family consists of several hypothetical glycine rich plant and bacterial proteins of around 300 residues in length. The function of this family is unknown. Probab=66.90 E-value=7.3 Score=19.05 Aligned_cols=120 Identities=14% Similarity=0.110 Sum_probs=54.2 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHH Q ss_conf 67026999999999999999998641544454446543334344444444445444566665443333333345542024 Q gi|254780815|r 6 MDSGDFLILLFAFITFFVFLQLRGVLGKKTGNEKPFSGFFSGKYVFSKRDERGIITLGKGKKKDNLDSINELFPIGTRLN 85 (232) Q Consensus 6 m~~~~i~Iii~A~IA~fl~~RLr~vLG~rtg~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 85 (232) .+.+.|-+|||.+||+|++.-||+..+.- +.... .... ........-....-..+..+. T Consensus 54 Gggglf~~lil~aia~~lv~~~r~~~~~~-~~~~~---------~~~~-----------~~~vsv~~lQVgLLa~a~~lQ 112 (280) T pfam07466 54 GGGGLFGLLILMAIAGVLVNAFRSAGGGG-GGLSS---------AGTS-----------NGKVTVVQLQVGLLASARELQ 112 (280) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHCCCCC-CCCCC---------CCCC-----------CCCEEEEEEEHHHHHCHHHHH T ss_conf 85459999999999999999976035777-76543---------3357-----------996058864224453459999 Q ss_pred HHHHHHHHCCCCCCH---HHHHHHHHHHHHHHHHHHHCCCHHHHHH--HHCHHHHHHHHHHHHHHHHC Q ss_conf 577776612676798---9999999999999999984276789997--31899999999877764434 Q gi|254780815|r 86 KVMRDIVSVYTDFDP---KDFLNEARNSYEAIVDSFFEGKVHAIEK--LVDSRVYQDFNDSLSTRKSN 148 (232) Q Consensus 86 ~~~~~i~~~~~~Fd~---~~Fl~gAk~af~~I~~A~~~gD~~~L~~--lls~~~~~~f~~~i~~r~~~ 148 (232) ..++.+-+.-..=+. ..-|+.+-.+.-+--+.|.-++.+.=+. -.-+.-|+.+ .|+||.+= T Consensus 113 ~dL~rlA~~aDTst~~GL~~vLqEt~LaLLR~pe~wvy~~~~~~~~~~~~AE~~Fn~l--SleERSKf 178 (280) T pfam07466 113 RDLNRLAESADTSTPEGLARVLQETTLALLRHPEYWVYANAEADQVPIDSAESQFNRL--SLEERSKF 178 (280) T ss_pred HHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCHHHHHHHHHHH--HHHHHHHH T ss_conf 9999999850899778999999999999764854168874014766878999999998--89887520 No 10 >PRK10533 putative lipoprotein; Provisional Probab=65.99 E-value=9.4 Score=18.36 Aligned_cols=64 Identities=14% Similarity=0.163 Sum_probs=37.6 Q ss_pred HHHHHHH-HHHHHHCCCHHHHHHHHCHHHHHHHHHHHHHHHHCCCCCEEEEE-----EHHHHHHHHHCCCCCE Q ss_conf 9999999-99998427678999731899999999877764434982023662-----0001245421134645 Q gi|254780815|r 107 ARNSYEA-IVDSFFEGKVHAIEKLVDSRVYQDFNDSLSTRKSNEKNVKSSLV-----GIDDMKIINASIEENT 173 (232) Q Consensus 107 Ak~af~~-I~~A~~~gD~~~L~~lls~~~~~~f~~~i~~r~~~~~~~~~~~v-----~i~~~~i~~~~~~~~~ 173 (232) |.+.|.. ||. ...+++.||+|+|..+|+.+..+-++-.+. ......+. .-.++++..+....+. T Consensus 45 AqqFYD~riq~--~~~~~a~lRPylS~~L~~~L~~a~~~~~~~-~~~~~D~FSs~~~~~tsa~VaSaStIPN~ 114 (171) T PRK10533 45 AQQFYDYRIQH--RSNDIAALRPYLSDKLATLLSDASRDNSHR-QLLSGDPFSSRTTLPDSAHVASASTIPNR 114 (171) T ss_pred HHHHHHHHHCC--CCCCCHHHCCHHHHHHHHHHHHHHHCCCCC-CCCCCCCCCCCCCCCCCCEEEECCCCCCC T ss_conf 99988873022--556525425245789999999987544567-76346634332348861001203427897 No 11 >pfam12273 RCR Chitin synthesis regulation, resistance to Congo red. RCR proteins are ER membrane proteins that regulate chitin deposition in fungal cell walls. Although chitin, a linear polymer of beta-1,4-linked N-acetylglucosamine, constitutes only 2% of the cell wall it plays a vital role in the overall protection of the cell wall against stress, noxious chemicals and osmotic pressure changes. Congo red is a cell wall-disrupting benzidine-type dye extensively used in many cell wall mutant studies that specifically targets chitin in yeast cells and inhibits growth. RCR proteins render the yeasts resistant to Congo red by diminishing the content of chitin in the cell wall. RCR proteins are probably regulating chitin synthase III interact directly with ubiquitin ligase Rsp5, and the VPEY motif is necessary for this, via interaction with the WW domains of Rsp5. Probab=63.00 E-value=8.3 Score=18.68 Aligned_cols=24 Identities=17% Similarity=0.476 Sum_probs=12.2 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 9999999999999999864154445 Q gi|254780815|r 12 LILLFAFITFFVFLQLRGVLGKKTG 36 (232) Q Consensus 12 ~Iii~A~IA~fl~~RLr~vLG~rtg 36 (232) .||+.++|.+|+|.++. .+.+|.| T Consensus 7 v~i~~~~i~~f~~~~~n-~RRrr~G 30 (124) T pfam12273 7 IFIIALLILFFLTARIN-RRRRRRG 30 (124) T ss_pred HHHHHHHHHHHHHHHHH-HHHHHCC T ss_conf 99999999999999873-9998758 No 12 >PRK00068 hypothetical protein; Validated Probab=61.47 E-value=12 Score=17.57 Aligned_cols=10 Identities=10% Similarity=0.497 Sum_probs=5.2 Q ss_pred CCCEEEEEEE Q ss_conf 9866895763 Q gi|254780815|r 220 NPNWVLISTK 229 (232) Q Consensus 220 ~p~W~L~~iq 229 (232) ++++.+++++ T Consensus 499 t~~Y~IV~t~ 508 (978) T PRK00068 499 TNDYAIVNGK 508 (978) T ss_pred CCCEEEECCC T ss_conf 9984897898 No 13 >cd05782 DNA_polB_like1_exo A subfamily of the 3'-5' exonuclease domain of family-B DNA polymerases. This subfamily is composed of uncharacterized bacterial family-B DNA polymerases. Family-B DNA polymerases contain an N-terminal DEDDy DnaQ-like exonuclease domain in the same polypeptide chain as the polymerase domain, similar to family-A DNA polymerases. This exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are involved in metal binding and catalysis. The exonuclease domain of family-B DNA polymerases has a fundamental role in proofreading activity. It contains a beta hairpin structure that plays an important role in active site switching in the event of a nucleotide misincorporation. Family-B DNA polymerases are predominantly involved in DNA replication and DNA repair. Probab=61.11 E-value=10 Score=18.12 Aligned_cols=28 Identities=18% Similarity=0.415 Sum_probs=24.3 Q ss_pred HHHHHHHHCCCHHHHHHHHCHHHHHHHH Q ss_conf 9999998427678999731899999999 Q gi|254780815|r 112 EAIVDSFFEGKVHAIEKLVDSRVYQDFN 139 (232) Q Consensus 112 ~~I~~A~~~gD~~~L~~lls~~~~~~f~ 139 (232) .++.+.|.+|+++.|+.||-.++.+... T Consensus 177 s~V~~~~~~g~l~~I~~YCe~DVl~T~~ 204 (208) T cd05782 177 SQVWELYAEGKLDEIAEYCETDVLNTYL 204 (208) T ss_pred HHHHHHHHCCCHHHHHHHHHHHHHHHHH T ss_conf 9999999819988999999999999999 No 14 >TIGR02957 SigX4 RNA polymerase sigma-70 factor, TIGR02957 family; InterPro: IPR014303 The bacterial core RNA polymerase complex, which consists of five subunits, is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme . RNA polymerase recruits alternative sigma factors as a means of switching on specific regulons. Most bacteria express a multiplicity of sigma factors. Two of these factors, sigma-70 (gene rpoD), generally known as the major or primary sigma factor, and sigma-54 (gene rpoN or ntrA) direct the transcription of a wide variety of genes. The other sigma factors, known as alternative sigma factors, are required for the transcription of specific subsets of genes. With regard to sequence similarity, sigma factors can be grouped into two classes, the sigma-54 and sigma-70 families. Sequence alignments of the sigma70 family members reveal four conserved regions that can be further divided into subregions eg. sub-region 2.2, which may be involved in the binding of the sigma factor to the core RNA polymerase; and sub-region 4.2, which seems to harbor a DNA-binding 'helix-turn-helix' motif involved in binding the conserved -35 region of promoters recognized by the major sigma factors , . This group of sigma factors appear by homology, tree building and bidirectional best hits, to represent a conserved family. This entry is found in a limited number of bacterial lineages and ncludes apparent paralogous expansion in Streptomyces coelicolor A3(2), and multiple copies in Mycobacterium smegmatis str. MC2 155, Streptomyces avermitilis MA-4680 and Nocardia farcinica IFM 10152.; GO: 0003677 DNA binding, 0003700 transcription factor activity, 0016987 sigma factor activity, 0006352 transcription initiation, 0006355 regulation of transcription DNA-dependent. Probab=59.07 E-value=12 Score=17.57 Aligned_cols=30 Identities=20% Similarity=0.359 Sum_probs=25.6 Q ss_pred HHHHHHHHHHHHHCCCHHHHHHHHCHHHHH Q ss_conf 999999999998427678999731899999 Q gi|254780815|r 107 ARNSYEAIVDSFFEGKVHAIEKLVDSRVYQ 136 (232) Q Consensus 107 Ak~af~~I~~A~~~gD~~~L~~lls~~~~~ 136 (232) .++.-+..+.|...||++.|+.+|++++.- T Consensus 168 ~~~l~~~F~~A~~~GD~~~L~~~L~~Dv~L 197 (287) T TIGR02957 168 SRELLERFVEALQTGDLEGLLELLAEDVVL 197 (287) T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHCEE T ss_conf 999999999998318978999862530177 No 15 >PRK04598 tatA twin arginine translocase protein A; Provisional Probab=57.00 E-value=11 Score=18.01 Aligned_cols=26 Identities=31% Similarity=0.575 Sum_probs=17.4 Q ss_pred CCH-HHHHHHHHHHHHHHHH--HHHHHHC Q ss_conf 670-2699999999999999--9998641 Q gi|254780815|r 6 MDS-GDFLILLFAFITFFVF--LQLRGVL 31 (232) Q Consensus 6 m~~-~~i~Iii~A~IA~fl~--~RLr~vL 31 (232) ||+ +.-.|||+++|.+.|| -||+++. T Consensus 1 Mggisiw~llii~~iv~llFGtkkL~~lg 29 (82) T PRK04598 1 MGGISIWQLLIIAVIVVLLFGTKKLRGIG 29 (82) T ss_pred CCCCCHHHHHHHHHHHHHHHCCCHHHHHH T ss_conf 98655999999999999993631677888 No 16 >pfam07446 GumN GumN protein. This family contains the bacterial protein GumN (approximately 330 residues long). Note that many members of this family are hypothetical proteins. Probab=56.12 E-value=15 Score=17.00 Aligned_cols=30 Identities=20% Similarity=0.398 Sum_probs=23.7 Q ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHCHHHH Q ss_conf 999999999999842767899973189999 Q gi|254780815|r 106 EARNSYEAIVDSFFEGKVHAIEKLVDSRVY 135 (232) Q Consensus 106 gAk~af~~I~~A~~~gD~~~L~~lls~~~~ 135 (232) ..+..+..+.+||..||++.|..+....+. T Consensus 173 ~~~~~~~~l~~aw~~gd~~~l~~~~~~~~~ 202 (259) T pfam07446 173 KGDDLLVTLADAWAEGDVEALAELMREDMA 202 (259) T ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHHHH T ss_conf 417999999999990899999999877543 No 17 >PRK00442 tatA twin arginine translocase protein A; Provisional Probab=55.43 E-value=11 Score=17.90 Aligned_cols=25 Identities=20% Similarity=0.404 Sum_probs=14.6 Q ss_pred CCHHHHH-HHHHHHHHHHHH--HHHHHH Q ss_conf 6702699-999999999999--999864 Q gi|254780815|r 6 MDSGDFL-ILLFAFITFFVF--LQLRGV 30 (232) Q Consensus 6 m~~~~i~-Iii~A~IA~fl~--~RLr~v 30 (232) ||+..+. +||+++|.+.|| -||+++ T Consensus 1 MG~~s~~hwlivl~iv~llFGtkkl~~l 28 (90) T PRK00442 1 MGIFDWKHWIVILVVVVLVFGTKKLKNL 28 (90) T ss_pred CCCCHHHHHHHHHHHHHHHHCCCHHHHH T ss_conf 9600699999999999999464035467 No 18 >PRK03554 tatA twin arginine translocase protein A; Provisional Probab=54.75 E-value=12 Score=17.67 Aligned_cols=26 Identities=23% Similarity=0.423 Sum_probs=17.1 Q ss_pred CCH-HHHHHHHHHHHHHHHH--HHHHHHC Q ss_conf 670-2699999999999999--9998641 Q gi|254780815|r 6 MDS-GDFLILLFAFITFFVF--LQLRGVL 31 (232) Q Consensus 6 m~~-~~i~Iii~A~IA~fl~--~RLr~vL 31 (232) ||+ +.-.+||+++|.+.|| -|||++. T Consensus 1 Mggisiw~llIi~~iv~llFGtkKL~~lg 29 (89) T PRK03554 1 MGGISIWQLLIIAVIVVLLFGTKKLGSIG 29 (89) T ss_pred CCCCCHHHHHHHHHHHHHHHCCCHHHHHH T ss_conf 98655999999999999994642577888 No 19 >PRK01833 tatA twin arginine translocase protein A; Provisional Probab=50.61 E-value=16 Score=16.95 Aligned_cols=25 Identities=24% Similarity=0.466 Sum_probs=16.6 Q ss_pred CCH-HHHHHHHHHHHHHHHH--HHHHHH Q ss_conf 670-2699999999999999--999864 Q gi|254780815|r 6 MDS-GDFLILLFAFITFFVF--LQLRGV 30 (232) Q Consensus 6 m~~-~~i~Iii~A~IA~fl~--~RLr~v 30 (232) |++ +.-+|+|+++|.+.|| -|||++ T Consensus 1 MggisiwqllIi~~iv~llFGtkKL~~l 28 (75) T PRK01833 1 MGGISIWQLLIIVAIIVLLFGTKKLRTL 28 (75) T ss_pred CCCCCHHHHHHHHHHHHHHHCCCHHHHH T ss_conf 9864499999999999999464167688 No 20 >COG3086 RseC Positive regulator of sigma E activity [Signal transduction mechanisms] Probab=49.27 E-value=18 Score=16.59 Aligned_cols=28 Identities=21% Similarity=0.191 Sum_probs=13.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 9999999999999999986415444544 Q gi|254780815|r 11 FLILLFAFITFFVFLQLRGVLGKKTGNE 38 (232) Q Consensus 11 i~Iii~A~IA~fl~~RLr~vLG~rtg~e 38 (232) +.+++..+.++.+.+|....+++|+..| T Consensus 108 ~~~~lg~~l~fl~~r~ysRkl~~~~~~Q 135 (150) T COG3086 108 FGAFLGLALGFLLARRYSRKLAKRTEWQ 135 (150) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 9999999999999999998864244578 No 21 >pfam05223 MecA_N NTF2-like N-terminal transpeptidase domain. The structure of this domain from MecA is known, and is found to be similar to that found in NTF2 pfam02136. This domain seems unlikely to have an enzymatic function, and its role remains unknown. Probab=48.58 E-value=20 Score=16.24 Aligned_cols=28 Identities=14% Similarity=0.317 Sum_probs=24.0 Q ss_pred HHHHHHHHHHHHHCCCHHHHHHHHCHHH Q ss_conf 9999999999984276789997318999 Q gi|254780815|r 107 ARNSYEAIVDSFFEGKVHAIEKLVDSRV 134 (232) Q Consensus 107 Ak~af~~I~~A~~~gD~~~L~~lls~~~ 134 (232) +..++...+.||.+||.+..-.+++..- T Consensus 3 p~~~~~~f~~aw~~~dy~~m~~~~~~~~ 30 (118) T pfam05223 3 PEKAAETFLDAWAKGDFKTAASLTSEAS 30 (118) T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHCCHH T ss_conf 7899999999987499999998647064 No 22 >pfam09252 Feld-I_B Allergen Fel d I-B chain. Members of this family of cat allergens adopt a helical structure consisting of eight alpha helices, in a Uteroglobin-like fold. They are one of the most important causes of allergic asthma worldwide. Probab=48.53 E-value=20 Score=16.23 Aligned_cols=31 Identities=29% Similarity=0.529 Sum_probs=23.8 Q ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHCHHHHHHH Q ss_conf 999999999999842767899973189999999 Q gi|254780815|r 106 EARNSYEAIVDSFFEGKVHAIEKLVDSRVYQDF 138 (232) Q Consensus 106 gAk~af~~I~~A~~~gD~~~L~~lls~~~~~~f 138 (232) +-|.||+.||..|.++-+++ .++.+.++.++ T Consensus 31 ~Ek~AfeKIQdCy~E~gl~~--k~ld~~im~si 61 (67) T pfam09252 31 KERVAFEKIQDCYNEEGLKS--KLLDPQVMETI 61 (67) T ss_pred HHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHH T ss_conf 89999999999997424777--65348899887 No 23 >pfam08332 CaMKII_AD Calcium/calmodulin dependent protein kinase II Association. This domain is found at the C-terminus of the Calcium/calmodulin dependent protein kinases II (CaMKII). These proteins also have a Ser/Thr protein kinase domain (pfam00069) at their N-terminus. The function of the CaMKII association domain is the assembly of the single proteins into large (8 to 14 subunits) multimers. Probab=46.79 E-value=21 Score=16.06 Aligned_cols=25 Identities=20% Similarity=0.399 Sum_probs=23.2 Q ss_pred HHHHHHHHHHHCCCHHHHHHHHCHH Q ss_conf 9999999998427678999731899 Q gi|254780815|r 109 NSYEAIVDSFFEGKVHAIEKLVDSR 133 (232) Q Consensus 109 ~af~~I~~A~~~gD~~~L~~lls~~ 133 (232) +.|+...+|...||.+....+|.|+ T Consensus 7 ~l~~~~~~ai~tGD~~~y~~l~~~d 31 (128) T pfam08332 7 KVTETLIEAIATGDFETYTKLCDPD 31 (128) T ss_pred HHHHHHHHHHHHCCHHHHHHHHCCC T ss_conf 9999999998729999988775787 No 24 >PRK01614 tatE twin arginine translocase protein A; Validated Probab=44.74 E-value=23 Score=15.87 Aligned_cols=27 Identities=22% Similarity=0.470 Sum_probs=17.6 Q ss_pred CCH-HHHHHHHHHHHHHHHH--HHHHHHCCC Q ss_conf 670-2699999999999999--999864154 Q gi|254780815|r 6 MDS-GDFLILLFAFITFFVF--LQLRGVLGK 33 (232) Q Consensus 6 m~~-~~i~Iii~A~IA~fl~--~RLr~vLG~ 33 (232) |++ ..-.+||+++|.+.|| -|||++ |. T Consensus 1 Mggisiw~llii~~IvvllFGtkkLr~l-G~ 30 (85) T PRK01614 1 MEGLSITKLLVVGILIVLLFGTSKLRTL-GA 30 (85) T ss_pred CCCCCHHHHHHHHHHHHHHHCCHHHHHH-HH T ss_conf 9854499999999999999452377788-78 No 25 >PRK09635 sigI RNA polymerase sigma factor SigI; Provisional Probab=44.07 E-value=24 Score=15.80 Aligned_cols=48 Identities=15% Similarity=0.208 Sum_probs=34.2 Q ss_pred HHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHCHHHH Q ss_conf 57777661267679899999999999999999842767899973189999 Q gi|254780815|r 86 KVMRDIVSVYTDFDPKDFLNEARNSYEAIVDSFFEGKVHAIEKLVDSRVY 135 (232) Q Consensus 86 ~~~~~i~~~~~~Fd~~~Fl~gAk~af~~I~~A~~~gD~~~L~~lls~~~~ 135 (232) .+-+.+.+..|.|+... ......-+....|+..||++.|..+|++++. T Consensus 157 RAR~~l~~~~p~~~~~~--~~~~~lv~~F~~A~~~gD~~~L~~LL~~dv~ 204 (290) T PRK09635 157 RARRKINESRIAASVEP--AQHRVVTRAFIEACSNGDLDTLLEVLDPGVA 204 (290) T ss_pred HHHHHHHCCCCCCCCCH--HHHHHHHHHHHHHHHCCCHHHHHHHCCCCCC T ss_conf 99998732389899997--9999999999999982899999986385241 No 26 >pfam07366 SnoaL SnoaL-like polyketide cyclase. This family includes SnoaL a polyketide cyclase involved in nogalamycin biosynthesis. This family was formerly known as DUF1486. Probab=42.49 E-value=17 Score=16.77 Aligned_cols=25 Identities=16% Similarity=0.299 Sum_probs=21.0 Q ss_pred HHHHHHHHHCCCHHHHHHHHCHHHH Q ss_conf 9999999842767899973189999 Q gi|254780815|r 111 YEAIVDSFFEGKVHAIEKLVDSRVY 135 (232) Q Consensus 111 f~~I~~A~~~gD~~~L~~lls~~~~ 135 (232) .+...++|+++|.+.+..|++|++. T Consensus 4 ~~~y~~~wn~~d~~~~~~~~a~d~~ 28 (126) T pfam07366 4 LRFYDEVWNTGDLDALDEFVAEDVV 28 (126) T ss_pred HHHHHHHHHCCCHHHHHHHHCCCEE T ss_conf 9999999975999999987287858 No 27 >PRK09636 RNA polymerase sigma factor SigJ; Provisional Probab=41.18 E-value=26 Score=15.52 Aligned_cols=44 Identities=20% Similarity=0.330 Sum_probs=30.0 Q ss_pred HHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHCHHH Q ss_conf 7766126767989999999999999999984276789997318999 Q gi|254780815|r 89 RDIVSVYTDFDPKDFLNEARNSYEAIVDSFFEGKVHAIEKLVDSRV 134 (232) Q Consensus 89 ~~i~~~~~~Fd~~~Fl~gAk~af~~I~~A~~~gD~~~L~~lls~~~ 134 (232) +.+....|.|+... ..-...-.....|+..||++.|..||++++ T Consensus 153 ~~l~~~~~~~~~~~--~~~~~lv~~f~~A~~~gD~~~L~~LLa~Dv 196 (289) T PRK09636 153 KHVRAARPRFTVSD--EEGAELVEAFFAALRSGDMDALVALLAPDV 196 (289) T ss_pred HHHHCCCCCCCCCH--HHHHHHHHHHHHHHHCCCHHHHHHHHHCCE T ss_conf 98734489999983--999999999999998089999999851562 No 28 >TIGR02439 catechol_proteo catechol 1,2-dioxygenase; InterPro: IPR012801 Members of this family known so far are catechol 1,2-dioxygenases of the proteobacteria. They are distinct from catechol 1,2-dioxygenases and chlorocatechol 1,2-dioxygenases of the actinobacteria, which are quite similar to each other and resolved by separate entries. This enzyme catalyses intradiol cleavage in which catechol + O2 becomes cis,cis-muconate. Catechol is an intermediate in the catabolism of many different aromatic compounds, as is the alternative intermediate protocatechuate. In Acinetobacter lwoffii, two isozymes are present with abilities, differing somewhat, to act on catechol analogues 3-methylcatechol, 4-methylcatechol, 4-methoxycatechol, and 4-chlorocatechol.; GO: 0005506 iron ion binding, 0018576 catechol 12-dioxygenase activity, 0019614 catechol catabolic process. Probab=40.50 E-value=20 Score=16.21 Aligned_cols=27 Identities=22% Similarity=0.428 Sum_probs=19.9 Q ss_pred EEEEEEEEEEEEEECCCCCEECCCCCCCEEEEEEEEE Q ss_conf 9999988877766768996805765784899999999 Q gi|254780815|r 176 ITIRIVGQFISASYDKDNLLISSDPEIFGKVIDIWTF 212 (232) Q Consensus 176 itV~F~~~~i~~~~d~~g~vveG~~~~~~~~~e~WtF 212 (232) -|.-.+++ ++|.+|++|+|- ..|+||= T Consensus 128 ~tl~l~G~----V~d~~G~Pi~gA------~VE~WHA 154 (288) T TIGR02439 128 ETLVLHGT----VTDTDGKPIAGA------KVEVWHA 154 (288) T ss_pred CEEEEEEE----EECCCCCCCCCC------EEEEEEC T ss_conf 16898658----878988754787------6657751 No 29 >PRK01371 sec-independent translocase; Provisional Probab=40.45 E-value=20 Score=16.32 Aligned_cols=36 Identities=17% Similarity=0.349 Sum_probs=18.9 Q ss_pred HHHHHHHHHH-HHHHHHHHCCCHHHH--HHHHCHHHHHH Q ss_conf 9999999999-999999842767899--97318999999 Q gi|254780815|r 102 DFLNEARNSY-EAIVDSFFEGKVHAI--EKLVDSRVYQD 137 (232) Q Consensus 102 ~Fl~gAk~af-~~I~~A~~~gD~~~L--~~lls~~~~~~ 137 (232) .|..+||.-. +.+-.-|.+-|+..| |.|+++-+++. T Consensus 42 ~~a~~ak~~l~~eLGpef~dl~~~~l~Pk~~i~k~l~d~ 80 (135) T PRK01371 42 ELANNARNDLRSELGPEFADLDLRDLHPKTFVRKHLLEA 80 (135) T ss_pred HHHHHHHHHHHHHHCCCHHHCCHHHCCHHHHHHHHHHCC T ss_conf 999999999999858641015453278899999987324 No 30 >TIGR00448 rpoE DNA-directed RNA polymerase; InterPro: IPR004519 DNA-directed RNA polymerases 2.7.7.6 from EC (also known as DNA-dependent RNA polymerases) are responsible for the polymerisation of ribonucleotides into a sequence complementary to the template DNA. In eukaryotes, there are three different forms of DNA-directed RNA polymerases transcribing different sets of genes. Most RNA polymerases are multimeric enzymes and are composed of a variable number of subunits. The core RNA polymerase complex consists of five subunits (two alpha, one beta, one beta-prime and one omega) and is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme . The core RNA polymerase complex forms a "crab claw"-like structure with an internal channel running along the full length . The key functional sites of the enzyme, as defined by mutational and cross-linking analysis, are located on the inner wall of this channel. RNA synthesis follows after the attachment of RNA polymerase to a specific site, the promoter, on the template DNA strand. The RNA synthesis process continues until a termination sequence is reached. The RNA product, which is synthesised in the 5' to 3'direction, is known as the primary transcript. Eukaryotic nuclei contain three distinct types of RNA polymerases that differ in the RNA they synthesise: RNA polymerase I: located in the nucleoli, synthesises precursors of most ribosomal RNAs. RNA polymerase II: occurs in the nucleoplasm, synthesises mRNA precursors. RNA polymerase III: also occurs in the nucleoplasm, synthesises the precursors of 5S ribosomal RNA, the tRNAs, and a variety of other small nuclear and cytosolic RNAs. Eukaryotic cells are also known to contain separate mitochondrial and chloroplast RNA polymerases. Eukaryotic RNA polymerases, whose molecular masses vary in size from 500 to 700 kD, contain two non-identical large (>100 kDa) subunits and an array of up to 12 different small (less than 50 kDa) subunits. This family seems to be confined to the archea and eukaryotic taxa and are quite dissimilar to Escherichia coli RpoE.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006350 transcription, 0005634 nucleus. Probab=39.63 E-value=28 Score=15.37 Aligned_cols=84 Identities=10% Similarity=0.168 Sum_probs=60.4 Q ss_pred HHHHHHHCHHHHHHHHHHHHHHHHCCCCCEEEEEEHHHHHHHHHCCCCCEEEEEEEEEEEEEEEEECCCCCEECCCCCCC Q ss_conf 89997318999999998777644349820236620001245421134645699999988877766768996805765784 Q gi|254780815|r 124 HAIEKLVDSRVYQDFNDSLSTRKSNEKNVKSSLVGIDDMKIINASIEENTVYITIRIVGQFISASYDKDNLLISSDPEIF 203 (232) Q Consensus 124 ~~L~~lls~~~~~~f~~~i~~r~~~~~~~~~~~v~i~~~~i~~~~~~~~~~~itV~F~~~~i~~~~d~~g~vveG~~~~~ 203 (232) ++|++-++.-|.+.|...|+. +.=.-+++..|.+...=.+...++.+|+-|-|.+- |-+-.-++||+| .+ T Consensus 20 ~D~~~~i~~~L~e~f~grld~----~~G~~iti~d~~~igeG~~~~GDGs~Y~~V~F~A~---~FkP~l~EiVeG---ev 89 (184) T TIGR00448 20 EDLEEVITKLLNEKFEGRLDK----NVGLVITIKDIEDIGEGKVIEGDGSAYHNVVFRAL---VFKPELGEIVEG---EV 89 (184) T ss_pred CCHHHHHHHHHHHHCCCCCCC----CCCEEEEEEEEEECCCCEEECCCCCCCCCEEEEEE---EECCCCEEEEEE---EE T ss_conf 668999999865420464134----56448988874231543587079843101034688---960352167865---89 Q ss_pred EEEEEEEEEEECCC Q ss_conf 89999999996189 Q gi|254780815|r 204 GKVIDIWTFVRNIP 217 (232) Q Consensus 204 ~~~~e~WtF~R~~~ 217 (232) .++.+.=.|+|=++ T Consensus 90 ~~~~efG~fV~LlG 103 (184) T TIGR00448 90 IEIVEFGAFVSLLG 103 (184) T ss_pred EEEEECCCEEEEEC T ss_conf 98985274267522 No 31 >PRK00182 tatB sec-independent translocase; Provisional Probab=39.31 E-value=21 Score=16.15 Aligned_cols=21 Identities=24% Similarity=0.327 Sum_probs=13.8 Q ss_pred EECCCCHHHHHHHHHHHHHHHHH Q ss_conf 22126702699999999999999 Q gi|254780815|r 2 VFIKMDSGDFLILLFAFITFFVF 24 (232) Q Consensus 2 ~~~~m~~~~i~Iii~A~IA~fl~ 24 (232) ||-++|+ .+|+++++||.+++ T Consensus 1 MF~dIG~--~EllvI~VvaLvVl 21 (165) T PRK00182 1 MFSSVGW--GEILLLLIVGLIVI 21 (165) T ss_pred CCCCCCH--HHHHHHHHHHHHHC T ss_conf 9664479--99999999999625 No 32 >PRK03100 sec-independent translocase; Provisional Probab=38.14 E-value=23 Score=15.88 Aligned_cols=21 Identities=10% Similarity=0.251 Sum_probs=14.0 Q ss_pred EECCCCHHHHHHHHHHHHHHHHH Q ss_conf 22126702699999999999999 Q gi|254780815|r 2 VFIKMDSGDFLILLFAFITFFVF 24 (232) Q Consensus 2 ~~~~m~~~~i~Iii~A~IA~fl~ 24 (232) ||-.+|+ .+|+++++||.+++ T Consensus 1 MF~dIG~--~Ellvi~vvaLvvi 21 (131) T PRK03100 1 MFANIGW--GEMLVLVVAGLVIL 21 (131) T ss_pred CCCCCCH--HHHHHHHHHHHHHC T ss_conf 9543469--99999999999834 No 33 >PRK08241 RNA polymerase factor sigma-70; Validated Probab=38.10 E-value=29 Score=15.22 Aligned_cols=31 Identities=26% Similarity=0.214 Sum_probs=25.5 Q ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHCHHHHH Q ss_conf 9999999999998427678999731899999 Q gi|254780815|r 106 EARNSYEAIVDSFFEGKVHAIEKLVDSRVYQ 136 (232) Q Consensus 106 gAk~af~~I~~A~~~gD~~~L~~lls~~~~~ 136 (232) ..+......+.||..||++.|..+|++++-- T Consensus 218 ~~r~ll~rf~~A~~~~D~~al~~LL~~Da~~ 248 (341) T PRK08241 218 EQRALLERYVAAFEAYDVDALVALLTEDATW 248 (341) T ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHCCCEEE T ss_conf 9999999999999806999999871578069 No 34 >COG1826 TatA Sec-independent protein secretion pathway components [Intracellular trafficking and secretion] Probab=34.74 E-value=31 Score=15.09 Aligned_cols=17 Identities=24% Similarity=0.425 Sum_probs=12.6 Q ss_pred HHHHHHHHHHHHHHHHH Q ss_conf 02699999999999999 Q gi|254780815|r 8 SGDFLILLFAFITFFVF 24 (232) Q Consensus 8 ~~~i~Iii~A~IA~fl~ 24 (232) .+..+|||+++|+..|| T Consensus 4 ig~~elliIlvV~lllf 20 (94) T COG1826 4 IGWSELLIILVVALLVF 20 (94) T ss_pred CCHHHHHHHHHHHHHHC T ss_conf 88999999999999862 No 35 >TIGR02465 chlorocat_1_2 chlorocatechol 1,2-dioxygenase; InterPro: IPR012817 Members of this protein family are chlorocatechol 1,2-dioxygenase. This protein is closely related to catechol 1,2-dioxygenase, IPR012801 from INTERPRO, 1.13.11.1 from EC. Note: a number of annotated database entries for members of this family have EC numbers that refers to IPR012801 from INTERPRO; whereas members of this family acts in pathways for the biodegradation of chlorinated aromatic compounds.; GO: 0005506 iron ion binding, 0018575 chlorocatechol 12-dioxygenase activity, 0006725 aromatic compound metabolic process. Probab=33.81 E-value=31 Score=15.02 Aligned_cols=87 Identities=18% Similarity=0.321 Sum_probs=42.9 Q ss_pred HHHHHHHHHHHHCCCHHHHHHHHCHHHHHHHHHHHHHHH--HCCCCCEEEEEE---HHHHHHHH-------HCC-CCCEE Q ss_conf 999999999984276789997318999999998777644--349820236620---00124542-------113-46456 Q gi|254780815|r 108 RNSYEAIVDSFFEGKVHAIEKLVDSRVYQDFNDSLSTRK--SNEKNVKSSLVG---IDDMKIIN-------ASI-EENTV 174 (232) Q Consensus 108 k~af~~I~~A~~~gD~~~L~~lls~~~~~~f~~~i~~r~--~~~~~~~~~~v~---i~~~~i~~-------~~~-~~~~~ 174 (232) |..+..|.+--..|.+ .||-+-.|++--.+++.++ .++.+ ..+-+ ..+|=+++ ... +.... T Consensus 30 r~g~~Yl~~v~eagE~----pLl~DvFfnsTi~~~~~~~Gi~~gS~--~aiqGPYF~~~AP~v~GkP~~L~~yd~D~~~P 103 (258) T TIGR02465 30 RTGVQYLMKVAEAGET----PLLLDVFFNSTIVEIENRKGISRGST--SAIQGPYFLEDAPLVEGKPGTLKTYDTDDHKP 103 (258) T ss_pred HHHHHHHHHHHHCCCH----HHHHHHHHCCEEEEEECCCCCCCCCC--HHCCCCCCCCCCCEECCCCCCCCCCCCCCCCC T ss_conf 9999998864302522----67776631020354312576254760--02058831168860037856366200578898 Q ss_pred EEEEEEEEEEEEEEECCCCCEECCCCCCCEEEEEEEEE Q ss_conf 99999988877766768996805765784899999999 Q gi|254780815|r 175 YITIRIVGQFISASYDKDNLLISSDPEIFGKVIDIWTF 212 (232) Q Consensus 175 ~itV~F~~~~i~~~~d~~g~vveG~~~~~~~~~e~WtF 212 (232) .+.|++. +++-+|++|+| .+-|+||= T Consensus 104 --~l~~~G~----V~~~~G~~v~g------AvIDVWHS 129 (258) T TIGR02465 104 --RLLIRGT----VRDLSGEPVAG------AVIDVWHS 129 (258) T ss_pred --EEEEEEE----EECCCCCEECC------CEEEEECC T ss_conf --0688777----85157856437------47875405 No 36 >PRK03625 tatE twin arginine translocase protein E; Validated Probab=32.77 E-value=35 Score=14.68 Aligned_cols=25 Identities=24% Similarity=0.377 Sum_probs=16.0 Q ss_pred CCH-HHHHHHHHHHHHHHHH--HHHHHH Q ss_conf 670-2699999999999999--999864 Q gi|254780815|r 6 MDS-GDFLILLFAFITFFVF--LQLRGV 30 (232) Q Consensus 6 m~~-~~i~Iii~A~IA~fl~--~RLr~v 30 (232) |++ ..-.+||+++|.+.|| -|||++ T Consensus 1 MggiSiw~llIv~~IVvllFGtkkLr~l 28 (67) T PRK03625 1 MGEISITKLLVVAALVVLLFGTKKLRTL 28 (67) T ss_pred CCCCCHHHHHHHHHHHHHHHCCHHHHHH T ss_conf 9855599999999999999354277788 No 37 >PRK06531 yajC preprotein translocase subunit YajC; Validated Probab=31.88 E-value=28 Score=15.34 Aligned_cols=10 Identities=10% Similarity=0.338 Sum_probs=4.8 Q ss_pred CCHHHHHHHH Q ss_conf 6702699999 Q gi|254780815|r 6 MDSGDFLILL 15 (232) Q Consensus 6 m~~~~i~Iii 15 (232) ||+.+|.+++ T Consensus 1 M~~~~iim~v 10 (120) T PRK06531 1 MGFPTIIMFV 10 (120) T ss_pred CCCHHHHHHH T ss_conf 9822899999 No 38 >PRK00191 tatA twin arginine translocase protein A; Provisional Probab=31.82 E-value=37 Score=14.58 Aligned_cols=19 Identities=21% Similarity=0.475 Sum_probs=12.9 Q ss_pred CCHHHHHHHHHHHHHHHHH Q ss_conf 6702699999999999999 Q gi|254780815|r 6 MDSGDFLILLFAFITFFVF 24 (232) Q Consensus 6 m~~~~i~Iii~A~IA~fl~ 24 (232) |+-+--.|+|+++|++.|| T Consensus 1 MsLGpweiiiI~vvvllLF 19 (107) T PRK00191 1 MSLGPWEIGIIVLLIIVLF 19 (107) T ss_pred CCCCHHHHHHHHHHHHHHH T ss_conf 9863799999999999997 No 39 >TIGR03116 cas_csf3 CRISPR-associated protein, Csf3 family. Members of this family show up near CRISPR repeats in Acidithiobacillus ferrooxidans ATCC 23270, Azoarcus sp. EbN1, and Rhodoferax ferrireducens DSM 15236. In the latter two species, the CRISPR/cas locus is found on a plasmid. This family is one of several characteristic of a type of CRISPR-associated (cas) gene cluster we designate Aferr after A. ferrooxidans, where it is both chromosomal and the only type of cas gene cluster found. The gene is designated csf3 (CRISPR/cas Subtype as in A. ferrooxidans protein 3), as it lies third closest to the repeats. Probab=30.23 E-value=30 Score=15.19 Aligned_cols=21 Identities=24% Similarity=0.363 Sum_probs=17.9 Q ss_pred HHH--HHHHHCCCHHHHHHHHCH Q ss_conf 999--999842767899973189 Q gi|254780815|r 112 EAI--VDSFFEGKVHAIEKLVDS 132 (232) Q Consensus 112 ~~I--~~A~~~gD~~~L~~lls~ 132 (232) +.+ +.||+-||.+.|+.||+. T Consensus 136 e~~~el~awcIGd~e~le~LL~~ 158 (214) T TIGR03116 136 ENAQELAAWCIGDEEALEFLLSQ 158 (214) T ss_pred HHHHHHHHHHCCCHHHHHHHHHH T ss_conf 65765465410779999999988 No 40 >PRK04561 tatA twin arginine translocase protein A; Provisional Probab=29.69 E-value=40 Score=14.35 Aligned_cols=25 Identities=28% Similarity=0.356 Sum_probs=14.5 Q ss_pred CCH-HHHHHHHHHHHHHHHH--HHHHHH Q ss_conf 670-2699999999999999--999864 Q gi|254780815|r 6 MDS-GDFLILLFAFITFFVF--LQLRGV 30 (232) Q Consensus 6 m~~-~~i~Iii~A~IA~fl~--~RLr~v 30 (232) ||+ ..-.+||+++|.+.|| -|||++ T Consensus 1 MG~~Siwh~lIv~~ivvllFGtkkL~~l 28 (75) T PRK04561 1 MGSFSIWHWLVVLVIVLLVFGTKRLTSG 28 (75) T ss_pred CCCCCHHHHHHHHHHHHHHHCCCCHHHH T ss_conf 9855389999999999998365303147 No 41 >PRK02958 tatA twin arginine translocase protein A; Provisional Probab=28.72 E-value=41 Score=14.25 Aligned_cols=25 Identities=32% Similarity=0.424 Sum_probs=14.4 Q ss_pred CCHH-HHHHHHHHHHHHHHH--HHHHHH Q ss_conf 6702-699999999999999--999864 Q gi|254780815|r 6 MDSG-DFLILLFAFITFFVF--LQLRGV 30 (232) Q Consensus 6 m~~~-~i~Iii~A~IA~fl~--~RLr~v 30 (232) ||+- .-.+||+++|.+.|| -||+++ T Consensus 1 MG~~Siwh~livl~iv~llFG~kkL~~l 28 (75) T PRK02958 1 MGSLSIWHWLIVLLIVALVFGTKKLRNI 28 (75) T ss_pred CCCCCHHHHHHHHHHHHHHHCCCHHHHH T ss_conf 9854389999999999998275315468 No 42 >PRK05886 yajC preprotein translocase subunit YajC; Validated Probab=27.93 E-value=43 Score=14.16 Aligned_cols=23 Identities=17% Similarity=0.356 Sum_probs=10.2 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 67026999999999999999998 Q gi|254780815|r 6 MDSGDFLILLFAFITFFVFLQLR 28 (232) Q Consensus 6 m~~~~i~Iii~A~IA~fl~~RLr 28 (232) |.+-..-+-+++++++|.||-.| T Consensus 1 m~sl~~flPlili~~vf~f~~iR 23 (108) T PRK05886 1 MESLVLFLPFLLIMGGFMYFASR 23 (108) T ss_pred CCHHHHHHHHHHHHHHHHHHHCC T ss_conf 90089999999999999999316 No 43 >pfam05961 Chordopox_A13L Chordopoxvirus A13L protein. This family consists of A13L proteins from the Chordopoxviruses. A13L or p8 is one of the three most abundant membrane proteins of the intracellular mature Vaccinia virus. Probab=27.60 E-value=43 Score=14.12 Aligned_cols=16 Identities=31% Similarity=0.370 Sum_probs=6.8 Q ss_pred HHHHHHHHHHHCCCCC Q ss_conf 9999999986415444 Q gi|254780815|r 20 TFFVFLQLRGVLGKKT 35 (232) Q Consensus 20 A~fl~~RLr~vLG~rt 35 (232) ++.+.+.+|++--+++ T Consensus 12 Vavi~lIvYgiYnkk~ 27 (68) T pfam05961 12 VAIIGLIVYGIYNKKT 27 (68) T ss_pred HHHHHHHHHHHHHCCC T ss_conf 9999999998851345 No 44 >TIGR03053 PS_I_psaM photosystem I reaction center subunit XII. Members of this protein family are PsaM, which is subunit XII of the photosystem I reaction center. This protein is found in both the Cyanobacteria and the chloroplasts of plants, but is absent from non-oxygenic photosynthetic bacteria such as Rhodobacter sphaeroides. Species that contain photosystem I also contain photosystem II, which splits water and releases molecular oxygen. The seed alignment for this model includes sequences from pfam07465 and additional sequences, as from Prochlorococcus. Probab=26.67 E-value=45 Score=14.02 Aligned_cols=24 Identities=33% Similarity=0.376 Sum_probs=18.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 026999999999999999998641 Q gi|254780815|r 8 SGDFLILLFAFITFFVFLQLRGVL 31 (232) Q Consensus 8 ~~~i~Iii~A~IA~fl~~RLr~vL 31 (232) ++.+-.++.|++++++-+||=.-| T Consensus 4 ~Qi~iaL~iAli~~ilAlrLg~~L 27 (29) T TIGR03053 4 SQIFIALVIALIAGILALRLGKEL 27 (29) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 798999999999999999984886 No 45 >pfam02807 ATP-gua_PtransN ATP:guanido phosphotransferase, N-terminal domain. The N-terminal domain has an all-alpha fold. Probab=26.38 E-value=43 Score=14.17 Aligned_cols=19 Identities=21% Similarity=0.454 Sum_probs=15.0 Q ss_pred HHHHHHHHCHHHHHHHHHH Q ss_conf 7899973189999999987 Q gi|254780815|r 123 VHAIEKLVDSRVYQDFNDS 141 (232) Q Consensus 123 ~~~L~~lls~~~~~~f~~~ 141 (232) ..-++.+||+++|+.++.. T Consensus 7 ~sll~K~LT~ev~~~lk~k 25 (76) T pfam02807 7 KSLLKKHLTPEVYEKLKDK 25 (76) T ss_pred CHHHHHHCCHHHHHHHHCC T ss_conf 0188886899999998477 No 46 >KOG1585 consensus Probab=25.85 E-value=47 Score=13.92 Aligned_cols=46 Identities=17% Similarity=0.345 Sum_probs=33.9 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHCHHHHHHHHHHHHH Q ss_conf 67679899999999999999999842767899973189999999987776 Q gi|254780815|r 95 YTDFDPKDFLNEARNSYEAIVDSFFEGKVHAIEKLVDSRVYQDFNDSLST 144 (232) Q Consensus 95 ~~~Fd~~~Fl~gAk~af~~I~~A~~~gD~~~L~~lls~~~~~~f~~~i~~ 144 (232) -|+|...+=- ..-+.++++|+.||.+.+++.|+...+..+...... T Consensus 219 ip~f~~sed~----r~lenLL~ayd~gD~E~~~kvl~sp~~r~MDneya~ 264 (308) T KOG1585 219 IPAFLKSEDS----RSLENLLTAYDEGDIEEIKKVLSSPTVRNMDNEYAH 264 (308) T ss_pred CCCCCCHHHH----HHHHHHHHHHCCCCHHHHHHHHCCHHHHHHHHHHHH T ss_conf 7664674778----999999987533788999999727176530289999 No 47 >pfam07932 DAP_C D-aminopeptidase, domain C. D-aminopeptidase is a dimeric enzyme with each monomer being composed of three domains. Domain C is organized to form a beta barrel made up of eight antiparallel beta strands. It is connected to domain B by a short linker sequence, and interacts extensively with the domain A, the catalytic domain. The gamma loop of domain C forms part of the wall of the catalytic pocket; domain C is in fact thought to confer substrate and inhibitor specificity to the enzyme. Probab=25.74 E-value=47 Score=13.91 Aligned_cols=24 Identities=29% Similarity=0.796 Sum_probs=18.0 Q ss_pred EEEEEEEE--ECCCCCCC-CEEEEEEE Q ss_conf 99999999--61898998-66895763 Q gi|254780815|r 206 VIDIWTFV--RNIPPSNP-NWVLISTK 229 (232) Q Consensus 206 ~~e~WtF~--R~~~~~~p-~W~L~~iq 229 (232) =.|+|.+. |.+..+.| +|.|+=.. T Consensus 47 g~DvW~l~~~R~mDApaPGdWTlvf~R 73 (97) T pfam07932 47 GPDVWLLACRRSMDAPAPGNWTLVFQR 73 (97) T ss_pred CCCEEEEECCCCCCCCCCCCEEEEEEE T ss_conf 894699864554799889865999996 No 48 >PRK04098 sec-independent translocase; Provisional Probab=25.62 E-value=47 Score=13.90 Aligned_cols=18 Identities=17% Similarity=0.390 Sum_probs=12.4 Q ss_pred CCCHHHHHHHHHHHHHHHHH Q ss_conf 26702699999999999999 Q gi|254780815|r 5 KMDSGDFLILLFAFITFFVF 24 (232) Q Consensus 5 ~m~~~~i~Iii~A~IA~fl~ 24 (232) +||| .+|+++++||.+++ T Consensus 3 ~iGf--~EllvI~vvaLiVl 20 (153) T PRK04098 3 GMGF--FEIIVILVVAIIFL 20 (153) T ss_pred CCCH--HHHHHHHHHHHHHC T ss_conf 7558--99999999999734 No 49 >pfam07465 PsaM Photosystem I protein M (PsaM). This family consists of several plant and cyanobacterial photosystem I protein M (PsaM) sequences. PsaM forms part of the photosystem I complex and its binding is stabilized by PsaI. Probab=24.92 E-value=49 Score=13.81 Aligned_cols=23 Identities=35% Similarity=0.479 Sum_probs=17.7 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 70269999999999999999986 Q gi|254780815|r 7 DSGDFLILLFAFITFFVFLQLRG 29 (232) Q Consensus 7 ~~~~i~Iii~A~IA~fl~~RLr~ 29 (232) .++.+-.++.|.+++++-+||=. T Consensus 3 d~QifiaL~~Ali~~iLA~rLg~ 25 (29) T pfam07465 3 DSQIFIALIIALIPAILALRLGK 25 (29) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH T ss_conf 77989999999999999999848 No 50 >pfam05261 Tra_M TraM protein. The TraM protein is an essential part of the DNA transfer machinery of the conjugative resistance plasmid R1 (IncFII). On the basis of mutational analyses, it was shown that the essential transfer protein TraM has at least two functions. First, a functional TraM protein was found to be required for normal levels of transfer gene expression. Second, experimental evidence was obtained that TraM stimulates efficient site-specific single-stranded DNA cleavage at the oriT, in vivo. Furthermore, a specific interaction of the cytoplasmic TraM protein with the membrane protein TraD was demonstrated, suggesting that the TraM protein creates a physical link between the relaxosomal nucleoprotein complex and the membrane-bound DNA transfer apparatus. Probab=24.71 E-value=49 Score=13.79 Aligned_cols=13 Identities=15% Similarity=0.312 Sum_probs=9.3 Q ss_pred HCCCCCCHHHHHH Q ss_conf 1267679899999 Q gi|254780815|r 93 SVYTDFDPKDFLN 105 (232) Q Consensus 93 ~~~~~Fd~~~Fl~ 105 (232) ..+.+|+..+|-+ T Consensus 56 ~KesgFnq~efnK 68 (127) T pfam05261 56 RKESGFNQMEFNK 68 (127) T ss_pred HHCCCCCHHHHHH T ss_conf 7506756899999 No 51 >COG0753 KatE Catalase [Inorganic ion transport and metabolism] Probab=24.47 E-value=49 Score=13.76 Aligned_cols=21 Identities=24% Similarity=0.453 Sum_probs=7.4 Q ss_pred EEEEEEEECCCCCCCCEEEEE Q ss_conf 999999961898998668957 Q gi|254780815|r 207 IDIWTFVRNIPPSNPNWVLIS 227 (232) Q Consensus 207 ~e~WtF~R~~~~~~p~W~L~~ 227 (232) +|+=-|.=..=..+.||-|+| T Consensus 116 RD~RGFA~KfYTeeGNwDlVG 136 (496) T COG0753 116 RDVRGFALKFYTEEGNWDLVG 136 (496) T ss_pred CCCCCCEEEEEECCCCCEEEC T ss_conf 254540688851366510650 No 52 >TIGR02096 TIGR02096 conserved hypothetical protein; InterPro: IPR011721 This entry represents proteins, which are about 135 amino acids in length and largely restricted to the Proteobacteria. This family and a delta5-3-ketosteroid isomerase from Pseudomonas testosteroni appear homologous, especially toward their respective N-termini. Members, therefore, probably are enzymes.. Probab=24.42 E-value=44 Score=14.07 Aligned_cols=25 Identities=20% Similarity=0.391 Sum_probs=20.4 Q ss_pred HHHHHH-HHCCCHHHHHHHHCHHHHH Q ss_conf 999999-8427678999731899999 Q gi|254780815|r 112 EAIVDS-FFEGKVHAIEKLVDSRVYQ 136 (232) Q Consensus 112 ~~I~~A-~~~gD~~~L~~lls~~~~~ 136 (232) ..-++| |++||.+.+-.|++++++= T Consensus 5 ~~~~~as~n~g~~~~~~aL~aeD~~~ 30 (145) T TIGR02096 5 QQWIEASFNSGDMDAVLALLAEDVLY 30 (145) T ss_pred HHHHHHHCCCCCHHHHHHHHHHHHHC T ss_conf 88998514899889999985001100 No 53 >PRK05996 motB flagellar motor protein MotB; Validated Probab=22.95 E-value=53 Score=13.58 Aligned_cols=16 Identities=25% Similarity=0.538 Sum_probs=12.3 Q ss_pred HHHHHHHHHHHHHHHH Q ss_conf 9999999999999864 Q gi|254780815|r 15 LFAFITFFVFLQLRGV 30 (232) Q Consensus 15 i~A~IA~fl~~RLr~v 30 (232) +-||+||||++.|-+. T Consensus 38 mTAMMAFFLvMWLlna 53 (431) T PRK05996 38 MTAMMAFFLVMWLINA 53 (431) T ss_pred HHHHHHHHHHHHHHCC T ss_conf 9999999999998059 No 54 >TIGR02902 spore_lonB ATP-dependent protease LonB; InterPro: IPR014251 This entry represents LonB, a paralog of the ATP-dependent protease La (LonA, IPR004815 from INTERPRO). LonB proteins are unassigned peptidases belonging to the MEROPS peptidase family S16 (lon protease family, clan SJ) and are found strictly, and almost universally, in endospore-forming bacteria. This protease was shown, in Bacillus subtilis, to be expressed specifically in the forespore during sporulation, under control of sigmaF . The lonB gene, despite being located immediately upstream of lonA, was shown to be monocistronic. LonB appears to be involved in the post-translation control of sigmaH, but lonB mutation did not produce an obvious sporulation defect under the conditions tested . Note that additional paralogs of LonA and LonB occur in the Clostridium lineage and these are excluded from this entry. . Probab=22.42 E-value=54 Score=13.51 Aligned_cols=21 Identities=14% Similarity=0.324 Sum_probs=17.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHH Q ss_conf 269999999999999999986 Q gi|254780815|r 9 GDFLILLFAFITFFVFLQLRG 29 (232) Q Consensus 9 ~~i~Iii~A~IA~fl~~RLr~ 29 (232) ..+++|+..+|..|+|+.||+ T Consensus 4 ~~~q~if~iiIG~YFfn~LK~ 24 (532) T TIGR02902 4 AIVQLIFAIIIGLYFFNLLKN 24 (532) T ss_pred EEHHHHHHHHHHHHHHHHHHC T ss_conf 642589999999999975440 No 55 >pfam11444 DUF2895 Protein of unknown function (DUF2895). This is a bacterial family of uncharacterized proteins. Probab=22.24 E-value=55 Score=13.49 Aligned_cols=37 Identities=8% Similarity=0.069 Sum_probs=28.1 Q ss_pred CCHHHHHHHHCHHHHHHHHHHHHHHHHCCC-CCEEEEE Q ss_conf 767899973189999999987776443498-2023662 Q gi|254780815|r 121 GKVHAIEKLVDSRVYQDFNDSLSTRKSNEK-NVKSSLV 157 (232) Q Consensus 121 gD~~~L~~lls~~~~~~f~~~i~~r~~~~~-~~~~~~v 157 (232) .++..|+.|+||.....+.+....|...|+ +.....+ T Consensus 87 ~ni~~l~~YlTP~c~~~L~~d~~~r~~~geLr~R~R~v 124 (199) T pfam11444 87 ANIERLSPYLTPSCQAFLQKDYELRLLNGELRDRVRGV 124 (199) T ss_pred HHHHHHCCCCCHHHHHHHHHHHHHHHHCCHHHHHHEEE T ss_conf 99986114459789999999999998621477541002 No 56 >PRK13428 F0F1 ATP synthase subunit delta; Provisional Probab=22.01 E-value=55 Score=13.46 Aligned_cols=21 Identities=24% Similarity=0.311 Sum_probs=13.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHHH Q ss_conf 6702699999999999999999 Q gi|254780815|r 6 MDSGDFLILLFAFITFFVFLQL 27 (232) Q Consensus 6 m~~~~i~Iii~A~IA~fl~~RL 27 (232) |+--.-+||-||+| +|+||++ T Consensus 1 m~~fIgqLI~Faii-~f~~~Kf 21 (445) T PRK13428 1 MSTFIGQLIGFAVI-VFLVVRF 21 (445) T ss_pred CCHHHHHHHHHHHH-HHHHHHH T ss_conf 91399999999999-9999999 No 57 >pfam08769 Spo0A_C Sporulation initiation factor Spo0A C terminal. The response regulator Spo0A is comprised of a phophoacceptor domain and a transcription activation domain. This domain corresponds to the transcription activation domain and forms an alpha helical structure comprising of 6 alpha helices. The structure contains a helix-turn-helix and binds DNA. Probab=21.83 E-value=56 Score=13.43 Aligned_cols=23 Identities=26% Similarity=0.442 Sum_probs=18.7 Q ss_pred HHHHHHHHCCCHHHHHHHHCHHH Q ss_conf 99999984276789997318999 Q gi|254780815|r 112 EAIVDSFFEGKVHAIEKLVDSRV 134 (232) Q Consensus 112 ~~I~~A~~~gD~~~L~~lls~~~ 134 (232) -.|.-||.+||.+.|..+..-.+ T Consensus 63 ~aIe~aW~~g~~e~l~~~fg~~~ 85 (106) T pfam08769 63 HAIEVAWTRGNVDTLDRLFGYTV 85 (106) T ss_pred HHHHHHHHCCCHHHHHHHHCCCC T ss_conf 99999998599999999959736 No 58 >PRK12799 motB flagellar motor protein MotB; Reviewed Probab=21.81 E-value=56 Score=13.43 Aligned_cols=16 Identities=31% Similarity=0.613 Sum_probs=11.6 Q ss_pred HHHHHHHHHHHHHHHH Q ss_conf 9999999999999864 Q gi|254780815|r 15 LFAFITFFVFLQLRGV 30 (232) Q Consensus 15 i~A~IA~fl~~RLr~v 30 (232) +.||+||||++.|-++ T Consensus 35 vTAMMAFFLVLWLL~s 50 (424) T PRK12799 35 MTAMMAFFLVMWLLAV 50 (424) T ss_pred HHHHHHHHHHHHHHCC T ss_conf 9999999999998548 No 59 >pfam11714 Inhibitor_I53 Thrombin inhibitor Madanin. Members of this family are the peptidase inhibitor madanin proteins. These proteins were isolated from tick saliva. Probab=21.48 E-value=36 Score=14.67 Aligned_cols=16 Identities=31% Similarity=0.538 Sum_probs=12.3 Q ss_pred HHHHHHHHHHHHHHHH Q ss_conf 9999999999999999 Q gi|254780815|r 11 FLILLFAFITFFVFLQ 26 (232) Q Consensus 11 i~Iii~A~IA~fl~~R 26 (232) |-|||||++|-.++.- T Consensus 4 faililavvasavvma 19 (78) T pfam11714 4 FAILILAVVASAVVMA 19 (78) T ss_pred EEHHHHHHHHHHHHHC T ss_conf 3214899988788620 No 60 >pfam09214 Prd1-P2 Bacteriophage Prd1, adsorption protein P2. Members of this family form a set of bacteriophage adsorption proteins, composed mainly of beta-strands whose complicated topology forms an elongated seahorse-shaped molecule with a distinct head, containing a pseudo-beta propeller structure with approximate 6-fold symmetry, and tail. They are required for the attachment of the phage to the host conjugative DNA transfer complex. This is a poorly understood large transmembrane complex of unknown architecture, with at least 11 different proteins. Probab=21.12 E-value=46 Score=13.97 Aligned_cols=16 Identities=31% Similarity=0.808 Sum_probs=13.2 Q ss_pred EEEEEECCCCCCCCEE Q ss_conf 9999961898998668 Q gi|254780815|r 209 IWTFVRNIPPSNPNWV 224 (232) Q Consensus 209 ~WtF~R~~~~~~p~W~ 224 (232) .+.|.|++..+||||- T Consensus 189 afifdkpvtepnpnwp 204 (560) T pfam09214 189 AFIFDKPVTEPNPNWP 204 (560) T ss_pred EEEECCCCCCCCCCCC T ss_conf 8980577788998989 No 61 >TIGR02875 spore_0_A sporulation transcription factor Spo0A; InterPro: IPR012052 Members of this group are response regulators/transcription factors that contain CheY-like receiver (phosphoacceptor) domain and a unique DNA-binding domain. Spo0A controls the entry of Bacillus subtilis into the developmental process of sporulation . Activation of the Spo0A transcription factor by phosphorylation serves as a developmental checkpoint and to integrate several physiological signals that control entry into the sporulation pathway. The signals are generated by conditions of nutrient deprivation, high cell density, the Krebs cycle, DNA replication, DNA damage, and some aspect of the chromosome partitioning machinery . Activated Spo0A has multiple functions . It represses promoters (such as the abrB promoter) by binding to sites (0A boxes) downstream from the transcription start site. Spo0A also activates transcription from promoters used by two forms of RNA polymerase that differ by containing either the major sigma factor, sigma A (e.g. the spoIIE and spoIIG promoters) or the alternate sigma factor, sigma H (e.g. the spoIIA promoter). At promoters activated by Spo0A, the 0A boxes lie upstream of the transcription-initiation site.; GO: 0003677 DNA binding, 0005509 calcium ion binding, 0000160 two-component signal transduction system (phosphorelay), 0042173 regulation of sporulation, 0045449 regulation of transcription, 0050906 detection of stimulus during sensory perception. Probab=21.09 E-value=26 Score=15.49 Aligned_cols=20 Identities=25% Similarity=0.358 Sum_probs=13.6 Q ss_pred HHHHHHCCCHHHHHHHHCHH Q ss_conf 99998427678999731899 Q gi|254780815|r 114 IVDSFFEGKVHAIEKLVDSR 133 (232) Q Consensus 114 I~~A~~~gD~~~L~~lls~~ 133 (232) |--||++|++|.|++|-.-. T Consensus 223 IEvAWsRG~~E~i~klFGYT 242 (270) T TIGR02875 223 IEVAWSRGNIELINKLFGYT 242 (270) T ss_pred HHHCCCCCCCCCCHHHHCCC T ss_conf 34202588621015551566 No 62 >PRK01770 sec-independent translocase; Provisional Probab=20.99 E-value=54 Score=13.53 Aligned_cols=17 Identities=29% Similarity=0.272 Sum_probs=11.6 Q ss_pred HHHHHHHHHHHHHHHHH Q ss_conf 02699999999999999 Q gi|254780815|r 8 SGDFLILLFAFITFFVF 24 (232) Q Consensus 8 ~~~i~Iii~A~IA~fl~ 24 (232) .+..+|+|+++||.+++ T Consensus 4 iG~~ElllI~vvaLvVl 20 (171) T PRK01770 4 IGFSELLLVFVIGLVVL 20 (171) T ss_pred CCHHHHHHHHHHHHHEE T ss_conf 56899999999998405 No 63 >TIGR01165 cbiN cobalt transport protein; InterPro: IPR003705 The cobalt transport protein CbiN is part of the active cobalt transport system involved in uptake of cobalt in to the cell involved with cobalamin biosynthesis (vitamin B12). It has been suggested that CbiN may function as the periplasmic binding protein component of the active cobalt transport system .; GO: 0015087 cobalt ion transmembrane transporter activity, 0006824 cobalt ion transport, 0009236 cobalamin biosynthetic process, 0016020 membrane. Probab=20.43 E-value=60 Score=13.25 Aligned_cols=22 Identities=36% Similarity=0.360 Sum_probs=12.6 Q ss_pred HHHHHHHHHHHHHHHHHHCCCC Q ss_conf 9999999999999998641544 Q gi|254780815|r 13 ILLFAFITFFVFLQLRGVLGKK 34 (232) Q Consensus 13 Iii~A~IA~fl~~RLr~vLG~r 34 (232) -+||+++|+.++.=|--..|+. T Consensus 9 a~LL~~va~L~~lPL~i~~G~~ 30 (94) T TIGR01165 9 ALLLAAVAALVVLPLLIYAGKG 30 (94) T ss_pred HHHHHHHHHHHHHHHHHHCCCC T ss_conf 9999999999999998711675 No 64 >pfam10694 DUF2500 Protein of unknown function (DUF2500). The members of this family are largely confined to the Gammaproteobacteria. The function is not known. Probab=20.13 E-value=60 Score=13.21 Aligned_cols=18 Identities=22% Similarity=0.473 Sum_probs=8.0 Q ss_pred HHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999998 Q gi|254780815|r 11 FLILLFAFITFFVFLQLR 28 (232) Q Consensus 11 i~Iii~A~IA~fl~~RLr 28 (232) +.+++++++|.|.|++.+ T Consensus 6 iv~~iii~~~~~~~~~~~ 23 (107) T pfam10694 6 LVIIAIIGLAIFVFVRQI 23 (107) T ss_pred HHHHHHHHHHHHHHHHHH T ss_conf 389999999999999998 Done!