Query gi|254781178|ref|YP_003065591.1| cell division protein [Candidatus Liberibacter asiaticus str. psy62] Match_columns 304 No_of_seqs 193 out of 1682 Neff 6.5 Searched_HMMs 39220 Date Mon May 30 06:16:13 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254781178.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 COG1589 FtsQ Cell division sep 100.0 6E-41 1.4E-45 280.7 25.1 246 46-292 17-267 (269) 2 PRK10775 cell division protein 100.0 3.9E-37 9.9E-42 256.4 21.4 220 66-290 36-265 (274) 3 PRK05529 cell division protein 100.0 4E-27 1E-31 192.7 26.9 231 33-279 17-254 (255) 4 pfam03799 FtsQ Cell division p 99.6 2.4E-15 6.2E-20 117.6 11.0 112 159-272 1-115 (115) 5 pfam08478 POTRA_1 POTRA domain 99.6 6.1E-15 1.6E-19 115.1 9.8 67 90-156 1-67 (67) 6 COG4775 Outer membrane protein 98.2 5.4E-05 1.4E-09 51.7 11.6 84 82-166 18-101 (766) 7 PRK11067 outer membrane protei 98.0 0.00014 3.6E-09 49.0 10.4 81 85-166 21-101 (801) 8 TIGR03303 OM_YaeT outer membra 97.6 0.00053 1.3E-08 45.4 9.0 78 88-166 1-78 (741) 9 pfam06898 YqfD Putative stage 96.9 0.037 9.3E-07 33.7 12.6 147 30-195 54-228 (383) 10 PRK11067 outer membrane protei 95.9 0.02 5.2E-07 35.3 5.5 89 86-177 345-439 (801) 11 TIGR03303 OM_YaeT outer membra 95.4 0.098 2.5E-06 30.9 7.3 74 89-162 70-153 (741) 12 pfam08479 POTRA_2 POTRA domain 94.1 0.18 4.7E-06 29.2 6.0 68 88-155 1-75 (76) 13 pfam07244 Surf_Ag_VNR Surface 93.9 0.26 6.6E-06 28.3 6.5 69 88-156 1-78 (78) 14 COG4775 Outer membrane protein 92.8 0.4 1E-05 27.1 6.0 67 89-155 93-166 (766) 15 COG2831 FhaC Hemolysin activat 89.1 0.93 2.4E-05 24.7 5.0 73 86-158 71-150 (554) 16 TIGR02876 spore_yqfD sporulati 86.4 2.6 6.7E-05 21.8 11.5 117 42-177 75-202 (406) 17 COG0729 Outer membrane protein 82.9 3.7 9.4E-05 20.9 5.4 78 84-161 196-280 (594) 18 PRK12585 putative monovalent c 79.6 1 2.6E-05 24.4 1.6 100 11-117 7-126 (197) 19 TIGR01734 D-ala-DACP-lig D-ala 65.4 11 0.00027 18.0 5.8 101 153-259 319-433 (513) 20 TIGR00998 8a0101 efflux pump m 60.1 11 0.00028 17.9 3.3 77 50-129 2-82 (379) 21 KOG2602 consensus 59.7 13 0.00034 17.4 4.5 74 89-162 37-120 (457) 22 PRK12586 putative monovalent c 57.5 13 0.00034 17.3 3.4 57 14-76 13-70 (145) 23 COG2362 DppA D-aminopeptidase 54.2 16 0.00042 16.8 5.7 60 84-145 117-178 (274) 24 KOG2333 consensus 51.0 13 0.00034 17.4 2.5 40 116-160 367-408 (614) 25 pfam04951 Peptidase_M55 D-amin 45.3 23 0.00057 15.9 6.3 59 86-145 119-179 (265) 26 cd04405 RhoGAP_BRCC3-like RhoG 43.4 24 0.00061 15.7 3.0 23 134-156 70-93 (235) 27 PHA02047 phage lambda Rz1-like 34.5 31 0.00079 15.0 2.3 87 48-136 1-100 (109) 28 PRK11644 sensory histidine kin 34.1 34 0.00086 14.8 6.4 23 24-46 187-216 (497) 29 PRK05182 DNA-directed RNA poly 33.5 9.6 0.00025 18.3 -0.5 33 66-100 29-61 (306) 30 PRK12671 putative monovalent c 33.4 35 0.00088 14.7 3.6 58 14-76 16-73 (120) 31 TIGR02159 PA_CoA_Oxy4 phenylac 33.3 35 0.00088 14.7 2.7 23 123-145 46-68 (152) 32 PRK13361 molybdenum cofactor b 32.7 35 0.0009 14.7 5.4 12 28-39 44-55 (329) 33 pfam10081 DUF2319 Predicted me 32.0 36 0.00093 14.6 6.2 53 14-66 113-166 (534) 34 TIGR02283 MltB_2 lytic murein 29.9 12 0.00031 17.6 -0.4 165 31-211 75-295 (313) 35 pfam01883 DUF59 Domain of unkn 29.3 40 0.001 14.3 3.0 26 117-142 50-75 (76) 36 KOG3090 consensus 27.9 34 0.00086 14.8 1.6 23 48-70 13-35 (290) 37 KOG3922 consensus 26.6 45 0.0011 14.0 3.6 21 105-125 96-116 (361) 38 TIGR02027 rpoA DNA-directed RN 25.2 21 0.00052 16.2 0.0 125 66-199 12-153 (324) 39 TIGR02350 prok_dnaK chaperone 24.7 37 0.00093 14.6 1.3 131 156-294 84-267 (598) 40 pfam06350 HSL_N Hormone-sensit 23.5 51 0.0013 13.6 3.4 75 32-131 175-254 (315) 41 pfam10309 DUF2414 Protein of u 23.2 52 0.0013 13.6 2.2 49 89-148 4-52 (62) 42 TIGR02941 Sigma_B RNA polymera 23.0 53 0.0013 13.6 2.1 51 72-122 116-166 (256) 43 KOG0290 consensus 22.0 43 0.0011 14.1 1.2 41 244-284 240-286 (364) 44 COG5102 SFT2 Membrane protein 22.0 55 0.0014 13.4 4.2 49 23-72 71-119 (201) 45 PRK03598 hypothetical protein; 21.8 32 0.00081 14.9 0.5 71 50-131 3-84 (331) 46 KOG4634 consensus 21.1 57 0.0015 13.3 2.7 26 35-60 3-28 (105) 47 smart00662 RPOLD RNA polymeras 20.8 58 0.0015 13.3 3.4 33 81-113 21-54 (224) 48 PRK00164 moaA molybdenum cofac 20.5 59 0.0015 13.2 5.0 16 26-41 48-63 (334) 49 cd03529 Rieske_NirD Assimilato 20.5 59 0.0015 13.2 3.7 40 135-174 1-41 (103) 50 KOG1665 consensus 20.3 51 0.0013 13.7 1.2 32 164-195 50-82 (302) 51 PRK07394 hypothetical protein; 20.1 60 0.0015 13.2 1.9 53 76-135 101-159 (342) No 1 >COG1589 FtsQ Cell division septal protein [Cell envelope biogenesis, outer membrane] Probab=100.00 E-value=6e-41 Score=280.70 Aligned_cols=246 Identities=30% Similarity=0.504 Sum_probs=219.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCC-EEEEEEEEECCCCCCHHHHHHHHCCCCCCCEEEECHH Q ss_conf 871357799999999999999887315608999999742108-1254899977654898899998088889843887599 Q gi|254781178|r 46 KVLPSYCGVILAIFFFAIVGIYGASIGGHTRKVIDIVDSFIG-FSIEKVRIIGNVETPEADIIHCLDLNTSTSLIFFDAI 124 (304) Q Consensus 46 ~~~p~~~g~~~~~~~~~~~~~~g~~~gg~~~~~~~~~~~~~g-F~I~~I~V~Gn~~ls~~eIl~~l~i~~~~sl~~ld~~ 124 (304) ...|.+.+.....++....+.|++..+++...+........| |+|++|.|+||.++++++|++.+++..++++|.+|.+ T Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~v~v~Gn~~~s~~~I~~~~~l~~~~~~~~ld~~ 96 (269) T COG1589 17 AKIPTLNKTRKRGVIRLLIGLYLVLLLLVLVVLWVLILLSLPYFPIRKVSVSGNNQVSEEDILKALGLDGGTSFLTLDLN 96 (269) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCCCHHHHHHHHCCCCCCCEEEECHH T ss_conf 12433047888889999999999999999998874334332677520899967843699999998575689854887699 Q ss_pred HHHHHHHCCCCEEEEEEEEECCCEEEEEEEEEEEEEEEEECCEEEEEECCCC-CCCCCCCCCCCCCCCCCC-CCHHHHHH Q ss_conf 9999987197622799998289819999993037999995792799705641-032100011245431137-76678899 Q gi|254781178|r 125 KIQKQLLALPWIAHAEIRRLYPDTMEIRLTERHPYAIWQNNSALYLIDNNGY-VITAFNHVRFAYLPILIG-ENIYKAVR 202 (304) Q Consensus 125 ~i~~~L~~~pwIk~a~V~r~~P~tL~I~I~Er~PvA~~~~~g~~~~Id~~G~-Vi~~~~~~~~~~lPli~G-~~~~~~~~ 202 (304) +++++|+++|||++|+|+|+||||+.|.|+||+|+|+|+.+++++++|++|. ...+........+|.+.| ++.+..+. T Consensus 97 ~~~~~i~~~PwVk~a~V~r~~Pntv~I~v~Er~piA~w~~~~~~~~i~~~G~~~~~~~~~~~~~~lp~~~g~~~~~~~v~ 176 (269) T COG1589 97 AIRENIEKLPWVKSAEVRRQFPNTLEIEVVEREPIAYWQRNDDYHLIDENGNLFIVPSDRPKLPVLPLLVGPEGADPKVK 176 (269) T ss_pred HHHHHHHHCCCEEEEEEEEECCCCEEEEEEEEEEEEEEEECCCCCEECCCCCEEECCCCCCCCCCCEEECCCCCCCHHHH T ss_conf 99999862976579999983899479999985037999707986433278858762566666652133438630037799 Q ss_pred HH-HHHHHHHHHCCCEEEEEEECCCE-EEEEECCCEEEEECCCCHHHHHHHHHHHHHHHCHHCCCEEEEEEECCCEEEEE Q ss_conf 99-87664013103303899825880-58997498289808511799999999987630100088338961129836999 Q gi|254781178|r 203 SF-EVLSNIAGITKFVKAYNWIAERR-WDLHLHNGIIIKLPEEKFDVAIAKILELQNKYQILDRDISVIDMRLPDRLSVR 280 (304) Q Consensus 203 ~~-~~L~~~~~l~~~i~~~~~v~~rr-WdL~l~nGi~IkLp~~~~~~~L~rL~~l~~~~k~~~r~I~~IDLR~pdri~Vr 280 (304) .+ .....++.....+......++++ |+|.++||+.+++|..+...+++++.++++.....++ |.+||||+|++++|+ T Consensus 177 ~~~~~~~~l~~~~~~~~~~~~~~~r~~~~l~l~~g~~v~l~~~~~~~~l~~~~~l~~~l~~~~~-i~~iDlr~~~~~~V~ 255 (269) T COG1589 177 ALLEQLSALLALLGEVIKAIRKTDRRSWDLTLNDGLVVKLGRSDLAKRLARFVKLYKQLSQPSR-ISYIDLRYPDGFTVR 255 (269) T ss_pred HHHHHHHHHHHHHCCEEEEEECCCCCEEEEECCCCCEEEECHHHHHHHHHHHHHHHHHHCCCCC-CEEEEEECCCCCEEE T ss_conf 8887789888752101464103774207999189838996466798888889999986354345-259998549864686 Q ss_pred ECCCCHHHHHHH Q ss_conf 768421558999 Q gi|254781178|r 281 LTTGSFIDRRDI 292 (304) Q Consensus 281 l~~~a~~~r~~~ 292 (304) +++.+.+.+++. T Consensus 256 ~~~~~~~~~~~~ 267 (269) T COG1589 256 LAPEPEEEKEEE 267 (269) T ss_pred ECCCCCHHHHCC T ss_conf 057652112112 No 2 >PRK10775 cell division protein FtsQ; Provisional Probab=100.00 E-value=3.9e-37 Score=256.43 Aligned_cols=220 Identities=23% Similarity=0.363 Sum_probs=180.1 Q ss_pred HHHHHCCCCHHHHHHHHHHHCCEEEEEEEEECC-CCCCHHHHHH-HHCCCCCCCEEEECHHHHHHHHHCCCCEEEEEEEE Q ss_conf 988731560899999974210812548999776-5489889999-80888898438875999999987197622799998 Q gi|254781178|r 66 IYGASIGGHTRKVIDIVDSFIGFSIEKVRIIGN-VETPEADIIH-CLDLNTSTSLIFFDAIKIQKQLLALPWIAHAEIRR 143 (304) Q Consensus 66 ~~g~~~gg~~~~~~~~~~~~~gF~I~~I~V~Gn-~~ls~~eIl~-~l~i~~~~sl~~ld~~~i~~~L~~~pwIk~a~V~r 143 (304) ..+++-||.. +.+=+..+--|.|++|.|.|. .+++.+||.+ .+......|+|++|++++++.++++|||++|+|+| T Consensus 36 i~~~~~~~~~--~~~wm~d~~~lPis~i~v~G~~~~~~~~dir~ai~~~~~~G~Fft~Dl~~iq~~le~lPWV~~asVRR 113 (274) T PRK10775 36 LGTVLVSGWV--VLGWMEDAQRLPLSKLVLTGERHYTRNDDIRQAILALGAPGTFMTQDVNIIQQQIERLPWIKQVSVRK 113 (274) T ss_pred HHHHHHHHHH--HHHHHCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHCCCCCCEEEECHHHHHHHHHHCCCEEEEEEEE T ss_conf 9999998999--99886075438853899903004179999999998447899757506999999998499933788998 Q ss_pred ECCCEEEEEEEEEEEEEEEEECCEEEEEECCCCCCCC-CCCCCCCCCCCCCCCCHHH--HHHHHHHHHHHHH-HCCCEEE Q ss_conf 2898199999930379999957927997056410321-0001124543113776678--8999987664013-1033038 Q gi|254781178|r 144 LYPDTMEIRLTERHPYAIWQNNSALYLIDNNGYVITA-FNHVRFAYLPILIGENIYK--AVRSFEVLSNIAG-ITKFVKA 219 (304) Q Consensus 144 ~~P~tL~I~I~Er~PvA~~~~~g~~~~Id~~G~Vi~~-~~~~~~~~lPli~G~~~~~--~~~~~~~L~~~~~-l~~~i~~ 219 (304) .||++|.|.|+|++|+|+| |+..++|.+|.+|.. .......+||.+.|++... ....|..+..... ..-++.+ T Consensus 114 ~WPd~L~V~v~E~~PvArW---n~~~lln~~G~vF~~p~~~~~~~~LP~L~GP~gs~~~Vl~~Y~~l~~~L~~~gl~I~~ 190 (274) T PRK10775 114 QWPDELKIHLVEYVPIARW---NDQHMVDAEGNTFSVPSDRTGKQVLPMLYGPEGSENEVLQGYREMGQVLAKDKFTLKE 190 (274) T ss_pred ECCCCEEEEEEEECCEEEE---CCCCCCCCCCEEEECCCCCCCCCCCCEEECCCCCHHHHHHHHHHHHHHHHHCCCEEEE T ss_conf 4899609999985348998---9987276898098667000455688557899987999999999999999877987689 Q ss_pred EEEECCCEEEEEECCCEEEEECCCCHHHHHHHHHHHHHHH----CHHCCCEEEEEEECCCEEEEEECCCCHHHHH Q ss_conf 9982588058997498289808511799999999987630----1000883389611298369997684215589 Q gi|254781178|r 220 YNWIAERRWDLHLHNGIIIKLPEEKFDVAIAKILELQNKY----QILDRDISVIDMRLPDRLSVRLTTGSFIDRR 290 (304) Q Consensus 220 ~~~v~~rrWdL~l~nGi~IkLp~~~~~~~L~rL~~l~~~~----k~~~r~I~~IDLR~pdri~Vrl~~~a~~~r~ 290 (304) +.....+.|.+.++||+.+.||+++..++|+||..++... +...+.|+|||||||++++|+|.|.++++.. T Consensus 191 L~l~~R~aW~l~L~nGi~L~LGR~~~~~RL~RFi~vyP~L~~~~~~~~k~I~~VDLRY~nG~AV~W~p~~~~~~~ 265 (274) T PRK10775 191 AAMTARRSWQLTLNNDIKLNLGRGDTMKRLARFVELYPVLQQQAQTDGKRISYVDLRYDSGAAVGWAPLPPEESN 265 (274) T ss_pred EEECCCCEEEEEECCCEEEEECCCCHHHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCCCEEEECCCCCCCCC T ss_conf 997578618999669819997870589999999999999997620344734699766899708971158987654 No 3 >PRK05529 cell division protein FtsQ; Provisional Probab=99.97 E-value=4e-27 Score=192.65 Aligned_cols=231 Identities=18% Similarity=0.218 Sum_probs=182.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHCC Q ss_conf 99999999999988713577999999999999998873156089999997421081254899977654898899998088 Q gi|254781178|r 33 EMRNFLNFCVFLEKVLPSYCGVILAIFFFAIVGIYGASIGGHTRKVIDIVDSFIGFSIEKVRIIGNVETPEADIIHCLDL 112 (304) Q Consensus 33 ~~~~~~~~~~~~~~~~p~~~g~~~~~~~~~~~~~~g~~~gg~~~~~~~~~~~~~gF~I~~I~V~Gn~~ls~~eIl~~l~i 112 (304) |-|++-|-+-|..+.--|+++-+++++-+.+..++.. .+...+=|++++|+|+||.++++++|.++++. T Consensus 17 ~~~~~~~~~~~~~~~r~r~~~~~~~~g~v~~l~~~v~-----------~~~fSpll~v~~I~V~G~~~~~~~~V~~a~~~ 85 (255) T PRK05529 17 ERKDYERVRRFTTRIRRRFILLACAVGAVLTLLLFVM-----------LSAYSPLLALRSIEVAGNMRVKPQDIVAALRD 85 (255) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----------HHHHCCCEEEEEEEEECCCCCCHHHHHHHHCC T ss_conf 8888999999999888888989999999999999999-----------85417546988999989834799999998658 Q ss_pred CCCCCEEEECHHHHHHHHHCCCCEEEEEEEEECCCEEEEEEEEEEEEEEEEECCEEEEEECCCCCCCCCCCCCCCCCCCC Q ss_conf 88984388759999999871976227999982898199999930379999957927997056410321000112454311 Q gi|254781178|r 113 NTSTSLIFFDAIKIQKQLLALPWIAHAEIRRLYPDTMEIRLTERHPYAIWQNNSALYLIDNNGYVITAFNHVRFAYLPIL 192 (304) Q Consensus 113 ~~~~sl~~ld~~~i~~~L~~~pwIk~a~V~r~~P~tL~I~I~Er~PvA~~~~~g~~~~Id~~G~Vi~~~~~~~~~~lPli 192 (304) ..|++|+.+|.++++++|+++|||++|+|+|.||++|.|+|+||+|+|.++.++++.++|.+|.+|.... .....+|.+ T Consensus 86 ~~G~pL~~vd~~~v~~~l~~lp~V~sa~V~r~~P~~l~I~V~ER~Pva~v~~~~g~~lVD~~GV~~~~~~-~~P~glP~i 164 (255) T PRK05529 86 QFGKPLPLVDPETVRKKLAAFPLIRSYSVESKPPGTIVVRVVERVPLAFIQRGDGFHVVDAAGVSIKVTQ-ARPLGMPLI 164 (255) T ss_pred CCCCEEEEECHHHHHHHHHCCCCEEEEEEEECCCCCEEEEEEEEEEEEEEEECCCEEEECCCCCEEEECC-CCCCCCCEE T ss_conf 7898513886999999874499855899996397926999995052799982895299989997985058-899897569 Q ss_pred CCCCH----HHHHHHHHHHHHHHH-HCCCEEEEEEECCCEEEEEECC-CEEEEECC-CCHHHHHHHHHHHHHHHCHHCCC Q ss_conf 37766----788999987664013-1033038998258805899749-82898085-11799999999987630100088 Q gi|254781178|r 193 IGENI----YKAVRSFEVLSNIAG-ITKFVKAYNWIAERRWDLHLHN-GIIIKLPE-EKFDVAIAKILELQNKYQILDRD 265 (304) Q Consensus 193 ~G~~~----~~~~~~~~~L~~~~~-l~~~i~~~~~v~~rrWdL~l~n-Gi~IkLp~-~~~~~~L~rL~~l~~~~k~~~r~ 265 (304) .+... +........+...|+ +..++.++.-.+.....+.|++ |.+|.+|. ++...|.+-|..+.+. ..+ T Consensus 165 ~v~~~~~~~~~~~aA~~Vl~aLP~~lr~qV~~V~A~s~d~V~l~L~~~GrtV~WGsae~~~~Ka~vL~aLL~~----~g~ 240 (255) T PRK05529 165 DVPKDGNQEQLFAAAAWVLAALPADLKKQVDLISAETPDDVNLTLRNSGKTVFWGSFENSKLKAKVLEALLKI----PGH 240 (255) T ss_pred EECCCCCCCHHHHHHHHHHHHCCHHHHHHEEEEEECCCCCEEEEEECCCEEEECCCCCHHHHHHHHHHHHHCC----CCC T ss_conf 8158988678899999999969999984205999547661899997798899828822677799999999708----788 Q ss_pred EEEEEEECCCEEEE Q ss_conf 33896112983699 Q gi|254781178|r 266 ISVIDMRLPDRLSV 279 (304) Q Consensus 266 I~~IDLR~pdri~V 279 (304) ..++|.+.|++|+| T Consensus 241 ~~~~DVSaP~~PvV 254 (255) T PRK05529 241 YSSYDVSAPEAPAV 254 (255) T ss_pred CCEEECCCCCCCCC T ss_conf 84887689999878 No 4 >pfam03799 FtsQ Cell division protein FtsQ. FtsQ is one of several cell division proteins. FtsQ interacts with other Fts proteins, reviewed in. The precise function of FtsQ is unknown. Probab=99.64 E-value=2.4e-15 Score=117.62 Aligned_cols=112 Identities=32% Similarity=0.528 Sum_probs=96.0 Q ss_pred EEEEEECCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCHHHH--HHHHHHHHHH-HHHCCCEEEEEEECCCEEEEEECCC Q ss_conf 999995792799705641032100011245431137766788--9999876640-1310330389982588058997498 Q gi|254781178|r 159 YAIWQNNSALYLIDNNGYVITAFNHVRFAYLPILIGENIYKA--VRSFEVLSNI-AGITKFVKAYNWIAERRWDLHLHNG 235 (304) Q Consensus 159 vA~~~~~g~~~~Id~~G~Vi~~~~~~~~~~lPli~G~~~~~~--~~~~~~L~~~-~~l~~~i~~~~~v~~rrWdL~l~nG 235 (304) +|+|+.++++++||.+|.+|.+.. ....++|++.|.+.... ...+..+... +++..++.++.+.+.++|+++++|| T Consensus 1 vA~w~~~~~~~lvd~~G~vf~~~~-~~~~~LP~l~G~~~~~~~~~~~~~~l~~~~~~~~~~I~~i~~~~~~~w~l~l~~G 79 (115) T pfam03799 1 VAYWQDGGGYYLIDEEGEVFEASV-ARPDDLPLLVGPGASEKAVLALLKELNALLPPLGLRVSELVLTPRRRWDLVLDDG 79 (115) T ss_pred CEEEECCCCCEEECCCCEEECCCC-CCCCCCCEEECCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCEEEEEECCC T ss_conf 969723998779999980984684-4313785898899768999999999999868603605799997898799998899 Q ss_pred EEEEECCCCHHHHHHHHHHHHHHHCHHCCCEEEEEEE Q ss_conf 2898085117999999999876301000883389611 Q gi|254781178|r 236 IIIKLPEEKFDVAIAKILELQNKYQILDRDISVIDMR 272 (304) Q Consensus 236 i~IkLp~~~~~~~L~rL~~l~~~~k~~~r~I~~IDLR 272 (304) +.|+||++++.+++++|.++.+.. ..++++++|||| T Consensus 80 ~~V~lg~~~~~~kl~r~~~l~~~~-~~~~~i~~iDlr 115 (115) T pfam03799 80 IEVRLGREDLAKRLARFPALYPQK-LLGRDIAYIDLR 115 (115) T ss_pred CEEEECCCCHHHHHHHHHHHHHHH-HHCCCCEEEECC T ss_conf 289957869999999999985734-631996188695 No 5 >pfam08478 POTRA_1 POTRA domain, FtsQ-type. FtsQ/DivIB bacterial division proteins (pfam03799) contain an N-terminal POTRA domain (for polypeptide-transport-associated domain). This is found in different types of proteins, usually associated with a transmembrane beta-barrel. FtsQ/DivIB may have chaperone-like roles, which has also been postulated for the POTRA domain in other contexts. Probab=99.60 E-value=6.1e-15 Score=115.05 Aligned_cols=67 Identities=37% Similarity=0.658 Sum_probs=65.8 Q ss_pred EEEEEEECCCCCCHHHHHHHHCCCCCCCEEEECHHHHHHHHHCCCCEEEEEEEEECCCEEEEEEEEE Q ss_conf 5489997765489889999808888984388759999999871976227999982898199999930 Q gi|254781178|r 90 IEKVRIIGNVETPEADIIHCLDLNTSTSLIFFDAIKIQKQLLALPWIAHAEIRRLYPDTMEIRLTER 156 (304) Q Consensus 90 I~~I~V~Gn~~ls~~eIl~~l~i~~~~sl~~ld~~~i~~~L~~~pwIk~a~V~r~~P~tL~I~I~Er 156 (304) |++|.|+||.++++++|++.+++..|+|+|.+|.++++++|+++|||++|+|+|.||++|.|+|+|| T Consensus 1 i~~I~V~G~~~~~~~~i~~~~~~~~~~~~~~i~~~~i~~~l~~~p~I~~a~V~r~~P~~l~I~I~Er 67 (67) T pfam08478 1 IRQVEVSGNKHVSAEEIRKALGIQKGTSFFSVDLNAIEDRLEKLPWIKSASVRRQWPNTLEIRVVER 67 (67) T ss_pred CCEEEEECCCCCCHHHHHHHHCCCCCCCEEEECHHHHHHHHHCCCCEEEEEEEEECCCEEEEEEEEC T ss_conf 9299999983479999999767658992799989999998747887269999997799899999969 No 6 >COG4775 Outer membrane protein/protective antigen OMA87 [Cell envelope biogenesis, outer membrane] Probab=98.18 E-value=5.4e-05 Score=51.69 Aligned_cols=84 Identities=14% Similarity=0.137 Sum_probs=78.3 Q ss_pred HHHHCCEEEEEEEEECCCCCCHHHHHHHHCCCCCCCEEEECHHHHHHHHHCCCCEEEEEEEEECCCEEEEEEEEEEEEEE Q ss_conf 74210812548999776548988999980888898438875999999987197622799998289819999993037999 Q gi|254781178|r 82 VDSFIGFSIEKVRIIGNVETPEADIIHCLDLNTSTSLIFFDAIKIQKQLLALPWIAHAEIRRLYPDTMEIRLTERHPYAI 161 (304) Q Consensus 82 ~~~~~gF~I~~I~V~Gn~~ls~~eIl~~l~i~~~~sl~~ld~~~i~~~L~~~pwIk~a~V~r~~P~tL~I~I~Er~PvA~ 161 (304) ..++.+|-|++|.|+|++++++..++..+.+..|..+=.-+.+++-++|-+....+++.+.+.- ++|.+.|+|+-.+.- T Consensus 18 ~~~a~~~vv~~I~v~G~~~~~~~~~~~~l~~~~G~~~s~~~i~~~ik~L~~tg~F~~v~~~~~~-~~Lvv~V~e~p~I~~ 96 (766) T COG4775 18 AVAAEAFVVKDIRVEGLQRVSSGTALSYLPVRVGDTVSDEDIDEAIKALYATGYFEDVRVEREG-GTLVVTVKERPVINS 96 (766) T ss_pred HHHCCCEEEEEEEEECCCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHCCCCCCEEEEECC-CEEEEEEECCCCEEE T ss_conf 1103362787899965542451236641573557513677889999999716897613788709-879999962772878 Q ss_pred EEECC Q ss_conf 99579 Q gi|254781178|r 162 WQNNS 166 (304) Q Consensus 162 ~~~~g 166 (304) +...| T Consensus 97 v~~~G 101 (766) T COG4775 97 VVFSG 101 (766) T ss_pred EEEEC T ss_conf 99868 No 7 >PRK11067 outer membrane protein assembly factor YaeT; Provisional Probab=97.96 E-value=0.00014 Score=49.01 Aligned_cols=81 Identities=15% Similarity=0.152 Sum_probs=72.5 Q ss_pred HCCEEEEEEEEECCCCCCHHHHHHHHCCCCCCCEEEECHHHHHHHHHCCCCEEEEEEEEECCCEEEEEEEEEEEEEEEEE Q ss_conf 10812548999776548988999980888898438875999999987197622799998289819999993037999995 Q gi|254781178|r 85 FIGFSIEKVRIIGNVETPEADIIHCLDLNTSTSLIFFDAIKIQKQLLALPWIAHAEIRRLYPDTMEIRLTERHPYAIWQN 164 (304) Q Consensus 85 ~~gF~I~~I~V~Gn~~ls~~eIl~~l~i~~~~sl~~ld~~~i~~~L~~~pwIk~a~V~r~~P~tL~I~I~Er~PvA~~~~ 164 (304) +-+|.|++|+|+||++++++.|+..+++..|+.+=.-+.++.-++|-+..+-++|+|.+. -++|.|.|+|+-.++.+.. T Consensus 21 a~~~~I~~I~v~G~~ri~~~ti~~~l~~~~G~~~~~~~i~~~ik~L~~tg~F~dV~v~~~-g~~L~~~V~E~P~I~~i~~ 99 (801) T PRK11067 21 AEGFVVKDIHFEGLQRVAVGAALLSMPVRVGDTVNDEDISNTIRALFATGNFEDVRVLRD-GDTLLVQVKERPTIASITF 99 (801) T ss_pred CCCEEEEEEEEECCCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEEE-CCEEEEEEEECCEEEEEEE T ss_conf 587299799998973769889996469899992798999999999985899524899971-9999999995773899999 Q ss_pred CC Q ss_conf 79 Q gi|254781178|r 165 NS 166 (304) Q Consensus 165 ~g 166 (304) .| T Consensus 100 ~G 101 (801) T PRK11067 100 SG 101 (801) T ss_pred EC T ss_conf 88 No 8 >TIGR03303 OM_YaeT outer membrane protein assembly complex, YaeT protein. Members of this protein family are the YaeT protein of the YaeT/YfiO complex for assembling proteins into the outer membrane of Gram-negative bacteria. This protein is similar in sequence and function to a non-essential paralog, YtfM, that is also in the Omp85 family. Members of this family typically have five tandem copies of the surface antigen variable number repeat (pfam07244), followed by an outer membrane beta-barrel domain (pfam01103), while the YtfM family typically has a single pfam07244 repeat. Probab=97.65 E-value=0.00053 Score=45.40 Aligned_cols=78 Identities=15% Similarity=0.207 Sum_probs=66.2 Q ss_pred EEEEEEEEECCCCCCHHHHHHHHCCCCCCCEEEECHHHHHHHHHCCCCEEEEEEEEECCCEEEEEEEEEEEEEEEEECC Q ss_conf 1254899977654898899998088889843887599999998719762279999828981999999303799999579 Q gi|254781178|r 88 FSIEKVRIIGNVETPEADIIHCLDLNTSTSLIFFDAIKIQKQLLALPWIAHAEIRRLYPDTMEIRLTERHPYAIWQNNS 166 (304) Q Consensus 88 F~I~~I~V~Gn~~ls~~eIl~~l~i~~~~sl~~ld~~~i~~~L~~~pwIk~a~V~r~~P~tL~I~I~Er~PvA~~~~~g 166 (304) |-|++|+|.||+.++++.|+..+++..|+.+=.-+.++.-++|-+..+.++++|... -+.|.|.|+|+..+.-+...| T Consensus 1 ~~I~~I~v~G~~~i~~~~i~~~~~~~~G~~~~~~~i~~~ik~L~~tg~F~dv~v~~~-~~~l~i~v~E~p~i~~i~~~G 78 (741) T TIGR03303 1 FVVKDIRVEGLQRVSAGTVLNYLPVKVGDTISDEKIDEAIKALYATGYFEDVKIERE-GGVLVIKVKERPIINSIVFSG 78 (741) T ss_pred CEEEEEEEECCEECCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEEEEE-CCEEEEEEEECCEEEEEEEEC T ss_conf 989899999971659899986369999984599999999999996799005899972-999999999676399999999 No 9 >pfam06898 YqfD Putative stage IV sporulation protein YqfD. This family consists of several putative bacterial stage IV sporulation (SpoIV) proteins. YqfD of Bacillus subtilis is known to be essential for efficient sporulation although its exact function is unknown. Probab=96.93 E-value=0.037 Score=33.67 Aligned_cols=147 Identities=18% Similarity=0.223 Sum_probs=96.0 Q ss_pred HHHHHHHHHHHH------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCEEEEEEEEEC Q ss_conf 599999999999------------99988713577999999999999998873156089999997421081254899977 Q gi|254781178|r 30 GLEEMRNFLNFC------------VFLEKVLPSYCGVILAIFFFAIVGIYGASIGGHTRKVIDIVDSFIGFSIEKVRIIG 97 (304) Q Consensus 30 ~~~~~~~~~~~~------------~~~~~~~p~~~g~~~~~~~~~~~~~~g~~~gg~~~~~~~~~~~~~gF~I~~I~V~G 97 (304) +....|+..|=. -|+=.++-+|.|.++-++++++... .++ --|=.|.|+| T Consensus 54 d~~~lr~iarkt~~rv~i~~r~GlpF~~~r~~kR~~~~~G~~~f~~~l~-~lS-----------------~fVW~I~i~G 115 (383) T pfam06898 54 DYKKLRQLVKKTKCKVKFIERKGLPFLFFRLRKRRGIVLGILLFVILLY-FLS-----------------NFIWSIDIKT 115 (383) T ss_pred HHHHHHHHHHHHCCEEEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHH-HHH-----------------HEEEEEEEEC T ss_conf 9899999999819689999504717899998887999999999999999-784-----------------0699999986 Q ss_pred CCCCCHHHHHH---HHCCCCCCCEEEECHHHHHHHHH-CCCCEEEEEEEEECCCEEEEEEEEEE-EEEEEEEC-CEEEEE Q ss_conf 65489889999---80888898438875999999987-19762279999828981999999303-79999957-927997 Q gi|254781178|r 98 NVETPEADIIH---CLDLNTSTSLIFFDAIKIQKQLL-ALPWIAHAEIRRLYPDTMEIRLTERH-PYAIWQNN-SALYLI 171 (304) Q Consensus 98 n~~ls~~eIl~---~l~i~~~~sl~~ld~~~i~~~L~-~~pwIk~a~V~r~~P~tL~I~I~Er~-PvA~~~~~-g~~~~I 171 (304) |...++++|++ ..|+..|..-+.+|.+.+++.|. ++|-|.=|+|+.. -.++.|+|.|+. |-.....+ -..-+- T Consensus 116 ~~~~t~~~i~~~L~~~gi~~G~~k~~id~~~ie~~l~~~~~~i~Wv~v~i~-Gt~l~v~v~Ek~~p~~~~~~~~p~~iVA 194 (383) T pfam06898 116 AKNIAEHEIRQQLNEIGVKSGVKKFSLDVYKLERELTDNNNNIMWIGVRVR-GTTLKVKVVEKQEPPEELENGEPRNIVA 194 (383) T ss_pred CCCCCHHHHHHHHHHCCCCCCEECCCCCHHHHHHHHHHHCCCEEEEEEEEE-EEEEEEEEEECCCCCCCCCCCCCCCEEE T ss_conf 998998999999998699778072778999999999864898299999998-3699999996578975444589866589 Q ss_pred ECCCC----------CCCCCCCCCCCCCCCCCCC Q ss_conf 05641----------0321000112454311377 Q gi|254781178|r 172 DNNGY----------VITAFNHVRFAYLPILIGE 195 (304) Q Consensus 172 d~~G~----------Vi~~~~~~~~~~lPli~G~ 195 (304) ..+|. ......+.-...-.++.|. T Consensus 195 ~k~GvI~~i~v~~G~p~Vk~GD~VkkGqiLVsG~ 228 (383) T pfam06898 195 KKDGIITRMYVTKGTAVVKVGDVVKKGDILVSGQ 228 (383) T ss_pred CCCCEEEEEEECCCEEEECCCCEECCCCEEEEEE T ss_conf 9997999999647717765899877898999635 No 10 >PRK11067 outer membrane protein assembly factor YaeT; Provisional Probab=95.91 E-value=0.02 Score=35.29 Aligned_cols=89 Identities=13% Similarity=0.109 Sum_probs=57.6 Q ss_pred CCEEEEEEEEECCCCCCHHHHHHHHCCCCCCCEEEECHHHHHHHHHCCCCEEEEEEEEEC----CCE--EEEEEEEEEEE Q ss_conf 081254899977654898899998088889843887599999998719762279999828----981--99999930379 Q gi|254781178|r 86 IGFSIEKVRIIGNVETPEADIIHCLDLNTSTSLIFFDAIKIQKQLLALPWIAHAEIRRLY----PDT--MEIRLTERHPY 159 (304) Q Consensus 86 ~gF~I~~I~V~Gn~~ls~~eIl~~l~i~~~~sl~~ld~~~i~~~L~~~pwIk~a~V~r~~----P~t--L~I~I~Er~Pv 159 (304) .-+.|++|.|+||..+.+.-|+..+.+..|..+=.-.+++-+++|..+...++|.+.-.- |+. |.++|+| +|. T Consensus 345 ~~~yV~~I~I~GN~~T~D~VIRRel~~~eGd~~~~~~i~~S~~rL~~lg~F~~V~~~~~~~~~~~~~vdl~~~V~E-~~t 423 (801) T PRK11067 345 NRFYVRKIRFEGNDTSKDSVLRREMRQMEGAWLGSDLVEQGKERLNRLGFFETVDTDTQRVPGSPDQVDVVYKVKE-RNT 423 (801) T ss_pred CEEEEEEEEEECCCCCHHHHHHHEECCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEEEEEE-CCC T ss_conf 6279998674277061347510100036765338889999999998558742034334358999873899999898-678 Q ss_pred EEEEECCEEEEEECCCCC Q ss_conf 999957927997056410 Q gi|254781178|r 160 AIWQNNSALYLIDNNGYV 177 (304) Q Consensus 160 A~~~~~g~~~~Id~~G~V 177 (304) +-...+.++. ..+|.. T Consensus 424 G~~~~g~Gys--s~~g~~ 439 (801) T PRK11067 424 GSFNFGIGYG--TESGVS 439 (801) T ss_pred CEEEEEEEEE--CCCCCE T ss_conf 8799998886--278828 No 11 >TIGR03303 OM_YaeT outer membrane protein assembly complex, YaeT protein. Members of this protein family are the YaeT protein of the YaeT/YfiO complex for assembling proteins into the outer membrane of Gram-negative bacteria. This protein is similar in sequence and function to a non-essential paralog, YtfM, that is also in the Omp85 family. Members of this family typically have five tandem copies of the surface antigen variable number repeat (pfam07244), followed by an outer membrane beta-barrel domain (pfam01103), while the YtfM family typically has a single pfam07244 repeat. Probab=95.37 E-value=0.098 Score=30.94 Aligned_cols=74 Identities=19% Similarity=0.154 Sum_probs=44.9 Q ss_pred EEEEEEEECCCCCCHHHHHHHH-CCCCCCCEEEECHHHHHHHHH----CCCCE-EEEEEE--EECCCE--EEEEEEEEEE Q ss_conf 2548999776548988999980-888898438875999999987----19762-279999--828981--9999993037 Q gi|254781178|r 89 SIEKVRIIGNVETPEADIIHCL-DLNTSTSLIFFDAIKIQKQLL----ALPWI-AHAEIR--RLYPDT--MEIRLTERHP 158 (304) Q Consensus 89 ~I~~I~V~Gn~~ls~~eIl~~l-~i~~~~sl~~ld~~~i~~~L~----~~pwI-k~a~V~--r~~P~t--L~I~I~Er~P 158 (304) .|.+|.++||+.++++++.+.+ ++..+..+=.-..++..++|. +..+- .+|+++ ..-++. +.++|.|-++ T Consensus 70 ~i~~i~~~G~~~~~~~~l~~~l~~lk~g~~~~~~~~~~~~~~I~~~Y~~~Gy~~~~V~~~~~~~~~~~v~v~f~I~Eg~~ 149 (741) T TIGR03303 70 IINSIVFSGNKEIKKEQLKKALVGIKKGEIFDRAKLEKDEKALKEFYRSRGKYNAEVEAKVTPLPRNRVDLEFNIKEGKK 149 (741) T ss_pred EEEEEEEECCCCCCHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCEEEEEEEEECCCE T ss_conf 39999999985589899998874557897039999999999999999966952049999999889987999999953885 Q ss_pred EEEE Q ss_conf 9999 Q gi|254781178|r 159 YAIW 162 (304) Q Consensus 159 vA~~ 162 (304) +-+. T Consensus 150 ~~I~ 153 (741) T TIGR03303 150 AKIK 153 (741) T ss_pred EEEE T ss_conf 8999 No 12 >pfam08479 POTRA_2 POTRA domain, ShlB-type. The POTRA domain (for polypeptide-transport-associated domain) is found towards the N-terminus of ShlB family proteins (pfam03865). ShlB is important in the secretion and activation of the haemolysin ShlA. It has been postulated that the POTRA domain has a chaperone-like function over ShlA; it may fold back into the C-terminal beta-barrel channel. Probab=94.08 E-value=0.18 Score=29.22 Aligned_cols=68 Identities=19% Similarity=0.266 Sum_probs=54.9 Q ss_pred EEEEEEEEECCCCCCHHHHHHHHCCCCCCCEEEECHHHHHHHHH----CCCCEE-EEEEEEE--CCCEEEEEEEE Q ss_conf 12548999776548988999980888898438875999999987----197622-7999982--89819999993 Q gi|254781178|r 88 FSIEKVRIIGNVETPEADIIHCLDLNTSTSLIFFDAIKIQKQLL----ALPWIA-HAEIRRL--YPDTMEIRLTE 155 (304) Q Consensus 88 F~I~~I~V~Gn~~ls~~eIl~~l~i~~~~sl~~ld~~~i~~~L~----~~pwIk-~a~V~r~--~P~tL~I~I~E 155 (304) |.|++|.++||..++++++.+...-..|+.+-.=++.++.+.+. +..++- .|.+-.+ -.++|.|.|.| T Consensus 1 F~v~~i~~~G~~~~~~~~l~~~~~~~~g~~l~~~~L~~~~~~i~~~Y~~~GY~~srv~lP~Q~i~~G~l~i~VvE 75 (76) T pfam08479 1 FPIKGIELEGNTLLPPADLAALLQPYLGRCLGLADLNQLAKALTNAYIERGYVTSRVYLPPQDLSDGVLRLTVLE 75 (76) T ss_pred CCEEEEEEECCCCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCEEEEEEEC T ss_conf 907899998982579999999878750983699999999999999999889879999817850679889999968 No 13 >pfam07244 Surf_Ag_VNR Surface antigen variable number repeat. This family is found primarily in bacterial surface antigens, normally as variable number repeats at the N-terminus. The C-terminus of these proteins is normally represented by pfam01103. The alignment centres on a -GY- or -GF- motif. Some members of this family are found in the mitochondria. It is predicted to have a mixed alpha/beta secondary structure. Probab=93.87 E-value=0.26 Score=28.27 Aligned_cols=69 Identities=19% Similarity=0.269 Sum_probs=46.4 Q ss_pred EEEEEEEEECCCC-CCHHHHHHHHCCCCCCCEEEECHHHHHHHHH----CCCC-EEEEEEEEEC-CC--EEEEEEEEE Q ss_conf 1254899977654-8988999980888898438875999999987----1976-2279999828-98--199999930 Q gi|254781178|r 88 FSIEKVRIIGNVE-TPEADIIHCLDLNTSTSLIFFDAIKIQKQLL----ALPW-IAHAEIRRLY-PD--TMEIRLTER 156 (304) Q Consensus 88 F~I~~I~V~Gn~~-ls~~eIl~~l~i~~~~sl~~ld~~~i~~~L~----~~pw-Ik~a~V~r~~-P~--tL~I~I~Er 156 (304) +.|.+|.++||.. +++.++.+.+.+..|..+=.-..++..++|. +..+ -.++++.... |+ .|.+.|.|. T Consensus 1 ~~i~~I~~~Gn~~~~~~~~l~~~~~~~~G~~~~~~~i~~~~~~l~~~y~~~Gy~~~~V~~~~~~~~~~v~l~~~V~Eg 78 (78) T pfam07244 1 YKIKDINIEGNLKRTKDEVLRRELKLKEGDVFNREKLEEDIERLYDYYGALGYAFADVKVEPEPDPKTVDLTFNVDEG 78 (78) T ss_pred CEEEEEEEEECCCCCCHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEECCCCEEEEEEEEECC T ss_conf 979799999488668999999986358876316999997999999999867976544331771389989999998269 No 14 >COG4775 Outer membrane protein/protective antigen OMA87 [Cell envelope biogenesis, outer membrane] Probab=92.83 E-value=0.4 Score=27.09 Aligned_cols=67 Identities=12% Similarity=0.128 Sum_probs=44.9 Q ss_pred EEEEEEEECCCCCCHHHHHHHHCCCCCCCEEEECHHHHHHHHHCC----CCEE-EEEEEE-ECC-CEEEEEEEE Q ss_conf 254899977654898899998088889843887599999998719----7622-799998-289-819999993 Q gi|254781178|r 89 SIEKVRIIGNVETPEADIIHCLDLNTSTSLIFFDAIKIQKQLLAL----PWIA-HAEIRR-LYP-DTMEIRLTE 155 (304) Q Consensus 89 ~I~~I~V~Gn~~ls~~eIl~~l~i~~~~sl~~ld~~~i~~~L~~~----pwIk-~a~V~r-~~P-~tL~I~I~E 155 (304) .|.+|+++||..+.++++.+.+.+..+..+=.-.+++..++|++. +... +++.+. ..| +.+.|.+.- T Consensus 93 ~I~~v~~~Gn~~i~~~~L~~~l~~~~g~~~~~~~l~~~~~~l~e~Y~~~G~~~~~v~~~~~~~~~~~v~v~f~i 166 (766) T COG4775 93 VINSVVFSGNKSIKDDALKKNLKIKPGEVLNRAKLEADVEALKEAYLSRGYYSVKVTYKVTPLPRNRVDVVFVI 166 (766) T ss_pred CEEEEEEECCCCCCHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHCCEECCEEEEEEEECCCCEEEEEEEE T ss_conf 28789986871377589887536477873457899989999999887468304346798787799707999993 No 15 >COG2831 FhaC Hemolysin activation/secretion protein [Intracellular trafficking and secretion] Probab=89.12 E-value=0.93 Score=24.71 Aligned_cols=73 Identities=15% Similarity=0.198 Sum_probs=46.3 Q ss_pred CCEEEEEEEEECCCCCCHHHHHHHHCCCCCCCEEEECHHHHHHHHH----CCCCEEEEEEEE---ECCCEEEEEEEEEEE Q ss_conf 0812548999776548988999980888898438875999999987----197622799998---289819999993037 Q gi|254781178|r 86 IGFSIEKVRIIGNVETPEADIIHCLDLNTSTSLIFFDAIKIQKQLL----ALPWIAHAEIRR---LYPDTMEIRLTERHP 158 (304) Q Consensus 86 ~gF~I~~I~V~Gn~~ls~~eIl~~l~i~~~~sl~~ld~~~i~~~L~----~~pwIk~a~V~r---~~P~tL~I~I~Er~P 158 (304) .-|.|++|+++|+..+...++.+.+.-..|+.+-.-|+..+..+|. ...||-+.-+-. .--++|.|+|.|=+- T Consensus 71 ~~f~I~~i~l~g~~~~~~~~l~~~~~~~~g~~l~~~~l~~l~~~lt~~y~~~GYvtsra~lp~Q~i~~G~l~l~VveG~i 150 (554) T COG2831 71 PCFPINRIELEGNTLLDALELQAILAPYLGRCLGLADLNQLASALTNLYIDRGYVTTRALLPPQDLKSGTLRLQVVEGRI 150 (554) T ss_pred CEEEEEEEEEECCCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCEEEEEEEEEEE T ss_conf 50788789981775578589998667751886678999999999999998669478899726874557879999952185 No 16 >TIGR02876 spore_yqfD sporulation protein YqfD; InterPro: IPR010690 This family consists of several putative bacterial stage IV sporulation (SpoIV) proteins. YqfD of Bacillus subtilis (P54469 from SWISSPROT) is known to be essential for efficient sporulation although its exact function is unknown .. Probab=86.37 E-value=2.6 Score=21.85 Aligned_cols=117 Identities=21% Similarity=0.331 Sum_probs=86.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCEEEEEEE--EECCCCCCHHHHHH---HHCCCCCC Q ss_conf 99988713577999999999999998873156089999997421081254899--97765489889999---80888898 Q gi|254781178|r 42 VFLEKVLPSYCGVILAIFFFAIVGIYGASIGGHTRKVIDIVDSFIGFSIEKVR--IIGNVETPEADIIH---CLDLNTST 116 (304) Q Consensus 42 ~~~~~~~p~~~g~~~~~~~~~~~~~~g~~~gg~~~~~~~~~~~~~gF~I~~I~--V~Gn~~ls~~eIl~---~l~i~~~~ 116 (304) -|+-+++=|+.|.++=++++.+. .|- +|+ -|=+|. |.|...+++.+|.+ ..|+..|. T Consensus 75 pF~~~r~~kr~~~~~G~L~f~~~-~~~--lS~---------------fiW~IdvrI~g~~~~~~~~i~~~L~~~G~k~Gv 136 (406) T TIGR02876 75 PFLIKRLRKRPGILIGILLFLAI-VYL--LST---------------FIWKIDVRITGVKGETEYEIRKELKEMGIKPGV 136 (406) T ss_pred HHHHHHHHHCCCHHHHHHHHHHH-HHH--HCC---------------CEEEEEEEEECCCCCCHHHHHHHHHHCCCCCCC T ss_conf 47888988550076689999999-998--617---------------558999997279997789999999855875683 Q ss_pred CEEEEC-HHHHHHHHH-CCCCEEEEEEEEECCCEEEEEEEEEEEE--EEEEECC--EEEEEECCCCC Q ss_conf 438875-999999987-1976227999982898199999930379--9999579--27997056410 Q gi|254781178|r 117 SLIFFD-AIKIQKQLL-ALPWIAHAEIRRLYPDTMEIRLTERHPY--AIWQNNS--ALYLIDNNGYV 177 (304) Q Consensus 117 sl~~ld-~~~i~~~L~-~~pwIk~a~V~r~~P~tL~I~I~Er~Pv--A~~~~~g--~~~~Id~~G~V 177 (304) +-+.+| ++.++++|. .+|=|-=|.|+. .-.+|.|.++||+-. --...+. ..-+--.+|.| T Consensus 137 ~k~~~Ds~~~~~~~L~~~~~ei~Wv~v~~-~Gt~l~~kvvEk~~~ki~~~~~~~ep~niVAkk~G~i 202 (406) T TIGR02876 137 KKFSIDSVYKLERKLLDRVPEIMWVGVRV-RGTKLVIKVVEKQEPKIKEVKKKAEPRNIVAKKDGVI 202 (406) T ss_pred CCCCCCCHHHHHHHHHHCCCCEEEEEEEE-ECCEEEEEEEECCCCCCCCCCCCCCCCCEEEECCEEE T ss_conf 20368898899999984599489999998-5309999998213332445687677723266334088 No 17 >COG0729 Outer membrane protein [Cell envelope biogenesis, outer membrane] Probab=82.88 E-value=3.7 Score=20.92 Aligned_cols=78 Identities=12% Similarity=0.143 Sum_probs=65.0 Q ss_pred HHCCEEEEEEEEECCCCCCHHHHHHHHCCCCCCCEEEECHHHHHHHHHCCCCEEEEEEEEECC----C---EEEEEEEEE Q ss_conf 210812548999776548988999980888898438875999999987197622799998289----8---199999930 Q gi|254781178|r 84 SFIGFSIEKVRIIGNVETPEADIIHCLDLNTSTSLIFFDAIKIQKQLLALPWIAHAEIRRLYP----D---TMEIRLTER 156 (304) Q Consensus 84 ~~~gF~I~~I~V~Gn~~ls~~eIl~~l~i~~~~sl~~ld~~~i~~~L~~~pwIk~a~V~r~~P----~---tL~I~I~Er 156 (304) +..-...-.|.|+|.+.+.++.+........|.....=+..+++++|.+.+|-.++.|.-..- + -+.|.+.|| T Consensus 196 sG~~y~~g~v~~~g~~r~~~~~~~~~~~~~~G~~Y~~~~l~~~~~rl~~~~~F~sv~v~~~~~~~~~~~~~pv~v~~~~r 275 (594) T COG0729 196 SGRRYRFGPVTVEGSQRIDEEYLQNLVPFKYGLPYDPEDLAELNQRLRQTGYFSSVVVQPADKQAGPDGLLPVKVSVSER 275 (594) T ss_pred CCCEEEECCEEECCCEECCHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCEECEEEEECCC T ss_conf 54169767679716603587787630366678867877999999987615850348981275555767603539885246 Q ss_pred EEEEE Q ss_conf 37999 Q gi|254781178|r 157 HPYAI 161 (304) Q Consensus 157 ~PvA~ 161 (304) +|-.+ T Consensus 276 k~~~~ 280 (594) T COG0729 276 KENTF 280 (594) T ss_pred CCCEE T ss_conf 55337 No 18 >PRK12585 putative monovalent cation/H+ antiporter subunit G; Reviewed Probab=79.58 E-value=1 Score=24.45 Aligned_cols=100 Identities=16% Similarity=0.138 Sum_probs=64.9 Q ss_pred HHHHEEEEECHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHH------------ Q ss_conf 0110021000156888865599999999999-999887135779999999999999988731560899------------ Q gi|254781178|r 11 IDRRLCLVIGMSLSLCCVLGLEEMRNFLNFC-VFLEKVLPSYCGVILAIFFFAIVGIYGASIGGHTRK------------ 77 (304) Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~p~~~g~~~~~~~~~~~~~~g~~~gg~~~~------------ 77 (304) |---+|+++|.-+++...+|+ +||- +|..-|-|--+++.-+..++.|+-+|....+|+++. T Consensus 7 IiiaillliGa~F~ligAIGl------lRlPDvytRlHAaTKa~TLGV~liLig~~iYF~~~~~~fs~rlLL~IlFiFLT 80 (197) T PRK12585 7 IIISIMILIGGLLSILAAIGV------IRLPDVYTRTHAAGISNTFGVSLLLFATVGYFFHSGEGFNARVLLAVLFIFLT 80 (197) T ss_pred HHHHHHHHHHHHHHHHHHHHH------HCCCHHHHHCCCCCCCHHHHHHHHHHHHHHHEEECCCCCHHHHHHHHHHHHHC T ss_conf 999999999999999998774------23661887743664201688999999887740333775319999999999981 Q ss_pred --HHH----HHHHHCCEEEEEEEEECC-CCCCHHHHHHHHCCCCCCC Q ss_conf --999----974210812548999776-5489889999808888984 Q gi|254781178|r 78 --VID----IVDSFIGFSIEKVRIIGN-VETPEADIIHCLDLNTSTS 117 (304) Q Consensus 78 --~~~----~~~~~~gF~I~~I~V~Gn-~~ls~~eIl~~l~i~~~~s 117 (304) |.. -++...|-.+ .|.|... ..+.+++|.+...+-..+. T Consensus 81 aPVaaHlI~RAAYr~GVPL-a~~~~~~~~~~~kddi~~~~~~~~~q~ 126 (197) T PRK12585 81 TPVASHLINRAAYDTGVPL-AIRIRDQLRSVKKDDIKKKKSLIIRQE 126 (197) T ss_pred CHHHHHHHHHHHHHCCCCE-EEHHHHHHHHHHHHHHHHHHHHHCCHH T ss_conf 1589999999997538962-210356664520300013423201098 No 19 >TIGR01734 D-ala-DACP-lig D-alanine-activating enzyme; InterPro: IPR010072 This entry represents the enzyme (also called D-alanine-D-alanyl carrier protein ligase) which activates D-alanine as an adenylate via the reaction D-ala + ATP to D-ala-AMP + PPi, and further catalyses the condensation of the amino acid adenylate with the D-alanyl carrier protein (D-ala-ACP). The D-alanine is then further transferred to teichoic acid in the biosynthesis of lipoteichoic acid (LTA) and wall teichoic acid (WTA) in Gram-positive bacteria, both polysacchatides .; GO: 0016208 AMP binding, 0047473 D-alanine-poly(phosphoribitol) ligase activity, 0019350 teichoic acid biosynthetic process. Probab=65.40 E-value=11 Score=18.01 Aligned_cols=101 Identities=17% Similarity=0.265 Sum_probs=61.6 Q ss_pred EEEEEEEEEEEECCEEEEEECCCCCCCCCCCCCCCCCCCCCCC--------CHHHHHHHH---HHHHHHH-HHCCCEEEE Q ss_conf 9930379999957927997056410321000112454311377--------667889999---8766401-310330389 Q gi|254781178|r 153 LTERHPYAIWQNNSALYLIDNNGYVITAFNHVRFAYLPILIGE--------NIYKAVRSF---EVLSNIA-GITKFVKAY 220 (304) Q Consensus 153 I~Er~PvA~~~~~g~~~~Id~~G~Vi~~~~~~~~~~lPli~G~--------~~~~~~~~~---~~L~~~~-~l~~~i~~~ 220 (304) =.+|-|+++.+.|-..+++|++|..+..-..+. .++.|+ +.++-++.| +-...|. .-...+..- T Consensus 319 ~~~~LPiG~~KPD~~~~I~DeeG~~Lp~Ge~GE----ivi~G~SVSkGYL~nPeKT~~aFF~~~g~~aYrTGD~g~i~d~ 394 (513) T TIGR01734 319 QYPRLPIGFAKPDMNLFIVDEEGEKLPEGEKGE----IVIVGPSVSKGYLNNPEKTAEAFFEVDGQRAYRTGDAGTITDG 394 (513) T ss_pred HCCCCCEEEECCCCCEEEEECCCCCCCCCCCCE----EEEECCCCCCCCCCCCHHHHHHHCCCCCCCCEECCCEEEEECC T ss_conf 168974236659985699715787587897312----8997470102357682115898314689410001753588518 Q ss_pred -EEECCCEEEEEEC-CCEEEEECCCCHHHHHHHHHHHHHHH Q ss_conf -9825880589974-98289808511799999999987630 Q gi|254781178|r 221 -NWIAERRWDLHLH-NGIIIKLPEEKFDVAIAKILELQNKY 259 (304) Q Consensus 221 -~~v~~rrWdL~l~-nGi~IkLp~~~~~~~L~rL~~l~~~~ 259 (304) -+.-.+|.|...+ ||-+|-| +|++..|++..-+.... T Consensus 395 G~lfyqGR~DFQiKlnGYRiEL--Edie~nl~~~~yi~~Av 433 (513) T TIGR01734 395 GMLFYQGRLDFQIKLNGYRIEL--EDIEANLRQSSYIKSAV 433 (513) T ss_pred CEEEECCCCEEEEECCCCEECH--HHHHHHHHHHHHHHHCE T ss_conf 7889717500554015645137--88977656645324102 No 20 >TIGR00998 8a0101 efflux pump membrane protein; InterPro: IPR005694 emrA and emrB confer resistance to carbonylcyanide m-chlorophenylhydrazone, nalidixic acid, and a number of other toxic compounds. EmrB encodes a highly hydrophobic 56.2-kDa peptide, with 14 potential alpha-helices to span the inner membrane. EmrA encodes a putative 42.7-kDa peptide containing a single hydrophobic domain and a large C-terminal hydrophilic domain.; GO: 0015238 drug transporter activity, 0006855 multidrug transport, 0009276 1-2nm peptidoglycan-based cell wall. Probab=60.11 E-value=11 Score=17.86 Aligned_cols=77 Identities=13% Similarity=0.103 Sum_probs=43.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCHHHHH---HHHHHHCCEEEEEEEEECCC-CCCHHHHHHHHCCCCCCCEEEECHHH Q ss_conf 577999999999999998873156089999---99742108125489997765-48988999980888898438875999 Q gi|254781178|r 50 SYCGVILAIFFFAIVGIYGASIGGHTRKVI---DIVDSFIGFSIEKVRIIGNV-ETPEADIIHCLDLNTSTSLIFFDAIK 125 (304) Q Consensus 50 ~~~g~~~~~~~~~~~~~~g~~~gg~~~~~~---~~~~~~~gF~I~~I~V~Gn~-~ls~~eIl~~l~i~~~~sl~~ld~~~ 125 (304) .+.+..++++|+...++||++=-=|..++. ||-.++--..|.-..|+|+. .+..++=-. +..|.=|+.||... T Consensus 2 k~~~~l~vv~Li~~a~a~~~y~~~~~~~~~~T~dAYV~a~~~~v~sF~VsG~v~~v~~~d~d~---Vk~GD~Lv~LD~t~ 78 (379) T TIGR00998 2 KKFLLLLVVLLIVVALAYAIYWFLVLRDYESTDDAYVKANQLQVSSFQVSGSVIEVNVDDTDY---VKQGDVLVRLDPTD 78 (379) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHCCCCCEEEEEEEEEEEEEEEEECCCCC---CEECCEEEEECCCH T ss_conf 306799999999999999999998874011211462127730232465732489997437884---20311566767716 Q ss_pred HHHH Q ss_conf 9999 Q gi|254781178|r 126 IQKQ 129 (304) Q Consensus 126 i~~~ 129 (304) .+-. T Consensus 79 a~~~ 82 (379) T TIGR00998 79 AELA 82 (379) T ss_pred HHHH T ss_conf 8999 No 21 >KOG2602 consensus Probab=59.74 E-value=13 Score=17.36 Aligned_cols=74 Identities=18% Similarity=0.164 Sum_probs=55.7 Q ss_pred EEEEEEEECCCCCCHHHHHHHHCCCC-CCCEEE--ECHHHHHHHHHCCCCEEEEEEEEEC-------CCEEEEEEEEEEE Q ss_conf 25489997765489889999808888-984388--7599999998719762279999828-------9819999993037 Q gi|254781178|r 89 SIEKVRIIGNVETPEADIIHCLDLNT-STSLIF--FDAIKIQKQLLALPWIAHAEIRRLY-------PDTMEIRLTERHP 158 (304) Q Consensus 89 ~I~~I~V~Gn~~ls~~eIl~~l~i~~-~~sl~~--ld~~~i~~~L~~~pwIk~a~V~r~~-------P~tL~I~I~Er~P 158 (304) +|.+|.|+|+..+.++-|.+....-. .+++.. ...+.+..+|.+..--++|++.-.- |+++.|.+.-.+| T Consensus 37 ~v~~V~v~Gl~~Tkdd~I~k~v~~~~ka~nl~el~~~s~~a~~nL~~l~iF~~v~~~iD~~~g~~~~p~~~~Vt~~v~E~ 116 (457) T KOG2602 37 RVDHVIVSGLERTKDDFIMKEVDLVFKAKNLQELLLASHEAASNLRALGIFDSVNILIDTKEGSDALPGGLVVTFLVTEP 116 (457) T ss_pred EEEEEEECCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHCCCCCCEEEECCCCCCCCCCCCEEEEEEEEEC T ss_conf 13589851310153278999866666300599999878999988988452100116602455787789851689996525 Q ss_pred EEEE Q ss_conf 9999 Q gi|254781178|r 159 YAIW 162 (304) Q Consensus 159 vA~~ 162 (304) --+. T Consensus 117 kr~t 120 (457) T KOG2602 117 KRLT 120 (457) T ss_pred CHHE T ss_conf 2010 No 22 >PRK12586 putative monovalent cation/H+ antiporter subunit G; Reviewed Probab=57.52 E-value=13 Score=17.32 Aligned_cols=57 Identities=19% Similarity=0.320 Sum_probs=44.2 Q ss_pred HEEEEECHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHH Q ss_conf 0021000156888865599999999999-99988713577999999999999998873156089 Q gi|254781178|r 14 RLCLVIGMSLSLCCVLGLEEMRNFLNFC-VFLEKVLPSYCGVILAIFFFAIVGIYGASIGGHTR 76 (304) Q Consensus 14 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~p~~~g~~~~~~~~~~~~~~g~~~gg~~~ 76 (304) -.|+++|--+.+-..+|+ +||- +|..-|.|--.++.-+..++.|+++|.....|+.. T Consensus 13 ~ill~~G~~f~l~~aiGl------~R~pD~y~R~HA~tK~~TLG~~lil~g~~l~f~~~~~~~s 70 (145) T PRK12586 13 AIMILLGSIIALISAIGI------VKFQDVFLRSHAATKSSTLSVLLTLIGVLIYFIVNTGFFS 70 (145) T ss_pred HHHHHHHHHHHHHHHHHH------HHCCCHHHHCCCCHHHHHHHHHHHHHHHHHHHHHCCCCHH T ss_conf 999999999999998898------7368689884102122589999999999999972022278 No 23 >COG2362 DppA D-aminopeptidase [Amino acid transport and metabolism] Probab=54.20 E-value=16 Score=16.78 Aligned_cols=60 Identities=22% Similarity=0.390 Sum_probs=43.8 Q ss_pred HHCCEEEEEEEEECCCCCCHHHHHHHHCCCCCCCEEEEC--HHHHHHHHHCCCCEEEEEEEEEC Q ss_conf 210812548999776548988999980888898438875--99999998719762279999828 Q gi|254781178|r 84 SFIGFSIEKVRIIGNVETPEADIIHCLDLNTSTSLIFFD--AIKIQKQLLALPWIAHAEIRRLY 145 (304) Q Consensus 84 ~~~gF~I~~I~V~Gn~~ls~~eIl~~l~i~~~~sl~~ld--~~~i~~~L~~~pwIk~a~V~r~~ 145 (304) .-.|| |+++.|.| ..+++--+.+.+.-.+|-+...+. --..++..+-.|||..|.+++-. T Consensus 117 t~~G~-v~~v~Ing-~e~gE~gLNa~laG~ygVPV~~~sGDd~~~~E~~~l~P~i~tv~~K~~~ 178 (274) T COG2362 117 TISGF-VRRVRING-VEVGEYGLNAYLAGEYGVPVALVSGDDVAAREARELTPWIETVAVKEAS 178 (274) T ss_pred EEECE-EEEEEECC-EEHHHHHHHHHHHHCCCCCEEEEECCHHHHHHHHHHCCCEEEEEECCCC T ss_conf 02021-47999998-9757878889974125886799607588765556518870588730245 No 24 >KOG2333 consensus Probab=51.01 E-value=13 Score=17.36 Aligned_cols=40 Identities=28% Similarity=0.326 Sum_probs=18.7 Q ss_pred CCEEEECHHHHHHHHHCCCCEEEEEEEEECCCEEEEE--EEEEEEEE Q ss_conf 8438875999999987197622799998289819999--99303799 Q gi|254781178|r 116 TSLIFFDAIKIQKQLLALPWIAHAEIRRLYPDTMEIR--LTERHPYA 160 (304) Q Consensus 116 ~sl~~ld~~~i~~~L~~~pwIk~a~V~r~~P~tL~I~--I~Er~PvA 160 (304) -|-+.=.+..+...+..... |....|=|+.|. ++|..|+| T Consensus 367 GsALl~rp~rl~~~l~~m~~-----vs~~iPiTVKiRTG~keg~~~a 408 (614) T KOG2333 367 GSALLNRPARLIRILRAMNA-----VSGDIPITVKIRTGTKEGHPVA 408 (614) T ss_pred CCHHHCCCHHHHHHHHHHHH-----HCCCCCEEEEEECCCCCCCHHH T ss_conf 50542381899999999987-----6057874899840443676138 No 25 >pfam04951 Peptidase_M55 D-aminopeptidase. Bacillus subtilis DppA is a binuclear zinc-dependent, D-specific aminopeptidase. The structure reveals that DppA is a new example of a 'self-compartmentalising protease', a family of proteolytic complexes. Proteasomes are the most extensively studied representatives of this family. The DppA enzyme is composed of identical 30 kDa subunits organized in a decamer with 52 point-group symmetry. A 20 A wide channel runs through the complex, giving access to a central chamber holding the active sites. The structure shows DppA to be a prototype of a new family of metalloaminopeptidases characterized by the SXDXEG key sequence. The only known substrates are D-ala-D-ala and D-ala-gly-gly. Probab=45.35 E-value=23 Score=15.91 Aligned_cols=59 Identities=14% Similarity=0.159 Sum_probs=42.9 Q ss_pred CCEEEEEEEEECCCCCCHHHHHHHHCCCCCCCEEEEC--HHHHHHHHHCCCCEEEEEEEEEC Q ss_conf 0812548999776548988999980888898438875--99999998719762279999828 Q gi|254781178|r 86 IGFSIEKVRIIGNVETPEADIIHCLDLNTSTSLIFFD--AIKIQKQLLALPWIAHAEIRRLY 145 (304) Q Consensus 86 ~gF~I~~I~V~Gn~~ls~~eIl~~l~i~~~~sl~~ld--~~~i~~~L~~~pwIk~a~V~r~~ 145 (304) .|+.+.+|.|.| ..+++--|.+.+.-..|-+..++. ....++--+-.||+..+.|++-. T Consensus 119 s~~~~~~v~iNG-~~~gE~~lNa~~Ag~~gVPV~lVsGD~~~~~e~~~~~P~~~tv~vK~~~ 179 (265) T pfam04951 119 SGSVFRRVWING-EEVGEAGLNAAVAGEYGVPVVLVSGDDVLAEEAKPLFPDAETVVVKEAI 179 (265) T ss_pred CCCCEEEEEECC-EECCHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHCCCCEEEEEEEEE T ss_conf 467569999999-9936888999998751997799836599999999878995799972100 No 26 >cd04405 RhoGAP_BRCC3-like RhoGAP_BRCC3-like: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of BRCC3-like proteins. This subgroup also contains two groups of closely related proteins, BRCC3 and DEPDC7, which both contain a C-terminal RhoGAP-like domain and an N-terminal DEP (Disheveled, Egl-10, and Pleckstrin) domain. The function(s) of BRCC3 and DEPDC7 are unknown. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude. Probab=43.41 E-value=24 Score=15.72 Aligned_cols=23 Identities=17% Similarity=0.517 Sum_probs=10.6 Q ss_pred CCEEEE-EEEEECCCEEEEEEEEE Q ss_conf 762279-99982898199999930 Q gi|254781178|r 134 PWIAHA-EIRRLYPDTMEIRLTER 156 (304) Q Consensus 134 pwIk~a-~V~r~~P~tL~I~I~Er 156 (304) +|+.+| ..=--||+.+.+++..- T Consensus 70 ~Wl~sAm~CL~~~Pd~~~~d~s~~ 93 (235) T cd04405 70 HWLLSAMDCLANWPDQLVVDVSRP 93 (235) T ss_pred HHHHHHHHHHHHCCCCCEECCCCC T ss_conf 999999999986884230136766 No 27 >PHA02047 phage lambda Rz1-like protein Probab=34.48 E-value=31 Score=15.02 Aligned_cols=87 Identities=14% Similarity=0.117 Sum_probs=48.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHH----HH---HHCCEEEEE-----EEEECCCCCCHHHHHHHHCCCCC Q ss_conf 1357799999999999999887315608999999----74---210812548-----99977654898899998088889 Q gi|254781178|r 48 LPSYCGVILAIFFFAIVGIYGASIGGHTRKVIDI----VD---SFIGFSIEK-----VRIIGNVETPEADIIHCLDLNTS 115 (304) Q Consensus 48 ~p~~~g~~~~~~~~~~~~~~g~~~gg~~~~~~~~----~~---~~~gF~I~~-----I~V~Gn~~ls~~eIl~~l~i~~~ 115 (304) ||+-...+++++...+.|.||++.+-.+-..++- ++ .++.-++-+ ..|.-....++.|+..+++-..+ T Consensus 1 M~~Ti~~IL~L~V~~~~~~Y~~~~s~~~~~~A~N~~~~~~a~~EA~E~R~~~lQ~~VL~V~~R~~t~r~ev~~AL~~n~~ 80 (109) T PHA02047 1 MPRTIVAILVLAVVALGASYGFVQSYRALGIAQNEIKRQTARAEALEVRYATLQRHVLEVAARTNTQRQEVDRALDQNRP 80 (109) T ss_pred CCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 97205789999999961204389999987366889999888889999999999988888886400369999989862987 Q ss_pred CCEEEE-CHHHHHHHHHCCCCE Q ss_conf 843887-599999998719762 Q gi|254781178|r 116 TSLIFF-DAIKIQKQLLALPWI 136 (304) Q Consensus 116 ~sl~~l-d~~~i~~~L~~~pwI 136 (304) .-.- -+.++.+.|-+.|-. T Consensus 81 --WaD~PVPpAVvdsLCkr~~~ 100 (109) T PHA02047 81 --WADRPVPAAVVDSLCNRPGA 100 (109) T ss_pred --CCCCCCCHHHHHHHHCCCCC T ss_conf --45589988899998569886 No 28 >PRK11644 sensory histidine kinase UhpB; Provisional Probab=34.09 E-value=34 Score=14.79 Aligned_cols=23 Identities=26% Similarity=0.597 Sum_probs=16.3 Q ss_pred HHHHHHHH-------HHHHHHHHHHHHHHH Q ss_conf 88886559-------999999999999988 Q gi|254781178|r 24 SLCCVLGL-------EEMRNFLNFCVFLEK 46 (304) Q Consensus 24 ~~~~~~~~-------~~~~~~~~~~~~~~~ 46 (304) .|+|.+++ +||+||..||+|+-. T Consensus 187 ~llf~l~i~~q~~l~~el~~~~~~~l~iPi 216 (497) T PRK11644 187 LLLFVISIWLQLGLPDELSRFTPFCLAIPI 216 (497) T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHHHH T ss_conf 999999999996172889999999999989 No 29 >PRK05182 DNA-directed RNA polymerase subunit alpha; Provisional Probab=33.49 E-value=9.6 Score=18.25 Aligned_cols=33 Identities=27% Similarity=0.565 Sum_probs=24.6 Q ss_pred HHHHHCCCCHHHHHHHHHHHCCEEEEEEEEECCCC Q ss_conf 98873156089999997421081254899977654 Q gi|254781178|r 66 IYGASIGGHTRKVIDIVDSFIGFSIEKVRIIGNVE 100 (304) Q Consensus 66 ~~g~~~gg~~~~~~~~~~~~~gF~I~~I~V~Gn~~ 100 (304) +||.-+|-..+.+. +.+..|++|..|+|.|..| T Consensus 29 G~G~TiGNaLRRvL--Lssi~G~aIt~vkI~Gv~H 61 (306) T PRK05182 29 GFGTTLGNALRRVL--LSSLPGAAVTSVKIDGVLH 61 (306) T ss_pred CCHHHHHHHHHHHH--HHCCCCCEEEEEEECCCCC T ss_conf 74315789999999--8569851899999889643 No 30 >PRK12671 putative monovalent cation/H+ antiporter subunit G; Reviewed Probab=33.40 E-value=35 Score=14.72 Aligned_cols=58 Identities=17% Similarity=0.181 Sum_probs=45.8 Q ss_pred HEEEEECHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHH Q ss_conf 002100015688886559999999999999988713577999999999999998873156089 Q gi|254781178|r 14 RLCLVIGMSLSLCCVLGLEEMRNFLNFCVFLEKVLPSYCGVILAIFFFAIVGIYGASIGGHTR 76 (304) Q Consensus 14 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~g~~~~~~~~~~~~~~g~~~gg~~~ 76 (304) -.|+++|--+.+---+|+-.|.. +|..-|-|--.++.-+..++.++.+|.-..+|+.. T Consensus 16 ~~ll~~G~~f~liga~GllRlpD-----~y~RlHA~tka~TlG~g~iL~as~l~f~~~~~~~~ 73 (120) T PRK12671 16 AFFLVLGAGLTLIGTIGLVRFKS-----FYERLHAPTLGTSWGAGGILIASILYFSILQSRPV 73 (120) T ss_pred HHHHHHHHHHHHHHHHHHHHCCC-----HHHHCCCCHHHHHHHHHHHHHHHHHHHHHHCCCHH T ss_conf 99999999999999888872761-----99880000576889999999999999998324428 No 31 >TIGR02159 PA_CoA_Oxy4 phenylacetate-CoA oxygenase, PaaJ subunit; InterPro: IPR011883 Phenylacetate-CoA oxygenase is comprised of a five gene complex responsible for the hydroxylation of phenylacetate-CoA (PA-CoA) as the second catabolic step in phenylacetic acid (PA) degradation , . Although the exact function of this enzyme has not been determined, it has been shown to be required for phenylacetic acid degradation and has been proposed to function in a multicomponent oxygenase acting on phenylacetate-CoA . . Probab=33.33 E-value=35 Score=14.72 Aligned_cols=23 Identities=26% Similarity=0.512 Sum_probs=19.9 Q ss_pred HHHHHHHHHCCCCEEEEEEEEEC Q ss_conf 99999998719762279999828 Q gi|254781178|r 123 AIKIQKQLLALPWIAHAEIRRLY 145 (304) Q Consensus 123 ~~~i~~~L~~~pwIk~a~V~r~~ 145 (304) .++|++++..++|+..++|+-.+ T Consensus 46 ~~~i~~A~~~~GW~~~VeV~~~L 68 (152) T TIGR02159 46 RQDIRDALRALGWVEEVEVKTSL 68 (152) T ss_pred HHHHHHHHHHCCCCCCEEEEEEE T ss_conf 99999999747897511588771 No 32 >PRK13361 molybdenum cofactor biosynthesis protein A; Provisional Probab=32.70 E-value=35 Score=14.65 Aligned_cols=12 Identities=42% Similarity=0.490 Sum_probs=6.4 Q ss_pred HHHHHHHHHHHH Q ss_conf 655999999999 Q gi|254781178|r 28 VLGLEEMRNFLN 39 (304) Q Consensus 28 ~~~~~~~~~~~~ 39 (304) +|..||+-++.+ T Consensus 44 ~Ls~eEi~~l~~ 55 (329) T PRK13361 44 VLTLEELAWLAQ 55 (329) T ss_pred CCCHHHHHHHHH T ss_conf 689999999999 No 33 >pfam10081 DUF2319 Predicted membrane protein (DUF2319). Members of this family of hypothetical bacterial proteins have no known function. Probab=31.97 E-value=36 Score=14.57 Aligned_cols=53 Identities=13% Similarity=0.179 Sum_probs=35.6 Q ss_pred HEEEEECHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 0021000156888865599999999999-9998871357799999999999999 Q gi|254781178|r 14 RLCLVIGMSLSLCCVLGLEEMRNFLNFC-VFLEKVLPSYCGVILAIFFFAIVGI 66 (304) Q Consensus 14 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~p~~~g~~~~~~~~~~~~~ 66 (304) -++.+++.-...+|+.--+-++...||. -.+.+++|++...++.++..+.... T Consensus 113 ~~~~~~a~~~~~~ll~~gr~~~~~~r~l~~~l~R~~P~~va~~v~~~~~~~l~~ 166 (534) T pfam10081 113 LRGVLVALLVFGALVALGRLLRRLIRFLARRLQRVLPRRVANVVGVLVVVLLLW 166 (534) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHH T ss_conf 999999999999999999999999999998611327689999999999999999 No 34 >TIGR02283 MltB_2 lytic murein transglycosylase; InterPro: IPR011970 Proteins of this entries are closely related to the MltB family lytic murein transglycosylases described by IPR011757 from INTERPRO and are likewise all proteobacterial, although that family and this one form clearly distinct clades. Several species have one member of each family. Many proteins of this entry(unlike the MltB family) contain an additional C-terminal domain, a putative peptidoglycan binding domain (IPR002477 from INTERPRO), not included in region described by this model. Many sequences appear to contain N-terminal lipoprotein attachment sites, as does Esherichia coli MltB in IPR011757 from INTERPRO.. Probab=29.93 E-value=12 Score=17.62 Aligned_cols=165 Identities=15% Similarity=0.194 Sum_probs=104.0 Q ss_pred HHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHCCC--CHHHHHHHHHHHCCEEEEEEEEEC-C-CCCCHH Q ss_conf 99999999999999887--13577999999999999998873156--089999997421081254899977-6-548988 Q gi|254781178|r 31 LEEMRNFLNFCVFLEKV--LPSYCGVILAIFFFAIVGIYGASIGG--HTRKVIDIVDSFIGFSIEKVRIIG-N-VETPEA 104 (304) Q Consensus 31 ~~~~~~~~~~~~~~~~~--~p~~~g~~~~~~~~~~~~~~g~~~gg--~~~~~~~~~~~~~gF~I~~I~V~G-n-~~ls~~ 104 (304) -.-+++.-.+..=+|++ +|++ ..++||+.=-.||-+.|- |+.+++.++++.| =.| . ..+=+. T Consensus 75 ~~~~~~~~~~L~~iE~~yGVp~~----~l~AiWGmEs~fG~~~g~~~~M~dv~~sLATLA--------y~~~RR~~~F~~ 142 (313) T TIGR02283 75 RALLQRYAALLARIEKRYGVPAE----ILLAIWGMESDFGAYQGKDDVMFDVIRSLATLA--------YDGARRKDFFRT 142 (313) T ss_pred HHHHHHHHHHHHHHHHHCCCCCC----HHEEHHHHHHHHHHHCCCCCCHHHHHHHHHHHH--------HCCCCCHHHHHH T ss_conf 99999999999998752388810----410001134131031167510133899999986--------246655567668 Q ss_pred HHHHHHCC--------------------------------------CCCCCEEEECHHH-------HHHHH---HCCCCE Q ss_conf 99998088--------------------------------------8898438875999-------99998---719762 Q gi|254781178|r 105 DIIHCLDL--------------------------------------NTSTSLIFFDAIK-------IQKQL---LALPWI 136 (304) Q Consensus 105 eIl~~l~i--------------------------------------~~~~sl~~ld~~~-------i~~~L---~~~pwI 136 (304) |++++|.+ +....||.=.++. +.... ...||. T Consensus 143 EL~aALkI~q~Gd~~~~~~~GSWAGA~GQTQFlPssy~~yaVD~DGDGr~Diw~s~pDaLAStAnyL~~~ggW~~g~~Wg 222 (313) T TIGR02283 143 ELIAALKILQRGDLTRAALKGSWAGAMGQTQFLPSSYLKYAVDFDGDGRRDIWNSVPDALASTANYLVRGGGWKRGRPWG 222 (313) T ss_pred HHHHHHHHHHCCCCCHHHHHHHHHHHCCCCCCCHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCC T ss_conf 99999997605778846501356765379875736555410234698844578886227899999998416865788841 Q ss_pred EEEEEEEECCCEEEEEEEEEEEEEEEEECCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCHHHHH--HHHHHHHHHH Q ss_conf 27999982898199999930379999957927997056410321000112454311377667889--9998766401 Q gi|254781178|r 137 AHAEIRRLYPDTMEIRLTERHPYAIWQNNSALYLIDNNGYVITAFNHVRFAYLPILIGENIYKAV--RSFEVLSNIA 211 (304) Q Consensus 137 k~a~V~r~~P~tL~I~I~Er~PvA~~~~~g~~~~Id~~G~Vi~~~~~~~~~~lPli~G~~~~~~~--~~~~~L~~~~ 211 (304) .+|.+-..+|-.+.- -+.++|++-|+.-| +.-.+|.-++.........|.+..|..-..+. +.|.++..|- T Consensus 223 ~EV~lP~~~d~~~~g-~~~~~~~~~W~~~G---v~~~~G~~~p~~~~~~~A~L~~P~G~~GPAFL~~~NF~~~~~wN 295 (313) T TIGR02283 223 YEVRLPEGFDYELAG-SQQKKPIAEWQALG---VTRADGRPLPAAEAAAEASLLLPAGRKGPAFLVTPNFRVIKEWN 295 (313) T ss_pred EEEEECCCCCCEECC-CCCCCCHHHHHHCC---CCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCHHHHHHHHHH T ss_conf 578716898700078-85547779998568---21689863775000187256546789885001305799999975 No 35 >pfam01883 DUF59 Domain of unknown function DUF59. This family includes prokaryotic proteins of unknown function. The family also includes PhaH from Pseudomonas putida. PhaH forms a complex with PhaF, PhaG, and PhaI, which hydroxylates phenylacetic acid to 2-hydroxyphenylacetic acid. So members of this family may all be components of ring hydroxylating complexes. Probab=29.27 E-value=40 Score=14.29 Aligned_cols=26 Identities=27% Similarity=0.448 Sum_probs=15.0 Q ss_pred CEEEECHHHHHHHHHCCCCEEEEEEE Q ss_conf 43887599999998719762279999 Q gi|254781178|r 117 SLIFFDAIKIQKQLLALPWIAHAEIR 142 (304) Q Consensus 117 sl~~ld~~~i~~~L~~~pwIk~a~V~ 142 (304) ++...=.+++++.+++.|||++++|. T Consensus 50 P~~~~i~~~i~~~l~~v~gv~~V~V~ 75 (76) T pfam01883 50 PLADLIALDVREALLELPGVEDVEVE 75 (76) T ss_pred CCHHHHHHHHHHHHHCCCCCEEEEEE T ss_conf 83789999999999839994078997 No 36 >KOG3090 consensus Probab=27.95 E-value=34 Score=14.78 Aligned_cols=23 Identities=26% Similarity=0.475 Sum_probs=19.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 13577999999999999998873 Q gi|254781178|r 48 LPSYCGVILAIFFFAIVGIYGAS 70 (304) Q Consensus 48 ~p~~~g~~~~~~~~~~~~~~g~~ 70 (304) =|+.+|+.+-+++.+|.++||+. T Consensus 13 g~~~~~~~l~~~~~~G~~~y~v~ 35 (290) T KOG3090 13 GPRGAGAGLKLLLIGGLGAYGVT 35 (290) T ss_pred CCCHHHHHHHHHHHHCHHHHEEC T ss_conf 96227788889987035554001 No 37 >KOG3922 consensus Probab=26.64 E-value=45 Score=13.99 Aligned_cols=21 Identities=14% Similarity=0.183 Sum_probs=9.3 Q ss_pred HHHHHHCCCCCCCEEEECHHH Q ss_conf 999980888898438875999 Q gi|254781178|r 105 DIIHCLDLNTSTSLIFFDAIK 125 (304) Q Consensus 105 eIl~~l~i~~~~sl~~ld~~~ 125 (304) .|...+.-..+-+++.+|+.+ T Consensus 96 niaydL~ekn~F~vlh~nvtk 116 (361) T KOG3922 96 NIAYDLSEKNGFHVLHINVTK 116 (361) T ss_pred HHHHHHHHCCCCEEEEEECCC T ss_conf 999998740483499854124 No 38 >TIGR02027 rpoA DNA-directed RNA polymerase, alpha subunit; InterPro: IPR011773 DNA-directed RNA polymerases 2.7.7.6 from EC (also known as DNA-dependent RNA polymerases) are responsible for the polymerisation of ribonucleotides into a sequence complementary to the template DNA. In eukaryotes, there are three different forms of DNA-directed RNA polymerases transcribing different sets of genes. Most RNA polymerases are multimeric enzymes and are composed of a variable number of subunits. The core RNA polymerase complex consists of five subunits (two alpha, one beta, one beta-prime and one omega) and is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme . The core RNA polymerase complex forms a "crab claw"-like structure with an internal channel running along the full length . The key functional sites of the enzyme, as defined by mutational and cross-linking analysis, are located on the inner wall of this channel. RNA synthesis follows after the attachment of RNA polymerase to a specific site, the promoter, on the template DNA strand. The RNA synthesis process continues until a termination sequence is reached. The RNA product, which is synthesised in the 5' to 3'direction, is known as the primary transcript. Eukaryotic nuclei contain three distinct types of RNA polymerases that differ in the RNA they synthesise: RNA polymerase I: located in the nucleoli, synthesises precursors of most ribosomal RNAs. RNA polymerase II: occurs in the nucleoplasm, synthesises mRNA precursors. RNA polymerase III: also occurs in the nucleoplasm, synthesises the precursors of 5S ribosomal RNA, the tRNAs, and a variety of other small nuclear and cytosolic RNAs. Eukaryotic cells are also known to contain separate mitochondrial and chloroplast RNA polymerases. Eukaryotic RNA polymerases, whose molecular masses vary in size from 500 to 700 kD, contain two non-identical large (>100 kDa) subunits and an array of up to 12 different small (less than 50 kDa) subunits. This family consists of the bacterial (and chloroplast) DNA-directed RNA polymerase alpha subunit, encoded by the rpoA gene. The RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. The amino terminal domain is involved in dimerizing and assembling the other RNA polymerase subunits into a transcriptionally active enzyme. The carboxy-terminal domain contains determinants for interaction with DNA and with transcriptional activator proteins , .; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription DNA-dependent. Probab=25.25 E-value=21 Score=16.16 Aligned_cols=125 Identities=12% Similarity=0.152 Sum_probs=68.4 Q ss_pred HHHHHCCCCHHHHHHHHHHHCCEEEEEEEEECCC----CCCHHHHHHHHCCCCCCCEEEECHHHHH------HHHHCC-C Q ss_conf 9887315608999999742108125489997765----4898899998088889843887599999------998719-7 Q gi|254781178|r 66 IYGASIGGHTRKVIDIVDSFIGFSIEKVRIIGNV----ETPEADIIHCLDLNTSTSLIFFDAIKIQ------KQLLAL-P 134 (304) Q Consensus 66 ~~g~~~gg~~~~~~~~~~~~~gF~I~~I~V~Gn~----~ls~~eIl~~l~i~~~~sl~~ld~~~i~------~~L~~~-p 134 (304) +||.-+|=..+.+. +.|.+|++|+.|+|.|++ +.-.-|=-..-|+.....=+-+|+++++ -+.... + T Consensus 12 GfG~TlGNALRRvL--LSSipG~Ait~~~I~G~~~D~~~~V~HEF~~i~GV~EDV~eIiLNlK~l~~k~~~l~~~~~~~~ 89 (324) T TIGR02027 12 GFGITLGNALRRVL--LSSIPGAAITAIKIDGFDKDVAKVVLHEFSTIEGVKEDVTEIILNLKKLRLKIKELVKSEGDEE 89 (324) T ss_pred CCCEECHHHHHHHH--HHCCCCCEEEEEEEECCCCCEEEEEEECCCCCCCEEHHHHHHHHHCCCEEEEEEHHHCCCCCCC T ss_conf 83213105789998--6337454047889815776603478202033787020278998750403343200010358865 Q ss_pred C-EE---EEEEEEECCCEEEEEEEEEEEEEEEEECCEEEEEECCCCCCCCCCC-CC-CCCCCCCCCCCHHH Q ss_conf 6-22---7999982898199999930379999957927997056410321000-11-24543113776678 Q gi|254781178|r 135 W-IA---HAEIRRLYPDTMEIRLTERHPYAIWQNNSALYLIDNNGYVITAFNH-VR-FAYLPILIGENIYK 199 (304) Q Consensus 135 w-Ik---~a~V~r~~P~tL~I~I~Er~PvA~~~~~g~~~~Id~~G~Vi~~~~~-~~-~~~lPli~G~~~~~ 199 (304) - +. .++++..-|+.+.-.=.+ -..++.-+++.+=.+..-..+ .. .-.+.+-.|.|... T Consensus 90 ~~~~~WP~~~~~~~Gp~~~~A~Di~-------~~~~~~EvvNpdl~Iatl~~~Na~l~~e~~v~~GRGYVp 153 (324) T TIGR02027 90 RQVTKWPIMTLSKKGPGVVTAGDIK-------AQPGGVEVVNPDLVIATLTEDNAKLEIELRVERGRGYVP 153 (324) T ss_pred CCEEECCEEEEEEECCCEEEEEEEE-------CCCCCEEEECCCCEEEEECCCCEEEEEEEEEEECCCEEE T ss_conf 5213256059998657314522001-------379964687888526674079727999999973561220 No 39 >TIGR02350 prok_dnaK chaperone protein DnaK; InterPro: IPR012725 Molecular chaperones are a diverse family of proteins that function to protect proteins in the intracellular milieu from irreversible aggregation during synthesis and in times of cellular stress. The bacterial molecular chaperone DnaK is an enzyme that couples cycles of ATP binding, hydrolysis, and ADP release by an N-terminal ATP-hydrolysing domain to cycles of sequestration and release of unfolded proteins by a C-terminal substrate binding domain. In prokaryotes, the grpE protein is a co-chaperone for DnaK, and acts as a nucleotide exchange factor, stimulating the rate of ADP release 5000-fold . DnaK is itself a weak ATPase; ATP hydrolysis by DnaK is stimulated by its interaction with another co-chaperone, DnaJ. Thus the co-chaperones DnaJ and GrpE are capable of tightly regulating the nucleotide-bound and substrate-bound state of DnaK in ways that are necessary for the normal housekeeping functions and stress-related functions of the DnaK molecular chaperone cycle. Members of this family are the chaperone DnaK, of the DnaK-DnaJ-GrpE chaperone system. All members of the seed alignment were taken from completely sequenced bacterial or archaeal genomes and (except for the Mycoplasma sequence) found clustered with other genes of this systems. This entry excludes DnaK homologues that are not DnaK itself, such as the heat shock cognate protein HscA (IPR010236 from INTERPRO). However, it is not designed to distinguish among DnaK paralogs in eukaryotes. Note that a number of DnaK genes have shadow ORFs in the same reverse (relative to dnaK) reading frame, a few of which have been assigned glutamate dehydrogenase activity. The significance of this observation is unclear; the lengths of such shadow ORFs are highly variable as if the presumptive protein product is not conserved.; GO: 0005524 ATP binding, 0051082 unfolded protein binding, 0006457 protein folding. Probab=24.75 E-value=37 Score=14.56 Aligned_cols=131 Identities=15% Similarity=0.195 Sum_probs=70.9 Q ss_pred EEEEEEEEECCEEEEEECCCCCCCCCCCCC---------------------CCCCCCCCCCCHHHHH-HHH------HHH Q ss_conf 037999995792799705641032100011---------------------2454311377667889-999------876 Q gi|254781178|r 156 RHPYAIWQNNSALYLIDNNGYVITAFNHVR---------------------FAYLPILIGENIYKAV-RSF------EVL 207 (304) Q Consensus 156 r~PvA~~~~~g~~~~Id~~G~Vi~~~~~~~---------------------~~~lPli~G~~~~~~~-~~~------~~L 207 (304) +.||.+...++...-+.-+|+-+.|..-.. .-..|=.+ .++++.+ ++- +.+ T Consensus 84 ~~py~vv~~~~~~~~~~~~~K~y~P~eISA~iL~klk~~AE~yLGe~v~~AVITVPAYF-nDaQR~ATKdAG~IAGL~V~ 162 (598) T TIGR02350 84 RVPYKVVEGDNGDVRVEVRGKEYTPQEISAMILQKLKKDAEAYLGEKVTEAVITVPAYF-NDAQRQATKDAGKIAGLEVL 162 (598) T ss_pred HCCEEEEECCCCEEEEEECCCEECHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEECCCC-CHHHHHHHHHHHHCCCCEEE T ss_conf 19847885488648999618610717799999999999999862853022577711567-71356889863100583137 Q ss_pred ----------HHHH--HHCCCEEEEEEECCCEEEE---EECCCE-EE-------EECCCCHHHHHHHHHH--HHHHHCHH Q ss_conf ----------6401--3103303899825880589---974982-89-------8085117999999999--87630100 Q gi|254781178|r 208 ----------SNIA--GITKFVKAYNWIAERRWDL---HLHNGI-II-------KLPEEKFDVAIAKILE--LQNKYQIL 262 (304) Q Consensus 208 ----------~~~~--~l~~~i~~~~~v~~rrWdL---~l~nGi-~I-------kLp~~~~~~~L~rL~~--l~~~~k~~ 262 (304) .+|. ...++...++=.+++-+|+ .+.+|+ .| +||-+|+++++-.+.. |.++ T Consensus 163 RIINEPTAAALAYGldk~~~~~i~VfDLGGGTFDvSILe~gdG~feV~sT~GDt~LGGdDFD~~Iidwl~~~fk~~---- 238 (598) T TIGR02350 163 RIINEPTAAALAYGLDKKKDEKILVFDLGGGTFDVSILEIGDGVFEVKSTAGDTHLGGDDFDERIIDWLADEFKKE---- 238 (598) T ss_pred EEECCHHHHHHHHCCCCCCCCEEEEEECCCCCEEEEEEEECCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHH---- T ss_conf 7417713898860557667867999975888324688875686899985168887875205799999999998876---- Q ss_pred CCCEEEEEEECCCEEEEEECCCCHHHHHHHHH Q ss_conf 08833896112983699976842155899988 Q gi|254781178|r 263 DRDISVIDMRLPDRLSVRLTTGSFIDRRDIVD 294 (304) Q Consensus 263 ~r~I~~IDLR~pdri~Vrl~~~a~~~r~~~~~ 294 (304) -.-|||+-+....-||.++|-+++.+... T Consensus 239 ---~~GiDL~~D~~ALQRL~eAAEkAKieLS~ 267 (598) T TIGR02350 239 ---EQGIDLSKDKMALQRLKEAAEKAKIELSS 267 (598) T ss_pred ---HCCCCCHHHHHHHHHHHHHHHHHHHHHCC T ss_conf ---33888112478988889999886585256 No 40 >pfam06350 HSL_N Hormone-sensitive lipase (HSL) N-terminus. This family consists of several mammalian hormone-sensitive lipase (HSL) proteins (EC:3.1.1.-). Hormone-sensitive lipase, a key enzyme in fatty acid mobilisation, overall energy homeostasis, and possibly steroidogenesis, is acutely controlled through reversible phosphorylation by catecholamines and insulin. Probab=23.47 E-value=51 Score=13.62 Aligned_cols=75 Identities=19% Similarity=0.138 Sum_probs=37.1 Q ss_pred HHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCEEEEEEEEECCCCCCHHHH Q ss_conf 9999999999-----99988713577999999999999998873156089999997421081254899977654898899 Q gi|254781178|r 32 EEMRNFLNFC-----VFLEKVLPSYCGVILAIFFFAIVGIYGASIGGHTRKVIDIVDSFIGFSIEKVRIIGNVETPEADI 106 (304) Q Consensus 32 ~~~~~~~~~~-----~~~~~~~p~~~g~~~~~~~~~~~~~~g~~~gg~~~~~~~~~~~~~gF~I~~I~V~Gn~~ls~~eI 106 (304) +.||++|+|+ .|=|.|--..-|.+.++.-+..+|-|-+..--+.+.+++ | T Consensus 175 ~Sir~~Lk~I~i~MAsfsE~Y~~~~~~i~~a~sSl~~sgKY~ldPElRarriv~-------------------------i 229 (315) T pfam06350 175 PSIRPFLQTISIGMVSFGENYKSQEGGISKAASSLFTSGKYAIDPELRGREFER-------------------------I 229 (315) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHH-------------------------H T ss_conf 889999999999999987888855980999999998447312599999999999-------------------------8 Q ss_pred HHHHCCCCCCCEEEECHHHHHHHHH Q ss_conf 9980888898438875999999987 Q gi|254781178|r 107 IHCLDLNTSTSLIFFDAIKIQKQLL 131 (304) Q Consensus 107 l~~l~i~~~~sl~~ld~~~i~~~L~ 131 (304) -+.++++--++.|.|.-.++-..+- T Consensus 230 tqnadi~FcKaFW~L~E~el~~~~p 254 (315) T pfam06350 230 SQNLDVDFWKAFWNITESEMLSSLP 254 (315) T ss_pred HCCCCHHHHHHHHCCCHHHHHHHHH T ss_conf 5005859999987513558988608 No 41 >pfam10309 DUF2414 Protein of unknown function (DUF2414). This is a family of proteins conserved from fungi to mammals. One mouse member is referred to as ELG protein but this is not a homologue of human ELG protein. The function is not known. Probab=23.24 E-value=52 Score=13.59 Aligned_cols=49 Identities=18% Similarity=0.230 Sum_probs=33.1 Q ss_pred EEEEEEEECCCCCCHHHHHHHHCCCCCCCEEEECHHHHHHHHHCCCCEEEEEEEEECCCE Q ss_conf 254899977654898899998088889843887599999998719762279999828981 Q gi|254781178|r 89 SIEKVRIIGNVETPEADIIHCLDLNTSTSLIFFDAIKIQKQLLALPWIAHAEIRRLYPDT 148 (304) Q Consensus 89 ~I~~I~V~Gn~~ls~~eIl~~l~i~~~~sl~~ld~~~i~~~L~~~pwIk~a~V~r~~P~t 148 (304) +-+.|.|.|-.++|.++|..=..-..+..-. ...-||.+-++.-.|++. T Consensus 4 rpeavhirGVD~lsT~Dik~y~~~y~~~~~p-----------~~IEWIdDtS~Nvvf~~~ 52 (62) T pfam10309 4 RPEAVHIRGVDELSTDDIKAYVKEYFDEESP-----------FRIEWIDDTSANIVFKSE 52 (62) T ss_pred CCCEEEEECCCCCCHHHHHHHHHHHCCCCCC-----------CEEEEECCCCEEEEECCH T ss_conf 4231799727879879999999997565897-----------048986598379997787 No 42 >TIGR02941 Sigma_B RNA polymerase sigma-B factor; InterPro: IPR014288 The bacterial core RNA polymerase complex, which consists of five subunits, is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme . RNA polymerase recruits alternative sigma factors as a means of switching on specific regulons. Most bacteria express a multiplicity of sigma factors. Two of these factors, sigma-70 (gene rpoD), generally known as the major or primary sigma factor, and sigma-54 (gene rpoN or ntrA) direct the transcription of a wide variety of genes. The other sigma factors, known as alternative sigma factors, are required for the transcription of specific subsets of genes. With regard to sequence similarity, sigma factors can be grouped into two classes, the sigma-54 and sigma-70 families. Sequence alignments of the sigma70 family members reveal four conserved regions that can be further divided into subregions eg. sub-region 2.2, which may be involved in the binding of the sigma factor to the core RNA polymerase; and sub-region 4.2, which seems to harbor a DNA-binding 'helix-turn-helix' motif involved in binding the conserved -35 region of promoters recognized by the major sigma factors , . This entry represents the sigma factor, sigmaB. It is restricted to certain species within the order Bacillales including Staphylococcus aureus , Listeria monocytogenes and Bacillus cereus (strain ATCC 14579 / DSM 31) .. Probab=22.99 E-value=53 Score=13.56 Aligned_cols=51 Identities=16% Similarity=0.273 Sum_probs=25.2 Q ss_pred CCCHHHHHHHHHHHCCEEEEEEEEECCCCCCHHHHHHHHCCCCCCCEEEEC Q ss_conf 560899999974210812548999776548988999980888898438875 Q gi|254781178|r 72 GGHTRKVIDIVDSFIGFSIEKVRIIGNVETPEADIIHCLDLNTSTSLIFFD 122 (304) Q Consensus 72 gg~~~~~~~~~~~~~gF~I~~I~V~Gn~~ls~~eIl~~l~i~~~~sl~~ld 122 (304) ||....+++.+|..--=+=+=++|--.-.+|++||+.+..-...-+.+.+| T Consensus 116 GpkIkk~~dELT~~lqrSP~i~EIA~~lg~SEEEVLE~Me~g~sY~alSvD 166 (256) T TIGR02941 116 GPKIKKAVDELTDELQRSPKIAEIADRLGVSEEEVLEIMEMGQSYKALSVD 166 (256) T ss_pred CCCCHHHHHHHHHHHCCCCCHHHHHHHCCCCHHHHHHHHHCCCCCCEEECC T ss_conf 772014556644442148706546412177488898776147774333225 No 43 >KOG0290 consensus Probab=22.03 E-value=43 Score=14.10 Aligned_cols=41 Identities=12% Similarity=0.259 Sum_probs=25.1 Q ss_pred CHHHHHHHHHH------HHHHHCHHCCCEEEEEEECCCEEEEEECCC Q ss_conf 17999999999------876301000883389611298369997684 Q gi|254781178|r 244 KFDVAIAKILE------LQNKYQILDRDISVIDMRLPDRLSVRLTTG 284 (304) Q Consensus 244 ~~~~~L~rL~~------l~~~~k~~~r~I~~IDLR~pdri~Vrl~~~ 284 (304) .....|.||.- +...--.-+.++..+|+|.|-.++.+|.-. T Consensus 240 ~~~~pLlRL~wnkqDpnymATf~~dS~~V~iLDiR~P~tpva~L~~H 286 (364) T KOG0290 240 SPSTPLLRLSWNKQDPNYMATFAMDSNKVVILDIRVPCTPVARLRNH 286 (364) T ss_pred CCCCCCEEECCCCCCCHHHHHHHCCCCEEEEEEECCCCCHHEHHHCC T ss_conf 88873034046867816886624378607999702887530443247 No 44 >COG5102 SFT2 Membrane protein involved in ER to Golgi transport [Intracellular trafficking and secretion] Probab=21.96 E-value=55 Score=13.43 Aligned_cols=49 Identities=22% Similarity=0.381 Sum_probs=32.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 68888655999999999999998871357799999999999999887315 Q gi|254781178|r 23 LSLCCVLGLEEMRNFLNFCVFLEKVLPSYCGVILAIFFFAIVGIYGASIG 72 (304) Q Consensus 23 ~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~g~~~~~~~~~~~~~~g~~~g 72 (304) .|. |.||-...--|+-||-=+-+--||..+..-++.-+..+-.+|+..| T Consensus 71 F~~-ClLGa~ac~a~~~fmfpVl~lkPrkFiLlwTmgslLfvl~Fg~l~G 119 (201) T COG5102 71 FSA-CLLGAGACSAFLYFMFPVLRLKPRKFILLWTMGSLLFVLMFGFLLG 119 (201) T ss_pred HHH-HHHHHHHHHHHHHHHHHHHHCCCCCEEEEHHHHHHHHHHHHHHHHH T ss_conf 999-9855689999999988998417653153114888999999999983 No 45 >PRK03598 hypothetical protein; Provisional Probab=21.78 E-value=32 Score=14.94 Aligned_cols=71 Identities=13% Similarity=-0.003 Sum_probs=35.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCHHHHHH-----------HHHHHCCEEEEEEEEECCCCCCHHHHHHHHCCCCCCCE Q ss_conf 5779999999999999988731560899999-----------97421081254899977654898899998088889843 Q gi|254781178|r 50 SYCGVILAIFFFAIVGIYGASIGGHTRKVID-----------IVDSFIGFSIEKVRIIGNVETPEADIIHCLDLNTSTSL 118 (304) Q Consensus 50 ~~~g~~~~~~~~~~~~~~g~~~gg~~~~~~~-----------~~~~~~gF~I~~I~V~Gn~~ls~~eIl~~l~i~~~~sl 118 (304) |++...++++.+.+.+++|.+.-.+...... -+++..+=.|++|.|.-+.+ +..|+-| T Consensus 3 k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ve~~~V~vs~~v~G~V~~v~V~eGd~-----------Vk~Gq~L 71 (331) T PRK03598 3 KPVVIGLAVVALVAVGAGGWWWYQSRQDNGLTLYGNVDIRTVNLSFRVGGRLASLAVDEGDA-----------VKAGQVL 71 (331) T ss_pred CHHHHHHHHHHHHHHHHHHHHHEEECCCCCEEEEEEEEEEEEEECCCCCEEEEEEECCCCCE-----------ECCCCEE T ss_conf 33899999999999999976521405788603579999999999132775999998689798-----------7799889 Q ss_pred EEECHHHHHHHHH Q ss_conf 8875999999987 Q gi|254781178|r 119 IFFDAIKIQKQLL 131 (304) Q Consensus 119 ~~ld~~~i~~~L~ 131 (304) +.||...++.++. T Consensus 72 a~LD~~~~~~~l~ 84 (331) T PRK03598 72 GELDHAPYENALM 84 (331) T ss_pred EEECCHHHHHHHH T ss_conf 9988089999999 No 46 >KOG4634 consensus Probab=21.07 E-value=57 Score=13.31 Aligned_cols=26 Identities=19% Similarity=0.338 Sum_probs=19.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999998871357799999999 Q gi|254781178|r 35 RNFLNFCVFLEKVLPSYCGVILAIFF 60 (304) Q Consensus 35 ~~~~~~~~~~~~~~p~~~g~~~~~~~ 60 (304) .|++||+--+..-.-+|+|.+++++- T Consensus 3 qrlfr~s~vlrs~vs~~~gv~a~a~n 28 (105) T KOG4634 3 QRLFRFSSVLRSAVSVHLGVTATAFN 28 (105) T ss_pred HHHHHHHHHHHHHHHHHHCHHHHHHH T ss_conf 88987999999999874010266777 No 47 >smart00662 RPOLD RNA polymerases D. DNA-directed RNA polymerase subunit D and bacterial alpha chain Probab=20.83 E-value=58 Score=13.28 Aligned_cols=33 Identities=24% Similarity=0.476 Sum_probs=26.9 Q ss_pred HHHHHCCEEEEEEEEECCCCCCHHHHHH-HHCCC Q ss_conf 9742108125489997765489889999-80888 Q gi|254781178|r 81 IVDSFIGFSIEKVRIIGNVETPEADIIH-CLDLN 113 (304) Q Consensus 81 ~~~~~~gF~I~~I~V~Gn~~ls~~eIl~-~l~i~ 113 (304) .+++.+|+.|+.|.|.||...-.+|+++ .+|+. T Consensus 21 lLseip~~AI~~V~I~~n~s~l~eevl~~rL~Li 54 (224) T smart00662 21 LLSSVPGMAVTEVEIEGNTSGVQDEVLAHRLGLK 54 (224) T ss_pred HHHCCCCEEEEEEEEEECCCCCHHHHHHHHHCCC T ss_conf 9865954057999998089886467999984683 No 48 >PRK00164 moaA molybdenum cofactor biosynthesis protein A; Reviewed Probab=20.47 E-value=59 Score=13.23 Aligned_cols=16 Identities=25% Similarity=0.326 Sum_probs=9.8 Q ss_pred HHHHHHHHHHHHHHHH Q ss_conf 8865599999999999 Q gi|254781178|r 26 CCVLGLEEMRNFLNFC 41 (304) Q Consensus 26 ~~~~~~~~~~~~~~~~ 41 (304) -..|..||+.+..+.+ T Consensus 48 ~~~Ls~eEi~~i~~~~ 63 (334) T PRK00164 48 EELLSLEEIERLVRAF 63 (334) T ss_pred HHCCCHHHHHHHHHHH T ss_conf 4229999999999999 No 49 >cd03529 Rieske_NirD Assimilatory nitrite reductase (NirD) family, Rieske domain; Assimilatory nitrate and nitrite reductases convert nitrate through nitrite to ammonium. Members include bacterial and fungal proteins. The bacterial NirD contains a single Rieske domain while fungal proteins have a C-terminal Rieske domain in addition to several other domains. The fungal NirD is involved in nutrient acquisition, functioning at the soil/fungus interface to control nutrient exchange between the fungus and the host plant. The Rieske domain is a [2Fe-2S] cluster binding domain involved in electron transfer. The Rieske [2Fe-2S] cluster is liganded to two histidine and two cysteine residues present in conserved sequences called Rieske motifs. In this family, only a few members contain these residues. Other members may have lost the ability to bind the Rieske [2Fe-2S] cluster. Probab=20.46 E-value=59 Score=13.23 Aligned_cols=40 Identities=20% Similarity=0.346 Sum_probs=28.8 Q ss_pred CEEEEEEEEECCCEEEEEEEEEEEEEEEEE-CCEEEEEECC Q ss_conf 622799998289819999993037999995-7927997056 Q gi|254781178|r 135 WIAHAEIRRLYPDTMEIRLTERHPYAIWQN-NSALYLIDNN 174 (304) Q Consensus 135 wIk~a~V~r~~P~tL~I~I~Er~PvA~~~~-~g~~~~Id~~ 174 (304) |++=+++.-..|+.-..-...-+++|+++. +++++.+++. T Consensus 1 W~~V~~~~dl~~g~~~~v~v~g~~iav~~~~~g~~~A~~~~ 41 (103) T cd03529 1 WQTVCALDDLPPGSGVAALVGDTQIAIFRLPGREVYAVQNM 41 (103) T ss_pred CEEECCHHHCCCCCEEEEEECCEEEEEEEECCCEEEEEECC T ss_conf 92927388889998799998997999999189929999575 No 50 >KOG1665 consensus Probab=20.29 E-value=51 Score=13.66 Aligned_cols=32 Identities=16% Similarity=0.106 Sum_probs=17.7 Q ss_pred ECCEEEEEECCCCCCCCCCCCC-CCCCCCCCCC Q ss_conf 5792799705641032100011-2454311377 Q gi|254781178|r 164 NNSALYLIDNNGYVITAFNHVR-FAYLPILIGE 195 (304) Q Consensus 164 ~~g~~~~Id~~G~Vi~~~~~~~-~~~lPli~G~ 195 (304) ...+.++||++++.|+|.-... .-.+|...+. T Consensus 50 d~kGa~lIDRsp~yFepIlNyLr~Gq~~~~s~i 82 (302) T KOG1665 50 DKKGAVLIDRSPKYFEPILNYLRDGQIPSLSDI 82 (302) T ss_pred CCCCEEEECCCCHHHHHHHHHHHCCCEEECCCC T ss_conf 457408981684326999989853824335996 No 51 >PRK07394 hypothetical protein; Provisional Probab=20.06 E-value=60 Score=13.18 Aligned_cols=53 Identities=9% Similarity=0.120 Sum_probs=36.5 Q ss_pred HHHHHHHHHHCCEEEEEEEEECCCCC------CHHHHHHHHCCCCCCCEEEECHHHHHHHHHCCCC Q ss_conf 99999974210812548999776548------9889999808888984388759999999871976 Q gi|254781178|r 76 RKVIDIVDSFIGFSIEKVRIIGNVET------PEADIIHCLDLNTSTSLIFFDAIKIQKQLLALPW 135 (304) Q Consensus 76 ~~~~~~~~~~~gF~I~~I~V~Gn~~l------s~~eIl~~l~i~~~~sl~~ld~~~i~~~L~~~pw 135 (304) +..+..+++++|.+|- -.||... +..|+++++|++.. .++++++++.+++..| T Consensus 101 ST~aa~v~Aa~G~~Va---KHGnrsvssk~G~~saDvLe~lGv~i~----~l~~e~~~~~l~~~g~ 159 (342) T PRK07394 101 YPLTALLLAAAGQPVV---LHGGDRMPTKYGVPLIELWQGLGVDLT----GLSLEQVQEGFEQTGL 159 (342) T ss_pred HHHHHHHHHHCCCEEE---EECCCCCCCCCCCCHHHHHHHCCCCCC----CCCHHHHHHHHHHCCE T ss_conf 8999999996899599---888998876567427999997599867----7999999999997094 Done!