Query gi|254780532|ref|YP_003064945.1| aminodeoxychorismate lyase [Candidatus Liberibacter asiaticus str. psy62] Match_columns 325 No_of_seqs 217 out of 1862 Neff 8.0 Searched_HMMs 39220 Date Sun May 29 22:46:19 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780532.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK10270 hypothetical protein; 100.0 0 0 719.1 31.9 320 1-324 1-338 (340) 2 COG1559 Aminodeoxychorismate l 100.0 0 0 674.4 30.1 314 4-321 11-341 (342) 3 pfam02618 YceG YceG-like famil 100.0 0 0 668.9 25.0 276 36-316 1-287 (287) 4 TIGR00247 TIGR00247 conserved 100.0 0 0 638.6 24.0 323 1-323 7-369 (373) 5 cd08010 yceG_like proteins sim 100.0 0 0 597.0 18.6 231 78-313 1-245 (245) 6 TIGR02869 spore_SleB spore cor 96.6 0.0023 5.9E-08 41.6 3.8 170 112-304 12-232 (232) 7 pfam02618 YceG YceG-like famil 95.7 0.065 1.7E-06 32.3 7.3 67 35-101 70-142 (287) 8 cd08010 yceG_like proteins sim 95.5 0.04 1E-06 33.7 5.7 68 34-101 27-103 (245) 9 PRK10270 hypothetical protein; 94.5 0.035 8.8E-07 34.1 3.1 61 105-165 39-104 (340) 10 COG1559 Aminodeoxychorismate l 94.0 0.036 9.3E-07 34.0 2.3 66 36-101 116-190 (342) 11 TIGR00247 TIGR00247 conserved 92.6 0.14 3.6E-06 30.3 3.4 63 103-165 47-114 (373) 12 cd03715 RT_ZFREV_like RT_ZFREV 83.2 1.8 4.5E-05 23.2 3.9 72 252-323 112-187 (210) 13 pfam07486 Hydrolase_2 Cell Wal 69.9 8.9 0.00023 18.7 4.6 17 205-221 6-22 (107) 14 PRK03598 hypothetical protein; 69.7 6 0.00015 19.8 3.5 21 1-21 1-22 (331) 15 cd01647 RT_LTR RT_LTR: Reverse 63.6 9.2 0.00023 18.6 3.4 109 206-323 43-154 (177) 16 cd01645 RT_Rtv RT_Rtv: Reverse 62.6 12 0.0003 18.0 3.8 67 254-323 118-191 (213) 17 PRK13481 glycosyltransferase; 58.6 11 0.00027 18.3 3.0 22 183-204 209-230 (269) 18 COG3944 Capsular polysaccharid 57.8 15 0.00038 17.3 5.3 109 1-125 15-139 (226) 19 pfam10566 Glyco_hydro_97 Glyco 56.7 15 0.00039 17.2 3.6 43 1-43 1-44 (643) 20 PRK13759 arylsulfatase; Provis 52.6 12 0.00031 17.8 2.5 15 46-60 80-94 (474) 21 pfam09949 DUF2183 Uncharacteri 51.2 5.8 0.00015 19.9 0.7 18 254-271 7-31 (100) 22 PRK10783 mltD membrane-bound l 50.5 19 0.00049 16.6 6.2 13 212-224 242-254 (449) 23 cd02204 PurL_repeat2 PurL subu 50.3 20 0.0005 16.5 3.8 16 255-270 98-113 (264) 24 TIGR03417 chol_sulfatase choli 47.5 15 0.00038 17.3 2.3 14 46-59 81-94 (500) 25 COG3773 SleB Cell wall hydroly 47.0 3.2 8.2E-05 21.5 -1.2 120 175-308 104-246 (249) 26 TIGR00434 cysH phosophoadenyly 43.6 7.4 0.00019 19.2 0.3 35 113-150 26-60 (226) 27 TIGR02881 spore_V_K stage V sp 43.5 25 0.00063 15.9 3.8 95 92-203 127-223 (261) 28 PRK11198 hypothetical protein; 41.9 26 0.00067 15.7 5.5 23 144-166 94-116 (147) 29 PRK09510 tolA cell envelope in 40.5 28 0.0007 15.6 4.6 51 4-55 14-70 (402) 30 pfam04337 DUF480 Protein of un 39.8 13 0.00034 17.6 1.1 25 186-210 2-26 (148) 31 PRK10833 putative assembly pro 37.6 31 0.00078 15.3 2.7 20 1-20 1-20 (617) 32 COG0046 PurL Phosphoribosylfor 37.2 31 0.00079 15.3 4.4 71 254-324 546-637 (743) 33 KOG4254 consensus 36.9 31 0.0008 15.2 4.1 116 207-325 263-399 (561) 34 smart00540 LEM in nuclear memb 35.9 15 0.00038 17.3 0.8 19 260-278 15-33 (44) 35 PRK13133 consensus 35.3 33 0.00085 15.1 3.1 60 257-316 142-209 (267) 36 COG5294 Uncharacterized protei 35.2 28 0.00072 15.5 2.1 20 1-20 1-20 (113) 37 TIGR01063 gyrA DNA gyrase, A s 34.7 25 0.00065 15.8 1.9 237 36-306 298-592 (864) 38 PRK11239 hypothetical protein; 34.4 18 0.00046 16.7 1.1 70 38-109 48-118 (215) 39 PRK06770 hypothetical protein; 33.7 35 0.0009 14.9 2.7 48 79-127 89-138 (185) 40 PRK11627 hypothetical protein; 33.6 30 0.00077 15.3 2.1 18 1-18 2-19 (192) 41 PRK02090 phosphoadenosine phos 33.1 36 0.00092 14.8 2.7 12 140-151 75-86 (243) 42 pfam10787 YfmQ Uncharacterized 32.7 37 0.00093 14.8 2.6 36 282-317 86-124 (149) 43 pfam01686 Adeno_Penton_B Adeno 32.7 28 0.00071 15.6 1.8 46 106-151 128-196 (454) 44 pfam10828 DUF2570 Protein of u 31.2 39 0.00099 14.6 3.7 24 1-24 1-24 (110) 45 TIGR03028 EpsE polysaccharide 30.3 40 0.001 14.5 3.8 74 42-116 43-130 (239) 46 pfam03020 LEM LEM domain. The 30.0 21 0.00054 16.3 0.8 17 259-275 14-30 (43) 47 pfam06809 NPDC1 Neural prolife 29.3 25 0.00064 15.8 1.1 31 190-225 205-235 (345) 48 TIGR02707 butyr_kinase butyrat 29.1 26 0.00067 15.7 1.2 52 47-103 29-99 (353) 49 KOG3805 consensus 29.0 42 0.0011 14.4 3.6 45 261-305 251-308 (361) 50 COG5398 Heme oxygenase [Inorga 28.6 7.6 0.00019 19.2 -1.7 39 195-233 49-87 (238) 51 TIGR00998 8a0101 efflux pump m 27.5 45 0.0011 14.2 3.1 21 182-202 202-222 (379) 52 KOG0480 consensus 26.6 47 0.0012 14.1 4.1 17 273-289 483-499 (764) 53 TIGR03525 GldK gliding motilit 26.2 44 0.0011 14.3 1.9 17 1-17 1-17 (449) 54 pfam11498 Activator_LAG-3 Tran 26.1 37 0.00093 14.8 1.5 22 104-125 246-267 (476) 55 PRK13116 consensus 25.7 48 0.0012 14.0 2.5 57 260-316 143-204 (278) 56 PRK06256 biotin synthase; Vali 25.5 15 0.00038 17.3 -0.6 48 220-267 196-245 (325) 57 pfam03867 FTZ Fushi tarazu (FT 25.4 24 0.00061 16.0 0.4 16 269-284 224-239 (264) 58 PHA02291 hypothetical protein 25.4 49 0.0012 14.0 3.6 37 2-38 4-40 (131) 59 pfam05819 NolX NolX protein. T 25.2 34 0.00086 15.0 1.2 110 192-320 398-531 (545) 60 cd01454 vWA_norD_type norD typ 25.2 49 0.0013 14.0 6.8 90 227-319 38-141 (174) 61 TIGR02764 spore_ybaN_pdaB poly 25.0 50 0.0013 13.9 2.4 91 36-127 10-124 (198) 62 KOG0015 consensus 24.9 50 0.0013 13.9 2.3 23 254-276 227-249 (338) 63 KOG3032 consensus 24.9 23 0.00057 16.1 0.2 15 189-203 179-193 (264) 64 pfam10531 SLBB SLBB domain. 24.5 51 0.0013 13.9 2.4 22 79-100 10-31 (52) 65 PRK13124 consensus 24.0 52 0.0013 13.8 3.3 53 263-315 137-192 (257) 66 PRK13697 cytochrome c6; Provis 23.5 53 0.0014 13.8 2.8 17 1-17 1-17 (111) 67 PRK13123 consensus 23.3 53 0.0014 13.7 2.9 60 256-316 133-198 (256) 68 pfam05170 AsmA AsmA family. Th 23.3 54 0.0014 13.7 2.4 18 1-18 1-18 (537) 69 PRK13118 consensus 23.2 54 0.0014 13.7 3.6 55 261-316 144-203 (269) 70 PRK13139 consensus 22.9 54 0.0014 13.7 2.9 58 258-316 139-201 (254) 71 cd00118 LysM Lysin domain, fou 22.6 55 0.0014 13.7 3.6 40 37-84 2-43 (46) 72 cd01775 CYR1_RA CYR1 is a fung 22.5 18 0.00046 16.8 -0.7 48 34-81 13-60 (97) 73 PRK13114 consensus 22.2 56 0.0014 13.6 3.0 55 261-316 140-199 (266) 74 cd01777 SNX27_RA SNX27_RA SN 22.0 57 0.0014 13.6 2.2 54 30-83 8-64 (87) 75 TIGR03174 cas_Csc3 CRISPR-asso 22.0 57 0.0014 13.6 2.5 19 256-274 486-504 (953) 76 pfam07219 HemY_N HemY protein 21.7 58 0.0015 13.5 3.5 22 1-22 1-22 (134) 77 TIGR02057 PAPS_reductase phosp 21.6 21 0.00054 16.3 -0.5 13 140-152 65-77 (239) 78 pfam12234 Rav1p_C RAVE protein 21.2 59 0.0015 13.5 4.2 48 179-226 280-335 (629) 79 PRK00031 lolA outer-membrane l 21.1 59 0.0015 13.5 2.4 16 1-16 1-16 (201) 80 PRK13112 consensus 20.7 60 0.0015 13.4 2.5 57 259-316 143-204 (279) 81 COG4814 Uncharacterized protei 20.6 35 0.00089 14.9 0.5 55 162-222 197-251 (288) 82 PRK13137 consensus 20.4 61 0.0016 13.4 2.1 58 257-315 146-209 (266) 83 KOG2500 consensus 20.2 62 0.0016 13.3 4.1 42 76-121 28-69 (253) No 1 >PRK10270 hypothetical protein; Provisional Probab=100.00 E-value=0 Score=719.14 Aligned_cols=320 Identities=35% Similarity=0.568 Sum_probs=287.4 Q ss_pred CHHHHHHHHHHHHHH---HHHHHHHHH-HHCCCCCCCCCEEEEECCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCC Q ss_conf 957999999999999---999999998-7246777778789998888999999999998798069899999999538667 Q gi|254780532|r 1 MLKFLIPLITIFLLA---IGVHIHVIR-VYNATGPLQNDTIFLVRNNMSLKEISKNLFNGGVIVNPYIFRYVTQFYFGSR 76 (325) Q Consensus 1 m~k~li~li~i~~l~---~~~~~~~~~-~~~~~~~~~~~~~i~I~~G~s~~~I~~~L~~~gvI~s~~~F~~~~~~~~~~~ 76 (325) |+|+++.++++++++ +++.+|... ....+++..++++|+|++|+|..+|++.|+++|||+|+.+|++++++.+.+. T Consensus 1 Mkk~~~~~~~l~~~l~i~~g~~~~~~~~~~~~~~~~~~~vvv~I~~G~s~~~Ia~~L~~~gvI~s~~~F~~~~~~~~~~~ 80 (340) T PRK10270 1 MKKVLLIILLLLVVLGIAAGVGVWKVRHLADSKLLIKEETIFTLKPGTGRLALGEQLYADKIINRPRVFQWLLRIEPDLS 80 (340) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCEEEEECCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCC T ss_conf 90289999999999999999999874100258999998889998999999999999998868378999999999679757 Q ss_pred CCCCCEEEECCCCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCC------------C--CCCCCCC Q ss_conf 6257204406787589999986102433321011110348999998620731113443------------6--6881000 Q gi|254780532|r 77 GLKTGEYEIEKGSSMSQIAEKIMYGKVLMHSISFPEGFTVKQMARRLKDNPLLVGELP------------L--ELPLEGT 142 (325) Q Consensus 77 ~ikaG~Y~i~~~~s~~~il~~L~~G~~~~~~iTI~EG~t~~ei~~~L~~~~~l~~~~~------------~--~~~lEG~ 142 (325) +||||+|.|+++||..++++.|.+|++..++||||||+|+.|+++++++.+.+..... . ...+||| T Consensus 81 ~ikaG~Y~l~~~ms~~~il~~L~~g~~~~~~vTipEG~t~~ei~~~l~~~~~i~~~~~~~~~~~~~~~~~~~~~~~lEGy 160 (340) T PRK10270 81 HFKAGTYRFTPQMTVREMLKLLESGKEAQFPLRLVEGMRLSDYLKQLREAPYIKHTLSDDKYATVAQALELENPEWIEGW 160 (340) T ss_pred CCCCEEEEECCCCCHHHHHHHHHCCCCCEEEEEECCCCCHHHHHHHHHHCHHHHHHCCCCCHHHHHHHCCCCCCCCCCEE T ss_conf 61446999689999999999997199622689954898699999998757556653133208999986288998764557 Q ss_pred CCHHHEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCC Q ss_conf 03011210146897899999999999987556554531066765999999887777652672578899999999740587 Q gi|254780532|r 143 LCPSTYNFPLGTHRSEILNQAMLKQKQVVDEVWEIRDVDHPIKSKEDLVILASIVEKETSRADERAHVASVFINRFSKSI 222 (325) Q Consensus 143 l~PdTY~~~~~~s~~~il~~m~~~~~~~~~~~~~~~~~~~~~~s~~evlilASIVEkEa~~~~e~~~IA~V~~NRL~~~M 222 (325) ||||||+|.+++++.+++++|+++|.+.+++.|..++.+++..+++|++||||||||||+.++|||+|||||+|||++|| T Consensus 161 LfPdTY~~~~~~~~~~il~~~~~~~~~~~~~~~~~~~~~l~~~~~~e~ltlASIVEkEa~~~~er~~IA~Vf~NRL~~gM 240 (340) T PRK10270 161 FWPDTWMYTANTTDVALLKRAHKKMVKAVDSAWEGRADGLPYKDKNQLVTMASIIEKETAVASERDQVASVFINRLRIGM 240 (340) T ss_pred ECCCEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCC T ss_conf 63865884699998999999999999999999986411578678999999999999974274212789999999862399 Q ss_pred EEEEEEEEEEECCCCCCCCCCCCCCCHHHCCCCCCHHCCCCCCCCCCCCCCCHHHHHHHCCCCCCCEEEEEECCCCCEEE Q ss_conf 02442221010155667754200000010156830110238889998664327989996087889879999838987665 Q gi|254780532|r 223 RLQSDSTVIYGILEGDYDLTNRKISRSDFSIKTPYNSYLMNGLPPTAISNPGRLSLEAVAKPLHTEDLYFVGDGKGGHFF 302 (325) Q Consensus 223 ~Lq~D~Tv~Y~l~~~~~~~~~~~~~~~dl~~~spYNTY~~~GLPP~PI~~pg~~ai~A~l~p~~~dylyfv~~~~g~h~F 302 (325) +||+||||+||+++. ..++++++||+++||||||+++|||||||||||.+||+||+||++|||||||+|++|+|+| T Consensus 241 ~LqsD~Tv~Y~l~~~----~~~~~~~~dl~~~sPYNTY~~~GLPPgPI~~Pg~~ai~Aal~P~~~dylYFVa~~~g~h~F 316 (340) T PRK10270 241 RLQTDPTVIYGMGER----YNGKLSRADLETPTAYNTYTITGLPPGAIATPGADSLKAAAHPAKTPYLYFVADGKGGHTF 316 (340) T ss_pred CCCCCCHHHCCCCCC----CCCCCCHHHHCCCCCHHHCCCCCCCCCCCCCCCHHHHHHHHCCCCCCEEEEEECCCCCEEC T ss_conf 888781221004787----6775566664179821210116989987678799999998688888748999839998572 Q ss_pred CCCHHHHHHHHHHHHHHHHHCC Q ss_conf 2999999999999999987288 Q gi|254780532|r 303 STNFKDHTINVQKWRKMSLESK 324 (325) Q Consensus 303 s~t~~eH~~nv~~y~~~~~~~k 324 (325) |+||+||++||++|+|+.++++ T Consensus 317 s~T~~eH~~~v~~Y~~~~~~~~ 338 (340) T PRK10270 317 NTNLASHNRSVQDYLKVLKEKN 338 (340) T ss_pred CCCHHHHHHHHHHHHHHHHHHH T ss_conf 7999999999999999998751 No 2 >COG1559 Aminodeoxychorismate lyase [Coenzyme transport and metabolism] Probab=100.00 E-value=0 Score=674.41 Aligned_cols=314 Identities=41% Similarity=0.664 Sum_probs=279.6 Q ss_pred HHHHHHHHHHHHHH-HHHHHHHHH-CCCCCC-CCCEEEEECCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCCCCCC Q ss_conf 99999999999999-999999872-467777-787899988889999999999987980698999999995386676257 Q gi|254780532|r 4 FLIPLITIFLLAIG-VHIHVIRVY-NATGPL-QNDTIFLVRNNMSLKEISKNLFNGGVIVNPYIFRYVTQFYFGSRGLKT 80 (325) Q Consensus 4 ~li~li~i~~l~~~-~~~~~~~~~-~~~~~~-~~~~~i~I~~G~s~~~I~~~L~~~gvI~s~~~F~~~~~~~~~~~~ika 80 (325) +++.++++++++++ +.++....+ ...++. .+++.|+|++|++..+|++.|+++++|++..+|.++++.++....||+ T Consensus 11 ~~~~~i~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~v~I~~Gs~~~~i~~~Le~~~vI~~~~~f~~~~~~~~~~~~lka 90 (342) T COG1559 11 TLLVLVLVLLLILGGVGYIGILYFFALSPPLGKNKKTVTIPPGSSVSEIGSELEKQKVIKNARVFRLYLKYNPAAGSLKA 90 (342) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCCCCCC T ss_conf 99999999999999999999998713688666786599978999778999999866752567999999985576577677 Q ss_pred CEEEECCCCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCC------------C--CCCCCCCCCCHH Q ss_conf 20440678758999998610243332101111034899999862073111344------------3--668810000301 Q gi|254780532|r 81 GEYEIEKGSSMSQIAEKIMYGKVLMHSISFPEGFTVKQMARRLKDNPLLVGEL------------P--LELPLEGTLCPS 146 (325) Q Consensus 81 G~Y~i~~~~s~~~il~~L~~G~~~~~~iTI~EG~t~~ei~~~L~~~~~l~~~~------------~--~~~~lEG~l~Pd 146 (325) |+|.|.++||..+++..|.+|+....++|||||+|+.|++++|.+.+.+.... + ...++||||||| T Consensus 91 G~Y~l~~~ms~~~il~~L~~Gk~~~~~~~ipEG~t~~~i~~~l~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~EG~l~Pd 170 (342) T COG1559 91 GEYRLRPGMSAADILKLLESGKSAQFKVTIPEGYTLKQIAKKLAEAPVLDGVLDGLAIAEIAARLPKLDKERLEGYLFPD 170 (342) T ss_pred CEEECCCCCCHHHHHHHHHCCCCCCCEEECCCCCCHHHHHHHHHHCHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEECCC T ss_conf 11442789999999999865765554276689862999999983022654311345667899846887678975165376 Q ss_pred HEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCEEEE Q ss_conf 12101468978999999999999875565545310667659999998877776526725788999999997405870244 Q gi|254780532|r 147 TYNFPLGTHRSEILNQAMLKQKQVVDEVWEIRDVDHPIKSKEDLVILASIVEKETSRADERAHVASVFINRFSKSIRLQS 226 (325) Q Consensus 147 TY~~~~~~s~~~il~~m~~~~~~~~~~~~~~~~~~~~~~s~~evlilASIVEkEa~~~~e~~~IA~V~~NRL~~~M~Lq~ 226 (325) ||.|+++++.++|+++|+.+|.+.+++.|..++.++|+++++|+|||||||||||++++|||+|||||+|||++||+||+ T Consensus 171 TY~~~~g~~~~~Ii~~mi~~~~~~l~~~~~~~~~~lp~~t~~e~ltlASIVEKEa~~~~Erp~IAsVFyNRLk~gM~LQs 250 (342) T COG1559 171 TYEFTRGTSAEDIIKRMIKAMDKKLDEAWEKRADDLPGKTPYELLTLASIVEKEAAVDEERPKIASVFYNRLKKGMRLQS 250 (342) T ss_pred EEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHCCCCCCCC T ss_conf 36408999899999999999999887754202556776889999999999997513644220457896877600885787 Q ss_pred EEEEEEECCCCCCCCCCCCCCCHHHCCCCCCHHCCCCCCCCCCCCCCCHHHHHHHCCCCCCCEEEEEECCCCCEEECCCH Q ss_conf 22210101556677542000000101568301102388899986643279899960878898799998389876652999 Q gi|254780532|r 227 DSTVIYGILEGDYDLTNRKISRSDFSIKTPYNSYLMNGLPPTAISNPGRLSLEAVAKPLHTEDLYFVGDGKGGHFFSTNF 306 (325) Q Consensus 227 D~Tv~Y~l~~~~~~~~~~~~~~~dl~~~spYNTY~~~GLPP~PI~~pg~~ai~A~l~p~~~dylyfv~~~~g~h~Fs~t~ 306 (325) ||||+||+++ ...++++++||++|||||||+++|||||||||||.+||+||+||++|||||||++++|+|.||+|| T Consensus 251 DpTViYglg~----~~~~~~~~~Dl~~~TPYNTY~~~GLPp~PIanPg~~aL~Aa~~P~~tdyLYFVa~~~gGh~Fs~tl 326 (342) T COG1559 251 DPTVIYGLGK----SHEGAISKKDLEIDTPYNTYKIKGLPPTPIANPGLAALRAALNPAKTDYLYFVADGDGGHVFSKTL 326 (342) T ss_pred CCEEEECCCC----CCCCCCCHHHHCCCCCCHHCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCEEEEEECCCCEEECCCH T ss_conf 8701120567----546764688727899852010279899877781699999985888788169999799986835899 Q ss_pred HHHHHHHHHHHHHHH Q ss_conf 999999999999987 Q gi|254780532|r 307 KDHTINVQKWRKMSL 321 (325) Q Consensus 307 ~eH~~nv~~y~~~~~ 321 (325) +||++||++|+++.+ T Consensus 327 ~eHn~~v~~y~~~~~ 341 (342) T COG1559 327 EEHNANVEKYRKSLR 341 (342) T ss_pred HHHHHHHHHHHHHCC T ss_conf 999999999986405 No 3 >pfam02618 YceG YceG-like family. This family of proteins is found in bacteria. Proteins in this family are typically between 332 and 389 amino acids in length. This family was previously incorrectly annotated and names as aminodeoxychorismate lyase. The structure of YceG was solved by X-ray crystallography. Probab=100.00 E-value=0 Score=668.89 Aligned_cols=276 Identities=42% Similarity=0.700 Sum_probs=262.7 Q ss_pred EEEEECCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCCCCCCCEEEECCCCCHHHHHHHHHHCCCCCCCCCCCCCCC Q ss_conf 89998888999999999998798069899999999538667625720440678758999998610243332101111034 Q gi|254780532|r 36 TIFLVRNNMSLKEISKNLFNGGVIVNPYIFRYVTQFYFGSRGLKTGEYEIEKGSSMSQIAEKIMYGKVLMHSISFPEGFT 115 (325) Q Consensus 36 ~~i~I~~G~s~~~I~~~L~~~gvI~s~~~F~~~~~~~~~~~~ikaG~Y~i~~~~s~~~il~~L~~G~~~~~~iTI~EG~t 115 (325) |+|+|++|+|+++|++.|+++|||+|+.+|++++++.+.+++||||+|.|+++||..++++.|.+|++..++||||||+| T Consensus 1 V~v~I~~G~s~~~I~~~L~~~gvI~s~~~F~~~~~~~~~~~~ik~G~Y~l~~~~s~~~il~~L~~g~~~~~~vTI~EG~t 80 (287) T pfam02618 1 VVVTIPPGSSLSQIADILEEAGVIKSALAFRLYARLKGKDGNLKAGEYELKPGMSPAEILEKLVSGKVVQYKVTIPEGLT 80 (287) T ss_pred CEEEECCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCCCEEEEEEEECCCCCHHHHHHHHHCCCEEEEEEEECCCCC T ss_conf 98998999999999999998859178999999998779868540179897999999999998635975789999869860 Q ss_pred HHHHHHHHHCCCCCCCCCC-----------CCCCCCCCCCHHHEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 8999998620731113443-----------66881000030112101468978999999999999875565545310667 Q gi|254780532|r 116 VKQMARRLKDNPLLVGELP-----------LELPLEGTLCPSTYNFPLGTHRSEILNQAMLKQKQVVDEVWEIRDVDHPI 184 (325) Q Consensus 116 ~~ei~~~L~~~~~l~~~~~-----------~~~~lEG~l~PdTY~~~~~~s~~~il~~m~~~~~~~~~~~~~~~~~~~~~ 184 (325) +.|+++.|++...+..+.+ ...++|||||||||+|++++++++++++|+++|.++|++.|..++.+.++ T Consensus 81 ~~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~lEG~l~PdTY~~~~~~s~~~il~~m~~~~~~~~~~~~~~~~~~~~~ 160 (287) T pfam02618 81 LEQIAERLAKAGGLKKEEFLAFLEELGGELPSLPLEGYLFPDTYEFDKGTSAEEILKRMVKRFEKVLAEAWQARAKGLPL 160 (287) T ss_pred HHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCEEEECCEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 99999999856599989999869987512688983448728768878999999999999999999998988765441367 Q ss_pred CCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCEEEEEEEEEEECCCCCCCCCCCCCCCHHHCCCCCCHHCCCCC Q ss_conf 65999999887777652672578899999999740587024422210101556677542000000101568301102388 Q gi|254780532|r 185 KSKEDLVILASIVEKETSRADERAHVASVFINRFSKSIRLQSDSTVIYGILEGDYDLTNRKISRSDFSIKTPYNSYLMNG 264 (325) Q Consensus 185 ~s~~evlilASIVEkEa~~~~e~~~IA~V~~NRL~~~M~Lq~D~Tv~Y~l~~~~~~~~~~~~~~~dl~~~spYNTY~~~G 264 (325) .|++|+||||||||+||+.++||++|||||+|||++||+|||||||+||+++. .++++.+||++|||||||+++| T Consensus 161 ~s~~evltLASIVEkEa~~~~e~~~IA~V~~NRL~~gm~LQ~D~Tv~Y~~~~~-----~~~~~~~dl~~~spYNTY~~~G 235 (287) T pfam02618 161 KTPYEALILASIVEKETGVAEERPKIAGVFYNRLKKGMRLQSDPTVIYGLGEG-----DGRLTRKDLRTDSPYNTYKIKG 235 (287) T ss_pred CCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCCCCCEEECCCCC-----CCCCCHHHHCCCCCHHHCCCCC T ss_conf 78789767878888873274036789999998644799777584203537788-----8866589863688213214169 Q ss_pred CCCCCCCCCCHHHHHHHCCCCCCCEEEEEECCCCCEEECCCHHHHHHHHHHH Q ss_conf 8999866432798999608788987999983898766529999999999999 Q gi|254780532|r 265 LPPTAISNPGRLSLEAVAKPLHTEDLYFVGDGKGGHFFSTNFKDHTINVQKW 316 (325) Q Consensus 265 LPP~PI~~pg~~ai~A~l~p~~~dylyfv~~~~g~h~Fs~t~~eH~~nv~~y 316 (325) |||||||+||.+||+||+||+++||||||++++|+|+||+||+||++||++| T Consensus 236 LPPgPI~~P~~~ai~Aal~p~~~~ylyfv~~~~g~~~Fs~t~~eH~~~v~~y 287 (287) T pfam02618 236 LPPGPIANPGKAAIEAALNPAKTDYLYFVADGDGTHYFSKTLAEHNRAVRKY 287 (287) T ss_pred CCCCCCCCCCHHHHHHHHCCCCCCEEEEEECCCCCEEECCCHHHHHHHHHHC T ss_conf 9998768879999999868888874899985998478078999999999739 No 4 >TIGR00247 TIGR00247 conserved hypothetical protein TIGR00247; InterPro: IPR003770 This family contains several aminodeoxychorismate lyase proteins. Aminodeoxychorismate lyase is a pyridoxal 5'-phosphate-dependent enzyme that converts 4-aminodeoxychorismate to pyruvate and p-aminobenzoate, a precursor of folic acid in bacteria . . Probab=100.00 E-value=0 Score=638.55 Aligned_cols=323 Identities=34% Similarity=0.554 Sum_probs=283.6 Q ss_pred CHHHH----HHHH---HHHHHHHHHHHHHHHHHCCCC-CCCCCEEEEECCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHC Q ss_conf 95799----9999---999999999999998724677-777878999888899999999999879806989999999953 Q gi|254780532|r 1 MLKFL----IPLI---TIFLLAIGVHIHVIRVYNATG-PLQNDTIFLVRNNMSLKEISKNLFNGGVIVNPYIFRYVTQFY 72 (325) Q Consensus 1 m~k~l----i~li---~i~~l~~~~~~~~~~~~~~~~-~~~~~~~i~I~~G~s~~~I~~~L~~~gvI~s~~~F~~~~~~~ 72 (325) |+|+| |.+. ++|++++|+.+|....+.... ....++.|+|++|+|...||..|.++.||++..+|.+++++. T Consensus 7 ~Kk~Lnkkri~i~~~~vll~~~gs~~~y~~~~L~~~~~~~~~~~~~~i~~G~~~~~lA~~L~~~k~I~~~~~l~y~~~~~ 86 (373) T TIGR00247 7 MKKVLNKKRIIILLLLVLLLIIGSILLYKISLLEKSELAVKLEVEFNIEKGTGVSKLAKELKKEKLIKSEKLLQYLLKIK 86 (373) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHCCCEEEEECCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHC T ss_conf 45566566899999999999998899887775200122115446886178977899999987620112531665566552 Q ss_pred CCCCCCCCCEEEECCCCCHHHHHHHHHHCCC-CCCCCCCCCCCCHHHHHHHHHCCC-----CCCCCC------------- Q ss_conf 8667625720440678758999998610243-332101111034899999862073-----111344------------- Q gi|254780532|r 73 FGSRGLKTGEYEIEKGSSMSQIAEKIMYGKV-LMHSISFPEGFTVKQMARRLKDNP-----LLVGEL------------- 133 (325) Q Consensus 73 ~~~~~ikaG~Y~i~~~~s~~~il~~L~~G~~-~~~~iTI~EG~t~~ei~~~L~~~~-----~l~~~~------------- 133 (325) |.+.++|||.|.|.+.|+..+|+..|.+|.. .++.||++||.++.+++..|.+.+ .+.+++ T Consensus 87 g~~~~fkaG~Y~l~~~~~~~~~~~~L~sG~~~~~f~v~~~EG~~~~~~~~~L~~~~kyskk~v~~~l~D~~f~~~l~~~~ 166 (373) T TIGR00247 87 GELKKFKAGTYLLNKDMTVDEILKLLESGKENVQFDVTVVEGATLKDIAKKLKEAPKYSKKQVIDTLKDELFLEQLKKEK 166 (373) T ss_pred CCCCCEECCEEEECCCCCHHHHHHHHHCCCCCCEEEEEECCCCCHHHHHHHHHCCCCHHHHHHHHCCCHHHHHHHHHHCC T ss_conf 76554003213456887689999996357765203457627702789999974045001456422035267999986306 Q ss_pred -----------CCCCCCCCCCCHHHEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHC Q ss_conf -----------366881000030112101468978999999999999875565545310667659999998877776526 Q gi|254780532|r 134 -----------PLELPLEGTLCPSTYNFPLGTHRSEILNQAMLKQKQVVDEVWEIRDVDHPIKSKEDLVILASIVEKETS 202 (325) Q Consensus 134 -----------~~~~~lEG~l~PdTY~~~~~~s~~~il~~m~~~~~~~~~~~~~~~~~~~~~~s~~evlilASIVEkEa~ 202 (325) ..++|+|||||||||.|..+++.-+.+.||..++.+-+.+.|..+..+.+..++|+.||||||||+|++ T Consensus 167 ~~~v~~~~~~~n~~~PlEG~LfPdTY~f~~~~t~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~ASivE~E~~ 246 (373) T TIGR00247 167 KSTVTQDLELKNSKHPLEGWLFPDTYKFTKKDTDLELLKRAFVKMVKALLKAWKSRLKDLPVKELYKLLIMASIVEKETT 246 (373) T ss_pred HHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHH T ss_conf 23577777541788761301177665788896149999999999887666654334305863320310011224545665 Q ss_pred CHHHHHHHHHHHHHHHCCCCEEEEEEEEEEECCCCCCCCCCCCCCCHHHCCCCCCHHCCCCCCCCCCCCCCCHHHHHHHC Q ss_conf 72578899999999740587024422210101556677542000000101568301102388899986643279899960 Q gi|254780532|r 203 RADERAHVASVFINRFSKSIRLQSDSTVIYGILEGDYDLTNRKISRSDFSIKTPYNSYLMNGLPPTAISNPGRLSLEAVA 282 (325) Q Consensus 203 ~~~e~~~IA~V~~NRL~~~M~Lq~D~Tv~Y~l~~~~~~~~~~~~~~~dl~~~spYNTY~~~GLPP~PI~~pg~~ai~A~l 282 (325) ...||++|||||+|||+.||+||+||||+||+|+.......+++..+||++|||||||+++|||||||++||.+|++||+ T Consensus 247 v~~er~k~aSVFyNRLk~kM~LQtDpTv~Y~~ge~~~r~~~k~iyfsDLEi~sPYNTY~~~GlpPtPIa~aG~~SL~Aa~ 326 (373) T TIGR00247 247 VKSERDKVASVFYNRLKKKMKLQTDPTVIYGMGEEYNRSLSKRIYFSDLEIDSPYNTYINTGLPPTPIANAGISSLQAAA 326 (373) T ss_pred HHCCCCCEEHHHHHHHHCCCCCCCCCEEEEECCCCCCCCCEEEEEEEECCCCCCCCCEECCCCCCCCCCHHHHHHHHHHH T ss_conf 53156520030013453378888861023303111588100256642157688866522178888612211078999950 Q ss_pred CCCCCCEEEEEECCC--CCEEECCCHHHHHHHHHHHHHHHHHC Q ss_conf 878898799998389--87665299999999999999998728 Q gi|254780532|r 283 KPLHTEDLYFVGDGK--GGHFFSTNFKDHTINVQKWRKMSLES 323 (325) Q Consensus 283 ~p~~~dylyfv~~~~--g~h~Fs~t~~eH~~nv~~y~~~~~~~ 323 (325) +|++|||||||||++ |+|.|+++|.||++.+++|.+|.+-. T Consensus 327 ~P~kT~YLYFva~~~klG~h~F~~~l~~Hnka~~~Yl~~~~~~ 369 (373) T TIGR00247 327 HPKKTDYLYFVAKGSKLGGHKFTKNLKEHNKAVQDYLKSFKTE 369 (373) T ss_pred CCCCCCEEEEEEECCCCCCCEECCCHHHHHHHHHHHHHHHHHC T ss_conf 8667873689984650155134321556657899988653000 No 5 >cd08010 yceG_like proteins similar to Escherichia coli yceG. The gene product of Escherichia coli yceG has been erroneously annotated as an aminodeoxychorismate lyase. Its overexpression has been reported to cause abnormal biofilm architecture, and it has been reported to be part of a putative five-gene operon. It might function as a periplasmic solute-binding protein. The family also includes Streptomyces caeruleus NovB, an uncharacterized member of the novobiocin biosynthetic gene cluster. Probab=100.00 E-value=0 Score=597.00 Aligned_cols=231 Identities=42% Similarity=0.744 Sum_probs=215.2 Q ss_pred CCCCEEEECCCCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCC--------------CCCCCCCCCC Q ss_conf 25720440678758999998610243332101111034899999862073111344--------------3668810000 Q gi|254780532|r 78 LKTGEYEIEKGSSMSQIAEKIMYGKVLMHSISFPEGFTVKQMARRLKDNPLLVGEL--------------PLELPLEGTL 143 (325) Q Consensus 78 ikaG~Y~i~~~~s~~~il~~L~~G~~~~~~iTI~EG~t~~ei~~~L~~~~~l~~~~--------------~~~~~lEG~l 143 (325) ||||+|.|+++||..+++..|.+|++..++||||||+|+.||+++|++...+..+. ....++|||| T Consensus 1 ikaG~Y~l~~~~s~~~il~~L~~g~~~~~~vTIpEG~t~~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lEG~L 80 (245) T cd08010 1 LKAGEYELSPGMSLAEILERLTSGKSAQVKVTIPEGYTLKQIAKALSKAGGLSDAEFLAALKDPAFILLGDPKYPLEGYL 80 (245) T ss_pred CCCEEEEECCCCCHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHCCCCCHHHHHHHHHCHHHHHCCCCCCCCCEEE T ss_conf 95668886999899999999973881579999889883999999998566999999999851655553578888842366 Q ss_pred CHHHEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCE Q ss_conf 30112101468978999999999999875565545310667659999998877776526725788999999997405870 Q gi|254780532|r 144 CPSTYNFPLGTHRSEILNQAMLKQKQVVDEVWEIRDVDHPIKSKEDLVILASIVEKETSRADERAHVASVFINRFSKSIR 223 (325) Q Consensus 144 ~PdTY~~~~~~s~~~il~~m~~~~~~~~~~~~~~~~~~~~~~s~~evlilASIVEkEa~~~~e~~~IA~V~~NRL~~~M~ 223 (325) |||||+|.+++++++|+++|+++|.++|++.|..++ +.+++|++|+|||||||||||+.++||++|||||+|||++||+ T Consensus 81 fPdTY~~~~~~s~~~il~~m~~~~~~~~~~~~~~~~-~~~~~s~~evltLASIVEkEa~~~~e~~~IA~Vf~NRL~~~M~ 159 (245) T cd08010 81 FPDTYEFPKGATPEDLLKRMVKRFDKVLEEAWAARA-GLPGLTPYEVLTLASIVEKEAGLPEERPLIAGVFYNRLKKGMR 159 (245) T ss_pred CCCEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHH-CCCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCC T ss_conf 076699537899999999999999999999988653-0268999999999999998723731588999999997025996 Q ss_pred EEEEEEEEEECCCCCCCCCCCCCCCHHHCCCCCCHHCCCCCCCCCCCCCCCHHHHHHHCCCCCCCEEEEEECCCCCEEEC Q ss_conf 24422210101556677542000000101568301102388899986643279899960878898799998389876652 Q gi|254780532|r 224 LQSDSTVIYGILEGDYDLTNRKISRSDFSIKTPYNSYLMNGLPPTAISNPGRLSLEAVAKPLHTEDLYFVGDGKGGHFFS 303 (325) Q Consensus 224 Lq~D~Tv~Y~l~~~~~~~~~~~~~~~dl~~~spYNTY~~~GLPP~PI~~pg~~ai~A~l~p~~~dylyfv~~~~g~h~Fs 303 (325) ||+||||+||+++.. .+.++++||++|||||||+++|||||||||||.+||+|||||+++||||||++++|+|+|| T Consensus 160 LQ~D~Tv~Y~l~~~~----~~~~~~~dl~~~sPYNTY~~~GLPPgPI~~Pg~~ai~Aal~P~~~~ylYFva~~~G~h~Fs 235 (245) T cd08010 160 LQSDPTVIYGLGEHK----KGTITKKDLEIDSPYNTYKIKGLPPGPIANPGLAALEAALNPAKTDYLYFVADGTGTHYFS 235 (245) T ss_pred CCCCCHHHHHCCCCC----CCCCCHHHHCCCCCHHHCCCCCCCCCCCCCCCHHHHHHHHCCCCCCEEEEEECCCCCEEEC T ss_conf 672806775126787----7767699861579412125169999876787999999986888888489998499827847 Q ss_pred CCHHHHHHHH Q ss_conf 9999999999 Q gi|254780532|r 304 TNFKDHTINV 313 (325) Q Consensus 304 ~t~~eH~~nv 313 (325) +||+||++|| T Consensus 236 ~t~~eH~~nV 245 (245) T cd08010 236 KTYEEHNKNV 245 (245) T ss_pred CCHHHHHHHC T ss_conf 8999998629 No 6 >TIGR02869 spore_SleB spore cortex-lytic enzyme; InterPro: IPR014224 The entry represents the spore cortex-lytic enzyme SleB from Bacillus subtilis and other Gram-positive, endospore-forming bacterial species. SleB is stored in an inactive form in the spore and activated during germination.. Probab=96.63 E-value=0.0023 Score=41.63 Aligned_cols=170 Identities=22% Similarity=0.287 Sum_probs=94.4 Q ss_pred CCCCHHHHHHHHHCCCCCCCCCC-----------------CCCCCCCCCCHHHEEEC--CCCCHHHHHHHHHHHHHHHHH Q ss_conf 10348999998620731113443-----------------66881000030112101--468978999999999999875 Q gi|254780532|r 112 EGFTVKQMARRLKDNPLLVGELP-----------------LELPLEGTLCPSTYNFP--LGTHRSEILNQAMLKQKQVVD 172 (325) Q Consensus 112 EG~t~~ei~~~L~~~~~l~~~~~-----------------~~~~lEG~l~PdTY~~~--~~~s~~~il~~m~~~~~~~~~ 172 (325) .|..+.||-++|..-++-.++.. ..++..|..=|+|-.=- ...=.++.++...++=+.+.. T Consensus 12 ~G~~V~~~Q~rLk~wGYY~G~VDG~FG~~Ty~AVr~FQ~knGL~VDGivG~~T~~aLv~A~~Y~~~~v~~~~~~GN~fth 91 (232) T TIGR02869 12 TGSDVIEVQRRLKAWGYYNGKVDGVFGVKTYKAVRKFQSKNGLTVDGIVGPKTKAALVKASKYDKEFVKEQINKGNDFTH 91 (232) T ss_pred CCHHHHHHHHHHHHCCCCCCCCCEEECCHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCEEE T ss_conf 62178999999987486114301054736999999988870885564207578999987644678999999853776011 Q ss_pred H--HH---HHHHHCC-----CCCCHHHHHHHHHHHHHHHCCHHHHHHHH--HHHHHHHCCCCEEEEEEEEEEECCCCCCC Q ss_conf 5--65---5453106-----67659999998877776526725788999--99999740587024422210101556677 Q gi|254780532|r 173 E--VW---EIRDVDH-----PIKSKEDLVILASIVEKETSRADERAHVA--SVFINRFSKSIRLQSDSTVIYGILEGDYD 240 (325) Q Consensus 173 ~--~~---~~~~~~~-----~~~s~~evlilASIVEkEa~~~~e~~~IA--~V~~NRL~~~M~Lq~D~Tv~Y~l~~~~~~ 240 (325) - .+ ...+... ...+..++--||-+|+.||.-+---.+|| .|++||+++.--=..=+=|+|-=+-+. - T Consensus 92 yGG~~~~~~~~~~~~G~~vp~~~~~~D~~LLAr~v~gEArGEPY~GqVAVaAViLNRv~dp~FP~Ti~GVIyqp~AF~-a 170 (232) T TIGR02869 92 YGGIATTQQSSAASSGSNVPSGISNQDIDLLARLVNGEARGEPYEGQVAVAAVILNRVRDPRFPNTIAGVIYQPGAFT-A 170 (232) T ss_pred CCCCCCCCCHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHEEEEECCCCCCCCCCCCCCEECCCCCC-C T ss_conf 077156652023203422456764445899999988750687874602100030000378887887765223576656-1 Q ss_pred CCCCCCCCHHHCCCCCCHHCCCCCCCCCCCCCCCHHHHHHHCCCCC-----CCEEEEEE--CCC-------------CCE Q ss_conf 5420000001015683011023888999866432798999608788-----98799998--389-------------876 Q gi|254780532|r 241 LTNRKISRSDFSIKTPYNSYLMNGLPPTAISNPGRLSLEAVAKPLH-----TEDLYFVG--DGK-------------GGH 300 (325) Q Consensus 241 ~~~~~~~~~dl~~~spYNTY~~~GLPP~PI~~pg~~ai~A~l~p~~-----~dylyfv~--~~~-------------g~h 300 (325) ..++++ || ..|.++|++||+..-. +..|||.- ..+ |+| T Consensus 171 v~dGqi----------~~------------~~p~~~s~kAa~DA~nGWDPsgGAlYyfNPa~ats~WIw~Rp~i~~IG~H 228 (232) T TIGR02869 171 VADGQI----------WQ------------LTPTEESIKAALDALNGWDPSGGALYYFNPATATSSWIWSRPIIKRIGKH 228 (232) T ss_pred CCCCEE----------EE------------CCCCHHHHHHHHHHHCCCCCCCCCEEEECCCCCCCCEEEECHHHHCCCCE T ss_conf 237723----------42------------68897899999988427788868468777876768605602221101852 Q ss_pred EECC Q ss_conf 6529 Q gi|254780532|r 301 FFST 304 (325) Q Consensus 301 ~Fs~ 304 (325) .|+| T Consensus 229 ~Fck 232 (232) T TIGR02869 229 VFCK 232 (232) T ss_pred ECCC T ss_conf 0379 No 7 >pfam02618 YceG YceG-like family. This family of proteins is found in bacteria. Proteins in this family are typically between 332 and 389 amino acids in length. This family was previously incorrectly annotated and names as aminodeoxychorismate lyase. The structure of YceG was solved by X-ray crystallography. Probab=95.68 E-value=0.065 Score=32.35 Aligned_cols=67 Identities=16% Similarity=0.274 Sum_probs=50.7 Q ss_pred CEEEEECCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHCC------CCCCCCCCEEEECCCCCHHHHHHHHHHC Q ss_conf 789998888999999999998798069899999999538------6676257204406787589999986102 Q gi|254780532|r 35 DTIFLVRNNMSLKEISKNLFNGGVIVNPYIFRYVTQFYF------GSRGLKTGEYEIEKGSSMSQIAEKIMYG 101 (325) Q Consensus 35 ~~~i~I~~G~s~~~I~~~L~~~gvI~s~~~F~~~~~~~~------~~~~ikaG~Y~i~~~~s~~~il~~L~~G 101 (325) .+.|+|++|.+..+|+..|.+++.+....+......... ...-+-|++|.+..+.+..++++.+... T Consensus 70 ~~~vTI~EG~t~~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~lEG~l~PdTY~~~~~~s~~~il~~m~~~ 142 (287) T pfam02618 70 QYKVTIPEGLTLEQIAERLAKAGGLKKEEFLAFLEELGGELPSLPLEGYLFPDTYEFDKGTSAEEILKRMVKR 142 (287) T ss_pred EEEEEECCCCCHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCEEEECCEEEECCCCCHHHHHHHHHHH T ss_conf 8999986986099999999856599989999869987512688983448728768878999999999999999 No 8 >cd08010 yceG_like proteins similar to Escherichia coli yceG. The gene product of Escherichia coli yceG has been erroneously annotated as an aminodeoxychorismate lyase. Its overexpression has been reported to cause abnormal biofilm architecture, and it has been reported to be part of a putative five-gene operon. It might function as a periplasmic solute-binding protein. The family also includes Streptomyces caeruleus NovB, an uncharacterized member of the novobiocin biosynthetic gene cluster. Probab=95.49 E-value=0.04 Score=33.69 Aligned_cols=68 Identities=16% Similarity=0.236 Sum_probs=50.2 Q ss_pred CCEEEEECCCCCHHHHHHHHHHCCCCCCHHHHHHHHH---H-C-----CCCCCCCCCEEEECCCCCHHHHHHHHHHC Q ss_conf 8789998888999999999998798069899999999---5-3-----86676257204406787589999986102 Q gi|254780532|r 34 NDTIFLVRNNMSLKEISKNLFNGGVIVNPYIFRYVTQ---F-Y-----FGSRGLKTGEYEIEKGSSMSQIAEKIMYG 101 (325) Q Consensus 34 ~~~~i~I~~G~s~~~I~~~L~~~gvI~s~~~F~~~~~---~-~-----~~~~~ikaG~Y~i~~~~s~~~il~~L~~G 101 (325) ..+.|+|++|.+..+|++.|.+++.+....+...+.. . . ....-+-|++|.+..+++..+|+..+... T Consensus 27 ~~~~vTIpEG~t~~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lEG~LfPdTY~~~~~~s~~~il~~m~~~ 103 (245) T cd08010 27 AQVKVTIPEGYTLKQIAKALSKAGGLSDAEFLAALKDPAFILLGDPKYPLEGYLFPDTYEFPKGATPEDLLKRMVKR 103 (245) T ss_pred EEEEEEECCCCCHHHHHHHHHHCCCCCHHHHHHHHHCHHHHHCCCCCCCCCEEECCCEEEECCCCCHHHHHHHHHHH T ss_conf 57999988988399999999856699999999985165555357888884236607669953789999999999999 No 9 >PRK10270 hypothetical protein; Provisional Probab=94.53 E-value=0.035 Score=34.11 Aligned_cols=61 Identities=13% Similarity=0.120 Sum_probs=39.4 Q ss_pred CCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCC-----CCCCCCCCHHHEEECCCCCHHHHHHHHHH Q ss_conf 32101111034899999862073111344366-----88100003011210146897899999999 Q gi|254780532|r 105 MHSISFPEGFTVKQMARRLKDNPLLVGELPLE-----LPLEGTLCPSTYNFPLGTHRSEILNQAML 165 (325) Q Consensus 105 ~~~iTI~EG~t~~ei~~~L~~~~~l~~~~~~~-----~~lEG~l~PdTY~~~~~~s~~~il~~m~~ 165 (325) ...|+|++|.+..+|++.|.+.+.+.+...-. ....+-|-|++|.+..+++..++++.+.. T Consensus 39 ~vvv~I~~G~s~~~Ia~~L~~~gvI~s~~~F~~~~~~~~~~~~ikaG~Y~l~~~ms~~~il~~L~~ 104 (340) T PRK10270 39 ETIFTLKPGTGRLALGEQLYADKIINRPRVFQWLLRIEPDLSHFKAGTYRFTPQMTVREMLKLLES 104 (340) T ss_pred CEEEEECCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCCCCCCEEEEECCCCCHHHHHHHHHC T ss_conf 889998999999999999998868378999999999679757614469996899999999999971 No 10 >COG1559 Aminodeoxychorismate lyase [Coenzyme transport and metabolism] Probab=93.97 E-value=0.036 Score=33.97 Aligned_cols=66 Identities=18% Similarity=0.162 Sum_probs=46.6 Q ss_pred EEEEECCCCCHHHHHHHHHHCCCCCCHHHHHHHH---HHCC------CCCCCCCCEEEECCCCCHHHHHHHHHHC Q ss_conf 8999888899999999999879806989999999---9538------6676257204406787589999986102 Q gi|254780532|r 36 TIFLVRNNMSLKEISKNLFNGGVIVNPYIFRYVT---QFYF------GSRGLKTGEYEIEKGSSMSQIAEKIMYG 101 (325) Q Consensus 36 ~~i~I~~G~s~~~I~~~L~~~gvI~s~~~F~~~~---~~~~------~~~~ikaG~Y~i~~~~s~~~il~~L~~G 101 (325) ..+.|++|.++.+|++.|.++.++.....-.... ...+ .+.-+-|++|.+.++.+..+|++.+..- T Consensus 116 ~~~~ipEG~t~~~i~~~l~~~~~l~~~~~~~~~~~~~~~~p~~~~~~~EG~l~PdTY~~~~g~~~~~Ii~~mi~~ 190 (342) T COG1559 116 FKVTIPEGYTLKQIAKKLAEAPVLDGVLDGLAIAEIAARLPKLDKERLEGYLFPDTYEFTRGTSAEDIIKRMIKA 190 (342) T ss_pred CEEECCCCCCHHHHHHHHHHCHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEECCCEEEECCCCCHHHHHHHHHHH T ss_conf 427668986299999998302265431134566789984688767897516537636408999899999999999 No 11 >TIGR00247 TIGR00247 conserved hypothetical protein TIGR00247; InterPro: IPR003770 This family contains several aminodeoxychorismate lyase proteins. Aminodeoxychorismate lyase is a pyridoxal 5'-phosphate-dependent enzyme that converts 4-aminodeoxychorismate to pyruvate and p-aminobenzoate, a precursor of folic acid in bacteria . . Probab=92.56 E-value=0.14 Score=30.25 Aligned_cols=63 Identities=22% Similarity=0.204 Sum_probs=49.5 Q ss_pred CCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCC-----CCCCCCCHHHEEECCCCCHHHHHHHHHH Q ss_conf 33321011110348999998620731113443668-----8100003011210146897899999999 Q gi|254780532|r 103 VLMHSISFPEGFTVKQMARRLKDNPLLVGELPLEL-----PLEGTLCPSTYNFPLGTHRSEILNQAML 165 (325) Q Consensus 103 ~~~~~iTI~EG~t~~ei~~~L~~~~~l~~~~~~~~-----~lEG~l~PdTY~~~~~~s~~~il~~m~~ 165 (325) .....+||+.||+...||..|.++..+..+.-..+ +-.+-|-.+||.+..+.+..++++.+.+ T Consensus 47 ~~~~~~~i~~G~~~~~lA~~L~~~k~I~~~~~l~y~~~~~g~~~~fkaG~Y~l~~~~~~~~~~~~L~s 114 (373) T TIGR00247 47 KLEVEFNIEKGTGVSKLAKELKKEKLIKSEKLLQYLLKIKGELKKFKAGTYLLNKDMTVDEILKLLES 114 (373) T ss_pred CCEEEEECCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHCCCCCCEECCEEEECCCCCHHHHHHHHHC T ss_conf 54468861789778999999876201125316655665527655400321345688768999999635 No 12 >cd03715 RT_ZFREV_like RT_ZFREV_like: A subfamily of reverse transcriptases (RTs) found in sequences similar to the intact endogenous retrovirus ZFERV from zebrafish and to Moloney murine leukemia virus RT. An RT gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. RTs occur in a variety of mobile elements, including retrotransposons, retroviruses, group II introns, bacterial msDNAs, hepadnaviruses, and caulimoviruses. These elements can be divided into two major groups. One group contains retroviruses and DNA viruses whose propagation involves an RNA intermediate. They are grouped together with transposable elements containing long terminal repeats (LTRs). The other group, also called poly(A)-type retrotransposons, contain fungal mitochondrial introns and transposable elements that lack LTRs. Phylogenetic analysis suggests that ZFERV belongs to a distinct group of retroviruses. Probab=83.24 E-value=1.8 Score=23.20 Aligned_cols=72 Identities=13% Similarity=0.027 Sum_probs=51.6 Q ss_pred CCCCCCHHCCCCCCCCCCCCCCC--HHHHHHHCCCCCCCEE--EEEECCCCCEEECCCHHHHHHHHHHHHHHHHHC Q ss_conf 15683011023888999866432--7989996087889879--999838987665299999999999999998728 Q gi|254780532|r 252 SIKTPYNSYLMNGLPPTAISNPG--RLSLEAVAKPLHTEDL--YFVGDGKGGHFFSTNFKDHTINVQKWRKMSLES 323 (325) Q Consensus 252 ~~~spYNTY~~~GLPP~PI~~pg--~~ai~A~l~p~~~dyl--yfv~~~~g~h~Fs~t~~eH~~nv~~y~~~~~~~ 323 (325) ...+|.-+|.++-||-|-..+|. ..++..+|.+.....+ ..+.=-|.-..||+|.+||.+++++-.+..++. T Consensus 112 aF~~~~g~y~~~vlP~Gl~~spa~fq~~m~~~l~~~~~~~~~~~v~~Y~DDIli~s~t~eeh~~~l~~vl~~L~~~ 187 (210) T cd03715 112 AFEWEGQQYTFTRLPQGFKNSPTLFHEALARDLAPFPLEHEGTILLQYVDDLLLAADSEEDCLKGTDALLTHLGEL 187 (210) T ss_pred EEEECCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEECEEEEECCCHHHHHHHHHHHHHHHHHC T ss_conf 6896798879998188675958999999999998778648983999981638996499999999999999999997 No 13 >pfam07486 Hydrolase_2 Cell Wall Hydrolase. These enzymes have been implicated in cell wall hydrolysis, most extensively in Bacillus subtilis. For instance SCLE (spore cortex-lytic enzyme) is expressed during sporulation as an inactive form and then deposited on the cell outer cortex. During germination the the enzyme is activated and hydrolyses the cortex. A similar role is carried out by the partially redundant cell wall hydrolase cwlJ. It is not clear whether these enzymes are amidases or peptidases. Probab=69.87 E-value=8.9 Score=18.73 Aligned_cols=17 Identities=29% Similarity=0.505 Sum_probs=14.0 Q ss_pred HHHHHHHHHHHHHHCCC Q ss_conf 57889999999974058 Q gi|254780532|r 205 DERAHVASVFINRFSKS 221 (325) Q Consensus 205 ~e~~~IA~V~~NRL~~~ 221 (325) +-|--||.|++||++.+ T Consensus 6 ~G~~AVa~VilNRv~~~ 22 (107) T pfam07486 6 EGQVAVAAVILNRVRSP 22 (107) T ss_pred HHHHHHHHHHHHCCCCC T ss_conf 89999999999722688 No 14 >PRK03598 hypothetical protein; Provisional Probab=69.74 E-value=6 Score=19.83 Aligned_cols=21 Identities=19% Similarity=0.194 Sum_probs=11.2 Q ss_pred CHHHHHHHHHHHHHH-HHHHHH Q ss_conf 957999999999999-999999 Q gi|254780532|r 1 MLKFLIPLITIFLLA-IGVHIH 21 (325) Q Consensus 1 m~k~li~li~i~~l~-~~~~~~ 21 (325) |+|.++..+++.+++ ++++.| T Consensus 1 mkk~~~~~~~~~~~~~~~~~~~ 22 (331) T PRK03598 1 MKKPVVIGLAVVALVAVGAGGW 22 (331) T ss_pred CCCHHHHHHHHHHHHHHHHHHH T ss_conf 9833899999999999999976 No 15 >cd01647 RT_LTR RT_LTR: Reverse transcriptases (RTs) from retrotransposons and retroviruses which have long terminal repeats (LTRs) in their DNA copies but not in their RNA template. RT catalyzes DNA replication from an RNA template, and is responsible for the replication of retroelements. An RT gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. RTs are present in a variety of mobile elements, including retrotransposons, retroviruses, group II introns, bacterial msDNAs, hepadnaviruses, and Caulimoviruses. Probab=63.55 E-value=9.2 Score=18.64 Aligned_cols=109 Identities=12% Similarity=0.101 Sum_probs=63.4 Q ss_pred HHHHHHHHHHHHHCC-CCEEEEEEEEEEECCCCCCCCCCCCCCCHHHCCCCCCHHCCCCCCCCCCCCCCCH--HHHHHHC Q ss_conf 788999999997405-8702442221010155667754200000010156830110238889998664327--9899960 Q gi|254780532|r 206 ERAHVASVFINRFSK-SIRLQSDSTVIYGILEGDYDLTNRKISRSDFSIKTPYNSYLMNGLPPTAISNPGR--LSLEAVA 282 (325) Q Consensus 206 e~~~IA~V~~NRL~~-~M~Lq~D~Tv~Y~l~~~~~~~~~~~~~~~dl~~~spYNTY~~~GLPP~PI~~pg~--~ai~A~l 282 (325) .+|.+..++. ++.. ...-..|-.--|- . .....-.+.-+....|.-+|...-||=|=.++|.. ..|+.++ T Consensus 43 ~~p~~~~~~~-~~~~~~~~~~lDl~~~f~-q-----i~l~~~s~~~~af~t~~g~y~~~~lp~Gl~~sp~~fq~~~~~~l 115 (177) T cd01647 43 PLPTIDELLE-ELAGAKVFSKLDLRSGYH-Q-----IPLAEESRPKTAFRTPFGLYEYTRMPFGLKNAPATFQRLMNKIL 115 (177) T ss_pred CCCCHHHHHH-HHCCCEEEEEEECCCCHH-C-----CCCCHHHCCCEEEEECCCCEEEEECCCCCCCCHHHHHHHHHHHH T ss_conf 8984788876-644975999988545341-2-----75882201315578468357997346656585899999999999 Q ss_pred CCCCCCEEEEEECCCCCEEECCCHHHHHHHHHHHHHHHHHC Q ss_conf 87889879999838987665299999999999999998728 Q gi|254780532|r 283 KPLHTEDLYFVGDGKGGHFFSTNFKDHTINVQKWRKMSLES 323 (325) Q Consensus 283 ~p~~~dylyfv~~~~g~h~Fs~t~~eH~~nv~~y~~~~~~~ 323 (325) .+.-..+. ++=-|.-..+|+|.+||.+++++--+..++. T Consensus 116 ~~~~~~~~--~~Y~DDili~s~~~eeh~~~l~~vl~~l~~~ 154 (177) T cd01647 116 GDLLGDFV--EVYLDDILVYSKTEEEHLEHLREVLERLREA 154 (177) T ss_pred HHHHHHHC--CCCCCEEEEECCCHHHHHHHHHHHHHHHHHC T ss_conf 98763320--2455428995699999999999999999996 No 16 >cd01645 RT_Rtv RT_Rtv: Reverse transcriptases (RTs) from retroviruses (Rtvs). RTs catalyze the conversion of single-stranded RNA into double-stranded viral DNA for integration into host chromosomes. Proteins in this subfamily contain long terminal repeats (LTRs) and are multifunctional enzymes with RNA-directed DNA polymerase, DNA directed DNA polymerase, and ribonuclease hybrid (RNase H) activities. The viral RNA genome enters the cytoplasm as part of a nucleoprotein complex, and the process of reverse transcription generates in the cytoplasm forming a linear DNA duplex via an intricate series of steps. This duplex DNA is colinear with its RNA template, but contains terminal duplications known as LTRs that are not present in viral RNA. It has been proposed that two specialized template switches, known as strand-transfer reactions or "jumps", are required to generate the LTRs. Probab=62.62 E-value=12 Score=17.99 Aligned_cols=67 Identities=15% Similarity=0.101 Sum_probs=50.0 Q ss_pred CCCCHHCCCCCCCCCCCCCCCH--HHHHHHCCCCCCC-----EEEEEECCCCCEEECCCHHHHHHHHHHHHHHHHHC Q ss_conf 6830110238889998664327--9899960878898-----79999838987665299999999999999998728 Q gi|254780532|r 254 KTPYNSYLMNGLPPTAISNPGR--LSLEAVAKPLHTE-----DLYFVGDGKGGHFFSTNFKDHTINVQKWRKMSLES 323 (325) Q Consensus 254 ~spYNTY~~~GLPP~PI~~pg~--~ai~A~l~p~~~d-----ylyfv~~~~g~h~Fs~t~~eH~~nv~~y~~~~~~~ 323 (325) .+|+-+|...=||=|-.++|.. .++.-+|.+.... .+-|+ |.=..||+|.+||.+.+++-.+..++. T Consensus 118 ~t~~g~y~~~vlPfGl~naPa~Fqr~m~~~L~~~~~~~~~~~~~~Y~---DDIlI~s~t~eeh~~~l~~vl~~L~~~ 191 (213) T cd01645 118 KGPAKRYQWKVLPQGMKNSPTICQSFVAQALEPFRKQYPDIVIYHYM---DDILIASDLEGQLREIYEELRQTLLRW 191 (213) T ss_pred CCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEE---EEEEEECCCHHHHHHHHHHHHHHHHHC T ss_conf 78870589985176345868999999999999988638864899984---248995199999999999999999997 No 17 >PRK13481 glycosyltransferase; Provisional Probab=58.64 E-value=11 Score=18.25 Aligned_cols=22 Identities=23% Similarity=0.067 Sum_probs=14.2 Q ss_pred CCCCHHHHHHHHHHHHHHHCCH Q ss_conf 6765999999887777652672 Q gi|254780532|r 183 PIKSKEDLVILASIVEKETSRA 204 (325) Q Consensus 183 ~~~s~~evlilASIVEkEa~~~ 204 (325) .-.|..|.-+||+|++.=+..+ T Consensus 209 s~Ltl~EaA~LAgl~~aPs~Yn 230 (269) T PRK13481 209 SHITVLQSAILASKVNAPSVYN 230 (269) T ss_pred HHCCHHHHHHHHCCCCCCCCCC T ss_conf 5499999999842177866679 No 18 >COG3944 Capsular polysaccharide biosynthesis protein [Cell envelope biogenesis, outer membrane] Probab=57.76 E-value=15 Score=17.30 Aligned_cols=109 Identities=19% Similarity=0.251 Sum_probs=48.8 Q ss_pred CHHHHH-HHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEE--ECCCCCHHHHHHHHHH--------CCCCCCHHHHHHHH Q ss_conf 957999-999999999999999998724677777878999--8888999999999998--------79806989999999 Q gi|254780532|r 1 MLKFLI-PLITIFLLAIGVHIHVIRVYNATGPLQNDTIFL--VRNNMSLKEISKNLFN--------GGVIVNPYIFRYVT 69 (325) Q Consensus 1 m~k~li-~li~i~~l~~~~~~~~~~~~~~~~~~~~~~~i~--I~~G~s~~~I~~~L~~--------~gvI~s~~~F~~~~ 69 (325) |+++.+ .++.++++++++++.+ +...+. -+..+.+. ...|++.+.-.+..+. ..+|+|+....-.. T Consensus 15 ~K~~~lIlivti~~~~~~~~~sf-fvi~P~--YqssTqilV~~~~~~~~~~~~qdvq~s~qlvnTY~~IikS~~vldeVv 91 (226) T COG3944 15 MKRKKLILIVTILFVAISAIVSF-FVIKPT--YQSSTQILVNQSDGDSTNLQFQDVQSSLQLVNTYAEIIKSPAVLDEVV 91 (226) T ss_pred HHHHHHHHHHHHHHHHHHHHHHE-EECCCC--CCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHH T ss_conf 98789999999999999988821-444744--434169999626899800468889988899999999982778899999 Q ss_pred HHCCCCCCCCCCEEEECCCCCHHHHHHHHHHC---CCCCCCCCCCCCC--CHHHHHHHHHC Q ss_conf 95386676257204406787589999986102---4333210111103--48999998620 Q gi|254780532|r 70 QFYFGSRGLKTGEYEIEKGSSMSQIAEKIMYG---KVLMHSISFPEGF--TVKQMARRLKD 125 (325) Q Consensus 70 ~~~~~~~~ikaG~Y~i~~~~s~~~il~~L~~G---~~~~~~iTI~EG~--t~~ei~~~L~~ 125 (325) .- +.-++|..++-..+.-- +.....+|...|. ...+||+.+++ T Consensus 92 ~e-------------L~l~~T~~qL~~~i~vtse~~tqiInItv~~~~p~~Aa~IAN~~~~ 139 (226) T COG3944 92 SE-------------LGLELTASQLSSKISVTSETDTQIINITVNDGSPEEAAEIANSISE 139 (226) T ss_pred HH-------------HCCCCCHHHHCCCEEECCCCCCEEEEEECCCCCHHHHHHHHHHHHH T ss_conf 98-------------5664588887062775488885689987379986799999999999 No 19 >pfam10566 Glyco_hydro_97 Glycoside hydrolase 97. This is the 97th family of glycosidases, in this case bacterial. The central part of the GH97 family protein sequences represents a typical and complete (beta/alpha)8-barrel or catalytic TIM-barrel type domain. The N- and C-terminal parts of the sequences, mainly consisting of beta-strands, most probably form two additional non-catalytic domains with as yet unknown functions. The non-catalytic domains of glycosidases from the alpha-galactosidase and alpha-glucosidase superfamilies are also predominantly composed of beta-strands, and at least some of these domains are involved in oligomerisation and carbohydrate binding. In all known glycosidases with the (beta-alpha)8-barrel fold, the amino acid residues at the active site are located on the C-termini of the beta-strands. Probab=56.68 E-value=15 Score=17.19 Aligned_cols=43 Identities=23% Similarity=0.329 Sum_probs=21.5 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC-CEEEEECCC Q ss_conf 9579999999999999999999987246777778-789998888 Q gi|254780532|r 1 MLKFLIPLITIFLLAIGVHIHVIRVYNATGPLQN-DTIFLVRNN 43 (325) Q Consensus 1 m~k~li~li~i~~l~~~~~~~~~~~~~~~~~~~~-~~~i~I~~G 43 (325) |||+++.++++++++........+.+.-.+|... .+.|.+..| T Consensus 1 mkk~~~~~~~~~~~~~~~~~~~~k~~~v~SPDG~l~v~v~~~~g 44 (643) T pfam10566 1 MKKLTIILLAFLLLIGNLSLAARKNYQLTSPNGKLKVELTLEGN 44 (643) T ss_pred CCCHHHHHHHHHHHHHHHHHHHCCEEEEECCCCCEEEEEEECCC T ss_conf 95025789999999976567541623899989988999998899 No 20 >PRK13759 arylsulfatase; Provisional Probab=52.64 E-value=12 Score=17.84 Aligned_cols=15 Identities=13% Similarity=0.242 Sum_probs=9.7 Q ss_pred HHHHHHHHHHCCCCC Q ss_conf 999999999879806 Q gi|254780532|r 46 LKEISKNLFNGGVIV 60 (325) Q Consensus 46 ~~~I~~~L~~~gvI~ 60 (325) ...+++.|.++|.-. T Consensus 80 ~~tl~~~Lk~~GY~T 94 (474) T PRK13759 80 KNEMPQMFRDAGYYT 94 (474) T ss_pred CCHHHHHHHHCCCEE T ss_conf 444899999769848 No 21 >pfam09949 DUF2183 Uncharacterized conserved protein (DUF2183). This domain, found in various hypothetical bacterial proteins, has no known function. Probab=51.17 E-value=5.8 Score=19.93 Aligned_cols=18 Identities=39% Similarity=0.870 Sum_probs=11.7 Q ss_pred CCCCHHCC-------CCCCCCCCCC Q ss_conf 68301102-------3888999866 Q gi|254780532|r 254 KTPYNSYL-------MNGLPPTAIS 271 (325) Q Consensus 254 ~spYNTY~-------~~GLPP~PI~ 271 (325) .||||.|- ..|+|+||+- T Consensus 7 ~SPwnly~~l~~Fl~~~~~P~G~l~ 31 (100) T pfam09949 7 NSPWNLYPFLRRFLERNGFPPGPLL 31 (100) T ss_pred CCHHHHHHHHHHHHHHCCCCCCCEE T ss_conf 8878879999999987699998550 No 22 >PRK10783 mltD membrane-bound lytic murein transglycosylase D; Provisional Probab=50.53 E-value=19 Score=16.56 Aligned_cols=13 Identities=23% Similarity=0.353 Sum_probs=5.1 Q ss_pred HHHHHHHCCCCEE Q ss_conf 9999974058702 Q gi|254780532|r 212 SVFINRFSKSIRL 224 (325) Q Consensus 212 ~V~~NRL~~~M~L 224 (325) -|+-|-=+-|..| T Consensus 242 ~Ii~np~~ygi~l 254 (449) T PRK10783 242 DILKNSKRYGVRL 254 (449) T ss_pred HHHHCHHHCCCCC T ss_conf 9986999749777 No 23 >cd02204 PurL_repeat2 PurL subunit of the formylglycinamide ribonucleotide amidotransferase (FGAR-AT), second repeat. FGAR-AT catalyzes the ATP-dependent conversion of formylglycinamide ribonucleotide (FGAR) and glutamine to formylglycinamidine ribonucleotide (FGAM), ADP, phosphate, and glutamate in the fourth step of the purine biosynthetic pathway. In eukaryotes and Gram-negative bacteria, FGAR-AT is encoded by the purL gene as a multidomain protein with a molecular mass of about 140 kDa. In Gram-positive bacteria and archaea FGAR-AT is a complex of three proteins: PurS, PurL, and PurQ. PurL itself contains two tandem N- and C-terminal domains (four domains altogether). The N-terminal domains bind ATP and are related to the ATP-binding domains of HypE, ThiL, SelD and PurM. Probab=50.29 E-value=20 Score=16.54 Aligned_cols=16 Identities=31% Similarity=0.490 Sum_probs=7.4 Q ss_pred CCCHHCCCCCCCCCCC Q ss_conf 8301102388899986 Q gi|254780532|r 255 TPYNSYLMNGLPPTAI 270 (325) Q Consensus 255 spYNTY~~~GLPP~PI 270 (325) |-||++.....||||+ T Consensus 98 Sl~n~~~~~~ippTlv 113 (264) T cd02204 98 SLYNETEGVAIPPTLV 113 (264) T ss_pred EEECCCCCCCCCCCEE T ss_conf 6514667866788528 No 24 >TIGR03417 chol_sulfatase choline-sulfatase. Probab=47.53 E-value=15 Score=17.31 Aligned_cols=14 Identities=14% Similarity=0.154 Sum_probs=9.5 Q ss_pred HHHHHHHHHHCCCC Q ss_conf 99999999987980 Q gi|254780532|r 46 LKEISKNLFNGGVI 59 (325) Q Consensus 46 ~~~I~~~L~~~gvI 59 (325) ...++..|.++|.- T Consensus 81 ~~t~a~~L~~aGY~ 94 (500) T TIGR03417 81 IPTYAHYLRRAGYR 94 (500) T ss_pred CCCHHHHHHHCCCE T ss_conf 67299999977992 No 25 >COG3773 SleB Cell wall hydrolyses involved in spore germination [Cell envelope biogenesis, outer membrane] Probab=47.00 E-value=3.2 Score=21.53 Aligned_cols=120 Identities=21% Similarity=0.323 Sum_probs=76.6 Q ss_pred HHHHHHCCCCCCHHHHHHHHHHHHHHHCCHH--HHHHHHHHHHHHHCCCCEEEEEEEEEEECCCCCCCCCCCCCCCHHHC Q ss_conf 5545310667659999998877776526725--78899999999740587024422210101556677542000000101 Q gi|254780532|r 175 WEIRDVDHPIKSKEDLVILASIVEKETSRAD--ERAHVASVFINRFSKSIRLQSDSTVIYGILEGDYDLTNRKISRSDFS 252 (325) Q Consensus 175 ~~~~~~~~~~~s~~evlilASIVEkEa~~~~--e~~~IA~V~~NRL~~~M~Lq~D~Tv~Y~l~~~~~~~~~~~~~~~dl~ 252 (325) |..+.......+..|+.-||-.|..|+.-.. .+-.||.|++||++..--=-+..-|+|..... +..+. T Consensus 104 ~~~~~~~~~~~t~~e~~lLA~~v~aEA~gE~~~g~vavaqvvlNR~~~p~~p~~i~gvi~q~~~~----~~~qF------ 173 (249) T COG3773 104 YEKRPLAPEVPTSAEVDLLARLVYAEARGEPYAGQVAVAQVVLNRVRNPAFPKAIAGVIYQPKNR----TPCQF------ 173 (249) T ss_pred CCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHCCCCC----CCCCC------ T ss_conf 01268886656727788887540243447665432143234203445787742566776343447----76662------ Q ss_pred CCCC-CHHCCCCCCCCCCCCCCCHHHHHHHC------CCCCCCEEEEEECCC--------------CCEEECCCHHH Q ss_conf 5683-01102388899986643279899960------878898799998389--------------87665299999 Q gi|254780532|r 253 IKTP-YNSYLMNGLPPTAISNPGRLSLEAVA------KPLHTEDLYFVGDGK--------------GGHFFSTNFKD 308 (325) Q Consensus 253 ~~sp-YNTY~~~GLPP~PI~~pg~~ai~A~l------~p~~~dylyfv~~~~--------------g~h~Fs~t~~e 308 (325) || =++|++.+ +-|-.-+...++.++. .|+..-..||=.+.. |.|.|..|+.| T Consensus 174 --s~v~dg~~~~~--~~~~a~~~a~~~a~~~~a~~~~~P~~~s~~~~~p~~~~~~w~~t~~~v~~iG~hiF~~~~~~ 246 (249) T COG3773 174 --SPVQDGYIYRG--VRPQAWPNAQAVAAAMVALQGWDPEVGAAYFYNPDTVSPSWGYTQKNVDRIGAHIFYQTRPE 246 (249) T ss_pred --CCCCCCEEEEC--CCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEECCCCCCCHHHCCCHHHHHHHHEEEEECCCC T ss_conf --61005557621--34101213789999998631468887525751578878103046077887630457405788 No 26 >TIGR00434 cysH phosophoadenylyl-sulfate reductase; InterPro: IPR004511 Phosphoadenosine phosphosulphate reductase (1.8.4.8 from EC), involved in the assimilation of inorganic sulphate, is designated cysH in bacteria and MET16 in Saccharomyces cerevisiae. Synonyms include phosphoadenosine phosphosulphate reductase and PAPS reductase. In a reaction requiring reduced thioredoxin and NADPH, it converts 3'-phosphoadenylylsulphate (PAPS) to sulphite and adenosine 3',5' diphosphate (PAP). A related family of plant enzymes, scoring below the trusted cut-off, differs in having a thioredoxin-like C-terminal domain, not requiring thioredoxin, and in having a preference for 5'-adenylylsulphate (APS) over PAPS.; GO: 0004604 phosphoadenylyl-sulfate reductase (thioredoxin) activity, 0019379 sulfate assimilation phosphoadenylyl sulfate reduction by phosphoadenylyl-sulfate reductase (thioredoxin). Probab=43.63 E-value=7.4 Score=19.23 Aligned_cols=35 Identities=17% Similarity=0.302 Sum_probs=17.3 Q ss_pred CCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHEEE Q ss_conf 03489999986207311134436688100003011210 Q gi|254780532|r 113 GFTVKQMARRLKDNPLLVGELPLELPLEGTLCPSTYNF 150 (325) Q Consensus 113 G~t~~ei~~~L~~~~~l~~~~~~~~~lEG~l~PdTY~~ 150 (325) |.=+-++..+|+++-.+.--.+++ =||+||+||.| T Consensus 26 g~V~ldL~~ki~~~~~~i~lDfiD---TG~hF~eTY~~ 60 (226) T TIGR00434 26 GAVLLDLVSKIKKDIPVIFLDFID---TGYHFPETYEL 60 (226) T ss_pred HHHHHHHHHHHCCCCCEEEECCHH---HCCCCHHHHHH T ss_conf 999999988517885146530001---12672778999 No 27 >TIGR02881 spore_V_K stage V sporulation protein K; InterPro: IPR014232 Proteins in this entry include the stage V sporulation protein K (SpoVK), a close homologue of the Rubisco expression protein CbbX (IPR000470 from INTERPRO), and are members of an ATPase family associated with various cellular activities. These proteins are strictly limited to bacterial endospore-forming species, but are not found universally among members of this group; they are missing from the Clostridium species.. Probab=43.54 E-value=25 Score=15.88 Aligned_cols=95 Identities=17% Similarity=0.288 Sum_probs=55.3 Q ss_pred HHHHHHHHHC--CCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHEEECCCCCHHHHHHHHHHHHHH Q ss_conf 9999986102--43332101111034899999862073111344366881000030112101468978999999999999 Q gi|254780532|r 92 SQIAEKIMYG--KVLMHSISFPEGFTVKQMARRLKDNPLLVGELPLELPLEGTLCPSTYNFPLGTHRSEILNQAMLKQKQ 169 (325) Q Consensus 92 ~~il~~L~~G--~~~~~~iTI~EG~t~~ei~~~L~~~~~l~~~~~~~~~lEG~l~PdTY~~~~~~s~~~il~~m~~~~~~ 169 (325) +|.++.|.++ +....-|.|.-|+. .|+.--|+.||.|.+..|+... ||| |.. +..-+|+++|+++.+- T Consensus 127 KEAIDtLVK~mEd~~~~lvlILAGY~-~EM~yFL~~NPGL~SRFPi~i~-----FPd-Y~~---eeL~~Ia~~m~~~ReY 196 (261) T TIGR02881 127 KEAIDTLVKAMEDQRNELVLILAGYS-DEMDYFLSLNPGLRSRFPISID-----FPD-YTV---EELMEIAERMVKEREY 196 (261) T ss_pred CHHHHHHHHHHHHCCCCEEEEEECCH-HHHHHHHHCCCCCCCCCCCEEE-----CCC-CCH---HHHHHHHHHHHHHHHH T ss_conf 20888999987615698689970876-8999986207797776650541-----889-988---8999999999864642 Q ss_pred HHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCC Q ss_conf 8755655453106676599999988777765267 Q gi|254780532|r 170 VVDEVWEIRDVDHPIKSKEDLVILASIVEKETSR 203 (325) Q Consensus 170 ~~~~~~~~~~~~~~~~s~~evlilASIVEkEa~~ 203 (325) .+.+.-... + .+-+....+.-+++.+| T Consensus 197 ~Lt~~A~~~-----l--r~~l~~~~~~~~~~~sN 223 (261) T TIGR02881 197 KLTEEAKWK-----L--REHLAKVDQLSSREFSN 223 (261) T ss_pred CCCHHHHHH-----H--HHHHHHHHHHHHCCCCC T ss_conf 257889999-----9--99974124442100576 No 28 >PRK11198 hypothetical protein; Provisional Probab=41.93 E-value=26 Score=15.72 Aligned_cols=23 Identities=17% Similarity=0.130 Sum_probs=14.7 Q ss_pred CHHHEEECCCCCHHHHHHHHHHH Q ss_conf 30112101468978999999999 Q gi|254780532|r 144 CPSTYNFPLGTHRSEILNQAMLK 166 (325) Q Consensus 144 ~PdTY~~~~~~s~~~il~~m~~~ 166 (325) -|..|.+..++|...|-++.+.. T Consensus 94 e~~~YtVq~GDTLw~IA~k~YGd 116 (147) T PRK11198 94 ESQFYTVKSGDTLSAIAKKVYGN 116 (147) T ss_pred CCCEEEECCCCCHHHHHHHHCCC T ss_conf 98578979899799999998398 No 29 >PRK09510 tolA cell envelope integrity inner membrane protein TolA; Provisional Probab=40.45 E-value=28 Score=15.58 Aligned_cols=51 Identities=12% Similarity=0.107 Sum_probs=24.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCC----CCCCCCCEE--EEECCCCCHHHHHHHHHH Q ss_conf 9999999999999999999987246----777778789--998888999999999998 Q gi|254780532|r 4 FLIPLITIFLLAIGVHIHVIRVYNA----TGPLQNDTI--FLVRNNMSLKEISKNLFN 55 (325) Q Consensus 4 ~li~li~i~~l~~~~~~~~~~~~~~----~~~~~~~~~--i~I~~G~s~~~I~~~L~~ 55 (325) -+|+-+++-++++++.+|.....+. .+.. .+++ |.|.+|.-..+-.+.-.. T Consensus 14 aiiiSv~LHiiLi~lLIwgS~~~~~e~~ggG~g-G~~I~AVmVDpgaV~qqy~r~qqQ 70 (402) T PRK09510 14 AIIISAVLHIILFALLIWSSFDENVEASGGGGG-GSVIDAVMVDPGAVVEQYNRQQQQ 70 (402) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCC-CCEEEEEEECCHHHHHHHHHHHHH T ss_conf 999999999999999998001113214689998-773237887808999999999987 No 30 >pfam04337 DUF480 Protein of unknown function, DUF480. This family consists of several proteins of uncharacterized function. Probab=39.76 E-value=13 Score=17.60 Aligned_cols=25 Identities=24% Similarity=0.332 Sum_probs=15.9 Q ss_pred CHHHHHHHHHHHHHHHCCHHHHHHH Q ss_conf 5999999887777652672578899 Q gi|254780532|r 186 SKEDLVILASIVEKETSRADERAHV 210 (325) Q Consensus 186 s~~evlilASIVEkEa~~~~e~~~I 210 (325) |+.|+=+|.++||||...||-+|+- T Consensus 2 s~~E~RVlG~LiEKe~TTPd~YPLt 26 (148) T pfam04337 2 SALEARVLGCLLEKEVTTPDQYPLS 26 (148) T ss_pred CHHHHHHHHHHHHHCCCCCCCCCCC T ss_conf 7888879988531123587767640 No 31 >PRK10833 putative assembly protein; Provisional Probab=37.55 E-value=31 Score=15.29 Aligned_cols=20 Identities=20% Similarity=0.418 Sum_probs=11.9 Q ss_pred CHHHHHHHHHHHHHHHHHHH Q ss_conf 95799999999999999999 Q gi|254780532|r 1 MLKFLIPLITIFLLAIGVHI 20 (325) Q Consensus 1 m~k~li~li~i~~l~~~~~~ 20 (325) |+|++..++++++++++... T Consensus 1 MKrll~~l~illvvvv~gl~ 20 (617) T PRK10833 1 MRRFLTTLMILLVVLVAGLS 20 (617) T ss_pred CCHHHHHHHHHHHHHHHHHH T ss_conf 94189999999999999999 No 32 >COG0046 PurL Phosphoribosylformylglycinamidine (FGAM) synthase, synthetase domain [Nucleotide transport and metabolism] Probab=37.25 E-value=31 Score=15.26 Aligned_cols=71 Identities=23% Similarity=0.323 Sum_probs=44.9 Q ss_pred CCCCHHCCCCCCCCCCCC-CCCHHH-HHHHCCC---CCCCEEEEEECCC---CC------------EEE-CCCHHHHHHH Q ss_conf 683011023888999866-432798-9996087---8898799998389---87------------665-2999999999 Q gi|254780532|r 254 KTPYNSYLMNGLPPTAIS-NPGRLS-LEAVAKP---LHTEDLYFVGDGK---GG------------HFF-STNFKDHTIN 312 (325) Q Consensus 254 ~spYNTY~~~GLPP~PI~-~pg~~a-i~A~l~p---~~~dylyfv~~~~---g~------------h~F-s~t~~eH~~n 312 (325) +|-||++.-.-.||||+- ..|.-. +.-++.| ..++.+|-+-.+. |. ..+ .-++++-.++ T Consensus 546 vSlynet~~~~i~PTpvi~~vg~v~dv~k~~~~~~~~~g~~i~llG~~~~~lggS~~~~~~~~~~~g~~p~~dl~~ek~~ 625 (743) T COG0046 546 VSLYNETNGQPIPPTPVIGAVGLVEDVRKVLTPELKKAGLLLYLLGETKDELGGSELAQVYHGLGDGPPPVVDLAEEKKF 625 (743) T ss_pred EEEEECCCCCCCCCCCEEEEEEEEECHHHHCCCCCCCCCCEEEEECCCCCCCCHHHHHHHHHCCCCCCCCCCCHHHHHHH T ss_conf 30020017964699656999998623234237554668868999089756665549999985113799997798999999 Q ss_pred HHHHHHHHHHCC Q ss_conf 999999987288 Q gi|254780532|r 313 VQKWRKMSLESK 324 (325) Q Consensus 313 v~~y~~~~~~~k 324 (325) .+.-|++.++.+ T Consensus 626 ~~~i~~~i~~~~ 637 (743) T COG0046 626 FDAIRALIADGK 637 (743) T ss_pred HHHHHHHHHCCC T ss_conf 999999975696 No 33 >KOG4254 consensus Probab=36.87 E-value=31 Score=15.22 Aligned_cols=116 Identities=17% Similarity=0.232 Sum_probs=63.3 Q ss_pred HHHHHHHHHHHHCC-CCEEEEEEEEEEECCCCCC----CCCC-CCC-CCHHHCCCCCCHHCCCCCCCCCCCCCCCHHHHH Q ss_conf 88999999997405-8702442221010155667----7542-000-000101568301102388899986643279899 Q gi|254780532|r 207 RAHVASVFINRFSK-SIRLQSDSTVIYGILEGDY----DLTN-RKI-SRSDFSIKTPYNSYLMNGLPPTAISNPGRLSLE 279 (325) Q Consensus 207 ~~~IA~V~~NRL~~-~M~Lq~D~Tv~Y~l~~~~~----~~~~-~~~-~~~dl~~~spYNTY~~~GLPP~PI~~pg~~ai~ 279 (325) |.-||--+.+-+++ |-.++.-+||.=.+-+..- .+.+ ..+ .+.-+..-+|||||.+ =||+.-.+-++ .|+ T Consensus 263 ~Gavs~aia~~~~~~GaeI~tka~Vq~Illd~gka~GV~L~dG~ev~sk~VvSNAt~~~Tf~k-Llp~e~LPeef--~i~ 339 (561) T KOG4254 263 MGAVSFAIAEGAKRAGAEIFTKATVQSILLDSGKAVGVRLADGTEVRSKIVVSNATPWDTFEK-LLPGEALPEEF--VIQ 339 (561) T ss_pred HHHHHHHHHHHHHHCCCEEEEHHHHHHEECCCCEEEEEEECCCCEEEEEEEECCCCHHHHHHH-HCCCCCCCCHH--HHH T ss_conf 068999999999860614210211446031388278789558858771156517846777887-47975578445--455 Q ss_pred HH--CCCCCCCEE-EE-EECCC----C------CEEECCCHHHHHHHHHHHHHHHHHCCC Q ss_conf 96--087889879-99-98389----8------766529999999999999999872886 Q gi|254780532|r 280 AV--AKPLHTEDL-YF-VGDGK----G------GHFFSTNFKDHTINVQKWRKMSLESKP 325 (325) Q Consensus 280 A~--l~p~~~dyl-yf-v~~~~----g------~h~Fs~t~~eH~~nv~~y~~~~~~~k~ 325 (325) -. .+|--.|-+ =| +..+. . -|.=+++-+.|.+.|..+++-..+++| T Consensus 340 q~d~~spv~k~~~psFl~~~~~~~~plph~~~~i~~~~ed~~~~H~~v~D~~~gl~s~~p 399 (561) T KOG4254 340 QLDTVSPVTKDKLPSFLCLPNTKSLPLPHHGYTIHYNAEDTQAHHRAVEDPRNGLASHRP 399 (561) T ss_pred HCCCCCCCCCCCCCCEEECCCCCCCCCCCCCEEEEECCCHHHHHHHHHHCHHHCCCCCCC T ss_conf 326646651002740043378888877755426884572588899997171215665687 No 34 >smart00540 LEM in nuclear membrane-associated proteins. LEM, domain in nuclear membrane-associated proteins, including lamino-associated polypeptide 2 and emerin. Probab=35.90 E-value=15 Score=17.31 Aligned_cols=19 Identities=32% Similarity=0.487 Sum_probs=12.7 Q ss_pred CCCCCCCCCCCCCCCHHHH Q ss_conf 0238889998664327989 Q gi|254780532|r 260 YLMNGLPPTAISNPGRLSL 278 (325) Q Consensus 260 Y~~~GLPP~PI~~pg~~ai 278 (325) -+-.|+|||||-.-.+.-. T Consensus 15 L~~~G~~~gPIt~sTR~vy 33 (44) T smart00540 15 LKQYGLPPGPITDTTRKLY 33 (44) T ss_pred HHHCCCCCCCCCCCHHHHH T ss_conf 9983999999683309999 No 35 >PRK13133 consensus Probab=35.26 E-value=33 Score=15.06 Aligned_cols=60 Identities=13% Similarity=0.041 Sum_probs=33.3 Q ss_pred CHHCCCCCCCCCCCCCCC-HHHHHHHCCCCCCCEEEEEECC--CCCE-----EECCCHHHHHHHHHHH Q ss_conf 011023888999866432-7989996087889879999838--9876-----6529999999999999 Q gi|254780532|r 257 YNSYLMNGLPPTAISNPG-RLSLEAVAKPLHTEDLYFVGDG--KGGH-----FFSTNFKDHTINVQKW 316 (325) Q Consensus 257 YNTY~~~GLPP~PI~~pg-~~ai~A~l~p~~~dylyfv~~~--~g~h-----~Fs~t~~eH~~nv~~y 316 (325) .+.-+..||++-++.+|. .+.=...+.-..++|+|+|+.. +|.- .+.....++.+.++++ T Consensus 142 ~~~~~~~gl~~I~lvaPtt~~eRi~~i~~~s~GFiY~vs~~GvTG~~~~~~~~~~~~~~~~i~~ik~~ 209 (267) T PRK13133 142 LERAKNFGLTVVFLISPVTPPERIEFIDSLSTDFSYCLAVNATTGTAKLSDAGTEAAVDEYLKRVRQH 209 (267) T ss_pred HHHHHHCCCCEEEEECCCCCHHHHHHHHHCCCCCEEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHC T ss_conf 99998469860244289999999999984278957999800134677555542678999999999971 No 36 >COG5294 Uncharacterized protein conserved in bacteria [Function unknown] Probab=35.19 E-value=28 Score=15.52 Aligned_cols=20 Identities=35% Similarity=0.660 Sum_probs=12.8 Q ss_pred CHHHHHHHHHHHHHHHHHHH Q ss_conf 95799999999999999999 Q gi|254780532|r 1 MLKFLIPLITIFLLAIGVHI 20 (325) Q Consensus 1 m~k~li~li~i~~l~~~~~~ 20 (325) |+|.++.++.+.++++++.+ T Consensus 1 MKkil~~ilall~~ii~a~~ 20 (113) T COG5294 1 MKKILIGILALLLIIIGALF 20 (113) T ss_pred CCCHHHHHHHHHHHHHHHHE T ss_conf 90439999999999986621 No 37 >TIGR01063 gyrA DNA gyrase, A subunit; InterPro: IPR005743 DNA topoisomerases regulate the number of topological links between two DNA strands (i.e. change the number of superhelical turns) by catalysing transient single- or double-strand breaks, crossing the strands through one another, then resealing the breaks. These enzymes have several functions: to remove DNA supercoils during transcription and DNA replication; for strand breakage during recombination; for chromosome condensation; and to disentangle intertwined DNA during mitosis , . DNA topoisomerases are divided into two classes: type I enzymes (5.99.1.2 from EC; topoisomerases I, III and V) break single-strand DNA, and type II enzymes (5.99.1.3 from EC; topoisomerases II, IV and VI) break double-strand DNA . Type II topoisomerases are ATP-dependent enzymes, and can be subdivided according to their structure and reaction mechanisms: type IIA (topoisomerase II or gyrase, and topoisomerase IV) and type IIB (topoisomerase VI). These enzymes are responsible for relaxing supercoiled DNA as well as for introducing both negative and positive supercoils . Topoisomerase II (called gyrase in bacteria) primarily introduces negative supercoils into DNA. In bacteria, topoisomerase II consists of two polypeptide subunits, gyrA and gyrB, which form a heterotetramer: (BA)2. In most eukaryotes, topoisomerase II consists of a single polypeptide, where the N- and C-terminal regions correspond to gyrB and gyrA, respectively. This entry represents the A subunit (gyrA) as found predominantly in bacteria, but does not include the topoisomerase II enzymes composed of a single polypeptide, as are found in most eukaryotes. GyrA has two functional domains: an N-terminal that forms the covalent DNA-protein bridge that is responsible for the breaking- and rejoining function, and a C-terminal that can bind DNA non-specifically . More information about this protein can be found at Protein of the Month: DNA Topoisomerase .; GO: 0003677 DNA binding, 0003918 DNA topoisomerase (ATP-hydrolyzing) activity, 0005524 ATP binding, 0006265 DNA topological change, 0006268 DNA unwinding during replication, 0005694 chromosome. Probab=34.71 E-value=25 Score=15.82 Aligned_cols=237 Identities=11% Similarity=0.173 Sum_probs=126.6 Q ss_pred EEEEECCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCCCCCCCEEEECCCCCHHHHHHHHHHCC--CCCC------- Q ss_conf 8999888899999999999879806989999999953866762572044067875899999861024--3332------- Q gi|254780532|r 36 TIFLVRNNMSLKEISKNLFNGGVIVNPYIFRYVTQFYFGSRGLKTGEYEIEKGSSMSQIAEKIMYGK--VLMH------- 106 (325) Q Consensus 36 ~~i~I~~G~s~~~I~~~L~~~gvI~s~~~F~~~~~~~~~~~~ikaG~Y~i~~~~s~~~il~~L~~G~--~~~~------- 106 (325) ++++|++|+...-|.++|++.==+...|--..+|-.+| .|+-|++.+++..+..-+ ++++ T Consensus 298 iVIE~KRD~~a~VvLN~LYk~T~lQ~~Fg~NmLALv~G-----------~Pk~lnLk~~l~~f~~HR~~Vi~RRT~feLr 366 (864) T TIGR01063 298 IVIELKRDAVAEVVLNNLYKQTQLQVSFGINMLALVDG-----------LPKVLNLKELLEAFVEHRKDVITRRTIFELR 366 (864) T ss_pred EEEEEECCCCCCHHHHHHHCCCCCCEECCEEEEEEECC-----------CCCEECHHHHHHHHHHCCEEEEEEEHHCCHH T ss_conf 99997428983015421211264500148123211078-----------9750178899999984231335521000102 Q ss_pred ----CCCCCCCC-----CHHHHHHHHHCCCC--------CCC-----CCC--CCCCCCCCCC-HHHEEECCCCCHHHHHH Q ss_conf ----10111103-----48999998620731--------113-----443--6688100003-01121014689789999 Q gi|254780532|r 107 ----SISFPEGF-----TVKQMARRLKDNPL--------LVG-----ELP--LELPLEGTLC-PSTYNFPLGTHRSEILN 161 (325) Q Consensus 107 ----~iTI~EG~-----t~~ei~~~L~~~~~--------l~~-----~~~--~~~~lEG~l~-PdTY~~~~~~s~~~il~ 161 (325) +.-|.||+ ++.+|...+..... ... +.. ++..+|--|- -++|.|+. .=++.||+ T Consensus 367 KA~eRaHiL~GL~~AL~niDevI~lIr~S~~~~~Ak~~L~e~~W~l~di~~ll~~~~~~~l~~~g~F~l~E-~QA~AILd 445 (864) T TIGR01063 367 KAEERAHILEGLLIALDNIDEVIALIRASQNTEEAKTRLVERQWELLDILPLLKLVLEVELGERGTFSLSE-IQAQAILD 445 (864) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCH-HHHHHHHH T ss_conf 26557899999999997212899997427898899988421686078999999887675328888654488-99999999 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCC--------------------- Q ss_conf 99999999875565545310667659999998877776526725788999999997405--------------------- Q gi|254780532|r 162 QAMLKQKQVVDEVWEIRDVDHPIKSKEDLVILASIVEKETSRADERAHVASVFINRFSK--------------------- 220 (325) Q Consensus 162 ~m~~~~~~~~~~~~~~~~~~~~~~s~~evlilASIVEkEa~~~~e~~~IA~V~~NRL~~--------------------- 220 (325) +-+.+. .-+-.++ +.+.|+-| ++.|-+-| ..=.+..++-+|+-.=|.+ T Consensus 446 MrL~rL----TgLE~~K-----l~~E~~~L-~~~I~~l~-~iL~~~~r~~~iireEL~~i~~~Fgd~RRT~I~~~~~~~~ 514 (864) T TIGR01063 446 MRLQRL----TGLEREK-----LEEEYKEL-LELIADLE-DILASEERVLEIIREELEEIKEQFGDPRRTEIVAEESEDI 514 (864) T ss_pred HHHHHH----HHHHHHH-----HHHHHHHH-HHHHHHHH-HHCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHCCCCCC T ss_conf 887677----8888999-----99999999-99999999-7503776899999999999999738853212220243224 Q ss_pred --CCEEEEEEEEEEECCCCCCCCCCCCCCCHHHCCCCCCHHCCCCCCCCCCCCCCC-HHHHHHHCCCCCCCEEEEEECCC Q ss_conf --870244222101015566775420000001015683011023888999866432-79899960878898799998389 Q gi|254780532|r 221 --SIRLQSDSTVIYGILEGDYDLTNRKISRSDFSIKTPYNSYLMNGLPPTAISNPG-RLSLEAVAKPLHTEDLYFVGDGK 297 (325) Q Consensus 221 --~M~Lq~D~Tv~Y~l~~~~~~~~~~~~~~~dl~~~spYNTY~~~GLPP~PI~~pg-~~ai~A~l~p~~~dylyfv~~~~ 297 (325) ....|-+.=|+= |.+.+ +-+++.. |-|+.=+.-|==-.-...-- .+.|+.++-...|||+.|+++. T Consensus 515 D~EDLI~~E~vVvt-~s~~G---YvKR~p~------~~Y~~Q~RGGkG~~g~~~~~G~D~i~~~lvasTHD~~Lfftn~- 583 (864) T TIGR01063 515 DIEDLIARENVVVT-LSHKG---YVKRVPV------SAYRSQKRGGKGVSGADLKDGDDFIEQLLVASTHDYLLFFTNR- 583 (864) T ss_pred CCHHHHCCCCEEEE-EECCC---EEEECCH------HHHHCCCCCCCCCCCCCCCCCCCEEEEEEEEECCCEEEEEECC- T ss_conf 50344147878999-71686---4786030------1332576666464325312368505677883077607888188- Q ss_pred CCEEECCCH Q ss_conf 876652999 Q gi|254780532|r 298 GGHFFSTNF 306 (325) Q Consensus 298 g~h~Fs~t~ 306 (325) |+-|-.|-| T Consensus 584 G~vY~lK~Y 592 (864) T TIGR01063 584 GKVYRLKVY 592 (864) T ss_pred CCEEEEEEE T ss_conf 826864544 No 38 >PRK11239 hypothetical protein; Provisional Probab=34.43 E-value=18 Score=16.74 Aligned_cols=70 Identities=13% Similarity=0.134 Sum_probs=35.2 Q ss_pred EEECCCCCHHHHHHHHHHCCCCCCHHHHH-HHHHHCCCCCCCCCCEEEECCCCCHHHHHHHHHHCCCCCCCCC Q ss_conf 99888899999999999879806989999-9999538667625720440678758999998610243332101 Q gi|254780532|r 38 FLVRNNMSLKEISKNLFNGGVIVNPYIFR-YVTQFYFGSRGLKTGEYEIEKGSSMSQIAEKIMYGKVLMHSIS 109 (325) Q Consensus 38 i~I~~G~s~~~I~~~L~~~gvI~s~~~F~-~~~~~~~~~~~ikaG~Y~i~~~~s~~~il~~L~~G~~~~~~iT 109 (325) ....+ +.+....+.|.+++++.....|- -..++...-.+-..|.+.+++. ...=+-..|..|++...-+. T Consensus 48 m~l~e-~eV~~ald~L~~k~lv~~~~~~gsRv~ky~Hr~~~~ef~~l~l~~~-~~All~~LlLRGpQT~GELR 118 (215) T PRK11239 48 MNLSE-SEVQEQLDNLVKRHYLRTVSGFGNRVTKYEQRFCNSEFGDLKLSAA-EVALITTLLLRGAQTPGELR 118 (215) T ss_pred CCCCH-HHHHHHHHHHHHCCCEEEECCCCCCCHHHEEECCCCCCCCCCCCHH-HHHHHHHHHHCCCCCHHHHH T ss_conf 46899-9999999999978886765578875455402131001233688889-99999999972888747888 No 39 >PRK06770 hypothetical protein; Provisional Probab=33.69 E-value=35 Score=14.90 Aligned_cols=48 Identities=13% Similarity=0.242 Sum_probs=19.9 Q ss_pred CCCEEEECCCCCHHHHHHHHHHCCCC--CCCCCCCCCCCHHHHHHHHHCCC Q ss_conf 57204406787589999986102433--32101111034899999862073 Q gi|254780532|r 79 KTGEYEIEKGSSMSQIAEKIMYGKVL--MHSISFPEGFTVKQMARRLKDNP 127 (325) Q Consensus 79 kaG~Y~i~~~~s~~~il~~L~~G~~~--~~~iTI~EG~t~~ei~~~L~~~~ 127 (325) |-|.+.+.+ -....+...+.+.+.. ..-+-|.|.|...|..+...++. T Consensus 89 KwG~i~mt~-e~ie~l~~ii~~sn~~~k~~~l~Il~RW~~gDFS~~vedHN 138 (185) T PRK06770 89 KWGFIEMTQ-ENIEKLKDIINSSNFVQKEELLAILERWEKGDFSKIVEDHN 138 (185) T ss_pred CCCCEECCH-HHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCHHHHHHHHH T ss_conf 325454579-99999999986235464999999999885167247889988 No 40 >PRK11627 hypothetical protein; Provisional Probab=33.59 E-value=30 Score=15.34 Aligned_cols=18 Identities=39% Similarity=0.700 Sum_probs=11.2 Q ss_pred CHHHHHHHHHHHHHHHHH Q ss_conf 957999999999999999 Q gi|254780532|r 1 MLKFLIPLITIFLLAIGV 18 (325) Q Consensus 1 m~k~li~li~i~~l~~~~ 18 (325) |||+++.+++++++++|. T Consensus 2 mkk~~~~l~a~~lL~gCa 19 (192) T PRK11627 2 LKKILFPLVALFMLAGCA 19 (192) T ss_pred HHHHHHHHHHHHHHHHHC T ss_conf 287999999999998606 No 41 >PRK02090 phosphoadenosine phosphosulfate reductase; Provisional Probab=33.06 E-value=36 Score=14.83 Aligned_cols=12 Identities=50% Similarity=0.850 Sum_probs=9.5 Q ss_pred CCCCCHHHEEEC Q ss_conf 000030112101 Q gi|254780532|r 140 EGTLCPSTYNFP 151 (325) Q Consensus 140 EG~l~PdTY~~~ 151 (325) -|++||+||.|- T Consensus 75 TG~~f~ETy~~~ 86 (243) T PRK02090 75 TGYLFPETYRFI 86 (243) T ss_pred CCCCCHHHHHHH T ss_conf 798888999999 No 42 >pfam10787 YfmQ Uncharacterized protein from bacillus cereus group. This family is conserved in the Bacillus cereus group. Several members are called YfmQ but the function is not known. Probab=32.71 E-value=37 Score=14.80 Aligned_cols=36 Identities=22% Similarity=0.437 Sum_probs=13.1 Q ss_pred CCCCCCCEEEEEECCCCCE---EECCCHHHHHHHHHHHH Q ss_conf 0878898799998389876---65299999999999999 Q gi|254780532|r 282 AKPLHTEDLYFVGDGKGGH---FFSTNFKDHTINVQKWR 317 (325) Q Consensus 282 l~p~~~dylyfv~~~~g~h---~Fs~t~~eH~~nv~~y~ 317 (325) |+|+.+.-=.+...++|+| .|.=.|++|-.-|+.|+ T Consensus 86 l~Pq~~gtPlvI~tKkGK~dv~f~iY~YddHVDVVkqyK 124 (149) T pfam10787 86 LHPENGGTPLVINTKKGKKDVTFFIYRYDDHVDVVKQYK 124 (149) T ss_pred CCCCCCCCCEEEEECCCCCEEEEEEEEECCHHHHHHHHH T ss_conf 365458987899912685107999984143889999865 No 43 >pfam01686 Adeno_Penton_B Adenovirus penton base protein. This family consists of various adenovirus penton base proteins, from both the Mastadenoviradae having mammalian hosts and the Aviadenoviradae having avian hosts. The penton base is a major structural protein forming part of the penton which consists of a base and a fibre, the pentons hold a morphologically prominent position at the vertex capsomer in the adenovirus particle. In mammalian adenovirus there is only one tail on each base where as in avian adenovirus there are two. Probab=32.67 E-value=28 Score=15.55 Aligned_cols=46 Identities=22% Similarity=0.553 Sum_probs=27.0 Q ss_pred CCCCCCCCC-CHHHHHHHHHC------------CCCCCCCCCCC----------CCCCCCCCHHHEEEC Q ss_conf 210111103-48999998620------------73111344366----------881000030112101 Q gi|254780532|r 106 HSISFPEGF-TVKQMARRLKD------------NPLLVGELPLE----------LPLEGTLCPSTYNFP 151 (325) Q Consensus 106 ~~iTI~EG~-t~~ei~~~L~~------------~~~l~~~~~~~----------~~lEG~l~PdTY~~~ 151 (325) +.+|||||- +..+++..|++ ++....++..+ .|.-|+.-|+.|.+. T Consensus 128 ~~ltiPEGNys~~~~IDLlNnaIve~YL~~GRQngV~esDIGVKFDTRnf~Lg~DPvT~LV~PG~Yt~k 196 (454) T pfam01686 128 VDLTIPEGNYSLTELIDLLNNAIVENYLAVGRQNGVLESDIGVKFDTRNFRLGFDPVTGLVTPGVYTYK 196 (454) T ss_pred EEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEHHCCEEEECCCCCCCCCCCCCCCCCCCCCCC T ss_conf 998666887418789998889999999975101485201253477324334567875362157600033 No 44 >pfam10828 DUF2570 Protein of unknown function (DUF2570). This is a family of proteins with unknown function. Probab=31.24 E-value=39 Score=14.64 Aligned_cols=24 Identities=17% Similarity=0.302 Sum_probs=16.3 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 957999999999999999999998 Q gi|254780532|r 1 MLKFLIPLITIFLLAIGVHIHVIR 24 (325) Q Consensus 1 m~k~li~li~i~~l~~~~~~~~~~ 24 (325) |.+++...++++++.++++.|+-+ T Consensus 1 m~~~i~~~l~~lilgL~gwlw~QS 24 (110) T pfam10828 1 MSKYIKATLAALILGLGGWSWYQS 24 (110) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 906899999999999999999998 No 45 >TIGR03028 EpsE polysaccharide export protein EpsE. Sequences in this family of proteins are members of a polysaccharide export protein family (pfam02563) which includes the wza protein from E.coli. This family of proteins are homologous to the EpsE protein of the methanolan biosynthesis operon of Methylobacillus species strain 12S. The distribution of this protein appears to be restricted to a subset of exopolysaccharide operons containing a syntenic grouping of genes including a variant of the EpsH exosortase protein. Exosortase has been proposed to be involved in the targetting and processing of proteins containing the PEP-CTERM domain to the exopolysaccharide layer. Probab=30.26 E-value=40 Score=14.54 Aligned_cols=74 Identities=16% Similarity=0.241 Sum_probs=45.2 Q ss_pred CCCCHHHHHHHHH----HCCCCCCHHHHHHHHHHCCCCCC-----CCCCEEEECCCCCHHHHHHHHHHCCC-----CCCC Q ss_conf 8899999999999----87980698999999995386676-----25720440678758999998610243-----3321 Q gi|254780532|r 42 NNMSLKEISKNLF----NGGVIVNPYIFRYVTQFYFGSRG-----LKTGEYEIEKGSSMSQIAEKIMYGKV-----LMHS 107 (325) Q Consensus 42 ~G~s~~~I~~~L~----~~gvI~s~~~F~~~~~~~~~~~~-----ikaG~Y~i~~~~s~~~il~~L~~G~~-----~~~~ 107 (325) .|.|..++.+.|. +.++++++....-........-. -+||.|.+.+.+++.+++.. ..|-. .... T Consensus 43 ~G~T~~e~~~~I~~~l~~~~~~~~p~V~V~v~~~~~~~V~V~G~V~~PG~y~l~~~~tl~~al~~-AGG~~~~~~~~~~~ 121 (239) T TIGR03028 43 GGETPAAAERKIASRLSKGGFVKQPQVTINVLQYRGQQVSVLGQVNRPGRYPLETAGRVSDVLAL-AGGVTPDGADVITL 121 (239) T ss_pred CCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHCCCEEEEEECCCCCCEEEECCCCCHHHHHHHH-CCCCCCCCCCEEEE T ss_conf 58999999999999985446534852110022125358999813346507884887629999997-27999766623799 Q ss_pred CCCCCCCCH Q ss_conf 011110348 Q gi|254780532|r 108 ISFPEGFTV 116 (325) Q Consensus 108 iTI~EG~t~ 116 (325) ....+|... T Consensus 122 ~~~r~g~~~ 130 (239) T TIGR03028 122 VREREGKIF 130 (239) T ss_pred EEEECCEEE T ss_conf 998799589 No 46 >pfam03020 LEM LEM domain. The LEM domain is 50 residues long and is composed of two parallel alpha helices. This domain is found in inner nuclear membrane proteins. It is called the LEM domain after LAP2, Emerin, and Man1. Probab=30.00 E-value=21 Score=16.31 Aligned_cols=17 Identities=24% Similarity=0.421 Sum_probs=11.3 Q ss_pred HCCCCCCCCCCCCCCCH Q ss_conf 10238889998664327 Q gi|254780532|r 259 SYLMNGLPPTAISNPGR 275 (325) Q Consensus 259 TY~~~GLPP~PI~~pg~ 275 (325) .-+-.|.|||||-.-.+ T Consensus 14 ~L~~~G~~~GPI~~sTR 30 (43) T pfam03020 14 ELRKYGISPGPITGTTR 30 (43) T ss_pred HHHHCCCCCCCCCCCHH T ss_conf 99983999999663208 No 47 >pfam06809 NPDC1 Neural proliferation differentiation control-1 protein (NPDC1). This family consists of several neural proliferation differentiation control-1 (NPDC1) proteins. NPDC1 plays a role in the control of neural cell proliferation and differentiation. It has been suggested that NPDC1 may be involved in the development of several secretion glands. This family also contains the C-terminal region of the C. elegans protein CAB-1 which is known to interact with AEX-3. Probab=29.34 E-value=25 Score=15.83 Aligned_cols=31 Identities=29% Similarity=0.402 Sum_probs=22.6 Q ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCEEE Q ss_conf 999887777652672578899999999740587024 Q gi|254780532|r 190 LVILASIVEKETSRADERAHVASVFINRFSKSIRLQ 225 (325) Q Consensus 190 vlilASIVEkEa~~~~e~~~IA~V~~NRL~~~M~Lq 225 (325) +||++..|-.-++ -.||+|+|=||.+.|+|- T Consensus 205 ~lIvvc~vaGaaA-----LivAavCWcRLQrEiRLa 235 (345) T pfam06809 205 VLILAFCLAGAAA-----LAVAALCWCRLQREIRLT 235 (345) T ss_pred HHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHH T ss_conf 2889999884899-----999999985565332466 No 48 >TIGR02707 butyr_kinase butyrate kinase; InterPro: IPR011245 This group represents bacterial butyrate kinase, an enzyme that facilitates the formation of butyryl-CoA by phosphorylating butyrate in the presence of ATP to form butyryl phosphate . The final steps in butyrate synthesis by anaerobic bacteria can occur via butyrate kinase and phosphotransbutyrylase or via butyryl-CoA:acetate CoA-transferase, the latter providing the dominant route for butyrate formation in human colonic bacteria .; GO: 0005524 ATP binding, 0047761 butyrate kinase activity, 0016310 phosphorylation, 0005737 cytoplasm. Probab=29.10 E-value=26 Score=15.72 Aligned_cols=52 Identities=15% Similarity=0.201 Sum_probs=28.6 Q ss_pred HHHHHHHHHCCCCCCHHHHHHHHHH-------------------CCCCCCCCCCEEEECCCCCHHHHHHHHHHCCC Q ss_conf 9999999987980698999999995-------------------38667625720440678758999998610243 Q gi|254780532|r 47 KEISKNLFNGGVIVNPYIFRYVTQF-------------------YFGSRGLKTGEYEIEKGSSMSQIAEKIMYGKV 103 (325) Q Consensus 47 ~~I~~~L~~~gvI~s~~~F~~~~~~-------------------~~~~~~ikaG~Y~i~~~~s~~~il~~L~~G~~ 103 (325) +.=.++|.+=.-|-+.+-||.-+-+ =|.-+-|.-|+|.++.. +++.|.++.. T Consensus 29 rH~~eEL~~f~~v~dQfeFR~~~i~~~L~e~Gi~~~~l~AvVgRGGLLkPi~GGTY~Vn~~-----MleDLk~~~~ 99 (353) T TIGR02707 29 RHSVEELAKFKNVIDQFEFRKQVILEVLEEKGINISKLDAVVGRGGLLKPIEGGTYLVNEK-----MLEDLKEGKR 99 (353) T ss_pred CCCHHHHHCCCCEEECHHHHHHHHHHHHHHCCCCHHHEEEEEECCCCCCCCCCCEEEECHH-----HHHHHHHCCC T ss_conf 2587873056640211268999999998740887124315797277023338960477655-----6888851058 No 49 >KOG3805 consensus Probab=29.00 E-value=42 Score=14.40 Aligned_cols=45 Identities=20% Similarity=0.274 Sum_probs=36.9 Q ss_pred CCCCCCCCCCCCCCHHH------------HHHHCCCC-CCCEEEEEECCCCCEEECCC Q ss_conf 23888999866432798------------99960878-89879999838987665299 Q gi|254780532|r 261 LMNGLPPTAISNPGRLS------------LEAVAKPL-HTEDLYFVGDGKGGHFFSTN 305 (325) Q Consensus 261 ~~~GLPP~PI~~pg~~a------------i~A~l~p~-~~dylyfv~~~~g~h~Fs~t 305 (325) .-.|+||+|=..+|..+ .+-..+|. -++++-||-..+|...|... T Consensus 251 q~~~~~P~~~~~nG~tg~~kk~IhLwQFLkELL~sP~~~~~~IrWVDkdKGiFkiess 308 (361) T KOG3805 251 QLDGLEPQPTLLNGKTGKGKKKIHLWQFLKELLYSPEQMGSCIRWVDKDKGIFKIESS 308 (361) T ss_pred CCCCCCCCCCCCCCCCCCCCCEEEHHHHHHHHHHCHHHHCHHEEEECCCCCEEEEECH T ss_conf 2235788765467877777640549999999985956634110732167756885007 No 50 >COG5398 Heme oxygenase [Inorganic ion transport and metabolism] Probab=28.62 E-value=7.6 Score=19.15 Aligned_cols=39 Identities=21% Similarity=0.392 Sum_probs=17.8 Q ss_pred HHHHHHHCCHHHHHHHHHHHHHHHCCCCEEEEEEEEEEE Q ss_conf 777765267257889999999974058702442221010 Q gi|254780532|r 195 SIVEKETSRADERAHVASVFINRFSKSIRLQSDSTVIYG 233 (325) Q Consensus 195 SIVEkEa~~~~e~~~IA~V~~NRL~~~M~Lq~D~Tv~Y~ 233 (325) |=.|+++..-.|-|.+|+++.-+|+..-.||-|++..|+ T Consensus 49 saleaa~~~~~d~~~l~~i~fp~lnr~~tle~dl~~yyg 87 (238) T COG5398 49 SALEAATQIHKDNPILSSIYFPELNRKATLEKDLLYYYG 87 (238) T ss_pred HHHHHHHHHHCCCCHHHHCCCHHHHHHHHHHCCHHHHHC T ss_conf 999999998223712442036103268776348787732 No 51 >TIGR00998 8a0101 efflux pump membrane protein; InterPro: IPR005694 emrA and emrB confer resistance to carbonylcyanide m-chlorophenylhydrazone, nalidixic acid, and a number of other toxic compounds. EmrB encodes a highly hydrophobic 56.2-kDa peptide, with 14 potential alpha-helices to span the inner membrane. EmrA encodes a putative 42.7-kDa peptide containing a single hydrophobic domain and a large C-terminal hydrophilic domain.; GO: 0015238 drug transporter activity, 0006855 multidrug transport, 0009276 1-2nm peptidoglycan-based cell wall. Probab=27.53 E-value=45 Score=14.24 Aligned_cols=21 Identities=5% Similarity=0.118 Sum_probs=10.2 Q ss_pred CCCCCHHHHHHHHHHHHHHHC Q ss_conf 667659999998877776526 Q gi|254780532|r 182 HPIKSKEDLVILASIVEKETS 202 (325) Q Consensus 182 ~~~~s~~evlilASIVEkEa~ 202 (325) .+-.+-.+.|+.-+=++++-. T Consensus 202 ~G~~~An~Al~~~~~L~~qpa 222 (379) T TIGR00998 202 KGQLNANQALVRGTPLKKQPA 222 (379) T ss_pred HHHHHHHHHHHCCCCHHHHHH T ss_conf 999999874112442043341 No 52 >KOG0480 consensus Probab=26.55 E-value=47 Score=14.13 Aligned_cols=17 Identities=29% Similarity=0.303 Sum_probs=10.3 Q ss_pred CCHHHHHHHCCCCCCCE Q ss_conf 32798999608788987 Q gi|254780532|r 273 PGRLSLEAVAKPLHTED 289 (325) Q Consensus 273 pg~~ai~A~l~p~~~dy 289 (325) |-+.||.||.||-.+-| T Consensus 483 nARtSIlAAANPv~GhY 499 (764) T KOG0480 483 NARTSILAAANPVGGHY 499 (764) T ss_pred CCHHHHHHHCCCCCCCC T ss_conf 22355555327767745 No 53 >TIGR03525 GldK gliding motility-associated lipoprotein GldK. Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldK is a lipoprotein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae. Knockouts of GldK abolish the gliding phenotype. GldK is homologous to GldJ. There is a GldK homolog in Cytophaga hutchinsonii and several other species that has a different, shorter architecture and is represented by a separate model. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility. Probab=26.21 E-value=44 Score=14.31 Aligned_cols=17 Identities=24% Similarity=0.266 Sum_probs=10.7 Q ss_pred CHHHHHHHHHHHHHHHH Q ss_conf 95799999999999999 Q gi|254780532|r 1 MLKFLIPLITIFLLAIG 17 (325) Q Consensus 1 m~k~li~li~i~~l~~~ 17 (325) |+|++++++++++++.| T Consensus 1 Mkk~~~~~~~~~~~~SC 17 (449) T TIGR03525 1 MKKYLVFAALVVLVYSC 17 (449) T ss_pred CCHHHHHHHHHHHHHHC T ss_conf 91248999999987210 No 54 >pfam11498 Activator_LAG-3 Transcriptional activator LAG-3. The C.elegans Notch pathway, involved in the control of growth, differentiation and patterning in animal development, relies on either of the receptors GLP-1 or LIN-12. Both these receptors promote signalling by the recruitment of LAG-3 to target promoters, where it then acts as a transcriptional activator. LAG-3 works as a ternary complex together with the DNA binding protein, LAG-1. Probab=26.11 E-value=37 Score=14.79 Aligned_cols=22 Identities=18% Similarity=0.291 Sum_probs=13.7 Q ss_pred CCCCCCCCCCCCHHHHHHHHHC Q ss_conf 3321011110348999998620 Q gi|254780532|r 104 LMHSISFPEGFTVKQMARRLKD 125 (325) Q Consensus 104 ~~~~iTI~EG~t~~ei~~~L~~ 125 (325) ....+.+|||.|...++-.+++ T Consensus 246 dennl~vpegewfdklal~vae 267 (476) T pfam11498 246 DENNLAVPEGEWFDKLALAVAE 267 (476) T ss_pred CCCCCCCCCCHHHHHHHHHHHH T ss_conf 7555578761478999999998 No 55 >PRK13116 consensus Probab=25.69 E-value=48 Score=14.03 Aligned_cols=57 Identities=12% Similarity=0.140 Sum_probs=29.6 Q ss_pred CCCCCCCCCCCCCCC-HHHHHHHCCCCCCCEEEEEECC--CCC-EE-ECCCHHHHHHHHHHH Q ss_conf 023888999866432-7989996087889879999838--987-66-529999999999999 Q gi|254780532|r 260 YLMNGLPPTAISNPG-RLSLEAVAKPLHTEDLYFVGDG--KGG-HF-FSTNFKDHTINVQKW 316 (325) Q Consensus 260 Y~~~GLPP~PI~~pg-~~ai~A~l~p~~~dylyfv~~~--~g~-h~-Fs~t~~eH~~nv~~y 316 (325) -...||.+-+++.|. .+.....+.-..+.|+|+|+.. +|. .. ....+.++.+.++++ T Consensus 143 ~~~~~i~~I~l~~ptt~~~ri~~I~~~s~GFiY~VS~~GvTG~~~~~~~~~l~~~i~~ik~~ 204 (278) T PRK13116 143 AAAAGIDPIYIAPANASEKTLEGVSAASKGYIYAISRDGVTGTERESSTDGLSAVVDNIKKF 204 (278) T ss_pred HHHCCCCEEEEECCCCCHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHC T ss_conf 98657666999379995999999997189739998635222688666678999999999845 No 56 >PRK06256 biotin synthase; Validated Probab=25.47 E-value=15 Score=17.27 Aligned_cols=48 Identities=21% Similarity=0.219 Sum_probs=32.1 Q ss_pred CCCEEEEEEEEEEECCCCCCCCCCCCCCCHHHCCCC-CCHHCC-CCCCCC Q ss_conf 587024422210101556677542000000101568-301102-388899 Q gi|254780532|r 220 KSIRLQSDSTVIYGILEGDYDLTNRKISRSDFSIKT-PYNSYL-MNGLPP 267 (325) Q Consensus 220 ~~M~Lq~D~Tv~Y~l~~~~~~~~~~~~~~~dl~~~s-pYNTY~-~~GLPP 267 (325) +.+-++..+++++|+|+...+...--...++++.+| |-|-.. ++|-|= T Consensus 196 ~~aGi~vcsG~i~GlGEt~edrve~l~~Lr~l~~~sipin~l~P~~gTpl 245 (325) T PRK06256 196 KAAGIEPCSGGIIGMGETLEDRAEHAFFLKELDADSIPINFLNPIKGTPL 245 (325) T ss_pred HHCCCCCCCCEEECCCCCHHHHHHHHHHHHCCCCCEEECCCCEECCCCCC T ss_conf 98599646643766899989999999999719998895467010699866 No 57 >pfam03867 FTZ Fushi tarazu (FTZ), N-terminal region. This region contains the important motif (LXXLL) necessary for the interaction of FTZ with the nuclear receptor FTZ-F1. FTZ is thought to represents a category of LXXLL motif-dependent co-activators for nuclear receptors. Probab=25.45 E-value=24 Score=15.97 Aligned_cols=16 Identities=31% Similarity=0.405 Sum_probs=7.1 Q ss_pred CCCCCCHHHHHHHCCC Q ss_conf 8664327989996087 Q gi|254780532|r 269 AISNPGRLSLEAVAKP 284 (325) Q Consensus 269 PI~~pg~~ai~A~l~p 284 (325) |=-+||+.+-.||-+- T Consensus 224 PPQSP~eks~s~vs~e 239 (264) T pfam03867 224 PPQSPGEKSATAVSNE 239 (264) T ss_pred CCCCCCCCCCCHHHHH T ss_conf 9998210122002222 No 58 >PHA02291 hypothetical protein Probab=25.43 E-value=49 Score=14.00 Aligned_cols=37 Identities=11% Similarity=0.164 Sum_probs=18.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEE Q ss_conf 5799999999999999999999872467777787899 Q gi|254780532|r 2 LKFLIPLITIFLLAIGVHIHVIRVYNATGPLQNDTIF 38 (325) Q Consensus 2 ~k~li~li~i~~l~~~~~~~~~~~~~~~~~~~~~~~i 38 (325) +++++.++++.+++.+++.+.+..+.......+++.+ T Consensus 4 K~~iFYiL~~~VL~~si~sY~~sS~~Y~~~A~~~~~~ 40 (131) T PHA02291 4 KASIFYILVVIVLAFSISSYYISSFMYHDKAKNEVST 40 (131) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHEEEEECCCCCCCEE T ss_conf 0103789999999999988742105651124663168 No 59 >pfam05819 NolX NolX protein. This family consists of Rhizobium NolX and Xanthomonas HrpF proteins. The interaction between the plant pathogen Xanthomonas campestris pv. vesicatoria and its host plants is controlled by hrp genes (hypersensitive reaction and pathogenicity), which encode a type III protein secretion system. Among type III-secreted proteins are avirulence proteins, effectors involved in the induction of plant defence reactions. HrpF is dispensable for protein secretion but required for AvrBs3 recognition in planta, is thought to function as a translocator of effector proteins into the host cell. NolX, a soybean cultivar specificity protein, is secreted by a type III secretion system (TTSS) and shows homology to HrpF of the plant pathogen Xanthomonas campestris pv. vesicatoria. It is not known whether NolX functions at the bacterium-plant interface or acts inside the host cell. NolX is expressed in planta only during the early stages of nodule development. Probab=25.19 E-value=34 Score=15.01 Aligned_cols=110 Identities=23% Similarity=0.278 Sum_probs=45.3 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCEEEEEEEEEEECCC--------------CCCCCCCCCCCCHHHCCCCCC Q ss_conf 988777765267257889999999974058702442221010155--------------667754200000010156830 Q gi|254780532|r 192 ILASIVEKETSRADERAHVASVFINRFSKSIRLQSDSTVIYGILE--------------GDYDLTNRKISRSDFSIKTPY 257 (325) Q Consensus 192 ilASIVEkEa~~~~e~~~IA~V~~NRL~~~M~Lq~D~Tv~Y~l~~--------------~~~~~~~~~~~~~dl~~~spY 257 (325) -||||++-|...++=|.- |. .|=+|| +.|||-. +....-.+++.++|.+. =| T Consensus 398 kLasms~D~~ldp~Vr~A-A~----------QLL~DP-lLfGLlNNaitGYKt~h~FFgGGhtVDSG~Is~~DF~~--F~ 463 (545) T pfam05819 398 KLASMSQDKSLDPAVREA-AK----------QLLSDP-LLFGLLNNAITGYKTHHGFFGGGHTVDSGNISKKDFGR--FY 463 (545) T ss_pred HHHHHHCCCCCCHHHHHH-HH----------HHHHCH-HHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHH--HH T ss_conf 887641474359899999-99----------985041-88887612433443688633788745667555677999--99 Q ss_pred HH--CCCCCCCCCCCCCCC-HHHHHHHCCCCCCCEEEEEEC-------CCCCEEECCCHHHHHHHHHHHHHHH Q ss_conf 11--023888999866432-798999608788987999983-------8987665299999999999999998 Q gi|254780532|r 258 NS--YLMNGLPPTAISNPG-RLSLEAVAKPLHTEDLYFVGD-------GKGGHFFSTNFKDHTINVQKWRKMS 320 (325) Q Consensus 258 NT--Y~~~GLPP~PI~~pg-~~ai~A~l~p~~~dylyfv~~-------~~g~h~Fs~t~~eH~~nv~~y~~~~ 320 (325) +. -.++-..+-+---+. .++-.||.. -+-=.+| +++.-.|.+...|.++.-.+-..|. T Consensus 464 ~~mt~aNktv~~pktH~a~s~a~q~AvaD-----M~mG~~DqPdiK~~Kk~gG~~~~~l~~~Lk~~skvlD~~ 531 (545) T pfam05819 464 SNMTAANKTVQQPKTHAAESEAQQKAVAD-----MLMGKADQPDIKSPKKNGGAFKKGLDEFLKWYSKVLDWA 531 (545) T ss_pred HHHHHHCCCCCCCCCCCCCCHHHHHHHHH-----HHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 75321012468876677788788988887-----642767774323555575249999999988737899999 No 60 >cd01454 vWA_norD_type norD type: Denitrifying bacteria contain both membrane bound and periplasmic nitrate reductases. Denitrification plays a major role in completing the nitrogen cycle by converting nitrate or nitrite to nitrogen gas. The pathway for microbial denitrification has been established as NO3- ------ NO2- ------ NO ------- N2O --------- N2. This reaction generally occurs under oxygen limiting conditions. Genetic and biochemical studies have shown that the first srep of the biochemical pathway is catalyzed by periplasmic nitrate reductases. This family is widely present in proteobacteria and firmicutes. This version of the domain is also present in some archaeal members. The function of the vWA domain in this sub-group is not known. Members of this subgroup have a conserved MIDAS motif. Probab=25.18 E-value=49 Score=13.97 Aligned_cols=90 Identities=14% Similarity=0.165 Sum_probs=53.5 Q ss_pred EEEEEEECCCCCCCCCCCCCC-CHHHCCCCCCHH---CCCCCCCCCCCCCCCHHHHHHHCC-----CCCCCEEEEEECCC Q ss_conf 222101015566775420000-001015683011---023888999866432798999608-----78898799998389 Q gi|254780532|r 227 DSTVIYGILEGDYDLTNRKIS-RSDFSIKTPYNS---YLMNGLPPTAISNPGRLSLEAVAK-----PLHTEDLYFVGDGK 297 (325) Q Consensus 227 D~Tv~Y~l~~~~~~~~~~~~~-~~dl~~~spYNT---Y~~~GLPP~PI~~pg~~ai~A~l~-----p~~~dylyfv~~~~ 297 (325) |++.+|+........ .++. ..-...+.+|++ -...+|.|+. ++|=-.||..|.. |++...|..++|+. T Consensus 38 ~~~~v~gF~s~~~~r--~~~~~~~~k~f~e~~~~~~~~~i~~l~~~g-~Tr~G~Air~a~~~L~~~~~~rkiliviSDG~ 114 (174) T cd01454 38 VPHAILGFTTDAGGR--ERVRWIKIKDFDESLHERARKRLAALSPGG-NTRDGAAIRHAAERLLARPEKRKILLVISDGE 114 (174) T ss_pred CCEEEEECCCCCCCC--CCEEEEECCCCCCCHHHHHHHHHHCCCCCC-CCCCHHHHHHHHHHHHHCCCCCEEEEEEECCC T ss_conf 956999515788984--434789323667421145688885118789-89617999999999863976667999983899 Q ss_pred ---CCEEECC--CHHHHHHHHHHHHHH Q ss_conf ---8766529--999999999999999 Q gi|254780532|r 298 ---GGHFFST--NFKDHTINVQKWRKM 319 (325) Q Consensus 298 ---g~h~Fs~--t~~eH~~nv~~y~~~ 319 (325) ..+.-.. -.++|.+.|+++|+. T Consensus 115 P~D~~~~~~~~~~~~D~~~av~e~~~~ 141 (174) T cd01454 115 PNDLDYYEGNVFATEDALRAVIEARKL 141 (174) T ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHC T ss_conf 766777887553899999999999987 No 61 >TIGR02764 spore_ybaN_pdaB polysaccharide deacetylase family sporulation protein PdaB; InterPro: IPR014132 This entry describes the YbaN protein family, also called PdaB and SpoVIE, of Gram-positive bacteria. Although ybaN null mutants have only a mild sporulation defect, ybaN/ytrI double mutants show drastically reduced sporulation efficiencies. This synthetic defect suggests the role of this sigmaE-controlled gene in sporulation had been masked by functional redundancy. Members of this family are homologous to a characterised polysaccharide deacetylase; the exact function this protein family is unknown.. Probab=24.95 E-value=50 Score=13.94 Aligned_cols=91 Identities=12% Similarity=0.061 Sum_probs=67.1 Q ss_pred EEEEECCC-CCHHHHHHHHHHCCCCCCHHHHHHHHHHCCC---CCC---CCCCEEEEC---------CCCCHHHHHHHHH Q ss_conf 89998888-9999999999987980698999999995386---676---257204406---------7875899999861 Q gi|254780532|r 36 TIFLVRNN-MSLKEISKNLFNGGVIVNPYIFRYVTQFYFG---SRG---LKTGEYEIE---------KGSSMSQIAEKIM 99 (325) Q Consensus 36 ~~i~I~~G-~s~~~I~~~L~~~gvI~s~~~F~~~~~~~~~---~~~---ikaG~Y~i~---------~~~s~~~il~~L~ 99 (325) .+|.|..| +....|.+.|.+.+ |+++.+|....+...+ -+. |....|+|- .+++..+|.+.|. T Consensus 10 lTFDisWG~~~~~~IL~~L~~~~-vk~ATFFlsG~Wae~hPelvk~lveI~~~GHEIGsHgY~h~~~~~l~~E~ikkd~~ 88 (198) T TIGR02764 10 LTFDISWGNQYTEPILDTLKEED-VKNATFFLSGSWAERHPELVKELVEIVKDGHEIGSHGYRHKNYTTLEDEKIKKDLL 88 (198) T ss_pred EEECCCCCCCCCCHHHHHHHHCC-CCCCCCEEEHHHHHCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHH T ss_conf 84158987443006766676558-85333024315761277899999999844843033542124212587648999999 Q ss_pred HCCCCCC--------CCCCCCCCCHHHHHHHHHCCC Q ss_conf 0243332--------101111034899999862073 Q gi|254780532|r 100 YGKVLMH--------SISFPEGFTVKQMARRLKDNP 127 (325) Q Consensus 100 ~G~~~~~--------~iTI~EG~t~~ei~~~L~~~~ 127 (325) .++.... -|..|.|.--..+.+..++.+ T Consensus 89 ~a~~~i~~~~g~~p~LlRpP~G~fn~~~~~~ae~~G 124 (198) T TIGR02764 89 RAQEIIEKLTGKKPTLLRPPNGAFNKAVLKLAESLG 124 (198) T ss_pred HHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHHHCC T ss_conf 988888651055501656889860468999999739 No 62 >KOG0015 consensus Probab=24.87 E-value=50 Score=13.93 Aligned_cols=23 Identities=22% Similarity=0.218 Sum_probs=12.3 Q ss_pred CCCCHHCCCCCCCCCCCCCCCHH Q ss_conf 68301102388899986643279 Q gi|254780532|r 254 KTPYNSYLMNGLPPTAISNPGRL 276 (325) Q Consensus 254 ~spYNTY~~~GLPP~PI~~pg~~ 276 (325) .++|-+=-+-+||+.|-..++.+ T Consensus 227 ~~~~qss~tm~~~ts~a~~~~~a 249 (338) T KOG0015 227 TSQYQSSSTMNLPTSPASREVGA 249 (338) T ss_pred CCCCCCCCCCCCCCCCCCCCCCC T ss_conf 54544677536898721145564 No 63 >KOG3032 consensus Probab=24.86 E-value=23 Score=16.14 Aligned_cols=15 Identities=47% Similarity=0.477 Sum_probs=9.0 Q ss_pred HHHHHHHHHHHHHCC Q ss_conf 999988777765267 Q gi|254780532|r 189 DLVILASIVEKETSR 203 (325) Q Consensus 189 evlilASIVEkEa~~ 203 (325) +..++-||||.|-.. T Consensus 179 ~~tesd~iveEeeed 193 (264) T KOG3032 179 DLTESDSIVEEEEED 193 (264) T ss_pred HHHHHHHHHHHHHHH T ss_conf 777888888888888 No 64 >pfam10531 SLBB SLBB domain. Probab=24.49 E-value=51 Score=13.89 Aligned_cols=22 Identities=27% Similarity=0.554 Sum_probs=16.5 Q ss_pred CCCEEEECCCCCHHHHHHHHHH Q ss_conf 5720440678758999998610 Q gi|254780532|r 79 KTGEYEIEKGSSMSQIAEKIMY 100 (325) Q Consensus 79 kaG~Y~i~~~~s~~~il~~L~~ 100 (325) +||.|+++.++++.+++...-- T Consensus 10 ~Pg~~~~~~G~~~~~~l~~aGG 31 (52) T pfam10531 10 RPGNYEVPIGTTLSDLIEQAGG 31 (52) T ss_pred CCEEEEECCCCCHHHHHHHHCC T ss_conf 9707998697999999998579 No 65 >PRK13124 consensus Probab=23.99 E-value=52 Score=13.83 Aligned_cols=53 Identities=13% Similarity=0.102 Sum_probs=21.1 Q ss_pred CCCCCCCCCCCCHHHHHHHCCCCCCCEEEEEECC--CCC-EEECCCHHHHHHHHHH Q ss_conf 8889998664327989996087889879999838--987-6652999999999999 Q gi|254780532|r 263 NGLPPTAISNPGRLSLEAVAKPLHTEDLYFVGDG--KGG-HFFSTNFKDHTINVQK 315 (325) Q Consensus 263 ~GLPP~PI~~pg~~ai~A~l~p~~~dylyfv~~~--~g~-h~Fs~t~~eH~~nv~~ 315 (325) .||.+=|..+|..+.-...+.-..+.|+|.|+.. +|. ..+.....+..+.+++ T Consensus 137 ~gl~~I~lvaPTs~~Ri~~i~~~s~gFiY~vs~~GvTG~~~~~~~~~~~~i~~ik~ 192 (257) T PRK13124 137 YGIYLIPLVAPTSKERIKKIAEQAEGFVYCVSSLGVTGVREEIETDLEEFIRTVKQ 192 (257) T ss_pred CCCCEEEEECCCCHHHHHHHHHCCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHH T ss_conf 68735788479967999999854898389962466678765560889999999986 No 66 >PRK13697 cytochrome c6; Provisional Probab=23.54 E-value=53 Score=13.77 Aligned_cols=17 Identities=29% Similarity=0.231 Sum_probs=10.5 Q ss_pred CHHHHHHHHHHHHHHHH Q ss_conf 95799999999999999 Q gi|254780532|r 1 MLKFLIPLITIFLLAIG 17 (325) Q Consensus 1 m~k~li~li~i~~l~~~ 17 (325) |+|++..+++.+++++. T Consensus 1 Mk~l~~~~l~~~~~~~~ 17 (111) T PRK13697 1 MKKILKLVLLTLLLLTF 17 (111) T ss_pred CHHHHHHHHHHHHHHHH T ss_conf 97899999999999999 No 67 >PRK13123 consensus Probab=23.33 E-value=53 Score=13.74 Aligned_cols=60 Identities=8% Similarity=0.032 Sum_probs=36.0 Q ss_pred CCHH-CCCCCCCCCCCCCCCH--HHHHHHCCCCCCCEEEEEEC--CCCC-EEECCCHHHHHHHHHHH Q ss_conf 3011-0238889998664327--98999608788987999983--8987-66529999999999999 Q gi|254780532|r 256 PYNS-YLMNGLPPTAISNPGR--LSLEAVAKPLHTEDLYFVGD--GKGG-HFFSTNFKDHTINVQKW 316 (325) Q Consensus 256 pYNT-Y~~~GLPP~PI~~pg~--~ai~A~l~p~~~dylyfv~~--~~g~-h~Fs~t~~eH~~nv~~y 316 (325) +|.- -+..||.+=|+.+|.. +=|+.... ..++|+|+|+. .+|. ..++....+..+.++++ T Consensus 133 ~~~~~~~~~gi~~I~liaPtt~~~Ri~~i~~-~a~GFiY~Vs~~GvTG~~~~~~~~~~~~i~~ik~~ 198 (256) T PRK13123 133 FIAPLLRDTDIALIPLVSLTSPIERQKEIIK-EAEGFIYAVAVNGVTGKRGNYRDDLDSHLEKLKSI 198 (256) T ss_pred HHHHHHHHCCCCEEEECCCCCCHHHHHHHHH-CCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHC T ss_conf 9999999769977864089993889999986-07884899744556676533338899999999856 No 68 >pfam05170 AsmA AsmA family. The AsmA gene, whose product is involved in the assembly of outer membrane proteins in Escherichia coli. AsmA mutations were isolated as extragenic suppressors of an OmpF assembly mutant. AsmA may have a role in LPS biogenesis. Probab=23.26 E-value=54 Score=13.74 Aligned_cols=18 Identities=28% Similarity=0.338 Sum_probs=9.9 Q ss_pred CHHHHHHHHHHHHHHHHH Q ss_conf 957999999999999999 Q gi|254780532|r 1 MLKFLIPLITIFLLAIGV 18 (325) Q Consensus 1 m~k~li~li~i~~l~~~~ 18 (325) |+|++..+++++++++++ T Consensus 1 Mk~~lk~l~~i~~~lv~~ 18 (537) T pfam05170 1 MKKALKILLIILIVLLLL 18 (537) T ss_pred CHHHHHHHHHHHHHHHHH T ss_conf 945899999999999999 No 69 >PRK13118 consensus Probab=23.17 E-value=54 Score=13.73 Aligned_cols=55 Identities=16% Similarity=0.261 Sum_probs=30.5 Q ss_pred CCCCCCCCCCCCCCH--HHHHHHCCCCCCCEEEEEECC--CCC-EEECCCHHHHHHHHHHH Q ss_conf 238889998664327--989996087889879999838--987-66529999999999999 Q gi|254780532|r 261 LMNGLPPTAISNPGR--LSLEAVAKPLHTEDLYFVGDG--KGG-HFFSTNFKDHTINVQKW 316 (325) Q Consensus 261 ~~~GLPP~PI~~pg~--~ai~A~l~p~~~dylyfv~~~--~g~-h~Fs~t~~eH~~nv~~y 316 (325) +..||.+-|+.+|.. +-|+.. .-..++|+|+|+.. +|. ..++....++.+.++++ T Consensus 144 ~~~gl~~I~lvaPtt~~~Ri~~i-~~~a~gFiY~vs~~GvTG~~~~~~~~~~~~i~~ik~~ 203 (269) T PRK13118 144 QAHGLDFIRLTSPTTSDERLPRV-LEHASGYLYYVSLAGVTGAAALDTEHVEEAVARLRRH 203 (269) T ss_pred HHCCCCHHEEECCCCCHHHHHHH-HHCCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHH T ss_conf 97598464036989878999999-8437883899854566787766719899999999962 No 70 >PRK13139 consensus Probab=22.93 E-value=54 Score=13.70 Aligned_cols=58 Identities=14% Similarity=0.179 Sum_probs=38.8 Q ss_pred HHCCCCCCCCCCCCCCCH--HHHHHHCCCCCCCEEEEEECC--CCC-EEECCCHHHHHHHHHHH Q ss_conf 110238889998664327--989996087889879999838--987-66529999999999999 Q gi|254780532|r 258 NSYLMNGLPPTAISNPGR--LSLEAVAKPLHTEDLYFVGDG--KGG-HFFSTNFKDHTINVQKW 316 (325) Q Consensus 258 NTY~~~GLPP~PI~~pg~--~ai~A~l~p~~~dylyfv~~~--~g~-h~Fs~t~~eH~~nv~~y 316 (325) +.-+..||.+-|+.+|.. +=|+. +.-..+.|+|+|+.. +|. ..+....+++.+.++++ T Consensus 139 ~~~~~~gl~~I~lvaPtt~~~Ri~~-i~~~a~gFiY~vs~~GvTG~~~~~~~~~~~~i~~ik~~ 201 (254) T PRK13139 139 AQCRAKGMAPIGIYAPTSTDERMGK-IAAAADGFIYCVARRGVTGSKTSFDEHVGAFLHRCRAA 201 (254) T ss_pred HHHHHCCCCEEEEECCCCCHHHHHH-HHHCCCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHC T ss_conf 9998469757999458999899999-98516986999966666798866458899999999855 No 71 >cd00118 LysM Lysin domain, found in a variety of enzymes involved in bacterial cell wall degradation. This domain may have a general peptidoglycan binding function. Probab=22.62 E-value=55 Score=13.66 Aligned_cols=40 Identities=13% Similarity=0.162 Sum_probs=24.6 Q ss_pred EEEECCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHCCC--CCCCCCCEEE Q ss_conf 99988889999999999987980698999999995386--6762572044 Q gi|254780532|r 37 IFLVRNNMSLKEISKNLFNGGVIVNPYIFRYVTQFYFG--SRGLKTGEYE 84 (325) Q Consensus 37 ~i~I~~G~s~~~I~~~L~~~gvI~s~~~F~~~~~~~~~--~~~ikaG~Y~ 84 (325) +.+|.+|+|+..|+++-. + + ...+.++++. ...|++|.-. T Consensus 2 ~y~V~~GDTl~~Ia~~~~---v--~---~~~l~~~N~~~~~~~l~~Gq~l 43 (46) T cd00118 2 TYTVKKGDTLSSIAQRYG---I--S---VEELLKLNGLSDPDNLQVGQKL 43 (46) T ss_pred EEEECCCCCHHHHHHHHC---C--C---HHHHHHHCCCCCCCCCCCCCEE T ss_conf 479898999999999989---6--7---9999986699997716769999 No 72 >cd01775 CYR1_RA CYR1 is a fungal adenylate cyclase with at least four domains, an N-terminal RA (Ras association) domain, a middle leucine-rich repeat domain, a catalytic domain. The N-terminal RA domain of CYR1 post-translationally modifies a small GTPase called Ras. The Ras-CYR1 pathway has been implicated in the transduction of a glucose-triggered signal to an intracellular environment where a protein phosphorylation cascade is initiated by cyclic AMP. Probab=22.49 E-value=18 Score=16.78 Aligned_cols=48 Identities=15% Similarity=-0.036 Sum_probs=32.8 Q ss_pred CCEEEEECCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCCCCCCC Q ss_conf 878999888899999999999879806989999999953866762572 Q gi|254780532|r 34 NDTIFLVRNNMSLKEISKNLFNGGVIVNPYIFRYVTQFYFGSRGLKTG 81 (325) Q Consensus 34 ~~~~i~I~~G~s~~~I~~~L~~~gvI~s~~~F~~~~~~~~~~~~ikaG 81 (325) .-+++..+.++++.++...|.++-.+.+..-+.++.+..+..+-+.++ T Consensus 13 Tf~tls~~l~tTv~eli~~l~rK~~l~~~~ny~l~l~~~~l~RvL~p~ 60 (97) T cd01775 13 TFTTLSCPLNTTVSELIPQLAKKFYLPSGGNYQLSLKKHDLSRVLRPT 60 (97) T ss_pred CEEEEECCCCCCHHHHHHHHHHHHCCCCCCCEEEEEEECCEEEECCCC T ss_conf 489997387687999999998873488899728999979923665886 No 73 >PRK13114 consensus Probab=22.18 E-value=56 Score=13.60 Aligned_cols=55 Identities=18% Similarity=0.242 Sum_probs=32.6 Q ss_pred CCCCCCCCCCCCCCHH--HHHHHCCCCCCCEEEEEECC--CCC-EEECCCHHHHHHHHHHH Q ss_conf 2388899986643279--89996087889879999838--987-66529999999999999 Q gi|254780532|r 261 LMNGLPPTAISNPGRL--SLEAVAKPLHTEDLYFVGDG--KGG-HFFSTNFKDHTINVQKW 316 (325) Q Consensus 261 ~~~GLPP~PI~~pg~~--ai~A~l~p~~~dylyfv~~~--~g~-h~Fs~t~~eH~~nv~~y 316 (325) +..||.+=|+.+|... -|+.... ..++|+|+|+.. +|. ..+..+..++.+.++++ T Consensus 140 ~~~gi~~I~liaPtt~~~Ri~~i~~-~a~gFiY~vs~~GvTG~~~~~~~~~~~~i~~ik~~ 199 (266) T PRK13114 140 RAAGIDPIRLATPTTDAARLPAVLE-GASGFLYYVSVAGITGMQQAAQASIEAAVARIKAA 199 (266) T ss_pred HHCCCCEEEEECCCCCHHHHHHHHH-HCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHH T ss_conf 9749972677569997999999997-38995899844556677656658899999999970 No 74 >cd01777 SNX27_RA SNX27_RA SNX27 (sorting nexin protein 27) belongs to a large family of endosome-localized proteins related to sorting nexin1 which is implicated in regulating membrane traffic. The domain architecture of SNX27 includes an amino-terminal PDZ domain, a PX (PhoX homologous) domain, and a carboxy-terminal RA (RAS-associated) domain. Probab=22.01 E-value=57 Score=13.58 Aligned_cols=54 Identities=15% Similarity=0.047 Sum_probs=32.3 Q ss_pred CCCCCCEEEEECCCCCHHHHHHHHHHCCCCCCHHHHHHHH---HHCCCCCCCCCCEE Q ss_conf 7777878999888899999999999879806989999999---95386676257204 Q gi|254780532|r 30 GPLQNDTIFLVRNNMSLKEISKNLFNGGVIVNPYIFRYVT---QFYFGSRGLKTGEY 83 (325) Q Consensus 30 ~~~~~~~~i~I~~G~s~~~I~~~L~~~gvI~s~~~F~~~~---~~~~~~~~ikaG~Y 83 (325) -|..+.++|.|.+..+..++.+.+.++-=+.+...-.++. .-.+..+++++-+| T Consensus 8 LPD~~~vtV~vrk~~t~~~Vy~a~~~k~gm~~~ta~yF~LFEi~e~~FeRKL~p~E~ 64 (87) T cd01777 8 LPDKATVTVRVRKNATTDQVYQALVAKAGMDSYTQNYFALFEVINHSFVRKLAPNEF 64 (87) T ss_pred CCCCCEEEEEEECCCCHHHHHHHHHHHHCCCHHHHHHEEEEEEECCCEEEECCCCCC T ss_conf 699988999998357799999999999688876721323566643772671378668 No 75 >TIGR03174 cas_Csc3 CRISPR-associated protein Csc3. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) is a widespread family of prokaryotic direct repeats with spacers of unique sequence between consecutive repeats. This protein family is a CRISPR-associated (Cas) family strictly associated with the Cyano subtype of CRISPR/Cas locus, found in several species of Cyanobacteria and several archaeal species. This family is designated Csc3 for CRISPR/Cas Subtype Cyano protein 3, as it is often the third gene upstream of the core cas genes, cas3-cas4-cas1-cas2. Probab=21.97 E-value=57 Score=13.57 Aligned_cols=19 Identities=21% Similarity=0.272 Sum_probs=10.3 Q ss_pred CCHHCCCCCCCCCCCCCCC Q ss_conf 3011023888999866432 Q gi|254780532|r 256 PYNSYLMNGLPPTAISNPG 274 (325) Q Consensus 256 pYNTY~~~GLPP~PI~~pg 274 (325) -|++|+.+|=-+.|||+-| T Consensus 486 ~y~~~~k~~r~kq~iCSLs 504 (953) T TIGR03174 486 NYNAYAKTKRGKQPICSLS 504 (953) T ss_pred HHHHHHHCCCCCCCEECCC T ss_conf 8888862577777421066 No 76 >pfam07219 HemY_N HemY protein N-terminus. This family represents the N-terminus (approximately 150 residues) of bacterial HemY porphyrin biosynthesis proteins. This is a membrane protein involved in a late step of protoheme IX synthesis. Probab=21.69 E-value=58 Score=13.54 Aligned_cols=22 Identities=27% Similarity=0.372 Sum_probs=13.1 Q ss_pred CHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9579999999999999999999 Q gi|254780532|r 1 MLKFLIPLITIFLLAIGVHIHV 22 (325) Q Consensus 1 m~k~li~li~i~~l~~~~~~~~ 22 (325) |+|+++.+++++++..++..|. T Consensus 1 M~R~l~~~~~~l~~~~~~~~~l 22 (134) T pfam07219 1 MLRVLLFLFLVLLLGLFGGAWL 22 (134) T ss_pred CHHHHHHHHHHHHHHHHHHHHH T ss_conf 9789999999999999999999 No 77 >TIGR02057 PAPS_reductase phosphoadenosine phosphosulfate reductase; InterPro: IPR011800 Requiring thioredoxin as an electron donor, phosphoadenosine phosphosulphate reductase catalyzes the reduction of 3'-phosphoadenylylsulphate (PAPS) to sulphite and phospho-adenosine-phosphate (PAP). Found in enterobacteria, cyanobacteria, and yeast, PAPS reductase is related to a group of plant (IPR004508 from INTERPRO) and bacterial (IPR011798 from INTERPRO) enzymes preferring 5'-adenylylsulphate (APS) over PAPS as a substrate for reduction to sulphite.; GO: 0004604 phosphoadenylyl-sulfate reductase (thioredoxin) activity, 0019344 cysteine biosynthetic process. Probab=21.57 E-value=21 Score=16.33 Aligned_cols=13 Identities=23% Similarity=0.110 Sum_probs=8.9 Q ss_pred CCCCCHHHEEECC Q ss_conf 0000301121014 Q gi|254780532|r 140 EGTLCPSTYNFPL 152 (325) Q Consensus 140 EG~l~PdTY~~~~ 152 (325) =||+||+||.|.. T Consensus 65 Tly~F~~Tl~l~~ 77 (239) T TIGR02057 65 TLYLFPQTLELVD 77 (239) T ss_pred CCCCCHHHHHHHH T ss_conf 1136278999999 No 78 >pfam12234 Rav1p_C RAVE protein 1 C terminal. This domain family is found in eukaryotes, and is typically between 621 and 644 amino acids in length. This family is the C terminal region of the protein RAVE (regulator of the ATPase of vacuolar and endosomal membranes). Rav1p is involved in regulating the glucose dependent assembly and disassembly of vacuolar ATPase V1 and V0 subunits. Probab=21.24 E-value=59 Score=13.48 Aligned_cols=48 Identities=17% Similarity=0.119 Sum_probs=30.1 Q ss_pred HHCCCCCCHHHHHHHHHHHHHH--------HCCHHHHHHHHHHHHHHHCCCCEEEE Q ss_conf 3106676599999988777765--------26725788999999997405870244 Q gi|254780532|r 179 DVDHPIKSKEDLVILASIVEKE--------TSRADERAHVASVFINRFSKSIRLQS 226 (325) Q Consensus 179 ~~~~~~~s~~evlilASIVEkE--------a~~~~e~~~IA~V~~NRL~~~M~Lq~ 226 (325) ...+|..|.+|-++|++|||.= +-.+.-..-.-++-..-|++....|+ T Consensus 280 ~~~LP~LTr~eQ~~L~~iie~v~~v~~~~~slD~nG~RflL~~rl~~l~r~~~~~~ 335 (629) T pfam12234 280 KIALPYLTRHQQITLMDIIECVAEVEKNRRSVDDNGLRFLLGYRQHFSHKSTAEQS 335 (629) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHCCHHHC T ss_conf 27877988888999999999999872454344700428999998888750603312 No 79 >PRK00031 lolA outer-membrane lipoprotein carrier protein; Reviewed Probab=21.13 E-value=59 Score=13.47 Aligned_cols=16 Identities=31% Similarity=0.314 Sum_probs=10.7 Q ss_pred CHHHHHHHHHHHHHHH Q ss_conf 9579999999999999 Q gi|254780532|r 1 MLKFLIPLITIFLLAI 16 (325) Q Consensus 1 m~k~li~li~i~~l~~ 16 (325) |+|++++++++++++. T Consensus 1 mk~~~~~~~~~~~l~~ 16 (201) T PRK00031 1 MKKLLIAALLAAALVA 16 (201) T ss_pred CCHHHHHHHHHHHHHH T ss_conf 9159999999999877 No 80 >PRK13112 consensus Probab=20.65 E-value=60 Score=13.40 Aligned_cols=57 Identities=12% Similarity=0.174 Sum_probs=35.2 Q ss_pred HCCCCCCCCCCCCCCCHH--HHHHHCCCCCCCEEEEEECC--CCC-EEECCCHHHHHHHHHHH Q ss_conf 102388899986643279--89996087889879999838--987-66529999999999999 Q gi|254780532|r 259 SYLMNGLPPTAISNPGRL--SLEAVAKPLHTEDLYFVGDG--KGG-HFFSTNFKDHTINVQKW 316 (325) Q Consensus 259 TY~~~GLPP~PI~~pg~~--ai~A~l~p~~~dylyfv~~~--~g~-h~Fs~t~~eH~~nv~~y 316 (325) .-+..||.+=|+++|... -|+.... ..++|+|+|+.. +|. ..+.....++.+.++++ T Consensus 143 ~~~~~~i~~I~lvaPtt~~eRi~~i~~-~s~GFiY~Vs~~GvTG~~~~~~~~~~~~i~~ik~~ 204 (279) T PRK13112 143 PAMKAGINFIRLATPTTDDKRLPKVLA-NTSGFVYYVSMTGITGSALADTSAVGEAVARIKRH 204 (279) T ss_pred HHHHCCCCEEEEECCCCCHHHHHHHHH-CCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHH T ss_conf 998578346998258998999999985-27880899835666676645648899999999971 No 81 >COG4814 Uncharacterized protein with an alpha/beta hydrolase fold [General function prediction only] Probab=20.58 E-value=35 Score=14.92 Aligned_cols=55 Identities=7% Similarity=0.035 Sum_probs=34.2 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCC Q ss_conf 9999999987556554531066765999999887777652672578899999999740587 Q gi|254780532|r 162 QAMLKQKQVVDEVWEIRDVDHPIKSKEDLVILASIVEKETSRADERAHVASVFINRFSKSI 222 (325) Q Consensus 162 ~m~~~~~~~~~~~~~~~~~~~~~~s~~evlilASIVEkEa~~~~e~~~IA~V~~NRL~~~M 222 (325) ++...+.+++... ....+ .--|+|.||-=.+-|...+-.-|..++.-..||-.+- T Consensus 197 ~~~t~y~~y~~~n----~k~v~--~~~evl~IaGDl~dg~~tDG~Vp~assls~~~lf~~~ 251 (288) T COG4814 197 LIKTPYYDYIAKN----YKKVS--PNTEVLLIAGDLDDGKQTDGAVPWASSLSIYHLFKKN 251 (288) T ss_pred CCCCHHHHHHHHC----CEECC--CCCEEEEEECCCCCCCCCCCCEECHHHHHHHHHHCCC T ss_conf 3376789999843----63579--9847999741346688678721123767899986658 No 82 >PRK13137 consensus Probab=20.43 E-value=61 Score=13.37 Aligned_cols=58 Identities=19% Similarity=0.095 Sum_probs=34.6 Q ss_pred CHHCCCCCCCCCCCCCCCH--HHHHHHCCCCCCCEEEEEEC-C-CCCE-EEC-CCHHHHHHHHHH Q ss_conf 0110238889998664327--98999608788987999983-8-9876-652-999999999999 Q gi|254780532|r 257 YNSYLMNGLPPTAISNPGR--LSLEAVAKPLHTEDLYFVGD-G-KGGH-FFS-TNFKDHTINVQK 315 (325) Q Consensus 257 YNTY~~~GLPP~PI~~pg~--~ai~A~l~p~~~dylyfv~~-~-~g~h-~Fs-~t~~eH~~nv~~ 315 (325) ...-+..||.+-|+.+|.. +=|+. +.-..++|+|+|+. + +|.. .+. ...+++.+.+++ T Consensus 146 ~~~~~~~gi~~I~lvaPtT~~eRi~~-i~~~a~GFiY~Vs~~GvTG~r~~~~~~~l~~~i~~ik~ 209 (266) T PRK13137 146 ADLAAEIGLAVTFLIAPTSTPERVKL-VARACTGFLYAVSVTGVTGMREGLALGEVPDMLRLARQ 209 (266) T ss_pred HHHHHHCCCCEEEEECCCCCHHHHHH-HHHHCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHH T ss_conf 99998759978999379999999999-99608882899744676677766787999999999986 No 83 >KOG2500 consensus Probab=20.21 E-value=62 Score=13.34 Aligned_cols=42 Identities=19% Similarity=0.321 Sum_probs=21.7 Q ss_pred CCCCCCEEEECCCCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHH Q ss_conf 7625720440678758999998610243332101111034899999 Q gi|254780532|r 76 RGLKTGEYEIEKGSSMSQIAEKIMYGKVLMHSISFPEGFTVKQMAR 121 (325) Q Consensus 76 ~~ikaG~Y~i~~~~s~~~il~~L~~G~~~~~~iTI~EG~t~~ei~~ 121 (325) ..-+||+..++. .++.--+.....|+....++ |-.|.-++.. T Consensus 28 ~gyra~eW~l~~-~~WtGrlrvvakg~~~~ikL---eD~tsg~LfA 69 (253) T KOG2500 28 RGYRAGEWNLDK-PAWTGRLRVVAKGERCEIKL---EDKTSGELFA 69 (253) T ss_pred CCCCCCCCCCCC-CCCCCEEEEEECCCEEEEEE---CCCCCHHHHH T ss_conf 771343122578-54134168987173799995---1487331554 Done!