Query gi|254780536|ref|YP_003064949.1| acyl-carrier-protein S-malonyltransferase [Candidatus Liberibacter asiaticus str. psy62] Match_columns 314 No_of_seqs 136 out of 6775 Neff 7.8 Searched_HMMs 39220 Date Sun May 29 23:03:34 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780536.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR00128 fabD malonyl CoA-acy 100.0 0 0 603.4 17.0 292 1-293 1-294 (295) 2 TIGR03131 malonate_mdcH malona 100.0 0 0 553.0 21.6 295 3-310 1-295 (295) 3 COG0331 FabD (acyl-carrier-pro 100.0 0 0 533.4 22.5 308 1-310 2-309 (310) 4 pfam00698 Acyl_transf_1 Acyl t 100.0 0 0 522.8 21.5 298 4-312 1-309 (319) 5 smart00827 PKS_AT Acyl transfe 100.0 0 0 517.9 20.3 279 7-295 1-287 (298) 6 KOG2926 consensus 100.0 0 0 421.8 16.1 292 3-295 64-364 (386) 7 TIGR02813 omega_3_PfaA polyket 100.0 0 0 408.4 16.4 305 2-311 590-916 (2773) 8 COG3321 Polyketide synthase mo 100.0 0 0 374.5 14.8 284 2-294 526-813 (1061) 9 KOG1202 consensus 100.0 5.5E-44 0 286.1 17.5 295 3-312 501-809 (2376) 10 TIGR02816 pfaB_fam PfaB family 99.9 9.1E-24 2.3E-28 163.7 13.8 287 3-312 183-520 (538) 11 cd07225 Pat_PNPLA6_PNPLA7 Pata 90.5 0.33 8.3E-06 27.2 3.5 47 66-117 27-74 (306) 12 PRK10279 hypothetical protein; 90.3 0.19 4.8E-06 28.6 2.2 38 67-108 18-56 (300) 13 cd07227 Pat_Fungal_NTE1 Fungal 87.5 0.54 1.4E-05 25.9 2.9 41 67-112 23-64 (269) 14 cd07210 Pat_hypo_W_succinogene 85.3 0.79 2E-05 24.9 2.8 46 66-116 12-58 (221) 15 TIGR00755 ksgA dimethyladenosi 84.6 2.1 5.3E-05 22.3 4.7 13 91-103 36-48 (277) 16 COG1752 RssA Predicted esteras 81.8 1.4 3.6E-05 23.4 2.9 39 66-108 23-62 (306) 17 cd07207 Pat_ExoU_VipD_like Exo 81.7 1.2 3.1E-05 23.7 2.6 45 67-115 12-56 (194) 18 PRK11126 acyl-CoA thioester hy 79.3 0.9 2.3E-05 24.5 1.2 29 72-104 56-84 (242) 19 TIGR01286 nifK nitrogenase mol 78.0 3 7.6E-05 21.4 3.6 55 69-138 94-149 (526) 20 cd07204 Pat_PNPLA_like Patatin 76.9 2.8 7E-05 21.6 3.2 54 65-118 10-63 (243) 21 COG4188 Predicted dienelactone 76.9 2.1 5.4E-05 22.3 2.6 25 84-108 157-181 (365) 22 cd07209 Pat_hypo_Ecoli_Z1214_l 75.5 1.7 4.5E-05 22.8 1.8 36 66-105 10-46 (215) 23 cd07198 Patatin Patatin-like p 75.4 3.8 9.7E-05 20.7 3.5 40 67-111 11-51 (172) 24 PRK10673 hypothetical protein; 74.6 1.7 4.3E-05 22.9 1.6 30 72-105 71-100 (255) 25 TIGR03056 bchO_mg_che_rel puta 73.5 1.8 4.6E-05 22.7 1.5 30 72-105 85-114 (278) 26 PRK00274 ksgA dimethyladenosin 71.5 6.6 0.00017 19.3 4.0 12 90-101 45-56 (267) 27 pfam07082 DUF1350 Protein of u 69.8 5.5 0.00014 19.8 3.3 74 22-99 28-103 (250) 28 cd07206 Pat_TGL3-4-5_SDP1 Tria 69.2 6 0.00015 19.6 3.4 45 67-115 82-134 (298) 29 COG3208 GrsT Predicted thioest 67.8 2.4 6.2E-05 21.9 1.2 22 85-114 73-94 (244) 30 TIGR03611 RutD pyrimidine util 67.7 3.1 7.8E-05 21.3 1.6 29 72-104 70-98 (257) 31 cd07228 Pat_NTE_like_bacteria 67.5 5.5 0.00014 19.8 2.9 36 67-106 13-49 (175) 32 cd04436 DEP_fRgd2 DEP (Disheve 66.7 8.9 0.00023 18.5 3.8 59 210-270 2-62 (84) 33 cd07205 Pat_PNPLA6_PNPLA7_NTE1 66.1 6.4 0.00016 19.4 3.0 46 67-117 13-59 (175) 34 COG0030 KsgA Dimethyladenosine 64.8 11 0.00028 18.0 4.1 11 91-101 37-47 (259) 35 cd07224 Pat_like Patatin-like 64.7 11 0.00027 18.1 3.9 54 65-120 10-63 (233) 36 cd07213 Pat17_PNPLA8_PNPLA9_li 61.0 10 0.00026 18.1 3.3 48 72-119 20-67 (288) 37 cd07231 Pat_SDP1-like Sugar-De 60.7 10 0.00026 18.1 3.2 33 68-104 82-114 (323) 38 pfam04672 DUF574 Protein of un 60.3 13 0.00034 17.5 3.8 17 211-227 220-236 (268) 39 TIGR03343 biphenyl_bphD 2-hydr 59.8 4.8 0.00012 20.1 1.4 29 72-104 91-119 (282) 40 PRK13777 transcriptional regul 59.2 13 0.00033 17.6 3.5 20 91-113 59-78 (190) 41 PRK00870 haloalkane dehalogena 58.4 5.3 0.00013 19.9 1.4 29 72-104 105-133 (302) 42 pfam09968 DUF2202 Uncharacteri 58.3 7.4 0.00019 19.0 2.2 126 22-190 9-143 (162) 43 pfam03246 Pneumo_ncap Pneumovi 57.9 7.7 0.0002 18.9 2.2 24 97-120 64-87 (391) 44 PRK03592 haloalkane dehalogena 57.4 6.2 0.00016 19.5 1.7 29 72-104 83-111 (294) 45 pfam01734 Patatin Patatin-like 57.4 13 0.00033 17.5 3.3 45 67-115 11-55 (189) 46 pfam00561 Abhydrolase_1 alpha/ 56.9 6.2 0.00016 19.5 1.6 31 71-105 30-60 (225) 47 cd07208 Pat_hypo_Ecoli_yjju_li 55.9 7.7 0.0002 18.9 1.9 38 67-107 11-49 (266) 48 PRK03204 haloalkane dehalogena 54.8 6.7 0.00017 19.3 1.5 29 72-104 91-119 (286) 49 PRK03609 umuC DNA polymerase V 54.8 16 0.00042 16.9 7.1 15 259-273 373-388 (422) 50 pfam00551 Formyl_trans_N Formy 52.4 9.2 0.00024 18.4 1.9 18 1-18 1-18 (181) 51 pfam00975 Thioesterase Thioest 50.1 7.7 0.0002 18.9 1.2 18 86-103 65-82 (224) 52 PRK10439 enterobactin/ferric e 49.3 18 0.00045 16.7 2.9 13 269-281 331-343 (398) 53 TIGR02821 fghA_ester_D S-formy 49.0 11 0.00028 18.0 1.8 29 76-104 129-157 (279) 54 pfam00326 Peptidase_S9 Prolyl 48.5 21 0.00052 16.3 4.4 31 73-106 52-84 (212) 55 TIGR02082 metH methionine synt 48.3 7.6 0.00019 18.9 0.9 256 25-314 137-444 (1265) 56 PRK05294 carB carbamoyl phosph 46.9 4.9 0.00013 20.1 -0.2 23 74-96 697-719 (1063) 57 pfam09752 DUF2048 Uncharacteri 43.8 24 0.00062 15.9 4.7 39 65-107 156-194 (337) 58 COG4370 Uncharacterized protei 43.0 22 0.00055 16.2 2.6 11 1-11 7-17 (412) 59 PRK10566 esterase; Provisional 42.4 15 0.00038 17.2 1.7 31 72-103 94-124 (249) 60 TIGR01821 5aminolev_synth 5-am 42.4 26 0.00065 15.7 4.9 23 166-188 351-375 (427) 61 PRK10349 carboxylesterase BioH 42.3 13 0.00032 17.6 1.3 19 86-104 74-92 (256) 62 COG1054 Predicted sulfurtransf 40.5 27 0.0007 15.6 2.8 15 171-185 50-64 (308) 63 cd00381 IMPDH IMPDH: The catal 39.6 28 0.00072 15.5 5.0 24 166-190 134-157 (325) 64 TIGR02653 Lon_rel_chp conserve 39.1 18 0.00045 16.7 1.6 47 235-282 174-229 (677) 65 cd07220 Pat_PNPLA2 Patatin-lik 39.0 29 0.00073 15.4 2.7 54 65-118 15-68 (249) 66 TIGR01009 rpsC_bact ribosomal 38.9 27 0.0007 15.6 2.6 79 158-269 68-146 (217) 67 PRK08349 hypothetical protein; 37.5 29 0.00074 15.4 2.5 84 88-187 108-194 (198) 68 KOG2968 consensus 37.0 25 0.00064 15.8 2.1 34 67-104 852-886 (1158) 69 cd02430 PTH2 Peptidyl-tRNA hyd 35.8 29 0.00073 15.5 2.3 61 91-155 11-72 (115) 70 COG1565 Uncharacterized conser 35.0 34 0.00086 15.0 3.7 23 252-274 279-301 (370) 71 pfam06057 VirJ Bacterial virul 34.8 34 0.00086 15.0 3.5 75 2-97 3-79 (192) 72 COG4012 Uncharacterized protei 34.7 24 0.0006 15.9 1.7 67 202-282 255-321 (342) 73 TIGR02472 sucr_P_syn_N sucrose 34.6 26 0.00066 15.7 1.9 93 167-259 287-416 (445) 74 pfam03447 NAD_binding_3 Homose 34.2 35 0.00088 14.9 3.5 42 260-301 47-89 (116) 75 pfam06821 DUF1234 Alpha/Beta h 34.2 11 0.00029 17.9 -0.0 18 86-103 55-72 (171) 76 KOG1454 consensus 34.1 16 0.00041 16.9 0.8 21 86-106 128-148 (326) 77 PRK00107 gidB glucose-inhibite 33.9 35 0.00089 14.9 3.7 17 169-185 139-155 (216) 78 TIGR01050 rpsS_bact ribosomal 33.6 14 0.00035 17.3 0.4 36 62-100 41-76 (92) 79 TIGR02707 butyr_kinase butyrat 32.7 23 0.00059 16.0 1.4 33 97-141 221-253 (353) 80 PRK08609 hypothetical protein; 32.3 37 0.00095 14.8 5.4 68 200-281 434-503 (570) 81 cd07217 Pat17_PNPLA8_PNPLA9_li 32.3 27 0.00069 15.6 1.7 38 86-123 41-78 (344) 82 COG2819 Predicted hydrolase of 31.5 25 0.00063 15.8 1.3 22 78-99 129-150 (264) 83 CHL00050 rps19 ribosomal prote 31.4 15 0.00039 17.1 0.3 13 86-99 63-75 (92) 84 COG1506 DAP2 Dipeptidyl aminop 30.6 40 0.001 14.6 4.3 143 71-219 459-611 (620) 85 cd07221 Pat_PNPLA3 Patatin-lik 30.3 40 0.001 14.6 6.1 54 65-118 11-64 (252) 86 COG3146 Uncharacterized protei 30.0 8.9 0.00023 18.5 -1.1 24 257-281 308-331 (387) 87 TIGR02201 heptsyl_trn_III lipo 29.9 41 0.001 14.5 3.3 43 257-299 203-245 (347) 88 pfam01981 PTH2 Peptidyl-tRNA h 29.8 41 0.001 14.5 2.3 62 91-155 12-73 (116) 89 KOG2377 consensus 29.6 27 0.00069 15.6 1.3 48 86-137 344-391 (657) 90 pfam00282 Pyridoxal_deC Pyrido 29.5 41 0.0011 14.5 2.8 72 143-222 178-250 (370) 91 TIGR01048 lysA diaminopimelate 29.0 34 0.00087 15.0 1.7 32 68-106 64-95 (431) 92 COG0596 MhpC Predicted hydrola 28.2 29 0.00073 15.5 1.2 29 72-104 78-106 (282) 93 pfam00756 Esterase Putative es 28.2 18 0.00045 16.7 0.1 16 89-104 114-129 (243) 94 pfam12000 DUF3495 Domain of un 28.2 32 0.00083 15.1 1.5 22 72-97 56-78 (172) 95 cd06450 DOPA_deC_like DOPA dec 27.8 44 0.0011 14.3 2.6 75 141-222 131-205 (345) 96 PRK01810 DNA polymerase IV; Va 27.6 45 0.0011 14.3 6.8 26 74-105 132-157 (410) 97 pfam05057 DUF676 Putative seri 27.6 13 0.00032 17.6 -0.7 22 86-108 76-97 (212) 98 pfam10230 DUF2305 Uncharacteri 27.5 16 0.00042 16.9 -0.1 16 87-102 80-95 (260) 99 KOG0820 consensus 27.4 45 0.0012 14.2 3.7 22 75-99 50-73 (315) 100 PRK13942 protein-L-isoaspartat 27.2 46 0.0012 14.2 6.7 37 60-103 59-95 (214) 101 KOG3101 consensus 27.1 23 0.00057 16.1 0.5 16 89-104 144-159 (283) 102 KOG2382 consensus 26.7 42 0.0011 14.4 1.9 12 86-97 123-134 (315) 103 cd04174 Rnd1_Rho6 Rnd1/Rho6 su 26.5 47 0.0012 14.1 5.2 61 220-280 110-170 (232) 104 TIGR01025 rpsS_arch ribosomal 26.3 19 0.00049 16.5 0.1 15 84-99 97-111 (136) 105 cd02407 PTH2_family Peptidyl-t 25.7 45 0.0012 14.2 1.9 17 91-107 11-27 (115) 106 PRK08775 homoserine O-acetyltr 25.5 40 0.001 14.6 1.6 29 71-102 126-154 (343) 107 KOG2564 consensus 25.0 39 0.001 14.6 1.5 21 86-106 146-166 (343) 108 pfam11288 DUF3089 Protein of u 25.0 36 0.00092 14.8 1.3 15 84-98 93-107 (201) 109 KOG1579 consensus 25.0 50 0.0013 14.0 8.3 166 89-274 115-294 (317) 110 TIGR02688 TIGR02688 conserved 24.9 47 0.0012 14.1 1.8 52 251-310 389-452 (470) 111 KOG3847 consensus 24.6 28 0.00071 15.5 0.6 24 86-109 118-141 (399) 112 pfam01764 Lipase_3 Lipase (cla 24.4 22 0.00056 16.1 0.1 13 89-101 66-78 (141) 113 pfam11015 DUF2853 Protein of u 24.4 51 0.0013 13.9 3.0 41 140-186 14-54 (102) 114 PRK09240 thiH thiamine biosynt 24.4 51 0.0013 13.9 3.2 13 296-308 336-348 (371) 115 TIGR02812 fadR_gamma fatty aci 24.2 30 0.00076 15.3 0.7 53 48-121 63-115 (275) 116 cd07218 Pat_iPLA2 Calcium-inde 24.1 52 0.0013 13.9 6.0 52 65-118 11-62 (245) 117 KOG4409 consensus 24.0 29 0.00075 15.4 0.6 66 246-311 275-359 (365) 118 pfam01135 PCMT Protein-L-isoas 24.0 52 0.0013 13.9 5.1 37 60-103 56-92 (205) 119 COG0357 GidB Predicted S-adeno 23.6 53 0.0014 13.8 2.1 50 169-222 138-187 (215) 120 COG0627 Predicted esterase [Ge 23.3 24 0.00062 15.9 0.1 17 88-104 154-170 (316) 121 PRK00357 rpsS 30S ribosomal pr 23.2 26 0.00067 15.7 0.3 14 86-100 63-76 (92) 122 TIGR02418 acolac_catab acetola 22.8 39 0.00098 14.7 1.1 108 196-311 388-527 (553) 123 KOG3975 consensus 22.8 16 0.00042 16.9 -0.8 24 4-27 32-58 (301) 124 COG0185 RpsS Ribosomal protein 22.3 27 0.00069 15.6 0.2 16 84-100 61-76 (93) 125 pfam03096 Ndr Ndr family. This 22.0 44 0.0011 14.3 1.2 30 71-104 88-117 (285) 126 cd00741 Lipase Lipase. Lipase 21.6 26 0.00066 15.7 -0.0 12 89-100 31-42 (153) 127 COG4757 Predicted alpha/beta h 21.5 51 0.0013 13.9 1.5 12 86-97 105-116 (281) 128 TIGR02858 spore_III_AA stage I 21.1 9.5 0.00024 18.3 -2.3 24 119-143 185-208 (282) 129 cd07222 Pat_PNPLA4 Patatin-lik 20.9 60 0.0015 13.5 2.9 50 65-114 10-60 (246) 130 TIGR01967 DEAH_box_HrpA ATP-de 20.8 34 0.00087 15.0 0.4 166 141-311 968-1163(1320) 131 PRK13944 protein-L-isoaspartat 20.7 61 0.0016 13.5 5.1 38 59-103 54-91 (205) 132 PRK00175 metX homoserine O-ace 20.6 57 0.0015 13.6 1.6 38 62-102 123-162 (379) 133 PRK06765 homoserine O-acetyltr 20.4 62 0.0016 13.4 1.8 37 62-101 134-172 (387) 134 PRK13768 GTPase; Provisional 20.1 63 0.0016 13.4 3.8 119 107-235 51-173 (253) No 1 >TIGR00128 fabD malonyl CoA-acyl carrier protein transacylase; InterPro: IPR004410 Malonyl CoA-acyl carrier protein transacylase 2.3.1.39 from EC is involved in fatty acid biosynthesis and transfers the malonyl moeity from coenzyme A to acyl-carrier protein.; GO: 0004314 [acyl-carrier-protein] S-malonyltransferase activity, 0006633 fatty acid biosynthetic process. Probab=100.00 E-value=0 Score=603.44 Aligned_cols=292 Identities=43% Similarity=0.668 Sum_probs=282.2 Q ss_pred CEEEEECCCCCCCCCCHHHHHHHHCHHHHHHHHHHHHHCCCCHHHHHCCCCHHHHHCHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 91999808733373115899998599999999999988398978985389867830467650345666654789999718 Q gi|254780536|r 1 MSTVLTFPGQGSQVIGMGRDLYDSFPEARLVFEEVDHTLNQNLSDLMWNGSQEELTATCNAQPALTAVSMAFIRVMEKNG 80 (314) Q Consensus 1 mk~~f~FpGQGsq~~gM~~~L~~~~p~~r~~~~~~~~~l~~~l~~~~~~~~~~~l~~~~~~Qp~i~~~~~a~~~~l~~~g 80 (314) ||++|||||||||..|||++||+++|..++.|+++|++|++|+...+++++.+.|+.|+++||||+++++++|++|++.+ T Consensus 1 ~~~A~vFPGQGSQ~~GMG~~L~~~~p~akelf~~A~~~Lg~~~~~l~~e~~~~~L~~T~~TQPALy~~s~~~y~~~~~~~ 80 (295) T TIGR00128 1 MKIAFVFPGQGSQTVGMGKDLYESYPIAKELFDQADEILGYDLKKLCQEGDAEELNKTQYTQPALYVVSAILYRKLKEKG 80 (295) T ss_pred CCEEEECCCCCCCCCCCCHHHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCHHHHHHHHHHHHHHHHHC T ss_conf 92477517855553201033674064899999999887547799884268366675154013399999999999986415 Q ss_pred CCCCCCCCCCCCCCCCCHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHC-CCCCE Q ss_conf 87643544246654420122253178650233333456886531001111222221235035531122344211-35420 Q gi|254780536|r 81 LCVKRDISYVAGHSLGEYTALCAAKAFSLSDTIRLVRARGKSMQEAVPPGLGSMVAIIGLDDCVVDSICAQASR-VGICE 159 (314) Q Consensus 81 i~~~~~p~~v~GhSlGE~aAl~~aG~ls~eda~~lv~~R~~~m~~~~~~~~g~M~aV~~~~~~~~~~~~~~~~~-~~~~~ 159 (314) - -..+||+++|||||||+||+|||+|||+||++||..||++|+++++.+.|+|+||+|++.+.+..+|++..+ ...|. T Consensus 81 ~-G~~~P~f~AGHSLGEYsAL~aAGalDF~~al~LV~~RGe~M~~a~p~g~G~MaAv~Gld~e~l~~~~~~~~~eG~~V~ 159 (295) T TIGR00128 81 N-GGLKPDFVAGHSLGEYSALVAAGALDFETALKLVKKRGELMQEAVPEGGGAMAAVLGLDEEQLAQALEEATEEGEVVD 159 (295) T ss_pred C-CCCCCCEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCHHHHHHHHHHHHHCCCEEE T ss_conf 8-984653462463478999998535148889999998899888515788874167654788899999987754796678 Q ss_pred ECCCCCCCEEEECCCCCHHHHHHHHHH-HHHHHHCCCCCCCCCCCCHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCC Q ss_conf 012232221353033112899988888-8653100355554323420102037888874222320287765420457871 Q gi|254780536|r 160 IANDNGGGQVVISGLQDAVKCAADTCL-NKGAKRAVFLPVSAPFHSSLMTPVSKVMKWMLDSVTKQDPVVPILPNFCASP 238 (314) Q Consensus 160 ia~~Ns~~q~visG~~~~l~~~~~~~~-~~~~~~~~~L~v~~afHs~lm~~~~~~~~~~l~~i~~~~p~ipi~S~~~g~~ 238 (314) +||.|||.|+||||.+++|+++...++ ..+++|+.+|+||+||||++|+|+.++|+..|+...|++|.||||||++.++ T Consensus 160 ~AN~N~P~QvVIsG~k~~vE~a~~~~k~~~gakr~~PL~VSgafHs~~M~~a~~~f~~~L~~~~~~~~~~pViSNV~~~~ 239 (295) T TIGR00128 160 LANFNSPGQVVISGTKDEVEAAAALLKTEAGAKRALPLEVSGAFHSRLMKPAAEKFAETLEACQFNDATVPVISNVDAKP 239 (295) T ss_pred EEECCCCCCEEEECCHHHHHHHHHHHHHHCCCEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEECCCCCH T ss_conf 87637896179835678999999972544387075204577055776655689999999987426877510563787750 Q ss_pred CCCCHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHCCC Q ss_conf 0010023322222044850199999999977998999938987999999987389 Q gi|254780536|r 239 VSSIDEISRLLVEQVTGRVRWRETIQWFANHGVKSVYEVGSGKVLTGLAKRIDKS 293 (314) Q Consensus 239 ~~~~~~~~~~l~~ql~~pV~f~~~i~~l~~~g~~~fiEiGP~~~Lt~l~~~~~~~ 293 (314) +++.+.+++.|++||.+||||+++|+.+.+.|++.|+|||||.+|+||+|||.++ T Consensus 240 ~~~g~~~k~~L~~Ql~~pVrw~~s~~~l~~~G~~~f~EvGPG~VL~GL~kRi~~~ 294 (295) T TIGR00128 240 YTNGDRIKEKLIEQLYSPVRWTDSVEKLAARGVTEFAEVGPGKVLTGLIKRIKKD 294 (295) T ss_pred HCCHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCCCC T ss_conf 0144789999999701781147889998636630467546886112551202567 No 2 >TIGR03131 malonate_mdcH malonate decarboxylase, epsilon subunit. Members of this protein family are the epsilon subunit of malonate decarboxylase. This subunit has malonyl-CoA/dephospho-CoA acyltransferase activity. Malonate decarboxylase may be a soluble enzyme, or linked to membrane subunits and active as a sodium pump. The epsilon subunit is closely related to the malonyl CoA-acyl carrier protein (ACP) transacylase family described by TIGR00128, but acts on an ACP subunit of malonate decarboxylase that has an unusual coenzyme A derivative as its prothetic group. Probab=100.00 E-value=0 Score=553.02 Aligned_cols=295 Identities=33% Similarity=0.421 Sum_probs=277.5 Q ss_pred EEEECCCCCCCCCCHHHHHHHHCHHHHHHHHHHHHHCCCCHHHHHCCCCHHHHHCHHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 99980873337311589999859999999999998839897898538986783046765034566665478999971887 Q gi|254780536|r 3 TVLTFPGQGSQVIGMGRDLYDSFPEARLVFEEVDHTLNQNLSDLMWNGSQEELTATCNAQPALTAVSMAFIRVMEKNGLC 82 (314) Q Consensus 3 ~~f~FpGQGsq~~gM~~~L~~~~p~~r~~~~~~~~~l~~~l~~~~~~~~~~~l~~~~~~Qp~i~~~~~a~~~~l~~~gi~ 82 (314) +||+||||||||+||+++|+. +|.+++.++++++.++.++.++ .+++.+++|.++||+||+++++++++|+++|+ T Consensus 1 iafvFpGQGsQ~~gM~~~l~~-~~~~~~~~~~~~~~l~~~~~~~---~~~~~l~~t~~~Qpai~a~~~al~~~l~~~g~- 75 (295) T TIGR03131 1 IALLFPGQGSQRAGMLAELPD-HPAVAAVLAEASDVLGIDPREL---DDAEALASTRSAQLCILAAGVAAWRALLALLP- 75 (295) T ss_pred EEEEECCCHHHHHHHHHHHHH-CCHHHHHHHHHHHHHCCCHHHC---CCHHHHCCCCCHHHHHHHHHHHHHHHHHHCCC- T ss_conf 099968447889999999860-9099999999998868696663---75344326531519999999999999996499- Q ss_pred CCCCCCCCCCCCCCCHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCEECC Q ss_conf 64354424665442012225317865023333345688653100111122222123503553112234421135420012 Q gi|254780536|r 83 VKRDISYVAGHSLGEYTALCAAKAFSLSDTIRLVRARGKSMQEAVPPGLGSMVAIIGLDDCVVDSICAQASRVGICEIAN 162 (314) Q Consensus 83 ~~~~p~~v~GhSlGE~aAl~~aG~ls~eda~~lv~~R~~~m~~~~~~~~g~M~aV~~~~~~~~~~~~~~~~~~~~~~ia~ 162 (314) +|++++|||+|||+|+++||++|++|++++++.||++|+++++++ ++|+||.+.+.+.+++++... .++||| T Consensus 76 ---~P~~v~GHSlGE~aA~~~AG~ls~edal~lv~~R~~lm~~~~~~~-~gm~av~~~~~~~~~~~~~~~----~v~iA~ 147 (295) T TIGR03131 76 ---RPSAVAGYSVGEYAAAVVAGVLTFDDALRLVALRAALMDQAVPGG-YGMLAVLGLDLAAVEALIAKH----GVYLAI 147 (295) T ss_pred ---CCCEEEECCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCC-CCEEEEECCCHHHHHHHHHHC----CEEEEE T ss_conf ---888897677758999998399989999999999999999747899-835875458698888655304----737857 Q ss_pred CCCCCEEEECCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCC Q ss_conf 23222135303311289998888886531003555543234201020378888742223202877654204578710010 Q gi|254780536|r 163 DNGGGQVVISGLQDAVKCAADTCLNKGAKRAVFLPVSAPFHSSLMTPVSKVMKWMLDSVTKQDPVVPILPNFCASPVSSI 242 (314) Q Consensus 163 ~Ns~~q~visG~~~~l~~~~~~~~~~~~~~~~~L~v~~afHs~lm~~~~~~~~~~l~~i~~~~p~ipi~S~~~g~~~~~~ 242 (314) +|+|+|+||||++++++++.+.+++.|+.++++|+|++|||||+|+|+.++|++.++++++++|++|+|||+||+++.+. T Consensus 148 ~Nsp~q~VisG~~~~i~~~~~~l~~~g~~~~~~L~v~~afHS~~m~~~~~~~~~~l~~~~~~~p~ipviS~~tg~~~~~~ 227 (295) T TIGR03131 148 INAPDQVVIAGSRAALRAVAELARAAGASRAKRLAVRVPSHTPLLAKAAEQFAEALAEIPLAAPRLPYLSGIDARLVRDA 227 (295) T ss_pred ECCCCCEEEECCHHHHHHHHHHHHHCCCCCEEECCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCEEEECCCCCCCCCC T ss_conf 53898669848999999999999977996220078887566099999999999998667777997169989876255684 Q ss_pred HHHHHHHHHHCCCCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHCCCCCEEECCCHHHHHHHHH Q ss_conf 02332222204485019999999997799899993898799999998738981552399899999998 Q gi|254780536|r 243 DEISRLLVEQVTGRVRWRETIQWFANHGVKSVYEVGSGKVLTGLAKRIDKSLSAVSISKVEDIDLALR 310 (314) Q Consensus 243 ~~~~~~l~~ql~~pV~f~~~i~~l~~~g~~~fiEiGP~~~Lt~l~~~~~~~~~~~~~~~~~~~~~~l~ 310 (314) +.++++|++|+++||+|.++|+++.+.|+++|||+|||++|++|+|+++++.++.++++.++++..+. T Consensus 228 ~~~~~~l~~ql~~pV~f~~~i~~l~~~g~~~fiEiGP~~~Ls~lv~~il~~~~~~~~~~~~~l~~~~~ 295 (295) T TIGR03131 228 AQIRDDLARQIATPVDWHDCMQAAYERGARLVIELGPGDVLTKLANEAFPELPARSADDFRSLDGLLA 295 (295) T ss_pred CCCHHHHHHHHHCCCCHHHHHHHHHHCCCCEEEECCCCHHHHHHHHHHCCCCCEEECCCHHHHHHHHC T ss_conf 34299999871156829999999995799999997986888999999859997685499989998749 No 3 >COG0331 FabD (acyl-carrier-protein) S-malonyltransferase [Lipid metabolism] Probab=100.00 E-value=0 Score=533.40 Aligned_cols=308 Identities=49% Similarity=0.755 Sum_probs=297.6 Q ss_pred CEEEEECCCCCCCCCCHHHHHHHHCHHHHHHHHHHHHHCCCCHHHHHCCCCHHHHHCHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 91999808733373115899998599999999999988398978985389867830467650345666654789999718 Q gi|254780536|r 1 MSTVLTFPGQGSQVIGMGRDLYDSFPEARLVFEEVDHTLNQNLSDLMWNGSQEELTATCNAQPALTAVSMAFIRVMEKNG 80 (314) Q Consensus 1 mk~~f~FpGQGsq~~gM~~~L~~~~p~~r~~~~~~~~~l~~~l~~~~~~~~~~~l~~~~~~Qp~i~~~~~a~~~~l~~~g 80 (314) ||++|+||||||||.|||++||+++|.++++++++++.+++|+.++++++|++.|+.|.++||+|++++++.++.|++.| T Consensus 2 ~~~A~~FpGQGsQ~~gMg~dl~~~~~~a~~~~~~a~~~l~~~l~~i~~~~p~~~L~~T~~tQPal~~~s~a~~~~l~~~~ 81 (310) T COG0331 2 SKTAFVFPGQGSQSLGMGKDLYENSPEAKETFDEADEALGFDLWALVFEGPEEELNLTQNTQPALLLVSLAAYRVLAEQG 81 (310) T ss_pred CCCEEEECCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCHHHHHCCCCHHHHCCCCHHHHHHHHHHHHHHHHHHHHC T ss_conf 96169868832899876899986369999999999888503099984589778745661131899999999999999845 Q ss_pred CCCCCCCCCCCCCCCCCHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCEE Q ss_conf 87643544246654420122253178650233333456886531001111222221235035531122344211354200 Q gi|254780536|r 81 LCVKRDISYVAGHSLGEYTALCAAKAFSLSDTIRLVRARGKSMQEAVPPGLGSMVAIIGLDDCVVDSICAQASRVGICEI 160 (314) Q Consensus 81 i~~~~~p~~v~GhSlGE~aAl~~aG~ls~eda~~lv~~R~~~m~~~~~~~~g~M~aV~~~~~~~~~~~~~~~~~~~~~~i 160 (314) . ..+|++++|||+|||+|++++|+++|+|+++|+..||.+|+++++.+.|+|.+|+|.+.+.++++|++......++| T Consensus 82 ~--~~~p~~~aGHSlGEysAla~ag~~~~ed~~~Lv~~RG~~M~~a~p~~~~~Maav~gl~~e~v~~~~~~~~~~~~v~i 159 (310) T COG0331 82 L--GVKPDFVAGHSLGEYSALAAAGVLSFEDALKLVRKRGKLMQEAVPRGEGGMAAVLGLDDEQVEKACEEAAQGTVVEI 159 (310) T ss_pred C--CCCCCEEECCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCHHHHCCCCHHHHHHHHHHHCCCCEEEE T ss_conf 8--88875540243868999998050159999999999999999870689744888709899999999997366775888 Q ss_pred CCCCCCCEEEECCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCC Q ss_conf 12232221353033112899988888865310035555432342010203788887422232028776542045787100 Q gi|254780536|r 161 ANDNGGGQVVISGLQDAVKCAADTCLNKGAKRAVFLPVSAPFHSSLMTPVSKVMKWMLDSVTKQDPVVPILPNFCASPVS 240 (314) Q Consensus 161 a~~Ns~~q~visG~~~~l~~~~~~~~~~~~~~~~~L~v~~afHs~lm~~~~~~~~~~l~~i~~~~p~ipi~S~~~g~~~~ 240 (314) +|+|+|+|+||||.+++++++...+++.+.+++++|+|++|||||+|+|+.++|+..+....+++|.+|++||+|+.++. T Consensus 160 aN~N~~~QiVIsG~~~ale~~~~~~~~~g~kr~i~l~vs~pfHs~~m~pa~~~~~~~l~~~~~~~~~ipvi~n~~~~~~~ 239 (310) T COG0331 160 ANYNSPGQIVISGTKEALEKAAEILKEAGAKRAIPLPVSGPFHSPLMKPAADELAEALEKVRFSDPLVPVISNVDAKPVL 239 (310) T ss_pred EEECCCCCEEEECCHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEECCCCCCCC T ss_conf 64078872899777999999999999855654026688831120566789999999997237777545055226432103 Q ss_pred CCHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHCCCCCEEECCCHHHHHHHHH Q ss_conf 1002332222204485019999999997799899993898799999998738981552399899999998 Q gi|254780536|r 241 SIDEISRLLVEQVTGRVRWRETIQWFANHGVKSVYEVGSGKVLTGLAKRIDKSLSAVSISKVEDIDLALR 310 (314) Q Consensus 241 ~~~~~~~~l~~ql~~pV~f~~~i~~l~~~g~~~fiEiGP~~~Lt~l~~~~~~~~~~~~~~~~~~~~~~l~ 310 (314) +.+.+++.+++|+++||+|.+++..+.+.|++.|+|+|||++|++|+|+++++....++.+.++++.++. T Consensus 240 ~~~~i~~~L~~q~~~pVrW~etv~~l~~~gv~~~~EiGpg~vL~gL~kri~~~~~~~~~~~~~~l~~~~~ 309 (310) T COG0331 240 DGEEIRELLAKQLTSPVRWTETVETLKADGVTRFVEIGPGKVLTGLAKRILKGLGVRAVGSLEDLNAALE 309 (310) T ss_pred CHHHHHHHHHHHHCCCEEHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHHCCCCCCEECCCHHHHHHHHH T ss_conf 8999999999996387037999999986596499996772887999886327877341378988998861 No 4 >pfam00698 Acyl_transf_1 Acyl transferase domain. Probab=100.00 E-value=0 Score=522.79 Aligned_cols=298 Identities=25% Similarity=0.323 Sum_probs=266.9 Q ss_pred EEECCCCCCCCCCHHHHHHHHCHHHHHHHHHHHHHC----CCCHHHHHCCCCHHHHHCHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 998087333731158999985999999999999883----9897898538986783046765034566665478999971 Q gi|254780536|r 4 VLTFPGQGSQVIGMGRDLYDSFPEARLVFEEVDHTL----NQNLSDLMWNGSQEELTATCNAQPALTAVSMAFIRVMEKN 79 (314) Q Consensus 4 ~f~FpGQGsq~~gM~~~L~~~~p~~r~~~~~~~~~l----~~~l~~~~~~~~~~~l~~~~~~Qp~i~~~~~a~~~~l~~~ 79 (314) +|+||||||||+|||++||+++|.||+.+++|++++ ++++.+++++++.+.+++|.++||+||+++++++++|+++ T Consensus 1 VFlFpGQGsQ~~gMg~~L~~~~p~fr~~~~~~~~~l~~~~g~~l~~~l~~~~~~~l~~t~~~Qpal~~~~~al~~~l~~~ 80 (319) T pfam00698 1 VFVFSGQGSQWAGMGMDLLKTSPVFAAAIARCDEAFKPQYGFSVLDVLRNNPEGLLERVDVVQPALFAMQIALAALLRSY 80 (319) T ss_pred CEEECCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCCHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 68979615648999999996499999999999999766329999999746985555304788799999998652003322 Q ss_pred CCCCCCCCCCCCCCCCCCHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCE Q ss_conf 88764354424665442012225317865023333345688653100111122222123503553112234421135420 Q gi|254780536|r 80 GLCVKRDISYVAGHSLGEYTALCAAKAFSLSDTIRLVRARGKSMQEAVPPGLGSMVAIIGLDDCVVDSICAQASRVGICE 159 (314) Q Consensus 80 gi~~~~~p~~v~GhSlGE~aAl~~aG~ls~eda~~lv~~R~~~m~~~~~~~~g~M~aV~~~~~~~~~~~~~~~~~~~~~~ 159 (314) |+ +|++++|||+|||+|+++||+++++|++++++.|+++|++... +|+|++|.. +.+.+++++. ..+.++ T Consensus 81 Gi----~P~~v~GHSlGE~aA~~~aG~~~~~da~~lv~~R~~l~~~~~~--~g~M~av~~-~~~~v~~~~~---~~~~v~ 150 (319) T pfam00698 81 GV----TPAAVVGHSLGEIAAAVVAGALSLEEAALVVALRSRLMRQLAG--PGGMAAVGL-PAEEVEQLAR---WPDRVV 150 (319) T ss_pred CC----CCCEEEECCCCHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCC--CCCEEHHCC-CHHHHHHHHC---CCCCEE T ss_conf 98----7666866370489999872898889999999999999985589--765304316-6557999845---489779 Q ss_pred ECCCCCCCEEEECCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCC Q ss_conf 01223222135303311289998888886531003555543234201020378888742223202877654204578710 Q gi|254780536|r 160 IANDNGGGQVVISGLQDAVKCAADTCLNKGAKRAVFLPVSAPFHSSLMTPVSKVMKWMLDSVTKQDPVVPILPNFCASPV 239 (314) Q Consensus 160 ia~~Ns~~q~visG~~~~l~~~~~~~~~~~~~~~~~L~v~~afHs~lm~~~~~~~~~~l~~i~~~~p~ipi~S~~~g~~~ 239 (314) |||+|+|+|+||||++++++++.+.++++|+ ++++|+|++|||||+|+++.++|++.+.++++++|++|+|||+||+++ T Consensus 151 iA~~Nsp~q~visG~~~~i~~~~~~l~~~g~-~~~~L~v~~afHs~~m~~a~~~~~~~l~~~~~~~p~~p~~S~~tg~~~ 229 (319) T pfam00698 151 VAIVNSPRSVVVAGPQEALDEFVERVEARGV-RARVIAVDYASHSPQVEAIGDALALALADIAPRTPRVPFYSTTSGDPS 229 (319) T ss_pred EEEECCCCCEEECCCHHHHHHHHHHHHHCCC-CEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCEEEECCCCCCC T ss_conf 9999888855632889999999999985797-079926898766299999999999998369989998659986778646 Q ss_pred CCCHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHCCCC-----CEEE-C-CCHHHHHHHHHHH Q ss_conf 0100233222220448501999999999779989999389879999999873898-----1552-3-9989999999874 Q gi|254780536|r 240 SSIDEISRLLVEQVTGRVRWRETIQWFANHGVKSVYEVGSGKVLTGLAKRIDKSL-----SAVS-I-SKVEDIDLALRSI 312 (314) Q Consensus 240 ~~~~~~~~~l~~ql~~pV~f~~~i~~l~~~g~~~fiEiGP~~~Lt~l~~~~~~~~-----~~~~-~-~~~~~~~~~l~~~ 312 (314) .+.+.+.+||++|+++||+|.++|++++++|+++|||+|||++|++++|++++.. ...+ + .+.++...++..+ T Consensus 230 ~~~~~~~~~~~~ql~~pV~f~~~i~~l~~~g~~~fiEiGP~~~L~~~i~~~l~~~~~~~~~~~~tl~~~~~~~~~~~~~l 309 (319) T pfam00698 230 DQRELDAEYWYRNLRNPVRFAEAVLAAAEQGYLVFVEVSPHPVLLAAVEETLKSADGKDATLVGTLIRDQGDLVTFLYAL 309 (319) T ss_pred CCHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCEEEECCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHH T ss_conf 87666199999742076119999999996799999991988889999999998558997079736889998799999999 No 5 >smart00827 PKS_AT Acyl transferase domain in polyketide synthase (PKS) enzymes. Probab=100.00 E-value=0 Score=517.90 Aligned_cols=279 Identities=35% Similarity=0.473 Sum_probs=256.1 Q ss_pred CCCCCCCCCCHHHHHHHHCHHHHHHHHHHHHHC----CCCHHHHHCCCC--HHHHHCHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 087333731158999985999999999999883----989789853898--67830467650345666654789999718 Q gi|254780536|r 7 FPGQGSQVIGMGRDLYDSFPEARLVFEEVDHTL----NQNLSDLMWNGS--QEELTATCNAQPALTAVSMAFIRVMEKNG 80 (314) Q Consensus 7 FpGQGsq~~gM~~~L~~~~p~~r~~~~~~~~~l----~~~l~~~~~~~~--~~~l~~~~~~Qp~i~~~~~a~~~~l~~~g 80 (314) ||||||||+|||++||+++|.||+.++++++++ ++++.+.++.++ .+.+++|.++||+||+++++++++|+++| T Consensus 1 FpGQGsQ~~gMg~~L~~~~p~~r~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~t~~~Qpai~~~~~al~~~l~~~G 80 (298) T smart00827 1 FTGQGSQWPGMGRELYETEPVFRAALDECDAALQPLLGWSLLDVLFGEDGAASLLRRTEVAQPALFAVQVALARLWRSWG 80 (298) T ss_pred CCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCCHHHHHHCCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 99805759999999997299999999999999876049998999836997444532225786999999999999999859 Q ss_pred CCCCCCCCCCCCCCCCCHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCEE Q ss_conf 87643544246654420122253178650233333456886531001111222221235035531122344211354200 Q gi|254780536|r 81 LCVKRDISYVAGHSLGEYTALCAAKAFSLSDTIRLVRARGKSMQEAVPPGLGSMVAIIGLDDCVVDSICAQASRVGICEI 160 (314) Q Consensus 81 i~~~~~p~~v~GhSlGE~aAl~~aG~ls~eda~~lv~~R~~~m~~~~~~~~g~M~aV~~~~~~~~~~~~~~~~~~~~~~i 160 (314) + +||+++|||+|||+||++||++|++|++++++.||++|+++.. +|+|+|| +.+.+++++++... .+.++| T Consensus 81 i----~Pd~v~GhSlGE~aAl~aaG~l~~edal~lv~~R~~~m~~~~~--~G~M~aV-~~~~~~v~~~~~~~--~~~v~i 151 (298) T smart00827 81 V----RPDAVVGHSLGEIAAAYVAGVLSLEDAARLVAARGRLMQALPG--GGAMLAV-GLSEEEVEELLAGY--GGRVSV 151 (298) T ss_pred C----CCCEEEECCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCC--CCCEEEE-CCCHHHHHHHHHHC--CCEEEE T ss_conf 9----8777865579899999984999899999999999999997167--8824665-68999999998725--890799 Q ss_pred CCCCCCCEEEECCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCC Q ss_conf 12232221353033112899988888865310035555432342010203788887422232028776542045787100 Q gi|254780536|r 161 ANDNGGGQVVISGLQDAVKCAADTCLNKGAKRAVFLPVSAPFHSSLMTPVSKVMKWMLDSVTKQDPVVPILPNFCASPVS 240 (314) Q Consensus 161 a~~Ns~~q~visG~~~~l~~~~~~~~~~~~~~~~~L~v~~afHs~lm~~~~~~~~~~l~~i~~~~p~ipi~S~~~g~~~~ 240 (314) ||+|+|+|+||||+.++++++.+.++++|. ++++|+|++|||||+|+++.++|++.++++++++|++|+|||+||+++. T Consensus 152 A~~Nsp~q~VisG~~~~i~~~~~~l~~~g~-~~~~L~v~~afHS~lm~~a~~~~~~~l~~~~~~~p~~pv~S~~tg~~~~ 230 (298) T smart00827 152 AAVNGPSSVVLSGDEDAVDELAAALEARGI-RARRLKVDHAFHSPHMDPILDEFREALAGITPRPPRIPFVSTVTGELID 230 (298) T ss_pred EEECCCCCEEEECCHHHHHHHHHHHHHCCC-CEEECCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCEEECCCCEECC T ss_conf 985289828983789999999999985798-2698179885558859999999999983699889989756077871568 Q ss_pred CCHHH-HHHHHHHCCCCCCHHHHHHHHHH-CCCCEEEEECCCHHHHHHHHHHCCCCC Q ss_conf 10023-32222204485019999999997-799899993898799999998738981 Q gi|254780536|r 241 SIDEI-SRLLVEQVTGRVRWRETIQWFAN-HGVKSVYEVGSGKVLTGLAKRIDKSLS 295 (314) Q Consensus 241 ~~~~~-~~~l~~ql~~pV~f~~~i~~l~~-~g~~~fiEiGP~~~Lt~l~~~~~~~~~ 295 (314) +.+.. .+||++|+++||+|.++|+++.+ +|+++|||+|||++|++|+|+|+++.. T Consensus 231 ~~~~~~~~yl~~ql~~pV~f~~~i~~l~~~~g~~~fiEiGP~~~Lt~l~~~il~~~~ 287 (298) T smart00827 231 GAELDDAEYWVRNLREPVRFADAVRALLAEQGVTVFLEVGPHPVLTGPIKQTLPAAG 287 (298) T ss_pred CHHHCCHHHHHHHHHCCCCHHHHHHHHHHCCCCCEEEECCCCHHHHHHHHHHHHHCC T ss_conf 434368999998720756299999999971799889986985899999999861259 No 6 >KOG2926 consensus Probab=100.00 E-value=0 Score=421.80 Aligned_cols=292 Identities=35% Similarity=0.555 Sum_probs=270.0 Q ss_pred EEEECCCCCCCCCCHHHHHHHHCHHHHHHHHHHHHHCCCCHHHHHCCCCHHHHHCHHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 99980873337311589999859999999999998839897898538986783046765034566665478999971887 Q gi|254780536|r 3 TVLTFPGQGSQVIGMGRDLYDSFPEARLVFEEVDHTLNQNLSDLMWNGSQEELTATCNAQPALTAVSMAFIRVMEKNGLC 82 (314) Q Consensus 3 ~~f~FpGQGsq~~gM~~~L~~~~p~~r~~~~~~~~~l~~~l~~~~~~~~~~~l~~~~~~Qp~i~~~~~a~~~~l~~~gi~ 82 (314) .+++|||||+||+|||+.+++ +|..|+.+++|++++|+|+.+++.++|.+.+++|.++||+|++.+++..+.++..+.. T Consensus 64 s~iLFPGQG~q~vgm~q~~l~-~p~a~~~~~~A~~vl~YdLlki~~~gP~e~ldrT~~~QpAI~~~SlAa~E~l~~~~p~ 142 (386) T KOG2926 64 SVILFPGQGAQSVGMGQYLLQ-NPAARRLFAEASNVLGYDLLKICVNGPKEKLDRTVIAQPAIDVSSLAALEQLRLLGPS 142 (386) T ss_pred EEEEECCCCHHHHHHHHHHHH-CCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHCCCCCCCCEECCCHHHHHHCCCCCCC T ss_conf 389707987046666689874-8620679999988766799999860862112202002531000238889745435843 Q ss_pred CCCCCCCCCCCCCCCHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHH------CCC Q ss_conf 64354424665442012225317865023333345688653100111122222123503553112234421------135 Q gi|254780536|r 83 VKRDISYVAGHSLGEYTALCAAKAFSLSDTIRLVRARGKSMQEAVPPGLGSMVAIIGLDDCVVDSICAQAS------RVG 156 (314) Q Consensus 83 ~~~~p~~v~GhSlGE~aAl~~aG~ls~eda~~lv~~R~~~m~~~~~~~~g~M~aV~~~~~~~~~~~~~~~~------~~~ 156 (314) .....++.+|||+|||+|++++|+++|++++|+|..|+..|+++.+--+++|..+++.+...+++.|.... +.. T Consensus 143 ~ie~~~~~aGfSlGEy~alvfa~aLsFs~alKlVk~Ra~AMs~a~~~~~~~m~~~~~~p~sk~~~~~~~a~~~~~~qe~~ 222 (386) T KOG2926 143 IIENLVVTAGFSLGEYAALVFAGALSFSSALKLVKARAEAMSEASELVASGMVMILGLPTSKVQKACASANQLSASQEYP 222 (386) T ss_pred HHHEEEEECCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCHHHHHHHHHHHHHHHHHCCCC T ss_conf 44303200065588899999715456477899999999999999987415663000574767899886467776540487 Q ss_pred CCEECCCCCCCEEEECCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCC Q ss_conf 42001223222135303311289998888886531003555543234201020378888742223202877654204578 Q gi|254780536|r 157 ICEIANDNGGGQVVISGLQDAVKCAADTCLNKGAKRAVFLPVSAPFHSSLMTPVSKVMKWMLDSVTKQDPVVPILPNFCA 236 (314) Q Consensus 157 ~~~ia~~Ns~~q~visG~~~~l~~~~~~~~~~~~~~~~~L~v~~afHs~lm~~~~~~~~~~l~~i~~~~p~ipi~S~~~g 236 (314) .+.|||||+|+|+||||..++|+.+.+..++.++.|.++|+|++||||++|+|+.+++.++|..+.++.|.+|+|||+|| T Consensus 223 ~~~VANyl~~~~~VvsG~~~~Le~lee~~~sf~~~r~~~LaVsgAFHTr~MepAvepl~~Al~~vei~~p~~pViSNvdg 302 (386) T KOG2926 223 VCEVANYLSPGQRVVSGLVKALESLEENAKSFKIRRMKRLAVSGAFHTRLMEPAVEPLTKALKAVEIKNPVIPVISNVDG 302 (386) T ss_pred EEEEECCCCCCCEEEECCHHHHHHHHHHHHHHHHHHHEEEEECCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCEEECCCC T ss_conf 35450357787279507688899999998864044200232032120346566578999999988723887631654778 Q ss_pred CCCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHC---CCCEEEEECCCHHHHHHHHHHCCCCC Q ss_conf 710010023322222044850199999999977---99899993898799999998738981 Q gi|254780536|r 237 SPVSSIDEISRLLVEQVTGRVRWRETIQWFANH---GVKSVYEVGSGKVLTGLAKRIDKSLS 295 (314) Q Consensus 237 ~~~~~~~~~~~~l~~ql~~pV~f~~~i~~l~~~---g~~~fiEiGP~~~Lt~l~~~~~~~~~ 295 (314) +++.+++++++.+++|+.+||+|.+|++++.+. |.+.++|+|||++|.+++|+..+... T Consensus 303 ~~~~~~~hi~~~l~kQ~~rPV~we~~~~ti~sk~~~g~~~sye~GPG~~l~~ilk~~~~~a~ 364 (386) T KOG2926 303 KPYRDPGHILKQLAKQIVRPVQWEQTLKTIYSKQGVGFPRSYEVGPGRVLVAILKRNNPQAD 364 (386) T ss_pred CCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCCEEEECCCHHHHHHHHHHCCHHH T ss_conf 60578678999999974474119999999984587788841765784779999998272044 No 7 >TIGR02813 omega_3_PfaA polyketide-type polyunsaturated fatty acid synthase PfaA; InterPro: IPR004432 Members of this entry represent PfaA, which is involved in omega-3 polyunsaturated fatty acid biosynthesis, for example PfaA from the eicosapentaenoic acid biosynthesis operon in Photobacterium profundum strain SS9. PfaA is found together with PfaB, PfaC, and PfaD, and the functions of the individual polypeptides have not yet been described. More distant homologues of PfaA, also included in this entry, appear to be involved in polyketide-like biosynthetic mechanisms of polyunsaturated fatty acid biosynthesis; an alternative to the more familiar iterated mechanism of chain extension and desaturation, and like PfaA the more distant homologues are likewise found near genes for homologues of PfaB, PfaC, and/or PfaD.. Probab=100.00 E-value=0 Score=408.37 Aligned_cols=305 Identities=32% Similarity=0.455 Sum_probs=262.6 Q ss_pred EEEEECCCCCCCCCCHHHHHHHHCHHHHHHHHHHHHHCCC----CHHHHHCCCC----------HHHHHCHHHHHHHHHH Q ss_conf 1999808733373115899998599999999999988398----9789853898----------6783046765034566 Q gi|254780536|r 2 STVLTFPGQGSQVIGMGRDLYDSFPEARLVFEEVDHTLNQ----NLSDLMWNGS----------QEELTATCNAQPALTA 67 (314) Q Consensus 2 k~~f~FpGQGsq~~gM~~~L~~~~p~~r~~~~~~~~~l~~----~l~~~~~~~~----------~~~l~~~~~~Qp~i~~ 67 (314) |++=||.||||||.+||+||--+||..|+.+...|..++. .|...+|+-| ++.|..|.++|++|-+ T Consensus 590 Kv~ALFaGQGsQYlNMG~ELAc~fP~~~~~~~~~D~~F~~~~~~~Ls~~~yPIP~F~~~s~~AQ~~~L~~T~~AQsAIG~ 669 (2773) T TIGR02813 590 KVAALFAGQGSQYLNMGRELACNFPELRQALADADKVFTQAGKGALSSVLYPIPVFNDESRKAQEEALTNTAYAQSAIGV 669 (2773) T ss_pred EEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHH T ss_conf 07887327734677754887414768999999765565204665423221456665426789999999855521204548 Q ss_pred HHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCC-CCCCCCCCCCCHHHHHH Q ss_conf 66547899997188764354424665442012225317865023333345688653100111-12222212350355311 Q gi|254780536|r 68 VSMAFIRVMEKNGLCVKRDISYVAGHSLGEYTALCAAKAFSLSDTIRLVRARGKSMQEAVPP-GLGSMVAIIGLDDCVVD 146 (314) Q Consensus 68 ~~~a~~~~l~~~gi~~~~~p~~v~GhSlGE~aAl~~aG~ls~eda~~lv~~R~~~m~~~~~~-~~g~M~aV~~~~~~~~~ 146 (314) ++++.|.++...|. ++|+++|||+||++|||+||++|-+|=++|.+.||+.|...... +.|.|.+|+--.....+ T Consensus 670 lS~g~y~~~T~AGF----~aD~~AGHSFGELsALcAaGViS~~DY~~LA~~RG~AMA~~~~e~~~G~~~a~~l~~~~~~~ 745 (2773) T TIGR02813 670 LSMGQYKLFTQAGF----KADMVAGHSFGELSALCAAGVISDDDYIKLAFSRGQAMAASTAEADIGVMYAVILAVKKSAE 745 (2773) T ss_pred HHHHHHHHHHHCCC----CCCCCCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCHH T ss_conf 88999999984587----42323456446799998637655378999999850642488553012114545454203445 Q ss_pred HHHHHHHCCCCCEECCCCCCCEEEECCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHCCHHHHHHHHHHHHHHHCCC Q ss_conf 22344211354200122322213530331128999888888653100355554323420102037888874222320287 Q gi|254780536|r 147 SICAQASRVGICEIANDNGGGQVVISGLQDAVKCAADTCLNKGAKRAVFLPVSAPFHSSLMTPVSKVMKWMLDSVTKQDP 226 (314) Q Consensus 147 ~~~~~~~~~~~~~ia~~Ns~~q~visG~~~~l~~~~~~~~~~~~~~~~~L~v~~afHs~lm~~~~~~~~~~l~~i~~~~p 226 (314) .+.........+.|||||||+|.||+|.........+.++.+|- ++++|||++||||||+..+.++|.+.+..-.|+.| T Consensus 746 ~~~~~i~~~~~v~iANYNs~~QlViAG~s~~~~~A~~~l~~~Gf-K~ipLPVSgAFHTPLVaHAQKPFs~AID~A~F~~P 824 (2773) T TIGR02813 746 VIKNCIKDFEGVSIANYNSPTQLVIAGVSTAIQLAAKALKEKGF-KAIPLPVSGAFHTPLVAHAQKPFSAAIDKAKFNTP 824 (2773) T ss_pred HHHHHHHHHCCCEEEEECCCCCEEEECCCHHHHHHHHHHHCCCC-EEEECCCCCHHCCCCHHHCCCCHHHHHHCCCCCCC T ss_conf 78888886337436630588756985641789999999852884-78745766100034013205735787501447898 Q ss_pred CCCCCCCCCCCCCC-CCHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHCCC----CCEEECC- Q ss_conf 76542045787100-10023322222044850199999999977998999938987999999987389----8155239- Q gi|254780536|r 227 VVPILPNFCASPVS-SIDEISRLLVEQVTGRVRWRETIQWFANHGVKSVYEVGSGKVLTGLAKRIDKS----LSAVSIS- 300 (314) Q Consensus 227 ~ipi~S~~~g~~~~-~~~~~~~~l~~ql~~pV~f~~~i~~l~~~g~~~fiEiGP~~~Lt~l~~~~~~~----~~~~~~~- 300 (314) .+|+|||-||+.-. +..+|+..|-+||.+.|+|.+.++++|.+|.|+|||+||.++|++|+.++++. +-++++| T Consensus 825 ~~plYsNgTg~~H~~~a~~IK~alK~H~L~SV~F~~~l~~~Y~~GaRVFVEFGPKNiL~KLvE~~L~~~~~~l~~IsiN~ 904 (2773) T TIGR02813 825 LVPLYSNGTGKLHKNDAKAIKKALKDHMLQSVHFSEQLEAMYAAGARVFVEFGPKNILQKLVENTLKDKEDELCAISINP 904 (2773) T ss_pred CEEEECCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHHHHCCEEEEEECC T ss_conf 50210053113032128999999987678742256789999847996999547435678999999853015878998668 Q ss_pred CH-HHHHHHHHH Q ss_conf 98-999999987 Q gi|254780536|r 301 KV-EDIDLALRS 311 (314) Q Consensus 301 ~~-~~~~~~l~~ 311 (314) |+ .|.|.-||. T Consensus 905 NPK~~sD~QlR~ 916 (2773) T TIGR02813 905 NPKGDSDSQLRQ 916 (2773) T ss_pred CCCCCHHHHHHH T ss_conf 898535689999 No 8 >COG3321 Polyketide synthase modules and related proteins [Secondary metabolites biosynthesis, transport, and catabolism] Probab=100.00 E-value=0 Score=374.47 Aligned_cols=284 Identities=32% Similarity=0.420 Sum_probs=253.8 Q ss_pred EEEEECCCCCCCCCCHHHHHHHHCHHHHHHHHHHHHH----CCCCHHHHHCCCCHHHHHCHHHHHHHHHHHHHHHHHHHH Q ss_conf 1999808733373115899998599999999999988----398978985389867830467650345666654789999 Q gi|254780536|r 2 STVLTFPGQGSQVIGMGRDLYDSFPEARLVFEEVDHT----LNQNLSDLMWNGSQEELTATCNAQPALTAVSMAFIRVME 77 (314) Q Consensus 2 k~~f~FpGQGsq~~gM~~~L~~~~p~~r~~~~~~~~~----l~~~l~~~~~~~~~~~l~~~~~~Qp~i~~~~~a~~~~l~ 77 (314) |++|+|||||+||.+||++||+.+|.|+++++.|+.. +++++.+.++.+....+..+.++||++|+++++++++|+ T Consensus 526 ~~vfvf~gqgsq~~~mg~el~~~~p~f~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~l~~~~~~Qp~lfai~~ala~l~~ 605 (1061) T COG3321 526 KTVFVFPGQGSQWAGMGRELYALEPVFASAFDALEALLHRLLGFSLPEVIFAPDYPFLESIDFAQPALFAVSVALAALWR 605 (1061) T ss_pred CEEEEECCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHCCCCCCCEECCCCCCCCCCCCCCCHHHHHHHHHHHHHHH T ss_conf 12898670778999988997544299999997778899987288630132278875556720002057889999999887 Q ss_pred HCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCC Q ss_conf 71887643544246654420122253178650233333456886531001111222221235035531122344211354 Q gi|254780536|r 78 KNGLCVKRDISYVAGHSLGEYTALCAAKAFSLSDTIRLVRARGKSMQEAVPPGLGSMVAIIGLDDCVVDSICAQASRVGI 157 (314) Q Consensus 78 ~~gi~~~~~p~~v~GhSlGE~aAl~~aG~ls~eda~~lv~~R~~~m~~~~~~~~g~M~aV~~~~~~~~~~~~~~~~~~~~ 157 (314) ++|+ .|++++|||+||++|++++|+++++|+++++..|+++|+..... |+|++|.....+.+.+++.... .. T Consensus 606 s~gv----~p~~viGhS~gE~aaA~~aGv~s~~d~~~~v~~Rg~lm~~~~~~--G~m~~v~~~~~~~~~~~~~~~~--~~ 677 (1061) T COG3321 606 SWGV----IPGAVIGHSLGELAAAVAAGVLSLEDALRVVATRGRLMQQLAGE--GAMLAVELSLLAEVQELLALGR--PQ 677 (1061) T ss_pred HCCC----CCCCCCCCCHHHHHHHHHCCCCCHHHHHHHHHHHCHHHCCCCCC--CCHHHHHCCCCHHHHHHHHCCC--CC T ss_conf 4587----66300575689999999726587266678888624541157877--4447551773112578763245--64 Q ss_pred CEECCCCCCCEEEECCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 20012232221353033112899988888865310035555432342010203788887422232028776542045787 Q gi|254780536|r 158 CEIANDNGGGQVVISGLQDAVKCAADTCLNKGAKRAVFLPVSAPFHSSLMTPVSKVMKWMLDSVTKQDPVVPILPNFCAS 237 (314) Q Consensus 158 ~~ia~~Ns~~q~visG~~~~l~~~~~~~~~~~~~~~~~L~v~~afHs~lm~~~~~~~~~~l~~i~~~~p~ipi~S~~~g~ 237 (314) +.|+++|+|+++||+|+.+.+.++...+...+. ++++++++++|||++|+++.+.|...+.++.++.|++|+||++++. T Consensus 678 v~ia~~n~P~~~vi~g~~~~i~~l~~~~~~~~~-~~~~~~v~~a~hs~~m~~~~~~~~~~la~i~~~~p~~p~~S~~~~~ 756 (1061) T COG3321 678 VPLAAVNSPQQVVIAGDPEAIAALIARLQAQGV-RARRLAVSHAFHSPLMDPILDEFAAALADLAPRPPQIPLISNVTGD 756 (1061) T ss_pred CEEEEECCCCEEEECCCHHHHHHHHHHHHCCCC-CCEEEEEEECCCCHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCC T ss_conf 307996487048947988899999999732686-5324354102654778889999999987415588776044311036 Q ss_pred CCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHCCCC Q ss_conf 100100233222220448501999999999779989999389879999999873898 Q gi|254780536|r 238 PVSSIDEISRLLVEQVTGRVRWRETIQWFANHGVKSVYEVGSGKVLTGLAKRIDKSL 294 (314) Q Consensus 238 ~~~~~~~~~~~l~~ql~~pV~f~~~i~~l~~~g~~~fiEiGP~~~Lt~l~~~~~~~~ 294 (314) +...+-...+||.+++++||+|.+.++.+.+.|.++|+|+|||++|+.+++++.... T Consensus 757 ~~~~~~~d~~yw~~~~r~~v~f~~~i~~~~~~~~~~f~E~~p~p~l~~~~~~~~~~~ 813 (1061) T COG3321 757 LAGEPGGDAQYWVQHLRQPVRFADAIAAALADGARTFIEVGPGPVLTESIKQTLRDA 813 (1061) T ss_pred CCCCCCCCHHHHHHHHHHHCCHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHH T ss_conf 566665578899998772321899999987534765899468930578999865645 No 9 >KOG1202 consensus Probab=100.00 E-value=5.5e-44 Score=286.12 Aligned_cols=295 Identities=23% Similarity=0.281 Sum_probs=245.5 Q ss_pred EEEECCCCCCCCCCHHHHHHHHCHHHHHHHHHHHHHC---CCCHHHHHCCCCHHHHHCHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 9998087333731158999985999999999999883---9897898538986783046765034566665478999971 Q gi|254780536|r 3 TVLTFPGQGSQVIGMGRDLYDSFPEARLVFEEVDHTL---NQNLSDLMWNGSQEELTATCNAQPALTAVSMAFIRVMEKN 79 (314) Q Consensus 3 ~~f~FpGQGsq~~gM~~~L~~~~p~~r~~~~~~~~~l---~~~l~~~~~~~~~~~l~~~~~~Qp~i~~~~~a~~~~l~~~ 79 (314) +=|+|+|.||||+||+++|++ .+.||..+.++++.+ |.|+.+++...+++..+++.++...|.++|+|+.++|... T Consensus 501 iwfiysGMGsQW~~Ma~~LMk-l~~F~dsi~~~ae~l~~~gldv~~vL~~s~~~tfdn~l~sfvsitAiQiaLtDlLs~l 579 (2376) T KOG1202 501 IWFIYSGMGSQWAGMAKDLMK-LERFRDSIQRSAEVLKPFGLDVIDVLTRSDESTFDNILNSFVSITAIQIALTDLLSCL 579 (2376) T ss_pred EEEEEECCCCHHHHHHHHHHH-HHHHHHHHHHHHHHHCCCCCCHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 699971775322677888760-3888999998886533227506554107986788899999999999999999999861 Q ss_pred CCCCCCCCCCCCCCCCCCHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCE Q ss_conf 88764354424665442012225317865023333345688653100111122222123503553112234421135420 Q gi|254780536|r 80 GLCVKRDISYVAGHSLGEYTALCAAKAFSLSDTIRLVRARGKSMQEAVPPGLGSMVAIIGLDDCVVDSICAQASRVGICE 159 (314) Q Consensus 80 gi~~~~~p~~v~GhSlGE~aAl~~aG~ls~eda~~lv~~R~~~m~~~~~~~~g~M~aV~~~~~~~~~~~~~~~~~~~~~~ 159 (314) || +||-++|||+||..|.|+-|+++.|+++...+.||+.+-+. ...+|.|.|| |+..+.+.+-+ .+.++ T Consensus 580 gi----~PDGIvGHS~GElgc~YaDGclt~EqtvlaAYwRG~sild~-~l~kGaMAAV-GLsWEq~~~r~-----P~~~~ 648 (2376) T KOG1202 580 GI----RPDGIVGHSLGELGCGYADGCLTQEQTVLAAYWRGQSILDT-HLPKGAMAAV-GLSWEQCKSRC-----PPDVV 648 (2376) T ss_pred CC----CCCCCCCCCCCHHCCCCCCCCCCHHHHHHHHHHCCCEECCC-CCCCCCHHHH-CCCHHHHHCCC-----CCCCC T ss_conf 88----88752234532002100266668899999999757620003-4777413342-47799974419-----98432 Q ss_pred ECCCCCCCEEEECCCCCHHHHHHHHHHHHHHHHCCCCCCC-CCCCCHHCCHHHHHHHHHHHHHHHC--CCCCCCCCCCC- Q ss_conf 0122322213530331128999888888653100355554-3234201020378888742223202--87765420457- Q gi|254780536|r 160 IANDNGGGQVVISGLQDAVKCAADTCLNKGAKRAVFLPVS-APFHSSLMTPVSKVMKWMLDSVTKQ--DPVVPILPNFC- 235 (314) Q Consensus 160 ia~~Ns~~q~visG~~~~l~~~~~~~~~~~~~~~~~L~v~-~afHs~lm~~~~~~~~~~l~~i~~~--~p~ipi~S~~~- 235 (314) .+|.||.+.++|||+.+.++++.++++++|++ ++...++ +|||||+|+.+.+++++.++.+... +-...++|+.- T Consensus 649 paCHNs~D~~TiSGp~a~v~~~v~qL~~egvF-ak~V~t~G~aFHS~~m~a~~p~l~~~l~k~i~epK~rsarWlSTSip 727 (2376) T KOG1202 649 PACHNSKDNVTISGPQASVFAFVEQLRAEGVF-AKEVRTGGYAFHSPYMEAAAPPLRQSLEKVIPEPKPRSARWLSTSIP 727 (2376) T ss_pred CCCCCCCCCEEECCCHHHHHHHHHHHHHCCEE-EEEECCCCCCCCCHHHHHHCHHHHHHHHHHCCCCCCCCCCHHHCCCC T ss_conf 31037887337638727899999986445706-66743787312678887327699999998568998754332114687 Q ss_pred -CCCCCCCH--HHHHHHHHHCCCCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHC-CCCCEEECCC---HHHHHHH Q ss_conf -87100100--233222220448501999999999779989999389879999999873-8981552399---8999999 Q gi|254780536|r 236 -ASPVSSID--EISRLLVEQVTGRVRWRETIQWFANHGVKSVYEVGSGKVLTGLAKRID-KSLSAVSISK---VEDIDLA 308 (314) Q Consensus 236 -g~~~~~~~--~~~~~l~~ql~~pV~f~~~i~~l~~~g~~~fiEiGP~~~Lt~l~~~~~-~~~~~~~~~~---~~~~~~~ 308 (314) .+|-++-. -..+|.++++++||.|.++++.+-++. +.|||.||..|....|+.+ +.+..+++-+ .++++-| T Consensus 728 Ea~W~s~la~tsSA~Y~vnNl~SPVLF~eAlq~vP~nA--v~vEiAPH~LlqAiLkRsL~p~~t~v~Lmkr~h~~NlEff 805 (2376) T KOG1202 728 EAQWHSSLARTSSAEYHVNNLVSPVLFHEALQHVPENA--VVVEIAPHGLLQAILKRSLKPSCTNVSLMKRGHRNNLEFF 805 (2376) T ss_pred HHHHCCHHHHHCCHHHHHHCCCCHHHHHHHHHHCCCCC--EEEEECCHHHHHHHHHHHCCCCCCEEHHHCCCCHHHHHHH T ss_conf 56504756543024666633333788999997576540--6998562689999998523886435223217620039999 Q ss_pred HHHH Q ss_conf 9874 Q gi|254780536|r 309 LRSI 312 (314) Q Consensus 309 l~~~ 312 (314) |..+ T Consensus 806 L~~l 809 (2376) T KOG1202 806 LAGL 809 (2376) T ss_pred HHHH T ss_conf 9877 No 10 >TIGR02816 pfaB_fam PfaB family protein; InterPro: IPR014181 This entry represents a group of proteins based on a Pfam B protein family. The protein PfaB family is part of a four-gene locus, which is similar to polyketide biosynthesis systems, responsible for omega-3 polyunsaturated fatty acid biosynthesis in several high pressure and/or cold-adapted bacteria. The fairly permissive trusted cut off set for this HMM allows detection of homologues encoded near homologues to other proteins of the locus: PfaA, PfaC, and/or PfaD. The likely role in every case is either polyunsaturated fatty acid or polyketide biosynthesis.. Probab=99.91 E-value=9.1e-24 Score=163.73 Aligned_cols=287 Identities=15% Similarity=0.231 Sum_probs=191.4 Q ss_pred EEEECCCCCCCCCCHHHHHHHHCHHHHHHHHHHHHHCCCCHHHHHCCC------CHHHHHCHHHHHHHH--HHHHHHHHH Q ss_conf 999808733373115899998599999999999988398978985389------867830467650345--666654789 Q gi|254780536|r 3 TVLTFPGQGSQVIGMGRDLYDSFPEARLVFEEVDHTLNQNLSDLMWNG------SQEELTATCNAQPAL--TAVSMAFIR 74 (314) Q Consensus 3 ~~f~FpGQGsq~~gM~~~L~~~~p~~r~~~~~~~~~l~~~l~~~~~~~------~~~~l~~~~~~Qp~i--~~~~~a~~~ 74 (314) ++|||||-|.-|+||++||.+-||..-.-+|+ .-||.+.+... |+. -.+......+| +-.+|-+.. T Consensus 183 ~~FVYPGVGTVY~~Ml~~lHQYFPALyA~LER-----EG~LK~MLQAE~~Y~~~Pkh-aA~M~LG~LAIAGVGSSYLLTQ 256 (538) T TIGR02816 183 LAFVYPGVGTVYADMLRDLHQYFPALYAKLER-----EGDLKEMLQAEDIYGLDPKH-AAEMSLGDLAIAGVGSSYLLTQ 256 (538) T ss_pred CEEECCCCCCCHHHHHHHHHHHHHHHHHHHCC-----CCCHHHHHHHHHHCCCCCHH-HHHCCHHHHHHHHCCHHHHHHH T ss_conf 03771885430167887676532688875236-----76568885176530677204-5421322355541026889998 Q ss_pred HHHHC-CCCCCCCCCCCCCCCCCCHHHHHHCCCCCHHHH--------------H--HHHHHHHHHHHHHCCCCCC--CC- Q ss_conf 99971-887643544246654420122253178650233--------------3--3345688653100111122--22- Q gi|254780536|r 75 VMEKN-GLCVKRDISYVAGHSLGEYTALCAAKAFSLSDT--------------I--RLVRARGKSMQEAVPPGLG--SM- 134 (314) Q Consensus 75 ~l~~~-gi~~~~~p~~v~GhSlGE~aAl~~aG~ls~eda--------------~--~lv~~R~~~m~~~~~~~~g--~M- 134 (314) +|.+. .| +|+|.+|||.||-+.+..=|+|.=.-| + ||.+. +++|.-... .. T Consensus 257 LL~~EF~~----KP~FALGYS~GEASMWASLGVW~NPHALI~~TQT~P~FT~AISGKLTAV-----R~AWQL~DtA~EI~ 327 (538) T TIGR02816 257 LLTDEFAV----KPDFALGYSKGEASMWASLGVWKNPHALIEKTQTDPLFTSAISGKLTAV-----REAWQLDDTAEEIT 327 (538) T ss_pred HHHHHHCC----CCCCCCCCCCCHHHHHHHCCCCCCCHHHHHHCCCCCCCHHHHHHHHHHH-----HHHHCCCCCHHHHC T ss_conf 75554146----8761235552125777541433473466530168852021110034666-----54403222142302 Q ss_pred C--CCCCCHHHHHHHHHHHHHCCCCCEECCCCCCCEEEECCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCC-HHCCHHH Q ss_conf 2--123503553112234421135420012232221353033112899988888865310035555432342-0102037 Q gi|254780536|r 135 V--AIIGLDDCVVDSICAQASRVGICEIANDNGGGQVVISGLQDAVKCAADTCLNKGAKRAVFLPVSAPFHS-SLMTPVS 211 (314) Q Consensus 135 ~--aV~~~~~~~~~~~~~~~~~~~~~~ia~~Ns~~q~visG~~~~l~~~~~~~~~~~~~~~~~L~v~~afHs-~lm~~~~ 211 (314) + =|...+...+|+++++. +..|+|.+-+ +.|||+|.+..++++.+.+-++|+- +-| --|.|| |-|..-. T Consensus 328 WNSFVVR~E~A~IEALL~~y---P~AYLAIIQG-DTCV~AGCE~~C~ALL~~LGKRGIA-ANR---VTAMHT~PAl~~H~ 399 (538) T TIGR02816 328 WNSFVVRSEAATIEALLKDY---PRAYLAIIQG-DTCVLAGCEITCKALLKKLGKRGIA-ANR---VTAMHTTPALSEHS 399 (538) T ss_pred CCCEEEECCCCCHHHHHHHC---CHHHHHHEEC-CEEEEECCCHHHHHHHHHHCCCCCC-CCC---CHHHCCCHHHHHHH T ss_conf 26417862642055532211---0342001006-7226527647789999970766421-102---01212760356323 Q ss_pred HHHHHHHHHHHHC-CCCCCCCCCCCCCCC-------CCCHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCEEEEECCCHHH Q ss_conf 8888742223202-877654204578710-------01002332222204485019999999997799899993898799 Q gi|254780536|r 212 KVMKWMLDSVTKQ-DPVVPILPNFCASPV-------SSIDEISRLLVEQVTGRVRWRETIQWFANHGVKSVYEVGSGKVL 283 (314) Q Consensus 212 ~~~~~~l~~i~~~-~p~ipi~S~~~g~~~-------~~~~~~~~~l~~ql~~pV~f~~~i~~l~~~g~~~fiEiGP~~~L 283 (314) .-..-+++-+.-. |..|.+||.-+--+- -+..-+...+++..|.+.+|+..|+.+.+.|.+.|||||.++.- T Consensus 400 ~V~~FY~QPL~~ELP~~I~FISAA~L~~~~~~~~~~~~SQ~~A~SIADTFC~TLDFTAL~~~A~~QGA~LFVEIGADRQ~ 479 (538) T TIGR02816 400 NVTDFYLQPLKDELPSDIKFISAADLLAKNTVVEVALDSQVVANSIADTFCTTLDFTALVRTAREQGAKLFVEIGADRQT 479 (538) T ss_pred HHHHHHCCCHHHHCCCCCCEECHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCC T ss_conf 57776315202326754310012014500023530100467887666553003458999999974487588886066543 Q ss_pred HHHHHHHCCCCC----------EEEC--CCHHHHHHHHHHH Q ss_conf 999998738981----------5523--9989999999874 Q gi|254780536|r 284 TGLAKRIDKSLS----------AVSI--SKVEDIDLALRSI 312 (314) Q Consensus 284 t~l~~~~~~~~~----------~~~~--~~~~~~~~~l~~~ 312 (314) +.|+.||.|.=. +++. +-.+|+-..|++| T Consensus 480 ~TLIDKI~~~D~~~~~~hqPCCTV~~NAKG~~DI~~~~K~~ 520 (538) T TIGR02816 480 STLIDKIVKTDNVSSVSHQPCCTVAANAKGSEDIVSLLKAL 520 (538) T ss_pred CHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHH T ss_conf 30212100247843001576445445677850089999999 No 11 >cd07225 Pat_PNPLA6_PNPLA7 Patatin-like phospholipase domain containing protein 6 and protein 7. Patatin-like phospholipase domain containing protein 6 (PNPLA6) and protein 7 (PNPLA7) are 60% identical to each other. PNPLA6 is commonly known as Neuropathy Target Esterase (NTE). NTE has at least two functional domains: the N-terminal domain putatively regulatory domain and the C-terminal catalytic domain which shows esterase activity. NTE shows phospholipase activity for lysophosphatidylcholine (LPC) and phosphatidylcholine (PC). Exposure of NTE to organophosphates leads to organophosphate-induced delayed neurotoxicity (OPIDN). OPIDN is a progressive neurological condition that is characterized by weakness, paralysis, pain, and paresthesia. PNPLA7 is an insulin-regulated phospholipase that is homologous to Neuropathy Target Esterase (NTE or PNPLA6) and is also known as NTE-related esterase (NRE). Human NRE is predominantly expressed in prostate, white adipose, and pancreatic tissue. NRE Probab=90.49 E-value=0.33 Score=27.20 Aligned_cols=47 Identities=26% Similarity=0.503 Sum_probs=35.9 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHH-HHHHCCCCCHHHHHHHHH Q ss_conf 6666547899997188764354424665442012-225317865023333345 Q gi|254780536|r 66 TAVSMAFIRVMEKNGLCVKRDISYVAGHSLGEYT-ALCAAKAFSLSDTIRLVR 117 (314) Q Consensus 66 ~~~~~a~~~~l~~~gi~~~~~p~~v~GhSlGE~a-Al~~aG~ls~eda~~lv~ 117 (314) -+.+++..+.|.+.|| +||++.|-|.|-+. |++|+|. +.++..+.+. T Consensus 27 G~AHiGVLkaLeE~GI----~id~I~GtS~GAivgalyA~g~-~~~~~~~~~~ 74 (306) T cd07225 27 GCAHIGVIKALEEAGI----PVDMVGGTSIGAFIGALYAEER-NISRMKQRAR 74 (306) T ss_pred HHHHHHHHHHHHHCCC----CCCEEEECHHHHHHHHHHHCCC-CHHHHHHHHH T ss_conf 9999999999998699----9319983039899999998799-9999999999 No 12 >PRK10279 hypothetical protein; Provisional Probab=90.35 E-value=0.19 Score=28.65 Aligned_cols=38 Identities=32% Similarity=0.476 Sum_probs=32.0 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHH-HHHHCCCCC Q ss_conf 666547899997188764354424665442012-225317865 Q gi|254780536|r 67 AVSMAFIRVMEKNGLCVKRDISYVAGHSLGEYT-ALCAAKAFS 108 (314) Q Consensus 67 ~~~~a~~~~l~~~gi~~~~~p~~v~GhSlGE~a-Al~~aG~ls 108 (314) ..+++..+.|++.|| +||+++|-|+|-+. |+|++|.++ T Consensus 18 ~AHiGvl~aLee~gI----~id~I~GtS~GAivga~yA~g~~~ 56 (300) T PRK10279 18 WSHIGVINALKKVGI----EIDIVAGCSIGSLVGAAYACDRLS 56 (300) T ss_pred HHHHHHHHHHHHCCC----CCCEEEEECHHHHHHHHHHCCCCH T ss_conf 999999999998699----977898407999999999879968 No 13 >cd07227 Pat_Fungal_NTE1 Fungal patatin-like phospholipase domain containing protein 6. These are fungal Neuropathy Target Esterase (NTE), commonly referred to as NTE1. Patatin-like phospholipase. NTE has at least two functional domains: the N-terminal domain putatively regulatory domain and the C-terminal catalytic domain which shows esterase activity. NTE shows phospholipase activity for lysophosphatidylcholine (LPC) and phosphatidylcholine (PC). Exposure of NTE to organophosphates leads to organophosphate-induced delayed neurotoxicity (OPIDN). OPIDN is a progressive neurological condition that is characterized by weakness, paralysis, pain, and paresthesia. This family includes NTE1 from fungi. Probab=87.55 E-value=0.54 Score=25.88 Aligned_cols=41 Identities=24% Similarity=0.514 Sum_probs=32.5 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHH-HHHHCCCCCHHHH Q ss_conf 666547899997188764354424665442012-2253178650233 Q gi|254780536|r 67 AVSMAFIRVMEKNGLCVKRDISYVAGHSLGEYT-ALCAAKAFSLSDT 112 (314) Q Consensus 67 ~~~~a~~~~l~~~gi~~~~~p~~v~GhSlGE~a-Al~~aG~ls~eda 112 (314) +.+++..+.|++.|| ++|++.|-|.|-+. |++++|. +..+. T Consensus 23 ~AHiGvLkaLeE~gI----~iD~I~GtS~GAivGalyA~~~-~~~~~ 64 (269) T cd07227 23 ISHIGILQALEEAGI----PIDAIGGTSIGSFVGGLYAREA-DLVPI 64 (269) T ss_pred HHHHHHHHHHHHCCC----CCCEEEEECHHHHHHHHHHCCC-CHHHH T ss_conf 999999999998699----8308966069999899998299-88999 No 14 >cd07210 Pat_hypo_W_succinogenes_WS1459_like Hypothetical patatin similar to WS1459 of Wolinella succinogenes. Patatin-like phospholipase. This family predominantly consists of bacterial patatin glycoproteins. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein, but it also has the enzymatic activity of a lipid acyl hydrolase, catalyzing the cleavage of fatty acids from membrane lipids. Members of this family have also been found in vertebrates. Probab=85.25 E-value=0.79 Score=24.89 Aligned_cols=46 Identities=22% Similarity=0.309 Sum_probs=35.1 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHH-HHCCCCCHHHHHHHH Q ss_conf 666654789999718876435442466544201222-531786502333334 Q gi|254780536|r 66 TAVSMAFIRVMEKNGLCVKRDISYVAGHSLGEYTAL-CAAKAFSLSDTIRLV 116 (314) Q Consensus 66 ~~~~~a~~~~l~~~gi~~~~~p~~v~GhSlGE~aAl-~~aG~ls~eda~~lv 116 (314) .+.+++..+.|.+.|+ +|+.+.|-|.|-+.|+ +++| .+.++-.++. T Consensus 12 G~ahiGvlkaL~e~gi----~~d~i~GtSaGAi~ga~~a~G-~~~~e~~~~~ 58 (221) T cd07210 12 FYAHLGFLAALLEMGL----EPSAISGTSAGALVGGLFASG-ISPDEMAELL 58 (221) T ss_pred HHHHHHHHHHHHHCCC----CCCEEEEECHHHHHHHHHHCC-CCHHHHHHHH T ss_conf 9999999999997799----988899988999999999859-9999999999 No 15 >TIGR00755 ksgA dimethyladenosine transferase; InterPro: IPR011530 This family of proteins represent the rRNA adenine dimethylases (e.g. KsgA). The bacterial enzyme KsgA catalyzes the transfer of a total of four methyl groups from S-adenosyl-l-methionine (S-AdoMet) to two adjacent adenosine bases in 16S rRNA. This enzyme and the resulting modified adenosine bases appear to be conserved in all species of eubacteria, eukaryotes, and archaea, and in eukaryotic organelles. Bacterial resistance to the aminoglycoside antibiotic kasugamycin involves inactivation of KsgA and resulting loss of the dimethylations, with modest consequences to the overall fitness of the organism. In contrast, the yeast ortholog, Dim1, is essential. In yeast, and presumably in other eukaryotes, the enzyme performs a vital role in pre-rRNA processing in addition to its methylating activity. The best conserved region in these enzymes is located in the N-terminal section and corresponds to a region that is probably involved in S-adenosyl methionine (SAM) binding. The crystal structure of KsgA from Escherichia coli has been solved to a resolution of 2.1A. It bears a strong similarity to the crystal structure of ErmC' from Bacillus stearothermophilus and a lesser similarity to the yeast mitochondrial transcription factor, sc-mtTFB .; GO: 0016433 rRNA (adenine) methyltransferase activity, 0006364 rRNA processing. Probab=84.64 E-value=2.1 Score=22.31 Aligned_cols=13 Identities=31% Similarity=0.340 Sum_probs=7.2 Q ss_pred CCCCCCCHHHHHH Q ss_conf 6654420122253 Q gi|254780536|r 91 AGHSLGEYTALCA 103 (314) Q Consensus 91 ~GhSlGE~aAl~~ 103 (314) +|--+|-+|...+ T Consensus 36 IGPG~G~LT~~Ll 48 (277) T TIGR00755 36 IGPGLGALTEPLL 48 (277) T ss_pred ECCCCHHHHHHHH T ss_conf 7388207899999 No 16 >COG1752 RssA Predicted esterase of the alpha-beta hydrolase superfamily [General function prediction only] Probab=81.82 E-value=1.4 Score=23.38 Aligned_cols=39 Identities=31% Similarity=0.451 Sum_probs=31.0 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHH-HHHCCCCC Q ss_conf 66665478999971887643544246654420122-25317865 Q gi|254780536|r 66 TAVSMAFIRVMEKNGLCVKRDISYVAGHSLGEYTA-LCAAKAFS 108 (314) Q Consensus 66 ~~~~~a~~~~l~~~gi~~~~~p~~v~GhSlGE~aA-l~~aG~ls 108 (314) -+.+++..+.|.+.|+ +|+.+.|-|.|.+.| ++++|... T Consensus 23 G~~hiGVl~aL~e~gi----~~~~iaGtS~GAiva~l~A~g~~~ 62 (306) T COG1752 23 GAAHIGVLKALEEAGI----PIDVIAGTSAGAIVAALYAAGMDE 62 (306) T ss_pred HHHHHHHHHHHHHCCC----CCCEEEEECHHHHHHHHHHCCCCC T ss_conf 9989999999997699----988798738889999999816981 No 17 >cd07207 Pat_ExoU_VipD_like ExoU and VipD-like proteins; homologus to patatin, cPLA2, and iPLA2. ExoU, a 74-kDa enzyme, is a potent virulence factor of Pseudomonas aeruginosa. One of the pathogenic mechanisms of P. aeruginosa is to induce cytotoxicity by the injection of effector proteins (e.g. ExoU) using the type III secretion (T3S) system. ExoU is homologus to patatin and also has the conserved catalytic residues of mammalian calcium-independent (iPLA2) and cytosolic (cPLA2) PLA2. In vitro, ExoU cytotoxity is blocked by the inhibitor of cytosolic and Ca2-independent phospholipase A2 (cPLA2 and iPLA2) enzymes, suggesting that phospholipase A2 inhibitors may represent a novel mode of treatment for acute P. aeruginosa infections. ExoU requires eukaryotic superoxide dismutase as a cofactor and cleaves phosphatidylcholine and phosphatidylethanolamine in vitro. VipD, a 69-kDa cytosolic protein, belongs to the members of Legionella pneumophila family and is homologus to ExoU from Pseudomona Probab=81.74 E-value=1.2 Score=23.72 Aligned_cols=45 Identities=27% Similarity=0.335 Sum_probs=33.9 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCHHHHHHH Q ss_conf 6665478999971887643544246654420122253178650233333 Q gi|254780536|r 67 AVSMAFIRVMEKNGLCVKRDISYVAGHSLGEYTALCAAKAFSLSDTIRL 115 (314) Q Consensus 67 ~~~~a~~~~l~~~gi~~~~~p~~v~GhSlGE~aAl~~aG~ls~eda~~l 115 (314) +.+++.++.|.+.|+ +|+.+.|-|.|=+.|+..|--.+.++..++ T Consensus 12 ~~~~Gvl~aL~e~g~----~~~~i~GtSaGAi~a~~la~g~~~~~~~~~ 56 (194) T cd07207 12 IAYIGALKALEEAGI----LKKRVAGTSAGAITAALLALGYSAADIKDI 56 (194) T ss_pred HHHHHHHHHHHHCCC----CCCEEEEECHHHHHHHHHHCCCCHHHHHHH T ss_conf 999999999997799----966798877999999999829999999999 No 18 >PRK11126 acyl-CoA thioester hydrolase YfbB; Provisional Probab=79.25 E-value=0.9 Score=24.54 Aligned_cols=29 Identities=21% Similarity=0.232 Sum_probs=22.6 Q ss_pred HHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHC Q ss_conf 789999718876435442466544201222531 Q gi|254780536|r 72 FIRVMEKNGLCVKRDISYVAGHSLGEYTALCAA 104 (314) Q Consensus 72 ~~~~l~~~gi~~~~~p~~v~GhSlGE~aAl~~a 104 (314) +.+++...|+ ++-.++|||+|=..|+..| T Consensus 56 l~~~l~~l~~----~~~~LvGhSmGG~vAl~~A 84 (242) T PRK11126 56 LSQTLVSYNI----LPFWLVGYSLGGRIAMYYA 84 (242) T ss_pred HHHHHHHCCC----CCEEEEEECHHHHHHHHHH T ss_conf 9999998599----9759998677999999999 No 19 >TIGR01286 nifK nitrogenase molybdenum-iron protein beta chain; InterPro: IPR005976 The enzyme responsible for nitrogen fixation, the nitrogenase, shows a high degree of conservation of structure, function, and amino acid sequence across wide phylogenetic ranges. All known Mo-nitrogenases consist of two components, component I (also called dinitrogenase, or Fe-Mo protein), an alpha2beta2 tetramer encoded by the nifD and nifK genes, and component II (dinitrogenase reductase, or Fe protein) a homodimer encoded by the nifH gene , . Two operons, nifDK and nifEN, encode a tetrameric (alpha2beta2 and N2E2) enzymatic complex. Nitrogenase contains two unusual rare metal clusters; one of them is the iron molybdenum cofactor (FeMo-co), which is considered to be the site of dinitrogen reduction and whose biosynthesis requires the products of nifNE and of some other nif genes . It has been proposed that NifNE might serve as a scaffold upon which FeMo-co is built and then inserted into component I .; GO: 0016163 nitrogenase activity, 0009399 nitrogen fixation, 0016612 molybdenum-iron nitrogenase complex. Probab=78.01 E-value=3 Score=21.39 Aligned_cols=55 Identities=15% Similarity=0.143 Sum_probs=36.1 Q ss_pred HHHHHHH-HHHCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCC Q ss_conf 6547899-997188764354424665442012225317865023333345688653100111122222123 Q gi|254780536|r 69 SMAFIRV-MEKNGLCVKRDISYVAGHSLGEYTALCAAKAFSLSDTIRLVRARGKSMQEAVPPGLGSMVAII 138 (314) Q Consensus 69 ~~a~~~~-l~~~gi~~~~~p~~v~GhSlGE~aAl~~aG~ls~eda~~lv~~R~~~m~~~~~~~~g~M~aV~ 138 (314) .+|+|+. |.++-- +|.-.+=-||=| -|+|..|.=.+.||+. ++..-.+-.|+||. T Consensus 94 CVAYFRshl~RHFk----EPvSAvS~SMTE-dAAVFGGl~NmvdGL~----------N~~alYkPkmIAVS 149 (526) T TIGR01286 94 CVAYFRSHLARHFK----EPVSAVSSSMTE-DAAVFGGLKNMVDGLQ----------NIYALYKPKMIAVS 149 (526) T ss_pred HHHHHHHHCCCCCC----CCCCCCCCCCCH-HHHHHCCHHHHHHHHH----------HHHHHCCCCEEEEC T ss_conf 27877621113475----888766776413-5787425677999997----------40432589646630 No 20 >cd07204 Pat_PNPLA_like Patatin-like phospholipase domain containing protein family. Members of this family share a patain domain, initially discovered in potato tubers. PNPLA protein members show non-specific hydrolase activity with a variety of substrates such as triacylglycerol, phospholipids, and retinylesters. It contains the lipase consensus sequence (Gly-X-Ser-X-Gly). Nomenclature of PNPLA family could be misleading as some of the mammalian members of this family show hydrolase, but no phospholipase activity. Probab=76.93 E-value=2.8 Score=21.59 Aligned_cols=54 Identities=13% Similarity=0.188 Sum_probs=38.8 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCHHHHHHHHHH Q ss_conf 566665478999971887643544246654420122253178650233333456 Q gi|254780536|r 65 LTAVSMAFIRVMEKNGLCVKRDISYVAGHSLGEYTALCAAKAFSLSDTIRLVRA 118 (314) Q Consensus 65 i~~~~~a~~~~l~~~gi~~~~~p~~v~GhSlGE~aAl~~aG~ls~eda~~lv~~ 118 (314) +....++.++.|+++|.....++..+.|-|-|-++|++.+-.+++++..+.+.. T Consensus 10 lg~YHlGV~~~L~e~g~~l~~~~~~iaGaSAGAl~aa~~~~g~~~~~~~~~~~~ 63 (243) T cd07204 10 LGIYHVGVASALREHAPRLLQNARRIAGASAGAIVAAVVLCGVSMEEACSFILK 63 (243) T ss_pred HHHHHHHHHHHHHHHCHHHHCCCCEEEEEHHHHHHHHHHHHCCCHHHHHHHHHH T ss_conf 899999999999983714424677699985889999999928999999999999 No 21 >COG4188 Predicted dienelactone hydrolase [General function prediction only] Probab=76.86 E-value=2.1 Score=22.28 Aligned_cols=25 Identities=40% Similarity=0.360 Sum_probs=20.1 Q ss_pred CCCCCCCCCCCCCCHHHHHHCCCCC Q ss_conf 4354424665442012225317865 Q gi|254780536|r 84 KRDISYVAGHSLGEYTALCAAKAFS 108 (314) Q Consensus 84 ~~~p~~v~GhSlGE~aAl~~aG~ls 108 (314) +..+-.++|||+|=|+++..+|+=. T Consensus 157 d~~~Vgv~GhS~GG~T~m~laGA~~ 181 (365) T COG4188 157 DPQRVGVLGHSFGGYTAMELAGAEL 181 (365) T ss_pred CCCCEEEEECCCCCHHHHHHCCCCC T ss_conf 7554378722666589998503530 No 22 >cd07209 Pat_hypo_Ecoli_Z1214_like Hypothetical patatin similar to Z1214 protein of Escherichia coli. Patatin-like phospholipase similar to Z1214 protein of Escherichia coli. This family predominantly consists of bacterial patatin glycoproteins and some representatives from eukaryotes and archaea. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein, but it also has the enzymatic activity of a lipid acyl hydrolase, catalyzing the cleavage of fatty acids from membrane lipids. Members of this family have also been found in vertebrates. Probab=75.49 E-value=1.7 Score=22.79 Aligned_cols=36 Identities=17% Similarity=0.375 Sum_probs=29.0 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHH-HCC Q ss_conf 6666547899997188764354424665442012225-317 Q gi|254780536|r 66 TAVSMAFIRVMEKNGLCVKRDISYVAGHSLGEYTALC-AAK 105 (314) Q Consensus 66 ~~~~~a~~~~l~~~gi~~~~~p~~v~GhSlGE~aAl~-~aG 105 (314) -+.+++.++.|.+.|+ ++|.++|-|.|-+.|+. ++| T Consensus 10 Ga~~~Gvl~aL~e~gi----~~d~i~GtS~GAi~aa~~a~~ 46 (215) T cd07209 10 GAYQAGVLKALAEAGI----EPDIISGTSIGAINGALIAGG 46 (215) T ss_pred HHHHHHHHHHHHHCCC----CCCEEEEECHHHHHHHHHHCC T ss_conf 9999999999997699----988899988999999999849 No 23 >cd07198 Patatin Patatin-like phospholipase. Patatin is a storage protein of the potato tuber that shows Phospholipase A2 activity (PLA2; EC 3.1.1.4). Patatin catalyzes the nonspecific hydrolysis of phospholipids, glycolipids, sulfolipids, and mono- and diacylglycerols, thereby showing lipid acyl hydrolase activity. The active site includes an oxyanion hole with a conserved GGxR motif; it is found in almost all the members of this family. The catalytic dyad is formed by a serine and an aspartate. Patatin belongs to the alpha-beta hydrolase family which is identified by a characteristic nucleophile elbow with a consensus sequence of Sm-X-Nu-Sm (Sm = small residue, X = any residue and Nu = nucleophile). Members of this family have been found also in vertebrates. This family includes PNPLA (1-9), TGL (3-5), ExoU-like, and SDP1-like subfamilies. There are some additional hypothetical proteins included in this family. Probab=75.36 E-value=3.8 Score=20.74 Aligned_cols=40 Identities=23% Similarity=0.322 Sum_probs=30.3 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHH-HCCCCCHHH Q ss_conf 666547899997188764354424665442012225-317865023 Q gi|254780536|r 67 AVSMAFIRVMEKNGLCVKRDISYVAGHSLGEYTALC-AAKAFSLSD 111 (314) Q Consensus 67 ~~~~a~~~~l~~~gi~~~~~p~~v~GhSlGE~aAl~-~aG~ls~ed 111 (314) +.+++..+.|.+.|+ +++.+.|-|.|-+.|+. ++|. +.++ T Consensus 11 ~~~~Gvl~~L~e~gi----~~d~i~GtSaGAi~a~~~a~g~-~~~~ 51 (172) T cd07198 11 IYHVGVAKALRERGP----LIDIIAGTSAGAIVAALLASGR-DLEE 51 (172) T ss_pred HHHHHHHHHHHHCCC----CCCEEEEECHHHHHHHHHHHCC-CHHH T ss_conf 999999999997699----9878987450288999998099-9999 No 24 >PRK10673 hypothetical protein; Provisional Probab=74.55 E-value=1.7 Score=22.88 Aligned_cols=30 Identities=17% Similarity=0.377 Sum_probs=22.5 Q ss_pred HHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHCC Q ss_conf 7899997188764354424665442012225317 Q gi|254780536|r 72 FIRVMEKNGLCVKRDISYVAGHSLGEYTALCAAK 105 (314) Q Consensus 72 ~~~~l~~~gi~~~~~p~~v~GhSlGE~aAl~~aG 105 (314) +..++...|+ +...++|||+|=..|+..|. T Consensus 71 l~~lld~lgi----~~~~lvGhSmGG~va~~~A~ 100 (255) T PRK10673 71 LLDTLDALQI----DKATFIGHSMGGKAVMALTA 100 (255) T ss_pred HHHHHHHHCC----CCEEEEEECHHHHHHHHHHH T ss_conf 9999998399----96478526604899999998 No 25 >TIGR03056 bchO_mg_che_rel putative magnesium chelatase accessory protein. Members of this family belong to the alpha/beta fold family hydrolases (pfam00561). Members are found in bacterial genomes if and only if they encoded for anoxygenic photosynthetic systems similar to that of Rhodobacter capsulatus and other alpha-Proteobacteria. Members often are encoded in the same operon as subunits of the protoporphyrin IX magnesium chelatase, and were once designated BchO. No literature supports a role as an actual subunit of magnesium chelatase, but an accessory role is possible, as suggested by placement by its probable hydrolase activity. Probab=73.49 E-value=1.8 Score=22.71 Aligned_cols=30 Identities=33% Similarity=0.327 Sum_probs=23.1 Q ss_pred HHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHCC Q ss_conf 7899997188764354424665442012225317 Q gi|254780536|r 72 FIRVMEKNGLCVKRDISYVAGHSLGEYTALCAAK 105 (314) Q Consensus 72 ~~~~l~~~gi~~~~~p~~v~GhSlGE~aAl~~aG 105 (314) +..++.+.|+ ++..++|||+|=..|+..|. T Consensus 85 l~~ll~~l~i----~~~~lvGhS~Gg~vA~~~A~ 114 (278) T TIGR03056 85 LSALCAAEGL----SPDGVIGHSAGAAIALRLAL 114 (278) T ss_pred HHHHHHHCCC----CCEEEEECCCHHHHHHHHHH T ss_conf 9998986299----86599945417999999999 No 26 >PRK00274 ksgA dimethyladenosine transferase; Reviewed Probab=71.52 E-value=6.6 Score=19.29 Aligned_cols=12 Identities=33% Similarity=0.390 Sum_probs=6.3 Q ss_pred CCCCCCCCHHHH Q ss_conf 466544201222 Q gi|254780536|r 90 VAGHSLGEYTAL 101 (314) Q Consensus 90 v~GhSlGE~aAl 101 (314) =+|-..|-+|.. T Consensus 45 EIGpG~G~LT~~ 56 (267) T PRK00274 45 EIGPGLGALTEP 56 (267) T ss_pred EECCCCCHHHHH T ss_conf 963888889999 No 27 >pfam07082 DUF1350 Protein of unknown function (DUF1350). This family consists of several hypothetical proteins from both cyanobacteria and plants. Members of this family are typically around 250 residues in length. The function of this family is unknown but the species distribution indicates that the family may be involved in photosynthesis. Probab=69.84 E-value=5.5 Score=19.79 Aligned_cols=74 Identities=18% Similarity=0.178 Sum_probs=37.6 Q ss_pred HHHCH--HHHHHHHHHHHHCCCCHHHHHCCCCHHHHHCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHH Q ss_conf 98599--9999999999883989789853898678304676503456666547899997188764354424665442012 Q gi|254780536|r 22 YDSFP--EARLVFEEVDHTLNQNLSDLMWNGSQEELTATCNAQPALTAVSMAFIRVMEKNGLCVKRDISYVAGHSLGEYT 99 (314) Q Consensus 22 ~~~~p--~~r~~~~~~~~~l~~~l~~~~~~~~~~~l~~~~~~Qp~i~~~~~a~~~~l~~~gi~~~~~p~~v~GhSlGE~a 99 (314) ..+.| .||..++...+ -|+-+...-+.. .++....+.-+...+.-++..+.++.|..+..-|.+-+|||+|=.- T Consensus 28 ~ga~P~vtYr~lLe~L~~-~g~~ViAtpy~~---~fDH~~iA~~v~~~F~~~~~~L~~~~g~~~~~LPv~gvGHSlGckL 103 (250) T pfam07082 28 VATAPQLTYRWLLEHLGE-AGYVVIATPFVN---TFDHGAIALSVLNKFEYALERLVHRGGYPPAYLPIYGLGHSMGCKL 103 (250) T ss_pred HCCCCHHHHHHHHHHHHH-CCCEEEEECCCC---CCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCEECCCCCCCCHH T ss_conf 426828899999999987-896899923678---9877999999999999999999973499844475231343323165 No 28 >cd07206 Pat_TGL3-4-5_SDP1 Triacylglycerol lipase 3, 4, and 5 and Sugar-Dependent 1 lipase. Triacylglycerol lipases are involved in triacylglycerol mobilization and degradation; they are found in lipid particles. TGL4 is 30% homologus to TGL3, whereas TGL5 is 26% homologus to TGL3. Sugar-Dependent 1 (SDP1) lipase has a patatin-like acyl-hydrolase domain that initiates the breakdown of storage oil in germinating Arabidopsis seeds. This family includes subfamilies of proteins: TGL3, TGL4, TGL5, and SDP1. Probab=69.18 E-value=6 Score=19.55 Aligned_cols=45 Identities=11% Similarity=0.214 Sum_probs=33.1 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHC--------CCCCHHHHHHH Q ss_conf 66654789999718876435442466544201222531--------78650233333 Q gi|254780536|r 67 AVSMAFIRVMEKNGLCVKRDISYVAGHSLGEYTALCAA--------KAFSLSDTIRL 115 (314) Q Consensus 67 ~~~~a~~~~l~~~gi~~~~~p~~v~GhSlGE~aAl~~a--------G~ls~eda~~l 115 (314) ...++..+.|-+.|+. |..+.|-|.|-+.|+.++ |-++|+||.+. T Consensus 82 ~~H~GVvkaL~e~~LL----PrVIsGsS~GaivAa~~~t~tdeELlgDlTF~EAy~r 134 (298) T cd07206 82 LFHLGVVKALWEQDLL----PRVISGSSAGAIVAALLGTHTDEELIGDLTFQEAYER 134 (298) T ss_pred HHHHHHHHHHHHCCCC----CCEECCCCHHHHHHHHHHCCCHHHHCCCCCHHHHHHH T ss_conf 5789999999974888----7531057557888989851671887077879999998 No 29 >COG3208 GrsT Predicted thioesterase involved in non-ribosomal peptide biosynthesis [Secondary metabolites biosynthesis, transport, and catabolism] Probab=67.84 E-value=2.4 Score=21.91 Aligned_cols=22 Identities=23% Similarity=0.300 Sum_probs=15.1 Q ss_pred CCCCCCCCCCCCCHHHHHHCCCCCHHHHHH Q ss_conf 354424665442012225317865023333 Q gi|254780536|r 85 RDISYVAGHSLGEYTALCAAKAFSLSDTIR 114 (314) Q Consensus 85 ~~p~~v~GhSlGE~aAl~~aG~ls~eda~~ 114 (314) .+|=++.||||| |++.||-+.+ T Consensus 73 d~P~alfGHSmG--------a~lAfEvArr 94 (244) T COG3208 73 DAPFALFGHSMG--------AMLAFEVARR 94 (244) T ss_pred CCCEEECCCCHH--------HHHHHHHHHH T ss_conf 997642202504--------7999999999 No 30 >TIGR03611 RutD pyrimidine utilization protein D. This protein is observed in operons extremely similar to that characterized in E. coli K-12 responsible for the import and catabolism of pyrimidines, primarily uracil. This protein is a member of the hydrolase, alpha/beta fold family defined by pfam00067. Probab=67.74 E-value=3.1 Score=21.31 Aligned_cols=29 Identities=24% Similarity=0.454 Sum_probs=21.5 Q ss_pred HHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHC Q ss_conf 789999718876435442466544201222531 Q gi|254780536|r 72 FIRVMEKNGLCVKRDISYVAGHSLGEYTALCAA 104 (314) Q Consensus 72 ~~~~l~~~gi~~~~~p~~v~GhSlGE~aAl~~a 104 (314) +..++.+.|+ +.-.++|||+|=+.|+..| T Consensus 70 l~~~l~~l~~----~~~~lvG~S~Gg~va~~~a 98 (257) T TIGR03611 70 VLQLLDALGI----ERFHFVGHALGGLIGLQLA 98 (257) T ss_pred HHHHHHHCCC----CCCEEEEECCCHHHHHHHH T ss_conf 9999997699----8726998477289999999 No 31 >cd07228 Pat_NTE_like_bacteria Bacterial patatin-like phospholipase domain containing protein 6. Bacterial patatin-like phospholipase domain containing protein 6. PNPLA6 is commonly known as Neuropathy Target Esterase (NTE). NTE has at least two functional domains: the N-terminal domain putatively regulatory domain and the C-terminal catalytic domain which shows esterase activity. NTE shows phospholipase activity for lysophosphatidylcholine (LPC) and phosphatidylcholine (PC). Exposure of NTE to organophosphates leads to organophosphate-induced delayed neurotoxicity (OPIDN). OPIDN is a progressive neurological condition that is characterized by weakness, paralysis, pain, and paresthesia. This group includes YCHK and rssA from Escherichia coli as well as Ylbk from Bacillus amyloliquefaciens. Probab=67.54 E-value=5.5 Score=19.77 Aligned_cols=36 Identities=31% Similarity=0.573 Sum_probs=28.3 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHH-HHHCCC Q ss_conf 6665478999971887643544246654420122-253178 Q gi|254780536|r 67 AVSMAFIRVMEKNGLCVKRDISYVAGHSLGEYTA-LCAAKA 106 (314) Q Consensus 67 ~~~~a~~~~l~~~gi~~~~~p~~v~GhSlGE~aA-l~~aG~ 106 (314) +.+.+.++.|.+.|+ +|+.+.|-|.|-+.| ++++|. T Consensus 13 ~~~~Gvl~~L~e~gi----~~d~i~GtSaGAi~aa~~a~g~ 49 (175) T cd07228 13 WAHIGVLRALEEEGI----EIDIIAGSSIGALVGALYAAGH 49 (175) T ss_pred HHHHHHHHHHHHCCC----CCCEEEEECHHHHHHHHHHCCC T ss_conf 999999999998699----8678986448999999998189 No 32 >cd04436 DEP_fRgd2 DEP (Dishevelled, Egl-10, and Pleckstrin) domain found in fungal RhoGAP (GTPase-activator protein) Rgd2-like proteins. Rgd2-like proteins share a common domain architecture, containing, beside the RhoGAP domain, a DEP and a FCH (Fes/CIP4 homology) domain. Yeast Rgd2 is a GAP protein for Cdc42 and Rho5. Probab=66.75 E-value=8.9 Score=18.53 Aligned_cols=59 Identities=29% Similarity=0.367 Sum_probs=40.8 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCC-CCHHHHH-HHHHHCC Q ss_conf 37888874222320287765420457871001002332222204485-0199999-9999779 Q gi|254780536|r 210 VSKVMKWMLDSVTKQDPVVPILPNFCASPVSSIDEISRLLVEQVTGR-VRWRETI-QWFANHG 270 (314) Q Consensus 210 ~~~~~~~~l~~i~~~~p~ipi~S~~~g~~~~~~~~~~~~l~~ql~~p-V~f~~~i-~~l~~~g 270 (314) .++-+...|++++..+-++|++.++ +-..+..++.+++-++|-.+ +...+.+ +.+.+.| T Consensus 2 ~k~lL~~mL~~ipl~~~KvPilgty--~ntstGs~Iv~wl~~nm~~~~l~~AE~fGQDLv~~G 62 (84) T cd04436 2 LKELLAAMLKEIPLADYKVPILGTY--QNTSSGSEIVSWLQENMPEKDLDAAEAFGQDLLNQG 62 (84) T ss_pred HHHHHHHHHHHCCCCCEEEEEEECC--CCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCC T ss_conf 7999999997387766245565112--577778999999998479876799999989999675 No 33 >cd07205 Pat_PNPLA6_PNPLA7_NTE1_like Patatin-like phospholipase domain containing protein 6, protein 7, and fungal NTE1. Patatin-like phospholipase domain containing protein 6 (PNPLA6) and protein 7 (PNPLA7) are included in this family. PNPLA6 is commonly known as Neuropathy Target Esterase (NTE). NTE has at least two functional domains: the N-terminal domain putatively regulatory domain and the C-terminal catalytic domain which shows esterase activity. NTE shows phospholipase activity for lysophosphatidylcholine (LPC) and phosphatidylcholine (PC). Exposure of NTE to organophosphates leads to organophosphate-induced delayed neurotoxicity (OPIDN). OPIDN is a progressive neurological condition that is characterized by weakness, paralysis, pain, and paresthesia. PNPLA7 is an insulin-regulated phospholipase that is homologus to Neuropathy Target Esterase (NTE or PNPLA6) and is also known as NTE-related esterase (NRE). Human NRE is predominantly expressed in prostate, white adipose, and panc Probab=66.13 E-value=6.4 Score=19.39 Aligned_cols=46 Identities=22% Similarity=0.378 Sum_probs=32.5 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHH-HCCCCCHHHHHHHHH Q ss_conf 666547899997188764354424665442012225-317865023333345 Q gi|254780536|r 67 AVSMAFIRVMEKNGLCVKRDISYVAGHSLGEYTALC-AAKAFSLSDTIRLVR 117 (314) Q Consensus 67 ~~~~a~~~~l~~~gi~~~~~p~~v~GhSlGE~aAl~-~aG~ls~eda~~lv~ 117 (314) +.+.+.++.|.+.++ ++|.+.|-|-|-+.|+. ++| .+.++..+... T Consensus 13 ~~~~Gvl~~L~e~~~----~~d~i~GtSaGal~a~~la~g-~~~~~~~~~~~ 59 (175) T cd07205 13 LAHIGVLKALEEAGI----PIDIVSGTSAGAIVGALYAAG-YSPEEIEERAK 59 (175) T ss_pred HHHHHHHHHHHHCCC----CCCEEEEECHHHHHHHHHHCC-CCHHHHHHHHH T ss_conf 999999999997799----855899863788999999859-99999999999 No 34 >COG0030 KsgA Dimethyladenosine transferase (rRNA methylation) [Translation, ribosomal structure and biogenesis] Probab=64.81 E-value=11 Score=17.98 Aligned_cols=11 Identities=36% Similarity=0.483 Sum_probs=6.1 Q ss_pred CCCCCCCHHHH Q ss_conf 66544201222 Q gi|254780536|r 91 AGHSLGEYTAL 101 (314) Q Consensus 91 ~GhSlGE~aAl 101 (314) +|-..|-+|-. T Consensus 37 IGpG~GaLT~~ 47 (259) T COG0030 37 IGPGLGALTEP 47 (259) T ss_pred ECCCCCHHHHH T ss_conf 78987788999 No 35 >cd07224 Pat_like Patatin-like phospholipase. Patatin-like phospholipase. This family consists of various patatin glycoproteins from plants. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein, but it also has the enzymatic activity of lipid acyl hydrolase, catalysing the cleavage of fatty acids from membrane lipids. Members of this family have been found also in vertebrates. Probab=64.67 E-value=11 Score=18.06 Aligned_cols=54 Identities=19% Similarity=0.350 Sum_probs=37.6 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCHHHHHHHHHHHH Q ss_conf 56666547899997188764354424665442012225317865023333345688 Q gi|254780536|r 65 LTAVSMAFIRVMEKNGLCVKRDISYVAGHSLGEYTALCAAKAFSLSDTIRLVRARG 120 (314) Q Consensus 65 i~~~~~a~~~~l~~~gi~~~~~p~~v~GhSlGE~aAl~~aG~ls~eda~~lv~~R~ 120 (314) +....++.+..|+++|+.. +..-+.|-|-|-++|++.+--++.++..+.+..-+ T Consensus 10 lg~YHlGV~~~L~e~g~~~--~~~~~aGaSAGAl~aa~~~~g~~~~~~~~~~~~~~ 63 (233) T cd07224 10 LFPYHLGVLSLLIEAGVIN--ETTPLAGASAGSLAAACSASGLSPEEALEATEELA 63 (233) T ss_pred HHHHHHHHHHHHHHCCCCC--CCCEEEEECHHHHHHHHHHCCCCHHHHHHHHHHHH T ss_conf 7979999999999717323--35647886588999999974999999999999999 No 36 >cd07213 Pat17_PNPLA8_PNPLA9_like1 Patatin-like phospholipase. Patatin is a storage protein of the potato tuber that shows Phospholipase A2 activity (PLA2; EC 3.1.1.4). Patatin catalyzes the nonspecific hydrolysis of phospholipids, glycolipids, sulfolipids, and mono- and diacylglycerols, thereby showing lipid acyl hydrolase activity. The active site includes an oxyanion hole with a conserved GGxR motif; it is found in almost all the members of this family. The catalytic dyad is formed by a serine and an aspartate. Patatin belongs to the alpha-beta hydrolase family which is identified by a characteristic nucleophile elbow with a consensus sequence of Sm-X-Nu-Sm (Sm = small residue, X = any residue and Nu = nucleophile). Members of this family have been found also in vertebrates. This family includes subfamily of PNPLA8 (iPLA2-gamma) and PNPLA9 (iPLA2-beta) like phospholipases from human as well as the Pat17 isozyme from Solanum cardiophyllum. Probab=61.04 E-value=10 Score=18.12 Aligned_cols=48 Identities=21% Similarity=0.270 Sum_probs=32.8 Q ss_pred HHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCHHHHHHHHHHH Q ss_conf 789999718876435442466544201222531786502333334568 Q gi|254780536|r 72 FIRVMEKNGLCVKRDISYVAGHSLGEYTALCAAKAFSLSDTIRLVRAR 119 (314) Q Consensus 72 ~~~~l~~~gi~~~~~p~~v~GhSlGE~aAl~~aG~ls~eda~~lv~~R 119 (314) +.+.+.+.+-.+....|+++|-|.|-+.|+..+--++.++..++-... T Consensus 20 iL~~le~~~~~~~~~fDliaGTStGgiiA~~la~g~~~~~i~~~y~~~ 67 (288) T cd07213 20 LLKRLAEEFPSFLDQIDLFAGTSAGSLIALGLALGYSPRQVLKLYEEV 67 (288) T ss_pred HHHHHHHHCCCCHHHCCEEEECCHHHHHHHHHHCCCCHHHHHHHHHHH T ss_conf 999999865980344377897389899999995699999999999985 No 37 >cd07231 Pat_SDP1-like Sugar-Dependent 1 like lipase. Sugar-Dependent 1 (SDP1) lipase has a patatin-like acyl-hydrolase domain that initiates the breakdown of storage oil in germinating Arabidopsis seeds. This acyl-hydrolase domain is homologus to yeast triacylglycerol lipase 3 and human adipose triglyceride lipase. This family includes SDP1 from Arabidopsis thaliana. Probab=60.74 E-value=10 Score=18.14 Aligned_cols=33 Identities=21% Similarity=0.331 Sum_probs=25.5 Q ss_pred HHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHC Q ss_conf 6654789999718876435442466544201222531 Q gi|254780536|r 68 VSMAFIRVMEKNGLCVKRDISYVAGHSLGEYTALCAA 104 (314) Q Consensus 68 ~~~a~~~~l~~~gi~~~~~p~~v~GhSlGE~aAl~~a 104 (314) ..++..+.|-+.|+. |..+.|-|.|-+.|+.++ T Consensus 82 ~H~GVvkaL~e~~LL----PrVIsGsS~GaivAal~~ 114 (323) T cd07231 82 FHVGVVRTLVEHQLL----PRVIAGSSVGSIVCAIIA 114 (323) T ss_pred HHHHHHHHHHHCCCC----CCEECCCCHHHHHHHHHH T ss_conf 789999999974898----753116758889999986 No 38 >pfam04672 DUF574 Protein of unknown function (DUF574). Family of uncharacterized proteins. Probab=60.28 E-value=13 Score=17.48 Aligned_cols=17 Identities=12% Similarity=0.216 Sum_probs=8.8 Q ss_pred HHHHHHHHHHHHHCCCC Q ss_conf 78888742223202877 Q gi|254780536|r 211 SKVMKWMLDSVTKQDPV 227 (314) Q Consensus 211 ~~~~~~~l~~i~~~~p~ 227 (314) .++...++.+...-+|- T Consensus 220 ~~ei~~~f~g~elvePG 236 (268) T pfam04672 220 RAEVARFFDGLELVEPG 236 (268) T ss_pred HHHHHHHCCCCCCCCCC T ss_conf 99999975998408993 No 39 >TIGR03343 biphenyl_bphD 2-hydroxy-6-oxo-6-phenylhexa-2,4-dienoate hydrolase. Members of this family are 2-hydroxy-6-oxo-6-phenylhexa-2,4-dienoate hydrolase, or HOPD hydrolase, the BphD protein of biphenyl degradation. BphD acts on the product of ring meta-cleavage by BphC. Many species carrying bphC and bphD are capable of degrading polychlorinated biphenyls as well as biphenyl itself. Probab=59.77 E-value=4.8 Score=20.14 Aligned_cols=29 Identities=28% Similarity=0.350 Sum_probs=21.1 Q ss_pred HHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHC Q ss_conf 789999718876435442466544201222531 Q gi|254780536|r 72 FIRVMEKNGLCVKRDISYVAGHSLGEYTALCAA 104 (314) Q Consensus 72 ~~~~l~~~gi~~~~~p~~v~GhSlGE~aAl~~a 104 (314) +..++...|+ +...++|||+|=..|+..| T Consensus 91 ~~~lld~L~i----~~~~lvG~S~GG~iA~~~A 119 (282) T TIGR03343 91 VKGLMDALDI----EKAHLVGNSMGGATALNFA 119 (282) T ss_pred HHHHHHHCCC----CCEEEEEECCHHHHHHHHH T ss_conf 9999997499----8379997373799999999 No 40 >PRK13777 transcriptional regulator Hpr; Provisional Probab=59.18 E-value=13 Score=17.55 Aligned_cols=20 Identities=20% Similarity=0.366 Sum_probs=8.3 Q ss_pred CCCCCCCHHHHHHCCCCCHHHHH Q ss_conf 66544201222531786502333 Q gi|254780536|r 91 AGHSLGEYTALCAAKAFSLSDTI 113 (314) Q Consensus 91 ~GhSlGE~aAl~~aG~ls~eda~ 113 (314) -|-|.-|++-. |++-..+|+ T Consensus 59 ~gAsISdiAkf---GVMHVSTaF 78 (190) T PRK13777 59 KGASISEIAKF---GVMHVSTAF 78 (190) T ss_pred CCCCHHHHHHH---CCEEEEEHH T ss_conf 68839999870---724630032 No 41 >PRK00870 haloalkane dehalogenase; Provisional Probab=58.42 E-value=5.3 Score=19.90 Aligned_cols=29 Identities=17% Similarity=0.076 Sum_probs=21.7 Q ss_pred HHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHC Q ss_conf 789999718876435442466544201222531 Q gi|254780536|r 72 FIRVMEKNGLCVKRDISYVAGHSLGEYTALCAA 104 (314) Q Consensus 72 ~~~~l~~~gi~~~~~p~~v~GhSlGE~aAl~~a 104 (314) +..++.+.|+ ++..++|||+|=..|+..| T Consensus 105 l~~lld~Lgl----~~~~LVGhs~GG~vA~~~A 133 (302) T PRK00870 105 MRSWFEQLDL----TGVTLVCQDWGGLIGLRLA 133 (302) T ss_pred HHHHHHHHCC----CCEEEEEEECCCHHHHHHH T ss_conf 9999998499----8579999731608999999 No 42 >pfam09968 DUF2202 Uncharacterized protein domain (DUF2202). This domain, found in various hypothetical archaeal proteins, has no known function. Probab=58.31 E-value=7.4 Score=19.00 Aligned_cols=126 Identities=17% Similarity=0.207 Sum_probs=66.7 Q ss_pred HHHCHHHHHHHHHHHHHCCCCHHHHHCCCCHHHHHCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCH--- Q ss_conf 98599999999999988398978985389867830467650345666654789999718876435442466544201--- Q gi|254780536|r 22 YDSFPEARLVFEEVDHTLNQNLSDLMWNGSQEELTATCNAQPALTAVSMAFIRVMEKNGLCVKRDISYVAGHSLGEY--- 98 (314) Q Consensus 22 ~~~~p~~r~~~~~~~~~l~~~l~~~~~~~~~~~l~~~~~~Qp~i~~~~~a~~~~l~~~gi~~~~~p~~v~GhSlGE~--- 98 (314) .+.....|+..-..-+..+..+..-+.. .|. ...-+...++.+.++. .| +.|-..|++ T Consensus 9 rEEEKLARDVYl~Ly~~~~~~iF~NIa~--SEq------------~HmdaV~~Ll~~Y~l~---dP--~~~~~~G~F~np 69 (162) T pfam09968 9 REEEKLARDVYLTLYNRWGLPVFLNIAR--SEQ------------THMDAVKSLLDKYGLE---DP--ILSETVGVFTNP 69 (162) T ss_pred HHHHHHHHHHHHHHHHHCCCHHHHHHHH--HHH------------HHHHHHHHHHHHCCCC---CC--CCCCCCCCCCCH T ss_conf 9999999999999999828388889999--899------------9999999999983999---99--877867852887 Q ss_pred ------HHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCEECCCCCCCEEEEC Q ss_conf ------22253178650233333456886531001111222221235035531122344211354200122322213530 Q gi|254780536|r 99 ------TALCAAKAFSLSDTIRLVRARGKSMQEAVPPGLGSMVAIIGLDDCVVDSICAQASRVGICEIANDNGGGQVVIS 172 (314) Q Consensus 99 ------aAl~~aG~ls~eda~~lv~~R~~~m~~~~~~~~g~M~aV~~~~~~~~~~~~~~~~~~~~~~ia~~Ns~~q~vis 172 (314) ..|+.-|.-|..||+++ |.+++ -.+...+++.+....+... +..|+ .... T Consensus 70 elQ~LYn~L~~~G~~S~~dAL~V----Ga~IE--------------elDI~DL~~~i~~t~n~di--~~Vy~----nL~~ 125 (162) T pfam09968 70 DLQELYNQLVEKGSKSLEDALKV----GATIE--------------DLDIYDLEEAIAETDNQDI--KFVYE----NLMK 125 (162) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH----CCCHH--------------HCCHHHHHHHHHHCCHHHH--HHHHH----HHHH T ss_conf 79999999999733528999983----82022--------------2569999999985080879--99999----9998 Q ss_pred CCCCHHHHHHHHHHHHHH Q ss_conf 331128999888888653 Q gi|254780536|r 173 GLQDAVKCAADTCLNKGA 190 (314) Q Consensus 173 G~~~~l~~~~~~~~~~~~ 190 (314) |.+..+.++...+++.|. T Consensus 126 GS~NHLraF~r~l~~~G~ 143 (162) T pfam09968 126 GSRNHLRSFVRALENLGG 143 (162) T ss_pred HHHHHHHHHHHHHHHCCC T ss_conf 708889999999997798 No 43 >pfam03246 Pneumo_ncap Pneumovirus nucleocapsid protein. Probab=57.91 E-value=7.7 Score=18.88 Aligned_cols=24 Identities=33% Similarity=0.452 Sum_probs=10.5 Q ss_pred CHHHHHHCCCCCHHHHHHHHHHHH Q ss_conf 012225317865023333345688 Q gi|254780536|r 97 EYTALCAAKAFSLSDTIRLVRARG 120 (314) Q Consensus 97 E~aAl~~aG~ls~eda~~lv~~R~ 120 (314) |+.-.|++-.+.-|+..++.+.-| T Consensus 64 eIGmqY~~s~LG~e~t~qILr~sG 87 (391) T pfam03246 64 EIGMQYAMSALGREDTIKILRDAG 87 (391) T ss_pred HHHHHHHHHHHCHHHHHHHHHHCC T ss_conf 989999999856087899998468 No 44 >PRK03592 haloalkane dehalogenase; Provisional Probab=57.42 E-value=6.2 Score=19.47 Aligned_cols=29 Identities=14% Similarity=0.028 Sum_probs=21.7 Q ss_pred HHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHC Q ss_conf 789999718876435442466544201222531 Q gi|254780536|r 72 FIRVMEKNGLCVKRDISYVAGHSLGEYTALCAA 104 (314) Q Consensus 72 ~~~~l~~~gi~~~~~p~~v~GhSlGE~aAl~~a 104 (314) +..++.+.|+ ++..++|||+|=..|+..| T Consensus 83 l~~ll~~Lg~----~~~~lvGhS~Gg~va~~~A 111 (294) T PRK03592 83 LDAFFDALDL----RNVVLVLHDWGSALGFDWA 111 (294) T ss_pred HHHHHHHCCC----CCEEEEEECCCCHHHHHHH T ss_conf 9999997699----9879998688849999999 No 45 >pfam01734 Patatin Patatin-like phospholipase. This family consists of various patatin glycoproteins from plants. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein but it also has the enzymatic activity of lipid acyl hydrolase, catalysing the cleavage of fatty acids from membrane lipids. Members of this family have been found also in vertebrates. Probab=57.42 E-value=13 Score=17.53 Aligned_cols=45 Identities=18% Similarity=0.227 Sum_probs=31.1 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCHHHHHHH Q ss_conf 6665478999971887643544246654420122253178650233333 Q gi|254780536|r 67 AVSMAFIRVMEKNGLCVKRDISYVAGHSLGEYTALCAAKAFSLSDTIRL 115 (314) Q Consensus 67 ~~~~a~~~~l~~~gi~~~~~p~~v~GhSlGE~aAl~~aG~ls~eda~~l 115 (314) +.+.+..+.|.+.++ ++|.+.|-|.|=+.|+..+-..+.++..+. T Consensus 11 ~~~~G~l~~l~~~~~----~~d~i~GtS~Gal~a~~~~~~~~~~~~~~~ 55 (189) T pfam01734 11 AYHLGVLKALGEAGL----RFDVISGTSAGAINAALLALGGDPEEIEDE 55 (189) T ss_pred HHHHHHHHHHHHCCC----CCCEEEEECHHHHHHHHHHCCCCHHHHHHH T ss_conf 999999999997699----988898867989999999819999999999 No 46 >pfam00561 Abhydrolase_1 alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes. Probab=56.92 E-value=6.2 Score=19.48 Aligned_cols=31 Identities=32% Similarity=0.469 Sum_probs=21.1 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHCC Q ss_conf 47899997188764354424665442012225317 Q gi|254780536|r 71 AFIRVMEKNGLCVKRDISYVAGHSLGEYTALCAAK 105 (314) Q Consensus 71 a~~~~l~~~gi~~~~~p~~v~GhSlGE~aAl~~aG 105 (314) .+.+++.+.|+ ++-.++|||+|=..|+.++. T Consensus 30 di~~l~~~l~i----~~~~liGhS~Gg~va~~~a~ 60 (225) T pfam00561 30 DLEALLQALGL----DKVNLVGHSMGGLIALAYAA 60 (225) T ss_pred HHHHHHHHCCC----CCEEEEEECCCCHHHHHHHH T ss_conf 99999997699----97799997288399999999 No 47 >cd07208 Pat_hypo_Ecoli_yjju_like Hypothetical patatin similar to yjju protein of Escherichia coli. Patatin-like phospholipase similar to yjju protein of Escherichia coli. This family predominantly consists of bacterial patatin glycoproteins, and some representatives from eukaryotes and archaea. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein, but it also has the enzymatic activity of a lipid acyl hydrolase, catalyzing the cleavage of fatty acids from membrane lipids. Members of this family have also been found in vertebrates. Probab=55.87 E-value=7.7 Score=18.89 Aligned_cols=38 Identities=18% Similarity=0.208 Sum_probs=28.6 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHH-HHHCCCC Q ss_conf 6665478999971887643544246654420122-2531786 Q gi|254780536|r 67 AVSMAFIRVMEKNGLCVKRDISYVAGHSLGEYTA-LCAAKAF 107 (314) Q Consensus 67 ~~~~a~~~~l~~~gi~~~~~p~~v~GhSlGE~aA-l~~aG~l 107 (314) +.+.+..+.|.+.|+. ..+.++|-|.|-+.| .+++|.. T Consensus 11 ~~~~Gvl~aL~e~~~~---~fd~i~GtSaGAi~a~~~~~g~~ 49 (266) T cd07208 11 AYTAGVLDAFLEAGIR---PFDLVIGVSAGALNAASYLSGQR 49 (266) T ss_pred HHHHHHHHHHHHCCCC---CCCEEEEECHHHHHHHHHHHCCC T ss_conf 9999999999977999---88899995799999999996898 No 48 >PRK03204 haloalkane dehalogenase; Provisional Probab=54.83 E-value=6.7 Score=19.28 Aligned_cols=29 Identities=21% Similarity=0.287 Sum_probs=20.8 Q ss_pred HHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHC Q ss_conf 789999718876435442466544201222531 Q gi|254780536|r 72 FIRVMEKNGLCVKRDISYVAGHSLGEYTALCAA 104 (314) Q Consensus 72 ~~~~l~~~gi~~~~~p~~v~GhSlGE~aAl~~a 104 (314) +..++.+.|+ +...++|||+|=..|+..| T Consensus 91 l~~lld~Lgl----~~~~lvGhs~GG~va~~~A 119 (286) T PRK03204 91 IGEFVDHLGL----DRYLSMGQDWGGPISMAVA 119 (286) T ss_pred HHHHHHHHCC----CCEEEEEECCCHHHHHHHH T ss_conf 9999997489----9749999737789999999 No 49 >PRK03609 umuC DNA polymerase V subunit UmuC; Reviewed Probab=54.75 E-value=16 Score=16.91 Aligned_cols=15 Identities=7% Similarity=0.142 Sum_probs=7.1 Q ss_pred HHHHHHHHHHC-CCCE Q ss_conf 99999999977-9989 Q gi|254780536|r 259 WRETIQWFANH-GVKS 273 (314) Q Consensus 259 f~~~i~~l~~~-g~~~ 273 (314) -.++|+.+.+. |-+. T Consensus 373 L~~~iD~In~rfG~~t 388 (422) T PRK03609 373 LMKVLDTLNAKEGKGT 388 (422) T ss_pred HHHHHHHHHHHHCCCE T ss_conf 9999999998538995 No 50 >pfam00551 Formyl_trans_N Formyl transferase. This family includes the following members. Glycinamide ribonucleotide transformylase catalyses the third step in de novo purine biosynthesis, the transfer of a formyl group to 5'-phosphoribosylglycinamide. Formyltetrahydrofolate deformylase produces formate from formyl- tetrahydrofolate. Methionyl-tRNA formyltransferase transfers a formyl group onto the amino terminus of the acyl moiety of the methionyl aminoacyl-tRNA. Inclusion of the following members is supported by PSI-blast. HOXX_BRAJA (P31907) contains a related domain of unknown function. PRTH_PORGI (P46071) contains a related domain of unknown function. Y09P_MYCTU (Q50721) contains a related domain of unknown function. Probab=52.39 E-value=9.2 Score=18.42 Aligned_cols=18 Identities=22% Similarity=0.385 Sum_probs=13.5 Q ss_pred CEEEEECCCCCCCCCCHH Q ss_conf 919998087333731158 Q gi|254780536|r 1 MSTVLTFPGQGSQVIGMG 18 (314) Q Consensus 1 mk~~f~FpGQGsq~~gM~ 18 (314) ||++++++|+||-..... T Consensus 1 mkiavl~SG~Gsnl~~Il 18 (181) T pfam00551 1 MKIAVLISGTGSNLQALL 18 (181) T ss_pred CEEEEEECCCCHHHHHHH T ss_conf 989999907966599999 No 51 >pfam00975 Thioesterase Thioesterase domain. Peptide synthetases are involved in the non-ribosomal synthesis of peptide antibiotics. Next to the operons encoding these enzymes, in almost all cases, are genes that encode proteins that have similarity to the type II fatty acid thioesterases of vertebrates. There are also modules within the peptide synthetases that also share this similarity. With respect to antibiotic production, thioesterases are required for the addition of the last amino acid to the peptide antibiotic, thereby forming a cyclic antibiotic. Thioesterases (non-integrated) have molecular masses of 25-29 kDa. Probab=50.13 E-value=7.7 Score=18.90 Aligned_cols=18 Identities=28% Similarity=0.307 Sum_probs=12.7 Q ss_pred CCCCCCCCCCCCHHHHHH Q ss_conf 544246654420122253 Q gi|254780536|r 86 DISYVAGHSLGEYTALCA 103 (314) Q Consensus 86 ~p~~v~GhSlGE~aAl~~ 103 (314) .|-.+.|||+|=+.|.-+ T Consensus 65 gPy~L~GhS~Gg~lA~ev 82 (224) T pfam00975 65 GPYALFGHSMGGLLAFEV 82 (224) T ss_pred CCEEEEEECCCHHHHHHH T ss_conf 997999644456999999 No 52 >PRK10439 enterobactin/ferric enterobactin esterase; Provisional Probab=49.31 E-value=18 Score=16.72 Aligned_cols=13 Identities=31% Similarity=0.585 Sum_probs=7.7 Q ss_pred CCCCEEEEECCCH Q ss_conf 7998999938987 Q gi|254780536|r 269 HGVKSVYEVGSGK 281 (314) Q Consensus 269 ~g~~~fiEiGP~~ 281 (314) .+.+.|+|+|-.. T Consensus 331 ~~lri~l~~G~~E 343 (398) T PRK10439 331 RGLRIVLEAGVRE 343 (398) T ss_pred CCEEEEEECCCCC T ss_conf 8808999666885 No 53 >TIGR02821 fghA_ester_D S-formylglutathione hydrolase; InterPro: IPR014186 This entry describes proteins from bacteria, yeast, and human, with a conserved critical role in formaldehyde detoxification as S-formylglutathione hydrolase (3.1.2.12 from EC). Eukaryotic members, particularly the human protein, are better known as esterase D (3.1.1.1 from EC), an enzyme with broad specificity, although S-formylglutathione hydrolase has now been demonstrated as well.. Probab=49.01 E-value=11 Score=17.97 Aligned_cols=29 Identities=24% Similarity=0.320 Sum_probs=19.5 Q ss_pred HHHCCCCCCCCCCCCCCCCCCCHHHHHHC Q ss_conf 99718876435442466544201222531 Q gi|254780536|r 76 MEKNGLCVKRDISYVAGHSLGEYTALCAA 104 (314) Q Consensus 76 l~~~gi~~~~~p~~v~GhSlGE~aAl~~a 104 (314) |.....+++..-..+.|||||=--||++| T Consensus 129 l~~~~fPld~~r~GI~GHSMGGhGAL~~A 157 (279) T TIGR02821 129 LVAAQFPLDGERQGITGHSMGGHGALVIA 157 (279) T ss_pred HHHHHCCCCCCCCCCCCCCCHHHHHHHHH T ss_conf 99985892201160135555467999998 No 54 >pfam00326 Peptidase_S9 Prolyl oligopeptidase family. Probab=48.54 E-value=21 Score=16.31 Aligned_cols=31 Identities=23% Similarity=0.255 Sum_probs=20.3 Q ss_pred HHHHHHCCCCCCCCCC--CCCCCCCCCHHHHHHCCC Q ss_conf 8999971887643544--246654420122253178 Q gi|254780536|r 73 IRVMEKNGLCVKRDIS--YVAGHSLGEYTALCAAKA 106 (314) Q Consensus 73 ~~~l~~~gi~~~~~p~--~v~GhSlGE~aAl~~aG~ 106 (314) .+.|++++.. +|+ .+.|||.|=+.|+.++.. T Consensus 52 v~~l~~~~~i---D~~RI~v~G~S~GG~la~~~~~~ 84 (212) T pfam00326 52 AEYLIAQGYV---DPDRLAIWGGSYGGYLTGAALNQ 84 (212) T ss_pred HHHHHHCCCC---CHHHEEEEEECCCHHHHHHHHCC T ss_conf 9999986998---85574897676220988998604 No 55 >TIGR02082 metH methionine synthase; InterPro: IPR011822 Vitamin B12 dependent methionine synthase 2.1.1.13 from EC (5-methyltetrahydrofolate--homocysteine S-methyltransferase) catalyses the conversion of 5-methyltetrahydrofolate and L-homocysteine to tetrahydrofolate and L-methionine as the final step in de novo methionine biosynthesis. The enzyme requires methylcobalamin as a cofactor. In humans, defects in this enzyme are the cause of autosomal recessive inherited methylcobalamin deficiency (CBLG), which causes mental retardation, macrocytic anemia and homocystinuria. Mild deficiencies in activity may result in mild hyperhomocysteinemia, and mutations in the enzyme may be involved in tumorigenesis. Vitamin B12 dependent methionine synthase is found in prokaryotes and eukaryotes, but in prokaryotes the cofactor is cobalamin. This enzyme is a large protein composed of four structurally and functionally distinct modules: the first two modules bind homocysteine and tetrahydrofolate , the third module binds the B12 cofactor , and the C-terminal module (activation domain) binds S-adenosylmethionine. The activation domain is essential for the reductive activation of the enzyme. During the catalytic cycle, the highly reactive cob(I)alamin intermediate can be oxidised to produce an inactive cob(II)alamin enzyme; the enzyme is then reactivated via reductive methylation by the activation domain . The activation domain adopts an unusual alpha/beta fold.Recent studies suggest that this enzyme exists as an ensemble of conformations with equilibria dependent on the oxidation and methylation state of the cobalamin and on the concentrations of substrates or products .; GO: 0008705 methionine synthase activity, 0031419 cobalamin binding, 0050897 cobalt ion binding, 0009086 methionine biosynthetic process. Probab=48.31 E-value=7.6 Score=18.94 Aligned_cols=256 Identities=16% Similarity=0.205 Sum_probs=112.1 Q ss_pred CHHHHHH-HHHHHHHCCCCHHHHHCCCCH----HHHHCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHH Q ss_conf 9999999-999998839897898538986----78304676503456666547899997188764354424665442012 Q gi|254780536|r 25 FPEARLV-FEEVDHTLNQNLSDLMWNGSQ----EELTATCNAQPALTAVSMAFIRVMEKNGLCVKRDISYVAGHSLGEYT 99 (314) Q Consensus 25 ~p~~r~~-~~~~~~~l~~~l~~~~~~~~~----~~l~~~~~~Qp~i~~~~~a~~~~l~~~gi~~~~~p~~v~GhSlGE~a 99 (314) .|.||.+ ||...+++......++-+|-+ |.+-||-++.-|+|+++-. +.+.|.. .|-++ T Consensus 137 ~P~FRnvtyD~L~~AY~e~~~gL~dGGvDllLiET~fDtLnaKAAlfA~~~~----F~~~g~~---lPi~i--------- 200 (1265) T TIGR02082 137 RPEFRNVTYDELVDAYKEQIKGLLDGGVDLLLIETVFDTLNAKAALFAAETV----FEEKGRE---LPIMI--------- 200 (1265) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHH----HHHCCCC---CCEEE--------- T ss_conf 7788757879899999999998773260354774799999999999999999----9972897---87688--------- Q ss_pred HHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCH----HHHHHHHHHHHHCCCCCEECCC-CC--CCEEEEC Q ss_conf 225317865023333345688653100111122222123503----5531122344211354200122-32--2213530 Q gi|254780536|r 100 ALCAAKAFSLSDTIRLVRARGKSMQEAVPPGLGSMVAIIGLD----DCVVDSICAQASRVGICEIAND-NG--GGQVVIS 172 (314) Q Consensus 100 Al~~aG~ls~eda~~lv~~R~~~m~~~~~~~~g~M~aV~~~~----~~~~~~~~~~~~~~~~~~ia~~-Ns--~~q~vis 172 (314) ||++.-..|-.|+..=...-..+..+ .... .+|++ .+++.+.++.+++...++|+|+ |+ |+. ++ T Consensus 201 ----SGti~d~sGr~LsGqt~~Af~~sl~~--~~~~-~iGLNCAlGpdel~~h~~~Ls~~~~~~Vs~~PNAGLPn~--~G 271 (1265) T TIGR02082 201 ----SGTIVDTSGRTLSGQTLEAFLASLEH--AEIL-IIGLNCALGPDELRPHVKELSEHAEAYVSVHPNAGLPNA--FG 271 (1265) T ss_pred ----EEEEECCCCCCCCCCCHHHHHHHHCC--CCCC-EECCCCCCCHHHHHHHHHHHHHHCCEEEEEECCCCCCCC--CC T ss_conf ----60262276872026208999998546--8730-241231147778999999987008813888337888100--47 Q ss_pred C--C----CCHHHHHHHHHHHHHHHHCCCCC--CCCCCCCHHCCHHHHHHHHHHHHHHHC-CCC--CC---CCCCCCCCC Q ss_conf 3--3----11289998888886531003555--543234201020378888742223202-877--65---420457871 Q gi|254780536|r 173 G--L----QDAVKCAADTCLNKGAKRAVFLP--VSAPFHSSLMTPVSKVMKWMLDSVTKQ-DPV--VP---ILPNFCASP 238 (314) Q Consensus 173 G--~----~~~l~~~~~~~~~~~~~~~~~L~--v~~afHs~lm~~~~~~~~~~l~~i~~~-~p~--ip---i~S~~~g~~ 238 (314) | + ++.+....+.+-..+....+ =- -+.|=|=.-|+. +.+++..+ .|. +| ++|.++.-. T Consensus 272 ~~AeY~l~P~~~A~~l~~fa~eg~lniV-GGCCGTTP~HIraiA~-------aVk~i~Pr~~~~ta~~~~~~~Sgle~~~ 343 (1265) T TIGR02082 272 KQAEYDLTPDELAKALKDFAEEGGLNIV-GGCCGTTPDHIRAIAE-------AVKDIKPRKRPETALYEPSRLSGLEAIT 343 (1265) T ss_pred CCCCCCCCHHHHHHHHHHHHHHCCEEEE-ECCCCCCHHHHHHHHH-------HHHCCCCCCCCCCCCCHHHHHHCCCCCC T ss_conf 7768755578999999999986390377-4268996789999999-------9816876888765563246763031346 Q ss_pred CCCCH---HHHHH-------HHHHCCCCCCHHHHHHHHHH---CCCCEEEEECCCHH-------HHHHHHHHC--CCCCE Q ss_conf 00100---23322-------22204485019999999997---79989999389879-------999999873--89815 Q gi|254780536|r 239 VSSID---EISRL-------LVEQVTGRVRWRETIQWFAN---HGVKSVYEVGSGKV-------LTGLAKRID--KSLSA 296 (314) Q Consensus 239 ~~~~~---~~~~~-------l~~ql~~pV~f~~~i~~l~~---~g~~~fiEiGP~~~-------Lt~l~~~~~--~~~~~ 296 (314) +.... .+.+. =.+.+..-=+|.+++.-+.. +|. .+|+|-=..+ ...|.+.+- |++.. T Consensus 344 ~~~~~~FvniGERtNv~GS~KF~~li~aedY~~~L~iAk~QV~~GA-QilDiN~D~~~lDg~~~M~~f~~llA~ePdia~ 422 (1265) T TIGR02082 344 IAQESNFVNIGERTNVAGSKKFRKLIIAEDYDEALDIAKQQVENGA-QILDINVDDGMLDGVAAMKRFLNLLASEPDIAK 422 (1265) T ss_pred CCCCCCEEEEEEEECCCCCHHHHHHHHCCCHHHHHHHHHHHHHCCC-EEEEEEECCCCCCCHHHHHHHHHHHCCCCCCCC T ss_conf 3663566887430078768679998512384688789888711697-798987065536737899999996268986021 Q ss_pred EEC----CCHHHHHHHHHHHCC Q ss_conf 523----998999999987439 Q gi|254780536|r 297 VSI----SKVEDIDLALRSIVG 314 (314) Q Consensus 297 ~~~----~~~~~~~~~l~~~~g 314 (314) +.+ ++.+=++..|+.+-| T Consensus 423 vPlMlDSs~~~vleaGLk~~qG 444 (1265) T TIGR02082 423 VPLMLDSSEWEVLEAGLKCIQG 444 (1265) T ss_pred CCCCCCCCHHHHHHHHHHHCCC T ss_conf 3600057437999762430388 No 56 >PRK05294 carB carbamoyl phosphate synthase large subunit; Reviewed Probab=46.91 E-value=4.9 Score=20.06 Aligned_cols=23 Identities=30% Similarity=0.251 Sum_probs=12.9 Q ss_pred HHHHHCCCCCCCCCCCCCCCCCC Q ss_conf 99997188764354424665442 Q gi|254780536|r 74 RVMEKNGLCVKRDISYVAGHSLG 96 (314) Q Consensus 74 ~~l~~~gi~~~~~p~~v~GhSlG 96 (314) ..-++.|.++-.||++|+|=.-. T Consensus 697 ~~a~~iGyPvlvRPSyVLGG~~M 719 (1063) T PRK05294 697 EIAEEIGYPVLVRPSYVLGGRAM 719 (1063) T ss_pred HHHHHCCCCEEECCCCCCCCCCE T ss_conf 99986498668434310257714 No 57 >pfam09752 DUF2048 Uncharacterized conserved protein (DUF2048). The proteins in this family are conserved from plants to vertebrates. The function is unknown. Probab=43.78 E-value=24 Score=15.86 Aligned_cols=39 Identities=28% Similarity=0.286 Sum_probs=25.8 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHCCCC Q ss_conf 5666654789999718876435442466544201222531786 Q gi|254780536|r 65 LTAVSMAFIRVMEKNGLCVKRDISYVAGHSLGEYTALCAAKAF 107 (314) Q Consensus 65 i~~~~~a~~~~l~~~gi~~~~~p~~v~GhSlGE~aAl~~aG~l 107 (314) ++.=..++++.+++.|. .|-.+.|.|+|-.-|..++-.+ T Consensus 156 ~i~E~~~Ll~w~~~~g~----g~~g~tG~SmGG~mAalaas~~ 194 (337) T pfam09752 156 TILESRALLHWLEREGY----GPLGVTGLSMGGHMAALAASNW 194 (337) T ss_pred HHHHHHHHHHHHHHCCC----CCCEEEEECCCHHHHHHHHHCC T ss_conf 99999999999997288----8613664233127777664058 No 58 >COG4370 Uncharacterized protein conserved in bacteria [Function unknown] Probab=42.97 E-value=22 Score=16.19 Aligned_cols=11 Identities=27% Similarity=0.374 Sum_probs=7.6 Q ss_pred CEEEEECCCCC Q ss_conf 91999808733 Q gi|254780536|r 1 MSTVLTFPGQG 11 (314) Q Consensus 1 mk~~f~FpGQG 11 (314) ||+.|+=-|.| T Consensus 7 ~kiLllSNGHg 17 (412) T COG4370 7 PKILLLSNGHG 17 (412) T ss_pred CCEEEEECCCH T ss_conf 10478606852 No 59 >PRK10566 esterase; Provisional Probab=42.43 E-value=15 Score=17.17 Aligned_cols=31 Identities=26% Similarity=0.322 Sum_probs=21.7 Q ss_pred HHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHH Q ss_conf 78999971887643544246654420122253 Q gi|254780536|r 72 FIRVMEKNGLCVKRDISYVAGHSLGEYTALCA 103 (314) Q Consensus 72 ~~~~l~~~gi~~~~~p~~v~GhSlGE~aAl~~ 103 (314) +.+.+.+.|+.. ++--++.|+|+|=|+|+.+ T Consensus 94 ~v~~l~~~g~vD-~~RigI~G~S~GG~tal~a 124 (249) T PRK10566 94 LRAAIQQENWLL-DDRLAVGGASMGGMTALGI 124 (249) T ss_pred HHHHHHHCCCCC-CCEEEEEECCHHHHHHHHH T ss_conf 999999848877-5336998238899999999 No 60 >TIGR01821 5aminolev_synth 5-aminolevulinic acid synthase; InterPro: IPR010961 Tetrapyrroles are large macrocyclic compounds derived from a common biosynthetic pathway . The end-product, uroporphyrinogen III, is used to synthesise a number of important molecules, including vitamin B12, haem, sirohaem, chlorophyll, coenzyme F430 and phytochromobilin . The first stage in tetrapyrrole synthesis is the synthesis of 5-aminoaevulinic acid ALA via two possible routes: (1) condensation of succinyl CoA and glycine (C4 pathway) using ALA synthase (2.3.1.37 from EC), or (2) decarboxylation of glutamate (C5 pathway) via three different enzymes, glutamyl-tRNA synthetase (6.1.1.17 from EC) to charge a tRNA with glutamate, glutamyl-tRNA reductase (1.2.1.70 from EC) to reduce glutamyl-tRNA to glutamate-1-semialdehyde (GSA), and GSA aminotransferase (5.4.3.8 from EC) to catalyse a transamination reaction to produce ALA. The second stage is to convert ALA to uroporphyrinogen III, the first macrocyclic tetrapyrrolic structure in the pathway. This is achieved by the action of three enzymes in one common pathway: porphobilinogen (PBG) synthase (or ALA dehydratase, 4.2.1.24 from EC) to condense two ALA molecules to generate porphobilinogen; hydroxymethylbilane synthase (or PBG deaminase, 2.5.1.61 from EC) to polymerise four PBG molecules into preuroporphyrinogen (tetrapyrrole structure); and uroporphyrinogen III synthase (4.2.1.75 from EC) to link two pyrrole units together (rings A and D) to yield uroporphyrinogen III. Uroporphyrinogen III is the first branch point of the pathway. To synthesise cobalamin (vitamin B12), sirohaem, and coenzyme F430, uroporphyrinogen III needs to be converted into precorrin-2 by the action of uroporphyrinogen III methyltransferase (2.1.1.107 from EC). To synthesise haem and chlorophyll, uroporphyrinogen III needs to be decarboxylated into coproporphyrinogen III by the action of uroporphyrinogen III decarboxylase (4.1.1.37 from EC) . This entry represents 5-aminoaevulinic acid (ALA) synthase (2.3.1.37 from EC), which catalyses the first stage of tetrapyrrole biosynthesis by the C4 pathway, namely the condensation of succinyl CoA and glycine. ALA synthase is a pyridoxal-phosphate-dependent enzyme. During catalysis, glycine initially binds to the enzyme cofactor, and after condensation with succinyl CoA, CoA, carbon dioxide and 5-aminolevulinic acid are produced .; GO: 0003870 5-aminolevulinate synthase activity, 0030170 pyridoxal phosphate binding, 0033014 tetrapyrrole biosynthetic process. Probab=42.39 E-value=26 Score=15.73 Aligned_cols=23 Identities=35% Similarity=0.362 Sum_probs=9.4 Q ss_pred CCEEE--ECCCCCHHHHHHHHHHHH Q ss_conf 22135--303311289998888886 Q gi|254780536|r 166 GGQVV--ISGLQDAVKCAADTCLNK 188 (314) Q Consensus 166 ~~q~v--isG~~~~l~~~~~~~~~~ 188 (314) |+|+| +-|+....+++.+.|.++ T Consensus 351 pSHIvPv~iGdA~~~k~~SD~LL~~ 375 (427) T TIGR01821 351 PSHIVPVIIGDAAKCKKVSDLLLNK 375 (427) T ss_pred CCEEEEEEECCHHHHHHHHHHHHHH T ss_conf 7738876766868898998988871 No 61 >PRK10349 carboxylesterase BioH; Provisional Probab=42.30 E-value=13 Score=17.59 Aligned_cols=19 Identities=37% Similarity=0.291 Sum_probs=13.6 Q ss_pred CCCCCCCCCCCCHHHHHHC Q ss_conf 5442466544201222531 Q gi|254780536|r 86 DISYVAGHSLGEYTALCAA 104 (314) Q Consensus 86 ~p~~v~GhSlGE~aAl~~a 104 (314) +...++|||+|=..|+..| T Consensus 74 ~~~~lvG~SmGG~iA~~~A 92 (256) T PRK10349 74 DKAIWLGWSLGGLVASQIA 92 (256) T ss_pred CCEEEEEECHHHHHHHHHH T ss_conf 6737999896189999999 No 62 >COG1054 Predicted sulfurtransferase [General function prediction only] Probab=40.52 E-value=27 Score=15.56 Aligned_cols=15 Identities=20% Similarity=0.253 Sum_probs=4.8 Q ss_pred ECCCCCHHHHHHHHH Q ss_conf 303311289998888 Q gi|254780536|r 171 ISGLQDAVKCAADTC 185 (314) Q Consensus 171 isG~~~~l~~~~~~~ 185 (314) |||+.++++.+.+.+ T Consensus 50 vsG~~e~~~~~~~~l 64 (308) T COG1054 50 VSGSAEAIEAYMAWL 64 (308) T ss_pred EECCHHHHHHHHHHH T ss_conf 756799999999999 No 63 >cd00381 IMPDH IMPDH: The catalytic domain of the inosine monophosphate dehydrogenase. IMPDH catalyzes the NAD-dependent oxidation of inosine 5'-monophosphate (IMP) to xanthosine 5' monophosphate (XMP). It is a rate-limiting step in the de novo synthesis of the guanine nucleotides. There is often a CBS domain inserted in the middle of this domain, which is proposed to play a regulatory role. IMPDH is a key enzyme in the regulation of cell proliferation and differentiation. It has been identified as an attractive target for developing chemotherapeutic agents. Probab=39.65 E-value=28 Score=15.47 Aligned_cols=24 Identities=29% Similarity=0.271 Sum_probs=14.6 Q ss_pred CCEEEECCCCCHHHHHHHHHHHHHH Q ss_conf 2213530331128999888888653 Q gi|254780536|r 166 GGQVVISGLQDAVKCAADTCLNKGA 190 (314) Q Consensus 166 ~~q~visG~~~~l~~~~~~~~~~~~ 190 (314) |+..||+|....-+.+.... +.|+ T Consensus 134 p~~~IiaGNV~T~e~a~~L~-~~Ga 157 (325) T cd00381 134 PNVDVIAGNVVTAEAARDLI-DAGA 157 (325) T ss_pred CCCCEEECCCCCHHHHHHHH-HCCC T ss_conf 99756864566899999998-6699 No 64 >TIGR02653 Lon_rel_chp conserved hypothetical protein; InterPro: IPR013473 This entry describes a protein family of unknown function, about 690 residues in length, in which some members show C-terminal sequence similarity to IPR008269 from INTERPRO, which is the Lon protease C-terminal proteolytic domain, from MEROPS family S16. However, the annotated catalytic sites of Escherichia coli Lon protease are not conserved in members of this family. Members have a motif GP[RK][GS]TGKS, similar to the ATP-binding P-loop motif GxxGxGK[ST].. Probab=39.11 E-value=18 Score=16.73 Aligned_cols=47 Identities=15% Similarity=0.242 Sum_probs=29.2 Q ss_pred CCCCCCCCHHHHHHHHHHC-CCCCCHHHHHHHH--------HHCCCCEEEEECCCHH Q ss_conf 7871001002332222204-4850199999999--------9779989999389879 Q gi|254780536|r 235 CASPVSSIDEISRLLVEQV-TGRVRWRETIQWF--------ANHGVKSVYEVGSGKV 282 (314) Q Consensus 235 ~g~~~~~~~~~~~~l~~ql-~~pV~f~~~i~~l--------~~~g~~~fiEiGP~~~ 282 (314) +++.--+.+++.|.|.+.+ .+|-++++.++.. .|+.. ..+|+||.+| T Consensus 174 ~~R~~F~~deW~DvL~RSvGMEP~~le~r~Kwh~l~RliPlvEnNY-NlcELGPrgT 229 (677) T TIGR02653 174 EERRQFTKDEWIDVLLRSVGMEPTNLERRTKWHLLTRLIPLVENNY-NLCELGPRGT 229 (677) T ss_pred HHHHCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCC-CEEEECCCCC T ss_conf 7310378301034243125888325788999999975432300575-5476478898 No 65 >cd07220 Pat_PNPLA2 Patatin-like phospholipase domain containing protein 2. PNPLA2 plays a key role in hydrolysis of stored triacylglecerols and is also known as adipose triglyceride lipase (ATGL). Members of this family share a patain domain, initially discovered in potato tubers. ATGL is expressed in white and brown adipose tissue in high mRNA levels. Mutations in PNPLA2 encoding adipose triglyceride lipase (ATGL) leads to neutral lipid storage disease (NLSD) which is characterized by the accumulation of triglycerides in multiple tissues. ATGL mutations are also commonly associated with severe forms of skeletal- and cardio-myopathy. This family includes patatin-like proteins: TTS-2.2 (transport-secretion protein 2.2), PNPLA2 (Patatin-like phospholipase domain-containing protein 2), and iPLA2-zeta (Calcium-independent phospholipase A2) from Homo sapiens. Probab=39.02 E-value=29 Score=15.44 Aligned_cols=54 Identities=15% Similarity=0.121 Sum_probs=33.8 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCHHHHHHHHHH Q ss_conf 566665478999971887643544246654420122253178650233333456 Q gi|254780536|r 65 LTAVSMAFIRVMEKNGLCVKRDISYVAGHSLGEYTALCAAKAFSLSDTIRLVRA 118 (314) Q Consensus 65 i~~~~~a~~~~l~~~gi~~~~~p~~v~GhSlGE~aAl~~aG~ls~eda~~lv~~ 118 (314) +....++.+..|++++-....+..-+.|-|-|-++|++.+-.+++++....+.. T Consensus 15 Lg~YHlGVa~~L~e~~~~l~~~~~~~~GaSAGAL~Aa~l~~~~~~~~~~~~~~~ 68 (249) T cd07220 15 LGVYHVGVASCLLEHAPFLVANARKIYGASAGALTATALVTGVCLGECGASVIR 68 (249) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCEEEEECHHHHHHHHHHHCCCHHHHHHHHHH T ss_conf 140499999999986658763476666654878999999837889999999999 No 66 >TIGR01009 rpsC_bact ribosomal protein S3; InterPro: IPR005704 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites , . About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many of ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome , . Ribosomal protein S3 is one of the proteins from the small ribosomal subunit. This family describes the bacterial type of ribosomal protein S3 and also and many chloroplast forms. Chloroplast and mitochondrial forms have large, variable inserts between conserved N-terminal and C-terminal domains. ; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0015935 small ribosomal subunit. Probab=38.88 E-value=27 Score=15.56 Aligned_cols=79 Identities=10% Similarity=0.125 Sum_probs=47.7 Q ss_pred CEECCCCCCCEEEECCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 20012232221353033112899988888865310035555432342010203788887422232028776542045787 Q gi|254780536|r 158 CEIANDNGGGQVVISGLQDAVKCAADTCLNKGAKRAVFLPVSAPFHSSLMTPVSKVMKWMLDSVTKQDPVVPILPNFCAS 237 (314) Q Consensus 158 ~~ia~~Ns~~q~visG~~~~l~~~~~~~~~~~~~~~~~L~v~~afHs~lm~~~~~~~~~~l~~i~~~~p~ipi~S~~~g~ 237 (314) +.|-.+-+.=-+||+=.-.++|++.+.+.+.-.. .+.+.++. -+++ + T Consensus 68 v~V~IhtArPg~vIGkkG~~iE~l~~~l~k~~~~-~k~v~i~I-------------------------~EVk-------~ 114 (217) T TIGR01009 68 VRVTIHTARPGIVIGKKGSEIEKLKKALQKLTGS-VKEVQINI-------------------------KEVK-------R 114 (217) T ss_pred EEEEEEECCCCEEECCCCCHHHHHHHHHHHHHCC-EEEEEEEE-------------------------EECC-------C T ss_conf 8999982788658857884589999999988478-35899989-------------------------8748-------8 Q ss_pred CCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHC Q ss_conf 10010023322222044850199999999977 Q gi|254780536|r 238 PVSSIDEISRLLVEQVTGRVRWRETIQWFANH 269 (314) Q Consensus 238 ~~~~~~~~~~~l~~ql~~pV~f~~~i~~l~~~ 269 (314) |..|+.-+.+++++||-+.|=|..+|+.+.+. T Consensus 115 p~ldA~LvA~~IA~QlE~RVSFRrAmK~A~~~ 146 (217) T TIGR01009 115 PELDAQLVAQNIARQLENRVSFRRAMKKAIQS 146 (217) T ss_pred CCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHH T ss_conf 57387889989999986202046899999999 No 67 >PRK08349 hypothetical protein; Validated Probab=37.47 E-value=29 Score=15.39 Aligned_cols=84 Identities=19% Similarity=0.144 Sum_probs=39.3 Q ss_pred CCCCCCCCCCHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCC---CEECCCC Q ss_conf 4246654420122253178650233333456886531001111222221235035531122344211354---2001223 Q gi|254780536|r 88 SYVAGHSLGEYTALCAAKAFSLSDTIRLVRARGKSMQEAVPPGLGSMVAIIGLDDCVVDSICAQASRVGI---CEIANDN 164 (314) Q Consensus 88 ~~v~GhSlGE~aAl~~aG~ls~eda~~lv~~R~~~m~~~~~~~~g~M~aV~~~~~~~~~~~~~~~~~~~~---~~ia~~N 164 (314) ..+.|.|+|..|.-........+.+..+--.| + ++|.|.+++-++..+.+.... ..-.|-= T Consensus 108 aivTGEsLGQVASQTl~NL~~i~~~~~~pVlR-----------P-----Lig~DK~EII~~Ar~IGTye~S~~~~~cC~~ 171 (198) T PRK08349 108 AIVTGDSLGQVASQTLDNLMVISTATDLPILR-----------P-----LIGLDKEEIVRIAKEIGTFEISIEPEPPCPF 171 (198) T ss_pred EEEECCHHHHHHHHHHHHHHHHHHHHCCCCCC-----------C-----CCCCCHHHHHHHHHHHCCHHHHCCCCCCCEE T ss_conf 89845216788899998899998750676647-----------7-----6679989999999981955343289977441 Q ss_pred CCCEEEECCCCCHHHHHHHHHHH Q ss_conf 22213530331128999888888 Q gi|254780536|r 165 GGGQVVISGLQDAVKCAADTCLN 187 (314) Q Consensus 165 s~~q~visG~~~~l~~~~~~~~~ 187 (314) .|.+=+.-+..+.++++.+++.+ T Consensus 172 ~pk~P~t~~~~~~i~~~ee~l~~ 194 (198) T PRK08349 172 VPKFPVVRAGLGEFEKILEELKD 194 (198) T ss_pred ECCCCCCCCCHHHHHHHHHHHHH T ss_conf 08988268899999999997664 No 68 >KOG2968 consensus Probab=37.04 E-value=25 Score=15.80 Aligned_cols=34 Identities=29% Similarity=0.626 Sum_probs=22.3 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHH-HHHHC Q ss_conf 666547899997188764354424665442012-22531 Q gi|254780536|r 67 AVSMAFIRVMEKNGLCVKRDISYVAGHSLGEYT-ALCAA 104 (314) Q Consensus 67 ~~~~a~~~~l~~~gi~~~~~p~~v~GhSlGE~a-Al~~a 104 (314) +.+++..+.|++.|| ..|.+-|-|.|-+. ||||. T Consensus 852 ~ahiGvl~ALeE~GI----PvD~VGGTSIGafiGaLYA~ 886 (1158) T KOG2968 852 AAHIGVLQALEEAGI----PVDMVGGTSIGAFIGALYAE 886 (1158) T ss_pred HHHHHHHHHHHHCCC----CEEEECCCCHHHHHHHHHHC T ss_conf 568889999997399----74545662577752124206 No 69 >cd02430 PTH2 Peptidyl-tRNA hydrolase, type 2 (PTH2). Peptidyl-tRNA hydrolase (PTH) activity releases tRNA from the premature translation termination product peptidyl-tRNA, therefore allowing the tRNA and peptide to be reused in protein synthesis. PTH2 is present in archaea and eukaryotes. Probab=35.80 E-value=29 Score=15.45 Aligned_cols=61 Identities=13% Similarity=0.116 Sum_probs=27.6 Q ss_pred CCCCCCCHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCCC-CCCCCCCCHHHHHHHHHHHHHCC Q ss_conf 665442012225317865023333345688653100111122-22212350355311223442113 Q gi|254780536|r 91 AGHSLGEYTALCAAKAFSLSDTIRLVRARGKSMQEAVPPGLG-SMVAIIGLDDCVVDSICAQASRV 155 (314) Q Consensus 91 ~GhSlGE~aAl~~aG~ls~eda~~lv~~R~~~m~~~~~~~~g-~M~aV~~~~~~~~~~~~~~~~~~ 155 (314) ++.|.|.++|-|+-.++.. ......+..-.-+.+..+ | .-+.+...+++++.++.....+. T Consensus 11 L~Ms~GKiaAQ~~HAav~~---~~~a~~~~~~~~~~W~~~-G~~KIvlkv~~e~~l~~l~~~a~~~ 72 (115) T cd02430 11 LKMGKGKIAAQCAHAALGA---YKKAMKSNPELLRAWERE-GQKKIVLKVNSEEELLELKKKAKSL 72 (115) T ss_pred CCCCCHHHHHHHHHHHHHH---HHHHHHHCHHHHHHHHHC-CCCEEEEECCCHHHHHHHHHHHHHC T ss_conf 8988005999998999999---999986498999999967-9956999779999999999999987 No 70 >COG1565 Uncharacterized conserved protein [Function unknown] Probab=34.99 E-value=34 Score=15.02 Aligned_cols=23 Identities=13% Similarity=0.222 Sum_probs=18.1 Q ss_pred HCCCCCCHHHHHHHHHHCCCCEE Q ss_conf 04485019999999997799899 Q gi|254780536|r 252 QVTGRVRWRETIQWFANHGVKSV 274 (314) Q Consensus 252 ql~~pV~f~~~i~~l~~~g~~~f 274 (314) -++.-|+|+...+...+.|.+.. T Consensus 279 DLTahVdFt~L~~~~~~~g~~~~ 301 (370) T COG1565 279 DLTAHVDFTALAKAAKAAGLEVL 301 (370) T ss_pred CEEEEECHHHHHHHHHHCCCCCC T ss_conf 04564058999999998398720 No 71 >pfam06057 VirJ Bacterial virulence protein (VirJ). This family consists of several bacterial VirJ virulence proteins. VirJ is thought to be involved in the type IV secretion system. It is thought that the substrate proteins localized to the periplasm may associate with the pilus in a manner that is mediated by VirJ, and suggest a two-step process for type IV secretion in Agrobacterium. Probab=34.83 E-value=34 Score=15.01 Aligned_cols=75 Identities=9% Similarity=0.176 Sum_probs=36.2 Q ss_pred EEEEECCCCCCCCCCHHHHHHHHCHHHHHHHHHHHHHCCCCHHHHHCCC--CHHHHHCHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 1999808733373115899998599999999999988398978985389--86783046765034566665478999971 Q gi|254780536|r 2 STVLTFPGQGSQVIGMGRDLYDSFPEARLVFEEVDHTLNQNLSDLMWNG--SQEELTATCNAQPALTAVSMAFIRVMEKN 79 (314) Q Consensus 2 k~~f~FpGQGsq~~gM~~~L~~~~p~~r~~~~~~~~~l~~~l~~~~~~~--~~~~l~~~~~~Qp~i~~~~~a~~~~l~~~ 79 (314) ++|++++|-|. |.++-+++-+. .-++-=.+.|.|-...+|.. |++.=.+. .-.+-.-.++| T Consensus 3 tlav~~SGDGG-W~~lD~~va~~------L~~~GvpVvGvdSLrYFW~~ktP~q~a~Dl----------~~ii~~Y~~~w 65 (192) T pfam06057 3 TVAVFYSGDGG-WRDLDKEVGSA------LQKQGVPVVGVDSLRYFWSERTPEEVADDL----------DRIIDTYRKRW 65 (192) T ss_pred EEEEEEECCCC-HHHHHHHHHHH------HHHCCCCEEEECHHHHHHCCCCHHHHHHHH----------HHHHHHHHHHH T ss_conf 79999946887-35551999999------997798365532688875659989999999----------99999999985 Q ss_pred CCCCCCCCCCCCCCCCCC Q ss_conf 887643544246654420 Q gi|254780536|r 80 GLCVKRDISYVAGHSLGE 97 (314) Q Consensus 80 gi~~~~~p~~v~GhSlGE 97 (314) +. +--.++|+|+|- T Consensus 66 ~~----~~v~LiGYSFGA 79 (192) T pfam06057 66 KV----KNVVLIGYSFGA 79 (192) T ss_pred CC----CEEEEEEECCCC T ss_conf 89----659999617872 No 72 >COG4012 Uncharacterized protein conserved in archaea [Function unknown] Probab=34.74 E-value=24 Score=15.95 Aligned_cols=67 Identities=13% Similarity=0.087 Sum_probs=33.6 Q ss_pred CCCHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCEEEEECCCH Q ss_conf 34201020378888742223202877654204578710010023322222044850199999999977998999938987 Q gi|254780536|r 202 FHSSLMTPVSKVMKWMLDSVTKQDPVVPILPNFCASPVSSIDEISRLLVEQVTGRVRWRETIQWFANHGVKSVYEVGSGK 281 (314) Q Consensus 202 fHs~lm~~~~~~~~~~l~~i~~~~p~ipi~S~~~g~~~~~~~~~~~~l~~ql~~pV~f~~~i~~l~~~g~~~fiEiGP~~ 281 (314) .||+.|+| ++++..+....-.... ....+.|.. --.+..++|+|.+.-.......-+-++|+||.+ T Consensus 255 HHT~~Lsp--ekled~I~rf~~GeL~-------neeV~~DgG-----HGch~~evvd~eev~~iv~tgpk~p~levg~Dp 320 (342) T COG4012 255 HHTIRLSP--EKLEDQIIRFVEGELE-------NEEVYRDGG-----HGCHNVEVVDWEEVEDIVSTGPKKPELEVGRDP 320 (342) T ss_pred CCCCCCCH--HHHHHHHHHHHHCCCC-------CCEEECCCC-----CCEEEECCCCCHHHHHHCCCCCCCCHHHHCCCC T ss_conf 25654799--9999999999735566-------531431788-----742132255704232110578887065517993 Q ss_pred H Q ss_conf 9 Q gi|254780536|r 282 V 282 (314) Q Consensus 282 ~ 282 (314) - T Consensus 321 e 321 (342) T COG4012 321 E 321 (342) T ss_pred H T ss_conf 0 No 73 >TIGR02472 sucr_P_syn_N sucrose-phosphate synthase, putative, glycosyltransferase domain; InterPro: IPR012822 This family consists of the N-terminal regions, or in some cases the entirety, of bacterial proteins closely related to plant sucrose-phosphate synthases (SPS). The C-terminal domain (IPR012821 from INTERPRO), found with most members of this family, resembles both bona fide plant sucrose-phosphate phosphatases (SPP) and the SPP-like domain of plant sucrose phosphate synthase (SPS). At least two members of this family lack the SPP-like domain, which may have binding or regulatory rather than enzymatic activity by analogy to plant SPS. This enzyme produces sucrose 6-phosphate and UDP from UDP-glucose and D-fructose 6-phosphate, and may be encoded near the gene for fructokinase.. Probab=34.56 E-value=26 Score=15.72 Aligned_cols=93 Identities=15% Similarity=0.168 Sum_probs=43.4 Q ss_pred CEEEECCCCCHHHHHHHHHHHHH--HHHCC---CC--CCCCCCCC-HHCCHHHHHHHHHHHHHHH--------------- Q ss_conf 21353033112899988888865--31003---55--55432342-0102037888874222320--------------- Q gi|254780536|r 167 GQVVISGLQDAVKCAADTCLNKG--AKRAV---FL--PVSAPFHS-SLMTPVSKVMKWMLDSVTK--------------- 223 (314) Q Consensus 167 ~q~visG~~~~l~~~~~~~~~~~--~~~~~---~L--~v~~afHs-~lm~~~~~~~~~~l~~i~~--------------- 223 (314) |=++|.|.++++.++.+..++-= .+..+ -| +|.+|=|. +-==|..=++++.=+++=. T Consensus 287 NLVlvlG~RdD~r~me~~qR~Vl~~vl~~iD~YDLYGkvAyPK~H~~~dvP~lYRLAA~~rGiFVNPALTEPFGLTLlEA 366 (445) T TIGR02472 287 NLVLVLGSRDDIRKMESSQREVLTEVLLLIDRYDLYGKVAYPKHHRPDDVPELYRLAARRRGIFVNPALTEPFGLTLLEA 366 (445) T ss_pred CEEEEECCCCCHHHCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCEEECCCCCCCCHHHHHHH T ss_conf 80887527788531215789999999876300024564026888881123267899986598676272125301689999 Q ss_pred CCCCCCCCCCCCCCCCC--------------CCHHHHHHHHHHCCCCCCH Q ss_conf 28776542045787100--------------1002332222204485019 Q gi|254780536|r 224 QDPVVPILPNFCASPVS--------------SIDEISRLLVEQVTGRVRW 259 (314) Q Consensus 224 ~~p~ipi~S~~~g~~~~--------------~~~~~~~~l~~ql~~pV~f 259 (314) +.--+|++.+-||-|-. |.+++..-|-+-+.++=+| T Consensus 367 AAcGLPivAT~DGGP~dI~~~C~NGLLvd~ld~e~i~~AL~~alsd~~QW 416 (445) T TIGR02472 367 AACGLPIVATEDGGPRDIIANCRNGLLVDVLDLEAIASALEQALSDSSQW 416 (445) T ss_pred HHCCCCEEECCCCCCHHHHHHCCCCCEECCCCHHHHHHHHHHHCCCHHHH T ss_conf 97699721078648668884288875005789899999999733890667 No 74 >pfam03447 NAD_binding_3 Homoserine dehydrogenase, NAD binding domain. This domain adopts a Rossmann NAD binding fold. The C-terminal domain of homoserine dehydrogenase contributes a single helix to this structural domain, which is not included in the Pfam model. Probab=34.18 E-value=35 Score=14.94 Aligned_cols=42 Identities=12% Similarity=0.026 Sum_probs=26.9 Q ss_pred HHHHHHHHH-CCCCEEEEECCCHHHHHHHHHHCCCCCEEECCC Q ss_conf 999999997-799899993898799999998738981552399 Q gi|254780536|r 260 RETIQWFAN-HGVKSVYEVGSGKVLTGLAKRIDKSLSAVSISK 301 (314) Q Consensus 260 ~~~i~~l~~-~g~~~fiEiGP~~~Lt~l~~~~~~~~~~~~~~~ 301 (314) ...++.+.+ ..++++||+||......++++.++...++-..| T Consensus 47 ~~d~~~ll~~~~iDvVVE~~g~~~~~~~~~~aL~~GkhVVTaN 89 (116) T pfam03447 47 TLDLDDLVADPRPDVVVECASSEAVAEYVLKALKAGKHVVTAS 89 (116) T ss_pred ECCHHHHHCCCCCCEEEECCCCHHHHHHHHHHHHCCCEEEEEC T ss_conf 7799999618899899988994899999999998799899907 No 75 >pfam06821 DUF1234 Alpha/Beta hydrolase family of unknown function (DUF1234). The Crystal Structure Of The Yden Gene Product from B. Subtilis has been solved. The structure shows an alpha-beta hydrolase fold suggesting an enzymatic function for these proteins. Probab=34.15 E-value=11 Score=17.88 Aligned_cols=18 Identities=22% Similarity=0.154 Sum_probs=14.3 Q ss_pred CCCCCCCCCCCCHHHHHH Q ss_conf 544246654420122253 Q gi|254780536|r 86 DISYVAGHSLGEYTALCA 103 (314) Q Consensus 86 ~p~~v~GhSlGE~aAl~~ 103 (314) +|.+++|||+|=++++-. T Consensus 55 ~~~ilVaHSLGc~~~l~~ 72 (171) T pfam06821 55 GPVVLVAHSLGCLAVAHW 72 (171) T ss_pred CCEEEEEECHHHHHHHHH T ss_conf 985999735789999999 No 76 >KOG1454 consensus Probab=34.10 E-value=16 Score=16.94 Aligned_cols=21 Identities=43% Similarity=0.458 Sum_probs=16.3 Q ss_pred CCCCCCCCCCCCHHHHHHCCC Q ss_conf 544246654420122253178 Q gi|254780536|r 86 DISYVAGHSLGEYTALCAAKA 106 (314) Q Consensus 86 ~p~~v~GhSlGE~aAl~~aG~ 106 (314) +|..++|||+|=+.|+.+|-. T Consensus 128 ~~~~lvghS~Gg~va~~~Aa~ 148 (326) T KOG1454 128 EPVSLVGHSLGGIVALKAAAY 148 (326) T ss_pred CCEEEEEECHHHHHHHHHHHH T ss_conf 723898567088999999986 No 77 >PRK00107 gidB glucose-inhibited division protein B; Reviewed Probab=33.93 E-value=35 Score=14.92 Aligned_cols=17 Identities=24% Similarity=0.315 Sum_probs=8.0 Q ss_pred EEECCCCCHHHHHHHHH Q ss_conf 35303311289998888 Q gi|254780536|r 169 VVISGLQDAVKCAADTC 185 (314) Q Consensus 169 ~visG~~~~l~~~~~~~ 185 (314) +|+|---..+..+.+.+ T Consensus 139 ~itsRAva~l~~l~~~~ 155 (216) T PRK00107 139 VVTSRAVASLSDLVELC 155 (216) T ss_pred EEEEEHHHCHHHHHHHH T ss_conf 89860540699999999 No 78 >TIGR01050 rpsS_bact ribosomal protein S19; InterPro: IPR005732 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites , . About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many of ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome , . The small ribosomal subunit protein S19 contains 88-144 amino acid residues. In Escherichia coli, S19 is known to form a complex with S13 that binds strongly to 16S ribosomal RNA. Experimental evidence has revealed that S19 is moderately exposed on the ribosomal surface.; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0015935 small ribosomal subunit. Probab=33.55 E-value=14 Score=17.34 Aligned_cols=36 Identities=19% Similarity=0.344 Sum_probs=22.0 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHH Q ss_conf 034566665478999971887643544246654420122 Q gi|254780536|r 62 QPALTAVSMAFIRVMEKNGLCVKRDISYVAGHSLGEYTA 100 (314) Q Consensus 62 Qp~i~~~~~a~~~~l~~~gi~~~~~p~~v~GhSlGE~aA 100 (314) -|-+....++.++=-..-.|... +-++||=|||+|- T Consensus 41 ~P~mvG~t~~vhNG~~~ipvyi~---e~mVGhKLGEFap 76 (92) T TIGR01050 41 IPEMVGHTIAVHNGKKFIPVYIT---EEMVGHKLGEFAP 76 (92) T ss_pred CCCCCCCEEEEECCCEEEEEEEE---CCCCCCCCCCCCC T ss_conf 56312306787039667426860---3312420167554 No 79 >TIGR02707 butyr_kinase butyrate kinase; InterPro: IPR011245 This group represents bacterial butyrate kinase, an enzyme that facilitates the formation of butyryl-CoA by phosphorylating butyrate in the presence of ATP to form butyryl phosphate . The final steps in butyrate synthesis by anaerobic bacteria can occur via butyrate kinase and phosphotransbutyrylase or via butyryl-CoA:acetate CoA-transferase, the latter providing the dominant route for butyrate formation in human colonic bacteria .; GO: 0005524 ATP binding, 0047761 butyrate kinase activity, 0016310 phosphorylation, 0005737 cytoplasm. Probab=32.72 E-value=23 Score=16.01 Aligned_cols=33 Identities=21% Similarity=0.520 Sum_probs=17.9 Q ss_pred CHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCH Q ss_conf 012225317865023333345688653100111122222123503 Q gi|254780536|r 97 EYTALCAAKAFSLSDTIRLVRARGKSMQEAVPPGLGSMVAIIGLD 141 (314) Q Consensus 97 E~aAl~~aG~ls~eda~~lv~~R~~~m~~~~~~~~g~M~aV~~~~ 141 (314) ++.-+|.||-+|+++--|.+. |.|+|+|=+|.+ T Consensus 221 dlv~lCySGkyT~~E~kKkI~------------G~GGl~ayLGTn 253 (353) T TIGR02707 221 DLVDLCYSGKYTKEEMKKKIV------------GKGGLVAYLGTN 253 (353) T ss_pred HHHHHHHCCCHHHHHHHHHHC------------CCCCCEEECCCC T ss_conf 899987276103889999852------------674533254558 No 80 >PRK08609 hypothetical protein; Provisional Probab=32.29 E-value=37 Score=14.75 Aligned_cols=68 Identities=19% Similarity=0.280 Sum_probs=38.8 Q ss_pred CCCCCHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCEEEEE-- Q ss_conf 323420102037888874222320287765420457871001002332222204485019999999997799899993-- Q gi|254780536|r 200 APFHSSLMTPVSKVMKWMLDSVTKQDPVVPILPNFCASPVSSIDEISRLLVEQVTGRVRWRETIQWFANHGVKSVYEV-- 277 (314) Q Consensus 200 ~afHs~lm~~~~~~~~~~l~~i~~~~p~ipi~S~~~g~~~~~~~~~~~~l~~ql~~pV~f~~~i~~l~~~g~~~fiEi-- 277 (314) +..||.+-.+-.+.-+..++.+ ..|.+.++..-||+.+...+. -||+|.+.++.+.+.|+ ++|| T Consensus 434 asvHs~~~~~~~~~t~r~~~Ai--~np~v~ii~HptgR~l~~r~~----------~~~d~~~v~~~aa~~~~--~lEINa 499 (570) T PRK08609 434 AAIHSSFSQSEEEIMKRLETAC--RNPYVRLIAHPTGRLIGRRDG----------YDVNIDQLIELAKETNT--VLELNA 499 (570) T ss_pred EEECCCCCCCHHHHHHHHHHHH--HCCCCCEEECCCCCCCCCCCC----------CCCCHHHHHHHHHHHCC--EEEEEC T ss_conf 9813445798899999999997--189987884887542477788----------87799999999998597--999809 Q ss_pred CCCH Q ss_conf 8987 Q gi|254780536|r 278 GSGK 281 (314) Q Consensus 278 GP~~ 281 (314) .|.. T Consensus 500 ~p~r 503 (570) T PRK08609 500 NPNR 503 (570) T ss_pred CCCC T ss_conf 9876 No 81 >cd07217 Pat17_PNPLA8_PNPLA9_like4 Patatin-like phospholipase. Patatin is a storage protein of the potato tuber that shows Phospholipase A2 activity (PLA2; EC 3.1.1.4). Patatin catalyzes the nonspecific hydrolysis of phospholipids, glycolipids, sulfolipids, and mono- and diacylglycerols, thereby showing lipid acyl hydrolase activity. The active site includes an oxyanion hole with a conserved GGxR motif; it is found in almost all the members of this family. The catalytic dyad is formed by a serine and an aspartate. Patatin belongs to the alpha-beta hydrolase family which is identified by a characteristic nucleophile elbow with a consensus sequence of Sm-X-Nu-Sm (Sm = small residue, X = any residue and Nu = nucleophile). Members of this family have been found also in vertebrates. This family includes subfamily of PNPLA8 (iPLA2-gamma) and PNPLA9 (iPLA2-beta) like phospholipases from human as well as the Pat17 isozyme from Solanum cardiophyllum. Probab=32.27 E-value=27 Score=15.60 Aligned_cols=38 Identities=29% Similarity=0.493 Sum_probs=30.9 Q ss_pred CCCCCCCCCCCCHHHHHHCCCCCHHHHHHHHHHHHHHH Q ss_conf 54424665442012225317865023333345688653 Q gi|254780536|r 86 DISYVAGHSLGEYTALCAAKAFSLSDTIRLVRARGKSM 123 (314) Q Consensus 86 ~p~~v~GhSlGE~aAl~~aG~ls~eda~~lv~~R~~~m 123 (314) .-|+++|-|.|-+-|+..+--++.+|.+.+-..+|.-+ T Consensus 41 ~FDlI~GTSTGgIiA~~l~~g~~~~~i~~~Y~~~G~~I 78 (344) T cd07217 41 YFDFVGGTSTGSIIAACIALGMSVTDLLSFYTLNGVNM 78 (344) T ss_pred HEEEEECCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHH T ss_conf 12146438689999999977999999999999853775 No 82 >COG2819 Predicted hydrolase of the alpha/beta superfamily [General function prediction only] Probab=31.47 E-value=25 Score=15.84 Aligned_cols=22 Identities=23% Similarity=0.344 Sum_probs=14.2 Q ss_pred HCCCCCCCCCCCCCCCCCCCHH Q ss_conf 7188764354424665442012 Q gi|254780536|r 78 KNGLCVKRDISYVAGHSLGEYT 99 (314) Q Consensus 78 ~~gi~~~~~p~~v~GhSlGE~a 99 (314) +....++.++..+.|||+|=+- T Consensus 129 e~~y~~~~~~~~i~GhSlGGLf 150 (264) T COG2819 129 EARYRTNSERTAIIGHSLGGLF 150 (264) T ss_pred HCCCCCCCCCCEEEEECCHHHH T ss_conf 6006368655156520410388 No 83 >CHL00050 rps19 ribosomal protein S19 Probab=31.37 E-value=15 Score=17.12 Aligned_cols=13 Identities=38% Similarity=0.787 Sum_probs=9.7 Q ss_pred CCCCCCCCCCCCHH Q ss_conf 54424665442012 Q gi|254780536|r 86 DISYVAGHSLGEYT 99 (314) Q Consensus 86 ~p~~v~GhSlGE~a 99 (314) .++ ++||-|||++ T Consensus 63 ~~~-MvGhklGEF~ 75 (92) T CHL00050 63 TDQ-MVGHKLGEFA 75 (92) T ss_pred CCC-CCCCEEECCC T ss_conf 737-1371201110 No 84 >COG1506 DAP2 Dipeptidyl aminopeptidases/acylaminoacyl-peptidases [Amino acid transport and metabolism] Probab=30.59 E-value=40 Score=14.58 Aligned_cols=143 Identities=20% Similarity=0.077 Sum_probs=60.6 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHCCCCC-HHHHHHHHH--HHHHHHHHHCCCCCCCCCCCCCCHHHHHHH Q ss_conf 47899997188764354424665442012225317865-023333345--688653100111122222123503553112 Q gi|254780536|r 71 AFIRVMEKNGLCVKRDISYVAGHSLGEYTALCAAKAFS-LSDTIRLVR--ARGKSMQEAVPPGLGSMVAIIGLDDCVVDS 147 (314) Q Consensus 71 a~~~~l~~~gi~~~~~p~~v~GhSlGE~aAl~~aG~ls-~eda~~lv~--~R~~~m~~~~~~~~g~M~aV~~~~~~~~~~ 147 (314) +.++.|.+.++....+. ++.|||-|=|.++++++..+ |.-++-... ..-..+........+.+-...+.+...-+. T Consensus 459 ~~~~~l~~~~~~d~~ri-~i~G~SyGGymtl~~~~~~~~f~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 537 (620) T COG1506 459 AAVDALVKLPLVDPERI-GITGGSYGGYMTLLAATKTPRFKAAVAVAGGVDWLLYFGESTEGLRFDPEENGGGPPEDREK 537 (620) T ss_pred HHHHHHHHCCCCCCCEE-EEEEECHHHHHHHHHHHCCCCCEEEEEECCCCHHHHHHHCCCHHHCCCHHHHCCCCCCCHHH T ss_conf 99999997788661025-99820889999999986588625999915641035664036714314878764687544588 Q ss_pred HHHHHHCCCCCEECCCCCCCEEEECCCCC------HHHHHHHHHHHHHHHHCCCCCCCCCCCCHHC-CHHHHHHHHHHH Q ss_conf 23442113542001223222135303311------2899988888865310035555432342010-203788887422 Q gi|254780536|r 148 ICAQASRVGICEIANDNGGGQVVISGLQD------AVKCAADTCLNKGAKRAVFLPVSAPFHSSLM-TPVSKVMKWMLD 219 (314) Q Consensus 148 ~~~~~~~~~~~~ia~~Ns~~q~visG~~~------~l~~~~~~~~~~~~~~~~~L~v~~afHs~lm-~~~~~~~~~~l~ 219 (314) +. .+.+..++-++.+|- .+|-|..+ +-+++...++..|. .+...-.+.--|...- +.....++++++ T Consensus 538 ~~---~~sp~~~~~~i~~P~-LliHG~~D~~v~~~q~~~~~~aL~~~g~-~~~~~~~p~e~H~~~~~~~~~~~~~~~~~ 611 (620) T COG1506 538 YE---DRSPIFYADNIKTPL-LLIHGEEDDRVPIEQAEQLVDALKRKGK-PVELVVFPDEGHGFSRPENRVKVLKEILD 611 (620) T ss_pred HH---HHCCHHHHHCCCCCE-EEECCCCCCCCCHHHHHHHHHHHHHCCC-EEEEEEECCCCCCCCCHHHHHHHHHHHHH T ss_conf 87---629563540257987-9983278878897999999999997797-58999957999887846889999999999 No 85 >cd07221 Pat_PNPLA3 Patatin-like phospholipase domain containing protein 3. PNPLA3 is a triacylglycerol lipase that mediates triacylglycerol hydrolysis in adipocytes and is an indicator of the nutritional state. PNPLA3 is also known as adiponutrin (ADPN) or iPLA2-epsilon. Human adiponutrins are bound to the cell membrane of adipocytes and show transacylase, TG hydrolase, and PLA2 activity. This family includes patatin-like proteins: ADPN (adiponutrin) from mammals, PNPLA3 (Patatin-like phospholipase domain-containing protein 3), and iPLA2-epsilon (Calcium-independent phospholipase A2) from Homo sapiens. Probab=30.30 E-value=40 Score=14.55 Aligned_cols=54 Identities=15% Similarity=0.241 Sum_probs=34.9 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCHHHHHHHHHH Q ss_conf 566665478999971887643544246654420122253178650233333456 Q gi|254780536|r 65 LTAVSMAFIRVMEKNGLCVKRDISYVAGHSLGEYTALCAAKAFSLSDTIRLVRA 118 (314) Q Consensus 65 i~~~~~a~~~~l~~~gi~~~~~p~~v~GhSlGE~aAl~~aG~ls~eda~~lv~~ 118 (314) +....++.+..|+++|-..-.+...+.|-|-|-++|++.+-.+++++..+.+.. T Consensus 11 Lg~YHiGVa~~L~e~~p~ll~~~~~~aGaSAGAL~aa~l~~g~~~~~~~~~~~~ 64 (252) T cd07221 11 LGFYHVGVTRCLSERAPHLLRDARMFFGASAGALHCVTFLSGLPLDQILQILMD 64 (252) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCEEEEECHHHHHHHHHHCCCCHHHHHHHHHH T ss_conf 675899999999996258763466576524889999999809999999999999 No 86 >COG3146 Uncharacterized protein conserved in bacteria [Function unknown] Probab=30.04 E-value=8.9 Score=18.51 Aligned_cols=24 Identities=21% Similarity=0.493 Sum_probs=17.2 Q ss_pred CCHHHHHHHHHHCCCCEEEEECCCH Q ss_conf 0199999999977998999938987 Q gi|254780536|r 257 VRWRETIQWFANHGVKSVYEVGSGK 281 (314) Q Consensus 257 V~f~~~i~~l~~~g~~~fiEiGP~~ 281 (314) |-+.+.|+...+.|... +|-|.+. T Consensus 308 ~CYyQ~Id~aI~~gl~~-feaGAqG 331 (387) T COG3146 308 VCYYQAIDFAIAEGLQR-FEAGAQG 331 (387) T ss_pred HHHHHHHHHHHHHCCCE-ECCCCCC T ss_conf 87753789999808834-4688775 No 87 >TIGR02201 heptsyl_trn_III lipopolysaccharide heptosyltransferase III, putative; InterPro: IPR011916 This family consists of examples of the putative ADP-heptose:LPS heptosyltransferase III, an enzyme of LPS inner core region biosynthesis. LPS, composed of lipid A, a core region, and O antigen, is found in the outer membrane of Gram-negative bacteria. This enzyme may be less widely distributed than heptosyltransferases I and II.. Probab=29.95 E-value=41 Score=14.51 Aligned_cols=43 Identities=19% Similarity=0.210 Sum_probs=35.2 Q ss_pred CCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHCCCCCEEEC Q ss_conf 0199999999977998999938987999999987389815523 Q gi|254780536|r 257 VRWRETIQWFANHGVKSVYEVGSGKVLTGLAKRIDKSLSAVSI 299 (314) Q Consensus 257 V~f~~~i~~l~~~g~~~fiEiGP~~~Lt~l~~~~~~~~~~~~~ 299 (314) =+|+..++++..+|..+++--||..-=..++..|..+++.-.+ T Consensus 203 drfsali~AL~a~Gy~vVLTsgPD~~e~~mV~~I~~g~~~~rv 245 (347) T TIGR02201 203 DRFSALIDALAAEGYEVVLTSGPDKDELKMVDEIAAGCRSDRV 245 (347) T ss_pred HHHHHHHHHHHHCCCEEEECCCCCHHHHHHHHHHHHCCCCCCC T ss_conf 3589999988518956988289898679999999711689870 No 88 >pfam01981 PTH2 Peptidyl-tRNA hydrolase PTH2. Peptidyl-tRNA hydrolases are enzymes that release tRNAs from peptidyl-tRNA during translation. Probab=29.79 E-value=41 Score=14.50 Aligned_cols=62 Identities=11% Similarity=0.046 Sum_probs=28.1 Q ss_pred CCCCCCCHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCC Q ss_conf 66544201222531786502333334568865310011112222212350355311223442113 Q gi|254780536|r 91 AGHSLGEYTALCAAKAFSLSDTIRLVRARGKSMQEAVPPGLGSMVAIIGLDDCVVDSICAQASRV 155 (314) Q Consensus 91 ~GhSlGE~aAl~~aG~ls~eda~~lv~~R~~~m~~~~~~~~g~M~aV~~~~~~~~~~~~~~~~~~ 155 (314) ++.|.|.++|-|+=.++.. +.....+..-.-+.+....-.-+.+...+++++.++.....+. T Consensus 12 L~Ms~GKiaAQ~~HAav~~---~~~~~~~~~~~~~~W~~~G~~KIvlkv~de~~L~~l~~~a~~~ 73 (116) T pfam01981 12 LKMGKGKIAAQCAHAAVGA---YKKALKNNPELLKAWEREGQKKIVLKVNSEEELLELKEKAKSL 73 (116) T ss_pred CCCCCCHHHHHHHHHHHHH---HHHHHHHCHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHC T ss_conf 8989545999987999999---9999987989999999779967999729999999999999984 No 89 >KOG2377 consensus Probab=29.63 E-value=27 Score=15.58 Aligned_cols=48 Identities=23% Similarity=0.302 Sum_probs=37.0 Q ss_pred CCCCCCCCCCCCHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 5442466544201222531786502333334568865310011112222212 Q gi|254780536|r 86 DISYVAGHSLGEYTALCAAKAFSLSDTIRLVRARGKSMQEAVPPGLGSMVAI 137 (314) Q Consensus 86 ~p~~v~GhSlGE~aAl~~aG~ls~eda~~lv~~R~~~m~~~~~~~~g~M~aV 137 (314) .||.++--++| |+++.. .+.+--+.++..|+++|..+.....|.|+++ T Consensus 344 qPdiiiDa~~g---~~w~l~-~~~epL~~ll~dR~rl~~fllQr~~Gk~~~l 391 (657) T KOG2377 344 QPDIIIDASQG---YLWNLQ-VKLEPLVNLLPDRGRLMDFLLQRKEGKMVIL 391 (657) T ss_pred ECCEEEECCCC---HHHHHH-CCHHHHHHHHHHHHHHHHHHHHHHCCHHHHH T ss_conf 06588861014---124553-1267899987776889999987611400325 No 90 >pfam00282 Pyridoxal_deC Pyridoxal-dependent decarboxylase conserved domain. Probab=29.54 E-value=41 Score=14.47 Aligned_cols=72 Identities=15% Similarity=0.143 Sum_probs=41.5 Q ss_pred HHHHHHHHHHHCCCCCEECCCCCCCEEEECCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCH-HCCHHHHHHHHHHHHH Q ss_conf 531122344211354200122322213530331128999888888653100355554323420-1020378888742223 Q gi|254780536|r 143 CVVDSICAQASRVGICEIANDNGGGQVVISGLQDAVKCAADTCLNKGAKRAVFLPVSAPFHSS-LMTPVSKVMKWMLDSV 221 (314) Q Consensus 143 ~~~~~~~~~~~~~~~~~ia~~Ns~~q~visG~~~~l~~~~~~~~~~~~~~~~~L~v~~afHs~-lm~~~~~~~~~~l~~i 221 (314) +.+++.+++....+.. +.++.+.-=++..|..+.++++.+.+++.+. -|+|++||=-. ++. ++++..+.++ T Consensus 178 ~~L~~~i~~~~~~g~~-p~~VvataGTT~tGaiD~l~~i~~i~~~~~l----WlHVDaA~GG~~~l~---~~~r~~~~gi 249 (370) T pfam00282 178 MDLEKAIEEDKENGLI-PFFVCATLGTTGSGAFDPLQELGDICNKYDL----WLHVDAAYAGSAFIC---PEFRHWLFGI 249 (370) T ss_pred HHHHHHHHHHHHCCCC-CEEEEEECCCCCCCCCCCHHHHHHHHHHCCC----EEEEEHHHHHHHHHC---HHHHHHHCCC T ss_conf 9999999999867997-4499986377177134789999999998398----088423333478829---8557775698 Q ss_pred H Q ss_conf 2 Q gi|254780536|r 222 T 222 (314) Q Consensus 222 ~ 222 (314) . T Consensus 250 e 250 (370) T pfam00282 250 E 250 (370) T ss_pred C T ss_conf 7 No 91 >TIGR01048 lysA diaminopimelate decarboxylase; InterPro: IPR002986 Pyridoxal-dependent decarboxylases that act on ornithine-, lysine-, arginine- and related substrates can be classified into different families on the basis of sequence similarity , . One of these families includes ornithine decarboxylase (ODC), which catalyses the transformation of ornithine into putrescine; prokaryotic diaminopimelic acid decarboxylase (DAPDC), which catalyses the conversion of diaminopimelic acid into lysine, the final step of lysine biosynthesis; Pseudomonas syringae pv. tabaci protein, tabA, which is probably involved in tabtoxin biosynthesis and is similar to DAPDC; and bacterial and plant biosynthetic arginine decarboxylase (ADC), which catalyses the transformation of arginine into agmatine, the first step in putrescine synthesis from arginine. Although these proteins, which are known collectively as group IV decarboxylases , probably share a common evolutionary origin, their levels of sequence similarity are low, being confined to a few short conserved regions. These conserved motifs suggest a common structural arrangement for positioning of substrate and the cofactor pyridoxal 5'-phosphate among bacterial DAP decarboxylases, eukaryotic ornithine decarboxylases and arginine decarboxylases . The structure of bacterial diaminopimelate decarboxylase has been determined . ; GO: 0008836 diaminopimelate decarboxylase activity, 0009089 lysine biosynthetic process via diaminopimelate. Probab=29.04 E-value=34 Score=14.98 Aligned_cols=32 Identities=25% Similarity=0.236 Sum_probs=26.4 Q ss_pred HHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHCCC Q ss_conf 665478999971887643544246654420122253178 Q gi|254780536|r 68 VSMAFIRVMEKNGLCVKRDISYVAGHSLGEYTALCAAKA 106 (314) Q Consensus 68 ~~~a~~~~l~~~gi~~~~~p~~v~GhSlGE~aAl~~aG~ 106 (314) .++++.+++.+.|.. -|. -|.||+-.+.+||. T Consensus 64 ~Nl~~~~~la~~G~G----~D~---VS~GEL~~A~~AG~ 95 (431) T TIGR01048 64 SNLALLRLLAELGSG----FDV---VSGGELYRALAAGF 95 (431) T ss_pred CCHHHHHHHHHCCCC----EEE---EEHHHHHHHHHHCC T ss_conf 768899999963884----688---70169999998088 No 92 >COG0596 MhpC Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only] Probab=28.23 E-value=29 Score=15.45 Aligned_cols=29 Identities=31% Similarity=0.450 Sum_probs=18.6 Q ss_pred HHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHC Q ss_conf 789999718876435442466544201222531 Q gi|254780536|r 72 FIRVMEKNGLCVKRDISYVAGHSLGEYTALCAA 104 (314) Q Consensus 72 ~~~~l~~~gi~~~~~p~~v~GhSlGE~aAl~~a 104 (314) +...+...|+. +-.++|||+|.+.|+.++ T Consensus 78 ~~~~~~~~~~~----~~~l~G~S~Gg~~~~~~~ 106 (282) T COG0596 78 LAALLDALGLE----KVVLVGHSMGGAVALALA 106 (282) T ss_pred HHHHHHHHCCC----CEEEEEECHHHHHHHHHH T ss_conf 99999982888----669999886699999999 No 93 >pfam00756 Esterase Putative esterase. This family contains Esterase D. However it is not clear if all members of the family have the same function. This family is related to the pfam00135 family. Probab=28.23 E-value=18 Score=16.73 Aligned_cols=16 Identities=44% Similarity=0.555 Sum_probs=13.8 Q ss_pred CCCCCCCCCHHHHHHC Q ss_conf 2466544201222531 Q gi|254780536|r 89 YVAGHSLGEYTALCAA 104 (314) Q Consensus 89 ~v~GhSlGE~aAl~~a 104 (314) +++|||+|=+.|++.+ T Consensus 114 ai~G~SmGG~~Al~~a 129 (243) T pfam00756 114 ALAGQSMGGLGALYLA 129 (243) T ss_pred EEEECCCHHHHHHHHH T ss_conf 7985287699999999 No 94 >pfam12000 DUF3495 Domain of unknown function (DUF3495). This presumed domain is functionally uncharacterized. This domain is found in bacteria. This domain is about 170 amino acids in length. This domain is found associated with pfam00534. This domain has a single completely conserved residue G that may be functionally important. Probab=28.21 E-value=32 Score=15.11 Aligned_cols=22 Identities=23% Similarity=0.430 Sum_probs=16.3 Q ss_pred HHHHHHHCCCCCCCCCCCCCCCC-CCC Q ss_conf 78999971887643544246654-420 Q gi|254780536|r 72 FIRVMEKNGLCVKRDISYVAGHS-LGE 97 (314) Q Consensus 72 ~~~~l~~~gi~~~~~p~~v~GhS-lGE 97 (314) .+..|++.|. .||.++||| -|| T Consensus 56 aa~~L~~~Gf----~PDvIi~H~GWGe 78 (172) T pfam12000 56 AARQLRAQGF----RPDVIVAHPGWGE 78 (172) T ss_pred HHHHHHHCCC----CCCEEEECCCCCC T ss_conf 9999997499----9998987587640 No 95 >cd06450 DOPA_deC_like DOPA decarboxylase family. This family belongs to pyridoxal phosphate (PLP)-dependent aspartate aminotransferase superfamily (fold I). The major groups in this CD correspond to DOPA/tyrosine decarboxylase (DDC), histidine decarboxylase (HDC), and glutamate decarboxylase (GDC). DDC is active as a dimer and catalyzes the decarboxylation of tyrosine. GDC catalyzes the decarboxylation of glutamate and HDC catalyzes the decarboxylation of histidine. Probab=27.81 E-value=44 Score=14.29 Aligned_cols=75 Identities=15% Similarity=0.124 Sum_probs=41.6 Q ss_pred HHHHHHHHHHHHHCCCCCEECCCCCCCEEEECCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHCCHHHHHHHHHHHH Q ss_conf 35531122344211354200122322213530331128999888888653100355554323420102037888874222 Q gi|254780536|r 141 DDCVVDSICAQASRVGICEIANDNGGGQVVISGLQDAVKCAADTCLNKGAKRAVFLPVSAPFHSSLMTPVSKVMKWMLDS 220 (314) Q Consensus 141 ~~~~~~~~~~~~~~~~~~~ia~~Ns~~q~visG~~~~l~~~~~~~~~~~~~~~~~L~v~~afHs~lm~~~~~~~~~~l~~ 220 (314) +.+.+++.+++....+..-++ +.+.-=++.+|..++++++.+.+++.+. .|+|++||=-..+- .++++..+.+ T Consensus 131 d~~~L~~~i~~~~~~g~~p~~-vvataGTT~~GaiD~l~~i~~i~~~~~i----wlHVDaA~GG~~~~--~~~~~~~~~g 203 (345) T cd06450 131 DPEALEAAIDEDKAEGLNPIM-VVATAGTTDTGAIDPLEEIADLAEKYDL----WLHVDAAYGGFLLP--FPEPRHLDFG 203 (345) T ss_pred CHHHHHHHHHHHHHCCCCCEE-EEECCCCCCCCCCCCHHHHHHHHHHCCC----CEEEECHHCCHHHC--CCCCCHHHCC T ss_conf 899999999999866997389-9955787676256789999999998499----47751000120110--6220076579 Q ss_pred HH Q ss_conf 32 Q gi|254780536|r 221 VT 222 (314) Q Consensus 221 i~ 222 (314) ++ T Consensus 204 ie 205 (345) T cd06450 204 IE 205 (345) T ss_pred CC T ss_conf 99 No 96 >PRK01810 DNA polymerase IV; Validated Probab=27.65 E-value=45 Score=14.27 Aligned_cols=26 Identities=19% Similarity=0.212 Sum_probs=10.7 Q ss_pred HHHHHCCCCCCCCCCCCCCCCCCCHHHHHHCC Q ss_conf 99997188764354424665442012225317 Q gi|254780536|r 74 RVMEKNGLCVKRDISYVAGHSLGEYTALCAAK 105 (314) Q Consensus 74 ~~l~~~gi~~~~~p~~v~GhSlGE~aAl~~aG 105 (314) +++++.|++ . -+|-|---+-|=.|++ T Consensus 132 ~I~~~~gl~----~--SiGIa~nK~lAKiAs~ 157 (410) T PRK01810 132 RLLTELQLP----C--SIGIAPNKFLAKMASD 157 (410) T ss_pred HHHHHHCCC----E--EEEECCCHHHHHHHHH T ss_conf 999986997----4--8844651999999886 No 97 >pfam05057 DUF676 Putative serine esterase (DUF676). This family of proteins are probably serine esterase type enzymes with an alpha/beta hydrolase fold. Probab=27.58 E-value=13 Score=17.60 Aligned_cols=22 Identities=41% Similarity=0.449 Sum_probs=16.6 Q ss_pred CCCCCCCCCCCCHHHHHHCCCCC Q ss_conf 54424665442012225317865 Q gi|254780536|r 86 DISYVAGHSLGEYTALCAAKAFS 108 (314) Q Consensus 86 ~p~~v~GhSlGE~aAl~~aG~ls 108 (314) +.+ ++|||||=+.+-+|.|.+. T Consensus 76 kIS-fIGhSLGGLi~RyAl~~l~ 97 (212) T pfam05057 76 KIS-FVGHSLGGLIARSAIGYLY 97 (212) T ss_pred EEE-EEEECHHHHHHHHHHHHHH T ss_conf 599-9964424799999999887 No 98 >pfam10230 DUF2305 Uncharacterized conserved protein (DUF2305). This family of proteins is conserved from plants to humans. The function is unknown. Probab=27.53 E-value=16 Score=16.89 Aligned_cols=16 Identities=44% Similarity=0.644 Sum_probs=12.7 Q ss_pred CCCCCCCCCCCHHHHH Q ss_conf 4424665442012225 Q gi|254780536|r 87 ISYVAGHSLGEYTALC 102 (314) Q Consensus 87 p~~v~GhSlGE~aAl~ 102 (314) +-.++|||.|-|-|+= T Consensus 80 k~iLiGHSIGayi~le 95 (260) T pfam10230 80 KLILIGHSIGAYIALE 95 (260) T ss_pred CEEEEECCHHHHHHHH T ss_conf 5799833468999999 No 99 >KOG0820 consensus Probab=27.36 E-value=45 Score=14.24 Aligned_cols=22 Identities=32% Similarity=0.436 Sum_probs=11.1 Q ss_pred HHHHCCCCCCCCCCCC--CCCCCCCHH Q ss_conf 9997188764354424--665442012 Q gi|254780536|r 75 VMEKNGLCVKRDISYV--AGHSLGEYT 99 (314) Q Consensus 75 ~l~~~gi~~~~~p~~v--~GhSlGE~a 99 (314) ...+..+. .+|.+ +|-..|-++ T Consensus 50 I~~ka~~k---~tD~VLEvGPGTGnLT 73 (315) T KOG0820 50 IVEKADLK---PTDVVLEVGPGTGNLT 73 (315) T ss_pred HHHCCCCC---CCCEEEEECCCCCHHH T ss_conf 98604789---9877999579877899 No 100 >PRK13942 protein-L-isoaspartate O-methyltransferase; Provisional Probab=27.19 E-value=46 Score=14.22 Aligned_cols=37 Identities=19% Similarity=0.146 Sum_probs=20.0 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHH Q ss_conf 65034566665478999971887643544246654420122253 Q gi|254780536|r 60 NAQPALTAVSMAFIRVMEKNGLCVKRDISYVAGHSLGEYTALCA 103 (314) Q Consensus 60 ~~Qp~i~~~~~a~~~~l~~~gi~~~~~p~~v~GhSlGE~aAl~~ 103 (314) .+||.+.+. ++....+.+..+. .-+|--.|=.+|+.+ T Consensus 59 is~P~~~a~------ml~~L~l~~~~~V-LeIGtGsGY~tAlla 95 (214) T PRK13942 59 ISAIHMVAI------MCELLDLDEGQKV-LEIGTGSGYHAAVVA 95 (214) T ss_pred ECHHHHHHH------HHHHHCCCCCCEE-EEECCCCCHHHHHHH T ss_conf 874999999------9997279999979-996799529999999 No 101 >KOG3101 consensus Probab=27.12 E-value=23 Score=16.07 Aligned_cols=16 Identities=38% Similarity=0.414 Sum_probs=13.3 Q ss_pred CCCCCCCCCHHHHHHC Q ss_conf 2466544201222531 Q gi|254780536|r 89 YVAGHSLGEYTALCAA 104 (314) Q Consensus 89 ~v~GhSlGE~aAl~~a 104 (314) .+.|||||-..|++++ T Consensus 144 ~IfGHSMGGhGAl~~~ 159 (283) T KOG3101 144 GIFGHSMGGHGALTIY 159 (283) T ss_pred CEECCCCCCCCEEEEE T ss_conf 0540356777627999 No 102 >KOG2382 consensus Probab=26.71 E-value=42 Score=14.41 Aligned_cols=12 Identities=42% Similarity=0.667 Sum_probs=9.7 Q ss_pred CCCCCCCCCCCC Q ss_conf 544246654420 Q gi|254780536|r 86 DISYVAGHSLGE 97 (314) Q Consensus 86 ~p~~v~GhSlGE 97 (314) .|..++|||+|= T Consensus 123 ~~~~l~GHsmGG 134 (315) T KOG2382 123 DPVVLLGHSMGG 134 (315) T ss_pred CCCEECCCCCCH T ss_conf 783240357636 No 103 >cd04174 Rnd1_Rho6 Rnd1/Rho6 subfamily. Rnd1/Rho6 is a member of the novel Rho subfamily Rnd, together with Rnd2/Rho7 and Rnd3/RhoE/Rho8. Rnd1/Rho6 binds GTP but does not hydrolyze it to GDP, indicating that it is constitutively active. In rat, Rnd1/Rho6 is highly expressed in the cerebral cortex and hippocampus during synapse formation, and plays a role in spine formation. Rnd1/Rho6 is also expressed in the liver and in endothelial cells, and is upregulated in uterine myometrial cells during pregnancy. Like Rnd3/RhoE/Rho8, Rnd1/Rho6 is believed to function as an antagonist to RhoA. Most Rho proteins contain a lipid modification site at the C-terminus, with a typical sequence motif CaaX, where a = an aliphatic amino acid and X = any amino acid. Lipid binding is essential for membrane attachment, a key feature of most Rho proteins. Due to the presence of truncated sequences in this CD, the lipid modification site is not available for annotation. Probab=26.45 E-value=47 Score=14.14 Aligned_cols=61 Identities=13% Similarity=0.080 Sum_probs=39.1 Q ss_pred HHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCEEEEECCC Q ss_conf 2320287765420457871001002332222204485019999999997799899993898 Q gi|254780536|r 220 SVTKQDPVVPILPNFCASPVSSIDEISRLLVEQVTGRVRWRETIQWFANHGVKSVYEVGSG 280 (314) Q Consensus 220 ~i~~~~p~ipi~S~~~g~~~~~~~~~~~~l~~ql~~pV~f~~~i~~l~~~g~~~fiEiGP~ 280 (314) ++.-..|.+|++--=+-.-+++.......|.++-..||.+.++...+.+.|...|+|.+.- T Consensus 110 Ei~~~~p~~piiLVGnK~DLr~d~~~l~~L~~~~~~pVt~eeg~~~Ak~iga~~Y~E~SA~ 170 (232) T cd04174 110 EIMDYCPSTRILLIGCKTDLRTDLSTLMELSNQKQAPISYEQGCALAKQLGAEVYLECSAF 170 (232) T ss_pred HHHHHCCCCCEEEEEECHHCCCCHHHHHHHHHCCCCCCCHHHHHHHHHHCCCCEEEECCCC T ss_conf 9998689997899987602154757788997568887579999999997499789987568 No 104 >TIGR01025 rpsS_arch ribosomal protein S19; InterPro: IPR005713 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites , . About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many of ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome , . This family represents eukaryotic ribosomal protein S15 and its archaeal equivalent. It excludes bacterial and organellar ribosomal protein S19. The nomenclature for the archaeal members is unresolved and given variously as S19 (after the more distant bacterial homologs) or S15.; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0015935 small ribosomal subunit. Probab=26.33 E-value=19 Score=16.50 Aligned_cols=15 Identities=33% Similarity=0.629 Sum_probs=8.8 Q ss_pred CCCCCCCCCCCCCCHH Q ss_conf 4354424665442012 Q gi|254780536|r 84 KRDISYVAGHSLGEYT 99 (314) Q Consensus 84 ~~~p~~v~GhSlGE~a 99 (314) ..+|. ++||-|||++ T Consensus 97 ~i~PE-MIGhylgEF~ 111 (136) T TIGR01025 97 EIKPE-MIGHYLGEFA 111 (136) T ss_pred EEEEE-EECCCHHHCC T ss_conf 74200-3122000000 No 105 >cd02407 PTH2_family Peptidyl-tRNA hydrolase, type 2 (PTH2)_like . Peptidyl-tRNA hydrolase activity releases tRNA from the premature translation termination product peptidyl-tRNA. Two structurally different enzymes have been reported to encode such activity, Pth present in bacteria and eukaryotes and Pth2 present in archaea and eukaryotes. Probab=25.75 E-value=45 Score=14.23 Aligned_cols=17 Identities=29% Similarity=0.399 Sum_probs=12.5 Q ss_pred CCCCCCCHHHHHHCCCC Q ss_conf 66544201222531786 Q gi|254780536|r 91 AGHSLGEYTALCAAKAF 107 (314) Q Consensus 91 ~GhSlGE~aAl~~aG~l 107 (314) ++.|.|.++|-++=.++ T Consensus 11 L~Ms~GKiaAQ~~HAav 27 (115) T cd02407 11 LKMGKGKIAAQCAHAAL 27 (115) T ss_pred CCCCHHHHHHHHHHHHH T ss_conf 99884089999999999 No 106 >PRK08775 homoserine O-acetyltransferase; Provisional Probab=25.54 E-value=40 Score=14.56 Aligned_cols=29 Identities=21% Similarity=0.311 Sum_probs=21.3 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHH Q ss_conf 47899997188764354424665442012225 Q gi|254780536|r 71 AFIRVMEKNGLCVKRDISYVAGHSLGEYTALC 102 (314) Q Consensus 71 a~~~~l~~~gi~~~~~p~~v~GhSlGE~aAl~ 102 (314) +...++...||. +-.+|+|.|+|=.-||. T Consensus 126 aq~~Lld~LGI~---~l~AVIGGSmGGMqALe 154 (343) T PRK08775 126 AIAALLDALGIR---RLHGFVGYSYGALVGLQ 154 (343) T ss_pred HHHHHHHHCCCC---EEEEEECCCHHHHHHHH T ss_conf 999999862988---67423157889999999 No 107 >KOG2564 consensus Probab=25.04 E-value=39 Score=14.60 Aligned_cols=21 Identities=29% Similarity=0.382 Sum_probs=14.3 Q ss_pred CCCCCCCCCCCCHHHHHHCCC Q ss_conf 544246654420122253178 Q gi|254780536|r 86 DISYVAGHSLGEYTALCAAKA 106 (314) Q Consensus 86 ~p~~v~GhSlGE~aAl~~aG~ 106 (314) .+-+++|||+|--.|.+++-. T Consensus 146 ~~iilVGHSmGGaIav~~a~~ 166 (343) T KOG2564 146 PQIILVGHSMGGAIAVHTAAS 166 (343) T ss_pred CCEEEEECCCCCHHHHHHHHH T ss_conf 866998426662332306565 No 108 >pfam11288 DUF3089 Protein of unknown function (DUF3089). This family of proteins has no known function. Probab=24.99 E-value=36 Score=14.82 Aligned_cols=15 Identities=33% Similarity=0.346 Sum_probs=11.7 Q ss_pred CCCCCCCCCCCCCCH Q ss_conf 435442466544201 Q gi|254780536|r 84 KRDISYVAGHSLGEY 98 (314) Q Consensus 84 ~~~p~~v~GhSlGE~ 98 (314) +.+|=.++|||+|-. T Consensus 93 ~grPfILagHSQGs~ 107 (201) T pfam11288 93 GGRPFILAGHSQGAL 107 (201) T ss_pred CCCCEEEEEECHHHH T ss_conf 999679998356899 No 109 >KOG1579 consensus Probab=24.97 E-value=50 Score=13.97 Aligned_cols=166 Identities=16% Similarity=0.154 Sum_probs=78.0 Q ss_pred CCCCCCCCCHHHHHHCCC---------CCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCC-CHHHHHHHHHHHHHC--CC Q ss_conf 246654420122253178---------650233333456886531001111222221235-035531122344211--35 Q gi|254780536|r 89 YVAGHSLGEYTALCAAKA---------FSLSDTIRLVRARGKSMQEAVPPGLGSMVAIIG-LDDCVVDSICAQASR--VG 156 (314) Q Consensus 89 ~v~GhSlGE~aAl~~aG~---------ls~eda~~lv~~R~~~m~~~~~~~~g~M~aV~~-~~~~~~~~~~~~~~~--~~ 156 (314) +++| |.|.+.|.++-|+ .++++..+.-..|.+...+.-. ..++.-. ++..+.+.++....+ .. T Consensus 115 ~v~g-siGp~~A~l~~g~eytg~Y~~~~~~~el~~~~k~qle~~~~~gv----D~L~fETip~~~EA~a~l~~l~~~~~~ 189 (317) T KOG1579 115 YVAG-SIGPYGATLADGSEYTGIYGDNVEFEELYDFFKQQLEVFLEAGV----DLLAFETIPNVAEAKAALELLQELGPS 189 (317) T ss_pred EEEE-ECCCCCCEECCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCC----CEEEEEECCCHHHHHHHHHHHHHCCCC T ss_conf 3565-12663045337733115655665899999999999999986798----789986148788899999999962888 Q ss_pred -CCEECCCCCCCEEEECCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCC Q ss_conf -4200122322213530331128999888888653100355554323420102037888874222320287765420457 Q gi|254780536|r 157 -ICEIANDNGGGQVVISGLQDAVKCAADTCLNKGAKRAVFLPVSAPFHSSLMTPVSKVMKWMLDSVTKQDPVVPILPNFC 235 (314) Q Consensus 157 -~~~ia~~Ns~~q~visG~~~~l~~~~~~~~~~~~~~~~~L~v~~afHs~lm~~~~~~~~~~l~~i~~~~p~ipi~S~~~ 235 (314) .++|+.-=-+.-...+|+. ++.+.... +.+. +..-+.++. +|-+...+...++. ...+++|++.--+ T Consensus 190 ~p~~is~t~~d~g~l~~G~t--~e~~~~~~-~~~~-~~~~IGvNC-~~~~~~~~~~~~L~-------~~~~~~~llvYPN 257 (317) T KOG1579 190 KPFWISFTIKDEGRLRSGET--GEEAAQLL-KDGI-NLLGIGVNC-VSPNFVEPLLKELM-------AKLTKIPLLVYPN 257 (317) T ss_pred CCEEEEEEECCCCCCCCCCC--HHHHHHHH-CCCC-CEEEEEECC-CCCHHCCHHHHHHH-------HCCCCCCEEEECC T ss_conf 86799999668771267873--99999986-0687-438998335-78112008999986-------3027980899469 Q ss_pred -CCCCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCEE Q ss_conf -871001002332222204485019999999997799899 Q gi|254780536|r 236 -ASPVSSIDEISRLLVEQVTGRVRWRETIQWFANHGVKSV 274 (314) Q Consensus 236 -g~~~~~~~~~~~~l~~ql~~pV~f~~~i~~l~~~g~~~f 274 (314) |..+... ...|...-.....|..-++.+.+.|++.+ T Consensus 258 sGe~yd~~---~g~~~~~~~~~~~~~~~~~~~~~lGv~iI 294 (317) T KOG1579 258 SGEVYDNE---KGGWIPTPFGLEPWQTYVKKAIDLGVRII 294 (317) T ss_pred CCCCCCCC---CCCCCCCCCCCCHHHHHHHHHHHCCCCEE T ss_conf 88877665---68505898554418999999996055132 No 110 >TIGR02688 TIGR02688 conserved hypothetical protein TIGR02688; InterPro: IPR014061 Members of this entry are uncharacterised proteins sporadically distributed in bacteria and archaea, about 470 amino acids in length. Several members of this family appear in public databases with annotation as ATP-dependent protease La, despite the lack of similarity to families IPR004815 from INTERPRO (ATP-dependent protease La) or IPR003111 from INTERPRO (ATP-dependent protease La (LON) domain). The proteins in this entry are encoded by genes repeatedly found downstream of another gene that encodes an uncharacterised protein of about 880 amino acids in length (see IPR014060 from INTERPRO).. Probab=24.91 E-value=47 Score=14.13 Aligned_cols=52 Identities=12% Similarity=0.192 Sum_probs=26.2 Q ss_pred HHCCCCCCHHHHHHHHHHCCCCEEEEECCC-H----------HHHHHHHHHCCCCCEEECCCHHHH-HHHHH Q ss_conf 204485019999999997799899993898-7----------999999987389815523998999-99998 Q gi|254780536|r 251 EQVTGRVRWRETIQWFANHGVKSVYEVGSG-K----------VLTGLAKRIDKSLSAVSISKVEDI-DLALR 310 (314) Q Consensus 251 ~ql~~pV~f~~~i~~l~~~g~~~fiEiGP~-~----------~Lt~l~~~~~~~~~~~~~~~~~~~-~~~l~ 310 (314) +.|++--.+++.+ +.++++|++ . +.|||+|=..|+..-.+--+.+++ ...++ T Consensus 389 ~~lR~~~~~~~~~--------~r~~~L~~~r~~~RD~~avk~~~SGl~KlLFPh~~Dssq~tk~e~~~~~~~ 452 (470) T TIGR02688 389 RELREDHEYADLV--------DRHFSLSENRLNARDVKAVKKLFSGLLKLLFPHGTDSSQITKEELRKEILE 452 (470) T ss_pred HHHCCCCCHHHHH--------HHHHCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHH T ss_conf 9841637866786--------676406755200153678999987776641689875454567899999999 No 111 >KOG3847 consensus Probab=24.62 E-value=28 Score=15.50 Aligned_cols=24 Identities=33% Similarity=0.253 Sum_probs=12.0 Q ss_pred CCCCCCCCCCCCHHHHHHCCCCCH Q ss_conf 544246654420122253178650 Q gi|254780536|r 86 DISYVAGHSLGEYTALCAAKAFSL 109 (314) Q Consensus 86 ~p~~v~GhSlGE~aAl~~aG~ls~ 109 (314) -|-++.-|.||-.--+|.+=+.++ T Consensus 118 ~PvvvFSHGLggsRt~YSa~c~~L 141 (399) T KOG3847 118 YPVVVFSHGLGGSRTLYSAYCTSL 141 (399) T ss_pred CCEEEEECCCCCCHHHHHHHHHHH T ss_conf 658999346663034398876647 No 112 >pfam01764 Lipase_3 Lipase (class 3). Probab=24.42 E-value=22 Score=16.12 Aligned_cols=13 Identities=54% Similarity=0.623 Sum_probs=8.9 Q ss_pred CCCCCCCCCHHHH Q ss_conf 2466544201222 Q gi|254780536|r 89 YVAGHSLGEYTAL 101 (314) Q Consensus 89 ~v~GhSlGE~aAl 101 (314) .+.|||||=-.|. T Consensus 66 ~itGHSLGGa~A~ 78 (141) T pfam01764 66 VVTGHSLGGALAS 78 (141) T ss_pred EEEECCHHHHHHH T ss_conf 9980565789999 No 113 >pfam11015 DUF2853 Protein of unknown function (DUF2853). This bacterial family of proteins has no known function. Probab=24.41 E-value=51 Score=13.91 Aligned_cols=41 Identities=10% Similarity=0.083 Sum_probs=27.3 Q ss_pred CHHHHHHHHHHHHHCCCCCEECCCCCCCEEEECCCCCHHHHHHHHHH Q ss_conf 03553112234421135420012232221353033112899988888 Q gi|254780536|r 140 LDDCVVDSICAQASRVGICEIANDNGGGQVVISGLQDAVKCAADTCL 186 (314) Q Consensus 140 ~~~~~~~~~~~~~~~~~~~~ia~~Ns~~q~visG~~~~l~~~~~~~~ 186 (314) ++.+.++.+... +-+|.+|....+|-+-.++.|+.+.+.+. T Consensus 14 pD~dll~kv~~~------lGpsiyn~Das~Vs~Sd~~ELe~Vk~nfl 54 (102) T pfam11015 14 PDEDLLEKVTKG------LGPSIYNRDSSTVSASDPKELERVKENFL 54 (102) T ss_pred CCHHHHHHHHHH------CCCCEECCCCCHHCCCCHHHHHHHHHHHH T ss_conf 889999999960------57200457501320588899999998089 No 114 >PRK09240 thiH thiamine biosynthesis protein ThiH; Reviewed Probab=24.35 E-value=51 Score=13.90 Aligned_cols=13 Identities=15% Similarity=0.204 Sum_probs=4.4 Q ss_pred EEECCCHHHHHHH Q ss_conf 5523998999999 Q gi|254780536|r 296 AVSISKVEDIDLA 308 (314) Q Consensus 296 ~~~~~~~~~~~~~ 308 (314) -..+++..+.+.. T Consensus 336 QF~i~D~Rs~~Ev 348 (371) T PRK09240 336 QFEISDDRSVEEV 348 (371) T ss_pred CCCCCCCCCHHHH T ss_conf 6537999899999 No 115 >TIGR02812 fadR_gamma fatty acid metabolism transcriptional regulator FadR; InterPro: IPR014178 Members of this entry are transcriptional regulators of fatty acid metabolism (FadR), including both biosynthesis and beta-oxidation reactions, which are found exclusively in a subset of Gammaproteobacteria with strictly one copy per genome. Proteins have an N-terminal DNA-binding domain and a less well conserved C-terminal long chain acyl-CoA-binding domain. FadR when expressed in Escherichia coli shows differences in regulatory response and fatty acid binding profiles. All members of this entry are expected to have the same function.; GO: 0000062 acyl-CoA binding, 0003677 DNA binding, 0003700 transcription factor activity, 0019217 regulation of fatty acid metabolic process. Probab=24.24 E-value=30 Score=15.35 Aligned_cols=53 Identities=11% Similarity=0.192 Sum_probs=25.7 Q ss_pred CCCCHHHHHCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCHHHHHHHHHHHHH Q ss_conf 38986783046765034566665478999971887643544246654420122253178650233333456886 Q gi|254780536|r 48 WNGSQEELTATCNAQPALTAVSMAFIRVMEKNGLCVKRDISYVAGHSLGEYTALCAAKAFSLSDTIRLVRARGK 121 (314) Q Consensus 48 ~~~~~~~l~~~~~~Qp~i~~~~~a~~~~l~~~gi~~~~~p~~v~GhSlGE~aAl~~aG~ls~eda~~lv~~R~~ 121 (314) +.|...+.|+-+- +.++-..+.|.... +-++=+ .+...+|..+=+..++.|.. T Consensus 63 QHGKPTKVNnfWE------TSGLNILeTL~~LD-----------~~~~P~----l~dnLLSARTnIS~IYir~A 115 (275) T TIGR02812 63 QHGKPTKVNNFWE------TSGLNILETLIRLD-----------GESVPS----LIDNLLSARTNISAIYIRAA 115 (275) T ss_pred CCCCCCCCCCHHH------CCCCCHHHHHHHHC-----------CCCCCH----HHHHHHHHHHCCCHHHHHHH T ss_conf 5889986276243------02735789998618-----------777707----99999977641377889998 No 116 >cd07218 Pat_iPLA2 Calcium-independent phospholipase A2; Classified as Group IVA-1 PLA2. Calcium-independent phospholipase A2; otherwise known as Group IVA-1 PLA2. It contains the lipase consensus sequence (Gly-X-Ser-X-Gly);mutagenesis experiments confirm the role of this serine as a nucleophile. Some members of this group show triacylglycerol lipase activity (EC 3:1:1:3). Members include iPLA-1, iPLA-2, and iPLA-3 from Aedes aegypti and show acylglycerol transacylase/lipase activity. Also includes putative iPLA2-eta from Pediculus humanus corporis which shows patatin-like phospholipase activity. Probab=24.14 E-value=52 Score=13.87 Aligned_cols=52 Identities=15% Similarity=0.199 Sum_probs=34.2 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCHHHHHHHHHH Q ss_conf 566665478999971887643544246654420122253178650233333456 Q gi|254780536|r 65 LTAVSMAFIRVMEKNGLCVKRDISYVAGHSLGEYTALCAAKAFSLSDTIRLVRA 118 (314) Q Consensus 65 i~~~~~a~~~~l~~~gi~~~~~p~~v~GhSlGE~aAl~~aG~ls~eda~~lv~~ 118 (314) +....++.+..|+++|.... .+.+.|-|-|-++|++.+=.+++++..+.+.. T Consensus 11 Lg~YhlGVa~~L~e~~~~l~--~~~~~GaSAGaL~A~~l~~~~~~~~~~~~~~~ 62 (245) T cd07218 11 LGIYHVGVAVCLKKYAPHLL--LNKISGASAGALAACCLLCDLPLGEMTSDFLR 62 (245) T ss_pred HHHHHHHHHHHHHHHHHHHH--CCEEEEEHHHHHHHHHHHCCCCHHHHHHHHHH T ss_conf 68799999999998637865--67578882999999999839999999999999 No 117 >KOG4409 consensus Probab=24.01 E-value=29 Score=15.38 Aligned_cols=66 Identities=14% Similarity=0.133 Sum_probs=32.0 Q ss_pred HHHHHHHCCCCCCHHH--HHHHHHHC----------CCCEEEEECCCHHHHHHH-HH------HCCCCCEEECCCHHHHH Q ss_conf 3222220448501999--99999977----------998999938987999999-98------73898155239989999 Q gi|254780536|r 246 SRLLVEQVTGRVRWRE--TIQWFANH----------GVKSVYEVGSGKVLTGLA-KR------IDKSLSAVSISKVEDID 306 (314) Q Consensus 246 ~~~l~~ql~~pV~f~~--~i~~l~~~----------g~~~fiEiGP~~~Lt~l~-~~------~~~~~~~~~~~~~~~~~ 306 (314) .++...++.+|.-|.+ -++.+..- |-+.+++--.|--.++.. ++ +....-++-+++++... T Consensus 275 gE~~fk~l~~~~g~Ar~Pm~~r~~~l~~~~pv~fiyG~~dWmD~~~g~~~~~~~~~~~~~~~~v~~aGHhvylDnp~~Fn 354 (365) T KOG4409 275 GETAFKNLFEPGGWARRPMIQRLRELKKDVPVTFIYGDRDWMDKNAGLEVTKSLMKEYVEIIIVPGAGHHVYLDNPEFFN 354 (365) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCCEEECCCHHHHH T ss_conf 78999998752622340699999862668977999448410023357989977620555079956997336547989999 Q ss_pred HHHHH Q ss_conf 99987 Q gi|254780536|r 307 LALRS 311 (314) Q Consensus 307 ~~l~~ 311 (314) ..+.. T Consensus 355 ~~v~~ 359 (365) T KOG4409 355 QIVLE 359 (365) T ss_pred HHHHH T ss_conf 99999 No 118 >pfam01135 PCMT Protein-L-isoaspartate(D-aspartate) O-methyltransferase (PCMT). Probab=23.97 E-value=52 Score=13.86 Aligned_cols=37 Identities=24% Similarity=0.216 Sum_probs=20.3 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHH Q ss_conf 65034566665478999971887643544246654420122253 Q gi|254780536|r 60 NAQPALTAVSMAFIRVMEKNGLCVKRDISYVAGHSLGEYTALCA 103 (314) Q Consensus 60 ~~Qp~i~~~~~a~~~~l~~~gi~~~~~p~~v~GhSlGE~aAl~~ 103 (314) .+||.+.+. ++....+.+..+. .-+|-..|=.+|+.+ T Consensus 56 is~P~~~a~------ml~~L~l~~g~~V-LeIGtGsGY~tAlLa 92 (205) T pfam01135 56 ISAPHMHAM------MLELLELKPGMRV-LEIGSGSGYLTACFA 92 (205) T ss_pred ECHHHHHHH------HHHHCCCCCCCEE-EEECCCCCHHHHHHH T ss_conf 863899999------9997078999989-996699659999999 No 119 >COG0357 GidB Predicted S-adenosylmethionine-dependent methyltransferase involved in bacterial cell division [Cell envelope biogenesis, outer membrane] Probab=23.59 E-value=53 Score=13.81 Aligned_cols=50 Identities=10% Similarity=-0.074 Sum_probs=19.9 Q ss_pred EEECCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHCCHHHHHHHHHHHHHH Q ss_conf 353033112899988888865310035555432342010203788887422232 Q gi|254780536|r 169 VVISGLQDAVKCAADTCLNKGAKRAVFLPVSAPFHSSLMTPVSKVMKWMLDSVT 222 (314) Q Consensus 169 ~visG~~~~l~~~~~~~~~~~~~~~~~L~v~~afHs~lm~~~~~~~~~~l~~i~ 222 (314) +|.|---..+..+.+.+...... -..-.+++-..++...++.+....... T Consensus 138 ~vtsRAva~L~~l~e~~~pllk~----~g~~~~~k~~~~~~e~~e~~~a~~~~~ 187 (215) T COG0357 138 VVTSRAVASLNVLLELCLPLLKV----GGGFLAYKGLAGKDELPEAEKAILPLG 187 (215) T ss_pred EEEEEEECCHHHHHHHHHHHCCC----CCCCHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 89854202568899999984346----883202658765556899999987626 No 120 >COG0627 Predicted esterase [General function prediction only] Probab=23.25 E-value=24 Score=15.86 Aligned_cols=17 Identities=53% Similarity=0.745 Sum_probs=14.3 Q ss_pred CCCCCCCCCCHHHHHHC Q ss_conf 42466544201222531 Q gi|254780536|r 88 SYVAGHSLGEYTALCAA 104 (314) Q Consensus 88 ~~v~GhSlGE~aAl~~a 104 (314) .+++|||+|=+.|+..| T Consensus 154 ~aI~G~SMGG~GAl~lA 170 (316) T COG0627 154 RAIAGHSMGGYGALKLA 170 (316) T ss_pred CEEEEEECCCHHHHHHH T ss_conf 31489851437777776 No 121 >PRK00357 rpsS 30S ribosomal protein S19; Reviewed Probab=23.17 E-value=26 Score=15.68 Aligned_cols=14 Identities=36% Similarity=0.781 Sum_probs=9.7 Q ss_pred CCCCCCCCCCCCHHH Q ss_conf 544246654420122 Q gi|254780536|r 86 DISYVAGHSLGEYTA 100 (314) Q Consensus 86 ~p~~v~GhSlGE~aA 100 (314) .++ ++||-+||++- T Consensus 63 ~~~-MvGhklGEFa~ 76 (92) T PRK00357 63 TEN-MVGHKLGEFAP 76 (92) T ss_pred CCC-CCCCEEECCCC T ss_conf 736-13712111003 No 122 >TIGR02418 acolac_catab acetolactate synthase, catabolic; InterPro: IPR012782 Acetolactate synthase is a thiamin pyrophosphate-dependent enzyme that combines two molecules of pyruvate to yield 2-acetolactate with the release of CO2. It exists in two distinct forms that have different properties, though they are clearly related , . The biosynthetic form, found in plants, fungi and bacteria, is involved in the biosynthesis of branched-chain amino acids and contains non-catalytic FAD. The catabolic form is found only in some bacteria, participates in butanediol fermentation, and does not contain FAD. This entry represents the catabolic form of acetolactate synthase, generally found in species that posses acetolactate decarboxylase, which is often encoded in the same operon. The enzyme is thought to be a homotetramer, though this has not conclusively been shown. Each monomer consists of three distinct domains, designated, starting from the N-terminus, as alpha, beta and gamma . The core of the alpha and gamma domains is a six-stranded parallel beta sheet surrounded by alpha helices, while the the beta domain forms a double Rossman fold. The thiamin pyrophosphate cofactor has an unusual conformation that has not been seen in other enzymes using this cofactor, including the biosynthetic form of acetolactate synthase. The active site is located in a funnel at the dimer interface, bordered by residues from all three domains.; GO: 0000287 magnesium ion binding, 0003984 acetolactate synthase activity, 0030976 thiamin pyrophosphate binding, 0050660 FAD binding. Probab=22.85 E-value=39 Score=14.65 Aligned_cols=108 Identities=14% Similarity=0.168 Sum_probs=49.1 Q ss_pred CCCCCCCCCHHCCHHHHHHHHH--H-----HH----HHH------CCCCCCCCCCC-CCCCCCCCH----------HHHH Q ss_conf 5554323420102037888874--2-----22----320------28776542045-787100100----------2332 Q gi|254780536|r 196 LPVSAPFHSSLMTPVSKVMKWM--L-----DS----VTK------QDPVVPILPNF-CASPVSSID----------EISR 247 (314) Q Consensus 196 L~v~~afHs~lm~~~~~~~~~~--l-----~~----i~~------~~p~ipi~S~~-~g~~~~~~~----------~~~~ 247 (314) +.|+..+|.=+|+..-..|+.+ | +. +++ =.|..-++|=. ||-.+.+.- .|.. T Consensus 388 vt~D~GShyIWmARyf~Sy~AR~lLiSNGmQTlGVALPWAI~AaLlrP~~KVVSVSGDGGFLfSsmELETAVRLk~nIvH 467 (553) T TIGR02418 388 VTVDIGSHYIWMARYFKSYRARHLLISNGMQTLGVALPWAIGAALLRPNTKVVSVSGDGGFLFSSMELETAVRLKLNIVH 467 (553) T ss_pred EEEECCHHHHHHHHHHCCCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHHHHHHCCEEE T ss_conf 98704305888887622567540243357503111218999999738897589987283120025788999984089489 Q ss_pred -HHHH-HCCCCCCHHHHHHHH-HHCCCCEEEEECCCHHHHHHHHHH-CCCCCEEECCCHHHHHHHHHH Q ss_conf -2222-044850199999999-977998999938987999999987-389815523998999999987 Q gi|254780536|r 248 -LLVE-QVTGRVRWRETIQWF-ANHGVKSVYEVGSGKVLTGLAKRI-DKSLSAVSISKVEDIDLALRS 311 (314) Q Consensus 248 -~l~~-ql~~pV~f~~~i~~l-~~~g~~~fiEiGP~~~Lt~l~~~~-~~~~~~~~~~~~~~~~~~l~~ 311 (314) .|.+ .-++-|.|.+.+++= ...| ||+||=. ++|=. .=+.+-+.|++.|+++..|+. T Consensus 468 IiWnDn~~YnMV~fQE~~KY~GR~sG----VdFGpvD----fvkYAESFGA~GlrV~~~d~L~~tLr~ 527 (553) T TIGR02418 468 IIWNDNNKYNMVEFQEEMKYKGRSSG----VDFGPVD----FVKYAESFGAKGLRVESPDELEPTLRQ 527 (553) T ss_pred EEEECCCCCCHHHHHHHHCCCCCCCC----EEECCCC----CEEECCCCCCCCCCCCCHHHHHHHHHH T ss_conf 99704899766789776323552013----1006605----021010248432432897899999874 No 123 >KOG3975 consensus Probab=22.80 E-value=16 Score=16.89 Aligned_cols=24 Identities=17% Similarity=0.266 Sum_probs=15.3 Q ss_pred EEECCCCC---CCCCCHHHHHHHHCHH Q ss_conf 99808733---3731158999985999 Q gi|254780536|r 4 VLTFPGQG---SQVIGMGRDLYDSFPE 27 (314) Q Consensus 4 ~f~FpGQG---sq~~gM~~~L~~~~p~ 27 (314) ++.-||.- ..|..|++.|+.+.+. T Consensus 32 i~~IpGNPG~~gFY~~F~~~L~~~l~~ 58 (301) T KOG3975 32 IVWIPGNPGLLGFYTEFARHLHLNLID 58 (301) T ss_pred EEEECCCCCCHHHHHHHHHHHHHHCCC T ss_conf 999669998325789999999986443 No 124 >COG0185 RpsS Ribosomal protein S19 [Translation, ribosomal structure and biogenesis] Probab=22.26 E-value=27 Score=15.61 Aligned_cols=16 Identities=31% Similarity=0.644 Sum_probs=10.9 Q ss_pred CCCCCCCCCCCCCCHHH Q ss_conf 43544246654420122 Q gi|254780536|r 84 KRDISYVAGHSLGEYTA 100 (314) Q Consensus 84 ~~~p~~v~GhSlGE~aA 100 (314) ...|+ ++||=|||++- T Consensus 61 ~I~~e-MVGHkLGEFap 76 (93) T COG0185 61 EITEE-MVGHKLGEFAP 76 (93) T ss_pred EECHH-HCCEECCCCCC T ss_conf 82642-33614033155 No 125 >pfam03096 Ndr Ndr family. This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted. The precise molecular and cellular function of members of this family is still unknown. Yet, they are known to be involved in cellular differentiation events. The Ndr1 group was the first to be discovered. Their expression is repressed by the proto-oncogenes N-myc and c-myc, and in line with this observation, Ndr1 protein expression is down-regulated in neoplastic cells, and is reactivated when differentiation is induced by chemicals such as retinoic acid. Ndr2 and Ndr3 expression is not under the control of N-myc or c-myc. Ndr1 expression is also activated by several chemicals: tunicamycin and homocysteine induce Ndr1 in human umbilical endothelial cells; nickel induces Ndr1 in several cell types. Members of this family are found in wide variety of multicellular eukaryotes, including an Ndr1 type protein in Helianthus annuus (su Probab=22.04 E-value=44 Score=14.31 Aligned_cols=30 Identities=20% Similarity=0.138 Sum_probs=22.3 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHC Q ss_conf 4789999718876435442466544201222531 Q gi|254780536|r 71 AFIRVMEKNGLCVKRDISYVAGHSLGEYTALCAA 104 (314) Q Consensus 71 a~~~~l~~~gi~~~~~p~~v~GhSlGE~aAl~~a 104 (314) -+..+|...|+ +-..++|||+|-+.++-.| T Consensus 88 ~l~~vld~l~i----~~~~~~G~s~G~~i~~~fA 117 (285) T pfam03096 88 MLPVVLDHFRL----KSVIGMGVGAGAYILARFA 117 (285) T ss_pred HHHHHHHHCCC----CEEEEEEECHHHHHHHHHH T ss_conf 99999997198----7699984367889999999 No 126 >cd00741 Lipase Lipase. Lipases are esterases that can hydrolyze long-chain acyl-triglycerides into di- and monoglycerides, glycerol, and free fatty acids at a water/lipid interface. A typical feature of lipases is "interfacial activation", the process of becoming active at the lipid/water interface, although several examples of lipases have been identified that do not undergo interfacial activation . The active site of a lipase contains a catalytic triad consisting of Ser - His - Asp/Glu, but unlike most serine proteases, the active site is buried inside the structure. A "lid" or "flap" covers the active site, making it inaccessible to solvent and substrates. The lid opens during the process of interfacial activation, allowing the lipid substrate access to the active site. Probab=21.57 E-value=26 Score=15.72 Aligned_cols=12 Identities=58% Similarity=0.703 Sum_probs=8.7 Q ss_pred CCCCCCCCCHHH Q ss_conf 246654420122 Q gi|254780536|r 89 YVAGHSLGEYTA 100 (314) Q Consensus 89 ~v~GhSlGE~aA 100 (314) .+.|||||==.| T Consensus 31 ~vtGHSLGgalA 42 (153) T cd00741 31 HVTGHSLGGALA 42 (153) T ss_pred EEEEECHHHHHH T ss_conf 999636278999 No 127 >COG4757 Predicted alpha/beta hydrolase [General function prediction only] Probab=21.50 E-value=51 Score=13.92 Aligned_cols=12 Identities=42% Similarity=0.731 Sum_probs=10.5 Q ss_pred CCCCCCCCCCCC Q ss_conf 544246654420 Q gi|254780536|r 86 DISYVAGHSLGE 97 (314) Q Consensus 86 ~p~~v~GhSlGE 97 (314) .|-+++|||+|- T Consensus 105 ~P~y~vgHS~GG 116 (281) T COG4757 105 HPLYFVGHSFGG 116 (281) T ss_pred CCEEEEECCCCC T ss_conf 963886214452 No 128 >TIGR02858 spore_III_AA stage III sporulation protein AA; InterPro: IPR014217 Proteins in this entry include the stage III sporulation protein AA that is encoded by one of several genes in the spoIIIA locus. This protein is only found in species that are capable of endospore formation.. Probab=21.10 E-value=9.5 Score=18.34 Aligned_cols=24 Identities=21% Similarity=0.350 Sum_probs=11.9 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCHHH Q ss_conf 8865310011112222212350355 Q gi|254780536|r 119 RGKSMQEAVPPGLGSMVAIIGLDDC 143 (314) Q Consensus 119 R~~~m~~~~~~~~g~M~aV~~~~~~ 143 (314) |...| +.||+.+|-||+|..++++ T Consensus 185 RtDVL-D~CPKAEGmMM~iRSMSP~ 208 (282) T TIGR02858 185 RTDVL-DGCPKAEGMMMLIRSMSPD 208 (282) T ss_pred CEEEC-CCCCHHHHHHHHHHCCCCC T ss_conf 06751-7885378999999706985 No 129 >cd07222 Pat_PNPLA4 Patatin-like phospholipase domain containing protein 4. PNPLA4, also known as GS2 (gene sequence-2), shows both lipase and transacylation activities. GS2 lipase is expressed in various tissues, predominantly in muscle and adipocytes tissue. It is also expressed in keratinocytes and shows retinyl ester hydrolase, acylglycerol, TG hydrolase, and PLA2 activity. This family includes patatin-like proteins: GS2 from mammals, PNPLA4 (Patatin-like phospholipase domain-containing protein 4), and iPLA2-eta (Calcium-independent phospholipase A2) from Homo sapiens. Probab=20.94 E-value=60 Score=13.48 Aligned_cols=50 Identities=14% Similarity=0.185 Sum_probs=31.5 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHC-CCCCHHHHHH Q ss_conf 5666654789999718876435442466544201222531-7865023333 Q gi|254780536|r 65 LTAVSMAFIRVMEKNGLCVKRDISYVAGHSLGEYTALCAA-KAFSLSDTIR 114 (314) Q Consensus 65 i~~~~~a~~~~l~~~gi~~~~~p~~v~GhSlGE~aAl~~a-G~ls~eda~~ 114 (314) +....++.+..|+++|..+..++..+.|-|-|-++|++.+ +.-.+++..+ T Consensus 10 lg~YHlGV~~~L~e~g~~l~~~~~~~~GaSAGAlvaa~l~~~~~~l~~~~~ 60 (246) T cd07222 10 LGIYHLGAAKALLRHGKKLLKRVKRFAGASAGSLVAAVLLTAPEKIEECKE 60 (246) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCEEEEEHHHHHHHHHHHHCCHHHHHHHH T ss_conf 899999999999986066513567677777999999999819457999999 No 130 >TIGR01967 DEAH_box_HrpA ATP-dependent helicase HrpA; InterPro: IPR010222 This entry represents HrpA, one of two related but uncharacterised DEAH-box ATP-dependent helicases in many Proteobacteria and a few high-GC Gram-positive bacteria. HrpA is about 1300 amino acids long, while its paralog HrpB, also uncharacterised, is about 800 amino acids long. Related characterised eukaryotic proteins are RNA helicases associated with pre-mRNA processing.; GO: 0005524 ATP binding, 0008026 ATP-dependent helicase activity. Probab=20.80 E-value=34 Score=14.99 Aligned_cols=166 Identities=10% Similarity=0.043 Sum_probs=75.7 Q ss_pred HHHHHHHHHHHHHCCCCCEECCCCCCCEEEECCCCCHHHHHHHHHHHHHHHHCCCCCC--C----CCCCCHHCCHHHHHH Q ss_conf 3553112234421135420012232221353033112899988888865310035555--4----323420102037888 Q gi|254780536|r 141 DDCVVDSICAQASRVGICEIANDNGGGQVVISGLQDAVKCAADTCLNKGAKRAVFLPV--S----APFHSSLMTPVSKVM 214 (314) Q Consensus 141 ~~~~~~~~~~~~~~~~~~~ia~~Ns~~q~visG~~~~l~~~~~~~~~~~~~~~~~L~v--~----~afHs~lm~~~~~~~ 214 (314) +.+..++-..+.+.+-.+.+..+...+.-+| +.-.+|..|.+.+...-......++- . .+.|.--+.++-+.+ T Consensus 968 ~~edw~es~~~~P~HL~mnf~~VDd~grK~l-~~GRdL~~Lk~~~a~~~~~~~~~va~ts~~~E~~~l~~~~Fg~lp~~~ 1046 (1320) T TIGR01967 968 NPEDWDESVEQLPDHLRMNFRVVDDKGRKVL-AEGRDLAELKERLAGKVRAALSAVADTSEGIEKSGLTEWSFGELPEEI 1046 (1320) T ss_pred CHHHHHCCCCCCCHHCEEEEEEECCCCCEEE-CCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHCCCCCHHHH T ss_conf 8545512501262100120469817887540-157885899999999999999863420133465403440146112789 Q ss_pred HHHH-HHHHHCC-CC---------CCCCCCCCCCCCCCCHHHHHHHHHHCCC-CCCHHHH-HHHHHHCCCCEEEEECCCH Q ss_conf 8742-2232028-77---------6542045787100100233222220448-5019999-9999977998999938987 Q gi|254780536|r 215 KWML-DSVTKQD-PV---------VPILPNFCASPVSSIDEISRLLVEQVTG-RVRWRET-IQWFANHGVKSVYEVGSGK 281 (314) Q Consensus 215 ~~~l-~~i~~~~-p~---------ipi~S~~~g~~~~~~~~~~~~l~~ql~~-pV~f~~~-i~~l~~~g~~~fiEiGP~~ 281 (314) .... .+..... |- |-+++|--..........++.|..|+-+ ||+|-+. +=.-.+.+. .|-+.|=.. T Consensus 1047 ~~~~e~n~~vta~PALvd~~~sV~ik~F~t~~~A~~~h~~gLrRLLLl~i~~nP~K~L~~~LP~~~~l~~-~~~~~G~~e 1125 (1320) T TIGR01967 1047 EQKVEGNYKVTAYPALVDDKGSVAIKLFDTQAEAEAAHRAGLRRLLLLQIPSNPVKMLQGKLPNKAKLAL-YYNPLGRVE 1125 (1320) T ss_pred HHHHHCCCEEEEEEEEEECCCEEEEEECCCHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHCCCHHHHHHH-HHCCCCCHH T ss_conf 9887379606898867765862288526888999999999999999863100037887516856677875-540035047 Q ss_pred HHHHHHHHHC---------CCCC--EEECCCHHHHHHHHHH Q ss_conf 9999999873---------8981--5523998999999987 Q gi|254780536|r 282 VLTGLAKRID---------KSLS--AVSISKVEDIDLALRS 311 (314) Q Consensus 282 ~Lt~l~~~~~---------~~~~--~~~~~~~~~~~~~l~~ 311 (314) .||++++ .+.. --.+-+.++++..++. T Consensus 1126 ---~Lv~D~~~Aa~d~l~~e~~g~lPe~VR~~a~F~al~~~ 1163 (1320) T TIGR01967 1126 ---VLVDDILLAAVDKLIFENGGVLPEQVRTEAEFAALLEK 1163 (1320) T ss_pred ---HHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHH T ss_conf ---89999999878999872688698778898999999974 No 131 >PRK13944 protein-L-isoaspartate O-methyltransferase; Provisional Probab=20.69 E-value=61 Score=13.45 Aligned_cols=38 Identities=21% Similarity=0.204 Sum_probs=21.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHH Q ss_conf 765034566665478999971887643544246654420122253 Q gi|254780536|r 59 CNAQPALTAVSMAFIRVMEKNGLCVKRDISYVAGHSLGEYTALCA 103 (314) Q Consensus 59 ~~~Qp~i~~~~~a~~~~l~~~gi~~~~~p~~v~GhSlGE~aAl~~ 103 (314) ..+||.+.+. ++....+.+..+ -.-+|-..|=.+|+.+ T Consensus 54 tis~P~~~a~------ml~~L~~~~~~~-VLeIGtGsGY~tAlla 91 (205) T PRK13944 54 TISAPHMVAM------MCELIEPRPGMK-ILEVGTGSGYQAAVCA 91 (205) T ss_pred EECHHHHHHH------HHHHHCCCCCCE-EEEECCCCCHHHHHHH T ss_conf 9977999999------999706899998-9997898519999999 No 132 >PRK00175 metX homoserine O-acetyltransferase; Provisional Probab=20.63 E-value=57 Score=13.62 Aligned_cols=38 Identities=29% Similarity=0.425 Sum_probs=24.7 Q ss_pred HHHHHHHHH--HHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHH Q ss_conf 034566665--47899997188764354424665442012225 Q gi|254780536|r 62 QPALTAVSM--AFIRVMEKNGLCVKRDISYVAGHSLGEYTALC 102 (314) Q Consensus 62 Qp~i~~~~~--a~~~~l~~~gi~~~~~p~~v~GhSlGE~aAl~ 102 (314) .|.+..-.. +..++|.+.||. +..+|+|-|+|=.-|+- T Consensus 123 FP~iti~D~V~aq~~Ll~~LGI~---~l~aViG~SmGG~qAle 162 (379) T PRK00175 123 FPVITIRDWVRAQARLLDALGIT---RLAAVVGGSMGGMQALQ 162 (379) T ss_pred CCCCCHHHHHHHHHHHHHHCCCC---EEEEEECCCHHHHHHHH T ss_conf 99452999999999999973989---79889755808999999 No 133 >PRK06765 homoserine O-acetyltransferase; Provisional Probab=20.39 E-value=62 Score=13.41 Aligned_cols=37 Identities=24% Similarity=0.394 Sum_probs=23.7 Q ss_pred HHHHHHHHH--HHHHHHHHCCCCCCCCCCCCCCCCCCCHHHH Q ss_conf 034566665--4789999718876435442466544201222 Q gi|254780536|r 62 QPALTAVSM--AFIRVMEKNGLCVKRDISYVAGHSLGEYTAL 101 (314) Q Consensus 62 Qp~i~~~~~--a~~~~l~~~gi~~~~~p~~v~GhSlGE~aAl 101 (314) .|.+..-.. +...+|.+.||. +..+|+|-|+|=.-|+ T Consensus 134 FP~vTi~D~V~aq~~LLd~LGI~---kl~aViGgSmGGmqAl 172 (387) T PRK06765 134 FPVVTILDFVRVQKELLEDLGIA---RLHAVMGPSMGGMQAQ 172 (387) T ss_pred CCCEEHHHHHHHHHHHHHHHCCC---EEEECCCCCHHHHHHH T ss_conf 99403288999999999983988---5721146581889999 No 134 >PRK13768 GTPase; Provisional Probab=20.12 E-value=63 Score=13.38 Aligned_cols=119 Identities=13% Similarity=0.188 Sum_probs=56.3 Q ss_pred CCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCEECCCCCCCEEEECCCCCHHHHHHHHHH Q ss_conf 65023333345688653100111122222123503553112234421135420012232221353033112899988888 Q gi|254780536|r 107 FSLSDTIRLVRARGKSMQEAVPPGLGSMVAIIGLDDCVVDSICAQASRVGICEIANDNGGGQVVISGLQDAVKCAADTCL 186 (314) Q Consensus 107 ls~eda~~lv~~R~~~m~~~~~~~~g~M~aV~~~~~~~~~~~~~~~~~~~~~~ia~~Ns~~q~visG~~~~l~~~~~~~~ 186 (314) ++..|.+.+ -..|++.--...|+++.-...-.+.++.+..+...... ..-.+.+|+|+=+--..+....+.+.+. T Consensus 51 iDIRd~i~~----~dVM~~~~LGPNGali~~~e~l~~~~d~l~~~i~~~~~-dY~i~D~PGQiElft~~~~~~~i~~~L~ 125 (253) T PRK13768 51 IDVREYVSA----REIMRKYGLGPNGALIASVDLLLTKAEEIKEEIERLDA-DYVLVDTPGQMELFAFRESGRYLVEKLS 125 (253) T ss_pred CCHHHHCCH----HHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCC-CEEEEECCCCEEEEECCHHHHHHHHHHH T ss_conf 637861789----99998819896468999999999989999999851588-7599826874432223407999999986 Q ss_pred HHHHHHCCCCCCCCCCCCHHCCHHHHHHHHHH----HHHHHCCCCCCCCCCCC Q ss_conf 86531003555543234201020378888742----22320287765420457 Q gi|254780536|r 187 NKGAKRAVFLPVSAPFHSSLMTPVSKVMKWML----DSVTKQDPVVPILPNFC 235 (314) Q Consensus 187 ~~~~~~~~~L~v~~afHs~lm~~~~~~~~~~l----~~i~~~~p~ipi~S~~~ 235 (314) +....-...| .-++.+.....-+...| .-+.+.-|.++++|-.| T Consensus 126 ~~~~~~~v~l-----~D~~~~~~~~~fiS~~L~a~s~m~~l~lP~inVlsK~D 173 (253) T PRK13768 126 SGSKSLSVYL-----IDAVLAKDPSDFVSLLLLALSVQLRLGLPQIPVLNKID 173 (253) T ss_pred CCCCEEEEEE-----ECHHHHCCHHHHHHHHHHHHHHHHHCCCCEEEEEEHHH T ss_conf 3686289998-----45056378879999999999999973999799867686 Done!