Query gi|254780770|ref|YP_003065183.1| (3R)-hydroxymyristoyl-ACP dehydratase [Candidatus Liberibacter asiaticus str. psy62] Match_columns 161 No_of_seqs 121 out of 2072 Neff 6.9 Searched_HMMs 39220 Date Sun May 29 18:45:37 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780770.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR01750 fabZ beta-hydroxyacy 100.0 0 0 374.9 16.1 139 11-149 1-142 (142) 2 PRK13188 bifunctional UDP-3-O- 100.0 0 0 332.7 18.5 147 8-154 318-465 (465) 3 PRK00006 fabZ (3R)-hydroxymyri 100.0 0 0 325.2 19.2 143 10-152 6-149 (149) 4 cd01288 FabZ FabZ is a 17kD be 100.0 1.4E-45 0 293.0 18.0 131 20-150 1-131 (131) 5 COG0764 FabA 3-hydroxymyristoy 100.0 9.9E-42 0 270.9 16.7 143 9-152 2-146 (147) 6 pfam07977 FabA FabA-like domai 100.0 3.1E-41 1.4E-45 267.9 15.3 124 20-144 1-133 (133) 7 cd00493 FabA_FabZ FabA/Z, beta 100.0 1.4E-38 3.7E-43 251.9 17.4 128 21-148 1-130 (131) 8 cd01287 FabA FabA, beta-hydrox 100.0 3.2E-28 8.2E-33 189.7 16.2 130 20-150 3-148 (150) 9 PRK05174 3-hydroxydecanoyl-(ac 99.9 1.2E-25 3.1E-30 174.2 15.4 131 19-153 28-169 (172) 10 cd01289 FabA_like Domain of un 99.9 5.8E-22 1.5E-26 152.2 15.0 133 15-151 2-138 (138) 11 TIGR01749 fabA beta-hydroxyacy 99.7 4.1E-17 1.1E-21 123.0 7.6 132 18-153 24-166 (169) 12 COG4706 Predicted 3-hydroxylac 99.5 6.1E-14 1.6E-18 104.0 6.8 122 15-141 11-138 (161) 13 PRK04424 fatty acid biosynthes 98.9 7.6E-08 1.9E-12 67.4 13.5 114 18-152 66-184 (185) 14 cd03451 FkbR2 FkbR2 is a Strep 98.4 7.6E-06 1.9E-10 55.4 10.7 86 60-152 52-145 (146) 15 cd03452 MaoC_C MaoC_C The C-t 98.4 6.5E-06 1.7E-10 55.8 10.3 113 33-152 3-141 (142) 16 cd03446 MaoC_like MoaC_like 98.3 9.2E-06 2.3E-10 54.9 10.6 85 60-149 49-140 (140) 17 cd03440 hot_dog The hotdog fol 98.2 3.4E-05 8.7E-10 51.5 11.2 85 60-147 14-99 (100) 18 cd03454 YdeM YdeM is a Bacillu 98.2 2.2E-05 5.5E-10 52.7 9.6 59 91-149 73-139 (140) 19 PRK08190 bifunctional enoyl-Co 98.1 8.6E-05 2.2E-09 49.1 11.1 117 33-157 12-149 (465) 20 cd03449 R_hydratase (R)-hydrat 98.0 6.1E-05 1.6E-09 50.0 9.2 81 60-148 44-127 (128) 21 PRK10694 acyl-CoA thioester hy 97.8 0.00058 1.5E-08 44.1 11.1 87 65-156 31-126 (133) 22 cd03447 FAS_MaoC FAS_MaoC, the 97.8 0.00035 8.8E-09 45.4 9.7 90 50-147 31-123 (126) 23 cd03441 R_hydratase_like (R)-h 97.8 0.0004 1E-08 45.0 9.8 90 51-147 32-126 (127) 24 cd03442 BFIT_BACH Brown fat-in 97.5 0.0038 9.6E-08 39.2 11.3 87 64-155 26-118 (123) 25 cd03453 SAV4209_like SAV4209_l 97.4 0.003 7.6E-08 39.8 10.7 87 50-146 33-125 (127) 26 PRK11563 bifunctional aldehyde 97.4 0.0016 4.1E-08 41.4 9.2 117 24-149 531-674 (676) 27 COG2050 PaaI HGG motif-contain 97.3 0.014 3.5E-07 35.8 13.0 120 22-153 16-140 (141) 28 cd03448 HDE_HSD HDE_HSD The R 97.3 0.004 1E-07 39.0 9.9 80 56-146 40-119 (122) 29 COG2030 MaoC Acyl dehydratase 97.3 0.0051 1.3E-07 38.4 10.0 60 92-151 92-156 (159) 30 pfam01575 MaoC_dehydratas MaoC 97.2 0.0018 4.6E-08 41.1 6.9 63 56-126 46-108 (123) 31 COG1607 Acyl-CoA hydrolase [Li 96.9 0.029 7.5E-07 33.9 11.3 87 63-154 31-123 (157) 32 COG0824 FcbC Predicted thioest 96.9 0.017 4.2E-07 35.4 10.0 66 92-157 55-120 (137) 33 cd03455 SAV4209 SAV4209 is a S 96.9 0.017 4.2E-07 35.3 9.8 90 48-147 30-122 (123) 34 KOG3328 consensus 96.8 0.023 5.7E-07 34.6 10.0 111 28-152 21-142 (148) 35 PRK10800 acyl-CoA thioester hy 96.8 0.011 2.8E-07 36.4 8.3 63 95-157 55-118 (130) 36 PRK13693 (3R)-hydroxyacyl-ACP 96.7 0.054 1.4E-06 32.3 11.5 86 47-141 40-132 (142) 37 TIGR02286 PaaD phenylacetic ac 96.7 0.023 5.8E-07 34.5 9.2 99 31-142 7-107 (115) 38 cd03443 PaaI_thioesterase PaaI 96.6 0.06 1.5E-06 32.0 13.5 108 31-149 5-113 (113) 39 cd00586 4HBT 4-hydroxybenzoyl- 96.6 0.038 9.7E-07 33.2 9.7 61 91-151 49-110 (110) 40 PRK13692 (3R)-hydroxyacyl-ACP 96.3 0.09 2.3E-06 31.0 11.2 60 94-153 84-148 (159) 41 pfam03061 4HBT Thioesterase su 96.2 0.057 1.4E-06 32.1 8.9 45 95-139 32-76 (79) 42 COG5496 Predicted thioesterase 96.1 0.12 3E-06 30.2 10.9 85 69-156 36-120 (130) 43 cd03450 NodN NodN (nodulation 95.5 0.22 5.6E-06 28.6 10.1 86 60-149 55-147 (149) 44 PRK13691 (3R)-hydroxyacyl-ACP 95.4 0.23 5.9E-06 28.5 10.1 58 95-152 85-147 (166) 45 smart00826 PKS_DH PKS_DH. 95.1 0.27 7E-06 28.0 10.5 99 41-149 16-123 (167) 46 PRK11688 hypothetical protein; 94.7 0.37 9.4E-06 27.3 12.6 129 15-149 12-153 (154) 47 TIGR02799 thio_ybgC tol-pal sy 94.6 0.38 9.6E-06 27.2 8.9 65 90-154 48-114 (127) 48 PRK10526 acyl-CoA thioesterase 94.5 0.4 1E-05 27.0 9.7 49 102-150 63-111 (286) 49 TIGR02278 PaaN-DH phenylacetic 94.3 0.11 2.7E-06 30.5 5.2 104 33-145 547-684 (690) 50 PRK10254 hypothetical protein; 94.1 0.49 1.2E-05 26.5 12.8 109 28-150 24-136 (137) 51 cd03445 Thioesterase_II_repeat 93.0 0.75 1.9E-05 25.4 10.7 77 66-148 17-93 (94) 52 COG1946 TesB Acyl-CoA thioeste 92.7 0.37 9.4E-06 27.3 5.7 53 93-145 224-278 (289) 53 cd00556 Thioesterase_II Thioes 92.2 0.96 2.4E-05 24.8 8.0 50 99-148 48-98 (99) 54 PRK10293 hypothetical protein; 92.0 1 2.6E-05 24.6 12.8 108 28-150 24-136 (136) 55 TIGR00051 TIGR00051 conserved 90.5 1.4 3.7E-05 23.7 7.3 65 92-156 50-116 (121) 56 KOG3016 consensus 90.3 1.5 3.9E-05 23.6 9.3 84 57-150 34-117 (294) 57 COG4109 Predicted transcriptio 90.0 0.63 1.6E-05 25.9 4.7 51 100-150 381-431 (432) 58 KOG1206 consensus 88.8 0.052 1.3E-06 32.4 -1.6 82 20-116 147-236 (272) 59 pfam02551 Acyl_CoA_thio Acyl-C 86.5 2.7 6.9E-05 22.1 7.6 114 28-143 7-128 (133) 60 pfam03756 AfsA A-factor biosyn 81.6 3.9 1E-04 21.1 5.2 43 40-84 21-63 (74) 61 KOG2763 consensus 80.5 4.8 0.00012 20.6 7.3 72 63-139 217-290 (357) 62 smart00675 DM11 Domains in hyp 78.7 5.5 0.00014 20.2 5.3 72 50-131 9-80 (164) 63 PRK07531 bifunctional 3-hydrox 78.1 5.7 0.00015 20.1 5.3 68 92-159 388-456 (489) 64 cd03444 Thioesterase_II_repeat 76.8 6.2 0.00016 19.9 8.6 53 92-144 45-99 (104) 65 cd03706 mtEFTU_III Domain III 66.4 11 0.00028 18.4 4.5 42 105-146 50-91 (93) 66 TIGR00979 fumC_II fumarate hyd 65.8 3.8 9.7E-05 21.2 1.8 37 39-82 301-340 (459) 67 cd03707 EFTU_III Domain III of 64.5 11 0.00028 18.4 4.0 40 105-144 50-89 (90) 68 PRK01777 hypothetical protein; 59.0 11 0.00027 18.5 3.1 52 65-119 25-77 (95) 69 pfam10989 DUF2808 Protein of u 55.9 17 0.00043 17.3 3.8 40 100-139 88-129 (142) 70 pfam03658 UPF0125 Uncharacteri 54.0 15 0.00038 17.6 3.1 52 65-119 22-74 (83) 71 pfam10862 DUF2662 Protein of u 45.7 24 0.00062 16.3 4.6 123 20-148 4-156 (157) 72 TIGR02480 fliN flagellar motor 45.2 25 0.00063 16.3 3.4 16 131-146 45-60 (77) 73 COG0114 FumC Fumarase [Energy 42.0 18 0.00045 17.2 2.0 37 40-83 303-342 (462) 74 PRK09376 rho transcription ter 38.8 31 0.0008 15.7 4.8 112 2-118 24-151 (416) 75 pfam05179 CDC73 RNA pol II acc 38.0 22 0.00055 16.7 1.9 55 24-86 175-231 (273) 76 TIGR00369 unchar_dom_1 unchara 36.7 34 0.00086 15.5 5.7 54 94-147 66-120 (122) 77 TIGR02903 spore_lon_C ATP-depe 35.8 8.7 0.00022 19.0 -0.4 41 25-68 268-313 (616) 78 pfam01643 Acyl-ACP_TE Acyl-ACP 32.6 39 0.001 15.1 8.9 45 101-145 197-243 (247) 79 PRK12273 aspA aspartate ammoni 31.1 32 0.00083 15.6 1.9 24 59-82 318-344 (471) 80 PRK13325 bifunctional biotin-- 29.3 45 0.0011 14.7 3.1 21 56-76 469-490 (592) 81 cd01533 4RHOD_Repeat_2 Member 28.3 47 0.0012 14.6 3.5 45 9-69 10-54 (109) 82 KOG1317 consensus 27.2 33 0.00084 15.6 1.4 36 40-82 328-366 (487) 83 COG5105 MIH1 Mitotic inducer, 25.0 41 0.001 15.0 1.5 20 19-38 332-351 (427) 84 TIGR02303 HpaG-C-term 4-hydrox 22.9 49 0.0013 14.5 1.6 16 105-120 224-239 (249) 85 PRK12425 fumarate hydratase; P 22.4 49 0.0012 14.5 1.5 33 41-80 303-338 (464) 86 pfam08865 DUF1830 Domain of un 22.1 61 0.0016 13.9 3.7 27 92-118 12-38 (68) 87 cd01519 RHOD_HSP67B2 Member of 20.7 65 0.0017 13.8 2.4 41 13-69 3-43 (106) No 1 >TIGR01750 fabZ beta-hydroxyacyl-(acyl-carrier-protein) dehydratase FabZ; InterPro: IPR010084 Fatty acids biosynthesis occurs by two distinct pathways: in fungi, mammals and mycobacteria, type I or associative fatty-acid biosynthesis (type I FAS) is accomplished by multifunctional proteins in which distinct domains catalyse specific reactions; in plants and most bacteria, type II or dissociative fatty-acid biosynthesis (type II FAS) is accomplished by distinct enzymes . FabZ is the primary dehydratase involved in fatty-acid elongation in type II FAS. Both FabZ and FabA catalyse the dehydration of beta-hydroxyacyl acyl carrier protein (ACP) to trans 2-enoyl ACP. However, FabZ and FabA display subtle differences in substrate specificities, whereby FabA is most effective on acyl ACPs of 9-11 carbon atoms in length, while FabZ is less specific. Unlike FabA, FabZ does not function as an isomerase and cannot initiate unsaturated fatty acid biosynthesis. However, only FabZ can act during the elongation of unsaturated fatty acid chains. ; GO: 0016836 hydro-lyase activity, 0006633 fatty acid biosynthetic process, 0005737 cytoplasm. Probab=100.00 E-value=0 Score=374.85 Aligned_cols=139 Identities=48% Similarity=0.821 Sum_probs=132.9 Q ss_pred HHHHHHHHHCCCCCCEEEEEEEEEEEC-CCEEEEEEEECCCCHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCC-- Q ss_conf 418989984799975054789999846-97499999826888313134799950225669999863463454432997-- Q gi|254780770|r 11 KDIVELMRFLPHRYPFLLVDKVVNIQR-DESAIGIKNVTFNEPHFMGHFPGRPVMPGVLILEGMAQTAGAICAIHNGF-- 87 (161) Q Consensus 11 ~~~~eI~~~lPhr~Pfl~ID~i~~~~~-g~~~~g~k~v~~~e~ff~gHFp~~PvmPGvLliE~maQ~~~~l~~~~~~~-- 87 (161) +||++|+++|||||||||||||+++++ +++++|+||||.|||||+||||+.|||||||++|||||+||+++..+.+. T Consensus 1 ~~I~~I~~~LPHRYPfLLVDRi~e~~~~~~~ivA~KNVTINEPfF~GHFP~~PiMPGVLI~EAlAQaaGvl~~~~L~~~~ 80 (142) T TIGR01750 1 LDIQEIMELLPHRYPFLLVDRILELEPEGKRIVAIKNVTINEPFFQGHFPEKPIMPGVLIVEALAQAAGVLAILSLGGKK 80 (142) T ss_pred CCHHHHHHHCCCCCCCEEEEEEEEEECCCCEEEEEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 97589898446888710132688872499789997116774898889885029597078788889888877420579877 Q ss_pred CCCEEEEEEECCCCEEEEEEEECCEEEEEEEEEEEECCEEEEEEEEEECCEEEEEEEEEEEE Q ss_conf 76302677523641360022128889999999972077799999999999999989999998 Q gi|254780770|r 88 DQYAPPYLMSIDKARFRKPVFPGDRLEYHVNKVRNRVDLWKFQCCAKVENTVVAEAEICAMV 149 (161) Q Consensus 88 ~~~~~~~l~~i~~~kF~~~V~PGd~L~i~~~i~~~~~~~~~~~~~~~v~~~~va~a~l~~~i 149 (161) .+++..||++|+++|||+||.|||+|.+++|+.+.+++++++.|+|+||||+||+|||+||+ T Consensus 81 ~~g~~~~F~gid~akFr~pV~PGDqL~l~~e~~~~~~~i~k~~~~A~VDG~vaAeaei~~a~ 142 (142) T TIGR01750 81 GKGKLVYFAGIDKAKFRKPVVPGDQLILEVEFLKKRRKIGKFKGEAEVDGKVAAEAEITFAI 142 (142) T ss_pred CCCEEEEEEEECCCCCCCCCCCCCEEEEEEEEEEEECCEEEEEEEEEECCEEEEEEEEEEEC T ss_conf 88628999886273035664678224787885534124015788998899899778888619 No 2 >PRK13188 bifunctional UDP-3-O-[3-hydroxymyristoyl] N-acetylglucosamine deacetylase/(3R)-hydroxymyristoyl-[acyl-carrier-protein] dehydratase; Reviewed Probab=100.00 E-value=0 Score=332.66 Aligned_cols=147 Identities=46% Similarity=0.753 Sum_probs=137.8 Q ss_pred CCHHHHHHHHHHCCCCCCEEEEEEEEEEECCCEEEEEEEECCCCHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 28641898998479997505478999984697499999826888313134799950225669999863463454432997 Q gi|254780770|r 8 LDAKDIVELMRFLPHRYPFLLVDKVVNIQRDESAIGIKNVTFNEPHFMGHFPGRPVMPGVLILEGMAQTAGAICAIHNGF 87 (161) Q Consensus 8 l~~~~~~eI~~~lPhr~Pfl~ID~i~~~~~g~~~~g~k~v~~~e~ff~gHFp~~PvmPGvLliE~maQ~~~~l~~~~~~~ 87 (161) =..+|++||+++|||||||||||+|++++||++++|+|+||.|||||+||||++|+|||||++|+|||+|+++++.+.+. T Consensus 318 ~~~~d~~~i~~~lphR~Pfl~iD~v~~~~~~~~~~a~Knvt~nE~ff~GHFP~~pvmPGvl~iEamaQ~~~~l~l~~~~~ 397 (465) T PRK13188 318 EPIMDINRIMELLPHRYPFLLVDKIIEFKLDEKIVGIKNVTMNEPFFQGHFPGNPVMPGVLQIEAMAQTGGILVLNSVDN 397 (465) T ss_pred CCCCCHHHHHHHCCCCCCEEEEEEEEEECCCCEEEEEEECCCCCCHHCCCCCCCCCCCHHHHHHHHHHHHHHHHEECCCC T ss_conf 86068999997588989768999988506997899999357788221357899998971899999999988873204777 Q ss_pred CCCEEEEEEECCCCEEEEEEEECCEEEEEEEEE-EEECCEEEEEEEEEECCEEEEEEEEEEEEECCCC Q ss_conf 763026775236413600221288899999999-7207779999999999999998999999844520 Q gi|254780770|r 88 DQYAPPYLMSIDKARFRKPVFPGDRLEYHVNKV-RNRVDLWKFQCCAKVENTVVAEAEICAMVMHEKK 154 (161) Q Consensus 88 ~~~~~~~l~~i~~~kF~~~V~PGd~L~i~~~i~-~~~~~~~~~~~~~~v~~~~va~a~l~~~i~~~~~ 154 (161) .++...||++++++||+++|.|||+|.+++++. ..++++++++|+++||+++||+|+++++++++++ T Consensus 398 ~~~~~~y~~~i~~~kFr~~V~PGD~l~~~~~~l~~~k~~i~k~~g~a~V~~~~v~eae~~a~iv~~~~ 465 (465) T PRK13188 398 PENYSTYFMKIDKVKFRRPVVPGDTLIIEAELLTPIRRGICQMDGKAYVGGELVCEAELMAQIVKKKE 465 (465) T ss_pred CCCEEEEEEECCCEEECCCCCCCCEEEEEEEEEECCCCCEEEEEEEEEECCEEEEEEEEEEEEEECCC T ss_conf 78807999960311886886999899999999000479779999999999999999999999983589 No 3 >PRK00006 fabZ (3R)-hydroxymyristoyl-ACP dehydratase; Reviewed Probab=100.00 E-value=0 Score=325.19 Aligned_cols=143 Identities=50% Similarity=0.830 Sum_probs=135.2 Q ss_pred HHHHHHHHHHCCCCCCEEEEEEEEEEECCCEEEEEEEECCCCHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCC-C Q ss_conf 641898998479997505478999984697499999826888313134799950225669999863463454432997-7 Q gi|254780770|r 10 AKDIVELMRFLPHRYPFLLVDKVVNIQRDESAIGIKNVTFNEPHFMGHFPGRPVMPGVLILEGMAQTAGAICAIHNGF-D 88 (161) Q Consensus 10 ~~~~~eI~~~lPhr~Pfl~ID~i~~~~~g~~~~g~k~v~~~e~ff~gHFp~~PvmPGvLliE~maQ~~~~l~~~~~~~-~ 88 (161) -+|++||+++||||+||||||+|++++||++++|+|+|+.|||||+||||++|+|||||++|||||+|++++....+. . T Consensus 6 ~l~~~~I~~~lPhR~PfL~iD~i~~~~~g~~~~~~k~vs~nE~~f~GHFP~~PimPGvLliEamaQ~~~~l~~~~~~~~~ 85 (149) T PRK00006 6 MLDIEEILELLPHRYPFLLVDRVLELEPGKSIVAIKNVTINEPFFQGHFPGRPVMPGVLIVEAMAQAGGVLAFKSLGKDA 85 (149) T ss_pred EECHHHHHHHCCCCCCEEEEEEEEEECCCCEEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 98899999868999986999999997599889999964389771577588999286489999999999998833347677 Q ss_pred CCEEEEEEECCCCEEEEEEEECCEEEEEEEEEEEECCEEEEEEEEEECCEEEEEEEEEEEEECC Q ss_conf 6302677523641360022128889999999972077799999999999999989999998445 Q gi|254780770|r 89 QYAPPYLMSIDKARFRKPVFPGDRLEYHVNKVRNRVDLWKFQCCAKVENTVVAEAEICAMVMHE 152 (161) Q Consensus 89 ~~~~~~l~~i~~~kF~~~V~PGd~L~i~~~i~~~~~~~~~~~~~~~v~~~~va~a~l~~~i~~~ 152 (161) +++.++|++++++||+++|.|||+|++++++.+.++++++++|+++|||+++|+|+++++++|+ T Consensus 86 ~~~~~~l~~i~~vkF~~~V~PGd~l~i~~~l~~~~~~~~~~~~~a~V~g~~v~~ae~~~~i~dk 149 (149) T PRK00006 86 KGKLVYFAGIDKARFRRPVVPGDQLILEVEFLKKRRGIGKFKGVAKVDGKLVAEAELMFAIVDK 149 (149) T ss_pred CCCEEEEEEEEEEEECCCCCCCCEEEEEEEEEEEECCEEEEEEEEEECCEEEEEEEEEEEEECC T ss_conf 9847999961378984787999999999999996589999999999999999999999999769 No 4 >cd01288 FabZ FabZ is a 17kD beta-hydroxyacyl-acyl carrier protein (ACP) dehydratase that primarily catalyzes the dehydration of beta-hydroxyacyl-ACP to trans-2-acyl-ACP, the third step in the elongation phase of the bacterial/ plastid, type II, fatty-acid biosynthesis pathway. Probab=100.00 E-value=1.4e-45 Score=292.99 Aligned_cols=131 Identities=50% Similarity=0.785 Sum_probs=125.9 Q ss_pred CCCCCCEEEEEEEEEEECCCEEEEEEEECCCCHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECC Q ss_conf 79997505478999984697499999826888313134799950225669999863463454432997763026775236 Q gi|254780770|r 20 LPHRYPFLLVDKVVNIQRDESAIGIKNVTFNEPHFMGHFPGRPVMPGVLILEGMAQTAGAICAIHNGFDQYAPPYLMSID 99 (161) Q Consensus 20 lPhr~Pfl~ID~i~~~~~g~~~~g~k~v~~~e~ff~gHFp~~PvmPGvLliE~maQ~~~~l~~~~~~~~~~~~~~l~~i~ 99 (161) ||||+||||||||++++||++++|+|+|+.|||||+||||++|+|||||++|+|||+|+++........+++..+|++++ T Consensus 1 lPhR~PfL~iD~I~~~~~g~~~~~~k~vs~ne~ff~gHfp~~PvmPgvl~iEamaQ~~~~l~~~~~~~~~~~~~~l~~i~ 80 (131) T cd01288 1 LPHRYPFLLVDRVLELEPGKSIVAIKNVTINEPFFQGHFPGNPIMPGVLIIEALAQAAGILGLKSLEDFEGKLVYFAGID 80 (131) T ss_pred CCCCCCEEEEEEEEEECCCCEEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECC T ss_conf 96899848899999985899899999756998613575899984774899999999999997115677898089999814 Q ss_pred CCEEEEEEEECCEEEEEEEEEEEECCEEEEEEEEEECCEEEEEEEEEEEEE Q ss_conf 413600221288899999999720777999999999999999899999984 Q gi|254780770|r 100 KARFRKPVFPGDRLEYHVNKVRNRVDLWKFQCCAKVENTVVAEAEICAMVM 150 (161) Q Consensus 100 ~~kF~~~V~PGd~L~i~~~i~~~~~~~~~~~~~~~v~~~~va~a~l~~~i~ 150 (161) ++||+++|.|||+|++++++.+.++++++++|+++|||+++|+|++++++. T Consensus 81 ~~kF~~~V~PGd~l~i~~~~~~~~~~~~~~~~~~~v~~~~v~~ae~~~~i~ 131 (131) T cd01288 81 KARFRKPVVPGDQLILEVELLKLRRGIGKFKGKAYVDGKLVAEAELMFAIA 131 (131) T ss_pred EEEECCCCCCCCEEEEEEEEEEEECCEEEEEEEEEECCEEEEEEEEEEEEC T ss_conf 888947889999999999999976999999999999999999999993449 No 5 >COG0764 FabA 3-hydroxymyristoyl/3-hydroxydecanoyl-(acyl carrier protein) dehydratases [Lipid metabolism] Probab=100.00 E-value=9.9e-42 Score=270.87 Aligned_cols=143 Identities=48% Similarity=0.798 Sum_probs=131.6 Q ss_pred CHHHHHHHHHHCCCCCCEEEEEEEEEE-ECCCEEEEEEEECCCCHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 864189899847999750547899998-4697499999826888313134799950225669999863463454432997 Q gi|254780770|r 9 DAKDIVELMRFLPHRYPFLLVDKVVNI-QRDESAIGIKNVTFNEPHFMGHFPGRPVMPGVLILEGMAQTAGAICAIHNGF 87 (161) Q Consensus 9 ~~~~~~eI~~~lPhr~Pfl~ID~i~~~-~~g~~~~g~k~v~~~e~ff~gHFp~~PvmPGvLliE~maQ~~~~l~~~~~~~ 87 (161) ...++.+|.+.+|||+||||||||+++ ++|..++|+|+|+.|||||.||||++|+|||||++|+|||++++++.+.... T Consensus 2 ~~~~~~~i~~~lphryPfLmvDrv~~~~~~g~~i~a~k~Vt~nepfF~gHFP~~PimPGVLileamaQ~~g~~~~~~~~~ 81 (147) T COG0764 2 LAIDIGEILELLPHRYPFLLVDRVLEIDEEGKRIVAIKNVTINEPFFTGHFPGDPIMPGVLILEAMAQAAGFLLGWLLGN 81 (147) T ss_pred CCCCHHHHHHHCCCCCCHHHEEEEEEECCCCCEEEEEECCCCCCCEECCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCC T ss_conf 74488999634531378120001555406995899997558888871795999997653699999999999998334367 Q ss_pred CCCEEEEEEECCCCEEEEEEEECCEEEEEEEEEEEE-CCEEEEEEEEEECCEEEEEEEEEEEEECC Q ss_conf 763026775236413600221288899999999720-77799999999999999989999998445 Q gi|254780770|r 88 DQYAPPYLMSIDKARFRKPVFPGDRLEYHVNKVRNR-VDLWKFQCCAKVENTVVAEAEICAMVMHE 152 (161) Q Consensus 88 ~~~~~~~l~~i~~~kF~~~V~PGd~L~i~~~i~~~~-~~~~~~~~~~~v~~~~va~a~l~~~i~~~ 152 (161) . ++.+|+++++++||+++|.|||++.+++++.+.+ ++++++.+++.|||+++|+|++.++..+. T Consensus 82 ~-~~~~~~~gid~~kF~~~V~PGd~l~l~~~~~~~~~~~~~~~~~~a~Vdg~~v~~a~~~~~~~~~ 146 (147) T COG0764 82 K-GKLGYFLGIDNAKFKRPVLPGDQLELEVKLLKSRRLGIGKAKGVATVDGKVVAEAELLFAGVEK 146 (147) T ss_pred C-CCEEEEEEECCEEECCCCCCCCEEEEEEEEEEECCCCEEEEEEEEEECCEEEEEEEEEEEEEEC T ss_conf 8-6179999851333537659998799999998723224489878999899999988779998506 No 6 >pfam07977 FabA FabA-like domain. This enzyme domain has a HotDog fold. Probab=100.00 E-value=3.1e-41 Score=267.90 Aligned_cols=124 Identities=45% Similarity=0.614 Sum_probs=112.6 Q ss_pred CCCCCCEEEEEEEEEEEC-----CCEEEEEEEECCCCHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCEEEE Q ss_conf 799975054789999846-----974999998268883131347999502256699998634634544329977630267 Q gi|254780770|r 20 LPHRYPFLLVDKVVNIQR-----DESAIGIKNVTFNEPHFMGHFPGRPVMPGVLILEGMAQTAGAICAIHNGFDQYAPPY 94 (161) Q Consensus 20 lPhr~Pfl~ID~i~~~~~-----g~~~~g~k~v~~~e~ff~gHFp~~PvmPGvLliE~maQ~~~~l~~~~~~~~~~~~~~ 94 (161) ||||+||||||||++++| |.+++|+|+|+.|||||+||||++|+|||||++|||||++++++... +..+++..+ T Consensus 1 lPhR~PflliDrv~~~~~~g~~~g~~i~a~k~v~~~e~~f~gHFp~~PvmPGvL~iEa~aQ~~~~~~~~~-~~~~~~~~~ 79 (133) T pfam07977 1 LPHRYPFLMIDRVTEIDPEGGKFKGYIVAEKNVTPNEWFFQGHFPGDPVMPGVLGLEAMAQLGGFYLLWL-GGGEGKGRA 79 (133) T ss_pred CCCCCCEEEEEEEEEECCCCCCCCCEEEEEEECCCCCHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHECC-CCCCCCCCE T ss_conf 9689880369999999269985673999999448997221325779997883779999999999984505-678885101 Q ss_pred EEECCCCEEEEEEEECC-EEEEEEEEEEE---ECCEEEEEEEEEECCEEEEEEE Q ss_conf 75236413600221288-89999999972---0777999999999999999899 Q gi|254780770|r 95 LMSIDKARFRKPVFPGD-RLEYHVNKVRN---RVDLWKFQCCAKVENTVVAEAE 144 (161) Q Consensus 95 l~~i~~~kF~~~V~PGd-~L~i~~~i~~~---~~~~~~~~~~~~v~~~~va~a~ 144 (161) +++++++||+++|+||| +|++++++.+. ++++++++|+++|||+++|+|+ T Consensus 80 ~~~i~~~kFr~~V~PGD~~l~~ev~i~~i~~~~~~~~~~~g~a~Vdg~~v~eae 133 (133) T pfam07977 80 RFGIDEVKFRGQVTPGDKQLRYEVEIKRIGPRRGVIGIADGRALVDGKLVYEAE 133 (133) T ss_pred ECCCEEEEEEEEECCCCEEEEEEEEEEEEECCCCCEEEEEEEEEECCEEEEEEC T ss_conf 045227899247659994999999999986168749999999999999999849 No 7 >cd00493 FabA_FabZ FabA/Z, beta-hydroxyacyl-acyl carrier protein (ACP)-dehydratases: One of several distinct enzyme types of the dissociative, type II, fatty acid synthase system (found in bacteria and plants) required to complete successive cycles of fatty acid elongation. The third step of the elongation cycle, the dehydration of beta-hydroxyacyl-ACP to trans-2-acyl-ACP, is catalyzed by FabA or FabZ. FabA is bifunctional and catalyzes an additional isomerization reaction of trans-2-acyl-ACP to cis-3-acyl-ACP, an essential reaction to unsaturated fatty acid synthesis. FabZ is the primary dehydratase that participates in the elongation cycles of saturated as well as unsaturated fatty acid biosynthesis, whereas FabA is more active in the dehydration of beta-hydroxydecanoyl-ACP. The FabA structure is homodimeric with two independent active sites located at the dimer interface. Probab=100.00 E-value=1.4e-38 Score=251.88 Aligned_cols=128 Identities=48% Similarity=0.730 Sum_probs=119.8 Q ss_pred CCCCCEEEEEEEEEEECCCEEEEEEEECCCCHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCC--CCCEEEEEEEC Q ss_conf 9997505478999984697499999826888313134799950225669999863463454432997--76302677523 Q gi|254780770|r 21 PHRYPFLLVDKVVNIQRDESAIGIKNVTFNEPHFMGHFPGRPVMPGVLILEGMAQTAGAICAIHNGF--DQYAPPYLMSI 98 (161) Q Consensus 21 Phr~Pfl~ID~i~~~~~g~~~~g~k~v~~~e~ff~gHFp~~PvmPGvLliE~maQ~~~~l~~~~~~~--~~~~~~~l~~i 98 (161) |||+||+|||+|++++||++++|+|+|+.|||||+||||++|+|||||++|+|||+++++..+.... ..++.++|+++ T Consensus 1 Phr~p~lliD~v~~~~~~~~~~~~k~v~~~~~ff~gHfp~~pv~Pg~l~iE~~aQ~~~~~~~~~~~~~~~~~~~~~l~~~ 80 (131) T cd00493 1 PHRYPMLLVDRVLEIDPGGRIVAEKNVTPNEPFFQGHFPGDPVMPGVLGIEAMAQAAAALAGLLGLGKGNPPRLGYLAGV 80 (131) T ss_pred CCCCCEEEEEEEEEECCCCEEEEEEEECCCCCEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEE T ss_conf 98997688999999869998999999279977477205799966603799999999999974113344778835899971 Q ss_pred CCCEEEEEEEECCEEEEEEEEEEEECCEEEEEEEEEECCEEEEEEEEEEE Q ss_conf 64136002212888999999997207779999999999999998999999 Q gi|254780770|r 99 DKARFRKPVFPGDRLEYHVNKVRNRVDLWKFQCCAKVENTVVAEAEICAM 148 (161) Q Consensus 99 ~~~kF~~~V~PGd~L~i~~~i~~~~~~~~~~~~~~~v~~~~va~a~l~~~ 148 (161) +++||+++|.|||+|++++++.+.++++++++|++++||+++|++++++. T Consensus 81 ~~~kf~~~v~Pgd~l~i~v~~~~~~~~~~~~~~~~~v~~~~v~~~~l~~~ 130 (131) T cd00493 81 RKVKFRGPVLPGDTLTLEVELLKVRRGLGKFDGRAYVDGKLVAEAELMAA 130 (131) T ss_pred EEEEEECEECCCCEEEEEEEEEEEECCEEEEEEEEEECCEEEEEEEEEEE T ss_conf 38899340479989999999999889999999999999999999999970 No 8 >cd01287 FabA FabA, beta-hydroxydecanoyl-acyl carrier protein (ACP)-dehydratase: Bacterial protein of the type II, fatty acid synthase system that binds ACP and catalyzes both dehydration and isomerization reactions, apparently in the same active site. The FabA structure is a homodimer with two independent active sites located at the dimer interface. Each active site is tunnel-shaped and completely inaccessible to solvent. No metal ions or cofactors are required for ligand binding or catalysis. Probab=99.96 E-value=3.2e-28 Score=189.73 Aligned_cols=130 Identities=23% Similarity=0.252 Sum_probs=106.3 Q ss_pred CCCCCCEEEEEEEEEEECCC------EEEEEEEECCCCHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCC---CC-C Q ss_conf 79997505478999984697------49999982688831313479995022566999986346345443299---77-6 Q gi|254780770|r 20 LPHRYPFLLVDKVVNIQRDE------SAIGIKNVTFNEPHFMGHFPGRPVMPGVLILEGMAQTAGAICAIHNG---FD-Q 89 (161) Q Consensus 20 lPhr~Pfl~ID~i~~~~~g~------~~~g~k~v~~~e~ff~gHFp~~PvmPGvLliE~maQ~~~~l~~~~~~---~~-~ 89 (161) ||- +||||||||++++|+. +++|+|+|++|+|||++|||++|+|||+|++|+|+|+.++++.+... .. . T Consensus 3 Lp~-~~~lm~Drv~~i~~~GG~~~~G~i~ae~~i~~~~wff~~HF~~dpvmPG~L~~ea~~Q~l~~yl~~~G~~~~~~~~ 81 (150) T cd01287 3 LPG-GQLLMLDRVTEIDPGGGTFGLGYLRAEKDIDPDDWFFPCHFHGDPVMPGSLGLEAMIQLLQFYLIWLGLGTGVDNP 81 (150) T ss_pred CCC-CCEEEEEEEEEEECCCCCCCCEEEEEEEECCCCCCEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCC T ss_conf 789-9776569999991799854540999999158785711478998991455369999999999999867765445687 Q ss_pred CEEEEEEECCCCEEEEEEEECC-EEEEEEEEEEEE----CCEEEEEEEEEECCEEEEEEE-EEEEEE Q ss_conf 3026775236413600221288-899999999720----777999999999999999899-999984 Q gi|254780770|r 90 YAPPYLMSIDKARFRKPVFPGD-RLEYHVNKVRNR----VDLWKFQCCAKVENTVVAEAE-ICAMVM 150 (161) Q Consensus 90 ~~~~~l~~i~~~kF~~~V~PGd-~L~i~~~i~~~~----~~~~~~~~~~~v~~~~va~a~-l~~~i~ 150 (161) ...+...+..++||+++|+|++ ++++++++++.. ...+.+++.+++||+.|.++. +.+-+. T Consensus 82 ~f~~~~~~~~~~kfRGQV~P~~k~v~yev~I~~v~~~~~~~~~iADg~l~vDGk~Iy~~~~l~v~l~ 148 (150) T cd01287 82 RFQGAPGGPGEWKYRGQITPHNKKVTYEVHIKEVGRDGPRPYIIADASLWVDGLRIYEAKDIAVRLV 148 (150) T ss_pred CCEEECCCEEEEEEEEEECCCCCEEEEEEEEEEEEECCCCEEEEEEEEEEECCEEEEEECCCEEEEE T ss_conf 7420338502799978866799389999999999804995899999999989989999808899987 No 9 >PRK05174 3-hydroxydecanoyl-(acyl carrier protein) dehydratase; Validated Probab=99.94 E-value=1.2e-25 Score=174.23 Aligned_cols=131 Identities=21% Similarity=0.321 Sum_probs=104.7 Q ss_pred HCCCCCCEEEEEEEEEEECCC------EEEEEEEECCCCHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCEE Q ss_conf 479997505478999984697------49999982688831313479995022566999986346345443299776302 Q gi|254780770|r 19 FLPHRYPFLLVDKVVNIQRDE------SAIGIKNVTFNEPHFMGHFPGRPVMPGVLILEGMAQTAGAICAIHNGFDQYAP 92 (161) Q Consensus 19 ~lPhr~Pfl~ID~i~~~~~g~------~~~g~k~v~~~e~ff~gHFp~~PvmPGvLliE~maQ~~~~l~~~~~~~~~~~~ 92 (161) -||- +||||+|||++++|.- +++|.+.|++|+|||++||+++|||||+|.+|+|.|+.++++.+......++ T Consensus 28 ~Lp~-~~mlM~DRI~~i~~~GG~~g~G~i~ae~dI~Pd~WFF~cHF~~DPVMPGsLGlea~~Qll~f~l~w~g~~g~gr- 105 (172) T PRK05174 28 QLPA-PPMLMMDRITEISETGGEFGKGYIVAELDINPDLWFFGCHFIGDPVMPGCLGLDAMWQLVGFYLGWLGGPGKGR- 105 (172) T ss_pred CCCC-CCEEEEEEEEEECCCCCCCCCCEEEEEEEECHHHCEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCE- T ss_conf 8999-76074886716816898345731899985081332472567989837762246789999988987626988633- Q ss_pred EEEEECCCCEEEEEEEECC-EEEEEEEEEEEECC---EEEEEEEEEECCEEEEEEE-EEEEEECCC Q ss_conf 6775236413600221288-89999999972077---7999999999999999899-999984452 Q gi|254780770|r 93 PYLMSIDKARFRKPVFPGD-RLEYHVNKVRNRVD---LWKFQCCAKVENTVVAEAE-ICAMVMHEK 153 (161) Q Consensus 93 ~~l~~i~~~kF~~~V~PGd-~L~i~~~i~~~~~~---~~~~~~~~~v~~~~va~a~-l~~~i~~~~ 153 (161) ..++.++||+++|+|.+ ++++++.+++...+ ++..++.+++||+.|.++. +..-+.++. T Consensus 106 --alg~~evKfrGQV~P~~k~v~y~i~ik~v~~~~~~~~iADg~l~vDg~~IY~a~dl~VgLf~~~ 169 (172) T PRK05174 106 --ALGVGEVKFTGQVLPTAKKVTYEIDIKRVIRRKLVMGIADGRVLVDGEEIYTAKDLKVGLFKDT 169 (172) T ss_pred --ECCCCCEEEEEEECCCCCEEEEEEEEEEEECCCCEEEEEEEEEEECCEEEEEEECCEEEEECCC T ss_conf --2226830880437338977999999999860882699998999989989999948799975167 No 10 >cd01289 FabA_like Domain of unknown function, appears to be related to a diverse group of beta-hydroxydecanoyl ACP dehydratases (FabA) and beta-hydroxyacyl ACP dehydratases (FabZ). This group appears to lack the conserved active site histidine of FabA and FabZ. Probab=99.89 E-value=5.8e-22 Score=152.15 Aligned_cols=133 Identities=20% Similarity=0.226 Sum_probs=115.8 Q ss_pred HHHHHCCCCCCEEEEEEEEEEECCCEEEEEEEECCCCHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHC--CCCCCCEE Q ss_conf 8998479997505478999984697499999826888313134799950225669999863463454432--99776302 Q gi|254780770|r 15 ELMRFLPHRYPFLLVDKVVNIQRDESAIGIKNVTFNEPHFMGHFPGRPVMPGVLILEGMAQTAGAICAIH--NGFDQYAP 92 (161) Q Consensus 15 eI~~~lPhr~Pfl~ID~i~~~~~g~~~~g~k~v~~~e~ff~gHFp~~PvmPGvLliE~maQ~~~~l~~~~--~~~~~~~~ 92 (161) -|.++||||+||+|||+++++++ .++++..+++.+.+||. +.+..+|++..||.|||+.+++.++. ......+. T Consensus 2 ~I~~llPH~~pM~LlD~v~~~~~-~~i~~~~~i~~~~~f~~---~~~g~lpa~~~iE~mAQa~Aa~~g~~a~~~g~~~~~ 77 (138) T cd01289 2 WIAALIPHDGPMCLLDRVISWDD-DSIHCRATVHPDPLFPL---RAHGRLPAWVGIEYMAQAIAAHGGLLARQQGNPPRP 77 (138) T ss_pred CHHHHCCCCCCEEEEEEEEEECC-CEEEEEEEECCCCCCCC---CCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 56660999998587777647869-87999999478877735---568952257999999999999998888865999885 Q ss_pred EEEEECCCCEEEEEEEE-CCEEEEEE-EEEEEECCEEEEEEEEEECCEEEEEEEEEEEEEC Q ss_conf 67752364136002212-88899999-9997207779999999999999998999999844 Q gi|254780770|r 93 PYLMSIDKARFRKPVFP-GDRLEYHV-NKVRNRVDLWKFQCCAKVENTVVAEAEICAMVMH 151 (161) Q Consensus 93 ~~l~~i~~~kF~~~V~P-Gd~L~i~~-~i~~~~~~~~~~~~~~~v~~~~va~a~l~~~i~~ 151 (161) +||.++|+++.+.+.+| |+++.+.+ +..+..++++.|+|++..+++.+++|.++.+.+. T Consensus 78 GfLlg~Rk~~~~~~~f~~g~~l~i~~~~~l~~~~g~~vf~c~i~~~~~~la~g~l~V~~P~ 138 (138) T cd01289 78 GFLLGSRKYEAHVDRFDLGSTLLIVVAELLQGDSGLGVFECTIEDQGGVLASGRLNVYQPA 138 (138) T ss_pred EEEEEEEEEEEECCEECCCCCCEEHHHHHEECCCCEEEEEEEEEECCEEEEEEEEEEECCC T ss_conf 7999977889843421899840613354356179759999999989999999999987489 No 11 >TIGR01749 fabA beta-hydroxyacyl-(acyl-carrier-protein) dehydratase FabA; InterPro: IPR010083 This enzyme, FabA, shows overlapping substrate specificity with FabZ with regard to chain length in fatty acid biosynthesis. It is commonly designated 3-hydroxydecanoyl-[acyl-carrier-protein] dehydratase as if it were specific for that chain length, but its specificity is broader; it is active even in the initiation of fatty acid biosynthesis. This enzyme can also isomerise trans-2-decenoyl-ACP to cis-3-decenoyl-ACP to bypass reduction by FabI and instead allow biosynthesis of unsaturated fatty acids. FabA cannot elongate unsaturated fatty acids.; GO: 0008693 3-hydroxydecanoyl-[acyl-carrier-protein] dehydratase activity, 0006633 fatty acid biosynthetic process, 0005737 cytoplasm. Probab=99.70 E-value=4.1e-17 Score=123.04 Aligned_cols=132 Identities=24% Similarity=0.352 Sum_probs=104.1 Q ss_pred HHCCCCCCEEEEEEEEEEEC--CC----EEEEEEEECCCCHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCE Q ss_conf 84799975054789999846--97----4999998268883131347999502256699998634634544329977630 Q gi|254780770|r 18 RFLPHRYPFLLVDKVVNIQR--DE----SAIGIKNVTFNEPHFMGHFPGRPVMPGVLILEGMAQTAGAICAIHNGFDQYA 91 (161) Q Consensus 18 ~~lPhr~Pfl~ID~i~~~~~--g~----~~~g~k~v~~~e~ff~gHFp~~PvmPGvLliE~maQ~~~~l~~~~~~~~~~~ 91 (161) .-||. +||||+|||+++.. |. .+.+..+|++|.|||.+||.++|+|||+|.+++|.|+.++++.+.....+++ T Consensus 24 ~~lP~-~~mlm~~r~~~~~~~GG~~~kG~~~ae~d~~Pd~Wff~Chf~G~PvmPGClGld~mWql~Gf~l~W~G~~G~G~ 102 (169) T TIGR01749 24 AQLPA-PPMLMIDRIVEISETGGKYGKGYVEAELDIRPDLWFFDCHFIGDPVMPGCLGLDAMWQLVGFFLGWLGEPGRGR 102 (169) T ss_pred CCCCC-CCHHHHHHHHHHHHCCCCCCCCEEEEEEECCCCCEEEEEEEECCCCCCCCCCHHHHHHHHHHHEEECCCCCCCE T ss_conf 77886-30345555555531488435645887431277731342143158737620003567764321102117897620 Q ss_pred EEEEEECCCCEEEEEEEEC-CEEEEEEEEEEEE---CCEEEEEEEEEECCEEEEEE-EEEEEEECCC Q ss_conf 2677523641360022128-8899999999720---77799999999999999989-9999984452 Q gi|254780770|r 92 PPYLMSIDKARFRKPVFPG-DRLEYHVNKVRNR---VDLWKFQCCAKVENTVVAEA-EICAMVMHEK 153 (161) Q Consensus 92 ~~~l~~i~~~kF~~~V~PG-d~L~i~~~i~~~~---~~~~~~~~~~~v~~~~va~a-~l~~~i~~~~ 153 (161) . .++..+||.+.|.|. ..+++.+++.+.- -=++..++.+.+||+.+.++ .+..-+..+. T Consensus 103 a---l~~Gevkf~G~v~P~~~~~~y~i~~~r~~~~~lv~~~~d~~~~~dG~~iy~~~~l~vGl~~~~ 166 (169) T TIGR01749 103 A---LGVGEVKFTGQVLPTAKKVTYKIDLKRVINRKLVLGIADGELKVDGRLIYEASDLRVGLFTET 166 (169) T ss_pred E---EECCEEEEEEEECCCEEEEEEEEEEEEEEEEEEEEEECCCEEEECCEEEEEECCCEEEEECCC T ss_conf 5---411204775553442157888652212210001243227457745212332045156541322 No 12 >COG4706 Predicted 3-hydroxylacyl-(acyl carrier protein) dehydratase [Lipid metabolism] Probab=99.49 E-value=6.1e-14 Score=103.99 Aligned_cols=122 Identities=23% Similarity=0.350 Sum_probs=100.4 Q ss_pred HHHHHCCCCCCEEEEEEEEEEECCCEEEEEEEECCC-CHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHC-CCCCCC-- Q ss_conf 899847999750547899998469749999982688-8313134799950225669999863463454432-997763-- Q gi|254780770|r 15 ELMRFLPHRYPFLLVDKVVNIQRDESAIGIKNVTFN-EPHFMGHFPGRPVMPGVLILEGMAQTAGAICAIH-NGFDQY-- 90 (161) Q Consensus 15 eI~~~lPhr~Pfl~ID~i~~~~~g~~~~g~k~v~~~-e~ff~gHFp~~PvmPGvLliE~maQ~~~~l~~~~-~~~~~~-- 90 (161) .|..+|||+.||+++|+|+.++ +.++.+..+|+++ .||+.-| -.||++..||-|||+-++..++. ....++ T Consensus 11 ~iaa~LPHsg~MlLLd~Vvtwd-Dd~~rc~atvsp~~a~~l~~d----g~Lpa~~gIElmAQAv~vh~g~l~~rq~~ps~ 85 (161) T COG4706 11 SIAAYLPHSGPMLLLDDVVTWD-DDSARCRATVSPSGAPFLDPD----GNLPAWFGIELMAQAVGVHSGWLRHRQGKPSI 85 (161) T ss_pred CHHHHCCCCCCEEEEEEEEEEC-CCEEEEEEEECCCCCCCCCCC----CCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 9888488878567533242334-883788767678888763767----77545556999999999989888750279765 Q ss_pred EEEEEEECCCCEEEEEEEE-CCEEEEEE-EEEEEECCEEEEEEEEEECCEEEE Q ss_conf 0267752364136002212-88899999-999720777999999999999999 Q gi|254780770|r 91 APPYLMSIDKARFRKPVFP-GDRLEYHV-NKVRNRVDLWKFQCCAKVENTVVA 141 (161) Q Consensus 91 ~~~~l~~i~~~kF~~~V~P-Gd~L~i~~-~i~~~~~~~~~~~~~~~v~~~~va 141 (161) +.++|.++++..-+.+.+| |+.|.+.+ +..+..++++.|+|+++.|++... T Consensus 86 r~GfLlg~Rkleaha~~l~~~q~ll~t~~e~iqddgg~g~f~csir~d~~~~~ 138 (161) T COG4706 86 RLGFLLGARKLEAHAGILPAGQTLLITVKELIQDDGGFGSFECSIRNDGEATG 138 (161) T ss_pred CCEEEEEEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCEEEEEEECCCCHHHC T ss_conf 41246300011221354577653579999985157884179998716862100 No 13 >PRK04424 fatty acid biosynthesis transcriptional regulator; Provisional Probab=98.91 E-value=7.6e-08 Score=67.41 Aligned_cols=114 Identities=23% Similarity=0.274 Sum_probs=90.5 Q ss_pred HHCCCCCCEEEEEEEEEEECCCEEEEEEEECCCCHHH-----CCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCEE Q ss_conf 8479997505478999984697499999826888313-----13479995022566999986346345443299776302 Q gi|254780770|r 18 RFLPHRYPFLLVDKVVNIQRDESAIGIKNVTFNEPHF-----MGHFPGRPVMPGVLILEGMAQTAGAICAIHNGFDQYAP 92 (161) Q Consensus 18 ~~lPhr~Pfl~ID~i~~~~~g~~~~g~k~v~~~e~ff-----~gHFp~~PvmPGvLliE~maQ~~~~l~~~~~~~~~~~~ 92 (161) +-||+.. +|-+++++++|+++..+..++.+.-|= .||+ -.||+....+..- +.. T Consensus 66 rsl~~~E---viGelidle~~~~aiSil~it~emvf~~t~iargh~-------------lfAQAnSLAvAvi-----~~~ 124 (185) T PRK04424 66 KSLPLEE---VVGELIDLELGRSAISILEITEEMVFSKTQIARGHH-------------LFAQANSLAVAVI-----DAE 124 (185) T ss_pred CCCCHHH---EEEEEEEECCCCEEEEEEECCHHHEECCCCEEEHHE-------------EHHHHHHHHHHHC-----CCC T ss_conf 4677134---155626632796088889856667433387321110-------------0777763688613-----995 Q ss_pred EEEEECCCCEEEEEEEECCEEEEEEEEEEEECCEEEEEEEEEECCEEEEEEEEEEEEECC Q ss_conf 677523641360022128889999999972077799999999999999989999998445 Q gi|254780770|r 93 PYLMSIDKARFRKPVFPGDRLEYHVNKVRNRVDLWKFQCCAKVENTVVAEAEICAMVMHE 152 (161) Q Consensus 93 ~~l~~i~~~kF~~~V~PGd~L~i~~~i~~~~~~~~~~~~~~~v~~~~va~a~l~~~i~~~ 152 (161) ..|.++.++||++||..||++...+++...+++.+.+.++++|+++.|.+|+|..+-.++ T Consensus 125 ~~lt~~A~VrF~rPV~~Gd~vvAkA~V~~~~gnk~~V~V~s~V~~e~VF~G~F~~~~~~~ 184 (185) T PRK04424 125 LALTGKANIRFKRPVKLGERVVAKAEVVRVTDNKYIVEVKSYVKDELVFKGKFIMYASSE 184 (185) T ss_pred EEEEEEEEEEECCCCCCCCEEEEEEEEEEECCCEEEEEEEEEECCEEEEEEEEEEEEECC T ss_conf 499998789993105669899999999997299999999999899999999999999178 No 14 >cd03451 FkbR2 FkbR2 is a Streptomyces hygroscopicus protein with a hot dog fold that belongs to a conserved family of proteins found in prokaryotes and archaea but not in eukaryotes. FkbR2 has sequence similarity to (R)-specific enoyl-CoA hydratase, the peroxisomal Hydratase-Dehydrogenase-Epimerase (HDE) protein, and the fatty acid synthase beta subunit. The function of FkbR2 is unknown. Probab=98.37 E-value=7.6e-06 Score=55.39 Aligned_cols=86 Identities=20% Similarity=0.277 Sum_probs=58.3 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCEEEEEEEECCEEEEEEEEEEEE-------CCEEEEEEE Q ss_conf 9950225669999863463454432997763026775236413600221288899999999720-------777999999 Q gi|254780770|r 60 GRPVMPGVLILEGMAQTAGAICAIHNGFDQYAPPYLMSIDKARFRKPVFPGDRLEYHVNKVRNR-------VDLWKFQCC 132 (161) Q Consensus 60 ~~PvmPGvLliE~maQ~~~~l~~~~~~~~~~~~~~l~~i~~~kF~~~V~PGd~L~i~~~i~~~~-------~~~~~~~~~ 132 (161) +.++.+|.+.+ ++ ...+.............+.++++|.+||+|||+|++++++...+ .++..++++ T Consensus 52 g~~ia~G~~~~-s~------~~~l~~~~~~~~~~a~lg~~~~rf~~PV~~GDtl~~~~eV~~~r~~~sr~~~Giv~~~~~ 124 (146) T cd03451 52 GRRLVNSLFTL-SL------ALGLSVNDTSLTAVANLGYDEVRFPAPVFHGDTLYAESEVLSKRESKSRPDAGIVTVRTV 124 (146) T ss_pred CCCCCCHHHHH-HH------HHHEEECCCCCCEEEEEEEEEEEEECCCCCCCEEEEEEEEEEEEECCCCCCCEEEEEEEE T ss_conf 87321124555-54------441353258857046578403599377769999999999999874588998579999999 Q ss_pred EEE-CCEEEEEEEEEEEEECC Q ss_conf 999-99999989999998445 Q gi|254780770|r 133 AKV-ENTVVAEAEICAMVMHE 152 (161) Q Consensus 133 ~~v-~~~~va~a~l~~~i~~~ 152 (161) +.. +|++|.+++.+.++..+ T Consensus 125 ~~NQ~ge~V~~~~~~~lv~rr 145 (146) T cd03451 125 GYNQDGEPVLSFERTALVPKR 145 (146) T ss_pred EECCCCCEEEEEEEEEEEECC T ss_conf 990799899993999899867 No 15 >cd03452 MaoC_C MaoC_C The C-terminal hot dog fold of the MaoC (monoamine oxidase C) dehydratase regulatory protein. Orthologs of MaoC include PaaZ [Escherichia coli] and PaaN [Pseudomonas putida], which are putative ring-opening enzymes involved in phenylacetic acid degradation. The C-terminal domain of MaoC has sequence similarity to (R)-specific enoyl-CoA hydratase,Hydratase-Dehydrogenase-Epimerase (HDE) protein, and the fatty acid synthase beta subunit. MaoC also has an N-terminal PutA domain like that found in the E. coli PutA proline dehydrogenase and other members of the aldehyde dehydrogenase family. Probab=98.36 E-value=6.5e-06 Score=55.80 Aligned_cols=113 Identities=19% Similarity=0.293 Sum_probs=67.9 Q ss_pred EEEECCCEE-EEEEEECCCCH------------------HHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCEEE Q ss_conf 998469749-99998268883------------------13134799950225669999863463454432997763026 Q gi|254780770|r 33 VNIQRDESA-IGIKNVTFNEP------------------HFMGHFPGRPVMPGVLILEGMAQTAGAICAIHNGFDQYAPP 93 (161) Q Consensus 33 ~~~~~g~~~-~g~k~v~~~e~------------------ff~gHFp~~PvmPGvLliE~maQ~~~~l~~~~~~~~~~~~~ 93 (161) .++++|+.. +..++|+..+- |-+...-+..+.+|.+.+- .+.+.+. ....+... T Consensus 3 ED~~vG~~~~~~~rTiTe~di~~Fa~~tgD~~p~H~D~e~A~~s~fg~~ia~G~~~~~---~~~gl~~----~~~~~~~~ 75 (142) T cd03452 3 EQLRPGDSLLTHRRTVTEADIVNFACLTGDHFYAHMDEIAAKASFFGKRVAHGYFVLS---AAAGLFV----DPAPGPVL 75 (142) T ss_pred CCCCCCCEEEECCEEECHHHHHHHHHHHCCCCHHHCCHHHHHHCCCCCCCCCHHHHHH---HHHHHEE----ECCCCCEE T ss_conf 5479988997789888799999999856885730248999962889981414689999---9875485----17887457 Q ss_pred EEEECCCCEEEEEEEECCEEEEEEEEEEEE------CCEEEEEEEEEE-CCEEEEEEEEEEEEECC Q ss_conf 775236413600221288899999999720------777999999999-99999989999998445 Q gi|254780770|r 94 YLMSIDKARFRKPVFPGDRLEYHVNKVRNR------VDLWKFQCCAKV-ENTVVAEAEICAMVMHE 152 (161) Q Consensus 94 ~l~~i~~~kF~~~V~PGd~L~i~~~i~~~~------~~~~~~~~~~~v-~~~~va~a~l~~~i~~~ 152 (161) ...+++++||.+||+|||+|..++++.+.+ .++..+++++.. +|++|++.+...++..+ T Consensus 76 a~~g~~~lrf~~PV~~GDTi~~~~~V~e~~~~~~~~~Giv~~~~~~~NQ~ge~V~~~~~~~lv~rr 141 (142) T cd03452 76 ANYGLENLRFLEPVYPGDTIQVRLTCKRKIPRDGQDYGVVRWDAEVTNQNGELVASYDILTLVAKK 141 (142) T ss_pred EECCCCEEEECCCCCCCCEEEEEEEEEEEEECCCCCCEEEEEEEEEECCCCCEEEEEEEEEEEECC T ss_conf 723442569708989999999999999985167899679999999997999999993997799837 No 16 >cd03446 MaoC_like MoaC_like Similar to the MaoC (monoamine oxidase C) dehydratase regulatory protein but without the N-terminal PutA domain. This protein family has a hot-dog fold similar to that of (R)-specific enoyl-CoA hydratase, the peroxisomal Hydratase-Dehydrogenase-Epimerase (HDE) protein, and the fatty acid synthase beta subunit. Probab=98.34 E-value=9.2e-06 Score=54.91 Aligned_cols=85 Identities=24% Similarity=0.314 Sum_probs=56.3 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCEEEEEEEECCEEEEEEEEEEEEC------CEEEEEEEE Q ss_conf 99502256699998634634544329977630267752364136002212888999999997207------779999999 Q gi|254780770|r 60 GRPVMPGVLILEGMAQTAGAICAIHNGFDQYAPPYLMSIDKARFRKPVFPGDRLEYHVNKVRNRV------DLWKFQCCA 133 (161) Q Consensus 60 ~~PvmPGvLliE~maQ~~~~l~~~~~~~~~~~~~~l~~i~~~kF~~~V~PGd~L~i~~~i~~~~~------~~~~~~~~~ 133 (161) +.++.+|.+.+=.+ .++. ..............+.+++||.+||+|||+|.+++++...+. ++..++.++ T Consensus 49 ~~~ia~G~~~~s~~---~~l~--~~~~~~~~~~~a~~g~~~~rf~~PV~~GDtl~~~~~V~~~~~~~~~~~Giv~~~~~~ 123 (140) T cd03446 49 GERIAHGLLTLSIA---TGLL--QRLGVFERTVVAFYGIDNLRFLNPVFIGDTIRAEAEVVEKEEKDGEDAGVVTRRIEV 123 (140) T ss_pred CCCCCCHHHHHHHH---HHHH--HCCCCCCCCEEEECCCEEEEECCCCCCCCEEEEEEEEEEEEECCCCCCEEEEEEEEE T ss_conf 88233447899999---8876--316777753365123104587488889999999999999541579982799999999 Q ss_pred EE-CCEEEEEEEEEEEE Q ss_conf 99-99999989999998 Q gi|254780770|r 134 KV-ENTVVAEAEICAMV 149 (161) Q Consensus 134 ~v-~~~~va~a~l~~~i 149 (161) +. +|++|++++-+.++ T Consensus 124 ~NQ~ge~V~~~~~~~lv 140 (140) T cd03446 124 VNQRGEVVQSGEMSLLV 140 (140) T ss_pred ECCCCCEEEEEEEEEEC T ss_conf 94899999997988779 No 17 >cd03440 hot_dog The hotdog fold was initially identified in the E. coli FabA (beta-hydroxydecanoyl-acyl carrier protein (ACP)-dehydratase) structure and subsequently in 4HBT (4-hydroxybenzoyl-CoA thioesterase) from Pseudomonas. A number of other seemingly unrelated proteins also share the hotdog fold. These proteins have related, but distinct, catalytic activities that include metabolic roles such as thioester hydrolysis in fatty acid metabolism, and degradation of phenylacetic acid and the environmental pollutant 4-chlorobenzoate. This superfamily also includes the PaaI-like protein FapR, a non-catalytic bacterial homolog involved in transcriptional regulation of fatty acid biosynthesis. Probab=98.21 E-value=3.4e-05 Score=51.47 Aligned_cols=85 Identities=29% Similarity=0.353 Sum_probs=70.3 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCEEEEEEEECCEEEEEEEEEEEECCEEEEEEEEEEC-CE Q ss_conf 99502256699998634634544329977630267752364136002212888999999997207779999999999-99 Q gi|254780770|r 60 GRPVMPGVLILEGMAQTAGAICAIHNGFDQYAPPYLMSIDKARFRKPVFPGDRLEYHVNKVRNRVDLWKFQCCAKVE-NT 138 (161) Q Consensus 60 ~~PvmPGvLliE~maQ~~~~l~~~~~~~~~~~~~~l~~i~~~kF~~~V~PGd~L~i~~~i~~~~~~~~~~~~~~~v~-~~ 138 (161) .+++++|+..++.+.|+.......... .+....+.... ++|++|+.|||++.+++++...+.......+.+..+ ++ T Consensus 14 ~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~-~~f~~~~~~g~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 90 (100) T cd03440 14 GGGIVHGGLLLALADEAAGAAAARLGG--RGLGAVTLSLD-VRFLRPVRPGDTLTVEAEVVRVGRSSVTVEVEVRNEDGK 90 (100) T ss_pred CCCCEEHHHHHHHHHHHHHHHHHHHCC--CCCEEEEEEEE-EEEEECCCCCCEEEEEEEEEECCCEEEEEEEEEEECCCC T ss_conf 898590799999999999999998568--99359999999-998005879909999999998897199999999989897 Q ss_pred EEEEEEEEE Q ss_conf 999899999 Q gi|254780770|r 139 VVAEAEICA 147 (161) Q Consensus 139 ~va~a~l~~ 147 (161) +++++.... T Consensus 91 ~~~~~~~~~ 99 (100) T cd03440 91 LVATATATF 99 (100) T ss_pred EEEEEEEEE T ss_conf 999999998 No 18 >cd03454 YdeM YdeM is a Bacillus subtilis protein that belongs to a family of prokaryotic proteins of unkown function. YdeM has sequence similarity to the hot-dog fold of (R)-specific enoyl-CoA hydratase. Other enzymes with this fold include the peroxisomal Hydratase-Dehydrogenase-Epimerase (HDE) protein, and the fatty acid synthase beta subunit. Probab=98.18 E-value=2.2e-05 Score=52.67 Aligned_cols=59 Identities=27% Similarity=0.336 Sum_probs=47.2 Q ss_pred EEEEEEECCCCEEEEEEEECCEEEEEEEEEEEE-------CCEEEEEEEEEE-CCEEEEEEEEEEEE Q ss_conf 026775236413600221288899999999720-------777999999999-99999989999998 Q gi|254780770|r 91 APPYLMSIDKARFRKPVFPGDRLEYHVNKVRNR-------VDLWKFQCCAKV-ENTVVAEAEICAMV 149 (161) Q Consensus 91 ~~~~l~~i~~~kF~~~V~PGd~L~i~~~i~~~~-------~~~~~~~~~~~v-~~~~va~a~l~~~i 149 (161) ......+.+++||++||+|||+|..++++...+ .++..+++++.+ +|++|++++.+.++ T Consensus 73 ~~~a~~g~~~~rf~~PV~~GDTi~~~~eV~~~~~~~sr~~~GiV~~~~~~~NQ~ge~V~~~~~~~lv 139 (140) T cd03454 73 ASGGSPGIDELRWPRPVRPGDTLSVEVEVLDKRPSRSRPDRGIVTLRSETLNQRGEVVLTFEATVLV 139 (140) T ss_pred CEEEECCCCEEEECCCCCCCCEEEEEEEEEEEEECCCCCCCEEEEEEEEEEECCCCEEEEEEEEEEE T ss_conf 3012425212588378768999999999999364577898369999999992899999998981469 No 19 >PRK08190 bifunctional enoyl-CoA hydratase/phosphate acetyltransferase; Validated Probab=98.09 E-value=8.6e-05 Score=49.07 Aligned_cols=117 Identities=21% Similarity=0.265 Sum_probs=79.6 Q ss_pred EEEECCCEEEEEEEECCCCHH-HC---C-----CCC---------CCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCEEEE Q ss_conf 998469749999982688831-31---3-----479---------99502256699998634634544329977630267 Q gi|254780770|r 33 VNIQRDESAIGIKNVTFNEPH-FM---G-----HFP---------GRPVMPGVLILEGMAQTAGAICAIHNGFDQYAPPY 94 (161) Q Consensus 33 ~~~~~g~~~~g~k~v~~~e~f-f~---g-----HFp---------~~PvmPGvLliE~maQ~~~~l~~~~~~~~~~~~~~ 94 (161) .++..|.++.-.+.++.++-. |. | |.. +.++-+|.+.- +.+-..+.....+....| T Consensus 12 del~VG~sAs~~rtiT~~Di~~FA~~SGD~NP~Hld~e~A~~t~F~~~IaHGm~~a------sLiSavLGt~LPGpGTiY 85 (465) T PRK08190 12 DELAIGDSASLVRTLTPDDIELFAAMSGDVNPAHLDAAYAASDGFHHVVAHGMWGG------ALISAVLGTRLPGPGTIY 85 (465) T ss_pred HHCCCCCEEEEEEEECHHHHHHHHHHHCCCCCCCCCHHHHHCCCCCCCEEHHHHHH------HHHHHHHHCCCCCCCEEE T ss_conf 88268874899997079999999986289997778966861278887143077789------999999746589997587 Q ss_pred EEECCCCEEEEEEEECCEEEEEEEEEEEE--CCEEEEEEEEE-ECCEEEEEEEEEEEEECCCCCCC Q ss_conf 75236413600221288899999999720--77799999999-99999998999999844520023 Q gi|254780770|r 95 LMSIDKARFRKPVFPGDRLEYHVNKVRNR--VDLWKFQCCAK-VENTVVAEAEICAMVMHEKKEKN 157 (161) Q Consensus 95 l~~i~~~kF~~~V~PGd~L~i~~~i~~~~--~~~~~~~~~~~-v~~~~va~a~l~~~i~~~~~e~~ 157 (161) +. .+.+|++||++||+++..+++.+.+ .+...++|++. .+|+.|++|+-..+-+.++-.+. T Consensus 86 l~--Q~L~F~~PV~iGDtvtatVtV~ek~~~k~~V~L~~~~~nq~G~~Vi~G~A~V~APt~kv~~~ 149 (465) T PRK08190 86 LS--QSLRFRRPVRIGDTITVTVTVREKRPEKRIVLLDCRCTNQDGEVVISGTAEVIAPTEKVRRP 149 (465) T ss_pred EE--EEEEECCCCCCCCEEEEEEEEEEECCCCCEEEEEEEEECCCCCEEEEEEEEEECCCCCCCCC T ss_conf 65--14676678889998999999999717889899997999069979998579998885445676 No 20 >cd03449 R_hydratase (R)-hydratase [(R)-specific enoyl-CoA hydratase] catalyzes the hydration of trans-2-enoyl CoA to (R)-3-hydroxyacyl-CoA as part of the PHA (polyhydroxyalkanoate) biosynthetic pathway. (R)-hydratase contains a hot-dog fold similar to those of thioesterase II, and beta-hydroxydecanoyl-ACP dehydratase, MaoC dehydratase, Hydratase-Dehydrogenase-Epimerase protein (HDE), and the fatty acid synthase beta subunit. The active site lies within a substrate-binding tunnel formed by the (R)-hydratase homodimer. A subset of the bacterial (R)-hydratases contain a C-terminal phosphotransacetylase (PTA) domain. Probab=98.01 E-value=6.1e-05 Score=49.96 Aligned_cols=81 Identities=20% Similarity=0.241 Sum_probs=52.4 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCEEEEEEEECCEEEEEEEEEEEEC--CEEEEEEEEEE-C Q ss_conf 99502256699998634634544329977630267752364136002212888999999997207--77999999999-9 Q gi|254780770|r 60 GRPVMPGVLILEGMAQTAGAICAIHNGFDQYAPPYLMSIDKARFRKPVFPGDRLEYHVNKVRNRV--DLWKFQCCAKV-E 136 (161) Q Consensus 60 ~~PvmPGvLliE~maQ~~~~l~~~~~~~~~~~~~~l~~i~~~kF~~~V~PGd~L~i~~~i~~~~~--~~~~~~~~~~v-~ 136 (161) +.|+.||.|..=.++++ +.. ..+. .+ . +..-.+++|.+||+|||+|++++++...+. +...+++++.. + T Consensus 44 ~~~i~~G~l~~s~~~~~---~~~-~~~g-~g-~--~~~~~~~rF~~PV~~gDtl~~~~~V~~~~~~~~~v~~~~~~~nq~ 115 (128) T cd03449 44 GGRIAHGMLTASLISAV---LGT-LLPG-PG-T--IYLSQSLRFLRPVFIGDTVTATVTVTEKREDKKRVTLETVCTNQN 115 (128) T ss_pred CCCEECCHHHHHHHHHH---HHH-CCCC-CC-E--EEEEEEEEECCCCCCCCEEEEEEEEEEEECCCCEEEEEEEEEECC T ss_conf 86033127789999999---740-5689-86-1--874347999356799999999999999978999999999999689 Q ss_pred CEEEEEEEEEEE Q ss_conf 999998999999 Q gi|254780770|r 137 NTVVAEAEICAM 148 (161) Q Consensus 137 ~~~va~a~l~~~ 148 (161) |++|.+|+...+ T Consensus 116 g~~V~~G~a~v~ 127 (128) T cd03449 116 GEVVIEGEAVVL 127 (128) T ss_pred CCEEEEEEEEEE T ss_conf 999999899994 No 21 >PRK10694 acyl-CoA thioester hydrolase; Provisional Probab=97.79 E-value=0.00058 Score=44.10 Aligned_cols=87 Identities=13% Similarity=0.092 Sum_probs=64.6 Q ss_pred CHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCEEEEEEEECCEEEEEEEEEEEECCEEEEEEEEEEC-------- Q ss_conf 256699998634634544329977630267752364136002212888999999997207779999999999-------- Q gi|254780770|r 65 PGVLILEGMAQTAGAICAIHNGFDQYAPPYLMSIDKARFRKPVFPGDRLEYHVNKVRNRVDLWKFQCCAKVE-------- 136 (161) Q Consensus 65 PGvLliE~maQ~~~~l~~~~~~~~~~~~~~l~~i~~~kF~~~V~PGd~L~i~~~i~~~~~~~~~~~~~~~v~-------- 136 (161) =|. ++.-|-.+|++.+.... +...+.++++++.|++||..||.+.+++++....+.......++++. T Consensus 31 GG~-lm~~mD~aa~i~A~r~~----~~~~VTasvd~v~F~~Pv~vGdiv~~~a~V~~~G~tSm~V~v~v~~~~~~~~~~g 105 (133) T PRK10694 31 GGW-LMSQMDIGGAILAKEIA----HGRVVTVRVEGMTFLRPVAVGDVVCCYARCVKKGTTSISINIEVWVKKVASEPIG 105 (133) T ss_pred HHH-HHHHHHHHHHHHHHHHH----CCCEEEEEECEEEECCCCCCCCEEEEEEEEEEECCCCCEEEEEEEEEECCCCCCC T ss_conf 999-99999999999999973----9966999976189867630798999999999981663189999999864677777 Q ss_pred -CEEEEEEEEEEEEECCCCCC Q ss_conf -99999899999984452002 Q gi|254780770|r 137 -NTVVAEAEICAMVMHEKKEK 156 (161) Q Consensus 137 -~~~va~a~l~~~i~~~~~e~ 156 (161) ..++++|.|+++-+|++-+. T Consensus 106 ~~~~~~~a~ftfValDe~g~P 126 (133) T PRK10694 106 QRYKATEALFKYVAVDPEGKP 126 (133) T ss_pred EEEEEEEEEEEEEEECCCCCE T ss_conf 089999999999999999697 No 22 >cd03447 FAS_MaoC FAS_MaoC, the MaoC-like hot dog fold of the fatty acid synthase, beta subunit. Other enzymes with this fold include MaoC dehydratase, Hydratase-Dehydrogenase-Epimerase protein (HDE), and 17-beta-hydroxysteriod dehydrogenase (HSD). Probab=97.77 E-value=0.00035 Score=45.44 Aligned_cols=90 Identities=16% Similarity=0.119 Sum_probs=53.3 Q ss_pred CCHHHC-CCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCEEEEEEEECCEEEEEEEEEEEECCEEE Q ss_conf 883131-3479995022566999986346345443299776302677523641360022128889999999972077799 Q gi|254780770|r 50 NEPHFM-GHFPGRPVMPGVLILEGMAQTAGAICAIHNGFDQYAPPYLMSIDKARFRKPVFPGDRLEYHVNKVRNRVDLWK 128 (161) Q Consensus 50 ~e~ff~-gHFp~~PvmPGvLliE~maQ~~~~l~~~~~~~~~~~~~~l~~i~~~kF~~~V~PGd~L~i~~~i~~~~~~~~~ 128 (161) |+.|-+ .-|| .||.-|.+. =++ ++.++..+..+.... ...--+++|.+||+|||+|+++++.+....+... T Consensus 31 D~~~A~~~gf~-~~I~HGm~~-~a~--~~~~l~~~~~~g~~~----~~~~~~~rF~~pV~~GD~lt~~v~v~~~~~~~~~ 102 (126) T cd03447 31 SRVFASYAGLP-GTITHGMYT-SAA--VRALVETWAADNDRS----RVRSFTASFVGMVLPNDELEVRLEHVGMVDGRKV 102 (126) T ss_pred CHHHHHHCCCC-CCCHHHHHH-HHH--HHHHHHHHCCCCCCE----EEEEEEEEEEECCCCCCEEEEEEEEEEEECCCEE T ss_conf 99899874999-966247679-999--999999743799963----8999987981032599999999999999789179 Q ss_pred EEEEEE--ECCEEEEEEEEEE Q ss_conf 999999--9999999899999 Q gi|254780770|r 129 FQCCAK--VENTVVAEAEICA 147 (161) Q Consensus 129 ~~~~~~--v~~~~va~a~l~~ 147 (161) +..++. .+|++|.+|+-.. T Consensus 103 v~~~~~~~~~G~~Vl~G~A~V 123 (126) T cd03447 103 IKVEARNEETGELVLRGEAEV 123 (126) T ss_pred EEEEEEECCCCEEEEEEEEEE T ss_conf 999999976990999999999 No 23 >cd03441 R_hydratase_like (R)-hydratase [(R)-specific enoyl-CoA hydratase]. Catalyzes the hydration of trans-2-enoyl CoA to (R)-3-hydroxyacyl-CoA as part of the PHA (polyhydroxyalkanoate) biosynthetic pathway. The structure of the monomer includes a five-strand antiparallel beta-sheet wrapped around a central alpha helix, referred to as a hot dog fold. The active site lies within a substrate-binding tunnel formed by the homodimer. Other enzymes with this fold include MaoC dehydratase, Hydratase-Dehydrogenase-Epimerase protein (HDE), and the fatty acid synthase beta subunit. Probab=97.76 E-value=0.0004 Score=45.04 Aligned_cols=90 Identities=22% Similarity=0.227 Sum_probs=55.6 Q ss_pred CHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCEEEEEEEECCEEEEEEEEEEEEC----CE Q ss_conf 83131347999502256699998634634544329977630267752364136002212888999999997207----77 Q gi|254780770|r 51 EPHFMGHFPGRPVMPGVLILEGMAQTAGAICAIHNGFDQYAPPYLMSIDKARFRKPVFPGDRLEYHVNKVRNRV----DL 126 (161) Q Consensus 51 e~ff~gHFp~~PvmPGvLliE~maQ~~~~l~~~~~~~~~~~~~~l~~i~~~kF~~~V~PGd~L~i~~~i~~~~~----~~ 126 (161) +.|-+..--+.++.||.+..-.+.++. ..+... ......+..+++|++||+|||+|++.+++...+. +. T Consensus 32 ~~~a~~~~~~~~ia~G~~~~~~~~~~~---~~~~~~----~~~~~~~~~~~~f~~pv~~GDtl~~~~~v~~~~~~~~~~~ 104 (127) T cd03441 32 PEYAKAAGFGGRIAHGMLTLSLASGLL---VQWLPG----TDGANLGSQSVRFLAPVFPGDTLRVEVEVLGKRPSKGRGV 104 (127) T ss_pred HHHHHHCCCCCCEECCHHHHHHHHHHH---HHHCCC----CCCEEEEEEEEEEECCCCCCCEEEEEEEEEEEEECCCCCE T ss_conf 888835789862315544999999999---887066----5513565648865046799999999999999997599709 Q ss_pred EEEEEEEEE-CCEEEEEEEEEE Q ss_conf 999999999-999999899999 Q gi|254780770|r 127 WKFQCCAKV-ENTVVAEAEICA 147 (161) Q Consensus 127 ~~~~~~~~v-~~~~va~a~l~~ 147 (161) ..++..+.. +++++..++.+. T Consensus 105 v~~~~~~~nq~~~~v~~g~~~v 126 (127) T cd03441 105 VTVRTEARNQGGEVVLSGEATV 126 (127) T ss_pred EEEEEEEEECCCCEEEEEEEEE T ss_conf 9999999979999999999994 No 24 >cd03442 BFIT_BACH Brown fat-inducible thioesterase (BFIT). Brain acyl-CoA hydrolase (BACH). These enzymes deacylate long-chain fatty acids by hydrolyzing acyl-CoA thioesters to free fatty acids and CoA-SH. Eukaryotic members of this family are expressed in brain, testis, and brown adipose tissues. The archeal and eukaryotic members of this family have two tandem copies of the conserved hot dog fold, while most bacterial members have only one copy. Probab=97.46 E-value=0.0038 Score=39.21 Aligned_cols=87 Identities=25% Similarity=0.290 Sum_probs=64.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCEEEEEEEECCEEEEEEEEEEEECCEEEEEEEEEEC------C Q ss_conf 2256699998634634544329977630267752364136002212888999999997207779999999999------9 Q gi|254780770|r 64 MPGVLILEGMAQTAGAICAIHNGFDQYAPPYLMSIDKARFRKPVFPGDRLEYHVNKVRNRVDLWKFQCCAKVE------N 137 (161) Q Consensus 64 mPGvLliE~maQ~~~~l~~~~~~~~~~~~~~l~~i~~~kF~~~V~PGd~L~i~~~i~~~~~~~~~~~~~~~v~------~ 137 (161) .=|. +++-|-.+|++.+..... .....++++++.|++|+..||.+.+++++....+.......+++.. . T Consensus 26 ~GG~-ll~~~d~~a~~~a~~~~~----~~~vt~~vd~i~F~~pv~~Gd~v~~~~~v~~~g~sSmev~v~v~~~~~~~~~~ 100 (123) T cd03442 26 FGGW-LLEWMDELAGIAAYRHAG----GRVVTASVDRIDFLKPVRVGDVVELSARVVYTGRTSMEVGVEVEAEDPLTGER 100 (123) T ss_pred EHHH-HHHHHHHHHHHHHHHHCC----CCEEEEEECCEEECCCCCCCCEEEEEEEEEEECCEEEEEEEEEEEEECCCCCE T ss_conf 6899-999999999999999709----94699998868991235479799999999971561799999999992899969 Q ss_pred EEEEEEEEEEEEECCCCC Q ss_conf 999989999998445200 Q gi|254780770|r 138 TVVAEAEICAMVMHEKKE 155 (161) Q Consensus 138 ~~va~a~l~~~i~~~~~e 155 (161) +.++++.++++-.+++.+ T Consensus 101 ~~~~~~~ftfVa~d~~gr 118 (123) T cd03442 101 RLVTSAYFTFVALDEDGK 118 (123) T ss_pred EEEEEEEEEEEEECCCCC T ss_conf 999999999999999949 No 25 >cd03453 SAV4209_like SAV4209_like. Similar in sequence to the Streptomyces avermitilis SAV4209 protein, with a hot dog fold that is similar to those of (R)-specific enoyl-CoA hydratase, the peroxisomal Hydratase-Dehydrogenase-Epimerase (HDE) protein, and the fatty acid synthase beta subunit. Probab=97.45 E-value=0.003 Score=39.82 Aligned_cols=87 Identities=20% Similarity=0.122 Sum_probs=53.8 Q ss_pred CCHHHC-CCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCEEEEEEEECCEEEEEEEEEEE----EC Q ss_conf 883131-3479995022566999986346345443299776302677523641360022128889999999972----07 Q gi|254780770|r 50 NEPHFM-GHFPGRPVMPGVLILEGMAQTAGAICAIHNGFDQYAPPYLMSIDKARFRKPVFPGDRLEYHVNKVRN----RV 124 (161) Q Consensus 50 ~e~ff~-gHFp~~PvmPGvLliE~maQ~~~~l~~~~~~~~~~~~~~l~~i~~~kF~~~V~PGd~L~i~~~i~~~----~~ 124 (161) |+.|-+ ..||+ ++.+|.+..-.+.++ +..+. +. .+ .+.+ -++||++||+|||+|++..+++.. .. T Consensus 33 D~~~A~~~g~~~-~iahG~~~~a~~~~~---~~~~~-g~-~~---~~~~-~~~rF~~pV~~Gdtl~~~~~V~~k~~~~~~ 102 (127) T cd03453 33 DEDFAKKVGLPG-VIAHGMLTMGLLGRL---VTDWV-GD-PG---RVVS-FGVRFTKPVPVPDTLTCTGIVVEKTVADGE 102 (127) T ss_pred CHHHHHHCCCCC-CEECHHHHHHHHHHH---HHHHC-CC-CE---EEEE-EECEEECCCCCCCEEEEEEEEEEEEECCCC T ss_conf 888995659998-508088889999998---76315-99-74---9999-205494860189999999999998982899 Q ss_pred CEEEEEEEEEE-CCEEEEEEEEE Q ss_conf 77999999999-99999989999 Q gi|254780770|r 125 DLWKFQCCAKV-ENTVVAEAEIC 146 (161) Q Consensus 125 ~~~~~~~~~~v-~~~~va~a~l~ 146 (161) +...++..+.. +|+.+.+|+-. T Consensus 103 ~~v~~~v~v~nq~Ge~vv~G~A~ 125 (127) T cd03453 103 DALTVTVDATDQAGGKKVLGRAI 125 (127) T ss_pred CEEEEEEEEEECCCCEEEEEEEE T ss_conf 78999999999999999999999 No 26 >PRK11563 bifunctional aldehyde dehydrogenase/enoyl-CoA hydratase; Provisional Probab=97.44 E-value=0.0016 Score=41.44 Aligned_cols=117 Identities=21% Similarity=0.301 Sum_probs=70.9 Q ss_pred CCEEEEEEEEEEECCCEEE-EEEEECCCCH---------HHCCCCC-----CCCCCCHH-----HHHHHHHHHHHHHHHH Q ss_conf 7505478999984697499-9998268883---------1313479-----99502256-----6999986346345443 Q gi|254780770|r 24 YPFLLVDKVVNIQRDESAI-GIKNVTFNEP---------HFMGHFP-----GRPVMPGV-----LILEGMAQTAGAICAI 83 (161) Q Consensus 24 ~Pfl~ID~i~~~~~g~~~~-g~k~v~~~e~---------ff~gHFp-----~~PvmPGv-----LliE~maQ~~~~l~~~ 83 (161) .||. -.+.++..|++.. ...+|+.++- .|-.|+- .+|++|+- |++- + ++ ++ T Consensus 531 hpfr--~~f~~l~ig~~~~t~~r~vt~~di~~fa~~sgd~fyah~d~~aa~~~~~f~~rvahgy~~~s-~--aa----gl 601 (676) T PRK11563 531 HPFR--KYFEELRIGDSLLTARRTVTEADIVNFACLTGDTFYAHMDEIAAKANPFFGGRVAHGYLVLS-A--AA----GL 601 (676) T ss_pred CCCC--CCHHHCCCCCEEECCCEEECHHHHHHHHHCCCCCCCCCCCHHHHHCCCCCCCCCCCHHHHHH-H--HH----CC T ss_conf 8874--52411565656620780772999998750666540300488787228988763332220686-7--50----36 Q ss_pred CCCCCCCEEEEEEECCCCEEEEEEEECCEEEEEEEEEEEE------CCEEEEEEEEEE-CCEEEEEEEEEEEE Q ss_conf 2997763026775236413600221288899999999720------777999999999-99999989999998 Q gi|254780770|r 84 HNGFDQYAPPYLMSIDKARFRKPVFPGDRLEYHVNKVRNR------VDLWKFQCCAKV-ENTVVAEAEICAMV 149 (161) Q Consensus 84 ~~~~~~~~~~~l~~i~~~kF~~~V~PGd~L~i~~~i~~~~------~~~~~~~~~~~v-~~~~va~a~l~~~i 149 (161) .-+...+.+..--++++.||..||.|||+|..+.++++.. .+....+++++. +|++||+.++.-++ T Consensus 602 fv~p~~gpvlan~g~~~lrf~~pv~~gd~i~v~lt~k~k~~r~~~~~g~v~w~~~v~nq~~~~va~y~~lt~v 674 (676) T PRK11563 602 FVDPAPGPVLANYGLDNLRFIEPVKPGDTIQVRLTCKRKTPRNQKSYGEVRWDVEVTNQHGEPVATYDILTLV 674 (676) T ss_pred CCCCCCCCEEECCCCCEEEECCCCCCCCEEEEEEEEEEECCCCCCCCCEEEEEEEEECCCCCEEEEEEEEEEE T ss_conf 4259998665012743058615668997799999999803677898736999999985899878998654002 No 27 >COG2050 PaaI HGG motif-containing thioesterase, possibly involved in aromatic compounds catabolism [Secondary metabolites biosynthesis, transport, and catabolism] Probab=97.33 E-value=0.014 Score=35.84 Aligned_cols=120 Identities=20% Similarity=0.191 Sum_probs=73.0 Q ss_pred CCCCEE-EE-EEEEEEECCCEEEEEEEECCCCHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEC- Q ss_conf 997505-47-899998469749999982688831313479995022566999986346345443299776302677523- Q gi|254780770|r 22 HRYPFL-LV-DKVVNIQRDESAIGIKNVTFNEPHFMGHFPGRPVMPGVLILEGMAQTAGAICAIHNGFDQYAPPYLMSI- 98 (161) Q Consensus 22 hr~Pfl-~I-D~i~~~~~g~~~~g~k~v~~~e~ff~gHFp~~PvmPGvLliE~maQ~~~~l~~~~~~~~~~~~~~l~~i- 98 (161) -+.||. ++ =+++++++|. +.....+..+. .+ |..-+-=|++ =++|-+++.++......... .. .++ T Consensus 16 ~~~~~~~~lg~~~~~~~~g~-~~~~l~~~~~~--~~---~~G~~HGG~i--~alaD~a~~~a~~~~~~~~~--~~-~ti~ 84 (141) T COG2050 16 ERSPFLKTLGIEIEEIEEGE-AEATLPVDPEL--LN---PGGILHGGVI--AALADSAAGLAANSLLGVVA--LA-VTLE 84 (141) T ss_pred CCCCHHHHHCCEEEEECCCE-EEEEEECCHHH--CC---CCCCEEHHHH--HHHHHHHHHHHHHHCCCCCC--EE-EEEE T ss_conf 14733454185999963882-89999868888--18---8881539999--99999999999720267652--05-7899 Q ss_pred CCCEEEEEEEECCEEEEEEEEEEEECCEEEEEEEEEE--CCEEEEEEEEEEEEECCC Q ss_conf 6413600221288899999999720777999999999--999999899999984452 Q gi|254780770|r 99 DKARFRKPVFPGDRLEYHVNKVRNRVDLWKFQCCAKV--ENTVVAEAEICAMVMHEK 153 (161) Q Consensus 99 ~~~kF~~~V~PGd~L~i~~~i~~~~~~~~~~~~~~~v--~~~~va~a~l~~~i~~~~ 153 (161) -|+.|.+|+.+|+ ++.++++.+..+.....+.+++. +++++|++..++++.+.. T Consensus 85 l~i~flr~~~~g~-v~a~a~v~~~G~~~~v~~i~v~~~~~~~lva~~~~t~~v~~~~ 140 (141) T COG2050 85 LNINFLRPVKEGD-VTAEARVLHLGRRVAVVEIEVKNDEGGRLVAKGTGTYAVLRKR 140 (141) T ss_pred EEEEEECCCCCCC-EEEEEEEEEECCEEEEEEEEEEECCCCEEEEEEEEEEEEECCC T ss_conf 9999853777896-7999999982757999999999788967999999999983688 No 28 >cd03448 HDE_HSD HDE_HSD The R-hydratase-like hot dog fold of the 17-beta-hydroxysteriod dehydrogenase (HSD), and Hydratase-Dehydrogenase-Epimerase (HDE) proteins. Other enzymes with this fold include MaoC dehydratase, and the fatty acid synthase beta subunit. Probab=97.29 E-value=0.004 Score=39.03 Aligned_cols=80 Identities=29% Similarity=0.414 Sum_probs=50.2 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCEEEEEEEECCEEEEEEEEEEEECCEEEEEEEEEE Q ss_conf 34799950225669999863463454432997763026775236413600221288899999999720777999999999 Q gi|254780770|r 56 GHFPGRPVMPGVLILEGMAQTAGAICAIHNGFDQYAPPYLMSIDKARFRKPVFPGDRLEYHVNKVRNRVDLWKFQCCAKV 135 (161) Q Consensus 56 gHFp~~PvmPGvLliE~maQ~~~~l~~~~~~~~~~~~~~l~~i~~~kF~~~V~PGd~L~i~~~i~~~~~~~~~~~~~~~v 135 (161) .-|| .||+-|.+- ++-++..+.....+.+ ...+.+ -++||.+||+|||+|.+++.. .++...+.+++.. T Consensus 40 agf~-~~I~HGm~t---~g~~~~av~~~~~~g~---~~~~~~-~~~rF~~pV~pGd~l~v~iw~---~~~~v~~~~~~~~ 108 (122) T cd03448 40 AGFP-RPILHGLCT---YGFAARAVLEAFADGD---PARFKA-IKVRFSSPVFPGETLRTEMWK---EGNRVIFQTKVVE 108 (122) T ss_pred CCCC-CCEEHHHHH---HHHHHHHHHHHHCCCC---CEEEEE-EEEEECCCCCCCCEEEEEEEE---ECCEEEEEEEEEE T ss_conf 3999-854136679---9999999998675899---603899-998982653699989999999---7999999999964 Q ss_pred CCEEEEEEEEE Q ss_conf 99999989999 Q gi|254780770|r 136 ENTVVAEAEIC 146 (161) Q Consensus 136 ~~~~va~a~l~ 146 (161) ++++|-++... T Consensus 109 ~~~vVi~~g~a 119 (122) T cd03448 109 RDVVVLSNGAA 119 (122) T ss_pred CCCEEEECCEE T ss_conf 89099989779 No 29 >COG2030 MaoC Acyl dehydratase [Lipid metabolism] Probab=97.25 E-value=0.0051 Score=38.44 Aligned_cols=60 Identities=30% Similarity=0.402 Sum_probs=45.2 Q ss_pred EEEEEECCCCEEEEEEEECCEEEEEEEEEEEEC----CEEEEEEEEEE-CCEEEEEEEEEEEEEC Q ss_conf 267752364136002212888999999997207----77999999999-9999998999999844 Q gi|254780770|r 92 PPYLMSIDKARFRKPVFPGDRLEYHVNKVRNRV----DLWKFQCCAKV-ENTVVAEAEICAMVMH 151 (161) Q Consensus 92 ~~~l~~i~~~kF~~~V~PGd~L~i~~~i~~~~~----~~~~~~~~~~v-~~~~va~a~l~~~i~~ 151 (161) .+.-.+.+++||.+||+|||+|..++++...+. ++....+++.. .++.++..+-..++.. T Consensus 92 ~~~~~g~~~vRF~~PV~~Gdtl~~~~~v~~~~~~~~~G~v~~~~~~~~~~g~~v~~~~~~~~~~~ 156 (159) T COG2030 92 VGANLGGDEVRFVKPVFPGDTLRARVEVLDKRPSKSRGLVTLRLETVNQEGELVLTLEATVLVLR 156 (159) T ss_pred CEEEEEEEEEEECCCCCCCCEEEEEEEEEEECCCCCCCEEEEEEEEEEECCCEEEEEEEEEEEEE T ss_conf 04431577779707877999999999996401267763799999999405958101468888870 No 30 >pfam01575 MaoC_dehydratas MaoC like domain. The maoC gene is part of a operon with maoA which is involved in the synthesis of monoamine oxidase. The MaoC protein is found to share similarity with a wide variety of enzymes; estradiol 17 beta-dehydrogenase 4, peroxisomal hydratase-dehydrogenase-epimerase, fatty acid synthase beta subunit. Several bacterial proteins that are composed solely of this domain have (R)-specific enoyl-CoA hydratase activity. This domain is also present in the NodN nodulation protein N. Probab=97.17 E-value=0.0018 Score=41.15 Aligned_cols=63 Identities=30% Similarity=0.365 Sum_probs=40.2 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCEEEEEEEECCEEEEEEEEEEEECCE Q ss_conf 34799950225669999863463454432997763026775236413600221288899999999720777 Q gi|254780770|r 56 GHFPGRPVMPGVLILEGMAQTAGAICAIHNGFDQYAPPYLMSIDKARFRKPVFPGDRLEYHVNKVRNRVDL 126 (161) Q Consensus 56 gHFp~~PvmPGvLliE~maQ~~~~l~~~~~~~~~~~~~~l~~i~~~kF~~~V~PGd~L~i~~~i~~~~~~~ 126 (161) ..|+ .++.+|.+.. +.+.+++..... .+ .+...+..++||.+||+|||+|+++++++..+.+. T Consensus 46 ~~f~-~~iahG~~t~---s~~~~~~~~~~~---~~-~~~~~~~~~~rF~~PV~~GDtl~~~~ev~~~~~~k 108 (123) T pfam01575 46 AGFG-GPIAHGMLTL---AIARGLVEEQGG---DN-VVARYGGWSVRFTGPVFPGDTLRTEVEVVGKRDGR 108 (123) T ss_pred CCCC-CEECCCHHHH---HHHHHHHHHHCC---CC-CEEEEEEEEEEECCCCCCCCEEEEEEEEEEEECCC T ss_conf 7899-8021517799---999999888727---77-41555521778778878899999999999998687 No 31 >COG1607 Acyl-CoA hydrolase [Lipid metabolism] Probab=96.93 E-value=0.029 Score=33.88 Aligned_cols=87 Identities=21% Similarity=0.175 Sum_probs=64.1 Q ss_pred CCCHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCEEEEEEEECCEEEEEEEEEEEECCEEEEEEEEEEC------ Q ss_conf 02256699998634634544329977630267752364136002212888999999997207779999999999------ Q gi|254780770|r 63 VMPGVLILEGMAQTAGAICAIHNGFDQYAPPYLMSIDKARFRKPVFPGDRLEYHVNKVRNRVDLWKFQCCAKVE------ 136 (161) Q Consensus 63 vmPGvLliE~maQ~~~~l~~~~~~~~~~~~~~l~~i~~~kF~~~V~PGd~L~i~~~i~~~~~~~~~~~~~~~v~------ 136 (161) ++=|.| +.-|-.+|++.+.... +...+.++++.+.|.+||.-||.+.+++.+....+......++++.. T Consensus 31 ifGG~l-m~~mD~~a~i~A~~~a----~~~vVTasvd~v~F~~Pv~vGd~v~~~a~v~~~GrTSm~V~Vev~~~~~~~~~ 105 (157) T COG1607 31 IFGGWL-LSWMDLAAAIAASRHA----GGRVVTASVDSVDFKKPVRVGDIVCLYARVVYTGRTSMEVGVEVWAEDIRSGE 105 (157) T ss_pred CCCHHH-HHHHHHHHHHHHHHHH----CCEEEEEEECEEEECCCCCCCCEEEEEEEEEECCCCCEEEEEEEEEECCCCCC T ss_conf 414799-9999999999999984----99079999530887262304968999999954476428999999983266775 Q ss_pred CEEEEEEEEEEEEECCCC Q ss_conf 999998999999844520 Q gi|254780770|r 137 NTVVAEAEICAMVMHEKK 154 (161) Q Consensus 137 ~~~va~a~l~~~i~~~~~ 154 (161) -..++++.|+++=++++. T Consensus 106 ~~~~t~~~ft~VAvd~~g 123 (157) T COG1607 106 RRLATSAYFTFVAVDEDG 123 (157) T ss_pred CEEEEEEEEEEEEECCCC T ss_conf 037445999999888999 No 32 >COG0824 FcbC Predicted thioesterase [General function prediction only] Probab=96.92 E-value=0.017 Score=35.36 Aligned_cols=66 Identities=14% Similarity=0.114 Sum_probs=56.5 Q ss_pred EEEEEECCCCEEEEEEEECCEEEEEEEEEEEECCEEEEEEEEEECCEEEEEEEEEEEEECCCCCCC Q ss_conf 267752364136002212888999999997207779999999999999998999999844520023 Q gi|254780770|r 92 PPYLMSIDKARFRKPVFPGDRLEYHVNKVRNRVDLWKFQCCAKVENTVVAEAEICAMVMHEKKEKN 157 (161) Q Consensus 92 ~~~l~~i~~~kF~~~V~PGd~L~i~~~i~~~~~~~~~~~~~~~v~~~~va~a~l~~~i~~~~~e~~ 157 (161) ..+...--+++|++|+..||.+++++.+.+.++....+.-+++.+++++|+|+.+.+-++.+..+. T Consensus 55 ~~~~v~~~~i~y~~p~~~~d~l~v~~~v~~~~~~s~~~~~~i~~~~~l~a~~~~~~V~v~~~~~kp 120 (137) T COG0824 55 IAFVVVEAEIDYLRPARLGDVLTVRTRVEELGGKSLTLGYEIVNEDELLATGETTLVCVDLKTGKP 120 (137) T ss_pred EEEEEEEEEEEECCCCCCCCEEEEEEEEEEECCEEEEEEEEEEECCEEEEEEEEEEEEEECCCCCC T ss_conf 479999999899756648989999999997066399999999949999999999999998899956 No 33 >cd03455 SAV4209 SAV4209 is a Streptomyces avermitilis protein with a hot dog fold that is similar to those of (R)-specific enoyl-CoA hydratase, the peroxisomal Hydratase-Dehydrogenase-Epimerase (HDE) protein, and the fatty acid synthase beta subunit. The alpha- and gamma-proteobacterial members of this CD have, in addition to a hot dog fold, an N-terminal extension. Probab=96.88 E-value=0.017 Score=35.34 Aligned_cols=90 Identities=16% Similarity=0.161 Sum_probs=52.9 Q ss_pred CCCCHHHCC-CCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCEEEEEEEECCEEEEEEEEEEEECC- Q ss_conf 688831313-479995022566999986346345443299776302677523641360022128889999999972077- Q gi|254780770|r 48 TFNEPHFMG-HFPGRPVMPGVLILEGMAQTAGAICAIHNGFDQYAPPYLMSIDKARFRKPVFPGDRLEYHVNKVRNRVD- 125 (161) Q Consensus 48 ~~~e~ff~g-HFp~~PvmPGvLliE~maQ~~~~l~~~~~~~~~~~~~~l~~i~~~kF~~~V~PGd~L~i~~~i~~~~~~- 125 (161) ..|+.|-+. .|| .++..|.+..-.+.++. ..+. +. .. .+.+ -++||++||+|||+|++..+++..+.+ T Consensus 30 H~D~~~A~~~G~~-~~i~hG~l~~~~~~~~l---~~w~-g~-~~---~l~~-~~~rf~~pv~~Gd~lt~~g~V~~~~~~~ 99 (123) T cd03455 30 HHDRDYARAVGYP-DLYVNGPTLAGLVIRYV---TDWA-GP-DA---RVKS-FAFRLGAPLYAGDTLRFGGRVTAKRDDE 99 (123) T ss_pred CCCHHHHHHCCCC-CCEEEHHHHHHHHHHHH---HHHC-CC-CC---EEEE-EEEEEECCEECCCEEEEEEEEEEECCCC T ss_conf 2199999863999-72773789999999999---9764-99-84---5999-9999767511899899999999976886 Q ss_pred EEEEEEEEEE-CCEEEEEEEEEE Q ss_conf 7999999999-999999899999 Q gi|254780770|r 126 LWKFQCCAKV-ENTVVAEAEICA 147 (161) Q Consensus 126 ~~~~~~~~~v-~~~~va~a~l~~ 147 (161) ...++..+.. +|+.+++|+-+. T Consensus 100 ~v~l~~~~~nq~G~~v~~G~A~V 122 (123) T cd03455 100 VVTVELWARNSEGDHVMAGTATV 122 (123) T ss_pred EEEEEEEEECCCCCEEEEEEEEE T ss_conf 69999999989999999999997 No 34 >KOG3328 consensus Probab=96.81 E-value=0.023 Score=34.55 Aligned_cols=111 Identities=14% Similarity=0.090 Sum_probs=75.6 Q ss_pred EEE------EEEEEECCCEEEEEEEECCCCHHHCCCCCCCCCCCH---HHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEC Q ss_conf 478------999984697499999826888313134799950225---66999986346345443299776302677523 Q gi|254780770|r 28 LVD------KVVNIQRDESAIGIKNVTFNEPHFMGHFPGRPVMPG---VLILEGMAQTAGAICAIHNGFDQYAPPYLMSI 98 (161) Q Consensus 28 ~ID------~i~~~~~g~~~~g~k~v~~~e~ff~gHFp~~PvmPG---vLliE~maQ~~~~l~~~~~~~~~~~~~~l~~i 98 (161) ..| |+++.+|| +++|...|+.+.-=. +-.+-| ..++|.++-++. ....++..+..+- T Consensus 21 ~Fd~~~~~i~~~~~~~G-rv~ce~kV~~~~~N~------~k~LHGG~tAtLvD~i~s~~~---~~~~~~~~gvsvd---- 86 (148) T KOG3328 21 GFDRVLNNIRIVSAEPG-RVSCELKVTPDHLNR------FKTLHGGATATLVDLITSAAL---LMTSGFKPGVSVD---- 86 (148) T ss_pred CHHHHCCCEEEEECCCC-EEEEEEEECHHHCCC------CCCCCCCCHHHHHHHHHHHHH---HHCCCCCCCEEEE---- T ss_conf 43442374388532686-389999958799576------465346611018988766788---8526888814999---- Q ss_pred CCCEEEEEEEECCEEEEEEEEEEEECCEEEEEEEEEE--CCEEEEEEEEEEEEECC Q ss_conf 6413600221288899999999720777999999999--99999989999998445 Q gi|254780770|r 99 DKARFRKPVFPGDRLEYHVNKVRNRVDLWKFQCCAKV--ENTVVAEAEICAMVMHE 152 (161) Q Consensus 99 ~~~kF~~~V~PGd~L~i~~~i~~~~~~~~~~~~~~~v--~~~~va~a~l~~~i~~~ 152 (161) =++.|..++.+||.|.+++++.+...+++..+|+.+. ++++++++.-+.++-.. T Consensus 87 LsvsyL~~AklGe~l~i~a~~vr~Gk~la~t~v~l~~K~t~kiia~grhtk~~~~~ 142 (148) T KOG3328 87 LSVSYLSSAKLGEELEIEATVVRVGKTLAFTDVELRRKSTGKIIAKGRHTKYFRPA 142 (148) T ss_pred EEHHHCCCCCCCCEEEEEEEEEECCCEEEEEEEEEEECCCCEEEEECCEEEEEECC T ss_conf 88555164679976999989840586378999999985788199851148996237 No 35 >PRK10800 acyl-CoA thioester hydrolase YbgC; Provisional Probab=96.80 E-value=0.011 Score=36.40 Aligned_cols=63 Identities=22% Similarity=0.180 Sum_probs=52.1 Q ss_pred EEECCCCEEEEEEEECCEEEEEEEEEEEECCEEEEEEEEEE-CCEEEEEEEEEEEEECCCCCCC Q ss_conf 75236413600221288899999999720777999999999-9999998999999844520023 Q gi|254780770|r 95 LMSIDKARFRKPVFPGDRLEYHVNKVRNRVDLWKFQCCAKV-ENTVVAEAEICAMVMHEKKEKN 157 (161) Q Consensus 95 l~~i~~~kF~~~V~PGd~L~i~~~i~~~~~~~~~~~~~~~v-~~~~va~a~l~~~i~~~~~e~~ 157 (161) ...--+++|++|+.-||.|++++.+.+.++....+.-++.. ||+++|+|+...+.++.++.|. T Consensus 55 vV~~~~i~Y~~pa~~~D~l~I~~~i~~~~~~s~~~~~~i~~~d~~l~a~a~~~~V~vd~~~~kp 118 (130) T PRK10800 55 VVRKMTVEYYAPARLDDMLEVQTEITSMRGTSLVFTQRIVNAENTLLNEAEVLIVCVDPLKMKP 118 (130) T ss_pred EEEEEEEEECCCCCCCCEEEEEEEEEECCCEEEEEEEEEECCCCEEEEEEEEEEEEEECCCCEE T ss_conf 9999986944734379899999999966977999999999289979999999999999999918 No 36 >PRK13693 (3R)-hydroxyacyl-ACP dehydratase subunit HadB; Provisional Probab=96.69 E-value=0.054 Score=32.29 Aligned_cols=86 Identities=13% Similarity=0.056 Sum_probs=51.8 Q ss_pred ECCCCHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCEEEEEEE-----ECCEEEEEEEEEE Q ss_conf 26888313134799950225669999863463454432997763026775236413600221-----2888999999997 Q gi|254780770|r 47 VTFNEPHFMGHFPGRPVMPGVLILEGMAQTAGAICAIHNGFDQYAPPYLMSIDKARFRKPVF-----PGDRLEYHVNKVR 121 (161) Q Consensus 47 v~~~e~ff~gHFp~~PvmPGvLliE~maQ~~~~l~~~~~~~~~~~~~~l~~i~~~kF~~~V~-----PGd~L~i~~~i~~ 121 (161) +..|+.|-+.---..++..|.|. |+.++.++..+..+ .+.+ . --++||.+||. ||+++++..++.+ T Consensus 40 iH~D~~~Ak~~g~~~viahGmlt---~g~~~~~vt~w~g~--pg~v---~-~~~vrF~~pv~vp~~~~G~~v~~tg~V~~ 110 (142) T PRK13693 40 IHWDDEIAKVVGLDTAIAHGMLT---MGLGGGYVTSWVGD--PGAV---T-EYNVRFTAVVPVPNDGKGAELVFNGRVKS 110 (142) T ss_pred CCCCHHHHHHCCCCCCCHHHHHH---HHHHHHHHHHHCCC--CCEE---E-EEEEEECCEEEECCCCCCCEEEEEEEEEE T ss_conf 41399999864899824458889---99999999874599--8558---9-99989753388169898759999999999 Q ss_pred E--ECCEEEEEEEEEECCEEEE Q ss_conf 2--0777999999999999999 Q gi|254780770|r 122 N--RVDLWKFQCCAKVENTVVA 141 (161) Q Consensus 122 ~--~~~~~~~~~~~~v~~~~va 141 (161) . .++.+.+.+++.++++.+. T Consensus 111 ~d~~~~~v~l~l~~~~gg~~Vl 132 (142) T PRK13693 111 VDPESKSVTIALTATTGGKKIF 132 (142) T ss_pred ECCCCCEEEEEEEEEECCEEEE T ss_conf 7177888999999998999999 No 37 >TIGR02286 PaaD phenylacetic acid degradation protein PaaD; InterPro: IPR011973 This members of this family belong to the thioesterase superfamily IPR006683 from INTERPRO, and are nearly always found adjacent to other genes of the phenylacetic acid degradation pathway . Their function is currently unknown, but a role as thioesterases is suggested.. Probab=96.68 E-value=0.023 Score=34.51 Aligned_cols=99 Identities=20% Similarity=0.192 Sum_probs=70.0 Q ss_pred EEEEEECCCEEEEEEEECCCCHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCEEEEEEEEC Q ss_conf 99998469749999982688831313479995022566999986346345443299776302677523641360022128 Q gi|254780770|r 31 KVVNIQRDESAIGIKNVTFNEPHFMGHFPGRPVMPGVLILEGMAQTAGAICAIHNGFDQYAPPYLMSIDKARFRKPVFPG 110 (161) Q Consensus 31 ~i~~~~~g~~~~g~k~v~~~e~ff~gHFp~~PvmPGvLliE~maQ~~~~l~~~~~~~~~~~~~~l~~i~~~kF~~~V~PG 110 (161) +|+++.|| .+....+|+. .-++||=--. |= +|=++|-++=+++. +...+ .. .++--...|.+|..+| T Consensus 7 ~i~~~~pg-~A~v~mtVr~--dmlNgHgtaH----GG-flF~LAD~AFA~aC--Ns~~~-~a--VA~~~~idflrP~~~G 73 (115) T TIGR02286 7 KILELGPG-EARVAMTVRE--DMLNGHGTAH----GG-FLFALADSAFAYAC--NSYGD-AA--VAAQCTIDFLRPARAG 73 (115) T ss_pred EEEECCCC-CEEEEEEEEE--ECCCCCCCCH----HH-HHHHHHHHHHHEEE--CCCCC-CE--EHHHCCEEEECCCCCC T ss_conf 89980899-4588878764--0004322011----55-88888776631011--48751-10--0100300010468999 Q ss_pred CEEEEEEEEEEEECCEEEEEEEEEE--CCEEEEE Q ss_conf 8899999999720777999999999--9999998 Q gi|254780770|r 111 DRLEYHVNKVRNRVDLWKFQCCAKV--ENTVVAE 142 (161) Q Consensus 111 d~L~i~~~i~~~~~~~~~~~~~~~v--~~~~va~ 142 (161) |+|+=+++.++..+..+.|+.++.. .+|+||. T Consensus 74 ~~L~A~A~ev~~~grtG~YDv~v~~G~~g~~vA~ 107 (115) T TIGR02286 74 EVLVAEAVEVSRSGRTGVYDVEVRNGQEGELVAL 107 (115) T ss_pred CEEEEEEEEEECCCCEEEEEEEEEECCCCEEEEE T ss_conf 8789987884148911058889984699769998 No 38 >cd03443 PaaI_thioesterase PaaI_thioesterase is a tetrameric acyl-CoA thioesterase with a hot dog fold and one of several proteins responsible for phenylacetic acid (PA) degradation in bacteria. Although orthologs of PaaI exist in archaea and eukaryotes, their function has not been determined. Sequence similarity between PaaI, E. coli medium chain acyl-CoA thioesterase II, and human thioesterase III suggests they all belong to the same thioesterase superfamily. The conserved fold present in these thioesterases is referred to as an asymmetric hot dog fold, similar to those of 4-hydroxybenzoyl-CoA thioesterase (4HBT) and the beta-hydroxydecanoyl-ACP dehydratases (FabA/FabZ). Probab=96.61 E-value=0.06 Score=31.98 Aligned_cols=108 Identities=19% Similarity=0.121 Sum_probs=64.2 Q ss_pred EEEEEECCCEEEEEEEECCCCHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCEEEEEEEEC Q ss_conf 99998469749999982688831313479995022566999986346345443299776302677523641360022128 Q gi|254780770|r 31 KVVNIQRDESAIGIKNVTFNEPHFMGHFPGRPVMPGVLILEGMAQTAGAICAIHNGFDQYAPPYLMSIDKARFRKPVFPG 110 (161) Q Consensus 31 ~i~~~~~g~~~~g~k~v~~~e~ff~gHFp~~PvmPGvLliE~maQ~~~~l~~~~~~~~~~~~~~l~~i~~~kF~~~V~PG 110 (161) ++.++++| .+.....++. .+. ++ ..-+-=|++. +++..++.+....... .. ......-=+++|++|+.+ T Consensus 5 ~~~~~~~g-~~~~~~~~~~--~~~-n~--~g~~HGG~~~--~l~D~~~g~~~~~~~~-~~-~~~vt~~l~i~fl~p~~~- 73 (113) T cd03443 5 RVVEVGPG-RVVLRLPVRP--RHL-NP--GGIVHGGAIA--TLADTAGGLAALSALP-PG-ALAVTVDLNVNYLRPARG- 73 (113) T ss_pred EEEEEECC-EEEEEEECCH--HHC-CC--CCCEEHHHHH--HHHHHHHHHHHHHCCC-CC-CCEEEEEEEEEEECCCCC- T ss_conf 99999799-9999999898--995-99--9948078877--8887889999985156-65-020762157999757899- Q ss_pred CEEEEEEEEEEEECCEEEEEEEEEEC-CEEEEEEEEEEEE Q ss_conf 88999999997207779999999999-9999989999998 Q gi|254780770|r 111 DRLEYHVNKVRNRVDLWKFQCCAKVE-NTVVAEAEICAMV 149 (161) Q Consensus 111 d~L~i~~~i~~~~~~~~~~~~~~~v~-~~~va~a~l~~~i 149 (161) +.++.++++.+..+..+.++++++.+ |+++|+++-++++ T Consensus 74 ~~l~~~a~v~~~g~~~~~~~~~i~~~~g~~~A~~~~~~~~ 113 (113) T cd03443 74 GDLTARARVVKLGRRLAVVEVEVTDEDGKLVATARGTFAV 113 (113) T ss_pred CEEEEEEEEEEECCCEEEEEEEEEECCCCEEEEEEEEEEC T ss_conf 7699999999958889999999995999999999999989 No 39 >cd00586 4HBT 4-hydroxybenzoyl-CoA thioesterase (4HBT). Catalyzes the final step in the 4-chlorobenzoate degradation pathway in which 4-chlorobenzoate is converted to 4-hydroxybenzoate in certain soil-dwelling bacteria. 4HBT forms a homotetramer with four active sites. There is no evidence to suggest that 4HBT is related to the type I thioesterases functioning in primary or secondary metabolic pathways. Each subunit of the 4HBT tetramer adopts a so-called hot-dog fold similar to those of beta-hydroxydecanoyl-ACP dehydratase, (R)-specific enoyl-CoA hydratase, and type II, thioesterase (TEII). Probab=96.57 E-value=0.038 Score=33.19 Aligned_cols=61 Identities=18% Similarity=0.205 Sum_probs=50.6 Q ss_pred EEEEEEECCCCEEEEEEEECCEEEEEEEEEEEECCEEEEEEEEEEC-CEEEEEEEEEEEEEC Q ss_conf 0267752364136002212888999999997207779999999999-999998999999844 Q gi|254780770|r 91 APPYLMSIDKARFRKPVFPGDRLEYHVNKVRNRVDLWKFQCCAKVE-NTVVAEAEICAMVMH 151 (161) Q Consensus 91 ~~~~l~~i~~~kF~~~V~PGd~L~i~~~i~~~~~~~~~~~~~~~v~-~~~va~a~l~~~i~~ 151 (161) .......--.++|++|+..||++++++.+.+.......+..++..+ |+++|+|+...+.++ T Consensus 49 ~~~~~v~~~~~~y~~~~~~~d~i~i~~~i~~~~~~~~~~~~~~~~~~g~~~~~~~~~~v~~d 110 (110) T cd00586 49 GLGLVVVELEIDYLRPLRLGDRLTVETRVLRLGRKSFTFEQEIFREDGELLATAETVLVCVD 110 (110) T ss_pred CCCEEEEEEEEEECCCCCCCCEEEEEEEEEEECCEEEEEEEEEEECCCEEEEEEEEEEEEEC T ss_conf 95189787588985577899699999999980878999999999699859999999999989 No 40 >PRK13692 (3R)-hydroxyacyl-ACP dehydratase subunit HadA; Provisional Probab=96.34 E-value=0.09 Score=30.96 Aligned_cols=60 Identities=18% Similarity=0.138 Sum_probs=44.3 Q ss_pred EEEECCCCEEEEEEEECCEEEEEEEEEEEE--CC--EEEEEEEEEE-CCEEEEEEEEEEEEECCC Q ss_conf 775236413600221288899999999720--77--7999999999-999999899999984452 Q gi|254780770|r 94 YLMSIDKARFRKPVFPGDRLEYHVNKVRNR--VD--LWKFQCCAKV-ENTVVAEAEICAMVMHEK 153 (161) Q Consensus 94 ~l~~i~~~kF~~~V~PGd~L~i~~~i~~~~--~~--~~~~~~~~~v-~~~~va~a~l~~~i~~~~ 153 (161) .+-+=.+..|++|+.+||+|+..+++...+ .+ +...+.++.. +|+.|+++.-+++.-..+ T Consensus 84 vvHgeQ~f~yhrPi~AGD~Lt~~~~I~~ir~~~G~d~v~~~~~vt~~~Ge~V~t~~sTLv~R~~~ 148 (159) T PRK13692 84 IVQVDQVLKFEKPIVAGDKLYCDVYVDSVREAHGTQIIVTKNIVTNEAGDVVQETYTTLAGRAGE 148 (159) T ss_pred EEECCCEEEEECCCCCCCEEEEEEEEEEEEECCCCEEEEEEEEEECCCCCEEEEEEEEEEEECCC T ss_conf 88647248998982489899999999986533991699999999989999999999999997677 No 41 >pfam03061 4HBT Thioesterase superfamily. This family contains a wide variety of enzymes, principally thioesterases. This family includes 4HBT (EC 3.1.2.23) which catalyses the final step in the biosynthesis of 4-hydroxybenzoate from 4-chlorobenzoate in the soil dwelling microbe Pseudomonas CBS-3. This family includes various cytosolic long-chain acyl-CoA thioester hydrolases. Long-chain acyl-CoA hydrolases hydrolyse palmitoyl-CoA to CoA and palmitate, they also catalyse the hydrolysis of other long chain fatty acyl-CoA thioesters. Probab=96.20 E-value=0.057 Score=32.14 Aligned_cols=45 Identities=18% Similarity=0.094 Sum_probs=39.0 Q ss_pred EEECCCCEEEEEEEECCEEEEEEEEEEEECCEEEEEEEEEECCEE Q ss_conf 752364136002212888999999997207779999999999999 Q gi|254780770|r 95 LMSIDKARFRKPVFPGDRLEYHVNKVRNRVDLWKFQCCAKVENTV 139 (161) Q Consensus 95 l~~i~~~kF~~~V~PGd~L~i~~~i~~~~~~~~~~~~~~~v~~~~ 139 (161) ....-++.|++|+.+||.+++++++.+..+....++++++..+.. T Consensus 32 ~t~~~~i~f~~p~~~gd~l~~~~~v~~~g~~~~~~~~~i~~~~~~ 76 (79) T pfam03061 32 VVVELNIDFLRPARLGDVLTVEARVVRLGRTSAVVEVEVRDEDGR 76 (79) T ss_pred EEEEEEEEEEEECCCCCEEEEEEEEEECCCCEEEEEEEEEECCCC T ss_conf 799999899996057988999999998687399999999958996 No 42 >COG5496 Predicted thioesterase [General function prediction only] Probab=96.10 E-value=0.12 Score=30.21 Aligned_cols=85 Identities=11% Similarity=-0.005 Sum_probs=59.5 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCEEEEEEEECCEEEEEEEEEEEECCEEEEEEEEEECCEEEEEEEEEEE Q ss_conf 99998634634544329977630267752364136002212888999999997207779999999999999998999999 Q gi|254780770|r 69 ILEGMAQTAGAICAIHNGFDQYAPPYLMSIDKARFRKPVFPGDRLEYHVNKVRNRVDLWKFQCCAKVENTVVAEAEICAM 148 (161) Q Consensus 69 liE~maQ~~~~l~~~~~~~~~~~~~~l~~i~~~kF~~~V~PGd~L~i~~~i~~~~~~~~~~~~~~~v~~~~va~a~l~~~ 148 (161) |+-.|--+|--++.-+.+.....+ +.--+++-..++.||.++++.+++.+..+.-..|+.++..++..+.+|..+-. T Consensus 36 mi~~~E~a~~el~~~~Ld~g~ttV---G~ev~vrHla~~~~G~~V~i~~~l~~v~Gr~v~f~i~a~~~~~~Ig~g~h~R~ 112 (130) T COG5496 36 MIGFMENASYELLQPYLDNGETTV---GTEVLVRHLAATPPGLTVTIGARLEKVEGRKVKFRIIAMEGGDKIGEGTHTRV 112 (130) T ss_pred HHHHHHHHHHHHHHHHCCCCCCEE---EEEEEEEECCCCCCCCEEEEEEEEEEEECCEEEEEEEEEECCCEEEEEEEEEE T ss_conf 999999999999875276776305---58988644147899975999999999856388999999308858714579999 Q ss_pred EECCCCCC Q ss_conf 84452002 Q gi|254780770|r 149 VMHEKKEK 156 (161) Q Consensus 149 i~~~~~e~ 156 (161) +++.++=+ T Consensus 113 iv~~~kf~ 120 (130) T COG5496 113 IVPREKFR 120 (130) T ss_pred EECHHHHH T ss_conf 85699998 No 43 >cd03450 NodN NodN (nodulation factor N) contains a single hot dog fold similar to those of the peroxisomal Hydratase-Dehydrogenase-Epimerase (HDE) protein, and the fatty acid synthase beta subunit. Rhizobium and related species form nodules on the roots of their legume hosts, a symbiotic process that requires production of Nod factors, which are signal molecules involved in root hair deformation and meristematic cell division. The nodulation gene products, including NodN, are involved in producing the Nod factors, however the role played by NodN is unclear. Probab=95.45 E-value=0.22 Score=28.63 Aligned_cols=86 Identities=17% Similarity=0.191 Sum_probs=53.4 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCEEEEEEEECCEEEEEEEEEEE---ECC--EEEEEEEEE Q ss_conf 995022566999986346345443299776302677523641360022128889999999972---077--799999999 Q gi|254780770|r 60 GRPVMPGVLILEGMAQTAGAICAIHNGFDQYAPPYLMSIDKARFRKPVFPGDRLEYHVNKVRN---RVD--LWKFQCCAK 134 (161) Q Consensus 60 ~~PvmPGvLliE~maQ~~~~l~~~~~~~~~~~~~~l~~i~~~kF~~~V~PGd~L~i~~~i~~~---~~~--~~~~~~~~~ 134 (161) +.+|-+|-|.+-.+..+.. . .......+.+.=-+.+++||..||..|++++.++++.+. +.+ ....++++. T Consensus 55 g~tIAHGfltLSl~~~~~~---~-~~~~~~~~~~vNYG~dkvRF~~PV~vGs~vR~~~~l~~v~~~~~g~~~~~~~~tvE 130 (149) T cd03450 55 GGTIAHGFLTLSLLPALTP---Q-LFRVEGVKMGVNYGLDKVRFPAPVPVGSRVRGRFTLLSVEELKGGGVQVTLEVTVE 130 (149) T ss_pred CCCCCCHHHHHHHHHHHHH---H-CCCCCCCEEEEECCCCCEEECCCCCCCCEEEEEEEEEEEEECCCCEEEEEEEEEEE T ss_conf 8734650669999998875---2-66767843333003352670687567999999999999899189849999999999 Q ss_pred EC--CEEEEEEEEEEEE Q ss_conf 99--9999989999998 Q gi|254780770|r 135 VE--NTVVAEAEICAMV 149 (161) Q Consensus 135 v~--~~~va~a~l~~~i 149 (161) +. ++.+|-|++.... T Consensus 131 ieg~~kPa~vAe~l~~~ 147 (149) T cd03450 131 IEGEDKPACVAEWISRL 147 (149) T ss_pred ECCCCCCEEEEEEEEEH T ss_conf 95898878999885501 No 44 >PRK13691 (3R)-hydroxyacyl-ACP dehydratase subunit HadC; Provisional Probab=95.39 E-value=0.23 Score=28.49 Aligned_cols=58 Identities=21% Similarity=0.191 Sum_probs=41.9 Q ss_pred EEECCCCEEEEEEEECCEEEEEEEEEEEE--CC--EEEEEEEEE-ECCEEEEEEEEEEEEECC Q ss_conf 75236413600221288899999999720--77--799999999-999999989999998445 Q gi|254780770|r 95 LMSIDKARFRKPVFPGDRLEYHVNKVRNR--VD--LWKFQCCAK-VENTVVAEAEICAMVMHE 152 (161) Q Consensus 95 l~~i~~~kF~~~V~PGd~L~i~~~i~~~~--~~--~~~~~~~~~-v~~~~va~a~l~~~i~~~ 152 (161) +-+=.+++|+||+.+||+|++.+++.+.+ .| +...+..+. .+|+.|+++.-+++--.. T Consensus 85 VHgdQ~F~yhRPI~AGD~L~~~~~v~svr~~~G~diittr~~vtd~~Ge~V~t~~tTLv~R~g 147 (166) T PRK13691 85 VQVDQRFVFHKPVLAGDKLWARMDIHSVDERFGADIVVTRNVCTNDDGEVVMEAYTTLMGQQG 147 (166) T ss_pred EECCCEEEEECCCCCCCEEEEEEEEEEEEECCCCEEEEEEEEEECCCCCEEEEEEEEEEEECC T ss_conf 551306999887016999999999998310089318999889988999999999999998637 No 45 >smart00826 PKS_DH PKS_DH. Probab=95.15 E-value=0.27 Score=28.04 Aligned_cols=99 Identities=19% Similarity=0.311 Sum_probs=56.5 Q ss_pred EEEEEEECCCC-HHHCCC-CCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCEEEEE-EEECCE-EEEE Q ss_conf 99999826888-313134-7999502256699998634634544329977630267752364136002-212888-9999 Q gi|254780770|r 41 AIGIKNVTFNE-PHFMGH-FPGRPVMPGVLILEGMAQTAGAICAIHNGFDQYAPPYLMSIDKARFRKP-VFPGDR-LEYH 116 (161) Q Consensus 41 ~~g~k~v~~~e-~ff~gH-Fp~~PvmPGvLliE~maQ~~~~l~~~~~~~~~~~~~~l~~i~~~kF~~~-V~PGd~-L~i~ 116 (161) ..-...++.++ ||+++| ..|.+++||+-.+|+...++..+. .. .. ..++++.|.+| ++|++. .++. T Consensus 16 ~~w~~~ls~~~~p~L~DH~v~G~~v~Pga~ylemal~Aa~~~~---~~----~~---~~l~dv~f~~pL~l~~~~~~~v~ 85 (167) T smart00826 16 VVFTGRLSLRTHPWLADHRVGGTVVLPGAAYVEMALAAADEVG---CG----AA---VRLEELTLEAPLVLPEDGAVEVQ 85 (167) T ss_pred EEEEEEECCCCCCCCCCCEECCEEECCHHHHHHHHHHHHHHHC---CC----CC---EEEEEEEEECCCCCCCCCCEEEE T ss_conf 7999997788896002778588775246999999999999865---99----86---69998698158574899878999 Q ss_pred EEEEEEE-CCEEEEEEEEEECC----EEEEEEEEEEEE Q ss_conf 9999720-77799999999999----999989999998 Q gi|254780770|r 117 VNKVRNR-VDLWKFQCCAKVEN----TVVAEAEICAMV 149 (161) Q Consensus 117 ~~i~~~~-~~~~~~~~~~~v~~----~~va~a~l~~~i 149 (161) +.+.... ++...|+.....++ .+.|+|.+...- T Consensus 86 ~~l~~~~~~~~~~f~I~S~~~~~~~w~~h~~G~v~~~~ 123 (167) T smart00826 86 VVVGAPDESGRRTFEVYSRPDGDGPWTRHATGTLRPAA 123 (167) T ss_pred EEEECCCCCCEEEEEEEEECCCCCCCEEEEEEEEEECC T ss_conf 99933689987999999867999981899999998467 No 46 >PRK11688 hypothetical protein; Provisional Probab=94.66 E-value=0.37 Score=27.26 Aligned_cols=129 Identities=16% Similarity=0.145 Sum_probs=70.6 Q ss_pred HHHHHCCCCCCEE--EEEEEEEEECCCEEEEEEEECCCCHHHCCCCCCCCCCC-HHH--HHHHHHHHHHHHHHHCCCCC- Q ss_conf 8998479997505--47899998469749999982688831313479995022-566--99998634634544329977- Q gi|254780770|r 15 ELMRFLPHRYPFL--LVDKVVNIQRDESAIGIKNVTFNEPHFMGHFPGRPVMP-GVL--ILEGMAQTAGAICAIHNGFD- 88 (161) Q Consensus 15 eI~~~lPhr~Pfl--~ID~i~~~~~g~~~~g~k~v~~~e~ff~gHFp~~PvmP-GvL--liE~maQ~~~~l~~~~~~~~- 88 (161) -+.++.-++=||. +==++.++++| +++....++ +++. |++. .+.+= |++ ++++.+=.+++......... T Consensus 12 ~~~~~F~~~iPf~~~LGl~v~~~~~g-~~~l~lp~r--~eli-gn~~-~g~lHGGVi~allD~a~g~Aa~~~~~~~~~~~ 86 (154) T PRK11688 12 LVGEIFVYHMPFNRLLGLKLERYEPE-FVELRFKMQ--PELV-GNIA-QSILHGGVIASVLDVAGGLVAVGGILARHEDI 86 (154) T ss_pred HHHHHHHHHCCHHHHCCCEEEEEECC-EEEEEEECC--HHHC-CCCC-CCEEEHHHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 99999887399889729899998599-799998688--8983-8987-77786788999999988999997403204775 Q ss_pred --CCEEEEE---EECC-CCEEEEEEEECCEEEEEEEEEEEECCEEEEEEEEEE-CCEEEEEEEEEEEE Q ss_conf --6302677---5236-413600221288899999999720777999999999-99999989999998 Q gi|254780770|r 89 --QYAPPYL---MSID-KARFRKPVFPGDRLEYHVNKVRNRVDLWKFQCCAKV-ENTVVAEAEICAMV 149 (161) Q Consensus 89 --~~~~~~l---~~i~-~~kF~~~V~PGd~L~i~~~i~~~~~~~~~~~~~~~v-~~~~va~a~l~~~i 149 (161) +.....+ +.++ ++.|.+|.. |+.+..++++.+..+.++.++++++. +++++|.|.=++++ T Consensus 87 ~~~~~~~~~~~~aTidl~v~yLrpa~-g~~l~a~a~v~r~Gr~va~~~~~~~~~~g~lvA~a~~tf~v 153 (154) T PRK11688 87 SEEELQQRLSRLGTIDLRVDYLRPGR-GERFTATASVLRAGNKVAVARMELHNEQGTHIASGTATYMV 153 (154) T ss_pred CHHHHHHCCCCCCEEEEEEEEECCCC-CCEEEEEEEEEECCCCEEEEEEEEECCCCCEEEEEEEEEEE T ss_conf 44565420355304786631104689-98089999999717979999999998999889999999963 No 47 >TIGR02799 thio_ybgC tol-pal system-associated acyl-CoA thioesterase; InterPro: IPR014166 The tol-pal system consists of five critical genes. Inner membrane proteins TolQ and TolR convert proton motive force to energy that is transduced through TolA to an outer membrane complex of TolB and Pal. The system is known to be required to maintain outer membrane integrity. In a system with several homologous parts, ExbB and ExbD transduces energy through TonB to a variety of outer membrane proteins, many of which are siderophore receptors. The tol-pal system therefore may also be involved in transport. This family consists of a protein nearly always found in operons with the genes of the tol-pal system. The significance of this thioesterase to the tol-pal system is unclear, but either of two observations may be relevant. First, Pal, or peptidoglycan-associated lipoprotein, has a conserved N-terminal cleavage and acylation that makes it a lipoprotein. Second, the tol-pal system is implicated not only in the import of certain organics but also in the maintenance of outer membrane integrity (by an unknown mechanism) .. Probab=94.62 E-value=0.38 Score=27.20 Aligned_cols=65 Identities=14% Similarity=0.120 Sum_probs=54.9 Q ss_pred CEEEEEEECCCCEEEEEEEECCEEEEEEEEEEEECCEEEEEEEEE--ECCEEEEEEEEEEEEECCCC Q ss_conf 302677523641360022128889999999972077799999999--99999998999999844520 Q gi|254780770|r 90 YAPPYLMSIDKARFRKPVFPGDRLEYHVNKVRNRVDLWKFQCCAK--VENTVVAEAEICAMVMHEKK 154 (161) Q Consensus 90 ~~~~~l~~i~~~kF~~~V~PGd~L~i~~~i~~~~~~~~~~~~~~~--v~~~~va~a~l~~~i~~~~~ 154 (161) ..+.|...--.+.|++|+.=.|.|++++++.+.++--..|.=+++ .|+++.|+|+...+-++..+ T Consensus 48 ~g~~FVVr~~~~dy~~PArlDD~L~V~t~~~~~~~Asl~f~Q~v~~~Rg~~~L~~A~V~~aCv~~~~ 114 (127) T TIGR02799 48 TGLVFVVRSMELDYLKPARLDDLLTVTTRVVELKGASLVFAQEVRNKRGDTLLVEATVEVACVDASD 114 (127) T ss_pred CCCEEEEEEEEEEECCCCCCCCEEEEEEEEEEECCEEEEEEEEEEECCCCEEEEEEEEEEEEEECCC T ss_conf 9949999898751058735486158999998644747999999996789888888879999972689 No 48 >PRK10526 acyl-CoA thioesterase II; Provisional Probab=94.51 E-value=0.4 Score=27.04 Aligned_cols=49 Identities=12% Similarity=0.067 Sum_probs=23.4 Q ss_pred EEEEEEEECCEEEEEEEEEEEECCEEEEEEEEEECCEEEEEEEEEEEEE Q ss_conf 3600221288899999999720777999999999999999899999984 Q gi|254780770|r 102 RFRKPVFPGDRLEYHVNKVRNRVDLWKFQCCAKVENTVVAEAEICAMVM 150 (161) Q Consensus 102 kF~~~V~PGd~L~i~~~i~~~~~~~~~~~~~~~v~~~~va~a~l~~~i~ 150 (161) -|.++.-|+.-++++|+.++..++++.=..+++-+|+.+..+..+|... T Consensus 63 YFlr~G~~~~Pi~y~Ve~lrdGrSF~tR~V~A~Q~g~~if~~~~Sf~~~ 111 (286) T PRK10526 63 YFLRPGDSKKPIIYDVETLRDGNSFSARRVAAIQNGKPIFYMTASFQAP 111 (286) T ss_pred ECCCCCCCCCCEEEEEEEEECCCCEEEEEEEEEECCCEEEEEEEEEECC T ss_conf 5168889999989997887459864702689998895589999875126 No 49 >TIGR02278 PaaN-DH phenylacetic acid degradation protein paaN; InterPro: IPR011966 This enzyme is proposed to act in the ring-opening step of phenylacetic acid degradation which follows ligation of the acid with coenzyme A (by PaaF) and hydroxylation by a multicomponent non-heme iron hydroxylase complex (PaaGHIJK). Gene symbols have been standardised in . This enzyme is related to aldehyde dehydrogenases.. Probab=94.32 E-value=0.11 Score=30.50 Aligned_cols=104 Identities=20% Similarity=0.309 Sum_probs=63.6 Q ss_pred EEEECCCEEE-EEEEECCCCHH-H----CCCCC---------CCCCCCH-----HHHHHHHHHHHHHHHHHCCCCCCCEE Q ss_conf 9984697499-99982688831-3----13479---------9950225-----66999986346345443299776302 Q gi|254780770|r 33 VNIQRDESAI-GIKNVTFNEPH-F----MGHFP---------GRPVMPG-----VLILEGMAQTAGAICAIHNGFDQYAP 92 (161) Q Consensus 33 ~~~~~g~~~~-g~k~v~~~e~f-f----~gHFp---------~~PvmPG-----vLliE~maQ~~~~l~~~~~~~~~~~~ 92 (161) .+++.|+++. ..+.||.+|-. | -+||. .+|++|+ -|++ +=+||.++ +..-| T Consensus 547 ~~L~~G~~l~t~~R~vt~~dI~~FA~~sGD~FYAH~De~AA~~~P~Fg~RVaHGYlvl---S~AAGLFV----dp~pG-- 617 (690) T TIGR02278 547 EELEIGDSLATDRRTVTLADIVEFAELSGDTFYAHMDEIAAKESPFFGKRVAHGYLVL---SAAAGLFV----DPAPG-- 617 (690) T ss_pred HHCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCHHHHHHHHHCCCCCCCCHHHHHHH---HHHHCCCC----CCCCC-- T ss_conf 1357875113788742789999998605884200168999843867587202688899---99860532----68898-- Q ss_pred EEEE--ECCCCEEEEEEEECCEEEEEEEEEEEE------CC-----EEEEEEEEEE-CCEEEEEEEE Q ss_conf 6775--236413600221288899999999720------77-----7999999999-9999998999 Q gi|254780770|r 93 PYLM--SIDKARFRKPVFPGDRLEYHVNKVRNR------VD-----LWKFQCCAKV-ENTVVAEAEI 145 (161) Q Consensus 93 ~~l~--~i~~~kF~~~V~PGd~L~i~~~i~~~~------~~-----~~~~~~~~~v-~~~~va~a~l 145 (161) ++|+ +.++.||..||.|||+|.++-++++.. .+ .+.-.+++++ +++.||.=.+ T Consensus 618 PvLANYGLe~LRFl~PV~~dDsi~V~LT~Krk~p~~~~~~gekPtG~V~W~~ev~NQ~~~pVA~Y~v 684 (690) T TIGR02278 618 PVLANYGLENLRFLEPVKPDDSIQVRLTVKRKTPRKEKEYGEKPTGVVEWDVEVVNQNGEPVASYDV 684 (690) T ss_pred CEEEECCCCCCCCCCCCCCCCCEEEEEEEEECCCCCCCCCCCCCCCEEEEEEEEECCCCCEEEEEEH T ss_conf 5473136776630356799884589984220377778666778874478878997469835776525 No 50 >PRK10254 hypothetical protein; Provisional Probab=94.12 E-value=0.49 Score=26.53 Aligned_cols=109 Identities=13% Similarity=0.058 Sum_probs=73.4 Q ss_pred EEEEEEEEECCCEEEEEEEECCCCHHHCCCCCCCCCCC-H--HHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCEEE Q ss_conf 47899998469749999982688831313479995022-5--66999986346345443299776302677523641360 Q gi|254780770|r 28 LVDKVVNIQRDESAIGIKNVTFNEPHFMGHFPGRPVMP-G--VLILEGMAQTAGAICAIHNGFDQYAPPYLMSIDKARFR 104 (161) Q Consensus 28 ~ID~i~~~~~g~~~~g~k~v~~~e~ff~gHFp~~PvmP-G--vLliE~maQ~~~~l~~~~~~~~~~~~~~l~~i~~~kF~ 104 (161) |==+++++.++ ++++...|+...- =| +-++= | +.|.|+++=.++.+.. . ++.. ..+---|+.|. T Consensus 24 LGI~~t~~~~~-~v~a~mpV~~~~~-----QP-~G~LHGGa~~aLAEt~~s~aa~~~~---~--~g~~-~vgiein~nhl 90 (137) T PRK10254 24 LGIVYTRLGDD-VLEAEMPVDTRTH-----QP-FGLLHGGASAALAETLGSMAGFMMT---R--DGQC-VVGTELNATHH 90 (137) T ss_pred CCEEEEEECCC-EEEEEEEECCCCC-----CC-CCEEECHHHHHHHHHHHHHHHHEEC---C--CCCE-EEEEEEEEEEE T ss_conf 69099995399-6999998570015-----86-4375001355899999877500232---8--9946-99999720175 Q ss_pred EEEEECCEEEEEEEEEEEECCEEEEEEEEEEC-CEEEEEEEEEEEEE Q ss_conf 02212888999999997207779999999999-99999899999984 Q gi|254780770|r 105 KPVFPGDRLEYHVNKVRNRVDLWKFQCCAKVE-NTVVAEAEICAMVM 150 (161) Q Consensus 105 ~~V~PGd~L~i~~~i~~~~~~~~~~~~~~~v~-~~~va~a~l~~~i~ 150 (161) ++|.-|. ++-+++.....+.....+.+++.+ |+++|.+.++.++. T Consensus 91 r~v~~G~-v~a~A~~ih~GrttqvW~i~i~de~grl~~~~rlT~~Vl 136 (137) T PRK10254 91 RPVSEGK-VRGVCQPLHLGRQNQSWEIVVFDEQGRRCCTCRLGTAVL 136 (137) T ss_pred ECCCCCE-EEEEEEEEECCCCEEEEEEEEECCCCCEEEEEEEEEEEE T ss_conf 1045877-999999985365359999999988999998668887662 No 51 >cd03445 Thioesterase_II_repeat2 Thioesterase II (TEII) is thought to regenerate misprimed nonribosomal peptide synthetases (NRPSs) as well as modular polyketide synthases (PKSs) by hydrolyzing acetyl groups bound to the peptidyl carrier protein (PCP) and acyl carrier protein (ACP) domains, respectively. TEII has two tandem asymmetric hot dog folds that are structurally similar to one found in PaaI thioesterase, 4-hydroxybenzoyl-CoA thioesterase (4HBT) and beta-hydroxydecanoyl-ACP dehydratase and thus, the TEII monomer is equivalent to the homodimeric form of the latter three enzymes. Human TEII is expressed in T cells and has been shown to bind the product of the HIV-1 Nef gene. Probab=93.02 E-value=0.75 Score=25.40 Aligned_cols=77 Identities=19% Similarity=0.111 Sum_probs=54.2 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCEEEEEEEECCEEEEEEEEEEEECCEEEEEEEEEECCEEEEEEEE Q ss_conf 56699998634634544329977630267752364136002212888999999997207779999999999999998999 Q gi|254780770|r 66 GVLILEGMAQTAGAICAIHNGFDQYAPPYLMSIDKARFRKPVFPGDRLEYHVNKVRNRVDLWKFQCCAKVENTVVAEAEI 145 (161) Q Consensus 66 GvLliE~maQ~~~~l~~~~~~~~~~~~~~l~~i~~~kF~~~V~PGd~L~i~~~i~~~~~~~~~~~~~~~v~~~~va~a~l 145 (161) ++..-..+||+..+... .-+. +.. ...-.+-|.++..|+.-++++++..+..+++..-..++.-+|+++..+.. T Consensus 17 ~~fGG~~~Aqa~~AA~~-tv~~--~~~---~~S~h~~F~~~g~~~~pi~~~V~~lr~Grs~~tr~V~a~Q~g~~~~~~~~ 90 (94) T cd03445 17 GVFGGQVLAQALVAAAR-TVPD--DRV---PHSLHSYFLRPGDPDQPIEYEVERLRDGRSFATRRVRAVQNGKVIFTATA 90 (94) T ss_pred CCHHHHHHHHHHHHHHH-HCCC--CCC---CEEEEEEEECCCCCCCCEEEEEEEEECCCCEEEEEEEEEECCEEEEEEEE T ss_conf 24427999999999995-3899--876---35899998048889999999999986798679899999989999999999 Q ss_pred EEE Q ss_conf 999 Q gi|254780770|r 146 CAM 148 (161) Q Consensus 146 ~~~ 148 (161) +|. T Consensus 91 sF~ 93 (94) T cd03445 91 SFQ 93 (94) T ss_pred EEE T ss_conf 970 No 52 >COG1946 TesB Acyl-CoA thioesterase [Lipid metabolism] Probab=92.68 E-value=0.37 Score=27.26 Aligned_cols=53 Identities=15% Similarity=0.077 Sum_probs=36.4 Q ss_pred EEEEECCC-CEEEEEEEECCEEEEEEEEEEEECCEEEEEEEEE-ECCEEEEEEEE Q ss_conf 67752364-1360022128889999999972077799999999-99999998999 Q gi|254780770|r 93 PYLMSIDK-ARFRKPVFPGDRLEYHVNKVRNRVDLWKFQCCAK-VENTVVAEAEI 145 (161) Q Consensus 93 ~~l~~i~~-~kF~~~V~PGd~L~i~~~i~~~~~~~~~~~~~~~-v~~~~va~a~l 145 (161) ...+++++ +.|++|+.-+|.|.+.++....+++.+.++|+.+ .+|+++|...= T Consensus 224 ~~~aSLDHs~wFhrp~~~ddWlLy~~~sp~A~~~rgl~~G~lf~r~G~LiA~~~Q 278 (289) T COG1946 224 IQVASLDHSMWFHRPFRLDDWLLYAQESPSASGGRGLVRGQLFDRDGQLIASVVQ 278 (289) T ss_pred CEEEECCCEEEEECCCCCCCEEEEEEECCCCCCCCCEEEEEEECCCCCEEEEEEE T ss_conf 2476530258982456689779997218864577542125798678878999733 No 53 >cd00556 Thioesterase_II Thioesterase II (TEII) is thought to regenerate misprimed nonribosomal peptide synthetases (NRPSs) as well as modular polyketide synthases (PKSs) by hydrolyzing acetyl groups bound to the peptidyl carrier protein (PCP) and acyl carrier protein (ACP) domains, respectively. TEII has two tandem asymmetric hot dog folds that are structurally similar to one found in PaaI thioesterase, 4-hydroxybenzoyl-CoA thioesterase (4HBT) and beta-hydroxydecanoyl-ACP dehydratase and thus, the TEII monomer is equivalent to the homodimeric form of the latter three enzymes. Human TEII is expressed in T cells and has been shown to bind the product of the HIV-1 Nef gene. Probab=92.22 E-value=0.96 Score=24.77 Aligned_cols=50 Identities=20% Similarity=0.167 Sum_probs=44.1 Q ss_pred CCCEEEEEEEECCEEEEEEEEEEEECCEEEEEEEEEEC-CEEEEEEEEEEE Q ss_conf 64136002212888999999997207779999999999-999998999999 Q gi|254780770|r 99 DKARFRKPVFPGDRLEYHVNKVRNRVDLWKFQCCAKVE-NTVVAEAEICAM 148 (161) Q Consensus 99 ~~~kF~~~V~PGd~L~i~~~i~~~~~~~~~~~~~~~v~-~~~va~a~l~~~ 148 (161) -.+.|+++..+++.+.++++..+..++.+...++++-+ |+++++....+. T Consensus 48 ~~~~F~~~~~~~~w~~~~v~~~~~g~~~~~~~~~~~q~~G~lvas~~q~~~ 98 (99) T cd00556 48 HHIYFHRPGDADEWLLYEVESLRDGRSRALRRGRAYQRDGKLVASATQSFL 98 (99) T ss_pred EEEEECCCCCCCCCEEEEEECCCCCCCCEEEEEEEECCCCCEEEEEEEEEC T ss_conf 457865899999889999981402079518999999489979999981354 No 54 >PRK10293 hypothetical protein; Provisional Probab=91.96 E-value=1 Score=24.58 Aligned_cols=108 Identities=12% Similarity=0.127 Sum_probs=72.3 Q ss_pred EEEEEEEEECCCEEEEEEEECCCCHHHCCCCCCCCCCCH---HHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEC-CCCEE Q ss_conf 478999984697499999826888313134799950225---66999986346345443299776302677523-64136 Q gi|254780770|r 28 LVDKVVNIQRDESAIGIKNVTFNEPHFMGHFPGRPVMPG---VLILEGMAQTAGAICAIHNGFDQYAPPYLMSI-DKARF 103 (161) Q Consensus 28 ~ID~i~~~~~g~~~~g~k~v~~~e~ff~gHFp~~PvmPG---vLliE~maQ~~~~l~~~~~~~~~~~~~~l~~i-~~~kF 103 (161) |==+++++.++ ++++...|+... ..-+-++=| +.+.|+.+=.|+.+. .+.+ . . ..++ -|+.| T Consensus 24 LGI~~~e~~~d-~v~a~mpV~~~~------~QP~G~LHGGa~~aLAEt~gS~aa~~~---~~~~--~-~-~vGiein~nh 89 (136) T PRK10293 24 LDIRFEHIGDD-TLEATMPVDSRT------KQPFGLLHGGASVVLAESIGSVAGYLC---TEGE--Q-K-VVGLEINANH 89 (136) T ss_pred CCEEEEEECCC-EEEEEEEECHHH------CCCCCEECCCEEEHHHHHHHHHHHHHC---CCCC--C-E-EEEEEEECCE T ss_conf 59099996499-799999957221------497335100110036888776677622---5888--4-7-9999987223 Q ss_pred EEEEEECCEEEEEEEEEEEECCEEEEEEEEEE-CCEEEEEEEEEEEEE Q ss_conf 00221288899999999720777999999999-999999899999984 Q gi|254780770|r 104 RKPVFPGDRLEYHVNKVRNRVDLWKFQCCAKV-ENTVVAEAEICAMVM 150 (161) Q Consensus 104 ~~~V~PGd~L~i~~~i~~~~~~~~~~~~~~~v-~~~~va~a~l~~~i~ 150 (161) .++|.-|. ++-.++.....+..-..+.+++. +|+++|.+.++.++. T Consensus 90 lrsv~~G~-v~a~A~p~h~GrttqvW~i~I~de~g~l~a~~R~T~~il 136 (136) T PRK10293 90 VRSAREGR-VRGVCKPLHLGSRHQVWQIEIFDEKGRLCCSSRLTTAIL 136 (136) T ss_pred EEECCCCE-EEEEEEEEECCCCEEEEEEEEECCCCCEEEEEEEEEEEC T ss_conf 10024767-999999976355459999999989999999999999969 No 55 >TIGR00051 TIGR00051 conserved hypothetical protein; InterPro: IPR006684 4-hydroxybenzoyl-CoA thioesterase (3.1.2.23 from EC) is an enzyme from Pseudomonas CBS-3, a soil-dwelling microbe that survives on 4-chlorobenzoate as its sole carbon source. This enzyme catalyzes the last of the three steps in the pathway from 4-chlorobenzoate to hydroxybenzoate, the cleavage of the thioester bond of 4-hydroxybenzoyl-CoA to form hydroxybenzoate.4-hydroxybenzoyl-CoA + H_2O = 4-hydroxybenzoate + CoA 4-hydroxybenzoyl-CoA thioesterase is a protein of 141 amino-acid residues that assemble as an homotetramer. An aspartate in the N-terminal domain is thought to participate in the catalytic mechanism. This enzyme is evolutionary related to a number of uncharacterised proteins.; GO: 0016787 hydrolase activity. Probab=90.51 E-value=1.4 Score=23.71 Aligned_cols=65 Identities=17% Similarity=0.050 Sum_probs=51.0 Q ss_pred EEEEEECCCCEEEEEEEECCEEEEEEEEEEEECCEEEEEEEEEECCEEEEEEEEEEE--EECCCCCC Q ss_conf 267752364136002212888999999997207779999999999999998999999--84452002 Q gi|254780770|r 92 PPYLMSIDKARFRKPVFPGDRLEYHVNKVRNRVDLWKFQCCAKVENTVVAEAEICAM--VMHEKKEK 156 (161) Q Consensus 92 ~~~l~~i~~~kF~~~V~PGd~L~i~~~i~~~~~~~~~~~~~~~v~~~~va~a~l~~~--i~~~~~e~ 156 (161) +.+..---++.|++|+.=||.|+|+.++.+.++-...++-.++.+++..++++.... -+++++-| T Consensus 50 ~~~vV~~~~~~Y~~Pa~ldd~L~i~t~~~~l~g~~~~f~q~i~~~~~~l~~~~~~~~fvC~d~~~~K 116 (121) T TIGR00051 50 VAFVVVKINIEYKKPARLDDVLEIRTQIEELNGFSVVFSQEIFREDEALLKAATKIHFVCVDLKKFK 116 (121) T ss_pred EEEEEEEEEEEEECCCCCCCEEEEEEEEEECCCEEEEEEEEEECCCHHEECCCEEEEEEEEECCCCC T ss_conf 0489985564210365348578998888750564789999998061110014553799997524375 No 56 >KOG3016 consensus Probab=90.26 E-value=1.5 Score=23.58 Aligned_cols=84 Identities=13% Similarity=0.108 Sum_probs=56.5 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCEEEEEEEECCEEEEEEEEEEEECCEEEEEEEEEEC Q ss_conf 47999502256699998634634544329977630267752364136002212888999999997207779999999999 Q gi|254780770|r 57 HFPGRPVMPGVLILEGMAQTAGAICAIHNGFDQYAPPYLMSIDKARFRKPVFPGDRLEYHVNKVRNRVDLWKFQCCAKVE 136 (161) Q Consensus 57 HFp~~PvmPGvLliE~maQ~~~~l~~~~~~~~~~~~~~l~~i~~~kF~~~V~PGd~L~i~~~i~~~~~~~~~~~~~~~v~ 136 (161) -+|.++++=|.+.-.+++-+ ..-...++.-.. =..-|-+..-|...+.+.++.++..++++.-.++|.-+ T Consensus 34 ~~~~~~~fGG~i~sQaLaAA---~~TV~e~f~p~S-------lH~YFI~~gd~~~pI~Y~V~rirdGr~F~~R~V~AvQ~ 103 (294) T KOG3016 34 EIPSNHAYGGQIASQALAAA---SKTVEEMFIPHS-------LHCYFILVGDPNIPIIYDVKRIRDGRNFATRSVDAVQK 103 (294) T ss_pred CCCCCCCCCCEEHHHHHHHH---HHCCCCCCCCCE-------EEEEEEECCCCCCCEEEEEEEECCCCEEEEEEEEEEEC T ss_conf 47887530414459999998---702663525224-------55343215887885698836612786058999887887 Q ss_pred CEEEEEEEEEEEEE Q ss_conf 99999899999984 Q gi|254780770|r 137 NTVVAEAEICAMVM 150 (161) Q Consensus 137 ~~~va~a~l~~~i~ 150 (161) ||++..+.++|-.. T Consensus 104 ~k~If~~qiSF~~~ 117 (294) T KOG3016 104 GKTIFTLQISFQQS 117 (294) T ss_pred CEEEEEEEEEECCC T ss_conf 82899999997246 No 57 >COG4109 Predicted transcriptional regulator containing CBS domains [Transcription] Probab=90.04 E-value=0.63 Score=25.88 Aligned_cols=51 Identities=20% Similarity=0.142 Sum_probs=45.3 Q ss_pred CCEEEEEEEECCEEEEEEEEEEEECCEEEEEEEEEECCEEEEEEEEEEEEE Q ss_conf 413600221288899999999720777999999999999999899999984 Q gi|254780770|r 100 KARFRKPVFPGDRLEYHVNKVRNRVDLWKFQCCAKVENTVVAEAEICAMVM 150 (161) Q Consensus 100 ~~kF~~~V~PGd~L~i~~~i~~~~~~~~~~~~~~~v~~~~va~a~l~~~i~ 150 (161) +.-|.+||.-.+.|++..++....+.+++.+++.+.++.++++|-+++-+. T Consensus 381 ~iyflk~vqid~~l~I~prIl~~gR~~a~idvei~~~~~ivaKAiv~~ql~ 431 (432) T COG4109 381 TIYFLKPVQIDSVLEIYPRILEEGRKFAKIDVEIYHDGQIVAKAIVTVQLN 431 (432) T ss_pred EEEEECCEECCCEEEEEEEEECCCCCCCEEEEEEEECCCHHHHHEEEEECC T ss_conf 330652230244899841020015421146799960761301100244405 No 58 >KOG1206 consensus Probab=88.85 E-value=0.052 Score=32.37 Aligned_cols=82 Identities=26% Similarity=0.395 Sum_probs=43.6 Q ss_pred CCCCCCEEEEEEEEEEECCCEEEEEEEECCCCHHH--------CCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCE Q ss_conf 79997505478999984697499999826888313--------1347999502256699998634634544329977630 Q gi|254780770|r 20 LPHRYPFLLVDKVVNIQRDESAIGIKNVTFNEPHF--------MGHFPGRPVMPGVLILEGMAQTAGAICAIHNGFDQYA 91 (161) Q Consensus 20 lPhr~Pfl~ID~i~~~~~g~~~~g~k~v~~~e~ff--------~gHFp~~PvmPGvLliE~maQ~~~~l~~~~~~~~~~~ 91 (161) .|||.|-.-+.+.++-| ..+.+.++.|-..+ ..-|+ .|++-|.+..-..+-. +.. ..+ T Consensus 147 ~p~r~pd~~v~~~ts~D----qaAlyrlsgD~NPLHiDPe~A~~agFe-tpilHGlc~lg~~~ri---v~a-~~~----- 212 (272) T KOG1206 147 VPHRDPDAVVERFTSED----QAALYRLSGDHNPLHIDPESALEAGFE-TPILHGLCTLGFSARI---VGA-QFP----- 212 (272) T ss_pred CCCCCCCHHEEECCHHH----HHHHHHHCCCCCCCCCCHHHHHHCCCC-CCHHHHHHHHHHHHHH---HHH-HCC----- T ss_conf 88768503045405464----999998538988533587788754897-7124667775434889---987-567----- Q ss_pred EEEEEECCCCEEEEEEEECCEEEEE Q ss_conf 2677523641360022128889999 Q gi|254780770|r 92 PPYLMSIDKARFRKPVFPGDRLEYH 116 (161) Q Consensus 92 ~~~l~~i~~~kF~~~V~PGd~L~i~ 116 (161) +.+-...++||.+||.|||+|... T Consensus 213 -~a~y~~~kvrF~spV~pGdtll~~ 236 (272) T KOG1206 213 -PAVYKAQKVRFSSPVGPGDTLLVL 236 (272) T ss_pred -CHHHHEEEEEECCCCCCCHHHHHH T ss_conf -022000255413787786008999 No 59 >pfam02551 Acyl_CoA_thio Acyl-CoA thioesterase. This family represents the thioesterase II domain. Two copies of this domain are found in a number of acyl-CoA thioesterases. Probab=86.45 E-value=2.7 Score=22.06 Aligned_cols=114 Identities=15% Similarity=0.144 Sum_probs=64.9 Q ss_pred EEEEEEEEECCCE---EEEEEEECCCCHHH--CCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCC- Q ss_conf 4789999846974---99999826888313--13479995022566999986346345443299776302677523641- Q gi|254780770|r 28 LVDKVVNIQRDES---AIGIKNVTFNEPHF--MGHFPGRPVMPGVLILEGMAQTAGAICAIHNGFDQYAPPYLMSIDKA- 101 (161) Q Consensus 28 ~ID~i~~~~~g~~---~~g~k~v~~~e~ff--~gHFp~~PvmPGvLliE~maQ~~~~l~~~~~~~~~~~~~~l~~i~~~- 101 (161) +.||-+++.|.+. ..+. ......-|+ .+..|++|.+--.++.=+ --..-+................++++.+ T Consensus 7 ~~erpieiRp~~~~~~~~~~-~~~~~~~W~R~~~~lpdD~~lh~a~LAY~-SD~~ll~ta~~~~~~~~~~~~~ASLDHa~ 84 (133) T pfam02551 7 RGEYPVAVRPGELRRTFGGQ-VVALQQSWVAALGTVPDDPLLHSCALAYL-SDLTLLLTALYGNLFLYDGIFVASLDHSV 84 (133) T ss_pred CCCCCEEEEECCCCCCCCCC-CCCCCCEEEEECCCCCCCHHHHHHHHHHH-HHHHHHHHHCCCCCCCCCCCEEEECHHHE T ss_conf 56688178665887888988-89701288988998998999999999999-88779876407777577874575413100 Q ss_pred EEEEEEEECCEEEEEEEEEEEECCEEEEEEEEEE--CCEEEEEE Q ss_conf 3600221288899999999720777999999999--99999989 Q gi|254780770|r 102 RFRKPVFPGDRLEYHVNKVRNRVDLWKFQCCAKV--ENTVVAEA 143 (161) Q Consensus 102 kF~~~V~PGd~L~i~~~i~~~~~~~~~~~~~~~v--~~~~va~a 143 (161) =|++++.+.|.|.++.+--.-.++-+...|+++. +|++||+. T Consensus 85 WFH~p~~~ddWlLy~~~sp~a~~~Rgl~~G~~f~tr~G~Lvas~ 128 (133) T pfam02551 85 YFHRPGDLNKWILYDVESPSASGGRGLRQGRNFSTQSGKLIASV 128 (133) T ss_pred ECCCCCCCCCEEEEEEECCCCCCCCCEEEEEEEECCCCCEEEEE T ss_conf 21389889956999888965278876687678866998699998 No 60 >pfam03756 AfsA A-factor biosynthesis repeat. The AfsA family are key enzymes in A-factor biosynthesis, which is essential for streptomycin production and resistance. Probab=81.62 E-value=3.9 Score=21.10 Aligned_cols=43 Identities=28% Similarity=0.389 Sum_probs=32.7 Q ss_pred EEEEEEEECCCCHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHC Q ss_conf 499999826888313134799950225669999863463454432 Q gi|254780770|r 40 SAIGIKNVTFNEPHFMGHFPGRPVMPGVLILEGMAQTAGAICAIH 84 (161) Q Consensus 40 ~~~g~k~v~~~e~ff~gHFp~~PvmPGvLliE~maQ~~~~l~~~~ 84 (161) .......+..+.+||-.| +++ -.||.|++|++-|++.++.... T Consensus 21 ~~~~~~~~p~~H~~~~~h-~~~-h~~~mll~Ea~RQ~~~~~~H~~ 63 (74) T pfam03756 21 RFRWTLRVPRDHPFLFDH-PSD-HVPGMLLLEAARQAAIALAHAA 63 (74) T ss_pred EEEEEEECCCCCCCCCCC-CCC-CCCHHHHHHHHHHHHHHHHHHH T ss_conf 479999838988640589-887-7471569999999999987762 No 61 >KOG2763 consensus Probab=80.54 E-value=4.8 Score=20.58 Aligned_cols=72 Identities=19% Similarity=0.250 Sum_probs=48.3 Q ss_pred CCCHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCEEEEEEEECCEEEEEEEEEEEECC--EEEEEEEEEECCEE Q ss_conf 022566999986346345443299776302677523641360022128889999999972077--79999999999999 Q gi|254780770|r 63 VMPGVLILEGMAQTAGAICAIHNGFDQYAPPYLMSIDKARFRKPVFPGDRLEYHVNKVRNRVD--LWKFQCCAKVENTV 139 (161) Q Consensus 63 vmPGvLliE~maQ~~~~l~~~~~~~~~~~~~~l~~i~~~kF~~~V~PGd~L~i~~~i~~~~~~--~~~~~~~~~v~~~~ 139 (161) ++=|.| ..-+.-+|.+-+.+.. +..+++.+++.+.|.+||.-|+.|++.+.+..-..+ ....++.+.++.-. T Consensus 217 iFGGfl-Mrka~ElA~~~A~~f~----~~~p~~rsVD~i~F~~pVdvG~~L~f~s~V~yT~~k~~~vqv~~~~~v~s~~ 290 (357) T KOG2763 217 IFGGFL-MRKALELAEITAKLFC----KGRPATRSVDDIEFQKPVDVGCVLTFSSFVTYTDNKSIYVQVKAVASVDSSQ 290 (357) T ss_pred EEHHHH-HHHHHHHHHHHHHHHC----CCCCEEEEECHHHCCCCCEEEEEEEEEEEEEEECCCCEEEEEEEECCCCCCC T ss_conf 536899-9999999999999973----8996378740433268613520999730788716886258988732205642 No 62 >smart00675 DM11 Domains in hypothetical proteins in Drosophila including 2 in CG15241 and CG9329. Probab=78.65 E-value=5.5 Score=20.22 Aligned_cols=72 Identities=14% Similarity=0.165 Sum_probs=45.5 Q ss_pred CCHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCEEEEEEEECCEEEEEEEEEEEECCEEEE Q ss_conf 88313134799950225669999863463454432997763026775236413600221288899999999720777999 Q gi|254780770|r 50 NEPHFMGHFPGRPVMPGVLILEGMAQTAGAICAIHNGFDQYAPPYLMSIDKARFRKPVFPGDRLEYHVNKVRNRVDLWKF 129 (161) Q Consensus 50 ~e~ff~gHFp~~PvmPGvLliE~maQ~~~~l~~~~~~~~~~~~~~l~~i~~~kF~~~V~PGd~L~i~~~i~~~~~~~~~~ 129 (161) |+.+|. -.++.| ||++-++.+.-++..-. ... ....++. .+++..=.|.|+|++.+++.+-++.+|.++= T Consensus 9 d~~ifs-~C~~~p--~g~~~i~~lfD~S~l~~--~~~---~d~v~vS--Gn~T~~WdiqP~DrI~~~~~v~~~dRG~WqP 78 (164) T smart00675 9 DPDLYS-PCTDAP--PGNIGLREAFDISNLVV--DMD---PDGLHIS--GNITVIWDVQPTDRISARVSVMHFERGTWQP 78 (164) T ss_pred CCCCCC-CCCCCC--CCCCCHHHHCCCCCEEE--EEC---CCEEEEE--CEEEEEEECCCCCEEEEEEEEEEECCCEEEE T ss_conf 835116-488998--88428899001331189--864---8759994--5098999528999899999999973873873 Q ss_pred EE Q ss_conf 99 Q gi|254780770|r 130 QC 131 (161) Q Consensus 130 ~~ 131 (161) +. T Consensus 79 Tv 80 (164) T smart00675 79 TV 80 (164) T ss_pred EE T ss_conf 46 No 63 >PRK07531 bifunctional 3-hydroxyacyl-CoA dehydrogenase/thioesterase; Validated Probab=78.07 E-value=5.7 Score=20.11 Aligned_cols=68 Identities=13% Similarity=0.075 Sum_probs=51.6 Q ss_pred EEEEEECCCCEEEEEEEECCEEEEEEEEEEEECCEEEEEEEEEE-CCEEEEEEEEEEEEECCCCCCCCC Q ss_conf 26775236413600221288899999999720777999999999-999999899999984452002344 Q gi|254780770|r 92 PPYLMSIDKARFRKPVFPGDRLEYHVNKVRNRVDLWKFQCCAKV-ENTVVAEAEICAMVMHEKKEKNES 159 (161) Q Consensus 92 ~~~l~~i~~~kF~~~V~PGd~L~i~~~i~~~~~~~~~~~~~~~v-~~~~va~a~l~~~i~~~~~e~~~~ 159 (161) ..++..-.+++|.+-+..|+.++++..+......-..+-.+.+. ||+++|++|..+.=++-+..+.-| T Consensus 388 ~S~fTvE~Hiryl~Ev~~G~~i~v~t~ll~~d~Krlhl~h~m~~~dg~l~AT~E~mllHVDl~~rrs~p 456 (489) T PRK07531 388 HSYYTVETHIRHLGEAKAGQALHVETQLLGGDEKRLHLFHTLYDGDGTLIATGEHMLLHVDLKTGKAVP 456 (489) T ss_pred CCEEEEEEEEHHHHHCCCCCEEEEEEEEEECCCEEEEEEEEEECCCCCEEHHHHHHHEEEECCCCCCCC T ss_conf 977988522216665069984999999982476169999999848984868750344067379987899 No 64 >cd03444 Thioesterase_II_repeat1 Thioesterase II (TEII) is thought to regenerate misprimed nonribosomal peptide synthetases (NRPSs) as well as modular polyketide synthases (PKSs) by hydrolyzing acetyl groups bound to the peptidyl carrier protein (PCP) and acyl carrier protein (ACP) domains, respectively. TEII has two tandem asymmetric hot dog folds that are structurally similar to one found in PaaI thioesterase, 4-hydroxybenzoyl-CoA thioesterase (4HBT) and beta-hydroxydecanoyl-ACP dehydratase and thus, the TEII monomer is equivalent to the homodimeric form of the latter three enzymes. Human TEII is expressed in T cells and has been shown to bind the product of the HIV-1 Nef gene. Probab=76.78 E-value=6.2 Score=19.89 Aligned_cols=53 Identities=19% Similarity=0.152 Sum_probs=43.9 Q ss_pred EEEEEECCC-CEEEEEEEECCEEEEEEEEEEEECCEEEEEEEEE-ECCEEEEEEE Q ss_conf 267752364-1360022128889999999972077799999999-9999999899 Q gi|254780770|r 92 PPYLMSIDK-ARFRKPVFPGDRLEYHVNKVRNRVDLWKFQCCAK-VENTVVAEAE 144 (161) Q Consensus 92 ~~~l~~i~~-~kF~~~V~PGd~L~i~~~i~~~~~~~~~~~~~~~-v~~~~va~a~ 144 (161) ....++.+. +.|++++...|-+.++.+.....++.+...|+++ -+|++||+.. T Consensus 45 ~~~~aSLDHsiwFH~~~~~~~W~l~~~~s~~a~~gRgl~~g~i~~~~G~LvAs~~ 99 (104) T cd03444 45 ASASASLDHAIWFHRPFRADDWLLYEQRSPRAGNGRGLVEGRIFTRDGELVASVA 99 (104) T ss_pred CCEEEECCEEEEEECCCCCCCEEEEEEEEHHCCCCCEEEEEEEECCCCCEEEEEE T ss_conf 6057861245998589877851999999712049803899999999999999997 No 65 >cd03706 mtEFTU_III Domain III of mitochondrial EF-TU (mtEF-TU). mtEF-TU is highly conserved and is 55-60% identical to bacterial EF-TU. The overall structure is similar to that observed in the Escherichia coli and Thermus aquaticus EF-TU. However, compared with that observed in prokaryotic EF-TU the nucleotide-binding domain (domain I) of EF-TUmt is in a different orientation relative to the rest of the structure. Furthermore, domain III is followed by a short 11-amino acid extension that forms one helical turn. This extension seems to be specific to the mitochondrial factors and has not been observed in any of the prokaryotic factors. Probab=66.40 E-value=11 Score=18.45 Aligned_cols=42 Identities=14% Similarity=0.087 Sum_probs=27.5 Q ss_pred EEEEECCEEEEEEEEEEEECCEEEEEEEEEECCEEEEEEEEE Q ss_conf 022128889999999972077799999999999999989999 Q gi|254780770|r 105 KPVFPGDRLEYHVNKVRNRVDLWKFQCCAKVENTVVAEAEIC 146 (161) Q Consensus 105 ~~V~PGd~L~i~~~i~~~~~~~~~~~~~~~v~~~~va~a~l~ 146 (161) ..|.|||.+.+.+++.+...=-.-.+-.++-+++.|+.+.++ T Consensus 50 ~mvmPGd~~~~~~~L~~p~~le~g~rFtiREgg~TVg~GvVt 91 (93) T cd03706 50 EMVMPGEDTKVTLILRRPMVLEKGQRFTLRDGNRTIGTGLVT 91 (93) T ss_pred EEECCCCCEEEEEEECCCEECCCCCEEEEEECCEEEEEEEEE T ss_conf 586578828999997760611599818893599989989995 No 66 >TIGR00979 fumC_II fumarate hydratase, class II; InterPro: IPR005677 Fumarase C catalyzes the stereospecific interconversion of fumarate to L-malate as part of the metabolic citric acid or Kreb's cycle. The recent three-dimensional structure of fumarase C from Escherichia coli has identified a binding site for anions which is generated by side chains from three of the four subunits within the tetramer. These same side chains are found in the three most highly conserved regions within the class II fumarate hydratase family. Putative fumarases from several species (Mycobacterium tuberculosis, Streptomyces coelicolor, Pseudomonas aeruginosa) branch deeply, although within the same branch of a phylogenetic tree rooted by aspartate ammonia-lyase sequences.; GO: 0004333 fumarate hydratase activity, 0006106 fumarate metabolic process, 0045239 tricarboxylic acid cycle enzyme complex. Probab=65.84 E-value=3.8 Score=21.17 Aligned_cols=37 Identities=24% Similarity=0.423 Sum_probs=31.5 Q ss_pred CEEEEEEEECCCCHHHCCCCCCCCCCCHHH---HHHHHHHHHHHHHH Q ss_conf 749999982688831313479995022566---99998634634544 Q gi|254780770|r 39 ESAIGIKNVTFNEPHFMGHFPGRPVMPGVL---ILEGMAQTAGAICA 82 (161) Q Consensus 39 ~~~~g~k~v~~~e~ff~gHFp~~PvmPGvL---liE~maQ~~~~l~~ 82 (161) .+..|+-.+..|| ||.-||||=+ ++|+|.+.|.-.++ T Consensus 301 RCGlGEl~~PENE-------PGSSIMPGKVNPTQ~EalTMVC~QVmG 340 (459) T TIGR00979 301 RCGLGELFLPENE-------PGSSIMPGKVNPTQCEALTMVCVQVMG 340 (459) T ss_pred CCCCCCCCCCCCC-------CCCCCCCCCCCCCHHHHHHHHHHHHHH T ss_conf 6420025677787-------578769789871043588998887730 No 67 >cd03707 EFTU_III Domain III of elongation factor (EF) Tu. Ef-Tu consists of three structural domains, designated I, II and III. Domain III adopts a beta barrel structure. Domain III is involved in binding to both charged tRNA and binding to elongation factor Ts (EF-Ts). EF-Ts is the guanine-nucleotide-exchange factor for EF-Tu. EF-Tu and EF-G participate in the elongation phase during protein biosynthesis on the ribosome. Their functional cycles depend on GTP binding and its hydrolysis. The EF-Tu complexed with GTP and aminoacyl-tRNA delivers tRNA to the ribosome, whereas EF-G stimulates translocation, a process in which tRNA and mRNA movements occur in the ribosome. Crystallographic studies revealed structural similarities ("molecular mimicry") between tertiary structures of EF-G and the EF-Tu-aminoacyl-tRNA ternary complex. Domains III, IV, and V of EF-G mimic the tRNA structure in the EF-Tu ternary complex; domains III, IV and V can be related to the acceptor stem, anticodon helix Probab=64.49 E-value=11 Score=18.40 Aligned_cols=40 Identities=15% Similarity=0.184 Sum_probs=26.5 Q ss_pred EEEEECCEEEEEEEEEEEECCEEEEEEEEEECCEEEEEEE Q ss_conf 0221288899999999720777999999999999999899 Q gi|254780770|r 105 KPVFPGDRLEYHVNKVRNRVDLWKFQCCAKVENTVVAEAE 144 (161) Q Consensus 105 ~~V~PGd~L~i~~~i~~~~~~~~~~~~~~~v~~~~va~a~ 144 (161) ..|.|||.+.+.+++.+...=-.-..-.++-++++|+.+. T Consensus 50 ~mvmPGd~~~~~v~L~~p~~le~g~rFtiREGg~TVg~Gi 89 (90) T cd03707 50 EMVMPGDNVKMTVELIHPIALEKGLRFAIREGGRTVGAGV 89 (90) T ss_pred EEECCCCEEEEEEEECCCCCCCCCCEEEEEECCEEEEEEE T ss_conf 5973577289999987873147899699978999898778 No 68 >PRK01777 hypothetical protein; Validated Probab=58.96 E-value=11 Score=18.53 Aligned_cols=52 Identities=12% Similarity=0.126 Sum_probs=34.1 Q ss_pred CHHHHHHHHHHHHHHHHH-HCCCCCCCEEEEEEECCCCEEEEEEEECCEEEEEEEE Q ss_conf 256699998634634544-3299776302677523641360022128889999999 Q gi|254780770|r 65 PGVLILEGMAQTAGAICA-IHNGFDQYAPPYLMSIDKARFRKPVFPGDRLEYHVNK 119 (161) Q Consensus 65 PGvLliE~maQ~~~~l~~-~~~~~~~~~~~~l~~i~~~kF~~~V~PGd~L~i~~~i 119 (161) +|+-..|++.|++ ++-. -..+....+.+.|+. .++.-..+.+||+++|+=-+ T Consensus 25 ~GtTv~~Ai~~Sg-i~~~fpeidl~~~~vGIfgk--~~~ld~~L~~GDRVEIYRPL 77 (95) T PRK01777 25 EGATVEEAIRASG-LLELRTDIDLAKNKVGIFSR--PVKLTDVLRDGDRVEIYRPL 77 (95) T ss_pred CCCCHHHHHHHCC-CHHHCCCCCCCCCCEEEECC--CCCCCCCCCCCCEEEEECCC T ss_conf 9993999999829-65448765535474507884--83887728999999985365 No 69 >pfam10989 DUF2808 Protein of unknown function (DUF2808). This family of proteins with unknown function appears to be restricted to Cyanobacteria. Probab=55.91 E-value=17 Score=17.32 Aligned_cols=40 Identities=18% Similarity=0.283 Sum_probs=31.3 Q ss_pred CCEEEEEEEECCEEEEEEEE-EEE-ECCEEEEEEEEEECCEE Q ss_conf 41360022128889999999-972-07779999999999999 Q gi|254780770|r 100 KARFRKPVFPGDRLEYHVNK-VRN-RVDLWKFQCCAKVENTV 139 (161) Q Consensus 100 ~~kF~~~V~PGd~L~i~~~i-~~~-~~~~~~~~~~~~v~~~~ 139 (161) .+.|..||-||+++++..+- ..- ..|++.|.+.++--|+. T Consensus 88 ~I~f~~PV~pg~tvtv~l~~v~NP~~~G~Y~F~~~a~p~G~~ 129 (142) T pfam10989 88 EIFFAEPVPPGTTVTVVLKNVRNPRSGGTYQFNATAQPPGDN 129 (142) T ss_pred EEEECCCCCCCCEEEEEEECCCCCCCCCEEEEEEEEECCCCC T ss_conf 998189989899899998431397988559998998879998 No 70 >pfam03658 UPF0125 Uncharacterized protein family (UPF0125). Probab=54.05 E-value=15 Score=17.63 Aligned_cols=52 Identities=15% Similarity=0.224 Sum_probs=33.3 Q ss_pred CHHHHHHHHHHHHHHHHHH-CCCCCCCEEEEEEECCCCEEEEEEEECCEEEEEEEE Q ss_conf 2566999986346345443-299776302677523641360022128889999999 Q gi|254780770|r 65 PGVLILEGMAQTAGAICAI-HNGFDQYAPPYLMSIDKARFRKPVFPGDRLEYHVNK 119 (161) Q Consensus 65 PGvLliE~maQ~~~~l~~~-~~~~~~~~~~~l~~i~~~kF~~~V~PGd~L~i~~~i 119 (161) .|+-..|++.|++ ++... .......+.+.++. .++......+||+++|+=-+ T Consensus 22 ~GtTv~~Ai~~Sg-i~~~fPeidl~~~~vGIfgk--~v~l~~~L~~GDRVEIYRPL 74 (83) T pfam03658 22 EGTTVREAILQSG-LLRQFPDIDLSKNKVGIFGK--PVKLDDPLKDGDRIEIYRPL 74 (83) T ss_pred CCCCHHHHHHHCC-CHHHCCCCCCCCCCEEEECC--CCCCCCCCCCCCEEEEECCC T ss_conf 9894999999759-56569665556463646884--73887707999999995366 No 71 >pfam10862 DUF2662 Protein of unknown function (DUF2662). This bacterial family of proteins has no known function. Probab=45.73 E-value=24 Score=16.34 Aligned_cols=123 Identities=22% Similarity=0.192 Sum_probs=54.7 Q ss_pred CCCCCCEEEEEEE-EEEEC-----CCEEEEEEEECCCCHHH---CCCCCCCCCCCHHHHHHHHHHHHHHHHHHC------ Q ss_conf 7999750547899-99846-----97499999826888313---134799950225669999863463454432------ Q gi|254780770|r 20 LPHRYPFLLVDKV-VNIQR-----DESAIGIKNVTFNEPHF---MGHFPGRPVMPGVLILEGMAQTAGAICAIH------ 84 (161) Q Consensus 20 lPhr~Pfl~ID~i-~~~~~-----g~~~~g~k~v~~~e~ff---~gHFp~~PvmPGvLliE~maQ~~~~l~~~~------ 84 (161) -|+|.+-..+-.. +..++ |.++.+.=+++..|.-+ .|||-. |=..=|.-|++=.+++.. T Consensus 4 ~PY~~~CrYL~sA~vt~~~~~~a~~~~~~a~g~F~IpEScYIddTGHFNa------VEfniCfNQlaY~~~A~~v~~~li 77 (157) T pfam10862 4 EPYSKGCRYLLDAVVTESDQGRARGDSVLAYGNFTIPESCYIDDTGHFNA------VELNICYNQLAYSALAPCVKNGLI 77 (157) T ss_pred CCCCCCCCCHHEEEEECCCCCCCCCCCEEEEEEECCCCCEEECCCCCCCE------EEEEHHHHHHHHHHHHHHHHHCCC T ss_conf 54466670122016301798765776368988773787168847887425------652222788999999999875344 Q ss_pred CCCCC--------CE-EEEEEECCCCEEEEEEEECC---EEEE-EEEEEEEECCEEEEEEEE--EECCEEEEEEEEEEE Q ss_conf 99776--------30-26775236413600221288---8999-999997207779999999--999999998999999 Q gi|254780770|r 85 NGFDQ--------YA-PPYLMSIDKARFRKPVFPGD---RLEY-HVNKVRNRVDLWKFQCCA--KVENTVVAEAEICAM 148 (161) Q Consensus 85 ~~~~~--------~~-~~~l~~i~~~kF~~~V~PGd---~L~i-~~~i~~~~~~~~~~~~~~--~v~~~~va~a~l~~~ 148 (161) ..+.. .. .-.|..--+-+|++|+.|-+ ++++ .+............++.+ +-++.=-|.||+.++ T Consensus 78 ~~~~~Wsiddf~~~QL~dilI~~~~S~Frk~in~r~FsG~~~~~~~~~~~~~~~~l~l~t~~~f~D~~gG~s~GEv~lA 156 (157) T pfam10862 78 PGLRGWSIDDYFRHQLSDILIVKFSSRFRKPINPAKFSGRLEVRSLTVRERTWRYLFLSTYIEFWDDNGGAASGEVELA 156 (157) T ss_pred CCCCCCCHHHHHHHHCCCEEEEHHHHHHHCCCCHHHCEEEEEEEEEEEECCCCCEEEEEEEEEEEECCCCCCCCEEEEC T ss_conf 3224667889998653633341124444245774452678999998882355547999867899828998405638851 No 72 >TIGR02480 fliN flagellar motor switch protein FliN; InterPro: IPR012826 Proteins that consist largely of this domain can be designated flagellar motor switch proteins: FliN. Longer proteins in which this region is a C-terminal domain typically are designated FliY. More distantly related sequences, outside of this family, are associated with type III secretion systems (TTSS) and include the surface presentation of antigens protein SpaO required or invasion of host cells by Salmonella enterica.; GO: 0001539 ciliary or flagellar motility, 0006935 chemotaxis, 0016020 membrane, 0044461 flagellin-based flagellum part. Probab=45.18 E-value=25 Score=16.29 Aligned_cols=16 Identities=19% Similarity=0.247 Sum_probs=13.6 Q ss_pred EEEEECCEEEEEEEEE Q ss_conf 9999999999989999 Q gi|254780770|r 131 CCAKVENTVVAEAEIC 146 (161) Q Consensus 131 ~~~~v~~~~va~a~l~ 146 (161) -.++|||+++|.||.. T Consensus 45 ldI~VNG~~ia~GEvv 60 (77) T TIGR02480 45 LDILVNGRLIARGEVV 60 (77) T ss_pred EEEEECCCEEEEEEEE T ss_conf 1598889278889899 No 73 >COG0114 FumC Fumarase [Energy production and conversion] Probab=42.05 E-value=18 Score=17.18 Aligned_cols=37 Identities=27% Similarity=0.451 Sum_probs=27.8 Q ss_pred EEEEEEEECCCCHHHCCCCCCCCCCCHH---HHHHHHHHHHHHHHHH Q ss_conf 4999998268883131347999502256---6999986346345443 Q gi|254780770|r 40 SAIGIKNVTFNEPHFMGHFPGRPVMPGV---LILEGMAQTAGAICAI 83 (161) Q Consensus 40 ~~~g~k~v~~~e~ff~gHFp~~PvmPGv---LliE~maQ~~~~l~~~ 83 (161) ...|...+..+| ||..+|||- .++|+|.|.++-.++. T Consensus 303 ~GLgEi~lPene-------PGSSIMPGKVNPtq~EA~tmv~~QV~Gn 342 (462) T COG0114 303 CGLGEIELPENE-------PGSSIMPGKVNPTQCEALTMVAAQVIGN 342 (462) T ss_pred CCCCCCCCCCCC-------CCCCCCCCCCCCHHHHHHHHHHHHHHCC T ss_conf 775421057789-------9766789998942799999999999755 No 74 >PRK09376 rho transcription termination factor Rho; Provisional Probab=38.78 E-value=31 Score=15.69 Aligned_cols=112 Identities=14% Similarity=0.189 Sum_probs=63.3 Q ss_pred CCCHHHCCHHHH-HHHHHHCCCCCCEEEEEEEEEEECCCEEEEEEEECCCCHHHCCCCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 741211286418-9899847999750547899998469749999982688831313479995022566999986346345 Q gi|254780770|r 2 ISDLACLDAKDI-VELMRFLPHRYPFLLVDKVVNIQRDESAIGIKNVTFNEPHFMGHFPGRPVMPGVLILEGMAQTAGAI 80 (161) Q Consensus 2 ~~~~~~l~~~~~-~eI~~~lPhr~Pfl~ID~i~~~~~g~~~~g~k~v~~~e~ff~gHFp~~PvmPGvLliE~maQ~~~~l 80 (161) +.+...|.+.++ .+|.+..-...-.++.+.++++-|+.+ |+ --+.+..|..+ |+++-+|.+.+=-.=...+-.+ T Consensus 24 I~~~~~~~K~eLI~~Ilk~~a~~~~~i~~~GvLEi~~dGy--GF-LR~~~~nylp~--~dDiYVS~sqIrrf~LR~GD~V 98 (416) T PRK09376 24 IENASRLRKQELIFAILKAQAEKGEDIFGEGVLEILPDGF--GF-LRSPDANYLPG--PDDIYVSPSQIRRFNLRTGDTV 98 (416) T ss_pred CCCCCCCCHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCC--EE-EECCCCCCCCC--CCCEEECHHHHHHHCCCCCCEE T ss_conf 9876778999999999999985289587679999648986--57-05787688899--8883368999998099888889 Q ss_pred HHHCCCC-CCCEEEEEEECCCC------------EE--EEEEEECCEEEEEEE Q ss_conf 4432997-76302677523641------------36--002212888999999 Q gi|254780770|r 81 CAIHNGF-DQYAPPYLMSIDKA------------RF--RKPVFPGDRLEYHVN 118 (161) Q Consensus 81 ~~~~~~~-~~~~~~~l~~i~~~------------kF--~~~V~PGd~L~i~~~ 118 (161) .+..... .+.+-..|..|+.+ .| .-|++|.++|.++.. T Consensus 99 ~G~vR~pke~Ery~aLl~V~~VNg~~pe~~~~r~~F~~LTPi~P~erl~LE~~ 151 (416) T PRK09376 99 EGKIRPPKEGERYFALLKVETVNGEDPEKARNRILFENLTPLYPNERLKLETG 151 (416) T ss_pred EEEEECCCCCCCCCCEEEEEECCCCCHHHHCCCCCCCCCCCCCCHHHHCCCCC T ss_conf 99886898888774337763228939899657354014877797223114568 No 75 >pfam05179 CDC73 RNA pol II accessory factor, Cdc73 family. Probab=38.00 E-value=22 Score=16.66 Aligned_cols=55 Identities=16% Similarity=0.179 Sum_probs=37.4 Q ss_pred CCEEEEEEEEEEECCCE--EEEEEEECCCCHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCC Q ss_conf 75054789999846974--9999982688831313479995022566999986346345443299 Q gi|254780770|r 24 YPFLLVDKVVNIQRDES--AIGIKNVTFNEPHFMGHFPGRPVMPGVLILEGMAQTAGAICAIHNG 86 (161) Q Consensus 24 ~Pfl~ID~i~~~~~g~~--~~g~k~v~~~e~ff~gHFp~~PvmPGvLliE~maQ~~~~l~~~~~~ 86 (161) -+|..||....+.|..| ++|+ -++..+|-|+|.--.+| .|.+.+.+|+++.+... T Consensus 175 v~y~VVDn~~~~~pe~W~RVVAV-F~~G~~WQFK~wp~~~P-------~~lF~~v~Gf~l~~~~~ 231 (273) T pfam05179 175 VRYRVVDNPEKLKPDDWDRVVAV-FVTGPAWQFKGWPWNSP-------VDIFHKIKGFHLHFDGD 231 (273) T ss_pred EEEEEECCHHHCCCCCCCEEEEE-EECCCHHHHCCCCCCCH-------HHHHHHCEEEEEEECCC T ss_conf 57999528342791445368999-96284252168998998-------99975423899982698 No 76 >TIGR00369 unchar_dom_1 uncharacterized domain 1; InterPro: IPR003736 This domain is found in the PAAI protein from Escherichia coli that may be involved in phenylacetic acid degradation and in a few others that may be transcription regulators . Most proteins containing this domain consist almost entirely of a single copy of this domain. A protein from C. elegans consists of two tandem copies of the domain. The domain is also found as the N-terminal region of an apparent initiation factor eIF-2B alpha subunit of Aquifex aeolicus.. Probab=36.66 E-value=34 Score=15.49 Aligned_cols=54 Identities=9% Similarity=-0.063 Sum_probs=43.8 Q ss_pred EEEECCCCEEEEEEEECCEEEEEEEEEEEECCEEEEEEEEEE-CCEEEEEEEEEE Q ss_conf 775236413600221288899999999720777999999999-999999899999 Q gi|254780770|r 94 YLMSIDKARFRKPVFPGDRLEYHVNKVRNRVDLWKFQCCAKV-ENTVVAEAEICA 147 (161) Q Consensus 94 ~l~~i~~~kF~~~V~PGd~L~i~~~i~~~~~~~~~~~~~~~v-~~~~va~a~l~~ 147 (161) +...--++.|.+|+.-|++++-..+.....+....+..++.- .++++|.+..+. T Consensus 66 ~~~~~~~~~~~rp~~~g~~~~a~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 120 (122) T TIGR00369 66 VVGLELNVNYLRPAREGGKLRAIAEPVHLGRQTGVAEIEIVDEQGRLCALSRGTL 120 (122) T ss_pred EEEEEEEEECCCCCCCCCEEEEEEEEEEECCCEEEEEEEEECCCCCEEEEECEEE T ss_conf 1100000000222246971688778863066102688898876674786411022 No 77 >TIGR02903 spore_lon_C ATP-dependent protease, Lon family; InterPro: IPR014252 This entry shows some relation to the widely distributed ATP-dependent protease La, also called Lon or LonA (IPR004815 from INTERPRO), but is more closely related to LonB (IPR014251 from INTERPRO), a LonA paralog found only in endospore-forming bacteria. Proteins in this entry are unassigned peptidases belonging to the MEROPS peptidase family S16 (lon protease family, clan SJ). They are restricted to a subset of endospore-forming species, and probably participate in the program of endospore formation. We propose the designation LonC.. Probab=35.82 E-value=8.7 Score=19.04 Aligned_cols=41 Identities=17% Similarity=0.378 Sum_probs=24.2 Q ss_pred CEEEEEEEEEEEC---CCE--EEEEEEECCCCHHHCCCCCCCCCCCHHH Q ss_conf 5054789999846---974--9999982688831313479995022566 Q gi|254780770|r 25 PFLLVDKVVNIQR---DES--AIGIKNVTFNEPHFMGHFPGRPVMPGVL 68 (161) Q Consensus 25 Pfl~ID~i~~~~~---g~~--~~g~k~v~~~e~ff~gHFp~~PvmPGvL 68 (161) -.||||+|=++|+ .+- +--=|.|..+..||+ |+||+.|--+ T Consensus 268 GvLFIDEIGELD~lLQnKLLKVLEDKrV~F~SsYYD---pdD~NvPkYI 313 (616) T TIGR02903 268 GVLFIDEIGELDPLLQNKLLKVLEDKRVEFSSSYYD---PDDENVPKYI 313 (616) T ss_pred CEEEEECHHHHHHHHHHHHHHHHCCCEEEEEECCCC---CCCCCCCHHH T ss_conf 567650211222787632444322643665321248---7537865588 No 78 >pfam01643 Acyl-ACP_TE Acyl-ACP thioesterase. This family consists of various acyl-acyl carrier protein (ACP) thioesterases (TE) these terminate fatty acyl group extension via hydrolysing an acyl group on a fatty acid. Probab=32.63 E-value=39 Score=15.09 Aligned_cols=45 Identities=20% Similarity=0.163 Sum_probs=19.2 Q ss_pred CEEEEEEEECCEEEEEEEEE--EEECCEEEEEEEEEECCEEEEEEEE Q ss_conf 13600221288899999999--7207779999999999999998999 Q gi|254780770|r 101 ARFRKPVFPGDRLEYHVNKV--RNRVDLWKFQCCAKVENTVVAEAEI 145 (161) Q Consensus 101 ~kF~~~V~PGd~L~i~~~i~--~~~~~~~~~~~~~~v~~~~va~a~l 145 (161) ++|++.+..||.+...+++. ....++...+.-...+|+.+|.++. T Consensus 197 i~y~~E~~~Gd~i~s~~~~~~~~~~~~~~~~H~i~~~~g~e~~~~~t 243 (247) T pfam01643 197 LKYRKEVQYGDVVESLTEVEPAESEDGLKCLHLIRLEDGEEIARART 243 (247) T ss_pred EEEEECCCCCCEEEEEEEECCCCCCCCEEEEEEEECCCCCEEEEEEE T ss_conf 99944167999899999865687888539999998599928999999 No 79 >PRK12273 aspA aspartate ammonia-lyase; Provisional Probab=31.10 E-value=32 Score=15.59 Aligned_cols=24 Identities=29% Similarity=0.450 Sum_probs=18.6 Q ss_pred CCCCCCCH---HHHHHHHHHHHHHHHH Q ss_conf 99950225---6699998634634544 Q gi|254780770|r 59 PGRPVMPG---VLILEGMAQTAGAICA 82 (161) Q Consensus 59 p~~PvmPG---vLliE~maQ~~~~l~~ 82 (161) ||..+||| ..+.|++.|.|.-..+ T Consensus 318 pGSSIMPGKVNPv~~E~~~mv~~qViG 344 (471) T PRK12273 318 AGSSIMPGKVNPVIPEVVNQVCFQVIG 344 (471) T ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHC T ss_conf 987778999896789999999999974 No 80 >PRK13325 bifunctional biotin--[acetyl-CoA-carboxylase] ligase/pantothenate kinase; Reviewed Probab=29.25 E-value=45 Score=14.75 Aligned_cols=21 Identities=43% Similarity=0.835 Sum_probs=11.5 Q ss_pred CCCCCCCCCCH-HHHHHHHHHH Q ss_conf 34799950225-6699998634 Q gi|254780770|r 56 GHFPGRPVMPG-VLILEGMAQT 76 (161) Q Consensus 56 gHFp~~PvmPG-vLliE~maQ~ 76 (161) |||-|--|||| -|+.|+|+|- T Consensus 469 ~~yLGG~ImPG~~lM~e~la~~ 490 (592) T PRK13325 469 GHYLGGTIMPGFHLMKESLAVR 490 (592) T ss_pred CCEECCCCCCCHHHHHHHHHHH T ss_conf 8600530156568999999742 No 81 >cd01533 4RHOD_Repeat_2 Member of the Rhodanese Homology Domain superfamily, repeat 2. This CD includes putative rhodanese-related sulfurtransferases which contain 4 copies of the Rhodanese Homology Domain. This CD aligns the 2nd repeat which does contain the putative catalytic Cys residue. Probab=28.26 E-value=47 Score=14.64 Aligned_cols=45 Identities=18% Similarity=0.202 Sum_probs=29.6 Q ss_pred CHHHHHHHHHHCCCCCCEEEEEEEEEEECCCEEEEEEEECCCCHHHCCCCCCCCCCCHHHH Q ss_conf 8641898998479997505478999984697499999826888313134799950225669 Q gi|254780770|r 9 DAKDIVELMRFLPHRYPFLLVDKVVNIQRDESAIGIKNVTFNEPHFMGHFPGRPVMPGVLI 69 (161) Q Consensus 9 ~~~~~~eI~~~lPhr~Pfl~ID~i~~~~~g~~~~g~k~v~~~e~ff~gHFp~~PvmPGvLl 69 (161) +....+|+++++-...++++|| |+..+.|=.||.|+.=.+|...+ T Consensus 10 P~Is~~elk~~l~~g~~~~llD----------------VRe~~E~~~~hIPGA~~iP~~~l 54 (109) T cd01533 10 PSVSADELAALQARGAPLVVLD----------------GRRFDEYRKMTIPGSVSCPGAEL 54 (109) T ss_pred CCCCHHHHHHHHHCCCCEEEEE----------------CCCHHHHHHCCCCCCEECCHHHH T ss_conf 9578999999997799859996----------------98988886075899888772378 No 82 >KOG1317 consensus Probab=27.25 E-value=33 Score=15.56 Aligned_cols=36 Identities=28% Similarity=0.488 Sum_probs=26.5 Q ss_pred EEEEEEEECCCCHHHCCCCCCCCCCCHHH---HHHHHHHHHHHHHH Q ss_conf 49999982688831313479995022566---99998634634544 Q gi|254780770|r 40 SAIGIKNVTFNEPHFMGHFPGRPVMPGVL---ILEGMAQTAGAICA 82 (161) Q Consensus 40 ~~~g~k~v~~~e~ff~gHFp~~PvmPGvL---liE~maQ~~~~l~~ 82 (161) ...|......|| ||..+|||-. ++|+|.+.++-..+ T Consensus 328 cGlgEL~LPENE-------PGSSIMPGKVNPTQcEamTmvcaQVMG 366 (487) T KOG1317 328 CGLGELMLPENE-------PGSSIMPGKVNPTQCEAMTMVCAQVMG 366 (487) T ss_pred CCCCCCCCCCCC-------CCCCCCCCCCCCHHHHHHHHHHHHHHC T ss_conf 776531367899-------986668888791788999999999845 No 83 >COG5105 MIH1 Mitotic inducer, protein phosphatase [Cell division and chromosome partitioning] Probab=25.00 E-value=41 Score=14.99 Aligned_cols=20 Identities=25% Similarity=0.416 Sum_probs=11.7 Q ss_pred HCCCCCCEEEEEEEEEEECC Q ss_conf 47999750547899998469 Q gi|254780770|r 19 FLPHRYPFLLVDKVVNIQRD 38 (161) Q Consensus 19 ~lPhr~Pfl~ID~i~~~~~g 38 (161) .-|||||+|+.-.|.-++.| T Consensus 332 ~N~dhYP~L~yPevyIl~GG 351 (427) T COG5105 332 KNPDHYPLLTYPEVYILEGG 351 (427) T ss_pred CCCCCCCCCCCCEEEEECCC T ss_conf 28655861005627986172 No 84 >TIGR02303 HpaG-C-term 4-hydroxyphenylacetate degradation bifunctional isomerase/decarboxylase, C-terminal subunit; InterPro: IPR012684 This entry represents one of two subunits/domains of the bifunctional isomerase/decarboxylase involved in 4-hydroxyphenylacetate degradation . In Escherichia coli and some other species, this enzyme is encoded by a single polypeptide containing both this domain and the closely related N-terminal domain (IPR012686 from INTERPRO). In other species such as Pasteurella multocida, these domains are found as two separate proteins (usually as tandem genes). Together, these domains carry out the decarboxylation of 5-oxopent-3-ene-1,2,5-tricarboxylic acid (OPET) to 2-hydroxy-2,4-diene-1,7-dioate (HHDD) and the subsequent isomerization to 2-oxohept-3-ene-1,7-dioate (OHED).; GO: 0008704 5-carboxymethyl-2-hydroxymuconate delta-isomerase activity, 0018800 5-oxopent-3-ene-125-tricarboxylate decarboxylase activity. Probab=22.88 E-value=49 Score=14.50 Aligned_cols=16 Identities=31% Similarity=0.528 Sum_probs=8.7 Q ss_pred EEEEECCEEEEEEEEE Q ss_conf 0221288899999999 Q gi|254780770|r 105 KPVFPGDRLEYHVNKV 120 (161) Q Consensus 105 ~~V~PGd~L~i~~~i~ 120 (161) ..|.|||.+++++|-+ T Consensus 224 ~dv~pGD~v~~eiEGv 239 (249) T TIGR02303 224 SDVKPGDVVRLEIEGV 239 (249) T ss_pred CCCCCCCEEEEEECCC T ss_conf 2235587899977002 No 85 >PRK12425 fumarate hydratase; Provisional Probab=22.40 E-value=49 Score=14.53 Aligned_cols=33 Identities=27% Similarity=0.458 Sum_probs=20.9 Q ss_pred EEEEEEECCCCHHHCCCCCCCCCCCH---HHHHHHHHHHHHHH Q ss_conf 99999826888313134799950225---66999986346345 Q gi|254780770|r 41 AIGIKNVTFNEPHFMGHFPGRPVMPG---VLILEGMAQTAGAI 80 (161) Q Consensus 41 ~~g~k~v~~~e~ff~gHFp~~PvmPG---vLliE~maQ~~~~l 80 (161) ..|.-.+...+ ||..+||| ..+.|++.|+|.-. T Consensus 303 GlgEi~LP~~q-------pGSSIMPGKVNPvi~E~v~qv~~qV 338 (464) T PRK12425 303 GLAEVRLPANE-------PGSSIMPGKVNPTQCEALSMLACQV 338 (464) T ss_pred CCCCCCCCCCC-------CCCCCCCCCCCCHHHHHHHHCCEEE T ss_conf 76512489999-------9877789987988888865274154 No 86 >pfam08865 DUF1830 Domain of unknown function (DUF1830). This family of short proteins is functionally uncharacterized. Probab=22.09 E-value=61 Score=13.94 Aligned_cols=27 Identities=26% Similarity=0.454 Sum_probs=19.5 Q ss_pred EEEEEECCCCEEEEEEEECCEEEEEEE Q ss_conf 267752364136002212888999999 Q gi|254780770|r 92 PPYLMSIDKARFRKPVFPGDRLEYHVN 118 (161) Q Consensus 92 ~~~l~~i~~~kF~~~V~PGd~L~i~~~ 118 (161) +.-..++.+.-|-|.|.||+.+.+++- T Consensus 12 I~Ri~~~~~~y~ERVVFP~e~l~FeAp 38 (68) T pfam08865 12 VLRCIGIPNFYLERVVFPGERLLFEAP 38 (68) T ss_pred EEEEECCCCCEEEEEECCCCEEEEECC T ss_conf 999958898279998768868999859 No 87 >cd01519 RHOD_HSP67B2 Member of the Rhodanese Homology Domain superfamily. This CD includes the heat shock protein 67B2 of Drosophila melanogaster and other similar proteins, many of which are uncharacterized. Probab=20.74 E-value=65 Score=13.77 Aligned_cols=41 Identities=22% Similarity=0.214 Sum_probs=25.1 Q ss_pred HHHHHHHCCCCCCEEEEEEEEEEECCCEEEEEEEECCCCHHHCCCCCCCCCCCHHHH Q ss_conf 898998479997505478999984697499999826888313134799950225669 Q gi|254780770|r 13 IVELMRFLPHRYPFLLVDKVVNIQRDESAIGIKNVTFNEPHFMGHFPGRPVMPGVLI 69 (161) Q Consensus 13 ~~eI~~~lPhr~Pfl~ID~i~~~~~g~~~~g~k~v~~~e~ff~gHFp~~PvmPGvLl 69 (161) .+|++++|-..+-+++|| |+..+.|-.||.|+.=..|-.-+ T Consensus 3 ~~e~k~ll~~~p~~vLID----------------vR~~~E~~~ghIpgainip~~~~ 43 (106) T cd01519 3 FEEVKNLPNPHPNKVLID----------------VREPEELKTGKIPGAINIPLSSL 43 (106) T ss_pred HHHHHHHHHCCCCEEEEE----------------CCCHHHHHCCCCCCCEECCHHHH T ss_conf 899999986098959998----------------97889997597888874577665 Done!