Query gi|254780663|ref|YP_003065076.1| carbonate dehydratase [Candidatus Liberibacter asiaticus str. psy62] Match_columns 207 No_of_seqs 125 out of 2127 Neff 7.8 Searched_HMMs 39220 Date Sun May 29 20:39:43 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780663.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK10437 carbonic anhydrase; P 100.0 0 0 389.6 20.6 194 1-205 1-201 (220) 2 cd00884 beta_CA_cladeB Carboni 100.0 0 0 377.1 20.1 184 11-195 1-190 (190) 3 COG0288 CynT Carbonic anhydras 100.0 0 0 354.5 20.3 196 2-205 1-205 (207) 4 cd00883 beta_CA_cladeA Carboni 100.0 0 0 354.8 19.0 174 12-195 1-182 (182) 5 cd03378 beta_CA_cladeC Carboni 100.0 0 0 321.5 18.4 145 4-195 4-154 (154) 6 pfam00484 Pro_CA Carbonic anhy 100.0 3.2E-41 1.4E-45 287.7 16.0 146 36-192 1-146 (146) 7 cd00382 beta_CA Carbonic anhyd 100.0 6E-41 1.4E-45 285.9 13.0 119 32-195 1-119 (119) 8 cd03379 beta_CA_cladeD Carboni 100.0 8.9E-39 2.3E-43 271.9 10.9 142 32-195 1-142 (142) 9 KOG1578 consensus 100.0 4.3E-35 1.1E-39 247.9 10.2 193 4-198 62-260 (276) 10 pfam10575 consensus 99.9 7.4E-25 1.9E-29 181.5 7.8 154 32-198 1-158 (159) 11 KOG1578 consensus 98.7 1E-09 2.6E-14 83.2 -2.8 195 8-203 3-257 (276) 12 pfam10070 DUF2309 Uncharacteri 89.1 0.86 2.2E-05 25.3 4.8 38 161-198 534-577 (783) 13 TIGR01184 ntrCD nitrate ABC tr 86.9 2.2 5.6E-05 22.6 5.8 16 92-107 9-24 (230) 14 PRK03604 moaC bifunctional mol 73.9 1.3 3.4E-05 24.0 0.9 75 30-107 142-219 (301) 15 pfam01707 Peptidase_C9 Peptida 71.6 3.8 9.7E-05 21.1 2.8 35 161-201 138-173 (202) 16 COG3002 Uncharacterized protei 70.0 7.3 0.00019 19.2 3.9 20 93-112 561-592 (880) 17 COG1254 AcyP Acylphosphatases 70.0 3.6 9.1E-05 21.3 2.4 19 177-195 29-47 (92) 18 cd04169 RF3 RF3 subfamily. Pe 59.5 4.6 0.00012 20.5 1.3 27 80-106 83-109 (267) 19 TIGR00073 hypB hydrogenase acc 56.0 10 0.00025 18.4 2.5 44 32-75 61-137 (225) 20 PRK10463 hydrogenase nickel in 52.6 18 0.00045 16.7 5.0 45 31-75 130-203 (290) 21 TIGR00385 dsbE periplasmic pro 47.6 17 0.00042 16.9 2.6 36 154-189 69-106 (175) 22 pfam00708 Acylphosphatase Acyl 46.8 22 0.00056 16.1 3.6 19 177-195 28-46 (90) 23 COG1116 TauB ABC-type nitrate/ 46.1 9.9 0.00025 18.4 1.2 16 92-107 27-42 (248) 24 cd03548 Rieske_RO_Alpha_OMO_CA 45.8 12 0.00031 17.8 1.6 18 178-195 76-93 (136) 25 cd03478 Rieske_AIFL_N AIFL (ap 45.4 8.2 0.00021 18.9 0.7 27 178-204 59-87 (95) 26 pfam00355 Rieske Rieske [2Fe-2 45.0 11 0.00028 18.1 1.3 21 178-198 66-86 (99) 27 KOG2564 consensus 43.3 14 0.00037 17.3 1.7 34 83-116 134-167 (343) 28 TIGR02528 EutP ethanolamine ut 42.3 12 0.00031 17.8 1.2 87 21-113 50-140 (144) 29 cd03467 Rieske Rieske domain; 41.6 9.9 0.00025 18.4 0.7 18 178-195 61-78 (98) 30 cd03528 Rieske_RO_ferredoxin R 41.4 9.4 0.00024 18.5 0.5 18 178-195 60-77 (98) 31 TIGR00088 trmD tRNA (guanine-N 41.1 13 0.00034 17.6 1.2 12 93-104 105-116 (247) 32 TIGR01347 sucB 2-oxoglutarate 40.6 12 0.00031 17.8 1.0 40 155-195 265-312 (435) 33 cd03529 Rieske_NirD Assimilato 39.1 9.4 0.00024 18.5 0.2 16 178-193 66-81 (103) 34 TIGR00055 uppS undecaprenyl di 38.6 1 2.7E-05 24.7 -4.7 54 8-67 8-61 (237) 35 cd03474 Rieske_T4moC Toluene-4 38.5 12 0.00029 17.9 0.6 16 178-193 61-76 (108) 36 TIGR01208 rmlA_long glucose-1- 38.1 27 0.00068 15.6 2.4 35 155-189 181-227 (361) 37 COG2146 {NirD} Ferredoxin subu 37.6 11 0.00029 18.0 0.4 18 178-195 66-83 (106) 38 PRK09965 3-phenylpropionate di 36.8 9.7 0.00025 18.4 -0.0 27 178-204 60-88 (104) 39 KOG4780 consensus 36.7 18 0.00046 16.7 1.4 32 40-71 390-421 (467) 40 pfam07646 Kelch_2 Kelch motif. 36.5 28 0.00071 15.5 2.3 34 174-207 10-46 (48) 41 cd03530 Rieske_NirD_small_Baci 34.1 12 0.00031 17.8 0.1 16 178-193 61-76 (98) 42 pfam07250 Glyoxal_oxid_N Glyox 34.1 34 0.00087 14.9 3.4 17 178-194 169-185 (243) 43 KOG0922 consensus 33.5 21 0.00053 16.3 1.2 25 176-200 328-354 (674) 44 pfam01074 Glyco_hydro_38 Glyco 32.8 36 0.00092 14.7 5.4 15 3-17 94-108 (269) 45 TIGR02090 LEU1_arch isopropylm 31.5 26 0.00066 15.7 1.5 27 90-116 306-333 (371) 46 TIGR01717 AMP-nucleosdse AMP n 31.2 38 0.00097 14.6 3.9 127 54-192 266-423 (486) 47 TIGR01324 cysta_beta_ly_B cyst 30.9 32 0.00082 15.0 1.9 32 75-106 176-213 (389) 48 pfam02100 ODC_AZ Ornithine dec 30.4 39 0.001 14.5 2.8 29 78-106 118-146 (187) 49 pfam06581 DUF1135 Protein of u 28.6 23 0.00058 16.1 0.8 33 75-107 145-188 (239) 50 cd01890 LepA LepA subfamily. 28.5 28 0.00071 15.4 1.2 18 90-107 89-106 (179) 51 cd01886 EF-G Elongation factor 27.8 25 0.00063 15.8 0.8 15 184-198 164-178 (270) 52 COG2885 OmpA Outer membrane pr 27.2 45 0.0011 14.1 2.9 29 79-107 99-127 (190) 53 KOG1420 consensus 27.0 32 0.00082 15.1 1.3 23 91-113 354-376 (1103) 54 PRK13224 consensus 26.5 46 0.0012 14.0 6.5 90 22-118 92-181 (216) 55 cd03297 ABC_ModC_molybdenum_tr 26.3 38 0.00097 14.6 1.6 21 143-163 157-177 (214) 56 cd03472 Rieske_RO_Alpha_BPDO_l 26.2 23 0.00059 16.0 0.5 16 179-195 72-87 (128) 57 cd01883 EF1_alpha Eukaryotic e 26.1 28 0.00071 15.5 0.8 14 87-100 130-143 (219) 58 cd03301 ABC_MalK_N The N-termi 25.3 39 0.001 14.5 1.5 12 95-106 27-38 (213) 59 cd00982 gltB_C gltb_C. This do 25.0 32 0.00081 15.1 1.0 41 55-105 132-172 (251) 60 cd04166 CysN_ATPS CysN_ATPS su 24.6 33 0.00085 15.0 1.0 45 48-100 92-136 (208) 61 PRK08292 AMP nucleosidase; Pro 24.6 50 0.0013 13.8 4.6 131 56-198 272-432 (487) 62 cd00953 KDG_aldolase KDG (2-ke 24.4 50 0.0013 13.8 3.3 68 23-105 113-184 (279) 63 cd03295 ABC_OpuCA_Osmoprotecti 24.1 45 0.0011 14.1 1.6 57 142-203 160-225 (242) 64 cd03290 ABCC_SUR1_N The SUR do 23.6 43 0.0011 14.2 1.4 14 93-106 26-39 (218) 65 cd03229 ABC_Class3 This class 23.4 46 0.0012 14.0 1.6 16 92-107 24-39 (178) 66 cd03300 ABC_PotA_N PotA is an 23.4 46 0.0012 14.0 1.6 21 143-163 156-176 (232) 67 PRK00741 prfC peptide chain re 23.3 47 0.0012 14.0 1.6 28 24-51 123-151 (526) 68 PRK13632 cbiO cobalt transport 23.2 48 0.0012 13.9 1.6 20 144-163 170-189 (273) 69 PRK13491 chemoreceptor glutami 22.9 41 0.001 14.4 1.2 32 39-70 2-34 (199) 70 PRK11545 gntK gluconate kinase 22.9 39 0.00099 14.5 1.1 15 93-107 7-21 (177) 71 cd03479 Rieske_RO_Alpha_PhDO_l 22.5 47 0.0012 14.0 1.5 13 180-193 85-97 (144) 72 KOG0926 consensus 22.3 42 0.0011 14.3 1.2 18 92-109 269-286 (1172) 73 TIGR01892 AcOrn-deacetyl acety 22.1 37 0.00094 14.7 0.9 13 93-105 68-80 (386) 74 cd04168 TetM_like Tet(M)-like 22.1 35 0.00089 14.8 0.7 15 90-104 113-127 (237) 75 pfam08184 Cuticle_2 Cuticle pr 22.1 37 0.00095 14.6 0.9 13 181-193 7-19 (59) 76 cd03296 ABC_CysA_sulfate_impor 22.1 52 0.0013 13.7 1.6 15 186-200 209-223 (239) 77 cd03294 ABC_Pro_Gly_Bertaine T 21.9 53 0.0014 13.6 1.6 17 187-203 234-250 (269) 78 COG0295 Cdd Cytidine deaminase 21.7 56 0.0014 13.5 3.5 66 51-120 31-103 (134) 79 TIGR03415 ABC_choXWV_ATP choli 21.5 54 0.0014 13.6 1.6 14 148-161 195-208 (382) 80 PRK12411 cytidine deaminase; P 21.4 57 0.0015 13.4 3.6 60 55-116 33-97 (132) 81 PRK11131 ATP-dependent RNA hel 21.1 47 0.0012 14.0 1.2 33 168-200 341-375 (1295) 82 cd00886 MogA_MoaB MogA_MoaB fa 21.0 59 0.0015 13.4 6.1 63 35-106 2-74 (152) 83 pfam01875 Memo Memo-like prote 20.8 59 0.0015 13.3 6.2 10 52-61 90-99 (274) 84 cd04170 EF-G_bact Elongation f 20.7 40 0.001 14.4 0.8 21 24-44 108-128 (268) 85 pfam06807 Clp1 Pre-mRNA cleava 20.6 60 0.0015 13.3 2.1 23 179-201 145-171 (187) 86 COG4525 TauB ABC-type taurine 20.5 51 0.0013 13.8 1.3 22 88-109 25-46 (259) 87 PRK13634 cbiO cobalt transport 20.5 60 0.0015 13.3 1.6 21 143-163 158-178 (276) 88 TIGR03167 tRNA_sel_U_synt tRNA 20.0 51 0.0013 13.7 1.2 66 33-110 74-143 (311) No 1 >PRK10437 carbonic anhydrase; Provisional Probab=100.00 E-value=0 Score=389.56 Aligned_cols=194 Identities=31% Similarity=0.480 Sum_probs=173.0 Q ss_pred CCHHHHHHHHHHHHHHHCCC--CHHHHHHHHCCCCCEEEEEECCCCCCCHHHHHCCCCCCEEEEECCCCCCCCCCCCCCC Q ss_conf 97469999999999970687--9889998623899759999626789988885178866248886242234577777530 Q gi|254780663|r 1 MTSFPNTLLERHREFIQDQY--DKKLFQELANQQKPKIMIISCCDSRVAPETIFNAKPGELFVVRNVANIVPPYEPDGQH 78 (207) Q Consensus 1 M~~~~~~Ll~~N~~f~~~~~--~~~~~~~l~~~q~P~~~vitC~DsRv~~~~i~~~~~Gd~fv~RnaGn~v~~~~~~~~~ 78 (207) |..+ ++|++||++|++..+ +|++|++|++||+|+++|||||||||+|+.|+++.|||+||+||+||+|++.+. T Consensus 1 M~~i-~~Ll~gn~~w~~~~~~~~p~~F~~La~GQ~P~~L~IgCsDSRV~p~~I~~~~PGelFV~RNIaNlV~~~d~---- 75 (220) T PRK10437 1 MKDI-DTLISNNALWSKMLVEEDPGFFEKLAQAQKPRFLWIGCSDSRVPAERLTGLEPGELFVHRNVANLVIHTDL---- 75 (220) T ss_pred CCHH-HHHHHHHHHHHHHHHHHCHHHHHHHHCCCCCCEEEEEECCCCCCHHHHCCCCCCCEEEEEECCCCCCCCCC---- T ss_conf 9519-99999899999998764969999862789998499973377889999738998667899633454788884---- Q ss_pred HHHHHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHH----HCCCHHHHHHHHHH Q ss_conf 25788877672205766899981585389887530000134633101468988889999864----01632468799999 Q gi|254780663|r 79 HATSAAIEFAVQGLNVEHIVVMGHGRCGGIQAVLDSNNSSTSPGDFIGKWMDIVRPIAQKIV----ANNPTEKQTILEQL 154 (207) Q Consensus 79 ~~~~~sle~av~~l~v~~iiV~GHt~CGav~aa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~l~~~ 154 (207) +++++|||||.+|+|++|||||||+||||+|+++.. ..+++..|+..+++...... ...+.++.+.++++ T Consensus 76 -~~~aaIEYAV~~L~V~~IIVcGHs~CGgv~A~l~~~-----~~~~v~~Wl~~i~~~~~~~~~~~~~~~~~~~~~~l~e~ 149 (220) T PRK10437 76 -NCLSVVQYAVDVLEVEHIIICGHYGCGGVQAAVENP-----ELGLINNWLLHIRDIWFKHSSLLGEMPQERRLDTLCEL 149 (220) T ss_pred -HHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHCCC-----CCCHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHH T ss_conf -089999999997498879995587640578886398-----65547889899999999989875459989999999999 Q ss_pred HHHHHHHHHHCCHHHHHHHHCC-CCEEEEEEEECCCCEEEEEECCCCEEEEC Q ss_conf 9999999973097889999779-96799999988987699984688825833 Q gi|254780663|r 155 SIRNSLKNIRNFPFVNKLEKEH-MLQIHGAWFDISSGKLWILDPTSNEFTCD 205 (207) Q Consensus 155 nV~~~v~~L~~~p~i~~~v~~g-~l~I~G~~yDi~tG~v~~l~~~~~~F~~~ 205 (207) ||+.|++||.++|+|+++++.| +|.||||+||++||+|+.||.+.+.||-. T Consensus 150 NV~~Qv~nL~~~p~V~~a~~~G~~l~iHGw~ydi~tG~l~~L~~~~~~~e~~ 201 (220) T PRK10437 150 NVMEQVYNLGHSTIMQSAWKRGQKVTIHGWAYGIHDGLLRDLDVTATNRETL 201 (220) T ss_pred HHHHHHHHHHCCHHHHHHHHCCCCCEEEEEEEECCCCCEEECCCCCCCHHHH T ss_conf 9999999982088999999769975798899987988678668877876679 No 2 >cd00884 beta_CA_cladeB Carbonic anhydrases (CA) are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism in which the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide is followed by the regeneration of an active site by ionization of the zinc-bound water molecule and removal of a proton from the active site. CAs are ubiquitous enzymes involved in fundamental processes like photosynthesis, respiration, pH homeostasis and ion transport. There are three evolutionarily distinct families of CAs (the alpha-, beta-, and gamma-CAs) which show no significant sequence identity or structural similarity. Within the beta-CA family there are four evolutionarily distinct clades (A through D). The beta-CAs are multimeric enzymes (forming dimers,tetramers,hexamers and octamers) which are present in higher plants, algae, fungi, archaea and prokaryotes. Probab=100.00 E-value=0 Score=377.14 Aligned_cols=184 Identities=45% Similarity=0.821 Sum_probs=166.7 Q ss_pred HHHHHHHCCC--CHHHHHHHHCCCCCEEEEEECCCCCCCHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHH Q ss_conf 9999970687--98899986238997599996267899888851788662488862422345777775302578887767 Q gi|254780663|r 11 RHREFIQDQY--DKKLFQELANQQKPKIMIISCCDSRVAPETIFNAKPGELFVVRNVANIVPPYEPDGQHHATSAAIEFA 88 (207) Q Consensus 11 ~N~~f~~~~~--~~~~~~~l~~~q~P~~~vitC~DsRv~~~~i~~~~~Gd~fv~RnaGn~v~~~~~~~~~~~~~~sle~a 88 (207) ||++|+++.+ +|++|++|++||+|+++|||||||||+||.|||++|||+||+|||||+|||++.+..+.++++||||| T Consensus 1 G~~~f~~~~~~~~~~~f~~La~gQ~P~a~vi~CsDSRV~pe~if~~~~GdlFVvRnagNiVpp~~~~~~~~~~~aslEyA 80 (190) T cd00884 1 GFRRFRKEYFPEERELFEKLAKGQSPKALFIACSDSRVVPALITQTQPGELFVVRNVGNLVPPYEPDGGFHGTSAAIEYA 80 (190) T ss_pred CHHHHHHHHHHHCHHHHHHHHCCCCCCEEEEEEECCCCCHHHHHCCCCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHH T ss_conf 97789877631478999998378999679998653589999982899976799841223358776776630378999999 Q ss_pred HHCCCCCEEEEEECCCHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHC----CCHHHHHHHHHHHHHHHHHHHH Q ss_conf 220576689998158538988753000013463310146898888999986401----6324687999999999999973 Q gi|254780663|r 89 VQGLNVEHIVVMGHGRCGGIQAVLDSNNSSTSPGDFIGKWMDIVRPIAQKIVAN----NPTEKQTILEQLSIRNSLKNIR 164 (207) Q Consensus 89 v~~l~v~~iiV~GHt~CGav~aa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~l~~~nV~~~v~~L~ 164 (207) +.+|+|++|||||||+||||+|+++.... ....+.+..|+..++++....... ++.+..+.++++||+.|+++|+ T Consensus 81 v~~L~v~~IvV~GHs~CGav~Aa~~~~~~-~~~~~~i~~wl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~nV~~qv~~L~ 159 (190) T cd00884 81 VAVLKVEHIVVCGHSDCGGIRALLSPEDL-LDKLPFIGKWLRIAEPAKEVVLAELSHADFDDQLRALEKENVLLSLENLL 159 (190) T ss_pred HHHCCCCEEEEECCCCCHHHHHHHCCCCC-CCCCCHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 99639867999458776699988613456-77764589999988989999986336899899999999999999999997 Q ss_pred CCHHHHHHHHCCCCEEEEEEEECCCCEEEEE Q ss_conf 0978899997799679999998898769998 Q gi|254780663|r 165 NFPFVNKLEKEHMLQIHGAWFDISSGKLWIL 195 (207) Q Consensus 165 ~~p~i~~~v~~g~l~I~G~~yDi~tG~v~~l 195 (207) ++|+|++++++|+|.|||||||++||+|++| T Consensus 160 ~~p~i~~~~~~g~l~I~G~~Ydi~tG~v~~f 190 (190) T cd00884 160 TYPFVRERLEAGTLSLHGWYYDIETGELYAY 190 (190) T ss_pred CCHHHHHHHHCCCCEEEEEEEECCCEEEEEC T ss_conf 2999999998799589999998797169849 No 3 >COG0288 CynT Carbonic anhydrase [Inorganic ion transport and metabolism] Probab=100.00 E-value=0 Score=354.53 Aligned_cols=196 Identities=32% Similarity=0.557 Sum_probs=171.0 Q ss_pred CHHHHHHHHHHHHHHHCCC--CHHHHHHHH-CCCCCEEEEEECCCCCCCHHHHHCCCCCCEEEEECCCCCCCCCCCCCCC Q ss_conf 7469999999999970687--988999862-3899759999626789988885178866248886242234577777530 Q gi|254780663|r 2 TSFPNTLLERHREFIQDQY--DKKLFQELA-NQQKPKIMIISCCDSRVAPETIFNAKPGELFVVRNVANIVPPYEPDGQH 78 (207) Q Consensus 2 ~~~~~~Ll~~N~~f~~~~~--~~~~~~~l~-~~q~P~~~vitC~DsRv~~~~i~~~~~Gd~fv~RnaGn~v~~~~~~~~~ 78 (207) |+.++.|++||++|+++.+ .+.+|..++ ++|+|+++||||||||++||.+||++|||+||+|||||+|+|++ T Consensus 1 ~~~~~~ll~gn~~f~~~~~~~~~~~~~~l~~~~Q~P~~lii~C~DSRv~~e~i~~~~pGdlfV~RNaGniV~~~~----- 75 (207) T COG0288 1 MSALKDLLAGNQRFAEGKFPEQSALFRKLADKGQSPKALIITCSDSRVPPELITGLGPGDLFVIRNAGNIVTHPD----- 75 (207) T ss_pred CCHHHHHHHHHHHHHHCCCCCCHHHHHHHHCCCCCCCEEEEEECCCCCCHHHHHCCCCCCEEEEEECCCCCCCCC----- T ss_conf 907999999999998533432168888874058998579998705788889983899865789963053258886----- Q ss_pred HHHHHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHCCCH-----HHHHHHHH Q ss_conf 2578887767220576689998158538988753000013463310146898888999986401632-----46879999 Q gi|254780663|r 79 HATSAAIEFAVQGLNVEHIVVMGHGRCGGIQAVLDSNNSSTSPGDFIGKWMDIVRPIAQKIVANNPT-----EKQTILEQ 153 (207) Q Consensus 79 ~~~~~sle~av~~l~v~~iiV~GHt~CGav~aa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~l~~ 153 (207) .++++|||||+.+|||++|||||||+||||+++++....... .+..|+..+++........... ++.+.+++ T Consensus 76 ~~~l~sleyAv~~L~v~~IiV~GH~~CGav~aa~~~~~~~~~---~i~~wl~~i~~~~~~~~~~~~~~~~~~~~~~~~~e 152 (207) T COG0288 76 GSVLRSLEYAVYVLGVKEIIVCGHTDCGAVKAALDDQLEGLK---PIPGWLLHIEDLAYAVSNLLGELPGEEDRSDELVE 152 (207) T ss_pred CCHHHHHHHHHHHCCCCEEEEECCCCCHHHHHCCCCCCCCCC---CCCHHHHHHHHHHHHHHCCHHHCCCHHHHHHHHHH T ss_conf 314678999999749988999668984787702534101444---44115667788888764022212326666566788 Q ss_pred HHHHHHHHHHHCCHHHHHHHHCCC-CEEEEEEEECCCCEEEEEECCCCEEEEC Q ss_conf 999999999730978899997799-6799999988987699984688825833 Q gi|254780663|r 154 LSIRNSLKNIRNFPFVNKLEKEHM-LQIHGAWFDISSGKLWILDPTSNEFTCD 205 (207) Q Consensus 154 ~nV~~~v~~L~~~p~i~~~v~~g~-l~I~G~~yDi~tG~v~~l~~~~~~F~~~ 205 (207) .||++|+++|+++|.++.++..++ |.||||+||++||+++.++.....|+.+ T Consensus 153 ~nV~~qv~~L~~~p~v~~~~~~~~~l~vhG~~y~i~~G~l~~~~~~~~~~~~~ 205 (207) T COG0288 153 DNVREQVANLRTHPIVQSALVRGQKVAVHGWVYDIETGRLYVVDVATIDFEAL 205 (207) T ss_pred HHHHHHHHHHHCCCCHHHHHHCCCEEEEEEEEEECCCCEEEEEECCCCCCCCC T ss_conf 87999999986297045555338668998779856875589973355666667 No 4 >cd00883 beta_CA_cladeA Carbonic anhydrases (CA) are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism in which the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide is followed by the regeneration of an active site by ionization of the zinc-bound water molecule and removal of a proton from the active site. CAs are ubiquitous enzymes involved in fundamental processes like photosynthesis, respiration, pH homeostasis and ion transport. There are three evolutionarily distinct families of CAs (the alpha-, beta-, and gamma-CAs) which show no significant sequence identity or structural similarity. Within the beta-CA family there are four evolutionarily distinct clades (A through D). The beta-CAs are multimeric enzymes (forming dimers,tetramers,hexamers and octamers) which are present in higher plants, algae, fungi, archaea and prokaryotes. Probab=100.00 E-value=0 Score=354.78 Aligned_cols=174 Identities=33% Similarity=0.606 Sum_probs=149.2 Q ss_pred HHHHHHCCC--CHHHHHHHHCCCCCEEEEEECCCCCCCHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 999970687--988999862389975999962678998888517886624888624223457777753025788877672 Q gi|254780663|r 12 HREFIQDQY--DKKLFQELANQQKPKIMIISCCDSRVAPETIFNAKPGELFVVRNVANIVPPYEPDGQHHATSAAIEFAV 89 (207) Q Consensus 12 N~~f~~~~~--~~~~~~~l~~~q~P~~~vitC~DsRv~~~~i~~~~~Gd~fv~RnaGn~v~~~~~~~~~~~~~~sle~av 89 (207) |++|+++.+ +|++|++|++||+|+++||||||||++||.+||++|||+||+||+||+|+|.| .+++++||||+ T Consensus 1 n~~~~~~~~~~~~~~~~~La~gQ~P~a~vi~CsDSRv~pe~ifd~~~GdlFvvRn~gNiv~~~d-----~~~~aslEyAv 75 (182) T cd00883 1 NRAWAEEKKAKDPDFFPRLAKGQTPEYLWIGCSDSRVPENTILGLLPGEVFVHRNIANLVSPTD-----LNCLSVLQYAV 75 (182) T ss_pred CHHHHHHHHHHCHHHHHHHHCCCCCCEEEEEECCCCCCHHHHCCCCCCCEEEEEECCCCCCCCC-----CHHHHHHHHHH T ss_conf 9768877876498999998578999669998647898999980999976799973245067777-----40899999998 Q ss_pred HCCCCCEEEEEECCCHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHH----HHHHC-CCHHHHHHHHHHHHHHHHHHHH Q ss_conf 205766899981585389887530000134633101468988889999----86401-6324687999999999999973 Q gi|254780663|r 90 QGLNVEHIVVMGHGRCGGIQAVLDSNNSSTSPGDFIGKWMDIVRPIAQ----KIVAN-NPTEKQTILEQLSIRNSLKNIR 164 (207) Q Consensus 90 ~~l~v~~iiV~GHt~CGav~aa~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~-~~~~~~~~l~~~nV~~~v~~L~ 164 (207) .+|||++|||||||+||||+|+++.... +.+..|+..+++... .+... ++.+..+.++++||+.|+++|+ T Consensus 76 ~~L~v~~IVVlGHs~CGav~Aa~~~~~~-----~~~~~~l~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~NV~~qv~~L~ 150 (182) T cd00883 76 DVLKVKHIIVCGHYGCGGVKAALTGKRL-----GLLDNWLRPIRDVYRLHAAELDALEDEEERVDRLVELNVVEQVKNLC 150 (182) T ss_pred HCCCCCEEEEECCCCCHHHHHHHCCCCC-----CCHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 7078877999558885789998638776-----50678999999999988987625599899999999999999999996 Q ss_pred CCHHHHHHHHCC-CCEEEEEEEECCCCEEEEE Q ss_conf 097889999779-9679999998898769998 Q gi|254780663|r 165 NFPFVNKLEKEH-MLQIHGAWFDISSGKLWIL 195 (207) Q Consensus 165 ~~p~i~~~v~~g-~l~I~G~~yDi~tG~v~~l 195 (207) ++|+|++++.++ +|.|||||||++||+|++| T Consensus 151 ~~p~i~~~~~~~~~l~I~G~~Ydi~tG~v~~L 182 (182) T cd00883 151 KTPIVQDAWKRGQELEVHGWVYDLGDGLLRDL 182 (182) T ss_pred CCHHHHHHHHCCCCEEEEEEEEECCCCEEEEC T ss_conf 09899999986994199999998698268739 No 5 >cd03378 beta_CA_cladeC Carbonic anhydrases (CA) are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism in which the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide is followed by the regeneration of an active site by ionization of the zinc-bound water molecule and removal of a proton from the active site. CAs are ubiquitous enzymes involved in fundamental processes like photosynthesis, respiration, pH homeostasis and ion transport. There are three evolutionarily distinct families of CAs (the alpha-, beta-, and gamma-CAs) which show no significant sequence identity or structural similarity. Within the beta-CA family there are four evolutionarily distinct clades (A through D). The beta-CAs are multimeric enzymes (forming dimers,tetramers,hexamers and octamers) which are present in higher plants, algae, fungi, archaea and prokaryotes. Probab=100.00 E-value=0 Score=321.54 Aligned_cols=145 Identities=37% Similarity=0.622 Sum_probs=133.2 Q ss_pred HHHHHHHHHHHHHHCCC-----CHHHHHHHHCCCCCEEEEEECCCCCCCHHHHHCCCCCCEEEEECCCCCCCCCCCCCCC Q ss_conf 69999999999970687-----9889998623899759999626789988885178866248886242234577777530 Q gi|254780663|r 4 FPNTLLERHREFIQDQY-----DKKLFQELANQQKPKIMIISCCDSRVAPETIFNAKPGELFVVRNVANIVPPYEPDGQH 78 (207) Q Consensus 4 ~~~~Ll~~N~~f~~~~~-----~~~~~~~l~~~q~P~~~vitC~DsRv~~~~i~~~~~Gd~fv~RnaGn~v~~~~~~~~~ 78 (207) -+++|++||+||++++. .++.+..|++||+|+++||||||||++||.+||++|||+|++||+||+|+ T Consensus 4 al~~L~~GN~RF~~g~~~~~~~~~~~~~~la~GQ~P~a~vi~CsDSRv~pe~if~~~~Gdlfv~Rn~gn~v~-------- 75 (154) T cd03378 4 ALERLKEGNKRFVSGKPLHPDQDLARRRELAKGQKPFAVILSCSDSRVPPEIIFDQGLGDLFVVRVAGNIVD-------- 75 (154) T ss_pred HHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHCCCCCCEEEEEECCCCCCHHHHHCCCCCCEEEEECCCCCCC-------- T ss_conf 999999999999779977888899999987379998279998607899999985899855799842223489-------- Q ss_pred HHHHHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH Q ss_conf 25788877672205766899981585389887530000134633101468988889999864016324687999999999 Q gi|254780663|r 79 HATSAAIEFAVQGLNVEHIVVMGHGRCGGIQAVLDSNNSSTSPGDFIGKWMDIVRPIAQKIVANNPTEKQTILEQLSIRN 158 (207) Q Consensus 79 ~~~~~sle~av~~l~v~~iiV~GHt~CGav~aa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~nV~~ 158 (207) .++++|||||+.+|++++|||||||+||||+|+ .++.||.. T Consensus 76 ~~~~~sleyav~~L~v~~ivV~GH~~CGav~Aa---------------------------------------~~~~nV~~ 116 (154) T cd03378 76 DDVLGSLEYAVEVLGVPLVVVLGHESCGAVAAA---------------------------------------AVRANVKA 116 (154) T ss_pred HHHHHHHHHHHHHHCCCEEEEECCCCCHHHHHH---------------------------------------HHHHHHHH T ss_conf 888888999999719988999789874489999---------------------------------------99999999 Q ss_pred HHHHHHC-CHHHHHHHHCCCCEEEEEEEECCCCEEEEE Q ss_conf 9999730-978899997799679999998898769998 Q gi|254780663|r 159 SLKNIRN-FPFVNKLEKEHMLQIHGAWFDISSGKLWIL 195 (207) Q Consensus 159 ~v~~L~~-~p~i~~~v~~g~l~I~G~~yDi~tG~v~~l 195 (207) |+++|++ +|++++++++|+|+||||+||++||+|+++ T Consensus 117 ~~~~L~~~s~~l~~~v~~G~l~i~Ga~Ydi~tG~V~~l 154 (154) T cd03378 117 TVAKLRSRSPIIAELVAAGKLKIVGAYYDLDTGKVEFL 154 (154) T ss_pred HHHHHHHHCHHHHHHHHCCCEEEEEEEEECCCEEEEEC T ss_conf 99999750999999998898099999998387499879 No 6 >pfam00484 Pro_CA Carbonic anhydrase. This family includes carbonic anhydrases as well as a family of non-functional homologues related to YbcF. Probab=100.00 E-value=3.2e-41 Score=287.72 Aligned_cols=146 Identities=39% Similarity=0.733 Sum_probs=130.8 Q ss_pred EEEEECCCCCCCHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHH Q ss_conf 99996267899888851788662488862422345777775302578887767220576689998158538988753000 Q gi|254780663|r 36 IMIISCCDSRVAPETIFNAKPGELFVVRNVANIVPPYEPDGQHHATSAAIEFAVQGLNVEHIVVMGHGRCGGIQAVLDSN 115 (207) Q Consensus 36 ~~vitC~DsRv~~~~i~~~~~Gd~fv~RnaGn~v~~~~~~~~~~~~~~sle~av~~l~v~~iiV~GHt~CGav~aa~~~~ 115 (207) ++||||||||++|+.+||++|||+|++||+||+|++. +.++++||+||+.+|++++|+|||||+||||+++++.. T Consensus 1 alvi~C~DsRv~~~~i~~~~~Gd~fviRn~Gn~v~~~-----~~~~~~sleyAv~~l~v~~IvV~GHt~CGav~a~~~~~ 75 (146) T pfam00484 1 ALVIGCSDSRVPPELIFGLGPGDLFVIRNAGNIVPPP-----DGDVLASLEYAVEVLGVKEIVVLGHTDCGAVKAALDDV 75 (146) T ss_pred CEEEEECCCCCCHHHHHCCCCCCEEEEECCCCCCCCC-----CCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHCC T ss_conf 9799871679898998299997579996156612667-----75634566889985698779997067747999866057 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHCCCCEEEEEEEECCCCEE Q ss_conf 01346331014689888899998640163246879999999999999730978899997799679999998898769 Q gi|254780663|r 116 NSSTSPGDFIGKWMDIVRPIAQKIVANNPTEKQTILEQLSIRNSLKNIRNFPFVNKLEKEHMLQIHGAWFDISSGKL 192 (207) Q Consensus 116 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~nV~~~v~~L~~~p~i~~~v~~g~l~I~G~~yDi~tG~v 192 (207) +.+.++.|+..++++...... ...+..+.++++||++|+++|++||+|++++++++|+||||+||++||+| T Consensus 76 -----~~~~l~~~l~~~~~~~~~~~~-~~~~~~~~~~~~nV~~qv~~L~~~p~i~~~~~~~~l~V~G~~ydi~tG~v 146 (146) T pfam00484 76 -----ELGLIDEWLRHIRPAVEEVKS-EFEDELDALVEENVREQVENLRTSPLVPEAVAKGELKVHGAVYDIETGKL 146 (146) T ss_pred -----CCHHHHHHHHHHHHHHHHHHC-CHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHCCCCEEEEEEEECCCCCC T ss_conf -----544799999974889998651-30658999999999999999980988899998699689999996888509 No 7 >cd00382 beta_CA Carbonic anhydrases (CA) are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism in which the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide is followed by the regeneration of an active site by ionization of the zinc-bound water molecule and removal of a proton from the active site. CAs are ubiquitous enzymes involved in fundamental processes like photosynthesis, respiration, pH homeostasis and ion transport. There are three evolutionarily distinct families of CAs (the alpha-, beta-, and gamma-CAs) which show no significant sequence identity or structural similarity. Within the beta-CA family there are four evolutionarily distinct clades (A through D). The beta-CAs are multimeric enzymes (forming dimers,tetramers,hexamers and octamers) which are present in higher plants, algae, fungi, archaea and prokaryotes. Probab=100.00 E-value=6e-41 Score=285.92 Aligned_cols=119 Identities=47% Similarity=0.859 Sum_probs=112.4 Q ss_pred CCCEEEEEECCCCCCCHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCEEEEEECCCHHHHHHH Q ss_conf 99759999626789988885178866248886242234577777530257888776722057668999815853898875 Q gi|254780663|r 32 QKPKIMIISCCDSRVAPETIFNAKPGELFVVRNVANIVPPYEPDGQHHATSAAIEFAVQGLNVEHIVVMGHGRCGGIQAV 111 (207) Q Consensus 32 q~P~~~vitC~DsRv~~~~i~~~~~Gd~fv~RnaGn~v~~~~~~~~~~~~~~sle~av~~l~v~~iiV~GHt~CGav~aa 111 (207) |+|+++||||||||++|+.+||++|||+|++||+||+|+|. +.++++|||||+.+|++++|+|||||+|||++| T Consensus 1 Q~P~~~vi~C~DSRv~p~~i~~~~~Gd~fv~Rn~Gn~v~~~-----d~~~~~sleyav~~L~v~~IvV~GHt~CG~v~a- 74 (119) T cd00382 1 QKPKALIIGCSDSRVPPELIFGLGPGDLFVVRNAGNLVPPY-----DLDVLASLEYAVEVLGVKHIIVCGHTDCGAVKA- 74 (119) T ss_pred CCCCEEEEEEECCCCCHHHHCCCCCCCEEEEECCCCCCCCC-----CCEEEEEHHHHHHHCCCCEEEEECCCCCHHHHH- T ss_conf 99848999886789898998099997589997045400678-----731675312688855998899968987518889- Q ss_pred HHHHCCCCCCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHCCCCEEEEEEEECCCCE Q ss_conf 30000134633101468988889999864016324687999999999999973097889999779967999999889876 Q gi|254780663|r 112 LDSNNSSTSPGDFIGKWMDIVRPIAQKIVANNPTEKQTILEQLSIRNSLKNIRNFPFVNKLEKEHMLQIHGAWFDISSGK 191 (207) Q Consensus 112 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~nV~~~v~~L~~~p~i~~~v~~g~l~I~G~~yDi~tG~ 191 (207) ++++||+.|+++|+++|+|++++++++++||||+||++||+ T Consensus 75 ---------------------------------------~~~~nV~~qv~~L~~~p~i~~~~~~~~l~v~G~~ydi~tG~ 115 (119) T cd00382 75 ---------------------------------------LVEENVREQVENLRSHPLIQEAVAPGELKVHGWVYDIETGK 115 (119) T ss_pred ---------------------------------------HHHHHHHHHHHHHHCCHHHHHHHHCCCCEEEEEEEECCCCE T ss_conf ---------------------------------------99999999999998098899998789958999999879977 Q ss_pred EEEE Q ss_conf 9998 Q gi|254780663|r 192 LWIL 195 (207) Q Consensus 192 v~~l 195 (207) |+++ T Consensus 116 v~~l 119 (119) T cd00382 116 LEVL 119 (119) T ss_pred EEEC T ss_conf 9869 No 8 >cd03379 beta_CA_cladeD Carbonic anhydrases (CA) are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism in which the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide is followed by the regeneration of an active site by ionization of the zinc-bound water molecule and removal of a proton from the active site. CAs are ubiquitous enzymes involved in fundamental processes like photosynthesis, respiration, pH homeostasis and ion transport. There are three evolutionarily distinct families of CAs (the alpha-, beta-, and gamma-CAs) which show no significant sequence identity or structural similarity. Within the beta-CA family there are four evolutionarily distinct clades (A through D). The beta-CAs are multimeric enzymes (forming dimers,tetramers,hexamers and octamers) which are present in higher plants, algae, fungi, archaea and prokaryotes. Probab=100.00 E-value=8.9e-39 Score=271.85 Aligned_cols=142 Identities=23% Similarity=0.325 Sum_probs=110.5 Q ss_pred CCCEEEEEECCCCCCCHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCEEEEEECCCHHHHHHH Q ss_conf 99759999626789988885178866248886242234577777530257888776722057668999815853898875 Q gi|254780663|r 32 QKPKIMIISCCDSRVAPETIFNAKPGELFVVRNVANIVPPYEPDGQHHATSAAIEFAVQGLNVEHIVVMGHGRCGGIQAV 111 (207) Q Consensus 32 q~P~~~vitC~DsRv~~~~i~~~~~Gd~fv~RnaGn~v~~~~~~~~~~~~~~sle~av~~l~v~~iiV~GHt~CGav~aa 111 (207) ++.++++|||||||++|+.+||++|||+||+|||||+|++ ++++||+||++.||+++|+|||||+|||++++ T Consensus 1 p~~k~~vitC~DSRv~p~~i~~~~pGdlfv~RNaGn~V~~--------~~~~si~~av~~L~v~~IvV~GHt~CGa~~~~ 72 (142) T cd03379 1 PARKLAIVTCMDARLDPEKALGLKLGDAKVIRNAGGRVTD--------DAIRSLVVSVYLLGTREIIVIHHTDCGMLTFT 72 (142) T ss_pred CCCEEEEEEECCCCCCHHHHCCCCCCCEEEEEECCCCCCH--------HHHHHHHHHHHHCCCCEEEEECCCCCCCEECC T ss_conf 9850899996254879899709998747999513342488--------89977999998429978999766675321237 Q ss_pred HHHHCCCCCCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHCCCCEEEEEEEECCCCE Q ss_conf 30000134633101468988889999864016324687999999999999973097889999779967999999889876 Q gi|254780663|r 112 LDSNNSSTSPGDFIGKWMDIVRPIAQKIVANNPTEKQTILEQLSIRNSLKNIRNFPFVNKLEKEHMLQIHGAWFDISSGK 191 (207) Q Consensus 112 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~nV~~~v~~L~~~p~i~~~v~~g~l~I~G~~yDi~tG~ 191 (207) .+........ .+ .+..................++||+.|+++|++||++++ +++||||+||++||+ T Consensus 73 ~~~~~~~~~~-----~~----~~~~~~~~~~~~~~~~~~~~~~nV~~~v~~L~~~P~v~~-----~v~V~G~~Ydi~tG~ 138 (142) T cd03379 73 DEELKEKMKE-----RG----IAEAYGGIDKEFWFLGFDDLEESVREDVERIRNHPLIPD-----DVPVHGYVYDVKTGK 138 (142) T ss_pred HHHHHHHHHH-----CC----HHHHHCCCCCHHHHCCCCCHHHHHHHHHHHHHHCCCCCC-----CCEEEEEEEECCCCE T ss_conf 7999888886-----22----144532346255540344499999999999985917568-----988999999889987 Q ss_pred EEEE Q ss_conf 9998 Q gi|254780663|r 192 LWIL 195 (207) Q Consensus 192 v~~l 195 (207) |..+ T Consensus 139 v~~V 142 (142) T cd03379 139 LTEV 142 (142) T ss_pred EEEC T ss_conf 9869 No 9 >KOG1578 consensus Probab=100.00 E-value=4.3e-35 Score=247.94 Aligned_cols=193 Identities=29% Similarity=0.493 Sum_probs=166.8 Q ss_pred HHHHHHHHHHHHHHCCC--CHHHHHHHHCCCCCEEEEEECCCCCCCHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCHHH Q ss_conf 69999999999970687--9889998623899759999626789988885178866248886242234577777530257 Q gi|254780663|r 4 FPNTLLERHREFIQDQY--DKKLFQELANQQKPKIMIISCCDSRVAPETIFNAKPGELFVVRNVANIVPPYEPDGQHHAT 81 (207) Q Consensus 4 ~~~~Ll~~N~~f~~~~~--~~~~~~~l~~~q~P~~~vitC~DsRv~~~~i~~~~~Gd~fv~RnaGn~v~~~~~~~~~~~~ 81 (207) ..++++++|-.|.++.+ +|++|..++++|+|+.+||+|+|||+.|+.|++.+|||.|++||++|+|||.+... ...+ T Consensus 62 ~~~~i~~~Fv~~~~~~~~~~p~~f~~~~~~qsp~~l~i~csdsRv~~shIL~~~pge~f~irniaNlv~p~~~~~-~~~~ 140 (276) T KOG1578 62 TLEDIGDMFVVRNSGNYIPNPTLFGALAKSQSPEPLALECSDSRVCISHILVCGPGECFAIRNIANLVPPPDKSK-PTNV 140 (276) T ss_pred HHHHHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCEEEEECCCCCCCCEEEEECCCHHHHHHHHHHCCCCCCCCC-CCCC T ss_conf 698998667300134677884666777503798505897203357776278734704768987875268600168-5333 Q ss_pred HHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHCC----CHHHHHHHHHHHHH Q ss_conf 88877672205766899981585389887530000134633101468988889999864016----32468799999999 Q gi|254780663|r 82 SAAIEFAVQGLNVEHIVVMGHGRCGGIQAVLDSNNSSTSPGDFIGKWMDIVRPIAQKIVANN----PTEKQTILEQLSIR 157 (207) Q Consensus 82 ~~sle~av~~l~v~~iiV~GHt~CGav~aa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~l~~~nV~ 157 (207) .+++||+|.+|+|++|+||||+.||||++++....+.. ..+++..|.-.-.+......++. -.++....+.+.+. T Consensus 141 ~AalE~aV~~lkvenIiv~ghs~cgGik~~m~~~~~~~-~~~f~~~wv~id~~~kl~~e~~~s~i~~~~Q~~n~~~~a~~ 219 (276) T KOG1578 141 GAALEYAVTTLKVENIIVIGHSLCGGIKGLMSFSLEAP-SRSFIENWVYIDPEAKLAVEDKLSQINFLQQCENCESEAFL 219 (276) T ss_pred CCHHHHHHHHHCCCEEEEECCCCCCCHHHCCCCCCCCC-CHHHHHHHEEECHHHHHHHHHHHHHCHHHHHHHHHHHHHHH T ss_conf 41688899870554389963446786021044534574-20010202254738888788677532199999888999998 Q ss_pred HHHHHHHCCHHHHHHHHCCCCEEEEEEEECCCCEEEEEECC Q ss_conf 99999730978899997799679999998898769998468 Q gi|254780663|r 158 NSLKNIRNFPFVNKLEKEHMLQIHGAWFDISSGKLWILDPT 198 (207) Q Consensus 158 ~~v~~L~~~p~i~~~v~~g~l~I~G~~yDi~tG~v~~l~~~ 198 (207) .++.+|.+||+++.++..|.+++||++||+..|..+++.-+ T Consensus 220 ~s~~~l~sy~~vr~~v~k~~l~~~G~~Y~fskg~~~~~~ld 260 (276) T KOG1578 220 VSLARLLSYPFVREAVVKGFLQVHGGYYNFSKGTKEFWELD 260 (276) T ss_pred HHHHHHHCCHHHHHHHHHCCEEEEEEEEEECCCCEEEEEEC T ss_conf 89887705719998876232144111677326844578730 No 10 >pfam10575 consensus Probab=99.91 E-value=7.4e-25 Score=181.48 Aligned_cols=154 Identities=17% Similarity=0.245 Sum_probs=110.2 Q ss_pred CCCEEEEEECCCCCCCHH--HHHCCCCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCEEEEEECCCHHHHH Q ss_conf 997599996267899888--851788662488862422345777775302578887767220576689998158538988 Q gi|254780663|r 32 QKPKIMIISCCDSRVAPE--TIFNAKPGELFVVRNVANIVPPYEPDGQHHATSAAIEFAVQGLNVEHIVVMGHGRCGGIQ 109 (207) Q Consensus 32 q~P~~~vitC~DsRv~~~--~i~~~~~Gd~fv~RnaGn~v~~~~~~~~~~~~~~sle~av~~l~v~~iiV~GHt~CGav~ 109 (207) |..+++++||||.|+.+- ..+|.++||+.++|||||.|.+ ...++++||..|++.|++++|+|+||+|||+-. T Consensus 1 ~dKKlaIlTCMDtRL~elLp~AlgiknGDakiIKnAG~~I~h-----PfGsvmRSllvAIYel~veEI~VIGH~dcg~~~ 75 (159) T pfam10575 1 KNKKVLLLTDIEHGIEPIIQQVTNIQPENMLTIHSYDSVIVH-----PYGDIMRSIIIAIYQENVEEIFVVGIEDKETDV 75 (159) T ss_pred CCCEEEEEECCCCCHHHHHHHHHCCCCCCEEEEECCCCEECC-----CCCCHHHHHHHHHHHCCCCEEEEEEEECCCCCC T ss_conf 995699997442355989999868998677899707954306-----761499999999997498189998262567565 Q ss_pred HHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHC--CCHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHCCCCEEEEEEEEC Q ss_conf 753000013463310146898888999986401--632468799999999999997309788999977996799999988 Q gi|254780663|r 110 AVLDSNNSSTSPGDFIGKWMDIVRPIAQKIVAN--NPTEKQTILEQLSIRNSLKNIRNFPFVNKLEKEHMLQIHGAWFDI 187 (207) Q Consensus 110 aa~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~l~~~nV~~~v~~L~~~p~i~~~v~~g~l~I~G~~yDi 187 (207) ................+.......+ +.... ..+...+.-+++||+++|+-+|+||+|++ .++|||++.|. T Consensus 76 ~~~~~~~~~~~~r~~~~~~~~~~~~---~~~~~dl~~WL~gf~~veesV~~sV~~Ir~HPLiP~-----dV~VhGliid~ 147 (159) T pfam10575 76 VNLQIQRDSIKDNKELDYLFQNCMP---EFSSGSLDEWLSGKENVSENIEKSIDMIRHHPLVPS-----DVKVHGFMIDK 147 (159) T ss_pred CCHHHHHHHHHHHCCHHHHHHHCCH---HHCCCCHHHHHCCCCCHHHHHHHHHHHHHCCCCCCC-----CCCEEEEEECC T ss_conf 6889999999983335888885125---332777899727788877889999999974999999-----98356899869 Q ss_pred CCCEEEEEECC Q ss_conf 98769998468 Q gi|254780663|r 188 SSGKLWILDPT 198 (207) Q Consensus 188 ~tG~v~~l~~~ 198 (207) .||++..++.. T Consensus 148 ~TGkl~~ve~~ 158 (159) T pfam10575 148 TGGKETVVEVS 158 (159) T ss_pred CCCCEEEEECC T ss_conf 99947898448 No 11 >KOG1578 consensus Probab=98.66 E-value=1e-09 Score=83.16 Aligned_cols=195 Identities=21% Similarity=0.298 Sum_probs=129.5 Q ss_pred HHHHHHHHHHCCCCHHHHHHHHCCCCCEEEEEECCCCCCCHHHH----------------HCCCCCCEEEEECCCCCCCC Q ss_conf 99999999706879889998623899759999626789988885----------------17886624888624223457 Q gi|254780663|r 8 LLERHREFIQDQYDKKLFQELANQQKPKIMIISCCDSRVAPETI----------------FNAKPGELFVVRNVANIVPP 71 (207) Q Consensus 8 Ll~~N~~f~~~~~~~~~~~~l~~~q~P~~~vitC~DsRv~~~~i----------------~~~~~Gd~fv~RnaGn~v~~ 71 (207) |+.|-.+|+..... +.-.++.+-++|.+...+|||||+-|... +....||+|++||.||.+|. T Consensus 3 i~~~~~~~~~t~~~-~~~~~~~~mp~~~~~~~~~~dsrml~~r~~~~~~~~~~~~~~~~~~~~~i~~~Fv~~~~~~~~~~ 81 (276) T KOG1578 3 ILRGVIRFRNTTRK-DLVEEIRDMPSPTAVMFTCMDSRMLPTRYNLVAAAKIKKLTAEFDTLEDIGDMFVVRNSGNYIPN 81 (276) T ss_pred CCCCCCHHHHHHHH-HHHHHHHHCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCC T ss_conf 11254010011277-75799885897889999987762144124556543011233113569899866730013467788 Q ss_pred CCCCCCC----HHHHHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHCCC--CCC---CCCHHHHHHHHHHHH------ Q ss_conf 7777530----2578887767220576689998158538988753000013--463---310146898888999------ Q gi|254780663|r 72 YEPDGQH----HATSAAIEFAVQGLNVEHIVVMGHGRCGGIQAVLDSNNSS--TSP---GDFIGKWMDIVRPIA------ 136 (207) Q Consensus 72 ~~~~~~~----~~~~~sle~av~~l~v~~iiV~GHt~CGav~aa~~~~~~~--~~~---~~~~~~~~~~~~~~~------ 136 (207) ....... +--...|+.|.......||++|||++|-++.......... ..+ ...++.|+-.++-.. T Consensus 82 p~~f~~~~~~qsp~~l~i~csdsRv~~shIL~~~pge~f~irniaNlv~p~~~~~~~~~~AalE~aV~~lkvenIiv~gh 161 (276) T KOG1578 82 PTLFGALAKSQSPEPLALECSDSRVCISHILVCGPGECFAIRNIANLVPPPDKSKPTNVGAALEYAVTTLKVENIIVIGH 161 (276) T ss_pred HHHHHHHHCCCCCCCEEEEECCCCCCCCEEEEECCCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHCCCEEEEECC T ss_conf 46667775037985058972033577762787347047689878752686001685333416888998705543899634 Q ss_pred -----------HHHHH-----------CCC---HHHHHHHHHHHHHHHHHHHHCCHHHH--HHHHCCCCEEEE--EEEEC Q ss_conf -----------98640-----------163---24687999999999999973097889--999779967999--99988 Q gi|254780663|r 137 -----------QKIVA-----------NNP---TEKQTILEQLSIRNSLKNIRNFPFVN--KLEKEHMLQIHG--AWFDI 187 (207) Q Consensus 137 -----------~~~~~-----------~~~---~~~~~~l~~~nV~~~v~~L~~~p~i~--~~v~~g~l~I~G--~~yDi 187 (207) ..... .++ ....+.+.+.|..+|..|...+.+.. ..+.....-+++ -|.+. T Consensus 162 s~cgGik~~m~~~~~~~~~~f~~~wv~id~~~kl~~e~~~s~i~~~~Q~~n~~~~a~~~s~~~l~sy~~vr~~v~k~~l~ 241 (276) T KOG1578 162 SLCGGIKGLMSFSLEAPSRSFIENWVYIDPEAKLAVEDKLSQINFLQQCENCESEAFLVSLARLLSYPFVREAVVKGFLQ 241 (276) T ss_pred CCCCCHHHCCCCCCCCCCHHHHHHHEEECHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHCCEE T ss_conf 46786021044534574200102022547388887886775321999998889999988988770571999887623214 Q ss_pred CCCEEEEEECCCCEEE Q ss_conf 9876999846888258 Q gi|254780663|r 188 SSGKLWILDPTSNEFT 203 (207) Q Consensus 188 ~tG~v~~l~~~~~~F~ 203 (207) -.|..+.|...+.+|= T Consensus 242 ~~G~~Y~fskg~~~~~ 257 (276) T KOG1578 242 VHGGYYNFSKGTKEFW 257 (276) T ss_pred EEEEEEEECCCCEEEE T ss_conf 4111677326844578 No 12 >pfam10070 DUF2309 Uncharacterized protein conserved in bacteria (DUF2309). Members of this family of hypothetical bacterial proteins have no known function. Probab=89.13 E-value=0.86 Score=25.26 Aligned_cols=38 Identities=21% Similarity=0.169 Sum_probs=20.6 Q ss_pred HHHHCCHHHHHHHHCCCCEE------EEEEEECCCCEEEEEECC Q ss_conf 99730978899997799679------999998898769998468 Q gi|254780663|r 161 KNIRNFPFVNKLEKEHMLQI------HGAWFDISSGKLWILDPT 198 (207) Q Consensus 161 ~~L~~~p~i~~~v~~g~l~I------~G~~yDi~tG~v~~l~~~ 198 (207) ..|.+.|-|++.+++..+.| +++.+|-.|-+|.+||.+ T Consensus 534 A~~lN~p~VR~~L~~~GI~IP~dT~F~aa~H~TttD~v~~~d~d 577 (783) T pfam10070 534 AALLNDPEVRAGLAERGIDIPDDTVFVAALHDTTTDEVTLYDLD 577 (783) T ss_pred HHHHCCHHHHHHHHHCCCCCCCCCEEEEEECCCCCCEEEEECCC T ss_conf 99857999999999769978998878753316764337875588 No 13 >TIGR01184 ntrCD nitrate ABC transporter, ATP-binding proteins C and D; InterPro: IPR005890 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This model describes the ATP binding subunits of nitrate transport in bacteria and archaea. This protein belongs to the ATP-binding cassette (ABC) superfamily. It is thought that the two subunits encoded by ntrC and ntrD form the binding surface for interaction with ATP. This model is restricted in identifying ATP binding subunit associated with the nitrate transport. Nitrate assimilation is aided by other proteins derived from the operon which among others include products of ntrA - a regulatory protein; ntrB - a hydropbobic transmembrane permease and narB - a reductase. ; GO: 0015112 nitrate transmembrane transporter activity, 0015706 nitrate transport, 0016020 membrane. Probab=86.95 E-value=2.2 Score=22.62 Aligned_cols=16 Identities=38% Similarity=0.729 Sum_probs=12.8 Q ss_pred CCCCEEEEEECCCHHH Q ss_conf 5766899981585389 Q gi|254780663|r 92 LNVEHIVVMGHGRCGG 107 (207) Q Consensus 92 l~v~~iiV~GHt~CGa 107 (207) -.-+-|-++|||+||= T Consensus 9 ~~GEFisliGHSGCGK 24 (230) T TIGR01184 9 RQGEFISLIGHSGCGK 24 (230) T ss_pred ECCCEEEEECCCCCCH T ss_conf 2673699851278617 No 14 >PRK03604 moaC bifunctional molybdenum cofactor biosynthesis protein C/molybdopterin-binding protein; Provisional Probab=73.95 E-value=1.3 Score=24.00 Aligned_cols=75 Identities=17% Similarity=0.123 Sum_probs=41.3 Q ss_pred CCCCCEEEEEECCCCCCCHH--HHHCCCCCCEEEEECCCCCCCCCCCCCCC-HHHHHHHHHHHHCCCCCEEEEEECCCHH Q ss_conf 38997599996267899888--85178866248886242234577777530-2578887767220576689998158538 Q gi|254780663|r 30 NQQKPKIMIISCCDSRVAPE--TIFNAKPGELFVVRNVANIVPPYEPDGQH-HATSAAIEFAVQGLNVEHIVVMGHGRCG 106 (207) Q Consensus 30 ~~q~P~~~vitC~DsRv~~~--~i~~~~~Gd~fv~RnaGn~v~~~~~~~~~-~~~~~sle~av~~l~v~~iiV~GHt~CG 106 (207) ..+-|+++||||||+-..-+ -.-|--. .-.++++|..|..+.....+ ......|.-++ .-++..|+-.|=|+++ T Consensus 142 ~~~~~~aAViTvSD~~s~G~reD~SG~~l--~e~L~~~G~~V~~~~vVPDd~~~I~~~l~~~~-~~~~dlIlTtGGTG~s 218 (301) T PRK03604 142 ARYGPSAAVLVLSDSIAAGTKEDRSGKLL--VEGLEEAGCEVSHYQIIPDEPEEIAAAVQAWI-AEGVELIITTGGTGLG 218 (301) T ss_pred CCCCCCEEEEEECCCCCCCCCCCCHHHHH--HHHHHHCCCCCCEEEEECCCHHHHHHHHHHHH-HCCCCEEEEECCCCCC T ss_conf 67898669999627313797768456999--99999869810302896898899999999999-7799799970897789 Q ss_pred H Q ss_conf 9 Q gi|254780663|r 107 G 107 (207) Q Consensus 107 a 107 (207) . T Consensus 219 ~ 219 (301) T PRK03604 219 P 219 (301) T ss_pred C T ss_conf 9 No 15 >pfam01707 Peptidase_C9 Peptidase family C9. Probab=71.60 E-value=3.8 Score=21.07 Aligned_cols=35 Identities=20% Similarity=0.437 Sum_probs=28.1 Q ss_pred HHH-HCCHHHHHHHHCCCCEEEEEEEECCCCEEEEEECCCCE Q ss_conf 997-30978899997799679999998898769998468882 Q gi|254780663|r 161 KNI-RNFPFVNKLEKEHMLQIHGAWFDISSGKLWILDPTSNE 201 (207) Q Consensus 161 ~~L-~~~p~i~~~v~~g~l~I~G~~yDi~tG~v~~l~~~~~~ 201 (207) ..| ++||.++.+++.| -.+|+.||++..+++..+- T Consensus 138 ~~l~rryP~~~~a~~tG------~q~d~~t~~~~~~~p~~N~ 173 (202) T pfam01707 138 RQLERRYPFLLGAGKTG------RQIDVTTGRVQVPSARINL 173 (202) T ss_pred HHHHHHCCHHHHHHCCC------CEEECCCCCCCCCCCCCCE T ss_conf 99988673544322158------3886256842258865433 No 16 >COG3002 Uncharacterized protein conserved in bacteria [Function unknown] Probab=70.03 E-value=7.3 Score=19.25 Aligned_cols=20 Identities=30% Similarity=0.640 Sum_probs=12.8 Q ss_pred CCCEEEEEECC------------CHHHHHHHH Q ss_conf 76689998158------------538988753 Q gi|254780663|r 93 NVEHIVVMGHG------------RCGGIQAVL 112 (207) Q Consensus 93 ~v~~iiV~GHt------------~CGav~aa~ 112 (207) -.+.|++|||- ||||+..+. T Consensus 561 fAp~Vvl~GHgSqS~NNPy~aaLdCGACgGaS 592 (880) T COG3002 561 FAPLVVLVGHGSQSQNNPYRAALDCGACGGAS 592 (880) T ss_pred HCCEEEEECCCCCCCCCCHHHHCCCCCCCCCC T ss_conf 17659995356556798024305655546744 No 17 >COG1254 AcyP Acylphosphatases [Energy production and conversion] Probab=69.97 E-value=3.6 Score=21.26 Aligned_cols=19 Identities=21% Similarity=0.298 Sum_probs=17.4 Q ss_pred CCEEEEEEEECCCCEEEEE Q ss_conf 9679999998898769998 Q gi|254780663|r 177 MLQIHGAWFDISSGKLWIL 195 (207) Q Consensus 177 ~l~I~G~~yDi~tG~v~~l 195 (207) +|.|+||+++..+|.|+.+ T Consensus 29 ~lgl~G~V~N~~DGsVeiv 47 (92) T COG1254 29 RLGLTGWVKNLDDGSVEIV 47 (92) T ss_pred HCCCEEEEEECCCCEEEEE T ss_conf 8798799997899829999 No 18 >cd04169 RF3 RF3 subfamily. Peptide chain release factor 3 (RF3) is a protein involved in the termination step of translation in bacteria. Termination occurs when class I release factors (RF1 or RF2) recognize the stop codon at the A-site of the ribosome and activate the release of the nascent polypeptide. The class II release factor RF3 then initiates the release of the class I RF from the ribosome. RF3 binds to the RF/ribosome complex in the inactive (GDP-bound) state. GDP/GTP exchange occurs, followed by the release of the class I RF. Subsequent hydrolysis of GTP to GDP triggers the release of RF3 from the ribosome. RF3 also enhances the efficiency of class I RFs at less preferred stop codons and at stop codons in weak contexts. Probab=59.51 E-value=4.6 Score=20.54 Aligned_cols=27 Identities=7% Similarity=-0.002 Sum_probs=12.9 Q ss_pred HHHHHHHHHHHCCCCCEEEEEECCCHH Q ss_conf 578887767220576689998158538 Q gi|254780663|r 80 ATSAAIEFAVQGLNVEHIVVMGHGRCG 106 (207) Q Consensus 80 ~~~~sle~av~~l~v~~iiV~GHt~CG 106 (207) +-.+-.+-|+.+...=.+||.+.++.. T Consensus 83 DF~~e~~~al~v~D~AviVv~a~~GVe 109 (267) T cd04169 83 DFSEDTYRTLTAVDSAVMVIDAAKGVE 109 (267) T ss_pred HHHHHHHHHHHHHHCEEEEEECCCCCC T ss_conf 899999999988645479952566653 No 19 >TIGR00073 hypB hydrogenase accessory protein HypB; InterPro: IPR004392 The hydrogenase accessory protein HypB is a GTP hydrolase for assembly of nickel metallocentre of hydrogenase. A similar protein, ureG, is an accessory protein for urease, which also uses nickel.; GO: 0016151 nickel ion binding, 0006461 protein complex assembly. Probab=55.97 E-value=10 Score=18.36 Aligned_cols=44 Identities=18% Similarity=0.256 Sum_probs=29.2 Q ss_pred CCCEEEEEECCCCCCCHHH-----------------------HH---------CCCCC-CEEEEECCCCCCCCCCCC Q ss_conf 9975999962678998888-----------------------51---------78866-248886242234577777 Q gi|254780663|r 32 QKPKIMIISCCDSRVAPET-----------------------IF---------NAKPG-ELFVVRNVANIVPPYEPD 75 (207) Q Consensus 32 q~P~~~vitC~DsRv~~~~-----------------------i~---------~~~~G-d~fv~RnaGn~v~~~~~~ 75 (207) ..|.+++.|=+=|..+.+. |- +..-- |+..|=|+||+|.|.+.+ T Consensus 61 ~~K~Avi~GD~~t~~DA~RlR~~G~~a~~~nTGk~CHLdA~mv~G~~~~L~~~~ld~~~DlL~IENVGNLvCP~~Fd 137 (225) T TIGR00073 61 EVKIAVIEGDVQTKNDADRLRKYGVPAIQINTGKECHLDAHMVAGAIHALKDLPLDDISDLLLIENVGNLVCPADFD 137 (225) T ss_pred CCCEEEEECCHHHHHHHHHHHHCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCEECCCCC T ss_conf 89789997553225569999864986886368864440166786587554216888714626886447610067311 No 20 >PRK10463 hydrogenase nickel incorporation protein HypB; Provisional Probab=52.65 E-value=18 Score=16.73 Aligned_cols=45 Identities=20% Similarity=0.238 Sum_probs=31.2 Q ss_pred CCCCEEEEEECCCCCCCHHHH-----------------------------HCCCCCCEEEEECCCCCCCCCCCC Q ss_conf 899759999626789988885-----------------------------178866248886242234577777 Q gi|254780663|r 31 QQKPKIMIISCCDSRVAPETI-----------------------------FNAKPGELFVVRNVANIVPPYEPD 75 (207) Q Consensus 31 ~q~P~~~vitC~DsRv~~~~i-----------------------------~~~~~Gd~fv~RnaGn~v~~~~~~ 75 (207) +..|.+++-+-..+..+.+.| |+....|+..|-|+||+|.|...+ T Consensus 130 ~~~~~aVIeGD~~T~~DA~RI~~~Gv~avQInTG~~CHLDA~MV~~al~~l~l~~~dllfIENVGNLVCPA~FD 203 (290) T PRK10463 130 DSVPCAVIEGDQQTVNDAARIRATGTPAIQVNTGKGCHLDAQMIADAAPRLPLDDNGILFIENVGNLVCPASFD 203 (290) T ss_pred CCCCEEEEEECCCCHHHHHHHHHCCCCEEEECCCCCCCCCHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCC T ss_conf 36757999604235667999997699589954799767599999999984898779899981278843551203 No 21 >TIGR00385 dsbE periplasmic protein thiol:disulfide oxidoreductases, DsbE subfamily; InterPro: IPR004799 Periplasmic protein thiol:disulphide oxidoreductase is involved in the biogenesis of c-type cytochromes as well as in disulphide bond formation in some periplasmic proteins. This group defines the DsbE subfamily.; GO: 0015036 disulfide oxidoreductase activity, 0017004 cytochrome complex assembly, 0030288 outer membrane-bounded periplasmic space. Probab=47.59 E-value=17 Score=16.91 Aligned_cols=36 Identities=33% Similarity=0.423 Sum_probs=25.3 Q ss_pred HHHHHHHHHH-H-CCHHHHHHHHCCCCEEEEEEEECCC Q ss_conf 9999999997-3-0978899997799679999998898 Q gi|254780663|r 154 LSIRNSLKNI-R-NFPFVNKLEKEHMLQIHGAWFDISS 189 (207) Q Consensus 154 ~nV~~~v~~L-~-~~p~i~~~v~~g~l~I~G~~yDi~t 189 (207) .||-.|=--- | +||.+.+.-++|+|+|+|.=|-=++ T Consensus 69 lNVWAsWCp~CraEHp~L~eLak~Grl~ivG~dYKD~~ 106 (175) T TIGR00385 69 LNVWASWCPPCRAEHPYLNELAKEGRLPIVGVDYKDQS 106 (175) T ss_pred EEEEECCCCCCHHCCHHHHHHHHCCCCEEEEEECCCCC T ss_conf 98840368883100057899975397307886347884 No 22 >pfam00708 Acylphosphatase Acylphosphatase. Probab=46.84 E-value=22 Score=16.15 Aligned_cols=19 Identities=21% Similarity=0.333 Sum_probs=15.6 Q ss_pred CCEEEEEEEECCCCEEEEE Q ss_conf 9679999998898769998 Q gi|254780663|r 177 MLQIHGAWFDISSGKLWIL 195 (207) Q Consensus 177 ~l~I~G~~yDi~tG~v~~l 195 (207) ++.|.||+-+..+|.|+.+ T Consensus 28 ~lgl~G~V~N~~dG~Vei~ 46 (90) T pfam00708 28 ELGLKGYVRNLPDGTVEIV 46 (90) T ss_pred HCCCEEEEEECCCCEEEEE T ss_conf 8598799999999909999 No 23 >COG1116 TauB ABC-type nitrate/sulfonate/bicarbonate transport system, ATPase component [Inorganic ion transport and metabolism] Probab=46.10 E-value=9.9 Score=18.36 Aligned_cols=16 Identities=25% Similarity=0.609 Sum_probs=11.5 Q ss_pred CCCCEEEEEECCCHHH Q ss_conf 5766899981585389 Q gi|254780663|r 92 LNVEHIVVMGHGRCGG 107 (207) Q Consensus 92 l~v~~iiV~GHt~CGa 107 (207) -.-+-+.++|||+||= T Consensus 27 ~~GEfvsilGpSGcGK 42 (248) T COG1116 27 EKGEFVAILGPSGCGK 42 (248) T ss_pred CCCCEEEEECCCCCCH T ss_conf 7997999989997889 No 24 >cd03548 Rieske_RO_Alpha_OMO_CARDO Rieske non-heme iron oxygenase (RO) family, 2-Oxoquinoline 8-monooxygenase (OMO) and Carbazole 1,9a-dioxygenase (CARDO) subfamily, N-terminal Rieske domain of the oxygenase alpha subunit; ROs comprise a large class of aromatic ring-hydroxylating dioxygenases that enable microorganisms to tolerate and utilize aromatic compounds for growth. The oxygenase alpha subunit contains an N-terminal Rieske domain with an [2Fe-2S] cluster and a C-terminal catalytic domain with a mononuclear Fe(II) binding site. The Rieske [2Fe-2S] cluster accepts electrons from a reductase or ferredoxin component and transfers them to the mononuclear iron for catalysis. OMO catalyzes the NADH-dependent oxidation of the N-heterocyclic aromatic compound 2-oxoquinoline to 8-hydroxy-2-oxoquinoline, the second step in the bacterial degradation of quinoline. OMO consists of a reductase component (OMR) and an oxygenase component (OMO) that together function to shuttle electrons from the Probab=45.77 E-value=12 Score=17.81 Aligned_cols=18 Identities=28% Similarity=0.420 Sum_probs=11.8 Q ss_pred CEEEEEEEECCCCEEEEE Q ss_conf 679999998898769998 Q gi|254780663|r 178 LQIHGAWFDISSGKLWIL 195 (207) Q Consensus 178 l~I~G~~yDi~tG~v~~l 195 (207) -.-|||-||+++|++... T Consensus 76 CpyHGW~fd~~~G~~~~i 93 (136) T cd03548 76 CWYHGWTYRLDDGKLVTI 93 (136) T ss_pred CCCCCCEEECCCCEEEEC T ss_conf 489887897999779665 No 25 >cd03478 Rieske_AIFL_N AIFL (apoptosis-inducing factor like) family, N-terminal Rieske domain; members of this family show similarity to human AIFL, containing an N-terminal Rieske domain and a C-terminal pyridine nucleotide-disulfide oxidoreductase domain (Pyr_redox). The Rieske domain is a [2Fe-2S] cluster binding domain involved in electron transfer. AIFL shares 35% homology with human AIF (apoptosis-inducing factor), mainly in the Pyr_redox domain. AIFL is predominantly localized to the mitochondria. AIFL induces apoptosis in a caspase-dependent manner. Probab=45.45 E-value=8.2 Score=18.91 Aligned_cols=27 Identities=19% Similarity=0.304 Sum_probs=18.3 Q ss_pred CEEEEEEEECCCCEEEEEEC--CCCEEEE Q ss_conf 67999999889876999846--8882583 Q gi|254780663|r 178 LQIHGAWFDISSGKLWILDP--TSNEFTC 204 (207) Q Consensus 178 l~I~G~~yDi~tG~v~~l~~--~~~~F~~ 204 (207) -+.|||.||++||+...-.. .-+.|.. T Consensus 59 Cp~Hg~~Fdl~tG~~~~~P~~~~l~~y~v 87 (95) T cd03478 59 CPWHGACFNLRTGDIEDAPALDSLPCYEV 87 (95) T ss_pred CCCCCCEEECCCCCEECCCCCCCCCEEEE T ss_conf 08999889899877967887689758989 No 26 >pfam00355 Rieske Rieske [2Fe-2S] domain. The rieske domain has a [2Fe-2S] centre. Two conserved cysteines that one Fe ion while the other Fe ion is coordinated by two conserved histidines. Probab=45.05 E-value=11 Score=18.10 Aligned_cols=21 Identities=38% Similarity=0.401 Sum_probs=16.1 Q ss_pred CEEEEEEEECCCCEEEEEECC Q ss_conf 679999998898769998468 Q gi|254780663|r 178 LQIHGAWFDISSGKLWILDPT 198 (207) Q Consensus 178 l~I~G~~yDi~tG~v~~l~~~ 198 (207) -+-|||.||++||++..-... T Consensus 66 Cp~Hg~~f~~~~G~~~~~P~~ 86 (99) T pfam00355 66 CPYHGWRFDLSTGKLVGGPAP 86 (99) T ss_pred CCCCCCEEECCCCCEECCCCC T ss_conf 388499986999789169998 No 27 >KOG2564 consensus Probab=43.32 E-value=14 Score=17.32 Aligned_cols=34 Identities=21% Similarity=0.285 Sum_probs=22.4 Q ss_pred HHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHC Q ss_conf 8877672205766899981585389887530000 Q gi|254780663|r 83 AAIEFAVQGLNVEHIVVMGHGRCGGIQAVLDSNN 116 (207) Q Consensus 83 ~sle~av~~l~v~~iiV~GHt~CGav~aa~~~~~ 116 (207) +++.-..+--..+-|+++|||==|+|.+-..... T Consensus 134 ~~~i~~~fge~~~~iilVGHSmGGaIav~~a~~k 167 (343) T KOG2564 134 GAVIKELFGELPPQIILVGHSMGGAIAVHTAASK 167 (343) T ss_pred HHHHHHHHCCCCCCEEEEECCCCCHHHHHHHHHH T ss_conf 9999998356898669984266623323065652 No 28 >TIGR02528 EutP ethanolamine utilization protein, EutP; InterPro: IPR012381 Members of this family function in ethanolamine and propanediol degradation pathways. Both pathways require coenzyme B12 (adenosylcobalamin, AdoCbl). Bacteria that harbour these pathways can use ethanolamine as a source of carbon and nitrogen, or propanediol as a sole carbon and energy source, respectively. The exact roles of the EutP and PduV proteins in these respective pathways are not yet determined. Members of this family contain P-loop consensus motifs in the N-terminal part, and are distantly related to various GTPases and ATPases, including ATPase components of transport systems. Propanediol degradation is thought to be important for the natural Salmonella populations, since propanediol is produced by the fermentation of the common plant sugars rhamnose and fucose , . More than 10f the Salmonella enterica genome is devoted to the utilisation of propanediol and cobalamin biosynthesis. In vivo expression technology has indicated that propanediol utilisation (pdu) genes may be important for growth in host tissues, and competitive index studies with mice have shown that pdu mutations confer a virulence defect , . The pdu operon is contiguous and co-regulated with the cobalamin (B12) biosynthesis cob operon, indicating that propanediol catabolism may be the primary reason for de novo B12 synthesis in Salmonella , , . Please see IPR003207 from INTERPRO, IPR003208 from INTERPRO, IPR009204 from INTERPRO, IPR009191 from INTERPRO, IPR009192 from INTERPRO for more details on the propanediol utilisation pathway and the pdu operon.; GO: 0005524 ATP binding, 0006576 biogenic amine metabolic process. Probab=42.33 E-value=12 Score=17.76 Aligned_cols=87 Identities=17% Similarity=0.274 Sum_probs=44.5 Q ss_pred CHHHHHHH---HCCCCCEEEEEECCCCCCCHHHHHCCCCCCEEEE-ECCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCE Q ss_conf 98899986---2389975999962678998888517886624888-6242234577777530257888776722057668 Q gi|254780663|r 21 DKKLFQEL---ANQQKPKIMIISCCDSRVAPETIFNAKPGELFVV-RNVANIVPPYEPDGQHHATSAAIEFAVQGLNVEH 96 (207) Q Consensus 21 ~~~~~~~l---~~~q~P~~~vitC~DsRv~~~~i~~~~~Gd~fv~-RnaGn~v~~~~~~~~~~~~~~sle~av~~l~v~~ 96 (207) ++.+|..| +..-.=.++|.+|-|. +..|--+.-++|+= |.+| +|+--|+...+... .-.+--...-|.++ T Consensus 50 nr~~Y~AL~vtaaDAd~i~lV~~a~~~----~~~f~PgF~~~f~kK~~IG-~vTK~DLA~~d~~i-~r~~~~L~~AG~~~ 123 (144) T TIGR02528 50 NRRYYSALIVTAADADVIALVQSATDE----ESRFSPGFASIFVKKEVIG-IVTKIDLAEADKDI-ERAKRLLETAGAEK 123 (144) T ss_pred CCCHHHHHHHHHHCCEEEEEEECCCCC----CCCCCCCCCCCCCCCCEEE-EEEECCCCCCHHHH-HHHHHHHHHCCCCC T ss_conf 752378888887210236677357764----2237850002367886347-88403788773479-99999987236543 Q ss_pred EEEEECCCHHHHHHHHH Q ss_conf 99981585389887530 Q gi|254780663|r 97 IVVMGHGRCGGIQAVLD 113 (207) Q Consensus 97 iiV~GHt~CGav~aa~~ 113 (207) |-.++--|=-|+.+..+ T Consensus 124 iF~~~~~d~~G~~~l~~ 140 (144) T TIGR02528 124 IFEISSVDEKGIEELVD 140 (144) T ss_pred CEEECCCCCHHHHHHHH T ss_conf 31650778045899999 No 29 >cd03467 Rieske Rieske domain; a [2Fe-2S] cluster binding domain commonly found in Rieske non-heme iron oxygenase (RO) systems such as naphthalene and biphenyl dioxygenases, as well as in plant/cyanobacterial chloroplast b6f and mitochondrial cytochrome bc(1) complexes. The Rieske domain can be divided into two subdomains, with an incomplete six-stranded, antiparallel beta-barrel at one end, and an iron-sulfur cluster binding subdomain at the other. The Rieske iron-sulfur center contains a [2Fe-2S] cluster, which is involved in electron transfer, and is liganded to two histidine and two cysteine residues present in conserved sequences called Rieske motifs. In RO systems, the N-terminal Rieske domain of the alpha subunit acts as an electron shuttle that accepts electrons from a reductase or ferredoxin component and transfers them to the mononuclear iron in the alpha subunit C-terminal domain to be used for catalysis. Probab=41.62 E-value=9.9 Score=18.38 Aligned_cols=18 Identities=28% Similarity=0.481 Sum_probs=14.7 Q ss_pred CEEEEEEEECCCCEEEEE Q ss_conf 679999998898769998 Q gi|254780663|r 178 LQIHGAWFDISSGKLWIL 195 (207) Q Consensus 178 l~I~G~~yDi~tG~v~~l 195 (207) -+-|||.||++||++..- T Consensus 61 CP~H~~~Fdl~tG~~~~~ 78 (98) T cd03467 61 CPCHGSRFDLRTGEVVSG 78 (98) T ss_pred CCCCCCEEECCCCCEECC T ss_conf 688898897899729678 No 30 >cd03528 Rieske_RO_ferredoxin Rieske non-heme iron oxygenase (RO) family, Rieske ferredoxin component; composed of the Rieske ferredoxin component of some three-component RO systems including biphenyl dioxygenase (BPDO) and carbazole 1,9a-dioxygenase (CARDO). The RO family comprise a large class of aromatic ring-hydroxylating dioxygenases found predominantly in microorganisms. These enzymes enable microorganisms to tolerate and even exclusively utilize aromatic compounds for growth. ROs consist of two or three components: reductase, oxygenase, and ferredoxin (in some cases) components. The ferredoxin component contains either a plant-type or Rieske-type [2Fe-2S] cluster. The Rieske ferredoxin component in this family carries an electron from the RO reductase component to the terminal RO oxygenase component. BPDO degrades biphenyls and polychlorinated biphenyls. BPDO ferredoxin (BphF) has structural features consistent with a minimal and perhaps archetypical Rieske protein in that the in Probab=41.43 E-value=9.4 Score=18.51 Aligned_cols=18 Identities=39% Similarity=0.713 Sum_probs=14.4 Q ss_pred CEEEEEEEECCCCEEEEE Q ss_conf 679999998898769998 Q gi|254780663|r 178 LQIHGAWFDISSGKLWIL 195 (207) Q Consensus 178 l~I~G~~yDi~tG~v~~l 195 (207) -+.|||.||++||++.-- T Consensus 60 Cp~H~~~Fdl~tG~~~~~ 77 (98) T cd03528 60 CPLHGGRFDLRTGKALSL 77 (98) T ss_pred CCCCCCEEECCCCCEECC T ss_conf 987899898998768078 No 31 >TIGR00088 trmD tRNA (guanine-N1)-methyltransferase; InterPro: IPR002649 In transfer RNA many different modified nucleosides are found, especially in the anticodon region. tRNA (guanine-N1-)-methyltransferase 2.1.1.31 from EC is one of several nucleases operating together with the tRNA-modifying enzymes before the formation of the mature tRNA. It catalyses the reaction:S-adenosyl-L-methionine + tRNA -> S-adenosyl-L-homocysteine + tRNA containing N1-methylguanine methylating guanosine(G) to N1-methylguanine (1-methylguanosine (m1G)) at position 37 of tRNAs that read CUN (leucine), CCN(proline), and CGG (arginine) codons. The presence of m1G improves the cellular growth rate and the polypeptide steptime and also prevents the tRNA from shifting the reading frame . The mechanism of the trmD3-induced frameshift involving mutant tRNA(Pro) and tRNA(Leu) species has been investigated . It has been suggested that the conformation of the anticodon loop may be a major determining element for the formation of m1G37 in vivo . ; GO: 0003723 RNA binding, 0008175 tRNA methyltransferase activity, 0008033 tRNA processing. Probab=41.07 E-value=13 Score=17.55 Aligned_cols=12 Identities=8% Similarity=0.227 Sum_probs=9.3 Q ss_pred CCCEEEEEECCC Q ss_conf 766899981585 Q gi|254780663|r 93 NVEHIVVMGHGR 104 (207) Q Consensus 93 ~v~~iiV~GHt~ 104 (207) .=++|+||||+. T Consensus 105 ~~~lvLiCGrYE 116 (247) T TIGR00088 105 KEHLVLICGRYE 116 (247) T ss_pred CCCEEEEECCCC T ss_conf 284789956503 No 32 >TIGR01347 sucB 2-oxoglutarate dehydrogenase, E2 component, dihydrolipoamide succinyltransferase; InterPro: IPR006255 These sequences describe the TCA cycle 2-oxoglutarate system E2 component, dihydrolipoamide succinyltransferase. It is closely related to the pyruvate dehydrogenase E2 component, dihydrolipoamide acetyltransferase. Members include the mitochondrial and Gram-negative bacterial forms. Mycobacterial candidates are highly derived, differ in having and extra copy of the lipoyl-binding domain at the N-terminus and are generally excluded from this group of sequences as are all examples of dihydrolipoamide acetyltransferase. ; GO: 0004149 dihydrolipoyllysine-residue succinyltransferase activity, 0006099 tricarboxylic acid cycle, 0045252 oxoglutarate dehydrogenase complex. Probab=40.58 E-value=12 Score=17.76 Aligned_cols=40 Identities=20% Similarity=0.363 Sum_probs=33.3 Q ss_pred HHHHHHHHHHCCHHHHHHHHC--CCCEEEEEEEECC------CCEEEEE Q ss_conf 999999997309788999977--9967999999889------8769998 Q gi|254780663|r 155 SIRNSLKNIRNFPFVNKLEKE--HMLQIHGAWFDIS------SGKLWIL 195 (207) Q Consensus 155 nV~~~v~~L~~~p~i~~~v~~--g~l~I~G~~yDi~------tG~v~~l 195 (207) -|...+..|+++|.|...+++ .++--|+ ||||. .|.|.-+ T Consensus 265 FvKA~~~ALK~~P~VNA~IDGtt~divy~~-Y~DIsvAVsTd~GLVvPV 312 (435) T TIGR01347 265 FVKAVVEALKRFPEVNAEIDGTTDDIVYKD-YYDISVAVSTDRGLVVPV 312 (435) T ss_pred HHHHHHHHHHHCCCCCCEECCCCCEEEEEC-CEEEEEEEECCCCCEEEE T ss_conf 789999998724423312157433278754-557999987599833422 No 33 >cd03529 Rieske_NirD Assimilatory nitrite reductase (NirD) family, Rieske domain; Assimilatory nitrate and nitrite reductases convert nitrate through nitrite to ammonium. Members include bacterial and fungal proteins. The bacterial NirD contains a single Rieske domain while fungal proteins have a C-terminal Rieske domain in addition to several other domains. The fungal NirD is involved in nutrient acquisition, functioning at the soil/fungus interface to control nutrient exchange between the fungus and the host plant. The Rieske domain is a [2Fe-2S] cluster binding domain involved in electron transfer. The Rieske [2Fe-2S] cluster is liganded to two histidine and two cysteine residues present in conserved sequences called Rieske motifs. In this family, only a few members contain these residues. Other members may have lost the ability to bind the Rieske [2Fe-2S] cluster. Probab=39.07 E-value=9.4 Score=18.51 Aligned_cols=16 Identities=13% Similarity=0.318 Sum_probs=13.1 Q ss_pred CEEEEEEEECCCCEEE Q ss_conf 6799999988987699 Q gi|254780663|r 178 LQIHGAWFDISSGKLW 193 (207) Q Consensus 178 l~I~G~~yDi~tG~v~ 193 (207) -+.|||.||++||+.. T Consensus 66 CP~H~~~Fdl~tG~~~ 81 (103) T cd03529 66 SPLYKQHFSLKTGRCL 81 (103) T ss_pred CCCCCCEEECCCCCCC T ss_conf 6767988989987636 No 34 >TIGR00055 uppS undecaprenyl diphosphate synthase; InterPro: IPR001441 Synonym(s): Di-trans-poly-cis-undecaprenyl-diphosphate synthase, Undecaprenyl pyrophosphate synthetase, Undecaprenyl pyrophosphate synthase, UPP synthetase Di-trans-poly-cis-decaprenylcistransferase (2.5.1.31 from EC) (UPP synthetase) generates undecaprenyl pyrophosphate (UPP) from isopentenyl pyrophosphate (IPP) . This bacterial enzyme is also found in archaebacteria and in a number of uncharacterised proteins including some from yeasts.; GO: 0016740 transferase activity, 0008152 metabolic process. Probab=38.60 E-value=1 Score=24.71 Aligned_cols=54 Identities=13% Similarity=0.260 Sum_probs=40.0 Q ss_pred HHHHHHHHHHCCCCHHHHHHHHCCCCCEEEEEECCCCCCCHHHHHCCCCCCEEEEECCCC Q ss_conf 999999997068798899986238997599996267899888851788662488862422 Q gi|254780663|r 8 LLERHREFIQDQYDKKLFQELANQQKPKIMIISCCDSRVAPETIFNAKPGELFVVRNVAN 67 (207) Q Consensus 8 Ll~~N~~f~~~~~~~~~~~~l~~~q~P~~~vitC~DsRv~~~~i~~~~~Gd~fv~RnaGn 67 (207) +++||.||++.+-.+..+.+-+...+=+-++--|.|..|..-+++ +|..=|+.= T Consensus 8 IMDGNgRwAk~~~k~r~~GH~~G~~~l~~~l~~c~~~Gv~~lTlY------AFS~ENw~R 61 (237) T TIGR00055 8 IMDGNGRWAKKKGKDRKYGHEAGVKSLRRILRWCADLGVECLTLY------AFSTENWKR 61 (237) T ss_pred EECCCHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEEEE------EEECCCCCC T ss_conf 717616999853883445568778999999999986693057533------210002579 No 35 >cd03474 Rieske_T4moC Toluene-4-monooxygenase effector protein complex (T4mo), Rieske ferredoxin subunit; The Rieske domain is a [2Fe-2S] cluster binding domain involved in electron transfer. T4mo is a four-protein complex that catalyzes the NADH- and O2-dependent hydroxylation of toluene to form p-cresol. T4mo consists of an NADH oxidoreductase (T4moF), a diiron hydroxylase (T4moH), a catalytic effector protein (T4moD), and a Rieske ferredoxin (T4moC). T4moC contains a Rieske domain and functions as an obligate electron carrier between T4moF and T4moH. Rieske ferredoxins are found as subunits of membrane oxidase complexes, cis-dihydrodiol-forming aromatic dioxygenases, bacterial assimilatory nitrite reductases, and arsenite oxidase. Rieske ferredoxins are also found as soluble electron carriers in bacterial dioxygenase and monooxygenase complexes. Probab=38.53 E-value=12 Score=17.94 Aligned_cols=16 Identities=25% Similarity=0.212 Sum_probs=13.2 Q ss_pred CEEEEEEEECCCCEEE Q ss_conf 6799999988987699 Q gi|254780663|r 178 LQIHGAWFDISSGKLW 193 (207) Q Consensus 178 l~I~G~~yDi~tG~v~ 193 (207) -+.|||.||++||+.. T Consensus 61 Cp~H~~~Fdl~tG~~~ 76 (108) T cd03474 61 CRAHLWQFDADTGEGL 76 (108) T ss_pred CCCCCCEEECCCCCCC T ss_conf 4884888999998799 No 36 >TIGR01208 rmlA_long glucose-1-phosphate thymidylyltransferase; InterPro: IPR005908 Synonym: dTDP-D-glucose synthase This group of proteins comprises a tightly conserved but broadly distributed subfamily of known and putative bacterial glucose-1-phosphate thymidylyltransferases (2.7.7.24 from EC). It is well characterised in several species as the first of four enzymes involved in the biosynthesis of dTDP-L-rhamnose, a cell wall constituent and a feedback inhibitor of the enzyme.dTTP + alpha-D-glucose 1-phosphate = diphosphate + dTDP-glucose The family of known and putative glucose-1-phosphate thymidyltransferases shows a deep split into a short form (see IPR005907 from INTERPRO) and a long form described by this model. The homotetrameric short form is found in numerous bacterial species that incorporate dTDP-L-rhamnose, which it helps synthesize, into the cell wall. It is subject to feedback inhibition. The long form, in contrast, is found in many species for which it serves as a sugar-activating enzyme for antibiotic biosynthesis and or other, unknown pathways, and in which dTDP-L-rhamnose is not necessarily produced. . Probab=38.07 E-value=27 Score=15.56 Aligned_cols=35 Identities=23% Similarity=0.343 Sum_probs=23.2 Q ss_pred HHHHHHHHHHCC--------HHHHHHHHCCC----CEEEEEEEECCC Q ss_conf 999999997309--------78899997799----679999998898 Q gi|254780663|r 155 SIRNSLKNIRNF--------PFVNKLEKEHM----LQIHGAWFDISS 189 (207) Q Consensus 155 nV~~~v~~L~~~--------p~i~~~v~~g~----l~I~G~~yDi~t 189 (207) -|.+.+++|+=| ..|+.+++.|. .+|.|||.|..+ T Consensus 181 ~if~~~~~~kPSwRGElEITD~IQ~lIe~Gy~V~~~~v~GWWkDTGk 227 (361) T TIGR01208 181 LIFEAIKNIKPSWRGELEITDAIQYLIEKGYKVGGSKVKGWWKDTGK 227 (361) T ss_pred HHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEEEEEECCCCC T ss_conf 98889852888776621577577565315837777999788701598 No 37 >COG2146 {NirD} Ferredoxin subunits of nitrite reductase and ring-hydroxylating dioxygenases [Inorganic ion transport and metabolism / General function prediction only] Probab=37.64 E-value=11 Score=17.99 Aligned_cols=18 Identities=33% Similarity=0.573 Sum_probs=13.9 Q ss_pred CEEEEEEEECCCCEEEEE Q ss_conf 679999998898769998 Q gi|254780663|r 178 LQIHGAWFDISSGKLWIL 195 (207) Q Consensus 178 l~I~G~~yDi~tG~v~~l 195 (207) -+.|+|.||+.||+...- T Consensus 66 Cp~H~a~Fdl~tG~~~~~ 83 (106) T COG2146 66 CPLHGARFDLRTGECLEP 83 (106) T ss_pred CCCCCCEEECCCCCEECC T ss_conf 986788998798877368 No 38 >PRK09965 3-phenylpropionate dioxygenase ferredoxin subunit; Provisional Probab=36.77 E-value=9.7 Score=18.43 Aligned_cols=27 Identities=22% Similarity=0.352 Sum_probs=17.6 Q ss_pred CEEEEEEEECCCCEEEEEEC--CCCEEEE Q ss_conf 67999999889876999846--8882583 Q gi|254780663|r 178 LQIHGAWFDISSGKLWILDP--TSNEFTC 204 (207) Q Consensus 178 l~I~G~~yDi~tG~v~~l~~--~~~~F~~ 204 (207) -+.|||.||+.||+..-... .-+.|.+ T Consensus 60 CP~H~~~Fdl~tG~~~~~p~~~~L~~y~V 88 (104) T PRK09965 60 CPLHAASFCLRTGKALCLPATDPLRTYPV 88 (104) T ss_pred CCCCCCEEECCCCCCCCCCCCCCCCEEEE T ss_conf 88789999879877427886588667979 No 39 >KOG4780 consensus Probab=36.70 E-value=18 Score=16.68 Aligned_cols=32 Identities=38% Similarity=0.211 Sum_probs=16.3 Q ss_pred ECCCCCCCHHHHHCCCCCCEEEEECCCCCCCC Q ss_conf 62678998888517886624888624223457 Q gi|254780663|r 40 SCCDSRVAPETIFNAKPGELFVVRNVANIVPP 71 (207) Q Consensus 40 tC~DsRv~~~~i~~~~~Gd~fv~RnaGn~v~~ 71 (207) .-.+||.+...+.+-+.||+--+||++|++.| T Consensus 390 s~~~s~~~~w~~~~~~~~e~~~~r~~~~a~~p 421 (467) T KOG4780 390 SKKLSRSVTWADQNDGRGELCEVRNNDNAAGP 421 (467) T ss_pred CCCCCCCCHHHHCCCCCCCHHHEECCCCEECC T ss_conf 56412361365416652102213046731003 No 40 >pfam07646 Kelch_2 Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Drosophila ring canal kelch protein is related to galactose oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415. Probab=36.53 E-value=28 Score=15.47 Aligned_cols=34 Identities=35% Similarity=0.481 Sum_probs=27.1 Q ss_pred HCCCCEEEEEE---EECCCCEEEEEECCCCEEEECCC Q ss_conf 77996799999---98898769998468882583359 Q gi|254780663|r 174 KEHMLQIHGAW---FDISSGKLWILDPTSNEFTCDTR 207 (207) Q Consensus 174 ~~g~l~I~G~~---yDi~tG~v~~l~~~~~~F~~~~~ 207 (207) -.+++-+.|.. .+.....+..++++++.|+..++ T Consensus 10 ~~~ki~v~GG~~~~~~~~~~~~~~~d~~~~~W~~~~~ 46 (48) T pfam07646 10 VGGKLYVVGGSTGLGDLSSSDLWVLDPETNVWTELPA 46 (48) T ss_pred ECCEEEEEECCCCCCCCCCCCEEEECCCCCCEECCCC T ss_conf 8788899626665356553643787676460040657 No 41 >cd03530 Rieske_NirD_small_Bacillus Small subunit of nitrite reductase (NirD) family, Rieske domain; composed of proteins similar to the Bacillus subtilis small subunit of assimilatory nitrite reductase containing a Rieske domain. The Rieske domain is a [2Fe-2S] cluster binding domain involved in electron transfer. Assimilatory nitrate and nitrite reductases convert nitrate through nitrite to ammonium. Probab=34.10 E-value=12 Score=17.79 Aligned_cols=16 Identities=19% Similarity=0.322 Sum_probs=13.4 Q ss_pred CEEEEEEEECCCCEEE Q ss_conf 6799999988987699 Q gi|254780663|r 178 LQIHGAWFDISSGKLW 193 (207) Q Consensus 178 l~I~G~~yDi~tG~v~ 193 (207) -+.|||.||++||+.. T Consensus 61 CP~H~~~Fdl~tG~~~ 76 (98) T cd03530 61 CPLHNWVIDLETGEAQ 76 (98) T ss_pred CCCCCCEEECCCCCCC T ss_conf 5898999999997488 No 42 >pfam07250 Glyoxal_oxid_N Glyoxal oxidase N-terminus. This family represents the N-terminus (approximately 300 residues) of a number of plant and fungal glyoxal oxidase enzymes. Glyoxal oxidase catalyses the oxidation of aldehydes to carboxylic acids, coupled with reduction of dioxygen to hydrogen peroxide. It is an essential component of the extracellular lignin degradation pathways of the wood-rot fungus Phanerochaete chrysosporium. Probab=34.09 E-value=34 Score=14.87 Aligned_cols=17 Identities=24% Similarity=0.344 Sum_probs=8.2 Q ss_pred CEEEEEEEECCCCEEEE Q ss_conf 67999999889876999 Q gi|254780663|r 178 LQIHGAWFDISSGKLWI 194 (207) Q Consensus 178 l~I~G~~yDi~tG~v~~ 194 (207) -.++-++|-+.+|.|.+ T Consensus 169 ~NLYPfv~LlpdG~LFI 185 (243) T pfam07250 169 NNLYPFVFLLPDGNLFI 185 (243) T ss_pred CCCCEEEEECCCCCEEE T ss_conf 76550799838999999 No 43 >KOG0922 consensus Probab=33.54 E-value=21 Score=16.30 Aligned_cols=25 Identities=24% Similarity=0.403 Sum_probs=12.2 Q ss_pred CCCEEEEEEEECCCCEEEE--EECCCC Q ss_conf 9967999999889876999--846888 Q gi|254780663|r 176 HMLQIHGAWFDISSGKLWI--LDPTSN 200 (207) Q Consensus 176 g~l~I~G~~yDi~tG~v~~--l~~~~~ 200 (207) -.+.|-|-.|=+++|.+.. +++.++ T Consensus 328 TSlTI~GI~YVVDsG~vK~~~y~p~~g 354 (674) T KOG0922 328 TSLTIDGIRYVVDSGFVKQKKYNPRTG 354 (674) T ss_pred EEEEECCEEEEECCCCEEEEEECCCCC T ss_conf 017735628997387367776356467 No 44 >pfam01074 Glyco_hydro_38 Glycosyl hydrolases family 38 N-terminal domain. Glycosyl hydrolases are key enzymes of carbohydrate metabolism. Probab=32.75 E-value=36 Score=14.73 Aligned_cols=15 Identities=33% Similarity=0.330 Sum_probs=6.9 Q ss_pred HHHHHHHHHHHHHHH Q ss_conf 469999999999970 Q gi|254780663|r 3 SFPNTLLERHREFIQ 17 (207) Q Consensus 3 ~~~~~Ll~~N~~f~~ 17 (207) ++.+.|+.|.+-|.+ T Consensus 94 slirql~~G~~~~~~ 108 (269) T pfam01074 94 SLIRQLLYGQRFFKE 108 (269) T ss_pred HHHHHHHHHHHHHHH T ss_conf 999999999999999 No 45 >TIGR02090 LEU1_arch isopropylmalate/citramalate/homocitrate synthases; InterPro: IPR011830 Methanogenic archaea contain three closely related homologs of the 2-isopropylmalate synthases (LeuA) represented by IPR005671 from INTERPRO. Two of these in Methanococcus janaschii (MJ1392 - CimA ; MJ0503 - AksA ) have been characterised as catalyzing alternative reactions leaving the third (MJ1195) as the presumptive LeuA enzyme. CimA is citramalate (2-methylmalate) synthase, which condenses acetyl-CoA with pyruvate. This enzyme is believed to be involved in the biosynthesis of isoleucine in methanogens and possibly other species lacking threonine dehydratase. AksA is a homocitrate synthase which also produces (homo)2-citrate and (homo)3-citrate in the biosynthesis of Coenzyme B which is restricted solely to methanogenic archaea. Methanogens, then should and apparently do contain all three of these enzymes. Unfortunately, phylogenetic trees do not resolve into three unambiguous clades, making assignment of function to particular genes problematic. Other archaea, which lack a threonine dehydratase (mainly Euryarchaeota), should contain both CimA and LeuA genes. This is true for archaeoglobus fulgidis, but not for the Pyrococci which have none in this clade, but one in IPR005671 from INTERPRO and one in IPR005675 from INTERPRO which may fulfil these roles. Proteins from other species, which have only one hit to this entry and lack threonine dehydratase, are very likely to be LeuA enzymes.; GO: 0046912 transferase activity transferring acyl groups acyl groups converted into alkyl on transfer, 0019752 carboxylic acid metabolic process. Probab=31.51 E-value=26 Score=15.67 Aligned_cols=27 Identities=15% Similarity=0.273 Sum_probs=19.1 Q ss_pred HCCCCCEEEEEE-CCCHHHHHHHHHHHC Q ss_conf 205766899981-585389887530000 Q gi|254780663|r 90 QGLNVEHIVVMG-HGRCGGIQAVLDSNN 116 (207) Q Consensus 90 ~~l~v~~iiV~G-Ht~CGav~aa~~~~~ 116 (207) ...|-+--|||| ||+|.|+++.++.+- T Consensus 306 E~VG~kR~I~LGKHsG~~a~~~kL~e~G 333 (371) T TIGR02090 306 ELVGNKRRIILGKHSGRHAVKAKLKELG 333 (371) T ss_pred CCCCCCCCEEECCHHHHHHHHHHHHHCC T ss_conf 4325511035225577999999998739 No 46 >TIGR01717 AMP-nucleosdse AMP nucleosidase; InterPro: IPR011271 This entry represents the AMP nucleosidase from proteobacteria but also including sequences from Corynebacterium, a Gram-positive organism. The Escherichia coli protein is the most well studied .. Probab=31.17 E-value=38 Score=14.57 Aligned_cols=127 Identities=16% Similarity=0.223 Sum_probs=63.4 Q ss_pred CCCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHH--------------HCC-- Q ss_conf 8866248886242234577777530257888776722057668999815853898875300--------------001-- Q gi|254780663|r 54 AKPGELFVVRNVANIVPPYEPDGQHHATSAAIEFAVQGLNVEHIVVMGHGRCGGIQAVLDS--------------NNS-- 117 (207) Q Consensus 54 ~~~Gd~fv~RnaGn~v~~~~~~~~~~~~~~sle~av~~l~v~~iiV~GHt~CGav~aa~~~--------------~~~-- 117 (207) ..-||=.++=|+| |.||.- =.|-=-+.+|.-.-.+-+|| |||+...+.. ..+ T Consensus 266 ~~~G~GITlvNIG--VGPSNA--------KtITDHLAVLRP~~WlMiGH--CaGLr~sq~iGDyvLAhAY~R~DHvLd~~ 333 (486) T TIGR01717 266 TADGDGITLVNIG--VGPSNA--------KTITDHLAVLRPHAWLMIGH--CAGLRESQRIGDYVLAHAYLREDHVLDTR 333 (486) T ss_pred HCCCCCEEEEECC--CCCCCH--------HHHHHHHHHCCCCCEEEECC--CCCHHHHHHHHHHHHHHHHCCCCCCCCCC T ss_conf 5069932899607--488621--------23541022118874287222--30412323431255520000235653765 Q ss_pred --CCCCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHH-------------HHHHHHHHHHHHHCCHHHHHHHHCCCCEEEE Q ss_conf --3463310146898888999986401632468799-------------9999999999973097889999779967999 Q gi|254780663|r 118 --STSPGDFIGKWMDIVRPIAQKIVANNPTEKQTIL-------------EQLSIRNSLKNIRNFPFVNKLEKEHMLQIHG 182 (207) Q Consensus 118 --~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l-------------~~~nV~~~v~~L~~~p~i~~~v~~g~l~I~G 182 (207) ...|.+.+..+...+..+..++.+...++....+ +=..-.....+|..+-.|+==++++.+.-=| T Consensus 334 lP~~~PIPAlaEvQ~AL~~a~~ev~G~~g~elk~~lRTGTV~t~DdRNWELr~~~~~~~~l~~SrA~AvDMESaTiAa~G 413 (486) T TIGR01717 334 LPPDIPIPALAEVQKALEEAVAEVTGEEGEELKRRLRTGTVLTTDDRNWELRTAENEVKRLNLSRAIAVDMESATIAANG 413 (486) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHCCCCEEEECCCCCCCCCHHHHHHHHHHCCEEEEECCHHHHHHCC T ss_conf 76987987617899999986754068875455523104656531077600027124456763082455421247787588 Q ss_pred EEEECCCCEE Q ss_conf 9998898769 Q gi|254780663|r 183 AWFDISSGKL 192 (207) Q Consensus 183 ~~yDi~tG~v 192 (207) +-|-+==|.| T Consensus 414 YRfRvPYGTL 423 (486) T TIGR01717 414 YRFRVPYGTL 423 (486) T ss_pred CEEECCCCCC T ss_conf 5343165520 No 47 >TIGR01324 cysta_beta_ly_B cystathionine beta-lyase; InterPro: IPR006233 Cystathionine beta-lyase (alternate name: beta-cystathionase) is one of several pyridoxal-dependent enzymes of cysteine, methionine, and homocysteine metabolism. This enzyme is involved in the biosynthesis of Met from Cys .; GO: 0004121 cystathionine beta-lyase activity, 0006520 amino acid metabolic process, 0005737 cytoplasm. Probab=30.87 E-value=32 Score=15.04 Aligned_cols=32 Identities=16% Similarity=0.062 Sum_probs=21.1 Q ss_pred CCCCHHHHHHHHHHHHCCCCCEE------EEEECCCHH Q ss_conf 75302578887767220576689------998158538 Q gi|254780663|r 75 DGQHHATSAAIEFAVQGLNVEHI------VVMGHGRCG 106 (207) Q Consensus 75 ~~~~~~~~~sle~av~~l~v~~i------iV~GHt~CG 106 (207) ..-|+++.+.|.|---.-+|..= .+.|||||= T Consensus 176 ~miDNTWaagllFkpL~HgvdiSiqA~TKY~~GHSD~~ 213 (389) T TIGR01324 176 VMIDNTWAAGLLFKPLEHGVDISIQAATKYLAGHSDVL 213 (389) T ss_pred EEEECCCCCHHHCCCCCCCCCEEEEECCCCCCCCCHHH T ss_conf 99636302001023001786368750335575500245 No 48 >pfam02100 ODC_AZ Ornithine decarboxylase antizyme. This family consists of ornithine decarboxylase antizyme proteins. The polyamine biosynthetic enzyme ornithine decarboxylase (ODC) is degraded by the 26 S proteasome via a ubiquitin-independent pathway. Its degradation is greatly accelerated by association with the polyamine-induced regulatory protein antizyme 1 (AZ1). Probab=30.38 E-value=39 Score=14.48 Aligned_cols=29 Identities=24% Similarity=0.301 Sum_probs=24.6 Q ss_pred CHHHHHHHHHHHHCCCCCEEEEEECCCHH Q ss_conf 02578887767220576689998158538 Q gi|254780663|r 78 HHATSAAIEFAVQGLNVEHIVVMGHGRCG 106 (207) Q Consensus 78 ~~~~~~sle~av~~l~v~~iiV~GHt~CG 106 (207) -.+.++-||||-..|++.+|+||=+-+=. T Consensus 118 K~~~~aLle~Aee~l~c~~vv~c~~k~~~ 146 (187) T pfam02100 118 KEGLVALLEFAEEKLECSHVVICFDKNRE 146 (187) T ss_pred HHHHHHHHHHHHHCCCCCEEEEEEECCCC T ss_conf 89999999974111588789999847863 No 49 >pfam06581 DUF1135 Protein of unknown function (DUF1135). This family consists of several hypothetical mammalian proteins of unknown function. Probab=28.62 E-value=23 Score=16.05 Aligned_cols=33 Identities=24% Similarity=0.240 Sum_probs=19.6 Q ss_pred CCCCHHHHHHHHHHHH------CC-----CCCEEEEEECCCHHH Q ss_conf 7530257888776722------05-----766899981585389 Q gi|254780663|r 75 DGQHHATSAAIEFAVQ------GL-----NVEHIVVMGHGRCGG 107 (207) Q Consensus 75 ~~~~~~~~~sle~av~------~l-----~v~~iiV~GHt~CGa 107 (207) +....++++.|--++. .+ --..+++-||-|||. T Consensus 145 sL~t~~ClR~Lfr~lf~aD~fsel~~~plmgTiVm~qgHRdCG~ 188 (239) T pfam06581 145 GLNTAACLRRLFRAIFLADPFSELQTPPLMGTIVMVQGHRDCGE 188 (239) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEECCCCCCC T ss_conf 76588999999999998624555326762250433312312472 No 50 >cd01890 LepA LepA subfamily. LepA belongs to the GTPase family of and exhibits significant homology to the translation factors EF-G and EF-Tu, indicating its possible involvement in translation and association with the ribosome. LepA is ubiquitous in bacteria and eukaryota (e.g. yeast GUF1p), but is missing from archaea. This pattern of phyletic distribution suggests that LepA evolved through a duplication of the EF-G gene in bacteria, followed by early transfer into the eukaryotic lineage, most likely from the promitochondrial endosymbiont. Yeast GUF1p is not essential and mutant cells did not reveal any marked phenotype. Probab=28.50 E-value=28 Score=15.44 Aligned_cols=18 Identities=6% Similarity=-0.045 Sum_probs=9.9 Q ss_pred HCCCCCEEEEEECCCHHH Q ss_conf 205766899981585389 Q gi|254780663|r 90 QGLNVEHIVVMGHGRCGG 107 (207) Q Consensus 90 ~~l~v~~iiV~GHt~CGa 107 (207) .....=.+||-..++... T Consensus 89 ~~~D~allVVda~~Gv~~ 106 (179) T cd01890 89 AACEGALLLVDATQGVEA 106 (179) T ss_pred HHHCEEEEEEECCCCCCH T ss_conf 754427899864778737 No 51 >cd01886 EF-G Elongation factor G (EF-G) subfamily. Translocation is mediated by EF-G (also called translocase). The structure of EF-G closely resembles that of the complex between EF-Tu and tRNA. This is an example of molecular mimicry; a protein domain evolved so that it mimics the shape of a tRNA molecule. EF-G in the GTP form binds to the ribosome, primarily through the interaction of its EF-Tu-like domain with the 50S subunit. The binding of EF-G to the ribosome in this manner stimulates the GTPase activity of EF-G. On GTP hydrolysis, EF-G undergoes a conformational change that forces its arm deeper into the A site on the 30S subunit. To accommodate this domain, the peptidyl-tRNA in the A site moves to the P site, carrying the mRNA and the deacylated tRNA with it. The ribosome may be prepared for these rearrangements by the initial binding of EF-G as well. The dissociation of EF-G leaves the ribosome ready to accept the next aminoacyl-tRNA into the A site. This group conta Probab=27.77 E-value=25 Score=15.79 Aligned_cols=15 Identities=20% Similarity=0.065 Sum_probs=7.4 Q ss_pred EEECCCCEEEEEECC Q ss_conf 998898769998468 Q gi|254780663|r 184 WFDISSGKLWILDPT 198 (207) Q Consensus 184 ~yDi~tG~v~~l~~~ 198 (207) ++|+-+.+.+.|+.. T Consensus 164 vvDll~~ka~~y~~~ 178 (270) T cd01886 164 VVDLIEMKALYWDGE 178 (270) T ss_pred EEECCCCEEEEECCC T ss_conf 997787879984178 No 52 >COG2885 OmpA Outer membrane protein and related peptidoglycan-associated (lipo)proteins [Cell envelope biogenesis, outer membrane] Probab=27.21 E-value=45 Score=14.13 Aligned_cols=29 Identities=24% Similarity=0.311 Sum_probs=19.6 Q ss_pred HHHHHHHHHHHHCCCCCEEEEEECCCHHH Q ss_conf 25788877672205766899981585389 Q gi|254780663|r 79 HATSAAIEFAVQGLNVEHIVVMGHGRCGG 107 (207) Q Consensus 79 ~~~~~sle~av~~l~v~~iiV~GHt~CGa 107 (207) ...+..+.=.+......+|+|-||||--| T Consensus 99 ~~~L~~~a~~L~~~p~~~i~V~GHTD~~G 127 (190) T COG2885 99 QATLDELAKYLKKNPITRILVEGHTDSTG 127 (190) T ss_pred HHHHHHHHHHHHHCCCCEEEEEECCCCCC T ss_conf 99999999999878997799975578879 No 53 >KOG1420 consensus Probab=26.99 E-value=32 Score=15.06 Aligned_cols=23 Identities=30% Similarity=0.395 Sum_probs=16.2 Q ss_pred CCCCCEEEEEECCCHHHHHHHHH Q ss_conf 05766899981585389887530 Q gi|254780663|r 91 GLNVEHIVVMGHGRCGGIQAVLD 113 (207) Q Consensus 91 ~l~v~~iiV~GHt~CGav~aa~~ 113 (207) .-|-+|||||||----.+...+. T Consensus 354 ehgkkhivvcghityesvshflk 376 (1103) T KOG1420 354 EHGKKHIVVCGHITYESVSHFLK 376 (1103) T ss_pred HCCCEEEEEECCEEHHHHHHHHH T ss_conf 03870499933322898999999 No 54 >PRK13224 consensus Probab=26.49 E-value=46 Score=14.05 Aligned_cols=90 Identities=16% Similarity=0.115 Sum_probs=59.6 Q ss_pred HHHHHHHHCCCCCEEEEEECCCCCCCHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCEEEEEE Q ss_conf 88999862389975999962678998888517886624888624223457777753025788877672205766899981 Q gi|254780663|r 22 KKLFQELANQQKPKIMIISCCDSRVAPETIFNAKPGELFVVRNVANIVPPYEPDGQHHATSAAIEFAVQGLNVEHIVVMG 101 (207) Q Consensus 22 ~~~~~~l~~~q~P~~~vitC~DsRv~~~~i~~~~~Gd~fv~RnaGn~v~~~~~~~~~~~~~~sle~av~~l~v~~iiV~G 101 (207) .+..+.|.....| ..++|-+........+-..+..+.|..-.-|.-++..+... ..+.+|...|+++..+++| T Consensus 92 ~e~L~~L~~~g~~-laivTn~~~~~~~~~L~~~gl~~~Fd~iv~~d~v~~~KP~P------e~~l~a~~~l~~~~~l~VG 164 (216) T PRK13224 92 EAALRHLKAEGWR-LGLCTNKPVGPTRAILAHFGLAELFDVVIGGDSLPQRKPDP------APLRAAFAALGDEPVLYVG 164 (216) T ss_pred HHHHHHHHHCCCC-EEEEECCCHHHHHHHHHHCCCHHHCCEEECCCCCCCCCCCH------HHHHHHHHHCCCCCEEEEC T ss_conf 9999999977996-68760893477888998739443247465446689999999------9999999737999989975 Q ss_pred CCCHHHHHHHHHHHCCC Q ss_conf 58538988753000013 Q gi|254780663|r 102 HGRCGGIQAVLDSNNSS 118 (207) Q Consensus 102 Ht~CGav~aa~~~~~~~ 118 (207) -|.-+...|-..+.... T Consensus 165 Ds~~D~~aA~~AGi~~i 181 (216) T PRK13224 165 DSEVDAETAENAGVPFL 181 (216) T ss_pred CCHHHHHHHHHHCCCEE T ss_conf 98878999999499889 No 55 >cd03297 ABC_ModC_molybdenum_transporter ModC is an ABC-type transporter and the ATPase component of a molybdate transport system that also includes the periplasmic binding protein ModA and the membrane protein ModB. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=26.29 E-value=38 Score=14.57 Aligned_cols=21 Identities=24% Similarity=0.249 Sum_probs=9.9 Q ss_pred CCHHHHHHHHHHHHHHHHHHH Q ss_conf 632468799999999999997 Q gi|254780663|r 143 NPTEKQTILEQLSIRNSLKNI 163 (207) Q Consensus 143 ~~~~~~~~l~~~nV~~~v~~L 163 (207) .|....+...+..++..+..+ T Consensus 157 EP~saLD~~~~~~i~~~l~~l 177 (214) T cd03297 157 EPFSALDRALRLQLLPELKQI 177 (214) T ss_pred CCCCCCCHHHHHHHHHHHHHH T ss_conf 887666999999999999999 No 56 >cd03472 Rieske_RO_Alpha_BPDO_like Rieske non-heme iron oxygenase (RO) family, Biphenyl dioxygenase (BPDO)-like subfamily, N-terminal Rieske domain of the oxygenase alpha subunit; composed of the oxygenase alpha subunits of BPDO and similar proteins including cumene dioxygenase (CumDO), nitrobenzene dioxygenase (NBDO), alkylbenzene dioxygenase (AkbDO) and dibenzofuran 4,4a-dioxygenase (DFDO). ROs comprise a large class of aromatic ring-hydroxylating dioxygenases that enable microorganisms to tolerate and utilize aromatic compounds for growth. The oxygenase alpha subunit contains an N-terminal Rieske domain with an [2Fe-2S] cluster and a C-terminal catalytic domain with a mononuclear Fe(II) binding site. The Rieske [2Fe-2S] cluster accepts electrons from a reductase or ferredoxin component and transfers them to the mononuclear iron for catalysis. BPDO degrades biphenyls and polychlorinated biphenyls (PCB's) while CumDO degrades cumene (isopropylbenzene), an aromatic hydrocarbon that is i Probab=26.19 E-value=23 Score=15.96 Aligned_cols=16 Identities=31% Similarity=0.476 Sum_probs=7.4 Q ss_pred EEEEEEEECCCCEEEEE Q ss_conf 79999998898769998 Q gi|254780663|r 179 QIHGAWFDISSGKLWIL 195 (207) Q Consensus 179 ~I~G~~yDi~tG~v~~l 195 (207) +-|||.||+ +|++... T Consensus 72 pYHgW~Y~l-dG~L~~~ 87 (128) T cd03472 72 TYHGWAYDT-AGNLVNV 87 (128) T ss_pred CCCCCEECC-CCCEEEC T ss_conf 988889889-9988668 No 57 >cd01883 EF1_alpha Eukaryotic elongation factor 1 (EF1) alpha subfamily. EF1 is responsible for the GTP-dependent binding of aminoacyl-tRNAs to the ribosomes. EF1 is composed of four subunits: the alpha chain which binds GTP and aminoacyl-tRNAs, the gamma chain that probably plays a role in anchoring the complex to other cellular components and the beta and delta (or beta') chains. This subfamily is the alpha subunit, and represents the counterpart of bacterial EF-Tu for the archaea (aEF1-alpha) and eukaryotes (eEF1-alpha). eEF1-alpha interacts with the actin of the eukaryotic cytoskeleton and may thereby play a role in cellular transformation and apoptosis. EF-Tu can have no such role in bacteria. In humans, the isoform eEF1A2 is overexpressed in 2/3 of breast cancers and has been identified as a putative oncogene. This subfamily also includes Hbs1, a G protein known to be important for efficient growth and protein synthesis under conditions of limiting translation initiation in Probab=26.14 E-value=28 Score=15.48 Aligned_cols=14 Identities=21% Similarity=0.479 Sum_probs=8.5 Q ss_pred HHHHCCCCCEEEEE Q ss_conf 67220576689998 Q gi|254780663|r 87 FAVQGLNVEHIVVM 100 (207) Q Consensus 87 ~av~~l~v~~iiV~ 100 (207) +-+..||+++|||+ T Consensus 130 ~l~~~lGik~iIVa 143 (219) T cd01883 130 LLARTLGVKQLIVA 143 (219) T ss_pred HHHHHCCCCEEEEE T ss_conf 99998499748999 No 58 >cd03301 ABC_MalK_N The N-terminal ATPase domain of the maltose transporter, MalK. ATP binding cassette (ABC) proteins function from bacteria to human, mediating the translocation of substances into and out of cells or organelles. ABC transporters contain two transmembrane-spanning domains (TMDs) or subunits and two nucleotide binding domains (NBDs) or subunits that couple transport to the hydrolysis of ATP. In the maltose transport system, the periplasmic maltose binding protein (MBP) stimulates the ATPase activity of the membrane-associated transporter, which consists of two transmembrane subunits, MalF and MalG, and two copies of the ATP binding subunit, MalK, and becomes tightly bound to the transporter in the catalytic transition state, ensuring that maltose is passed to the transporter as ATP is hydrolyzed. Probab=25.35 E-value=39 Score=14.48 Aligned_cols=12 Identities=42% Similarity=1.043 Sum_probs=4.9 Q ss_pred CEEEEEECCCHH Q ss_conf 689998158538 Q gi|254780663|r 95 EHIVVMGHGRCG 106 (207) Q Consensus 95 ~~iiV~GHt~CG 106 (207) +.+.|+|+|+|| T Consensus 27 e~~~i~GpSG~G 38 (213) T cd03301 27 EFVVLLGPSGCG 38 (213) T ss_pred CEEEEECCCCCH T ss_conf 899999999880 No 59 >cd00982 gltB_C gltb_C. This domain is found at the C-terminus of the large subunit (gltB) of glutamate synthase (GltS). GltS encodes a complex iron-sulfur flavoprotein that catalyzes the synthesis of L-glutamate from L-glutamine and 2-oxoglutarate. It requires the transfer of ammonia and electrons among three distinct active centers that carry out L-Gln hydrolysis, conversion of 2-oxoglutarate into L-Glu, and electron uptake from a donor. These catalytic sites appear to occur in other domains within the protein, and not the domain in this CD. This particular domain has no known function, but it likely has a structural role as it interacts with the amidotransferase and FMN-binding domains of gltS. Probab=25.00 E-value=32 Score=15.10 Aligned_cols=41 Identities=22% Similarity=0.315 Sum_probs=27.4 Q ss_pred CCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCEEEEEECCCH Q ss_conf 866248886242234577777530257888776722057668999815853 Q gi|254780663|r 55 KPGELFVVRNVANIVPPYEPDGQHHATSAAIEFAVQGLNVEHIVVMGHGRC 105 (207) Q Consensus 55 ~~Gd~fv~RnaGn~v~~~~~~~~~~~~~~sle~av~~l~v~~iiV~GHt~C 105 (207) ..||=|.|||-|..+. +-+.=..+|...--=.|||+|-++. T Consensus 132 ~aGERF~VRNSGA~aV----------VEG~GdHgCEYMTGG~vvvLG~~G~ 172 (251) T cd00982 132 RAGERFAVRNSGATAV----------VEGVGDHGCEYMTGGTVVVLGKTGR 172 (251) T ss_pred CCCCEEEEECCCCEEE----------EECCCCCCEEECCCCEEEEECCCCC T ss_conf 5442246723698899----------9825776311414977999337787 No 60 >cd04166 CysN_ATPS CysN_ATPS subfamily. CysN, together with protein CysD, form the ATP sulfurylase (ATPS) complex in some bacteria and lower eukaryotes. ATPS catalyzes the production of ATP sulfurylase (APS) and pyrophosphate (PPi) from ATP and sulfate. CysD, which catalyzes ATP hydrolysis, is a member of the ATP pyrophosphatase (ATP PPase) family. CysN hydrolysis of GTP is required for CysD hydrolysis of ATP; however, CysN hydrolysis of GTP is not dependent on CysD hydrolysis of ATP. CysN is an example of lateral gene transfer followed by acquisition of new function. In many organisms, an ATPS exists which is not GTP-dependent and shares no sequence or structural similarity to CysN. Probab=24.61 E-value=33 Score=14.96 Aligned_cols=45 Identities=20% Similarity=0.268 Sum_probs=19.7 Q ss_pred HHHHHCCCCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCEEEEE Q ss_conf 88851788662488862422345777775302578887767220576689998 Q gi|254780663|r 48 PETIFNAKPGELFVVRNVANIVPPYEPDGQHHATSAAIEFAVQGLNVEHIVVM 100 (207) Q Consensus 48 ~~~i~~~~~Gd~fv~RnaGn~v~~~~~~~~~~~~~~sle~av~~l~v~~iiV~ 100 (207) ++++.++...|.-++=..++- +....+-.. .+.+..||+++|||+ T Consensus 92 ~nmi~gas~aD~ailVVda~~-------G~~~QT~eh-~~~~~~lgi~~iIv~ 136 (208) T cd04166 92 RNMVTGASTADLAILLVDARK-------GVLEQTRRH-SYILSLLGIRHVVVA 136 (208) T ss_pred HHHHHHHHHCCEEEEEEECCC-------CCCHHHHHH-HHHHHHCCCCEEEEE T ss_conf 999999863774799997588-------872789999-999997499839999 No 61 >PRK08292 AMP nucleosidase; Provisional Probab=24.58 E-value=50 Score=13.82 Aligned_cols=131 Identities=15% Similarity=0.147 Sum_probs=66.0 Q ss_pred CCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHH---------HCCCCCCC---- Q ss_conf 66248886242234577777530257888776722057668999815853898875300---------00134633---- Q gi|254780663|r 56 PGELFVVRNVANIVPPYEPDGQHHATSAAIEFAVQGLNVEHIVVMGHGRCGGIQAVLDS---------NNSSTSPG---- 122 (207) Q Consensus 56 ~Gd~fv~RnaGn~v~~~~~~~~~~~~~~sle~av~~l~v~~iiV~GHt~CGav~aa~~~---------~~~~~~~~---- 122 (207) -|+=.++=|+|= .|+ -...|.=-+.+|.-+-++.+|| |||.+..... ..+..... T Consensus 272 d~~GiTlvNiGv--Gps--------NAktitDhlavlrP~awlmlGh--CgGLr~~q~iGdyvLa~ay~R~d~vld~~lP 339 (487) T PRK08292 272 DGQGITLVNIGV--GPS--------NAKTITDHLAVLRPHAWLMIGH--CGGLRNSQRIGDYVLAHAYLREDHVLDAVLP 339 (487) T ss_pred CCCCEEEEEECC--CCC--------HHHHHHHHHHHCCCCEEEEEEC--CCCCCCCCCCCCEEECHHHHCCCCCCCCCCC T ss_conf 999538998366--884--------0778988887508765999841--3576664613000100143304675445478 Q ss_pred -----CCHHHHHHHHHHHHHHHHHCCCHHHHHH------------HHHHHHHHHHHHHHCCHHHHHHHHCCCCEEEEEEE Q ss_conf -----1014689888899998640163246879------------99999999999973097889999779967999999 Q gi|254780663|r 123 -----DFIGKWMDIVRPIAQKIVANNPTEKQTI------------LEQLSIRNSLKNIRNFPFVNKLEKEHMLQIHGAWF 185 (207) Q Consensus 123 -----~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------l~~~nV~~~v~~L~~~p~i~~~v~~g~l~I~G~~y 185 (207) +.+..+...+..+..........+.... .=|..-....+.++.+-.|+--+++..+...|+.| T Consensus 340 ~~vPiP~~aEvQ~al~~av~~v~~~~g~~~K~~mrTGtV~ttd~RnWEl~~~~~~~~~~~sraiaiDMEsaTiaa~G~r~ 419 (487) T PRK08292 340 PWVPIPAIAEVQVALEDAVAEVTGLPGEELKRRMRTGTVVTTDDRNWELRYSASALRFNQSRAVALDMESATIAANGYRF 419 (487) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHCCCEEEEEECCCCCCCCCCHHHHHHHHCCEEEEEHHHHHHHHCCCCC T ss_conf 77887862999999999999885355376554102216996278655125417899975321025630077786534300 Q ss_pred ECCCCEEEEEECC Q ss_conf 8898769998468 Q gi|254780663|r 186 DISSGKLWILDPT 198 (207) Q Consensus 186 Di~tG~v~~l~~~ 198 (207) .+-.|.|-.++.. T Consensus 420 rvpyGtLL~VSDk 432 (487) T PRK08292 420 RVPYGTLLCVSDK 432 (487) T ss_pred CCCCCEEEEECCC T ss_conf 3666436885277 No 62 >cd00953 KDG_aldolase KDG (2-keto-3-deoxygluconate) aldolases found in archaea. This subfamily of enzymes is adapted for high thermostability and shows specificity for non-phosphorylated substrates. The enzyme catalyses the reversible aldol cleavage of 2-keto-3-dexoygluconate to pyruvate and glyceraldehyde, the third step of a modified non-phosphorylated Entner-Doudoroff pathway of glucose oxidation. KDG aldolase shows no significant sequence similarity to microbial 2-keto-3-deoxyphosphogluconate (KDPG) aldolases, and the enzyme shows no activity with glyceraldehyde 3-phosphate as substrate. The enzyme is a tetramer and a member of the DHDPS family of Schiff-base-dependent class I aldolases. Probab=24.45 E-value=50 Score=13.80 Aligned_cols=68 Identities=15% Similarity=0.259 Sum_probs=28.9 Q ss_pred HHHHHHHCCCCCEEEEEE--CCCCCCCHHHHHCC--CCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCEEE Q ss_conf 899986238997599996--26789988885178--86624888624223457777753025788877672205766899 Q gi|254780663|r 23 KLFQELANQQKPKIMIIS--CCDSRVAPETIFNA--KPGELFVVRNVANIVPPYEPDGQHHATSAAIEFAVQGLNVEHIV 98 (207) Q Consensus 23 ~~~~~l~~~q~P~~~vit--C~DsRv~~~~i~~~--~~Gd~fv~RnaGn~v~~~~~~~~~~~~~~sle~av~~l~v~~ii 98 (207) ++|..++. .-|.++.-- .+-.-++++.+-.+ .++.+.-++...+ + .....++ ..++-+..+ T Consensus 113 ~yf~~va~-~lPi~lYn~P~~tg~~l~~~~~~~L~~~~~~v~giKds~~----------d--~~~~~~~--~~~~~~~~v 177 (279) T cd00953 113 KYFTDISS-PYPTFIYNYPKATGYDINARMAKEIKKAGGDIIGVKDTNE----------D--ISHMLEY--KRLVPDFKV 177 (279) T ss_pred HHHHHHHH-HCCEEEEECCCCCCCCCCHHHHHHHHHCCCCEEEEECCCC----------C--HHHHHHH--HHHCCCCEE T ss_conf 99999985-0987699677535888899999999817998899973876----------9--9999999--984899478 Q ss_pred EEECCCH Q ss_conf 9815853 Q gi|254780663|r 99 VMGHGRC 105 (207) Q Consensus 99 V~GHt~C 105 (207) .+||... T Consensus 178 ~~G~d~~ 184 (279) T cd00953 178 YSGPDSL 184 (279) T ss_pred ECCCHHH T ss_conf 5695799 No 63 >cd03295 ABC_OpuCA_Osmoprotection OpuCA is a the ATP binding component of a bacterial solute transporter that serves a protective role to cells growing in a hyperosmolar environment. ABC (ATP-binding cassette) transporter nucleotide-binding domain; ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition, to the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=24.13 E-value=45 Score=14.12 Aligned_cols=57 Identities=5% Similarity=0.152 Sum_probs=26.7 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHC-C--------HHHHHHHHCCCCEEEEEEEECCCCEEEEEECCCCEEE Q ss_conf 163246879999999999999730-9--------7889999779967999999889876999846888258 Q gi|254780663|r 142 NNPTEKQTILEQLSIRNSLKNIRN-F--------PFVNKLEKEHMLQIHGAWFDISSGKLWILDPTSNEFT 203 (207) Q Consensus 142 ~~~~~~~~~l~~~nV~~~v~~L~~-~--------p~i~~~v~~g~l~I~G~~yDi~tG~v~~l~~~~~~F~ 203 (207) +.|....+...+..+.+.+..|.. . --+.++.. +.--++=+..|++.......+-|. T Consensus 160 DEP~saLD~~~~~~i~~~l~~l~~~~~~T~i~vTHd~~ea~~-----~aDri~vm~~G~iv~~g~~~ei~~ 225 (242) T cd03295 160 DEPFGALDPITRDQLQEEFKRLQQELGKTIVFVTHDIDEAFR-----LADRIAIMKNGEIVQVGTPDEILR 225 (242) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHCCEEEEECCCHHHHHH-----HCCEEEEEECCEEEEECCHHHHHH T ss_conf 187654698999999999999999759999999989999999-----699899998999999878899985 No 64 >cd03290 ABCC_SUR1_N The SUR domain 1. The sulfonylurea receptor SUR is an ATP transporter of the ABCC/MRP family with tandem ATPase binding domains. Unlike other ABC proteins, it has no intrinsic transport function, neither active nor passive, but associates with the potassium channel proteins Kir6.1 or Kir6.2 to form the ATP-sensitive potassium (K(ATP)) channel. Within the channel complex, SUR serves as a regulatory subunit that fine-tunes the gating of Kir6.x in response to alterations in cellular metabolism. It constitutes a major pharmaceutical target as it binds numerous drugs, K(ATP) channel openers and blockers, capable of up- or down-regulating channel activity. Probab=23.60 E-value=43 Score=14.22 Aligned_cols=14 Identities=21% Similarity=0.579 Sum_probs=6.4 Q ss_pred CCCEEEEEECCCHH Q ss_conf 76689998158538 Q gi|254780663|r 93 NVEHIVVMGHGRCG 106 (207) Q Consensus 93 ~v~~iiV~GHt~CG 106 (207) .-+.+.|+|.|+|| T Consensus 26 ~Ge~~~IvG~sGsG 39 (218) T cd03290 26 TGQLTMIVGQVGCG 39 (218) T ss_pred CCCEEEEECCCCCC T ss_conf 99999999999980 No 65 >cd03229 ABC_Class3 This class is comprised of all BPD (Binding Protein Dependent) systems that are largely represented in archaea and eubacteria and are primarily involved in scavenging solutes from the environment. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=23.43 E-value=46 Score=14.05 Aligned_cols=16 Identities=19% Similarity=0.279 Sum_probs=11.4 Q ss_pred CCCCEEEEEECCCHHH Q ss_conf 5766899981585389 Q gi|254780663|r 92 LNVEHIVVMGHGRCGG 107 (207) Q Consensus 92 l~v~~iiV~GHt~CGa 107 (207) -.-+-+.++|+|+||= T Consensus 24 ~~Ge~~~ivGpSG~GK 39 (178) T cd03229 24 EAGEIVALLGPSGSGK 39 (178) T ss_pred CCCCEEEEECCCCCHH T ss_conf 8998999999999839 No 66 >cd03300 ABC_PotA_N PotA is an ABC-type transporter and the ATPase component of the spermidine/putrescine-preferential uptake system consisting of PotA, -B, -C, and -D. PotA has two domains with the N-terminal domain containing the ATPase activity and the residues required for homodimerization with PotA and heterdimerization with PotB. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=23.41 E-value=46 Score=14.04 Aligned_cols=21 Identities=14% Similarity=0.219 Sum_probs=9.3 Q ss_pred CCHHHHHHHHHHHHHHHHHHH Q ss_conf 632468799999999999997 Q gi|254780663|r 143 NPTEKQTILEQLSIRNSLKNI 163 (207) Q Consensus 143 ~~~~~~~~l~~~nV~~~v~~L 163 (207) .|....+...+..++.-+..| T Consensus 156 EP~s~LD~~~~~~i~~~l~~l 176 (232) T cd03300 156 EPLGALDLKLRKDMQLELKRL 176 (232) T ss_pred CCCCCCCHHHHHHHHHHHHHH T ss_conf 887646999999999999999 No 67 >PRK00741 prfC peptide chain release factor 3; Provisional Probab=23.33 E-value=47 Score=14.01 Aligned_cols=28 Identities=18% Similarity=0.240 Sum_probs=16.9 Q ss_pred HHHHHHCCCCCEEEEEECCCCC-CCHHHH Q ss_conf 9998623899759999626789-988885 Q gi|254780663|r 24 LFQELANQQKPKIMIISCCDSR-VAPETI 51 (207) Q Consensus 24 ~~~~l~~~q~P~~~vitC~DsR-v~~~~i 51 (207) .++......-|.+++|..||-- ..+..+ T Consensus 123 ~w~~~~~~~iP~i~FINKmDR~~ad~~~~ 151 (526) T PRK00741 123 LMEVCRLRDTPIFTFINKLDRDGREPLEL 151 (526) T ss_pred HHHHHHHCCCCEEEEEECCCCCCCCHHHH T ss_conf 99998863998899996567678987898 No 68 >PRK13632 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=23.18 E-value=48 Score=13.93 Aligned_cols=20 Identities=10% Similarity=0.101 Sum_probs=8.4 Q ss_pred CHHHHHHHHHHHHHHHHHHH Q ss_conf 32468799999999999997 Q gi|254780663|r 144 PTEKQTILEQLSIRNSLKNI 163 (207) Q Consensus 144 ~~~~~~~l~~~nV~~~v~~L 163 (207) |....+...+..+.+.+..| T Consensus 170 PTs~LD~~~~~~l~~~l~~l 189 (273) T PRK13632 170 STSMLDPKGKREIKKIMVDL 189 (273) T ss_pred CCCCCCHHHHHHHHHHHHHH T ss_conf 75569989999999999999 No 69 >PRK13491 chemoreceptor glutamine deamidase CheD; Provisional Probab=22.91 E-value=41 Score=14.37 Aligned_cols=32 Identities=19% Similarity=0.290 Sum_probs=23.7 Q ss_pred EECCCCCCCHH-HHHCCCCCCEEEEECCCCCCC Q ss_conf 96267899888-851788662488862422345 Q gi|254780663|r 39 ISCCDSRVAPE-TIFNAKPGELFVVRNVANIVP 70 (207) Q Consensus 39 itC~DsRv~~~-~i~~~~~Gd~fv~RnaGn~v~ 70 (207) +-|-||+-... .|....+||.+|-....-++. T Consensus 2 ~~~~~~~~~~~~kivhV~~Ge~~Vs~~p~~~~t 34 (199) T PRK13491 2 TRCDDSPSASQISITHVTQGSCVASDSPNEVYA 34 (199) T ss_pred CCCCCCCCCCCCEEEEEECCEEEEECCCCEEEE T ss_conf 655789876775589982532799779986899 No 70 >PRK11545 gntK gluconate kinase 1; Provisional Probab=22.91 E-value=39 Score=14.51 Aligned_cols=15 Identities=27% Similarity=0.375 Sum_probs=12.1 Q ss_pred CCCEEEEEECCCHHH Q ss_conf 766899981585389 Q gi|254780663|r 93 NVEHIVVMGHGRCGG 107 (207) Q Consensus 93 ~v~~iiV~GHt~CGa 107 (207) ....|||||=++||= T Consensus 7 ~~~iiVVMGVsGsGK 21 (177) T PRK11545 7 DHHIYVLMGVSGSGK 21 (177) T ss_pred CCEEEEEECCCCCCH T ss_conf 875999984798999 No 71 >cd03479 Rieske_RO_Alpha_PhDO_like Rieske non-heme iron oxygenase (RO) family, Phthalate 4,5-dioxygenase (PhDO)-like subfamily, N-terminal Rieske domain of the oxygenase alpha subunit; composed of the oxygenase alpha subunits of PhDO and similar proteins including 3-chlorobenzoate 3,4-dioxygenase (CBDO), phenoxybenzoate dioxygenase (POB-dioxygenase) and 3-nitrobenzoate oxygenase (MnbA). ROs comprise a large class of aromatic ring-hydroxylating dioxygenases that enable microorganisms to tolerate and utilize aromatic compounds for growth. The oxygenase alpha subunit contains an N-terminal Rieske domain with an [2Fe-2S] cluster and a C-terminal catalytic domain with a mononuclear Fe(II) binding site. The Rieske [2Fe-2S] cluster accepts electrons from a reductase or ferredoxin component and transfers them to the mononuclear iron for catalysis. PhDO and CBDO are two-component RO systems, containing oxygenase and reductase components. PhDO catalyzes the dihydroxylation of phthalate to form th Probab=22.52 E-value=47 Score=13.98 Aligned_cols=13 Identities=38% Similarity=0.667 Sum_probs=5.2 Q ss_pred EEEEEEECCCCEEE Q ss_conf 99999988987699 Q gi|254780663|r 180 IHGAWFDISSGKLW 193 (207) Q Consensus 180 I~G~~yDi~tG~v~ 193 (207) =|||-||. +|++. T Consensus 85 YHGW~yd~-dG~l~ 97 (144) T cd03479 85 YHGWKFDV-DGQCL 97 (144) T ss_pred CCCCEECC-CCCEE T ss_conf 12318859-99878 No 72 >KOG0926 consensus Probab=22.27 E-value=42 Score=14.29 Aligned_cols=18 Identities=11% Similarity=0.215 Sum_probs=13.8 Q ss_pred CCCCEEEEEECCCHHHHH Q ss_conf 576689998158538988 Q gi|254780663|r 92 LNVEHIVVMGHGRCGGIQ 109 (207) Q Consensus 92 l~v~~iiV~GHt~CGav~ 109 (207) ..-+.+||||-|+||-.+ T Consensus 269 n~n~vvIIcGeTGsGKTT 286 (1172) T KOG0926 269 NENPVVIICGETGSGKTT 286 (1172) T ss_pred HCCCEEEEECCCCCCCCC T ss_conf 228749995488888644 No 73 >TIGR01892 AcOrn-deacetyl acetylornithine deacetylase (ArgE); InterPro: IPR010169 This entry represents a clade of acetylornithine deacetylases (argE) from proteobacteria, which catalyse the final step in arginine biosynthesis. The enzyme is closely related to and may share a common origin with dapE (succinyl-diaminopimelate desuccinylase), cpg2 (carboxypeptidase G2) and ACY1 (aminoacylase I), all of which are members of the metal-dependent aminoacylase family, ArgE/dapE/ACY1/CPG2/yscS .; GO: 0008270 zinc ion binding, 0008777 acetylornithine deacetylase activity, 0050897 cobalt ion binding, 0006526 arginine biosynthetic process, 0005737 cytoplasm. Probab=22.13 E-value=37 Score=14.66 Aligned_cols=13 Identities=23% Similarity=0.414 Sum_probs=8.1 Q ss_pred CCCEEEEEECCCH Q ss_conf 7668999815853 Q gi|254780663|r 93 NVEHIVVMGHGRC 105 (207) Q Consensus 93 ~v~~iiV~GHt~C 105 (207) |-.=+++-||||| T Consensus 68 G~~G~~L~GH~D~ 80 (386) T TIGR01892 68 GAGGLVLSGHTDV 80 (386) T ss_pred CCCCEEEECCEEE T ss_conf 7632688767012 No 74 >cd04168 TetM_like Tet(M)-like subfamily. Tet(M), Tet(O), Tet(W), and OtrA are tetracycline resistance genes found in Gram-positive and Gram-negative bacteria. Tetracyclines inhibit protein synthesis by preventing aminoacyl-tRNA from binding to the ribosomal acceptor site. This subfamily contains tetracycline resistance proteins that function through ribosomal protection and are typically found on mobile genetic elements, such as transposons or plasmids, and are often conjugative. Ribosomal protection proteins are homologous to the elongation factors EF-Tu and EF-G. EF-G and Tet(M) compete for binding on the ribosomes. Tet(M) has a higher affinity than EF-G, suggesting these two proteins may have overlapping binding sites and that Tet(M) must be released before EF-G can bind. Tet(M) and Tet(O) have been shown to have ribosome-dependent GTPase activity. These proteins are part of the GTP translation factor family, which includes EF-G, EF-Tu, EF2, LepA, and SelB. Probab=22.13 E-value=35 Score=14.82 Aligned_cols=15 Identities=20% Similarity=0.346 Sum_probs=7.8 Q ss_pred HCCCCCEEEEEECCC Q ss_conf 205766899981585 Q gi|254780663|r 90 QGLNVEHIVVMGHGR 104 (207) Q Consensus 90 ~~l~v~~iiV~GHt~ 104 (207) ...+++.++++-+=| T Consensus 113 ~~~~~P~iifiNKmD 127 (237) T cd04168 113 RKLNIPTIIFVNKID 127 (237) T ss_pred HHCCCCEEEEEECCC T ss_conf 985998599862445 No 75 >pfam08184 Cuticle_2 Cuticle protein 7 isoform family. This family consists of cuticle protein 7 isoforms that are isolated from the carapace cuticle of a juvenile horseshoe crab, Limulus polyphemus. There are 3 isoforms of cuticle protein 7. The 3 isoforms are N-terminally blocked but could be deblocked by treatment with pyroglutaminase, showing that the N-terminal residue is a pyroglutamine residue. Probab=22.10 E-value=37 Score=14.63 Aligned_cols=13 Identities=31% Similarity=0.621 Sum_probs=10.4 Q ss_pred EEEEEECCCCEEE Q ss_conf 9999988987699 Q gi|254780663|r 181 HGAWFDISSGKLW 193 (207) Q Consensus 181 ~G~~yDi~tG~v~ 193 (207) -|..|||+||.|. T Consensus 7 ngytydietgqvs 19 (59) T pfam08184 7 NGYTYDIETGQVS 19 (59) T ss_pred CCCEEEECCCEEC T ss_conf 7847770146054 No 76 >cd03296 ABC_CysA_sulfate_importer Part of the ABC transporter complex cysAWTP involved in sulfate import. Responsible for energy coupling to the transport system. The complex is composed of two ATP-binding proteins (cysA), two transmembrane proteins (cysT and cysW), and a solute-binding protein (cysP). ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=22.07 E-value=52 Score=13.69 Aligned_cols=15 Identities=7% Similarity=0.224 Sum_probs=7.1 Q ss_pred ECCCCEEEEEECCCC Q ss_conf 889876999846888 Q gi|254780663|r 186 DISSGKLWILDPTSN 200 (207) Q Consensus 186 Di~tG~v~~l~~~~~ 200 (207) =++.|++.......+ T Consensus 209 vm~~G~iv~~G~p~e 223 (239) T cd03296 209 VMNKGRIEQVGTPDE 223 (239) T ss_pred EEECCEEEEECCHHH T ss_conf 998999999848899 No 77 >cd03294 ABC_Pro_Gly_Bertaine This family comprises the glycine betaine/L-proline ATP binding subunit in bacteria and its equivalents in archaea. This transport system belong to the larger ATP-Binding Cassette (ABC) transporter superfamily. The characteristic feature of these transporters is the obligatory coupling of ATP hydrolysis to substrate translocation. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=21.93 E-value=53 Score=13.62 Aligned_cols=17 Identities=18% Similarity=0.333 Sum_probs=8.4 Q ss_pred CCCCEEEEEECCCCEEE Q ss_conf 89876999846888258 Q gi|254780663|r 187 ISSGKLWILDPTSNEFT 203 (207) Q Consensus 187 i~tG~v~~l~~~~~~F~ 203 (207) +..|++.......+-|+ T Consensus 234 m~~G~Ivq~G~p~ei~~ 250 (269) T cd03294 234 MKDGRLVQVGTPEEILT 250 (269) T ss_pred EECCEEEEECCHHHHHH T ss_conf 98999999858999984 No 78 >COG0295 Cdd Cytidine deaminase [Nucleotide transport and metabolism] Probab=21.73 E-value=56 Score=13.46 Aligned_cols=66 Identities=21% Similarity=0.296 Sum_probs=40.3 Q ss_pred HHCCCCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHC--CCCCEEEEEEC-----CCHHHHHHHHHHHCCCCC Q ss_conf 51788662488862422345777775302578887767220--57668999815-----853898875300001346 Q gi|254780663|r 51 IFNAKPGELFVVRNVANIVPPYEPDGQHHATSAAIEFAVQG--LNVEHIVVMGH-----GRCGGIQAVLDSNNSSTS 120 (207) Q Consensus 51 i~~~~~Gd~fv~RnaGn~v~~~~~~~~~~~~~~sle~av~~--l~v~~iiV~GH-----t~CGav~aa~~~~~~~~~ 120 (207) +.-.+-|.+|.==|+-|.--+.....+ -.+|--|+.. -.+..|+|.++ |-||++...+........ T Consensus 31 a~~t~~G~i~tG~NiEnasy~~t~CAE----rsAI~~ais~G~~~~~~v~v~~~~~~~~sPCG~CRQ~i~Ef~~~d~ 103 (134) T COG0295 31 ALRTKDGRIYTGANVENASYGLTVCAE----RSAIFKAISEGKRKFDAVVVVADTGKPVSPCGACRQVLAEFCGDDT 103 (134) T ss_pred EEEECCCCEEEEEEEECCCCCCHHHHH----HHHHHHHHHCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHCCCCC T ss_conf 999279979998866025455216389----9999999973997179999974899975985889999998668774 No 79 >TIGR03415 ABC_choXWV_ATP choline ABC transporter, ATP-binding protein. Members of this protein family are the ATP-binding subunit of a three-protein transporter. This family belongs, more broadly, to the family of proline and glycine-betaine transporters, but members have been identified by direct characterization and by bioinformatic means as choline transporters. Many species have several closely-related members of this family, probably with variable abilities to act additionally on related quaternary amines. Probab=21.47 E-value=54 Score=13.61 Aligned_cols=14 Identities=14% Similarity=0.147 Sum_probs=4.9 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 87999999999999 Q gi|254780663|r 148 QTILEQLSIRNSLK 161 (207) Q Consensus 148 ~~~l~~~nV~~~v~ 161 (207) .+...+..+...+. T Consensus 195 LD~~~r~~l~~~l~ 208 (382) T TIGR03415 195 LDPLIRTQLQDELL 208 (382) T ss_pred CCHHHHHHHHHHHH T ss_conf 59999999999999 No 80 >PRK12411 cytidine deaminase; Provisional Probab=21.43 E-value=57 Score=13.42 Aligned_cols=60 Identities=17% Similarity=0.155 Sum_probs=40.3 Q ss_pred CCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCEEEEEEC-----CCHHHHHHHHHHHC Q ss_conf 866248886242234577777530257888776722057668999815-----85389887530000 Q gi|254780663|r 55 KPGELFVVRNVANIVPPYEPDGQHHATSAAIEFAVQGLNVEHIVVMGH-----GRCGGIQAVLDSNN 116 (207) Q Consensus 55 ~~Gd~fv~RnaGn~v~~~~~~~~~~~~~~sle~av~~l~v~~iiV~GH-----t~CGav~aa~~~~~ 116 (207) .-|.+|.==|+-|..-+.....+..+...++-- ..-+++.|+|.+. +-||++...+.... T Consensus 33 ~~G~i~~G~NvEnasy~~t~CAEr~Ai~~avs~--g~~~~~~I~v~~~~~~~~~PCG~CRQvl~Ef~ 97 (132) T PRK12411 33 QDGKVYRGCNVENASYGLCNCAERTALFKAVSE--GDKEFVAIAIVADTKRPVPPCGACRQVMVELC 97 (132) T ss_pred CCCCEEEEECCCCCCCCCCEEHHHHHHHHHHHC--CCCCEEEEEEECCCCCCCCCCHHHHHHHHHHC T ss_conf 899999987415677887513377799999864--88768999998689986797388899999966 No 81 >PRK11131 ATP-dependent RNA helicase HrpA; Provisional Probab=21.06 E-value=47 Score=13.97 Aligned_cols=33 Identities=15% Similarity=0.151 Sum_probs=20.2 Q ss_pred HHHHHHHCCCCEEEEEEEECCCCEEE--EEECCCC Q ss_conf 88999977996799999988987699--9846888 Q gi|254780663|r 168 FVNKLEKEHMLQIHGAWFDISSGKLW--ILDPTSN 200 (207) Q Consensus 168 ~i~~~v~~g~l~I~G~~yDi~tG~v~--~l~~~~~ 200 (207) ++..-+++-.+.|=|..|=|++|... .|++.++ T Consensus 341 VlaTNIAETSlTIp~I~yVIDsG~~k~~~y~~~~~ 375 (1295) T PRK11131 341 VLATNVAETSLTVPGIKYVIDPGTARISRYSYRTK 375 (1295) T ss_pred EEECCCCCCCCEECCEEEEEECCCEEEEEECCCCC T ss_conf 98405200010005648999378401004757778 No 82 >cd00886 MogA_MoaB MogA_MoaB family. Members of this family are involved in biosynthesis of the molybdenum cofactor (MoCF) an essential cofactor of a diverse group of redox enzymes. MoCF biosynthesis is an evolutionarily conserved pathway present in eubacteria, archaea, and eukaryotes. MoCF contains a tricyclic pyranopterin, termed molybdopterin (MPT). MogA, together with MoeA, is responsible for the metal incorporation into MPT, the third step in MoCF biosynthesis. The plant homolog Cnx1 is a MoeA-MogA fusion protein. The mammalian homolog gephyrin is a MogA-MoeA fusion protein, that plays a critical role in postsynaptic anchoring of inhibitory glycine receptors and major GABAa receptor subtypes. In contrast, MoaB shows high similarity to MogA, but little is known about its physiological role. All well studied members of this family form highly stable trimers. Probab=20.96 E-value=59 Score=13.36 Aligned_cols=63 Identities=17% Similarity=0.230 Sum_probs=39.2 Q ss_pred EEEEEECCCCCCCH----------HHHHCCCCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCEEEEEECCC Q ss_conf 59999626789988----------88517886624888624223457777753025788877672205766899981585 Q gi|254780663|r 35 KIMIISCCDSRVAP----------ETIFNAKPGELFVVRNVANIVPPYEPDGQHHATSAAIEFAVQGLNVEHIVVMGHGR 104 (207) Q Consensus 35 ~~~vitC~DsRv~~----------~~i~~~~~Gd~fv~RnaGn~v~~~~~~~~~~~~~~sle~av~~l~v~~iiV~GHt~ 104 (207) ++.|||+||++..- ...+...-.+....+ +||. ........|..++...++..|+..|=|+ T Consensus 2 ~~~Vitvsd~~~~G~~~D~sG~~l~~~l~~~G~~v~~~~----ivpD-----d~~~I~~~l~~~~~~~~~DlIittGGTG 72 (152) T cd00886 2 RAAVLTVSDTRSAGEAEDRSGPALVELLEEAGHEVVAYE----IVPD-----DKDEIREALIEWADEDGVDLILTTGGTG 72 (152) T ss_pred EEEEEEECCCCCCCCCCCCHHHHHHHHHHHCCCEEEEEE----EECC-----CHHHHHHHHHHHHHCCCCCEEEECCCCC T ss_conf 899999888776799877569999999987499788999----9599-----9899999999998536988899668555 Q ss_pred HH Q ss_conf 38 Q gi|254780663|r 105 CG 106 (207) Q Consensus 105 CG 106 (207) ++ T Consensus 73 ~~ 74 (152) T cd00886 73 LA 74 (152) T ss_pred CC T ss_conf 89 No 83 >pfam01875 Memo Memo-like protein. This family contains members from all branches of life. The molecular function of this protein is unknown, but Memo (mediator of ErbB2-driven cell motility) a human protein is included in this family. It has been suggested that Memo controls cell migration by relaying extracellular chemotactic signals to the microtubule cytoskeleton. Probab=20.84 E-value=59 Score=13.35 Aligned_cols=10 Identities=20% Similarity=0.448 Sum_probs=4.3 Q ss_pred HCCCCCCEEE Q ss_conf 1788662488 Q gi|254780663|r 52 FNAKPGELFV 61 (207) Q Consensus 52 ~~~~~Gd~fv 61 (207) |.+..|++-| T Consensus 90 ~~TPlG~v~v 99 (274) T pfam01875 90 WETPLGDVKV 99 (274) T ss_pred EECCCCCCCC T ss_conf 6898867656 No 84 >cd04170 EF-G_bact Elongation factor G (EF-G) subfamily. Translocation is mediated by EF-G (also called translocase). The structure of EF-G closely resembles that of the complex between EF-Tu and tRNA. This is an example of molecular mimicry; a protein domain evolved so that it mimics the shape of a tRNA molecule. EF-G in the GTP form binds to the ribosome, primarily through the interaction of its EF-Tu-like domain with the 50S subunit. The binding of EF-G to the ribosome in this manner stimulates the GTPase activity of EF-G. On GTP hydrolysis, EF-G undergoes a conformational change that forces its arm deeper into the A site on the 30S subunit. To accommodate this domain, the peptidyl-tRNA in the A site moves to the P site, carrying the mRNA and the deacylated tRNA with it. The ribosome may be prepared for these rearrangements by the initial binding of EF-G as well. The dissociation of EF-G leaves the ribosome ready to accept the next aminoacyl-tRNA into the A site. This group Probab=20.71 E-value=40 Score=14.44 Aligned_cols=21 Identities=24% Similarity=0.398 Sum_probs=11.6 Q ss_pred HHHHHHCCCCCEEEEEECCCC Q ss_conf 999862389975999962678 Q gi|254780663|r 24 LFQELANQQKPKIMIISCCDS 44 (207) Q Consensus 24 ~~~~l~~~q~P~~~vitC~Ds 44 (207) .++.+.+.+-|.+++|..||- T Consensus 108 ~w~~~~~~~iP~i~fINKmDr 128 (268) T cd04170 108 LWEFADEAGIPRIIFINKMDR 128 (268) T ss_pred HHHHHHHCCCCEEEEEECCCC T ss_conf 999999859998999978787 No 85 >pfam06807 Clp1 Pre-mRNA cleavage complex II protein Clp1. This family consists of several pre-mRNA cleavage complex II Clp1 (or HeaB) proteins. Six different protein factors are required in vitro for 3' end formation of mammalian pre-mRNAs by endonucleolytic cleavage and polyadenylation. Clp1 is a subunit of cleavage complex IIA, which is required for cleavage, but not for polyadenylation of pre-mRNA. Probab=20.59 E-value=60 Score=13.31 Aligned_cols=23 Identities=22% Similarity=0.394 Sum_probs=14.1 Q ss_pred EEEEEE----EECCCCEEEEEECCCCE Q ss_conf 799999----98898769998468882 Q gi|254780663|r 179 QIHGAW----FDISSGKLWILDPTSNE 201 (207) Q Consensus 179 ~I~G~~----yDi~tG~v~~l~~~~~~ 201 (207) .+.|+. .|.+.+.++.|.|..+. T Consensus 145 ~v~Gf~~V~~VD~~k~~l~iLsP~p~~ 171 (187) T pfam06807 145 NVAGFGVVTGVDVEKRVITVLTPVPRE 171 (187) T ss_pred CCEEEEEEEEEECCCCEEEEECCCCCC T ss_conf 641599999972477679998789998 No 86 >COG4525 TauB ABC-type taurine transport system, ATPase component [Inorganic ion transport and metabolism] Probab=20.48 E-value=51 Score=13.78 Aligned_cols=22 Identities=27% Similarity=0.463 Sum_probs=15.7 Q ss_pred HHHCCCCCEEEEEECCCHHHHH Q ss_conf 7220576689998158538988 Q gi|254780663|r 88 AVQGLNVEHIVVMGHGRCGGIQ 109 (207) Q Consensus 88 av~~l~v~~iiV~GHt~CGav~ 109 (207) .+..-+-+.++|+|-|+||--+ T Consensus 25 sL~ia~ge~vv~lGpSGcGKTT 46 (259) T COG4525 25 SLTIASGELVVVLGPSGCGKTT 46 (259) T ss_pred CEEECCCCEEEEECCCCCCHHH T ss_conf 5023589789997688865788 No 87 >PRK13634 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=20.45 E-value=60 Score=13.30 Aligned_cols=21 Identities=14% Similarity=0.124 Sum_probs=9.3 Q ss_pred CCHHHHHHHHHHHHHHHHHHH Q ss_conf 632468799999999999997 Q gi|254780663|r 143 NPTEKQTILEQLSIRNSLKNI 163 (207) Q Consensus 143 ~~~~~~~~l~~~nV~~~v~~L 163 (207) .|....+...+..+..-++.| T Consensus 158 EPTs~LD~~~~~~i~~ll~~L 178 (276) T PRK13634 158 EPTAGLDPKGRKEIMEMFYKL 178 (276) T ss_pred CCCCCCCHHHHHHHHHHHHHH T ss_conf 985427999999999999999 No 88 >TIGR03167 tRNA_sel_U_synt tRNA 2-selenouridine synthase. The Escherichia coli YbbB protein was shown to encode a selenophosphate-dependent tRNA 2-selenouridine synthase, essential for modification of some tRNAs to replace a sulfur atom with selenium. This enzyme works with SelD, the selenium donor protein, which also acts in selenocysteine incorporation. Although the members of this protein family show a fairly deep split, sequences from both sides of the split are supported by co-occurence with, and often proximity to, the selD gene. Probab=20.00 E-value=51 Score=13.74 Aligned_cols=66 Identities=15% Similarity=0.097 Sum_probs=32.9 Q ss_pred CCEEEEEECC----CCCCCHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCEEEEEECCCHHHH Q ss_conf 9759999626----789988885178866248886242234577777530257888776722057668999815853898 Q gi|254780663|r 33 KPKIMIISCC----DSRVAPETIFNAKPGELFVVRNVANIVPPYEPDGQHHATSAAIEFAVQGLNVEHIVVMGHGRCGGI 108 (207) Q Consensus 33 ~P~~~vitC~----DsRv~~~~i~~~~~Gd~fv~RnaGn~v~~~~~~~~~~~~~~sle~av~~l~v~~iiV~GHt~CGav 108 (207) .|. ++|=|. =|+.....+-.++ .-+.|--|+- +. -..-++..++= ...+-+.+|+.|.|+||=. T Consensus 74 ~~~-~~iyCwRGG~RS~s~~~~L~~~G---~~v~~l~GGY----Ka--yR~~v~~~l~~--~~~~~~~~vl~G~TG~GKT 141 (311) T TIGR03167 74 PPQ-PLLYCWRGGMRSGSLAWLLAQIG---FRVPRLEGGY----KA--YRRFVIDQLEE--LPQPFPLIVLGGMTGSGKT 141 (311) T ss_pred CCC-EEEEECCCCHHHHHHHHHHHHCC---CCCCCCCCCH----HH--HHHHHHHHHHH--CCCCCCEEEEECCCCCCHH T ss_conf 997-79997476187899999999859---9843157838----99--99999999971--4546876998788887789 Q ss_pred HH Q ss_conf 87 Q gi|254780663|r 109 QA 110 (207) Q Consensus 109 ~a 110 (207) .- T Consensus 142 ~i 143 (311) T TIGR03167 142 EL 143 (311) T ss_pred HH T ss_conf 99 Done!