Query gi|254780965|ref|YP_003065378.1| hypothetical protein CLIBASIA_04330 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 229 No_of_seqs 4 out of 6 Neff 2.0 Searched_HMMs 39220 Date Mon May 30 02:59:16 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780965.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 CHL00014 ndhI NADH dehydrogena 59.2 5.9 0.00015 20.7 1.8 37 75-116 12-48 (173) 2 MTH00145 CYTB cytochrome b; Pr 52.9 18 0.00046 17.7 3.4 51 8-78 42-92 (379) 3 TIGR01350 lipoamide_DH dihydro 52.2 13 0.00034 18.5 2.7 91 63-153 321-468 (481) 4 TIGR02653 Lon_rel_chp conserve 51.7 20 0.0005 17.5 3.5 54 61-132 435-491 (677) 5 MTH00033 CYTB cytochrome b; Pr 51.5 21 0.00054 17.2 3.8 53 8-80 38-90 (383) 6 pfam07423 DUF1510 Protein of u 50.7 11 0.00027 19.1 2.0 25 3-27 16-40 (214) 7 MTH00022 CYTB cytochrome b; Va 49.5 22 0.00056 17.2 3.4 53 8-80 40-92 (379) 8 pfam10717 ODV-E18 Occlusion-de 49.3 18 0.00045 17.8 2.9 25 7-31 28-52 (85) 9 CHL00070 petB cytochrome b6 48.9 19 0.00049 17.5 3.0 54 7-80 43-96 (215) 10 PRK00574 gltX glutamyl-tRNA sy 48.5 11 0.00027 19.1 1.7 65 33-101 68-139 (489) 11 PRK10260 hypothetical protein; 46.5 21 0.00054 17.3 3.0 36 184-219 204-245 (306) 12 MTH00086 CYTB cytochrome b; Pr 46.0 26 0.00066 16.7 3.8 52 8-79 30-81 (355) 13 PRK03735 cytochrome b6; Provis 43.2 26 0.00067 16.7 3.0 55 7-81 44-98 (216) 14 COG1860 Uncharacterized protei 43.1 22 0.00057 17.1 2.7 47 174-226 15-61 (147) 15 MTH00190 CYTB cytochrome b; Pr 43.1 29 0.00073 16.5 3.2 52 8-79 32-83 (370) 16 MTH00191 CYTB cytochrome b; Pr 42.2 30 0.00076 16.3 4.0 52 8-79 42-93 (380) 17 cd00284 Cytochrome_b_N Cytochr 41.2 29 0.00075 16.4 3.0 55 7-81 32-86 (200) 18 TIGR01530 nadN NAD nucleotidas 40.8 23 0.00058 17.1 2.4 54 132-186 433-488 (550) 19 pfam05094 LEF-9 Late expressio 39.6 18 0.00047 17.7 1.8 67 124-191 187-260 (487) 20 KOG0584 consensus 39.2 3 7.7E-05 22.5 -2.3 83 64-154 191-279 (632) 21 TIGR01879 hydantase amidase, h 38.8 7.9 0.0002 19.9 -0.2 44 166-212 12-60 (406) 22 MTH00156 CYTB cytochrome b; Pr 38.4 34 0.00087 16.0 3.4 53 8-80 31-83 (356) 23 MTH00053 CYTB cytochrome b; Pr 38.2 34 0.00087 16.0 3.2 52 8-79 42-93 (381) 24 MTH00016 CYTB cytochrome b; Va 36.8 36 0.00092 15.8 3.2 53 8-80 42-94 (378) 25 KOG0136 consensus 36.6 20 0.00051 17.4 1.6 48 142-191 165-227 (670) 26 PRK09783 copper/silver efflux 34.3 40 0.001 15.6 3.4 36 1-37 1-47 (407) 27 pfam11857 DUF3377 Domain of un 33.8 40 0.001 15.5 3.4 16 20-36 52-67 (74) 28 MTH00034 CYTB cytochrome b; Va 32.5 42 0.0011 15.4 3.3 53 8-80 42-94 (379) 29 MTH00046 CYTB cytochrome b; Va 32.5 42 0.0011 15.4 3.7 53 8-80 32-84 (355) 30 TIGR00403 ndhI NADH-plastoquin 32.1 19 0.00049 17.5 0.9 42 73-119 11-52 (187) 31 PRK13664 hypothetical protein; 29.0 44 0.0011 15.3 2.3 16 7-22 10-25 (62) 32 PRK01844 hypothetical protein; 28.4 45 0.0011 15.3 2.2 59 5-86 7-65 (72) 33 COG3672 Predicted transglutami 28.2 50 0.0013 14.9 2.7 49 172-222 92-155 (191) 34 COG3285 Predicted eukaryotic-t 28.2 50 0.0013 14.9 2.9 34 76-109 188-221 (299) 35 COG0708 XthA Exonuclease III [ 25.8 55 0.0014 14.7 2.5 81 35-123 139-222 (261) 36 PRK10190 hypothetical protein; 25.3 57 0.0014 14.6 3.6 36 184-219 201-242 (310) 37 MTH00119 CYTB cytochrome b; Pr 24.5 59 0.0015 14.5 3.7 52 8-79 42-93 (380) 38 cd05493 Bromo_ALL-1 Bromodomai 24.2 38 0.00096 15.7 1.2 66 22-87 35-110 (131) 39 TIGR00770 Dcu transporter, ana 23.7 35 0.00089 15.9 1.0 16 141-156 195-210 (447) 40 KOG4606 consensus 23.5 36 0.00093 15.8 1.0 33 2-34 30-62 (126) 41 PRK09814 hypothetical protein; 23.2 25 0.00064 16.8 0.2 12 156-167 191-202 (337) 42 TIGR02335 hydr_PhnA phosphonoa 22.7 64 0.0016 14.3 2.5 91 92-196 38-146 (408) 43 pfam02751 TFIIA_gamma_C Transc 22.6 24 0.00061 17.0 -0.0 21 38-58 5-25 (51) 44 TIGR01318 gltD_gamma_fam gluta 22.4 45 0.0011 15.3 1.3 12 63-74 306-317 (480) 45 pfam02288 Dehydratase_MU Dehyd 22.1 39 0.00098 15.6 0.9 35 22-58 94-129 (175) 46 PRK10523 lipoprotein involved 22.0 66 0.0017 14.2 2.1 18 6-23 5-22 (234) 47 COG0158 Fbp Fructose-1,6-bisph 21.8 66 0.0017 14.2 2.2 111 49-184 57-180 (326) 48 TIGR00759 aceE 2-oxo-acid dehy 21.5 46 0.0012 15.2 1.2 74 73-163 748-835 (905) 49 MTH00074 CYTB cytochrome b; Pr 21.4 67 0.0017 14.1 4.0 53 8-80 42-94 (380) 50 COG0178 UvrA Excinuclease ATPa 21.2 19 0.00049 17.5 -0.8 71 129-205 677-751 (935) 51 MTH00131 CYTB cytochrome b; Pr 20.8 69 0.0018 14.1 3.8 52 8-79 41-92 (380) 52 COG3889 Predicted solute bindi 20.5 64 0.0016 14.3 1.8 23 1-23 1-24 (872) 53 pfam10856 DUF2678 Protein of u 20.1 65 0.0017 14.2 1.8 17 9-25 35-51 (114) No 1 >CHL00014 ndhI NADH dehydrogenase subunit I Probab=59.24 E-value=5.9 Score=20.70 Aligned_cols=37 Identities=30% Similarity=0.601 Sum_probs=29.1 Q ss_pred CHHHHHHHHHHHCCCCHHHHHHHHHCCCCCEEEEEEEEECCC Q ss_conf 889987999851057413777586417702589997553388 Q gi|254780965|r 75 TQEALRAAQYLEKYLEPGYVSSYRKEALIGIVEVQYPYRANE 116 (229) Q Consensus 75 t~eAl~aa~~L~~~~e~gYV~sf~E~g~vdvatV~YPyRANe 116 (229) +++|++|++++-. |...+|..|-+-- +|++||+-.-+ T Consensus 12 ~~~~~~a~~~i~q----Gm~vT~~hmfrkp-vTiqYP~ek~~ 48 (173) T CHL00014 12 GQQTVRAARYIGQ----GFMITLSHANRLP-VTIQYPYEKLI 48 (173) T ss_pred HHHHHHHHHHHHH----HHHHHHHHHCCCC-CCCCCCCCCCC T ss_conf 9999999999983----1727789877997-41139888888 No 2 >MTH00145 CYTB cytochrome b; Provisional Probab=52.90 E-value=18 Score=17.72 Aligned_cols=51 Identities=14% Similarity=0.290 Sum_probs=35.9 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCEEEEEEEEEEECCCHHHHHCCCCHHHCCCCHHHHHHHHHHHCCCCHHH Q ss_conf 78899999999887403578742167652100012745676313002110133578999999733888899 Q gi|254780965|r 8 IILLVLVGVILAFYFQQSSTPQNHLLFFSEKAVWKGDSETYFQCKRAHEFDENFDNCIITSIKKTGGTQEA 78 (229) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~se~~vw~gd~~~~~q~~~a~~~d~~~~~Cl~~~m~~tGat~eA 78 (229) .++.++-|+.|++++.. |.+..|..-..-.||-|+. .+.|.||+.|||-== T Consensus 42 l~iQiitGi~Lam~Y~p-------------------~~~~AF~Sv~~i~rdv~~G-WliR~~HanGAS~fF 92 (379) T MTH00145 42 LVIQILTGLFLSMHYTA-------------------HVDLAFSSVIHIMRDVNYG-WLLRYLHANGASFFF 92 (379) T ss_pred HHHHHHHHHHHHHHHCC-------------------CCHHHHHHHHHHHHCCCCH-HHHHHHHHHHHHHHH T ss_conf 99999999999977479-------------------9065899899999646304-999987753568999 No 3 >TIGR01350 lipoamide_DH dihydrolipoamide dehydrogenase; InterPro: IPR006258 These sequences represent dihydrolipoamide dehydrogenase, a flavoprotein that acts in a number of ways. It is the E3 component of dehydrogenase complexes for pyruvate, 2-oxoglutarate, 2-oxoisovalerate, and acetoin. It can also serve as the L protein of the glycine cleavage system. This family includes a few members known to have distinct functions (ferric leghemoglobin reductase and NADH:ferredoxin oxidoreductase) but that may be predicted by homology to act as dihydrolipoamide dehydrogenase as well. The motif GGXCXXXGCXP near the N-terminus contains a redox-active disulphide. ; GO: 0004148 dihydrolipoyl dehydrogenase activity, 0050660 FAD binding, 0006118 electron transport. Probab=52.20 E-value=13 Score=18.53 Aligned_cols=91 Identities=20% Similarity=0.288 Sum_probs=62.1 Q ss_pred HHHHHHHHHCCCCHHHHHHHHHHHCCCC------H-------------------HHHHHHHHCCCCCEEEEEEEEECCC- Q ss_conf 8999999733888899879998510574------1-------------------3777586417702589997553388- Q gi|254780965|r 63 NCIITSIKKTGGTQEALRAAQYLEKYLE------P-------------------GYVSSYRKEALIGIVEVQYPYRANE- 116 (229) Q Consensus 63 ~Cl~~~m~~tGat~eAl~aa~~L~~~~e------~-------------------gYV~sf~E~g~vdvatV~YPyRANe- 116 (229) |+....|=+-=|+.|++.||+.+-+-.+ + ||-+.=-+..++++....|||+||= T Consensus 321 Dv~gg~mLAH~A~~eG~vAAe~iaG~~~~~~~~~~~d~~~vP~~iYT~PEvAsVGlTE~qAk~~G~~~k~gkfpF~AnGk 400 (481) T TIGR01350 321 DVIGGPMLAHVASHEGIVAAENIAGKEPSKKKKAEIDYKAVPSCIYTSPEVASVGLTEEQAKEEGIDVKIGKFPFAANGK 400 (481) T ss_pred ECCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCEEEECCCCEEEECCCHHHHHHCCCCEEEEEECCCCCCH T ss_conf 34787026899998789999998187611000012553578877846873663038879998559858999701010421 Q ss_pred -------CCEE-EEE------ECCCCCEECHHHHH-----------------HHHHHCCCHHHHHHHH Q ss_conf -------8603-897------52776211643512-----------------7676303843799996 Q gi|254780965|r 117 -------NSGT-LLI------PTVGSHIIDINDSS-----------------VHQLYDSSPIAKDFVL 153 (229) Q Consensus 117 -------N~gT-~Lv------pt~gssiIdv~d~~-----------------la~~~~aDPty~~fik 153 (229) ..|= -+| -=+|-|||=++++- |.+++-+-||++|-++ T Consensus 401 A~a~~eC~~GF~K~i~D~~~~~iLG~hiiGp~AtelI~E~~la~~~e~T~~el~~tIHpHPT~SEa~~ 468 (481) T TIGR01350 401 ALALGECTDGFVKIIADKKTGEILGAHIIGPHATELISEAALAIELELTVEELAKTIHPHPTLSEAIK 468 (481) T ss_pred HHCCCCCCCCEEEEEEECCCCEEEEEEEECCCHHHHHHHHHHHHHCCCCHHHHHHHHCCCCCHHHHHH T ss_conf 20005678761899996477818889988424432688999987748988999985278988889999 No 4 >TIGR02653 Lon_rel_chp conserved hypothetical protein; InterPro: IPR013473 This entry describes a protein family of unknown function, about 690 residues in length, in which some members show C-terminal sequence similarity to IPR008269 from INTERPRO, which is the Lon protease C-terminal proteolytic domain, from MEROPS family S16. However, the annotated catalytic sites of Escherichia coli Lon protease are not conserved in members of this family. Members have a motif GP[RK][GS]TGKS, similar to the ATP-binding P-loop motif GxxGxGK[ST].. Probab=51.66 E-value=20 Score=17.46 Aligned_cols=54 Identities=20% Similarity=0.353 Sum_probs=37.4 Q ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHCCCCHHHHHHHHHCCCCCEEEEEEEEECCCCCEEEEE--E-CCCCCEEC Q ss_conf 578999999733888899879998510574137775864177025899975533888603897--5-27762116 Q gi|254780965|r 61 FDNCIITSIKKTGGTQEALRAAQYLEKYLEPGYVSSYRKEALIGIVEVQYPYRANENSGTLLI--P-TVGSHIID 132 (229) Q Consensus 61 ~~~Cl~~~m~~tGat~eAl~aa~~L~~~~e~gYV~sf~E~g~vdvatV~YPyRANeN~gT~Lv--p-t~gssiId 132 (229) ...||--+|..- =|+-|||-| +|++----|.|+|+-||+---.-| | .+|+.+|+ T Consensus 435 ~~~~l~yalegR------RRVKEQLKK------------~Gg~Effdv~FSYiDNe~leE~fv~VPE~GG~~lIp 491 (677) T TIGR02653 435 VREILDYALEGR------RRVKEQLKK------------LGGLEFFDVNFSYIDNESLEEFFVSVPEEGGSKLIP 491 (677) T ss_pred HHHHHHHHHCCC------CHHHHHHHH------------HCCCEEEECEEEEEECCCCCEEEEECCCCCCCCCCC T ss_conf 999988762356------203666643------------067306622046720787312277457004432265 No 5 >MTH00033 CYTB cytochrome b; Provisional Probab=51.52 E-value=21 Score=17.25 Aligned_cols=53 Identities=19% Similarity=0.289 Sum_probs=36.7 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCEEEEEEEEEEECCCHHHHHCCCCHHHCCCCHHHHHHHHHHHCCCCHHHHH Q ss_conf 7889999999988740357874216765210001274567631300211013357899999973388889987 Q gi|254780965|r 8 IILLVLVGVILAFYFQQSSTPQNHLLFFSEKAVWKGDSETYFQCKRAHEFDENFDNCIITSIKKTGGTQEALR 80 (229) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~se~~vw~gd~~~~~q~~~a~~~d~~~~~Cl~~~m~~tGat~eAl~ 80 (229) .+..++-|+.|+++++-+ .+..|..-..-.||-|+. .+.|.||+.|||.==+- T Consensus 38 l~iQiitGi~Lam~Y~p~-------------------~~~AF~SV~~ImrdV~~G-wliR~~HanGAS~fF~~ 90 (383) T MTH00033 38 LGIQILTGVLLAMHYRSD-------------------VSLAFSSVAHIVRDVNYG-WILRYVHANGASLFFIC 90 (383) T ss_pred HHHHHHHHHHHHHHHCCC-------------------HHHHHHHHHHHHHCCCCH-HHHHHHHHHHHHHHHHH T ss_conf 999999999999885799-------------------177999999998057408-99998885146799999 No 6 >pfam07423 DUF1510 Protein of unknown function (DUF1510). This family consists of several hypothetical bacterial proteins of around 200 residues in length. The function of this family is unknown. Probab=50.71 E-value=11 Score=19.11 Aligned_cols=25 Identities=36% Similarity=0.390 Sum_probs=19.0 Q ss_pred EEEHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 0103478899999999887403578 Q gi|254780965|r 3 YKIAIIILLVLVGVILAFYFQQSST 27 (229) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~ 27 (229) ..|||+|.++||.++....|--+++ T Consensus 16 LNiaI~iV~llIiiva~~lf~~~~~ 40 (214) T pfam07423 16 LNIAIGIVVVLIIIVAYQLFFPSSP 40 (214) T ss_pred HHHHHHHHHHHHHHHHHHEEECCCC T ss_conf 5799999999999976640235887 No 7 >MTH00022 CYTB cytochrome b; Validated Probab=49.53 E-value=22 Score=17.18 Aligned_cols=53 Identities=15% Similarity=0.263 Sum_probs=37.3 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCEEEEEEEEEEECCCHHHHHCCCCHHHCCCCHHHHHHHHHHHCCCCHHHHH Q ss_conf 7889999999988740357874216765210001274567631300211013357899999973388889987 Q gi|254780965|r 8 IILLVLVGVILAFYFQQSSTPQNHLLFFSEKAVWKGDSETYFQCKRAHEFDENFDNCIITSIKKTGGTQEALR 80 (229) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~se~~vw~gd~~~~~q~~~a~~~d~~~~~Cl~~~m~~tGat~eAl~ 80 (229) .++.++-|+.|++++..+ .+..|..-..-.||-|+. .+.|.||+.|||-=-+. T Consensus 40 l~iQiiTGi~Lam~Y~p~-------------------~~~AF~SV~~ImrdV~~G-WliR~~HanGAS~fF~~ 92 (379) T MTH00022 40 LVLQIITGCFLSMHYCAD-------------------VSLAFASVGHIMRDVNYG-FLLRYLHANGASLFFLC 92 (379) T ss_pred HHHHHHHHHHHHHHHCCC-------------------CHHHHHHHHHHHHCCCCH-HHHHHHHHHHHHHHHHH T ss_conf 999999999999772799-------------------045999999998257407-99998876347899999 No 8 >pfam10717 ODV-E18 Occlusion-derived virus envelope protein ODV-E18. This family of occlusion-derived viral envelope proteins are detected in viral-induced intranuclear microvesicles and are not detected in the plasma membrane, cytoplasmic membranes, or the nuclear envelope. The ODV-E18 protein is encoded by baculovirus late genes with transcription initiating from a TAAG motif. It exists as a dimer in the ODV envelope and contains a hydrophobic domain which is putatively acting as a target or retention signal for intranuclear microvesicles. Probab=49.33 E-value=18 Score=17.77 Aligned_cols=25 Identities=44% Similarity=0.529 Sum_probs=18.1 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCE Q ss_conf 4788999999998874035787421 Q gi|254780965|r 7 IIILLVLVGVILAFYFQQSSTPQNH 31 (229) Q Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~ 31 (229) ..||.|||=+||-.-+-|||.+-|. T Consensus 28 MTILivLVIIiLLImlfqsSS~~ns 52 (85) T pfam10717 28 MTILIVLVIIILLILLFQSSSNGNS 52 (85) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCCC T ss_conf 9999999999999999603589987 No 9 >CHL00070 petB cytochrome b6 Probab=48.92 E-value=19 Score=17.54 Aligned_cols=54 Identities=20% Similarity=0.365 Sum_probs=36.8 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEEEEECCCHHHHHCCCCHHHCCCCHHHHHHHHHHHCCCCHHHHH Q ss_conf 47889999999988740357874216765210001274567631300211013357899999973388889987 Q gi|254780965|r 7 IIILLVLVGVILAFYFQQSSTPQNHLLFFSEKAVWKGDSETYFQCKRAHEFDENFDNCIITSIKKTGGTQEALR 80 (229) Q Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~se~~vw~gd~~~~~q~~~a~~~d~~~~~Cl~~~m~~tGat~eAl~ 80 (229) ..++.++-|+.|++|++.+.. .-++.-..-++|-+| --+.|.+|..||+-=-+. T Consensus 43 ~f~iqiiTGi~L~~~Y~p~~~-------------------~Af~Sv~~I~~~v~~-GwliR~~H~~gAs~~~~~ 96 (215) T CHL00070 43 CFLVQVATGFAMTFYYRPTVT-------------------EAFASVQYIMTEVNF-GWLIRSVHRWSASMMVLM 96 (215) T ss_pred HHHHHHHHHHHHHHHHCCCCH-------------------HHHHHHHHHHHHCCC-HHHHHHHHHHHHHHHHHH T ss_conf 999999999999977389907-------------------899999999951623-999999999888999999 No 10 >PRK00574 gltX glutamyl-tRNA synthetase; Reviewed Probab=48.50 E-value=11 Score=19.12 Aligned_cols=65 Identities=17% Similarity=0.161 Sum_probs=44.0 Q ss_pred EEEEEEEEECCCHHHHHCCCCHHHCCCCHHHHHHHHHHHCCC-----CHHHHHH--HHHHHCCCCHHHHHHHHHCC Q ss_conf 765210001274567631300211013357899999973388-----8899879--99851057413777586417 Q gi|254780965|r 33 LFFSEKAVWKGDSETYFQCKRAHEFDENFDNCIITSIKKTGG-----TQEALRA--AQYLEKYLEPGYVSSYRKEA 101 (229) Q Consensus 33 ~~~se~~vw~gd~~~~~q~~~a~~~d~~~~~Cl~~~m~~tGa-----t~eAl~a--a~~L~~~~e~gYV~sf~E~g 101 (229) |-|.|.....|+-..|+|-+|..-..+ ....-+.++-| |+|-|.+ .++...+..+.|--.++.+. T Consensus 68 l~~DE~p~~~g~~gPY~QS~R~~~Y~~----~a~~Ll~~G~AY~Cfct~eel~~~r~~~~~~g~~~~Y~~~cr~l~ 139 (489) T PRK00574 68 IDWDEGPDKGGEYGPYRQSERLDIYQK----YADQLLEEGKAYYCFCTEEELEAEREKQEARGETPKYDGKCLGLT 139 (489) T ss_pred CCCCCCCCCCCCCCCCCCCCCHHHHHH----HHHHHHHCCCCEECCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCC T ss_conf 888778777888888351268999999----999999749974525779999998999975699987872323389 No 11 >PRK10260 hypothetical protein; Provisional Probab=46.55 E-value=21 Score=17.28 Aligned_cols=36 Identities=36% Similarity=0.494 Sum_probs=20.1 Q ss_pred CCCCCCCH-----HHHHHHEEEEECCCCCEEEEEE-EEEEEC Q ss_conf 27753314-----4103110021078775100135-666404 Q gi|254780965|r 184 LRSCHGCE-----DIGFMDIAYKFTTKGAFIGRKV-FGIRND 219 (229) Q Consensus 184 LRdCHaC~-----d~G~a~iAyDFd~dGnFIG~kV-~~V~~~ 219 (229) +|.-|+|- |+-++-=.-.-+++-.+|-|-| ++++.+ T Consensus 204 ~rvS~GCIRl~peDI~~Lf~~V~vGT~V~iInqPvK~~~~~d 245 (306) T PRK10260 204 LRVSHGCVRLRNEDIKFLFEKVPVGTRVQFIDEPVKATTEPD 245 (306) T ss_pred CCCCCCEECCCHHHHHHHHHCCCCCCEEEEEECCEEEEECCC T ss_conf 201687512375719999950999985999715779887269 No 12 >MTH00086 CYTB cytochrome b; Provisional Probab=46.02 E-value=26 Score=16.72 Aligned_cols=52 Identities=21% Similarity=0.446 Sum_probs=34.9 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCEEEEEEEEEEECCCHHHHHCCCCHHHCCCCHHHHHHHHHHHCCCCHHHH Q ss_conf 788999999998874035787421676521000127456763130021101335789999997338888998 Q gi|254780965|r 8 IILLVLVGVILAFYFQQSSTPQNHLLFFSEKAVWKGDSETYFQCKRAHEFDENFDNCIITSIKKTGGTQEAL 79 (229) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~se~~vw~gd~~~~~q~~~a~~~d~~~~~Cl~~~m~~tGat~eAl 79 (229) .++.++-|+.|++|++.++ +.-|..-..-.||.|+. .+.|.||+.|||-=-+ T Consensus 30 l~iQiiTGifLam~Y~p~~-------------------~~AF~Sv~~ImrdV~~G-WliR~~HanGAS~fF~ 81 (355) T MTH00086 30 LVFQILTGTFLTFYYTADS-------------------EMAFSSVQYIMYEVNFG-WLFRIFHFNGASLFFI 81 (355) T ss_pred HHHHHHHHHHHHHHHCCCC-------------------HHHHHHHHHHHCCCCHH-HHHHHHHHHHHHHHHH T ss_conf 9999999999997847990-------------------77999999998057228-9999876312479999 No 13 >PRK03735 cytochrome b6; Provisional Probab=43.18 E-value=26 Score=16.70 Aligned_cols=55 Identities=13% Similarity=0.275 Sum_probs=37.3 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEEEEECCCHHHHHCCCCHHHCCCCHHHHHHHHHHHCCCCHHHHHH Q ss_conf 478899999999887403578742167652100012745676313002110133578999999733888899879 Q gi|254780965|r 7 IIILLVLVGVILAFYFQQSSTPQNHLLFFSEKAVWKGDSETYFQCKRAHEFDENFDNCIITSIKKTGGTQEALRA 81 (229) Q Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~se~~vw~gd~~~~~q~~~a~~~d~~~~~Cl~~~m~~tGat~eAl~a 81 (229) ..++.++-|+.|++|++-+. +.-++.-..-.+|-++ --+.|.+|..||+.=-+.. T Consensus 44 ~f~iqiiTGi~L~~~Y~p~~-------------------~~Af~Sv~~I~~ev~~-GwliR~iH~~gAs~~~~~~ 98 (216) T PRK03735 44 CFLIQFATGFAMTFYYKPTV-------------------KNAWESVYYLMNEVAF-GWLIRSMHRWSASMMVLMM 98 (216) T ss_pred HHHHHHHHHHHHHHHHCCCH-------------------HHHHHHHHHHHHCCCC-HHHHHHHHHHHHHHHHHHH T ss_conf 99999999999995408987-------------------8999999999823611-9999999998899999999 No 14 >COG1860 Uncharacterized protein conserved in archaea [Function unknown] Probab=43.14 E-value=22 Score=17.12 Aligned_cols=47 Identities=23% Similarity=0.511 Sum_probs=34.6 Q ss_pred CEEEEEEEEECCCCCCCHHHHHHHEEEEECCCCCEEEEEEEEEEECCCCCCCH Q ss_conf 80789999622775331441031100210787751001356664047768501 Q gi|254780965|r 174 GLIELIFSYPLRSCHGCEDIGFMDIAYKFTTKGAFIGRKVFGIRNDNAKYPMH 226 (229) Q Consensus 174 Ggqrl~F~yPLRdCHaC~d~G~a~iAyDFd~dGnFIG~kV~~V~~~~~~~p~~ 226 (229) -|.+|+|.=|+-.|-+|.-- +.-+...+-+-| ||+.||.+.-+-|+| T Consensus 15 ~G~eFif~Gp~eeCk~Cr~k---~~C~nLe~GrrY---kVv~VR~~~~~C~ih 61 (147) T COG1860 15 EGLEFIFLGPAEECKDCRLK---NVCLNLEEGRRY---KVVEVRSQVQPCPIH 61 (147) T ss_pred CCCEEEEECCCCCCCCCCCH---HEEECCCCCCEE---EEEEEHHCCCCCCEE T ss_conf 77369995684004478513---103446788378---998840056645254 No 15 >MTH00190 CYTB cytochrome b; Provisional Probab=43.10 E-value=29 Score=16.46 Aligned_cols=52 Identities=21% Similarity=0.241 Sum_probs=34.4 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCEEEEEEEEEEECCCHHHHHCCCCHHHCCCCHHHHHHHHHHHCCCCHHHH Q ss_conf 788999999998874035787421676521000127456763130021101335789999997338888998 Q gi|254780965|r 8 IILLVLVGVILAFYFQQSSTPQNHLLFFSEKAVWKGDSETYFQCKRAHEFDENFDNCIITSIKKTGGTQEAL 79 (229) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~se~~vw~gd~~~~~q~~~a~~~d~~~~~Cl~~~m~~tGat~eAl 79 (229) ....++-|+.|++++.-+. +.-|..-..-.||.|+. .+.|.||+.|||-==+ T Consensus 32 l~iQiitGi~Lam~Y~p~~-------------------~~AF~Sv~~I~rdV~~G-wliR~~H~~GAS~fF~ 83 (370) T MTH00190 32 LVLQILSGVFLSMHYVPSG-------------------DLAFMSVDHIMRDVNGG-WVLRLVHVWGSSFFFL 83 (370) T ss_pred HHHHHHHHHHHHHHHCCCC-------------------HHHHHHHHHHHCCCCCH-HHHHHHHHHHHHHHHH T ss_conf 9999999999998857990-------------------64999999997357437-9999887646899999 No 16 >MTH00191 CYTB cytochrome b; Provisional Probab=42.16 E-value=30 Score=16.35 Aligned_cols=52 Identities=17% Similarity=0.313 Sum_probs=35.3 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCEEEEEEEEEEECCCHHHHHCCCCHHHCCCCHHHHHHHHHHHCCCCHHHH Q ss_conf 788999999998874035787421676521000127456763130021101335789999997338888998 Q gi|254780965|r 8 IILLVLVGVILAFYFQQSSTPQNHLLFFSEKAVWKGDSETYFQCKRAHEFDENFDNCIITSIKKTGGTQEAL 79 (229) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~se~~vw~gd~~~~~q~~~a~~~d~~~~~Cl~~~m~~tGat~eAl 79 (229) .++.++-|+.|++++.-+. +..|..-..-.||-|+. .+.|.||..|||-==+ T Consensus 42 l~iQiitGi~L~m~Y~p~~-------------------~~AF~Sv~~ImrdV~~G-wllR~~H~~GAS~fF~ 93 (380) T MTH00191 42 LIIQIVTGLFLAMHYTADI-------------------SLAFSSVAHICRDVNYG-WLIRNLHANGASFFFI 93 (380) T ss_pred HHHHHHHHHHHHHHHCCCC-------------------HHHHHHHHHHHCCCCCH-HHHHHHHHCCCHHHHH T ss_conf 9999999999998856992-------------------45999999998127426-9999988515249999 No 17 >cd00284 Cytochrome_b_N Cytochrome b (N-terminus)/b6/petB: Cytochrome b is a subunit of cytochrome bc1, an 11-subunit mitochondrial respiratory enzyme. Cytochrome b spans the mitochondrial membrane with 8 transmembrane helices (A-H) in eukaryotes. In plants and cyanobacteria, cytochrome b6 is analogous to eukaryote cytochrome b, containing two chains: helices A-D are encoded by the petB gene and helices E-H are encoded by the petD gene in these organisms. Cytochrome b/b6 contains two bound hemes and two ubiquinol/ubiquinone binding sites. The C-terminal portion of cytochrome b is described in a separate CD. Probab=41.21 E-value=29 Score=16.37 Aligned_cols=55 Identities=24% Similarity=0.353 Sum_probs=35.3 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEEEEECCCHHHHHCCCCHHHCCCCHHHHHHHHHHHCCCCHHHHHH Q ss_conf 478899999999887403578742167652100012745676313002110133578999999733888899879 Q gi|254780965|r 7 IIILLVLVGVILAFYFQQSSTPQNHLLFFSEKAVWKGDSETYFQCKRAHEFDENFDNCIITSIKKTGGTQEALRA 81 (229) Q Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~se~~vw~gd~~~~~q~~~a~~~d~~~~~Cl~~~m~~tGat~eAl~a 81 (229) ..++.++-|+.|++|+..+.. +|+ ..-..-.+|-++ -++.|.||.-||+.=-+.. T Consensus 32 ~~~iQiiTGi~L~~yY~P~~~------------------~Af-~Sv~~I~~ev~~-GwllR~~H~~gAs~~~~~~ 86 (200) T cd00284 32 CLVIQILTGVFLAMHYTPDVT------------------LAF-SSVQYIMRDVNF-GWLIRSLHANGASMFFLML 86 (200) T ss_pred HHHHHHHHHHHHHHHHCCCHH------------------HHH-HHHHHHHCCCCC-HHHHHHHHHHHHHHHHHHH T ss_conf 999999999999989379866------------------899-999999754733-9999986460699999999 No 18 >TIGR01530 nadN NAD nucleotidase; InterPro: IPR006420 This family describes NadN of Haemophilus influenzae and a small number of close homologs in pathogenic, Gram-negative bacteria. NadN is a periplasmic enzyme that cleaves NAD (nicotinamide adenine dinucleotide) to NMN (nicotinamide mononucleotide) and AMP. The NMN must be converted by a 5-nucleotidase to nicotinamide riboside for import. NadN belongs a large family of 5-nucleotidases and has NMN 5-nucleotidase activity for NMN, AMP, etc.; GO: 0008253 5'-nucleotidase activity. Probab=40.79 E-value=23 Score=17.10 Aligned_cols=54 Identities=20% Similarity=0.316 Sum_probs=38.4 Q ss_pred CHHHHHHHHHHCCCHHHHHHHHHCCCCCCCCCC--CCCCCCCCCCEEEEEEEEECCC Q ss_conf 643512767630384379999608884112676--3123468778078999962277 Q gi|254780965|r 132 DINDSSVHQLYDSSPIAKDFVLRNPGVFPYSAG--HFVKSSHKDGLIELIFSYPLRS 186 (229) Q Consensus 132 dv~d~~la~~~~aDPty~~fikanP~i~P~gag--~sav~T~e~Ggqrl~F~yPLRd 186 (229) +++-+.|-|++.+ -.-=+|.----++|||||| .+|.++|..-|-|++-.=+++- T Consensus 433 ~~~G~~~KQ~~ED-AlQFAL~DGSTGAFPYGAG~RYEA~~~P~~~GKR~~~~EV~nK 488 (550) T TIGR01530 433 DVEGALLKQVLED-ALQFALVDGSTGAFPYGAGVRYEAREEPDEEGKRLVKLEVLNK 488 (550) T ss_pred EECCCHHHHHHHH-HHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEECC T ss_conf 8434034678998-8887512777676667775020146883111886999976316 No 19 >pfam05094 LEF-9 Late expression factor 9 (LEF-9). Late expression factor 9 (LEF-9) is one of the primary components of RNA polymerase produced by baculoviruses. LEF-9 is homologous to the largest beta-subunit of prokaryotic DNA-directed RNA polymerase. Probab=39.59 E-value=18 Score=17.66 Aligned_cols=67 Identities=13% Similarity=0.155 Sum_probs=46.1 Q ss_pred ECCCCCEECHHHHHHHHH-HCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCE-----EEE-EEEEECCCCCCCH Q ss_conf 527762116435127676-3038437999960888411267631234687780-----789-9996227753314 Q gi|254780965|r 124 PTVGSHIIDINDSSVHQL-YDSSPIAKDFVLRNPGVFPYSAGHFVKSSHKDGL-----IEL-IFSYPLRSCHGCE 191 (229) Q Consensus 124 pt~gssiIdv~d~~la~~-~~aDPty~~fikanP~i~P~gag~sav~T~e~Gg-----qrl-~F~yPLRdCHaC~ 191 (229) -+++-.||||.. ++..- =|+-..+...-+|||+++--+.-.|-|.-+++.+ +++ .|.=.=|||+|=- T Consensus 187 ~Ng~NeIipPk~-Si~RYYGR~v~nvrawtTRHPNISQLSTq~s~V~~~~~~~~wNvkvGLGtF~GANrDCDGDK 260 (487) T pfam05094 187 LNGVNEIVPPKS-SIQRYYGRDVDNVRAWTTRHPNISQLSTQVSDVRENDNYTDWNVKVGLGTFPGANTDCDGDK 260 (487) T ss_pred CCCCCCCCCCHH-HHHHHHCCCCCCEEEECCCCCCHHHHHHHHHEEECCCCCCCCCEEEEEEECCCCCCCCCCCC T ss_conf 167763259644-55676087667535404789987874134403434677776635786520257888889983 No 20 >KOG0584 consensus Probab=39.22 E-value=3 Score=22.53 Aligned_cols=83 Identities=25% Similarity=0.364 Sum_probs=59.9 Q ss_pred HHHHHHHHC-----CCCHHHHHHHHHHHCCCCHHHHHHHHHCCCCCEEEEEEEEECCCCCEE-EEEECCCCCEECHHHHH Q ss_conf 999999733-----888899879998510574137775864177025899975533888603-89752776211643512 Q gi|254780965|r 64 CIITSIKKT-----GGTQEALRAAQYLEKYLEPGYVSSYRKEALIGIVEVQYPYRANENSGT-LLIPTVGSHIIDINDSS 137 (229) Q Consensus 64 Cl~~~m~~t-----Gat~eAl~aa~~L~~~~e~gYV~sf~E~g~vdvatV~YPyRANeN~gT-~Lvpt~gssiIdv~d~~ 137 (229) -|++.|++- =||||-.+---|=|.|-|.-=|-||- |-.+-.||-+|||.--+|.-. |--+++| |.|+..+ T Consensus 191 GLAtl~r~s~aksvIGTPEFMAPEmYEE~YnE~VDVYaFG-MCmLEMvT~eYPYsEC~n~AQIYKKV~SG---iKP~sl~ 266 (632) T KOG0584 191 GLATLLRKSHAKSVIGTPEFMAPEMYEENYNELVDVYAFG-MCMLEMVTSEYPYSECTNPAQIYKKVTSG---IKPAALS 266 (632) T ss_pred HHHHHHHCCCCCEECCCCCCCCHHHHHHHCCHHHHHHHHH-HHHHHHHHCCCCHHHHCCHHHHHHHHHCC---CCHHHHH T ss_conf 6899850255341226835347677765212044445556-79999885047824307999999998758---9877863 Q ss_pred HHHHHCCCHHHHHHHHH Q ss_conf 76763038437999960 Q gi|254780965|r 138 VHQLYDSSPIAKDFVLR 154 (229) Q Consensus 138 la~~~~aDPty~~fika 154 (229) -- .||-+++||-. T Consensus 267 kV----~dPevr~fIek 279 (632) T KOG0584 267 KV----KDPEVREFIEK 279 (632) T ss_pred CC----CCHHHHHHHHH T ss_conf 36----88889999999 No 21 >TIGR01879 hydantase amidase, hydantoinase/carbamoylase family; InterPro: IPR010158 Enzymes in this subfamily hydrolyse the amide bonds of compounds containing carbamoyl groups or hydantoin rings . These enzymes are members of the broader family of amidases.; GO: 0016813 hydrolase activity acting on carbon-nitrogen (but not peptide) bonds in linear amidines, 0008152 metabolic process. Probab=38.82 E-value=7.9 Score=19.93 Aligned_cols=44 Identities=25% Similarity=0.336 Sum_probs=17.3 Q ss_pred CCCCCCCCCEEEEEEEEECCC-----CCCCHHHHHHHEEEEECCCCCEEEEE Q ss_conf 123468778078999962277-----53314410311002107877510013 Q gi|254780965|r 166 FVKSSHKDGLIELIFSYPLRS-----CHGCEDIGFMDIAYKFTTKGAFIGRK 212 (229) Q Consensus 166 sav~T~e~Ggqrl~F~yPLRd-----CHaC~d~G~a~iAyDFd~dGnFIG~k 212 (229) |.+.+|+.|-+||.|+-==|. -.-|..+| +.-.||.=||.|||+ T Consensus 12 evG~~~~~G~~RL~~S~E~r~~~~~~~~~mr~~G---l~v~~D~~GNL~gr~ 60 (406) T TIGR01879 12 EVGADPEGGVTRLALSEEDREAKDLVKKRMRAAG---LEVEIDEVGNLIGRK 60 (406) T ss_pred CCCCCCCCCCEECCCCHHHHHHHHHHHHHHHHCC---CEEEECCCCCCEECC T ss_conf 0365786542222478778999999999998659---817652522302106 No 22 >MTH00156 CYTB cytochrome b; Provisional Probab=38.40 E-value=34 Score=15.98 Aligned_cols=53 Identities=15% Similarity=0.283 Sum_probs=38.1 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCEEEEEEEEEEECCCHHHHHCCCCHHHCCCCHHHHHHHHHHHCCCCHHHHH Q ss_conf 7889999999988740357874216765210001274567631300211013357899999973388889987 Q gi|254780965|r 8 IILLVLVGVILAFYFQQSSTPQNHLLFFSEKAVWKGDSETYFQCKRAHEFDENFDNCIITSIKKTGGTQEALR 80 (229) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~se~~vw~gd~~~~~q~~~a~~~d~~~~~Cl~~~m~~tGat~eAl~ 80 (229) .++.++-|+.|++|++-+ .+..|..-..-.||-|+. -+.|.||+.|||-==+. T Consensus 31 l~iQiiTGi~Lam~Y~p~-------------------~~~AF~Sv~~I~~dv~~G-wliR~~H~~gAs~ff~~ 83 (356) T MTH00156 31 LMIQILTGLFLAMHYTAD-------------------ISLAFSSVIHICRDVNYG-WLLRTLHANGASFFFIC 83 (356) T ss_pred HHHHHHHHHHHHHHCCCC-------------------HHHHHHHHHHHHHCCCCH-HHHHHHHHHHHHHHHHH T ss_conf 999999999999770899-------------------177999799999326217-99999998566999999 No 23 >MTH00053 CYTB cytochrome b; Provisional Probab=38.24 E-value=34 Score=15.97 Aligned_cols=52 Identities=19% Similarity=0.342 Sum_probs=35.5 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCEEEEEEEEEEECCCHHHHHCCCCHHHCCCCHHHHHHHHHHHCCCCHHHH Q ss_conf 788999999998874035787421676521000127456763130021101335789999997338888998 Q gi|254780965|r 8 IILLVLVGVILAFYFQQSSTPQNHLLFFSEKAVWKGDSETYFQCKRAHEFDENFDNCIITSIKKTGGTQEAL 79 (229) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~se~~vw~gd~~~~~q~~~a~~~d~~~~~Cl~~~m~~tGat~eAl 79 (229) ..+.++-|+.|++++..+ .+.-|..-..-.||.|+. .+.|.||+.|||-=-+ T Consensus 42 l~iQi~TGi~Lam~Y~p~-------------------~~~AF~Sv~~ImrdV~~G-WliR~~Ha~GAS~fF~ 93 (381) T MTH00053 42 LIIQIITGIFLAMHYCAD-------------------VNLAFSSVAHITRDVNYG-FILRYLHANGASMFFL 93 (381) T ss_pred HHHHHHHHHHHHHHHCCC-------------------CHHHHHHHHHHHHCCCCH-HHHHHHHHHHHHHHHH T ss_conf 999999999999774799-------------------056999999999658417-9999878746899999 No 24 >MTH00016 CYTB cytochrome b; Validated Probab=36.76 E-value=36 Score=15.82 Aligned_cols=53 Identities=13% Similarity=0.320 Sum_probs=36.1 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCEEEEEEEEEEECCCHHHHHCCCCHHHCCCCHHHHHHHHHHHCCCCHHHHH Q ss_conf 7889999999988740357874216765210001274567631300211013357899999973388889987 Q gi|254780965|r 8 IILLVLVGVILAFYFQQSSTPQNHLLFFSEKAVWKGDSETYFQCKRAHEFDENFDNCIITSIKKTGGTQEALR 80 (229) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~se~~vw~gd~~~~~q~~~a~~~d~~~~~Cl~~~m~~tGat~eAl~ 80 (229) .++.++-|+.|++++.- |.+..|..-.--.||-|+. .+.|.||+.|||-=-+- T Consensus 42 l~iQiitGi~Lam~Y~p-------------------~~~~AF~Sv~~Imrdv~~G-wliR~~HangAS~fF~~ 94 (378) T MTH00016 42 LVIQILTGLFLSMHYTP-------------------HIDLAFSSVAHISRDVNYG-WLLRNLHANGASWFFIC 94 (378) T ss_pred HHHHHHHHHHHHHHHCC-------------------CHHHHHHHHHHHHHCCCCH-HHHHHHHHHHHHHHHHH T ss_conf 99999999999987479-------------------9166999999999557408-99998776578999999 No 25 >KOG0136 consensus Probab=36.61 E-value=20 Score=17.44 Aligned_cols=48 Identities=25% Similarity=0.374 Sum_probs=34.3 Q ss_pred HCCCHHHHHHHHHCCCC-----CCCCCCCCCCCCCCCCEEEE----------EEEEECCCCCCCH Q ss_conf 30384379999608884-----11267631234687780789----------9996227753314 Q gi|254780965|r 142 YDSSPIAKDFVLRNPGV-----FPYSAGHFVKSSHKDGLIEL----------IFSYPLRSCHGCE 191 (229) Q Consensus 142 ~~aDPty~~fikanP~i-----~P~gag~sav~T~e~Ggqrl----------~F~yPLRdCHaC~ 191 (229) -.-||.-++|+.-+|.+ ||=|.|+|+ +|....-++ .|--||||-+.-. T Consensus 165 AtyD~~T~eFVl~TPt~ta~KWWPGgLG~ss--nhAvV~AqL~~~gk~~G~h~FiV~lRd~~th~ 227 (670) T KOG0136 165 ATYDPKTQEFVLNTPTLTATKWWPGGLGKSS--NHAVVVAQLITKGKCYGPHPFIVQLRDEDTHK 227 (670) T ss_pred EEECCCCCEEEECCCCCCEECCCCCCCCCCC--CHHEEEEEEEECCCCCCCCEEEEECCCCCCCC T ss_conf 2235776648845897430024687665533--20201245556461114540589723754467 No 26 >PRK09783 copper/silver efflux system membrane fusion protein CusB; Provisional Probab=34.33 E-value=40 Score=15.58 Aligned_cols=36 Identities=39% Similarity=0.606 Sum_probs=24.4 Q ss_pred CCEEEHHHHHHHHHHHH---HHHHHHC-C-------CCCCCEEEEEEE Q ss_conf 95010347889999999---9887403-5-------787421676521 Q gi|254780965|r 1 MKYKIAIIILLVLVGVI---LAFYFQQ-S-------STPQNHLLFFSE 37 (229) Q Consensus 1 ~~~~~~~~~~~~~~~~~---~~~~~~~-~-------~~~~~~~~~~se 37 (229) || +|+|||+.++.|-+ ..||+-+ + ++.....|+|.- T Consensus 1 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~er~~lyw~~ 47 (407) T PRK09783 1 MK-KIALIIGSMIAGGIISAAGFTWVAKAEPPAEKTSTAERKVLFWYD 47 (407) T ss_pred CC-EEHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCEEECC T ss_conf 95-010459999998877777565563278876677877764069868 No 27 >pfam11857 DUF3377 Domain of unknown function (DUF3377). This domain is functionally uncharacterized. This domain is found in eukaryotes. This presumed domain is about 70 amino acids in length. Probab=33.75 E-value=40 Score=15.52 Aligned_cols=16 Identities=38% Similarity=0.810 Sum_probs=11.8 Q ss_pred HHHHCCCCCCCEEEEEE Q ss_conf 87403578742167652 Q gi|254780965|r 20 FYFQQSSTPQNHLLFFS 36 (229) Q Consensus 20 ~~~~~~~~~~~~~~~~s 36 (229) |.|+...||+ |+|++- T Consensus 52 ~qfkrkgtpr-~llYck 67 (74) T pfam11857 52 VQFKRKGTPR-HLLYCK 67 (74) T ss_pred HHHEECCCCC-EEEEEE T ss_conf 2100059973-788873 No 28 >MTH00034 CYTB cytochrome b; Validated Probab=32.51 E-value=42 Score=15.40 Aligned_cols=53 Identities=17% Similarity=0.308 Sum_probs=35.1 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCEEEEEEEEEEECCCHHHHHCCCCHHHCCCCHHHHHHHHHHHCCCCHHHHH Q ss_conf 7889999999988740357874216765210001274567631300211013357899999973388889987 Q gi|254780965|r 8 IILLVLVGVILAFYFQQSSTPQNHLLFFSEKAVWKGDSETYFQCKRAHEFDENFDNCIITSIKKTGGTQEALR 80 (229) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~se~~vw~gd~~~~~q~~~a~~~d~~~~~Cl~~~m~~tGat~eAl~ 80 (229) ..+.++-|+.|++++.-+. +..|..-..-.||.|+. ++.|.||..||+-==+. T Consensus 42 l~iQiisGi~L~m~Y~p~~-------------------~~Af~Sv~~i~~dv~~G-wliR~~H~~gas~ff~~ 94 (379) T MTH00034 42 LIIQIITGLFLAMHYTADI-------------------SLAFSSVSHICRDVNYG-WLLRNIHANGASLFFIC 94 (379) T ss_pred HHHHHHHHHHHHHHHCCCC-------------------HHHHHHHHHHHHCCCCH-HHHHHHHHHHHHHHHHH T ss_conf 9999999999997727990-------------------55999999998047514-99998886357999999 No 29 >MTH00046 CYTB cytochrome b; Validated Probab=32.47 E-value=42 Score=15.39 Aligned_cols=53 Identities=23% Similarity=0.319 Sum_probs=35.3 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCEEEEEEEEEEECCCHHHHHCCCCHHHCCCCHHHHHHHHHHHCCCCHHHHH Q ss_conf 7889999999988740357874216765210001274567631300211013357899999973388889987 Q gi|254780965|r 8 IILLVLVGVILAFYFQQSSTPQNHLLFFSEKAVWKGDSETYFQCKRAHEFDENFDNCIITSIKKTGGTQEALR 80 (229) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~se~~vw~gd~~~~~q~~~a~~~d~~~~~Cl~~~m~~tGat~eAl~ 80 (229) .++-++.|+.||+++. | |.+.-|..-..-.||-|+. -+.|.||+.|||-==+. T Consensus 32 l~iQiitGi~Lam~Y~----~---------------~~~~AF~Sv~~ImrdV~~G-Wl~R~~HanGAS~fF~~ 84 (355) T MTH00046 32 MVIQVLTGVLLSLLYV----A---------------DSNCSFFCVMNLSNDSFFT-WCVRYWHIWGVSVLFIL 84 (355) T ss_pred HHHHHHHHHHHHHHHC----C---------------CCHHHHHHHHHHHHCCCCH-HHHHHHHHHHHHHHHHH T ss_conf 9999999999997856----9---------------9067999999998457407-99998774155699999 No 30 >TIGR00403 ndhI NADH-plastoquinone oxidoreductase, I subunit; InterPro: IPR004497 NADH-plastoquinone oxidoreductase subunit I (1.6.5.3 from EC) catalyses the conversion of plastoquinone and NADH to plastoquinol and NAD(+). The enzyme binds two 4FE-4S clusters at iron-sulphur centres which are similar to those of the bacterial-type 4FE-4S ferredoxins.; GO: 0008137 NADH dehydrogenase (ubiquinone) activity, 0006118 electron transport. Probab=32.12 E-value=19 Score=17.54 Aligned_cols=42 Identities=26% Similarity=0.462 Sum_probs=33.3 Q ss_pred CCCHHHHHHHHHHHCCCCHHHHHHHHHCCCCCEEEEEEEEECCCCCE Q ss_conf 88889987999851057413777586417702589997553388860 Q gi|254780965|r 73 GGTQEALRAAQYLEKYLEPGYVSSYRKEALIGIVEVQYPYRANENSG 119 (229) Q Consensus 73 Gat~eAl~aa~~L~~~~e~gYV~sf~E~g~vdvatV~YPyRANeN~g 119 (229) +-.+++|+||+|... |.+.++..+.+.- +|++|||..--.+- T Consensus 11 ~y~~~~~~aa~yiGq----~~~~tl~h~~r~P-~t~~yPyek~i~se 52 (187) T TIGR00403 11 DYAQQAIQAARYIGQ----GLLVTLDHMNRLP-ITIQYPYEKLIPSE 52 (187) T ss_pred HHHHHHHHHHHHHCC----EEEEEEECCCCCC-EEEECCCHHCCCCC T ss_conf 778988988775122----0256642124775-26756700134431 No 31 >PRK13664 hypothetical protein; Provisional Probab=29.02 E-value=44 Score=15.30 Aligned_cols=16 Identities=44% Similarity=0.746 Sum_probs=12.1 Q ss_pred HHHHHHHHHHHHHHHH Q ss_conf 4788999999998874 Q gi|254780965|r 7 IIILLVLVGVILAFYF 22 (229) Q Consensus 7 ~~~~~~~~~~~~~~~~ 22 (229) |.||++|||+++.-+- T Consensus 10 ilvli~lvG~l~n~iK 25 (62) T PRK13664 10 ILVLVFLVGVLLNVIK 25 (62) T ss_pred HHHHHHHHHHHHHHHH T ss_conf 9999999999999899 No 32 >PRK01844 hypothetical protein; Provisional Probab=28.41 E-value=45 Score=15.25 Aligned_cols=59 Identities=15% Similarity=0.326 Sum_probs=37.4 Q ss_pred EHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEEEEECCCHHHHHCCCCHHHCCCCHHHHHHHHHHHCCCCHHHHHHHHH Q ss_conf 03478899999999887403578742167652100012745676313002110133578999999733888899879998 Q gi|254780965|r 5 IAIIILLVLVGVILAFYFQQSSTPQNHLLFFSEKAVWKGDSETYFQCKRAHEFDENFDNCIITSIKKTGGTQEALRAAQY 84 (229) Q Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~se~~vw~gd~~~~~q~~~a~~~d~~~~~Cl~~~m~~tGat~eAl~aa~~ 84 (229) |.|+|+..++|+++.||+-... .+.|..- | --.+ .+-|+.-|.|-|-.|-.-..+|+ T Consensus 7 Ili~i~~li~G~~~GffiaRk~------------------m~~ylk~-N-PPIn---E~mir~Mm~QMGqKPSekki~Q~ 63 (72) T PRK01844 7 ILVGVVALVAGVALGFFIARKY------------------MMNYLQK-N-PPIN---EQMLKMMMMQMGQKPSQKKINQM 63 (72) T ss_pred HHHHHHHHHHHHHHHHHHHHHH------------------HHHHHHH-C-CCCC---HHHHHHHHHHHCCCCCHHHHHHH T ss_conf 9999999999999999999999------------------9998866-8-5999---99999999986899209999999 Q ss_pred HH Q ss_conf 51 Q gi|254780965|r 85 LE 86 (229) Q Consensus 85 L~ 86 (229) .. T Consensus 64 m~ 65 (72) T PRK01844 64 MS 65 (72) T ss_pred HH T ss_conf 99 No 33 >COG3672 Predicted transglutaminase-like cysteine proteinase [General function prediction only] Probab=28.22 E-value=50 Score=14.94 Aligned_cols=49 Identities=22% Similarity=0.265 Sum_probs=36.2 Q ss_pred CCCEEEEEEEEECCCCCCCHH---------------HHHHHEEEEECCCCCEEEEEEEEEEECCCC Q ss_conf 778078999962277533144---------------103110021078775100135666404776 Q gi|254780965|r 172 KDGLIELIFSYPLRSCHGCED---------------IGFMDIAYKFTTKGAFIGRKVFGIRNDNAK 222 (229) Q Consensus 172 e~Ggqrl~F~yPLRdCHaC~d---------------~G~a~iAyDFd~dGnFIG~kV~~V~~~~~~ 222 (229) |.=|++=+..||..+--+|+| .+.++|++--.++|-.| .|+-||.++-- T Consensus 92 e~~G~ed~Wa~P~~~~GDCEDyal~KRr~L~~~G~s~~~Llitvvr~~~gegH--aVLtVrTd~GD 155 (191) T COG3672 92 EIYGKEDYWAYPVTGAGDCEDYALEKRRRLMEAGVSSSALLITVVRKPDGEGH--AVLTVRTDRGD 155 (191) T ss_pred HHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCHHHEEEEEEECCCCCEE--EEEEEECCCCC T ss_conf 54175234568888866679999999999998599812478999976799613--89999868898 No 34 >COG3285 Predicted eukaryotic-type DNA primase [DNA replication, recombination, and repair] Probab=28.19 E-value=50 Score=14.94 Aligned_cols=34 Identities=24% Similarity=0.125 Sum_probs=22.2 Q ss_pred HHHHHHHHHHHCCCCHHHHHHHHHCCCCCEEEEE Q ss_conf 8998799985105741377758641770258999 Q gi|254780965|r 76 QEALRAAQYLEKYLEPGYVSSYRKEALIGIVEVQ 109 (229) Q Consensus 76 ~eAl~aa~~L~~~~e~gYV~sf~E~g~vdvatV~ 109 (229) +=|-+.+++|+...-+.|.+++.++-+.|-+-+- T Consensus 188 ~fa~~va~~la~~~Pd~~tt~~~K~~R~grIFlD 221 (299) T COG3285 188 QFAKAVAQQLARAAPDLFTTTMGKENRGGRIFLD 221 (299) T ss_pred HHHHHHHHHHHHHCHHHHHHHHHHHHCCCCEEEE T ss_conf 9999999999876818788886476158708877 No 35 >COG0708 XthA Exonuclease III [DNA replication, recombination, and repair] Probab=25.83 E-value=55 Score=14.68 Aligned_cols=81 Identities=20% Similarity=0.105 Sum_probs=47.5 Q ss_pred EEEEEEECCCHHHHHCCCCHHHCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCHHHHHHHHHCCCCCEEEEEEEEEC Q ss_conf 52100012745676313002110133578999999733888899879998510574137775864177025899975533 Q gi|254780965|r 35 FSEKAVWKGDSETYFQCKRAHEFDENFDNCIITSIKKTGGTQEALRAAQYLEKYLEPGYVSSYRKEALIGIVEVQYPYRA 114 (229) Q Consensus 35 ~se~~vw~gd~~~~~q~~~a~~~d~~~~~Cl~~~m~~tGat~eAl~aa~~L~~~~e~gYV~sf~E~g~vdvatV~YPyRA 114 (229) ..+++|.-||.|.+.+-.-.+.-++. ..-....|-++|.-++-..| ++.||+-+||..---+=.=-=++||+ T Consensus 139 ~~~~~vl~GD~NIap~~iDv~~~~~~-----~~n~~~~~f~~eeR~~~~~l---l~~G~~D~~R~~~p~~~~YTwW~YR~ 210 (261) T COG0708 139 KGKPVVLCGDFNIAPEEIDVANPKKR-----WLNEGNSGFLPEERAWFRRL---LNAGFVDTFRLFHPEPEKYTWWDYRA 210 (261) T ss_pred CCCCEEEECCCCCCCCHHCCCCCHHH-----HHCCCCCCCCHHHHHHHHHH---HHCCHHHHHHHHCCCCCCCCCCCCCC T ss_conf 59987996663117853212572555-----40168889898999999999---87530645676489999635125621 Q ss_pred C---CCCEEEEE Q ss_conf 8---88603897 Q gi|254780965|r 115 N---ENSGTLLI 123 (229) Q Consensus 115 N---eN~gT~Lv 123 (229) | +|.|.++= T Consensus 211 ~~~~~n~G~RID 222 (261) T COG0708 211 NAARRNRGWRID 222 (261) T ss_pred CHHHHCCCEEEE T ss_conf 305663860688 No 36 >PRK10190 hypothetical protein; Provisional Probab=25.25 E-value=57 Score=14.61 Aligned_cols=36 Identities=28% Similarity=0.409 Sum_probs=19.5 Q ss_pred CCCCCCCHHHHHHHEE-----EEECCCCCEEEEEE-EEEEEC Q ss_conf 2775331441031100-----21078775100135-666404 Q gi|254780965|r 184 LRSCHGCEDIGFMDIA-----YKFTTKGAFIGRKV-FGIRND 219 (229) Q Consensus 184 LRdCHaC~d~G~a~iA-----yDFd~dGnFIG~kV-~~V~~~ 219 (229) +|--|+|--+---+|. -.-+++-.+|-+-| ++++.+ T Consensus 201 ~rvS~GCIRl~peDI~~Lf~~V~vGT~V~iInqPvK~~~~~d 242 (310) T PRK10190 201 LRVSQGCIRLRNDDIKYLFDNVPVGTRVQIIDQPVKYTTEPD 242 (310) T ss_pred CHHCCCCCCCCHHHHHHHHHCCCCCCEEEEEECCEEEEECCC T ss_conf 110687514487869999842999886999705779888079 No 37 >MTH00119 CYTB cytochrome b; Provisional Probab=24.51 E-value=59 Score=14.52 Aligned_cols=52 Identities=17% Similarity=0.302 Sum_probs=35.9 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCEEEEEEEEEEECCCHHHHHCCCCHHHCCCCHHHHHHHHHHHCCCCHHHH Q ss_conf 788999999998874035787421676521000127456763130021101335789999997338888998 Q gi|254780965|r 8 IILLVLVGVILAFYFQQSSTPQNHLLFFSEKAVWKGDSETYFQCKRAHEFDENFDNCIITSIKKTGGTQEAL 79 (229) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~se~~vw~gd~~~~~q~~~a~~~d~~~~~Cl~~~m~~tGat~eAl 79 (229) .++.++-|+.|++++.-+ .+..|..-..-.||-|+. -+.|.||..|||-==+ T Consensus 42 l~iQiitGi~Lam~Y~p~-------------------~~~AF~SV~~i~rdv~~G-WliR~~H~~gaS~fF~ 93 (380) T MTH00119 42 LITQILTGLFLAMHYTAD-------------------TSLAFSSVAHICRDVQYG-WLIRNLHANGASFFFI 93 (380) T ss_pred HHHHHHHHHHHHHHCCCC-------------------CHHHHHHHHHHHHHHHCH-HHHHHHHHCCCHHHHH T ss_conf 999999999999770899-------------------068999999998750047-9999878520159999 No 38 >cd05493 Bromo_ALL-1 Bromodomain, ALL-1 like proteins. ALL-1 is a vertebrate homologue of Drosophila trithorax and is often affected in chromosomal rearrangements that are linked to acute leukemias, such as acute lymphocytic leukemia (ALL). Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine. Probab=24.24 E-value=38 Score=15.71 Aligned_cols=66 Identities=14% Similarity=0.170 Sum_probs=46.4 Q ss_pred HHCCCCCCCEEEEEEEEEEECCCHHHHHCCCCH----------HHCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHC Q ss_conf 403578742167652100012745676313002----------110133578999999733888899879998510 Q gi|254780965|r 22 FQQSSTPQNHLLFFSEKAVWKGDSETYFQCKRA----------HEFDENFDNCIITSIKKTGGTQEALRAAQYLEK 87 (229) Q Consensus 22 ~~~~~~~~~~~~~~se~~vw~gd~~~~~q~~~a----------~~~d~~~~~Cl~~~m~~tGat~eAl~aa~~L~~ 87 (229) +.|+.+|++--.-.+++++-.+.+.++.+.-++ ..|-+++---|.++|+..|+-+|...|+.|.-. T Consensus 35 ~~~~~~~~~~~~~~~s~~~~~~~~p~dl~~V~kkl~~G~Y~Sv~~F~~DiV~Iiq~~~~~e~~~pe~~kans~~K~ 110 (131) T cd05493 35 RKSPVTPEVLQGLLSSEVKQEELPPLDLEAVGKKLEAGFYTSVLDFSDDIVKIIQAALNSEGGQPEIKKANSMAKS 110 (131) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHH T ss_conf 2267889765000002345766896224899988626522049988999999999998730588734450279999 No 39 >TIGR00770 Dcu transporter, anaerobic C4-dicarboxylate uptake (Dcu) family; InterPro: IPR004668 These proteins are members of the C4-Dicarboxylate Uptake (Dcu) family. Most proteins in this family are predicted to have 12 GES predicted transmembrane regions; however the one member whose membrane topology has been experimentally determined has 10 transmembrane regions, with both the N- and C-termini localized to the periplasm . The DcuA and DcuB proteins are involved in the transport of aspartate, malate, fumarate and succinate in many species , , , and are thought to function as antiporters with any two of these substrates. Since DcuA is encoded in an operon with the gene for aspartase, and DcuB is encoded in an operon with the gene for fumarase, their physiological functions may be to catalyze aspartate:fumarate and fumarate:malate exchange during the anaerobic utilization of aspartate and fumarate, respectively . The Escherichia coli DcuA and DcuB proteins have very different expression patterns . DcuA is constitutively expressed; DcuB is strongly induced anaerobically by FNR and C4-dicarboxylates, while it is repressed by nitrate and subject to CRP-mediated catabolite repression.; GO: 0015556 C4-dicarboxylate transmembrane transporter activity, 0015740 C4-dicarboxylate transport, 0016021 integral to membrane. Probab=23.73 E-value=35 Score=15.93 Aligned_cols=16 Identities=13% Similarity=0.067 Sum_probs=12.7 Q ss_pred HHCCCHHHHHHHHHCC Q ss_conf 6303843799996088 Q gi|254780965|r 141 LYDSSPIAKDFVLRNP 156 (229) Q Consensus 141 ~~~aDPty~~fikanP 156 (229) -+|+||.||++++.-- T Consensus 195 ~L~~DP~Yq~Rl~~gk 210 (447) T TIGR00770 195 DLDDDPEYQKRLSEGK 210 (447) T ss_pred CCCCCHHHHHHHCCCC T ss_conf 8777867999851898 No 40 >KOG4606 consensus Probab=23.46 E-value=36 Score=15.80 Aligned_cols=33 Identities=30% Similarity=0.575 Sum_probs=24.8 Q ss_pred CEEEHHHHHHHHHHHHHHHHHHCCCCCCCEEEE Q ss_conf 501034788999999998874035787421676 Q gi|254780965|r 2 KYKIAIIILLVLVGVILAFYFQQSSTPQNHLLF 34 (229) Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 34 (229) .+.|+++|.-|++|+|-|.||-....-|.--+| T Consensus 30 RWRi~lvi~svc~gaigawywl~dpet~~vp~~ 62 (126) T KOG4606 30 RWRIALVIFSVCTGAIGAWYWLIDPETQKVPFF 62 (126) T ss_pred CHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEHH T ss_conf 024679999999986325542027320564138 No 41 >PRK09814 hypothetical protein; Provisional Probab=23.22 E-value=25 Score=16.82 Aligned_cols=12 Identities=8% Similarity=0.022 Sum_probs=5.4 Q ss_pred CCCCCCCCCCCC Q ss_conf 884112676312 Q gi|254780965|r 156 PGVFPYSAGHFV 167 (229) Q Consensus 156 P~i~P~gag~sa 167 (229) +.+.=||++++. T Consensus 191 ~~l~lYG~~~~~ 202 (337) T PRK09814 191 IKLNVFGPGPSS 202 (337) T ss_pred CCEEEECCCCCC T ss_conf 877997888545 No 42 >TIGR02335 hydr_PhnA phosphonoacetate hydrolase; InterPro: IPR012710 This family consists of examples of phosphonoacetate hydrolase, an enzyme specific for the cleavage of the C-P bond in phosphonoacetate. Phosphonates are organic compounds with a direct C-P bond that is far less labile than the C-O-P bonds of phosphate attachment sites. Phosphonates may be degraded for phosphorus and energy by broad spectrum C-P lyase encoded by a large operon or by specific enzymes for some of the more common phosphonates in nature. This family represents an enzyme from the latter category. It may be found encoded near genes for phosphonate transport and for other specific phosphonatases .. Probab=22.65 E-value=64 Score=14.30 Aligned_cols=91 Identities=16% Similarity=0.215 Sum_probs=55.3 Q ss_pred HHHHHHHHCCCCCEEEEEEEEECCCCCEEEEEECCCC-----------------CEECHHHHHHHHHHCCCHHHHHHHHH Q ss_conf 3777586417702589997553388860389752776-----------------21164351276763038437999960 Q gi|254780965|r 92 GYVSSYRKEALIGIVEVQYPYRANENSGTLLIPTVGS-----------------HIIDINDSSVHQLYDSSPIAKDFVLR 154 (229) Q Consensus 92 gYV~sf~E~g~vdvatV~YPyRANeN~gT~Lvpt~gs-----------------siIdv~d~~la~~~~aDPty~~fika 154 (229) -..-.+.+.|-+.++--+-|=..|+||+. |+|++| .||=++| |+.+|++-+-.+.-|| T Consensus 38 P~ia~l~~~G~v~tadcvvPsfTNpNN~S--iVTGaPPavHGI~GNyylD~dTgeEim~tD---A~~LR~~TiLg~MskA 112 (408) T TIGR02335 38 PAIAELTEAGYVATADCVVPSFTNPNNVS--IVTGAPPAVHGIGGNYYLDRDTGEEIMMTD---AKLLRADTILGEMSKA 112 (408) T ss_pred CHHHHHHCCCCEEEECCCCCCCCCCCCEE--EECCCCCCEECCCCCCCCCCCCCCCHHHHH---HHHHCCCCHHHHHHHC T ss_conf 15888750780355400127766888706--861888750045645222301141036667---7662212368998646 Q ss_pred CCCCCCCCCCCCCCCCCCCCEEEEEEEEECCC-CCCCHHHHHH Q ss_conf 88841126763123468778078999962277-5331441031 Q gi|254780965|r 155 NPGVFPYSAGHFVKSSHKDGLIELIFSYPLRS-CHGCEDIGFM 196 (229) Q Consensus 155 nP~i~P~gag~sav~T~e~Ggqrl~F~yPLRd-CHaC~d~G~a 196 (229) -- .-|+=|-||-+.. +.-|-|++ |---|-.-|. T Consensus 113 Gv--------~tAvvTAKDKLrk-vLgh~l~GIcFSsEkad~~ 146 (408) T TIGR02335 113 GV--------LTAVVTAKDKLRK-VLGHELKGICFSSEKADQV 146 (408) T ss_pred CC--------CEEEEECCHHHHH-HHHHCCCEEEECCCCCCCC T ss_conf 97--------3798850068877-7420266147426785624 No 43 >pfam02751 TFIIA_gamma_C Transcription initiation factor IIA, gamma subunit. Accurate transcription in vivo requires at least six general transcription initiation factors, in addition to RNA polymerase II. Transcription initiation factor IIA (TFIIA) is a multimeric protein which facilitates the binding of TFIID to the TATA box. The C-terminal domain of the gamma subunit is a 12 stranded beta-barrel. Probab=22.61 E-value=24 Score=16.96 Aligned_cols=21 Identities=33% Similarity=0.566 Sum_probs=17.2 Q ss_pred EEEECCCHHHHHCCCCHHHCC Q ss_conf 000127456763130021101 Q gi|254780965|r 38 KAVWKGDSETYFQCKRAHEFD 58 (229) Q Consensus 38 ~~vw~gd~~~~~q~~~a~~~d 58 (229) +.-.||.++|||-|.|--||= T Consensus 5 k~tfKG~L~TYRfCDnVWTFi 25 (51) T pfam02751 5 KLTFKGHLDTYRFCDNVWTFI 25 (51) T ss_pred EEEEEEEECCCEECCCEEEEE T ss_conf 899846311001015377899 No 44 >TIGR01318 gltD_gamma_fam glutamate synthase, small subunit; InterPro: IPR006006 One pathway for the assimilation of ammonia and glutamate biosynthesis involves glutamate synthase (1.4.1.13 from EC) which transfers the amide group of glutamine to 2-oxoglutarate to yield two molecules of glutamate.2 L-glutamate + NADP^+ = L-glutamine + 2-oxoglutarate + NADPH + H^+. This model describes both glutatate synthase small subunit and closely related paralogs of unknown function from a number of gamma and alpha subdivision Proteobacteria, including Escherichia coli. ; GO: 0016639 oxidoreductase activity acting on the CH-NH2 group of donors NAD or NADP as acceptor, 0006537 glutamate biosynthetic process. Probab=22.36 E-value=45 Score=15.26 Aligned_cols=12 Identities=25% Similarity=0.581 Sum_probs=9.4 Q ss_pred HHHHHHHHHCCC Q ss_conf 899999973388 Q gi|254780965|r 63 NCIITSIKKTGG 74 (229) Q Consensus 63 ~Cl~~~m~~tGa 74 (229) ||+|+|+|++-. T Consensus 306 DCvRTaiR~GA~ 317 (480) T TIGR01318 306 DCVRTAIRLGAK 317 (480) T ss_pred HHHHHHHHHCCC T ss_conf 788999981776 No 45 >pfam02288 Dehydratase_MU Dehydratase medium subunit. This family contains the medium subunit of the trimeric diol dehydratases and glycerol dehydratases. These enzymes are produced by some enterobacteria in response to growth substances. Probab=22.08 E-value=39 Score=15.65 Aligned_cols=35 Identities=26% Similarity=0.396 Sum_probs=27.1 Q ss_pred HHCCCCCCCEEEEEEEEEEECCCHHHHHC-CCCHHHCC Q ss_conf 40357874216765210001274567631-30021101 Q gi|254780965|r 22 FQQSSTPQNHLLFFSEKAVWKGDSETYFQ-CKRAHEFD 58 (229) Q Consensus 22 ~~~~~~~~~~~~~~se~~vw~gd~~~~~q-~~~a~~~d 58 (229) .|..-.|-|.|-.||.+-+- ++++||+ |+||+..- T Consensus 94 HQkDl~Pl~NLELFpqAPll--tletyR~iGkNAA~YA 129 (175) T pfam02288 94 HQRDLPPLSNLELFPQAPLL--TLETYRAIGKNAAKYA 129 (175) T ss_pred ECCCCCCCCCCCCCCCCCCC--CHHHHHHHHHHHHHHH T ss_conf 73777876432116788636--7899999988799985 No 46 >PRK10523 lipoprotein involved with copper homeostasis and adhesion; Provisional Probab=21.97 E-value=66 Score=14.21 Aligned_cols=18 Identities=28% Similarity=0.338 Sum_probs=9.4 Q ss_pred HHHHHHHHHHHHHHHHHH Q ss_conf 347889999999988740 Q gi|254780965|r 6 AIIILLVLVGVILAFYFQ 23 (229) Q Consensus 6 ~~~~~~~~~~~~~~~~~~ 23 (229) +|+-++.-+|++-.+=-| T Consensus 5 ~~~~~~~a~g~~~l~GC~ 22 (234) T PRK10523 5 AILTALAAVGLFALMGCN 22 (234) T ss_pred HHHHHHHHHHHHHHHCCC T ss_conf 999999999999876346 No 47 >COG0158 Fbp Fructose-1,6-bisphosphatase [Carbohydrate transport and metabolism] Probab=21.77 E-value=66 Score=14.19 Aligned_cols=111 Identities=24% Similarity=0.347 Sum_probs=66.2 Q ss_pred HCCCCHHHCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCHHHHHHHHHCCCCCEEEEEEEEECCCCCEEEEE---EC Q ss_conf 313002110133578999999733888899879998510574137775864177025899975533888603897---52 Q gi|254780965|r 49 FQCKRAHEFDENFDNCIITSIKKTGGTQEALRAAQYLEKYLEPGYVSSYRKEALIGIVEVQYPYRANENSGTLLI---PT 125 (229) Q Consensus 49 ~q~~~a~~~d~~~~~Cl~~~m~~tGat~eAl~aa~~L~~~~e~gYV~sf~E~g~vdvatV~YPyRANeN~gT~Lv---pt 125 (229) .||+---..|---||-+.+.+++.|.- +-+. --+++-|+.-++|+|-|+| |- T Consensus 57 ~qGd~QkKlDv~an~~~~~~l~a~~~V------a~~a-------------------SEE~d~~v~~~~~~g~Y~V~~DPL 111 (326) T COG0158 57 VQGDTQKKLDVFANEILIEALKARGNV------AGIA-------------------SEEEDEPVTFPENNGSYAVAYDPL 111 (326) T ss_pred CCCCHHHHHHHHHHHHHHHHHHCCCHH------HEEE-------------------CCCCCCCEECCCCCCEEEEEECCC T ss_conf 686065656777569999998703201------0100-------------------301147563178871389996788 Q ss_pred CCCCEECHHHHH--HHHHH---CCCHHHHHHHH-HCCCC----CCCCCCCCCCCCCCCCEEEEEEEEEC Q ss_conf 776211643512--76763---03843799996-08884----11267631234687780789999622 Q gi|254780965|r 126 VGSHIIDINDSS--VHQLY---DSSPIAKDFVL-RNPGV----FPYSAGHFVKSSHKDGLIELIFSYPL 184 (229) Q Consensus 126 ~gssiIdv~d~~--la~~~---~aDPty~~fik-anP~i----~P~gag~sav~T~e~Ggqrl~F~yPL 184 (229) -|||-||+|-+- |--|| .++|+-++|+. .+-++ .-||+.-.-|-|..+|-..|.+.--| T Consensus 112 DGSSNiDvNvsvGTIFsIy~~~~~~~t~~dflqpG~~qvAAGYv~YG~st~LV~t~g~gV~~ftld~~~ 180 (326) T COG0158 112 DGSSNIDVNVSVGTIFSIYRRPGSPGTEEDFLQPGNKQVAAGYVVYGPSTMLVYTLGEGVHGFTLDPSL 180 (326) T ss_pred CCCCCCCCCCCEEEEEEEEECCCCCCCHHHHCCCCHHHHHEEEEEECCCEEEEEEECCCEEEEEECCCC T ss_conf 886533568511578998755899764555417524221157788768248999927846789975888 No 48 >TIGR00759 aceE 2-oxo-acid dehydrogenase E1 component, homodimeric type; InterPro: IPR004660 Most members of this family are pyruvate dehydrogenase complex, E1 component. It includes a counterexample from Pseudomonas putida, MdeB, that is active as an E1 component of an alpha-ketoglutarate dehydrogenase complex rather than a pyruvate dehydrogenase complex. The second pyruvate dehydrogenase complex E1 protein from Alcaligenes eutrophus, PdhE, complements an aceE mutant of Escherichia coli but is not part of a pyruvate dehydrogenase complex operon, is more similar to the Pseudomonas putida MdeB than to E. coli AceE, and may have also have a different primary specificity. ; GO: 0016491 oxidoreductase activity. Probab=21.49 E-value=46 Score=15.20 Aligned_cols=74 Identities=28% Similarity=0.319 Sum_probs=46.7 Q ss_pred CCCHHHHHHHHHHHC-CCCH---HHHHHHHHCCCCCEEEEEEEEECCCCCEEEEEEC---CCCCEECHHHHHHHHHHC-- Q ss_conf 888899879998510-5741---3777586417702589997553388860389752---776211643512767630-- Q gi|254780965|r 73 GGTQEALRAAQYLEK-YLEP---GYVSSYRKEALIGIVEVQYPYRANENSGTLLIPT---VGSHIIDINDSSVHQLYD-- 143 (229) Q Consensus 73 Gat~eAl~aa~~L~~-~~e~---gYV~sf~E~g~vdvatV~YPyRANeN~gT~Lvpt---~gssiIdv~d~~la~~~~-- 143 (229) -+-.++++|+|+|.+ ||=. ==|+||.|+.+=+..+--. -+|=|+ .+|+ ||+.+. T Consensus 748 ~i~r~v~~Aa~lLa~DyGV~sDvwSvtSf~ELaRdg~dv~R~---------nlLHP~e~~~vsy--------VA~~L~~~ 810 (905) T TIGR00759 748 AIMREVIEAAELLAADYGVASDVWSVTSFTELARDGHDVERW---------NLLHPTETPRVSY--------VAQVLNEA 810 (905) T ss_pred HHHHHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHHHH---------HCCCCCCCCCCCH--------HHHHHHCC T ss_conf 689999999999875178314552058756765678999886---------4258788887648--------98974127 Q ss_pred CCH--H--HHHHHHHCCCC-CCCCC Q ss_conf 384--3--79999608884-11267 Q gi|254780965|r 144 SSP--I--AKDFVLRNPGV-FPYSA 163 (229) Q Consensus 144 aDP--t--y~~fikanP~i-~P~ga 163 (229) +-| . .+|.+|+-|+- -||=| T Consensus 811 ~aPGv~iA~tDYvra~a~qir~~VP 835 (905) T TIGR00759 811 DAPGVVIASTDYVRAFAEQIRPYVP 835 (905) T ss_pred CCCCEEEEECCHHHHCHHHHHHCCC T ss_conf 9980699852212303576631278 No 49 >MTH00074 CYTB cytochrome b; Provisional Probab=21.43 E-value=67 Score=14.15 Aligned_cols=53 Identities=19% Similarity=0.318 Sum_probs=35.1 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCEEEEEEEEEEECCCHHHHHCCCCHHHCCCCHHHHHHHHHHHCCCCHHHHH Q ss_conf 7889999999988740357874216765210001274567631300211013357899999973388889987 Q gi|254780965|r 8 IILLVLVGVILAFYFQQSSTPQNHLLFFSEKAVWKGDSETYFQCKRAHEFDENFDNCIITSIKKTGGTQEALR 80 (229) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~se~~vw~gd~~~~~q~~~a~~~d~~~~~Cl~~~m~~tGat~eAl~ 80 (229) .+..++-|+.|++++..+ .+..|..-..-.||-|+. -+.|.||+.|||-==+. T Consensus 42 l~iQiitGi~L~m~Y~p~-------------------~~~AF~SV~~I~rdV~~G-WliR~~H~nGAS~fF~~ 94 (380) T MTH00074 42 LIAQILTGLFLAMHYTAD-------------------TSSAFSSVAHICRDVNYG-WLMRNIHANGASFFFIC 94 (380) T ss_pred HHHHHHHHHHHHHHHCCC-------------------HHHHHHHHHHHHCCCHHH-HHHHHHHHHHHHHHHHH T ss_conf 999999999999883898-------------------378999999998033008-99999897000799999 No 50 >COG0178 UvrA Excinuclease ATPase subunit [DNA replication, recombination, and repair] Probab=21.15 E-value=19 Score=17.53 Aligned_cols=71 Identities=21% Similarity=0.398 Sum_probs=38.2 Q ss_pred CEECHHHHHHHHHHCCCH-HHHHHHHHCCCC-CCCCCCCCCCCCCCCCEEEEEEEEECCC--CCCCHHHHHHHEEEEECC Q ss_conf 211643512767630384-379999608884-1126763123468778078999962277--533144103110021078 Q gi|254780965|r 129 HIIDINDSSVHQLYDSSP-IAKDFVLRNPGV-FPYSAGHFVKSSHKDGLIELIFSYPLRS--CHGCEDIGFMDIAYKFTT 204 (229) Q Consensus 129 siIdv~d~~la~~~~aDP-ty~~fikanP~i-~P~gag~sav~T~e~Ggqrl~F~yPLRd--CHaC~d~G~a~iAyDFd~ 204 (229) .+|+|+-+|+..+-|+.| ||-..-. .+ --|+.-++|++- |-.-=-|+|-..+ |.+|--.|...|...|.+ T Consensus 677 kvI~IdQsPIGRTpRSNPATYtg~Fd---~IR~lFA~tpeAK~r---Gyk~grFSFNvkGGRCe~C~GdG~ikIeM~FLp 750 (935) T COG0178 677 KVIDIDQSPIGRTPRSNPATYTGVFD---DIRELFAGTPEAKAR---GYKPGRFSFNVKGGRCEACQGDGVIKIEMHFLP 750 (935) T ss_pred EEEEECCCCCCCCCCCCCCCHHCCHH---HHHHHHHHCHHHHHC---CCCCCCCCCCCCCCCCCCCCCCCEEEEEECCCC T ss_conf 07995388878998998630112437---899998209398973---998654432589967754637835999853688 Q ss_pred C Q ss_conf 7 Q gi|254780965|r 205 K 205 (229) Q Consensus 205 d 205 (229) | T Consensus 751 d 751 (935) T COG0178 751 D 751 (935) T ss_pred C T ss_conf 8 No 51 >MTH00131 CYTB cytochrome b; Provisional Probab=20.81 E-value=69 Score=14.07 Aligned_cols=52 Identities=21% Similarity=0.326 Sum_probs=35.9 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCEEEEEEEEEEECCCHHHHHCCCCHHHCCCCHHHHHHHHHHHCCCCHHHH Q ss_conf 788999999998874035787421676521000127456763130021101335789999997338888998 Q gi|254780965|r 8 IILLVLVGVILAFYFQQSSTPQNHLLFFSEKAVWKGDSETYFQCKRAHEFDENFDNCIITSIKKTGGTQEAL 79 (229) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~se~~vw~gd~~~~~q~~~a~~~d~~~~~Cl~~~m~~tGat~eAl 79 (229) .++.++-|+.|++|+..++ +..|..-..-.||.|+. -+.|.||..|||-==+ T Consensus 41 l~iQiitGi~L~~~Y~p~~-------------------~~AF~Sv~~im~dv~~G-WliR~~H~~gaS~ff~ 92 (380) T MTH00131 41 LITQILTGLFLAMHYTSDI-------------------STAFSSVAHICRDVNYG-WLIRNMHANGASFFFI 92 (380) T ss_pred HHHHHHHHHHHHHHHCCCH-------------------HHHHHHHHHHHHHCCCH-HHHHHHHHCCHHHHHH T ss_conf 9999999999997717982-------------------68999999998434247-9999868501259999 No 52 >COG3889 Predicted solute binding protein [General function prediction only] Probab=20.53 E-value=64 Score=14.27 Aligned_cols=23 Identities=39% Similarity=0.622 Sum_probs=18.2 Q ss_pred CCEEEH-HHHHHHHHHHHHHHHHH Q ss_conf 950103-47889999999988740 Q gi|254780965|r 1 MKYKIA-IIILLVLVGVILAFYFQ 23 (229) Q Consensus 1 ~~~~~~-~~~~~~~~~~~~~~~~~ 23 (229) ||-|.+ |+++||+||..+.=++. T Consensus 1 Mk~k~~~ILaiLvlv~llvtPl~~ 24 (872) T COG3889 1 MKKKLGKILAILVLVGLLVTPLTP 24 (872) T ss_pred CCCHHHHHHHHHHHHHHHHCCCCC T ss_conf 960478899999999998713665 No 53 >pfam10856 DUF2678 Protein of unknown function (DUF2678). This family of proteins has no known function. Probab=20.07 E-value=65 Score=14.24 Aligned_cols=17 Identities=47% Similarity=0.677 Sum_probs=14.7 Q ss_pred HHHHHHHHHHHHHHHCC Q ss_conf 88999999998874035 Q gi|254780965|r 9 ILLVLVGVILAFYFQQS 25 (229) Q Consensus 9 ~~~~~~~~~~~~~~~~~ 25 (229) .++|+|.+|.+|.|.|. T Consensus 35 ~llvlvTvisafvfp~~ 51 (114) T pfam10856 35 SLLVLVTVVSSFVFPTP 51 (114) T ss_pred HHHHHHHHHHHHHCCCC T ss_conf 99999999998615789 Done!