Query gi|254780347|ref|YP_003064760.1| riboflavin synthase subunit alpha [Candidatus Liberibacter asiaticus str. psy62] Match_columns 204 No_of_seqs 131 out of 2487 Neff 5.8 Searched_HMMs 39220 Date Sun May 29 15:09:45 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780347.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK09289 riboflavin synthase s 100.0 0 0 514.3 19.4 198 1-204 1-198 (198) 2 PRK13020 riboflavin synthase s 100.0 0 0 505.2 18.8 197 1-202 1-202 (207) 3 COG0307 RibC Riboflavin syntha 100.0 0 0 494.7 17.2 195 1-203 1-197 (204) 4 TIGR00187 ribE riboflavin synt 100.0 0 0 484.5 12.5 196 1-201 1-216 (225) 5 KOG3310 consensus 100.0 0 0 364.6 12.3 195 1-200 1-199 (210) 6 pfam00677 Lum_binding Lumazine 99.9 1.3E-27 3.3E-32 182.1 6.7 85 104-188 1-86 (86) 7 pfam00677 Lum_binding Lumazine 99.9 3.4E-23 8.7E-28 156.2 9.9 83 3-90 1-85 (86) 8 PRK13020 riboflavin synthase s 99.8 1.2E-20 3E-25 141.4 6.4 85 103-187 2-86 (207) 9 PRK09289 riboflavin synthase s 99.8 2.4E-20 6E-25 139.6 6.5 86 102-188 1-86 (198) 10 TIGR00187 ribE riboflavin synt 99.8 1.4E-20 3.5E-25 141.0 4.6 88 102-189 1-95 (225) 11 COG0307 RibC Riboflavin syntha 99.8 8.5E-19 2.2E-23 130.5 6.6 84 3-92 97-183 (204) 12 KOG3310 consensus 99.5 1.4E-14 3.6E-19 105.8 4.7 87 102-189 1-88 (210) 13 pfam07944 DUF1680 Putative gly 78.0 5.2 0.00013 20.6 4.8 60 119-194 429-488 (511) 14 pfam06597 Clostridium_P47 Clos 76.2 6.3 0.00016 20.1 4.8 77 14-102 50-134 (413) 15 PRK00054 dihydroorotate dehydr 59.2 15 0.00039 17.8 5.5 175 7-201 4-201 (248) 16 pfam11948 DUF3465 Protein of u 48.6 23 0.00059 16.8 4.4 13 6-18 39-51 (131) 17 pfam06534 RGM_C Repulsive guid 47.5 7.2 0.00018 19.8 0.7 60 90-149 34-95 (213) 18 KOG3380 consensus 44.6 12 0.00032 18.4 1.5 17 186-202 102-118 (152) 19 cd01247 PH_GPBP Goodpasture an 41.9 29 0.00075 16.2 3.4 27 139-165 40-66 (91) 20 KOG2415 consensus 37.0 19 0.00049 17.3 1.5 52 95-146 114-167 (621) 21 pfam00970 FAD_binding_6 Oxidor 29.8 46 0.0012 15.0 5.4 78 11-91 3-93 (99) 22 KOG1653 consensus 27.6 43 0.0011 15.2 2.0 97 84-191 41-138 (175) 23 pfam04699 P16-Arc ARP2/3 compl 26.4 33 0.00085 15.9 1.3 16 186-201 103-118 (153) 24 TIGR00739 yajC preprotein tran 25.0 57 0.0014 14.5 3.8 38 31-71 34-72 (86) 25 COG1661 Predicted DNA-binding 24.0 59 0.0015 14.4 3.3 82 39-123 38-125 (141) 26 TIGR00038 efp translation elon 23.9 41 0.001 15.4 1.4 120 32-166 7-134 (194) 27 PRK10476 multidrug resistance 23.0 62 0.0016 14.3 3.3 73 24-111 211-287 (348) 28 TIGR01343 hacA_fam homoaconita 22.6 29 0.00073 16.3 0.4 34 142-175 182-215 (432) 29 cd06219 DHOD_e_trans_like1 FAD 20.4 70 0.0018 14.0 6.8 181 11-200 2-201 (248) No 1 >PRK09289 riboflavin synthase subunit alpha; Provisional Probab=100.00 E-value=0 Score=514.31 Aligned_cols=198 Identities=44% Similarity=0.696 Sum_probs=191.9 Q ss_pred CCCEEEEEEEEEEEEEECCCCEEEEEEECCCCCCCCCCCEEEECCEEEEEEEECCCCCCCCEEEEEEEHHHHHHCCCCCC Q ss_conf 94533711899999987699359999817551126677246533315423650455566867999986565532013555 Q gi|254780347|r 1 MFTGIVTDIGKIIAMTPIAKGMRLRVMTSYNTSKMKLGCSIAHAGICLTVVRLSEEHSVDNWYEVEVWAETNRITNIASW 80 (204) Q Consensus 1 MFtGIV~~~G~V~~i~~~~~~~~l~i~~~~~~~~l~~G~SIavnGvCLTV~~i~~~~~~~~~f~~~~~~ETl~~T~l~~l 80 (204) |||||||++|+|.++++.+++.++.|+++..++++++|||||||||||||++++ +++|+|++++||+++|||+.| T Consensus 1 MFTGIIe~~G~i~~i~~~~~~~~l~I~~~~~~~~~~iG~SIavnGvCLTV~~~~-----~~~~~fdv~~ETl~~T~l~~l 75 (198) T PRK09289 1 MFTGIVEEVGTVVSIEEKGGGRRLTIEAPKLLSDLKLGDSIAVNGVCLTVTEID-----GDGFTVDVSPETLRRTNLGDL 75 (198) T ss_pred CCCEECCEEEEEEEEEECCCCEEEEEECCCCCCCCCCCCEEEECCEEEEEEECC-----CCEEEEEEEHHHHHHCCCCCC T ss_conf 988543548999999976997899998086435676788799888998889868-----988999989788333137777 Q ss_pred CCCCCCCCCCCCHHHHCCCCCEEEECCCCHHHHHHHCCCCCEEEEEEECCCCCCCEECCCCEEEEEEEEEEEEEECCCEE Q ss_conf 65663133543200000476424540100110111002331379875134211210023571998234578774168549 Q gi|254780347|r 81 GIGTFINLERSVKLGDRLDGHLVSGHIDGTVEILFLDFIGDSMYCRLSLPHNLEQFIAVKGSVCLNGVSLTVNLVGKNFF 160 (204) Q Consensus 81 ~~G~~VNLE~a~~~g~~lgGH~v~GhVd~~~~i~~i~~~~~~~~~~i~~~~~~~~~l~~KgsI~idGvSLTV~~~~~~~f 160 (204) ++|++|||||||++|||+|||+|||||||+|+|.++++.++++.++|++|+.+++|+++|||||+|||||||++++ ++| T Consensus 76 ~~G~~VNLErsl~~gd~lgGH~VsGHVD~~~~i~~i~~~~~~~~~~i~~p~~~~~yiv~KGsIaidGvSLTV~~v~-~~F 154 (198) T PRK09289 76 KVGDRVNLERALRLGDRLGGHIVSGHVDGTGEIVSIEKSGNSVEFRIKAPAELAKYIVEKGSIAVDGVSLTVNEVD-GEF 154 (198) T ss_pred CCCCEEEECCCCCCCCEECCCEEEEEECEEEEEEEEEECCCCEEEEEECCHHHHCCCCCCCEEEEEEEEEEEEEEC-CEE T ss_conf 7796671055755886206713238713489999999848829999992979963777776699961699978437-889 Q ss_pred EEEEEHHHHCCCCHHHCCCCCEEEEEHHHHHHHHHHHHCCCCCC Q ss_conf 99984443301330024899674697135889999986023359 Q gi|254780347|r 161 DVLLIRHTIEETTWKMHKVGDLINIEVDSMMRYIARLYALNTSQ 204 (204) Q Consensus 161 ~v~lIp~Tl~~T~l~~~k~Gd~VNiE~D~l~kyv~r~~~~~~~~ 204 (204) +|+||||||++|||+++++||+||||+|++|||++|+++.+.++ T Consensus 155 ~v~lIP~Tl~~T~l~~~~~Gd~VNiE~D~lakyv~r~l~~~~~~ 198 (198) T PRK09289 155 SVNLIPHTLEVTTLGELKVGDRVNLEIDLLAKYVERLLAARKNA 198 (198) T ss_pred EEEEEHHHHHHCCCCCCCCCCEEEEEEEHHHHHHHHHHHHCCCC T ss_conf 99995455654724448689988995862999999986443079 No 2 >PRK13020 riboflavin synthase subunit alpha; Provisional Probab=100.00 E-value=0 Score=505.18 Aligned_cols=197 Identities=34% Similarity=0.559 Sum_probs=190.4 Q ss_pred CCCEEEEEEEEEEEEEECCCCEEEEEEECCC-CCCCCCCCEEEECCEEEEEEEECCCCCCCCEEEEEEEHHHHHHCCCCC Q ss_conf 9453371189999998769935999981755-112667724653331542365045556686799998656553201355 Q gi|254780347|r 1 MFTGIVTDIGKIIAMTPIAKGMRLRVMTSYN-TSKMKLGCSIAHAGICLTVVRLSEEHSVDNWYEVEVWAETNRITNIAS 79 (204) Q Consensus 1 MFtGIV~~~G~V~~i~~~~~~~~l~i~~~~~-~~~l~~G~SIavnGvCLTV~~i~~~~~~~~~f~~~~~~ETl~~T~l~~ 79 (204) |||||||++|+|.++++.+++.++.|.++.. ++++++|||||||||||||++++ ++.|+|++++||+++|||+. T Consensus 1 MFTGIIe~~G~I~~i~~~~~~~~l~I~~~~~~~~~l~iGdSIavnGvCLTV~~~~-----~~~~~fdv~~ETl~~T~l~~ 75 (207) T PRK13020 1 MFTGIVQAVAEVVAIHKKDGLNTLEIAFPPDLLEGLEIGASVAVNGVCLTVTKIE-----GDRVFFDVMEETLRLTNLAN 75 (207) T ss_pred CCCEECCEEEEEEEEEECCCCEEEEEECCCHHHCCCCCCCEEEECCEEEEEEECC-----CCEEEEEEEHHHHHHCCHHH T ss_conf 9885516389999999769968999986822413577898799888898889748-----98599998789631252313 Q ss_pred CCCCCCCCCCCCCHHHHCCCCCEEEECCCCHHHHHHHCCCCCEEEEEEECCCCCCCEECCCCEEEEEEEEEEEEEECCCE Q ss_conf 56566313354320000047642454010011011100233137987513421121002357199823457877416854 Q gi|254780347|r 80 WGIGTFINLERSVKLGDRLDGHLVSGHIDGTVEILFLDFIGDSMYCRLSLPHNLEQFIAVKGSVCLNGVSLTVNLVGKNF 159 (204) Q Consensus 80 l~~G~~VNLE~a~~~g~~lgGH~v~GhVd~~~~i~~i~~~~~~~~~~i~~~~~~~~~l~~KgsI~idGvSLTV~~~~~~~ 159 (204) |++|++|||||||++|||+|||+|||||||+|+|.++++.++++.++|++|+.+++|+++|||||+|||||||+++.+++ T Consensus 76 ~~~G~~VNLEral~~gdrlgGH~V~GHVD~~~~i~~i~~~~~~~~~~i~~p~~~~~~iv~KGSIaidGVSLTV~~v~~~~ 155 (207) T PRK13020 76 LRVGDRVNIERSAKFGAEIGGHILSGHVDTTAEVVEISETEENYDIRFRVPPEWMKYIFAKGFIGVNGCSLTVGDVEEGW 155 (207) T ss_pred CCCCCEEEECCCCCCCCEECCCEEEEECCEEEEEEEEEECCCCEEEEEECCHHHHCCCCCCCEEEEEEEEEEEEEECCCE T ss_conf 23598760044756876517816789800599999999848938999955979953667687799924899998773998 Q ss_pred EEEEEEHHHHCCCCHHHCCCCCEEEEEHH----HHHHHHHHHHCCCC Q ss_conf 99998444330133002489967469713----58899999860233 Q gi|254780347|r 160 FDVLLIRHTIEETTWKMHKVGDLINIEVD----SMMRYIARLYALNT 202 (204) Q Consensus 160 f~v~lIp~Tl~~T~l~~~k~Gd~VNiE~D----~l~kyv~r~~~~~~ 202 (204) |+|+||||||++|||+++++||+||||+| +++||+||+|+.+- T Consensus 156 f~V~lIPhTl~~T~l~~~k~Gd~VNiE~D~~~~~i~k~ver~l~~~~ 202 (207) T PRK13020 156 FEVHLIPETLRATNLGAKKVGDKVNIEIDSQTQVIVDTVERVLAARL 202 (207) T ss_pred EEEEEEEHHHHHCCCCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHH T ss_conf 99999511354384434868998998385883489999999987543 No 3 >COG0307 RibC Riboflavin synthase alpha chain [Coenzyme metabolism] Probab=100.00 E-value=0 Score=494.66 Aligned_cols=195 Identities=45% Similarity=0.721 Sum_probs=184.1 Q ss_pred CCCEEEEEEEEEEEEEECCCCEEEEEEE-CCCCCCCCCCCEEEECCEEEEEEEECCCCCCCCEEEEEEEHHHHHHCCCCC Q ss_conf 9453371189999998769935999981-755112667724653331542365045556686799998656553201355 Q gi|254780347|r 1 MFTGIVTDIGKIIAMTPIAKGMRLRVMT-SYNTSKMKLGCSIAHAGICLTVVRLSEEHSVDNWYEVEVWAETNRITNIAS 79 (204) Q Consensus 1 MFtGIV~~~G~V~~i~~~~~~~~l~i~~-~~~~~~l~~G~SIavnGvCLTV~~i~~~~~~~~~f~~~~~~ETl~~T~l~~ 79 (204) |||||||++|+|.... ++.++.+.. +..+.++++|||||||||||||+++++ ++|+||+|||||++|||+. T Consensus 1 MFTGIIe~iG~V~~~~---~~~~~~i~~~~~~~~d~~lGdSIAvnGvCLTVt~~~~-----~~f~~dv~~ETl~~TnL~~ 72 (204) T COG0307 1 MFTGIVEEIGKVKKVE---NGITLTIESAKLILEDVKLGDSIAVNGVCLTVTEFNE-----DGFSVDVMPETLRRTNLGD 72 (204) T ss_pred CCCEEEEEEEEEEECC---CCEEEEEEECCCCCCCCCCCCEEEECCEEEEEEEECC-----CCEEEEECHHHHHHCCHHH T ss_conf 9653310369998504---8569999504545566756774788989899999889-----9699995389964330101 Q ss_pred CCCCCCCCCCCCCHHHHCCCCCEEEECCCCHHHHHHHCCCCCEEEEEEE-CCCCCCCEECCCCEEEEEEEEEEEEEECCC Q ss_conf 5656631335432000004764245401001101110023313798751-342112100235719982345787741685 Q gi|254780347|r 80 WGIGTFINLERSVKLGDRLDGHLVSGHIDGTVEILFLDFIGDSMYCRLS-LPHNLEQFIAVKGSVCLNGVSLTVNLVGKN 158 (204) Q Consensus 80 l~~G~~VNLE~a~~~g~~lgGH~v~GhVd~~~~i~~i~~~~~~~~~~i~-~~~~~~~~l~~KgsI~idGvSLTV~~~~~~ 158 (204) |+.|++|||||||++|+|+|||+|||||||+|+|.++++.++++.++|. .|+.+++|+++|||||+||+||||+++.++ T Consensus 73 ~~~G~~VNLERAl~~~~r~GGH~VsGHVDg~g~I~~i~~~~na~~~~~~~~~~~l~kyiv~KGsIavDGiSLTV~~v~~~ 152 (204) T COG0307 73 LKVGDKVNLERALKLGDRLGGHLVSGHVDGTGEIVKIEKDGNAVRFWFKAPPEELAKYIVEKGSIAVDGISLTVNEVEDD 152 (204) T ss_pred CCCCCEEEEEHHCCCCCCCCCEEEEEEEEEEEEEEEEEECCCEEEEEEECCCHHHHHHHCCCCEEEEEEEEEEEEEECCC T ss_conf 66688885104133798353178876770089999998748709999975998998552046449983189999977287 Q ss_pred EEEEEEEHHHHCCCCHHHCCCCCEEEEEHHHHHHHHHHHHCCCCC Q ss_conf 499998444330133002489967469713588999998602335 Q gi|254780347|r 159 FFDVLLIRHTIEETTWKMHKVGDLINIEVDSMMRYIARLYALNTS 203 (204) Q Consensus 159 ~f~v~lIp~Tl~~T~l~~~k~Gd~VNiE~D~l~kyv~r~~~~~~~ 203 (204) +|+|+|||||+++|+|+.+++||+||||+|+++||+||+++.... T Consensus 153 ~F~V~lIPhT~~~T~l~~~~~Gd~VNiE~D~l~kyver~l~~~~~ 197 (204) T COG0307 153 TFSVSLIPHTLERTTLGEKKVGDRVNIEIDVLAKYVERLLAAGKA 197 (204) T ss_pred EEEEEEEECCHHHCCHHHCCCCCEEEEEECHHHHHHHHHHHHCCC T ss_conf 899999736043300300668998899681799999999863213 No 4 >TIGR00187 ribE riboflavin synthase, alpha subunit; InterPro: IPR001783 The following proteins have been shown , to be structurally and evolutionary related: Riboflavin synthase alpha chain (2.5.1.9 from EC) (RS-alpha) (gene ribC in Escherichia coli, ribB in Bacillus subtilis and Photobacterium leiognathi, RIB5 in yeast. This enzyme synthesises riboflavin from two moles of 6,7- dimethyl-8-(1'-D-ribityl)lumazine (Lum), a pteridine-derivative. Photobacterium phosphoreum lumazine protein (LumP) (gene luxL). LumP is a protein that modulates the colour of the bioluminescence emission of bacterial luciferase. In the presence of LumP, light emission is shifted to higher energy values (shorter wavelength). LumP binds non-covalently to 6,7-dimethyl-8-(1'-D-ribityl)lumazine. Vibrio fischeri yellow fluorescent protein (YFP) (gene luxY). Like LumP, YFP modulates light emission but towards a longer wavelength. YFP binds non-covalently to FMN. These proteins seem to have evolved from the duplication of a domain of about 100 residues. In its C-terminal section, this domain contains a conserved motif [KR]-V-N-[LI]-E which has been proposed to be the binding site for lumazine (Lum) and some of its derivatives. RS-alpha which binds two molecules of Lum has two perfect copies of this motif, while LumP which binds one molecule of Lum, has a Glu instead of Lys/Arg in the first position of the second copy of the motif. Similarly, YFP, which binds to one molecule of FMN, also seems to have a potentially dysfunctional binding site by substitution of Gly for Glu in the last position of the first copy of the motif. . Probab=100.00 E-value=0 Score=484.46 Aligned_cols=196 Identities=36% Similarity=0.590 Sum_probs=186.8 Q ss_pred CCCEEEEEEEEEEEEEECCCC------EEEEEEEC-CCCCCCCCCCEEEECCEEEEEEEECCCCCCCCEEEEEEEHHHHH Q ss_conf 945337118999999876993------59999817-55112667724653331542365045556686799998656553 Q gi|254780347|r 1 MFTGIVTDIGKIIAMTPIAKG------MRLRVMTS-YNTSKMKLGCSIAHAGICLTVVRLSEEHSVDNWYEVEVWAETNR 73 (204) Q Consensus 1 MFtGIV~~~G~V~~i~~~~~~------~~l~i~~~-~~~~~l~~G~SIavnGvCLTV~~i~~~~~~~~~f~~~~~~ETl~ 73 (204) |||||||.+|+|.++.+.... ..+.+..+ ..+.|+++|||||+|||||||++++. +.|.||+||||++ T Consensus 1 MFTGIi~~~g~~~s~~~~~~~esGGqG~~~~~~~~~~~L~D~~lGdSIA~NGvCLTV~e~~~-----~~f~vd~s~ET~~ 75 (225) T TIGR00187 1 MFTGIIQGTGKVKSIKEKRLFESGGQGISLTINLAQKILEDLHLGDSIAVNGVCLTVTEINK-----DHFSVDLSPETLK 75 (225) T ss_pred CCCEEECCCEEEEECCCCCCCCCCCCCEEECCCCCHHHHCCCCCCCEEEECCEEEEEEEECC-----CEEEEEECHHHHH T ss_conf 96302023023320013322012785047634872345402520454123674332221128-----8689983256733 Q ss_pred HCCCCCCCCC-CCCCCCCCCHHHHCCCCCEEEECCCCHHHHHHHCCCCCEEEEEEEC-CCC-------CCCEECCCCEEE Q ss_conf 2013555656-6313354320000047642454010011011100233137987513-421-------121002357199 Q gi|254780347|r 74 ITNIASWGIG-TFINLERSVKLGDRLDGHLVSGHIDGTVEILFLDFIGDSMYCRLSL-PHN-------LEQFIAVKGSVC 144 (204) Q Consensus 74 ~T~l~~l~~G-~~VNLE~a~~~g~~lgGH~v~GhVd~~~~i~~i~~~~~~~~~~i~~-~~~-------~~~~l~~KgsI~ 144 (204) +|||++|+.| +.|||||||++++|+|||||||||||+|+|.++++.+|++.++|++ ++. ++||+++||||| T Consensus 76 ~t~L~~~k~gr~~VNiErA~~~~~~~GGHfvsGHvd~~a~i~~i~~~~N~~~~~f~~~~~~qkqWqey~~ky~v~KGsIa 155 (225) T TIGR00187 76 RTNLGDLKVGRTKVNIERALKADGRIGGHFVSGHVDGVAEIEKIEESENAVQFWFKLRDSEQKQWQEYLLKYIVEKGSIA 155 (225) T ss_pred HHHHHHHHHCCCEECEEHHHHCCCEECCEEEEEEEEEEEEEEEHHHCCCCEEEEEECCCHHHHHHHHHHHHHHHCCCCEE T ss_conf 54233355236720110674169741666751116665522300301582899984058134678887687642477579 Q ss_pred EEEEEEEEEEECCCEEE----EEEEHHHHCCCCHHHCCCCCEEEEEHHHHHHHHHHHHCCC Q ss_conf 82345787741685499----9984443301330024899674697135889999986023 Q gi|254780347|r 145 LNGVSLTVNLVGKNFFD----VLLIRHTIEETTWKMHKVGDLINIEVDSMMRYIARLYALN 201 (204) Q Consensus 145 idGvSLTV~~~~~~~f~----v~lIp~Tl~~T~l~~~k~Gd~VNiE~D~l~kyv~r~~~~~ 201 (204) ||||||||.++.+++|. |+|||||+++|+|+.+|+||+||||+||++||++|.+..- T Consensus 156 vDG~SLTi~~v~e~~f~Gafyv~lIP~T~~~T~~~~~k~G~~VNIE~D~~gk~~~~~l~~~ 216 (225) T TIGR00187 156 VDGISLTIGKVTEDSFCGAFYVSLIPHTLEETILGLKKIGDRVNIEIDMLGKYVARFLEKT 216 (225) T ss_pred ECCEEEEEEEECCCCEEEEEEEEECCCCHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHH T ss_conf 9667876577413603315899863101222210111768776542085789999999987 No 5 >KOG3310 consensus Probab=100.00 E-value=0 Score=364.65 Aligned_cols=195 Identities=34% Similarity=0.547 Sum_probs=185.1 Q ss_pred CCCEEEEEEEEEEEEEE-CCCCEEEEEEECCCCCCCCCCCEEEECCEEEEEEEECCCCCCCCEEEEEEEHHHHHHCCCCC Q ss_conf 94533711899999987-69935999981755112667724653331542365045556686799998656553201355 Q gi|254780347|r 1 MFTGIVTDIGKIIAMTP-IAKGMRLRVMTSYNTSKMKLGCSIAHAGICLTVVRLSEEHSVDNWYEVEVWAETNRITNIAS 79 (204) Q Consensus 1 MFtGIV~~~G~V~~i~~-~~~~~~l~i~~~~~~~~l~~G~SIavnGvCLTV~~i~~~~~~~~~f~~~~~~ETl~~T~l~~ 79 (204) ||||||+..|.|+.... .++|..+.|.++..+.|.+.|||||+||+||||++|+.. .|.+-+.|||+++||+++ T Consensus 1 mftgive~mG~Vkd~~~~dd~Gf~~kI~A~~iL~Dch~GDSIAVNGtCLTVTeFn~~-----~FtVGiaPEtlr~tnl~~ 75 (210) T KOG3310 1 MFTGIVECMGEVKDLGMADDGGFDLKIGARVILEDCHLGDSIAVNGTCLTVTEFNAE-----EFTVGIAPETLRKTNLEE 75 (210) T ss_pred CCCCHHHHHEEHHHCCCCCCCCEEEEECCCEEEEECCCCCEEEECCEEEEEEEECCC-----CEEEECCHHHHHHCCHHH T ss_conf 941057221132304765468788896687355312357727883479988860542-----058725879963256988 Q ss_pred CCCCCCCCCCCCCHHHHCCCCCEEEECCCCHHHHHHHCCCCCEEEEEEEC-CCCCCCEECCCCEEEEEEEEEEEEEEC-- Q ss_conf 56566313354320000047642454010011011100233137987513-421121002357199823457877416-- Q gi|254780347|r 80 WGIGTFINLERSVKLGDRLDGHLVSGHIDGTVEILFLDFIGDSMYCRLSL-PHNLEQFIAVKGSVCLNGVSLTVNLVG-- 156 (204) Q Consensus 80 l~~G~~VNLE~a~~~g~~lgGH~v~GhVd~~~~i~~i~~~~~~~~~~i~~-~~~~~~~l~~KgsI~idGvSLTV~~~~-- 156 (204) |+.|++||||||+..-.|+|||+||||||++|.|.+.+..+++.++.|++ ++.+.||+++||+||+||-||||.++. T Consensus 76 ~k~G~pVNLERAv~~~~RmGGH~VQGHVDtva~Ivs~~~eG~si~f~f~~rD~~~lKYIV~KGfiavDGTSLTi~~Vd~~ 155 (210) T KOG3310 76 LKKGSPVNLERAVQPVSRMGGHVVQGHVDTVAVIVSMEVEGDSIWFKFKLRDKGLLKYIVPKGFIAVDGTSLTIVDVDDE 155 (210) T ss_pred HHCCCCCCHHHHCCCCCCCCCEEEEEEECCEEEEEEECCCCCEEEEEEEECCCCCEEEEECCCEEEECCCEEEEEEECCC T ss_conf 73599625656406122005547876603237899861369779999885682505788115479972736899997688 Q ss_pred CCEEEEEEEHHHHCCCCHHHCCCCCEEEEEHHHHHHHHHHHHCC Q ss_conf 85499998444330133002489967469713588999998602 Q gi|254780347|r 157 KNFFDVLLIRHTIEETTWKMHKVGDLINIEVDSMMRYIARLYAL 200 (204) Q Consensus 157 ~~~f~v~lIp~Tl~~T~l~~~k~Gd~VNiE~D~l~kyv~r~~~~ 200 (204) +.+|.+..|.||.++-.+..+|+||+||+|.|+++||+||++.. T Consensus 156 ~s~F~imMI~yTQ~nVimp~KkiGd~VN~EVD~~GKy~Ekl~~~ 199 (210) T KOG3310 156 ESCFNIMMIAYTQQNVIMPTKKIGDKVNLEVDIMGKYVEKLLTS 199 (210) T ss_pred CCEEEEEEEEECCCCEEEECHHCCCEEEEEEEHHHHHHHHHHHH T ss_conf 87589998410234278750142864668982676799999998 No 6 >pfam00677 Lum_binding Lumazine binding domain. This domain binds to derivatives of lumazine in some proteins. Some proteins have lost the residues involved in binding lumazine. Probab=99.94 E-value=1.3e-27 Score=182.06 Aligned_cols=85 Identities=35% Similarity=0.480 Sum_probs=82.2 Q ss_pred EECCCCHHHHHHHCCCCCEEEEEEECCCC-CCCEECCCCEEEEEEEEEEEEEECCCEEEEEEEHHHHCCCCHHHCCCCCE Q ss_conf 54010011011100233137987513421-12100235719982345787741685499998444330133002489967 Q gi|254780347|r 104 SGHIDGTVEILFLDFIGDSMYCRLSLPHN-LEQFIAVKGSVCLNGVSLTVNLVGKNFFDVLLIRHTIEETTWKMHKVGDL 182 (204) Q Consensus 104 ~GhVd~~~~i~~i~~~~~~~~~~i~~~~~-~~~~l~~KgsI~idGvSLTV~~~~~~~f~v~lIp~Tl~~T~l~~~k~Gd~ 182 (204) +||||++|+|.++++.++++.+++++|+. +.+|++.|||||+|||||||+++++++|++++||||+++|+|+++++||+ T Consensus 1 TGiV~~~g~V~~i~~~~~~~~~~i~~~~~~~~~~~~~~~SIavnGvcLTV~~~~~~~~~~~ii~eTl~~T~l~~~k~G~~ 80 (86) T pfam00677 1 TGIVEGVGEIAEIIENGNARSIWIKLPDKMLTKVLIGKDSIAVNGISLTVTEVTGNQFTFDLIPETLKRTNLGSLKVGDK 80 (86) T ss_pred CCCEEEEEEEEEEEECCCCEEEEEEECCHHHHHCCCCCCEEEECCEEEEEEEECCCEEEEEEEHHHHHHCCCCCCCCCCE T ss_conf 96882899999999979969999993817854202368879985589999998799999999899984383534877999 Q ss_pred EEEEHH Q ss_conf 469713 Q gi|254780347|r 183 INIEVD 188 (204) Q Consensus 183 VNiE~D 188 (204) ||||+| T Consensus 81 VNlE~D 86 (86) T pfam00677 81 VNLEID 86 (86) T ss_pred EEEEEC T ss_conf 996589 No 7 >pfam00677 Lum_binding Lumazine binding domain. This domain binds to derivatives of lumazine in some proteins. Some proteins have lost the residues involved in binding lumazine. Probab=99.89 E-value=3.4e-23 Score=156.22 Aligned_cols=83 Identities=30% Similarity=0.388 Sum_probs=75.4 Q ss_pred CEEEEEEEEEEEEEECCCCEEEEEEECCC-CCC-CCCCCEEEECCEEEEEEEECCCCCCCCEEEEEEEHHHHHHCCCCCC Q ss_conf 53371189999998769935999981755-112-6677246533315423650455566867999986565532013555 Q gi|254780347|r 3 TGIVTDIGKIIAMTPIAKGMRLRVMTSYN-TSK-MKLGCSIAHAGICLTVVRLSEEHSVDNWYEVEVWAETNRITNIASW 80 (204) Q Consensus 3 tGIV~~~G~V~~i~~~~~~~~l~i~~~~~-~~~-l~~G~SIavnGvCLTV~~i~~~~~~~~~f~~~~~~ETl~~T~l~~l 80 (204) |||||++|+|.++++.++++++.+..|.. +.+ +..|+|||+|||||||++++ +++|.|+++|||+++|||+.| T Consensus 1 TGiV~~~g~V~~i~~~~~~~~~~i~~~~~~~~~~~~~~~SIavnGvcLTV~~~~-----~~~~~~~ii~eTl~~T~l~~~ 75 (86) T pfam00677 1 TGIVEGVGEIAEIIENGNARSIWIKLPDKMLTKVLIGKDSIAVNGISLTVTEVT-----GNQFTFDLIPETLKRTNLGSL 75 (86) T ss_pred CCCEEEEEEEEEEEECCCCEEEEEEECCHHHHHCCCCCCEEEECCEEEEEEEEC-----CCEEEEEEEHHHHHHCCCCCC T ss_conf 968828999999999799699999938178542023688799855899999987-----999999998999843835348 Q ss_pred CCCCCCCCCC Q ss_conf 6566313354 Q gi|254780347|r 81 GIGTFINLER 90 (204) Q Consensus 81 ~~G~~VNLE~ 90 (204) ++|++||||- T Consensus 76 k~G~~VNlE~ 85 (86) T pfam00677 76 KVGDKVNLEI 85 (86) T ss_pred CCCCEEEEEE T ss_conf 7799999658 No 8 >PRK13020 riboflavin synthase subunit alpha; Provisional Probab=99.82 E-value=1.2e-20 Score=141.36 Aligned_cols=85 Identities=24% Similarity=0.318 Sum_probs=44.4 Q ss_pred EEECCCCHHHHHHHCCCCCEEEEEEECCCCCCCEECCCCEEEEEEEEEEEEEECCCEEEEEEEHHHHCCCCHHHCCCCCE Q ss_conf 45401001101110023313798751342112100235719982345787741685499998444330133002489967 Q gi|254780347|r 103 VSGHIDGTVEILFLDFIGDSMYCRLSLPHNLEQFIAVKGSVCLNGVSLTVNLVGKNFFDVLLIRHTIEETTWKMHKVGDL 182 (204) Q Consensus 103 v~GhVd~~~~i~~i~~~~~~~~~~i~~~~~~~~~l~~KgsI~idGvSLTV~~~~~~~f~v~lIp~Tl~~T~l~~~k~Gd~ 182 (204) .+|.|.++|+|.++++.++.+.+.|.+|..+.+.+....|||+|||||||.++.++.|.+.++|||+++|+|+.|++|++ T Consensus 2 FTGIIe~~G~I~~i~~~~~~~~l~I~~~~~~~~~l~iGdSIavnGvCLTV~~~~~~~~~fdv~~ETl~~T~l~~~~~G~~ 81 (207) T PRK13020 2 FTGIVQAVAEVVAIHKKDGLNTLEIAFPPDLLEGLEIGASVAVNGVCLTVTKIEGDRVFFDVMEETLRLTNLANLRVGDR 81 (207) T ss_pred CCEECCEEEEEEEEEECCCCEEEEEECCCHHHCCCCCCCEEEECCEEEEEEECCCCEEEEEEEHHHHHHCCHHHCCCCCE T ss_conf 88551638999999976996899998682241357789879988889888974898599998789631252313235987 Q ss_pred EEEEH Q ss_conf 46971 Q gi|254780347|r 183 INIEV 187 (204) Q Consensus 183 VNiE~ 187 (204) ||||- T Consensus 82 VNLEr 86 (207) T PRK13020 82 VNIER 86 (207) T ss_pred EEECC T ss_conf 60044 No 9 >PRK09289 riboflavin synthase subunit alpha; Provisional Probab=99.81 E-value=2.4e-20 Score=139.61 Aligned_cols=86 Identities=24% Similarity=0.339 Sum_probs=64.5 Q ss_pred EEEECCCCHHHHHHHCCCCCEEEEEEECCCCCCCEECCCCEEEEEEEEEEEEEECCCEEEEEEEHHHHCCCCHHHCCCCC Q ss_conf 24540100110111002331379875134211210023571998234578774168549999844433013300248996 Q gi|254780347|r 102 LVSGHIDGTVEILFLDFIGDSMYCRLSLPHNLEQFIAVKGSVCLNGVSLTVNLVGKNFFDVLLIRHTIEETTWKMHKVGD 181 (204) Q Consensus 102 ~v~GhVd~~~~i~~i~~~~~~~~~~i~~~~~~~~~l~~KgsI~idGvSLTV~~~~~~~f~v~lIp~Tl~~T~l~~~k~Gd 181 (204) +.+|.|.++|+|.++++.++...++|++|..+ +.+....|||+|||||||.++.++.|+++++|||+++|||+.|++|+ T Consensus 1 MFTGIIe~~G~i~~i~~~~~~~~l~I~~~~~~-~~~~iG~SIavnGvCLTV~~~~~~~~~fdv~~ETl~~T~l~~l~~G~ 79 (198) T PRK09289 1 MFTGIVEEVGTVVSIEEKGGGRRLTIEAPKLL-SDLKLGDSIAVNGVCLTVTEIDGDGFTVDVSPETLRRTNLGDLKVGD 79 (198) T ss_pred CCCEECCEEEEEEEEEECCCCEEEEEECCCCC-CCCCCCCEEEECCEEEEEEECCCCEEEEEEEHHHHHHCCCCCCCCCC T ss_conf 98854354899999997699789999808643-56767887998889988898689889999897883331377777796 Q ss_pred EEEEEHH Q ss_conf 7469713 Q gi|254780347|r 182 LINIEVD 188 (204) Q Consensus 182 ~VNiE~D 188 (204) +||||-- T Consensus 80 ~VNLErs 86 (198) T PRK09289 80 RVNLERA 86 (198) T ss_pred EEEECCC T ss_conf 6710557 No 10 >TIGR00187 ribE riboflavin synthase, alpha subunit; InterPro: IPR001783 The following proteins have been shown , to be structurally and evolutionary related: Riboflavin synthase alpha chain (2.5.1.9 from EC) (RS-alpha) (gene ribC in Escherichia coli, ribB in Bacillus subtilis and Photobacterium leiognathi, RIB5 in yeast. This enzyme synthesises riboflavin from two moles of 6,7- dimethyl-8-(1'-D-ribityl)lumazine (Lum), a pteridine-derivative. Photobacterium phosphoreum lumazine protein (LumP) (gene luxL). LumP is a protein that modulates the colour of the bioluminescence emission of bacterial luciferase. In the presence of LumP, light emission is shifted to higher energy values (shorter wavelength). LumP binds non-covalently to 6,7-dimethyl-8-(1'-D-ribityl)lumazine. Vibrio fischeri yellow fluorescent protein (YFP) (gene luxY). Like LumP, YFP modulates light emission but towards a longer wavelength. YFP binds non-covalently to FMN. These proteins seem to have evolved from the duplication of a domain of about 100 residues. In its C-terminal section, this domain contains a conserved motif [KR]-V-N-[LI]-E which has been proposed to be the binding site for lumazine (Lum) and some of its derivatives. RS-alpha which binds two molecules of Lum has two perfect copies of this motif, while LumP which binds one molecule of Lum, has a Glu instead of Lys/Arg in the first position of the second copy of the motif. Similarly, YFP, which binds to one molecule of FMN, also seems to have a potentially dysfunctional binding site by substitution of Gly for Glu in the last position of the first copy of the motif. . Probab=99.81 E-value=1.4e-20 Score=140.97 Aligned_cols=88 Identities=27% Similarity=0.335 Sum_probs=83.4 Q ss_pred EEEECCCCHHHHHHHCCCCCE------EEEEEECCCCCCCEECCCCEEEEEEEEEEEEEECCCEEEEEEEHHHHCCCCHH Q ss_conf 245401001101110023313------79875134211210023571998234578774168549999844433013300 Q gi|254780347|r 102 LVSGHIDGTVEILFLDFIGDS------MYCRLSLPHNLEQFIAVKGSVCLNGVSLTVNLVGKNFFDVLLIRHTIEETTWK 175 (204) Q Consensus 102 ~v~GhVd~~~~i~~i~~~~~~------~~~~i~~~~~~~~~l~~KgsI~idGvSLTV~~~~~~~f~v~lIp~Tl~~T~l~ 175 (204) +.||.|+++|+|.++++..+. ....++.+.....-+.-.+|||+|||||||.+++++.|+|.|.|||+++|||+ T Consensus 1 MFTGIi~~~g~~~s~~~~~~~esGGqG~~~~~~~~~~~L~D~~lGdSIA~NGvCLTV~e~~~~~f~vd~s~ET~~~t~L~ 80 (225) T TIGR00187 1 MFTGIIQGTGKVKSIKEKRLFESGGQGISLTINLAQKILEDLHLGDSIAVNGVCLTVTEINKDHFSVDLSPETLKRTNLG 80 (225) T ss_pred CCCEEECCCEEEEECCCCCCCCCCCCCEEECCCCCHHHHCCCCCCCEEEECCEEEEEEEECCCEEEEEECHHHHHHHHHH T ss_conf 96302023023320013322012785047634872345402520454123674332221128868998325673354233 Q ss_pred HCCCC-CEEEEEHHH Q ss_conf 24899-674697135 Q gi|254780347|r 176 MHKVG-DLINIEVDS 189 (204) Q Consensus 176 ~~k~G-d~VNiE~D~ 189 (204) .+|.| ++||||.=+ T Consensus 81 ~~k~gr~~VNiErA~ 95 (225) T TIGR00187 81 DLKVGRTKVNIERAL 95 (225) T ss_pred HHHHCCCEECEEHHH T ss_conf 355236720110674 No 11 >COG0307 RibC Riboflavin synthase alpha chain [Coenzyme metabolism] Probab=99.76 E-value=8.5e-19 Score=130.51 Aligned_cols=84 Identities=27% Similarity=0.391 Sum_probs=52.9 Q ss_pred CEEEEEEEEEEEEEECCCCEEEEEE-ECCCCCC--CCCCCEEEECCEEEEEEEECCCCCCCCEEEEEEEHHHHHHCCCCC Q ss_conf 5337118999999876993599998-1755112--667724653331542365045556686799998656553201355 Q gi|254780347|r 3 TGIVTDIGKIIAMTPIAKGMRLRVM-TSYNTSK--MKLGCSIAHAGICLTVVRLSEEHSVDNWYEVEVWAETNRITNIAS 79 (204) Q Consensus 3 tGIV~~~G~V~~i~~~~~~~~l~i~-~~~~~~~--l~~G~SIavnGvCLTV~~i~~~~~~~~~f~~~~~~ETl~~T~l~~ 79 (204) +|.|.++|+|.++++.++.+++.+. .|..+.+ +.+| |||+||++|||.++. +++|++.++|||+++|+|+. T Consensus 97 sGHVDg~g~I~~i~~~~na~~~~~~~~~~~l~kyiv~KG-sIavDGiSLTV~~v~-----~~~F~V~lIPhT~~~T~l~~ 170 (204) T COG0307 97 SGHVDGTGEIVKIEKDGNAVRFWFKAPPEELAKYIVEKG-SIAVDGISLTVNEVE-----DDTFSVSLIPHTLERTTLGE 170 (204) T ss_pred EEEEEEEEEEEEEEECCCEEEEEEECCCHHHHHHHCCCC-EEEEEEEEEEEEEEC-----CCEEEEEEEECCHHHCCHHH T ss_conf 767700899999987487099999759989985520464-499831899999772-----87899999736043300300 Q ss_pred CCCCCCCCCCCCC Q ss_conf 5656631335432 Q gi|254780347|r 80 WGIGTFINLERSV 92 (204) Q Consensus 80 l~~G~~VNLE~a~ 92 (204) +++||+||+|--+ T Consensus 171 ~~~Gd~VNiE~D~ 183 (204) T COG0307 171 KKVGDRVNIEIDV 183 (204) T ss_pred CCCCCEEEEEECH T ss_conf 6689988996817 No 12 >KOG3310 consensus Probab=99.51 E-value=1.4e-14 Score=105.81 Aligned_cols=87 Identities=22% Similarity=0.275 Sum_probs=74.7 Q ss_pred EEEECCCCHHHHHHHCCCC-CEEEEEEECCCCCCCEECCCCEEEEEEEEEEEEEECCCEEEEEEEHHHHCCCCHHHCCCC Q ss_conf 2454010011011100233-137987513421121002357199823457877416854999984443301330024899 Q gi|254780347|r 102 LVSGHIDGTVEILFLDFIG-DSMYCRLSLPHNLEQFIAVKGSVCLNGVSLTVNLVGKNFFDVLLIRHTIEETTWKMHKVG 180 (204) Q Consensus 102 ~v~GhVd~~~~i~~i~~~~-~~~~~~i~~~~~~~~~l~~KgsI~idGvSLTV~~~~~~~f~v~lIp~Tl~~T~l~~~k~G 180 (204) +.+|.|+|.|+++.....+ ++..+.|.++.-+..- -...|||+||+||||.+.+...|.|.+-|||+++|||+.|+.| T Consensus 1 mftgive~mG~Vkd~~~~dd~Gf~~kI~A~~iL~Dc-h~GDSIAVNGtCLTVTeFn~~~FtVGiaPEtlr~tnl~~~k~G 79 (210) T KOG3310 1 MFTGIVECMGEVKDLGMADDGGFDLKIGARVILEDC-HLGDSIAVNGTCLTVTEFNAEEFTVGIAPETLRKTNLEELKKG 79 (210) T ss_pred CCCCHHHHHEEHHHCCCCCCCCEEEEECCCEEEEEC-CCCCEEEECCEEEEEEEECCCCEEEECCHHHHHHCCHHHHHCC T ss_conf 941057221132304765468788896687355312-3577278834799888605420587258799632569887359 Q ss_pred CEEEEEHHH Q ss_conf 674697135 Q gi|254780347|r 181 DLINIEVDS 189 (204) Q Consensus 181 d~VNiE~D~ 189 (204) +.||+|--. T Consensus 80 ~pVNLERAv 88 (210) T KOG3310 80 SPVNLERAV 88 (210) T ss_pred CCCCHHHHC T ss_conf 962565640 No 13 >pfam07944 DUF1680 Putative glycosyl hydrolase of unknown function (DUF1680). The members of this family are sequences derived from hypothetical bacterial and eukaryotic proteins of unknown function. One members of this family is annotated as a possible arabinosidase, but no references were found to back this. These proteins are related to a large family of glycosyl hydrolases. Probab=78.00 E-value=5.2 Score=20.61 Aligned_cols=60 Identities=15% Similarity=0.249 Sum_probs=42.1 Q ss_pred CCCEEEEEEECCCCCCCEECCCCEEEEEEEEEEEEEECCCEEEEEEEHHHHCCCCHHHCCCCCEEEEEHHHHHHHH Q ss_conf 3313798751342112100235719982345787741685499998444330133002489967469713588999 Q gi|254780347|r 119 IGDSMYCRLSLPHNLEQFIAVKGSVCLNGVSLTVNLVGKNFFDVLLIRHTIEETTWKMHKVGDLINIEVDSMMRYI 194 (204) Q Consensus 119 ~~~~~~~~i~~~~~~~~~l~~KgsI~idGvSLTV~~~~~~~f~v~lIp~Tl~~T~l~~~k~Gd~VNiE~D~l~kyv 194 (204) .+..+.+.+.+|..- .+-.|.|||-........+..+.+. ..|+.||.|.|+.+|-.|.+ T Consensus 429 ~~~~~~l~lRiP~Wa-----~~~~v~vNG~~~~~~~~~~gY~~i~-----------R~W~~GD~i~l~lpm~~r~~ 488 (511) T pfam07944 429 KPAEFTLYLRIPGWA-----AGATLTVNGKPVVVQPKSDGYLSIT-----------REWKKGDRVELTLPMPVRLE 488 (511) T ss_pred CCCEEEEEEECCCCC-----CCCEEEECCEECCCCCCCCCEEEEE-----------EEECCCCEEEEEECCEEEEE T ss_conf 984489999876778-----8987999998765665689569998-----------53579898999935835998 No 14 >pfam06597 Clostridium_P47 Clostridium P-47 protein. This family consists of several P-47 proteins from various Clostridium species as well as two related sequences from Pseudomonas putida. The function of this family is unknown. Probab=76.19 E-value=6.3 Score=20.13 Aligned_cols=77 Identities=17% Similarity=0.159 Sum_probs=45.8 Q ss_pred EEEECCCCEEEEEEECCCCCC-CCCCCEEEECCEEEEEEEECCCCCCCCEEEEEEEHH-----HHHHCCCCCCC--CCCC Q ss_conf 998769935999981755112-667724653331542365045556686799998656-----55320135556--5663 Q gi|254780347|r 14 AMTPIAKGMRLRVMTSYNTSK-MKLGCSIAHAGICLTVVRLSEEHSVDNWYEVEVWAE-----TNRITNIASWG--IGTF 85 (204) Q Consensus 14 ~i~~~~~~~~l~i~~~~~~~~-l~~G~SIavnGvCLTV~~i~~~~~~~~~f~~~~~~E-----Tl~~T~l~~l~--~G~~ 85 (204) +|..-+++.++++..|-.--. .-.|..+.++|+|+++- ..+++.|. +....++..++ .++. T Consensus 50 qIinGGsg~~lrl~~PIk~G~f~~k~~~~~L~Gv~p~le-----------v~L~f~p~~~p~~~~~~~~~kkvk~~~~~~ 118 (413) T pfam06597 50 EIINGGSGNLLRIKIPIREGYFKVKNTTISLSGVNPVLE-----------IKLDLLPQGNGQNQTSNPNIKKLKFTQGSE 118 (413) T ss_pred EEEECCCCCEEEEEEEEECCEEEECCEEEECCCCCEEEE-----------EEEEEECCCCCCCCCCCCCEEEEEEEECCC T ss_conf 996078876379999720267997546887147767899-----------999860689985455785407876641279 Q ss_pred CCCCCCCHHHHCCCCCE Q ss_conf 13354320000047642 Q gi|254780347|r 86 INLERSVKLGDRLDGHL 102 (204) Q Consensus 86 VNLE~a~~~g~~lgGH~ 102 (204) =|.-++.-+.| .+||+ T Consensus 119 s~~~~~Viv~D-~~g~l 134 (413) T pfam06597 119 SGDLIKVILSD-PSGKL 134 (413) T ss_pred CCCCCEEEEEC-CCCCC T ss_conf 99873699988-88887 No 15 >PRK00054 dihydroorotate dehydrogenase electron transfer subunit; Reviewed Probab=59.16 E-value=15 Score=17.84 Aligned_cols=175 Identities=13% Similarity=0.119 Sum_probs=87.3 Q ss_pred EEEEEEEEEEEC-CCCEEEEEEECCCCCCCCCCCEEEE-----C---CEEEEEEEECCCCCCCCEEEEEEEHHHHHHCCC Q ss_conf 118999999876-9935999981755112667724653-----3---315423650455566867999986565532013 Q gi|254780347|r 7 TDIGKIIAMTPI-AKGMRLRVMTSYNTSKMKLGCSIAH-----A---GICLTVVRLSEEHSVDNWYEVEVWAETNRITNI 77 (204) Q Consensus 7 ~~~G~V~~i~~~-~~~~~l~i~~~~~~~~l~~G~SIav-----n---GvCLTV~~i~~~~~~~~~f~~~~~~ETl~~T~l 77 (204) +..++|.++++. ++.+.++++.+..+ ++|.=+.+ + -..+++.+.+.+. ..|.+...-+.- ..| T Consensus 4 p~~~~Iv~~~~~~~~i~~l~l~~~~~~---~PGQFv~l~~~~~~~~l~rP~Si~~~~~~~---~~i~v~~vG~gT--~~l 75 (248) T PRK00054 4 PENMKIVENKEIAPNIYTLVLDGDFVF---KPGQFVMVWVWDDPGLLERPISISDIDKNE---NTILYRAVGEGT--KKL 75 (248) T ss_pred CEEEEEEEEEEECCCEEEEEECCCCCC---CCCCEEEEEECCCCCCCCEEEEEEECCCCE---EEEEEEEECHHH--HHH T ss_conf 704799999995799799998466877---999779999788898443641999638985---899999988799--999 Q ss_pred CCCCCCCCCCCCCCCHHHH---CCCCCEEE-ECCCCHHHHHHHC----CCCCEEEEEEECC--CC--CCCEECCCCEEEE Q ss_conf 5556566313354320000---04764245-4010011011100----2331379875134--21--1210023571998 Q gi|254780347|r 78 ASWGIGTFINLERSVKLGD---RLDGHLVS-GHIDGTVEILFLD----FIGDSMYCRLSLP--HN--LEQFIAVKGSVCL 145 (204) Q Consensus 78 ~~l~~G~~VNLE~a~~~g~---~lgGH~v~-GhVd~~~~i~~i~----~~~~~~~~~i~~~--~~--~~~~l~~KgsI~i 145 (204) .+++.||.+++--|+=-+= .-+++.+- |-=-|+|.+..+- ..++...+-+... +. +...+-.-+ T Consensus 76 ~~l~~Gd~l~v~GPlGngf~~~~~~~~~llVaGGiGiAPl~~l~~~l~~~~~~v~~i~G~r~~~~l~~~~el~~~~---- 151 (248) T PRK00054 76 SKLKEGDELDIRGPLGNGFDLEKIGGKVLLVGGGIGVAPLYELAKQLKAKGVEVTTVLGARTKDEVIFEDEFEKYG---- 151 (248) T ss_pred HCCCCCCEEEEECCCCCCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHCCCCEEEEEECCCHHHCCHHHHHHHHC---- T ss_conf 6188999897452568872367789619999676745768999999997299779999758879811799998654---- Q ss_pred EEEEEEEEE--ECCCEEEEEEEHHHHCCCCHHHCCCCCEEEEEHHHHHHHHHHHHCCC Q ss_conf 234578774--16854999984443301330024899674697135889999986023 Q gi|254780347|r 146 NGVSLTVNL--VGKNFFDVLLIRHTIEETTWKMHKVGDLINIEVDSMMRYIARLYALN 201 (204) Q Consensus 146 dGvSLTV~~--~~~~~f~v~lIp~Tl~~T~l~~~k~Gd~VNiE~D~l~kyv~r~~~~~ 201 (204) | +-+|..+ .+...|-..++++- ........--=+.+|.|.+.+++..+ T Consensus 152 ~-~~i~tddGs~G~~G~vt~~l~~~-------~~~~d~v~~cGP~~Mmk~v~~~~~~~ 201 (248) T PRK00054 152 D-VYVATDDGSYGFKGFVTDVLDEL-------DLDYDAIYSCGPEIMMKKVVEILKEK 201 (248) T ss_pred C-EEEEECCCCCCCCCEECHHHHHC-------CCCCCEEEEECCHHHHHHHHHHHHHC T ss_conf 7-67980678676651710312322-------56868999969989999999998755 No 16 >pfam11948 DUF3465 Protein of unknown function (DUF3465). This family of proteins are functionally uncharacterized. This protein is found in bacteria. Proteins in this family are typically between 131 to 151 amino acids in length. This protein has a conserved HWTH sequence motif. Probab=48.61 E-value=23 Score=16.83 Aligned_cols=13 Identities=23% Similarity=0.526 Sum_probs=8.4 Q ss_pred EEEEEEEEEEEEC Q ss_conf 7118999999876 Q gi|254780347|r 6 VTDIGKIIAMTPI 18 (204) Q Consensus 6 V~~~G~V~~i~~~ 18 (204) |++.|+|.++.+. T Consensus 39 v~g~G~V~~iLpD 51 (131) T pfam11948 39 VSGSGTVVRILPD 51 (131) T ss_pred EECCEEEEEECCC T ss_conf 9621689997888 No 17 >pfam06534 RGM_C Repulsive guidance molecule (RGM) C-terminus. This family consists of several mammalian and one bird sequence from Gallus gallus (Chicken). This family represents the C-terminal region of several sequences but in others it represents the full protein. All of the mammalian proteins are hypothetical and have no known function but a member from chicken is annotated as being a repulsive guidance molecule (RGM). RGM is a GPI-linked axon guidance molecule of the retinotectal system. RGM is repulsive for a subset of axons, those from the temporal half of the retina. Temporal retinal axons invade the anterior optic tectum in a superficial layer, and encounter RGM expressed in a gradient with increasing concentration along the anterior-posterior axis. Temporal axons are able to receive posterior-dependent information by sensing gradients or concentrations of guidance cues. Thus, RGM is likely to provide positional information for temporal axons invading the optic tectum in the s Probab=47.51 E-value=7.2 Score=19.77 Aligned_cols=60 Identities=15% Similarity=0.338 Sum_probs=37.3 Q ss_pred CCCHHHHCCCCCEEEECCCCHHHHHHHCCCCCEEEEEEECCCCCCCE-ECCCC-EEEEEEEE Q ss_conf 43200000476424540100110111002331379875134211210-02357-19982345 Q gi|254780347|r 90 RSVKLGDRLDGHLVSGHIDGTVEILFLDFIGDSMYCRLSLPHNLEQF-IAVKG-SVCLNGVS 149 (204) Q Consensus 90 ~a~~~g~~lgGH~v~GhVd~~~~i~~i~~~~~~~~~~i~~~~~~~~~-l~~Kg-sI~idGvS 149 (204) ++++...+.+|+.|-=|..-+|+-+-+++.+....+-|.+|+.+..- --.++ -.|++|+- T Consensus 34 ~sl~i~e~~~g~~VEI~a~yIgTtIiVRQvG~YLTfaiRmPeev~~~~e~~qdLqLC~~GCP 95 (213) T pfam06534 34 NSLRIVEKVPGRHVEIHARYIGTTIIVRQVGRYLTFAVRMPEEVANAYEESQDLQLCLHGCP 95 (213) T ss_pred CEEEEEEECCCCEEEEEEEECCCEEEEEEECCEEEEEEECHHHHHHHCCCCCCEEHHHCCCC T ss_conf 34899960799789999987073899997347258999776999853176788437646798 No 18 >KOG3380 consensus Probab=44.63 E-value=12 Score=18.39 Aligned_cols=17 Identities=29% Similarity=0.598 Sum_probs=14.3 Q ss_pred EHHHHHHHHHHHHCCCC Q ss_conf 71358899999860233 Q gi|254780347|r 186 EVDSMMRYIARLYALNT 202 (204) Q Consensus 186 E~D~l~kyv~r~~~~~~ 202 (204) |+|+|.|||.|=|+... T Consensus 102 ~~DiLmKYiYkGm~~p~ 118 (152) T KOG3380 102 EIDILMKYIYKGMEIPS 118 (152) T ss_pred HHHHHHHHHHHHHCCCC T ss_conf 78899999998702766 No 19 >cd01247 PH_GPBP Goodpasture antigen binding protein (GPBP) Pleckstrin homology (PH) domain. The GPBP protein is a kinase that phosphorylates an N-terminal region of the alpha 3 chain of type IV collagen , which is commonly known as the goodpasture antigen. It has has an N-terminal PH domain and a C-terminal START domain. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinsases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes. Probab=41.93 E-value=29 Score=16.21 Aligned_cols=27 Identities=22% Similarity=0.389 Sum_probs=17.2 Q ss_pred CCCEEEEEEEEEEEEEECCCEEEEEEE Q ss_conf 357199823457877416854999984 Q gi|254780347|r 139 VKGSVCLNGVSLTVNLVGKNFFDVLLI 165 (204) Q Consensus 139 ~KgsI~idGvSLTV~~~~~~~f~v~lI 165 (204) .+|||.+.+.++.+.+-++..|.|..- T Consensus 40 ~RGsI~L~~a~i~~~~~d~~~F~I~~~ 66 (91) T cd01247 40 CRGSIFLKKAIIAAHEFDENRFDISVN 66 (91) T ss_pred CCEEEEEEEEEEEECCCCCCEEEEEEC T ss_conf 846999105699946888979999949 No 20 >KOG2415 consensus Probab=36.95 E-value=19 Score=17.27 Aligned_cols=52 Identities=15% Similarity=0.112 Sum_probs=32.4 Q ss_pred HHCCCCCEEEECCCCHHHHHHHCCCCCE--EEEEEECCCCCCCEECCCCEEEEE Q ss_conf 0004764245401001101110023313--798751342112100235719982 Q gi|254780347|r 95 GDRLDGHLVSGHIDGTVEILFLDFIGDS--MYCRLSLPHNLEQFIAVKGSVCLN 146 (204) Q Consensus 95 g~~lgGH~v~GhVd~~~~i~~i~~~~~~--~~~~i~~~~~~~~~l~~KgsI~id 146 (204) ..++|||.++|-|-..+-..+.-..... --+...+.++-.++|..|+-|.+- T Consensus 114 aa~~GghtlSGaviep~aldEL~P~wke~~apl~t~vT~d~~~fLt~~~~i~vP 167 (621) T KOG2415 114 AAEVGGHTLSGAVIEPGALDELLPDWKEDGAPLNTPVTSDKFKFLTGKGRISVP 167 (621) T ss_pred CCCCCCCEECCEEECCCHHHHHCCCHHHCCCCCCCCCCCCCEEEECCCCEEECC T ss_conf 231477132130506543666584412138744542220304454367403068 No 21 >pfam00970 FAD_binding_6 Oxidoreductase FAD-binding domain. Probab=29.85 E-value=46 Score=15.05 Aligned_cols=78 Identities=13% Similarity=0.107 Sum_probs=29.8 Q ss_pred EEEEEEECCC-CEEEEEEECCC--CCCCCCCCEEE----ECCEEEE----EEEECCCCCCCCEEEEEEE--HHHHHHCCC Q ss_conf 9999987699-35999981755--11266772465----3331542----3650455566867999986--565532013 Q gi|254780347|r 11 KIIAMTPIAK-GMRLRVMTSYN--TSKMKLGCSIA----HAGICLT----VVRLSEEHSVDNWYEVEVW--AETNRITNI 77 (204) Q Consensus 11 ~V~~i~~~~~-~~~l~i~~~~~--~~~l~~G~SIa----vnGvCLT----V~~i~~~~~~~~~f~~~~~--~ETl~~T~l 77 (204) +|.++++... ..+++++.+.. ......|.-|. .+|.+++ +.+... ..+.+++.+- +.=.--+-| T Consensus 3 kv~~~~~lt~dv~~~~f~~~~~~~~~~f~pGQ~v~l~~~~~g~~~~R~YS~~s~p~---~~~~~e~~Vk~~~~G~~S~~l 79 (99) T pfam00970 3 KLVEKESLSHDTRRFRFALPSPDQVLGLPPGQHIFLRLPIDGKLVVRAYSPASSPD---DVGELELLVKVYPGGKMSQYL 79 (99) T ss_pred EEEEEEECCCCEEEEEEECCCCCCCCCCCCCEEEEEEEEECCEEEEEEEECCCCCC---CCCEEEEEEEEECCCCHHHHH T ss_conf 99999985899599999979986434889960999997659989987722156657---898599999997079667787 Q ss_pred CCCCCCCCCCCCCC Q ss_conf 55565663133543 Q gi|254780347|r 78 ASWGIGTFINLERS 91 (204) Q Consensus 78 ~~l~~G~~VNLE~a 91 (204) ..+++||.|.++.| T Consensus 80 ~~l~~Gd~v~i~gP 93 (99) T pfam00970 80 DSLKVGDTVEVKGP 93 (99) T ss_pred HCCCCCCEEEEECC T ss_conf 52899999999843 No 22 >KOG1653 consensus Probab=27.62 E-value=43 Score=15.24 Aligned_cols=97 Identities=13% Similarity=0.092 Sum_probs=56.5 Q ss_pred CCCCCCCCCHHHHCCCCCEEEECCCCHHHHHHHCCCCCEEE-EEEECCCCCCCEECCCCEEEEEEEEEEEEEECCCEEEE Q ss_conf 63133543200000476424540100110111002331379-87513421121002357199823457877416854999 Q gi|254780347|r 84 TFINLERSVKLGDRLDGHLVSGHIDGTVEILFLDFIGDSMY-CRLSLPHNLEQFIAVKGSVCLNGVSLTVNLVGKNFFDV 162 (204) Q Consensus 84 ~~VNLE~a~~~g~~lgGH~v~GhVd~~~~i~~i~~~~~~~~-~~i~~~~~~~~~l~~KgsI~idGvSLTV~~~~~~~f~v 162 (204) ...|=|++.++---+---+.-|.|- ...+.++-+++.... |.+--...+...+.. +|- ..-.-+|-.| T Consensus 41 ~~~~~~~p~k~~~~vnkv~lvG~VG-qdPl~k~~rngrpVtiFsv~T~~~~k~r~~q------~g~----~~~~tqWHRV 109 (175) T KOG1653 41 ETTTGEDPRKLERGVNKVILVGRVG-QDPLQKILRNGRPVTIFSVGTGGMFKQRLYQ------AGD----QPQPTQWHRV 109 (175) T ss_pred CCCCCCCHHHHHCCCCEEEEECCCC-CCHHHHHHCCCCEEEEEEEECCCCCCCCCCC------CCC----CCCCCEEEEE T ss_conf 5547333355543411699971335-2368885237970899995158520145544------677----6785405799 Q ss_pred EEEHHHHCCCCHHHCCCCCEEEEEHHHHH Q ss_conf 98444330133002489967469713588 Q gi|254780347|r 163 LLIRHTIEETTWKMHKVGDLINIEVDSMM 191 (204) Q Consensus 163 ~lIp~Tl~~T~l~~~k~Gd~VNiE~D~l~ 191 (204) ++-++-|+.+-++++|+|+.+-+|-++-. T Consensus 110 sVf~~~L~d~~~k~lkKGsriyveG~iey 138 (175) T KOG1653 110 SVFNEVLADYALKYLKKGSRIYVEGKIEY 138 (175) T ss_pred EEECCHHHHHHHHHHCCCCEEEEEEEEEE T ss_conf 86180688999997447977999656776 No 23 >pfam04699 P16-Arc ARP2/3 complex 16 kDa subunit (p16-Arc). The Arp2/3 protein complex has been implicated in the control of actin polymerisation. The human complex consists of seven subunits which include the actin related proteins Arp2 and Arp3, and five others referred to as p41-Arc, p34-Arc, p21-Arc, p20-Arc, and p16-Arc. The precise function of p16-Arc is currently unknown. Its structure consists of a single domain containing a bundle of seven alpha helices. Probab=26.42 E-value=33 Score=15.89 Aligned_cols=16 Identities=25% Similarity=0.426 Sum_probs=11.3 Q ss_pred EHHHHHHHHHHHHCCC Q ss_conf 7135889999986023 Q gi|254780347|r 186 EVDSMMRYIARLYALN 201 (204) Q Consensus 186 E~D~l~kyv~r~~~~~ 201 (204) +.|+|.||++|=|+.. T Consensus 103 ~~D~LMKYiYkGm~~p 118 (153) T pfam04699 103 QQDVLMKYLYKGMATP 118 (153) T ss_pred HHHHHHHHHHHHHCCC T ss_conf 7506899998752277 No 24 >TIGR00739 yajC preprotein translocase, YajC subunit; InterPro: IPR003849 This entry describes proteins of unknown function.. Probab=24.99 E-value=57 Score=14.53 Aligned_cols=38 Identities=26% Similarity=0.288 Sum_probs=28.3 Q ss_pred CCCCCCCCCEEEECC-EEEEEEEECCCCCCCCEEEEEEEHHH Q ss_conf 511266772465333-15423650455566867999986565 Q gi|254780347|r 31 NTSKMKLGCSIAHAG-ICLTVVRLSEEHSVDNWYEVEVWAET 71 (204) Q Consensus 31 ~~~~l~~G~SIavnG-vCLTV~~i~~~~~~~~~f~~~~~~ET 71 (204) ..+++++||+|-.+| ..=||.++.++ .+.+...+.+.| T Consensus 34 l~~~L~KGd~V~T~gGi~G~V~~i~e~---~~~i~i~~~~~t 72 (86) T TIGR00739 34 LIESLKKGDKVLTIGGIIGTVTKIAEN---TNNIVIELNDNT 72 (86) T ss_pred HHHCCCCCCEEEECCCEEEEEEEEECC---CCEEEEEECCCC T ss_conf 885279977899838838999885238---867899987994 No 25 >COG1661 Predicted DNA-binding protein with PD1-like DNA-binding motif [General function prediction only] Probab=24.01 E-value=59 Score=14.42 Aligned_cols=82 Identities=17% Similarity=0.334 Sum_probs=48.7 Q ss_pred CEEEECCEEEEE--EEECCCCCCCCEEEEEEEHHHHHH-CCCCCCCCCC-CCCCCCCC--HHHHCCCCCEEEECCCCHHH Q ss_conf 246533315423--650455566867999986565532-0135556566-31335432--00000476424540100110 Q gi|254780347|r 39 CSIAHAGICLTV--VRLSEEHSVDNWYEVEVWAETNRI-TNIASWGIGT-FINLERSV--KLGDRLDGHLVSGHIDGTVE 112 (204) Q Consensus 39 ~SIavnGvCLTV--~~i~~~~~~~~~f~~~~~~ETl~~-T~l~~l~~G~-~VNLE~a~--~~g~~lgGH~v~GhVd~~~~ 112 (204) +++.+=|+|-++ .-++.+.. ++-+. -..|++.. +-.|++..++ +++|--++ .-|.-+|||++.|-|.-+++ T Consensus 38 a~v~~iGal~~~~l~~~~~~~~--~y~~~-~~~e~~EvlSL~G~i~~~~p~~HlHa~l~~~~G~~~GGHL~~~~V~~t~E 114 (141) T COG1661 38 AVVTAIGALRDAKLRYFDPEEK--EYETI-PVNEPLEVLSLLGNIALDDPFVHLHAALGDENGITLGGHLLEGEVFPTAE 114 (141) T ss_pred EEEEEEEEEEEEEEEEECCCCC--CEEEE-ECCCCEEEEEECCEEECCCCCEEEEEEEECCCCCEEEEEECCCEEEEEEE T ss_conf 9998761220238997257777--16888-52783889982245414897479999972589948831214548867899 Q ss_pred HHHHCCCCCEE Q ss_conf 11100233137 Q gi|254780347|r 113 ILFLDFIGDSM 123 (204) Q Consensus 113 i~~i~~~~~~~ 123 (204) |.-.+-.+... T Consensus 115 v~I~el~~~~~ 125 (141) T COG1661 115 VFIRELPGELF 125 (141) T ss_pred EEEEECCCCCE T ss_conf 99998146210 No 26 >TIGR00038 efp translation elongation factor P; InterPro: IPR011768 Members of this family possess translation elongation factor activity. They have been shown to stimulate efficient translation and peptide-bond synthesis on native or reconstituted 70S ribosomes in vitro, possibly indirectly by altering the affinity of the ribosome for aminoacyl-tRNA, thus increasing their reactivity as acceptors for peptidyl transferase (by similarity) . This family does not include members of IPR011897 from INTERPRO, an EFP-like protein of certain gammaproteobacteria.; GO: 0003746 translation elongation factor activity, 0006414 translational elongation, 0005737 cytoplasm. Probab=23.91 E-value=41 Score=15.37 Aligned_cols=120 Identities=18% Similarity=0.202 Sum_probs=70.4 Q ss_pred CCCCCCCCEEEE-CCEEEEEEEECCCCCCCCEEEEEEEHHHHHHCCCCCCCCCCCCCCCCCCHHHHCCCCCEEEECCCCH Q ss_conf 112667724653-3315423650455566867999986565532013555656631335432000004764245401001 Q gi|254780347|r 32 TSKMKLGCSIAH-AGICLTVVRLSEEHSVDNWYEVEVWAETNRITNIASWGIGTFINLERSVKLGDRLDGHLVSGHIDGT 110 (204) Q Consensus 32 ~~~l~~G~SIav-nGvCLTV~~i~~~~~~~~~f~~~~~~ETl~~T~l~~l~~G~~VNLE~a~~~g~~lgGH~v~GhVd~~ 110 (204) .++++.|-||.. ||.-+-|++... .=--.-..+-||.|++|+.|..|. .-.|.||.+..-- |+. T Consensus 7 s~d~r~Gl~i~~~~G~~~~VvE~~f--------vKpGkG~AfVR~Klk~L~tG~v~~--~tFk~Gd~~e~~~----i~~- 71 (194) T TIGR00038 7 SNDFRKGLKILLIDGEPYRVVENEF--------VKPGKGQAFVRTKLKNLRTGAVVE--KTFKAGDKVEQAN----IEE- 71 (194) T ss_pred EECCCCCCEEEECCCEEEEEEECCC--------CCCCCCCCEEEEEEEECCCCCEEE--EECCCCCCCCCCE----EEE- T ss_conf 2045688538722983899981245--------787566616676653135884899--9417668635416----764- Q ss_pred HHHHHHCCCCCEEEE-------EEECCCCCCCEECCCCEEEEEEEEEEEEEECCCEEEEEEEH Q ss_conf 101110023313798-------75134211210023571998234578774168549999844 Q gi|254780347|r 111 VEILFLDFIGDSMYC-------RLSLPHNLEQFIAVKGSVCLNGVSLTVNLVGKNFFDVLLIR 166 (204) Q Consensus 111 ~~i~~i~~~~~~~~~-------~i~~~~~~~~~l~~KgsI~idGvSLTV~~~~~~~f~v~lIp 166 (204) -...-.=+.++.|.| .++++..-...|=.+.-=-.|...-+|.-.++..+.|.|.+ T Consensus 72 ~~mqylYkDgd~~vFMD~etyeQ~~~~~~~sanl~d~~kfl~e~~~v~v~~~~~~~l~v~lP~ 134 (194) T TIGR00038 72 KTMQYLYKDGDQYVFMDLETYEQISLDAKQSANLGDEAKFLLEEMEVSVVLYNDEILGVELPN 134 (194) T ss_pred EEEEEEEECCCEEEECCCCCHHHEECCCCCCCCCHHHHCCCCCCCEEEEEEECCEEEECCCCC T ss_conf 334477706976886678865553116111545201100235663389997689484130787 No 27 >PRK10476 multidrug resistance protein MdtN; Provisional Probab=23.05 E-value=62 Score=14.31 Aligned_cols=73 Identities=15% Similarity=0.201 Sum_probs=46.0 Q ss_pred EEEEECCC--C--CCCCCCCEEEECCEEEEEEEECCCCCCCCEEEEEEEHHHHHHCCCCCCCCCCCCCCCCCCHHHHCCC Q ss_conf 99981755--1--1266772465333154236504555668679999865655320135556566313354320000047 Q gi|254780347|r 24 LRVMTSYN--T--SKMKLGCSIAHAGICLTVVRLSEEHSVDNWYEVEVWAETNRITNIASWGIGTFINLERSVKLGDRLD 99 (204) Q Consensus 24 l~i~~~~~--~--~~l~~G~SIavnGvCLTV~~i~~~~~~~~~f~~~~~~ETl~~T~l~~l~~G~~VNLE~a~~~g~~lg 99 (204) ..|.+|.. . ..++.|+-+.-.---+|++..+ .-|...+ .+|| .++..++|+.|.+.- |-.. T Consensus 211 t~I~AP~dG~V~~~~v~~G~~V~pG~pl~~lv~~~-----~~wV~An-~~Et----~l~~v~~Gq~v~I~~-----Da~p 275 (348) T PRK10476 211 TTVRAPFDGRVVGLKVSVGEFAAPMQPIFTLIDTD-----HWYAIAN-FRET----DLKNIRVGDCATVYS-----MIDR 275 (348) T ss_pred CEEECCCCEEEEEEECCCCCEECCCCEEEEEEECC-----CEEEEEE-ECHH----HHCCCCCCCEEEEEE-----ECCC T ss_conf 89977898499988669998778998489999689-----5799999-3277----825279989899999-----7689 Q ss_pred CCEEEECCCCHH Q ss_conf 642454010011 Q gi|254780347|r 100 GHLVSGHIDGTV 111 (204) Q Consensus 100 GH~v~GhVd~~~ 111 (204) |+...|||+.++ T Consensus 276 ~~~f~G~V~sI~ 287 (348) T PRK10476 276 GRPIEGKVDSIG 287 (348) T ss_pred CCEEEEEEEEEC T ss_conf 978999999988 No 28 >TIGR01343 hacA_fam homoaconitate hydratase family protein; InterPro: IPR006251 3-isopropylmalate dehydratase (or isopropylmalate isomerase; 4.2.1.33 from EC) catalyses the stereo-specific isomerisation of 2-isopropylmalate and 3-isopropylmalate, via the formation of 2-isopropylmaleate. This enzyme performs the second step in the biosynthesis of leucine, and is present in most prokaryotes and many fungal species. The prokaryotic enzyme is a heterodimer composed of a large (LeuC) and small (LeuD) subunit, while the fungal form is a monomeric enzyme. Both forms of isopropylmalate are related and are part of the larger aconitase family . Aconitases are mostly monomeric proteins which share four domains in common and contain a single, labile [4Fe-4S] cluster. Three structural domains (1, 2 and 3) are tightly packed around the iron-sulphur cluster, while a fourth domain (4) forms a deep active-site cleft. The prokaryotic enzyme is encoded by two adjacent genes, leuC and leuD, corresponding to aconitase domains 1-3 and 4 respectively , . LeuC does not bind an iron-sulphur cluster. It is thought that some prokaryotic isopropylamalate dehydrogenases can also function as homoaconitase 4.2.1.36 from EC, converting cis-homoaconitate to homoisocitric acid in lysine biosynthesis . Homoaconitase has been identified in higher fungi (mitochondria) and several archaea and one thermophilic species of bacteria, Thermus thermophilus . This entry represents the large subunit of 3-isopropylmalate dehydratase (LeuC), as well as homoaconitase, certain aconitases and uncharacterised proteins, and is a subgroup of those represented in IPR012095. Homoaconitase, aconitase and 3-isopropylmalate dehydratase have similar overall structures and domain organisation . All are dehydratases that bind a [4Fe-4S]-cluster. The majority of the members of this family have been designated as 3-isopropylmalate dehydratase large subunit (LeuC) in microbial genome annotation, but the only characterised member is T. thermophilus homoaconitase, an enzyme of a non-aspartate pathway of lysine biosynthesis. Information about related proteins can be found at Protein of the Month: Aconitase .; GO: 0016836 hydro-lyase activity, 0051539 4 iron 4 sulfur cluster binding, 0008652 amino acid biosynthetic process. Probab=22.58 E-value=29 Score=16.28 Aligned_cols=34 Identities=3% Similarity=0.050 Sum_probs=15.0 Q ss_pred EEEEEEEEEEEEEECCCEEEEEEEHHHHCCCCHH Q ss_conf 1998234578774168549999844433013300 Q gi|254780347|r 142 SVCLNGVSLTVNLVGKNFFDVLLIRHTIEETTWK 175 (204) Q Consensus 142 sI~idGvSLTV~~~~~~~f~v~lIp~Tl~~T~l~ 175 (204) -|..||.---..++.....+=.=+.+=+..||+. T Consensus 182 ~iG~DGAtY~a~Ef~Ge~~~~~~~~~R~TlaNMA 215 (432) T TIGR01343 182 EIGVDGATYKAMEFHGETVKNMDMEERLTLANMA 215 (432) T ss_pred HHCCCCCCEEEEEEECCEEEECCHHHHHHHHHHH T ss_conf 5447875206899817324526643466798899 No 29 >cd06219 DHOD_e_trans_like1 FAD/NAD binding domain in the electron transfer subunit of dihydroorotate dehydrogenase-like proteins. Dihydroorotate dehydrogenases (DHODs) catalyze the only redox reaction in pyrimidine de novo biosynthesis. They catalyze the oxidation of (S)-dihydroorotate to orotate coupled with the reduction of NAD+. In L. lactis, DHOD B (encoded by pyrDa) is co-expressed with pyrK and both gene products are required for full activity, as well as NAD binding. NAD(P) binding domain of ferredoxin reductase-like proteins catalyze electron transfer between an NAD(P)-binding domain of the alpha/beta class and a discrete (usually N-terminal) domain which vary in orientation with respect to the NAD(P) binding domain. The N-terminal domain may contain a flavin prosthetic group, as in flavoenzymes, or use flavin as a substrate. Ferredoxin is reduced in the final stage of photosystem I. The flavoprotein Ferredoxin-NADP+ reductase transfers electrons from reduced ferredoxin to FAD, Probab=20.35 E-value=70 Score=13.98 Aligned_cols=181 Identities=19% Similarity=0.179 Sum_probs=84.1 Q ss_pred EEEEEEECC-CCEEEEEEECCCCCCCCCCCEEE--ECC----EEEEEEEECCCCCCCCEEEEEEEHHHHHHCCCCCCCCC Q ss_conf 999998769-93599998175511266772465--333----15423650455566867999986565532013555656 Q gi|254780347|r 11 KIIAMTPIA-KGMRLRVMTSYNTSKMKLGCSIA--HAG----ICLTVVRLSEEHSVDNWYEVEVWAETNRITNIASWGIG 83 (204) Q Consensus 11 ~V~~i~~~~-~~~~l~i~~~~~~~~l~~G~SIa--vnG----vCLTV~~i~~~~~~~~~f~~~~~~ETl~~T~l~~l~~G 83 (204) +|.+.++.. +-+.+++++|....+.++|.=+. +++ .-+++...+.+ ++.+++-+-.---...-|..++.| T Consensus 2 kI~~~~~i~~~~~~l~l~ap~~a~~~~PGQFvml~~~~~~er~P~Si~~~d~~---~g~i~~~~~~vG~gT~~L~~l~~G 78 (248) T cd06219 2 KILEKEELAPNVKLFEIEAPLIAKKAKPGQFVIVRADEKGERIPLTIADWDPE---KGTITIVVQVVGKSTRELATLEEG 78 (248) T ss_pred EEEEEEEECCCEEEEEEECCCHHHHCCCCCEEEEEECCCCCEEEEEEEEECCC---CCEEEEEEEEECCHHHHHHCCCCC T ss_conf 68789992598899999870847508998179999799996442898887699---999999999639059899618899 Q ss_pred CCC-CCCCCCHHHHCC--CCCE--EEECCCCHHHHH----HHCCCCCEEEEEEECCCCCCCEECCCC-EEEEEEEEEEEE Q ss_conf 631-335432000004--7642--454010011011----100233137987513421121002357-199823457877 Q gi|254780347|r 84 TFI-NLERSVKLGDRL--DGHL--VSGHIDGTVEIL----FLDFIGDSMYCRLSLPHNLEQFIAVKG-SVCLNGVSLTVN 153 (204) Q Consensus 84 ~~V-NLE~a~~~g~~l--gGH~--v~GhVd~~~~i~----~i~~~~~~~~~~i~~~~~~~~~l~~Kg-sI~idGvSLTV~ 153 (204) |.+ ++.-|+=-+=.+ .+|. |.|=+ |.|.+. .+.+.++...+-+-....-.-+..+.- ..+ +-+.+|-. T Consensus 79 d~l~~v~GPlG~~f~~~~~~~~llVaGGi-G~APl~~la~~l~~~g~~v~~i~G~r~~~~~~~~~~~~~~~-~~~~~~td 156 (248) T cd06219 79 DKIHDVVGPLGKPSEIENYGTVVFVGGGV-GIAPIYPIAKALKEAGNRVITIIGARTKDLVILEDEFRAVS-DELIITTD 156 (248) T ss_pred CEEEEEECCCCCCCCCCCCCEEEEECCCE-ECCCHHHHHHHHHHCCCEEEEEECCCCHHHCCCHHHHHHHC-CEEEEECC T ss_conf 98837874688874507896199984735-22535999999998799589995799979851098987408-76999857 Q ss_pred E--ECCCEEEEEEEHHHHCCCCHHHCCCCCEEEEEHHHHHHHHHHHHCC Q ss_conf 4--1685499998444330133002489967469713588999998602 Q gi|254780347|r 154 L--VGKNFFDVLLIRHTIEETTWKMHKVGDLINIEVDSMMRYIARLYAL 200 (204) Q Consensus 154 ~--~~~~~f~v~lIp~Tl~~T~l~~~k~Gd~VNiE~D~l~kyv~r~~~~ 200 (204) + .....|-..+.++.+.. ..+..-..--=++.|.|.+.+++.. T Consensus 157 dgs~g~~G~vt~~l~~~l~~----~~~~d~i~~cGP~~Mm~~v~~~~~~ 201 (248) T cd06219 157 DGSYGEKGFVTDPLKELIES----GEKVDLVIAIGPPIMMKAVSELTRP 201 (248) T ss_pred CCCCCCCCCHHHHHHHHHHC----CCCCCEEEEECCHHHHHHHHHHHHH T ss_conf 88888514010689998745----6786489984999999999999986 Done!