Query gi|254781210|ref|YP_003065623.1| hypothetical protein CLIBASIA_05585 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 343 No_of_seqs 163 out of 290 Neff 7.0 Searched_HMMs 39220 Date Mon May 30 06:22:40 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254781210.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PHA00201 major capsid protein 100.0 0 0 417.5 23.6 309 8-327 24-341 (343) 2 PHA02514 minor capsid protein; 100.0 0 0 411.3 24.0 317 8-336 25-351 (397) 3 PHA02004 capsid protein 99.9 8.4E-23 2.1E-27 147.8 20.5 294 11-327 22-326 (330) 4 pfam11651 P22_CoatProtein P22 99.7 8.9E-15 2.3E-19 102.4 17.9 210 1-232 1-221 (413) 5 pfam05065 Phage_capsid Phage c 99.3 5.8E-09 1.5E-13 69.6 20.4 259 13-325 7-277 (278) 6 PHA01511 coat protein 96.8 0.021 5.4E-07 32.5 9.5 120 96-231 93-219 (430) 7 pfam04454 Linocin_M18 Encapsul 95.6 0.14 3.6E-06 27.8 8.8 142 72-230 33-177 (218) 8 TIGR01343 hacA_fam homoaconita 62.3 4.8 0.00012 19.2 1.8 31 159-191 314-350 (432) 9 pfam09950 DUF2184 Uncharacteri 60.9 12 0.00029 17.1 11.7 220 55-324 4-244 (247) 10 TIGR02086 IPMI_arch 3-isopropy 54.8 9.4 0.00024 17.6 2.2 45 159-219 317-362 (431) 11 COG4397 Mu-like prophage major 51.0 8.9 0.00023 17.7 1.6 85 79-164 77-161 (308) 12 TIGR00977 LeuA_rel 2-isopropyl 47.6 19 0.00049 15.8 5.2 97 30-127 286-396 (543) 13 pfam12217 End_beta_propel Cata 41.0 21 0.00052 15.6 2.2 32 300-331 388-419 (449) 14 pfam06816 NOD NOTCH protein. N 36.5 29 0.00074 14.8 3.9 40 12-51 17-56 (56) 15 TIGR01522 ATPase-IIA2_Ca calci 29.6 8 0.0002 18.0 -1.4 64 7-82 94-161 (856) 16 TIGR01263 4HPPD 4-hydroxypheny 28.0 41 0.001 13.9 2.1 34 156-195 258-293 (379) 17 KOG1546 consensus 26.5 43 0.0011 13.8 3.5 55 30-92 128-183 (362) No 1 >PHA00201 major capsid protein Probab=100.00 E-value=0 Score=417.51 Aligned_cols=309 Identities=15% Similarity=0.115 Sum_probs=251.7 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHCCEECCCCCCCEEEEECCCCEEEEEECCCCCCCCC-CCCCCCEEEEEECCCC-CH Q ss_conf 279998867789999987112430100230312674068713462355332589874324-5687626999823443-11 Q gi|254781210|r 8 ATANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMPDTIY-NATDQDRRWVGHSQFG-WA 85 (343) Q Consensus 8 ~~aFv~q~~~~v~~a~qqk~s~L~~~V~~~t~~~Gks~~f~~~G~~~A~~~t~r~~~t~~-~~~~~~~~~v~~~~~~-~a 85 (343) .+.|++.|+.||..+||+. ++++++|++|++.+|||++||++|+.+|.||+|+..+++. .+++|.+++++++|+. ++ T Consensus 24 ~AlfLK~fsgEvltaF~~~-~i~~~~~~~Rti~~GKSaqFp~tG~~~a~YhtpG~~i~~~~~~~~~~Ek~I~iDdlLis~ 102 (343) T PHA00201 24 LALFLKVFGGEVLTAFART-SVTMNRHMVRSIQSGKSAQFPVLGRTKAAYLAPGENLDDKRKDIKHTEKVITIDGLLTAD 102 (343) T ss_pred HHHHHHHHHHHHHHHHHHH-HHHCCCEEEEECCCCCEEEEEEEEEEEEEECCCCCCCCCCCCCCCCCEEEEEECHHHHHH T ss_conf 7789998408999999886-413044368743798245301430440362379887678988876541699964044467 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCC-CCCCCEEEECCCCC------CCCCC Q ss_conf 221268888741006899999999998776889999986201455666665445-66664023023577------64222 Q gi|254781210|r 86 ERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEF-FSKENILSAVEGDD------FFKTF 158 (343) Q Consensus 86 ~~IddiDe~k~~~d~rs~y~~~~g~ALar~~D~~I~~a~~g~a~~~~~g~~~~~-~~~~~~~~~~~~~~------~~~lt 158 (343) .||+++||+++|||+|++|++++|+||++++|+.|++.+.+.++.....+.... .+.+..+......+ ..... T Consensus 103 ~fi~~~de~~~hydvR~~ys~~~G~aLa~~~D~~i~r~ia~~a~~a~~~~~~~~g~~~~~~i~~g~~~~~~~~~~~g~al 182 (343) T PHA00201 103 VLIYDIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAGLCNLPDASNENIAGLGKATVLEVGAKADLTDPVKLGKAI 182 (343) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEECCCCCCCCCHHHHHHH T ss_conf 88883999985015779999999999999999999999998751456666667788887389724655545402448999 Q ss_pred HHHHHHHHHHHHHCCCCCCCCEEEEECCHHHHHHHHCCCCEEEHHCCCCCCCCCCCEEEEEEEEEEEECCCCCCCCCCCC Q ss_conf 89999999986023889754607997699999875067101000001343113670122743589970777876666544 Q gi|254781210|r 159 IGQLITAKSIFRKRYIDVDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAG 238 (343) Q Consensus 159 ~akl~~a~~~ld~~~Vp~~g~~~~~v~~p~~~~~Ll~~t~~~s~Df~~~~al~~G~I~~~~G~~~i~s~~lp~~~~~~~~ 238 (343) ++.|++|+++|||++||.++ |+||++|++||.|++.....+.||.+.....+|.|.+++||++++++|||..+.+... T Consensus 183 v~aif~Aa~~lDEk~VP~~g--R~~Vl~P~qYY~Li~a~~~n~~n~~g~~~~~~G~i~~VaGi~iv~~~~l~~~~ag~~~ 260 (343) T PHA00201 183 IAQLTIARAKLTKNYVPAAD--RVFYTTPDNYSAILAALMPNAANYAALIDPETGSIRNVMGFEVVEVPHLTAGGAGDTR 260 (343) T ss_pred HHHHHHHHHHHHHCCCCCCC--CEEEECHHHHHHHHHHHHCCCCCCCCCCCCCCCCEEEEEEEEEEECCCCCCCCCCCCC T ss_conf 99999999986424899778--3899888999999975401345533444757771102301688743761346677677 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCEEEEECCCEEEEEECCCCCCHHEEEHEECCCCEEECC Q ss_conf 43333345655667776666666654556551379998454103445135315655416532033000100026356541 Q gi|254781210|r 239 TKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWHAPQITLTSSFGATRIEP 318 (343) Q Consensus 239 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~hksAvg~a~~~di~~ei~~~~~k~~a~~i~~~~s~Ga~~l~~ 318 (343) .. ...+.++..+....+++.+++++.++++|||+|+|+++++|+++|.+|+. +||+++|+++|+|||+.||| T Consensus 261 ~~-------a~tg~~~~~~~~~~gd~~~~~snt~GLifhk~AvGtVKLmDl~vEsd~~v-~yQGdLmVaKyAMGhG~LRP 332 (343) T PHA00201 261 ED-------APTGQKHAFPATASGDVKVALDNVVGLFQHRSAVGTVKLKDLALERARRA-NFQADQIIAKYAMGHGGLRP 332 (343) T ss_pred CC-------CCCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCEEEEEEECCCCEEEEEE-EEEHHHHHHHHHCCCCCCCH T ss_conf 67-------77787776665557753244452479997311258998504310012565-64248999998725787897 Q ss_pred CCEEEEEEC Q ss_conf 525899954 Q gi|254781210|r 319 DKILGIEIS 327 (343) Q Consensus 319 ~~vv~I~~~ 327 (343) |++++|... T Consensus 333 eaA~alv~~ 341 (343) T PHA00201 333 EAAGALVFK 341 (343) T ss_pred HHHHHEECC T ss_conf 774320045 No 2 >PHA02514 minor capsid protein; Reviewed Probab=100.00 E-value=0 Score=411.29 Aligned_cols=317 Identities=14% Similarity=0.122 Sum_probs=257.5 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHCCEECCCCCCCEEEEECCCCEEEEEECCCCCCCCC-CCCCCCEEEEEECCCCCH- Q ss_conf 279998867789999987112430100230312674068713462355332589874324-568762699982344311- Q gi|254781210|r 8 ATANIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMPDTIY-NATDQDRRWVGHSQFGWA- 85 (343) Q Consensus 8 ~~aFv~q~~~~v~~a~qqk~s~L~~~V~~~t~~~Gks~~f~~~G~~~A~~~t~r~~~t~~-~~~~~~~~~v~~~~~~~a- 85 (343) -+-|++.|+.||..+||+. ++++++|++|++.+|||++||++|+.+|.||+|+++++.. .+++|.+++++++|+..+ T Consensus 25 ~ALfLK~FsGEvltaF~~~-~i~~~~~~~Rti~~GKSaqFp~tGr~~A~YHtpG~~i~~~~~~i~~~Ek~I~iDdlLis~ 103 (397) T PHA02514 25 LALFLKVFGGEVLTAFART-SVTTSRHMVRSISSGKSAQFPVLGRTQAAYLAPGENLDDKRKDIKHTEKVITIDGLLTAD 103 (397) T ss_pred HHHHHHHHHHHHHHHHHHH-HHHHCCEEEEECCCCCEEECEEEEEEEEEECCCCCCCCCCCCCCCCCCEEEEECCEEEHH T ss_conf 5568776018889999887-533023157752688334210530331020479875668888876550599974222136 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCC-CCCCCCCEEEECCCCCC-------CCC Q ss_conf 2212688887410068999999999987768899999862014556666654-45666640230235776-------422 Q gi|254781210|r 86 ERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAET-EFFSKENILSAVEGDDF-------FKT 157 (343) Q Consensus 86 ~~IddiDe~k~~~d~rs~y~~~~g~ALar~~D~~I~~a~~g~a~~~~~g~~~-~~~~~~~~~~~~~~~~~-------~~l 157 (343) .||++|||+++|||+|++|++++|+||++++|+.|++.+.+.++.....+.. ..+.....+....+.++ ... T Consensus 104 ~fi~~ide~~~hydvRs~ys~e~G~aLA~~~Dr~Ifr~ia~~a~~~~~~~~~i~g~g~~~~~~~~~~~~~~~d~~~~g~a 183 (397) T PHA02514 104 VLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESQYNENIEGLGTATVIETTQPKTTLTDQVALGKE 183 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCEEEEECCCCCCCCCHHHHHHH T ss_conf 77860899985456889999999999999988999999998742566555443688864267403566543456677999 Q ss_pred CHHHHHHHHHHHHHCCCCCCCCEEEEECCHHHHHHHHCCCCEEEHHCCCCCCCCCCCEEEEEEEEEEEECCCCCCCCCCC Q ss_conf 28999999998602388975460799769999987506710100000134311367012274358997077787666654 Q gi|254781210|r 158 FIGQLITAKSIFRKRYIDVDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPA 237 (343) Q Consensus 158 t~akl~~a~~~ld~~~Vp~~g~~~~~v~~p~~~~~Ll~~t~~~s~Df~~~~al~~G~I~~~~G~~~i~s~~lp~~~~~~~ 237 (343) .++.|++|++.|||++||.++ |+||++|.+||.|++.......+|.+.....+|.|.+++||++++++|+|..+.+.. T Consensus 184 LI~AfydAAa~LDEk~VP~~g--RvavL~PrQYYaLi~av~~N~~n~~g~~~~~~G~I~~I~G~~i~~ip~l~~~gag~~ 261 (397) T PHA02514 184 IIAALTKARAALTKNYVPAAD--RVFYCDPDSYSAILAALMPNAANYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAGTT 261 (397) T ss_pred HHHHHHHHHHHHHHCCCCCCC--CEEEECHHHHHHHHHHHCCCCCCCCCCCCCCCCEEEEEEEEEEEECCCCCCCCCCCC T ss_conf 999999999984215899788--189978899999998650344230013577787030565699852575045677766 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCEEEEECCCEEEEEECCCCCCHHEEEHEECCCCEEEC Q ss_conf 44333334565566777666666665455655137999845410344513531565541653203300010002635654 Q gi|254781210|r 238 GTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWHAPQITLTSSFGATRIE 317 (343) Q Consensus 238 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~hksAvg~a~~~di~~ei~~~~~k~~a~~i~~~~s~Ga~~l~ 317 (343) .... .+.....+......+..+++|.+||||||+|+|+++++|+++|++|+.. +|++||+++++|||+.|| T Consensus 262 ~~~~--------tg~~~~~~~t~~~~~~~a~~N~~GLIfhksAaGtVKlidpqVE~t~~V~-yQGdLIVaKyAMGhG~LR 332 (397) T PHA02514 262 REGT--------TGQKHAFPATKSGNVKVAKDNVIGLFMHRSAVGTVKLRDLALERARRAN-FQADQIIAKYAMGHGGLR 332 (397) T ss_pred CCCC--------CCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCEEEEECCCEEEEEEEEE-EHHHHHHHHHHCCCCCCC T ss_conf 6666--------6777778887676432112434689971020068985043024554333-104689999863578789 Q ss_pred CCCEEEEEECHHHHCCCEE Q ss_conf 1525899954133225300 Q gi|254781210|r 318 PDKILGIEISKDSLKGVPV 336 (343) Q Consensus 318 ~~~vv~I~~~e~~~~~~~~ 336 (343) ||.++++.+..+.|.|+.- T Consensus 333 PeAA~alv~~~~~~~~~~~ 351 (397) T PHA02514 333 PEAAGAVVFQKGVMLGVTS 351 (397) T ss_pred HHHHHHEEHHCCCEECCCC T ss_conf 6774221101450203333 No 3 >PHA02004 capsid protein Probab=99.92 E-value=8.4e-23 Score=147.76 Aligned_cols=294 Identities=12% Similarity=0.106 Sum_probs=224.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCEECCCCCCCEEEEECCCCEEEEEECCCCCCCCCCCCCCCEEEEEECCCCCHH-HHH Q ss_conf 9988677899999871124301002303126740687134623553325898743245687626999823443112-212 Q gi|254781210|r 11 NIYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFKPTEAHEIVGDMPDTIYNATDQDRRWVGHSQFGWAE-RID 89 (343) Q Consensus 11 Fv~q~~~~v~~a~qqk~s~L~~~V~~~t~~~Gks~~f~~~G~~~A~~~t~r~~~t~~~~~~~~~~~v~~~~~~~a~-~Id 89 (343) -|++|..+|..-++.+ |+++++...|+.+|-.++....+|..+...++||.+-.+.. +..++...+.+...+++ .|| T Consensus 22 HiEey~g~Vd~~~~~~-s~f~~~~~vR~v~GTNtvr~drlG~sel~gr~~Ges~d~T~-~~~DK~~L~VDT~lyaR~~vd 99 (330) T PHA02004 22 HLEEHLGIVDKHFAYT-SKFAPLMNIRDLRGSNVVRLDRLGNVEAKGRRAGEELERSR-VVNDKWNLTVDTLLYLRHQFD 99 (330) T ss_pred EHHHHCCHHHHHHHHH-HHHHCCCEEEEECCCCEEEECCCCCEEECCCCCCCCCCCCC-CCCCCEEEEEEEEEEECCCCC T ss_conf 2788547055432377-77622440332036515661000637760315888877774-433616899864665215443 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHCCCCCCCCCCCCCCCCCCCEEE----ECCCCCCCCCCHHHHHH Q ss_conf 68888741006899999999998776889999-98620145566666544566664023----02357764222899999 Q gi|254781210|r 90 PFATLDSGINPLLPYASLATAAMHRKQDEAIL-KGMLGVNKKGKIGAETEFFSKENILS----AVEGDDFFKTFIGQLIT 164 (343) Q Consensus 90 diDe~k~~~d~rs~y~~~~g~ALar~~D~~I~-~a~~g~a~~~~~g~~~~~~~~~~~~~----~~~~~~~~~lt~akl~~ 164 (343) -+|+.+..+|.|.+.+++.|-+|+|..|+..+ +++.++.-....+ -..++..+.... ......++.-....+++ T Consensus 100 ~~dDwq~~~D~r~ela~n~g~~lak~~DQa~liQlik~~~~~apa~-l~~~f~~G~~~~~~~~g~~~e~~a~~l~~a~~~ 178 (330) T PHA02004 100 HQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVD-LEDAFSPGVLEKLDLTGLTAKQAADKIVRMHRR 178 (330) T ss_pred HHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC-CCCCCCCCEEEEEEEECCCCCCCHHHHHHHHHH T ss_conf 0320458603778987887999999888999999997514468011-677777871567887312355330689999999 Q ss_pred HHHHHHHCCCCC--CCCEEEEECCHHHHHHHHCCCCEEEHHCCCC---CCCCCCCEEEEEEEEEEEECCCCCCCCCCCCC Q ss_conf 999860238897--5460799769999987506710100000134---31136701227435899707778766665444 Q gi|254781210|r 165 AKSIFRKRYIDV--DSEQVYVLIPSDVWASLFALERATSKDYINT---AALQAGKIEAFAGVWFINMEKVPGNDLFPAGT 239 (343) Q Consensus 165 a~~~ld~~~Vp~--~g~~~~~v~~p~~~~~Ll~~t~~~s~Df~~~---~al~~G~I~~~~G~~~i~s~~lp~~~~~~~~~ 239 (343) +.+.|-+.+||+ .. +-+.+++|..|+.||.-.++++.+|+.. ..+..+.+....|+++|+||++|..+.. ++ T Consensus 179 ~ve~~~~rdv~d~v~~-e~v~l~~P~~Fs~LlehdrLvd~ey~~s~~~~~~~~~rv~~~ngvpVvesnrfPt~ait--~H 255 (330) T PHA02004 179 VVETFIDRDLGDAVYS-EGLTPMSPRVFSLLLEHDKLMNVEYQATGATNDYVKSRVAILNGVKVLETPRFATKAIA--AH 255 (330) T ss_pred HHHHHHHCCCCCCCCC-CCEEEECHHHHHHHHHCCHHCCEEEEECCCCCCCCCEEEEEECCEEEEECCCCCCCCCC--CC T ss_conf 9999986014653332-55187378999999844301144545236765312328999827068846888865413--56 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCEEEEECCCEEEEEECCCCCCHHEEEHEECCCCEEECCC Q ss_conf 33333456556677766666666545565513799984541034451353156554165320330001000263565415 Q gi|254781210|r 240 KFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPGKWHAPQITLTSSFGATRIEPD 319 (343) Q Consensus 240 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~hksAvg~a~~~di~~ei~~~~~k~~a~~i~~~~s~Ga~~l~~~ 319 (343) .+ .+..+.+.+.+..+...+|..+++..+.-..++.++.|++. ...|.|-.-+++|.+-=||+ T Consensus 256 ~L----------------g~~~Nvt~~e~~~~~a~f~~s~aLl~~e~~pvt~diw~dkk-e~~~~iDtf~s~~v~~rrpd 318 (330) T PHA02004 256 PL----------------GRHFNVSAEESERQIALFLPSKTLITAQVAPVQAKLWEDNE-KFSWVLDTFQMYNIGARRPD 318 (330) T ss_pred CC----------------CCCCCCCHHHHCCEEEEEECCCEEEEEEEEECCCCCCCCCC-HHEEEEEHHHHCCCCCCCCC T ss_conf 33----------------45568655664032789855741578875523410012431-10356432655387778856 Q ss_pred CEEEEEEC Q ss_conf 25899954 Q gi|254781210|r 320 KILGIEIS 327 (343) Q Consensus 320 ~vv~I~~~ 327 (343) .+--++.+ T Consensus 319 ~a~~~~~~ 326 (330) T PHA02004 319 TAGAIELK 326 (330) T ss_pred CEEEEEEE T ss_conf 21047861 No 4 >pfam11651 P22_CoatProtein P22 coat protein - gene protein 5. This family of proteins represents gene product 5 from bacteriophage P22. This protein is involved in the formation of the pro-capsid shells in the bacteriophage. In total, there are 415 molecules of the coat protein which are arranged in an icosahedral shell. Probab=99.68 E-value=8.9e-15 Score=102.42 Aligned_cols=210 Identities=16% Similarity=0.031 Sum_probs=150.9 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCEE-CC-----CCCCCEEEEECCCCEEEEEECCC-CCCCCCCCCCCC Q ss_conf 987653227999886778999998711243010023-03-----12674068713462355332589-874324568762 Q gi|254781210|r 1 MATKEQLATANIYEFKKHVELALQETKSKLRPTVTE-QA-----TEGEASALVEVFKPTEAHEIVGD-MPDTIYNATDQD 73 (343) Q Consensus 1 ma~~~~i~~aFv~q~~~~v~~a~qqk~s~L~~~V~~-~t-----~~~Gks~~f~~~G~~~A~~~t~r-~~~t~~~~~~~~ 73 (343) ||++ +.+ ++.+..|....++.. .++..+|.. +. ...|.+..|.++..-.+..-... ...+++..++.. T Consensus 1 MaN~--~lt--~~iia~eal~~l~~~-lv~~~~V~r~y~~~f~~~k~Gdtv~i~~P~~~~~~~~~~~d~s~~~qd~~E~~ 75 (413) T pfam11651 1 MANN--LLT--IQVIAREALEGFDNN-LVMAKNVHRQYSPEFNGAQIGDTVTIRRPPDFEVRSGAGADLSGNAQGLTEGK 75 (413) T ss_pred CCCC--CCC--HHHHHHHHHHHHHHH-HHHHHHHCCCCCCCCCCCCCCCEEEEECCCCEECCCCCCCCCCCCCCCCCCCE T ss_conf 9731--032--889999999988874-25665414244643244465872798468730302345533235647621131 Q ss_pred EEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HCCCCCC-CCCCCCCCCCCCCEEEECCC Q ss_conf 699982344311221268888741006899999999998776889999986-2014556-66665445666640230235 Q gi|254781210|r 74 RRWVGHSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGM-LGVNKKG-KIGAETEFFSKENILSAVEG 151 (343) Q Consensus 74 ~~~v~~~~~~~a~~IddiDe~k~~~d~rs~y~~~~g~ALar~~D~~I~~a~-~g~a~~~-~~g~~~~~~~~~~~~~~~~~ 151 (343) ........+..+.-+...++...+-|.-..|.+-+..+|+.++|..++.-+ ...++.- ..++ . T Consensus 76 v~~~v~~~~~V~~e~ts~E~tL~l~dfs~ril~PAm~~LA~~Idsdl~~~~~~~~~n~vg~~~~---------------t 140 (413) T pfam11651 76 VPVVVDKQKNVPFTLTAKEKTLRLEDFSERILRPAMQRLANKVDADLLNSMALMGSNVVGKTDA---------------T 140 (413) T ss_pred EEEEECCCCCCCEEEECHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEECCCC---------------C T ss_conf 4478524125307971156534547899898899999999999999999998646626602577---------------7 Q ss_pred CCCCCCCHHHHHHHHHHHHHCCCCCCCCEEEEECCHHHHHHHHCCCCEEEHHCC-CCCCCCCCCE-EEEEEEEEEEECCC Q ss_conf 776422289999999986023889754607997699999875067101000001-3431136701-22743589970777 Q gi|254781210|r 152 DDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLIPSDVWASLFALERATSKDYI-NTAALQAGKI-EAFAGVWFINMEKV 229 (343) Q Consensus 152 ~~~~~lt~akl~~a~~~ld~~~Vp~~g~~~~~v~~p~~~~~Ll~~t~~~s~Df~-~~~al~~G~I-~~~~G~~~i~s~~l 229 (343) .+........+..+.++|++..||.+ +|+++++|+.+..|.+.......+.. ..+++++|.| ++++||+|++++++ T Consensus 141 ~~t~~~~~~dva~a~~~L~~~~vP~~--~R~~~~~p~~~~~la~~~~gl~~~~~~~~~Ayr~g~i~g~~aGfd~y~~~~~ 218 (413) T pfam11651 141 PGTTGTDWNDVADAASRLDDIGVPSD--GRRAFLSPDAYAKLAGSLAGLLIAGGEIRTAYRKGQIQGRVAGFDFYMSQNV 218 (413) T ss_pred CCCCCCCHHHHHHHHHHHHHCCCCCC--CCEEEECHHHHHHHHHHHHCCCCCCCCHHHHHHHCCCCCCCCCHHHHHHCCC T ss_conf 88575437789999988866379866--8459868899999877652644445535899973733432232456753788 Q ss_pred CCC Q ss_conf 876 Q gi|254781210|r 230 PGN 232 (343) Q Consensus 230 p~~ 232 (343) |.- T Consensus 219 ~~~ 221 (413) T pfam11651 219 LRH 221 (413) T ss_pred CCC T ss_conf 650 No 5 >pfam05065 Phage_capsid Phage capsid family. Family of bacteriophage hypothetical proteins and capsid proteins. Probab=99.26 E-value=5.8e-09 Score=69.55 Aligned_cols=259 Identities=18% Similarity=0.130 Sum_probs=146.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCEECCCCCCCE--EEEECCCC-EEEEEECCCCCCCCCCCCCCCEEEEEECCCCCHHHHH Q ss_conf 88677899999871124301002303126740--68713462-3553325898743245687626999823443112212 Q gi|254781210|r 13 YEFKKHVELALQETKSKLRPTVTEQATEGEAS--ALVEVFKP-TEAHEIVGDMPDTIYNATDQDRRWVGHSQFGWAERID 89 (343) Q Consensus 13 ~q~~~~v~~a~qqk~s~L~~~V~~~t~~~Gks--~~f~~~G~-~~A~~~t~r~~~t~~~~~~~~~~~v~~~~~~~a~~Id 89 (343) +++..++....++ .+.|++++++.++..+.+ ..+|+... ..+....++ +..+..+.+ +...+.+.-+..+.++. T Consensus 7 ~~~~~~ii~~~~~-~~~l~~~~~~~~~~~~~~~~~~i~~~~~~~~a~~v~E~-~~~~~~~~~-~f~~i~l~~~k~~~~~~ 83 (278) T pfam05065 7 EEISTEIIELLRE-KSPLRQLATVVPVTTGESGSLTIPKRTGGATAGWVGEG-AAAPETDTP-TFGEVTLPVKKLAALVP 83 (278) T ss_pred HHHHHHHHHHHHH-HHHHHHHCEEEECCCCCCCCEEEEEECCCCCEEEECCC-CCCCCCCCC-CEEEEEEEEEEEEEEEE T ss_conf 9999999999986-53586624799815898786688877376416885468-867766777-33588871089999961 Q ss_pred HHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCC--C-EEEECCCCCCCCCCHHHHHH Q ss_conf 688--887410068999999999987768899999862014556666654456666--4-02302357764222899999 Q gi|254781210|r 90 PFA--TLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEFFSKE--N-ILSAVEGDDFFKTFIGQLIT 164 (343) Q Consensus 90 diD--e~k~~~d~rs~y~~~~g~ALar~~D~~I~~a~~g~a~~~~~g~~~~~~~~~--~-~~~~~~~~~~~~lt~akl~~ 164 (343) -=. ...+..|..+...+.++++++++.|..++.. .|.+. ...| .+... . .............+.+.|.. T Consensus 84 iS~ell~ds~~~~~~~i~~~l~~~~~~~~d~a~l~G-~G~~~-~p~G----il~~~~~~~~~~~~~~~~~~~~~~~~i~~ 157 (278) T pfam05065 84 ISNELLDDSAIDLEAYVADELAEAFARKEDAAFLNG-DGTGT-KPKG----ILTSTGATAVGAAAGVKTAAGNTADDLID 157 (278) T ss_pred ECHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHEEC-CCCCC-CCCC----EECCCCCCEEECCCCCCCCCCCCHHHHHH T ss_conf 139987414677999999999999999997785007-89777-8665----11346531231144435545425999999 Q ss_pred HHH-HHHHCCCCCCCCEEEEECCHHHHHHHHCCCCEEEHHCCCCCCCCCCCEEEEEEEEEEEECCCCCCCCCCCCCCCCC Q ss_conf 999-8602388975460799769999987506710100000134311367012274358997077787666654443333 Q gi|254781210|r 165 AKS-IFRKRYIDVDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEKVPGNDLFPAGTKFPG 243 (343) Q Consensus 165 a~~-~ld~~~Vp~~g~~~~~v~~p~~~~~Ll~~t~~~s~Df~~~~al~~G~I~~~~G~~~i~s~~lp~~~~~~~~~~~~~ 243 (343) +.. .++.++-+. ...+.+|..|..|..... .+..|.-...+ .+.-++++|++++.++++|... T Consensus 158 ~~~~~l~~~~~~~----~~~v~n~~~~~~l~~lkd-~~G~~l~~~~~-~~~~~~l~G~pv~~~~~~~~~~---------- 221 (278) T pfam05065 158 LIYASLDAAYRAN----AVWVMNPSTLAALRKLKD-ANGRYLWQPSL-TAGTPTLLGRPVVVTEDMPDGP---------- 221 (278) T ss_pred HHHHHCCCCCCCC----CEEEECHHHHHHHHHHCC-CCCCEEECCCC-CCCCCCEEEEEEEEECCCCCCC---------- T ss_conf 9860224013537----699988999999985225-69977843566-6788707101578748998671---------- Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCEEEEECCCEEEEEECCC---CCCHHEEEHEECCCCEEECCCC Q ss_conf 3456556677766666666545565513799984541034451353156554165---3203300010002635654152 Q gi|254781210|r 244 LIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYCKSAVVFTQRKAIDVQHSKDPG---KWHAPQITLTSSFGATRIEPDK 320 (343) Q Consensus 244 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~hksAvg~a~~~di~~ei~~~~~---k~~a~~i~~~~s~Ga~~l~~~~ 320 (343) .++-.-+.+.++...++.+++..++. ....+.+.+.+-+|...++|++ T Consensus 222 -----------------------------~~~Gdfs~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~A 272 (278) T pfam05065 222 -----------------------------ILFGDFSAYIIVDREGVTVERLRDPYTAFEKNQVGFRATERVDGAVVDPEA 272 (278) T ss_pred -----------------------------EEEEEHHHCEEEEECCCEEEEECCCHHHCCCCEEEEEEEEEECCCCCCHHH T ss_conf -----------------------------999812888999947848999515512214884999999998883837637 Q ss_pred EEEEE Q ss_conf 58999 Q gi|254781210|r 321 ILGIE 325 (343) Q Consensus 321 vv~I~ 325 (343) ++.++ T Consensus 273 ~~~l~ 277 (278) T pfam05065 273 FKKLK 277 (278) T ss_pred EEEEE T ss_conf 45864 No 6 >PHA01511 coat protein Probab=96.76 E-value=0.021 Score=32.50 Aligned_cols=120 Identities=13% Similarity=0.068 Sum_probs=78.4 Q ss_pred HHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHCC Q ss_conf 410068--999999999987768899999862014556666654456666402302357764222899999999860238 Q gi|254781210|r 96 SGINPL--LPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTFIGQLITAKSIFRKRY 173 (343) Q Consensus 96 ~~~d~r--s~y~~~~g~ALar~~D~~I~~a~~g~a~~~~~g~~~~~~~~~~~~~~~~~~~~~~lt~akl~~a~~~ld~~~ 173 (343) .+-|++ ..|-+.++..|+..+|..|+....--+.-... .....+ +.+.-..+.+-.+-..|-... T Consensus 93 eLRd~~~~~~~gqAA~qkLAs~V~sal~~~aal~gs~v~~------------~~~~~g-t~~ad~wd~vAd~d~~m~~~~ 159 (430) T PHA01511 93 DLRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVIT------------SPDAIG-TNTADAWNFVADAEELMFSRE 159 (430) T ss_pred HHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEE------------ECCCCC-CCCCCHHHHHHHHHHHHHHCC T ss_conf 4218687777889999888765658999999865205785------------023357-887430456777777776414 Q ss_pred CCCCCCEEEEECCHHHHHHHHCCCCEEEHHCCCC---CCCCCCCEE-EEEEE-EEEEECCCCC Q ss_conf 8975460799769999987506710100000134---311367012-27435-8997077787 Q gi|254781210|r 174 IDVDSEQVYVLIPSDVWASLFALERATSKDYINT---AALQAGKIE-AFAGV-WFINMEKVPG 231 (343) Q Consensus 174 Vp~~g~~~~~v~~p~~~~~Ll~~t~~~s~Df~~~---~al~~G~I~-~~~G~-~~i~s~~lp~ 231 (343) +|.++.|+++ ..|..|+++-. ++.-+++.+. .+++.+.|. .++|| +++.|.++|. T Consensus 160 l~~~~gr~~F-~np~dy~~~a~--nL~gR~~~~~~~~~AYe~a~I~~~VAgFd~~~~s~k~p~ 219 (430) T PHA01511 160 LNRDMGTSYF-FNPQDYKKAGY--DLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPV 219 (430) T ss_pred CCCCCCCEEE-ECHHHHHHHHH--HCCCCHHCCCCCHHHHHHCCCCCCCCCHHHHHCCCCCEE T ss_conf 7888660234-17577778765--404740105777888752332100035788750776403 No 7 >pfam04454 Linocin_M18 Encapsulating protein for peroxidase. The Linocin_M18 is found in eubacteria and archaea. These proteins, referred to as encapsulins, form nanocompartments within the bacterium which contain ferritin-like proteins or peroxidases, enzymes involved in oxidative-stress response. These enzymes are targeted to the interior of encapsulins via unique C-terminal extensions. Probab=95.62 E-value=0.14 Score=27.81 Aligned_cols=142 Identities=14% Similarity=0.128 Sum_probs=77.9 Q ss_pred CCEEEEEECCCCCHHHH--HHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEE Q ss_conf 62699982344311221--2688887-41006899999999998776889999986201455666665445666640230 Q gi|254781210|r 72 QDRRWVGHSQFGWAERI--DPFATLD-SGINPLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEFFSKENILSA 148 (343) Q Consensus 72 ~~~~~v~~~~~~~a~~I--ddiDe~k-~~~d~rs~y~~~~g~ALar~~D~~I~~a~~g~a~~~~~g~~~~~~~~~~~~~~ 148 (343) +.|+..-+-.....+.+ -++|..+ ...|+--.-...++..+|++.|+.|+.....+.-.+ --.. .....+.. T Consensus 33 ~~R~~~Pl~el~~~F~l~~~dld~~~rg~~~~D~s~~~~AA~~~A~~ED~~If~G~~~~gi~G---L~~~--~~~~~~~l 107 (218) T pfam04454 33 GLRQVLPLPELRVPFTLSRRDLDDVERGAKDLDLSPAAEAARKLALAEDRLIFNGYAEAGIKG---LLNA--EGNKKIKL 107 (218) T ss_pred EEEEEEEEEEEEEEEEEEHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCC---CCCC--CCCCCEEC T ss_conf 116677757999720652999899982699998409999999999988989865860208640---0047--99871244 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCEEEEECCHHHHHHHHCCCCEEEHHCCCCCCCCCCCEEEEEEEEEEEECC Q ss_conf 23577642228999999998602388975460799769999987506710100000134311367012274358997077 Q gi|254781210|r 149 VEGDDFFKTFIGQLITAKSIFRKRYIDVDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFAGVWFINMEK 228 (343) Q Consensus 149 ~~~~~~~~lt~akl~~a~~~ld~~~Vp~~g~~~~~v~~p~~~~~Ll~~t~~~s~Df~~~~al~~G~I~~~~G~~~i~s~~ 228 (343) . -...+-....+.+|..+|+++.++ | ++.++++|+.|..|.....=+ +-+.--.|.++.+-++|.++. T Consensus 108 -s-w~~~~~~~~~v~~A~~~L~~aG~~--G-PyaLvlsp~~y~~L~r~~~~t-------G~~~~~~i~~lv~g~Ii~sP~ 175 (218) T pfam04454 108 -S-DDDPGDAPDDVAEALSKLREAGVE--G-PYALVLSPDLYTKLFRVYDHT-------GYPEIEHIKELVDGGIIWAPA 175 (218) T ss_pred -C-CCCHHHHHHHHHHHHHHHHHCCCC--C-CEEEEECHHHHHHHHCCCCCC-------CCCHHHHHHHHHCCCEEECCC T ss_conf -8-554556999999999999972789--9-859998999999986315889-------962999999986798498666 Q ss_pred CC Q ss_conf 78 Q gi|254781210|r 229 VP 230 (343) Q Consensus 229 lp 230 (343) +. T Consensus 176 i~ 177 (218) T pfam04454 176 ID 177 (218) T ss_pred CC T ss_conf 68 No 8 >TIGR01343 hacA_fam homoaconitate hydratase family protein; InterPro: IPR006251 3-isopropylmalate dehydratase (or isopropylmalate isomerase; 4.2.1.33 from EC) catalyses the stereo-specific isomerisation of 2-isopropylmalate and 3-isopropylmalate, via the formation of 2-isopropylmaleate. This enzyme performs the second step in the biosynthesis of leucine, and is present in most prokaryotes and many fungal species. The prokaryotic enzyme is a heterodimer composed of a large (LeuC) and small (LeuD) subunit, while the fungal form is a monomeric enzyme. Both forms of isopropylmalate are related and are part of the larger aconitase family . Aconitases are mostly monomeric proteins which share four domains in common and contain a single, labile [4Fe-4S] cluster. Three structural domains (1, 2 and 3) are tightly packed around the iron-sulphur cluster, while a fourth domain (4) forms a deep active-site cleft. The prokaryotic enzyme is encoded by two adjacent genes, leuC and leuD, corresponding to aconitase domains 1-3 and 4 respectively , . LeuC does not bind an iron-sulphur cluster. It is thought that some prokaryotic isopropylamalate dehydrogenases can also function as homoaconitase 4.2.1.36 from EC, converting cis-homoaconitate to homoisocitric acid in lysine biosynthesis . Homoaconitase has been identified in higher fungi (mitochondria) and several archaea and one thermophilic species of bacteria, Thermus thermophilus . This entry represents the large subunit of 3-isopropylmalate dehydratase (LeuC), as well as homoaconitase, certain aconitases and uncharacterised proteins, and is a subgroup of those represented in IPR012095. Homoaconitase, aconitase and 3-isopropylmalate dehydratase have similar overall structures and domain organisation . All are dehydratases that bind a [4Fe-4S]-cluster. The majority of the members of this family have been designated as 3-isopropylmalate dehydratase large subunit (LeuC) in microbial genome annotation, but the only characterised member is T. thermophilus homoaconitase, an enzyme of a non-aspartate pathway of lysine biosynthesis. Information about related proteins can be found at Protein of the Month: Aconitase .; GO: 0016836 hydro-lyase activity, 0051539 4 iron 4 sulfur cluster binding, 0008652 amino acid biosynthetic process. Probab=62.30 E-value=4.8 Score=19.18 Aligned_cols=31 Identities=23% Similarity=0.350 Sum_probs=24.8 Q ss_pred HHHHHHHHHHHH-----HCC-CCCCCCEEEEECCHHHHH Q ss_conf 899999999860-----238-897546079976999998 Q gi|254781210|r 159 IGQLITAKSIFR-----KRY-IDVDSEQVYVLIPSDVWA 191 (343) Q Consensus 159 ~akl~~a~~~ld-----~~~-Vp~~g~~~~~v~~p~~~~ 191 (343) +.-|+.|++.|. -.. |.++- |++|+|++..- T Consensus 314 l~DLr~AAkiLkaalkqGr~GV~~~V--rliViPAS~~v 350 (432) T TIGR01343 314 LEDLRVAAKILKAALKQGRKGVAPDV--RLIVIPASKAV 350 (432) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCC--EEEEECCCHHH T ss_conf 67899999999999753675527884--39998464689 No 9 >pfam09950 DUF2184 Uncharacterized protein conserved in bacteria (DUF2184). This domain, found in various hypothetical bacterial proteins, has no known function. Probab=60.91 E-value=12 Score=17.05 Aligned_cols=220 Identities=15% Similarity=0.114 Sum_probs=102.4 Q ss_pred EEEECCCCCCCCCCCCCCCEEEEEECCCCCHH--HHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 53325898743245687626999823443112--2126888874-10068999999999987768899999862014556 Q gi|254781210|r 55 AHEIVGDMPDTIYNATDQDRRWVGHSQFGWAE--RIDPFATLDS-GINPLLPYASLATAAMHRKQDEAILKGMLGVNKKG 131 (343) Q Consensus 55 A~~~t~r~~~t~~~~~~~~~~~v~~~~~~~a~--~IddiDe~k~-~~d~rs~y~~~~g~ALar~~D~~I~~a~~g~a~~~ 131 (343) |.-...+..|.|..+..-++....+..|..+. -|.++...|. ..++-...+..+-.|...++|+.++ .|.+..+ T Consensus 4 a~~i~~~a~dlP~vdv~~~~~~~~v~~~g~~~~yti~El~~Aq~~G~~L~~~ka~aar~a~~~~~~~i~~---~Gd~~~g 80 (247) T pfam09950 4 AKWIAGYADDLPLVDVDITKTTSPVFLWGIGYGYTIQELEAAQALGRPLDTQKAEAARLAAEEKIDEIVY---VGDAAKG 80 (247) T ss_pred CEEECCCCCCCCCCCCCCEEEEEEEEEEEEEEEEEHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHEEE---ECCCCCC T ss_conf 1142477565653012440556767887637666799999999829994589999999999975423289---5672007 Q ss_pred CCCCCCCCCCCCCEEEECC------CCCCCCCCHHHHHHHHHHHHHC----CCCCCCCEEEEECCHHHHHHHHCCCCEEE Q ss_conf 6666544566664023023------5776422289999999986023----88975460799769999987506710100 Q gi|254781210|r 132 KIGAETEFFSKENILSAVE------GDDFFKTFIGQLITAKSIFRKR----YIDVDSEQVYVLIPSDVWASLFALERATS 201 (343) Q Consensus 132 ~~g~~~~~~~~~~~~~~~~------~~~~~~lt~akl~~a~~~ld~~----~Vp~~g~~~~~v~~p~~~~~Ll~~t~~~s 201 (343) -.|.- ..+.+.... ......-.++.|.++....-.. ..| + -+.+||.+|. +|+.+.+.+ T Consensus 81 ~~GLl-----N~p~V~~~~~~~~~Wa~~T~deil~dln~~~~~i~~~s~~~~~p----~-tlllp~~~~~-~l~~~~~s~ 149 (247) T pfam09950 81 ITGLL-----NHPGVTVVNAVATGWATKTPDEILADVNEALTKIWTATGGVEQP----D-TLLLPPSQFL-YLSSTVVSP 149 (247) T ss_pred CEEEE-----CCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCC----C-EEEECHHHHH-HHHCEEECC T ss_conf 66553-----58997212256776244899999999999999998860983679----8-6774999999-873540068 Q ss_pred HHCCC-CCCCCCCCEE---EEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEE Q ss_conf 00013-4311367012---2743589970777876666544433333456556677766666666545565513799984 Q gi|254781210|r 202 KDYIN-TAALQAGKIE---AFAGVWFINMEKVPGNDLFPAGTKFPGLIDGKVEYPNGKPTVKSSAKFEDTKIKYVLPIYC 277 (343) Q Consensus 202 ~Df~~-~~al~~G~I~---~~~G~~~i~s~~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h 277 (343) ..-.+ .+-|.+..+- +--+.+|...+.|... +.+..-+-++|. T Consensus 150 ~~~~Tvl~~~~~nn~~~~~~g~~l~I~~v~~L~~a---------------------------------g~~g~dr~v~Y~ 196 (247) T pfam09950 150 ATNITILEFLKENNIYTAQTGRPLTIRGVKELEGA---------------------------------GTGGTDRMVAYT 196 (247) T ss_pred CCCCCHHHHHHHCCCCCCCCCCCEEEEECHHHHHC---------------------------------CCCCCCEEEEEE T ss_conf 89968989987577000057898169971665525---------------------------------888973799997 Q ss_pred CCCCEEEEEC--CCE-EEEEECCCCCCHHEEEHEECCCCEEE-CCCCEEEE Q ss_conf 5410344513--531-56554165320330001000263565-41525899 Q gi|254781210|r 278 KSAVVFTQRK--AID-VQHSKDPGKWHAPQITLTSSFGATRI-EPDKILGI 324 (343) Q Consensus 278 ksAvg~a~~~--di~-~ei~~~~~k~~a~~i~~~~s~Ga~~l-~~~~vv~I 324 (343) ||.=-+-... +++ ...+++ ...+.|-+.+-.|.+.+ +|+.++.+ T Consensus 197 ~d~~~v~~~vP~p~~~lp~q~~---~l~~~vp~~~r~gGv~~~~P~a~~y~ 244 (247) T pfam09950 197 KDPDYVKFPLPMPLTFLPAQPR---GLRFKVPGISRLGGVEVRYPDAIVYV 244 (247) T ss_pred CCCCEEEEECCCCHHHCCCEEC---CCEEEEEEEEEEEEEEEEECCEEEEC T ss_conf 4801277625766000676426---84799755888524999702448951 No 10 >TIGR02086 IPMI_arch 3-isopropylmalate dehydratase, large subunit; InterPro: IPR011826 3-isopropylmalate dehydratase (or isopropylmalate isomerase; 4.2.1.33 from EC) catalyses the stereo-specific isomerisation of 2-isopropylmalate and 3-isopropylmalate, via the formation of 2-isopropylmaleate. This enzyme performs the second step in the biosynthesis of leucine, and is present in most prokaryotes and many fungal species. The prokaryotic enzyme is a heterodimer composed of a large (LeuC) and small (LeuD) subunit, while the fungal form is a monomeric enzyme. Both forms of isopropylmalate are related and are part of the larger aconitase family . Aconitases are mostly monomeric proteins which share four domains in common and contain a single, labile [4Fe-4S] cluster. Three structural domains (1, 2 and 3) are tightly packed around the iron-sulphur cluster, while a fourth domain (4) forms a deep active-site cleft. The prokaryotic enzyme is encoded by two adjacent genes, leuC and leuD, corresponding to aconitase domains 1-3 and 4 respectively , . LeuC does not bind an iron-sulphur cluster. It is thought that some prokaryotic isopropylamalate dehydrogenases can also function as homoaconitase 4.2.1.36 from EC, converting cis-homoaconitate to homoisocitric acid in lysine biosynthesis . Homoaconitase has been identified in higher fungi (mitochondria) and several archaea and one thermophilic species of bacteria, Thermus thermophilus . This entry represents the large subunit of 3-isopropylmalate dehydratase (LeuC), as well as homoaconitase enzymes, from prokaryotes. Homoaconitase, aconitase and 3-isopropylmalate dehydratase have similar overall structures and domain organisation . All are dehydratases that bind a [4Fe-4S]-cluster. Information about related proteins can be found at Protein of the Month: Aconitase .; GO: 0003861 3-isopropylmalate dehydratase activity, 0051539 4 iron 4 sulfur cluster binding, 0009098 leucine biosynthetic process. Probab=54.84 E-value=9.4 Score=17.55 Aligned_cols=45 Identities=24% Similarity=0.392 Sum_probs=34.0 Q ss_pred HHHHHHHHHHHHHCC-CCCCCCEEEEECCHHHHHHHHCCCCEEEHHCCCCCCCCCCCEEEEE Q ss_conf 899999999860238-8975460799769999987506710100000134311367012274 Q gi|254781210|r 159 IGQLITAKSIFRKRY-IDVDSEQVYVLIPSDVWASLFALERATSKDYINTAALQAGKIEAFA 219 (343) Q Consensus 159 ~akl~~a~~~ld~~~-Vp~~g~~~~~v~~p~~~~~Ll~~t~~~s~Df~~~~al~~G~I~~~~ 219 (343) +..|+.|++.|+... +-++- |++|+|++ +.-++ +++..|.|.+|. T Consensus 317 ~~Dl~~AaeiL~g~~GvH~dV--R~iV~PAS-~~V~~-------------~A~e~G~i~~lv 362 (431) T TIGR02086 317 LEDLRIAAEILKGRRGVHPDV--RLIVVPAS-RKVYE-------------KALEEGIIETLV 362 (431) T ss_pred HHHHHHHHHHHCCCCCCCCCE--EEEEECHH-HHHHH-------------HHHHHHHHHHHH T ss_conf 899999999841378857630--38982014-78899-------------998741788887 No 11 >COG4397 Mu-like prophage major head subunit gpT [General function prediction only] Probab=50.96 E-value=8.9 Score=17.68 Aligned_cols=85 Identities=11% Similarity=0.076 Sum_probs=46.8 Q ss_pred ECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCCC Q ss_conf 23443112212688887410068999999999987768899999862014556666654456666402302357764222 Q gi|254781210|r 79 HSQFGWAERIDPFATLDSGINPLLPYASLATAAMHRKQDEAILKGMLGVNKKGKIGAETEFFSKENILSAVEGDDFFKTF 158 (343) Q Consensus 79 ~~~~~~a~~IddiDe~k~~~d~rs~y~~~~g~ALar~~D~~I~~a~~g~a~~~~~g~~~~~~~~~~~~~~~~~~~~~~lt 158 (343) .+.|+.+.-|..-|----+.-.-|+..+++|.|-+-+-|+++++++..+-.+ .--.+...|.+...+.....+++...+ T Consensus 77 NK~fe~~V~i~R~DiEDDNvGiYsPl~~EmGRAaav~pDELVFaaL~~g~~t-aCYDGQnFFDTdHPVy~kVDGtgqas~ 155 (308) T COG4397 77 NKTFEGTVAISRDDIEDDNVGIYSPLFQEMGRAAAVQPDELVFAALRDGIST-ACYDGQNFFDTDHPVYPKVDGTGQASM 155 (308) T ss_pred ECCCCCCEEECCCCCCCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHH-HHCCCCCCCCCCCCCCCCCCCCCCHHH T ss_conf 0100462356036555566420208899976764069338999998743566-532762124578876567578773103 Q ss_pred HHHHHH Q ss_conf 899999 Q gi|254781210|r 159 IGQLIT 164 (343) Q Consensus 159 ~akl~~ 164 (343) +-.++. T Consensus 156 VSNif~ 161 (308) T COG4397 156 VSNIFV 161 (308) T ss_pred HHHHEE T ss_conf 323100 No 12 >TIGR00977 LeuA_rel 2-isopropylmalate synthase/homocitrate synthase family protein; InterPro: IPR005675 This model represents largely uncharacterised proteins related to 2-isopropylmalate synthases and homocitrate synthases but phylogenetically distinct. ; GO: 0046912 transferase activity transferring acyl groups acyl groups converted into alkyl on transfer, 0019752 carboxylic acid metabolic process. Probab=47.64 E-value=19 Score=15.79 Aligned_cols=97 Identities=12% Similarity=0.074 Sum_probs=71.3 Q ss_pred HHHCCEECCCCCCCEEEEECCC-CEEEEEECCCCCCCCCCCCCCCEEEEEECCC-CCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 3010023031267406871346-2355332589874324568762699982344-3112212688887410068999999 Q gi|254781210|r 30 LRPTVTEQATEGEASALVEVFK-PTEAHEIVGDMPDTIYNATDQDRRWVGHSQF-GWAERIDPFATLDSGINPLLPYASL 107 (343) Q Consensus 30 L~~~V~~~t~~~Gks~~f~~~G-~~~A~~~t~r~~~t~~~~~~~~~~~v~~~~~-~~a~~IddiDe~k~~~d~rs~y~~~ 107 (343) =++..+.++.+ |.++|-|+-| ..+|-.+-|++=.=..-+.-..+|.++.+++ .-|-.+.++.+.=..+|+.++-.+. T Consensus 286 n~~~~~~~PyV-G~~AFAHKGGvHvSAv~r~p~TYEHI~P~LVGN~R~I~vSe~aG~SNvl~K~~~~G~~~D~~~P~~r~ 364 (543) T TIGR00977 286 NLPLDENMPYV-GESAFAHKGGVHVSAVKRNPKTYEHIDPELVGNKRKIVVSELAGKSNVLEKLKEFGIEIDEKSPKVRK 364 (543) T ss_pred CCCCCCCCCCC-CCCCCCCCCCEEEEEEECCCCCCCCCCCCCCCCEEEEEEECCCCCHHHHHHHHHHCHHCCCCCCHHHH T ss_conf 48853467823-66322025865776550588742345864317713799823557314899998704010578806899 Q ss_pred HHHHHHHHHHH------------HHHHHHHCC Q ss_conf 99998776889------------999986201 Q gi|254781210|r 108 ATAAMHRKQDE------------AILKGMLGV 127 (343) Q Consensus 108 ~g~ALar~~D~------------~I~~a~~g~ 127 (343) +-..|..=..+ ++++-+.|. T Consensus 365 il~~~KeLE~~GY~FEaAEAS~ELL~r~~~G~ 396 (543) T TIGR00977 365 ILKKIKELEKQGYHFEAAEASFELLVRQALGK 396 (543) T ss_pred HHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCC T ss_conf 99999988752671126788899999987066 No 13 >pfam12217 End_beta_propel Catalytic beta propeller domain of bacteriophage endosialidase. This domain family is found in bacteria and viruses, and is typically between 443 and 460 amino acids in length. This domain is the highly conserved beta propeller of bacteriophage endosialidase which represents the catalytically active part of the enzymes. This core domain forms stable SDS-resistant trimers. There is a nested beta barrel domain in this domain (pfam12195). The endosialidase protein complexes to form a homotrimeric molecule. Probab=41.02 E-value=21 Score=15.63 Aligned_cols=32 Identities=9% Similarity=0.123 Sum_probs=18.2 Q ss_pred CCHHEEEHEECCCCEEECCCCEEEEEECHHHH Q ss_conf 20330001000263565415258999541332 Q gi|254781210|r 300 WHAPQITLTSSFGATRIEPDKILGIEISKDSL 331 (343) Q Consensus 300 ~~a~~i~~~~s~Ga~~l~~~~vv~I~~~e~~~ 331 (343) +++.-+.+..-.|.+.+.+.-..-|=-.||-. T Consensus 388 YqG~ivNsavGVGSv~vKD~~lyyiFGgED~~ 419 (449) T pfam12217 388 YQGGIVNSAVGVGSVCVKDNWLYYIFGGEDFF 419 (449) T ss_pred HCCCCCCCCCCCCEEEEECCEEEEEECCCCCC T ss_conf 23663102344212898678899995685447 No 14 >pfam06816 NOD NOTCH protein. NOTCH signalling plays a fundamental role during a great number of developmental processes in multicellular animals. NOD and NODP represent a region present in many NOTCH proteins and NOTCH homologs in multiple species such as NOTCH2 and NOTCH3, LIN12, SC1 and TAN1. Role of NOD domain remains to be elucidated. Probab=36.45 E-value=29 Score=14.77 Aligned_cols=40 Identities=15% Similarity=0.139 Sum_probs=33.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCEECCCCCCCEEEEECCC Q ss_conf 9886778999998711243010023031267406871346 Q gi|254781210|r 12 IYEFKKHVELALQETKSKLRPTVTEQATEGEASALVEVFK 51 (343) Q Consensus 12 v~q~~~~v~~a~qqk~s~L~~~V~~~t~~~Gks~~f~~~G 51 (343) .+||..+.....++.+..||..|+.+....|+--.||.-| T Consensus 17 p~ef~~~~~~FLr~ls~~Lrt~v~ik~d~~G~~mI~pw~g 56 (56) T pfam06816 17 PEELRNNSVQFLRELSHLLRTNVRIKKDAEGQPMIFPWYG 56 (56) T ss_pred HHHHHHHHHHHHHHHHHHHHHEEEEEECCCCCEEEEECCC T ss_conf 9999987989999999876306999886899887760689 No 15 >TIGR01522 ATPase-IIA2_Ca calcium-transporting P-type ATPase, PMR1-type; InterPro: IPR006413 This family describes the P-type ATPase responsible for translocating calcium ions across the golgi membrane of fungi and animals , , and is of particular importance in the sarcoplasmic reticulum of skeletal and cardiac muscle in vertebrates . The calcium P-type ATPases have been characterised as Type IIA based on a phylogenetic analysis which distinguishes this group from the Type IIB PMCA calcium pump represented by IPR006408 from INTERPRO. ; GO: 0005388 calcium-transporting ATPase activity, 0006816 calcium ion transport, 0016021 integral to membrane. Probab=29.58 E-value=8 Score=17.95 Aligned_cols=64 Identities=13% Similarity=0.094 Sum_probs=31.9 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHCCEECC--CCCCCEEEEECCCCEEEEEECCCCCCCC--CCCCCCCEEEEEECCC Q ss_conf 22799988677899999871124301002303--1267406871346235533258987432--4568762699982344 Q gi|254781210|r 7 LATANIYEFKKHVELALQETKSKLRPTVTEQA--TEGEASALVEVFKPTEAHEIVGDMPDTI--YNATDQDRRWVGHSQF 82 (343) Q Consensus 7 i~~aFv~q~~~~v~~a~qqk~s~L~~~V~~~t--~~~Gks~~f~~~G~~~A~~~t~r~~~t~--~~~~~~~~~~v~~~~~ 82 (343) +|-+|||||+||=-+.+= -++|-.+- +..||..+ .-|+.-+||--+.- =.-.|.+-|++---|+ T Consensus 94 vTVgFVQEYRSEkSLeaL------nkLVP~ech~~R~G~~~h------vlAs~LVPGDlV~l~vGDRvPADlRiveAv~L 161 (856) T TIGR01522 94 VTVGFVQEYRSEKSLEAL------NKLVPPECHLIRAGKEEH------VLASTLVPGDLVILSVGDRVPADLRIVEAVDL 161 (856) T ss_pred EEEEEEEECCHHHHHHHH------CCCCCCCCCEEECCCCCE------EEEECCCCCCEEEEEECCCCCCCCEEEEEEEE T ss_conf 764016401325799985------036871111473588430------57510789867998627856314025655640 No 16 >TIGR01263 4HPPD 4-hydroxyphenylpyruvate dioxygenase; InterPro: IPR005956 4-hydroxyphenylpyruvate dioxygenase (1.13.11.27 from EC) oxidizes 4-hydroxyphenylpyruvate, a tyrosine and phenylalanine catabolite, to homogentisate. Homogentisate can undergo a further non-enzymatic oxidation and polymerization into brown pigments that protect some bacterial species from light. A similar process occurs spontaneously in blood and is hemolytic . In some bacterial species, this enzyme has been studied as a hemolysin.; GO: 0003868 4-hydroxyphenylpyruvate dioxygenase activity, 0009072 aromatic amino acid family metabolic process. Probab=27.97 E-value=41 Score=13.95 Aligned_cols=34 Identities=9% Similarity=0.195 Sum_probs=26.2 Q ss_pred CCCHHHHHHHHHHHHHC--CCCCCCCEEEEECCHHHHHHHHC Q ss_conf 22289999999986023--88975460799769999987506 Q gi|254781210|r 156 KTFIGQLITAKSIFRKR--YIDVDSEQVYVLIPSDVWASLFA 195 (343) Q Consensus 156 ~lt~akl~~a~~~ld~~--~Vp~~g~~~~~v~~p~~~~~Ll~ 195 (343) +|+..-|+.+.+.|++. .|. ++=.+|..||++|. T Consensus 258 Al~t~DI~~tV~~Lr~rClGv~------FL~~~P~~YYd~l~ 293 (379) T TIGR01263 258 ALNTDDIVRTVRALRARCLGVE------FLDSTPDTYYDNLE 293 (379) T ss_pred EECCHHHHHHHHHHHHHHCCCE------ECCCCCHHHHHHHH T ss_conf 8150579999999776414412------04889716789998 No 17 >KOG1546 consensus Probab=26.51 E-value=43 Score=13.80 Aligned_cols=55 Identities=20% Similarity=0.057 Sum_probs=33.1 Q ss_pred HHHCCEECCCCCCCEEEEECCCCEEEEEECCCCCCCCCCCC-CCCEEEEEECCCCCHHHHHHHH Q ss_conf 30100230312674068713462355332589874324568-7626999823443112212688 Q gi|254781210|r 30 LRPTVTEQATEGEASALVEVFKPTEAHEIVGDMPDTIYNAT-DQDRRWVGHSQFGWAERIDPFA 92 (343) Q Consensus 30 L~~~V~~~t~~~Gks~~f~~~G~~~A~~~t~r~~~t~~~~~-~~~~~~v~~~~~~~a~~IddiD 92 (343) |+.+|. ....|.+.+|+.-|.++ |..++.-.+. -.++.++-.++-..+..|||.+ T Consensus 128 l~wLV~--~aq~gD~LvfHYSGHGt------r~~~~~gDe~dG~DE~I~P~D~~t~G~iIdDe~ 183 (362) T KOG1546 128 LRWLVE--SAQPGDSLVFHYSGHGT------RQPDTNGDEVDGYDETIVPCDHNTQGPIIDDEI 183 (362) T ss_pred HHHHHH--CCCCCCEEEEEECCCCC------CCCCCCCCCCCCCCCEEECCCCCCCCCCCCHHH T ss_conf 999971--68889779998547777------679989877778753030355645665545388 Done!