Query gi|254781059|ref|YP_003065472.1| cysteine desulfurase activator complex subunit SufB [Candidatus Liberibacter asiaticus str. psy62] Match_columns 489 No_of_seqs 225 out of 1870 Neff 7.2 Searched_HMMs 39220 Date Mon May 30 06:21:34 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254781059.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR01980 sufB FeS assembly pr 100.0 0 0 1252.8 39.3 458 17-480 1-469 (469) 2 PRK11814 cysteine desulfurase 100.0 0 0 1154.9 49.5 484 1-489 1-484 (484) 3 CHL00085 ycf24 putative ABC tr 100.0 0 0 1140.6 50.1 478 7-489 8-485 (485) 4 COG0719 SufB Cysteine desulfur 100.0 0 0 753.6 38.9 406 40-486 1-411 (412) 5 PRK10948 cysteine desulfurase 100.0 0 0 660.2 39.2 399 47-488 20-423 (424) 6 TIGR01981 sufD FeS assembly pr 100.0 0 0 589.7 34.6 323 145-475 52-395 (397) 7 pfam01458 UPF0051 Uncharacteri 100.0 0 0 468.8 30.4 229 223-460 2-230 (230) 8 pfam01458 UPF0051 Uncharacteri 98.7 4.7E-05 1.2E-09 56.8 22.2 177 199-389 9-191 (230) 9 COG0719 SufB Cysteine desulfur 98.2 0.0011 2.7E-08 46.8 21.7 172 200-387 166-345 (412) 10 TIGR01981 sufD FeS assembly pr 98.1 0.0015 3.8E-08 45.7 20.8 180 194-389 149-340 (397) 11 PRK10948 cysteine desulfurase 97.9 0.0039 9.9E-08 42.5 22.0 178 230-420 138-322 (424) 12 CHL00085 ycf24 putative ABC tr 97.8 0.0017 4.3E-08 45.2 13.6 211 229-468 256-476 (485) 13 PRK11814 cysteine desulfurase 97.3 0.013 3.3E-07 38.6 12.6 148 230-386 256-413 (484) 14 TIGR01980 sufB FeS assembly pr 95.7 0.2 5.2E-06 29.8 11.5 289 57-390 98-418 (469) 15 pfam00073 Rhv picornavirus cap 75.6 5.8 0.00015 19.0 4.6 53 192-252 95-149 (173) 16 PRK12440 acetate kinase; Revie 64.8 8.5 0.00022 17.7 3.5 31 428-458 238-268 (397) 17 cd01089 PA2G4-like Related to 64.1 9.8 0.00025 17.3 3.9 82 155-241 142-225 (228) 18 PRK09490 metH B12-dependent me 58.0 12 0.00031 16.6 3.3 48 418-468 1058-1105(1229) 19 PRK07668 hypothetical protein; 58.0 12 0.00032 16.5 4.2 33 23-64 9-41 (248) 20 pfam12247 MKT1_N Temperature d 56.0 6.1 0.00015 18.8 1.4 47 431-480 8-56 (90) 21 TIGR00501 met_pdase_II methion 52.3 8.4 0.00021 17.8 1.6 57 172-242 267-325 (327) 22 PRK12318 methionine aminopepti 51.6 15 0.00039 15.8 4.2 87 155-241 180-285 (291) 23 COG2137 OraA Uncharacterized p 48.2 17 0.00044 15.5 3.8 42 30-71 93-134 (174) 24 KOG2738 consensus 47.3 13 0.00032 16.5 1.9 28 174-201 136-166 (369) 25 PTZ00286 6-phospho-1-fructokin 45.8 19 0.00047 15.2 2.6 58 184-244 272-330 (486) 26 PTZ00134 40S ribosomal protein 44.7 19 0.0005 15.1 3.1 26 35-60 60-91 (154) 27 PRK06646 DNA polymerase III su 43.9 20 0.00051 15.0 5.2 51 161-216 13-75 (154) 28 PRK05667 dnaG DNA primase; Val 43.4 20 0.00052 14.9 2.8 57 154-210 274-345 (600) 29 TIGR02386 rpoC_TIGR DNA-direct 42.9 11 0.00027 17.0 1.0 40 333-374 1053-1094(1552) 30 KOG3870 consensus 42.8 9.3 0.00024 17.5 0.7 24 315-338 408-431 (434) 31 PRK07058 acetate kinase; Provi 42.4 21 0.00053 14.8 3.5 30 429-458 239-269 (393) 32 TIGR02656 cyanin_plasto plasto 41.9 21 0.00054 14.8 2.9 34 197-234 8-41 (102) 33 COG0829 UreH Urease accessory 41.5 22 0.00055 14.7 10.4 84 191-276 23-110 (269) 34 TIGR00500 met_pdase_I methioni 41.3 22 0.00055 14.7 3.0 31 30-65 28-60 (265) 35 TIGR01135 glmS glucosamine--fr 40.2 23 0.00057 14.6 3.4 63 135-202 97-162 (628) 36 pfam04256 DUF434 Protein of un 40.0 23 0.00058 14.6 2.7 26 436-461 7-32 (58) 37 TIGR00118 acolac_lg acetolacta 32.3 16 0.00042 15.6 0.6 19 156-174 223-242 (593) 38 cd01088 MetAP2 Methionine Amin 31.2 31 0.00078 13.6 2.6 13 156-168 123-135 (291) 39 PRK09458 pspB phage shock prot 31.1 31 0.00079 13.6 3.8 44 31-75 31-74 (75) 40 pfam03069 FmdA_AmdA Acetamidas 29.4 33 0.00084 13.4 2.8 23 437-459 284-306 (331) 41 TIGR01478 STEVOR variant surfa 28.6 23 0.0006 14.5 0.8 14 155-168 186-199 (315) 42 cd06460 M32_Taq Peptidase fami 27.6 33 0.00085 13.3 1.5 45 171-219 217-263 (396) 43 KOG1184 consensus 27.6 35 0.0009 13.2 1.9 52 227-281 432-490 (561) 44 smart00775 LNS2 LNS2 domain. T 27.5 30 0.00076 13.7 1.2 27 42-69 17-43 (157) 45 KOG2874 consensus 26.7 33 0.00084 13.4 1.3 44 78-121 36-79 (356) 46 TIGR02916 PEP_his_kin putative 26.1 21 0.00054 14.8 0.2 250 18-283 307-638 (696) 47 pfam06667 PspB Phage shock pro 25.7 38 0.00096 12.9 3.8 44 31-75 31-74 (75) 48 COG4340 Uncharacterized protei 25.4 38 0.00098 12.9 4.2 129 159-318 86-220 (226) 49 COG4101 Predicted mannose-6-ph 25.0 33 0.00083 13.4 1.0 15 194-208 90-104 (142) 50 KOG1151 consensus 24.9 39 0.00099 12.8 2.1 19 233-251 536-554 (775) 51 pfam10884 DUF2683 Protein of u 24.8 39 0.001 12.8 3.9 33 449-481 28-61 (80) 52 TIGR01756 LDH_protist lactate 24.5 40 0.001 12.8 4.5 44 29-72 178-222 (314) 53 PRK03902 manganese transport t 24.2 40 0.001 12.7 3.1 47 112-176 84-130 (142) 54 pfam02074 Peptidase_M32 Carbox 23.6 41 0.001 12.7 1.9 19 430-448 420-438 (494) 55 TIGR02461 osmo_MPG_phos mannos 23.5 41 0.0011 12.7 2.9 13 195-207 63-75 (248) 56 TIGR01173 glmU UDP-N-acetylglu 23.0 42 0.0011 12.6 9.1 25 191-215 249-273 (461) 57 COG2130 Putative NADP-dependen 22.6 21 0.00054 14.8 -0.3 83 391-481 218-301 (340) 58 pfam03701 UPF0181 Uncharacteri 22.6 43 0.0011 12.5 2.8 25 434-458 15-39 (52) 59 pfam04472 DUF552 Protein of un 22.5 43 0.0011 12.5 1.9 15 197-211 59-73 (73) 60 TIGR01818 ntrC nitrogen regula 22.5 29 0.00075 13.8 0.4 18 197-214 91-108 (471) 61 PRK11480 tauA taurine transpor 22.5 43 0.0011 12.5 4.1 13 444-456 255-267 (320) 62 PRK12379 propionate/acetate ki 21.3 45 0.0012 12.3 3.0 28 430-457 240-268 (400) 63 pfam09145 Ubiq-assoc Ubiquitin 21.0 46 0.0012 12.3 2.6 30 434-463 5-34 (43) 64 TIGR03629 arch_S13P archaeal r 21.0 41 0.001 12.7 0.9 11 36-46 52-62 (144) 65 KOG2321 consensus 20.8 46 0.0012 12.3 1.1 31 42-72 12-52 (703) 66 COG4668 MtlA Mannitol/fructose 20.7 47 0.0012 12.3 6.1 68 150-219 26-95 (142) 67 TIGR02418 acolac_catab acetola 20.7 47 0.0012 12.3 1.5 17 198-214 154-170 (553) 68 PHA01625 hypothetical protein 20.6 47 0.0012 12.2 1.3 10 263-272 230-239 (252) 69 COG5616 Predicted integral mem 20.5 47 0.0012 12.2 1.5 17 30-46 42-58 (152) 70 PRK10918 phosphate transporter 20.1 48 0.0012 12.2 2.7 49 171-232 124-174 (346) No 1 >TIGR01980 sufB FeS assembly protein SufB; InterPro: IPR010231 Iron-sulphur (FeS) clusters are important cofactors for numerous proteins involved in electron transfer, in redox and non-redox catalysis, in gene regulation, and as sensors of oxygen and iron. These functions depend on the various FeS cluster prosthetic groups, the most common being [2Fe-2S] and [4Fe-4S] . FeS cluster assembly is a complex process involving the mobilisation of Fe and S atoms from storage sources, their assembly into [Fe-S] form, their transport to specific cellular locations, and their transfer to recipient apoproteins. So far, three FeS assembly machineries have been identified, which are capable of synthesising all types of [Fe-S] clusters: ISC (iron-sulphur cluster), SUF (sulphur assimilation), and NIF (nitrogen fixation) systems. The ISC system is conserved in eubacteria and eukaryotes (mitochondria), and has broad specificity, targeting general FeS proteins , . It is encoded by the isc operon (iscRSUA-hscBA-fdx-iscX). IscS is a cysteine desulphurase, which obtains S from cysteine (converting it to alanine) and serves as a S donor for FeS cluster assembly. IscU and IscA act as scaffolds to accept S and Fe atoms, assembling clusters and transfering them to recipient apoproteins. HscA is a molecular chaperone and HscB is a co-chaperone. Fdx is a [2Fe-2S]-type ferredoxin. IscR is a transcription factor that regulates expression of the isc operon. IscX (also known as YfhJ) appears to interact with IscS and may function as an Fe donor during cluster assembly . The SUF system is an alternative pathway to the ISC system that operates under iron starvation and oxidative stress. It is found in eubacteria, archaea and eukaryotes (plastids). The SUF system is encoded by the suf operon (sufABCDSE), and the six encoded proteins are arranged into two complexes (SufSE and SufBCD) and one protein (SufA). SufS is a pyridoxal-phosphate (PLP) protein displaying cysteine desulphurase activity. SufE acts as a scaffold protein that accepts S from SufS and donates it to SufA . SufC is an ATPase with an unorthodox ATP-binding cassette (ABC)-like component. No specific functions have been assigned to SufB and SufD. SufA is homologous to IscA , acting as a scaffold protein in which Fe and S atoms are assembled into [FeS] cluster forms, which can then easily be transferred to apoproteins targets. In the NIF system, NifS and NifU are required for the formation of metalloclusters of nitrogenase in Azotobacter vinelandii, and other organisms, as well as in the maturation of other FeS proteins. Nitrogenase catalyses the fixation of nitrogen. It contains a complex cluster, the FeMo cofactor, which contains molybdenum, Fe and S. NifS is a cysteine desulphurase. NifU binds one Fe atom at its N-terminal, assembling an FeS cluster that is transferred to nitrogenase apoproteins . Nif proteins involved in the formation of FeS clusters can also be found in organisms that do not fix nitrogen . This entry represents SufB, which is part of the SUF system and forms a complex with SufBCD.; GO: 0005515 protein binding, 0016226 iron-sulfur cluster assembly. Probab=100.00 E-value=0 Score=1252.75 Aligned_cols=458 Identities=51% Similarity=0.851 Sum_probs=448.1 Q ss_pred CCCCCCCCCCCC-EECCCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCEEEECCC--- Q ss_conf 221257413531-0204888899999999863299889999999999999968877866587665682000310056--- Q gi|254781059|r 17 DQYKYGFETNIQ-VDKAPKGLNEDIIRFISAKKNEPDWMLEWRLLAYRRWLTMKDPSWARIRYPAIDFDDLYYYAAP--- 92 (489) Q Consensus 17 ~~y~~~~~~~~~-~~~~~~gl~~~~i~~iS~~k~EP~Wl~e~R~~Al~~f~~l~~P~W~~~~l~~i~~~~i~~~~~p--- 92 (489) .+|+|||+++++ .+.++||||+++|++||+.|+||+|||++||+||+.|.+||+|+|...|++.||++++.||++| T Consensus 1 ~~~~yGf~~~~~~~~~~~kGLt~~VV~~IS~~K~EP~WMLd~RLrale~f~k~p~P~Wg~~dl~~iD~~~~vyY~~PdCn 80 (469) T TIGR01980 1 TEYKYGFHDEDKYAYETEKGLTEEVVREISEKKGEPDWMLDFRLRALELFLKMPMPTWGRPDLSGIDIEDIVYYSKPDCN 80 (469) T ss_pred CCCCCCCCCCCCHHHHCCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHCCCCCCCCCH T ss_conf 97667751222224440789888999999730588868999999999988428889888888886786453434377703 Q ss_pred ---C-CCC--CCCCCCCCCHHHHHHHHHCCCCHHHHHHHCCCCCCCCEEEEEECCEEEECCCHHHHHHCCEEEEEHHHHH Q ss_conf ---5-333--4332101587888877651765111111023444431000121350220200567874790885589987 Q gi|254781059|r 93 ---K-SIV--GPQSLDDVDPELLRTYEKLGIPIREQEILAGVQTSKVAVDAVFDSVSVVTTFKEELKKSGVIFMPISEAI 166 (489) Q Consensus 93 ---~-~~~--~~~sld~v~~ei~~tf~kLgip~~e~~~l~g~~~~~v~v~~~~ds~~v~~~l~~~L~~~GVi~~~l~eA~ 166 (489) + ... ...|||+||++|++||||||||++|+++|+| ++|+|||||++||++++|+|+++|||||||++|| T Consensus 81 efkKq~~~~r~~~sWdevP~eIk~TFEkLGIPeaE~k~LaG-----vav~AQYdSEviY~~~Ke~L~~~GVIF~d~d~Al 155 (469) T TIGR01980 81 EFKKQSKKTRKATSWDEVPDEIKDTFEKLGIPEAERKALAG-----VAVGAQYDSEVIYHNIKEDLEEKGVIFCDMDTAL 155 (469) T ss_pred HHHHHHHHHCCCCCCCCCCHHHHHHHHHCCCCHHHHHHHCC-----CEEEEEEEEEEEECCCHHHHHHCCEEECCHHHHH T ss_conf 33101234215678234678999876444861799987425-----3577886204462220467562987863859986 Q ss_pred HHHHHHHHHHCCCE-ECCCCCHHHHHHHHHHCCCEEEEECCCCCCCCCEEEEEEECCCCCEECCEEEEEEECCCEEEEEE Q ss_conf 77789886202211-24565314668876423843887347741342127999965888400221489972772499996 Q gi|254781059|r 167 CEHPDLVKKYLSTV-VPVGDNFYAALNSAVFTDGSFVYIPKGVRCPMELSTYFRINEKGTGQFERTLIIADEGSYVSYLE 245 (489) Q Consensus 167 ~~~p~lv~~~~~~~-~~~~~~~f~aLn~A~~~~G~fi~Vp~gv~v~~PI~~~~~~~~~~~~~f~r~lIi~ee~S~v~~iE 245 (489) ++||||||+|||++ |||.||||+|||+|+|+||+|||||+||+|+.|||+|||+|+++++||+||||||||||+|+||| T Consensus 156 ~~YPdLVKeYF~~v~VPp~DnKFAALn~AVWSGGsFvYVPkGV~v~~PLqtYFRIN~~~~GQFErTLiIadeGa~VhYiE 235 (469) T TIGR01980 156 KEYPDLVKEYFGKVCVPPSDNKFAALNGAVWSGGSFVYVPKGVRVDMPLQTYFRINSENTGQFERTLIIADEGASVHYIE 235 (469) T ss_pred HHCHHHHHHHCCCCCCCCCCCHHHHHHHHEEECCEEEEECCCCEECCCCCCCEECCCCCCCCCEEEEEEEEECCEEEEEE T ss_conf 51725678662700258763414664021102785798478986666300000028477886325889995177488870 Q ss_pred CCCCCCCCCCCCCCEEEEEEECCCCEEEEEEEEECCCCCCCCCCCEEEEEEEEEEEECCCEEEEEEEEEECCCEEEEEEE Q ss_conf 15676544444433058998669958999996304755311448746401467863046303899887514634788538 Q gi|254781059|r 246 GCTAPQRDENQLHAAVVELVALDNAEIKYSTVQNWYPGDSQGKGGIYNFVTKRGDCRGMRSKISWTQLETGSAITWKYPS 325 (489) Q Consensus 246 ~~~s~~~~~~~~~~~v~Ei~vg~nA~v~~~~iQn~~~~~~~~~~~~~~~~t~r~~~~~~~a~~~~~~~~lG~~~~r~~~~ 325 (489) ||+||.+..++||+++||++|++||+++|++||||..++ ...+++|||+||||.|.|++++|+|+++++||++||.||+ T Consensus 236 GCsAP~Y~~~sLHAa~VE~~v~~~A~~rYtTvQNWs~N~-~~KGGvyNlvTKRA~veG~ng~meWv~Gs~GSK~Tm~YPs 314 (469) T TIGR01980 236 GCSAPVYSTNSLHAAVVELIVKEDARVRYTTVQNWSKND-KNKGGVYNLVTKRALVEGENGTMEWVDGSIGSKITMKYPS 314 (469) T ss_pred CCCCCCCCCCCCEEEEEEEEECCCCEEEEEEEEECCCCC-CCCCCEEECCCCEEEEECCCCEEEEEECCHHHHHHHHCCE T ss_conf 232665688871335899998378758984687303788-9985448668723789626877999602400024412552 Q ss_pred EEEECCCCEEEEEEEEEECCCEEEEEEEEEEEECCCCCCEEEEEEEECCCCEEEEEEEEEECCCCCCCCCEEEEEEEEEC Q ss_conf 99827872799998776259868997218998367661101000010589468994014733886587430221058845 Q gi|254781059|r 326 CLLRGEQSCGEFYSIAISNGFQQIDSGTKMLHLAKNTTSRIVSKGISAGFSNNTYRGKVSVHRGAAHSRNFTQCDSLLIG 405 (489) Q Consensus 326 ~~L~G~~a~~~~~gi~l~~~~Q~~D~~~~v~H~~p~t~S~~~~K~i~~d~s~~~f~G~i~v~~~A~~t~~~~~~~~LlLs 405 (489) ++|+|++|.+++.||+||++.|++|++++|+|+||+|+|.|++|+|++|+|+++|||+|+|.++|++++++.+||+|||+ T Consensus 315 ~iL~G~~A~~e~~sIa~A~~gQ~~DTGaK~~H~AP~TSStI~sKSIS~ggG~~~YRGLVk~~~~A~~~k~~v~CD~Ll~D 394 (469) T TIGR01980 315 SILKGEGAKTEFLSIALAGKGQHLDTGAKMIHLAPNTSSTIISKSISKGGGKSSYRGLVKIGKGAKGAKSHVQCDSLLLD 394 (469) T ss_pred EEECCCCCCCCCEEEEEECCCCCEECCCEEEEECCCCEEEEEEEEEECCCCEEEEEEEEEECCCCCCCCCEEEEEEEEEC T ss_conf 47828862114227787247630000753687089962488987887588744302578865765577111256655657 Q ss_pred CCCEEEEEEEEEEECCCCEEEEEEEECCCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHH Q ss_conf 985486223689834875688753202479999999983788989999999999899998858968999999999 Q gi|254781059|r 406 NNCGAHTIPYIESKNSSCNLEHEATTSKISDEHLFYCLQRGIPEEAAIALIVNGFVREVLQKLPMEFAVEAQKLI 480 (489) Q Consensus 406 ~~a~~~s~P~LeI~~ddv~~~H~Atvg~id~e~LFYL~sRGi~~~~A~~liv~gF~~~v~~~ip~e~~~e~~~li 480 (489) ++|.+||+|.+||++|+|..+||||||+|.||||||||||||+|+||++|||.||++||+++||||||+|||||| T Consensus 395 d~S~SdT~P~~ei~n~~v~~~HEAtvsKIsEeqlfYL~sRGl~EedA~~MIV~GF~~pI~keLPmEYAVElnrLi 469 (469) T TIGR01980 395 DESASDTIPYIEIFNDTVSVEHEATVSKISEEQLFYLMSRGLSEEDARAMIVRGFVEPITKELPMEYAVELNRLI 469 (469) T ss_pred CCCCCCCCCEEEEECCCCCEEEEEEECCCCHHHHHHHHHCCCCHHHHHHHHHCCCHHHHHHHCCHHHHHHHCCCC T ss_conf 842213243055526833247855520012456889773788889999997515802565523357889840359 No 2 >PRK11814 cysteine desulfurase activator complex subunit SufB; Provisional Probab=100.00 E-value=0 Score=1154.90 Aligned_cols=484 Identities=75% Similarity=1.225 Sum_probs=475.6 Q ss_pred CCCCHHHHHHHHHHCCCCCCCCCCCCCCEECCCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCC Q ss_conf 97624789998860232212574135310204888899999999863299889999999999999968877866587665 Q gi|254781059|r 1 MVAVRETVDKVRKIDVDQYKYGFETNIQVDKAPKGLNEDIIRFISAKKNEPDWMLEWRLLAYRRWLTMKDPSWARIRYPA 80 (489) Q Consensus 1 m~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~gl~~~~i~~iS~~k~EP~Wl~e~R~~Al~~f~~l~~P~W~~~~l~~ 80 (489) |....++.+.++++..++|||||+++++++++||||||++|++||++|+||+||+++|++||+.|+++|+|+|..++++. T Consensus 1 ~~~~~~~~~~~~~~~~~~Ykygf~~~~~~~~~~kGL~e~~V~~IS~~k~EP~Wml~~RLkA~~~~~~~~~P~W~~~~~~~ 80 (484) T PRK11814 1 MSANTETTDDVKELVVQDYKYGFVTDIETDELPKGLNEDVIRLISAKKNEPEWMLEWRLKAYRHWLTMEEPHWAKLHYPP 80 (484) T ss_pred CCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCC T ss_conf 97306678899875234564778775110127899999999998604088789999999999998608999998889987 Q ss_pred CCCCCEEEECCCCCCCCCCCCCCCCHHHHHHHHHCCCCHHHHHHHCCCCCCCCEEEEEECCEEEECCCHHHHHHCCEEEE Q ss_conf 68200031005653334332101587888877651765111111023444431000121350220200567874790885 Q gi|254781059|r 81 IDFDDLYYYAAPKSIVGPQSLDDVDPELLRTYEKLGIPIREQEILAGVQTSKVAVDAVFDSVSVVTTFKEELKKSGVIFM 160 (489) Q Consensus 81 i~~~~i~~~~~p~~~~~~~sld~v~~ei~~tf~kLgip~~e~~~l~g~~~~~v~v~~~~ds~~v~~~l~~~L~~~GVi~~ 160 (489) |||+++.||++|+.....+|||++||++++||+|||||+.|++.|+| +++++||||++||++++++|.++||||| T Consensus 81 id~~~i~yY~~P~~~~~~~s~devp~ei~~tf~kLGIp~~e~k~Lag-----v~vda~~dS~~V~~~~~~~l~~~GVIf~ 155 (484) T PRK11814 81 IDYQDISYYSAPKCKSKPKSLDEVDPELLETFEKLGIPLREQKRLAG-----VAVDAVFDSVSVATTFKEKLAEAGVIFC 155 (484) T ss_pred CCHHHEEEEECCCCCCCCCCHHHCCHHHHHHHHHHCCCHHHHHHHCC-----CEEEEEECCCHHHHHHHHHHHHCCEEEE T ss_conf 77146488624455566455334589999999883886778775236-----5243220520123556899986885996 Q ss_pred EHHHHHHHHHHHHHHHCCCEECCCCCHHHHHHHHHHCCCEEEEECCCCCCCCCEEEEEEECCCCCEECCEEEEEEECCCE Q ss_conf 58998777789886202211245653146688764238438873477413421279999658884002214899727724 Q gi|254781059|r 161 PISEAICEHPDLVKKYLSTVVPVGDNFYAALNSAVFTDGSFVYIPKGVRCPMELSTYFRINEKGTGQFERTLIIADEGSY 240 (489) Q Consensus 161 ~l~eA~~~~p~lv~~~~~~~~~~~~~~f~aLn~A~~~~G~fi~Vp~gv~v~~PI~~~~~~~~~~~~~f~r~lIi~ee~S~ 240 (489) ||++|+++|||||++|||+++|++||+|+|||+|+|++|+||||||||+++.||+.||++++.+.+||+|||||+|+||+ T Consensus 156 ~~~~A~~~hpeLVkkylg~~v~~~d~kfaALn~A~~s~G~FvYVPkgV~v~~pL~~~f~i~~~~~~qf~htLIIaEegS~ 235 (484) T PRK11814 156 SISEAIQEHPELVKKYLGSVVPVNDNFFAALNSAVFSDGSFVYIPKGVRCPMELSTYFRINAANTGQFERTLIIADEGSY 235 (484) T ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHCCCEEEEECCCEEECCEEEEEEEECCCCCCEEEEEEEEECCCCE T ss_conf 69999999899999998465687510889988998639769996698267536799999857765400157999627857 Q ss_pred EEEEECCCCCCCCCCCCCCEEEEEEECCCCEEEEEEEEECCCCCCCCCCCEEEEEEEEEEEECCCEEEEEEEEEECCCEE Q ss_conf 99996156765444444330589986699589999963047553114487464014678630463038998875146347 Q gi|254781059|r 241 VSYLEGCTAPQRDENQLHAAVVELVALDNAEIKYSTVQNWYPGDSQGKGGIYNFVTKRGDCRGMRSKISWTQLETGSAIT 320 (489) Q Consensus 241 v~~iE~~~s~~~~~~~~~~~v~Ei~vg~nA~v~~~~iQn~~~~~~~~~~~~~~~~t~r~~~~~~~a~~~~~~~~lG~~~~ 320 (489) ++|||+|++|.+...++|++|+|+++++||+|+|++||||+.++..+.+++|||+|+|+.|.+++|+++|+++++|++++ T Consensus 236 v~yiEgctap~~~~~~lh~~vvEi~v~~~A~v~y~tiQNW~~gd~~~~~~v~n~vtkR~~~~~~~a~i~W~~~~~Gs~it 315 (484) T PRK11814 236 VSYLEGCTAPMRDENQLHAAVVELVALDDAEIKYSTVQNWYPGDENGKGGIYNFVTKRGLCRGENSKISWTQVETGSAIT 315 (484) T ss_pred EEEEEEEECCCCCCCCEEEEEEEEEECCCCEEEEEEEECCCCCCCCCCCCEEEEEEEHHHHCCCCCEEEEEEEECCCCEE T ss_conf 99996550466654541148899997689778899986045545246674489888401221678558888863387002 Q ss_pred EEEEEEEEECCCCEEEEEEEEEECCCEEEEEEEEEEEECCCCCCEEEEEEEECCCCEEEEEEEEEECCCCCCCCCEEEEE Q ss_conf 88538998278727999987762598689972189983676611010000105894689940147338865874302210 Q gi|254781059|r 321 WKYPSCLLRGEQSCGEFYSIAISNGFQQIDSGTKMLHLAKNTTSRIVSKGISAGFSNNTYRGKVSVHRGAAHSRNFTQCD 400 (489) Q Consensus 321 r~~~~~~L~G~~a~~~~~gi~l~~~~Q~~D~~~~v~H~~p~t~S~~~~K~i~~d~s~~~f~G~i~v~~~A~~t~~~~~~~ 400 (489) |.+|+++|.|+||.++++|+++++++||+|++++|+|.||||+|++++|||++|+|+++|||+|+|.++|++|+++|+|| T Consensus 316 ~~yps~~L~G~gs~~~~~sv~~a~~~Q~~D~gtk~~H~ap~t~S~ii~K~I~~d~s~~~frG~v~i~~~A~~s~~~~~c~ 395 (484) T PRK11814 316 WKYPSCILRGDNSVGEFYSVALTNGHQQADTGTKMIHIGKNTKSTIISKGISAGHSQNTYRGLVKIMPKATNARNFTQCD 395 (484) T ss_pred ECCCCEEEECCCCEEEEEEEEECCCCEEEECCEEEEEECCCCEEEEEHHHHCCCCCEEEEEEEEEECCCCCCCHHHHHHH T ss_conf 12561589748978999889972698998133079997577601354001220688799802577557744670343031 Q ss_pred EEEECCCCEEEEEEEEEEECCCCEEEEEEEECCCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHH Q ss_conf 58845985486223689834875688753202479999999983788989999999999899998858968999999999 Q gi|254781059|r 401 SLLIGNNCGAHTIPYIESKNSSCNLEHEATTSKISDEHLFYCLQRGIPEEAAIALIVNGFVREVLQKLPMEFAVEAQKLI 480 (489) Q Consensus 401 ~LlLs~~a~~~s~P~LeI~~ddv~~~H~Atvg~id~e~LFYL~sRGi~~~~A~~liv~gF~~~v~~~ip~e~~~e~~~li 480 (489) +||||++|+++|+|.|+|++|||+|+||||||+||+|||||||||||++++|++|||.|||+||+++||+||++|+++|| T Consensus 396 ~LlL~~~a~~dt~P~leI~~~dv~~~HeAtvg~I~eeqlfYLmSRGl~e~eA~~lIV~GF~~~v~~~LP~E~a~E~~~li 475 (484) T PRK11814 396 SLLIGDQCGAHTFPYIEVKNNSAQVEHEATTSKISEDQLFYCRQRGISEEDAVSMIVNGFCKDVFQELPMEFAVEAQKLL 475 (484) T ss_pred HEECCCCCCCCCCCEEEEECCCCEEEEEEECCCCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHH T ss_conf 15017886620065389963882798523445679899999986799999999999999789998768889999999998 Q ss_pred HHHHCCCCC Q ss_conf 987013579 Q gi|254781059|r 481 DISLEGSVG 489 (489) Q Consensus 481 ~~~~e~~ig 489 (489) +++|||+|| T Consensus 476 ~~~~egsvg 484 (484) T PRK11814 476 AISLEGSVG 484 (484) T ss_pred HHHHCCCCC T ss_conf 876345879 No 3 >CHL00085 ycf24 putative ABC transporter Probab=100.00 E-value=0 Score=1140.61 Aligned_cols=478 Identities=65% Similarity=1.091 Sum_probs=469.5 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCEECCCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCE Q ss_conf 89998860232212574135310204888899999999863299889999999999999968877866587665682000 Q gi|254781059|r 7 TVDKVRKIDVDQYKYGFETNIQVDKAPKGLNEDIIRFISAKKNEPDWMLEWRLLAYRRWLTMKDPSWARIRYPAIDFDDL 86 (489) Q Consensus 7 ~~~~~~~~~~~~y~~~~~~~~~~~~~~~gl~~~~i~~iS~~k~EP~Wl~e~R~~Al~~f~~l~~P~W~~~~l~~i~~~~i 86 (489) ....+.++...+|||||+++++++++||||||++|++||++|+||+||+++|++||+.|+++|+|+|.+++++.|||+++ T Consensus 8 ~~~~~~~~~~~~Ykygf~~~i~~~~~~~GL~e~~V~~IS~~k~EP~Wml~~RLkAl~~~~~~~~P~W~~~~~~~id~~~i 87 (485) T CHL00085 8 KNSNLDNLVNQPYKYGFSTSIETERFPKGLNEDIVKLISKKKNEPKFLLIFRLKAYKKWKKMKQPDWAFLKYPEIDYQDI 87 (485) T ss_pred HHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHE T ss_conf 27789998478777888776030137999999999999844386799999999999997449999998899997884105 Q ss_pred EEECCCCCCCCCCCCCCCCHHHHHHHHHCCCCHHHHHHHCCCCCCCCEEEEEECCEEEECCCHHHHHHCCEEEEEHHHHH Q ss_conf 31005653334332101587888877651765111111023444431000121350220200567874790885589987 Q gi|254781059|r 87 YYYAAPKSIVGPQSLDDVDPELLRTYEKLGIPIREQEILAGVQTSKVAVDAVFDSVSVVTTFKEELKKSGVIFMPISEAI 166 (489) Q Consensus 87 ~~~~~p~~~~~~~sld~v~~ei~~tf~kLgip~~e~~~l~g~~~~~v~v~~~~ds~~v~~~l~~~L~~~GVi~~~l~eA~ 166 (489) .||++|+......|||++|++++++|+|||||+.|+++|+| +++|+||||++||++++++|.++|||||||++|+ T Consensus 88 ~yy~~p~~~~~~~s~devp~eik~tfekLGip~~e~k~l~g-----v~vda~~dS~~V~~~~~~~L~~~GVIf~~~~~A~ 162 (485) T CHL00085 88 IYYSAPKLKKKLNSLDEVDPELLSTFEKLGIPLNEQKRLSN-----VAVDAVFDSVSIGTTFKEELIKAGVIFCSISEAI 162 (485) T ss_pred EEEECCCCCCCCCCHHHCCHHHHHHHHHCCCCHHHHHHHHH-----HHHHEEECCEEEEECHHHHHHHCCEEEECHHHHH T ss_conf 77752564456679366799999999871883566666420-----0120020505532012789997995995689999 Q ss_pred HHHHHHHHHHCCCEECCCCCHHHHHHHHHHCCCEEEEECCCCCCCCCEEEEEEECCCCCEECCEEEEEEECCCEEEEEEC Q ss_conf 77789886202211245653146688764238438873477413421279999658884002214899727724999961 Q gi|254781059|r 167 CEHPDLVKKYLSTVVPVGDNFYAALNSAVFTDGSFVYIPKGVRCPMELSTYFRINEKGTGQFERTLIIADEGSYVSYLEG 246 (489) Q Consensus 167 ~~~p~lv~~~~~~~~~~~~~~f~aLn~A~~~~G~fi~Vp~gv~v~~PI~~~~~~~~~~~~~f~r~lIi~ee~S~v~~iE~ 246 (489) ++|||||++|||+++|+++|+|+|||+|+|++|+|||||+||+|+.||++||++++++.+||+|||||+|+||+++|||+ T Consensus 163 ~~~pelv~ky~g~~v~~~d~kfaALn~A~~sgG~FvYVPkgV~v~~Pl~~~f~i~~~~~~qf~htLIIaeegS~v~~iEg 242 (485) T CHL00085 163 NKYPELIKKYLGSVVPIGDNYFAALNSAVFSDGSFCYIPKDTKCPLELSTYFRINNEESGQFERTLIIADENSYVSYLEG 242 (485) T ss_pred HHHHHHHHHHHHCCCCCCCCHHHHHHHHHHCCCEEEEECCCCCCCCCEEEEEEECCCCCCCCCEEEEEECCCCEEEEEEE T ss_conf 87089999986154388744689999998459769984077111552079999757756777447999657977999841 Q ss_pred CCCCCCCCCCCCCEEEEEEECCCCEEEEEEEEECCCCCCCCCCCEEEEEEEEEEEECCCEEEEEEEEEECCCEEEEEEEE Q ss_conf 56765444444330589986699589999963047553114487464014678630463038998875146347885389 Q gi|254781059|r 247 CTAPQRDENQLHAAVVELVALDNAEIKYSTVQNWYPGDSQGKGGIYNFVTKRGDCRGMRSKISWTQLETGSAITWKYPSC 326 (489) Q Consensus 247 ~~s~~~~~~~~~~~v~Ei~vg~nA~v~~~~iQn~~~~~~~~~~~~~~~~t~r~~~~~~~a~~~~~~~~lG~~~~r~~~~~ 326 (489) |++|.+...++|++|+|++|++||+|+|++||||+.++..+.+++|||+|+|+.|.++||+++|+++++|++++|.+|++ T Consensus 243 ctap~~~~~~lh~~vvEv~v~~~A~v~yttiqNW~~g~~~~~~~v~n~vtkR~~~~~~~a~i~W~~~~~Gs~~t~~yps~ 322 (485) T CHL00085 243 CTAPQFDTNQLHAAVVELIALENAEIKYSTVQNWYAGDQNGEGGIYNFVTKRGLCAGKNSKISWTQVETGSAITWKYPSC 322 (485) T ss_pred EECCCCCCCCEEEEEEEEEECCCCEEEEEEEECCCCCCCCCCCCEEEEEEEEEEECCCCCEEEEEEECCCCCEEEECCCE T ss_conf 31477765640258999997679688898876255545246676588876212330579789988851577203304431 Q ss_pred EEECCCCEEEEEEEEEECCCEEEEEEEEEEEECCCCCCEEEEEEEECCCCEEEEEEEEEECCCCCCCCCEEEEEEEEECC Q ss_conf 98278727999987762598689972189983676611010000105894689940147338865874302210588459 Q gi|254781059|r 327 LLRGEQSCGEFYSIAISNGFQQIDSGTKMLHLAKNTTSRIVSKGISAGFSNNTYRGKVSVHRGAAHSRNFTQCDSLLIGN 406 (489) Q Consensus 327 ~L~G~~a~~~~~gi~l~~~~Q~~D~~~~v~H~~p~t~S~~~~K~i~~d~s~~~f~G~i~v~~~A~~t~~~~~~~~LlLs~ 406 (489) +|.|+||.++++|+++++++|++|++++|+|.||||+|++++|||+.|+|+++|||+|+|.++|++|+++|+||+||||+ T Consensus 323 ~L~G~gs~~~~~si~~a~~~Q~~D~gtk~~H~ap~T~S~iisK~I~~~~s~~~frG~v~i~~~A~~s~~~~~c~~LlL~~ 402 (485) T CHL00085 323 ILIGNNSQGEFYSVALTNNYQQADTGTKMIHIGKNTKSRIISKGISAGKSKNTYRGLVKISKKAINSRNYSQCDSLLIGK 402 (485) T ss_pred EEECCCCEEEEEEEEECCCCEEECCCEEEEEEECCCEEEEEEEHHHCCCCEEEECCCCCCCCCCCCCCCCEEECEEEECC T ss_conf 89738978999888851577998456069998068544675301123897677455031278878884201765287768 Q ss_pred CCEEEEEEEEEEECCCCEEEEEEEECCCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCC Q ss_conf 85486223689834875688753202479999999983788989999999999899998858968999999999987013 Q gi|254781059|r 407 NCGAHTIPYIESKNSSCNLEHEATTSKISDEHLFYCLQRGIPEEAAIALIVNGFVREVLQKLPMEFAVEAQKLIDISLEG 486 (489) Q Consensus 407 ~a~~~s~P~LeI~~ddv~~~H~Atvg~id~e~LFYL~sRGi~~~~A~~liv~gF~~~v~~~ip~e~~~e~~~li~~~~e~ 486 (489) +|+++|+|+|+|++|||+|+|||||||||+|||||||||||++++|++|||+|||+||+++||+||++|+++||+++||| T Consensus 403 ~a~~dt~P~l~i~~~dv~~~HeAtvgkI~eeqlfYLmSRGl~e~eA~~lIV~GF~~~v~~~LP~E~a~E~~~li~~~leg 482 (485) T CHL00085 403 NSQANTFPYIQVQNPSAKIEHEASTSKIGEEQIFYFLQRGINLEEAISLMISGFCQDVFNKLPMEFALEADRLLSLKLEG 482 (485) T ss_pred CCCCCCCCEEEEECCCCEEEECEECCCCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCC T ss_conf 86535674489834883078530435679999999986699999999999998479888658989999999998764266 Q ss_pred CCC Q ss_conf 579 Q gi|254781059|r 487 SVG 489 (489) Q Consensus 487 ~ig 489 (489) +|| T Consensus 483 svg 485 (485) T CHL00085 483 SVG 485 (485) T ss_pred CCC T ss_conf 879 No 4 >COG0719 SufB Cysteine desulfurase activator SufB [Posttranslational modification, protein turnover, chaperones] Probab=100.00 E-value=0 Score=753.57 Aligned_cols=406 Identities=46% Similarity=0.763 Sum_probs=381.9 Q ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCC-CCCCCCCCCCCCC--CEEEECCCCCCCCCCCCCCCCHHHHHHHHHCC Q ss_conf 99999863299889999999999999968877-8665876656820--00310056533343321015878888776517 Q gi|254781059|r 40 IIRFISAKKNEPDWMLEWRLLAYRRWLTMKDP-SWARIRYPAIDFD--DLYYYAAPKSIVGPQSLDDVDPELLRTYEKLG 116 (489) Q Consensus 40 ~i~~iS~~k~EP~Wl~e~R~~Al~~f~~l~~P-~W~~~~l~~i~~~--~i~~~~~p~~~~~~~sld~v~~ei~~tf~kLg 116 (489) +|+.||++++||+||+++|++++..|..+++| .|.. +...|.++ +..+|..|.... |+++| T Consensus 1 ~i~~~s~~~~ep~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~---------------~~~~~ 64 (412) T COG0719 1 MIELISARKDEPEWMLALRLKALRASPDLEMPEVPTR-RDELWELDFLDRLYYLAPGEAS---------------FKRLG 64 (412) T ss_pred CCCHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCCCC-CCCCCCCCHHHHHHCCCCCCCC---------------HHHCC T ss_conf 9011322304845778999986764322346889864-2323576867764301666555---------------10037 Q ss_pred CCHHHHHHHCCCCCCCCEEEEEECCEEEECCCHHHHHHCCEEEEEHHHHHHHHHHHHHHHCCC-EECCCCCHHHHHHHHH Q ss_conf 651111110234444310001213502202005678747908855899877778988620221-1245653146688764 Q gi|254781059|r 117 IPIREQEILAGVQTSKVAVDAVFDSVSVVTTFKEELKKSGVIFMPISEAICEHPDLVKKYLST-VVPVGDNFYAALNSAV 195 (489) Q Consensus 117 ip~~e~~~l~g~~~~~v~v~~~~ds~~v~~~l~~~L~~~GVi~~~l~eA~~~~p~lv~~~~~~-~~~~~~~~f~aLn~A~ 195 (489) +|..++ +.+++++..++..+..++.++||+|+++ +++||+++++||++ ++++.+++|+|||+|+ T Consensus 65 ~~~~e~------------~~~~~~~~~~~~~~~~~~~~~gv~~~~~---~~~~~~~~~~~~~~~~v~~~~~~~~aL~~a~ 129 (412) T COG0719 65 IPDAEE------------VLAQCDSVVVYHAVSGELAPEGVIVEDI---LKEHPDLVKKYFGKGVVPDDDDKFAALNAAL 129 (412) T ss_pred CCCHHH------------HCEEECCEEEEEECCCCCCCCCEEEECC---HHHHHHHHHHHCCCCCCCCCCCCEEEEEEEE T ss_conf 774122------------0016422665441154435674088513---3341257887466567677664043555568 Q ss_pred HCCCEEEEECCCCCCCCCEEEEEEECCCCCE-ECCEEEEEEECCCEEEEEECCCCCCCCCCCCCCEEEEEEECCCCEEEE Q ss_conf 2384388734774134212799996588840-022148997277249999615676544444433058998669958999 Q gi|254781059|r 196 FTDGSFVYIPKGVRCPMELSTYFRINEKGTG-QFERTLIIADEGSYVSYLEGCTAPQRDENQLHAAVVELVALDNAEIKY 274 (489) Q Consensus 196 ~~~G~fi~Vp~gv~v~~PI~~~~~~~~~~~~-~f~r~lIi~ee~S~v~~iE~~~s~~~~~~~~~~~v~Ei~vg~nA~v~~ 274 (489) |++|+|||||+|++++.||+++|+++.++.+ ||+|++||+|+||+++|||+|++| ....++|++++|+++++||+|+| T Consensus 130 ~~~g~fiyVp~g~~~~~Pi~~~~~~~~~~~~~q~~~~lIivee~a~v~~ie~~~~p-~~~~~~h~~vvei~v~~nA~v~~ 208 (412) T COG0719 130 FSDGTFIYVPKGVEVPTPIQLYFIINGENTGVQFERTLIIVEEGAEITVIEGCSAP-VGSGSLHNGVVEIFVGENASLTY 208 (412) T ss_pred ECCCEEEEECCCCEECEEEEEEEEEECCCCCEEEEEEEEEECCCCEEEEEEEEECC-CCCCCEEEEEEEEEECCCCEEEE T ss_conf 72748999899942233479999860576522575589998899889999501146-78875045689999848979999 Q ss_pred EEEEECCCCCCCCCCCEEEEEEEEEEEECCCEEEEEEEEEECCCEEEEEEEEEEECCCCEEEEEEEEEECCCEEEEEEEE Q ss_conf 99630475531144874640146786304630389988751463478853899827872799998776259868997218 Q gi|254781059|r 275 STVQNWYPGDSQGKGGIYNFVTKRGDCRGMRSKISWTQLETGSAITWKYPSCLLRGEQSCGEFYSIAISNGFQQIDSGTK 354 (489) Q Consensus 275 ~~iQn~~~~~~~~~~~~~~~~t~r~~~~~~~a~~~~~~~~lG~~~~r~~~~~~L~G~~a~~~~~gi~l~~~~Q~~D~~~~ 354 (489) ++||||++ +++++++|+.+.+++|+++|+++++|++++|.++++.|.|++|.++++|++++.++|++|++++ T Consensus 209 ~~vqn~~~--------~~~~~~~~~~~~~~~a~~~~~~~~~G~~~t~~~~~~~l~Ge~a~~e~~sv~~~~~~q~~D~~~~ 280 (412) T COG0719 209 TTVQNWSP--------VYNHLTKRAAVEEEDATVRWTQVTLGSKVTRKYPSVRLKGEGADGELLSVAFAGGGQHMDVGTK 280 (412) T ss_pred EEEECCCC--------CEEEEEEEEEEECCCCEEEEEEECCCCEEEEEEEEEEEECCCCEEEEEEEEECCCCCEECCCCE T ss_conf 99773675--------0576788999833784899999636870599974279975995799999997479808811428 Q ss_pred EEEECCCCCCEEEEEEEECCCCEEEEEEEEEECCCCCCCCCEEEEEEEEECCCCEEEEEEEEEEECCCCEEEEEEEECCC Q ss_conf 99836766110100001058946899401473388658743022105884598548622368983487568875320247 Q gi|254781059|r 355 MLHLAKNTTSRIVSKGISAGFSNNTYRGKVSVHRGAAHSRNFTQCDSLLIGNNCGAHTIPYIESKNSSCNLEHEATTSKI 434 (489) Q Consensus 355 v~H~~p~t~S~~~~K~i~~d~s~~~f~G~i~v~~~A~~t~~~~~~~~LlLs~~a~~~s~P~LeI~~ddv~~~H~Atvg~i 434 (489) |.|.||||+|++++|||+.|+|+++|||+|.|.++|++|+|+|+|++||||++|+++|+|.|+|++|||+|+||||||+| T Consensus 281 v~h~g~~t~s~i~~kgI~~d~a~~vfrG~i~i~~~A~~s~a~~~c~~LlLs~~a~~dt~P~LeI~~ddV~~~H~Atvg~i 360 (412) T COG0719 281 MIHNGPNTRSNILSKGIVKDGARTVFRGLIDIRKGATGSDAYQECRTLLLSDDARADTVPILEIETDDVKAGHEATVGRI 360 (412) T ss_pred EEEECCCCEEEEEEEEEEECCCEEEEEEEEEEECCCCCCCCEEEECEEEECCCCCEECCEEEEEECCCEEEEEEEEECCC T ss_conf 99956995668999889927868999808998368877601347157998699756516079997697689830241256 Q ss_pred CHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCC Q ss_conf 9999999983788989999999999899998858968999999999987013 Q gi|254781059|r 435 SDEHLFYCLQRGIPEEAAIALIVNGFVREVLQKLPMEFAVEAQKLIDISLEG 486 (489) Q Consensus 435 d~e~LFYL~sRGi~~~~A~~liv~gF~~~v~~~ip~e~~~e~~~li~~~~e~ 486 (489) |+|||||||||||+|++|++|||+||++||+++||+ +++|.++++..++++ T Consensus 361 deeqLFYLmSRGl~e~eA~~lIV~GF~~~vi~~lp~-~~v~~~~~~~~~~~~ 411 (412) T COG0719 361 DEEQLFYLMSRGLSEEEAKRLIVRGFLEEVIEELPE-EAVEEQLLIFIELEG 411 (412) T ss_pred CHHHEEEHHHCCCCHHHHHHHHHHHHHHHHHHCCCH-HHHHHHHHHHHHHHC T ss_conf 868853077569998999999999977777650887-789988998777406 No 5 >PRK10948 cysteine desulfurase activator complex subunit SufD; Provisional Probab=100.00 E-value=0 Score=660.23 Aligned_cols=399 Identities=19% Similarity=0.220 Sum_probs=326.7 Q ss_pred HCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCCCCCCCHHHHHHHHHCCCCHHHHHHHC Q ss_conf 32998899999999999999688778665876656820003100565333433210158788887765176511111102 Q gi|254781059|r 47 KKNEPDWMLEWRLLAYRRWLTMKDPSWARIRYPAIDFDDLYYYAAPKSIVGPQSLDDVDPELLRTYEKLGIPIREQEILA 126 (489) Q Consensus 47 ~k~EP~Wl~e~R~~Al~~f~~l~~P~W~~~~l~~i~~~~i~~~~~p~~~~~~~sld~v~~ei~~tf~kLgip~~e~~~l~ 126 (489) ++++|+-- ..|++||+.|.++|+|+ .+++.|+|+++............. ..++++..+ .+.++ T Consensus 20 ~~~~~~~~-~~r~~~~~~F~~lGlPt---rK~EdWKYTdL~~l~~~~f~~~~~--~~~~~~~~~---~~~~~-------- 82 (424) T PRK10948 20 EAKGIERS-PQARQHWQQVLRLGLPT---RKHEDWKYTPLEGLLNSQFIVSIA--AEISPAQRD---ALALT-------- 82 (424) T ss_pred HHCCCCCH-HHHHHHHHHHHHHCCCC---CCCCCCCCCCHHHHHHCCCCCCCC--CCCCHHHHH---HHCCC-------- T ss_conf 60167686-99999999999858999---887676679889996400677765--556843542---20367-------- Q ss_pred CCCCCCCEEEEEECCEEEECCCHHHHHHCCEEEEEHHHHHHHHHHHHHHHCCCEECCCCCHHHHHHHHHHCCCEEEEECC Q ss_conf 34444310001213502202005678747908855899877778988620221124565314668876423843887347 Q gi|254781059|r 127 GVQTSKVAVDAVFDSVSVVTTFKEELKKSGVIFMPISEAICEHPDLVKKYLSTVVPVGDNFYAALNSAVFTDGSFVYIPK 206 (489) Q Consensus 127 g~~~~~v~v~~~~ds~~v~~~l~~~L~~~GVi~~~l~eA~~~~p~lv~~~~~~~~~~~~~~f~aLn~A~~~~G~fi~Vp~ 206 (489) ......|+|++.|.+.... +....+. ..+.+.....++ +...+.|++||+|++++|+||+||+ T Consensus 83 ~d~~rIVFVNG~f~~~LSd------~~~~~~~-~~i~~~~~~~~~----------~~~~d~f~~LN~Al~~dG~~I~V~~ 145 (424) T PRK10948 83 IDAVRLVFVDGRFSPALSD------STIGPYQ-VSINDDRQGLPD----------PIQPEVFLHLTESLAQSVTHIRLPR 145 (424) T ss_pred CCCEEEEEECCEECCCCCC------CCCCCCE-EEECCCHHHCCC----------CCCHHHHHHHHHHHHCCCEEEEECC T ss_conf 8825999999998455675------5568505-785001021523----------1232389999999855977999879 Q ss_pred CCCCCCCEEEEEEECCCCC----EECCEEEEEEECCCEEEEEECCCCCCCCCCCCCCEEEEEEECCCCEEEEEEEEECCC Q ss_conf 7413421279999658884----002214899727724999961567654444443305899866995899999630475 Q gi|254781059|r 207 GVRCPMELSTYFRINEKGT----GQFERTLIIADEGSYVSYLEGCTAPQRDENQLHAAVVELVALDNAEIKYSTVQNWYP 282 (489) Q Consensus 207 gv~v~~PI~~~~~~~~~~~----~~f~r~lIi~ee~S~v~~iE~~~s~~~~~~~~~~~v~Ei~vg~nA~v~~~~iQn~~~ 282 (489) |+++++||++++..++... ...||++|++|+||++++||.+.+.. +...|+|.++||++++||+|+|+++|++.. T Consensus 146 n~~v~kPI~il~i~s~~~~~~~~~~~~R~lI~l~enS~~tIIE~~~~~~-~~~~f~N~vteI~l~~nA~L~h~klq~~~~ 224 (424) T PRK10948 146 GQRPAKPLYLLHITQGVDGDELNTAHYRHHLDLAEGAEATVIEHFVSLN-EARHFTGARLTMNVGDNAHLNHIKLAFENP 224 (424) T ss_pred CCCCCCCEEEEEEECCCCCCCCCCCCCEEEEEECCCCEEEEEEEEECCC-CCCCEEEEEEEEEECCCCEEEEEEEECCCC T ss_conf 9705789899998158863200034404899988998899999861688-767066038999987996799999731674 Q ss_pred CCCCCCCCEEEEEEEEEEEECCCEEEEEEEEEECCCEEEEEEEEEEECCCCEEEEEEEEEECCCEEEEEEEEEEEECCCC Q ss_conf 53114487464014678630463038998875146347885389982787279999877625986899721899836766 Q gi|254781059|r 283 GDSQGKGGIYNFVTKRGDCRGMRSKISWTQLETGSAITWKYPSCLLRGEQSCGEFYSIAISNGFQQIDSGTKMLHLAKNT 362 (489) Q Consensus 283 ~~~~~~~~~~~~~t~r~~~~~~~a~~~~~~~~lG~~~~r~~~~~~L~G~~a~~~~~gi~l~~~~Q~~D~~~~v~H~~p~t 362 (489) +.+++.+.+. .++++|+++++++++||+++|+++.+.|.|++|++.++|+++++++||+|++|.|+|.+||| T Consensus 225 -------~s~h~~~~~v-~~~~dS~~~~~~~slGg~l~R~~~~~~L~Ge~a~~~l~Gl~l~~~~qh~D~~t~i~H~~p~c 296 (424) T PRK10948 225 -------SSYHFAHNDL-LLGRDARAFSHSFLLGAGVLRHNTSSQLNGENSTLSLNSLAMPVKNEVCDTRTWLEHNKGYC 296 (424) T ss_pred -------CCEEEEEEEE-EECCCCEEEEEEEEECCEEEEEEEEEEEECCCCEEEEEEEEECCCCEEEEEEEEEEECCCCC T ss_conf -------2378852479-97179579999999678558988899994799689998779635865762027887279983 Q ss_pred CCEEEEEEEECCCCEEEEEEEEEECCCCCCCCCEEEEEEEEECCCCEEEEEEEEEEECCCCEEEEEEEECCCCHHHHHHH Q ss_conf 11010000105894689940147338865874302210588459854862236898348756887532024799999999 Q gi|254781059|r 363 TSRIVSKGISAGFSNNTYRGKVSVHRGAAHSRNFTQCDSLLIGNNCGAHTIPYIESKNSSCNLEHEATTSKISDEHLFYC 442 (489) Q Consensus 363 ~S~~~~K~i~~d~s~~~f~G~i~v~~~A~~t~~~~~~~~LlLs~~a~~~s~P~LeI~~ddv~~~H~Atvg~id~e~LFYL 442 (489) +|+|++|||+.|+|++||+|+|.|+++||||+|+|.|+|||||++|+++|+|+|||+||||+|+||||||+||+|+|||| T Consensus 297 ~S~ql~K~Il~d~s~gVF~GkI~V~~~Aqktda~q~n~nllLs~~a~~~skP~LeI~aDDVkcsHGaTvg~ld~~~lFYL 376 (424) T PRK10948 297 NSRQLHKTIVSDKGRAVFNGLIKVAQHAIKTDGQMTNNNLLLGKLAEVDTKPQLEIYADDVKCSHGATVGRIDDEQMFYL 376 (424) T ss_pred EEEEEEEHCCCCCCCEEEEEEEEECCCCCCCHHHHHHCCEECCCCCEEEECCEEEEECCCEEECCCCCCCCCCHHHHHHH T ss_conf 11022100066787189997899907856476655213063378856953640799868613714443478898888889 Q ss_pred HHCCCCHHHHHHHHHHHHHHHHHHHCCHHHHHH-HHHHHHHHHCCCC Q ss_conf 837889899999999998999988589689999-9999998701357 Q gi|254781059|r 443 LQRGIPEEAAIALIVNGFVREVLQKLPMEFAVE-AQKLIDISLEGSV 488 (489) Q Consensus 443 ~sRGi~~~~A~~liv~gF~~~v~~~ip~e~~~e-~~~li~~~~e~~i 488 (489) |||||++++|+.||+.||+.||+++||.|-.++ +.++|+.+|.|.| T Consensus 377 ~sRGi~~~~A~~ll~~aF~~ev~~~i~~~~lr~~l~~~i~~~l~~~~ 423 (424) T PRK10948 377 RSRGINQQDAQQMIIYAFAAELTEAIHNEALKQQVLARIGQRLPGGI 423 (424) T ss_pred HHCCCCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCC T ss_conf 86699999999999999999999858978999999999998751477 No 6 >TIGR01981 sufD FeS assembly protein SufD; InterPro: IPR011542 Iron-sulphur (FeS) clusters are important cofactors for numerous proteins involved in electron transfer, in redox and non-redox catalysis, in gene regulation, and as sensors of oxygen and iron. These functions depend on the various FeS cluster prosthetic groups, the most common being [2Fe-2S] and [4Fe-4S] . FeS cluster assembly is a complex process involving the mobilisation of Fe and S atoms from storage sources, their assembly into [Fe-S] form, their transport to specific cellular locations, and their transfer to recipient apoproteins. So far, three FeS assembly machineries have been identified, which are capable of synthesising all types of [Fe-S] clusters: ISC (iron-sulphur cluster), SUF (sulphur assimilation), and NIF (nitrogen fixation) systems. The ISC system is conserved in eubacteria and eukaryotes (mitochondria), and has broad specificity, targeting general FeS proteins , . It is encoded by the isc operon (iscRSUA-hscBA-fdx-iscX). IscS is a cysteine desulphurase, which obtains S from cysteine (converting it to alanine) and serves as a S donor for FeS cluster assembly. IscU and IscA act as scaffolds to accept S and Fe atoms, assembling clusters and transfering them to recipient apoproteins. HscA is a molecular chaperone and HscB is a co-chaperone. Fdx is a [2Fe-2S]-type ferredoxin. IscR is a transcription factor that regulates expression of the isc operon. IscX (also known as YfhJ) appears to interact with IscS and may function as an Fe donor during cluster assembly . The SUF system is an alternative pathway to the ISC system that operates under iron starvation and oxidative stress. It is found in eubacteria, archaea and eukaryotes (plastids). The SUF system is encoded by the suf operon (sufABCDSE), and the six encoded proteins are arranged into two complexes (SufSE and SufBCD) and one protein (SufA). SufS is a pyridoxal-phosphate (PLP) protein displaying cysteine desulphurase activity. SufE acts as a scaffold protein that accepts S from SufS and donates it to SufA . SufC is an ATPase with an unorthodox ATP-binding cassette (ABC)-like component. No specific functions have been assigned to SufB and SufD. SufA is homologous to IscA , acting as a scaffold protein in which Fe and S atoms are assembled into [FeS] cluster forms, which can then easily be transferred to apoproteins targets. In the NIF system, NifS and NifU are required for the formation of metalloclusters of nitrogenase in Azotobacter vinelandii, and other organisms, as well as in the maturation of other FeS proteins. Nitrogenase catalyses the fixation of nitrogen. It contains a complex cluster, the FeMo cofactor, which contains molybdenum, Fe and S. NifS is a cysteine desulphurase. NifU binds one Fe atom at its N-terminal, assembling an FeS cluster that is transferred to nitrogenase apoproteins . Nif proteins involved in the formation of FeS clusters can also be found in organisms that do not fix nitrogen . This entry represents SufD proteins that form part of the SufBCD complex in the SUF system. No specific functions have been assigned to these proteins.; GO: 0005515 protein binding, 0016226 iron-sulfur cluster assembly. Probab=100.00 E-value=0 Score=589.75 Aligned_cols=323 Identities=28% Similarity=0.422 Sum_probs=298.5 Q ss_pred ECCCHHHHHHCCEEEEEHHHHHHHHHHHHHHHCCC-EECCCC------------CHHHHHHHHHHCCCEEEEECCCCCCC Q ss_conf 02005678747908855899877778988620221-124565------------31466887642384388734774134 Q gi|254781059|r 145 VTTFKEELKKSGVIFMPISEAICEHPDLVKKYLST-VVPVGD------------NFYAALNSAVFTDGSFVYIPKGVRCP 211 (489) Q Consensus 145 ~~~l~~~L~~~GVi~~~l~eA~~~~p~lv~~~~~~-~~~~~~------------~~f~aLn~A~~~~G~fi~Vp~gv~v~ 211 (489) ..+++..+...|+++..|+++...+|++.++++++ ++.+.+ ++|++||.|++++|++||||+|++++ T Consensus 52 ~~~~~~~~~~~G~~~~~l~~~~~~~~~~~~~~~~~~~~~~~e~~~~yL~~~g~d~~~~~~~~~~~~~g~~~~~~~~~~~~ 131 (397) T TIGR01981 52 VSNLVRLVFVNGVFVPDLSDADPLLPELLEDLLEARAVLSDEIQSKYLQFEGADERFTALNAALFNDGTVLYVPKGVEVE 131 (397) T ss_pred CCCCCCHHHCCCEEECHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHCCCCCCCHHHHHHHHHHHCCCEEEEEECCCCEEE T ss_conf 15663001108732000222034664321001377256737999850674583377566556313515999965894773 Q ss_pred CCEEEEEEEC-CCC---CEECCEEEEEEECCCEEEEEECCCCCC-CCCCC-CCCEEEEEEECCCCEEEEEEEEECCCCCC Q ss_conf 2127999965-888---400221489972772499996156765-44444-43305899866995899999630475531 Q gi|254781059|r 212 MELSTYFRIN-EKG---TGQFERTLIIADEGSYVSYLEGCTAPQ-RDENQ-LHAAVVELVALDNAEIKYSTVQNWYPGDS 285 (489) Q Consensus 212 ~PI~~~~~~~-~~~---~~~f~r~lIi~ee~S~v~~iE~~~s~~-~~~~~-~~~~v~Ei~vg~nA~v~~~~iQn~~~~~~ 285 (489) .||++.|+.+ .++ ...++|++|++|++|++++||++.+.. ..... +.+.|+||++++||+|+|+.||.| T Consensus 132 ~Pi~~~~~~~~~~~e~~~~~~~~~~i~~~~~~~~~v~e~~~~~~~~~~~~Y~~n~v~~i~~~~~A~v~~~~~~~~----- 206 (397) T TIGR01981 132 EPIEIKFIGVAGEGEKKAFVHPRLLIEVEKGSKITVIERFEGISLNESKAYFTNSVVEITVGANASVSFVRVQDF----- 206 (397) T ss_pred CCEEEEEEEECCCCCEEEEEECCEEEEECCCCEEEEEEEEECCCCCCCCEEEEECEEEEEECCCCEEEEEEEECC----- T ss_conf 348999984067675013653453899868980799998325788864314650408999744878999996158----- Q ss_pred CCCCCEEEEE-EEEEEEECCCEEEEEEEEEECCCEEEEEEEEEEECCCCEEEEEEEEEECCCEEEEEEEEEEEECCCCCC Q ss_conf 1448746401-467863046303899887514634788538998278727999987762598689972189983676611 Q gi|254781059|r 286 QGKGGIYNFV-TKRGDCRGMRSKISWTQLETGSAITWKYPSCLLRGEQSCGEFYSIAISNGFQQIDSGTKMLHLAKNTTS 364 (489) Q Consensus 286 ~~~~~~~~~~-t~r~~~~~~~a~~~~~~~~lG~~~~r~~~~~~L~G~~a~~~~~gi~l~~~~Q~~D~~~~v~H~~p~t~S 364 (489) +.++++|. ..|+.. ++||++.|..+.+|++++|.+..+.|.|++|.++++|++++.++|++|+++.|.|.+|+|+| T Consensus 207 --~~~~~~~~W~~~~~~-~~~a~~~~~~~~~Gg~~~~~~~~~~l~G~~s~~~~~gl~~~~~~~~~D~~~~v~h~~~~~~S 283 (397) T TIGR01981 207 --SKNSTHFSWNHRAKI-GKDARLRLAVVNLGGKLSRHNLDVDLEGEGSKLEVKGLAIGAGRQHVDVRTLVEHNGPDTKS 283 (397) T ss_pred --CCCEEEEEEEEEEEE-ECCCEEEEEEEECCCCEEEEEEEEEEECCCEEEEEEEEEEECCCCEEEEEEEEEECCCCCEE T ss_conf --981699865577765-12865899999927857999987888246407999998751798157777568871666243 Q ss_pred EEEEEEEECCCCEEEEEEEEEECCCCCCCCCEEEEEEEEECCCCEEEEEEEEEEECCCCEEE-EEEEECCCCHHHHHHHH Q ss_conf 01000010589468994014733886587430221058845985486223689834875688-75320247999999998 Q gi|254781059|r 365 RIVSKGISAGFSNNTYRGKVSVHRGAAHSRNFTQCDSLLIGNNCGAHTIPYIESKNSSCNLE-HEATTSKISDEHLFYCL 443 (489) Q Consensus 365 ~~~~K~i~~d~s~~~f~G~i~v~~~A~~t~~~~~~~~LlLs~~a~~~s~P~LeI~~ddv~~~-H~Atvg~id~e~LFYL~ 443 (489) +|++|||+.|+|+.||+|+++|.++|++|+|+|.||+||||++|+++|+|.|||+||||+|+ ||||||+||+|+||||| T Consensus 284 ~~~~~g~~~d~a~~vF~G~~~i~~~A~~tda~~~~~~LlLs~~A~~d~~P~LeI~~ddV~~~gHgAtVg~iD~e~LFYLr 363 (397) T TIGR01981 284 NILHRGILDDRAHGVFRGIIRIPKGAKGTDAEQLNRNLLLSDKARADTKPQLEIDADDVKCSGHGATVGQIDDEQLFYLR 363 (397) T ss_pred EEEEEEEECCCCEEEEEEEEEECCCCCCCCEEEEEEEEEECCCCEEEECEEEEEECCCEEECCEECCCCCCCHHHHHHHH T ss_conf 28997799268429999878846557486336563157547876287121679975824321101100335744657676 Q ss_pred HCCCCHHHHHHHHHHHHHHHHHHHCCHHHHHH Q ss_conf 37889899999999998999988589689999 Q gi|254781059|r 444 QRGIPEEAAIALIVNGFVREVLQKLPMEFAVE 475 (489) Q Consensus 444 sRGi~~~~A~~liv~gF~~~v~~~ip~e~~~e 475 (489) |||||+++|++|||+||+.+++++||.|-..+ T Consensus 364 SRGi~~~~A~~L~i~gF~~e~i~~i~~~~l~~ 395 (397) T TIGR01981 364 SRGIDEEEARRLLIRGFLGEIIEEIPDESLKE 395 (397) T ss_pred HCCCCHHHHHHHHHHHHHHHHHHHCCCHHHHH T ss_conf 37889899999999988643332138815652 No 7 >pfam01458 UPF0051 Uncharacterized protein family (UPF0051). Probab=100.00 E-value=0 Score=468.75 Aligned_cols=229 Identities=33% Similarity=0.561 Sum_probs=217.1 Q ss_pred CCCEECCEEEEEEECCCEEEEEECCCCCCCCCCCCCCEEEEEEECCCCEEEEEEEEECCCCCCCCCCCEEEEEEEEEEEE Q ss_conf 88400221489972772499996156765444444330589986699589999963047553114487464014678630 Q gi|254781059|r 223 KGTGQFERTLIIADEGSYVSYLEGCTAPQRDENQLHAAVVELVALDNAEIKYSTVQNWYPGDSQGKGGIYNFVTKRGDCR 302 (489) Q Consensus 223 ~~~~~f~r~lIi~ee~S~v~~iE~~~s~~~~~~~~~~~v~Ei~vg~nA~v~~~~iQn~~~~~~~~~~~~~~~~t~r~~~~ 302 (489) .+..++||++|++|+||+++++|+|.++. ...++++.++|+++++||+|+|+.+|+|.. +.+++.+.+..+ T Consensus 2 ~~~~~~~r~~I~v~~~s~~~iie~~~~~~-~~~~~~n~v~ei~l~~nA~l~~~~iq~~~~-------~~~~~~~~~~~~- 72 (230) T pfam01458 2 SGGAQFPRNLIIVEEGAEVTIIEGCASPG-AGDGLHNGVTEIIVGEGARLKYTKIQNEGK-------GSIHFATTRVRL- 72 (230) T ss_pred CCCCEECCEEEEECCCCEEEEEEEECCCC-CCCEEEEEEEEEEECCCCEEEEEEEECCCC-------CEEEEEEEEEEE- T ss_conf 97538123899989999999999961799-876388899999988998999999972899-------729995689999- Q ss_pred CCCEEEEEEEEEECCCEEEEEEEEEEECCCCEEEEEEEEEECCCEEEEEEEEEEEECCCCCCEEEEEEEECCCCEEEEEE Q ss_conf 46303899887514634788538998278727999987762598689972189983676611010000105894689940 Q gi|254781059|r 303 GMRSKISWTQLETGSAITWKYPSCLLRGEQSCGEFYSIAISNGFQQIDSGTKMLHLAKNTTSRIVSKGISAGFSNNTYRG 382 (489) Q Consensus 303 ~~~a~~~~~~~~lG~~~~r~~~~~~L~G~~a~~~~~gi~l~~~~Q~~D~~~~v~H~~p~t~S~~~~K~i~~d~s~~~f~G 382 (489) +++|.++|+.+++|++.+|.++.+.|.|++|.++++|++++.++|++|+++.++|.+|+|+|+|++|+|+.|+|+++|+| T Consensus 73 ~~~s~~~~~~~~~G~~~~r~~~~~~l~G~~a~~~~~~~~l~~~~q~~D~~~~i~H~~~~t~S~~~~k~vl~d~s~~vf~G 152 (230) T pfam01458 73 EEDATLESTSVSLGGKLSRNDPDVDLNGEGAEAELNGVYLAKGDQHVDTHTKVRHNGPNTRSRILYKGVLKDRSRGVFRG 152 (230) T ss_pred CCCCEEEEEEEEECCCEEEEEEEEEEECCCCEEEEEEEEECCCCEEEEEEEEEEECCCCEEEEEEEEEEECCCCEEEEEE T ss_conf 99989999999978812999317999679989999999963697999502699998898099610513656872699962 Q ss_pred EEEECCCCCCCCCEEEEEEEEECCCCEEEEEEEEEEECCCCEEEEEEEECCCCHHHHHHHHHCCCCHHHHHHHHHHHH Q ss_conf 147338865874302210588459854862236898348756887532024799999999837889899999999998 Q gi|254781059|r 383 KVSVHRGAAHSRNFTQCDSLLIGNNCGAHTIPYIESKNSSCNLEHEATTSKISDEHLFYCLQRGIPEEAAIALIVNGF 460 (489) Q Consensus 383 ~i~v~~~A~~t~~~~~~~~LlLs~~a~~~s~P~LeI~~ddv~~~H~Atvg~id~e~LFYL~sRGi~~~~A~~liv~gF 460 (489) +|+|.++|++|+|+|+|++||||++|+++|+|.|||++|||+|+||||||+||+||||||||||||+++|++|||+|| T Consensus 153 ~i~V~~~A~~t~~~q~~~~llLs~~a~~~s~P~Lei~~ddv~~~HgAtvg~id~e~lfYL~sRGi~~~~A~~lli~gf 230 (230) T pfam01458 153 KIKVEKGAQKTDGHQECRNLLLSDKARVDTKPELEIYADDVKCSHGATVGQLDEEQLFYLMSRGLSEEEAKRLLIRGF 230 (230) T ss_pred EEEECCCCCEECCEEEEEEEEECCCCEEEECCEEEEECCCEEEEEEEEECCCCHHHHHHHHHCCCCHHHHHHHHHHCC T ss_conf 146727774300056787797058830885854999759849998487206798999999876989999999887509 No 8 >pfam01458 UPF0051 Uncharacterized protein family (UPF0051). Probab=98.70 E-value=4.7e-05 Score=56.77 Aligned_cols=177 Identities=11% Similarity=0.073 Sum_probs=120.5 Q ss_pred CEEEEECCCCCCCCCEEEEEEECCCCCEE-CCEEEEEEECCCEEEEEECCCCCCCCCCCCCCEEEEEEECCCCEEEEEEE Q ss_conf 43887347741342127999965888400-22148997277249999615676544444433058998669958999996 Q gi|254781059|r 199 GSFVYIPKGVRCPMELSTYFRINEKGTGQ-FERTLIIADEGSYVSYLEGCTAPQRDENQLHAAVVELVALDNAEIKYSTV 277 (489) Q Consensus 199 G~fi~Vp~gv~v~~PI~~~~~~~~~~~~~-f~r~lIi~ee~S~v~~iE~~~s~~~~~~~~~~~v~Ei~vg~nA~v~~~~i 277 (489) =..|.|.+|.++.. .-.|.....+.+. -+.+-|.+++||+++++.-..- .....+...+.+.++++|+++.+.+ T Consensus 9 r~~I~v~~~s~~~i--ie~~~~~~~~~~~~n~v~ei~l~~nA~l~~~~iq~~---~~~~~~~~~~~~~~~~~s~~~~~~~ 83 (230) T pfam01458 9 RNLIIVEEGAEVTI--IEGCASPGAGDGLHNGVTEIIVGEGARLKYTKIQNE---GKGSIHFATTRVRLEEDATLESTSV 83 (230) T ss_pred CEEEEECCCCEEEE--EEEECCCCCCCEEEEEEEEEEECCCCEEEEEEEECC---CCCEEEEEEEEEEECCCCEEEEEEE T ss_conf 38999899999999--999617998763888999999889989999999728---9972999568999999989999999 Q ss_pred EECCCCCCCCCCCEEEEEEEEEEEECCCEEEEEEEEEECCCEEE--EEEEEEEECCCCEEEEEEEEEECCC--EEEEEEE Q ss_conf 30475531144874640146786304630389988751463478--8538998278727999987762598--6899721 Q gi|254781059|r 278 QNWYPGDSQGKGGIYNFVTKRGDCRGMRSKISWTQLETGSAITW--KYPSCLLRGEQSCGEFYSIAISNGF--QQIDSGT 353 (489) Q Consensus 278 Qn~~~~~~~~~~~~~~~~t~r~~~~~~~a~~~~~~~~lG~~~~r--~~~~~~L~G~~a~~~~~gi~l~~~~--Q~~D~~~ 353 (489) .- +. . .....-+....|++|..+...+.++..-.+ ....+.+.|+++.++...-.+..++ ..+.=.. T Consensus 84 ~~-G~------~--~~r~~~~~~l~G~~a~~~~~~~~l~~~~q~~D~~~~i~H~~~~t~S~~~~k~vl~d~s~~vf~G~i 154 (230) T pfam01458 84 SL-GG------K--LSRNDPDVDLNGEGAEAELNGVYLAKGDQHVDTHTKVRHNGPNTRSRILYKGVLKDRSRGVFRGKI 154 (230) T ss_pred EE-CC------C--EEEEEEEEEEECCCCEEEEEEEEECCCCEEEEEEEEEEECCCCEEEEEEEEEEECCCCEEEEEEEE T ss_conf 97-88------1--299931799967998999999996369799950269999889809961051365687269996214 Q ss_pred EEEEECCCCCCEEEEEEEEC-CCCEEEEEEEEEECCC Q ss_conf 89983676611010000105-8946899401473388 Q gi|254781059|r 354 KMLHLAKNTTSRIVSKGISA-GFSNNTYRGKVSVHRG 389 (489) Q Consensus 354 ~v~H~~p~t~S~~~~K~i~~-d~s~~~f~G~i~v~~~ 389 (489) .|.+.+++|.+.|..++++. ++++..-.=.+.|..+ T Consensus 155 ~V~~~A~~t~~~q~~~~llLs~~a~~~s~P~Lei~~d 191 (230) T pfam01458 155 KVEKGAQKTDGHQECRNLLLSDKARVDTKPELEIYAD 191 (230) T ss_pred EECCCCCEECCEEEEEEEEECCCCEEEECCEEEEECC T ss_conf 6727774300056787797058830885854999759 No 9 >COG0719 SufB Cysteine desulfurase activator SufB [Posttranslational modification, protein turnover, chaperones] Probab=98.19 E-value=0.0011 Score=46.76 Aligned_cols=172 Identities=9% Similarity=0.018 Sum_probs=112.4 Q ss_pred EEEEECCCCCCCCCEEEEEEEC---CCCCEECCEEEEEEECCCEEEEEECCCCCCCCCCCCCCEEEEEEECCCCEEEEEE Q ss_conf 3887347741342127999965---8884002214899727724999961567654444443305899866995899999 Q gi|254781059|r 200 SFVYIPKGVRCPMELSTYFRIN---EKGTGQFERTLIIADEGSYVSYLEGCTAPQRDENQLHAAVVELVALDNAEIKYST 276 (489) Q Consensus 200 ~fi~Vp~gv~v~~PI~~~~~~~---~~~~~~f~r~lIi~ee~S~v~~iE~~~s~~~~~~~~~~~v~Ei~vg~nA~v~~~~ 276 (489) ..|-|.+|..+.. +-..+ ..+.....-+-|++++||+++|..-..- .....|....-...+++|+++.+. T Consensus 166 ~lIivee~a~v~~----ie~~~~p~~~~~~h~~vvei~v~~nA~v~~~~vqn~---~~~~~~~~~~~~~~~~~a~~~~~~ 238 (412) T COG0719 166 TLIIVEEGAEITV----IEGCSAPVGSGSLHNGVVEIFVGENASLTYTTVQNW---SPVYNHLTKRAAVEEEDATVRWTQ 238 (412) T ss_pred EEEEECCCCEEEE----EEEEECCCCCCCEEEEEEEEEECCCCEEEEEEEECC---CCCEEEEEEEEEEECCCCEEEEEE T ss_conf 8999889988999----950114678875045689999848979999997736---750576788999833784899999 Q ss_pred EEECCCCCCCCCCCEEEEEEEEEEEECCCEEEEEEEEEECCCE--EEEEEEEEEECCCCEEEEEEEEEECCCEEEEE--E Q ss_conf 6304755311448746401467863046303899887514634--78853899827872799998776259868997--2 Q gi|254781059|r 277 VQNWYPGDSQGKGGIYNFVTKRGDCRGMRSKISWTQLETGSAI--TWKYPSCLLRGEQSCGEFYSIAISNGFQQIDS--G 352 (489) Q Consensus 277 iQn~~~~~~~~~~~~~~~~t~r~~~~~~~a~~~~~~~~lG~~~--~r~~~~~~L~G~~a~~~~~gi~l~~~~Q~~D~--~ 352 (489) +.. +++....+.. ....|++|..+...+.++.+- .-....+.+.|+++.+++.+-.+..++-+.-. . T Consensus 239 ~~~-------G~~~t~~~~~--~~l~Ge~a~~e~~sv~~~~~~q~~D~~~~v~h~g~~t~s~i~~kgI~~d~a~~vfrG~ 309 (412) T COG0719 239 VTL-------GSKVTRKYPS--VRLKGEGADGELLSVAFAGGGQHMDVGTKMIHNGPNTRSNILSKGIVKDGARTVFRGL 309 (412) T ss_pred ECC-------CCEEEEEEEE--EEEECCCCEEEEEEEEECCCCCEECCCCEEEEECCCCEEEEEEEEEEECCCEEEEEEE T ss_conf 636-------8705999742--7997599579999999747980881142899956995668999889927868999808 Q ss_pred EEEEEECCCCCCEEEEEEEECCC-CEEEEEEEEEEC Q ss_conf 18998367661101000010589-468994014733 Q gi|254781059|r 353 TKMLHLAKNTTSRIVSKGISAGF-SNNTYRGKVSVH 387 (489) Q Consensus 353 ~~v~H~~p~t~S~~~~K~i~~d~-s~~~f~G~i~v~ 387 (489) ..+.+.|++|.+++..+.++.+. |++.-.=.+.|. T Consensus 310 i~i~~~A~~s~a~~~c~~LlLs~~a~~dt~P~LeI~ 345 (412) T COG0719 310 IDIRKGATGSDAYQECRTLLLSDDARADTVPILEIE 345 (412) T ss_pred EEEECCCCCCCCEEEECEEEECCCCCEECCEEEEEE T ss_conf 998368877601347157998699756516079997 No 10 >TIGR01981 sufD FeS assembly protein SufD; InterPro: IPR011542 Iron-sulphur (FeS) clusters are important cofactors for numerous proteins involved in electron transfer, in redox and non-redox catalysis, in gene regulation, and as sensors of oxygen and iron. These functions depend on the various FeS cluster prosthetic groups, the most common being [2Fe-2S] and [4Fe-4S] . FeS cluster assembly is a complex process involving the mobilisation of Fe and S atoms from storage sources, their assembly into [Fe-S] form, their transport to specific cellular locations, and their transfer to recipient apoproteins. So far, three FeS assembly machineries have been identified, which are capable of synthesising all types of [Fe-S] clusters: ISC (iron-sulphur cluster), SUF (sulphur assimilation), and NIF (nitrogen fixation) systems. The ISC system is conserved in eubacteria and eukaryotes (mitochondria), and has broad specificity, targeting general FeS proteins , . It is encoded by the isc operon (iscRSUA-hscBA-fdx-iscX). IscS is a cysteine desulphurase, which obtains S from cysteine (converting it to alanine) and serves as a S donor for FeS cluster assembly. IscU and IscA act as scaffolds to accept S and Fe atoms, assembling clusters and transfering them to recipient apoproteins. HscA is a molecular chaperone and HscB is a co-chaperone. Fdx is a [2Fe-2S]-type ferredoxin. IscR is a transcription factor that regulates expression of the isc operon. IscX (also known as YfhJ) appears to interact with IscS and may function as an Fe donor during cluster assembly . The SUF system is an alternative pathway to the ISC system that operates under iron starvation and oxidative stress. It is found in eubacteria, archaea and eukaryotes (plastids). The SUF system is encoded by the suf operon (sufABCDSE), and the six encoded proteins are arranged into two complexes (SufSE and SufBCD) and one protein (SufA). SufS is a pyridoxal-phosphate (PLP) protein displaying cysteine desulphurase activity. SufE acts as a scaffold protein that accepts S from SufS and donates it to SufA . SufC is an ATPase with an unorthodox ATP-binding cassette (ABC)-like component. No specific functions have been assigned to SufB and SufD. SufA is homologous to IscA , acting as a scaffold protein in which Fe and S atoms are assembled into [FeS] cluster forms, which can then easily be transferred to apoproteins targets. In the NIF system, NifS and NifU are required for the formation of metalloclusters of nitrogenase in Azotobacter vinelandii, and other organisms, as well as in the maturation of other FeS proteins. Nitrogenase catalyses the fixation of nitrogen. It contains a complex cluster, the FeMo cofactor, which contains molybdenum, Fe and S. NifS is a cysteine desulphurase. NifU binds one Fe atom at its N-terminal, assembling an FeS cluster that is transferred to nitrogenase apoproteins . Nif proteins involved in the formation of FeS clusters can also be found in organisms that do not fix nitrogen . This entry represents SufD proteins that form part of the SufBCD complex in the SUF system. No specific functions have been assigned to these proteins.; GO: 0005515 protein binding, 0016226 iron-sulfur cluster assembly. Probab=98.11 E-value=0.0015 Score=45.67 Aligned_cols=180 Identities=13% Similarity=0.077 Sum_probs=124.7 Q ss_pred HHHCCCEEEEECCCCCCCCCEEEEEEE--CC-CCC-E-EC-CEEEEEEECCCEEEEEECCCCCCCCCCCCCCE-EEEEEE Q ss_conf 642384388734774134212799996--58-884-0-02-21489972772499996156765444444330-589986 Q gi|254781059|r 194 AVFTDGSFVYIPKGVRCPMELSTYFRI--NE-KGT-G-QF-ERTLIIADEGSYVSYLEGCTAPQRDENQLHAA-VVELVA 266 (489) Q Consensus 194 A~~~~G~fi~Vp~gv~v~~PI~~~~~~--~~-~~~-~-~f-~r~lIi~ee~S~v~~iE~~~s~~~~~~~~~~~-v~Ei~v 266 (489) .++..=..|.|++|.++.. +.+. .+ .+. . .+ ..+-|.+++||+++++---. .....+|-. .+.+.+ T Consensus 149 ~~~~~~~~i~~~~~~~~~v----~e~~~~~~~~~~~~Y~~n~v~~i~~~~~A~v~~~~~~~---~~~~~~~~~W~~~~~~ 221 (397) T TIGR01981 149 AFVHPRLLIEVEKGSKITV----IERFEGISLNESKAYFTNSVVEITVGANASVSFVRVQD---FSKNSTHFSWNHRAKI 221 (397) T ss_pred EEEECCEEEEECCCCEEEE----EEEEECCCCCCCCEEEEECEEEEEECCCCEEEEEEEEC---CCCCEEEEEEEEEEEE T ss_conf 3653453899868980799----99832578886431465040899974487899999615---8981699865577765 Q ss_pred CCCCEEEEEEEEECCCCCCCCCCCEEEEEEEEEEEECCCEEEEEEEEEE--CCCEEEEEEEEEEECCCCEEEEE--EEEE Q ss_conf 6995899999630475531144874640146786304630389988751--46347885389982787279999--8776 Q gi|254781059|r 267 LDNAEIKYSTVQNWYPGDSQGKGGIYNFVTKRGDCRGMRSKISWTQLET--GSAITWKYPSCLLRGEQSCGEFY--SIAI 342 (489) Q Consensus 267 g~nA~v~~~~iQn~~~~~~~~~~~~~~~~t~r~~~~~~~a~~~~~~~~l--G~~~~r~~~~~~L~G~~a~~~~~--gi~l 342 (489) ++||+++++.++.- . .-...++.. ...|++|+.+...+.+ |....=....+...|+.+.++.. |+.. T Consensus 222 ~~~a~~~~~~~~~G---g---~~~~~~~~~---~l~G~~s~~~~~gl~~~~~~~~~D~~~~v~h~~~~~~S~~~~~g~~~ 292 (397) T TIGR01981 222 GKDARLRLAVVNLG---G---KLSRHNLDV---DLEGEGSKLEVKGLAIGAGRQHVDVRTLVEHNGPDTKSNILHRGILD 292 (397) T ss_pred ECCCEEEEEEEECC---C---CEEEEEEEE---EEECCCEEEEEEEEEEECCCCEEEEEEEEEECCCCCEEEEEEEEEEC T ss_conf 12865899999927---8---579999878---88246407999998751798157777568871666243289977992 Q ss_pred ECCCEEEEEEEEEEEECCCCCCEEEEEEEEC-CCCEEEEEEEEEECCC Q ss_conf 2598689972189983676611010000105-8946899401473388 Q gi|254781059|r 343 SNGFQQIDSGTKMLHLAKNTTSRIVSKGISA-GFSNNTYRGKVSVHRG 389 (489) Q Consensus 343 ~~~~Q~~D~~~~v~H~~p~t~S~~~~K~i~~-d~s~~~f~G~i~v~~~ 389 (489) -++.-.+.=...|...|.+|.+.+..|.++. ++|+..-.=..-|..+ T Consensus 293 d~a~~vF~G~~~i~~~A~~tda~~~~~~LlLs~~A~~d~~P~LeI~~d 340 (397) T TIGR01981 293 DRAHGVFRGIIRIPKGAKGTDAEQLNRNLLLSDKARADTKPQLEIDAD 340 (397) T ss_pred CCCEEEEEEEEEECCCCCCCCEEEEEEEEEECCCCEEEECEEEEEECC T ss_conf 684299998788465574863365631575478762871216799758 No 11 >PRK10948 cysteine desulfurase activator complex subunit SufD; Provisional Probab=97.87 E-value=0.0039 Score=42.55 Aligned_cols=178 Identities=11% Similarity=-0.008 Sum_probs=104.5 Q ss_pred EEEEEEECCCEE----EEEECCCCCCCCC-CCCCCEEEEEEECCCCEEEEEEEEECCCCCCCCCCCEEEEEEEEEEEECC Q ss_conf 148997277249----9996156765444-44433058998669958999996304755311448746401467863046 Q gi|254781059|r 230 RTLIIADEGSYV----SYLEGCTAPQRDE-NQLHAAVVELVALDNAEIKYSTVQNWYPGDSQGKGGIYNFVTKRGDCRGM 304 (489) Q Consensus 230 r~lIi~ee~S~v----~~iE~~~s~~~~~-~~~~~~v~Ei~vg~nA~v~~~~iQn~~~~~~~~~~~~~~~~t~r~~~~~~ 304 (489) -..|.+.+|..+ .++ .+++..... ......-.-|.+++||+++.+--.- +.+ ......|-++. +..++ T Consensus 138 G~~I~V~~n~~v~kPI~il-~i~s~~~~~~~~~~~~R~lI~l~enS~~tIIE~~~-~~~---~~~~f~N~vte--I~l~~ 210 (424) T PRK10948 138 VTHIRLPRGQRPAKPLYLL-HITQGVDGDELNTAHYRHHLDLAEGAEATVIEHFV-SLN---EARHFTGARLT--MNVGD 210 (424) T ss_pred CEEEEECCCCCCCCCEEEE-EEECCCCCCCCCCCCCEEEEEECCCCEEEEEEEEE-CCC---CCCCEEEEEEE--EEECC T ss_conf 7799987997057898999-98158863200034404899988998899999861-688---76706603899--99879 Q ss_pred CEEEEEEEEEECCCEEEE--EEEEEEECCCCEEEEEEEEEECCCEEEEEEEEEEEECCCCCCEEEEEEEECCCCEEEEEE Q ss_conf 303899887514634788--538998278727999987762598689972189983676611010000105894689940 Q gi|254781059|r 305 RSKISWTQLETGSAITWK--YPSCLLRGEQSCGEFYSIAISNGFQQIDSGTKMLHLAKNTTSRIVSKGISAGFSNNTYRG 382 (489) Q Consensus 305 ~a~~~~~~~~lG~~~~r~--~~~~~L~G~~a~~~~~gi~l~~~~Q~~D~~~~v~H~~p~t~S~~~~K~i~~d~s~~~f~G 382 (489) +|+++++.+..-+.-+.+ ...+ -.+++|..+...+.++ ....-+...+.+.|+++.+.+. |+..-++.....- T Consensus 211 nA~L~h~klq~~~~~s~h~~~~~v-~~~~dS~~~~~~~slG--g~l~R~~~~~~L~Ge~a~~~l~--Gl~l~~~~qh~D~ 285 (424) T PRK10948 211 NAHLNHIKLAFENPSSYHFAHNDL-LLGRDARAFSHSFLLG--AGVLRHNTSSQLNGENSTLSLN--SLAMPVKNEVCDT 285 (424) T ss_pred CCEEEEEEEECCCCCCEEEEEEEE-EECCCCEEEEEEEEEC--CEEEEEEEEEEEECCCCEEEEE--EEEECCCCEEEEE T ss_conf 967999997316742378852479-9717957999999967--8558988899994799689998--7796358657620 Q ss_pred EEEECCCCCCCCCEEEEEEEEECCCCEEEEEEEEEEEC Q ss_conf 14733886587430221058845985486223689834 Q gi|254781059|r 383 KVSVHRGAAHSRNFTQCDSLLIGNNCGAHTIPYIESKN 420 (489) Q Consensus 383 ~i~v~~~A~~t~~~~~~~~LlLs~~a~~~s~P~LeI~~ 420 (489) ...|...+..|.++|.-+++ |+++|++.-.=.+.|.. T Consensus 286 ~t~i~H~~p~c~S~ql~K~I-l~d~s~gVF~GkI~V~~ 322 (424) T PRK10948 286 RTWLEHNKGYCNSRQLHKTI-VSDKGRAVFNGLIKVAQ 322 (424) T ss_pred EEEEEECCCCCEEEEEEEHC-CCCCCCEEEEEEEEECC T ss_conf 27887279983110221000-66787189997899907 No 12 >CHL00085 ycf24 putative ABC transporter Probab=97.80 E-value=0.0017 Score=45.21 Aligned_cols=211 Identities=16% Similarity=0.103 Sum_probs=122.2 Q ss_pred CEEEEEEECCCEEEEE--ECCCCC-CCCCCCCCCEEEE--EEECCCCEEEEEEEEECCCCCCCCCCCEEEEEEEEEEEEC Q ss_conf 2148997277249999--615676-5444444330589--9866995899999630475531144874640146786304 Q gi|254781059|r 229 ERTLIIADEGSYVSYL--EGCTAP-QRDENQLHAAVVE--LVALDNAEIKYSTVQNWYPGDSQGKGGIYNFVTKRGDCRG 303 (489) Q Consensus 229 ~r~lIi~ee~S~v~~i--E~~~s~-~~~~~~~~~~v~E--i~vg~nA~v~~~~iQn~~~~~~~~~~~~~~~~t~r~~~~~ 303 (489) .=+-|++++||+++|. +.|... .....+..+-++. +.++++|++..+.+.. +++.+..+.+ ....| T Consensus 256 ~vvEv~v~~~A~v~yttiqNW~~g~~~~~~~v~n~vtkR~~~~~~~a~i~W~~~~~-------Gs~~t~~yps--~~L~G 326 (485) T CHL00085 256 AVVELIALENAEIKYSTVQNWYAGDQNGEGGIYNFVTKRGLCAGKNSKISWTQVET-------GSAITWKYPS--CILIG 326 (485) T ss_pred EEEEEEECCCCEEEEEEEECCCCCCCCCCCCEEEEEEEEEEECCCCCEEEEEEECC-------CCCEEEECCC--EEEEC T ss_conf 89999976796888988762555452466765888762123305797899888515-------7720330443--18973 Q ss_pred CCEEEEEEEEEECCCEEEEE--EEEEEECCCCEEEEEEEEEECCCEEEEEEE--EEEEECCCCCCEEEEEEEEC-CCCEE Q ss_conf 63038998875146347885--389982787279999877625986899721--89983676611010000105-89468 Q gi|254781059|r 304 MRSKISWTQLETGSAITWKY--PSCLLRGEQSCGEFYSIAISNGFQQIDSGT--KMLHLAKNTTSRIVSKGISA-GFSNN 378 (489) Q Consensus 304 ~~a~~~~~~~~lG~~~~r~~--~~~~L~G~~a~~~~~gi~l~~~~Q~~D~~~--~v~H~~p~t~S~~~~K~i~~-d~s~~ 378 (489) ++|.-++..+.+++.-...+ ..+.+.|+++.+++.+-.+..++-+.-.+. +|.+.|.+|++.+-.++++. ++|++ T Consensus 327 ~gs~~~~~si~~a~~~Q~~D~gtk~~H~ap~T~S~iisK~I~~~~s~~~frG~v~i~~~A~~s~~~~~c~~LlL~~~a~~ 406 (485) T CHL00085 327 NNSQGEFYSVALTNNYQQADTGTKMIHIGKNTKSRIISKGISAGKSKNTYRGLVKISKKAINSRNYSQCDSLLIGKNSQA 406 (485) T ss_pred CCCEEEEEEEEECCCCEEECCCEEEEEEECCCEEEEEEEHHHCCCCEEEECCCCCCCCCCCCCCCCEEECEEEECCCCCC T ss_conf 89789998888515779984560699980685446753011238976774550312788788842017652877688653 Q ss_pred EEEEEEEECCCCCCCCCEEEEEEEEECCCCEEEEEEEEEEECCCCEEEEEEEECCCCHHHHHHHHHCCCCHHHHHHHHHH Q ss_conf 99401473388658743022105884598548622368983487568875320247999999998378898999999999 Q gi|254781059|r 379 TYRGKVSVHRGAAHSRNFTQCDSLLIGNNCGAHTIPYIESKNSSCNLEHEATTSKISDEHLFYCLQRGIPEEAAIALIVN 458 (489) Q Consensus 379 ~f~G~i~v~~~A~~t~~~~~~~~LlLs~~a~~~s~P~LeI~~ddv~~~H~Atvg~id~e~LFYL~sRGi~~~~A~~liv~ 458 (489) --.=.+.|..+ .++. +-.|.+-- |.++..- - --.=-|++++=--|.-||.=++..+.|=+ T Consensus 407 dt~P~l~i~~~--dv~~---------~HeAtvgk-----I~eeqlf--Y-LmSRGl~e~eA~~lIV~GF~~~v~~~LP~- 466 (485) T CHL00085 407 NTFPYIQVQNP--SAKI---------EHEASTSK-----IGEEQIF--Y-FLQRGINLEEAISLMISGFCQDVFNKLPM- 466 (485) T ss_pred CCCCEEEEECC--CCEE---------EECEECCC-----CCHHHHH--H-HHHCCCCHHHHHHHHHHHHHHHHHHHCCH- T ss_conf 56744898348--8307---------85304356-----7999999--9-98669999999999999847988865898- Q ss_pred HHHHHHHHHC Q ss_conf 9899998858 Q gi|254781059|r 459 GFVREVLQKL 468 (489) Q Consensus 459 gF~~~v~~~i 468 (489) =|..|+-.-| T Consensus 467 E~a~E~~~li 476 (485) T CHL00085 467 EFALEADRLL 476 (485) T ss_pred HHHHHHHHHH T ss_conf 9999999998 No 13 >PRK11814 cysteine desulfurase activator complex subunit SufB; Provisional Probab=97.30 E-value=0.013 Score=38.63 Aligned_cols=148 Identities=15% Similarity=0.141 Sum_probs=86.9 Q ss_pred EEEEEEECCCEEEEE--ECCCCCC-CCCCCCCCEEEE--EEECCCCEEEEEEEEECCCCCCCCCCCEEEEEEEEEEEECC Q ss_conf 148997277249999--6156765-444444330589--98669958999996304755311448746401467863046 Q gi|254781059|r 230 RTLIIADEGSYVSYL--EGCTAPQ-RDENQLHAAVVE--LVALDNAEIKYSTVQNWYPGDSQGKGGIYNFVTKRGDCRGM 304 (489) Q Consensus 230 r~lIi~ee~S~v~~i--E~~~s~~-~~~~~~~~~v~E--i~vg~nA~v~~~~iQn~~~~~~~~~~~~~~~~t~r~~~~~~ 304 (489) =+-|++++||+++|. +.+.+.. ....+..|-++. +..+++|++..+.+. .++.-...+. .....|+ T Consensus 256 vvEi~v~~~A~v~y~tiQNW~~gd~~~~~~v~n~vtkR~~~~~~~a~i~W~~~~-------~Gs~it~~yp--s~~L~G~ 326 (484) T PRK11814 256 VVELVALDDAEIKYSTVQNWYPGDENGKGGIYNFVTKRGLCRGENSKISWTQVE-------TGSAITWKYP--SCILRGD 326 (484) T ss_pred EEEEEECCCCEEEEEEEECCCCCCCCCCCCEEEEEEEHHHHCCCCCEEEEEEEE-------CCCCEEECCC--CEEEECC T ss_conf 899997689778899986045545246674489888401221678558888863-------3870021256--1589748 Q ss_pred CEEEEEEEEEECCCEEEEE--EEEEEECCCCEEEEEEEEEECCCEEEEEEE--EEEEECCCCCCEEEEEEEECC-CCEEE Q ss_conf 3038998875146347885--389982787279999877625986899721--899836766110100001058-94689 Q gi|254781059|r 305 RSKISWTQLETGSAITWKY--PSCLLRGEQSCGEFYSIAISNGFQQIDSGT--KMLHLAKNTTSRIVSKGISAG-FSNNT 379 (489) Q Consensus 305 ~a~~~~~~~~lG~~~~r~~--~~~~L~G~~a~~~~~gi~l~~~~Q~~D~~~--~v~H~~p~t~S~~~~K~i~~d-~s~~~ 379 (489) +|.-++..+.++..-...+ ..+...|+++.+++.+-.+..+.-+.-.+. +|.+.|.+|++.+..++++.+ +|++- T Consensus 327 gs~~~~~sv~~a~~~Q~~D~gtk~~H~ap~t~S~ii~K~I~~d~s~~~frG~v~i~~~A~~s~~~~~c~~LlL~~~a~~d 406 (484) T PRK11814 327 NSVGEFYSVALTNGHQQADTGTKMIHIGKNTKSTIISKGISAGHSQNTYRGLVKIMPKATNARNFTQCDSLLIGDQCGAH 406 (484) T ss_pred CCEEEEEEEEECCCCEEEECCEEEEEECCCCEEEEEHHHHCCCCCEEEEEEEEEECCCCCCCHHHHHHHHEECCCCCCCC T ss_conf 97899988997269899813307999757760135400122068879980257755774467034303115017886620 Q ss_pred EEEEEEE Q ss_conf 9401473 Q gi|254781059|r 380 YRGKVSV 386 (489) Q Consensus 380 f~G~i~v 386 (489) -.=.+.| T Consensus 407 t~P~leI 413 (484) T PRK11814 407 TFPYIEV 413 (484) T ss_pred CCCEEEE T ss_conf 0653899 No 14 >TIGR01980 sufB FeS assembly protein SufB; InterPro: IPR010231 Iron-sulphur (FeS) clusters are important cofactors for numerous proteins involved in electron transfer, in redox and non-redox catalysis, in gene regulation, and as sensors of oxygen and iron. These functions depend on the various FeS cluster prosthetic groups, the most common being [2Fe-2S] and [4Fe-4S] . FeS cluster assembly is a complex process involving the mobilisation of Fe and S atoms from storage sources, their assembly into [Fe-S] form, their transport to specific cellular locations, and their transfer to recipient apoproteins. So far, three FeS assembly machineries have been identified, which are capable of synthesising all types of [Fe-S] clusters: ISC (iron-sulphur cluster), SUF (sulphur assimilation), and NIF (nitrogen fixation) systems. The ISC system is conserved in eubacteria and eukaryotes (mitochondria), and has broad specificity, targeting general FeS proteins , . It is encoded by the isc operon (iscRSUA-hscBA-fdx-iscX). IscS is a cysteine desulphurase, which obtains S from cysteine (converting it to alanine) and serves as a S donor for FeS cluster assembly. IscU and IscA act as scaffolds to accept S and Fe atoms, assembling clusters and transfering them to recipient apoproteins. HscA is a molecular chaperone and HscB is a co-chaperone. Fdx is a [2Fe-2S]-type ferredoxin. IscR is a transcription factor that regulates expression of the isc operon. IscX (also known as YfhJ) appears to interact with IscS and may function as an Fe donor during cluster assembly . The SUF system is an alternative pathway to the ISC system that operates under iron starvation and oxidative stress. It is found in eubacteria, archaea and eukaryotes (plastids). The SUF system is encoded by the suf operon (sufABCDSE), and the six encoded proteins are arranged into two complexes (SufSE and SufBCD) and one protein (SufA). SufS is a pyridoxal-phosphate (PLP) protein displaying cysteine desulphurase activity. SufE acts as a scaffold protein that accepts S from SufS and donates it to SufA . SufC is an ATPase with an unorthodox ATP-binding cassette (ABC)-like component. No specific functions have been assigned to SufB and SufD. SufA is homologous to IscA , acting as a scaffold protein in which Fe and S atoms are assembled into [FeS] cluster forms, which can then easily be transferred to apoproteins targets. In the NIF system, NifS and NifU are required for the formation of metalloclusters of nitrogenase in Azotobacter vinelandii, and other organisms, as well as in the maturation of other FeS proteins. Nitrogenase catalyses the fixation of nitrogen. It contains a complex cluster, the FeMo cofactor, which contains molybdenum, Fe and S. NifS is a cysteine desulphurase. NifU binds one Fe atom at its N-terminal, assembling an FeS cluster that is transferred to nitrogenase apoproteins . Nif proteins involved in the formation of FeS clusters can also be found in organisms that do not fix nitrogen . This entry represents SufB, which is part of the SUF system and forms a complex with SufBCD.; GO: 0005515 protein binding, 0016226 iron-sulfur cluster assembly. Probab=95.70 E-value=0.2 Score=29.78 Aligned_cols=289 Identities=15% Similarity=0.202 Sum_probs=167.9 Q ss_pred HHHHHHHHHHHCCCCCCCCCCCCC--CC--CCC-EEEECCCCCCCC----CCC----CCCCCHHHHHHHHHCCCCHHHHH Q ss_conf 999999999968877866587665--68--200-031005653334----332----10158788887765176511111 Q gi|254781059|r 57 WRLLAYRRWLTMKDPSWARIRYPA--ID--FDD-LYYYAAPKSIVG----PQS----LDDVDPELLRTYEKLGIPIREQE 123 (489) Q Consensus 57 ~R~~Al~~f~~l~~P~W~~~~l~~--i~--~~~-i~~~~~p~~~~~----~~s----ld~v~~ei~~tf~kLgip~~e~~ 123 (489) .=.+=-+.|++||+|.-.+..|.+ .. |+. +.|.+-.+.... ..+ +.+-|.=+++.|-+.-+|..+.+ T Consensus 98 vP~eIk~TFEkLGIPeaE~k~LaGvav~AQYdSEviY~~~Ke~L~~~GVIF~d~d~Al~~YPdLVKeYF~~v~VPp~DnK 177 (469) T TIGR01980 98 VPDEIKDTFEKLGIPEAERKALAGVAVGAQYDSEVIYHNIKEDLEEKGVIFCDMDTALKEYPDLVKEYFGKVCVPPSDNK 177 (469) T ss_pred CCHHHHHHHHHCCCCHHHHHHHCCCEEEEEEEEEEEECCCHHHHHHCCEEECCHHHHHHHCHHHHHHHCCCCCCCCCCCH T ss_conf 67899987644486179998742535778862044622204675629878638599865172567866270025876341 Q ss_pred HHCCCCCCCCEEEEEE-CCEEEECCCHHHHHHCCEEE-EEHHHHHHHHHHHHHHHCCCEECCCCCHHHHHHH-HHHCCCE Q ss_conf 1023444431000121-35022020056787479088-5589987777898862022112456531466887-6423843 Q gi|254781059|r 124 ILAGVQTSKVAVDAVF-DSVSVVTTFKEELKKSGVIF-MPISEAICEHPDLVKKYLSTVVPVGDNFYAALNS-AVFTDGS 200 (489) Q Consensus 124 ~l~g~~~~~v~v~~~~-ds~~v~~~l~~~L~~~GVi~-~~l~eA~~~~p~lv~~~~~~~~~~~~~~f~aLn~-A~~~~G~ 200 (489) |-|- +. |+. ++.-||- | +||-+ .||. -|| ++..++..-|- || -....|+ T Consensus 178 FAAL----n~---AVWSGGsFvYV--P-----kGV~v~~PLq-----------tYF-RIN~~~~GQFE--rTLiIadeGa 229 (469) T TIGR01980 178 FAAL----NG---AVWSGGSFVYV--P-----KGVRVDMPLQ-----------TYF-RINSENTGQFE--RTLIIADEGA 229 (469) T ss_pred HHHH----HH---HEEECCEEEEE--C-----CCCEECCCCC-----------CCE-ECCCCCCCCCE--EEEEEEEECC T ss_conf 4664----02---11027857984--7-----8986666300-----------000-02847788632--5889995177 Q ss_pred EEEECCCCCCCCCEEEEEEECCCCCEECCEEEEEEECCCEEEE--EECCCCCCCCCCCCCCEEEE-EEEC-CCCEEEEEE Q ss_conf 8873477413421279999658884002214899727724999--96156765444444330589-9866-995899999 Q gi|254781059|r 201 FVYIPKGVRCPMELSTYFRINEKGTGQFERTLIIADEGSYVSY--LEGCTAPQRDENQLHAAVVE-LVAL-DNAEIKYST 276 (489) Q Consensus 201 fi~Vp~gv~v~~PI~~~~~~~~~~~~~f~r~lIi~ee~S~v~~--iE~~~s~~~~~~~~~~~v~E-i~vg-~nA~v~~~~ 276 (489) .||==+| |..|+.. .++=.+ .-+-|+|.+||++.| |+.|+....+.-+..|=||. -+|+ +|+++.-+. T Consensus 230 ~VhYiEG--CsAP~Y~---~~sLHA---a~VE~~v~~~A~~rYtTvQNWs~N~~~KGGvyNlvTKRA~veG~ng~meWv~ 301 (469) T TIGR01980 230 SVHYIEG--CSAPVYS---TNSLHA---AVVELIVKEDARVRYTTVQNWSKNDKNKGGVYNLVTKRALVEGENGTMEWVD 301 (469) T ss_pred EEEEEEC--CCCCCCC---CCCCEE---EEEEEEECCCCEEEEEEEEECCCCCCCCCCEEECCCCEEEEECCCCEEEEEE T ss_conf 4888702--3266568---887133---5899998378758984687303788998544866872378962687799960 Q ss_pred EEECCCCCCCCCCCEEEEEEEEEEEECCCEEEEEEEEEECCCEEEEEEE--EEEECCCCEEEEEEEEEECCCEEEEEEE- Q ss_conf 6304755311448746401467863046303899887514634788538--9982787279999877625986899721- Q gi|254781059|r 277 VQNWYPGDSQGKGGIYNFVTKRGDCRGMRSKISWTQLETGSAITWKYPS--CLLRGEQSCGEFYSIAISNGFQQIDSGT- 353 (489) Q Consensus 277 iQn~~~~~~~~~~~~~~~~t~r~~~~~~~a~~~~~~~~lG~~~~r~~~~--~~L~G~~a~~~~~gi~l~~~~Q~~D~~~- 353 (489) |+ .+|+-++-|-. .+..|++|+=+.++++|-+.-...++- ++..+++.++.+.+=+++.++-....+- T Consensus 302 ------Gs-~GSK~Tm~YPs--~iL~G~~A~~e~~sIa~A~~gQ~~DTGaK~~H~AP~TSStI~sKSIS~ggG~~~YRGL 372 (469) T TIGR01980 302 ------GS-IGSKITMKYPS--SILKGEGAKTEFLSIALAGKGQHLDTGAKMIHLAPNTSSTIISKSISKGGGKSSYRGL 372 (469) T ss_pred ------CC-HHHHHHHHCCE--EEECCCCCCCCCEEEEEECCCCCEECCCEEEEECCCCEEEEEEEEEECCCCEEEEEEE T ss_conf ------24-00024412552--4782886211422778724763000075368708996248898788758874430257 Q ss_pred -EEEEECCCCCCEEEEEEEECCC-CEE-------EEEEEEEECCCC Q ss_conf -8998367661101000010589-468-------994014733886 Q gi|254781059|r 354 -KMLHLAKNTTSRIVSKGISAGF-SNN-------TYRGKVSVHRGA 390 (489) Q Consensus 354 -~v~H~~p~t~S~~~~K~i~~d~-s~~-------~f~G~i~v~~~A 390 (489) +|.-.|.+++|++..=+++.|+ |.+ |++-.+.|...| T Consensus 373 Vk~~~~A~~~k~~v~CD~Ll~Dd~S~SdT~P~~ei~n~~v~~~HEA 418 (469) T TIGR01980 373 VKIGKGAKGAKSHVQCDSLLLDDESASDTIPYIEIFNDTVSVEHEA 418 (469) T ss_pred EEECCCCCCCCCEEEEEEEEECCCCCCCCCCEEEEECCCCCEEEEE T ss_conf 8865765577111256655657842213243055526833247855 No 15 >pfam00073 Rhv picornavirus capsid protein. CAUTION: This alignment is very weak. It can not be generated by clustalw. If a representative set is used for a seed, many so-called members are not recognized. The family should probably be split up into sub-families. Capsid proteins of picornaviruses. Picornaviruses are non-enveloped plus-strand ssRNA animal viruses with icosahedral capsids. They include rhinovirus (common cold) and poliovirus. Common structure is an 8-stranded beta sandwich. Variations (one or two extra strands) occur. Probab=75.60 E-value=5.8 Score=19.01 Aligned_cols=53 Identities=13% Similarity=0.110 Sum_probs=37.9 Q ss_pred HHHHHCCC--EEEEECCCCCCCCCEEEEEEECCCCCEECCEEEEEEECCCEEEEEECCCCCCC Q ss_conf 87642384--38873477413421279999658884002214899727724999961567654 Q gi|254781059|r 192 NSAVFTDG--SFVYIPKGVRCPMELSTYFRINEKGTGQFERTLIIADEGSYVSYLEGCTAPQR 252 (489) Q Consensus 192 n~A~~~~G--~fi~Vp~gv~v~~PI~~~~~~~~~~~~~f~r~lIi~ee~S~v~~iE~~~s~~~ 252 (489) |...|..| .+.|||+|...+... .....+||.++-++.++.+++.-=|.++.. T Consensus 95 ~~s~~~~g~llv~~vP~g~~~p~~~--------~qa~~~ph~~~~~~~n~~~~~~vPyis~~~ 149 (173) T pfam00073 95 TGSQFHQGKLLVAYVPPGAPKPTSR--------WQATLNPHQFWNLGTNSSARLSVPYVSIAP 149 (173) T ss_pred ECCCCCCCEEEEEEECCCCCCCCCH--------HHHHCCCEEEEECCCCCEEEEEECCCCCCC T ss_conf 5880246559999906888899898--------996108668894798985899976161657 No 16 >PRK12440 acetate kinase; Reviewed Probab=64.84 E-value=8.5 Score=17.73 Aligned_cols=31 Identities=13% Similarity=0.253 Sum_probs=26.9 Q ss_pred EEEECCCCHHHHHHHHHCCCCHHHHHHHHHH Q ss_conf 5320247999999998378898999999999 Q gi|254781059|r 428 EATTSKISDEHLFYCLQRGIPEEAAIALIVN 458 (489) Q Consensus 428 ~Atvg~id~e~LFYL~sRGi~~~~A~~liv~ 458 (489) |.-.|.||+--++|||.+|++.++...+|-+ T Consensus 238 gTRsGdiDPgvv~~l~~~g~s~~e~~~~Lnk 268 (397) T PRK12440 238 GTRCGDLDPGIIEYLLKKGWSQEQVFNSLNK 268 (397) T ss_pred CCCCCCCCHHHHHHHHHCCCCHHHHHHHHHH T ss_conf 7889886847999999849999999999875 No 17 >cd01089 PA2G4-like Related to aminopepdidase M, this family contains proliferation-associated protein 2G4. Family members have been implicated in cell cycle control. Probab=64.06 E-value=9.8 Score=17.28 Aligned_cols=82 Identities=20% Similarity=0.199 Sum_probs=35.0 Q ss_pred CCEEEEEHHHHHHHHHHHHHHHCCCEEC-CCCCHHH-HHHHHHHCCCEEEEECCCCCCCCCEEEEEEECCCCCEECCEEE Q ss_conf 7908855899877778988620221124-5653146-6887642384388734774134212799996588840022148 Q gi|254781059|r 155 SGVIFMPISEAICEHPDLVKKYLSTVVP-VGDNFYA-ALNSAVFTDGSFVYIPKGVRCPMELSTYFRINEKGTGQFERTL 232 (489) Q Consensus 155 ~GVi~~~l~eA~~~~p~lv~~~~~~~~~-~~~~~f~-aLn~A~~~~G~fi~Vp~gv~v~~PI~~~~~~~~~~~~~f~r~l 232 (489) -|+-+.++..|++++ +++|=...+. ..-|-.- -+|..-......--+..+.. .|+..++--...-.+||+||+ T Consensus 142 pG~~~~dI~~ai~~~---~~~~g~~~v~~~~gH~~~~~~~~~~~~~~~~e~~~~~~~--~~~~~~~~~dg~~~A~fEhTv 216 (228) T cd01089 142 PGNQNSDITEAIQKV---IVDYGCTPVEGVLSHQLKRVVSSGEGKAKLVECVKHGLL--FPYPVLYEKEGEVVAQFKLTV 216 (228) T ss_pred CCCCHHHHHHHHHHH---HHHCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEE--EEEEEEECCCCCEEEEEEEEE T ss_conf 998968999999999---998499677452056999817779998885413546718--888789857998887833799 Q ss_pred EEEECCCEE Q ss_conf 997277249 Q gi|254781059|r 233 IIADEGSYV 241 (489) Q Consensus 233 Ii~ee~S~v 241 (489) +|.+.|.++ T Consensus 217 lVT~~G~eI 225 (228) T cd01089 217 LLTPNGVTV 225 (228) T ss_pred EECCCCCEE T ss_conf 997998877 No 18 >PRK09490 metH B12-dependent methionine synthase; Provisional Probab=58.04 E-value=12 Score=16.56 Aligned_cols=48 Identities=13% Similarity=0.010 Sum_probs=30.7 Q ss_pred EECCCCEEEEEEEECCCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHC Q ss_conf 834875688753202479999999983788989999999999899998858 Q gi|254781059|r 418 SKNSSCNLEHEATTSKISDEHLFYCLQRGIPEEAAIALIVNGFVREVLQKL 468 (489) Q Consensus 418 I~~ddv~~~H~Atvg~id~e~LFYL~sRGi~~~~A~~liv~gF~~~v~~~i 468 (489) ....|+-+.+..|+|.=-++-.=-+..+| ++=..++++|.....-+.+ T Consensus 1058 sg~~DyiG~FaVTaG~g~~e~a~~f~~~~---DdY~~im~kaLadrLAEA~ 1105 (1229) T PRK09490 1058 SGKADYIGGFAVTAGLGEDELADRFEAAH---DDYNAIMVKALADRLAEAF 1105 (1229) T ss_pred CCCCCEEEEEEEECCCCHHHHHHHHHHCC---CHHHHHHHHHHHHHHHHHH T ss_conf 89886577788716805899999999728---8099999999999999999 No 19 >PRK07668 hypothetical protein; Validated Probab=57.96 E-value=12 Score=16.54 Aligned_cols=33 Identities=21% Similarity=0.233 Sum_probs=19.2 Q ss_pred CCCCCCEECCCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHH Q ss_conf 413531020488889999999986329988999999999999 Q gi|254781059|r 23 FETNIQVDKAPKGLNEDIIRFISAKKNEPDWMLEWRLLAYRR 64 (489) Q Consensus 23 ~~~~~~~~~~~~gl~~~~i~~iS~~k~EP~Wl~e~R~~Al~~ 64 (489) |-.+...|-.-||.+|+-|++| +.|.+....+. T Consensus 9 FL~~~r~yL~~~Gkke~dI~~~---------~eele~HL~e~ 41 (248) T PRK07668 9 FLDDTRVYLIAKGIKEEDIESF---------LEDAELHLIEG 41 (248) T ss_pred HHHHHHHHHHHCCCCHHHHHHH---------HHHHHHHHHHH T ss_conf 9999999999918978899999---------99999999999 No 20 >pfam12247 MKT1_N Temperature dependent protein affecting M2 dsRNA replication. This domain family is found in eukaryotes, and is typically between 231 and 255 amino acids in length. There is a single completely conserved residue P that may be functionally important. MKT1 is required for maintenance of K2 toxin above 30 degrees C in strains with the L-A-HN variant of the L-A double-stranded RNA virus of Saccharomyces cerevisiae. MKT1 is a 93 kDa protein with serine-rich regions and the retroviral protease signature, DTG. This family is the N terminal region of MKT1. Probab=56.02 E-value=6.1 Score=18.84 Aligned_cols=47 Identities=19% Similarity=0.378 Sum_probs=30.3 Q ss_pred ECCCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCHHH--HHHHHHHH Q ss_conf 024799999999837889899999999998999988589689--99999999 Q gi|254781059|r 431 TSKISDEHLFYCLQRGIPEEAAIALIVNGFVREVLQKLPMEF--AVEAQKLI 480 (489) Q Consensus 431 vg~id~e~LFYL~sRGi~~~~A~~liv~gF~~~v~~~ip~e~--~~e~~~li 480 (489) +|+-=+++|||+||+||=-.+-..-|+.|.+ ++.-|++- ..|-++++ T Consensus 8 iG~rLP~ely~y~s~GLl~~~ll~~lt~G~~---~~~~PL~~~~S~eYr~lv 56 (90) T pfam12247 8 IGQRLPPELYYYLSAGLLSPRLLTALTQGEL---IDEPPLDGGDSDEYRKLV 56 (90) T ss_pred HCCCCCHHHHHHHHCCCCCHHHHHHHHEEEE---ECCCCCCCCCCHHHHHHH T ss_conf 5267979999999856667767737510147---323897887779999999 No 21 >TIGR00501 met_pdase_II methionine aminopeptidase, type II; InterPro: IPR002468 Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site . The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This group of metallopeptidases belong to MEROPS peptidase family M24 (clan MG), subfamily M24A. Methionine aminopeptidase (3.4.11.18 from EC) (MAP) is responsible for the removal of the amino-terminal (initiator) methionine from nascent eukaryotic cytosolic and cytoplasmic prokaryotic proteins if the penultimate amino acid is small and uncharged. All MAP studied to date are monomeric proteins that require cobalt ions for activity. Two subfamilies of MAP enzymes are known to exist , . While being evolutionary related, they only share a limited amount of sequence similarity mostly clustered around the residues shown, in the Escherichia coli MAP , to be involved in cobalt-binding. The first family consists of enzymes from prokaryotes as well as eukaryotic MAP-1, while the second group (IPR002467 from INTERPRO) is made up of archaeal MAP and eukaryotic MAP-2 and includes proteins which do not seem to be MAP, but that are clearly evolutionary related such as mouse proliferation-associated protein 1 and fission yeast curved DNA-binding protein. ; GO: 0004239 methionyl aminopeptidase activity, 0006508 proteolysis. Probab=52.29 E-value=8.4 Score=17.80 Aligned_cols=57 Identities=21% Similarity=0.374 Sum_probs=28.9 Q ss_pred HHHHHCCCEECCCCCH--HHHHHHHHHCCCEEEEECCCCCCCCCEEEEEEECCCCCEECCEEEEEEECCCEEE Q ss_conf 8862022112456531--4668876423843887347741342127999965888400221489972772499 Q gi|254781059|r 172 LVKKYLSTVVPVGDNF--YAALNSAVFTDGSFVYIPKGVRCPMELSTYFRINEKGTGQFERTLIIADEGSYVS 242 (489) Q Consensus 172 lv~~~~~~~~~~~~~~--f~aLn~A~~~~G~fi~Vp~gv~v~~PI~~~~~~~~~~~~~f~r~lIi~ee~S~v~ 242 (489) .-++||.+ ..+.| ..+||. +...|++ =+.|+ +--.++.-.+||+||+|+-+.|.+++ T Consensus 267 FA~RwL~~---~~~~Kr~~~~l~~-L~r~~~~--------~~YPv--L~e~~G~~vaQ~EHTilv~~~g~~v~ 325 (327) T TIGR00501 267 FAKRWLEK---LEDEKRLRLALNK-LIREGVI--------YDYPV--LKEKSGGLVAQWEHTILVEEDGVEVT 325 (327) T ss_pred HHHHHHHH---CCCHHHHHHHHHH-HHHHCCC--------CCCCC--CCCCCCCEEEEEEEEEEEECCCEEEE T ss_conf 47889752---3330268999999-8873333--------47887--52478874523245889964834685 No 22 >PRK12318 methionine aminopeptidase; Provisional Probab=51.61 E-value=15 Score=15.83 Aligned_cols=87 Identities=16% Similarity=0.272 Sum_probs=41.8 Q ss_pred CCEEEEEHHHHHHHHHH-----HHHHHCCCE--ECCCCCHHHH----HHHHHHCCCEEEEECCCCCCCCCEEE------E Q ss_conf 79088558998777789-----886202211--2456531466----88764238438873477413421279------9 Q gi|254781059|r 155 SGVIFMPISEAICEHPD-----LVKKYLSTV--VPVGDNFYAA----LNSAVFTDGSFVYIPKGVRCPMELST------Y 217 (489) Q Consensus 155 ~GVi~~~l~eA~~~~p~-----lv~~~~~~~--~~~~~~~f~a----Ln~A~~~~G~fi~Vp~gv~v~~PI~~------~ 217 (489) -|+-+.++..|++++-+ .++.+.|-- ....+.++.. -+...+..|.++-|-+++-+...=.+ + T Consensus 180 pG~~~~dI~~ai~~~~~~~G~~~~~~~~GHGIG~~vHE~P~I~~~~~~~~~~L~pGMV~TIEPgI~~g~~~~~~~~~d~W 259 (291) T PRK12318 180 PNLPLYEIGEVIENCADTYGFSVVDQFVGHGVGIKFHENPYVPHHRNRSKIPLAPGMIFTIEPMINVGKKEGVIDPINHW 259 (291) T ss_pred CCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCCEEEECCCCCCCCCCCEECCCCCC T ss_conf 99647799999999999849911768002872731034886564569998783799989879921247887477378985 Q ss_pred EEEC--CCCCEECCEEEEEEECCCEE Q ss_conf 9965--88840022148997277249 Q gi|254781059|r 218 FRIN--EKGTGQFERTLIIADEGSYV 241 (489) Q Consensus 218 ~~~~--~~~~~~f~r~lIi~ee~S~v 241 (489) -..+ ..-+++|+|+++|.+.|.++ T Consensus 260 t~~t~d~~~~a~~EhtvlIT~~G~ei 285 (291) T PRK12318 260 EARTCDNQPSAQWEHTILITETGYEI 285 (291) T ss_pred EEEECCCCEEEEECEEEEEECCCCEE T ss_conf 79914998475615589991992678 No 23 >COG2137 OraA Uncharacterized protein conserved in bacteria [General function prediction only] Probab=48.24 E-value=17 Score=15.46 Aligned_cols=42 Identities=19% Similarity=0.250 Sum_probs=29.3 Q ss_pred ECCCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCC Q ss_conf 204888899999999863299889999999999999968877 Q gi|254781059|r 30 DKAPKGLNEDIIRFISAKKNEPDWMLEWRLLAYRRWLTMKDP 71 (489) Q Consensus 30 ~~~~~gl~~~~i~~iS~~k~EP~Wl~e~R~~Al~~f~~l~~P 71 (489) .-..+|++.++|.++-....+.+|...-+..+.++|.+...| T Consensus 93 eL~qkGi~~~~Ie~aL~~~~~~~~~~~a~~~~~kk~~~~~~~ 134 (174) T COG2137 93 ELKQKGIDDEIIEEALELIDEEDEQERARKVLRKKFKRENKP 134 (174) T ss_pred HHHHCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCC T ss_conf 999849989999999851265889999999999983766667 No 24 >KOG2738 consensus Probab=47.27 E-value=13 Score=16.50 Aligned_cols=28 Identities=25% Similarity=0.458 Sum_probs=13.5 Q ss_pred HHHCCCEECC---CCCHHHHHHHHHHCCCEE Q ss_conf 6202211245---653146688764238438 Q gi|254781059|r 174 KKYLSTVVPV---GDNFYAALNSAVFTDGSF 201 (489) Q Consensus 174 ~~~~~~~~~~---~~~~f~aLn~A~~~~G~f 201 (489) -+|-++++.+ +|.-=-+.|.|....|.| T Consensus 136 Ld~Aa~~v~PgvTTdEiD~~VH~a~Ierg~Y 166 (369) T KOG2738 136 LDYAATLVRPGVTTDEIDRAVHNAIIERGAY 166 (369) T ss_pred HHHHHHHCCCCCCHHHHHHHHHHHHHHCCCC T ss_conf 9987502579763889999999999864885 No 25 >PTZ00286 6-phospho-1-fructokinase; Provisional Probab=45.80 E-value=19 Score=15.23 Aligned_cols=58 Identities=24% Similarity=0.314 Sum_probs=31.3 Q ss_pred CCCHHHHHHHHHHCC-CEEEEECCCCCCCCCEEEEEEECCCCCEECCEEEEEEECCCEEEEE Q ss_conf 653146688764238-4388734774134212799996588840022148997277249999 Q gi|254781059|r 184 GDNFYAALNSAVFTD-GSFVYIPKGVRCPMELSTYFRINEKGTGQFERTLIIADEGSYVSYL 244 (489) Q Consensus 184 ~~~~f~aLn~A~~~~-G~fi~Vp~gv~v~~PI~~~~~~~~~~~~~f~r~lIi~ee~S~v~~i 244 (489) .+.-|.|+++++.++ --|+-||+ ++..+.-++..-......-.|-+|++-|||--.+. T Consensus 272 R~SGfIA~~A~LAs~~vd~cLIPE---~pf~~~Gll~~le~Rl~~~ghaVIVVAEGAGQd~~ 330 (486) T PTZ00286 272 RESGFIAAQTAVASAQANICLIPE---NPLPKETVMRLIERRFQQSRNCVIIVAEGFGQDWE 330 (486) T ss_pred CCCCHHHHHHHHCCCCCCEEEECC---CCCCCCHHHHHHHHHHHHCCCEEEEEECCCCCCCC T ss_conf 551289988775138777899248---77780059999999998479879999346774333 No 26 >PTZ00134 40S ribosomal protein S18; Provisional Probab=44.67 E-value=19 Score=15.08 Aligned_cols=26 Identities=31% Similarity=0.456 Sum_probs=18.5 Q ss_pred CCCHHHHHHHHHHCCC------CHHHHHHHHH Q ss_conf 8899999999863299------8899999999 Q gi|254781059|r 35 GLNEDIIRFISAKKNE------PDWMLEWRLL 60 (489) Q Consensus 35 gl~~~~i~~iS~~k~E------P~Wl~e~R~~ 60 (489) -|+++-|+.|....++ |.||+..|.. T Consensus 60 ~Lteeei~~i~~~I~~~~~~~iP~w~~Nr~kD 91 (154) T PTZ00134 60 ELTQEEINKIVAIISSPAQFKIPDWFLNRQKD 91 (154) T ss_pred CCCHHHHHHHHHHHCCCCCCCCCHHHHHHHHH T ss_conf 49999999999997272101564887655777 No 27 >PRK06646 DNA polymerase III subunit chi; Provisional Probab=43.92 E-value=20 Score=15.00 Aligned_cols=51 Identities=20% Similarity=0.324 Sum_probs=34.0 Q ss_pred EHHHHHHHHHHHHHHHCCC-----EECCCCCHHHHHHHHHHC--CCEEEEECCCCCCC-----CCEEE Q ss_conf 5899877778988620221-----124565314668876423--84388734774134-----21279 Q gi|254781059|r 161 PISEAICEHPDLVKKYLST-----VVPVGDNFYAALNSAVFT--DGSFVYIPKGVRCP-----MELST 216 (489) Q Consensus 161 ~l~eA~~~~p~lv~~~~~~-----~~~~~~~~f~aLn~A~~~--~G~fi~Vp~gv~v~-----~PI~~ 216 (489) ++++|+ |.|++|-+.+ +...+.....+||..+|+ +++| +|-|...+ .||-+ T Consensus 13 ~Le~aL---p~LLeKsl~~Gwr~vV~~~~~er~~~Ld~~LWTy~~~SF--LPHG~~~~~~~~~QPV~L 75 (154) T PRK06646 13 LLLKSI---LLLIEKCYYSDLKSVILTTDADQQEMLNKNLWTYSRKQF--IPHGSKLDPQPEKQPIYI 75 (154) T ss_pred CHHHHH---HHHHHHHHHCCCEEEEEECCHHHHHHHHHHHCCCCCCCC--CCCCCCCCCCCCCCCEEE T ss_conf 099999---999999997598399994899999999864222688876--677666799846598897 No 28 >PRK05667 dnaG DNA primase; Validated Probab=43.41 E-value=20 Score=14.95 Aligned_cols=57 Identities=28% Similarity=0.566 Sum_probs=28.7 Q ss_pred HCCE--EEEEHHHHHH-HHHHHHHHHCCCEE-CCCCCH---HHHHH---HH--HHCCCE---EEEECCCCCC Q ss_conf 4790--8855899877-77898862022112-456531---46688---76--423843---8873477413 Q gi|254781059|r 154 KSGV--IFMPISEAIC-EHPDLVKKYLSTVV-PVGDNF---YAALN---SA--VFTDGS---FVYIPKGVRC 210 (489) Q Consensus 154 ~~GV--i~~~l~eA~~-~~p~lv~~~~~~~~-~~~~~~---f~aLn---~A--~~~~G~---fi~Vp~gv~v 210 (489) ..|+ .+.+|-+|+. +|-.++++|-.+++ ..+.|. =|++- .+ +...|. |+.+|.|.-- T Consensus 274 q~Gi~naVA~lGTAlT~~h~~~L~r~~~~vil~fDgD~AG~~Aa~ra~~~~~~~~~~g~~v~v~~lp~g~DP 345 (600) T PRK05667 274 QAGITNAVASLGTALTEEHLQLLRRYTDEVILCFDGDRAGRKAALRALELALPLLKDGRQLRFLFLPDGKDP 345 (600) T ss_pred HCCCCEEEECCCCCCCHHHHHHHHHCCCEEEEEECCCHHHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCC T ss_conf 849867753343416399999997368938999688588899999999987888638977999618999893 No 29 >TIGR02386 rpoC_TIGR DNA-directed RNA polymerase, beta' subunit; InterPro: IPR012754 DNA-directed RNA polymerases 2.7.7.6 from EC (also known as DNA-dependent RNA polymerases) are responsible for the polymerisation of ribonucleotides into a sequence complementary to the template DNA. In eukaryotes, there are three different forms of DNA-directed RNA polymerases transcribing different sets of genes. Most RNA polymerases are multimeric enzymes and are composed of a variable number of subunits. The core RNA polymerase complex consists of five subunits (two alpha, one beta, one beta-prime and one omega) and is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme . The core RNA polymerase complex forms a "crab claw"-like structure with an internal channel running along the full length . The key functional sites of the enzyme, as defined by mutational and cross-linking analysis, are located on the inner wall of this channel. RNA synthesis follows after the attachment of RNA polymerase to a specific site, the promoter, on the template DNA strand. The RNA synthesis process continues until a termination sequence is reached. The RNA product, which is synthesised in the 5' to 3'direction, is known as the primary transcript. Eukaryotic nuclei contain three distinct types of RNA polymerases that differ in the RNA they synthesise: RNA polymerase I: located in the nucleoli, synthesises precursors of most ribosomal RNAs. RNA polymerase II: occurs in the nucleoplasm, synthesises mRNA precursors. RNA polymerase III: also occurs in the nucleoplasm, synthesises the precursors of 5S ribosomal RNA, the tRNAs, and a variety of other small nuclear and cytosolic RNAs. Eukaryotic cells are also known to contain separate mitochondrial and chloroplast RNA polymerases. Eukaryotic RNA polymerases, whose molecular masses vary in size from 500 to 700 kD, contain two non-identical large (>100 kDa) subunits and an array of up to 12 different small (less than 50 kDa) subunits. This entry represents the beta-prime subunit, RpoC, found in most bacteria. It excludes some, mainly cyanobacterial, species where RpoC is replaced by two homologous proteins that include an additional domain. One arm of the "claw" is predominantly formed by this subunit, the other being predominantly formed by the beta subunit. The active site of the enzyme is defined by three invariant aspartate residues within the beta-prime subunit .; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006350 transcription. Probab=42.85 E-value=11 Score=17.01 Aligned_cols=40 Identities=18% Similarity=0.106 Sum_probs=17.6 Q ss_pred CEEEEEEEEEECCCEEEEEEEEEEEECCCCCC--EEEEEEEECC Q ss_conf 27999987762598689972189983676611--0100001058 Q gi|254781059|r 333 SCGEFYSIAISNGFQQIDSGTKMLHLAKNTTS--RIVSKGISAG 374 (489) Q Consensus 333 a~~~~~gi~l~~~~Q~~D~~~~v~H~~p~t~S--~~~~K~i~~d 374 (489) |..-+-.=+++..+.++=|+|+ |.|-=+.. ....+.+..+ T Consensus 1053 AVGvIAAQSIGEPGTQLTMRTF--H~GGvA~~~~~~~~~~~~~~ 1094 (1552) T TIGR02386 1053 AVGVIAAQSIGEPGTQLTMRTF--HTGGVAGASSEEESIKVKNG 1094 (1552) T ss_pred CCEEEEEEECCCCCCCCCCCHH--HCCCCCCCCCCCCHHECCCC T ss_conf 5237886406846771202011--01210123321000010036 No 30 >KOG3870 consensus Probab=42.81 E-value=9.3 Score=17.47 Aligned_cols=24 Identities=17% Similarity=0.354 Sum_probs=13.8 Q ss_pred ECCCEEEEEEEEEEECCCCEEEEE Q ss_conf 146347885389982787279999 Q gi|254781059|r 315 TGSAITWKYPSCLLRGEQSCGEFY 338 (489) Q Consensus 315 lG~~~~r~~~~~~L~G~~a~~~~~ 338 (489) ++.++...+|+=-..|+.|.+++- T Consensus 408 ~~e~L~~~~p~Wm~tG~yaViqf~ 431 (434) T KOG3870 408 LGEKLSKENPNWMTTGKYAVIQFC 431 (434) T ss_pred HHHHHHCCCCCEEEECCEEEEEEC T ss_conf 799973028871784467999936 No 31 >PRK07058 acetate kinase; Provisional Probab=42.39 E-value=21 Score=14.84 Aligned_cols=30 Identities=13% Similarity=0.258 Sum_probs=25.2 Q ss_pred EEECCCCHHHHHHHH-HCCCCHHHHHHHHHH Q ss_conf 320247999999998-378898999999999 Q gi|254781059|r 429 ATTSKISDEHLFYCL-QRGIPEEAAIALIVN 458 (489) Q Consensus 429 Atvg~id~e~LFYL~-sRGi~~~~A~~liv~ 458 (489) .-.|.||+-.++||| +.|++-++...||-+ T Consensus 239 TRsGdiDPgiv~~l~~~~g~s~~ei~~~Lnk 269 (393) T PRK07058 239 TRCGALDPGVVLHLLQQEGMSADEVEDLLYH 269 (393) T ss_pred CCCCCCCHHHHHHHHHHCCCCHHHHHHHHHH T ss_conf 7688758489999998739999999999875 No 32 >TIGR02656 cyanin_plasto plastocyanin; InterPro: IPR002387 Blue or 'type-1' copper proteins are small proteins which bind a single copper atom and which are characterised by an intense electronic absorption band near 600 nm , . The most well known members of this class of proteins are the plant chloroplastic plastocyanins, and the distantly related bacterial azurins, which exchange electrons with cytochrome c551. Plastocyanin participates in electron transfer between the cytochrome b6f complex and photosystem I. Many cyanobacteria and eukaryotic algae can synthesise both plastocyanin and its functional analog cytochrome c6, depending on bioavailabilities of copper and iron, respectively . Plastocyanin participates in electron transfer between P700 and the cytochrome b/f complex in photosystem I. ; GO: 0005507 copper ion binding, 0009055 electron carrier activity, 0006118 electron transport. Probab=41.88 E-value=21 Score=14.78 Aligned_cols=34 Identities=18% Similarity=0.358 Sum_probs=23.1 Q ss_pred CCCEEEEECCCCCCCCCEEEEEEECCCCCEECCEEEEE Q ss_conf 38438873477413421279999658884002214899 Q gi|254781059|r 197 TDGSFVYIPKGVRCPMELSTYFRINEKGTGQFERTLII 234 (489) Q Consensus 197 ~~G~fi~Vp~gv~v~~PI~~~~~~~~~~~~~f~r~lIi 234 (489) .+|.++|+|+.+.|..==.+-| .|+. .+|||+|. T Consensus 8 D~G~L~F~P~~~~i~aGDtV~f-~NNK---~~PHNvVF 41 (102) T TIGR02656 8 DKGALVFEPAKISIAAGDTVKF-VNNK---GGPHNVVF 41 (102) T ss_pred CCCCCEEECCCEEECCCCEEEE-EECC---CCCCCEEE T ss_conf 8876313458104688981788-4378---89976785 No 33 >COG0829 UreH Urease accessory protein UreH [Posttranslational modification, protein turnover, chaperones] Probab=41.48 E-value=22 Score=14.74 Aligned_cols=84 Identities=11% Similarity=-0.066 Sum_probs=43.7 Q ss_pred HHHHHHCCCEEEEECCCCCCCCCEEEEEEECCCCCEEC--CEEEEEEECCCEEEEEE-CCCCCCCCC-CCCCCEEEEEEE Q ss_conf 88764238438873477413421279999658884002--21489972772499996-156765444-444330589986 Q gi|254781059|r 191 LNSAVFTDGSFVYIPKGVRCPMELSTYFRINEKGTGQF--ERTLIIADEGSYVSYLE-GCTAPQRDE-NQLHAAVVELVA 266 (489) Q Consensus 191 Ln~A~~~~G~fi~Vp~gv~v~~PI~~~~~~~~~~~~~f--~r~lIi~ee~S~v~~iE-~~~s~~~~~-~~~~~~v~Ei~v 266 (489) |.+.+..+.+=|--|.-.. +.+.+.+...++++...= -..=|.+++|+++.+.- +|+= .+.. .+..-.-++|.| T Consensus 23 l~~~~~~~plkV~~p~y~~-~~~~~~~li~~~GGv~gGD~l~~di~lg~~a~~~iTTQsatK-vYrs~~g~A~Q~~~i~v 100 (269) T COG0829 23 LATLFQTGPLKVQRPFYPD-EGLCEAMLIHPSGGVVGGDRLEIDIELGDNAKALITTQSATK-VYRSPGGQATQTTNITV 100 (269) T ss_pred EHHHHCCCCEEEECCCCCC-CCCEEEEEEECCCCCCCCCEEEEEEEECCCCEEEEECCCCCE-EEECCCCCEEEEEEEEE T ss_conf 0333135774887367679-887199999379973246568898998899659997423132-57479996079999997 Q ss_pred CCCCEEEEEE Q ss_conf 6995899999 Q gi|254781059|r 267 LDNAEIKYST 276 (489) Q Consensus 267 g~nA~v~~~~ 276 (489) ++||.|.|.. T Consensus 101 g~nA~LewlP 110 (269) T COG0829 101 GENARLEWLP 110 (269) T ss_pred CCCCEEEECC T ss_conf 7997887648 No 34 >TIGR00500 met_pdase_I methionine aminopeptidase, type I; InterPro: IPR002467 Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site . The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This group of metallopeptidases belong to MEROPS peptidase family M24 (clan MG), subfamily M24A. Methionine aminopeptidase (3.4.11.18 from EC) (MAP) is responsible for the removal of the amino-terminal (initiator) methionine from nascent eukaryotic cytosolic and cytoplasmic prokaryotic proteins if the penultimate amino acid is small and uncharged. All MAP studied to date are monomeric proteins that require cobalt ions for activity. Two subfamilies of MAP enzymes are known to exist , . While being evolutionary related, they only share a limited amount of sequence similarity mostly clustered around the residues shown, in the Escherichia coli MAP , to be involved in cobalt-binding. The first family consists of enzymes from prokaryotes as well as eukaryotic MAP-1, while the second group (IPR002468 from INTERPRO) is made up of archaeal MAP and eukaryotic MAP-2. ; GO: 0004239 methionyl aminopeptidase activity, 0006508 proteolysis. Probab=41.35 E-value=22 Score=14.73 Aligned_cols=31 Identities=13% Similarity=0.232 Sum_probs=21.0 Q ss_pred ECCCCCCCHHHHHHHHHHCCCCHHHHHHH--HHHHHHH Q ss_conf 20488889999999986329988999999--9999999 Q gi|254781059|r 30 DKAPKGLNEDIIRFISAKKNEPDWMLEWR--LLAYRRW 65 (489) Q Consensus 30 ~~~~~gl~~~~i~~iS~~k~EP~Wl~e~R--~~Al~~f 65 (489) .....|.|-..++.|.. +|+.+.| .-|+.-| T Consensus 28 ~~v~PGvST~eLD~i~~-----~~i~~~~hA~pAf~GY 60 (265) T TIGR00500 28 REVKPGVSTKELDRIAK-----DFIEKRGHAKPAFLGY 60 (265) T ss_pred HHCCCCCCHHHHHHHHH-----HHHHHCCCCCCCCCCC T ss_conf 63689887899999999-----9998479723535678 No 35 >TIGR01135 glmS glucosamine--fructose-6-phosphate aminotransferase, isomerizing; InterPro: IPR005855 Glucosamine:fructose-6-phosphate aminotransferase (2.6.1.16 from EC) catalyses the formation of glucosamine 6-phosphate and is the first and rate-limiting enzyme of the hexosamine biosynthetic pathway. The final product of the hexosamine pathway, UDP-N-acetyl glucosamine, is an active precursor of numerous macromolecules containing amino sugars. This family of sequences belong to the MEROPS peptidase family C44 (clan PB(C)), and are classified as non-peptidase homologs.; GO: 0004360 glutamine-fructose-6-phosphate transaminase (isomerizing) activity, 0016051 carbohydrate biosynthetic process, 0005737 cytoplasm. Probab=40.24 E-value=23 Score=14.61 Aligned_cols=63 Identities=19% Similarity=0.415 Sum_probs=43.2 Q ss_pred EEEEECCEE-EECCCHHHHHHCCEEEEEHHHH-HHHHHHHHHHHCCCEECCCCCHHHHHHHHHHC-CCEEE Q ss_conf 001213502-2020056787479088558998-77778988620221124565314668876423-84388 Q gi|254781059|r 135 VDAVFDSVS-VVTTFKEELKKSGVIFMPISEA-ICEHPDLVKKYLSTVVPVGDNFYAALNSAVFT-DGSFV 202 (489) Q Consensus 135 v~~~~ds~~-v~~~l~~~L~~~GVi~~~l~eA-~~~~p~lv~~~~~~~~~~~~~~f~aLn~A~~~-~G~fi 202 (489) +-+|.|... -|.+++++|+++|+.|.|=-|- + =|=||+.++.+ .+.+.+.|.-.|+-. .|+|= T Consensus 97 iaVVHNGIIENy~~Lr~eL~~~G~~F~S~TDTEV--IaHLie~~~~~---~~~~L~~A~~~~~~~L~G~YA 162 (628) T TIGR01135 97 IAVVHNGIIENYAELREELEARGHVFSSDTDTEV--IAHLIEEELRQ---EGGDLLEAVQKALKQLRGAYA 162 (628) T ss_pred EEEEECCEECCCHHHHHHHHHCCCEEEECCCCCH--HHHHHHHHHCC---CCCHHHHHHHHHHHHHCCCEE T ss_conf 7999778270548899899848928971367404--88898898417---884189999999876266257 No 36 >pfam04256 DUF434 Protein of unknown function (DUF434). Probab=39.97 E-value=23 Score=14.58 Aligned_cols=26 Identities=35% Similarity=0.544 Sum_probs=21.8 Q ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHH Q ss_conf 99999998378898999999999989 Q gi|254781059|r 436 DEHLFYCLQRGIPEEAAIALIVNGFV 461 (489) Q Consensus 436 ~e~LFYL~sRGi~~~~A~~liv~gF~ 461 (489) -+.|+||.+||-+.+.|..++-+-|- T Consensus 7 ~~Dl~yLLnRGY~~k~al~~Vgnhy~ 32 (58) T pfam04256 7 YEDLKYLLNRGYSKKSALELVGNHYQ 32 (58) T ss_pred HHHHHHHHCCCCCCCCHHHHHHHHHC T ss_conf 99999999199984008999898732 No 37 >TIGR00118 acolac_lg acetolactate synthase, large subunit, biosynthetic type; InterPro: IPR012846 Two groups of proteins form acetolactate from two molecules of pyruvate. The type of acetolactate synthase described in this entry also catalyzes the formation of acetohydroxybutyrate from pyruvate and 2-oxobutyrate, an early step in the branched chain amino acid biosynthesis; it is therefore also termed acetohydroxyacid synthase. In bacteria, this catalytic chain is associated with a smaller regulatory chain in an alpha2/beta2 heterotetramer. Acetolactate synthase is a thiamine pyrophosphate enzyme. In this type, FAD and Mg++ are also found. Several isozymes of this enzyme are found in E. coli K12, one of which contains a frameshift in the large subunit gene and is not expressed.; GO: 0000287 magnesium ion binding, 0003984 acetolactate synthase activity, 0030976 thiamin pyrophosphate binding, 0050660 FAD binding, 0009082 branched chain family amino acid biosynthetic process. Probab=32.33 E-value=16 Score=15.63 Aligned_cols=19 Identities=32% Similarity=0.413 Sum_probs=10.2 Q ss_pred CEE-EEEHHHHHHHHHHHHH Q ss_conf 908-8558998777789886 Q gi|254781059|r 156 GVI-FMPISEAICEHPDLVK 174 (489) Q Consensus 156 GVi-~~~l~eA~~~~p~lv~ 174 (489) ||| ...-++-|+++.|.++ T Consensus 223 Gvin~a~as~~L~elae~~~ 242 (593) T TIGR00118 223 GVINIAGASEELKELAERLQ 242 (593) T ss_pred HHHHCCCCHHHHHHHHHHCC T ss_conf 35420554079999985407 No 38 >cd01088 MetAP2 Methionine Aminopeptidase 2. E.C. 3.4.11.18. Also known as methionyl aminopeptidase and peptidase M. Catalyzes release of N-terminal amino acids, preferentially methionine, from peptides and arylamides. Probab=31.24 E-value=31 Score=13.61 Aligned_cols=13 Identities=46% Similarity=0.363 Sum_probs=6.0 Q ss_pred CEEEEEHHHHHHH Q ss_conf 9088558998777 Q gi|254781059|r 156 GVIFMPISEAICE 168 (489) Q Consensus 156 GVi~~~l~eA~~~ 168 (489) |+-+.++..|+++ T Consensus 123 G~~~~dIg~~Ie~ 135 (291) T cd01088 123 DVRLGEIGEAIEE 135 (291) T ss_pred CCCHHHHHHHHHH T ss_conf 9898999999999 No 39 >PRK09458 pspB phage shock protein B; Provisional Probab=31.07 E-value=31 Score=13.59 Aligned_cols=44 Identities=16% Similarity=0.370 Sum_probs=31.1 Q ss_pred CCCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 048888999999998632998899999999999999688778665 Q gi|254781059|r 31 KAPKGLNEDIIRFISAKKNEPDWMLEWRLLAYRRWLTMKDPSWAR 75 (489) Q Consensus 31 ~~~~gl~~~~i~~iS~~k~EP~Wl~e~R~~Al~~f~~l~~P~W~~ 75 (489) +..+|||++--..+-..-..-+=| +-|.++++.-..-.-|+|++ T Consensus 31 k~~~~Ls~~d~~~L~~L~~~A~~m-~eRI~tLE~ILDae~P~WR~ 74 (75) T PRK09458 31 QVSQGLSQEEQQRLAQLTEKAEKM-RERIQTLEAILDAEHPNWRN 74 (75) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHH-HHHHHHHHHHHCCCCCCCCC T ss_conf 246799999999999999999999-99999999997266988566 No 40 >pfam03069 FmdA_AmdA Acetamidase/Formamidase family. This family includes amidohydrolases of formamide EC:3.5.1.49 and acetamide. Methylophilus methylotrophus formamidase forms a homotrimer suggesting all the members of this family also do. Probab=29.36 E-value=33 Score=13.39 Aligned_cols=23 Identities=17% Similarity=0.111 Sum_probs=20.1 Q ss_pred HHHHHHHHCCCCHHHHHHHHHHH Q ss_conf 99999983788989999999999 Q gi|254781059|r 437 EHLFYCLQRGIPEEAAIALIVNG 459 (489) Q Consensus 437 e~LFYL~sRGi~~~~A~~liv~g 459 (489) +.+-||..+|+++++|-.|+-.+ T Consensus 284 ~mI~~L~~~Gltr~eAY~L~SvA 306 (331) T pfam03069 284 NTIEYLEKFGYAREQAYLGLSVA 306 (331) T ss_pred HHHHHHHHCCCCHHHHHHHHHHC T ss_conf 99999998199999999825202 No 41 >TIGR01478 STEVOR variant surface antigen, stevor family; InterPro: IPR006374 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see . This entry represents the stevor (short for "subtelomeric variable open reading frame") family of predicted variant surface antigens. This is the third largest variant surface antigen family in P. falciparum, with 33 genes identified in the genome of this parasite . The function of these proteins is not known, but they are expressed during several stages of the parasite life-cycle and may play a variety of roles in parasite survival and infection .. Probab=28.58 E-value=23 Score=14.48 Aligned_cols=14 Identities=29% Similarity=0.285 Sum_probs=9.9 Q ss_pred CCEEEEEHHHHHHH Q ss_conf 79088558998777 Q gi|254781059|r 155 SGVIFMPISEAICE 168 (489) Q Consensus 155 ~GVi~~~l~eA~~~ 168 (489) .||-+|++..++-. T Consensus 186 ~Gvg~C~l~S~ll~ 199 (315) T TIGR01478 186 AGVGTCALSSALLG 199 (315) T ss_pred CHHHHHHHHHHHHH T ss_conf 03689889999998 No 42 >cd06460 M32_Taq Peptidase family M32 is a subclass of metallocarboxypeptidases which are distributed mainly in bacteria and archaea, and contain a HEXXH motif that coordinates a divalent cation such as Zn2+ or Co2+, so far only observed in the active site of neutral metallopeptidases but not in carboxypeptidases. M32 includes the thermostable carboxypeptidases (E.C. 3.4.17.19) from Thermus aquaticus (TaqCP) and Pyrococcus furiosus (PfuCP), which have broad specificities toward a wide range of C-terminal substrates that include basic, aromatic, neutral and polar amino acids. These enzymes have a similar fold to the M3 peptidases such as neurolysin and the M2 angiotensin converting enzyme (ACE). Novel peptidases from protozoa Trypanosoma cruzi, a causative agent of Chagas' disease, and Leishmania major, a parasite that causes leishmaniasis, are the first eukaryotic M32 enzymes identified so far, thus making these enzymes an attractive potential target for drug development against these o Probab=27.56 E-value=33 Score=13.35 Aligned_cols=45 Identities=22% Similarity=0.346 Sum_probs=25.6 Q ss_pred HHHHHHCCCEEC--CCCCHHHHHHHHHHCCCEEEEECCCCCCCCCEEEEEE Q ss_conf 988620221124--5653146688764238438873477413421279999 Q gi|254781059|r 171 DLVKKYLSTVVP--VGDNFYAALNSAVFTDGSFVYIPKGVRCPMELSTYFR 219 (489) Q Consensus 171 ~lv~~~~~~~~~--~~~~~f~aLn~A~~~~G~fi~Vp~gv~v~~PI~~~~~ 219 (489) ..++++|+...+ ..++.+.++|. -...||.|-.. .|..|+|+..| T Consensus 217 p~l~~~f~~~~~~~s~e~~y~~~n~---V~pslIRveAD-EvTY~lHImlR 263 (396) T cd06460 217 PKLKKTFPEQLKDVSLENFYRAVNR---VQPSLIRVEAD-EVTYNLHIMLR 263 (396) T ss_pred HHHHHHHCCCCCCCCHHHHHHHHHH---CCCCCCCCCCH-HHHHHHHHHHH T ss_conf 9999984702457899999999975---37751112005-43047899999 No 43 >KOG1184 consensus Probab=27.56 E-value=35 Score=13.17 Aligned_cols=52 Identities=21% Similarity=0.336 Sum_probs=23.4 Q ss_pred ECCEEEEEEECCC-EEEEEECCCCCCCCC----CCCCC-E-EEEEEECCCCEEEEEEEEECC Q ss_conf 0221489972772-499996156765444----44433-0-589986699589999963047 Q gi|254781059|r 227 QFERTLIIADEGS-YVSYLEGCTAPQRDE----NQLHA-A-VVELVALDNAEIKYSTVQNWY 281 (489) Q Consensus 227 ~f~r~lIi~ee~S-~v~~iE~~~s~~~~~----~~~~~-~-v~Ei~vg~nA~v~~~~iQn~~ 281 (489) +-+|+++++|+|| ++|+.|.-+....+- .-++| + -.|..++.++ |..||+|. T Consensus 432 ~e~rvilfiGDGs~qlTvQeiStmir~gl~~~if~~NN~GYTIE~~IH~~~---Yn~I~~W~ 490 (561) T KOG1184 432 PEKRVILFIGDGSFQLTVQEISTMIRWGLKPIIFLINNGGYTIEVEIHDGP---YNDIQNWD 490 (561) T ss_pred CCCEEEEEECCCHHEEEHHHHHHHHHCCCCCEEEEEECCCEEEEEEECCCC---CCCCCCCH T ss_conf 775499996384031228999999864997189999279718987624787---66453311 No 44 >smart00775 LNS2 LNS2 domain. This domain is found in Saccharomyces cerevisiae protein SMP2, proteins with an N-terminal lipin domain and phosphatidylinositol transfer proteins. SMP2 is involved in plasmid maintenance and respiration. Lipin proteins are involved in adipose tissue development and insulin resistance. Probab=27.52 E-value=30 Score=13.73 Aligned_cols=27 Identities=15% Similarity=0.105 Sum_probs=14.5 Q ss_pred HHHHHHCCCCHHHHHHHHHHHHHHHHCC Q ss_conf 9998632998899999999999999688 Q gi|254781059|r 42 RFISAKKNEPDWMLEWRLLAYRRWLTMK 69 (489) Q Consensus 42 ~~iS~~k~EP~Wl~e~R~~Al~~f~~l~ 69 (489) ..+....+ .+|...-=.+-+..+.+.| T Consensus 17 G~v~~~~G-~Dwth~Gva~l~~~i~~nG 43 (157) T smart00775 17 GHVVPIIG-KDWTHPGVAKLYRDIQNNG 43 (157) T ss_pred HHHHHHCC-CCCCCCCHHHHHHHHHHCC T ss_conf 75886308-5521210999999999789 No 45 >KOG2874 consensus Probab=26.69 E-value=33 Score=13.36 Aligned_cols=44 Identities=16% Similarity=0.212 Sum_probs=18.7 Q ss_pred CCCCCCCCEEEECCCCCCCCCCCCCCCCHHHHHHHHHCCCCHHH Q ss_conf 66568200031005653334332101587888877651765111 Q gi|254781059|r 78 YPAIDFDDLYYYAAPKSIVGPQSLDDVDPELLRTYEKLGIPIRE 121 (489) Q Consensus 78 l~~i~~~~i~~~~~p~~~~~~~sld~v~~ei~~tf~kLgip~~e 121 (489) ...|+...+..-..|......+|...+.|.....|-+---|..+ T Consensus 36 ~d~wki~~Fs~edn~~g~~EeSSfatlFPKYREkYlke~wp~v~ 79 (356) T KOG2874 36 IDMWKIEKFSKEDNPHGFLEESSFATLFPKYREKYLKECWPLVE 79 (356) T ss_pred CHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 00200455684458642234210776657889999999889999 No 46 >TIGR02916 PEP_his_kin putative PEP-CTERM system histidine kinase; InterPro: IPR014265 Proteins in this entry are putative periplasmic sensor signal transduction histidine kinases. They all contain a GAF domain that is present in phytochromes and cGMP-specific phosphodiesterases, and which has been experimentally proven to be involved in protein:protein interactions. They also contain a C-terminal histidine kinase domain, which is composed of a dimerisation sub-domain and an ATP/ADP-binding phosphotransfer, or catalytic, sub-domain. The proteins in this entry are found strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria, where members of IPR014264 from INTERPRO and IPR014266 from INTERPRO also occur.. Probab=26.08 E-value=21 Score=14.81 Aligned_cols=250 Identities=19% Similarity=0.185 Sum_probs=116.3 Q ss_pred CCCCCCCCC----CC---EECCCCCCCHHHHHHHHHHCCCCH---HHHHHHHHHHHHHHHCCCCCCCC------------ Q ss_conf 212574135----31---020488889999999986329988---99999999999999688778665------------ Q gi|254781059|r 18 QYKYGFETN----IQ---VDKAPKGLNEDIIRFISAKKNEPD---WMLEWRLLAYRRWLTMKDPSWAR------------ 75 (489) Q Consensus 18 ~y~~~~~~~----~~---~~~~~~gl~~~~i~~iS~~k~EP~---Wl~e~R~~Al~~f~~l~~P~W~~------------ 75 (489) +|||||-++ ++ ....+.+|-+.+|+.+|..=+=|. ||++-..+.|..--+..+|.-.- T Consensus 307 ~ykYDYR~EWL~Ft~~L~~~~~~~~L~~~v~~~La~~vesp~G~LWlk~~~~~~Y~~~~~~N~~~~~~l~~~~~sa~~qF 386 (696) T TIGR02916 307 RYKYDYREEWLRFTQALSEAESSDDLGERVIKALAQLVESPGGALWLKSVNDGLYRPAARWNFPLAQALFEPSDSAFCQF 386 (696) T ss_pred CCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCEEEEECCCCHHHHHHHHHCCCHHHHHHHCCCCHHHHH T ss_conf 88751355888999742157997578999999998630268855776337402679999713202100120026614545 Q ss_pred CCCCCCCC--CCEEEECCCCCCCCCCCCCCCCHHHHHHH--HHCCCCHHHHHHHCCCCC--------------------- Q ss_conf 87665682--00031005653334332101587888877--651765111111023444--------------------- Q gi|254781059|r 76 IRYPAIDF--DDLYYYAAPKSIVGPQSLDDVDPELLRTY--EKLGIPIREQEILAGVQT--------------------- 130 (489) Q Consensus 76 ~~l~~i~~--~~i~~~~~p~~~~~~~sld~v~~ei~~tf--~kLgip~~e~~~l~g~~~--------------------- 130 (489) -.-+.|=. +... ..|..-. .-. +|..+ ..+ -.+-||+.-.+.|.|... T Consensus 387 l~~~~Wi~~L~e~~--~~p~~Y~-~l~---~p~wl-~e~~~~~l~vPL~~~~~L~GFvvL~~p~~~~e~YnWEv~DLLkt 459 (696) T TIGR02916 387 LQESGWIIDLEEAK--SEPDHYS-GLV---LPEWL-REIPNAWLIVPLILGEELLGFVVLARPRTAVEFYNWEVRDLLKT 459 (696) T ss_pred HHHCCCEEEECCCC--CCCCCCC-CCC---CCHHH-HHCCCCEEEEEEECCCEEEEEEEECCCCCCCEEECCCHHHHHHH T ss_conf 52158847312457--6478899-962---44165-31689469975633881799988638754401303210247899 Q ss_pred --------------CC-CEEEEEEC-----CEEEECCCHHHHHHCCEEEEEH--HHHH--HHHHHHHHHHCCCEECCCCC Q ss_conf --------------43-10001213-----5022020056787479088558--9987--77789886202211245653 Q gi|254781059|r 131 --------------SK-VAVDAVFD-----SVSVVTTFKEELKKSGVIFMPI--SEAI--CEHPDLVKKYLSTVVPVGDN 186 (489) Q Consensus 131 --------------~~-v~v~~~~d-----s~~v~~~l~~~L~~~GVi~~~l--~eA~--~~~p~lv~~~~~~~~~~~~~ 186 (489) .. .+.-=||| |.-|.|++|- |-.| .|| .-|- +..|+..++-|+++-. ..+ T Consensus 460 ~~rQAASyLa~~~a~e~L~~arqF~~FnR~SaFvVHDLKN-LvaQ----LSL~l~NA~~Hk~NPeFq~DmL~Tv~~-~v~ 533 (696) T TIGR02916 460 AGRQAASYLAQMEASEALAEARQFEAFNRMSAFVVHDLKN-LVAQ----LSLLLRNAERHKDNPEFQDDMLETVES-AVN 533 (696) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEHHHH-HHHH----HHHHHHHHHHCCCCHHHHHHHHHHHHH-HHH T ss_conf 9999999999999999999888887518803421102589-9999----999999887417896689999998999-999 Q ss_pred HHHHHHHHHHCCCEEEEECCCCCCCCCEEEEEEECCCCCEECCEEEEEEECCC-EEEEEE---------CCCCCCCCCCC Q ss_conf 14668876423843887347741342127999965888400221489972772-499996---------15676544444 Q gi|254781059|r 187 FYAALNSAVFTDGSFVYIPKGVRCPMELSTYFRINEKGTGQFERTLIIADEGS-YVSYLE---------GCTAPQRDENQ 256 (489) Q Consensus 187 ~f~aLn~A~~~~G~fi~Vp~gv~v~~PI~~~~~~~~~~~~~f~r~lIi~ee~S-~v~~iE---------~~~s~~~~~~~ 256 (489) ++-.|=+=+=++|....-.+.+.+..=|+ ++-+....|-||.-|-+++-+ ++++-- ++---.--+.. T Consensus 534 RM~~ll~qLr~~~~p~~~~~~~~l~~L~~---~~~~~k~~q~p~~e~~~~~~~~rl~~~a~~erl~rV~gHL~QNAlEAT 610 (696) T TIGR02916 534 RMKKLLAQLREKGLPEEEKKTVDLVDLLR---RVIASKRAQQPRPEVSIEDTDFRLSVRADRERLERVLGHLVQNALEAT 610 (696) T ss_pred HHHHHHHHHHCCCCCCHHHHEEHHHHHHH---HHHHHHHHCCCCEEEEECCCCEEEEEEECHHHHHHHHHHHHHHHHHCC T ss_conf 99999999730689740564103899999---999998631894489971754178887528889999999999888604 Q ss_pred CCCEEEEEEECCCC-EEEEEEEEECCCC Q ss_conf 43305899866995-8999996304755 Q gi|254781059|r 257 LHAAVVELVALDNA-EIKYSTVQNWYPG 283 (489) Q Consensus 257 ~~~~v~Ei~vg~nA-~v~~~~iQn~~~~ 283 (489) -..|-|+|-+++.- .-=++.|.+-+.| T Consensus 611 ~~~G~V~~~~~~~~~~~a~i~i~D~G~G 638 (696) T TIGR02916 611 PEEGRVKIRVERECGGAAVIEIEDSGCG 638 (696) T ss_pred CCCCEEEEEEEECCCCEEEEEEEECCCC T ss_conf 9996289998741888227999865789 No 47 >pfam06667 PspB Phage shock protein B. This family consists of several bacterial phage shock protein B (PspB) sequences. The phage shock protein (psp) operon is induced in response to heat, ethanol, osmotic shock and infection by filamentous bacteriophages. Expression of the operon requires the alternative sigma factor sigma54 and the transcriptional activator PspF. In addition, PspA plays a negative regulatory role, and the integral-membrane proteins PspB and PspC play a positive one. Probab=25.71 E-value=38 Score=12.94 Aligned_cols=44 Identities=20% Similarity=0.420 Sum_probs=30.1 Q ss_pred CCCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 048888999999998632998899999999999999688778665 Q gi|254781059|r 31 KAPKGLNEDIIRFISAKKNEPDWMLEWRLLAYRRWLTMKDPSWAR 75 (489) Q Consensus 31 ~~~~gl~~~~i~~iS~~k~EP~Wl~e~R~~Al~~f~~l~~P~W~~ 75 (489) +..+||+++--+.+-..-..-+=| +-|.++++.-..-.-|+|++ T Consensus 31 k~~~~Ls~~d~~~L~~L~~~A~rm-~~RI~tLE~ILDae~P~WR~ 74 (75) T pfam06667 31 KVSQGLSEEDEQLLEELLETAEKL-QERIQTLERILDAESPNWRP 74 (75) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHH-HHHHHHHHHHHCCCCCCCCC T ss_conf 467789999999999999999999-99999999997366989778 No 48 >COG4340 Uncharacterized protein conserved in bacteria [Function unknown] Probab=25.39 E-value=38 Score=12.90 Aligned_cols=129 Identities=22% Similarity=0.207 Sum_probs=70.9 Q ss_pred EEEHHHHHHHHHHHHHHHCCCEECCCCCHHHHHHHHHHCCCEEEEECCCCCCCCCEEEEEEECCC--CCEECCEEEEEEE Q ss_conf 85589987777898862022112456531466887642384388734774134212799996588--8400221489972 Q gi|254781059|r 159 FMPISEAICEHPDLVKKYLSTVVPVGDNFYAALNSAVFTDGSFVYIPKGVRCPMELSTYFRINEK--GTGQFERTLIIAD 236 (489) Q Consensus 159 ~~~l~eA~~~~p~lv~~~~~~~~~~~~~~f~aLn~A~~~~G~fi~Vp~gv~v~~PI~~~~~~~~~--~~~~f~r~lIi~e 236 (489) |.++.+|+.++|- +-.|-.||. .|.-.|++.+++. | -|++... +.+. .+- T Consensus 86 Fepl~da~~~~~~----------------~r~l~aaf~---~~~~L~p~~EIe~--H-Q~Ri~a~~de~gl------paP 137 (226) T COG4340 86 FEPLTDAIADHPV----------------TRGLIAAFE---LFDPLSPTSEIEM--H-QFRIEARTDEQGL------PAP 137 (226) T ss_pred EEEHHHHHHCCCH----------------HHHHHHHHH---HCCCCCCCCEEEE--E-EEEEEEECCCCCC------CCC T ss_conf 5463556646810----------------888999997---5078998544366--7-8998861377689------797 Q ss_pred CCCEE---EEEECCCCCCCCCCCCCCEEEEEEECCCCEEEEEEEEECCCCCCCCCCCEEEEEEE-EEEEECCCEEEEEEE Q ss_conf 77249---99961567654444443305899866995899999630475531144874640146-786304630389988 Q gi|254781059|r 237 EGSYV---SYLEGCTAPQRDENQLHAAVVELVALDNAEIKYSTVQNWYPGDSQGKGGIYNFVTK-RGDCRGMRSKISWTQ 312 (489) Q Consensus 237 e~S~v---~~iE~~~s~~~~~~~~~~~v~Ei~vg~nA~v~~~~iQn~~~~~~~~~~~~~~~~t~-r~~~~~~~a~~~~~~ 312 (489) ||+.- -+|--. .-+.++...+.+.+|+-+++..-+.++-.-++..-...+.++|++|. +.....+.+.+.|+. T Consensus 138 EG~HqDG~D~I~I~---~vDR~NI~gGet~lY~~~~~~p~f~kvl~pGe~~~l~Dh~~~H~~tpi~p~t~~q~g~mdvfv 214 (226) T COG4340 138 EGAHQDGVDWIIIM---LVDRQNIDGGETDLYAPDGASPGFFKVLAPGEAVFLDDHRVLHGVTPIVPSTSRQRGAMDVFV 214 (226) T ss_pred CCCCCCCCCEEEEE---EEEECCCCCCEEEEECCCCCCCCEEEECCCCCEEEECCCHHCCCCCCEECCCHHHCCCEEEEE T ss_conf 53244686679999---863013447437887358888633775168747886130200065540114254255226999 Q ss_pred EEECCC Q ss_conf 751463 Q gi|254781059|r 313 LETGSA 318 (489) Q Consensus 313 ~~lG~~ 318 (489) .+.|.+ T Consensus 215 lt~~~~ 220 (226) T COG4340 215 LTVGRK 220 (226) T ss_pred EECCCC T ss_conf 964876 No 49 >COG4101 Predicted mannose-6-phosphate isomerase [Carbohydrate transport and metabolism] Probab=25.01 E-value=33 Score=13.41 Aligned_cols=15 Identities=53% Similarity=0.928 Sum_probs=11.6 Q ss_pred HHHCCCEEEEECCCC Q ss_conf 642384388734774 Q gi|254781059|r 194 AVFTDGSFVYIPKGV 208 (489) Q Consensus 194 A~~~~G~fi~Vp~gv 208 (489) |...-|-|+|||+|| T Consensus 90 a~~~pGDf~YiPpgV 104 (142) T COG4101 90 AEVGPGDFFYIPPGV 104 (142) T ss_pred EEECCCCEEECCCCC T ss_conf 774488769838999 No 50 >KOG1151 consensus Probab=24.90 E-value=39 Score=12.84 Aligned_cols=19 Identities=26% Similarity=0.455 Sum_probs=13.6 Q ss_pred EEEECCCEEEEEECCCCCC Q ss_conf 9972772499996156765 Q gi|254781059|r 233 IIADEGSYVSYLEGCTAPQ 251 (489) Q Consensus 233 Ii~ee~S~v~~iE~~~s~~ 251 (489) .-++..|-+|++|-|.+.. T Consensus 536 fslDtdsFCTVLEYceGND 554 (775) T KOG1151 536 FSLDTDSFCTVLEYCEGND 554 (775) T ss_pred EEECCCCCEEEEEECCCCC T ss_conf 4522665334345347885 No 51 >pfam10884 DUF2683 Protein of unknown function (DUF2683). This family of proteins with unknown function appears to be restricted to Methanosarcinaceae. Probab=24.76 E-value=39 Score=12.82 Aligned_cols=33 Identities=30% Similarity=0.513 Sum_probs=16.6 Q ss_pred HHHHHHHHHHHHHHHHHH-HCCHHHHHHHHHHHH Q ss_conf 899999999998999988-589689999999999 Q gi|254781059|r 449 EEAAIALIVNGFVREVLQ-KLPMEFAVEAQKLID 481 (489) Q Consensus 449 ~~~A~~liv~gF~~~v~~-~ip~e~~~e~~~li~ 481 (489) +..|..|++.-|.+.+++ ++.-||..++++..+ T Consensus 28 KSaAId~~i~qyee~~lEpelrPefiek~~~i~k 61 (80) T pfam10884 28 KSAAIDKIIEQYEEEILEPELKPEFIEKMQNIMK 61 (80) T ss_pred HHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHC T ss_conf 7899999999999872184459999999999863 No 52 >TIGR01756 LDH_protist lactate dehydrogenase; InterPro: IPR011272 This entry represents a family of protist (Trichomonad) lactate dehydrogenases which have apparently evolved from a recent protist malate dehydrogenase ancestor . Lactate dehydrogenase converts the hydroxyl at C-2 of lactate to a carbonyl in the product, pyruvate. The preference of this enzyme for NAD or NADP has not been determined. A critical residue in malate dehydrogenase, arginine-91 (Trichomonas vaginalis numbering) has been mutated to a leucine, eliminating the positive charge which complemented the carboxylate in malate which is absent in lactate. Several other more subtle changes are proposed to make the active site smaller to accommodate the less bulky lactate molecule .. Probab=24.51 E-value=40 Score=12.79 Aligned_cols=44 Identities=16% Similarity=0.101 Sum_probs=39.5 Q ss_pred EECCC-CCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCC Q ss_conf 02048-888999999998632998899999999999999688778 Q gi|254781059|r 29 VDKAP-KGLNEDIIRFISAKKNEPDWMLEWRLLAYRRWLTMKDPS 72 (489) Q Consensus 29 ~~~~~-~gl~~~~i~~iS~~k~EP~Wl~e~R~~Al~~f~~l~~P~ 72 (489) .+.+. .|-.+.+++.+++..-+|+++..+-.+||+..+..|+-+ T Consensus 178 ~~~~~e~G~~~~~~d~l~~~~~~~~F~~~Ia~RAW~ilE~RG~TS 222 (314) T TIGR01756 178 AEFTKEDGKKQKVIDELSKDYAEDDFFEVIAQRAWKILERRGFTS 222 (314) T ss_pred EEEECCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCC T ss_conf 999816883137888776522561589999899999984058835 No 53 >PRK03902 manganese transport transcriptional regulator; Provisional Probab=24.23 E-value=40 Score=12.75 Aligned_cols=47 Identities=9% Similarity=0.039 Sum_probs=20.6 Q ss_pred HHHCCCCHHHHHHHCCCCCCCCEEEEEECCEEEECCCHHHHHHCCEEEEEHHHHHHHHHHHHHHH Q ss_conf 76517651111110234444310001213502202005678747908855899877778988620 Q gi|254781059|r 112 YEKLGIPIREQEILAGVQTSKVAVDAVFDSVSVVTTFKEELKKSGVIFMPISEAICEHPDLVKKY 176 (489) Q Consensus 112 f~kLgip~~e~~~l~g~~~~~v~v~~~~ds~~v~~~l~~~L~~~GVi~~~l~eA~~~~p~lv~~~ 176 (489) +..||+|..+....| ...-|.++++.-++ +..+-...+++||+++.+ T Consensus 84 L~~LGv~~~~a~~eA---------------c~lEH~iS~~~~~~---l~~~l~~~~~~P~~~~~f 130 (142) T PRK03902 84 LRIIGVDESKIYNDV---------------EGIEHHLSWNAIDR---IGDLVQYFEEDPDRLETL 130 (142) T ss_pred HHHHCCCHHHHHHHH---------------HHHHCCCCHHHHHH---HHHHHCCHHHCCHHHHHH T ss_conf 999199876999999---------------88141698999999---999973654296999999 No 54 >pfam02074 Peptidase_M32 Carboxypeptidase Taq (M32) metallopeptidase. Probab=23.60 E-value=41 Score=12.67 Aligned_cols=19 Identities=21% Similarity=0.579 Sum_probs=9.7 Q ss_pred EECCCCHHHHHHHHHCCCC Q ss_conf 2024799999999837889 Q gi|254781059|r 430 TTSKISDEHLFYCLQRGIP 448 (489) Q Consensus 430 tvg~id~e~LFYL~sRGi~ 448 (489) |.|.+=.-|||.-|-+-++ T Consensus 420 ~LG~~~AaQl~~~~~~~~p 438 (494) T pfam02074 420 SLGTIYAAQLYYKIKEDLP 438 (494) T ss_pred HHHHHHHHHHHHHHHHHCC T ss_conf 9999999999999998676 No 55 >TIGR02461 osmo_MPG_phos mannosyl-3-phosphoglycerate phosphatase; InterPro: IPR012813 Members of this family are mannosyl-3-phosphoglycerate phosphatase (3.1.3.70 from EC). It acts sequentially after mannosyl-3-phosphoglycerate synthase (2.4.1.217 from EC) in a two-step pathway of biosynthesis of the compatible solute mannosylglycerate, a typical osmolyte of thermophiles.; GO: 0050531 mannosyl-3-phosphoglycerate phosphatase activity, 0051479 mannosylglycerate biosynthetic process, 0005737 cytoplasm. Probab=23.50 E-value=41 Score=12.65 Aligned_cols=13 Identities=38% Similarity=1.071 Sum_probs=9.5 Q ss_pred HHCCCEEEEECCC Q ss_conf 4238438873477 Q gi|254781059|r 195 VFTDGSFVYIPKG 207 (489) Q Consensus 195 ~~~~G~fi~Vp~g 207 (489) +...|.=||+|+| T Consensus 63 I~ENGgaIyiPrg 75 (248) T TIGR02461 63 IVENGGAIYIPRG 75 (248) T ss_pred EEECCCEEECCCC T ss_conf 8722733204777 No 56 >TIGR01173 glmU UDP-N-acetylglucosamine pyrophosphorylase; InterPro: IPR005882 N-Acetylglucosamine-1-PO(4) uridyltransferase (GlmU, 2.7.7.23 from EC) is a trimeric bifunctional enzyme that catalyzes the last two sequential reactions in the de novo biosynthetic pathway for UDP-GlcNAc. The X-ray crystal structure of Escherichia coli GlmU in complex with UDP-GlcNAc and CoA has been determined to 2.1 A resolution and reveals a two-domain architecture that is responsible for these two reactions . The C-terminal domain is responsible for the CoA-dependent acetylation of Glc-1-PO(4) to GlcNAc-1-PO(4) and displays the longest left-handed parallel beta-helix observed to date. The acetyltransferase active site defined by the binding site for CoA makes use of residues from all three subunits and is positioned beneath an open cavity large enough to accommodate the Glc-1-PO(4) acetyl acceptor. The N-terminal domain catalyzes uridyl transfer from UTP to GlcNAc-1-PO(4) to form the final products UDP-GlcNAc and pyrophosphate. This domain is composed of a central seven-stranded beta-sheet surrounded by six alpha-helices in a Rossmann fold-like topology. ; GO: 0003977 UDP-N-acetylglucosamine diphosphorylase activity, 0009103 lipopolysaccharide biosynthetic process. Probab=23.03 E-value=42 Score=12.59 Aligned_cols=25 Identities=8% Similarity=0.117 Sum_probs=14.6 Q ss_pred HHHHHHCCCEEEEECCCCCCCCCEE Q ss_conf 8876423843887347741342127 Q gi|254781059|r 191 LNSAVFTDGSFVYIPKGVRCPMELS 215 (489) Q Consensus 191 Ln~A~~~~G~fi~Vp~gv~v~~PI~ 215 (489) ++..++..|+.+.=|..+-++.-++ T Consensus 249 ~~~~l~~~GVt~~dP~~~~i~~~v~ 273 (461) T TIGR01173 249 IAKKLLLQGVTLIDPARTDIRGTVE 273 (461) T ss_pred HHHHHHHCCCEEECCCEEEECCCEE T ss_conf 9998985892998683378735479 No 57 >COG2130 Putative NADP-dependent oxidoreductases [General function prediction only] Probab=22.58 E-value=21 Score=14.82 Aligned_cols=83 Identities=12% Similarity=-0.044 Sum_probs=46.8 Q ss_pred CCCCCEEEEEEEEECCCCEEEEEEEEEEECCCCEEEEEEEECCCCHHHHHHHHHCCCCHHHHHHHHHHHHHH-HHHHHCC Q ss_conf 587430221058845985486223689834875688753202479999999983788989999999999899-9988589 Q gi|254781059|r 391 AHSRNFTQCDSLLIGNNCGAHTIPYIESKNSSCNLEHEATTSKISDEHLFYCLQRGIPEEAAIALIVNGFVR-EVLQKLP 469 (489) Q Consensus 391 ~~t~~~~~~~~LlLs~~a~~~s~P~LeI~~ddv~~~H~Atvg~id~e~LFYL~sRGi~~~~A~~liv~gF~~-~v~~~ip 469 (489) ++.|-|.+|-. .+--=-..|.|..++-=+-|+|=|.-..-+. =.+-|.++.--++++.|+||+- +-+..-| T Consensus 218 ~GIDvyfeNVG----g~v~DAv~~~ln~~aRi~~CG~IS~YN~~~~----~~gp~~l~~l~~kr~~v~Gfiv~~~~~~~~ 289 (340) T COG2130 218 KGIDVYFENVG----GEVLDAVLPLLNLFARIPVCGAISQYNAPEL----PPGPRRLPLLMAKRLRVQGFIVASDYDQRF 289 (340) T ss_pred CCEEEEEECCC----CHHHHHHHHHHCCCCCEEEEEEHHHCCCCCC----CCCCCHHHHHHHHHHEEEEEEECHHHHHHH T ss_conf 87179997489----6589999986230452246100253389889----987401668876432467899641355646 Q ss_pred HHHHHHHHHHHH Q ss_conf 689999999999 Q gi|254781059|r 470 MEFAVEAQKLID 481 (489) Q Consensus 470 ~e~~~e~~~li~ 481 (489) -|+.+|+-+++. T Consensus 290 ~e~~~~l~~wv~ 301 (340) T COG2130 290 PEALRELGGWVK 301 (340) T ss_pred HHHHHHHHHHHH T ss_conf 999999999987 No 58 >pfam03701 UPF0181 Uncharacterized protein family (UPF0181). This family contains small proteins of about 50 amino acids of unknown function. The family includes YoaH. Probab=22.57 E-value=43 Score=12.53 Aligned_cols=25 Identities=20% Similarity=0.324 Sum_probs=19.3 Q ss_pred CCHHHHHHHHHCCCCHHHHHHHHHH Q ss_conf 7999999998378898999999999 Q gi|254781059|r 434 ISDEHLFYCLQRGIPEEAAIALIVN 458 (489) Q Consensus 434 id~e~LFYL~sRGi~~~~A~~liv~ 458 (489) -.-|.|--||+.|+|-.+|..++-. T Consensus 15 ~AvErIq~LMaqGmSsgeAI~~VA~ 39 (52) T pfam03701 15 EAVERIQELMAQGMSSGEAIAIVAQ 39 (52) T ss_pred HHHHHHHHHHHCCCCHHHHHHHHHH T ss_conf 9999999999804558899999999 No 59 >pfam04472 DUF552 Protein of unknown function (DUF552). Family of uncharacterized proteins. Probab=22.54 E-value=43 Score=12.52 Aligned_cols=15 Identities=27% Similarity=0.499 Sum_probs=12.0 Q ss_pred CCCEEEEECCCCCCC Q ss_conf 384388734774134 Q gi|254781059|r 197 TDGSFVYIPKGVRCP 211 (489) Q Consensus 197 ~~G~fi~Vp~gv~v~ 211 (489) .+..|+.+|+|+++. T Consensus 59 ~~~v~l~~P~~v~I~ 73 (73) T pfam04472 59 GDKIFLLTPSNVKIS 73 (73) T ss_pred CCCEEEEECCCCEEC T ss_conf 198999979983939 No 60 >TIGR01818 ntrC nitrogen regulation protein NR(I); InterPro: IPR010114 This entry represents Nitrogen regulatory protein C (NtrC), which is a bacterial enhancer-binding protein that activates the transcription of genes encoding enzymes required for nitrogen metabolism. It is phosphorylated by NtrB and interacts with sigma-54. One of the best studied examples is its activation of the gene glnA, which encodes the enzyme glutamine synthetase.. NtrC is composed of three domains , . The 124 residue N-terminal domain is homologous to receiver domains of other response regulator proteins in two-component signal transduction systems , . The 240 residue central domain of NtrC is homologous to a domain found in all activators of the sigma-54 RNA polymerase holoenzyme , . The C-terminal domain has been indicated to contain the determinants necessary for both DNA-binding and dimerization of full-length NtrC.; GO: 0000156 two-component response regulator activity, 0003677 DNA binding, 0005524 ATP binding, 0000160 two-component signal transduction system (phosphorelay), 0009399 nitrogen fixation, 0050906 detection of stimulus during sensory perception. Probab=22.53 E-value=29 Score=13.76 Aligned_cols=18 Identities=28% Similarity=0.551 Sum_probs=8.1 Q ss_pred CCCEEEEECCCCCCCCCE Q ss_conf 384388734774134212 Q gi|254781059|r 197 TDGSFVYIPKGVRCPMEL 214 (489) Q Consensus 197 ~~G~fi~Vp~gv~v~~PI 214 (489) ..|+|=|.||---++.-| T Consensus 91 ~~GAfEYLpKPFD~de~v 108 (471) T TIGR01818 91 QRGAFEYLPKPFDLDEAV 108 (471) T ss_pred HCCCHHCCCCCCCHHHHH T ss_conf 358302176987668899 No 61 >PRK11480 tauA taurine transporter substrate binding subunit; Provisional Probab=22.46 E-value=43 Score=12.51 Aligned_cols=13 Identities=31% Similarity=0.593 Sum_probs=5.8 Q ss_pred HCCCCHHHHHHHH Q ss_conf 3788989999999 Q gi|254781059|r 444 QRGIPEEAAIALI 456 (489) Q Consensus 444 sRGi~~~~A~~li 456 (489) --|++.+++..++ T Consensus 255 ~~G~~~~~~~~~l 267 (320) T PRK11480 255 LSGVPEGDVPGLV 267 (320) T ss_pred HHCCCHHHHHHHH T ss_conf 9598988999997 No 62 >PRK12379 propionate/acetate kinase; Provisional Probab=21.30 E-value=45 Score=12.35 Aligned_cols=28 Identities=4% Similarity=0.064 Sum_probs=19.5 Q ss_pred EECCCCHHHHHHHHHC-CCCHHHHHHHHH Q ss_conf 2024799999999837-889899999999 Q gi|254781059|r 430 TTSKISDEHLFYCLQR-GIPEEAAIALIV 457 (489) Q Consensus 430 tvg~id~e~LFYL~sR-Gi~~~~A~~liv 457 (489) -.|.||+-.++|||.. |++.++...+|- T Consensus 240 RsG~lDPgv~~~l~~~~g~s~~e~~~~Ln 268 (400) T PRK12379 240 RSGDVDFGAMAWVAKQTGQTLGDLERVVN 268 (400) T ss_pred CCCCCCHHHHHHHHHHCCCCHHHHHHHHH T ss_conf 67787818999999973999999999985 No 63 >pfam09145 Ubiq-assoc Ubiquitin-associated. Ubiquitin associated domains contain approximately 40 residues and bind ubiquitin noncovalently. They adopt a secondary structure consisting of three alpha-helices, and have been identified in various modular proteins involved in protein trafficking, clathrin assembly/disassembly, DNA repair, proteasomal degradation, and cell cycle regulation. Probab=21.00 E-value=46 Score=12.30 Aligned_cols=30 Identities=20% Similarity=0.235 Sum_probs=23.6 Q ss_pred CCHHHHHHHHHCCCCHHHHHHHHHHHHHHH Q ss_conf 799999999837889899999999998999 Q gi|254781059|r 434 ISDEHLFYCLQRGIPEEAAIALIVNGFVRE 463 (489) Q Consensus 434 id~e~LFYL~sRGi~~~~A~~liv~gF~~~ 463 (489) +-+.+|--|||-|++-+.|..---+|-.-| T Consensus 5 vkDmEiAkLMSLGl~id~A~~yYe~gi~ye 34 (43) T pfam09145 5 VKDMELAKLMSLGLSIDKANDYYERGITYE 34 (43) T ss_pred HHHHHHHHHHHCCCCHHHHHHHHHCCCHHH T ss_conf 888999999981278226688998366199 No 64 >TIGR03629 arch_S13P archaeal ribosomal protein S13P. This model describes exclusively the archaeal ribosomal protein S13P. It excludes the homologous eukaryotic 40S ribosomal protein S18 and bacterial 30S ribosomal protein S13. Probab=20.96 E-value=41 Score=12.71 Aligned_cols=11 Identities=18% Similarity=0.395 Sum_probs=5.4 Q ss_pred CCHHHHHHHHH Q ss_conf 89999999986 Q gi|254781059|r 36 LNEDIIRFISA 46 (489) Q Consensus 36 l~~~~i~~iS~ 46 (489) |+++-++.|.. T Consensus 52 Lt~~ei~~i~~ 62 (144) T TIGR03629 52 LDDEEIEKLEE 62 (144) T ss_pred CCHHHHHHHHH T ss_conf 99999999999 No 65 >KOG2321 consensus Probab=20.79 E-value=46 Score=12.27 Aligned_cols=31 Identities=26% Similarity=0.404 Sum_probs=22.1 Q ss_pred HHHHHHCCCCHHHHHHHHHHHH----------HHHHCCCCC Q ss_conf 9998632998899999999999----------999688778 Q gi|254781059|r 42 RFISAKKNEPDWMLEWRLLAYR----------RWLTMKDPS 72 (489) Q Consensus 42 ~~iS~~k~EP~Wl~e~R~~Al~----------~f~~l~~P~ 72 (489) -.+|.-|.-|+||.+.|++.+. ..+.++||+ T Consensus 12 YnvS~~kslP~Wls~r~kR~lkkd~~~~~rieLiQdfe~p~ 52 (703) T KOG2321 12 YNVSAGKSLPDWLSDRRKRQLKKDVDYRQRIELIQDFEMPT 52 (703) T ss_pred EEEECCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHCCCCC T ss_conf 98525777605555678888742667888889987438864 No 66 >COG4668 MtlA Mannitol/fructose-specific phosphotransferase system, IIA domain [Carbohydrate transport and metabolism] Probab=20.73 E-value=47 Score=12.27 Aligned_cols=68 Identities=25% Similarity=0.295 Sum_probs=44.6 Q ss_pred HHHHHCCEEEEEHHHHHHHHHHHHHHHCCC--EECCCCCHHHHHHHHHHCCCEEEEECCCCCCCCCEEEEEE Q ss_conf 678747908855899877778988620221--1245653146688764238438873477413421279999 Q gi|254781059|r 150 EELKKSGVIFMPISEAICEHPDLVKKYLST--VVPVGDNFYAALNSAVFTDGSFVYIPKGVRCPMELSTYFR 219 (489) Q Consensus 150 ~~L~~~GVi~~~l~eA~~~~p~lv~~~~~~--~~~~~~~~f~aLn~A~~~~G~fi~Vp~gv~v~~PI~~~~~ 219 (489) +.|.+.|-+--+.-+++.+....+--|||. ++|.+. -.+-...+-++=+++-.|.|+....+-+.++. T Consensus 26 ~~Lv~~G~V~~~Yl~~m~eRE~~~STyLGngIAIPHgt--~e~k~~V~kTgv~i~q~p~GV~wg~g~~a~~v 95 (142) T COG4668 26 EALVEGGYVEPGYLDAMLEREKQVSTYLGNGIAIPHGT--GEAKDLVLKTGVVIVQFPEGVDWGDGQIAYLV 95 (142) T ss_pred HHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCEECCCCC--HHHHHHHHHCCEEEEECCCCCCCCCCCEEEEE T ss_conf 99987798788999999999888777514744048897--58999999718499977876215999669999 No 67 >TIGR02418 acolac_catab acetolactate synthase, catabolic; InterPro: IPR012782 Acetolactate synthase is a thiamin pyrophosphate-dependent enzyme that combines two molecules of pyruvate to yield 2-acetolactate with the release of CO2. It exists in two distinct forms that have different properties, though they are clearly related , . The biosynthetic form, found in plants, fungi and bacteria, is involved in the biosynthesis of branched-chain amino acids and contains non-catalytic FAD. The catabolic form is found only in some bacteria, participates in butanediol fermentation, and does not contain FAD. This entry represents the catabolic form of acetolactate synthase, generally found in species that posses acetolactate decarboxylase, which is often encoded in the same operon. The enzyme is thought to be a homotetramer, though this has not conclusively been shown. Each monomer consists of three distinct domains, designated, starting from the N-terminus, as alpha, beta and gamma . The core of the alpha and gamma domains is a six-stranded parallel beta sheet surrounded by alpha helices, while the the beta domain forms a double Rossman fold. The thiamin pyrophosphate cofactor has an unusual conformation that has not been seen in other enzymes using this cofactor, including the biosynthetic form of acetolactate synthase. The active site is located in a funnel at the dimer interface, bordered by residues from all three domains.; GO: 0000287 magnesium ion binding, 0003984 acetolactate synthase activity, 0030976 thiamin pyrophosphate binding, 0050660 FAD binding. Probab=20.68 E-value=47 Score=12.26 Aligned_cols=17 Identities=24% Similarity=0.407 Sum_probs=6.9 Q ss_pred CCEEEEECCCCCCCCCE Q ss_conf 84388734774134212 Q gi|254781059|r 198 DGSFVYIPKGVRCPMEL 214 (489) Q Consensus 198 ~G~fi~Vp~gv~v~~PI 214 (489) |-+||-+|-.|.=++|. T Consensus 154 GAaFvS~PQDV~~~kp~ 170 (553) T TIGR02418 154 GAAFVSLPQDVVDSKPV 170 (553) T ss_pred CCCEEECCCCCCCCCCC T ss_conf 66255057321488755 No 68 >PHA01625 hypothetical protein Probab=20.61 E-value=47 Score=12.25 Aligned_cols=10 Identities=10% Similarity=0.245 Sum_probs=4.1 Q ss_pred EEEECCCCEE Q ss_conf 9986699589 Q gi|254781059|r 263 ELVALDNAEI 272 (489) Q Consensus 263 Ei~vg~nA~v 272 (489) |+.+..|..| T Consensus 230 e~~~~~~~Wl 239 (252) T PHA01625 230 EFFVYRDSWL 239 (252) T ss_pred EEEECCCCEE T ss_conf 6664488079 No 69 >COG5616 Predicted integral membrane protein [Function unknown] Probab=20.52 E-value=47 Score=12.23 Aligned_cols=17 Identities=41% Similarity=0.514 Sum_probs=6.9 Q ss_pred ECCCCCCCHHHHHHHHH Q ss_conf 20488889999999986 Q gi|254781059|r 30 DKAPKGLNEDIIRFISA 46 (489) Q Consensus 30 ~~~~~gl~~~~i~~iS~ 46 (489) +.+..||+|++|..+|+ T Consensus 42 ~~faDGlTEdiIt~Lsr 58 (152) T COG5616 42 EYFADGLTEDIITDLSR 58 (152) T ss_pred HHHCCCCHHHHHHHHHH T ss_conf 56505530889987665 No 70 >PRK10918 phosphate transporter subunit; Provisional Probab=20.07 E-value=48 Score=12.17 Aligned_cols=49 Identities=27% Similarity=0.332 Sum_probs=34.7 Q ss_pred HHHHHHCCCEECCCCCHHHHHHHHHHCCCEEEEECCCCCC-CCCEEEEEEECCCCC-EECCEEE Q ss_conf 9886202211245653146688764238438873477413-421279999658884-0022148 Q gi|254781059|r 171 DLVKKYLSTVVPVGDNFYAALNSAVFTDGSFVYIPKGVRC-PMELSTYFRINEKGT-GQFERTL 232 (489) Q Consensus 171 ~lv~~~~~~~~~~~~~~f~aLn~A~~~~G~fi~Vp~gv~v-~~PI~~~~~~~~~~~-~~f~r~l 232 (489) -|.+=|.|++..++|...++||- |+.+ +.||..++|.++.++ ..|.+-| T Consensus 124 ~la~IF~G~It~WnDp~I~~lNP-------------g~~LP~~~I~vv~RsD~SGTT~~FT~yL 174 (346) T PRK10918 124 TLGDIYLGKIKKWDDEAIAKLNP-------------GLKLPSQNIAVVRRADGSGTSFVFTSYL 174 (346) T ss_pred HHHHHHHCCCCCCCCHHHHHCCC-------------CCCCCCCCEEEEEECCCCCCHHHHHHHH T ss_conf 99999848567678765663088-------------9989998639997079860299999999 Done!