Query gi|254780258|ref|YP_003064671.1| 50S ribosomal protein L2 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 278 No_of_seqs 160 out of 1455 Neff 4.8 Searched_HMMs 39220 Date Tue May 24 07:26:10 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780258.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR01171 rplB_bact ribosomal 100.0 0 0 1018.2 23.5 272 3-275 1-279 (279) 2 PRK09374 rplB 50S ribosomal pr 100.0 0 0 872.0 28.3 275 1-276 1-276 (276) 3 CHL00052 rpl2 ribosomal protei 100.0 0 0 848.9 27.0 271 1-275 1-274 (274) 4 PTZ00031 ribosomal protein L2; 100.0 0 0 828.4 25.3 254 23-277 68-321 (330) 5 COG0090 RplB Ribosomal protein 100.0 0 0 788.5 24.0 274 1-275 1-275 (275) 6 PRK09612 rpl2p 50S ribosomal p 100.0 0 0 573.5 19.8 214 36-250 2-221 (240) 7 PTZ00180 60S ribosomal protein 100.0 0 0 571.3 18.4 213 39-254 8-232 (260) 8 KOG0438 consensus 100.0 0 0 562.6 13.7 273 1-274 23-306 (312) 9 pfam03947 Ribosomal_L2_C Ribos 100.0 0 0 417.4 12.9 129 125-253 1-130 (130) 10 KOG2309 consensus 100.0 3.5E-38 8.9E-43 271.3 10.6 198 52-255 19-225 (248) 11 pfam00181 Ribosomal_L2 Ribosom 100.0 6.4E-32 1.6E-36 230.1 10.0 77 42-118 1-77 (77) 12 KOG0438 consensus 96.4 0.015 3.7E-07 36.9 6.6 108 37-163 13-126 (312) 13 COG0272 Lig NAD-dependent DNA 84.9 1.3 3.3E-05 24.1 3.8 103 64-168 309-438 (667) 14 cd03699 lepA_II lepA_II: This 81.6 3 7.6E-05 21.7 4.5 26 75-101 2-27 (86) 15 smart00532 LIGANc Ligase N fam 81.4 1.6 4E-05 23.6 3.0 105 64-169 304-433 (441) 16 pfam02699 YajC Preprotein tran 71.4 6.4 0.00016 19.5 3.9 40 147-188 38-77 (83) 17 TIGR02303 HpaG-C-term 4-hydrox 67.7 3.1 7.8E-05 21.6 1.6 65 79-147 165-248 (249) 18 KOG1004 consensus 67.3 2.4 6.1E-05 22.4 1.0 124 81-212 40-188 (230) 19 pfam06434 Aconitase_2_N Aconit 62.9 12 0.0003 17.8 3.9 83 90-181 100-189 (204) 20 KOG0462 consensus 60.4 4.5 0.00011 20.6 1.4 11 145-155 352-362 (650) 21 PRK05585 yajC preprotein trans 60.1 13 0.00034 17.5 4.1 38 152-189 57-94 (107) 22 KOG4309 consensus 59.7 13 0.00034 17.4 3.7 39 42-86 105-149 (217) 23 PRK07956 ligA NAD-dependent DN 59.1 3.8 9.8E-05 21.0 0.8 105 64-169 309-439 (668) 24 TIGR02176 pyruv_ox_red pyruvat 51.0 4.8 0.00012 20.4 0.3 21 208-228 197-218 (1194) 25 pfam09907 DUF2136 Uncharacteri 47.1 13 0.00034 17.5 2.0 39 47-85 36-74 (76) 26 cd04088 EFG_mtEFG_II EFG_mtEFG 44.1 24 0.0006 15.8 4.8 26 75-101 2-27 (83) 27 TIGR02092 glgD glucose-1-phosp 43.2 11 0.00028 18.0 1.1 37 21-81 16-52 (383) 28 PRK05433 GTP-binding protein L 41.8 25 0.00063 15.7 2.7 84 28-120 192-281 (601) 29 pfam04189 Gcd10p Gcd10p family 40.1 27 0.00069 15.4 2.7 17 184-200 185-201 (300) 30 smart00536 AXH domain in Ataxi 37.9 24 0.00061 15.8 2.2 28 93-123 7-34 (116) 31 TIGR02252 DREG-2 REG-2-like, H 36.8 11 0.00029 18.0 0.3 19 191-209 136-154 (224) 32 PRK09238 bifunctional aconitat 35.7 32 0.00081 15.0 3.4 84 90-182 267-357 (841) 33 PRK13922 rod shape-determining 34.8 33 0.00084 14.9 4.8 36 100-142 198-233 (264) 34 KOG3283 consensus 34.4 33 0.00085 14.8 3.2 108 47-154 41-176 (200) 35 PRK06763 F0F1 ATP synthase sub 33.8 34 0.00087 14.8 6.6 120 73-205 41-173 (213) 36 PRK05886 yajC preprotein trans 33.3 35 0.00089 14.7 5.4 37 152-188 42-78 (108) 37 pfam11946 DUF3463 Domain of un 32.8 20 0.0005 16.4 1.0 13 243-255 55-68 (141) 38 KOG3129 consensus 32.8 26 0.00066 15.5 1.6 30 106-138 153-182 (231) 39 TIGR02091 glgC glucose-1-phosp 30.9 19 0.00048 16.5 0.7 10 62-71 32-41 (421) 40 cd05790 S1_Rrp40 S1_Rrp40: Rrp 30.9 30 0.00076 15.2 1.7 12 103-114 2-13 (86) 41 pfam09038 53-BP1_Tudor Tumour 30.3 39 0.00099 14.4 6.4 89 70-169 14-107 (118) 42 KOG2040 consensus 26.1 39 0.001 14.4 1.6 19 142-165 597-615 (1001) 43 pfam02736 Myosin_N Myosin N-te 25.7 47 0.0012 13.9 4.4 27 162-188 15-41 (42) 44 PRK13885 conjugal transfer pro 24.9 46 0.0012 13.9 1.8 12 83-94 85-96 (299) 45 TIGR00575 dnlj DNA ligase, NAD 24.4 49 0.0013 13.7 4.0 103 65-169 317-446 (706) 46 COG1430 Uncharacterized conser 23.7 51 0.0013 13.6 3.4 56 58-119 57-120 (126) 47 COG0448 GlgC ADP-glucose pyrop 23.3 42 0.0011 14.1 1.4 126 61-195 36-196 (393) 48 KOG3556 consensus 23.1 47 0.0012 13.9 1.5 60 195-257 243-311 (724) 49 TIGR01393 lepA GTP-binding pro 22.9 53 0.0013 13.5 4.7 27 74-101 192-218 (598) 50 cd01773 Faf1_like1_UBX Faf1_li 22.5 48 0.0012 13.8 1.5 24 172-195 7-30 (82) 51 COG0298 HypC Hydrogenase matur 21.8 55 0.0014 13.4 4.5 43 72-118 3-47 (82) 52 KOG2051 consensus 21.3 43 0.0011 14.1 1.1 26 229-256 1012-1037(1128) 53 KOG3429 consensus 21.3 57 0.0014 13.3 1.8 50 128-187 29-78 (172) 54 pfam04472 DUF552 Protein of un 21.2 19 0.00049 16.4 -0.7 36 60-112 36-72 (73) 55 PRK13839 conjugal transfer pro 21.0 57 0.0015 13.3 1.8 12 169-180 206-217 (274) 56 cd03691 BipA_TypA_II BipA_TypA 20.7 58 0.0015 13.2 4.4 25 75-100 2-26 (86) 57 smart00287 SH3b Bacterial SH3 20.6 59 0.0015 13.2 5.4 43 145-187 13-58 (63) 58 TIGR02386 rpoC_TIGR DNA-direct 20.6 39 0.001 14.4 0.7 39 147-185 1328-1372(1552) No 1 >TIGR01171 rplB_bact ribosomal protein L2; InterPro: IPR005880 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites , . About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many of ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome , . The protein L2 is found in all ribosomes and is one of the best conserved proteins of this mega-dalton complex. L2 is elongated, exposing one end of the protein to the surface of the intersubunit interface of the 50 S subunit and is essential for the association of the ribosomal subunits and might participate in the binding and translocation of the tRNAs . This entry represents bacterial, chloroplast and mitochondrial forms.; GO: 0003723 RNA binding, 0003735 structural constituent of ribosome, 0016740 transferase activity, 0006412 translation, 0015934 large ribosomal subunit. Probab=100.00 E-value=0 Score=1018.21 Aligned_cols=272 Identities=57% Similarity=0.995 Sum_probs=266.4 Q ss_pred CEECCCCCCCCCCEEECCCCCCCCC--CCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCEEEEEEEEE-CCCCCCEEEE Q ss_conf 5011588841051087045764168--872221115543677588984878864787510268798863-3777604788 Q gi|254780258|r 3 LKNFNPDTSGRRQLVMVDRNSLHSG--KPIKSLTRGLCSKGGRNNTGRVTMRFRGGGHKNRYRLVDFKR-GEYGLEGVVK 79 (278) Q Consensus 3 ~k~~kP~Tp~~R~~~~~~~~~l~k~--~p~K~L~~~~~~~~GRNn~GrITvrhRGGGhKr~yR~IDf~r-~~~~~~~~V~ 79 (278) ||+|||+|||+|+++.+||+|||++ +|+|+|+..+++++||||+|+||+|||||||||+||+||||| ++.+|+|+|+ T Consensus 1 ik~ykP~TpG~R~~~~~~f~ei~k~~~~P~KsL~~~~~~~~GRNn~GrIT~RHrGGGHKr~YR~IDFKR~~K~~I~AkV~ 80 (279) T TIGR01171 1 IKKYKPYTPGRRGAVVSDFEEITKGRSKPEKSLLEKLHKTGGRNNRGRITSRHRGGGHKRLYRIIDFKRRDKDGIPAKVA 80 (279) T ss_pred CCEECCCCCCCCEEEEEECCCCEECCCCCCCCHHCCCCCCCCCCCCCEEEEEEECCCCCCCCCEEEEEECCCCCCCEEEE T ss_conf 91120478885313353056521047886311001676467848887188887158844124314321036578735999 Q ss_pred EEEECCCCEEEEEEEECCCCCEEEEECCCCCCCCCEEEEC-CCCCCCCCCCCHHHHHCCCCCEEEEEEEECCCCEEEEEE Q ss_conf 7641588525889841477634675164566104578836-667675757411044357463776233303688258863 Q gi|254780258|r 80 RLEYDPNRTAFIALISYSNGSLAYILAPQRLSVGDKVISS-DSAVDVKPGNAMPLRFIPVGSIVHNVEMKPGKGGQISRS 158 (278) Q Consensus 80 ~IeYDPnRsa~IALv~~~~g~~~YIlAp~gl~vGd~I~s~-~~~~~i~~Gn~lpL~~IP~Gt~I~NIE~~pG~Ggkl~Rs 158 (278) +||||||||||||||+|.|||++|||||+||++||+|+|+ .++ ||++||||||.+||+||.||||||.||+||||||| T Consensus 81 ~IEYDPNRsA~IALl~Y~DGEKRYILaP~Gl~vGd~v~SG~~~a-~IK~GNaLPL~~IP~Gt~VHNiEl~PGkGGQlaRS 159 (279) T TIGR01171 81 AIEYDPNRSARIALLHYADGEKRYILAPKGLKVGDTVISGEPEA-PIKPGNALPLKNIPVGTTVHNIELKPGKGGQLARS 159 (279) T ss_pred EEEECCCCCEEEEEEECCCCCEEEEECCCCCCCCCEEEECCCCC-CCCCCCCCCCCCCCCCEEEEEEEEEECCCCHHHEE T ss_conf 97227876602224432787676643278777078898678788-80332227756476240688898830796032211 Q ss_pred CCCEEEEEEECC-CEEEEEECCCCEEEECCCCEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCC Q ss_conf 465189967508-8899994897589985678099985125533230013134410057799701233268747888888 Q gi|254780258|r 159 AGSYARLVERDR-SRALLRLSSGEMRFVHSSCMASIGAVSNQDHSNVNHAKAGRARWLGMRSHVRGVAMNPVDHPHGGGE 237 (278) Q Consensus 159 AGt~a~Ii~k~~-~~~~ikLPSGe~r~i~~~c~AtIG~vsN~~~~~~~~gKAG~~Rwlg~RP~VRGvAMNpvDHPhGGGe 237 (278) ||||||||+||+ +||+|||||||+|+|+.+|+||||+|||+||.|+.|||||++||||+||+||||||||||||||||| T Consensus 160 AG~~aqi~aKe~~~Yv~lrLpSGE~R~v~~~C~ATiG~VGN~d~~n~~~GKAGR~RWlG~RP~VRG~AMNPVDHPHGGGE 239 (279) T TIGR01171 160 AGTSAQILAKEGTKYVTLRLPSGEVRMVLKECRATIGEVGNEDHNNIVLGKAGRSRWLGIRPTVRGVAMNPVDHPHGGGE 239 (279) T ss_pred CCCEEEEEEECCCCEEEEEECCCCEEEECCCCEEEECCCCHHHHHHHHCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCC T ss_conf 14078989763875499982687245422332274122333344422205211331677479511237088877878840 Q ss_pred CCCCCCC-CCCCCCCEECCCCCCCCCCC-CCEEEEEECCC Q ss_conf 8888888-78878873437725488888-86149962346 Q gi|254780258|r 238 GKTSGGR-NPCSPWGKLTKGKRTRSNKS-TDVFIARSRHK 275 (278) Q Consensus 238 Gkt~~gr-~~~spwG~~~kg~kTr~~k~-~~~~i~~~r~~ 275 (278) ||++.|| +||||||+||||+|||++|+ ||+|||++|++ T Consensus 240 Gr~~~Gr~~PvtPWG~ptkG~KTR~~kk~Sd~fIvrrR~~ 279 (279) T TIGR01171 240 GRTPGGRRHPVTPWGKPTKGYKTRKKKKYSDKFIVRRRKK 279 (279) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCC T ss_conf 6785578888788861006641334778776000562689 No 2 >PRK09374 rplB 50S ribosomal protein L2; Validated Probab=100.00 E-value=0 Score=871.96 Aligned_cols=275 Identities=60% Similarity=1.043 Sum_probs=270.7 Q ss_pred CCCEECCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCEEEEEEEEECCCCCCEEEEE Q ss_conf 97501158884105108704576416887222111554367758898487886478751026879886337776047887 Q gi|254780258|r 1 MALKNFNPDTSGRRQLVMVDRNSLHSGKPIKSLTRGLCSKGGRNNTGRVTMRFRGGGHKNRYRLVDFKRGEYGLEGVVKR 80 (278) Q Consensus 1 M~~k~~kP~Tp~~R~~~~~~~~~l~k~~p~K~L~~~~~~~~GRNn~GrITvrhRGGGhKr~yR~IDf~r~~~~~~~~V~~ 80 (278) |+||+|||+|||+||++++++++||+++|+|+||+++++++||||+|+||+|||||||||+||+|||+|++.+++++|++ T Consensus 1 M~~k~~kP~Tp~~R~~~~~~~~~l~k~~P~k~L~~~~~k~~GRNn~GrIT~rhrGGGhKr~yR~IDf~R~k~~i~~~V~~ 80 (276) T PRK09374 1 MAIKKYKPTTPGRRHMVVVDFSELTKGKPEKSLLEPLKKSGGRNNNGRITVRHRGGGHKRHYRIIDFKRNKDGIPAKVER 80 (276) T ss_pred CCCCCCCCCCCCCCCEEECCCCCEECCCCCHHHCCCCCCCCCCCCCCCEEEEEECCCCCEEEEEEEEEECCCCCCEEEEE T ss_conf 98421079898668657216586176984331235566667758986684775079863468874566336998669999 Q ss_pred EEECCCCEEEEEEEECCCCCEEEEECCCCCCCCCEEEECCCCCCCCCCCCHHHHHCCCCCEEEEEEEECCCCEEEEEECC Q ss_conf 64158852588984147763467516456610457883666767575741104435746377623330368825886346 Q gi|254780258|r 81 LEYDPNRTAFIALISYSNGSLAYILAPQRLSVGDKVISSDSAVDVKPGNAMPLRFIPVGSIVHNVEMKPGKGGQISRSAG 160 (278) Q Consensus 81 IeYDPnRsa~IALv~~~~g~~~YIlAp~gl~vGd~I~s~~~~~~i~~Gn~lpL~~IP~Gt~I~NIE~~pG~Ggkl~RsAG 160 (278) |||||||||+||||+|+||+++||||||||++||.|+++.++ ++++||+|||.+||+||+|||||+.||+||||||||| T Consensus 81 IeYDPnRsa~IALv~y~dG~k~YIlAp~gl~vGd~I~sg~~~-~~~~GN~lpL~~IP~Gt~IhNIE~~pG~GgklaRSAG 159 (276) T PRK09374 81 IEYDPNRSANIALLHYADGEKRYILAPKGLKVGDTVVSGPDA-DIKPGNALPLRNIPVGTTVHNIELKPGKGGQLARSAG 159 (276) T ss_pred EEECCCCCCEEEEEECCCCCEEEEECHHHCCCCCEEEECCCC-CCCCCCCCCHHHCCCCCEEEEEEECCCCCEEEEEECC T ss_conf 986799883389998589988999744440079999879998-8677664585038898888777713899706986089 Q ss_pred CEEEEEEECCCEEEEEECCCCEEEECCCCEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCCCCC Q ss_conf 51899675088899994897589985678099985125533230013134410057799701233268747888888888 Q gi|254780258|r 161 SYARLVERDRSRALLRLSSGEMRFVHSSCMASIGAVSNQDHSNVNHAKAGRARWLGMRSHVRGVAMNPVDHPHGGGEGKT 240 (278) Q Consensus 161 t~a~Ii~k~~~~~~ikLPSGe~r~i~~~c~AtIG~vsN~~~~~~~~gKAG~~Rwlg~RP~VRGvAMNpvDHPhGGGeGkt 240 (278) |||||++||+++|+|||||||+|+|+++|+||||+|||.+|.+.++||||++||||+||+||||||||||||||||||+| T Consensus 160 t~a~ii~k~~~~~~vkLPSGe~r~i~~~C~ATIG~VsN~~~~~~~lgKAGr~Rwlg~RP~VRGvAMNpvDHPhGGGeg~~ 239 (276) T PRK09374 160 TSAQLVAKEGKYATLRLPSGEVRKVLAECRATIGEVGNEEHSNISLGKAGRSRWLGIRPTVRGVAMNPVDHPHGGGEGRT 239 (276) T ss_pred CEEEEEEECCCEEEEECCCCCEEEECCCCEEEEEEECCCCCCCCCCCCCCHHHHCCCCCCCCEEECCCCCCCCCCCCCCC T ss_conf 83899996099999986898805788657599866218430336335567556178898543021288758988888888 Q ss_pred CCCCCCCCCCCEECCCCCCCC-CCCCCEEEEEECCCC Q ss_conf 888878878873437725488-888861499623467 Q gi|254780258|r 241 SGGRNPCSPWGKLTKGKRTRS-NKSTDVFIARSRHKN 276 (278) Q Consensus 241 ~~gr~~~spwG~~~kg~kTr~-~k~~~~~i~~~r~~~ 276 (278) ++|++|+||||++|||+|||+ +|+||+|||++|+++ T Consensus 240 ~gg~~~~spwG~~~kG~kTR~~~k~s~~~I~~~r~~~ 276 (276) T PRK09374 240 SGGRHPVTPWGKPTKGYKTRKKNKRSDKFIVRRRKKK 276 (276) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCC T ss_conf 9887984887642788627788887675587715579 No 3 >CHL00052 rpl2 ribosomal protein L2 Probab=100.00 E-value=0 Score=848.87 Aligned_cols=271 Identities=47% Similarity=0.852 Sum_probs=263.4 Q ss_pred CCCEECCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCCCCC-CCCCCCCEEEEEECCCCCCEEEEEEEEECCCCCCEEEE Q ss_conf 97501158884105108704576416887222111554367-75889848788647875102687988633777604788 Q gi|254780258|r 1 MALKNFNPDTSGRRQLVMVDRNSLHSGKPIKSLTRGLCSKG-GRNNTGRVTMRFRGGGHKNRYRLVDFKRGEYGLEGVVK 79 (278) Q Consensus 1 M~~k~~kP~Tp~~R~~~~~~~~~l~k~~p~K~L~~~~~~~~-GRNn~GrITvrhRGGGhKr~yR~IDf~r~~~~~~~~V~ 79 (278) |+||+|||+|||+||++++|+ ++++|+|+||+++++++ ||||+|+||+|||||||||+||+|||+|++.+++++|+ T Consensus 1 M~ik~~kP~Tp~~R~~~~~d~---~k~~p~k~L~~~~~~~~~GRNn~GrIT~RhrGGGhKr~yR~IDf~R~k~~i~~~V~ 77 (274) T CHL00052 1 MAIRLYKTYTPGTRNRSVSSF---VKSNPRKNLIYGQHRCGKGRNNRGIITARHRGGGHKRLYRKIDFRRNKKDIYGRIV 77 (274) T ss_pred CCCCCCCCCCCCCCCCCCCCC---CCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCEEEEEEEECCCCCCCCEEEE T ss_conf 984201798985686364464---46976601033335788888998678278507986437898761336689658999 Q ss_pred EEEECCCCEEEEEEEECCCCCEEEEECCCCCCCCCEEEECCCCCCCCCCCCHHHHHCCCCCEEEEEEEECCCCEEEEEEC Q ss_conf 76415885258898414776346751645661045788366676757574110443574637762333036882588634 Q gi|254780258|r 80 RLEYDPNRTAFIALISYSNGSLAYILAPQRLSVGDKVISSDSAVDVKPGNAMPLRFIPVGSIVHNVEMKPGKGGQISRSA 159 (278) Q Consensus 80 ~IeYDPnRsa~IALv~~~~g~~~YIlAp~gl~vGd~I~s~~~~~~i~~Gn~lpL~~IP~Gt~I~NIE~~pG~Ggkl~RsA 159 (278) +|||||||||+||||+|+||+++|||||+||++||.|+++.++ ++++||+|||.+||+||+|||||+.||+|||||||| T Consensus 78 ~IEYDPnRsa~IALi~y~dg~k~YIlAp~glkvGd~I~sg~~~-~ik~GN~lpL~~IP~Gt~IhNIE~~pG~Ggk~~RSA 156 (274) T CHL00052 78 TIEYDPNRNAYICLIHYGDGEKRYILHPRGLIIGDTIVSGTEA-PIKIGNALPLTDIPLGTAIHNIEITPGKGGQLARAA 156 (274) T ss_pred EEEECCCCCCEEEEEECCCCCEEEEEECCCCCCCCEEEECCCC-CCCCCCCHHHHHCCCCCEEEEEEECCCCCCEEEECC T ss_conf 9884899996799998279988899803886579999869999-877746540754899887886651289986498749 Q ss_pred CCEEEEEEECCCEEEEEECCCCEEEECCCCEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCCCC Q ss_conf 65189967508889999489758998567809998512553323001313441005779970123326874788888888 Q gi|254780258|r 160 GSYARLVERDRSRALLRLSSGEMRFVHSSCMASIGAVSNQDHSNVNHAKAGRARWLGMRSHVRGVAMNPVDHPHGGGEGK 239 (278) Q Consensus 160 Gt~a~Ii~k~~~~~~ikLPSGe~r~i~~~c~AtIG~vsN~~~~~~~~gKAG~~Rwlg~RP~VRGvAMNpvDHPhGGGeGk 239 (278) ||||||++||++||+|||||||+|+|+.+|+||||+|||.+|.++++||||++||||+||+||||||||||||||||||| T Consensus 157 Gt~A~ii~k~~~~~~ikLPSGe~R~i~~~C~ATIG~VsN~~~~~~~lgKAGr~Rwlg~RP~VRGvAMNpvDHPhGGGeGk 236 (274) T CHL00052 157 GAVAKLIAKEGKSVTLKLPSGEVRLISKNCSATIGQVGNVGVNNKSLGKAGSKRWLGKRPVVRGVVMNPVDHPHGGGEGR 236 (274) T ss_pred CCCEEEEEECCCEEEEECCCCCEEEECCCCEEEEEECCCHHHCCCEECCCCHHHHCCCCCCCCEECCCCCCCCCCCCCCC T ss_conf 98389999559989998389985898976959973532802315720357744517779843301048766888888777 Q ss_pred CCCCC-CCCCCCCEECCCCCCCC-CCCCCEEEEEECCC Q ss_conf 88888-78878873437725488-88886149962346 Q gi|254780258|r 240 TSGGR-NPCSPWGKLTKGKRTRS-NKSTDVFIARSRHK 275 (278) Q Consensus 240 t~~gr-~~~spwG~~~kg~kTr~-~k~~~~~i~~~r~~ 275 (278) |++|+ +||||||++|||+|||+ +|.||+|||++|++ T Consensus 237 ~~~G~~~p~spwG~~~kG~kTR~~~k~s~~~i~~~rkk 274 (274) T CHL00052 237 APIGRKKPVTPWGKPALGRRTRKRKKYSDNLILRRRKK 274 (274) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCC T ss_conf 98989897388854678862688888777568651479 No 4 >PTZ00031 ribosomal protein L2; Provisional Probab=100.00 E-value=0 Score=828.45 Aligned_cols=254 Identities=49% Similarity=0.804 Sum_probs=250.1 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCEEEEEEEEECCCCCCEEEEEEEECCCCEEEEEEEECCCCCEE Q ss_conf 64168872221115543677588984878864787510268798863377760478876415885258898414776346 Q gi|254780258|r 23 SLHSGKPIKSLTRGLCSKGGRNNTGRVTMRFRGGGHKNRYRLVDFKRGEYGLEGVVKRLEYDPNRTAFIALISYSNGSLA 102 (278) Q Consensus 23 ~l~k~~p~K~L~~~~~~~~GRNn~GrITvrhRGGGhKr~yR~IDf~r~~~~~~~~V~~IeYDPnRsa~IALv~~~~g~~~ 102 (278) .|||++|+|+||+++.+++||||+|+||+|||||||||+||+|||+|++.+++++|++|||||||||+||||+|+||+++ T Consensus 68 ~LwKGrP~k~Lt~~~~k~GGRNN~GrITvRHRGGGHKR~YR~IDFkR~k~~i~a~V~~IEYDPNRSA~IALv~y~DGek~ 147 (330) T PTZ00031 68 VLYKCRVVEQLSIRRVKFGGRNNTGRITTRHRGGGHVQRLRFIDFKRQRKDIYSTVLRIEYDPTRSAHVALIQYEDGVLS 147 (330) T ss_pred CCCCCCCCHHHCCCCCCCCCCCCCCCEEEEECCCCCCCEEEEEECCCCCCCCCEEEEEEEECCCCCEEEEEEECCCCCEE T ss_conf 32779875221455787788889825604725886562467851035779997799998789999806899984799688 Q ss_pred EEECCCCCCCCCEEEECCCCCCCCCCCCHHHHHCCCCCEEEEEEEECCCCEEEEEECCCEEEEEEECCCEEEEEECCCCE Q ss_conf 75164566104578836667675757411044357463776233303688258863465189967508889999489758 Q gi|254780258|r 103 YILAPQRLSVGDKVISSDSAVDVKPGNAMPLRFIPVGSIVHNVEMKPGKGGQISRSAGSYARLVERDRSRALLRLSSGEM 182 (278) Q Consensus 103 YIlAp~gl~vGd~I~s~~~~~~i~~Gn~lpL~~IP~Gt~I~NIE~~pG~Ggkl~RsAGt~a~Ii~k~~~~~~ikLPSGe~ 182 (278) |||||++|++||.|+|+.++ ++++||+|||.+||+||+|||||+.||+||||||||||||||++||++||+|||||||+ T Consensus 148 YILAP~glkvGD~I~Sg~~a-~ik~GN~lPL~~IP~GT~IHNIEl~PGkGgqlaRSAGt~Aqli~Kd~~ya~ikLpSGE~ 226 (330) T PTZ00031 148 YILCPAGVRPGDKLLASKYA-PINPGNSLPLRHIPVNSIVHNVEMRPGAGGQIARAGGTYATIVEKDEMFATLRLKSTEL 226 (330) T ss_pred EEECCCCCCCCCEEEECCCC-CCCCCCCCCCCCCCCCCEEEEEEECCCCCCEEEECCCCEEEEEECCCCEEEEECCCCCE T ss_conf 99774898789989859999-87773556726589988777777228997189976887279980579979997687346 Q ss_pred EEECCCCEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCEECCCCCCCCC Q ss_conf 99856780999851255332300131344100577997012332687478888888888888788788734377254888 Q gi|254780258|r 183 RFVHSSCMASIGAVSNQDHSNVNHAKAGRARWLGMRSHVRGVAMNPVDHPHGGGEGKTSGGRNPCSPWGKLTKGKRTRSN 262 (278) Q Consensus 183 r~i~~~c~AtIG~vsN~~~~~~~~gKAG~~Rwlg~RP~VRGvAMNpvDHPhGGGeGkt~~gr~~~spwG~~~kg~kTr~~ 262 (278) |+|+++|+||||+|||.+|+++++||||++||||+||+||||||||||||||||||||++|||||||||++|||+|||++ T Consensus 227 R~i~~~C~ATIG~VsN~~h~~~~lGKAGrsRWlG~RP~VRGvAMNPVDHPHGGGEGkt~ggr~~vsPWG~~tkG~kTR~~ 306 (330) T PTZ00031 227 RKFPLDCWATIGQVSNIEHNMRILKKAGTRRNLGWRPVVRGIAMNPNKHPHGGGNSKSGTKRPKCSLWGICRDGYKTRSK 306 (330) T ss_pred EEECCCCEEEEEEEECHHHHHHHHCCCHHHCCCCCCCCCCCEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 89878661888864287874321153232345884987641051898889899888898988897998754788747899 Q ss_pred CCCCEEEEEECCCCC Q ss_conf 888614996234678 Q gi|254780258|r 263 KSTDVFIARSRHKNK 277 (278) Q Consensus 263 k~~~~~i~~~r~~~~ 277 (278) |++|+|||+||..++ T Consensus 307 kk~~~~Iv~Rr~~gr 321 (330) T PTZ00031 307 KKPLGFIIRRKLCGR 321 (330) T ss_pred CCCCCEEEEEECCCC T ss_conf 998843897603585 No 5 >COG0090 RplB Ribosomal protein L2 [Translation, ribosomal structure and biogenesis] Probab=100.00 E-value=0 Score=788.51 Aligned_cols=274 Identities=58% Similarity=0.956 Sum_probs=270.0 Q ss_pred CCCEECCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCEEEEEEEEECCCCCCEEEEE Q ss_conf 97501158884105108704576416887222111554367758898487886478751026879886337776047887 Q gi|254780258|r 1 MALKNFNPDTSGRRQLVMVDRNSLHSGKPIKSLTRGLCSKGGRNNTGRVTMRFRGGGHKNRYRLVDFKRGEYGLEGVVKR 80 (278) Q Consensus 1 M~~k~~kP~Tp~~R~~~~~~~~~l~k~~p~K~L~~~~~~~~GRNn~GrITvrhRGGGhKr~yR~IDf~r~~~~~~~~V~~ 80 (278) |++|+|||+|||+|+++.+||++||+++|+|+|+..+.+++||||+|+||+||+|||||++||+|||+|++.+++++|++ T Consensus 1 m~ik~~kp~t~g~r~~~~~~~~~~~~~~p~k~l~~~~~~~~gR~n~G~iT~R~~ggghK~~yr~idfkr~k~~i~g~V~~ 80 (275) T COG0090 1 MAIKKYKPTTPGRRHMVVVDFAELTKGKPEKSLMGKLIKSQGRNNRGRITVRHRGGGHKRRYRLIDFKRNKDGIPGKVED 80 (275) T ss_pred CCCCEECCCCCCCCEEEEECCHHHHCCCCCHHHHHCCCCCCCCCCCCCCEEECCCCCCCCCEECCCCCCCCCCCCEEEEE T ss_conf 97312036998731267760101105985245311020025788898704885788844415645645466786679999 Q ss_pred EEECCCCEEEEEEEECCCCCEEEEECCCCCCCCCEEEECCCCCCCCCCCCHHHHHCCCCCEEEEEEEECCCCEEEEEECC Q ss_conf 64158852588984147763467516456610457883666767575741104435746377623330368825886346 Q gi|254780258|r 81 LEYDPNRTAFIALISYSNGSLAYILAPQRLSVGDKVISSDSAVDVKPGNAMPLRFIPVGSIVHNVEMKPGKGGQISRSAG 160 (278) Q Consensus 81 IeYDPnRsa~IALv~~~~g~~~YIlAp~gl~vGd~I~s~~~~~~i~~Gn~lpL~~IP~Gt~I~NIE~~pG~Ggkl~RsAG 160 (278) |||||||||+||||.|+|||++|||||+||++||+|+++.++ +|++||+|||.+||+||.|||||+.||+||||||||| T Consensus 81 IeyDP~RsA~IAlv~y~dGek~yilAp~Gl~vGd~I~~G~~a-~ik~GN~lpL~~IP~Gt~VhNVE~~pG~GGq~aRSaG 159 (275) T COG0090 81 IEYDPNRSAPIALVVYEDGEKRYILAPEGLKVGDVIESGKDA-DIKPGNALPLGNIPEGTIVHNVELKPGDGGQLARSAG 159 (275) T ss_pred EEECCCCCCCEEEEEECCCCEEEEECCCCCCCCCEEEECCCC-CCCCCCEEEECCCCCCCEEEEEEECCCCCCEEEEECC T ss_conf 888999986258998148878899744765458889957888-8677645241357787568635603588842887078 Q ss_pred CEEEEEEECCCEEEEEECCCCEEEECCCCEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCCCCC Q ss_conf 51899675088899994897589985678099985125533230013134410057799701233268747888888888 Q gi|254780258|r 161 SYARLVERDRSRALLRLSSGEMRFVHSSCMASIGAVSNQDHSNVNHAKAGRARWLGMRSHVRGVAMNPVDHPHGGGEGKT 240 (278) Q Consensus 161 t~a~Ii~k~~~~~~ikLPSGe~r~i~~~c~AtIG~vsN~~~~~~~~gKAG~~Rwlg~RP~VRGvAMNpvDHPhGGGeGkt 240 (278) |||||+++|++|++|+|||||+|.|+++|+||||+|+|.+|.+++|+|||++||+|+||+||||||||||||||||||++ T Consensus 160 tyA~vv~~~~~y~~vrLpSGe~r~v~~~CrATIGvV~n~~~~~~~lgKAGr~r~~g~rPtVRGvAMNpvDHPHGGGeg~~ 239 (275) T COG0090 160 TYAQVVGKEGNYVIVRLPSGEMRKVLSECRATIGVVANGGHILKPLGKAGRARHKGKRPTVRGVAMNPVDHPHGGGEGQH 239 (275) T ss_pred CEEEEEECCCCEEEEECCCCCEEEECCCCCEEEEEECCCCCCCCEECCCCHHCCCCCCCCCCEEECCCCCCCCCCCCCCC T ss_conf 55899972588799988888857545010189988617863125000021310874178355552388768878878888 Q ss_pred CCCCCCCCCCCEECCCCCCCC-CCCCCEEEEEECCC Q ss_conf 888878878873437725488-88886149962346 Q gi|254780258|r 241 SGGRNPCSPWGKLTKGKRTRS-NKSTDVFIARSRHK 275 (278) Q Consensus 241 ~~gr~~~spwG~~~kg~kTr~-~k~~~~~i~~~r~~ 275 (278) ++|++|+||||++++|++||. +|+|++||+++|++ T Consensus 240 ~ggk~p~~pwg~~~~Gkktr~~~krt~~~ivrrr~~ 275 (275) T COG0090 240 PGGKPPTVPWGKPTPGKKTRIAAKRTGKFIVRRRKK 275 (275) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCC T ss_conf 999899788988898641333334447633312579 No 6 >PRK09612 rpl2p 50S ribosomal protein L2P; Validated Probab=100.00 E-value=0 Score=573.48 Aligned_cols=214 Identities=36% Similarity=0.563 Sum_probs=199.8 Q ss_pred CCCCCCCCCCCCCEEEEEECCCCCCEEEEEEEEECCCC-CCEEEEEEEECCCCEEEEEEEECCCCCEEEEECCCCCCCCC Q ss_conf 55436775889848788647875102687988633777-60478876415885258898414776346751645661045 Q gi|254780258|r 36 GLCSKGGRNNTGRVTMRFRGGGHKNRYRLVDFKRGEYG-LEGVVKRLEYDPNRTAFIALISYSNGSLAYILAPQRLSVGD 114 (278) Q Consensus 36 ~~~~~~GRNn~GrITvrhRGGGhKr~yR~IDf~r~~~~-~~~~V~~IeYDPnRsa~IALv~~~~g~~~YIlAp~gl~vGd 114 (278) ++.-.++||+.|.||.||++++||+.||.+||.+++.+ ++++|++|||||||||+||||.|+||+++|||||++|++|| T Consensus 2 GKri~~qR~g~G~~t~R~~~~~~k~~~r~~~~~~~r~~~i~g~V~~IeyDPnRsA~IAlv~y~dg~~~YIiAp~gl~vGd 81 (240) T PRK09612 2 GKRIIVQRRGRGTPTYRSPSHRRKGPVKYPPFELEKDGTLRGKVIDILHDPGRNAPVAKVKFENGEEFLILAPEGLYVGQ 81 (240) T ss_pred CCCCCCCCCCCCCCEEECCCCCCCCCEECCCCCCCCCCCEEEEEEEEEECCCCCCEEEEEEECCCCEEEEECCCCCCCCC T ss_conf 97544324799987488789983230055761244479677999999989899861899993799999998858887899 Q ss_pred EEEECCCCCCCCCCCCHHHHHCCCCCEEEEEEEECCCCEEEEEECCCEEEEEEECCCEEEEEECCCCEEEECCCCEEEEE Q ss_conf 78836667675757411044357463776233303688258863465189967508889999489758998567809998 Q gi|254780258|r 115 KVISSDSAVDVKPGNAMPLRFIPVGSIVHNVEMKPGKGGQISRSAGSYARLVERDRSRALLRLSSGEMRFVHSSCMASIG 194 (278) Q Consensus 115 ~I~s~~~~~~i~~Gn~lpL~~IP~Gt~I~NIE~~pG~Ggkl~RsAGt~a~Ii~k~~~~~~ikLPSGe~r~i~~~c~AtIG 194 (278) .|+++.++ ++++||+|||.+||+||+|||||+.||+||||||||||||||++||+++++|||||||+|+|+++|+|||| T Consensus 82 ~I~~g~~a-~i~~GN~lpL~~IP~Gt~IhNIE~~pG~Ggkl~RSAG~~a~ii~k~~~~~~vkLpSGe~r~v~~~C~ATIG 160 (240) T PRK09612 82 EIEIGPSA-EIKPGNTLPLGEIPEGTPVCNIESRPGDGGKFVRSSGTYALVVGHEGDKVIVQLPSGKIKELNPRCRATIG 160 (240) T ss_pred EEEECCCC-CCCCCCCCCHHHCCCCCEEEEECCCCCCCEEEEEECCCEEEEEEECCCEEEEECCCCCEEEECCCCEEEEE T ss_conf 99968988-76667603861389999998232648986179860797699999539979998489875888766839999 Q ss_pred EECCCCCCCCCCCCCCCCCCCCCC-----CCCCEEEECCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 512553323001313441005779-----97012332687478888888888888788788 Q gi|254780258|r 195 AVSNQDHSNVNHAKAGRARWLGMR-----SHVRGVAMNPVDHPHGGGEGKTSGGRNPCSPW 250 (278) Q Consensus 195 ~vsN~~~~~~~~gKAG~~Rwlg~R-----P~VRGvAMNpvDHPhGGGeGkt~~gr~~~spw 250 (278) +|||.+|.+++++|||++||+.++ |+|||||||||||||||||+++.+.-.+||-+ T Consensus 161 ~Vsn~~~~~~~l~KAG~~~~~~r~kg~r~P~VRGvAMNpvDHPHGGG~~~~~g~pstv~r~ 221 (240) T PRK09612 161 VVAGGGRIEKPFLKAGKKYHKMKAKAKKWPRVRGVAMNAVDHPHGGGNHQHPGRPSTVSRN 221 (240) T ss_pred EECCCCHHHHHHHHHHHHHHHHHCCCCCCCEECCCCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 8008751120655535677654402788981640001777699789899999998755588 No 7 >PTZ00180 60S ribosomal protein L2; Provisional Probab=100.00 E-value=0 Score=571.31 Aligned_cols=213 Identities=33% Similarity=0.527 Sum_probs=202.0 Q ss_pred CCCCCCCCCCEEEEEECCCCCCEEEEEEEEECCCCCCEEEEEEEECCCCEEEEEEEECCCCC-----EEEEECCCCCCCC Q ss_conf 36775889848788647875102687988633777604788764158852588984147763-----4675164566104 Q gi|254780258|r 39 SKGGRNNTGRVTMRFRGGGHKNRYRLVDFKRGEYGLEGVVKRLEYDPNRTAFIALISYSNGS-----LAYILAPQRLSVG 113 (278) Q Consensus 39 ~~~GRNn~GrITvrhRGGGhKr~yR~IDf~r~~~~~~~~V~~IeYDPnRsa~IALv~~~~g~-----~~YIlAp~gl~vG 113 (278) ..-||+++.|.+++++.|..| ||.|||+|++.+++++|++|||||||||+||||.|+||+ ++|||||+||++| T Consensus 8 QRrG~gs~fr~~s~~r~G~ak--yR~iDf~r~~~~i~g~V~~IeyDPnRsA~IAlv~y~dG~~~~~~k~yIlApeGl~vG 85 (260) T PTZ00180 8 CRKGNGSVYQVHGHKRLGPAK--LRILDYAERHGYMRGVVKSIEHEAGRGAALARVEFRHPYKFRRVKELMVAPEGMFTG 85 (260) T ss_pred CCCCCCCCCCCCCCCCCCCCC--CCCCCCCCCCCCCCEEEEEEEECCCCCCEEEEEEECCCCCCCCCEEEEEEECCCCCC T ss_conf 027999882176555377531--265464333699778999989898998608899943777556626899803898899 Q ss_pred CEEEECCCCCCCCCCCCHHHHHCCCCCEEEEEEEECCCCEEEEEECCCEEEEEEEC--CCEEEEEECCCCEEEECCCCEE Q ss_conf 57883666767575741104435746377623330368825886346518996750--8889999489758998567809 Q gi|254780258|r 114 DKVISSDSAVDVKPGNAMPLRFIPVGSIVHNVEMKPGKGGQISRSAGSYARLVERD--RSRALLRLSSGEMRFVHSSCMA 191 (278) Q Consensus 114 d~I~s~~~~~~i~~Gn~lpL~~IP~Gt~I~NIE~~pG~Ggkl~RsAGt~a~Ii~k~--~~~~~ikLPSGe~r~i~~~c~A 191 (278) |.|+++.++ ++++||+|||.+||+||+|||||+.||+||||||||||||||++|+ .++++|||||||+|+|+++|+| T Consensus 86 d~I~sG~~a-~i~~GN~lpL~~IP~Gt~IhNIE~~PG~GgklaRsAGt~A~vv~k~~e~~~v~lkLPSGe~R~v~~~CrA 164 (260) T PTZ00180 86 QSVFCGQKA-PLAIGNVLPLGQITEGCIVCNVEAKPGDRGTLARASGDYCIIISHNHETGRTRLKLPSGQKKSVPSTSRA 164 (260) T ss_pred CEEEECCCC-CCCCCCCCCHHHCCCCCEEEEEEECCCCCCEEEEECCCEEEEEEECCCCCEEEEECCCCCEEECCCCCCE T ss_conf 989869866-7777563546338998778867303899877996067608999716877579997788054722213877 Q ss_pred EEEEECCCCCCCCCCCCCCCC--CCCCCC---CCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCEEC Q ss_conf 998512553323001313441--005779---970123326874788888888888887887887343 Q gi|254780258|r 192 SIGAVSNQDHSNVNHAKAGRA--RWLGMR---SHVRGVAMNPVDHPHGGGEGKTSGGRNPCSPWGKLT 254 (278) Q Consensus 192 tIG~vsN~~~~~~~~gKAG~~--Rwlg~R---P~VRGvAMNpvDHPhGGGeGkt~~gr~~~spwG~~~ 254 (278) |||+|||.+|.+++|+|||++ ||+|+| |+||||||||||||||||++++.+...+||+|.-|- T Consensus 165 TIG~Vsn~~~~~~~l~KAG~~~~r~~g~r~~~P~VRGvAMNpVDHPHGGG~hq~~g~pstvsr~apPg 232 (260) T PTZ00180 165 MIGIISGGGRIEKPVLKAGNSFYRFRGKRNCWPKVRGVARNPVEHPHGGGNHQHIGHPSTVSRHSPPG 232 (260) T ss_pred EEEEEECCCHHEHHHCCCCCCCCHHCCCCCCCCEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 98752077411333261164430001448899825721228777997998988899987769988898 No 8 >KOG0438 consensus Probab=100.00 E-value=0 Score=562.64 Aligned_cols=273 Identities=41% Similarity=0.737 Sum_probs=257.0 Q ss_pred CCCEECCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCCCCC-CCCCCCCEEEEEECCCCCCEEEEEEEEECCCCCCE--- Q ss_conf 97501158884105108704576416887222111554367-75889848788647875102687988633777604--- Q gi|254780258|r 1 MALKNFNPDTSGRRQLVMVDRNSLHSGKPIKSLTRGLCSKG-GRNNTGRVTMRFRGGGHKNRYRLVDFKRGEYGLEG--- 76 (278) Q Consensus 1 M~~k~~kP~Tp~~R~~~~~~~~~l~k~~p~K~L~~~~~~~~-GRNn~GrITvrhRGGGhKr~yR~IDf~r~~~~~~~--- 76 (278) |.+..|+|.||+.| .-+....+|++..|.+.|+..++..+ |||++|+||+||+|||||++||+|||.|......+ T Consensus 23 a~i~~~~~~t~~~r-~~l~~~~~l~~~~p~~~~~~~~~~~~~GRd~tGriv~~h~GGGhKq~yr~idF~R~~p~~~~~~~ 101 (312) T KOG0438 23 ARITVLKPGTPSLR-NGLLQQPDLKKSTPSRPLVESLKINGLGRDETGRIVVRHIGGGHKQRYRMIDFARPRPIEQGTTT 101 (312) T ss_pred CEEEECCCCCCCCC-CCCCCCCHHHHCCCCCCEEEEEEECCCCCCCCCCEEEEEECCCEEEEEEEEEECCCCCCCCCCCC T ss_conf 23210146886656-76434541643488763343478558887664636898724866126667665157875566543 Q ss_pred --EEEEEEECCCCEEEEEEEECCCCCEEEEECCCCCCCCCEEEECCCCC----CCCCCCCHHHHHCCCCCEEEEEEEECC Q ss_conf --78876415885258898414776346751645661045788366676----757574110443574637762333036 Q gi|254780258|r 77 --VVKRLEYDPNRTAFIALISYSNGSLAYILAPQRLSVGDKVISSDSAV----DVKPGNAMPLRFIPVGSIVHNVEMKPG 150 (278) Q Consensus 77 --~V~~IeYDPnRsa~IALv~~~~g~~~YIlAp~gl~vGd~I~s~~~~~----~i~~Gn~lpL~~IP~Gt~I~NIE~~pG 150 (278) .|.+|||||||||+|||+.+.+++++||||+|||+.||.|.++.+.. ..+.||++||.+||+||+|||+|..|+ T Consensus 102 ~e~v~~i~yDP~Rs~~iaLv~~~~~~~~~Ila~egm~aGD~i~ss~~i~~~av~~~~Gna~Pl~~lp~GT~ih~~e~~p~ 181 (312) T KOG0438 102 EERVIEIEYDPGRSAKIALVAGGTGELRYILATEGLKAGDTILSSRKIPFMAVKGKEGNALPLGDLPVGTLIHNVEITPG 181 (312) T ss_pred CCEEEEEEECCCCCCCEEEEECCCCCEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCEEEEECCCCCCC T ss_conf 21379999789864317888326777059998468878883123564443212366777213044211125300135888 Q ss_pred CCEEEEEECCCEEEEEEECCCEEEEEECCCCEEEECCCCEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCC Q ss_conf 88258863465189967508889999489758998567809998512553323001313441005779970123326874 Q gi|254780258|r 151 KGGQISRSAGSYARLVERDRSRALLRLSSGEMRFVHSSCMASIGAVSNQDHSNVNHAKAGRARWLGMRSHVRGVAMNPVD 230 (278) Q Consensus 151 ~Ggkl~RsAGt~a~Ii~k~~~~~~ikLPSGe~r~i~~~c~AtIG~vsN~~~~~~~~gKAG~~Rwlg~RP~VRGvAMNpvD 230 (278) .++||||||||+|+|++|++.+++|+|||+|.+.++.+|+||||+|||.+|+.+.+||||++||||+||+||||+||++| T Consensus 182 ~~~~f~raAGt~a~ilak~~~~aiv~Lps~r~~~~~~tC~ATvGrvsni~~~~r~~GkAgr~rwlG~Rp~vrg~~~s~~~ 261 (312) T KOG0438 182 RSAQFARAAGTSATILAKAGKFAIVQLPSKRERSVLRTCVATVGRVSNIDHNHRILGKAGRSRWLGKRPQVRGVLMSGLD 261 (312) T ss_pred CCHHHHHHCCCHHHHHHCCCCCEEEECCCCCHHHHHHHHHEEECCCCCCCCCCCEECCCCHHHHCCCCCCCCCCCCCCCC T ss_conf 30353454284353544179721587655300455432102432314632012111210022221647643222125766 Q ss_pred CCCCCCCCCCCCCCCCCCCCCEECCC-CCCCCCCCCCEEEEEECC Q ss_conf 78888888888888788788734377-254888888614996234 Q gi|254780258|r 231 HPHGGGEGKTSGGRNPCSPWGKLTKG-KRTRSNKSTDVFIARSRH 274 (278) Q Consensus 231 HPhGGGeGkt~~gr~~~spwG~~~kg-~kTr~~k~~~~~i~~~r~ 274 (278) ||||||+|+..+-.+|++|||+++|+ ++||+++.++++-+..|. T Consensus 262 H~kgg~~gr~i~~~~P~~~~~~~ak~~~~~r~~~~a~k~~~~~~~ 306 (312) T KOG0438 262 HPKGGGKGRKIGRKKPVTPWGKPAKGLRPTRRKKGANKVLRALRP 306 (312) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 777787643346789977100122357876621133334556653 No 9 >pfam03947 Ribosomal_L2_C Ribosomal Proteins L2, C-terminal domain. Probab=100.00 E-value=0 Score=417.40 Aligned_cols=129 Identities=62% Similarity=1.100 Sum_probs=126.9 Q ss_pred CCCCCCHHHHHCCCCCEEEEEEEECCCCEEEEEECCCEEEEEEECCCEEEEEECCCCEEEECCCCEEEEEEECCCCCCCC Q ss_conf 57574110443574637762333036882588634651899675088899994897589985678099985125533230 Q gi|254780258|r 125 VKPGNAMPLRFIPVGSIVHNVEMKPGKGGQISRSAGSYARLVERDRSRALLRLSSGEMRFVHSSCMASIGAVSNQDHSNV 204 (278) Q Consensus 125 i~~Gn~lpL~~IP~Gt~I~NIE~~pG~Ggkl~RsAGt~a~Ii~k~~~~~~ikLPSGe~r~i~~~c~AtIG~vsN~~~~~~ 204 (278) +++||++||.+||+||+|||||+.||+|+||||||||||+|++||++|++|||||||+|+|+++|+||||+|||.+|.++ T Consensus 1 ik~Gn~~pL~~iP~Gt~I~NIE~~pg~ggk~~RsAGt~A~ii~k~~~~~~vkLPSGe~r~i~~~c~AtiG~vsn~~~~~~ 80 (130) T pfam03947 1 IKIGNALPLKNIPVGTVIHNIELIPGKGGQLARSAGTYAQILAKEGKYVTLKLPSGEIRLVSSNCRATIGVVSNIDHNNK 80 (130) T ss_pred CCCCCCCCHHHCCCCCEEEEEECCCCCCCEEEEECCCEEEEEECCCCEEEEECCCCCEEEECCCCEEEEEEECCCCCCCC T ss_conf 98546431846899986997766689963488838987999890499999985899868889768599958138511047 Q ss_pred CCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCC-CCCCCCCEE Q ss_conf 0131344100577997012332687478888888888888-788788734 Q gi|254780258|r 205 NHAKAGRARWLGMRSHVRGVAMNPVDHPHGGGEGKTSGGR-NPCSPWGKL 253 (278) Q Consensus 205 ~~gKAG~~Rwlg~RP~VRGvAMNpvDHPhGGGeGkt~~gr-~~~spwG~~ 253 (278) +++|||++||||+||+||||||||||||||||||++++|+ +|+||||+| T Consensus 81 ~~gKAG~~r~~g~rP~VRGvaMNpvDHPhGGG~g~~~~g~~~~~s~wg~p 130 (130) T pfam03947 81 VLGKAGRNRWLGKRPTVRGVAMNPVDHPHGGGEGRTSIGRPPPVSPWGKP 130 (130) T ss_pred CCCCHHHHHHCCCCCCCCEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 21553567658879976666138766897998888978797987899998 No 10 >KOG2309 consensus Probab=100.00 E-value=3.5e-38 Score=271.29 Aligned_cols=198 Identities=35% Similarity=0.583 Sum_probs=168.0 Q ss_pred EEECCCCCCEEEEEEEEECCCCCCEEEEEEEECCCCEEEEEEEECCC--CCEEEEECCCCCCCCCEEEECCCCCCCCCCC Q ss_conf 86478751026879886337776047887641588525889841477--6346751645661045788366676757574 Q gi|254780258|r 52 RFRGGGHKNRYRLVDFKRGEYGLEGVVKRLEYDPNRTAFIALISYSN--GSLAYILAPQRLSVGDKVISSDSAVDVKPGN 129 (278) Q Consensus 52 rhRGGGhKr~yR~IDf~r~~~~~~~~V~~IeYDPnRsa~IALv~~~~--g~~~YIlAp~gl~vGd~I~s~~~~~~i~~Gn 129 (278) +||-+-.| |..||--....+.+.|.+|.+||.|-|++|.+.|.| ...-.++|.|+|..|+.+.++.++ ++.+|| T Consensus 19 k~r~~~~~---r~~d~~~~~~~~~g~v~~iih~~~rgapla~v~frd~~~~~~~F~a~eg~~tgq~~~~g~ka-~~~ign 94 (248) T KOG2309 19 KHRKGAAK---RTLDYAERHGYIKGVVKDIIHDPGRGAPLAKVVFRDYKKDKELFIAAEGMYTGQFVYCGKKA-QLNIGN 94 (248) T ss_pred CCCCCCCC---CHHHHHHCCCCEEEEEEEEECCCCCCCCCCEEEECCCCCEEEEEECCCCCEECCEECCCCCC-CCCCCC T ss_conf 00257532---02435431301203577775157788640203414656105898646662214255377443-401053 Q ss_pred CHHHHHCCCCCEEEEEEEECCCCEEEEEECCCEEEEEEE--CCCEEEEEECCCCEEEECCCCEEEEEEECCCCCCCCCCC Q ss_conf 110443574637762333036882588634651899675--088899994897589985678099985125533230013 Q gi|254780258|r 130 AMPLRFIPVGSIVHNVEMKPGKGGQISRSAGSYARLVER--DRSRALLRLSSGEMRFVHSSCMASIGAVSNQDHSNVNHA 207 (278) Q Consensus 130 ~lpL~~IP~Gt~I~NIE~~pG~Ggkl~RsAGt~a~Ii~k--~~~~~~ikLPSGe~r~i~~~c~AtIG~vsN~~~~~~~~g 207 (278) ++|+.++|+|+.|+|+|..+++.+.|+|++|.||.+|++ |.+.+.|+||||..+.+++.|+|+||+|.+.....+++. T Consensus 95 v~~~~s~peg~~v~~ve~~~gdrg~lar~sGnya~vIaHn~dt~kTrIkLPsgaKKvV~S~~RamIG~vAggG~~dKp~l 174 (248) T KOG2309 95 VLPVGSMPEGTIVCNVEEKPGDRGALARASGNYAIVIAHNPDTKKTRIKLPSGAKKVVQSACRAMIGVVAGGGRTDKPLL 174 (248) T ss_pred EEECCCCCCCEEEEEEECCCCHHHHHHHHCCCEEEEEECCCCCCCEEEECCCCCCCEECCCCCEEEEEECCCCCCCCHHH T ss_conf 11226666624899740267635667864476258984476545247846777522000015167898617862255355 Q ss_pred CCCCCCCCC--CC---CCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCEECC Q ss_conf 134410057--79---9701233268747888888888888878878873437 Q gi|254780258|r 208 KAGRARWLG--MR---SHVRGVAMNPVDHPHGGGEGKTSGGRNPCSPWGKLTK 255 (278) Q Consensus 208 KAG~~Rwlg--~R---P~VRGvAMNpvDHPhGGGeGkt~~gr~~~spwG~~~k 255 (278) |||+..++. +| |+||||||||||||||||.-- -.|.++ |-|--.+- T Consensus 175 Kag~a~~K~~~Krn~wPrvRGVAMnPVeHphGGgnhq-hig~~s-tv~r~~~~ 225 (248) T KOG2309 175 KAGRAYHKYKAKRNCWPRVRGVAMNPVEHPHGGGNHQ-HIGKPS-TVRRDASA 225 (248) T ss_pred HHHHHHHHHHHHCCCCHHHCCEECCCCCCCCCCCCCC-CCCCCC-CCCCCCCC T ss_conf 4216787766631776112122226543787887644-357765-11266765 No 11 >pfam00181 Ribosomal_L2 Ribosomal Proteins L2, RNA binding domain. Probab=99.97 E-value=6.4e-32 Score=230.09 Aligned_cols=77 Identities=58% Similarity=1.003 Sum_probs=75.3 Q ss_pred CCCCCCCEEEEEECCCCCCEEEEEEEEECCCCCCEEEEEEEECCCCEEEEEEEECCCCCEEEEECCCCCCCCCEEEE Q ss_conf 75889848788647875102687988633777604788764158852588984147763467516456610457883 Q gi|254780258|r 42 GRNNTGRVTMRFRGGGHKNRYRLVDFKRGEYGLEGVVKRLEYDPNRTAFIALISYSNGSLAYILAPQRLSVGDKVIS 118 (278) Q Consensus 42 GRNn~GrITvrhRGGGhKr~yR~IDf~r~~~~~~~~V~~IeYDPnRsa~IALv~~~~g~~~YIlAp~gl~vGd~I~s 118 (278) ||||+|+||+||||||||++||+|||+|++.+++++|++|||||||||+||||+|+||+++|||||+||++||.|+| T Consensus 1 GRnn~Grit~r~rGGg~K~~yR~IDf~r~~~~~~~~V~~IeyDPnRsa~IAlv~~~dg~~~YIlAp~gl~vGd~I~s 77 (77) T pfam00181 1 GRNNPGRITVRHRGGGHKRLYRLIDFKRNKGNIKGKVIDIEYDPNRSAPIALVKYEDGEKRYILAPEGLKVGDTIES 77 (77) T ss_pred CCCCCCCEEEEECCCCCCEEEEEEEEEECCCCCCEEEEEEEECCCCCEEEEEEEECCCCEEEEECCCCCCCCCEEEC T ss_conf 98887559587528874478998874786799887999999879966568999957997999985499979999859 No 12 >KOG0438 consensus Probab=96.37 E-value=0.015 Score=36.92 Aligned_cols=108 Identities=19% Similarity=0.289 Sum_probs=76.3 Q ss_pred CCCCCCCCCCCCEEEEEECCCCCC--EEEEEEEEECCCCCCEEEEEEEECC-CCEEEEEEEECC-C-CC-EEEEECCCCC Q ss_conf 543677588984878864787510--2687988633777604788764158-852588984147-7-63-4675164566 Q gi|254780258|r 37 LCSKGGRNNTGRVTMRFRGGGHKN--RYRLVDFKRGEYGLEGVVKRLEYDP-NRTAFIALISYS-N-GS-LAYILAPQRL 110 (278) Q Consensus 37 ~~~~~GRNn~GrITvrhRGGGhKr--~yR~IDf~r~~~~~~~~V~~IeYDP-nRsa~IALv~~~-~-g~-~~YIlAp~gl 110 (278) .-++.+|+..++|++.|++++-++ +.+.+|.++..+. ..+|+.++||+ +|+.-++++... . |. ++|-+- T Consensus 13 ~~~~~~~~~~a~i~~~~~~t~~~r~~l~~~~~l~~~~p~-~~~~~~~~~~~~GRd~tGriv~~h~GGGhKq~yr~i---- 87 (312) T KOG0438 13 TLKSAGRTVSARITVLKPGTPSLRNGLLQQPDLKKSTPS-RPLVESLKINGLGRDETGRIVVRHIGGGHKQRYRMI---- 87 (312) T ss_pred CCCCCCCCCCCEEEECCCCCCCCCCCCCCCCHHHHCCCC-CCEEEEEEECCCCCCCCCCEEEEEECCCEEEEEEEE---- T ss_conf 024577764323210146886656764345416434887-633434785588876646368987248661266676---- Q ss_pred CCCCEEEECCCCCCCCCCCCHHHHHCCCCCEEEEEEEECCCCEEEEEECCCEE Q ss_conf 10457883666767575741104435746377623330368825886346518 Q gi|254780258|r 111 SVGDKVISSDSAVDVKPGNAMPLRFIPVGSIVHNVEMKPGKGGQISRSAGSYA 163 (278) Q Consensus 111 ~vGd~I~s~~~~~~i~~Gn~lpL~~IP~Gt~I~NIE~~pG~Ggkl~RsAGt~a 163 (278) |+. ... ++.-|.. .-.-|..||..|+.-+++|-.|+... T Consensus 88 ---dF~---R~~-p~~~~~~-------~~e~v~~i~yDP~Rs~~iaLv~~~~~ 126 (312) T KOG0438 88 ---DFA---RPR-PIEQGTT-------TEERVIEIEYDPGRSAKIALVAGGTG 126 (312) T ss_pred ---EEC---CCC-CCCCCCC-------CCCEEEEEEECCCCCCCEEEEECCCC T ss_conf ---651---578-7556654-------32137999978986431788832677 No 13 >COG0272 Lig NAD-dependent DNA ligase (contains BRCT domain type II) [DNA replication, recombination, and repair] Probab=84.92 E-value=1.3 Score=24.11 Aligned_cols=103 Identities=22% Similarity=0.335 Sum_probs=71.2 Q ss_pred EEEEEECCCCCCEEEEEEEECCCCEEEEEEEECC-----CCC---------EEEEECCCCCCCCCEEEECCCC--CC--- Q ss_conf 7988633777604788764158852588984147-----763---------4675164566104578836667--67--- Q gi|254780258|r 64 LVDFKRGEYGLEGVVKRLEYDPNRTAFIALISYS-----NGS---------LAYILAPQRLSVGDKVISSDSA--VD--- 124 (278) Q Consensus 64 ~IDf~r~~~~~~~~V~~IeYDPnRsa~IALv~~~-----~g~---------~~YIlAp~gl~vGd~I~s~~~~--~~--- 124 (278) .|-||+...-...++.+|+..=+||.-|.-|... +|. ..|| +-.++.+||.|.....+ +| T Consensus 309 AiAyKFpa~e~~T~l~dI~~qVGRTG~iTPvA~L~PV~laG~~VsrATLHN~d~I-~rkdIrIGDtV~V~kAGdVIP~V~ 387 (667) T COG0272 309 AIAYKFPAEEAVTKLLDIEVQVGRTGAITPVARLEPVELAGVTVSRATLHNVDEI-KRKDIRIGDTVVVRKAGDVIPQVV 387 (667) T ss_pred EEEECCCCHHEEEEEEEEEEECCCCEEEEEEEEEEEEEECCEEEEEEECCCHHHH-HHCCCCCCCEEEEEECCCCCCCEE T ss_conf 2355377322146899999952685364133797769987989997623888889-863878799999973688777323 Q ss_pred -----CCCCCCHHHHHCCCCCEEEEEEEECCCCEEEEEECC---CEEEEEEE Q ss_conf -----575741104435746377623330368825886346---51899675 Q gi|254780258|r 125 -----VKPGNAMPLRFIPVGSIVHNVEMKPGKGGQISRSAG---SYARLVER 168 (278) Q Consensus 125 -----i~~Gn~lpL~~IP~Gt~I~NIE~~pG~Ggkl~RsAG---t~a~Ii~k 168 (278) ..+++..+.. +|.-+.+|+=++....+....|.-+ |.||.+.+ T Consensus 388 ~Vv~e~R~~~~~~~~-~P~~CP~C~s~l~r~~~e~~~rC~n~~~C~aq~~e~ 438 (667) T COG0272 388 GVVLEKRPGNEKPIP-FPTHCPVCGSELVREEGEVVIRCTNGLNCPAQLKER 438 (667) T ss_pred EEECCCCCCCCCCCC-CCCCCCCCCCEEEECCCCEEEECCCCCCCHHHHHHH T ss_conf 554344779998888-999798899976742675568668987885888623 No 14 >cd03699 lepA_II lepA_II: This subfamily represents the domain II of LepA, a GTP-binding protein localized in the cytoplasmic membrane. The N-terminal domain of LepA shares regions of homology to translation factors. In terms of interaction with the ribosome, EF-G, EF-Tu and IF2 have all been demonstrated to interact at overlapping sites on the ribosome. Chemical protection studies demonstrate that they all include the universally conserved alpha-sarcin loop as part of their binding site. These data indicate that LepA may bind to this location on the ribosome as well. LepA has never been observed in archaea, and eukaryl LepA is organellar. LepA is therefore a true bacterial GTPase, found only in the bacterial lineage. Probab=81.62 E-value=3 Score=21.74 Aligned_cols=26 Identities=35% Similarity=0.538 Sum_probs=20.7 Q ss_pred CEEEEEEEECCCCEEEEEEEECCCCCE Q ss_conf 047887641588525889841477634 Q gi|254780258|r 75 EGVVKRLEYDPNRTAFIALISYSNGSL 101 (278) Q Consensus 75 ~~~V~~IeYDPnRsa~IALv~~~~g~~ 101 (278) .|.|-+.-|||.|-. |++|+-.||+. T Consensus 2 rALifDS~yD~y~Gv-v~~vRV~~G~l 27 (86) T cd03699 2 RALIFDSWYDPYRGV-IALVRVFDGTL 27 (86) T ss_pred EEEEEEEEECCCCCE-EEEEEEECCEE T ss_conf 089998672278878-99999979999 No 15 >smart00532 LIGANc Ligase N family. Probab=81.36 E-value=1.6 Score=23.56 Aligned_cols=105 Identities=18% Similarity=0.212 Sum_probs=53.4 Q ss_pred EEEEEECCCCCCEEEEEEEECCCCEEEEEEEEC-----CCCCE-EE-------EECCCCCCCCCEEEECCC--CCCC--- Q ss_conf 798863377760478876415885258898414-----77634-67-------516456610457883666--7675--- Q gi|254780258|r 64 LVDFKRGEYGLEGVVKRLEYDPNRTAFIALISY-----SNGSL-AY-------ILAPQRLSVGDKVISSDS--AVDV--- 125 (278) Q Consensus 64 ~IDf~r~~~~~~~~V~~IeYDPnRsa~IALv~~-----~~g~~-~Y-------IlAp~gl~vGd~I~s~~~--~~~i--- 125 (278) .|-||+.......+|.+|+....||..|.-|.. .+|.. ++ .+.-.++.+||.|..... .+|- T Consensus 304 AiA~Kf~~e~~~T~l~~I~~qVGRtG~itPva~lePV~l~G~~Vs~aTLhN~~~i~~~~i~iGd~V~i~raGdVIPkI~~ 383 (441) T smart00532 304 AIAYKFPAEEAETKLLDIIVQVGRTGKITPVAELEPVFLAGSTVSRATLHNEDEIEEKDIRIGDTVVVRKAGDVIPKVVG 383 (441) T ss_pred EEEECCCCCEEEEEEEEEEEECCCCEEEEEEEEEEEEEECCEEEEEECCCCHHHHHHCCCCCCCEEEEEECCCCCCCEEE T ss_conf 48976886657899999999737862876899997688678699970568989999759999999999977888874314 Q ss_pred -----CCCCCHHHHHCCCCCEEEEEEEECCCC--EEEEEECCCEEEEEEEC Q ss_conf -----757411044357463776233303688--25886346518996750 Q gi|254780258|r 126 -----KPGNAMPLRFIPVGSIVHNVEMKPGKG--GQISRSAGSYARLVERD 169 (278) Q Consensus 126 -----~~Gn~lpL~~IP~Gt~I~NIE~~pG~G--gkl~RsAGt~a~Ii~k~ 169 (278) ..++..+ -.+|.-+..|+-++..-.+ ..+|-...|.||++.+- T Consensus 384 vv~~~r~~~~~~-~~~P~~CP~C~s~l~~~~~~~~~~C~n~~C~aq~~~~i 433 (441) T smart00532 384 VVKEKRPGDERE-IEMPTHCPSCGSELVREEGEVDIRCPNPLCPAQLIERI 433 (441) T ss_pred CCCCCCCCCCCC-CCCCCCCCCCCCEEEECCCCEEEECCCCCCHHHHHHHE T ss_conf 110238998856-55899898997983843898789889998989997011 No 16 >pfam02699 YajC Preprotein translocase subunit. See. Probab=71.41 E-value=6.4 Score=19.54 Aligned_cols=40 Identities=23% Similarity=0.380 Sum_probs=31.6 Q ss_pred EECCCCEEEEEECCCEEEEEEECCCEEEEEECCCCEEEECCC Q ss_conf 303688258863465189967508889999489758998567 Q gi|254780258|r 147 MKPGKGGQISRSAGSYARLVERDRSRALLRLSSGEMRFVHSS 188 (278) Q Consensus 147 ~~pG~Ggkl~RsAGt~a~Ii~k~~~~~~ikLPSGe~r~i~~~ 188 (278) +.||+ +.+-++|-+++|.+-+++.++|.+..|-.-.+... T Consensus 38 L~~Gd--~VvT~gGi~G~V~~i~~~~v~lei~~gv~i~v~r~ 77 (83) T pfam02699 38 LKKGD--EVVTIGGIHGKIVKVDDDTVVLEIADGVKVKVDKS 77 (83) T ss_pred CCCCC--EEEECCCEEEEEEEEECCEEEEEECCCCEEEEEHH T ss_conf 78999--99989971999999979999999889939999969 No 17 >TIGR02303 HpaG-C-term 4-hydroxyphenylacetate degradation bifunctional isomerase/decarboxylase, C-terminal subunit; InterPro: IPR012684 This entry represents one of two subunits/domains of the bifunctional isomerase/decarboxylase involved in 4-hydroxyphenylacetate degradation . In Escherichia coli and some other species, this enzyme is encoded by a single polypeptide containing both this domain and the closely related N-terminal domain (IPR012686 from INTERPRO). In other species such as Pasteurella multocida, these domains are found as two separate proteins (usually as tandem genes). Together, these domains carry out the decarboxylation of 5-oxopent-3-ene-1,2,5-tricarboxylic acid (OPET) to 2-hydroxy-2,4-diene-1,7-dioate (HHDD) and the subsequent isomerization to 2-oxohept-3-ene-1,7-dioate (OHED).; GO: 0008704 5-carboxymethyl-2-hydroxymuconate delta-isomerase activity, 0018800 5-oxopent-3-ene-125-tricarboxylate decarboxylase activity. Probab=67.65 E-value=3.1 Score=21.65 Aligned_cols=65 Identities=31% Similarity=0.510 Sum_probs=49.6 Q ss_pred EEEEECCCCEEEEEEEECCCCCEE-----------------EEECCCCCCCCCEEEECC-C-CCCCCCCCCHHHHHCCCC Q ss_conf 876415885258898414776346-----------------751645661045788366-6-767575741104435746 Q gi|254780258|r 79 KRLEYDPNRTAFIALISYSNGSLA-----------------YILAPQRLSVGDKVISSD-S-AVDVKPGNAMPLRFIPVG 139 (278) Q Consensus 79 ~~IeYDPnRsa~IALv~~~~g~~~-----------------YIlAp~gl~vGd~I~s~~-~-~~~i~~Gn~lpL~~IP~G 139 (278) ++| -||+ .++|=-|-|||++ ||-.--.|+.||.|.+|- + ..++++|+++-+.==-+| T Consensus 165 ~dv-eDp~---nL~LrtyVNGEL~Q~G~T~dMiF~va~LiEylS~~mTL~PgD~ilTGTPKG~~dv~pGD~v~~eiEGvG 240 (249) T TIGR02303 165 EDV-EDPM---NLALRTYVNGELTQEGNTSDMIFSVAYLIEYLSEFMTLEPGDVILTGTPKGLSDVKPGDVVRLEIEGVG 240 (249) T ss_pred CCC-CCCC---CCEEEEEEEEEEEECCCCCHHHHCHHHHHHHHHHCCCCCCCCEEEECCCCCCCCCCCCCEEEEEECCCC T ss_conf 236-7855---671336771378646874211204678999863102478564887058887522355878999770026 Q ss_pred CEEEEEEE Q ss_conf 37762333 Q gi|254780258|r 140 SIVHNVEM 147 (278) Q Consensus 140 t~I~NIE~ 147 (278) .+.+-|+. T Consensus 241 ~L~N~i~~ 248 (249) T TIGR02303 241 KLENPIVS 248 (249) T ss_pred CEECCCCC T ss_conf 21031105 No 18 >KOG1004 consensus Probab=67.25 E-value=2.4 Score=22.36 Aligned_cols=124 Identities=16% Similarity=0.179 Sum_probs=68.1 Q ss_pred EEECCCCEEEEEEEECC-CCCEEEEECCCCCCCCCEEEECCCCCCCCCCCCHH---------------HHHCCCCCEEEE Q ss_conf 64158852588984147-76346751645661045788366676757574110---------------443574637762 Q gi|254780258|r 81 LEYDPNRTAFIALISYS-NGSLAYILAPQRLSVGDKVISSDSAVDVKPGNAMP---------------LRFIPVGSIVHN 144 (278) Q Consensus 81 IeYDPnRsa~IALv~~~-~g~~~YIlAp~gl~vGd~I~s~~~~~~i~~Gn~lp---------------L~~IP~Gt~I~N 144 (278) -+.+++++ ..-+.|. .-+++||.+++..-+|.+|-.+.+.--+.+|++-+ --++.+|.+||+ T Consensus 40 ~~~~~gk~--~~~v~~vds~~kRYiP~~~D~VIGiV~~~~gd~ykVDigg~~~a~L~~laFe~AtkrNrPnl~vGdliya 117 (230) T KOG1004 40 RHKEPGKG--GGGVYWVDSQQKRYIPVKGDHVIGIVTSKSGDIYKVDIGGSEPASLSYLAFEGATKRNRPNLQVGDLIYA 117 (230) T ss_pred EECCCCCC--CCEEEEEECCCCEECCCCCCEEEEEEEECCCCEEEEECCCCCEEEEEECCCCCCCCCCCCCCCCCCEEEE T ss_conf 44068866--5306888604504337889989999985046517996389866533100236854357886431657899 Q ss_pred EEEECCC---------CEEEEEECCCEEEEEEECCCEEEEEECCCCEEEECCCCEEEEEEECCCCCCCCCCCCCCCC Q ss_conf 3330368---------8258863465189967508889999489758998567809998512553323001313441 Q gi|254780258|r 145 VEMKPGK---------GGQISRSAGSYARLVERDRSRALLRLSSGEMRFVHSSCMASIGAVSNQDHSNVNHAKAGRA 212 (278) Q Consensus 145 IE~~pG~---------Ggkl~RsAGt~a~Ii~k~~~~~~ikLPSGe~r~i~~~c~AtIG~vsN~~~~~~~~gKAG~~ 212 (278) =-...++ ... -||||- ..-++..++++.-+..|++...-.-++-.|+-....++.+|--|+- T Consensus 118 kv~~a~~~~Epel~Cids~-graaGf-----G~LkdG~if~vs~~~~R~Ll~p~~~iLq~vGk~~~FEia~GlNGri 188 (230) T KOG1004 118 KVVDANKDMEPELTCIDST-GRAAGF-----GVLKDGMIFKVSLGLCRKLLLPDCPILQTVGKKYPFEIAFGLNGRI 188 (230) T ss_pred EEEECCCCCCCCEEEECCC-CCCCCC-----CCCCCCEEEEECHHHHHHHHCCCCCHHHHHHCCCCEEEEEECCCEE T ss_conf 9870677766406887355-754575-----3426866986058886788759971999862145369998527559 No 19 >pfam06434 Aconitase_2_N Aconitate hydratase 2 N-terminus. This family represents the N-terminal region of several bacterial Aconitate hydratase 2 proteins and is found in conjunction with pfam00330. Probab=62.88 E-value=12 Score=17.78 Aligned_cols=83 Identities=22% Similarity=0.316 Sum_probs=55.0 Q ss_pred EEEEEECCC----CCEEEEECCCCCCCCCEEEECCCCCCCCCCCCHHHHHCCCCCEEEEEEEECCCCEEEEEECCCEEEE Q ss_conf 889841477----6346751645661045788366676757574110443574637762333036882588634651899 Q gi|254780258|r 90 FIALISYSN----GSLAYILAPQRLSVGDKVISSDSAVDVKPGNAMPLRFIPVGSIVHNVEMKPGKGGQISRSAGSYARL 165 (278) Q Consensus 90 ~IALv~~~~----g~~~YIlAp~gl~vGd~I~s~~~~~~i~~Gn~lpL~~IP~Gt~I~NIE~~pG~Ggkl~RsAGt~a~I 165 (278) .||-|+|.. |-.-...-...|..||.|......-.+.. -.|..+...++.|..=--.+||.|.++.| T Consensus 100 kIAPIFfnT~edsGalPie~dv~~l~tGdvi~I~p~~g~i~~---------~~g~v~~~f~l~p~~l~DevRAGGri~LI 170 (204) T pfam06434 100 KIAPIFFNTAEDSGALPIEADVSSLNTGDVITIYPYEGKITK---------EDGEVISTFELKPNTLLDEVRAGGRIPLI 170 (204) T ss_pred EECCEEEEECCCCCCEEEEEECCCCCCCCEEEEECCCCEEEC---------CCCCEEEEEECCCCCHHHHHHCCCCEEEE T ss_conf 343268841033687246864031667988999657886976---------99978988762774278887548837999 Q ss_pred EEEC---CCEEEEEECCCC Q ss_conf 6750---888999948975 Q gi|254780258|r 166 VERD---RSRALLRLSSGE 181 (278) Q Consensus 166 i~k~---~~~~~ikLPSGe 181 (278) +.+. ...-.|.|+.-. T Consensus 171 iGr~Lt~kAr~~Lgl~~s~ 189 (204) T pfam06434 171 IGRGLTDKAREALGLEPSD 189 (204) T ss_pred ECHHHHHHHHHHHCCCHHH T ss_conf 7345659999981999176 No 20 >KOG0462 consensus Probab=60.42 E-value=4.5 Score=20.58 Aligned_cols=11 Identities=27% Similarity=0.395 Sum_probs=4.5 Q ss_pred EEEECCCCEEE Q ss_conf 33303688258 Q gi|254780258|r 145 VEMKPGKGGQI 155 (278) Q Consensus 145 IE~~pG~Ggkl 155 (278) +-+.|-+|..+ T Consensus 352 vg~fP~dgsd~ 362 (650) T KOG0462 352 VGLFPLDGSDY 362 (650) T ss_pred ECCCCCCCCHH T ss_conf 53666762055 No 21 >PRK05585 yajC preprotein translocase subunit YajC; Validated Probab=60.14 E-value=13 Score=17.47 Aligned_cols=38 Identities=11% Similarity=0.257 Sum_probs=30.7 Q ss_pred CEEEEEECCCEEEEEEECCCEEEEEECCCCEEEECCCC Q ss_conf 82588634651899675088899994897589985678 Q gi|254780258|r 152 GGQISRSAGSYARLVERDRSRALLRLSSGEMRFVHSSC 189 (278) Q Consensus 152 Ggkl~RsAGt~a~Ii~k~~~~~~ikLPSGe~r~i~~~c 189 (278) |-+.+-++|-+++|..-++++++|++..|-...+.... T Consensus 57 Gd~VvT~gGi~G~I~~v~d~~v~leia~gv~i~~~r~a 94 (107) T PRK05585 57 GDEVVTNGGIIGKVTKVSEDFVIIELNDDTEIKIQKSA 94 (107) T ss_pred CCEEEECCCCEEEEEEEECCEEEEEECCCCEEEEEHHH T ss_conf 99999899858999999799899998899589999589 No 22 >KOG4309 consensus Probab=59.66 E-value=13 Score=17.44 Aligned_cols=39 Identities=23% Similarity=0.437 Sum_probs=26.4 Q ss_pred CCCCCCCEEEEEECCCCCCEEEEEEEEEC------CCCCCEEEEEEEECCC Q ss_conf 75889848788647875102687988633------7776047887641588 Q gi|254780258|r 42 GRNNTGRVTMRFRGGGHKNRYRLVDFKRG------EYGLEGVVKRLEYDPN 86 (278) Q Consensus 42 GRNn~GrITvrhRGGGhKr~yR~IDf~r~------~~~~~~~V~~IeYDPn 86 (278) --.++++|-+|. .+|-.-||--. -....|+++.|||||. T Consensus 105 ~s~ka~KIE~rG------~ry~Y~Df~IkvGtvTmg~tvKGi~vEIEY~pc 149 (217) T KOG4309 105 QSAKASKIETRG------TRYQYCDFLIKVGTVTMGPTVKGISVEIEYGPC 149 (217) T ss_pred EECCCCCEEECC------CEEEECCEEEEECCEEECCEEEEEEEEEEECCE T ss_conf 315554046614------334431159997515632522158999960877 No 23 >PRK07956 ligA NAD-dependent DNA ligase LigA; Validated Probab=59.11 E-value=3.8 Score=21.02 Aligned_cols=105 Identities=17% Similarity=0.238 Sum_probs=60.3 Q ss_pred EEEEEECCCCCCEEEEEEEECCCCEEEEEEEECC-----CCCE-E-------EEECCCCCCCCCEEEECCC--CCC---- Q ss_conf 7988633777604788764158852588984147-----7634-6-------7516456610457883666--767---- Q gi|254780258|r 64 LVDFKRGEYGLEGVVKRLEYDPNRTAFIALISYS-----NGSL-A-------YILAPQRLSVGDKVISSDS--AVD---- 124 (278) Q Consensus 64 ~IDf~r~~~~~~~~V~~IeYDPnRsa~IALv~~~-----~g~~-~-------YIlAp~gl~vGd~I~s~~~--~~~---- 124 (278) .|-||+.......+|.+|++...||..|.-|... +|.. + =.+.-.++.+||.|..... -+| T Consensus 309 AiA~Kf~~e~~~T~l~~I~~qVGRTG~iTPvA~l~PV~l~G~~VsrATLHN~~~I~~~~i~iGd~V~v~raGdVIP~I~~ 388 (668) T PRK07956 309 AIAYKFPAEEATTKLLDIEVQVGRTGAVTPVARLEPVEVAGVTVSRATLHNADEIERKDIRIGDTVVVRRAGDVIPEVVG 388 (668) T ss_pred EEEECCCCCEEEEEEEEEEEEECCCCEEEEEEEEEEEEECCEEEEECCCCCHHHHHHCCCCCCCEEEEEECCCCCCCEEE T ss_conf 47874773679999777799734763562699998787156067523647889999707899998999988888720046 Q ss_pred ----CCCCCCHHHHHCCCCCEEEEEEEECCCCEE--EEEEC-CCEEEEEEEC Q ss_conf ----575741104435746377623330368825--88634-6518996750 Q gi|254780258|r 125 ----VKPGNAMPLRFIPVGSIVHNVEMKPGKGGQ--ISRSA-GSYARLVERD 169 (278) Q Consensus 125 ----i~~Gn~lpL~~IP~Gt~I~NIE~~pG~Ggk--l~RsA-Gt~a~Ii~k~ 169 (278) ..+++..| -.+|.-+.+|+-++.--.+.. +|-.. .|.||++.+- T Consensus 389 v~~~~r~~~~~~-~~~P~~CP~C~s~l~~~~~~~~~~C~N~~~Cpaq~~~~i 439 (668) T PRK07956 389 VVLEKRPGDERE-IVMPTHCPVCGSELVREEGEAVLRCTNGLSCPAQLKERL 439 (668) T ss_pred ECCCCCCCCCCC-CCCCCCCCCCCCEEEEECCCEEEECCCCCCCHHHHHHHH T ss_conf 413018998875-436778767798579936963787789989989999988 No 24 >TIGR02176 pyruv_ox_red pyruvate:ferredoxin (flavodoxin) oxidoreductase; InterPro: IPR011895 The oxidative decarboxylation of pyruvate to acetyl-CoA, a central step in energy metabolism, can occur by two different mechanisms . In mitochondria and aerobic bacteria this reaction is catalysed by the multienzyme complex pyruvate dehydrogenase using NAD as electron acceptor. In anaerobic organisms, however, this reaction is reversibly catalysed by a single enzyme using either ferrodoxin or flavodoxin as the electron acceptor. Pyruvate:ferrodoxin/flavodoxin reductases (PFORs) in this entry occur in both obligately and facultatively anaerobic bacteria and also some eukaryotic microorganisms. These proteins are single-chain enzymes containing a thiamin pyrophosphate cofactor for the cleavage of carbon-carbon bonds next to a carbonyl group, and iron-sulphur clusters for electron transfer. The Desulfovibrio africanus enzyme is currently the only PFOR whose three dimensional structure is known , . It is a homodimer where each subunit contains one thiamin pyrophosphate cofactor and two ferrodoxin-like 4Fe-S clusters and an atypical 4Fe-S cluster. Each monomer is composed of seven domains - domains I, II and VI make intersubunit contacts, while domains III, IV and V are located at the suface of the dimer, and domain VII forms a long arm extending over the other subunit. The cofactor is bound at the interface of domains I and VI and is proximal to the atypical 4Fe-S bound by domain VI, while the ferrodoxin-like 4Fe-S clusters are bound by domain V. Comparison of this enzyme with the multi-chain PFORs shows a correspondance between the domains in this enzyme and the subunits of the multi-chain enzymes.; GO: 0005506 iron ion binding, 0016903 oxidoreductase activity acting on the aldehyde or oxo group of donors, 0006118 electron transport. Probab=51.04 E-value=4.8 Score=20.35 Aligned_cols=21 Identities=52% Similarity=0.588 Sum_probs=9.3 Q ss_pred CCCCCCCCCC-CCCCCEEEECC Q ss_conf 1344100577-99701233268 Q gi|254780258|r 208 KAGRARWLGM-RSHVRGVAMNP 228 (278) Q Consensus 208 KAG~~Rwlg~-RP~VRGvAMNp 228 (278) +|=|.|-|+= .|.|||.|+|| T Consensus 197 ~~fR~r~lnPe~P~vRGTaQNP 218 (1194) T TIGR02176 197 KAFRKRSLNPEHPEVRGTAQNP 218 (1194) T ss_pred HHHHHHCCCCCCCEEECCCCCC T ss_conf 9997416688877540346587 No 25 >pfam09907 DUF2136 Uncharacterized protein conserved in bacteria (DUF2136). This domain, found in various hypothetical prokaryotic proteins, has no known function. Probab=47.10 E-value=13 Score=17.46 Aligned_cols=39 Identities=23% Similarity=0.322 Sum_probs=31.5 Q ss_pred CCEEEEEECCCCCCEEEEEEEEECCCCCCEEEEEEEECC Q ss_conf 848788647875102687988633777604788764158 Q gi|254780258|r 47 GRVTMRFRGGGHKNRYRLVDFKRGEYGLEGVVKRLEYDP 85 (278) Q Consensus 47 GrITvrhRGGGhKr~yR~IDf~r~~~~~~~~V~~IeYDP 85 (278) +.-+|+..||+.-|+.-.|||.++..-+.-+...-|||- T Consensus 36 ~~rvVFnI~GN~yRLI~~i~y~~~~v~Ir~igTHaEYDk 74 (76) T pfam09907 36 DNRVVFNIGGNKYRLIAAIDFRRGKVYIKFIGTHAEYDK 74 (76) T ss_pred CCEEEEECCCCEEEEEEEEEECCCEEEEEEEECHHHHCC T ss_conf 877999868984999999990797899999374688444 No 26 >cd04088 EFG_mtEFG_II EFG_mtEFG_II: this subfamily represents the domain II of elongation factor G (EF-G) in bacteria and, the C-terminus of mitochondrial Elongation factor G1 (mtEFG1) and G2 (mtEFG2)_like proteins found in eukaryotes. During the process of peptide synthesis and tRNA site changes, the ribosome is moved along the mRNA a distance equal to one codon with the addition of each amino acid. In bacteria this translocation step is catalyzed by EF-G_GTP, which is hydrolyzed to provide the required energy. Thus, this action releases the uncharged tRNA from the P site and transfers the newly formed peptidyl-tRNA from the A site to the P site. Eukaryotic cells harbor 2 protein synthesis systems: one localized in the cytoplasm, the other in the mitochondria. Most factors regulating mitochondrial protein synthesis are encoded by nuclear genes, translated in the cytoplasm, and then transported to the mitochondria. The eukaryotic system of elongation factor (EF) components is more compl Probab=44.08 E-value=24 Score=15.81 Aligned_cols=26 Identities=19% Similarity=0.261 Sum_probs=18.5 Q ss_pred CEEEEEEEECCCCEEEEEEEECCCCCE Q ss_conf 047887641588525889841477634 Q gi|254780258|r 75 EGVVKRLEYDPNRTAFIALISYSNGSL 101 (278) Q Consensus 75 ~~~V~~IeYDPnRsa~IALv~~~~g~~ 101 (278) .|.|-.++|||.+ ..||+++-.+|.. T Consensus 2 ~a~VFK~~~d~~~-Gri~~~RV~~G~l 27 (83) T cd04088 2 VALVFKTIHDPFV-GKLSFVRVYSGTL 27 (83) T ss_pred EEEEEEEEECCCC-CEEEEEEEECCEE T ss_conf 0999976655988-8699999958998 No 27 >TIGR02092 glgD glucose-1-phosphate adenylyltransferase, GlgD subunit; InterPro: IPR011832 This family is GlgD, an apparent regulatory protein that appears in an alpha2/beta2 heterotetramer with GlgC (glucose-1-phosphate adenylyltransferase, IPR011831 from INTERPRO) in a subset of bacteria that use GlgC for glycogen biosynthesis.; GO: 0005978 glycogen biosynthetic process. Probab=43.21 E-value=11 Score=17.98 Aligned_cols=37 Identities=32% Similarity=0.593 Sum_probs=23.2 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCEEEEEEEEECCCCCCEEEEEE Q ss_conf 5764168872221115543677588984878864787510268798863377760478876 Q gi|254780258|r 21 RNSLHSGKPIKSLTRGLCSKGGRNNTGRVTMRFRGGGHKNRYRLVDFKRGEYGLEGVVKRL 81 (278) Q Consensus 21 ~~~l~k~~p~K~L~~~~~~~~GRNn~GrITvrhRGGGhKr~yR~IDf~r~~~~~~~~V~~I 81 (278) -.+||+-+|+-+|= =+|| ||+|||--...- .|=|.+| T Consensus 16 l~~LT~~Rp~A~~P-----fgGr------------------YRlIDF~LSn~~-NAgI~~v 52 (383) T TIGR02092 16 LSPLTKVRPVASLP-----FGGR------------------YRLIDFPLSNMV-NAGIRNV 52 (383) T ss_pred CCCHHHHCCCEEEC-----CCCE------------------EEEEECCHHCCC-CCCEEEE T ss_conf 13145508703323-----5864------------------778830100231-0360022 No 28 >PRK05433 GTP-binding protein LepA; Provisional Probab=41.80 E-value=25 Score=15.72 Aligned_cols=84 Identities=24% Similarity=0.336 Sum_probs=34.8 Q ss_pred CCCCCCCCCCCCCCCCCCCCCEE-EEEECCCCCCEEEEEEEEECCCCCCEEEEEE-EECCCCEEEEEEEECCCCCEEEEE Q ss_conf 87222111554367758898487-8864787510268798863377760478876-415885258898414776346751 Q gi|254780258|r 28 KPIKSLTRGLCSKGGRNNTGRVT-MRFRGGGHKNRYRLVDFKRGEYGLEGVVKRL-EYDPNRTAFIALISYSNGSLAYIL 105 (278) Q Consensus 28 ~p~K~L~~~~~~~~GRNn~GrIT-vrhRGGGhKr~yR~IDf~r~~~~~~~~V~~I-eYDPnRsa~IALv~~~~g~~~YIl 105 (278) .|++.|.+--. =-+-.|.|+ +|=-.|--|.--.+.-+...+ ...|.+| .+.|.+..-=. ..-|+--||+ T Consensus 192 ~PL~ALIFDS~---yD~YrGvV~~vRV~~G~lk~Gd~I~~~~t~~---~~~v~evGi~~p~~~~~~~---L~aGeVGyii 262 (601) T PRK05433 192 APLKALIFDSW---YDPYRGVVVLVRVVDGTLKKGDKIKMMSTGK---EYEVDEVGVFTPKMVPVDE---LSAGEVGYII 262 (601) T ss_pred CCHHHHHHHCC---CCCCCCEEEEEEEECCEECCCCEEEEECCCC---EEEEEEEECCCCCCCCCCE---ECCCCEEEEE T ss_conf 34312012303---0467880799994088772585256412697---1672024256898527450---1378447998 Q ss_pred C----CCCCCCCCEEEECC Q ss_conf 6----45661045788366 Q gi|254780258|r 106 A----PQRLSVGDKVISSD 120 (278) Q Consensus 106 A----p~gl~vGd~I~s~~ 120 (278) | +....+||++.... T Consensus 263 agiK~~~d~~vGDTit~~~ 281 (601) T PRK05433 263 AGIKDVRDARVGDTITLAK 281 (601) T ss_pred ECCCCHHHCCCCCEEECCC T ss_conf 2454443233476055477 No 29 >pfam04189 Gcd10p Gcd10p family. eIF-3 is a multi-subunit complex that stimulates translation initiation in vitro at several different steps. This family corresponds to the gamma subunit if eIF3. The Yeast protein Gcd10p has also been shown to be part of a complex with the methyltransferase Gcd14p that is involved in modifying tRNA. Probab=40.10 E-value=27 Score=15.42 Aligned_cols=17 Identities=12% Similarity=0.030 Sum_probs=7.7 Q ss_pred EECCCCEEEEEEECCCC Q ss_conf 98567809998512553 Q gi|254780258|r 184 FVHSSCMASIGAVSNQD 200 (278) Q Consensus 184 ~i~~~c~AtIG~vsN~~ 200 (278) .+..++.|.|...+|.. T Consensus 185 ~lR~d~l~~il~~anV~ 201 (300) T pfam04189 185 DLRFDTLGLILTLANVR 201 (300) T ss_pred CCCHHHHHHHHHHCCCC T ss_conf 37899999999741779 No 30 >smart00536 AXH domain in Ataxins and HMG containing proteins. unknown function Probab=37.91 E-value=24 Score=15.77 Aligned_cols=28 Identities=21% Similarity=0.285 Sum_probs=17.4 Q ss_pred EEECCCCCEEEEECCCCCCCCCEEEECCCCC Q ss_conf 8414776346751645661045788366676 Q gi|254780258|r 93 LISYSNGSLAYILAPQRLSVGDKVISSDSAV 123 (278) Q Consensus 93 Lv~~~~g~~~YIlAp~gl~vGd~I~s~~~~~ 123 (278) +|...+|+...| |.++++|.|.+..... T Consensus 7 ~I~la~G~~K~v---EDi~teDfv~s~~~~~ 34 (116) T smart00536 7 RLCLANGSNKKV---EDLKTEDFIRSAECSN 34 (116) T ss_pred EEEECCCCCCEE---EEEECCCEEECCCCCC T ss_conf 999747983077---8700375674244487 No 31 >TIGR02252 DREG-2 REG-2-like, HAD superfamily (subfamily IA) hydrolase; InterPro: IPR011949 This family of proteins includes uncharacterised sequences from eukaryotes, cyanobacteria and Leptospira as well as the DREG-2 protein from Drosophila melanogaster which has been identified as a rhythmically (diurnally) regulated gene . This family is a member of the Haloacid Dehalogenase (HAD) superfamily of aspartate-nucleophile hydrolases. The superfamily is defined by the presence of three short catalytic motifs . The subfamilies are defined based on the location and the observed or predicted fold of a so-called, capping domain, , or the absence of such a domain. This family is a member of subfamily 1A in which the cap domain consists of a predicted alpha helical bundle found in between the first and second catalytic motifs. A distinctive feature of this family is a conserved tandem pair of tryptophan residues in the cap domain. The most divergent sequences included within the scope of this entry are from plants and have "FW" at this position instead. Most likely, these sequences, like the vast majority of HAD sequences, represent phosphatase enzymes.. Probab=36.83 E-value=11 Score=17.96 Aligned_cols=19 Identities=26% Similarity=0.288 Sum_probs=14.1 Q ss_pred EEEEEECCCCCCCCCCCCC Q ss_conf 9998512553323001313 Q gi|254780258|r 191 ASIGAVSNQDHSNVNHAKA 209 (278) Q Consensus 191 AtIG~vsN~~~~~~~~gKA 209 (278) ..+|+|||.|+.+..+..| T Consensus 136 ~~lgviSNFD~RL~~~L~~ 154 (224) T TIGR02252 136 VKLGVISNFDSRLRGVLEA 154 (224) T ss_pred EEEEEEECCHHHHHHHHHH T ss_conf 5899874651337899976 No 32 >PRK09238 bifunctional aconitate hydratase 2/2-methylisocitrate dehydratase; Validated Probab=35.69 E-value=32 Score=14.97 Aligned_cols=84 Identities=20% Similarity=0.304 Sum_probs=58.0 Q ss_pred EEEEEECCC----CCEEEEECCCCCCCCCEEEECCCCCCCCCCCCHHHHHCCCCCEEEEEEEECCCCEEEEEECCCEEEE Q ss_conf 889841477----6346751645661045788366676757574110443574637762333036882588634651899 Q gi|254780258|r 90 FIALISYSN----GSLAYILAPQRLSVGDKVISSDSAVDVKPGNAMPLRFIPVGSIVHNVEMKPGKGGQISRSAGSYARL 165 (278) Q Consensus 90 ~IALv~~~~----g~~~YIlAp~gl~vGd~I~s~~~~~~i~~Gn~lpL~~IP~Gt~I~NIE~~pG~Ggkl~RsAGt~a~I 165 (278) .||-|+|.. |-.-...-...|..||+|......-.|.. -.|..|...++.|..---.+||.|.++.| T Consensus 267 kIAPIFfnT~eDsGaLPIe~dV~~l~tGdvI~I~p~~gki~~---------~~ge~i~~f~l~~~~l~DevrAGGri~lI 337 (841) T PRK09238 267 KIAPIFFNTMEDSGALPIEADVSKLEMGDVIDIYPYEGKIRK---------ETGEVIATFELKTDVLLDEVRAGGRIPLI 337 (841) T ss_pred EECCEEEEECCCCCCEEEEEECCCCCCCCEEEEECCCCEEEC---------CCCCEEEEECCCCCCHHHHHHCCCCEEEE T ss_conf 123357730134687025752133567867999547873866---------88868877405881078898548846899 Q ss_pred EEEC---CCEEEEEECCCCE Q ss_conf 6750---8889999489758 Q gi|254780258|r 166 VERD---RSRALLRLSSGEM 182 (278) Q Consensus 166 i~k~---~~~~~ikLPSGe~ 182 (278) +.+. ...-.|-|+.-.+ T Consensus 338 iGr~Lt~kar~~Lgl~~s~~ 357 (841) T PRK09238 338 IGRGLTTKAREALGLPPSDV 357 (841) T ss_pred ECHHHHHHHHHHCCCCHHHH T ss_conf 63455599999809991763 No 33 >PRK13922 rod shape-determining protein MreC; Provisional Probab=34.84 E-value=33 Score=14.88 Aligned_cols=36 Identities=28% Similarity=0.447 Sum_probs=24.9 Q ss_pred CEEEEECCCCCCCCCEEEECCCCCCCCCCCCHHHHHCCCCCEE Q ss_conf 3467516456610457883666767575741104435746377 Q gi|254780258|r 100 SLAYILAPQRLSVGDKVISSDSAVDVKPGNAMPLRFIPVGSIV 142 (278) Q Consensus 100 ~~~YIlAp~gl~vGd~I~s~~~~~~i~~Gn~lpL~~IP~Gt~I 142 (278) ...||.....+++||.|.+++.. .+ ....||+|++. T Consensus 198 ~~~~i~~~~~i~~GD~vvTSG~~-g~------fP~Gi~VG~V~ 233 (264) T PRK13922 198 ELEFLPRSADIKVGDLVVTSGLG-GV------FPAGLPVGRVT 233 (264) T ss_pred EEECCCCCCCCCCCCEEEECCCC-CC------CCCCCEEEEEE T ss_conf 67227544444689999997999-83------89998899999 No 34 >KOG3283 consensus Probab=34.38 E-value=33 Score=14.83 Aligned_cols=108 Identities=19% Similarity=0.375 Sum_probs=56.9 Q ss_pred CCEE-EEEECCCCCCEEEEEEEEECCCCC-----CEEEEEEEECCCCEEEE--------EEEECCCCC-EEEEECCCCCC Q ss_conf 8487-886478751026879886337776-----04788764158852588--------984147763-46751645661 Q gi|254780258|r 47 GRVT-MRFRGGGHKNRYRLVDFKRGEYGL-----EGVVKRLEYDPNRTAFI--------ALISYSNGS-LAYILAPQRLS 111 (278) Q Consensus 47 GrIT-vrhRGGGhKr~yR~IDf~r~~~~~-----~~~V~~IeYDPnRsa~I--------ALv~~~~g~-~~YIlAp~gl~ 111 (278) -+|| ||-|||..|-+---+|.-...++. ...|.++-|+|--+-.+ |+|..+--. ..|..+-.++- T Consensus 41 ~~~~~VRvRGGN~KyrALRlD~Gnfsw~Se~~trktri~dvvY~~snneLVRT~TL~k~aiV~ida~Pfrqwyeshy~~~ 120 (200) T KOG3283 41 KRITPVRVRGGNKKYRALRLDMGNFSWGSEGTTRKTRILDVVYHPSNNELVRTNTLTKSAIVQIDAAPFRQWYESHYGLP 120 (200) T ss_pred CEEEEEEECCCCHHHHEEEECCCCCCCCCCCEEEEEEEEEEEECCCCCCEEEECCHHHCEEEEECCCHHHHHHHHHHCCC T ss_conf 13567786177520210254157630022330000045666754886602242132111178852516788888740430 Q ss_pred CCCE---EEECCCCCCC----------CCCCCHHHHHCCCCCEEEEEEEECCCCEE Q ss_conf 0457---8836667675----------75741104435746377623330368825 Q gi|254780258|r 112 VGDK---VISSDSAVDV----------KPGNAMPLRFIPVGSIVHNVEMKPGKGGQ 154 (278) Q Consensus 112 vGd~---I~s~~~~~~i----------~~Gn~lpL~~IP~Gt~I~NIE~~pG~Ggk 154 (278) .|.. +..-...-.+ .-=+.+.-..+..|-+.-||...||+-|+ T Consensus 121 lg~kk~~~~~~k~sk~~~rk~~~r~~~a~vds~l~eqF~~GrL~A~isSrpGQ~Gr 176 (200) T KOG3283 121 LGRKKNKILNKKKSKHVQRKYAERQKNAKVDSSLEEQFAAGRLYACISSRPGQCGR 176 (200) T ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHCCCEEEEEECCCCCCCC T ss_conf 02245755562136889998877640364027799887538079998178875554 No 35 >PRK06763 F0F1 ATP synthase subunit alpha; Validated Probab=33.75 E-value=34 Score=14.77 Aligned_cols=120 Identities=18% Similarity=0.164 Sum_probs=67.2 Q ss_pred CCCEEEEEEEECCCCEEEEEEEECCCCCEEEEECCCCCCCCCEEEECCCCCCCCCCCCHHHHHCCCCCEEEEEEEECCCC Q ss_conf 76047887641588525889841477634675164566104578836667675757411044357463776233303688 Q gi|254780258|r 73 GLEGVVKRLEYDPNRTAFIALISYSNGSLAYILAPQRLSVGDKVISSDSAVDVKPGNAMPLRFIPVGSIVHNVEMKPGKG 152 (278) Q Consensus 73 ~~~~~V~~IeYDPnRsa~IALv~~~~g~~~YIlAp~gl~vGd~I~s~~~~~~i~~Gn~lpL~~IP~Gt~I~NIE~~pG~G 152 (278) -++|.|+.++-+ +-+|--=.|.+-..-|+=-+.+.++||.|..... ..+++-+-..-..||...-.- T Consensus 41 tieGrVvEv~~~---~i~iesK~y~~~v~v~~~~~~nvKvGD~VKatg~----------m~rnf~~~M~A~sVEki~~~~ 107 (213) T PRK06763 41 TIEGRVVEVDNG---VIVIKSKQYEEPVSVYIDSLSNVKVGDEVKATGS----------MMRNFTEYMVATAVENTTNKL 107 (213) T ss_pred HCEEEEEEEECC---EEEEEECCCCCCEEEEECCCCCCCCCCEEEECHH----------HHHHHHHHHHHHHHHHCCCCC T ss_conf 001569998688---8999703568851699647888653748986178----------898779987553341005785 Q ss_pred EEEEEECCCEEEEEEEC-------------CCEEEEEECCCCEEEECCCCEEEEEEECCCCCCCCC Q ss_conf 25886346518996750-------------888999948975899856780999851255332300 Q gi|254780258|r 153 GQISRSAGSYARLVERD-------------RSRALLRLSSGEMRFVHSSCMASIGAVSNQDHSNVN 205 (278) Q Consensus 153 gkl~RsAGt~a~Ii~k~-------------~~~~~ikLPSGe~r~i~~~c~AtIG~vsN~~~~~~~ 205 (278) |-...--|++.-++... .+||+|+-|+-+=.......+-|-|+.=|.+-+-+. T Consensus 108 G~~~~~~g~p~yviG~I~kv~~~~~~~~~~~~yVvV~YP~~~Gk~~~V~V~LT~GQkFnvGDkVKV 173 (213) T PRK06763 108 GMHMKEDGSPDYVIGEVSKVYTMEDDGDGATKYVVVEYPSLNGKKLIIDVFLTKGQVFHVGDKVKV 173 (213) T ss_pred CEEECCCCCCCEEEEEEEEECCCCCCCCCCEEEEEEECCCCCCCEEEEEEEECCCCEEECCCEEEE T ss_conf 324211589765889999960432378984039999868977984689999736978705988999 No 36 >PRK05886 yajC preprotein translocase subunit YajC; Validated Probab=33.28 E-value=35 Score=14.72 Aligned_cols=37 Identities=16% Similarity=0.221 Sum_probs=31.9 Q ss_pred CEEEEEECCCEEEEEEECCCEEEEEECCCCEEEECCC Q ss_conf 8258863465189967508889999489758998567 Q gi|254780258|r 152 GGQISRSAGSYARLVERDRSRALLRLSSGEMRFVHSS 188 (278) Q Consensus 152 Ggkl~RsAGt~a~Ii~k~~~~~~ikLPSGe~r~i~~~ 188 (278) |-+.+-++|-|++|..-+++.+.|.+..|-.-.+... T Consensus 42 Gd~VvT~~Gl~GtV~~v~dd~v~lEiApGv~v~~~k~ 78 (108) T PRK05886 42 GDRVHTTSGLQATIVGITDDTVDLEIAPGVVTTWMKL 78 (108) T ss_pred CCEEEECCCCEEEEEEECCCEEEEEECCCCEEEEEHH T ss_conf 9989978982899999718879999769958999837 No 37 >pfam11946 DUF3463 Domain of unknown function (DUF3463). This presumed domain is functionally uncharacterized. This domain is found in bacteria and archaea. This domain is about 140 amino acids in length. This domain is found associated with pfam04055. This domain has two conserved sequence motifs: CTPWG and PCYL. Probab=32.80 E-value=20 Score=16.36 Aligned_cols=13 Identities=38% Similarity=1.163 Sum_probs=9.6 Q ss_pred CC-CCCCCCCEECC Q ss_conf 88-78878873437 Q gi|254780258|r 243 GR-NPCSPWGKLTK 255 (278) Q Consensus 243 gr-~~~spwG~~~k 255 (278) .+ -.|||||-||- T Consensus 55 ~~~y~CTPWg~Pt~ 68 (141) T pfam11946 55 NQTYQCTPWGNPTR 68 (141) T ss_pred CCCCCCCCCCCCCC T ss_conf 87556566778664 No 38 >KOG3129 consensus Probab=32.77 E-value=26 Score=15.55 Aligned_cols=30 Identities=33% Similarity=0.577 Sum_probs=23.5 Q ss_pred CCCCCCCCCEEEECCCCCCCCCCCCHHHHHCCC Q ss_conf 645661045788366676757574110443574 Q gi|254780258|r 106 APQRLSVGDKVISSDSAVDVKPGNAMPLRFIPV 138 (278) Q Consensus 106 Ap~gl~vGd~I~s~~~~~~i~~Gn~lpL~~IP~ 138 (278) +-.|+.+||.|+...+ +.-||.++|.+|-. T Consensus 153 ~~aGl~~gD~il~fGn---V~sgn~~~lq~i~~ 182 (231) T KOG3129 153 DEAGLCVGDEILKFGN---VHSGNFLPLQNIAA 182 (231) T ss_pred HHHCCCCCCEEEEECC---CCCCCCHHHHHHHH T ss_conf 5407543765788533---24655225889899 No 39 >TIGR02091 glgC glucose-1-phosphate adenylyltransferase; InterPro: IPR011831 This enzyme, glucose-1-phosphate adenylyltransferase, is also called ADP-glucose pyrophosphorylase. The plant form is an alpha2, beta2 heterodimer, allosterically regulated in plants. Both subunits are homologous and included in this entry. In bacteria, both homomeric forms of GlgC and more active heterodimers of GlgC and GlgD have been described. This entry describes the GlgC subunit only. This enzyme appears in variants of glycogen synthesis pathways that use ADP-glucose, rather than UDP-glucose as in animals.; GO: 0008878 glucose-1-phosphate adenylyltransferase activity. Probab=30.93 E-value=19 Score=16.49 Aligned_cols=10 Identities=40% Similarity=0.959 Sum_probs=7.0 Q ss_pred EEEEEEEECC Q ss_conf 6879886337 Q gi|254780258|r 62 YRLVDFKRGE 71 (278) Q Consensus 62 yR~IDf~r~~ 71 (278) ||+|||-=.. T Consensus 32 YRiIDF~LSN 41 (421) T TIGR02091 32 YRIIDFALSN 41 (421) T ss_pred EEEECCCHHH T ss_conf 1452100135 No 40 >cd05790 S1_Rrp40 S1_Rrp40: Rrp40 S1-like RNA-binding domain. S1-like RNA-binding domains are found in a wide variety of RNA-associated proteins. Rrp4 protein is a subunit of the exosome complex. The exosome plays a central role in 3' to 5' RNA processing and degradation in eukarytes and archaea. Its functions include the removal of incorrectly processed RNA and the maintenance of proper levels of mRNA, rRNA and a number of small RNA species. In Saccharomyces cerevisiae, the exosome includes nine core components, six of which are homologous to bacterial RNase PH. These form a hexameric ring structure. The other three subunits (RrP4, Rrp40, and Csl4) contain an S1 RNA binding domain and are part of the "S1 pore structure". Probab=30.88 E-value=30 Score=15.15 Aligned_cols=12 Identities=25% Similarity=0.337 Sum_probs=4.8 Q ss_pred EEECCCCCCCCC Q ss_conf 751645661045 Q gi|254780258|r 103 YILAPQRLSVGD 114 (278) Q Consensus 103 YIlAp~gl~vGd 114 (278) ||.++..+-+|. T Consensus 2 YiP~~gD~VIGi 13 (86) T cd05790 2 YVPAKGDHVIGI 13 (86) T ss_pred CCCCCCCEEEEE T ss_conf 628899989999 No 41 >pfam09038 53-BP1_Tudor Tumour suppressor p53-binding protein-1 Tudor. Members of this family consist of ten beta-strands and a carboxy-terminal alpha-helix. The amino-terminal five beta-strands and the C-terminal five beta-strands adopt folds that are identical to each other. This domain is essential for the recruitment of proteins to double stranded breaks in DNA, which is mediated by interaction with methylated Lys 79 of histone H3. Probab=30.30 E-value=39 Score=14.40 Aligned_cols=89 Identities=15% Similarity=0.138 Sum_probs=52.2 Q ss_pred CCCCCCEEEEEEEECCCCEEEEEEEECCCCCEEE-----EECCCCCCCCCEEEECCCCCCCCCCCCHHHHHCCCCCEEEE Q ss_conf 3777604788764158852588984147763467-----51645661045788366676757574110443574637762 Q gi|254780258|r 70 GEYGLEGVVKRLEYDPNRTAFIALISYSNGSLAY-----ILAPQRLSVGDKVISSDSAVDVKPGNAMPLRFIPVGSIVHN 144 (278) Q Consensus 70 ~~~~~~~~V~~IeYDPnRsa~IALv~~~~g~~~Y-----IlAp~gl~vGd~I~s~~~~~~i~~Gn~lpL~~IP~Gt~I~N 144 (278) +.+-.||.|.. | +...--+|.|+||.++- ||+.+.+-+|+.|.+..+-.-..+|-+.- .+---+..-|. T Consensus 14 d~~~YpG~I~k---d--~~n~r~~i~FDDg~e~~v~~k~ii~~d~l~~~~~VlA~~Ed~yf~~Gvi~G-h~~~~~e~~Y~ 87 (118) T pfam09038 14 NGYFYSGKITK---D--AGEGKYKLLFDDGYECDVLGKDILLCDPIPLDTEVTALSEDEYFSAGVVKG-HKKESQELYYS 87 (118) T ss_pred CCCCCCCEEEE---C--CCCCEEEEEECCCCEEEEECCCEEEECCCCCCCEEEEEECCCCCCCCEEEE-CCCCCCEEEEE T ss_conf 78471646864---2--678748999658974565047179974424773445430224335446981-13788627899 Q ss_pred EEEECCCCEEEEEECCCEEEEEEEC Q ss_conf 3330368825886346518996750 Q gi|254780258|r 145 VEMKPGKGGQISRSAGSYARLVERD 169 (278) Q Consensus 145 IE~~pG~Ggkl~RsAGt~a~Ii~k~ 169 (278) || .-|+.-.+-|+ +.|++.| T Consensus 88 VE-~DG~t~~y~R~----~VILs~e 107 (118) T pfam09038 88 IE-KDGQRKWYKRM----AVILSLE 107 (118) T ss_pred EE-ECCCEEEECCC----CEEECHH T ss_conf 99-56965684352----1883699 No 42 >KOG2040 consensus Probab=26.11 E-value=39 Score=14.36 Aligned_cols=19 Identities=32% Similarity=0.719 Sum_probs=9.2 Q ss_pred EEEEEEECCCCEEEEEECCCEEEE Q ss_conf 762333036882588634651899 Q gi|254780258|r 142 VHNVEMKPGKGGQISRSAGSYARL 165 (278) Q Consensus 142 I~NIE~~pG~Ggkl~RsAGt~a~I 165 (278) .-++.++|+.|+ +|-||-+ T Consensus 597 ~D~~s~QPNsGA-----~GEYaGL 615 (1001) T KOG2040 597 FDSFSLQPNSGA-----QGEYAGL 615 (1001) T ss_pred CCCEEECCCCCC-----CCCHHHH T ss_conf 661342689875-----5311229 No 43 >pfam02736 Myosin_N Myosin N-terminal SH3-like domain. This domain has an SH3-like fold. It is found at the N-terminus of many but not all myosins. The function of this domain is unknown. Probab=25.73 E-value=47 Score=13.88 Aligned_cols=27 Identities=11% Similarity=0.249 Sum_probs=22.3 Q ss_pred EEEEEEECCCEEEEEECCCCEEEECCC Q ss_conf 189967508889999489758998567 Q gi|254780258|r 162 YARLVERDRSRALLRLSSGEMRFVHSS 188 (278) Q Consensus 162 ~a~Ii~k~~~~~~ikLPSGe~r~i~~~ 188 (278) -|.|.+.+++.++|++..|+.+.+..+ T Consensus 15 ~~~I~~~~Gd~vtV~t~~Gk~vtVk~d 41 (42) T pfam02736 15 KGEIKSREGDKVTVKTEDGKTVTVKKD 41 (42) T ss_pred EEEEEEEECCEEEEEECCCCEEEECCC T ss_conf 999999209999999799989986289 No 44 >PRK13885 conjugal transfer protein TrbG; Provisional Probab=24.94 E-value=46 Score=13.94 Aligned_cols=12 Identities=17% Similarity=-0.011 Sum_probs=6.3 Q ss_pred ECCCCEEEEEEE Q ss_conf 158852588984 Q gi|254780258|r 83 YDPNRTAFIALI 94 (278) Q Consensus 83 YDPnRsa~IALv 94 (278) --|.+-|-|||- T Consensus 85 CAPlqv~DIaLq 96 (299) T PRK13885 85 CAVLQVCDVALQ 96 (299) T ss_pred ECCCEEEEEECC T ss_conf 437637788628 No 45 >TIGR00575 dnlj DNA ligase, NAD-dependent; InterPro: IPR001679 DNA ligase (polydeoxyribonucleotide synthase) is the enzyme that joins two DNA fragments by catalyzing the formation of an internucleotide ester bond between phosphate and deoxyribose. It is active during DNA replication, DNA repair and DNA recombination. There are two forms of DNA ligase: one requires ATP (6.5.1.1 from EC), the other NAD (6.5.1.2 from EC). This family is predominantly composed of NAD-dependent bacterial DNA ligases. They are proteins of about 75 to 85 Kd whose sequence is well conserved , . They also show similarity to yicF, an Escherichia coli hypothetical protein of 63 Kd.; GO: 0003911 DNA ligase (NAD+) activity, 0006260 DNA replication, 0006281 DNA repair. Probab=24.37 E-value=49 Score=13.72 Aligned_cols=103 Identities=16% Similarity=0.182 Sum_probs=71.1 Q ss_pred EEEEECCCCCCEEEEEEEECCCCEEEEEEEECCC-----CC---------EEEEECCCCCCCCCEEEECCCC--CC---- Q ss_conf 9886337776047887641588525889841477-----63---------4675164566104578836667--67---- Q gi|254780258|r 65 VDFKRGEYGLEGVVKRLEYDPNRTAFIALISYSN-----GS---------LAYILAPQRLSVGDKVISSDSA--VD---- 124 (278) Q Consensus 65 IDf~r~~~~~~~~V~~IeYDPnRsa~IALv~~~~-----g~---------~~YIlAp~gl~vGd~I~s~~~~--~~---- 124 (278) |=||+..--...++.+|+--=+||..|--|...+ |. ..| |--.++.|||.|+....+ +| T Consensus 317 iAYKFpa~e~~T~L~dv~~qVGRTG~iTPvA~LePV~vaG~~Vs~ATLHN~D~-I~~kdiriGD~Vvv~kAGdVIP~vv~ 395 (706) T TIGR00575 317 IAYKFPAEEAQTKLLDVVVQVGRTGAITPVAKLEPVFVAGTTVSRATLHNEDE-IEEKDIRIGDTVVVRKAGDVIPKVVE 395 (706) T ss_pred HCCCCCCCCCCEEEEEEEEEECCEEEECCEEEECCEEECCEEEEEEEECCHHH-HHCCCCCCCCEEEEEECCCCCHHHHH T ss_conf 02366743010089878998465677600367565587207862344016002-42027665888999844862535541 Q ss_pred ----CCCCCCHHHHHCCCCCEEEEEEEEC--CCCEEEEEE-CCCEEEEEEEC Q ss_conf ----5757411044357463776233303--688258863-46518996750 Q gi|254780258|r 125 ----VKPGNAMPLRFIPVGSIVHNVEMKP--GKGGQISRS-AGSYARLVERD 169 (278) Q Consensus 125 ----i~~Gn~lpL~~IP~Gt~I~NIE~~p--G~Ggkl~Rs-AGt~a~Ii~k~ 169 (278) ..+|+.-| ..+|.-+.+|+=++.. +.-..+|=. -.|.||.+..- T Consensus 396 v~~~~R~~~~~~-~~~P~~CP~C~s~lv~~~~e~~~rC~Ng~~Cpa~~~e~i 446 (706) T TIGR00575 396 VLLEKRTGEERP-IKFPTHCPSCGSPLVRIEEEVDIRCTNGLNCPAQLVERI 446 (706) T ss_pred CCCCCCCCCCCE-EECCCCCCCCCCEEECCCCCEEEECCCCCCCHHHHHHHE T ss_conf 023578888870-342871888883311157872165388766536765110 No 46 >COG1430 Uncharacterized conserved protein [Function unknown] Probab=23.70 E-value=51 Score=13.63 Aligned_cols=56 Identities=27% Similarity=0.232 Sum_probs=33.6 Q ss_pred CCCEEEEEEEEECCCCCCEEEEEEEE-CCCCEEEEEEEECCCCCEEEEE-------CCCCCCCCCEEEEC Q ss_conf 51026879886337776047887641-5885258898414776346751-------64566104578836 Q gi|254780258|r 58 HKNRYRLVDFKRGEYGLEGVVKRLEY-DPNRTAFIALISYSNGSLAYIL-------APQRLSVGDKVISS 119 (278) Q Consensus 58 hKr~yR~IDf~r~~~~~~~~V~~IeY-DPnRsa~IALv~~~~g~~~YIl-------Ap~gl~vGd~I~s~ 119 (278) ||.-+..+|.-+-.. .+.|.+|.+ -|.+...+.- .+..+|+| +..++++||.+..- T Consensus 57 Mknt~lpLDiiFid~--dg~i~~i~~~~P~~~~~~~~----~~~~~yvLEl~~G~~~~~~i~vGd~v~~~ 120 (126) T COG1430 57 MKNTMLPLDIIFIDS--DGRVVDIVELVPWSTYPCKS----YGPVRYVLELPAGWAARLGIKVGDRVEFR 120 (126) T ss_pred EECCCCCEEEEEECC--CCCEEEEEECCCCCCCCCCC----CCCCCEEEEECCCCHHHCCCCCCCEEEEC T ss_conf 526774248999958--99799989656665578898----88825899952783232188648898862 No 47 >COG0448 GlgC ADP-glucose pyrophosphorylase [Carbohydrate transport and metabolism] Probab=23.29 E-value=42 Score=14.15 Aligned_cols=126 Identities=17% Similarity=0.285 Sum_probs=52.5 Q ss_pred EEEEEEEEECCC---CCCEEEEEEEECCCCEEEEEEEECC--------C--CCEEEEECCCCCCCCCEEEECCCCCCCCC Q ss_conf 268798863377---7604788764158852588984147--------7--63467516456610457883666767575 Q gi|254780258|r 61 RYRLVDFKRGEY---GLEGVVKRLEYDPNRTAFIALISYS--------N--GSLAYILAPQRLSVGDKVISSDSAVDVKP 127 (278) Q Consensus 61 ~yR~IDf~r~~~---~~~~~V~~IeYDPnRsa~IALv~~~--------~--g~~~YIlAp~gl~vGd~I~s~~~~~~i~~ 127 (278) +||+|||--... ++.-+-+--.|.|- +|..+. + ++.-+||.|+--.-++.-..+. +..+- T Consensus 36 kYRiIDF~LSN~vNSGi~~I~VltQy~~~-----SL~~Hi~~G~~w~l~~~~~~v~ilp~~~~~~~~~wy~Gt-adai~- 108 (393) T COG0448 36 KYRIIDFALSNCVNSGIRRIGVLTQYKSH-----SLNDHIGRGWPWDLDRKNGGVFILPAQQREGGERWYEGT-ADAIY- 108 (393) T ss_pred EEEEEEEECCCCCCCCCCEEEEEECCCHH-----HHHHHHHCCCCCCCCCCCCCEEEECCHHCCCCCCCEECC-HHHHH- T ss_conf 46898677000010388769999435536-----799986289865665555847995711125787312044-79999- Q ss_pred CCCHH-------HHHCCCCCEEEEEEEECCCCEEEEEECCCEEEEEEEC------CCEEEEEEC-CCCEEEE-------- Q ss_conf 74110-------4435746377623330368825886346518996750------888999948-9758998-------- Q gi|254780258|r 128 GNAMP-------LRFIPVGSIVHNVEMKPGKGGQISRSAGSYARLVERD------RSRALLRLS-SGEMRFV-------- 185 (278) Q Consensus 128 Gn~lp-------L~~IP~Gt~I~NIE~~pG~Ggkl~RsAGt~a~Ii~k~------~~~~~ikLP-SGe~r~i-------- 185 (278) =|.+. .--|--|..|||+.... =-++-..+|+.++|..++ ..+-+++.- ++.+..| T Consensus 109 Qnl~~i~~~~~eyvlIlsgDhIYkmDy~~--ml~~H~~~gadiTv~~~~Vp~~eas~fGim~~D~~~~i~~F~eKp~~~~ 186 (393) T COG0448 109 QNLLIIRRSDPEYVLILSGDHIYKMDYSD--MLDFHIESGADVTVAVKEVPREEASRFGVMNVDENGRIIEFVEKPADGP 186 (393) T ss_pred HHHHHHHHCCCCEEEEECCCEEEECCHHH--HHHHHHHCCCCEEEEEEECCHHHHHHCCCEEECCCCCEEEEEECCCCCC T ss_conf 76899974398889995287789448999--9999998199879999987967600248158879997887352667688 Q ss_pred CCCCEEEEEE Q ss_conf 5678099985 Q gi|254780258|r 186 HSSCMASIGA 195 (278) Q Consensus 186 ~~~c~AtIG~ 195 (278) ..+..|..|+ T Consensus 187 ~~~~laSMgi 196 (393) T COG0448 187 PSNSLASMGI 196 (393) T ss_pred CCCCEEEEEE T ss_conf 6653135566 No 48 >KOG3556 consensus Probab=23.05 E-value=47 Score=13.87 Aligned_cols=60 Identities=23% Similarity=0.346 Sum_probs=30.8 Q ss_pred EECCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCC------CCCCCCCCCCCCCCCC---CCCCCCEECCCC Q ss_conf 51255332300131344100577997012332687------4788888888888887---887887343772 Q gi|254780258|r 195 AVSNQDHSNVNHAKAGRARWLGMRSHVRGVAMNPV------DHPHGGGEGKTSGGRN---PCSPWGKLTKGK 257 (278) Q Consensus 195 ~vsN~~~~~~~~gKAG~~Rwlg~RP~VRGvAMNpv------DHPhGGGeGkt~~gr~---~~spwG~~~kg~ 257 (278) .|+|.+-..... |.-||.|.+|-.-||--|-- --|-+--.|..+|-|- --+-|-..-.|. T Consensus 243 eV~np~~~~~~y---gvVrWIG~pP~~~~Vl~gve~edet~~~~~~~~dgs~~g~Rlftc~d~~a~Fv~~~s 311 (724) T KOG3556 243 EVENPETMRRVY---GVVRWIGEPPEASGVLYGVEFEDETSQWPSSNQDGSSSGDRLFTCFDTNANFVMSGS 311 (724) T ss_pred EECCCCCCCCCC---EEEEECCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCEEEEECCCCCEEEECCC T ss_conf 861776443344---156762579762571684455475456765466676467467664056402567578 No 49 >TIGR01393 lepA GTP-binding protein LepA; InterPro: IPR006297 LepA (GUF1 in Saccaromyces) is a GTP-binding membrane protein related to EF-G and EF-Tu. Two types of phylogenetic tree, rooted by other GTP-binding proteins, suggest that eukaryotic homologs (including GUF1 of yeast) originated within the bacterial LepA family. The function of the proteins in this family are unknown. ; GO: 0005525 GTP binding. Probab=22.86 E-value=53 Score=13.53 Aligned_cols=27 Identities=22% Similarity=0.460 Sum_probs=21.4 Q ss_pred CCEEEEEEEECCCCEEEEEEEECCCCCE Q ss_conf 6047887641588525889841477634 Q gi|254780258|r 74 LEGVVKRLEYDPNRTAFIALISYSNGSL 101 (278) Q Consensus 74 ~~~~V~~IeYDPnRsa~IALv~~~~g~~ 101 (278) ..|-|-+=-||++|-. |++|+-.+|+. T Consensus 192 LkALIFDS~YD~YrGV-v~~vRv~~G~i 218 (598) T TIGR01393 192 LKALIFDSHYDNYRGV-VVLVRVFEGTI 218 (598) T ss_pred CEEEEEEEEECCCCEE-EEEEEEEECEE T ss_conf 3227884354386508-99999952686 No 50 >cd01773 Faf1_like1_UBX Faf1_like1 is a protein of unknown function with a domain architecture that includes the UAS (ubiquitin-associated) and UBX (ubiquitin-like) domains. This protein is related to other UBA/UBX-containing proteins like Faf1, p47, and SAKS1 and may serve as an adaptor molecule that shuttles proteins to the proteasome for degradation. The UBX domain has a beta-grasp fold similar to that of ubiquitin however, UBX lacks the c-terminal double glycine motif and is thus unlikely to be conjugated to other proteins. Probab=22.49 E-value=48 Score=13.82 Aligned_cols=24 Identities=25% Similarity=0.275 Sum_probs=13.8 Q ss_pred EEEEEECCCCEEEECCCCEEEEEE Q ss_conf 899994897589985678099985 Q gi|254780258|r 172 RALLRLSSGEMRFVHSSCMASIGA 195 (278) Q Consensus 172 ~~~ikLPSGe~r~i~~~c~AtIG~ 195 (278) .+.+|+|.||.+...-.+-++|=. T Consensus 7 kl~iR~P~Ger~~~~f~as~~lq~ 30 (82) T cd01773 7 RLMLRYPDGKREQIALPEQAKLLA 30 (82) T ss_pred EEEEECCCCCEEEEEEECCCHHHH T ss_conf 799998999878999635762999 No 51 >COG0298 HypC Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones] Probab=21.77 E-value=55 Score=13.39 Aligned_cols=43 Identities=19% Similarity=0.379 Sum_probs=28.4 Q ss_pred CCCCEEEEEEEECCCCEEEEEEEECCC--CCEEEEECCCCCCCCCEEEE Q ss_conf 776047887641588525889841477--63467516456610457883 Q gi|254780258|r 72 YGLEGVVKRLEYDPNRTAFIALISYSN--GSLAYILAPQRLSVGDKVIS 118 (278) Q Consensus 72 ~~~~~~V~~IeYDPnRsa~IALv~~~~--g~~~YIlAp~gl~vGd~I~s 118 (278) ..+|+.|..|.. |+. +|+|.+.. -+-+.-|-++..++||.|.. T Consensus 3 laiPgqI~~I~~--~~~--~A~Vd~gGvkreV~l~Lv~~~v~~GdyVLV 47 (82) T COG0298 3 LAIPGQIVEIDD--NNH--LAIVDVGGVKREVNLDLVGEEVKVGDYVLV 47 (82) T ss_pred CCCCCEEEEEEC--CCC--EEEEEECCEEEEEEEEEECCCCCCCCEEEE T ss_conf 266727899807--885--589986567689875330475334778999 No 52 >KOG2051 consensus Probab=21.31 E-value=43 Score=14.11 Aligned_cols=26 Identities=31% Similarity=0.700 Sum_probs=15.5 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCEECCC Q ss_conf 7478888888888888788788734377 Q gi|254780258|r 229 VDHPHGGGEGKTSGGRNPCSPWGKLTKG 256 (278) Q Consensus 229 vDHPhGGGeGkt~~gr~~~spwG~~~kg 256 (278) -|||.|||||++..+-+ .|.-..+.| T Consensus 1012 k~~p~gg~~g~ae~~~~--~~f~lmvr~ 1037 (1128) T KOG2051 1012 KDEPTGGEEGEAEEASQ--VPFTLMVRG 1037 (1128) T ss_pred CCCCCCCCCCCCCCCCC--CEEEEEECC T ss_conf 68887644454455640--026732300 No 53 >KOG3429 consensus Probab=21.28 E-value=57 Score=13.32 Aligned_cols=50 Identities=20% Similarity=0.322 Sum_probs=32.4 Q ss_pred CCCHHHHHCCCCCEEEEEEEECCCCEEEEEECCCEEEEEEECCCEEEEEECCCCEEEECC Q ss_conf 741104435746377623330368825886346518996750888999948975899856 Q gi|254780258|r 128 GNAMPLRFIPVGSIVHNVEMKPGKGGQISRSAGSYARLVERDRSRALLRLSSGEMRFVHS 187 (278) Q Consensus 128 Gn~lpL~~IP~Gt~I~NIE~~pG~Ggkl~RsAGt~a~Ii~k~~~~~~ikLPSGe~r~i~~ 187 (278) |...+-+.||.- ++|+ .|+||+|--+|=+.|....|.|+|+=.+--||+. T Consensus 29 g~~~~~g~ipld----~~~i------~y~RSSGPGGQNVNKvNTKv~vrf~vs~a~Wipe 78 (172) T KOG3429 29 GAIHFKGKIPLD----QLEI------SYSRSSGPGGQNVNKVNTKVEVRFKVSNAEWIPE 78 (172) T ss_pred CCCCCCCCCCHH----HEEE------EEEECCCCCCCCCCCCCCEEEEEEECCHHHHCCH T ss_conf 546558878631----6478------8740579987652021443899975013656119 No 54 >pfam04472 DUF552 Protein of unknown function (DUF552). Family of uncharacterized proteins. Probab=21.21 E-value=19 Score=16.40 Aligned_cols=36 Identities=31% Similarity=0.734 Sum_probs=20.7 Q ss_pred CEEEEEEEEECC-CCCCEEEEEEEECCCCEEEEEEEECCCCCEEEEECCCCCCC Q ss_conf 026879886337-77604788764158852588984147763467516456610 Q gi|254780258|r 60 NRYRLVDFKRGE-YGLEGVVKRLEYDPNRTAFIALISYSNGSLAYILAPQRLSV 112 (278) Q Consensus 60 r~yR~IDf~r~~-~~~~~~V~~IeYDPnRsa~IALv~~~~g~~~YIlAp~gl~v 112 (278) ...|+|||-..- +...|.+..| ++.-|+++|.++.+ T Consensus 36 ~a~RiiDfl~G~~~al~G~i~~v-----------------~~~v~l~~P~~v~I 72 (73) T pfam04472 36 QARRILDFLAGAVYALGGDIQRV-----------------GDKIFLLTPSNVKI 72 (73) T ss_pred HHHHHHHHHHHHHHHHCCEEEEE-----------------CCCEEEEECCCCEE T ss_conf 97889999720262127899996-----------------19899997998393 No 55 >PRK13839 conjugal transfer protein TrbG; Provisional Probab=20.97 E-value=57 Score=13.28 Aligned_cols=12 Identities=8% Similarity=0.393 Sum_probs=6.7 Q ss_pred CCCEEEEEECCC Q ss_conf 088899994897 Q gi|254780258|r 169 DRSRALLRLSSG 180 (278) Q Consensus 169 ~~~~~~ikLPSG 180 (278) |+..+.|.||++ T Consensus 206 DG~KTyIqfP~~ 217 (274) T PRK13839 206 DGQKTYIQFPRS 217 (274) T ss_pred CCCEEEEECCCC T ss_conf 993799999987 No 56 >cd03691 BipA_TypA_II BipA_TypA_II: domain II of BipA (also called TypA) having homology to domain II of the elongation factors (EFs) EF-G and EF-Tu. BipA is a highly conserved protein with global regulatory properties in Escherichia coli. BipA is phosphorylated on a tyrosine residue under some cellular conditions. Mutants show altered regulation of some pathways. BipA functions as a translation factor that is required specifically for the expression of the transcriptional modulator Fis. BipA binds to ribosomes at a site that coincides with that of EF-G and has a GTPase activity that is sensitive to high GDP:GTP ratios and, is stimulated by 70S ribosomes programmed with mRNA and aminoacylated tRNAs. The growth rate-dependent induction of BipA allows the efficient expression of Fis, thereby modulating a range of downstream processes, including DNA metabolism and type III secretion. Probab=20.72 E-value=58 Score=13.25 Aligned_cols=25 Identities=28% Similarity=0.336 Sum_probs=14.9 Q ss_pred CEEEEEEEECCCCEEEEEEEECCCCC Q ss_conf 04788764158852588984147763 Q gi|254780258|r 75 EGVVKRLEYDPNRTAFIALISYSNGS 100 (278) Q Consensus 75 ~~~V~~IeYDPnRsa~IALv~~~~g~ 100 (278) +..|..++|||+- ..||.++-..|. T Consensus 2 ~mlv~~~~~d~~~-Gri~~~RV~sG~ 26 (86) T cd03691 2 QMLVTTLDYDDYV-GRIAIGRIFRGT 26 (86) T ss_pred CEEEEEEECCCCC-CEEEEEEEECCC T ss_conf 6899981018877-559999984695 No 57 >smart00287 SH3b Bacterial SH3 domain homologues. Probab=20.56 E-value=59 Score=13.23 Aligned_cols=43 Identities=23% Similarity=0.358 Sum_probs=29.2 Q ss_pred EEEECCCCEEEE--EECCCEEEEEEECC-CEEEEEECCCCEEEECC Q ss_conf 333036882588--63465189967508-88999948975899856 Q gi|254780258|r 145 VEMKPGKGGQIS--RSAGSYARLVERDR-SRALLRLSSGEMRFVHS 187 (278) Q Consensus 145 IE~~pG~Ggkl~--RsAGt~a~Ii~k~~-~~~~ikLPSGe~r~i~~ 187 (278) +-.-|+--+..+ =.+|+...++..+. +.+.|++++|+.-+|+. T Consensus 13 vR~gP~t~~~ii~~l~~G~~v~vl~~~~~~W~~i~~~~G~~Gwv~~ 58 (63) T smart00287 13 VRSGPGTSSPIIGTLKKGDKVKVLGVDGQDWAKITYGSGQRGYVPG 58 (63) T ss_pred ECCCCCCCCCEEEEECCCCEEEEEECCCCCEEEEECCCCCEEEEEE T ss_conf 6169999863479986999999953269979999918998999992 No 58 >TIGR02386 rpoC_TIGR DNA-directed RNA polymerase, beta' subunit; InterPro: IPR012754 DNA-directed RNA polymerases 2.7.7.6 from EC (also known as DNA-dependent RNA polymerases) are responsible for the polymerisation of ribonucleotides into a sequence complementary to the template DNA. In eukaryotes, there are three different forms of DNA-directed RNA polymerases transcribing different sets of genes. Most RNA polymerases are multimeric enzymes and are composed of a variable number of subunits. The core RNA polymerase complex consists of five subunits (two alpha, one beta, one beta-prime and one omega) and is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme . The core RNA polymerase complex forms a "crab claw"-like structure with an internal channel running along the full length . The key functional sites of the enzyme, as defined by mutational and cross-linking analysis, are located on the inner wall of this channel. RNA synthesis follows after the attachment of RNA polymerase to a specific site, the promoter, on the template DNA strand. The RNA synthesis process continues until a termination sequence is reached. The RNA product, which is synthesised in the 5' to 3'direction, is known as the primary transcript. Eukaryotic nuclei contain three distinct types of RNA polymerases that differ in the RNA they synthesise: RNA polymerase I: located in the nucleoli, synthesises precursors of most ribosomal RNAs. RNA polymerase II: occurs in the nucleoplasm, synthesises mRNA precursors. RNA polymerase III: also occurs in the nucleoplasm, synthesises the precursors of 5S ribosomal RNA, the tRNAs, and a variety of other small nuclear and cytosolic RNAs. Eukaryotic cells are also known to contain separate mitochondrial and chloroplast RNA polymerases. Eukaryotic RNA polymerases, whose molecular masses vary in size from 500 to 700 kD, contain two non-identical large (>100 kDa) subunits and an array of up to 12 different small (less than 50 kDa) subunits. This entry represents the beta-prime subunit, RpoC, found in most bacteria. It excludes some, mainly cyanobacterial, species where RpoC is replaced by two homologous proteins that include an additional domain. One arm of the "claw" is predominantly formed by this subunit, the other being predominantly formed by the beta subunit. The active site of the enzyme is defined by three invariant aspartate residues within the beta-prime subunit .; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006350 transcription. Probab=20.55 E-value=39 Score=14.36 Aligned_cols=39 Identities=13% Similarity=0.193 Sum_probs=20.1 Q ss_pred EECCC--CEEEEEECCCEEEEE-EECCCE-EEEEECCC--CEEEE Q ss_conf 30368--825886346518996-750888-99994897--58998 Q gi|254780258|r 147 MKPGK--GGQISRSAGSYARLV-ERDRSR-ALLRLSSG--EMRFV 185 (278) Q Consensus 147 ~~pG~--Ggkl~RsAGt~a~Ii-~k~~~~-~~ikLPSG--e~r~i 185 (278) -+|-+ -+.+|..-|+-..+- ..-+.+ +.|+=-.. |.... T Consensus 1328 r~PK~~e~A~Iae~dG~v~~~~~~~k~~r~~~i~~e~~~~e~~~Y 1372 (1552) T TIGR02386 1328 RTPKDFEAAVIAEVDGVVEIIEDDVKNKRVVVIKDENDDGEEKEY 1372 (1552) T ss_pred CCCCCCCCCCEEHCCCEEEEECCCCCCEEEEEEECCCCCCCEEEE T ss_conf 688611211000102178851144234089998316888534787 Done!