Query gi|254780874|ref|YP_003065287.1| peptide chain release factor 1 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 357 No_of_seqs 251 out of 3008 Neff 6.5 Searched_HMMs 39220 Date Mon May 30 00:15:50 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780874.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR00019 prfA peptide chain r 100.0 0 0 1124.0 30.3 353 5-357 6-372 (373) 2 TIGR00020 prfB peptide chain r 100.0 0 0 1055.2 27.7 332 8-347 26-372 (373) 3 PRK00591 prfA peptide chain re 100.0 0 0 1034.3 35.9 356 1-357 1-360 (360) 4 COG0216 PrfA Protein chain rel 100.0 0 0 989.4 34.2 356 1-357 1-362 (363) 5 PRK07245 consensus 100.0 0 0 941.2 32.7 330 12-349 1-336 (337) 6 PRK00578 prfB peptide chain re 100.0 0 0 928.1 34.4 338 7-350 23-366 (367) 7 PRK05589 peptide chain release 100.0 0 0 900.3 31.6 318 23-347 1-324 (325) 8 PRK06746 peptide chain release 100.0 0 0 885.7 30.2 319 23-347 1-325 (326) 9 PRK07342 peptide chain release 100.0 0 0 883.0 30.5 309 35-349 1-312 (322) 10 PRK08787 peptide chain release 100.0 0 0 858.9 30.7 308 42-355 2-311 (313) 11 KOG2726 consensus 100.0 0 0 773.4 29.6 332 18-357 52-386 (386) 12 COG1186 PrfB Protein chain rel 100.0 0 0 623.9 17.7 233 112-347 1-238 (239) 13 TIGR03072 release_prfH putativ 100.0 0 0 568.6 19.6 196 112-312 1-198 (200) 14 PRK08179 prfH peptide chain re 100.0 0 0 551.8 20.7 199 112-315 2-202 (204) 15 pfam00472 RF-1 RF-1 domain. Th 100.0 7.3E-38 1.9E-42 293.8 10.3 113 208-320 1-114 (114) 16 pfam03462 PCRF PCRF domain. Th 100.0 8.3E-36 2.1E-40 278.9 9.9 112 68-179 4-115 (115) 17 PRK09256 hypothetical protein; 99.7 6.3E-17 1.6E-21 142.8 5.7 67 214-280 7-98 (138) 18 KOG3429 consensus 98.9 1.7E-09 4.3E-14 89.2 5.9 72 211-282 31-128 (172) 19 PRK07245 consensus 95.4 0.13 3.3E-06 32.3 8.0 65 272-336 263-331 (337) 20 PRK11147 ABC transporter ATPas 93.1 0.38 9.7E-06 28.9 6.3 24 209-233 334-357 (632) 21 PRK10636 putative ABC transpor 89.2 2.1 5.3E-05 23.6 8.5 23 210-233 328-350 (638) 22 TIGR03545 conserved hypothetic 89.2 2.1 5.3E-05 23.6 7.7 20 122-141 287-306 (554) 23 PRK12595 bifunctional 3-deoxy- 74.2 7.5 0.00019 19.6 6.3 113 4-123 1-117 (360) 24 KOG0971 consensus 70.3 9.1 0.00023 19.0 10.6 42 121-162 786-827 (1243) 25 COG0249 MutS Mismatch repair A 69.8 9.3 0.00024 18.9 8.8 158 3-175 306-473 (843) 26 PRK13520 L-tyrosine decarboxyl 65.9 8.8 0.00022 19.1 3.7 115 120-257 87-238 (375) 27 pfam02128 Peptidase_M36 Fungal 65.6 5.8 0.00015 20.4 2.7 42 304-345 249-296 (368) 28 pfam11593 Med3 Mediator comple 62.6 13 0.00032 17.9 6.2 82 13-99 7-90 (381) 29 pfam09418 DUF2009 Protein of u 62.0 13 0.00033 17.9 5.6 73 101-193 229-317 (458) 30 KOG1697 consensus 62.0 6.6 0.00017 20.0 2.4 102 98-213 70-173 (275) 31 PRK05399 DNA mismatch repair p 60.6 14 0.00035 17.7 7.2 46 4-49 308-353 (848) 32 pfam08399 VWA_N VWA N-terminal 59.3 4.1 0.00011 21.4 1.0 19 233-254 55-73 (123) 33 TIGR02168 SMC_prok_B chromosom 55.9 16 0.00041 17.2 11.7 36 249-284 661-700 (1191) 34 TIGR02194 GlrX_NrdH Glutaredox 55.3 7.2 0.00018 19.7 1.7 41 134-174 13-61 (72) 35 pfam05557 MAD Mitotic checkpoi 53.8 17 0.00044 16.9 9.0 33 129-164 445-477 (722) 36 PRK08883 ribulose-phosphate 3- 53.4 18 0.00045 16.9 4.2 59 112-175 133-201 (220) 37 COG1256 FlgK Flagellar hook-as 52.4 18 0.00047 16.8 7.1 19 9-27 87-105 (552) 38 PRK00474 rps9p 30S ribosomal p 52.1 9.3 0.00024 18.9 1.9 32 192-227 5-36 (132) 39 PRK05581 ribulose-phosphate 3- 51.7 19 0.00048 16.7 4.3 59 112-175 136-204 (220) 40 pfam10458 Val_tRNA-synt_C Valy 51.5 19 0.00048 16.7 7.2 53 5-57 9-63 (66) 41 PRK06737 acetolactate synthase 50.7 11 0.00028 18.4 2.0 33 140-172 23-55 (76) 42 pfam02815 MIR MIR domain. The 50.6 19 0.0005 16.6 4.0 33 225-257 121-153 (189) 43 cd01989 STK_N The N-terminal d 49.2 15 0.00039 17.4 2.6 75 127-207 61-143 (146) 44 pfam05616 Neisseria_TspB Neiss 49.0 15 0.00037 17.5 2.4 37 107-143 199-239 (502) 45 pfam05192 MutS_III MutS domain 49.0 21 0.00052 16.4 7.8 114 3-122 50-173 (306) 46 KOG3274 consensus 48.6 12 0.00032 18.0 2.1 112 132-253 75-189 (210) 47 pfam00380 Ribosomal_S9 Ribosom 47.4 13 0.00033 17.9 2.0 26 195-222 1-26 (121) 48 PRK13182 racA polar chromosome 46.0 23 0.00058 16.1 6.8 11 58-68 103-113 (178) 49 PRK10329 glutaredoxin-like pro 45.9 13 0.00034 17.8 1.8 42 133-174 14-62 (81) 50 COG1196 Smc Chromosome segrega 45.8 23 0.00058 16.1 9.3 24 170-194 555-578 (1163) 51 TIGR03627 arch_S9P archaeal ri 45.8 12 0.00031 18.0 1.7 32 192-227 3-34 (130) 52 PRK08745 ribulose-phosphate 3- 45.1 23 0.0006 16.0 4.4 59 112-175 137-205 (223) 53 pfam07050 consensus 45.0 23 0.0006 16.0 10.6 63 3-65 1-63 (145) 54 PRK05431 seryl-tRNA synthetase 44.6 24 0.00061 16.0 11.1 103 3-111 1-112 (422) 55 pfam00834 Ribul_P_3_epim Ribul 44.0 24 0.00062 15.9 4.3 58 112-174 132-199 (201) 56 pfam04983 RNA_pol_Rpb1_3 RNA p 43.0 23 0.00059 16.1 2.7 32 120-151 126-157 (158) 57 pfam08014 DUF1704 Domain of un 41.8 26 0.00067 15.7 6.0 121 121-287 106-237 (348) 58 COG0103 RpsI Ribosomal protein 41.1 16 0.0004 17.3 1.6 19 191-209 6-24 (130) 59 PRK13902 alaS lanyl-tRNA synth 40.6 27 0.00069 15.5 9.2 23 220-242 682-707 (898) 60 KOG3358 consensus 40.4 17 0.00043 17.1 1.7 18 239-256 145-162 (211) 61 PRK08178 acetolactate synthase 40.0 21 0.00053 16.4 2.1 50 111-172 12-61 (96) 62 pfam08317 Spc7 Spc7 kinetochor 39.7 28 0.00072 15.4 8.4 37 111-151 263-299 (322) 63 COG4402 Uncharacterized protei 39.6 13 0.00033 17.8 1.0 13 246-258 393-405 (457) 64 KOG0804 consensus 38.8 29 0.00074 15.4 9.0 22 81-102 428-449 (493) 65 cd04878 ACT_AHAS N-terminal AC 38.7 21 0.00053 16.4 1.9 40 131-171 13-52 (72) 66 KOG3409 consensus 38.6 28 0.00072 15.4 2.6 70 168-239 91-168 (193) 67 pfam00509 Hemagglutinin Hemagg 38.5 29 0.00075 15.3 9.7 17 6-22 370-386 (550) 68 KOG1760 consensus 37.8 30 0.00076 15.2 8.4 70 28-97 15-104 (131) 69 KOG1290 consensus 36.4 13 0.00033 17.8 0.6 10 336-345 526-535 (590) 70 TIGR00651 pta phosphate acetyl 36.1 2.9 7.3E-05 22.6 -2.8 119 198-343 161-298 (322) 71 TIGR00106 TIGR00106 conserved 36.1 25 0.00065 15.7 2.0 32 319-350 35-67 (103) 72 pfam02298 Cu_bind_like Plastoc 35.5 15 0.00037 17.5 0.7 34 136-192 9-42 (84) 73 PRK11895 ilvH acetolactate syn 35.4 26 0.00067 15.7 2.0 61 131-192 15-80 (161) 74 KOG0081 consensus 34.9 32 0.00083 15.0 2.4 45 178-234 16-60 (219) 75 PHA00159 endonuclease I 34.6 21 0.00054 16.3 1.4 15 135-149 108-122 (149) 76 PRK08566 DNA-directed RNA poly 34.1 34 0.00087 14.8 2.6 31 171-206 299-329 (881) 77 pfam05367 Phage_endo_I Phage e 33.7 22 0.00057 16.2 1.4 14 135-148 108-121 (149) 78 pfam10275 Peptidase_C65 Peptid 33.5 35 0.00088 14.8 5.5 118 7-146 31-153 (237) 79 PRK11546 zraP zinc resistance 33.4 35 0.00089 14.8 5.8 48 53-100 64-111 (139) 80 CHL00100 ilvH acetohydroxyacid 33.3 35 0.00089 14.8 2.9 65 131-196 15-84 (172) 81 TIGR02898 spore_YhcN_YlaJ spor 33.3 20 0.00052 16.5 1.2 17 157-173 84-100 (185) 82 COG3378 Phage associated DNA p 33.2 35 0.00089 14.7 6.7 12 338-349 441-452 (517) 83 PRK00132 rpsI 30S ribosomal pr 33.2 30 0.00076 15.2 2.0 30 191-222 6-35 (130) 84 pfam12112 DUF3579 Protein of u 33.1 22 0.00057 16.1 1.4 64 113-178 10-78 (93) 85 COG3978 Acetolactate synthase 32.6 35 0.00088 14.8 2.3 52 127-178 11-62 (86) 86 KOG2150 consensus 31.3 38 0.00096 14.5 9.1 48 8-55 16-67 (575) 87 TIGR02153 gatD_arch glutamyl-t 31.2 16 0.0004 17.3 0.3 111 120-248 149-291 (413) 88 pfam04800 ETC_C1_NDUFA4 ETC co 31.2 21 0.00054 16.3 1.0 18 137-154 62-79 (101) 89 pfam01403 Sema Sema domain. Th 30.1 39 0.001 14.4 5.5 77 168-249 182-288 (434) 90 KOG4044 consensus 30.0 39 0.001 14.4 2.2 126 132-275 20-155 (201) 91 pfam11103 DUF2887 Protein of u 29.5 40 0.001 14.3 2.6 121 62-201 16-137 (200) 92 KOG0581 consensus 29.5 40 0.001 14.3 2.6 54 196-256 55-108 (364) 93 smart00787 Spc7 Spc7 kinetocho 29.2 41 0.001 14.3 8.1 35 112-150 260-294 (312) 94 cd06145 REX1_like This subfami 29.0 24 0.0006 16.0 0.9 13 184-196 15-27 (150) 95 pfam01447 Peptidase_M4 Thermol 28.7 41 0.0011 14.2 2.1 28 119-146 66-93 (150) 96 COG0440 IlvH Acetolactate synt 28.3 42 0.0011 14.2 3.3 110 137-255 22-147 (163) 97 PRK08091 ribulose-phosphate 3- 27.6 43 0.0011 14.1 4.3 59 112-175 152-220 (235) 98 pfam00606 Glycoprotein_B Herpe 27.6 8.7 0.00022 19.1 -1.5 45 298-343 588-632 (711) 99 CHL00079 rps9 ribosomal protei 27.4 38 0.00096 14.5 1.7 31 190-222 8-38 (134) 100 pfam11926 DUF3444 Domain of un 27.4 26 0.00066 15.7 0.9 54 108-170 114-176 (216) 101 TIGR00593 pola DNA polymerase 27.2 16 0.00041 17.2 -0.2 34 14-47 178-220 (1005) 102 PRK06945 flgK flagellar hook-a 26.3 45 0.0012 13.9 6.6 19 8-26 84-102 (649) 103 KOG0895 consensus 26.2 41 0.001 14.3 1.7 16 265-280 998-1013(1101) 104 PRK09722 allulose-6-phosphate 26.1 46 0.0012 13.9 3.8 38 112-149 133-170 (227) 105 PRK11152 ilvM acetolactate syn 25.9 40 0.001 14.3 1.6 42 137-178 21-62 (76) 106 TIGR02090 LEU1_arch isopropylm 25.4 20 0.00051 16.5 0.0 14 167-183 276-289 (371) 107 COG0011 Uncharacterized conser 25.4 45 0.0011 14.0 1.8 24 323-346 37-60 (100) 108 PRK07300 consensus 24.7 23 0.00057 16.1 0.2 42 311-354 546-589 (880) 109 TIGR00263 trpB tryptophan synt 24.5 49 0.0012 13.7 3.0 32 8-40 21-52 (412) 110 PRK08434 consensus 24.1 25 0.00064 15.8 0.3 52 14-65 170-232 (887) 111 KOG3389 consensus 23.5 34 0.00087 14.8 0.9 18 136-153 137-154 (178) 112 KOG0544 consensus 22.9 36 0.00093 14.6 0.9 34 142-180 57-90 (108) 113 cd06143 PAN2_exo PAN2 is the c 22.6 34 0.00086 14.9 0.7 26 183-208 31-57 (174) 114 COG2126 RPL37A Ribosomal prote 22.5 24 0.00061 15.9 -0.0 21 301-321 40-60 (61) 115 PTZ00086 40S ribosomal protein 22.5 53 0.0014 13.4 1.9 29 192-222 12-40 (147) 116 KOG2148 consensus 22.1 54 0.0014 13.4 10.9 14 121-134 306-319 (867) 117 KOG2019 consensus 22.0 39 0.00099 14.4 0.9 14 193-206 624-637 (998) 118 pfam07962 Swi3 Replication For 21.8 55 0.0014 13.4 3.0 27 116-143 23-49 (84) 119 COG2960 Uncharacterized protei 21.7 48 0.0012 13.8 1.4 83 1-98 8-90 (103) 120 TIGR02660 nifV_homocitr homoci 21.7 26 0.00066 15.7 -0.0 12 158-169 210-221 (369) 121 TIGR02375 pseudoazurin pseudoa 21.6 46 0.0012 13.9 1.3 34 142-175 34-73 (123) 122 TIGR01149 mtrG tetrahydrometha 21.1 56 0.0014 13.3 2.6 34 2-37 9-43 (72) 123 PRK00252 alaS alanyl-tRNA synt 21.1 57 0.0014 13.2 9.4 35 219-253 639-681 (864) 124 PRK11522 putrescine--2-oxoglut 21.1 57 0.0014 13.2 5.9 87 120-212 157-252 (468) 125 TIGR01875 cas_MJ0381 CRISPR-as 21.0 45 0.0012 13.9 1.1 107 129-242 50-196 (300) 126 cd04911 ACT_AKiii-YclM-BS_1 AC 20.9 35 0.0009 14.7 0.5 23 243-265 29-53 (76) 127 TIGR02566 cas_Csy3 CRISPR-asso 20.5 58 0.0015 13.2 3.4 137 97-255 81-240 (370) 128 TIGR01461 greB transcription e 20.3 58 0.0015 13.1 2.0 60 110-193 87-146 (157) 129 PRK13562 acetolactate synthase 20.2 59 0.0015 13.1 1.8 37 131-168 15-51 (84) 130 KOG4657 consensus 20.2 59 0.0015 13.1 7.7 97 9-114 21-123 (246) 131 COG1162 Predicted GTPases [Gen 20.0 50 0.0013 13.6 1.2 25 228-255 196-221 (301) No 1 >TIGR00019 prfA peptide chain release factor 1; InterPro: IPR004373 This family describes peptide chain release factor 1 (PrfA, RF-1), and excludes the related peptide chain release factor 2 (PrfB, RF-2). RF-1 helps recognise and terminate translation at UAA and UAG stop codons. The mitochondrial release factors are prfA-like, although not included above the trusted cut-off for this model. RF-1 does not have a translational frameshift.; GO: 0016149 translation release factor activity codon specific, 0006415 translational termination, 0005737 cytoplasm. Probab=100.00 E-value=0 Score=1123.96 Aligned_cols=353 Identities=55% Similarity=0.907 Sum_probs=344.8 Q ss_pred CHHHHHHHHHHHHHHHHHHCCCCCHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHH Q ss_conf 689899999999999998538531579---99999999879999999999999999899999997203785356654332 Q gi|254780874|r 5 SHKQICDLKNRFAEIELRMSESPSVDA---YIKLTEEYAAISPIISKISIYDQKKQEEQDLHTVIGDHNSDSEIRDLAQI 81 (357) Q Consensus 5 ~~~kL~~l~~rl~ELE~~Lsdp~~w~d---~~klsKe~s~L~~iv~~~~~~~~~~~ei~el~eLl~e~e~D~El~elAee 81 (357) ..+||+.|..++++|+..|+||+...| +.+++|+++.|.+++.+|..|.+..+++.++++|+++..+|+||++||++ T Consensus 6 l~~kL~~~~~~~~eL~~~L~dp~v~~d~~k~~~~~ke~~~l~~i~~~~~~~~~~~~~~~~A~~~l~E~~~D~El~elak~ 85 (373) T TIGR00019 6 LLEKLESLEERYEELEELLSDPEVISDQDKLRKLSKEYSQLEEIVDKYEEYKKAQEDLKEAKELLEEKGDDKELRELAKE 85 (373) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHH T ss_conf 99999999999999998506863113679999998862065689999999999999999999999742588658999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHCCCCCCCCC Q ss_conf 22332000577888999864000113554037886226885217889999999999998764984001321135447721 Q gi|254780874|r 82 EVLAIEKEIRELENEINYLLLPKDTDDDKSCILEIRAGTGGSEAALFVGDLFRMYERYAALRKWKVEVLSSSDNDDGGYK 161 (357) Q Consensus 82 El~~l~~~l~~le~eL~~~LLpsde~D~~nailEI~aGaGG~EA~dwA~~L~rMY~r~ae~~gwk~eii~~~~ge~aGik 161 (357) |+..++.++++||..|+++|||+||+|.+||||||||||||+|||+||+||||||.||||.|||||+|++.+.++.|||| T Consensus 86 El~~l~~~~~~LE~~Lk~LLlPkDPND~kNv~lEiRaGtGGdEAalFagDLfrMY~rYAE~kgWkV~i~s~~~~~~GGyK 165 (373) T TIGR00019 86 ELEELEKEIEELEEQLKILLLPKDPNDEKNVILEIRAGTGGDEAALFAGDLFRMYSRYAESKGWKVEILSANETELGGYK 165 (373) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCE T ss_conf 99999988888999999726886976695348885079971479999987999988887437988999741157778703 Q ss_pred EEEEEE--EC-------CCHHHHHHCCCCCEEEEEECCCCCCCCCCEEEEEEEEEECCCCCC-CCCCCCEEEEEEECCCC Q ss_conf 113777--50-------311121000246502676125456785110168888530334332-22454347999852888 Q gi|254780874|r 162 EIVATI--SG-------RGVFSRMKFESGVHRVQRVPATEASGRVHTSAATVAVLPEAAEID-VDIPLEDIRIDTMRASG 231 (357) Q Consensus 162 ~v~~~I--~G-------~~aYg~Lk~E~GvHRvqRip~~es~gR~hTS~a~V~V~P~~~~~~-~~i~~~dl~i~~~rssG 231 (357) +|+|.| +| ++||+.||||||||||||||.|||+||+|||||||+|||++||++ |+|||+||||||||||| T Consensus 166 E~I~~ish~GasdGdav~gVys~LKFEsGvHRVQRVP~TES~GRiHTSaaTVAvlPE~eE~E~v~InP~DLrIDtfRsSG 245 (373) T TIGR00019 166 EVIAEISHKGASDGDAVDGVYSRLKFESGVHRVQRVPVTESQGRIHTSAATVAVLPEVEEVEEVKINPKDLRIDTFRSSG 245 (373) T ss_pred EEEEEEECCCCCCCCHHHHHHHHHHHCCCCCEEEEECCCCCCCCEEECHHHEEECCCCCCCCCCCCCHHHCEEEEEECCC T ss_conf 25766530578886303315654001058730310023137692510011101057779889755576056484765278 Q ss_pred CCCCCCCCCCCEEEEEECCCCEEE-EECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEECC Q ss_conf 752000632121588964984488-8436541358999999999999999999999999874465224674238221044 Q gi|254780874|r 232 SGGQHVNTTDSAVRITHIPTGIMV-TSSEKSQHQNRLRAMKVLRARLYDVERKRMANERSANRKLQIGSGDRSERIRTYN 310 (357) Q Consensus 232 ~GGQ~vN~t~savri~H~ptgi~v-~~~eRsQ~~Nk~~A~~~L~~~l~~~~~~~~~~~~~~~r~~~~g~~~rs~~iRtY~ 310 (357) |||||||||||||||||+|||||| ||+|||||+||++||+||+||||+.+.+++.++.++.||+||||||||||||||| T Consensus 246 aGGQ~VNtTdSAVRITHLPTGIVV~CQdErSQ~kNK~kAmk~L~ArL~~~~~~~~~~~~a~~RksqVGSGDRSErIRTYN 325 (373) T TIGR00019 246 AGGQHVNTTDSAVRITHLPTGIVVSCQDERSQHKNKDKAMKILRARLYEKEQEKQQAEQASDRKSQVGSGDRSERIRTYN 325 (373) T ss_pred CCCCCCCCCCCCEEEEECCCCCEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHCCCCCCC T ss_conf 98888665200002002688417885478520567999999999999999999999998750466156654000355678 Q ss_pred CCCCCEEEEECCCEECCHHHHHCCCHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 89981124321644178778728866999999999999999874059 Q gi|254780874|r 311 FSQGRITDHRINLTLYKLEYVLQGYIDDIINPLLTAHQAKMIGSASE 357 (357) Q Consensus 311 ~~~~rv~DhR~~~~~~~~~~~l~G~l~~~i~~~~~~~~~~~l~~~~e 357 (357) ||||||||||||+|+|+|+.||+|+||++|+||....++++|+++.+ T Consensus 326 fPQNRvTDHRInlTly~L~~~l~G~LD~~i~aL~~~~q~~~L~~~~~ 372 (373) T TIGR00019 326 FPQNRVTDHRINLTLYKLDEVLEGDLDEIIEALIAEDQAQQLAELSE 372 (373) T ss_pred CCCCCCCHHHHCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHC T ss_conf 98885304344010430456630672157789999999999998612 No 2 >TIGR00020 prfB peptide chain release factor 2; InterPro: IPR004374 In many but not all taxa, there is a conserved real translational frameshift at a TGA codon. RF-2 helps terminate translation at TGA codons and can therefore regulate its own production by readthrough when RF-2 is insufficient. There is a superfamily IPR000352 from INTERPRO of RF-1, RF-2, mitochondrial, RF-H, etc proteins.; GO: 0016149 translation release factor activity codon specific, 0006415 translational termination, 0005737 cytoplasm. Probab=100.00 E-value=0 Score=1055.17 Aligned_cols=332 Identities=39% Similarity=0.585 Sum_probs=302.0 Q ss_pred HHHHHHHHHHHHHHHHCCCCCHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCH------HHHHH Q ss_conf 89999999999999853853157999999---9998799999999999999998999999972037853------56654 Q gi|254780874|r 8 QICDLKNRFAEIELRMSESPSVDAYIKLT---EEYAAISPIISKISIYDQKKQEEQDLHTVIGDHNSDS------EIRDL 78 (357) Q Consensus 8 kL~~l~~rl~ELE~~Lsdp~~w~d~~kls---Ke~s~L~~iv~~~~~~~~~~~ei~el~eLl~e~e~D~------El~el 78 (357) +.+.|+.|+++|++++++|+||+|+.++. ++.+.|+..++.++.+.+..+++.+|.+|+.+ +.|. +...+ T Consensus 26 d~e~~~~rle~le~e~~dP~fW~D~~rAq~v~~e~~~l~~~l~~~~~l~~~~~dl~~L~Ela~e-e~d~aaasGm~~~~e 104 (373) T TIGR00020 26 DPEKLKARLEELEKEMEDPNFWNDQERAQKVIKERSSLEEELDTLEELKNSLDDLSELLELAKE-EDDEAAASGMETFAE 104 (373) T ss_pred CHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC-CCHHHHHHHHHHHHH T ss_conf 8668999999998775078666125899999999999998632799987553257889987437-603577789999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHCCCCCC Q ss_conf 33222332000577888999864000113554037886226885217889999999999998764984001321135447 Q gi|254780874|r 79 AQIEVLAIEKEIRELENEINYLLLPKDTDDDKSCILEIRAGTGGSEAALFVGDLFRMYERYAALRKWKVEVLSSSDNDDG 158 (357) Q Consensus 79 AeeEl~~l~~~l~~le~eL~~~LLpsde~D~~nailEI~aGaGG~EA~dwA~~L~rMY~r~ae~~gwk~eii~~~~ge~a 158 (357) +.+|++.|+..+.++| +.++||||+|.+||||+||||||||||||||+||||||.||||+|||++|+||+++||+| T Consensus 105 l~~El~~Le~~~~~lE----~~~~LSgE~D~~nA~ltI~~GAGGTEa~DWa~ML~RMY~RWAE~rgykvE~lD~~~geeA 180 (373) T TIGR00020 105 LEEELKALEKELEELE----LRTLLSGEYDANNAILTIQSGAGGTEAQDWASMLYRMYLRWAERRGYKVEILDYSEGEEA 180 (373) T ss_pred HHHHHHHHHHHHHHHH----HHHHCCCCCCHHCCCEEECCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCC T ss_conf 9999999999999999----999706757721063352387798516669999987535764105751356312678613 Q ss_pred CCCEEEEEEECCCHHHHHHCCCCCEEEEEECCCCCCCCCCEEEEEEEEEECC-CC-CCCCCCCCEEEEEEECCCCCCCCC Q ss_conf 7211137775031112100024650267612545678511016888853033-43-322245434799985288875200 Q gi|254780874|r 159 GYKEIVATISGRGVFSRMKFESGVHRVQRVPATEASGRVHTSAATVAVLPEA-AE-IDVDIPLEDIRIDTMRASGSGGQH 236 (357) Q Consensus 159 Gik~v~~~I~G~~aYg~Lk~E~GvHRvqRip~~es~gR~hTS~a~V~V~P~~-~~-~~~~i~~~dl~i~~~rssG~GGQ~ 236 (357) |||||||.|+|+|||||||+|.|||||||||||||++|||||||+|.|+|++ |+ ++|+|+|.|||||||||||||||| T Consensus 181 GIKSVT~~ikG~yAYG~Lk~E~GvHRLVRISPFDa~~rRHTSFasv~V~P~~Dd~tI~IeI~~~D~riD~YRaSGAGGQh 260 (373) T TIGR00020 181 GIKSVTILIKGEYAYGYLKSEQGVHRLVRISPFDANARRHTSFASVEVMPEVDDDTIDIEIKPEDVRIDTYRASGAGGQH 260 (373) T ss_pred CEEEEEEEEECCCCCCCCCCCCCEEEEEEECCCCCCCCCCCEEEEEEECCCCCCCCCCCCCCCCCEEEEEEECCCCCCCC T ss_conf 40158999852653330226874178775066668695332268888624035744511068875468666546898974 Q ss_pred CCCCCCEEEEEECCCCEEE-EECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHCCCCCCCCCEEECCCCC Q ss_conf 0632121588964984488-84365413589999999999999999999999998744--65224674238221044899 Q gi|254780874|r 237 VNTTDSAVRITHIPTGIMV-TSSEKSQHQNRLRAMKVLRARLYDVERKRMANERSANR--KLQIGSGDRSERIRTYNFSQ 313 (357) Q Consensus 237 vN~t~savri~H~ptgi~v-~~~eRsQ~~Nk~~A~~~L~~~l~~~~~~~~~~~~~~~r--~~~~g~~~rs~~iRtY~~~~ 313 (357) ||||||||||||+|||||| ||+||||||||..||++||||||+.+++|+.++..+.+ |+.|||| +|||+||||| T Consensus 261 VNkTdSAVRITHiPTGiVv~CQ~eRSQ~kNK~~Amk~LKaKLYe~~~~k~~~~~~~~~~ek~eIgWG---~QIRSYVl~P 337 (373) T TIGR00020 261 VNKTDSAVRITHIPTGIVVQCQNERSQHKNKDSAMKVLKAKLYELEMEKEQAEKDAKEGEKSEIGWG---NQIRSYVLHP 337 (373) T ss_pred CCCCCCCEEEEEECCCCEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCC---CCCCEEEECC T ss_conf 2144653265443587368512730468889999999999999988899988887630246777864---5100004076 Q ss_pred CC-EEEEECCCEECCHHHHHCCCHHHHHHHHHHHH Q ss_conf 81-12432164417877872886699999999999 Q gi|254780874|r 314 GR-ITDHRINLTLYKLEYVLQGYIDDIINPLLTAH 347 (357) Q Consensus 314 ~r-v~DhR~~~~~~~~~~~l~G~l~~~i~~~~~~~ 347 (357) |. |||||||++++|+++||||+||.||+++|+|. T Consensus 338 Y~~vKD~Rt~~E~gNv~AVlDGdID~FiE~~Lkwk 372 (373) T TIGR00020 338 YQLVKDLRTNYETGNVEAVLDGDIDEFIEAYLKWK 372 (373) T ss_pred HHHHHHCCCCCCCCCCCEEECCCHHHHHHHHHHHC T ss_conf 13332201672357822575476547898887505 No 3 >PRK00591 prfA peptide chain release factor 1; Validated Probab=100.00 E-value=0 Score=1034.26 Aligned_cols=356 Identities=57% Similarity=0.928 Sum_probs=346.3 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHCCCCCHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHH Q ss_conf 9678689899999999999998538531579---9999999987999999999999999989999999720378535665 Q gi|254780874|r 1 MVFLSHKQICDLKNRFAEIELRMSESPSVDA---YIKLTEEYAAISPIISKISIYDQKKQEEQDLHTVIGDHNSDSEIRD 77 (357) Q Consensus 1 m~mi~~~kL~~l~~rl~ELE~~Lsdp~~w~d---~~klsKe~s~L~~iv~~~~~~~~~~~ei~el~eLl~e~e~D~El~e 77 (357) |...+.+||+.++.|+.+|+.+|++|+||+| +++++|+++.|+++++.++.|.+...++.++.+|+.+ ++|++|.+ T Consensus 1 m~~~m~~kLe~l~~rl~eLe~~ls~p~fW~D~~~a~kl~kE~~~L~~~v~~~~~l~~~~~dl~el~eLl~e-e~D~el~e 79 (360) T PRK00591 1 MKPSMLDKLEALEERYEELEALLSDPEVISDQKRFRKLSKEYAELEPIVEAYREYKQAEEDLEDAKELLKE-ESDPEMRE 79 (360) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHCCCHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC-CCCHHHHH T ss_conf 96799999999999999999987298521299999999999999999999999999999989999999862-79978999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHCCCCC Q ss_conf 43322233200057788899986400011355403788622688521788999999999999876498400132113544 Q gi|254780874|r 78 LAQIEVLAIEKEIRELENEINYLLLPKDTDDDKSCILEIRAGTGGSEAALFVGDLFRMYERYAALRKWKVEVLSSSDNDD 157 (357) Q Consensus 78 lAeeEl~~l~~~l~~le~eL~~~LLpsde~D~~nailEI~aGaGG~EA~dwA~~L~rMY~r~ae~~gwk~eii~~~~ge~ 157 (357) ++++|+..+..++++++.+|+.+|||+||+|.+|||||||||||||||||||+||||||+|||+++||+++++++++|++ T Consensus 80 ~a~~El~~l~~~l~~lE~eL~~lLlp~d~~D~~nailEIrAGAGGtEA~dwA~mL~RMY~RwAE~~g~k~evid~s~ge~ 159 (360) T PRK00591 80 MAKEELKELEERLEELEEELKILLLPKDPNDDKNVILEIRAGTGGDEAALFAGDLFRMYSRYAERQGWKVEILSASEGEL 159 (360) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCC T ss_conf 99999999999999999999998379997756785899965888748999999999999999986799715602577655 Q ss_pred CCCCEEEEEEECCCHHHHHHCCCCCEEEEEECCCCCCCCCCEEEEEEEEEECCCCCCCCCCCCEEEEEEECCCCCCCCCC Q ss_conf 77211137775031112100024650267612545678511016888853033433222454347999852888752000 Q gi|254780874|r 158 GGYKEIVATISGRGVFSRMKFESGVHRVQRVPATEASGRVHTSAATVAVLPEAAEIDVDIPLEDIRIDTMRASGSGGQHV 237 (357) Q Consensus 158 aGik~v~~~I~G~~aYg~Lk~E~GvHRvqRip~~es~gR~hTS~a~V~V~P~~~~~~~~i~~~dl~i~~~rssG~GGQ~v 237 (357) ||||+|||.|+|+|||||||+|+||||||||||||++||||||||||+|+|++++++|+|+|+||||||||||||||||| T Consensus 160 aGiKsat~~I~G~~AYg~Lk~EsGVHRvvRip~~es~gR~HTSfasV~V~P~~~~~~i~i~~~dl~idt~RssGaGGQhV 239 (360) T PRK00591 160 GGYKEIIASISGDGVYSKLKFESGVHRVQRVPATESQGRIHTSAATVAVLPEAEEVEVEINPKDLRIDTFRSSGAGGQHV 239 (360) T ss_pred CCHHEEEEEEECHHHHHHHHHCCCCCCEECCCCCCCCCCEEEEEEEEEEECCCCCCCCCCCCCCEEEEEEECCCCCCCCC T ss_conf 65021688871706887763237754123069856678566400478970676431025685402588753579998876 Q ss_pred CCCCCEEEEEECCCCEEE-EECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEECCCCCCCE Q ss_conf 632121588964984488-8436541358999999999999999999999999874465224674238221044899811 Q gi|254780874|r 238 NTTDSAVRITHIPTGIMV-TSSEKSQHQNRLRAMKVLRARLYDVERKRMANERSANRKLQIGSGDRSERIRTYNFSQGRI 316 (357) Q Consensus 238 N~t~savri~H~ptgi~v-~~~eRsQ~~Nk~~A~~~L~~~l~~~~~~~~~~~~~~~r~~~~g~~~rs~~iRtY~~~~~rv 316 (357) |||||||||||+||||+| ||+|||||+||++||++||+|||+++.+++.++.+..||+|||||+||+|||||||||+|| T Consensus 240 Nkt~saVritH~ptgi~v~cq~ersQ~~Nk~~A~~~L~~~l~~~~~~~~~~~~~~~r~~~i~~g~rs~~iRtY~~p~~~v 319 (360) T PRK00591 240 NTTDSAVRITHLPTGIVVECQDERSQHKNKAKAMKVLRARLYDAERQKAQAEEAATRKSQVGSGDRSERIRTYNFPQGRV 319 (360) T ss_pred CCCCCEEEEEECCCCEEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCC T ss_conf 76343268986167559985460227889999999999999999999999999998886513777000750771898865 Q ss_pred EEEECCCEECCHHHHHCCCHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 24321644178778728866999999999999999874059 Q gi|254780874|r 317 TDHRINLTLYKLEYVLQGYIDDIINPLLTAHQAKMIGSASE 357 (357) Q Consensus 317 ~DhR~~~~~~~~~~~l~G~l~~~i~~~~~~~~~~~l~~~~e 357 (357) ||||||++++|+++||||+||+||++|+.++++++|+++++ T Consensus 320 ~DhR~~~~~~~~~~vldG~ld~~I~a~~~~~~~~~l~~~~~ 360 (360) T PRK00591 320 TDHRINLTLYKLDEVMEGDLDELIDALIAEDQAEKLAALGE 360 (360) T ss_pred CCCCCCCEECCHHHHHCCCHHHHHHHHHHHHHHHHHHHCCC T ss_conf 00135986468577748876999999999999999986149 No 4 >COG0216 PrfA Protein chain release factor A [Translation, ribosomal structure and biogenesis] Probab=100.00 E-value=0 Score=989.35 Aligned_cols=356 Identities=57% Similarity=0.930 Sum_probs=345.5 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHCCCC---CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHH Q ss_conf 967868989999999999999853853---15799999999987999999999999999989999999720378535665 Q gi|254780874|r 1 MVFLSHKQICDLKNRFAEIELRMSESP---SVDAYIKLTEEYAAISPIISKISIYDQKKQEEQDLHTVIGDHNSDSEIRD 77 (357) Q Consensus 1 m~mi~~~kL~~l~~rl~ELE~~Lsdp~---~w~d~~klsKe~s~L~~iv~~~~~~~~~~~ei~el~eLl~e~e~D~El~e 77 (357) |.+.+..+|+.+..|+.+|+..|++|+ ..+.+.+++|++++|.+++..|..|.+...++.++++|+.++ .|+||++ T Consensus 1 ~~~~~~~kl~~~~~r~~el~~~L~~p~v~~d~~~~~~lske~a~l~~iv~~~~~~~~~~~~l~~a~~~l~~~-~D~em~e 79 (363) T COG0216 1 MKPSLLEKLESLLERYEELEALLSDPEVISDPDEYRKLSKEYAELEPIVEKYREYKKAQEDLEDAKEMLAEE-KDPEMRE 79 (363) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC-CCHHHHH T ss_conf 971589999999999999999853963336999999999999888899999999999999999999998356-8878999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHCCCCC Q ss_conf 43322233200057788899986400011355403788622688521788999999999999876498400132113544 Q gi|254780874|r 78 LAQIEVLAIEKEIRELENEINYLLLPKDTDDDKSCILEIRAGTGGSEAALFVGDLFRMYERYAALRKWKVEVLSSSDNDD 157 (357) Q Consensus 78 lAeeEl~~l~~~l~~le~eL~~~LLpsde~D~~nailEI~aGaGG~EA~dwA~~L~rMY~r~ae~~gwk~eii~~~~ge~ 157 (357) ||++|+..++.+++.++.+|+.+|||+||+|++||||||||||||+||++||++|||||.|||+.+||+++|++.++++. T Consensus 80 ma~~Ei~~~~~~~~~le~~L~~lLlPkDpnd~knvilEIRagtGGdEAalFagDLfrMY~rYAe~kgWk~ei~s~se~~~ 159 (363) T COG0216 80 MAEEEIKELEAKIEELEEELKILLLPKDPNDDKNIILEIRAGTGGDEAALFAGDLFRMYSRYAESKGWKVEILSASESEL 159 (363) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCC T ss_conf 99999999999899999999986289998877776899706888159999999999999999975798799941586667 Q ss_pred CCCCEEEEEEECCCHHHHHHCCCCCEEEEEECCCCCCCCCCEEEEEEEEEECCCCC-CCCCCCCEEEEEEECCCCCCCCC Q ss_conf 77211137775031112100024650267612545678511016888853033433-22245434799985288875200 Q gi|254780874|r 158 GGYKEIVATISGRGVFSRMKFESGVHRVQRVPATEASGRVHTSAATVAVLPEAAEI-DVDIPLEDIRIDTMRASGSGGQH 236 (357) Q Consensus 158 aGik~v~~~I~G~~aYg~Lk~E~GvHRvqRip~~es~gR~hTS~a~V~V~P~~~~~-~~~i~~~dl~i~~~rssG~GGQ~ 236 (357) ||||++++.|+|.+||+.||||+|||||||||.|||+||+|||||||+|||+++++ +|+|||+|||||||||||||||| T Consensus 160 GG~kEii~~I~G~gvys~LKfEsGvHRVQRVP~TEsqGRIHTStaTVaVlPE~ee~~ei~I~~~DlrIDt~RsSGaGGQh 239 (363) T COG0216 160 GGYKEIIASISGKGVYSRLKFESGVHRVQRVPATESQGRIHTSAATVAVLPEVEEVEEIEINPKDLRIDTFRSSGAGGQH 239 (363) T ss_pred CCCEEEEEEEECCCHHHHHHHCCCCCCEECCCCCCCCCCEEECCEEEEECCCCCCCCCCCCCHHHCEEEEEECCCCCCCC T ss_conf 88048999984365455410315765221066655788314000168862787753346447788347655437898887 Q ss_pred CCCCCCEEEEEECCCCEEE-EECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEECCCCCCC Q ss_conf 0632121588964984488-843654135899999999999999999999999987446522467423822104489981 Q gi|254780874|r 237 VNTTDSAVRITHIPTGIMV-TSSEKSQHQNRLRAMKVLRARLYDVERKRMANERSANRKLQIGSGDRSERIRTYNFSQGR 315 (357) Q Consensus 237 vN~t~savri~H~ptgi~v-~~~eRsQ~~Nk~~A~~~L~~~l~~~~~~~~~~~~~~~r~~~~g~~~rs~~iRtY~~~~~r 315 (357) ||||||||||||+|||||| ||+|||||+||++||++|+||||+.+.+++.++.++.||+||||||||+|||||||||+| T Consensus 240 VNtTdSAVRiTHlPTGIvV~cQderSQ~kNk~kAmkvL~ARl~~~~~~~~~~~~~~~RksqVGSGDRSErIRTYNfPQnR 319 (363) T COG0216 240 VNTTDSAVRITHLPTGIVVECQDERSQHKNKAKAMKVLRARLYDAERQKAQAEEASERKSQVGSGDRSERIRTYNFPQNR 319 (363) T ss_pred CCCCCHHHEEEECCCCEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHCCCCCCCC T ss_conf 66513110143147736998446245665199999999999999999999999998888733777425421035788776 Q ss_pred EEEEECCCEECCHHHHHCC-CHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 1243216441787787288-66999999999999999874059 Q gi|254780874|r 316 ITDHRINLTLYKLEYVLQG-YIDDIINPLLTAHQAKMIGSASE 357 (357) Q Consensus 316 v~DhR~~~~~~~~~~~l~G-~l~~~i~~~~~~~~~~~l~~~~e 357 (357) |||||||+|+++|+.||+| +||++|+++..++++++|+++++ T Consensus 320 VTDHRI~lTl~kLd~vm~gG~LDeii~aLi~~~q~~~L~~l~~ 362 (363) T COG0216 320 VTDHRINLTLYKLDEVMEGGKLDEIIDALIAEDQAEQLAELGE 362 (363) T ss_pred CCCHHCCCCCCCHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHC T ss_conf 5101036610458888646857899999999999999987304 No 5 >PRK07245 consensus Probab=100.00 E-value=0 Score=941.16 Aligned_cols=330 Identities=31% Similarity=0.514 Sum_probs=298.8 Q ss_pred HHHHHHHHHHHHCCCCCHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHH Q ss_conf 999999999985385315799---99999998799999999999999998999999972037853566543322233200 Q gi|254780874|r 12 LKNRFAEIELRMSESPSVDAY---IKLTEEYAAISPIISKISIYDQKKQEEQDLHTVIGDHNSDSEIRDLAQIEVLAIEK 88 (357) Q Consensus 12 l~~rl~ELE~~Lsdp~~w~d~---~klsKe~s~L~~iv~~~~~~~~~~~ei~el~eLl~e~e~D~El~elAeeEl~~l~~ 88 (357) |+.++.+|+.+|++|+||+|+ ++++|+++.|+..++.|+.|.+...+++.+.+++. +|+++.++++.++ . T Consensus 1 l~~~l~eLe~~~s~Pd~w~D~~ka~kl~ke~~~L~~~v~~~~~l~~~~~d~e~l~el~~---ede~~~~El~~~~----~ 73 (337) T PRK07245 1 MEEEIALLENHMTEPDFWNDNIAAQKTSQELNELKGKYDTFHNMQELSDETELLLEMLD---EDDSLKEELEENL----M 73 (337) T ss_pred CHHHHHHHHHHHCCCHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---CCHHHHHHHHHHH----H T ss_conf 95899999998629703309999999999999999999999999999999999998660---7998999999999----9 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHCCCCCCCCCEEEEEEE Q ss_conf 05778889998640001135540378862268852178899999999999987649840013211354477211137775 Q gi|254780874|r 89 EIRELENEINYLLLPKDTDDDKSCILEIRAGTGGSEAALFVGDLFRMYERYAALRKWKVEVLSSSDNDDGGYKEIVATIS 168 (357) Q Consensus 89 ~l~~le~eL~~~LLpsde~D~~nailEI~aGaGG~EA~dwA~~L~rMY~r~ae~~gwk~eii~~~~ge~aGik~v~~~I~ 168 (357) +++++..++++.+||+||+|.+||||||||||||+||||||+||||||+|||+++||++++++.++|++||||+|+|.|+ T Consensus 74 ~l~~~~~~le~~lll~~~~D~~naileIrAGaGG~EA~dwA~mL~RMY~R~aE~~gwk~evl~~~~ge~~GiK~vt~~I~ 153 (337) T PRK07245 74 QLDKIMGAYEMTLLLSEPYDHNNAILEIHPGSGGTEAQDWGDLLLRMYTRFGNANGFKVEVLDYQAGDEAGIKSVTLSFE 153 (337) T ss_pred HHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHCCCCHHHHHCCCCCCCCCCEEEEEEE T ss_conf 99999999999866699876788379997898967999999999999999999769940121068865456005799996 Q ss_pred CCCHHHHHHCCCCCEEEEEECCCCCCCCCCEEEEEEEEEECCCC-CCCCCCCCEEEEEEECCCCCCCCCCCCCCCEEEEE Q ss_conf 03111210002465026761254567851101688885303343-32224543479998528887520006321215889 Q gi|254780874|r 169 GRGVFSRMKFESGVHRVQRVPATEASGRVHTSAATVAVLPEAAE-IDVDIPLEDIRIDTMRASGSGGQHVNTTDSAVRIT 247 (357) Q Consensus 169 G~~aYg~Lk~E~GvHRvqRip~~es~gR~hTS~a~V~V~P~~~~-~~~~i~~~dl~i~~~rssG~GGQ~vN~t~savri~ 247 (357) |+|||||||+|+||||||||||||++||||||||+|+|+|+++| ++|+|+|+||||||||||||||||||||+|||||| T Consensus 154 G~~aYg~Lk~EsGVHRv~Risp~es~gRrhTS~asV~VlP~~dd~i~i~i~~~Dlridt~RssGaGGQ~VNkt~SAVRit 233 (337) T PRK07245 154 GPNAYGLLKSEMGVHRLVRISPFDSAKRRHTSFASVEVMPELDDTIEVEVRDDDIKMDTFRSGGAGGQNVNKVSTGVRLT 233 (337) T ss_pred CCCCCCCEEECCCCEEEEECCCCCCCCCEEEEEEEEEEECCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCCEEEEEEEE T ss_conf 47877703313752024767998888866656789999535576545678875638998757999988867652268999 Q ss_pred ECCCCEEE-EECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEECCCC-CCCEEEEECCCEE Q ss_conf 64984488-843654135899999999999999999999999987446522467423822104489-9811243216441 Q gi|254780874|r 248 HIPTGIMV-TSSEKSQHQNRLRAMKVLRARLYDVERKRMANERSANRKLQIGSGDRSERIRTYNFS-QGRITDHRINLTL 325 (357) Q Consensus 248 H~ptgi~v-~~~eRsQ~~Nk~~A~~~L~~~l~~~~~~~~~~~~~~~r~~~~g~~~rs~~iRtY~~~-~~rv~DhR~~~~~ 325 (357) |+||||+| ||+|||||+||++||++||||||+.+++++.++..+.|... +...||+|||||||+ ++||||||||+++ T Consensus 234 H~PTGi~v~cq~eRSQ~~Nk~~A~~~L~akL~~~~~~~~~~e~~~~~~~~-~~~~~g~~IRtY~~~p~~~V~DhRt~~~~ 312 (337) T PRK07245 234 HIPTGIVVSSTVDRTQYGNRDRAMKMLQAKLYQLEQEKKAQEVDALKGDK-KEITWGSQIRSYVFTPYTMVKDHRTNFEL 312 (337) T ss_pred EECCCEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC-CCCCCCCCCCCEECCCCCCCCCCCCCCEE T ss_conf 72586699965743689999999999999999999999998999864152-12665576615167898837420669744 Q ss_pred CCHHHHHCCCHHHHHHHHHHHHHH Q ss_conf 787787288669999999999999 Q gi|254780874|r 326 YKLEYVLQGYIDDIINPLLTAHQA 349 (357) Q Consensus 326 ~~~~~~l~G~l~~~i~~~~~~~~~ 349 (357) +|+++||||+||+||+||++|+.. T Consensus 313 ~~~~~vldG~ld~fi~a~l~~~~~ 336 (337) T PRK07245 313 AQVDKVMDGEINGFIDAYLKWRIE 336 (337) T ss_pred CCHHHHHCCCHHHHHHHHHHHHHC T ss_conf 784787587759999999975404 No 6 >PRK00578 prfB peptide chain release factor 2; Validated Probab=100.00 E-value=0 Score=928.12 Aligned_cols=338 Identities=36% Similarity=0.535 Sum_probs=318.8 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHH Q ss_conf 9899999999999998538531579---9999999987999999999999999989999999720378535665433222 Q gi|254780874|r 7 KQICDLKNRFAEIELRMSESPSVDA---YIKLTEEYAAISPIISKISIYDQKKQEEQDLHTVIGDHNSDSEIRDLAQIEV 83 (357) Q Consensus 7 ~kL~~l~~rl~ELE~~Lsdp~~w~d---~~klsKe~s~L~~iv~~~~~~~~~~~ei~el~eLl~e~e~D~El~elAeeEl 83 (357) ..++.++.|+++|+.+|++|+||+| +++++|+++.|+++++.|+.|.+...++.++.+|+.+ ++|++|.++++.|+ T Consensus 23 ldle~l~~rl~ELe~~lsdP~~W~D~~~a~kl~kE~~~L~~~v~~~~~l~~~~~dl~~l~eL~~e-e~D~el~eea~~el 101 (367) T PRK00578 23 LDVDALKERLEELEAEAEDPDFWNDQERAQKVGKELSSLKAKLDTLENLRQRLDDLEELLELAEE-EDDEETLAEAEAEL 101 (367) T ss_pred CCHHHHHHHHHHHHHHHCCCHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC-CCCHHHHHHHHHHH T ss_conf 79999999999999986397054499999999999999999999999999999999999999864-99989999999999 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHCCCCCCCCCEE Q ss_conf 33200057788899986400011355403788622688521788999999999999876498400132113544772111 Q gi|254780874|r 84 LAIEKEIRELENEINYLLLPKDTDDDKSCILEIRAGTGGSEAALFVGDLFRMYERYAALRKWKVEVLSSSDNDDGGYKEI 163 (357) Q Consensus 84 ~~l~~~l~~le~eL~~~LLpsde~D~~nailEI~aGaGG~EA~dwA~~L~rMY~r~ae~~gwk~eii~~~~ge~aGik~v 163 (357) ..+...++++| +.+||+||+|.+||||||||||||+||||||+||||||+|||+++||++++++.++|+.||||+| T Consensus 102 ~~l~~~l~~lE----l~~Llsd~~D~~naileI~aGaGG~EA~~~a~~L~rMY~r~ae~~g~k~~v~~~~~~~~~Gik~~ 177 (367) T PRK00578 102 KALEKKLAALE----LERLLSGEYDANNAILTIHAGAGGTEAQDWASMLLRMYLRWAERHGFKVEVLDYSEGEEAGIKSA 177 (367) T ss_pred HHHHHHHHHHH----HHHCCCCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEECCCCCCCCCCEE T ss_conf 99999999999----98604898867884799984899607889999999999999987588631410377655660058 Q ss_pred EEEEECCCHHHHHHCCCCCEEEEEECCCCCCCCCCEEEEEEEEEECCCC-CCCCCCCCEEEEEEECCCCCCCCCCCCCCC Q ss_conf 3777503111210002465026761254567851101688885303343-322245434799985288875200063212 Q gi|254780874|r 164 VATISGRGVFSRMKFESGVHRVQRVPATEASGRVHTSAATVAVLPEAAE-IDVDIPLEDIRIDTMRASGSGGQHVNTTDS 242 (357) Q Consensus 164 ~~~I~G~~aYg~Lk~E~GvHRvqRip~~es~gR~hTS~a~V~V~P~~~~-~~~~i~~~dl~i~~~rssG~GGQ~vN~t~s 242 (357) +|.|+|+||||+||+|+||||||||||||++||||||||+|+|+|++++ ++|+|+|+||+|||||||||||||||||+| T Consensus 178 ~~~i~G~~ayg~Lk~E~GvHRv~Ri~~~es~gr~hTS~a~V~V~P~~~~~~~i~i~~~dl~id~~RssGaGGQ~VNkt~s 257 (367) T PRK00578 178 TLKIKGPYAYGYLKSEKGVHRLVRISPFDSAGRRHTSFASVEVYPEVDDDIEIEINPKDLRIDTYRSSGAGGQHVNKTDS 257 (367) T ss_pred EEEEECCCHHHHHCCCCCCEEEEECCCCCCCCCEEEEEEEEEEECCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCCCCC T ss_conf 99983465343322257621356469857888657775699996267764357788213358986468998777575233 Q ss_pred EEEEEECCCCEEE-EECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEECCC-CCCCEEEEE Q ss_conf 1588964984488-84365413589999999999999999999999998744652246742382210448-998112432 Q gi|254780874|r 243 AVRITHIPTGIMV-TSSEKSQHQNRLRAMKVLRARLYDVERKRMANERSANRKLQIGSGDRSERIRTYNF-SQGRITDHR 320 (357) Q Consensus 243 avri~H~ptgi~v-~~~eRsQ~~Nk~~A~~~L~~~l~~~~~~~~~~~~~~~r~~~~g~~~rs~~iRtY~~-~~~rv~DhR 320 (357) ||||||+|||||| ||+|||||+||++||++||+|||+++.+++.++....|+++++. .|++||||||| ||+|||||| T Consensus 258 AVRitH~PTGivv~cq~eRSQ~~Nk~~A~~~L~akL~~~~~~~~~~~~~~~r~~~~~~-~~g~~IRtY~f~p~~rV~DhR 336 (367) T PRK00578 258 AVRITHIPTGIVVQCQNERSQHQNKASAMKMLKAKLYELELEKRAAEKDALRGEKKDI-GWGSQIRSYVLHPYQMVKDLR 336 (367) T ss_pred EEEEEEECCCEEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC-CCCCCCCCCCCCCCCCCCCCC T ss_conf 3799972586499955602189899999999999999999999999999976441357-653656442478987474005 Q ss_pred CCCEECCHHHHHCCCHHHHHHHHHHHHHHH Q ss_conf 164417877872886699999999999999 Q gi|254780874|r 321 INLTLYKLEYVLQGYIDDIINPLLTAHQAK 350 (357) Q Consensus 321 ~~~~~~~~~~~l~G~l~~~i~~~~~~~~~~ 350 (357) ||++++|+++||+|+||+||++|++|+..+ T Consensus 337 ~~~~~~~l~~vl~G~ld~~I~a~L~~~~~~ 366 (367) T PRK00578 337 TGYETGNTQAVLDGDLDGFIEAYLRWRLSS 366 (367) T ss_pred CCCEECCHHHHHCCCHHHHHHHHHHHHHHC T ss_conf 697357757874888599999999877447 No 7 >PRK05589 peptide chain release factor 2; Provisional Probab=100.00 E-value=0 Score=900.30 Aligned_cols=318 Identities=33% Similarity=0.514 Sum_probs=292.0 Q ss_pred HCCCCCHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 538531579---99999999879999999999999999899999997203785356654332223320005778889998 Q gi|254780874|r 23 MSESPSVDA---YIKLTEEYAAISPIISKISIYDQKKQEEQDLHTVIGDHNSDSEIRDLAQIEVLAIEKEIRELENEINY 99 (357) Q Consensus 23 Lsdp~~w~d---~~klsKe~s~L~~iv~~~~~~~~~~~ei~el~eLl~e~e~D~El~elAeeEl~~l~~~l~~le~eL~~ 99 (357) |++|+||+| +++++|+++.|+++++.|+.+.+...++.++.+|+.+ +|+++..++..++..+...+++ ++. T Consensus 1 m~dPd~W~D~~~a~~l~kE~~~l~~~v~~~~~l~~~~~dl~~l~el~ee--ed~~~~~e~~~e~~~l~~~l~~----le~ 74 (325) T PRK05589 1 MQEPNFWNDIKKAQEITSEEKYLKDKIDKYNSLRNRIEDIEVLKEMMEE--EDEESAKEIISEVKNIKEEIDN----FKI 74 (325) T ss_pred CCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC--CCHHHHHHHHHHHHHHHHHHHH----HHH T ss_conf 9893032099999999999999999999999999999999999998738--9999999999999999999999----999 Q ss_pred HHCCCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHCCCCCCCCCEEEEEEECCCHHHHHHCC Q ss_conf 64000113554037886226885217889999999999998764984001321135447721113777503111210002 Q gi|254780874|r 100 LLLPKDTDDDKSCILEIRAGTGGSEAALFVGDLFRMYERYAALRKWKVEVLSSSDNDDGGYKEIVATISGRGVFSRMKFE 179 (357) Q Consensus 100 ~LLpsde~D~~nailEI~aGaGG~EA~dwA~~L~rMY~r~ae~~gwk~eii~~~~ge~aGik~v~~~I~G~~aYg~Lk~E 179 (357) .+||+||+|.+||||||||||||+||||||+||||||+|||+++||+++++++++|++||||+|++.|+|++|||+||+| T Consensus 75 ~~Ll~~~~D~~naileIrAGaGG~EA~dwA~~L~RMY~r~Ae~~gwk~evid~~~ge~gGiK~v~~~I~G~~ayg~Lk~E 154 (325) T PRK05589 75 ETLLSGEYDRNNAILTLHSGVGGTDAQDWTEMLLRMYTRWAEKKGYKVEIIDLLEGDEAGIKSVTLKITGEFAYGYLKAE 154 (325) T ss_pred HHCCCCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCCCCCEEEEEECCHHHHHHHCC T ss_conf 84579988789858999658897799999999999999999875995010431765457810038998551256554215 Q ss_pred CCCEEEEEECCCCCCCCCCEEEEEEEEEECC-CCCCCCCCCCEEEEEEECCCCCCCCCCCCCCCEEEEEECCCCEEE-EE Q ss_conf 4650267612545678511016888853033-433222454347999852888752000632121588964984488-84 Q gi|254780874|r 180 SGVHRVQRVPATEASGRVHTSAATVAVLPEA-AEIDVDIPLEDIRIDTMRASGSGGQHVNTTDSAVRITHIPTGIMV-TS 257 (357) Q Consensus 180 ~GvHRvqRip~~es~gR~hTS~a~V~V~P~~-~~~~~~i~~~dl~i~~~rssG~GGQ~vN~t~savri~H~ptgi~v-~~ 257 (357) +||||||||||||++||||||||+|+|+|++ ++++|+|+|+||+|||||||||||||||||+|||||||+||||+| || T Consensus 155 ~GvHRvqRip~~es~gR~hTS~a~V~VlP~~~~~~~i~i~~~dl~id~~rssG~GGQ~VNkt~sAVRitH~PTGi~v~cq 234 (325) T PRK05589 155 KGIHRLVRISPFNANGKRQTSFASVEVLPELTDDQDIEIRSEDLKIDTYRAGGAGGQHVNKTESAVRITHIPTGIVVQCQ 234 (325) T ss_pred CCCCEEEECCCCCCCCCEEEEEEEEEEECCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCEEEEEEEECCCEEEEEC T ss_conf 77511464488777885487757889945766654477773564289887899987877761204899970586499965 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEECCC-CCCCEEEEECCCEECCHHHHHCCCH Q ss_conf 365413589999999999999999999999998744652246742382210448-9981124321644178778728866 Q gi|254780874|r 258 SEKSQHQNRLRAMKVLRARLYDVERKRMANERSANRKLQIGSGDRSERIRTYNF-SQGRITDHRINLTLYKLEYVLQGYI 336 (357) Q Consensus 258 ~eRsQ~~Nk~~A~~~L~~~l~~~~~~~~~~~~~~~r~~~~g~~~rs~~iRtY~~-~~~rv~DhR~~~~~~~~~~~l~G~l 336 (357) +|||||+||++||++||+|||+++.+++.++.... +.++++++||+||||||| |++||||||||++++|+++||+|+| T Consensus 235 ~eRSQ~~Nk~~A~~~L~arL~~~~~~~~~~~~~~~-~~~~~~~~~g~~iRtY~~~p~~~V~DhR~g~~~~~l~~vl~Gdl 313 (325) T PRK05589 235 NERSQHSNKDTAMKMLKSKLVELKERAHKEKIEDL-TGELKDMGWGSQIRSYVFHPYNLVKDHRTGVETSNVDSVMDGDI 313 (325) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HCCCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCHHHHHCCCH T ss_conf 85648999999999999999999999999999998-60645677424631027789983741166970277477658773 Q ss_pred HHHHHHHHHHH Q ss_conf 99999999999 Q gi|254780874|r 337 DDIINPLLTAH 347 (357) Q Consensus 337 ~~~i~~~~~~~ 347 (357) |+||+||++++ T Consensus 314 d~fI~a~l~~~ 324 (325) T PRK05589 314 DIFITQYLKGN 324 (325) T ss_pred HHHHHHHHHHC T ss_conf 99999999722 No 8 >PRK06746 peptide chain release factor 2; Provisional Probab=100.00 E-value=0 Score=885.69 Aligned_cols=319 Identities=36% Similarity=0.539 Sum_probs=299.9 Q ss_pred HCCCCCHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 5385315799---9999999879999999999999999899999997203785356654332223320005778889998 Q gi|254780874|r 23 MSESPSVDAY---IKLTEEYAAISPIISKISIYDQKKQEEQDLHTVIGDHNSDSEIRDLAQIEVLAIEKEIRELENEINY 99 (357) Q Consensus 23 Lsdp~~w~d~---~klsKe~s~L~~iv~~~~~~~~~~~ei~el~eLl~e~e~D~El~elAeeEl~~l~~~l~~le~eL~~ 99 (357) |++|+||+|+ +++.|+++.|+++++.++.|.+..+++.++.+|+.+ +.|+++.++++.++..+...+++ +++ T Consensus 1 M~~p~fWdD~~~A~~i~kE~s~l~~~v~~~~~l~~~~~dl~~l~eL~~e-e~D~el~ee~~~el~~l~~~i~~----lE~ 75 (326) T PRK06746 1 MMGAGFWDDQQGAQAVINEANALKDMVGKFRQLDETFENLEITHELLKE-EYDEDLHEELESEVKGLIQEMNE----YEL 75 (326) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC-CCCHHHHHHHHHHHHHHHHHHHH----HHH T ss_conf 9897778699999999999999999999999999999979999999854-89989999999999999999999----999 Q ss_pred HHCCCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHCCCCCCCCCEEEEEEECCCHHHHHHCC Q ss_conf 64000113554037886226885217889999999999998764984001321135447721113777503111210002 Q gi|254780874|r 100 LLLPKDTDDDKSCILEIRAGTGGSEAALFVGDLFRMYERYAALRKWKVEVLSSSDNDDGGYKEIVATISGRGVFSRMKFE 179 (357) Q Consensus 100 ~LLpsde~D~~nailEI~aGaGG~EA~dwA~~L~rMY~r~ae~~gwk~eii~~~~ge~aGik~v~~~I~G~~aYg~Lk~E 179 (357) .+||+||+|.+||||||||||||+||||||+||||||.|||+++||++++++.++|++||||+|+|.|+|++|||+||+| T Consensus 76 ~~lls~~~D~~naileI~aGaGG~EA~~~a~~L~rMY~r~ae~~g~~~~~~~~~~~~~~Gik~~~~~i~G~~ayg~Lk~E 155 (326) T PRK06746 76 QLLLSDPYDKNNAILELHPGAGGTESQDWGSMLLRMYTRWAEKRGFKVETVDYLPGDEAGIKSVTLLIKGHNAYGYLKAE 155 (326) T ss_pred HHCCCCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCEEEEEEECCCCCCCCCCC T ss_conf 85449987678848999789897799999999999999999975985799720577778862467999604542010125 Q ss_pred CCCEEEEEECCCCCCCCCCEEEEEEEEEECC-CCCCCCCCCCEEEEEEECCCCCCCCCCCCCCCEEEEEECCCCEEEE-E Q ss_conf 4650267612545678511016888853033-4332224543479998528887520006321215889649844888-4 Q gi|254780874|r 180 SGVHRVQRVPATEASGRVHTSAATVAVLPEA-AEIDVDIPLEDIRIDTMRASGSGGQHVNTTDSAVRITHIPTGIMVT-S 257 (357) Q Consensus 180 ~GvHRvqRip~~es~gR~hTS~a~V~V~P~~-~~~~~~i~~~dl~i~~~rssG~GGQ~vN~t~savri~H~ptgi~v~-~ 257 (357) +||||||||||||++||||||||+|+|+|++ ++++++|+|+||+|||||||||||||||||+|||||||+||||+|+ | T Consensus 156 ~GvHRv~Ris~~~s~gr~hTS~a~V~V~P~~~~~~~i~i~~~dl~id~~rssG~GGQ~VN~t~SAVRitH~PTGi~v~cq 235 (326) T PRK06746 156 KGVHRLVRISPFDSSGRRHTSFVSCEVVPEFNDEVEIEVRTEDLKIDTYRASGAGGQHVNTTDSAVRITHTPTNTVVTCQ 235 (326) T ss_pred CCCEEEEECCCCCCCCCEEEEEEEEEEECCCCCCCCCCCCHHHHCEEECCCCCCCCCCCCCCCCEEEEEEECCCEEEEEC T ss_conf 76213464588788886687778999972667543356886771432011799987876650304999981576699956 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEECCC-CCCCEEEEECCCEECCHHHHHCCCH Q ss_conf 365413589999999999999999999999998744652246742382210448-9981124321644178778728866 Q gi|254780874|r 258 SEKSQHQNRLRAMKVLRARLYDVERKRMANERSANRKLQIGSGDRSERIRTYNF-SQGRITDHRINLTLYKLEYVLQGYI 336 (357) Q Consensus 258 ~eRsQ~~Nk~~A~~~L~~~l~~~~~~~~~~~~~~~r~~~~g~~~rs~~iRtY~~-~~~rv~DhR~~~~~~~~~~~l~G~l 336 (357) +|||||+||++||++||+|||+.+++++.++..+.|+.+++. .|++||||||| |++||||||||++++|+++||+|+| T Consensus 236 ~eRSQ~~Nk~~Al~~L~arL~~~~~e~~~~~~~~~r~~~~~~-~~g~~IRtY~~~p~~~V~DhR~~~~~~~~~~vl~G~l 314 (326) T PRK06746 236 SERSQIKNREHAMKMLKAKLYQKKLEEQQAELDEIRGEQKEI-GWGSQIRSYVFHPYSLVKDHRTNTEVGNVQAVMDGEI 314 (326) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC-CCCCCCCCCCCCCCCCCCCCCCCCEECCHHHHHCCCH T ss_conf 855779999999999999999999999999999987453578-5525631526789883662055975678478658873 Q ss_pred HHHHHHHHHHH Q ss_conf 99999999999 Q gi|254780874|r 337 DDIINPLLTAH 347 (357) Q Consensus 337 ~~~i~~~~~~~ 347 (357) |+||++|++|+ T Consensus 315 d~fI~a~l~~r 325 (326) T PRK06746 315 DPFIDAYLRSR 325 (326) T ss_pred HHHHHHHHHCC T ss_conf 99999999755 No 9 >PRK07342 peptide chain release factor 2; Provisional Probab=100.00 E-value=0 Score=883.04 Aligned_cols=309 Identities=35% Similarity=0.531 Sum_probs=288.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEE Q ss_conf 99999879999999999999999899999997203785356654332223320005778889998640001135540378 Q gi|254780874|r 35 LTEEYAAISPIISKISIYDQKKQEEQDLHTVIGDHNSDSEIRDLAQIEVLAIEKEIRELENEINYLLLPKDTDDDKSCIL 114 (357) Q Consensus 35 lsKe~s~L~~iv~~~~~~~~~~~ei~el~eLl~e~e~D~El~elAeeEl~~l~~~l~~le~eL~~~LLpsde~D~~nail 114 (357) +.||++.|+++++.|+.|.+...++.++.+|+.+ ++|+++.++++.++..+...+++++ +.+||+||+|.+|||| T Consensus 1 l~kE~~~L~~~v~~~~~l~~~l~d~~el~el~~e-e~D~e~~~e~~~~l~~l~~~l~~le----l~~Ll~~~~D~~~ail 75 (322) T PRK07342 1 LMRERQQLEDSINGINHLEQTLNDNIELIAMGEE-EGDKSIVEDAEKTIRDLKDEIDRRQ----IDALLSGEADANDTYL 75 (322) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC-CCCHHHHHHHHHHHHHHHHHHHHHH----HHHCCCCCCCCCCEEE T ss_conf 9431899999999999999998759999999756-9998999999999999999999999----9854699888888589 Q ss_pred EEECCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHCCCCCCCCCEEEEEEECCCHHHHHHCCCCCEEEEEECCCCCC Q ss_conf 86226885217889999999999998764984001321135447721113777503111210002465026761254567 Q gi|254780874|r 115 EIRAGTGGSEAALFVGDLFRMYERYAALRKWKVEVLSSSDNDDGGYKEIVATISGRGVFSRMKFESGVHRVQRVPATEAS 194 (357) Q Consensus 115 EI~aGaGG~EA~dwA~~L~rMY~r~ae~~gwk~eii~~~~ge~aGik~v~~~I~G~~aYg~Lk~E~GvHRvqRip~~es~ 194 (357) ||||||||+||||||+||||||+|||+++||++++++.++|++||||+|+|.|+|++|||+||+|+||||||||||||++ T Consensus 76 eIraGaGG~EA~~fA~~L~rMY~r~ae~~g~k~evid~~~g~~~Gik~~~~~i~G~~ayg~Lk~E~GvHRv~Ri~~~es~ 155 (322) T PRK07342 76 EVHAGAGGTESQDWASMLLRMYTRWAERQGRKVEVLEVHDGEEAGIKSATILVKGHNAYGWLKTESGVHRLVRISPYDSN 155 (322) T ss_pred EEECCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCEEEEEEECCCHHHHHHHCCCCCEEEECCCCCCC T ss_conf 99688895799999999999999999877987899631676567710599999647677764204685305865985888 Q ss_pred CCCCEEEEEEEEEECCCC-CCCCCCCCEEEEEEECCCCCCCCCCCCCCCEEEEEECCCCEEE-EECCCCCHHHHHHHHHH Q ss_conf 851101688885303343-3222454347999852888752000632121588964984488-84365413589999999 Q gi|254780874|r 195 GRVHTSAATVAVLPEAAE-IDVDIPLEDIRIDTMRASGSGGQHVNTTDSAVRITHIPTGIMV-TSSEKSQHQNRLRAMKV 272 (357) Q Consensus 195 gR~hTS~a~V~V~P~~~~-~~~~i~~~dl~i~~~rssG~GGQ~vN~t~savri~H~ptgi~v-~~~eRsQ~~Nk~~A~~~ 272 (357) ||||||||+|+|+|++++ ++|+|+|+||+|||||||||||||||||+|||||||+||||+| ||+|||||+||++||++ T Consensus 156 gr~hTS~asV~VlP~~~d~~~i~i~~~Dlridt~RssGaGGQ~VNkt~SAVRitH~PTGivV~cq~eRSQ~~Nk~~Am~~ 235 (322) T PRK07342 156 ARRHTSFASIWVYPVIDDNIEVDVNESDVRIDTYRSSGAGGQHVNTTDSAVRITHIPTGIVVQCQQERSQHKNRAKAWSM 235 (322) T ss_pred CCEEEEEEEEEEEECCCCCCCCCCCHHHCCEEEEECCCCCCCCCCCCCCEEEEEEECCCEEEEECCCCCHHHHHHHHHHH T ss_conf 86687667999982446643356983361278756799998777762424899970686699966854389999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEECCCCC-CCEEEEECCCEECCHHHHHCCCHHHHHHHHHHHHHH Q ss_conf 99999999999999999874465224674238221044899-811243216441787787288669999999999999 Q gi|254780874|r 273 LRARLYDVERKRMANERSANRKLQIGSGDRSERIRTYNFSQ-GRITDHRINLTLYKLEYVLQGYIDDIINPLLTAHQA 349 (357) Q Consensus 273 L~~~l~~~~~~~~~~~~~~~r~~~~g~~~rs~~iRtY~~~~-~rv~DhR~~~~~~~~~~~l~G~l~~~i~~~~~~~~~ 349 (357) ||||||+.+++++.++....+..+... .||+|||||||+| .||||||||++++|+++||+|+||+||+||+.|+.. T Consensus 236 LrarL~~~e~~k~~~~~~~~~~~~~~~-~~g~qIRtY~f~P~~~V~DhRt~~~~~~~~~vldG~ld~fI~a~L~~r~~ 312 (322) T PRK07342 236 LRARLYEEELKKREEATNAAEASKTDI-GWGHQIRSYVLQPYQLVKDLRTGVESTNPQDVLDGDLDEFMEAALAHRIS 312 (322) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCC-CCCCCCCCCCCCCCCCCCCCCCCCEECCHHHHHCCCHHHHHHHHHHHHHH T ss_conf 999999999999999999987431036-53155226154798867603669787784786588739999999998862 No 10 >PRK08787 peptide chain release factor 2; Provisional Probab=100.00 E-value=0 Score=858.90 Aligned_cols=308 Identities=33% Similarity=0.480 Sum_probs=280.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCC Q ss_conf 99999999999999998999999972037853566543322233200057788899986400011355403788622688 Q gi|254780874|r 42 ISPIISKISIYDQKKQEEQDLHTVIGDHNSDSEIRDLAQIEVLAIEKEIRELENEINYLLLPKDTDDDKSCILEIRAGTG 121 (357) Q Consensus 42 L~~iv~~~~~~~~~~~ei~el~eLl~e~e~D~El~elAeeEl~~l~~~l~~le~eL~~~LLpsde~D~~nailEI~aGaG 121 (357) |+..+..++.+.+...++.++.+|+.+ +.|+++.+++..++..+...+++++ +.+||+||+|.+||||||||||| T Consensus 2 Lek~v~~~~~l~~~l~D~~el~ela~~-E~d~~~~~e~~~~l~~l~~~~~~le----~~~ll~~~~D~~~aileI~aGaG 76 (313) T PRK08787 2 LEKTVIGIADVLSGLADAGELLDLAES-EQDEDTALAVIADLDKYQAHVEKLE----FQRMFSGQMDGANAFVDIQAGAG 76 (313) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH-CCCHHHHHHHHHHHHHHHHHHHHHH----HHHCCCCCCCCCCEEEEEECCCC T ss_conf 588999999999762669999999885-8998999999999999999999999----98635997756784899916899 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHCCCCCCCCCEEEEEEECCCHHHHHHCCCCCEEEEEECCCCCCCCCCEEE Q ss_conf 52178899999999999987649840013211354477211137775031112100024650267612545678511016 Q gi|254780874|r 122 GSEAALFVGDLFRMYERYAALRKWKVEVLSSSDNDDGGYKEIVATISGRGVFSRMKFESGVHRVQRVPATEASGRVHTSA 201 (357) Q Consensus 122 G~EA~dwA~~L~rMY~r~ae~~gwk~eii~~~~ge~aGik~v~~~I~G~~aYg~Lk~E~GvHRvqRip~~es~gR~hTS~ 201 (357) |+||||||+||||||+|||+++||++++++.++|++||||+|++.|+|+|||||||+|+||||||||||||++||||||| T Consensus 77 G~EA~dwA~~L~rMY~r~ae~~g~k~ev~~~~~~~~~Giks~t~~i~G~~ayg~Lk~E~GvHRl~Ris~~~s~gr~hTS~ 156 (313) T PRK08787 77 GTEAQDWAEILLRMYLRWAESRGWKTELMEVSGGEVAGIKSATVRIEGEYAYGWLKTEIGVHRLVRKSPFDSDNRRHTSF 156 (313) T ss_pred CHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCEEEEEEEECCCCCCCCCCCCCCEEEEECCCCCCCCEEEEEE T ss_conf 58999999999999999998769837998447766578035689994477566321458732048469957787278642 Q ss_pred EEEEEEECCCC-CCCCCCCCEEEEEEECCCCCCCCCCCCCCCEEEEEECCCCEEE-EECCCCCHHHHHHHHHHHHHHHHH Q ss_conf 88885303343-3222454347999852888752000632121588964984488-843654135899999999999999 Q gi|254780874|r 202 ATVAVLPEAAE-IDVDIPLEDIRIDTMRASGSGGQHVNTTDSAVRITHIPTGIMV-TSSEKSQHQNRLRAMKVLRARLYD 279 (357) Q Consensus 202 a~V~V~P~~~~-~~~~i~~~dl~i~~~rssG~GGQ~vN~t~savri~H~ptgi~v-~~~eRsQ~~Nk~~A~~~L~~~l~~ 279 (357) |+|+|+|++++ ++++|+|+||+|||||||||||||||||+|||||||+||||+| ||+|||||+||++||++||+|||+ T Consensus 157 asV~V~P~~~~~~~i~i~~~Dlrid~~RssGaGGQ~VNkt~SAVRitH~PTGivV~cq~eRSQ~~Nk~~A~~~LkarL~~ 236 (313) T PRK08787 157 TSVFVSPEVDDNIEIDINPADLRTDVYRSSGAGGQHVNKTESAVRITHIPTNTVVACQTGRSQHQNRDNAMKMLAAKLYE 236 (313) T ss_pred EEEEEECCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCCEEEEEEEEEECCCEEEEECCCCCHHHHHHHHHHHHHHHHHH T ss_conf 37999566465445757756658998757999989878860058999842877999668553899999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCCEEECCCCCCCEEEEECCCEECCHHHHHCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999874465224674238221044899811243216441787787288669999999999999998740 Q gi|254780874|r 280 VERKRMANERSANRKLQIGSGDRSERIRTYNFSQGRITDHRINLTLYKLEYVLQGYIDDIINPLLTAHQAKMIGSA 355 (357) Q Consensus 280 ~~~~~~~~~~~~~r~~~~g~~~rs~~iRtY~~~~~rv~DhR~~~~~~~~~~~l~G~l~~~i~~~~~~~~~~~l~~~ 355 (357) ++++++.++....|..+. ...|++||||||||||||||||||++++|+++||+|+||+||+|||+|..+...+++ T Consensus 237 ~e~~k~~~e~~~~~~~~~-~~~~g~qIRtY~~~~~rVkDhRtg~~~~~~~~vldG~ld~fI~a~L~~~~~~~~~~~ 311 (313) T PRK08787 237 LEVQKRNAEKDALEATKS-DIGWGSQIRNYVLDQSRIKDLRTGIERSDTQKVLDGDLDEFVEASLKAGLAAGSKRL 311 (313) T ss_pred HHHHHHHHHHHHHHHHHC-CCCCCCCCCCCCCCCCCCCCCCCCCEECCHHHHHCCCHHHHHHHHHHHHHHHHHHHC T ss_conf 999999999999875022-476426735731799835520669776784676487739999999986266556421 No 11 >KOG2726 consensus Probab=100.00 E-value=0 Score=773.40 Aligned_cols=332 Identities=45% Similarity=0.696 Sum_probs=309.7 Q ss_pred HHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99998538531579999999998799999999999999998999999972037853566543322233200057788899 Q gi|254780874|r 18 EIELRMSESPSVDAYIKLTEEYAAISPIISKISIYDQKKQEEQDLHTVIGDHNSDSEIRDLAQIEVLAIEKEIRELENEI 97 (357) Q Consensus 18 ELE~~Lsdp~~w~d~~klsKe~s~L~~iv~~~~~~~~~~~ei~el~eLl~e~e~D~El~elAeeEl~~l~~~l~~le~eL 97 (357) ..+..+++.++|+++.++...+..|.. ...+..++.+++.|..+ .+|++|.++|.+|+..+..+++.+..+| T Consensus 52 ~~~~~~~~~~~~~~~~~l~~~~~~l~~-------~~~~~~~~~~lk~l~~~-~e~e~~~~~a~~E~~~~~~~i~~~~~~l 123 (386) T KOG2726 52 VQEAASNDSDLWDDPAELDEVLNALSD-------RMKLVRELKSLKSLIKE-GEDEDMDELAEEEAEEISKEIERSLHEL 123 (386) T ss_pred HHHHHHCHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHH-CCCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 578762402455434889999999999-------99999976509999862-0428899999999999999999999999 Q ss_pred HHHHCCCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHCCCCCCCCCEEEEEEECCCHHHHHH Q ss_conf 98640001135540378862268852178899999999999987649840013211354477211137775031112100 Q gi|254780874|r 98 NYLLLPKDTDDDKSCILEIRAGTGGSEAALFVGDLFRMYERYAALRKWKVEVLSSSDNDDGGYKEIVATISGRGVFSRMK 177 (357) Q Consensus 98 ~~~LLpsde~D~~nailEI~aGaGG~EA~dwA~~L~rMY~r~ae~~gwk~eii~~~~ge~aGik~v~~~I~G~~aYg~Lk 177 (357) +..|||++|+|.++|++||+|||||+||++||.||++||.+||++++|++++++..+++.||||+|++.|+|.+|||||+ T Consensus 124 ~~~lLp~~~~D~~~~iiev~aGaGG~Ea~ift~el~~MY~~~a~~~~w~~~~l~~~~~~~~Gi~~At~~i~G~~ayg~l~ 203 (386) T KOG2726 124 ELSLLPSDPYDAEACIIEVRAGAGGQEAQIFTMELVDMYQKYAERLGWKARVLEKAPGESGGIKSATLEIEGESAYGYLK 203 (386) T ss_pred HHHHCCCCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHCCCCEEEHHCCCCCCCCCEEEEEEECCCCHHHEEE T ss_conf 99726877465237589982788857999999999999999998606640451117753345001366763500122154 Q ss_pred CCCCCEEEEEECCCCCCCCCCEEEEEEEEEECC--CCCCCCCCCCEEEEEEECCCCCCCCCCCCCCCEEEEEECCCCEEE Q ss_conf 024650267612545678511016888853033--433222454347999852888752000632121588964984488 Q gi|254780874|r 178 FESGVHRVQRVPATEASGRVHTSAATVAVLPEA--AEIDVDIPLEDIRIDTMRASGSGGQHVNTTDSAVRITHIPTGIMV 255 (357) Q Consensus 178 ~E~GvHRvqRip~~es~gR~hTS~a~V~V~P~~--~~~~~~i~~~dl~i~~~rssG~GGQ~vN~t~savri~H~ptgi~v 255 (357) ||+|||||||+|+|+++||+|||||+|+|+|++ +++++.|+++||+|+|||||||||||||||||||||+|+||||+| T Consensus 204 ~E~GvHRv~r~p~~e~~gr~htstasV~ViP~~~~~~~~~~~~~~dl~i~~~R~~G~GGQhvNktdsaVrl~HiPTGIvv 283 (386) T KOG2726 204 FEAGVHRVQRVPSTETSGRRHTSTASVAVIPQPGRDEVDVEIDEKDLRIETFRASGPGGQHVNKTDSAVRLTHIPTGIVV 283 (386) T ss_pred CCCCCCCEEECCCCCCCCCCCCCCCEEEEECCCCCCCCCEECCCHHEEEEECCCCCCCCCCCCCCCCCEEEEEECCCEEE T ss_conf 15764444666876655651136543798546788765611472033677323678875655433440589860674699 Q ss_pred -EECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEECCCCCCCEEEEECCCEECCHHHHHCC Q ss_conf -8436541358999999999999999999999999874465224674238221044899811243216441787787288 Q gi|254780874|r 256 -TSSEKSQHQNRLRAMKVLRARLYDVERKRMANERSANRKLQIGSGDRSERIRTYNFSQGRITDHRINLTLYKLEYVLQG 334 (357) Q Consensus 256 -~~~eRsQ~~Nk~~A~~~L~~~l~~~~~~~~~~~~~~~r~~~~g~~~rs~~iRtY~~~~~rv~DhR~~~~~~~~~~~l~G 334 (357) ||+|||||+||++||++|++||+++...+...+....|+.|+||++||+|||||||+|+||||||++++.+++.+||+| T Consensus 284 ~cq~eRSq~~Nr~~A~~~L~akL~~~~~~~~~~~~~~~r~~qv~s~~rsekiRTy~~~q~rv~D~r~~~~~~d~~~~l~G 363 (386) T KOG2726 284 ECQEERSQHKNRALALKRLRAKLAVIYREEKSEEEKKKRKAQVGSLKRSEKIRTYNFKQDRVTDHRIGLESHDLESFLDG 363 (386) T ss_pred EEECHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHCEEECCCCCCCHHHHHHCCCCCCHHHHHHC T ss_conf 96137768865999999999999999874121776520087625547011210022576502333103320255888742 Q ss_pred CHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 66999999999999999874059 Q gi|254780874|r 335 YIDDIINPLLTAHQAKMIGSASE 357 (357) Q Consensus 335 ~l~~~i~~~~~~~~~~~l~~~~e 357 (357) +||+||++++.++..+.++++.+ T Consensus 364 ~Ld~li~~~~~~~~~~~~~e~~~ 386 (386) T KOG2726 364 NLDELIEALLSLRREEDLAELLE 386 (386) T ss_pred CHHHHHHHHHHHHHHHHHHHHHC T ss_conf 48999999998766777777419 No 12 >COG1186 PrfB Protein chain release factor B [Translation, ribosomal structure and biogenesis] Probab=100.00 E-value=0 Score=623.86 Aligned_cols=233 Identities=43% Similarity=0.682 Sum_probs=222.3 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHCCCCCCCCCEEEEEEECCCHHHHHHCCCCCEEEEEECCC Q ss_conf 37886226885217889999999999998764984001321135447721113777503111210002465026761254 Q gi|254780874|r 112 CILEIRAGTGGSEAALFVGDLFRMYERYAALRKWKVEVLSSSDNDDGGYKEIVATISGRGVFSRMKFESGVHRVQRVPAT 191 (357) Q Consensus 112 ailEI~aGaGG~EA~dwA~~L~rMY~r~ae~~gwk~eii~~~~ge~aGik~v~~~I~G~~aYg~Lk~E~GvHRvqRip~~ 191 (357) |+|+|+||||||||||||.||||||+|||+++||++|++++++||++||||++|.|+|++||||||.|.||||+||+||| T Consensus 1 ~~l~i~~g~gg~e~~dw~~~l~rmy~r~a~~~g~~~e~l~~~~g~~~g~ks~~~~~~g~~a~g~~~~e~g~hrlvr~Spf 80 (239) T COG1186 1 AYLTIHAGAGGTEAQDWASMLLRMYTRWAERKGFKVEVLDTSDGEEAGIKSATLKIKGENAYGYLKTETGVHRLVRISPF 80 (239) T ss_pred CEEEEECCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCEEEEEEEECHHHHHHHHHHCCEEEEEEECCC T ss_conf 97997279885588999999999999999875985899860477304510379999646888888763254677850687 Q ss_pred CCCCCCCEEEEEEEEEECCC-CCCCCCCCCEEEEEEECCCCCCCCCCCCCCCEEEEEECCCCEEEEE-CCCCCHHHHHHH Q ss_conf 56785110168888530334-3322245434799985288875200063212158896498448884-365413589999 Q gi|254780874|r 192 EASGRVHTSAATVAVLPEAA-EIDVDIPLEDIRIDTMRASGSGGQHVNTTDSAVRITHIPTGIMVTS-SEKSQHQNRLRA 269 (357) Q Consensus 192 es~gR~hTS~a~V~V~P~~~-~~~~~i~~~dl~i~~~rssG~GGQ~vN~t~savri~H~ptgi~v~~-~eRsQ~~Nk~~A 269 (357) ++++||||||++|.|+|+++ .++++|+++||+|||||||||||||||||+|||||||+||||+|+| ++||||+||+.| T Consensus 81 ~~~~~R~tsf~~v~v~p~~~~~i~i~I~~~dl~idt~RASGaGGQhVNKt~SAVrlth~ptgivv~cq~eRSq~~n~~~a 160 (239) T COG1186 81 DSNGRRHTSFASVEVFPELDISIEIEIPDDDLRIDTYRASGAGGQHVNKTDSAVRLTHLPTGIVVLCQNERSQHLNKALA 160 (239) T ss_pred CCCCCCCCCEEEEEECCCCCCCCCEECCCCCEEEEEEECCCCCCCCCCCCCCCEEEEECCCCCEECCHHHHHHHHHHHHH T ss_conf 75664155333310047777653100485216999887579998865552442899986777775188899999879999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH--HHHCCCCCCCCCEEECCCCCCC-EEEEECCCEECCHHHHHCCCHHHHHHHHHHH Q ss_conf 999999999999999999998744--6522467423822104489981-1243216441787787288669999999999 Q gi|254780874|r 270 MKVLRARLYDVERKRMANERSANR--KLQIGSGDRSERIRTYNFSQGR-ITDHRINLTLYKLEYVLQGYIDDIINPLLTA 346 (357) Q Consensus 270 ~~~L~~~l~~~~~~~~~~~~~~~r--~~~~g~~~rs~~iRtY~~~~~r-v~DhR~~~~~~~~~~~l~G~l~~~i~~~~~~ 346 (357) |.+|+++||..+.+++.++....+ +..++|| .|||+|+|+|++ |+|||++++..|.+.||||+++.||++++.+ T Consensus 161 ~~~l~~kL~~~~~~~Rsqe~n~~~a~~k~i~wg---~qirsyv~~p~~~vKd~Rt~~E~~~~~~v~dg~~~~~~~~~l~~ 237 (239) T COG1186 161 RKMLKGKLYILAQEKRSQEKNRERALKKLIGWG---NQIRSYVLDPYQPTKDLRTGVERRNKSKVLDGDKDGFIKAYLKW 237 (239) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHCCCCCCCCCCCCCCCHHEECCHHHHHHHHHHHHHHHHHHC T ss_conf 998777999998898999998999876677887---73212678864346653210012357876432477887755403 Q ss_pred H Q ss_conf 9 Q gi|254780874|r 347 H 347 (357) Q Consensus 347 ~ 347 (357) . T Consensus 238 ~ 238 (239) T COG1186 238 D 238 (239) T ss_pred C T ss_conf 5 No 13 >TIGR03072 release_prfH putative peptide chain release factor H. Members of this protein family are bacterial proteins homologous to peptide chain release factors 1 (RF-1, product of the prfA gene), and 2 (RF-2, product of the prfB gene). The member from Escherichia coli K-12, designated prfH, appears to be a pseudogene. This class I release factor is always found as the downstream gene of a two-gene operon. Probab=100.00 E-value=0 Score=568.62 Aligned_cols=196 Identities=29% Similarity=0.386 Sum_probs=184.3 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHCCCCC-CCCCEEEEEEECCCHHHHHHCCCCCEEEEEECC Q ss_conf 3788622688521788999999999999876498400132113544-772111377750311121000246502676125 Q gi|254780874|r 112 CILEIRAGTGGSEAALFVGDLFRMYERYAALRKWKVEVLSSSDNDD-GGYKEIVATISGRGVFSRMKFESGVHRVQRVPA 190 (357) Q Consensus 112 ailEI~aGaGG~EA~dwA~~L~rMY~r~ae~~gwk~eii~~~~ge~-aGik~v~~~I~G~~aYg~Lk~E~GvHRvqRip~ 190 (357) +||||+||+||+||++||+||||||.|||+++||++++++.++|+. ||||+|++.|+|++||++|++|.|+||+||.|| T Consensus 1 vileIraG~GG~Ea~lfa~~L~rMY~r~a~~~gwk~evi~~~~~~~~Gg~ksv~~~i~G~~~~~~~~~~~G~~~~v~~sp 80 (200) T TIGR03072 1 ILLQLSSAQGPAECCLAVAKALERLTREAAARGVRVEVLEQEPGEVPGTLRSALVSLDGEAAAALADRWEGTLLWICPSP 80 (200) T ss_pred CEEEEECCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCEEEEEEEEEEECCEEEEEEECCCEEEEEECC T ss_conf 98999389986999999999999999999985998999857777779950899999995043078887756179996679 Q ss_pred CCCCCCCCEEEEEEEEEECCCCCCCCCCCCEEEEEEECCCCCCCCCCCCCCCEEEEEECCCCEEE-EECCCCCHHHHHHH Q ss_conf 45678511016888853033433222454347999852888752000632121588964984488-84365413589999 Q gi|254780874|r 191 TEASGRVHTSAATVAVLPEAAEIDVDIPLEDIRIDTMRASGSGGQHVNTTDSAVRITHIPTGIMV-TSSEKSQHQNRLRA 269 (357) Q Consensus 191 ~es~gR~hTS~a~V~V~P~~~~~~~~i~~~dl~i~~~rssG~GGQ~vN~t~savri~H~ptgi~v-~~~eRsQ~~Nk~~A 269 (357) |...++|||||++|.++|+.. +|+|+||+|||||||||||||||||||||||||+||||+| ||+|||||+||++| T Consensus 81 ~~~~~~r~t~~v~v~~~~~~~----ei~~~Dlridt~RssGaGGQhVNkT~SAVRitH~PTGi~V~~q~eRSQ~~Nk~~A 156 (200) T TIGR03072 81 YRPHHRRKNWFIGVQRFSASE----EATEDEIRFETLRSSGPGGQHVNKTESAVRATHLASGISVKVQSERSQHANKRLA 156 (200) T ss_pred CCCCCCEEEEEEEEEECCCCC----CCCCHHEEEEEEECCCCCCCCCCCCCCEEEEEECCCCEEEEECCCCCHHHHHHHH T ss_conf 767786799999977516557----8881234899987368998850775634999974884799975855479999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEECCCC Q ss_conf 9999999999999999999987446522467423822104489 Q gi|254780874|r 270 MKVLRARLYDVERKRMANERSANRKLQIGSGDRSERIRTYNFS 312 (357) Q Consensus 270 ~~~L~~~l~~~~~~~~~~~~~~~r~~~~g~~~rs~~iRtY~~~ 312 (357) |++|++|||+.+.+++.++ .+.|++|+|+++||+||||||-+ T Consensus 157 l~~L~arL~~~~~~~~~~~-~~~r~~~~~~~~Rs~kIRTYng~ 198 (200) T TIGR03072 157 TLLLAVRLADLQQEQAAAL-RAERRTAHHQIERGNPVRVFKGE 198 (200) T ss_pred HHHHHHHHHHHHHHHHHHH-HHHHHHCCCCCCCCCCCEEECCC T ss_conf 9999999999999999999-99998667987256872512798 No 14 >PRK08179 prfH peptide chain release factor-like protein; Reviewed Probab=100.00 E-value=0 Score=551.85 Aligned_cols=199 Identities=25% Similarity=0.320 Sum_probs=184.6 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHCCCCCCC-CCEEEEEEECCCHHHHHHCCCCCEEEEEECC Q ss_conf 378862268852178899999999999987649840013211354477-2111377750311121000246502676125 Q gi|254780874|r 112 CILEIRAGTGGSEAALFVGDLFRMYERYAALRKWKVEVLSSSDNDDGG-YKEIVATISGRGVFSRMKFESGVHRVQRVPA 190 (357) Q Consensus 112 ailEI~aGaGG~EA~dwA~~L~rMY~r~ae~~gwk~eii~~~~ge~aG-ik~v~~~I~G~~aYg~Lk~E~GvHRvqRip~ 190 (357) +||||+||+||+||++||+||||||.|||+++||++++++.++|+.+| ||+|++.|+|++||++|++|.|+|+.|+.|| T Consensus 2 vileI~aG~GG~Ea~~fa~~L~rmy~r~a~~~g~~~eil~~~~~~~~g~~ks~~~~i~G~~a~~~l~~~~G~v~~V~~sp 81 (204) T PRK08179 2 ILLQLSSAQGPAECCLAVAKALERLTKEAARQKVRVEVLETETGRYPDTLRSALVSLDGENAWALAESWCGTIQWICPSP 81 (204) T ss_pred EEEEEECCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCEEEEEEEEEECCHHHHHEEECCEEEEEEECC T ss_conf 89999089896999999999999999999885997999847887779835999999982465330000554379897679 Q ss_pred CCCCCCCCEEEEEEEEEECCCCCCCCCCCCEEEEEEECCCCCCCCCCCCCCCEEEEEECCCCEEE-EECCCCCHHHHHHH Q ss_conf 45678511016888853033433222454347999852888752000632121588964984488-84365413589999 Q gi|254780874|r 191 TEASGRVHTSAATVAVLPEAAEIDVDIPLEDIRIDTMRASGSGGQHVNTTDSAVRITHIPTGIMV-TSSEKSQHQNRLRA 269 (357) Q Consensus 191 ~es~gR~hTS~a~V~V~P~~~~~~~~i~~~dl~i~~~rssG~GGQ~vN~t~savri~H~ptgi~v-~~~eRsQ~~Nk~~A 269 (357) |....+|||||++|.++|.. ++++++||+|||||||||||||||||||||||||+||||+| ||+|||||+||++| T Consensus 82 ~~~~~~R~~~~v~v~~~~~~----~~~~~~Dlridt~RssG~GGQhVNkT~SAVRitH~PTGi~V~~q~eRSQ~~Nk~~A 157 (204) T PRK08179 82 YRPHHGRKNWFVGIGRFSAD----EQEQSDEIRYETLRSSGPGGQHVNKTDSAVRATHLASGISVKVQSERSQHANKRLA 157 (204) T ss_pred CCCCCCEEEEEEEEEEECCC----CCCCCCCEEEEEEECCCCCCCCCCCCCCEEEEEECCCCEEEEECCCCCHHHHHHHH T ss_conf 88877559999998760776----67786334899997479988750465524999975887799971633589999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEECCCCCCC Q ss_conf 9999999999999999999987446522467423822104489981 Q gi|254780874|r 270 MKVLRARLYDVERKRMANERSANRKLQIGSGDRSERIRTYNFSQGR 315 (357) Q Consensus 270 ~~~L~~~l~~~~~~~~~~~~~~~r~~~~g~~~rs~~iRtY~~~~~r 315 (357) |++|++||++.+.+++.++. +.|+.++|+++||++||||.++... T Consensus 158 ~~~L~arL~~~~~~~~~~~~-~~r~~~~~~~~Rg~~IRtY~g~~f~ 202 (204) T PRK08179 158 RLLIAWKLEQQQQEQSAALK-SQRRMFHHQIERGNPRRVFTGMAFT 202 (204) T ss_pred HHHHHHHHHHHHHHHHHHHH-HHHHHCCCCCCCCCCCEEECCCCCC T ss_conf 99999999999999999999-9998666862157880676189888 No 15 >pfam00472 RF-1 RF-1 domain. This domain is found in peptide chain release factors such as RF-1 and RF-2, and a number of smaller proteins of unknown function. This domain contains the peptidyl-tRNA hydrolase activity. The domain contains a highly conserved motif GGQ, where the glutamine is thought to coordinate the water that mediates the hydrolysis. Probab=100.00 E-value=7.3e-38 Score=293.80 Aligned_cols=113 Identities=54% Similarity=0.924 Sum_probs=107.9 Q ss_pred ECCCCCCCCCCCCEEEEEEECCCCCCCCCCCCCCCEEEEEECCCCEEE-EECCCCCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 033433222454347999852888752000632121588964984488-8436541358999999999999999999999 Q gi|254780874|r 208 PEAAEIDVDIPLEDIRIDTMRASGSGGQHVNTTDSAVRITHIPTGIMV-TSSEKSQHQNRLRAMKVLRARLYDVERKRMA 286 (357) Q Consensus 208 P~~~~~~~~i~~~dl~i~~~rssG~GGQ~vN~t~savri~H~ptgi~v-~~~eRsQ~~Nk~~A~~~L~~~l~~~~~~~~~ 286 (357) |..+.++|+|+++||+|+|||||||||||||||+|||||+|+||||+| ||++|||++||+.||++|+++|++.+.++.. T Consensus 1 ~~~~~~~i~i~~~dl~~~~~RssGpGGQ~VNKt~saV~l~h~ptgi~v~~q~~RSq~~Nr~~A~~~L~~~l~~~~~~k~~ 80 (114) T pfam00472 1 PLDEEVEIEIPPSDLRIDTFRSSGPGGQNVNKTESAVRLTHLPTGIVVKCQEERSQHKNREKALERLKAKLYEAELQKKR 80 (114) T ss_pred CCCCCCCEECCHHHEEEEEEECCCCCCCCCCHHHEEEEEEECCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 98885111418799489988878986565472041589998476489997077888999999999999999999888999 Q ss_pred HHHHHHHHHHCCCCCCCCCEEECCCCCCCEEEEE Q ss_conf 9998744652246742382210448998112432 Q gi|254780874|r 287 NERSANRKLQIGSGDRSERIRTYNFSQGRITDHR 320 (357) Q Consensus 287 ~~~~~~r~~~~g~~~rs~~iRtY~~~~~rv~DhR 320 (357) .+....|+++++.++|++||||||||+++||||| T Consensus 81 ~~~~~~r~~~~~~~~~~~kiRtY~~~~~~Vkd~R 114 (114) T pfam00472 81 EETKPTRKSQVRRGDRKEKIRTYNFPQGRVKDHR 114 (114) T ss_pred HHHHHHHHHHCCCCCCCCCCCCEECCCCCCCCCC T ss_conf 9987998600366855466001337877767697 No 16 >pfam03462 PCRF PCRF domain. This domain is found in peptide chain release factors. Probab=100.00 E-value=8.3e-36 Score=278.94 Aligned_cols=112 Identities=53% Similarity=0.881 Sum_probs=109.3 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 03785356654332223320005778889998640001135540378862268852178899999999999987649840 Q gi|254780874|r 68 DHNSDSEIRDLAQIEVLAIEKEIRELENEINYLLLPKDTDDDKSCILEIRAGTGGSEAALFVGDLFRMYERYAALRKWKV 147 (357) Q Consensus 68 e~e~D~El~elAeeEl~~l~~~l~~le~eL~~~LLpsde~D~~nailEI~aGaGG~EA~dwA~~L~rMY~r~ae~~gwk~ 147 (357) ..++|++|.++|+.|+..+..+++.++.++..+|+|++|+|.+||||||||||||+||||||+||||||+|||+++||++ T Consensus 4 ~~e~D~e~~~~a~~e~~~l~~~l~~le~~l~~~llp~~~~D~~~~ilEI~aGaGG~EA~~fa~~L~rMY~r~ae~~gw~~ 83 (115) T pfam03462 4 EEEDDPELRELAEEELEELEKELEELEKELLNLLLPLDPYDDKNAILEIRAGAGGTEAQLFAEDLLRMYQRYAERKGWKV 83 (115) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHCCCEE T ss_conf 10799899999999999999999999999999707999776678689993688968999999999999999999779988 Q ss_pred CHHHHCCCCCCCCCEEEEEEECCCHHHHHHCC Q ss_conf 01321135447721113777503111210002 Q gi|254780874|r 148 EVLSSSDNDDGGYKEIVATISGRGVFSRMKFE 179 (357) Q Consensus 148 eii~~~~ge~aGik~v~~~I~G~~aYg~Lk~E 179 (357) +|+++++|+.||||+|+|.|+|++|||+||+| T Consensus 84 evl~~~~~e~gG~k~a~~~I~G~~aYg~Lk~E 115 (115) T pfam03462 84 EVLDYSEGEEGGIKEATLEIEGEGAYGYLKFE 115 (115) T ss_pred EEEECCCCCCCCEEEEEEEEECCCHHHHCCCC T ss_conf 99755887777547999999863858753388 No 17 >PRK09256 hypothetical protein; Provisional Probab=99.67 E-value=6.3e-17 Score=142.77 Aligned_cols=67 Identities=33% Similarity=0.492 Sum_probs=60.4 Q ss_pred CCCCCCCEEEEEEECCCCCCCCCCCCCCCEEEEE------ECCCC------------------EEEEEC-CCCCHHHHHH Q ss_conf 2224543479998528887520006321215889------64984------------------488843-6541358999 Q gi|254780874|r 214 DVDIPLEDIRIDTMRASGSGGQHVNTTDSAVRIT------HIPTG------------------IMVTSS-EKSQHQNRLR 268 (357) Q Consensus 214 ~~~i~~~dl~i~~~rssG~GGQ~vN~t~savri~------H~ptg------------------i~v~~~-eRsQ~~Nk~~ 268 (357) .|.|+++||.+.+.|||||||||||||+|+|.|. |+|++ |+|.|+ .|||++|++. T Consensus 7 ~i~Ip~~el~~~f~RSsGPGGQnVNKv~Tkv~Lr~~~~~s~lp~~~k~rl~~~~~~ri~~~g~lvI~s~~~RsQ~~Nr~~ 86 (138) T PRK09256 7 RLVIPENELEWRFIRASGPGGQNVNKVSTAVELRFDIAASSLPEFYKERLLALAGHRITKDGVIVIKAQEFRSQERNRED 86 (138) T ss_pred CEEECHHHCEEEEEECCCCCCCCCCCEEEEEEEEECCCCCCCCHHHHHHHHHHHHCCCCCCCEEEEEECCCCCHHHHHHH T ss_conf 73787699077777757998764230000278896330146999999999998626517797399997452699998999 Q ss_pred HHHHHHHHHHHH Q ss_conf 999999999999 Q gi|254780874|r 269 AMKVLRARLYDV 280 (357) Q Consensus 269 A~~~L~~~l~~~ 280 (357) |+..|...|.+. T Consensus 87 a~~rL~~li~~a 98 (138) T PRK09256 87 ALERLAALIRAA 98 (138) T ss_pred HHHHHHHHHHHC T ss_conf 999999999980 No 18 >KOG3429 consensus Probab=98.94 E-value=1.7e-09 Score=89.20 Aligned_cols=72 Identities=31% Similarity=0.523 Sum_probs=58.2 Q ss_pred CCCCCCCCCCEEEEEEECCCCCCCCCCCCCCCEEEEEEC-------C------------------CCEEEEEC-CCCCHH Q ss_conf 433222454347999852888752000632121588964-------9------------------84488843-654135 Q gi|254780874|r 211 AEIDVDIPLEDIRIDTMRASGSGGQHVNTTDSAVRITHI-------P------------------TGIMVTSS-EKSQHQ 264 (357) Q Consensus 211 ~~~~~~i~~~dl~i~~~rssG~GGQ~vN~t~savri~H~-------p------------------tgi~v~~~-eRsQ~~ 264 (357) ..+.-.|+.+-+.|.+.||||||||||||+.|-|-++-. | .-|++..+ .||||. T Consensus 31 ~~~~g~ipld~~~i~y~RSSGPGGQNVNKvNTKv~vrf~vs~a~Wipe~~R~~~~~~~~~rink~gelvI~Sd~TRsq~~ 110 (172) T KOG3429 31 IHFKGKIPLDQLEISYSRSSGPGGQNVNKVNTKVEVRFKVSNAEWIPEFLRNKLLTTEKNRINKDGELVIYSDKTRSQHK 110 (172) T ss_pred CCCCCCCCHHHEEEEEEECCCCCCCCCCCCCCEEEEEEECCHHHHCCHHHHHHHHHHHHHHHCCCCCEEEECCHHHHHHC T ss_conf 65588786316478874057998765202144389997501365611999999999988764257648994150477662 Q ss_pred HHHHHHHHHHHHHHHHHH Q ss_conf 899999999999999999 Q gi|254780874|r 265 NRLRAMKVLRARLYDVER 282 (357) Q Consensus 265 Nk~~A~~~L~~~l~~~~~ 282 (357) |.+.||..|++-+++.+. T Consensus 111 NiaDcleKlr~~I~~~~~ 128 (172) T KOG3429 111 NIADCLEKLRDIIRAAEQ 128 (172) T ss_pred CHHHHHHHHHHHHHHHHC T ss_conf 599999999999999860 No 19 >PRK07245 consensus Probab=95.40 E-value=0.13 Score=32.32 Aligned_cols=65 Identities=11% Similarity=0.169 Sum_probs=41.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEECCCCCCC---EEEEEC-CCEECCHHHHHCCCH Q ss_conf 99999999999999999987446522467423822104489981---124321-644178778728866 Q gi|254780874|r 272 VLRARLYDVERKRMANERSANRKLQIGSGDRSERIRTYNFSQGR---ITDHRI-NLTLYKLEYVLQGYI 336 (357) Q Consensus 272 ~L~~~l~~~~~~~~~~~~~~~r~~~~g~~~rs~~iRtY~~~~~r---v~DhR~-~~~~~~~~~~l~G~l 336 (357) .|-..-.+...++....+.......||+--|+.-.-.|+.-.+- ++-+.+ ++--++|+.|++.-| T Consensus 263 kL~~~~~~~~~~e~~~~~~~~~~~~~g~~IRtY~~~p~~~V~DhRt~~~~~~~~~vldG~ld~fi~a~l 331 (337) T PRK07245 263 KLYQLEQEKKAQEVDALKGDKKEITWGSQIRSYVFTPYTMVKDHRTNFELAQVDKVMDGEINGFIDAYL 331 (337) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEECCCCCCCCCCCCCCEECCHHHHHCCCHHHHHHHHH T ss_conf 999999999998999864152126655766151678988374206697447847875877599999999 No 20 >PRK11147 ABC transporter ATPase component; Reviewed Probab=93.15 E-value=0.38 Score=28.92 Aligned_cols=24 Identities=17% Similarity=0.387 Sum_probs=16.9 Q ss_pred CCCCCCCCCCCCEEEEEEECCCCCC Q ss_conf 3343322245434799985288875 Q gi|254780874|r 209 EAAEIDVDIPLEDIRIDTMRASGSG 233 (357) Q Consensus 209 ~~~~~~~~i~~~dl~i~~~rssG~G 233 (357) .++++++.|...| +|-..=.+|+| T Consensus 334 ~l~~vsl~i~~Ge-~ialvG~NGsG 357 (632) T PRK11147 334 LVKDFSAQVQRGD-KIALIGPNGCG 357 (632) T ss_pred EEEEECCCCCCCC-EEEEECCCCCC T ss_conf 7765333357887-79998898842 No 21 >PRK10636 putative ABC transporter ATP-binding protein; Provisional Probab=89.22 E-value=2.1 Score=23.62 Aligned_cols=23 Identities=22% Similarity=0.429 Sum_probs=14.5 Q ss_pred CCCCCCCCCCCEEEEEEECCCCCC Q ss_conf 343322245434799985288875 Q gi|254780874|r 210 AAEIDVDIPLEDIRIDTMRASGSG 233 (357) Q Consensus 210 ~~~~~~~i~~~dl~i~~~rssG~G 233 (357) ++++++.|.+.| +|-..=..|+| T Consensus 328 l~~vsl~i~~Ge-riaIvG~NGsG 350 (638) T PRK10636 328 LDSIKLNLVPGS-RIGLLGRNGAG 350 (638) T ss_pred CCCCCCEECCCC-EEEEECCCCCC T ss_conf 137750563784-79997478713 No 22 >TIGR03545 conserved hypothetical protein TIGR03545. This model represents a relatively rare but broadly distributed uncharacterized protein family, distributed in 1-2 percent of bacterial genomes, all of which have outer membranes. In many of these genomes, it is part of a two-gene pair. Probab=89.20 E-value=2.1 Score=23.61 Aligned_cols=20 Identities=15% Similarity=0.438 Sum_probs=16.1 Q ss_pred CCHHHHHHHHHHHHHHHHHH Q ss_conf 52178899999999999987 Q gi|254780874|r 122 GSEAALFVGDLFRMYERYAA 141 (357) Q Consensus 122 G~EA~dwA~~L~rMY~r~ae 141 (357) |++++.|++.++.-|.+..- T Consensus 287 g~~i~~~~~~~~~~y~~~~p 306 (554) T TIGR03545 287 GEEIRRYLQTAYDYYDQAEP 306 (554) T ss_pred HHHHHHHHHHHHHHHHHHHH T ss_conf 69999999999999998766 No 23 >PRK12595 bifunctional 3-deoxy-7-phosphoheptulonate synthase/chorismate mutase; Reviewed Probab=74.20 E-value=7.5 Score=19.59 Aligned_cols=113 Identities=15% Similarity=0.279 Sum_probs=64.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC---CCHHHHHHHH Q ss_conf 8689899999999999998538531579999999998799999999999999998999999972037---8535665433 Q gi|254780874|r 4 LSHKQICDLKNRFAEIELRMSESPSVDAYIKLTEEYAAISPIISKISIYDQKKQEEQDLHTVIGDHN---SDSEIRDLAQ 80 (357) Q Consensus 4 i~~~kL~~l~~rl~ELE~~Lsdp~~w~d~~klsKe~s~L~~iv~~~~~~~~~~~ei~el~eLl~e~e---~D~El~elAe 80 (357) +++++|+.|+.++++|+.++.+ ..+.-.++.++..+++.... ...| .-..+.+-|..+..... +|..++.+.. T Consensus 1 mm~~~L~~LR~~ID~ID~~iL~--LLNeRa~la~eIG~~K~~~g-~~~y-~P~RE~eiL~rl~~~N~GPl~d~~i~~IFr 76 (360) T PRK12595 1 MMNNELDQLRSEVDELNLQILE--LLNKRGELVQEIGEEKEKQG-TKRY-DPVREREMLNDIAENNEGPFEDSTIQHIFK 76 (360) T ss_pred CCHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHCC-CCCC-CCHHHHHHHHHHHHHCCCCCCHHHHHHHHH T ss_conf 9204899999999899999999--99999999999998887569-9760-858899999999972899998899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECC-CCCC Q ss_conf 222332000577888999864000113554037886226-8852 Q gi|254780874|r 81 IEVLAIEKEIRELENEINYLLLPKDTDDDKSCILEIRAG-TGGS 123 (357) Q Consensus 81 eEl~~l~~~l~~le~eL~~~LLpsde~D~~nailEI~aG-aGG~ 123 (357) + +.. ..++-.+..-+..||-|-+...-|.+++|..- -||. T Consensus 77 E-IF~--ASL~Lqe~~~~k~llvsr~~~~~~t~v~v~~~~iG~~ 117 (360) T PRK12595 77 E-IFK--ASLELQEDDNRKALLVSRKKKPEDTIVDVKGEIIGDG 117 (360) T ss_pred H-HHH--HHHHHHHHCCCCEEEEECCCCCCCCEEEECCEEECCC T ss_conf 9-999--9987654056533787457688887798799997799 No 24 >KOG0971 consensus Probab=70.30 E-value=9.1 Score=18.98 Aligned_cols=42 Identities=21% Similarity=0.176 Sum_probs=15.8 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHCCCCCCCCCE Q ss_conf 852178899999999999987649840013211354477211 Q gi|254780874|r 121 GGSEAALFVGDLFRMYERYAALRKWKVEVLSSSDNDDGGYKE 162 (357) Q Consensus 121 GG~EA~dwA~~L~rMY~r~ae~~gwk~eii~~~~ge~aGik~ 162 (357) +|.||.|-+-+|--.-..+|..+.+--.+--.-||+.-|++. T Consensus 786 e~~eatds~~llq~Lne~~~a~rq~~kki~RrLpgd~~g~~~ 827 (1243) T KOG0971 786 EGQEATDSALLLQDLNESCSAIRQFCKKIRRRLPGDAPGIPA 827 (1243) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHH T ss_conf 304465799999999988999999999999848998765476 No 25 >COG0249 MutS Mismatch repair ATPase (MutS family) [DNA replication, recombination, and repair] Probab=69.83 E-value=9.3 Score=18.91 Aligned_cols=158 Identities=10% Similarity=0.174 Sum_probs=81.8 Q ss_pred CCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHH----------HHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 78689899999999999998538531579999999998799999999----------99999999899999997203785 Q gi|254780874|r 3 FLSHKQICDLKNRFAEIELRMSESPSVDAYIKLTEEYAAISPIISKI----------SIYDQKKQEEQDLHTVIGDHNSD 72 (357) Q Consensus 3 mi~~~kL~~l~~rl~ELE~~Lsdp~~w~d~~klsKe~s~L~~iv~~~----------~~~~~~~~ei~el~eLl~e~e~D 72 (357) +-|-...++|..|++-++....++.+..+....-+...++..++..+ -.+.+....+.++..++...... T Consensus 306 ~~PL~~~~~I~~Rld~Ve~l~~~~~l~~~L~~~L~~v~DleRl~~Rl~~~~~~~rDl~~l~~~l~~~~~i~~~l~~~~~~ 385 (843) T COG0249 306 LQPLLDKEEIEERLDAVEELKDNPELREKLREMLKKVPDLERLLSRLSLGRASPRDLLRLRDSLEKIPEIFKLLSSLKSE 385 (843) T ss_pred HCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCH T ss_conf 48035899999999999999865689999999985076799999999737998656999999999889999998525430 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHH Q ss_conf 35665433222332000577888999864000113554037886226885217889999999999998764984001321 Q gi|254780874|r 73 SEIRDLAQIEVLAIEKEIRELENEINYLLLPKDTDDDKSCILEIRAGTGGSEAALFVGDLFRMYERYAALRKWKVEVLSS 152 (357) Q Consensus 73 ~El~elAeeEl~~l~~~l~~le~eL~~~LLpsde~D~~nailEI~aGaGG~EA~dwA~~L~rMY~r~ae~~gwk~eii~~ 152 (357) ..+ ..+...+..+.. +.++...+...+.+.-|.. ++. |.=+.-|-.+|..+=..+-..+.|-.++... T Consensus 386 ~~l-~~~~~~i~~~~~-~~e~~~ll~~~i~~~~~~~-------~~~---~ii~~g~~~eLd~lr~~~~~~~~~i~~le~~ 453 (843) T COG0249 386 SDL-LLLLEDIESLDY-LAELLELLETAINEDPPLA-------VRD---GIIKEGYNIELDELRDLLNNAKEWIAKLELE 453 (843) T ss_pred HHH-HHHHHHHHCCCC-HHHHHHHHHHHHHHCCHHH-------CCH---HHHHCCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 235-677644431340-7999999998721166233-------015---5775110399999999998889999999998 Q ss_pred CCCCCCCCCEEEEEEECCCHHHH Q ss_conf 13544772111377750311121 Q gi|254780874|r 153 SDNDDGGYKEIVATISGRGVFSR 175 (357) Q Consensus 153 ~~ge~aGik~v~~~I~G~~aYg~ 175 (357) +-+.-||++ ++|.-..+||| T Consensus 454 -~r~~~gi~s--lki~~n~v~Gy 473 (843) T COG0249 454 -ERERTGIKS--LKIKYNKVYGY 473 (843) T ss_pred -HHHHCCCCH--HHHHHHCCCEE T ss_conf -888649721--21122034605 No 26 >PRK13520 L-tyrosine decarboxylase; Provisional Probab=65.91 E-value=8.8 Score=19.09 Aligned_cols=115 Identities=21% Similarity=0.222 Sum_probs=59.7 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHCCCCCCCCCE--EEEEEECCCHHHHHHCCCCCEEEEEECCCCCCCCC Q ss_conf 8852178899999999999987649840013211354477211--13777503111210002465026761254567851 Q gi|254780874|r 120 TGGSEAALFVGDLFRMYERYAALRKWKVEVLSSSDNDDGGYKE--IVATISGRGVFSRMKFESGVHRVQRVPATEASGRV 197 (357) Q Consensus 120 aGG~EA~dwA~~L~rMY~r~ae~~gwk~eii~~~~ge~aGik~--v~~~I~G~~aYg~Lk~E~GvHRvqRip~~es~gR~ 197 (357) +|||||.+.|--.+| .+ +|++. +.+..+..+++...-.--|+ ++++||.. .++|+ T Consensus 87 sGGTes~l~Al~aaR---~~------------------~~~~~p~iv~~~~aH~s~~Kaa~~~gi-~~~~vp~d-~~~~~ 143 (375) T PRK13520 87 SGGTEANIQAVRAAR---NL------------------ADAEKPNIVVPESAHFSFDKAADMLGL-ELRRAPLD-EDYRV 143 (375) T ss_pred CHHHHHHHHHHHHHH---HH------------------CCCCCCEEEEECCCHHHHHHHHHHCCC-EEEEEEEC-CCCCC T ss_conf 618999999999997---62------------------689998799967316999999997495-56994038-88869 Q ss_pred C----------EEEEEEEEEECC-----CCCC-C-C-CCCCEEEEEEECCCC--------CCCC---CCCCCCCEEEEEE Q ss_conf 1----------016888853033-----4332-2-2-454347999852888--------7520---0063212158896 Q gi|254780874|r 198 H----------TSAATVAVLPEA-----AEID-V-D-IPLEDIRIDTMRASG--------SGGQ---HVNTTDSAVRITH 248 (357) Q Consensus 198 h----------TS~a~V~V~P~~-----~~~~-~-~-i~~~dl~i~~~rssG--------~GGQ---~vN~t~savri~H 248 (357) . ..++-|+..|.. |+++ + + ....++-+-+=-+.| +.|. .+.-++|=.-=.| T Consensus 144 d~~~l~~~i~~~ti~vvatagtt~~G~iDpi~~i~~i~~~~~i~lHVDaa~GGf~~pf~~~~~~~df~l~~vdSIs~d~H 223 (375) T PRK13520 144 DVKSVEDLIDDNTIGIVGIAGTTELGQVDPIPELSKIALENDIFLHVDAAFGGFVIPFLDDPYDFDFSLPGVSSITIDPH 223 (375) T ss_pred CHHHHHHHHHHCCEEEEEECCCCCCEEECCHHHHHHHHHHCCCEEEEECCCCEEEEECCCCCCCCCCCCCCEEEEEECCC T ss_conf 89999998864957999925887882235999999999975974998625520221125766400047788026424611 Q ss_pred C------CCCEEEEE Q ss_conf 4------98448884 Q gi|254780874|r 249 I------PTGIMVTS 257 (357) Q Consensus 249 ~------ptgi~v~~ 257 (357) + |+|+++.- T Consensus 224 K~g~~p~g~g~vl~r 238 (375) T PRK13520 224 KMGLAPIPAGGILFR 238 (375) T ss_pred CCCCCCCCEEEEEEC T ss_conf 136515771599987 No 27 >pfam02128 Peptidase_M36 Fungalysin metallopeptidase (M36). Probab=65.63 E-value=5.8 Score=20.37 Aligned_cols=42 Identities=14% Similarity=0.126 Sum_probs=21.9 Q ss_pred CCEEECCC------CCCCEEEEECCCEECCHHHHHCCCHHHHHHHHHH Q ss_conf 82210448------9981124321644178778728866999999999 Q gi|254780874|r 304 ERIRTYNF------SQGRITDHRINLTLYKLEYVLQGYIDDIINPLLT 345 (357) Q Consensus 304 ~~iRtY~~------~~~rv~DhR~~~~~~~~~~~l~G~l~~~i~~~~~ 345 (357) --||.|-| .|.+-+|.-..-++|++-.|-.--|-++..+|.. T Consensus 249 ~GIR~~pYStd~~~NP~tY~~i~~~~~vH~~GeVWA~mLwe~~~aLi~ 296 (368) T pfam02128 249 AGIRPYPYSTDLTVNPLTYADIDGLSEVHGVGTVWATMLWEVLWNLID 296 (368) T ss_pred CCEECCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHH T ss_conf 733047655776759977310777763365317999999999999999 No 28 >pfam11593 Med3 Mediator complex subunit 3 fungal. Mediator is a large complex of up to 33 proteins that is conserved from plants to fungi to humans - the number and representation of individual subunits varying with species. It is arranged into four different sections, a core, a head, a tail and a kinase-activity part, and the number of subunits within each of these is what varies with species. Overall, Mediator regulates the transcriptional activity of RNA polymerase II but it would appear that each of the four different sections has a slightly different function. Mediator subunit Hrs1/Med3 is a physical target for Cyc8-Tup1, a yeast transcriptional co-repressor. Probab=62.55 E-value=13 Score=17.95 Aligned_cols=82 Identities=16% Similarity=0.210 Sum_probs=52.5 Q ss_pred HHHHHHHHHHHCCCCCHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 9999999998538531579--99999999879999999999999999899999997203785356654332223320005 Q gi|254780874|r 13 KNRFAEIELRMSESPSVDA--YIKLTEEYAAISPIISKISIYDQKKQEEQDLHTVIGDHNSDSEIRDLAQIEVLAIEKEI 90 (357) Q Consensus 13 ~~rl~ELE~~Lsdp~~w~d--~~klsKe~s~L~~iv~~~~~~~~~~~ei~el~eLl~e~e~D~El~elAeeEl~~l~~~l 90 (357) .-+|++|+..|...+.-.| ..++.+-...+-|+--.|++|..+...|++... ...+ |-.-++..-+.+|.+.+ T Consensus 7 ~~kLdeL~~~l~~~d~~kd~V~~~I~ea~d~ILPLRLqFNdfi~~mS~Ie~~~~----~t~q-EKfllIRsKlleL~~~i 81 (381) T pfam11593 7 NLTLDELQELLAETEGTKDEVADKIKEAKDAILPLRLQFNEFIRIISHIEEMGN----KTSQ-EKFLLIRSKLLELYNNI 81 (381) T ss_pred CCCHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCC----CCHH-HHHHHHHHHHHHHHHHH T ss_conf 651999999986266608999999998653322089889999999998763134----6748-88999999999999999 Q ss_pred HHHHHHHHH Q ss_conf 778889998 Q gi|254780874|r 91 RELENEINY 99 (357) Q Consensus 91 ~~le~eL~~ 99 (357) .+|..+++. T Consensus 82 q~lS~df~~ 90 (381) T pfam11593 82 QKLSKDFQQ 90 (381) T ss_pred HHHHHHHHH T ss_conf 999999987 No 29 >pfam09418 DUF2009 Protein of unknown function (DUF2009). This is a eukaryotic family of proteins with unknown function. Probab=62.04 E-value=13 Score=17.88 Aligned_cols=73 Identities=21% Similarity=0.261 Sum_probs=51.3 Q ss_pred HCCCCCCCCCCE-EEEEECCCCCCH---------------HHHHHHHHHHHHHHHHHHCCCCCCHHHHCCCCCCCCCEEE Q ss_conf 400011355403-788622688521---------------7889999999999998764984001321135447721113 Q gi|254780874|r 101 LLPKDTDDDKSC-ILEIRAGTGGSE---------------AALFVGDLFRMYERYAALRKWKVEVLSSSDNDDGGYKEIV 164 (357) Q Consensus 101 LLpsde~D~~na-ilEI~aGaGG~E---------------A~dwA~~L~rMY~r~ae~~gwk~eii~~~~ge~aGik~v~ 164 (357) .-+-+|.+..+. =|.|++|.||.- =.+|-.++-.||.=|+-. +.|. T Consensus 229 ~~~f~P~~~~~~~sL~I~~G~~GaRLsH~H~rQy~yV~QSL~LW~~i~~~mf~LW~~a-----------e~Dl------- 290 (458) T pfam09418 229 EEFFSPDSATRGFSLAIDYGRGGARLTHNHRRQYHYVLQSLSLWSNIMREMFSLWYLA-----------DADL------- 290 (458) T ss_pred HHHCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCC-----------CHHH------- T ss_conf 9748987789997620004898653577534113467838999999999999988524-----------0755------- Q ss_pred EEEECCCHHHHHHCCCCCEEEEEECCCCC Q ss_conf 77750311121000246502676125456 Q gi|254780874|r 165 ATISGRGVFSRMKFESGVHRVQRVPATEA 193 (357) Q Consensus 165 ~~I~G~~aYg~Lk~E~GvHRvqRip~~es 193 (357) .+|.+-|..--.=-|.||||.-|..-+ T Consensus 291 --l~~~~~Y~L~~TGQGl~Rvq~~P~v~~ 317 (458) T pfam09418 291 --LNPNEPYELRDTGQGLNRVQKCPNVYK 317 (458) T ss_pred --CCCCCCEEEECCCCCHHHHHCCCHHHH T ss_conf --399996065017742887504867999 No 30 >KOG1697 consensus Probab=61.97 E-value=6.6 Score=19.98 Aligned_cols=102 Identities=24% Similarity=0.146 Sum_probs=47.8 Q ss_pred HHHHCCCCCCCCCCEEEEEECCCCCCHHHHH--HHHHHHHHHHHHHHCCCCCCHHHHCCCCCCCCCEEEEEEECCCHHHH Q ss_conf 9864000113554037886226885217889--99999999999876498400132113544772111377750311121 Q gi|254780874|r 98 NYLLLPKDTDDDKSCILEIRAGTGGSEAALF--VGDLFRMYERYAALRKWKVEVLSSSDNDDGGYKEIVATISGRGVFSR 175 (357) Q Consensus 98 ~~~LLpsde~D~~nailEI~aGaGG~EA~dw--A~~L~rMY~r~ae~~gwk~eii~~~~ge~aGik~v~~~I~G~~aYg~ 175 (357) ...+.|++-+|.+.-.+- ++++.|-.-| ..|++-|-..-+..-.-.++-+-.++.... +..--..|-. T Consensus 70 i~~l~psg~~d~~arp~m---~~~~~et~~~~ll~di~~~~~~~~~~~~~s~~~l~s~p~~~~-------e~~f~~~~~k 139 (275) T KOG1697 70 ISYLFPSGLFDPNARPVM---AFQFDETKYYQLLHDIGVLTNKLAKLYSISIEHLLSSPYSAI-------EEKFLKRFRK 139 (275) T ss_pred HHHHCCCCCCCCCCCCCC---CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHH-------HHHHHHHHHH T ss_conf 887463123573346420---136785048999999876654566531067887761714577-------7999999873 Q ss_pred HHCCCCCEEEEEECCCCCCCCCCEEEEEEEEEECCCCC Q ss_conf 00024650267612545678511016888853033433 Q gi|254780874|r 176 MKFESGVHRVQRVPATEASGRVHTSAATVAVLPEAAEI 213 (357) Q Consensus 176 Lk~E~GvHRvqRip~~es~gR~hTS~a~V~V~P~~~~~ 213 (357) .--|+|-- -|| ....|||.||.|+|.|.|--..+ T Consensus 140 ~~~~~g~~-~~~---~~~~g~rK~a~A~V~v~~GtGk~ 173 (275) T KOG1697 140 PLDESGKP-EVR---IIAVGRRKCARATVKVQPGTGKF 173 (275) T ss_pred CCCCCCCC-CEE---EEECCCEECCEEEEEEECCCEEE T ss_conf 02546887-514---66125400011379995684048 No 31 >PRK05399 DNA mismatch repair protein; Provisional Probab=60.56 E-value=14 Score=17.71 Aligned_cols=46 Identities=11% Similarity=0.157 Sum_probs=33.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 8689899999999999998538531579999999998799999999 Q gi|254780874|r 4 LSHKQICDLKNRFAEIELRMSESPSVDAYIKLTEEYAAISPIISKI 49 (357) Q Consensus 4 i~~~kL~~l~~rl~ELE~~Lsdp~~w~d~~klsKe~s~L~~iv~~~ 49 (357) -|-..++.|..|++-++..+.++...++.+..-+...+++.+...+ T Consensus 308 ~PL~d~~~I~~RldaVe~l~~~~~~~~~lr~~Lk~i~DlERllsRi 353 (848) T PRK05399 308 RPLRDREAINARLDAVEELLDNPLLREDLRELLKGVYDLERILSRV 353 (848) T ss_pred HHCCCHHHHHHHHHHHHHHHHCHHHHHHHHHHHHCCCCHHHHHHHH T ss_conf 3562778999999999999749799999999985489889999999 No 32 >pfam08399 VWA_N VWA N-terminal. This domain is found at the N-terminus of proteins containing von Willebrand factor type A (VWA, pfam00092) and Cache (pfam02743) domains. It has been found in vertebrates, Drosophila and C. elegans but has not yet been identified in other eukaryotes. It is probably involved in the function of some voltage-dependent calcium channel subunits. Probab=59.33 E-value=4.1 Score=21.44 Aligned_cols=19 Identities=37% Similarity=0.536 Sum_probs=15.5 Q ss_pred CCCCCCCCCCEEEEEECCCCEE Q ss_conf 5200063212158896498448 Q gi|254780874|r 233 GGQHVNTTDSAVRITHIPTGIM 254 (357) Q Consensus 233 GGQ~vN~t~savri~H~ptgi~ 254 (357) -+..||++-||| |+||+|- T Consensus 55 ~~~~Vn~~~S~V---hiPt~Vy 73 (123) T pfam08399 55 NNLPVNTSVSSV---QLPTNVY 73 (123) T ss_pred CCCCCCCEEEEE---ECCCCCC T ss_conf 786110634127---8576403 No 33 >TIGR02168 SMC_prok_B chromosome segregation protein SMC; InterPro: IPR011890 The SMC (structural maintenance of chromosomes) family of proteins, exist in virtually all organisms including both bacteria and archaea. The SMC proteins are essential for successful chromosome transmission during replication and segregation of the genome in all organisms and form three types of heterodimer (SMC1SMC3, SMC2SMC4, SMC5SMC6), which are core components of large multiprotein complexes. The best known complexes are cohesin, which is responsible for sister-chromatid cohesion, and condensin, which is required for full chromosome condensation in mitosis. SMCs are generally present as single proteins in bacteria, and as at least six distinct proteins in eukaryotes. The proteins range in size from approximately 110 to 170 kDa, and share a five-domain structure, with globular N- and C-terminal (IPR003395 from INTERPRO) domains separated by a long (circa 100 nm or 900 residues) coiled coil segment in the centre of which is a globular ''hinge'' domain, characterised by a set of four highly conserved glycine residues that are typical of flexible regions in a protein. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif (XXXXD, where X is any hydrophobic residue), and a LSGG motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases . All SMC proteins appear to form dimers, either forming homodimers with themselves, as in the case of prokaryotic SMC proteins, or heterodimers between different but related SMC proteins. The dimers are arranged in an antiparallel alignment. This orientation brings the N- and C-terminal globular domains (from either different or identical protamers) together, which unites an ATP binding site (Walker A motif) within the N-terminal domain with a Walker B motif (DA box) within the C-terminal domain, to form a potentially functional ATPase. Protein interaction and microscopy data suggest that SMC dimers form a ring-like structure which might embrace DNA molecules. Non-SMC subunits associate with the SMC amino- and carboxy-terminal domains. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. SMCs share not only sequence similarity but also structural similarity with ABC proteins. SMC proteins function together with other proteins in a range of chromosomal transactions, including chromosome condensation, sister-chromatid cohesion, recombination, DNA repair and epigenetic silencing of gene expression . The smc gene is often associated with scpB (IPR005234 from INTERPRO) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle , , .; GO: 0005515 protein binding, 0005524 ATP binding, 0005694 chromosome. Probab=55.93 E-value=16 Score=17.17 Aligned_cols=36 Identities=25% Similarity=0.380 Sum_probs=25.8 Q ss_pred CCCCEEE-EE-C-CCCCHHHHHH-HHHHHHHHHHHHHHHH Q ss_conf 4984488-84-3-6541358999-9999999999999999 Q gi|254780874|r 249 IPTGIMV-TS-S-EKSQHQNRLR-AMKVLRARLYDVERKR 284 (357) Q Consensus 249 ~ptgi~v-~~-~-eRsQ~~Nk~~-A~~~L~~~l~~~~~~~ 284 (357) -|-|++. .. . ..+...++.. .++-|+.++-.++.+- T Consensus 661 ~~~G~~~gg~~~~~~~~~l~r~~~ei~~l~~~~~~~~~~~ 700 (1191) T TIGR02168 661 RPGGVITGGSAKKTNSSILERRRIEIEELEEEIEELEEKI 700 (1191) T ss_pred EECEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 4033676465332012346888999999999999999999 No 34 >TIGR02194 GlrX_NrdH Glutaredoxin-like protein NrdH; InterPro: IPR011909 Glutaredoxins , , , also known as thioltransferases (disulphide reductases, are small proteins of approximately one hundred amino-acid residues which utilise glutathione and NADPH as cofactors. Oxidized glutathione is regenerated by glutathione reductase. Together these components compose the glutathione system . Glutaredoxin functions as an electron carrier in the glutathione-dependent synthesis of deoxyribonucleotides by the enzyme ribonucleotide reductase. Like thioredoxin, which functions in a similar way, glutaredoxin possesses an active centre disulphide bond . It exists in either a reduced or an oxidized form where the two cysteine residues are linked in an intramolecular disulphide bond. Glutaredoxin has been sequenced in a variety of species. On the basis of extensive sequence similarity, it has been proposed that Vaccinia virus protein O2L is most probably a glutaredoxin. Finally, it must be noted that Bacteriophage T4 thioredoxin seems also to be evolutionary related. In position 5 of the pattern T4 thioredoxin has Val instead of Pro. NrdH-redoxin is a representative of a class of small redox proteins that contain a conserved CXXC motif and are characterised by a glutaredoxin-like amino acid sequence and thioredoxin-like activity profile. Unlike other the glutaredoxins to which it is most closely related, NrdH aparrently does not interact with glutathione/glutathione reductase, but rather with thioredoxin reductase to catalyze the reduction of ribonucleotide reductase .; GO: 0006118 electron transport, 0045454 cell redox homeostasis. Probab=55.32 E-value=7.2 Score=19.69 Aligned_cols=41 Identities=12% Similarity=0.340 Sum_probs=34.6 Q ss_pred HHHHHHHHHCCCCCCHHHHCCCCCC-------CCCEEEEEEE-CCCHHH Q ss_conf 9999998764984001321135447-------7211137775-031112 Q gi|254780874|r 134 RMYERYAALRKWKVEVLSSSDNDDG-------GYKEIVATIS-GRGVFS 174 (357) Q Consensus 134 rMY~r~ae~~gwk~eii~~~~ge~a-------Gik~v~~~I~-G~~aYg 174 (357) .|=.||-+.+|..||.++++.-+++ |++.|=+-.. |..++. T Consensus 13 kmTKk~L~~~~i~Fe~inid~~pea~d~vk~lGF~~~PVi~~~g~~~~S 61 (72) T TIGR02194 13 KMTKKALEEHGIAFEEINIDEQPEAVDYVKALGFRQVPVIVAEGDESWS 61 (72) T ss_pred CHHHHHHHHCCCCEEEEECCCCHHHHHHHHHCCCEECCEEEECCCEEEE T ss_conf 1577667637996157635888437899987486307626767970021 No 35 >pfam05557 MAD Mitotic checkpoint protein. This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated. Probab=53.80 E-value=17 Score=16.94 Aligned_cols=33 Identities=21% Similarity=0.180 Sum_probs=19.6 Q ss_pred HHHHHHHHHHHHHHCCCCCCHHHHCCCCCCCCCEEE Q ss_conf 999999999998764984001321135447721113 Q gi|254780874|r 129 VGDLFRMYERYAALRKWKVEVLSSSDNDDGGYKEIV 164 (357) Q Consensus 129 A~~L~rMY~r~ae~~gwk~eii~~~~ge~aGik~v~ 164 (357) +++|.-+|..|-...-..++.+ ..+.+|+|... T Consensus 445 ~e~~~q~~~~~~~~~e~~i~~l---e~~l~~~K~~~ 477 (722) T pfam05557 445 AEDLVQKVQSHLAKMENQLSEL---EEDVGQQKDRN 477 (722) T ss_pred HHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHH T ss_conf 8899999998899999999999---99988899998 No 36 >PRK08883 ribulose-phosphate 3-epimerase; Provisional Probab=53.36 E-value=18 Score=16.89 Aligned_cols=59 Identities=19% Similarity=0.324 Sum_probs=40.8 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHCCCCCCCCCE----------EEEEEECCCHHHH Q ss_conf 378862268852178899999999999987649840013211354477211----------1377750311121 Q gi|254780874|r 112 CILEIRAGTGGSEAALFVGDLFRMYERYAALRKWKVEVLSSSDNDDGGYKE----------IVATISGRGVFSR 175 (357) Q Consensus 112 ailEI~aGaGG~EA~dwA~~L~rMY~r~ae~~gwk~eii~~~~ge~aGik~----------v~~~I~G~~aYg~ 175 (357) .+|.+.||-||..=..-+-+-.+...++...+|+.+++. . .||++. |...|.|.+.|+. T Consensus 133 LvMtV~PGf~GQ~f~~~~l~Ki~~l~~~~~~~~~~~~I~--V---DGGI~~~ti~~l~~aGad~~V~GS~iF~~ 201 (220) T PRK08883 133 LLMSVNPGFGGQSFIPHTLDKLRAVRKMIDASGRDIRLE--I---DGGVKVDNIREIAEAGADMFVAGSAIFGQ 201 (220) T ss_pred EEEEECCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEE--E---ECCCCHHHHHHHHHCCCCEEEECHHHHCC T ss_conf 998745898875455779999999999887449980799--9---89878999999998799999968267489 No 37 >COG1256 FlgK Flagellar hook-associated protein [Cell motility and secretion] Probab=52.41 E-value=18 Score=16.79 Aligned_cols=19 Identities=16% Similarity=0.277 Sum_probs=10.5 Q ss_pred HHHHHHHHHHHHHHHCCCC Q ss_conf 9999999999999853853 Q gi|254780874|r 9 ICDLKNRFAEIELRMSESP 27 (357) Q Consensus 9 L~~l~~rl~ELE~~Lsdp~ 27 (357) .....+-+.+|+..+++|+ T Consensus 87 ~~t~~~~L~~le~ll~~~~ 105 (552) T COG1256 87 LDTRASQLSQLESLLSEPS 105 (552) T ss_pred HHHHHHHHHHHHHHHCCCC T ss_conf 9999999999999847976 No 38 >PRK00474 rps9p 30S ribosomal protein S9P; Reviewed Probab=52.12 E-value=9.3 Score=18.91 Aligned_cols=32 Identities=19% Similarity=0.138 Sum_probs=23.0 Q ss_pred CCCCCCCEEEEEEEEEECCCCCCCCCCCCEEEEEEE Q ss_conf 567851101688885303343322245434799985 Q gi|254780874|r 192 EASGRVHTSAATVAVLPEAAEIDVDIPLEDIRIDTM 227 (357) Q Consensus 192 es~gR~hTS~a~V~V~P~~~~~~~~i~~~dl~i~~~ 227 (357) -+.|||.||.|.|.+.|= .-.|.||..++ +.| T Consensus 5 ~atGrRKtaiArv~l~~G--~G~i~INgk~~--~~y 36 (132) T PRK00474 5 ITSGKRKTAIARATIKEG--KGRVRINGVPL--ELY 36 (132) T ss_pred EEECCCCCEEEEEEEECC--CCEEEECCEEH--HHH T ss_conf 876787667999999828--73598898758--996 No 39 >PRK05581 ribulose-phosphate 3-epimerase; Validated Probab=51.68 E-value=19 Score=16.71 Aligned_cols=59 Identities=20% Similarity=0.303 Sum_probs=39.9 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHCCCCCCCCCE----------EEEEEECCCHHHH Q ss_conf 378862268852178899999999999987649840013211354477211----------1377750311121 Q gi|254780874|r 112 CILEIRAGTGGSEAALFVGDLFRMYERYAALRKWKVEVLSSSDNDDGGYKE----------IVATISGRGVFSR 175 (357) Q Consensus 112 ailEI~aGaGG~EA~dwA~~L~rMY~r~ae~~gwk~eii~~~~ge~aGik~----------v~~~I~G~~aYg~ 175 (357) -+|.+.||.||.--...+-.-.+-..+|...+|+++.+. =.||++. |...|.|.+.|+. T Consensus 136 lvMtV~PGf~GQ~f~~~~l~ki~~l~~~~~~~~~~~~I~-----VDGGIn~~~i~~l~~~Gad~~V~GS~iF~~ 204 (220) T PRK05581 136 LLMSVNPGFGGQKFIPEVLEKIREVRKLIDERGLDILIE-----VDGGVNAENIKECAEAGADVFVAGSAVFGA 204 (220) T ss_pred EEEEECCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEE-----EECCCCHHHHHHHHHCCCCEEEECHHHHCC T ss_conf 899865887876455669999999999998459975599-----978989899999997799999979488579 No 40 >pfam10458 Val_tRNA-synt_C Valyl tRNA synthetase tRNA binding arm. This domain is found at the C-terminus of Valyl tRNA synthetases. Probab=51.48 E-value=19 Score=16.69 Aligned_cols=53 Identities=13% Similarity=0.219 Sum_probs=35.9 Q ss_pred CHHHHHHHHHHHHHHHHHHCCCCCHHHH-H-HHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 6898999999999999985385315799-9-999999879999999999999999 Q gi|254780874|r 5 SHKQICDLKNRFAEIELRMSESPSVDAY-I-KLTEEYAAISPIISKISIYDQKKQ 57 (357) Q Consensus 5 ~~~kL~~l~~rl~ELE~~Lsdp~~w~d~-~-klsKe~s~L~~iv~~~~~~~~~~~ 57 (357) ..+.++.+...+.-++.+|++++|.+++ . -+.++...+......+..+...+. T Consensus 9 L~Kel~kl~~~i~~~~~kL~N~~Fv~kAP~~VV~~er~kl~~~~~~~~~l~~~L~ 63 (66) T pfam10458 9 LEKELAKLQKEIERLQKKLSNPGFVAKAPAEVVEEEKAKLAEYEEQLEKLKERLS 63 (66) T ss_pred HHHHHHHHHHHHHHHHHHHCCHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999999999999877876777709899999999999999999999999998 No 41 >PRK06737 acetolactate synthase 1 regulatory subunit; Validated Probab=50.70 E-value=11 Score=18.43 Aligned_cols=33 Identities=21% Similarity=0.223 Sum_probs=29.7 Q ss_pred HHHCCCCCCHHHHCCCCCCCCCEEEEEEECCCH Q ss_conf 876498400132113544772111377750311 Q gi|254780874|r 140 AALRKWKVEVLSSSDNDDGGYKEIVATISGRGV 172 (357) Q Consensus 140 ae~~gwk~eii~~~~ge~aGik~v~~~I~G~~a 172 (357) +.++||.++-+...+++..|+...|+.+.|+.. T Consensus 23 FsrRGyNI~SL~Vg~te~~~~SRmTivv~gd~~ 55 (76) T PRK06737 23 FARRGYYISSLNLNERDTSGVSEMKLTAVCTEN 55 (76) T ss_pred HHCCCCCEEEEEECCCCCCCCEEEEEEEECCCH T ss_conf 621665711256746679982079999977813 No 42 >pfam02815 MIR MIR domain. The MIR (protein mannosyltransferase, IP3R and RyR) domain is a domain that may have a ligand transferase function. Probab=50.60 E-value=19 Score=16.60 Aligned_cols=33 Identities=30% Similarity=0.458 Sum_probs=25.7 Q ss_pred EEECCCCCCCCCCCCCCCEEEEEECCCCEEEEE Q ss_conf 985288875200063212158896498448884 Q gi|254780874|r 225 DTMRASGSGGQHVNTTDSAVRITHIPTGIMVTS 257 (357) Q Consensus 225 ~~~rssG~GGQ~vN~t~savri~H~ptgi~v~~ 257 (357) +.--..|.|+.-|-+.+|-+||.|..||.-+.. T Consensus 121 ~~~~~~~~~~~~ik~~~s~frL~H~~tgc~L~s 153 (189) T pfam02815 121 ELKSTTGMGQDRIKPGDSKVRLRHACTGCWLFS 153 (189) T ss_pred EECCCCCCCCCEEEEEEEEEEEEECCCCEEEEE T ss_conf 633677777783898115999999996889986 No 43 >cd01989 STK_N The N-terminal domain of Eukaryotic Serine Threonine kinases. The Serine Threonine kinases are enzymes that belong to a very extensive family of proteins which share a conserved catalytic core common with both serine/threonine and tyrosine protein kinases. The N-terminal domain is homologous to the USP family which has a ATP binding fold. The N-terminal domain is predicted to be involved in ATP binding. Probab=49.19 E-value=15 Score=17.36 Aligned_cols=75 Identities=15% Similarity=0.101 Sum_probs=41.3 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCHHHHCCCCCC-C------CCEEEEEEECCCHHHHHHCCCCCEEEEEECCCCCCCCCCE Q ss_conf 89999999999998764984001321135447-7------2111377750311121000246502676125456785110 Q gi|254780874|r 127 LFVGDLFRMYERYAALRKWKVEVLSSSDNDDG-G------YKEIVATISGRGVFSRMKFESGVHRVQRVPATEASGRVHT 199 (357) Q Consensus 127 dwA~~L~rMY~r~ae~~gwk~eii~~~~ge~a-G------ik~v~~~I~G~~aYg~Lk~E~GvHRvqRip~~es~gR~hT 199 (357) .=+..||.-|..+|..++-+++.+-...++.+ + -..+...|=|-...|.++ |.-+-|..-+.--+|. T Consensus 61 ~~~~~ll~~~~~~C~~~~V~~e~~v~e~~d~~~~I~e~v~~~~i~~LVmGs~~~~~~~------r~~~gs~vs~~v~~~a 134 (146) T cd01989 61 KEAKELLLPYRCFCSRKGVQCEDVVLEDDDVAKAIVEYVADHGITKLVMGASSDNHFS------MKFKKSDVASSVLKEA 134 (146) T ss_pred HHHHHHHHHHHHHHHHCCCEEEEEEEECCCHHHHHHHHHHHCCCCEEEECCCCCCCEE------EECCCCCHHHHHHHCC T ss_conf 9999999999999987597289999834888899999999759989998627998514------5121786778998379 Q ss_pred -EEEEEEEE Q ss_conf -16888853 Q gi|254780874|r 200 -SAATVAVL 207 (357) Q Consensus 200 -S~a~V~V~ 207 (357) +||.|.|. T Consensus 135 p~~C~V~vV 143 (146) T cd01989 135 PDFCTVYVV 143 (146) T ss_pred CCCCEEEEE T ss_conf 999859999 No 44 >pfam05616 Neisseria_TspB Neisseria meningitidis TspB protein. This family consists of several Neisseria meningitidis TspB virulence factor proteins. Probab=48.96 E-value=15 Score=17.51 Aligned_cols=37 Identities=16% Similarity=0.244 Sum_probs=18.3 Q ss_pred CCCCCEEEEEECC----CCCCHHHHHHHHHHHHHHHHHHHC Q ss_conf 3554037886226----885217889999999999998764 Q gi|254780874|r 107 DDDKSCILEIRAG----TGGSEAALFVGDLFRMYERYAALR 143 (357) Q Consensus 107 ~D~~nailEI~aG----aGG~EA~dwA~~L~rMY~r~ae~~ 143 (357) +--..|..-.+.| .-|+++-.|+.--|.|-.+|.|-. T Consensus 199 y~~~~c~f~wNGg~C~V~~g~d~r~~isfsl~rn~kykEem 239 (502) T pfam05616 199 YNFSRCYFNWNGGNCNVGKGDDARSFISFSLIRNPKYKEEM 239 (502) T ss_pred CCCCCCEEECCCCCCEECCCCCCCCEEEEEEEECCCHHHHC T ss_conf 14467657505875433258762214789987076405440 No 45 >pfam05192 MutS_III MutS domain III. This domain is found in proteins of the MutS family (DNA mismatch repair proteins) and is found associated with pfam00488, pfam05188, pfam01624 and pfam05190. The MutS family of proteins is named after the Salmonella typhimurium MutS protein involved in mismatch repair; other members of the family included the eukaryotic MSH 1,2,3, 4,5 and 6 proteins. These have various roles in DNA repair and recombination. Human MSH has been implicated in non-polyposis colorectal carcinoma (HNPCC) and is a mismatch binding protein. The aligned region corresponds with domain III, which is central to the structure of Thermus aquaticus MutS as characterized in. Probab=48.95 E-value=21 Score=16.42 Aligned_cols=114 Identities=8% Similarity=0.092 Sum_probs=53.9 Q ss_pred CCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHH-------HHH---HHHHHHHHHHHHHHHCCCCC Q ss_conf 78689899999999999998538531579999999998799999999-------999---99999899999997203785 Q gi|254780874|r 3 FLSHKQICDLKNRFAEIELRMSESPSVDAYIKLTEEYAAISPIISKI-------SIY---DQKKQEEQDLHTVIGDHNSD 72 (357) Q Consensus 3 mi~~~kL~~l~~rl~ELE~~Lsdp~~w~d~~klsKe~s~L~~iv~~~-------~~~---~~~~~ei~el~eLl~e~e~D 72 (357) +-|--.++.|..|++-++..+..++......+.-+...++..+...+ ..| .+....+..+..++..... T Consensus 50 ~~Pl~d~~~I~~R~d~Ve~l~~~~~~~~~l~~~Lk~i~Dlerl~~ri~~~~~~~~~~~~l~~~l~~i~~i~~~l~~~~~- 128 (306) T pfam05192 50 LQPLRDREEINARLDAVEELLEDPELRQTLRELLKRVPDLERLLSRLALGRANPRDLLRLRDSLERLPTIKELLEELDS- 128 (306) T ss_pred HCCCCCHHHHHHHHHHHHHHHHCHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHCCC- T ss_conf 5816699999999999999985989999999998648978999999982897889999999999999999999985788- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCC Q ss_conf 35665433222332000577888999864000113554037886226885 Q gi|254780874|r 73 SEIRDLAQIEVLAIEKEIRELENEINYLLLPKDTDDDKSCILEIRAGTGG 122 (357) Q Consensus 73 ~El~elAeeEl~~l~~~l~~le~eL~~~LLpsde~D~~nailEI~aGaGG 122 (357) ....... ......+..+.+.+.....+..+....++. -|++|.-. T Consensus 129 ~~~~~~~----~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~-~i~~g~~~ 173 (306) T pfam05192 129 PLLLKEL----RKSIDDLSELLDLLETAINEDPPLLIREGG-VIKSGYDA 173 (306) T ss_pred HHHHHHH----HHCCCCHHHHHHHHHHHHHCCHHHHCCCCC-EECCCCCH T ss_conf 4678888----744234899999999998246176556788-54899887 No 46 >KOG3274 consensus Probab=48.60 E-value=12 Score=17.99 Aligned_cols=112 Identities=19% Similarity=0.269 Sum_probs=69.3 Q ss_pred HHHHHHHHHHHCCCCCCHHHHCCCCCCCC-CEEEEEEECCCHHHHHHCCCCCEEEEEECCCCCCCCCCEEEEEEEEEECC Q ss_conf 99999999876498400132113544772-11137775031112100024650267612545678511016888853033 Q gi|254780874|r 132 LFRMYERYAALRKWKVEVLSSSDNDDGGY-KEIVATISGRGVFSRMKFESGVHRVQRVPATEASGRVHTSAATVAVLPEA 210 (357) Q Consensus 132 L~rMY~r~ae~~gwk~eii~~~~ge~aGi-k~v~~~I~G~~aYg~Lk~E~GvHRvqRip~~es~gR~hTS~a~V~V~P~~ 210 (357) =+|=|..=+.-+.=.+..+..- |..-+ =+|++.++-+.++++|-+|-|||- +||-=++..|..-|+|---.|.|+- T Consensus 75 Gl~eYaltsAl~DsRF~PIsr~--ELp~L~CsvslL~nFE~i~d~lDWevG~HG-IrieF~~e~g~krsATyLPeVa~EQ 151 (210) T KOG3274 75 GLREYALTSALKDSRFPPISRE--ELPSLQCSVSLLTNFEDIFDYLDWEVGVHG-IRIEFTNETGTKRSATYLPEVAAEQ 151 (210) T ss_pred HHHHHHHHHHHHCCCCCCCCHH--HCCCEEEEEEEECCCHHCCCCCCEEECCCE-EEEEEECCCCCEEEEEECCCCHHHC T ss_conf 3788888887620468987732--417438777753364120353220100332-8999973789584212356321121 Q ss_pred C--CCCCCCCCCEEEEEEECCCCCCCCCCCCCCCEEEEEECCCCE Q ss_conf 4--332224543479998528887520006321215889649844 Q gi|254780874|r 211 A--EIDVDIPLEDIRIDTMRASGSGGQHVNTTDSAVRITHIPTGI 253 (357) Q Consensus 211 ~--~~~~~i~~~dl~i~~~rssG~GGQ~vN~t~savri~H~ptgi 253 (357) . .++ -| ++ -.|-+|=+|-=-+-..+++++|---+-- T Consensus 152 gWd~~e-Ti-ds-----LirKaGY~g~It~~~r~~I~ltRY~S~k 189 (210) T KOG3274 152 GWDQIE-TI-DS-----LIRKAGYKGPITEELRKSIKLTRYRSEK 189 (210) T ss_pred CCCHHH-HH-HH-----HHHHCCCCCCCCHHHHHHEEEEEEECEE T ss_conf 874888-89-99-----9984477775678987120356741223 No 47 >pfam00380 Ribosomal_S9 Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes. Probab=47.39 E-value=13 Score=17.88 Aligned_cols=26 Identities=27% Similarity=0.377 Sum_probs=16.9 Q ss_pred CCCCEEEEEEEEEECCCCCCCCCCCCEE Q ss_conf 8511016888853033433222454347 Q gi|254780874|r 195 GRVHTSAATVAVLPEAAEIDVDIPLEDI 222 (357) Q Consensus 195 gR~hTS~a~V~V~P~~~~~~~~i~~~dl 222 (357) |||.||.|.|.+.|=.- .|.||..++ T Consensus 1 GrRKtaiArv~l~~G~G--~i~INg~~~ 26 (121) T pfam00380 1 GRRKTAVARVWLKPGSG--KITINGKPL 26 (121) T ss_pred CCCCCEEEEEEEEECCC--EEEECCCCH T ss_conf 99845899999980776--799989328 No 48 >PRK13182 racA polar chromosome segregation protein; Reviewed Probab=46.03 E-value=23 Score=16.11 Aligned_cols=11 Identities=9% Similarity=0.205 Sum_probs=4.0 Q ss_pred HHHHHHHHHHC Q ss_conf 89999999720 Q gi|254780874|r 58 EEQDLHTVIGD 68 (357) Q Consensus 58 ei~el~eLl~e 68 (357) .+.+++.-+.. T Consensus 103 Rl~~LE~qL~q 113 (178) T PRK13182 103 RLDELEEQLQQ 113 (178) T ss_pred HHHHHHHHHHH T ss_conf 99999999986 No 49 >PRK10329 glutaredoxin-like protein; Provisional Probab=45.91 E-value=13 Score=17.80 Aligned_cols=42 Identities=17% Similarity=0.345 Sum_probs=34.2 Q ss_pred HHHHHHHHHHCCCCCCHHHHCCCCCC-------CCCEEEEEEECCCHHH Q ss_conf 99999998764984001321135447-------7211137775031112 Q gi|254780874|r 133 FRMYERYAALRKWKVEVLSSSDNDDG-------GYKEIVATISGRGVFS 174 (357) Q Consensus 133 ~rMY~r~ae~~gwk~eii~~~~ge~a-------Gik~v~~~I~G~~aYg 174 (357) ..|=.||-+++|..|+.++.+...++ |++++-+-+.|+..+. T Consensus 14 C~aTKr~L~~~gI~y~~vdi~~dpea~~~vk~~G~~q~PVV~~~~~~ws 62 (81) T PRK10329 14 CHATKRAMESRGFDFEMINVDRVPEAADTLRAQGFRQLPVVIAGDLSWS 62 (81) T ss_pred HHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHCCCCCCCEEEECCCEEC T ss_conf 7989999998799429985899999999999769855998996995443 No 50 >COG1196 Smc Chromosome segregation ATPases [Cell division and chromosome partitioning] Probab=45.83 E-value=23 Score=16.09 Aligned_cols=24 Identities=13% Similarity=0.138 Sum_probs=11.1 Q ss_pred CCHHHHHHCCCCCEEEEEECCCCCC Q ss_conf 3111210002465026761254567 Q gi|254780874|r 170 RGVFSRMKFESGVHRVQRVPATEAS 194 (357) Q Consensus 170 ~~aYg~Lk~E~GvHRvqRip~~es~ 194 (357) ..+..+||. .+.+|+-=+|...-. T Consensus 555 ~~~i~~lk~-~~~gr~tflpl~~i~ 578 (1163) T COG1196 555 KKAIEFLKE-NKAGRATFLPLDRIK 578 (1163) T ss_pred HHHHHHHHH-CCCCCCEECCCHHCC T ss_conf 999999985-489974311412103 No 51 >TIGR03627 arch_S9P archaeal ribosomal protein S9P. This model describes exclusively the archaeal ribosomal protein S9P. Homologous eukaryotic and bacterial ribosomal proteins are excluded from this model. Probab=45.75 E-value=12 Score=18.03 Aligned_cols=32 Identities=19% Similarity=0.123 Sum_probs=22.3 Q ss_pred CCCCCCCEEEEEEEEEECCCCCCCCCCCCEEEEEEE Q ss_conf 567851101688885303343322245434799985 Q gi|254780874|r 192 EASGRVHTSAATVAVLPEAAEIDVDIPLEDIRIDTM 227 (357) Q Consensus 192 es~gR~hTS~a~V~V~P~~~~~~~~i~~~dl~i~~~ 227 (357) -+.|||.||.|.|.+.|=. -.|.||..++ +.| T Consensus 3 ~~tGrRKtaiA~v~l~~G~--G~i~VNg~~~--~~~ 34 (130) T TIGR03627 3 ITSGKRKTAIARATIREGK--GRVRINGVPV--ELY 34 (130) T ss_pred EEECCCCCEEEEEEEECCC--EEEEECCEEH--HHH T ss_conf 5567680779999999171--7999999879--994 No 52 >PRK08745 ribulose-phosphate 3-epimerase; Provisional Probab=45.10 E-value=23 Score=16.02 Aligned_cols=59 Identities=19% Similarity=0.282 Sum_probs=38.7 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHCCCCCCCCCE----------EEEEEECCCHHHH Q ss_conf 378862268852178899999999999987649840013211354477211----------1377750311121 Q gi|254780874|r 112 CILEIRAGTGGSEAALFVGDLFRMYERYAALRKWKVEVLSSSDNDDGGYKE----------IVATISGRGVFSR 175 (357) Q Consensus 112 ailEI~aGaGG~EA~dwA~~L~rMY~r~ae~~gwk~eii~~~~ge~aGik~----------v~~~I~G~~aYg~ 175 (357) -+|.+.||-||..=..-+-.-.+.-.+|.+.+|.++++-- .||++. |...|.|.+.|+. T Consensus 137 liMtV~PGf~GQ~f~~~~l~KI~~l~~~~~~~~~~~~I~V-----DGGI~~~ti~~l~~aGad~~V~GSaiF~~ 205 (223) T PRK08745 137 LVMSVNPGFGGQAFIPSALDKLRAIRKKIDALGKPIRLEI-----DGGVKADNIGAIAAAGADTFVAGSAIFNA 205 (223) T ss_pred EEEEECCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEE-----ECCCCHHHHHHHHHCCCCEEEECHHHHCC T ss_conf 9987569988754568899999999999986499945999-----78879899999998699999974177579 No 53 >pfam07050 consensus Probab=44.99 E-value=23 Score=16.00 Aligned_cols=63 Identities=11% Similarity=0.187 Sum_probs=37.8 Q ss_pred CCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 786898999999999999985385315799999999987999999999999999989999999 Q gi|254780874|r 3 FLSHKQICDLKNRFAEIELRMSESPSVDAYIKLTEEYAAISPIISKISIYDQKKQEEQDLHTV 65 (357) Q Consensus 3 mi~~~kL~~l~~rl~ELE~~Lsdp~~w~d~~klsKe~s~L~~iv~~~~~~~~~~~ei~el~eL 65 (357) |+.-+.+-+|.....+|-..+...+-...|.++.+.+..=..+-..+..|..+..+.++.+-. T Consensus 1 m~~t~e~~~Il~~a~eLa~mI~~se~~~~Y~~ak~~~~~d~~~q~li~~f~~~ke~ye~~q~f 63 (145) T pfam07050 1 MIITEELLDILDQAEQLADMILQSEEVENYRKAKQALENDREAQQLIQEFLKLKEQYEEVQRF 63 (145) T ss_pred CCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 935189999999999999999818999999999999984899999999999999999999987 No 54 >PRK05431 seryl-tRNA synthetase; Provisional Probab=44.64 E-value=24 Score=15.97 Aligned_cols=103 Identities=18% Similarity=0.277 Sum_probs=43.9 Q ss_pred CCCHHHHHHHHHHHHHHHHHHCCCCC---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHCCCCCHHHH-- Q ss_conf 78689899999999999998538531---57999999999879999999999999999899999-9972037853566-- Q gi|254780874|r 3 FLSHKQICDLKNRFAEIELRMSESPS---VDAYIKLTEEYAAISPIISKISIYDQKKQEEQDLH-TVIGDHNSDSEIR-- 76 (357) Q Consensus 3 mi~~~kL~~l~~rl~ELE~~Lsdp~~---w~d~~klsKe~s~L~~iv~~~~~~~~~~~ei~el~-eLl~e~e~D~El~-- 76 (357) |+..+. |+...+.+...+..-++ .+...++-++.+++ ...++.+....+.+..-. .+....++..++. T Consensus 1 MLDik~---IReNpe~v~~~lkkR~~~~~id~il~lD~~rr~l---~~e~e~Lr~erN~iSK~Ig~~kk~g~~~~~l~~~ 74 (422) T PRK05431 1 MLDIKL---IRENPDAVKEKLAKRGDPLDVDELLELDEERREL---QTETEELQAERNALSKEIGQAKAKGEDAEELIAE 74 (422) T ss_pred CCCHHH---HHHCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHH T ss_conf 988789---8749999999998659845199999999999999---9999999999999999999987489779999999 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHHH--HHCCCCCCCCCC Q ss_conf -54332223320005778889998--640001135540 Q gi|254780874|r 77 -DLAQIEVLAIEKEIRELENEINY--LLLPKDTDDDKS 111 (357) Q Consensus 77 -elAeeEl~~l~~~l~~le~eL~~--~LLpsde~D~~n 111 (357) ....+++..+++++.++++++.. +.+|+-++++-+ T Consensus 75 ~~~lk~~i~~le~~~~~~~~~l~~~ll~IPNi~~~~VP 112 (422) T PRK05431 75 VKELKEEIKALEAELKELEAELEELLLSIPNLPHDSVP 112 (422) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCC T ss_conf 99999999999999999999999999848998886578 No 55 >pfam00834 Ribul_P_3_epim Ribulose-phosphate 3 epimerase family. This enzyme catalyses the conversion of D-ribulose 5-phosphate into D-xylulose 5-phosphate. Probab=43.96 E-value=24 Score=15.89 Aligned_cols=58 Identities=19% Similarity=0.233 Sum_probs=41.1 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHCCCCCCCCCE----------EEEEEECCCHHH Q ss_conf 378862268852178899999999999987649840013211354477211----------137775031112 Q gi|254780874|r 112 CILEIRAGTGGSEAALFVGDLFRMYERYAALRKWKVEVLSSSDNDDGGYKE----------IVATISGRGVFS 174 (357) Q Consensus 112 ailEI~aGaGG~EA~dwA~~L~rMY~r~ae~~gwk~eii~~~~ge~aGik~----------v~~~I~G~~aYg 174 (357) -+|.+.||-||.-=..-+-.-.+-..+|...+|++++|. =.||++. |...|.|.+.|+ T Consensus 132 LvMtV~PGf~GQ~f~~~~l~KI~~lr~~~~~~~~~~~I~-----vDGGIn~~ti~~l~~~Gad~~V~GSaiF~ 199 (201) T pfam00834 132 LLMSVNPGFGGQSFIPSVLPKIRKVRKMIDEGGLDTLIE-----VDGGVNLDNIPQIAEAGADVLVAGSAVFG 199 (201) T ss_pred EEEEECCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEE-----EECCCCHHHHHHHHHCCCCEEEECCEECC T ss_conf 998866898876456779999999999998269980799-----98988899999999879999997800245 No 56 >pfam04983 RNA_pol_Rpb1_3 RNA polymerase Rpb1, domain 3. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). This domain, domain 3, represents the pore domain. The 3' end of RNA is positioned close to this domain. The pore delimited by this domain is thought to act as a channel through which nucleotides enter the active site and/or where the 3' end of the RNA may be extruded during back-tracking. Probab=42.98 E-value=23 Score=16.07 Aligned_cols=32 Identities=19% Similarity=0.285 Sum_probs=25.9 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHH Q ss_conf 88521788999999999999876498400132 Q gi|254780874|r 120 TGGSEAALFVGDLFRMYERYAALRKWKVEVLS 151 (357) Q Consensus 120 aGG~EA~dwA~~L~rMY~r~ae~~gwk~eii~ 151 (357) -|..+|..|...+-+|=.+|+...||++-+-| T Consensus 126 ~G~~~~~~~ld~i~~l~~~~l~~~GfSvgi~D 157 (158) T pfam04983 126 YGPEETVKFLDRLQKLGFRYLTKSGFSIGIDD 157 (158) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHCCEEEEECC T ss_conf 29899999999999999999997786795205 No 57 >pfam08014 DUF1704 Domain of unknown function (DUF1704). This family contains many hypothetical proteins. Probab=41.82 E-value=26 Score=15.67 Aligned_cols=121 Identities=20% Similarity=0.170 Sum_probs=71.9 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHCCCCCCHH--HH-CCCCCCCCCEEEEEEECC------CHHHHHHCCCCCEEEEEECCC Q ss_conf 852178899999999999987649840013--21-135447721113777503------111210002465026761254 Q gi|254780874|r 121 GGSEAALFVGDLFRMYERYAALRKWKVEVL--SS-SDNDDGGYKEIVATISGR------GVFSRMKFESGVHRVQRVPAT 191 (357) Q Consensus 121 GG~EA~dwA~~L~rMY~r~ae~~gwk~eii--~~-~~ge~aGik~v~~~I~G~------~aYg~Lk~E~GvHRvqRip~~ 191 (357) --..|...+..+-.+-..|.+..++++.|. +- +.+-.+|=+ ++.|+-. .+=.++.+|.|||= .| T Consensus 106 ~~~~a~~~~~~~~~~~~~yf~~~~~~~~V~vsd~i~A~Amv~~~--~l~I~~~a~Fs~~dl~~L~~HE~gVHv-----~T 178 (348) T pfam08014 106 KRLSADEAVEYLQERADAYFEGDGFKCKVKLSDDIVADAMVSGD--TLKINKDAMFSERELDALIHHEGGVHL-----LT 178 (348) T ss_pred CCCCHHHHHHHHHHHHHHHHCCCCCEEEEEECCCHHHHHHCCCC--EEEECCCCEECHHHHHHHHHHHHHHHE-----EE T ss_conf 67699999999999999983236871799978861178752798--799779984769999999876455300-----01 Q ss_pred CCCCCCCEEEEEEEEEECCCCCCCCCCCCEEEEEEECCCCCCCCCCCCCCCEEEEEECCCCEEE-EE-CCCCCHHHHHHH Q ss_conf 5678511016888853033433222454347999852888752000632121588964984488-84-365413589999 Q gi|254780874|r 192 EASGRVHTSAATVAVLPEAAEIDVDIPLEDIRIDTMRASGSGGQHVNTTDSAVRITHIPTGIMV-TS-SEKSQHQNRLRA 269 (357) Q Consensus 192 es~gR~hTS~a~V~V~P~~~~~~~~i~~~dl~i~~~rssG~GGQ~vN~t~savri~H~ptgi~v-~~-~eRsQ~~Nk~~A 269 (357) --|||.|. +.+ ++.+-|| .|+.--|+.| |- ---|=--+| T Consensus 179 tlNG~~Qp--l~~----------------------l~~G~p~------------~t~TQEGLAVlsE~lsg~~t~~R--- 219 (348) T pfam08014 179 TLNGRAQP--LKV----------------------LSLGLPG------------NTETQEGLAVLSEYLTGSLTPER--- 219 (348) T ss_pred CCCCCCCC--CHH----------------------HCCCCCC------------CCCCCHHHHHHHHHHHCCCCHHH--- T ss_conf 01443487--166----------------------3667999------------87431099999999846887899--- Q ss_pred HHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999 Q gi|254780874|r 270 MKVLRARLYDVERKRMAN 287 (357) Q Consensus 270 ~~~L~~~l~~~~~~~~~~ 287 (357) |+.|-.|+.++.....-+ T Consensus 220 lr~La~RV~Av~~~~~Ga 237 (348) T pfam08014 220 LRTLALRVIAVDMMLKGA 237 (348) T ss_pred HHHHHHHHHHHHHHHHCC T ss_conf 999999999999998279 No 58 >COG0103 RpsI Ribosomal protein S9 [Translation, ribosomal structure and biogenesis] Probab=41.15 E-value=16 Score=17.27 Aligned_cols=19 Identities=21% Similarity=0.447 Sum_probs=16.0 Q ss_pred CCCCCCCCEEEEEEEEEEC Q ss_conf 4567851101688885303 Q gi|254780874|r 191 TEASGRVHTSAATVAVLPE 209 (357) Q Consensus 191 ~es~gR~hTS~a~V~V~P~ 209 (357) +-+.|||.||.|.|.+.|- T Consensus 6 ~~~tGkRK~AvArv~l~~g 24 (130) T COG0103 6 VYTTGKRKSAVARVRLVPG 24 (130) T ss_pred EEEECCCCCEEEEEEEECC T ss_conf 4640554326899999768 No 59 >PRK13902 alaS lanyl-tRNA synthetase; Provisional Probab=40.62 E-value=27 Score=15.54 Aligned_cols=23 Identities=30% Similarity=0.616 Sum_probs=13.9 Q ss_pred CEEEEEEE---CCCCCCCCCCCCCCC Q ss_conf 34799985---288875200063212 Q gi|254780874|r 220 EDIRIDTM---RASGSGGQHVNTTDS 242 (357) Q Consensus 220 ~dl~i~~~---rssG~GGQ~vN~t~s 242 (357) +.+||-.+ -..-.||-||+.|-- T Consensus 682 d~VRVV~ig~~s~ElCGGTHV~nTge 707 (898) T PRK13902 682 KEIRVVEIEGWDVEACGGTHVKNTGE 707 (898) T ss_pred CCEEEEEECCCEEECCCCCCCCCCCC T ss_conf 73589865780365237987787031 No 60 >KOG3358 consensus Probab=40.38 E-value=17 Score=17.08 Aligned_cols=18 Identities=33% Similarity=0.637 Sum_probs=13.3 Q ss_pred CCCCEEEEEECCCCEEEE Q ss_conf 321215889649844888 Q gi|254780874|r 239 TTDSAVRITHIPTGIMVT 256 (357) Q Consensus 239 ~t~savri~H~ptgi~v~ 256 (357) +-+-+||+.|+-||.-.+ T Consensus 145 ~r~~~vrl~Hi~T~~yLs 162 (211) T KOG3358 145 KRDARVRLQHIDTSVYLS 162 (211) T ss_pred CCCCEEEEEEECCCEEEE T ss_conf 002027878703313687 No 61 >PRK08178 acetolactate synthase 1 regulatory subunit; Reviewed Probab=40.05 E-value=21 Score=16.39 Aligned_cols=50 Identities=16% Similarity=0.135 Sum_probs=39.4 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHCCCCCCCCCEEEEEEECCCH Q ss_conf 03788622688521788999999999999876498400132113544772111377750311 Q gi|254780874|r 111 SCILEIRAGTGGSEAALFVGDLFRMYERYAALRKWKVEVLSSSDNDDGGYKEIVATISGRGV 172 (357) Q Consensus 111 nailEI~aGaGG~EA~dwA~~L~rMY~r~ae~~gwk~eii~~~~ge~aGik~v~~~I~G~~a 172 (357) ..+++=+||+-.-=|.+|+ ++||.++-+...+++..|+...|+.+.|+.. T Consensus 12 SvlV~N~pGVL~RV~gLFs------------rRgyNIeSL~v~~te~~~~SRiTivv~~d~~ 61 (96) T PRK08178 12 ELTVRNHPGVMTHVCGLFA------------RRAFNVEGILCLPIQDSDKSRIWLLVNDDQR 61 (96) T ss_pred EEEEECCCCHHHHHHHHHH------------CCCCCCCEEEEEECCCCCCEEEEEEECCCCC T ss_conf 9999577878999988875------------0566812078851389981089999889844 No 62 >pfam08317 Spc7 Spc7 kinetochore protein. This domain is found in cell division proteins which are required for kinetochore-spindle association. Probab=39.67 E-value=28 Score=15.44 Aligned_cols=37 Identities=30% Similarity=0.324 Sum_probs=25.4 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHH Q ss_conf 03788622688521788999999999999876498400132 Q gi|254780874|r 111 SCILEIRAGTGGSEAALFVGDLFRMYERYAALRKWKVEVLS 151 (357) Q Consensus 111 nailEI~aGaGG~EA~dwA~~L~rMY~r~ae~~gwk~eii~ 151 (357) .-+++.+-|-.-.| +..|.+=|..+-...||++.-++ T Consensus 263 e~~~e~~r~~t~~E----i~~Lk~~~~~Lq~l~gw~~~~~~ 299 (322) T pfam08317 263 EKILEECRGWSAKE----ISKLKAKVSLLQKLTGWKILSLS 299 (322) T ss_pred HHHHHHCCCCCHHH----HHHHHHHHHHHHHHHCCEEEEEC T ss_conf 99998516999999----99999999999998688067422 No 63 >COG4402 Uncharacterized protein conserved in bacteria [Function unknown] Probab=39.57 E-value=13 Score=17.84 Aligned_cols=13 Identities=15% Similarity=0.128 Sum_probs=8.7 Q ss_pred EEECCCCEEEEEC Q ss_conf 8964984488843 Q gi|254780874|r 246 ITHIPTGIMVTSS 258 (357) Q Consensus 246 i~H~ptgi~v~~~ 258 (357) |.|--||+.+|+- T Consensus 393 L~~p~~gea~c~A 405 (457) T COG4402 393 LLVPVSGEAVCAA 405 (457) T ss_pred EEEEECEEEEECC T ss_conf 9986110677235 No 64 >KOG0804 consensus Probab=38.85 E-value=29 Score=15.36 Aligned_cols=22 Identities=14% Similarity=0.345 Sum_probs=9.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHC Q ss_conf 2223320005778889998640 Q gi|254780874|r 81 IEVLAIEKEIRELENEINYLLL 102 (357) Q Consensus 81 eEl~~l~~~l~~le~eL~~~LL 102 (357) +.+.....++..|++.|.-++. T Consensus 428 ~~~~s~d~~I~dLqEQlrDlmf 449 (493) T KOG0804 428 EALGSKDEKITDLQEQLRDLMF 449 (493) T ss_pred HHHHHHHHHHHHHHHHHHHHHE T ss_conf 9987788888889999976860 No 65 >cd04878 ACT_AHAS N-terminal ACT domain of the Escherichia coli IlvH-like regulatory subunit of acetohydroxyacid synthase (AHAS). ACT_AHAS: N-terminal ACT domain of the Escherichia coli IlvH-like regulatory subunit of acetohydroxyacid synthase (AHAS). AHAS catalyses the first common step in the biosynthesis of the three branched-chain amino acids. The first step involves the condensation of either pyruvate or 2-ketobutyrate with the two-carbon hydroxyethyl fragment derived from another pyruvate molecule, covalently bound to the coenzyme thiamine diphosphate. Bacterial AHASs generally consist of regulatory and catalytic subunits. The effector (valine) binding sites are proposed to be located in two symmetrically related positions in the interface between a pair of N-terminal ACT domains with the C-terminal domain of IlvH contacting the catalytic dimer. Plants Arabidopsis and Oryza have tandem IlvH subunits; both the first and second ACT domain sequences are present in this CD. Members of Probab=38.65 E-value=21 Score=16.36 Aligned_cols=40 Identities=25% Similarity=0.349 Sum_probs=32.5 Q ss_pred HHHHHHHHHHHHCCCCCCHHHHCCCCCCCCCEEEEEEECCC Q ss_conf 99999999987649840013211354477211137775031 Q gi|254780874|r 131 DLFRMYERYAALRKWKVEVLSSSDNDDGGYKEIVATISGRG 171 (357) Q Consensus 131 ~L~rMY~r~ae~~gwk~eii~~~~ge~aGik~v~~~I~G~~ 171 (357) .|.|+=.- ..++||.++-+...+.+..|+...++-+.|+. T Consensus 13 vL~Rit~l-F~rRg~NI~Sl~v~~te~~~~sR~tivv~~~~ 52 (72) T cd04878 13 VLNRISGL-FARRGFNIESLTVGPTEDPGISRITIVVEGDD 52 (72) T ss_pred HHHHHHHH-HHHCCCCEEEEEEEECCCCCEEEEEEEEECCH T ss_conf 99999999-86078556789986137998489999997888 No 66 >KOG3409 consensus Probab=38.56 E-value=28 Score=15.43 Aligned_cols=70 Identities=24% Similarity=0.213 Sum_probs=47.3 Q ss_pred ECCCHHHHHHCCCCCEEEEEECCCCCC-CCCCEEEEEEEE-EECC---CC-C--CCCCCCCEEEEEEECCCCCCCCCCCC Q ss_conf 503111210002465026761254567-851101688885-3033---43-3--22245434799985288875200063 Q gi|254780874|r 168 SGRGVFSRMKFESGVHRVQRVPATEAS-GRVHTSAATVAV-LPEA---AE-I--DVDIPLEDIRIDTMRASGSGGQHVNT 239 (357) Q Consensus 168 ~G~~aYg~Lk~E~GvHRvqRip~~es~-gR~hTS~a~V~V-~P~~---~~-~--~~~i~~~dl~i~~~rssG~GGQ~vN~ 239 (357) -|+-.|+ +.=+|+-|.|-|-+||.. -.+.-||--=++ +..+ ++ . =+.+..+||-+-+.|||+.|+|-|-. T Consensus 91 V~d~~lk--~~FrglirkqdvR~tEkdrv~v~ksFrPgDiVlAkVis~~~~~~y~LTtAeneLGVV~a~as~~g~~M~pv 168 (193) T KOG3409 91 VGDKPLK--KSFRGLIRKQDVRATEKDRVKVYKSFRPGDIVLAKVISLGDGSNYLLTTAENELGVVFARASETGEPMVPV 168 (193) T ss_pred ECCEEHH--HHHCCEEEHHHCCCCCCCHHHHHHCCCCCCEEEEEEEECCCCCCEEEEEECCCCEEEEEECCCCCCCEEEC T ss_conf 8679700--11211433543564440112163436988688888851477872799875164238999514668821451 No 67 >pfam00509 Hemagglutinin Hemagglutinin. Hemagglutinin from influenza virus causes membrane fusion of the viral membrane with the host membrane. Fusion occurs after the host cell internalises the virus by endocytosis. The drop of pH causes release of a hydrophobic fusion peptide and a large conformational change leading to membrane fusion. Probab=38.47 E-value=29 Score=15.32 Aligned_cols=17 Identities=12% Similarity=0.272 Sum_probs=5.9 Q ss_pred HHHHHHHHHHHHHHHHH Q ss_conf 89899999999999998 Q gi|254780874|r 6 HKQICDLKNRFAEIELR 22 (357) Q Consensus 6 ~~kL~~l~~rl~ELE~~ 22 (357) |+-|+.|..++..|-.+ T Consensus 370 Q~AId~It~KlN~iiek 386 (550) T pfam00509 370 QKAIDQITGKLNSLIEK 386 (550) T ss_pred HHHHHHHHHHHHHHHHH T ss_conf 99999887788889998 No 68 >KOG1760 consensus Probab=37.76 E-value=30 Score=15.24 Aligned_cols=70 Identities=19% Similarity=0.236 Sum_probs=37.6 Q ss_pred CHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC-----------------HHHHHHHHHHHHHHH Q ss_conf 15799999---9999879999999999999999899999997203785-----------------356654332223320 Q gi|254780874|r 28 SVDAYIKL---TEEYAAISPIISKISIYDQKKQEEQDLHTVIGDHNSD-----------------SEIRDLAQIEVLAIE 87 (357) Q Consensus 28 ~w~d~~kl---sKe~s~L~~iv~~~~~~~~~~~ei~el~eLl~e~e~D-----------------~El~elAeeEl~~l~ 87 (357) .|.|.+++ ++-+++...+...++.-.....+++|+..-+....+| +.+.++.++--..+. T Consensus 15 t~EDQq~iN~Fsrl~~R~~~lk~dik~~k~~~enledA~~EieL~Dedd~~Ip~~vGdvF~~~~~~~~~~~LEe~ke~l~ 94 (131) T KOG1760 15 TFEDQQNINEFSRLNSRKDDLKADIKEAKTEIENLEDASNEIELLDEDDEDIPFKVGDVFIHVKLDKLQDQLEEKKETLE 94 (131) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEHHHHHEECCHHHHHHHHHHHHHHHH T ss_conf 48879989999998735889999999999999879987766732276656466031010025157889999999999999 Q ss_pred HHHHHHHHHH Q ss_conf 0057788899 Q gi|254780874|r 88 KEIRELENEI 97 (357) Q Consensus 88 ~~l~~le~eL 97 (357) +.++.++.++ T Consensus 95 k~i~~les~~ 104 (131) T KOG1760 95 KEIEELESEL 104 (131) T ss_pred HHHHHHHHHH T ss_conf 9999999999 No 69 >KOG1290 consensus Probab=36.37 E-value=13 Score=17.84 Aligned_cols=10 Identities=30% Similarity=0.840 Sum_probs=3.6 Q ss_pred HHHHHHHHHH Q ss_conf 6999999999 Q gi|254780874|r 336 IDDIINPLLT 345 (357) Q Consensus 336 l~~~i~~~~~ 345 (357) +.+|+-++|. T Consensus 526 fsdFL~PmLe 535 (590) T KOG1290 526 FSDFLSPMLE 535 (590) T ss_pred HHHHHHHHHH T ss_conf 9998878875 No 70 >TIGR00651 pta phosphate acetyltransferase; InterPro: IPR004614 Phosphate acetyltransferase (2.3.1.8 from EC) catalyses the conversion of acetyl-CoA and phosphate to CoA and acetyl phosphate in the last two steps in the conversion of actetae to acetyl-CoA.; GO: 0016407 acetyltransferase activity. Probab=36.14 E-value=2.9 Score=22.60 Aligned_cols=119 Identities=19% Similarity=0.301 Sum_probs=69.3 Q ss_pred CEEEEEEEEEECCCC-CCCC---------CCCCEEEE-----EEECCCCCCCCCCCCCCCEEEEEECCCCEEEEECCCCC Q ss_conf 101688885303343-3222---------45434799-----98528887520006321215889649844888436541 Q gi|254780874|r 198 HTSAATVAVLPEAAE-IDVD---------IPLEDIRI-----DTMRASGSGGQHVNTTDSAVRITHIPTGIMVTSSEKSQ 262 (357) Q Consensus 198 hTS~a~V~V~P~~~~-~~~~---------i~~~dl~i-----~~~rssG~GGQ~vN~t~savri~H~ptgi~v~~~eRsQ 262 (357) -=+=|+|.|.|.+++ -+|- +...++|+ -|.=|||--|..|-|+-.||||.|- T Consensus 161 ~F~DCAv~~~P~a~~LAeIA~~sA~sak~f~~~epkVAlLSYST~gSgGd~g~~veKV~~A~~I~~E------------- 227 (322) T TIGR00651 161 VFADCAVNVDPNAEQLAEIAIQSAKSAKSFGEIEPKVALLSYSTKGSGGDSGEEVEKVREATRIAKE------------- 227 (322) T ss_pred EEEEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHC------------- T ss_conf 8851245029887899999999999999846899608998302455655541333899999999853------------- Q ss_pred HHHHHHHH-HHHHHHH-HHHHHHHHHHHHHHHHHHHCCCCCCCCCEEECCCCCCCEEEEECCCEECCHHHHHCC--CHHH Q ss_conf 35899999-9999999-999999999999874465224674238221044899811243216441787787288--6699 Q gi|254780874|r 263 HQNRLRAM-KVLRARL-YDVERKRMANERSANRKLQIGSGDRSERIRTYNFSQGRITDHRINLTLYKLEYVLQG--YIDD 338 (357) Q Consensus 263 ~~Nk~~A~-~~L~~~l-~~~~~~~~~~~~~~~r~~~~g~~~rs~~iRtY~~~~~rv~DhR~~~~~~~~~~~l~G--~l~~ 338 (357) |.-=| +.+-.-| |+...-+...+. ..+.+.|. | +--.|+|| |.-+|-..|.+-.=+.+ -+.+ T Consensus 228 ---k~pdL~~~~dGELQ~DAA~v~~Va~~-KaP~S~v~-G----~AnvfvFP-----dL~aGNigYK~vQR~~~~~AiGP 293 (322) T TIGR00651 228 ---KRPDLKATIDGELQFDAAVVEKVAEK-KAPNSPVA-G----KANVFVFP-----DLDAGNIGYKIVQRLADAEAIGP 293 (322) T ss_pred ---CCHHHHHHCCCCHHHHHHHHHHHHHH-HCCCCCCC-C----CEEEEECC-----CCCCCHHHHHHHHHHCCCEEECC T ss_conf ---87137640587301114323888964-07898655-6----35588716-----87740578888876237723440 Q ss_pred HHHHH Q ss_conf 99999 Q gi|254780874|r 339 IINPL 343 (357) Q Consensus 339 ~i~~~ 343 (357) |+..+ T Consensus 294 iLQGl 298 (322) T TIGR00651 294 ILQGL 298 (322) T ss_pred HHHHH T ss_conf 67521 No 71 >TIGR00106 TIGR00106 conserved hypothetical protein TIGR00106; InterPro: IPR002767 This entry contains several hypothetical proteins of unknown function found in archaebacteria, eukaryotes and eubacteria. The structures of YBL001c from Saccharomyces cerevisiae and its homologue MTH1187 from the archaea Methanobacterium thermoautotrophicum have been determined . These proteins have a ferredoxin-like alpha/beta sandwich structure with anti-parallel beta-sheets. Generally, they have two domains that form a single beta-sheet dimer, where two dimers pack sheet-to-sheet into a tetramer, some proteins having an extra C-terminal helix. . Probab=36.10 E-value=25 Score=15.75 Aligned_cols=32 Identities=16% Similarity=0.403 Sum_probs=23.5 Q ss_pred EECCCEECCHHHHHCC-CHHHHHHHHHHHHHHH Q ss_conf 3216441787787288-6699999999999999 Q gi|254780874|r 319 HRINLTLYKLEYVLQG-YIDDIINPLLTAHQAK 350 (357) Q Consensus 319 hR~~~~~~~~~~~l~G-~l~~~i~~~~~~~~~~ 350 (357) .-+.++++.....+.| +||+++.++..-|++- T Consensus 35 ~~lk~~l~~mGT~iEgEdld~l~~~~~~~Hea~ 67 (103) T TIGR00106 35 EGLKYELHAMGTLIEGEDLDELFEAIKAIHEAV 67 (103) T ss_pred CCCCEEECCCCCEEECCCHHHHHHHHHHHHHHH T ss_conf 577434425765230469789999999986547 No 72 >pfam02298 Cu_bind_like Plastocyanin-like domain. This family represents a domain found in flowering plants related to the copper binding protein plastocyanin. Some members of this family may not bind copper due to the lack of key residues. Probab=35.55 E-value=15 Score=17.51 Aligned_cols=34 Identities=24% Similarity=0.541 Sum_probs=25.0 Q ss_pred HHHHHHHCCCCCCHHHHCCCCCCCCCEEEEEEECCCHHHHHHCCCCCEEEEEECCCC Q ss_conf 999987649840013211354477211137775031112100024650267612545 Q gi|254780874|r 136 YERYAALRKWKVEVLSSSDNDDGGYKEIVATISGRGVFSRMKFESGVHRVQRVPATE 192 (357) Q Consensus 136 Y~r~ae~~gwk~eii~~~~ge~aGik~v~~~I~G~~aYg~Lk~E~GvHRvqRip~~e 192 (357) |..||..+.|.+ ||. | .+++.+|.|-|+.|+..+ T Consensus 9 Y~~Wa~~~~F~v-------GD~-------L---------vF~Y~~~~h~V~~V~~~~ 42 (84) T pfam02298 9 YTLWASGKTFRV-------GDT-------L---------VFNYDKGFHNVVEVTKAD 42 (84) T ss_pred HHHHHCCCCEEC-------CCE-------E---------EEECCCCCCCEEEECHHH T ss_conf 789767997838-------999-------9---------995169875189978688 No 73 >PRK11895 ilvH acetolactate synthase 3 regulatory subunit; Reviewed Probab=35.44 E-value=26 Score=15.65 Aligned_cols=61 Identities=20% Similarity=0.258 Sum_probs=44.0 Q ss_pred HHHHHHHHHHHHCCCCCCHHHHCCCCCCCCCEEEEEEECCC-----HHHHHHCCCCCEEEEEECCCC Q ss_conf 99999999987649840013211354477211137775031-----112100024650267612545 Q gi|254780874|r 131 DLFRMYERYAALRKWKVEVLSSSDNDDGGYKEIVATISGRG-----VFSRMKFESGVHRVQRVPATE 192 (357) Q Consensus 131 ~L~rMY~r~ae~~gwk~eii~~~~ge~aGik~v~~~I~G~~-----aYg~Lk~E~GvHRvqRip~~e 192 (357) -|.|+=. -..++||.++-+...|++..|+...|+.+.|+. .-..|.-=--||.|.-+.+.+ T Consensus 15 vL~Risg-lFsrRg~NI~SL~v~~te~~~iSR~Tiv~~g~~~~i~QI~kQL~KLidVi~V~dlt~~~ 80 (161) T PRK11895 15 VLSRVAG-LFSRRGYNIESLTVAPTEDPGLSRITIVTSGDERVLEQITKQLNKLIDVLKVVDLTEEA 80 (161) T ss_pred HHHHHHH-HHHCCCCCEEEEEEECCCCCCCEEEEEEEECCHHHHHHHHHHHHCCCCEEEEEECCCCC T ss_conf 9999999-98506865466665024799815999999689999999999996321403466458854 No 74 >KOG0081 consensus Probab=34.89 E-value=32 Score=15.00 Aligned_cols=45 Identities=22% Similarity=0.390 Sum_probs=22.6 Q ss_pred CCCCCEEEEEECCCCCCCCCCEEEEEEEEEECCCCCCCCCCCCEEEEEEECCCCCCC Q ss_conf 024650267612545678511016888853033433222454347999852888752 Q gi|254780874|r 178 FESGVHRVQRVPATEASGRVHTSAATVAVLPEAAEIDVDIPLEDIRIDTMRASGSGG 234 (357) Q Consensus 178 ~E~GvHRvqRip~~es~gR~hTS~a~V~V~P~~~~~~~~i~~~dl~i~~~rssG~GG 234 (357) ..|||-.--=.-. =+.|+-||-|.|- +-|+..++-+ +|+|||||| T Consensus 16 GDSGVGKTs~Ly~-YTD~~F~~qFIsT--------VGIDFreKrv---vY~s~gp~g 60 (219) T KOG0081 16 GDSGVGKTSFLYQ-YTDGKFNTQFIST--------VGIDFREKRV---VYNSSGPGG 60 (219) T ss_pred CCCCCCCEEEEEE-ECCCCCCCEEEEE--------EECCCCCCEE---EEECCCCCC T ss_conf 4677773478987-3278533435777--------4013440047---883468998 No 75 >PHA00159 endonuclease I Probab=34.58 E-value=21 Score=16.33 Aligned_cols=15 Identities=7% Similarity=0.049 Sum_probs=12.0 Q ss_pred HHHHHHHHCCCCCCH Q ss_conf 999998764984001 Q gi|254780874|r 135 MYERYAALRKWKVEV 149 (357) Q Consensus 135 MY~r~ae~~gwk~ei 149 (357) -|-.||+++||.+-- T Consensus 108 TYa~WCeKhgf~wa~ 122 (149) T PHA00159 108 SYAEWCEKHGILFAD 122 (149) T ss_pred CHHHHHHHCCCCCCC T ss_conf 399999874997457 No 76 >PRK08566 DNA-directed RNA polymerase subunit alpha; Validated Probab=34.15 E-value=34 Score=14.85 Aligned_cols=31 Identities=32% Similarity=0.308 Sum_probs=14.6 Q ss_pred CHHHHHHCCCCCEEEEEECCCCCCCCCCEEEEEEEE Q ss_conf 111210002465026761254567851101688885 Q gi|254780874|r 171 GVFSRMKFESGVHRVQRVPATEASGRVHTSAATVAV 206 (357) Q Consensus 171 ~aYg~Lk~E~GvHRvqRip~~es~gR~hTS~a~V~V 206 (357) +..-.|+.-.|-=| -.=...|+-=|.-||.+ T Consensus 299 ~i~q~L~gK~GrfR-----~nlmGKRVd~saRSVIs 329 (881) T PRK08566 299 TLAQRLKGKEGRFR-----GNLSGKRVNFSARTVIS 329 (881) T ss_pred CHHHHHCCCCCEEE-----CCCCCCCCCCCCEEEEC T ss_conf 18888546663010-----02244533554303432 No 77 >pfam05367 Phage_endo_I Phage endonuclease I. The bacteriophage endonuclease I is a nuclease that is selective for the structure of the four-way Holliday DNA junction. Probab=33.73 E-value=22 Score=16.17 Aligned_cols=14 Identities=14% Similarity=0.394 Sum_probs=11.3 Q ss_pred HHHHHHHHCCCCCC Q ss_conf 99999876498400 Q gi|254780874|r 135 MYERYAALRKWKVE 148 (357) Q Consensus 135 MY~r~ae~~gwk~e 148 (357) .|-.||+++||++- T Consensus 108 TYa~WCeKhGf~wa 121 (149) T pfam05367 108 TYAEWCEKHGFKFA 121 (149) T ss_pred CHHHHHHHCCCCCC T ss_conf 39999987499745 No 78 >pfam10275 Peptidase_C65 Peptidase C65 Otubain. This family of proteins conserved from plants to humans is a highly specific ubiquitin iso-peptidase that removes ubiquitin from proteins. The modification of cellular proteins by ubiquitin (Ub) is an important event that underlies protein stability and function in eukaryote being a dynamic and reversible process. Otubain carries several key conserved domains: (i) the OTU (ovarian tumour domain) in which there is an active cysteine protease triad (ii) a nuclear localisation signal, (iii) a Ub interaction motif (UIM)-like motif phi-xx-A-xxxs-xx-Ac (where phi indicates an aromatic amino acid, x indicates any amino acid and Ac indicates an acidic amino acid), (iv) a Ub-associated (UBA)-like domain and (v) the LxxLL motif. Probab=33.52 E-value=35 Score=14.78 Aligned_cols=118 Identities=13% Similarity=0.108 Sum_probs=44.5 Q ss_pred HHHHHHHHHHHHHHHHHCCCC-CHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHH Q ss_conf 989999999999999853853-15799999999987999999999--999999989999999720378535665433222 Q gi|254780874|r 7 KQICDLKNRFAEIELRMSESP-SVDAYIKLTEEYAAISPIISKIS--IYDQKKQEEQDLHTVIGDHNSDSEIRDLAQIEV 83 (357) Q Consensus 7 ~kL~~l~~rl~ELE~~Lsdp~-~w~d~~klsKe~s~L~~iv~~~~--~~~~~~~ei~el~eLl~e~e~D~El~elAeeEl 83 (357) .|+..|.+++..+-.--.|.+ ||-.+. ++=|+.++...+ ++......+..+...+....-+..+.+..-++. T Consensus 31 ~Ki~~L~~~y~~~RrvRgDGNCFyRaf~-----F~ylE~Ll~~~~~~e~~~f~~~i~~~~~~l~~~g~~~~~~edf~~~f 105 (237) T pfam10275 31 KKIKDLAKKYSGIRRVRGDGNCFYRAFG-----FSYLEILLEKKDQAEINRFIARIESLKELLVALGFDEFTFEDFVDEF 105 (237) T ss_pred HHHHHHHHHHHHHEEECCCCCHHHHHHH-----HHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHH T ss_conf 9999999876661201578834799999-----99999998589999999999999998999998599479999999999 Q ss_pred HHHHHHHHHHHHHH-HHHHCCCCCCCCCCEEEEEECCCCCCHHHHHHHHHHH-HHHHHHHHCCCC Q ss_conf 33200057788899-9864000113554037886226885217889999999-999998764984 Q gi|254780874|r 84 LAIEKEIRELENEI-NYLLLPKDTDDDKSCILEIRAGTGGSEAALFVGDLFR-MYERYAALRKWK 146 (357) Q Consensus 84 ~~l~~~l~~le~eL-~~~LLpsde~D~~nailEI~aGaGG~EA~dwA~~L~r-MY~r~ae~~gwk 146 (357) ..+.+++....... +++-.++++.-+.. .=+.|| +...|...+-.. T Consensus 106 ~~~l~~i~~~~~~~~~ll~~fnd~~~s~y-----------------iv~flRlltS~~l~~~~~~ 153 (237) T pfam10275 106 IELLEKLEKGGETEEELLKAFNDPEYSDY-----------------IVIFFRLLTSAYIKENADE 153 (237) T ss_pred HHHHHHHHCCCCCHHHHHHHHHCCCCCCH-----------------HHHHHHHHHHHHHHHCHHH T ss_conf 99999874578658999999847576511-----------------8999999999999988999 No 79 >PRK11546 zraP zinc resistance protein; Provisional Probab=33.42 E-value=35 Score=14.77 Aligned_cols=48 Identities=10% Similarity=0.137 Sum_probs=28.8 Q ss_pred HHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999998999999972037853566543322233200057788899986 Q gi|254780874|r 53 DQKKQEEQDLHTVIGDHNSDSEIRDLAQIEVLAIEKEIRELENEINYL 100 (357) Q Consensus 53 ~~~~~ei~el~eLl~e~e~D~El~elAeeEl~~l~~~l~~le~eL~~~ 100 (357) .++...-.|+..|+.-.+.|.+-..-+..|+..|..++.+.--+.... T Consensus 64 qqL~aKr~ElNALL~at~PD~~kI~avakEis~LR~kl~e~rv~~dv~ 111 (139) T PRK11546 64 QQLVSKRYEYNALLTANPPDSSKINAVAKEMETLRQSLDEQRVKRDIA 111 (139) T ss_pred HHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999998999999980699779999999999999999999999999999 No 80 >CHL00100 ilvH acetohydroxyacid synthase small subunit Probab=33.33 E-value=35 Score=14.76 Aligned_cols=65 Identities=18% Similarity=0.236 Sum_probs=50.4 Q ss_pred HHHHHHHHHHHHCCCCCCHHHHCCCCCCCCCEEEEEEECCC-----HHHHHHCCCCCEEEEEECCCCCCCC Q ss_conf 99999999987649840013211354477211137775031-----1121000246502676125456785 Q gi|254780874|r 131 DLFRMYERYAALRKWKVEVLSSSDNDDGGYKEIVATISGRG-----VFSRMKFESGVHRVQRVPATEASGR 196 (357) Q Consensus 131 ~L~rMY~r~ae~~gwk~eii~~~~ge~aGik~v~~~I~G~~-----aYg~Lk~E~GvHRvqRip~~es~gR 196 (357) .|.|+=.- +.++||.++=+...|++..|+...|+-+.|+. .-..|.-=--||+|.-+++.++=.| T Consensus 15 vL~RV~gl-FsrRgyNIeSL~V~~te~~~iSR~Tiv~~gd~~~ieQI~kQL~KLidVi~V~dlt~~~~v~R 84 (172) T CHL00100 15 VLTRIAGL-FARRGFNIESLAVGPAEQIGISRITMVVPGDNRTIEQLTKQLYKLVNILKVQDITNIPSVER 84 (172) T ss_pred HHHHHHHH-HHCCCCCEEEEEEECCCCCCEEEEEEEEECCHHHHHHHHHHHHCCCCEEEEEECCCCCHHHH T ss_conf 99999999-85178672369972148998159999996899999999999955827026430678445643 No 81 >TIGR02898 spore_YhcN_YlaJ sporulation lipoprotein, YhcN/YlaJ family; InterPro: IPR014247 This entry contains YhcN and YlaJ, which are predicted lipoproteins that have been detected as spore proteins but not vegetative proteins in Bacillus subtilis. Both appear to be expressed under control of the RNA polymerase sigma-G factor. The YlaJ-like members of this family have a low-complexity, strongly acidic, 40-residue C-terminal domain. . Probab=33.30 E-value=20 Score=16.47 Aligned_cols=17 Identities=24% Similarity=0.458 Sum_probs=9.9 Q ss_pred CCCCCEEEEEEECCCHH Q ss_conf 47721113777503111 Q gi|254780874|r 157 DGGYKEIVATISGRGVF 173 (357) Q Consensus 157 ~aGik~v~~~I~G~~aY 173 (357) .-|||+|++-|.|.+|| T Consensus 84 ~~gvk~A~vvi~g~~A~ 100 (185) T TIGR02898 84 VKGVKDATVVITGNRAY 100 (185) T ss_pred CCCCCEEEEEEECCEEE T ss_conf 78862114887468158 No 82 >COG3378 Phage associated DNA primase [General function prediction only] Probab=33.20 E-value=35 Score=14.74 Aligned_cols=12 Identities=8% Similarity=0.099 Sum_probs=6.2 Q ss_pred HHHHHHHHHHHH Q ss_conf 999999999999 Q gi|254780874|r 338 DIINPLLTAHQA 349 (357) Q Consensus 338 ~~i~~~~~~~~~ 349 (357) .+-++|+.|..+ T Consensus 441 ~ly~~y~~w~e~ 452 (517) T COG3378 441 ELYEAYQEWCEA 452 (517) T ss_pred HHHHHHHHHHHH T ss_conf 899999999975 No 83 >PRK00132 rpsI 30S ribosomal protein S9; Reviewed Probab=33.20 E-value=30 Score=15.24 Aligned_cols=30 Identities=20% Similarity=0.361 Sum_probs=20.2 Q ss_pred CCCCCCCCEEEEEEEEEECCCCCCCCCCCCEE Q ss_conf 45678511016888853033433222454347 Q gi|254780874|r 191 TEASGRVHTSAATVAVLPEAAEIDVDIPLEDI 222 (357) Q Consensus 191 ~es~gR~hTS~a~V~V~P~~~~~~~~i~~~dl 222 (357) |-+-|||.||.|-|.+.|-.- .|.||..++ T Consensus 6 ~~~~GrRKtaiArv~l~~G~G--~i~INgk~~ 35 (130) T PRK00132 6 YYGTGRRKSAVARVRLKPGSG--KITVNGRDL 35 (130) T ss_pred EEEECCCCCEEEEEEEECCCC--EEEECCEEH T ss_conf 998557605899999986885--799958129 No 84 >pfam12112 DUF3579 Protein of unknown function (DUF3579). This family of proteins is functionally uncharacterized. This protein is found in bacteria. Proteins in this family are typically between 98 to 126 amino acids in length. This protein has a conserved FRP sequence motif. Probab=33.14 E-value=22 Score=16.14 Aligned_cols=64 Identities=13% Similarity=0.106 Sum_probs=35.2 Q ss_pred EEEEECCCCCC-HHHHHHHHHHHHHHHHHHHCCCCCCHHHHCCCCCCCCCEEEEEE----ECCCHHHHHHC Q ss_conf 78862268852-17889999999999998764984001321135447721113777----50311121000 Q gi|254780874|r 113 ILEIRAGTGGS-EAALFVGDLFRMYERYAALRKWKVEVLSSSDNDDGGYKEIVATI----SGRGVFSRMKF 178 (357) Q Consensus 113 ilEI~aGaGG~-EA~dwA~~L~rMY~r~ae~~gwk~eii~~~~ge~aGik~v~~~I----~G~~aYg~Lk~ 178 (357) |..|+. .|-+ --.|||+.|.-+...|--.+.+.+... ..|.-.+|+|.+.+.- .-+.||..+.. T Consensus 10 I~GiT~-~Gk~FRPSDWaERL~Gv~a~f~~~~r~~YSp~-~~P~~~~G~kcvvvd~~L~~~~P~a~~fvm~ 78 (93) T pfam12112 10 IQGVTR-AGKTFRPSDWAERLCGVMSSFRPGGRLSYSPY-VRPGVIGGVKCVVVDARLRDIEPMAFDFLMN 78 (93) T ss_pred EEEECC-CCCCCCCCHHHHHHHHHHHHCCCCCCEEECCC-CCCEEECCEEEEEECCHHHHCCHHHHHHHHH T ss_conf 996738-99982885299997411540089994762675-4542658857999860743519599999998 No 85 >COG3978 Acetolactate synthase (isozyme II), small (regulatory) subunit [Function unknown] Probab=32.60 E-value=35 Score=14.79 Aligned_cols=52 Identities=15% Similarity=0.169 Sum_probs=44.0 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCHHHHCCCCCCCCCEEEEEEECCCHHHHHHC Q ss_conf 8999999999999876498400132113544772111377750311121000 Q gi|254780874|r 127 LFVGDLFRMYERYAALRKWKVEVLSSSDNDDGGYKEIVATISGRGVFSRMKF 178 (357) Q Consensus 127 dwA~~L~rMY~r~ae~~gwk~eii~~~~ge~aGik~v~~~I~G~~aYg~Lk~ 178 (357) .|+..++.=-.|--+.+||.+--++.++.-.||.-++.+.|.|+.++..|.+ T Consensus 11 r~~pe~leRVLrvtrhRGF~vcamnmt~~~da~~~nie~tV~s~R~~~lL~~ 62 (86) T COG3978 11 RFNPETLERVLRVTRHRGFRVCAMNMTAAVDAGNANIELTVDSDRSVDLLTS 62 (86) T ss_pred CCCHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCCEEEEEECCCCCHHHHHH T ss_conf 6795899999988651575999852531123663059999858988689999 No 86 >KOG2150 consensus Probab=31.26 E-value=38 Score=14.53 Aligned_cols=48 Identities=15% Similarity=0.230 Sum_probs=32.0 Q ss_pred HHHHHHHHHHHHHHHHCCCCCHHHH----HHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 8999999999999985385315799----99999998799999999999999 Q gi|254780874|r 8 QICDLKNRFAEIELRMSESPSVDAY----IKLTEEYAAISPIISKISIYDQK 55 (357) Q Consensus 8 kL~~l~~rl~ELE~~Lsdp~~w~d~----~klsKe~s~L~~iv~~~~~~~~~ 55 (357) |+.+-..-|+++-.++.+.++.+.- ..|.||.++|..+-+.++.|.+. T Consensus 16 Kv~Egve~Fd~i~ek~~~~~n~sqkeK~e~DLKkEIKKLQRlRdQIKtW~ss 67 (575) T KOG2150 16 KVDEGVEIFDEIYEKLHSANNVSQKEKLESDLKKEIKKLQRLRDQIKTWQSS 67 (575) T ss_pred HHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 8654688999999998724780477888889999999999999999864144 No 87 >TIGR02153 gatD_arch glutamyl-tRNA(Gln) amidotransferase, subunit D; InterPro: IPR011878 This peptide is found only in the Archaea. It is part of a heterodimer, with GatD (IPR004414 from INTERPRO), that acts as an amidotransferase on misacylated Glu-tRNA(Gln) to produce Gln-tRNA(Gln). The analogous amidotransferase found in bacteria is the GatABC system, although GatABC homologs in the Archaea appear to act instead on Asp-tRNA(Asn) .; GO: 0006450 regulation of translational fidelity. Probab=31.22 E-value=16 Score=17.30 Aligned_cols=111 Identities=20% Similarity=0.345 Sum_probs=55.4 Q ss_pred CCCCHHHHHHH-HHHHHHHHHHHHCCCCCCHH--------HHCCCC-------------CCCCCEEEEEEECC--CHHHH Q ss_conf 88521788999-99999999987649840013--------211354-------------47721113777503--11121 Q gi|254780874|r 120 TGGSEAALFVG-DLFRMYERYAALRKWKVEVL--------SSSDND-------------DGGYKEIVATISGR--GVFSR 175 (357) Q Consensus 120 aGG~EA~dwA~-~L~rMY~r~ae~~gwk~eii--------~~~~ge-------------~aGik~v~~~I~G~--~aYg~ 175 (357) |=||+=+-|++ =| -|+=+.|..+=|+ |.-.+| .|=|.+|++--=|. .-|- T Consensus 149 aHGTDTM~YTAaAL-----SFm~~~Gl~~PvVlVGAQRSSDRPSSDAa~NL~~A~~~A~~a~~aEV~VvMHG~tsD~yc- 222 (413) T TIGR02153 149 AHGTDTMAYTAAAL-----SFMFKTGLPVPVVLVGAQRSSDRPSSDAALNLIAAVRAATSAPIAEVVVVMHGETSDTYC- 222 (413) T ss_pred EECCCHHHHHHHHH-----HHHHHCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHCCCCCCEEEEEEECCCCCCEE- T ss_conf 41587468999999-----999971699988998574358887313799999999983369710038884012488679- Q ss_pred HHCCCCCEEEEEECCCCCCCCCCEE----EEEEEEEECCC--CCCCC-CCCCEEEEEEECCCCC-CCCCCCCCCCEEEEE Q ss_conf 0002465026761254567851101----68888530334--33222-4543479998528887-520006321215889 Q gi|254780874|r 176 MKFESGVHRVQRVPATEASGRVHTS----AATVAVLPEAA--EIDVD-IPLEDIRIDTMRASGS-GGQHVNTTDSAVRIT 247 (357) Q Consensus 176 Lk~E~GvHRvqRip~~es~gR~hTS----~a~V~V~P~~~--~~~~~-i~~~dl~i~~~rssG~-GGQ~vN~t~savri~ 247 (357) =|||=|||= +.||| |=||.-.|.+. ..++. =.-.-|+ +.||..|- +=-==++-|..|=|+ T Consensus 223 -----~~HRGVkVR------KMHTSRRDAF~SiN~~PiA~~~~~~~~~~~~~~L~-~dYr~r~~~~le~~~~~EekVaLv 290 (413) T TIGR02153 223 -----LVHRGVKVR------KMHTSRRDAFQSINSIPIAKVDPKSLKEKEIEKLR-EDYRRRGEKELELDDKFEEKVALV 290 (413) T ss_pred -----EEECCCEEE------CCCCCCHHHHHCCCCCCCEEECCCCCCCCEEEEEE-EEEEECCCCCCCCCCCCCCEEEEE T ss_conf -----996582350------36588102553158865358817778744077754-211313776532068877137999 Q ss_pred E Q ss_conf 6 Q gi|254780874|r 248 H 248 (357) Q Consensus 248 H 248 (357) - T Consensus 291 K 291 (413) T TIGR02153 291 K 291 (413) T ss_pred E T ss_conf 8 No 88 >pfam04800 ETC_C1_NDUFA4 ETC complex I subunit conserved region. Family of pankaryotic NADH-ubiquinone oxidoreductase subunits (EC:1.6.5.3) (EC:1.6.99.3) from complex I of the electron transport chain initially identified in Neurospora crassa as a 21 kDa protein. Probab=31.21 E-value=21 Score=16.35 Aligned_cols=18 Identities=17% Similarity=0.394 Sum_probs=14.4 Q ss_pred HHHHHHCCCCCCHHHHCC Q ss_conf 999876498400132113 Q gi|254780874|r 137 ERYAALRKWKVEVLSSSD 154 (357) Q Consensus 137 ~r~ae~~gwk~eii~~~~ 154 (357) ..||+++||.++|++-.. T Consensus 62 i~YA~k~gi~y~V~ep~~ 79 (101) T pfam04800 62 IAFAERQGWEYDVEEPNA 79 (101) T ss_pred HHHHHHCCCEEEEECCCC T ss_conf 999998498289957886 No 89 >pfam01403 Sema Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor and the human Plexin-A3 precursor. Probab=30.15 E-value=39 Score=14.40 Aligned_cols=77 Identities=19% Similarity=0.205 Sum_probs=44.2 Q ss_pred ECCCHHHHHHCCCC-------C-EEEEEECCCCCCCCCC------EEEEEEEE---EEC----------CCCCCC---CC Q ss_conf 50311121000246-------5-0267612545678511------01688885---303----------343322---24 Q gi|254780874|r 168 SGRGVFSRMKFESG-------V-HRVQRVPATEASGRVH------TSAATVAV---LPE----------AAEIDV---DI 217 (357) Q Consensus 168 ~G~~aYg~Lk~E~G-------v-HRvqRip~~es~gR~h------TS~a~V~V---~P~----------~~~~~~---~i 217 (357) .|+++|=++.-.+. + =||.||-.-|..|... |||.-.-. .|- +-++-+ .- T Consensus 182 ~~~~vYFff~e~~~e~~~~~~~~SRVaRvCk~D~gg~~~~e~~~wttFlKarL~Cs~~g~~~~~~~fN~lQ~a~~~~~~~ 261 (434) T pfam01403 182 SDDKVYFFFRETAVEDSNCKAIHSRVARVCKNDPGGRSYLELNKWTTFLKARLNCSVPGREGIPFYFNELQAAFVLPTGA 261 (434) T ss_pred CCCEEEEEEEEECCCCCCCCEEEEEEEEECCCCCCCCCEEEECCEEEECCEEEEECCCCCCCCCCCCCHHEEEEECCCCC T ss_conf 79999999988852367885468889994788888752455112576000147614788666776561313888546888 Q ss_pred CCCEEEEEEECCCCCCCCCCCCCCCEEEEEEC Q ss_conf 54347999852888752000632121588964 Q gi|254780874|r 218 PLEDIRIDTMRASGSGGQHVNTTDSAVRITHI 249 (357) Q Consensus 218 ~~~dl~i~~~rssG~GGQ~vN~t~savri~H~ 249 (357) +..++-.-+|..+.+++. .|||++-.+ T Consensus 262 ~~~~vlygvFst~~~~~~-----~SAvCvfsl 288 (434) T pfam01403 262 DTDPVLYGVFTTSSNSSA-----GSAVCAFSM 288 (434) T ss_pred CCCCEEEEEECCCCCCCC-----CEEEEEEEH T ss_conf 878689999707889998-----418999789 No 90 >KOG4044 consensus Probab=30.02 E-value=39 Score=14.38 Aligned_cols=126 Identities=18% Similarity=0.228 Sum_probs=53.5 Q ss_pred HHHHHHHHHHHCCCCCCHHHHCCC--CCCCCCEEEEEEECCCHHHHHHCCCCCEEEEEECCCCCCCC----CCEEEEEEE Q ss_conf 999999998764984001321135--44772111377750311121000246502676125456785----110168888 Q gi|254780874|r 132 LFRMYERYAALRKWKVEVLSSSDN--DDGGYKEIVATISGRGVFSRMKFESGVHRVQRVPATEASGR----VHTSAATVA 205 (357) Q Consensus 132 L~rMY~r~ae~~gwk~eii~~~~g--e~aGik~v~~~I~G~~aYg~Lk~E~GvHRvqRip~~es~gR----~hTS~a~V~ 205 (357) |..|-.+|.-.-.|=-++++.+.- ..|-+=++-+.+.-.|.-|.=+. ||..|..|- -.|-|.-| T Consensus 20 lCDmQEKFrpai~yf~~iIs~~~rLl~aaril~vP~ivTEqYP~gLG~T---------V~eLd~~g~~~~~~KT~FSM~- 89 (201) T KOG4044 20 LCDMQEKFRPAIPYFPSIISVTTRLLAAARILQVPVIVTEQYPEGLGKT---------VPELDIEGLKLNLSKTKFSMV- 89 (201) T ss_pred EECHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCC---------CHHHCHHHHCCCCCCCCEEEE- T ss_conf 9651766262341367899999999976555177478520053200665---------313031100002564111355- Q ss_pred EEECCCCCCCCC-CCCEEEEEEECCCCCCCCCCCCCCCEEEEEE--CCCCEEE-EECCCCCHHHHHHHHHHHHH Q ss_conf 530334332224-5434799985288875200063212158896--4984488-84365413589999999999 Q gi|254780874|r 206 VLPEAAEIDVDI-PLEDIRIDTMRASGSGGQHVNTTDSAVRITH--IPTGIMV-TSSEKSQHQNRLRAMKVLRA 275 (357) Q Consensus 206 V~P~~~~~~~~i-~~~dl~i~~~rssG~GGQ~vN~t~savri~H--~ptgi~v-~~~eRsQ~~Nk~~A~~~L~~ 275 (357) +|+.++.--+| +.+++..=-.- -||---.+|.-+.- +---||+ ||..||+ +.|..|+..+|. T Consensus 90 -~p~v~~s~~~i~~~k~VvL~GiE------thvCv~qTa~dLl~rgl~VhvVaDacSSRs~-~DR~~Al~r~rq 155 (201) T KOG4044 90 -LPPVEDSLKDIFGGKTVVLFGIE------THVCVLQTALDLLERGLNVHVVADACSSRSN-QDRDLALERMRQ 155 (201) T ss_pred -CCHHHHHHHHCCCCCEEEEEECC------HHEEHHHHHHHHHHCCCEEEEEEEHHCCCCC-HHHHHHHHHHHH T ss_conf -72678888753599749998310------0000488799998679649998540024200-007899999986 No 91 >pfam11103 DUF2887 Protein of unknown function (DUF2887). This bacterial family of proteins has no known function. Probab=29.50 E-value=40 Score=14.32 Aligned_cols=121 Identities=20% Similarity=0.309 Sum_probs=66.8 Q ss_pred HHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCE-EEEEECCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 999972037853566543322233200057788899986400011355403-7886226885217889999999999998 Q gi|254780874|r 62 LHTVIGDHNSDSEIRDLAQIEVLAIEKEIRELENEINYLLLPKDTDDDKSC-ILEIRAGTGGSEAALFVGDLFRMYERYA 140 (357) Q Consensus 62 l~eLl~e~e~D~El~elAeeEl~~l~~~l~~le~eL~~~LLpsde~D~~na-ilEI~aGaGG~EA~dwA~~L~rMY~r~a 140 (357) +.+|+.....+++--..-.-| +++..-.|.-..+|+.+..+.++ ++|++--.--.==.-+.+++|+--..+. T Consensus 16 ~feLi~~~~~~a~~Y~F~s~e-------vK~~~fRlDGvf~P~~~~~~~Piyf~EvQfQ~D~~fy~RlfAEiflyL~q~~ 88 (200) T pfam11103 16 LFELIGEPPPEAEKYRFSSVE-------VKETAFRIDGVFLPPLESPDGPIYFLEVQFQKDEGFYYRLFAEIFLYLRQYP 88 (200) T ss_pred HHHHCCCCCCCHHCCEEECHH-------HHHHCCCCCEEEECCCCCCCCCEEEEEEEECCCHHHHHHHHHHHHHHHHHCC T ss_conf 999838994322033672566-------6401141563672688888997799998713677789999999999997478 Q ss_pred HHCCCCCCHHHHCCCCCCCCCEEEEEEECCCHHHHHHCCCCCEEEEEECCCCCCCCCCEEE Q ss_conf 7649840013211354477211137775031112100024650267612545678511016 Q gi|254780874|r 141 ALRKWKVEVLSSSDNDDGGYKEIVATISGRGVFSRMKFESGVHRVQRVPATEASGRVHTSA 201 (357) Q Consensus 141 e~~gwk~eii~~~~ge~aGik~v~~~I~G~~aYg~Lk~E~GvHRvqRip~~es~gR~hTS~ 201 (357) ..+.|.+-|+ |.+-.+...-...|.-|=.- +||+||--.|-..+...|- T Consensus 89 ~~~~W~~VVI---------yp~R~le~~~~~~~~~lL~~---~rV~rIYLdEL~~~~~~sl 137 (200) T pfam11103 89 PANDWRGVVI---------YPSRSLELSNPQPYRELLNS---QRVQRIYLDELGEAELLSL 137 (200) T ss_pred CCCCEEEEEE---------ECCCCCCCCCCCHHHHHHCC---CCEEEEEECCCCCCCCCCC T ss_conf 7678179998---------23655577775258987565---6569998342135788996 No 92 >KOG0581 consensus Probab=29.48 E-value=40 Score=14.32 Aligned_cols=54 Identities=28% Similarity=0.386 Sum_probs=32.9 Q ss_pred CCCEEEEEEEEEECCCCCCCCCCCCEEEEEEECCCCCCCCCCCCCCCEEEEEECCCCEEEE Q ss_conf 5110168888530334332224543479998528887520006321215889649844888 Q gi|254780874|r 196 RVHTSAATVAVLPEAAEIDVDIPLEDIRIDTMRASGSGGQHVNTTDSAVRITHIPTGIMVT 256 (357) Q Consensus 196 R~hTS~a~V~V~P~~~~~~~~i~~~dl~i~~~rssG~GGQ~vN~t~savri~H~ptgi~v~ 256 (357) +.-+|........+.++.+..|...||+.--.=.+|+|| +-....|+|||...+ T Consensus 55 ~~~~~~~~~~~~~~l~~~~~~i~~~dle~~~~lG~G~gG-------~V~kv~Hk~t~~i~A 108 (364) T KOG0581 55 LSKISLSSSSANSELSEDDNGISLSDLERLGVLGSGNGG-------TVYKVRHKPTGKIYA 108 (364) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCC-------EEEEEEECCCCEEEE T ss_conf 433322454432234665333377775423112467873-------799999737885899 No 93 >smart00787 Spc7 Spc7 kinetochore protein. This domain is found in cell division proteins which are required for kinetochore-spindle association. Probab=29.23 E-value=41 Score=14.29 Aligned_cols=35 Identities=26% Similarity=0.268 Sum_probs=20.9 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCHH Q ss_conf 378862268852178899999999999987649840013 Q gi|254780874|r 112 CILEIRAGTGGSEAALFVGDLFRMYERYAALRKWKVEVL 150 (357) Q Consensus 112 ailEI~aGaGG~EA~dwA~~L~rMY~r~ae~~gwk~eii 150 (357) -+++.+-|-.-.| +.-|.+=|..+=..+||++.=+ T Consensus 260 ~~~~~~r~~t~~E----i~~Lk~~~~~Le~l~gw~~~~~ 294 (312) T smart00787 260 KKLEQCRGFTFKE----IEKLKEQLKLLQSLTGWKITKL 294 (312) T ss_pred HHHHHCCCCCHHH----HHHHHHHHHHHHHHHCCEEEEE T ss_conf 9998717999999----9999999999999869805753 No 94 >cd06145 REX1_like This subfamily is composed of RNA exonuclease 1 (REX1 or Rex1p), REX3 (or Rex3p), and similar eukaryotic proteins. In yeast, REX1 and REX3 are required for 5S rRNA and MRP (mitochondrial RNA processing) RNA maturation, respectively. They are DEDDh-type DnaQ-like 3'-5' exonucleases containing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. REX1 is the major exonuclease responsible for pre-tRNA trail trimming and may also be involved in nuclear CCA turnover. REX proteins function in the processing and maturation of many RNA species, similar to the function of E.coli RNase T. Probab=28.97 E-value=24 Score=15.97 Aligned_cols=13 Identities=23% Similarity=0.601 Sum_probs=8.8 Q ss_pred EEEEECCCCCCCC Q ss_conf 2676125456785 Q gi|254780874|r 184 RVQRVPATEASGR 196 (357) Q Consensus 184 RvqRip~~es~gR 196 (357) -|.||+..|..|. T Consensus 15 ~LaRvsvVd~~g~ 27 (150) T cd06145 15 ELTRVTVVDENGK 27 (150) T ss_pred EEEEEEEECCCCC T ss_conf 8999999938998 No 95 >pfam01447 Peptidase_M4 Thermolysin metallopeptidase, catalytic domain. Probab=28.65 E-value=41 Score=14.22 Aligned_cols=28 Identities=7% Similarity=0.269 Sum_probs=25.4 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 6885217889999999999998764984 Q gi|254780874|r 119 GTGGSEAALFVGDLFRMYERYAALRKWK 146 (357) Q Consensus 119 GaGG~EA~dwA~~L~rMY~r~ae~~gwk 146 (357) =+.+.+|.-+|+..+.||..|..|+++. T Consensus 66 ~saavDAh~~a~~~~d~yk~~~~rns~d 93 (150) T pfam01447 66 DSAAVDAHYYAKVVYDYYKDWFGRNSLD 93 (150) T ss_pred CCCHHHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 5504556663367899988874888827 No 96 >COG0440 IlvH Acetolactate synthase, small (regulatory) subunit [Amino acid transport and metabolism] Probab=28.31 E-value=42 Score=14.18 Aligned_cols=110 Identities=17% Similarity=0.264 Sum_probs=72.6 Q ss_pred HHHHHHCCCCCCHHHHCCCCCCCCCEEEEEEECC-----CHHHHHHCCCCCEEEEEECCCCCCCCCCEEEEEEEEEECC- Q ss_conf 9998764984001321135447721113777503-----1112100024650267612545678511016888853033- Q gi|254780874|r 137 ERYAALRKWKVEVLSSSDNDDGGYKEIVATISGR-----GVFSRMKFESGVHRVQRVPATEASGRVHTSAATVAVLPEA- 210 (357) Q Consensus 137 ~r~ae~~gwk~eii~~~~ge~aGik~v~~~I~G~-----~aYg~Lk~E~GvHRvqRip~~es~gR~hTS~a~V~V~P~~- 210 (357) ..++.++||..+-+...++|..++..+|+.+.|+ -++..|.-=--|++|+.+.+. .-..|+---.-|.--|.. T Consensus 22 ~glfsrRG~NIeSltv~~tE~~~~SRiTivv~g~~~~~EQi~kQL~kLidV~kV~d~~~~-~~veRel~LiKv~~~~~~R 100 (163) T COG0440 22 TGLFSRRGYNIESLTVGPTETPGLSRITIVVSGDEQVLEQIIKQLNKLIDVLKVLDLTSE-PHVERELALIKVSAEGSER 100 (163) T ss_pred HHHHHHCCCCCCEEEEEECCCCCCEEEEEEECCCCCHHHHHHHHHHHHCCCEEEEECCCC-CHHHEEEEEEEEECCCCCH T ss_conf 688875176632079960278981589999827851689999998754350268984776-3010165789973582006 Q ss_pred CC----------CCCCCCCCEEEEEEECCCCCCCCCCCCCCCEEEEEECCCCEEE Q ss_conf 43----------3222454347999852888752000632121588964984488 Q gi|254780874|r 211 AE----------IDVDIPLEDIRIDTMRASGSGGQHVNTTDSAVRITHIPTGIMV 255 (357) Q Consensus 211 ~~----------~~~~i~~~dl~i~~~rssG~GGQ~vN~t~savri~H~ptgi~v 255 (357) .+ .-++++++.+.++.-..++ |.|.=.++.- |.||.- T Consensus 101 ~ei~~~~~ifr~~vvDvs~~~~~~eltG~~~-------ki~afi~~l~-~~gi~e 147 (163) T COG0440 101 GEIARITEIFRASVVDVSPESLTIELTGDEE-------KIEAFIRLLR-PYGIIE 147 (163) T ss_pred HHHHHHHHHHCCEEEECCCCEEEEEEECCHH-------HHHHHHHHHC-CCCEEE T ss_conf 8999999874766874472128999907828-------9999999844-456167 No 97 >PRK08091 ribulose-phosphate 3-epimerase; Validated Probab=27.64 E-value=43 Score=14.10 Aligned_cols=59 Identities=15% Similarity=0.144 Sum_probs=32.9 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHCCCCCCCCCE----------EEEEEECCCHHHH Q ss_conf 378862268852178899999999999987649840013211354477211----------1377750311121 Q gi|254780874|r 112 CILEIRAGTGGSEAALFVGDLFRMYERYAALRKWKVEVLSSSDNDDGGYKE----------IVATISGRGVFSR 175 (357) Q Consensus 112 ailEI~aGaGG~EA~dwA~~L~rMY~r~ae~~gwk~eii~~~~ge~aGik~----------v~~~I~G~~aYg~ 175 (357) -+|.+.||-||..=..-+-+=.+-..+|...+++.+.|. . .||++. |.+.|.|.+.|+. T Consensus 152 LvMtV~PGfgGQ~fi~~~l~KI~~l~~~~~~~~~~~~I~-V----DGGI~~~ti~~~~~aGad~~V~GS~iF~~ 220 (235) T PRK08091 152 QLLTLDPRYGSKMRSSDLHERVAQLLCLLGDKREGKLIV-I----DGSMTQDQLPSLIAQGIDWVVSGSALFSD 220 (235) T ss_pred EEEEECCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEE-E----ECCCCHHHHHHHHHCCCCEEEECHHHCCC T ss_conf 998766898888678789999999999999649991599-8----48989888999998399999978243379 No 98 >pfam00606 Glycoprotein_B Herpesvirus Glycoprotein B. This family of proteins contains a transmembrane region. Probab=27.59 E-value=8.7 Score=19.12 Aligned_cols=45 Identities=24% Similarity=0.324 Sum_probs=35.2 Q ss_pred CCCCCCCCEEECCCCCCCEEEEECCCEECCHHHHHCCCHHHHHHHH Q ss_conf 4674238221044899811243216441787787288669999999 Q gi|254780874|r 298 GSGDRSERIRTYNFSQGRITDHRINLTLYKLEYVLQGYIDDIINPL 343 (357) Q Consensus 298 g~~~rs~~iRtY~~~~~rv~DhR~~~~~~~~~~~l~G~l~~~i~~~ 343 (357) +--|-.+-.|-||++..|+.|.|..+...+-+.+|+| |++|++.| T Consensus 588 nvfD~e~~~ReyN~y~q~~~~l~~~~~~~~~~~~~~g-l~~~~~~L 632 (711) T pfam00606 588 NVFDLEEIFREYNYYAQRFYDIDKDIDNDNRPAFVRG-LDDFFQGL 632 (711) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHH-HHHHHHHH T ss_conf 7868999998877888877527777426884279998-99998654 No 99 >CHL00079 rps9 ribosomal protein S9 Probab=27.40 E-value=38 Score=14.52 Aligned_cols=31 Identities=23% Similarity=0.229 Sum_probs=20.7 Q ss_pred CCCCCCCCCEEEEEEEEEECCCCCCCCCCCCEE Q ss_conf 545678511016888853033433222454347 Q gi|254780874|r 190 ATEASGRVHTSAATVAVLPEAAEIDVDIPLEDI 222 (357) Q Consensus 190 ~~es~gR~hTS~a~V~V~P~~~~~~~~i~~~dl 222 (357) .+-+.|||.||.|-|.+.|-.-. |.||..++ T Consensus 8 ~~y~tGkRKtsiArv~i~~GsG~--i~VN~k~l 38 (134) T CHL00079 8 LYYGTGRRKCAVAQVRLVPGTGE--IIINGKPG 38 (134) T ss_pred EEEEECCCCCEEEEEEEECCCCE--EEEECCCH T ss_conf 99986687058999999869836--99959709 No 100 >pfam11926 DUF3444 Domain of unknown function (DUF3444). This presumed domain is functionally uncharacterized. This domain is found in eukaryotes. This domain is about 210 amino acids in length. This domain is found associated with pfam00226. This domain has two conserved sequence motifs: FSH and FSH. Probab=27.35 E-value=26 Score=15.68 Aligned_cols=54 Identities=13% Similarity=0.333 Sum_probs=27.5 Q ss_pred CCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHH------HHCCCCCCHHHHCC--CCCCCCCEEEEE-EECC Q ss_conf 554037886226885217889999999999998------76498400132113--544772111377-7503 Q gi|254780874|r 108 DDKSCILEIRAGTGGSEAALFVGDLFRMYERYA------ALRKWKVEVLSSSD--NDDGGYKEIVAT-ISGR 170 (357) Q Consensus 108 D~~nailEI~aGaGG~EA~dwA~~L~rMY~r~a------e~~gwk~eii~~~~--ge~aGik~v~~~-I~G~ 170 (357) ....-..+|.|-.|=+ || ||..|. ....++++++++.. +++.|++=+-+. |+|- T Consensus 114 ~~~r~~y~IyP~kGEV----WA-----lYknW~~~~~~~~~~~~~Y~iVEVLs~~~~~~gi~V~~L~Kv~Gf 176 (216) T pfam11926 114 KGRRNEYRIYPRKGEV----WA-----LYKNWSPDWSASDEKKYEYEIVEILSDFDEESGISVAPLVKVEGF 176 (216) T ss_pred CCCCCEEEEECCCCCE----EE-----EECCCCCCCCCCCCCCCCEEEEEEEECCCCCCCEEEEEEEEECCE T ss_conf 5897679997998766----25-----741768998856466541689999502567777799997874664 No 101 >TIGR00593 pola DNA polymerase I; InterPro: IPR002298 DNA carries the biological information that instructs cells how to exist in an ordered fashion. Accurate replication is thus one of the most important events in the cell life cycle. This function is mediated by DNA-directed DNA polymerases, which add nucleotide triphosphate (dNTP) residues to the 5'-end of the growing DNA chain, using a complementary DNA as template. Small RNA molecules are generally used as primers for chain elongation, although terminal proteins may also be used. DNA-dependent DNA polymerases have been grouped into families, denoted A, B and X, on the basis of sequence similarities , . Members of family A, which includes bacterial and bacteriophage polymerases, share significant similarity to Escherichia coli polymerase I; hence family A is also known as the pol I family. The bacterial polymerases also contain an exonuclease activity, which is coded for in the N-terminal portion. Three motifs, A, B and C , are seen to be conserved across all DNA polymerases, with motifs A and C also seen in RNA polymerases. They are centred on invariant residues, and their structural significance was implied from the Klenow (E. coli) structure. Motif A contains a strictly-conserved aspartate at the junction of a beta-strand and an alpha-helix; motif B contains an alpha-helix with positive charges; and motif C has a doublet of negative charges, located in a beta-turn-beta secondary structure .; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006260 DNA replication. Probab=27.21 E-value=16 Score=17.16 Aligned_cols=34 Identities=15% Similarity=0.400 Sum_probs=25.2 Q ss_pred HHHHHHHHHHCCCC-CH--------HHHHHHHHHHHHHHHHHH Q ss_conf 99999999853853-15--------799999999987999999 Q gi|254780874|r 14 NRFAEIELRMSESP-SV--------DAYIKLTEEYAAISPIIS 47 (357) Q Consensus 14 ~rl~ELE~~Lsdp~-~w--------~d~~klsKe~s~L~~iv~ 47 (357) ..+.++...+.|+. +. +-+.+|-+++..|+.+.+ T Consensus 178 ~Q~~D~~AL~GD~SDNiPGV~GIGeKTA~kLL~~fgsLe~iy~ 220 (1005) T TIGR00593 178 SQLVDLKALVGDSSDNIPGVKGIGEKTAAKLLQEFGSLENIYE 220 (1005) T ss_pred HHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 5788762046888878599897376568999987210899998 No 102 >PRK06945 flgK flagellar hook-associated protein FlgK; Validated Probab=26.34 E-value=45 Score=13.94 Aligned_cols=19 Identities=5% Similarity=0.174 Sum_probs=11.1 Q ss_pred HHHHHHHHHHHHHHHHCCC Q ss_conf 8999999999999985385 Q gi|254780874|r 8 QICDLKNRFAEIELRMSES 26 (357) Q Consensus 8 kL~~l~~rl~ELE~~Lsdp 26 (357) ..+.....+.+||..+.+| T Consensus 84 ~~~~~~~~l~~ie~~f~e~ 102 (649) T PRK06945 84 SLSTYYSLVAQLNNYLADP 102 (649) T ss_pred HHHHHHHHHHHHHHHHCCC T ss_conf 9999999999999984898 No 103 >KOG0895 consensus Probab=26.22 E-value=41 Score=14.28 Aligned_cols=16 Identities=13% Similarity=0.250 Sum_probs=6.3 Q ss_pred HHHHHHHHHHHHHHHH Q ss_conf 8999999999999999 Q gi|254780874|r 265 NRLRAMKVLRARLYDV 280 (357) Q Consensus 265 Nk~~A~~~L~~~l~~~ 280 (357) |-..-+..+++.+|++ T Consensus 998 nen~~~~~~~~m~~~~ 1013 (1101) T KOG0895 998 NENAFLLTCKSMVYQL 1013 (1101) T ss_pred CHHHHHHHHHHHHHHH T ss_conf 4247999999999986 No 104 >PRK09722 allulose-6-phosphate 3-epimerase; Provisional Probab=26.13 E-value=46 Score=13.92 Aligned_cols=38 Identities=11% Similarity=0.141 Sum_probs=31.2 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCH Q ss_conf 37886226885217889999999999998764984001 Q gi|254780874|r 112 CILEIRAGTGGSEAALFVGDLFRMYERYAALRKWKVEV 149 (357) Q Consensus 112 ailEI~aGaGG~EA~dwA~~L~rMY~r~ae~~gwk~ei 149 (357) -+|.+.||-||.-=.--+-+=.+....|...+|..+.| T Consensus 133 LvMsV~PGf~GQ~Fi~~~l~KI~~lr~~~~~~~~~~~I 170 (227) T PRK09722 133 TVMTVDPGFAGQPFIPEMLDKIAELKAWREREGLEYEI 170 (227) T ss_pred EEEEECCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEE T ss_conf 99988899987656688999999999999825998269 No 105 >PRK11152 ilvM acetolactate synthase 2 regulatory subunit; Provisional Probab=25.86 E-value=40 Score=14.35 Aligned_cols=42 Identities=17% Similarity=0.179 Sum_probs=30.2 Q ss_pred HHHHHHCCCCCCHHHHCCCCCCCCCEEEEEEECCCHHHHHHC Q ss_conf 999876498400132113544772111377750311121000 Q gi|254780874|r 137 ERYAALRKWKVEVLSSSDNDDGGYKEIVATISGRGVFSRMKF 178 (357) Q Consensus 137 ~r~ae~~gwk~eii~~~~ge~aGik~v~~~I~G~~aYg~Lk~ 178 (357) .|-.-.+||++.-++......++.-.+++.|+++.+...|.. T Consensus 21 LRVvRHRGF~v~~m~m~~~~~~~~~~i~ltV~SdR~i~lL~~ 62 (76) T PRK11152 21 LRVVRHRGFQVCQMNMTANSDAQNINIELTVASERAIDLLSS 62 (76) T ss_pred EEEEEECCEEEEEEEEEECCCCCEEEEEEEEECCCHHHHHHH T ss_conf 347841684999976267389988999999905773999999 No 106 >TIGR02090 LEU1_arch isopropylmalate/citramalate/homocitrate synthases; InterPro: IPR011830 Methanogenic archaea contain three closely related homologs of the 2-isopropylmalate synthases (LeuA) represented by IPR005671 from INTERPRO. Two of these in Methanococcus janaschii (MJ1392 - CimA ; MJ0503 - AksA ) have been characterised as catalyzing alternative reactions leaving the third (MJ1195) as the presumptive LeuA enzyme. CimA is citramalate (2-methylmalate) synthase, which condenses acetyl-CoA with pyruvate. This enzyme is believed to be involved in the biosynthesis of isoleucine in methanogens and possibly other species lacking threonine dehydratase. AksA is a homocitrate synthase which also produces (homo)2-citrate and (homo)3-citrate in the biosynthesis of Coenzyme B which is restricted solely to methanogenic archaea. Methanogens, then should and apparently do contain all three of these enzymes. Unfortunately, phylogenetic trees do not resolve into three unambiguous clades, making assignment of function to particular genes problematic. Other archaea, which lack a threonine dehydratase (mainly Euryarchaeota), should contain both CimA and LeuA genes. This is true for archaeoglobus fulgidis, but not for the Pyrococci which have none in this clade, but one in IPR005671 from INTERPRO and one in IPR005675 from INTERPRO which may fulfil these roles. Proteins from other species, which have only one hit to this entry and lack threonine dehydratase, are very likely to be LeuA enzymes.; GO: 0046912 transferase activity transferring acyl groups acyl groups converted into alkyl on transfer, 0019752 carboxylic acid metabolic process. Probab=25.40 E-value=20 Score=16.51 Aligned_cols=14 Identities=50% Similarity=0.952 Sum_probs=7.6 Q ss_pred EECCCHHHHHHCCCCCE Q ss_conf 75031112100024650 Q gi|254780874|r 167 ISGRGVFSRMKFESGVH 183 (357) Q Consensus 167 I~G~~aYg~Lk~E~GvH 183 (357) |=|+|||+ +|||+| T Consensus 276 IVGeNAFa---HEsGIH 289 (371) T TIGR02090 276 IVGENAFA---HESGIH 289 (371) T ss_pred CCCCCCCC---CCCHHH T ss_conf 32565000---110355 No 107 >COG0011 Uncharacterized conserved protein [Function unknown] Probab=25.39 E-value=45 Score=13.97 Aligned_cols=24 Identities=21% Similarity=0.418 Sum_probs=9.5 Q ss_pred CEECCHHHHHCCCHHHHHHHHHHH Q ss_conf 441787787288669999999999 Q gi|254780874|r 323 LTLYKLEYVLQGYIDDIINPLLTA 346 (357) Q Consensus 323 ~~~~~~~~~l~G~l~~~i~~~~~~ 346 (357) |.++....++.|++|++++.+... T Consensus 37 y~~~pm~T~iEg~~del~~~ik~~ 60 (100) T COG0011 37 YQLGPMGTVIEGELDELMEAVKEA 60 (100) T ss_pred EEECCCCEEEEECHHHHHHHHHHH T ss_conf 260676218986299999999999 No 108 >PRK07300 consensus Probab=24.70 E-value=23 Score=16.13 Aligned_cols=42 Identities=17% Similarity=0.143 Sum_probs=28.8 Q ss_pred CCCCCEEEEECCCEECCHHHHHCC--CHHHHHHHHHHHHHHHHHHH Q ss_conf 899811243216441787787288--66999999999999999874 Q gi|254780874|r 311 FSQGRITDHRINLTLYKLEYVLQG--YIDDIINPLLTAHQAKMIGS 354 (357) Q Consensus 311 ~~~~rv~DhR~~~~~~~~~~~l~G--~l~~~i~~~~~~~~~~~l~~ 354 (357) +|....+--.||++|. .+||.. +..+++..+|.|+.-.+|.. T Consensus 546 LP~~~~KKTKTGySTd--~evLe~L~~~hpi~~~ILeyR~l~KLkS 589 (880) T PRK07300 546 LPLEMTKKTKTGYSTA--VDVLERLAPIAPIVAKILEYRQITKLQS 589 (880) T ss_pred CCCCCCCCCCCCCCCH--HHHHHHHHHCCCCHHHHHHHHHHHHHHH T ss_conf 9876777788887857--9999997661874899999848999999 No 109 >TIGR00263 trpB tryptophan synthase, beta subunit; InterPro: IPR006654 Tryptophan synthase catalyzes the last step in the biosynthesis of tryptophan , :L-serine + 1-(indol-3-yl)glycerol 3-phosphate = L-tryptophan + glyceraldehyde 3-phosphate + H_2O It has two functional domains, each found in bacteria and plants on a separate subunit: alpha chain (IPR002028 from INTERPRO) is for the aldol cleavage of indoleglycerol phosphate to indole and glyceraldehyde 3-phosphate and beta chain is for the synthesis of tryptophan from indole and serine. In fungi the two domains are fused together on a single multifunctional protein . The beta chain of the enzyme, represented here, requires pyridoxal-phosphate as a cofactor. The pyridoxal-phosphate group is attached to a lysine residue. The region around this lysine residue also contains two histidine residues which are part of the pyridoxal-phosphate binding site.; GO: 0004834 tryptophan synthase activity, 0006568 tryptophan metabolic process. Probab=24.55 E-value=49 Score=13.72 Aligned_cols=32 Identities=16% Similarity=0.262 Sum_probs=21.8 Q ss_pred HHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHH Q ss_conf 899999999999998538531579999999998 Q gi|254780874|r 8 QICDLKNRFAEIELRMSESPSVDAYIKLTEEYA 40 (357) Q Consensus 8 kL~~l~~rl~ELE~~Lsdp~~w~d~~klsKe~s 40 (357) -|++|+..|+.+... +|+.||.++..+-++|. T Consensus 21 a~~eLe~af~~a~~D-nD~~F~~El~~lL~~Y~ 52 (412) T TIGR00263 21 ALEELEAAFEKAKKD-NDPEFKAELEELLKNYA 52 (412) T ss_pred HHHHHHHHHHHHHHC-CCHHHHHHHHHHHHHCC T ss_conf 999999999998622-56788999999998607 No 110 >PRK08434 consensus Probab=24.10 E-value=25 Score=15.79 Aligned_cols=52 Identities=10% Similarity=0.126 Sum_probs=33.7 Q ss_pred HHHHHHHHHHCCCC-----CH----HHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHH Q ss_conf 99999999853853-----15----7999999999879999999999999--99989999999 Q gi|254780874|r 14 NRFAEIELRMSESP-----SV----DAYIKLTEEYAAISPIISKISIYDQ--KKQEEQDLHTV 65 (357) Q Consensus 14 ~rl~ELE~~Lsdp~-----~w----~d~~klsKe~s~L~~iv~~~~~~~~--~~~ei~el~eL 65 (357) .++-++-..+.|+. .. +-+.++-.++..|+.+.+.++.... ..+.+.+-++. T Consensus 170 ~qv~D~lAL~GDsSDNIPGVpGIG~KTA~kLL~eyGsLE~I~~n~d~Ik~~K~re~L~~~~e~ 232 (887) T PRK08434 170 KQIRDFLALCGDSADNIPGVKGIGAKGAKKLLDEFGSLEGIYENLELVRNERIRNLLLEGKEN 232 (887) T ss_pred HHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHCCCHHHHHHHHHHCCCHHHHHHHHHHHHH T ss_conf 999999998188645789999856899999999828889999988712407799999854788 No 111 >KOG3389 consensus Probab=23.49 E-value=34 Score=14.84 Aligned_cols=18 Identities=17% Similarity=0.405 Sum_probs=12.7 Q ss_pred HHHHHHHCCCCCCHHHHC Q ss_conf 999987649840013211 Q gi|254780874|r 136 YERYAALRKWKVEVLSSS 153 (357) Q Consensus 136 Y~r~ae~~gwk~eii~~~ 153 (357) -..|||++||.+.|-+-. T Consensus 137 A~sFaEkngW~ydveep~ 154 (178) T KOG3389 137 AKSFAEKNGWDYDVEEPN 154 (178) T ss_pred HHHHHHHCCCCCCCCCCC T ss_conf 999998728841035788 No 112 >KOG0544 consensus Probab=22.90 E-value=36 Score=14.64 Aligned_cols=34 Identities=18% Similarity=0.209 Sum_probs=24.3 Q ss_pred HCCCCCCHHHHCCCCCCCCCEEEEEEECCCHHHHHHCCC Q ss_conf 649840013211354477211137775031112100024 Q gi|254780874|r 142 LRKWKVEVLSSSDNDDGGYKEIVATISGRGVFSRMKFES 180 (357) Q Consensus 142 ~~gwk~eii~~~~ge~aGik~v~~~I~G~~aYg~Lk~E~ 180 (357) .+||.--+.-.|-|+. +-+.|+.++|||--.+-. T Consensus 57 IkGwdegv~qmsvGek-----akLti~pd~aYG~~G~p~ 90 (108) T KOG0544 57 IKGWDEGVAQMSVGEK-----AKLTISPDYAYGPRGHPG 90 (108) T ss_pred EECHHHCCHHCCCCCC-----CEEEECCCCCCCCCCCCC T ss_conf 6032232101153640-----101657631227788887 No 113 >cd06143 PAN2_exo PAN2 is the catalytic subunit of poly(A) nuclease (PAN), a Pab1p-dependent 3'-5' exoribonuclease which plays an important role in the posttranscriptional maturation of pre-mRNAs. PAN catalyzes the deadenylation of poly(A) tails, which are initially synthesized to default lengths of 70 to 90, to mRNA-specific lengths of 55 to 71. Pab1p and PAN also play a role in the export and decay of mRNA. PAN2 contains a DEDDh-type DnaQ-like 3'-5' exonuclease domain with three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. Probab=22.56 E-value=34 Score=14.89 Aligned_cols=26 Identities=12% Similarity=0.161 Sum_probs=14.8 Q ss_pred EEEEEECCCCCCCCCCEEE-EEEEEEE Q ss_conf 0267612545678511016-8888530 Q gi|254780874|r 183 HRVQRVPATEASGRVHTSA-ATVAVLP 208 (357) Q Consensus 183 HRvqRip~~es~gR~hTS~-a~V~V~P 208 (357) |.|-|||-.++.|-.|-.. --..|-| T Consensus 31 ~aLARvSiV~~~g~~~G~vl~D~yVkP 57 (174) T cd06143 31 MSLARVSVVRGEGELEGVPFIDDYIST 57 (174) T ss_pred CCCEEEEEECCCCCCCCCEEEEEEECC T ss_conf 643027898445871688885551789 No 114 >COG2126 RPL37A Ribosomal protein L37E [Translation, ribosomal structure and biogenesis] Probab=22.48 E-value=24 Score=15.94 Aligned_cols=21 Identities=33% Similarity=0.619 Sum_probs=14.5 Q ss_pred CCCCCEEECCCCCCCEEEEEC Q ss_conf 423822104489981124321 Q gi|254780874|r 301 DRSERIRTYNFSQGRITDHRI 321 (357) Q Consensus 301 ~rs~~iRtY~~~~~rv~DhR~ 321 (357) .||.+||+|+.+--.+.-+|+ T Consensus 40 grs~r~R~y~W~~K~~~r~~~ 60 (61) T COG2126 40 GRSARMRSYNWQAKKKRRKTI 60 (61) T ss_pred CCCCCCCCCHHHHHHHCCEEC T ss_conf 876100350366654037015 No 115 >PTZ00086 40S ribosomal protein S16; Provisional Probab=22.48 E-value=53 Score=13.44 Aligned_cols=29 Identities=14% Similarity=0.150 Sum_probs=19.9 Q ss_pred CCCCCCCEEEEEEEEEECCCCCCCCCCCCEE Q ss_conf 5678511016888853033433222454347 Q gi|254780874|r 192 EASGRVHTSAATVAVLPEAAEIDVDIPLEDI 222 (357) Q Consensus 192 es~gR~hTS~a~V~V~P~~~~~~~~i~~~dl 222 (357) .+.|||.||.|.|.+.|=.- .|.||..++ T Consensus 12 ~~~GrRKtAvArv~l~~G~G--~I~IN~k~~ 40 (147) T PTZ00086 12 QTFGKKKTAVAVATVTKGKG--LIKVNGVPL 40 (147) T ss_pred EEECCCCCEEEEEEEECCCC--EEEECCEEH T ss_conf 97754742799999988975--099989149 No 116 >KOG2148 consensus Probab=22.06 E-value=54 Score=13.39 Aligned_cols=14 Identities=29% Similarity=0.458 Sum_probs=10.9 Q ss_pred CCCHHHHHHHHHHH Q ss_conf 85217889999999 Q gi|254780874|r 121 GGSEAALFVGDLFR 134 (357) Q Consensus 121 GG~EA~dwA~~L~r 134 (357) -|.|||.||+.-++ T Consensus 306 ~~ieact~aA~al~ 319 (867) T KOG2148 306 QGIEACTWAAKALR 319 (867) T ss_pred HHHHHHHHHHHHHH T ss_conf 10799999999999 No 117 >KOG2019 consensus Probab=22.03 E-value=39 Score=14.44 Aligned_cols=14 Identities=14% Similarity=0.000 Sum_probs=6.0 Q ss_pred CCCCCCEEEEEEEE Q ss_conf 67851101688885 Q gi|254780874|r 193 ASGRVHTSAATVAV 206 (357) Q Consensus 193 s~gR~hTS~a~V~V 206 (357) -|=+|||.-.||.+ T Consensus 624 qqI~rkTGGiS~~p 637 (998) T KOG2019 624 QQIGRKTGGISVSP 637 (998) T ss_pred HHHHHHCCCEEECE T ss_conf 98602058655110 No 118 >pfam07962 Swi3 Replication Fork Protection Component Swi3. Replication fork pausing is required to initiate a recombination events. More specifically, Swi1 is required for recombination near the mat1 locus. Swi3 has been found to co-purify with Swi1 Swi3, together with Swi1, define a fork protection complex that coordinates leading- and lagging-strand synthesis and stabilizes stalled replication forks. The Swi1-Swi3 complex is required for accurate replication, fork protection and replication checkpoint signalling Probab=21.82 E-value=55 Score=13.35 Aligned_cols=27 Identities=19% Similarity=0.257 Sum_probs=18.0 Q ss_pred EECCCCCCHHHHHHHHHHHHHHHHHHHC Q ss_conf 6226885217889999999999998764 Q gi|254780874|r 116 IRAGTGGSEAALFVGDLFRMYERYAALR 143 (357) Q Consensus 116 I~aGaGG~EA~dwA~~L~rMY~r~ae~~ 143 (357) ++..--|.|+.|.+ -|+.||+.||... T Consensus 23 ~k~kgkghE~~DL~-~ll~~Yq~W~h~l 49 (84) T pfam07962 23 FKFKGKGHEYSDLA-RLLQFYQLWAHDL 49 (84) T ss_pred CCCCCCCCHHHHHH-HHHHHHHHHHHHH T ss_conf 78189998088999-9999999999800 No 119 >COG2960 Uncharacterized protein conserved in bacteria [Function unknown] Probab=21.73 E-value=48 Score=13.75 Aligned_cols=83 Identities=18% Similarity=0.203 Sum_probs=37.2 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHH Q ss_conf 96786898999999999999985385315799999999987999999999999999989999999720378535665433 Q gi|254780874|r 1 MVFLSHKQICDLKNRFAEIELRMSESPSVDAYIKLTEEYAAISPIISKISIYDQKKQEEQDLHTVIGDHNSDSEIRDLAQ 80 (357) Q Consensus 1 m~mi~~~kL~~l~~rl~ELE~~Lsdp~~w~d~~klsKe~s~L~~iv~~~~~~~~~~~ei~el~eLl~e~e~D~El~elAe 80 (357) |+|++..+++++..++.+.-..-. +.+-.+..+. -...+... ++.++-...+--|. -.+|.--+- T Consensus 8 ~~m~~~~~~e~~~~ql~e~~a~~~-~~~~~evE~~--~r~~~q~~---lnkLDlVsREEFdv---------q~qvl~rtR 72 (103) T COG2960 8 MTMIGPNRFEDIAAQLSEDAAGAA-QEVRAEVEKA--FRAQLQRQ---LNKLDLVSREEFDV---------QRQVLLRTR 72 (103) T ss_pred HHCCCHHHHHHHHHHHHHHCCCCC-HHHHHHHHHH--HHHHHHHH---HHHHHHHHHHHHHH---------HHHHHHHHH T ss_conf 204777789999999998731000-1037999999--99999988---77602243999999---------999999999 Q ss_pred HHHHHHHHHHHHHHHHHH Q ss_conf 222332000577888999 Q gi|254780874|r 81 IEVLAIEKEIRELENEIN 98 (357) Q Consensus 81 eEl~~l~~~l~~le~eL~ 98 (357) +++..|+..+..||.+|. T Consensus 73 ~kl~~Leari~~LEarl~ 90 (103) T COG2960 73 EKLAALEARIEELEARLA 90 (103) T ss_pred HHHHHHHHHHHHHHHHHC T ss_conf 999999999999998742 No 120 >TIGR02660 nifV_homocitr homocitrate synthase; InterPro: IPR013477 Most NifV-type homocitrate synthases are encoded within operons for nitrogen fixation. They are homologous to enzymes that include 2-isopropylmalate synthase, (R)-citramalate synthase, and homocitrate synthases associated with other processes. The homocitrate made by these enzymes becomes a part of the iron-molybdenum cofactor of nitrogenase.; GO: 0004410 homocitrate synthase activity, 0051188 cofactor biosynthetic process. Probab=21.70 E-value=26 Score=15.71 Aligned_cols=12 Identities=25% Similarity=0.556 Sum_probs=5.1 Q ss_pred CCCCEEEEEEEC Q ss_conf 772111377750 Q gi|254780874|r 158 GGYKEIVATISG 169 (357) Q Consensus 158 aGik~v~~~I~G 169 (357) ||-++|...|.| T Consensus 210 AGA~~vntTV~G 221 (369) T TIGR02660 210 AGATHVNTTVNG 221 (369) T ss_pred HCCEEEEEEECC T ss_conf 085776335433 No 121 >TIGR02375 pseudoazurin pseudoazurin; InterPro: IPR012745 Pseudoazurin, also called cupredoxin, is a small, blue periplasmic protein with a single bound copper atom. Pseudoazurin is related to plastocyanins. Several examples of pseudoazurin are encoded by a neighbouring gene for, or have been shown to transfer electrons to, copper-containing nitrite reductases (IPR012746 from INTERPRO) of the same species.; GO: 0005507 copper ion binding. Probab=21.61 E-value=46 Score=13.87 Aligned_cols=34 Identities=21% Similarity=0.331 Sum_probs=20.8 Q ss_pred HCCCCCCHH-HHCCCCCCCCC-----EEEEEEECCCHHHH Q ss_conf 649840013-21135447721-----11377750311121 Q gi|254780874|r 142 LRKWKVEVL-SSSDNDDGGYK-----EIVATISGRGVFSR 175 (357) Q Consensus 142 ~~gwk~eii-~~~~ge~aGik-----~v~~~I~G~~aYg~ 175 (357) -+|+.+|.| ...|-..-.+| ++++.++-+|+||+ T Consensus 34 DKGHNve~iKgm~P~GA~~Fk~k~Ne~~~vt~~~eGvY~~ 73 (123) T TIGR02375 34 DKGHNVETIKGMIPEGAEKFKSKINEEYTVTLTKEGVYGV 73 (123) T ss_pred CCCCCHHCCCCCCCCCHHHCCCCCCCCEEEEEECCCEEEE T ss_conf 8876700002336522121025578755888614852588 No 122 >TIGR01149 mtrG tetrahydromethanopterin S-methyltransferase, subunit G; InterPro: IPR005866 This model describes the N5-methyltetrahydromethanopterin: coenzyme M methyltransferase subunit G in methanogenic archaea. This methyltranferase is a membrane-associated enzyme complex that uses methyl-transfer reaction to drive a sodium-ion pump. Archaea have evolved energy-yielding pathways marked by one-carbon biochemistry featuring novel cofactors and enzymes. This transferase is involved in the transfer of a methyl group from N5-methyltetrahydromethanopterin to coenzyme M. In an accompanying reaction, methane is produced by two-electron reduction of the methyl moiety in methyl-coenzyme M by another enzyme methyl-coenzyme M reductase.; GO: 0030269 tetrahydromethanopterin S-methyltransferase activity, 0015948 methanogenesis, 0016021 integral to membrane. Probab=21.14 E-value=56 Score=13.26 Aligned_cols=34 Identities=18% Similarity=0.297 Sum_probs=20.6 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHCCCCCHHH-HHHHHH Q ss_conf 678689899999999999998538531579-999999 Q gi|254780874|r 2 VFLSHKQICDLKNRFAEIELRMSESPSVDA-YIKLTE 37 (357) Q Consensus 2 ~mi~~~kL~~l~~rl~ELE~~Lsdp~~w~d-~~klsK 37 (357) +.++....+++.+||++||+++. ..|.. ++++.| T Consensus 9 ~~~~~~ef~~~~~rLDeie~kve--Fv~~E~~Qr~Gk 43 (72) T TIGR01149 9 VLVEPDEFNEVKKRLDEIEEKVE--FVNGEVAQRIGK 43 (72) T ss_pred HCCCHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHCC T ss_conf 10597669999987417867887--899998886076 No 123 >PRK00252 alaS alanyl-tRNA synthetase; Reviewed Probab=21.08 E-value=57 Score=13.25 Aligned_cols=35 Identities=26% Similarity=0.526 Sum_probs=21.5 Q ss_pred CCEEEEEEEC---CCCCCCCCCCCCCC--EEEEEE---CCCCE Q ss_conf 4347999852---88875200063212--158896---49844 Q gi|254780874|r 219 LEDIRIDTMR---ASGSGGQHVNTTDS--AVRITH---IPTGI 253 (357) Q Consensus 219 ~~dl~i~~~r---ssG~GGQ~vN~t~s--avri~H---~ptgi 253 (357) ++.+||-.+- .--.||-||..|.- .++|+- +-.|| T Consensus 639 ~d~VRVV~ig~~s~ElCGGTHv~~TgeIg~~kI~~e~~i~~Gv 681 (864) T PRK00252 639 GDEVRVVSMGDFSIELCGGTHVSNTGDIGLFKIVSESGVAAGV 681 (864) T ss_pred CCEEEEEEECCEEEECCCCCCCCCCCEEEEEEEEEECCCCCCC T ss_conf 8806999989926856789889970523549999845635881 No 124 >PRK11522 putrescine--2-oxoglutarate aminotransferase; Provisional Probab=21.06 E-value=57 Score=13.25 Aligned_cols=87 Identities=18% Similarity=0.227 Sum_probs=50.6 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHCCCCCCCCCEEEEEEECCCHHHHHHCCCCCEEEEEECCCCCCCC--- Q ss_conf 88521788999999999999876498400132113544772111377750311121000246502676125456785--- Q gi|254780874|r 120 TGGSEAALFVGDLFRMYERYAALRKWKVEVLSSSDNDDGGYKEIVATISGRGVFSRMKFESGVHRVQRVPATEASGR--- 196 (357) Q Consensus 120 aGG~EA~dwA~~L~rMY~r~ae~~gwk~eii~~~~ge~aGik~v~~~I~G~~aYg~Lk~E~GvHRvqRip~~es~gR--- 196 (357) .+|.||..-|--|.|.|.+ .+| +.+++....+=. |---.++++.|...|.. .|...+-.+.++|-.+...- T Consensus 157 nSGSEAvE~AiKlAr~y~~---~~G-r~~IIs~~~sYH-G~T~galS~tg~~~~~~-~f~pl~p~~~~vp~~d~~~~~~~ 230 (468) T PRK11522 157 NSGTESVEAALKLAKAYQS---PRG-KFTFIATSGAFH-GKSLGALSATAKSTFRK-PFMPLLPGFRHVPFGNIEAMRTA 230 (468) T ss_pred CCCHHHHHHHHHHHHHHHH---HHC-CCEEEEECCCCC-CCCCHHHEECCCHHCCC-CCCCCCCCCEECCCCCHHHHHHH T ss_conf 9728999999999999777---409-918999717867-76422203127621025-77789998456378999999999 Q ss_pred ------CCEEEEEEEEEECCCC Q ss_conf ------1101688885303343 Q gi|254780874|r 197 ------VHTSAATVAVLPEAAE 212 (357) Q Consensus 197 ------~hTS~a~V~V~P~~~~ 212 (357) .--..|+|.+=|..-+ T Consensus 231 ~~~~~~~~~~vAAvIvEPIqG~ 252 (468) T PRK11522 231 LSECKKTGDDVAAVILEPIQGE 252 (468) T ss_pred HHHHHHCCCCEEEEEEECCCCC T ss_conf 9986514896899999777677 No 125 >TIGR01875 cas_MJ0381 CRISPR-associated autoregulator DevR family; InterPro: IPR010154 CRISPR is a term for Clustered Regularly Interspaced Short Palidromic Repeats. A number of protein families appear only in association with these repeats and are designated Cas (CRISPR associated) proteins. This entry represents one such family, represented by MJ0381 of Methanococcus jannaschii. This family includes the DevR protein of Myxococcus xanthus, a protein whose expression appears to regulated through a number of means, including both location and autorepression; DevR mutants are incapable of fruiting body development.. Probab=20.98 E-value=45 Score=13.93 Aligned_cols=107 Identities=19% Similarity=0.209 Sum_probs=58.8 Q ss_pred HHHHHHHHHHHHHH-------CCCCCCHHHH--CCCCCCCCCEE------------------EEEEECC---CHHHHHHC Q ss_conf 99999999999876-------4984001321--13544772111------------------3777503---11121000 Q gi|254780874|r 129 VGDLFRMYERYAAL-------RKWKVEVLSS--SDNDDGGYKEI------------------VATISGR---GVFSRMKF 178 (357) Q Consensus 129 A~~L~rMY~r~ae~-------~gwk~eii~~--~~ge~aGik~v------------------~~~I~G~---~aYg~Lk~ 178 (357) .+||-.++...+.. ..|+..-... ..+..-.++++ ...---. .++|||.. T Consensus 50 G~alkh~~~~~~~~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~e~~~~~k~r~fe~~~~~~~~~~fefDl~Gfm~~ 129 (300) T TIGR01875 50 GEALKHALRVTLVEFAKKDGNLLCELSKAYATRKSGNEDVIKKATSKEPEKKDKIREFEKEEILKYIDAFEFDLFGFMSA 129 (300) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCHHHHCCCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCC T ss_conf 78899999999999986428866520011143001341055146655764320233231454542034421235654680 Q ss_pred C---CCC-EEEEEECCCCCCCCCCEEEEEEEEEECC---CCCCCCCCCCEEEEEE--ECCCC-CCCCCCCCCCC Q ss_conf 2---465-0267612545678511016888853033---4332224543479998--52888-75200063212 Q gi|254780874|r 179 E---SGV-HRVQRVPATEASGRVHTSAATVAVLPEA---AEIDVDIPLEDIRIDT--MRASG-SGGQHVNTTDS 242 (357) Q Consensus 179 E---~Gv-HRvqRip~~es~gR~hTS~a~V~V~P~~---~~~~~~i~~~dl~i~~--~rssG-~GGQ~vN~t~s 242 (357) + .|- .-+.|.||+ -+|+| |.+.|.. ..++..++.+-+...+ ++..+ .+||.+=.+|. T Consensus 130 ~aK~~~~~~~~~R~S~v------~~~~a-~s~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~y~~E~ 196 (300) T TIGR01875 130 EAKIEGKGATVRRTSPV------KVSYA-ISVTPFINEQAVLDRDFGVNSIDATNRAIKKQGEEEGQMIYETEY 196 (300) T ss_pred HHCCCCCCCCCEEEEEE------EECCC-CCCCCCCCCHHHEEEECCCCCCCCCCCEEECCCCCCCCCCEECCE T ss_conf 11003588972468888------85133-203623110101001002343566664022056546653032011 No 126 >cd04911 ACT_AKiii-YclM-BS_1 ACT domains located C-terminal to the catalytic domain of the lysine plus threonine-sensitive aspartokinase isoenzyme AKIII. This CD includes the first of two ACT domains located C-terminal to the catalytic domain of the lysine plus threonine-sensitive aspartokinase isoenzyme AKIII, a monofunctional class enzyme found in Bacilli (Bacillus subtilis (BS) YclM) and Clostridia species. Aspartokinase is the first enzyme in the aspartate metabolic pathway and catalyzes the conversion of aspartate and ATP to aspartylphosphate and ADP. Bacillus subtilis YclM is reported to be a single polypeptide of 50 kD. AKIII from Bacillus subtilis strain 168 is induced by lysine and repressed by threonine and it is synergistically inhibited by lysine and threonine. Members of this CD belong to the superfamily of ACT regulatory domains. Probab=20.85 E-value=35 Score=14.71 Aligned_cols=23 Identities=22% Similarity=0.488 Sum_probs=13.0 Q ss_pred EEEEEECCCCE-EEEE-CCCCCHHH Q ss_conf 15889649844-8884-36541358 Q gi|254780874|r 243 AVRITHIPTGI-MVTS-SEKSQHQN 265 (357) Q Consensus 243 avri~H~ptgi-~v~~-~eRsQ~~N 265 (357) .|..-|+|||| .++- =...|+.+ T Consensus 29 ~IS~EHmPSGID~~siiv~~~~l~~ 53 (76) T cd04911 29 GISYEHMPSGIDDISIIIRDNQLTD 53 (76) T ss_pred CCCEEECCCCCCEEEEEEEHHHCCC T ss_conf 9986455888660899988687470 No 127 >TIGR02566 cas_Csy3 CRISPR-associated protein, Csy3 family; InterPro: IPR013399 CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) is a widespread family of prokaryotic direct repeats with spacers of unique sequence between consecutive repeats. This entry, typified by YPO2463 of Yersinia pestis, is a CRISPR-associated (Cas) entry strictly associated with the Ypest subtype of CRISPR/Cas locus. It is designated Csy3, for CRISPR/Cas Subtype Ypest protein 3.. Probab=20.54 E-value=58 Score=13.17 Aligned_cols=137 Identities=17% Similarity=0.141 Sum_probs=76.9 Q ss_pred HHHHHCCCCCCCCCCEE--EEEECCCCCCHH---HHHHHHHHHHHHHHHHHCCCCCCHHHHCCCCC---------CCCCE Q ss_conf 99864000113554037--886226885217---88999999999999876498400132113544---------77211 Q gi|254780874|r 97 INYLLLPKDTDDDKSCI--LEIRAGTGGSEA---ALFVGDLFRMYERYAALRKWKVEVLSSSDNDD---------GGYKE 162 (357) Q Consensus 97 L~~~LLpsde~D~~nai--lEI~aGaGG~EA---~dwA~~L~rMY~r~ae~~gwk~eii~~~~ge~---------aGik~ 162 (357) .-..+|| ...|...|- |-+-||.|+-.+ +.|=..|+..=..|.+..||+.=.-=|...=. -|=-+ T Consensus 81 vD~~~l~-~d~dtl~~~F~l~vl~~~~~P~~Cnd~~~~~~l~~~v~~Y~~~~G~~ELA~RYa~NiaNGRwLWRNR~gAE~ 159 (370) T TIGR02566 81 VDVAALP-YDADTLKLSFTLKVLASLAKPSKCNDAEYKQALLQTVKLYIEEQGFEELARRYAENIANGRWLWRNRVGAES 159 (370) T ss_pred CCEEECC-CCCCEEEEEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCEEHHHCCEEEEE T ss_conf 0034327-997468899999974157888666798999999999998764157477888888640043032017655667 Q ss_pred EEEEEECCCHHHHHHCCCCCEEEEEECCCCCCCCCCEEEE-EEE----EEECCCCCCCCCC-CC---EEEEEEECCCCCC Q ss_conf 1377750311121000246502676125456785110168-888----5303343322245-43---4799985288875 Q gi|254780874|r 163 IVATISGRGVFSRMKFESGVHRVQRVPATEASGRVHTSAA-TVA----VLPEAAEIDVDIP-LE---DIRIDTMRASGSG 233 (357) Q Consensus 163 v~~~I~G~~aYg~Lk~E~GvHRvqRip~~es~gR~hTS~a-~V~----V~P~~~~~~~~i~-~~---dl~i~~~rssG~G 233 (357) +.+.|+---. .-...+.+||+..-.=-+|+ +-. +--.+..+.--|. ++ =|.|+-|=-=|.| T Consensus 160 i~i~v~~~~~----------~~~~~~~~FDa~~~nL~~F~a~f~~~~~~~~la~lI~~gl~ge~~~~~l~V~A~v~~G~g 229 (370) T TIGR02566 160 IEIRVTPLPA----------EDEKAIWSFDALRFNLRSFAAHFQPSKDLDALAKLIAQGLSGESGFVILEVTAYVRLGTG 229 (370) T ss_pred EEEEEEECCC----------CCCCCCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEEEEEEEEECCC T ss_conf 7898875677----------776543342145531121023678647899999999997069885489999999981899 Q ss_pred CCCCCCCCCEEEEEECCCCEEE Q ss_conf 2000632121588964984488 Q gi|254780874|r 234 GQHVNTTDSAVRITHIPTGIMV 255 (357) Q Consensus 234 GQ~vN~t~savri~H~ptgi~v 255 (357) ||-| =|+=--| T Consensus 230 GqEV-----------yPSQel~ 240 (370) T TIGR02566 230 GQEV-----------YPSQELV 240 (370) T ss_pred CEEE-----------CCCCEEE T ss_conf 7101-----------4651035 No 128 >TIGR01461 greB transcription elongation factor GreB; InterPro: IPR006358 Bacterial GreA IPR006359 from INTERPRO and GreB promote transcription elongation by stimulating an endogenous, endonucleolytic transcript cleavage activity of the RNA polymerase, allowing RNA transcription to continue past template-encoded arresting sites. GreA and GreB are sequence homologues and have homologues in every known bacterial genome. GreA and GreB stimulate transcript cleavage in different ways; GreA induces cleavage of 3'-RNA fragments 23 nt in length and can only prevent the formation of arrested complexes, whereas GreB induces cleavage of fragments up to 18 nt in length and can rescue preexisting arrested complexes . A 15 Å resolution helical reconstruction of the Escherichia coli core RNA polymerase (RNAP)/GreB complex that allows fitting of high-resolution RNAP and GreB structures. The model of the complex reveals a remarkable binding mode for GreB; the globular C-terminal domain binds RNAP at the edge of the active site channel, while the N-terminal coiled-coil domain extends 45 Å into a channel directly to the RNAP active site. The results point to a key role for conserved acidic residues at the tip of the Gre factor coiled coil in modifying the RNAP active site to catalyse the transcript cleavage reaction, and mutational studies confirm that these positions are critical for Gre factor function. Functional differences between GreA and GreB correlate with the distribution of positively charged residues on one face of the N-terminal coiled coil.; GO: 0003677 DNA binding, 0003711 transcription elongation regulator activity, 0006355 regulation of transcription DNA-dependent. Probab=20.34 E-value=58 Score=13.14 Aligned_cols=60 Identities=18% Similarity=0.201 Sum_probs=44.9 Q ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHCCCCCCCCCEEEEEEECCCHHHHHHCCCCCEEEEEEC Q ss_conf 40378862268852178899999999999987649840013211354477211137775031112100024650267612 Q gi|254780874|r 110 KSCILEIRAGTGGSEAALFVGDLFRMYERYAALRKWKVEVLSSSDNDDGGYKEIVATISGRGVFSRMKFESGVHRVQRVP 189 (357) Q Consensus 110 ~nailEI~aGaGG~EA~dwA~~L~rMY~r~ae~~gwk~eii~~~~ge~aGik~v~~~I~G~~aYg~Lk~E~GvHRvqRip 189 (357) ..|-|+|--|-|-+. .|.|+=+.+=+.+ .--++|..+-|=.++|.|-|--=|+|-| T Consensus 87 FGAwV~le~~DG~~~---------------------r~rIVG~DEid~r---~n~ISidSPlARaL~kke~gdevvvnTP 142 (157) T TIGR01461 87 FGAWVELEDDDGREQ---------------------RYRIVGIDEIDTR---KNKISIDSPLARALIKKEVGDEVVVNTP 142 (157) T ss_pred EEEEEEEECCCCCCC---------------------CEEEEECCCCCCC---CCCEEEECHHHHHHHCCCCCCEEEEECC T ss_conf 225888653888320---------------------2688605521235---7720142667888630125877787578 Q ss_pred CCCC Q ss_conf 5456 Q gi|254780874|r 190 ATEA 193 (357) Q Consensus 190 ~~es 193 (357) .=+. T Consensus 143 aGe~ 146 (157) T TIGR01461 143 AGEE 146 (157) T ss_pred CCCC T ss_conf 8850 No 129 >PRK13562 acetolactate synthase 1 regulatory subunit; Provisional Probab=20.18 E-value=59 Score=13.12 Aligned_cols=37 Identities=11% Similarity=0.251 Sum_probs=29.4 Q ss_pred HHHHHHHHHHHHCCCCCCHHHHCCCCCCCCCEEEEEEE Q ss_conf 99999999987649840013211354477211137775 Q gi|254780874|r 131 DLFRMYERYAALRKWKVEVLSSSDNDDGGYKEIVATIS 168 (357) Q Consensus 131 ~L~rMY~r~ae~~gwk~eii~~~~ge~aGik~v~~~I~ 168 (357) -|-|.-. -.-+++|.++-+..-+++..||...+|.+. T Consensus 15 vLNRIT~-lF~RrqfNI~sltVg~te~~gIS~mt~vv~ 51 (84) T PRK13562 15 TLNRITS-AFVRLQYNIDTLHVTHSEQPGISNMEIQVD 51 (84) T ss_pred HHHHHHH-HHHHHCCCEEEEEECCCCCCCCCCEEEEEE T ss_conf 8889899-999851782268863678988342699996 No 130 >KOG4657 consensus Probab=20.16 E-value=59 Score=13.12 Aligned_cols=97 Identities=15% Similarity=0.227 Sum_probs=47.0 Q ss_pred HHHHHHHHHHHHHHHCCCC--CHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHH Q ss_conf 9999999999999853853--1579----999999998799999999999999998999999972037853566543322 Q gi|254780874|r 9 ICDLKNRFAEIELRMSESP--SVDA----YIKLTEEYAAISPIISKISIYDQKKQEEQDLHTVIGDHNSDSEIRDLAQIE 82 (357) Q Consensus 9 L~~l~~rl~ELE~~Lsdp~--~w~d----~~klsKe~s~L~~iv~~~~~~~~~~~ei~el~eLl~e~e~D~El~elAeeE 82 (357) ..++.+....+..-+.+|- ..+. .++++....++..++..+.++.+ ++..-.+|+.+. .+ ...-.+.| T Consensus 21 ~~~i~n~~s~~D~f~q~~r~~~~nS~~efar~lS~~~~e~e~l~~~l~eten---e~~~~neL~~ek-~~--~q~~ieqe 94 (246) T KOG4657 21 EKDIHNQRSKIDSFIQSPRRRSMNSLVEFARALSQSQVELENLKADLRETEN---ELVKVNELKTEK-EA--RQMGIEQE 94 (246) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHH-HH--HHHHHHHH T ss_conf 9999998988999998799999999999999988878999999999999999---999999999888-88--88878999 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEE Q ss_conf 23320005778889998640001135540378 Q gi|254780874|r 83 VLAIEKEIRELENEINYLLLPKDTDDDKSCIL 114 (357) Q Consensus 83 l~~l~~~l~~le~eL~~~LLpsde~D~~nail 114 (357) +..+..+++.+...++.+ +.++|+.-.|+ T Consensus 95 ik~~q~elEvl~~n~Q~l---keE~dd~keiI 123 (246) T KOG4657 95 IKATQSELEVLRRNLQLL---KEEKDDSKEII 123 (246) T ss_pred HHHHHHHHHHHHHHHHHH---HHHHHHHHHHH T ss_conf 888899999999988888---98751699999 No 131 >COG1162 Predicted GTPases [General function prediction only] Probab=20.00 E-value=50 Score=13.65 Aligned_cols=25 Identities=24% Similarity=0.470 Sum_probs=13.0 Q ss_pred CCCCCCCCCCCCCCCEEEEEECC-CCEEE Q ss_conf 28887520006321215889649-84488 Q gi|254780874|r 228 RASGSGGQHVNTTDSAVRITHIP-TGIMV 255 (357) Q Consensus 228 rssG~GGQ~vN~t~savri~H~p-tgi~v 255 (357) +....=|+| |-+++++.|+| .|.++ T Consensus 196 S~~~~rGkH---TTt~~~l~~l~~gG~ii 221 (301) T COG1162 196 SEKLGRGRH---TTTHVELFPLPGGGWII 221 (301) T ss_pred CCCCCCCCC---CCCEEEEEECCCCCEEE T ss_conf 044789887---63248999838998897 Done!