Query gi|254780143|ref|YP_003064556.1| DNA-directed RNA polymerase subunit beta [Candidatus Liberibacter asiaticus str. psy62] Match_columns 1386 No_of_seqs 242 out of 1573 Neff 7.7 Searched_HMMs 39220 Date Sun May 22 15:11:55 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780143.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR02013 rpoB DNA-directed RN 100.0 0 0 3744.8 82.9 1352 10-1363 1-1449(1449) 2 PRK09603 DNA-directed RNA poly 100.0 0 0 3176.6 96.1 1347 8-1369 6-1377(2890) 3 PRK00405 rpoB DNA-directed RNA 100.0 0 0 2886.6 85.0 1127 3-1365 1-1127(1127) 4 CHL00001 rpoB RNA polymerase b 100.0 0 0 2638.8 80.2 1040 21-1371 2-1058(1065) 5 COG0085 RpoB DNA-directed RNA 100.0 0 0 2590.7 72.5 1041 4-1369 2-1060(1060) 6 PRK08565 DNA-directed RNA poly 100.0 0 0 2012.3 71.2 937 25-1368 10-1100(1101) 7 cd00653 RNA_pol_B_RPB2 RNA pol 100.0 0 0 1833.3 63.5 801 29-1364 1-866 (866) 8 KOG0214 consensus 100.0 0 0 1602.8 35.4 1005 17-1366 6-1140(1141) 9 KOG0215 consensus 100.0 0 0 1481.7 29.3 912 26-1369 56-1152(1153) 10 TIGR03670 rpoB_arch DNA-direct 100.0 0 0 1248.3 28.1 506 594-1368 50-598 (599) 11 PRK07225 DNA-directed RNA poly 100.0 0 0 1234.5 28.6 505 594-1368 56-604 (605) 12 KOG0216 consensus 100.0 0 0 1158.1 46.2 929 22-1365 11-1110(1111) 13 pfam00562 RNA_pol_Rpb2_6 RNA p 100.0 0 0 958.1 22.8 369 673-1288 1-373 (373) 14 PRK09606 DNA-directed RNA poly 100.0 0 0 678.9 33.4 417 26-602 15-480 (491) 15 pfam04563 RNA_pol_Rpb2_1 RNA p 100.0 0 0 486.3 26.0 355 28-520 1-394 (394) 16 pfam04561 RNA_pol_Rpb2_2 RNA p 100.0 2.3E-34 5.9E-39 265.4 15.9 177 159-465 1-180 (180) 17 pfam04560 RNA_pol_Rpb2_7 RNA p 99.9 8.1E-27 2.1E-31 211.0 6.8 77 1290-1366 1-78 (78) 18 pfam04565 RNA_pol_Rpb2_3 RNA p 99.8 3E-21 7.6E-26 170.8 3.5 68 524-593 1-68 (68) 19 KOG0214 consensus 99.4 1.4E-15 3.6E-20 129.9 -9.1 309 903-1321 736-1047(1141) 20 pfam10385 RNA_pol_Rpb2_45 RNA 98.9 2.4E-09 6.1E-14 84.9 4.7 66 602-667 1-66 (66) 21 COG0085 RpoB DNA-directed RNA 98.5 2.3E-07 5.8E-12 70.6 6.4 70 609-706 576-645 (1060) 22 PRK09603 DNA-directed RNA poly 97.8 0.00013 3.2E-09 50.8 7.3 127 777-903 876-1021(2890) 23 PRK08565 DNA-directed RNA poly 97.6 1.8E-05 4.6E-10 56.9 0.9 193 679-942 537-734 (1101) 24 TIGR02013 rpoB DNA-directed RN 96.2 0.00031 8E-09 48.0 -2.7 266 722-1019 705-995 (1449) 25 TIGR02386 rpoC_TIGR DNA-direct 88.7 0.52 1.3E-05 24.7 3.5 14 882-895 1289-1302(1552) 26 cd06232 Peptidase_M14-like_5 P 87.4 0.11 2.7E-06 29.7 -0.8 49 1082-1135 103-151 (240) 27 pfam04941 LEF-8 Late expressio 87.3 0.5 1.3E-05 24.9 2.6 31 1072-1102 706-736 (748) 28 KOG0215 consensus 86.3 0.38 9.8E-06 25.7 1.6 59 410-477 441-499 (1153) 29 TIGR02876 spore_yqfD sporulati 72.9 2 5E-05 20.6 1.6 21 779-799 211-231 (406) 30 COG4942 Membrane-bound metallo 70.4 3.7 9.5E-05 18.6 2.5 62 721-800 328-393 (420) 31 TIGR00432 arcsn_tRNA_tgt archa 70.1 1.7 4.3E-05 21.1 0.7 45 96-145 44-91 (658) 32 pfam06898 YqfD Putative stage 68.5 3.9 9.9E-05 18.4 2.3 22 779-800 209-230 (383) 33 PRK13487 chemoreceptor glutami 67.3 6.2 0.00016 17.0 3.1 27 302-328 147-173 (201) 34 TIGR01945 rnfC electron transp 64.2 3.7 9.4E-05 18.6 1.5 107 774-902 39-154 (444) 35 TIGR02248 mutH_TIGR DNA mismat 57.8 3.7 9.4E-05 18.6 0.6 14 200-213 113-126 (220) 36 KOG0318 consensus 56.0 8.7 0.00022 15.9 2.2 41 536-583 318-358 (603) 37 pfam04567 RNA_pol_Rpb2_5 RNA p 55.1 6.2 0.00016 16.9 1.4 24 610-633 6-29 (46) 38 KOG0677 consensus 54.7 6.1 0.00016 17.0 1.3 55 1304-1363 272-327 (389) 39 pfam04564 U-box U-box domain. 54.6 5.8 0.00015 17.2 1.1 44 1236-1284 5-50 (74) 40 TIGR01812 sdhA_frdA_Gneg succi 54.4 10 0.00027 15.3 2.4 26 790-815 395-424 (636) 41 pfam07508 Recombinase Recombin 52.0 12 0.00031 14.9 3.9 48 1212-1259 16-63 (101) 42 pfam11963 DUF3477 Protein of u 50.0 10 0.00026 15.4 1.8 12 834-845 196-207 (355) 43 PRK10871 nlpD lipoprotein NlpD 47.9 14 0.00036 14.4 4.0 20 777-796 324-343 (374) 44 PRK11637 hypothetical protein; 46.4 15 0.00037 14.3 2.8 62 720-796 311-373 (404) 45 PRK10556 hypothetical protein; 45.4 7.9 0.0002 16.2 0.6 36 1107-1142 76-111 (111) 46 COG0511 AccB Biotin carboxyl c 44.9 15 0.00039 14.1 2.2 35 779-813 83-118 (140) 47 TIGR01176 fum_red_Fp fumarate 44.3 12 0.00031 14.8 1.5 14 870-883 553-566 (585) 48 KOG0772 consensus 44.2 11 0.00028 15.1 1.2 13 1116-1128 511-523 (641) 49 PRK13497 chemoreceptor glutami 40.2 18 0.00046 13.6 4.2 43 1231-1285 137-179 (184) 50 pfam10258 RNA_GG_bind PHAX RNA 40.1 10 0.00027 15.3 0.6 20 140-166 43-63 (87) 51 TIGR00407 proA gamma-glutamyl 39.9 9.7 0.00025 15.6 0.4 42 434-475 67-112 (415) 52 pfam11783 Cytochrome_cB Cytoch 39.8 18 0.00047 13.6 3.8 44 1214-1271 113-156 (166) 53 PRK04049 30S ribosomal protein 39.6 18 0.00047 13.6 1.9 18 1089-1106 109-126 (127) 54 TIGR02388 rpoC2_cyan DNA-direc 39.3 19 0.00047 13.5 3.1 52 297-348 232-283 (1252) 55 PRK06302 acetyl-CoA carboxylas 39.2 19 0.00048 13.5 2.9 38 778-815 98-136 (155) 56 cd01778 RASSF1_RA RASSF1 (also 37.9 16 0.00042 13.9 1.3 23 1122-1144 18-40 (96) 57 TIGR02645 ARCH_P_rylase putati 37.7 12 0.0003 15.0 0.5 53 725-796 420-473 (499) 58 TIGR02002 PTS-II-BC-glcB PTS s 37.4 16 0.00041 14.0 1.2 28 862-889 471-498 (518) 59 cd01787 GRB7_RA Grb7_RA The R 36.9 20 0.00052 13.3 3.2 32 1111-1147 8-39 (85) 60 cd01784 rasfadin_RA rasfadin_R 36.2 21 0.00053 13.2 3.4 30 1111-1145 7-37 (87) 61 TIGR03477 DMSO_red_II_gam DMSO 35.8 21 0.00053 13.2 1.6 24 803-826 168-191 (205) 62 pfam07304 SRA1 Steroid recepto 35.7 21 0.00054 13.1 4.6 12 473-484 125-136 (157) 63 cd02809 alpha_hydroxyacid_oxid 35.2 12 0.0003 14.9 0.2 17 328-344 181-197 (299) 64 PRK11633 hypothetical protein; 34.7 22 0.00056 13.0 2.8 13 1217-1229 156-168 (218) 65 PRK00750 lysK lysyl-tRNA synth 33.9 22 0.00057 12.9 2.3 50 1296-1354 434-486 (513) 66 COG2352 Ppc Phosphoenolpyruvat 33.7 23 0.00058 12.9 8.4 15 1084-1098 719-733 (910) 67 PRK09613 thiH thiamine biosynt 33.5 23 0.00058 12.9 2.4 18 429-446 113-130 (471) 68 COG4172 ABC-type uncharacteriz 31.9 20 0.00051 13.3 1.0 11 575-585 311-321 (534) 69 TIGR02782 TrbB_P P-type conjug 31.9 14 0.00035 14.5 0.1 21 470-490 245-265 (315) 70 TIGR02924 ICDH_alpha isocitrat 31.7 24 0.00062 12.7 1.4 70 428-520 269-340 (481) 71 KOG3439 consensus 31.2 25 0.00063 12.6 2.1 21 131-152 37-57 (116) 72 KOG0158 consensus 30.6 22 0.00056 13.0 1.0 21 464-484 289-309 (499) 73 TIGR01926 peroxid_rel uncharac 30.5 25 0.00065 12.6 1.8 16 366-381 112-127 (179) 74 PRK03427 cell division protein 30.5 14 0.00036 14.4 -0.0 12 813-824 199-210 (331) 75 PRK00269 zipA cell division pr 30.3 14 0.00036 14.4 -0.0 18 811-828 157-174 (295) 76 PHA00430 tail fiber protein 30.3 26 0.00065 12.5 2.7 10 216-225 92-101 (568) 77 PRK01741 cell division protein 29.2 16 0.0004 14.1 0.0 14 813-826 217-230 (342) 78 PRK13534 7-cyano-7-deazaguanin 29.0 22 0.00056 13.0 0.7 54 86-146 39-93 (630) 79 PRK05035 electron transport co 28.6 27 0.00069 12.3 2.9 18 142-159 55-72 (725) 80 pfam09400 DUF2002 Protein of u 28.1 28 0.00071 12.3 1.2 32 1108-1139 77-108 (110) 81 cd04737 LOX_like_FMN L-Lactate 27.9 13 0.00033 14.6 -0.6 51 1289-1355 282-332 (351) 82 pfam04221 RelB RelB antitoxin. 27.7 28 0.00072 12.2 2.5 35 200-234 2-36 (83) 83 TIGR02384 RelB_DinJ addiction 27.7 28 0.00072 12.2 3.0 34 199-232 5-38 (96) 84 PRK02597 DNA-directed RNA poly 27.2 29 0.00073 12.2 3.3 47 299-346 240-286 (1295) 85 PRK13533 7-cyano-7-deazaguanin 27.2 22 0.00055 13.0 0.5 50 86-146 40-92 (486) 86 PHA01630 putative group 1 glyc 26.8 9.3 0.00024 15.7 -1.5 14 28-41 8-21 (333) 87 pfam04104 DNA_primase_lrg Euka 26.8 29 0.00074 12.1 2.6 19 213-231 124-142 (217) 88 pfam11547 E3_UbLigase_EDD E3 u 26.7 29 0.00075 12.1 1.2 18 1346-1363 22-39 (51) 89 KOG1647 consensus 26.5 30 0.00075 12.1 1.4 10 475-484 195-204 (255) 90 PRK04335 cell division protein 26.5 18 0.00046 13.6 -0.1 16 812-827 185-200 (319) 91 PRK13133 consensus 25.9 30 0.00077 12.0 2.0 17 1089-1105 115-131 (267) 92 TIGR02881 spore_V_K stage V sp 25.6 12 0.00032 14.8 -1.0 22 401-422 191-212 (261) 93 pfam02449 Glyco_hydro_42 Beta- 25.5 31 0.00078 12.0 2.4 81 1274-1357 292-376 (376) 94 PRK05352 Na(+)-translocating N 25.3 31 0.00079 11.9 1.3 23 144-166 44-70 (448) 95 PRK09282 pyruvate carboxylase 25.2 31 0.00079 11.9 4.0 25 779-803 524-548 (580) 96 COG0427 ACH1 Acetyl-CoA hydrol 25.1 31 0.0008 11.9 2.3 43 1096-1139 404-452 (501) 97 KOG4507 consensus 24.5 32 0.00081 11.9 0.9 28 549-576 512-539 (886) 98 TIGR01163 rpe ribulose-phospha 24.5 25 0.00064 12.6 0.4 29 316-344 166-194 (216) 99 TIGR01677 pln_FAD_oxido plant- 24.1 17 0.00042 13.9 -0.6 17 568-584 409-425 (577) 100 TIGR01369 CPSaseII_lrg carbamo 24.1 33 0.00083 11.8 2.3 82 866-947 825-917 (1089) 101 PRK04570 cell division protein 23.7 22 0.00056 13.0 -0.0 13 80-92 24-36 (244) 102 PRK05070 DNA mismatch repair p 22.9 34 0.00087 11.6 1.0 14 201-214 112-125 (218) 103 pfam06925 MGDG_synth Monogalac 22.5 33 0.00085 11.7 0.7 21 1116-1136 88-108 (169) 104 pfam05059 Orbi_VP4 Orbivirus V 22.2 35 0.0009 11.5 4.1 19 88-106 34-52 (642) 105 PRK11029 FtsH protease regulat 22.1 35 0.0009 11.5 2.8 29 139-167 16-56 (334) 106 smart00454 SAM Sterile alpha m 21.7 11 0.00029 15.1 -1.8 30 1216-1245 6-35 (68) 107 PRK13121 consensus 21.6 36 0.00092 11.4 2.0 18 1088-1105 112-129 (265) 108 COG1386 scpB Chromosome segreg 21.6 36 0.00092 11.4 1.1 11 1295-1305 93-103 (184) 109 TIGR01217 ac_ac_CoA_syn acetoa 21.5 36 0.00093 11.4 3.6 28 1090-1121 613-640 (676) 110 KOG0030 consensus 21.4 37 0.00093 11.4 1.1 18 215-232 34-53 (152) 111 PRK12999 pyruvate carboxylase; 21.4 37 0.00093 11.4 4.1 21 1240-1260 1011-1031(1147) 112 TIGR00003 TIGR00003 copper ion 21.3 37 0.00094 11.4 2.2 23 1204-1229 42-64 (66) 113 PRK13139 consensus 21.1 37 0.00094 11.4 2.0 45 1085-1131 106-161 (254) 114 pfam01070 FMN_dh FMN-dependent 21.1 32 0.00081 11.8 0.3 46 326-381 176-221 (301) 115 PRK13124 consensus 21.0 37 0.00095 11.4 2.0 22 1088-1111 103-124 (257) 116 pfam08587 UBA_2 Ubiquitin asso 20.9 37 0.00095 11.3 2.5 20 213-232 5-24 (46) 117 PRK13134 consensus 20.5 38 0.00097 11.3 2.0 23 1088-1112 113-135 (257) 118 cd03404 Band_7_HflK Band_7_Hfl 20.4 38 0.00098 11.3 2.8 27 140-166 12-43 (266) 119 cd02552 PseudoU_synth_TruD_lik 20.2 38 0.00098 11.2 3.2 16 213-228 47-62 (232) 120 TIGR00337 PyrG CTP synthase; I 20.2 39 0.00098 11.2 1.3 52 464-515 151-206 (571) 121 pfam02037 SAP SAP domain. The 20.1 39 0.00099 11.2 2.8 22 1212-1233 2-23 (35) No 1 >TIGR02013 rpoB DNA-directed RNA polymerase, beta subunit; InterPro: IPR010243 DNA-directed RNA polymerases 2.7.7.6 from EC (also known as DNA-dependent RNA polymerases) are responsible for the polymerisation of ribonucleotides into a sequence complementary to the template DNA. In eukaryotes, there are three different forms of DNA-directed RNA polymerases transcribing different sets of genes. Most RNA polymerases are multimeric enzymes and are composed of a variable number of subunits. The core RNA polymerase complex consists of five subunits (two alpha, one beta, one beta-prime and one omega) and is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme . The core RNA polymerase complex forms a "crab claw"-like structure with an internal channel running along the full length . The key functional sites of the enzyme, as defined by mutational and cross-linking analysis, are located on the inner wall of this channel. RNA synthesis follows after the attachment of RNA polymerase to a specific site, the promoter, on the template DNA strand. The RNA synthesis process continues until a termination sequence is reached. The RNA product, which is synthesised in the 5' to 3'direction, is known as the primary transcript. Eukaryotic nuclei contain three distinct types of RNA polymerases that differ in the RNA they synthesise: RNA polymerase I: located in the nucleoli, synthesises precursors of most ribosomal RNAs. RNA polymerase II: occurs in the nucleoplasm, synthesises mRNA precursors. RNA polymerase III: also occurs in the nucleoplasm, synthesises the precursors of 5S ribosomal RNA, the tRNAs, and a variety of other small nuclear and cytosolic RNAs. Eukaryotic cells are also known to contain separate mitochondrial and chloroplast RNA polymerases. Eukaryotic RNA polymerases, whose molecular masses vary in size from 500 to 700 kD, contain two non-identical large (>100 kDa) subunits and an array of up to 12 different small (less than 50 kDa) subunits. This entry describes orthologues of the beta subunit of bacterial RNA polymerase. The core enzyme consists of two alpha chains, one beta chain, and one beta' subunit.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006350 transcription. Probab=100.00 E-value=0 Score=3744.77 Aligned_cols=1352 Identities=55% Similarity=0.889 Sum_probs=1264.3 Q ss_pred CEEECCCHHHCCCCCCCCCHHHHHHHHHHHHHHCC---------CCCCCCCCHHHHHHHHHHCCCCCCCCCEEEEEEE-E Q ss_conf 63321013104666788708899999999986246---------6522111225899976637807679858999997-8 Q gi|254780143|r 10 LGRVRKFFGKNPEIIDIPDLIEVQKASYDHFLMMN---------IAPDERPNEGLQAAFKSVFPITAFSGAAMLEFVS-Y 79 (1386) Q Consensus 10 ~~~~R~~f~k~~~~~~~P~Li~iQ~~Sf~~Flq~~---------~~~~~r~~~GL~~v~~~~fPI~d~~~~~~Lef~~-y 79 (1386) |++.|.||+|+++.+++||||++|++||+||||.+ ..++.|+++||++||+|+|||+|++|++.|||++ | T Consensus 1 k~~~R~~f~ki~~~l~~P~Lle~Q~~Sy~~FL~~~~~~~~~A~~~~~~~R~~~GLe~vF~~~FPI~d~~g~~~LEY~sDY 80 (1449) T TIGR02013 1 KKRERIDFAKIPEILELPNLLEVQLDSYDWFLQSDRNKASGADETPPEERKEEGLEEVFKSIFPIEDYNGNITLEYLSDY 80 (1449) T ss_pred CCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCEEEEEECCC T ss_conf 98720143435543568618899999999875201246421344778874234699997522884024687688870201 Q ss_pred EECCCCCCHH-HHHHCCCCEEEEEEEEEEEEEECCCCCCCCCCCEEEEEEEEEEEECCEECCCCCEEECCEEEEEEEEEC Q ss_conf 9809858899-999839975335899999999317876655200012236787510000268962898682146866512 Q gi|254780143|r 80 EFDPPKFDVD-DCLWRDLTYAVPLKITLRLIVFDVDEFTGAKSIKDIKEQSIYMGDLPLMTKDGTFVIKGIQRIVVSQLH 158 (1386) Q Consensus 80 ~l~~Pk~tp~-ECRlR~lTYsapL~V~i~l~v~~~~~~~~~k~~~~ike~~V~lG~IPiMt~~GyFIING~ERVIVsQl~ 158 (1386) +|++|||+++ ||..||+|||+||+|+++|++++.+.+++.+.+++|++|+||||+||+||++|||||||+||||||||| T Consensus 81 ~l~ePky~v~~EC~~RG~TYs~pLkvk~rL~~~e~~~e~g~~~~~eIk~q~VymG~iP~MTd~GTFIINGaERVvVSQlH 160 (1449) T TIGR02013 81 ELGEPKYSVEEECKERGLTYSAPLKVKLRLINKEKDEETGEKNTKEIKEQDVYMGDIPLMTDRGTFIINGAERVVVSQLH 160 (1449) T ss_pred CCCCCCCCHHHHHHHCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCEEEECEEEEEEEEEE T ss_conf 16898878789986258950202348898877477787777353563464566437676457854688041888887778 Q ss_pred CCCCEEECCCCCCCCCCCCEEEEEEEECCCCCEEEEEEC-CCCEEEEEECCCC---CHHHHHHHHHCCCCHHHHHHHHCC Q ss_conf 278521202346515778567999981188723689975-8982999971878---703998998809984799997387 Q gi|254780143|r 159 RSPGIHFDHDKGRASLSGKLLYACRIIPDQGLWMDIEFD-SKDIIHVRIDRRR---KVPVTSFLMALGMDSEEILSTFYP 234 (1386) Q Consensus 159 RSPGVyf~~~k~k~~~s~k~~ysa~IIP~RGSwLe~e~d-~kd~iyvrIdr~r---KIPi~ilLrALG~ssdeIl~~f~~ 234 (1386) |||||||+++++|++.+||.+|||+|||+|||||||||| +||.||||||||| |+|+|||||||||++|+||..||. T Consensus 161 RSPGV~F~~~~~K~~~~Gk~~fsa~IIP~RGSWLeFe~Dq~kD~~yvrIDrkrrrrK~~aTvlLrAlg~~~~~~i~~~f~ 240 (1449) T TIGR02013 161 RSPGVFFSSEKDKHTKKGKKLFSARIIPYRGSWLEFETDQKKDLLYVRIDRKRRRRKLPATVLLRALGYEKDEDIIEFFD 240 (1449) T ss_pred ECCCEEECCCCCCCCCCCCEEEEEEEECCCCCCHHHCCCCCCCEEEEEECCCCCCCCCCHHHHHHHCCCHHHHHHHHHHC T ss_conf 55857616644565766756788888078687010100452475899976600354660413332508302478874103 Q ss_pred CEEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCEEECCCCHHHHHHHHHHHHCCCCCCCCCHHHHCCCCCCCCCCC Q ss_conf 40577306731023545666421012232112445845440221020799887876360001026677347410000136 Q gi|254780143|r 235 KIVYSQRGDFWCFPLSAADLMVGAKVSSSLVDIDTGEQVIESGKKLTSGLLKSLKEKGVKFLGITSDCLCGLYVAEDIVN 314 (1386) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~l~~~~~~~~~~d 314 (1386) ........+.+...+..... . .+..+++.+.+++..++..++.++++..+.+.+...+........+.......+..+ T Consensus 241 ~v~~~~~~~~~~~~~~~~~~-r-~~~eFdl~d~~~g~~~~~~g~~itar~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (1449) T TIGR02013 241 LVEKEKVSKKLLKKYVPERL-R-ERAEFDLKDADGGKILLAKGKKITARIKKKLENKSLKRELLLEELLAEDLVDEDAGE 318 (1449) T ss_pred EEEEEEECCHHHHHHHHHHC-C-CCCCCCCCCCCCCEEEECCCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCC T ss_conf 02355403202345533302-4-666776324888879842785004899998731313446666888866653333145 Q ss_pred CCCCEEEE---ECCCCCCHHHHHHHHHCCCCCEEEEEECC---CCCCCCCCHHHHCCCCCCHHH---HHHHHHHHHCCCC Q ss_conf 66776999---62345998999999865654102442024---445520000011023458899---9999887605776 Q gi|254780143|r 315 GETGEIYI---EAGDVIDEKSLEEIFHSEIRDIPILYVDS---VNNNAYIRNTLVTDKNKDRKD---ALLDIYRVMRPGD 385 (1386) Q Consensus 315 ~~~gei~~---~~~~~~~~~~l~~~~~~~~~~~~~l~~~~---~~~~~~i~~~~~~d~~~~~~e---Al~~I~k~lr~~~ 385 (1386) ....+... +.........+.....+......+..... .....++.+++.+|.+.+.++ ||..||++||||+ T Consensus 319 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tL~kD~~~~~~enalAL~~iY~~LRPGE 398 (1449) T TIGR02013 319 VVDAEVILVALDEEALESEEILKQILFSGIDTLILERLGSIIKSVENEYIRNTLEKDPTASEEENALALVEIYRKLRPGE 398 (1449) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCC T ss_conf 11377899998532046799999986111114442026640015642799998622787583676789999998627778 Q ss_pred CCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCCCCC------------------------------------CC Q ss_conf 310456888862024530233345555777654203667767------------------------------------70 Q gi|254780143|r 386 VSTFSVAESMFNFLFFDSDKYDLSTVGRVKMNMRLNLDTPDD------------------------------------VR 429 (1386) Q Consensus 386 ~~~~~~~~~~~~~~~~~~~~y~l~~vgr~~~n~~l~~~~~~~------------------------------------~~ 429 (1386) |+|.+.|++++.++||+++||||+.|||||+|++|..+.++. .. T Consensus 399 PpT~~aA~~~l~~lFFdpkRYDL~~VGRYK~NkKL~~~~~~~~~~L~~~~~~~E~~~~~~E~~~~~~~~~~~~~~~~~~~ 478 (1449) T TIGR02013 399 PPTVEAARSLLESLFFDPKRYDLGRVGRYKLNKKLGLDVPDESAYLEKRLELKELSDILYERLLQLISTLDLEEIDPDIG 478 (1449) T ss_pred CCCHHHHHHHHHHHCCCHHHCCCCCCCHHHHHHHHCCCCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCE T ss_conf 40268999999862788214277656602231543457731357887403561047899988877650588664577611 Q ss_pred EECHHHHHHHHHHHHHHHCC------------CCCCCCCHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCC-- Q ss_conf 66189988889888763048------------7643440101533233335787898888889999887653113443-- Q gi|254780143|r 430 HIRKEDIIAIIKILVDLRNG------------KGTIDDIDNLGNRRVRSVGEMLKNQYRLGLLRMERSIKERISSVDI-- 495 (1386) Q Consensus 430 ~Lt~~d~~~~i~~L~~l~~g------------~~~~DdiDhlgnkRvr~vgeLl~~~fr~~l~rl~r~i~~~~~~~~~-- 495 (1386) +||.+|++++++||+.|.+| .+..||||||||||+|+|||||+||||+||.||+|.++++|+..+. T Consensus 479 ~Lt~eDii~~iKyLi~l~~g~~~~~~~~G~~~~g~~DDIDHLgNRRvRsVGELl~Nq~r~GL~RMer~vrERM~~~D~~~ 558 (1449) T TIGR02013 479 VLTKEDIIATIKYLIKLRNGEEEVVNVIGLEIKGEIDDIDHLGNRRVRSVGELLQNQFRVGLARMERIVRERMTTQDTAF 558 (1449) T ss_pred EECCHHHHHHHHHHEEECCCCEEEEEECCCCCCEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHEECCCCCCCCC T ss_conf 61313563310322220278702401016642223564105767614117899888788766652101100267332110 Q ss_pred CCCCCCCCCCHHHHHHHHHHHCCCCCCEEECCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECCCC Q ss_conf 34352112220234655542026675035415542102322001122344444322346655432211110444303566 Q gi|254780143|r 496 DSVMPQDLINAKPVVSAVCEFFCSSQLSQLEEHVNSLSRITHTRRLSALGQGGVARARAGVEMRDVHPTHYGRICPAETS 575 (1386) Q Consensus 496 ~~~~~~~~in~~~i~~~i~~ff~t~~lsq~ld~~n~ls~lth~RR~s~lgpggl~r~~~~~evR~ih~s~~GriCPieTP 575 (1386) ++++|+++||+++|.+++++||||||||||||||||||||||||||||||||||+||||||||||||||||||||||||| T Consensus 559 dt~tP~dLiN~kp~~a~ikeFFG~SQLSQFMDQtNPLaElTHKRRLSALGPGGL~RERAGFEVRDVH~tHYGRiCPIETP 638 (1449) T TIGR02013 559 DTLTPQDLINAKPIVAAIKEFFGSSQLSQFMDQTNPLAELTHKRRLSALGPGGLTRERAGFEVRDVHPTHYGRICPIETP 638 (1449) T ss_pred CCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCHHHCHHHHHCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCC T ss_conf 23277431130024323223222786540004777532120011003437778870116505631178888740775788 Q ss_pred CCCCCEEEECEEEEEEECCCCCCCCEEEEEECCCCC--CCEEECCHHHHCCEEEECCCCEECCCCCCCCCCEEECCCC-C Q ss_conf 765210210001224424687644106986225534--8166429677376379616534114684022200000023-3 Q gi|254780143|r 576 EGHNIGLVSSLTSFARVNAYGFIETPYRKVCDGKVT--NDVVYLSAMEEENRYIAQANSSLDEDGSFTEELVFCRCAG-E 652 (1386) Q Consensus 576 EG~n~GLv~~la~~a~in~~g~ie~py~~v~~~~~~--~~i~~l~~~~e~~~~Ia~~~~~l~~~~~~~~~~~~~r~~~-~ 652 (1386) ||||||||+|||.||+||.+||||||||+|.++.+. ++|.||||++|+++.|||+|+.++++|.+....+.||+++ + T Consensus 639 EGPNIGLI~SLs~yArvN~yGFIETPYr~V~~g~v~~~~~v~YLtA~eEd~~~iAQa~~~lDe~g~i~~d~V~~R~~G~e 718 (1449) T TIGR02013 639 EGPNIGLINSLSTYARVNEYGFIETPYRKVKDGKVVVDDEVDYLTADEEDNYVIAQANAPLDENGRIVEDLVVARYRGDE 718 (1449) T ss_pred CCCCHHHHHHHHHCEEECCCCCCCCCCEEEECCEEEECCCCEECCHHHHCCEEEECCCCCCCCCCCCCCCEEEEEECCCC T ss_conf 88861134323020055577751367678745323333641343512236716703544318897052356888654773 Q ss_pred CCCCCHHHEEECCCCCCCEEEECCCCCCCHHHCCHHHHHHHHHHHHHHHCCCCCCCCEECCCCCHHHHHHCCCEEECCCC Q ss_conf 33225787220236721102311233320110100221122234443210136654111266201110106530102124 Q gi|254780143|r 653 EILVPREKIDFIDASPKQVVSIAASLIPFLENDDSNRVLMGCNMQRQAVPLLKAEAPFVGTGMESVVAKSSGAAIVAKRA 732 (1386) Q Consensus 653 ~~~~~~~~v~~~~i~p~~i~sv~aslIPflehdda~R~l~g~nm~rQav~l~~~~~~~v~tg~E~~~~~~s~~~i~a~~~ 732 (1386) +..+.++.|+||||||.|++|||||||||||||||||||||||||||||||+.+++|+||||||.++|.|||.+|+|+.. T Consensus 719 ~~~~~~~~VdyMDVSP~Q~VSVaAaLIPFLEHDDANRALMGsNMQRQAVPLL~seaP~VGTGmE~~~A~DSG~~i~A~~~ 798 (1449) T TIGR02013 719 ITLVSPDEVDYMDVSPKQIVSVAAALIPFLEHDDANRALMGSNMQRQAVPLLRSEAPLVGTGMEAKVARDSGAVIVAKRA 798 (1449) T ss_pred CCCCCCCCEEEEEECCHHHHHHHHHCCCCCCCCHHHHHHHHCCCHHHCCCCCCCCCCCCCCHHHHHHHHCCCEEEEECCC T ss_conf 22116760247651832355665542633235414566541253123677787798823202789886235408997069 Q ss_pred CCCCCCCCCCEEEECCCCCCCCCCC------CCEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCEECCCCCCCCCCCC Q ss_conf 4343356553025215665564446------6301114655544547322445304479772078520355223578602 Q gi|254780143|r 733 GIVEQVDAIRIVIRSVEGDLDPSTS------GVDIYRLMKFQRSNQNTCVNQRPLVKVGDEVRRNDIIADGPSTDLGDLA 806 (1386) Q Consensus 733 g~v~~vd~~~i~i~~~~~~~~~~~~------~~~~y~~~~~~~~~~~~~~~q~p~V~~g~~~~~~~~l~~~~~~~~~el~ 806 (1386) |+|.|||+.+|+|+........... ...+|+|.||.||||+||+||+|||+.||.|.+||||||||||+.|||| T Consensus 799 GvV~~Vda~~I~v~~~~~~~~~~~~g~DPd~~~~~Y~L~Ky~RSNQ~TC~nQ~PiV~~GDrV~~GdvlADGPsT~~GELA 878 (1449) T TIGR02013 799 GVVEYVDAKRIVVRYKEKEEEETVSGDDPDAAIDIYRLLKYQRSNQDTCINQRPIVSVGDRVEAGDVLADGPSTDLGELA 878 (1449) T ss_pred CEEEEEECCEEEEEECCCCCCCCCCCCCCCCCEEEEECCCCCCCCCCCEECCEEECCCCCEECCCCEEECCCCCCCCCCC T ss_conf 78999847788993147766655577883220257504676314788401453550148681021277347666443201 Q ss_pred CCCCCEEEEEECCCCCCCCCEECCHHHHHCCCCCEEEEEEEEHHHHHCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCC Q ss_conf 22375155531355444442000013442587310346665311211478840024666546867841024137413773 Q gi|254780143|r 807 LGRNMLVAFMPWHGYNFEDSMLISERMVSEDVFTSIHIEEFEVMARDTKLGPEEITRDIPNVSEEGLKNIDECGIICVGA 886 (1386) Q Consensus 807 ~G~N~~VA~m~~~GYN~EDaiiin~~~v~rg~~~s~h~~~y~~~~~~~~~g~~~~~~~~~~~~~~~~~~ld~~Giv~~G~ 886 (1386) ||+|++||||||+|||||||||||+|+|++|+||||||+||++++|+||+|+||||+||||||+++++||||+|||+||| T Consensus 879 LGrNvlVAFMPW~GYNyEDAIliSERlVkdDvFTSIHI~E~e~~aRdTKLG~EEiTrDIPNVsE~ALrnLDE~GIvrIGA 958 (1449) T TIGR02013 879 LGRNVLVAFMPWNGYNYEDAILISERLVKDDVFTSIHIEEFEVEARDTKLGPEEITRDIPNVSEEALRNLDENGIVRIGA 958 (1449) T ss_pred CCCCEEEEEECCCCCCHHHHHHHHHHHEECCCCEEEEEEEEEECCEECCCCCCCCCCCCCCCCHHHHHCCCCCCEEEEEE T ss_conf 16710367522788653356355000101376203789999830103478870123457984088882589577588733 Q ss_pred EECCCCCEEECCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 31367611101246777767703466543035555443233202278853220012101456542026678888999999 Q gi|254780143|r 887 EVNPGDILVGKITPKGESPMTPEEKLLRAIFGEKAVDVRDTSLRVPSGVSGTVVDVRIFNRHGIDKNERSISVEREQIEL 966 (1386) Q Consensus 887 ~V~~gDilvgk~tp~~~~~~~pe~~~l~~i~~~~~~~~~d~~~~~~~g~~g~v~~~~~~~r~~~~~~~~~~~~~~~~~~~ 966 (1386) +|++|||||||+|||+|+++||||+|||||||+||.||+|+||++|||++|+|+||++|+|.+..++.+...++.+.+.+ T Consensus 959 eV~~GDILVGKvTPKGEs~~TPEEkLLRAIFGEKA~dVrD~SL~vP~G~~G~ViDVkvF~R~g~~K~~~~~~~~~~~~~k 1038 (1449) T TIGR02013 959 EVKAGDILVGKVTPKGESELTPEEKLLRAIFGEKARDVRDTSLRVPPGVEGTVIDVKVFSREGKEKDKRDLKIEEEELDK 1038 (1449) T ss_pred EECCCCEEEEEECCCCCCCCCHHHHHHHHCCCCCHHHHCCCCCCCCCCCCEEEEEEEEECCCCCCCCHHHHHHHHHHHHH T ss_conf 87077477721218898888756775420033411455276630579972079986885401233306899999999988 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCC--CH------HHHHCCCHHHHHE-EEECCHHHHHHHHH Q ss_conf 987288899988788898888862585345665555543211--35------7640148543100-14158678999999 Q gi|254780143|r 967 LARDKDDEQVILDRNIYSRLMEILCGQNAVSGPKGFKKSTVL--SS------DLISEYPRSQWWQ-FAVQDEKVQRNVES 1037 (1386) Q Consensus 967 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~------~~~~~~~~~~~~~-~~~~~~~~~~~~~~ 1037 (1386) ...++.++..+.......++...+..............+... .. ..+.......+.. +...++........ T Consensus 1039 ~~~~~~~~l~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lle~~~L~~l~~~~~~~~~~~~~~ 1118 (1449) T TIGR02013 1039 LKKKREDELTILLREELKRLLKLLLLAIVESALSVELRGEGEAEQEAKLEKAEWLELLESISLSDLLKLEAKENENLLEI 1118 (1449) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHH T ss_conf 76438999999999999888765545321242122235311234145554443466653343566530301566789999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEEEEEECCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCCE Q ss_conf 99989999999999989887631258867747276999999875588552314336678735888630000787935871 Q gi|254780143|r 1038 LKVQYETSKSILEDRFKNKIEKIQWGDDMPPGVLRVVKVFVAMKRPIQSGDKMAGRHGNKGIVSRILPCEDMPFLKDGTP 1117 (1386) Q Consensus 1038 ~~~~~~~~~~~~~~~~~~k~~ki~~~~~~~~gv~~~vKV~ir~~R~p~iGDKfasRHGqKGVis~i~p~eDMPf~~dG~~ 1117 (1386) +...+...+.......+.|..++..||+|||||+++||||||+||++|+|||||||||||||||+|+|+|||||++|||| T Consensus 1119 ~~~~~~~~k~~~~~~~~~k~~~~~~GDeLppGV~~~VkVYiA~KRKiqvGDKMAGRHGNKGvvSkIlP~EDMPfL~DGtP 1198 (1449) T TIGR02013 1119 LREYFLELKKLLDERKEEKKSKLEEGDELPPGVNKLVKVYIAQKRKIQVGDKMAGRHGNKGVVSKILPIEDMPFLEDGTP 1198 (1449) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEEEEECCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCE T ss_conf 99999888889887888887652178878765236889988861205888843555578415655245247786954387 Q ss_pred EEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCH---HHHH Q ss_conf 5698668986750708999999999999871962243334322221024677787763154322100001352---5677 Q gi|254780143|r 1118 VDIVLNPLGVPSRMNVGQIFETHLGWACVGLGKKIKSLINDYKANGDISPLRSFLEKVIGTGSHTEKISDYDD---DSVL 1194 (1386) Q Consensus 1118 pDIIlNPhgvPSRMtIGqllE~~lGka~~~~G~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~ 1194 (1386) +|||||||||||||||||+|||||||||+.+|.++..+.-+.............+..++..........++.+ +.++ T Consensus 1199 VDivLNPLGVPSRMNiGQilE~hLG~Ag~~~g~~i~~~le~~q~~~~k~~~~~~l~~~~a~~e~~pd~~~~~enCde~~~ 1278 (1449) T TIGR02013 1199 VDIVLNPLGVPSRMNIGQILETHLGWAGKKLGKQIARMLEDKQKDAKKELLVKRLEELNAINEKDPDTIHALENCDEELV 1278 (1449) T ss_pred EEEEECCCCCCCCCCHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHH T ss_conf 77897887897657267999999999986206478888898766789999999999716842014135455543025654 Q ss_pred HHHHHHCCCCEECCCCCCCCCHHHHHHHHHHCCCCCC--CCEEEECCCCCCCCCCCEEEEEEEEEECCCHHHHHCCCCCC Q ss_conf 7665302684002565578899999999998686899--86999868988402685048846454101102110000236 Q gi|254780143|r 1195 RVAEQWKSGVPVSTPVFDGADEEAINSMLRMADLDES--GQSILYDGLTGEPFDRPVTVGYIYMLKLNHMVSDKVYARST 1272 (1386) Q Consensus 1195 ~~~~~~~~g~~~aTP~F~g~~~~~i~~~L~~aG~~~~--Gke~lydG~TGe~~~~~IfvG~~YyqkL~HMV~DKiHARst 1272 (1386) .++.+|..|+++|||||+|++.++|.++|.++|+|.+ ||..||||||||+|+.|||||+|||+||+|||||||||||| T Consensus 1279 ~~A~~~~~G~~~atPVFdGa~~~~~~~~l~~ag~p~~~~GK~~LyDGRTGE~F~~pVtVGy~YMLKL~HLVDDK~HARSt 1358 (1449) T TIGR02013 1279 ELAKDLSKGVKIATPVFDGASEEEIKELLEKAGLPRDNSGKVVLYDGRTGEQFDNPVTVGYMYMLKLHHLVDDKIHARST 1358 (1449) T ss_pred CCHHHCCCCEEEECCCCCCCCHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCCCCCEEEEEEEHHHHCCCHHHCCCCCCC T ss_conf 03011148854525611698879999999846879899831688227888706883166220011102340004243312 Q ss_pred CCCCCEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHH Q ss_conf 87311030799862331784320789999999869899998611100110159999887638787899786677899999 Q gi|254780143|r 1273 GSYSLVTQQPLGGKSNRGGQRLGEMEVWCIQAYGAAYVLQEMLTIKSDDVVGRTRVYESIVAGNDTFETGTPESFNVLVK 1352 (1386) Q Consensus 1273 GP~sllTrQP~eGRsr~GGlRfGEMErwaL~AyGAa~~LqE~Lt~kSDdv~gr~~~~~~iv~g~~~~~~~~pesf~vl~~ 1352 (1386) |||||||||||||||||||||||||||||||||||||+||||||||||||.||+|+|+|||||+++|+||||||||||+| T Consensus 1359 GPYSLVTQQPLGGKAQFGGQRFGEMEVWALEAYGAAy~LQE~LTVKSDDv~GR~K~YeAIVKGe~~~epGiPESFnVL~k 1438 (1449) T TIGR02013 1359 GPYSLVTQQPLGGKAQFGGQRFGEMEVWALEAYGAAYTLQEMLTVKSDDVTGRTKAYEAIVKGENLPEPGIPESFNVLIK 1438 (1449) T ss_pred CCCCCEEECCCCCEECCCCCCCHHHHHHHHHHHHHHHHHHHHCCEECCCCCCHHHHHHHHCCCCCCCCCCCCCCHHHHHH T ss_conf 88455243687620035884140688999999878999887413230566501565532104688765446841579999 Q ss_pred HHHHCCCCEEE Q ss_conf 99854200276 Q gi|254780143|r 1353 EMQALGLSIDL 1363 (1386) Q Consensus 1353 El~~l~l~~~~ 1363 (1386) |||||||||++ T Consensus 1439 ElqsLgLdi~~ 1449 (1449) T TIGR02013 1439 ELQSLGLDIEI 1449 (1449) T ss_pred HHHHHCCCCCC T ss_conf 99760354459 No 2 >PRK09603 DNA-directed RNA polymerase subunit beta/beta'; Reviewed Probab=100.00 E-value=0 Score=3176.59 Aligned_cols=1347 Identities=44% Similarity=0.757 Sum_probs=1256.7 Q ss_pred CCCEEECCCHHHCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHCCCCCCCCCEEEEEEEEEECCCCCC Q ss_conf 18633210131046667887088999999999862466522111225899976637807679858999997898098588 Q gi|254780143|r 8 NGLGRVRKFFGKNPEIIDIPDLIEVQKASYDHFLMMNIAPDERPNEGLQAAFKSVFPITAFSGAAMLEFVSYEFDPPKFD 87 (1386) Q Consensus 8 ~~~~~~R~~f~k~~~~~~~P~Li~iQ~~Sf~~Flq~~~~~~~r~~~GL~~v~~~~fPI~d~~~~~~Lef~~y~l~~Pk~t 87 (1386) +=|||+|+||||+++++++|||+++|++||+|||| +.+|+++||+++|+++|||+|+++++.|||++|++++|+|+ T Consensus 6 ~~~~r~R~~f~k~~~~~~~PnLi~iQ~~Sy~~Fl~----~~~~~~~Gl~~vf~~ifPI~~~~~~~~lef~~y~~~~pky~ 81 (2890) T PRK09603 6 PLKNRLRADFTKTPTDLEVPNLLLLQRDSYDSFLY----SKDGKESGIEKVFKSIFPIQDEHNRITLEYAGCEFGKSKYT 81 (2890) T ss_pred CCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHC----CCCCCCHHHHHHHHHHCCEECCCCCEEEEEECCCCCCCCCC T ss_conf 77665302514788778998551888999999826----88553525999987518827799988999841486899899 Q ss_pred HHHHHHCCCCEEEEEEEEEEEEEECCCCCCCCC-CCEEEEEEEEEEEECCEECCCCCEEECCEEEEEEEEECCCCCEEEC Q ss_conf 999998399753358999999993178766552-0001223678751000026896289868214686651227852120 Q gi|254780143|r 88 VDDCLWRDLTYAVPLKITLRLIVFDVDEFTGAK-SIKDIKEQSIYMGDLPLMTKDGTFVIKGIQRIVVSQLHRSPGIHFD 166 (1386) Q Consensus 88 p~ECRlR~lTYsapL~V~i~l~v~~~~~~~~~k-~~~~ike~~V~lG~IPiMt~~GyFIING~ERVIVsQl~RSPGVyf~ 166 (1386) ++||+.||+||||||||+++|++++.+.+++.+ .+++++||+||||+||+||++|||||||+||||||||||||||||. T Consensus 82 ~~Ec~~r~~tY~~pl~v~~rl~~~~~d~~tge~~~~k~ikeq~v~~gdiPlMT~~GtFIING~ERViVsQl~RSPGvyf~ 161 (2890) T PRK09603 82 VREAMERGITYSIPLKIKVRLILWEKDTKSGEKNGIKDIKEQSIFIREIPLMTERTSFIINGVERVVVNQLHRSPGVIFK 161 (2890) T ss_pred HHHHHHCCCCEEEEEEEEEEEEEECCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCEEEECCCCEEEEECCCCCCEEEEE T ss_conf 99998649914540799999998257767665056676614369960765327997699669855787633439922051 Q ss_pred CCCCCCCCCCCEEEEEEEECCCCCEEEEEECCCCEEEEEECCCCCHHHHHHHHHCCCCHHHHHHHHCCCEEEEECCCCCC Q ss_conf 23465157785679999811887236899758982999971878703998998809984799997387405773067310 Q gi|254780143|r 167 HDKGRASLSGKLLYACRIIPDQGLWMDIEFDSKDIIHVRIDRRRKVPVTSFLMALGMDSEEILSTFYPKIVYSQRGDFWC 246 (1386) Q Consensus 167 ~~k~k~~~s~k~~ysa~IIP~RGSwLe~e~d~kd~iyvrIdr~rKIPi~ilLrALG~ssdeIl~~f~~~~~~~~~~~~~~ 246 (1386) ++++++ .++|.+|+|+|||+|||||+||+|.||.+||||||+||||+|+|||||||+++||++.||.........+.+. T Consensus 162 ~~~~~~-~~~k~~y~a~iIP~rGsWle~e~d~kd~i~vrIDr~rK~p~t~lLrAlG~~~~~il~~f~~~~~~~~~~~~~~ 240 (2890) T PRK09603 162 EEESST-SLNKLIYTGQIIPDRGSWLYFEYDSKDVLYARINKRRKVPVTILFRAMDYQKQDIIKMFYPLVKVRYENDKYL 240 (2890) T ss_pred CCCCCC-CCCCEEEEEEEECCCCCCEEEEECCCCEEEEEEECCCCCCHHHHHHHCCCCHHHHHHHHHHHHHEECCCCCCC T ss_conf 455888-8885789999916997628989868986999981888612999998829998999998502221110366422 Q ss_pred CCCCHHHHHHHHHHHHHHCCCCCCCEEECCCCHHHHHHHHHHHHCCCCCCCCCHHHHCCCCCCCCCCCCCCCEEEEECCC Q ss_conf 23545666421012232112445845440221020799887876360001026677347410000136667769996234 Q gi|254780143|r 247 FPLSAADLMVGAKVSSSLVDIDTGEQVIESGKKLTSGLLKSLKEKGVKFLGITSDCLCGLYVAEDIVNGETGEIYIEAGD 326 (1386) Q Consensus 247 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~l~~~~~~~~~~d~~~gei~~~~~~ 326 (1386) ......+. +.....++.+.. +..++..++.+..+..+.+.+.++..+.+....+.+++++.++++.+ +++++++. T Consensus 241 ~~~~~~~~--~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~i~~~~--ei~~~~~~ 315 (2890) T PRK09603 241 IPFASLDA--NQRMEFDLKDPQ-GKLILLAGKKLTSRKIKELKENHLEWVEYPMDILLNRHLAEPVMVGK--EVLLDMLT 315 (2890) T ss_pred CCCCHHHC--CCCCCHHHHCCC-CCEEECCCCHHHHHHHHHHHHCCCCCCCCCHHHHHCCHHCCCCCCCH--HEEEECCC T ss_conf 33440110--453110000245-55222145302288887888615440115267652201011225701--21320233 Q ss_pred CCCHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHCCCC------------CCHHHHHHHHHHHHCCCCCCHHHHHHH Q ss_conf 59989999998656541024420244455200000110234------------588999999887605776310456888 Q gi|254780143|r 327 VIDEKSLEEIFHSEIRDIPILYVDSVNNNAYIRNTLVTDKN------------KDRKDALLDIYRVMRPGDVSTFSVAES 394 (1386) Q Consensus 327 ~~~~~~l~~~~~~~~~~~~~l~~~~~~~~~~i~~~~~~d~~------------~~~~eAl~~I~k~lr~~~~~~~~~~~~ 394 (1386) .++++.++.+.+.+...+.+.+.........+.+++..|.. .....|+..||+++||++|++.+.++. T Consensus 316 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~a~~~iy~~~rpgep~~~~~a~~ 395 (2890) T PRK09603 316 QLDKNKLEKIHDLGVQEFVIINDLALGHDASIIHSFSADSESLKLLKQTEKIDDENALAAIRIHKVMKPGDPVTTEVAKQ 395 (2890) T ss_pred CCCHHHHHHHHHCCCCEEEEECCCCCCCCCHHHCCCCCCCCHHHHHCCCCCCCCHHHHHHHHHHHHCCCCCCCCHHHHHH T ss_conf 01588899998647633787403224555101012013530121101344556235678999998627999987899999 Q ss_pred HHHHCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEECHHHHHHHHHHHHHHHCCCCCCCCCHHCCCCCCCCHHHHHHHH Q ss_conf 86202453023334555577765420366776770661899888898887630487643440101533233335787898 Q gi|254780143|r 395 MFNFLFFDSDKYDLSTVGRVKMNMRLNLDTPDDVRHIRKEDIIAIIKILVDLRNGKGTIDDIDNLGNRRVRSVGEMLKNQ 474 (1386) Q Consensus 395 ~~~~~~~~~~~y~l~~vgr~~~n~~l~~~~~~~~~~Lt~~d~~~~i~~L~~l~~g~~~~DdiDhlgnkRvr~vgeLl~~~ 474 (1386) ++.++||++++|+|+.+||+++|++++++.+...++|++.|++++++||+++..|.+..||||||||||+|++|||+++| T Consensus 396 ~~~~lff~~~rydL~~vGR~k~N~kl~l~~~~~~~~Lt~~Dii~~i~yli~l~~g~g~~DDIDhLgNRRvRsVGELl~nQ 475 (2890) T PRK09603 396 FVKKLFFDPERYDLTMVGRMKMNHKLGLHVPDYITTLTHEDIITTVKYLMKIKNNQGKIDDRDHLGNRRIRAVGELLANE 475 (2890) T ss_pred HHHHHCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHH T ss_conf 99874268011333777678777531899886644458999999999987303389987766565666106299999999 Q ss_pred HHHHHHHHHHHHHHHHCCCCC--CCCCCCCCCCHHHHHHHHHHHCCCCCCEEECCCCCCHHHHHHCCCCCCCCCCCCCCC Q ss_conf 888889999887653113443--343521122202346555420266750354155421023220011223444443223 Q gi|254780143|r 475 YRLGLLRMERSIKERISSVDI--DSVMPQDLINAKPVVSAVCEFFCSSQLSQLEEHVNSLSRITHTRRLSALGQGGVARA 552 (1386) Q Consensus 475 fr~~l~rl~r~i~~~~~~~~~--~~~~~~~~in~~~i~~~i~~ff~t~~lsq~ld~~n~ls~lth~RR~s~lgpggl~r~ 552 (1386) ||+|+.||+|.++++|+..+. ..++|++++|++++++++++||++|||||||||+|||||||||||+||||||||+|+ T Consensus 476 ~riGL~Rmer~irerms~~~~~~~~~~p~~liN~kpi~a~ikeFFgssqLSQFMDQtNPLaElTHKRRlSaLGPGGL~Re 555 (2890) T PRK09603 476 LHSGLVKMQKTIKDKLTTMSGAFDSLMPHDLVNSKMITSTIMEFFMGGQLSQFMDQTNPLSEVTHKRRLSALGEGGLVKD 555 (2890) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHCCHHHHHHHHHHHCCCCCCHHCCCCCHHHHHHHCCCCCCCCCCCCCCC T ss_conf 99999999999998860200454656705410646799999998688963120215583666651341126389888854 Q ss_pred CCCCCCCCCCCCCEEEEEECCCCCCCCCEEEECEEEEEEECCCCCCCCEEEEEECCCCCCCEEECCHHHHCCEEEECCCC Q ss_conf 46655432211110444303566765210210001224424687644106986225534816642967737637961653 Q gi|254780143|r 553 RAGVEMRDVHPTHYGRICPAETSEGHNIGLVSSLTSFARVNAYGFIETPYRKVCDGKVTNDVVYLSAMEEENRYIAQANS 632 (1386) Q Consensus 553 ~~~~evR~ih~s~~GriCPieTPEG~n~GLv~~la~~a~in~~g~ie~py~~v~~~~~~~~i~~l~~~~e~~~~Ia~~~~ 632 (1386) ||+||||||||||||||||||||||||||||+|||+||+||++||++|||++|.++.+++++.|++|.+|+.+.||++++ T Consensus 556 rAgfEVRDVH~THYGRICPIETPEGqNiGLI~SLA~yA~Vn~~GfieTPy~kV~~g~vt~ei~yLsA~eE~~~~iA~a~~ 635 (2890) T PRK09603 556 RVGFEARDVHPTHYGRICPIETPEGQNIGLINTLSTFTRVNDLGFIEAPYKKVVDGKVVGETIYLTAIQEDSHIIAPAST 635 (2890) T ss_pred CCCCCHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHEEECCCCCCCCCEEEEECCEECCCEEEECCCCCCCCEEECCCE T ss_conf 57985312877523465877799865322487777657777898644872998587242533653650025727821554 Q ss_pred EECCCCCCCCCCEEECCCCCCCCCCHHHEEECCCCCCCEEEECCCCCCCHHHCCHHHHHHHHHHHHHHHCCCCCCCCEEC Q ss_conf 41146840222000000233332257872202367211023112333201101002211222344432101366541112 Q gi|254780143|r 633 SLDEDGSFTEELVFCRCAGEEILVPREKIDFIDASPKQVVSIAASLIPFLENDDSNRVLMGCNMQRQAVPLLKAEAPFVG 712 (1386) Q Consensus 633 ~l~~~~~~~~~~~~~r~~~~~~~~~~~~v~~~~i~p~~i~sv~aslIPflehdda~R~l~g~nm~rQav~l~~~~~~~v~ 712 (1386) .++.+|.+.+..+.+|+++++....+++++++|++|.|++|++||||||||||||||||||||||||||||+.+++|+|| T Consensus 636 ~~~~~g~~~~~~v~~R~~ge~~~~~~~~vd~~dvsp~Q~vSvaasLIPFLEhDDANRALMGSNMQRQAVPLl~~eaPiVg 715 (2890) T PRK09603 636 PIDEEGNILGDLIETRVEGEIVLNEKSKVTLMDLSSSMLVGVAASLIPFLEHDDANRALMGTNMQRQAVPLLRSDAPIVG 715 (2890) T ss_pred EECCCCCEEEEEEEEEECCCEEECCCCCCEEEEECCCEEECCCCEEEEEEEECCCCCCCCCCCCCHHHEEEECCCCCCCC T ss_conf 87559817233664257683530251000357505314653550467776316887401045555100330047886457 Q ss_pred CCCCHHHHHHCCCEEECCCCCCCCCCCCCCEEEECCCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCC Q ss_conf 66201110106530102124434335655302521566556444663011146555445473224453044797720785 Q gi|254780143|r 713 TGMESVVAKSSGAAIVAKRAGIVEQVDAIRIVIRSVEGDLDPSTSGVDIYRLMKFQRSNQNTCVNQRPLVKVGDEVRRND 792 (1386) Q Consensus 713 tg~E~~~~~~s~~~i~a~~~g~v~~vd~~~i~i~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~q~p~V~~g~~~~~~~ 792 (1386) ||||.+++.||+.++.|..+|+|.|+|+.+|.|...... ...+.+|.+.+|.||||+||+||+|+|+.|+++++|| T Consensus 716 TG~E~~va~DSg~~i~a~~~GiVe~vDa~~I~i~~~~~~----~~~id~Y~l~k~~rSNq~TcinqrPiV~~G~~V~~G~ 791 (2890) T PRK09603 716 TGIEKIIARDSWGAIKANRAGVVEKIDSKNIYILGEGKE----EAYIDAYSLQKNLRTNQNTSFNQVPIVKVGDKVEAGQ 791 (2890) T ss_pred CCCHHEEEECCCEEEEECCCCEEEEEECCEEEEEECCCC----CCEEEEEECCCCCCCCCCCCCCCCCCCCCCCEEECCC T ss_conf 776210555376289844795899998988799305888----6226898710025567874047777334798786381 Q ss_pred EECCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCEECCHHHHHCCCCCEEEEEEEEHHHHHCCCCCCCCCCCCCCCCHHH Q ss_conf 20355223578602223751555313554444420000134425873103466653112114788400246665468678 Q gi|254780143|r 793 IIADGPSTDLGDLALGRNMLVAFMPWHGYNFEDSMLISERMVSEDVFTSIHIEEFEVMARDTKLGPEEITRDIPNVSEEG 872 (1386) Q Consensus 793 ~l~~~~~~~~~el~~G~N~~VA~m~~~GYN~EDaiiin~~~v~rg~~~s~h~~~y~~~~~~~~~g~~~~~~~~~~~~~~~ 872 (1386) ++|||++|..||||+|+|++||||||.||||||||++|++++.+|+|||+|+++|++++|++++|+|++|+++|+++++. T Consensus 792 iiADGpst~~GELALGkNvlvAfmpW~GyNfEDAIliSErlvkeD~~TSiHIee~e~~~R~tklG~EeiTrdiPnv~e~~ 871 (2890) T PRK09603 792 IIADGPSMDRGELALGKNVRVAFMPWNGYNFEDAIVVSERITKDDIFTSTHIYEKEVDARELKHGVEEFTADIPDVKEEA 871 (2890) T ss_pred EECCCCCCCCCCCCCCCCEEEEECCCCCCCCCCHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCHHHHHCCCCCCCHHH T ss_conf 86268677577101575218997288997765246653433033567662353230033123554144405688868787 Q ss_pred HHHCCCCCCCCCCCEECCCCCEEECCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCCCCCH Q ss_conf 41024137413773313676111012467777677034665430355554432332022788532200121014565420 Q gi|254780143|r 873 LKNIDECGIICVGAEVNPGDILVGKITPKGESPMTPEEKLLRAIFGEKAVDVRDTSLRVPSGVSGTVVDVRIFNRHGIDK 952 (1386) Q Consensus 873 ~~~ld~~Giv~~G~~V~~gDilvgk~tp~~~~~~~pe~~~l~~i~~~~~~~~~d~~~~~~~g~~g~v~~~~~~~r~~~~~ 952 (1386) +++||++|||++|++|++|||||||+||+++++++||++|++||||+++.+++|+|+++|+|.+|+|+|+++|+|.+.++ T Consensus 872 l~~LDe~GIi~iGa~V~~gDILVGKvtPkget~~tpeekLLraifGeka~~v~dtSLrvp~g~eG~VIdv~~f~r~g~~k 951 (2890) T PRK09603 872 LAHLDESGIVKVGTYVSAGMILVGKTSPKGEIKSTPEERLLRAIFGDKAGHVVNKSLYCPPSLEGTVIDVKVFTKKGYEK 951 (2890) T ss_pred HHCCCCCCCEEECCEECCCCEEEEEECCCCCCCCCHHHHHHHHHHCCCHHHHHHHHCCCCCCCCCEEEEEEEEECCCCCH T ss_conf 63555037988621754688899843678878778688987765178366675754546899987699999974246421 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC----CCCCCCCCCCCCCHHHHHCCCHHHHHEEE-EC Q ss_conf 26678888999999987288899988788898888862585345----66555554321135764014854310014-15 Q gi|254780143|r 953 NERSISVEREQIELLARDKDDEQVILDRNIYSRLMEILCGQNAV----SGPKGFKKSTVLSSDLISEYPRSQWWQFA-VQ 1027 (1386) Q Consensus 953 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~ 1027 (1386) +.+...++.++...+..+..++..++......++...+.+.... .++....+|.....+.+...+...+.... .. T Consensus 952 ~~r~~~~~~~e~~~l~~~~~d~~~~~~~~~~~rl~~~~~~~kl~~d~~~~~~~~~~G~~i~~~~~~~i~~~~~~~~~~~~ 1031 (2890) T PRK09603 952 DARVLSAYEEEKAKLDMEHFDRLTMLNREELLRVSSLLSQAILEEPFSHNGKDYKEGDQIPKEEIASINRFTLASLVKKY 1031 (2890) T ss_pred HHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHCCCCCHHHHHHHHHCC T ss_conf 35566667889887410056665310177787764211123235544347754344541032120456734543321001 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEEEEEECCCCCCCCCCCCCCCCCEEEEEECCC Q ss_conf 86789999999998999999999998988763125886774727699999987558855231433667873588863000 Q gi|254780143|r 1028 DEKVQRNVESLKVQYETSKSILEDRFKNKIEKIQWGDDMPPGVLRVVKVFVAMKRPIQSGDKMAGRHGNKGIVSRILPCE 1107 (1386) Q Consensus 1028 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~ki~~~~~~~~gv~~~vKV~ir~~R~p~iGDKfasRHGqKGVis~i~p~e 1107 (1386) ....+..+..+..+|.++...+...+.++..++..+|+++|||+++||||||++|+||+|||||||||||||||+|+|+| T Consensus 1032 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~k~~~~~~~d~l~~Gv~k~VKV~Ia~kRkiQvGDKmAGRHGNKGVIs~Ilp~E 1111 (2890) T PRK09603 1032 SKEVQNHYEITKNNFLEQKKVLGEEHEEKLSILEKDDILPNGVIKKVKLYIATKRKLKVGDKMAGRHGNKGIVSNIVPVA 1111 (2890) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEEEEEECCCCCCCCCCCCCCCCCCEEEEECCCC T ss_conf 25565689987777888888876666553020346776787872799999965169988412466688863677675601 Q ss_pred CCCCCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCC-----CCC Q ss_conf 078793587156986689867507089999999999998719622433343222210246777877631543-----221 Q gi|254780143|r 1108 DMPFLKDGTPVDIVLNPLGVPSRMNVGQIFETHLGWACVGLGKKIKSLINDYKANGDISPLRSFLEKVIGTG-----SHT 1182 (1386) Q Consensus 1108 DMPf~~dG~~pDIIlNPhgvPSRMtIGqllE~~lGka~~~~G~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~-----~~~ 1182 (1386) |||||+||||||||||||||||||||||||||||||||..+|+++..++|+.... .+...+.++.+++... ... T Consensus 1112 DMPfl~DGtpvDIILNPLGVPSRMNVGQIlE~hLG~Aa~~lG~~i~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 1190 (2890) T PRK09603 1112 DMPYTADGEPVDIVLNPLGVPSRMNIGQILEMHLGLVGKEFGKQIASMLEDKTKD-FAKELRAKMLEIANAINEKDPLTI 1190 (2890) T ss_pred CCCCCCCCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHHHCHHHHHHHHHHCHH-HHHHHHHHHHHHHHCCCCCCCCCH T ss_conf 4997989966427689898876540999999889999986332444333310002-456666666655301354553201 Q ss_pred CCCCCCCHHHHHHHHHHHCCCCEECCCCCCCCCHHHHHHHHHHCCCCCCCCEEEECCCCCCCCCCCEEEEEEEEEECCCH Q ss_conf 00001352567776653026840025655788999999999986868998699986898840268504884645410110 Q gi|254780143|r 1183 EKISDYDDDSVLRVAEQWKSGVPVSTPVFDGADEEAINSMLRMADLDESGQSILYDGLTGEPFDRPVTVGYIYMLKLNHM 1262 (1386) Q Consensus 1183 ~~~~~~~~~~~~~~~~~~~~g~~~aTP~F~g~~~~~i~~~L~~aG~~~~Gke~lydG~TGe~~~~~IfvG~~YyqkL~HM 1262 (1386) ..+.++++++++.++++|+.|++||||||+|+++++|.++|..||++.+||++||||||||+|+++|||||||||||+|| T Consensus 1191 ~~v~~~~~~~ll~~~~~~~~G~~~atPvFdg~~~~~i~~~l~~ag~~~~Gk~~LyDGrTGe~fd~~VtVG~~YmlKL~Hm 1270 (2890) T PRK09603 1191 HALENCSDEELLEYAKDWSKGVKMAIPVFEGISQEKFYKLFELAKIAMDGKMDLYDGRTGEKMRERVNVGYMYMIKLHHL 1270 (2890) T ss_pred HHHHCCCHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCEEEECCCCCCCCCCCEEEEHHHHHCCCHH T ss_conf 22211366889987665405973137766887889999999972999999889788998880578667856565325122 Q ss_pred HHHHCCCCCCCCCCCEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCCCCCC Q ss_conf 21100002368731103079986233178432078999999986989999861110011015999988763878789978 Q gi|254780143|r 1263 VSDKVYARSTGSYSLVTQQPLGGKSNRGGQRLGEMEVWCIQAYGAAYVLQEMLTIKSDDVVGRTRVYESIVAGNDTFETG 1342 (1386) Q Consensus 1263 V~DKiHARstGP~sllTrQP~eGRsr~GGlRfGEMErwaL~AyGAa~~LqE~Lt~kSDdv~gr~~~~~~iv~g~~~~~~~ 1342 (1386) ||||||||||||||+|||||||||||+||||||||||||||||||||+||||||||||||.||+++|+|||||+++|+|| T Consensus 1271 VDDKIHARS~GPYSLvTQQPLgGKAq~GGQRFGEMEVWALEAYGAAytLQE~LTvKSDDv~GR~k~yeaIvkG~~~~~pg 1350 (2890) T PRK09603 1271 VDEKVHARSTGPYSLVTHQPVGGKALFGGQRFGEMEVWALEAYGAAHTLKEMLTIKSDDIRGRENAYRAIAKGEQVGESE 1350 (2890) T ss_pred HCCCCCCCCCCCCHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHCCCCCCCCC T ss_conf 20263244768652421588885136788314689999988989999998885037413415899999875889898987 Q ss_pred CCHHHHHHHHHHHHCCCCEEEEECCCC Q ss_conf 667789999999854200276406654 Q gi|254780143|r 1343 TPESFNVLVKEMQALGLSIDLENSRTK 1369 (1386) Q Consensus 1343 ~pesf~vl~~El~~l~l~~~~~~~~~~ 1369 (1386) +|||||||+||||||||||+|+.++-. T Consensus 1351 iPESF~VL~kElqsL~Ldv~~~~~~~~ 1377 (2890) T PRK09603 1351 IPETFYVLTKELQSLALDINIFGDDVD 1377 (2890) T ss_pred CCCCHHHHHHHHHHCCCCEEECCCCCC T ss_conf 895299999998757125354355410 No 3 >PRK00405 rpoB DNA-directed RNA polymerase subunit beta; Reviewed Probab=100.00 E-value=0 Score=2886.55 Aligned_cols=1127 Identities=64% Similarity=1.053 Sum_probs=1053.9 Q ss_pred CCEEECCCEEECCCHHHCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHCCCCCCCCCEEEEEEEEEEC Q ss_conf 63134186332101310466678870889999999998624665221112258999766378076798589999978980 Q gi|254780143|r 3 KGVVFNGLGRVRKFFGKNPEIIDIPDLIEVQKASYDHFLMMNIAPDERPNEGLQAAFKSVFPITAFSGAAMLEFVSYEFD 82 (1386) Q Consensus 3 ~~~~~~~~~~~R~~f~k~~~~~~~P~Li~iQ~~Sf~~Flq~~~~~~~r~~~GL~~v~~~~fPI~d~~~~~~Lef~~y~l~ 82 (1386) +.++++||||+|++|||+++.+++|||+++|++|||||||.+++++ .+||+++|+++|||+|+++++.|+|.+|+|+ T Consensus 1 ~~~~~~~kk~~R~~F~ki~~~l~~PdLv~iQ~dSF~~FLq~~~~~~---~~GL~~v~~~ifPI~d~~~~~~Lef~~y~lg 77 (1127) T PRK00405 1 MAYSYTGKKRIRKSFGKIKEVLELPNLLEIQLDSYDWFLQEDVPPE---KKGLEEVFRSIFPIEDFNGNLELEFVSYRLG 77 (1127) T ss_pred CCCCCCCCCEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCH---HHHHHHHHHHCCCEECCCCCEEEEEEEEEEC T ss_conf 9644356622455720466578898878999999999973777404---5679999976498088899889999889958 Q ss_pred CCCCCHHHHHHCCCCEEEEEEEEEEEEEECCCCCCCCCCCEEEEEEEEEEEECCEECCCCCEEECCEEEEEEEEECCCCC Q ss_conf 98588999998399753358999999993178766552000122367875100002689628986821468665122785 Q gi|254780143|r 83 PPKFDVDDCLWRDLTYAVPLKITLRLIVFDVDEFTGAKSIKDIKEQSIYMGDLPLMTKDGTFVIKGIQRIVVSQLHRSPG 162 (1386) Q Consensus 83 ~Pk~tp~ECRlR~lTYsapL~V~i~l~v~~~~~~~~~k~~~~ike~~V~lG~IPiMt~~GyFIING~ERVIVsQl~RSPG 162 (1386) +|++||+|||+||+||||||||+++++.. .++++++|+||||+||+||++|||||||+|||||+||||||| T Consensus 78 ~Pk~tp~ECRlRdlTYsApLyV~i~l~~~---------~~~eikeq~V~iG~IPiMT~~GyFIINGsERVIVsQl~RSPG 148 (1127) T PRK00405 78 EPKYDVEECKERGLTYSAPLRVKLRLINK---------ETGEIKEQEVYMGDIPLMTENGTFIINGTERVIVSQLHRSPG 148 (1127) T ss_pred CCCCCHHHHHHCCCCEEEEEEEEEEEEEC---------CCCEEEEEEEEEECCCEECCCEEEEECCCCEEEEEEECCCCC T ss_conf 87799999985199342128999999988---------744699988998325529988289987873789998157896 Q ss_pred EEECCCCCCCCCCCCEEEEEEEECCCCCEEEEEECCCCEEEEEECCCCCHHHHHHHHHCCCCHHHHHHHHCCCEEEEECC Q ss_conf 21202346515778567999981188723689975898299997187870399899880998479999738740577306 Q gi|254780143|r 163 IHFDHDKGRASLSGKLLYACRIIPDQGLWMDIEFDSKDIIHVRIDRRRKVPVTSFLMALGMDSEEILSTFYPKIVYSQRG 242 (1386) Q Consensus 163 Vyf~~~k~k~~~s~k~~ysa~IIP~RGSwLe~e~d~kd~iyvrIdr~rKIPi~ilLrALG~ssdeIl~~f~~~~~~~~~~ 242 (1386) |||.++..++ ++++.+|+|+|||+|||||+||+|+|+.+||||||+||||+++||||||++.+||+++||......... T Consensus 149 Vyf~~~~~k~-~s~k~~ysa~IIP~RGSWLe~E~D~K~~i~vrIdrkkKIPitilLrALGlSd~eIl~~f~~~~~~~~~~ 227 (1127) T PRK00405 149 VYFDHDKDKT-SSGKLLYSARIIPYRGSWLEFEFDPKDILYVRIDRRRKLPVTVLLRALGYSDEEILDLFYEKVTFGIDE 227 (1127) T ss_pred EEEEECCCCC-CCCCEEEEEEEECCCCCCEEEEECCCCEEEEEECCCCCEEHHHHHHHCCCCHHHHHHHHCCCHHHHHHH T ss_conf 2686067777-778578999993588763899986898599998798855499999871999899998754501220124 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHCCCCCCCEEECCCCHHHHHHHHHHHHCCCCCCCCCHHHHCCCCCCCCCCCCCCCEEEE Q ss_conf 73102354566642101223211244584544022102079988787636000102667734741000013666776999 Q gi|254780143|r 243 DFWCFPLSAADLMVGAKVSSSLVDIDTGEQVIESGKKLTSGLLKSLKEKGVKFLGITSDCLCGLYVAEDIVNGETGEIYI 322 (1386) Q Consensus 243 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~l~~~~~~~~~~d~~~gei~~ 322 (1386) ..... ........ ...++.++++...+++.. T Consensus 228 ~~~~~-----------------------------~~~~~~~~--------------------~~~~~~d~i~~~~~ei~~ 258 (1127) T PRK00405 228 VIVEA-----------------------------GRRITARH--------------------IRQLAKDIVDEDTGEVIA 258 (1127) T ss_pred HHHHH-----------------------------CCHHHHHH--------------------HHHHHHHHCCHHHCHHHH T ss_conf 55541-----------------------------00345788--------------------887777631610110214 Q ss_pred ECCCCCCHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHCCCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHCCCC Q ss_conf 62345998999999865654102442024445520000011023458899999988760577631045688886202453 Q gi|254780143|r 323 EAGDVIDEKSLEEIFHSEIRDIPILYVDSVNNNAYIRNTLVTDKNKDRKDALLDIYRVMRPGDVSTFSVAESMFNFLFFD 402 (1386) Q Consensus 323 ~~~~~~~~~~l~~~~~~~~~~~~~l~~~~~~~~~~i~~~~~~d~~~~~~eAl~~I~k~lr~~~~~~~~~~~~~~~~~~~~ 402 (1386) .......... .............++.+++.++...+.++|+.+||+++||+++++.+.|+.++.+.||+ T Consensus 259 ~~~~~~~~~~-----------~~~~~~~~~~~~~~i~~tl~kd~~~t~~eAL~~Iyk~lRpgep~t~e~A~~~l~~lFFn 327 (1127) T PRK00405 259 EANDEITEGI-----------KEIETLYTNDLDHYISNTLEKDPTSSREEALVEIYRRLRPGEPPTVEAARSLLENLFFD 327 (1127) T ss_pred HHHHHHHHHH-----------HHEEECCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCC T ss_conf 2123554221-----------00000034303589999886446899999999999970799998478999999985378 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCCCEECHHHHHHHHHHHHHHHCCCCCCCCCHHCCCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 02333455557776542036677677066189988889888763048764344010153323333578789888888999 Q gi|254780143|r 403 SDKYDLSTVGRVKMNMRLNLDTPDDVRHIRKEDIIAIIKILVDLRNGKGTIDDIDNLGNRRVRSVGEMLKNQYRLGLLRM 482 (1386) Q Consensus 403 ~~~y~l~~vgr~~~n~~l~~~~~~~~~~Lt~~d~~~~i~~L~~l~~g~~~~DdiDhlgnkRvr~vgeLl~~~fr~~l~rl 482 (1386) +++|+|+.+||+++|++++++.+...++|++.|+++++++|+++..|.+.+||+|||||||++++||||++||+.+|.|| T Consensus 328 pkrYDLg~VGR~klNkKL~l~~~~~~~~Lt~~Di~~~i~~Li~l~~g~~~~DDiDhlgNKRvr~aGeLL~~~fr~~l~rl 407 (1127) T PRK00405 328 PKRYDLSRVGRYKFNKKLGLDEDEGVRVLTKEDIIAVIKYLINLRNGKGEVDDIDHLGNRRVRSVGELLENQFRIGLARM 407 (1127) T ss_pred CCCCCHHHHHHHHHHHHCCCCCCCCCCEECHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEECCHHHHHHHHHHHHHHHH T ss_conf 10012556546661441178877566685299999999988888608998765200146233237799999999999999 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCEEECCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 98876531134433435211222023465554202667503541554210232200112234444432234665543221 Q gi|254780143|r 483 ERSIKERISSVDIDSVMPQDLINAKPVVSAVCEFFCSSQLSQLEEHVNSLSRITHTRRLSALGQGGVARARAGVEMRDVH 562 (1386) Q Consensus 483 ~r~i~~~~~~~~~~~~~~~~~in~~~i~~~i~~ff~t~~lsq~ld~~n~ls~lth~RR~s~lgpggl~r~~~~~evR~ih 562 (1386) +|.++++|...+...++|++++|++++++++++||++|||||||||+||||+|||+||+||||||||+|++++||||+|| T Consensus 408 ~k~iker~~~~~~~~~~~~~li~~~~i~~~i~~ffg~~~lSQfmdqtNpLs~ltH~RRlSaLGpGGLtR~~a~fevR~lH 487 (1127) T PRK00405 408 ERAVKERMSLQDLDTLTPQDLINAKPVSAAIKEFFGSSQLSQFMDQTNPLSELTHKRRLSALGPGGLTRERAGFEVRDVH 487 (1127) T ss_pred HHHHHHHHHCCCHHHCCHHHHHCHHHHHHHHHHHHCCCCHHHHHHCCCHHHHHHHHHEECCCCCCCCCCCCCCCEEEECC T ss_conf 99998876202421066565425288999999876646232556524737765311034255787655455786432024 Q ss_pred CCCEEEEEECCCCCCCCCEEEECEEEEEEECCCCCCCCEEEEEECCCCCCCEEECCHHHHCCEEEECCCCEECCCCCCCC Q ss_conf 11104443035667652102100012244246876441069862255348166429677376379616534114684022 Q gi|254780143|r 563 PTHYGRICPAETSEGHNIGLVSSLTSFARVNAYGFIETPYRKVCDGKVTNDVVYLSAMEEENRYIAQANSSLDEDGSFTE 642 (1386) Q Consensus 563 ~s~~GriCPieTPEG~n~GLv~~la~~a~in~~g~ie~py~~v~~~~~~~~i~~l~~~~e~~~~Ia~~~~~l~~~~~~~~ 642 (1386) |||||||||+|||||+|||||+|||+||+||.+||+++||++|.++.++++++|++|.+|+.+.||+++..+..++.+.+ T Consensus 488 pSh~GriCPieTPEG~n~GLVknLA~~a~in~~GfietP~~~v~~g~~~~~i~yl~a~eE~~~~ia~~~~~~~~~g~~~~ 567 (1127) T PRK00405 488 PTHYGRICPIETPEGPNIGLINSLATYARVNEYGFIETPYRKVVDGKVTDEIEYLTADEEDNYVIAQANAPLDEDGRFVD 567 (1127) T ss_pred CHHCCCCCCCCCCCCCCEEEEECEEEEEECCCCCEEECCCEEEECCEEECCEEEECHHHCCCCEEEECCCCCCCCCCCCC T ss_conf 10023316753899874301301022664066772313546774774505458957677277069834752877886635 Q ss_pred CCEEECCCCCCCCCCHHHEEECCCCCCCEEEECCCCCCCHHHCCHHHHHHHHHHHHHHHCCCCCCCCEECCCCCHHHHHH Q ss_conf 20000002333322578722023672110231123332011010022112223444321013665411126620111010 Q gi|254780143|r 643 ELVFCRCAGEEILVPREKIDFIDASPKQVVSIAASLIPFLENDDSNRVLMGCNMQRQAVPLLKAEAPFVGTGMESVVAKS 722 (1386) Q Consensus 643 ~~~~~r~~~~~~~~~~~~v~~~~i~p~~i~sv~aslIPflehdda~R~l~g~nm~rQav~l~~~~~~~v~tg~E~~~~~~ 722 (1386) ..+.+|+++++..+.+++++|+|++|.|++|++||+|||||||||||||||||||||||||+.+++|+||||+|..++.| T Consensus 568 ~~~~~r~~~e~~~~~~~~i~~~disp~qi~sv~aslIPFleHnda~RalmgsnMqrQAvpLl~~e~p~VgTG~E~~~~~d 647 (1127) T PRK00405 568 ELVTARYKGEFILVPPEEVDYMDVSPKQVVSVAASLIPFLEHDDANRALMGSNMQRQAVPLLRPEAPLVGTGMERRVARD 647 (1127) T ss_pred CEEEEEECCCCEEECHHHEEEEECCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCEECCCHHHHHHHH T ss_conf 51577873740575557826883475607788633355543576215666555443213445677861033177787764 Q ss_pred CCCEEECCCCCCCCCCCCCCEEEECCCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCEECCCCCCCC Q ss_conf 65301021244343356553025215665564446630111465554454732244530447977207852035522357 Q gi|254780143|r 723 SGAAIVAKRAGIVEQVDAIRIVIRSVEGDLDPSTSGVDIYRLMKFQRSNQNTCVNQRPLVKVGDEVRRNDIIADGPSTDL 802 (1386) Q Consensus 723 s~~~i~a~~~g~v~~vd~~~i~i~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~q~p~V~~g~~~~~~~~l~~~~~~~~ 802 (1386) |+.++.|..+|+|.|+|+++|.|..+.. .........+|+|.+|.++||+||+||||+|+.|+++++++++||+++|.+ T Consensus 648 s~~~~~a~~~G~v~yvd~~~I~i~~~~~-~~~~~~~~~~y~l~~y~~sNq~T~~~Q~PlV~~~~~v~~~~~iad~~~~~~ 726 (1127) T PRK00405 648 SGAVVVAKRDGVVEYVDASRIVVRVNDD-LAGDEGGVDIYNLTKFTRSNQNTCINQRPIVKVGDRVEKGDVLADGPSTDN 726 (1127) T ss_pred HCCEEEECCCCEEEEEECCEEEEEECCC-CCCCCCCEEEEECCCCCCCCCCCCCCEEEEEECCCEEECCCEECCCCCCCC T ss_conf 1704785479559997478899982654-567777637996131366666753564037733988833878146763216 Q ss_pred CCCCCCCCCEEEEEECCCCCCCCCEECCHHHHHCCCCCEEEEEEEEHHHHHCCCCCCCCCCCCCCCCHHHHHHCCCCCCC Q ss_conf 86022237515553135544444200001344258731034666531121147884002466654686784102413741 Q gi|254780143|r 803 GDLALGRNMLVAFMPWHGYNFEDSMLISERMVSEDVFTSIHIEEFEVMARDTKLGPEEITRDIPNVSEEGLKNIDECGII 882 (1386) Q Consensus 803 ~el~~G~N~~VA~m~~~GYN~EDaiiin~~~v~rg~~~s~h~~~y~~~~~~~~~g~~~~~~~~~~~~~~~~~~ld~~Giv 882 (1386) ||||+|+|++||||||+|||||||||||+++++||+|||+|+++|++++++++.|.|++|+++|+++.+.+++||+|||+ T Consensus 727 ~el~~G~N~~Va~m~~~GYn~EDaiiin~~~v~~~~f~s~~~~~y~~~~~~~~~g~e~it~~ip~~~~~~~~~Ld~~Gi~ 806 (1127) T PRK00405 727 GELALGQNVLVAFMPWNGYNFEDAILISERLVKEDVFTSIHIEEYEIEARDTKLGPEEITRDIPNVSEEALRNLDESGIV 806 (1127) T ss_pred CCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHCCCEEEEEEEEEEEEEEECCCCCCEECCCCCCCCHHHHCCCCCCCCC T ss_conf 80356633168773567736446455546777437515888999876753067896201247999887871015556863 Q ss_pred CCCCEECCCCCEEECCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 37733136761110124677776770346654303555544323320227885322001210145654202667888899 Q gi|254780143|r 883 CVGAEVNPGDILVGKITPKGESPMTPEEKLLRAIFGEKAVDVRDTSLRVPSGVSGTVVDVRIFNRHGIDKNERSISVERE 962 (1386) Q Consensus 883 ~~G~~V~~gDilvgk~tp~~~~~~~pe~~~l~~i~~~~~~~~~d~~~~~~~g~~g~v~~~~~~~r~~~~~~~~~~~~~~~ 962 (1386) .+|++|++|||||||+||+++++++||+++++||||+++.+++|+|+++|+|++|+|+++.++++. T Consensus 807 ~~G~~v~~gdilvgK~~P~~~~~~~pe~kll~~ifg~~~~~~~d~sl~~~~~~~G~v~~v~~~~~~-------------- 872 (1127) T PRK00405 807 RIGAEVKPGDILVGKVTPKGETELTPEEKLLRAIFGEKARDVKDTSLRVPHGEEGTVIDVKVFTRE-------------- 872 (1127) T ss_pred CCCCEECCCCEEEEEECCCCCCCCCHHHHHHHHHHCCCCCCCEECEEECCCCCCEEEEEEEEEECC-------------- T ss_conf 689787589889998458876777878876777744446751068064389995799999999677-------------- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHCCCHHHHHEEEECCHHHHHHHHHHHHHH Q ss_conf 99999872888999887888988888625853456655555432113576401485431001415867899999999989 Q gi|254780143|r 963 QIELLARDKDDEQVILDRNIYSRLMEILCGQNAVSGPKGFKKSTVLSSDLISEYPRSQWWQFAVQDEKVQRNVESLKVQY 1042 (1386) Q Consensus 963 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1042 (1386) T Consensus 873 -------------------------------------------------------------------------------- 872 (1127) T PRK00405 873 -------------------------------------------------------------------------------- 872 (1127) T ss_pred -------------------------------------------------------------------------------- T ss_conf -------------------------------------------------------------------------------- Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEEEEEECCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCCEEEEEE Q ss_conf 99999999998988763125886774727699999987558855231433667873588863000078793587156986 Q gi|254780143|r 1043 ETSKSILEDRFKNKIEKIQWGDDMPPGVLRVVKVFVAMKRPIQSGDKMAGRHGNKGIVSRILPCEDMPFLKDGTPVDIVL 1122 (1386) Q Consensus 1043 ~~~~~~~~~~~~~k~~ki~~~~~~~~gv~~~vKV~ir~~R~p~iGDKfasRHGqKGVis~i~p~eDMPf~~dG~~pDIIl 1122 (1386) .+|++++|++++|||+||++|+||||||||||||||||||+|+||||||||+||+|||||| T Consensus 873 -------------------~~~~l~~~~~~~vkv~i~~~R~~~iGDK~asRHGqKGvi~~i~~~eDMPf~~dG~~pDii~ 933 (1127) T PRK00405 873 -------------------QGDELPPGVNKLVKVYIAQKRKIQVGDKMAGRHGNKGVISKILPVEDMPYLEDGTPVDIVL 933 (1127) T ss_pred -------------------CCCCCCCCCCEEEEEEEEEECCCCCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCEEEE T ss_conf -------------------7764675763699999986327887730001357772555451420299688998763878 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHCC Q ss_conf 68986750708999999999999871962243334322221024677787763154322100001352567776653026 Q gi|254780143|r 1123 NPLGVPSRMNVGQIFETHLGWACVGLGKKIKSLINDYKANGDISPLRSFLEKVIGTGSHTEKISDYDDDSVLRVAEQWKS 1202 (1386) Q Consensus 1123 NPhgvPSRMtIGqllE~~lGka~~~~G~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1202 (1386) ||||||||||||||+||++||||+.+|..++ T Consensus 934 NPhg~PSRMtvGql~E~~~G~a~~~~g~~~~------------------------------------------------- 964 (1127) T PRK00405 934 NPLGVPSRMNIGQILETHLGWAAKGLGKKIK------------------------------------------------- 964 (1127) T ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHCCCCCC------------------------------------------------- T ss_conf 8876776562999999998899874554457------------------------------------------------- Q ss_pred CCEECCCCCCCCCHHHHHHHHHHCCCCCCCCEEEECCCCCCCCCCCEEEEEEEEEECCCHHHHHCCCCCCCCCCCEEECC Q ss_conf 84002565578899999999998686899869998689884026850488464541011021100002368731103079 Q gi|254780143|r 1203 GVPVSTPVFDGADEEAINSMLRMADLDESGQSILYDGLTGEPFDRPVTVGYIYMLKLNHMVSDKVYARSTGSYSLVTQQP 1282 (1386) Q Consensus 1203 g~~~aTP~F~g~~~~~i~~~L~~aG~~~~Gke~lydG~TGe~~~~~IfvG~~YyqkL~HMV~DKiHARstGP~sllTrQP 1282 (1386) |++||||+|+|+++++|++.|.++||+++||++||||+||++|+++||+||+|||||+|||+|||||||||||++||||| T Consensus 965 ~~~~~tp~F~~~~~~~i~~~L~~~g~~~~gk~~l~~G~TG~~~~~~i~~G~~yy~kL~HmV~DKihaRs~Gp~~~lTrQP 1044 (1127) T PRK00405 965 GVPFATPVFDGAKEEEIKELLEEAGLPEDGKTVLYDGRTGEPFDRPVTVGYMYMLKLHHLVDDKIHARSTGPYSLVTQQP 1044 (1127) T ss_pred CCEECCCCCCCCCHHHHHHHHHHCCCCCCCCEEEECCCCCCCCCCCEEEEHHHHHHHHHHHCCCCCCCCCCCCCCEECCC T ss_conf 82334677799789999999998596998888987899888536748871578654544421030004638985520489 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHCCCCEE Q ss_conf 98623317843207899999998698999986111001101599998876387878997866778999999985420027 Q gi|254780143|r 1283 LGGKSNRGGQRLGEMEVWCIQAYGAAYVLQEMLTIKSDDVVGRTRVYESIVAGNDTFETGTPESFNVLVKEMQALGLSID 1362 (1386) Q Consensus 1283 ~eGRsr~GGlRfGEMErwaL~AyGAa~~LqE~Lt~kSDdv~gr~~~~~~iv~g~~~~~~~~pesf~vl~~El~~l~l~~~ 1362 (1386) +|||||+|||||||||||||||||||++||||||+|||||+||+++|+|||+|+++++||+|||||||+||||||||+++ T Consensus 1045 ~~Grs~~GG~R~GEME~~~l~a~Gas~~L~E~L~~~SDd~~gr~~~~~~i~~g~~~~~~~~p~sfklL~~El~~l~l~~~ 1124 (1127) T PRK00405 1045 LGGKAQFGGQRFGEMEVWALEAYGAAYTLQEMLTVKSDDVVGRTKVYEAIVKGENIPEPGIPESFNVLVKELQSLGLDVE 1124 (1127) T ss_pred CCCCCCCCCCEEEHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCEE T ss_conf 99865779840236689999998799999998425674401068998766468878899998227799999986757829 Q ss_pred EEE Q ss_conf 640 Q gi|254780143|r 1363 LEN 1365 (1386) Q Consensus 1363 ~~~ 1365 (1386) |++ T Consensus 1125 l~~ 1127 (1127) T PRK00405 1125 LLD 1127 (1127) T ss_pred ECC T ss_conf 619 No 4 >CHL00001 rpoB RNA polymerase beta subunit Probab=100.00 E-value=0 Score=2638.78 Aligned_cols=1040 Identities=43% Similarity=0.724 Sum_probs=964.5 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHCCCCCCCCCEEEEEE--EEEECCCCCCHHHHHHCCCCE Q ss_conf 666788708899999999986246652211122589997663780767985899999--789809858899999839975 Q gi|254780143|r 21 PEIIDIPDLIEVQKASYDHFLMMNIAPDERPNEGLQAAFKSVFPITAFSGAAMLEFV--SYEFDPPKFDVDDCLWRDLTY 98 (1386) Q Consensus 21 ~~~~~~P~Li~iQ~~Sf~~Flq~~~~~~~r~~~GL~~v~~~~fPI~d~~~~~~Lef~--~y~l~~Pk~tp~ECRlR~lTY 98 (1386) .+...+|||+++|++|||||| ++||+++|++++||+|++++++++|. +|++++|+++|+|||+||+|| T Consensus 2 ~~~~~~PdLv~~Q~dSFn~Fl----------~~gL~ei~~~~~pI~d~~~~~~l~~~~~~y~l~~P~~t~~Ecr~r~lTY 71 (1065) T CHL00001 2 EGMSTIPDFLQIQFESFCRFI----------DQGLTEELSKFPKIEDTDQEIEFQLFGETYQLVEPLIKERDAVYESLTY 71 (1065) T ss_pred CCCCCCCCHHHHHHHHHHHHH----------HHHHHHHHHHCCCEECCCCCEEEEEEECEEEECCCCCCHHHHHHHCCCC T ss_conf 884679977999999999999----------9748999985598784799889999705799848968999998619931 Q ss_pred EEEEEEEEEEEEECCCCCCCCCCCEEEEEEEEEEEECCEECCCCCEEECCEEEEEEEEECCCCCEEECCCCCCCCCCCCE Q ss_conf 33589999999931787665520001223678751000026896289868214686651227852120234651577856 Q gi|254780143|r 99 AVPLKITLRLIVFDVDEFTGAKSIKDIKEQSIYMGDLPLMTKDGTFVIKGIQRIVVSQLHRSPGIHFDHDKGRASLSGKL 178 (1386) Q Consensus 99 sapL~V~i~l~v~~~~~~~~~k~~~~ike~~V~lG~IPiMt~~GyFIING~ERVIVsQl~RSPGVyf~~~k~k~~~s~k~ 178 (1386) ||||||+++++.. ..+++++++||||+||+||++|||||||+|||||+|++|||||||..+.++ +++. T Consensus 72 saplyV~v~l~~~---------~~~~ik~~~V~iG~iPiMt~~GyFIING~ERVIV~Qi~rsPGiyf~~~~d~---~~~~ 139 (1065) T CHL00001 72 SSELYVPAGLIWK---------KSRDIQEQTIFIGNIPLMTSLGTFIINGIYRVVINQILRSPGIYYRSELDH---NGIS 139 (1065) T ss_pred CCCEEEEEEEEEC---------CCCEEEEEEEEEEEECEECCCCEEEECCCEEEEEEEEECCCCCEEEEEECC---CCCE T ss_conf 1018999999987---------774799999999873779999789990852489898614898378766358---9967 Q ss_pred EEEEEEECCCCCEEEEEECCCCEEEEEECCCCCHHHHHHHHHCCCCHHHHHHHHCCCEEEEECCCCCCCCCCHHHHHHHH Q ss_conf 79999811887236899758982999971878703998998809984799997387405773067310235456664210 Q gi|254780143|r 179 LYACRIIPDQGLWMDIEFDSKDIIHVRIDRRRKVPVTSFLMALGMDSEEILSTFYPKIVYSQRGDFWCFPLSAADLMVGA 258 (1386) Q Consensus 179 ~ysa~IIP~RGSwLe~e~d~kd~iyvrIdr~rKIPi~ilLrALG~ssdeIl~~f~~~~~~~~~~~~~~~~~~~~~~~~~~ 258 (1386) .|+|+|||+||+|++||+|+|+.+|+|||+++|||+++||||||++.+||++.++.... . T Consensus 140 ~~~a~ii~~rG~wl~~e~~~k~~i~vri~k~~kIPi~ilLrALG~sd~eI~~~i~~~e~-----------------~--- 199 (1065) T CHL00001 140 VYTGTIISDWGGRLKLEIDKKARIWARVSKKQKISILVLLSAMGLNLREILDNVCYPEI-----------------F--- 199 (1065) T ss_pred EEEEEEEECCCCEEEEEECCCCEEEEEECCCCCEEHHHHHHHHCCCHHHHHHHHCCHHH-----------------H--- T ss_conf 99999996898649999837986999937988340999998869799999988356688-----------------8--- Q ss_pred HHHHHHCCCCCCCEEECCCCHHHHHHHHHHHHCCCCCCCCCHHHHCCCCCCCCCCCCCCCEEEEECCCCCCHHHHHHHHH Q ss_conf 12232112445845440221020799887876360001026677347410000136667769996234599899999986 Q gi|254780143|r 259 KVSSSLVDIDTGEQVIESGKKLTSGLLKSLKEKGVKFLGITSDCLCGLYVAEDIVNGETGEIYIEAGDVIDEKSLEEIFH 338 (1386) Q Consensus 259 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~l~~~~~~~~~~d~~~gei~~~~~~~~~~~~l~~~~~ 338 (1386) T Consensus 200 -------------------------------------------------------------------------------- 199 (1065) T CHL00001 200 -------------------------------------------------------------------------------- 199 (1065) T ss_pred -------------------------------------------------------------------------------- T ss_conf -------------------------------------------------------------------------------- Q ss_pred CCCCCEEEEEECCCCCCCCCCHHHHCCCCCCHHHHHHHHHHHH--CCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHH Q ss_conf 5654102442024445520000011023458899999988760--57763104568888620245302333455557776 Q gi|254780143|r 339 SEIRDIPILYVDSVNNNAYIRNTLVTDKNKDRKDALLDIYRVM--RPGDVSTFSVAESMFNFLFFDSDKYDLSTVGRVKM 416 (1386) Q Consensus 339 ~~~~~~~~l~~~~~~~~~~i~~~~~~d~~~~~~eAl~~I~k~l--r~~~~~~~~~~~~~~~~~~~~~~~y~l~~vgr~~~ 416 (1386) .....++...++++|+.++|+++ +++++.+...++.++...||+ ++|+|+.+||+++ T Consensus 200 --------------------~~~~~~~~i~t~e~Al~ei~~~~~~~~~~~~~~~~a~~~l~~~ff~-~rydLg~vGR~kl 258 (1065) T CHL00001 200 --------------------LSFKEKKKIGSKENAILEFYKQFACVGGDPVFSESLCKELQKKFFQ-QRCELGRIGRRNI 258 (1065) T ss_pred --------------------HHHHHHCCCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHH-HHCCCHHHHHHHH T ss_conf --------------------8776540458889999999986325789862168999999987644-3046113416776 Q ss_pred HHHCCCCCCCCCCEECHHHHHHHHHHHHHHHCCCCCCCCCHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCC- Q ss_conf 5420366776770661899888898887630487643440101533233335787898888889999887653113443- Q gi|254780143|r 417 NMRLNLDTPDDVRHIRKEDIIAIIKILVDLRNGKGTIDDIDNLGNRRVRSVGEMLKNQYRLGLLRMERSIKERISSVDI- 495 (1386) Q Consensus 417 n~~l~~~~~~~~~~Lt~~d~~~~i~~L~~l~~g~~~~DdiDhlgnkRvr~vgeLl~~~fr~~l~rl~r~i~~~~~~~~~- 495 (1386) |++++++.+...++|++.|+++++++|+.+..|.+.+||+|||||||++++||||++||+++|.||++.++++|..... T Consensus 259 n~kl~l~i~~~~~~L~~~dil~~i~~Li~l~~g~~~~DDiDhlgNKRlrlaGeLL~~~fr~~l~rl~r~ik~~m~~~~~~ 338 (1065) T CHL00001 259 NKKLNLDIPQNNTFLLPQDILAAADYLIGLKFGMGTLDDIDHLKNKRIRSVADLLQDQFGLALNRLENVVRETICGAIRH 338 (1065) T ss_pred HHHHCCCCCCCCEEECHHHHHHHHHHHHHHHCCCCCCCCCHHCCCCEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 66306898843117639999999999998650899876501205844355899999999999999999999999776424 Q ss_pred -CCCCCCCCCCHHHHHHHHHHHCCCCCCEEECCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECCC Q ss_conf -3435211222023465554202667503541554210232200112234444432234665543221111044430356 Q gi|254780143|r 496 -DSVMPQDLINAKPVVSAVCEFFCSSQLSQLEEHVNSLSRITHTRRLSALGQGGVARARAGVEMRDVHPTHYGRICPAET 574 (1386) Q Consensus 496 -~~~~~~~~in~~~i~~~i~~ff~t~~lsq~ld~~n~ls~lth~RR~s~lgpggl~r~~~~~evR~ih~s~~GriCPieT 574 (1386) ...+|.++++++++++.+++||++|+|||||||+||||+|||+||+|+|||||++|++|+|+||+|||||||||||+|| T Consensus 339 ~~~~~~~~li~~~~it~~ik~ffgs~~LSQfmdqtNpLseltHkRRlSalGPGGltr~~a~fevR~lHpShwGriCPiET 418 (1065) T CHL00001 339 KLIPTPQNLVTSTPLTTTYEEFFGSHPLSQFLDQTNPLTEIVHKRKLSYLGPGGLTRRTASFRVRDIHPSHYGRICPIET 418 (1065) T ss_pred CCCCCHHHHCCCHHHHHHHHHHHCCCCHHHHHHCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCCC T ss_conf 54478888427478999999987566431443425888887766653235887655455676100036654630224558 Q ss_pred CCCCCCEEEECEEEEEEECCCCCCCCEEEEEECCCCCCC--EEECCHHHHCCEEEECCCCEECCCCCCCCCCEEECCCCC Q ss_conf 676521021000122442468764410698622553481--664296773763796165341146840222000000233 Q gi|254780143|r 575 SEGHNIGLVSSLTSFARVNAYGFIETPYRKVCDGKVTND--VVYLSAMEEENRYIAQANSSLDEDGSFTEELVFCRCAGE 652 (1386) Q Consensus 575 PEG~n~GLv~~la~~a~in~~g~ie~py~~v~~~~~~~~--i~~l~~~~e~~~~Ia~~~~~l~~~~~~~~~~~~~r~~~~ 652 (1386) |||+|||||+|||+||+||.+||+|+||++|.++.+..+ +.|+++.+|+.+.||+++......+.+....+.+|.+++ T Consensus 419 PEG~n~GLVknLAl~a~I~~~G~ietP~~~v~~~~~~~~~~~~~l~~~~e~~~~ia~~~~~~~~~~~~~~~~~~~r~~~e 498 (1065) T CHL00001 419 SEGINAGLIGSLAIHARINHWGFLESPFYKISEGKVSKEERMVYLSPGRDEYYMIAAGNSLALNQGIQEEQVVPARYRQE 498 (1065) T ss_pred CCCCCCCHHHHHHHEEECCCCCCCCCCCEEEECCEEECCCEEEEECCCCCCCEEEECCCEEECCCCCCCCCEEEEECCCC T ss_conf 88765421432211220367775578858920570642562898465222763880274345257826354021001465 Q ss_pred CCCCCHHHEEECCCCCCCEEEECCCCCCCHHHCCHHHHHHHHHHHHHHHCCCCCCCCEECCCCCHHHHHHCCCEEECCCC Q ss_conf 33225787220236721102311233320110100221122234443210136654111266201110106530102124 Q gi|254780143|r 653 EILVPREKIDFIDASPKQVVSIAASLIPFLENDDSNRVLMGCNMQRQAVPLLKAEAPFVGTGMESVVAKSSGAAIVAKRA 732 (1386) Q Consensus 653 ~~~~~~~~v~~~~i~p~~i~sv~aslIPflehdda~R~l~g~nm~rQav~l~~~~~~~v~tg~E~~~~~~s~~~i~a~~~ 732 (1386) +..+.++++++++++|.|++|++||+|||+||||||||+|||||||||||++.+++|+||||+|.+++.||+.++.|..+ T Consensus 499 ~~~~~~~~v~~~~i~p~q~~sv~aslIPflehnda~R~lmgsnMqrQavpll~~e~p~Vgtg~e~~~a~ds~~~~~a~~~ 578 (1065) T CHL00001 499 FLTIAWEQVHFRSIFPFQYFSIGASLIPFLEHNDANRALMGSNMQRQAVPLLRSEKCIVGTGLERQVALDSGVVAIAEHE 578 (1065) T ss_pred CEEECCCEEEEEECCCCEEEEEEEEEECCHHCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHCCCCEEEECCC T ss_conf 04503222578860566476433224210321873233333320355011013656501267254443204736884268 Q ss_pred CCCCCCCCCCEEEECCCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCEECCCCCCCCCCCCCCCCCE Q ss_conf 43433565530252156655644466301114655544547322445304479772078520355223578602223751 Q gi|254780143|r 733 GIVEQVDAIRIVIRSVEGDLDPSTSGVDIYRLMKFQRSNQNTCVNQRPLVKVGDEVRRNDIIADGPSTDLGDLALGRNML 812 (1386) Q Consensus 733 g~v~~vd~~~i~i~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~q~p~V~~g~~~~~~~~l~~~~~~~~~el~~G~N~~ 812 (1386) |+|.|+|+++|.+..... ....|+|.+|.||||+||++|+|+|+.|++++++|++||+++|..||||+|+|++ T Consensus 579 G~v~y~d~~~i~~~~~~~-------~~~~y~l~~~~rsn~~t~~~q~P~V~~g~~v~~g~ilad~~~t~~~el~~G~N~~ 651 (1065) T CHL00001 579 GKIIYVDADKIILSGNKG-------DTLSIPLVKYQRSNQNTCMHQKPIVWRGECIKKGQILADGAATVGGELALGKNVL 651 (1065) T ss_pred CEEEEEECCEEEEECCCC-------CEEEEEECCCCCCCCCCCCCCCEEEECCCEEECCCEECCCCHHHCCEECCCCCEE T ss_conf 628885076689963788-------5578752566668755542512056248677128374356321078501221037 Q ss_pred EEEEECCCCCCCCCEECCHHHHHCCCCCEEEEEEEEHHHHHCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCEECCCC Q ss_conf 55531355444442000013442587310346665311211478840024666546867841024137413773313676 Q gi|254780143|r 813 VAFMPWHGYNFEDSMLISERMVSEDVFTSIHIEEFEVMARDTKLGPEEITRDIPNVSEEGLKNIDECGIICVGAEVNPGD 892 (1386) Q Consensus 813 VA~m~~~GYN~EDaiiin~~~v~rg~~~s~h~~~y~~~~~~~~~g~~~~~~~~~~~~~~~~~~ld~~Giv~~G~~V~~gD 892 (1386) ||||||+|||||||||||+++|+||+|||+|+++|++++++++.|+|++++++|++++..+++||+|||+++|++|++|| T Consensus 652 VA~m~~~GYN~EDAiiin~~~v~~~~ftSih~~~ye~~~~~~~~g~e~it~~ip~~~~~~~~~LD~dGii~~G~~V~~gD 731 (1065) T CHL00001 652 VAYMPWEGYNFEDAVLISERLVYEDIYTSIHIEKYEIETRVTSQGPERITKEIPHLEAHLLRNLDENGIVMLGSWVETGD 731 (1065) T ss_pred EEEECCCCCCHHHHEEECHHHHHCCCEEEEEEEEEEEEEEECCCCCCEECCCCCCCCHHHHHHCCCCCCCCCCCEEECCC T ss_conf 89834576026681330224543575059999998899984478874313669997778885486567707887884797 Q ss_pred CEEECCCCC--CCCCCCHHHHHHHHHCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 111012467--777677034665430355554432332022788532200121014565420266788889999999872 Q gi|254780143|r 893 ILVGKITPK--GESPMTPEEKLLRAIFGEKAVDVRDTSLRVPSGVSGTVVDVRIFNRHGIDKNERSISVEREQIELLARD 970 (1386) Q Consensus 893 ilvgk~tp~--~~~~~~pe~~~l~~i~~~~~~~~~d~~~~~~~g~~g~v~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~ 970 (1386) |||||+||+ .+++.+||+++++||||+++.+++|+|+++|+|++|+|+|++++++...+ T Consensus 732 ilvgK~tp~~~~~~~~~pe~~ll~~ifg~k~~~~kd~Sl~~~~g~~G~Vidv~~~~~~~~~------------------- 792 (1065) T CHL00001 732 ILVGKLTPQEAEESSYAPEGKLLRAIFGIQVSTSKETCLKLPIGGRGRVIDVRWIQKKGGS------------------- 792 (1065) T ss_pred EEEEEECCCCCCCCCCCHHHHHHHHHHCCCCCCEEECEEECCCCCCEEEEEEEEEECCCCC------------------- T ss_conf 7999745778775555757766777634666741014275369985689999999757888------------------- Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHCCCHHHHHEEEECCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 88899988788898888862585345665555543211357640148543100141586789999999998999999999 Q gi|254780143|r 971 KDDEQVILDRNIYSRLMEILCGQNAVSGPKGFKKSTVLSSDLISEYPRSQWWQFAVQDEKVQRNVESLKVQYETSKSILE 1050 (1386) Q Consensus 971 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1050 (1386) T Consensus 793 -------------------------------------------------------------------------------- 792 (1065) T CHL00001 793 -------------------------------------------------------------------------------- 792 (1065) T ss_pred -------------------------------------------------------------------------------- T ss_conf -------------------------------------------------------------------------------- Q ss_pred HHHHHHHHHHCCCCCCCCCCEEEEEEEEEEECCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCCEEEEEECCCCCCCC Q ss_conf 99898876312588677472769999998755885523143366787358886300007879358715698668986750 Q gi|254780143|r 1051 DRFKNKIEKIQWGDDMPPGVLRVVKVFVAMKRPIQSGDKMAGRHGNKGIVSRILPCEDMPFLKDGTPVDIVLNPLGVPSR 1130 (1386) Q Consensus 1051 ~~~~~k~~ki~~~~~~~~gv~~~vKV~ir~~R~p~iGDKfasRHGqKGVis~i~p~eDMPf~~dG~~pDIIlNPhgvPSR 1130 (1386) .+. .++|||+|||+|+||||||||||||||||||+|+||||||||+||||||||||||||||| T Consensus 793 --------------~~~---~~~vkv~i~~~R~p~iGDKfasRHGqKGvig~I~p~eDMPf~~dG~~pDiIlNPhgvPSR 855 (1065) T CHL00001 793 --------------SYN---PETIRVYILQKRKIQVGDKVAGRHGNKGIISKILPRQDMPYLQDGTPVDMVLNPLGVPSR 855 (1065) T ss_pred --------------CCC---CEEEEEEEEECCCCCCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCEEECCCCCCCC T ss_conf --------------767---569999997405897553102145777554566043029979899875087888867666 Q ss_pred CCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHCCCCEECCCC Q ss_conf 70899999999999987196224333432222102467778776315432210000135256777665302684002565 Q gi|254780143|r 1131 MNVGQIFETHLGWACVGLGKKIKSLINDYKANGDISPLRSFLEKVIGTGSHTEKISDYDDDSVLRVAEQWKSGVPVSTPV 1210 (1386) Q Consensus 1131 MtIGqllE~~lGka~~~~G~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~aTP~ 1210 (1386) |||||||||||||||+.+|.++++|||++.+..++++.+.+ T Consensus 856 MtvGqllE~~lG~a~~~~g~~~~~tpF~~~~~~e~s~~li~--------------------------------------- 896 (1065) T CHL00001 856 MNVGQIFECLLGLAGDLLNRHYRIAPFDERYEQEASRKLVF--------------------------------------- 896 (1065) T ss_pred CCHHHHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHH--------------------------------------- T ss_conf 71999999998888874387044467766420345677889--------------------------------------- Q ss_pred CCCCCHHHHHHHHHHC------CCCCCCCEEEECCCCCCCCCCCEEEEEEEEEECCCHHHHHCCCCCCCCCCCEEECCCC Q ss_conf 5788999999999986------8689986999868988402685048846454101102110000236873110307998 Q gi|254780143|r 1211 FDGADEEAINSMLRMA------DLDESGQSILYDGLTGEPFDRPVTVGYIYMLKLNHMVSDKVYARSTGSYSLVTQQPLG 1284 (1386) Q Consensus 1211 F~g~~~~~i~~~L~~a------G~~~~Gke~lydG~TGe~~~~~IfvG~~YyqkL~HMV~DKiHARstGP~sllTrQP~e 1284 (1386) +.+.+...++ +++++||++||||+|||+|+++|||||+|||||+|||+||||||||||||+||||||| T Consensus 897 ------~~l~~~~~~~~~~~~f~~~~~Gk~~lydG~TGe~~~~~I~vG~~YyqkL~HmV~DKiHaRs~GP~s~lTrQP~~ 970 (1065) T CHL00001 897 ------SELYEASKQTANPWLFEPEYPGKSRLFDGRTGDPFEQPVTIGKAYILKLIHQVDDKIHARSTGPYSLVTQQPLR 970 (1065) T ss_pred ------HHHHHHHHHHCCCCCCCCCCCCCEEEECCCCCCCCCCCEEEEHHHHHHHHHHHCCCCCCCCCCCCCCEECCCCC T ss_conf ------99998665406652246689998798889988814673887277865434452436513663999531028999 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCCCCC-CCCHHHHHHHHHHHHCCCCEEE Q ss_conf 623317843207899999998698999986111001101599998876387878997-8667789999999854200276 Q gi|254780143|r 1285 GKSNRGGQRLGEMEVWCIQAYGAAYVLQEMLTIKSDDVVGRTRVYESIVAGNDTFET-GTPESFNVLVKEMQALGLSIDL 1363 (1386) Q Consensus 1285 GRsr~GGlRfGEMErwaL~AyGAa~~LqE~Lt~kSDdv~gr~~~~~~iv~g~~~~~~-~~pesf~vl~~El~~l~l~~~~ 1363 (1386) ||||+||||||||||||||||||||+||||||+|||||.||+++|+|||||+++|+| |+|||||||++|||||||||++ T Consensus 971 Grsr~GG~R~GEME~~aL~ayGAa~~L~E~L~~kSDd~~~r~~~~~~i~~g~~~~~~~g~pesf~vl~~El~~l~l~~~~ 1050 (1065) T CHL00001 971 GRAKQGGQRVGEMEVWALEGFGAAYILQEMLTYKSDHIRARQEVLGAIIKGGTIPKPEGAPESFRLLVRELRSLALELNH 1050 (1065) T ss_pred CCCCCCCCEEEHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEE T ss_conf 74467986004788899999889999999844547463426688888766998888899995176899999854205167 Q ss_pred EECCCCCC Q ss_conf 40665431 Q gi|254780143|r 1364 ENSRTKNS 1371 (1386) Q Consensus 1364 ~~~~~~~~ 1371 (1386) +..+.++. T Consensus 1051 ~~~~~~~~ 1058 (1065) T CHL00001 1051 FLVSEKNF 1058 (1065) T ss_pred EEECCCCC T ss_conf 77145455 No 5 >COG0085 RpoB DNA-directed RNA polymerase, beta subunit/140 kD subunit [Transcription] Probab=100.00 E-value=0 Score=2590.67 Aligned_cols=1041 Identities=58% Similarity=0.933 Sum_probs=955.6 Q ss_pred CEEECCCEEECCCHHHCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHCCCCCCCCCEEEEEEEEEECC Q ss_conf 31341863321013104666788708899999999986246652211122589997663780767985899999789809 Q gi|254780143|r 4 GVVFNGLGRVRKFFGKNPEIIDIPDLIEVQKASYDHFLMMNIAPDERPNEGLQAAFKSVFPITAFSGAAMLEFVSYEFDP 83 (1386) Q Consensus 4 ~~~~~~~~~~R~~f~k~~~~~~~P~Li~iQ~~Sf~~Flq~~~~~~~r~~~GL~~v~~~~fPI~d~~~~~~Lef~~y~l~~ 83 (1386) ..++++|+|+|++|+++++.+++|||+++|++|||||+ ++|||++|+++|||++++++++|+|++|++++ T Consensus 2 ~~~~~~~~r~r~~~~~i~~~~~~p~Lv~iQldSyn~ff----------~~gLq~i~~~i~PI~~~~~~~~Lef~~~~l~~ 71 (1060) T COG0085 2 VDSYTGKKRIRDSFGKIPEFLDLPNLVEIQLDSYNAFF----------LEGLQEVFREIFPIESYNGNTELEYGSYRLGE 71 (1060) T ss_pred CCCCCCCCCCCCCHHHCCCCCCCCCHHHHHHHHHHHHH----------HHHHHHHHHHCCCCCCCCCCEEEEEEEEEECC T ss_conf 64210123101366543100578357888888999999----------98899999731862268997899997788368 Q ss_pred C-CCCHHHHHHCCCCEEEEEEEEEEEEEECCCCCCCCCCCEEEEEEEEEEEECCEECCCCCEEECCEEEEEEEEECCCCC Q ss_conf 8-588999998399753358999999993178766552000122367875100002689628986821468665122785 Q gi|254780143|r 84 P-KFDVDDCLWRDLTYAVPLKITLRLIVFDVDEFTGAKSIKDIKEQSIYMGDLPLMTKDGTFVIKGIQRIVVSQLHRSPG 162 (1386) Q Consensus 84 P-k~tp~ECRlR~lTYsapL~V~i~l~v~~~~~~~~~k~~~~ike~~V~lG~IPiMt~~GyFIING~ERVIVsQl~RSPG 162 (1386) | +++|+|||+||+||||||||+++|++++.++. +++++||||+||+||++|||||||+|||||||+||||| T Consensus 72 p~k~~p~EcR~R~lTYsapLyv~lrlv~~~~~e~--------~k~~eV~iG~iPlMt~gGyFIINGsERVIVsQ~~rSPg 143 (1060) T COG0085 72 PPKFYPEECRLRGLTYSAPLYVKLRLVVNETGEE--------IKEQEVYMGDIPLMTRGGYFIINGTERVIVSQEHRSPG 143 (1060) T ss_pred CCCCCHHHHHHCCCCCCCCEEEEEEEEECCCCCC--------CCCCEEEECCCCEECCCCEEEECCEEEEEEEEEEECCC T ss_conf 8778989998508821442699999997776643--------43014995467676378679987837899987730698 Q ss_pred EEECCCCCCCCCCCCEEEEEEEECCCCCEEEEEECCCCEEEEEECCCCCHHHHHHHHHCCCCHH-HHHHHHCCCEEEEEC Q ss_conf 2120234651577856799998118872368997589829999718787039989988099847-999973874057730 Q gi|254780143|r 163 IHFDHDKGRASLSGKLLYACRIIPDQGLWMDIEFDSKDIIHVRIDRRRKVPVTSFLMALGMDSE-EILSTFYPKIVYSQR 241 (1386) Q Consensus 163 Vyf~~~k~k~~~s~k~~ysa~IIP~RGSwLe~e~d~kd~iyvrIdr~rKIPi~ilLrALG~ssd-eIl~~f~~~~~~~~~ 241 (1386) +||.++..|++. +..|+|++||+||+|++|++|+|+.+|++||+++|||+++||||||+++| ||++.|+........ T Consensus 144 vif~~~~~k~~~--~~~~~a~vIp~rG~wl~~e~d~~d~~~~rId~~rkiPvtilLRALG~~sDeeI~~~~~~~~~~~~~ 221 (1060) T COG0085 144 VIFVEKKDKTGS--KVLYVARVIPYRGSWLEFEFDPKDNLYVRIDRKRKIPVTILLRALGLETDEEIIEAFGGDELTDLV 221 (1060) T ss_pred EEEECCCCCCCC--CEEEEEEEECCCCCEEEEEECCCCEEEEEEEEEEEEEHHHHHHHHCCCCHHHHHHHHCCCCCCCCC T ss_conf 487314566775--157899990577643899983576078871067883287879870798579999985342123555 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHCCCCCCCEEECCCCHHHHHHHHHHHHCCCCCCCCCHHHHCCCCCCCCCCCCCCCEEE Q ss_conf 67310235456664210122321124458454402210207998878763600010266773474100001366677699 Q gi|254780143|r 242 GDFWCFPLSAADLMVGAKVSSSLVDIDTGEQVIESGKKLTSGLLKSLKEKGVKFLGITSDCLCGLYVAEDIVNGETGEIY 321 (1386) Q Consensus 242 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~l~~~~~~~~~~d~~~gei~ 321 (1386) . T Consensus 222 ~------------------------------------------------------------------------------- 222 (1060) T COG0085 222 P------------------------------------------------------------------------------- 222 (1060) T ss_pred C------------------------------------------------------------------------------- T ss_conf 2------------------------------------------------------------------------------- Q ss_pred EECCCCCCHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHCCCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHCCC Q ss_conf 96234599899999986565410244202444552000001102345889999998876057763104568888620245 Q gi|254780143|r 322 IEAGDVIDEKSLEEIFHSEIRDIPILYVDSVNNNAYIRNTLVTDKNKDRKDALLDIYRVMRPGDVSTFSVAESMFNFLFF 401 (1386) Q Consensus 322 ~~~~~~~~~~~l~~~~~~~~~~~~~l~~~~~~~~~~i~~~~~~d~~~~~~eAl~~I~k~lr~~~~~~~~~~~~~~~~~~~ 401 (1386) ....+++..+++... +.+++...+..++...++ T Consensus 223 ----------------------------------------------~~~~~~ll~~~~~~~-~~~i~~~~a~~~i~~r~~ 255 (1060) T COG0085 223 ----------------------------------------------PEGEEALLEIYEEAK-GEKITARNALELIGSRVF 255 (1060) T ss_pred ----------------------------------------------HHHHHHHHHHHHHCC-CCCCCHHHHHHHHHCCCC T ss_conf ----------------------------------------------278998688776414-788506899998754244 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCC----------CCC-CCEECHHHHHHHHHHHHHHHCCCCCCCCCHHCCCCCCCCHHHH Q ss_conf 30233345555777654203667----------767-7066189988889888763048764344010153323333578 Q gi|254780143|r 402 DSDKYDLSTVGRVKMNMRLNLDT----------PDD-VRHIRKEDIIAIIKILVDLRNGKGTIDDIDNLGNRRVRSVGEM 470 (1386) Q Consensus 402 ~~~~y~l~~vgr~~~n~~l~~~~----------~~~-~~~Lt~~d~~~~i~~L~~l~~g~~~~DdiDhlgnkRvr~vgeL 470 (1386) ...+|+ ...||++.+..+.... ... .++.+..|+++++++|+++..|.+..||+|||||||+|++||| T Consensus 256 ~~~~~~-~~~gR~k~~~~vl~~~~L~hL~~~~~~~~l~~~~k~~di~~mi~~l~~L~~G~~~~DDiDHlGNKRlrlvGeL 334 (1060) T COG0085 256 VVKRYD-AKEGRYKRAKYVLDKELLPHLGEAGERYDLSRVGKAKDIIAMIKYLIELRLGKGEEDDIDHLGNRRLRLVGEL 334 (1060) T ss_pred CCCCCC-CCHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHH T ss_conf 111343-2001256655545566776634256644235566899999999999876428999888866565200308999 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCC-CCCCCCCCCCHHHHHHHHHHHCCCCCCEEECCCCCCHHHHHHCCCCCCCCCCCC Q ss_conf 7898888889999887653113443-343521122202346555420266750354155421023220011223444443 Q gi|254780143|r 471 LKNQYRLGLLRMERSIKERISSVDI-DSVMPQDLINAKPVVSAVCEFFCSSQLSQLEEHVNSLSRITHTRRLSALGQGGV 549 (1386) Q Consensus 471 l~~~fr~~l~rl~r~i~~~~~~~~~-~~~~~~~~in~~~i~~~i~~ff~t~~lsq~ld~~n~ls~lth~RR~s~lgpggl 549 (1386) +++|||++|.||+|.++++|...+. ..++|++++|.+++.+.++.||+++||||||||+||||+||||||+||| || T Consensus 335 l~n~fR~gl~rmer~ikerm~~~~~r~~i~p~dlIn~k~~~Alitgffg~sqlSQfmDqtNpLSeLSHkRRIsAl---gL 411 (1060) T COG0085 335 LENLFRVGLSRMERDVKERLEKADKRDTLVPQDLINAKPIHALITGFFGRSQLSQFMDQTNPLSELSHKRRLSAL---GL 411 (1060) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHCCHHHHHHHHHCCCCCCCEEEEEECCCHHHHHHHHHCEECC---CC T ss_conf 999999999999999999986443131157255425316888673352314331673247848764320002046---75 Q ss_pred CCCCCCCCCCCCCCCCEEEEEECCCCCCCCCEEEECEEEEEEECCCCCCCCEEEEEECC-CCCCCEEECCHHHHCCEEEE Q ss_conf 22346655432211110444303566765210210001224424687644106986225-53481664296773763796 Q gi|254780143|r 550 ARARAGVEMRDVHPTHYGRICPAETSEGHNIGLVSSLTSFARVNAYGFIETPYRKVCDG-KVTNDVVYLSAMEEENRYIA 628 (1386) Q Consensus 550 ~r~~~~~evR~ih~s~~GriCPieTPEG~n~GLv~~la~~a~in~~g~ie~py~~v~~~-~~~~~i~~l~~~~e~~~~Ia 628 (1386) +|++|+||||||||||||||||+|||||+|||||+|||+||+||++||+++||++|.++ .++++++|+++.+++.+.+| T Consensus 412 sRera~fEvRDvHpTHyGRiCPiETPEGpNiGLI~sLA~~A~Vn~~Gf~etPy~kv~~g~~~v~~i~~l~~~~~~~~~~~ 491 (1060) T COG0085 412 SRERAGFEVRDVHPTHYGRICPIETPEGPNIGLIKSLALYARINEYGFLETPYRKVLDGSLVVDEIEYLSADEEDVYVIG 491 (1060) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHEEECCCCCCCCCEEEEEECCCCCCCEEEECCCCCCCEEEE T ss_conf 76679985001670003354788998986434788798672734668877746999705534333078524102508986 Q ss_pred CCCCEECCCCCCCCCCEEECCC--CCCCCCCHHHEEECCCCCCCEEEECCCCCCCHHHCCHHHHHHHHHHHHHHHCCCCC Q ss_conf 1653411468402220000002--33332257872202367211023112333201101002211222344432101366 Q gi|254780143|r 629 QANSSLDEDGSFTEELVFCRCA--GEEILVPREKIDFIDASPKQVVSIAASLIPFLENDDSNRVLMGCNMQRQAVPLLKA 706 (1386) Q Consensus 629 ~~~~~l~~~~~~~~~~~~~r~~--~~~~~~~~~~v~~~~i~p~~i~sv~aslIPflehdda~R~l~g~nm~rQav~l~~~ 706 (1386) ++++.++.++.+++..+.+|+. .+.....++.++|+|++|.|++++++++|||++|||++|++||+||||||+|++.+ T Consensus 492 ~~~~~~~~~~~~~~~~~~~R~~~~~~~~~~~~~~~~~~dv~~~~~~~~~~~~~P~l~~dd~~~~l~~~~mqr~~~~l~~~ 571 (1060) T COG0085 492 QANGTLDEPGELVEELVECRRGGSGEVSVADPEGVDYMDVSPKQVVSVGRSLIPFLEHDDANRALMGSNMQRQAVPLLRT 571 (1060) T ss_pred EEEEEECCCCCEEEEEEEEEECCCCCCEEECCCCCCEEECCCEEEECCCCCCCCEEEECCCHHHHHHHHHHHHCCCCCCC T ss_conf 75103568872420177775236665135567654202025448833543567616874856766431056531665466 Q ss_pred CCCEECCCCCHHHHHHCCCEEECCCCCCCCCCCCCCEEEECCCCCCCCC-CCCCEEECCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 5411126620111010653010212443433565530252156655644-466301114655544547322445304479 Q gi|254780143|r 707 EAPFVGTGMESVVAKSSGAAIVAKRAGIVEQVDAIRIVIRSVEGDLDPS-TSGVDIYRLMKFQRSNQNTCVNQRPLVKVG 785 (1386) Q Consensus 707 ~~~~v~tg~E~~~~~~s~~~i~a~~~g~v~~vd~~~i~i~~~~~~~~~~-~~~~~~y~~~~~~~~~~~~~~~q~p~V~~g 785 (1386) ++|+|+||+|+.+|.+++.+++|.++|.+.++++..+.|.......... ......|++.++.++||+||++|+|+|+.| T Consensus 572 ~~~lv~tG~E~~~a~e~~~~~ia~~~~~~~~ve~~~~~I~~~~~~~~~~~~~n~~~~n~~~~~~~~Q~~~~~~~~~~~~~ 651 (1060) T COG0085 572 EAPLVGTGMEYLDAEDSGAAVIAKRPGVVTHVEISPIVILGIEASLIPYPEHNQSPYNLYKFARSNQATGINQRPLVKRG 651 (1060) T ss_pred CCCCCCCCCEEECCCCCCCEEEECCCCCEEEEEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCEECC T ss_conf 56401278544323435442676048937999520359996325666665568676788888664013477656502216 Q ss_pred CEEECCCEECCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCEECCHHHHHCCCCCEEEEEEEEHHHHHCCCCCCCCCCCC Q ss_conf 77207852035522357860222375155531355444442000013442587310346665311211478840024666 Q gi|254780143|r 786 DEVRRNDIIADGPSTDLGDLALGRNMLVAFMPWHGYNFEDSMLISERMVSEDVFTSIHIEEFEVMARDTKLGPEEITRDI 865 (1386) Q Consensus 786 ~~~~~~~~l~~~~~~~~~el~~G~N~~VA~m~~~GYN~EDaiiin~~~v~rg~~~s~h~~~y~~~~~~~~~g~~~~~~~~ 865 (1386) |.+..+++++|+|+++.||||+|||++||||||+||||||||||||++|+||+|||+||++|++++|++++|+|++ +++ T Consensus 652 d~~~~~~~~~~~P~~~~~e~~~GqN~~VA~m~~~GYn~EDAiiin~~~v~~~~~ts~~~~~~~~~~r~~~~g~e~~-~~i 730 (1060) T COG0085 652 DTVEKGLVYADGPSVDTGELALGQNALVAFMPWNGYNYEDAIIISERSVERDLFTSIHIEEYETEARDTKLGPEEI-RDI 730 (1060) T ss_pred CCEECCCEECCCCCCCCCCCCCCCEEEEEEECCCCCCHHHHEECCCCHHHCCCCEEEEEEEEEEEEECCCCCCCCE-ECC T ss_conf 6300152636878656675658740399996536868666121010044347615999997766510258995304-038 Q ss_pred CCCCHHHHHHCCCCCCCCCCCEECCCCCEEECCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCEECCCCCCCCCCCCCCC Q ss_conf 54686784102413741377331367611101246777767703466543035555443233202278853220012101 Q gi|254780143|r 866 PNVSEEGLKNIDECGIICVGAEVNPGDILVGKITPKGESPMTPEEKLLRAIFGEKAVDVRDTSLRVPSGVSGTVVDVRIF 945 (1386) Q Consensus 866 ~~~~~~~~~~ld~~Giv~~G~~V~~gDilvgk~tp~~~~~~~pe~~~l~~i~~~~~~~~~d~~~~~~~g~~g~v~~~~~~ 945 (1386) |+++++++++||++||+++|++|++|||||||+||+++++.+||+++++ +|+++ ++|+|+++|+|+.|+|++|++| T Consensus 731 P~~~~~~~~~Lde~Gii~ig~~V~~gdilvgk~tP~~~~~~~~ee~ll~-i~~ek---~rdtsl~~~~g~~G~V~~V~~~ 806 (1060) T COG0085 731 PNVSEEALRNLDEDGIIRIGAEVKGGDILVGKVTPKGETELTPEERLLR-IFGEK---VRDTSLRVPHGEEGIVDDVQVF 806 (1060) T ss_pred CCCCHHHHHHCCCCCCCCCCCEECCCCEEEEEECCCCCCCCCCHHHHHC-CCCCE---EECCCEEECCCCCEEEEEEEEE T ss_conf 9968899840774576357558747988898647998776780243202-44530---2025046149997489999997 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHCCCHHHHHEEE Q ss_conf 45654202667888899999998728889998878889888886258534566555554321135764014854310014 Q gi|254780143|r 946 NRHGIDKNERSISVEREQIELLARDKDDEQVILDRNIYSRLMEILCGQNAVSGPKGFKKSTVLSSDLISEYPRSQWWQFA 1025 (1386) Q Consensus 946 ~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1025 (1386) ++.. T Consensus 807 ~~~~---------------------------------------------------------------------------- 810 (1060) T COG0085 807 TRED---------------------------------------------------------------------------- 810 (1060) T ss_pred ECCC---------------------------------------------------------------------------- T ss_conf 3467---------------------------------------------------------------------------- Q ss_pred ECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEEEEEECCCCCCCCCCCCCCCCCEEEEEEC Q ss_conf 15867899999999989999999999989887631258867747276999999875588552314336678735888630 Q gi|254780143|r 1026 VQDEKVQRNVESLKVQYETSKSILEDRFKNKIEKIQWGDDMPPGVLRVVKVFVAMKRPIQSGDKMAGRHGNKGIVSRILP 1105 (1386) Q Consensus 1026 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~ki~~~~~~~~gv~~~vKV~ir~~R~p~iGDKfasRHGqKGVis~i~p 1105 (1386) -.||++++|||+||++|+||+|||||||||||||||+||| T Consensus 811 ----------------------------------------~~~g~~~~vkV~v~~~R~~~~GDK~a~RHG~KGVis~i~p 850 (1060) T COG0085 811 ----------------------------------------GDPGVNKLVKVYVAQKRKPQIGDKMAGRHGNKGVVSKIVP 850 (1060) T ss_pred ----------------------------------------CCCCCEEEEEEEEEEECCCCCCCCCCCCCCCCCEEEEECC T ss_conf ----------------------------------------8877428999999863267766523445788745444447 Q ss_pred CCCCCCCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 00078793587156986689867507089999999999998719622433343222210246777877631543221000 Q gi|254780143|r 1106 CEDMPFLKDGTPVDIVLNPLGVPSRMNVGQIFETHLGWACVGLGKKIKSLINDYKANGDISPLRSFLEKVIGTGSHTEKI 1185 (1386) Q Consensus 1106 ~eDMPf~~dG~~pDIIlNPhgvPSRMtIGqllE~~lGka~~~~G~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1185 (1386) +||||||+||||||||||||||||||||||+|||||||||+++|.. T Consensus 851 ~eDMPf~~~G~~~DiilNP~gvPSRM~iGqilE~~lG~a~~~~G~~---------------------------------- 896 (1060) T COG0085 851 QEDMPFLEDGTPPDIILNPLGVPSRMNIGQILETHLGKAAALLGIP---------------------------------- 896 (1060) T ss_pred CCCCCCCCCCCCCCEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCE---------------------------------- T ss_conf 5668949889766478789878765519999999988999854961---------------------------------- Q ss_pred CCCCHHHHHHHHHHHCCCCEECCCCCCCCCHHHHHHHHHHCCCCCCCCEEEECCCCCCCCCCCEEEEEEEEEECCCHHHH Q ss_conf 01352567776653026840025655788999999999986868998699986898840268504884645410110211 Q gi|254780143|r 1186 SDYDDDSVLRVAEQWKSGVPVSTPVFDGADEEAINSMLRMADLDESGQSILYDGLTGEPFDRPVTVGYIYMLKLNHMVSD 1265 (1386) Q Consensus 1186 ~~~~~~~~~~~~~~~~~g~~~aTP~F~g~~~~~i~~~L~~aG~~~~Gke~lydG~TGe~~~~~IfvG~~YyqkL~HMV~D 1265 (1386) ++||+|+|+++++|+++|.+|||+++||++||||+|||+|+++||||+||||||+|||+| T Consensus 897 --------------------~~~~~F~g~~~e~~~~~l~~~g~~~~Gk~~lydG~TGe~~~~~i~vG~~Y~~kL~HmV~d 956 (1060) T COG0085 897 --------------------VDTPVFDGAPEEDIRELLKEAGFPYSGKEVLYDGRTGEPFDAPIFVGVMYYQKLHHMVDD 956 (1060) T ss_pred --------------------ECCCCCCCCCHHHHHHHHHHCCCCCCCCEEEECCCCCCCCCCCEEEEEHHHHHHHHHHCC T ss_conf --------------------024873788889999999974899889778445888870135179972477767766334 Q ss_pred HCCCCCCCCCCCEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCCCCCCCCH Q ss_conf 00002368731103079986233178432078999999986989999861110011015999988763878789978667 Q gi|254780143|r 1266 KVYARSTGSYSLVTQQPLGGKSNRGGQRLGEMEVWCIQAYGAAYVLQEMLTIKSDDVVGRTRVYESIVAGNDTFETGTPE 1345 (1386) Q Consensus 1266 KiHARstGP~sllTrQP~eGRsr~GGlRfGEMErwaL~AyGAa~~LqE~Lt~kSDdv~gr~~~~~~iv~g~~~~~~~~pe 1345 (1386) |||||||||||+|||||||||||+||||||||||||||||||||+||||||+||||+|||+++|+|||||++++++|+|| T Consensus 957 K~HaRs~GP~s~lT~QP~~Gka~~GG~RfGEME~~aL~ayGAa~~LqE~L~~~SD~~~G~~~~y~~~v~g~~~~~~~ip~ 1036 (1060) T COG0085 957 KIHARSTGPYSLVTQQPLGGKAQFGGQRFGEMEVWALEAYGAAYTLQERLTVKSDDVCGRIKIYECIVKGENIPEVGIPE 1036 (1060) T ss_pred CCEEECCCCCEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCHHCCCCHHHHCCCCCCCCCCCCCCCH T ss_conf 21010337854554178886324588131305889999875999999986414400066023102732577778888977 Q ss_pred HHHHHHHHHHHCCCCEEEEECCCC Q ss_conf 789999999854200276406654 Q gi|254780143|r 1346 SFNVLVKEMQALGLSIDLENSRTK 1369 (1386) Q Consensus 1346 sf~vl~~El~~l~l~~~~~~~~~~ 1369 (1386) |||||++|||||||+++|+.++.+ T Consensus 1037 sFk~L~~El~sl~i~~~l~~~~~~ 1060 (1060) T COG0085 1037 SFKVLLKELRSLGIDVRLELEDGK 1060 (1060) T ss_pred HHHHHHHHHHHCCCCEEEEECCCC T ss_conf 999999999977886498605789 No 6 >PRK08565 DNA-directed RNA polymerase subunit beta; Provisional Probab=100.00 E-value=0 Score=2012.33 Aligned_cols=937 Identities=28% Similarity=0.447 Sum_probs=760.6 Q ss_pred CCCCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHCCCCCCCCCEEEEEEEEEECCCCC----------CHHHHHHC Q ss_conf 88708899999999986246652211122589997663780767985899999789809858----------89999983 Q gi|254780143|r 25 DIPDLIEVQKASYDHFLMMNIAPDERPNEGLQAAFKSVFPITAFSGAAMLEFVSYEFDPPKF----------DVDDCLWR 94 (1386) Q Consensus 25 ~~P~Li~iQ~~Sf~~Flq~~~~~~~r~~~GL~~v~~~~fPI~d~~~~~~Lef~~y~l~~Pk~----------tp~ECRlR 94 (1386) .-..|+++|+||||||| ++||+++|++++||++..+++.|+|.+|++++|++ +|.|||+| T Consensus 10 ~~~gLv~~qidSFn~Fi----------~~gL~~i~~~~~~I~~~~~~~~l~~~~~~i~~P~~~e~~~~~~~l~P~ecR~r 79 (1101) T PRK08565 10 KEKGLVRQHLDSFNDFL----------DRGLQEIVDEFGEIKTEIPGLKIKFGKIRVGEPEIKEADGSERPITPMEARLR 79 (1101) T ss_pred CCCCCHHHHHHHHHHHH----------HHHHHHHHHHCCCEECCCCCEEEEEEEEEEECCEEECCCCCCCCCCHHHHHHC T ss_conf 89985799999999999----------97189999853975726998899998899808814436675576898999856 Q ss_pred CCCEEEEEEEEEEEEEECCCCCCCCCCCEEEEEEEEEEEECCEECCC------------------------CCEEECCEE Q ss_conf 99753358999999993178766552000122367875100002689------------------------628986821 Q gi|254780143|r 95 DLTYAVPLKITLRLIVFDVDEFTGAKSIKDIKEQSIYMGDLPLMTKD------------------------GTFVIKGIQ 150 (1386) Q Consensus 95 ~lTYsapL~V~i~l~v~~~~~~~~~k~~~~ike~~V~lG~IPiMt~~------------------------GyFIING~E 150 (1386) ++||||||+|++++++.+ .+.++++|++|+|||||+| |||||||+| T Consensus 80 ~~TYs~~l~v~v~~~~~~----------~~~~~~~v~iG~iPiMv~S~~C~L~~~~~~el~~~gE~~~d~GGyFIInG~E 149 (1101) T PRK08565 80 NLTYAAPLYLEMTLVEDG----------IEYETEEVKIGDLPIMVKSKACPLSGLSPDELIEKGEDPKDPGGYFIINGSE 149 (1101) T ss_pred CCCEEEEEEEEEEEEECC----------EEEEEEEEEEEECCEEECCCCCCCCCCCHHHHHHCCCCCCCCCCEEEECCEE T ss_conf 996568899999999999----------3888999998658779688854467999899986088877899679989928 Q ss_pred EEEEEEECCCCCEEECCCCCCCCCCCCEEEEEEEECCCCCE---EEEEECCCCEEEEEECCC-CCHHHHHHHHHCCCCHH Q ss_conf 46866512278521202346515778567999981188723---689975898299997187-87039989988099847 Q gi|254780143|r 151 RIVVSQLHRSPGIHFDHDKGRASLSGKLLYACRIIPDQGLW---MDIEFDSKDIIHVRIDRR-RKVPVTSFLMALGMDSE 226 (1386) Q Consensus 151 RVIVsQl~RSPGVyf~~~k~k~~~s~k~~ysa~IIP~RGSw---Le~e~d~kd~iyvrIdr~-rKIPi~ilLrALG~ssd 226 (1386) ||||+|++||||.+|... .....+..|.++++|.||+| +.+++++++.+|+++++. ++||+++||||||+++| T Consensus 150 RVIv~q~~~~~n~~~~~~---~~~~~~~~~~~~i~s~~~~~~~~~~~~~~~~g~i~~~i~~~~~~IPi~illraLG~~sD 226 (1101) T PRK08565 150 RVIVSQEDLAPNRILVDK---GEAGSSVTHVAKVFSTRAGYRAQLTVERKKDGTIYVSIPAVPGKIPFVILMRALGLETD 226 (1101) T ss_pred EEEEEEEECCCCCEEEEE---CCCCCCEEEEEEEEECCCCCCEEEEEEECCCCEEEEEECCCCCEEEHHHHHHHHCCCCH T ss_conf 999999626899689986---57899368999999778876045899992798799998783547639999998379978 Q ss_pred -HHHHHHCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCEEECCCCHHHHHHHHHHHHCCCCCCCCCHHHHCC Q ss_conf -9999738740577306731023545666421012232112445845440221020799887876360001026677347 Q gi|254780143|r 227 -EILSTFYPKIVYSQRGDFWCFPLSAADLMVGAKVSSSLVDIDTGEQVIESGKKLTSGLLKSLKEKGVKFLGITSDCLCG 305 (1386) Q Consensus 227 -eIl~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~l~~ 305 (1386) ||++.++.+... . . T Consensus 227 ~eI~~~i~~d~~~------------------------------------------~----~------------------- 241 (1101) T PRK08565 227 EDIVDAVSLDPEI------------------------------------------Q----Q------------------- 241 (1101) T ss_pred HHHHHHHCCCHHH------------------------------------------H----H------------------- T ss_conf 9999873278899------------------------------------------9----9------------------- Q ss_pred CCCCCCCCCCCCCEEEEECCCCCCHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHCCCCCCHHHHHHHHHHHHCCCC Q ss_conf 41000013666776999623459989999998656541024420244455200000110234588999999887605776 Q gi|254780143|r 306 LYVAEDIVNGETGEIYIEAGDVIDEKSLEEIFHSEIRDIPILYVDSVNNNAYIRNTLVTDKNKDRKDALLDIYRVMRPGD 385 (1386) Q Consensus 306 ~~~~~~~~d~~~gei~~~~~~~~~~~~l~~~~~~~~~~~~~l~~~~~~~~~~i~~~~~~d~~~~~~eAl~~I~k~lr~~~ 385 (1386) ..+..+.. ..+...++++|+.+++++++++. T Consensus 242 -------------------------~l~~~l~~------------------------~~~~~~t~e~al~yi~~~~~~~~ 272 (1101) T PRK08565 242 -------------------------ELLPSLED------------------------ASDIAITREDALDYIGKRVAPGQ 272 (1101) T ss_pred -------------------------HHHHHHHH------------------------CCCCCCCHHHHHHHHHHHHCCCC T ss_conf -------------------------99999996------------------------33667899999999997606899 Q ss_pred CCHH--HHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEECHHHHHHHHHHHHHHHCCCCCCCCCHHCCCCC Q ss_conf 3104--56888862024530233345555777654203667767706618998888988876304876434401015332 Q gi|254780143|r 386 VSTF--SVAESMFNFLFFDSDKYDLSTVGRVKMNMRLNLDTPDDVRHIRKEDIIAIIKILVDLRNGKGTIDDIDNLGNRR 463 (1386) Q Consensus 386 ~~~~--~~~~~~~~~~~~~~~~y~l~~vgr~~~n~~l~~~~~~~~~~Lt~~d~~~~i~~L~~l~~g~~~~DdiDhlgnkR 463 (1386) +... +.+..++.+.++ ..+ ......+..+..++..|+++|+.+..|...+||+|||+||| T Consensus 273 ~~~~~~~~~~~~l~~~~l-------pHl-----------g~~~~~~~~K~~~L~~mi~kLl~~~~g~~~~DD~D~l~NkR 334 (1101) T PRK08565 273 PREFRIRRAEQLLDNYLL-------PHL-----------GTSPEDRIKKAYFLGQMARKLLELYLGRRKPDDKDHYANKR 334 (1101) T ss_pred CCCHHHHHHHHHHHHHCC-------CCC-----------CCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEE T ss_conf 972569999999997384-------766-----------98830257889999999999999875998889863103738 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHCCC--CCCCCCCCCCCCHHHHHHHHHHHCCCCC-------CEEECCCCCCHHH Q ss_conf 333357878988888899998876531134--4334352112220234655542026675-------0354155421023 Q gi|254780143|r 464 VRSVGEMLKNQYRLGLLRMERSIKERISSV--DIDSVMPQDLINAKPVVSAVCEFFCSSQ-------LSQLEEHVNSLSR 534 (1386) Q Consensus 464 vr~vgeLl~~~fr~~l~rl~r~i~~~~~~~--~~~~~~~~~~in~~~i~~~i~~ff~t~~-------lsq~ld~~n~ls~ 534 (1386) ++++|+|++++|+.++.++.+.+++++... ......+..++++..++..+++||+|++ |||||||+|||++ T Consensus 335 v~~~G~Ll~~lfr~~l~~~~k~ik~~l~~~~~~~~~~~~~~~i~~~~It~~i~~~fsTGnw~~~~sGlSQ~ldr~N~ls~ 414 (1101) T PRK08565 335 LRLAGDLLAQLFRTAFKQLVKDLKYQLEKTYARGRRIDIRTIVRPDIITERIRHALATGNWVGGRTGVSQLLDRTNYLST 414 (1101) T ss_pred ECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHCCCHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCHHHH T ss_conf 85688999999999999999999999998863178788899508255678999998536656787517898536767665 Q ss_pred HHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCEEEECEEEEEEECCCCCCCCEEEEEECCCCC--- Q ss_conf 22001122344444322346655432211110444303566765210210001224424687644106986225534--- Q gi|254780143|r 535 ITHTRRLSALGQGGVARARAGVEMRDVHPTHYGRICPAETSEGHNIGLVSSLTSFARVNAYGFIETPYRKVCDGKVT--- 611 (1386) Q Consensus 535 lth~RR~s~lgpggl~r~~~~~evR~ih~s~~GriCPieTPEG~n~GLv~~la~~a~in~~g~ie~py~~v~~~~~~--- 611 (1386) |||+||+++ ++.|++++|+||+|||||||||||+|||||+|||||+|||++|+|| .++.++|++.+...... T Consensus 415 lSH~RRv~~----~l~r~~~~~~vR~lHpS~yG~iCPieTPEG~n~GLi~~LA~~a~Is-~~~~~~~i~~~l~~~g~~~~ 489 (1101) T PRK08565 415 LSHLRRVVS----PLSRTQPHFEARDLHGTQWGRICPFETPEGQNCGLVKNLALLAQIT-VGVDEEEVEELLYDLGVVPV 489 (1101) T ss_pred HHHHHHCCC----CCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCEEEEEEEEEEC-CCCCCHHHHHHHHHCCCEEH T ss_conf 102433035----3354566664421477645255565489987512222003589970-58880778999986395665 Q ss_pred ---------CCEEECC---------HHH--------------HCCEEEEC---C--C-CEECCC-CCCCCCCEEE----- Q ss_conf ---------8166429---------677--------------37637961---6--5-341146-8402220000----- Q gi|254780143|r 612 ---------NDVVYLS---------AME--------------EENRYIAQ---A--N-SSLDED-GSFTEELVFC----- 647 (1386) Q Consensus 612 ---------~~i~~l~---------~~~--------------e~~~~Ia~---~--~-~~l~~~-~~~~~~~~~~----- 647 (1386) ....+++ +.. .....|+. . + ..+..+ ++.+...... T Consensus 490 ~~~~~~~~~~~~v~lnG~~iG~~~~~~~~~~~lr~~rr~~~i~~~~si~~~~~~~~~~i~i~sd~gR~~rPll~~~~~~~ 569 (1101) T PRK08565 490 EEAREEGKRGARVYLNGRLIGYHPDGEELAETIRELRRSGKISDEVNVAYYRTGEINEVYVNCDAGRVRRPLIVVENGKP 569 (1101) T ss_pred HHHCCCCCCCCEEEECCEEEEEECCHHHHHHHHHHHHHCCCCCCEEEEEEECCCCCCEEEEECCCCCCCCEEEEECCCCC T ss_conf 77042323562799747799997468999999998874388574699997437655658996477621140587227864 Q ss_pred --------------------------------CCCCCCCCCCHH----HEEECCCCCCCEEEECCCCCCCHHHCCHHHHH Q ss_conf --------------------------------002333322578----72202367211023112333201101002211 Q gi|254780143|r 648 --------------------------------RCAGEEILVPRE----KIDFIDASPKQVVSIAASLIPFLENDDSNRVL 691 (1386) Q Consensus 648 --------------------------------r~~~~~~~~~~~----~v~~~~i~p~~i~sv~aslIPflehdda~R~l 691 (1386) ......+...++ ..+|++++|.+++|++|++|||++|||+||++ T Consensus 570 ~~~~~~~~~~~~~~~~~~~l~~~g~ieyid~~e~~~~~Ia~~~~~~~~~~th~Ei~p~~ilsv~asliPf~~hNqspRn~ 649 (1101) T PRK08565 570 KLTREHVEKLKKGELTFDDLVKMGVVEYLDAEEEENAYIALDPWDVTKEHTHLEIWPPAILGVVASIIPYPEHNQSPRNT 649 (1101) T ss_pred CCCHHHHHHHHCCCCCHHHHHHCCCEEEECCCCCEEEEEEECHHHCCCCCCEEEEEHHHEEEEEECCCCCCCCCCCHHHH T ss_conf 23488888877488204556407846986554240058996567735665248850136135531146435667215665 Q ss_pred HHHHHHHHHHCCCCCCCCEECCCCCHHHHHHCCCEEECCCCCCCCCCCCCCEEEECCCCCCCCCCCCCEEECCCCCCCCC Q ss_conf 22234443210136654111266201110106530102124434335655302521566556444663011146555445 Q gi|254780143|r 692 MGCNMQRQAVPLLKAEAPFVGTGMESVVAKSSGAAIVAKRAGIVEQVDAIRIVIRSVEGDLDPSTSGVDIYRLMKFQRSN 771 (1386) Q Consensus 692 ~g~nm~rQav~l~~~~~~~v~tg~E~~~~~~s~~~i~a~~~g~v~~vd~~~i~i~~~~~~~~~~~~~~~~y~~~~~~~~~ 771 (1386) |||+|+||||++...+.. .+.++. . T Consensus 650 yq~~m~KQa~g~~~~n~~-----------------------------------------~r~D~~--------------~ 674 (1101) T PRK08565 650 YQAAMAKQSLGLYAANFR-----------------------------------------IRVDTR--------------G 674 (1101) T ss_pred HHHCCCCCCCCCCCCCCE-----------------------------------------EEECCC--------------C T ss_conf 421244333565410011-----------------------------------------576155--------------5 Q ss_pred CCCCCCCCCCCCCCCEEECCCEECCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCEECCHHHHHCCCCCEEEEEEEEHHH Q ss_conf 47322445304479772078520355223578602223751555313554444420000134425873103466653112 Q gi|254780143|r 772 QNTCVNQRPLVKVGDEVRRNDIIADGPSTDLGDLALGRNMLVAFMPWHGYNFEDSMLISERMVSEDVFTSIHIEEFEVMA 851 (1386) Q Consensus 772 ~~~~~~q~p~V~~g~~~~~~~~l~~~~~~~~~el~~G~N~~VA~m~~~GYN~EDaiiin~~~v~rg~~~s~h~~~y~~~~ 851 (1386) +.++++|+|+|++. .++. ..++++|+|+|++||||||+||||||||||||++++||+|||+|+++|++++ T Consensus 675 ~~l~ypQ~PlV~t~----~~~~------~~~~~~p~G~N~iVAvmsy~GYN~EDAIIink~sv~rg~f~s~~~~~y~~~~ 744 (1101) T PRK08565 675 HLLHYPQRPLVQTR----GLEL------IGYNDRPAGQNAVVAVLSYTGYNIEDAIIMNKASIERGLARSTFFRTYETEE 744 (1101) T ss_pred CEEECCCCCEEEEC----HHEE------ECCCCCCCCEEEEEEEECCCCCCHHHHHEECCCHHHCCCEEEEEEEEEEEEE T ss_conf 37971777626502----0014------3457788976689999767786754510002111123870689999999998 Q ss_pred HHCCCCCC-CCCCCCCCCC----HHHHHHCCCCCCCCCCCEECCCCCEEECCCCCCCCCCCHHHHHHHHHCCCCCCCCCC Q ss_conf 11478840-0246665468----678410241374137733136761110124677776770346654303555544323 Q gi|254780143|r 852 RDTKLGPE-EITRDIPNVS----EEGLKNIDECGIICVGAEVNPGDILVGKITPKGESPMTPEEKLLRAIFGEKAVDVRD 926 (1386) Q Consensus 852 ~~~~~g~~-~~~~~~~~~~----~~~~~~ld~~Giv~~G~~V~~gDilvgk~tp~~~~~~~pe~~~l~~i~~~~~~~~~d 926 (1386) ++++.|.+ .++.+.|+++ +..+++||+||+|++|++|++|||||||++|+...+..++...+ ...+| T Consensus 745 ~~~~~g~~~~~~~~~~~~~~~~~~~~~~~LD~dGii~~G~~v~~gDilvGKvsp~~~~~~~~~~~~~--------~~~~d 816 (1101) T PRK08565 745 RRYPGGQEDKIEIPEPNVRGYRGEEAYRKLDEDGIVSPEVYVEGGDVLIGKTSPPRFLEELEELSGL--------QERRD 816 (1101) T ss_pred EECCCCCCEEEECCCCCCCCCCCHHHHHCCCCCCCCCCCCEECCCCEEEEECCCCCCCCCCCCCCCC--------CEEEE T ss_conf 7146997327856897645666466773168568737998966899899963687756542112564--------12531 Q ss_pred CCEECCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCC Q ss_conf 32022788532200121014565420266788889999999872888999887888988888625853456655555432 Q gi|254780143|r 927 TSLRVPSGVSGTVVDVRIFNRHGIDKNERSISVEREQIELLARDKDDEQVILDRNIYSRLMEILCGQNAVSGPKGFKKST 1006 (1386) Q Consensus 927 ~~~~~~~g~~g~v~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1006 (1386) +|+++++|+.|+|.+|.+... T Consensus 817 ~s~~~~~~~~g~Vd~v~~~~~----------------------------------------------------------- 837 (1101) T PRK08565 817 TSVAVRHGEKGIVDTVIITES----------------------------------------------------------- 837 (1101) T ss_pred CCEECCCCCCCEEEEEEEEEC----------------------------------------------------------- T ss_conf 551226898734789999853----------------------------------------------------------- Q ss_pred CCCHHHHHCCCHHHHHEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEEEEEECCCCC Q ss_conf 11357640148543100141586789999999998999999999998988763125886774727699999987558855 Q gi|254780143|r 1007 VLSSDLISEYPRSQWWQFAVQDEKVQRNVESLKVQYETSKSILEDRFKNKIEKIQWGDDMPPGVLRVVKVFVAMKRPIQS 1086 (1386) Q Consensus 1007 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~ki~~~~~~~~gv~~~vKV~ir~~R~p~i 1086 (1386) ++++++|||++|++|+||| T Consensus 838 -------------------------------------------------------------~~~~~~vkv~ir~~R~p~i 856 (1101) T PRK08565 838 -------------------------------------------------------------PEGNKLVKVRVRDLRIPEL 856 (1101) T ss_pred -------------------------------------------------------------CCCCEEEEEEEEEECCCCC T ss_conf -------------------------------------------------------------8884689999841147765 Q ss_pred CCCCCCCCCCCCEEEEEECCCCCCCCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHH Q ss_conf 23143366787358886300007879358715698668986750708999999999999871962243334322221024 Q gi|254780143|r 1087 GDKMAGRHGNKGIVSRILPCEDMPFLKDGTPVDIVLNPLGVPSRMNVGQIFETHLGWACVGLGKKIKSLINDYKANGDIS 1166 (1386) Q Consensus 1087 GDKfasRHGqKGVis~i~p~eDMPf~~dG~~pDIIlNPhgvPSRMtIGqllE~~lGka~~~~G~~~~~~~f~~~~~~~~~ 1166 (1386) |||||||||||||||+|||+||||||+||++|||||||||||||||||||+|||+||||+++|..+| T Consensus 857 GDKfasRHGqKGvis~i~p~eDMPf~~dG~~pDiI~NPhg~PSRMtiGql~E~~~gk~~~~~g~~~d------------- 923 (1101) T PRK08565 857 GDKFASRHGQKGVIGMLVPQEDMPFTEEGIVPDLILNPHAIPSRMTVGQLLEAIAGKVAALTGRFVD------------- 923 (1101) T ss_pred CCEECCCCCCCCEEEEECCHHCCCCCCCCCCCCEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCCCC------------- T ss_conf 5431445567633320235301996889988518878987877670999999987688872597311------------- Q ss_pred HHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHCCCCEECCCCCCCCCHHHHHHHHHHCCCCCCCCEEEECCCCCCCCC Q ss_conf 67778776315432210000135256777665302684002565578899999999998686899869998689884026 Q gi|254780143|r 1167 PLRSFLEKVIGTGSHTEKISDYDDDSVLRVAEQWKSGVPVSTPVFDGADEEAINSMLRMADLDESGQSILYDGLTGEPFD 1246 (1386) Q Consensus 1167 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~aTP~F~g~~~~~i~~~L~~aG~~~~Gke~lydG~TGe~~~ 1246 (1386) ||| |++.++++++++|.++||+++|+|.||||+|||+|+ T Consensus 924 ----------------------------------------~tp-F~~~~~~~~~~~l~~~g~~~~G~e~ly~G~tG~~~~ 962 (1101) T PRK08565 924 ----------------------------------------ATP-FEGEPEEELRKELLKLGFKPSGKEVMYDGRTGEKLK 962 (1101) T ss_pred ----------------------------------------CCC-CCCCCHHHHHHHHHHCCCCCCCCEEEECCCCCCCCC T ss_conf ----------------------------------------489-899889999999997598999998967799998605 Q ss_pred CCEEEEEEEEEECCCHHHHHCCCCCCCCCCCEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCH---- Q ss_conf 8504884645410110211000023687311030799862331784320789999999869899998611100110---- Q gi|254780143|r 1247 RPVTVGYIYMLKLNHMVSDKVYARSTGSYSLVTQQPLGGKSNRGGQRLGEMEVWCIQAYGAAYVLQEMLTIKSDDV---- 1322 (1386) Q Consensus 1247 ~~IfvG~~YyqkL~HMV~DKiHARstGP~sllTrQP~eGRsr~GGlRfGEMErwaL~AyGAa~~LqE~Lt~kSDdv---- 1322 (1386) ++|||||+|||||+|||+|||||||||||++|||||++||||+|||||||||||||+|||||++||||||++||++ T Consensus 963 ~~i~~G~~yy~rL~HmV~DK~haRs~Gp~~~lT~QP~~Gr~~~GG~R~GEME~dal~a~Gaa~~L~erl~~~SD~~~~~v 1042 (1101) T PRK08565 963 ADIFIGVVYYQKLHHMVADKIHARARGPVQILTRQPTEGRAREGGLRFGEMERDVLIGHGAAMLLKERLLDSSDKTVIYV 1042 (1101) T ss_pred CEEEEEHHHHHCCHHHHCCCCEEEEECCCCCCCCCCCCCCCCCCCCCEEHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEE T ss_conf 62899736732251242427664515898772379999766789843546789999998499999999755489732450 Q ss_pred ---HH-------HHHHHHHHHCCC--CCCCCCCCHHHHHHHHHHHHCCCCEEEEECCC Q ss_conf ---15-------999988763878--78997866778999999985420027640665 Q gi|254780143|r 1323 ---VG-------RTRVYESIVAGN--DTFETGTPESFNVLVKEMQALGLSIDLENSRT 1368 (1386) Q Consensus 1323 ---~g-------r~~~~~~iv~g~--~~~~~~~pesf~vl~~El~~l~l~~~~~~~~~ 1368 (1386) || +++.|.|.++|+ ++.++++|||||||++||+||||+++|..+|+ T Consensus 1043 c~~cg~~~~~~~~~~~~~c~~c~~~~~~~~~~iPy~fk~L~~EL~sm~i~~~l~~~d~ 1100 (1101) T PRK08565 1043 CELCGHIAWYDRRKNKPVCPVHGDKGRISPVEVSYAFKLLLQELMSMGIRPRLELGDK 1100 (1101) T ss_pred CCCCCCEEEEECCCCCEECCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCC T ss_conf 0366751443045673576677987861316798899999999987888649984687 No 7 >cd00653 RNA_pol_B_RPB2 RNA polymerase beta subunit. RNA polymerases catalyse the DNA dependent polymerization of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). Each RNA polymerase complex contains two related members of this family, in each case they are the two largest subunits.The clamp is a mobile structure that grips DNA during elongation. Probab=100.00 E-value=0 Score=1833.32 Aligned_cols=801 Identities=49% Similarity=0.791 Sum_probs=711.3 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHCCCCCC--CCCEEEEEEEEEECCCC---------CCHHHHHHCCCC Q ss_conf 8899999999986246652211122589997663780767--98589999978980985---------889999983997 Q gi|254780143|r 29 LIEVQKASYDHFLMMNIAPDERPNEGLQAAFKSVFPITAF--SGAAMLEFVSYEFDPPK---------FDVDDCLWRDLT 97 (1386) Q Consensus 29 Li~iQ~~Sf~~Flq~~~~~~~r~~~GL~~v~~~~fPI~d~--~~~~~Lef~~y~l~~Pk---------~tp~ECRlR~lT 97 (1386) |+++|++|||+|| ++||++++++++||++. +.++.++|.++++++|. ++|+|||+|++| T Consensus 1 Lv~~qidSFn~Fi----------~~gl~~ii~~~~~i~~~~~~~~~~i~~~~i~i~~P~~~~~~~~~~l~P~eaRlr~lT 70 (866) T cd00653 1 LVKQQIDSFNYFL----------NVGLQEIVKSIPPITDTDDDGRLKLKFGDIYLGKPKVEEGGVTRKLTPNECRLRDLT 70 (866) T ss_pred CCHHHHHHHHHHH----------HHHHHHHHHHCCCEEEECCCCEEEEEEEEEEECCCCCCCCCCCCCCCHHHHHHCCCC T ss_conf 9427889999999----------983999998569889708995599999889980780157897564699999854995 Q ss_pred EEEEEEEEEEEEEECCCCCCCCCCCEEEEEEEEEEEECCEECC------------------------CCCEEECCEEEEE Q ss_conf 5335899999999317876655200012236787510000268------------------------9628986821468 Q gi|254780143|r 98 YAVPLKITLRLIVFDVDEFTGAKSIKDIKEQSIYMGDLPLMTK------------------------DGTFVIKGIQRIV 153 (1386) Q Consensus 98 YsapL~V~i~l~v~~~~~~~~~k~~~~ike~~V~lG~IPiMt~------------------------~GyFIING~ERVI 153 (1386) |||||+|++++++.+.. ..++++|++|+||||++ +|||||||+|||| T Consensus 71 Ys~~l~v~i~~~~~~~~---------~~~~~~v~iG~IPIMvkS~~C~L~~~~~~~l~~~gEcp~D~GGYFIInG~EKVI 141 (866) T cd00653 71 YSAPLYVDIRLTVNDKG---------KIKEQEVFIGEIPIMLRSKLCNLNGLTPEELIKLGECPLDPGGYFIINGTEKVI 141 (866) T ss_pred EEEEEEEEEEEEECCCC---------CEEEEEEEEEECCEEECCCCCCCCCCCHHHHHHCCCCCCCCCEEEEECCEEEEE T ss_conf 30479999999997777---------179999998417378278743367999889976188877898089987789999 Q ss_pred EEEECCCCCEEECCCCCCCCCCCCEEEEEEEEC----CCCCEEEEEECC-CCEEEEEECCCCCHHHHHHHHHCCCCHHHH Q ss_conf 665122785212023465157785679999811----887236899758-982999971878703998998809984799 Q gi|254780143|r 154 VSQLHRSPGIHFDHDKGRASLSGKLLYACRIIP----DQGLWMDIEFDS-KDIIHVRIDRRRKVPVTSFLMALGMDSEEI 228 (1386) Q Consensus 154 VsQl~RSPGVyf~~~k~k~~~s~k~~ysa~IIP----~RGSwLe~e~d~-kd~iyvrIdr~rKIPi~ilLrALG~ssdeI 228 (1386) ++|++++||.+|..+. .++..|..+.++ .++||++++.+. ++.+|++++.. T Consensus 142 i~qe~~~~N~~~~~~~-----~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~i~~~i~~~------------------- 197 (866) T cd00653 142 INQEQRSPNVIIVEDS-----KGKRIYTKTSIPSYSPYRGSWLEVKSDKKKDRIYVRIDLK------------------- 197 (866) T ss_pred EEEECCCCCEEEEECC-----CCCEEEEEEECCCCCCCCCEEEEEEECCCCCEEEEEECCC------------------- T ss_conf 9999058987999748-----9837999998656677760699999536687699984158------------------- Q ss_pred HHHHCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCEEECCCCHHHHHHHHHHHHCCCCCCCCCHHHHCCCCC Q ss_conf 99738740577306731023545666421012232112445845440221020799887876360001026677347410 Q gi|254780143|r 229 LSTFYPKIVYSQRGDFWCFPLSAADLMVGAKVSSSLVDIDTGEQVIESGKKLTSGLLKSLKEKGVKFLGITSDCLCGLYV 308 (1386) Q Consensus 229 l~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~l~~~~~ 308 (1386) T Consensus 198 -------------------------------------------------------------------------------- 197 (866) T cd00653 198 -------------------------------------------------------------------------------- 197 (866) T ss_pred -------------------------------------------------------------------------------- T ss_conf -------------------------------------------------------------------------------- Q ss_pred CCCCCCCCCCEEEEECCCCCCHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHCCCCCCHHHHHHHHHHHHCCCCCCH Q ss_conf 00013666776999623459989999998656541024420244455200000110234588999999887605776310 Q gi|254780143|r 309 AEDIVNGETGEIYIEAGDVIDEKSLEEIFHSEIRDIPILYVDSVNNNAYIRNTLVTDKNKDRKDALLDIYRVMRPGDVST 388 (1386) Q Consensus 309 ~~~~~d~~~gei~~~~~~~~~~~~l~~~~~~~~~~~~~l~~~~~~~~~~i~~~~~~d~~~~~~eAl~~I~k~lr~~~~~~ 388 (1386) .+++|+.+|++++ T Consensus 198 ------------------------------------------------------------~~e~al~yIg~k~------- 210 (866) T cd00653 198 ------------------------------------------------------------RQEEALKYIGKRF------- 210 (866) T ss_pred ------------------------------------------------------------CHHHHHHHHHHHH------- T ss_conf ------------------------------------------------------------8899999999999------- Q ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEECHHHHHHHHHHHHHHHCCCCCCCCCHHCCCCCCCCHH Q ss_conf 45688886202453023334555577765420366776770661899888898887630487643440101533233335 Q gi|254780143|r 389 FSVAESMFNFLFFDSDKYDLSTVGRVKMNMRLNLDTPDDVRHIRKEDIIAIIKILVDLRNGKGTIDDIDNLGNRRVRSVG 468 (1386) Q Consensus 389 ~~~~~~~~~~~~~~~~~y~l~~vgr~~~n~~l~~~~~~~~~~Lt~~d~~~~i~~L~~l~~g~~~~DdiDhlgnkRvr~vg 468 (1386) .+++.|+++|+.+..|...+||+||++|||++++| T Consensus 211 ---------------------------------------------~~L~~mi~kLl~~~~g~~~~DD~D~~~NKRv~~~G 245 (866) T cd00653 211 ---------------------------------------------EDLIYMIRKLILLVLGKGKLDDIDHLGNKRVRLAG 245 (866) T ss_pred ---------------------------------------------HHHHHHHHHHHHHHCCCCCCCCCCCCCCCEECCHH T ss_conf ---------------------------------------------99999999999986699998886434587764599 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCC--CCCCCCCCCCCCHHHHHHHHHHHCCCCC------------CEEECCCCCCHHH Q ss_conf 7878988888899998876531134--4334352112220234655542026675------------0354155421023 Q gi|254780143|r 469 EMLKNQYRLGLLRMERSIKERISSV--DIDSVMPQDLINAKPVVSAVCEFFCSSQ------------LSQLEEHVNSLSR 534 (1386) Q Consensus 469 eLl~~~fr~~l~rl~r~i~~~~~~~--~~~~~~~~~~in~~~i~~~i~~ff~t~~------------lsq~ld~~n~ls~ 534 (1386) +|++++|+.++.++.+.++.++... ......+...+++..|+..+++||+|++ |||+|||+||+++ T Consensus 246 ~Ll~~lFr~~l~~~~~~ik~~i~k~~~~~~~~~~~~~~~~~~It~~i~~~~~TGnw~~~~~~~~~~GvsQ~l~r~N~ls~ 325 (866) T cd00653 246 ELLQNLFRSGLKRLEREVKEKLQKQLSKKKDLTPQLLINSKPITSGIKEFLATGNWGSKRFLMQRSGLSQVLDRLNPLSE 325 (866) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHCCCHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEECCCCHHHH T ss_conf 99999999999999999999998764036657888863762789999999971887666776565418999536888998 Q ss_pred HHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCEEEECEEEEEEECCCCCCCCEEEEEECCCCCCCE Q ss_conf 22001122344444322346655432211110444303566765210210001224424687644106986225534816 Q gi|254780143|r 535 ITHTRRLSALGQGGVARARAGVEMRDVHPTHYGRICPAETSEGHNIGLVSSLTSFARVNAYGFIETPYRKVCDGKVTNDV 614 (1386) Q Consensus 535 lth~RR~s~lgpggl~r~~~~~evR~ih~s~~GriCPieTPEG~n~GLv~~la~~a~in~~g~ie~py~~v~~~~~~~~i 614 (1386) ++|+||+++ |++.++++.++||+||||||||+||+|||||+|||||+|||++|+|| |++++||+.|. T Consensus 326 lSh~Rri~~---g~~~~~~k~~~~R~Lh~s~~G~iCPveTPEG~~cGLvknLal~a~It--Grl~rP~~~v~-------- 392 (866) T cd00653 326 LSHKRRISS---LGLFRERKGFEVRDLHPSHWGRICPIETPEGENCGLVKNLALMARIS--GRIERPYRIVE-------- 392 (866) T ss_pred HHHHHEECC---CCCCCCCCCCCCCCCCHHHCEEECCCCCCCCCCEEEEEEEEEEEEEE--EEEECCEEEEC-------- T ss_conf 666642136---78474556775533460126242355589998431000001048864--69854479946-------- Q ss_pred EECCHHHHCCEEEECCCCEECCCCCCCCCCEEECCCCCCCCCCHHHEEECCCCCCCEEEECCCCCCCHHHCCHHHHHHHH Q ss_conf 64296773763796165341146840222000000233332257872202367211023112333201101002211222 Q gi|254780143|r 615 VYLSAMEEENRYIAQANSSLDEDGSFTEELVFCRCAGEEILVPREKIDFIDASPKQVVSIAASLIPFLENDDSNRVLMGC 694 (1386) Q Consensus 615 ~~l~~~~e~~~~Ia~~~~~l~~~~~~~~~~~~~r~~~~~~~~~~~~v~~~~i~p~~i~sv~aslIPflehdda~R~l~g~ 694 (1386) ..++|+|++|.+++|++||+|||++|||||||+||| T Consensus 393 --------------------------------------------~~~th~ei~p~~~l~~~aslIPF~~hNqspRn~yq~ 428 (866) T cd00653 393 --------------------------------------------KEVTHIEISPSQILSVAASLIPFPEHNQSPRNLYQS 428 (866) T ss_pred --------------------------------------------CCEEEEECCHHHHHHHHHHCCCCCCCCHHHHHHHHH T ss_conf --------------------------------------------860799728778344664246773468136777766 Q ss_pred HHHHHHHCCCCCCCCEECCCCCHHHHHHCCCEEECCCCCCCCCCCCCCEEEECCCCCCCCCCCCCEEECCCCCCCCCCCC Q ss_conf 34443210136654111266201110106530102124434335655302521566556444663011146555445473 Q gi|254780143|r 695 NMQRQAVPLLKAEAPFVGTGMESVVAKSSGAAIVAKRAGIVEQVDAIRIVIRSVEGDLDPSTSGVDIYRLMKFQRSNQNT 774 (1386) Q Consensus 695 nm~rQav~l~~~~~~~v~tg~E~~~~~~s~~~i~a~~~g~v~~vd~~~i~i~~~~~~~~~~~~~~~~y~~~~~~~~~~~~ 774 (1386) ||+|||+|++..+.. .+.++.. +-+ T Consensus 429 ~M~kQa~g~~~~n~~-----------------------------------------~r~D~~~--------------~~L 453 (866) T cd00653 429 NMQKQAVGTPALNQQ-----------------------------------------YRMDTKL--------------YLL 453 (866) T ss_pred HHHHCCCCEEECCCC-----------------------------------------EECCCCC--------------EEE T ss_conf 644031420101340-----------------------------------------4403510--------------353 Q ss_pred CCCCCCCCCCCCEEECCCEECCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCEECCHHHHHCCCCCEEEEEEEEHHHHHC Q ss_conf 22445304479772078520355223578602223751555313554444420000134425873103466653112114 Q gi|254780143|r 775 CVNQRPLVKVGDEVRRNDIIADGPSTDLGDLALGRNMLVAFMPWHGYNFEDSMLISERMVSEDVFTSIHIEEFEVMARDT 854 (1386) Q Consensus 775 ~~~q~p~V~~g~~~~~~~~l~~~~~~~~~el~~G~N~~VA~m~~~GYN~EDaiiin~~~v~rg~~~s~h~~~y~~~~~~~ 854 (1386) +++|+|+|+++.. ..+.++|+|+|+|++||||||+|||||||||+||++++||+|||+|+++|+++++.+ T Consensus 454 ~ypQ~Plv~t~~~----------~~~~~~e~p~G~N~iVAv~~y~GYn~EDAiIiNk~si~rG~f~s~~~~~~~~~~~~~ 523 (866) T cd00653 454 LYPQKPLVGTGIE----------EYIAFGELPLGQNAIVAVMSYSGYNFEDAIIINKSSVDRGFFRSIHYKKYEIELRKT 523 (866) T ss_pred CCCCCCEEECCHH----------HHHCCCCCCCCEEEEEEEECCCCCCCCCEEEECHHHHHHHHCEEEEEEEEEEEEEEC T ss_conf 3687754405768----------874898677745579999657787743058961989873213169999889998715 Q ss_pred CCCCCCCCC-CCCCCCHHHHHHCCCCCCCCCCCEECCCCCEEECCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCEECCC Q ss_conf 788400246-6654686784102413741377331367611101246777767703466543035555443233202278 Q gi|254780143|r 855 KLGPEEITR-DIPNVSEEGLKNIDECGIICVGAEVNPGDILVGKITPKGESPMTPEEKLLRAIFGEKAVDVRDTSLRVPS 933 (1386) Q Consensus 855 ~~g~~~~~~-~~~~~~~~~~~~ld~~Giv~~G~~V~~gDilvgk~tp~~~~~~~pe~~~l~~i~~~~~~~~~d~~~~~~~ 933 (1386) ..+.++.+. ++++.+...+.+||+||++++|++|.+|||+|||++|.......+ +++....+.+|+|++++. T Consensus 524 ~~~~~~~~~~~~~~~~~~~~~~Ld~dGi~~~g~~v~~gdvligk~~~~~~~~~~~-------~~~~~~~~~~d~s~~~~~ 596 (866) T cd00653 524 KNGPEEITRGDIPNVSEEKLKNLDEDGIIRPGARVEPGDILVGKITPKGETESTP-------IFGEKARDVRDTSLKYPG 596 (866) T ss_pred CCCCCEECCCCCCCCCHHHHHHCCCCCCCCCCCEECCCCEEEEEECCCCCCCCCC-------CCCCCCCEEEEEEEEECC T ss_conf 8986313468999898678742374577579989779998999966787666664-------445566503575799359 Q ss_pred CCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHH Q ss_conf 85322001210145654202667888899999998728889998878889888886258534566555554321135764 Q gi|254780143|r 934 GVSGTVVDVRIFNRHGIDKNERSISVEREQIELLARDKDDEQVILDRNIYSRLMEILCGQNAVSGPKGFKKSTVLSSDLI 1013 (1386) Q Consensus 934 g~~g~v~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1013 (1386) ++.|+|.+|.++... T Consensus 597 ~~~g~V~~V~~~~~~----------------------------------------------------------------- 611 (866) T cd00653 597 GEKGIVDDVKIFSRE----------------------------------------------------------------- 611 (866) T ss_pred CCCEEEEEEEEECCC----------------------------------------------------------------- T ss_conf 984899999995255----------------------------------------------------------------- Q ss_pred HCCCHHHHHEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEEEEEECCCCCCCCCCCC Q ss_conf 01485431001415867899999999989999999999989887631258867747276999999875588552314336 Q gi|254780143|r 1014 SEYPRSQWWQFAVQDEKVQRNVESLKVQYETSKSILEDRFKNKIEKIQWGDDMPPGVLRVVKVFVAMKRPIQSGDKMAGR 1093 (1386) Q Consensus 1014 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~ki~~~~~~~~gv~~~vKV~ir~~R~p~iGDKfasR 1093 (1386) ...++.++|||++|+.|+|+|||||||| T Consensus 612 ----------------------------------------------------~~~~~~~~vkV~~r~~R~p~iGDKfssR 639 (866) T cd00653 612 ----------------------------------------------------LNDGGNKLVKVYIRQKRKPQIGDKFASR 639 (866) T ss_pred ----------------------------------------------------CCCCCCEEEEEEEEEECCCCCCCCCCCC T ss_conf ----------------------------------------------------6878737999998153466546521245 Q ss_pred CCCCCEEEEEECCCCCCCCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHH Q ss_conf 67873588863000078793587156986689867507089999999999998719622433343222210246777877 Q gi|254780143|r 1094 HGNKGIVSRILPCEDMPFLKDGTPVDIVLNPLGVPSRMNVGQIFETHLGWACVGLGKKIKSLINDYKANGDISPLRSFLE 1173 (1386) Q Consensus 1094 HGqKGVis~i~p~eDMPf~~dG~~pDIIlNPhgvPSRMtIGqllE~~lGka~~~~G~~~~~~~f~~~~~~~~~~~~~~~~ 1173 (1386) ||||||||+|||+||||||+||++|||||||||||||||||||+||++||||+++|..+| T Consensus 640 HGqKGvi~~i~~~~DmPft~~G~~pDiI~NPhg~PSRMtiGql~E~~~gk~~~~~g~~~d-------------------- 699 (866) T cd00653 640 HGQKGVISKILPQEDMPFTEDGIPPDIILNPHGFPSRMTIGQLLESLLGKAGALLGKFGD-------------------- 699 (866) T ss_pred CCCCCEEEEEECCCCCCCCCCCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCCEE-------------------- T ss_conf 576622567875203982889987717446665755561999999998899985497353-------------------- Q ss_pred HHCCCCCCCCCCCCCCHHHHHHHHHHHCCCCEECCCCCCCCCHHHHHHHHHHCCCCCCCCEEEECCCCCCCCCCCEEEEE Q ss_conf 63154322100001352567776653026840025655788999999999986868998699986898840268504884 Q gi|254780143|r 1174 KVIGTGSHTEKISDYDDDSVLRVAEQWKSGVPVSTPVFDGADEEAINSMLRMADLDESGQSILYDGLTGEPFDRPVTVGY 1253 (1386) Q Consensus 1174 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~aTP~F~g~~~~~i~~~L~~aG~~~~Gke~lydG~TGe~~~~~IfvG~ 1253 (1386) ||| |++..+++|.+.|.++||+++|+|.||||+||++|+++||+|| T Consensus 700 ---------------------------------~t~-F~~~~~~~i~~~l~~~g~~~~G~e~l~~G~tG~~~~~~i~~G~ 745 (866) T cd00653 700 ---------------------------------ATP-FDGAEEEDISELLGEAGLNYYGKEVLYDGRTGEPLEAPIFVGP 745 (866) T ss_pred ---------------------------------ECC-CCCCCHHHHHHHHHHCCCCCCCCEEEECCCCCCCCCCCEEEEE T ss_conf ---------------------------------278-8999889999999985989999989865999988237489962 Q ss_pred EEEEECCCHHHHHCCCCCCCCCCCEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHH Q ss_conf 64541011021100002368731103079986233178432078999999986989999861110011015999988763 Q gi|254780143|r 1254 IYMLKLNHMVSDKVYARSTGSYSLVTQQPLGGKSNRGGQRLGEMEVWCIQAYGAAYVLQEMLTIKSDDVVGRTRVYESIV 1333 (1386) Q Consensus 1254 ~YyqkL~HMV~DKiHARstGP~sllTrQP~eGRsr~GGlRfGEMErwaL~AyGAa~~LqE~Lt~kSDdv~gr~~~~~~iv 1333 (1386) +|||||+|||+|||||||||||++|||||++||||+|||||||||||||+|||||++|+||||.+||+..+|.-..-.++ T Consensus 746 ~yyqrL~Hmv~dK~~~R~~Gp~~~lT~QP~~Gr~~~GG~R~GEMErd~l~ahGas~~l~erl~~~SD~~~~~vc~~cg~~ 825 (866) T cd00653 746 VYYQRLKHMVDDKIHARSTGPYSLLTRQPLKGRSRGGGQRFGEMERDALIAHGAAYLLQERLTIKSDDVVARVCVKCGII 825 (866) T ss_pred HHHHCCCHHHHHCCEEECCCCCCCCCCCCCCCCCCCCCCEEEHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCC T ss_conf 47640212330061853528997613189997646898501477999999872999999997637887158840367612 Q ss_pred ----------CCCCCCCCCCCHHHHHHHHHHHHCCCCEEEE Q ss_conf ----------8787899786677899999998542002764 Q gi|254780143|r 1334 ----------AGNDTFETGTPESFNVLVKEMQALGLSIDLE 1364 (1386) Q Consensus 1334 ----------~g~~~~~~~~pesf~vl~~El~~l~l~~~~~ 1364 (1386) ++.+++++++|||||+|++||+|||++++|. T Consensus 826 ~~~~~c~~c~~~~~~~~~~~Py~~kll~~EL~~m~i~~~l~ 866 (866) T cd00653 826 LSANLCRLCKKGTNISKVGIPYAFKLLFQELQSMNIDPRLK 866 (866) T ss_pred CCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEC T ss_conf 24543344679981670689889999999998788777609 No 8 >KOG0214 consensus Probab=100.00 E-value=0 Score=1602.85 Aligned_cols=1005 Identities=25% Similarity=0.357 Sum_probs=768.6 Q ss_pred HHHCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHCCCCCC---------CCCEEEEEEEEEECCCCC- Q ss_conf 3104666788708899999999986246652211122589997663780767---------985899999789809858- Q gi|254780143|r 17 FGKNPEIIDIPDLIEVQKASYDHFLMMNIAPDERPNEGLQAAFKSVFPITAF---------SGAAMLEFVSYEFDPPKF- 86 (1386) Q Consensus 17 f~k~~~~~~~P~Li~iQ~~Sf~~Flq~~~~~~~r~~~GL~~v~~~~fPI~d~---------~~~~~Lef~~y~l~~Pk~- 86 (1386) ..-++...+.|-|+.+|+|||++|+| .++|+++.+..+|++- -.+|.|.|..|+|.+|.+ T Consensus 6 w~vis~~f~ekglvrqQldsFdeFiq----------~~~qeiv~~~p~i~l~~~~qh~~~~~~r~~l~f~qiylskP~~~ 75 (1141) T KOG0214 6 WDVISAYFEEKGLVRQQLDSFDEFIQ----------QTLQEIVADSPKIELQDEAQHTQENPARYQLFFEQIYLSKPTHT 75 (1141) T ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHH----------HHHHHHHHCCCCCEEHHHHHHHCCCCCCEEEEEEEEEEECCCCC T ss_conf 77877651243268887776999999----------88999974189731226556521576314888656998046301 Q ss_pred ---------CHHHHHHCCCCEEEEEEEEEEEEEECCCCCCCCCCCEEEEEEEEEEEECCEEC------------------ Q ss_conf ---------89999983997533589999999931787665520001223678751000026------------------ Q gi|254780143|r 87 ---------DVDDCLWRDLTYAVPLKITLRLIVFDVDEFTGAKSIKDIKEQSIYMGDLPLMT------------------ 139 (1386) Q Consensus 87 ---------tp~ECRlR~lTYsapL~V~i~l~v~~~~~~~~~k~~~~ike~~V~lG~IPiMt------------------ 139 (1386) -|+|||+|+|||||||||+++..+..... ++.+.+++.||||+||||+ T Consensus 76 e~dg~~~~m~p~eARlrnLTYSspLyVd~~k~~~~~~~-----~~~~~~~qkvfIGkIPiMlrS~~C~l~~~~d~dl~~l 150 (1141) T KOG0214 76 ESDGSTSTMFPNEARLRNLTYSSPLYVDATKIVKTSRD-----EVEDMQEQKVFIGKIPIMLRSSYCLLSGLTDKDLTEL 150 (1141) T ss_pred CCCCCCCCCCCHHHHHHHCCCCCCEEEEEEEEEEECCC-----CCCCCCCCEEEEECCCEEEEHHHHHHCCCCHHHHHHC T ss_conf 46787334571666764044466428976799850332-----0002451204673253333100211046660444540 Q ss_pred ------CCCCEEECCEEEEEEEEECCCCCEEECCCCCCCCCCCCEEEEEEEEC--CC----CCEEEEEECCCCE-----E Q ss_conf ------89628986821468665122785212023465157785679999811--88----7236899758982-----9 Q gi|254780143|r 140 ------KDGTFVIKGIQRIVVSQLHRSPGIHFDHDKGRASLSGKLLYACRIIP--DQ----GLWMDIEFDSKDI-----I 202 (1386) Q Consensus 140 ------~~GyFIING~ERVIVsQl~RSPGVyf~~~k~k~~~s~k~~ysa~IIP--~R----GSwLe~e~d~kd~-----i 202 (1386) .+|||||||+|||+|+|+..|+++++... ....++..|+++|++ .| +|.|..+.+.++. + T Consensus 151 ~eCp~DqgGyfIINGseKVlIAQe~matn~vyvf~---k~~p~~~ay~~EirS~le~~sR~~stl~v~~~ar~~~gq~i~ 227 (1141) T KOG0214 151 GECPYDQGGYFIINGSEKVLIAQEKMATNIVYVFK---KAHPGTYAYTGEIRSCLERSSRFISTLSVNMDARAGKGQEII 227 (1141) T ss_pred CCCCCCCCCEEEECCCEEEEEEHHHCCCCEEEEEE---CCCCCCEEEEEEEEHHHHCCCCCCCEEEEEECCCCCCCCEEE T ss_conf 47875788659983720033125550686699974---488984567643201565168875455643101467762799 Q ss_pred EEEECCCCCHHHHHHHHHCCCCHH-HHHHHHCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCEEECCCCHHH Q ss_conf 999718787039989988099847-9999738740577306731023545666421012232112445845440221020 Q gi|254780143|r 203 HVRIDRRRKVPVTSFLMALGMDSE-EILSTFYPKIVYSQRGDFWCFPLSAADLMVGAKVSSSLVDIDTGEQVIESGKKLT 281 (1386) Q Consensus 203 yvrIdr~rKIPi~ilLrALG~ssd-eIl~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 281 (1386) +...-.+.+|||+|+|||||+.+| |||+++|++.........+..+..+........ T Consensus 228 ~tlpyikqeIPI~ilfrALG~v~dreILehIcYd~~D~em~e~lkpsieeafviq~qn---------------------- 285 (1141) T KOG0214 228 ATLPYIKQEIPILILFRALGFVSDREILEHICYDFDDAEMFESLKPSIEEAFVIQEQN---------------------- 285 (1141) T ss_pred EEEEEECCCCCEEEEEHHHCCCCHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHH---------------------- T ss_conf 9853321567489880352677608899750478873999997551056654332057---------------------- Q ss_pred HHHHHHHHHCCCCCCCCCHHHHCCCCCCCCCCCCCCCEEEEECCCCCCHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHH Q ss_conf 79988787636000102667734741000013666776999623459989999998656541024420244455200000 Q gi|254780143|r 282 SGLLKSLKEKGVKFLGITSDCLCGLYVAEDIVNGETGEIYIEAGDVIDEKSLEEIFHSEIRDIPILYVDSVNNNAYIRNT 361 (1386) Q Consensus 282 ~~~~~~~~~~~~~~~~v~~~~l~~~~~~~~~~d~~~gei~~~~~~~~~~~~l~~~~~~~~~~~~~l~~~~~~~~~~i~~~ 361 (1386) T Consensus 286 -------------------------------------------------------------------------------- 285 (1141) T KOG0214 286 -------------------------------------------------------------------------------- 285 (1141) T ss_pred -------------------------------------------------------------------------------- T ss_conf -------------------------------------------------------------------------------- Q ss_pred HHCCCCCCHHHHHHHHHHHH-CCC--CCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEECHHHHHH Q ss_conf 11023458899999988760-577--631045688886202453023334555577765420366776770661899888 Q gi|254780143|r 362 LVTDKNKDRKDALLDIYRVM-RPG--DVSTFSVAESMFNFLFFDSDKYDLSTVGRVKMNMRLNLDTPDDVRHIRKEDIIA 438 (1386) Q Consensus 362 ~~~d~~~~~~eAl~~I~k~l-r~~--~~~~~~~~~~~~~~~~~~~~~y~l~~vgr~~~n~~l~~~~~~~~~~Lt~~d~~~ 438 (1386) -||.+|.++- ..| .......++.+++..++ - +....+.-++-.+.++.. T Consensus 286 ----------~ALdfig~rga~~Gvtkekri~yakdiLqKe~l-------P-----------hi~~~e~~etkKA~fLGY 337 (1141) T KOG0214 286 ----------VALDFIGQRGACIGVTKEKRIKYAKEILQKELL-------P-----------HVGLGEICETKKAYFLGY 337 (1141) T ss_pred ----------HHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHH-------H-----------CCCCCHHHHHHHHHHHHH T ss_conf ----------899998861788788830356899999998741-------0-----------313000323667788885 Q ss_pred HHHHHHHHHCCCCCCCCCHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCC--CCCCCCCCCCHHHHHHHHHHH Q ss_conf 898887630487643440101533233335787898888889999887653113443--343521122202346555420 Q gi|254780143|r 439 IIKILVDLRNGKGTIDDIDNLGNRRVRSVGEMLKNQYRLGLLRMERSIKERISSVDI--DSVMPQDLINAKPVVSAVCEF 516 (1386) Q Consensus 439 ~i~~L~~l~~g~~~~DdiDhlgnkRvr~vgeLl~~~fr~~l~rl~r~i~~~~~~~~~--~~~~~~~~in~~~i~~~i~~f 516 (1386) |++.|+-...+...+||.||+||||+.++|+|++.+||..|++|.+.+...|..+-. ..+.....++.+.|++.++.. T Consensus 338 mi~rlll~aLgRr~~ddRDHfG~KRLDlaGpLLa~lfR~lf~~l~rd~~~~mQk~le~~~dfni~~aika~iIt~gl~ys 417 (1141) T KOG0214 338 MVHRLLLAALGRRELDDRDHFGNKRLDLAGPLLAKLFRSLFAKLLRDVTRYMQKCLENGVDFNIELAIKAKIITNGLNYA 417 (1141) T ss_pred HHHHHHHHHHCCCCCCCHHHCCCCCHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEEEEEEECCCCEEE T ss_conf 69888888707887420111055533212068999999999999999999999998638651278888754515640124 Q ss_pred CCCCC-------------CEEECCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCEEE Q ss_conf 26675-------------03541554210232200112234444432234665543221111044430356676521021 Q gi|254780143|r 517 FCSSQ-------------LSQLEEHVNSLSRITHTRRLSALGQGGVARARAGVEMRDVHPTHYGRICPAETSEGHNIGLV 583 (1386) Q Consensus 517 f~t~~-------------lsq~ld~~n~ls~lth~RR~s~lgpggl~r~~~~~evR~ih~s~~GriCPieTPEG~n~GLv 583 (1386) ++|++ +||+|+++||+++|+|+||+++ +..|+.+...+|++|||||||+||+|||||++|||| T Consensus 418 laTgnwgdq~ka~~~ragVsQVLnR~t~~sTLsHlRR~n~----Pigr~GKlAkpRqlHnthwGmvCPaeTPEGqacGLV 493 (1141) T KOG0214 418 LATGNWGDQKKAMQGRAGVSQVLNRLNPLSTLSHLRRTNS----PIGRTGKLAKPRQLHNSHWGMVCPAETPEGQACGLV 493 (1141) T ss_pred ECCCCCCCHHHHHHCCCCHHHHHCCCCHHHHHHHHHHCCC----CCCCCCCCCCCCCCCCCCCCEECCCCCCCCCEEECC T ss_conf 1468753166775331048888514566888887764178----655556557855567552200024668876132020 Q ss_pred ECEEEEEEECCCCCCCCEEEEEECCCCCCCEEE--CCHHHHCCEEEECCCCEECCCCCCCCCCEEECCCCCCCCCCHHHE Q ss_conf 000122442468764410698622553481664--296773763796165341146840222000000233332257872 Q gi|254780143|r 584 SSLTSFARVNAYGFIETPYRKVCDGKVTNDVVY--LSAMEEENRYIAQANSSLDEDGSFTEELVFCRCAGEEILVPREKI 661 (1386) Q Consensus 584 ~~la~~a~in~~g~ie~py~~v~~~~~~~~i~~--l~~~~e~~~~Ia~~~~~l~~~~~~~~~~~~~r~~~~~~~~~~~~v 661 (1386) +|||+||.|++ |.++.|+.....-..++.++. .++.....++++.++ .+.....+.......+.......+..+.. T Consensus 494 KNLSLmayIsv-GS~~sPi~EfLeewgmE~le~~~ps~~~datkvfvNG~-wvGlhrdp~~l~~tlr~lRR~~di~~Evs 571 (1141) T KOG0214 494 KNLSLMAYISV-GSLESPIVEFLEEWGMENLEEISPSPSPDATKVFVNGV-WVGLHRDPEELVATLKRLRRQFDIIAEVS 571 (1141) T ss_pred CCCEEEEEEEC-CCCCHHHHHHHHHHCHHHHHHCCCCCCCCCEEEEECCE-EEEECCCHHHHHHHHHHHHHHHCCCHHHH T ss_conf 23257899851-88860799999985735576438876887308997360-77642798999999999987505640221 Q ss_pred EECCCCCCCE---EEECCCCCCCHHHCCH--------HHHHHHH--HHHHHHHCCCCCCCCEECCCCCHHHHHHCCCEEE Q ss_conf 2023672110---2311233320110100--------2211222--3444321013665411126620111010653010 Q gi|254780143|r 662 DFIDASPKQV---VSIAASLIPFLENDDS--------NRVLMGC--NMQRQAVPLLKAEAPFVGTGMESVVAKSSGAAIV 728 (1386) Q Consensus 662 ~~~~i~p~~i---~sv~aslIPflehdda--------~R~l~g~--nm~rQav~l~~~~~~~v~tg~E~~~~~~s~~~i~ 728 (1386) -..|+.+.++ .+.+++++||+.|+|+ +|+||++ ||+++++++-.....+++|+.|..+..+++.-.+ T Consensus 572 ~vRDIr~kE~ri~tDaGR~~rPLliven~~l~~~k~hi~~L~~~~~~~~~w~~lv~~G~ie~idteeEEtvmiam~~~dL 651 (1141) T KOG0214 572 MVRDIRDKEIRIFTDAGRSLRPLLIVENAKLLIKKRHIRALKQSKPNEYRWAHLLSSGEVEYIDTEEEETVMIAMGPKDL 651 (1141) T ss_pred HCCCCCHHHEEEEECCCCCCCCEEEEECHHHHHHHHHHHHHHCCCHHHCCHHHHCCCCCEEEECCCHHHHHHHHCCHHHH T ss_conf 00235620346641277533435998060665427789998608332134455303560564055055565673598887 Q ss_pred CC------------CCCCCCCCCCCCEEEECCCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCEECC Q ss_conf 21------------244343356553025215665564446630111465554454732244530447977207852035 Q gi|254780143|r 729 AK------------RAGIVEQVDAIRIVIRSVEGDLDPSTSGVDIYRLMKFQRSNQNTCVNQRPLVKVGDEVRRNDIIAD 796 (1386) Q Consensus 729 a~------------~~g~v~~vd~~~i~i~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~q~p~V~~g~~~~~~~~l~~ 796 (1386) +. +++.+.++++..|... ++.++...+|+-.+. .+..+|+.++|.|..+...+.+...++ T Consensus 652 ~~~~~~~~~THCEiHpsmILgv~asiIpFp------dh~qspr~tYQsa~~--kqamg~y~tn~~vR~dtl~~~l~ypqk 723 (1141) T KOG0214 652 AESAYPKTYTHCEIHPSMILGVCASIIPFP------DHNQSPRNTYQSAMG--KQAMGVYHTNPQVRMDTLAKVLYYPQK 723 (1141) T ss_pred HHCCCCCCCCCCCCCHHHHHHHCEEEEECC------CCCCCHHHHHHHHHH--HHCCEEEEECCHHHHHHHHHCCCCCCC T ss_conf 425677654211356767222200365358------888852899999864--002024551504655666542436666 Q ss_pred CCC-------CCCCCCCCCCCCEEEEEECCCCCCCCCEECCHHHHHCCCCCEEEEEEEEHHHHHCCCCCCCCCCCCCC-- Q ss_conf 522-------35786022237515553135544444200001344258731034666531121147884002466654-- Q gi|254780143|r 797 GPS-------TDLGDLALGRNMLVAFMPWHGYNFEDSMLISERMVSEDVFTSIHIEEFEVMARDTKLGPEEITRDIPN-- 867 (1386) Q Consensus 797 ~~~-------~~~~el~~G~N~~VA~m~~~GYN~EDaiiin~~~v~rg~~~s~h~~~y~~~~~~~~~g~~~~~~~~~~-- 867 (1386) +.+ ..+.|||+|+|++||++||+|||||||+|||+++++||+|||.|++.|+.++..+..++++.+...+. T Consensus 724 pl~tt~~~e~l~~~eL~aG~NaiVAi~~~~GYNqEDsvimn~s~v~rg~FrS~~~RsYk~q~~~~~~~~ee~~~~~~~~~ 803 (1141) T KOG0214 724 PLVTTRAMEYLRFRELPAGQNAIVAIACYSGYNQEDSVIMNQSSVDRGLFRSFFIRSYKDQEHKKDQGPEEIFEEPPRGE 803 (1141) T ss_pred CHHHHHHHHHHHHHHCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCC T ss_conf 14566677652254314461238998424674577788876655420204321246766655203555210003655100 Q ss_pred ---CCHHHHHHCCCCCCCCCCCEECCCCCEEECCCCC--CCCCCCHHHHHHHHHCCCCCCCCCCCCEECCCCCCCCCCCC Q ss_conf ---6867841024137413773313676111012467--77767703466543035555443233202278853220012 Q gi|254780143|r 868 ---VSEEGLKNIDECGIICVGAEVNPGDILVGKITPK--GESPMTPEEKLLRAIFGEKAVDVRDTSLRVPSGVSGTVVDV 942 (1386) Q Consensus 868 ---~~~~~~~~ld~~Giv~~G~~V~~gDilvgk~tp~--~~~~~~pe~~~l~~i~~~~~~~~~d~~~~~~~g~~g~v~~~ 942 (1386) .....|.+||+||++.+|++|..+||+|||+||. .+.+..++.+.. .-+|+|+.+++++.| ++|+ T Consensus 804 ~~~mr~~~~dkLdddG~i~~G~~vs~~Dv~iGk~t~~~~~~~~~~~~~~~~---------t~~d~s~~Lr~~e~G-ivd~ 873 (1141) T KOG0214 804 GRGMRNGKYDKLDDDGIIMPGSRVSGGDVLIGKTTPQPAKEDESGPEDRLY---------TKRDHSTKLRHTERG-IVDQ 873 (1141) T ss_pred CCCCCCCCCCCCCCCCCCCCCCEEECCCEEECCCCCCCCCCHHCCCCCCCC---------CCCCCEEECCCCCCC-EEEE T ss_conf 201222443332225775676233128888024567766500126433433---------455412440237861-6899 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHCCCHHHHH Q ss_conf 10145654202667888899999998728889998878889888886258534566555554321135764014854310 Q gi|254780143|r 943 RIFNRHGIDKNERSISVEREQIELLARDKDDEQVILDRNIYSRLMEILCGQNAVSGPKGFKKSTVLSSDLISEYPRSQWW 1022 (1386) Q Consensus 943 ~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1022 (1386) .+.+. T Consensus 874 V~vt~--------------------------------------------------------------------------- 878 (1141) T KOG0214 874 VWVTK--------------------------------------------------------------------------- 878 (1141) T ss_pred EEEEC--------------------------------------------------------------------------- T ss_conf 99804--------------------------------------------------------------------------- Q ss_pred EEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEEEEEECCCCCCCCCCCCCCCCCEEEE Q ss_conf 01415867899999999989999999999989887631258867747276999999875588552314336678735888 Q gi|254780143|r 1023 QFAVQDEKVQRNVESLKVQYETSKSILEDRFKNKIEKIQWGDDMPPGVLRVVKVFVAMKRPIQSGDKMAGRHGNKGIVSR 1102 (1386) Q Consensus 1023 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~ki~~~~~~~~gv~~~vKV~ir~~R~p~iGDKfasRHGqKGVis~ 1102 (1386) +..+.+++||++||.|.||+|||||||||||||||+ T Consensus 879 --------------------------------------------n~~G~kF~kv~vr~~ripqiGDKfasrHgqKG~ig~ 914 (1141) T KOG0214 879 --------------------------------------------NSEGPKFVKVRVRQVRIPQIGDKFASRHGQKGTIGI 914 (1141) T ss_pred --------------------------------------------CCCCCCEEEEEEEECCCCCCCCHHCCCCCCCCCCCC T ss_conf --------------------------------------------777871278997621344323300133566751241 Q ss_pred EECCCCCCCCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCC Q ss_conf 63000078793587156986689867507089999999999998719622433343222210246777877631543221 Q gi|254780143|r 1103 ILPCEDMPFLKDGTPVDIVLNPLGVPSRMNVGQIFETHLGWACVGLGKKIKSLINDYKANGDISPLRSFLEKVIGTGSHT 1182 (1386) Q Consensus 1103 i~p~eDMPf~~dG~~pDIIlNPhgvPSRMtIGqllE~~lGka~~~~G~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~ 1182 (1386) +++|||||||.+|++||||+||||||||||||||+||++||.++..|..++ T Consensus 915 ~~~qedmpft~eGi~pDiiiNPhaiPSRmtig~liEc~lgk~~a~~~e~~~----------------------------- 965 (1141) T KOG0214 915 TYRQEDMPFTIEGIVPDIIINPHAIPSRMTIGQLIECLLGKVAAYEGEEGD----------------------------- 965 (1141) T ss_pred EEECCCCCCCCCCCCCCEEECCCCCCCCCCHHHHHHHHHHHHHHCCCCCCC----------------------------- T ss_conf 333488984225777646877655863356156678763245540342256----------------------------- Q ss_pred CCCCCCCHHHHHHHHHHHCCCCEECCCCCCCCCHHHHHHHHHHCCCCCCCCEEEECCCCCCCCCCCEEEEEEEEEECCCH Q ss_conf 00001352567776653026840025655788999999999986868998699986898840268504884645410110 Q gi|254780143|r 1183 EKISDYDDDSVLRVAEQWKSGVPVSTPVFDGADEEAINSMLRMADLDESGQSILYDGLTGEPFDRPVTVGYIYMLKLNHM 1262 (1386) Q Consensus 1183 ~~~~~~~~~~~~~~~~~~~~g~~~aTP~F~g~~~~~i~~~L~~aG~~~~Gke~lydG~TGe~~~~~IfvG~~YyqkL~HM 1262 (1386) ||| |.+++++.|+..|+.+||+++|+|+||||+||++|.++||+||+|||||+|| T Consensus 966 ------------------------atp-Fs~v~v~~is~~l~~~g~~~~G~e~~ynGrtG~~~~~~if~GptyyqrL~Hm 1020 (1141) T KOG0214 966 ------------------------ATP-FSDVTVSKISANLHVYGYQYRGNERMYNGRTGRKLRAQIFIGPTYYQRLKHM 1020 (1141) T ss_pred ------------------------CCC-CCCCCHHCCCCCHHHHCCCCCCCEEEECCCCCCEEEEEEECCCHHHHHHHHH T ss_conf ------------------------777-6653210014117763643579788853888863024352284399877776 Q ss_pred HHHHCCCCCCCCCCCEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHH---HHHHHC----- Q ss_conf 211000023687311030799862331784320789999999869899998611100110159999---887638----- Q gi|254780143|r 1263 VSDKVYARSTGSYSLVTQQPLGGKSNRGGQRLGEMEVWCIQAYGAAYVLQEMLTIKSDDVVGRTRV---YESIVA----- 1334 (1386) Q Consensus 1263 V~DKiHARstGP~sllTrQP~eGRsr~GGlRfGEMErwaL~AyGAa~~LqE~Lt~kSDdv~gr~~~---~~~iv~----- 1334 (1386) ||||||+|++||+++|||||++||||+|||||||||||||||||||+.|+|||+.+||...-+.-. |.+|.. T Consensus 1021 vd~kih~R~~Gp~q~ltRQP~~gRsr~GGlRfGEMErdc~iahGaa~~L~ERL~~~SD~~~~~~c~~c~l~~i~~~~~n~ 1100 (1141) T KOG0214 1021 VDDKIHSRARGPVQILTRQPVEGRSRDGGLRFGEMERDCLIAHGAAAFLKERLFEKSDAYRVHICVLCGLTAIAGLIPNS 1100 (1141) T ss_pred HHHEEEECCCCCEEEEECCCCCCCCCCCCEEEEHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCHHHHHHCCCCCC T ss_conf 43225301468703443165446643478253246788998751788999886404766158871201111221157786 Q ss_pred ----C----CCCCCCCCCHHHHHHHHHHHHCCCCEEEEEC Q ss_conf ----7----8789978667789999999854200276406 Q gi|254780143|r 1335 ----G----NDTFETGTPESFNVLVKEMQALGLSIDLENS 1366 (1386) Q Consensus 1335 ----g----~~~~~~~~pesf~vl~~El~~l~l~~~~~~~ 1366 (1386) | ..+...-+||++|||.+||.|+++.+++... T Consensus 1101 ~~ck~c~n~~~v~~v~ipya~kLl~qelmsmni~pr~~~~ 1140 (1141) T KOG0214 1101 FECRGCENKTLVVRVYIPYAAKLLFQELMSMNIAPRRKTK 1140 (1141) T ss_pred CCCCCCCCCCCEEEEEHHHHHHHHHHHHHHCCCCCCEECC T ss_conf 5443313711146676045899999999844576430047 No 9 >KOG0215 consensus Probab=100.00 E-value=0 Score=1481.71 Aligned_cols=912 Identities=24% Similarity=0.366 Sum_probs=685.9 Q ss_pred CCCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHCCCC-CCCCCEEEEEEEEEECCCC-----------CCHHHHHH Q ss_conf 87088999999999862466522111225899976637807-6798589999978980985-----------88999998 Q gi|254780143|r 26 IPDLIEVQKASYDHFLMMNIAPDERPNEGLQAAFKSVFPIT-AFSGAAMLEFVSYEFDPPK-----------FDVDDCLW 93 (1386) Q Consensus 26 ~P~Li~iQ~~Sf~~Flq~~~~~~~r~~~GL~~v~~~~fPI~-d~~~~~~Lef~~y~l~~Pk-----------~tp~ECRl 93 (1386) +--|+.+++||||+|++ .+|..+.++..-|+ |.+..++|+|.++++++|. .+|+|||+ T Consensus 56 vkglvkqhldsfnyfv~----------~~ik~iv~an~~itsd~dp~fylky~dirvg~Ps~~~~~~~~~~~i~p~ecrl 125 (1153) T KOG0215 56 VKGLVKQHLDSFNYFVD----------VDIKKIVKANQLITSDVDPSFYLKYLDIRVGKPSIEEGNNVTNDNITPHECRL 125 (1153) T ss_pred HHHHHHHHHHHHHHHHH----------HHHHHHHHHHCCCCCCCCCHHHHEEEEEECCCCCHHHCCCCCCCCCCCHHEEE T ss_conf 65489987765424544----------12899998515667676842111001342157402203353124678541240 Q ss_pred CCCCEEEEEEEEEEEEEECCCCCCCCCCCEEEEEEEEEEEECCEEC------------------------CCCCEEECCE Q ss_conf 3997533589999999931787665520001223678751000026------------------------8962898682 Q gi|254780143|r 94 RDLTYAVPLKITLRLIVFDVDEFTGAKSIKDIKEQSIYMGDLPLMT------------------------KDGTFVIKGI 149 (1386) Q Consensus 94 R~lTYsapL~V~i~l~v~~~~~~~~~k~~~~ike~~V~lG~IPiMt------------------------~~GyFIING~ 149 (1386) |++|||||+||+++++..+ ..+...+|.||++|+|+ ++||||++|+ T Consensus 126 rd~tysa~i~vdieytrg~----------~~~~~~~v~igrmpimlrs~rcvl~~~~e~~~a~~~ecpldpggyfiv~G~ 195 (1153) T KOG0215 126 RDMTYSAPIYVDIEYTRGR----------QIIAKRDVIIGRMPIMLRSSKCVLRGKDEEELARLNECPLDPGGYFIVKGT 195 (1153) T ss_pred HHHHCCCCEEEEEEEECCC----------CEEEECCCCCCCCHHHHHCCCCCCCCCCHHHHHHHCCCCCCCCCEEEECCC T ss_conf 0332046315777884286----------057742535466346652365425784688998854498788754676263 Q ss_pred EEEEEEEECCCCC-EEECCCCCCCCCCCCEEEEEEEEC---CCCCEEEEEECCCCEEEEEECC-CCCHHHHHHHHHCCCC Q ss_conf 1468665122785-212023465157785679999811---8872368997589829999718-7870399899880998 Q gi|254780143|r 150 QRIVVSQLHRSPG-IHFDHDKGRASLSGKLLYACRIIP---DQGLWMDIEFDSKDIIHVRIDR-RRKVPVTSFLMALGMD 224 (1386) Q Consensus 150 ERVIVsQl~RSPG-Vyf~~~k~k~~~s~k~~ysa~IIP---~RGSwLe~e~d~kd~iyvrIdr-~rKIPi~ilLrALG~s 224 (1386) ||||+.|++.|.+ +..+.++.+. ..|.+-+ .|+|. .+.+.+++..|.++.. ..+||+++.|+|+|+. T Consensus 196 ekViLiqeqlsknrii~~~~k~~~-------~~~svtsst~e~ksk-t~v~~k~~k~ylk~~~~~d~ipiviv~ka~g~~ 267 (1153) T KOG0215 196 EKVILIQEQLSKNRIIVEEDKKGN-------LQASVTSSTHERKSK-TYVTTKKGKYYLKHNSFTDDIPIVIVLKAMGLE 267 (1153) T ss_pred EEEEEEHHHHHCCCEEECCCCCCC-------CCCCCCCCEEEECCE-EEEEECCCEEEEEHHHHHCCCCEEEEEEECCCC T ss_conf 368986663410570544466785-------100345441440433-799960651221013221347779998400342 Q ss_pred HH-HHHHHHCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCEEECCCCHHHHHHHHHHHHCCCCCCCCCHHHH Q ss_conf 47-99997387405773067310235456664210122321124458454402210207998878763600010266773 Q gi|254780143|r 225 SE-EILSTFYPKIVYSQRGDFWCFPLSAADLMVGAKVSSSLVDIDTGEQVIESGKKLTSGLLKSLKEKGVKFLGITSDCL 303 (1386) Q Consensus 225 sd-eIl~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~l 303 (1386) +| ||.++.+.+..+... +..++...- T Consensus 268 sd~ei~~~vg~~~~y~~~---~~~s~ee~~-------------------------------------------------- 294 (1153) T KOG0215 268 SDQEIVQLVGGDSKYQDI---FAPSLEECV-------------------------------------------------- 294 (1153) T ss_pred CHHHHHHHHHCCHHHHHH---HCCCHHHHH-------------------------------------------------- T ss_conf 048899886066678876---254166525-------------------------------------------------- Q ss_pred CCCCCCCCCCCCCCCEEEEECCCCCCHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHCCCCCCHHHHHHHHHHHHCC Q ss_conf 47410000136667769996234599899999986565410244202444552000001102345889999998876057 Q gi|254780143|r 304 CGLYVAEDIVNGETGEIYIEAGDVIDEKSLEEIFHSEIRDIPILYVDSVNNNAYIRNTLVTDKNKDRKDALLDIYRVMRP 383 (1386) Q Consensus 304 ~~~~~~~~~~d~~~gei~~~~~~~~~~~~l~~~~~~~~~~~~~l~~~~~~~~~~i~~~~~~d~~~~~~eAl~~I~k~lr~ 383 (1386) .....++..|+.++..++.. T Consensus 295 ------------------------------------------------------------~~~v~tqqqal~y~~~kvk~ 314 (1153) T KOG0215 295 ------------------------------------------------------------SLGVYTQQQALEYIGSKVKV 314 (1153) T ss_pred ------------------------------------------------------------HHHHHHHHHHHHHHHHEEEE T ss_conf ------------------------------------------------------------54257888999876525773 Q ss_pred CC---CCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEECHHHHHHHHHHHHHHHCCCCCCCCCHHCC Q ss_conf 76---310456888862024530233345555777654203667767706618998888988876304876434401015 Q gi|254780143|r 384 GD---VSTFSVAESMFNFLFFDSDKYDLSTVGRVKMNMRLNLDTPDDVRHIRKEDIIAIIKILVDLRNGKGTIDDIDNLG 460 (1386) Q Consensus 384 ~~---~~~~~~~~~~~~~~~~~~~~y~l~~vgr~~~n~~l~~~~~~~~~~Lt~~d~~~~i~~L~~l~~g~~~~DdiDhlg 460 (1386) .. ++....+..+....+ ..+...+...-.-....+..|.+.++....+.-..||.|..| T Consensus 315 ~~~t~~~~~~e~~~~l~~~~------------------lahv~v~~~~~r~K~~yi~~m~rr~~~a~l~~~~~ddrd~~g 376 (1153) T KOG0215 315 KRGTTPTKDEEALEVLSTTV------------------LAHVPVPDLNFRMKAIYIGLMVRRVIQAMLNKLAMDDRDYVG 376 (1153) T ss_pred CCCCCCCCHHHHHHHHHHHH------------------EECEECCCCCCCCEEEHHHHHHHHHHHHHHCCCCCCCHHHHC T ss_conf 26888880489999997630------------------201222578976000028999999999874865444133305 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC--CCCCCC-----CCCHHHHHHHHHHHCCCCC------------ Q ss_conf 332333357878988888899998876531134433--435211-----2220234655542026675------------ Q gi|254780143|r 461 NRRVRSVGEMLKNQYRLGLLRMERSIKERISSVDID--SVMPQD-----LINAKPVVSAVCEFFCSSQ------------ 521 (1386) Q Consensus 461 nkRvr~vgeLl~~~fr~~l~rl~r~i~~~~~~~~~~--~~~~~~-----~in~~~i~~~i~~ff~t~~------------ 521 (1386) |+|+.++|.|+..+|...|+++....+......... .....+ -.....|+.++..+.++++ T Consensus 377 nkrlelagqllsllfeD~fk~~n~e~k~~~dk~l~k~~ra~~fD~~~~~n~~~~~it~~l~ra~stgnw~iKrfrmer~g 456 (1153) T KOG0215 377 NKRLELAGQLLSLLFEDLFKRFNSELKKNIDKILKKPIRAQEFDALKHLNVRANMITSGLERAISTGNWSIKRFRMERAG 456 (1153) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCC T ss_conf 42488888999999888888878999888998734502321567999988889888887787765275177776565405 Q ss_pred CEEECCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCEEEECEEEEEEEC-------- Q ss_conf 035415542102322001122344444322346655432211110444303566765210210001224424-------- Q gi|254780143|r 522 LSQLEEHVNSLSRITHTRRLSALGQGGVARARAGVEMRDVHPTHYGRICPAETSEGHNIGLVSSLTSFARVN-------- 593 (1386) Q Consensus 522 lsq~ld~~n~ls~lth~RR~s~lgpggl~r~~~~~evR~ih~s~~GriCPieTPEG~n~GLv~~la~~a~in-------- 593 (1386) +.|++.+.++++.|..+.|+++ +++++|+...+|.+||||||++||.+||||++||||+|||++++|+ T Consensus 457 vt~VlsrlSyisaLgmmtri~s----~fektrkvsgprSlq~sqwgmlcp~dtpegeacglvknlalmthittd~ee~pi 532 (1153) T KOG0215 457 VTQVLSRLSYISALGMMTRINS----QFEKTRKVSGPRSLQPSQWGMLCPSDTPEGEACGLVKNLALMTHITTDSEEKPI 532 (1153) T ss_pred CEEEEHHHHHHHHHHHHEEHHH----HHEEEHHCCCCCCCCHHHCCCCCCCCCCCCCCCCCCCCHHHHHCCCCCCCCCHH T ss_conf 2032024565655335201301----000211016753247112253067789876444110266877112678775316 Q ss_pred ----------------------------------------------------------------------------CCCC Q ss_conf ----------------------------------------------------------------------------6876 Q gi|254780143|r 594 ----------------------------------------------------------------------------AYGF 597 (1386) Q Consensus 594 ----------------------------------------------------------------------------~~g~ 597 (1386) +.|. T Consensus 533 ~~~~~k~gv~di~~v~~~e~~~~~~~~v~Lng~iig~~~~~~~~v~~~r~lrr~G~i~~fvsv~~~~~q~~v~i~sdggr 612 (1153) T KOG0215 533 LKLCYKLGVEDIHVVSGRELHTPDSFLVFLNGLIIGITRRPQYFVNSFRRLRRKGKIGEFVSVFTNTTQRCVYIASDGGR 612 (1153) T ss_pred HHHHHHHCCEEEEEECCCCCCCCCEEEEEECCEEECCCCCHHHHHHHHHHHHHCCCCCCEEEEEEEEEEEEEEEECCCCE T ss_conf 78998706222354035435887438998355332233465777999999987076444165334001379999537976 Q ss_pred CCCEEEEEECCCC----------------------CCCEEECCHHHHCCEEEECCCCEECCCCCCCCCCEEECCCCCCCC Q ss_conf 4410698622553----------------------481664296773763796165341146840222000000233332 Q gi|254780143|r 598 IETPYRKVCDGKV----------------------TNDVVYLSAMEEENRYIAQANSSLDEDGSFTEELVFCRCAGEEIL 655 (1386) Q Consensus 598 ie~py~~v~~~~~----------------------~~~i~~l~~~~e~~~~Ia~~~~~l~~~~~~~~~~~~~r~~~~~~~ 655 (1386) ++.|+..|.++.. .+.++|++..||.+..||.....+. T Consensus 613 ~crp~Iiv~~~~~~v~~~h~~el~~g~r~F~DFl~~glvEYLDVNEEND~~Ialye~~I~-------------------- 672 (1153) T KOG0215 613 VCRPLIIVDNGRPRVKQHHMDELLDGKRTFDDFLKDGLIEYLDVNEENDSLIALYEEDIG-------------------- 672 (1153) T ss_pred EEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHCHHHEEECCCCCCCEEEECHHHCC-------------------- T ss_conf 750499970896312365689987755317888873403303146566724774141258-------------------- Q ss_pred CCHHHEEECCCCCCCEEEECCCCCCCHHHCCHHHHHHHHHHHHHHHCCCCCCCCEECCCCCHHHHHHCCCEEECCCCCCC Q ss_conf 25787220236721102311233320110100221122234443210136654111266201110106530102124434 Q gi|254780143|r 656 VPREKIDFIDASPKQVVSIAASLIPFLENDDSNRVLMGCNMQRQAVPLLKAEAPFVGTGMESVVAKSSGAAIVAKRAGIV 735 (1386) Q Consensus 656 ~~~~~v~~~~i~p~~i~sv~aslIPflehdda~R~l~g~nm~rQav~l~~~~~~~v~tg~E~~~~~~s~~~i~a~~~g~v 735 (1386) ...+|.++.|..+++++|+||||++|||+||++|||+|+|||++.+ T Consensus 673 ---~~TTHLEIEPFTiLGv~AGLIPYPHHNQSPRNTYQCAMGKQAmG~I------------------------------- 718 (1153) T KOG0215 673 ---PETTHLEIEPFTILGVVAGLIPYPHHNQSPRNTYQCAMGKQAMGAI------------------------------- 718 (1153) T ss_pred ---CCCCEEEECCHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHH------------------------------- T ss_conf ---8763355313115666424667777789984066643215565254------------------------------- Q ss_pred CCCCCCCEEEECCCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCEECCCCCCCCCCCCCCCCCEEEE Q ss_conf 33565530252156655644466301114655544547322445304479772078520355223578602223751555 Q gi|254780143|r 736 EQVDAIRIVIRSVEGDLDPSTSGVDIYRLMKFQRSNQNTCVNQRPLVKVGDEVRRNDIIADGPSTDLGDLALGRNMLVAF 815 (1386) Q Consensus 736 ~~vd~~~i~i~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~q~p~V~~g~~~~~~~~l~~~~~~~~~el~~G~N~~VA~ 815 (1386) .+++..+.++. -..++|+|+|+|++... ..+++++||+||||.||+ T Consensus 719 ----------aYNQ~~RiDtl--------------mYll~YPq~PmVkTKTI----------ELi~ydKLPAGQNAtVAV 764 (1153) T KOG0215 719 ----------AYNQKKRIDSL--------------LYLLVYPQRPMVKTKTI----------ELINYDKLPAGQNATVAV 764 (1153) T ss_pred ----------HHHHHHHHHHH--------------HHHHHCCCCCCCCCEEE----------EEECCCCCCCCCCCEEEE T ss_conf ----------45345347788--------------99885587764220158----------752235478887617999 Q ss_pred EECCCCCCCCCEECCHHHHHCCCCCEEEEEEEEHHHHHCCCCCC-CCCC----CCCCCCHHHHHHCCCCCCCCCCCEECC Q ss_conf 31355444442000013442587310346665311211478840-0246----665468678410241374137733136 Q gi|254780143|r 816 MPWHGYNFEDSMLISERMVSEDVFTSIHIEEFEVMARDTKLGPE-EITR----DIPNVSEEGLKNIDECGIICVGAEVNP 890 (1386) Q Consensus 816 m~~~GYN~EDaiiin~~~v~rg~~~s~h~~~y~~~~~~~~~g~~-~~~~----~~~~~~~~~~~~ld~~Giv~~G~~V~~ 890 (1386) |||+||++|||+++|++|+|||++||..++..++..+++.++.- .+.. ..++........||+||++.+|..|.+ T Consensus 765 MSYSGYDIEDALVLNKsSlDRGfGRC~Vyk~~~~~~kkY~N~T~Drimgp~~d~~t~kpi~kh~vLd~DGl~~pG~~V~~ 844 (1153) T KOG0215 765 MSYSGYDIEDALVLNKSSIDRGFGRCEVYKKTTTTLKKYANGTFDRIMGPQLDPNTRKPIWKHQVLDDDGLATPGERVQP 844 (1153) T ss_pred EECCCCCHHHHHHCCCCHHCCCCCEEEEEEEEEEEEEECCCCCHHHHCCCCCCCCCCCCCHHHCCCCCCCCCCCCCEECC T ss_conf 94068765566540400011575248998600666640578751332365447876783022115375557787537326 Q ss_pred CCCEEECCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 76111012467777677034665430355554432332022788532200121014565420266788889999999872 Q gi|254780143|r 891 GDILVGKITPKGESPMTPEEKLLRAIFGEKAVDVRDTSLRVPSGVSGTVVDVRIFNRHGIDKNERSISVEREQIELLARD 970 (1386) Q Consensus 891 gDilvgk~tp~~~~~~~pe~~~l~~i~~~~~~~~~d~~~~~~~g~~g~v~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~ 970 (1386) |+|+|.|..|.....+.|..+..+. ..+++++..+.-+.+.+ T Consensus 845 ~qi~iNK~mP~vt~~~~~~~~~~~~-------~Yk~~pitykgpepsyi------------------------------- 886 (1153) T KOG0215 845 GQIYINKQMPTVTGTSLPGLSASQV-------QYKAVPITYKGPEPSYI------------------------------- 886 (1153) T ss_pred CCEEEECCCCCCCCCCCCCCCCCCC-------CCCCCCCEECCCCCCHH------------------------------- T ss_conf 8579825678755665777776553-------33216414338982321------------------------------- Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHCCCHHHHHEEEECCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 88899988788898888862585345665555543211357640148543100141586789999999998999999999 Q gi|254780143|r 971 KDDEQVILDRNIYSRLMEILCGQNAVSGPKGFKKSTVLSSDLISEYPRSQWWQFAVQDEKVQRNVESLKVQYETSKSILE 1050 (1386) Q Consensus 971 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1050 (1386) T Consensus 887 -------------------------------------------------------------------------------- 886 (1153) T KOG0215 887 -------------------------------------------------------------------------------- 886 (1153) T ss_pred -------------------------------------------------------------------------------- T ss_conf -------------------------------------------------------------------------------- Q ss_pred HHHHHHHHHHCCCCCCCCCCEEEEEEEEEEECCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCCEEEEEECCCCCCCC Q ss_conf 99898876312588677472769999998755885523143366787358886300007879358715698668986750 Q gi|254780143|r 1051 DRFKNKIEKIQWGDDMPPGVLRVVKVFVAMKRPIQSGDKMAGRHGNKGIVSRILPCEDMPFLKDGTPVDIVLNPLGVPSR 1130 (1386) Q Consensus 1051 ~~~~~k~~ki~~~~~~~~gv~~~vKV~ir~~R~p~iGDKfasRHGqKGVis~i~p~eDMPf~~dG~~pDIIlNPhgvPSR 1130 (1386) +|+....+- .-..++||.+||+|+||+|||||||||||||||+|++||||||...|||||||||||||||| T Consensus 887 -------dkVmls~n~--~dq~LIK~llRQTRrPElGDKFSSRHGQKGVcGlIv~QEDMPFnD~GIcPDiIMNPHGFPSR 957 (1153) T KOG0215 887 -------DRVMLTSND--EDQFLIKVLLRQTRRPELGDKFSSRHGQKGVCGLIVQQEDMPFNDQGICPDIIMNPHGFPSR 957 (1153) T ss_pred -------HEEEEECCC--CCCHHHHHHHHHCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf -------226762274--32189999986436864234210004777402688633678875467786301188888641 Q ss_pred CCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHCCCCEECCCC Q ss_conf 70899999999999987196224333432222102467778776315432210000135256777665302684002565 Q gi|254780143|r 1131 MNVGQIFETHLGWACVGLGKKIKSLINDYKANGDISPLRSFLEKVIGTGSHTEKISDYDDDSVLRVAEQWKSGVPVSTPV 1210 (1386) Q Consensus 1131 MtIGqllE~~lGka~~~~G~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~aTP~ 1210 (1386) ||||+++|.+.||||.+.|+..++| + T Consensus 958 MTVGK~iELlsGKAGVl~G~~hYGT------------------------------------------------------a 983 (1153) T KOG0215 958 MTVGKMIELLSGKAGVLEGTFHYGT------------------------------------------------------A 983 (1153) T ss_pred CHHHHHHHHHCCCCCEEEEEEEECC------------------------------------------------------C T ss_conf 1188899986366434501575112------------------------------------------------------2 Q ss_pred CCCCCHHHHHHHHHHCCCCCCCCEEEECCCCCCCCCCCEEEEEEEEEECCCHHHHHCCCCCCCCCCCEEECCCCCCCCCC Q ss_conf 57889999999999868689986999868988402685048846454101102110000236873110307998623317 Q gi|254780143|r 1211 FDGADEEAINSMLRMADLDESGQSILYDGLTGEPFDRPVTVGYIYMLKLNHMVSDKVYARSTGSYSLVTQQPLGGKSNRG 1290 (1386) Q Consensus 1211 F~g~~~~~i~~~L~~aG~~~~Gke~lydG~TGe~~~~~IfvG~~YyqkL~HMV~DKiHARstGP~sllTrQP~eGRsr~G 1290 (1386) |.|.+++|+++.|.++||+|.||+.+|||+||||++++||+||+|||||||||.|||||||+||...|||||||||||+| T Consensus 984 FGgskVed~~~~Lv~hGfnY~GKD~ltSGITGepLeAYIffGPiYYQKLKHMVlDKMHARARGPRAvLTRQPTEGRSrdG 1063 (1153) T KOG0215 984 FGGSKVEDISEELVEHGFNYSGKDMLTSGITGEPLEAYIFFGPIYYQKLKHMVLDKMHARARGPRAVLTRQPTEGRSRDG 1063 (1153) T ss_pred CCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECHHHHHHHHHHHHHHHHHHCCCCCEEEECCCCCCCCCCC T ss_conf 48761989999999836575573100167778740026773558999999998988765224872234317987767778 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHCCCC----CHHHHHH--HHHHH----HCCCCCCCCCCCHHHHHHHHHHHHCCCC Q ss_conf 843207899999998698999986111001----1015999--98876----3878789978667789999999854200 Q gi|254780143|r 1291 GQRLGEMEVWCIQAYGAAYVLQEMLTIKSD----DVVGRTR--VYESI----VAGNDTFETGTPESFNVLVKEMQALGLS 1360 (1386) Q Consensus 1291 GlRfGEMErwaL~AyGAa~~LqE~Lt~kSD----dv~gr~~--~~~~i----v~g~~~~~~~~pesf~vl~~El~~l~l~ 1360 (1386) |+|+||||||||||||||++|-||||++|| |||+--- .|..- -.+.++.+..+||++|||.|||||+++- T Consensus 1064 GLRLGEMERDCLIaYGASmLl~ERLMiSSDaFeVdVC~~CGllgykgwC~~Ckss~~v~~~~iPYAcKLLFQEL~SMNi~ 1143 (1153) T KOG0215 1064 GLRLGEMERDCLIAYGASMLLLERLMISSDAFEVDVCRQCGLLGYKGWCTTCKSSKNVAKMKIPYACKLLFQELQSMNIV 1143 (1153) T ss_pred CCCCCHHHHHHHHHCCHHHHHHHHHHHCCCCEEEEECCCCCCCEECHHHHHCCCCCCEEEEECCHHHHHHHHHHHHCCCC T ss_conf 81013134435355168899999986147651355110036210111113245877504431308899999998755761 Q ss_pred EEEEECCCC Q ss_conf 276406654 Q gi|254780143|r 1361 IDLENSRTK 1369 (1386) Q Consensus 1361 ~~~~~~~~~ 1369 (1386) .+|..++.+ T Consensus 1144 PrL~L~~~~ 1152 (1153) T KOG0215 1144 PRLKLEDYF 1152 (1153) T ss_pred CEEEECCCC T ss_conf 001010024 No 10 >TIGR03670 rpoB_arch DNA-directed RNA polymerase subunit B. This model represents the archaeal version of DNA-directed RNA polymerase subunit B (rpoB) and is observed in all archaeal genomes. Probab=100.00 E-value=0 Score=1248.33 Aligned_cols=506 Identities=35% Similarity=0.520 Sum_probs=435.1 Q ss_pred CCCCCCCEEEEEECCCC----------------------CCCEEECCHHHHCCEEEECCCCEECCCCCCCCCCEEECCCC Q ss_conf 68764410698622553----------------------48166429677376379616534114684022200000023 Q gi|254780143|r 594 AYGFIETPYRKVCDGKV----------------------TNDVVYLSAMEEENRYIAQANSSLDEDGSFTEELVFCRCAG 651 (1386) Q Consensus 594 ~~g~ie~py~~v~~~~~----------------------~~~i~~l~~~~e~~~~Ia~~~~~l~~~~~~~~~~~~~r~~~ 651 (1386) +.|.+..|+..|.+++. .+.|+|++++||++..||.....+. T Consensus 50 D~GR~~RPl~vv~~~k~~~~~~~~~~l~~~~~~w~~l~~~g~IEyiD~~Ee~~~~Ia~~~~~~~---------------- 113 (599) T TIGR03670 50 DAGRIRRPLIVVENGKPKLTREHVEKLKEGELTWDDLVKQGVIEYLDAEEEENAYIALDPEELT---------------- 113 (599) T ss_pred CCCEEEEEEEEEECCCCCCCHHHHHHHHCCCCCHHHHHHCCCEEEECHHHHCCCEEEECHHHCC---------------- T ss_conf 8986466889987995736699999876189888999756988981777721838981688937---------------- Q ss_pred CCCCCCHHHEEECCCCCCCEEEECCCCCCCHHHCCHHHHHHHHHHHHHHHCCCCCCCCEECCCCCHHHHHHCCCEEECCC Q ss_conf 33322578722023672110231123332011010022112223444321013665411126620111010653010212 Q gi|254780143|r 652 EEILVPREKIDFIDASPKQVVSIAASLIPFLENDDSNRVLMGCNMQRQAVPLLKAEAPFVGTGMESVVAKSSGAAIVAKR 731 (1386) Q Consensus 652 ~~~~~~~~~v~~~~i~p~~i~sv~aslIPflehdda~R~l~g~nm~rQav~l~~~~~~~v~tg~E~~~~~~s~~~i~a~~ 731 (1386) ...+|+|++|.+++|++||+|||++||||||++|||+|+|||++++..+.. T Consensus 114 -------~~~TH~EI~Ps~ilgv~AslIPF~~hNqsPRn~yq~~M~KQA~G~~~~n~~---------------------- 164 (599) T TIGR03670 114 -------PEHTHLEIDPSAILGIIASTIPYPEHNQSPRNTMGAAMAKQSLGLYAANYR---------------------- 164 (599) T ss_pred -------CCCEEEEECHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCCCCCCCCCCC---------------------- T ss_conf -------774598757899766654356675678862456554312242364433586---------------------- Q ss_pred CCCCCCCCCCCEEEECCCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCEECCCCCCCCCCCCCCCCC Q ss_conf 44343356553025215665564446630111465554454732244530447977207852035522357860222375 Q gi|254780143|r 732 AGIVEQVDAIRIVIRSVEGDLDPSTSGVDIYRLMKFQRSNQNTCVNQRPLVKVGDEVRRNDIIADGPSTDLGDLALGRNM 811 (1386) Q Consensus 732 ~g~v~~vd~~~i~i~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~q~p~V~~g~~~~~~~~l~~~~~~~~~el~~G~N~ 811 (1386) .+.++...+ ++++|+|+|++..+ ..+.++++|+|+|+ T Consensus 165 -------------------~R~Dt~~~~--------------L~yPQ~PlV~T~~~----------~~~~~~~~P~G~Na 201 (599) T TIGR03670 165 -------------------IRLDTRGHL--------------LHYPQKPLVKTRVL----------ELIGYDDRPAGQNF 201 (599) T ss_pred -------------------EEEECCCEE--------------EECCCCCEEEEEEE----------ECCCCCCCCCCEEE T ss_conf -------------------787245149--------------95288756997631----------31575558898127 Q ss_pred EEEEEECCCCCCCCCEECCHHHHHCCCCCEEEEEEEEHHHHHCCCCCCC-CCCCCCCC----CHHHHHHCCCCCCCCCCC Q ss_conf 1555313554444420000134425873103466653112114788400-24666546----867841024137413773 Q gi|254780143|r 812 LVAFMPWHGYNFEDSMLISERMVSEDVFTSIHIEEFEVMARDTKLGPEE-ITRDIPNV----SEEGLKNIDECGIICVGA 886 (1386) Q Consensus 812 ~VA~m~~~GYN~EDaiiin~~~v~rg~~~s~h~~~y~~~~~~~~~g~~~-~~~~~~~~----~~~~~~~ld~~Giv~~G~ 886 (1386) +||||||+|||||||||||+++++||+|||+||++|+++++++..|.++ +..+.+++ ++..|.+||+||++++|+ T Consensus 202 iVAImsYtGYNqEDAIIiNksSvdRGlfrs~~yrty~~ee~~~~~g~~e~~~~p~~~v~~~k~~~~y~~LD~dGii~~G~ 281 (599) T TIGR03670 202 VVAVMSYEGYNIEDALIMNKASIERGLARSTFFRTYEAEERRYPGGQEDRFEIPEPDVRGYRGEEAYKHLDEDGIVYPEV 281 (599) T ss_pred EEEEECCCCCCHHHHHHHHHHHHHCCCCEEEEEEEEEEEEEECCCCCCEEEECCCCCCCCCCCHHHHHHCCCCCCCCCCC T ss_conf 99997977868889888877788708836999999999977257986127747986434677766675368578837997 Q ss_pred EECCCCCEEECCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 31367611101246777767703466543035555443233202278853220012101456542026678888999999 Q gi|254780143|r 887 EVNPGDILVGKITPKGESPMTPEEKLLRAIFGEKAVDVRDTSLRVPSGVSGTVVDVRIFNRHGIDKNERSISVEREQIEL 966 (1386) Q Consensus 887 ~V~~gDilvgk~tp~~~~~~~pe~~~l~~i~~~~~~~~~d~~~~~~~g~~g~v~~~~~~~r~~~~~~~~~~~~~~~~~~~ 966 (1386) +|.+|||+|||++|..... ....++......+|+|+++++++.|+|..|.+.. T Consensus 282 ~V~~gDvlIgK~~p~~~~~-------~~~~~~~~~~~~~d~S~~~k~~e~g~VD~V~i~~-------------------- 334 (599) T TIGR03670 282 EVKGGDVLIGKTSPPRFLE-------ELRELGLVTERRRDTSVTVRHGEKGIVDKVIITE-------------------- 334 (599) T ss_pred EECCCCEEEEEECCCCCCC-------CCCCCCCCCCEEEECCEECCCCCCCEEEEEEEEE-------------------- T ss_conf 8779998998635864221-------2211156443687663663699852146999984-------------------- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHCCCHHHHHEEEECCHHHHHHHHHHHHHHHHHH Q ss_conf 98728889998878889888886258534566555554321135764014854310014158678999999999899999 Q gi|254780143|r 967 LARDKDDEQVILDRNIYSRLMEILCGQNAVSGPKGFKKSTVLSSDLISEYPRSQWWQFAVQDEKVQRNVESLKVQYETSK 1046 (1386) Q Consensus 967 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1046 (1386) T Consensus 335 -------------------------------------------------------------------------------- 334 (599) T TIGR03670 335 -------------------------------------------------------------------------------- 334 (599) T ss_pred -------------------------------------------------------------------------------- T ss_conf -------------------------------------------------------------------------------- Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCEEEEEEEEEEECCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCCEEEEEECCCC Q ss_conf 99999989887631258867747276999999875588552314336678735888630000787935871569866898 Q gi|254780143|r 1047 SILEDRFKNKIEKIQWGDDMPPGVLRVVKVFVAMKRPIQSGDKMAGRHGNKGIVSRILPCEDMPFLKDGTPVDIVLNPLG 1126 (1386) Q Consensus 1047 ~~~~~~~~~k~~ki~~~~~~~~gv~~~vKV~ir~~R~p~iGDKfasRHGqKGVis~i~p~eDMPf~~dG~~pDIIlNPhg 1126 (1386) ..++.++|||++|+.|+|+||||||||||||||||+||||||||||+||++||||||||| T Consensus 335 --------------------~~~g~~~vkVriR~~R~P~iGDKFsSRHGQKGvig~i~~qeDMPFt~dGi~PDIIiNPHa 394 (599) T TIGR03670 335 --------------------TEEGNKLVKVRVRDLRIPELGDKFASRHGQKGVIGMIVPQEDMPFTEDGIVPDLIINPHA 394 (599) T ss_pred --------------------CCCCCEEEEEEEEEECCCCCHHHHCCCCCCCCEEEEECCHHHCCCCCCCCCCCEEECCCC T ss_conf --------------------478867999998102056532233123466633320126753997889998708989987 Q ss_pred CCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHCCCCEE Q ss_conf 67507089999999999998719622433343222210246777877631543221000013525677766530268400 Q gi|254780143|r 1127 VPSRMNVGQIFETHLGWACVGLGKKIKSLINDYKANGDISPLRSFLEKVIGTGSHTEKISDYDDDSVLRVAEQWKSGVPV 1206 (1386) Q Consensus 1127 vPSRMtIGqllE~~lGka~~~~G~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 1206 (1386) ||||||||||+||++||||++.|..+| T Consensus 395 ~PSRMTIGqllE~l~gK~~~~~G~~~d----------------------------------------------------- 421 (599) T TIGR03670 395 IPSRMTVGQLLEMIAGKVAALEGRRVD----------------------------------------------------- 421 (599) T ss_pred CCCCCCHHHHHHHHHHHHHHHCCCCEE----------------------------------------------------- T ss_conf 876550899999998999996698422----------------------------------------------------- Q ss_pred CCCCCCCCCHHHHHHHHHHCCCCCCCCEEEECCCCCCCCCCCEEEEEEEEEECCCHHHHHCCCCCCCCCCCEEECCCCCC Q ss_conf 25655788999999999986868998699986898840268504884645410110211000023687311030799862 Q gi|254780143|r 1207 STPVFDGADEEAINSMLRMADLDESGQSILYDGLTGEPFDRPVTVGYIYMLKLNHMVSDKVYARSTGSYSLVTQQPLGGK 1286 (1386) Q Consensus 1207 aTP~F~g~~~~~i~~~L~~aG~~~~Gke~lydG~TGe~~~~~IfvG~~YyqkL~HMV~DKiHARstGP~sllTrQP~eGR 1286 (1386) ||| |++.+++++++.|+++||+++|+|+||||+||++|+++||+||+|||||+|||+||||||||||+++|||||++|| T Consensus 422 ~tp-F~~~~~~~~~~~L~~~g~~~~G~e~my~G~tG~~~~~~IfiG~~yYqRL~HmV~DK~~~R~~Gp~~~lTrQP~~GR 500 (599) T TIGR03670 422 GTP-FEGEPEEELRKELLKLGFKPDGKEVMYDGITGEKLEAEIFIGVIYYQKLHHMVADKIHARSRGPVQVLTRQPTEGR 500 (599) T ss_pred ECC-CCCCCHHHHHHHHHHHCCCCCCCEEEECCCCCCEECCEEEEEHHHHHHHHHHHCCCEEEECCCCCCCEECCCCCCC T ss_conf 478-9999999999999981999889988753878978401489866787756654101203322288755302889865 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCH-------HHHH-------HHHHHHHCCC--CCCCCCCCHHHHHH Q ss_conf 331784320789999999869899998611100110-------1599-------9988763878--78997866778999 Q gi|254780143|r 1287 SNRGGQRLGEMEVWCIQAYGAAYVLQEMLTIKSDDV-------VGRT-------RVYESIVAGN--DTFETGTPESFNVL 1350 (1386) Q Consensus 1287 sr~GGlRfGEMErwaL~AyGAa~~LqE~Lt~kSDdv-------~gr~-------~~~~~iv~g~--~~~~~~~pesf~vl 1350 (1386) ||+|||||||||||||+|||||++|||||+.+||.. ||.. +.+.|..++. ++.+..+|||||+| T Consensus 501 ~r~GGlR~GEMErd~liahGas~~l~erl~~~SD~~~~~vC~~CG~i~~~~~~~~~~~C~~C~~~~~i~~v~iPya~Kll 580 (599) T TIGR03670 501 AREGGLRFGEMERDVLIGHGAAMLLKERLLDESDKYVVYVCENCGHIAWEDKRKGTAYCPVCGETGDISPVEMSYAFKLL 580 (599) T ss_pred CCCCCCCEEHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCCEEEECCCCCEECCCCCCCCCEEEEECCHHHHHH T ss_conf 56898013206789999876999999987425887168840267840354135797489888997826787678999999 Q ss_pred HHHHHHCCCCEEEEECCC Q ss_conf 999985420027640665 Q gi|254780143|r 1351 VKEMQALGLSIDLENSRT 1368 (1386) Q Consensus 1351 ~~El~~l~l~~~~~~~~~ 1368 (1386) ++||+|||+.++|..+++ T Consensus 581 ~qEL~sm~I~~rl~~ed~ 598 (599) T TIGR03670 581 LDELKSLGISPRLELGDK 598 (599) T ss_pred HHHHHHCCCCEEEEECCC T ss_conf 999987898338985577 No 11 >PRK07225 DNA-directed RNA polymerase subunit B'; Validated Probab=100.00 E-value=0 Score=1234.53 Aligned_cols=505 Identities=32% Similarity=0.507 Sum_probs=433.6 Q ss_pred CCCCCCCEEEEEECCCC----------------------CCCEEECCHHHHCCEEEECCCCEECCCCCCCCCCEEECCCC Q ss_conf 68764410698622553----------------------48166429677376379616534114684022200000023 Q gi|254780143|r 594 AYGFIETPYRKVCDGKV----------------------TNDVVYLSAMEEENRYIAQANSSLDEDGSFTEELVFCRCAG 651 (1386) Q Consensus 594 ~~g~ie~py~~v~~~~~----------------------~~~i~~l~~~~e~~~~Ia~~~~~l~~~~~~~~~~~~~r~~~ 651 (1386) +.|.+..|+..|.+++. .+-|||++++||++..||.....+. T Consensus 56 D~GR~~RPl~iv~n~k~~~~~~~~~~l~~~~~~~~dl~~~g~IEyiD~~Ee~~~~Ia~~~~~l~---------------- 119 (605) T PRK07225 56 DAGRARRPLIVVENGVPKLTEEHIEKLKNGELTFDDLVKQGVIEYLDAEEEENAYIALYEEDLT---------------- 119 (605) T ss_pred CCCEEEEEEEEEECCCCCCCHHHHHHHHCCCCCHHHHHHCCCEEEECHHHHCCEEEECCHHHCC---------------- T ss_conf 8982256699998994747799999987389898999755988997644500818971789958---------------- Q ss_pred CCCCCCHHHEEECCCCCCCEEEECCCCCCCHHHCCHHHHHHHHHHHHHHHCCCCCCCCEECCCCCHHHHHHCCCEEECCC Q ss_conf 33322578722023672110231123332011010022112223444321013665411126620111010653010212 Q gi|254780143|r 652 EEILVPREKIDFIDASPKQVVSIAASLIPFLENDDSNRVLMGCNMQRQAVPLLKAEAPFVGTGMESVVAKSSGAAIVAKR 731 (1386) Q Consensus 652 ~~~~~~~~~v~~~~i~p~~i~sv~aslIPflehdda~R~l~g~nm~rQav~l~~~~~~~v~tg~E~~~~~~s~~~i~a~~ 731 (1386) ...+|+|++|..++|++||+|||++||||||++|||+|+|||++++.++.. T Consensus 120 -------~~~TH~EI~P~~ilgv~AslIPF~~hNqsPRn~yqc~M~KQA~G~~~~N~~---------------------- 170 (605) T PRK07225 120 -------PEHTHLEIDPSLILGIGAGMIPYPEHNASPRNTMGAGMIKQALGLPAANYK---------------------- 170 (605) T ss_pred -------CCCEEEEECHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCC---------------------- T ss_conf -------774588636888655643156674678760667766533300276434565---------------------- Q ss_pred CCCCCCCCCCCEEEECCCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCEECCCCCCCCCCCCCCCCC Q ss_conf 44343356553025215665564446630111465554454732244530447977207852035522357860222375 Q gi|254780143|r 732 AGIVEQVDAIRIVIRSVEGDLDPSTSGVDIYRLMKFQRSNQNTCVNQRPLVKVGDEVRRNDIIADGPSTDLGDLALGRNM 811 (1386) Q Consensus 732 ~g~v~~vd~~~i~i~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~q~p~V~~g~~~~~~~~l~~~~~~~~~el~~G~N~ 811 (1386) .+.++... .+.++|+|+|++... ..+.++++|+|+|+ T Consensus 171 -------------------~R~Dt~~~--------------~L~YPQkPLV~T~~~----------~~~~~d~~P~G~Na 207 (605) T PRK07225 171 -------------------LRPDTRGH--------------LLHYPQVPLVRTQTQ----------EIIGFDDRPAGQNF 207 (605) T ss_pred -------------------EEECCCEE--------------EEECCCCCEEEEEEC----------HHCCCCCCCCCEEE T ss_conf -------------------66224015--------------996177766887400----------10254447897127 Q ss_pred EEEEEECCCCCCCCCEECCHHHHHCCCCCEEEEEEEEHHHHHCCCCCCCCCCCCCCC------CHHHHHHCCCCCCCCCC Q ss_conf 155531355444442000013442587310346665311211478840024666546------86784102413741377 Q gi|254780143|r 812 LVAFMPWHGYNFEDSMLISERMVSEDVFTSIHIEEFEVMARDTKLGPEEITRDIPNV------SEEGLKNIDECGIICVG 885 (1386) Q Consensus 812 ~VA~m~~~GYN~EDaiiin~~~v~rg~~~s~h~~~y~~~~~~~~~g~~~~~~~~~~~------~~~~~~~ld~~Giv~~G 885 (1386) +||||||+|||||||||||++|++||+|||+|+++|+++++.+..|.++.+ .+|+. .+..|.+||+||++++| T Consensus 208 iVAVmsYtGYNqEDAIIiNKsSidRGlfrs~~ykty~~eek~~~~g~~e~~-~~P~~~~~~~~~~~~y~~LD~dGii~~g 286 (605) T PRK07225 208 VVAVMSYEGYNIEDALIMNKASIERGLGRSHFFRTYEGEERRYPGGQEDKF-EIPEKEVRGYRGEEAYRHLDDDGLVNPE 286 (605) T ss_pred EEEEECCCCCCHHHHHHHHHHHHHCCCEEEEEEEEEEEEEEECCCCCEEEE-ECCCCCCCCCCCHHHHHHCCCCCCCCCC T ss_conf 999978778688888887666675187179999999999750689850486-1798311367766678537878873899 Q ss_pred CEECCCCCEEECCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 33136761110124677776770346654303555544323320227885322001210145654202667888899999 Q gi|254780143|r 886 AEVNPGDILVGKITPKGESPMTPEEKLLRAIFGEKAVDVRDTSLRVPSGVSGTVVDVRIFNRHGIDKNERSISVEREQIE 965 (1386) Q Consensus 886 ~~V~~gDilvgk~tp~~~~~~~pe~~~l~~i~~~~~~~~~d~~~~~~~g~~g~v~~~~~~~r~~~~~~~~~~~~~~~~~~ 965 (1386) ++|.+|||||||++|+...+.. ..++......+|+|+.+++++.|+|..|.+.. T Consensus 287 ~~v~~gDVliGK~~P~~~~~~~-------~~~~~~~~~~~d~S~~~k~~e~g~VD~V~~~~------------------- 340 (605) T PRK07225 287 TEVKSGDVLIGKTSPPRFLEEP-------EDFGISVEQRRETSVTMRSGEEGIVDTVILTE------------------- 340 (605) T ss_pred CEECCCCEEEEECCCCCCCCCC-------CCCCCCCCEEEEEEEECCCCCCCEEEEEEEEE------------------- T ss_conf 8978998899951587544454-------33255453576621871589853465999984------------------- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHCCCHHHHHEEEECCHHHHHHHHHHHHHHHHH Q ss_conf 99872888999887888988888625853456655555432113576401485431001415867899999999989999 Q gi|254780143|r 966 LLARDKDDEQVILDRNIYSRLMEILCGQNAVSGPKGFKKSTVLSSDLISEYPRSQWWQFAVQDEKVQRNVESLKVQYETS 1045 (1386) Q Consensus 966 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1045 (1386) T Consensus 341 -------------------------------------------------------------------------------- 340 (605) T PRK07225 341 -------------------------------------------------------------------------------- 340 (605) T ss_pred -------------------------------------------------------------------------------- T ss_conf -------------------------------------------------------------------------------- Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEEEEEECCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCCEEEEEECCC Q ss_conf 99999998988763125886774727699999987558855231433667873588863000078793587156986689 Q gi|254780143|r 1046 KSILEDRFKNKIEKIQWGDDMPPGVLRVVKVFVAMKRPIQSGDKMAGRHGNKGIVSRILPCEDMPFLKDGTPVDIVLNPL 1125 (1386) Q Consensus 1046 ~~~~~~~~~~k~~ki~~~~~~~~gv~~~vKV~ir~~R~p~iGDKfasRHGqKGVis~i~p~eDMPf~~dG~~pDIIlNPh 1125 (1386) ..++.++|||++|+.|+|+||||||||||||||||+||||||||||+||++|||||||| T Consensus 341 ---------------------~~~g~~~vKVriR~~R~P~iGDKFsSRHGQKGvig~i~~~eDMPFt~dGi~PDiIiNPH 399 (605) T PRK07225 341 ---------------------TEEGSRLAKVRVRDLRIPELGDKFASRHGQKGVIGLIVPQEDMPFTESGVVPDLIINPH 399 (605) T ss_pred ---------------------CCCCCEEEEEEECHHCCCCCHHHHCCCCCCCCEEECCCCCCCCCCCCCCCCCCEEECCC T ss_conf ---------------------47787899999802117543002200246763121102540399388999871888988 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHCCCCE Q ss_conf 86750708999999999999871962243334322221024677787763154322100001352567776653026840 Q gi|254780143|r 1126 GVPSRMNVGQIFETHLGWACVGLGKKIKSLINDYKANGDISPLRSFLEKVIGTGSHTEKISDYDDDSVLRVAEQWKSGVP 1205 (1386) Q Consensus 1126 gvPSRMtIGqllE~~lGka~~~~G~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ 1205 (1386) |||||||||||+||++||||++.|..+| T Consensus 400 ~~PSRMTIGqllE~l~gK~~~~~G~~~d---------------------------------------------------- 427 (605) T PRK07225 400 AIPSRMTIGHVLEMIGGKVGSLEGRRVD---------------------------------------------------- 427 (605) T ss_pred CCCCCCCHHHHHHHHHHHHHHHCCCEEE---------------------------------------------------- T ss_conf 6876560899999987788874496353---------------------------------------------------- Q ss_pred ECCCCCCCCCHHHHHHHHHHCCCCCCCCEEEECCCCCCCCCCCEEEEEEEEEECCCHHHHHCCCCCCCCCCCEEECCCCC Q ss_conf 02565578899999999998686899869998689884026850488464541011021100002368731103079986 Q gi|254780143|r 1206 VSTPVFDGADEEAINSMLRMADLDESGQSILYDGLTGEPFDRPVTVGYIYMLKLNHMVSDKVYARSTGSYSLVTQQPLGG 1285 (1386) Q Consensus 1206 ~aTP~F~g~~~~~i~~~L~~aG~~~~Gke~lydG~TGe~~~~~IfvG~~YyqkL~HMV~DKiHARstGP~sllTrQP~eG 1285 (1386) ||| |.+.+++++++.|+++||+++|+|+||||+||++|+++||+||+|||||+|||+||||||||||+++|||||++| T Consensus 428 -~tp-F~~~~~~~~~~~L~~~g~~~~G~e~ly~G~tG~~~~~~If~G~~yYqRLkHmV~DK~~~R~~Gp~~~lTrQP~~G 505 (605) T PRK07225 428 -GTA-FSGEPEEDLRESLKKLGFEHTGKEVMYDGITGEKIEADIFVGVIYYQKLHHMVASKMHARSRGPVQVLTRQPTEG 505 (605) T ss_pred -ECC-CCCCCHHHHHHHHHHHCCCCCCCEEEECCCCCCEECCEEEEEHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCC T ss_conf -078-899888999999998389989988866587897702248998578663442220042675007874530388986 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCC-------HHHHHH-------HHHHHHCCC--CCCCCCCCHHHHH Q ss_conf 233178432078999999986989999861110011-------015999-------988763878--7899786677899 Q gi|254780143|r 1286 KSNRGGQRLGEMEVWCIQAYGAAYVLQEMLTIKSDD-------VVGRTR-------VYESIVAGN--DTFETGTPESFNV 1349 (1386) Q Consensus 1286 Rsr~GGlRfGEMErwaL~AyGAa~~LqE~Lt~kSDd-------v~gr~~-------~~~~iv~g~--~~~~~~~pesf~v 1349 (1386) |||+|||||||||||||+|||||++|+|||+.+||. -||... .+.|..++. ++.+..+|||||+ T Consensus 506 R~r~GGlR~GEMErd~liahGas~~l~erl~~~SD~~~~~vC~~CG~i~~~~~~~~~~~C~~C~~~~~i~~v~iPYa~Kl 585 (605) T PRK07225 506 RAREGGLRFGEMERDVLIGHGAAMLLKERLLDESDKVEIYVCAKCGMIAIYDKKRNMKYCPICGEETDIYPVEMSYAFKL 585 (605) T ss_pred CCCCCCCCEEHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEECCCCCCEEEEECCCCCEECCCCCCCCCEEEEECCHHHHH T ss_conf 45689801420578999987699999998742588846873105797024303579748989899883568537899999 Q ss_pred HHHHHHHCCCCEEEEECCC Q ss_conf 9999985420027640665 Q gi|254780143|r 1350 LVKEMQALGLSIDLENSRT 1368 (1386) Q Consensus 1350 l~~El~~l~l~~~~~~~~~ 1368 (1386) |++||+|||+.++|..++. T Consensus 586 l~qEL~sm~I~~rl~~~d~ 604 (605) T PRK07225 586 LLDELKSLGIAPRLELEDK 604 (605) T ss_pred HHHHHHHCCCEEEEEECCC T ss_conf 9999987898048984268 No 12 >KOG0216 consensus Probab=100.00 E-value=0 Score=1158.13 Aligned_cols=929 Identities=23% Similarity=0.342 Sum_probs=634.5 Q ss_pred CCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHH----CCCCCCCCCEEEEEEEEEECCCCC----------- Q ss_conf 66788708899999999986246652211122589997663----780767985899999789809858----------- Q gi|254780143|r 22 EIIDIPDLIEVQKASYDHFLMMNIAPDERPNEGLQAAFKSV----FPITAFSGAAMLEFVSYEFDPPKF----------- 86 (1386) Q Consensus 22 ~~~~~P~Li~iQ~~Sf~~Flq~~~~~~~r~~~GL~~v~~~~----fPI~d~~~~~~Lef~~y~l~~Pk~----------- 86 (1386) .--++++.++-.++|||.-. +.||..+-.+. +|- -..++.+..+.++.+.+|.. T Consensus 11 ~fp~l~~a~~pHi~sfnal~----------~~gll~~~v~~~~ek~~~-~~g~kis~~ve~i~iakP~l~~~~~ss~~r~ 79 (1111) T KOG0216 11 SFPELQDAASPHIDSFNALT----------NGGLLNAGVAGIAEKVPL-KAGDKISMKVESIQIAKPMLSDKVHSSDTRK 79 (1111) T ss_pred CCHHHHHHHHHHHCCCCCHH----------HHHHHHHHHHCHHHHCCC-CCCCCEEEEEEEEEECCCCCCCCCCCCCCCC T ss_conf 43114656301011453300----------205788775022432354-5688236898777853887786641101220 Q ss_pred -CHHHHHHCCCCEEEEEEEEEEEEEECCCCCCCCCCCEEEEEEEEEEEECCEECCC------------------------ Q ss_conf -8999998399753358999999993178766552000122367875100002689------------------------ Q gi|254780143|r 87 -DVDDCLWRDLTYAVPLKITLRLIVFDVDEFTGAKSIKDIKEQSIYMGDLPLMTKD------------------------ 141 (1386) Q Consensus 87 -tp~ECRlR~lTYsapL~V~i~l~v~~~~~~~~~k~~~~ike~~V~lG~IPiMt~~------------------------ 141 (1386) -|.|||+|++||+..+.|++.|.+++.... + ++..+|.+|||+.| T Consensus 80 lyPaEcRqR~~TY~Gkl~v~v~wsVNg~~~~--------~--e~~dlG~vPIMlrSklChL~g~sp~eLV~hkEe~~EmG 149 (1111) T KOG0216 80 LYPAECRQRGLTYKGKLVVRVSWSVNGGHVV--------I--EKRDLGHVPIMLRSKLCHLNGASPKELVKHKEESSEMG 149 (1111) T ss_pred CCCHHHHHCCCEECCEEEEEEEEEECCEEEE--------E--EEEECCCCCEEEECCCCCCCCCCHHHHHHCCCCHHHCC T ss_conf 0556676456603004999999998885532--------3--55434766568761535668989589863567654418 Q ss_pred CCEEECCEEEEE---EEEECCCCCEEECCCCCCCCCCCCEEEEEEEE------CCCCCE---EEEEECCCCEEEEEEC-C Q ss_conf 628986821468---66512278521202346515778567999981------188723---6899758982999971-8 Q gi|254780143|r 142 GTFVIKGIQRIV---VSQLHRSPGIHFDHDKGRASLSGKLLYACRII------PDQGLW---MDIEFDSKDIIHVRID-R 208 (1386) Q Consensus 142 GyFIING~ERVI---VsQl~RSPGVyf~~~k~k~~~s~k~~ysa~II------P~RGSw---Le~e~d~kd~iyvrId-r 208 (1386) |||||||+|||| |+| -|.--+.+....-|. ....||-.-+ ++.-|- |.|- .++..-+++- | T Consensus 150 GYFIvNGIEKvIRmLIm~-RRNHPiAiIR~Sfk~---RG~~yS~yGVsmRcVReDqSsvtn~LHYL--nnGtv~~~F~~r 223 (1111) T KOG0216 150 GYFIVNGIEKVIRMLIMP-RRNHPIAIIRPSFKD---RGSSYSDYGVSMRCVREDQSSVTNVLHYL--NNGTVMFRFSHR 223 (1111) T ss_pred CEEEECCHHHHHHHHEEC-CCCCCEEEECHHHHH---CCCCCCCCCEEEEEECCCCCCCEEEEEEE--CCCCEEEEEEEE T ss_conf 589981747736320103-578965784242553---57772102369988556511112478994--388189999985 Q ss_pred CC--CHHHHHHHHHCCCCHH-HHHHHHCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCEEECCCCHHHHHHH Q ss_conf 78--7039989988099847-99997387405773067310235456664210122321124458454402210207998 Q gi|254780143|r 209 RR--KVPVTSFLMALGMDSE-EILSTFYPKIVYSQRGDFWCFPLSAADLMVGAKVSSSLVDIDTGEQVIESGKKLTSGLL 285 (1386) Q Consensus 209 ~r--KIPi~ilLrALG~ssd-eIl~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 285 (1386) ++ -+|+..+||||-=++| ||.+.+... ...+. .+..++. T Consensus 224 K~eylvPv~lILKAL~~tsDeEIf~~lv~g----~~gdq----------------------------------fl~~rv~ 265 (1111) T KOG0216 224 KREYLVPVVLILKALTNTTDEEIFEGLVGG----DEGDQ----------------------------------FLTSRVE 265 (1111) T ss_pred EEEEHHHHHHHHHHHHCCCHHHHHHHHHCC----CCCCE----------------------------------EHHHHHH T ss_conf 033113499999998645789999987178----65762----------------------------------0899999 Q ss_pred HHHHHCCCCCCCCCHHHHCCCCCCCCCCCCCCCEEEEECCCCCCHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHCC Q ss_conf 87876360001026677347410000136667769996234599899999986565410244202444552000001102 Q gi|254780143|r 286 KSLKEKGVKFLGITSDCLCGLYVAEDIVNGETGEIYIEAGDVIDEKSLEEIFHSEIRDIPILYVDSVNNNAYIRNTLVTD 365 (1386) Q Consensus 286 ~~~~~~~~~~~~v~~~~l~~~~~~~~~~d~~~gei~~~~~~~~~~~~l~~~~~~~~~~~~~l~~~~~~~~~~i~~~~~~d 365 (1386) ..+. .+... T Consensus 266 ~mLr-----------------------------------------------------------------------~v~ee 274 (1111) T KOG0216 266 LMLR-----------------------------------------------------------------------EVQEE 274 (1111) T ss_pred HHHH-----------------------------------------------------------------------HHHHC T ss_conf 9999-----------------------------------------------------------------------97644 Q ss_pred CCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEECHHHHHHHHHHHHH Q ss_conf 34588999999887605776310456888862024530233345555777654203667767706618998888988876 Q gi|254780143|r 366 KNKDRKDALLDIYRVMRPGDVSTFSVAESMFNFLFFDSDKYDLSTVGRVKMNMRLNLDTPDDVRHIRKEDIIAIIKILVD 445 (1386) Q Consensus 366 ~~~~~~eAl~~I~k~lr~~~~~~~~~~~~~~~~~~~~~~~y~l~~vgr~~~n~~l~~~~~~~~~~Lt~~d~~~~i~~L~~ 445 (1386) ...++++++.+|..+.|.- ..+-++ +.-..+|+..++...-........ .-..++.|+++|.. T Consensus 275 ~l~trtqcl~yLGs~FRaV--------------l~~v~~-~~D~evg~fILr~~VlvHLd~~~D--KF~lLifmiqKL~s 337 (1111) T KOG0216 275 NLFTRTQCLQYLGSRFRAV--------------LDFVPE-QPDLEVGRFILREYVLVHLDSDED--KFNLLIFMIQKLYS 337 (1111) T ss_pred CCCCHHHHHHHHHHHHHHH--------------HCCCCC-CCHHHHHHHHHHHHEEEEECCCHH--HHHHHHHHHHHHHH T ss_conf 6403999999986553545--------------167888-852788999987426887366178--78889999999999 Q ss_pred HHCCCCCCCCCHHCCCCCCCCHHHHHHHHHHHHHHHHHH-------HHHHHHCCCCCC----CCCCCCCCCHHHHHHHHH Q ss_conf 304876434401015332333357878988888899998-------876531134433----435211222023465554 Q gi|254780143|r 446 LRNGKGTIDDIDNLGNRRVRSVGEMLKNQYRLGLLRMER-------SIKERISSVDID----SVMPQDLINAKPVVSAVC 514 (1386) Q Consensus 446 l~~g~~~~DdiDhlgnkRvr~vgeLl~~~fr~~l~rl~r-------~i~~~~~~~~~~----~~~~~~~in~~~i~~~i~ 514 (1386) +-.|...+|+.|...|..+.+.|.|.....+.-+....+ ...+++...... .++..-.-++..|...+. T Consensus 338 lv~ge~~~dnpDs~q~qEil~~ghl~~~~LkEriEe~l~~~~~~vr~~l~~~~~~~~~~~~~~i~~~~~r~~~~Ig~~me 417 (1111) T KOG0216 338 LVDGECTPDNPDSPQNQEILLPGHLYGSILKERIEEWLRFISAQVRRDLYKLGFKEDFLDISAIRKVFMRTNGNIGRKME 417 (1111) T ss_pred HHCCCCCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHH T ss_conf 85587578998973122204461159999999999999999999999998616783067699999998535630778899 Q ss_pred HHCCCCCC--------------EEECCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCC Q ss_conf 20266750--------------3541554210232200112234444432234665543221111044430356676521 Q gi|254780143|r 515 EFFCSSQL--------------SQLEEHVNSLSRITHTRRLSALGQGGVARARAGVEMRDVHPTHYGRICPAETSEGHNI 580 (1386) Q Consensus 515 ~ff~t~~l--------------sq~ld~~n~ls~lth~RR~s~lgpggl~r~~~~~evR~ih~s~~GriCPieTPEG~n~ 580 (1386) .|++|++| +-..++.|.+..++|.| +++.|.+..+.+...||.|.|..||++||++||+|..| T Consensus 418 ~fLsTGnl~s~sgldLqQ~sG~tv~AEriNf~RflShFR---avhRGa~f~~mrTTtvRKLlPEsWGFlCPVHTPDG~PC 494 (1111) T KOG0216 418 YFLSTGNLVSRSGLDLQQTSGYTVVAERINFYRFLSHFR---AVHRGAFFAEMRTTTVRKLLPESWGFLCPVHTPDGAPC 494 (1111) T ss_pred HHHHCCCEEECCCHHHHHHCCCEEEHHHHHHHHHHHHHH---HHHCCCCHHHEEEHHHHHHCCHHHCCCCCCCCCCCCCH T ss_conf 988508732122155676529088654456899998888---77424530110204257638032021154518899841 Q ss_pred EEEECEEEEEEECCCC----C-------------------CCCEEEEEECCCCCCCEEECCHHH----------H----- Q ss_conf 0210001224424687----6-------------------441069862255348166429677----------3----- Q gi|254780143|r 581 GLVSSLTSFARVNAYG----F-------------------IETPYRKVCDGKVTNDVVYLSAME----------E----- 622 (1386) Q Consensus 581 GLv~~la~~a~in~~g----~-------------------ie~py~~v~~~~~~~~i~~l~~~~----------e----- 622 (1386) ||.+++|..|+|...- + -+..|-...+|++.+-+.+--+.. + T Consensus 495 GlLnH~t~~c~I~t~~dd~s~ip~~L~~~Gm~p~~~~~~~G~~l~~VlldGk~vG~~s~~~a~~i~~~lR~~Kv~~p~~i 574 (1111) T KOG0216 495 GLLNHMTRTCRIVTRPDDVSFIPSILFELGMVPSSHLVEAGEPLYPVLLDGKVVGWVSSPLAKKIVDYLRYYKVEAPAVI 574 (1111) T ss_pred HHHHHHHHHEEEEECCCCHHHHHHHHHHCCCCCHHCCCCCCCCEEEEEECCEEEEEECCHHHHHHHHHHHHHHHCCCCCC T ss_conf 67776455256760576211206899854777501024679841579977878876351777899999998752276558 Q ss_pred -CCEEEECCCCEECCCCCCCCCCEE------------ECCC----------------CCCCCCCHHHEEECCCCCCCEEE Q ss_conf -763796165341146840222000------------0002----------------33332257872202367211023 Q gi|254780143|r 623 -ENRYIAQANSSLDEDGSFTEELVF------------CRCA----------------GEEILVPREKIDFIDASPKQVVS 673 (1386) Q Consensus 623 -~~~~Ia~~~~~l~~~~~~~~~~~~------------~r~~----------------~~~~~~~~~~v~~~~i~p~~i~s 673 (1386) ...-|+- ++...++.....++. .... .......+...+|.+++|..++| T Consensus 575 P~~leIg~--Vp~s~~gqyPglyi~s~~aR~~RpV~nl~~~~~e~iG~fEQvym~i~~~~~e~~~~v~th~E~~pt~~ls 652 (1111) T KOG0216 575 PNDLEIGY--VPTSTNGQYPGLYIFSGPARMIRPVKNLSLDSVEWIGPFEQVYMNIAIDPKEIFPDVTTHVELHPTGILS 652 (1111) T ss_pred CCCCEEEE--EECCCCCCCCEEEEECCCHHHHHEHHCCCCCCHHHCCCHHHHHHHHCCCHHHCCCCCCEEEEECCCCHHH T ss_conf 87535866--6267899786369833716650110036557243204398763120025000279851157863410576 Q ss_pred ECCCCCCCHHHCCHHHHHHHHHHHHHHHCCCCCCCCEECCCCCHHHHHHCCCEEECCCCCCCCCCCCCCEEEECCCCCCC Q ss_conf 11233320110100221122234443210136654111266201110106530102124434335655302521566556 Q gi|254780143|r 674 IAASLIPFLENDDSNRVLMGCNMQRQAVPLLKAEAPFVGTGMESVVAKSSGAAIVAKRAGIVEQVDAIRIVIRSVEGDLD 753 (1386) Q Consensus 674 v~aslIPflehdda~R~l~g~nm~rQav~l~~~~~~~v~tg~E~~~~~~s~~~i~a~~~g~v~~vd~~~i~i~~~~~~~~ 753 (1386) +.|.+|||.+|||+||++|||+|.||+|+.-...-+ .+ T Consensus 653 ~~anliP~sD~NQSPRNmYQCQMgKQtmg~p~~a~~-----------------------------------------~r- 690 (1111) T KOG0216 653 IVANLIPFSDHNQSPRNMYQCQMGKQTMGTPGHALR-----------------------------------------TR- 690 (1111) T ss_pred HHHCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCH-----------------------------------------HC- T ss_conf 862574464458882067776520121278511021-----------------------------------------02- Q ss_pred CCCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCEECCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCEECCHHH Q ss_conf 44466301114655544547322445304479772078520355223578602223751555313554444420000134 Q gi|254780143|r 754 PSTSGVDIYRLMKFQRSNQNTCVNQRPLVKVGDEVRRNDIIADGPSTDLGDLALGRNMLVAFMPWHGYNFEDSMLISERM 833 (1386) Q Consensus 754 ~~~~~~~~y~~~~~~~~~~~~~~~q~p~V~~g~~~~~~~~l~~~~~~~~~el~~G~N~~VA~m~~~GYN~EDaiiin~~~ 833 (1386) ..-..|.+ ..+|.|+|+...+- ....+++|.|.||+||+++|+||++|||+||||++ T Consensus 691 ---adnklYrl----------qt~qsP~vr~~~y~----------~y~~d~yp~GtNaiVAVisyTgyDMeDAmiiNK~s 747 (1111) T KOG0216 691 ---ADNKLYRL----------QTPQSPIVRPELYD----------TYGMDDYPNGTNAIVAVISYTGYDMEDAMIINKSS 747 (1111) T ss_pred ---CCCCEEEE----------CCCCCCEEECCCCC----------CCCCCCCCCCCCEEEEEEEECCCCHHHHHHHCHHH T ss_conf ---56704882----------27998532011134----------43545588875239999951466704544203105 Q ss_pred HHCCCCCEEEEEEEEHHHHHCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCEECCCCCEEECCCCCCCCCCCHHHHHH Q ss_conf 42587310346665311211478840024666546867841024137413773313676111012467777677034665 Q gi|254780143|r 834 VSEDVFTSIHIEEFEVMARDTKLGPEEITRDIPNVSEEGLKNIDECGIICVGAEVNPGDILVGKITPKGESPMTPEEKLL 913 (1386) Q Consensus 834 v~rg~~~s~h~~~y~~~~~~~~~g~~~~~~~~~~~~~~~~~~ld~~Giv~~G~~V~~gDilvgk~tp~~~~~~~pe~~~l 913 (1386) .+||++.-..|++-....-+... ....+.-.|+..+ +.+||.||++.+|+++..||.+....--. T Consensus 748 ~eRGf~~G~vykte~i~L~~~~~-r~~~F~~~p~~~~--~~~ld~dgLP~~G~kl~~~dp~~~y~d~~------------ 812 (1111) T KOG0216 748 YERGFAYGTVYKTEKIDLSKKRS-RSKHFGRSPGEPE--LKKLDADGLPSIGQKLEYGDPYYAYFDEE------------ 812 (1111) T ss_pred HHCCCCCEEEEEEEEECHHHCCC-CEEEEEECCCCCC--CCCCCCCCCCCCCCCCCCCCCEEEEECCC------------ T ss_conf 65064011688643533221321-0034433799842--01358888987752345799479997266------------ Q ss_pred HHHCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 43035555443233202278853220012101456542026678888999999987288899988788898888862585 Q gi|254780143|r 914 RAIFGEKAVDVRDTSLRVPSGVSGTVVDVRIFNRHGIDKNERSISVEREQIELLARDKDDEQVILDRNIYSRLMEILCGQ 993 (1386) Q Consensus 914 ~~i~~~~~~~~~d~~~~~~~g~~g~v~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 993 (1386) .. +.-..+....+.+.|..++++. T Consensus 813 -------t~--~~~~~~~~~~ep~~vd~vr~~~----------------------------------------------- 836 (1111) T KOG0216 813 -------TG--KTRIKKYHGTEPGIVDEVRVLG----------------------------------------------- 836 (1111) T ss_pred -------CC--CEEEEEECCCCCEEEEEEEECC----------------------------------------------- T ss_conf -------77--2678885278985667878704----------------------------------------------- Q ss_pred CCCCCCCCCCCCCCCCHHHHHCCCHHHHHEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEE Q ss_conf 34566555554321135764014854310014158678999999999899999999999898876312588677472769 Q gi|254780143|r 994 NAVSGPKGFKKSTVLSSDLISEYPRSQWWQFAVQDEKVQRNVESLKVQYETSKSILEDRFKNKIEKIQWGDDMPPGVLRV 1073 (1386) Q Consensus 994 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~ki~~~~~~~~gv~~~ 1073 (1386) .+..+++ +. T Consensus 837 ---------------------------------------------------------------------~~~~~~~--k~ 845 (1111) T KOG0216 837 ---------------------------------------------------------------------NDMGDQE--KC 845 (1111) T ss_pred ---------------------------------------------------------------------CCCCCCC--CE T ss_conf ---------------------------------------------------------------------6678731--06 Q ss_pred EEEEEEEECCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCCCC Q ss_conf 99999875588552314336678735888630000787935871569866898675070899999999999987196224 Q gi|254780143|r 1074 VKVFVAMKRPIQSGDKMAGRHGNKGIVSRILPCEDMPFLKDGTPVDIVLNPLGVPSRMNVGQIFETHLGWACVGLGKKIK 1153 (1386) Q Consensus 1074 vKV~ir~~R~p~iGDKfasRHGqKGVis~i~p~eDMPf~~dG~~pDIIlNPhgvPSRMtIGqllE~~lGka~~~~G~~~~ 1153 (1386) +-|.+|..|.|.||||||||||||||||+.||.+||||||.|++||||+||||||||||||+++|+++||||+++|..+| T Consensus 846 ~~i~~Ri~R~p~IGDKFsSRhGQKGicS~~wP~~dmPFtesGm~PDii~NPH~FPSRMTIgM~iEs~AgK~~alhG~~~D 925 (1111) T KOG0216 846 ATITLRIPRNPIIGDKFSSRHGQKGICSQKWPTIDMPFTESGMVPDIIINPHAFPSRMTIGMLIESMAGKAGALHGNAQD 925 (1111) T ss_pred EEEEEEECCCCCCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCHHHHHHHHHCHHHCCCCCCCC T ss_conf 89999704798533211232366540225477777785535767656667877842004999999873312104554345 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHCCCCEECCCCCCCCCHHHHHHHHHHCCCCCCCC Q ss_conf 33343222210246777877631543221000013525677766530268400256557889999999999868689986 Q gi|254780143|r 1154 SLINDYKANGDISPLRSFLEKVIGTGSHTEKISDYDDDSVLRVAEQWKSGVPVSTPVFDGADEEAINSMLRMADLDESGQ 1233 (1386) Q Consensus 1154 ~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~aTP~F~g~~~~~i~~~L~~aG~~~~Gk 1233 (1386) +|||.+.+++. ..++++++|.+|||||+|+ T Consensus 926 atpf~~~E~~t--------------------------------------------------~~dyfg~~L~~~GyNyyGn 955 (1111) T KOG0216 926 ATPFIFSEENT--------------------------------------------------AIDYFGEMLKKAGYNYYGN 955 (1111) T ss_pred CCCEEECCCCC--------------------------------------------------HHHHHHHHHHHCCCCCCCC T ss_conf 88334357663--------------------------------------------------8899999998727674478 Q ss_pred EEEECCCCCCCCCCCEEEEEEEEEECCCHHHHHCCCCCCCCCCCEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 99986898840268504884645410110211000023687311030799862331784320789999999869899998 Q gi|254780143|r 1234 SILYDGLTGEPFDRPVTVGYIYMLKLNHMVSDKVYARSTGSYSLVTQQPLGGKSNRGGQRLGEMEVWCIQAYGAAYVLQE 1313 (1386) Q Consensus 1234 e~lydG~TGe~~~~~IfvG~~YyqkL~HMV~DKiHARstGP~sllTrQP~eGRsr~GGlRfGEMErwaL~AyGAa~~LqE 1313 (1386) |.||+|.+|+.|+++||+|++|||||+|||.||++.|||||+..+|+||++||+|.||+||||||||||+||||||+||+ T Consensus 956 E~~YSGv~G~e~~adIf~GvVyYQRLrHMv~DKfQVRstG~v~~~T~QPvkGRkr~GGiRfGEMERDali~HGtsfllqD 1035 (1111) T KOG0216 956 EPMYSGVDGREMRADIFFGVVYYQRLRHMVSDKFQVRSTGPVDSLTHQPVKGRKRGGGIRFGEMERDALIAHGTSFLLQD 1035 (1111) T ss_pred CCCCCCCCCCEEEEEEEEEEHHHHHHHHHHCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCHHHHHHH T ss_conf 54304656533344489864228888987424026650267441234876676057872101022666765452763265 Q ss_pred HHHCCCC----CHHHHHH----HHHHHHC---------------CCCCCCCCCCHHHHHHHHHHHHCCCCEEEEE Q ss_conf 6111001----1015999----9887638---------------7878997866778999999985420027640 Q gi|254780143|r 1314 MLTIKSD----DVVGRTR----VYESIVA---------------GNDTFETGTPESFNVLVKEMQALGLSIDLEN 1365 (1386) Q Consensus 1314 ~Lt~kSD----dv~gr~~----~~~~iv~---------------g~~~~~~~~pesf~vl~~El~~l~l~~~~~~ 1365 (1386) ||.-.|| +||++-. +-..+++ |+.+.-..+|+.||-|.-||-|+++.+++.. T Consensus 1036 RL~~~SD~~~a~vC~~cgsil~~~~~l~~~~~~~~~~~~C~~c~~~~~~~v~~P~vfkYL~aEL~amnIk~~l~l 1110 (1111) T KOG0216 1036 RLLNSSDYTVAEVCRTCGSILSTQQKLIKEPGGSTSTVTCRSCDGKGVTTVAMPYVFKYLTAELAAMNIKMRLDL 1110 (1111) T ss_pred HHCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEECCCCCEEEEECCHHHHHHHHHHHHCCEEEEECC T ss_conf 551676236999987766564066554115678888626884488734799711899999999862751599615 No 13 >pfam00562 RNA_pol_Rpb2_6 RNA polymerase Rpb2, domain 6. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). This domain represents the hybrid binding domain and the wall domain. The hybrid binding domain binds the nascent RNA strand / template DNA strand in the Pol II transcription elongation complex. This domain contains the important structural motifs, switch 3 and the flap loop and binds an active site metal ion. This domain is also involved in binding to Rpb1 and Rpb3. Many of the bacterial members contain large insertions within this domain, as region known as dispensable region 2 (DRII). Probab=100.00 E-value=0 Score=958.06 Aligned_cols=369 Identities=57% Similarity=0.903 Sum_probs=335.4 Q ss_pred EECCCCCCCHHHCCHHHHHHHHHHHHHHHCCCCCCCCEECCCCCHHHHHHCCCEEECCCCCCCCCCCCCCEEEECCCCCC Q ss_conf 31123332011010022112223444321013665411126620111010653010212443433565530252156655 Q gi|254780143|r 673 SIAASLIPFLENDDSNRVLMGCNMQRQAVPLLKAEAPFVGTGMESVVAKSSGAAIVAKRAGIVEQVDAIRIVIRSVEGDL 752 (1386) Q Consensus 673 sv~aslIPflehdda~R~l~g~nm~rQav~l~~~~~~~v~tg~E~~~~~~s~~~i~a~~~g~v~~vd~~~i~i~~~~~~~ 752 (1386) +++||+|||++|||++|++|||+|+|||+++ T Consensus 1 gi~As~iPf~~hNqspRn~yq~~m~KQaig~------------------------------------------------- 31 (373) T pfam00562 1 GIVASLIPFVDHNQSPRITYQCAMGKQAIGI------------------------------------------------- 31 (373) T ss_pred CCEEEECCCCCCCCHHHHHHHHHHHHCCCCH------------------------------------------------- T ss_conf 9411301487578206999987663114372------------------------------------------------- Q ss_pred CCCCCCCEEECCCCCCCCCCCC---CCCCCCCCCCCCEEECCCEECCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCEEC Q ss_conf 6444663011146555445473---2244530447977207852035522357860222375155531355444442000 Q gi|254780143|r 753 DPSTSGVDIYRLMKFQRSNQNT---CVNQRPLVKVGDEVRRNDIIADGPSTDLGDLALGRNMLVAFMPWHGYNFEDSMLI 829 (1386) Q Consensus 753 ~~~~~~~~~y~~~~~~~~~~~~---~~~q~p~V~~g~~~~~~~~l~~~~~~~~~el~~G~N~~VA~m~~~GYN~EDaiii 829 (1386) |.+..+.|.++++ +++|+|+|+++.. ..++++++|+|+|++||||||+|||||||||| T Consensus 32 ---------y~~n~~~R~D~~~~~l~ypQ~PlV~t~~~----------~~~~~~e~p~G~N~iVAi~sy~GYN~EDaIIi 92 (373) T pfam00562 32 ---------YTLNKYNRSDQNTYLLCYPQKPLVKTGAV----------EKGGFGELPLGQNALVAVMSYTGYNQEDAIII 92 (373) T ss_pred ---------HHHCCCEEECCCCCEECCCCCCEEEEEEH----------CCCCCCCCCCCCCEEEEEECCCCCCHHHHHHH T ss_conf ---------45442301068651102588765885011----------00588656787206999977678663453123 Q ss_pred CHHHHHCCCCCEEEEEEEEHHHHHCCCCC-CCCCCCCCCCCHHHHHHCCCCCCCCCCCEECCCCCEEECCCCCCCCCCCH Q ss_conf 01344258731034666531121147884-00246665468678410241374137733136761110124677776770 Q gi|254780143|r 830 SERMVSEDVFTSIHIEEFEVMARDTKLGP-EEITRDIPNVSEEGLKNIDECGIICVGAEVNPGDILVGKITPKGESPMTP 908 (1386) Q Consensus 830 n~~~v~rg~~~s~h~~~y~~~~~~~~~g~-~~~~~~~~~~~~~~~~~ld~~Giv~~G~~V~~gDilvgk~tp~~~~~~~p 908 (1386) ||++++||+|+|+|+++|+++++.++.+. ++++.+.++.+...+.+||+||++++|++|.+|||||||++|.. T Consensus 93 Nkssi~RGlf~s~~~~~y~~~~~~~~~~~~e~~~~~~~~~~~~~y~~Ld~dGii~~g~~v~~gDvligK~~~~~------ 166 (373) T pfam00562 93 NKSSVDRGLFTSIHIKEYEIEARKTKLGPIEEITRDPPNVSEEAYRKLDEDGIVRVGAEVKPGDILVGKVTPKG------ 166 (373) T ss_pred HHHHHHHCCCEEEEEEEEEEEEECCCCCCCEEECCCCCCCCHHHHHHCCCCCCCCCCCEECCCCEEEEEECCCC------ T ss_conf 18888623564578986434420257998208725898876677653550147479979748988999953676------ Q ss_pred HHHHHHHHCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 34665430355554432332022788532200121014565420266788889999999872888999887888988888 Q gi|254780143|r 909 EEKLLRAIFGEKAVDVRDTSLRVPSGVSGTVVDVRIFNRHGIDKNERSISVEREQIELLARDKDDEQVILDRNIYSRLME 988 (1386) Q Consensus 909 e~~~l~~i~~~~~~~~~d~~~~~~~g~~g~v~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 988 (1386) +++..++++......+|+|++++.++.|+|.+|.+.. T Consensus 167 -~~~~~~~~~~~~~~~~d~S~~~~~~e~G~Vd~V~~~~------------------------------------------ 203 (373) T pfam00562 167 -EKLLRAIFGEKARDVKDTSLKVKHGEEGRVDDVKIDL------------------------------------------ 203 (373) T ss_pred -CCCCCCCCCCCCCEEEEEEEEECCCCCEEEEEEEECC------------------------------------------ T ss_conf -4320002466676257825981599846999999703------------------------------------------ Q ss_pred HHCCCCCCCCCCCCCCCCCCCHHHHHCCCHHHHHEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 62585345665555543211357640148543100141586789999999998999999999998988763125886774 Q gi|254780143|r 989 ILCGQNAVSGPKGFKKSTVLSSDLISEYPRSQWWQFAVQDEKVQRNVESLKVQYETSKSILEDRFKNKIEKIQWGDDMPP 1068 (1386) Q Consensus 989 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~ki~~~~~~~~ 1068 (1386) .. T Consensus 204 ------------------------------------------------------------------------------~~ 205 (373) T pfam00562 204 ------------------------------------------------------------------------------NP 205 (373) T ss_pred ------------------------------------------------------------------------------CC T ss_conf ------------------------------------------------------------------------------88 Q ss_pred CCEEEEEEEEEEECCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHC Q ss_conf 72769999998755885523143366787358886300007879358715698668986750708999999999999871 Q gi|254780143|r 1069 GVLRVVKVFVAMKRPIQSGDKMAGRHGNKGIVSRILPCEDMPFLKDGTPVDIVLNPLGVPSRMNVGQIFETHLGWACVGL 1148 (1386) Q Consensus 1069 gv~~~vKV~ir~~R~p~iGDKfasRHGqKGVis~i~p~eDMPf~~dG~~pDIIlNPhgvPSRMtIGqllE~~lGka~~~~ 1148 (1386) ++.++|||++|+.|+|+||||||||||||||||+|||+||||||+||++||||||||||||||||||||||++||||++. T Consensus 206 ~~~~~vkv~ir~~R~p~iGDKfssRhGQKGvig~i~~~~DMPft~dGi~PDiIiNPh~iPSRMTiGqllE~~~gk~~~~~ 285 (373) T pfam00562 206 GGIKKVKVYIRQKRKPQVGDKFASRHGQKGVVSKILPQEDMPFTEDGIPPDIILNPHGVPSRMTIGQLLESLLGKAAALL 285 (373) T ss_pred CCCEEEEEEEEECCCCCCCCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCEEECCCCCCCCCCHHHHHHHHHHHHHHHC T ss_conf 87589999993135887411004555786165444464029948889967474266656554628899999988999745 Q ss_pred CCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHCCCCEECCCCCCCCCHHHHHHHHHHCCC Q ss_conf 96224333432222102467778776315432210000135256777665302684002565578899999999998686 Q gi|254780143|r 1149 GKKIKSLINDYKANGDISPLRSFLEKVIGTGSHTEKISDYDDDSVLRVAEQWKSGVPVSTPVFDGADEEAINSMLRMADL 1228 (1386) Q Consensus 1149 G~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~aTP~F~g~~~~~i~~~L~~aG~ 1228 (1386) |..+|++||+... ..+++|.++|.++|| T Consensus 286 G~~~d~t~F~~~~----------------------------------------------------~~~~~i~~~L~~~g~ 313 (373) T pfam00562 286 GKFIDATPFDGAS----------------------------------------------------EVVEDIGELLKEAGY 313 (373) T ss_pred CCCEEECCCCCCC----------------------------------------------------CHHHHHHHHHHHCCC T ss_conf 9436768989987----------------------------------------------------449999999997599 Q ss_pred CCCCCEEEECCCCCCCCCCCEEEEEEEEEECCCHHHHHCCCCCCCCCCCEEECCCCCCCC Q ss_conf 899869998689884026850488464541011021100002368731103079986233 Q gi|254780143|r 1229 DESGQSILYDGLTGEPFDRPVTVGYIYMLKLNHMVSDKVYARSTGSYSLVTQQPLGGKSN 1288 (1386) Q Consensus 1229 ~~~Gke~lydG~TGe~~~~~IfvG~~YyqkL~HMV~DKiHARstGP~sllTrQP~eGRsr 1288 (1386) +++|+|+||||+||++|+++||+||+|||||+|||+|||||||||||++|||||++|||| T Consensus 314 ~~~G~e~l~~G~TG~~~~~~if~G~~yy~rL~HmV~dK~~~Rs~Gp~~~lT~QP~~Grsr 373 (373) T pfam00562 314 NAYGKEVLYDGRTGEPFKAPIFVGPIYYQKLKHMVDDKIHARSTGPYSLLTQQPLGGRSR 373 (373) T ss_pred CCCCCEEEECCCCCCEECCEEEEEHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCCCC T ss_conf 999998988899998805608885658554444543077956728986503599998879 No 14 >PRK09606 DNA-directed RNA polymerase subunit beta''; Validated Probab=100.00 E-value=0 Score=678.91 Aligned_cols=417 Identities=23% Similarity=0.439 Sum_probs=335.8 Q ss_pred CCCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHCCCC-CCCCCEEEEEEEEEECCCC----------CCHHHHHHC Q ss_conf 87088999999999862466522111225899976637807-6798589999978980985----------889999983 Q gi|254780143|r 26 IPDLIEVQKASYDHFLMMNIAPDERPNEGLQAAFKSVFPIT-AFSGAAMLEFVSYEFDPPK----------FDVDDCLWR 94 (1386) Q Consensus 26 ~P~Li~iQ~~Sf~~Flq~~~~~~~r~~~GL~~v~~~~fPI~-d~~~~~~Lef~~y~l~~Pk----------~tp~ECRlR 94 (1386) -..|+++|+||||||| ++||+++|++++||+ +..+++.|+|.++++++|. ++|.|||+| T Consensus 15 ~~~Lv~~qidSfn~Fi----------~~gl~~ii~~~~pi~~~~~~~~~l~~~~~~i~~P~~~e~~~~~~~l~P~ecR~r 84 (491) T PRK09606 15 EEPLVRHHIDSYNDFI----------DNGLQKIIDEQEPIETEIEGGYYVELGKIRVGKPIVKEADGSVREIYPMEARLR 84 (491) T ss_pred CCCCHHHHHHHHHHHH----------HHHHHHHHHHCCCEEECCCCCEEEEEEEEEECCCCEECCCCCCCCCCHHHHHHC T ss_conf 8997089999999999----------974799998529758757996899997899848824337787564698999963 Q ss_pred CCCEEEEEEEEEEEEEECCCCCCCCCCCEEEEEEEEEEEECCEECCC------------------------CCEEECCEE Q ss_conf 99753358999999993178766552000122367875100002689------------------------628986821 Q gi|254780143|r 95 DLTYAVPLKITLRLIVFDVDEFTGAKSIKDIKEQSIYMGDLPLMTKD------------------------GTFVIKGIQ 150 (1386) Q Consensus 95 ~lTYsapL~V~i~l~v~~~~~~~~~k~~~~ike~~V~lG~IPiMt~~------------------------GyFIING~E 150 (1386) ++||+|||+|+++++.++.. ...++|++|+||||+.| |||||||+| T Consensus 85 ~lTYs~~l~v~i~~~~~~~~----------~~~~~v~iG~iPIMv~S~~C~L~~~~~~~~~~~gE~~~d~GGYFIInG~E 154 (491) T PRK09606 85 NLTYSAPLYLEMTLVEGGEE----------EEPEEVYIGELPVMVGSKICNLYGLSEEELIEVGEDPLDPGGYFIVNGSE 154 (491) T ss_pred CCCCCCEEEEEEEEEECCCC----------CEEEEEEECCCCEECCCCCCCCCCCCHHHHHHCCCCCCCCCCEEEECCEE T ss_conf 99643148999999989940----------20578996115457257754589979799986187766798679997878 Q ss_pred EEEEEEECCCCCEEECCCCCCCCCCCCEEEEEEEECCCCCE---EEEEECCCCEEEEEECC-CCCHHHHHHHHHCCCCHH Q ss_conf 46866512278521202346515778567999981188723---68997589829999718-787039989988099847 Q gi|254780143|r 151 RIVVSQLHRSPGIHFDHDKGRASLSGKLLYACRIIPDQGLW---MDIEFDSKDIIHVRIDR-RRKVPVTSFLMALGMDSE 226 (1386) Q Consensus 151 RVIVsQl~RSPGVyf~~~k~k~~~s~k~~ysa~IIP~RGSw---Le~e~d~kd~iyvrIdr-~rKIPi~ilLrALG~ssd 226 (1386) ||||+|++++||.+|.... ...++..|+++++|.+++| +.++.+.++.+++++++ +++||+++||||||+++| T Consensus 155 rVIi~Qe~~~~n~~~~~~~---~~~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~i~v~~~~~~~~IPl~illrALG~~sD 231 (491) T PRK09606 155 RVLMTLEDLAPNKILVEKI---ERYNDRIYVAKVFSQRRGYRALVTVERNRDGLLEVSFPSVPGKIPFVILMRALGLETD 231 (491) T ss_pred EEEEEEHHCCCCEEEEEEC---CCCCCEEEEEEEEECCCCCEEEEEEEECCCCEEEEEECCCCCEEEHHHHHHHHCCCCH T ss_conf 8998611207983899753---6799547999999358874368999986896799998143757669999998568868 Q ss_pred -HHHHHHCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCEEECCCCHHHHHHHHHHHHCCCCCCCCCHHHHCC Q ss_conf -9999738740577306731023545666421012232112445845440221020799887876360001026677347 Q gi|254780143|r 227 -EILSTFYPKIVYSQRGDFWCFPLSAADLMVGAKVSSSLVDIDTGEQVIESGKKLTSGLLKSLKEKGVKFLGITSDCLCG 305 (1386) Q Consensus 227 -eIl~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~l~~ 305 (1386) +|++.+.... +. .. T Consensus 232 ~eI~~~i~~d~----------------~~------------------------------~~------------------- 246 (491) T PRK09606 232 EEIVEAVSDDP----------------EI------------------------------VK------------------- 246 (491) T ss_pred HHHHHHHCCCH----------------HH------------------------------HH------------------- T ss_conf 99999864789----------------99------------------------------99------------------- Q ss_pred CCCCCCCCCCCCCEEEEECCCCCCHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHCCCCCCHHHHHHHHHHHHCCCC Q ss_conf 41000013666776999623459989999998656541024420244455200000110234588999999887605776 Q gi|254780143|r 306 LYVAEDIVNGETGEIYIEAGDVIDEKSLEEIFHSEIRDIPILYVDSVNNNAYIRNTLVTDKNKDRKDALLDIYRVMRPGD 385 (1386) Q Consensus 306 ~~~~~~~~d~~~gei~~~~~~~~~~~~l~~~~~~~~~~~~~l~~~~~~~~~~i~~~~~~d~~~~~~eAl~~I~k~lr~~~ 385 (1386) ..+ ..+......++++|+.+|+++++++. T Consensus 247 -------------------------~ll--------------------------~~l~~~~i~t~e~Al~yIg~~~~~~~ 275 (491) T PRK09606 247 -------------------------FML--------------------------ENLEEAEVDTQEDALEYIGKRVAPGQ 275 (491) T ss_pred -------------------------HHH--------------------------HHHHHCCCCCHHHHHHHHHHHCCCCC T ss_conf -------------------------999--------------------------99987589999999999997557998 Q ss_pred CCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEECHHHHHHHHHHHHHHHCCCCCCCCCHHCCCCCCC Q ss_conf 31045688886202453023334555577765420366776770661899888898887630487643440101533233 Q gi|254780143|r 386 VSTFSVAESMFNFLFFDSDKYDLSTVGRVKMNMRLNLDTPDDVRHIRKEDIIAIIKILVDLRNGKGTIDDIDNLGNRRVR 465 (1386) Q Consensus 386 ~~~~~~~~~~~~~~~~~~~~y~l~~vgr~~~n~~l~~~~~~~~~~Lt~~d~~~~i~~L~~l~~g~~~~DdiDhlgnkRvr 465 (1386) +......... +...+|.+. ++............+++.|+++|+.+..|...+||+||++|||++ T Consensus 276 ~~~~~~~~~~-----~~l~~~~lp-----------Hlg~~~~~~~~K~~~L~~mi~kLl~l~~g~~~~DD~D~l~NkRv~ 339 (491) T PRK09606 276 TKEYQIKRAE-----YVLDNYLLP-----------HLGVDKEVRIAKAHYLGRMAEACFELALGRREEDDKDHYANKRLK 339 (491) T ss_pred CHHHHHHHHH-----HHHHHHHCC-----------CCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHCCEEEE T ss_conf 8167999999-----999864056-----------679860234678888999999999985699888884132372671 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHCCCC--CCCCCCCCCCCHHHHHHHHHHHCCCCC-------CEEECCCCCCHHHHH Q ss_conf 33578789888888999988765311344--334352112220234655542026675-------035415542102322 Q gi|254780143|r 466 SVGEMLKNQYRLGLLRMERSIKERISSVD--IDSVMPQDLINAKPVVSAVCEFFCSSQ-------LSQLEEHVNSLSRIT 536 (1386) Q Consensus 466 ~vgeLl~~~fr~~l~rl~r~i~~~~~~~~--~~~~~~~~~in~~~i~~~i~~ff~t~~-------lsq~ld~~n~ls~lt 536 (1386) ++|+|++++|+.++.++.|.++.++.... .....+..+++++.++..++.||+||+ |||+|||+||++++| T Consensus 340 ~~G~Ll~~lfr~~~~~~~k~ik~~l~~~~~~~~~~~~~~~~~~~~it~~~~~~~~tGnw~~~~~glsQ~l~r~n~l~~lS 419 (491) T PRK09606 340 LAGDLMEDLFRVAFNRLARDIKYQLERAAMRNRELSIQTAVRSDVLTERLRHALATGNWVGGRTGVSQLLDRTDYMATLS 419 (491) T ss_pred EHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHCCCHHHHHHHHHHHHHCCCCCCCCCEEEEECCCCHHHHHH T ss_conf 17789999999999999999999999987508877889962735577889999873775578862678833778878888 Q ss_pred HCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCEEEECEEEEEEECCCCCCCCEE Q ss_conf 001122344444322346655432211110444303566765210210001224424687644106 Q gi|254780143|r 537 HTRRLSALGQGGVARARAGVEMRDVHPTHYGRICPAETSEGHNIGLVSSLTSFARVNAYGFIETPY 602 (1386) Q Consensus 537 h~RR~s~lgpggl~r~~~~~evR~ih~s~~GriCPieTPEG~n~GLv~~la~~a~in~~g~ie~py 602 (1386) |+||+++ ++.++++.++||+|||||||||||+|||||+||||++|||++|+||.. ..+.+. T Consensus 420 h~Rrv~~----~~~~~~k~~~~R~lhps~~G~iCPveTPeG~~~GLi~~La~~~~Is~~-~~~~~i 480 (491) T PRK09606 420 HLRRVVS----PLSRSQPHFEARDLHPTQWGRICPSETPEGPNCGLVKNLAQMVEISTG-EDEEEI 480 (491) T ss_pred HHHHCCC----CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHCEEEEECC-CCCHHH T ss_conf 7654067----756567887678478121763055779897003077763352899679-991789 No 15 >pfam04563 RNA_pol_Rpb2_1 RNA polymerase beta subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). This domain forms one of the two distinctive lobes of the Rpb2 structure. This domain is also known as the protrusion domain. The other lobe (pfam04561) is nested within this domain. Probab=100.00 E-value=0 Score=486.33 Aligned_cols=355 Identities=29% Similarity=0.484 Sum_probs=284.4 Q ss_pred CHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHCCCCCCCCC--EEEEEEEEEECCCCC----------CHHHHHHCC Q ss_conf 08899999999986246652211122589997663780767985--899999789809858----------899999839 Q gi|254780143|r 28 DLIEVQKASYDHFLMMNIAPDERPNEGLQAAFKSVFPITAFSGA--AMLEFVSYEFDPPKF----------DVDDCLWRD 95 (1386) Q Consensus 28 ~Li~iQ~~Sf~~Flq~~~~~~~r~~~GL~~v~~~~fPI~d~~~~--~~Lef~~y~l~~Pk~----------tp~ECRlR~ 95 (1386) .|+++|++|||||| ++||++++++++||++.+++ +.|+|.++++++|.+ +|+|||+|+ T Consensus 1 gLv~~qi~Sfn~Fi----------~~gl~~ii~~~~~I~~~~~~~~~~l~~~~~~i~~P~~~e~~~~~~~i~P~eaR~r~ 70 (394) T pfam04563 1 GLVEQQLDSFNWFL----------DEGLQEEIDEFPPIEDEDEEPEFSLKVGQIKLAKPKIKESDGKTREIYPREARLRN 70 (394) T ss_pred CCHHHHHHHHHHHH----------HHHHHHHHHHCCCEEECCCCEEEEEEEEEEEECCCCEEECCCCCCCCCHHHHHHCC T ss_conf 94478889999999----------97189999865998815998699999878998378056156755778979999619 Q ss_pred CCEEEEEEEEEEEEEECCCCCCCCCCCEEEEEEEEEEEECCEECCC------------------------CCEEECCEEE Q ss_conf 9753358999999993178766552000122367875100002689------------------------6289868214 Q gi|254780143|r 96 LTYAVPLKITLRLIVFDVDEFTGAKSIKDIKEQSIYMGDLPLMTKD------------------------GTFVIKGIQR 151 (1386) Q Consensus 96 lTYsapL~V~i~l~v~~~~~~~~~k~~~~ike~~V~lG~IPiMt~~------------------------GyFIING~ER 151 (1386) +||||||||++++++.+. +++++++|++|+||+||+| |||||||+|| T Consensus 71 lTYs~~i~v~v~~~~~~~---------~~~~~~~V~iG~iPiMv~S~~C~L~~~~~~el~~~~Ec~~D~GGYFIING~Er 141 (394) T pfam04563 71 LTYSSPLYVPAELTVNNT---------EEIEKQKVFIGKIPLMLRSNACILNGASESELVKLGECPLDPGGYFIVNGIEK 141 (394) T ss_pred CCCCEEEEEEEEEEECCC---------CEEEEEEEEEECCCEEECCCEEECCCCCHHHHHHCCCCCCCCCCEEEECCHHH T ss_conf 965304899999999788---------64899999850252460466224689998998652667668980899846777 Q ss_pred EEEEEECCCCCEEECCCCCCCCCCCCEEEEEEEECCCCCEEEEEECCCCEEEEEECCCCCHHHHHHHHHCCCCHHHHHH- Q ss_conf 6866512278521202346515778567999981188723689975898299997187870399899880998479999- Q gi|254780143|r 152 IVVSQLHRSPGIHFDHDKGRASLSGKLLYACRIIPDQGLWMDIEFDSKDIIHVRIDRRRKVPVTSFLMALGMDSEEILS- 230 (1386) Q Consensus 152 VIVsQl~RSPGVyf~~~k~k~~~s~k~~ysa~IIP~RGSwLe~e~d~kd~iyvrIdr~rKIPi~ilLrALG~ssdeIl~- 230 (1386) |||+|++||||++|...+. ..++..|+|+++|.+|+|++++.+.+..+|+++++++++|++++++|+|+....|+. T Consensus 142 VIi~qe~~~~n~~~~~~~~---~~~~~~~~~~i~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~ 218 (394) T pfam04563 142 VIINQIQRSPNIYYVFKKD---KNGIRIYSASIISNRGRSVRLEITSKGKIYARINSGAKLIIFVLLLALGLNPVEIILI 218 (394) T ss_pred HHHHHHHCCCCCEEEEECC---CCCCEEEEEEEECCCCCCCCCEEEEEEEEEEEECCCCCCCEEEEHHHHCCCCHHHHHH T ss_conf 8788761078744899748---9982788899953677753211440005999936998640588338555886899998 Q ss_pred HHCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCEEECCCCHHHHHHHHHHHHCCCCCCCCCHHHHCCCCCCC Q ss_conf 73874057730673102354566642101223211244584544022102079988787636000102667734741000 Q gi|254780143|r 231 TFYPKIVYSQRGDFWCFPLSAADLMVGAKVSSSLVDIDTGEQVIESGKKLTSGLLKSLKEKGVKFLGITSDCLCGLYVAE 310 (1386) Q Consensus 231 ~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~l~~~~~~~ 310 (1386) .++..... +. ..... .. T Consensus 219 ~~~~~~~~--------------e~------~~~~~------------------------~~------------------- 235 (394) T pfam04563 219 VLVPEFDL--------------EI------IDDIG------------------------VN------------------- 235 (394) T ss_pred HHCCCCHH--------------HH------HHHHH------------------------CC------------------- T ss_conf 50444449--------------99------99875------------------------24------------------- Q ss_pred CCCCCCCCEEEEECCCCCCHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHCCCCCCHHHHHHHHHHHHCCCCCCHHH Q ss_conf 01366677699962345998999999865654102442024445520000011023458899999988760577631045 Q gi|254780143|r 311 DIVNGETGEIYIEAGDVIDEKSLEEIFHSEIRDIPILYVDSVNNNAYIRNTLVTDKNKDRKDALLDIYRVMRPGDVSTFS 390 (1386) Q Consensus 311 ~~~d~~~gei~~~~~~~~~~~~l~~~~~~~~~~~~~l~~~~~~~~~~i~~~~~~d~~~~~~eAl~~I~k~lr~~~~~~~~ 390 (1386) +. ....... ...........+..+|+.++..+ T Consensus 236 ~~----------------~~~~~~~----------------------~~~~~~~~~~~~~~~~~~~i~~~---------- 267 (394) T pfam04563 236 LE----------------EDEFLTL----------------------KLELEEKFYIQTQDEALSFIGKL---------- 267 (394) T ss_pred CC----------------HHHHHHH----------------------HHHHHHHHCCCCCHHHHHHHHHH---------- T ss_conf 31----------------1156666----------------------78899874179837889999998---------- Q ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEECHHHHHHHHHHHHHHHCCCCCCCCCHHCCCCCCCCHHHH Q ss_conf 68888620245302333455557776542036677677066189988889888763048764344010153323333578 Q gi|254780143|r 391 VAESMFNFLFFDSDKYDLSTVGRVKMNMRLNLDTPDDVRHIRKEDIIAIIKILVDLRNGKGTIDDIDNLGNRRVRSVGEM 470 (1386) Q Consensus 391 ~~~~~~~~~~~~~~~y~l~~vgr~~~n~~l~~~~~~~~~~Lt~~d~~~~i~~L~~l~~g~~~~DdiDhlgnkRvr~vgeL 470 (1386) +...++...++.+..+++...+..++++.....+++++.+++.|+++|+.+..|.+.+||+|||+|||++++|+| T Consensus 268 -----~~~~~~~~~~~~~~~~~~l~~~~lphl~~~~~~~~~k~~~l~~mi~~Ll~~~~g~~~~DD~D~~~NKRv~~~G~L 342 (394) T pfam04563 268 -----GSARGFQRERRILGAVGILRLNVLPHLGVSENKRTLKAQDIGYMIHRLLLLALGRGPLDDIDHLGNKRLRLAGEL 342 (394) T ss_pred -----HHCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHH T ss_conf -----730478811578899899998777098998765404299999999999998639899988511168122759999 Q ss_pred HHHHHHHHHHHHHHHHHHHHCC--CCCCCCCCCCCCCHHHHHHHHHHHCCCC Q ss_conf 7898888889999887653113--4433435211222023465554202667 Q gi|254780143|r 471 LKNQYRLGLLRMERSIKERISS--VDIDSVMPQDLINAKPVVSAVCEFFCSS 520 (1386) Q Consensus 471 l~~~fr~~l~rl~r~i~~~~~~--~~~~~~~~~~~in~~~i~~~i~~ff~t~ 520 (1386) |+++||.++.||+|.++++|.. .....++|..++++++|++++++||+|| T Consensus 343 l~~lFr~~~~r~~r~ik~~~~~~~~~~~~~~~~~~i~~~~it~~i~~~~~TG 394 (394) T pfam04563 343 LQSQFRILLNRLERDVRERIQKCLKKNFDFTLQNLVNSKPITSGIRYFLGTG 394 (394) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHCCCHHHHHHHHHHHCCC T ss_conf 9999999999999999999998860478789999838365999999985389 No 16 >pfam04561 RNA_pol_Rpb2_2 RNA polymerase Rpb2, domain 2. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). Rpb2 is the second largest subunit of the RNA polymerase. This domain forms one of the two distinctive lobes of the Rpb2 structure. This domain is also known as the lobe domain. DNA has been demonstrated to bind to the concave surface of the lobe domain, and plays a role in maintaining the transcription bubble. Many of the bacterial members contain large insertions within this domain, as region known as dispensable region 1 (DRI). Probab=100.00 E-value=2.3e-34 Score=265.39 Aligned_cols=177 Identities=31% Similarity=0.494 Sum_probs=142.3 Q ss_pred CCCCEEECCCCCCCCCCCCEEEEEEEECCCCCEEEEEECCCCEEEEEECCCCCHHHHHHHHHCCCCHH-HHHHHHCCCEE Q ss_conf 27852120234651577856799998118872368997589829999718787039989988099847-99997387405 Q gi|254780143|r 159 RSPGIHFDHDKGRASLSGKLLYACRIIPDQGLWMDIEFDSKDIIHVRIDRRRKVPVTSFLMALGMDSE-EILSTFYPKIV 237 (1386) Q Consensus 159 RSPGVyf~~~k~k~~~s~k~~ysa~IIP~RGSwLe~e~d~kd~iyvrIdr~rKIPi~ilLrALG~ssd-eIl~~f~~~~~ 237 (1386) |||||||.++.+++ +.+.+|+|+|||+|||||+||+|+++.+||+|||++|||+++||||||+++| ||++.++.... T Consensus 1 RSPGvy~~~~~~~~--~~~~~y~a~iip~rG~Wl~~e~d~~~~~~v~idr~~kiPi~ilLrALG~~sd~eIl~~i~~~~~ 78 (180) T pfam04561 1 RSNGIYVEKELDKN--GIGATYTSSLISNRGSWLKLEIDGKTLIWSRPSKKRKIPIVIFLKALGLVSDREILDRLCYDFN 78 (180) T ss_pred CCCCEEEECCCCCC--CCCCEEEEEEECCCCCCEEEEECCCCEEEEEECCCCCCCHHHHHHHHCCCCHHHHHHHHCCCCC T ss_conf 99946852243788--8851688999238987179998689879999778775239999999768988999998444420 Q ss_pred EEECCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCEEECCCCHHHHHHHHHHHHCCCCCCCCCHHHHCCCCCCCCCCCCCC Q ss_conf 77306731023545666421012232112445845440221020799887876360001026677347410000136667 Q gi|254780143|r 238 YSQRGDFWCFPLSAADLMVGAKVSSSLVDIDTGEQVIESGKKLTSGLLKSLKEKGVKFLGITSDCLCGLYVAEDIVNGET 317 (1386) Q Consensus 238 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~l~~~~~~~~~~d~~~ 317 (1386) .... T Consensus 79 ~~~~---------------------------------------------------------------------------- 82 (180) T pfam04561 79 DPQM---------------------------------------------------------------------------- 82 (180) T ss_pred CHHH---------------------------------------------------------------------------- T ss_conf 0778---------------------------------------------------------------------------- Q ss_pred CEEEEECCCCCCHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHCCCCCCHHHHHHHHHHHH--CCCCCCHHHHHHHH Q ss_conf 7699962345998999999865654102442024445520000011023458899999988760--57763104568888 Q gi|254780143|r 318 GEIYIEAGDVIDEKSLEEIFHSEIRDIPILYVDSVNNNAYIRNTLVTDKNKDRKDALLDIYRVM--RPGDVSTFSVAESM 395 (1386) Q Consensus 318 gei~~~~~~~~~~~~l~~~~~~~~~~~~~l~~~~~~~~~~i~~~~~~d~~~~~~eAl~~I~k~l--r~~~~~~~~~~~~~ 395 (1386) .. .....+...+...++++|+.+||+++ ++++++....++. T Consensus 83 ---------------~~---------------------~~~~~~~~~~~~~t~~~Al~~i~~~~~~~~~~~~~~~~a~~- 125 (180) T pfam04561 83 ---------------LE---------------------LLKPELEEAENIYTQEEALDYIGKGFYLRRGEEPRLQRARE- 125 (180) T ss_pred ---------------HH---------------------HHHHHHHHHCCCCCHHHHHHHHHHHHCCCCCCCCHHHHHHH- T ss_conf ---------------99---------------------98899987167899999999999884247899844999999- Q ss_pred HHHCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEECHHHHHHHHHHHHHHHCCCCCCCCCHHCCCCCCC Q ss_conf 6202453023334555577765420366776770661899888898887630487643440101533233 Q gi|254780143|r 396 FNFLFFDSDKYDLSTVGRVKMNMRLNLDTPDDVRHIRKEDIIAIIKILVDLRNGKGTIDDIDNLGNRRVR 465 (1386) Q Consensus 396 ~~~~~~~~~~y~l~~vgr~~~n~~l~~~~~~~~~~Lt~~d~~~~i~~L~~l~~g~~~~DdiDhlgnkRvr 465 (1386) +++.++|++++.+.+...++|++.|+++|+++|+.+..|.+.+||||||||||+| T Consensus 126 ---------------~l~~~ln~kLg~~~~~~~~~lt~~di~~~i~~Li~l~~g~~~~DDiDhlgNRRvR 180 (180) T pfam04561 126 ---------------ILYSNLNKHLGLNEPFENERLKAQDILYMIDRLLNLKLGRRKPDDIDHLGNKRVR 180 (180) T ss_pred ---------------HHHHHCCCCCCCCCCCCCCEECHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCC T ss_conf ---------------9875031126888887765757999999999999985399999767665466689 No 17 >pfam04560 RNA_pol_Rpb2_7 RNA polymerase Rpb2, domain 7. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). Rpb2 is the second largest subunit of the RNA polymerase. This domain comprised of the structural domains anchor and clamp. The clamp region (C-terminal) contains a zinc-binding motif. The clamp region is named due to its interaction with the clamp domain found in Rpb1. The domain also contains a region termed "switch 4". The switches within the polymerase are thought to signal different stages of transcription. Probab=99.93 E-value=8.1e-27 Score=210.98 Aligned_cols=77 Identities=57% Similarity=0.941 Sum_probs=71.8 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCCCC-CCCCHHHHHHHHHHHHCCCCEEEEEC Q ss_conf 784320789999999869899998611100110159999887638787899-78667789999999854200276406 Q gi|254780143|r 1290 GGQRLGEMEVWCIQAYGAAYVLQEMLTIKSDDVVGRTRVYESIVAGNDTFE-TGTPESFNVLVKEMQALGLSIDLENS 1366 (1386) Q Consensus 1290 GGlRfGEMErwaL~AyGAa~~LqE~Lt~kSDdv~gr~~~~~~iv~g~~~~~-~~~pesf~vl~~El~~l~l~~~~~~~ 1366 (1386) |||||||||||||+|||||++|+||||.+||+||+|..++.++.+++..+. ..+|||||||++||+|||+++++..+ T Consensus 1 GGlR~GEMErd~l~~hGas~~LkErl~~~SD~vc~~cg~~~~~c~~~~~~~~v~iPyafKLL~qEL~am~I~~rl~~~ 78 (78) T pfam04560 1 GGQRFGEMEVWALEAYGAAYTLQERLTIKSDDVCGRCGAYAAICKGKTIIEPGDIPESFKLLLQELRSLGLDIRLFLE 78 (78) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHCCCCHHHHHHCCCCCEEECCCCHHHHHHHHHHHHCCCCEEEECC T ss_conf 996301679999999959999999842578733125376989869899634667887999999999977777177339 No 18 >pfam04565 RNA_pol_Rpb2_3 RNA polymerase Rpb2, domain 3. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). Domain 3, s also known as the fork domain and is proximal to catalytic site. Probab=99.82 E-value=3e-21 Score=170.82 Aligned_cols=68 Identities=51% Similarity=0.941 Sum_probs=64.9 Q ss_pred EECCCCCCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCEEEECEEEEEEEC Q ss_conf 5415542102322001122344444322346655432211110444303566765210210001224424 Q gi|254780143|r 524 QLEEHVNSLSRITHTRRLSALGQGGVARARAGVEMRDVHPTHYGRICPAETSEGHNIGLVSSLTSFARVN 593 (1386) Q Consensus 524 q~ld~~n~ls~lth~RR~s~lgpggl~r~~~~~evR~ih~s~~GriCPieTPEG~n~GLv~~la~~a~in 593 (1386) |+|||+|++++++|+||++.. +++.++.+.+++|+||+||||++||+|||||++||||+|||++|+|+ T Consensus 1 Q~l~r~n~ls~lShlRri~~~--~~~~~~~k~~~~R~lh~s~~G~iCp~eTPEG~~~GLvk~La~~~~Is 68 (68) T pfam04565 1 QVLDQTNWLSELSHKRRVNRL--GGLSKERKTFEVRDLHPSQYGRICPIETPEGANCGLVNSLALYARIN 68 (68) T ss_pred CCCCCCCHHHHHHHHHHCCCC--CCCCCCCCCCCCCCCCHHHCEEECCCCCCCCCCEEEEEEEEEEEEEC T ss_conf 945225789999972601777--88664466766252587678044444288998355132124577709 No 19 >KOG0214 consensus Probab=99.36 E-value=1.4e-15 Score=129.88 Aligned_cols=309 Identities=11% Similarity=-0.040 Sum_probs=204.4 Q ss_pred CCCCCHHHHHHHHHCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 77677034665430355554432332022788532200121014565420266788889999999872888999887888 Q gi|254780143|r 903 ESPMTPEEKLLRAIFGEKAVDVRDTSLRVPSGVSGTVVDVRIFNRHGIDKNERSISVEREQIELLARDKDDEQVILDRNI 982 (1386) Q Consensus 903 ~~~~~pe~~~l~~i~~~~~~~~~d~~~~~~~g~~g~v~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 982 (1386) ..+++...+.+.|+....+++..|+.+....+.++.+.. +.+.|...+.........++..++. ......-..... T Consensus 736 ~~eL~aG~NaiVAi~~~~GYNqEDsvimn~s~v~rg~Fr-S~~~RsYk~q~~~~~~~~ee~~~~~---~~~~~~~mr~~~ 811 (1141) T KOG0214 736 FRELPAGQNAIVAIACYSGYNQEDSVIMNQSSVDRGLFR-SFFIRSYKDQEHKKDQGPEEIFEEP---PRGEGRGMRNGK 811 (1141) T ss_pred HHHCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHCCHHH-HHHHHHHHHHHHCCCCCCCCCCCCC---CCCCCCCCCCCC T ss_conf 543144612389984246745777888766554202043-2124676665520355521000365---510020122244 Q ss_pred HHH---HHHHHCCCCCCCCCCCCCCCCCCCHHHHHCCCHHHHHEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 988---88862585345665555543211357640148543100141586789999999998999999999998988763 Q gi|254780143|r 983 YSR---LMEILCGQNAVSGPKGFKKSTVLSSDLISEYPRSQWWQFAVQDEKVQRNVESLKVQYETSKSILEDRFKNKIEK 1059 (1386) Q Consensus 983 ~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~k 1059 (1386) +++ -+.+..|.+...++..+.|.+.......+...... ...+++. T Consensus 812 ~dkLdddG~i~~G~~vs~~Dv~iGk~t~~~~~~~~~~~~~~--------------------------------~~t~~d~ 859 (1141) T KOG0214 812 YDKLDDDGIIMPGSRVSGGDVLIGKTTPQPAKEDESGPEDR--------------------------------LYTKRDH 859 (1141) T ss_pred CCCCCCCCCCCCCCEEECCCEEECCCCCCCCCCHHCCCCCC--------------------------------CCCCCCC T ss_conf 33322257756762331288880245677665001264334--------------------------------3345541 Q ss_pred HCCCCCCCCCCEEEEEEEEEEECCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCCEEEEEECCCCCCCCCCHHHHHHH Q ss_conf 12588677472769999998755885523143366787358886300007879358715698668986750708999999 Q gi|254780143|r 1060 IQWGDDMPPGVLRVVKVFVAMKRPIQSGDKMAGRHGNKGIVSRILPCEDMPFLKDGTPVDIVLNPLGVPSRMNVGQIFET 1139 (1386) Q Consensus 1060 i~~~~~~~~gv~~~vKV~ir~~R~p~iGDKfasRHGqKGVis~i~p~eDMPf~~dG~~pDIIlNPhgvPSRMtIGqllE~ 1139 (1386) .......+.| .+.+|.+.. ++.|+||+.+|++++-+.+|-...++++-..||++++.-+++.-+++|.|-..+++ T Consensus 860 s~~Lr~~e~G--ivd~V~vt~---n~~G~kF~kv~vr~~ripqiGDKfasrHgqKG~ig~~~~qedmpft~eGi~pDiii 934 (1141) T KOG0214 860 STKLRHTERG--IVDQVWVTK---NSEGPKFVKVRVRQVRIPQIGDKFASRHGQKGTIGITYRQEDMPFTIEGIVPDIII 934 (1141) T ss_pred EEECCCCCCC--EEEEEEEEC---CCCCCCEEEEEEEECCCCCCCCHHCCCCCCCCCCCCEEECCCCCCCCCCCCCCEEE T ss_conf 2440237861--689999804---77787127899762134432330013356675124133348898422577764687 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHCCCCEECCCCCCCCCHHHH Q ss_conf 99999987196224333432222102467778776315432210000135256777665302684002565578899999 Q gi|254780143|r 1140 HLGWACVGLGKKIKSLINDYKANGDISPLRSFLEKVIGTGSHTEKISDYDDDSVLRVAEQWKSGVPVSTPVFDGADEEAI 1219 (1386) Q Consensus 1140 ~lGka~~~~G~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~aTP~F~g~~~~~i 1219 (1386) -.-...+.. ++..+ T Consensus 935 NPhaiPSRm------------------------------------------------------------------tig~l 948 (1141) T KOG0214 935 NPHAIPSRM------------------------------------------------------------------TIGQL 948 (1141) T ss_pred CCCCCCCCC------------------------------------------------------------------CHHHH T ss_conf 765586335------------------------------------------------------------------61566 Q ss_pred HHHHHHCCCCCCCCEEEECCCCCCCCCCCEEEEEEEEEECCCHHHHHCCCCCCCCCCCEEECCCCCCCCCCCCCCHHHHH Q ss_conf 99999868689986999868988402685048846454101102110000236873110307998623317843207899 Q gi|254780143|r 1220 NSMLRMADLDESGQSILYDGLTGEPFDRPVTVGYIYMLKLNHMVSDKVYARSTGSYSLVTQQPLGGKSNRGGQRLGEMEV 1299 (1386) Q Consensus 1220 ~~~L~~aG~~~~Gke~lydG~TGe~~~~~IfvG~~YyqkL~HMV~DKiHARstGP~sllTrQP~eGRsr~GGlRfGEMEr 1299 (1386) -+.|. |-...++..++|+.++.. -.-.+++..+++...|++.++.|++.+++..+.+|+++|+....++.+++||++ T Consensus 949 iEc~l--gk~~a~~~e~~~atpFs~-v~v~~is~~l~~~g~~~~G~e~~ynGrtG~~~~~~if~GptyyqrL~Hmvd~ki 1025 (1141) T KOG0214 949 IECLL--GKVAAYEGEEGDATPFSD-VTVSKISANLHVYGYQYRGNERMYNGRTGRKLRAQIFIGPTYYQRLKHMVDDKI 1025 (1141) T ss_pred HHHHH--HHHHHCCCCCCCCCCCCC-CCHHCCCCCHHHHCCCCCCCEEEECCCCCCEEEEEEECCCHHHHHHHHHHHHEE T ss_conf 78763--245540342256777665-321001411776364357978885388886302435228439987777643225 Q ss_pred HHHHHHHHHHHHHHHHHCCCCC Q ss_conf 9999986989999861110011 Q gi|254780143|r 1300 WCIQAYGAAYVLQEMLTIKSDD 1321 (1386) Q Consensus 1300 waL~AyGAa~~LqE~Lt~kSDd 1321 (1386) .|=..++.+-+.++.+---|+| T Consensus 1026 h~R~~Gp~q~ltRQP~~gRsr~ 1047 (1141) T KOG0214 1026 HSRARGPVQILTRQPVEGRSRD 1047 (1141) T ss_pred EECCCCCEEEEECCCCCCCCCC T ss_conf 3014687034431654466434 No 20 >pfam10385 RNA_pol_Rpb2_45 RNA polymerase beta subunit external 1 domain. RNA polymerases catalyse the DNA-dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared with three in eukaryotes (not including mitochondrial or chloroplast polymerases). This domain in prokaryotes spans the gap between domains 4 and 5 of the yeast protein. It is also known as the external 1 region of the polymerase and is bound in association with the external 2 region. Probab=98.87 E-value=2.4e-09 Score=84.89 Aligned_cols=66 Identities=58% Similarity=0.922 Sum_probs=64.7 Q ss_pred EEEEECCCCCCCEEECCHHHHCCEEEECCCCEECCCCCCCCCCEEECCCCCCCCCCHHHEEECCCC Q ss_conf 698622553481664296773763796165341146840222000000233332257872202367 Q gi|254780143|r 602 YRKVCDGKVTNDVVYLSAMEEENRYIAQANSSLDEDGSFTEELVFCRCAGEEILVPREKIDFIDAS 667 (1386) Q Consensus 602 y~~v~~~~~~~~i~~l~~~~e~~~~Ia~~~~~l~~~~~~~~~~~~~r~~~~~~~~~~~~v~~~~i~ 667 (1386) ||+|.+|+++++++||||.+|+++.|||+++.+..+|.+.+..+.+|+++++..+.+++|+|||+| T Consensus 1 YrkV~~G~Vt~~i~YLsA~eE~~~~IAqa~~~~d~~g~i~~~~V~~R~~ge~~~~~~~~VdyiDVS 66 (66) T pfam10385 1 YRKVKNGKVTDEIVYLTADEEEGYVIAQANAPLDEDGKFVDDLVIARYRGEFVLVPPEEVDYMDVS 66 (66) T ss_pred CEEECCCEECCCEEEECHHHHCCCEEEECCCCCCCCCCCCCCCEEEEECCCEEEECHHHCEEEECC T ss_conf 969738998696699677983886799657569899979345175888582016897674179629 No 21 >COG0085 RpoB DNA-directed RNA polymerase, beta subunit/140 kD subunit [Transcription] Probab=98.53 E-value=2.3e-07 Score=70.63 Aligned_cols=70 Identities=27% Similarity=0.347 Sum_probs=60.8 Q ss_pred CCCCCEEECCHHHHCCEEEECCCCEECCCCCCCCCCEEECCCCCCCCCCHHHEEECCCCCCCEEEECCCCCCCHHHCCHH Q ss_conf 53481664296773763796165341146840222000000233332257872202367211023112333201101002 Q gi|254780143|r 609 KVTNDVVYLSAMEEENRYIAQANSSLDEDGSFTEELVFCRCAGEEILVPREKIDFIDASPKQVVSIAASLIPFLENDDSN 688 (1386) Q Consensus 609 ~~~~~i~~l~~~~e~~~~Ia~~~~~l~~~~~~~~~~~~~r~~~~~~~~~~~~v~~~~i~p~~i~sv~aslIPflehdda~ 688 (1386) ..++ ++|+.|.++....||..... ++|.|.+|..+++.+++++||++||+++ T Consensus 576 v~tG-~E~~~a~e~~~~~ia~~~~~---------------------------~~~ve~~~~~I~~~~~~~~~~~~~n~~~ 627 (1060) T COG0085 576 VGTG-MEYLDAEDSGAAVIAKRPGV---------------------------VTHVEISPIVILGIEASLIPYPEHNQSP 627 (1060) T ss_pred CCCC-CEEECCCCCCCEEEECCCCC---------------------------EEEEEEEEEEEEEECCCCCCCCCCCCCH T ss_conf 1278-54432343544267604893---------------------------7999520359996325666665568676 Q ss_pred HHHHHHHHHHHHHCCCCC Q ss_conf 211222344432101366 Q gi|254780143|r 689 RVLMGCNMQRQAVPLLKA 706 (1386) Q Consensus 689 R~l~g~nm~rQav~l~~~ 706 (1386) |++|+|.|.+|+..+... T Consensus 628 ~n~~~~~~~~Q~~~~~~~ 645 (1060) T COG0085 628 YNLYKFARSNQATGINQR 645 (1060) T ss_pred HHHHHHHHHHCCCCCCCC T ss_conf 788888664013477656 No 22 >PRK09603 DNA-directed RNA polymerase subunit beta/beta'; Reviewed Probab=97.77 E-value=0.00013 Score=50.83 Aligned_cols=127 Identities=15% Similarity=0.107 Sum_probs=85.7 Q ss_pred CCCCCCCCCCEEECCCEECCCC-------CCCCC---CCCCCCCC--EEEEEECCCCCCCCCEECCHHHHHCCCCCEEEE Q ss_conf 4453044797720785203552-------23578---60222375--155531355444442000013442587310346 Q gi|254780143|r 777 NQRPLVKVGDEVRRNDIIADGP-------STDLG---DLALGRNM--LVAFMPWHGYNFEDSMLISERMVSEDVFTSIHI 844 (1386) Q Consensus 777 ~q~p~V~~g~~~~~~~~l~~~~-------~~~~~---el~~G~N~--~VA~m~~~GYN~EDaiiin~~~v~rg~~~s~h~ 844 (1386) ...-+|+.|.+|+.||+|.--. .|... ..+.|+++ +++-.++.+|++|+.+|.++...++|.+++... T Consensus 876 De~GIi~iGa~V~~gDILVGKvtPkget~~tpeekLLraifGeka~~v~dtSLrvp~g~eG~VIdv~~f~r~g~~k~~r~ 955 (2890) T PRK09603 876 DESGIVKVGTYVSAGMILVGKTSPKGEIKSTPEERLLRAIFGDKAGHVVNKSLYCPPSLEGTVIDVKVFTKKGYEKDARV 955 (2890) T ss_pred CCCCCEEECCEECCCCEEEEEECCCCCCCCCHHHHHHHHHHCCCHHHHHHHHCCCCCCCCCEEEEEEEEECCCCCHHHHH T ss_conf 50379886217546888998436788787786889877651783666757545468999876999999742464213556 Q ss_pred E-EEEHHHHHCCCCC-CCCC---CC-CCCC-CHHHHHHCCCCCCCCCCCEECCCCCEEECCCCCCC Q ss_conf 6-6531121147884-0024---66-6546-86784102413741377331367611101246777 Q gi|254780143|r 845 E-EFEVMARDTKLGP-EEIT---RD-IPNV-SEEGLKNIDECGIICVGAEVNPGDILVGKITPKGE 903 (1386) Q Consensus 845 ~-~y~~~~~~~~~g~-~~~~---~~-~~~~-~~~~~~~ld~~Giv~~G~~V~~gDilvgk~tp~~~ 903 (1386) . .|+.+.+...... ++++ ++ .... ....+.+|+++|++.++..+.+.||..+++++... T Consensus 956 ~~~~~~e~~~l~~~~~d~~~~~~~~~~~rl~~~~~~~kl~~d~~~~~~~~~~G~~i~~~~~~~i~~ 1021 (2890) T PRK09603 956 LSAYEEEKAKLDMEHFDRLTMLNREELLRVSSLLSQAILEEPFSHNGKDYKEGDQIPKEEIASINR 1021 (2890) T ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHCCCCCH T ss_conf 666788988741005666531017778776421112323554434775434454103212045673 No 23 >PRK08565 DNA-directed RNA polymerase subunit beta; Provisional Probab=97.59 E-value=1.8e-05 Score=56.94 Aligned_cols=193 Identities=15% Similarity=0.167 Sum_probs=105.1 Q ss_pred CCCHHHCCHHHHHHHHHHHHHHHCCCCCCCCEECCCCCHHHHHHCCCEEEC--CCCCCCCCCCCCCE---EEECCCCCCC Q ss_conf 320110100221122234443210136654111266201110106530102--12443433565530---2521566556 Q gi|254780143|r 679 IPFLENDDSNRVLMGCNMQRQAVPLLKAEAPFVGTGMESVVAKSSGAAIVA--KRAGIVEQVDAIRI---VIRSVEGDLD 753 (1386) Q Consensus 679 IPflehdda~R~l~g~nm~rQav~l~~~~~~~v~tg~E~~~~~~s~~~i~a--~~~g~v~~vd~~~i---~i~~~~~~~~ 753 (1386) |+|++|++++++.|.++|+|++.|++..+......+.|.....+++...+. ...|+++|+|++.. .|+....... T Consensus 537 i~~~~~~~~~~i~i~sd~gR~~rPll~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~ieyid~~e~~~~~Ia~~~~~~~ 616 (1101) T PRK08565 537 VAYYRTGEINEVYVNCDAGRVRRPLIVVENGKPKLTREHVEKLKKGELTFDDLVKMGVVEYLDAEEEENAYIALDPWDVT 616 (1101) T ss_pred EEEECCCCCCEEEEECCCCCCCCEEEEECCCCCCCCHHHHHHHHCCCCCHHHHHHCCCEEEECCCCCEEEEEEECHHHCC T ss_conf 99743765565899647762114058722786423488888877488204556407846986554240058996567735 Q ss_pred CCCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCEECCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCEECCHHH Q ss_conf 44466301114655544547322445304479772078520355223578602223751555313554444420000134 Q gi|254780143|r 754 PSTSGVDIYRLMKFQRSNQNTCVNQRPLVKVGDEVRRNDIIADGPSTDLGDLALGRNMLVAFMPWHGYNFEDSMLISERM 833 (1386) Q Consensus 754 ~~~~~~~~y~~~~~~~~~~~~~~~q~p~V~~g~~~~~~~~l~~~~~~~~~el~~G~N~~VA~m~~~GYN~EDaiiin~~~ 833 (1386) . .-|.. -+.+.. -..++.+..||.-|||-..-+..... T Consensus 617 ~-----------------~~th~----------Ei~p~~---------------ilsv~asliPf~~hNqspRn~yq~~m 654 (1101) T PRK08565 617 K-----------------EHTHL----------EIWPPA---------------ILGVVASIIPYPEHNQSPRNTYQAAM 654 (1101) T ss_pred C-----------------CCCEE----------EEEHHH---------------EEEEEECCCCCCCCCCCHHHHHHHCC T ss_conf 6-----------------65248----------850136---------------13553114643566721566542124 Q ss_pred HHCCCCCEEEEEEEEHHHHHCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCEECCCCCEEECCCCCCCCCCCHHHHHH Q ss_conf 42587310346665311211478840024666546867841024137413773313676111012467777677034665 Q gi|254780143|r 834 VSEDVFTSIHIEEFEVMARDTKLGPEEITRDIPNVSEEGLKNIDECGIICVGAEVNPGDILVGKITPKGESPMTPEEKLL 913 (1386) Q Consensus 834 v~rg~~~s~h~~~y~~~~~~~~~g~~~~~~~~~~~~~~~~~~ld~~Giv~~G~~V~~gDilvgk~tp~~~~~~~pe~~~l 913 (1386) .++-+....+ ....|....+.-..+.+.|-+. ....|.+ +..+.+.+.+++ T Consensus 655 ~KQa~g~~~~----n~~~r~D~~~~~l~ypQ~PlV~------------------t~~~~~~-------~~~~~p~G~N~i 705 (1101) T PRK08565 655 AKQSLGLYAA----NFRIRVDTRGHLLHYPQRPLVQ------------------TRGLELI-------GYNDRPAGQNAV 705 (1101) T ss_pred CCCCCCCCCC----CCEEEECCCCCEEECCCCCEEE------------------ECHHEEE-------CCCCCCCCEEEE T ss_conf 4333565410----0115761555379717776265------------------0200143-------457788976689 Q ss_pred HHHCCCCCCCCCCCCEECCCCCCCCCCCC Q ss_conf 43035555443233202278853220012 Q gi|254780143|r 914 RAIFGEKAVDVRDTSLRVPSGVSGTVVDV 942 (1386) Q Consensus 914 ~~i~~~~~~~~~d~~~~~~~g~~g~v~~~ 942 (1386) .|++...+++.+|+.+......+..+... T Consensus 706 VAvmsy~GYN~EDAIIink~sv~rg~f~s 734 (1101) T PRK08565 706 VAVLSYTGYNIEDAIIMNKASIERGLARS 734 (1101) T ss_pred EEEECCCCCCHHHHHEECCCHHHCCCEEE T ss_conf 99976778675451000211112387068 No 24 >TIGR02013 rpoB DNA-directed RNA polymerase, beta subunit; InterPro: IPR010243 DNA-directed RNA polymerases 2.7.7.6 from EC (also known as DNA-dependent RNA polymerases) are responsible for the polymerisation of ribonucleotides into a sequence complementary to the template DNA. In eukaryotes, there are three different forms of DNA-directed RNA polymerases transcribing different sets of genes. Most RNA polymerases are multimeric enzymes and are composed of a variable number of subunits. The core RNA polymerase complex consists of five subunits (two alpha, one beta, one beta-prime and one omega) and is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme . The core RNA polymerase complex forms a "crab claw"-like structure with an internal channel running along the full length . The key functional sites of the enzyme, as defined by mutational and cross-linking analysis, are located on the inner wall of this channel. RNA synthesis follows after the attachment of RNA polymerase to a specific site, the promoter, on the template DNA strand. The RNA synthesis process continues until a termination sequence is reached. The RNA product, which is synthesised in the 5' to 3'direction, is known as the primary transcript. Eukaryotic nuclei contain three distinct types of RNA polymerases that differ in the RNA they synthesise: RNA polymerase I: located in the nucleoli, synthesises precursors of most ribosomal RNAs. RNA polymerase II: occurs in the nucleoplasm, synthesises mRNA precursors. RNA polymerase III: also occurs in the nucleoplasm, synthesises the precursors of 5S ribosomal RNA, the tRNAs, and a variety of other small nuclear and cytosolic RNAs. Eukaryotic cells are also known to contain separate mitochondrial and chloroplast RNA polymerases. Eukaryotic RNA polymerases, whose molecular masses vary in size from 500 to 700 kD, contain two non-identical large (>100 kDa) subunits and an array of up to 12 different small (less than 50 kDa) subunits. This entry describes orthologues of the beta subunit of bacterial RNA polymerase. The core enzyme consists of two alpha chains, one beta chain, and one beta' subunit.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006350 transcription. Probab=96.20 E-value=0.00031 Score=47.99 Aligned_cols=266 Identities=20% Similarity=0.161 Sum_probs=148.6 Q ss_pred HCCCEEECCCCC-CCCCCCCCCEEEECCCCCCCCCCCCCEEECCCC-----------CCCCCCCCCCCCCCCCCCCCEEE Q ss_conf 065301021244-343356553025215665564446630111465-----------55445473224453044797720 Q gi|254780143|r 722 SSGAAIVAKRAG-IVEQVDAIRIVIRSVEGDLDPSTSGVDIYRLMK-----------FQRSNQNTCVNQRPLVKVGDEVR 789 (1386) Q Consensus 722 ~s~~~i~a~~~g-~v~~vd~~~i~i~~~~~~~~~~~~~~~~y~~~~-----------~~~~~~~~~~~q~p~V~~g~~~~ 789 (1386) .....+.|...| -...+..+.+......+.+.-+.+.-++=.|-+ ++|--==+.-++.|+|=+|.--+ T Consensus 705 i~~d~V~~R~~G~e~~~~~~~~VdyMDVSP~Q~VSVaAaLIPFLEHDDANRALMGsNMQRQAVPLL~seaP~VGTGmE~~ 784 (1449) T TIGR02013 705 IVEDLVVARYRGDEITLVSPDEVDYMDVSPKQIVSVAAALIPFLEHDDANRALMGSNMQRQAVPLLRSEAPLVGTGMEAK 784 (1449) T ss_pred CCCCEEEEEECCCCCCCCCCCCEEEEEECCHHHHHHHHHCCCCCCCCHHHHHHHHCCCHHHCCCCCCCCCCCCCCHHHHH T ss_conf 52356888654773221167602476518323556655426332354145665412531236777877988232027898 Q ss_pred CCCEECCCCCCCCCC-CCCCCCCEEEEEECCCCCCCCCEECCHHHHHCCCCCEEEEEEEEHHHHHCCCCC----CCCCCC Q ss_conf 785203552235786-022237515553135544444200001344258731034666531121147884----002466 Q gi|254780143|r 790 RNDIIADGPSTDLGD-LALGRNMLVAFMPWHGYNFEDSMLISERMVSEDVFTSIHIEEFEVMARDTKLGP----EEITRD 864 (1386) Q Consensus 790 ~~~~l~~~~~~~~~e-l~~G~N~~VA~m~~~GYN~EDaiiin~~~v~rg~~~s~h~~~y~~~~~~~~~g~----~~~~~~ 864 (1386) . -.| -|- .-+=++-+|.+.. ...|.++....+...+..|. .+.+.. T Consensus 785 ~---A~D-----SG~~i~A~~~GvV~~Vd---------------------a~~I~v~~~~~~~~~~~~g~DPd~~~~~Y~ 835 (1449) T TIGR02013 785 V---ARD-----SGAVIVAKRAGVVEYVD---------------------AKRIVVRYKEKEEEETVSGDDPDAAIDIYR 835 (1449) T ss_pred H---HHC-----CCEEEEECCCCEEEEEE---------------------CCEEEEEECCCCCCCCCCCCCCCCCEEEEE T ss_conf 8---623-----54089970697899984---------------------778899314776665557788322025750 Q ss_pred CCC-CCHHHHHHCCCCCCCCCCCEECCCCCEE-ECCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCEECCC------CCC Q ss_conf 654-6867841024137413773313676111-01246777767703466543035555443233202278------853 Q gi|254780143|r 865 IPN-VSEEGLKNIDECGIICVGAEVNPGDILV-GKITPKGESPMTPEEKLLRAIFGEKAVDVRDTSLRVPS------GVS 936 (1386) Q Consensus 865 ~~~-~~~~~~~~ld~~Giv~~G~~V~~gDilv-gk~tp~~~~~~~pe~~~l~~i~~~~~~~~~d~~~~~~~------g~~ 936 (1386) ..+ .+...-.+++..-||.+|.+|..||||. |--|-.| |+.-+-|.+.|.|-+.+++..|+.+.... .++ T Consensus 836 L~Ky~RSNQ~TC~nQ~PiV~~GDrV~~GdvlADGPsT~~G--ELALGrNvlVAFMPW~GYNyEDAIliSERlVkdDvFTS 913 (1449) T TIGR02013 836 LLKYQRSNQDTCINQRPIVSVGDRVEAGDVLADGPSTDLG--ELALGRNVLVAFMPWNGYNYEDAILISERLVKDDVFTS 913 (1449) T ss_pred CCCCCCCCCCCEECCEEECCCCCEECCCCEEECCCCCCCC--CCCCCCCEEEEEECCCCCCHHHHHHHHHHHEECCCCEE T ss_conf 4676314788401453550148681021277347666443--20116710367522788653356355000101376203 Q ss_pred CCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHCC Q ss_conf 22001210145654202667888899999998728889998878889888886258534566555554321135764014 Q gi|254780143|r 937 GTVVDVRIFNRHGIDKNERSISVEREQIELLARDKDDEQVILDRNIYSRLMEILCGQNAVSGPKGFKKSTVLSSDLISEY 1016 (1386) Q Consensus 937 g~v~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1016 (1386) =+|-...+..|..............+.-|....+. ||..|+.-+...+-++++.|+..+.|......-.++..++|..+ T Consensus 914 IHI~E~e~~aRdTKLG~EEiTrDIPNVsE~ALrnL-DE~GIvrIGAeV~~GDILVGKvTPKGEs~~TPEEkLLRAIFGEK 992 (1449) T TIGR02013 914 IHIEEFEVEARDTKLGPEEITRDIPNVSEEALRNL-DENGIVRIGAEVKAGDILVGKVTPKGESELTPEEKLLRAIFGEK 992 (1449) T ss_pred EEEEEEEECCEECCCCCCCCCCCCCCCCHHHHHCC-CCCCEEEEEEEECCCCEEEEEECCCCCCCCCHHHHHHHHCCCCC T ss_conf 78999983010347887012345798408888258-95775887338707747772121889888875677542003341 Q ss_pred CHH Q ss_conf 854 Q gi|254780143|r 1017 PRS 1019 (1386) Q Consensus 1017 ~~~ 1019 (1386) .++ T Consensus 993 A~d 995 (1449) T TIGR02013 993 ARD 995 (1449) T ss_pred HHH T ss_conf 145 No 25 >TIGR02386 rpoC_TIGR DNA-directed RNA polymerase, beta' subunit; InterPro: IPR012754 DNA-directed RNA polymerases 2.7.7.6 from EC (also known as DNA-dependent RNA polymerases) are responsible for the polymerisation of ribonucleotides into a sequence complementary to the template DNA. In eukaryotes, there are three different forms of DNA-directed RNA polymerases transcribing different sets of genes. Most RNA polymerases are multimeric enzymes and are composed of a variable number of subunits. The core RNA polymerase complex consists of five subunits (two alpha, one beta, one beta-prime and one omega) and is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme . The core RNA polymerase complex forms a "crab claw"-like structure with an internal channel running along the full length . The key functional sites of the enzyme, as defined by mutational and cross-linking analysis, are located on the inner wall of this channel. RNA synthesis follows after the attachment of RNA polymerase to a specific site, the promoter, on the template DNA strand. The RNA synthesis process continues until a termination sequence is reached. The RNA product, which is synthesised in the 5' to 3'direction, is known as the primary transcript. Eukaryotic nuclei contain three distinct types of RNA polymerases that differ in the RNA they synthesise: RNA polymerase I: located in the nucleoli, synthesises precursors of most ribosomal RNAs. RNA polymerase II: occurs in the nucleoplasm, synthesises mRNA precursors. RNA polymerase III: also occurs in the nucleoplasm, synthesises the precursors of 5S ribosomal RNA, the tRNAs, and a variety of other small nuclear and cytosolic RNAs. Eukaryotic cells are also known to contain separate mitochondrial and chloroplast RNA polymerases. Eukaryotic RNA polymerases, whose molecular masses vary in size from 500 to 700 kD, contain two non-identical large (>100 kDa) subunits and an array of up to 12 different small (less than 50 kDa) subunits. This entry represents the beta-prime subunit, RpoC, found in most bacteria. It excludes some, mainly cyanobacterial, species where RpoC is replaced by two homologous proteins that include an additional domain. One arm of the "claw" is predominantly formed by this subunit, the other being predominantly formed by the beta subunit. The active site of the enzyme is defined by three invariant aspartate residues within the beta-prime subunit .; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006350 transcription. Probab=88.69 E-value=0.52 Score=24.73 Aligned_cols=14 Identities=50% Similarity=0.857 Sum_probs=9.6 Q ss_pred CCCCCEECCCCCEE Q ss_conf 13773313676111 Q gi|254780143|r 882 ICVGAEVNPGDILV 895 (1386) Q Consensus 882 v~~G~~V~~gDilv 895 (1386) |.-|..|++||||. T Consensus 1289 V~dG~~v~~GDIlA 1302 (1552) T TIGR02386 1289 VEDGQKVKPGDILA 1302 (1552) T ss_pred ECCCCCCCCCCEEE T ss_conf 21587547474788 No 26 >cd06232 Peptidase_M14-like_5 Peptidase M14-like domain of a functionally uncharacterized subgroup of the M14 family of metallocarboxypeptidases (MCPs). The M14 family are zinc-binding carboxypeptidases (CPs) which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. Two major subfamilies of the M14 family, defined based on sequence and structural homology, are the A/B and N/E subfamilies. Enzymes belonging to the A/B subfamily are normally synthesized as inactive precursors containing preceding signal peptide, followed by an N-terminal pro-region linked to the enzyme; these proenzymes are called procarboxypeptidases. The A/B enzymes can be further divided based on their substrate specificity; Carboxypeptidase A-like (CPA-like) enzymes favor hydrophobic residues while carboxypeptidase B-like (CPB-like) enzymes only cleave the basic residues lysine or arginine. The Probab=87.41 E-value=0.11 Score=29.75 Aligned_cols=49 Identities=16% Similarity=0.043 Sum_probs=34.5 Q ss_pred CCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCCEEEEEECCCCCCCCCCHHH Q ss_conf 588552314336678735888630000787935871569866898675070899 Q gi|254780143|r 1082 RPIQSGDKMAGRHGNKGIVSRILPCEDMPFLKDGTPVDIVLNPLGVPSRMNVGQ 1135 (1386) Q Consensus 1082 R~p~iGDKfasRHGqKGVis~i~p~eDMPf~~dG~~pDIIlNPhgvPSRMtIGq 1135 (1386) |---+||-+.-||+++ ....+-.+-..+=.-||+++|+||+||+==+-. T Consensus 103 RytALGdd~~~r~~~~-----~~E~~~r~~a~~~~ga~lhvnlHGyPaHEWtrP 151 (240) T cd06232 103 RYTALGDDLEYREFPP-----FGEREARHQALAKSGAQLHVNLHGYPAHEWTRP 151 (240) T ss_pred HHHHHHHHHHHHCCCC-----CHHHHHHHHHHHHHCCCEEECCCCCCCCCCCCC T ss_conf 6521013565312687-----317788999999735334785788874120146 No 27 >pfam04941 LEF-8 Late expression factor 8 (LEF-8). Late expression factor 8 (LEF-8) is one of the primary components of RNA polymerase produced by polyhedrosis viruses. LEF-8 shows homology to the second largest subunit of prokaryotic DNA-directed RNA polymerase. Probab=87.26 E-value=0.5 Score=24.88 Aligned_cols=31 Identities=26% Similarity=0.444 Sum_probs=23.8 Q ss_pred EEEEEEEEEECCCCCCCCCCCCCCCCCEEEE Q ss_conf 6999999875588552314336678735888 Q gi|254780143|r 1072 RVVKVFVAMKRPIQSGDKMAGRHGNKGIVSR 1102 (1386) Q Consensus 1072 ~~vKV~ir~~R~p~iGDKfasRHGqKGVis~ 1102 (1386) -.+|+.+-..-.=-.|=|.+|=||||||... T Consensus 706 v~vK~~~v~s~~dleGlKICgiHGQKGVlN~ 736 (748) T pfam04941 706 VYLKITIVTSTNDLEGVKICGIHGQKGVLNG 736 (748) T ss_pred EEEEEEEEEEECCCCEEEEEEECCCCCCCCC T ss_conf 9999999998167560588445066534367 No 28 >KOG0215 consensus Probab=86.26 E-value=0.38 Score=25.69 Aligned_cols=59 Identities=10% Similarity=0.264 Sum_probs=40.4 Q ss_pred HHHHHHHHHHCCCCCCCCCCEECHHHHHHHHHHHHHHHCCCCCCCCCHHCCCCCCCCHHHHHHHHHHH Q ss_conf 55577765420366776770661899888898887630487643440101533233335787898888 Q gi|254780143|r 410 TVGRVKMNMRLNLDTPDDVRHIRKEDIIAIIKILVDLRNGKGTIDDIDNLGNRRVRSVGEMLKNQYRL 477 (1386) Q Consensus 410 ~vgr~~~n~~l~~~~~~~~~~Lt~~d~~~~i~~L~~l~~g~~~~DdiDhlgnkRvr~vgeLl~~~fr~ 477 (1386) ..|+|.+ +++..+......+|+....+.++..+..++.+..+. |.+-.+..|...||+. T Consensus 441 stgnw~i-Krfrmer~gvt~VlsrlSyisaLgmmtri~s~fekt--------rkvsgprSlq~sqwgm 499 (1153) T KOG0215 441 STGNWSI-KRFRMERAGVTQVLSRLSYISALGMMTRINSQFEKT--------RKVSGPRSLQPSQWGM 499 (1153) T ss_pred HCCCHHH-HHHHHHHCCCEEEEHHHHHHHHHHHHEEHHHHHEEE--------HHCCCCCCCCHHHCCC T ss_conf 5275177-776565405203202456565533520130100021--------1016753247112253 No 29 >TIGR02876 spore_yqfD sporulation protein YqfD; InterPro: IPR010690 This family consists of several putative bacterial stage IV sporulation (SpoIV) proteins. YqfD of Bacillus subtilis (P54469 from SWISSPROT) is known to be essential for efficient sporulation although its exact function is unknown .. Probab=72.88 E-value=2 Score=20.56 Aligned_cols=21 Identities=33% Similarity=0.602 Sum_probs=18.6 Q ss_pred CCCCCCCCEEECCCEECCCCC Q ss_conf 530447977207852035522 Q gi|254780143|r 779 RPLVKVGDEVRRNDIIADGPS 799 (1386) Q Consensus 779 ~p~V~~g~~~~~~~~l~~~~~ 799 (1386) .|+|+.||+|++||+|..|.. T Consensus 211 ~~~Vk~GD~VkkGd~Li~G~~ 231 (406) T TIGR02876 211 EAVVKKGDVVKKGDLLISGIL 231 (406) T ss_pred EEEECCCCEECCCCEEEEECC T ss_conf 648548887657717871110 No 30 >COG4942 Membrane-bound metallopeptidase [Cell division and chromosome partitioning] Probab=70.38 E-value=3.7 Score=18.58 Aligned_cols=62 Identities=32% Similarity=0.252 Sum_probs=40.1 Q ss_pred HHCCCEEECCCCCCCCCCCCCCE----EEECCCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCEECC Q ss_conf 10653010212443433565530----25215665564446630111465554454732244530447977207852035 Q gi|254780143|r 721 KSSGAAIVAKRAGIVEQVDAIRI----VIRSVEGDLDPSTSGVDIYRLMKFQRSNQNTCVNQRPLVKVGDEVRRNDIIAD 796 (1386) Q Consensus 721 ~~s~~~i~a~~~g~v~~vd~~~i----~i~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~q~p~V~~g~~~~~~~~l~~ 796 (1386) -..|..+.|..+|.|.|.|.-+. +|.. +....+..| -++|...|+.|++|..|+.++. T Consensus 328 a~~Ga~V~A~AdG~VvyA~~l~GYG~vvIld------hG~gy~sly------------g~~~~i~v~~G~~V~AGepIa~ 389 (420) T COG4942 328 ASAGATVKAIADGRVVYADWLRGYGLVVILD------HGGGYHSLY------------GGNQSILVNPGQFVKAGEPIAL 389 (420) T ss_pred CCCCCEEEECCCCEEEECHHHCCCCEEEEEE------CCCCCEEEE------------CCCCEEEECCCCEEECCCCHHH T ss_conf 5999825620695699543325675699997------488537886------------1664221068977556971532 Q ss_pred CCCC Q ss_conf 5223 Q gi|254780143|r 797 GPST 800 (1386) Q Consensus 797 ~~~~ 800 (1386) .-.+ T Consensus 390 ~G~s 393 (420) T COG4942 390 VGSS 393 (420) T ss_pred CCCC T ss_conf 2677 No 31 >TIGR00432 arcsn_tRNA_tgt archaeosine tRNA-ribosyltransferase; InterPro: IPR004804 The archaeosine tRNA-guanine transglycosylase (tgt) differs from the tgt of Escherichia coli and other bacteria in the site of action and the modification that results. It exchanges 7-cyano-7-deazaguanine (preQ0) with guanine at position 15 of archaeal tRNA; this nucleotide is subsequently converted to archaeosine, found exclusively in the archaea. In contrast, bacterial tgt catalyzes the exchange of preQ0 or preQ1 for the guanine base at position 34; this nucleotide is subsequently modified to queuosine (IPR004803 from INTERPRO). Archaeoglobus fulgidus has both enzymes. . Probab=70.09 E-value=1.7 Score=21.06 Aligned_cols=45 Identities=20% Similarity=0.352 Sum_probs=29.9 Q ss_pred CCEEEEEEEEEEEEEECCCCC---CCCCCCEEEEEEEEEEEECCEECCCCCEE Q ss_conf 975335899999999317876---65520001223678751000026896289 Q gi|254780143|r 96 LTYAVPLKITLRLIVFDVDEF---TGAKSIKDIKEQSIYMGDLPLMTKDGTFV 145 (1386) Q Consensus 96 lTYsapL~V~i~l~v~~~~~~---~~~k~~~~ike~~V~lG~IPiMt~~GyFI 145 (1386) .-+.|.+-++=.+++|...+. .-.+.+.. +.=-|+||||+||+|= T Consensus 44 K~~GA~~VITN~YIIYR~PelRE~AL~~GVH~-----~~~~D~P~MTDSGSyQ 91 (658) T TIGR00432 44 KKFGAEIVITNAYIIYRSPELRERALEDGVHR-----LLDFDGPVMTDSGSYQ 91 (658) T ss_pred CCCCCCEEEECEEEEECCHHHHHHHHHCCCCE-----EECCCCCEEECCCCEE T ss_conf 67776279832066754813588986347644-----4207886430576311 No 32 >pfam06898 YqfD Putative stage IV sporulation protein YqfD. This family consists of several putative bacterial stage IV sporulation (SpoIV) proteins. YqfD of Bacillus subtilis is known to be essential for efficient sporulation although its exact function is unknown. Probab=68.52 E-value=3.9 Score=18.44 Aligned_cols=22 Identities=41% Similarity=0.699 Sum_probs=16.9 Q ss_pred CCCCCCCCEEECCCEECCCCCC Q ss_conf 5304479772078520355223 Q gi|254780143|r 779 RPLVKVGDEVRRNDIIADGPST 800 (1386) Q Consensus 779 ~p~V~~g~~~~~~~~l~~~~~~ 800 (1386) .|+|+.||.|++||+|..|..- T Consensus 209 ~p~Vk~GD~VkkGqiLVsG~i~ 230 (383) T pfam06898 209 TAVVKVGDVVKKGDILVSGQIG 230 (383) T ss_pred EEEECCCCEECCCCEEEEEEEC T ss_conf 1776589987789899963565 No 33 >PRK13487 chemoreceptor glutamine deamidase CheD; Provisional Probab=67.27 E-value=6.2 Score=16.97 Aligned_cols=27 Identities=15% Similarity=0.139 Sum_probs=11.3 Q ss_pred HHCCCCCCCCCCCCCCCEEEEECCCCC Q ss_conf 734741000013666776999623459 Q gi|254780143|r 302 CLCGLYVAEDIVNGETGEIYIEAGDVI 328 (1386) Q Consensus 302 ~l~~~~~~~~~~d~~~gei~~~~~~~~ 328 (1386) .+-|..--.-.+++.+|+++.+..... T Consensus 147 DlGG~~gRkV~F~p~tG~v~~k~l~~~ 173 (201) T PRK13487 147 DLLDIYPRKVYFFPTTGKVLVKKLKHA 173 (201) T ss_pred ECCCCCCCEEEEECCCCEEEEEECCCC T ss_conf 659997757999899980877862653 No 34 >TIGR01945 rnfC electron transport complex, RnfABCDGE type, C subunit; InterPro: IPR010208 The six subunit complex RnfABCDGE in Rhodobacter capsulatus encodes an apparent NADH oxidoreductase responsible for electron transport to nitrogenase, necessary for nitrogen fixation . A closely related complex in Escherichia coli, RsxABCDGE (Reducer of SoxR), reduces the 2Fe-2S-containing superoxide sensor SoxR, active as a transcription factor when oxidized . This family of putative NADH oxidoreductase complexes exists in many of the same species as the related NQR, a Na(+)-translocating NADH-quinone reductase (IPR003667 from INTERPRO), but is distinct. This entry describes the C subunit.; GO: 0051539 4 iron 4 sulfur cluster binding, 0006118 electron transport, 0016020 membrane. Probab=64.17 E-value=3.7 Score=18.60 Aligned_cols=107 Identities=22% Similarity=0.295 Sum_probs=49.4 Q ss_pred CCCCCCCCCCCCCEEECCCEECCCCCCCCCCC---CCCC-CCEEEEEE--CCCCCCCCCEECCHHH--HHCCCCCEEEEE Q ss_conf 32244530447977207852035522357860---2223-75155531--3554444420000134--425873103466 Q gi|254780143|r 774 TCVNQRPLVKVGDEVRRNDIIADGPSTDLGDL---ALGR-NMLVAFMP--WHGYNFEDSMLISERM--VSEDVFTSIHIE 845 (1386) Q Consensus 774 ~~~~q~p~V~~g~~~~~~~~l~~~~~~~~~el---~~G~-N~~VA~m~--~~GYN~EDaiiin~~~--v~rg~~~s~h~~ 845 (1386) .=.+=.|+|+.||.|.+||.||++..+-.--+ -.|. -.+..+.. -+|++.. ||+|..-. -||..=+ T Consensus 39 IG~p~~p~V~~GD~VLkGq~Ia~~~G~~sap~HaPtSG~v~~I~~~~~pH~sGlp~~-~i~I~~Dgd~~e~w~e~----- 112 (444) T TIGR01945 39 IGAPAEPIVKVGDKVLKGQLIAKADGFVSAPIHAPTSGTVVAIEERVVPHASGLPVP-AIVIEPDGDEEERWIER----- 112 (444) T ss_pred CCCCCCCEECCCCEEECCCEECCCCCEEEEEEECCCCCEEEEEEEEECCCCCCCCCC-EEEECCCCCCHHHCCCC----- T ss_conf 787777300278652066110067740787000781106887410311578888744-58867789802105765----- Q ss_pred EEEHHHHHCCCCCCCCCCCCCCC-CHHHHHHCCCCCCCCCCCEECCCCCEEECCCCCC Q ss_conf 65311211478840024666546-8678410241374137733136761110124677 Q gi|254780143|r 846 EFEVMARDTKLGPEEITRDIPNV-SEEGLKNIDECGIICVGAEVNPGDILVGKITPKG 902 (1386) Q Consensus 846 ~y~~~~~~~~~g~~~~~~~~~~~-~~~~~~~ld~~Giv~~G~~V~~gDilvgk~tp~~ 902 (1386) .+.-.+-.+. .+..+...-+.||+-.|=-..|-=+ |+.|+. T Consensus 113 -------------~~~~~~~~~l~~~~i~~~I~~AGIvGlGGAtFPthv---KL~~pp 154 (444) T TIGR01945 113 -------------LEPIDDESSLSPEEILEKIRAAGIVGLGGATFPTHV---KLNPPP 154 (444) T ss_pred -------------CCCCCCHHHCCHHHHHHHHHHCCCCCCCCCHHHHHH---HCCCCC T ss_conf -------------467887331487999999997086556755201011---038586 No 35 >TIGR02248 mutH_TIGR DNA mismatch repair endonuclease MutH; InterPro: IPR004230 MutS, MutL and MutH are the three essential proteins for initiation of methyl-directed DNA mismatch repair to correct mistakes made during DNA replication in Escherichia coli. MutH cleaves a newly synthesized and unmethylated daughter strand 5' to the sequence d(GATC) in a hemi-methylated duplex. Activation of MutH requires the recognition of a DNA mismatch by MutS and MutL .; GO: 0004519 endonuclease activity, 0006304 DNA modification. Probab=57.81 E-value=3.7 Score=18.60 Aligned_cols=14 Identities=21% Similarity=0.693 Sum_probs=5.5 Q ss_pred CEEEEEECCCCCHH Q ss_conf 82999971878703 Q gi|254780143|r 200 DIIHVRIDRRRKVP 213 (1386) Q Consensus 200 d~iyvrIdr~rKIP 213 (1386) -+||+=|.=.|-|| T Consensus 113 ~vLWiPieG~R~ip 126 (220) T TIGR02248 113 RVLWIPIEGERHIP 126 (220) T ss_pred EEEEEEEECCCEEE T ss_conf 24430331240111 No 36 >KOG0318 consensus Probab=55.99 E-value=8.7 Score=15.90 Aligned_cols=41 Identities=20% Similarity=0.266 Sum_probs=24.2 Q ss_pred HHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCEEE Q ss_conf 200112234444432234665543221111044430356676521021 Q gi|254780143|r 536 THTRRLSALGQGGVARARAGVEMRDVHPTHYGRICPAETSEGHNIGLV 583 (1386) Q Consensus 536 th~RR~s~lgpggl~r~~~~~evR~ih~s~~GriCPieTPEG~n~GLv 583 (1386) .|.+-+++|. +..+.+.+ +-.||-|.||--+.--|.++-|. T Consensus 318 GHnK~ITaLt---v~~d~~~i----~SgsyDG~I~~W~~~~g~~~~~~ 358 (603) T KOG0318 318 GHNKSITALT---VSPDGKTI----YSGSYDGHINSWDSGSGTSDRLA 358 (603) T ss_pred CCCCCEEEEE---ECCCCCEE----EEECCCCEEEEEECCCCCCCCCC T ss_conf 4564316999---86999779----95045756788763776125425 No 37 >pfam04567 RNA_pol_Rpb2_5 RNA polymerase Rpb2, domain 5. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). Domain 5, is also known as the external 2 domain. Probab=55.13 E-value=6.2 Score=16.95 Aligned_cols=24 Identities=33% Similarity=0.368 Sum_probs=19.2 Q ss_pred CCCCEEECCHHHHCCEEEECCCCE Q ss_conf 348166429677376379616534 Q gi|254780143|r 610 VTNDVVYLSAMEEENRYIAQANSS 633 (1386) Q Consensus 610 ~~~~i~~l~~~~e~~~~Ia~~~~~ 633 (1386) ..+.|+|++++|++..+||..... T Consensus 6 ~~gvIEyiD~eEe~~~~IA~~~~d 29 (46) T pfam04567 6 KEGVIEYLDAEEEETAMIAMSPED 29 (46) T ss_pred HCCCEEEECHHHHHCCEEEECHHH T ss_conf 449789876434106079837878 No 38 >KOG0677 consensus Probab=54.66 E-value=6.1 Score=17.02 Aligned_cols=55 Identities=36% Similarity=0.482 Sum_probs=31.7 Q ss_pred HHHHHHHHHHHHHCCCCCHHHHHHHHHHHH-CCCCCCCCCCCHHHHHHHHHHHHCCCCEEE Q ss_conf 986989999861110011015999988763-878789978667789999999854200276 Q gi|254780143|r 1304 AYGAAYVLQEMLTIKSDDVVGRTRVYESIV-AGNDTFETGTPESFNVLVKEMQALGLSIDL 1363 (1386) Q Consensus 1304 AyGAa~~LqE~Lt~kSDdv~gr~~~~~~iv-~g~~~~~~~~pesf~vl~~El~~l~l~~~~ 1363 (1386) +-|.|-+| .=||.+-||.-|...|..|| .|-...-||.|- -|-+||+-|-|+--| T Consensus 272 ~~G~aell--F~~iQaaDiD~R~~lYkhIVLSGGstMYPGLPS---RLEkElkqlyl~rVL 327 (389) T KOG0677 272 GPGVAELL--FNTIQAADIDIRSELYKHIVLSGGSTMYPGLPS---RLEKELKQLYLDRVL 327 (389) T ss_pred CCCHHHHH--HHHHHHHCCCHHHHHHHHEEECCCCCCCCCCCH---HHHHHHHHHHHHHHH T ss_conf 87689999--877777333228888867564388524899737---899999999999997 No 39 >pfam04564 U-box U-box domain. This domain is related to the Ring finger pfam00097 but lacks the zinc binding residues. Probab=54.59 E-value=5.8 Score=17.17 Aligned_cols=44 Identities=25% Similarity=0.315 Sum_probs=19.5 Q ss_pred EECCCCCCCCCCCEEE--EEEEEEECCCHHHHHCCCCCCCCCCCEEECCCC Q ss_conf 9868988402685048--846454101102110000236873110307998 Q gi|254780143|r 1236 LYDGLTGEPFDRPVTV--GYIYMLKLNHMVSDKVYARSTGSYSLVTQQPLG 1284 (1386) Q Consensus 1236 lydG~TGe~~~~~Ifv--G~~YyqkL~HMV~DKiHARstGP~sllTrQP~e 1284 (1386) +.+.+|++-|..||.. |.+|=.. .+.. |-++.|+.+.+||||+. T Consensus 5 f~CPIt~~iM~dPV~~~~G~tyER~---~I~~--wl~~~~~~~P~T~~~l~ 50 (74) T pfam04564 5 FLDPITLELMKDPVILPSGITYDRS---TIER--HLLSVDPTDPFTREPLT 50 (74) T ss_pred CCCCCCCCHHCCCEECCCCCEECHH---HHHH--HHHHCCCCCCCCCCCCC T ss_conf 3886854652297098999888799---9999--99978996887787488 No 40 >TIGR01812 sdhA_frdA_Gneg succinate dehydrogenase or fumarate reductase, flavoprotein subunit; InterPro: IPR014006 Succinate:quinone oxidoreductase (1.3.5.1 from EC) refers collectively to succinate:quinone reductase (SQR, or Complex II) and quinol:fumarate reductase (QFR) . SQR is found in aerobic organisms, and catalyses the oxidation of succinate to fumarate in the citric acid cycle and donates the electrons to quinone in the membrane. QFR can be found in anaerobic cells respiring with fumarate as terminal electron acceptor. SQR and QFR are very similar in composition and structure, despite catalysing opposite reactions in vivo. They are thought to have evolved from a common ancestor, and in Escherichia coli they are capable of functionally replacing each other . Succinate:quinone oxidoreductases consist of a peripheral domain, exposed to the cytoplasm in bacteria and to the matrix in mitochondria, and a membrane-integral anchor domain that spans the membrane (Fig. 1). The peripheral part, which contains the dicarboxylate binding site, is composed of a flavoprotein subunit, with one covalently bound FAD, and an iron-sulphur protein subunit containing three iron-sulphur clusters. The membrane-integral domain functions to anchor the peripheral domain to the membrane and is required for quinone reduction and oxidation. The anchor domain shows the largest variability in composition and primary sequence, being composed either of one large subunit, or two smaller subunits, which may, or may not, contain protoheme groups. This entry represents the flavoprotein subunit found in both the SQR and QFR enzymes. This subunit contains an N-terminal domain which binds the FAD cofactor, a central catalytic domain with an unsual fold, and a C-terminal domain whose role is unclear , , . The dicarboxylate binding site is located between the FAD and catalytic domains.. Probab=54.39 E-value=10 Score=15.33 Aligned_cols=26 Identities=27% Similarity=0.335 Sum_probs=13.9 Q ss_pred CCCEECC----CCCCCCCCCCCCCCCEEEE Q ss_conf 7852035----5223578602223751555 Q gi|254780143|r 790 RNDIIAD----GPSTDLGDLALGRNMLVAF 815 (1386) Q Consensus 790 ~~~~l~~----~~~~~~~el~~G~N~~VA~ 815 (1386) .|+++.. +...-.|=+|+|-=++|.+ T Consensus 395 ~G~~~~~D~~~~~~iv~GLfAaGE~ACVSV 424 (636) T TIGR01812 395 RGQVIGEDAKNNDSIVKGLFAAGECACVSV 424 (636) T ss_pred CCEEECCCCCCCCCCCCCEEEEEEEEECCC T ss_conf 424870167888862101022010222034 No 41 >pfam07508 Recombinase Recombinase. This domain is usually found associated with pfam00239 in putative integrases/recombinases of mobile genetic elements of diverse bacteria and phages. Probab=51.99 E-value=12 Score=14.86 Aligned_cols=48 Identities=13% Similarity=0.131 Sum_probs=35.7 Q ss_pred CCCCHHHHHHHHHHCCCCCCCCEEEECCCCCCCCCCCEEEEEEEEEEC Q ss_conf 788999999999986868998699986898840268504884645410 Q gi|254780143|r 1212 DGADEEAINSMLRMADLDESGQSILYDGLTGEPFDRPVTVGYIYMLKL 1259 (1386) Q Consensus 1212 ~g~~~~~i~~~L~~aG~~~~Gke~lydG~TGe~~~~~IfvG~~YyqkL 1259 (1386) +|.+...|.+.|.+.|+...+.-......-=..+.+++++|.+.|.|- T Consensus 16 ~g~s~~~Ia~~Ln~~g~~~~~~~~w~~~~I~~iL~N~~Y~G~~~~~~~ 63 (101) T pfam07508 16 EGKSLREIARYLNERGIPTPRGKKWTKSTVRRILTNPAYIGRLVWGKT 63 (101) T ss_pred CCCCHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHCCCEEEEEEEECCE T ss_conf 389999999999874996778982464435566208615899997652 No 42 >pfam11963 DUF3477 Protein of unknown function (DUF3477). This family of proteins is functionally uncharacterized. This protein is found in viruses. Proteins in this family are typically between 246 to 7162 amino acids in length. This protein is found associated with pfam08716, pfam01661, pfam05409, pfam08717, pfam01831, pfam08715, pfam08710. Probab=49.95 E-value=10 Score=15.36 Aligned_cols=12 Identities=25% Similarity=0.177 Sum_probs=6.0 Q ss_pred HHCCCCCEEEEE Q ss_conf 425873103466 Q gi|254780143|r 834 VSEDVFTSIHIE 845 (1386) Q Consensus 834 v~rg~~~s~h~~ 845 (1386) -..|-++|-|++ T Consensus 196 G~KGs~~s~h~r 207 (355) T pfam11963 196 GNKGSVTSDHFR 207 (355) T ss_pred CCCCCCCCCCEE T ss_conf 766774455244 No 43 >PRK10871 nlpD lipoprotein NlpD; Provisional Probab=47.92 E-value=14 Score=14.43 Aligned_cols=20 Identities=35% Similarity=0.416 Sum_probs=11.2 Q ss_pred CCCCCCCCCCEEECCCEECC Q ss_conf 44530447977207852035 Q gi|254780143|r 777 NQRPLVKVGDEVRRNDIIAD 796 (1386) Q Consensus 777 ~q~p~V~~g~~~~~~~~l~~ 796 (1386) +++.+|+.|+.|++||.|+. T Consensus 324 n~~~lVkeg~~V~~Gq~Ia~ 343 (374) T PRK10871 324 NDTMLVREQQEVKAGQKIAT 343 (374) T ss_pred CCCCCCCCCCEECCCCEEEE T ss_conf 66266788899889998986 No 44 >PRK11637 hypothetical protein; Provisional Probab=46.43 E-value=15 Score=14.27 Aligned_cols=62 Identities=26% Similarity=0.235 Sum_probs=33.8 Q ss_pred HHHCCCEEECCCCCCCCCCCCCCEEEECCCCCCCCCCCCCEEECCCCCCCCCCCCC-CCCCCCCCCCCEEECCCEECC Q ss_conf 01065301021244343356553025215665564446630111465554454732-244530447977207852035 Q gi|254780143|r 720 AKSSGAAIVAKRAGIVEQVDAIRIVIRSVEGDLDPSTSGVDIYRLMKFQRSNQNTC-VNQRPLVKVGDEVRRNDIIAD 796 (1386) Q Consensus 720 ~~~s~~~i~a~~~g~v~~vd~~~i~i~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~-~~q~p~V~~g~~~~~~~~l~~ 796 (1386) +-..|..|.|..+|.|.+++..... .+....+|.... ...| .++...|..|++|.+|++|+. T Consensus 311 ~a~~Gt~V~Av~~G~Vv~a~~~~gy--G~~ViIdHG~g~-------------~TlYah~s~l~v~~Gq~V~~Gq~Ig~ 373 (404) T PRK11637 311 GASEGTEVKAIADGRVILADWLQGY--GLVVVVEHGKGD-------------MSLYGYNQSALVSVGAQVRAGQPIAL 373 (404) T ss_pred ECCCCCEEEEECCEEEEEEEECCCC--CCEEEEECCCCC-------------EEECCCCCCCCCCCCCEECCCCEEEE T ss_conf 5699980541017699991140888--857999869946-------------57152889588899799899996987 No 45 >PRK10556 hypothetical protein; Provisional Probab=45.42 E-value=7.9 Score=16.23 Aligned_cols=36 Identities=19% Similarity=0.358 Sum_probs=25.5 Q ss_pred CCCCCCCCCCEEEEEECCCCCCCCCCHHHHHHHHHH Q ss_conf 007879358715698668986750708999999999 Q gi|254780143|r 1107 EDMPFLKDGTPVDIVLNPLGVPSRMNVGQIFETHLG 1142 (1386) Q Consensus 1107 eDMPf~~dG~~pDIIlNPhgvPSRMtIGqllE~~lG 1142 (1386) +..|---.|-.++-.==||||-|||-.-..|+++.| T Consensus 76 ~~FPlyl~g~~~~~yGIpHGFsSR~~L~ryl~~mf~ 111 (111) T PRK10556 76 QQFPLYLAGERHEHYGIPHGFSSRVALERYLNGLFG 111 (111) T ss_pred HCCCCCCCCCCHHHHCCCCCCCHHHHHHHHHHHHCC T ss_conf 026876365536551788562359999999998539 No 46 >COG0511 AccB Biotin carboxyl carrier protein [Lipid metabolism] Probab=44.91 E-value=15 Score=14.11 Aligned_cols=35 Identities=23% Similarity=0.423 Sum_probs=26.5 Q ss_pred CCCCCCCCEEECCCEECCCCCCCC-CCCCCCCCCEE Q ss_conf 530447977207852035522357-86022237515 Q gi|254780143|r 779 RPLVKVGDEVRRNDIIADGPSTDL-GDLALGRNMLV 813 (1386) Q Consensus 779 ~p~V~~g~~~~~~~~l~~~~~~~~-~el~~G~N~~V 813 (1386) ++.|+.||.|++||.++--.+|++ +++.+...-+| T Consensus 83 ~~~V~vGd~V~~Gq~l~IiEAMKmeneI~A~~~G~V 118 (140) T COG0511 83 KPFVEVGDTVKAGQTLAIIEAMKMENEIEAPADGVV 118 (140) T ss_pred EEEECCCCEECCCCEEEEEEEEECCCEECCCCCCEE T ss_conf 987657999758999999982001553228999689 No 47 >TIGR01176 fum_red_Fp fumarate reductase, flavoprotein subunit; InterPro: IPR005884 In bacteria two distinct, membrane-bound, enzyme complexes are responsible for the interconversion of fumarate and succinate (1.3.99.1 from EC): fumaratereductase (Frd) is used in anaerobic growth, and succinate dehydrogenase (Sdh) is used in aerobic growth. Both complexes consist of two main components: a membrane-extrinsic component composed of a FAD-binding flavoprotein and an iron-sulphur protein; and an hydrophobic component composed of a membrane anchor protein and/or a cytochrome B. In eukaryotes mitochondrial succinate dehydrogenase (ubiquinone) (1.3.5.1 from EC) is an enzyme composed of two subunits: a FAD flavoprotein and and iron-sulphur protein. The flavoprotein subunit is a protein of about 60 to 70 Kd to which FAD is covalently bound to a histidine residue which is located in the N-terminal section of the protein . The sequence around that histidine is well conserved in Frd and Sdh from various bacterial and eukaryotic species . The terms succinate dehydrogenase and fumarate reductase may be used interchangeably in certain systems. However, a number of species have distinct complexes, with the fumarate reductase active under anaerobic conditions. This model represents the fumarate reductase flavoprotein subunit from several such species in which a distinct succinate dehydrogenase is also found. ; GO: 0016491 oxidoreductase activity, 0006118 electron transport, 0009061 anaerobic respiration. Probab=44.28 E-value=12 Score=14.81 Aligned_cols=14 Identities=14% Similarity=0.119 Sum_probs=6.2 Q ss_pred HHHHHHCCCCCCCC Q ss_conf 67841024137413 Q gi|254780143|r 870 EEGLKNIDECGIIC 883 (1386) Q Consensus 870 ~~~~~~ld~~Giv~ 883 (1386) ...+.+.++||-.+ T Consensus 553 ~HtL~f~~~d~T~r 566 (585) T TIGR01176 553 KHTLAFRESDGTLR 566 (585) T ss_pred HHHHHEECCCCEEE T ss_conf 65331021788124 No 48 >KOG0772 consensus Probab=44.20 E-value=11 Score=15.14 Aligned_cols=13 Identities=31% Similarity=0.777 Sum_probs=8.9 Q ss_pred CEEEEEECCCCCC Q ss_conf 7156986689867 Q gi|254780143|r 1116 TPVDIVLNPLGVP 1128 (1386) Q Consensus 1116 ~~pDIIlNPhgvP 1128 (1386) ...|+|+|||+.| T Consensus 511 ~~~~~ii~phalp 523 (641) T KOG0772 511 LSEDVIINPHALP 523 (641) T ss_pred CCCCEEECCCCCH T ss_conf 4766677555401 No 49 >PRK13497 chemoreceptor glutamine deamidase CheD; Provisional Probab=40.20 E-value=18 Score=13.62 Aligned_cols=43 Identities=16% Similarity=0.233 Sum_probs=19.1 Q ss_pred CCCEEEECCCCCCCCCCCEEEEEEEEEECCCHHHHHCCCCCCCCCCCEEECCCCC Q ss_conf 9869998689884026850488464541011021100002368731103079986 Q gi|254780143|r 1231 SGQSILYDGLTGEPFDRPVTVGYIYMLKLNHMVSDKVYARSTGSYSLVTQQPLGG 1285 (1386) Q Consensus 1231 ~Gke~lydG~TGe~~~~~IfvG~~YyqkL~HMV~DKiHARstGP~sllTrQP~eG 1285 (1386) +|..+.|+..||+.+ ..||.+.-.+.+-.|..=+ .+-++|++| T Consensus 137 ~gRkI~F~p~tGrv~----------~~~l~~~~~~~~~~~e~~~--~~~~~p~~g 179 (184) T PRK13497 137 HGRKLEYWPVSGRAR----------QYPLTGAETQRTVALEQRP--AAPQKPVET 179 (184) T ss_pred CCCEEEEECCCCCEE----------EEECCCCCHHHHHHHHHHC--CCCCCCCCC T ss_conf 775799988998173----------4533564427889875330--144689998 No 50 >pfam10258 RNA_GG_bind PHAX RNA-binding domain. RNA_GG_bind is the highly conserved U3 snoRNA-binding domain of PHAX (phosphorylated adaptor for RNA export) whose function is to transport U3 snoRNA from the nucleus after transcription. It is characterized by having two pairs of adjacent glycines, as GGx12GG. Probab=40.07 E-value=10 Score=15.32 Aligned_cols=20 Identities=30% Similarity=0.797 Sum_probs=11.6 Q ss_pred CCCCEEECCEEEEEEEEECCCCC-EEEC Q ss_conf 89628986821468665122785-2120 Q gi|254780143|r 140 KDGTFVIKGIQRIVVSQLHRSPG-IHFD 166 (1386) Q Consensus 140 ~~GyFIING~ERVIVsQl~RSPG-Vyf~ 166 (1386) .+|.++-||+-| |+|| |||. T Consensus 43 ~GG~~t~dG~Rr-------RTpGGVF~~ 63 (87) T pfam10258 43 NGGQLTADGSRR-------RTPGGVFLN 63 (87) T ss_pred CCCEECCCCCCC-------CCCCCCHHH T ss_conf 499876479736-------778641458 No 51 >TIGR00407 proA gamma-glutamyl phosphate reductase; InterPro: IPR000965 Gamma-glutamyl phosphate reductase (1.2.1.41 from EC) (GPR) is the enzyme that catalyzes the second step in the biosynthesis of proline from glutamate, the NADP-dependent reduction of L-glutamate 5-phosphate into L-glutamate 5-semialdehyde and phosphate. In bacteria (gene proA) and yeast (gene PRO2), GPR is a monofunctional protein, while in plants and mammals, it is a bifunctional enzyme (P5CS) that consists of two domains, an N-terminal glutamate 5-kinase domain (2.7.2.11 from EC) and a C-terminal GPR domain. ; GO: 0004350 glutamate-5-semialdehyde dehydrogenase activity, 0006561 proline biosynthetic process. Probab=39.90 E-value=9.7 Score=15.58 Aligned_cols=42 Identities=26% Similarity=0.236 Sum_probs=18.4 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCH----HCCCCCCCCHHHHHHHHH Q ss_conf 998888988876304876434401----015332333357878988 Q gi|254780143|r 434 EDIIAIIKILVDLRNGKGTIDDID----NLGNRRVRSVGEMLKNQY 475 (1386) Q Consensus 434 ~d~~~~i~~L~~l~~g~~~~DdiD----hlgnkRvr~vgeLl~~~f 475 (1386) .+++.-++.+.+|....|.+-+-- -|.-.|++.+=--+...| T Consensus 67 ~~i~~~V~~v~~L~DPvG~v~~~~~lD~GL~l~rv~~PLGV~GvIy 112 (415) T TIGR00407 67 KGIADGVKDVIELADPVGKVIDGRELDSGLVLERVRVPLGVLGVIY 112 (415) T ss_pred HHHHHHHHHHHHCCCCCCHHHCCCCCCCCCEEEEEECCEEEEEEEE T ss_conf 9999999998513777010202415147867876524612577663 No 52 >pfam11783 Cytochrome_cB Cytochrome c bacterial. This is a family of long bacterial cytochrome c proteins, found in Proteobacteria and Chlorobi families. Probab=39.82 E-value=18 Score=13.58 Aligned_cols=44 Identities=30% Similarity=0.221 Sum_probs=32.7 Q ss_pred CCHHHHHHHHHHCCCCCCCCEEEECCCCCCCCCCCEEEEEEEEEECCCHHHHHCCCCC Q ss_conf 8999999999986868998699986898840268504884645410110211000023 Q gi|254780143|r 1214 ADEEAINSMLRMADLDESGQSILYDGLTGEPFDRPVTVGYIYMLKLNHMVSDKVYARS 1271 (1386) Q Consensus 1214 ~~~~~i~~~L~~aG~~~~Gke~lydG~TGe~~~~~IfvG~~YyqkL~HMV~DKiHARs 1271 (1386) .-...|..-+..+|++++|+- =||=-..|+.|.|||.=|=.|=+ T Consensus 113 Dw~~Ai~~G~~~~gl~~sg~~--------------~fv~T~~y~~inHmVaPKe~AL~ 156 (166) T pfam11783 113 DWDKAIAAGMKAAGLKYSGPY--------------GFVETEMYWPINHMVAPKEKALK 156 (166) T ss_pred CHHHHHHHHHHHCCCCCCCCC--------------CEEEEEEEEEECCCCCCHHHCCC T ss_conf 899999999986299855641--------------30999898540430687897226 No 53 >PRK04049 30S ribosomal protein S8e; Validated Probab=39.57 E-value=18 Score=13.55 Aligned_cols=18 Identities=33% Similarity=0.595 Sum_probs=12.5 Q ss_pred CCCCCCCCCCEEEEEECC Q ss_conf 143366787358886300 Q gi|254780143|r 1089 KMAGRHGNKGIVSRILPC 1106 (1386) Q Consensus 1089 KfasRHGqKGVis~i~p~ 1106 (1386) |..||-||-|||.-++=+ T Consensus 109 ~VTsRPGQ~G~vnavLi~ 126 (127) T PRK04049 109 KVTSRPGQDGVVNAVLIE 126 (127) T ss_pred EEECCCCCCCEEEEEEEC T ss_conf 982699987678799944 No 54 >TIGR02388 rpoC2_cyan DNA-directed RNA polymerase, beta'' subunit; InterPro: IPR012756 DNA-directed RNA polymerases 2.7.7.6 from EC (also known as DNA-dependent RNA polymerases) are responsible for the polymerisation of ribonucleotides into a sequence complementary to the template DNA. In eukaryotes, there are three different forms of DNA-directed RNA polymerases transcribing different sets of genes. Most RNA polymerases are multimeric enzymes and are composed of a variable number of subunits. The core RNA polymerase complex consists of five subunits (two alpha, one beta, one beta-prime and one omega) and is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme . The core RNA polymerase complex forms a "crab claw"-like structure with an internal channel running along the full length . The key functional sites of the enzyme, as defined by mutational and cross-linking analysis, are located on the inner wall of this channel. RNA synthesis follows after the attachment of RNA polymerase to a specific site, the promoter, on the template DNA strand. The RNA synthesis process continues until a termination sequence is reached. The RNA product, which is synthesised in the 5' to 3'direction, is known as the primary transcript. Eukaryotic nuclei contain three distinct types of RNA polymerases that differ in the RNA they synthesise: RNA polymerase I: located in the nucleoli, synthesises precursors of most ribosomal RNAs. RNA polymerase II: occurs in the nucleoplasm, synthesises mRNA precursors. RNA polymerase III: also occurs in the nucleoplasm, synthesises the precursors of 5S ribosomal RNA, the tRNAs, and a variety of other small nuclear and cytosolic RNAs. Eukaryotic cells are also known to contain separate mitochondrial and chloroplast RNA polymerases. Eukaryotic RNA polymerases, whose molecular masses vary in size from 500 to 700 kD, contain two non-identical large (>100 kDa) subunits and an array of up to 12 different small (less than 50 kDa) subunits. The family consists of the product of the rpoC2 gene, a subunit of DNA-directed RNA polymerase of cyanobacteria and chloroplasts. RpoC2 corresponds largely to the C-terminal region of the RpoC (the beta' subunit) of other bacteria. Members of this family are designated beta'' in chloroplasts/plastids, and beta' (confusingly) in Cyanobacteria, where RpoC1 is called beta' in chloroplasts/plastids and gamma in Cyanobacteria. We prefer to name this family beta'', after its organellar members, to emphasize that this RpoC1 and RpoC2 together replace RpoC in other bacteria.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006350 transcription. Probab=39.30 E-value=19 Score=13.52 Aligned_cols=52 Identities=31% Similarity=0.404 Sum_probs=42.4 Q ss_pred CCCHHHHCCCCCCCCCCCCCCCEEEEECCCCCCHHHHHHHHHCCCCCEEEEE Q ss_conf 0266773474100001366677699962345998999999865654102442 Q gi|254780143|r 297 GITSDCLCGLYVAEDIVNGETGEIYIEAGDVIDEKSLEEIFHSEIRDIPILY 348 (1386) Q Consensus 297 ~v~~~~l~~~~~~~~~~d~~~gei~~~~~~~~~~~~l~~~~~~~~~~~~~l~ 348 (1386) .-...+++|+.++.|++++++++++...+..+..+..+.+..+.+..+.+-- T Consensus 232 ~~l~~RL~GR~~~~Dv~~p~tge~i~~~N~~I~~~Lak~i~~~~~~~V~vRS 283 (1252) T TIGR02388 232 IKLADRLLGRLVAEDVLHPETGEVIVPKNTAIDEDLAKKIEKAGIEEVVVRS 283 (1252) T ss_pred EEECCCCEEEEEHHHCCCCCCCEEEECCCCCCCHHHHHHHHHHHHCEEEECC T ss_conf 9862540223321002476546066116760218999999751115768716 No 55 >PRK06302 acetyl-CoA carboxylase biotin carboxyl carrier protein subunit; Validated Probab=39.16 E-value=19 Score=13.51 Aligned_cols=38 Identities=18% Similarity=0.344 Sum_probs=28.7 Q ss_pred CCCCCCCCCEEECCCEECCCCCCC-CCCCCCCCCCEEEE Q ss_conf 453044797720785203552235-78602223751555 Q gi|254780143|r 778 QRPLVKVGDEVRRNDIIADGPSTD-LGDLALGRNMLVAF 815 (1386) Q Consensus 778 q~p~V~~g~~~~~~~~l~~~~~~~-~~el~~G~N~~VA~ 815 (1386) ..|-|+.|+.|++||+++--.+|+ +++..+.....|.- T Consensus 98 ~~pFV~vGd~V~~Gq~v~iIEaMK~mneI~a~~~G~I~~ 136 (155) T PRK06302 98 APPFVEVGDTVKEGQTLCIIEAMKMMNEIEADKSGVVKE 136 (155) T ss_pred CCCCCCCCCEECCCCEEEEEEECCCCCEEECCCCCEEEE T ss_conf 997424686724898899998424353240698848999 No 56 >cd01778 RASSF1_RA RASSF1 (also known as RASSF3 and NORE1) is a tumour suppressor protein with a C-terminal Ras-associating (RA) domain that binds Ras. RASSF1 also binds the proapoptotic protein kinase MST1 and is thus thought to regulate the proapoptotic signalling pathway. RASSF1 also associates with microtubule-associated proteins like MAP1B and regulates tubulin polymerization. RASSF1 also binds CDC20 and regulates mitosis by inhibiting the anaphase-promoting complex and preventing degradation of cyclin A and cyclin B until the spindle checkpoint becomes fully operational. Probab=37.91 E-value=16 Score=13.94 Aligned_cols=23 Identities=22% Similarity=0.248 Sum_probs=16.5 Q ss_pred ECCCCCCCCCCHHHHHHHHHHHH Q ss_conf 66898675070899999999999 Q gi|254780143|r 1122 LNPLGVPSRMNVGQIFETHLGWA 1144 (1386) Q Consensus 1122 lNPhgvPSRMtIGqllE~~lGka 1144 (1386) ++.+-|-|+||..++|+.+|.|- T Consensus 18 ~k~lhIsS~tTt~eVI~~LL~KF 40 (96) T cd01778 18 AKHLHISSKTTVREVIEALLKKF 40 (96) T ss_pred EEEEEEECCCCHHHHHHHHHHHC T ss_conf 02899825686999999999853 No 57 >TIGR02645 ARCH_P_rylase putative thymidine phosphorylase; InterPro: IPR013466 Proteins in this entry are closely related to characterised examples of thymidine phosphorylase (2.4.2.4 from EC) and pyrimidine nucleoside phosphorylase (2.4.2.2 from EC). Most examples are found in the archaea, but other examples are found in bacteria such as Legionella pneumophila (strain Paris) and Rhodopseudomonas palustris CGA009.; GO: 0009032 thymidine phosphorylase activity. Probab=37.67 E-value=12 Score=14.98 Aligned_cols=53 Identities=36% Similarity=0.559 Sum_probs=28.3 Q ss_pred CEEECCCCCCCCCCCCCCE-EEECCCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCEECC Q ss_conf 3010212443433565530-25215665564446630111465554454732244530447977207852035 Q gi|254780143|r 725 AAIVAKRAGIVEQVDAIRI-VIRSVEGDLDPSTSGVDIYRLMKFQRSNQNTCVNQRPLVKVGDEVRRNDIIAD 796 (1386) Q Consensus 725 ~~i~a~~~g~v~~vd~~~i-~i~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~q~p~V~~g~~~~~~~~l~~ 796 (1386) ..|.|..+|+|...|+..+ .|+-.-+-+.+...++.. .||.|+.|++|++|-. T Consensus 420 ~DI~A~~DG~Vt~IDN~~i~~IAr~AGAP~DKgAGv~l-------------------hvK~G~~Vk~GdPL~T 473 (499) T TIGR02645 420 ADIHAETDGYVTEIDNKRITRIARLAGAPNDKGAGVEL-------------------HVKVGDKVKKGDPLYT 473 (499) T ss_pred EEEEECCCCEEHHHHHHHHHHHHHHHCCCCCCCCCEEE-------------------EEEECCEEECCCCCEE T ss_conf 77873688615042388999999871888767684388-------------------8654677203870158 No 58 >TIGR02002 PTS-II-BC-glcB PTS system, glucose-specific IIBC component; InterPro: IPR011299 This entry represents the combined B and C domains of the PTS transport system enzyme II specific for glucose transport . Many of the genes in this family also include an A domain as part of the same polypeptide and thus should be given the name 'PTS system, glucose-specific IIABC component' while the Bacillus subtilus enzyme also contains an enzyme III domain which appears to act independently of the enzyme II domains . This group is most closely related to the N-acetylglucosamine-specific PTS enzymes (IPR010974 from INTERPRO).; GO: 0005355 glucose transmembrane transporter activity, 0015758 glucose transport, 0016021 integral to membrane. Probab=37.44 E-value=16 Score=13.99 Aligned_cols=28 Identities=32% Similarity=0.646 Sum_probs=20.2 Q ss_pred CCCCCCCCHHHHHHCCCCCCCCCCCEEC Q ss_conf 4666546867841024137413773313 Q gi|254780143|r 862 TRDIPNVSEEGLKNIDECGIICVGAEVN 889 (1386) Q Consensus 862 ~~~~~~~~~~~~~~ld~~Giv~~G~~V~ 889 (1386) .+|+.+|....+++|...|+.-+|..|+ T Consensus 471 V~d~~~Vd~~~LK~LGA~GVLvvGnn~Q 498 (518) T TIGR02002 471 VKDIKKVDKAKLKKLGAAGVLVVGNNVQ 498 (518) T ss_pred EHHHCCCCHHHHHHCCCCCEEEECCCEE T ss_conf 4000013436665216771388868701 No 59 >cd01787 GRB7_RA Grb7_RA The RA (RAS-associated like) domain of Grb7. Grb7 is an adaptor molecule that mediates signal transduction from multiple cell surface receptors to various downstream signaling pathways. Grb7 and its related family members Grb10 and Grb14 share a conserved domain architecture that includes an amino-terminal proline-rich region, a central segment termed the GM region (for Grb and Mig) which includes the RA, PIR, and PH domains, and a carboxyl-terminal SH2 domain. Grb7/10/14 family proteins are phosphorylated on serine/threonine as well as tyrosine residues and are mainly localized to the cytoplasm. Probab=36.89 E-value=20 Score=13.27 Aligned_cols=32 Identities=16% Similarity=0.264 Sum_probs=23.4 Q ss_pred CCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 7935871569866898675070899999999999987 Q gi|254780143|r 1111 FLKDGTPVDIVLNPLGVPSRMNVGQIFETHLGWACVG 1147 (1386) Q Consensus 1111 f~~dG~~pDIIlNPhgvPSRMtIGqllE~~lGka~~~ 1147 (1386) |++||+.=-+. ||+|||++.+...+.-|.-+. T Consensus 8 ~~eDgssK~l~-----Vde~mtar~Vc~~L~~KnH~~ 39 (85) T cd01787 8 YSEDGASKSLE-----VDERMTARDVCQLLVDKNHCQ 39 (85) T ss_pred ECCCCCCEEEE-----ECCCCCHHHHHHHHHHHHCCC T ss_conf 83589815899-----879883999999999985768 No 60 >cd01784 rasfadin_RA rasfadin_RA Rasfadin (RASSF2) belongs to a family of Ras effectors/tumor suppressors that includes RASSF1 and NORE1. RASSF2 binds directly to K-Ras in a GTP-dependent manner via its RA (RAS-associated) domain. RASSF2 promotes apoptosis and cell cycle arrest and is frequently down-regulated in lung tumor cell lines Probab=36.21 E-value=21 Score=13.19 Aligned_cols=30 Identities=20% Similarity=0.262 Sum_probs=19.7 Q ss_pred CCCC-CCEEEEEECCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 7935-8715698668986750708999999999999 Q gi|254780143|r 1111 FLKD-GTPVDIVLNPLGVPSRMNVGQIFETHLGWAC 1145 (1386) Q Consensus 1111 f~~d-G~~pDIIlNPhgvPSRMtIGqllE~~lGka~ 1145 (1386) ||+. |.+--+= |-|+||..|+|+.+|.|-- T Consensus 7 FtP~~gS~t~v~-----i~S~~tt~eVI~~LL~KFk 37 (87) T cd01784 7 FTPAYGSVTNVR-----INSTMTTPQVLKLLLNKFK 37 (87) T ss_pred ECCCCCCEEEEE-----EECCCCHHHHHHHHHHHHE T ss_conf 658888547999-----9445769999999998520 No 61 >TIGR03477 DMSO_red_II_gam DMSO reductase family type II enzyme, heme b subunit. This model represents a heme b-binding subunit, typically called the gamma subunit, of various proteins that also contain a molybdopterin subunit and an iron-sulfur protein. The group includes two distinct but very closely related periplasmic proteins of anaerobic respiration, selenate reductase and chlorate reductase. Other members of this family include dimethyl sulphide dehydrogenase and ethylbenzene dehydrogenase. Probab=35.82 E-value=21 Score=13.15 Aligned_cols=24 Identities=42% Similarity=0.601 Sum_probs=20.1 Q ss_pred CCCCCCCCCEEEEEECCCCCCCCC Q ss_conf 860222375155531355444442 Q gi|254780143|r 803 GDLALGRNMLVAFMPWHGYNFEDS 826 (1386) Q Consensus 803 ~el~~G~N~~VA~m~~~GYN~EDa 826 (1386) -.|..|..+-|||.-|.|.|.|-. T Consensus 168 ~~l~~g~~~~vAFAVWdG~n~ER~ 191 (205) T TIGR03477 168 ASLQAGGDSKVAFAVWNGGNAERS 191 (205) T ss_pred CCCCCCCCEEEEEEEECCCCCCCC T ss_conf 321469825579999548676346 No 62 >pfam07304 SRA1 Steroid receptor RNA activator (SRA1). This family consists of several hypothetical mammalian steroid receptor RNA activator proteins. SRA-RNAs likely to encode stable proteins are widely expressed in breast cancer cell lines. SRA-RNA is a steroid receptor co-activator which acts as a functional RNA and is classified as belonging to the growing family of functional non-coding RNAs. Probab=35.74 E-value=21 Score=13.14 Aligned_cols=12 Identities=25% Similarity=0.509 Sum_probs=5.7 Q ss_pred HHHHHHHHHHHH Q ss_conf 988888899998 Q gi|254780143|r 473 NQYRLGLLRMER 484 (1386) Q Consensus 473 ~~fr~~l~rl~r 484 (1386) .+|-+|++||.- T Consensus 125 g~WmvGVKRLI~ 136 (157) T pfam07304 125 SQWMVGVKRLIA 136 (157) T ss_pred HHHHHHHHHHHH T ss_conf 888999999999 No 63 >cd02809 alpha_hydroxyacid_oxid_FMN Family of homologous FMN-dependent alpha-hydroxyacid oxidizing enzymes. This family occurs in both prokaryotes and eukaryotes. Members of this family include flavocytochrome b2 (FCB2), glycolate oxidase (GOX), lactate monooxygenase (LMO), mandelate dehydrogenase (MDH), and long chain hydroxyacid oxidase (LCHAO). In green plants, glycolate oxidase is one of the key enzymes in photorespiration where it oxidizes glycolate to glyoxylate. LMO catalyzes the oxidation of L-lactate to acetate and carbon dioxide. MDH oxidizes (S)-mandelate to phenylglyoxalate. It is an enzyme in the mandelate pathway that occurs in several strains of Pseudomonas which converts (R)-mandelate to benzoate. Probab=35.21 E-value=12 Score=14.92 Aligned_cols=17 Identities=6% Similarity=0.069 Sum_probs=7.3 Q ss_pred CCHHHHHHHHHCCCCCE Q ss_conf 99899999986565410 Q gi|254780143|r 328 IDEKSLEEIFHSEIRDI 344 (1386) Q Consensus 328 ~~~~~l~~~~~~~~~~~ 344 (1386) ++.+......+.+.+-+ T Consensus 181 ~~~~DA~~a~~~G~dgI 197 (299) T cd02809 181 LTPEDALRAVDAGADGI 197 (299) T ss_pred CCHHHHHHHHHCCCCEE T ss_conf 88999999998599889 No 64 >PRK11633 hypothetical protein; Provisional Probab=34.69 E-value=22 Score=13.03 Aligned_cols=13 Identities=23% Similarity=0.363 Sum_probs=4.5 Q ss_pred HHHHHHHHHCCCC Q ss_conf 9999999986868 Q gi|254780143|r 1217 EAINSMLRMADLD 1229 (1386) Q Consensus 1217 ~~i~~~L~~aG~~ 1229 (1386) +.+.+-|+++||+ T Consensus 156 ~~L~~kLr~aGy~ 168 (218) T PRK11633 156 NEIVGKLRLSGYR 168 (218) T ss_pred HHHHHHHHHCCCC T ss_conf 9999999987984 No 65 >PRK00750 lysK lysyl-tRNA synthetase; Reviewed Probab=33.87 E-value=22 Score=12.94 Aligned_cols=50 Identities=22% Similarity=0.258 Sum_probs=24.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCHHH---HHHHHHHHHCCCCCCCCCCCHHHHHHHHHH Q ss_conf 78999999986989999861110011015---999988763878789978667789999999 Q gi|254780143|r 1296 EMEVWCIQAYGAAYVLQEMLTIKSDDVVG---RTRVYESIVAGNDTFETGTPESFNVLVKEM 1354 (1386) Q Consensus 1296 EMErwaL~AyGAa~~LqE~Lt~kSDdv~g---r~~~~~~iv~g~~~~~~~~pesf~vl~~El 1354 (1386) |-|+-||.. |.|.|.-..++..+ -+.+|+ +-|.... + ..-+-|++|=+=| T Consensus 434 e~ek~aL~~------L~~~L~~~~~~~~~E~Iq~~Iy~-i~K~~~~-~-~~rd~F~~lY~vL 486 (513) T PRK00750 434 EEERAALED------LLDGLESLPDDASAEEIQNLVYD-VGKKHGF-E-PLRDWFKALYEVL 486 (513) T ss_pred HHHHHHHHH------HHHHHHHCCCCCCHHHHHHHHHH-HHHHCCC-C-CHHHHHHHHHHHH T ss_conf 899999999------99999738778998999999999-9886288-6-7899999999998 No 66 >COG2352 Ppc Phosphoenolpyruvate carboxylase [Energy production and conversion] Probab=33.68 E-value=23 Score=12.92 Aligned_cols=15 Identities=27% Similarity=0.507 Sum_probs=7.3 Q ss_pred CCCCCCCCCCCCCCC Q ss_conf 855231433667873 Q gi|254780143|r 1084 IQSGDKMAGRHGNKG 1098 (1386) Q Consensus 1084 p~iGDKfasRHGqKG 1098 (1386) ..||-.=|+|-+.+| T Consensus 719 LniGSRPA~Rk~~~~ 733 (910) T COG2352 719 LNIGSRPASRKPTTG 733 (910) T ss_pred CCCCCCCCCCCCCCC T ss_conf 887888767899877 No 67 >PRK09613 thiH thiamine biosynthesis protein ThiH; Reviewed Probab=33.52 E-value=23 Score=12.90 Aligned_cols=18 Identities=22% Similarity=0.436 Sum_probs=7.6 Q ss_pred CEECHHHHHHHHHHHHHH Q ss_conf 066189988889888763 Q gi|254780143|r 429 RHIRKEDIIAIIKILVDL 446 (1386) Q Consensus 429 ~~Lt~~d~~~~i~~L~~l 446 (1386) ++|+...+..-+..|... T Consensus 113 k~Lt~eEi~~E~~al~~~ 130 (471) T PRK09613 113 KKLTQEEIREEVKALESM 130 (471) T ss_pred CCCCHHHHHHHHHHHHHC T ss_conf 378999999999999976 No 68 >COG4172 ABC-type uncharacterized transport system, duplicated ATPase component [General function prediction only] Probab=31.94 E-value=20 Score=13.29 Aligned_cols=11 Identities=36% Similarity=0.785 Sum_probs=5.4 Q ss_pred CCCCCCEEEEC Q ss_conf 67652102100 Q gi|254780143|r 575 SEGHNIGLVSS 585 (1386) Q Consensus 575 PEG~n~GLv~~ 585 (1386) ..|+..|+|.. T Consensus 311 ~~gqTlGlVGE 321 (534) T COG4172 311 RRGQTLGLVGE 321 (534) T ss_pred CCCCEEEEEEC T ss_conf 38976777705 No 69 >TIGR02782 TrbB_P P-type conjugative transfer ATPase TrbB; InterPro: IPR014149 This entry represents TrbB, a protein, which is encoded in the trb locus of Agrobacterium Ti plasmids where it is involved in the type IV secretion system for plasmid conjugative transfer . TrbB is a homologue of the vir system VirB11 ATPase , and the Flp pilus system ATPase TadA .. Probab=31.94 E-value=14 Score=14.45 Aligned_cols=21 Identities=29% Similarity=0.322 Sum_probs=12.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHH Q ss_conf 878988888899998876531 Q gi|254780143|r 470 MLKNQYRLGLLRMERSIKERI 490 (1386) Q Consensus 470 Ll~~~fr~~l~rl~r~i~~~~ 490 (1386) +.+|--..+|.|++.-+.+-- T Consensus 245 iHAn~a~~aL~RLeQLi~E~s 265 (315) T TIGR02782 245 IHANNAKAALRRLEQLIAEVS 265 (315) T ss_pred ECCCCHHHHHHHHHHHHHHHC T ss_conf 314886689999999998535 No 70 >TIGR02924 ICDH_alpha isocitrate dehydrogenase; InterPro: IPR014273 This entry represents a group of isocitrate dehydrogenases found mainly in the alphaproteobacteria. Many of the species containing these proteins appear to have a TCA cycle lacking only a determined isocitrate dehydrogenase. The precise identity of the cofactor (NADH -- 1.1.1.41 from EC vs. NADPH -- 1.1.1.42 from EC) is unclear.. Probab=31.72 E-value=24 Score=12.70 Aligned_cols=70 Identities=13% Similarity=0.299 Sum_probs=31.1 Q ss_pred CCEECHHHHHH-HHHHHHHHHCCCCCCCCCHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC-CCCCCCCCC Q ss_conf 70661899888-8988876304876434401015332333357878988888899998876531134433-435211222 Q gi|254780143|r 428 VRHIRKEDIIA-IIKILVDLRNGKGTIDDIDNLGNRRVRSVGEMLKNQYRLGLLRMERSIKERISSVDID-SVMPQDLIN 505 (1386) Q Consensus 428 ~~~Lt~~d~~~-~i~~L~~l~~g~~~~DdiDhlgnkRvr~vgeLl~~~fr~~l~rl~r~i~~~~~~~~~~-~~~~~~~in 505 (1386) +..-.|+.+++ ++..|..+.. ....+|..+-|- |++.+...+.|.. ....+.-+. T Consensus 269 knIANPSGLLNAAi~MLvyigQ----------------~d~A~liynAwL-------KTLEdGvHTADIy~~k~Sk~KVg 325 (481) T TIGR02924 269 KNIANPSGLLNAAIQMLVYIGQ----------------KDIAQLIYNAWL-------KTLEDGVHTADIYNEKTSKKKVG 325 (481) T ss_pred CCCCCHHHHHHHHHHHHHHHCH----------------HHHHHHHHHHHH-------HHHHCCCCHHHHCCCCCCCCCCC T ss_conf 6766700579999998640045----------------468999987764-------44213743133212336666665 Q ss_pred HHHHHHHHHHHCCCC Q ss_conf 023465554202667 Q gi|254780143|r 506 AKPVVSAVCEFFCSS 520 (1386) Q Consensus 506 ~~~i~~~i~~ff~t~ 520 (1386) .+-.+..+-.=||.+ T Consensus 326 TkeFA~aV~~nlG~~ 340 (481) T TIGR02924 326 TKEFAEAVVKNLGKK 340 (481) T ss_pred CHHHHHHHHHHHCCC T ss_conf 257999999861678 No 71 >KOG3439 consensus Probab=31.21 E-value=25 Score=12.64 Aligned_cols=21 Identities=33% Similarity=0.623 Sum_probs=17.0 Q ss_pred EEEECCEECCCCCEEECCEEEE Q ss_conf 7510000268962898682146 Q gi|254780143|r 131 YMGDLPLMTKDGTFVIKGIQRI 152 (1386) Q Consensus 131 ~lG~IPiMt~~GyFIING~ERV 152 (1386) -+|+.|+| +.-.|-||++.+| T Consensus 37 aiG~~Pil-K~~k~~i~~t~tf 57 (116) T KOG3439 37 AIGDAPIL-KKSKFKINPTQTF 57 (116) T ss_pred CCCCCCCE-ECCEEEECCCHHH T ss_conf 15787501-0325886730426 No 72 >KOG0158 consensus Probab=30.60 E-value=22 Score=13.00 Aligned_cols=21 Identities=14% Similarity=0.146 Sum_probs=12.4 Q ss_pred CCCHHHHHHHHHHHHHHHHHH Q ss_conf 333357878988888899998 Q gi|254780143|r 464 VRSVGEMLKNQYRLGLLRMER 484 (1386) Q Consensus 464 vr~vgeLl~~~fr~~l~rl~r 484 (1386) .-+..|+.+..|-..++.++. T Consensus 289 ~lt~dei~aQafvFl~AGfeT 309 (499) T KOG0158 289 ALTDDEIAAQAFVFLLAGFET 309 (499) T ss_pred CCCHHHHHHHHHHHHHHHHHH T ss_conf 769999999999999851574 No 73 >TIGR01926 peroxid_rel uncharacterized peroxidase-related enzyme; InterPro: IPR010195 Members of this family are conserved hypothetical proteins of around 200 amino acids in length. Many of them contain an akylhydroperoxidase (AhpD) domain. . Probab=30.51 E-value=25 Score=12.56 Aligned_cols=16 Identities=25% Similarity=0.299 Sum_probs=7.9 Q ss_pred CCCCHHHHHHHHHHHH Q ss_conf 3458899999988760 Q gi|254780143|r 366 KNKDRKDALLDIYRVM 381 (1386) Q Consensus 366 ~~~~~~eAl~~I~k~l 381 (1386) ....++.|+......| T Consensus 112 ~l~~re~A~~~fA~~L 127 (179) T TIGR01926 112 DLSPRERAMLDFAVKL 127 (179) T ss_pred CCCHHHHHHHHHHHHH T ss_conf 7998899999999998 No 74 >PRK03427 cell division protein ZipA; Provisional Probab=30.46 E-value=14 Score=14.39 Aligned_cols=12 Identities=17% Similarity=0.410 Sum_probs=4.8 Q ss_pred EEEEECCCCCCC Q ss_conf 555313554444 Q gi|254780143|r 813 VAFMPWHGYNFE 824 (1386) Q Consensus 813 VA~m~~~GYN~E 824 (1386) .-++.-.|..|. T Consensus 199 l~v~a~~~~~~~ 210 (331) T PRK03427 199 MNVAAHHGSELN 210 (331) T ss_pred EEEEECCCCCCC T ss_conf 999708997101 No 75 >PRK00269 zipA cell division protein ZipA; Reviewed Probab=30.34 E-value=14 Score=14.35 Aligned_cols=18 Identities=11% Similarity=0.337 Sum_probs=8.5 Q ss_pred CEEEEEECCCCCCCCCEE Q ss_conf 515553135544444200 Q gi|254780143|r 811 MLVAFMPWHGYNFEDSML 828 (1386) Q Consensus 811 ~~VA~m~~~GYN~EDaii 828 (1386) +++-+++-.|..|.-.-+ T Consensus 157 lvl~v~a~~~~~~~G~~L 174 (295) T PRK00269 157 LVISVIARDEGGFKGPAL 174 (295) T ss_pred EEEEEECCCCCCCCHHHH T ss_conf 999997289996455999 No 76 >PHA00430 tail fiber protein Probab=30.33 E-value=26 Score=12.54 Aligned_cols=10 Identities=30% Similarity=0.375 Sum_probs=4.0 Q ss_pred HHHHHCCCCH Q ss_conf 8998809984 Q gi|254780143|r 216 SFLMALGMDS 225 (1386) Q Consensus 216 ilLrALG~ss 225 (1386) -+|||..+.- T Consensus 92 silraydlni 101 (568) T PHA00430 92 SILRAYDLNI 101 (568) T ss_pred CEEEEECCCH T ss_conf 0433102677 No 77 >PRK01741 cell division protein ZipA; Provisional Probab=29.15 E-value=16 Score=14.09 Aligned_cols=14 Identities=7% Similarity=0.133 Sum_probs=6.1 Q ss_pred EEEEECCCCCCCCC Q ss_conf 55531355444442 Q gi|254780143|r 813 VAFMPWHGYNFEDS 826 (1386) Q Consensus 813 VA~m~~~GYN~EDa 826 (1386) .-+++-.|..|.-+ T Consensus 217 l~v~a~~~~~~~G~ 230 (342) T PRK01741 217 LYVVAPENQQFNGA 230 (342) T ss_pred EEEECCCCCEECHH T ss_conf 99875899720389 No 78 >PRK13534 7-cyano-7-deazaguanine tRNA-ribosyltransferase; Provisional Probab=28.99 E-value=22 Score=13.01 Aligned_cols=54 Identities=24% Similarity=0.379 Sum_probs=31.2 Q ss_pred CCHHHHHHCCCCEEEEEEEEEEEEEECCCCCCCCCCCEEEEEEEEEEE-ECCEECCCCCEEE Q ss_conf 889999983997533589999999931787665520001223678751-0000268962898 Q gi|254780143|r 86 FDVDDCLWRDLTYAVPLKITLRLIVFDVDEFTGAKSIKDIKEQSIYMG-DLPLMTKDGTFVI 146 (1386) Q Consensus 86 ~tp~ECRlR~lTYsapL~V~i~l~v~~~~~~~~~k~~~~ike~~V~lG-~IPiMt~~GyFII 146 (1386) .+|.|-+ -|.+.+.++=.++++..+.-......+-+-+ +|| +=||||+||.|=+ T Consensus 39 i~p~el~----~~G~~~iITNsYi~~~~~~~~~~a~~~GlH~---~l~~dg~ImTDSG~fQ~ 93 (630) T PRK13534 39 VDPKELK----KMGFDIVITNSYIIYKNPELREKALEKGIHS---LIDFDGPIMTDSGSFQL 93 (630) T ss_pred CCHHHHH----HCCCCEEEEEEEEEEECCCHHHHHHHHCHHH---HCCCCCCEEECCCCEEE T ss_conf 4999998----6186589750057750762346666404798---71899857977886377 No 79 >PRK05035 electron transport complex protein RnfC; Provisional Probab=28.61 E-value=27 Score=12.34 Aligned_cols=18 Identities=17% Similarity=0.246 Sum_probs=10.7 Q ss_pred CCEEECCEEEEEEEEECC Q ss_conf 628986821468665122 Q gi|254780143|r 142 GTFVIKGIQRIVVSQLHR 159 (1386) Q Consensus 142 GyFIING~ERVIVsQl~R 159 (1386) +.=+|+=-++|.+-|++= T Consensus 55 ~~p~VkvGD~VlkGQ~I~ 72 (725) T PRK05035 55 GELLVSVGDRVLKGQPLT 72 (725) T ss_pred CCCCCCCCCEECCCCEEE T ss_conf 714147899976888745 No 80 >pfam09400 DUF2002 Protein of unknown function (DUF2002). This is a family of putative cytoplasmic proteins. The structure of these proteins form an antiparallel beta and sheet and contain some alpha helical regions. Probab=28.12 E-value=28 Score=12.28 Aligned_cols=32 Identities=19% Similarity=0.320 Sum_probs=19.9 Q ss_pred CCCCCCCCCEEEEEECCCCCCCCCCHHHHHHH Q ss_conf 07879358715698668986750708999999 Q gi|254780143|r 1108 DMPFLKDGTPVDIVLNPLGVPSRMNVGQIFET 1139 (1386) Q Consensus 1108 DMPf~~dG~~pDIIlNPhgvPSRMtIGqllE~ 1139 (1386) ..|---.|-.++-.==||||-|||-.-..|+. T Consensus 77 ~FPlyl~g~~~~hyGIpHGFsSR~~L~ryl~~ 108 (110) T pfam09400 77 EFPLYLGGETHEHYGIPHGFSSREALERYLNR 108 (110) T ss_pred CCCCCCCCCCHHHCCCCCCCCHHHHHHHHHHH T ss_conf 36765477743210787563469999999875 No 81 >cd04737 LOX_like_FMN L-Lactate oxidase (LOX) FMN-binding domain. LOX is a member of the family of FMN-containing alpha-hydroxyacid oxidases and catalyzes the oxidation of l-lactate using molecular oxygen to generate pyruvate and H2O2. This family occurs in both prokaryotes and eukaryotes. Members of this family include flavocytochrome b2 (FCB2), glycolate oxidase (GOX), lactate monooxygenase (LMO), mandelate dehydrogenase (MDH), and long chain hydroxyacid oxidase (LCHAO). Probab=27.94 E-value=13 Score=14.64 Aligned_cols=51 Identities=33% Similarity=0.436 Sum_probs=26.7 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 1784320789999999869899998611100110159999887638787899786677899999998 Q gi|254780143|r 1289 RGGQRLGEMEVWCIQAYGAAYVLQEMLTIKSDDVVGRTRVYESIVAGNDTFETGTPESFNVLVKEMQ 1355 (1386) Q Consensus 1289 ~GGlRfGEMErwaL~AyGAa~~LqE~Lt~kSDdv~gr~~~~~~iv~g~~~~~~~~pesf~vl~~El~ 1355 (1386) +||.|=|.==+=| +|-||.+++ .||.-+|---..|+ .|+--.+.+|..||. T Consensus 282 DgGIR~G~DV~KA-LALGA~aV~-----------iGRp~l~glaa~G~----~GV~~~l~iL~~El~ 332 (351) T cd04737 282 DSGVRRGEHVFKA-LASGADAVA-----------VGRPVLYGLALGGA----QGVASVLEHLNKELK 332 (351) T ss_pred CCCCCCHHHHHHH-HHCCCCEEE-----------ECHHHHHHHHHCCH----HHHHHHHHHHHHHHH T ss_conf 6986746899999-976998897-----------57899998871338----999999999999999 No 82 >pfam04221 RelB RelB antitoxin. RelE and RelB form a toxin-antitoxin system. RelE represses translation, probably through binding ribosomes. RelB stably binds RelE, presumably deactivating it. Probab=27.71 E-value=28 Score=12.23 Aligned_cols=35 Identities=20% Similarity=0.407 Sum_probs=28.6 Q ss_pred CEEEEEECCCCCHHHHHHHHHCCCCHHHHHHHHCC Q ss_conf 82999971878703998998809984799997387 Q gi|254780143|r 200 DIIHVRIDRRRKVPVTSFLMALGMDSEEILSTFYP 234 (1386) Q Consensus 200 d~iyvrIdr~rKIPi~ilLrALG~ssdeIl~~f~~ 234 (1386) ..+-+|||..-|=-+.-+|..||++--+-+.+|+. T Consensus 2 ~ti~~RiD~~lK~~A~~i~~~lGlt~S~Ai~~fl~ 36 (83) T pfam04221 2 GSLNLRIDDDTKAAAYDVLERMGLTPSQAIRLFLT 36 (83) T ss_pred CEEEEEECHHHHHHHHHHHHHHCCCHHHHHHHHHH T ss_conf 80688749899999999999949998999999999 No 83 >TIGR02384 RelB_DinJ addiction module antitoxin, RelB/DinJ family; InterPro: IPR007337 Plasmids may be maintained stably in bacterial populations through the action of addiction modules, in which a toxin and antidote are encoded in a cassette on the plasmid. In any daughter cell that lacks the plasmid, the toxin persists and is lethal after the antidote protein is depleted. Toxin/antitoxin pairs are also found on main chromosomes, and likely represent selfish DNA. Sequences in the seed for this alignment all were found adjacent to toxin genes. Several toxin/antitoxin pairs may occur in a single species. RelE and RelB form a toxin-antitoxin system; RelE represses translation, probably through binding ribosomes , . RelB stably binds RelE, presumably deactivating it.. Probab=27.70 E-value=28 Score=12.23 Aligned_cols=34 Identities=26% Similarity=0.412 Sum_probs=29.9 Q ss_pred CCEEEEEECCCCCHHHHHHHHHCCCCHHHHHHHH Q ss_conf 9829999718787039989988099847999973 Q gi|254780143|r 199 KDIIHVRIDRRRKVPVTSFLMALGMDSEEILSTF 232 (1386) Q Consensus 199 kd~iyvrIdr~rKIPi~ilLrALG~ssdeIl~~f 232 (1386) |..|.+|||..=|.-+.=+|..||++.-+-+++| T Consensus 5 k~~~~~RiD~~~K~~A~~v~~~lGl~~S~Air~f 38 (96) T TIGR02384 5 KATISIRIDEELKKEAYAVFEELGLTMSTAIRMF 38 (96) T ss_pred CCEEEEECCHHHHHHHHHHHHHCCCCHHHHHHHH T ss_conf 3215751667668999999987489988999999 No 84 >PRK02597 DNA-directed RNA polymerase subunit beta'; Provisional Probab=27.18 E-value=29 Score=12.17 Aligned_cols=47 Identities=28% Similarity=0.437 Sum_probs=32.8 Q ss_pred CHHHHCCCCCCCCCCCCCCCEEEEECCCCCCHHHHHHHHHCCCCCEEE Q ss_conf 667734741000013666776999623459989999998656541024 Q gi|254780143|r 299 TSDCLCGLYVAEDIVNGETGEIYIEAGDVIDEKSLEEIFHSEIRDIPI 346 (1386) Q Consensus 299 ~~~~l~~~~~~~~~~d~~~gei~~~~~~~~~~~~l~~~~~~~~~~~~~ 346 (1386) ....+.|+.++.+++++. +++++.++..++++.++.+..++...+.+ T Consensus 240 l~~r~~Gr~~~~~v~~~~-~~~i~~~~~~i~~~~~~~~~~~~~~~v~i 286 (1295) T PRK02597 240 LGDRLVGRVTAEDVVDPD-GEVIAERNTEIDPDLAKKIEKAGVEEVMV 286 (1295) T ss_pred ECCEEEEEEEHHHEECCC-CCEEEECCCCCCHHHHHHHHHCCCCEEEE T ss_conf 103556463875678899-96899798675999999999869978997 No 85 >PRK13533 7-cyano-7-deazaguanine tRNA-ribosyltransferase; Provisional Probab=27.18 E-value=22 Score=13.04 Aligned_cols=50 Identities=18% Similarity=0.331 Sum_probs=28.7 Q ss_pred CCHHHHHHCCCCEEEEEEEEEEEEEECCCCCCCCCCCEEEEEEEE--EEE-ECCEECCCCCEEE Q ss_conf 889999983997533589999999931787665520001223678--751-0000268962898 Q gi|254780143|r 86 FDVDDCLWRDLTYAVPLKITLRLIVFDVDEFTGAKSIKDIKEQSI--YMG-DLPLMTKDGTFVI 146 (1386) Q Consensus 86 ~tp~ECRlR~lTYsapL~V~i~l~v~~~~~~~~~k~~~~ike~~V--~lG-~IPiMt~~GyFII 146 (1386) .+|+|=+. +.+.+.++=.++.+....+. ..++.+ +|| +=||||+||.|=+ T Consensus 40 i~p~el~~----~Ga~iIItNtY~l~~rpg~~-------a~~~GLH~fm~wdgpImTDSGgFQv 92 (486) T PRK13533 40 IPPEELKE----FGFEALITNSYIIYRSERER-------ALEKGLHELLGFDGVIMTDSGSYQL 92 (486) T ss_pred CCHHHHHH----HCCCEEEEECCCEECCCCHH-------HHHCCHHHHCCCCCCEEECCCCCEE T ss_conf 18999998----17989931011000276066-------7760708760999973766887643 No 86 >PHA01630 putative group 1 glycosyl transferase Probab=26.84 E-value=9.3 Score=15.70 Aligned_cols=14 Identities=14% Similarity=0.121 Sum_probs=7.0 Q ss_pred CHHHHHHHHHHHHH Q ss_conf 08899999999986 Q gi|254780143|r 28 DLIEVQKASYDHFL 41 (1386) Q Consensus 28 ~Li~iQ~~Sf~~Fl 41 (1386) ++.++.--||..-+ T Consensus 8 ~y~~h~~v~~k~l~ 21 (333) T PHA01630 8 DYPDHSFVRQKKLL 21 (333) T ss_pred CCCCCCHHHHHHHH T ss_conf 36654316789999 No 87 >pfam04104 DNA_primase_lrg Eukaryotic and archaeal DNA primase, large subunit. DNA primase is the polymerase that synthesizes small RNA primers for the Okazaki fragments made during discontinuous DNA replication. DNA primase is a heterodimer of two subunits, the small subunit Pri1 (48 kDa in yeast), and the large subunit Pri2 (58 kDa in the yeast S. cerevisiae). The large subunit of DNA primase forms interactions with the small subunit and the structure implicates that it is not directly involved in catalysis, but plays roles in correctly positioning the primase/DNA complex, and in the transfer of RNA to DNA polymerase. Probab=26.77 E-value=29 Score=12.12 Aligned_cols=19 Identities=37% Similarity=0.614 Sum_probs=6.9 Q ss_pred HHHHHHHHCCCCHHHHHHH Q ss_conf 3998998809984799997 Q gi|254780143|r 213 PVTSFLMALGMDSEEILST 231 (1386) Q Consensus 213 Pi~ilLrALG~ssdeIl~~ 231 (1386) -++.||+.+|++.||++.. T Consensus 124 ~l~~FLk~iG~~~~e~l~~ 142 (217) T pfam04104 124 QLTLFLKGIGLSLDEILEF 142 (217) T ss_pred HHHHHHHHCCCCHHHHHHH T ss_conf 9999998679989999999 No 88 >pfam11547 E3_UbLigase_EDD E3 ubiquitin ligase EDD. EDD, the ER ubiquitin ligase from the HECT ligases, contains an N-terminal ubiquitin-associated domain which binds ubiquitin. Ubiquitin is recognized by helices alpha-1 and -3 in in the UBA domain. EDD is involved in DNA damage repair pathways and binds to mono-ubiquitinated proteins. Probab=26.66 E-value=29 Score=12.10 Aligned_cols=18 Identities=39% Similarity=0.671 Sum_probs=13.5 Q ss_pred HHHHHHHHHHHCCCCEEE Q ss_conf 789999999854200276 Q gi|254780143|r 1346 SFNVLVKEMQALGLSIDL 1363 (1386) Q Consensus 1346 sf~vl~~El~~l~l~~~~ 1363 (1386) |=+|.++|||--+|||.+ T Consensus 22 SR~vIiRELQrtnLdVN~ 39 (51) T pfam11547 22 SRNVIIRELQRTNLDVNL 39 (51) T ss_pred CHHHHHHHHHHCCCCHHH T ss_conf 289999999981763999 No 89 >KOG1647 consensus Probab=26.50 E-value=30 Score=12.08 Aligned_cols=10 Identities=20% Similarity=0.424 Sum_probs=4.0 Q ss_pred HHHHHHHHHH Q ss_conf 8888899998 Q gi|254780143|r 475 YRLGLLRMER 484 (1386) Q Consensus 475 fr~~l~rl~r 484 (1386) =|--|.||.| T Consensus 195 eRedF~RLKK 204 (255) T KOG1647 195 EREDFYRLKK 204 (255) T ss_pred HHHHHHHHHH T ss_conf 7899999999 No 90 >PRK04335 cell division protein ZipA; Provisional Probab=26.46 E-value=18 Score=13.62 Aligned_cols=16 Identities=6% Similarity=-0.079 Sum_probs=6.5 Q ss_pred EEEEEECCCCCCCCCE Q ss_conf 1555313554444420 Q gi|254780143|r 812 LVAFMPWHGYNFEDSM 827 (1386) Q Consensus 812 ~VA~m~~~GYN~EDai 827 (1386) +..+++-.|..|.-.- T Consensus 185 vl~v~a~~~~~~~G~~ 200 (319) T PRK04335 185 VLNVHCAGNEPFVGTK 200 (319) T ss_pred EEEEECCCCCCCCHHH T ss_conf 9999728998546699 No 91 >PRK13133 consensus Probab=25.89 E-value=30 Score=12.01 Aligned_cols=17 Identities=12% Similarity=0.243 Sum_probs=6.8 Q ss_pred CCCCCCCCCCEEEEEEC Q ss_conf 14336678735888630 Q gi|254780143|r 1089 KMAGRHGNKGIVSRILP 1105 (1386) Q Consensus 1089 KfasRHGqKGVis~i~p 1105 (1386) ||+.+--+-||-|.|+| T Consensus 115 ~F~~~~~~aGvdGlIip 131 (267) T PRK13133 115 CFLADAVKAGVDGLLIP 131 (267) T ss_pred HHHHHHHHCCCCEEECC T ss_conf 99999998698788778 No 92 >TIGR02881 spore_V_K stage V sporulation protein K; InterPro: IPR014232 Proteins in this entry include the stage V sporulation protein K (SpoVK), a close homologue of the Rubisco expression protein CbbX (IPR000470 from INTERPRO), and are members of an ATPase family associated with various cellular activities. These proteins are strictly limited to bacterial endospore-forming species, but are not found universally among members of this group; they are missing from the Clostridium species.. Probab=25.61 E-value=12 Score=14.78 Aligned_cols=22 Identities=18% Similarity=0.314 Sum_probs=12.1 Q ss_pred CCHHHHHHHHHHHHHHHHHCCC Q ss_conf 5302333455557776542036 Q gi|254780143|r 401 FDSDKYDLSTVGRVKMNMRLNL 422 (1386) Q Consensus 401 ~~~~~y~l~~vgr~~~n~~l~~ 422 (1386) .+..-|.|+.-..+.+...+.. T Consensus 191 ~~~ReY~Lt~~A~~~lr~~l~~ 212 (261) T TIGR02881 191 VKEREYKLTEEAKWKLREHLAK 212 (261) T ss_pred HHHHHHCCCHHHHHHHHHHHHH T ss_conf 8646422578899999999741 No 93 >pfam02449 Glyco_hydro_42 Beta-galactosidase. This group of beta-galactosidase enzymes belong to the glycosyl hydrolase 42 family. The enzyme catalyses the hydrolysis of terminal, non-reducing terminal beta-D-galactosidase residues. Probab=25.53 E-value=31 Score=11.96 Aligned_cols=81 Identities=19% Similarity=0.240 Sum_probs=40.4 Q ss_pred CCCCEEECCCCC--CCCCCCCCCHHHHHHHHH--HHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCCCCCCCCHHHHH Q ss_conf 731103079986--233178432078999999--9869899998611100110159999887638787899786677899 Q gi|254780143|r 1274 SYSLVTQQPLGG--KSNRGGQRLGEMEVWCIQ--AYGAAYVLQEMLTIKSDDVVGRTRVYESIVAGNDTFETGTPESFNV 1349 (1386) Q Consensus 1274 P~sllTrQP~eG--Rsr~GGlRfGEMErwaL~--AyGAa~~LqE~Lt~kSDdv~gr~~~~~~iv~g~~~~~~~~pesf~v 1349 (1386) |+-+.-|||--+ +..+.=-|=|+|.+|+++ ||||-.++-=+. =.-..|.-+...+++.-+..+..-+-+-..- T Consensus 292 pf~vmE~q~g~vnw~~~n~~~~pG~~~l~s~~~vA~GAd~v~yf~W---r~~~~G~E~~h~gll~hdg~~~~r~~~Ev~~ 368 (376) T pfam02449 292 PFWVMEQSPSPVNWAPYNPAKRPGMMRLWSLQAVAHGADAVCYFQW---RQSRGGSEKFHSGVLDHDGREDTRVFREVAE 368 (376) T ss_pred CEEEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCEEEEECC---CCCCCCHHHHCEEECCCCCCCCCCHHHHHHH T ss_conf 7799747898888877889999779999999999663665764206---7878730311512048799998625999999 Q ss_pred HHHHHHHC Q ss_conf 99999854 Q gi|254780143|r 1350 LVKEMQAL 1357 (1386) Q Consensus 1350 l~~El~~l 1357 (1386) |-+||+.| T Consensus 369 ~g~el~~l 376 (376) T pfam02449 369 VGEELKKL 376 (376) T ss_pred HHHHHHHC T ss_conf 99999729 No 94 >PRK05352 Na(+)-translocating NADH-quinone reductase subunit A; Provisional Probab=25.32 E-value=31 Score=11.94 Aligned_cols=23 Identities=22% Similarity=0.459 Sum_probs=11.8 Q ss_pred EEECCEEEEEEEEEC----CCCCEEEC Q ss_conf 898682146866512----27852120 Q gi|254780143|r 144 FVIKGIQRIVVSQLH----RSPGIHFD 166 (1386) Q Consensus 144 FIING~ERVIVsQl~----RSPGVyf~ 166 (1386) .+|+=-++|-.-|-. ..|+|.|. T Consensus 44 l~VkeGD~V~~Gq~Lf~dK~~~~v~~~ 70 (448) T PRK05352 44 MKVKEGDKVKKGQPLFEDKKNPGVKFT 70 (448) T ss_pred EEECCCCEECCCCEEEECCCCCCEEEE T ss_conf 685579997479855653898962696 No 95 >PRK09282 pyruvate carboxylase subunit B; Validated Probab=25.15 E-value=31 Score=11.91 Aligned_cols=25 Identities=28% Similarity=0.437 Sum_probs=16.0 Q ss_pred CCCCCCCCEEECCCEECCCCCCCCC Q ss_conf 5304479772078520355223578 Q gi|254780143|r 779 RPLVKVGDEVRRNDIIADGPSTDLG 803 (1386) Q Consensus 779 ~p~V~~g~~~~~~~~l~~~~~~~~~ 803 (1386) +..|+.|+.|++||.++--++|++. T Consensus 524 ~V~Vk~Gd~V~~Gd~L~vlEAMKME 548 (580) T PRK09282 524 KVLVKEGDKVKEGDVLLILEAMKME 548 (580) T ss_pred EEEECCCCEECCCCEEEEEEHHCCC T ss_conf 9997899987899989998510474 No 96 >COG0427 ACH1 Acetyl-CoA hydrolase [Energy production and conversion] Probab=25.08 E-value=31 Score=11.90 Aligned_cols=43 Identities=28% Similarity=0.421 Sum_probs=28.7 Q ss_pred CCCEEEEEECCCCCCCCCCCCEEEEEECC------CCCCCCCCHHHHHHH Q ss_conf 87358886300007879358715698668------986750708999999 Q gi|254780143|r 1096 NKGIVSRILPCEDMPFLKDGTPVDIVLNP------LGVPSRMNVGQIFET 1139 (1386) Q Consensus 1096 qKGVis~i~p~eDMPf~~dG~~pDIIlNP------hgvPSRMtIGqllE~ 1139 (1386) -+|.||.|+|+-+= -+-.=.-+|+|+-= .|.+.|--.-.|+|| T Consensus 404 ~~G~IS~IVP~~~h-Vd~~rhdvdvvVTE~GiAdLRGlsp~ERA~~iI~~ 452 (501) T COG0427 404 KGGTISRIVPMLSH-VDHTRHDVDVVVTEYGIADLRGLSPRERAAAIIEC 452 (501) T ss_pred CCCCEEEEEECCCC-CCCCCCCEEEEEEEECHHHHCCCCHHHHHHHHHHH T ss_conf 79943688844787-66566513699970023543389979999999986 No 97 >KOG4507 consensus Probab=24.55 E-value=32 Score=11.87 Aligned_cols=28 Identities=14% Similarity=0.354 Sum_probs=18.6 Q ss_pred CCCCCCCCCCCCCCCCCEEEEEECCCCC Q ss_conf 3223466554322111104443035667 Q gi|254780143|r 549 VARARAGVEMRDVHPTHYGRICPAETSE 576 (1386) Q Consensus 549 l~r~~~~~evR~ih~s~~GriCPieTPE 576 (1386) +..+.++..+..+-.+|||----..||. T Consensus 512 ~~pen~g~~i~el~s~~~~~~~~~~~p~ 539 (886) T KOG4507 512 LPPENKGLRIHELSSDDYSTEEEAQTPD 539 (886) T ss_pred CCCCCCCEEEEEHHCCCCCCCCCCCCCC T ss_conf 2766676464330014434201234899 No 98 >TIGR01163 rpe ribulose-phosphate 3-epimerase; InterPro: IPR000056 Ribulose-phosphate 3-epimerase (5.1.3.1 from EC) (also known as pentose-5-phosphate 3-epimerase or PPE) is the enzyme that converts D-ribulose 5-phosphate into D-xylulose 5-phosphate in Calvin's reductive pentose phosphate cycle. In Alcaligenes eutrophus two copies of the gene coding for PPE are known , one is chromosomally encoded P40117 from SWISSPROT, the other one is on a plasmid Q04539 from SWISSPROT. PPE has been found in a wide range of bacteria, archaebacteria, fungi and plants. All the proteins have from 209 to 241 amino acid residues. The enzyme has a TIM barrel structure.; GO: 0004750 ribulose-phosphate 3-epimerase activity, 0005975 carbohydrate metabolic process. Probab=24.45 E-value=25 Score=12.61 Aligned_cols=29 Identities=17% Similarity=0.193 Sum_probs=18.1 Q ss_pred CCCEEEEECCCCCCHHHHHHHHHCCCCCE Q ss_conf 67769996234599899999986565410 Q gi|254780143|r 316 ETGEIYIEAGDVIDEKSLEEIFHSEIRDI 344 (1386) Q Consensus 316 ~~gei~~~~~~~~~~~~l~~~~~~~~~~~ 344 (1386) ....+.++-.+-...+.+..+..++.+.+ T Consensus 166 ~~~~~~ieVDGGv~~~ni~~~~~AGAD~~ 194 (216) T TIGR01163 166 LGLSILIEVDGGVNEDNIAEVAEAGADIL 194 (216) T ss_pred CCCCEEEEECCCCCHHHHHHHHHCCCCEE T ss_conf 79955899717989767999997589899 No 99 >TIGR01677 pln_FAD_oxido plant-specific FAD-dependent oxidoreductase; InterPro: IPR010030 This entry identifies a family of uncharacterised plant-specific FAD-dependent oxidoreductases. At least seven distinct members are found in Arabidopsis thaliana. The family shows considerable sequence similarity to three different enzymes of ascorbic acid biosynthesis: L-galactono-1,4-lactone dehydrogenase (1.3.2.3 from EC) from higher plants, D-arabinono-1,4-lactone oxidase (1.1.3.37 from EC) from Saccharomyces cerevisiae, and L-gulonolactone oxidase (1.1.3.8 from EC) from mouse, as well as to a bacterial sorbitol oxidase. The class of compound acted on by members of this family is unknown.. Probab=24.11 E-value=17 Score=13.89 Aligned_cols=17 Identities=24% Similarity=0.411 Sum_probs=7.9 Q ss_pred EEEECCCCCCCCCEEEE Q ss_conf 44303566765210210 Q gi|254780143|r 568 RICPAETSEGHNIGLVS 584 (1386) Q Consensus 568 riCPieTPEG~n~GLv~ 584 (1386) ++|=+|-=.|-=+--|| T Consensus 409 ~LCGvd~Y~GiLiRyvk 425 (577) T TIGR01677 409 SLCGVDLYNGILIRYVK 425 (577) T ss_pred CCCCCEEECCEEEEEEC T ss_conf 01431300763799871 No 100 >TIGR01369 CPSaseII_lrg carbamoyl-phosphate synthase, large subunit; InterPro: IPR006275 Carbamoyl phosphate synthase (CPSase) is a heterodimeric enzyme composed of a small and a large subunit (with the exception of CPSase III, see below). CPSase catalyses the synthesis of carbamoyl phosphate from biocarbonate, ATP and glutamine (6.3.5.5 from EC) or ammonia (6.3.4.16 from EC), and represents the first committed step in pyrimidine and arginine biosynthesis in prokaryotes and eukaryotes, and in the urea cycle in most terrestrial vertebrates , . CPSase has three active sites, one in the small subunit and two in the large subunit. The small subunit contains the glutamine binding site and catalyses the hydrolysis of glutamine to glutamate and ammonia. The large subunit has two homologous carboxy phosphate domains, both of which have ATP-binding sites; however, the N-terminal carboxy phosphate domain catalyses the phosphorylation of biocarbonate, while the C-terminal domain catalyses the phosphorylation of the carbamate intermediate . The carboxy phosphate domain found duplicated in the large subunit of CPSase is also present as a single copy in the biotin-dependent enzymes acetyl-CoA carboxylase (6.4.1.2 from EC) (ACC), propionyl-CoA carboxylase (6.4.1.3 from EC) (PCCase), pyruvate carboxylase (6.4.1.1 from EC) (PC) and urea carboxylase (6.3.4.6 from EC). Most prokaryotes carry one form of CPSase that participates in both arginine and pyrimidine biosynthesis, however certain bacteria can have separate forms. The large subunit in bacterial CPSase has four structural domains: the carboxy phosphate domain 1, the oligomerisation domain, the carbamoyl phosphate domain 2 and the allosteric domain . CPSase heterodimers from Escherichia coli contain two molecular tunnels: an ammonia tunnel and a carbamate tunnel. These inter-domain tunnels connect the three distinct active sites, and function as conduits for the transport of unstable reaction intermediates (ammonia and carbamate) between successive active sites . The catalytic mechanism of CPSase involves the diffusion of carbamate through the interior of the enzyme from the site of synthesis within the N-terminal domain of the large subunit to the site of phosphorylation within the C-terminal domain. Eukaryotes have two distinct forms of CPSase: a mitochondrial enzyme (CPSase I) that participates in both arginine biosynthesis and the urea cycle; and a cytosolic enzyme (CPSase II) involved in pyrimidine biosynthesis. CPSase II occurs as part of a multi-enzyme complex along with aspartate transcarbamoylase and dihydroorotase; this complex is referred to as the CAD protein . The hepatic expression of CPSase is transcriptionally regulated by glucocorticoids and/or cAMP . There is a third form of the enzyme, CPSase III, found in fish, which uses glutamine as a nitrogen source instead of ammonia . CPSase III is closely related to CPSase I, and is composed of a single polypeptide that may have arisen from gene fusion of the glutaminase and synthetase domains . This entry represents glutamine-dependent CPSase (6.3.5.5 from EC) from prokaryotes and eukaryotes (CPSase II). ; GO: 0004086 carbamoyl-phosphate synthase activity, 0006807 nitrogen compound metabolic process. Probab=24.10 E-value=33 Score=11.78 Aligned_cols=82 Identities=16% Similarity=0.314 Sum_probs=47.0 Q ss_pred CCCCHHHHHHCCCCCCCCCCCEECC-CCCEEECCCCCCCCCC--------CH-HHHHHHHHCCCCCCCCCC-CCEECCCC Q ss_conf 5468678410241374137733136-7611101246777767--------70-346654303555544323-32022788 Q gi|254780143|r 866 PNVSEEGLKNIDECGIICVGAEVNP-GDILVGKITPKGESPM--------TP-EEKLLRAIFGEKAVDVRD-TSLRVPSG 934 (1386) Q Consensus 866 ~~~~~~~~~~ld~~Giv~~G~~V~~-gDilvgk~tp~~~~~~--------~p-e~~~l~~i~~~~~~~~~d-~~~~~~~g 934 (1386) .......=+.|.--|+-.+.--|+. |++.|==+=|.-.... -| -.-..++++|.+-...-- ..+..++. T Consensus 825 ~~~~~~iA~~L~v~Gl~NiQf~~~~E~~~yVIE~NpRASRtVPFvSKa~Gipl~~~A~~~~~G~~l~~~~~~~gv~~~~~ 904 (1089) T TIGR01369 825 KDIVRKIAKELNVKGLFNIQFVVKDEGEVYVIEVNPRASRTVPFVSKATGIPLAKLAVRVMLGKKLEELGKDLGVGKEKE 904 (1089) T ss_pred HHHHHHHHHHCCCCCCEEEEEEEECCCCEEEEEECCCCCCCCCCEEEECCCCHHHHHHHHHHCCCCHHCCCCCCCCCCCC T ss_conf 99999999870660722245556169967999971742066541321037887999999970882010275401123268 Q ss_pred CCCCCCCCCCCCC Q ss_conf 5322001210145 Q gi|254780143|r 935 VSGTVVDVRIFNR 947 (1386) Q Consensus 935 ~~g~v~~~~~~~r 947 (1386) ....-+.+-.|+. T Consensus 905 ~~~vavK~~vFSF 917 (1089) T TIGR01369 905 SKYVAVKVPVFSF 917 (1089) T ss_pred CCEEEEEEEECCC T ss_conf 8727996423771 No 101 >PRK04570 cell division protein ZipA; Provisional Probab=23.66 E-value=22 Score=13.00 Aligned_cols=13 Identities=23% Similarity=0.161 Sum_probs=5.4 Q ss_pred EECCCCCCHHHHH Q ss_conf 9809858899999 Q gi|254780143|r 80 EFDPPKFDVDDCL 92 (1386) Q Consensus 80 ~l~~Pk~tp~ECR 92 (1386) .+|.||-+|+-.| T Consensus 24 ~~~r~k~~~~~~~ 36 (244) T PRK04570 24 LFGRPKKSPQGRR 36 (244) T ss_pred EEECCCCCHHHHH T ss_conf 7605676601220 No 102 >PRK05070 DNA mismatch repair protein; Provisional Probab=22.88 E-value=34 Score=11.62 Aligned_cols=14 Identities=14% Similarity=0.710 Sum_probs=5.6 Q ss_pred EEEEEECCCCCHHH Q ss_conf 29999718787039 Q gi|254780143|r 201 IIHVRIDRRRKVPV 214 (1386) Q Consensus 201 ~iyvrIdr~rKIPi 214 (1386) ++||-+.-.|-||. T Consensus 112 VLwipve~~r~ip~ 125 (218) T PRK05070 112 VLWIPVEGERSIPL 125 (218) T ss_pred EEEEEEECCCCCCH T ss_conf 79999726789988 No 103 >pfam06925 MGDG_synth Monogalactosyldiacylglycerol (MGDG) synthase. This family represents a conserved region of approximately 180 residues within plant and bacterial monogalactosyldiacylglycerol (MGDG) synthase (EC:2.4.1.46). In Arabidopsis, there are two types of MGDG synthase which differ in their N-terminal portion: type A and type B. Probab=22.55 E-value=33 Score=11.69 Aligned_cols=21 Identities=14% Similarity=0.350 Sum_probs=13.9 Q ss_pred CEEEEEECCCCCCCCCCHHHH Q ss_conf 715698668986750708999 Q gi|254780143|r 1116 TPVDIVLNPLGVPSRMNVGQI 1136 (1386) Q Consensus 1116 ~~pDIIlNPhgvPSRMtIGql 1136 (1386) .-||+|+.-|-+|+.|....| T Consensus 88 ~~PD~IV~Thp~~~~~~l~~l 108 (169) T pfam06925 88 FQPDIIISTHPLPAAVPLSVL 108 (169) T ss_pred HCCCEEEECCHHHHHHHHHHH T ss_conf 493999999762667899999 No 104 >pfam05059 Orbi_VP4 Orbivirus VP4 core protein. Orbiviruses are double stranded RNA retroviruses of which the bluetongue virus is a member. The core of bluetongue virus (BTV) is a multienzyme complex composed of two major proteins (VP7 and VP3) and three minor proteins (VP1, VP4 and VP6) in addition to the viral genome. VP4 has been shown to perform all RNA capping activities and has both methyltransferase type 1 and type 2 activities associated with it. Probab=22.19 E-value=35 Score=11.52 Aligned_cols=19 Identities=16% Similarity=0.085 Sum_probs=9.0 Q ss_pred HHHHHHCCCCEEEEEEEEE Q ss_conf 9999983997533589999 Q gi|254780143|r 88 VDDCLWRDLTYAVPLKITL 106 (1386) Q Consensus 88 p~ECRlR~lTYsapL~V~i 106 (1386) +++-=+-+-+|+..+|+.= T Consensus 34 lNdLW~~~Gky~tDiYa~G 52 (642) T pfam05059 34 LNDLWLERGKYATDIYAYG 52 (642) T ss_pred HHHHHHHCCCCCCCEEEEC T ss_conf 7899996184122048851 No 105 >PRK11029 FtsH protease regulator HflC; Provisional Probab=22.15 E-value=35 Score=11.52 Aligned_cols=29 Identities=31% Similarity=0.496 Sum_probs=23.0 Q ss_pred CCCCCEEECCEEEEEEEEE---CC---------CCCEEECC Q ss_conf 6896289868214686651---22---------78521202 Q gi|254780143|r 139 TKDGTFVIKGIQRIVVSQL---HR---------SPGIHFDH 167 (1386) Q Consensus 139 t~~GyFIING~ERVIVsQl---~R---------SPGVyf~~ 167 (1386) .-++.|||+=.|+.||-|. +| .||.+|.- T Consensus 16 l~~s~yiV~e~e~aVVlrFGk~vr~~~~~~~v~ePGLhfki 56 (334) T PRK11029 16 LYMSVFVVKEGERGITLRFGKVLRDDDNKPLVYEPGLHFKI 56 (334) T ss_pred HHHEEEEECCCEEEEEEECCCEECCCCCCCCCCCCCEEEEC T ss_conf 98568997577089999669552366777643589805873 No 106 >smart00454 SAM Sterile alpha motif. Widespread domain in signalling and nuclear proteins. In EPH-related tyrosine kinases, appears to mediate cell-cell initiated signal transduction via the binding of SH2-containing proteins to a conserved tyrosine that is phosphorylated. In many cases mediates homodimerisation. Probab=21.73 E-value=11 Score=15.09 Aligned_cols=30 Identities=17% Similarity=0.359 Sum_probs=11.9 Q ss_pred HHHHHHHHHHCCCCCCCCEEEECCCCCCCC Q ss_conf 999999999868689986999868988402 Q gi|254780143|r 1216 EEAINSMLRMADLDESGQSILYDGLTGEPF 1245 (1386) Q Consensus 1216 ~~~i~~~L~~aG~~~~Gke~lydG~TGe~~ 1245 (1386) .+++.++|...|++.+.....-+|++|..+ T Consensus 6 ~~~V~~WL~~~gl~~y~~~F~~~~i~g~~l 35 (68) T smart00454 6 PESVADWLESIGLEQYADNFRKNGIDGALL 35 (68) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHCCCHHHH T ss_conf 999999998797899899999816058999 No 107 >PRK13121 consensus Probab=21.61 E-value=36 Score=11.44 Aligned_cols=18 Identities=17% Similarity=0.255 Sum_probs=7.5 Q ss_pred CCCCCCCCCCCEEEEEEC Q ss_conf 314336678735888630 Q gi|254780143|r 1088 DKMAGRHGNKGIVSRILP 1105 (1386) Q Consensus 1088 DKfasRHGqKGVis~i~p 1105 (1386) |||+.+--+-||-|.|+| T Consensus 112 e~F~~~~~~aGvdGlIip 129 (265) T PRK13121 112 DAFAAAARAAGVDGVLVV 129 (265) T ss_pred HHHHHHHHHCCCCEEECC T ss_conf 999999987298734348 No 108 >COG1386 scpB Chromosome segregation and condensation protein B [DNA replication, recombination and repair] Probab=21.58 E-value=36 Score=11.44 Aligned_cols=11 Identities=36% Similarity=0.576 Sum_probs=6.4 Q ss_pred HHHHHHHHHHH Q ss_conf 07899999998 Q gi|254780143|r 1295 GEMEVWCIQAY 1305 (1386) Q Consensus 1295 GEMErwaL~Ay 1305 (1386) ++||+=|.||| T Consensus 93 aalEtLAiIAY 103 (184) T COG1386 93 AALETLAIIAY 103 (184) T ss_pred HHHHHHHHHHH T ss_conf 99999999998 No 109 >TIGR01217 ac_ac_CoA_syn acetoacetyl-CoA synthase; InterPro: IPR005914 Isoprenoids are a large class of compounds, with more than 20,000 structures currently known, which are found in all living organisms. Some play essential physiological roles, such as sterols that stabilise cell membranes or carotenoids involved in photosynthesis, while the function of many others is not well understood. In all eukaryotes and some prokaryotes, isoprenoids are synthesised by the mevalonate pathway, while most prokaryotes use a mevalonate-independent pathway , . This entry represents acetoacetyl-CoA synthase (6.2.1.16 from EC), catalysing the first step of the mevalonate pathway of isoprenoid biosynthesis via isopentenyl diphosphate.ATP + acetoacetate + CoA = AMP + diphosphate + acetoacetyl-CoA A Sinorhizobium protein in this entry is also required for growth on polyhydroxybutyrate, a commonly used carbon storage molecule in bacteria .. Probab=21.47 E-value=36 Score=11.42 Aligned_cols=28 Identities=21% Similarity=0.516 Sum_probs=22.4 Q ss_pred CCCCCCCCCEEEEEECCCCCCCCCCCCEEEEE Q ss_conf 43366787358886300007879358715698 Q gi|254780143|r 1090 MAGRHGNKGIVSRILPCEDMPFLKDGTPVDIV 1121 (1386) Q Consensus 1090 fasRHGqKGVis~i~p~eDMPf~~dG~~pDII 1121 (1386) .|-|| |=+.|+...|.|+|..|-.+.+= T Consensus 613 lSpRH----VP~~iiev~gIP~T~~GK~vEva 640 (676) T TIGR01217 613 LSPRH----VPDKIIEVAGIPRTLSGKKVEVA 640 (676) T ss_pred CCCCC----CCHHHHCCCCCCCCCCCCEEEEC T ss_conf 88766----73455016888725677678850 No 110 >KOG0030 consensus Probab=21.40 E-value=37 Score=11.41 Aligned_cols=18 Identities=33% Similarity=0.551 Sum_probs=8.9 Q ss_pred HHHHHHCCCC--HHHHHHHH Q ss_conf 9899880998--47999973 Q gi|254780143|r 215 TSFLMALGMD--SEEILSTF 232 (1386) Q Consensus 215 ~ilLrALG~s--sdeIl~~f 232 (1386) -=.|||||.. .+++.+.. T Consensus 34 gdvlRalG~nPT~aeV~k~l 53 (152) T KOG0030 34 GDVLRALGQNPTNAEVLKVL 53 (152) T ss_pred HHHHHHHCCCCCHHHHHHHH T ss_conf 99999846999678999997 No 111 >PRK12999 pyruvate carboxylase; Reviewed Probab=21.38 E-value=37 Score=11.41 Aligned_cols=21 Identities=24% Similarity=0.434 Sum_probs=11.1 Q ss_pred CCCCCCCCCEEEEEEEEEECC Q ss_conf 988402685048846454101 Q gi|254780143|r 1240 LTGEPFDRPVTVGYIYMLKLN 1260 (1386) Q Consensus 1240 ~TGe~~~~~IfvG~~YyqkL~ 1260 (1386) ..||.+...|--|-+|..||. T Consensus 1011 ~~gEe~~veie~GKtl~Ikl~ 1031 (1147) T PRK12999 1011 RPGEEIEVEIEPGKTLIIKLE 1031 (1147) T ss_pred CCCCEEEEECCCCCEEEEEEE T ss_conf 889548887268937999962 No 112 >TIGR00003 TIGR00003 copper ion binding protein; InterPro: IPR006122 Proteins that transport heavy metals in micro-organisms and eukaryotes share similarities in their sequences and structures. These proteins provide an important focus for research, some being involved in bacterial resistance to toxic metals, such as lead and cadmium, while others are involved in inherited human syndromes, such as Wilson's and Menke's diseases . A conserved 30-residue domain has been found in a number of these heavy metal transport or detoxification proteins . The domain, which has been termed Heavy-Metal-Associated (HMA), contains two conserved cysteines that are probably involved in metal binding. This sub-domain is found in copper-binding proteins. ; GO: 0005507 copper ion binding, 0006825 copper ion transport. Probab=21.30 E-value=37 Score=11.40 Aligned_cols=23 Identities=17% Similarity=0.284 Sum_probs=17.2 Q ss_pred CEECCCCCCCCCHHHHHHHHHHCCCC Q ss_conf 40025655788999999999986868 Q gi|254780143|r 1204 VPVSTPVFDGADEEAINSMLRMADLD 1229 (1386) Q Consensus 1204 ~~~aTP~F~g~~~~~i~~~L~~aG~~ 1229 (1386) +.|.+| .++.++|.+.+..+||+ T Consensus 42 V~Fd~~---~v~~~~I~~Ai~d~GY~ 64 (66) T TIGR00003 42 VEFDAP---KVSAKEIKEAILDAGYE 64 (66) T ss_pred EEECCC---CCCHHHHHHHHHHCCCC T ss_conf 875375---34467788898736653 No 113 >PRK13139 consensus Probab=21.13 E-value=37 Score=11.38 Aligned_cols=45 Identities=22% Similarity=0.414 Sum_probs=29.7 Q ss_pred CCC-CCCCCCCCCCCEEEEEECCCCCCCCC----------CCCEEEEEECCCCCCCCC Q ss_conf 552-31433667873588863000078793----------587156986689867507 Q gi|254780143|r 1085 QSG-DKMAGRHGNKGIVSRILPCEDMPFLK----------DGTPVDIVLNPLGVPSRM 1131 (1386) Q Consensus 1085 ~iG-DKfasRHGqKGVis~i~p~eDMPf~~----------dG~~pDIIlNPhgvPSRM 1131 (1386) +.| |||+.+--+-||-|.|+| |+|+-| .|+.+=.++.|.--+.|| T Consensus 106 ~~G~e~F~~~~~~~Gv~GvIip--DLP~eE~~~~~~~~~~~gl~~I~lvaPtt~~~Ri 161 (254) T PRK13139 106 KYGVERFIDEVADIGVKGLIVP--DLPPEQAQDYIAQCRAKGMAPIGIYAPTSTDERM 161 (254) T ss_pred HCCHHHHHHHHHHCCCCEEECC--CCCHHHHHHHHHHHHHCCCCEEEEECCCCCHHHH T ss_conf 7099999999997599858647--9997889999999984697579994589998999 No 114 >pfam01070 FMN_dh FMN-dependent dehydrogenase. Probab=21.07 E-value=32 Score=11.85 Aligned_cols=46 Identities=13% Similarity=0.161 Sum_probs=24.5 Q ss_pred CCCCHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHCCCCCCHHHHHHHHHHHH Q ss_conf 45998999999865654102442024445520000011023458899999988760 Q gi|254780143|r 326 DVIDEKSLEEIFHSEIRDIPILYVDSVNNNAYIRNTLVTDKNKDRKDALLDIYRVM 381 (1386) Q Consensus 326 ~~~~~~~l~~~~~~~~~~~~~l~~~~~~~~~~i~~~~~~d~~~~~~eAl~~I~k~l 381 (1386) ..++.+......+.+..-+.+.+--. ..-|.....-+++..|-+.+ T Consensus 176 GI~s~eDA~~a~~~Gv~~I~VSnHGG----------RqlD~~~~t~~~L~eI~~~v 221 (301) T pfam01070 176 GILSPEDAKRAVEAGVDGIVVSNHGG----------RQLDGAPATIDALPEIVAAV 221 (301) T ss_pred CCCCHHHHHHHHHCCCCEEEECCCCC----------CCCCCCCCHHHHHHHHHHHH T ss_conf 28999999999985999999649985----------44688867999999999985 No 115 >PRK13124 consensus Probab=21.05 E-value=37 Score=11.36 Aligned_cols=22 Identities=27% Similarity=0.600 Sum_probs=12.0 Q ss_pred CCCCCCCCCCCEEEEEECCCCCCC Q ss_conf 314336678735888630000787 Q gi|254780143|r 1088 DKMAGRHGNKGIVSRILPCEDMPF 1111 (1386) Q Consensus 1088 DKfasRHGqKGVis~i~p~eDMPf 1111 (1386) |+|+.+--+-|+-|.|+| |+|| T Consensus 103 e~F~~~~~~~Gv~GvIip--DLP~ 124 (257) T PRK13124 103 EKFFALARENGIDGLLIP--DLPL 124 (257) T ss_pred HHHHHHHHHCCCCEEECC--CCCH T ss_conf 999999997599847778--9997 No 116 >pfam08587 UBA_2 Ubiquitin associated domain (UBA). This is a UBA (ubiquitin associated) domain. Ubiquitin is involved in intracellular proteolysis. Probab=20.93 E-value=37 Score=11.35 Aligned_cols=20 Identities=25% Similarity=0.521 Sum_probs=16.3 Q ss_pred HHHHHHHHCCCCHHHHHHHH Q ss_conf 39989988099847999973 Q gi|254780143|r 213 PVTSFLMALGMDSEEILSTF 232 (1386) Q Consensus 213 Pi~ilLrALG~ssdeIl~~f 232 (1386) -++.|-++|||..|+|++.+ T Consensus 5 vv~~Ls~TMGYdkdeI~eal 24 (46) T pfam08587 5 VVSKLSKTMGYDKDEIVEAL 24 (46) T ss_pred HHHHHHHHHCCCHHHHHHHH T ss_conf 99999998088799999999 No 117 >PRK13134 consensus Probab=20.49 E-value=38 Score=11.28 Aligned_cols=23 Identities=22% Similarity=0.621 Sum_probs=11.8 Q ss_pred CCCCCCCCCCCEEEEEECCCCCCCC Q ss_conf 3143366787358886300007879 Q gi|254780143|r 1088 DKMAGRHGNKGIVSRILPCEDMPFL 1112 (1386) Q Consensus 1088 DKfasRHGqKGVis~i~p~eDMPf~ 1112 (1386) |+|+..--.-||-|.|+| |+|+- T Consensus 113 e~F~~~~~~aGvdGvIip--DLP~e 135 (257) T PRK13134 113 ERFVRDAADAGVAGCIIP--DLPLD 135 (257) T ss_pred HHHHHHHHHCCCCEEEEC--CCCHH T ss_conf 999999986798759946--99977 No 118 >cd03404 Band_7_HflK Band_7_HflK: The band 7 domain of flotillin (reggie) like proteins. This group includes proteins similar to prokaryotic HlfK (High frequency of lysogenization K). Although many members of the band 7 family are lipid raft associated, prokaryote plasma membranes lack cholesterol and are unlikely to have lipid raft domains. Individual proteins of this band 7 domain family may cluster to form membrane microdomains which may in turn recruit multiprotein complexes. Escherichia coli HflK is an integral membrane protein which may localize to the plasma membrane. HflK associates with another band 7 family member (HflC) to form an HflKC complex. HflKC interacts with FtsH in a large complex termed the FtsH holo-enzyme. FtsH is an AAA ATP-dependent protease which exerts progressive proteolysis against membrane-embedded and soluble substrate proteins. HflKC can modulate the activity of FtsH. HflKC plays a role in the decision between lysogenic and lytic cycle growth during la Probab=20.36 E-value=38 Score=11.27 Aligned_cols=27 Identities=26% Similarity=0.551 Sum_probs=19.6 Q ss_pred CCCCEEECCEEEEEEEEE-----CCCCCEEEC Q ss_conf 896289868214686651-----227852120 Q gi|254780143|r 140 KDGTFVIKGIQRIVVSQL-----HRSPGIHFD 166 (1386) Q Consensus 140 ~~GyFIING~ERVIVsQl-----~RSPGVyf~ 166 (1386) -+|.|||+-.|+.||-+. +..||.+|. T Consensus 12 ~ss~~~V~~~e~~Vv~rfGk~~~~~~pGlhf~ 43 (266) T cd03404 12 LSGFYIVQPGERGVVLRFGKYSRTVEPGLHWK 43 (266) T ss_pred HHEEEEECCCEEEEEEECCCCCCCCCCCEEEE T ss_conf 96689988984899887698667348976899 No 119 >cd02552 PseudoU_synth_TruD_like PseudoU_synth_TruD_like: Pseudouridine synthase, TruD family. This group consists of eukaryotic, bacterial and archeal pseudouridine synthases similar to Escherichia coli TruD and Saccharomyces cerevisiae Pus7. Pseudouridine synthases catalyze the isomerization of specific uridines in an RNA molecule to pseudouridines (5-ribosyluracil, psi). E. coli TruD and S. cerevisiae Pus7 make psi13 in cytoplasmic tRNAs. In addition S. cerevisiae Pus7 makes psi35 in U2 small nuclear RNA (U2 snRNA) and psi35 in pre-tRNATyr. Psi35 in U2 snRNA and psi13 in tRNAs are highly phylogenetically conserved. Psi34 is the mammalian U2 snRNA counterpart of yeast U2 snRNA psi35. Probab=20.24 E-value=38 Score=11.25 Aligned_cols=16 Identities=25% Similarity=0.457 Sum_probs=9.0 Q ss_pred HHHHHHHHCCCCHHHH Q ss_conf 3998998809984799 Q gi|254780143|r 213 PVTSFLMALGMDSEEI 228 (1386) Q Consensus 213 Pi~ilLrALG~ssdeI 228 (1386) -+-.+-|++|+...+| T Consensus 47 a~~~lA~~l~i~~~~i 62 (232) T cd02552 47 ALREIAKALGVPPRDI 62 (232) T ss_pred HHHHHHHHHCCCHHHE T ss_conf 9999999819986663 No 120 >TIGR00337 PyrG CTP synthase; InterPro: IPR004468 CTP synthase is involved in pyrimidine ribonucleotide/ribonucleoside metabolism. The enzyme catalyzes the reaction L-glutamine + H2O + UTP + ATP = CTP + phosphate + ADP + L-glutamate. The enzyme exists as a dimer of identical chains that aggregates as a tetramer. This gene has been found circa 500 bp 5 upstream of enolase in both beta (Nitrosomonas europaea) and gamma (Escherichia coli) subdivisions of Proteobacterium .; GO: 0003883 CTP synthase activity, 0006221 pyrimidine nucleotide biosynthetic process. Probab=20.19 E-value=39 Score=11.24 Aligned_cols=52 Identities=13% Similarity=0.210 Sum_probs=28.5 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCC----CCCCCHHHHHHHHHH Q ss_conf 3333578789888888999988765311344334352----112220234655542 Q gi|254780143|r 464 VRSVGEMLKNQYRLGLLRMERSIKERISSVDIDSVMP----QDLINAKPVVSAVCE 515 (1386) Q Consensus 464 vr~vgeLl~~~fr~~l~rl~r~i~~~~~~~~~~~~~~----~~~in~~~i~~~i~~ 515 (1386) =-+||+.-..-|=-++.+|...+-..--..---++.| ..-+..||...++|+ T Consensus 151 GGTVGDIEs~PFLEAiRQ~~~e~G~Env~~iHvTLVP~i~aagE~KTKPTQhSVKe 206 (571) T TIGR00337 151 GGTVGDIESLPFLEAIRQLKKEVGRENVLFIHVTLVPYIAAAGELKTKPTQHSVKE 206 (571) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEEECCCCCCCCCCCCCCCHHHHHH T ss_conf 77000003625899999999873898679998400263144874787751278999 No 121 >pfam02037 SAP SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA/RNA binding domain found in diverse nuclear and cytoplasmic proteins. Probab=20.08 E-value=39 Score=11.23 Aligned_cols=22 Identities=18% Similarity=0.250 Sum_probs=16.2 Q ss_pred CCCCHHHHHHHHHHCCCCCCCC Q ss_conf 7889999999999868689986 Q gi|254780143|r 1212 DGADEEAINSMLRMADLDESGQ 1233 (1386) Q Consensus 1212 ~g~~~~~i~~~L~~aG~~~~Gk 1233 (1386) ...+++++.+.|.+.|++..|+ T Consensus 2 ~~ltv~eLk~~l~~~gL~~~G~ 23 (35) T pfam02037 2 SKLTVAELKEELKKRGLPTSGK 23 (35) T ss_pred CCCCHHHHHHHHHHCCCCCCCC T ss_conf 7260999999999869898888 Done!