Query 537021.9.peg.753_1 Match_columns 1033 No_of_seqs 206 out of 3477 Neff 6.8 Searched_HMMs 39220 Date Wed May 25 03:23:55 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i peg_753.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK09074 consensus 100.0 0 0 2570.6 99.2 1024 5-1033 1-1028(1149) 2 PRK05673 dnaE DNA polymerase I 100.0 0 0 2557.5 98.9 1014 8-1033 1-1020(1143) 3 PRK08512 consensus 100.0 0 0 2537.1 96.2 1014 9-1033 2-1049(1185) 4 PRK06826 dnaE DNA polymerase I 100.0 0 0 2513.8 97.8 1007 8-1033 3-1037(1157) 5 PRK07374 dnaE DNA polymerase I 100.0 0 0 2514.6 96.0 1012 10-1033 2-1042(1171) 6 PRK07892 consensus 100.0 0 0 2510.0 96.9 1017 5-1033 1-1058(1180) 7 PRK06658 consensus 100.0 0 0 2505.3 94.8 1010 8-1033 3-1019(1145) 8 PRK06637 consensus 100.0 0 0 2495.2 96.1 1003 8-1033 2-1063(1182) 9 PRK07012 consensus 100.0 0 0 2494.7 94.1 1006 7-1033 2-1023(1173) 10 PRK08957 consensus 100.0 0 0 2483.2 97.3 1009 8-1033 3-1031(1159) 11 PRK06326 consensus 100.0 0 0 2473.3 97.4 1016 10-1033 2-1071(1240) 12 PRK08744 consensus 100.0 0 0 2472.6 94.7 1008 7-1033 2-1061(1195) 13 PRK06920 dnaE DNA polymerase I 100.0 0 0 2449.8 93.1 980 10-1033 2-984 (1107) 14 COG0587 DnaE DNA polymerase II 100.0 0 0 2451.7 87.6 1005 8-1033 1-1018(1139) 15 PRK05672 dnaE2 error-prone DNA 100.0 0 0 2434.4 92.3 981 9-1033 3-996 (1050) 16 PRK07279 dnaE DNA polymerase I 100.0 0 0 2288.5 83.5 922 11-1033 2-925 (1033) 17 PRK05898 dnaE DNA polymerase I 100.0 0 0 2250.9 83.4 930 11-1033 2-939 (971) 18 PRK07135 dnaE DNA polymerase I 100.0 0 0 2222.3 83.0 939 10-1033 2-949 (987) 19 PRK09532 DNA polymerase III su 100.0 0 0 2002.8 66.5 798 10-823 2-836 (874) 20 TIGR00594 polc DNA polymerase 100.0 0 0 1985.7 74.8 1018 11-1032 1-1108(1108) 21 PRK00448 polC DNA polymerase I 100.0 0 0 1545.8 62.3 791 11-896 333-1428(1436) 22 pfam07733 DNA_pol3_alpha Bacte 100.0 0 0 1103.3 28.8 378 309-740 1-381 (381) 23 TIGR01405 polC_Gram_pos DNA po 100.0 0 0 1027.5 48.3 796 11-896 112-1257(1264) 24 COG2176 PolC DNA polymerase II 100.0 0 0 931.2 56.6 796 11-896 336-1437(1444) 25 PRK07373 DNA polymerase III su 100.0 0 0 628.4 29.8 276 758-1033 35-319 (447) 26 LOAD_php consensus 100.0 1.7E-30 4.4E-35 276.3 20.5 200 11-225 1-205 (206) 27 pfam02811 PHP PHP domain. The 99.9 1.3E-25 3.3E-30 235.1 18.9 163 12-191 1-165 (173) 28 smart00481 POLIIIAc DNA polyme 99.6 1.9E-15 4.9E-20 149.1 7.5 66 13-78 1-66 (67) 29 PRK06361 hypothetical protein; 98.8 3.2E-07 8.1E-12 79.6 14.8 180 12-231 2-189 (216) 30 COG1387 HIS2 Histidinol phosph 98.7 1.2E-06 3.1E-11 74.6 13.4 198 13-239 4-217 (237) 31 PRK08392 hypothetical protein; 98.6 2.8E-07 7.2E-12 80.1 8.4 176 12-222 1-188 (215) 32 COG0613 Predicted metal-depend 98.5 2.1E-07 5.3E-12 81.1 5.2 68 12-81 4-72 (258) 33 PRK08609 hypothetical protein; 98.5 5.7E-06 1.5E-10 69.0 11.9 46 792-841 497-542 (570) 34 PRK07945 hypothetical protein; 98.4 1.1E-05 2.8E-10 66.5 13.0 195 13-238 99-314 (335) 35 PRK09248 putative hydrolase; V 98.4 8.5E-07 2.2E-11 76.0 6.6 177 11-223 4-200 (246) 36 PRK07328 histidinol-phosphatas 98.4 2.2E-05 5.6E-10 64.0 13.0 40 12-52 3-42 (268) 37 cd04485 DnaE_OBF DnaE_OBF: A s 98.1 3.6E-06 9.3E-11 70.6 5.1 38 996-1033 1-38 (84) 38 PRK00912 ribonuclease P protei 98.0 5.9E-05 1.5E-09 60.4 9.3 70 9-82 4-73 (237) 39 PRK08123 histidinol-phosphatas 98.0 0.00011 2.8E-09 58.2 9.7 68 13-80 4-94 (266) 40 TIGR01259 comE comEA protein; 96.8 0.0019 4.9E-08 47.6 4.2 47 837-886 72-119 (124) 41 PRK05588 histidinol-phosphatas 96.5 0.0033 8.4E-08 45.6 3.8 37 13-51 4-40 (256) 42 COG1555 ComEA DNA uptake prote 96.5 0.0059 1.5E-07 43.5 5.0 45 839-886 99-144 (149) 43 COG1491 Predicted RNA-binding 95.9 0.0099 2.5E-07 41.6 3.9 38 838-875 131-168 (202) 44 PRK04036 DNA polymerase II sma 95.7 0.091 2.3E-06 33.4 8.1 12 18-29 20-31 (497) 45 pfam01336 tRNA_anti OB-fold nu 95.6 0.026 6.6E-07 38.1 4.9 35 995-1033 1-35 (75) 46 pfam04919 DUF655 Protein of un 95.5 0.019 4.9E-07 39.2 3.9 36 838-873 117-152 (181) 47 TIGR00426 TIGR00426 competence 95.0 0.031 7.8E-07 37.4 3.9 42 840-884 21-63 (70) 48 TIGR01856 hisJ_fam histidinol 94.5 0.025 6.3E-07 38.2 2.3 37 14-50 3-40 (301) 49 PRK07329 hypothetical protein; 94.0 0.095 2.4E-06 33.3 4.5 34 14-52 4-37 (246) 50 cd04492 YhaM_OBF_like YhaM_OBF 94.0 0.094 2.4E-06 33.3 4.4 34 999-1032 4-37 (83) 51 cd04317 EcAspRS_like_N EcAspRS 93.2 0.3 7.6E-06 29.1 5.9 46 981-1031 3-48 (135) 52 COG1379 PHP family phosphoeste 93.1 0.77 2E-05 25.6 14.5 185 12-222 5-199 (403) 53 PRK02983 lysS lysyl-tRNA synth 92.7 0.61 1.5E-05 26.5 6.8 45 730-779 833-877 (1099) 54 cd04316 ND_PkAspRS_like_N ND_P 92.6 0.27 6.8E-06 29.5 5.0 45 983-1032 3-47 (108) 55 cd04490 PolII_SU_OBF PolII_SU_ 92.5 0.29 7.3E-06 29.3 5.0 35 994-1032 1-35 (79) 56 PRK10917 ATP-dependent DNA hel 90.7 1 2.6E-05 24.6 6.2 106 778-895 494-605 (677) 57 cd04483 hOBFC1_like hOBFC1_lik 90.7 0.58 1.5E-05 26.6 5.0 32 997-1033 2-33 (92) 58 pfam12228 DUF3604 Protein of u 90.1 0.33 8.4E-06 28.7 3.3 54 6-59 3-74 (592) 59 cd04488 RecG_wedge_OBF RecG_we 89.0 1.2 3E-05 24.0 5.5 36 996-1032 1-36 (75) 60 PRK13122 consensus 87.2 2.2 5.6E-05 21.8 7.8 13 55-67 92-104 (242) 61 KOG3056 consensus 86.9 1.9 4.9E-05 22.3 5.5 29 530-565 191-219 (578) 62 PRK06740 histidinol-phosphatas 86.8 2.3 5.9E-05 21.6 13.0 16 880-895 205-220 (338) 63 CHL00200 trpA tryptophan synth 86.5 2.4 6E-05 21.5 10.8 11 56-66 32-42 (263) 64 PRK13115 consensus 86.4 2.4 6.1E-05 21.4 8.4 43 27-69 9-54 (269) 65 cd04484 polC_OBF polC_OBF: A s 86.1 2.5 6.3E-05 21.4 6.0 39 994-1032 1-39 (82) 66 cd04478 RPA2_DBD_D RPA2_DBD_D: 85.9 2.1 5.3E-05 22.0 5.2 33 995-1032 2-34 (95) 67 PRK09532 DNA polymerase III su 85.0 0.64 1.6E-05 26.3 2.2 12 57-68 23-34 (874) 68 pfam06695 Sm_multidrug_ex Puta 84.5 1.9 4.8E-05 22.3 4.5 49 336-395 57-105 (121) 69 PRK00484 lysS lysyl-tRNA synth 84.1 3 7.6E-05 20.6 7.3 18 306-323 226-243 (491) 70 TIGR02628 fuculo_kin_coli L-fu 83.8 0.71 1.8E-05 25.9 2.0 41 751-800 126-174 (473) 71 COG1107 Archaea-specific RecJ- 83.3 2.6 6.6E-05 21.1 4.8 51 730-800 518-568 (715) 72 cd04323 AsnRS_cyto_like_N AsnR 82.9 3.3 8.3E-05 20.3 5.2 34 994-1032 1-34 (84) 73 pfam01120 Alpha_L_fucos Alpha- 82.3 3.4 8.7E-05 20.1 5.4 17 359-375 140-156 (344) 74 cd03524 RPA2_OBF_family RPA2_O 82.2 3.5 8.8E-05 20.1 5.1 36 996-1033 1-37 (75) 75 COG2183 Tex Transcriptional ac 81.9 0.85 2.2E-05 25.2 1.9 79 801-894 483-562 (780) 76 PRK06461 single-stranded DNA-b 81.8 3.6 9.1E-05 20.0 6.5 50 980-1032 4-58 (130) 77 pfam01710 Transposase_14 Trans 81.3 3.7 9.4E-05 19.8 5.1 85 422-532 3-88 (120) 78 cd04489 ExoVII_LU_OBF ExoVII_L 81.1 3.7 9.6E-05 19.8 4.9 36 994-1033 1-36 (78) 79 PRK12445 lysyl-tRNA synthetase 80.9 3.8 9.7E-05 19.7 9.6 19 305-323 238-256 (505) 80 PRK13139 consensus 80.7 3.8 9.8E-05 19.7 7.0 13 54-66 31-43 (254) 81 PRK13125 trpA tryptophan synth 79.8 4.1 0.0001 19.5 8.2 56 385-450 82-143 (247) 82 cd04319 PhAsnRS_like_N PhAsnRS 79.6 4.1 0.00011 19.4 5.4 16 1015-1030 17-32 (103) 83 cd04724 Tryptophan_synthase_al 79.3 4.2 0.00011 19.4 8.1 35 40-74 76-112 (242) 84 TIGR01430 aden_deam adenosine 79.0 1.2 3.2E-05 23.9 1.9 180 12-222 4-216 (346) 85 PRK13123 consensus 78.3 4.5 0.00011 19.1 7.9 40 29-69 2-45 (256) 86 TIGR02237 recomb_radB DNA repa 78.2 2.4 6.2E-05 21.4 3.2 27 198-224 139-165 (223) 87 cd04100 Asp_Lys_Asn_RS_N Asp_L 77.6 4.7 0.00012 19.0 5.0 33 994-1031 1-33 (85) 88 PRK13129 consensus 77.6 4.7 0.00012 19.0 9.9 43 26-68 3-48 (267) 89 PRK13117 consensus 77.2 4.8 0.00012 18.9 5.8 44 26-70 1-48 (268) 90 pfam04157 EAP30 EAP30/Vps36 fa 76.8 4.9 0.00013 18.8 8.8 42 849-890 173-215 (219) 91 KOG3108 consensus 76.7 4.9 0.00013 18.8 4.9 14 215-228 149-162 (265) 92 PRK02515 psbU photosystem II c 76.0 3.5 9.1E-05 20.0 3.5 56 811-885 60-115 (144) 93 COG1200 RecG RecG-like helicas 75.9 5.1 0.00013 18.6 6.5 110 777-899 499-615 (677) 94 COG1031 Uncharacterized Fe-S o 74.5 2.5 6.4E-05 21.3 2.4 34 839-875 518-551 (560) 95 PRK13121 consensus 74.5 5.5 0.00014 18.4 7.5 43 26-69 1-47 (265) 96 PRK13116 consensus 74.4 5.6 0.00014 18.4 5.5 44 26-70 1-48 (278) 97 PTZ00124 adenosine deaminase; 74.3 1.7 4.4E-05 22.6 1.6 31 12-48 38-68 (362) 98 PRK13901 ruvA Holliday junctio 73.6 5.8 0.00015 18.2 4.5 61 838-910 108-170 (196) 99 cd06546 GH18_CTS3_chitinase GH 73.3 5.9 0.00015 18.2 5.6 20 355-374 56-75 (256) 100 KOG3416 consensus 73.0 5.9 0.00015 18.1 5.9 34 999-1032 21-55 (134) 101 COG0159 TrpA Tryptophan syntha 72.9 6 0.00015 18.1 5.9 38 29-67 4-45 (265) 102 TIGR03632 bact_S11 30S ribosom 72.8 2.4 6.2E-05 21.4 2.0 43 359-401 48-101 (108) 103 pfam09547 Spore_IV_A Stage IV 72.6 6.1 0.00015 18.0 10.7 144 99-256 99-260 (492) 104 pfam10391 DNA_pol_lambd_f Fing 72.6 6.1 0.00015 18.0 4.4 41 839-885 4-44 (52) 105 PRK13131 consensus 72.1 6.2 0.00016 17.9 5.3 21 425-446 127-147 (257) 106 COG2766 PrkA Putative Ser prot 72.0 6.2 0.00016 17.9 5.7 46 605-650 339-391 (649) 107 PRK13137 consensus 71.9 6.3 0.00016 17.9 5.9 39 24-63 7-49 (266) 108 cd01990 Alpha_ANH_like_I This 71.7 6.3 0.00016 17.9 6.7 58 144-215 2-59 (202) 109 TIGR03631 bact_S13 30S ribosom 71.7 5.3 0.00014 18.5 3.6 47 830-885 8-54 (113) 110 PRK07571 bidirectional hydroge 71.6 6.2 0.00016 18.0 3.9 55 424-478 17-77 (169) 111 smart00812 Alpha_L_fucos Alpha 71.1 6.5 0.00017 17.8 5.3 18 359-376 129-146 (384) 112 pfam00416 Ribosomal_S13 Riboso 70.7 6.6 0.00017 17.7 4.2 48 830-886 8-55 (106) 113 COG1038 PycA Pyruvate carboxyl 70.4 6.1 0.00015 18.0 3.6 52 18-70 58-109 (1149) 114 PRK13119 consensus 70.3 6.7 0.00017 17.6 5.8 12 436-447 142-153 (261) 115 PRK05309 30S ribosomal protein 70.2 2.5 6.3E-05 21.3 1.6 42 360-401 65-117 (126) 116 PRK13118 consensus 70.1 6.8 0.00017 17.6 4.8 40 27-67 2-45 (269) 117 PRK13120 consensus 70.1 6.8 0.00017 17.6 5.7 44 25-69 3-51 (285) 118 cd07230 Pat_TGL4-5_like Triacy 69.6 1.9 4.7E-05 22.4 0.8 33 366-398 94-126 (421) 119 PRK13112 consensus 69.3 7 0.00018 17.5 5.9 42 27-69 3-48 (279) 120 PRK08418 chlorohydrolase; Prov 69.2 6.5 0.00017 17.8 3.6 21 9-29 58-78 (407) 121 PRK05179 rpsM 30S ribosomal pr 68.8 6.6 0.00017 17.7 3.5 47 831-886 11-57 (122) 122 cd01320 ADA Adenosine deaminas 68.6 4.3 0.00011 19.3 2.5 26 12-42 5-30 (325) 123 pfam00962 A_deaminase Adenosin 68.4 4.2 0.00011 19.4 2.4 26 12-42 5-30 (329) 124 pfam06514 PsbU Photosystem II 68.2 7.4 0.00019 17.3 5.2 44 836-885 64-107 (135) 125 COG1237 Metal-dependent hydrol 67.7 7.5 0.00019 17.2 7.9 78 132-224 157-236 (259) 126 PRK13140 consensus 67.3 7.6 0.00019 17.2 6.1 14 434-447 139-152 (257) 127 CHL00137 rps13 ribosomal prote 67.2 7.7 0.0002 17.2 4.3 28 830-857 10-37 (122) 128 TIGR03629 arch_S13P archaeal r 67.1 7.7 0.0002 17.2 4.2 47 831-886 15-61 (144) 129 PRK13113 consensus 67.0 7.7 0.0002 17.1 5.8 40 27-67 2-45 (263) 130 cd07232 Pat_PLPL Patain-like p 66.4 1.9 5E-05 22.2 0.4 39 367-411 89-127 (407) 131 PRK04053 rps13p 30S ribosomal 66.4 7.9 0.0002 17.1 4.5 47 831-886 19-65 (149) 132 TIGR00612 ispG_gcpE 4-hydroxy- 65.9 8.1 0.00021 17.0 4.8 221 149-403 21-268 (633) 133 cd04318 EcAsnRS_like_N EcAsnRS 65.6 8.2 0.00021 16.9 5.1 32 995-1031 2-35 (82) 134 PTZ00134 40S ribosomal protein 64.9 5.2 0.00013 18.6 2.4 47 831-886 24-70 (154) 135 pfam00411 Ribosomal_S11 Riboso 64.6 3.4 8.7E-05 20.2 1.4 43 359-401 48-101 (109) 136 cd04320 AspRS_cyto_N AspRS_cyt 64.4 8.6 0.00022 16.8 5.4 34 995-1032 2-35 (102) 137 cd04321 ScAspRS_mt_like_N ScAs 64.2 8.6 0.00022 16.7 5.5 34 995-1032 2-36 (86) 138 PRK00286 xseA exodeoxyribonucl 64.1 8.6 0.00022 16.7 5.1 18 96-113 90-107 (443) 139 TIGR01179 galE UDP-glucose 4-e 64.1 6.3 0.00016 17.9 2.6 23 49-71 101-123 (341) 140 PRK07228 N-ethylammeline chlor 64.1 8.7 0.00022 16.7 6.7 64 9-74 56-146 (445) 141 PRK13124 consensus 64.0 8.7 0.00022 16.7 5.5 11 436-446 135-145 (257) 142 TIGR00160 MGSA methylglyoxal s 63.3 6.5 0.00017 17.8 2.6 52 146-215 54-110 (143) 143 TIGR00010 TIGR00010 hydrolase, 63.3 8.9 0.00023 16.6 11.3 215 17-319 5-245 (269) 144 PRK09358 adenosine deaminase; 62.7 6.4 0.00016 17.8 2.5 26 12-42 12-37 (333) 145 COG0099 RpsM Ribosomal protein 62.7 8.2 0.00021 16.9 3.0 47 831-886 11-57 (121) 146 pfam08288 PIGA PIGA (GPI ancho 62.7 5.4 0.00014 18.5 2.1 23 197-219 64-86 (90) 147 PRK13132 consensus 62.4 9.2 0.00023 16.5 7.8 47 395-448 94-147 (246) 148 cd04497 hPOT1_OB1_like hPOT1_O 62.2 9.3 0.00024 16.5 6.6 33 992-1024 14-47 (138) 149 COG1311 HYS2 Archaeal DNA poly 62.0 9.3 0.00024 16.4 6.0 21 432-452 332-352 (481) 150 PRK08402 replication factor A; 61.8 9.4 0.00024 16.4 6.1 90 552-655 113-214 (357) 151 cd02071 MM_CoA_mut_B12_BD meth 60.6 9.8 0.00025 16.3 6.6 43 26-68 36-80 (122) 152 PRK00116 ruvA Holliday junctio 60.6 5.3 0.00014 18.5 1.8 16 839-854 110-125 (198) 153 PRK00945 acetyl-CoA decarbonyl 60.6 9.8 0.00025 16.3 3.3 45 193-237 48-98 (171) 154 pfam07393 Sec10 Exocyst comple 60.5 9.9 0.00025 16.3 9.4 27 875-903 660-686 (703) 155 TIGR02236 recomb_radA DNA repa 60.4 9.9 0.00025 16.2 5.6 75 839-913 35-124 (333) 156 COG3835 CdaR Sugar diacid util 60.2 10 0.00025 16.2 6.5 16 106-121 98-113 (376) 157 PRK13114 consensus 60.0 10 0.00026 16.2 5.9 31 41-71 91-123 (266) 158 PRK09482 xni exonuclease IX; P 59.9 10 0.00026 16.2 3.4 42 838-886 183-224 (256) 159 PRK10353 3-methyl-adenine DNA 59.6 10 0.00026 16.1 5.0 144 670-826 5-166 (189) 160 PRK08654 pyruvate carboxylase 59.1 10 0.00026 16.1 4.5 68 42-121 3-72 (497) 161 CHL00041 rps11 ribosomal prote 58.5 5.7 0.00015 18.3 1.6 42 360-401 62-114 (116) 162 PRK11805 N5-glutamine S-adenos 58.4 11 0.00027 16.0 4.2 75 358-449 14-96 (307) 163 pfam03934 GspK General secreti 58.2 11 0.00027 16.0 3.2 32 842-873 200-231 (282) 164 pfam08732 HIM1 HIM1. HIM1 (hig 57.9 8.1 0.00021 17.0 2.3 49 726-779 321-369 (400) 165 PRK13134 consensus 57.5 11 0.00028 15.9 5.8 45 25-70 2-50 (257) 166 COG1154 Dxs Deoxyxylulose-5-ph 57.0 11 0.00028 15.8 8.4 34 429-476 260-293 (627) 167 pfam01989 DUF126 Protein of un 56.9 5.9 0.00015 18.1 1.5 27 377-405 37-63 (83) 168 TIGR02706 P_butyryltrans phosp 56.4 11 0.00028 15.9 2.8 85 234-319 112-200 (295) 169 COG0632 RuvA Holliday junction 56.3 10 0.00026 16.2 2.6 68 838-908 109-180 (201) 170 pfam04007 DUF354 Protein of un 56.1 11 0.00029 15.8 2.8 18 200-217 95-112 (335) 171 COG1570 XseA Exonuclease VII, 56.0 12 0.00029 15.7 4.8 23 95-117 89-111 (440) 172 cd01312 Met_dep_hydrolase_D Me 55.7 7.9 0.0002 17.1 2.0 62 9-70 31-92 (381) 173 TIGR00593 pola DNA polymerase 55.7 12 0.0003 15.6 3.0 65 511-585 639-714 (1005) 174 TIGR01011 rpsB_bact ribosomal 55.7 9.6 0.00024 16.4 2.4 14 881-894 116-129 (227) 175 COG0100 RpsK Ribosomal protein 55.6 7.8 0.0002 17.1 1.9 27 375-401 90-119 (129) 176 TIGR03628 arch_S11P archaeal r 55.2 6.7 0.00017 17.7 1.5 27 375-401 83-112 (114) 177 PRK12366 replication factor A; 55.2 12 0.0003 15.6 6.3 14 529-542 189-202 (649) 178 TIGR00315 cdhB CO dehydrogenas 55.0 7.9 0.0002 17.0 1.9 27 193-219 42-68 (170) 179 PRK13138 consensus 54.8 12 0.00031 15.5 5.4 16 52-67 26-41 (264) 180 TIGR00617 rpa1 replication fac 54.1 12 0.00031 15.5 5.0 54 168-223 236-291 (671) 181 cd01356 AcnX_swivel Putative A 53.4 6.3 0.00016 17.9 1.2 27 377-405 52-78 (123) 182 TIGR02002 PTS-II-BC-glcB PTS s 53.1 4.5 0.00012 19.1 0.4 12 175-186 240-251 (518) 183 PRK03955 hypothetical protein; 52.9 6.7 0.00017 17.6 1.2 22 377-398 60-81 (132) 184 TIGR02104 pulA_typeI pullulana 52.9 11 0.00028 15.8 2.3 62 569-633 337-407 (655) 185 PRK06380 metal-dependent hydro 52.8 6.7 0.00017 17.7 1.2 59 9-70 54-115 (418) 186 cd07229 Pat_TGL3_like Triacylg 52.8 11 0.00029 15.8 2.4 43 364-412 102-144 (391) 187 KOG1111 consensus 52.5 12 0.00032 15.4 2.6 44 14-64 93-136 (426) 188 PRK12833 acetyl-CoA carboxylas 52.5 13 0.00033 15.3 4.8 30 40-71 4-33 (458) 189 TIGR02386 rpoC_TIGR DNA-direct 52.5 13 0.00033 15.3 2.8 31 509-539 260-292 (1552) 190 COG1786 Swiveling domain assoc 52.3 6.7 0.00017 17.7 1.1 22 377-398 59-80 (131) 191 COG1880 CdhB CO dehydrogenase/ 52.1 13 0.00033 15.2 2.9 27 193-219 48-74 (170) 192 PRK13135 consensus 51.7 13 0.00034 15.2 5.5 40 27-67 2-45 (267) 193 cd07206 Pat_TGL3-4-5_SDP1 Tria 51.4 8.6 0.00022 16.7 1.6 32 365-396 89-120 (298) 194 TIGR01235 pyruv_carbox pyruvat 51.0 14 0.00034 15.1 3.5 52 18-70 56-107 (1169) 195 COG4955 Uncharacterized protei 50.8 14 0.00035 15.1 3.0 37 606-650 247-283 (343) 196 KOG3142 consensus 50.3 14 0.00035 15.0 4.4 93 749-842 39-132 (187) 197 pfam02679 ComA (2R)-phospho-3- 50.0 14 0.00036 15.0 5.9 12 430-441 210-221 (245) 198 COG0556 UvrB Helicase subunit 49.8 13 0.00033 15.3 2.3 229 162-422 65-351 (663) 199 pfam00289 CPSase_L_chain Carba 49.8 14 0.00036 14.9 4.0 26 44-71 4-29 (109) 200 TIGR02283 MltB_2 lytic murein 49.0 14 0.00037 14.9 4.1 83 710-792 59-149 (313) 201 pfam01301 Glyco_hydro_35 Glyco 48.9 14 0.00037 14.8 5.7 49 30-78 27-88 (317) 202 KOG0140 consensus 48.9 10 0.00026 16.1 1.7 51 713-763 319-370 (408) 203 PRK08462 biotin carboxylase; V 48.9 14 0.00037 14.8 4.5 70 40-121 3-74 (446) 204 cd00128 XPG Xeroderma pigmento 48.8 12 0.0003 15.6 2.0 42 398-440 264-305 (316) 205 PRK05298 excinuclease ABC subu 48.7 6.3 0.00016 17.9 0.5 204 162-389 66-330 (657) 206 COG4956 Integral membrane prot 48.5 3.4 8.6E-05 20.2 -0.9 35 955-990 252-287 (356) 207 TIGR02472 sucr_P_syn_N sucrose 48.5 9.5 0.00024 16.4 1.4 15 201-215 130-144 (445) 208 pfam00091 Tubulin Tubulin/FtsZ 48.2 11 0.00028 15.8 1.7 46 334-393 102-147 (210) 209 KOG2499 consensus 48.1 11 0.00028 15.8 1.7 15 360-374 252-266 (542) 210 PRK08463 acetyl-CoA carboxylas 48.1 15 0.00038 14.7 4.4 68 42-121 3-71 (478) 211 PRK07075 isochorismate-pyruvat 48.0 15 0.00038 14.7 5.3 35 721-755 50-84 (101) 212 cd03320 OSBS o-Succinylbenzoat 47.6 15 0.00038 14.7 5.1 25 350-374 206-230 (263) 213 KOG0369 consensus 47.4 15 0.00039 14.7 3.7 47 23-70 89-135 (1176) 214 pfam09194 Endonuc-BsobI Restri 47.3 9.7 0.00025 16.3 1.3 34 603-642 106-140 (316) 215 PRK11337 DNA-binding transcrip 47.2 12 0.00031 15.5 1.8 17 460-476 52-68 (293) 216 COG0337 AroB 3-dehydroquinate 47.1 15 0.00039 14.6 5.4 18 803-820 289-306 (360) 217 TIGR00506 ribB 3,4-dihydroxy-2 46.8 11 0.00029 15.8 1.6 31 344-374 185-220 (230) 218 smart00475 53EXOc 5'-3' exonuc 46.5 16 0.0004 14.6 3.3 31 838-873 187-217 (259) 219 TIGR00387 glcD glycolate oxida 46.5 11 0.00027 15.9 1.5 135 341-531 256-397 (437) 220 cd00008 53EXOc 5'-3' exonuclea 46.4 16 0.0004 14.6 3.5 30 839-873 185-214 (240) 221 PRK05395 3-dehydroquinate dehy 46.1 12 0.0003 15.6 1.6 19 346-364 122-140 (143) 222 TIGR00689 rpiB_lacA_lacB sugar 45.7 11 0.00028 15.9 1.4 36 197-232 13-48 (146) 223 KOG3717 consensus 45.5 5.8 0.00015 18.2 -0.0 53 390-455 141-194 (612) 224 PRK08104 consensus 45.4 16 0.00041 14.4 6.4 119 27-162 6-128 (212) 225 TIGR00644 recJ single-stranded 45.4 16 0.00041 14.4 4.7 52 171-223 75-129 (705) 226 PTZ00129 40S ribosomal protein 45.3 12 0.00031 15.5 1.6 27 375-401 108-137 (150) 227 pfam01367 5_3_exonuc 5'-3' exo 45.3 16 0.00041 14.4 2.9 31 838-873 19-49 (100) 228 pfam08423 Rad51 Rad51. Rad51 i 45.3 16 0.00041 14.4 2.9 23 197-219 172-194 (261) 229 COG0052 RpsB Ribosomal protein 45.2 16 0.00041 14.4 4.1 64 404-469 27-91 (252) 230 PRK11507 hypothetical protein; 45.0 16 0.00042 14.4 4.6 57 961-1017 4-61 (70) 231 cd05152 MPH2' Macrolide 2'-Pho 45.0 12 0.0003 15.7 1.4 15 335-349 46-60 (276) 232 pfam00633 HHH Helix-hairpin-he 44.9 12 0.00031 15.4 1.6 18 838-855 12-29 (30) 233 PRK04301 radA DNA repair and r 44.9 16 0.00042 14.4 2.9 18 200-217 236-253 (318) 234 COG2231 Uncharacterized protei 44.7 12 0.0003 15.7 1.4 12 165-176 46-57 (215) 235 COG2426 Predicted membrane pro 44.6 17 0.00042 14.3 3.0 68 308-394 46-114 (142) 236 COG0384 Predicted epimerase, P 44.1 12 0.0003 15.6 1.3 90 334-424 170-270 (291) 237 pfam03659 Glyco_hydro_71 Glyco 44.0 17 0.00043 14.3 3.9 63 11-75 3-69 (386) 238 PRK05929 consensus 43.7 17 0.00043 14.2 3.2 72 893-972 720-800 (870) 239 COG1148 HdrA Heterodisulfide r 43.7 17 0.00043 14.2 4.8 36 26-64 15-53 (622) 240 PRK07898 consensus 43.6 17 0.00044 14.2 3.7 76 893-975 749-833 (902) 241 pfam03344 Daxx Daxx Family. Th 43.5 17 0.00044 14.2 3.9 55 402-465 280-336 (715) 242 PRK05586 biotin carboxylase; V 43.3 17 0.00044 14.2 4.9 68 42-121 3-72 (447) 243 COG1263 PtsG Phosphotransferas 43.1 11 0.00027 16.0 1.0 14 425-438 128-141 (393) 244 cd00530 PTE Phosphotriesterase 43.0 17 0.00044 14.2 5.7 74 141-218 6-82 (293) 245 cd00466 DHQase_II Dehydroquina 42.9 14 0.00037 14.8 1.7 17 346-362 120-136 (140) 246 PRK11302 DNA-binding transcrip 42.8 17 0.00045 14.1 2.1 36 179-216 175-210 (284) 247 TIGR00021 rpiA ribose 5-phosph 42.7 18 0.00045 14.1 3.1 63 804-868 55-120 (236) 248 PRK05907 hypothetical protein; 42.5 12 0.0003 15.7 1.1 18 607-624 62-79 (311) 249 PRK07178 acetyl-CoA carboxylas 42.4 18 0.00045 14.1 4.5 68 42-121 3-71 (471) 250 PRK08097 ligB NAD-dependent DN 42.1 18 0.00046 14.1 4.4 18 839-856 523-540 (563) 251 TIGR03314 Se_ssnA putative sel 42.1 18 0.00046 14.1 5.9 22 632-654 362-383 (441) 252 PRK09875 putative hydrolase; P 42.0 18 0.00046 14.1 6.1 72 141-218 12-84 (292) 253 KOG0932 consensus 41.8 18 0.00046 14.0 5.3 14 425-438 372-385 (774) 254 COG0821 gcpE 1-hydroxy-2-methy 41.7 18 0.00046 14.0 2.7 164 163-374 31-197 (361) 255 cd07218 Pat_iPLA2 Calcium-inde 41.4 16 0.0004 14.5 1.7 22 376-398 33-54 (245) 256 TIGR01012 Sa_S2_E_A ribosomal 41.4 13 0.00032 15.3 1.2 20 198-217 122-141 (197) 257 TIGR00873 gnd 6-phosphoglucona 41.4 11 0.00027 16.0 0.8 78 393-475 268-364 (480) 258 cd04491 SoSSB_OBF SoSSB_OBF: A 41.3 18 0.00047 14.0 4.3 36 997-1032 2-41 (82) 259 COG0402 SsnA Cytosine deaminas 41.3 18 0.00047 14.0 5.8 61 9-69 57-119 (421) 260 smart00146 PI3Kc Phosphoinosit 41.3 13 0.00032 15.3 1.1 26 378-405 91-116 (202) 261 cd02742 GH20_hexosaminidase Be 41.3 9.2 0.00023 16.5 0.4 16 360-375 74-89 (303) 262 PRK03980 flap endonuclease-1; 41.2 18 0.00047 14.0 3.9 58 397-465 226-283 (295) 263 PTZ00254 40S ribosomal protein 41.1 18 0.00047 13.9 2.6 22 196-217 129-150 (242) 264 smart00278 HhH1 Helix-hairpin- 41.0 15 0.00038 14.8 1.5 19 839-857 3-21 (26) 265 TIGR00884 guaA_Cterm GMP synth 40.9 19 0.00047 13.9 5.5 112 141-295 17-133 (319) 266 PRK05973 replicative DNA helic 40.8 16 0.00042 14.4 1.7 19 198-216 173-191 (237) 267 PRK12999 pyruvate carboxylase; 40.4 19 0.00048 13.9 4.5 45 24-69 62-106 (1147) 268 cd00142 PI3Kc_like Phosphoinos 40.0 12 0.0003 15.6 0.9 26 378-405 118-143 (219) 269 cd00443 ADA_AMPD Adenosine/AMP 39.9 14 0.00035 15.0 1.2 22 12-38 4-25 (305) 270 pfam06026 Rib_5-P_isom_A Ribos 39.6 19 0.00049 13.8 2.3 52 805-857 10-63 (172) 271 TIGR01325 O_suc_HS_sulf O-succ 39.6 5.3 0.00014 18.5 -1.0 30 596-626 119-148 (386) 272 PRK06111 acetyl-CoA carboxylas 39.5 19 0.00049 13.8 4.5 69 42-122 3-73 (449) 273 TIGR00639 PurN phosphoribosylg 39.4 19 0.0005 13.7 2.1 159 28-249 13-180 (215) 274 KOG0555 consensus 39.3 20 0.0005 13.7 3.1 109 307-437 289-400 (545) 275 PRK07556 consensus 39.1 20 0.0005 13.7 3.8 74 893-973 821-903 (977) 276 KOG0538 consensus 39.1 20 0.0005 13.7 2.7 29 193-221 132-161 (363) 277 pfam09862 DUF2089 Protein of u 38.9 18 0.00045 14.1 1.6 20 453-472 47-66 (113) 278 KOG0741 consensus 38.8 16 0.00041 14.4 1.4 12 401-412 380-391 (744) 279 pfam10481 Cenp-F_N Cenp-F N-te 38.7 20 0.00051 13.7 5.3 131 269-419 123-274 (288) 280 cd07220 Pat_PNPLA2 Patatin-lik 38.7 15 0.00039 14.6 1.3 21 378-399 41-61 (249) 281 PRK13978 ribose-5-phosphate is 38.4 20 0.00051 13.6 2.8 52 805-857 60-113 (228) 282 cd00893 PI4Kc_III Phosphoinosi 38.4 12 0.0003 15.6 0.6 17 378-394 129-145 (289) 283 PRK08393 N-ethylammeline chlor 38.3 20 0.00051 13.6 2.5 61 9-71 52-115 (419) 284 KOG3666 consensus 38.2 20 0.00052 13.6 3.9 14 729-742 873-886 (1141) 285 TIGR01381 E1_like_apg7 E1-like 38.0 20 0.00052 13.6 5.1 29 209-237 220-248 (689) 286 PRK07625 consensus 38.0 20 0.00052 13.6 3.6 29 939-967 818-846 (922) 287 PRK00702 ribose-5-phosphate is 37.9 20 0.00052 13.6 2.4 53 804-857 57-111 (222) 288 PRK13690 hypothetical protein; 37.8 21 0.00052 13.6 6.0 73 141-217 24-102 (183) 289 PRK11282 glcE glycolate oxidas 37.8 15 0.00039 14.7 1.1 22 360-382 7-28 (351) 290 pfam10706 Aminoglyc_resit Amin 37.7 21 0.00052 13.5 2.0 156 359-585 8-170 (174) 291 TIGR02855 spore_yabG sporulati 37.7 21 0.00052 13.5 3.2 35 189-226 118-152 (292) 292 KOG3172 consensus 37.6 7 0.00018 17.5 -0.6 16 368-383 87-102 (119) 293 PRK07539 NADH dehydrogenase su 37.5 21 0.00053 13.5 4.2 48 431-478 10-63 (154) 294 PRK11557 putative DNA-binding 37.4 21 0.00053 13.5 1.8 17 199-215 197-213 (282) 295 COG1649 Uncharacterized protei 37.4 18 0.00047 13.9 1.5 67 163-229 62-149 (418) 296 pfam07570 consensus 37.3 18 0.00045 14.1 1.4 25 365-390 9-33 (45) 297 TIGR03573 WbuX N-acetyl sugar 37.2 21 0.00053 13.5 9.0 124 28-218 44-170 (343) 298 PRK08591 acetyl-CoA carboxylas 37.0 21 0.00054 13.5 4.7 68 42-121 3-72 (449) 299 COG0083 ThrB Homoserine kinase 36.9 18 0.00045 14.1 1.4 27 256-282 173-202 (299) 300 PRK12311 rpsB 30S ribosomal pr 36.9 21 0.00054 13.5 2.6 12 340-351 145-156 (332) 301 cd01398 RPI_A RPI_A: Ribose 5- 36.9 21 0.00054 13.5 2.3 51 804-855 53-105 (213) 302 KOG3311 consensus 36.9 21 0.00054 13.5 2.5 48 832-888 23-70 (152) 303 TIGR02836 spore_IV_A stage IV 36.9 21 0.00054 13.5 5.5 109 100-219 100-220 (492) 304 COG0120 RpiA Ribose 5-phosphat 36.6 21 0.00054 13.4 2.9 53 804-857 57-111 (227) 305 PRK07300 consensus 36.6 21 0.00054 13.4 3.9 34 939-972 776-809 (880) 306 PRK08786 consensus 36.6 21 0.00054 13.4 3.4 33 939-971 823-855 (927) 307 COG4294 Uve UV damage repair e 36.4 21 0.00055 13.4 4.0 41 196-236 218-263 (347) 308 PRK07213 chlorohydrolase; Prov 36.3 21 0.00054 13.5 1.6 62 9-70 50-115 (378) 309 pfam04219 DUF413 Protein of un 36.3 22 0.00055 13.4 3.2 49 702-752 15-71 (94) 310 cd05168 PI4Kc_III_beta Phospho 36.3 13 0.00034 15.2 0.6 17 378-394 131-147 (293) 311 TIGR01370 cysRS possible cyste 36.2 20 0.00052 13.6 1.6 81 333-433 147-228 (325) 312 pfam01152 Bac_globin Bacterial 36.0 22 0.00055 13.4 7.1 72 692-763 43-120 (120) 313 CHL00067 rps2 ribosomal protei 35.7 22 0.00056 13.3 3.1 19 98-116 39-57 (227) 314 PRK10760 murein hydrolase B; P 35.5 22 0.00056 13.3 3.1 27 366-397 198-224 (357) 315 PRK09045 N-ethylammeline chlor 35.5 18 0.00046 14.0 1.2 19 9-27 67-85 (444) 316 COG1748 LYS9 Saccharopine dehy 35.5 22 0.00056 13.3 4.0 91 304-401 168-265 (389) 317 smart00483 POLXc DNA polymeras 35.5 22 0.00056 13.3 4.6 12 388-399 106-117 (334) 318 cd01321 ADGF Adenosine deamina 35.4 22 0.00056 13.3 2.0 24 11-39 27-50 (345) 319 cd07222 Pat_PNPLA4 Patatin-lik 35.3 22 0.00057 13.3 2.6 24 376-400 34-57 (246) 320 pfam04260 DUF436 Protein of un 35.3 22 0.00057 13.3 5.9 73 141-217 17-95 (172) 321 COG0122 AlkA 3-methyladenine D 35.2 22 0.00057 13.3 3.5 97 738-856 115-217 (285) 322 PRK09239 chorismate mutase; Pr 34.7 23 0.00058 13.2 6.0 32 724-755 56-87 (104) 323 pfam01049 Cadherin_C Cadherin 34.7 11 0.00029 15.8 0.0 12 376-387 102-113 (145) 324 pfam00728 Glyco_hydro_20 Glyco 34.7 23 0.00058 13.2 2.0 16 360-375 72-87 (335) 325 COG1737 RpiR Transcriptional r 34.6 23 0.00058 13.2 1.9 37 179-217 177-213 (281) 326 TIGR01448 recD_rel helicase, R 34.5 22 0.00056 13.3 1.5 20 568-587 453-472 (769) 327 pfam07579 DUF1548 Domain of Un 34.5 23 0.00058 13.2 2.7 74 379-474 25-107 (135) 328 pfam08440 Poty_PP Potyviridae 34.5 17 0.00043 14.3 0.9 50 359-412 121-172 (274) 329 KOG0400 consensus 34.5 23 0.00058 13.2 2.5 18 738-755 77-94 (151) 330 PRK07203 putative chlorohydrol 34.4 23 0.00058 13.2 5.2 22 631-653 362-383 (442) 331 TIGR02198 rfaE_dom_I rfaE bifu 34.2 23 0.00059 13.1 1.7 30 186-215 151-182 (321) 332 COG4865 Glutamate mutase epsil 34.1 23 0.00059 13.1 3.2 78 677-778 234-314 (485) 333 TIGR02248 mutH_TIGR DNA mismat 33.9 23 0.00058 13.2 1.5 25 53-77 43-73 (220) 334 cd00896 PI3Kc_III Phosphoinosi 33.9 15 0.00039 14.7 0.6 43 571-624 108-150 (350) 335 cd01425 RPS2 Ribosomal protein 33.9 23 0.00059 13.1 2.3 67 401-469 16-83 (193) 336 TIGR02025 BchH magnesium chela 33.8 23 0.00059 13.1 1.9 710 196-1026 366-1256(1384) 337 cd05164 PIKKc Phosphoinositide 33.8 19 0.00049 13.8 1.1 26 377-404 120-145 (222) 338 PRK00074 guaA GMP synthase; Re 33.7 23 0.0006 13.1 6.6 16 458-473 372-387 (513) 339 pfam00454 PI3_PI4_kinase Phosp 33.6 20 0.0005 13.7 1.1 21 377-397 121-141 (233) 340 cd00561 CobA_CobO_BtuR ATP:cor 33.6 23 0.0006 13.1 2.7 51 344-400 103-157 (159) 341 pfam07905 PucR Purine cataboli 33.6 23 0.0006 13.1 2.7 69 138-216 38-106 (122) 342 PTZ00010 beta tubulin; Provisi 33.6 23 0.0006 13.1 1.6 46 333-392 112-157 (443) 343 cd01996 Alpha_ANH_like_III Thi 33.5 24 0.0006 13.0 6.3 101 50-217 10-111 (154) 344 PRK13126 consensus 33.2 24 0.00061 13.0 5.6 32 415-448 101-132 (237) 345 TIGR00331 hrcA heat-inducible 33.2 24 0.00061 13.0 3.9 111 424-547 1-144 (343) 346 KOG2836 consensus 33.2 24 0.00061 13.0 4.3 33 44-76 19-52 (173) 347 COG5249 RER1 Golgi protein inv 33.0 16 0.00042 14.4 0.7 23 383-411 57-79 (180) 348 cd02186 alpha_tubulin The tubu 33.0 18 0.00047 14.0 0.9 45 333-391 113-157 (434) 349 cd05166 PI3Kc_II Phosphoinosit 33.0 16 0.00042 14.4 0.6 44 570-624 102-145 (353) 350 KOG2805 consensus 32.8 24 0.00061 13.0 2.8 214 144-441 9-242 (377) 351 cd01305 archeal_chlorohydrolas 32.8 24 0.00062 13.0 2.2 64 9-72 4-71 (263) 352 PRK02227 hypothetical protein; 32.7 24 0.00062 13.0 5.5 16 360-375 169-184 (239) 353 COG1448 TyrB Aspartate/tyrosin 32.7 24 0.00062 12.9 6.4 125 741-888 263-391 (396) 354 cd05173 PI3Kc_IA_beta Phosphoi 32.6 20 0.00052 13.6 1.1 87 521-624 55-149 (362) 355 TIGR02392 rpoH_proteo alternat 32.5 19 0.00048 13.9 0.9 221 333-635 33-277 (279) 356 TIGR01755 flav_wrbA flavoprote 32.4 24 0.00062 12.9 4.4 142 50-230 8-163 (205) 357 pfam00318 Ribosomal_S2 Ribosom 32.4 24 0.00062 12.9 3.3 67 401-469 16-83 (205) 358 KOG3718 consensus 32.3 24 0.00062 12.9 2.1 19 358-379 133-151 (609) 359 COG2089 SpsE Sialic acid synth 32.3 25 0.00063 12.9 8.2 19 50-68 27-45 (347) 360 cd01295 AdeC Adenine deaminase 32.2 25 0.00063 12.9 4.1 60 8-72 8-73 (422) 361 PRK02261 methylaspartate mutas 32.2 25 0.00063 12.9 4.9 13 333-345 122-134 (137) 362 cd00891 PI3Kc Phosphoinositide 32.2 17 0.00043 14.2 0.6 44 570-624 102-145 (352) 363 cd05167 PI4Kc_III_alpha Phosph 32.2 21 0.00054 13.5 1.1 15 609-623 93-107 (311) 364 KOG3120 consensus 32.2 24 0.00061 13.0 1.4 14 224-237 121-134 (256) 365 TIGR02479 FliA_WhiG RNA polyme 32.1 25 0.00063 12.9 2.6 21 456-476 195-215 (227) 366 KOG0023 consensus 32.0 25 0.00063 12.9 2.6 48 577-624 305-353 (360) 367 COG4464 CapC Capsular polysacc 32.0 25 0.00063 12.9 9.0 188 12-232 2-210 (254) 368 cd05013 SIS_RpiR RpiR-like pro 31.9 25 0.00063 12.9 2.2 41 178-220 59-99 (139) 369 PRK07217 replication factor A; 31.8 25 0.00063 12.8 4.2 21 686-710 229-249 (311) 370 KOG3818 consensus 31.7 25 0.00064 12.8 3.3 91 563-657 250-368 (525) 371 pfam08585 DUF1767 Domain of un 31.6 25 0.00064 12.8 3.4 45 805-851 42-86 (89) 372 TIGR02257 cobalto_cobN cobalto 31.6 25 0.00064 12.8 4.2 66 157-226 223-294 (1310) 373 COG1070 XylB Sugar (pentulose 31.3 25 0.00065 12.8 2.9 13 355-367 156-168 (502) 374 PRK08434 consensus 31.2 25 0.00065 12.8 3.9 19 947-965 793-811 (887) 375 PRK13015 3-dehydroquinate dehy 31.2 19 0.00047 13.9 0.7 23 346-368 122-144 (148) 376 TIGR00314 cdhA CO dehydrogenas 31.2 25 0.00065 12.8 2.1 253 108-451 65-337 (795) 377 cd05176 PI3Kc_C2_alpha Phospho 31.1 22 0.00057 13.3 1.1 87 521-624 51-145 (353) 378 cd07231 Pat_SDP1-like Sugar-De 31.1 23 0.00059 13.1 1.2 29 365-393 88-116 (323) 379 cd05165 PI3Kc_I Phosphoinositi 31.0 18 0.00047 14.0 0.6 87 520-624 55-153 (366) 380 PRK08835 consensus 30.7 26 0.00066 12.7 3.5 25 941-965 828-852 (931) 381 cd00892 PIKKc_ATR ATR (Ataxia 30.6 23 0.00059 13.1 1.1 27 377-405 128-154 (237) 382 PRK10428 putative stress-respo 30.6 21 0.00053 13.5 0.8 28 414-441 23-50 (69) 383 TIGR02354 thiF_fam2 thiamine b 30.6 8.7 0.00022 16.7 -1.1 17 145-161 26-42 (200) 384 cd02189 delta_tubulin The tubu 30.5 23 0.00058 13.2 1.0 49 333-395 108-156 (446) 385 PRK05797 consensus 30.5 26 0.00066 12.7 3.9 26 940-965 766-791 (869) 386 PRK07456 consensus 30.5 26 0.00066 12.7 3.9 10 804-813 833-842 (975) 387 KOG1095 consensus 30.3 26 0.00067 12.7 2.9 74 624-723 778-863 (1025) 388 cd02922 FCB2_FMN Flavocytochro 30.3 26 0.00067 12.7 3.1 22 198-219 84-105 (344) 389 cd02190 epsilon_tubulin The tu 30.3 24 0.00061 13.0 1.1 46 334-393 82-127 (379) 390 TIGR01859 fruc_bis_ald_ fructo 30.2 25 0.00064 12.8 1.2 165 27-251 2-197 (339) 391 TIGR01307 pgm_bpd_ind 2,3-bisp 30.2 26 0.00067 12.6 3.0 17 196-212 133-149 (529) 392 TIGR01496 DHPS dihydropteroate 30.2 26 0.00067 12.6 3.1 14 200-213 117-130 (268) 393 PRK10053 hypothetical protein; 30.1 26 0.00067 12.6 5.9 47 977-1031 46-92 (130) 394 pfam12116 SpoIIID Stage III sp 30.1 17 0.00045 14.1 0.4 32 441-473 6-37 (82) 395 COG0757 AroQ 3-dehydroquinate 30.1 25 0.00064 12.8 1.2 22 346-367 121-142 (146) 396 PRK06687 chlorohydrolase; Vali 30.0 27 0.00068 12.6 6.2 60 9-70 58-120 (422) 397 cd02072 Glm_B12_BD B12 binding 30.0 27 0.00068 12.6 4.9 41 27-68 37-80 (128) 398 COG1087 GalE UDP-glucose 4-epi 29.9 27 0.00068 12.6 2.7 39 429-468 236-274 (329) 399 pfam03352 Adenine_glyco Methyl 29.9 27 0.00068 12.6 4.4 63 675-737 5-75 (179) 400 KOG0556 consensus 29.8 27 0.00068 12.6 5.3 27 732-762 470-496 (533) 401 cd06364 PBP1_CaSR Ligand-bindi 29.6 27 0.00068 12.6 6.0 80 371-460 121-206 (510) 402 pfam02310 B12-binding B12 bind 29.6 27 0.00069 12.6 5.7 18 29-46 40-57 (121) 403 PRK10702 endonuclease III; Pro 29.6 27 0.00069 12.6 2.4 17 375-392 116-132 (211) 404 cd02809 alpha_hydroxyacid_oxid 29.6 27 0.00069 12.6 3.1 13 348-360 138-150 (299) 405 PRK11230 glycolate oxidase sub 29.5 27 0.00069 12.5 1.4 20 360-380 68-87 (499) 406 cd01916 ACS_1 Acetyl-CoA synth 29.5 27 0.00069 12.5 1.8 76 356-449 209-295 (731) 407 pfam03918 CcmH Cytochrome C bi 29.4 27 0.00069 12.5 1.6 16 196-211 27-42 (145) 408 PRK10144 formate-dependent nit 29.3 27 0.00069 12.5 1.6 13 198-210 32-44 (126) 409 KOG2781 consensus 29.2 27 0.00069 12.5 2.1 56 334-389 93-154 (290) 410 cd04487 RecJ_OBF2_like RecJ_OB 29.2 27 0.00069 12.5 3.6 33 995-1032 1-33 (73) 411 pfam03248 Rer1 Rer1 family. RE 29.1 15 0.00038 14.8 -0.1 23 385-413 51-73 (175) 412 COG3066 MutH DNA mismatch repa 29.1 27 0.0007 12.5 1.8 16 693-708 15-30 (229) 413 TIGR03147 cyt_nit_nrfF cytochr 29.1 27 0.0007 12.5 1.6 14 197-210 31-44 (126) 414 cd06569 GH20_Sm-chitobiase-lik 29.0 27 0.0007 12.5 1.4 23 360-382 99-126 (445) 415 cd05171 PIKKc_ATM Ataxia telan 29.0 5.2 0.00013 18.6 -2.5 43 377-423 170-212 (279) 416 PRK07997 consensus 28.9 28 0.0007 12.5 3.5 29 939-967 824-852 (928) 417 pfam08549 SWI-SNF_Ssr4 Fungal 28.8 28 0.0007 12.5 2.8 19 424-442 290-308 (669) 418 COG1190 LysU Lysyl-tRNA synthe 28.8 28 0.00071 12.5 5.5 12 308-319 237-248 (502) 419 KOG3719 consensus 28.7 28 0.00071 12.5 4.8 25 627-651 431-457 (638) 420 TIGR01787 squalene_cyclas squa 28.5 28 0.00071 12.4 1.8 54 553-614 356-415 (716) 421 cd05174 PI3Kc_IA_delta Phospho 28.5 27 0.00068 12.6 1.1 87 521-624 55-149 (361) 422 pfam01257 Complex1_24kDa Respi 28.5 28 0.00071 12.4 2.5 39 430-480 18-56 (145) 423 PRK05070 DNA mismatch repair p 28.4 28 0.00071 12.4 1.9 13 49-61 40-52 (218) 424 pfam07340 Herpes_IE1 Cytomegal 28.4 24 0.00062 12.9 0.9 167 221-413 77-263 (392) 425 PRK13913 3-methyladenine DNA g 28.3 28 0.00072 12.4 2.5 11 256-266 166-176 (218) 426 TIGR00855 L12 ribosomal protei 28.2 23 0.0006 13.1 0.8 45 694-757 77-125 (131) 427 PRK04207 glyceraldehyde-3-phos 28.1 28 0.00072 12.4 2.4 45 426-473 234-278 (338) 428 PRK09177 xanthine-guanine phos 28.1 28 0.00072 12.4 1.3 26 374-399 36-62 (156) 429 KOG1091 consensus 28.0 28 0.00072 12.4 1.3 32 355-393 295-327 (625) 430 PRK06182 short chain dehydroge 28.0 28 0.00073 12.4 5.2 47 742-793 223-269 (273) 431 cd07224 Pat_like Patatin-like 27.9 28 0.00073 12.4 4.1 40 356-399 13-54 (233) 432 cd00895 PI3Kc_C2_beta Phosphoi 27.9 27 0.0007 12.5 1.1 87 521-624 52-146 (354) 433 COG4034 Uncharacterized protei 27.8 29 0.00073 12.3 5.4 53 23-75 87-140 (328) 434 TIGR00631 uvrb excinuclease AB 27.8 29 0.00073 12.3 3.4 208 161-389 61-327 (667) 435 pfam10087 DUF2325 Uncharacteri 27.8 29 0.00073 12.3 2.3 19 197-215 62-80 (96) 436 KOG0624 consensus 27.8 29 0.00073 12.3 2.7 43 16-62 22-64 (504) 437 TIGR00789 flhB_rel FlhB domain 27.7 29 0.00073 12.3 2.9 36 195-240 27-62 (84) 438 TIGR01326 OAH_OAS_sulfhy O-ace 27.6 29 0.00073 12.3 4.1 223 171-467 133-373 (434) 439 pfam05301 Mec-17 Touch recepto 27.6 29 0.00074 12.3 2.6 34 403-439 67-100 (120) 440 cd00894 PI3Kc_IB_gamma Phospho 27.6 26 0.00067 12.6 1.0 44 570-624 111-154 (365) 441 pfam03102 NeuB NeuB family. Ne 27.6 29 0.00074 12.3 7.7 70 164-240 125-197 (240) 442 PRK12460 2-keto-3-deoxyglucona 27.6 29 0.00074 12.3 1.8 42 354-398 229-270 (308) 443 COG1735 Php Predicted metal-de 27.5 29 0.00074 12.3 3.2 20 199-218 79-98 (316) 444 cd03332 LMO_FMN L-Lactate 2-mo 27.5 29 0.00074 12.3 2.9 18 200-217 107-124 (383) 445 cd00080 HhH2_motif Helix-hairp 27.5 29 0.00074 12.3 2.3 30 839-873 24-53 (75) 446 pfam01026 TatD_DNase TatD rela 27.5 29 0.00074 12.3 5.5 57 15-74 4-62 (255) 447 pfam02638 DUF187 Uncharacteriz 27.4 29 0.00074 12.3 1.7 30 196-225 111-140 (394) 448 COG0313 Predicted methyltransf 27.4 29 0.00074 12.3 5.3 100 346-466 81-180 (275) 449 PRK10434 srlR DNA-bindng trans 27.4 29 0.00074 12.3 2.3 35 423-473 3-37 (256) 450 cd05170 PIKKc_SMG1 Suppressor 27.3 29 0.00074 12.3 1.1 27 377-405 198-224 (307) 451 TIGR00191 thrB homoserine kina 27.3 29 0.00074 12.3 1.3 28 256-283 226-256 (359) 452 PRK00941 acetyl-CoA decarbonyl 27.2 29 0.00075 12.2 1.8 66 356-435 246-322 (779) 453 pfam09128 RGS-like Regulator o 27.1 29 0.00075 12.2 3.4 21 158-178 42-62 (188) 454 TIGR01265 tyr_nico_aTase tyros 27.1 29 0.00075 12.2 2.9 22 192-213 201-222 (424) 455 PRK08928 consensus 27.1 29 0.00075 12.2 3.7 26 940-965 760-785 (861) 456 pfam03811 Ins_element1 Inserti 27.1 29 0.00075 12.2 2.0 17 457-473 65-81 (88) 457 PRK06887 consensus 27.0 29 0.00075 12.2 1.8 38 938-975 850-887 (954) 458 TIGR03533 L3_gln_methyl protei 26.8 30 0.00076 12.2 2.5 76 357-449 1-84 (284) 459 TIGR00853 pts-lac PTS system, 26.8 30 0.00076 12.2 4.2 26 197-222 107-132 (142) 460 cd05175 PI3Kc_IA_alpha Phospho 26.8 30 0.00075 12.2 1.1 87 520-624 54-153 (366) 461 KOG0781 consensus 26.7 30 0.00076 12.2 4.5 110 765-894 391-500 (587) 462 KOG0638 consensus 26.7 30 0.00076 12.2 5.7 14 95-108 17-30 (381) 463 COG0329 DapA Dihydrodipicolina 26.7 30 0.00076 12.2 11.2 90 140-238 38-129 (299) 464 COG3088 CcmH Uncharacterized p 26.6 30 0.00076 12.2 1.2 12 199-210 37-48 (153) 465 cd01123 Rad51_DMC1_radA Rad51_ 26.6 30 0.00076 12.2 3.1 23 197-219 149-171 (235) 466 TIGR02865 spore_II_E stage II 26.5 27 0.0007 12.5 0.9 415 285-767 136-646 (794) 467 PRK10425 DNase TatD; Provision 26.5 30 0.00077 12.2 5.6 57 15-74 5-62 (258) 468 TIGR03674 fen_arch flap struct 26.4 30 0.00077 12.2 3.7 51 400-461 273-323 (338) 469 cd00288 Pyruvate_Kinase Pyruva 26.3 30 0.00077 12.1 14.2 39 30-68 17-60 (480) 470 PRK00257 erythronate-4-phospha 26.3 30 0.00077 12.1 2.1 27 365-391 75-101 (379) 471 PRK07084 fructose-bisphosphate 26.3 30 0.00077 12.1 6.3 49 25-74 8-56 (321) 472 COG4008 Predicted metal-bindin 26.2 22 0.00057 13.3 0.4 35 366-403 52-86 (153) 473 PRK07562 ribonucleotide-diphos 26.2 30 0.00077 12.1 2.2 14 725-738 868-881 (1177) 474 pfam08901 DUF1847 Protein of u 26.2 30 0.00077 12.1 2.6 22 27-48 41-62 (157) 475 cd01313 Met_dep_hydrolase_E Me 26.1 30 0.00078 12.1 2.1 62 9-70 42-108 (418) 476 KOG2214 consensus 26.1 31 0.00078 12.1 1.1 34 377-412 206-239 (543) 477 cd04736 MDH_FMN Mandelate dehy 26.0 31 0.00078 12.1 2.9 22 199-220 85-106 (361) 478 PRK06450 threonine synthase; V 25.9 31 0.00078 12.1 5.2 38 32-71 86-123 (336) 479 cd07311 terB_like_1 tellurium 25.9 31 0.00078 12.1 3.2 23 453-475 108-130 (150) 480 cd06564 GH20_DspB_LnbB-like Gl 25.9 25 0.00065 12.8 0.6 73 746-818 16-100 (326) 481 pfam04076 BOF Bacterial OB fol 25.9 31 0.00078 12.1 5.4 45 979-1031 44-88 (126) 482 TIGR00707 argD acetylornithine 25.7 29 0.00073 12.3 0.8 60 577-636 137-212 (402) 483 cd05169 PIKKc_TOR TOR (Target 25.6 27 0.00068 12.6 0.7 14 381-394 174-187 (280) 484 PRK02101 hypothetical protein; 25.6 31 0.00079 12.0 6.4 64 684-774 20-83 (257) 485 TIGR00542 hxl6Piso_put hexulos 25.6 31 0.00079 12.0 2.5 30 195-224 94-125 (290) 486 COG2055 Malate/L-lactate dehyd 25.6 31 0.00079 12.0 3.9 63 8-70 76-138 (349) 487 TIGR01803 CM-like chorismate m 25.6 31 0.00079 12.0 3.9 32 722-753 43-74 (82) 488 TIGR02275 DHB_AMP_lig 2,3-dihy 25.5 26 0.00067 12.7 0.6 48 118-174 408-455 (534) 489 pfam09420 Nop16 Ribosome bioge 25.4 27 0.00069 12.5 0.7 22 424-445 127-148 (173) 490 PTZ00075 S-adenosyl-L-homocyst 25.3 16 0.00042 14.4 -0.5 23 377-399 266-289 (476) 491 cd01303 GDEase Guanine deamina 25.3 31 0.0008 12.0 6.8 64 9-72 65-156 (429) 492 TIGR01063 gyrA DNA gyrase, A s 25.2 31 0.0008 12.0 1.0 56 610-693 158-220 (864) 493 PRK05826 pyruvate kinase; Prov 25.2 31 0.0008 12.0 15.4 174 30-299 176-354 (461) 494 pfam08667 BetR BetR domain. Th 25.2 26 0.00067 12.7 0.6 42 441-482 8-50 (146) 495 cd05177 PI3Kc_C2_gamma Phospho 25.2 32 0.0008 12.0 1.1 14 381-394 196-209 (354) 496 cd07204 Pat_PNPLA_like Patatin 25.1 32 0.00081 12.0 1.5 33 379-412 37-69 (243) 497 PRK05377 fructose-1,6-bisphosp 25.1 32 0.00081 12.0 2.4 98 51-163 156-255 (296) 498 cd06449 ACCD Aminocyclopropane 25.1 32 0.00081 12.0 1.9 111 351-470 152-274 (307) 499 PRK08517 DNA polymerase III su 25.1 32 0.00081 12.0 2.7 115 254-369 130-249 (259) 500 cd00740 MeTr MeTr subgroup of 25.0 32 0.00081 12.0 3.4 49 21-69 100-158 (252) No 1 >PRK09074 consensus Probab=100.00 E-value=0 Score=2570.60 Aligned_cols=1024 Identities=59% Similarity=0.965 Sum_probs=982.6 Q ss_pred CCCCCCCHHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCC Q ss_conf 66898813343314447644329989999999978998899811651774999999999769963899999985476643 Q 537021.9.peg.7 5 LEKSPSFVHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPIIGCQLDIDMQDNLK 84 (1033) Q Consensus 5 ~~~~~~FvhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~~~ 84 (1033) |-.+|+||||||||+||+|+|+++|++||++|+++||+||||||||||||+++|+++|+++|||||+|||+++..++... T Consensus 1 ~m~~p~FvhLhvhS~YSlLdg~~~i~eLv~~A~e~G~~AlALTDhgnl~GaveFy~aakk~GIKPIIG~El~v~~~~~~~ 80 (1149) T PRK09074 1 MMAHPGFVHLRVHSAYSLLEGAIKLKKIIGLAAADGMPAIALTDTNNLFGALEFSQKASGAGIQPIIGCQLAVDFGDADP 80 (1149) T ss_pred CCCCCCCHHHCCCCCCCHHCCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCC T ss_conf 99999945441576271304075999999999985999899984770525999999999859985876489983687665 Q ss_pred CCCCCCCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCEEEECCCCCCHHHHHHHCCCH Q ss_conf 34456566667753999956968999999999899732487787400789998634898899819767967888877997 Q 537021.9.peg.7 85 MTNQGKTPFPTVPSIVLLVSTAEGYQRLIELVSRMYLLDQGKQSVRICLSWLQEIGTEGLIMLTGGSSGPIDRAFFFNSS 164 (1033) Q Consensus 85 ~~~~~~~~~~~~~~lvlLAkN~~Gy~nL~kL~S~a~~~~~~~~~p~i~~~~L~e~~~egLIvlsg~~~g~i~~~l~~~~~ 164 (1033) .........+|++|||||++||+|||+|+|.+|+++..+.+|++++++|.+ |++||||+|||..|++++++..++. T Consensus 81 ---~~~~~~~~~~~LvLLAkN~~GY~NL~kL~S~a~l~g~~~~~Pri~~e~L~~-~~eGLI~Lsg~~~g~v~~~l~~g~~ 156 (1149) T PRK09074 81 ---GERNAAADPAPIVLLAANEAGYANLMRLVSRAYLDTPDGQPPHITLDWLEA-HAEGLIALTGGPDGPIGRLLAAGQR 156 (1149) T ss_pred ---CCCCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHH-CCCCEEEEECCCCCHHHHHHHCCCH T ss_conf ---422356788739999799899999999999999705578888468999984-6899799938966769999876998 Q ss_pred HHHHHHHHHHHHHCCCCEEEEEECCCHHHH-HHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 899999999987659868999943991158-9999999999981898998088300787889999876688469857653 Q 537021.9.peg.7 165 QVAEKRLLTFKKLFGDRLYVNLQRHRGYDR-YRESQVVQLAYTHELPLVATNNSLFLSEEDYEAHDVLMAVAHSTVVSQK 243 (1033) Q Consensus 165 ~~a~~~l~~l~~~Fgd~~ylEl~~~~~~e~-~~~~~l~~lA~~~~iPlVaTndv~Y~~~~D~~~h~iL~aI~~~~~l~~~ 243 (1033) ++|++.+.+|+++||+|||+|+|+|+..++ .++..++++|+++++|+||||||||++++|+++|++|+||++|++++++ T Consensus 157 ~~A~~~l~~l~~iFgd~~YlElq~hg~~~e~~~~~~Li~LA~~~giPlVATNdvhYl~~eD~~ahdvL~cI~~g~~L~d~ 236 (1149) T PRK09074 157 AEAEARLARLAEAFGDRLYVELQRHGLEYERRVEPGLIDLAYAMDLPLVATNEAFFPKPEDYEAHDALLCIAEGAYVSED 236 (1149) T ss_pred HHHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCC T ss_conf 99999999999876884899994289858999999999999981999898478640687578999999999769903751 Q ss_pred CCCCCCCCCCCCCHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCC Q ss_conf 10024534323434788751220358998788641110001332210234555756887768988999876541034201 Q 537021.9.peg.7 244 DRPRVTPDHYLKNRSEMVSIFSDLPEALENTVEIARRCSFILQVHAPILPRFIEKECDDIQKEEENELRNKAVAGLEVRL 323 (1033) Q Consensus 244 ~~~~~~~~~yLks~eEM~~~f~~~peAi~NT~~IAe~c~~~l~~~~~~lP~f~~p~~~~~~~~~~~~Lr~l~~~Gl~~R~ 323 (1033) ++.++++++||||++||.++|+++|+|++||.+||++|++.+++.+++||+||+|++.+...++.++|+++|++|+++|| T Consensus 237 ~r~~~~~e~yLKS~eEM~~lF~d~PEAl~NT~~IA~rC~~~l~~~~~~lP~fp~~~~~~~~~~~~~~L~~~~~eGl~~R~ 316 (1149) T PRK09074 237 DRRRLTPDHYFKSRAEMAALFADLPEALENTVEIARRCAFRAKTRKPILPRFTGGSDDAAGAAEAEELRRQAEEGLAKRL 316 (1149) T ss_pred CCCCCCHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHCEEECCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHH T ss_conf 13458976538999999999753199999999999750010135887799999987898777989999999999999987 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHHCCCH Q ss_conf 24643245783899999887789898606305578889999985104764334444204667765544037563332310 Q 537021.9.peg.7 324 AQGAIAKGYHAQDYRQRLDFELNVIARMKFSGYFLIVADFIQWAKKNNIPVGPGRGSGAGSVTAYALTITDIDPLRFSLL 403 (1033) Q Consensus 324 ~~~~~~~~~~~~~y~~RLe~EL~vI~~~gf~~YFLiV~Div~~ak~~gI~vGpGRGSAagSLVaY~LgIT~VDPi~~~Ll 403 (1033) ++...+.+...++|++||+|||+||++|||++|||||||+|+|||++||+||||||||||||||||||||+||||+|||| T Consensus 317 ~~~~~~~~~~~~~y~~RL~~EL~vI~~mgf~~YFLIV~D~i~~Ak~~gI~vGpGRGSaAGSLVaY~LgIT~vDPi~y~Ll 396 (1149) T PRK09074 317 AVIGLAEGYTEEDYRERLDFELGIITRMGFPGYFLIVADFIKWAKAHGIPVGPGRGSGAGSLVAYALTITDLDPLRFSLL 396 (1149) T ss_pred HHCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHCCCCCCCHHCCCC T ss_conf 54489866777999999999999998779976760999999999963886478877278899999827876580204741 Q ss_pred HHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCC Q ss_conf 56762100145787533365687899999998303802155422077877889999999752799889886543101458 Q 537021.9.peg.7 404 FERFLNPDRMSMPDFDIDFCQDRRDEVIRYVQNKYGHERVAQIITFGSLQAKAALRDVGRALQMPYSQVDRLCKLIPNDP 483 (1033) Q Consensus 404 FERFLnp~R~~~PDIDiDf~~~~R~~vi~y~~~kyG~~~va~i~t~~t~~~k~aird~~r~~g~~~~~~~~~~~~i~~~~ 483 (1033) |||||||+|++||||||||+++||++||+||++|||.++||||+||+||++|+||||||||||+|+.++|++||+||..+ T Consensus 397 FERFLNpeR~smPDIDiDF~~~rR~eVi~Yv~~kYG~~~VAqI~Tfgt~~ak~aiRDvgRvlg~~~~~vd~iak~ip~~~ 476 (1149) T PRK09074 397 FERFLNPERVSMPDFDIDFCQDRREEVIRYVQEKYGRDQVAQIITFGTLQARAVLRDVGRVLQMPYGQVDRLCKLVPNNP 476 (1149) T ss_pred HHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCC T ss_conf 88761444567899984436130899999999985846545420187799999999999985899899999985277578 Q ss_pred CCCCCH-HHHHCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCEEEECCCCHHHHEECCCCCCCCCEEEEEECHH Q ss_conf 982443-4452298878864307420268898720112210024474403352566666200123676886688510213 Q 537021.9.peg.7 484 AHPVSL-QAIADDSRFREARLADPSVDRLLEISQKLEGLYRHASTHAAGIVIGDRPLSQLVPMYRDVRSDLPVTQFNMKW 562 (1033) Q Consensus 484 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~a~~leg~~r~~~~Ha~Gvvi~~~~l~~~~p~~~~~~~~~~~~q~d~~~ 562 (1033) +...++ ++++++++|+++..++|..++++++|.+|||+|||+|+||||||||++||+++||++++++++.++|||||++ T Consensus 477 ~~~~tl~~a~~~~~~l~~~~~~~~~~~~l~~~A~~leGl~R~~s~HAaGvVIs~~pL~~~vPl~~~~~~~~~vtQ~d~~~ 556 (1149) T PRK09074 477 ANPVSLAQAIEDEPRLQEERDKDPVVARLLDIALKLEGLYRHASTHAAGIVIGDRPLSELVPLYRDPRSDMPVTQFNMKW 556 (1149) T ss_pred CCCCCHHHHHHCCHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCCCCCEEEECCCCCCCCCCEEECCCCCCEEEEEECHH T ss_conf 88765898873486788776448899999999998634623224467616733766132155688788886268876514 Q ss_pred HHHHCCCHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHCCCHHHHHHHHHHCCCHHHH Q ss_conf 34311232135654178999999998874025434332344358899876201410122114534788887717404888 Q 537021.9.peg.7 563 IEKAGLIKFDFLGLKTLTLLQKSLDFLAQRGVKVDLSLIPFDDHETYHLLTTKGTLGIFQLESSGMRQALEGMQPDCIED 642 (1033) Q Consensus 563 ~e~~gl~K~D~Lgl~~L~~i~~~~~~i~~~~~~~d~~~ip~~D~~~~~l~~~g~t~GvFQ~es~~~~~~l~~~~p~~~~d 642 (1033) ||++||||||||||||||+|++++++|+++|.++|+.+||+||++||+||++|+|.||||+||+|||++|++++|++|+| T Consensus 557 vE~~GLlK~DfLGLr~LtvI~~a~~~I~~~g~~idl~~ipldD~~t~~ll~~g~T~GvFQ~Es~g~~~~l~~~~P~~f~d 636 (1149) T PRK09074 557 VEQAGLVKFDFLGLKTLTVLQRAVKLLAKRGIEIDLSTIPLDDAKTYEMLARGETVGVFQVESAGMRDALRGMKPDRIED 636 (1149) T ss_pred HHHCCCHHHHHHCCCCHHHHHHHHHHHHHHCCCCCHHHCCCCCHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCHHH T ss_conf 67628623566526715899999999998599888775798679999999576858845235779999998539998999 Q ss_pred HHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCC Q ss_conf 87665411411342003688850465785026966733410335530012568988788632684130788976201211 Q 537021.9.peg.7 643 IIALVSLYRPGPIDNIVVYNNRKNGKEKIVSIHPLIDPMLKETQGVIIYQEQVMQIAQLLSGYSLSEADVLRRAMGKKIK 722 (1033) Q Consensus 643 l~~~~al~RPGp~~~~~~yi~rk~g~e~~~~~hp~l~~iL~~T~Gv~vyQEQvm~ia~~~ag~~~~ead~lRra~~kk~~ 722 (1033) |||++||||||||++++.|++||||+++|+|+||.|++||++|||||||||||||||+.+||||+||||.||||||||+. T Consensus 637 lva~~aL~RPGp~~~i~~yi~rk~g~~~~~y~hp~l~~iL~~TyGvivyQEQvM~ia~~~AgfslgeAD~lRram~KK~~ 716 (1149) T PRK09074 637 IIALVALYRPGPMENIPTYCARKHGEEEIESLHPKIDHILKETQGVIVYQEQVMQIAQVMAGYSLGEADLLRRAMGKKIR 716 (1149) T ss_pred HHHHHHHCCCCCCHHHHHHHHHHCCCCCCCCCCHHHHHHHHCCCCEEEEHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHH T ss_conf 99999744787401367899975699670268777998862358768769999999999855882467999987512017 Q ss_pred CHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHH Q ss_conf 01677878999998624999999999999999750001331014788899999999997302666777510023542025 Q 537021.9.peg.7 723 EEMDKQKERFISGASKNGISKTIAVNIFELLAKFADYGFNKSHAAAYAVISYQTAWMKTHYPVEFLAASMTLEMDNVEKI 802 (1033) Q Consensus 723 ~~~~~~~~~f~~g~~~~g~~~~~a~~i~~~i~~f~~Y~FnksHa~aya~~~y~~aylK~~yP~eF~aa~L~~~~~~~~~~ 802 (1033) ++|++.+++|++||.+||++++.|++||++|++||+||||||||+|||++||||||||+|||+|||||+||+++++++|+ T Consensus 717 ~~m~~~~~~F~~ga~~~g~~~~~a~~i~~~i~~Fa~YgFnksHa~aYa~~ayqtAylK~hyP~ef~aall~~~~~~~~k~ 796 (1149) T PRK09074 717 EEMDKQRARFVEGAVERGVDKAQADEIFDLLAKFADYGFNKSHAAAYALVSYQTAYMKANYPVEFLAASMTLDMSNTDKL 796 (1149) T ss_pred HHHHHHHHHHHCCHHHCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHCCCCCCCCHHH T ss_conf 79999999996310213999999999999999873646674057899999999999986284999986433376755589 Q ss_pred HHHHHHHHHCCCCCCCCCCCCCCEEEEECCCEEEEEHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCHHHH Q ss_conf 89998888636850355233421003531986985026858969899999999983799999789997348247998999 Q 537021.9.peg.7 803 KKFCQDARQFNIQIMPPSVNTPCVDFKVGDNRIYYSLAAIKGVGTTTARHIMEASADKPFDSLEDFCSRVDSKKLNRRVL 882 (1033) Q Consensus 803 ~~~i~e~~~~gi~vl~PdIN~S~~~f~~~~~~Ir~Gl~~Ikgvg~~~~~~Ive~r~~g~f~sl~df~~rv~~~~~nk~~l 882 (1033) ..|++||+++||+|+|||||+|.++|++++++||+||++|||||+++++.|+++|++|||+|++||+.|++.+.+||+++ T Consensus 797 ~~~~~e~~~~gi~vlppdin~S~~~f~~~~~~Ir~gl~~ikgvg~~~~~~Iv~~R~~g~f~s~~Df~~Rv~~~~~nk~~l 876 (1149) T PRK09074 797 ADFRQEARRLGIEVVPPSVNRSGATFEVGDGAIYYALAALKGVGEAAMDHIVAARGDKPFADLADFAARVDPKIVNKRVL 876 (1149) T ss_pred HHHHHHHHHCCCCEECCEEECCCCCEEEECCEEEECCHHHCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHCCCCCCHHHH T ss_conf 99999999879816486440347833772887887433326688899999997544699999999998715034899999 Q ss_pred HHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCEE Q ss_conf 98886767234790389999989999999998865431277786666555565577666899998889998578872621 Q 537021.9.peg.7 883 ESLVFAGALDCFGYSRMQLLQSLDNIQKYAQWVEKNRTNKHENIFAHEKGTFSDKITLEKFSVENSSVRFENEQRVLGFY 962 (1033) Q Consensus 883 e~Li~aGAfD~~~~~R~~ll~~i~~~~~~~~~~~~~~~~~q~~LF~~~~~~~~~~~~~~~~~~~~~~e~l~~E~e~LGf~ 962 (1033) |+||+|||||+||.||++++++++.++.++....+++..+|.+||+..... ...+.++..++|+..|++.+|+|+|||| T Consensus 877 e~Li~aGAfD~~~~~R~~l~~~~~~~~~~~~~~~~~~~~gq~~LF~~~~~~-~~~~~~~~~~~~~~~e~l~~E~e~lG~y 955 (1149) T PRK09074 877 ESLARAGAFDCLDPNRARVFAGADALLAYAQRAHENAASGQSDLFGGAGAD-AEPLRLPQTEPWLPAERLHREFQAIGFY 955 (1149) T ss_pred HHHHHCCCHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCC-CCCCCCCCCCCCCHHHHHHHHHHHHCCC T ss_conf 999974873220727999999599999999998798756886655777666-5678888999999789999999975717 Q ss_pred ECCCCHHHHHHHHHHCCCCCHHHHHCCC--CCCEEEEEEEEEEEEEECCCCCCEEEEEEEECCCCEEEEEECC Q ss_conf 1638558999999874897688985055--8988999999961121015888769999998497049999539 Q 537021.9.peg.7 963 FSGHPLDVYKPILRKMGIKGYEESVATI--RGNNIRLAAMVVSKQQKKTRKGSRIGWVTFSEPAKEYEANFFP 1033 (1033) Q Consensus 963 lS~HPl~~~~~~l~~~~~~~~~~~~~~~--~g~~v~iaG~V~~i~~~~TKkG~~mafltleD~~g~~e~~iFP 1033 (1033) +|+|||+.|+..++..+++++.++.+.. .++.|+++|+|+++|+++||+|++|+|+|+||+||.+|+++|| T Consensus 956 ls~HPl~~~~~~l~~~~~~~~~~l~~~~~~~~~~v~vaG~v~~~~~~~tk~G~~maf~tleD~tg~~e~vvfp 1028 (1149) T PRK09074 956 LSAHPLDDYATALKRLRVQTWAEFSAAVKRGATAGRLAGTVVSRQERKTKTGNKMGIVQLSDPTGQFEAVLFS 1028 (1149) T ss_pred CCCCCHHHHHHHHHHCCCCCHHHHHHHHHCCCEEEEEEEEEEEEEECCCCCCCEEEEEEEEECCCCEEEEECH T ss_conf 6638088999999981995699987564269849999999999885358999978999999799858999988 No 2 >PRK05673 dnaE DNA polymerase III subunit alpha; Validated Probab=100.00 E-value=0 Score=2557.46 Aligned_cols=1014 Identities=50% Similarity=0.822 Sum_probs=973.1 Q ss_pred CCCCHHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCCC Q ss_conf 98813343314447644329989999999978998899811651774999999999769963899999985476643344 Q 537021.9.peg.7 8 SPSFVHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPIIGCQLDIDMQDNLKMTN 87 (1033) Q Consensus 8 ~~~FvhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~~~~~~ 87 (1033) .|+||||||||+||+|+|+++|++||++|+++||+||||||||||||+++|+++|+++|||||+|||+||.+++..... T Consensus 1 ~~~FvhLhvhS~ySlLdg~~~~~eLv~~A~~~G~~AlAiTD~~nl~Gav~F~~aak~~gIKPIiG~e~~v~~~~~~~~~- 79 (1143) T PRK05673 1 DPRFVHLHVHSEYSLLDGAAKIKPLVKKAAELGMPAVALTDHGNLFGAVEFYKAAKGAGIKPIIGCEAYVAPEDKDDVS- 79 (1143) T ss_pred CCCCCEECCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCC- T ss_conf 9982100337637120316699999999998599989997277050599999999986998378889997268533455- Q ss_pred CCCCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCEEEECCCCCCHHHHHHHCCCHHHH Q ss_conf 56566667753999956968999999999899732487787400789998634898899819767967888877997899 Q 537021.9.peg.7 88 QGKTPFPTVPSIVLLVSTAEGYQRLIELVSRMYLLDQGKQSVRICLSWLQEIGTEGLIMLTGGSSGPIDRAFFFNSSQVA 167 (1033) Q Consensus 88 ~~~~~~~~~~~lvlLAkN~~Gy~nL~kL~S~a~~~~~~~~~p~i~~~~L~e~~~egLIvlsg~~~g~i~~~l~~~~~~~a 167 (1033) .....+|++|||||++||+|||+|+|.+++++..+++|++++++|.+ |++||||+|||..|++++++..++.++| T Consensus 80 ----~~~~~~hLvlLAkN~~GY~NL~kL~S~a~~~g~~~~~Prid~e~L~~-~~eGLI~lsgc~~g~i~~~l~~~~~~~A 154 (1143) T PRK05673 80 ----GGGAYTHLTLLAKNETGYRNLMKLSSRAYLEGQRYYRPRIDREWLAE-HSEGLIALSGCPSGEVGTALLAGNYEEA 154 (1143) T ss_pred ----CCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHH-HCCCEEEECCCCCCHHHHHHHCCCHHHH T ss_conf ----67877508999589999999999999999706578888358999987-0699699618865669999866999999 Q ss_pred HHHHHHHHHHCCCCEEEEEECCCHHH-HHHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 99999998765986899994399115-89999999999981898998088300787889999876688469857653100 Q 537021.9.peg.7 168 EKRLLTFKKLFGDRLYVNLQRHRGYD-RYRESQVVQLAYTHELPLVATNNSLFLSEEDYEAHDVLMAVAHSTVVSQKDRP 246 (1033) Q Consensus 168 ~~~l~~l~~~Fgd~~ylEl~~~~~~e-~~~~~~l~~lA~~~~iPlVaTndv~Y~~~~D~~~h~iL~aI~~~~~l~~~~~~ 246 (1033) ++.+.+|+++||+|||+|+|+|+..+ +.++..++++|+++++|+||||||||++++|+.+|++|.||++|++++++++. T Consensus 155 ~~~~~~l~~iFgd~fylElq~hg~~~~~~~~~~L~~lA~~~~iplVATndvhYl~~eD~~~hdvL~aI~~g~~l~d~~r~ 234 (1143) T PRK05673 155 EEAAAFYQEIFGDRFYLELQRHGLPDEERVEHALLELAKELGLPLVATNDVHFLTPEDFEAHEALLCIAEGKTLDDPKRF 234 (1143) T ss_pred HHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCHH T ss_conf 99999999876986899984489827999999999999982999899688651786368999999999769900436200 Q ss_pred CC-CCCCCCCCHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCC Q ss_conf 24-53432343478875122035899878864111000133221023455575688776898899987654103420124 Q 537021.9.peg.7 247 RV-TPDHYLKNRSEMVSIFSDLPEALENTVEIARRCSFILQVHAPILPRFIEKECDDIQKEEENELRNKAVAGLEVRLAQ 325 (1033) Q Consensus 247 ~~-~~~~yLks~eEM~~~f~~~peAi~NT~~IAe~c~~~l~~~~~~lP~f~~p~~~~~~~~~~~~Lr~l~~~Gl~~R~~~ 325 (1033) +. ++++||||++||.++|+++|+|++||.+||++|++++++.+++||+|++|++ .++++||+++|++|+++||+. T Consensus 235 ~~~~~e~yLKS~eEM~~lF~d~PeAl~NT~~IAe~C~~~l~~~~~~lP~f~~p~g----~~~~~~L~~l~~~Gl~~R~~~ 310 (1143) T PRK05673 235 KFYSPEQYLKSPEEMAELFADLPEALENTVEIAERCNVTVRLGKPFLPNFPTPEG----MTEEDYLRKEAKEGLEERLAF 310 (1143) T ss_pred CCCCHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCEEEECCCCCCCCCCCCCCC----CCHHHHHHHHHHHHHHHHHHH T ss_conf 0377221689999999997536999999999984403640557777998889999----999999999999999998530 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHHCCCHHH Q ss_conf 64324578389999988778989860630557888999998510476433444420466776554403756333231056 Q 537021.9.peg.7 326 GAIAKGYHAQDYRQRLDFELNVIARMKFSGYFLIVADFIQWAKKNNIPVGPGRGSGAGSVTAYALTITDIDPLRFSLLFE 405 (1033) Q Consensus 326 ~~~~~~~~~~~y~~RLe~EL~vI~~~gf~~YFLiV~Div~~ak~~gI~vGpGRGSAagSLVaY~LgIT~VDPi~~~LlFE 405 (1033) +.. ....++|++||+|||+||++|||++|||||||+|+|||++||+||||||||||||||||||||+||||+|||||| T Consensus 311 ~~~--~~~~~~y~~RL~~EL~vI~~mgf~~YFLIV~D~v~~Ar~~gI~vGpGRGSaAgSLVaY~LgIT~VDPi~~~LlFE 388 (1143) T PRK05673 311 LFP--ADEREEYRERLEYELDVIIQMGFPGYFLIVADFIKWAKDNGIPVGPGRGSGAGSLVAYALGITDLDPLRYDLLFE 388 (1143) T ss_pred CCC--CCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCHH T ss_conf 588--411199999999999999868984658999999999995388757887753889999983787658211665486 Q ss_pred HHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCC Q ss_conf 76210014578753336568789999999830380215542207787788999999975279988988654310145898 Q 537021.9.peg.7 406 RFLNPDRMSMPDFDIDFCQDRRDEVIRYVQNKYGHERVAQIITFGSLQAKAALRDVGRALQMPYSQVDRLCKLIPNDPAH 485 (1033) Q Consensus 406 RFLnp~R~~~PDIDiDf~~~~R~~vi~y~~~kyG~~~va~i~t~~t~~~k~aird~~r~~g~~~~~~~~~~~~i~~~~~~ 485 (1033) |||||+|++||||||||+++||++||+||++|||+++||||+||+||++|+||||||||||+|+.++|++||+||..++. T Consensus 389 RFLnpeR~smPDIDiDf~~~rR~eVi~Yv~~kYG~~~VAqI~Tf~t~~ak~AiRDv~Rvlg~~~~~~d~iak~ip~~~~~ 468 (1143) T PRK05673 389 RFLNPERVSMPDFDIDFCQDRRDEVIDYVAEKYGRDAVAQIITFGTMKAKAVIRDVGRVLGMPYGFVDRITKLIPPDPGK 468 (1143) T ss_pred HHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEEHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCCCC T ss_conf 61571014788766554223189999999998564757888546677787899999998689999999998527766789 Q ss_pred CCCHHHHHCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCEEEECCCCHHHHEECCCCCCCCCEEEEEECHHHHH Q ss_conf 24434452298878864307420268898720112210024474403352566666200123676886688510213343 Q 537021.9.peg.7 486 PVSLQAIADDSRFREARLADPSVDRLLEISQKLEGLYRHASTHAAGIVIGDRPLSQLVPMYRDVRSDLPVTQFNMKWIEK 565 (1033) Q Consensus 486 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~leg~~r~~~~Ha~Gvvi~~~~l~~~~p~~~~~~~~~~~~q~d~~~~e~ 565 (1033) ....++++++++|+++..++|.+++++++|++|||+|||+|+||||||||++||++++|++++++++.++|||||++||+ T Consensus 469 ~tl~~a~~~~~~l~~~~~~~~~~~~l~~~a~~leG~~r~~~~Ha~GvVis~~~l~~~~Pl~~~~~~~~~~tq~d~~~~e~ 548 (1143) T PRK05673 469 ITLAKAYEEEPELRELYESDPEVKRLIDMARKLEGLTRNAGVHAAGVVISPTPLTDFVPLYKDPKSGMPVTQFDMKDVEA 548 (1143) T ss_pred CCHHHHHHHCHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCCCCCEEEECCCCCCCCCCEEECCCCCCEEEEECHHHHHH T ss_conf 88999987499999888549889999999999848854335678838951776222067677588995578721898997 Q ss_pred HCCCHHHHHHHCCHHHHHHHHHHHHH-HCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHCCCHHHHHHHHHHCCCHHHHHH Q ss_conf 11232135654178999999998874-02543433234435889987620141012211453478888771740488887 Q 537021.9.peg.7 566 AGLIKFDFLGLKTLTLLQKSLDFLAQ-RGVKVDLSLIPFDDHETYHLLTTKGTLGIFQLESSGMRQALEGMQPDCIEDII 644 (1033) Q Consensus 566 ~gl~K~D~Lgl~~L~~i~~~~~~i~~-~~~~~d~~~ip~~D~~~~~l~~~g~t~GvFQ~es~~~~~~l~~~~p~~~~dl~ 644 (1033) +||||||||||+|||+|++++++|++ +|.++|+.+||+||++||+||++|+|.||||+||+|||++|++++|++|+||| T Consensus 549 ~GllK~D~Lgl~~Lt~i~~~~~~i~~~~~~~~d~~~ipldD~~t~~ll~~g~T~GvFQ~ES~g~~~~l~~l~P~~f~dlv 628 (1143) T PRK05673 549 AGLVKFDFLGLRTLTIIDWALKLIKKRGGIDVDLAAIPLDDKKTYELLQRGETLGVFQLESRGMRDLLKRLKPDCFEDII 628 (1143) T ss_pred CCCCCCHHHCCCCHHHHHHHHHHHHHHCCCCCCHHHCCCCHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHH T ss_conf 48841003234607899999999998349988865578873999999845785773113576799989755999899978 Q ss_pred HHHHHCCCHHHHH--HHHHHHHHCCCCCCCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCC Q ss_conf 6654114113420--03688850465785026966733410335530012568988788632684130788976201211 Q 537021.9.peg.7 645 ALVSLYRPGPIDN--IVVYNNRKNGKEKIVSIHPLIDPMLKETQGVIIYQEQVMQIAQLLSGYSLSEADVLRRAMGKKIK 722 (1033) Q Consensus 645 ~~~al~RPGp~~~--~~~yi~rk~g~e~~~~~hp~l~~iL~~T~Gv~vyQEQvm~ia~~~ag~~~~ead~lRra~~kk~~ 722 (1033) |++||||||||++ ++.|++||||+|+++|+||.|++||++|||||||||||||||+.+||||+||||.||||||||+. T Consensus 629 a~~aL~RPGP~~~~~i~~yi~rk~g~e~i~y~hp~l~~iL~~TyGv~vyQEQvM~ia~~~agfslgeAD~lRram~KK~~ 708 (1143) T PRK05673 629 ALVALYRPGPMESGMIPDFIDRKHGREEIEYPHPELEPILKETYGIIVYQEQVMQIAQVLAGYSLGGADLLRRAMGKKKP 708 (1143) T ss_pred HHHHHCCCCCHHHHCCCHHHHHHCCCCCCCCCCHHHHHHHCCCCCEEEEHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCH T ss_conf 88741067742530371789964799763068778998862237778739999999999737987888999987611356 Q ss_pred CHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHH Q ss_conf 01677878999998624999999999999999750001331014788899999999997302666777510023542025 Q 537021.9.peg.7 723 EEMDKQKERFISGASKNGISKTIAVNIFELLAKFADYGFNKSHAAAYAVISYQTAWMKTHYPVEFLAASMTLEMDNVEKI 802 (1033) Q Consensus 723 ~~~~~~~~~f~~g~~~~g~~~~~a~~i~~~i~~f~~Y~FnksHa~aya~~~y~~aylK~~yP~eF~aa~L~~~~~~~~~~ 802 (1033) +.|++++++|++||.+||+++++|++||++|++||+||||||||+|||++||||||||+|||+|||||+||+++++++|+ T Consensus 709 ~~m~~~~~~F~~ga~~~G~~~~~a~~i~~~i~~Fa~YgFnkSHa~aYa~iayqtAyLKahyP~eF~aalL~~~~~~~~k~ 788 (1143) T PRK05673 709 EEMAKQREIFVEGAKKNGIDKELADKIFDLLEKFAGYGFNKSHAAAYALVSYQTAYLKAHYPAEFMAALLTSDMDNTDKV 788 (1143) T ss_pred HHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCCCHHH T ss_conf 78999999999989876999999999999999771435574235899999999999856586999999974077884679 Q ss_pred HHHHHHHHHCCCCCCCCCCCCCCEEEEECCCEEEEEHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCHHHH Q ss_conf 89998888636850355233421003531986985026858969899999999983799999789997348247998999 Q 537021.9.peg.7 803 KKFCQDARQFNIQIMPPSVNTPCVDFKVGDNRIYYSLAAIKGVGTTTARHIMEASADKPFDSLEDFCSRVDSKKLNRRVL 882 (1033) Q Consensus 803 ~~~i~e~~~~gi~vl~PdIN~S~~~f~~~~~~Ir~Gl~~Ikgvg~~~~~~Ive~r~~g~f~sl~df~~rv~~~~~nk~~l 882 (1033) +.|++|||++||+|+|||||+|.++|++++++||+||++|||||++++++|+++|++|||+|+.||+.|++.+.+||+++ T Consensus 789 ~~~i~ea~~~gi~vlppdin~S~~~f~~~~~~Ir~gL~~Ikgvg~~~~~~Iv~~R~~g~f~s~~Df~~Rv~~~~~nk~~l 868 (1143) T PRK05673 789 AVYLDECRRMGIKVLPPDVNESLYRFTVVGGDIRYGLGAIKGVGEGAVEAIVEAREKGPFKDLFDFCARVDLKKVNKRVL 868 (1143) T ss_pred HHHHHHHHHCCCCEECCEECCCCCCCEECCCEEEEHHHHHCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHCCCCCCHHHH T ss_conf 99999999879917587431356873653998987167506787999999999863499999899999823112699999 Q ss_pred HHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCEE Q ss_conf 98886767234790389999989999999998865431277786666555565577666899998889998578872621 Q 537021.9.peg.7 883 ESLVFAGALDCFGYSRMQLLQSLDNIQKYAQWVEKNRTNKHENIFAHEKGTFSDKITLEKFSVENSSVRFENEQRVLGFY 962 (1033) Q Consensus 883 e~Li~aGAfD~~~~~R~~ll~~i~~~~~~~~~~~~~~~~~q~~LF~~~~~~~~~~~~~~~~~~~~~~e~l~~E~e~LGf~ 962 (1033) |+||+|||||+||.||++++++++.++.++...++++..+|.+||+.........+.++.+++|+..|++++|+|+|||| T Consensus 869 e~Li~aGAfD~~~~~R~~ll~~~~~~~~~~~~~~~~~~~gq~~LF~~~~~~~~~~~~~~~~~e~~~~e~l~~E~e~lG~y 948 (1143) T PRK05673 869 ESLIKAGAFDSLGPNRAALLASLEDAIDAADQHAKAEASGQFDLFGVEPEAEDVEQSLPDVPEWPDKERLAGERETLGLY 948 (1143) T ss_pred HHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCC T ss_conf 99997798200160199999969999999999757775487222445555555567778999999778999999984707 Q ss_pred ECCCCHHHHHHHHHHCCCCCHHHHHCCCCCC-EEEEEEEEEEEEEECCCCCCEEEEEEEECCCCEEEEEECC Q ss_conf 1638558999999874897688985055898-8999999961121015888769999998497049999539 Q 537021.9.peg.7 963 FSGHPLDVYKPILRKMGIKGYEESVATIRGN-NIRLAAMVVSKQQKKTRKGSRIGWVTFSEPAKEYEANFFP 1033 (1033) Q Consensus 963 lS~HPl~~~~~~l~~~~~~~~~~~~~~~~g~-~v~iaG~V~~i~~~~TKkG~~mafltleD~~g~~e~~iFP 1033 (1033) +|+|||+.|++.++..+.+++.++.+...+. .|+|+|+|+++|++.||||++|+|+|+||++|.+|+++|| T Consensus 949 lS~HPl~~~~~~l~~~~~~~~~~l~~~~~~~~~v~vaG~v~~~~~~~TkkG~~maf~tleD~tg~~e~vvfp 1020 (1143) T PRK05673 949 LSGHPLDGYEDELRRLVVTRLADLEPAERGKVAVTVAGLIVSVRVRVTKRGNKMAIVTLEDRSGRIEVMLFS 1020 (1143) T ss_pred CCCCHHHHHHHHHHHHCCCCHHHHHCCCCCCEEEEEEEEEEEEEECCCCCCCEEEEEEEEECCCCEEEEECH T ss_conf 663818899999998578669998505489849999999989785117998989999999899848999979 No 3 >PRK08512 consensus Probab=100.00 E-value=0 Score=2537.11 Aligned_cols=1014 Identities=39% Similarity=0.662 Sum_probs=957.5 Q ss_pred CCCHHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCCCC Q ss_conf 88133433144476443299899999999789988998116517749999999997699638999999854766433445 Q 537021.9.peg.7 9 PSFVHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPIIGCQLDIDMQDNLKMTNQ 88 (1033) Q Consensus 9 ~~FvhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~~~~~~~ 88 (1033) ++||||||||+||+|||+++|++||++|+++||+||||||||||||+++|+++|+++|||||+|||+|+........ T Consensus 2 ~~FvhLhvhS~YSlLdga~~i~~Lv~~AkelG~~AlAlTDhgnm~Gav~F~~a~kk~gIKPIiG~E~yv~~~~~~~~--- 78 (1185) T PRK08512 2 SQFTHLHLHTEYSLLDGANKIKELAKRLKELGMTSVAMTDHGNMFGAIDFYKTMKKEGIKPIIGMEAYIHNHDELDD--- 78 (1185) T ss_pred CCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCC--- T ss_conf 99740244774711050449999999999869998999608778589999999998699827753899836875566--- Q ss_pred CCCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCEEEECCCCCCHHHHHHHC------- Q ss_conf 6566667753999956968999999999899732487787400789998634898899819767967888877------- Q 537021.9.peg.7 89 GKTPFPTVPSIVLLVSTAEGYQRLIELVSRMYLLDQGKQSVRICLSWLQEIGTEGLIMLTGGSSGPIDRAFFF------- 161 (1033) Q Consensus 89 ~~~~~~~~~~lvlLAkN~~Gy~nL~kL~S~a~~~~~~~~~p~i~~~~L~e~~~egLIvlsg~~~g~i~~~l~~------- 161 (1033) ......+|++|||||++||+|||+|+|.++.+ +++++||+++++|.+ |++|||++|||+.|+++..+.. T Consensus 79 --~~~~~~~hl~LLAkN~~GY~NL~kLsS~a~~~-g~y~~Prid~~~L~~-~~eGLI~~s~cl~Gev~~~L~~~~e~~~~ 154 (1185) T PRK08512 79 --KSSKQRFHLCLFAKNEIGYKNLMYLSSMSYIE-GFYYYPRINKKLLRE-HSEGLICSSACLQGEVNWHLNTLSERNVK 154 (1185) T ss_pred --CCCCCCCEEEEEECCHHHHHHHHHHHHHHHHC-CCCCCCCCCHHHHHH-HCCCCEEECCCCCCCHHHHHHCCCHHHHH T ss_conf --56566412999948999999999999999974-786788689999997-14998994355555156665122244454 Q ss_pred ---CCHHHHHHHHHHHHHHCCCCEEEEEECCCHHHH-HHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHCC Q ss_conf ---997899999999987659868999943991158-9999999999981898998088300787889999876688469 Q 537021.9.peg.7 162 ---NSSQVAEKRLLTFKKLFGDRLYVNLQRHRGYDR-YRESQVVQLAYTHELPLVATNNSLFLSEEDYEAHDVLMAVAHS 237 (1033) Q Consensus 162 ---~~~~~a~~~l~~l~~~Fgd~~ylEl~~~~~~e~-~~~~~l~~lA~~~~iPlVaTndv~Y~~~~D~~~h~iL~aI~~~ 237 (1033) +++++|++.+.+|+++||+|||+|+|+|+..++ .++..++++|+++++|+||||||||++++|+.+|++|.||++| T Consensus 155 ~~a~~~e~A~~~~~~~~~iFgd~fylEl~~hg~~~q~~~n~~l~~lA~~~~iplVATndvhY~~~eD~~~hdvL~cI~~g 234 (1185) T PRK08512 155 FGAKGYEEAKEVALEYKEIFGDDFYLEIMRHGIGDQRFIDDQIIRLSKETGIKIIATNDTHYTFKEDADAHEVFMCIAMG 234 (1185) T ss_pred HCCCCHHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCHHHHHHHHHHHHCC T ss_conf 32455899999999999865886899984189807999999999999981999999788650773429999999999777 Q ss_pred CCCCCCCCCCC-CCCCCCCCHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCC--------------CCCCCC Q ss_conf 85765310024-534323434788751220358998788641110001332210234555--------------756887 Q 537021.9.peg.7 238 TVVSQKDRPRV-TPDHYLKNRSEMVSIFSDLPEALENTVEIARRCSFILQVHAPILPRFI--------------EKECDD 302 (1033) Q Consensus 238 ~~l~~~~~~~~-~~~~yLks~eEM~~~f~~~peAi~NT~~IAe~c~~~l~~~~~~lP~f~--------------~p~~~~ 302 (1033) +++++++++++ +.++||||++||.++|.++|+|++||.+||++|++++++..+.+|+|+ .|+... T Consensus 235 ~~l~d~~r~~~~~~e~ylks~eEM~~lf~d~peal~NT~~IAe~C~~~l~~~~~~lP~f~~~~~~~~~~~~~~p~~~~~~ 314 (1185) T PRK08512 235 KTLNDPNRLRHSVHEFYVKSPEEMAELFADIPEAIENTQEIADKCNLEIKLGNPTPPNFKFTREYAKEEGLTLPEPTKEY 314 (1185) T ss_pred CCCCCCCCCCCCCHHHCCCCHHHHHHHHHCHHHHHHHHHHHHHHCEEEECCCCCCCCCCCCCCCCCCHHCCCCCCCCCCC T ss_conf 93175863236853203699999999861469999999999984300403588669876654344411036778876544 Q ss_pred CCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHH Q ss_conf 76898899987654103420124643245783899999887789898606305578889999985104764334444204 Q 537021.9.peg.7 303 IQKEEENELRNKAVAGLEVRLAQGAIAKGYHAQDYRQRLDFELNVIARMKFSGYFLIVADFIQWAKKNNIPVGPGRGSGA 382 (1033) Q Consensus 303 ~~~~~~~~Lr~l~~~Gl~~R~~~~~~~~~~~~~~y~~RLe~EL~vI~~~gf~~YFLiV~Div~~ak~~gI~vGpGRGSAa 382 (1033) ...+++.||+++|++|+++||+... ...+++|++||+|||+||++|||++|||||||+|+|||++||+||||||||| T Consensus 315 ~~~~~~~yL~~l~~~Gl~~R~~~~~---~~~~~~y~~RL~~EL~vI~~mgf~~YFLIV~D~i~~Ar~~gI~VGPGRGSAA 391 (1185) T PRK08512 315 SFENDAVYFEYKCKKGLEERLKFVP---EEKHEEYKDRLEVEIEIINNMKFPGYMLIVWDFINVAKDRGIPVGPGRGSAA 391 (1185) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHCC---CCCCHHHHHHHHHHHHHHHHCCCCEEHHHHHHHHHHHHHCCCEECCCCCCHH T ss_conf 5457889999999999999863488---6432999999999999998789950049999999999966841268877348 Q ss_pred HHHHHHHHHCCCCCHHHCCCHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHH Q ss_conf 66776554403756333231056762100145787533365687899999998303802155422077877889999999 Q 537021.9.peg.7 383 GSVTAYALTITDIDPLRFSLLFERFLNPDRMSMPDFDIDFCQDRRDEVIRYVQNKYGHERVAQIITFGSLQAKAALRDVG 462 (1033) Q Consensus 383 gSLVaY~LgIT~VDPi~~~LlFERFLnp~R~~~PDIDiDf~~~~R~~vi~y~~~kyG~~~va~i~t~~t~~~k~aird~~ 462 (1033) |||||||||||+||||+|||||||||||+|++||||||||+++||++||+||++|||.++||||+||+||++|+|||||| T Consensus 392 GSLVaY~LgIT~vDPi~y~LLFERFLNpeR~smPDIDiDF~~~rR~eVi~Yv~~kYG~~~VAqI~Tfgt~~ak~AiRDv~ 471 (1185) T PRK08512 392 GSLVAYSLKITDIDPIPYNLLFERFLNPERVSMPDIDVDFCQRRRGEIIDYVIEKYGKYNVAQVITFGKLLAKGVIRDVA 471 (1185) T ss_pred HHHHHHHHCCCCCCCHHCCCCHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCEEEECHHHHHHHHHHHHHHH T ss_conf 88999994680118011377689863855677888765543231999999999984615399730177799889999999 Q ss_pred HHCCCCHHHHHHHHHHCCCCCCCCCCHHHHHCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCEEEECCCCHHHH Q ss_conf 75279988988654310145898244344522988788643074202688987201122100244744033525666662 Q 537021.9.peg.7 463 RALQMPYSQVDRLCKLIPNDPAHPVSLQAIADDSRFREARLADPSVDRLLEISQKLEGLYRHASTHAAGIVIGDRPLSQL 542 (1033) Q Consensus 463 r~~g~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~leg~~r~~~~Ha~Gvvi~~~~l~~~ 542 (1033) ||||+|+.++|++||+||...+.++. ++++.++++++++..+|..++++++|++|||+|||+|+||||||||++||+++ T Consensus 472 Rvlg~~~~~~d~iaK~ip~~~~~tl~-~a~~~~~~l~~~~~~~~~~~~l~~~A~~LEGl~R~~s~HAaGvVIs~~pL~~~ 550 (1185) T PRK08512 472 RVLDMPYSEADAMAKLIPDELGITLD-GAYEKEPKIKELIESNPKAKQVWEYALALEGLNRNAGMHAAGVVISNEELWHK 550 (1185) T ss_pred HHHCCCHHHHHHHHHCCCCCCCCCHH-HHHHHCHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCCCCCEEEECCCCCCCC T ss_conf 98299989999986227633588889-99872805666664386664899999998377566565677268427762402 Q ss_pred EECCCCCCCCCEEEEEECHHHHHHCCCHHHHHHHCCHHHHHHHHHHHHH-HCCCCCCCCCCCCHHHHHHHHHCCCCCHHH Q ss_conf 0012367688668851021334311232135654178999999998874-025434332344358899876201410122 Q 537021.9.peg.7 543 VPMYRDVRSDLPVTQFNMKWIEKAGLIKFDFLGLKTLTLLQKSLDFLAQ-RGVKVDLSLIPFDDHETYHLLTTKGTLGIF 621 (1033) Q Consensus 543 ~p~~~~~~~~~~~~q~d~~~~e~~gl~K~D~Lgl~~L~~i~~~~~~i~~-~~~~~d~~~ip~~D~~~~~l~~~g~t~GvF 621 (1033) +|+++.++++.++|||||++||++||||||||||||||+|++++++|++ +|.++|+++||+||++||+||++|+|.||| T Consensus 551 vPl~~~~~~~~~vtQ~d~~~vE~lGLlKmDfLGLr~LtvI~~a~~~I~~~~g~~id~~~ipldD~~t~~ll~~G~T~GvF 630 (1185) T PRK08512 551 TPLFRQSKNDELVTQYSKKYLEDVDLIKFDFLGLKTLTVIDNAIKLIKKRYNKDIDWETIDVNDPKVYKTIQSGNTLGIF 630 (1185) T ss_pred CCEEECCCCCCEEEEECHHHHHHCCCEEEEECCCCCHHHHHHHHHHHHHHCCCCCCHHHCCCCCHHHHHHHHCCCCCCCC T ss_conf 54234588995457632466987798157766888256899999999983299767654387639999999603469976 Q ss_pred HCCCHHHHHHHHHHCCCHHHHHHHHHHHCCCHHHHH--HHHHHHHHCCCCCCCCCCHHHHHHHCCCCCCCCHHHHHHHHH Q ss_conf 114534788887717404888876654114113420--036888504657850269667334103355300125689887 Q 537021.9.peg.7 622 QLESSGMRQALEGMQPDCIEDIIALVSLYRPGPIDN--IVVYNNRKNGKEKIVSIHPLIDPMLKETQGVIIYQEQVMQIA 699 (1033) Q Consensus 622 Q~es~~~~~~l~~~~p~~~~dl~~~~al~RPGp~~~--~~~yi~rk~g~e~~~~~hp~l~~iL~~T~Gv~vyQEQvm~ia 699 (1033) |+||+||+++|++++|++|+||||++||||||||++ ++.||+||||+++++|+||.|++||++||||||||||||||| T Consensus 631 Q~ES~gm~~~l~~lkP~~feDiva~~ALyRPGPm~~~~i~~yi~RK~g~e~i~y~hp~Le~IL~~TyGv~vYQEQVMqia 710 (1185) T PRK08512 631 QIESGGMQSLNKRLKPSSFEDIIAVLALYRPGPMESGMLDDFIDRKHGLAKIEYAFDELEPILEPTYGVIVYQEQVMQIV 710 (1185) T ss_pred CCCCHHHHHHHHHCCCCCHHHHHHHHHCCCCCCCCCCCHHHHHHHHCCCCCCCCCCHHHHHHHCCCCCEEEEHHHHHHHH T ss_conf 24360899999976998798867875301577765564579998616998878888899998467787687799999999 Q ss_pred HHHHCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHH Q ss_conf 88632684130788976201211016778789999986249999999999999997500013310147888999999999 Q 537021.9.peg.7 700 QLLSGYSLSEADVLRRAMGKKIKEEMDKQKERFISGASKNGISKTIAVNIFELLAKFADYGFNKSHAAAYAVISYQTAWM 779 (1033) Q Consensus 700 ~~~ag~~~~ead~lRra~~kk~~~~~~~~~~~f~~g~~~~g~~~~~a~~i~~~i~~f~~Y~FnksHa~aya~~~y~~ayl 779 (1033) +.+||||+||||.||||||||+.+.|++++++|++||.+||++++.|++||++|++||+||||||||+|||++||||||| T Consensus 711 ~~lAGfslgeAD~LRRAmgKK~~~~m~~~~~~Fi~Ga~~~G~~~~~a~~i~~~i~~FA~YgFNKSHaaAYa~iaYqtAyL 790 (1185) T PRK08512 711 QTIGGFSLGGADLVRRAMGKKDPEEMAKLKTEFADGAKKQGFDRAKAEDLFDLIVKFAGYGFNKSHSAAYAMITFQTAYL 790 (1185) T ss_pred HHHHCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHH T ss_conf 99827997778999998531768789999999999998769999999999999999852266854027989999999998 Q ss_pred HHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEECC----CEEEEEHHHHCCCCHHHHHHHHH Q ss_conf 97302666777510023542025899988886368503552334210035319----86985026858969899999999 Q 537021.9.peg.7 780 KTHYPVEFLAASMTLEMDNVEKIKKFCQDARQFNIQIMPPSVNTPCVDFKVGD----NRIYYSLAAIKGVGTTTARHIME 855 (1033) Q Consensus 780 K~~yP~eF~aa~L~~~~~~~~~~~~~i~e~~~~gi~vl~PdIN~S~~~f~~~~----~~Ir~Gl~~Ikgvg~~~~~~Ive 855 (1033) |+|||+|||||+||+++++++|++.|+.|||++||+|+|||||+|.++|++.+ ++|||||++|||||++++++|++ T Consensus 791 KahYP~eF~aAlL~s~~~~~~k~~~~i~ear~~GI~VlpPdIN~S~~~ftv~~~~~~~~IrfGL~aIkgvg~~~i~~Iv~ 870 (1185) T PRK08512 791 KTYYPAEFMAALLTSEENNIDKVAKYIDEVKRLGIKLLPPSINKSIREFSVAEFDGKDAIIFGLGAIKGVGGAAIENIIE 870 (1185) T ss_pred HHCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCEEECCCCCCCCCCCEEECCCCCCEEEECHHHHCCCCHHHHHHHHH T ss_conf 76789999999852688996269999999998799684882046667643543798753773047647898999999998 Q ss_pred HHHCCCCCCHHHHHHHCCCCCCCHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCC Q ss_conf 98379999978999734824799899998886767234790389999989999999998865431277786666555565 Q 537021.9.peg.7 856 ASADKPFDSLEDFCSRVDSKKLNRRVLESLVFAGALDCFGYSRMQLLQSLDNIQKYAQWVEKNRTNKHENIFAHEKGTFS 935 (1033) Q Consensus 856 ~r~~g~f~sl~df~~rv~~~~~nk~~le~Li~aGAfD~~~~~R~~ll~~i~~~~~~~~~~~~~~~~~q~~LF~~~~~~~~ 935 (1033) +|++|||+|+.||+.|++.+.+||+++|+||+|||||+||.||++|+++++.+++.+...++.+..+|.+||++...... T Consensus 871 ~R~~Gpf~sl~Df~~Rv~~~~~nk~~~e~Li~aGafD~~g~~R~~l~~~~~~~~~~~~~~~~~~~~~q~~lf~~~~~~~~ 950 (1185) T PRK08512 871 AREDGEFKDLEDFLSRIDPSKVNKKVLESLIKAGAFDEFGYTRKALLDNIENIVDASKKAAEARKNAANSLFGEDEEMTD 950 (1185) T ss_pred HHCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCC T ss_conf 53589999999999873211279999999997796112072399999989999999999899886376455688754445 Q ss_pred CC-CCCCCCCCCCHHHHHHHHHHHHCEEECCCCHHHHHHHHHHCCCCCHHHHHCCCCCCEEEEEEEEEEEEEECCCCCCE Q ss_conf 57-76668999988899985788726211638558999999874897688985055898899999996112101588876 Q 537021.9.peg.7 936 DK-ITLEKFSVENSSVRFENEQRVLGFYFSGHPLDVYKPILRKMGIKGYEESVATIRGNNIRLAAMVVSKQQKKTRKGSR 1014 (1033) Q Consensus 936 ~~-~~~~~~~~~~~~e~l~~E~e~LGf~lS~HPl~~~~~~l~~~~~~~~~~~~~~~~g~~v~iaG~V~~i~~~~TKkG~~ 1014 (1033) .. +..+..++|+..|+|.+|+|+||||+|+|||+.|+..++..+.++..++.+..+|+.|+++|+|+++|++.||+|++ T Consensus 951 ~~~~~~~~~~e~~~~e~l~~E~e~lG~y~s~HPl~~~~~~l~~~~~~~~~~l~~~~~~~~v~~~G~i~~~~~~~tk~G~~ 1030 (1185) T PRK08512 951 VQVNLEPKNEEFDVKELLEFEKETLGIYVSGHPLDEFKEEINKINYTKSSDFENLEDGSEALFVGKIEDIKKKISKKGKK 1030 (1185) T ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCCCHHHHHHCCCCCEEEEEEEEEEEEECCCCCCCE T ss_conf 56677888899998799999999734287607189999999873995689996188998899999997867521799898 Q ss_pred EEEEEEECCCCEEEEEECC Q ss_conf 9999998497049999539 Q 537021.9.peg.7 1015 IGWVTFSEPAKEYEANFFP 1033 (1033) Q Consensus 1015 mafltleD~~g~~e~~iFP 1033 (1033) |+|+++||++|.+|+++|| T Consensus 1031 ~a~~~leD~~g~~e~~~F~ 1049 (1185) T PRK08512 1031 FGIINILDFHGNIELMVFE 1049 (1185) T ss_pred EEEEEEEECCCCEEEEECC T ss_conf 9999998489858999811 No 4 >PRK06826 dnaE DNA polymerase III DnaE; Reviewed Probab=100.00 E-value=0 Score=2513.79 Aligned_cols=1007 Identities=40% Similarity=0.686 Sum_probs=963.4 Q ss_pred CCCCHHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCCC Q ss_conf 98813343314447644329989999999978998899811651774999999999769963899999985476643344 Q 537021.9.peg.7 8 SPSFVHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPIIGCQLDIDMQDNLKMTN 87 (1033) Q Consensus 8 ~~~FvhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~~~~~~ 87 (1033) +|+||||||||+||||+|+++|++||++|+++||+||||||||||||+++|+++|+++|||||||||+|+.++....+. T Consensus 3 ~~~FvhLhvhS~YSlLdga~~i~eLv~~Ake~G~~AlALTD~~nl~Gav~Fy~aakk~GIKPIIG~E~~v~~~~~~~k~- 81 (1157) T PRK06826 3 KKSFVHLHVHTEYSLLDGSGKIKDLIKRAKELGMKSIAITDHGVMYGVVDFYKAAKKQGIKPIIGCEVYVAPRSMHDKE- 81 (1157) T ss_pred CCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCCCCCC- T ss_conf 9983121566637110506599999999998599989997388586899999999987998144558996687411256- Q ss_pred CCCCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCEEEECCCCCCHHHHHHHCCCHHHH Q ss_conf 56566667753999956968999999999899732487787400789998634898899819767967888877997899 Q 537021.9.peg.7 88 QGKTPFPTVPSIVLLVSTAEGYQRLIELVSRMYLLDQGKQSVRICLSWLQEIGTEGLIMLTGGSSGPIDRAFFFNSSQVA 167 (1033) Q Consensus 88 ~~~~~~~~~~~lvlLAkN~~Gy~nL~kL~S~a~~~~~~~~~p~i~~~~L~e~~~egLIvlsg~~~g~i~~~l~~~~~~~a 167 (1033) .......+|++|||||++||+|||+|+|.++.+ +++.+|++++++|.+ |++|||++|||..|+++++++.++.++| T Consensus 82 --~~~~~~~~~LvLLAkN~~GY~nL~kL~S~a~~e-g~~~kPrid~~~L~~-~~eGLI~lsgcl~G~i~~~ll~~~~e~A 157 (1157) T PRK06826 82 --PDIDNETYHLVLLAKNEIGYKNLMKIVSKASIE-GFYYKPRVDHELLKE-HSEGLIALSACLAGEVQSLILKGNYEKA 157 (1157) T ss_pred --CCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHC-CCCCCCCCCHHHHHH-CCCCCEEECCCCCCHHHHHHHCCCHHHH T ss_conf --677787523999958999999999999999871-786788789999996-0799699358754538898856999999 Q ss_pred HHHHHHHHHHCC-CCEEEEEECCCHHHH-HHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 999999987659-868999943991158-999999999998189899808830078788999987668846985765310 Q 537021.9.peg.7 168 EKRLLTFKKLFG-DRLYVNLQRHRGYDR-YRESQVVQLAYTHELPLVATNNSLFLSEEDYEAHDVLMAVAHSTVVSQKDR 245 (1033) Q Consensus 168 ~~~l~~l~~~Fg-d~~ylEl~~~~~~e~-~~~~~l~~lA~~~~iPlVaTndv~Y~~~~D~~~h~iL~aI~~~~~l~~~~~ 245 (1033) ++.+.+|+++|| +|||+|+|+|+..++ .++..++++|+++++|+||||||||++++|+.+|++|+||++|++++++++ T Consensus 158 ~~~~~~l~~iFg~d~fyLElq~~g~~~~~~~n~~l~~lA~~~~iplVATndvhYl~~eD~~~hdvL~cI~~g~tl~d~~r 237 (1157) T PRK06826 158 KEAALFYKDIFGKENFYLELQDHGIPEQKKVNEELIKLSKELGIPLVATNDVHYIRKEDAKAHDVLLCIQTGKTVDDENR 237 (1157) T ss_pred HHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCC T ss_conf 99999999985889699997218986899999999999998499989978866578736999999999867996052740 Q ss_pred CCC-CCCCCCCCHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCC Q ss_conf 024-5343234347887512203589987886411100013322102345557568877689889998765410342012 Q 537021.9.peg.7 246 PRV-TPDHYLKNRSEMVSIFSDLPEALENTVEIARRCSFILQVHAPILPRFIEKECDDIQKEEENELRNKAVAGLEVRLA 324 (1033) Q Consensus 246 ~~~-~~~~yLks~eEM~~~f~~~peAi~NT~~IAe~c~~~l~~~~~~lP~f~~p~~~~~~~~~~~~Lr~l~~~Gl~~R~~ 324 (1033) .++ +.++||||++||.++|+++|+|++||.+||++|+|++++.+++||+|++|++ .++++||+++|++|+++||+ T Consensus 238 ~~~~~~e~yLKs~~EM~~lF~d~peAl~NT~~IA~rC~~~l~~~~~~lP~~~~p~g----~~~~~~L~~l~~~Gl~~r~~ 313 (1157) T PRK06826 238 MRFPSDEFYLKSPEEMYELFSYVPEALENTVKIAEECNVEFEFGESKLPKFPLPEG----VDPYEYLRELCYEGLIKRYE 313 (1157) T ss_pred CCCCCCCCCCCCHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCC----CCHHHHHHHHHHHHHHHHCC T ss_conf 01787334679999999874015999999999997331013568878987778889----99899999999977887527 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHHCCCHH Q ss_conf 46432457838999998877898986063055788899999851047643344442046677655440375633323105 Q 537021.9.peg.7 325 QGAIAKGYHAQDYRQRLDFELNVIARMKFSGYFLIVADFIQWAKKNNIPVGPGRGSGAGSVTAYALTITDIDPLRFSLLF 404 (1033) Q Consensus 325 ~~~~~~~~~~~~y~~RLe~EL~vI~~~gf~~YFLiV~Div~~ak~~gI~vGpGRGSAagSLVaY~LgIT~VDPi~~~LlF 404 (1033) + .+++|++||+|||+||.+|||++|||||||+|+|||++||+||||||||||||||||||||+||||+||||| T Consensus 314 ~-------~~~~~~~Rl~~EL~vI~~~gf~~YFLIV~D~i~~Ar~~gI~vGpGRGSaAgSlVaY~LgIT~vDPi~y~LlF 386 (1157) T PRK06826 314 N-------PSEELIERLEYELSVIKQMGYVDYFLIVWDFIKFAKENGIPVGPGRGSAAGSLVAYTLGITKIDPIKYNLLF 386 (1157) T ss_pred C-------CCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCEECCCCCCHHHHHHHHHHCCCCCCHHHCCCCH T ss_conf 9-------989999999999999985187712368999999998668323688772898999998077624802138769 Q ss_pred HHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCC Q ss_conf 67621001457875333656878999999983038021554220778778899999997527998898865431014589 Q 537021.9.peg.7 405 ERFLNPDRMSMPDFDIDFCQDRRDEVIRYVQNKYGHERVAQIITFGSLQAKAALRDVGRALQMPYSQVDRLCKLIPNDPA 484 (1033) Q Consensus 405 ERFLnp~R~~~PDIDiDf~~~~R~~vi~y~~~kyG~~~va~i~t~~t~~~k~aird~~r~~g~~~~~~~~~~~~i~~~~~ 484 (1033) ||||||+|++||||||||+++||++||+||++|||+++||||+||+||++|+||||||||||+|+.++|+++|+||..++ T Consensus 387 ERFLnpeR~~mPDIDiDf~~~~R~~vi~yv~~kyG~~~va~i~Tf~t~~~k~airdv~rvlg~~~~~~d~~~k~ip~~~~ 466 (1157) T PRK06826 387 ERFLNPERVSMPDIDSDFCYERRQEVIDYVVEKYGKDNVAQIITFGTMAARAAIRDVGRALNYPYAEVDRIAKMIPTVLG 466 (1157) T ss_pred HHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEEEHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCCC T ss_conf 98717566788987666650127999999999848555898732000268899999889859999999789873787679 Q ss_pred CCCCHHHHHCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCEEEECCCCHHHHEECCCCCCCCCEEEEEECHHHH Q ss_conf 82443445229887886430742026889872011221002447440335256666620012367688668851021334 Q 537021.9.peg.7 485 HPVSLQAIADDSRFREARLADPSVDRLLEISQKLEGLYRHASTHAAGIVIGDRPLSQLVPMYRDVRSDLPVTQFNMKWIE 564 (1033) Q Consensus 485 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~leg~~r~~~~Ha~Gvvi~~~~l~~~~p~~~~~~~~~~~~q~d~~~~e 564 (1033) .++. ++++.++++++.+..++.+++++++|++|||+|||+|+||||||||++||+++||++++ ++..+|||||+++| T Consensus 467 ~~l~-~a~~~~~~l~~~~~~~~~~~~~~~~a~~leg~~r~~~~Ha~Gvvi~~~~l~~~~Pl~~~--~~~~~tq~d~~~~e 543 (1157) T PRK06826 467 ITIE-KALELNPELKELYENDERVKELIDTARALEGMPRHASTHAAGVVIASEPLVEYVPLQKN--EDSIVTQFTMTTLE 543 (1157) T ss_pred CCHH-HHHHCCHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCCCCCEEEECCCCHHHCCCEECC--CCCEEEEEEHHHHH T ss_conf 9889-99751988998874598899999999997176245577777179606551111780128--98726542678898 Q ss_pred HHCCCHHHHHHHCCHHHHHHHHHHHHH-HCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHCCCHHHHHHHHHHCCCHHHHH Q ss_conf 311232135654178999999998874-0254343323443588998762014101221145347888877174048888 Q 537021.9.peg.7 565 KAGLIKFDFLGLKTLTLLQKSLDFLAQ-RGVKVDLSLIPFDDHETYHLLTTKGTLGIFQLESSGMRQALEGMQPDCIEDI 643 (1033) Q Consensus 565 ~~gl~K~D~Lgl~~L~~i~~~~~~i~~-~~~~~d~~~ip~~D~~~~~l~~~g~t~GvFQ~es~~~~~~l~~~~p~~~~dl 643 (1033) ++||+|||||||+|||+|++++++|++ +|.++|+++||+||++||+||++|+|.|||||||+||+++|++++|++|+|| T Consensus 544 ~~GllK~D~Lgl~~Lt~i~~~~~~i~~~~~~~~d~~~ipldD~~t~~ll~~g~t~GvFQ~Es~g~~~~l~~~~P~~f~dl 623 (1157) T PRK06826 544 ELGLLKMDFLGLRTLTVIRDAVELIKKNRGIDIDLDKIDYDDKEVYKMIGEGKTVGVFQLESAGMTSFMKELKPDSLEDI 623 (1157) T ss_pred HCCCCCEEECCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHCCCCCHHHH T ss_conf 73880022004674899999999999833997671457856699999983667377376677279999986599979998 Q ss_pred HHHHHHCCCHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCC Q ss_conf 76654114113420036888504657850269667334103355300125689887886326841307889762012110 Q 537021.9.peg.7 644 IALVSLYRPGPIDNIVVYNNRKNGKEKIVSIHPLIDPMLKETQGVIIYQEQVMQIAQLLSGYSLSEADVLRRAMGKKIKE 723 (1033) Q Consensus 644 ~~~~al~RPGp~~~~~~yi~rk~g~e~~~~~hp~l~~iL~~T~Gv~vyQEQvm~ia~~~ag~~~~ead~lRra~~kk~~~ 723 (1033) +|++||||||||++++.|++||||+++++|+||.|++||++|||||||||||||||+.+||||++|||.||||||||+.+ T Consensus 624 ~a~~aL~RPGp~~~i~~yi~rk~g~~~v~y~hp~l~~iL~~TyGv~vyQEQvM~ia~~~agfs~~eaD~lRram~KK~~~ 703 (1157) T PRK06826 624 IAGISLYRPGPMDEIPRYIANKNNPEKIEYLHPKLEPILSVTYGCIVYQEQVMQIVRDLAGYSMGRSDLVRRAMSKKKHD 703 (1157) T ss_pred HHHHHCCCCCCCCCHHHHHHHCCCCCCCCCCCHHHHHHHCCCCCEEEEHHHHHHHHHHHHCCCHHHHHHHHHHCCCCCHH T ss_conf 86573037750002468998546986777887899976501167796789999999998369867789999870667488 Q ss_pred HHHHHHHHHH------------HHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 1677878999------------9986249999999999999997500013310147888999999999973026667775 Q 537021.9.peg.7 724 EMDKQKERFI------------SGASKNGISKTIAVNIFELLAKFADYGFNKSHAAAYAVISYQTAWMKTHYPVEFLAAS 791 (1033) Q Consensus 724 ~~~~~~~~f~------------~g~~~~g~~~~~a~~i~~~i~~f~~Y~FnksHa~aya~~~y~~aylK~~yP~eF~aa~ 791 (1033) +|++++++|+ +||.+||++++.|++||++|++||+||||||||+|||++||||||||+|||+|||||+ T Consensus 704 ~~~~~~~~F~~g~~~~~~~~~v~g~~~~g~~~~~a~~i~~~i~~Fa~YgFnkSHa~aYa~iay~~AylK~hyP~ef~aa~ 783 (1157) T PRK06826 704 VMEEERKNFIYGIVDENGNVEVPGCIRNGIDEEVANKIFDSMMDFASYAFNKSHAAAYAVVAYQTAYLKRYYPVEFMAAL 783 (1157) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHH T ss_conf 99999998650330234567776798869999999999999999723255743236859999999999862879999998 Q ss_pred HCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEECCCEEEEEHHHHCCCCHHHHHHHHHHH-HCCCCCCHHHHHH Q ss_conf 100235420258999888863685035523342100353198698502685896989999999998-3799999789997 Q 537021.9.peg.7 792 MTLEMDNVEKIKKFCQDARQFNIQIMPPSVNTPCVDFKVGDNRIYYSLAAIKGVGTTTARHIMEAS-ADKPFDSLEDFCS 870 (1033) Q Consensus 792 L~~~~~~~~~~~~~i~e~~~~gi~vl~PdIN~S~~~f~~~~~~Ir~Gl~~Ikgvg~~~~~~Ive~r-~~g~f~sl~df~~ 870 (1033) ||+.+++.+++..|+.||+++||+|+|||||+|.++|++++++||+||++|||||++++++|+++| ++|||+|++||+. T Consensus 784 l~~~~~~~~k~~~~~~e~~~~gi~vlpPdIn~S~~~f~~~~~~Ir~gl~~Ikgvg~~~~~~I~~~R~~~G~f~s~~Df~~ 863 (1157) T PRK06826 784 LNSVMGNSDKVAFYIEECKRLGIEVLPPDINESYSKFTVKGNKIRFGLAAVKNVGENAIDNIVKAREKKGKFTSLVDFCN 863 (1157) T ss_pred HHHCCCCCCHHHHHHHHHHHCCCEEECCEECCCCCCEEEECCEEEEHHHHHCCCCHHHHHHHHHHHHHCCCCCCHHHHHH T ss_conf 65436895068999999998799597973311388138879978875566268888899999999987799999899998 Q ss_pred HCCCCCCCHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCC--CCCCCCCCCCCCCCH Q ss_conf 348247998999988867672347903899999899999999988654312777866665555--655776668999988 Q 537021.9.peg.7 871 RVDSKKLNRRVLESLVFAGALDCFGYSRMQLLQSLDNIQKYAQWVEKNRTNKHENIFAHEKGT--FSDKITLEKFSVENS 948 (1033) Q Consensus 871 rv~~~~~nk~~le~Li~aGAfD~~~~~R~~ll~~i~~~~~~~~~~~~~~~~~q~~LF~~~~~~--~~~~~~~~~~~~~~~ 948 (1033) |++.+.+||+++|+||+|||||+||.||++|+++++.+++++....+.+..+|.+||+..... ......++.+++|+. T Consensus 864 Rv~~~~~nk~~lE~Li~aGAfD~~g~~R~~ll~~l~~~l~~~~~~~~~~~~gq~~LF~~~~~~~~~~~~~~~p~~~e~~~ 943 (1157) T PRK06826 864 RVDTSIINKRAVESLIKAGAFDSFGVYRSQLLAVYEKILDSISKQKKNNIEGQISLFDDIMDMEESSIEIKYPNIKEFDK 943 (1157) T ss_pred HHCCCCCCHHHHHHHHHCCCHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCH T ss_conf 72542169999999986584222020199999999999999999878655587563346655655555567889999998 Q ss_pred HHHHHHHHHHHCEEECCCCHHHHHHHHHHCCCCCHHHHH---------CCCCCCEEEEEEEEEEEEEECCCCCCEEEEEE Q ss_conf 899985788726211638558999999874897688985---------05589889999999611210158887699999 Q 537021.9.peg.7 949 SVRFENEQRVLGFYFSGHPLDVYKPILRKMGIKGYEESV---------ATIRGNNIRLAAMVVSKQQKKTRKGSRIGWVT 1019 (1033) Q Consensus 949 ~e~l~~E~e~LGf~lS~HPl~~~~~~l~~~~~~~~~~~~---------~~~~g~~v~iaG~V~~i~~~~TKkG~~maflt 1019 (1033) .|+|.+|+|+||||+|+|||+.|++.++....+++.++. ...+|++|+|+|+|+++|+++||||++|+|+| T Consensus 944 ~e~L~~Eke~lG~ylS~HPl~~y~~~l~~~~~~~~~~l~~~~~~~~~~~~~~g~~V~vaG~i~~~k~~~TkkG~~maf~t 1023 (1157) T PRK06826 944 KELLAMEKEMLGLYISGHPLEEYEELLKKQTSHKIADIISSEELEGNSELKDGDRVIVGGIITEVKRKVTRNNDMMAFIT 1023 (1157) T ss_pred HHHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCCHHHHHHCCCCCCCCCCCCCCEEEEEEEEEEEEECCCCCCCEEEEEE T ss_conf 89999989872017661718899999987178339997520011110136899889999999887753179999789999 Q ss_pred EECCCCEEEEEECC Q ss_conf 98497049999539 Q 537021.9.peg.7 1020 FSEPAKEYEANFFP 1033 (1033) Q Consensus 1020 leD~~g~~e~~iFP 1033 (1033) +||++|.+|++||| T Consensus 1024 leD~tG~~e~vvFp 1037 (1157) T PRK06826 1024 LEDLYGTIEVIVFP 1037 (1157) T ss_pred EEECCCCEEEEECH T ss_conf 99689868999988 No 5 >PRK07374 dnaE DNA polymerase III subunit alpha; Validated Probab=100.00 E-value=0 Score=2514.57 Aligned_cols=1012 Identities=39% Similarity=0.659 Sum_probs=957.8 Q ss_pred CCHHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCCCCC Q ss_conf 81334331444764432998999999997899889981165177499999999976996389999998547664334456 Q 537021.9.peg.7 10 SFVHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPIIGCQLDIDMQDNLKMTNQG 89 (1033) Q Consensus 10 ~FvhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~~~~~~~~ 89 (1033) +||||||||+||+|||+++|++||++|+++||+||||||||||||+++|+++|+++|||||+|||+++..+..... T Consensus 2 sFvhLh~hS~YSlLdg~~~i~~Lv~~A~~~G~~AlAlTD~~nl~Gav~F~~aa~~~gIKPIiG~e~~v~~~~~~d~---- 77 (1171) T PRK07374 2 AFVPLHNHSDYSLLDGASQLPKMVERAKELGMPAIALTDHGVMYGAIELLKLCKSAGIKPIIGNEMYVINGSIDDP---- 77 (1171) T ss_pred CCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHCCCCCEEEEEEEEECCCCCCC---- T ss_conf 7515353763711051639999999999859998998628868789999999997699833676899737976566---- Q ss_pred CCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHCC----CCCCCCCCCHHHHHHHCCCCEEEECCCCCCHHHHHHHCCCHH Q ss_conf 5666677539999569689999999998997324----877874007899986348988998197679678888779978 Q 537021.9.peg.7 90 KTPFPTVPSIVLLVSTAEGYQRLIELVSRMYLLD----QGKQSVRICLSWLQEIGTEGLIMLTGGSSGPIDRAFFFNSSQ 165 (1033) Q Consensus 90 ~~~~~~~~~lvlLAkN~~Gy~nL~kL~S~a~~~~----~~~~~p~i~~~~L~e~~~egLIvlsg~~~g~i~~~l~~~~~~ 165 (1033) .......+|++|||||++||+|||+|+|.+|+++ +++.+|+++.++|.+ |++|||++|||+.|+++++++.++.+ T Consensus 78 ~~~~~~~~hlvlLAkN~~GY~nL~kL~S~a~~~g~~g~g~~~~p~id~~~L~~-~~eGLI~lsg~~~Gei~~~ll~~~~~ 156 (1171) T PRK07374 78 QPKKEKRYHLVVLAKNATGYKNLVKLTTISHLNGMRGRGIFSRPCIDKELLKQ-YSEGLIVSTACLGGEIPQAILRGRPD 156 (1171) T ss_pred CCCCCCCCCEEEECCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHH-CCCCCEEECCCCCCHHHHHHHCCCHH T ss_conf 55656775189955898999999999999997204578651577689999985-68991991477556488998679988 Q ss_pred HHHHHHHHHHHHCCCCEEEEEECCCHHHH-HHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 99999999987659868999943991158-99999999999818989980883007878899998766884698576531 Q 537021.9.peg.7 166 VAEKRLLTFKKLFGDRLYVNLQRHRGYDR-YRESQVVQLAYTHELPLVATNNSLFLSEEDYEAHDVLMAVAHSTVVSQKD 244 (1033) Q Consensus 166 ~a~~~l~~l~~~Fgd~~ylEl~~~~~~e~-~~~~~l~~lA~~~~iPlVaTndv~Y~~~~D~~~h~iL~aI~~~~~l~~~~ 244 (1033) .|++.+.||+++||++||+|+|+|+..++ .++..++++|+++++|+||||||||++++|+++|++|+||++|+++++++ T Consensus 157 ~a~~~~~~~~~iFg~~fylEiq~~g~~~~~~~n~~l~~la~~~~iplVATndv~y~~~~D~~ahd~l~cI~~g~~l~d~~ 236 (1171) T PRK07374 157 VARDVAAWYKEVFGDDFYLEIQDHGSIEDRIVNVELVRIAKELGIKLIATNDAHYLSKNDVEAHDALLCVLTGKLISDEK 236 (1171) T ss_pred HHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCC T ss_conf 99999999998638868999824898478999999999999809798996886317842588999999997799378754 Q ss_pred CCCCCCCCCCCCHHHHHHHCCH------HHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 0024534323434788751220------3589987886411100013322102345557568877689889998765410 Q 537021.9.peg.7 245 RPRVTPDHYLKNRSEMVSIFSD------LPEALENTVEIARRCSFILQVHAPILPRFIEKECDDIQKEEENELRNKAVAG 318 (1033) Q Consensus 245 ~~~~~~~~yLks~eEM~~~f~~------~peAi~NT~~IAe~c~~~l~~~~~~lP~f~~p~~~~~~~~~~~~Lr~l~~~G 318 (1033) +.++++++||||++||.++|.+ +|+|++||.+||++|++.+.+.+++||+||+|++ .++++||+++|++| T Consensus 237 r~~~~~~~ylks~eEM~~lF~d~~~~~~~peAl~NT~~IA~rc~~~~~~~~~~lP~fp~p~g----~t~~~~L~~l~~~G 312 (1171) T PRK07374 237 RLRYTGTEYIKSEEEMLRLFRDHLDPEVIQEAIANTVEVAEKVEEYDILGTYRMPRFPIPEG----HTAVSYLTEVTEQG 312 (1171) T ss_pred CCCCCHHHHCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCC----CCHHHHHHHHHHHH T ss_conf 45589766188999999987630262248999999999998532310137866888888889----99899999999988 Q ss_pred HHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCHH Q ss_conf 34201246432457838999998877898986063055788899999851047643344442046677655440375633 Q 537021.9.peg.7 319 LEVRLAQGAIAKGYHAQDYRQRLDFELNVIARMKFSGYFLIVADFIQWAKKNNIPVGPGRGSGAGSVTAYALTITDIDPL 398 (1033) Q Consensus 319 l~~R~~~~~~~~~~~~~~y~~RLe~EL~vI~~~gf~~YFLiV~Div~~ak~~gI~vGpGRGSAagSLVaY~LgIT~VDPi 398 (1033) |++||+.... ...+++|++||+|||+||++|||++|||||||+|+|||++||+||||||||||||||||||||+|||| T Consensus 313 l~~R~~~~~~--~~~~~~y~~RL~~EL~vI~~mGf~~YFLIV~D~i~~Ak~~gI~vGPGRGSaAGSLVaY~LgIT~VDPl 390 (1171) T PRK07374 313 LRKRLKLASL--DEIDENYKERLSSELKIIEQMGFPTYFLVVWDYIRFAREQGIPVGPGRGSAAGSLVAYALGITNIDPV 390 (1171) T ss_pred HHHHHCCCCC--CCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHCCCCCCHH T ss_conf 9998175786--43679999999999999997799253699999999999739713678771476799998348555812 Q ss_pred HCCCHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHH Q ss_conf 32310567621001457875333656878999999983038021554220778778899999997527998898865431 Q 537021.9.peg.7 399 RFSLLFERFLNPDRMSMPDFDIDFCQDRRDEVIRYVQNKYGHERVAQIITFGSLQAKAALRDVGRALQMPYSQVDRLCKL 478 (1033) Q Consensus 399 ~~~LlFERFLnp~R~~~PDIDiDf~~~~R~~vi~y~~~kyG~~~va~i~t~~t~~~k~aird~~r~~g~~~~~~~~~~~~ 478 (1033) +|||||||||||+|++||||||||+++||++||+||++|||+++||||+||+||++|+||||||||||+|+.++|++||+ T Consensus 391 ~y~LLFERFLNpeR~smPDIDiDF~~~rR~eVI~Yv~~kYG~d~VAqI~Tf~t~~akaaiRDVgRvlg~p~~~~d~iaKl 470 (1171) T PRK07374 391 KNGLLFERFLNPERKSMPDIDTDFCIERRGEVIDYVTRRYGEDKVAQIITFNRMTSKAVLKDVARVLDIPYGEADKLAKL 470 (1171) T ss_pred HCCCCHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHCCCHHHHHHHHHC T ss_conf 13886876058777788886755440048999999998838755686421032224799999999868987789999960 Q ss_pred CCCCCCCCCCHHH-HHC---CHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCEEEECCCCHHHHEECCCCCCCCCE Q ss_conf 0145898244344-522---988788643074202688987201122100244744033525666662001236768866 Q 537021.9.peg.7 479 IPNDPAHPVSLQA-IAD---DSRFREARLADPSVDRLLEISQKLEGLYRHASTHAAGIVIGDRPLSQLVPMYRDVRSDLP 554 (1033) Q Consensus 479 i~~~~~~~~~~~~-~~~---~~~~~~~~~~~~~~~~~~~~a~~leg~~r~~~~Ha~Gvvi~~~~l~~~~p~~~~~~~~~~ 554 (1033) ||...+...+++. +.+ .++|++.+..+|..++++++|++|||+|||+|+||||||||++||++++|++.+. +|.+ T Consensus 471 ip~~~g~~~~l~~~~~~~~~~~~l~~~~~~d~~~~~~~~~A~~LEGlpR~~g~HAaGvVIs~~pL~~~vPl~~~~-~g~~ 549 (1171) T PRK07374 471 IPVVRGKPAKLAAMIGEESPSPEFREKYENDPRVKKWVDMAMRIEGTNKTFGVHAAGVVIASDPLDELVPLQRNN-DGQI 549 (1171) T ss_pred CCCCCCCCCCHHHHHHCCCCCHHHHHHHHCCHHHHHHHHHHHHHCCCCCCCCCCCCEEEECCCCCCCCCCEEECC-CCCE T ss_conf 875579960099996405669899987733999999999999863873321235652573377743136513578-8984 Q ss_pred EEEEECHHHHHHCCCHHHHHHHCCHHHHHHHHHHHHH-HCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHCCCHHHHHHHH Q ss_conf 8851021334311232135654178999999998874-025434332344358899876201410122114534788887 Q 537021.9.peg.7 555 VTQFNMKWIEKAGLIKFDFLGLKTLTLLQKSLDFLAQ-RGVKVDLSLIPFDDHETYHLLTTKGTLGIFQLESSGMRQALE 633 (1033) Q Consensus 555 ~~q~d~~~~e~~gl~K~D~Lgl~~L~~i~~~~~~i~~-~~~~~d~~~ip~~D~~~~~l~~~g~t~GvFQ~es~~~~~~l~ 633 (1033) +|||||++||++||+|||||||||||+|++++++|++ +|.++|+++||+||++||+||++|+|.||||+||+||+++|+ T Consensus 550 vtQ~dk~dvE~~GLlK~DfLGLr~LtiI~~a~~~I~~~~g~~idl~~IpldD~~t~~ll~~g~T~GVFQlES~gm~~~l~ 629 (1171) T PRK07374 550 ITQYFMEDIESLGLLKMDFLGLKNLTMIEKTVELVEQSTGERIDPDNLPLEDEKTFELLGRGDLEGIFQLESSGMRQIVR 629 (1171) T ss_pred EECCCHHHHHHCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCHHHHHHHH T ss_conf 23035655887698366676368679999999999985499777133686619999998554878834136679999999 Q ss_pred HHCCCHHHHHHHHHHHCCCHHHHHH--HHHHHHHCCCCCCCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCHHHHH Q ss_conf 7174048888766541141134200--36888504657850269667334103355300125689887886326841307 Q 537021.9.peg.7 634 GMQPDCIEDIIALVSLYRPGPIDNI--VVYNNRKNGKEKIVSIHPLIDPMLKETQGVIIYQEQVMQIAQLLSGYSLSEAD 711 (1033) Q Consensus 634 ~~~p~~~~dl~~~~al~RPGp~~~~--~~yi~rk~g~e~~~~~hp~l~~iL~~T~Gv~vyQEQvm~ia~~~ag~~~~ead 711 (1033) +++|++|+||||++||||||||+++ +.||+||||+|+++|+||.|++||++|||||||||||||||+.+||||+|||| T Consensus 630 ~lkP~~feDiva~~AL~RPGP~~~gmi~~yi~Rk~g~e~i~y~hp~L~~IL~~TyGvivYQEQVMqia~~~aGfslgeAD 709 (1171) T PRK07374 630 DLKPSSLEDISSILALYRPGPLDAGLIPKFINRKHGREAIDFAHPLLEPILTETYGIMVYQEQIMKIAQDLAGYSLGQAD 709 (1171) T ss_pred HCCCCCHHHHHHHHHHCCCCCCCCCCHHHHHHHCCCCCCCCCCCHHHHHHHCCCCCEEEEHHHHHHHHHHHHCCCHHHHH T ss_conf 71999899999888424787554443389998517997888998999865424378787599999999986069978999 Q ss_pred HHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 88976201211016778789999986249999999999999997500013310147888999999999973026667775 Q 537021.9.peg.7 712 VLRRAMGKKIKEEMDKQKERFISGASKNGISKTIAVNIFELLAKFADYGFNKSHAAAYAVISYQTAWMKTHYPVEFLAAS 791 (1033) Q Consensus 712 ~lRra~~kk~~~~~~~~~~~f~~g~~~~g~~~~~a~~i~~~i~~f~~Y~FnksHa~aya~~~y~~aylK~~yP~eF~aa~ 791 (1033) .||||||||+.++|++.+++|++||.+||+++++|++||++|++||+||||||||+|||++||||||||+|||+|||||+ T Consensus 710 ~LRRAmgKKk~~~m~~~r~~Fi~Ga~~~G~~~~~A~~if~~i~~Fa~YgFnkSHaaAYa~~aYqtAyLK~hyP~eF~aal 789 (1171) T PRK07374 710 LLRRAMGKKKVSEMQKHRGIFVEGASKRGVDEKVADELFDQMVLFAEYCFNKSHSTAYGAVTYQTAYLKAHYPVAYMAAL 789 (1171) T ss_pred HHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHH T ss_conf 99998754868899999999997898759999999999999996468787842268999999999999977899999998 Q ss_pred HCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEECCCEEEEEHHHHCCCCHHHHHHHHHHH-HCCCCCCHHHHHH Q ss_conf 100235420258999888863685035523342100353198698502685896989999999998-3799999789997 Q 537021.9.peg.7 792 MTLEMDNVEKIKKFCQDARQFNIQIMPPSVNTPCVDFKVGDNRIYYSLAAIKGVGTTTARHIMEAS-ADKPFDSLEDFCS 870 (1033) Q Consensus 792 L~~~~~~~~~~~~~i~e~~~~gi~vl~PdIN~S~~~f~~~~~~Ir~Gl~~Ikgvg~~~~~~Ive~r-~~g~f~sl~df~~ 870 (1033) ||+++++++|+..|+.||+++||+|+|||||+|.++|++++++||+||++|||||++++++|+++| ++|||+|++||+. T Consensus 790 L~~~~~~~~k~~~~i~ea~~~gi~vlpPdIN~S~~~f~~~~~~I~~gl~~Ikgvg~~~~~~Iv~~R~~~Gpf~sl~Df~~ 869 (1171) T PRK07374 790 LTVNAGSSDKVQRYISNCNSMGIEVMPPDINASGIDFTPKGNRILFGLSAVKNLGDGAIRNLIASRDSDGPFKSLADLCD 869 (1171) T ss_pred HCCCCCCCHHHHHHHHHHHHCCCCEECCCEEEECCCEEECCCCEEHHHHHHCCCCHHHHHHHHHHHHHCCCCCCHHHHHH T ss_conf 50588997479999999998799162883322178216759822146567437339999999999985859798999996 Q ss_pred HCCCCCCCHHHHHHHHHCCCCCCCCC--CHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCC--------CCCCCCC Q ss_conf 34824799899998886767234790--3899999899999999988654312777866665555--------6557766 Q 537021.9.peg.7 871 RVDSKKLNRRVLESLVFAGALDCFGY--SRMQLLQSLDNIQKYAQWVEKNRTNKHENIFAHEKGT--------FSDKITL 940 (1033) Q Consensus 871 rv~~~~~nk~~le~Li~aGAfD~~~~--~R~~ll~~i~~~~~~~~~~~~~~~~~q~~LF~~~~~~--------~~~~~~~ 940 (1033) |++.+.+||+++|+||+|||||+|+. ||++|+++++.+++++....+++..+|.+||+..... ....+.. T Consensus 870 Rv~~~~~nr~~le~Li~aGAfD~l~~~~~R~~l~~~~~~~~~~a~~~~~~~~~gq~~lf~~~~~~~~~~~~~~~~~~p~~ 949 (1171) T PRK07374 870 RLPSNVLNRRSLESLIHCGALDAFSPNANRAQLIADLDLVIDWASSRARDRASGQGNLFDLLAGASEEEASNDLSSAPKA 949 (1171) T ss_pred HCCCCCCCHHHHHHHHHCCCCCCCCCCCCHHHHHHCHHHHHHHHHHHHHHHHCCCCCCHHCCCCCCCCCCCCCCCCCCCC T ss_conf 40631289899999997796011798888999986649999999997455433674400002455433333322237767 Q ss_pred CCCCCCCHHHHHHHHHHHHCEEECCCCHHHHHHHHHHCCCCCHHHHHCCCCCCEEEEEEEEEEEEEECCCCCCEEEEEEE Q ss_conf 68999988899985788726211638558999999874897688985055898899999996112101588876999999 Q 537021.9.peg.7 941 EKFSVENSSVRFENEQRVLGFYFSGHPLDVYKPILRKMGIKGYEESVATIRGNNIRLAAMVVSKQQKKTRKGSRIGWVTF 1020 (1033) Q Consensus 941 ~~~~~~~~~e~l~~E~e~LGf~lS~HPl~~~~~~l~~~~~~~~~~~~~~~~g~~v~iaG~V~~i~~~~TKkG~~mafltl 1020 (1033) +.+++|+..|++.+|+|+||||+|+|||+.|++.++.....++.++.+..+++.|+++|+|+.+|++.||+|++|+|+|+ T Consensus 950 ~~~~~~~~~e~l~~E~e~lG~yls~HPl~~~~~~~~~~~~~~~~~l~~~~~~~~v~v~g~v~~~~~~~Tk~G~~maf~tl 1029 (1171) T PRK07374 950 APVPDYPPTEKLKLEKELLGFYLSDHPLKQLTEPAKLLAPISLSSLEEQKDKAKVSVIAMIPEMRQVTTRKGDRMAILQL 1029 (1171) T ss_pred CCCCCCCHHHHHHHHHHHHEECCCCCHHHHHHHHHHHHCCCCHHHHHHCCCCCEEEEEEEEEEEEECCCCCCCEEEEEEE T ss_conf 88889987789999888863421417099999999972883799985077998899999997625267789996899999 Q ss_pred ECCCCEEEEEECC Q ss_conf 8497049999539 Q 537021.9.peg.7 1021 SEPAKEYEANFFP 1033 (1033) Q Consensus 1021 eD~~g~~e~~iFP 1033 (1033) ||+||.+|+++|| T Consensus 1030 eD~tg~~evvvFp 1042 (1171) T PRK07374 1030 EDLTGSCEAVVFP 1042 (1171) T ss_pred EECCCCEEEEECH T ss_conf 9899868999978 No 6 >PRK07892 consensus Probab=100.00 E-value=0 Score=2510.01 Aligned_cols=1017 Identities=37% Similarity=0.611 Sum_probs=950.8 Q ss_pred CCCCCCCHHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCC Q ss_conf 66898813343314447644329989999999978998899811651774999999999769963899999985476643 Q 537021.9.peg.7 5 LEKSPSFVHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPIIGCQLDIDMQDNLK 84 (1033) Q Consensus 5 ~~~~~~FvhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~~~ 84 (1033) |-++|+||||||||+||+|+|+++|++||++|+++||+||||||||||||+++|+++|+++|||||||||+|+.++.+.. T Consensus 1 mms~~~FVHLhvhS~YSlLdGa~~ieeLV~~Ake~G~~AlALTD~~nl~GaveFy~ackk~GIKPIIG~E~yv~p~sr~d 80 (1180) T PRK07892 1 MMSGDSFVHLHNHTEYSMLDGAAKIKPLFAEVARLGMPAVAMTDHGNMFGAYEFYNTAKKAGIKPIIGIEAYVAPGSRFD 80 (1180) T ss_pred CCCCCCEECCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCCCC T ss_conf 99999810114666372105065999999999985999899950886867999999999879982852179965764111 Q ss_pred CCCC----------CCCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCEEEECCCCCCH Q ss_conf 3445----------656666775399995696899999999989973248778740078999863489889981976796 Q 537021.9.peg.7 85 MTNQ----------GKTPFPTVPSIVLLVSTAEGYQRLIELVSRMYLLDQGKQSVRICLSWLQEIGTEGLIMLTGGSSGP 154 (1033) Q Consensus 85 ~~~~----------~~~~~~~~~~lvlLAkN~~Gy~nL~kL~S~a~~~~~~~~~p~i~~~~L~e~~~egLIvlsg~~~g~ 154 (1033) .... ........+|++|||||++||+|||+|+|.++.++.++..|+++.++|.+ |++||||+|||..|+ T Consensus 81 ~~~~~~g~~~~~~~d~s~~~~~~hLiLLAkN~~GY~NL~kL~S~a~~eg~~~k~Prid~elL~~-~~eGLI~lsgc~~g~ 159 (1180) T PRK07892 81 KRRVRWGDPSQKSDDVSGGGAYTHMTMLAENATGLRNLFKLSSLASLEGQLGKWPRMDAELIAE-YAEGIIATTGCPSGE 159 (1180) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHH-HCCCEEEECCCCCCH T ss_conf 1000136654344443455655449999898799999999999998677667888689999997-369949965887658 Q ss_pred HHHHHHCCCHHHHHHHHHHHHHHCC-CCEEEEEECCCHH-HHHHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHH Q ss_conf 7888877997899999999987659-8689999439911-5899999999999818989980883007878899998766 Q 537021.9.peg.7 155 IDRAFFFNSSQVAEKRLLTFKKLFG-DRLYVNLQRHRGY-DRYRESQVVQLAYTHELPLVATNNSLFLSEEDYEAHDVLM 232 (1033) Q Consensus 155 i~~~l~~~~~~~a~~~l~~l~~~Fg-d~~ylEl~~~~~~-e~~~~~~l~~lA~~~~iPlVaTndv~Y~~~~D~~~h~iL~ 232 (1033) +++.+..+++++|.+.+.+|+++|| +|||+|+|+|+.. ++..++.++++|+++++|+||||||||++++|+.+|++|+ T Consensus 160 v~~~l~~g~~~eA~~~~~~~~~iFg~d~fyLElq~hg~~~e~~~~~~L~~lAk~~~iplVATndvhYl~~eD~~~hdvL~ 239 (1180) T PRK07892 160 VQTRLRLGQYDEALEAAAKWQDIFGKENFFLELMDHGLDIERRVRDGLLEIARKLGLPPLATNDSHYVTEDDAQAHEALL 239 (1180) T ss_pred HHHHHHCCCHHHHHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHH T ss_conf 99998769989999999999998588979999823897589999999999999819998996887216861689999999 Q ss_pred HHHCCCCCCCCCCCCCC-CCCCCCCHHHHHHHCC-HHHHHHHHHHHHHHHHHHHHH--HHHCCCCCCCCCCCCCCCCCHH Q ss_conf 88469857653100245-3432343478875122-035899878864111000133--2210234555756887768988 Q 537021.9.peg.7 233 AVAHSTVVSQKDRPRVT-PDHYLKNRSEMVSIFS-DLPEALENTVEIARRCSFILQ--VHAPILPRFIEKECDDIQKEEE 308 (1033) Q Consensus 233 aI~~~~~l~~~~~~~~~-~~~yLks~eEM~~~f~-~~peAi~NT~~IAe~c~~~l~--~~~~~lP~f~~p~~~~~~~~~~ 308 (1033) ||++|++++++++.++. .++||||++||.++|. ++|+|++||.+||++|++.++ .....||+|++|++ .+++ T Consensus 240 cI~~g~tl~d~~r~~~~~~e~yLKS~eEM~~lF~~d~peAl~NT~~IAerc~~~~~~~~~~~~lP~f~~p~g----~~~~ 315 (1180) T PRK07892 240 CVQTGKTLSDPNRFKFDGDGYYLKSAEEMRALWDDDVPEACDNTLLIAERVQSYDDVWEPRDRMPRFPVPEG----ETET 315 (1180) T ss_pred HHHCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCC----CCHH T ss_conf 996699637585123786324579999999987765167778999999985201223567555998899999----9999 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHH Q ss_conf 99987654103420124643245783899999887789898606305578889999985104764334444204667765 Q 537021.9.peg.7 309 NELRNKAVAGLEVRLAQGAIAKGYHAQDYRQRLDFELNVIARMKFSGYFLIVADFIQWAKKNNIPVGPGRGSGAGSVTAY 388 (1033) Q Consensus 309 ~~Lr~l~~~Gl~~R~~~~~~~~~~~~~~y~~RLe~EL~vI~~~gf~~YFLiV~Div~~ak~~gI~vGpGRGSAagSLVaY 388 (1033) +||+++|++||++||+++ .+++|++||+|||+||.+|||++|||||||+|+|||++||+||||||||||||||| T Consensus 316 ~~L~~l~~~Gl~~Ry~~~------~~~ey~~RL~~EL~vI~~mgf~~YFLIV~D~i~~Ak~~gI~vGPGRGSaAGSLVaY 389 (1180) T PRK07892 316 SWLRKEVEAGLERRFPGG------VPDEVRERAEYEIGVICQMGFPSYFLVVADFINWAKSNGIRVGPGRGSAAGSLVAY 389 (1180) T ss_pred HHHHHHHHHHHHHHCCCC------CCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHH T ss_conf 999999998899876889------98999999999999998648716999999999999864886279877358889999 Q ss_pred HHHCCCCCHHHCCCHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHHCCCC Q ss_conf 54403756333231056762100145787533365687899999998303802155422077877889999999752799 Q 537021.9.peg.7 389 ALTITDIDPLRFSLLFERFLNPDRMSMPDFDIDFCQDRRDEVIRYVQNKYGHERVAQIITFGSLQAKAALRDVGRALQMP 468 (1033) Q Consensus 389 ~LgIT~VDPi~~~LlFERFLnp~R~~~PDIDiDf~~~~R~~vi~y~~~kyG~~~va~i~t~~t~~~k~aird~~r~~g~~ 468 (1033) |||||+||||+|||||||||||+|++||||||||+++||++||+||++|||+++||||+||+||++|+||||||||||+| T Consensus 390 ~LgIT~vDPi~y~LlFERFLNpeR~smPDIDiDf~~~rR~eVi~Yv~~kYG~~~VaqI~Tf~t~~ar~aiRDv~Rvlg~~ 469 (1180) T PRK07892 390 ALGITDLDPLEHGLLFERFLNPERVSMPDIDIDFDDRRRGEVIRYVTEKYGEDRVAQIITFGTIKAKAALKDSARVLGYP 469 (1180) T ss_pred HHCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEECHHHHHHHHHHHHHHHHHCCC T ss_conf 84563338401575599870334677888765666240999999999985767489851101055889999999985999 Q ss_pred HHHHHHHHHHCCCCC-CCCCCH--------HHHHCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCEEEECCCCH Q ss_conf 889886543101458-982443--------44522988788643074202688987201122100244744033525666 Q 537021.9.peg.7 469 YSQVDRLCKLIPNDP-AHPVSL--------QAIADDSRFREARLADPSVDRLLEISQKLEGLYRHASTHAAGIVIGDRPL 539 (1033) Q Consensus 469 ~~~~~~~~~~i~~~~-~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~a~~leg~~r~~~~Ha~Gvvi~~~~l 539 (1033) +.++|++||+||... +...++ +.+.+.++|++++.++|..++++++|.+|||+|||+|+||||||||++|| T Consensus 470 ~~~~d~iak~ip~~~~~~~~~l~~~~~~~~~~~~e~~~lr~~~~~~~~~~~l~~~A~~leGl~R~~s~HaaGvvis~~~l 549 (1180) T PRK07892 470 FAIGDRITKALPPAVMGKDIPLSGIFDPSHERYKEAGEVRQLIETDPDVRKIYDTARGLEGLIRQAGVHAAAVIMSSEPL 549 (1180) T ss_pred HHHHHHHHHHCCCHHCCCCCCHHHCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCCCCCCCCEEEEECCCC T ss_conf 88999998728802115887655404853077886699987763398999999999983598665677788799527876 Q ss_pred HHHEECCCCCCCCCEEEEEECHHHHHHCCCHHHHHHHCCHHHHHHHHHHHHH-HCCCCCCCCCCCCHHHHHHHHHCCCCC Q ss_conf 6620012367688668851021334311232135654178999999998874-025434332344358899876201410 Q 537021.9.peg.7 540 SQLVPMYRDVRSDLPVTQFNMKWIEKAGLIKFDFLGLKTLTLLQKSLDFLAQ-RGVKVDLSLIPFDDHETYHLLTTKGTL 618 (1033) Q Consensus 540 ~~~~p~~~~~~~~~~~~q~d~~~~e~~gl~K~D~Lgl~~L~~i~~~~~~i~~-~~~~~d~~~ip~~D~~~~~l~~~g~t~ 618 (1033) ++++|++..+++|.++|||||++||++||+|||||||+|||+|++++++|++ +|.++|+.+||+||++||+||++|+|. T Consensus 550 ~~~vPl~~~~~~~~~vtq~d~~~~e~~GllK~D~LGl~~Lt~i~~~~~~i~~~~~~~~d~~~ipldD~~t~~ll~~g~t~ 629 (1180) T PRK07892 550 IDAIPLWKRPQDGAIITGFDYPSCEALGLLKMDFLGLRNLTIIGDAIENIKANRGIDLDLENLPLDDPATYELLGRGDTL 629 (1180) T ss_pred CCCCCEEECCCCCCEEEEEECCHHHHCCCCCEEEECCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHCCCC T ss_conf 65567043278996588742004655698301010478468999999999984498401056887619999998504669 Q ss_pred HHHHCCCHHHHHHHHHHCCCHHHHHHHHHHHCCCHHHHHHH--HHHHHHCCCCCCCCCCHHHH----HHHCCCCCCCCHH Q ss_conf 12211453478888771740488887665411411342003--68885046578502696673----3410335530012 Q 537021.9.peg.7 619 GIFQLESSGMRQALEGMQPDCIEDIIALVSLYRPGPIDNIV--VYNNRKNGKEKIVSIHPLID----PMLKETQGVIIYQ 692 (1033) Q Consensus 619 GvFQ~es~~~~~~l~~~~p~~~~dl~~~~al~RPGp~~~~~--~yi~rk~g~e~~~~~hp~l~----~iL~~T~Gv~vyQ 692 (1033) ||||+||+|||++|++++|++|+||+|++||||||||++++ .|++||||+++++|+||.++ +||++|||||||| T Consensus 630 GvFQ~Es~g~~~~l~~~~p~~f~di~a~~aL~RPGp~~~~~~~~yi~rk~g~~~~~~~hp~l~~~l~~iL~~TyGv~vyQ 709 (1180) T PRK07892 630 GVFQLDGGPMRSLLRRMQPTNFEDISAVLALYRPGPMGANAHTDYADRKNGRQEITPIHPELEEPLKEILGETYGLIVYQ 709 (1180) T ss_pred CCCCCCCCHHHHHHHHCCCCCHHHHHHHHHHCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEH T ss_conf 97845566388999856999899989998630677566663568999756998877899537889999984046747647 Q ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHH Q ss_conf 56898878863268413078897620121101677878999998624999999999999999750001331014788899 Q 537021.9.peg.7 693 EQVMQIAQLLSGYSLSEADVLRRAMGKKIKEEMDKQKERFISGASKNGISKTIAVNIFELLAKFADYGFNKSHAAAYAVI 772 (1033) Q Consensus 693 EQvm~ia~~~ag~~~~ead~lRra~~kk~~~~~~~~~~~f~~g~~~~g~~~~~a~~i~~~i~~f~~Y~FnksHa~aya~~ 772 (1033) |||||||+++||||+||||.||||||||+.++|++++++|++||.+||+++++|++||++|++||+||||||||+|||++ T Consensus 710 EQvm~ia~~~agfs~g~ad~lRram~Kk~~~~~~~~~~~F~~g~~~~g~~~~~a~~i~~~i~~Fa~YgFnksHs~aYa~~ 789 (1180) T PRK07892 710 EQVMAIAQKVAGYSLGQADILRRAMGKKKKEVLDKEFAGFSEGMKANGYSDAAIKALWDTLLPFAGYAFNKSHAAGYGLV 789 (1180) T ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHCHHHCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHH T ss_conf 89999999864798778999999861174167899999997221215999999999999998762645575312787999 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEECCCEEEEEHHHHCCCCHHHHHH Q ss_conf 99999999730266677751002354202589998888636850355233421003531986985026858969899999 Q 537021.9.peg.7 773 SYQTAWMKTHYPVEFLAASMTLEMDNVEKIKKFCQDARQFNIQIMPPSVNTPCVDFKVGDNRIYYSLAAIKGVGTTTARH 852 (1033) Q Consensus 773 ~y~~aylK~~yP~eF~aa~L~~~~~~~~~~~~~i~e~~~~gi~vl~PdIN~S~~~f~~~~~~Ir~Gl~~Ikgvg~~~~~~ 852 (1033) ||||||||+|||+|||||+||++++++++++.|+.|||++||+|+|||||+|.++|++++++||+||++|||||+.++++ T Consensus 790 ayqtAyLK~hyP~ef~aa~l~~~~~~~~k~~~~~~e~~~~gi~vlpPdIN~S~~~f~~~~~~Ir~gL~~Ikgvg~~~~~~ 869 (1180) T PRK07892 790 SYWTAYLKANYPAEYMAALLTSVGDDKDKSAVYLAECRRMGITVLPPDVNESSLNFTAVGDDIRFGLGAVRNVGANVVDS 869 (1180) T ss_pred HHHHHHHHHCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCEECCCCCCCCCCCEECCCEEEECCCCCCCCCHHHHHH T ss_conf 99999986578999999998548673168999999999879907368754467765526980470662037888889999 Q ss_pred HHHHH-HCCCCCCHHHHHHHCCCCCCCHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 99998-37999997899973482479989999888676723479038999998999999999886543127778666655 Q 537021.9.peg.7 853 IMEAS-ADKPFDSLEDFCSRVDSKKLNRRVLESLVFAGALDCFGYSRMQLLQSLDNIQKYAQWVEKNRTNKHENIFAHEK 931 (1033) Q Consensus 853 Ive~r-~~g~f~sl~df~~rv~~~~~nk~~le~Li~aGAfD~~~~~R~~ll~~i~~~~~~~~~~~~~~~~~q~~LF~~~~ 931 (1033) |+++| +||||+|++||+.|++.+.+||+++|+||+|||||+||.||++|+++++.++..+...++....+|.+||+... T Consensus 870 Ii~~R~~~G~f~s~~Df~~R~~~~~~nk~~lE~Li~aGAfD~~~~~R~~ll~~~~~~l~~~~~~~~~~~~gq~~Lf~~~~ 949 (1180) T PRK07892 870 IIAAREEKGKFTDFSDFLDKVDAVACNKRVTESLIKAGAFDSLGHTRRGLFLVHEDAVDSVVGTKRAEAVGQFDLFGGGD 949 (1180) T ss_pred HHHHHHHCCCCCCHHHHHHHHCCCCCCHHHHHHHHHCCCHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCC T ss_conf 99999857999988999987165424899999998669523307459999997999999999999987668878667776 Q ss_pred CCCCCC------CCCCCCCCCCHHHHHHHHHHHHCEEECCCCHHHHHHHHHHCCCCCHHHHH--CCCCCCEEEEEEEEEE Q ss_conf 556557------76668999988899985788726211638558999999874897688985--0558988999999961 Q 537021.9.peg.7 932 GTFSDK------ITLEKFSVENSSVRFENEQRVLGFYFSGHPLDVYKPILRKMGIKGYEESV--ATIRGNNIRLAAMVVS 1003 (1033) Q Consensus 932 ~~~~~~------~~~~~~~~~~~~e~l~~E~e~LGf~lS~HPl~~~~~~l~~~~~~~~~~~~--~~~~g~~v~iaG~V~~ 1003 (1033) ...... ..+| .++|+..|+|.+|+|+||||+|+|||+.|+..++....+++.++. ...+|+.|+|+|+|++ T Consensus 950 ~~~~~~~~~~~~~~~p-~~e~~~~e~L~~Eke~LG~ylS~HPl~~y~~~l~~~~~~~i~~l~~~~~~~~~~v~vaG~V~~ 1028 (1180) T PRK07892 950 DDGEDASDSVFTIKVP-DDEWDKKHKLALEREMLGLYVSDHPLDGLEHLLAAQVDTSIAALLDGDVPNDGQVTIGGIITS 1028 (1180) T ss_pred CCCCCCCCCCCCCCCC-CCCCCHHHHHHHHHHHHEECCCCCHHHHHHHHHHHHCCCCHHHHHHCCCCCCCEEEEEEEEEE T ss_conf 5555444554456899-765897889998888732533508299999999984783199984121579988999999978 Q ss_pred EEEECCCCCCEEEEEEEECCCCEEEEEECC Q ss_conf 121015888769999998497049999539 Q 537021.9.peg.7 1004 KQQKKTRKGSRIGWVTFSEPAKEYEANFFP 1033 (1033) Q Consensus 1004 i~~~~TKkG~~mafltleD~~g~~e~~iFP 1033 (1033) +|++.||||++|+|+|+||++|.+|++||| T Consensus 1029 ~~~~~TkkG~~maf~tleD~tG~~e~vvFp 1058 (1180) T PRK07892 1029 VQRRVNKNGNPWAIATLEDLDGSIEVLFFP 1058 (1180) T ss_pred EEECCCCCCCEEEEEEEEECCCCEEEEECH T ss_conf 885417899989999999799868999868 No 7 >PRK06658 consensus Probab=100.00 E-value=0 Score=2505.27 Aligned_cols=1010 Identities=40% Similarity=0.685 Sum_probs=962.6 Q ss_pred CCCCHHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCCC Q ss_conf 98813343314447644329989999999978998899811651774999999999769963899999985476643344 Q 537021.9.peg.7 8 SPSFVHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPIIGCQLDIDMQDNLKMTN 87 (1033) Q Consensus 8 ~~~FvhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~~~~~~ 87 (1033) .++||||||||+||+|+|+++|++||++|+++||+||||||||||||+++|+++|+++|||||||||+|+.++.+..+.. T Consensus 3 ~~~FvhLhvhS~ySlLdg~~~i~~Lv~~AkelG~~AlAITD~~nL~Gav~Fy~aakk~GIKPIIG~E~~v~~~~r~~~~~ 82 (1145) T PRK06658 3 RAKFVHLHVHSDYSLLDGAASISDLISKAKKLGQSALALTDHGNMFGAIKFYKECKKAGIKPIIGCEAYVAPESRFLKKE 82 (1145) T ss_pred CCCCHHHCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCC T ss_conf 89843114546271205075999999999985999999951886989999999999779992445389966864124566 Q ss_pred CCCCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCEEEECCCCCCHHHHHHHCCCHHHH Q ss_conf 56566667753999956968999999999899732487787400789998634898899819767967888877997899 Q 537021.9.peg.7 88 QGKTPFPTVPSIVLLVSTAEGYQRLIELVSRMYLLDQGKQSVRICLSWLQEIGTEGLIMLTGGSSGPIDRAFFFNSSQVA 167 (1033) Q Consensus 88 ~~~~~~~~~~~lvlLAkN~~Gy~nL~kL~S~a~~~~~~~~~p~i~~~~L~e~~~egLIvlsg~~~g~i~~~l~~~~~~~a 167 (1033) .......+|++|||||++||+|||+|+|.++.+ +++++|++++++|.+ +++|||++|||..|+++++++.++.+.| T Consensus 83 --~~~~~~~~hLvLLAkN~~GY~NL~kL~S~a~~e-g~~~kPrid~e~L~~-~~eGLI~LSgcl~G~v~~~ll~g~~e~A 158 (1145) T PRK06658 83 --DDLGKMYYHLILLAKNETGYRNLMVLTSKSYLE-GMYYKPRIDDELLAQ-YAEGLICLSACLAGLLPRLLLAGRKEDA 158 (1145) T ss_pred --CCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHC-CCCCCCCCCHHHHHH-CCCCCEEECCCCCCCHHHHHHCCCHHHH T ss_conf --877665330899828999999999999999872-776888689999996-0699699247644606688857998999 Q ss_pred HHHHHHHHHHCC-CCEEEEEECCCHHHH-HHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 999999987659-868999943991158-999999999998189899808830078788999987668846985765310 Q 537021.9.peg.7 168 EKRLLTFKKLFG-DRLYVNLQRHRGYDR-YRESQVVQLAYTHELPLVATNNSLFLSEEDYEAHDVLMAVAHSTVVSQKDR 245 (1033) Q Consensus 168 ~~~l~~l~~~Fg-d~~ylEl~~~~~~e~-~~~~~l~~lA~~~~iPlVaTndv~Y~~~~D~~~h~iL~aI~~~~~l~~~~~ 245 (1033) ++.+.+|+++|| ++||+|+|+|+..++ .++..++++|+++++|+||||||||++++|+.+|++|+||++|++++++++ T Consensus 159 ~~~~~~~~~iFg~d~fylElq~~g~~~~~~~n~~L~~lA~~~gIPlVATNdvhYl~~eD~~ahdvL~cI~~g~~l~d~~r 238 (1145) T PRK06658 159 ENHIRKYRKVFGVNNYFIELQDHGLKEEKKVNELLIEYARELGVPLVVTNDSHYVEQEDAVAQDILVCIGTKKKRSDENR 238 (1145) T ss_pred HHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCH T ss_conf 99999999971989599997058982799999999999998399989978762268535799999999866997254430 Q ss_pred CCC-CCCCCCCCHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCC Q ss_conf 024-5343234347887512203589987886411100013322102345557568877689889998765410342012 Q 537021.9.peg.7 246 PRV-TPDHYLKNRSEMVSIFSDLPEALENTVEIARRCSFILQVHAPILPRFIEKECDDIQKEEENELRNKAVAGLEVRLA 324 (1033) Q Consensus 246 ~~~-~~~~yLks~eEM~~~f~~~peAi~NT~~IAe~c~~~l~~~~~~lP~f~~p~~~~~~~~~~~~Lr~l~~~Gl~~R~~ 324 (1033) .++ ++++||||++||.++|+++|+|++||.+||++|++++++..++||+|++|++. .++++||+++|++|+++||+ T Consensus 239 ~~~~~~e~ylKs~eEM~~lF~d~PeAl~NT~~IAerCn~~l~~~~~~lP~f~~P~~~---~~~~~~L~~l~~~Gl~~Ry~ 315 (1145) T PRK06658 239 LKFETNEFYIKSQEEMAQLFPDYPEALLNTVRIAEKCNFKIPQPGPILPDYQIPIEF---SSKEEYIRHLTLEGLKFRYK 315 (1145) T ss_pred HCCCCHHHHCCCHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCC---CCHHHHHHHHHHHHHHHHCC T ss_conf 026775541799999998831559999989999973542114788768877789877---89999999999999998678 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHHCCCHH Q ss_conf 46432457838999998877898986063055788899999851047643344442046677655440375633323105 Q 537021.9.peg.7 325 QGAIAKGYHAQDYRQRLDFELNVIARMKFSGYFLIVADFIQWAKKNNIPVGPGRGSGAGSVTAYALTITDIDPLRFSLLF 404 (1033) Q Consensus 325 ~~~~~~~~~~~~y~~RLe~EL~vI~~~gf~~YFLiV~Div~~ak~~gI~vGpGRGSAagSLVaY~LgIT~VDPi~~~LlF 404 (1033) . .+++|++||+|||+||.+|||++|||||||+|+|||++||+||||||||||||||||||||+||||+||||| T Consensus 316 ~-------~~~~~~~RL~~EL~vI~~mgf~~YFLIV~D~i~~Ak~~gI~vGpGRGSaAGSLVaY~LgIT~VDPi~y~LlF 388 (1145) T PRK06658 316 V-------LTSEIKDRAFYELDVIIRMDFVGYFLIVWDFIKWAKDNDIPVGPGRGSGAGSIVAYALRITDIDPLKYNLLF 388 (1145) T ss_pred C-------CCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCEECCCCCCHHHHHHHHHHCCCCCCCHHCCCCH T ss_conf 8-------869999999999999986798452799999999999668222788774698999999579344801039868 Q ss_pred HHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCC Q ss_conf 67621001457875333656878999999983038021554220778778899999997527998898865431014589 Q 537021.9.peg.7 405 ERFLNPDRMSMPDFDIDFCQDRRDEVIRYVQNKYGHERVAQIITFGSLQAKAALRDVGRALQMPYSQVDRLCKLIPNDPA 484 (1033) Q Consensus 405 ERFLnp~R~~~PDIDiDf~~~~R~~vi~y~~~kyG~~~va~i~t~~t~~~k~aird~~r~~g~~~~~~~~~~~~i~~~~~ 484 (1033) ||||||+|++||||||||+++||++||+||++|||+++||||+||+||++|+||||||||||+|+.++|++||+||...+ T Consensus 389 ERFLnpeR~smPDIDiDF~~~rR~eVi~Yv~~kYG~~~VaqI~Tf~t~~ak~aiRDvgRvlg~~~~~~d~iaK~ip~~~~ 468 (1145) T PRK06658 389 ERFLNPERISMPDFDIDFCFERRQEVIEYVRNKYGEDNVGQIITFGTLKPKAVVKDVGRVLDIPLAEVNEITKLIPDGPK 468 (1145) T ss_pred HHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEEEHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCCC T ss_conf 87448888888876654540048999999998857666786410888999999999999849999999999861689987 Q ss_pred CCCCHHHHHCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCEEEECCCCHHHHEECCCCCCCCCEEEEEECHHHH Q ss_conf 82443445229887886430742026889872011221002447440335256666620012367688668851021334 Q 537021.9.peg.7 485 HPVSLQAIADDSRFREARLADPSVDRLLEISQKLEGLYRHASTHAAGIVIGDRPLSQLVPMYRDVRSDLPVTQFNMKWIE 564 (1033) Q Consensus 485 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~leg~~r~~~~Ha~Gvvi~~~~l~~~~p~~~~~~~~~~~~q~d~~~~e 564 (1033) .++. ++++..+ +++...++|.+++++++|++|||+|||+|+||||||||++||+++||+++++++|.++|||||+++| T Consensus 469 ~~l~-~a~~~~~-~~~~~~~~~~~~~l~~~A~~leGl~R~~s~HaaGvVIs~~~L~~~vPl~~~~~~~~~~tq~d~~~~E 546 (1145) T PRK06658 469 VTLK-EALDDNS-LKEYMTSKPVYKELFNIALVLEGMNRNASLHAAGIVIGKTALTDYVPLYKDSKTGKVSTQFTMDLIE 546 (1145) T ss_pred CCHH-HHHHCCH-HHHHHHCCHHHHHHHHHHHHHCCCCCCCCCCCCEEEEECCCCCCCCCEEECCCCCCCEEEEECCHHH T ss_conf 6899-9852336-6665412878999999999844984444567774898377744124344457689730541064277 Q ss_pred HHCCCHHHHHHHCCHHHHHHHHHHHHHH-C--CCCCCCCCCCCHHHHHHHHHCCCCCHHHHCCCHHHHHHHHHHCCCHHH Q ss_conf 3112321356541789999999988740-2--543433234435889987620141012211453478888771740488 Q 537021.9.peg.7 565 KAGLIKFDFLGLKTLTLLQKSLDFLAQR-G--VKVDLSLIPFDDHETYHLLTTKGTLGIFQLESSGMRQALEGMQPDCIE 641 (1033) Q Consensus 565 ~~gl~K~D~Lgl~~L~~i~~~~~~i~~~-~--~~~d~~~ip~~D~~~~~l~~~g~t~GvFQ~es~~~~~~l~~~~p~~~~ 641 (1033) ++||+|||||||+|||+|++++++|+++ | .++|+.+||+||++||+||++|+|.||||+||+|||++|++++|++|+ T Consensus 547 ~~GLlK~D~LGl~~Lt~i~~~~~li~~~~~~~~~~~~~~ip~~D~~t~~ll~~g~t~GvFQ~ES~g~~~~l~~~~P~~f~ 626 (1145) T PRK06658 547 ECGLVKMDFLGLKTLTLIKNTENLIRSVGGDYTTFSIKDIPDNDVKTFKMLGEGKSAAVFQFESEGMQQILKRAKPSKME 626 (1145) T ss_pred HHCCCCCCEECCCCHHHHHHHHHHHHHHCCCCCCCCHHHCCCCCHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCHH T ss_conf 71895501016782789999999999847987777876588556999998617676673302576799999854999799 Q ss_pred HHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCC Q ss_conf 88766541141134200368885046578502696673341033553001256898878863268413078897620121 Q 537021.9.peg.7 642 DIIALVSLYRPGPIDNIVVYNNRKNGKEKIVSIHPLIDPMLKETQGVIIYQEQVMQIAQLLSGYSLSEADVLRRAMGKKI 721 (1033) Q Consensus 642 dl~~~~al~RPGp~~~~~~yi~rk~g~e~~~~~hp~l~~iL~~T~Gv~vyQEQvm~ia~~~ag~~~~ead~lRra~~kk~ 721 (1033) ||||++||||||||++++.|++||||+++++|+||.|++||++|||||||||||||||+.+||||++|||.||||||||+ T Consensus 627 Dlva~~aL~RPGp~~~i~~yi~rk~g~~~i~y~hp~l~~iL~~TyGv~vyQEQvM~ia~~~agfs~~eaD~lRram~kK~ 706 (1145) T PRK06658 627 DLIALNALYRPGPMQFIPQFIESKKDVKKIKYPDPCLEDILSETYGVIVYQEQVMQVAQRIAGFSLGKADILRRAMGKKK 706 (1145) T ss_pred HHHHHHHHCCCCCHHHHHHHHHHHCCCCCCCCCCHHHHHHHCCCCCEEEEHHHHHHHHHHHCCCCHHHHHHHHHHHCCCC T ss_conf 98898764068804566999999739978788988898764445887987278664567760689778899998756485 Q ss_pred CCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHH Q ss_conf 10167787899999862499999999999999975000133101478889999999999730266677751002354202 Q 537021.9.peg.7 722 KEEMDKQKERFISGASKNGISKTIAVNIFELLAKFADYGFNKSHAAAYAVISYQTAWMKTHYPVEFLAASMTLEMDNVEK 801 (1033) Q Consensus 722 ~~~~~~~~~~f~~g~~~~g~~~~~a~~i~~~i~~f~~Y~FnksHa~aya~~~y~~aylK~~yP~eF~aa~L~~~~~~~~~ 801 (1033) .++|++++++|++||.+||++++.|++||++|++||+||||||||+|||++||||||||+|||+|||||+||+++++++| T Consensus 707 ~~~m~~~~~~F~~g~~~~g~~~~~a~~i~~~i~~Fa~YgFnksHa~aYa~iay~tAyLK~hyP~ef~aa~l~~~~~~~~k 786 (1145) T PRK06658 707 EDVMNEMKVDFLRGAIEKGFDKEIASEIFEILIPFAGYGFNKSHAAAYSVIAYQTAYLKANFPAEFMAANLTNEINSNDK 786 (1145) T ss_pred HHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHCCCCCCCHH T ss_conf 66789999999820211698999999999999998440567430169599999999970567999999985058898657 Q ss_pred HHHHHHHHHHCCCCCCCCCCCCCCEEEEECCCEEEEEHHHHCCCCHHHHHHHHHHH-HCCCCCCHHHHHHHCCCCCCCHH Q ss_conf 58999888863685035523342100353198698502685896989999999998-37999997899973482479989 Q 537021.9.peg.7 802 IKKFCQDARQFNIQIMPPSVNTPCVDFKVGDNRIYYSLAAIKGVGTTTARHIMEAS-ADKPFDSLEDFCSRVDSKKLNRR 880 (1033) Q Consensus 802 ~~~~i~e~~~~gi~vl~PdIN~S~~~f~~~~~~Ir~Gl~~Ikgvg~~~~~~Ive~r-~~g~f~sl~df~~rv~~~~~nk~ 880 (1033) ++.|+.|||++||+|+|||||+|.+.|++++++||+||++|||||++++++|+++| ++|||+|++||+.|++.+.+||+ T Consensus 787 ~~~~~~e~~~~gi~vlppdIn~S~~~f~~~~~~Ir~gl~~Ikgvg~~~~~~Iv~~R~~~G~f~s~~Df~~Rv~~~~~nk~ 866 (1145) T PRK06658 787 LPYYIAEAEKMGLNILKPDVNRSFPYFSVTDGGISYGLLGIKGLGEQVADLIIDEREKNGKYSSFEDFLRRVDDKVINKK 866 (1145) T ss_pred HHHHHHHHHHCCCCCCCCEEECCCCCCEECCCCCEEHHHHCCCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCCHH T ss_conf 99999999985994638722216886366289542333445787789999999999981899988999987211037999 Q ss_pred HHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHC Q ss_conf 99988867672347903899999899999999988654312777866665555655776668999988899985788726 Q 537021.9.peg.7 881 VLESLVFAGALDCFGYSRMQLLQSLDNIQKYAQWVEKNRTNKHENIFAHEKGTFSDKITLEKFSVENSSVRFENEQRVLG 960 (1033) Q Consensus 881 ~le~Li~aGAfD~~~~~R~~ll~~i~~~~~~~~~~~~~~~~~q~~LF~~~~~~~~~~~~~~~~~~~~~~e~l~~E~e~LG 960 (1033) ++|+||+|||||+||.||++|+++++.+++++....++...+|.+||+...........++.+++|+..|+|++|+|+|| T Consensus 867 ~le~Li~aGAfD~~g~~R~~ll~~~~~~~~~~~~~~~~~~~gq~~Lf~~~~~~~~~~~~~~~~~~~~~~e~l~~Eke~lG 946 (1145) T PRK06658 867 NLESMIKSGCFDSLGQNRKTLFENLDRAIEYVSEKKNNKSLGQNSLFDVLESQEQSSFNFQVFEEYSYSELLRFEKELMG 946 (1145) T ss_pred HHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHE T ss_conf 99999977972333035999998199999999997366545787776776655555667778899996669997898830 Q ss_pred EEECCCCHHHHHHHHHHCCCCCHHHHHCCCCCCEEEEEEEEEEEEEECCCCCCEEEEEEEECCCCEEEEEECC Q ss_conf 2116385589999998748976889850558988999999961121015888769999998497049999539 Q 537021.9.peg.7 961 FYFSGHPLDVYKPILRKMGIKGYEESVATIRGNNIRLAAMVVSKQQKKTRKGSRIGWVTFSEPAKEYEANFFP 1033 (1033) Q Consensus 961 f~lS~HPl~~~~~~l~~~~~~~~~~~~~~~~g~~v~iaG~V~~i~~~~TKkG~~mafltleD~~g~~e~~iFP 1033 (1033) ||+|+|||+.|+..++..+..++.++.....++.|+++|+|+++|+++||+|++|+|+|+||++|.+|+++|| T Consensus 947 ~yls~HPl~~~~~~l~~~~~~~~~~~~~~~~~~~v~v~G~i~~~~~~~tk~g~~maf~~leD~tg~~e~~~fp 1019 (1145) T PRK06658 947 FYISGHPLDEYRKVIDNFATLDVLHDLAAKSDSIYQFVGILNSVKVIQTKRGAKMAFGVFEDLKGAIDIVVFP 1019 (1145) T ss_pred ECCCCCHHHHHHHHHHHCCCCCHHHHHCCCCCCEEEEEEEEEEEEECCCCCCCEEEEEEEEECCCCEEEEECH T ss_conf 2775070999999998718836999840679988999999988674357999978999999689868999978 No 8 >PRK06637 consensus Probab=100.00 E-value=0 Score=2495.23 Aligned_cols=1003 Identities=41% Similarity=0.702 Sum_probs=943.6 Q ss_pred CCCCHHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCCC Q ss_conf 98813343314447644329989999999978998899811651774999999999769963899999985476643344 Q 537021.9.peg.7 8 SPSFVHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPIIGCQLDIDMQDNLKMTN 87 (1033) Q Consensus 8 ~~~FvhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~~~~~~ 87 (1033) .|+||||||||+||||||+++|++||++|+++||+||||||||||||+++|+++|+++|||||+|||+++..+. T Consensus 2 ~~~FvhLhvhS~ySlLdG~~~i~eLv~~A~~~g~~AlAlTD~~nl~Gav~F~~aa~~~GIKPIiG~el~~~~~~------ 75 (1182) T PRK06637 2 QPEFIHLRTQSSYSFLESALTIEKVVELASSNKMPAICLADKGNLFGSLEFALYAVKKGLQPIHGVILNIKYDI------ 75 (1182) T ss_pred CCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHCCCCCEEEEEEEEECCC------ T ss_conf 99710104756472105075999999999985999899984876747999999999869994024899876588------ Q ss_pred CCCCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCEEEECCCCCCHHHHHHHCCCHHHH Q ss_conf 56566667753999956968999999999899732487787400789998634898899819767967888877997899 Q 537021.9.peg.7 88 QGKTPFPTVPSIVLLVSTAEGYQRLIELVSRMYLLDQGKQSVRICLSWLQEIGTEGLIMLTGGSSGPIDRAFFFNSSQVA 167 (1033) Q Consensus 88 ~~~~~~~~~~~lvlLAkN~~Gy~nL~kL~S~a~~~~~~~~~p~i~~~~L~e~~~egLIvlsg~~~g~i~~~l~~~~~~~a 167 (1033) +..+|++|||||++||+|||+|+|.++..+.++..|++++++|.+ |++|||++|||..|+++++++.++.+.| T Consensus 76 ------~~~~~lvLLAkn~~GY~nL~kL~S~a~~~~~~~~~p~i~~~~L~~-~~eGLI~lsg~~~g~v~~~ll~~~~~~a 148 (1182) T PRK06637 76 ------DIFAQILLIAKDETGYKNLLKLSSLTFTKNDRKICDHINFEDLIE-YQEGLIALCCYTDGIVGKCLLARNEEQA 148 (1182) T ss_pred ------CCCCCEEEEECCHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHH-HCCCCEEEECCCCCHHHHHHHCCCHHHH T ss_conf ------886438998079999999999999999845778888588999997-5799599836865678898856998999 Q ss_pred HHHHHHHHHHCCCCEEEEEECCCHHHH-HHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 999999987659868999943991158-9999999999981898998088300787889999876688469857653100 Q 537021.9.peg.7 168 EKRLLTFKKLFGDRLYVNLQRHRGYDR-YRESQVVQLAYTHELPLVATNNSLFLSEEDYEAHDVLMAVAHSTVVSQKDRP 246 (1033) Q Consensus 168 ~~~l~~l~~~Fgd~~ylEl~~~~~~e~-~~~~~l~~lA~~~~iPlVaTndv~Y~~~~D~~~h~iL~aI~~~~~l~~~~~~ 246 (1033) ...+.+|+++||++||+|+|+|+.+++ .++..++++|+++++|+||||||||++++|+++|++|+||++|.+++++++. T Consensus 149 ~~~~~~l~~ifgd~~ylEl~~h~~~~~~~~~~~l~~lA~~~~iplVATndvhy~~~ed~~ahdvL~cI~~g~~~~~~~r~ 228 (1182) T PRK06637 149 MLFARKLQEILGDRFYFEIMRHDLPEEQFIEDSYIRIAAELAIPLVATNKVLFSEKSMHDAHDVLLCISAGVTKEYPDRK 228 (1182) T ss_pred HHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCC T ss_conf 99999999871886899985489857899999999999980999898476434786068999999999679984652235 Q ss_pred CCCCCCCCCCHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCC Q ss_conf 24534323434788751220358998788641110001332210234555756887768988999876541034201246 Q 537021.9.peg.7 247 RVTPDHYLKNRSEMVSIFSDLPEALENTVEIARRCSFILQVHAPILPRFIEKECDDIQKEEENELRNKAVAGLEVRLAQG 326 (1033) Q Consensus 247 ~~~~~~yLks~eEM~~~f~~~peAi~NT~~IAe~c~~~l~~~~~~lP~f~~p~~~~~~~~~~~~Lr~l~~~Gl~~R~~~~ 326 (1033) ..+.++||||++||.++|.++|+|++||.+||++|+|++...++.||+|++|+. ++.++|+++|++|+.+||... T Consensus 229 ~~~~~~ylks~~EM~~lF~d~PeAl~NT~~IA~rC~~~~~~~~~~LP~f~~~~~-----~~~~~l~~~~~~Gl~~Rl~~~ 303 (1182) T PRK06637 229 TVSENCYFKSPHEMIELFSDLPSAIQNTVNLRERCYFAAHANPPMLPNFATKDI-----SETDLIRKDAKEGLLARLATK 303 (1182) T ss_pred CCCHHHCCCCHHHHHHHHCCCHHHHHHHHHHHHHHEEECCCCCCCCCCCCCCCC-----CHHHHHHHHHHHHHHHHHHHC T ss_conf 678221049999999883666999999999997403313148886999999998-----878999999999999986530 Q ss_pred CCC-------CCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHH Q ss_conf 432-------4578389999988778989860630557888999998510476433444420466776554403756333 Q 537021.9.peg.7 327 AIA-------KGYHAQDYRQRLDFELNVIARMKFSGYFLIVADFIQWAKKNNIPVGPGRGSGAGSVTAYALTITDIDPLR 399 (1033) Q Consensus 327 ~~~-------~~~~~~~y~~RLe~EL~vI~~~gf~~YFLiV~Div~~ak~~gI~vGpGRGSAagSLVaY~LgIT~VDPi~ 399 (1033) ... ....+++|.+||+|||+||++|||++|||||||+|+|||++||+||||||||||||||||||||+||||+ T Consensus 304 ~~~~~~~~~~~~~~~~~y~~RL~~EL~vI~~mgf~~YFLIV~D~I~~Ak~~gI~vGPGRGSaAGSlVaY~LgIT~VDPl~ 383 (1182) T PRK06637 304 FKSENISLENQEELKTEYFARLNYELDIICNMNFAGYFLIVSDFIKWSKKEGILVGPGRGSGAGSVVAWSLLITDLDPIK 383 (1182) T ss_pred CCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHCCCCCCHHH T ss_conf 45334661000002189999999999999877996241799999999997697116887623777999983687418134 Q ss_pred CCCHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHC Q ss_conf 23105676210014578753336568789999999830380215542207787788999999975279988988654310 Q 537021.9.peg.7 400 FSLLFERFLNPDRMSMPDFDIDFCQDRRDEVIRYVQNKYGHERVAQIITFGSLQAKAALRDVGRALQMPYSQVDRLCKLI 479 (1033) Q Consensus 400 ~~LlFERFLnp~R~~~PDIDiDf~~~~R~~vi~y~~~kyG~~~va~i~t~~t~~~k~aird~~r~~g~~~~~~~~~~~~i 479 (1033) |||||||||||+|+|||||||||+++||++||+||++|||+++||||+||+||++|+||||||||||+|+.++|++||+| T Consensus 384 y~LLFERFLNpeR~smPDIDiDF~~~rReeVI~YV~~kYG~d~VAqI~Tfgt~~aksaiRDvgRvlg~p~~~~d~iaKli 463 (1182) T PRK06637 384 FGLLFERFLNPERISMPDFDIDFCQERREEVINYVRSKYGNNRVGQIITFGKMQAKAVIKDVARVLSLPYKFADYLTELV 463 (1182) T ss_pred CCCCHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEEEHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCC T ss_conf 17529986043567889998554500199999999998632657875321228989999999998499989999998448 Q ss_pred CCCCCCCCCH-HHHHCCHHHHHHHH---------CCCCCHHHHHHHHHHHHHHHCCCCCCCEEEECCCCHHHHEECCCCC Q ss_conf 1458982443-44522988788643---------0742026889872011221002447440335256666620012367 Q 537021.9.peg.7 480 PNDPAHPVSL-QAIADDSRFREARL---------ADPSVDRLLEISQKLEGLYRHASTHAAGIVIGDRPLSQLVPMYRDV 549 (1033) Q Consensus 480 ~~~~~~~~~~-~~~~~~~~~~~~~~---------~~~~~~~~~~~a~~leg~~r~~~~Ha~Gvvi~~~~l~~~~p~~~~~ 549 (1033) |+......++ +++.+.+++++... ..+..+.++++|++|||+|||+|+||||||||++||++++|+++++ T Consensus 464 p~~~~~~~~l~~a~~~~pel~~~~~~~~l~~~~~~~~~~~~l~~~A~~LEGlpR~~s~HAaGvVIs~~pL~~~vPl~~~~ 543 (1182) T PRK06637 464 PFSAVNPVSLEQAMREVPELANAAKGNGLYNLEGEAELIKLVLDTSLILEGLHRHSSTHAAGIVIAGTDLVDIVPVYKDA 543 (1182) T ss_pred CCCCCCCCCHHHHHHHCHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCEEEECCCCHHHCCCEEECC T ss_conf 87678976899998768778877640120135531089999999999854885443456652684276502107654668 Q ss_pred CCCCEEEEEECHHHHHHCCCHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHCCCHHHH Q ss_conf 68866885102133431123213565417899999999887402543433234435889987620141012211453478 Q 537021.9.peg.7 550 RSDLPVTQFNMKWIEKAGLIKFDFLGLKTLTLLQKSLDFLAQRGVKVDLSLIPFDDHETYHLLTTKGTLGIFQLESSGMR 629 (1033) Q Consensus 550 ~~~~~~~q~d~~~~e~~gl~K~D~Lgl~~L~~i~~~~~~i~~~~~~~d~~~ip~~D~~~~~l~~~g~t~GvFQ~es~~~~ 629 (1033) +++.++|||||++||++||||||||||+|||+|++++++|+++|.++|+.+||+||++||+||++|+|.||||+||+||+ T Consensus 544 ~~~~~vtQ~d~~~~E~~GLlK~DfLGL~tLtvI~~a~~li~~~g~~idl~~ipldD~~ty~ll~~g~T~GVFQlES~gm~ 623 (1182) T PRK06637 544 NSDMLVVGYSMKYSEIAGLIKFDFLGLQTLTVITDCKKLLKEQGIEVDFNNMTFDDNKTYQMLCKGKGVGVFQFESIGMK 623 (1182) T ss_pred CCCCCEEECCCHHHHHHCCCCEEECCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHCCCCCCCCHHCCHHHH T ss_conf 89854331355888871870031032685599999999999619845612167677999999960777992020366999 Q ss_pred HHHHHHCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCHHH Q ss_conf 88877174048888766541141134200368885046578502696673341033553001256898878863268413 Q 537021.9.peg.7 630 QALEGMQPDCIEDIIALVSLYRPGPIDNIVVYNNRKNGKEKIVSIHPLIDPMLKETQGVIIYQEQVMQIAQLLSGYSLSE 709 (1033) Q Consensus 630 ~~l~~~~p~~~~dl~~~~al~RPGp~~~~~~yi~rk~g~e~~~~~hp~l~~iL~~T~Gv~vyQEQvm~ia~~~ag~~~~e 709 (1033) ++|+++||++|+||||++||||||||+++++|++||||+++++|+||.|++||++|||||||||||||||+.+||||+|| T Consensus 624 ~~L~~lkP~~feDlva~vAL~RPGPm~~i~~yi~rk~g~e~~~y~hp~L~~IL~~TyGVivYQEQVMqia~~~AGfslge 703 (1182) T PRK06637 624 DALRRLKPDSIHDLIALGALYRPGPMENIPTYIACKHKLQQPDYLHELLKPILEETYGVVIYQEQVQRIAQVLAGYTLGA 703 (1182) T ss_pred HHHHHHCCCCHHHHHHHHHHCCCCCCCCCCHHHHHHCCCCCCCCCCHHHHHHHCCCCCCCEEHHHHHHHHHHHHCCCHHH T ss_conf 99997099869999888830167752322028998569989998779999985227854456798999999996799878 Q ss_pred HHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 07889762012110167787899999862499999999999999975000133101478889999999999730266677 Q 537021.9.peg.7 710 ADVLRRAMGKKIKEEMDKQKERFISGASKNGISKTIAVNIFELLAKFADYGFNKSHAAAYAVISYQTAWMKTHYPVEFLA 789 (1033) Q Consensus 710 ad~lRra~~kk~~~~~~~~~~~f~~g~~~~g~~~~~a~~i~~~i~~f~~Y~FnksHa~aya~~~y~~aylK~~yP~eF~a 789 (1033) ||.||||||||+.++|++.+++|++||++||++++.|++||++|++||+||||||||+|||++||||||||+|||+|||| T Consensus 704 AD~LRRAmgKK~~~~m~~~r~~Fi~Ga~~~G~~~~~A~~if~~i~~FA~YGFnKSHaaAYAllaYqtAyLKahYP~eF~a 783 (1182) T PRK06637 704 ADLLRRAMGKKIKKEMEEQEEIFVKGAIANNISESQAKSIFATVAKFAGYGFNKAHAAAYGVISYQTAYLKANYPAEFLV 783 (1182) T ss_pred HHHHHHHHCCCCHHHHHHHHHHHHCCHHHCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHCHHHHHH T ss_conf 99999997468634699999999732754599999999999999987345889862277799999999999728999999 Q ss_pred HHHCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEEC-------------------------------------C Q ss_conf 751002354202589998888636850355233421003531-------------------------------------9 Q 537021.9.peg.7 790 ASMTLEMDNVEKIKKFCQDARQFNIQIMPPSVNTPCVDFKVG-------------------------------------D 832 (1033) Q Consensus 790 a~L~~~~~~~~~~~~~i~e~~~~gi~vl~PdIN~S~~~f~~~-------------------------------------~ 832 (1033) |+||++++++||++.|+.|||+|||+|+|||||+|.+.|+++ + T Consensus 784 AlL~s~~~~~~k~~~~i~ear~~Gi~VlpPdIN~S~~~f~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 863 (1182) T PRK06637 784 ACLNLELNNHDKINLFLQEAKDNGIKIIAPNINISEGYFSVKFSDTVIPHSVKPVIPRLDRGIQEISKDTTVVKPRYDIA 863 (1182) T ss_pred HHHCCCCCCCHHHHHHHHHHHHCCCEECCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 98516878865799999999987895818840577775035125554432223334555422111122322234456888 Q ss_pred CEEEEEHHHHCCCCHHHHHHHHHHH-HCCCCCCHHHHHHHCCCCCCCHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 8698502685896989999999998-379999978999734824799899998886767234790389999989999999 Q 537021.9.peg.7 833 NRIYYSLAAIKGVGTTTARHIMEAS-ADKPFDSLEDFCSRVDSKKLNRRVLESLVFAGALDCFGYSRMQLLQSLDNIQKY 911 (1033) Q Consensus 833 ~~Ir~Gl~~Ikgvg~~~~~~Ive~r-~~g~f~sl~df~~rv~~~~~nk~~le~Li~aGAfD~~~~~R~~ll~~i~~~~~~ 911 (1033) ++|||||++|||||+.+++.|+++| ++|||+|+.||+.|++.+.+||+++|+||+|||||+||.||++++|+++.++.+ T Consensus 864 ~~Ir~GL~aIKgvg~~~~~~Iv~~R~~~Gpf~sl~Df~~Rv~~~~~nkr~lE~Li~aGAfD~~~~~R~~l~~~~~~~~~~ 943 (1182) T PRK06637 864 GAIIFALGAIKGVTPNFGKLVTDERKARGAFKSITDFIERLPPKSINSKLLENLIKSGCFDELHDNRLQLFLSIPKLLAY 943 (1182) T ss_pred CCEEEEHHHHCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHH T ss_conf 83886107544887999999999998287989999999743622399999999986899756670199999999999999 Q ss_pred HHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHH-HHHHHHCEEECCCCHHHHHHHHHHCCCCCHHHHHCC- Q ss_conf 998865431277786666555565577666899998889998-578872621163855899999987489768898505- Q 537021.9.peg.7 912 AQWVEKNRTNKHENIFAHEKGTFSDKITLEKFSVENSSVRFE-NEQRVLGFYFSGHPLDVYKPILRKMGIKGYEESVAT- 989 (1033) Q Consensus 912 ~~~~~~~~~~~q~~LF~~~~~~~~~~~~~~~~~~~~~~e~l~-~E~e~LGf~lS~HPl~~~~~~l~~~~~~~~~~~~~~- 989 (1033) +...++.+..+|.+||+.... .+..+...+|...+++. +|+|+||||+|+|||+.|++.++..++.+..++.+. T Consensus 944 ~~~~~~~~~~~q~~lf~~~~~----~~~~~~~~~~~~~~~l~~~E~e~lG~yls~HPl~~~~~~l~~~~~~~~~~l~~~~ 1019 (1182) T PRK06637 944 STSYHEEQESNQFSLIKVSSL----SPTILVSSDYADKNTLAFYEFEAMGLFISNHPLTEYQEIFSRLNILNTADLHNNL 1019 (1182) T ss_pred HHHHHHHHHCCCCCCCCCCCC----CCCCCCCCCCCCHHHHHHHHHHHEEEEECCCCHHHHHHHHHHCCCCCHHHHHHHC T ss_conf 999887554379887677777----8776777665647688887675637800639638899999983896389987546 Q ss_pred CCC-CEEEEEEEEEEEEEECCCCCCEEEEEEEECCCCEEEEEECC Q ss_conf 589-88999999961121015888769999998497049999539 Q 537021.9.peg.7 990 IRG-NNIRLAAMVVSKQQKKTRKGSRIGWVTFSEPAKEYEANFFP 1033 (1033) Q Consensus 990 ~~g-~~v~iaG~V~~i~~~~TKkG~~mafltleD~~g~~e~~iFP 1033 (1033) .+| ++|+|||+|+.+|++.||+|+ |+|+|+||+||.+|+++|| T Consensus 1020 ~~~~~~v~vaG~v~~~q~~~t~~G~-~~~~~leD~tG~~e~~vf~ 1063 (1182) T PRK06637 1020 PDGTNRVNLAGVIQKKDSRMSARGR-FVTLVLSDPENIFELSIFS 1063 (1182) T ss_pred CCCCEEEEEEEEEEEEEEECCCCCC-EEEEEEEECCCCEEEEEEC T ss_conf 6887079999999661443278886-8999999689868999958 No 9 >PRK07012 consensus Probab=100.00 E-value=0 Score=2494.72 Aligned_cols=1006 Identities=41% Similarity=0.707 Sum_probs=951.7 Q ss_pred CCCCCHHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCC Q ss_conf 89881334331444764432998999999997899889981165177499999999976996389999998547664334 Q 537021.9.peg.7 7 KSPSFVHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPIIGCQLDIDMQDNLKMT 86 (1033) Q Consensus 7 ~~~~FvhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~~~~~ 86 (1033) ..|.||||||||+||+|||+++|++||++|+++||+||||||||||||+++|+++|+++|||||||||+++...+.. T Consensus 2 ~~~~FvhLhvhS~ySlldg~~~i~eLv~~A~~~G~~AlAlTD~~nl~Gav~F~~aa~~~gIKPIiG~e~~v~~~~~~--- 78 (1173) T PRK07012 2 SDPRFVHLRVHSEFSIADGIVRLDDIVKAAAADGQGALALTDLGNAFGLVRFYKEARGAGVKPIAGCDVWITNPDDR--- 78 (1173) T ss_pred CCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCCEEEEEEEEECCCCC--- T ss_conf 69981121466737220507499999999998599999885188687899999999977998336779997678777--- Q ss_pred CCCCCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCEEEECCCCCCHHHHHHHCCCHHH Q ss_conf 45656666775399995696899999999989973248778740078999863489889981976796788887799789 Q 537021.9.peg.7 87 NQGKTPFPTVPSIVLLVSTAEGYQRLIELVSRMYLLDQGKQSVRICLSWLQEIGTEGLIMLTGGSSGPIDRAFFFNSSQV 166 (1033) Q Consensus 87 ~~~~~~~~~~~~lvlLAkN~~Gy~nL~kL~S~a~~~~~~~~~p~i~~~~L~e~~~egLIvlsg~~~g~i~~~l~~~~~~~ 166 (1033) ...+|++|||||++||+|||+|+|.+|+.++++..|+++.++|.+.+++|||++|||..|++++++..++.+. T Consensus 79 -------~~~~~l~LLAkn~~GY~nL~rL~S~a~~~~~~~~~~~~~~~~l~~~~~egLI~lsg~~~g~v~~~l~~~~~~~ 151 (1173) T PRK07012 79 -------DKPSRLLLLVKDKRGYLNLCELLSKAWLTNQYRGRAELDASWLEGELAEGLLALSGAQQGDIGLALAAGNEEA 151 (1173) T ss_pred -------CCCCCEEEECCCHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHCCCCEEEECCCCCCHHHHHHHCCCHHH T ss_conf -------7764289978988999999999999998357789875799999624179979965886676999874799999 Q ss_pred HHHHHHHHHHHCCCCEEEEEECCCHH-HHHHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 99999999876598689999439911-58999999999998189899808830078788999987668846985765310 Q 537021.9.peg.7 167 AEKRLLTFKKLFGDRLYVNLQRHRGY-DRYRESQVVQLAYTHELPLVATNNSLFLSEEDYEAHDVLMAVAHSTVVSQKDR 245 (1033) Q Consensus 167 a~~~l~~l~~~Fgd~~ylEl~~~~~~-e~~~~~~l~~lA~~~~iPlVaTndv~Y~~~~D~~~h~iL~aI~~~~~l~~~~~ 245 (1033) |++.+.+|+++|+++||+|+|+|+.+ ++.++..++++|+++++|+||||||||++++|+.+|++|+||++|++++++++ T Consensus 152 a~~~~~~~~~iF~~~~ylEl~~~g~~~~~~~~~~l~~lA~~~~iplVATndvhy~~~~D~~ahdvl~~I~~g~~l~d~~r 231 (1173) T PRK07012 152 ARRHAQRWARVFPGGFYIELQRYGQPGAEAYIQQAVTLAAELKLPVVATHPVQFMTADDFTAHEARVCISEGDILANPRR 231 (1173) T ss_pred HHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCC T ss_conf 99999999997699789998538983689999999999998098989847765368756799999999977691155851 Q ss_pred CC-CCCCCCCCCHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCC Q ss_conf 02-45343234347887512203589987886411100013322102345557568877689889998765410342012 Q 537021.9.peg.7 246 PR-VTPDHYLKNRSEMVSIFSDLPEALENTVEIARRCSFILQVHAPILPRFIEKECDDIQKEEENELRNKAVAGLEVRLA 324 (1033) Q Consensus 246 ~~-~~~~~yLks~eEM~~~f~~~peAi~NT~~IAe~c~~~l~~~~~~lP~f~~p~~~~~~~~~~~~Lr~l~~~Gl~~R~~ 324 (1033) .+ ++.++||||++||.++|+++|+|++||.+||++|++++++.+++||+||+|+|. ++++||+++|++||++||+ T Consensus 232 ~~~~~~e~ylks~~EM~~lf~d~peAl~NT~~IA~rC~~~l~~~~~~lP~f~~p~g~----t~~~~L~~l~~~Gl~~R~~ 307 (1173) T PRK07012 232 QKRFTTDQYFRTQDEMAALFADLPSALANTVEIAKRCNLTLELGKPKLPLFPTPDGM----SLDDYLVQLSKEGLEKRLA 307 (1173) T ss_pred CCCCCHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHCCCCCCCCCCCCCCC----CHHHHHHHHHHHHHHHHHH T ss_conf 033787543799999999876239999999999986110173087668888899999----9899999999999999865 Q ss_pred CCCC---CCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHHCC Q ss_conf 4643---2457838999998877898986063055788899999851047643344442046677655440375633323 Q 537021.9.peg.7 325 QGAI---AKGYHAQDYRQRLDFELNVIARMKFSGYFLIVADFIQWAKKNNIPVGPGRGSGAGSVTAYALTITDIDPLRFS 401 (1033) Q Consensus 325 ~~~~---~~~~~~~~y~~RLe~EL~vI~~~gf~~YFLiV~Div~~ak~~gI~vGpGRGSAagSLVaY~LgIT~VDPi~~~ 401 (1033) .... .+....++|++||+|||+||++|||++|||||||+|+|||++||+||||||||||||||||||||+||||+|| T Consensus 308 ~~~p~~~~~~~~~~~y~~RL~~EL~vI~~mGf~~YFLIV~D~i~~Ak~~gI~vGPGRGSaAGSLVaY~LgIT~vDPi~y~ 387 (1173) T PRK07012 308 QLYPDEAEREAQRDTYYERLEFECGTIIKMGFPGYFLIVADFINWAKNNGVPVGPGRGSGAGSLVAYALGITDLDPLRYN 387 (1173) T ss_pred HCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEHHHHHHHHHHHHCCCEECCCCCCHHHHHHHHHHCCCCCCCCCCC T ss_conf 40896033332117899999999999987699614079999999999769722688770376799998447443842258 Q ss_pred CHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCC Q ss_conf 10567621001457875333656878999999983038021554220778778899999997527998898865431014 Q 537021.9.peg.7 402 LLFERFLNPDRMSMPDFDIDFCQDRRDEVIRYVQNKYGHERVAQIITFGSLQAKAALRDVGRALQMPYSQVDRLCKLIPN 481 (1033) Q Consensus 402 LlFERFLnp~R~~~PDIDiDf~~~~R~~vi~y~~~kyG~~~va~i~t~~t~~~k~aird~~r~~g~~~~~~~~~~~~i~~ 481 (1033) |||||||||+|++||||||||+++||++||+||++|||+++||||+||+||++|+||||||||||+|+.++|++||+||. T Consensus 388 LLFERFLNpeR~smPDIDiDF~~~rR~eVi~Yv~~kYG~d~VAqI~Tf~T~~ak~AiRDvgRvlg~~~~~~d~iaK~ip~ 467 (1173) T PRK07012 388 LLFERFLNPERVSMPDFDIDFCQHGRDRVIQYVKEKYGADAVSQIATFGTMAAKAAVRDIGRVLDLGYMFTDGVAKLIPF 467 (1173) T ss_pred CCHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCC T ss_conf 63788616677889999865775452999999999863021114666888999999999999858998888778621887 Q ss_pred CCCCCCCH-HHHHCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCEEEECCCCHHHHEECCCCCCCCCEEEEEEC Q ss_conf 58982443-44522988788643074202688987201122100244744033525666662001236768866885102 Q 537021.9.peg.7 482 DPAHPVSL-QAIADDSRFREARLADPSVDRLLEISQKLEGLYRHASTHAAGIVIGDRPLSQLVPMYRDVRSDLPVTQFNM 560 (1033) Q Consensus 482 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~a~~leg~~r~~~~Ha~Gvvi~~~~l~~~~p~~~~~~~~~~~~q~d~ 560 (1033) .++..+++ +++.+.+++++....++..++++++|++|||+|||+|+||||||||++||+++||++..++++.++||||| T Consensus 468 ~~~~~~tl~~a~~~~~~l~~~~~~~~~~~~l~~~A~~LeGl~R~~g~HAaGvVIs~~~L~~~vPl~~~~~~~~~vtQ~d~ 547 (1173) T PRK07012 468 KPGKHVTIADAMKEEPLLQERYDNEDEVHQLLDLAQRVEGLTRNVGMHAGGVLIAPGKLTDFCPLYTQGDEGGVVSQYDK 547 (1173) T ss_pred CCCCCCCHHHHHHHCHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCCCCCCCEEECCCCHHHCCCEEECCCCCCEEEEECH T ss_conf 78886778999774744565531689999999999998278531003667258634431000253771689963672031 Q ss_pred HHHHHHCCCHHHHHHHCCHHHHHHHHHHHHH---HCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHCCCHHHHHHHHHHCC Q ss_conf 1334311232135654178999999998874---0254343323443588998762014101221145347888877174 Q 537021.9.peg.7 561 KWIEKAGLIKFDFLGLKTLTLLQKSLDFLAQ---RGVKVDLSLIPFDDHETYHLLTTKGTLGIFQLESSGMRQALEGMQP 637 (1033) Q Consensus 561 ~~~e~~gl~K~D~Lgl~~L~~i~~~~~~i~~---~~~~~d~~~ip~~D~~~~~l~~~g~t~GvFQ~es~~~~~~l~~~~p 637 (1033) ++||++||+|||||||||||+|++++++|++ ++.++|+.+||+||++||+||++|+|.||||+||+|||++|++++| T Consensus 548 ~~vE~~GLlK~DfLGLr~LtvI~~a~~~i~~~~~~~~~~dl~~ipldD~~ty~ll~~g~T~GvFQ~ES~gmr~~l~~l~P 627 (1173) T PRK07012 548 DDVEAVGLVKFDFLGLTTLTILDWAERYIRRLDPSKADWSLAQVPLDDPASFQILKKANTVAVFQLESRGMQGMLKDAQP 627 (1173) T ss_pred HHHHHCCCCCEEEECCCCHHHHHHHHHHHHHHCCCCCCCCHHHCCCCCHHHHHHHCCCCCCCCCCCCCHHHHHHHHHCCC T ss_conf 25987499541211478348999999999983687878766637976599998746778898452458189999986599 Q ss_pred CHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHH Q ss_conf 04888876654114113420036888504657850269667334103355300125689887886326841307889762 Q 537021.9.peg.7 638 DCIEDIIALVSLYRPGPIDNIVVYNNRKNGKEKIVSIHPLIDPMLKETQGVIIYQEQVMQIAQLLSGYSLSEADVLRRAM 717 (1033) Q Consensus 638 ~~~~dl~~~~al~RPGp~~~~~~yi~rk~g~e~~~~~hp~l~~iL~~T~Gv~vyQEQvm~ia~~~ag~~~~ead~lRra~ 717 (1033) ++|+||||++||||||||++++.|++||||+++++|+||.|++||++|||||||||||||||+.+||||++|||.||||| T Consensus 628 ~~f~dlva~~AL~RPGP~~~i~~yi~rk~g~e~v~y~hp~le~iL~~TyGvivyQEQvm~ia~~~agfs~g~aD~lRram 707 (1173) T PRK07012 628 DRFEDIIALVALYRPGPMDLIPSFCARKHGREKVDYPDPRVEPVLKETYGIMVYQEQVMQMAQIIGGYSLGGADLLRRAM 707 (1173) T ss_pred CCHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHCCEEEEHHHHHHHHHHHCCCCHHHHHHHHHHH T ss_conf 98999999998517880666799999865998888989899987764368787679999999997099978899999987 Q ss_pred CCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 01211016778789999986249999999999999997500013310147888999999999973026667775100235 Q 537021.9.peg.7 718 GKKIKEEMDKQKERFISGASKNGISKTIAVNIFELLAKFADYGFNKSHAAAYAVISYQTAWMKTHYPVEFLAASMTLEMD 797 (1033) Q Consensus 718 ~kk~~~~~~~~~~~f~~g~~~~g~~~~~a~~i~~~i~~f~~Y~FnksHa~aya~~~y~~aylK~~yP~eF~aa~L~~~~~ 797 (1033) |||++++|++.+++|++||.+||+++++|++||++|++||+||||||||+|||++||||||||+|||+|||||+||++++ T Consensus 708 gkK~~~~~~~~~~~f~~ga~~~G~~~~~a~~i~~~i~~Fa~YgFnksHaaaYa~~ay~tAyLK~hyP~eF~aAlLs~~~~ 787 (1173) T PRK07012 708 GKKKPEEMAKHREIFAEGAAKNGLTREKADEIFDLMEKFAGYGFNKSHAAAYALLAYYTAWLKAHHPAEFMAANMSLAMD 787 (1173) T ss_pred CCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHCCCCC T ss_conf 02883679999999984113259899999999999999854255805789999999999999873899999998376879 Q ss_pred CCHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEECCC-------EEEEEHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHH Q ss_conf 420258999888863685035523342100353198-------6985026858969899999999983799999789997 Q 537021.9.peg.7 798 NVEKIKKFCQDARQFNIQIMPPSVNTPCVDFKVGDN-------RIYYSLAAIKGVGTTTARHIMEASADKPFDSLEDFCS 870 (1033) Q Consensus 798 ~~~~~~~~i~e~~~~gi~vl~PdIN~S~~~f~~~~~-------~Ir~Gl~~Ikgvg~~~~~~Ive~r~~g~f~sl~df~~ 870 (1033) +++|+..|+.|||+|||+|+|||||+|.++|++.++ +||+||++|||||++++++|+++|++|||+|++|||. T Consensus 788 ~~~k~~~~i~ear~~gi~VlpPdIN~S~~~f~~~~~~~~~~~~~Ir~GL~aIkgvg~~~~~~Iv~~R~~g~f~sl~Df~~ 867 (1173) T PRK07012 788 DTDKVKILFDDCLVNGLAVLPPDINASHYRFEPVAEADGKRSRTIRYGLGAVKGSGQNAIEEILRAREEKPFADLFDFCE 867 (1173) T ss_pred CCHHHHHHHHHHHHCCCEECCCCCCCCCCEEEECCCCCCCCCCEEEEEHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHH T ss_conf 97589999999998789164884225577025525766676763898435415898789999999984599999899999 Q ss_pred HCCCCCCCHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCHHH Q ss_conf 34824799899998886767234790389999989999999998865431277786666555565577666899998889 Q 537021.9.peg.7 871 RVDSKKLNRRVLESLVFAGALDCFGYSRMQLLQSLDNIQKYAQWVEKNRTNKHENIFAHEKGTFSDKITLEKFSVENSSV 950 (1033) Q Consensus 871 rv~~~~~nk~~le~Li~aGAfD~~~~~R~~ll~~i~~~~~~~~~~~~~~~~~q~~LF~~~~~~~~~~~~~~~~~~~~~~e 950 (1033) |++.+.+||+++|+||+|||||+||.||++|+++++.+++.+.+ +.+..+|.+||+..... ...+.++..++|+..| T Consensus 868 Rv~~~~~nk~~le~Li~aGAfD~~~~~R~~ll~~l~~a~~~~~~--~~~~~~q~~lf~~~~~~-~~~~~~~~~~~~~~~e 944 (1173) T PRK07012 868 RVDRRVVNRRTIEALIRAGAFDSLNPNRAQLLASVPLAMEAAEQ--AAANANQAGLFDMGAGP-AHAHELVDEPAWDDKR 944 (1173) T ss_pred HHHHCCCCHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHH--HHHHCCCCCCCCCCCCC-CCCCCCCCCCCCCHHH T ss_conf 85305799999999997699624111299999975999999998--87526888877777665-4566667899988555 Q ss_pred HHHHHHHHHCEEECCCCHHHHHHHHHHCCCCCHHHHHCCCCCCEEEEEEEEEEEEEECCCCCCEEEEEEEECCCCEEEEE Q ss_conf 99857887262116385589999998748976889850558988999999961121015888769999998497049999 Q 537021.9.peg.7 951 RFENEQRVLGFYFSGHPLDVYKPILRKMGIKGYEESVATIRGNNIRLAAMVVSKQQKKTRKGSRIGWVTFSEPAKEYEAN 1030 (1033) Q Consensus 951 ~l~~E~e~LGf~lS~HPl~~~~~~l~~~~~~~~~~~~~~~~g~~v~iaG~V~~i~~~~TKkG~~mafltleD~~g~~e~~ 1030 (1033) ++.+|+|+||||+|+||++.|+..++.....++.+ ..+++.++|+|+|+++|+++||+|+ |+|+|+||++|.+|++ T Consensus 945 ~l~~E~e~lG~yls~HPl~~~~~~l~~~~~~~l~~---l~~~~~~~vaG~v~~~~~~~tk~G~-maf~tleD~tg~~ev~ 1020 (1173) T PRK07012 945 RLQEEKGALGFYLSGHLFDAYRDEVRRFVRQKLGE---LKEGRDKVVAGVIASLRTQMTQRGK-MVIALLDDGTGQCEIT 1020 (1173) T ss_pred HHHHHHHHHCCCCCCCCHHHHHHHHHHHCCCCHHH---CCCCCEEEEEEEEEEEEECCCCCCC-EEEEEEEECCCCEEEE T ss_conf 99988998642505170899999998749032865---7789816999999886878888777-8999999799878999 Q ss_pred ECC Q ss_conf 539 Q 537021.9.peg.7 1031 FFP 1033 (1033) Q Consensus 1031 iFP 1033 (1033) ||| T Consensus 1021 vfp 1023 (1173) T PRK07012 1021 VFN 1023 (1173) T ss_pred ECH T ss_conf 978 No 10 >PRK08957 consensus Probab=100.00 E-value=0 Score=2483.16 Aligned_cols=1009 Identities=44% Similarity=0.733 Sum_probs=956.6 Q ss_pred CCCCHHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCCC Q ss_conf 98813343314447644329989999999978998899811651774999999999769963899999985476643344 Q 537021.9.peg.7 8 SPSFVHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPIIGCQLDIDMQDNLKMTN 87 (1033) Q Consensus 8 ~~~FvhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~~~~~~ 87 (1033) .|.||||||||+||||+|+++|++||++|+++||+||||||||||||+++|+++|+++|||||+|||+++..+.. T Consensus 3 ~~~FVHLhvhS~YSLLdGa~~i~eLV~~Ake~G~~AlALTD~~nlyGaveFy~ackk~GIKPIIG~El~v~~~~~----- 77 (1159) T PRK08957 3 EPRFVHLRVHSDFSMIDGLAKVKPLVKKAAALGMPALALTDFTNLCGLVKFYGAAHGAGIKPIIGADFWVQSPLL----- 77 (1159) T ss_pred CCCCHHHCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCC----- T ss_conf 998312226773722030759999999999859998999727736259999999998699879777999736877----- Q ss_pred CCCCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCEEEECCCCCCHHHHHHHCCCHHHH Q ss_conf 56566667753999956968999999999899732487787400789998634898899819767967888877997899 Q 537021.9.peg.7 88 QGKTPFPTVPSIVLLVSTAEGYQRLIELVSRMYLLDQGKQSVRICLSWLQEIGTEGLIMLTGGSSGPIDRAFFFNSSQVA 167 (1033) Q Consensus 88 ~~~~~~~~~~~lvlLAkN~~Gy~nL~kL~S~a~~~~~~~~~p~i~~~~L~e~~~egLIvlsg~~~g~i~~~l~~~~~~~a 167 (1033) ....+|++|||||++||+|||+|+|.++.++.++..|++++++|.+ |++|||++|||..|+++++++.++.+++ T Consensus 78 -----~~~~~~LvLLAkN~~GY~NL~kLiS~a~~~g~~~~~p~id~e~L~~-~~eGLI~Lsg~~~G~i~~~Ll~g~~~~a 151 (1159) T PRK08957 78 -----GDELTHLTVLAANNEGYQNLTLLISKAYQRGYGAAGPVIDRDWLIE-LNEGLILLSGGRMGDVGKALLKGNQAQV 151 (1159) T ss_pred -----CCCCCCEEEEECCHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHH-CCCCEEEEECCCCCHHHHHHHCCCHHHH T ss_conf -----7873428999589999999999999998624568887478999987-2799499838865568999865999999 Q ss_pred HHHHHHHHHHCCCCEEEEEECCCHHH-HHHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 99999998765986899994399115-89999999999981898998088300787889999876688469857653100 Q 537021.9.peg.7 168 EKRLLTFKKLFGDRLYVNLQRHRGYD-RYRESQVVQLAYTHELPLVATNNSLFLSEEDYEAHDVLMAVAHSTVVSQKDRP 246 (1033) Q Consensus 168 ~~~l~~l~~~Fgd~~ylEl~~~~~~e-~~~~~~l~~lA~~~~iPlVaTndv~Y~~~~D~~~h~iL~aI~~~~~l~~~~~~ 246 (1033) ++.+.+|+++|+++||+|+|+|+..+ +.+++.++++|+++++|+||||||||++++|+.+|++|.||++|++++++++. T Consensus 152 ~~~l~~~~~iF~d~fyLElq~~g~~~e~~~~~~li~LA~~~giPlVATndv~Yl~~eD~~ahdvL~~I~~g~tL~d~~r~ 231 (1159) T PRK08957 152 ESCVAFYQEHFPDRYYLELIRTGRPDEERYLHAAVELAEERGLPVVATNDVVFLKPEDFDAHEIRVAIHDGFTLDDPKRP 231 (1159) T ss_pred HHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHCCCEEEECCCEECCHHHHHHHHHHHHHHCCCCCCCCCCC T ss_conf 99999999867986899983579837999999999999983999899688411685167899999999769921658630 Q ss_pred CC-CCCCCCCCHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCC Q ss_conf 24-53432343478875122035899878864111000133221023455575688776898899987654103420124 Q 537021.9.peg.7 247 RV-TPDHYLKNRSEMVSIFSDLPEALENTVEIARRCSFILQVHAPILPRFIEKECDDIQKEEENELRNKAVAGLEVRLAQ 325 (1033) Q Consensus 247 ~~-~~~~yLks~eEM~~~f~~~peAi~NT~~IAe~c~~~l~~~~~~lP~f~~p~~~~~~~~~~~~Lr~l~~~Gl~~R~~~ 325 (1033) +. ++++||||++||.++|.++|+|++||.+||++|++.+++.+++||+||+++ .++++||+++|++||++||.. T Consensus 232 ~~~s~e~ylKS~eEM~~lF~d~peAl~NT~~IAerCnv~~~~~~~~LP~fp~~~-----~~~~~~L~~l~~~Gl~~R~~~ 306 (1159) T PRK08957 232 KNYSPQQYLRSEEEMCELFADIPEALENTVEIAKRCNVTVRLGEYFLPQFPTGD-----MSTEDFLVKKSKEGLEERLEF 306 (1159) T ss_pred CCCCHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHCCCCCCCCCCCCC-----CCHHHHHHHHHHHHHHHHHHH T ss_conf 128975438999999999755489898899999845531401887699999899-----998999999999999987533 Q ss_pred CCC---CCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHHCCC Q ss_conf 643---24578389999988778989860630557888999998510476433444420466776554403756333231 Q 537021.9.peg.7 326 GAI---AKGYHAQDYRQRLDFELNVIARMKFSGYFLIVADFIQWAKKNNIPVGPGRGSGAGSVTAYALTITDIDPLRFSL 402 (1033) Q Consensus 326 ~~~---~~~~~~~~y~~RLe~EL~vI~~~gf~~YFLiV~Div~~ak~~gI~vGpGRGSAagSLVaY~LgIT~VDPi~~~L 402 (1033) ... .+....++|++||+|||+||++|||++|||||||+|+|||++||+||||||||||||||||||||+||||+||| T Consensus 307 ~~~~~~~~~~~~~~y~~RL~~EL~vI~~mgf~~YFLIV~D~i~~Ak~~gI~vGpGRGSaAGSLVaY~LgIT~VDPi~y~L 386 (1159) T PRK08957 307 LFPDPEVRAEKRPEYDERLDIELQVINQMGFPGYFLIVMEFIQWSKDNGIPVGPGRGSGAGSLVAYALKITDLDPLEFDL 386 (1159) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHCCCCCCHHHCCC T ss_conf 28961332221278999999999999876993538999999999995587637887736989999995794659021388 Q ss_pred HHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCC Q ss_conf 05676210014578753336568789999999830380215542207787788999999975279988988654310145 Q 537021.9.peg.7 403 LFERFLNPDRMSMPDFDIDFCQDRRDEVIRYVQNKYGHERVAQIITFGSLQAKAALRDVGRALQMPYSQVDRLCKLIPND 482 (1033) Q Consensus 403 lFERFLnp~R~~~PDIDiDf~~~~R~~vi~y~~~kyG~~~va~i~t~~t~~~k~aird~~r~~g~~~~~~~~~~~~i~~~ 482 (1033) ||||||||+|++||||||||+++||++||+||++|||.++||||+||+||++|+||||||||||+|+.++|+++|+||.. T Consensus 387 lFERFLNpeR~smPDIDiDf~~~rR~~Vi~yv~~kYG~~~VA~I~Tf~t~~ak~aiRDv~rvlg~~~~~~d~i~k~ip~~ 466 (1159) T PRK08957 387 LFERFLNPERVSMPDFDVDFCMDKRDQVIDHVAEMYGRDAVSQIITFGTMAAKAVIRDVGRVLGHPYGFVDRISKLIPPD 466 (1159) T ss_pred CHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCEEEHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCC T ss_conf 58884488778888776556401289999999988574545521035646888999999998489888999886328998 Q ss_pred CCCCCCHHHHHCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCEEEECCCCHHHHEECCCCCCCCCEEEEEECHH Q ss_conf 89824434452298878864307420268898720112210024474403352566666200123676886688510213 Q 537021.9.peg.7 483 PAHPVSLQAIADDSRFREARLADPSVDRLLEISQKLEGLYRHASTHAAGIVIGDRPLSQLVPMYRDVRSDLPVTQFNMKW 562 (1033) Q Consensus 483 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~leg~~r~~~~Ha~Gvvi~~~~l~~~~p~~~~~~~~~~~~q~d~~~ 562 (1033) ++.++. ++++.++++++.+..++.+++++++|++|||+|||+|+||||||||++||++++|++.+++++.++|||||++ T Consensus 467 ~~~tl~-~~~~~~~~~~~~~~~~~~~~~~~~~a~~leG~~r~~~~HaaGvvis~~~l~~~~Pl~~~~~~~~~~tq~d~~~ 545 (1159) T PRK08957 467 PGMTLA-KAFEAEPQLPEIYEADEEVKALIDMARKLEGVTRNAGKHAGGVVIAPTKITDFAPLYCDAEGKHPVTQFDKND 545 (1159) T ss_pred CCCCHH-HHHHHCHHHHHHHHCCHHHHHHHHHHHHHCCCCCCCCCCCCEEEEECCCCCCCCCEEECCCCCCEEEEEECCH T ss_conf 998889-9987586679887538788999999998619610035687637975798655013368788995368731303 Q ss_pred HHHHCCCHHHHHHHCCHHHHHHHHHHHHHH-CC----CCCCCCCCCCHHHHHHHHHCCCCCHHHHCCCHHHHHHHHHHCC Q ss_conf 343112321356541789999999988740-25----4343323443588998762014101221145347888877174 Q 537021.9.peg.7 563 IEKAGLIKFDFLGLKTLTLLQKSLDFLAQR-GV----KVDLSLIPFDDHETYHLLTTKGTLGIFQLESSGMRQALEGMQP 637 (1033) Q Consensus 563 ~e~~gl~K~D~Lgl~~L~~i~~~~~~i~~~-~~----~~d~~~ip~~D~~~~~l~~~g~t~GvFQ~es~~~~~~l~~~~p 637 (1033) ||++||+|||||||+|||+|++++++|+++ +. ++|+++||+||++||+||++|+|.||||+||+|||++|++++| T Consensus 546 ~e~~GllK~D~LGl~~Lt~i~~~~~~i~~~~~~~~~~~~d~~~ipldD~~t~~ll~~g~t~GvFQ~ES~g~~~~l~~~~P 625 (1159) T PRK08957 546 VEYAGLVKFDFLGLRTLTIIDWALEMINKRRAKNGKPPVDIAAIPLDDKKSFDLLQRSETTAVFQLESRGMKDLIKRLQP 625 (1159) T ss_pred HHHCCCEEEEECCCCHHHHHHHHHHHHHHHHCCCCCCCCCHHHCCCCCHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCC T ss_conf 87759801321036638999999999998623257886687767964699999996766378573377889999985399 Q ss_pred CHHHHHHHHHHHCCCHHHHHH--HHHHHHHCCCCCCCCC-----CHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCHHHH Q ss_conf 048888766541141134200--3688850465785026-----966733410335530012568988788632684130 Q 537021.9.peg.7 638 DCIEDIIALVSLYRPGPIDNI--VVYNNRKNGKEKIVSI-----HPLIDPMLKETQGVIIYQEQVMQIAQLLSGYSLSEA 710 (1033) Q Consensus 638 ~~~~dl~~~~al~RPGp~~~~--~~yi~rk~g~e~~~~~-----hp~l~~iL~~T~Gv~vyQEQvm~ia~~~ag~~~~ea 710 (1033) ++|+||||++||||||||+++ +.||+||||+++++|+ ||.|++||++||||||||||||+||+.+||||++|| T Consensus 626 ~~f~dl~a~~aL~RPGP~~~g~i~~yi~rk~g~~~v~y~~~~~~h~~l~~iL~~TyGvivyQEQvM~ia~~~agfsl~ea 705 (1159) T PRK08957 626 DCFEDMIALVALFRPGPLQSGMVDNFIDRKHGREEISYPDAQWQHESLKPVLEPTYGIILYQEQVMQIAQVLSGYTLGGA 705 (1159) T ss_pred CCHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCCCCCCCCHHCCHHHHHHHCCCCCEEEEHHHHHHHHHHHCCCCHHHH T ss_conf 98999999997417875434551679987529878778870002489999753538868865999999998706998889 Q ss_pred HHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 78897620121101677878999998624999999999999999750001331014788899999999997302666777 Q 537021.9.peg.7 711 DVLRRAMGKKIKEEMDKQKERFISGASKNGISKTIAVNIFELLAKFADYGFNKSHAAAYAVISYQTAWMKTHYPVEFLAA 790 (1033) Q Consensus 711 d~lRra~~kk~~~~~~~~~~~f~~g~~~~g~~~~~a~~i~~~i~~f~~Y~FnksHa~aya~~~y~~aylK~~yP~eF~aa 790 (1033) |.||||||||+.++|++.+++|++||.+||+++++|++||++|++||+||||||||+|||++||||||||+|||+||||| T Consensus 706 D~lRram~KK~~~~m~~~~~~F~~ga~~~g~~~~~a~~i~~~i~~Fa~YgFnkSHa~aYa~iayqtAyLK~hyP~ef~aa 785 (1159) T PRK08957 706 DMLRRAMGKKKPEEMAKQRSVFEEGAIKNGIDGELAMKIFDLVEKFAGYGFNKSHSAAYALVSYQTLWLKTHYPAEFMAA 785 (1159) T ss_pred HHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHCHHHHHHH T ss_conf 99999875067788999999998401225999999999999999984317674168999999999999997184999999 Q ss_pred HHCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEECC-CEEEEEHHHHCCCCHHHHHHHHHHH-HCCCCCCHHHH Q ss_conf 510023542025899988886368503552334210035319-8698502685896989999999998-37999997899 Q 537021.9.peg.7 791 SMTLEMDNVEKIKKFCQDARQFNIQIMPPSVNTPCVDFKVGD-NRIYYSLAAIKGVGTTTARHIMEAS-ADKPFDSLEDF 868 (1033) Q Consensus 791 ~L~~~~~~~~~~~~~i~e~~~~gi~vl~PdIN~S~~~f~~~~-~~Ir~Gl~~Ikgvg~~~~~~Ive~r-~~g~f~sl~df 868 (1033) +||+++++++++..|+.||+++||+|+|||||+|.+.|++++ +.||+||++|||||+++++.|+++| ++|||+|++|| T Consensus 786 ~L~~~~~~~~k~~~~~~e~~~~gi~vlpPdIN~S~~~f~~~~~~~I~~gL~~Ikgvg~~~~~~Iv~~R~~~g~f~s~~Df 865 (1159) T PRK08957 786 VMTADMDNTDKIVGLVDECWRMGLKILPPDINSGLYHFHVNDDGEIVYGIGAIKGVGEGPIEAILEARNKGGYFKDLFDL 865 (1159) T ss_pred HHHHCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEECCCCCEEECHHHHCCCCHHHHHHHHHHHHHCCCCCCHHHH T ss_conf 75315798267999999999859952488506778864676899652472764578788999999999967998988999 Q ss_pred HHHCCCCCCCHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCC-CCCCCCCCCCCCCC Q ss_conf 97348247998999988867672347903899999899999999988654312777866665555-65577666899998 Q 537021.9.peg.7 869 CSRVDSKKLNRRVLESLVFAGALDCFGYSRMQLLQSLDNIQKYAQWVEKNRTNKHENIFAHEKGT-FSDKITLEKFSVEN 947 (1033) Q Consensus 869 ~~rv~~~~~nk~~le~Li~aGAfD~~~~~R~~ll~~i~~~~~~~~~~~~~~~~~q~~LF~~~~~~-~~~~~~~~~~~~~~ 947 (1033) +.|++.+.+||+++|+||+|||||+||.||++|+++++.++..+....+.+..||.+||+..... ......++..++|+ T Consensus 866 ~~Rv~~~~~nk~~le~Li~aGAfD~~~~~R~~l~~~~~~~l~~~~~~~~~~~~gq~~lf~~~~~~~~~~~~~~~~~~~~~ 945 (1159) T PRK08957 866 CARVDLKKLNRRVLEKLIMSGAFDRLGPHRAALMASLGDALKAADQHAKAEAIGQADMFGVLAEEPEQVEQSYANCPPWP 945 (1159) T ss_pred HHHHCCCCCCHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 98725533699999999975885655542999998799999999987577755896655555567432334456899988 Q ss_pred HHHHHHHHHHHHCEEECCCCHHHHHHHHHHCCCCCHHHHHCCCCCCEEEEEEEEEEEEEECCCCCCEEEEEEEECCCCEE Q ss_conf 88999857887262116385589999998748976889850558988999999961121015888769999998497049 Q 537021.9.peg.7 948 SSVRFENEQRVLGFYFSGHPLDVYKPILRKMGIKGYEESVATIRGNNIRLAAMVVSKQQKKTRKGSRIGWVTFSEPAKEY 1027 (1033) Q Consensus 948 ~~e~l~~E~e~LGf~lS~HPl~~~~~~l~~~~~~~~~~~~~~~~g~~v~iaG~V~~i~~~~TKkG~~mafltleD~~g~~ 1027 (1033) ..|+|++|+|+||||+|+|||+.|++.++.....++.++.....|+.|+++|+|+++|+++||||++|+|+|+||++|.+ T Consensus 946 ~~e~l~~Eke~lG~y~s~HPl~~~~~~l~~~~~~~~~~~~~~~~~~~v~v~G~i~~~~~~~tkkg~~maf~~leD~tg~~ 1025 (1159) T PRK08957 946 EKVVLDGERETLGLYLTGHPINQYLKELERYTSGRLKDMHPTERGKVTTAAGLVVAARVMVTKRGNRIGICTLDDRSGRL 1025 (1159) T ss_pred HHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCCHHHHHCCCCCCEEEEEEEEEEEEECCCCCCCEEEEEEEEECCCCE T ss_conf 66799988998632742160899999998738853898621459988999999999775578998988999999899868 Q ss_pred EEEECC Q ss_conf 999539 Q 537021.9.peg.7 1028 EANFFP 1033 (1033) Q Consensus 1028 e~~iFP 1033 (1033) |++||| T Consensus 1026 e~vvfp 1031 (1159) T PRK08957 1026 EVMLFT 1031 (1159) T ss_pred EEEECH T ss_conf 999978 No 11 >PRK06326 consensus Probab=100.00 E-value=0 Score=2473.26 Aligned_cols=1016 Identities=37% Similarity=0.625 Sum_probs=947.7 Q ss_pred CCHHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCCCCC Q ss_conf 81334331444764432998999999997899889981165177499999999976996389999998547664334456 Q 537021.9.peg.7 10 SFVHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPIIGCQLDIDMQDNLKMTNQG 89 (1033) Q Consensus 10 ~FvhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~~~~~~~~ 89 (1033) +||||||||+||||+|+++|++||++|+++||+||||||||||||+++|+++|+++|||||||||+|+.++...+... T Consensus 2 sFvhLhvhS~YSlLdGa~~i~eLv~~A~elG~~AlAiTDh~nL~Gav~Fy~aak~~gIKPIIG~El~v~~~~~~dk~~-- 79 (1240) T PRK06326 2 TWIPLHCHSQYSILDATSSIKDFVAKAVEYQIPALALTDHGNLFGAVDFYKECKQNAIKPIIGCELYVAPSSRFDKKK-- 79 (1240) T ss_pred CCCCCCCCCCCCHHCCCCCHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCC-- T ss_conf 430635577581206054999999999985999999861890879999999999769990345489965875223444-- Q ss_pred CCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCEEEECCCCCCHHHHHHHCCCHHHHHH Q ss_conf 56666775399995696899999999989973248778740078999863489889981976796788887799789999 Q 537021.9.peg.7 90 KTPFPTVPSIVLLVSTAEGYQRLIELVSRMYLLDQGKQSVRICLSWLQEIGTEGLIMLTGGSSGPIDRAFFFNSSQVAEK 169 (1033) Q Consensus 90 ~~~~~~~~~lvlLAkN~~Gy~nL~kL~S~a~~~~~~~~~p~i~~~~L~e~~~egLIvlsg~~~g~i~~~l~~~~~~~a~~ 169 (1033) .......+|++|||||++||+|||+|+|.++.+ ++++.|++++++|.+ |++||||+|||..|++++.++.+. +.+++ T Consensus 80 ~~~~~~~~hlvLLAkN~~GY~NL~kL~S~a~~e-g~y~~Prid~~~L~~-~seGLI~lsg~l~g~v~~~il~g~-e~~~~ 156 (1240) T PRK06326 80 EKKSRVANHLILLCKNEQGYRNLCLLSSLAYTE-GFYYFPRIDRDLLSQ-HSEGLICLSACLSGSVAQAALESE-EALEK 156 (1240) T ss_pred CCCCCCCCCEEEECCCHHHHHHHHHHHHHHHHC-CCCCCCCCCHHHHHH-HCCCCEEECCCCCCHHHHHHHCCH-HHHHH T ss_conf 556677773899849989999999999999874-786788689999986-069949943875445889885593-99999 Q ss_pred HHHHHHHHCCCCEEEEEECCCHHHH----------------------HHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHH Q ss_conf 9999987659868999943991158----------------------999999999998189899808830078788999 Q 537021.9.peg.7 170 RLLTFKKLFGDRLYVNLQRHRGYDR----------------------YRESQVVQLAYTHELPLVATNNSLFLSEEDYEA 227 (1033) Q Consensus 170 ~l~~l~~~Fgd~~ylEl~~~~~~e~----------------------~~~~~l~~lA~~~~iPlVaTndv~Y~~~~D~~~ 227 (1033) .+.||+++||++||+|+|+|+..++ ..+..++++|+++++|+||||||||++++|+.+ T Consensus 157 ~l~~~~~iFgd~fylEiq~h~~~~e~~~~~~~~~l~~~~~~~i~~q~~vn~~l~~lA~~~giplVATnDvhY~~~eD~~~ 236 (1240) T PRK06326 157 DLLWYQDLFKDDFFSEVQLHKMSEEKIALFEEEWLKQEYYQFIEKQIKVNEAVLATSKRLGIPSVATNDIHYINPDDWLA 236 (1240) T ss_pred HHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHH T ss_conf 99999986487568852106883665422245544333321026778999999999998198989958860078625899 Q ss_pred HHHHHHHHCCCCCC---------CCCCCCC-CCCCCCCCHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 98766884698576---------5310024-5343234347887512203589987886411100013322102345557 Q 537021.9.peg.7 228 HDVLMAVAHSTVVS---------QKDRPRV-TPDHYLKNRSEMVSIFSDLPEALENTVEIARRCSFILQVHAPILPRFIE 297 (1033) Q Consensus 228 h~iL~aI~~~~~l~---------~~~~~~~-~~~~yLks~eEM~~~f~~~peAi~NT~~IAe~c~~~l~~~~~~lP~f~~ 297 (1033) |++|+||++|++++ ++++.++ +.++||||++||.++|.++|+|++||.+||++|++++++..+++|.|. T Consensus 237 hdvL~cI~~g~tl~~~~~~~~l~d~~r~~~~~~e~ylKs~eEM~~lF~d~PeAl~NT~~IA~rC~~~l~~~~~~~P~~~- 315 (1240) T PRK06326 237 HEILLNVQSGEPIRIAKQNTYIPNPKRKTYPSREYYFKSPQEMAELFADHPETISNTLEVAERCNLELDFSTKHYPIYV- 315 (1240) T ss_pred HHHHHHHHCCCCCHHHHHCCCCCCHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCEECCCCCCCCCCCC- T ss_conf 9999998769960024433656880532477721137999999999764599999999999962045023655488879- Q ss_pred CCCCC---------CCCCHHHHHHHHHHHHHHHCCCCCCC-------CCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHH Q ss_conf 56887---------76898899987654103420124643-------245783899999887789898606305578889 Q 537021.9.peg.7 298 KECDD---------IQKEEENELRNKAVAGLEVRLAQGAI-------AKGYHAQDYRQRLDFELNVIARMKFSGYFLIVA 361 (1033) Q Consensus 298 p~~~~---------~~~~~~~~Lr~l~~~Gl~~R~~~~~~-------~~~~~~~~y~~RLe~EL~vI~~~gf~~YFLiV~ 361 (1033) |++.. ...++++||+++|++|+++||..... +.....+.|++||+|||+||++|||++|||||| T Consensus 316 P~~~~~~~~~~~~~~~~~~~~~L~~l~~~Gl~~Ry~~e~~~~~~~~~p~~~p~e~v~~RL~~EL~vI~~mgf~~YFLIV~ 395 (1240) T PRK06326 316 PESLKKKGSYTEEERYKASSAFLEELCEEGLSKKYTPELLAHIAKKFPDRDPLDLVKERLDMEMSIIIPKGMCDYLLIVW 395 (1240) T ss_pred CCCCCCCCCCCHHHHCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCHHHHHHH T ss_conf 98777666654122001489999999999999871512344566535786657999999999999998779808599999 Q ss_pred HHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHHCCCHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCC Q ss_conf 99998510476433444420466776554403756333231056762100145787533365687899999998303802 Q 537021.9.peg.7 362 DFIQWAKKNNIPVGPGRGSGAGSVTAYALTITDIDPLRFSLLFERFLNPDRMSMPDFDIDFCQDRRDEVIRYVQNKYGHE 441 (1033) Q Consensus 362 Div~~ak~~gI~vGpGRGSAagSLVaY~LgIT~VDPi~~~LlFERFLnp~R~~~PDIDiDf~~~~R~~vi~y~~~kyG~~ 441 (1033) |+|+|||++||+||||||||||||||||||||+||||+|||||||||||+|++||||||||+++||++||+||++|||++ T Consensus 396 D~V~~Ar~~gI~vGpGRGSAAGSLVaY~LgIT~VDPi~y~LLFERFLNpeR~smPDIDiDF~~~rR~eVI~Yv~~kYG~d 475 (1240) T PRK06326 396 DIINWAKDNGIPVGPGRGSGAGSVMLFLLGITEIEPIRFDLFFERFINPERLSYPDIDIDICMAGRERVINYAIERHGKD 475 (1240) T ss_pred HHHHHHHHCCCEECCCCCCHHHHHHHHHHCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCC T ss_conf 99999997794436888731878999995474418431587588744766789998507388324799999999985877 Q ss_pred CEEECCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCCHHHHHCCHHHHHHHHCCCCCHHHHHHHHHHHHH Q ss_conf 15542207787788999999975279988988654310145898244344522988788643074202688987201122 Q 537021.9.peg.7 442 RVAQIITFGSLQAKAALRDVGRALQMPYSQVDRLCKLIPNDPAHPVSLQAIADDSRFREARLADPSVDRLLEISQKLEGL 521 (1033) Q Consensus 442 ~va~i~t~~t~~~k~aird~~r~~g~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~leg~ 521 (1033) +||||+||+||++|+||||||||||+|+.++|++||+||. .+.++. ++++.+++++++..+++..++++++|.+|||+ T Consensus 476 ~VAqI~Tf~T~~ak~AiRDvgRvlg~~~~~~d~iaK~ip~-~~~tl~-~a~~~~~~lr~~~~~d~~~~~l~~~A~~LeGl 553 (1240) T PRK06326 476 NVAQIITFGTMKAKMAVKDVGRTLDVPLSKVNHIAKHIPD-LNTTLS-KALETDPDLRQLYINDAEAAQVIDMAKKLEGS 553 (1240) T ss_pred CEEEEECHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCC-CCCCHH-HHHHCCHHHHHHHCCCHHHHHHHHHHHHHCCC T ss_conf 5798601231378899999999849998899999831776-886277-77642988897750688899999999985398 Q ss_pred HHCCCCCCCEEEECCCCHHHHEECCCCCCCCCEEEEEECHHHHHHCCCHHHHHHHCCHHHHHHHHHHHHH-HCCCCCCCC Q ss_conf 1002447440335256666620012367688668851021334311232135654178999999998874-025434332 Q 537021.9.peg.7 522 YRHASTHAAGIVIGDRPLSQLVPMYRDVRSDLPVTQFNMKWIEKAGLIKFDFLGLKTLTLLQKSLDFLAQ-RGVKVDLSL 600 (1033) Q Consensus 522 ~r~~~~Ha~Gvvi~~~~l~~~~p~~~~~~~~~~~~q~d~~~~e~~gl~K~D~Lgl~~L~~i~~~~~~i~~-~~~~~d~~~ 600 (1033) |||+|+||||||||++||+++||++.+++++.++|||||++||++||+|||||||+|||+|++++++|++ +|.++|+.+ T Consensus 554 pR~~s~HAaGvVIs~~pL~~~vPl~~~~~~~~~vtQ~d~~~vE~~GLlK~D~LGL~~Lt~i~~~~~~I~~~~g~~id~~~ 633 (1240) T PRK06326 554 IRNTGVHAAGVIICGDPLTNHIPICISKDSTMITTQYSMKPVESVGMLKVDFLGLKTLTSIHIAMQAIEKKTGKLLAMAT 633 (1240) T ss_pred CCCCCCCCCEEEEECCCCCCCCCEEECCCCCCEEECCCHHHHHHCCCCHHHHHCCCCHHHHHHHHHHHHHHHCCCCCHHC T ss_conf 66656666526861675234476354488994132162789987698022565578079999999999985297457201 Q ss_pred CCCCHHHHHHHHHCCCCCHHHHCCCHHHHHHHHHHCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCCCHHHHH Q ss_conf 34435889987620141012211453478888771740488887665411411342003688850465785026966733 Q 537021.9.peg.7 601 IPFDDHETYHLLTTKGTLGIFQLESSGMRQALEGMQPDCIEDIIALVSLYRPGPIDNIVVYNNRKNGKEKIVSIHPLIDP 680 (1033) Q Consensus 601 ip~~D~~~~~l~~~g~t~GvFQ~es~~~~~~l~~~~p~~~~dl~~~~al~RPGp~~~~~~yi~rk~g~e~~~~~hp~l~~ 680 (1033) ||+||++||+||++|+|.||||+||+||+++|++++|++|+||||++||||||||++++.|++||||++++.|+||.|++ T Consensus 634 ipldD~~t~~ll~~g~T~GvFQ~ES~g~~~~l~~lkP~~f~dlva~~aL~RPGP~~~i~~yi~rk~g~~~v~y~hp~l~~ 713 (1240) T PRK06326 634 LPLDDKTTFSLLHQGKTMGIFQMESKGMQELAKNLRPDAFEEIIAIGALYRPGPMDMIPSFINRKHGKEIIEYDHPLMES 713 (1240) T ss_pred CCCCCHHHHHHHHCCCCCCCCCCCCCHHHHHHHHCCCCCHHHHHHHHHHCCCCCHHHHHHHHHHHCCCCCCCCCCHHHHH T ss_conf 78463999999836666896703362799999866999999988999532777234579999986699787888889998 Q ss_pred HHCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCC Q ss_conf 41033553001256898878863268413078897620121101677878999998624999999999999999750001 Q 537021.9.peg.7 681 MLKETQGVIIYQEQVMQIAQLLSGYSLSEADVLRRAMGKKIKEEMDKQKERFISGASKNGISKTIAVNIFELLAKFADYG 760 (1033) Q Consensus 681 iL~~T~Gv~vyQEQvm~ia~~~ag~~~~ead~lRra~~kk~~~~~~~~~~~f~~g~~~~g~~~~~a~~i~~~i~~f~~Y~ 760 (1033) ||++||||||||||||+||+.+||||++|||.||||||||+.+.|++.+++|++||.+||+++++|++||++|++||+|| T Consensus 714 iL~~TyGv~vyQEQvm~ia~~~agfs~~eaD~lRram~kk~~~~~~~~~~~f~~g~~~~g~~~~~a~~i~~~i~~Fa~Yg 793 (1240) T PRK06326 714 ILKETYGIMVYQEQVMQIAGSLASYSLGEGDVLRRAMGKKDHQQMVKEREKFCSRACANGIDPELATTIFDKMEKFASYG 793 (1240) T ss_pred HHHCCCCEEEEHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCC T ss_conf 76020787676999999999970888888889999855387256899999999999984999999999999998885649 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEECCCEEEEEHH Q ss_conf 33101478889999999999730266677751002354202589998888636850355233421003531986985026 Q 537021.9.peg.7 761 FNKSHAAAYAVISYQTAWMKTHYPVEFLAASMTLEMDNVEKIKKFCQDARQFNIQIMPPSVNTPCVDFKVGDNRIYYSLA 840 (1033) Q Consensus 761 FnksHa~aya~~~y~~aylK~~yP~eF~aa~L~~~~~~~~~~~~~i~e~~~~gi~vl~PdIN~S~~~f~~~~~~Ir~Gl~ 840 (1033) ||||||+|||++||||||||+|||+|||||+||++++++++++.|+.|||++||+|+|||||+|.++|++++++||+||+ T Consensus 794 FnksHa~aYa~~ay~tAyLK~~yP~ef~aAlL~s~~~~~~k~~~~i~ear~~Gi~VlpPdIN~S~~~f~~~~~~Ir~GL~ 873 (1240) T PRK06326 794 FNKSHAAAYGLITYTTAYLKANYPKEWLAALLTCDSDDIEKVGKLIREAHSMNIPILPPDINESGQDFVATDEGIRFAMG 873 (1240) T ss_pred CCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCEEECCCCCCCCCCCEECCCCEEECHH T ss_conf 89762478899999999998668799999984378799658999999999879968488536677863770893787636 Q ss_pred HHCCCCHHHHHHHHHHH-HCCCCCCHHHHHHHCCCCCCCHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 85896989999999998-37999997899973482479989999888676723479038999998999999999886543 Q 537021.9.peg.7 841 AIKGVGTTTARHIMEAS-ADKPFDSLEDFCSRVDSKKLNRRVLESLVFAGALDCFGYSRMQLLQSLDNIQKYAQWVEKNR 919 (1033) Q Consensus 841 ~Ikgvg~~~~~~Ive~r-~~g~f~sl~df~~rv~~~~~nk~~le~Li~aGAfD~~~~~R~~ll~~i~~~~~~~~~~~~~~ 919 (1033) +|||||++++++|+++| ++|||+|+.||+.|++.+.+||+++|+||+|||||+||.||++++++++.+++.+...+++. T Consensus 874 aIKgvg~~~~~~Iv~~R~~~Gpf~s~~Df~~R~~~~~~nk~~lE~Li~aGAfD~~g~~R~~ll~~~~~~~~~~~~~~~~~ 953 (1240) T PRK06326 874 AIKGVGRGLVESIVEEREKHGPYKSIQDFIQRSDLKKVTKKQLESLVDAGCFDCFEPNRDLALASLEPLYETISKEKKEA 953 (1240) T ss_pred HHCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 54788699999999999868698999999986187448999999998679842527449999998999999999989987 Q ss_pred HCCCCCCCCCCCCC---CCCCCCCCCCCCCCHHHHHHHHHHHHCEEECCCCHHHHHHHHHHCCCCCHHHHHCCCCCCEEE Q ss_conf 12777866665555---655776668999988899985788726211638558999999874897688985055898899 Q 537021.9.peg.7 920 TNKHENIFAHEKGT---FSDKITLEKFSVENSSVRFENEQRVLGFYFSGHPLDVYKPILRKMGIKGYEESVATIRGNNIR 996 (1033) Q Consensus 920 ~~~q~~LF~~~~~~---~~~~~~~~~~~~~~~~e~l~~E~e~LGf~lS~HPl~~~~~~l~~~~~~~~~~~~~~~~g~~v~ 996 (1033) ..+|.+||+..... .......+++++|+..|+|.+|+|+||||+|+|||+.|++.+...++.++.++.+..+|+.|+ T Consensus 954 ~~gq~~lf~~~~~~~~~~~~~~~~~~~~~~~~~e~L~~Eke~lG~yls~HPl~~~~~~l~~~~~~~~~~~~~~~~~~~v~ 1033 (1240) T PRK06326 954 ATGVMTFFSLNAMDREPVVPTCSPENVVRRSKKELLKKEKELLGIYLTEHPMDAVKDILPRLSVVPAGEFENLPHGSVIR 1033 (1240) T ss_pred HCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHEEEECCCCHHHHHHHHHHHCCCCHHHHHHCCCCCEEE T ss_conf 54886655666544454435688666556885559999799847315546289999999870886699996278998799 Q ss_pred EEEEEEEEEE-ECCCCCCEEEEEEEECCCCEEEEEECC Q ss_conf 9999961121-015888769999998497049999539 Q 537021.9.peg.7 997 LAAMVVSKQQ-KKTRKGSRIGWVTFSEPAKEYEANFFP 1033 (1033) Q Consensus 997 iaG~V~~i~~-~~TKkG~~mafltleD~~g~~e~~iFP 1033 (1033) ++|+|..+++ .+||+|++|||+|+||++|.+|++||| T Consensus 1034 ~~g~i~~v~~r~~~k~~~~maf~tleD~tg~~e~vvfp 1071 (1240) T PRK06326 1034 TVFIIDKVTTKISSKEQKKFALLRVSDGVDSYELPIWP 1071 (1240) T ss_pred EEEEEEEEEEEEECCCCCEEEEEEEEECCCCEEEEECH T ss_conf 99998468999867999869999999689858999874 No 12 >PRK08744 consensus Probab=100.00 E-value=0 Score=2472.59 Aligned_cols=1008 Identities=41% Similarity=0.712 Sum_probs=931.5 Q ss_pred CCCCCHHHHHHCHHHHHHHCCCHHH--------------HHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEEEE Q ss_conf 8988133433144476443299899--------------99999978998899811651774999999999769963899 Q 537021.9.peg.7 7 KSPSFVHLQVHSSYSLLEGALSLNN--------------ILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPIIG 72 (1033) Q Consensus 7 ~~~~FvhLh~hS~ySlldg~~~~e~--------------LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPIiG 72 (1033) .+++||||||||+||+|||++++++ ||++|+++||+||||||||||||+++||++|+++|||||+| T Consensus 2 ~~~~FvHLh~HS~ySlldg~~ri~e~~~~~~~~~~~~~dLv~~A~e~Gm~AlALTD~~nl~Gav~Fy~aa~~~GIKPIiG 81 (1195) T PRK08744 2 STSRFVHLHVHTEFSLADSTIRVPEKPDQADPKKAKQANLLSRAVELDLPALAVTDLNNLFALVKFYKAAEGVGIKPIAG 81 (1195) T ss_pred CCCCCCHHCCCCCCHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHCCCCEEEE T ss_conf 99996520147700232243125632222111112199999999986999899850886879999999999869982850 Q ss_pred EEEEEECCCCCCCCCCCCCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCCC-CCCCCCHHHHHHHCCCCEEEECCCC Q ss_conf 999985476643344565666677539999569689999999998997324877-8740078999863489889981976 Q 537021.9.peg.7 73 CQLDIDMQDNLKMTNQGKTPFPTVPSIVLLVSTAEGYQRLIELVSRMYLLDQGK-QSVRICLSWLQEIGTEGLIMLTGGS 151 (1033) Q Consensus 73 ~E~~v~~~~~~~~~~~~~~~~~~~~~lvlLAkN~~Gy~nL~kL~S~a~~~~~~~-~~p~i~~~~L~e~~~egLIvlsg~~ 151 (1033) ||+++...+. ..+|++|||||++||+|||+|+|.+++++... ..+.++.++|.+ |++|||++||+ T Consensus 82 ~E~~v~~~~~------------~~~~l~LLAkN~~GY~NL~~L~S~a~~~g~kpr~~~~i~~e~l~~-~~~gli~lsG~- 147 (1195) T PRK08744 82 ADVMIATPDM------------TPWRMTLLCRDREGYLSLSRLLTRAWMEGHRPEGGVAIHPEWLQA-GHANLFALAGR- 147 (1195) T ss_pred EEEEEECCCC------------CCCCEEEECCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHH-CCCCEEEEECC- T ss_conf 0899865888------------877679980888999999999999998257888885679999984-58998999788- Q ss_pred CCHHHHHHHCCCHHHHHHHHHHHHHHCCCCEEEEEECCCHH-HHHHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHH Q ss_conf 79678888779978999999999876598689999439911-58999999999998189899808830078788999987 Q 537021.9.peg.7 152 SGPIDRAFFFNSSQVAEKRLLTFKKLFGDRLYVNLQRHRGY-DRYRESQVVQLAYTHELPLVATNNSLFLSEEDYEAHDV 230 (1033) Q Consensus 152 ~g~i~~~l~~~~~~~a~~~l~~l~~~Fgd~~ylEl~~~~~~-e~~~~~~l~~lA~~~~iPlVaTndv~Y~~~~D~~~h~i 230 (1033) .+.+.+++..++.+.|++.+.+|+++||++||+|+|+|+.. ++.++..++++|+++++|+||||||||++++|+.+|++ T Consensus 148 ~~~~~~~l~~g~~d~a~~~l~~~~~iFgd~~ylElqr~g~~~e~~~~~~l~~la~~~~iplVATndv~y~~~~D~~ahd~ 227 (1195) T PRK08744 148 DSLAGRLFAEGRADLAEQQLADWQRVFGDGLHLELTRTGREGEERFNQFALHAAGVRGLPVVASNDVRFLYASDFNAHEA 227 (1195) T ss_pred CCHHHHHHHCCCHHHHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCCCEEEECCCEECCHHHHHHHHH T ss_conf 53688988678989999999999997088689997058980789999999999998199989858740378636889999 Q ss_pred HHHHHCCCCCCCCCCCCC-CCCCCCCCHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHH Q ss_conf 668846985765310024-5343234347887512203589987886411100013322102345557568877689889 Q 537021.9.peg.7 231 LMAVAHSTVVSQKDRPRV-TPDHYLKNRSEMVSIFSDLPEALENTVEIARRCSFILQVHAPILPRFIEKECDDIQKEEEN 309 (1033) Q Consensus 231 L~aI~~~~~l~~~~~~~~-~~~~yLks~eEM~~~f~~~peAi~NT~~IAe~c~~~l~~~~~~lP~f~~p~~~~~~~~~~~ 309 (1033) |+||++|++++++++.+. +.++||||++||.++|.++|+|++||.+||++|++++++.+++||+||+|++ .++++ T Consensus 228 l~cI~~g~~l~d~~r~r~~~~e~ylks~~EM~~lF~d~peAl~NT~~IA~rC~~~l~~~~~~lP~f~~p~g----~t~~~ 303 (1195) T PRK08744 228 RVCISSGRVLDDPKRPRDYSDQQYLKSSEEMAALFADIPDAIDNTHALAQRCNIEMRLGTYFLPAYPVPED----ETLDS 303 (1195) T ss_pred HHHHHCCCCCCCCCCCCCCCHHHCCCCHHHHHHHHCCCHHHHHHHHHHHHHHEEEHHCCCCCCCCCCCCCC----CCHHH T ss_conf 99996789416587523289866589999999883389999999999998531251138866888889999----88679 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHH Q ss_conf 99876541034201246432457838999998877898986063055788899999851047643344442046677655 Q 537021.9.peg.7 310 ELRNKAVAGLEVRLAQGAIAKGYHAQDYRQRLDFELNVIARMKFSGYFLIVADFIQWAKKNNIPVGPGRGSGAGSVTAYA 389 (1033) Q Consensus 310 ~Lr~l~~~Gl~~R~~~~~~~~~~~~~~y~~RLe~EL~vI~~~gf~~YFLiV~Div~~ak~~gI~vGpGRGSAagSLVaY~ 389 (1033) ||+++|++||++||.+.....+..+++|.+||+|||+||++|||++|||||||+|+|||++||+|||||||||||||||| T Consensus 304 ~Lr~l~~~Gl~~R~~~~~~~~~~~~~~y~~RL~~EL~vI~~mgF~~YFLIV~D~I~~Ak~~gI~VGPGRGSaAGSLVaY~ 383 (1195) T PRK08744 304 WIRSQSRDGLAARLEKNPIAPGKTRQDYVDRLEFELDTIIKMGFPGYFLIVADFIQWGKNQGIPIGPGRGSGAGSLVAWA 383 (1195) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHH T ss_conf 99999998899997427898653559999999999999997799367999999999999769732788771387899998 Q ss_pred HHCCCCCHHHCCCHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHHCCCCH Q ss_conf 44037563332310567621001457875333656878999999983038021554220778778899999997527998 Q 537021.9.peg.7 390 LTITDIDPLRFSLLFERFLNPDRMSMPDFDIDFCQDRRDEVIRYVQNKYGHERVAQIITFGSLQAKAALRDVGRALQMPY 469 (1033) Q Consensus 390 LgIT~VDPi~~~LlFERFLnp~R~~~PDIDiDf~~~~R~~vi~y~~~kyG~~~va~i~t~~t~~~k~aird~~r~~g~~~ 469 (1033) ||||+||||+|||||||||||+|+|||||||||+++||++||+||.+|||+++||||+||+||++|+||||||||||+|+ T Consensus 384 LgIT~vDPi~y~LLFERFLNpeR~smPDIDiDF~~~rR~eVI~Yv~~kYG~d~VAqI~TfgT~~akaaiRDvgRvlg~p~ 463 (1195) T PRK08744 384 LQITDLDPLPYNLLFERFLNPERVSMPDFDIDFCMDRRDEVIDYVARKYGRERVSQIITYGTMAAKAVVRDAGRVLGFTY 463 (1195) T ss_pred HCCCCCCCHHCCCCHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEEHHHHHHHHHHHHHHHHHCCCH T ss_conf 46876480013753998706567788876543431339999999998858220786311111588899999999858997 Q ss_pred HHHHHHHHHCCCCCCCCCCHHHHHC-------CHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCEEEECCCCHHHH Q ss_conf 8988654310145898244344522-------988788643074202688987201122100244744033525666662 Q 537021.9.peg.7 470 SQVDRLCKLIPNDPAHPVSLQAIAD-------DSRFREARLADPSVDRLLEISQKLEGLYRHASTHAAGIVIGDRPLSQL 542 (1033) Q Consensus 470 ~~~~~~~~~i~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~a~~leg~~r~~~~Ha~Gvvi~~~~l~~~ 542 (1033) .++|++||+||..++.++. +++.+ .+++++.+..++.+++++++|++|||+|||+|+||||||||++||+++ T Consensus 464 ~~~d~iaK~iP~~~g~tl~-~a~~e~~~~~~~~~el~~~~~~~~~~~~l~~~A~~LEGl~R~~g~HAaGVVIs~~pL~d~ 542 (1195) T PRK08744 464 GLVDSVAKLIPNILGITLK-DAMGEGKDSEMASPELIQRYQVEDDVRDLMDLARQLEDLTRNAGKHAGGVVIAPEPLSEF 542 (1195) T ss_pred HHHHHHHHHCCCCCCCCHH-HHHHHHHCCCCCCHHHHHHHHCCHHHHHHHHHHHHHCCCCCCCCCCCCEEEECCCCHHHC T ss_conf 7888998626665688889-997640202324889998740487899999999986388443234567158427751100 Q ss_pred EECCCC----CCCCCEEEEEECHHHHHHCCCHHHHHHHCCHHHHHHHHHHHHHH----CCC-CCCCCCCCCHHHHHH-HH Q ss_conf 001236----76886688510213343112321356541789999999988740----254-343323443588998-76 Q 537021.9.peg.7 543 VPMYRD----VRSDLPVTQFNMKWIEKAGLIKFDFLGLKTLTLLQKSLDFLAQR----GVK-VDLSLIPFDDHETYH-LL 612 (1033) Q Consensus 543 ~p~~~~----~~~~~~~~q~d~~~~e~~gl~K~D~Lgl~~L~~i~~~~~~i~~~----~~~-~d~~~ip~~D~~~~~-l~ 612 (1033) ||++.. +.++.++|||||++||++||||||||||||||+|++++++|+++ |.+ +|+..||+||++||+ || T Consensus 543 vPl~~~~~~~~~~~~~vtQ~d~~~vE~~GLlK~DfLGLrtLtvI~~a~~~I~~~~~~~g~~~~dl~~ipldD~~t~~~ll 622 (1195) T PRK08744 543 CPLFAEHDEDGRGKNPVTQFDKNDVEEVGLVKFDFLGLRTLTIIDWAVKAINVRHARAGIDPVDITAIPLDDVPTYKGVF 622 (1195) T ss_pred CCCEECCCCCCCCCCEEEEEEHHHHHHCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCCCCCCCHHHCCCCCHHHHHHHH T ss_conf 56210466566678616984045498779804742788648999999999986215467885676437854599999999 Q ss_pred HCCCCCHHHHCCCHHHHHHHHHHCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHCCCCCCCCHH Q ss_conf 20141012211453478888771740488887665411411342003688850465785026966733410335530012 Q 537021.9.peg.7 613 TTKGTLGIFQLESSGMRQALEGMQPDCIEDIIALVSLYRPGPIDNIVVYNNRKNGKEKIVSIHPLIDPMLKETQGVIIYQ 692 (1033) Q Consensus 613 ~~g~t~GvFQ~es~~~~~~l~~~~p~~~~dl~~~~al~RPGp~~~~~~yi~rk~g~e~~~~~hp~l~~iL~~T~Gv~vyQ 692 (1033) ++|+|.||||+||+||+++|++++|++|+||+|++||||||||++++.|++||||++++.|+||.|++||++|||||||| T Consensus 623 ~~g~T~GvFQlES~gm~~~l~~lkP~~feDiia~~ALyRPGP~~~i~~yi~rk~g~e~i~y~hp~le~iL~~TyGvivYQ 702 (1195) T PRK08744 623 ASGNTGAVFQFESSGMRRLLKDARPDRFEDLIALVSLYRPGPMDLIPDFNARKHGQQEIIYPDPRTEVILKDTYGIMVYQ 702 (1195) T ss_pred HCCCCCCEEEECCHHHHHHHHHCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHCCCCEEEEH T ss_conf 56775413330776899999970999899999999742777566658999982799776589878998751358777529 Q ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHH Q ss_conf 56898878863268413078897620121101677878999998624999999999999999750001331014788899 Q 537021.9.peg.7 693 EQVMQIAQLLSGYSLSEADVLRRAMGKKIKEEMDKQKERFISGASKNGISKTIAVNIFELLAKFADYGFNKSHAAAYAVI 772 (1033) Q Consensus 693 EQvm~ia~~~ag~~~~ead~lRra~~kk~~~~~~~~~~~f~~g~~~~g~~~~~a~~i~~~i~~f~~Y~FnksHa~aya~~ 772 (1033) |||||||+.+||||+||||.||||||||++++|++.+++|++||.+||++++.|++||++|++||+||||||||+|||++ T Consensus 703 EQvMqia~~~agfslg~AD~lRrAmgKK~~~~m~~~~~~F~~Ga~~~G~~~~~a~~if~~i~~Fa~YgFnKSHaaaYa~i 782 (1195) T PRK08744 703 EQVMQMAQIVGDYSLGGADLLRRAMGKKVPAEMAKHREIFREGAAKGGVSAAKADEIFDLMEKFAGYGFNKSHAAAYALV 782 (1195) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCCCCCHHHHHHH T ss_conf 99999999995799878999999854367889999999999999876999999999999999860468797613788999 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEEC-CCEEEEEHHHHCCCCHHHHH Q ss_conf 99999999730266677751002354202589998888636850355233421003531-98698502685896989999 Q 537021.9.peg.7 773 SYQTAWMKTHYPVEFLAASMTLEMDNVEKIKKFCQDARQFNIQIMPPSVNTPCVDFKVG-DNRIYYSLAAIKGVGTTTAR 851 (1033) Q Consensus 773 ~y~~aylK~~yP~eF~aa~L~~~~~~~~~~~~~i~e~~~~gi~vl~PdIN~S~~~f~~~-~~~Ir~Gl~~Ikgvg~~~~~ 851 (1033) ||||||||+|||+|||||+||+++++++|+..|+.|||++||+|+|||||+|.+.|++. +++|||||++|||||+++++ T Consensus 783 aYqtAyLKahyP~eFmaA~L~~~~~~~~k~~~~i~ear~~gi~vlpPdIN~S~~~f~~~~~~~Ir~GL~aIkgvg~~~v~ 862 (1195) T PRK08744 783 SYQTAWLKRHYPAEFMAATLSSDMDNTDKVVGFLDEVRNLGLTVLPPRVNESAYMFEAASPDTIQYGLGAIKGVGQGACE 862 (1195) T ss_pred HHHHHHHHHCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCEEECCCCCCCCCCEEEECCCCEEECCCCCCCCCHHHHH T ss_conf 99999998658999999984468899758999999999879969588414678821760799556432211377599999 Q ss_pred HHHHHH-HCCCCCCHHHHHHHCCCCCCCHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 999998-3799999789997348247998999988867672347903899999899999999988654312777866665 Q 537021.9.peg.7 852 HIMEAS-ADKPFDSLEDFCSRVDSKKLNRRVLESLVFAGALDCFGYSRMQLLQSLDNIQKYAQWVEKNRTNKHENIFAHE 930 (1033) Q Consensus 852 ~Ive~r-~~g~f~sl~df~~rv~~~~~nk~~le~Li~aGAfD~~~~~R~~ll~~i~~~~~~~~~~~~~~~~~q~~LF~~~ 930 (1033) +|+++| ++|||+|++|||.|++.+.+||+++|+||+|||||+||.||++|+++++.++..+.+..+++.+||.+||+.. T Consensus 863 ~Iv~~R~~~Gpf~sl~Df~~Rv~~~~~nkr~lE~LI~aGAfD~~g~~R~~ll~~~~~~~~~~~~~~~~~~~gq~~LF~~~ 942 (1195) T PRK08744 863 AIVEERLRGGPYTTLLDFCTRVGTAKLNRRTLEAMINAGAMDGLGKNRASLMLQLPEVMKATEQMARERASGQNSLFGGP 942 (1195) T ss_pred HHHHHHHHCCCCCCHHHHHHHCCCCCCCHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCC T ss_conf 99999984899999999997435456999999999867887576857999999899999999997787755886766888 Q ss_pred CCCC-CCCCCCCCCCCCCHHHHHHHHHHHHCEEECCCCHHHHHHHHHHCCCCCHHHHHCCCC-----------C----CE Q ss_conf 5556-557766689999888999857887262116385589999998748976889850558-----------9----88 Q 537021.9.peg.7 931 KGTF-SDKITLEKFSVENSSVRFENEQRVLGFYFSGHPLDVYKPILRKMGIKGYEESVATIR-----------G----NN 994 (1033) Q Consensus 931 ~~~~-~~~~~~~~~~~~~~~e~l~~E~e~LGf~lS~HPl~~~~~~l~~~~~~~~~~~~~~~~-----------g----~~ 994 (1033) .... .....+|+.++|+..|+|..|+|+||||+|+|||+.|+..++....+++..+..... + .+ T Consensus 943 ~~~~~~~~~~~p~~~e~~~~e~L~~Eke~LG~ylSgHPl~~y~~~l~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~ 1022 (1195) T PRK08744 943 DPSAPAMRLDLPESKEWPLGQLLTGERETLGFYLSGHPFDPHREEVRELVGCDLSALEKILASQQRGGGGGGDGEKRAWR 1022 (1195) T ss_pred CCCCCCCCCCCCCCCCCCHHHHHHHHHHHHEEEECCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCCCCHHCCC T ss_conf 76665445788887779877899998988136665184488999999852476688888755543034334442100134 Q ss_pred EEEEEEEEEEEEECCCCCCEEEEEEEECCCCEEEEEECC Q ss_conf 999999961121015888769999998497049999539 Q 537021.9.peg.7 995 IRLAAMVVSKQQKKTRKGSRIGWVTFSEPAKEYEANFFP 1033 (1033) Q Consensus 995 v~iaG~V~~i~~~~TKkG~~mafltleD~~g~~e~~iFP 1033 (1033) ..+.++|..+....+|+|++|+|+|+||.||.+|+++|| T Consensus 1023 ~~~~~~iag~~~~~~k~G~~maf~tleD~tg~~e~~vF~ 1061 (1195) T PRK08744 1023 PEVSAILAGQVVGVRRKGDSQVFVQLEDGRGRVECSAFS 1061 (1195) T ss_pred CCCEEEEEEEEEEEEECCCEEEEEEEEECCCCEEEEECH T ss_conf 551289988899997269879999999689738999878 No 13 >PRK06920 dnaE DNA polymerase III DnaE; Reviewed Probab=100.00 E-value=0 Score=2449.83 Aligned_cols=980 Identities=38% Similarity=0.678 Sum_probs=934.8 Q ss_pred CCHHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCCCCC Q ss_conf 81334331444764432998999999997899889981165177499999999976996389999998547664334456 Q 537021.9.peg.7 10 SFVHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPIIGCQLDIDMQDNLKMTNQG 89 (1033) Q Consensus 10 ~FvhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~~~~~~~~ 89 (1033) +||||||||+||+|||+++|++||++|+++||+||||||||||||+++|+++|+++|||||+|||+++..++. T Consensus 2 ~FvhLh~hS~ySlLdg~~~i~eLv~~A~e~G~~AlALTDh~~l~Gav~Fy~a~kk~gIKPIiG~e~~v~~~~~------- 74 (1107) T PRK06920 2 KFVHLQCQTVYSLLKSACKIDELVVRAKELGYSSLAITDENVMYGVIPFYKACKKHGIKPIIGLTASIFSEEE------- 74 (1107) T ss_pred CCCCCCCCCCCCHHCCCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCCEEEEEEEEECCCC------- T ss_conf 2626363672812050759999999999759999999608848789999999997799803542799843897------- Q ss_pred CCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCEEEECCCCCCHHHHHHHCCCHHHHHH Q ss_conf 56666775399995696899999999989973248778740078999863489889981976796788887799789999 Q 537021.9.peg.7 90 KTPFPTVPSIVLLVSTAEGYQRLIELVSRMYLLDQGKQSVRICLSWLQEIGTEGLIMLTGGSSGPIDRAFFFNSSQVAEK 169 (1033) Q Consensus 90 ~~~~~~~~~lvlLAkN~~Gy~nL~kL~S~a~~~~~~~~~p~i~~~~L~e~~~egLIvlsg~~~g~i~~~l~~~~~~~a~~ 169 (1033) ...+|++|||||++||+|||+|+|.++.++ +|+++.++|.+ +++|||++|||..|++++++..+..++|++ T Consensus 75 ----~~~~~LiLLAkN~~GY~nL~kLsS~~~~~~----~~~i~~~~L~~-~~~gLi~ls~~~~g~i~~~l~~~~~~~a~~ 145 (1107) T PRK06920 75 ----EKSYPLVLLAENEIGYQNLLKISSSIMTKS----KEGIPKKWLAH-YAKGLIAISPGKDGEIEQLLLEDKEEQAEE 145 (1107) T ss_pred ----CCCCCEEEEECCHHHHHHHHHHHHHHHHCC----CCCCCHHHHHH-HCCCCEEEECCCCCHHHHHHHCCCHHHHHH T ss_conf ----775787999349999999999999998658----99989999997-479979982886587999984699999999 Q ss_pred HHHHHHHHCCCCEEEEEECCCHH-HHHHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCC Q ss_conf 99999876598689999439911-58999999999998189899808830078788999987668846985765310024 Q 537021.9.peg.7 170 RLLTFKKLFGDRLYVNLQRHRGY-DRYRESQVVQLAYTHELPLVATNNSLFLSEEDYEAHDVLMAVAHSTVVSQKDRPRV 248 (1033) Q Consensus 170 ~l~~l~~~Fgd~~ylEl~~~~~~-e~~~~~~l~~lA~~~~iPlVaTndv~Y~~~~D~~~h~iL~aI~~~~~l~~~~~~~~ 248 (1033) .+.+|+++|| ++|++++.|+.. ++.++..++++|+++++|+||||||||++++|+.+|++|.||++|++++++++.+. T Consensus 146 ~~~~~~~iFg-~~~~~l~~~~~~~~~~~~~~l~~la~~~~iplVATndvhYl~~eD~~~~dvL~~I~~g~~l~d~~~~~~ 224 (1107) T PRK06920 146 VARAYQNMFG-NFYMSLQHHAIQDELLLQEKLPEFSNRVNIPVVATNDVRYINQSDALVHECLLSVESGTKMTDPDRPRL 224 (1107) T ss_pred HHHHHHHHHC-CCEEEEEECCCCCHHHHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCC T ss_conf 9999999738-747999968985459999999999998498989968865478626999999999976996587860115 Q ss_pred -CCCCCCCCHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCC Q ss_conf -5343234347887512203589987886411100013322102345557568877689889998765410342012464 Q 537021.9.peg.7 249 -TPDHYLKNRSEMVSIFSDLPEALENTVEIARRCSFILQVHAPILPRFIEKECDDIQKEEENELRNKAVAGLEVRLAQGA 327 (1033) Q Consensus 249 -~~~~yLks~eEM~~~f~~~peAi~NT~~IAe~c~~~l~~~~~~lP~f~~p~~~~~~~~~~~~Lr~l~~~Gl~~R~~~~~ 327 (1033) +.++||||++||.++|.++|+|++||.+||++|++++++..++||+|++|++ .++++||+++|++|+++||+. T Consensus 225 ~~~e~ylks~~EM~~lf~~~peal~NT~~IAerC~~~l~~~~~~lP~f~~p~g----~t~~~~Lr~l~~~Gl~~Ry~~-- 298 (1107) T PRK06920 225 KTDQYYLKSSDEMEALFSHVPEAIYNTVEIAERCRVEIPFHVNQLPKFPVPSN----ETADDYLRRVCEEGLQKRYGT-- 298 (1107) T ss_pred CCCCCCCCCHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCC----CCHHHHHHHHHHHHHHHCCCC-- T ss_conf 88102389999999885115999999999997521011248876877779998----778999999999656640379-- Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHHCCCHHHHH Q ss_conf 32457838999998877898986063055788899999851047643344442046677655440375633323105676 Q 537021.9.peg.7 328 IAKGYHAQDYRQRLDFELNVIARMKFSGYFLIVADFIQWAKKNNIPVGPGRGSGAGSVTAYALTITDIDPLRFSLLFERF 407 (1033) Q Consensus 328 ~~~~~~~~~y~~RLe~EL~vI~~~gf~~YFLiV~Div~~ak~~gI~vGpGRGSAagSLVaY~LgIT~VDPi~~~LlFERF 407 (1033) .+++|++||+|||+||++|||++|||||||+|+|||++||+||||||||||||||||||||+||||+|||||||| T Consensus 299 -----~~~~y~~RL~~EL~vI~~mgf~~YFLIV~D~i~~Ak~~~I~vGPGRGSAAGSLVaY~LgIT~VDPi~y~LLFERF 373 (1107) T PRK06920 299 -----PKEVHINRLNHELNVISRMGFSDYFLIVWDFMKYAHENHILTGPGRGSAAGSLVAYVLEITDIDPIEYDLLFERF 373 (1107) T ss_pred -----CCHHHHHHHHHHHHHHHHCCCCCEEEHHHHHHHHHHHCCCEECCCCCCHHHHHHHHHHCCCCCCCCCCCCCHHHH T ss_conf -----959999999999999986488878742999999998779413688873588899998568033841258739986 Q ss_pred HHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCC Q ss_conf 21001457875333656878999999983038021554220778778899999997527998898865431014589824 Q 537021.9.peg.7 408 LNPDRMSMPDFDIDFCQDRRDEVIRYVQNKYGHERVAQIITFGSLQAKAALRDVGRALQMPYSQVDRLCKLIPNDPAHPV 487 (1033) Q Consensus 408 Lnp~R~~~PDIDiDf~~~~R~~vi~y~~~kyG~~~va~i~t~~t~~~k~aird~~r~~g~~~~~~~~~~~~i~~~~~~~~ 487 (1033) |||+|++||||||||+++||++||+||++|||.++||||+||+||++|+||||||||||+|+.++|++||+||..++.++ T Consensus 374 LNpeR~smPDIDiDf~~~rR~eVi~Yv~~kYG~~~VAqI~Tf~t~~ak~aiRDv~Rvlg~~~~~~d~iak~ip~~~g~tl 453 (1107) T PRK06920 374 LNPERVTLPDIDIDFPDTRRDEMIRYVKDKYGQLRVAQIVTFGTLAAKAAIRDIARVMGLPPRDIDIFSKLIPSKLGITL 453 (1107) T ss_pred CCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEEHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCCCCCCH T ss_conf 58877888887877650129999999999866675897500241498899999988708998999998732788788887 Q ss_pred CHHHHHCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCEEEECCCCHHHHEECCCCCCCCCEEEEEECHHHHHHC Q ss_conf 43445229887886430742026889872011221002447440335256666620012367688668851021334311 Q 537021.9.peg.7 488 SLQAIADDSRFREARLADPSVDRLLEISQKLEGLYRHASTHAAGIVIGDRPLSQLVPMYRDVRSDLPVTQFNMKWIEKAG 567 (1033) Q Consensus 488 ~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~leg~~r~~~~Ha~Gvvi~~~~l~~~~p~~~~~~~~~~~~q~d~~~~e~~g 567 (1033) . ++++.++++++++..++.+++++++|++|||+|||+|+||||||||++||++++|++.+. +|.++|||||+++|++| T Consensus 454 ~-~a~~~~~~l~~~~~~~~~~~~l~~~A~~lEGlpR~~s~HAaGvVIs~~pL~~~vPl~~~~-~~~~vtQ~d~~~~E~~G 531 (1107) T PRK06920 454 K-EAYEESQSLREFIQGNLLHERVFEIAKRVEGLPRHTSIHAAGVIMSQEPLTGSVAIQEGH-NDVYVTQYPADALEELG 531 (1107) T ss_pred H-HHHHHCHHHHHHHHCCHHHHHHHHHHHHHCCCCCCCCCCCCEEEECCCCCCCCCCEEECC-CCCEEEEEECCCHHHCC T ss_conf 9-999857557776634978899999999964984544456773363176755346514679-99657764430357509 Q ss_pred CCHHHHHHHCCHHHHHHHHHHHHH-HCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHCCCHHHHHHHHHHCCCHHHHHHHH Q ss_conf 232135654178999999998874-0254343323443588998762014101221145347888877174048888766 Q 537021.9.peg.7 568 LIKFDFLGLKTLTLLQKSLDFLAQ-RGVKVDLSLIPFDDHETYHLLTTKGTLGIFQLESSGMRQALEGMQPDCIEDIIAL 646 (1033) Q Consensus 568 l~K~D~Lgl~~L~~i~~~~~~i~~-~~~~~d~~~ip~~D~~~~~l~~~g~t~GvFQ~es~~~~~~l~~~~p~~~~dl~~~ 646 (1033) |||||||||||||+|++++++|++ +|.++|+++||+||++||+||++|+|.||||+||+|||++|++++|++|+||||+ T Consensus 532 LlK~DfLGLr~Ltvi~~a~~~I~~~~~~~id~~~ipldD~~t~~ll~~g~T~GvFQ~ES~g~~~~l~~~~P~~f~diva~ 611 (1107) T PRK06920 532 LLKMDFLGLRNLTLLENIIKFIEKKTGKEIDIRNLPLQDEKTFQLLGRGDTTGVFQLESGGMRNVLRGLKPNEFEDIVAV 611 (1107) T ss_pred CCCHHHHCCCCHHHHHHHHHHHHHHHCCCCCHHHCCCCCHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHHHH T ss_conf 80034434772899999999999833997671307845499999985778777074557579999986599989996763 Q ss_pred HHHCCCHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCHHH Q ss_conf 54114113420036888504657850269667334103355300125689887886326841307889762012110167 Q 537021.9.peg.7 647 VSLYRPGPIDNIVVYNNRKNGKEKIVSIHPLIDPMLKETQGVIIYQEQVMQIAQLLSGYSLSEADVLRRAMGKKIKEEMD 726 (1033) Q Consensus 647 ~al~RPGp~~~~~~yi~rk~g~e~~~~~hp~l~~iL~~T~Gv~vyQEQvm~ia~~~ag~~~~ead~lRra~~kk~~~~~~ 726 (1033) +||||||||++++.|++||||+++++|+||.|++||++|||||||||||||||+.+||||++|||.||||||||+.+.|+ T Consensus 612 ~aL~RPGP~~~i~~yi~rk~g~~~~~y~hp~l~~iL~~TyGvivyQEQvM~ia~~~agfs~~eaD~lRram~KK~~~~m~ 691 (1107) T PRK06920 612 NSLYRPGPMEQIPTFIESKHGRRKIEYLHPDLKPILERTYGVIVYQEQIMQIASKLAGFSLGEADLLRRAVSKKNRDILD 691 (1107) T ss_pred EEEECCCCCCCHHHHHHHCCCCCCCCCCCHHHHHHHHHCCCEEEEHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCHHHHH T ss_conf 00205775200169999708997888899779998742477487799999989987369867899999984437588899 Q ss_pred HHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHH Q ss_conf 78789999986249999999999999997500013310147888999999999973026667775100235420258999 Q 537021.9.peg.7 727 KQKERFISGASKNGISKTIAVNIFELLAKFADYGFNKSHAAAYAVISYQTAWMKTHYPVEFLAASMTLEMDNVEKIKKFC 806 (1033) Q Consensus 727 ~~~~~f~~g~~~~g~~~~~a~~i~~~i~~f~~Y~FnksHa~aya~~~y~~aylK~~yP~eF~aa~L~~~~~~~~~~~~~i 806 (1033) +++++|++||.+||+++++|++||++|++||+||||||||+|||++||||||||+|||+|||||+||+++++++++..|+ T Consensus 692 ~~~~~F~~g~~~~g~~~~~a~~i~~~i~~Fa~YgFnkSHa~aYa~iay~~AyLK~hyP~eF~aalL~~~~~~~~k~~~~i 771 (1107) T PRK06920 692 QERKHFVQGCLKNGYDETSAEKIYDLIVRFANYGFNRSHAVAYSMIGYQLAYLKANYPLEFMTALLSSAIGNEDKIVQYI 771 (1107) T ss_pred HHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHCCCCCHHHHHHH T ss_conf 99999986676659999999999999998751287878999999999999999875759999999764468856799999 Q ss_pred HHHHHCCCCCCCCCCCCCCEEEEECCCEEEEEHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCHHHHHHHH Q ss_conf 88886368503552334210035319869850268589698999999999837999997899973482479989999888 Q 537021.9.peg.7 807 QDARQFNIQIMPPSVNTPCVDFKVGDNRIYYSLAAIKGVGTTTARHIMEASADKPFDSLEDFCSRVDSKKLNRRVLESLV 886 (1033) Q Consensus 807 ~e~~~~gi~vl~PdIN~S~~~f~~~~~~Ir~Gl~~Ikgvg~~~~~~Ive~r~~g~f~sl~df~~rv~~~~~nk~~le~Li 886 (1033) .||+++||+|+|||||+|.++|++++++||+||++|||||++++++|+++|++|||+|+.||+.|++.+.+||+++|+|| T Consensus 772 ~e~~~~gi~vlppdin~S~~~f~~~~~~Ir~gl~~ikgvg~~~~~~Iv~~R~~~~f~~l~Df~~Rv~~~~~~~~~le~Li 851 (1107) T PRK06920 772 RETKRKGFHVLPPSLQRSGYNFQIEGNAIRYSLLSIRNIGMATVTALYEEREKKPFEDLFEFCLRMPSKFVTERNLEAFV 851 (1107) T ss_pred HHHHHCCCCCCCCEEEECCCCEEEECCEEEEHHHHHCCCCHHHHHHHHHHHHHCCCCCHHHHHHHCCCCCCCHHHHHHHH T ss_conf 99998699514973442388508878906636556456669999999999964899998999985465448999999999 Q ss_pred HCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCEEECCC Q ss_conf 67672347903899999899999999988654312777866665555655776668999988899985788726211638 Q 537021.9.peg.7 887 FAGALDCFGYSRMQLLQSLDNIQKYAQWVEKNRTNKHENIFAHEKGTFSDKITLEKFSVENSSVRFENEQRVLGFYFSGH 966 (1033) Q Consensus 887 ~aGAfD~~~~~R~~ll~~i~~~~~~~~~~~~~~~~~q~~LF~~~~~~~~~~~~~~~~~~~~~~e~l~~E~e~LGf~lS~H 966 (1033) +|||||+||.||++|+++++.+++.+.. |..... ..+.+.++..++|+..|++++|+|+||||+|+| T Consensus 852 ~aGAfD~~g~~R~~l~~~l~~~~~~~~~------------~~~~~~-~~~~~~~~~~~~~~~~e~l~~Eke~lG~yls~H 918 (1107) T PRK06920 852 WSGCFDDFGVSRTNLWASLKGALEYANL------------ARDLGD-AVPKSKYVQGEELSFIEQLNKEKEALGFYLSSY 918 (1107) T ss_pred HCCCCHHHCCCHHHHHHHHHHHHHHHHH------------CCCCCC-CCCCCCCCCCCCCCHHHHHHHHHHHEEEEECCC T ss_conf 7786200164899999979987500433------------136632-347777778899986889997687573321527 Q ss_pred CHHHHHHHHHHCCCCCHHHHHCCCCCCEEEEEEEEEEEEEECCCCCCEEEEEEEECCCCEEEEEECC Q ss_conf 5589999998748976889850558988999999961121015888769999998497049999539 Q 537021.9.peg.7 967 PLDVYKPILRKMGIKGYEESVATIRGNNIRLAAMVVSKQQKKTRKGSRIGWVTFSEPAKEYEANFFP 1033 (1033) Q Consensus 967 Pl~~~~~~l~~~~~~~~~~~~~~~~g~~v~iaG~V~~i~~~~TKkG~~mafltleD~~g~~e~~iFP 1033 (1033) ||+.|++.++..+..++.++... .++.++++|+|+++++++||+|++|+|+|+||+||.+|++||| T Consensus 919 Pl~~~~~~l~~~~~~~~~~~~~~-~~~~~~~~g~i~~~~~~~tk~g~~maf~~leD~tg~~e~vvFp 984 (1107) T PRK06920 919 PTAQYRKLLKELEIPSLAQAMRH-KKKVQRAIVYITSVKVIRTKKGQKMAFITFCDQNDEMEAVVFP 984 (1107) T ss_pred HHHHHHHHHHHCCCCCHHHHHHC-CCCEEEEEEEEEEEEEEECCCCCEEEEEEEEECCCCEEEEECH T ss_conf 29899999987489779999746-8988999999988898636889969999999799868999997 No 14 >COG0587 DnaE DNA polymerase III, alpha subunit [DNA replication, recombination, and repair] Probab=100.00 E-value=0 Score=2451.72 Aligned_cols=1005 Identities=49% Similarity=0.809 Sum_probs=936.0 Q ss_pred CCCCHHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCCC Q ss_conf 98813343314447644329989999999978998899811651774999999999769963899999985476643344 Q 537021.9.peg.7 8 SPSFVHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPIIGCQLDIDMQDNLKMTN 87 (1033) Q Consensus 8 ~~~FvhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~~~~~~ 87 (1033) .++||||||||+||||||++++++||++|+++||+||||||||||||+++|+++|+++|||||||||+|+..++.. T Consensus 1 ~~~fv~Lh~hS~YS~Ldga~~i~~Lv~~A~~~g~~AlaiTD~~nl~Gav~Fy~~ak~~gikpIiG~e~~v~~~~~~---- 76 (1139) T COG0587 1 EMSFVHLHVHSEYSLLDGASKIEELVKKAKELGMPALALTDHNNLYGAVEFYKAAKKAGIKPIIGCEAYVANGDGF---- 76 (1139) T ss_pred CCCEEECCCCCCCCHHCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEHHHHHHHHHHCCCEEEEEEEEEEECCCCC---- T ss_conf 9942641214543131245689999999998699769995277632389999999986981775207997246544---- Q ss_pred CCCCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHH-HHCCCCEEEECCCCCCHHHHHHHCCCHHH Q ss_conf 56566667753999956968999999999899732487787400789998-63489889981976796788887799789 Q 537021.9.peg.7 88 QGKTPFPTVPSIVLLVSTAEGYQRLIELVSRMYLLDQGKQSVRICLSWLQ-EIGTEGLIMLTGGSSGPIDRAFFFNSSQV 166 (1033) Q Consensus 88 ~~~~~~~~~~~lvlLAkN~~Gy~nL~kL~S~a~~~~~~~~~p~i~~~~L~-e~~~egLIvlsg~~~g~i~~~l~~~~~~~ 166 (1033) .....+|++|||||++||+|||+|+|.++.++ +++.|++++++|. + |++|||++|||..|+++.+++.++.+. T Consensus 77 ----~~~~~~~l~llAkn~~GY~nL~~LsS~a~~~g-~~~~~~i~~~~l~~~-~~eglI~~s~~~~g~v~~~ll~~~~~~ 150 (1139) T COG0587 77 ----RGRERPHLLLLAKNNEGYKNLVKLSSIAYLEG-EKGKPRIDKDLLELE-YSEGLIALSACLGGEVPQLLLKGNEDL 150 (1139) T ss_pred ----CCCCCCCEEEEECCHHHHHHHHHHHHHHHHCC-CCCCCCCCHHHHHHH-CCCCEEEECCCCCCCHHHHHHCCCHHH T ss_conf ----55677527999678899999999988998547-678864488997433-238818941544553214441365889 Q ss_pred HHHHHHHHHHHCCCCEEEEEECCC-HHHHHHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 999999998765986899994399-1158999999999998189899808830078788999987668846985765310 Q 537021.9.peg.7 167 AEKRLLTFKKLFGDRLYVNLQRHR-GYDRYRESQVVQLAYTHELPLVATNNSLFLSEEDYEAHDVLMAVAHSTVVSQKDR 245 (1033) Q Consensus 167 a~~~l~~l~~~Fgd~~ylEl~~~~-~~e~~~~~~l~~lA~~~~iPlVaTndv~Y~~~~D~~~h~iL~aI~~~~~l~~~~~ 245 (1033) |++.+.||+++||++||+|+|+|+ ..++.++..++.+|.++++|+||||||||++++|+.+|++++||++|.++++.++ T Consensus 151 a~~~~~~~~~~Fg~~~ylelq~~~~~~~~~~~~~~~~la~~~~ip~vaTndv~y~~~~d~~ahd~~l~i~~~~~l~~~~~ 230 (1139) T COG0587 151 AEEALAWYKEVFGDDFYLELQRHGSPEDRRRNDALIKLARELGIPLVATNDVHYINPEDREAHDALLCIRTGKTLSDDKR 230 (1139) T ss_pred HHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEECCCEECCCCHHHHHHHHHHHHCCCCCCCCCC T ss_conf 99999999998076406764115670667889999999987499868736531158113789999999863980056643 Q ss_pred CCC-CCCCCCCCHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCC Q ss_conf 024-5343234347887512203589987886411100013322102345557568877689889998765410342012 Q 537021.9.peg.7 246 PRV-TPDHYLKNRSEMVSIFSDLPEALENTVEIARRCSFILQVHAPILPRFIEKECDDIQKEEENELRNKAVAGLEVRLA 324 (1033) Q Consensus 246 ~~~-~~~~yLks~eEM~~~f~~~peAi~NT~~IAe~c~~~l~~~~~~lP~f~~p~~~~~~~~~~~~Lr~l~~~Gl~~R~~ 324 (1033) .+. +.++||||++||.++|+++|+||+||.+||++|+|+++.. +.+|+||+|++ .++++||+++|++|+++||. T Consensus 231 ~~~~~~e~ylks~eeM~~lf~d~peAl~nT~~Ia~rc~~~l~~~-~~lP~fp~~~~----~~~~~~L~~~~~~Gl~~r~~ 305 (1139) T COG0587 231 LRYSSAEQYLKSPEEMARLFADIPEALANTVEIAERCNFELDLG-PRLPNFPTPPG----KSAAEYLRKLAEEGLEERYK 305 (1139) T ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCEEHHCC-CCCCCCCCCCC----CCHHHHHHHHHHHHHHHHHH T ss_conf 02467325569999999986605999999999998522242026-65787999888----89899999999977999875 Q ss_pred CCCCCCCC--CHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHHCCC Q ss_conf 46432457--8389999988778989860630557888999998510476433444420466776554403756333231 Q 537021.9.peg.7 325 QGAIAKGY--HAQDYRQRLDFELNVIARMKFSGYFLIVADFIQWAKKNNIPVGPGRGSGAGSVTAYALTITDIDPLRFSL 402 (1033) Q Consensus 325 ~~~~~~~~--~~~~y~~RLe~EL~vI~~~gf~~YFLiV~Div~~ak~~gI~vGpGRGSAagSLVaY~LgIT~VDPi~~~L 402 (1033) ....+... ...+|++||+|||+||++|||+||||||||+|+|||++||+||||||||||||||||||||+|||++||| T Consensus 306 ~~~~~~~~~~~~~~y~~RL~~EL~vI~~mgf~~YFLiV~D~I~~Ak~~gI~vGPGRGSaAGSLVaY~L~IT~IDPl~~~L 385 (1139) T COG0587 306 ERLAPEEVPEKVREYKERLEYELDVINKMGFPGYFLIVWDFIKFARDNGIPVGPGRGSAAGSLVAYALGITDIDPLKYDL 385 (1139) T ss_pred HCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCC T ss_conf 22476455323688999999999999866999857729999999997898647987625768999984688768532575 Q ss_pred HHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCC Q ss_conf 05676210014578753336568789999999830380215542207787788999999975279988988654310145 Q 537021.9.peg.7 403 LFERFLNPDRMSMPDFDIDFCQDRRDEVIRYVQNKYGHERVAQIITFGSLQAKAALRDVGRALQMPYSQVDRLCKLIPND 482 (1033) Q Consensus 403 lFERFLnp~R~~~PDIDiDf~~~~R~~vi~y~~~kyG~~~va~i~t~~t~~~k~aird~~r~~g~~~~~~~~~~~~i~~~ 482 (1033) ||||||||+|+||||||||||++||++||+||++|||+++||||+||+||++|+||||||||||+|+.++|+|||+||.+ T Consensus 386 LFERFLNpeR~smPDIDIDf~~~rReeVI~YV~ekYG~d~VAqIiTFgt~~aKaaiRDVgRvlg~~~~~~d~lsK~Ip~~ 465 (1139) T COG0587 386 LFERFLNPERVSMPDIDIDFCDERREEVIQYVYEKYGRDRVAQIITFGTLRAKAAIRDVGRVLGLPYGEVDKLAKLIPFW 465 (1139) T ss_pred CHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEEEEEHHHHHHHHHHHHHHCCCCHHHHHHHHHCCCCC T ss_conf 06540698789998777177623499999999998551217888641005688999989987099889999997269986 Q ss_pred CCCCCCHHHHHCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCEEEECCCCHHHHEECCCCCCCCCEEEEEECHH Q ss_conf 89824434452298878864307420268898720112210024474403352566666200123676886688510213 Q 537021.9.peg.7 483 PAHPVSLQAIADDSRFREARLADPSVDRLLEISQKLEGLYRHASTHAAGIVIGDRPLSQLVPMYRDVRSDLPVTQFNMKW 562 (1033) Q Consensus 483 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~leg~~r~~~~Ha~Gvvi~~~~l~~~~p~~~~~~~~~~~~q~d~~~ 562 (1033) ++.. ...++++++.+++....+|.+++++++|++|||+|||+|+||||||||++||++++|++...+++ .+|||||++ T Consensus 466 ~~~~-l~~a~~~~~~l~~~~~~d~~~~~~~e~A~kLeGlpRh~s~HAaGvVIs~~~L~~~vPl~~~~~~~-~vtQ~dm~d 543 (1139) T COG0587 466 PGLT-LAVAYEEEPELKELLDSDPEVKRLIELARKLEGLPRHLSTHAAGVVISDDPLTDLVPLYKDKNRD-GVTQYDMDD 543 (1139) T ss_pred CCCC-HHHHHCCCHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCCCCCEEEECCCCCCCCCCEEECCCCC-CEEEECHHH T ss_conf 6650-76653015888865035688999999999964999887876776996388643338805548888-224435387 Q ss_pred HHHHCCCHHHHHHHCCHHHHHHHHHHHHH-HCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHCCCHHHHHHHHHHCCCHHH Q ss_conf 34311232135654178999999998874-02543433234435889987620141012211453478888771740488 Q 537021.9.peg.7 563 IEKAGLIKFDFLGLKTLTLLQKSLDFLAQ-RGVKVDLSLIPFDDHETYHLLTTKGTLGIFQLESSGMRQALEGMQPDCIE 641 (1033) Q Consensus 563 ~e~~gl~K~D~Lgl~~L~~i~~~~~~i~~-~~~~~d~~~ip~~D~~~~~l~~~g~t~GvFQ~es~~~~~~l~~~~p~~~~ 641 (1033) ||++||+|||||||+|||+|++++++|++ +++++|+.+||+||++||+||++|+|+||||+||+||+++|+++||++|+ T Consensus 544 lE~~GLvK~DfLGL~tLT~I~~a~~~I~~~~~~~i~~~~IpldD~~~y~ml~~gdT~GVFQlES~gm~~~l~rlkP~~fe 623 (1139) T COG0587 544 LEAVGLLKFDFLGLKTLTIIQRALDLIKEKRGIDIDLASIPLDDPKTYEMLAKGDTLGVFQLESRGMKSLLKRLKPDNFE 623 (1139) T ss_pred HHHCCCEEEEHHCCCHHHHHHHHHHHHHHCCCCCCCHHHCCCCCHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCCHH T ss_conf 98747802302054287899999999986359775642088556899999853681589862468999988745998699 Q ss_pred HHHHHHHHCCCHHHHH--HHHHHHHHCCCCCCCCCCH-HHHHHHCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHC Q ss_conf 8876654114113420--0368885046578502696-673341033553001256898878863268413078897620 Q 537021.9.peg.7 642 DIIALVSLYRPGPIDN--IVVYNNRKNGKEKIVSIHP-LIDPMLKETQGVIIYQEQVMQIAQLLSGYSLSEADVLRRAMG 718 (1033) Q Consensus 642 dl~~~~al~RPGp~~~--~~~yi~rk~g~e~~~~~hp-~l~~iL~~T~Gv~vyQEQvm~ia~~~ag~~~~ead~lRra~~ 718 (1033) ||||++||||||||++ +++|++||||+|++.|+|| .|++||++|||||||||||||||+.+||||+||||.|||||| T Consensus 624 Diva~vALyRPGPm~~gmI~~fi~RK~g~e~i~y~~p~~Le~IL~~TYGVivYQEQVMqIAq~~agfSlgeAD~LRRAMg 703 (1139) T COG0587 624 DIVALVALYRPGPMQGGMIPPFINRKHGREEIEYPHPEPLEPILKETYGVIVYQEQVMQIAQVLAGFSLGEADLLRRAMG 703 (1139) T ss_pred HHHHHHHHCCCCCCCCCCCCHHHHCCCCCCCCCCCCCHHHHHHHCCCCCCEEEHHHHHHHHHHHCCCCCHHHHHHHHHHC T ss_conf 99999974188875779865677545788888889806689874045573652999999999971899426789999871 Q ss_pred CCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 12110167787899999862499999999999999975000133101478889999999999730266677751002354 Q 537021.9.peg.7 719 KKIKEEMDKQKERFISGASKNGISKTIAVNIFELLAKFADYGFNKSHAAAYAVISYQTAWMKTHYPVEFLAASMTLEMDN 798 (1033) Q Consensus 719 kk~~~~~~~~~~~f~~g~~~~g~~~~~a~~i~~~i~~f~~Y~FnksHa~aya~~~y~~aylK~~yP~eF~aa~L~~~~~~ 798 (1033) ||+.++|++.+++|++||++||++++.|++||++|++||+||||||||+|||++||||||||+|||+|||||+||++++| T Consensus 704 KK~~~~m~~~r~~F~~Ga~~~G~~~~~a~~ifd~i~kFA~YGFNkSHAaAYa~isYqtAyLKahYP~eF~aall~s~~~n 783 (1139) T COG0587 704 KKKAEEMEKQREKFIEGAVKNGYDKEFAEKIFDLIEKFAGYGFNKSHAAAYALLSYQTAYLKAHYPAEFMAALLTSEPMN 783 (1139) T ss_pred CCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHCCCCC T ss_conf 47789999999999978987699989999999999988404877289999999999999998717799999997447788 Q ss_pred CHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEECC-CEEEEEHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCC Q ss_conf 2025899988886368503552334210035319-869850268589698999999999837999997899973482479 Q 537021.9.peg.7 799 VEKIKKFCQDARQFNIQIMPPSVNTPCVDFKVGD-NRIYYSLAAIKGVGTTTARHIMEASADKPFDSLEDFCSRVDSKKL 877 (1033) Q Consensus 799 ~~~~~~~i~e~~~~gi~vl~PdIN~S~~~f~~~~-~~Ir~Gl~~Ikgvg~~~~~~Ive~r~~g~f~sl~df~~rv~~~~~ 877 (1033) ++|++.|++|||++||+|+|||||.|.++|++++ ++|||||++|||||..++++|+++|+.+||+|++|||.|++.+.+ T Consensus 784 ~~k~~~~i~ear~~Gi~VlpPdIN~S~~~f~~~~~~~I~~gL~~IKGvg~~~i~~Iv~~R~~~~~~~~~df~~r~~~~~l 863 (1139) T COG0587 784 FDKVAQYIQEARRMGIEVLPPDINRSGWDFTVEEKKAIRLGLGAIKGVGEDAIEEIVEARKEKPFKSLEDFCDRIDRKGL 863 (1139) T ss_pred CCCHHHHHHHHHHCCCEEECCCCCCCCCCCEECCCCCEEEHHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCC T ss_conf 62699999999985976737852666776464478727871465568768799999997631577767669987524058 Q ss_pred CHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHH Q ss_conf 98999988867672347903899999899999999988654312777866665555655776668999988899985788 Q 537021.9.peg.7 878 NRRVLESLVFAGALDCFGYSRMQLLQSLDNIQKYAQWVEKNRTNKHENIFAHEKGTFSDKITLEKFSVENSSVRFENEQR 957 (1033) Q Consensus 878 nk~~le~Li~aGAfD~~~~~R~~ll~~i~~~~~~~~~~~~~~~~~q~~LF~~~~~~~~~~~~~~~~~~~~~~e~l~~E~e 957 (1033) ||+++|+||+|||||+|+.||+++||+++.++..+....++ .+|.+||+............+..++|+..|++..|+| T Consensus 864 ~kr~lE~LikaGAfd~~~~~R~~ll~~~~~~~~~~~~~~~~--~~q~~lF~~~~~~~~~~~~~~~~~~~~~~e~l~~E~e 941 (1139) T COG0587 864 NKRVLESLIKAGAFDSFGKNRAQLLAALDDLLDAASGTAKN--SGQLSLFGAAAAGESEQVSYVALPEWSEKEKLALEKE 941 (1139) T ss_pred CHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH--CCCCCCCCHHHCCCCCCCCCCCCCCCCHHHHHHHHHH T ss_conf 78889999974864211157999999999999887665763--1666662112124332124445667886678888888 Q ss_pred HHCEEECCCCHHHHHHHHHHCCC--CCHHHHHCCCCCCEEEEEEEEEEEEEECCC-CCCEEEEEEEECCCCEEEEEECC Q ss_conf 72621163855899999987489--768898505589889999999611210158-88769999998497049999539 Q 537021.9.peg.7 958 VLGFYFSGHPLDVYKPILRKMGI--KGYEESVATIRGNNIRLAAMVVSKQQKKTR-KGSRIGWVTFSEPAKEYEANFFP 1033 (1033) Q Consensus 958 ~LGf~lS~HPl~~~~~~l~~~~~--~~~~~~~~~~~g~~v~iaG~V~~i~~~~TK-kG~~mafltleD~~g~~e~~iFP 1033 (1033) +||+|+|+||++.+...+..... .+..+.... +....++|.|+.++.++|| +|++|+|+|++|++|.+|+++|| T Consensus 942 ~lG~yls~HPl~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~i~~vr~~~tk~~G~~~~f~tl~D~~g~~e~v~f~ 1018 (1139) T COG0587 942 TLGLYLSGHPLDFLYEDLLARGLTPIRLLDLVED--GRRVVLAGGIVAVRQRPTKAKGNKMAFLTLEDETGILEVVVFP 1018 (1139) T ss_pred HHCEECCCCCCHHHHHHHHHCCCCCCCHHHHHCC--CCEEEEEEEEEEEEEEECCCCCCEEEEEEEECCCCCEEEEECH T ss_conf 7600156788547899998602332014554014--6226999999998996326789779999983378857999868 No 15 >PRK05672 dnaE2 error-prone DNA polymerase; Validated Probab=100.00 E-value=0 Score=2434.41 Aligned_cols=981 Identities=31% Similarity=0.503 Sum_probs=915.5 Q ss_pred CCCHHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCCCC Q ss_conf 88133433144476443299899999999789988998116517749999999997699638999999854766433445 Q 537021.9.peg.7 9 PSFVHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPIIGCQLDIDMQDNLKMTNQ 88 (1033) Q Consensus 9 ~~FvhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~~~~~~~ 88 (1033) |+||||||||+||+|+|+++|++||++|+++||+||||||||||||+++|+++|+++|||||+|||+++..... T Consensus 3 ~~FvhLhvhS~ySlLdg~~~i~~Lv~~A~~~G~~AlALTD~~~l~Gav~F~~a~~~~GIKPIiG~El~v~~~~~------ 76 (1050) T PRK05672 3 PPYAELHCHSNFSFLDGASHPEELVERAARLGLRALAITDECGLAGVVRAAEAAKELGLRLVIGAELSLGPDPD------ 76 (1050) T ss_pred CCCHHHHCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHCCCCEEEEEEEEEEECCC------ T ss_conf 99502202763722040769999999999859998999737766459999999998499867888999861788------ Q ss_pred CCCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCE-EEECCCCCCHHHHHHHCCCHHHH Q ss_conf 65666677539999569689999999998997324877874007899986348988-99819767967888877997899 Q 537021.9.peg.7 89 GKTPFPTVPSIVLLVSTAEGYQRLIELVSRMYLLDQGKQSVRICLSWLQEIGTEGL-IMLTGGSSGPIDRAFFFNSSQVA 167 (1033) Q Consensus 89 ~~~~~~~~~~lvlLAkN~~Gy~nL~kL~S~a~~~~~~~~~p~i~~~~L~e~~~egL-Ivlsg~~~g~i~~~l~~~~~~~a 167 (1033) ...+|++|||||++||+|||+|+|.+|++ +++++|++++++|.++ ++|+ ++++||..|++.+.+..+....+ T Consensus 77 -----~~~~hLvLLAkN~~GY~NL~kL~S~a~l~-~~~~~pri~~~~L~~~-~~gliiilsg~~~g~~~~~l~~~~~~~~ 149 (1050) T PRK05672 77 -----PGGPHLVVLARDREGYGRLSRLITRARLR-AGKGEYRLDLDDLAEP-AGGHWAHLTGCRKGLVILAPPYGGPAAA 149 (1050) T ss_pred -----CCCCCEEEEECCHHHHHHHHHHHHHHHHH-CCCCCCCCCHHHHHHH-CCCCEEEECCCCCCHHHHHHHCCCCHHH T ss_conf -----88866899818989999999999999983-5899723489999843-6999899328865679897516994799 Q ss_pred HHH-HHHHHHHCCCCEEEEEECCCHHH-HHHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 999-99998765986899994399115-8999999999998189899808830078788999987668846985765310 Q 537021.9.peg.7 168 EKR-LLTFKKLFGDRLYVNLQRHRGYD-RYRESQVVQLAYTHELPLVATNNSLFLSEEDYEAHDVLMAVAHSTVVSQKDR 245 (1033) Q Consensus 168 ~~~-l~~l~~~Fgd~~ylEl~~~~~~e-~~~~~~l~~lA~~~~iPlVaTndv~Y~~~~D~~~h~iL~aI~~~~~l~~~~~ 245 (1033) .+. ..|+...|++|||+|+|+|+..+ ..++..+.++|+++++|+||||||||++++|+.+|++|.||++|+++++.++ T Consensus 150 ~~~~~~~l~~~f~d~~ylElq~~~~~~~~~~n~~l~~lA~~~~iPlVATndvhY~~~eD~~~~dvL~~I~~~~~l~~~~~ 229 (1050) T PRK05672 150 LAAQAAWLRAFFAERVWLELTLHGRPDDDRRNARLAALAARAGVPLVATGDVHMHHRSRRRLQDAMTAIRARRSVAEAGG 229 (1050) T ss_pred HHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCC T ss_conf 99999999985898589998328984689999999999998299989847864078888999999998764986231443 Q ss_pred CCC-CCCCCCCCHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCC Q ss_conf 024-5343234347887512203589987886411100013322102345557568877689889998765410342012 Q 537021.9.peg.7 246 PRV-TPDHYLKNRSEMVSIFSDLPEALENTVEIARRCSFILQVHAPILPRFIEKECDDIQKEEENELRNKAVAGLEVRLA 324 (1033) Q Consensus 246 ~~~-~~~~yLks~eEM~~~f~~~peAi~NT~~IAe~c~~~l~~~~~~lP~f~~p~~~~~~~~~~~~Lr~l~~~Gl~~R~~ 324 (1033) .++ +.++||||++||.++|+++|+|++||.+||++|+|++++.+++||+|++|++ .++++||+++|++|+++||+ T Consensus 230 ~~~~~~e~ylks~~EM~~lF~~~peal~NT~~Ia~~c~~~l~~~~~~lP~~~~p~g----~~~~~~Lr~l~~~Gl~~Ry~ 305 (1050) T PRK05672 230 WLAPNGERHLRSGEEMARLFPDYPEALAETLRLAERCAFDLDLLAYEYPDEPVPAG----HTPASWLRHLTEAGARRRYP 305 (1050) T ss_pred CCCCCHHHCCCCHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHCCCCCCCCCCCCC----CCHHHHHHHHHHHHHHHHCC T ss_conf 35898654269999999880788899998999998526316318756888889999----99999999999977998707 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHHCCCHH Q ss_conf 46432457838999998877898986063055788899999851047643344442046677655440375633323105 Q 537021.9.peg.7 325 QGAIAKGYHAQDYRQRLDFELNVIARMKFSGYFLIVADFIQWAKKNNIPVGPGRGSGAGSVTAYALTITDIDPLRFSLLF 404 (1033) Q Consensus 325 ~~~~~~~~~~~~y~~RLe~EL~vI~~~gf~~YFLiV~Div~~ak~~gI~vGpGRGSAagSLVaY~LgIT~VDPi~~~LlF 404 (1033) ++ .+++|++||+|||+||++|||++|||||||+|+|||++|| ||||||||||||||||||||+||||+||||| T Consensus 306 ~~------~~~~~~~RL~~EL~vI~~mgf~~YFLIV~D~i~~Ak~~gI-v~pGRGSaAGSLVaY~LgIT~VDPi~y~LlF 378 (1050) T PRK05672 306 PG------IPPKARAQIEHELALIAELGYEGYFLTVHDIVRFARSQGI-LCQGRGSAANSAVCYALGITEVDPVQSDLLF 378 (1050) T ss_pred CC------CCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCC-CCCCCCCHHHHHHHHHHCCCCCCCCCCCCCH T ss_conf 88------8789999999999999725887432799999999985574-1478884188999998478565833368847 Q ss_pred HHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCC Q ss_conf 67621001457875333656878999999983038021554220778778899999997527998898865431014589 Q 537021.9.peg.7 405 ERFLNPDRMSMPDFDIDFCQDRRDEVIRYVQNKYGHERVAQIITFGSLQAKAALRDVGRALQMPYSQVDRLCKLIPNDPA 484 (1033) Q Consensus 405 ERFLnp~R~~~PDIDiDf~~~~R~~vi~y~~~kyG~~~va~i~t~~t~~~k~aird~~r~~g~~~~~~~~~~~~i~~~~~ 484 (1033) ||||||+|++||||||||+++||++||+||++|||+++||||+||+||++|+||||||||||+|+.++|++||+||+..+ T Consensus 379 ERFLNpeR~smPDIDiDf~~~rR~eVi~Yv~~kYG~~~VAqI~Tf~T~~ak~AiRDvgRvlg~~~~~~d~iak~i~~~~~ 458 (1050) T PRK05672 379 ERFLSRERDEPPDIDVDFEHDRREEVIQYVYRRYGRDRAALVANVITYRPRSALRDVAKALGLSPGQVDAWAKQVSWWDG 458 (1050) T ss_pred HHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEEEEEEHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCCC T ss_conf 87358777788876655542308999999999848776787555554127789999999849999999999850777666 Q ss_pred CCCCHHHHHCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCEEEECCCCHHHHEECCCCCCCCCEEEEEECHHHH Q ss_conf 82443445229887886430742026889872011221002447440335256666620012367688668851021334 Q 537021.9.peg.7 485 HPVSLQAIADDSRFREARLADPSVDRLLEISQKLEGLYRHASTHAAGIVIGDRPLSQLVPMYRDVRSDLPVTQFNMKWIE 564 (1033) Q Consensus 485 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~leg~~r~~~~Ha~Gvvi~~~~l~~~~p~~~~~~~~~~~~q~d~~~~e 564 (1033) ..... +..........+|..++++++|++|||+|||+|+||||||||++||+++||++++++++.++|||||++|| T Consensus 459 ~~~~~----~~~~~~~~~~~~~~~~~~~~~a~~leGl~R~~s~HaaGvVIs~~pL~~~vPl~~~~~~~~~vtQ~d~~~~E 534 (1050) T PRK05672 459 SADDL----QRLRQAGLDPDSPIPRRVVELAAQLIGFPRHLSQHSGGFVISDRPLARLVPVENAAMAGRSVIQWDKDDLD 534 (1050) T ss_pred CCHHH----HHHHHCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCEEEEECCCHHCCCCCCCCCCCCCEEEEEECCHHH T ss_conf 52147----77876055646768999999999853986544566765998677520037844457899746875465187 Q ss_pred HHCCCHHHHHHHCCHHHHHHHHHHHHH-HCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHCCCHHHHHHHHHHCCCHHHHH Q ss_conf 311232135654178999999998874-0254343323443588998762014101221145347888877174048888 Q 537021.9.peg.7 565 KAGLIKFDFLGLKTLTLLQKSLDFLAQ-RGVKVDLSLIPFDDHETYHLLTTKGTLGIFQLESSGMRQALEGMQPDCIEDI 643 (1033) Q Consensus 565 ~~gl~K~D~Lgl~~L~~i~~~~~~i~~-~~~~~d~~~ip~~D~~~~~l~~~g~t~GvFQ~es~~~~~~l~~~~p~~~~dl 643 (1033) ++||||||||||||||+|++++++|++ +|.++|+.+||+||++||+||++|+|.||||+||+||+++|++++|++|+|| T Consensus 535 ~~GLlK~D~LGL~~Lt~i~~~~~~I~~~~g~~~dl~~ipldD~~t~~ll~~g~T~GvFQlES~g~~~~l~~l~P~~feDl 614 (1050) T PRK05672 535 AVGLVKVDVLALGMLSALRRAFDLIAEHRGRRLTLADIPLDDPAVYDMLCRADTVGVFQVESRAQMAMLPRLRPREFYDL 614 (1050) T ss_pred HCCCEEEEECCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCHHHH T ss_conf 77982242015673799999999999710997662138977789999873667578560147899999986399865342 Q ss_pred HHHHHHCCCHHHHHH--HHHHHHHCCCCCCCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHC-CC Q ss_conf 766541141134200--368885046578502696673341033553001256898878863268413078897620-12 Q 537021.9.peg.7 644 IALVSLYRPGPIDNI--VVYNNRKNGKEKIVSIHPLIDPMLKETQGVIIYQEQVMQIAQLLSGYSLSEADVLRRAMG-KK 720 (1033) Q Consensus 644 ~~~~al~RPGp~~~~--~~yi~rk~g~e~~~~~hp~l~~iL~~T~Gv~vyQEQvm~ia~~~ag~~~~ead~lRra~~-kk 720 (1033) ||++||||||||+++ +.||+||||+|+++|+||.|++||++|||||||||||||||+.+||||+||||.|||||| || T Consensus 615 va~~ALyRPGP~~~~~i~~yi~rk~g~e~i~y~hp~L~~IL~~TyGvivYQEQvM~ia~~~aGfslgeAD~lRramgsKK 694 (1050) T PRK05672 615 VVEVAIVRPGPIQGGMVHPYLRRRNGEEPVTYPSPELEPVLERTLGVPLFQEQVMQIAIDAAGFTPGEADQLRRAMAAWR 694 (1050) T ss_pred HHHHHEECCCCCCCCCCHHHHHHHCCCCCCCCCCHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHC T ss_conf 35531015766545642899987349876568988899998615760589999999999974999899999999987543 Q ss_pred CCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCH Q ss_conf 11016778789999986249999999999999997500013310147888999999999973026667775100235420 Q 537021.9.peg.7 721 IKEEMDKQKERFISGASKNGISKTIAVNIFELLAKFADYGFNKSHAAAYAVISYQTAWMKTHYPVEFLAASMTLEMDNVE 800 (1033) Q Consensus 721 ~~~~~~~~~~~f~~g~~~~g~~~~~a~~i~~~i~~f~~Y~FnksHa~aya~~~y~~aylK~~yP~eF~aa~L~~~~~~~~ 800 (1033) +.+.|++++++|++||.+||+++++|++||++|++||+||||||||+|||++||||||||+|||+|||||+||+++++++ T Consensus 695 ~~~~m~~~~~~f~~g~~~~g~~~~~a~~i~~~i~~fa~YgFnksHa~aYa~~ayq~AylK~hyP~ef~aalL~~~~~~~~ 774 (1050) T PRK05672 695 RVGRLERLRERLVDGMLARGYTGEFAERIFEQIKGFGSYGFPESHAASFAKLVYASSWLKCHHPAAFCAALLNSQPMGFY 774 (1050) T ss_pred CCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHCCCCCCCC T ss_conf 72228899999986798759999999999998764434477779999999999999999873999999998567876776 Q ss_pred HHHHHHHHHHHCCCCCCCCCCCCCCEEEEECC-----CEEEEEHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHCCCC Q ss_conf 25899988886368503552334210035319-----8698502685896989999999998379999978999734824 Q 537021.9.peg.7 801 KIKKFCQDARQFNIQIMPPSVNTPCVDFKVGD-----NRIYYSLAAIKGVGTTTARHIMEASADKPFDSLEDFCSRVDSK 875 (1033) Q Consensus 801 ~~~~~i~e~~~~gi~vl~PdIN~S~~~f~~~~-----~~Ir~Gl~~Ikgvg~~~~~~Ive~r~~g~f~sl~df~~rv~~~ 875 (1033) ++..+++|||++||+|+|||||+|+++|+++. ++||+||++|||||+++++.|+++|++|||+|++||+.|++ T Consensus 775 ~~~~~~~e~~~~gi~vlppdin~S~~~~~~~~~~~~~~~ir~gl~~ikg~g~~~~~~I~~~R~~g~f~s~~Df~~R~~-- 852 (1050) T PRK05672 775 SPQQLVQDARRHGVEVLPVDVNASGWDATLEPRPGGGPAVRLGLRLVRGLGEEAAERIVAARARGPFTSVEDLARRAG-- 852 (1050) T ss_pred CHHHHHHHHHHCCCCEECCEEECCCCCCEEEECCCCCCEEEEHHHHHCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHC-- T ss_conf 689999999985992618742025775356314788864874578755570999999999971699999999998858-- Q ss_pred CCCHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHH Q ss_conf 79989999888676723479038999998999999999886543127778666655556557766689999888999857 Q 537021.9.peg.7 876 KLNRRVLESLVFAGALDCFGYSRMQLLQSLDNIQKYAQWVEKNRTNKHENIFAHEKGTFSDKITLEKFSVENSSVRFENE 955 (1033) Q Consensus 876 ~~nk~~le~Li~aGAfD~~~~~R~~ll~~i~~~~~~~~~~~~~~~~~q~~LF~~~~~~~~~~~~~~~~~~~~~~e~l~~E 955 (1033) +||+++|+||+|||||+||.||++++|.+..+.. ...++.++|.... ......+.+|+|+..|++.+| T Consensus 853 -~~~~~~e~Li~aGafD~~g~~R~~~l~~~~~~~~--------~~~~~~~~~~~~~---~~~~~~~~lp~~~~~e~l~~E 920 (1050) T PRK05672 853 -LDRRALEALADAGALRSLGGHRRQALWAAGAAAT--------ERPPRLPGLAAAE---IPEEHAPALPGMSEAEDVAAD 920 (1050) T ss_pred -CCHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHH--------CCCCCCCCCCCCC---CCCCCCCCCCCCCHHHHHHHH T ss_conf -9999999998679840016059999999999764--------1645454433566---565567889996866799999 Q ss_pred HHHHCEEECCCCHHHHHHHHHHCCCCCHHHHHCCCCCCEEEEEEEEEEEEEECCCCCCEEEEEEEECCCCEEEEEECC Q ss_conf 887262116385589999998748976889850558988999999961121015888769999998497049999539 Q 537021.9.peg.7 956 QRVLGFYFSGHPLDVYKPILRKMGIKGYEESVATIRGNNIRLAAMVVSKQQKKTRKGSRIGWVTFSEPAKEYEANFFP 1033 (1033) Q Consensus 956 ~e~LGf~lS~HPl~~~~~~l~~~~~~~~~~~~~~~~g~~v~iaG~V~~i~~~~TKkG~~mafltleD~~g~~e~~iFP 1033 (1033) +|+||||+|+|||+.|++.++..++.++.++.+..+|+.|+|+|+|+.+|+++||+| |+|+|+||++|.+|+|||| T Consensus 921 ~e~LG~yls~HPl~~~~~~l~~~~~~~~~~l~~~~~g~~v~vaG~v~~~~~~~tkkG--maF~tleD~tG~ievvvFp 996 (1050) T PRK05672 921 YWTTGLSLDSHPLALLRERLDARGVVSAAELLDVEDGRRVRVAGLVTHRQRPGTASG--VTFLTLEDETGMVNVVVWP 996 (1050) T ss_pred HHHHCCCCCCCHHHHHHHHHHHCCCCCHHHHHCCCCCCEEEEEEEEEEEEECCCCCC--EEEEEEEECCCCEEEEECH T ss_conf 998683776180899999999739810999840799988999999998476577886--1899998689738999988 No 16 >PRK07279 dnaE DNA polymerase III DnaE; Reviewed Probab=100.00 E-value=0 Score=2288.51 Aligned_cols=922 Identities=32% Similarity=0.556 Sum_probs=856.8 Q ss_pred CHHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCCCCCC Q ss_conf 13343314447644329989999999978998899811651774999999999769963899999985476643344565 Q 537021.9.peg.7 11 FVHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPIIGCQLDIDMQDNLKMTNQGK 90 (1033) Q Consensus 11 FvhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~~~~~~~~~ 90 (1033) ||||||||+||+|||+++|++||++|+++||+||||||||||||+++|+++|+++|||||||||+++..++ T Consensus 2 FvhLh~hS~YSlldg~~~i~~lv~~A~~~g~~AlAlTDh~nm~Gav~F~~~a~~~gIKPIiG~E~~v~~~~--------- 72 (1033) T PRK07279 2 FAQLDTKTVYSFMDSLIDLNHYVERAKELGYHTIGIMDKDNLYGAYHFIKGAQKNGLQPVLGLELNIFYQE--------- 72 (1033) T ss_pred CCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCCEEEEEEEEECCC--------- T ss_conf 56435767474015155999999999975999999964794889999999999759894788899997489--------- Q ss_pred CCCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCEEEECCCCCCHHHHHHHCCCHHHHHHH Q ss_conf 66667753999956968999999999899732487787400789998634898899819767967888877997899999 Q 537021.9.peg.7 91 TPFPTVPSIVLLVSTAEGYQRLIELVSRMYLLDQGKQSVRICLSWLQEIGTEGLIMLTGGSSGPIDRAFFFNSSQVAEKR 170 (1033) Q Consensus 91 ~~~~~~~~lvlLAkN~~Gy~nL~kL~S~a~~~~~~~~~p~i~~~~L~e~~~egLIvlsg~~~g~i~~~l~~~~~~~a~~~ 170 (1033) ..+|++|||||++||+|||+|+|.++ .+++++++|.+ |++||++++++..+.- T Consensus 73 ----~~~hl~LLAkn~~Gy~nL~kLss~~~-------~~r~d~~~L~~-~~~gl~~l~~~~~~~~--------------- 125 (1033) T PRK07279 73 ----RQVLLRLIAKNTQGYKNLLKISTAKM-------SGKLHFSYLQQ-HLEGIAVIIPYFDWSD--------------- 125 (1033) T ss_pred ----CCEEEEEECCCHHHHHHHHHHHHHHC-------CCCCCHHHHHH-HCCCCEEEECCCCHHH--------------- T ss_conf ----62369998169999999999999856-------79999999997-6399699937856065--------------- Q ss_pred HHHHHHHCCCCEEEEEECCCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCC Q ss_conf 99998765986899994399115899999999999818989980883007878899998766884698576531002453 Q 537021.9.peg.7 171 LLTFKKLFGDRLYVNLQRHRGYDRYRESQVVQLAYTHELPLVATNNSLFLSEEDYEAHDVLMAVAHSTVVSQKDRPRVTP 250 (1033) Q Consensus 171 l~~l~~~Fgd~~ylEl~~~~~~e~~~~~~l~~lA~~~~iPlVaTndv~Y~~~~D~~~h~iL~aI~~~~~l~~~~~~~~~~ 250 (1033) .-.|++++|+|++.+. .+.++++|+||||+|||++++|+++|++|.||++|+++++..+ .+. T Consensus 126 ----~~~~~~~~yl~i~~~~------------~~~~~~~plvat~~v~y~~~~D~~~~~vl~~I~~~~~l~d~~~--~~~ 187 (1033) T PRK07279 126 ----TLVLPFDYYIGVYTDT------------SGMDFKRPILPLRTVRYFEQADRETLQMLHAIRDNLSLAEVPV--VSS 187 (1033) T ss_pred ----HHCCCCCEEEEECCCC------------HHHHCCCCCEECCCCEECCHHHHHHHHHHHHHHCCCCCHHCCC--CCC T ss_conf ----6207801175200577------------2877399938717650678606999999999975998001676--881 Q ss_pred CCCCCCHHHHHHHCCH-HHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 4323434788751220-358998788641110001332210234555756887768988999876541034201246432 Q 537021.9.peg.7 251 DHYLKNRSEMVSIFSD-LPEALENTVEIARRCSFILQVHAPILPRFIEKECDDIQKEEENELRNKAVAGLEVRLAQGAIA 329 (1033) Q Consensus 251 ~~yLks~eEM~~~f~~-~peAi~NT~~IAe~c~~~l~~~~~~lP~f~~p~~~~~~~~~~~~Lr~l~~~Gl~~R~~~~~~~ 329 (1033) ++||+|++||.++|.+ +|+|++||.+||++|+++++ ..+.||+|+. + .++.++|+++|++|+++|.. T Consensus 188 ~~~lks~~em~~lf~~~~peal~Nt~~Ia~~~~~~~~-~~~~lP~f~~-~-----~~~~e~L~~l~~~Gl~~R~~----- 255 (1033) T PRK07279 188 NQELASCQQLETFFQERFPQALQNLEDLVSGISYDFD-TNLKLPRFNR-D-----RPAVEELRELAEAGLKEKGL----- 255 (1033) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHEEECCC-CCCCCCCCCC-C-----CCHHHHHHHHHHHHHHHHCC----- T ss_conf 2016999999999987509999999999973046068-8788999999-9-----88699999999988998379----- Q ss_pred CCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHHCCCHHHHHHH Q ss_conf 45783899999887789898606305578889999985104764334444204667765544037563332310567621 Q 537021.9.peg.7 330 KGYHAQDYRQRLDFELNVIARMKFSGYFLIVADFIQWAKKNNIPVGPGRGSGAGSVTAYALTITDIDPLRFSLLFERFLN 409 (1033) Q Consensus 330 ~~~~~~~y~~RLe~EL~vI~~~gf~~YFLiV~Div~~ak~~gI~vGpGRGSAagSLVaY~LgIT~VDPi~~~LlFERFLn 409 (1033) ..++|++||+|||+||++|||++|||||||+|+|||++||+||||||||||||||||||||+||||+|||||||||| T Consensus 256 ---~~~~y~~RL~~EL~vI~~mGf~~YFLIV~D~i~~Ak~~gI~vGPGRGSAAGSLVaY~LgIT~vDPi~y~LlFERFLN 332 (1033) T PRK07279 256 ---WSEPYQERLDKELSIIHDMGFDDYFLIVWDLLRFGRSQGYYMGMGRGSAAGSLVAYALNITGIDPVKHNLLFERFLN 332 (1033) T ss_pred ---CCHHHHHHHHHHHHHHHHCCCCCEEEHHHHHHHHHHHCCCEECCCCCCHHHHHHHHHHCCCCCCHHHCCCCHHHHCC T ss_conf ---96689999999999998679962541999999999866934368887448789999852633384424886898659 Q ss_pred HCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCCH Q ss_conf 00145787533365687899999998303802155422077877889999999752799889886543101458982443 Q 537021.9.peg.7 410 PDRMSMPDFDIDFCQDRRDEVIRYVQNKYGHERVAQIITFGSLQAKAALRDVGRALQMPYSQVDRLCKLIPNDPAHPVSL 489 (1033) Q Consensus 410 p~R~~~PDIDiDf~~~~R~~vi~y~~~kyG~~~va~i~t~~t~~~k~aird~~r~~g~~~~~~~~~~~~i~~~~~~~~~~ 489 (1033) |+|++||||||||+++||++||+||.+|||+++||||+||+||++|+||||||||+|+|+.++|+++|+||... + .. T Consensus 333 peR~smPDIDiDF~~~rR~eVI~Yv~~kYG~d~VAqI~TfgT~~ak~AiRDv~Rvlg~~~~~~~~i~k~i~~~~--~-l~ 409 (1033) T PRK07279 333 KERYSMPDIDIDLPDIYRSEFLRYVRNRYGSDHSAQIVTFSTFGAKQAIRDVFKRFGVPEYELSNLTKKISFRD--S-LA 409 (1033) T ss_pred CCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEEEHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCCC--C-HH T ss_conf 88888998422066135999999999986887638854204468689999999981999899999995088888--6-67 Q ss_pred HHHHCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCEEEECCCCHHHHEECCCCCCCCCEEEEEECHHHHHHCCC Q ss_conf 44522988788643074202688987201122100244744033525666662001236768866885102133431123 Q 537021.9.peg.7 490 QAIADDSRFREARLADPSVDRLLEISQKLEGLYRHASTHAAGIVIGDRPLSQLVPMYRDVRSDLPVTQFNMKWIEKAGLI 569 (1033) Q Consensus 490 ~~~~~~~~~~~~~~~~~~~~~~~~~a~~leg~~r~~~~Ha~Gvvi~~~~l~~~~p~~~~~~~~~~~~q~d~~~~e~~gl~ 569 (1033) ++++.++++++.+.+++++++++++|++|||+|||+|+||||||||++||++++|++.+ ++.++|||||++||++||+ T Consensus 410 ~~~e~~~~~~~~~~~~~~~~~l~~~A~~LEGlpR~~s~HAaGVVIs~~pL~~~vPl~~~--~~~~vtQ~d~~~~E~~GLl 487 (1033) T PRK07279 410 TVYEKNISFRQVINSKLEYQKAFEIAKRIEGNPRQTSIHAAGVVMSDDDLTNHIPLKSG--EDMMITQYDAHAVEANGLL 487 (1033) T ss_pred HHHHHCHHHHHHHHCCHHHHHHHHHHHHHCCCCCCCCCCCCEEEEECCCCCCEEEEEEC--CCCEEEEEEHHHHHHCCCC T ss_conf 88761887886663377899999999985598555566677699967986766646767--9863774354889873990 Q ss_pred HHHHHHHCCHHHHHHHHHHHHH-HCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHCCCHHHHHHHHHHCCCHHHHHHHHHH Q ss_conf 2135654178999999998874-025434332344358899876201410122114534788887717404888876654 Q 537021.9.peg.7 570 KFDFLGLKTLTLLQKSLDFLAQ-RGVKVDLSLIPFDDHETYHLLTTKGTLGIFQLESSGMRQALEGMQPDCIEDIIALVS 648 (1033) Q Consensus 570 K~D~Lgl~~L~~i~~~~~~i~~-~~~~~d~~~ip~~D~~~~~l~~~g~t~GvFQ~es~~~~~~l~~~~p~~~~dl~~~~a 648 (1033) |||||||||||+|++++++|++ +|.++|+++||+||++||+||++|+|.||||+||+||+++|++++|++|+||||++| T Consensus 488 K~D~LGL~~LtvI~~a~~~i~~~~~~~id~~~ipldD~~t~~ll~~g~T~GvFQ~ES~gm~~~l~~l~P~~feDiva~~A 567 (1033) T PRK07279 488 KMDFLGLRNLTFVQKMQEKVAKDYGIDIDIEAIDLEDKQTLALFAAGDTKGIFQFEQPGAINLLKRIKPQRFEEIVATTS 567 (1033) T ss_pred HHHHHCCCHHHHHHHHHHHHHHHCCCCCCHHHCCCCCHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHHHHHH T ss_conf 35554567589999999999985598776010585629999999667776776344758999998669996999889877 Q ss_pred HCCCHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCHHHHH Q ss_conf 11411342003688850465785026966733410335530012568988788632684130788976201211016778 Q 537021.9.peg.7 649 LYRPGPIDNIVVYNNRKNGKEKIVSIHPLIDPMLKETQGVIIYQEQVMQIAQLLSGYSLSEADVLRRAMGKKIKEEMDKQ 728 (1033) Q Consensus 649 l~RPGp~~~~~~yi~rk~g~e~~~~~hp~l~~iL~~T~Gv~vyQEQvm~ia~~~ag~~~~ead~lRra~~kk~~~~~~~~ 728 (1033) ||||||+++++.|++||||+++++|+||.|++||++|||||||||||||||+++||||++|||.||||||||+.++|+++ T Consensus 568 L~RPGP~~~i~~yi~rk~g~~~i~y~hp~l~~IL~~TyGvivYQEQvMqia~~~Agfs~~eAD~lRrAm~KK~~~~m~~~ 647 (1033) T PRK07279 568 LNRPGASDYTTNFVKRRHGQEKVDLIDPVIAPILEPTYGIMLYQEQVMQIAQVFAGFSLGKADLLRRAMSKKNASEMQKM 647 (1033) T ss_pred HCCCCCCHHHHHHHHHHCCCCCCCCCHHHHHHHHCCCCCEEEEHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCHHHHHHH T ss_conf 40788634679999984699787526278998744778869865999999999839998999999999713758789999 Q ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHH Q ss_conf 78999998624999999999999999750001331014788899999999997302666777510023542025899988 Q 537021.9.peg.7 729 KERFISGASKNGISKTIAVNIFELLAKFADYGFNKSHAAAYAVISYQTAWMKTHYPVEFLAASMTLEMDNVEKIKKFCQD 808 (1033) Q Consensus 729 ~~~f~~g~~~~g~~~~~a~~i~~~i~~f~~Y~FnksHa~aya~~~y~~aylK~~yP~eF~aa~L~~~~~~~~~~~~~i~e 808 (1033) +++|++||.+||++++.|++||++|++||+||||||||+|||++||||||||+|||+|||||+||+ +...|+.| T Consensus 648 ~~~Fi~ga~~~G~~~~~a~~if~~i~~Fa~YgFNKSHaaAYa~iaYqtAyLKahYP~eF~aalL~~------~~~~yi~e 721 (1033) T PRK07279 648 EEDFIAGAKHLGHAEEKARELFDRMEKFAGYGFNRSHAFAYSALAFQLAYFKAHYPAVFYDIMLNY------SSSDYITD 721 (1033) T ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHCC------CHHHHHHH T ss_conf 999998788739999999999999987525677864036789999999999844889999998457------77899999 Q ss_pred HHHCCCCCCCCCCCCCCEEEEECCCEEEEEHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCHHHHHHHHHC Q ss_conf 88636850355233421003531986985026858969899999999983799999789997348247998999988867 Q 537021.9.peg.7 809 ARQFNIQIMPPSVNTPCVDFKVGDNRIYYSLAAIKGVGTTTARHIMEASADKPFDSLEDFCSRVDSKKLNRRVLESLVFA 888 (1033) Q Consensus 809 ~~~~gi~vl~PdIN~S~~~f~~~~~~Ir~Gl~~Ikgvg~~~~~~Ive~r~~g~f~sl~df~~rv~~~~~nk~~le~Li~a 888 (1033) |+++||+|+|||||+|.+.|++++++|||||++|||||++++++|+++| ||+|+.||+.|++.+.+||+++|+||+| T Consensus 722 a~~~gi~vlpPdIN~S~~~f~~~~~~Ir~GL~aIkgvg~~~v~~Iv~~R---pf~sl~Df~~Rv~~~~~n~~~~e~Li~a 798 (1033) T PRK07279 722 ALEFGFEVAQLSINTIPYHDKIENKKIYLGLKNIKGLPRDLAYWIIENR---PFNSIEDFLTRLPEKYQKKVFLEPLIKI 798 (1033) T ss_pred HHHCCCCCCCCCCCCCCCCCEECCCEEEEHHHHHCCCCHHHHHHHHHCC---CCCCHHHHHHHHCCHHCCHHHHHHHHHC T ss_conf 9984997458865336776511388188737866587699999999758---9998899998602010498999999977 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCEEECCCCH Q ss_conf 67234790389999989999999998865431277786666555565577666899998889998578872621163855 Q 537021.9.peg.7 889 GALDCFGYSRMQLLQSLDNIQKYAQWVEKNRTNKHENIFAHEKGTFSDKITLEKFSVENSSVRFENEQRVLGFYFSGHPL 968 (1033) Q Consensus 889 GAfD~~~~~R~~ll~~i~~~~~~~~~~~~~~~~~q~~LF~~~~~~~~~~~~~~~~~~~~~~e~l~~E~e~LGf~lS~HPl 968 (1033) ||||+||.||++++++++.++.++.. ..+||++.. ...++.++|+..|++.+|+|+||||+|+||| T Consensus 799 GAfD~~~~~R~~ll~~l~~~~~~~~~--------~~~l~~~~~------~~~~~~~~~~~~e~l~~Eke~lG~yls~HPl 864 (1033) T PRK07279 799 GLFDSFEPNRQKILDNLDNLFVFVNE--------LGSLFADSS------YSWVEAEDYSETEKYSLEQELLGVGVSKHPL 864 (1033) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHH--------CCCCCCCCC------CCCCCCCCCCHHHHHHHHHHHHEEEECCCHH T ss_conf 98411260199999818888987875--------023346766------6777888998545899988871663454548 Q ss_pred HHHHHHHHHCCCCCHHHHHCCCCCCEEEEEEEEEEEEEECCCCCCEEEEEEEECCCCEEEEEECC Q ss_conf 89999998748976889850558988999999961121015888769999998497049999539 Q 537021.9.peg.7 969 DVYKPILRKMGIKGYEESVATIRGNNIRLAAMVVSKQQKKTRKGSRIGWVTFSEPAKEYEANFFP 1033 (1033) Q Consensus 969 ~~~~~~l~~~~~~~~~~~~~~~~g~~v~iaG~V~~i~~~~TKkG~~mafltleD~~g~~e~~iFP 1033 (1033) +.++..... ..+++. +...+++++++|+|..+++++||+|++|+|+|+||++|.+|++||| T Consensus 865 ~~~~~~~~~-~~~~~~---~l~~~~~~~~~g~i~~v~~~~tk~g~~maf~tleD~tg~ie~vvFp 925 (1033) T PRK07279 865 QDIAEKSSQ-PFTPIS---QLVKNSEATILVQIQSIRIIRTKTGQQMAFLSVNDTKKKLDVTLFP 925 (1033) T ss_pred HHHHHHHHH-CCCCHH---HHHCCCCCEEEEEEEEEEEEECCCCCEEEEEEEEECCCCEEEEECH T ss_conf 999988640-654088---7505896279999988899877989969999999899868999867 No 17 >PRK05898 dnaE DNA polymerase III DnaE; Validated Probab=100.00 E-value=0 Score=2250.93 Aligned_cols=930 Identities=32% Similarity=0.540 Sum_probs=854.0 Q ss_pred CHHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCCCCCC Q ss_conf 13343314447644329989999999978998899811651774999999999769963899999985476643344565 Q 537021.9.peg.7 11 FVHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPIIGCQLDIDMQDNLKMTNQGK 90 (1033) Q Consensus 11 FvhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~~~~~~~~~ 90 (1033) ||||||||+||||||+++|++||++|+++||+||||||||||||+++|+++|+++|||||+|||++.. T Consensus 2 FvhLh~hS~YSlLdg~~~i~~lv~~A~e~g~~alALTD~~nl~Gav~Fy~~akk~gIKPIiG~E~~~~------------ 69 (971) T PRK05898 2 FINLNTHSHYSLLSSTLSIDDIIKFALDNNQPYVCLTDLNNLYGCIEFYDKAKAHNLIPIIGLEIEYQ------------ 69 (971) T ss_pred CCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHCCCCEEEEEEEECC------------ T ss_conf 26767887363016246999999999985999899986888879999999999879944989999827------------ Q ss_pred CCCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCEEEECCCCCCHHHHHHHCCCHHHHHHH Q ss_conf 66667753999956968999999999899732487787400789998634898899819767967888877997899999 Q 537021.9.peg.7 91 TPFPTVPSIVLLVSTAEGYQRLIELVSRMYLLDQGKQSVRICLSWLQEIGTEGLIMLTGGSSGPIDRAFFFNSSQVAEKR 170 (1033) Q Consensus 91 ~~~~~~~~lvlLAkN~~Gy~nL~kL~S~a~~~~~~~~~p~i~~~~L~e~~~egLIvlsg~~~g~i~~~l~~~~~~~a~~~ 170 (1033) ...+|++|||||++||+|||+|+|.++....+ .+.++ .++++++ |..|.+ T Consensus 70 ---~~~~~LvLLAkN~~GY~NL~kLsS~~~~~~~~---------~l~~~-~~~l~ii--~~~g~~--------------- 119 (971) T PRK05898 70 ---STNATLVLYAKNYNGYLNLIKISSFIMTNKEF---------EIQDY-LDDLFIV--CKKGTF--------------- 119 (971) T ss_pred ---CCCCEEEEECCCHHHHHHHHHHHHHHHHCCCC---------HHHHH-CCCCEEE--ECCCCC--------------- T ss_conf ---99943999849899999999999998608963---------47665-2982899--279980--------------- Q ss_pred HHHHHHHCCCCEEEEEECCCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCC Q ss_conf 99998765986899994399115899999999999818989980883007878899998766884698576531002453 Q 537021.9.peg.7 171 LLTFKKLFGDRLYVNLQRHRGYDRYRESQVVQLAYTHELPLVATNNSLFLSEEDYEAHDVLMAVAHSTVVSQKDRPRVTP 250 (1033) Q Consensus 171 l~~l~~~Fgd~~ylEl~~~~~~e~~~~~~l~~lA~~~~iPlVaTndv~Y~~~~D~~~h~iL~aI~~~~~l~~~~~~~~~~ 250 (1033) .|. ........+..+.|.||||||||++++|+.+|++|.||+++.++++..+.+... T Consensus 120 ------~~~-----------------~~~~~~~~~~~~~~~Vatndv~y~~~eD~~~~~~l~~I~~~~~l~~~~~~~~~~ 176 (971) T PRK05898 120 ------VFK-----------------SPNFYQTHNQNAPNAIAFNSVFYANKNDKIVFNAMLAIKNDLKIDELKNCQDFD 176 (971) T ss_pred ------CCC-----------------CHHHHHHHHCCCCCEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCC T ss_conf ------547-----------------637999974579981775776417876699999999981699647640024476 Q ss_pred CCCCCCHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 43234347887512203589987886411100013322102345557568877689889998765410342012464324 Q 537021.9.peg.7 251 DHYLKNRSEMVSIFSDLPEALENTVEIARRCSFILQVHAPILPRFIEKECDDIQKEEENELRNKAVAGLEVRLAQGAIAK 330 (1033) Q Consensus 251 ~~yLks~eEM~~~f~~~peAi~NT~~IAe~c~~~l~~~~~~lP~f~~p~~~~~~~~~~~~Lr~l~~~Gl~~R~~~~~~~~ 330 (1033) ++|++|++||.++|. |+|++||.+||++|++++++.+++||+|++|++. ++++||+++|++||++||++. . T Consensus 177 ~~~~~s~~em~~lf~--~~al~Nt~~Ia~~C~~~l~~~~~~lP~f~~p~g~----~~~eyLr~l~~~Gl~~R~~~~---~ 247 (971) T PRK05898 177 NNHFLNDNEAQSLFS--PIQLDNLNKVLNELKVEIHDLPINIIKYDKQNSI----ISSEILKQLCISGLNKRLNAN---D 247 (971) T ss_pred CCCCCCHHHHHHHCC--CHHHHHHHHHHHHHEEECCCCCCCCCCCCCCCCC----CHHHHHHHHHHHHHHHHCCCC---C T ss_conf 243689999988687--1999999999997335502478779999888999----989999999998999861677---5 Q ss_pred CCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHHCCCHHHHHHHH Q ss_conf 57838999998877898986063055788899999851047643344442046677655440375633323105676210 Q 537021.9.peg.7 331 GYHAQDYRQRLDFELNVIARMKFSGYFLIVADFIQWAKKNNIPVGPGRGSGAGSVTAYALTITDIDPLRFSLLFERFLNP 410 (1033) Q Consensus 331 ~~~~~~y~~RLe~EL~vI~~~gf~~YFLiV~Div~~ak~~gI~vGpGRGSAagSLVaY~LgIT~VDPi~~~LlFERFLnp 410 (1033) ...+++|++||+|||+||++|||+||||||||+|+|||++||+||||||||||||||||||||+||||+||||||||||| T Consensus 248 ~~~~~~y~eRL~~EL~VI~~mGf~~YFLIV~D~I~~Ak~~gI~vGPGRGSaAGSLVaY~LgIT~iDPi~~~LlFERFLnp 327 (971) T PRK05898 248 GQVKKIYVKRLKYELDIINEKQFDDYFLIVYDFINFAKSNGIIIGPGRGSAAGSLIAYLLHITDIDPIKYNLIFERFLNP 327 (971) T ss_pred CCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHCCCCCCHHHCCCCHHHHCCC T ss_conf 32339999999999999997599874689999999999759721788762698999999748775822158747765686 Q ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCCHH Q ss_conf 01457875333656878999999983038021554220778778899999997527998898865431014589824434 Q 537021.9.peg.7 411 DRMSMPDFDIDFCQDRRDEVIRYVQNKYGHERVAQIITFGSLQAKAALRDVGRALQMPYSQVDRLCKLIPNDPAHPVSLQ 490 (1033) Q Consensus 411 ~R~~~PDIDiDf~~~~R~~vi~y~~~kyG~~~va~i~t~~t~~~k~aird~~r~~g~~~~~~~~~~~~i~~~~~~~~~~~ 490 (1033) +|.+||||||||+++||++||+||++|||.++||||+||+||++|+||||||||+|+|+.++|+++|+||...+.++. . T Consensus 328 eR~smPDIDiDf~~~rR~eVi~Yv~~kYG~~~va~I~Tf~t~~ak~aiRDv~rvlg~~~~~~d~i~k~i~~~~~~~l~-~ 406 (971) T PRK05898 328 TRKSMPDIDTDIMDERRDEVVEYLFEKYGNDHVAHIITFQRIKAKMAIRDVGRILGIDLKVIDKICKNIKPDYEEDLD-L 406 (971) T ss_pred CCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEEEHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCHH-H T ss_conf 657899864338854499999999999498668987320213688999999997299999999999736888787989-9 Q ss_pred HHHCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCEEEECCCCHHHHEECCCCCCCCCEEEEEECHHHHHHCCCH Q ss_conf 45229887886430742026889872011221002447440335256666620012367688668851021334311232 Q 537021.9.peg.7 491 AIADDSRFREARLADPSVDRLLEISQKLEGLYRHASTHAAGIVIGDRPLSQLVPMYRDVRSDLPVTQFNMKWIEKAGLIK 570 (1033) Q Consensus 491 ~~~~~~~~~~~~~~~~~~~~~~~~a~~leg~~r~~~~Ha~Gvvi~~~~l~~~~p~~~~~~~~~~~~q~d~~~~e~~gl~K 570 (1033) ++..++.+++.... .++++++|.+|||+|||+|+||||||||++||++++|++.+. +|.++|||||+++|++||+| T Consensus 407 a~~~~~~l~~~~~~---~~~l~~~a~~leG~~R~~s~HAaGvVIs~~~l~~~vPl~~~~-~~~~vtQ~d~~~~E~~GLlK 482 (971) T PRK05898 407 AIKKNTILKEMYVL---HKELFDLAKKIINAPRQIGTHAAGVVLSNSLLTNIIPIQLGI-NDRPLSQYSMEYLERFGLIK 482 (971) T ss_pred HHHHCCCCHHHHHH---HHHHHHHHHHHCCCCCCCCCCCCEEEECCCCCCCCCCEEECC-CCCEEEEEEHHHHHHCCCCC T ss_conf 97637021023366---899999999853984433556662583077745036614668-89846874668788659800 Q ss_pred HHHHHHCCHHHHHHHHHHHHH-HCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHCCCHHHHHHHHHHCCCHHHHHHHHHHH Q ss_conf 135654178999999998874-0254343323443588998762014101221145347888877174048888766541 Q 537021.9.peg.7 571 FDFLGLKTLTLLQKSLDFLAQ-RGVKVDLSLIPFDDHETYHLLTTKGTLGIFQLESSGMRQALEGMQPDCIEDIIALVSL 649 (1033) Q Consensus 571 ~D~Lgl~~L~~i~~~~~~i~~-~~~~~d~~~ip~~D~~~~~l~~~g~t~GvFQ~es~~~~~~l~~~~p~~~~dl~~~~al 649 (1033) ||||||||||+|++++++|++ +|.++|+.+||+||++||+||++|+|.||||+||+||+++|++++|++|+||+|++|| T Consensus 483 ~DfLGLr~LtvI~~~~~~i~~~~~~~id~~~ipldD~~t~~ll~~G~T~GvFQlES~gm~~~l~~lkP~~feDlia~~AL 562 (971) T PRK05898 483 MDLLGLKNLTIIDNVLKLIKENQNKKIDLFNINLNDKNVFEDLAKGRTNGIFQLESPGMKKVLKKVKPQNIEDISIVSAL 562 (971) T ss_pred HHHHCCCHHHHHHHHHHHHHHHCCCCCCHHHHCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHHCCCCCHHHHHHHHHH T ss_conf 23434038999999999999851987657762325299999996176663502077899999997599979999999986 Q ss_pred CCCHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCHHHHHH Q ss_conf 14113420036888504657850269667334103355300125689887886326841307889762012110167787 Q 537021.9.peg.7 650 YRPGPIDNIVVYNNRKNGKEKIVSIHPLIDPMLKETQGVIIYQEQVMQIAQLLSGYSLSEADVLRRAMGKKIKEEMDKQK 729 (1033) Q Consensus 650 ~RPGp~~~~~~yi~rk~g~e~~~~~hp~l~~iL~~T~Gv~vyQEQvm~ia~~~ag~~~~ead~lRra~~kk~~~~~~~~~ 729 (1033) ||||||++++.|++||||+|+++|+||.|++||++|||||||||||||||+.+||||++|||.||||||||+.++|++++ T Consensus 563 ~RPGP~~~i~~yi~rk~g~e~v~y~hp~l~~IL~~TyGvivYQEQVMqia~~lagfslgeAD~lRram~KK~~~~m~~~~ 642 (971) T PRK05898 563 FRPGPQQNIKTFVERRFKREEFSYWNEATKKILEPTHGIIVYQEQVINLVKTIANFDIATADNFRRAISKKDEKILIQLK 642 (971) T ss_pred CCCCCCCCCHHHHHHHCCCCCCCCCCHHHHHHHCCCCCEEEEHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCHHHHHHHH T ss_conf 18776556437887654898888889999986036785686599999999999499988999999986558777999999 Q ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHH Q ss_conf 89999986249999999999999997500013310147888999999999973026667775100235420258999888 Q 537021.9.peg.7 730 ERFISGASKNGISKTIAVNIFELLAKFADYGFNKSHAAAYAVISYQTAWMKTHYPVEFLAASMTLEMDNVEKIKKFCQDA 809 (1033) Q Consensus 730 ~~f~~g~~~~g~~~~~a~~i~~~i~~f~~Y~FnksHa~aya~~~y~~aylK~~yP~eF~aa~L~~~~~~~~~~~~~i~e~ 809 (1033) ++|++||++||++++.|++||++|++||+||||||||+|||++||||||||+|||+|||||+||+++++++++..|+.|| T Consensus 643 ~~Fi~ga~~~g~~~~~a~~if~~i~~Fa~YgFNkSHs~aYa~iayqtAyLKahyP~eF~aalL~~~~~~~~k~~~~i~ea 722 (971) T PRK05898 643 KDFIEGALKNNYKQPLVNQIFEYIFSFADYGFNHSHSLAYSYISYWMAYLKHYYPLEFLSILLSHTSASKDKLLSYLNEA 722 (971) T ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHCCCCCCCHHHHHHHHHH T ss_conf 99999999769999999999999998606676689999999999999999854789999998468999975799999999 Q ss_pred HHCCCCCCCCCCCCCCEEEEECC--CEEEEEHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCHHHHHHHHH Q ss_conf 86368503552334210035319--8698502685896989999999998379999978999734824799899998886 Q 537021.9.peg.7 810 RQFNIQIMPPSVNTPCVDFKVGD--NRIYYSLAAIKGVGTTTARHIMEASADKPFDSLEDFCSRVDSKKLNRRVLESLVF 887 (1033) Q Consensus 810 ~~~gi~vl~PdIN~S~~~f~~~~--~~Ir~Gl~~Ikgvg~~~~~~Ive~r~~g~f~sl~df~~rv~~~~~nk~~le~Li~ 887 (1033) +++||+|+|||||+|.++|++++ +.||+||++|||||++++++|+++|++|||+|+.||+.|++.+.+||+++|+||+ T Consensus 723 ~~~gi~vlpPdIN~S~~~f~~~~~~~~Ir~GL~aIKgvG~~~~~~Iv~~R~~g~f~~~~df~~Rv~~~~~nk~~lE~LI~ 802 (971) T PRK05898 723 KEFNISIKKPDINYSSNSFVLDTQKQIIRFGFNTIKGFGDELLKKIKSALQNKTFSDFISYIDALKKNNVSLSNIEILIN 802 (971) T ss_pred HHCCCEECCCCCCCCCCCEEEECCCCEEEEEHHHHCCCCHHHHHHHHHHHCCCCCCCHHHHHHHCCHHCCCHHHHHHHHH T ss_conf 98699273882257778527857998699764760785599999999986689999999999853700089999999997 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHH-HHHHHHHHCEEECCC Q ss_conf 7672347903899999899999999988654312777866665555655776668999988899-985788726211638 Q 537021.9.peg.7 888 AGALDCFGYSRMQLLQSLDNIQKYAQWVEKNRTNKHENIFAHEKGTFSDKITLEKFSVENSSVR-FENEQRVLGFYFSGH 966 (1033) Q Consensus 888 aGAfD~~~~~R~~ll~~i~~~~~~~~~~~~~~~~~q~~LF~~~~~~~~~~~~~~~~~~~~~~e~-l~~E~e~LGf~lS~H 966 (1033) |||||+||.||++|+.+++.++..+.. ..+|.... ....+...+++..|+ +++|+|+||||+|+| T Consensus 803 aGAfD~fg~nR~~ll~~l~~~~~~~~~---------~~~~~~~~-----~~~~~~~~~~~~~e~~l~~Eke~lG~yls~h 868 (971) T PRK05898 803 VGTFDSFKLSRLFLLNNLPEIFEKTSL---------NGHFFELN-----LVGLEYAKDMSVKERFLKKEIQYLGIDLNSL 868 (971) T ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHH---------CCCCCCCC-----CCCCCCCCCCCHHHHHHHHHHHHHEEEECCC T ss_conf 188757783599999858999998740---------36411212-----2352125532145788898799875331478 Q ss_pred CHHHHHHHHHHC----CCCCHHHHHCCCCCCEEEEEEEEEEEEEECCCCCCEEEEEEEECCCCEEEEEECC Q ss_conf 558999999874----8976889850558988999999961121015888769999998497049999539 Q 537021.9.peg.7 967 PLDVYKPILRKM----GIKGYEESVATIRGNNIRLAAMVVSKQQKKTRKGSRIGWVTFSEPAKEYEANFFP 1033 (1033) Q Consensus 967 Pl~~~~~~l~~~----~~~~~~~~~~~~~g~~v~iaG~V~~i~~~~TKkG~~mafltleD~~g~~e~~iFP 1033 (1033) |++.|+..+... ....+..+.....+++++++|.|..+|+++||||+.|||+++.|.++.+++|+|| T Consensus 869 pl~~y~~~~~~~~~~~~~~~~~~i~~~~~~~~~~i~~~~~~ik~irtk~g~~maf~~~~d~~~~~~~t~f~ 939 (971) T PRK05898 869 NYANYKNEIDYNELKYTIKDIKEIELKNTNYETNIFAQVYEIKQSKTKNGNLSFYLETLINNKKKTITIFQ 939 (971) T ss_pred CHHHHHHHHHHHHHHHHHCCCCHHHHHCCCCEEEEEEEEEEEEEEECCCCCEEEEEEEECCCCCEEEEECC T ss_conf 77999999987665530125661444245750488878612577862577536899986378736899785 No 18 >PRK07135 dnaE DNA polymerase III DnaE; Validated Probab=100.00 E-value=0 Score=2222.29 Aligned_cols=939 Identities=32% Similarity=0.529 Sum_probs=851.6 Q ss_pred CCHHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCCCCC Q ss_conf 81334331444764432998999999997899889981165177499999999976996389999998547664334456 Q 537021.9.peg.7 10 SFVHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPIIGCQLDIDMQDNLKMTNQG 89 (1033) Q Consensus 10 ~FvhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~~~~~~~~ 89 (1033) +||||||||+||+|||+++|++||++|+++||+||||||||||||+++|+++|+++|||||+|||+++. T Consensus 2 ~FvHLh~HS~YSlLdg~~~i~~Lv~~A~e~g~~AlALTDh~nl~Gav~F~~~ak~~gIKPIiG~e~~~~----------- 70 (987) T PRK07135 2 KLINLHTNTEYSFLESAIKLNDLIKLAKENNLKTLVLTDHNNMFGVPKFYKLCKKNKIKPIIGLDIEVE----------- 70 (987) T ss_pred CCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHCCCCEEEEEEEEEC----------- T ss_conf 031767688483105104899999999985999999864887989999999999859976768999978----------- Q ss_pred CCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCEEEECCCCCCHHHHHHHCCCHHHHHH Q ss_conf 56666775399995696899999999989973248778740078999863489889981976796788887799789999 Q 537021.9.peg.7 90 KTPFPTVPSIVLLVSTAEGYQRLIELVSRMYLLDQGKQSVRICLSWLQEIGTEGLIMLTGGSSGPIDRAFFFNSSQVAEK 169 (1033) Q Consensus 90 ~~~~~~~~~lvlLAkN~~Gy~nL~kL~S~a~~~~~~~~~p~i~~~~L~e~~~egLIvlsg~~~g~i~~~l~~~~~~~a~~ 169 (1033) .+|++|||||++||+|||+|+|.++.+ +.+.+.+|.+ ++++++++++..|...... . T Consensus 71 ------~~~liLLAkN~~GY~NL~kL~S~~~~~------~~~~l~~l~~--~~~liil~~~~~g~~~~~~---------~ 127 (987) T PRK07135 71 ------NFRFILLAKNYSGYKLLNELSSKKSNN------EEIFLSDLAE--SDNLFIIDHPTKGFYAKKK---------E 127 (987) T ss_pred ------CCEEEEEECCHHHHHHHHHHHHHHHCC------CCCCHHHHHH--CCCEEEECCCCCCHHHHHH---------H T ss_conf ------971899958879999999998674338------9978898830--6998998176254666767---------6 Q ss_pred HHHHHHHHCCCCEEEEEECCCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 99999876598689999439911589999999999981898998088300787889999876688469857653100245 Q 537021.9.peg.7 170 RLLTFKKLFGDRLYVNLQRHRGYDRYRESQVVQLAYTHELPLVATNNSLFLSEEDYEAHDVLMAVAHSTVVSQKDRPRVT 249 (1033) Q Consensus 170 ~l~~l~~~Fgd~~ylEl~~~~~~e~~~~~~l~~lA~~~~iPlVaTndv~Y~~~~D~~~h~iL~aI~~~~~l~~~~~~~~~ 249 (1033) ....+ .++|++.+.+... ..|++++++|++++|+.+|++|.||+++..- ...+ T Consensus 128 ~~~~~-----~~~~~~~~~~~~~-----------------~~v~~~~~~~l~~ed~~~~~~l~~i~~~~~~----~~~~- 180 (987) T PRK07135 128 QESEL-----KNYYINSNNPKIP-----------------NAVYVKENKLLDKEDNETLEILNKIKNNTKE----NTNF- 180 (987) T ss_pred HHHHH-----HHHHHHCHHCCCC-----------------CHHHHCHHHHCCHHHHHHHHHHHHHHCCCHH----CCCC- T ss_conf 89888-----8898632003885-----------------3001120000467579999999987416320----1276- Q ss_pred CCCCCCCHHHHHHHCCH-HHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 34323434788751220-35899878864111000133221023455575688776898899987654103420124643 Q 537021.9.peg.7 250 PDHYLKNRSEMVSIFSD-LPEALENTVEIARRCSFILQVHAPILPRFIEKECDDIQKEEENELRNKAVAGLEVRLAQGAI 328 (1033) Q Consensus 250 ~~~yLks~eEM~~~f~~-~peAi~NT~~IAe~c~~~l~~~~~~lP~f~~p~~~~~~~~~~~~Lr~l~~~Gl~~R~~~~~~ 328 (1033) .+++. +.+ .+++++||.+||++|++.++...++||+|++|++ .++++||+++|++|+++|+.... T Consensus 181 --~~~~~-------~~~~~e~~l~NT~~IA~rC~~~~~~~~~~lP~f~~~~g----~~~~~~L~~l~~~g~~~r~~~~~- 246 (987) T PRK07135 181 --FDFEE-------EQEIDEIILKRINYIVDNCNVEFPKIEIKLPDFDHNLG----IESELFLKKILKEAVQEKKILLS- 246 (987) T ss_pred --CCHHH-------HCCCHHHHHHHHHHHHHHCEEEECCCCCCCCCCCCCCC----CCHHHHHHHHHHHHHHHHHCCCC- T ss_conf --45132-------24184999999999998567871678877999999999----99999999999848987500589- Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHHCCCHHHHHH Q ss_conf 24578389999988778989860630557888999998510476433444420466776554403756333231056762 Q 537021.9.peg.7 329 AKGYHAQDYRQRLDFELNVIARMKFSGYFLIVADFIQWAKKNNIPVGPGRGSGAGSVTAYALTITDIDPLRFSLLFERFL 408 (1033) Q Consensus 329 ~~~~~~~~y~~RLe~EL~vI~~~gf~~YFLiV~Div~~ak~~gI~vGpGRGSAagSLVaY~LgIT~VDPi~~~LlFERFL 408 (1033) ..++|++||+|||+||++|||++|||||||+|+|||++||+||||||||||||||||||||+||||+||||||||| T Consensus 247 ----~~~~y~~RL~~EL~VI~~mGF~~YFLIV~DfI~~Ak~~gI~VGPGRGSaAGSLVaY~LgIT~VDPi~y~LLFERFL 322 (987) T PRK07135 247 ----NYDNVKERINYELSVIKKLKFTDYFLIIWDILKWARKNKISIGPGRGSASGSLVSYLLEITSVNPLKYDLLFERFL 322 (987) T ss_pred ----CCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCEECCCCCCHHHHHHHHHHCCCCCCHHHCCCCHHHHC T ss_conf ----7089999999999999976998634399999998755494546887737889999996687638765487799873 Q ss_pred HHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCC Q ss_conf 10014578753336568789999999830380215542207787788999999975279988988654310145898244 Q 537021.9.peg.7 409 NPDRMSMPDFDIDFCQDRRDEVIRYVQNKYGHERVAQIITFGSLQAKAALRDVGRALQMPYSQVDRLCKLIPNDPAHPVS 488 (1033) Q Consensus 409 np~R~~~PDIDiDf~~~~R~~vi~y~~~kyG~~~va~i~t~~t~~~k~aird~~r~~g~~~~~~~~~~~~i~~~~~~~~~ 488 (1033) ||+|+||||||||||++||++||+||++|||.++||||+||+||++|+||||||||||+|+.++|++||+||...+.... T Consensus 323 NPeR~smPDIDIDF~~~rR~eVI~YV~~KYG~d~VAqIiTfgT~~aKaAiRDVgRvlg~p~~~vd~iaK~Ip~~~~~~~l 402 (987) T PRK07135 323 NPDRITMPDIDIDIQDDRRDEVLDYLFEKYGPQKVATITTFQTLGAKSAIRDVGRMLGIPESDINKISKLINANANLSQL 402 (987) T ss_pred CCCCCCCCCCEEECCCCCHHHHHHHHHHHHCCCCEEEEEHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCCCCCCCH T ss_conf 67788999823048866699999999999687648873007767788999999998289999999986348877786746 Q ss_pred HHHHHCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCEEEECCCCHHHHEECCCCCCCCCEEEEEECHHHHHHCC Q ss_conf 34452298878864307420268898720112210024474403352566666200123676886688510213343112 Q 537021.9.peg.7 489 LQAIADDSRFREARLADPSVDRLLEISQKLEGLYRHASTHAAGIVIGDRPLSQLVPMYRDVRSDLPVTQFNMKWIEKAGL 568 (1033) Q Consensus 489 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~leg~~r~~~~Ha~Gvvi~~~~l~~~~p~~~~~~~~~~~~q~d~~~~e~~gl 568 (1033) .+++.++++|++.+.+++.+++++++|.+|||+|||+|+||||||||++||++++|++++. ++.++|||||++||++|| T Consensus 403 ~~~~~~~~~l~~~~~~~~~~~~l~~~a~~LEGl~R~~s~HAaGvVIs~~pl~~~vPl~~~~-~~~~~tq~d~~~~E~~GL 481 (987) T PRK07135 403 YEQNKKNSKFRALVSKGDDYEKLYKIAAKIEGLPRQTGTHAAGIIISNKDITNYVPVFNSK-ENLNQVQYSMEFIEDFGL 481 (987) T ss_pred HHHHHHCHHHHHHHHCCHHHHHHHHHHHHHCCCCCCCCCCCCEEEECCCCCHHHCCEEECC-CCCEEEEEECCCHHHCCC T ss_conf 9887529889988733868999999999964985544557771685076500205636879-998568877752665687 Q ss_pred CHHHHHHHCCHHHHHHHHHHHHH-HCCCCC----CCCCCCCHHHHHHHHHCCCCCHHHHCCCHHHHHHHHHHCCCHHHHH Q ss_conf 32135654178999999998874-025434----3323443588998762014101221145347888877174048888 Q 537021.9.peg.7 569 IKFDFLGLKTLTLLQKSLDFLAQ-RGVKVD----LSLIPFDDHETYHLLTTKGTLGIFQLESSGMRQALEGMQPDCIEDI 643 (1033) Q Consensus 569 ~K~D~Lgl~~L~~i~~~~~~i~~-~~~~~d----~~~ip~~D~~~~~l~~~g~t~GvFQ~es~~~~~~l~~~~p~~~~dl 643 (1033) +|||||||+|||+|++++++|++ ++.++| +..||+||++||+||++|+|.||||+||+|||++|++++|++|+|| T Consensus 482 lK~D~LGL~~Ltvi~~~~~~i~~~~~~~~d~~~~~~~~pldD~~~~~ll~~g~T~GvFQ~ES~gm~~~l~~lkP~~feDl 561 (987) T PRK07135 482 LKIDLLGLKNLTIIKNIEAKISDGHKITFDELINQNLNPIIDNKTFNLLSDGKTEGIFQLESPGMTATIKKVGISSFEDI 561 (987) T ss_pred HHHHHHCCCHHHHHHHHHHHHHHHCCCCCCCCCCHHHCCCCCHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCHHHH T ss_conf 34456524668999999999986058885422543116887599999997257799553247599999997499979998 Q ss_pred HHHHHHCCCHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCC Q ss_conf 76654114113420036888504657850269667334103355300125689887886326841307889762012110 Q 537021.9.peg.7 644 IALVSLYRPGPIDNIVVYNNRKNGKEKIVSIHPLIDPMLKETQGVIIYQEQVMQIAQLLSGYSLSEADVLRRAMGKKIKE 723 (1033) Q Consensus 644 ~~~~al~RPGp~~~~~~yi~rk~g~e~~~~~hp~l~~iL~~T~Gv~vyQEQvm~ia~~~ag~~~~ead~lRra~~kk~~~ 723 (1033) ||++||||||||+++++|++||||+++++|+||.|++||++|||||||||||||||+.+||||++|||.||||||||+.+ T Consensus 562 ia~~AL~RPGP~~~i~~yi~rk~g~~~v~y~~p~l~~iL~~TyGvivYQEQvM~ia~~~agfs~geaD~lRram~KK~~~ 641 (987) T PRK07135 562 YAIISLYRPGPIQYIPTYAKNKKDPSNIEKIFPEYDKIVAPTFGIIIYQEQIMEIAQKVAGFSFAQADILRRAISKKNET 641 (987) T ss_pred HHHHHHCCCCHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHCCEEEEHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCHH T ss_conf 89876205760321278998614887778889899988886378677599999999998089988999999986558711 Q ss_pred HHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHH Q ss_conf 16778789999986249999999999999997500013310147888999999999973026667775100235420258 Q 537021.9.peg.7 724 EMDKQKERFISGASKNGISKTIAVNIFELLAKFADYGFNKSHAAAYAVISYQTAWMKTHYPVEFLAASMTLEMDNVEKIK 803 (1033) Q Consensus 724 ~~~~~~~~f~~g~~~~g~~~~~a~~i~~~i~~f~~Y~FnksHa~aya~~~y~~aylK~~yP~eF~aa~L~~~~~~~~~~~ 803 (1033) +|++++++|++||.+||++++.|++||++|++||+||||||||+|||++||||||||+|||+|||||+||++++++++++ T Consensus 642 ~m~~~~~~F~~ga~~~g~~~~~a~~i~~~i~~Fa~YgFnkSHa~aYa~iayqtAyLK~~yP~eF~aa~L~~~~~~~~k~~ 721 (987) T PRK07135 642 ELDKYKEKFIEGGIKNGYSPKLVEHIYSKIEKFAQYGFNKSHAVAYATLAYKMAYYKARYPLYFYSVLISNSNGSQEKIK 721 (987) T ss_pred HHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHCCCCCCCHHHH T ss_conf 59999999996145469999999999999998624466678999999999999999854889999998457989864499 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCEEEEECCCEEEEEHHHHCCCCHHHHHHHHHHH-HCCCCCCHHHHHHHCCCCCCCHHHH Q ss_conf 999888863685035523342100353198698502685896989999999998-3799999789997348247998999 Q 537021.9.peg.7 804 KFCQDARQFNIQIMPPSVNTPCVDFKVGDNRIYYSLAAIKGVGTTTARHIMEAS-ADKPFDSLEDFCSRVDSKKLNRRVL 882 (1033) Q Consensus 804 ~~i~e~~~~gi~vl~PdIN~S~~~f~~~~~~Ir~Gl~~Ikgvg~~~~~~Ive~r-~~g~f~sl~df~~rv~~~~~nk~~l 882 (1033) .|+.||+++||+|+|||||+|.++|+++++.||+||++|||||+.+++.|+++| ++|||+|++||+.|++.+.+||+++ T Consensus 722 ~~i~e~~~~gi~vlpPdiN~S~~~~~~~~~~I~~gl~~Ikgvg~~~~~~Iv~~R~~~g~f~~~~df~~r~~~~~~~k~~~ 801 (987) T PRK07135 722 KYVGEAKNNNFKIYSPDINFSTENAVFDKGKIFLPLLMIKGLGTIAIKKIIDERNENGKYKSFFDFIKRLKFVGIKKAAI 801 (987) T ss_pred HHHHHHHHCCCEEECCCCCCCCCCCEEECCEEEEHHHHHCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCHHHH T ss_conf 99999998899897998446888446579989980474367769999999999985879899999998647434899999 Q ss_pred HHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCC--CCCCCCCCCCCCCCHHHHHHHHHHHHC Q ss_conf 988867672347903899999899999999988654312777866665555--655776668999988899985788726 Q 537021.9.peg.7 883 ESLVFAGALDCFGYSRMQLLQSLDNIQKYAQWVEKNRTNKHENIFAHEKGT--FSDKITLEKFSVENSSVRFENEQRVLG 960 (1033) Q Consensus 883 e~Li~aGAfD~~~~~R~~ll~~i~~~~~~~~~~~~~~~~~q~~LF~~~~~~--~~~~~~~~~~~~~~~~e~l~~E~e~LG 960 (1033) |+||+|||||+|| ||++++.+++.+++++....++...+|.++|+..... ....+.++..++|+..|++.+|+|+|| T Consensus 802 e~Li~aGafd~~g-~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~l~~E~e~lG 880 (987) T PRK07135 802 EKLIFANALRSFG-NMQTLIHNFELLWNHAFFVLSKKEQTDGDLINTTDFFGLELSFENILEESKRNLEEEYKNEVEYLG 880 (987) T ss_pred HHHHHCCCCCCCC-CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHEEEE T ss_conf 9998568843579-979999879999999999998887514420355533344455776445689997889851050168 Q ss_pred EEECCCCHHHHHHHHHHCCCCCHHHHHCCCCCCEEEEEEEEEEEEEECCCCCCEEEEEEEECCCCEEEEEECC Q ss_conf 2116385589999998748976889850558988999999961121015888769999998497049999539 Q 537021.9.peg.7 961 FYFSGHPLDVYKPILRKMGIKGYEESVATIRGNNIRLAAMVVSKQQKKTRKGSRIGWVTFSEPAKEYEANFFP 1033 (1033) Q Consensus 961 f~lS~HPl~~~~~~l~~~~~~~~~~~~~~~~g~~v~iaG~V~~i~~~~TKkG~~mafltleD~~g~~e~~iFP 1033 (1033) ||+|+||++.|+..++........++.. ..+..+.+++++ .+ +.+|+|+.|+|+|++|.||.+|+++|| T Consensus 881 ~yls~hPl~~y~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~-~~--r~~~~~~~~af~~l~D~tg~~e~~vf~ 949 (987) T PRK07135 881 MSFNAIDTVKYENKYLLNNQIRLNLLRI-GNNFRLILEIII-VI--RLRKKNKEYKKVIFADDTVEIKIFVND 949 (987) T ss_pred EECCCCCHHHHHHHHHHHCCCCHHHHCC-CCCCEEEEEEEE-EE--EECCCCCCEEEEEEECCCCCEEEEECH T ss_conf 7416698699999987624221777444-667428999999-99--982189757999998688658999868 No 19 >PRK09532 DNA polymerase III subunit alpha; Reviewed Probab=100.00 E-value=0 Score=2002.77 Aligned_cols=798 Identities=39% Similarity=0.649 Sum_probs=754.8 Q ss_pred CCHHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCCCCC Q ss_conf 81334331444764432998999999997899889981165177499999999976996389999998547664334456 Q 537021.9.peg.7 10 SFVHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPIIGCQLDIDMQDNLKMTNQG 89 (1033) Q Consensus 10 ~FvhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~~~~~~~~ 89 (1033) +||||||||+||||+|+++|++||++|+++||+||||||||||||+++|+++|+++|||||+|||+++..++.... T Consensus 2 sFvhLh~hS~yS~L~Ga~~~~eLv~~A~elG~~AlALTD~~~l~Gav~F~~a~~~~gIkPIiG~el~v~~~~~~~~---- 77 (874) T PRK09532 2 SFVGLHIHSDYSLLDGASQLPALIDRAIELGMPAIALTDHGVMYGAIELLKVCRNKGIKPIIGNEMYVINGDIEKQ---- 77 (874) T ss_pred CCCCCCCCCCCCHHCCCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCCEEEEEEEEECCCCCCC---- T ss_conf 2215254663723063619999999999849999988508968899999999997799864562899717874444---- Q ss_pred CCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCC----CCCCCCCCHHHHHHHCCCCEEEECCCCCCHHHHHHHCCCHH Q ss_conf 56666775399995696899999999989973248----77874007899986348988998197679678888779978 Q 537021.9.peg.7 90 KTPFPTVPSIVLLVSTAEGYQRLIELVSRMYLLDQ----GKQSVRICLSWLQEIGTEGLIMLTGGSSGPIDRAFFFNSSQ 165 (1033) Q Consensus 90 ~~~~~~~~~lvlLAkN~~Gy~nL~kL~S~a~~~~~----~~~~p~i~~~~L~e~~~egLIvlsg~~~g~i~~~l~~~~~~ 165 (1033) .....+|++|||||++||+|||+|+|.+++++. .+.+|++++++|.+ +++|||++|||..|+++++++.++.+ T Consensus 78 --~~~~~~hlvLLAkN~~GY~NL~kL~S~a~l~g~~~k~~~~~P~id~e~L~~-~~egLI~ls~cl~gev~~~ll~~~~~ 154 (874) T PRK09532 78 --KRHRKYHQVVLAKNTQGYKNLVKLTTISHLQGFQGKGIFARPCINKELLEQ-YHEGLIVTSACLGGEIPQAILAGRPD 154 (874) T ss_pred --CCCCCCCEEEEECCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHH-CCCCEEEECCCCCCHHHHHHHCCCHH T ss_conf --556677779972887999999999999987143578650577589999985-47994994587666277998679999 Q ss_pred HHHHHHHHHHHHCCCCEEEEEECCCHHHH-HHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 99999999987659868999943991158-99999999999818989980883007878899998766884698576531 Q 537021.9.peg.7 166 VAEKRLLTFKKLFGDRLYVNLQRHRGYDR-YRESQVVQLAYTHELPLVATNNSLFLSEEDYEAHDVLMAVAHSTVVSQKD 244 (1033) Q Consensus 166 ~a~~~l~~l~~~Fgd~~ylEl~~~~~~e~-~~~~~l~~lA~~~~iPlVaTndv~Y~~~~D~~~h~iL~aI~~~~~l~~~~ 244 (1033) .|++.+.||+++||+|||+|+|+|+..++ .++..++++|+++++|+||||||||++++|+++|++|+||++|+++++.+ T Consensus 155 ~a~~~~~~~~~iFg~~~ylEiq~hg~~~~~~~n~~L~~lA~~~~iplVATndvhY~~~~D~~~~dvL~cI~~g~~l~~~~ 234 (874) T PRK09532 155 AARKVAKWYKDLFGDDFYLEIQDHGSQEDRIVNVEIVKIARELGIKIVATNDSHFISCYDVEAHDALLCIQTGKLITEDK 234 (874) T ss_pred HHHHHHHHHHHHCCCCEEEEEEECCCHHHHHHHHHHHHHHHHHCCCEEECCCCEECCHHHHHHHHHHHHHHHCCCCCCCC T ss_conf 99999999998648878999974898588999999999999819998861865015754536999998886098644212 Q ss_pred CCCCCCCCCCCCHHHHHHHCCH------HHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 0024534323434788751220------3589987886411100013322102345557568877689889998765410 Q 537021.9.peg.7 245 RPRVTPDHYLKNRSEMVSIFSD------LPEALENTVEIARRCSFILQVHAPILPRFIEKECDDIQKEEENELRNKAVAG 318 (1033) Q Consensus 245 ~~~~~~~~yLks~eEM~~~f~~------~peAi~NT~~IAe~c~~~l~~~~~~lP~f~~p~~~~~~~~~~~~Lr~l~~~G 318 (1033) +.++++++||||++||.++|.+ +|+|++||.+||++|++.+.+.++.||+|++|++ .++++||+++|++| T Consensus 235 r~~~~~~~ylks~~EM~~lF~d~~~~~~~~eAl~NT~~IA~~c~~~~~~~~~~lP~f~~p~g----~t~~~~L~~l~~~G 310 (874) T PRK09532 235 RLRYSGTEYLKSAEEMRLLFRDHLPDDVIAEAIANTLEVADKIEPYNILGEPRIPNYPVPSG----HTADTYVEEVAWQG 310 (874) T ss_pred CCCCCHHHHCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCC----CCHHHHHHHHHHHH T ss_conf 23688878688999999987641441348999997899998710013128866888778789----98689999999975 Q ss_pred HHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCHH Q ss_conf 34201246432457838999998877898986063055788899999851047643344442046677655440375633 Q 537021.9.peg.7 319 LEVRLAQGAIAKGYHAQDYRQRLDFELNVIARMKFSGYFLIVADFIQWAKKNNIPVGPGRGSGAGSVTAYALTITDIDPL 398 (1033) Q Consensus 319 l~~R~~~~~~~~~~~~~~y~~RLe~EL~vI~~~gf~~YFLiV~Div~~ak~~gI~vGpGRGSAagSLVaY~LgIT~VDPi 398 (1033) |.+||+... ....+++|++||+|||+||++|||++|||||||+|+|||++||+||||||||||||||||||||+|||| T Consensus 311 l~~R~~~~~--~~~~~~~y~~RL~~EL~vI~~mgf~~YFLIV~D~v~~Ar~~gI~vGpGRGSaAGSLVaY~LgIT~VDPi 388 (874) T PRK09532 311 LLERLKCKS--RAEVEPVYKERLEYELKMLQQMGFSTYFLVVWDYIKYARDNNIPVGPGRGSAAGSLVAYCLGITNIDPV 388 (874) T ss_pred HHHHCCCCC--CCCCCHHHHHHHHHHHHHHHHCCCCCEEHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHCCCCCCCCC T ss_conf 998706555--334779999999999999985487754149999999999769711578761143499997068767824 Q ss_pred HCCCHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHH Q ss_conf 32310567621001457875333656878999999983038021554220778778899999997527998898865431 Q 537021.9.peg.7 399 RFSLLFERFLNPDRMSMPDFDIDFCQDRRDEVIRYVQNKYGHERVAQIITFGSLQAKAALRDVGRALQMPYSQVDRLCKL 478 (1033) Q Consensus 399 ~~~LlFERFLnp~R~~~PDIDiDf~~~~R~~vi~y~~~kyG~~~va~i~t~~t~~~k~aird~~r~~g~~~~~~~~~~~~ 478 (1033) +|||||||||||+|++||||||||+++||++||+||++|||+++||||+||+||++|+||||||||||+|+.++|++||+ T Consensus 389 ~~~LLFERFLNpeR~smPDIDiDF~~~rReeVI~Yv~~KYG~~~VAqI~Tf~t~~ak~aiRDvgRvlg~p~~~~d~lak~ 468 (874) T PRK09532 389 HHGLLFERFLNPERKSMPDIDTDFCIERRDEMIKYVTEKYGEDRVAQIITFNRMTSKAVLKDVARVLDIPYGEADKMAKL 468 (874) T ss_pred CCCCCHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHHHHHHCCCHHHHHHHHHC T ss_conf 26755220378877889998751770029999999997858577899985234243689999999849988899999963 Q ss_pred CCCCCCCCCCHHHH----HCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCEEEECCCCHHHHEECCCCCCCCCE Q ss_conf 01458982443445----22988788643074202688987201122100244744033525666662001236768866 Q 537021.9.peg.7 479 IPNDPAHPVSLQAI----ADDSRFREARLADPSVDRLLEISQKLEGLYRHASTHAAGIVIGDRPLSQLVPMYRDVRSDLP 554 (1033) Q Consensus 479 i~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~a~~leg~~r~~~~Ha~Gvvi~~~~l~~~~p~~~~~~~~~~ 554 (1033) ||...+....++.. ...++|++.+..+|..++++++|.+|||+|||+|+||||||||++||++++|+++++ +|.+ T Consensus 469 ip~~~g~~~~l~~~~~~~~~~~~~~~~~~~~p~~~~~~~~a~~LeGlpR~~s~HAaGvVIs~~pL~~~vPl~~~~-~g~~ 547 (874) T PRK09532 469 IPVSRGKPTKLKVMISDETPEPEFKEKYDNDPRVRRWLDMAIRIEGTNKTFGVHAAGVVISSEPLDEIVPLQKNN-DGAV 547 (874) T ss_pred CCCCCCCCCHHHHHHHCCCCCHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCCCCCCEEEECCCCCCCCCCEEECC-CCCE T ss_conf 874468852377875225759899976415899999999999883474420025660585277744226624779-8981 Q ss_pred EEEEECHHHHHHCCCHHHHHHHCCHHHHHHHHHHHHH-HCCCCCCCCCCCCH-------------------HHHHHHHHC Q ss_conf 8851021334311232135654178999999998874-02543433234435-------------------889987620 Q 537021.9.peg.7 555 VTQFNMKWIEKAGLIKFDFLGLKTLTLLQKSLDFLAQ-RGVKVDLSLIPFDD-------------------HETYHLLTT 614 (1033) Q Consensus 555 ~~q~d~~~~e~~gl~K~D~Lgl~~L~~i~~~~~~i~~-~~~~~d~~~ip~~D-------------------~~~~~l~~~ 614 (1033) +|||||+++|++||||||||||||||+|++++++|++ +|.++|+++||+|| ++||+||++ T Consensus 548 vtQ~d~~d~E~~GLlK~D~LGL~~Lt~I~~a~~lI~~~~g~~~dl~~ip~dD~~~~~~~~~~~~~~ip~dd~~t~~ll~~ 627 (874) T PRK09532 548 ITQYFMEDLESLGLLKMDFLGLKNLTMIQKTADLIKENRGQRIDLDQLPLDERKALEIIAKGEAKKLPKDVQKTHQLLER 627 (874) T ss_pred EEEECHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHCCCCCHHHHCCCCCCCCCCCCCCCHHHHHHHHC T ss_conf 36615765998499643720557688999999999983189878532885421121012333211367650899999863 Q ss_pred CCCCHHHHCCCHHHHHHHHHHCCCHHHHHHHHHHHCCCHHHHH--HHHHHHHHCCCCCCCCCCHHHHHHHCCCCCCCCHH Q ss_conf 1410122114534788887717404888876654114113420--03688850465785026966733410335530012 Q 537021.9.peg.7 615 KGTLGIFQLESSGMRQALEGMQPDCIEDIIALVSLYRPGPIDN--IVVYNNRKNGKEKIVSIHPLIDPMLKETQGVIIYQ 692 (1033) Q Consensus 615 g~t~GvFQ~es~~~~~~l~~~~p~~~~dl~~~~al~RPGp~~~--~~~yi~rk~g~e~~~~~hp~l~~iL~~T~Gv~vyQ 692 (1033) |+|.||||+||+||+++|+++||++|+||||++||||||||++ ++.|++||||+|+++|+||.|++||++|||||||| T Consensus 628 g~T~GvFQ~ES~gm~~~l~~lkP~~f~Dlva~~ALyRPGP~~~gmi~~yi~rk~g~e~v~y~hp~Le~IL~~TyGV~vYQ 707 (874) T PRK09532 628 GDLEGIFQLESSGMKQIVRDLKPSNIEDISSILALYRPGPLDAGLIPKFINRKHGREPIDYQHQLLEPILNETYGVLVYQ 707 (874) T ss_pred CCEEEEEEECCHHHHHHHHHCCCCCHHHHHHHHHHCCCCCCCCCCHHHHHHHCCCCCCCCCCCHHHHHHHCCCCCEEEEH T ss_conf 88558998568899999997699979998999874277877778548999851899888889978998744578756549 Q ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHH Q ss_conf 56898878863268413078897620121101677878999998624999999999999999750001331014788899 Q 537021.9.peg.7 693 EQVMQIAQLLSGYSLSEADVLRRAMGKKIKEEMDKQKERFISGASKNGISKTIAVNIFELLAKFADYGFNKSHAAAYAVI 772 (1033) Q Consensus 693 EQvm~ia~~~ag~~~~ead~lRra~~kk~~~~~~~~~~~f~~g~~~~g~~~~~a~~i~~~i~~f~~Y~FnksHa~aya~~ 772 (1033) |||||||+.+||||+||||.||||||||+.++|++++++|++||++||+++++|++||++|++||+|+| ||++||+.+ T Consensus 708 EQVMqia~~~AGfslgeAD~LRRAm~KK~~~~m~~~r~~Fi~ga~~~G~~~~~a~~if~~i~~Fa~Y~~--s~~~~~~~~ 785 (874) T PRK09532 708 EQIMKMAQDLAGYSLGEADLLRRAMGKKKISEMQKHREKFIDGAAKNGVPKKIAENLFDQMVKFAEYCL--SYDTEVLTV 785 (874) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCHHH--HHHHHHHHH T ss_conf 999999999978992899999998727988999999999999999869999999999999998736199--999999998 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCC Q ss_conf 999999997302666777510023542025899988886368503552334 Q 537021.9.peg.7 773 SYQTAWMKTHYPVEFLAASMTLEMDNVEKIKKFCQDARQFNIQIMPPSVNT 823 (1033) Q Consensus 773 ~y~~aylK~~yP~eF~aa~L~~~~~~~~~~~~~i~e~~~~gi~vl~PdIN~ 823 (1033) .|++||+|+|||.+|+|+++|++++++.....+.+.+.+.+.+|.+-+++. T Consensus 786 ~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 836 (874) T PRK09532 786 EYGPLPIGKIVEENIECTVYSVDPNGFVYTQPIAQWHQRGEQEVFEYELED 836 (874) T ss_pred HHHHHHHHHHCCHHHEEEECCCCCCCEECCCHHHHHHHHCCEEEEEEECCC T ss_conf 753677632141322332224578862035333678871870489876158 No 20 >TIGR00594 polc DNA polymerase III, alpha subunit; InterPro: IPR004805 All proteins in this family for which functions are known are DNA polymerases. The group defines the alpha subunit of DNA polymerase III (2.7.7.7 from EC). DNA polymerase III is a complex, multichain enzyme responsible for most of the replicative synthesis in bacteria. This DNA polymerase also exhibits 3' to 5' exonuclease activity.; GO: 0003889 alpha DNA polymerase activity, 0008408 3'-5' exonuclease activity, 0006260 DNA replication, 0005737 cytoplasm. Probab=100.00 E-value=0 Score=1985.66 Aligned_cols=1018 Identities=45% Similarity=0.741 Sum_probs=947.9 Q ss_pred CHHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCC---- Q ss_conf 1334331444764432998999999997899889981165177499999999976996389999998547664334---- Q 537021.9.peg.7 11 FVHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPIIGCQLDIDMQDNLKMT---- 86 (1033) Q Consensus 11 FvhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~~~~~---- 86 (1033) |||||+||.||++||+++++.+|++|+++||+|+|+||||||||+++||++|++.|||||+|||+|+..+++.... T Consensus 1 f~hlh~h~~yslldga~~~~~l~~~a~~~g~~~~~~tdhgn~~Ga~~fy~~~~~~gikPiiG~e~y~~~~~~~~~~~~lw 80 (1108) T TIGR00594 1 FVHLHVHSDYSLLDGAAKIKPLVKKAKELGMPALALTDHGNLFGAVEFYKACKAEGIKPIIGCEAYVAPGSRFDKKRILW 80 (1108) T ss_pred CCCCEECCCCHHHHHHHHHHHHHHHHHHHCCCCEEEECCCCHHHHHHHHHHHHCCCCCCEECCEEEECCCCCHHHCCCCC T ss_conf 93210135401343354468999999981884023204530233688887653067862221223312563011000002 Q ss_pred ----C--CCCCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCCC---CCCCCCHHHHHHHCCCCEEEECCCCCCHHHH Q ss_conf ----4--565666677539999569689999999998997324877---8740078999863489889981976796788 Q 537021.9.peg.7 87 ----N--QGKTPFPTVPSIVLLVSTAEGYQRLIELVSRMYLLDQGK---QSVRICLSWLQEIGTEGLIMLTGGSSGPIDR 157 (1033) Q Consensus 87 ----~--~~~~~~~~~~~lvlLAkN~~Gy~nL~kL~S~a~~~~~~~---~~p~i~~~~L~e~~~egLIvlsg~~~g~i~~ 157 (1033) . .........+|+++||||++||+||++|+|.++.++.+. +.|+++.++|.+ |++||||+|+|+.|.+++ T Consensus 81 gdp~~~~~d~~~~~~~~hl~~lak~~~Gy~nl~~l~~~~~~~g~~~g~~~~p~~d~~~l~~-~~eGli~~~~c~~G~~~~ 159 (1108) T TIGR00594 81 GDPTKKSDDESLGKEAYHLILLAKNETGYRNLLKLSSLAYLEGFYGGIFYKPRIDKELLEE-YSEGLIALSACLGGEVPY 159 (1108) T ss_pred CCCCCCCCHHHCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHCCCCCEEECCCCCHHHHHH-HCCCEEEECCCCCCHHHH T ss_conf 6876542012103310113688426532256888877765400023401034448899997-538828861543431577 Q ss_pred HHHCCCHHHHHHHHHHHHHHCCCC-EEEEEECCCHHHH-HHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHH Q ss_conf 887799789999999998765986-8999943991158-99999999999818989980883007878899998766884 Q 537021.9.peg.7 158 AFFFNSSQVAEKRLLTFKKLFGDR-LYVNLQRHRGYDR-YRESQVVQLAYTHELPLVATNNSLFLSEEDYEAHDVLMAVA 235 (1033) Q Consensus 158 ~l~~~~~~~a~~~l~~l~~~Fgd~-~ylEl~~~~~~e~-~~~~~l~~lA~~~~iPlVaTndv~Y~~~~D~~~h~iL~aI~ 235 (1033) +++.|..+.+++++.||+++||++ ||+|+|+|+..++ ..+..++++++++++|+|||||+||.+++|+.+|+++.||+ T Consensus 160 ~~~~~~~~~~~~~~~~~~~~fg~d~~~le~~~~g~~~~~~~~~~~~~~~~~~g~p~~atnd~hy~~~~d~~a~~~l~~~~ 239 (1108) T TIGR00594 160 LLLLGEEREAEEAALWYKELFGDDNFYLELQDHGLPDERVVNEALLELAKELGIPLVATNDVHYINPEDALAHEILLCIQ 239 (1108) T ss_pred HHHCCCHHHHHHHHHHHHHHCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHCCCEEECCCCCCCCCCHHHHHHHHHHHH T ss_conf 87405466789999999873388633664311577056777699999998718861530640004730467889998752 Q ss_pred CCCCCCCCCCCCC-CC--CCCCCCHHHHHHHCCH------HHHHHHHHHHHHHHHH-----HHHHHHHCCCCCC------ Q ss_conf 6985765310024-53--4323434788751220------3589987886411100-----0133221023455------ Q 537021.9.peg.7 236 HSTVVSQKDRPRV-TP--DHYLKNRSEMVSIFSD------LPEALENTVEIARRCS-----FILQVHAPILPRF------ 295 (1033) Q Consensus 236 ~~~~l~~~~~~~~-~~--~~yLks~eEM~~~f~~------~peAi~NT~~IAe~c~-----~~l~~~~~~lP~f------ 295 (1033) .+.+++++++.++ .+ ++|++|++||.++|.+ +|+|+.||.+||++|+ ++.....+.+|.| T Consensus 240 ~g~~~~d~~~~~~~~~~~~~y~k~~ee~~~lf~~~~~~d~~p~a~~nt~~~a~~~~~~~d~~~~~~~~~~~P~~~~~~~~ 319 (1108) T TIGR00594 240 TGKTLSDPKRLKFKSDPEEFYLKSPEEMAELFADHLPGDEIPEALLNTLEIAEKCNLELDLVDVKLGPPRLPSYLFKKGA 319 (1108) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHCEEEECCCCCCCCHHHHHCCC T ss_conf 47655654445688985211237889999985420576310478899999988642210103423787536403431256 Q ss_pred -CCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCC---CCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCC Q ss_conf -575688776898899987654103420124643---2457838999998877898986063055788899999851047 Q 537021.9.peg.7 296 -IEKECDDIQKEEENELRNKAVAGLEVRLAQGAI---AKGYHAQDYRQRLDFELNVIARMKFSGYFLIVADFIQWAKKNN 371 (1033) Q Consensus 296 -~~p~~~~~~~~~~~~Lr~l~~~Gl~~R~~~~~~---~~~~~~~~y~~RLe~EL~vI~~~gf~~YFLiV~Div~~ak~~g 371 (1033) ..|.. ..+.++||+++|++||..||..... .+......|++||++||+||++|||++|||||||+|+||+++| T Consensus 320 ~~~~~~---~~~~~~yl~~~~~~Gl~~r~~~~~~~~~~~~~~~~~y~erl~~el~~i~~~gf~~yflivwd~i~~a~~~g 396 (1108) T TIGR00594 320 YQIPPD---FTSSEDYLEHLAEEGLRERLALGPPGEFERAKDRAEYKERLEYELDVINKMGFPGYFLIVWDFIKWAKDNG 396 (1108) T ss_pred CCCCCC---CCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCC T ss_conf 567888---87548899999987677530467876332210104678888888999861687506788999999887668 Q ss_pred C-CCCCCCCCHHHHHHHHHHHCCCCCHHHCCCHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEECCCHH Q ss_conf 6-433444420466776554403756333231056762100145787533365687899999998303802155422077 Q 537021.9.peg.7 372 I-PVGPGRGSGAGSVTAYALTITDIDPLRFSLLFERFLNPDRMSMPDFDIDFCQDRRDEVIRYVQNKYGHERVAQIITFG 450 (1033) Q Consensus 372 I-~vGpGRGSAagSLVaY~LgIT~VDPi~~~LlFERFLnp~R~~~PDIDiDf~~~~R~~vi~y~~~kyG~~~va~i~t~~ 450 (1033) | +|||||||||||||+|+|+||+||||+|+|+|||||||+|++|||||+|||.++|++||+|+.+|||+++||||+||+ T Consensus 397 i~pvGPGrGs~aGslvay~l~itd~dP~~~~llferflnPer~s~Pd~d~dfc~~~r~~~i~y~~~~yG~d~vaq~itfg 476 (1108) T TIGR00594 397 IRPVGPGRGSGAGSLVAYALKITDIDPLKHGLLFERFLNPERISMPDIDIDFCDERRDEVIEYVAEKYGKDNVAQIITFG 476 (1108) T ss_pred CCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCHHEEECC T ss_conf 70047876640467888876403546432231112104743245774434445330579999999841552100012201 Q ss_pred HHHHHHHHHHHHHH-CCCCHHHHHHHHHHCCCCCCCCCCHH-HHHCC-------------------HHHHHHHHCCCCCH Q ss_conf 87788999999975-27998898865431014589824434-45229-------------------88788643074202 Q 537021.9.peg.7 451 SLQAKAALRDVGRA-LQMPYSQVDRLCKLIPNDPAHPVSLQ-AIADD-------------------SRFREARLADPSVD 509 (1033) Q Consensus 451 t~~~k~aird~~r~-~g~~~~~~~~~~~~i~~~~~~~~~~~-~~~~~-------------------~~~~~~~~~~~~~~ 509 (1033) |+++|+++|||||+ +|+|+.+.|+++|.||..++...+++ +...+ +++++.+.++|..+ T Consensus 477 ~~~~k~~~~dv~r~~~~~p~~~~d~~~k~~P~~~~~~~~~~~~~~~~~~~~~~~~g~~~~d~~~~~~~~~~~~~~~~~~~ 556 (1108) T TIGR00594 477 TLKAKAALRDVGRVHLDIPYAEADKLAKLIPPGPGLWLTLKEALEASPEGPLWELGEHRTDIYDEEPQLKELYEEDPEVK 556 (1108) T ss_pred CHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCCHHHHCCCCHHHHHHHHCCCCCCHHHHHHHHHHHHCCCHHHH T ss_conf 00034677655554227866678888873687567642122111455125787612100102333567766522678899 Q ss_pred HHHHHHHHHHHHHHCCCCCCCEEEE-CCCCHHHHEECCCCCCCCCEEEEEECHHHHHHCCCHHHHHHHCCHHHHHHHHHH Q ss_conf 6889872011221002447440335-256666620012367688668851021334311232135654178999999998 Q 537021.9.peg.7 510 RLLEISQKLEGLYRHASTHAAGIVI-GDRPLSQLVPMYRDVRSDLPVTQFNMKWIEKAGLIKFDFLGLKTLTLLQKSLDF 588 (1033) Q Consensus 510 ~~~~~a~~leg~~r~~~~Ha~Gvvi-~~~~l~~~~p~~~~~~~~~~~~q~d~~~~e~~gl~K~D~Lgl~~L~~i~~~~~~ 588 (1033) .++++|.+|||++||+|+||||||| +++||++++|++.+.++|..+|||+|+++|++||+|||||||+|||+|+.+.++ T Consensus 557 ~~~~~a~~leg~~r~~~~haag~v~~~~~pl~~~~Pl~~~~~~g~~~tq~~~~~~e~~gl~k~dflGl~~lt~~~~~~~~ 636 (1108) T TIGR00594 557 ELIDMALKLEGLNRNAGVHAAGVVICSPEPLTDYVPLYKDEEGGAISTQYDMDDLEDLGLLKMDFLGLKTLTLIKDALEL 636 (1108) T ss_pred HHHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCEECCCCCEEEECCCHHHHHHCCCHHHHHHCCHHHHHHHHHHHH T ss_conf 99999986246653333110146751764100046630156677124113634343214301111110256788999999 Q ss_pred HHH-HC----CCCCCCCCCCCHHHHHHHHHCCCCCHHHHCCCHHHHHHHHHHCCCHHHHHHHHHHHCCCHHH-H--HHHH Q ss_conf 874-02----54343323443588998762014101221145347888877174048888766541141134-2--0036 Q 537021.9.peg.7 589 LAQ-RG----VKVDLSLIPFDDHETYHLLTTKGTLGIFQLESSGMRQALEGMQPDCIEDIIALVSLYRPGPI-D--NIVV 660 (1033) Q Consensus 589 i~~-~~----~~~d~~~ip~~D~~~~~l~~~g~t~GvFQ~es~~~~~~l~~~~p~~~~dl~~~~al~RPGp~-~--~~~~ 660 (1033) |++ +| .++++..||+||++||.++.+|+|.||||+||.||+.++++++|.+|+|+++++|||||||| + .++. T Consensus 637 ~~~~~g~~~~~~~~~~~~p~dd~~~~~~l~~g~~~g~fq~es~g~~~~~k~l~P~~~ed~~a~~alyrPGP~~~~g~~~~ 716 (1108) T TIGR00594 637 IRKKRGKNGKIDLDLASIPLDDPKTFKLLQEGDTTGVFQLESRGMKDLLKRLKPDGFEDIIALNALYRPGPMESGGLIPD 716 (1108) T ss_pred HHHHCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCEEEEECCCCHHHHHHHCCCCCHHHHHHHHHHCCCCCCCCCCCCHH T ss_conf 87624753433344101676523678876346520232203555789996417762467888875148886656665426 Q ss_pred HHHHHCCC-CCCCCC-----CHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHH Q ss_conf 88850465-785026-----966733410335530012568988788632684130788976201211016778789999 Q 537021.9.peg.7 661 YNNRKNGK-EKIVSI-----HPLIDPMLKETQGVIIYQEQVMQIAQLLSGYSLSEADVLRRAMGKKIKEEMDKQKERFIS 734 (1033) Q Consensus 661 yi~rk~g~-e~~~~~-----hp~l~~iL~~T~Gv~vyQEQvm~ia~~~ag~~~~ead~lRra~~kk~~~~~~~~~~~f~~ 734 (1033) |++||||+ +++.|+ ||.|++||++||||+|||||||+|++.+||||+|+||.||||||||+.++|.+.+.+|++ T Consensus 717 ~~~rk~g~~~~~~yp~~~~~~~~l~~~l~~tyG~~vyqeq~m~~a~~~~g~~lg~ad~lrra~gkkk~~~~~~~~~~f~~ 796 (1108) T TIGR00594 717 FIDRKHGRVEPIEYPDHPWLHPDLEPILKETYGVIVYQEQVMQIAQRLAGFSLGEADLLRRAMGKKKAEEMAKEREKFVE 796 (1108) T ss_pred HHHHCCCCCCCCCCCCHHHCCHHHHHHHCCCCCEEEHHHHHHHHHHHHHCCCCCHHHHHHHHHCCCCHHHHHHHHHHHHH T ss_conf 77622685343222210111322476520011333107889999987503565425677765255668899988888875 Q ss_pred HHHHCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCH-HHHHHHHHHHHCC Q ss_conf 986249999999999999997500013310147888999999999973026667775100235420-2589998888636 Q 537021.9.peg.7 735 GASKNGISKTIAVNIFELLAKFADYGFNKSHAAAYAVISYQTAWMKTHYPVEFLAASMTLEMDNVE-KIKKFCQDARQFN 813 (1033) Q Consensus 735 g~~~~g~~~~~a~~i~~~i~~f~~Y~FnksHa~aya~~~y~~aylK~~yP~eF~aa~L~~~~~~~~-~~~~~i~e~~~~g 813 (1033) ||.++|++++.|.++|+.+.+||+||||||||+||++++|||||||+|||++|||++|+++.++.+ ++..|+.||+.+| T Consensus 797 g~~~~g~~~~~a~~~~d~~~~fa~ygfnksh~~ay~~~~y~taylka~yp~e~~a~~lt~~~~~~~~k~~~~~~~~~~~g 876 (1108) T TIGR00594 797 GAEKNGYDPEIAENLFDLIEKFAGYGFNKSHAAAYGLISYQTAYLKANYPAEFMAALLTSEIDDNESKVAKYIDEAKKLG 876 (1108) T ss_pred HHHHCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHCC T ss_conf 67650788899999999887652234450256677777776666541262788777653024521467899999997738 Q ss_pred CCCCCCCCCCCCEEE-EECCCE-EEEEHHHHCCCCHHHHHHHHHHHH--CCCCCCHHHHHHHCCCCCCCHHHHHHHHHCC Q ss_conf 850355233421003-531986-985026858969899999999983--7999997899973482479989999888676 Q 537021.9.peg.7 814 IQIMPPSVNTPCVDF-KVGDNR-IYYSLAAIKGVGTTTARHIMEASA--DKPFDSLEDFCSRVDSKKLNRRVLESLVFAG 889 (1033) Q Consensus 814 i~vl~PdIN~S~~~f-~~~~~~-Ir~Gl~~Ikgvg~~~~~~Ive~r~--~g~f~sl~df~~rv~~~~~nk~~le~Li~aG 889 (1033) |.|+|||||.|..+| ++.+++ |+|||++|||||+.+++.|+++|+ +|+|+++.|||.|++.+.+||+.+|.||++| T Consensus 877 ~~~lpPd~n~s~~~f~~~~~~~~i~~gl~~~k~~G~~~~~~~~~~r~~~~g~~~~l~d~~~~~~~~~~n~k~~e~l~~~G 956 (1108) T TIGR00594 877 IEVLPPDINKSGRDFTTVEGGGEIRYGLGAIKGVGEGVVKSIIEARENKNGPFKSLADFCNRVDFKKLNKKVLEALIKAG 956 (1108) T ss_pred CEECCCCCCCCCCCCCEECCCCEEEEECHHHCCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHCCCHHHHHHHHHHHHHHC T ss_conf 54107756656565410126850575100111033367899998752157762027898742563021067788887511 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHHHH-HHCCCCCCCCCCCCCCCCCCCC--CCCCCCCHHHH-HHHHHHHHCEEECC Q ss_conf 72347903899999899999999988654-3127778666655556557766--68999988899-98578872621163 Q 537021.9.peg.7 890 ALDCFGYSRMQLLQSLDNIQKYAQWVEKN-RTNKHENIFAHEKGTFSDKITL--EKFSVENSSVR-FENEQRVLGFYFSG 965 (1033) Q Consensus 890 AfD~~~~~R~~ll~~i~~~~~~~~~~~~~-~~~~q~~LF~~~~~~~~~~~~~--~~~~~~~~~e~-l~~E~e~LGf~lS~ 965 (1033) |||+++.+|..++.+++.++.++....+. ...+|.+||+............ +...+|+..++ +..|+|+||+|+|+ T Consensus 957 ~fd~~~~~r~~l~~~~~~~~~~~~~~~~~~~~~g~~~lf~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~e~e~lG~y~~g 1036 (1108) T TIGR00594 957 AFDSLGPNRKTLLASLDDALDAASRKKKAEEALGQNSLFGALDEGEKPEYVETYPPDEEWSDQELLLKLEKELLGFYVSG 1036 (1108) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCEEEEC T ss_conf 21222134889998888999987665567765021111244334643111123212233104789875434451223305 Q ss_pred CCHHHHH---HHHHHCCCC-CHHHHHCCCCCCEEEEEEEEEEEEEEC-CCCCCEEEEEEEECCCCEEEEEEC Q ss_conf 8558999---999874897-688985055898899999996112101-588876999999849704999953 Q 537021.9.peg.7 966 HPLDVYK---PILRKMGIK-GYEESVATIRGNNIRLAAMVVSKQQKK-TRKGSRIGWVTFSEPAKEYEANFF 1032 (1033) Q Consensus 966 HPl~~~~---~~l~~~~~~-~~~~~~~~~~g~~v~iaG~V~~i~~~~-TKkG~~mafltleD~~g~~e~~iF 1032 (1033) ||++.|+ ..+...... ...+.........+.++|.+.++.+.. +|+|++|+|++++|.+|.+|+++| T Consensus 1037 hPl~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~a~~~~~d~~g~~e~~~f 1108 (1108) T TIGR00594 1037 HPLDAYEHLLKLLSNLATPLVALDDLEVREDSKVLVLGVLNSVKKVITTKNGKPMAFLTLEDLTGSIELVVF 1108 (1108) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEHHHHHHHHHHEEECCCCCEEEEEEEECCCCCEEEEEC T ss_conf 753689998876542100233343321664310101222333321001025760367775103574433418 No 21 >PRK00448 polC DNA polymerase III PolC; Validated Probab=100.00 E-value=0 Score=1545.78 Aligned_cols=791 Identities=23% Similarity=0.300 Sum_probs=664.8 Q ss_pred CHHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCC---- Q ss_conf 1334331444764432998999999997899889981165177499999999976996389999998547664334---- Q 537021.9.peg.7 11 FVHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPIIGCQLDIDMQDNLKMT---- 86 (1033) Q Consensus 11 FvhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~~~~~---- 86 (1033) =|+||+||.||.+||++++++||++|+++||+||||||||+++|.++|+.+|+++|||||+|||+|+.++...... T Consensus 333 RVELH~HT~mS~mDg~~~~~~lv~~A~~wGh~AiAITDHg~vq~fPe~~~~~kk~~iK~IyG~E~y~v~d~~~iv~n~~~ 412 (1436) T PRK00448 333 RVELHLHTKMSTMDAIPSVSELVKQAAKWGHKAIAITDHGVVQAFPEAYEAAKKAGIKVIYGVEANLVDDGVPIVYNEVD 412 (1436) T ss_pred EEEEEECCCCCHHCCCCCHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCEEECCCC T ss_conf 48986147751020688999999999984998799943650647999999876559848976479974277741337877 Q ss_pred -------------------------------------------------------------------------------- Q ss_conf -------------------------------------------------------------------------------- Q 537021.9.peg.7 87 -------------------------------------------------------------------------------- 86 (1033) Q Consensus 87 -------------------------------------------------------------------------------- 86 (1033) T Consensus 413 ~~l~~~~yVvFDiETTGls~~~d~IiEigAvki~~g~iid~f~~fi~P~~~i~~~i~~lT~Itd~mv~~~~~~~evl~~f 492 (1436) T PRK00448 413 RDLKDATYVVFDVETTGLSAVYDEIIEIGAVKIKNGEIIDKFEEFIKPGHPLSAFTTELTGITDDMVKDAPSIEEVLPKF 492 (1436) T ss_pred CCCCCCCEEEEEECCCCCCCCCCCEEEEEEEEEECCEEEEEEHHHCCCCCCCCHHHHHCCCCCHHHHCCCCCHHHHHHHH T ss_conf 74676637998600688886667236875798637817541021038999797545321478847863898789999999 Q ss_pred --------------------------------------------------------------------CCCC-------- Q ss_conf --------------------------------------------------------------------4565-------- Q 537021.9.peg.7 87 --------------------------------------------------------------------NQGK-------- 90 (1033) Q Consensus 87 --------------------------------------------------------------------~~~~-------- 90 (1033) ...+ T Consensus 493 ~~f~~d~vlVAHNa~FD~~Fi~~~~~~~~~~~~~~p~iDTl~lsr~l~p~~k~~~L~~l~k~~~v~~~~hHRA~~Da~~t 572 (1436) T PRK00448 493 KEFCGDSVLVAHNASFDVGFINTNYEKLGLEKIKNPVIDTLELSRFLYPEYKSHRLNTLAKKFGVELEHHHRADYDAEAT 572 (1436) T ss_pred HHHHCCCEEEEECCCCCHHHHHHHHHHCCCCCCCCCEECHHHHHHHHCCCCCCCCHHHHHHHHCCCCCCCCCCCCCHHHH T ss_conf 99828977998458656889999999808986679723029989875510111447889988488865454432238889 Q ss_pred -------------------------------CCCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHH Q ss_conf -------------------------------6666775399995696899999999989973248778740078999863 Q 537021.9.peg.7 91 -------------------------------TPFPTVPSIVLLVSTAEGYQRLIELVSRMYLLDQGKQSVRICLSWLQEI 139 (1033) Q Consensus 91 -------------------------------~~~~~~~~lvlLAkN~~Gy~nL~kL~S~a~~~~~~~~~p~i~~~~L~e~ 139 (1033) -.....+|+++||||++||+||++|+|.||++ +||++||++++.|.+ T Consensus 573 ~~i~~~~l~~~~~~~i~~~~~ln~~~~~~~~~~~~~~yHlilLAkN~~G~kNL~KLvS~s~~e-gFy~kPRIdk~lL~~- 650 (1436) T PRK00448 573 AYLLIKFLKDLKEKGITNLDELNKKLGSEDAYKKARPKHATILAKNQVGLKNLFKLVSLSNTK-YFYRVPRIPRSLLDK- 650 (1436) T ss_pred HHHHHHHHHHHHHCCCCCHHHHHHHHCCCHHHHCCCCCEEEEEEECHHHHHHHHHHHHHHHHC-CCCCCCCCCHHHHHH- T ss_conf 999999999998748743788865313411331377735999995578899999999999864-675789989999998- Q ss_pred CCCCEEEECCCCCCHHHHHHHCCCHHHHHHHHHHHH--HHCC-CCEEEEEECCCHHHH----HHHHHHHHHHHHHCCCEE Q ss_conf 489889981976796788887799789999999998--7659-868999943991158----999999999998189899 Q 537021.9.peg.7 140 GTEGLIMLTGGSSGPIDRAFFFNSSQVAEKRLLTFK--KLFG-DRLYVNLQRHRGYDR----YRESQVVQLAYTHELPLV 212 (1033) Q Consensus 140 ~~egLIvlsg~~~g~i~~~l~~~~~~~a~~~l~~l~--~~Fg-d~~ylEl~~~~~~e~----~~~~~l~~lA~~~~iPlV 212 (1033) |++|||++|||..|+++++++.++.++|++++.+|. ++|| ++|+.+++++...++ ..+..++++|+++++|+| T Consensus 651 yreGLI~~SaC~~GEv~~~il~~~~eea~~~a~~YDyiEI~p~~n~~~~i~~~~i~~e~~l~~in~~li~lak~~~~pvV 730 (1436) T PRK00448 651 YREGLLIGSACEEGEVFDAMLQKGDEEVEEIAKFYDFIEIQPPANYQHLIERELIKDEEELEEIIKNLIELGKKLNKPVV 730 (1436) T ss_pred HCCCCEEECCCCCCHHHHHHHCCCHHHHHHHHHHCCCEEECCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHCCCEE T ss_conf 53991994787657999999818999999999753713347833358999757982299999999999999999699889 Q ss_pred EECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHCCHHHH------HHHHHHHHHHHHH-HHH Q ss_conf 808830078788999987668846985765310024534323434788751220358------9987886411100-013 Q 537021.9.peg.7 213 ATNNSLFLSEEDYEAHDVLMAVAHSTVVSQKDRPRVTPDHYLKNRSEMVSIFSDLPE------ALENTVEIARRCS-FIL 285 (1033) Q Consensus 213 aTndv~Y~~~~D~~~h~iL~aI~~~~~l~~~~~~~~~~~~yLks~eEM~~~f~~~pe------Ai~NT~~IAe~c~-~~l 285 (1033) |||||||++|+|+.+|++|.|++.+.... ++. ..+++||||++||.+.|+.+|+ +++||++||++|+ +.+ T Consensus 731 ATnDvHYl~~eD~~~~diLl~~q~~~~~~--~r~-~~~~~y~kTteEMl~~F~~l~e~~a~eivi~NT~~IAd~~e~i~p 807 (1436) T PRK00448 731 ATGDVHYLDPEDKIYRKILVASQGGGNPL--NRF-PLPELHFRTTDEMLDEFSFLGEELAKEIVVENTNKIADMIEDIEP 807 (1436) T ss_pred EECCCCCCCHHHHHHHHHHHCCCCCCCCC--CCC-CCCCCCCCCHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCCCEE T ss_conf 97887779877899999995116689823--467-788852089999999860389878668987669999986026331 Q ss_pred HHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHH Q ss_conf 32210234555756887768988999876541034201246432457838999998877898986063055788899999 Q 537021.9.peg.7 286 QVHAPILPRFIEKECDDIQKEEENELRNKAVAGLEVRLAQGAIAKGYHAQDYRQRLDFELNVIARMKFSGYFLIVADFIQ 365 (1033) Q Consensus 286 ~~~~~~lP~f~~p~~~~~~~~~~~~Lr~l~~~Gl~~R~~~~~~~~~~~~~~y~~RLe~EL~vI~~~gf~~YFLiV~Div~ 365 (1033) .....+.|+|+ +++++|+++|++|+++||++.. ++.|.+||+|||++|.+|||++|||||||+|+ T Consensus 808 ~~~~l~~P~~e---------~a~e~lr~l~~~~a~~~YG~~l------Peiv~~RLe~EL~vIi~~GFa~yyLIv~dlV~ 872 (1436) T PRK00448 808 IKDKLYTPKIE---------GAEEEIRELTYKKAHEIYGEPL------PEIVEKRIEKELNSIIGNGFAVIYLISQKLVK 872 (1436) T ss_pred CCCCCCCCCCC---------CHHHHHHHHHHHHHHHHCCCCC------CHHHHHHHHHHHHHHHHCCCCEEEHHHHHHHH T ss_conf 78887899989---------9799999999867898718999------89999999999999984699567328999999 Q ss_pred HHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHH-----------------------------------------CCCHH Q ss_conf 8510476433444420466776554403756333-----------------------------------------23105 Q 537021.9.peg.7 366 WAKKNNIPVGPGRGSGAGSVTAYALTITDIDPLR-----------------------------------------FSLLF 404 (1033) Q Consensus 366 ~ak~~gI~vGpGRGSAagSLVaY~LgIT~VDPi~-----------------------------------------~~LlF 404 (1033) ||+++||+||| |||||||+|||+||||+||||+ |||+| T Consensus 873 ~s~~~Gy~VGp-RGSvgsSlVAy~lgITeVnPL~pHY~Cp~Cky~ef~~~g~~~sGyDLpdk~CP~Cg~~l~kdGhdIpF 951 (1436) T PRK00448 873 KSLEDGYLVGS-RGSVGSSFVATMIGITEVNPLPPHYVCPNCKYSEFFTDGSVGSGYDLPDKDCPKCGTKLKKDGHDIPF 951 (1436) T ss_pred HHHHCCCEECC-CCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCH T ss_conf 99867972478-86389999999827776479972225765540000246654657778656686434300257788671 Q ss_pred HHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCC Q ss_conf 67621001457875333656878999999983038021554220778778899999997527998898865431014589 Q 537021.9.peg.7 405 ERFLNPDRMSMPDFDIDFCQDRRDEVIRYVQNKYGHERVAQIITFGSLQAKAALRDVGRALQMPYSQVDRLCKLIPNDPA 484 (1033) Q Consensus 405 ERFLnp~R~~~PDIDiDf~~~~R~~vi~y~~~kyG~~~va~i~t~~t~~~k~aird~~r~~g~~~~~~~~~~~~i~~~~~ 484 (1033) ||||||+|++|||||+||++++|++|++|+.++||+++|+||+||+|+++|+|..-| -++.. ..+ T Consensus 952 ErFLgfer~k~PDIDldFs~e~q~~vh~Yv~~~yG~d~V~ragTigT~a~KtA~G~v---k~y~e------------~~~ 1016 (1436) T PRK00448 952 ETFLGFKGDKVPDIDLNFSGEYQPVAHNYTKVLFGEDHVFRAGTIGTVAEKTAYGYV---KGYEE------------DRN 1016 (1436) T ss_pred HHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEEHHHHHHHHHHHHHH---HHHHH------------HHC T ss_conf 430288999899988676823568999999998496657886306668998999999---86588------------718 Q ss_pred CCCCHHHHHCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCEEEECCC--CHHHHEECCCCC---CCCCEEEEEE Q ss_conf 82443445229887886430742026889872011221002447440335256--666620012367---6886688510 Q 537021.9.peg.7 485 HPVSLQAIADDSRFREARLADPSVDRLLEISQKLEGLYRHASTHAAGIVIGDR--PLSQLVPMYRDV---RSDLPVTQFN 559 (1033) Q Consensus 485 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~leg~~r~~~~Ha~Gvvi~~~--~l~~~~p~~~~~---~~~~~~~q~d 559 (1033) .. .. ..++-++|+++||++|++|+|||||||+|. ++.+++|++.++ .+++.+|||| T Consensus 1017 ~~----------------~~---~aei~rlA~~leGvkRttGqHaaGIVIvP~~~~i~dftPlq~padd~~~~~~tThFd 1077 (1436) T PRK00448 1017 KF----------------YR---GAEIDRLAKGCTGVKRTTGQHPGGIIVVPDYMDIYDFTPIQYPADDVNSEWKTTHFD 1077 (1436) T ss_pred CC----------------CC---CHHHHHHHHHCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCEEEEEEC T ss_conf 86----------------76---268999998677999876828988998289876120055004677888884897102 Q ss_pred CHHHHHHCCCHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCHHHH--------------CCC Q ss_conf 213343112321356541789999999988740254343323443588998762014101221--------------145 Q 537021.9.peg.7 560 MKWIEKAGLIKFDFLGLKTLTLLQKSLDFLAQRGVKVDLSLIPFDDHETYHLLTTKGTLGIFQ--------------LES 625 (1033) Q Consensus 560 ~~~~e~~gl~K~D~Lgl~~L~~i~~~~~~i~~~~~~~d~~~ip~~D~~~~~l~~~g~t~GvFQ--------------~es 625 (1033) |+.+++ ||||||||||++||+|+...++. | +|+.+||+|||+||+||++|+|.|||| |++ T Consensus 1078 ~~si~~-~LLKmDfLGhd~pTvIr~l~dlt---g--iD~~~IPldD~~t~~Lfss~e~lGV~~~~~~~~tGtlGipEFgT 1151 (1436) T PRK00448 1078 FHSIHD-NLLKLDILGHDDPTMIRMLQDLT---G--IDPKTIPMDDPKVMSLFSSTEALGVTPEQIGEKTGTLGIPEFGT 1151 (1436) T ss_pred CCCHHC-CEEEEECCCCCCHHHHHHHHHHH---C--CCHHHCCCCCHHHHHHHHCCCCCCCCHHHHCCCCCCCCCCCCCC T ss_conf 330315-81465337899568999999863---9--88442899988999987087746758777177656304643465 Q ss_pred HHHHHHHHHHCCCHHHHHHHHHHHCCCHHHHHHH--HHHHHHCCCCCCCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHH Q ss_conf 3478888771740488887665411411342003--68885046578502696673341033553001256898878863 Q 537021.9.peg.7 626 SGMRQALEGMQPDCIEDIIALVSLYRPGPIDNIV--VYNNRKNGKEKIVSIHPLIDPMLKETQGVIIYQEQVMQIAQLLS 703 (1033) Q Consensus 626 ~~~~~~l~~~~p~~~~dl~~~~al~RPGp~~~~~--~yi~rk~g~e~~~~~hp~l~~iL~~T~Gv~vyQEQvm~ia~~~a 703 (1033) +.||+||++++|++|+||+++++|.+++++-.++ .+|.+. ++.|. + |+.+|||||.-... . T Consensus 1152 ~fvR~mL~~~kP~~F~DLi~IsgLsHGtdVwl~Naq~lI~~~---------~~~L~----e---vI~~rddIM~yLi~-~ 1214 (1436) T PRK00448 1152 KFVRQMLEETKPKTFAELVQISGLSHGTDVWLGNAQELIKEG---------IATLS----D---VIGCRDDIMVYLIH-K 1214 (1436) T ss_pred HHHHHHHHHCCCCCHHHHHHHHHCCCCCCCCCCCHHHHHHCC---------CCCCC----C---EEEEHHHHHHHHHH-H T ss_conf 889999997699979999999704578644433578998708---------87603----5---56417879999999-6 Q ss_pred CCCHHHH----HHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHH Q ss_conf 2684130----788976201211016778789999986249999999999999997500013310147888999999999 Q 537021.9.peg.7 704 GYSLSEA----DVLRRAMGKKIKEEMDKQKERFISGASKNGISKTIAVNIFELLAKFADYGFNKSHAAAYAVISYQTAWM 779 (1033) Q Consensus 704 g~~~~ea----d~lRra~~kk~~~~~~~~~~~f~~g~~~~g~~~~~a~~i~~~i~~f~~Y~FnksHa~aya~~~y~~ayl 779 (1033) |+.+..| +.+||+.| +.+++++.+.++|+++|..+.+ +++ +||||||||+||+++|||+||| T Consensus 1215 gl~~~~AF~IME~vRKGkg---------~~~e~~~~m~~~~Vp~wyidsc----~KI-kY~FnKaHAaAY~lmAyrtAyl 1280 (1436) T PRK00448 1215 GLEPKMAFKIMEKVRKGKG---------LTEEEEELMKENNVPDWYIESC----KKI-KYMFPKAHAAAYVLMALRIAYF 1280 (1436) T ss_pred HCCCCHHHHHHHHHHHHCC---------CCHHHHHHHHHCCCCHHHHHHH----HHC-CCCCCCHHHHHHHHHHHHHHHH T ss_conf 5061058999999986668---------9878999999869958999888----870-8388828999999999999999 Q ss_pred HHHHHHHHHHHHHCCCCCCCHH---------------------------------HHHHHHHHHHCCCCCCCCCCCCCCE Q ss_conf 9730266677751002354202---------------------------------5899988886368503552334210 Q 537021.9.peg.7 780 KTHYPVEFLAASMTLEMDNVEK---------------------------------IKKFCQDARQFNIQIMPPSVNTPCV 826 (1033) Q Consensus 780 K~~yP~eF~aa~L~~~~~~~~~---------------------------------~~~~i~e~~~~gi~vl~PdIN~S~~ 826 (1033) |+|||+|||||+||...++++. .-.++.||+.+||+|+|||||+|.+ T Consensus 1281 K~hYP~efyAA~fT~r~~~fD~~~~~~g~~~i~~~i~ei~~~~~~~~~ke~~~~~~le~~~Em~~rGi~~lppDinkS~a 1360 (1436) T PRK00448 1281 KVHYPLAYYAAYFSVRADDFDLETMSKGKEAIKAKMEEINAKGNDASKKEKDLLTVLEIALEMLERGFKFQKVDLYKSDA 1360 (1436) T ss_pred HHCCHHHHHHHHHHCCCCCCCHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCCEECCCCCCCCCC T ss_conf 83778999999841553443277762438889999999987301222001114779999999997798453884024578 Q ss_pred -EEEECCCEEEEEHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCHHHHHHHHHCCCCCCCCC Q ss_conf -0353198698502685896989999999998379999978999734824799899998886767234790 Q 537021.9.peg.7 827 -DFKVGDNRIYYSLAAIKGVGTTTARHIMEASADKPFDSLEDFCSRVDSKKLNRRVLESLVFAGALDCFGY 896 (1033) Q Consensus 827 -~f~~~~~~Ir~Gl~~Ikgvg~~~~~~Ive~r~~g~f~sl~df~~rv~~~~~nk~~le~Li~aGAfD~~~~ 896 (1033) .|++++++||+||++|||||++++++||++|++|||+|++||+.|. ++||+.+|.|+++||||++.. T Consensus 1361 ~~F~i~~~~Ir~pl~AIkgvG~~aa~~IveaR~eg~F~S~eDf~~R~---kvnKtviE~L~~~G~ld~Lpe 1428 (1436) T PRK00448 1361 TEFIIEGDTLIPPFNAIPGLGENVAKSIVEAREEGEFLSKEDLRKRT---KVSKTLIEKLDELGILDDLPE 1428 (1436) T ss_pred CCEEECCCEEEECHHHHCCCHHHHHHHHHHHHHCCCCCCHHHHHHHC---CCCHHHHHHHHHCCCCCCCCH T ss_conf 54077299077674664461199999999998079989989999876---786899999997699789982 No 22 >pfam07733 DNA_pol3_alpha Bacterial DNA polymerase III alpha subunit. Probab=100.00 E-value=0 Score=1103.26 Aligned_cols=378 Identities=58% Similarity=0.981 Sum_probs=366.9 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHH Q ss_conf 99987654103420124643245783899999887789898606305578889999985104764334444204667765 Q 537021.9.peg.7 309 NELRNKAVAGLEVRLAQGAIAKGYHAQDYRQRLDFELNVIARMKFSGYFLIVADFIQWAKKNNIPVGPGRGSGAGSVTAY 388 (1033) Q Consensus 309 ~~Lr~l~~~Gl~~R~~~~~~~~~~~~~~y~~RLe~EL~vI~~~gf~~YFLiV~Div~~ak~~gI~vGpGRGSAagSLVaY 388 (1033) +||+++|++|+++||++ ..+|++||+|||+||.+|||++|||||||+|+|||++||+||||||||||||||| T Consensus 1 ~~L~~l~~~g~~~r~~~--------~~~~~~Rl~~EL~vI~~~gf~~YFLiv~d~v~~a~~~gi~vgpgRGSaagSlV~y 72 (381) T pfam07733 1 EYLRKLCEEGLKERYLP--------GIPVRERLEKELNVIIKMGFAGYFLIVWDLVKWAKDNGIPVGPGRGSAAGSLVAY 72 (381) T ss_pred CHHHHHHHHHHHHHCCC--------CCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCEECCCCCCHHHHHHHH T ss_conf 97899999999987699--------8409999999999998779951479999999999987961379987379999999 Q ss_pred HHHCCCCCHHHCCCHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHHCCCC Q ss_conf 54403756333231056762100145787533365687899999998303802155422077877889999999752799 Q 537021.9.peg.7 389 ALTITDIDPLRFSLLFERFLNPDRMSMPDFDIDFCQDRRDEVIRYVQNKYGHERVAQIITFGSLQAKAALRDVGRALQMP 468 (1033) Q Consensus 389 ~LgIT~VDPi~~~LlFERFLnp~R~~~PDIDiDf~~~~R~~vi~y~~~kyG~~~va~i~t~~t~~~k~aird~~r~~g~~ 468 (1033) |||||+||||+|||+|||||||+|.+|||||+||++++|++|++|+++|||.++||+|+||+|+++|+|+||+||+ T Consensus 73 ~LgIT~vDPi~~~LlFERFLn~~R~~~PDIDiDf~~~~R~~vi~y~~~~yG~~~va~i~t~~t~~~ksairdv~r~---- 148 (381) T pfam07733 73 LLGITEVDPLKHDLLFERFLNPERDSMPDIDIDFEDERREEVIDYVKEKYGEDRVAQIATFGTLAAKSAIRDVGRA---- 148 (381) T ss_pred HHCCCCCCCCCCCCCHHHHCCCCCCCCCCEEEECCCCCHHHHHHHHHHHHCCCCEEEEEEHHHHHHHHHHHHHHHH---- T ss_conf 8568240863147539987298889899823478853218999999998487878998516544799999999999---- Q ss_pred HHHHHHHHHHCCCCCCCCCCHHHHHCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCEEEECCCCHHHHEECCCC Q ss_conf 88988654310145898244344522988788643074202688987201122100244744033525666662001236 Q 537021.9.peg.7 469 YSQVDRLCKLIPNDPAHPVSLQAIADDSRFREARLADPSVDRLLEISQKLEGLYRHASTHAAGIVIGDRPLSQLVPMYRD 548 (1033) Q Consensus 469 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~leg~~r~~~~Ha~Gvvi~~~~l~~~~p~~~~ 548 (1033) +++++|++|||+|||+|+||||||||++||.+++|+++. T Consensus 149 -----------------------------------------~~~~~a~~leG~~R~~g~HaaGvVIs~~pl~~~~Pl~~~ 187 (381) T pfam07733 149 -----------------------------------------ELIELAKKLEGLPRHTGQHAGGVVISDDPLTDFVPLQKP 187 (381) T ss_pred -----------------------------------------HHHHHHHHHCCCCCCCCCCCCEEEEECCCCCHHCCCEEC T ss_conf -----------------------------------------999999983697665787888899868751221262358 Q ss_pred CCCCCEEEEEECHHHHHHCCCHHHHHHHCCHHHHHHHHHHHHH-HCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHCCCHH Q ss_conf 7688668851021334311232135654178999999998874-025434332344358899876201410122114534 Q 537021.9.peg.7 549 VRSDLPVTQFNMKWIEKAGLIKFDFLGLKTLTLLQKSLDFLAQ-RGVKVDLSLIPFDDHETYHLLTTKGTLGIFQLESSG 627 (1033) Q Consensus 549 ~~~~~~~~q~d~~~~e~~gl~K~D~Lgl~~L~~i~~~~~~i~~-~~~~~d~~~ip~~D~~~~~l~~~g~t~GvFQ~es~~ 627 (1033) ++ +.++|||||+++|++||+|||||||+|||+|++++++|++ +|.++|++.||+||++||+||++|+|.||||+||+| T Consensus 188 ~~-~~~~tq~d~~~~E~~GLlK~D~LGL~~LsvI~~~~~~i~~~~~~~~dl~~ipldD~kt~~ll~~G~T~GiFQ~ES~g 266 (381) T pfam07733 188 AD-DRPVTQFDKDDLEDLGLLKFDFLGLRTLTIIRDALDLIKENRGIDIDLATIPLDDPKTYKLLSSGDTLGVFQFESRG 266 (381) T ss_pred CC-CCEEEEEEHHHHHHCCCCCEEECCCCCHHHHHHHHHHHHHHCCCCCCHHHCCCCHHHHHHHHHCCCCCEEEEECCHH T ss_conf 99-98589720355887598225566888479999999999974498777776472469999998658712253436677 Q ss_pred HHHHHHHHCCCHHHHHHHHHHHCCCHHHHHH--HHHHHHHCCCCCCCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHCC Q ss_conf 7888877174048888766541141134200--36888504657850269667334103355300125689887886326 Q 537021.9.peg.7 628 MRQALEGMQPDCIEDIIALVSLYRPGPIDNI--VVYNNRKNGKEKIVSIHPLIDPMLKETQGVIIYQEQVMQIAQLLSGY 705 (1033) Q Consensus 628 ~~~~l~~~~p~~~~dl~~~~al~RPGp~~~~--~~yi~rk~g~e~~~~~hp~l~~iL~~T~Gv~vyQEQvm~ia~~~ag~ 705 (1033) |+++|++++|++|+||++++||||||||+++ +.|++||||+++++|+||.|++||++||||+|||||||+||+.+||| T Consensus 267 mr~~l~~~kP~~f~Dlva~~AL~RPGp~~~g~i~~Yi~rk~g~~~~~~~hp~l~~iL~~TyGvivYQEQiM~ia~~~aGf 346 (381) T pfam07733 267 MRSMLKRLKPDTFEDLVALSALYRPGPMQGGNVDDYIKRKHGKEKIEYPHPDLEPILKETYGVIVYQEQVMQIAQILAGF 346 (381) T ss_pred HHHHHHHCCCCCHHHHHHHHHHCCCCCHHHCCHHHHHHHHCCCCCCCCCCHHHHHHHCCCCCCEEEHHHHHHHHHHHHCC T ss_conf 88889863999888878876640677302116379999754997888899889987356689187689999999999798 Q ss_pred CHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCC Q ss_conf 84130788976201211016778789999986249 Q 537021.9.peg.7 706 SLSEADVLRRAMGKKIKEEMDKQKERFISGASKNG 740 (1033) Q Consensus 706 ~~~ead~lRra~~kk~~~~~~~~~~~f~~g~~~~g 740 (1033) |++|||.||||||||+.++|++++++|++||.+|| T Consensus 347 s~~eAD~lRram~KK~~~~m~~~~~~Fi~ga~~~G 381 (381) T pfam07733 347 SLGEADLLRRAMGKKKPEEMEKLREKFIEGAKENG 381 (381) T ss_pred CHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCC T ss_conf 97999999999738999999999999998888468 No 23 >TIGR01405 polC_Gram_pos DNA polymerase III, alpha subunit, Gram-positive type; InterPro: IPR006308 These are the polypeptide chains of DNA polymerase III. Full-length homologs of this protein are restricted to the Gram-positive lineages, including the Mycoplasmas. This protein is designated alpha chain and given the gene symbol polC, but is not a full-length homolog of other polC genes. The N-terminal region of about 200 amino acids is rich in low-complexity sequence and poorly alignable. ; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006260 DNA replication, 0005737 cytoplasm. Probab=100.00 E-value=0 Score=1027.49 Aligned_cols=796 Identities=23% Similarity=0.348 Sum_probs=661.4 Q ss_pred CHHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCC--CCEEEEEEEEEECCCCCCCCCC Q ss_conf 13343314447644329989999999978998899811651774999999999769--9638999999854766433445 Q 537021.9.peg.7 11 FVHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAG--IQPIIGCQLDIDMQDNLKMTNQ 88 (1033) Q Consensus 11 FvhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~g--IKPIiG~E~~v~~~~~~~~~~~ 88 (1033) =|+||.||.+|-+||..++.+||++|+++|++|+|||||||+.+.|+||++++|+| ||+|+|+|.++.++..+-..+. T Consensus 112 RVEL~~HTkMS~mDaI~s~~e~v~~A~~wg~kaIAiTDh~~VQAfP~~y~~a~K~gPD~K~IYG~E~~lv~D~~Piv~n~ 191 (1264) T TIGR01405 112 RVELHFHTKMSQMDAITSVQEYVKQAKKWGHKAIAITDHGVVQAFPEAYKAAKKDGPDIKIIYGLEANLVDDRVPIVYNP 191 (1264) T ss_pred CEEEECCHHHHHHCCCCCHHHHHHHHHHCCCCEEEEECCCCEECCHHHHHHHHHCCCCCEEEECCEEEEECCCCCEEECC T ss_conf 34541121201010457889999999975897589854873230058987786508972166252578604886637666 Q ss_pred CCCC------C--------------------------------------------------------------------- Q ss_conf 6566------6--------------------------------------------------------------------- Q 537021.9.peg.7 89 GKTP------F--------------------------------------------------------------------- 93 (1033) Q Consensus 89 ~~~~------~--------------------------------------------------------------------- 93 (1033) .... . T Consensus 192 ~~~~~ldykl~DaTyVVfD~ETTGLsp~~d~iIE~gA~k~~nGk~ii~~~~~FikP~~~l~~~~~elT~ITq~ml~n~~~ 271 (1264) T TIGR01405 192 KDQKFLDYKLKDATYVVFDIETTGLSPQYDEIIEFGAVKVKNGKRIIDKFQFFIKPKEPLSAFVTELTGITQDMLENAPE 271 (1264) T ss_pred CCCCHHCEEECCCEEEEEEEEECCCCCCCCCEEEEEEEEEECCCEEEEEHHHHCCCCCCCCCCEEEECCCCHHHHCCCCC T ss_conf 34510002423620799603515888861424678758874685762312323288754675426624664797379841 Q ss_pred -------------------------------------------------------------------------------- Q ss_conf -------------------------------------------------------------------------------- Q 537021.9.peg.7 94 -------------------------------------------------------------------------------- 93 (1033) Q Consensus 94 -------------------------------------------------------------------------------- 93 (1033) T Consensus 272 ~~~vL~k~~~f~~d~ilVAHNGasFD~~Fl~~~~~k~~~~~~~~p~IDTL~Lar~lnP~y~sh~Lg~~~~Kl~v~~~~e~ 351 (1264) T TIGR01405 272 IEEVLEKFKEFLKDSILVAHNGASFDIGFLNTNFEKVGLEPLENPVIDTLELARALNPEYKSHRLGEIAKKLGVDLDDER 351 (1264) T ss_pred HHHHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCHHHHHCCCCCCCCC T ss_conf 78999999997697588862761036688988987627742336511388999983731134454223332055215330 Q ss_pred --------------------------------------------------CCCCEEEEEECCHHHHHHHHHHHHHHHHCC Q ss_conf --------------------------------------------------677539999569689999999998997324 Q 537021.9.peg.7 94 --------------------------------------------------PTVPSIVLLVSTAEGYQRLIELVSRMYLLD 123 (1033) Q Consensus 94 --------------------------------------------------~~~~~lvlLAkN~~Gy~nL~kL~S~a~~~~ 123 (1033) ....|.++||||+.|++||++|+|.|++ . T Consensus 352 ~HRA~YDa~~t~~vF~~~~~~~~e~gGi~~l~~lni~e~~~l~~~~~y~r~~~~~~~i~~kNq~GLKnLykLvS~s~~-~ 430 (1264) T TIGR01405 352 HHRADYDAEATAKVFIVMVEQLKEKGGITNLEELNIQELDKLSSEELYKRLRPNHIIILVKNQAGLKNLYKLVSISLT-K 430 (1264) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHCCHHHHHHHCCHHHHHHCCCCEEEEEEECCHHHHHHHHHHHHHHH-H T ss_conf 124334799999999999999875037211234057889842052345423745157877630347889999879986-5 Q ss_pred CCCCCCCCCHHHHHHHCCCCEEEECCCCCCHHHHHHHCCCHH-HHHHHHHHHHHHCCCCEEEEEECCCHH---------- Q ss_conf 877874007899986348988998197679678888779978-999999999876598689999439911---------- Q 537021.9.peg.7 124 QGKQSVRICLSWLQEIGTEGLIMLTGGSSGPIDRAFFFNSSQ-VAEKRLLTFKKLFGDRLYVNLQRHRGY---------- 192 (1033) Q Consensus 124 ~~~~~p~i~~~~L~e~~~egLIvlsg~~~g~i~~~l~~~~~~-~a~~~l~~l~~~Fgd~~ylEl~~~~~~---------- 192 (1033) .||..|||.++.|.+ ++||||+.|+|..|++.++++.+..+ ++++.+.+|- |+|||.-+.+ T Consensus 431 Yfy~~Pri~~s~L~~-~~egLl~Gsac~~Gelf~a~l~~~~~~e~e~iaK~YD-------fIei~Plg~y~~l~~~e~~~ 502 (1264) T TIGR01405 431 YFYTRPRILRSLLKK-YREGLLIGSACSEGELFDALLSKTQDDELEEIAKRYD-------FIEIQPLGNYAHLIEKEQVK 502 (1264) T ss_pred HHCCCCCCCHHHHHH-HCCCEEECCCCCCCHHHHHHHCCCCHHHHHHHHHHCC-------EEEECCCCCHHHHHHHCCCC T ss_conf 314689751788986-4387077177743227687433885899999875447-------45627876507898613787 Q ss_pred --HH--HHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHCCC-CCCCCC------CCCCCCCCCCCCHHHHH Q ss_conf --58--99999999999818989980883007878899998766884698-576531------00245343234347887 Q 537021.9.peg.7 193 --DR--YRESQVVQLAYTHELPLVATNNSLFLSEEDYEAHDVLMAVAHST-VVSQKD------RPRVTPDHYLKNRSEMV 261 (1033) Q Consensus 193 --e~--~~~~~l~~lA~~~~iPlVaTndv~Y~~~~D~~~h~iL~aI~~~~-~l~~~~------~~~~~~~~yLks~eEM~ 261 (1033) |+ .....|+++|+++|.|+|||.|+||.+|+|...|+||.|-+=+. .++..- +..+-++.|+|+++||. T Consensus 503 d~E~~~~~~~~li~~a~~lNK~Vv~tG~~hy~~~~d~i~r~il~~s~G~~~~~nr~~~~k~~~~~~~~P~~~~rTT~EML 582 (1264) T TIGR01405 503 DKEALKEIIKKLIKLAKELNKPVVATGDVHYIEPEDKIYRKILVASQGLENKLNRLANYKEDAREVYLPELHFRTTEEML 582 (1264) T ss_pred CHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECHHHHH T ss_conf 77789999999999998619779996784217867876453100277888654766666543553338705861648899 Q ss_pred HHCCHH--HH------HHHHHHHHHHHHHHH-H-HH-HHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 512203--58------998788641110001-3-32-2102345557568877689889998765410342012464324 Q 537021.9.peg.7 262 SIFSDL--PE------ALENTVEIARRCSFI-L-QV-HAPILPRFIEKECDDIQKEEENELRNKAVAGLEVRLAQGAIAK 330 (1033) Q Consensus 262 ~~f~~~--pe------Ai~NT~~IAe~c~~~-l-~~-~~~~lP~f~~p~~~~~~~~~~~~Lr~l~~~Gl~~R~~~~~~~~ 330 (1033) +-|+-+ ++ .++||++||++|+-+ + +. .+.+-|.+ ..+++.+++++|+.+.+.|++.. T Consensus 583 ~ef~fLgee~~~a~eiVveNtN~ia~~~e~~di~Pi~~~LytP~i---------eg~~e~~~~~~Y~~A~~iYGdpL--- 650 (1264) T TIGR01405 583 DEFSFLGEEKLLAREIVVENTNKIADQIEEDDIQPIKDKLYTPKI---------EGADEKIRDLTYEKAKKIYGDPL--- 650 (1264) T ss_pred HHHHHHCCHHHHHHHHHCCCCHHHHHHHCCCCCEECCCCCCCCCC---------CCCCHHHHHHHHHHHHHHCCCCC--- T ss_conf 998740464676644200360257755130586333788887800---------38527899999999897427985--- Q ss_pred CCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHH----------- Q ss_conf 578389999988778989860630557888999998510476433444420466776554403756333----------- Q 537021.9.peg.7 331 GYHAQDYRQRLDFELNVIARMKFSGYFLIVADFIQWAKKNNIPVGPGRGSGAGSVTAYALTITDIDPLR----------- 399 (1033) Q Consensus 331 ~~~~~~y~~RLe~EL~vI~~~gf~~YFLiV~Div~~ak~~gI~vGpGRGSAagSLVaY~LgIT~VDPi~----------- 399 (1033) |+.+.+||++||+-|.+-||+--+||.+.+|+.+-+.|++|| .|||.++||||-++|||+||||. T Consensus 651 ---P~IVe~RieKEL~sII~nGF~ViYlIS~~LV~KS~~DGYlVG-SRGSVGSSlVAtl~gITEVNPL~PHY~Cp~Cky~ 726 (1264) T TIGR01405 651 ---PEIVEQRIEKELKSIIGNGFAVIYLISQLLVKKSLQDGYLVG-SRGSVGSSLVATLTGITEVNPLPPHYLCPNCKYS 726 (1264) T ss_pred ---HHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCEEC-CCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCCEE T ss_conf ---689999999874565406626778999998887415884663-7743114578542166115757687508787355 Q ss_pred --------------------------------CCCHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEECC Q ss_conf --------------------------------231056762100145787533365687899999998303802155422 Q 537021.9.peg.7 400 --------------------------------FSLLFERFLNPDRMSMPDFDIDFCQDRRDEVIRYVQNKYGHERVAQII 447 (1033) Q Consensus 400 --------------------------------~~LlFERFLnp~R~~~PDIDiDf~~~~R~~vi~y~~~kyG~~~va~i~ 447 (1033) ||..||.||..+-...||||+.|+++.+..+++||++-||+++|-+.| T Consensus 727 Ef~~D~~~~~GfDLp~K~CP~Cgak~pl~kDG~~IPFETFLGF~GDKVPDIDLNFSGEYQ~~AH~~vk~lFGe~~~fRAG 806 (1264) T TIGR01405 727 EFVTDGSVGSGFDLPDKDCPKCGAKAPLKKDGQDIPFETFLGFKGDKVPDIDLNFSGEYQAKAHNYVKELFGEDHTFRAG 806 (1264) T ss_pred EEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEECC T ss_conf 30037877887768578888888777634573699850016778884686446787224778889999860897223146 Q ss_pred CHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCCHHHHHCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCC Q ss_conf 07787788999999975279988988654310145898244344522988788643074202688987201122100244 Q 537021.9.peg.7 448 TFGSLQAKAALRDVGRALQMPYSQVDRLCKLIPNDPAHPVSLQAIADDSRFREARLADPSVDRLLEISQKLEGLYRHAST 527 (1033) Q Consensus 448 t~~t~~~k~aird~~r~~g~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~leg~~r~~~~ 527 (1033) |.+|...|+|. |+ + +.|.++.--......+ .++-.+.+++-|..|+||+ T Consensus 807 TIgTvAeKTAy-------GY----V-----------------K~Y~en~~~~~~~~r~---aei~rl~~~~tGvKRTTGQ 855 (1264) T TIGR01405 807 TIGTVAEKTAY-------GY----V-----------------KKYFENIIDQGKHYRD---AEIERLVQKCTGVKRTTGQ 855 (1264) T ss_pred CHHHHHHHHHH-------HH----H-----------------HHHHHHHHHCCCCCCC---HHHHHHHCCCCCCCCCCCC T ss_conf 51535534221-------02----5-----------------6667775422765550---4678765056322215786 Q ss_pred CCCEEEECCC--CHHHHEECCCCCCC---CCEEEEEECHHHHHHCCCHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 7440335256--66662001236768---866885102133431123213565417899999999887402543433234 Q 537021.9.peg.7 528 HAAGIVIGDR--PLSQLVPMYRDVRS---DLPVTQFNMKWIEKAGLIKFDFLGLKTLTLLQKSLDFLAQRGVKVDLSLIP 602 (1033) Q Consensus 528 Ha~Gvvi~~~--~l~~~~p~~~~~~~---~~~~~q~d~~~~e~~gl~K~D~Lgl~~L~~i~~~~~~i~~~~~~~d~~~ip 602 (1033) |||||||.|+ .++|+||++.++.+ ++.+||||.+++.+. |||+|+||....|+|+...++ +| ||+.+|| T Consensus 856 HPGGIiivPk~~dvyDFtP~qyPAdD~~s~w~TTHF~fh~ihdn-lLKLDILGHDDPT~ikmL~DL---tG--Idp~~IP 929 (1264) T TIGR01405 856 HPGGIIIVPKEMDVYDFTPVQYPADDTNSDWKTTHFDFHSIHDN-LLKLDILGHDDPTMIKMLQDL---TG--IDPKTIP 929 (1264) T ss_pred CCCCEEEECCCEEEEECCCCCCCCCCCCCCCEECCCCHHCCCCC-EEEEEECCCCCHHHHHHHHHC---CC--CCCCCCC T ss_conf 88765887687016521777878788988515413240202785-564520488876899850001---67--5876788 Q ss_pred CCHHHHHHHHHCCCCCHHH--HCC-CH------------HHHHHHHHHCCCHHHHHHHHHHHCCCHHHHHH-HHHHHHHC Q ss_conf 4358899876201410122--114-53------------47888877174048888766541141134200-36888504 Q 537021.9.peg.7 603 FDDHETYHLLTTKGTLGIF--QLE-SS------------GMRQALEGMQPDCIEDIIALVSLYRPGPIDNI-VVYNNRKN 666 (1033) Q Consensus 603 ~~D~~~~~l~~~g~t~GvF--Q~e-s~------------~~~~~l~~~~p~~~~dl~~~~al~RPGp~~~~-~~yi~rk~ 666 (1033) .+||+|.++|+.-+.+||- |++ ++ -.|+||.+.||++|.|||-++.| +.| ++|+.+.+ T Consensus 930 ~dD~~vm~~FSs~~~Lgv~p~~i~~~~TG~~GIPEFGT~FVR~ML~eTkPktFadLv~ISGL------SHGtDVWL~NA~ 1003 (1264) T TIGR01405 930 MDDKEVMKIFSSVKALGVTPEEILNEKTGTLGIPEFGTKFVRGMLEETKPKTFADLVRISGL------SHGTDVWLGNAQ 1003 (1264) T ss_pred CCCHHHHHHCCCCCCCCCCHHHHCCCCCCCEECCCCCCCCHHHHHHCCCCCCCHHHHEECCC------CCCCCHHHHHHH T ss_conf 88867875217876678737770785543312587565330410400689760113101254------267302268899 Q ss_pred CCCCCCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCHHHH----HHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCC Q ss_conf 65785026966733410335530012568988788632684130----78897620121101677878999998624999 Q 537021.9.peg.7 667 GKEKIVSIHPLIDPMLKETQGVIIYQEQVMQIAQLLSGYSLSEA----DVLRRAMGKKIKEEMDKQKERFISGASKNGIS 742 (1033) Q Consensus 667 g~e~~~~~hp~l~~iL~~T~Gv~vyQEQvm~ia~~~ag~~~~ea----d~lRra~~kk~~~~~~~~~~~f~~g~~~~g~~ 742 (1033) .. +..-.-.|.++.+ =-..||--.. -+|+.+.-| +.+|| ||+-..- ++ +.+.+.-|++++++ T Consensus 1004 ~l--I~~g~~~L~~VI~-------CRDDIM~YLI-~kGlep~~AF~IME~VRK--Gk~L~~~-~~-~~e~~~~mk~~~VP 1069 (1264) T TIGR01405 1004 DL--IKSGIKTLSDVIG-------CRDDIMVYLI-HKGLEPKLAFKIMEKVRK--GKGLKPS-KK-RAEYIELMKENKVP 1069 (1264) T ss_pred HH--HHCCCCCCCCEEE-------ECCHHHHHHH-HCCCCHHHHHHHHHHHCC--CCCCCCH-HH-HHHHHHHHHHCCCC T ss_conf 99--9615875010464-------1424578998-648885035575132115--6678854-66-89999999866897 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCH---------------------- Q ss_conf 9999999999997500013310147888999999999973026667775100235420---------------------- Q 537021.9.peg.7 743 KTIAVNIFELLAKFADYGFNKSHAAAYAVISYQTAWMKTHYPVEFLAASMTLEMDNVE---------------------- 800 (1033) Q Consensus 743 ~~~a~~i~~~i~~f~~Y~FnksHa~aya~~~y~~aylK~~yP~eF~aa~L~~~~~~~~---------------------- 800 (1033) +|..+.| ..|+ |+|+|||||||.++|++.||+|.|||++||||.+|.....|| T Consensus 1070 eWYIeSc-~KIK----YmFPKAHAaAYVlMA~RiAyFKVhyPl~YYAaYFsiRa~~FD~~~m~~Gk~~ik~k~~El~~~~ 1144 (1264) T TIGR01405 1070 EWYIESC-LKIK----YMFPKAHAAAYVLMAWRIAYFKVHYPLEYYAAYFSIRADAFDLETMIKGKEFIKQKLKELNERE 1144 (1264) T ss_pred CHHHHHH-HHCC----CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHCCHHHHHHHHHHHHHHH T ss_conf 1478752-2103----5661369999998888887887632278888664231100263543416689999999998874 Q ss_pred ----------------HHHHHHHHHHHCCCCCCCCCCCCCCEE-EEECCCE--EEEEHHHHCCCCHHHHHHHHHHH-HCC Q ss_conf ----------------258999888863685035523342100-3531986--98502685896989999999998-379 Q 537021.9.peg.7 801 ----------------KIKKFCQDARQFNIQIMPPSVNTPCVD-FKVGDNR--IYYSLAAIKGVGTTTARHIMEAS-ADK 860 (1033) Q Consensus 801 ----------------~~~~~i~e~~~~gi~vl~PdIN~S~~~-f~~~~~~--Ir~Gl~~Ikgvg~~~~~~Ive~r-~~g 860 (1033) ..-.++.|+..+|+++.|+||.+|.+. |-++++. ++.||.+|.|+|+.+|+.||++| +++ T Consensus 1145 ~~R~~~~~~~KEk~~~~~LEi~lEM~~RGF~~~~idl~~S~At~Fli~~~~~~LIpPF~~i~GlGe~vA~~Iv~ARq~Ek 1224 (1264) T TIGR01405 1145 KLRDQNKASPKEKDLLTVLEIVLEMLARGFKFQPIDLYKSQATEFLIEGDTKALIPPFNAIPGLGENVAESIVEARQNEK 1224 (1264) T ss_pred HHHHHHCCCCCCHHHCCHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCCEECCCHHHHHHHHHHHHHCCC T ss_conf 20322011210054325699999999634224787602115640377778831057610117746899999998643178 Q ss_pred CCCCHHHHHHHCCCCCCCHHHHHHHHHCCCCCCCCC Q ss_conf 999978999734824799899998886767234790 Q 537021.9.peg.7 861 PFDSLEDFCSRVDSKKLNRRVLESLVFAGALDCFGY 896 (1033) Q Consensus 861 ~f~sl~df~~rv~~~~~nk~~le~Li~aGAfD~~~~ 896 (1033) ||.|++||.+|. ++|+..+|.|-.-|.|+.+-. T Consensus 1225 ~F~S~eDL~kRt---ki~~~~iE~l~~~G~L~~lpE 1257 (1264) T TIGR01405 1225 PFLSVEDLKKRT---KISKTLIEKLDSLGVLDNLPE 1257 (1264) T ss_pred CCCCHHHHHHHC---CCCHHHHHHHHHCCCCCCCCC T ss_conf 747689876531---877779998876377778876 No 24 >COG2176 PolC DNA polymerase III, alpha subunit (gram-positive type) [DNA replication, recombination, and repair] Probab=100.00 E-value=0 Score=931.24 Aligned_cols=796 Identities=23% Similarity=0.312 Sum_probs=650.1 Q ss_pred CHHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCCCCCC Q ss_conf 13343314447644329989999999978998899811651774999999999769963899999985476643344565 Q 537021.9.peg.7 11 FVHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPIIGCQLDIDMQDNLKMTNQGK 90 (1033) Q Consensus 11 FvhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~~~~~~~~~ 90 (1033) =|+||.||..|-+||..++++||++|+++|++|+|||||||+.+.++||.+++|+|||+|+|+|.++.++..+...+... T Consensus 336 RVELh~HTkMS~mDai~sv~~~vk~A~kwghkaIAITDh~~VqafP~~y~~akK~giK~IyG~EanlvdD~vpiv~N~~d 415 (1444) T COG2176 336 RVELHFHTKMSQMDAITSVEELVKQAKKWGHKAIAITDHGVVQAFPEAYKAAKKYGIKAIYGLEANLVDDGVPIVYNPDD 415 (1444) T ss_pred EEEEEECCCHHHHCCCCCHHHHHHHHHHCCCCEEEEECCCCHHHCHHHHHHHHHCCCEEEEEEEEEECCCCCCEECCCCC T ss_conf 48998515324421667899999999975996488844762332668887666549618974346640578842307544 Q ss_pred CC------------------------------------------------------------------------------ Q ss_conf 66------------------------------------------------------------------------------ Q 537021.9.peg.7 91 TP------------------------------------------------------------------------------ 92 (1033) Q Consensus 91 ~~------------------------------------------------------------------------------ 92 (1033) .. T Consensus 416 ~~l~datyVVfDiETTGLs~~~d~iIE~aAvKikng~iId~f~~Fi~P~~pl~~~~telTgITdeml~~a~~i~~vL~kf 495 (1444) T COG2176 416 QKLDDATYVVFDIETTGLSPVYDEIIEIAAVKIKNGRIIDKFQFFIKPGRPLSATITELTGITDEMLENAPEIEEVLEKF 495 (1444) T ss_pred CCCCCCCEEEEEEECCCCCCCCCHHHHHEEEEEECCCCHHHHHHHCCCCCCCCHHHHHCCCCCHHHHCCCCCHHHHHHHH T ss_conf 54565508999855377675420156200044338804377897508999676445420356799874883399999999 Q ss_pred -------------------------------------------------------------------------------- Q ss_conf -------------------------------------------------------------------------------- Q 537021.9.peg.7 93 -------------------------------------------------------------------------------- 92 (1033) Q Consensus 93 -------------------------------------------------------------------------------- 92 (1033) T Consensus 496 ~~~~~d~IlVAHNasFD~gFl~~~~~k~~~~~~~~pvIDTL~lar~L~P~~ksh~Lg~l~kk~~v~le~hHRA~yDaeat 575 (1444) T COG2176 496 REFIGDSILVAHNASFDMGFLNTNYEKYGLEPLTNPVIDTLELARALNPEFKSHRLGTLCKKLGVELERHHRADYDAEAT 575 (1444) T ss_pred HHHHCCCEEEECCCCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHCHHHHHCCHHHHHHHHCCCHHHHHHHHHHHHHH T ss_conf 99844858996167556457888899838851027501199999875745432466778887486378753101008999 Q ss_pred ---------------------------------CCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHH Q ss_conf ---------------------------------66775399995696899999999989973248778740078999863 Q 537021.9.peg.7 93 ---------------------------------FPTVPSIVLLVSTAEGYQRLIELVSRMYLLDQGKQSVRICLSWLQEI 139 (1033) Q Consensus 93 ---------------------------------~~~~~~lvlLAkN~~Gy~nL~kL~S~a~~~~~~~~~p~i~~~~L~e~ 139 (1033) .....|.++||||+.|++||++|+|.+++ ..|++.|||+.+.|.+ T Consensus 576 ~~vf~~f~~~~ke~Gi~~l~eln~~l~~~~~ykr~r~~h~~i~vknq~GlKnLykLvS~s~~-kyf~~~PrI~~S~L~~- 653 (1444) T COG2176 576 AKVFFVFLKDLKEKGITNLSELNDKLSSEDLYKRLRPKHATIYVKNQVGLKNLYKLVSISHT-KYFYGRPRIPRSVLKK- 653 (1444) T ss_pred HHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHHHHH-HHHCCCCCCCHHHHHH- T ss_conf 99999999999884556599986764016788632566269999531758999999888888-7642688771999997- Q ss_pred CCCCEEEECCCCCCHHHHHHHCCCHHHHHHHHHHHH--HHCCCCEEEEEEC-CCHH-HH---HHHHHHHHHHHHHCCCEE Q ss_conf 489889981976796788887799789999999998--7659868999943-9911-58---999999999998189899 Q 537021.9.peg.7 140 GTEGLIMLTGGSSGPIDRAFFFNSSQVAEKRLLTFK--KLFGDRLYVNLQR-HRGY-DR---YRESQVVQLAYTHELPLV 212 (1033) Q Consensus 140 ~~egLIvlsg~~~g~i~~~l~~~~~~~a~~~l~~l~--~~Fgd~~ylEl~~-~~~~-e~---~~~~~l~~lA~~~~iPlV 212 (1033) ++||||+.|+|..|++.++.+....+++++.+..|. ++.+-+.|-++.. .+.. .+ .....++++++++|.|+| T Consensus 654 ~regLligsac~~Gelf~a~l~k~~ee~e~iak~YDfIei~pp~~y~~li~~e~~~~~e~l~~~~k~li~~g~~l~K~Vv 733 (1444) T COG2176 654 NREGLLIGSACSEGELFDAALQKPDEEVEEIAKFYDFIEIQPPANYAHLIEREGLKDKEALKEIIKKLIKLGKKLNKPVV 733 (1444) T ss_pred HHCCEEECCCCCCHHHHHHHHHCCHHHHHHHHHHCCEEEECCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCEE T ss_conf 41271643686512799999827999999998755758714820014678750367299999999999999998499089 Q ss_pred EECCCCCCCHHHHHHHHHHHHHHCC-CCCCCCCCCCCCCCCCCCCHHHHHHHCCHHHH------HHHHHHHHHHHHHHHH Q ss_conf 8088300787889999876688469-85765310024534323434788751220358------9987886411100013 Q 537021.9.peg.7 213 ATNNSLFLSEEDYEAHDVLMAVAHS-TVVSQKDRPRVTPDHYLKNRSEMVSIFSDLPE------ALENTVEIARRCSFIL 285 (1033) Q Consensus 213 aTndv~Y~~~~D~~~h~iL~aI~~~-~~l~~~~~~~~~~~~yLks~eEM~~~f~~~pe------Ai~NT~~IAe~c~~~l 285 (1033) ||.+|||++|+|...++++.|-.-+ ..++..-.....++.||++.+||.+.|+-+.| .++||++||++|+--- T Consensus 734 atgn~hyl~p~d~iyr~ilv~s~G~~~~~~r~~~~~~~P~~~frTT~EML~efsfLgee~a~eiVv~Ntn~iad~ie~i~ 813 (1444) T COG2176 734 ATGNVHYLDPEDKIYRKILVASQGLGNPLNRTFNEQTLPEVHFRTTDEMLQEFSFLGEEKAYEIVVENTNKIADMIEDIQ 813 (1444) T ss_pred EECCCEECCHHHHHHHHHHEECCCCCCCCCCCCCCCCCCCEEEECHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHCCE T ss_conf 95774106857878888612157767655545565568850565179999998765287777886458288998866022 Q ss_pred HHHH-CCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHH Q ss_conf 3221-023455575688776898899987654103420124643245783899999887789898606305578889999 Q 537021.9.peg.7 286 QVHA-PILPRFIEKECDDIQKEEENELRNKAVAGLEVRLAQGAIAKGYHAQDYRQRLDFELNVIARMKFSGYFLIVADFI 364 (1033) Q Consensus 286 ~~~~-~~lP~f~~p~~~~~~~~~~~~Lr~l~~~Gl~~R~~~~~~~~~~~~~~y~~RLe~EL~vI~~~gf~~YFLiV~Div 364 (1033) +... ..-|++ ..+++.+++++++.+.+.|++.. |+.+.+||++||+-|...||+--+||.+.+| T Consensus 814 Pik~~LytPki---------e~a~e~v~~lty~~A~~iYG~pL------P~IVe~RiEKEL~sII~nGFaViYlISq~LV 878 (1444) T COG2176 814 PIKDKLYTPKI---------EGAEEKVRDLTYEKAHKIYGDPL------PEIVEQRIEKELNSIIGNGFAVIYLISQKLV 878 (1444) T ss_pred ECCCCCCCCCC---------CCHHHHHHHHHHHHHHHHHCCCC------HHHHHHHHHHHHHHHHCCCEEEHHHHHHHHH T ss_conf 36667779861---------77779999999999999867970------6999999999999997288465358889999 Q ss_pred HHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHH-----------------------------------------CCCH Q ss_conf 98510476433444420466776554403756333-----------------------------------------2310 Q 537021.9.peg.7 365 QWAKKNNIPVGPGRGSGAGSVTAYALTITDIDPLR-----------------------------------------FSLL 403 (1033) Q Consensus 365 ~~ak~~gI~vGpGRGSAagSLVaY~LgIT~VDPi~-----------------------------------------~~Ll 403 (1033) +.+-++|++|| .|||.++||||.++|||+|+||. ||.. T Consensus 879 kkS~~dGYlVG-SRGSVGSSlVAtm~gITEVNPL~PHY~Cp~Cky~Ef~~d~svgsGfDLpdK~CPkCg~pl~kDG~dIP 957 (1444) T COG2176 879 KKSLDDGYLVG-SRGSVGSSLVATMIGITEVNPLPPHYLCPECKYSEFIDDGSVGSGFDLPDKDCPKCGTPLKKDGHDIP 957 (1444) T ss_pred HHHCCCCEEEC-CCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 86114770765-78750488998763441247899653288874046623787677888899989867994356898787 Q ss_pred HHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCC Q ss_conf 56762100145787533365687899999998303802155422077877889999999752799889886543101458 Q 537021.9.peg.7 404 FERFLNPDRMSMPDFDIDFCQDRRDEVIRYVQNKYGHERVAQIITFGSLQAKAALRDVGRALQMPYSQVDRLCKLIPNDP 483 (1033) Q Consensus 404 FERFLnp~R~~~PDIDiDf~~~~R~~vi~y~~~kyG~~~va~i~t~~t~~~k~aird~~r~~g~~~~~~~~~~~~i~~~~ 483 (1033) ||.||.......||||+.|+++.+...++|+++-||+++|-+.+|.+|...|+|.- -+ T Consensus 958 FETFlGF~GDKVPDIDLNFSGeYQ~~AH~yvk~lFGe~~~fRAGTIgTvAEKTA~G-----------yV----------- 1015 (1444) T COG2176 958 FETFLGFKGDKVPDIDLNFSGEYQPKAHNYVKELFGEDYVFRAGTIGTVAEKTAYG-----------YV----------- 1015 (1444) T ss_pred HHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHH-----------HH----------- T ss_conf 25532778887887667678644178889999971886515342045577887888-----------99----------- Q ss_pred CCCCCHHHHHCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCEEEECCC--CHHHHEECCCCCCC---CCEEEEE Q ss_conf 982443445229887886430742026889872011221002447440335256--66662001236768---8668851 Q 537021.9.peg.7 484 AHPVSLQAIADDSRFREARLADPSVDRLLEISQKLEGLYRHASTHAAGIVIGDR--PLSQLVPMYRDVRS---DLPVTQF 558 (1033) Q Consensus 484 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~leg~~r~~~~Ha~Gvvi~~~--~l~~~~p~~~~~~~---~~~~~q~ 558 (1033) +.+.++... . -.-.++-.+++++.|..|.+|+|||||||.|+ .++|+||++.++.+ .+.+||| T Consensus 1016 ------k~Y~ed~~~--~----~r~aEidRl~~~~tGvKRTTGQHPGGIiIvP~~~dVydFtPvqfPAdD~~sewkTTHF 1083 (1444) T COG2176 1016 ------KKYFEDYNK--F----YRDAEIDRLVQGCTGVKRTTGQHPGGIIIVPKYMDVYDFTPVQFPADDTNSEWKTTHF 1083 (1444) T ss_pred ------HHHHHHCCC--C----CCHHHHHHHHHHCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCEEEEE T ss_conf ------878876275--3----2068999999754266625688998279935976511016778888888775444003 Q ss_pred ECHHHHHHCCCHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCHH--------------HHCC Q ss_conf 02133431123213565417899999999887402543433234435889987620141012--------------2114 Q 537021.9.peg.7 559 NMKWIEKAGLIKFDFLGLKTLTLLQKSLDFLAQRGVKVDLSLIPFDDHETYHLLTTKGTLGI--------------FQLE 624 (1033) Q Consensus 559 d~~~~e~~gl~K~D~Lgl~~L~~i~~~~~~i~~~~~~~d~~~ip~~D~~~~~l~~~g~t~Gv--------------FQ~e 624 (1033) |.+++.+. +||+|+||....|+|+...++- | ||+.+||+|||.|..+|+..+..|| --|. T Consensus 1084 dfh~ihdn-lLKLDILGHDDPT~ikmLqdlt---g--Idp~~IP~dD~~vm~iFss~~~Lgvtpe~i~~~tGtlGIPEFG 1157 (1444) T COG2176 1084 DFHAIHDN-LLKLDILGHDDPTMIKMLQDLT---G--IDPKTIPMDDPEVMKIFSSTESLGVTPEQIGEKTGTLGIPEFG 1157 (1444) T ss_pred EHHHHCCC-EEEEECCCCCCHHHHHHHHHHC---C--CCCCCCCCCCHHHHHHHCCCCCCCCCHHHHCCCCCCCCCCCCC T ss_conf 23320110-1000004789868999988722---9--9945578888768987168865788888819866654677422 Q ss_pred CHHHHHHHHHHCCCHHHHHHHHHHHCCCHHHHHH-HHHHHHHCCCCCCCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHH Q ss_conf 5347888877174048888766541141134200-368885046578502696673341033553001256898878863 Q 537021.9.peg.7 625 SSGMRQALEGMQPDCIEDIIALVSLYRPGPIDNI-VVYNNRKNGKEKIVSIHPLIDPMLKETQGVIIYQEQVMQIAQLLS 703 (1033) Q Consensus 625 s~~~~~~l~~~~p~~~~dl~~~~al~RPGp~~~~-~~yi~rk~g~e~~~~~hp~l~~iL~~T~Gv~vyQEQvm~ia~~~a 703 (1033) ..-.|+||.+.+|++|.||+-++.| +.| ++|+.+.+. -+..---.|.++.. -...||--.. -. T Consensus 1158 T~FVR~ML~eTkPktFaeLv~ISGL------SHGtdVWlgNAq~--lIk~g~~~Ls~VI~-------CRDDIMvYLI-~k 1221 (1444) T COG2176 1158 TRFVRQMLEETKPKTFAELVRISGL------SHGTDVWLGNAQD--LIKSGIATLSDVIG-------CRDDIMVYLI-HK 1221 (1444) T ss_pred HHHHHHHHHHCCCCCHHHHHHHCCC------CCCCCCCCCCHHH--HHHHCCCCHHHHEE-------EHHHHHHHHH-HC T ss_conf 4899999875499778988865167------6774102466999--99828762313305-------0078999999-84 Q ss_pred CCCHHHH----HHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHH Q ss_conf 2684130----788976201211016778789999986249999999999999997500013310147888999999999 Q 537021.9.peg.7 704 GYSLSEA----DVLRRAMGKKIKEEMDKQKERFISGASKNGISKTIAVNIFELLAKFADYGFNKSHAAAYAVISYQTAWM 779 (1033) Q Consensus 704 g~~~~ea----d~lRra~~kk~~~~~~~~~~~f~~g~~~~g~~~~~a~~i~~~i~~f~~Y~FnksHa~aya~~~y~~ayl 779 (1033) |+.+.-| +.+||.++.|. ++.++-|+++++++|..+.| ..|+ |+|+|+||+||.++|++.||+ T Consensus 1222 Glep~~AFkIME~VRKGk~lk~--------~e~~~~Mke~~Vp~WyIeSC-~KIk----YMFPKAHAaAYVlMA~RIAyF 1288 (1444) T COG2176 1222 GLEPSLAFKIMEFVRKGKGLKP--------AEYEELMKENKVPEWYIESC-LKIK----YMFPKAHAAAYVLMAWRIAYF 1288 (1444) T ss_pred CCCCCHHHHHHHHHHCCCCCCH--------HHHHHHHHHCCCCHHHHHHH-HHHH----HHCCHHHHHHHHHHHHHHHHH T ss_conf 8993057999999855678885--------89999998769918999876-5465----308468899999999999999 Q ss_pred HHHHHHHHHHHHHCCCCCCCH----------------------------------HHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 973026667775100235420----------------------------------2589998888636850355233421 Q 537021.9.peg.7 780 KTHYPVEFLAASMTLEMDNVE----------------------------------KIKKFCQDARQFNIQIMPPSVNTPC 825 (1033) Q Consensus 780 K~~yP~eF~aa~L~~~~~~~~----------------------------------~~~~~i~e~~~~gi~vl~PdIN~S~ 825 (1033) |.|||.+||||++|....++| .+-.++.|+..+|+++.++||-+|. T Consensus 1289 KVhhPl~YYAayfSira~~FDi~~m~~Gke~ik~k~~Ei~~~~~~~~~~kEk~l~t~lEi~~EM~aRGf~f~~idly~S~ 1368 (1444) T COG2176 1289 KVHHPLEYYAAYFSIRADDFDIETMSKGKEAIKAKMEEINKRKGNKASPKEKNLLTVLEIVLEMLARGFKFQKIDLYKSD 1368 (1444) T ss_pred HHCCHHHHHHHHHEEEHHHCCHHHHHCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEECC T ss_conf 84453998888841115531889885609999999999861204568834542575999999999736764475125136 Q ss_pred EE-EEECCCEEEEEHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCHHHHHHHHHCCCCCCCCC Q ss_conf 00-353198698502685896989999999998379999978999734824799899998886767234790 Q 537021.9.peg.7 826 VD-FKVGDNRIYYSLAAIKGVGTTTARHIMEASADKPFDSLEDFCSRVDSKKLNRRVLESLVFAGALDCFGY 896 (1033) Q Consensus 826 ~~-f~~~~~~Ir~Gl~~Ikgvg~~~~~~Ive~r~~g~f~sl~df~~rv~~~~~nk~~le~Li~aGAfD~~~~ 896 (1033) +. |.++++.++.||.+|.|+|+++|+.||++|++|||.|.+||.+|. ++++..+|.|-.-|+|+.+-. T Consensus 1369 At~Fvid~~~LipPFi~i~GlGe~vA~~IV~AR~Ek~FlS~eDlkkRt---kis~t~ie~~~~~G~l~~lpe 1437 (1444) T COG2176 1369 ATEFVIDGDTLIPPFIAIPGLGENVAKSIVEAREEKEFLSKEDLKKRT---KISKTHIEKLDEMGCLEGLPE 1437 (1444) T ss_pred CEEEEEECCEECCCEECCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHC---CCCHHHHHHHHHCCCCCCCCC T ss_conf 727997588241760015871689999999986348867789998741---755889999986086357984 No 25 >PRK07373 DNA polymerase III subunit alpha; Reviewed Probab=100.00 E-value=0 Score=628.36 Aligned_cols=276 Identities=30% Similarity=0.524 Sum_probs=261.7 Q ss_pred HCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEECCCEEEE Q ss_conf 00133101478889999999999730266677751002354202589998888636850355233421003531986985 Q 537021.9.peg.7 758 DYGFNKSHAAAYAVISYQTAWMKTHYPVEFLAASMTLEMDNVEKIKKFCQDARQFNIQIMPPSVNTPCVDFKVGDNRIYY 837 (1033) Q Consensus 758 ~Y~FnksHa~aya~~~y~~aylK~~yP~eF~aa~L~~~~~~~~~~~~~i~e~~~~gi~vl~PdIN~S~~~f~~~~~~Ir~ 837 (1033) .-.||||||||||++||||||||+|||+|||||+||+++++++|+..|+.|||+|||+|+|||||+|.++|++++++||| T Consensus 35 ~~~FNKSHsaAYaliaYqtAyLKahYP~eFmAA~Lss~~~n~dKi~~~i~e~r~~gI~vlpPdIN~S~~~F~~~~~~Irf 114 (447) T PRK07373 35 SNCFNKSHSTAYAYVTYQTAYLKANYPVEYMAALLTANSGDQDKVQKYIENCQKMGIEVEPPDINRSGVDFTPVGDKILF 114 (447) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHCCHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCEEECCCCCCCCCCCEECCCEEEE T ss_conf 07778789999999999999987438499999970478788468999999999879919898111467862898984998 Q ss_pred EHHHHCCCCHHHHHHHHHHH-HCCCCCCHHHHHHHCCCCCCCHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 02685896989999999998-37999997899973482479989999888676723479038999998999999999886 Q 537021.9.peg.7 838 SLAAIKGVGTTTARHIMEAS-ADKPFDSLEDFCSRVDSKKLNRRVLESLVFAGALDCFGYSRMQLLQSLDNIQKYAQWVE 916 (1033) Q Consensus 838 Gl~~Ikgvg~~~~~~Ive~r-~~g~f~sl~df~~rv~~~~~nk~~le~Li~aGAfD~~~~~R~~ll~~i~~~~~~~~~~~ 916 (1033) ||++|||||++++++|+++| ++|||+|++|||.|++.+.+||++||+||+|||||+||.||++|+.+++.+++++.... T Consensus 115 GL~AIKgVG~~ai~~Iv~eR~~~G~F~sl~DF~~Rvd~~~vNkr~lEsLIkaGAfD~~g~nRa~Ll~~ld~al~~a~~~~ 194 (447) T PRK07373 115 GLSAVRNLGEGAIEAILKAREEGGEFKSLADFCDRVDLRVVNRRALESLIYCGAFDKIEANRHQLIKDLELVIDWAQSRA 194 (447) T ss_pred EHHHHCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 73774487599999999999866997669999975452214999999999708464433419999997999999999988 Q ss_pred HHHHCCCCCCCCCCC--------CCCCCCCCCCCCCCCCHHHHHHHHHHHHCEEECCCCHHHHHHHHHHCCCCCHHHHHC Q ss_conf 543127778666655--------556557766689999888999857887262116385589999998748976889850 Q 537021.9.peg.7 917 KNRTNKHENIFAHEK--------GTFSDKITLEKFSVENSSVRFENEQRVLGFYFSGHPLDVYKPILRKMGIKGYEESVA 988 (1033) Q Consensus 917 ~~~~~~q~~LF~~~~--------~~~~~~~~~~~~~~~~~~e~l~~E~e~LGf~lS~HPl~~~~~~l~~~~~~~~~~~~~ 988 (1033) +++..||.+||+... ......+.++.+++|+..|+|.+|+|+||||+|+|||+.|+..++.....++.++.. T Consensus 195 ~~~~~gQ~~LF~~~~~~~~~~~~~~~~~~p~~~~v~ew~~~ekL~~EKE~LGfYlSgHPL~~y~~~~~~~~~~~~~~l~~ 274 (447) T PRK07373 195 KDRASGQLNLFDLLGSTSNGDANSAFDTAPKAPPVADFPLQEKLKLEKELLGFYVSEHPLKSLQRPAPLLSPINLSQLEE 274 (447) T ss_pred HHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHEEEECCCHHHHHHHHHHHHCCCCHHHHHH T ss_conf 99875984522223556555443222246778876679988899877887506862073789999988838713899640 Q ss_pred CCCCCEEEEEEEEEEEEEECCCCCCEEEEEEEECCCCEEEEEECC Q ss_conf 558988999999961121015888769999998497049999539 Q 537021.9.peg.7 989 TIRGNNIRLAAMVVSKQQKKTRKGSRIGWVTFSEPAKEYEANFFP 1033 (1033) Q Consensus 989 ~~~g~~v~iaG~V~~i~~~~TKkG~~mafltleD~~g~~e~~iFP 1033 (1033) ...++.|+++|+|+++++++||+|++|||+|+||.+|.+|++||| T Consensus 275 ~~~~~~V~~~g~I~~vr~~~TKkG~~MAfvtLeD~tG~iEvvvFp 319 (447) T PRK07373 275 QKERTKVSAVVMLNGVKKIVTKKGDPMAFLQLEDLSGQSEAVVFP 319 (447) T ss_pred CCCCCEEEEEEEEEEEEEEECCCCCEEEEEEEEECCCCEEEEEEH T ss_conf 248978999999999999870489659999999789748999808 No 26 >LOAD_php consensus Probab=99.98 E-value=1.7e-30 Score=276.33 Aligned_cols=200 Identities=30% Similarity=0.401 Sum_probs=169.0 Q ss_pred CHHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHH----CCCCEEEEEEEEEECCCCCCCC Q ss_conf 133433144476443299899999999789988998116517749999999997----6996389999998547664334 Q 537021.9.peg.7 11 FVHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACA----AGIQPIIGCQLDIDMQDNLKMT 86 (1033) Q Consensus 11 FvhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk----~gIKPIiG~E~~v~~~~~~~~~ 86 (1033) ||||||||.||++||+++|++||++|+++||+|+|||||++|+|+++|+++|++ .|||||+|||+++..+...... T Consensus 1 ~v~LH~HS~~S~~dg~~~~~elv~~A~~~G~~aiaiTDh~~~~g~~~~~~~~~~~~~~~gik~I~G~E~~~~~~~~~~~~ 80 (206) T LOAD_php 1 FIDLHVHSVYSLLDGALTLEELVEEAKKLGLEAIAITDHGNLFGALEFYKYAKEIREKKGIKPIIGVEAYILDDGEFDLL 80 (206) T ss_pred CCCCCCCCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCCH T ss_conf 96642052225543117999999999987998899958997435999999998753206963788879997589862210 Q ss_pred CCCCCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCEEEECCCCCCHHHHHHHCCCHHH Q ss_conf 45656666775399995696899999999989973248778740078999863489889981976796788887799789 Q 537021.9.peg.7 87 NQGKTPFPTVPSIVLLVSTAEGYQRLIELVSRMYLLDQGKQSVRICLSWLQEIGTEGLIMLTGGSSGPIDRAFFFNSSQV 166 (1033) Q Consensus 87 ~~~~~~~~~~~~lvlLAkN~~Gy~nL~kL~S~a~~~~~~~~~p~i~~~~L~e~~~egLIvlsg~~~g~i~~~l~~~~~~~ 166 (1033) ......+..|+.+++.+.+.... ......++++.+++.. +.+|+++++||..+++......+ .+. T Consensus 81 ------------~~~~~~~~~~~~h~~~~~~~~~~-~~~~~~~~~~~~~~~~-~~~g~i~~~~~~~~~~~~~~~~~-~~~ 145 (206) T LOAD_php 81 ------------ALKELDGVVASVHLVLLAKNEEG-YKNYLRGYIDEELLIA-LEEGLIAIIAHPGGPVRLALLAR-EEA 145 (206) T ss_pred ------------HHHHCCCCHHHHHHHHHHHHHHH-CCCCCCCCCCHHHHHH-CCCCCEEEECCCCCHHHHHHHCC-CHH T ss_conf ------------03420420445678873114453-4577766588999985-30997899768775677764068-088 Q ss_pred HHHHHHHHHHHCCCCEEEEEECCCHH-HHHHHHHHHHHHHHHCCCEEEECCCCCCCHHHH Q ss_conf 99999999876598689999439911-589999999999981898998088300787889 Q 537021.9.peg.7 167 AEKRLLTFKKLFGDRLYVNLQRHRGY-DRYRESQVVQLAYTHELPLVATNNSLFLSEEDY 225 (1033) Q Consensus 167 a~~~l~~l~~~Fgd~~ylEl~~~~~~-e~~~~~~l~~lA~~~~iPlVaTndv~Y~~~~D~ 225 (1033) +.....+++..|++++|+|+++++.. +...+..++++|+++++|+|+||||||++|+|. T Consensus 146 ~~~~~~~~~~~~~~~~~lEi~~~~~~~~~~~~~~l~~la~~~~lp~vat~Dvh~~~~~d~ 205 (206) T LOAD_php 146 AKLAALVYQAIFEDGFALEINTHGLREDERYEEALLKLAEELGLPIVAGSDAHYPNPEDV 205 (206) T ss_pred HHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCHHHC T ss_conf 999999999863678399972799837899999999999980999999871256860107 No 27 >pfam02811 PHP PHP domain. The PHP (Polymerase and Histidinol Phosphatase) domain is a putative phosphoesterase domain. Probab=99.95 E-value=1.3e-25 Score=235.12 Aligned_cols=163 Identities=30% Similarity=0.367 Sum_probs=140.6 Q ss_pred HHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCC--CCEEEEEEEEEECCCCCCCCCCC Q ss_conf 3343314447644329989999999978998899811651774999999999769--96389999998547664334456 Q 537021.9.peg.7 12 VHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAG--IQPIIGCQLDIDMQDNLKMTNQG 89 (1033) Q Consensus 12 vhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~g--IKPIiG~E~~v~~~~~~~~~~~~ 89 (1033) ||||+||.||++||+.+|+++|++|++.||+|+|||||+||+|+++|+++|++.| ||||+|||+++.++...... T Consensus 1 v~LH~HT~~S~~dg~~~i~~lv~~a~~~g~~alaiTDh~~~~g~~~~~~~~~~~g~~ik~I~G~E~~i~~~~~~~~~--- 77 (173) T pfam02811 1 VDLHVHTDFSLLDGALSIEELVKAAKELGLEAIAITDHDNLFGAPEFYEAAKLEKAGLKPIIGAEINIVDDEDELDL--- 77 (173) T ss_pred CCCCCCCCCHHHHHCCCHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHCCCCCEEEEEEEEEECCCCCCCCC--- T ss_conf 98705773715443679999999999869987999667750519999999985478980788879996078775667--- Q ss_pred CCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCEEEECCCCCCHHHHHHHCCCHHHHHH Q ss_conf 56666775399995696899999999989973248778740078999863489889981976796788887799789999 Q 537021.9.peg.7 90 KTPFPTVPSIVLLVSTAEGYQRLIELVSRMYLLDQGKQSVRICLSWLQEIGTEGLIMLTGGSSGPIDRAFFFNSSQVAEK 169 (1033) Q Consensus 90 ~~~~~~~~~lvlLAkN~~Gy~nL~kL~S~a~~~~~~~~~p~i~~~~L~e~~~egLIvlsg~~~g~i~~~l~~~~~~~a~~ 169 (1033) ........+++++++|. |+||++++|.+++. .+.+..++++|+|++|||..|++.+++..+..++|++ T Consensus 78 ~~~~~~~~~l~~~~~~~--y~nl~~~~s~~~~~----------~~~l~~~~~~gli~lsg~~~g~~~~~~~~~~~~~a~~ 145 (173) T pfam02811 78 EDDLLKSLDLVLLSVHE--VRNLPELSSAAYTE----------DELLEAVLEEGLIIILAHPEGYVGTALLGGPLEEAEE 145 (173) T ss_pred CCCCCCCCCEEEEHHHH--HHHHHHHHHHHHCC----------HHHHHHHCCCCEEEECCCCCCHHHHHHHCCCHHHHHH T ss_conf 65645636876457667--88899974498649----------8999987489979989999898899998699999999 Q ss_pred HHHHHHHHCCCCEEEEEECCCH Q ss_conf 9999987659868999943991 Q 537021.9.peg.7 170 RLLTFKKLFGDRLYVNLQRHRG 191 (1033) Q Consensus 170 ~l~~l~~~Fgd~~ylEl~~~~~ 191 (1033) .+.+|...++...++ +++.. T Consensus 146 ~~~~~~~~~~~~~~~--~~~~~ 165 (173) T pfam02811 146 LLEEYIGEDGFYLEI--NNSGL 165 (173) T ss_pred HHHHHHHHCCCCEEE--ECCCC T ss_conf 999999862974686--21686 No 28 >smart00481 POLIIIAc DNA polymerase alpha chain like domain. DNA polymerase alpha chain like domain, incl. family of hypothetical proteins Probab=99.61 E-value=1.9e-15 Score=149.09 Aligned_cols=66 Identities=50% Similarity=0.707 Sum_probs=64.3 Q ss_pred HHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEE Q ss_conf 343314447644329989999999978998899811651774999999999769963899999985 Q 537021.9.peg.7 13 HLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPIIGCQLDID 78 (1033) Q Consensus 13 hLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~ 78 (1033) .||+||.||++||+.+|++++++|+++|++++|||||||++|+++|+++|++.|||||+|||+++. T Consensus 1 dLh~HT~~S~~dg~~~~~~lv~~a~~~g~~~vaiTDh~~~~g~~~~~~~~~~~gi~~I~G~Ei~~~ 66 (67) T smart00481 1 DLHVHSDYSLLDGALSPEELVKRAKELGLKAIAITDHGNLFGAVEFYKAAKKAGIKPIIGLEANIV 66 (67) T ss_pred CCCCCCCCCHHHCCCCHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHHHHCCCEEEEEEEEEEE T ss_conf 948876585656368999999999987998899967886446999999999869979999999986 No 29 >PRK06361 hypothetical protein; Provisional Probab=98.84 E-value=3.2e-07 Score=79.58 Aligned_cols=180 Identities=17% Similarity=0.153 Sum_probs=99.2 Q ss_pred HHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHH----H----HHCCCCEEEEEEEEEECCCCC Q ss_conf 33433144476443299899999999789988998116517749999999----9----976996389999998547664 Q 537021.9.peg.7 12 VHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQK----A----CAAGIQPIIGCQLDIDMQDNL 83 (1033) Q Consensus 12 vhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~a----c----kk~gIKPIiG~E~~v~~~~~~ 83 (1033) +-||+||.|| ||..+++++++.|++.|+..+|||||+...+.-+.... . +-.||+-.-|+|+.+.+++.. T Consensus 2 iDlH~HT~~S--DG~~si~Ema~aA~~~G~~yiaITDHs~~~~~~~~l~~~~~~~~~~~~~~~i~iL~GiEv~il~~g~l 79 (216) T PRK06361 2 IDLHTHTIFS--DGELIPSELVRRARVLGYRAIAITDHADASNLEEILEKLVPAAEELSLYWDIEVIPGVELTHVPPELI 79 (216) T ss_pred CCCCCCCCCC--CCCCCHHHHHHHHHHCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCCCC T ss_conf 5004275676--99988999999999869909999638886678889999999999875158977998789703687654 Q ss_pred CCCCCCCCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCEEEECCCCCCHHHHHHHCCC Q ss_conf 33445656666775399995696899999999989973248778740078999863489889981976796788887799 Q 537021.9.peg.7 84 KMTNQGKTPFPTVPSIVLLVSTAEGYQRLIELVSRMYLLDQGKQSVRICLSWLQEIGTEGLIMLTGGSSGPIDRAFFFNS 163 (1033) Q Consensus 84 ~~~~~~~~~~~~~~~lvlLAkN~~Gy~nL~kL~S~a~~~~~~~~~p~i~~~~L~e~~~egLIvlsg~~~g~i~~~l~~~~ 163 (1033) +.........+ .. ++..-..... .|......+....+ ..+=+.|+++ .+ . T Consensus 80 d~~~~~l~~ld--~~-iasvH~~~~~------------------~~~~~~~~~~Aien-p~VdII~HPg-~i-------~ 129 (216) T PRK06361 80 PKLAKKARDLG--AE-IVVVHGETIV------------------EPVAPGTNLAAISC-EDVDILAHPG-LI-------T 129 (216) T ss_pred CCHHHHHHHCC--CE-EEEEECCCCH------------------HHHHHHHHHHHHHC-CCCCEEECCC-CC-------C T ss_conf 53578998679--76-9998166201------------------15676899999858-9955873898-42-------5 Q ss_pred HHHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHH Q ss_conf 78999999999876598689999439911589999999999981898998088300787889999876 Q 537021.9.peg.7 164 SQVAEKRLLTFKKLFGDRLYVNLQRHRGYDRYRESQVVQLAYTHELPLVATNNSLFLSEEDYEAHDVL 231 (1033) Q Consensus 164 ~~~a~~~l~~l~~~Fgd~~ylEl~~~~~~e~~~~~~l~~lA~~~~iPlVaTndv~Y~~~~D~~~h~iL 231 (1033) .+.++... ..+..+||...... ...+..++++|+++|+|++..+|.|-...=+..-+-+- T Consensus 130 ~e~a~~Aa-------e~gtaLEINa~p~r-ld~~~~~~r~A~~~Gv~i~I~sDAH~~~~l~~~~~gv~ 189 (216) T PRK06361 130 EEDAALAA-------ENGVFLEITARKGH-NLTNGHVARLAREAGAPLVINTDTHAPSDLITYETARK 189 (216) T ss_pred HHHHHHHH-------HHCCEEEEECCCCC-CCCCHHHHHHHHHCCCEEEEECCCCCCCCCCCHHHHHH T ss_conf 89999999-------81958999689776-47779999999986996999789998000368899999 No 30 >COG1387 HIS2 Histidinol phosphatase and related hydrolases of the PHP family [Amino acid transport and metabolism / General function prediction only] Probab=98.67 E-value=1.2e-06 Score=74.64 Aligned_cols=198 Identities=14% Similarity=0.129 Sum_probs=115.4 Q ss_pred HHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHH--H-----HHHHHH----HHHHCCCCEEEEEEEEEECCC Q ss_conf 34331444764432998999999997899889981165177--4-----999999----999769963899999985476 Q 537021.9.peg.7 13 HLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLF--S-----ALEFSQ----KACAAGIQPIIGCQLDIDMQD 81 (1033) Q Consensus 13 hLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~--G-----av~F~~----ackk~gIKPIiG~E~~v~~~~ 81 (1033) -||+||.|| +|...+.++|++|.+.|++.+|+|||.=.. + ..++.+ .-+...|+.+.|+|+.+.+.. T Consensus 4 D~H~HT~~s--dg~~~~~e~~~~A~~~g~~~~~iTdH~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~G~E~~~~~~~ 81 (237) T COG1387 4 DLHTHTVFS--DGEATPEEMVEAAIELGLEYIAITDHAPFLRVGLDAELLKYFIEEIRELKKEYDIKILIGIEVDILPDG 81 (237) T ss_pred CCCCCCCCC--CCCCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCC T ss_conf 321256643--677899999999997399659996366443457768899999999999986448404765678751787 Q ss_pred CCCCCCCCCCCCCCCCEEEEEECCHHH-----HHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCEEEECCCCCCHHH Q ss_conf 643344565666677539999569689-----999999998997324877874007899986348988998197679678 Q 537021.9.peg.7 82 NLKMTNQGKTPFPTVPSIVLLVSTAEG-----YQRLIELVSRMYLLDQGKQSVRICLSWLQEIGTEGLIMLTGGSSGPID 156 (1033) Q Consensus 82 ~~~~~~~~~~~~~~~~~lvlLAkN~~G-----y~nL~kL~S~a~~~~~~~~~p~i~~~~L~e~~~egLIvlsg~~~g~i~ 156 (1033) ..+..+..... ..+++..-..-+ .....+.+..+. +.+-+-..|.++++.- T Consensus 82 ~~d~~~~~~~~----lD~vi~svH~~~~~~~~~~~~~~~~~~a~--------------------~~~~v~il~H~~~~~~ 137 (237) T COG1387 82 SLDFLDEILKE----LDYVIASVHELNFEDQDEEDYTERLIAAM--------------------SNGAVDILAHPGGRLL 137 (237) T ss_pred CCCCCHHHHHH----CCEEEEEEECCCCCCCCHHHHHHHHHHHH--------------------CCCCCCEEECCCCCEE T ss_conf 75644546743----69599997158766636999999999997--------------------3887547726887513 Q ss_pred HHHHCCCHHHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHC Q ss_conf 88877997899999999987659868999943991158999999999998189899808830078788999987668846 Q 537021.9.peg.7 157 RAFFFNSSQVAEKRLLTFKKLFGDRLYVNLQRHRGYDRYRESQVVQLAYTHELPLVATNNSLFLSEEDYEAHDVLMAVAH 236 (1033) Q Consensus 157 ~~l~~~~~~~a~~~l~~l~~~Fgd~~ylEl~~~~~~e~~~~~~l~~lA~~~~iPlVaTndv~Y~~~~D~~~h~iL~aI~~ 236 (1033) .-...+.+....+.+-.+...+ +-.+|+..... ....+..+++++++.|++++...|.|....-....+.+..+.+. T Consensus 138 ~~~~~~~~~~~~~~~~~~~~~~--g~aleins~~~-~~~~~~~~~~~~~e~G~~~~i~tDaH~~~~lg~~~~~~~~~~~a 214 (237) T COG1387 138 GRIDRGAYKEDIEELIELAEKN--GKALEINSRPG-RLDPNSEILRLARELGVKLAIGTDAHRPGDLGDMYFGVKIARRA 214 (237) T ss_pred EEECCCCCHHHHHHHHHHHHHH--CCEEEECCCCC-CCCCCHHHHHHHHHCCCEEEEECCCCCHHHCCCCHHHHHHHHHC T ss_conf 2322452057899999999985--96898538856-66760999999997195799735546700125542869999872 Q ss_pred CCC Q ss_conf 985 Q 537021.9.peg.7 237 STV 239 (1033) Q Consensus 237 ~~~ 239 (1033) +.. T Consensus 215 ~~~ 217 (237) T COG1387 215 GLT 217 (237) T ss_pred CCC T ss_conf 687 No 31 >PRK08392 hypothetical protein; Provisional Probab=98.61 E-value=2.8e-07 Score=80.06 Aligned_cols=176 Identities=13% Similarity=0.109 Sum_probs=95.1 Q ss_pred HHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHH---HHHHHHHHHH----HCCCCEEEEEEEEEECCCCCC Q ss_conf 334331444764432998999999997899889981165177---4999999999----769963899999985476643 Q 537021.9.peg.7 12 VHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLF---SALEFSQKAC----AAGIQPIIGCQLDIDMQDNLK 84 (1033) Q Consensus 12 vhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~---Gav~F~~ack----k~gIKPIiG~E~~v~~~~~~~ 84 (1033) |-||+||.|| +|.++++|++++|++.|++.+|||||+-.. ....+....+ +.+|+-..|+|.++-++. .+ T Consensus 1 iDlHtHT~~S--dg~~ti~E~~~aA~~~Gl~~i~ITDH~~~~~~~~~~~~~~~i~~~~~~~~i~iL~GiE~nIl~g~-lD 77 (215) T PRK08392 1 MDLHTHTVYS--DGIGSVRDNVAWAEKRGLKLLGISDHIHYFTPSKFNRYINEIRRWGEESEIVVLAGIEANITENG-PD 77 (215) T ss_pred CCCCCCCCCC--CCCCCHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCEEEEEEEECCCCCC-CC T ss_conf 9665587844--67787999999999879948998069998882899999999998544069779997763504898-78 Q ss_pred CCCCCCCCCC---CCCEEEEEECCHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCEEEECCCCCCHHHHHHHC Q ss_conf 3445656666---7753999956968999999999899732487787400789998634898899819767967888877 Q 537021.9.peg.7 85 MTNQGKTPFP---TVPSIVLLVSTAEGYQRLIELVSRMYLLDQGKQSVRICLSWLQEIGTEGLIMLTGGSSGPIDRAFFF 161 (1033) Q Consensus 85 ~~~~~~~~~~---~~~~lvlLAkN~~Gy~nL~kL~S~a~~~~~~~~~p~i~~~~L~e~~~egLIvlsg~~~g~i~~~l~~ 161 (1033) ..+......+ .+.|-.+...+.+-|-++ +..+-. + +.+-+.|.+.+.++. .. T Consensus 78 ~~~~~l~~lD~vIasvH~~~~~~~~e~~~~~---~~~A~~-n-------------------~~v~iigHp~~~~~~--~~ 132 (215) T PRK08392 78 ITDDFAKKLDYVIASVHEWFGRDTPHEYIEL---VKLALM-D-------------------ENVDIIGHFGNSFPW--IG 132 (215) T ss_pred CCHHHHHHCCEEEEEEEECCCCCCHHHHHHH---HHHHHH-C-------------------CCCCEEECCCCCCCC--CC T ss_conf 6799998699799998737889988999999---999996-7-------------------998899589987677--77 Q ss_pred CCH-HHHHHHHHHHHHHCCCCEEEEEECCC-HHHHHHHHHHHHHHHHHCCCEEEECCCCCCCH Q ss_conf 997-89999999998765986899994399-11589999999999981898998088300787 Q 537021.9.peg.7 162 NSS-QVAEKRLLTFKKLFGDRLYVNLQRHR-GYDRYRESQVVQLAYTHELPLVATNNSLFLSE 222 (1033) Q Consensus 162 ~~~-~~a~~~l~~l~~~Fgd~~ylEl~~~~-~~e~~~~~~l~~lA~~~~iPlVaTndv~Y~~~ 222 (1033) -.. ...++.++.-++ .+..+|+.... .++ ..++++|+++|+|++..+|.|.... T Consensus 133 ~p~~~~~~~v~~~a~~---~~~~lEIns~~~~~~----~~~~~~a~~~Gv~i~i~SDAH~~~~ 188 (215) T PRK08392 133 YPTEEEIREVLELAEA---YGKAFEISSRYKVPD----LEFIRECIKRGIKLTFASDAHRPED 188 (215) T ss_pred CCHHHHHHHHHHHHHH---HCCEEECCCCCCCCC----HHHHHHHHHCCCEEEEECCCCCCCC T ss_conf 7558899999999999---698774159888998----9999999984996999789998454 No 32 >COG0613 Predicted metal-dependent phosphoesterases (PHP family) [General function prediction only] Probab=98.49 E-value=2.1e-07 Score=81.15 Aligned_cols=68 Identities=25% Similarity=0.382 Sum_probs=62.0 Q ss_pred HHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHH-HHHHHCCCCEEEEEEEEEECCC Q ss_conf 334331444764432998999999997899889981165177499999-9999769963899999985476 Q 537021.9.peg.7 12 VHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFS-QKACAAGIQPIIGCQLDIDMQD 81 (1033) Q Consensus 12 vhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~-~ackk~gIKPIiG~E~~v~~~~ 81 (1033) +-|||||.+| ||..+|.+++++|++.|+..+|||||+++.|..++. .+...-||+-|.|+|++..... T Consensus 4 ~DLHvHSt~S--dg~~~p~~vv~~A~~~g~~vlAiTDHdt~~g~~~a~~~~~~~l~i~vipG~Ei~t~~~~ 72 (258) T COG0613 4 ADLHVHTTAS--DGGLTPREVVERAKAKGVDVLAITDHDTVRGLLEARRAAGLRLGITVIPGIEISTTWGG 72 (258) T ss_pred EEEEEECCCC--CCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCHHHHHHHHCCCCCEEECCEEEEECCCC T ss_conf 5433712678--98679899999999779977997686444660889998616688336444796413366 No 33 >PRK08609 hypothetical protein; Provisional Probab=98.45 E-value=5.7e-06 Score=68.98 Aligned_cols=46 Identities=11% Similarity=0.174 Sum_probs=27.1 Q ss_pred HCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEECCCEEEEEHHH Q ss_conf 10023542025899988886368503552334210035319869850268 Q 537021.9.peg.7 792 MTLEMDNVEKIKKFCQDARQFNIQIMPPSVNTPCVDFKVGDNRIYYSLAA 841 (1033) Q Consensus 792 L~~~~~~~~~~~~~i~e~~~~gi~vl~PdIN~S~~~f~~~~~~Ir~Gl~~ 841 (1033) +|+.+..-|-...++..|++.|+++ .|| |+++-.-+=+.++||... T Consensus 497 INa~p~rlDl~~~~~~~A~~~gv~~---~i~-sDAH~~~~l~~~~~Gv~~ 542 (570) T PRK08609 497 LNANPNRLDLSAEHLKKAQAAGVKL---AIN-TDAHHTEMLDFMEYGVAA 542 (570) T ss_pred EECCCCCCCCCHHHHHHHHHCCCEE---EEE-CCCCCCCCCCHHHHHHHH T ss_conf 8099876798989999999869969---983-798981313138888999 No 34 >PRK07945 hypothetical protein; Provisional Probab=98.44 E-value=1.1e-05 Score=66.55 Aligned_cols=195 Identities=15% Similarity=0.142 Sum_probs=113.3 Q ss_pred HHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCH---HHH------HHHHHHHH----HH-CCCCEEEEEEEEEE Q ss_conf 343314447644329989999999978998899811651---774------99999999----97-69963899999985 Q 537021.9.peg.7 13 HLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNN---LFS------ALEFSQKA----CA-AGIQPIIGCQLDID 78 (1033) Q Consensus 13 hLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~n---l~G------av~F~~ac----kk-~gIKPIiG~E~~v~ 78 (1033) -||+||.|| ||..+++++++.|+++|+.-+|||||.- +.+ ..+...+- ++ .+|+-.-|+|+.+- T Consensus 99 DLH~HT~wS--DG~~sieeMa~aA~~lGyeYlaITDHS~~l~vA~GLs~erL~~Q~~eI~~ln~~~~~~rIL~GiEvDIL 176 (335) T PRK07945 99 DLHLHSDWS--DGGSPIEEMMATAAALGHEYCALTDHSPRLTVANGLSPERLRKQLDVIAELNERLAPFRILTGIEVDIL 176 (335) T ss_pred CCCCCCCCC--CCCCHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEEEEEC T ss_conf 653378845--797869999999998398279980488534566897999999999999999854489279975353214 Q ss_pred CCCCCCCCCCCCCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCCCCCC-CCCHHHHHHHCCCCEEEECCCCCCHHHH Q ss_conf 476643344565666677539999569689999999998997324877874-0078999863489889981976796788 Q 537021.9.peg.7 79 MQDNLKMTNQGKTPFPTVPSIVLLVSTAEGYQRLIELVSRMYLLDQGKQSV-RICLSWLQEIGTEGLIMLTGGSSGPIDR 157 (1033) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~lvlLAkN~~Gy~nL~kL~S~a~~~~~~~~~p-~i~~~~L~e~~~egLIvlsg~~~g~i~~ 157 (1033) .+...+..+...... .+++=+- -.+ |.... .+....+....+. .+-+.|.+.| + T Consensus 177 ~DGsLD~~DevL~~L----D~VVASV-Hs~----------------f~~~~~~mT~Ril~Ai~nP-~V~IlgHPTG---R 231 (335) T PRK07945 177 EDGSLDQEPELLDRL----DVVVASV-HSK----------------LRMDRAAMTRRMLAAVANP-HTDVLGHCTG---R 231 (335) T ss_pred CCCCCCCCHHHHHHC----CEEEEEE-CCC----------------CCCCHHHHHHHHHHHHHCC-CCCEEECCCC---C T ss_conf 898878998999757----9899970-134----------------5799999999999997299-8888537764---5 Q ss_pred HHH-----CCCHH-HHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHH Q ss_conf 887-----79978-999999999876598689999439911589999999999981898998088300787889999876 Q 537021.9.peg.7 158 AFF-----FNSSQ-VAEKRLLTFKKLFGDRLYVNLQRHRGYDRYRESQVVQLAYTHELPLVATNNSLFLSEEDYEAHDVL 231 (1033) Q Consensus 158 ~l~-----~~~~~-~a~~~l~~l~~~Fgd~~ylEl~~~~~~e~~~~~~l~~lA~~~~iPlVaTndv~Y~~~~D~~~h~iL 231 (1033) +++ +..++ .+++++..-.+. +..+||..+... -.....+++.|++.|+++...+|.|-...=|+--+-+. T Consensus 232 ll~gr~G~R~~~~vD~e~V~~aaae~---gtalEINa~P~R-LDL~d~~~r~A~~~Gv~~sI~TDAH~~~~L~~m~yGv~ 307 (335) T PRK07945 232 LVTGNRGIRPESKFDAEAVFTACREH---GTAVEINSRPER-RDPPTRLLRLALDAGCVFSIDTDAHAPGQLDFLGYGCE 307 (335) T ss_pred CCCCCCCCCCCCCCCHHHHHHHHHHH---CCEEEECCCCCC-CCCCHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHH T ss_conf 03576787887657999999999997---978997799746-79998999999987996998589898113056999999 Q ss_pred HHHHCCC Q ss_conf 6884698 Q 537021.9.peg.7 232 MAVAHST 238 (1033) Q Consensus 232 ~aI~~~~ 238 (1033) +|-+-+. T Consensus 308 ~Arrgwl 314 (335) T PRK07945 308 RALDAGV 314 (335) T ss_pred HHHHCCC T ss_conf 9997299 No 35 >PRK09248 putative hydrolase; Validated Probab=98.41 E-value=8.5e-07 Score=75.99 Aligned_cols=177 Identities=12% Similarity=0.103 Sum_probs=99.0 Q ss_pred CHHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCH-HHHHH---HHHHH----HHHCCCCEEEEEEEEEEC-CC Q ss_conf 13343314447644329989999999978998899811651-77499---99999----997699638999999854-76 Q 537021.9.peg.7 11 FVHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNN-LFSAL---EFSQK----ACAAGIQPIIGCQLDIDM-QD 81 (1033) Q Consensus 11 FvhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~n-l~Gav---~F~~a----ckk~gIKPIiG~E~~v~~-~~ 81 (1033) .+-||+||.+| -.|.+++++++++|++.|+..+|||||+- |.|.. .|... -.-.||+-..|+|+++.+ +. T Consensus 4 ~~DlH~HT~~S-~ha~~ti~E~~~aA~~~Gl~~i~iTDH~p~~~~~~~~~~~~~~~~~~r~~~~i~Il~GiE~dIl~~~G 82 (246) T PRK09248 4 LVDTHTHTIAS-GHAYSTLHENAAEAKQKGIKLFAITDHGPDMPGAPHYWHFINLRVLPRFVDGVGILRGIEANIKNIDG 82 (246) T ss_pred CEEECCCCCCC-CCCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCHHHHHHHHHHHHHHCCCEEEEEEECCCCCCCC T ss_conf 55857788876-89979699999999986996899898996668971067886699989876174177544414468888 Q ss_pred CCCCCCCCCCCCCCCCEEEEEECCHHHHH------HHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCEEEECCCCCCHH Q ss_conf 64334456566667753999956968999------999999899732487787400789998634898899819767967 Q 537021.9.peg.7 82 NLKMTNQGKTPFPTVPSIVLLVSTAEGYQ------RLIELVSRMYLLDQGKQSVRICLSWLQEIGTEGLIMLTGGSSGPI 155 (1033) Q Consensus 82 ~~~~~~~~~~~~~~~~~lvlLAkN~~Gy~------nL~kL~S~a~~~~~~~~~p~i~~~~L~e~~~egLIvlsg~~~g~i 155 (1033) ..+........ ..+++-.--..+|. +--+++. .. ..+.+-+.|.++.+- T Consensus 83 ~lD~~~~~l~~----lD~vIaSvH~~~~~~~~~~~~t~~~i~--------------------ai-~n~~vdiigHp~~~~ 137 (246) T PRK09248 83 EIDLPGSMLKK----LDIVIAGFHEPVFAPGDKETNTQAMIN--------------------AI-KSGRVDIIGHPGNPK 137 (246) T ss_pred CCCCHHHHHHH----CCEEEEEEECCCCCCCCHHHHHHHHHH--------------------HH-HCCCCCEEECCCCCC T ss_conf 74851989963----999999831444687379999999999--------------------98-579976898999998 Q ss_pred HHHHHCCCHHHHHHHHHHHHHHCCCCEEEEEECCCH----HH-HHHHHHHHHHHHHHCCCEEEECCCCCCCHH Q ss_conf 888877997899999999987659868999943991----15-899999999999818989980883007878 Q 537021.9.peg.7 156 DRAFFFNSSQVAEKRLLTFKKLFGDRLYVNLQRHRG----YD-RYRESQVVQLAYTHELPLVATNNSLFLSEE 223 (1033) Q Consensus 156 ~~~l~~~~~~~a~~~l~~l~~~Fgd~~ylEl~~~~~----~e-~~~~~~l~~lA~~~~iPlVaTndv~Y~~~~ 223 (1033) ... ..++++...++ .+..+||...+. .+ ......++++++++|+|++..+|.|....= T Consensus 138 ------~~~-D~e~v~~aa~e---~g~~lEIN~s~~r~~r~~~~~~~~~il~~a~~~Gv~i~i~SDAH~~~~v 200 (246) T PRK09248 138 ------YPI-DIEAVVKAAAE---HNVALEINNSSFKHSRKGSEDNCRAIAALCKKAGVWVALGSDAHIAFDI 200 (246) T ss_pred ------CCC-CHHHHHHHHHH---HCCEEEEECCCCCCCCCCCCCCHHHHHHHHHHCCCEEEEECCCCCHHHH T ss_conf ------885-89999999998---4969999588666677876446899999999819939985898887775 No 36 >PRK07328 histidinol-phosphatase; Provisional Probab=98.36 E-value=2.2e-05 Score=64.03 Aligned_cols=40 Identities=15% Similarity=0.184 Sum_probs=35.8 Q ss_pred HHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHH Q ss_conf 33433144476443299899999999789988998116517 Q 537021.9.peg.7 12 VHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNL 52 (1033) Q Consensus 12 vhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl 52 (1033) +-+|+||.|| -+|..+++++|++|.+.|+..+|+|||.-+ T Consensus 3 ~D~H~HT~~s-~~a~~~~ee~v~~Ai~~Gl~~i~~TdH~p~ 42 (268) T PRK07328 3 VDSHLHTPLC-GHATGTPEEYVEEARAKGLAEIGFTDHLPM 42 (268) T ss_pred EEECCCCCCC-CCCCCCHHHHHHHHHHCCCCEEEECCCCCC T ss_conf 4745479988-998881999999999879998997279997 No 37 >cd04485 DnaE_OBF DnaE_OBF: A subfamily of OB folds corresponding to the C-terminal OB-fold nucleic acid binding domain of Thermus aquaticus and Escherichia coli type C replicative DNA polymerase III alpha subunit (DnaE). The DNA polymerase holoenzyme of E. coli contains two copies of this replicative polymerase, each of which copies a different DNA strand. This group also contains Bacillus subtilis DnaE. Replication in B. subtilis and Staphylococcus aureus requires two different type C polymerases, polC and DnaE, both of which are thought to be included in the DNA polymerase holoenzyme. At the B. subtilis replication fork, polC appears to be involved in leading strand synthesis and DnaE in lagging strand synthesis. Probab=98.15 E-value=3.6e-06 Score=70.64 Aligned_cols=38 Identities=32% Similarity=0.612 Sum_probs=36.8 Q ss_pred EEEEEEEEEEEECCCCCCEEEEEEEECCCCEEEEEECC Q ss_conf 99999961121015888769999998497049999539 Q 537021.9.peg.7 996 RLAAMVVSKQQKKTRKGSRIGWVTFSEPAKEYEANFFP 1033 (1033) Q Consensus 996 ~iaG~V~~i~~~~TKkG~~mafltleD~~g~~e~~iFP 1033 (1033) +++|+|+++|.+.||+|++|+|++++|.+|.+|+++|| T Consensus 1 ~iaG~I~~~k~~~tk~G~~~a~~tleD~~g~~e~~~F~ 38 (84) T cd04485 1 TVAGLVTSVRRRRTKKGKRMAFVTLEDLTGSIEVVVFP 38 (84) T ss_pred CEEEEEEEEEEEECCCCCEEEEEEEECCCCCEEEEECH T ss_conf 98999999899855799989999994378749999969 No 38 >PRK00912 ribonuclease P protein component 3; Provisional Probab=98.03 E-value=5.9e-05 Score=60.39 Aligned_cols=70 Identities=13% Similarity=0.214 Sum_probs=52.7 Q ss_pred CCCHHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCC Q ss_conf 88133433144476443299899999999789988998116517749999999997699638999999854766 Q 537021.9.peg.7 9 PSFVHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPIIGCQLDIDMQDN 82 (1033) Q Consensus 9 ~~FvhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~ 82 (1033) +.|--||||+ || ||-+++.+++..|+++|+.++|||||.+-+.-..- ..+...||+.+-|+|+...+... T Consensus 4 ~~fyDl~vh~-~p--dG~~t~~ela~~A~~lGy~~iait~h~d~~~~~~~-~~~~~~~i~v~~gvei~~~dp~~ 73 (237) T PRK00912 4 PKFYDLNVHP-VP--DGYDTVLELAEEASHLGFSGICLSYHSDKYPDLKS-ILPETEDFEIFRGVEIVEENPSK 73 (237) T ss_pred CCCEECCCCC-CC--CCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCC-CCCCCCCCEEEEEEEEEECCHHH T ss_conf 7514425677-78--98747999999999869868998067533354333-35777785688567997249788 No 39 >PRK08123 histidinol-phosphatase; Reviewed Probab=97.96 E-value=0.00011 Score=58.17 Aligned_cols=68 Identities=15% Similarity=0.095 Sum_probs=49.7 Q ss_pred HHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHH--------------------HHHHHHHHHHH---CCCCE Q ss_conf 34331444764432998999999997899889981165177--------------------49999999997---69963 Q 537021.9.peg.7 13 HLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLF--------------------SALEFSQKACA---AGIQP 69 (1033) Q Consensus 13 hLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~--------------------Gav~F~~ackk---~gIKP 69 (1033) -+|+||.||-.++-.+++++|++|.+.|+..+|+|||.-+. .-++-.+.+++ ..|+- T Consensus 4 D~H~HT~~s~h~~~~~lee~v~~Ai~~Gl~~ig~TdH~p~~~~~~~~~~~~~~~m~~~~~~~Y~~~i~~lkekY~~~I~I 83 (266) T PRK08123 4 DGHTHTPFCPHGSKDSLEAYIERAIELGFTEITFTEHAPLPPGFIDPTPRQDSAMAMEQLERYIKELNELKKKYKGQIKI 83 (266) T ss_pred EECCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCEE T ss_conf 44558899999888759999999998799989974799886433576511023478889999999999999982678818 Q ss_pred EEEEEEEEECC Q ss_conf 89999998547 Q 537021.9.peg.7 70 IIGCQLDIDMQ 80 (1033) Q Consensus 70 IiG~E~~v~~~ 80 (1033) -+|+|+...++ T Consensus 84 ~~GiE~dy~~~ 94 (266) T PRK08123 84 RIGLEVDYIEG 94 (266) T ss_pred EEEEEECCCCC T ss_conf 99887444676 No 40 >TIGR01259 comE comEA protein; InterPro: IPR004787 The comE locus is obligatory for bacterial cell competence - the process of internalizing the exogenous added DNA. comEA and comEC are required for transformability, whereas the products of comEB and of the overlapping comER, which is transcribed in the reverse direction, are dispensable . ComEA has been shown to be an integral membrane protein, as predicted from hydropathy analysis, with its C-terminal domain outside the cytoplasmic membrane. This C-terminal domain possesses a sequence with similarity to those of several proteins known to be involved in nucleic acid transactions including UvrC and a human protein that binds to the replication origin of the Human herpesvirus 4 (Epstein-Barr virus) .. Probab=96.76 E-value=0.0019 Score=47.63 Aligned_cols=47 Identities=30% Similarity=0.548 Sum_probs=41.1 Q ss_pred EEHHHHCCCCHHHHHHHHHHH-HCCCCCCHHHHHHHCCCCCCCHHHHHHHH Q ss_conf 502685896989999999998-37999997899973482479989999888 Q 537021.9.peg.7 837 YSLAAIKGVGTTTARHIMEAS-ADKPFDSLEDFCSRVDSKKLNRRVLESLV 886 (1033) Q Consensus 837 ~Gl~~Ikgvg~~~~~~Ive~r-~~g~f~sl~df~~rv~~~~~nk~~le~Li 886 (1033) -=|.+|+|||.+-|++|++.| +||.|+|++|+. +| +++..+++|.|- T Consensus 72 ~EL~~l~GiGP~kA~aIi~YRe~nG~F~SvddL~-kV--sGIG~k~~eKLk 119 (124) T TIGR01259 72 EELQALPGIGPAKAKAIIEYREENGAFKSVDDLT-KV--SGIGEKSLEKLK 119 (124) T ss_pred HHHHHCCCCCCHHHHHHHHHHHHCCCCCCHHHHH-CC--CCCCHHHHHHHH T ss_conf 9986369998133799999998569977755500-35--788546687420 No 41 >PRK05588 histidinol-phosphatase; Provisional Probab=96.46 E-value=0.0033 Score=45.64 Aligned_cols=37 Identities=16% Similarity=0.390 Sum_probs=33.1 Q ss_pred HHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCH Q ss_conf 343314447644329989999999978998899811651 Q 537021.9.peg.7 13 HLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNN 51 (1033) Q Consensus 13 hLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~n 51 (1033) -.|+||.|| .+|..+++++|++|.+.|+.-+ +|||.- T Consensus 4 D~H~HT~~s-~da~~~~ee~v~~Ai~~Gl~~i-~TdH~d 40 (256) T PRK05588 4 DTHIHTEFS-TDSKMKIEEAIKKAKKNNLGII-ITEHMD 40 (256) T ss_pred EECCCCCCC-CCCCCCHHHHHHHHHHCCCCEE-EECCCC T ss_conf 763689998-9987769999999998699889-966888 No 42 >COG1555 ComEA DNA uptake protein and related DNA-binding proteins [DNA replication, recombination, and repair] Probab=96.46 E-value=0.0059 Score=43.53 Aligned_cols=45 Identities=31% Similarity=0.595 Sum_probs=38.9 Q ss_pred HHHHCCCCHHHHHHHHHHH-HCCCCCCHHHHHHHCCCCCCCHHHHHHHH Q ss_conf 2685896989999999998-37999997899973482479989999888 Q 537021.9.peg.7 839 LAAIKGVGTTTARHIMEAS-ADKPFDSLEDFCSRVDSKKLNRRVLESLV 886 (1033) Q Consensus 839 l~~Ikgvg~~~~~~Ive~r-~~g~f~sl~df~~rv~~~~~nk~~le~Li 886 (1033) |..++|||.+.+++|++.| ++|||+|++|+.++ ++++.+.+|.|- T Consensus 99 L~~lpgIG~~kA~aIi~yRe~~G~f~sv~dL~~v---~GiG~~~~ekl~ 144 (149) T COG1555 99 LQALPGIGPKKAQAIIDYREENGPFKSVDDLAKV---KGIGPKTLEKLK 144 (149) T ss_pred HHHCCCCCHHHHHHHHHHHHHCCCCCCHHHHCCC---CCCCHHHHHHHH T ss_conf 9886798999999999999973997657887107---777899999877 No 43 >COG1491 Predicted RNA-binding protein [Translation, ribosomal structure and biogenesis] Probab=95.92 E-value=0.0099 Score=41.60 Aligned_cols=38 Identities=37% Similarity=0.612 Sum_probs=34.1 Q ss_pred EHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHCCCC Q ss_conf 02685896989999999998379999978999734824 Q 537021.9.peg.7 838 SLAAIKGVGTTTARHIMEASADKPFDSLEDFCSRVDSK 875 (1033) Q Consensus 838 Gl~~Ikgvg~~~~~~Ive~r~~g~f~sl~df~~rv~~~ 875 (1033) .|...+|||.+.+.+|+|+|+.+||+||+|+-.||... T Consensus 131 ~LELLpGiGkK~m~~IleERkkkpFeSFeDi~~Rv~~~ 168 (202) T COG1491 131 QLELLPGIGKKTMWAILEERKKKPFESFEDIKERVKGL 168 (202) T ss_pred HHHHCCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHCCC T ss_conf 88753120499999999987428886689999880567 No 44 >PRK04036 DNA polymerase II small subunit; Validated Probab=95.68 E-value=0.091 Score=33.44 Aligned_cols=12 Identities=25% Similarity=0.326 Sum_probs=4.9 Q ss_pred CHHHHHHHCCCH Q ss_conf 444764432998 Q 537021.9.peg.7 18 SSYSLLEGALSL 29 (1033) Q Consensus 18 S~ySlldg~~~~ 29 (1033) +.|.++.+...+ T Consensus 20 eA~~~L~~~~~~ 31 (497) T PRK04036 20 EAYELLKEQDDP 31 (497) T ss_pred HHHHHHHHCCCH T ss_conf 899999718977 No 45 >pfam01336 tRNA_anti OB-fold nucleic acid binding domain. This family contains OB-fold domains that bind to nucleic acids. The family includes the anti-codon binding domain of lysyl, aspartyl, and asparaginyl -tRNA synthetases (See pfam00152). Aminoacyl-tRNA synthetases catalyse the addition of an amino acid to the appropriate tRNA molecule EC:6.1.1.-. This family also includes part of RecG helicase involved in DNA repair. Replication factor A is a heterotrimeric complex, that contains a subunit in this family. This domain is also found at the C-terminus of bacterial DNA polymerase III alpha chain. Probab=95.56 E-value=0.026 Score=38.08 Aligned_cols=35 Identities=20% Similarity=0.451 Sum_probs=31.4 Q ss_pred EEEEEEEEEEEEECCCCCCEEEEEEEECCCCEEEEEECC Q ss_conf 999999961121015888769999998497049999539 Q 537021.9.peg.7 995 IRLAAMVVSKQQKKTRKGSRIGWVTFSEPAKEYEANFFP 1033 (1033) Q Consensus 995 v~iaG~V~~i~~~~TKkG~~mafltleD~~g~~e~~iFP 1033 (1033) +++.|.|.+++ ++|++|+|++++|.+|.+++++|| T Consensus 1 v~v~G~V~~~~----~~~~~~~~~~l~D~tg~i~~~~~~ 35 (75) T pfam01336 1 VTVAGRVTSVR----RSGGKVAFLTLRDGTGSIQVVLFK 35 (75) T ss_pred CEEEEEEEEEE----ECCCCEEEEEEEECCEEEEEEEEH T ss_conf 99999999518----879998999999799399999952 No 46 >pfam04919 DUF655 Protein of unknown function, DUF655. This family includes several uncharacterized archaeal proteins. Probab=95.45 E-value=0.019 Score=39.19 Aligned_cols=36 Identities=36% Similarity=0.633 Sum_probs=32.8 Q ss_pred EHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHCC Q ss_conf 026858969899999999983799999789997348 Q 537021.9.peg.7 838 SLAAIKGVGTTTARHIMEASADKPFDSLEDFCSRVD 873 (1033) Q Consensus 838 Gl~~Ikgvg~~~~~~Ive~r~~g~f~sl~df~~rv~ 873 (1033) .|.-+.|||.+.+..|+|+|+.+||+|++|+-.||. T Consensus 117 ~leLLPGIGkK~~~~ileeR~~~~FeSFedi~~Rv~ 152 (181) T pfam04919 117 QLELLPGIGKKMMWAILEERKKKPFESFEDIKERVK 152 (181) T ss_pred HHHHCCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHC T ss_conf 887533403899999999965599889999998835 No 47 >TIGR00426 TIGR00426 competence protein ComEA helix-hairpin-helix repeat region; InterPro: IPR004509 This domain is found in competence protein ComEA and closely related proteins from a number of species that exhibit competence for transformation by exongenous DNA, including Streptococcus pneumoniae, Bacillus subtilis, Neisseria meningitidis, and Haemophilus influenzae. This domain represents a region of two tandem copies of a helix-hairpin-helix domain, each about 30 residues in length. Limited sequence similarity can be found among some members of this family N-terminal to this domain.. Probab=95.04 E-value=0.031 Score=37.44 Aligned_cols=42 Identities=29% Similarity=0.472 Sum_probs=35.6 Q ss_pred HHHCCCCHHHHHHHHHHH-HCCCCCCHHHHHHHCCCCCCCHHHHHH Q ss_conf 685896989999999998-379999978999734824799899998 Q 537021.9.peg.7 840 AAIKGVGTTTARHIMEAS-ADKPFDSLEDFCSRVDSKKLNRRVLES 884 (1033) Q Consensus 840 ~~Ikgvg~~~~~~Ive~r-~~g~f~sl~df~~rv~~~~~nk~~le~ 884 (1033) .+|.|||.+-|++||..| ++|+|.+++|+.. | +++.+..+|. T Consensus 21 ~~~~GvG~kKAeAIv~YREe~G~F~t~Edl~~-V--~GiG~~~~Ek 63 (70) T TIGR00426 21 KALSGVGAKKAEAIVAYREEYGRFKTVEDLKK-V--SGIGEKLLEK 63 (70) T ss_pred HHHCCCCHHHHHHHHHHHHCCCCCCCHHHHHH-C--CCCCHHHHHH T ss_conf 76428872378999887532779576222321-4--7876245555 No 48 >TIGR01856 hisJ_fam histidinol phosphate phosphatase HisJ family; InterPro: IPR010140 This entry represents the histidinol phosphate phosphatase, HisJ of Bacillus subtilis, and related proteins from a number of species within a larger family of phosphatases in the PHP hydrolase family. HisJ catalyses the penultimate step of histidine biosynthesis but shows no homology to the functionally equivalent sequence in Escherichia coli, a domain of the bifunctional HisB protein. Note, however, that many species have two members and that Clostridium perfringens, predicted not to make histidine, has five members of this family; not all members of this family may act on HisJ.; GO: 0004401 histidinol-phosphatase activity, 0000105 histidine biosynthetic process. Probab=94.51 E-value=0.025 Score=38.24 Aligned_cols=37 Identities=14% Similarity=0.218 Sum_probs=33.1 Q ss_pred HHHHC-HHHHHHHCCCHHHHHHHHHHCCCCEEEEECCC Q ss_conf 43314-44764432998999999997899889981165 Q 537021.9.peg.7 14 LQVHS-SYSLLEGALSLNNILDKIAADQQPAIAITDTN 50 (1033) Q Consensus 14 Lh~hS-~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~ 50 (1033) -|.|| +||..-+.-.++++|+.|.+.|+..+.+|||- T Consensus 3 ~H~HsG~Fc~Hg~~~~~~~~v~~Ai~~gf~~~cfTEH~ 40 (301) T TIGR01856 3 SHSHSGPFCAHGSTDTLEEVVQEAIQKGFEEICFTEHA 40 (301) T ss_pred CCEECCCCCCCCCCCCHHHHHHHHHHCCCCCEEECCCC T ss_conf 73003577889998888999999997078812210368 No 49 >PRK07329 hypothetical protein; Provisional Probab=94.04 E-value=0.095 Score=33.31 Aligned_cols=34 Identities=18% Similarity=0.407 Sum_probs=29.4 Q ss_pred HHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHH Q ss_conf 433144476443299899999999789988998116517 Q 537021.9.peg.7 14 LQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNL 52 (1033) Q Consensus 14 Lh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl 52 (1033) .|+||.||. ||..+++++|++|.. .+++|||-.+ T Consensus 4 ~H~HT~fS~-ds~~~~e~~i~~a~~----~i~~TdH~d~ 37 (246) T PRK07329 4 QHLHTHFSF-DSDAKFEDYLTHFKG----EIVTTEHLDL 37 (246) T ss_pred CCCCCCCCC-CCCCCHHHHHHHHHC----CEEEEECCCC T ss_conf 357899989-877589999998514----8799755889 No 50 >cd04492 YhaM_OBF_like YhaM_OBF_like: A subfamily of OB folds similar to that found in Bacillus subtilis YhaM and Staphylococcus aureus cmp-binding factor-1 (SaCBF1). Both these proteins are 3'-to-5'exoribonucleases. YhaM requires Mn2+ or Co2+ for activity and is inactive in the presence of Mg2+. YhaM also has a Mn2+ dependent 3'-to-5'single-stranded DNA exonuclease activity. SaCBF is also a double-stranded DNA binding protein, binding specifically to cmp, the replication enhancer found in S. aureus plasmid pT181. Proteins in this group combine an N-terminal OB fold with a C-terminal HD domain. The HD domain is found in metal-dependent phosphohydrolases. Probab=93.97 E-value=0.094 Score=33.32 Aligned_cols=34 Identities=21% Similarity=0.343 Sum_probs=31.5 Q ss_pred EEEEEEEEECCCCCCEEEEEEEECCCCEEEEEEC Q ss_conf 9996112101588876999999849704999953 Q 537021.9.peg.7 999 AMVVSKQQKKTRKGSRIGWVTFSEPAKEYEANFF 1032 (1033) Q Consensus 999 G~V~~i~~~~TKkG~~mafltleD~~g~~e~~iF 1032 (1033) .+|.+.+.+.||+|++|..++|.|.||.+++.+| T Consensus 4 fli~~~~~~~tk~G~~Yl~l~L~D~tG~I~ak~W 37 (83) T cd04492 4 FLIKSKELRTAKNGKPYLALTLQDKTGEIEAKLW 37 (83) T ss_pred EEEEEHHHHCCCCCCCEEEEEEECCCCEEEEEEC T ss_conf 9999801000579991789999878876888885 No 51 >cd04317 EcAspRS_like_N EcAspRS_like_N: N-terminal, anticodon recognition domain of the type found in Escherichia coli aspartyl-tRNA synthetase (AspRS), the human mitochondrial (mt) AspRS-2, the discriminating (D) Thermus thermophilus AspRS-1, and the nondiscriminating (ND) Helicobacter pylori AspRS. These homodimeric enzymes are class2b aminoacyl-tRNA synthetases (aaRSs). This domain is a beta-barrel domain (OB fold) involved in binding the tRNA anticodon stem-loop. aaRSs catalyze the specific attachment of amino acids (AAs) to their cognate tRNAs during protein biosynthesis. This 2-step reaction involves i) the activation of the AA by ATP in the presence of magnesium ions, followed by ii) the transfer of the activated AA to the terminal ribose of tRNA. In the case of the class2b aaRSs, the activated AA is attached to the 3'OH of the terminal ribose. Eukaryotes contain 2 sets of aaRSs, both of which are encoded by the nuclear genome. One set concerns with cytoplasmic synthesis, wh Probab=93.21 E-value=0.3 Score=29.09 Aligned_cols=46 Identities=13% Similarity=0.217 Sum_probs=34.4 Q ss_pred CCHHHHHCCCCCCEEEEEEEEEEEEEECCCCCCEEEEEEEECCCCEEEEEE Q ss_conf 768898505589889999999611210158887699999984970499995 Q 537021.9.peg.7 981 KGYEESVATIRGNNIRLAAMVVSKQQKKTRKGSRIGWVTFSEPAKEYEANF 1031 (1033) Q Consensus 981 ~~~~~~~~~~~g~~v~iaG~V~~i~~~~TKkG~~mafltleD~~g~~e~~i 1031 (1033) +.+.++.....|++|+++|+|.+++.. |+ ++|+.|-|.+|.+.+++ T Consensus 3 h~~~el~~~~~G~~V~v~GwV~~~R~~----gk-i~Fi~LRD~tg~iQ~v~ 48 (135) T cd04317 3 HYCGELRESHVGQEVTLCGWVQRRRDH----GG-LIFIDLRDRYGIVQVVF 48 (135) T ss_pred EEHHHCCHHHCCCEEEEEEEEEEEEEC----CC-EEEEEEECCCCEEEEEE T ss_conf 354218958899999999999999717----97-99999852883588999 No 52 >COG1379 PHP family phosphoesterase with a Zn ribbon [General function prediction only] Probab=93.07 E-value=0.77 Score=25.62 Aligned_cols=185 Identities=19% Similarity=0.215 Sum_probs=97.9 Q ss_pred HHHHHHCHHHHHHH-CCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHH---CCCCEEEEEEEEEECCCCCCCCC Q ss_conf 33433144476443-299899999999789988998116517749999999997---69963899999985476643344 Q 537021.9.peg.7 12 VHLQVHSSYSLLEG-ALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACA---AGIQPIIGCQLDIDMQDNLKMTN 87 (1033) Q Consensus 12 vhLh~hS~ySlldg-~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk---~gIKPIiG~E~~v~~~~~~~~~~ 87 (1033) +-||.||.||..-| ...++.+++.|+.-|+.-|+--|- .--...+-.+.... .|..-+-|+.+-.+.+. . T Consensus 5 aDLHiHs~ys~a~S~~m~~~~Ia~~A~~KGl~lvGtgD~-~hP~w~e~ik~~~e~~e~g~~e~~~~~fi~t~ev----E- 78 (403) T COG1379 5 ADLHIHSHYSGATSKLMVLPNIAEYAKLKGLDLVGTGDC-LHPEWLEEIKKSIESDEDGTFEVKGVRFILTAEV----E- 78 (403) T ss_pred CCEEEEECCCCCCCCCCCHHHHHHHHHHCCCCEECCCCC-CCHHHHHHHHHHHHHCCCCCCCCCEEEEEEEEEE----C- T ss_conf 420464011332465413487776655346644513555-7868999999876502346655430489887751----3- Q ss_pred CCCCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCCCCCCCCCH--HHHHHHCC--CCEEEECCCCCCHHHHHHHCCC Q ss_conf 5656666775399995696899999999989973248778740078--99986348--9889981976796788887799 Q 537021.9.peg.7 88 QGKTPFPTVPSIVLLVSTAEGYQRLIELVSRMYLLDQGKQSVRICL--SWLQEIGT--EGLIMLTGGSSGPIDRAFFFNS 163 (1033) Q Consensus 88 ~~~~~~~~~~~lvlLAkN~~Gy~nL~kL~S~a~~~~~~~~~p~i~~--~~L~e~~~--egLIvlsg~~~g~i~~~l~~~~ 163 (1033) .... -|.+++--+-.--..|...++....+=.....||+.. ..|++..+ +|||.-..+. -+...+ .+ T Consensus 79 ----d~~r-VHhLlilPSl~~~~El~~~l~~~skni~~~grprv~~tg~el~e~v~dlggL~gPaHaF-tPwtsl--Yk- 149 (403) T COG1379 79 ----DSRR-VHHLLILPSLSAAEELSEWLSKYSKNIETEGRPRVYLTGAELAEIVKDLGGLIGPAHAF-TPWTSL--YK- 149 (403) T ss_pred ----CCCC-EEEEEECCCHHHHHHHHHHHHHHHCCCCCCCCCEEECCHHHHHHHHHHCCCCEECCCCC-CCCHHH--HH- T ss_conf ----6772-46999718588999999999874147643588555236899999998726604032216-861776--63- Q ss_pred HHHHHHHHHHHHHHCCCCE--EEEEECCCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCH Q ss_conf 7899999999987659868--9999439911589999999999981898998088300787 Q 537021.9.peg.7 164 SQVAEKRLLTFKKLFGDRL--YVNLQRHRGYDRYRESQVVQLAYTHELPLVATNNSLFLSE 222 (1033) Q Consensus 164 ~~~a~~~l~~l~~~Fgd~~--ylEl~~~~~~e~~~~~~l~~lA~~~~iPlVaTndv~Y~~~ 222 (1033) .-..+.+.||+.. |+||-. ..|..... .++.-..+|..-.+|.|-..+ T Consensus 150 ------~~dSl~e~yg~a~iDfvELGL--SADtdmAD---~I~el~~~pFLtNSDAHSp~p 199 (403) T COG1379 150 ------KYDSLKECYGDAMIDFVELGL--SADTDMAD---MIEELHRLPFLTNSDAHSPYP 199 (403) T ss_pred ------HHCHHHHHHCCCCHHHHHHCC--CCCCHHHH---HHHHHCCCCCCCCCCCCCCCH T ss_conf ------021198871743111777446--45714789---999843488633666678763 No 53 >PRK02983 lysS lysyl-tRNA synthetase; Provisional Probab=92.66 E-value=0.61 Score=26.50 Aligned_cols=45 Identities=22% Similarity=0.414 Sum_probs=31.0 Q ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHH Q ss_conf 89999986249999999999999997500013310147888999999999 Q 537021.9.peg.7 730 ERFISGASKNGISKTIAVNIFELLAKFADYGFNKSHAAAYAVISYQTAWM 779 (1033) Q Consensus 730 ~~f~~g~~~~g~~~~~a~~i~~~i~~f~~Y~FnksHa~aya~~~y~~ayl 779 (1033) +.|.+..+--|++ ++|+.-.-|-+=+.-..|.-=|.++=+++||. T Consensus 833 ELflKRLmVGGfe-----RVFEI~RcFRNEglradHnPEFTmLE~Y~Ay~ 877 (1099) T PRK02983 833 ELYLKRLCVGGVE-----RVFELGRAFRNEGVDFSHNPEFTLLEAYQAHA 877 (1099) T ss_pred HHHHHHHHHCCCC-----CCEEECCCCCCCCCCCCCCCCHHHHHHHHHHC T ss_conf 8999999864863-----11343254467999987494079999999847 No 54 >cd04316 ND_PkAspRS_like_N ND_PkAspRS_like_N: N-terminal, anticodon recognition domain of the type found in the homodimeric non-discriminating (ND) Pyrococcus kodakaraensis aspartyl-tRNA synthetase (AspRS). This domain is a beta-barrel domain (OB fold) involved in binding the tRNA anticodon stem-loop. P. kodakaraensis AspRS is a class 2b aaRS. aaRSs catalyze the specific attachment of amino acids (AAs) to their cognate tRNAs during protein biosynthesis. This 2-step reaction involves i) the activation the AA by ATP in the presence of magnesium ions, followed by ii) the transfer of the activated AA to the terminal ribose of tRNA. In the case of the class2b aaRSs, the activated AA is attached to the 3'OH of the terminal ribose. P. kodakaraensis ND-AspRS can charge both tRNAAsp and tRNAAsn. Some of the enzymes in this group may be discriminating, based on the presence of homologs of asparaginyl-tRNA synthetase (AsnRS) in their completed genomes. Probab=92.64 E-value=0.27 Score=29.50 Aligned_cols=45 Identities=16% Similarity=0.203 Sum_probs=35.8 Q ss_pred HHHHHCCCCCCEEEEEEEEEEEEEECCCCCCEEEEEEEECCCCEEEEEEC Q ss_conf 88985055898899999996112101588876999999849704999953 Q 537021.9.peg.7 983 YEESVATIRGNNIRLAAMVVSKQQKKTRKGSRIGWVTFSEPAKEYEANFF 1032 (1033) Q Consensus 983 ~~~~~~~~~g~~v~iaG~V~~i~~~~TKkG~~mafltleD~~g~~e~~iF 1032 (1033) +.++.....|++|++.|+|.++|.. | .++|+.|-|.+|.+.+++- T Consensus 3 ~g~l~~~~~G~~V~i~Gwv~~~R~~----g-~~~Fi~LRD~~G~vQ~v~~ 47 (108) T cd04316 3 SAEITPELDGEEVTVAGWVHEIRDL----G-GIKFVILRDREGIVQVTAP 47 (108) T ss_pred CCCCCHHHCCCEEEEEEEEEEEECC----C-CEEEEEEECCCEEEEEEEE T ss_conf 4759838999999998999823217----9-8799999809877999993 No 55 >cd04490 PolII_SU_OBF PolII_SU_OBF: A subfamily of OB folds corresponding to the OB fold found in Pyrococcus abyssi DNA polymerase II (PolII) small subunit. PolII is a family D DNA polymerase, having a 3-prime to 5-prime exonuclease activity. P. abyssi PolII is heterodimeric. The large subunit appears to be the polymerase, and the small subunit may be the exonuclease. The small subunit contains a calcineurin-like phosphatase superfamily domain C-terminal to this OB-fold domain. Probab=92.49 E-value=0.29 Score=29.26 Aligned_cols=35 Identities=17% Similarity=0.304 Sum_probs=28.7 Q ss_pred EEEEEEEEEEEEEECCCCCCEEEEEEEECCCCEEEEEEC Q ss_conf 899999996112101588876999999849704999953 Q 537021.9.peg.7 994 NIRLAAMVVSKQQKKTRKGSRIGWVTFSEPAKEYEANFF 1032 (1033) Q Consensus 994 ~v~iaG~V~~i~~~~TKkG~~mafltleD~~g~~e~~iF 1032 (1033) ++.++|+|..++ .||||+. .++|||.||.+.+++. T Consensus 1 ev~iIGmV~d~r--~TknG~~--~~~lED~TG~i~vl~~ 35 (79) T cd04490 1 EVSIIGMVNDVR--STKNGHR--IVELEDTTGRITVLLT 35 (79) T ss_pred CEEEEEEECCCE--ECCCCCE--EEEEECCCCEEEEEEE T ss_conf 969999993327--6168878--9999879998999996 No 56 >PRK10917 ATP-dependent DNA helicase RecG; Provisional Probab=90.75 E-value=1 Score=24.63 Aligned_cols=106 Identities=13% Similarity=0.081 Sum_probs=57.9 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCCCC------CCCCCCCCEEEEECCCEEEEEHHHHCCCCHHHHH Q ss_conf 9997302666777510023542025899988886368503------5523342100353198698502685896989999 Q 537021.9.peg.7 778 WMKTHYPVEFLAASMTLEMDNVEKIKKFCQDARQFNIQIM------PPSVNTPCVDFKVGDNRIYYSLAAIKGVGTTTAR 851 (1033) Q Consensus 778 ylK~~yP~eF~aa~L~~~~~~~~~~~~~i~e~~~~gi~vl------~PdIN~S~~~f~~~~~~Ir~Gl~~Ikgvg~~~~~ 851 (1033) +||.++|. +-.+++... +..+.....+.+-+.-.+.|| -.-|+-..+..-+-+++=||||+++-.+...+- T Consensus 494 ~l~~~~~~-~~v~~~hG~-m~~~ek~~~m~~F~~g~~~iLvsTtviEvGvdvpna~~mvi~~aerfGlsqLhQLRGRVg- 570 (677) T PRK10917 494 ELQKALPE-LRVGLLHGR-MKPAEKDAVMAAFKAGEIDILVATTVIEVGVDVPNATVMVIENAERFGLAQLHQLRGRVG- 570 (677) T ss_pred HHHHHCCC-CEEEEECCC-CCHHHHHHHHHHHHCCCCCEEEECCEEECCCCCCCCCEEEEECCCCCCHHHHHHHCCCCC- T ss_conf 99844899-759983078-987899999999983999999989897558678888589997701053678877427436- Q ss_pred HHHHHHHCCCCCCHHHHHHHCCCCCCCHHHHHHHHHCCCCCCCC Q ss_conf 99999837999997899973482479989999888676723479 Q 537021.9.peg.7 852 HIMEASADKPFDSLEDFCSRVDSKKLNRRVLESLVFAGALDCFG 895 (1033) Q Consensus 852 ~Ive~r~~g~f~sl~df~~rv~~~~~nk~~le~Li~aGAfD~~~ 895 (1033) .|...|+--++..-+.+...++-|+.|... -|.|. T Consensus 571 -------Rg~~~~~c~l~~~~~~~~~~~~Rl~~~~~~--~dGf~ 605 (677) T PRK10917 571 -------RGAAQSYCVLLYKDPLSETARERLKIMRET--NDGFE 605 (677) T ss_pred -------CCCCCEEEEEEECCCCCHHHHHHHHHHHHH--CCCHH T ss_conf -------788845899983899997899999999985--87499 No 57 >cd04483 hOBFC1_like hOBFC1_like: A subfamily of OB folds similar to that found in human OB fold containing protein 1 (hOBFC1). Members of this group belong to the Replication protein A subunit 2 (RPA2) family of OB folds. RPA is a nuclear ssDNA binding protein (SSB) which appears to be involved in all aspects of DNA metabolism including replication, recombination, and repair. RPA also mediates specific interactions of various nuclear proteins. In animals, plants, and fungi, RPA is a heterotrimer with subunits of 70KDa (RPA1), 32kDa (RPA2), and 14 KDa (RPA3). The OB fold domain of RPA2 has dual roles in ssDNA binding and trimerization. Probab=90.73 E-value=0.58 Score=26.64 Aligned_cols=32 Identities=9% Similarity=0.131 Sum_probs=27.9 Q ss_pred EEEEEEEEEEECCCCCCEEEEEEEECCCCEEEEEECC Q ss_conf 9999961121015888769999998497049999539 Q 537021.9.peg.7 997 LAAMVVSKQQKKTRKGSRIGWVTFSEPAKEYEANFFP 1033 (1033) Q Consensus 997 iaG~V~~i~~~~TKkG~~mafltleD~~g~~e~~iFP 1033 (1033) |.|+|++++++. .+...+++|.||.+++++|+ T Consensus 2 IlG~VVs~~ere-----~f~~y~vDDgTGcI~Cv~W~ 33 (92) T cd04483 2 ILGTVVSRRERE-----TFYSFGVDDGTGVVNCVCWK 33 (92) T ss_pred EEEEEEEEEEEC-----CEEEEEEECCCCEEEEEEEE T ss_conf 379999997605-----71699860898669989960 No 58 >pfam12228 DUF3604 Protein of unknown function (DUF3604). This family of proteins is found in bacteria. Proteins in this family are typically between 621 and 693 amino acids in length. Probab=90.13 E-value=0.33 Score=28.74 Aligned_cols=54 Identities=19% Similarity=0.260 Sum_probs=40.1 Q ss_pred CCCCCCHHHHHHCHHHH---HHH-CCCHHHHHHHHH--------------HCCCCEEEEECCCHHHHHHHHH Q ss_conf 68988133433144476---443-299899999999--------------7899889981165177499999 Q 537021.9.peg.7 6 EKSPSFVHLQVHSSYSL---LEG-ALSLNNILDKIA--------------ADQQPAIAITDTNNLFSALEFS 59 (1033) Q Consensus 6 ~~~~~FvhLh~hS~ySl---ldg-~~~~e~LV~~A~--------------~~G~~AiAITD~~nl~Gav~F~ 59 (1033) .|+.-|=-|||||.||+ +-| ...|++-...|+ ..-+..+|+|||.-..|.++-. T Consensus 3 ~kqi~FGDlHVHTtyS~DAfl~G~~~~P~dAYrFArGe~v~~~~G~~~ql~rPLDF~aVTDHAE~lG~~~~~ 74 (592) T pfam12228 3 PRNAYFGDLHLHTTLSFDAFAFGTRLTPDDAYRFAKGEPVTHPTGQPVQLRRPLDFLAVTDHAELLGLVRAL 74 (592) T ss_pred CCCCCCCCCCCCCCCCCHHHCCCCCCCHHHHHHHHCCCCCCCCCCCEEECCCCCCEEEECHHHHHHHHHHHH T ss_conf 886413664404786123334689999899999866895338998861218972234774303345268996 No 59 >cd04488 RecG_wedge_OBF RecG_wedge_OBF: A subfamily of OB folds corresponding to the OB fold found in the N-terminal (wedge) domain of Escherichia coli RecG. RecG is a branched-DNA-specific helicase, which catalyzes the interconversion of a DNA replication fork to a four-stranded (Holliday) junction in vivo and in vitro. This interconversion provides a route to repair stalled forks. The RecG monomer contains three domains. The N-terminal domain is named for its wedge structure, and may provide the specificity of RecG for binding branched-DNA structures. During the reversal of fork to Holliday junction, the wedge domain is fixed at the junction of the fork where the leading and lagging strand duplex arms meet, and is thought to promote the unwinding of the nascent leading and lagging strands. In order to form the Holliday junction, these nascent strands would be annealed, and the parental strands reannealed. The wedge domain may also be a processivity factor of RecG on these branched cha Probab=88.98 E-value=1.2 Score=24.00 Aligned_cols=36 Identities=31% Similarity=0.308 Sum_probs=30.7 Q ss_pred EEEEEEEEEEEECCCCCCEEEEEEEECCCCEEEEEEC Q ss_conf 9999996112101588876999999849704999953 Q 537021.9.peg.7 996 RLAAMVVSKQQKKTRKGSRIGWVTFSEPAKEYEANFF 1032 (1033) Q Consensus 996 ~iaG~V~~i~~~~TKkG~~mafltleD~~g~~e~~iF 1032 (1033) ++.|.|.+.+....+ +..+..++++|.+|.+++++| T Consensus 1 ti~g~I~~~~~~~~~-~r~~l~~~v~D~tg~i~l~~F 36 (75) T cd04488 1 TVEGTVVSVEVVPRR-GRRRLKVTLSDGTGTLTLVFF 36 (75) T ss_pred CEEEEEEEEEEECCC-CCEEEEEEEEECCCEEEEEEE T ss_conf 999999998854589-970899999969988999998 No 60 >PRK13122 consensus Probab=87.22 E-value=2.2 Score=21.79 Aligned_cols=13 Identities=23% Similarity=0.406 Sum_probs=4.3 Q ss_pred HHHHHHHHHHCCC Q ss_conf 9999999997699 Q 537021.9.peg.7 55 ALEFSQKACAAGI 67 (1033) Q Consensus 55 av~F~~ackk~gI 67 (1033) .-+|.+.|+++|+ T Consensus 92 ~~~F~~~~~~~Gv 104 (242) T PRK13122 92 EQAFFEKCRDTGV 104 (242) T ss_pred HHHHHHHHHHCCC T ss_conf 9999999987699 No 61 >KOG3056 consensus Probab=86.94 E-value=1.9 Score=22.28 Aligned_cols=29 Identities=7% Similarity=0.186 Sum_probs=15.6 Q ss_pred CEEEECCCCHHHHEECCCCCCCCCEEEEEECHHHHH Q ss_conf 403352566666200123676886688510213343 Q 537021.9.peg.7 530 AGIVIGDRPLSQLVPMYRDVRSDLPVTQFNMKWIEK 565 (1033) Q Consensus 530 ~Gvvi~~~~l~~~~p~~~~~~~~~~~~q~d~~~~e~ 565 (1033) .|||.+..+ ..... .|.+..-|--.|+.+ T Consensus 191 ~GvI~~K~~------~K~t~-~G~~y~iwkL~dLk~ 219 (578) T KOG3056 191 MGVIVEKSD------PKFTS-NGNPYSIWKLTDLKD 219 (578) T ss_pred EEEEEECCC------CCCCC-CCCCEEEEEEEECCC T ss_conf 988862387------63134-788357898650576 No 62 >PRK06740 histidinol-phosphatase; Validated Probab=86.76 E-value=2.3 Score=21.60 Aligned_cols=16 Identities=19% Similarity=0.281 Sum_probs=8.6 Q ss_pred HHHHHHHHCCCCCCCC Q ss_conf 9999888676723479 Q 537021.9.peg.7 880 RVLESLVFAGALDCFG 895 (1033) Q Consensus 880 ~~le~Li~aGAfD~~~ 895 (1033) ..++.++++|-||-+| T Consensus 205 ~~ve~~A~SGLFDIiG 220 (338) T PRK06740 205 KTVECAVRSELFDIIA 220 (338) T ss_pred HHHHHHHHCCCCCEEC T ss_conf 9999999759972623 No 63 >CHL00200 trpA tryptophan synthase alpha subunit; Provisional Probab=86.55 E-value=2.4 Score=21.51 Aligned_cols=11 Identities=9% Similarity=-0.133 Sum_probs=3.5 Q ss_pred HHHHHHHHHCC Q ss_conf 99999999769 Q 537021.9.peg.7 56 LEFSQKACAAG 66 (1033) Q Consensus 56 v~F~~ackk~g 66 (1033) .+..++..++| T Consensus 32 ~~~~~~l~~~G 42 (263) T CHL00200 32 KKALKILDKKG 42 (263) T ss_pred HHHHHHHHHCC T ss_conf 99999999769 No 64 >PRK13115 consensus Probab=86.38 E-value=2.4 Score=21.45 Aligned_cols=43 Identities=12% Similarity=0.157 Sum_probs=18.4 Q ss_pred CCHHHHHHHHHHCCCCEEE--EE-CCCHHHHHHHHHHHHHHCCCCE Q ss_conf 9989999999978998899--81-1651774999999999769963 Q 537021.9.peg.7 27 LSLNNILDKIAADQQPAIA--IT-DTNNLFSALEFSQKACAAGIQP 69 (1033) Q Consensus 27 ~~~e~LV~~A~~~G~~AiA--IT-D~~nl~Gav~F~~ackk~gIKP 69 (1033) +++++..++.++.|-+|+- +| -.-++-...++..+..++|.-- T Consensus 9 sRi~~~F~~lk~~~k~alI~yitaG~P~~e~t~~~i~~l~~~GaDi 54 (269) T PRK13115 9 SRLGPAFDAARAEGRAALIGYLPAGFPDVDTSIAAMTALVESGCDI 54 (269) T ss_pred HHHHHHHHHHHHCCCCEEEEEECCCCCCHHHHHHHHHHHHHCCCCE T ss_conf 3899999999976994488785273899899999999999669999 No 65 >cd04484 polC_OBF polC_OBF: A subfamily of OB folds corresponding to the N-terminal OB-fold nucleic acid binding domain of Bacillus subtilis type C replicative DNA polymerase III alpha subunit (polC). Replication in B. subtilis and Staphylococcus aureus requires two different polymerases, polC and DnaE. The holoenzyme is thought to include the two different polymerases. At the B. subtilis replication fork, polC appears to be involved in leading strand synthesis and DnaE in lagging strand synthesis. Probab=86.13 E-value=2.5 Score=21.35 Aligned_cols=39 Identities=13% Similarity=0.204 Sum_probs=35.6 Q ss_pred EEEEEEEEEEEEEECCCCCCEEEEEEEECCCCEEEEEEC Q ss_conf 899999996112101588876999999849704999953 Q 537021.9.peg.7 994 NIRLAAMVVSKQQKKTRKGSRIGWVTFSEPAKEYEANFF 1032 (1033) Q Consensus 994 ~v~iaG~V~~i~~~~TKkG~~mafltleD~~g~~e~~iF 1032 (1033) .|.|.|.|-.+..+.+|+|+....+.+.|.|+++-+-+| T Consensus 1 nV~i~G~IF~~E~relk~gk~i~~~~iTD~t~Si~~K~F 39 (82) T cd04484 1 NVVVEGEVFDLEIRELKSGRKILTFKVTDYTSSITVKKF 39 (82) T ss_pred CEEEEEEEEEEEEEECCCCCEEEEEEEEECCCCEEEEEE T ss_conf 989999999789897248989999999808897999998 No 66 >cd04478 RPA2_DBD_D RPA2_DBD_D: A subfamily of OB folds corresponding to the OB fold of the central ssDNA-binding domain (DBD)-D of human RPA2 (also called RPA32). RPA2 is a subunit of Replication protein A (RPA). RPA is a nuclear ssDNA-binding protein (SSB) which appears to be involved in all aspects of DNA metabolism including replication, recombination, and repair. RPA also mediates specific interactions of various nuclear proteins. In animals, plants, and fungi, RPA is a heterotrimer with subunits of 70KDa (RPA1), 32kDa (RPA2), and 14 KDa (RPA3). The major DNA binding activity of RPA is associated with RPA1 DBD-A and DBD-B; RPA2 DBD-D is a weak ssDNA-binding domain. RPA2 DBD-D is also involved in trimerization. The ssDNA binding mechanism is believed to be multistep and to involve conformational change. N-terminal to human RPA2 DBD-D is a domain containing all the known phosphorylation sites of RPA. Human RPA2 is phosphorylated in a cell cycle dependent manner in response to DNA dam Probab=85.92 E-value=2.1 Score=21.96 Aligned_cols=33 Identities=15% Similarity=0.176 Sum_probs=24.4 Q ss_pred EEEEEEEEEEEEECCCCCCEEEEEEEECCCCEEEEEEC Q ss_conf 99999996112101588876999999849704999953 Q 537021.9.peg.7 995 IRLAAMVVSKQQKKTRKGSRIGWVTFSEPAKEYEANFF 1032 (1033) Q Consensus 995 v~iaG~V~~i~~~~TKkG~~mafltleD~~g~~e~~iF 1032 (1033) |++.|.|.++....|+ .-.+++|.||.+++..| T Consensus 2 V~~VG~V~~v~~~~t~-----~~y~idDgTG~i~~~~w 34 (95) T cd04478 2 VTLVGVVRNVEEQSTN-----ITYTIDDGTGTIEVRQW 34 (95) T ss_pred EEEEEEEEEEEECCCE-----EEEEEECCCCCEEEEEE T ss_conf 8999999987772557-----99999879873999996 No 67 >PRK09532 DNA polymerase III subunit alpha; Reviewed Probab=85.00 E-value=0.64 Score=26.29 Aligned_cols=12 Identities=17% Similarity=0.357 Sum_probs=4.7 Q ss_pred HHHHHHHHCCCC Q ss_conf 999999976996 Q 537021.9.peg.7 57 EFSQKACAAGIQ 68 (1033) Q Consensus 57 ~F~~ackk~gIK 68 (1033) +..+.|++.|.+ T Consensus 23 eLv~~A~elG~~ 34 (874) T PRK09532 23 ALIDRAIELGMP 34 (874) T ss_pred HHHHHHHHCCCC T ss_conf 999999984999 No 68 >pfam06695 Sm_multidrug_ex Putative small multi-drug export protein. This family contains a small number of putative small multi-drug export proteins. Probab=84.53 E-value=1.9 Score=22.29 Aligned_cols=49 Identities=31% Similarity=0.421 Sum_probs=37.5 Q ss_pred HHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCC Q ss_conf 999998877898986063055788899999851047643344442046677655440375 Q 537021.9.peg.7 336 DYRQRLDFELNVIARMKFSGYFLIVADFIQWAKKNNIPVGPGRGSGAGSVTAYALTITDI 395 (1033) Q Consensus 336 ~y~~RLe~EL~vI~~~gf~~YFLiV~Div~~ak~~gI~vGpGRGSAagSLVaY~LgIT~V 395 (1033) .+.+|.++--+-|++-||-+-+|-|. ||. ||-|.=.|||+||+||+..- T Consensus 57 ~~~~k~~k~~~~iekyg~~gL~lFVa----------IPL-PgTGawTGsl~A~llg~~~k 105 (121) T pfam06695 57 WLEERARKGSGKVEKYGFLGLILFVA----------IPL-PGTGAWTGSLAASLLGLDFK 105 (121) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH----------CCC-CCCCHHHHHHHHHHHCCCHH T ss_conf 99999985407899999999999995----------788-87508999999999799799 No 69 >PRK00484 lysS lysyl-tRNA synthetase; Reviewed Probab=84.07 E-value=3 Score=20.65 Aligned_cols=18 Identities=28% Similarity=0.290 Sum_probs=10.7 Q ss_pred CHHHHHHHHHHHHHHHCC Q ss_conf 988999876541034201 Q 537021.9.peg.7 306 EEENELRNKAVAGLEVRL 323 (1033) Q Consensus 306 ~~~~~Lr~l~~~Gl~~R~ 323 (1033) +++-||+.+.-.|+.+=| T Consensus 226 SPqLylk~l~vgG~ervf 243 (491) T PRK00484 226 APELYLKRLIVGGFERVF 243 (491) T ss_pred CHHHHHHHHHHCCCHHHE T ss_conf 878987778762720222 No 70 >TIGR02628 fuculo_kin_coli L-fuculokinase; InterPro: IPR013450 Proteins in this entry are L-fuculokinases from the clade that includes the Escherichia coli enzyme P11553 from SWISSPROT. They catalyze the second step in fucose catabolism and belong to the FGGY family of carbohydrate kinases. These enzymes are encoded by the kinase (K) gene of the fucose (fuc) operon.; GO: 0008737 L-fuculokinase activity, 0005975 carbohydrate metabolic process. Probab=83.77 E-value=0.71 Score=25.89 Aligned_cols=41 Identities=27% Similarity=0.490 Sum_probs=28.9 Q ss_pred HHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHCCCCCCCH Q ss_conf 9999750001331014788899999999997302666--------7775100235420 Q 537021.9.peg.7 751 ELLAKFADYGFNKSHAAAYAVISYQTAWMKTHYPVEF--------LAASMTLEMDNVE 800 (1033) Q Consensus 751 ~~i~~f~~Y~FnksHa~aya~~~y~~aylK~~yP~eF--------~aa~L~~~~~~~~ 800 (1033) .....-+.|+|| .=|..-|||.|+|--| .+++||+..-|.. T Consensus 126 y~~nGiG~ySFN---------TLyKL~Wlk~h~Pqlfe~m~kfvFISSmithrLTG~f 174 (473) T TIGR02628 126 YRRNGIGDYSFN---------TLYKLRWLKEHKPQLFEKMDKFVFISSMITHRLTGEF 174 (473) T ss_pred HHHCCCCCCHHH---------HHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCCC T ss_conf 886488633156---------8899988632782589887666316565325537510 No 71 >COG1107 Archaea-specific RecJ-like exonuclease, contains DnaJ-type Zn finger domain [DNA replication, recombination, and repair] Probab=83.27 E-value=2.6 Score=21.14 Aligned_cols=51 Identities=12% Similarity=0.214 Sum_probs=30.6 Q ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCH Q ss_conf 89999986249999999999999997500013310147888999999999973026667775100235420 Q 537021.9.peg.7 730 ERFISGASKNGISKTIAVNIFELLAKFADYGFNKSHAAAYAVISYQTAWMKTHYPVEFLAASMTLEMDNVE 800 (1033) Q Consensus 730 ~~f~~g~~~~g~~~~~a~~i~~~i~~f~~Y~FnksHa~aya~~~y~~aylK~~yP~eF~aa~L~~~~~~~~ 800 (1033) +.+++=+.++||+++..++|-..+ .|..=||+.-.-..-++-+|.. .++.+ T Consensus 518 ~~Ylela~~~gyd~e~L~~ia~av-------------------d~EaFylrf~~gr~ii~DIL~~-~gd~~ 568 (715) T COG1107 518 EQYLELAAERGYDREDLEKIALAV-------------------DYEAFYLRFMDGRGIIADILGT-TGDAD 568 (715) T ss_pred HHHHHHHHHCCCCHHHHHHHHHHH-------------------HHHHHHHHHCCCCHHHHHHHHC-CCCHH T ss_conf 899998876278989999999987-------------------6798886516654079998631-35325 No 72 >cd04323 AsnRS_cyto_like_N AsnRS_cyto_like_N: N-terminal, anticodon recognition domain of the type found in human and Saccharomyces cerevisiae cytoplasmic asparaginyl-tRNA synthetase (AsnRS), in Brugia malayai AsnRs and, in various putative bacterial AsnRSs. This domain is a beta-barrel domain (OB fold) involved in binding the tRNA anticodon stem-loop. The enzymes in this group are homodimeric class2b aminoacyl-tRNA synthetases (aaRSs). aaRSs catalyze the specific attachment of amino acids (AAs) to their cognate tRNAs during protein biosynthesis. This 2-step reaction involves i) the activation of the AA by ATP in the presence of magnesium ions, followed by ii) the transfer of the activated AA to the terminal ribose of tRNA. In the case of the class2b aaRSs, the activated AA is attached to the 3'OH of the terminal ribose. Eukaryotes contain 2 sets of aaRSs, both of which are encoded by the nuclear genome. One set concerns with cytoplasmic synthesis, whereas the other exclusively with Probab=82.94 E-value=3.3 Score=20.30 Aligned_cols=34 Identities=6% Similarity=0.156 Sum_probs=26.4 Q ss_pred EEEEEEEEEEEEEECCCCCCEEEEEEEECCCCEEEEEEC Q ss_conf 899999996112101588876999999849704999953 Q 537021.9.peg.7 994 NIRLAAMVVSKQQKKTRKGSRIGWVTFSEPAKEYEANFF 1032 (1033) Q Consensus 994 ~v~iaG~V~~i~~~~TKkG~~mafltleD~~g~~e~~iF 1032 (1033) +|++.|+|-++|.. + .++|+.|-|.+|.+.+++. T Consensus 1 kV~l~GWV~~~R~~---g--~l~Fi~LRD~~G~iQ~V~~ 34 (84) T cd04323 1 RVKVFGWVHRLRSQ---K--KLMFLVLRDGTGFLQCVLS 34 (84) T ss_pred CEEEEEEEEEEECC---C--CCEEEEEEECCEEEEEEEC T ss_conf 99999989986238---9--9289999508879999988 No 73 >pfam01120 Alpha_L_fucos Alpha-L-fucosidase. Probab=82.33 E-value=3.4 Score=20.13 Aligned_cols=17 Identities=35% Similarity=0.597 Sum_probs=11.0 Q ss_pred HHHHHHHHHHCCCCCCC Q ss_conf 88999998510476433 Q 537021.9.peg.7 359 IVADFIQWAKKNNIPVG 375 (1033) Q Consensus 359 iV~Div~~ak~~gI~vG 375 (1033) ||.+++..||++|+.+| T Consensus 140 iv~el~~A~rk~Glk~G 156 (344) T pfam01120 140 IVGELAKACRKNGIKFG 156 (344) T ss_pred HHHHHHHHHHHCCCEEE T ss_conf 69999999998598679 No 74 >cd03524 RPA2_OBF_family RPA2_OBF_family: A family of oligonucleotide binding (OB) folds with similarity to the OB fold of the single strand (ss) DNA-binding domain (DBD)-D of human RPA2 (also called RPA32). RPA2 is a subunit of Replication protein A (RPA). RPA is a nuclear ssDNA-binding protein (SSB) which appears to be involved in all aspects of DNA metabolism including replication, recombination, and repair. RPA also mediates specific interactions of various nuclear proteins. In animals, plants, and fungi, RPA is a heterotrimer with subunits of 70KDa (RPA1), 32kDa (RPA2), and 14 KDa (RPA3). RPA contains six OB folds, which are involved in ssDNA binding and in trimerization. The ssDNA binding mechanism is believed to be multistep and to involve conformational change. This family also includes OB folds similar to those found in Escherichia coli SSB, the wedge domain of E. coli RecG (a branched-DNA-specific helicase), E. coli ssDNA specific exodeoxyribonuclease VII large subunit, Pyroco Probab=82.20 E-value=3.5 Score=20.09 Aligned_cols=36 Identities=14% Similarity=0.297 Sum_probs=29.3 Q ss_pred EEEEEEEEEEEECCCCCCEEEEEEEECCC-CEEEEEECC Q ss_conf 99999961121015888769999998497-049999539 Q 537021.9.peg.7 996 RLAAMVVSKQQKKTRKGSRIGWVTFSEPA-KEYEANFFP 1033 (1033) Q Consensus 996 ~iaG~V~~i~~~~TKkG~~mafltleD~~-g~~e~~iFP 1033 (1033) ++.|+|+.+..+.+ +..+..+++.|.+ +.+++++|+ T Consensus 1 ~i~g~V~~~~~~~~--~~~~~~~~l~D~t~~~i~v~~~~ 37 (75) T cd03524 1 TIVGIVVAVEEIRT--EGKVLIFTLTDGTGGTIRVTLFG 37 (75) T ss_pred CEEEEEEEEEEEEC--CCEEEEEEEECCCCCEEEEEEEE T ss_conf 97899999842003--86499999976999799999998 No 75 >COG2183 Tex Transcriptional accessory protein [Transcription] Probab=81.93 E-value=0.85 Score=25.24 Aligned_cols=79 Identities=20% Similarity=0.248 Sum_probs=51.5 Q ss_pred HHHHHHHHHHHCCCCCCCCCCCCCCEEEEECCCEEEEEHHHHCCCCHHHHHHHHHHH-HCCCCCCHHHHHHHCCCCCCCH Q ss_conf 258999888863685035523342100353198698502685896989999999998-3799999789997348247998 Q 537021.9.peg.7 801 KIKKFCQDARQFNIQIMPPSVNTPCVDFKVGDNRIYYSLAAIKGVGTTTARHIMEAS-ADKPFDSLEDFCSRVDSKKLNR 879 (1033) Q Consensus 801 ~~~~~i~e~~~~gi~vl~PdIN~S~~~f~~~~~~Ir~Gl~~Ikgvg~~~~~~Ive~r-~~g~f~sl~df~~rv~~~~~nk 879 (1033) +++..+.-+-..=+.-.+.|+|... ---|+.|.|++...+++|++.| ++|+|++-.++.+ .+.+.. T Consensus 483 ~L~~~Ld~vved~VN~VGVdvNtAs----------a~lL~~VsGL~kt~A~nIv~~r~~~g~f~~Rk~L~k---v~rlg~ 549 (780) T COG2183 483 KLAESLDAVVEDCVNAVGVDVNTAS----------ASLLSYVSGLNKTLAKNIVAYRDENGAFDNRKQLKK---VPRLGP 549 (780) T ss_pred HHHHHHHHHHHHHHCCCCCCCCCCC----------HHHHHHHHHHCHHHHHHHHHHHHHCCCCCCHHHHHC---CCCCCH T ss_conf 9999999999987310261134278----------999987740256678999997753288440898724---887672 Q ss_pred HHHHHHHHCCCCCCC Q ss_conf 999988867672347 Q 537021.9.peg.7 880 RVLESLVFAGALDCF 894 (1033) Q Consensus 880 ~~le~Li~aGAfD~~ 894 (1033) +.+|.. ||.|.-. T Consensus 550 k~Feq~--aGFLrI~ 562 (780) T COG2183 550 KAFEQC--AGFLRIP 562 (780) T ss_pred HHHHHC--CEEEEEC T ss_conf 236543--1236726 No 76 >PRK06461 single-stranded DNA-binding protein; Reviewed Probab=81.78 E-value=3.6 Score=19.98 Aligned_cols=50 Identities=12% Similarity=0.233 Sum_probs=36.0 Q ss_pred CCCHHHHHCCCCC-CEEEEEEEEEEE---EEECCCCCC-EEEEEEEECCCCEEEEEEC Q ss_conf 9768898505589-889999999611---210158887-6999999849704999953 Q 537021.9.peg.7 980 IKGYEESVATIRG-NNIRLAAMVVSK---QQKKTRKGS-RIGWVTFSEPAKEYEANFF 1032 (1033) Q Consensus 980 ~~~~~~~~~~~~g-~~v~iaG~V~~i---~~~~TKkG~-~mafltleD~~g~~e~~iF 1032 (1033) ++.++++ ..| +.|.+.|.|+++ ++..++.|. .++-+.+-|+||.+.+++| T Consensus 4 ~~KI~dL---~pg~~~V~v~~rVl~~~~~r~~~~~~g~~~V~~~~V~DeTG~I~~tlW 58 (130) T PRK06461 4 ITKIKDL---KPGMRRVNVTGRVLEVGEPKVIQTKGGPRTISEAVVGDETGRVKLTLW 58 (130) T ss_pred EEEHHHC---CCCCCCEEEEEEEEECCCCCEEECCCCCEEEEEEEEECCCCEEEEEEE T ss_conf 0589888---899898269999997689727862899668999999879986999994 No 77 >pfam01710 Transposase_14 Transposase. Transposase proteins are necessary for efficient DNA transposition. This family includes insertion sequences from Synechocystis PCC 6803 three of which are characterized as homologous to bacterial IS5- and IS4- and to several members of the IS630-Tc1-mariner superfamily. Probab=81.26 E-value=3.7 Score=19.85 Aligned_cols=85 Identities=16% Similarity=0.244 Sum_probs=51.9 Q ss_pred CCHHHHHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCCH-HHHHCCHHHHH Q ss_conf 65687899999998303802155422077877889999999752799889886543101458982443-44522988788 Q 537021.9.peg.7 422 FCQDRRDEVIRYVQNKYGHERVAQIITFGSLQAKAALRDVGRALQMPYSQVDRLCKLIPNDPAHPVSL-QAIADDSRFRE 500 (1033) Q Consensus 422 f~~~~R~~vi~y~~~kyG~~~va~i~t~~t~~~k~aird~~r~~g~~~~~~~~~~~~i~~~~~~~~~~-~~~~~~~~~~~ 500 (1033) .+-+-|+.|+.++.+=- .++++++.|+++-..+.+|-+......-..... ..--+...|++ T Consensus 3 YS~DlR~rVl~~ve~G~------------------S~~eaA~~F~Is~~Tv~rWlkr~~~~~~~~~~r~~~Kid~~~L~~ 64 (120) T pfam01710 3 YSLDLRKKVIDYIESGG------------------SITEASKVFQIGRATIYRWLKREKLGALKVEPRGRHKIDRDALKN 64 (120) T ss_pred CCHHHHHHHHHHHHCCC------------------CHHHHHHHHCCCHHHHHHHHHCHHCCCCCCCCCCCCCCCHHHHHH T ss_conf 76899999999998699------------------999999993702999999985500147788888988778999999 Q ss_pred HHHCCCCCHHHHHHHHHHHHHHHCCCCCCCEE Q ss_conf 64307420268898720112210024474403 Q 537021.9.peg.7 501 ARLADPSVDRLLEISQKLEGLYRHASTHAAGI 532 (1033) Q Consensus 501 ~~~~~~~~~~~~~~a~~leg~~r~~~~Ha~Gv 532 (1033) .+..+|+.- +.++|.++ |||+.+| T Consensus 65 ~V~~~pD~t-l~E~A~~~-------gVs~~tI 88 (120) T pfam01710 65 DVKDNPDAY-LKERAKKF-------GVTPSSI 88 (120) T ss_pred HHHHCCCCH-HHHHHHHC-------CCCHHHH T ss_conf 999886720-99999983-------9879999 No 78 >cd04489 ExoVII_LU_OBF ExoVII_LU_OBF: A subfamily of OB folds corresponding to the N-terminal OB-fold domain of Escherichia coli exodeoxyribonuclease VII (ExoVII) large subunit. E. coli ExoVII is composed of two non-identical subunits. E. coli ExoVII is a single-strand-specific exonuclease which degrades ssDNA from both 3-prime and 5-prime ends. ExoVII plays a role in methyl-directed mismatch repair in vivo. ExoVII may also guard the genome from mutagenesis by removing excess ssDNA, since the build up of ssDNA would lead to SOS induction and PolIV-dependent mutagenesis. Probab=81.09 E-value=3.7 Score=19.80 Aligned_cols=36 Identities=6% Similarity=0.170 Sum_probs=29.1 Q ss_pred EEEEEEEEEEEEEECCCCCCEEEEEEEECCCCEEEEEECC Q ss_conf 8999999961121015888769999998497049999539 Q 537021.9.peg.7 994 NIRLAAMVVSKQQKKTRKGSRIGWVTFSEPAKEYEANFFP 1033 (1033) Q Consensus 994 ~v~iaG~V~~i~~~~TKkG~~mafltleD~~g~~e~~iFP 1033 (1033) .++|-|.|++.+.+ ++| ..+++|.|..+.+.+++|. T Consensus 1 ~vwV~GEIs~~~~~--~sG--h~Yf~LkD~~a~i~~v~~~ 36 (78) T cd04489 1 RVWVEGEISNLKRP--SSG--HLYFTLKDEDASIRCVMWR 36 (78) T ss_pred CEEEEEEECCCEEC--CCC--EEEEEEECCCEEEEEEEEH T ss_conf 98999998855828--996--3999994788199999966 No 79 >PRK12445 lysyl-tRNA synthetase; Reviewed Probab=80.86 E-value=3.8 Score=19.74 Aligned_cols=19 Identities=26% Similarity=0.254 Sum_probs=13.1 Q ss_pred CCHHHHHHHHHHHHHHHCC Q ss_conf 8988999876541034201 Q 537021.9.peg.7 305 KEEENELRNKAVAGLEVRL 323 (1033) Q Consensus 305 ~~~~~~Lr~l~~~Gl~~R~ 323 (1033) .+++-|++.+...|+.+=| T Consensus 238 qSPQLykk~LmvgGfdRvf 256 (505) T PRK12445 238 IAPELYLKRLVVGGFERVF 256 (505) T ss_pred CCCHHHHHHHHHCCCCCEE T ss_conf 6939999999857974457 No 80 >PRK13139 consensus Probab=80.71 E-value=3.8 Score=19.70 Aligned_cols=13 Identities=23% Similarity=0.268 Sum_probs=4.8 Q ss_pred HHHHHHHHHHHCC Q ss_conf 4999999999769 Q 537021.9.peg.7 54 SALEFSQKACAAG 66 (1033) Q Consensus 54 Gav~F~~ackk~g 66 (1033) ...++..+..++| T Consensus 31 ~s~~~~~~l~~~G 43 (254) T PRK13139 31 ANREVIRAMVAGG 43 (254) T ss_pred HHHHHHHHHHHCC T ss_conf 9999999999669 No 81 >PRK13125 trpA tryptophan synthase subunit alpha; Provisional Probab=79.76 E-value=4.1 Score=19.47 Aligned_cols=56 Identities=21% Similarity=0.244 Sum_probs=31.6 Q ss_pred HHHHHHHCCCCCHHHCCCHHHHHHHHCCC------CCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEECCCHH Q ss_conf 77655440375633323105676210014------5787533365687899999998303802155422077 Q 537021.9.peg.7 385 VTAYALTITDIDPLRFSLLFERFLNPDRM------SMPDFDIDFCQDRRDEVIRYVQNKYGHERVAQIITFG 450 (1033) Q Consensus 385 LVaY~LgIT~VDPi~~~LlFERFLnp~R~------~~PDIDiDf~~~~R~~vi~y~~~kyG~~~va~i~t~~ 450 (1033) |++|. +|.+|+ .|+|++.-+. =.||.=+|++.+.. -+.-..+++|-+.+..++.-+ T Consensus 82 lM~Y~------N~~~~g--~e~F~~~~~~~GvdGvIipDLP~e~~ee~~--~~~~~~~~~gl~~I~lvsPtt 143 (247) T PRK13125 82 IMTYL------EDYVGS--LDDLLNTAKEVGARGVLFPDLLIDFPDELE--KYVELIRRYGLAPVFFTSPKF 143 (247) T ss_pred EHHHH------HHHHHH--HHHHHHHHHHCCCCEEEECCCCCCCHHHHH--HHHHHHHHCCCCEEEEECCCC T ss_conf 72988------999976--999999999859975883388875467899--999999976984699957998 No 82 >cd04319 PhAsnRS_like_N PhAsnRS_like_N: N-terminal, anticodon recognition domain of the type found in Pyrococcus horikoshii AsnRS asparaginyl-tRNA synthetase (AsnRS). This domain is a beta-barrel domain (OB fold) involved in binding the tRNA anticodon stem-loop. The archeal enzymes in this group are homodimeric class2b aminoacyl-tRNA synthetases (aaRSs). aaRSs catalyze the specific attachment of amino acids (AAs) to their cognate tRNAs during protein biosynthesis. This 2-step reaction involves i) the activation of the AA by ATP in the presence of magnesium ions, followed by ii) the transfer of the activated AA to the terminal ribose of tRNA. In the case of the class2b aaRSs, the activated AA is attached to the 3'OH of the terminal ribose. Probab=79.62 E-value=4.1 Score=19.44 Aligned_cols=16 Identities=6% Similarity=0.160 Sum_probs=5.5 Q ss_pred EEEEEEECCCCEEEEE Q ss_conf 9999998497049999 Q 537021.9.peg.7 1015 IGWVTFSEPAKEYEAN 1030 (1033) Q Consensus 1015 mafltleD~~g~~e~~ 1030 (1033) ++|+.|-|.+|.+.++ T Consensus 17 i~Fi~LRD~tG~iQ~v 32 (103) T cd04319 17 KAFIVLRDSTGIVQAV 32 (103) T ss_pred EEEEEECCCCEEEEEE T ss_conf 7999982698279999 No 83 >cd04724 Tryptophan_synthase_alpha Ttryptophan synthase (TRPS) alpha subunit (TSA). TPRS is a bifunctional tetrameric enzyme (2 alpha and 2 beta subunits) that catalyzes the last two steps of L-tryptophan biosynthesis. Alpha and beta subunit catalyze two distinct reactions which are both strongly stimulated by the formation of the complex. The alpha subunit catalyzes the cleavage of indole 3-glycerol phosphate (IGP) to indole and d-glyceraldehyde 3-phosphate (G3P). Indole is then channeled to the active site of the beta subunit, a PLP-dependent enzyme that catalyzes a replacement reaction to convert L-serine into L-tryptophan. Probab=79.35 E-value=4.2 Score=19.38 Aligned_cols=35 Identities=20% Similarity=0.249 Sum_probs=17.8 Q ss_pred CCCEEEEECCCHHH--HHHHHHHHHHHCCCCEEEEEE Q ss_conf 99889981165177--499999999976996389999 Q 537021.9.peg.7 40 QQPAIAITDTNNLF--SALEFSQKACAAGIQPIIGCQ 74 (1033) Q Consensus 40 G~~AiAITD~~nl~--Gav~F~~ackk~gIKPIiG~E 74 (1033) +.|-+.+|-.|.++ |.-+|.+.|+++|+.-++=.+ T Consensus 76 ~~pivlM~Y~N~i~~~G~e~F~~~~~~~Gv~GviipD 112 (242) T cd04724 76 TIPIVLMGYYNPILQYGLERFLRDAKEAGVDGLIIPD 112 (242) T ss_pred CCCEEEEEECHHHHHHCHHHHHHHHHHCCCCEEEECC T ss_conf 9888999844576652899999999975997587069 No 84 >TIGR01430 aden_deam adenosine deaminase; InterPro: IPR006330 This family includes the experimentally verified adenosine deaminases of mammals and Escherichia coli. Other members of this family are predicted also to be adenosine deaminase, an enzyme of nucleotide degradation. This family is distantly related to AMP deaminase. ; GO: 0004000 adenosine deaminase activity, 0009168 purine ribonucleoside monophosphate biosynthetic process. Probab=78.98 E-value=1.2 Score=23.85 Aligned_cols=180 Identities=21% Similarity=0.191 Sum_probs=91.4 Q ss_pred HHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCCCCCCC Q ss_conf 33433144476443299899999999789988998116517749999999997699638999999854766433445656 Q 537021.9.peg.7 12 VHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPIIGCQLDIDMQDNLKMTNQGKT 91 (1033) Q Consensus 12 vhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~~~~~~~~~~ 91 (1033) +||||| |+|+++|+-+...|+++|.+ +. |.-++-=...++.++.. ..+-...-+ T Consensus 4 ~ELH~H-----leG~~~P~l~l~la~k~gi~-LP--~~~~~~e~~~~~~~~~~--------------L~~FL~~yd---- 57 (346) T TIGR01430 4 AELHLH-----LEGSLRPELLLELAQKNGIP-LP--DLQSLEELKEAYEKFRD--------------LQDFLAKYD---- 57 (346) T ss_pred CCCCCC-----CCCCCCHHHHHHHHHHCCCC-CC--CCCCCCHHHHHHCCCCC--------------HHHHHHHHH---- T ss_conf 310157-----43255678899888643898-88--63121002321112678--------------789999999---- Q ss_pred CCCCCCEEEEEECCHHHHHHH-HHHHHHHHHCCCCCCCCCCCHHHHHHHCCC--------CEEEEC--------CCCC-- Q ss_conf 666775399995696899999-999989973248778740078999863489--------889981--------9767-- Q 537021.9.peg.7 92 PFPTVPSIVLLVSTAEGYQRL-IELVSRMYLLDQGKQSVRICLSWLQEIGTE--------GLIMLT--------GGSS-- 152 (1033) Q Consensus 92 ~~~~~~~lvlLAkN~~Gy~nL-~kL~S~a~~~~~~~~~p~i~~~~L~e~~~e--------gLIvls--------g~~~-- 152 (1033) +-+.+|.-.++-++.| ...++....++-.|-.-+.++....+.+.. |=+.+. |+.. T Consensus 58 -----~~~~~L~G~~~d~~rlAy~~~~~~a~dG~~y~Ev~fdP~~~t~~g~~PiPWnqaeGdlt~~~vv~~~~~g~~~a~ 132 (346) T TIGR01430 58 -----FGVEVLRGTEDDFKRLAYEYVEKAAKDGVVYAEVFFDPQLHTERGISPIPWNQAEGDLTPETVVEAVLDGLDEAE 132 (346) T ss_pred -----HHHHHHCCCHHHHHHHHHHHHHHHHHCCCEEEEEEECHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHH T ss_conf -----998874289989999999999999865964899960869998506898874446676567899999999999767 Q ss_pred ---CHHHHHHH----CCCHHHHHHHHHHHHHHCCCCEEEE--EECCC-HHH--HHHHHHHHHHHH--HHCCCEEEECCCC Q ss_conf ---96788887----7997899999999987659868999--94399-115--899999999999--8189899808830 Q 537021.9.peg.7 153 ---GPIDRAFF----FNSSQVAEKRLLTFKKLFGDRLYVN--LQRHR-GYD--RYRESQVVQLAY--THELPLVATNNSL 218 (1033) Q Consensus 153 ---g~i~~~l~----~~~~~~a~~~l~~l~~~Fgd~~ylE--l~~~~-~~e--~~~~~~l~~lA~--~~~iPlVaTndv~ 218 (1033) |.-+++|. .-+.+.+++.+..-...+++.--++ |-... ... -....+..+.|+ +.|+++.+-.+.| T Consensus 133 ~dfGi~~~~I~c~~R~~~~~~~~~~~~~a~~~~~~~~ivg~gLagdE~~~ps~~~~F~~~f~~Arsl~~Gl~~T~HAGlh 212 (346) T TIGR01430 133 RDFGIKSRLILCGLRHVQPEAAEETLELALPFYKEQGIVGFGLAGDERGGPSSPEKFVRAFAIARSLELGLKLTVHAGLH 212 (346) T ss_pred HHHCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHCCCCEEEECCCC T ss_conf 75164541344421356878999999999886347856898526546688988778999999987651698356303753 Q ss_pred CCCH Q ss_conf 0787 Q 537021.9.peg.7 219 FLSE 222 (1033) Q Consensus 219 Y~~~ 222 (1033) =..+ T Consensus 213 E~~g 216 (346) T TIGR01430 213 ELGG 216 (346) T ss_pred CCCC T ss_conf 4577 No 85 >PRK13123 consensus Probab=78.31 E-value=4.5 Score=19.14 Aligned_cols=40 Identities=23% Similarity=0.173 Sum_probs=19.7 Q ss_pred HHHHHHHHHHCCCCEEE----EECCCHHHHHHHHHHHHHHCCCCE Q ss_conf 89999999978998899----811651774999999999769963 Q 537021.9.peg.7 29 LNNILDKIAADQQPAIA----ITDTNNLFSALEFSQKACAAGIQP 69 (1033) Q Consensus 29 ~e~LV~~A~~~G~~AiA----ITD~~nl~Gav~F~~ackk~gIKP 69 (1033) +++-.+++++.|-+|+- .-|. ++-...++..+..+.|+-- T Consensus 2 i~~~f~~~~~~~~~ali~yitaG~P-~~~~~~~~i~~l~~~GaDi 45 (256) T PRK13123 2 LTEKLKAIKENGKGAFVPYIMAGDG-GLDGLKETIRFLEEAGVSA 45 (256) T ss_pred HHHHHHHHHHCCCCEEEEEECCCCC-CHHHHHHHHHHHHHCCCCE T ss_conf 7999999998699648888618689-9789999999999769999 No 86 >TIGR02237 recomb_radB DNA repair and recombination protein RadB; InterPro: IPR011939 This family consists exclusively of archaeal RadB protein, a homolog of bacterial RecA, eukaryotic RAD51 (IPR011941 from INTERPRO) and DMC1 (IPR011940 from INTERPRO), and archaeal RadA (IPR011938 from INTERPRO) ,, .; GO: 0003684 damaged DNA binding, 0005524 ATP binding, 0008094 DNA-dependent ATPase activity, 0006281 DNA repair, 0006310 DNA recombination. Probab=78.22 E-value=2.4 Score=21.40 Aligned_cols=27 Identities=22% Similarity=0.262 Sum_probs=22.0 Q ss_pred HHHHHHHHHHCCCEEEECCCCCCCHHH Q ss_conf 999999998189899808830078788 Q 537021.9.peg.7 198 SQVVQLAYTHELPLVATNNSLFLSEED 224 (1033) Q Consensus 198 ~~l~~lA~~~~iPlVaTndv~Y~~~~D 224 (1033) ..|+.||++.+||+|.||.||+---+| T Consensus 139 ~~Ll~lArk~~~AVviTNQvy~d~~~g 165 (223) T TIGR02237 139 TLLLSLARKKDLAVVITNQVYTDVNNG 165 (223) T ss_pred HHHHHHHHHCCCCEEEEEEEEEEECCC T ss_conf 999999876499789971158863779 No 87 >cd04100 Asp_Lys_Asn_RS_N Asp_Lys_Asn_RS_N: N-terminal, anticodon recognition domain of class 2b aminoacyl-tRNA synthetases (aaRSs). This domain is a beta-barrel domain (OB fold) involved in binding the tRNA anticodon stem-loop. Class 2b aaRSs include the homodimeric aspartyl-, asparaginyl-, and lysyl-tRNA synthetases (AspRS, AsnRS, and LysRS). aaRSs catalyze the specific attachment of amino acids (AAs) to their cognate tRNAs during protein biosynthesis. This 2-step reaction involves i) the activation of the AA by ATP in the presence of magnesium ions, followed by ii) the transfer of the activated AA to the terminal ribose of tRNA. In the case of the class2b aaRSs, the activated AA is attached to the 3'OH of the terminal ribose. Eukaryotes contain 2 sets of aaRSs, both of which are encoded by the nuclear genome. One set concerns with cytoplasmic protein synthesis, whereas the other exclusively with mitochondrial protein synthesis. Included in this group are archeal and archeal-like A Probab=77.56 E-value=4.7 Score=18.98 Aligned_cols=33 Identities=15% Similarity=0.303 Sum_probs=23.6 Q ss_pred EEEEEEEEEEEEEECCCCCCEEEEEEEECCCCEEEEEE Q ss_conf 89999999611210158887699999984970499995 Q 537021.9.peg.7 994 NIRLAAMVVSKQQKKTRKGSRIGWVTFSEPAKEYEANF 1031 (1033) Q Consensus 994 ~v~iaG~V~~i~~~~TKkG~~mafltleD~~g~~e~~i 1031 (1033) .|+++|.|.++|.. |+ ++|+.|-|.+|.+.+++ T Consensus 1 ~V~v~Gwv~~~R~~----g~-~~Fi~lrD~~g~iQvv~ 33 (85) T cd04100 1 EVTLAGWVHSRRDH----GG-LIFIDLRDGSGIVQVVV 33 (85) T ss_pred CEEEEEEEEEEECC----CC-EEEEEEECCCCEEEEEE T ss_conf 99999987904328----98-89958503895089999 No 88 >PRK13129 consensus Probab=77.55 E-value=4.7 Score=18.98 Aligned_cols=43 Identities=9% Similarity=0.138 Sum_probs=23.1 Q ss_pred CCCHHHHHHHHHHCCCCEEE--EE-CCCHHHHHHHHHHHHHHCCCC Q ss_conf 29989999999978998899--81-165177499999999976996 Q 537021.9.peg.7 26 ALSLNNILDKIAADQQPAIA--IT-DTNNLFSALEFSQKACAAGIQ 68 (1033) Q Consensus 26 ~~~~e~LV~~A~~~G~~AiA--IT-D~~nl~Gav~F~~ackk~gIK 68 (1033) ++++++..+++++.|-+|+- +| -.-++-...++..+..++|.- T Consensus 3 m~ri~~~F~~~k~~~~~ali~yitaG~P~~e~s~~~~~~l~~~GaD 48 (267) T PRK13129 3 MSAISECFEQLKKEGRCALMPFLMAGDPDLETTAEALLILQENGAD 48 (267) T ss_pred HHHHHHHHHHHHHCCCCEEEEEEECCCCCHHHHHHHHHHHHHCCCC T ss_conf 2599999999997699558988707189989999999999977999 No 89 >PRK13117 consensus Probab=77.15 E-value=4.8 Score=18.89 Aligned_cols=44 Identities=14% Similarity=0.215 Sum_probs=29.0 Q ss_pred CCCHHHHHHHHHHCCCCEEE----EECCCHHHHHHHHHHHHHHCCCCEE Q ss_conf 29989999999978998899----8116517749999999997699638 Q 537021.9.peg.7 26 ALSLNNILDKIAADQQPAIA----ITDTNNLFSALEFSQKACAAGIQPI 70 (1033) Q Consensus 26 ~~~~e~LV~~A~~~G~~AiA----ITD~~nl~Gav~F~~ackk~gIKPI 70 (1033) ++++++..+++++.|-+|+- .-| -++-...++..+..+.|.--| T Consensus 1 Mnri~~~F~~~k~~~~~ali~yitaG~-P~~~~t~~~~~~l~~~GaDii 48 (268) T PRK13117 1 MNRYQTLFAQLKAKKEGAFVPFVTLGD-PSPELSLKIIDTLIEAGADAL 48 (268) T ss_pred CCHHHHHHHHHHHCCCCEEEEEECCCC-CCHHHHHHHHHHHHHCCCCEE T ss_conf 964899999999759914898872708-997999999999996699989 No 90 >pfam04157 EAP30 EAP30/Vps36 family. This family includes EAP30 as well as the Vps36 protein. Vps36 is involved in Golgi to endosome trafficking. EAP30 is a subunit of the ELL complex. The ELL is an 80-kDa RNA polymerase II transcription factor. ELL interacts with three other proteins to form the complex known as ELL complex. The ELL complex is capable of increasing that catalytic rate of transcription elongation, but is unable to repress initiation of transcription by RNA polymerase II as is the case of ELL. EAP30 is thought to lead to the derepression of ELL's transcriptional inhibitory activity. Probab=76.76 E-value=4.9 Score=18.82 Aligned_cols=42 Identities=14% Similarity=0.147 Sum_probs=24.7 Q ss_pred HHHHHHHHHHCCCCCCHHHHHHHCCCC-CCCHHHHHHHHHCCC Q ss_conf 999999998379999978999734824-799899998886767 Q 537021.9.peg.7 849 TARHIMEASADKPFDSLEDFCSRVDSK-KLNRRVLESLVFAGA 890 (1033) Q Consensus 849 ~~~~Ive~r~~g~f~sl~df~~rv~~~-~~nk~~le~Li~aGA 890 (1033) ....|++.....++.+..++....++. .+-+..|+.+++.|- T Consensus 173 ~~~~il~~~~~~~~vt~~~l~~~l~ws~~~a~e~L~~~~~~G~ 215 (219) T pfam04157 173 DQTKILELAEELGGVTASELAEKLGWSIGRAKEVLEKAEKEGL 215 (219) T ss_pred HHHHHHHHHHHCCCCCHHHHHHHHCCCHHHHHHHHHHHHHCCC T ss_conf 9999999997279948999999979889999999999997499 No 91 >KOG3108 consensus Probab=76.71 E-value=4.9 Score=18.80 Aligned_cols=14 Identities=29% Similarity=0.050 Sum_probs=4.7 Q ss_pred CCCCCCCHHHHHHH Q ss_conf 88300787889999 Q 537021.9.peg.7 215 NNSLFLSEEDYEAH 228 (1033) Q Consensus 215 ndv~Y~~~~D~~~h 228 (1033) |+|+++.-|=-.+| T Consensus 149 Nevt~h~LE~i~~h 162 (265) T KOG3108 149 NEVTTHFLEVINAH 162 (265) T ss_pred CCEEEEEEHHHHHH T ss_conf 72257731457899 No 92 >PRK02515 psbU photosystem II complex extrinsic protein precursor U; Provisional Probab=75.97 E-value=3.5 Score=20.00 Aligned_cols=56 Identities=29% Similarity=0.387 Sum_probs=40.2 Q ss_pred HCCCCCCCCCCCCCCEEEEECCCEEEEEHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCHHHHHHH Q ss_conf 636850355233421003531986985026858969899999999983799999789997348247998999988 Q 537021.9.peg.7 811 QFNIQIMPPSVNTPCVDFKVGDNRIYYSLAAIKGVGTTTARHIMEASADKPFDSLEDFCSRVDSKKLNRRVLESL 885 (1033) Q Consensus 811 ~~gi~vl~PdIN~S~~~f~~~~~~Ir~Gl~~Ikgvg~~~~~~Ive~r~~g~f~sl~df~~rv~~~~~nk~~le~L 885 (1033) ..|=+| |+|-+.. -.|...+|+....+..|| .||||.|.+|.+.- ..+..++-+.| T Consensus 60 ~~g~KI---DlNNa~v----------r~f~q~pGmYPtlA~kIv---~~~PY~sVeDvl~i---pgLse~qK~~l 115 (144) T PRK02515 60 ERGEKI---DLNNSSV----------RAFRQFPGMYPTLAGKIV---KNAPYDSVEDVLNL---PGLSERQKELL 115 (144) T ss_pred HHCCCC---CCCCHHH----------HHHHHCCCCCHHHHHHHH---HCCCCCCHHHHHHC---CCCCHHHHHHH T ss_conf 715640---2562749----------999868884679999998---47997879999717---89999999999 No 93 >COG1200 RecG RecG-like helicase [DNA replication, recombination, and repair / Transcription] Probab=75.93 E-value=5.1 Score=18.65 Aligned_cols=110 Identities=18% Similarity=0.237 Sum_probs=59.5 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCCCC------CCCCCCCCEEEEECCCEEEEEHHHHCCCCHHHH Q ss_conf 99997302666777510023542025899988886368503------552334210035319869850268589698999 Q 537021.9.peg.7 777 AWMKTHYPVEFLAASMTLEMDNVEKIKKFCQDARQFNIQIM------PPSVNTPCVDFKVGDNRIYYSLAAIKGVGTTTA 850 (1033) Q Consensus 777 aylK~~yP~eF~aa~L~~~~~~~~~~~~~i~e~~~~gi~vl------~PdIN~S~~~f~~~~~~Ir~Gl~~Ikgvg~~~~ 850 (1033) ++|+.+|| ++-.+++-..+.+.+| ...+.+-+...+.|| -.-||-..+..-+-+++=||||+++-.+...+- T Consensus 499 ~~L~~~~~-~~~vgL~HGrm~~~eK-d~vM~~Fk~~e~~ILVaTTVIEVGVdVPnATvMVIe~AERFGLaQLHQLRGRVG 576 (677) T COG1200 499 EELKSFLP-ELKVGLVHGRMKPAEK-DAVMEAFKEGEIDILVATTVIEVGVDVPNATVMVIENAERFGLAQLHQLRGRVG 576 (677) T ss_pred HHHHHHCC-CCEEEEEECCCCHHHH-HHHHHHHHCCCCCEEEEEEEEEECCCCCCCEEEEEECHHHHHHHHHHHHCCCCC T ss_conf 99998705-4636777568986779-999999980887689981389952357887079996543303788887526557 Q ss_pred HHHHHHHHCCCCCCHHHHHHHCCCCCCCHHHHHHH-HHCCCCCCCCCCHH Q ss_conf 99999983799999789997348247998999988-86767234790389 Q 537021.9.peg.7 851 RHIMEASADKPFDSLEDFCSRVDSKKLNRRVLESL-VFAGALDCFGYSRM 899 (1033) Q Consensus 851 ~~Ive~r~~g~f~sl~df~~rv~~~~~nk~~le~L-i~aGAfD~~~~~R~ 899 (1033) .|.-.|+ |--+......+..-+.| |....-|.|-.... T Consensus 577 --------RG~~qSy---C~Ll~~~~~~~~a~~RL~im~~t~DGF~IAE~ 615 (677) T COG1200 577 --------RGDLQSY---CVLLYKPPLSEVAKQRLKIMRETTDGFVIAEE 615 (677) T ss_pred --------CCCCCEE---EEEEECCCCCHHHHHHHHHHHHCCCCCEEHHH T ss_conf --------8875448---99996798775689999988744876010465 No 94 >COG1031 Uncharacterized Fe-S oxidoreductase [Energy production and conversion] Probab=74.51 E-value=2.5 Score=21.27 Aligned_cols=34 Identities=41% Similarity=0.675 Sum_probs=29.7 Q ss_pred HHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHCCCC Q ss_conf 2685896989999999998379999978999734824 Q 537021.9.peg.7 839 LAAIKGVGTTTARHIMEASADKPFDSLEDFCSRVDSK 875 (1033) Q Consensus 839 l~~Ikgvg~~~~~~Ive~r~~g~f~sl~df~~rv~~~ 875 (1033) |.+|.|||.+++..|+.. .||+|.++|++.+... T Consensus 518 l~~ipgig~~~~~~I~~~---Rp~~s~e~~l~~v~~~ 551 (560) T COG1031 518 LRAIPGIGKKTLRKILAE---RPFKSSEEFLKLVPDG 551 (560) T ss_pred HHHCCCCHHHHHHHHHHC---CCCCCHHHHHHCCCCC T ss_conf 986656116667778743---8865548887305841 No 95 >PRK13121 consensus Probab=74.47 E-value=5.5 Score=18.37 Aligned_cols=43 Identities=14% Similarity=0.203 Sum_probs=29.3 Q ss_pred CCCHHHHHHHHHHCCCCEEE----EECCCHHHHHHHHHHHHHHCCCCE Q ss_conf 29989999999978998899----811651774999999999769963 Q 537021.9.peg.7 26 ALSLNNILDKIAADQQPAIA----ITDTNNLFSALEFSQKACAAGIQP 69 (1033) Q Consensus 26 ~~~~e~LV~~A~~~G~~AiA----ITD~~nl~Gav~F~~ackk~gIKP 69 (1033) ++++++..+.+++.|-+|+- .-| -++--..++..+..+.|.-- T Consensus 1 M~ri~~~f~~~k~~~~~ali~y~taG~-P~~~~s~~~~~~l~~~GaDi 47 (265) T PRK13121 1 MSRIQQTFAALAAQGRKALIPFITAGD-PDPAKTVELMHALVEGGADV 47 (265) T ss_pred CCHHHHHHHHHHHCCCCEEEEEEECCC-CCHHHHHHHHHHHHHCCCCE T ss_conf 975899999999769955898870718-99899999999999769999 No 96 >PRK13116 consensus Probab=74.42 E-value=5.6 Score=18.36 Aligned_cols=44 Identities=14% Similarity=0.242 Sum_probs=27.3 Q ss_pred CCCHHHHHHHHHHCCCCEEEE----ECCCHHHHHHHHHHHHHHCCCCEE Q ss_conf 299899999999789988998----116517749999999997699638 Q 537021.9.peg.7 26 ALSLNNILDKIAADQQPAIAI----TDTNNLFSALEFSQKACAAGIQPI 70 (1033) Q Consensus 26 ~~~~e~LV~~A~~~G~~AiAI----TD~~nl~Gav~F~~ackk~gIKPI 70 (1033) ++++++..+.+++.|-+|+.- -|. ++-...++..+..++|.--| T Consensus 1 Msri~~~F~~lk~~~~~alI~yitaG~P-~~~~s~~~l~~l~~~GaDii 48 (278) T PRK13116 1 MSRYDDLFARLDTAGEGAFVPFIMLSDP-SPEEAFQIISTAIEAGADAL 48 (278) T ss_pred CCHHHHHHHHHHHCCCCEEEEEECCCCC-CHHHHHHHHHHHHHCCCCEE T ss_conf 9628999999997599358988548489-98999999999996699999 No 97 >PTZ00124 adenosine deaminase; Provisional Probab=74.33 E-value=1.7 Score=22.62 Aligned_cols=31 Identities=23% Similarity=0.370 Sum_probs=22.0 Q ss_pred HHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEEC Q ss_conf 3343314447644329989999999978998899811 Q 537021.9.peg.7 12 VHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITD 48 (1033) Q Consensus 12 vhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD 48 (1033) |+|||| |+|+++|+-+.+.|+++|.+ ...++ T Consensus 38 vELH~H-----LdGslrp~tl~eLA~~~~i~-~~~~~ 68 (362) T PTZ00124 38 CELHCH-----LDICFSVDFFIDIIRKYNEQ-PNLSD 68 (362) T ss_pred HHHHCC-----CCCCCCHHHHHHHHHHHCCC-CCCCH T ss_conf 787707-----62888999999999981989-99996 No 98 >PRK13901 ruvA Holliday junction DNA helicase motor protein; Provisional Probab=73.63 E-value=5.8 Score=18.22 Aligned_cols=61 Identities=16% Similarity=0.336 Sum_probs=35.6 Q ss_pred EHHHHCCCCHHHHHHHH-HHHHCCCCCCHHHHHHHCCCCCC-CHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 02685896989999999-99837999997899973482479-989999888676723479038999998999999 Q 537021.9.peg.7 838 SLAAIKGVGTTTARHIM-EASADKPFDSLEDFCSRVDSKKL-NRRVLESLVFAGALDCFGYSRMQLLQSLDNIQK 910 (1033) Q Consensus 838 Gl~~Ikgvg~~~~~~Iv-e~r~~g~f~sl~df~~rv~~~~~-nk~~le~Li~aGAfD~~~~~R~~ll~~i~~~~~ 910 (1033) .|..++|||.++|+.|+ |-+. +...... ...... .....++ +-++|.+|.+.-..+.++.. T Consensus 108 ~L~~vpGIG~KtA~rIi~ELk~--Kl~~~~~----~~~~~~~~~e~~~A------L~~LGy~~~~a~~al~~i~~ 170 (196) T PRK13901 108 LISKVKGIGNKMAGKIFLKLRG--KLVKNDE----LESSLFKFKELEQS------IVNMGFDRKLVNSAIKEIML 170 (196) T ss_pred HHHHCCCCCHHHHHHHHHHHHH--HHCCCCC----CCCCCCCHHHHHHH------HHHCCCCHHHHHHHHHHHHH T ss_conf 9831999589999999999976--5315665----56553448999999------99849998999999999985 No 99 >cd06546 GH18_CTS3_chitinase GH18 domain of CTS3 (chitinase 3), an uncharacterized protein from the human fungal pathogen Coccidioides posadasii. CTS3 has a chitinase-like glycosyl hydrolase family 18 (GH18) domain; and has homologs in bacteria as well as fungi. Probab=73.27 E-value=5.9 Score=18.15 Aligned_cols=20 Identities=10% Similarity=0.076 Sum_probs=10.6 Q ss_pred HHHHHHHHHHHHHHCCCCCC Q ss_conf 55788899999851047643 Q 537021.9.peg.7 355 GYFLIVADFIQWAKKNNIPV 374 (1033) Q Consensus 355 ~YFLiV~Div~~ak~~gI~v 374 (1033) .+|=.+|+=++-++..||-| T Consensus 56 ~~f~~lW~e~~~l~~~GikV 75 (256) T cd06546 56 PRFTTLWTELAILQSSGVKV 75 (256) T ss_pred HHHHHHHHHHHHHHHCCCEE T ss_conf 56899999999999779979 No 100 >KOG3416 consensus Probab=73.01 E-value=5.9 Score=18.11 Aligned_cols=34 Identities=12% Similarity=0.272 Sum_probs=23.5 Q ss_pred EEEEE-EEEECCCCCCEEEEEEEECCCCEEEEEEC Q ss_conf 99961-12101588876999999849704999953 Q 537021.9.peg.7 999 AMVVS-KQQKKTRKGSRIGWVTFSEPAKEYEANFF 1032 (1033) Q Consensus 999 G~V~~-i~~~~TKkG~~mafltleD~~g~~e~~iF 1032 (1033) .+|.. -...+||.|+.+....+-|+||.+++-+| T Consensus 21 fIvl~~g~~tkTkdg~~v~~~kVaD~TgsI~isvW 55 (134) T KOG3416 21 FIVLEYGRATKTKDGHEVRSCKVADETGSINISVW 55 (134) T ss_pred EEEEEECEEEECCCCCEEEEEEEECCCCEEEEEEE T ss_conf 99986140301368988889998066552899983 No 101 >COG0159 TrpA Tryptophan synthase alpha chain [Amino acid transport and metabolism] Probab=72.93 E-value=6 Score=18.09 Aligned_cols=38 Identities=24% Similarity=0.410 Sum_probs=15.4 Q ss_pred HHHHHHHHHHCC----CCEEEEECCCHHHHHHHHHHHHHHCCC Q ss_conf 899999999789----988998116517749999999997699 Q 537021.9.peg.7 29 LNNILDKIAADQ----QPAIAITDTNNLFSALEFSQKACAAGI 67 (1033) Q Consensus 29 ~e~LV~~A~~~G----~~AiAITD~~nl~Gav~F~~ackk~gI 67 (1033) +++..+.++..+ .+.+-+-|. ++-...++.+...+.|. T Consensus 4 ~~~~F~~l~~~~~~a~i~yit~GdP-~~e~s~e~i~~L~~~Ga 45 (265) T COG0159 4 LDQKFAQLKAENRGALIPYVTAGDP-DLETSLEIIKTLVEAGA 45 (265) T ss_pred HHHHHHHHHHHCCCCEEEEEECCCC-CHHHHHHHHHHHHHCCC T ss_conf 8999999997378974888948899-98999999999986798 No 102 >TIGR03632 bact_S11 30S ribosomal protein S11. This model describes the bacterial 30S ribosomal protein S11. Cutoffs are set such that the model excludes archaeal and eukaryotic ribosomal proteins, but many chloroplast and mitochondrial equivalents of S11 are detected. Probab=72.78 E-value=2.4 Score=21.41 Aligned_cols=43 Identities=26% Similarity=0.358 Sum_probs=24.6 Q ss_pred HHHHHHHHHHCCCC--------CCCCCCCCHHHHHHHH---HHHCCCCCHHHCC Q ss_conf 88999998510476--------4334444204667765---5440375633323 Q 537021.9.peg.7 359 IVADFIQWAKKNNI--------PVGPGRGSGAGSVTAY---ALTITDIDPLRFS 401 (1033) Q Consensus 359 iV~Div~~ak~~gI--------~vGpGRGSAagSLVaY---~LgIT~VDPi~~~ 401 (1033) +...+.+.|++.|| -.||||-||--.|..- .+-|+|+-||.|| T Consensus 48 aa~~~~~~~~~~gi~~v~V~ikG~G~Gr~~air~l~~~glkI~~I~D~Tpiphn 101 (108) T TIGR03632 48 AAEDAAKKAKEFGMKTVDVYVKGPGAGRESAIRALQAAGLEVTSIKDVTPIPHN 101 (108) T ss_pred HHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHCCCEEEEEEECCCCCCC T ss_conf 999999999970961999999689974799999999889889999988999989 No 103 >pfam09547 Spore_IV_A Stage IV sporulation protein A (spore_IV_A). SpoIVA is designated stage IV sporulation protein A. It acts in the mother cell compartment and plays a role in spore coat morphogenesis. A comparative genome analysis of all sequenced genomes of Firmicutes shows that the proteins are strictly conserved among the sub-set of endospore-forming species. Probab=72.65 E-value=6.1 Score=18.04 Aligned_cols=144 Identities=24% Similarity=0.340 Sum_probs=79.3 Q ss_pred EEEEECCHHHHHHH--HHHHHHHHHCCCCCCCCCCCHHHHHHHC---------CCCEEEECCCCCCHHHHHHHCCCHHHH Q ss_conf 99995696899999--9999899732487787400789998634---------898899819767967888877997899 Q 537021.9.peg.7 99 IVLLVSTAEGYQRL--IELVSRMYLLDQGKQSVRICLSWLQEIG---------TEGLIMLTGGSSGPIDRAFFFNSSQVA 167 (1033) Q Consensus 99 lvlLAkN~~Gy~nL--~kL~S~a~~~~~~~~~p~i~~~~L~e~~---------~egLIvlsg~~~g~i~~~l~~~~~~~a 167 (1033) +-...+...||..= -|+++--|.+ -.|+++.-++.. +=||+++|-|.-+.+++- +|-+| T Consensus 99 VGY~V~gA~Gy~e~~~~RmV~TPWfd------~eIPF~eAAeiGT~KVI~dHSTIGiVVTTDGSi~dIpRe----~Y~eA 168 (492) T pfam09547 99 VGYIVEGALGYEEEDGPRMVKTPWFD------EEIPFEEAAEIGTRKVITEHSTIGLVVTTDGSITDIPRE----NYVEA 168 (492) T ss_pred CCEEECCCCCCCCCCCCEEECCCCCC------CCCCHHHHHHHCCCCEECCCCCEEEEEECCCCCCCCCHH----HHHHH T ss_conf 25675575555468975130489878------887868887646541331577144999568875676644----42357 Q ss_pred HH-HHHHHHHHCCCCEEEEEECCCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCC--- Q ss_conf 99-99999876598689999439911589999999999981898998088300787889999876688469857653--- Q 537021.9.peg.7 168 EK-RLLTFKKLFGDRLYVNLQRHRGYDRYRESQVVQLAYTHELPLVATNNSLFLSEEDYEAHDVLMAVAHSTVVSQK--- 243 (1033) Q Consensus 168 ~~-~l~~l~~~Fgd~~ylEl~~~~~~e~~~~~~l~~lA~~~~iPlVaTndv~Y~~~~D~~~h~iL~aI~~~~~l~~~--- 243 (1033) ++ .+..|+++ |.-|-+=|....+..+....-.-+|.++|++|+++.|=. -+..+ +.+.+|..+=---++.+. T Consensus 169 EervI~ELk~i-~KPFiillNs~~P~s~et~~L~~eL~ekY~v~Vl~vnc~-~m~~~--DI~~Il~~vLyEFPV~Ei~~~ 244 (492) T pfam09547 169 EERVIEELKEI-GKPFIILLNSQRPYSEETKELRDELEEKYDVPVLPVNCA-QMTEE--DITEILEEVLYEFPVKEININ 244 (492) T ss_pred HHHHHHHHHHH-CCCEEEEEECCCCCCHHHHHHHHHHHHHHCCCEEEEEHH-HCCHH--HHHHHHHHHHHCCCCEEEEEE T ss_conf 99999999970-998899983899899899999999999859957996779-75999--999999998633882499862 Q ss_pred --CCC-CCCCCCCCCC Q ss_conf --100-2453432343 Q 537021.9.peg.7 244 --DRP-RVTPDHYLKN 256 (1033) Q Consensus 244 --~~~-~~~~~~yLks 256 (1033) .+. ....+||||. T Consensus 245 lP~Wve~L~~~Hwlk~ 260 (492) T pfam09547 245 LPKWVEELETEHWLKQ 260 (492) T ss_pred CCHHHHHCCCCCHHHH T ss_conf 7368863388764779 No 104 >pfam10391 DNA_pol_lambd_f Fingers domain of DNA polymerase lambda. DNA polymerases catalyse the addition of dNMPs onto the 3-prime ends of DNA chains. There is a general polymerase fold consisting of three subdomains that have been likened to the fingers, palm, and thumb of a right hand. DNA_pol_lambd_f is the central three-helical region of DNA polymerase lambda referred to as the F and G helices of the fingers domain. Contacts with DNA involve this conserved helix-hairpin-helix motif in the fingers region which interacts with the primer strand. This motif is common to several DNA binding proteins and confers a sequence-independent interaction with the DNA backbone. Probab=72.58 E-value=6.1 Score=18.03 Aligned_cols=41 Identities=32% Similarity=0.422 Sum_probs=30.2 Q ss_pred HHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCHHHHHHH Q ss_conf 26858969899999999983799999789997348247998999988 Q 537021.9.peg.7 839 LAAIKGVGTTTARHIMEASADKPFDSLEDFCSRVDSKKLNRRVLESL 885 (1033) Q Consensus 839 l~~Ikgvg~~~~~~Ive~r~~g~f~sl~df~~rv~~~~~nk~~le~L 885 (1033) |..|=|||.+.|+..++ .| +++++|+-. +...+|..|.-.| T Consensus 4 f~~I~GvGp~~A~~~~~---~G-~~ti~dL~~--~~~~L~~~Q~iGL 44 (52) T pfam10391 4 FTNIWGVGPKTARKWYR---QG-IRTLEDLRE--NKASLTRAQQIGL 44 (52) T ss_pred HHHCCCCCHHHHHHHHH---HC-CCCHHHHHH--CCCCCCHHHHHHH T ss_conf 66354406999999999---47-888999974--6310899999999 No 105 >PRK13131 consensus Probab=72.08 E-value=6.2 Score=17.94 Aligned_cols=21 Identities=14% Similarity=0.105 Sum_probs=7.5 Q ss_pred HHHHHHHHHHHHHCCCCCEEEC Q ss_conf 8789999999830380215542 Q 537021.9.peg.7 425 DRRDEVIRYVQNKYGHERVAQI 446 (1033) Q Consensus 425 ~~R~~vi~y~~~kyG~~~va~i 446 (1033) +..+++.+. .+++|-+.+..+ T Consensus 127 eE~~~~~~~-~~~~~l~~I~lv 147 (257) T PRK13131 127 IEKELVIKS-AQKHQIKQIFIA 147 (257) T ss_pred HHHHHHHHH-HHHCCCCEEEEE T ss_conf 788999999-997798479972 No 106 >COG2766 PrkA Putative Ser protein kinase [Signal transduction mechanisms] Probab=71.96 E-value=6.2 Score=17.92 Aligned_cols=46 Identities=20% Similarity=0.328 Sum_probs=21.4 Q ss_pred HHHHHH-HHHCCCCCHH----HHCCCHHHHHHHHHHCCCHHH--HHHHHHHHC Q ss_conf 588998-7620141012----211453478888771740488--887665411 Q 537021.9.peg.7 605 DHETYH-LLTTKGTLGI----FQLESSGMRQALEGMQPDCIE--DIIALVSLY 650 (1033) Q Consensus 605 D~~~~~-l~~~g~t~Gv----FQ~es~~~~~~l~~~~p~~~~--dl~~~~al~ 650 (1033) ..+.|+ |+..-+..-+ --++-.+|..++.++++..=. +|..-..+| T Consensus 339 E~kIYEKll~~s~ls~~h~APhTL~~aa~FsVlTRLkep~~~~~~ll~Kmr~Y 391 (649) T COG2766 339 EAKIYEKLLQNSELSDAHCAPHTLEVAALFSVLTRLKEPDNSDIDLLSKMRLY 391 (649) T ss_pred HHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHH T ss_conf 88999998254666656658058999998888862368665407799999986 No 107 >PRK13137 consensus Probab=71.85 E-value=6.3 Score=17.91 Aligned_cols=39 Identities=13% Similarity=0.148 Sum_probs=21.8 Q ss_pred HHCCCHHHHHHHHHHCCCCEEEE----ECCCHHHHHHHHHHHHH Q ss_conf 43299899999999789988998----11651774999999999 Q 537021.9.peg.7 24 EGALSLNNILDKIAADQQPAIAI----TDTNNLFSALEFSQKAC 63 (1033) Q Consensus 24 dg~~~~e~LV~~A~~~G~~AiAI----TD~~nl~Gav~F~~ack 63 (1033) .|++++++..+.+++.|-+|+-- -|. ++-...+...+.. T Consensus 7 ~~m~Ri~~~F~~~k~~~~~ali~yitaG~P-~~~~s~~~~~~l~ 49 (266) T PRK13137 7 RGVQRLHAAFARARAEGRAAFIPFLTAGYP-SAAGFLQVAEELL 49 (266) T ss_pred HHHHHHHHHHHHHHHCCCCEEEEEECCCCC-CHHHHHHHHHHHH T ss_conf 048899999999997699538878668188-8789999999997 No 108 >cd01990 Alpha_ANH_like_I This is a subfamily of Adenine nucleotide alpha hydrolases superfamily. Adenine nucleotide alpha hydrolases superfamily includes N type ATP PPases and ATP sulphurylases. It forms a apha/beta/apha fold which binds to Adenosine group. This subfamily of proteins probably binds ATP. This domain is about 200 amino acids long with a strongly conserved motif SGGKD at the N terminus. Probab=71.75 E-value=6.3 Score=17.89 Aligned_cols=58 Identities=14% Similarity=0.157 Sum_probs=28.5 Q ss_pred EEEECCCCCCHHHHHHHCCCHHHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHHHHHHHHHCCCEEEEC Q ss_conf 899819767967888877997899999999987659868999943991158999999999998189899808 Q 537021.9.peg.7 144 LIMLTGGSSGPIDRAFFFNSSQVAEKRLLTFKKLFGDRLYVNLQRHRGYDRYRESQVVQLAYTHELPLVATN 215 (1033) Q Consensus 144 LIvlsg~~~g~i~~~l~~~~~~~a~~~l~~l~~~Fgd~~ylEl~~~~~~e~~~~~~l~~lA~~~~iPlVaTn 215 (1033) ++++|||.+|-+--. -..+..|++.-.=...++..-+.-.+...++|+++|++.+..+ T Consensus 2 ~vA~SGGVDSs~la~--------------la~~alG~~v~aV~~~s~~~~~~E~e~a~~~a~~~gi~~~~i~ 59 (202) T cd01990 2 AVAFSGGVDSTLLLK--------------AAVDALGDRVLAVTATSPLFPRRELEEAKRLAKEIGIRHEVIE 59 (202) T ss_pred EEECCCCHHHHHHHH--------------HHHHHHCCCEEEEEEECCCCCHHHHHHHHHHHHHCCCCEEEEE T ss_conf 898238599999999--------------9999768787999972798997899999999986398579973 No 109 >TIGR03631 bact_S13 30S ribosomal protein S13. This model describes bacterial ribosomal protein S13, to the exclusion of the homologous archaeal S13P and eukaryotic ribosomal protein S18. This model identifies some (but not all) instances of chloroplast and mitochondrial S13, which is of bacterial type. Probab=71.73 E-value=5.3 Score=18.52 Aligned_cols=47 Identities=30% Similarity=0.475 Sum_probs=32.2 Q ss_pred ECCCEEEEEHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCHHHHHHH Q ss_conf 31986985026858969899999999983799999789997348247998999988 Q 537021.9.peg.7 830 VGDNRIYYSLAAIKGVGTTTARHIMEASADKPFDSLEDFCSRVDSKKLNRRVLESL 885 (1033) Q Consensus 830 ~~~~~Ir~Gl~~Ikgvg~~~~~~Ive~r~~g~f~sl~df~~rv~~~~~nk~~le~L 885 (1033) .++..|+++|..|.|||...+..|+..-.=.|-.-+ ..++..+++.| T Consensus 8 ~~~K~v~~ALt~I~GIG~~~A~~Ic~~lgId~~~k~---------g~Ls~~qi~~I 54 (113) T TIGR03631 8 PNNKRVEIALTYIYGIGRTRARKILEKAGIDPDKRV---------KDLTEEELNAI 54 (113) T ss_pred CCCCCHHHHHHCEECCCHHHHHHHHHHCCCCCCCCH---------HHCCHHHHHHH T ss_conf 999674606520027589999999999299987786---------44999999999 No 110 >PRK07571 bidirectional hydrogenase complex protein HoxE; Reviewed Probab=71.59 E-value=6.2 Score=17.95 Aligned_cols=55 Identities=24% Similarity=0.278 Sum_probs=43.9 Q ss_pred HHHHHHHHHHHHHHCCCCCEEECCCHHHHH------HHHHHHHHHHHCCCCHHHHHHHHHH Q ss_conf 687899999998303802155422077877------8899999997527998898865431 Q 537021.9.peg.7 424 QDRRDEVIRYVQNKYGHERVAQIITFGSLQ------AKAALRDVGRALQMPYSQVDRLCKL 478 (1033) Q Consensus 424 ~~~R~~vi~y~~~kyG~~~va~i~t~~t~~------~k~aird~~r~~g~~~~~~~~~~~~ 478 (1033) ++++-..++-+.+||+.++-|.|--..-.+ +..|++.+|+.+|+|..++-.++.. T Consensus 17 ~~~~~~~i~~il~ry~~~~salipiL~~iQe~~Gyip~eai~~vA~~l~l~~a~V~~VaTF 77 (169) T PRK07571 17 GDKRFKVLEATMKRNQYRQDALIEVLHKAQELFGYLERDLLLYVARQLKLPLSRVYGVATF 77 (169) T ss_pred CCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCHHHHHHHHHH T ss_conf 3564999999998849986689999999999829989999999999979699999999879 No 111 >smart00812 Alpha_L_fucos Alpha-L-fucosidase. O-Glycosyl hydrolases (EC 3.2.1.-) are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site PUBMED:. Because the fold of proteins is better conserved than their sequences, some of the families can be grouped in 'clans'. Family 29 encompasses alpha-L-fucosidases, which is a lysosomal enzyme responsible for hydrolyzing the alpha-1,6-linked fucose joined to the reducing-end N-acetylglucosamine of the carbohydrate moieties of glycoproteins. Deficiency of alpha-L-fucosidase results in the lysosomal storage disease fucosidosis. Probab=71.08 E-value=6.5 Score=17.78 Aligned_cols=18 Identities=17% Similarity=0.394 Sum_probs=12.0 Q ss_pred HHHHHHHHHHCCCCCCCC Q ss_conf 889999985104764334 Q 537021.9.peg.7 359 IVADFIQWAKKNNIPVGP 376 (1033) Q Consensus 359 iV~Div~~ak~~gI~vGp 376 (1033) ||.++++.||++|+.+|. T Consensus 129 iv~el~~A~rk~Glk~G~ 146 (384) T smart00812 129 LVGELADAVRKRGLKFGL 146 (384) T ss_pred HHHHHHHHHHHCCCEEEE T ss_conf 799999999876976999 No 112 >pfam00416 Ribosomal_S13 Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes. Probab=70.67 E-value=6.6 Score=17.71 Aligned_cols=48 Identities=25% Similarity=0.421 Sum_probs=31.7 Q ss_pred ECCCEEEEEHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCHHHHHHHH Q ss_conf 319869850268589698999999999837999997899973482479989999888 Q 537021.9.peg.7 830 VGDNRIYYSLAAIKGVGTTTARHIMEASADKPFDSLEDFCSRVDSKKLNRRVLESLV 886 (1033) Q Consensus 830 ~~~~~Ir~Gl~~Ikgvg~~~~~~Ive~r~~g~f~sl~df~~rv~~~~~nk~~le~Li 886 (1033) .+++.|.++|..|.|||...+..|...-.=.|-. ++ +.++..+++.|. T Consensus 8 ~~~K~v~~ALt~I~GIG~~~A~~Ic~~lgi~~~~-------k~--~~Lt~~qi~~l~ 55 (106) T pfam00416 8 DGNKKIEIALTYIKGIGRRKANQILKKAGVDKDK-------RV--GELTEEEIDRIR 55 (106) T ss_pred CCCCEEEEEECCCCCCCHHHHHHHHHHCCCCCCC-------CC--CCCCHHHHHHHH T ss_conf 9996864441121052899999999991959775-------71--549999999999 No 113 >COG1038 PycA Pyruvate carboxylase [Energy production and conversion] Probab=70.45 E-value=6.1 Score=18.03 Aligned_cols=52 Identities=25% Similarity=0.237 Sum_probs=41.8 Q ss_pred CHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEE Q ss_conf 44476443299899999999789988998116517749999999997699638 Q 537021.9.peg.7 18 SSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPI 70 (1033) Q Consensus 18 S~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPI 70 (1033) ..++--++..++++++..|++.|-.|+-=- +|-|+--++|..+|.++||.-| T Consensus 58 ~~~~Pi~aYL~IdeII~iAk~~gaDaIhPG-YGfLSEn~efA~~c~eaGI~FI 109 (1149) T COG1038 58 EGKGPVEAYLSIDEIIRIAKRSGADAIHPG-YGFLSENPEFARACAEAGITFI 109 (1149) T ss_pred CCCCCHHHHCCHHHHHHHHHHCCCCEECCC-CCCCCCCHHHHHHHHHCCCEEE T ss_conf 788726874329999999997088731378-4324479899999997597896 No 114 >PRK13119 consensus Probab=70.27 E-value=6.7 Score=17.65 Aligned_cols=12 Identities=17% Similarity=0.207 Sum_probs=5.2 Q ss_pred HHCCCCCEEECC Q ss_conf 303802155422 Q 537021.9.peg.7 436 NKYGHERVAQII 447 (1033) Q Consensus 436 ~kyG~~~va~i~ 447 (1033) ++||-+.+..|+ T Consensus 142 ~~~gl~~I~lva 153 (261) T PRK13119 142 KDNGVDCIFLIA 153 (261) T ss_pred HHCCCCEEEEEC T ss_conf 975997644307 No 115 >PRK05309 30S ribosomal protein S11; Validated Probab=70.25 E-value=2.5 Score=21.35 Aligned_cols=42 Identities=26% Similarity=0.415 Sum_probs=22.3 Q ss_pred HHHHHHHHHCCCCC--------CCCCCCCHHHHHHHH---HHHCCCCCHHHCC Q ss_conf 89999985104764--------334444204667765---5440375633323 Q 537021.9.peg.7 360 VADFIQWAKKNNIP--------VGPGRGSGAGSVTAY---ALTITDIDPLRFS 401 (1033) Q Consensus 360 V~Div~~ak~~gI~--------vGpGRGSAagSLVaY---~LgIT~VDPi~~~ 401 (1033) ..++.+.|++.||- .||||-+|--+|..- ..-|+|+-|+.|| T Consensus 65 a~~~~~~~~~~Gi~~v~V~ikG~G~GR~~air~L~~~gikI~~I~D~Tpiphn 117 (126) T PRK05309 65 AEDAAKKAKEHGMKTVEVFVKGPGSGRESAIRALQAAGLEVTSIKDVTPIPHN 117 (126) T ss_pred HHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHCCCEEEEEEECCCCCCC T ss_conf 99999999972971999999778833899999998789889999989999978 No 116 >PRK13118 consensus Probab=70.14 E-value=6.8 Score=17.62 Aligned_cols=40 Identities=15% Similarity=0.225 Sum_probs=17.0 Q ss_pred CCHHHHHHHHHHCCCCEEE----EECCCHHHHHHHHHHHHHHCCC Q ss_conf 9989999999978998899----8116517749999999997699 Q 537021.9.peg.7 27 LSLNNILDKIAADQQPAIA----ITDTNNLFSALEFSQKACAAGI 67 (1033) Q Consensus 27 ~~~e~LV~~A~~~G~~AiA----ITD~~nl~Gav~F~~ackk~gI 67 (1033) +++++..+..+..|-+|+. .-|. ++-...++.++..+.|. T Consensus 2 ~ri~~~f~~lk~~~~~ali~yitaG~P-~~e~t~~~~~~l~~~Ga 45 (269) T PRK13118 2 SRIDTRFAELKAQNRKALVPFVTAGDP-SPEASVPLLHGLVAAGA 45 (269) T ss_pred CHHHHHHHHHHHCCCCEEEEEECCCCC-CHHHHHHHHHHHHHCCC T ss_conf 748999999997699559988717189-98999999999997699 No 117 >PRK13120 consensus Probab=70.05 E-value=6.8 Score=17.61 Aligned_cols=44 Identities=9% Similarity=0.135 Sum_probs=23.4 Q ss_pred HCCCHHHHHHHHHHCCC-CEEE----EECCCHHHHHHHHHHHHHHCCCCE Q ss_conf 32998999999997899-8899----811651774999999999769963 Q 537021.9.peg.7 25 GALSLNNILDKIAADQQ-PAIA----ITDTNNLFSALEFSQKACAAGIQP 69 (1033) Q Consensus 25 g~~~~e~LV~~A~~~G~-~AiA----ITD~~nl~Gav~F~~ackk~gIKP 69 (1033) .+.++++..+++++.|- +|+- --| -++-...++.++..+.|.-- T Consensus 3 ~~~Ri~~~f~~~k~~~r~~alI~yitaG~-P~~~~t~~~l~~l~~~GaDi 51 (285) T PRK13120 3 TTDRIAAAFARVSEAGRAAALIPYIAAGD-PSPQATVPLMHALVRAGADL 51 (285) T ss_pred CCHHHHHHHHHHHHCCCCCEEEEEECCCC-CCHHHHHHHHHHHHHCCCCE T ss_conf 20289999999998499822887857858-99899999999999769999 No 118 >cd07230 Pat_TGL4-5_like Triacylglycerol lipase 4 and 5. TGL4 and TGL5 are triacylglycerol lipases that are involved in triacylglycerol mobilization and degradation; they are found in lipid particles. Tgl4 is a functional ortholog of mammalian adipose TG lipase (ATGL) and is phosphorylated and activated by cyclin-dependent kinase 1 (Cdk1/Cdc28). TGL4 is 30% homologus to TGL3, whereas TGL5 is 26% homologus to TGL3. This family includes TGL4 (STC1) and TGL5 (STC2) from Saccharomyces cerevisiae. Probab=69.56 E-value=1.9 Score=22.39 Aligned_cols=33 Identities=24% Similarity=0.141 Sum_probs=21.7 Q ss_pred HHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCHH Q ss_conf 851047643344442046677655440375633 Q 537021.9.peg.7 366 WAKKNNIPVGPGRGSGAGSVTAYALTITDIDPL 398 (1033) Q Consensus 366 ~ak~~gI~vGpGRGSAagSLVaY~LgIT~VDPi 398 (1033) -=-++|.+=--==||+|||+||=+++...=|=| T Consensus 94 aL~e~~LLPrVIsGSSaGaiVAal~~t~tdeEL 126 (421) T cd07230 94 ALFEANLLPRIISGSSAGSIVAAILCTHTDEEI 126 (421) T ss_pred HHHHCCCCCCEECCCCHHHHHHHHHHCCCHHHH T ss_conf 999748886420157535888989862877999 No 119 >PRK13112 consensus Probab=69.27 E-value=7 Score=17.49 Aligned_cols=42 Identities=19% Similarity=0.320 Sum_probs=21.9 Q ss_pred CCHHHHHHHHHHCCCCEEEE----ECCCHHHHHHHHHHHHHHCCCCE Q ss_conf 99899999999789988998----11651774999999999769963 Q 537021.9.peg.7 27 LSLNNILDKIAADQQPAIAI----TDTNNLFSALEFSQKACAAGIQP 69 (1033) Q Consensus 27 ~~~e~LV~~A~~~G~~AiAI----TD~~nl~Gav~F~~ackk~gIKP 69 (1033) +++++..+.+++.|-+|+.- -| -++-...+...+..++|.-- T Consensus 3 ~ri~~~F~~~k~~~r~ali~yitaG~-P~~~~s~~~l~~l~~~GaDi 48 (279) T PRK13112 3 TRIDTRFAELKAEGRPALVTYFMGGD-PDLETSLKIMKALPKAGADI 48 (279) T ss_pred CHHHHHHHHHHHCCCCEEEEEEECCC-CCHHHHHHHHHHHHHCCCCE T ss_conf 57999999999769955898860738-99789999999998779998 No 120 >PRK08418 chlorohydrolase; Provisional Probab=69.18 E-value=6.5 Score=17.76 Aligned_cols=21 Identities=29% Similarity=0.569 Sum_probs=16.6 Q ss_pred CCCHHHHHHCHHHHHHHCCCH Q ss_conf 881334331444764432998 Q 537021.9.peg.7 9 PSFVHLQVHSSYSLLEGALSL 29 (1033) Q Consensus 9 ~~FvhLh~hS~ySlldg~~~~ 29 (1033) |-||..|+|..||...+.... T Consensus 58 PGlVNaH~Hl~~~~~~~~~~~ 78 (407) T PRK08418 58 PAFINAHVHLEFSANKTTLDY 78 (407) T ss_pred ECCEECCCCCCCCCCCCCCCC T ss_conf 796050438120142133345 No 121 >PRK05179 rpsM 30S ribosomal protein S13; Validated Probab=68.78 E-value=6.6 Score=17.72 Aligned_cols=47 Identities=30% Similarity=0.460 Sum_probs=32.0 Q ss_pred CCCEEEEEHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCHHHHHHHH Q ss_conf 19869850268589698999999999837999997899973482479989999888 Q 537021.9.peg.7 831 GDNRIYYSLAAIKGVGTTTARHIMEASADKPFDSLEDFCSRVDSKKLNRRVLESLV 886 (1033) Q Consensus 831 ~~~~Ir~Gl~~Ikgvg~~~~~~Ive~r~~g~f~sl~df~~rv~~~~~nk~~le~Li 886 (1033) ++..|.++|..|.|||...+..|++.-.=.|-.-+ ..++..+++.|. T Consensus 11 ~~K~v~~ALt~I~GIG~~~A~~Ic~~lgId~~~k~---------g~Ls~~qi~~I~ 57 (122) T PRK05179 11 RNKRVVIALTYIYGIGRTRAKEILAAAGIDPDTRV---------KDLTDEELDKLR 57 (122) T ss_pred CCCCHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCH---------HHCCHHHHHHHH T ss_conf 99786847730027589999999998299976688---------765699999999 No 122 >cd01320 ADA Adenosine deaminase (ADA) is a monomeric zinc dependent enzyme which catalyzes the irreversible hydrolytic deamination of both adenosine, as well as desoxyadenosine, to ammonia and inosine or desoxyinosine, respectively. ADA plays an important role in the purine pathway. Low, as well as high levels of ADA activity have been linked to several diseases. Probab=68.57 E-value=4.3 Score=19.32 Aligned_cols=26 Identities=27% Similarity=0.478 Sum_probs=23.2 Q ss_pred HHHHHHCHHHHHHHCCCHHHHHHHHHHCCCC Q ss_conf 3343314447644329989999999978998 Q 537021.9.peg.7 12 VHLQVHSSYSLLEGALSLNNILDKIAADQQP 42 (1033) Q Consensus 12 vhLh~hS~ySlldg~~~~e~LV~~A~~~G~~ 42 (1033) ++||+| |+|+++++.+.+.|+++|.+ T Consensus 5 ~eLH~H-----L~Gsi~~~tl~~la~~~~~~ 30 (325) T cd01320 5 AELHLH-----LDGSLRPETILELAKKNGIT 30 (325) T ss_pred HHHHCC-----CCCCCCHHHHHHHHHHCCCC T ss_conf 444407-----77888999999999971999 No 123 >pfam00962 A_deaminase Adenosine/AMP deaminase. Probab=68.36 E-value=4.2 Score=19.40 Aligned_cols=26 Identities=23% Similarity=0.423 Sum_probs=23.6 Q ss_pred HHHHHHCHHHHHHHCCCHHHHHHHHHHCCCC Q ss_conf 3343314447644329989999999978998 Q 537021.9.peg.7 12 VHLQVHSSYSLLEGALSLNNILDKIAADQQP 42 (1033) Q Consensus 12 vhLh~hS~ySlldg~~~~e~LV~~A~~~G~~ 42 (1033) |+||+| |+|+++++-+.+.|+++|.+ T Consensus 5 vELH~H-----L~Gsi~~~tl~ela~~~~~~ 30 (329) T pfam00962 5 VDLHLH-----LDGSMNPDTLLRLAKRKGII 30 (329) T ss_pred HHHHCC-----CCCCCCHHHHHHHHHHCCCC T ss_conf 777648-----76799999999999971999 No 124 >pfam06514 PsbU Photosystem II 12 kDa extrinsic protein (PsbU). This family consists of several photosystem II 12 kDa extrinsic protein (PsbU) proteins from cyanobacteria and algae. PsbU is an extrinsic protein of the photosystem II complex of cyanobacteria and red algae. PsbU is known to stabilize the oxygen-evolving machinery of the photosystem II complex against heat-induced inactivation. Probab=68.18 E-value=7.4 Score=17.32 Aligned_cols=44 Identities=25% Similarity=0.344 Sum_probs=34.4 Q ss_pred EEEHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCHHHHHHH Q ss_conf 85026858969899999999983799999789997348247998999988 Q 537021.9.peg.7 836 YYSLAAIKGVGTTTARHIMEASADKPFDSLEDFCSRVDSKKLNRRVLESL 885 (1033) Q Consensus 836 r~Gl~~Ikgvg~~~~~~Ive~r~~g~f~sl~df~~rv~~~~~nk~~le~L 885 (1033) .-.|...+|+-...+..|| .||||.|.+|.+.- .++..++-+.| T Consensus 64 vr~f~q~pGmYPtlA~kIv---~~~PY~~v~Dvl~i---pgLse~qk~~l 107 (135) T pfam06514 64 VRAFRQLPGMYPTLAGKIV---KNAPYKSVEDVLNI---PGLSERQKALL 107 (135) T ss_pred HHHHHHCCCCCHHHHHHHH---HCCCCCCHHHHHCC---CCCCHHHHHHH T ss_conf 9999868884678999997---37996779999718---89899999999 No 125 >COG1237 Metal-dependent hydrolases of the beta-lactamase superfamily II [General function prediction only] Probab=67.68 E-value=7.5 Score=17.24 Aligned_cols=78 Identities=21% Similarity=0.173 Sum_probs=50.4 Q ss_pred CHHHHHHHCCCCEEEECCCCCCHHHHHHHCCCHHHHHHHHHHHHHHCCCCEEEEEEC-CC-HHHHHHHHHHHHHHHHHCC Q ss_conf 789998634898899819767967888877997899999999987659868999943-99-1158999999999998189 Q 537021.9.peg.7 132 CLSWLQEIGTEGLIMLTGGSSGPIDRAFFFNSSQVAEKRLLTFKKLFGDRLYVNLQR-HR-GYDRYRESQVVQLAYTHEL 209 (1033) Q Consensus 132 ~~~~L~e~~~egLIvlsg~~~g~i~~~l~~~~~~~a~~~l~~l~~~Fgd~~ylEl~~-~~-~~e~~~~~~l~~lA~~~~i 209 (1033) +..+..+ ..+||+++|||..+-| .+.+++-++.++++.|.=+-- |. +....+..+.++.-+++++ T Consensus 157 e~aLi~~-~~~GlvvItGCsH~GI------------~niv~~~~~~~g~rv~~ViGGFHL~~~~~~~l~~~~~~l~el~v 223 (259) T COG1237 157 EQALIVE-TEKGLVVITGCSHPGI------------VNIVEWAKERSGDRVKAVIGGFHLIGASEERLEEVADYLKELGV 223 (259) T ss_pred CEEEEEE-CCCCEEEEECCCCCCH------------HHHHHHHHHHCCCEEEEEEEEECCCCCCHHHHHHHHHHHHHCCC T ss_conf 4179996-4786399982786438------------99999999860644578962102378768899999999985588 Q ss_pred CEEEECCCCCCCHHH Q ss_conf 899808830078788 Q 537021.9.peg.7 210 PLVATNNSLFLSEED 224 (1033) Q Consensus 210 PlVaTndv~Y~~~~D 224 (1033) ..|+.. |+.-... T Consensus 224 ~~i~pc--HCTg~~a 236 (259) T COG1237 224 EKIYPC--HCTGEKA 236 (259) T ss_pred CEEEEC--CCCCHHH T ss_conf 769820--7788899 No 126 >PRK13140 consensus Probab=67.28 E-value=7.6 Score=17.18 Aligned_cols=14 Identities=29% Similarity=0.544 Sum_probs=5.9 Q ss_pred HHHHCCCCCEEECC Q ss_conf 98303802155422 Q 537021.9.peg.7 434 VQNKYGHERVAQII 447 (1033) Q Consensus 434 ~~~kyG~~~va~i~ 447 (1033) +.++||-+.|-.|+ T Consensus 139 ~~~~~~i~~I~lva 152 (257) T PRK13140 139 IFEKYGLKNIMLIT 152 (257) T ss_pred HHHHCCCCEEEEEC T ss_conf 99986997799868 No 127 >CHL00137 rps13 ribosomal protein S13; Validated Probab=67.23 E-value=7.7 Score=17.18 Aligned_cols=28 Identities=36% Similarity=0.557 Sum_probs=24.1 Q ss_pred ECCCEEEEEHHHHCCCCHHHHHHHHHHH Q ss_conf 3198698502685896989999999998 Q 537021.9.peg.7 830 VGDNRIYYSLAAIKGVGTTTARHIMEAS 857 (1033) Q Consensus 830 ~~~~~Ir~Gl~~Ikgvg~~~~~~Ive~r 857 (1033) ++++.|+++|..|.|||...+..|...- T Consensus 10 ~~~K~v~~aLt~I~GIG~~~A~~Ic~~l 37 (122) T CHL00137 10 PRNKRIEIALTYIYGIGLTSAKKILEKA 37 (122) T ss_pred CCCCEEEEHHHHHCCCCHHHHHHHHHHC T ss_conf 9997731311100061899999999984 No 128 >TIGR03629 arch_S13P archaeal ribosomal protein S13P. This model describes exclusively the archaeal ribosomal protein S13P. It excludes the homologous eukaryotic 40S ribosomal protein S18 and bacterial 30S ribosomal protein S13. Probab=67.12 E-value=7.7 Score=17.16 Aligned_cols=47 Identities=30% Similarity=0.503 Sum_probs=31.3 Q ss_pred CCCEEEEEHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCHHHHHHHH Q ss_conf 19869850268589698999999999837999997899973482479989999888 Q 537021.9.peg.7 831 GDNRIYYSLAAIKGVGTTTARHIMEASADKPFDSLEDFCSRVDSKKLNRRVLESLV 886 (1033) Q Consensus 831 ~~~~Ir~Gl~~Ikgvg~~~~~~Ive~r~~g~f~sl~df~~rv~~~~~nk~~le~Li 886 (1033) ++..|.++|.+|+|||...+..|++.-.-.| -.++ ..++..+++.|. T Consensus 15 ~~K~v~~aLt~I~GIG~~~A~~Ic~~~gid~-------~~k~--~~Lt~~ei~~i~ 61 (144) T TIGR03629 15 GNKPVEYALTGIKGIGRRFARAIARKLGVDP-------NAKL--GYLDDEEIEKLE 61 (144) T ss_pred CCCEEEEEEEEECCCCHHHHHHHHHHHCCCC-------CCCC--CCCCHHHHHHHH T ss_conf 9968988722123728999999999909993-------5360--419999999999 No 129 >PRK13113 consensus Probab=67.03 E-value=7.7 Score=17.15 Aligned_cols=40 Identities=20% Similarity=0.337 Sum_probs=17.8 Q ss_pred CCHHHHHHHHHHCCCCEEEE----ECCCHHHHHHHHHHHHHHCCC Q ss_conf 99899999999789988998----116517749999999997699 Q 537021.9.peg.7 27 LSLNNILDKIAADQQPAIAI----TDTNNLFSALEFSQKACAAGI 67 (1033) Q Consensus 27 ~~~e~LV~~A~~~G~~AiAI----TD~~nl~Gav~F~~ackk~gI 67 (1033) +++++..+++++.|-+|+-. -|. ++--..++.++..+.|. T Consensus 2 ~ri~~~f~~lk~~~~~ali~yitaG~P-~~e~s~~~~~~l~~~Ga 45 (263) T PRK13113 2 TRIDDTFARLRAEGKKAFVAYVMAGDP-DYDTSLEVMRGLPAAGV 45 (263) T ss_pred CHHHHHHHHHHHCCCCEEEEEECCCCC-CHHHHHHHHHHHHHCCC T ss_conf 748999999997699548988738289-97999999999997699 No 130 >cd07232 Pat_PLPL Patain-like phospholipase. Patatin-like phospholipase. This family consists of various patatin glycoproteins from plants and fungi. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein, but it also has the enzymatic activity of a lipid acyl hydrolase, catalyzing the cleavage of fatty acids from membrane lipids. Members of this family have been found also in vertebrates. Probab=66.43 E-value=1.9 Score=22.20 Aligned_cols=39 Identities=21% Similarity=0.277 Sum_probs=24.5 Q ss_pred HHCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHHCCCHHHHHHHHC Q ss_conf 510476433444420466776554403756333231056762100 Q 537021.9.peg.7 367 AKKNNIPVGPGRGSGAGSVTAYALTITDIDPLRFSLLFERFLNPD 411 (1033) Q Consensus 367 ak~~gI~vGpGRGSAagSLVaY~LgIT~VDPi~~~LlFERFLnp~ 411 (1033) =-++|.+=--==||+|||+||=+++.-.=|=| ++||.|+ T Consensus 89 L~e~~LLPrVIsGSSaGaiVAa~l~t~tdeEL------~~~l~~~ 127 (407) T cd07232 89 LLDADLLPNVISGTSGGSLVAALLCTRTDEEL------KQLLVPE 127 (407) T ss_pred HHHCCCCCCEECCCCHHHHHHHHHHCCCHHHH------HHHHHHH T ss_conf 99748886432057535788989851777899------9873477 No 131 >PRK04053 rps13p 30S ribosomal protein S13P; Reviewed Probab=66.41 E-value=7.9 Score=17.06 Aligned_cols=47 Identities=30% Similarity=0.507 Sum_probs=31.5 Q ss_pred CCCEEEEEHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCHHHHHHHH Q ss_conf 19869850268589698999999999837999997899973482479989999888 Q 537021.9.peg.7 831 GDNRIYYSLAAIKGVGTTTARHIMEASADKPFDSLEDFCSRVDSKKLNRRVLESLV 886 (1033) Q Consensus 831 ~~~~Ir~Gl~~Ikgvg~~~~~~Ive~r~~g~f~sl~df~~rv~~~~~nk~~le~Li 886 (1033) +++.|.++|..|.|||...+..|++.-.-.|- .|+ +.++..+++.|. T Consensus 19 ~~K~i~~ALt~IyGIG~~~A~~Ic~~lgId~~-------~k~--~~Lte~qi~~l~ 65 (149) T PRK04053 19 GTKPVEYALTGIKGIGRRTARAIARKLGLDPH-------AKL--GYLSDEEIEKIE 65 (149) T ss_pred CCCEEEEECCCCCCCCHHHHHHHHHHHCCCCC-------CCC--CCCCHHHHHHHH T ss_conf 99686444111148489999999999189987-------730--749999999999 No 132 >TIGR00612 ispG_gcpE 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase; InterPro: IPR004588 This protein previously of unknown biochemical function is essential in Escherichia coli. It has now been characterised as 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase, which converts 2C-methyl-D-erythritol 2,4-cyclodiphosphate (ME-2,4CPP) into 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate in the sixth step of nonmevalonate terpenoid biosynthesis. The family is restricted to bacteria, where it is widely but not universally distributed. No homology can be detected between this family and other proteins.; GO: 0016114 terpenoid biosynthetic process. Probab=65.93 E-value=8.1 Score=16.99 Aligned_cols=221 Identities=14% Similarity=0.134 Sum_probs=94.0 Q ss_pred CCCCCHHHHHHHCCCHH--HHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHHHHHHH-----HHCCCEEEECCCCCC- Q ss_conf 97679678888779978--9999999998765986899994399115899999999999-----818989980883007- Q 537021.9.peg.7 149 GGSSGPIDRAFFFNSSQ--VAEKRLLTFKKLFGDRLYVNLQRHRGYDRYRESQVVQLAY-----THELPLVATNNSLFL- 220 (1033) Q Consensus 149 g~~~g~i~~~l~~~~~~--~a~~~l~~l~~~Fgd~~ylEl~~~~~~e~~~~~~l~~lA~-----~~~iPlVaTndv~Y~- 220 (1033) |+-....-|...+-... .+...+.+...+.--+ -||.|---++..-.+.+.++.+ ..++|||| |+||. T Consensus 21 GGdaPi~vQSMt~t~T~GaD~~~tv~qi~~L~e~G--cdIVRvtVp~~e~a~A~~~I~~rL~al~~~vPLVA--DiHf~G 96 (633) T TIGR00612 21 GGDAPIVVQSMTNTDTLGADVDATVEQIRALEEAG--CDIVRVTVPDKESAEALEEIKERLQALGLNVPLVA--DIHFAG 96 (633) T ss_pred CCCCCCCEECCCCCCCHHHHHHHHHHHHHHHHHCC--CCEEEEEECCHHHHHHHHHHHHHHHHCCCCCCEEE--EECCCC T ss_conf 78875220007764501310899999999999719--94899960884679999999999986268887488--730256 Q ss_pred --CHHHHH-HHHHHHHHHCC--CCCCCCCCCCCCCCCCCCCHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf --878899-99876688469--8576531002453432343478875122035899878864111000133221023455 Q 537021.9.peg.7 221 --SEEDYE-AHDVLMAVAHS--TVVSQKDRPRVTPDHYLKNRSEMVSIFSDLPEALENTVEIARRCSFILQVHAPILPRF 295 (1033) Q Consensus 221 --~~~D~~-~h~iL~aI~~~--~~l~~~~~~~~~~~~yLks~eEM~~~f~~~peAi~NT~~IAe~c~~~l~~~~~~lP~f 295 (1033) -+.+.. ..+.+.|+|-| ....+.. .+..+-| |.++..+.-..++|-+.. |.++|.-. ++ T Consensus 97 hk~~~~~~~~~~~v~kvRINPGNig~k~~--~~~~~~Y--~~~~yA~~~~~~~e~v~~---~Ve~ar~~---G~------ 160 (633) T TIGR00612 97 HKYKKAALAVAKGVAKVRINPGNIGFKKK--DAKRKIY--SDAKYAEILLKVEERVRD---VVEKARRH---GK------ 160 (633) T ss_pred CCHHHHHHHHHHCCCEEEECCCCCCCCCC--HHHCCCC--CHHHHHHHHCCHHHHHHH---HHHHHHHC---CC------ T ss_conf 61468999986117825437876434670--1200456--425788863004678899---99999857---89------ Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCC---CHH----------HHHHHHHHH-HHHHHHHCCHHHHHHHH Q ss_conf 5756887768988999876541034201246432457---838----------999998877-89898606305578889 Q 537021.9.peg.7 296 IEKECDDIQKEEENELRNKAVAGLEVRLAQGAIAKGY---HAQ----------DYRQRLDFE-LNVIARMKFSGYFLIVA 361 (1033) Q Consensus 296 ~~p~~~~~~~~~~~~Lr~l~~~Gl~~R~~~~~~~~~~---~~~----------~y~~RLe~E-L~vI~~~gf~~YFLiV~ 361 (1033) ++--|-+...=+...++++ -.+|+++..+-.. .++ ++.+.+++. =+++-+|.-++=+-+|. T Consensus 161 a~RIGVN~GSL~~~~~~~l-----l~~Yg~~~~~~~~qev~~EA~V~SAl~~a~~~e~~~f~rd~~v~S~KaS~v~~~va 235 (633) T TIGR00612 161 AMRIGVNHGSLEQVLKRRL-----LEKYGDKGFSLTAQEVTAEAMVQSALESAEILEKLDFRRDNVVLSMKASDVAETVA 235 (633) T ss_pred CEEEECCCCCCHHHHHHHH-----HHHHCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEECCHHHHHH T ss_conf 6875035676608999999-----99715468321743158789999999999999850894460799984187678999 Q ss_pred HHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHHCCCH Q ss_conf 999985104764334444204667765544037563332310 Q 537021.9.peg.7 362 DFIQWAKKNNIPVGPGRGSGAGSVTAYALTITDIDPLRFSLL 403 (1033) Q Consensus 362 Div~~ak~~gI~vGpGRGSAagSLVaY~LgIT~VDPi~~~Ll 403 (1033) =.-.+|+..+-+ | -++.=-||||+-=+..+|.. T Consensus 236 Ay~~lA~~~D~~-G--------~~~PLHLGvTEAG~g~~Giv 268 (633) T TIGR00612 236 AYRLLAERSDEL-G--------MLYPLHLGVTEAGMGVKGIV 268 (633) T ss_pred HHHHHHHHHHHC-C--------CCCCCCCCCEECCCCCCCHH T ss_conf 999999998650-3--------52524254120288887345 No 133 >cd04318 EcAsnRS_like_N EcAsnRS_like_N: N-terminal, anticodon recognition domain of the type found in Escherichia coli asparaginyl-tRNA synthetase (AsnRS) and, in Arabidopsis thaliana and Saccharomyces cerevisiae mitochondrial (mt) AsnRS. This domain is a beta-barrel domain (OB fold) involved in binding the tRNA anticodon stem-loop. The enzymes in this group are homodimeric class2b aminoacyl-tRNA synthetases (aaRSs). aaRSs catalyze the specific attachment of amino acids (AAs) to their cognate tRNAs during protein biosynthesis. This 2-step reaction involves i) the activation of the AA by ATP in the presence of magnesium ions, followed by ii) the transfer of the activated AA to the terminal ribose of tRNA. In the case of the class2b aaRSs, the activated AA is attached to the 3'OH of the terminal ribose. Eukaryotes contain 2 sets of aaRSs, both of which are encoded by the nuclear genome. One set concerns with cytoplasmic protein synthesis, whereas the other exclusively with mitochondrial Probab=65.65 E-value=8.2 Score=16.95 Aligned_cols=32 Identities=13% Similarity=0.256 Sum_probs=21.3 Q ss_pred EEEEEEEEEEEEECCCCCCEEEEEEEECCCCE--EEEEE Q ss_conf 99999996112101588876999999849704--99995 Q 537021.9.peg.7 995 IRLAAMVVSKQQKKTRKGSRIGWVTFSEPAKE--YEANF 1031 (1033) Q Consensus 995 v~iaG~V~~i~~~~TKkG~~mafltleD~~g~--~e~~i 1031 (1033) |.+.|+|.++|..+ .++|+.|.|.|+. +.+++ T Consensus 2 V~i~GWV~~~R~~~-----~i~Fi~L~DGS~~~~lQvV~ 35 (82) T cd04318 2 VTVNGWVRSVRDSK-----KISFIELNDGSCLKNLQVVV 35 (82) T ss_pred EEEEEEEEEEECCC-----CEEEEEEECCCCCCCEEEEE T ss_conf 99998688886489-----74999997898644389998 No 134 >PTZ00134 40S ribosomal protein S18; Provisional Probab=64.92 E-value=5.2 Score=18.58 Aligned_cols=47 Identities=26% Similarity=0.460 Sum_probs=32.1 Q ss_pred CCCEEEEEHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCHHHHHHHH Q ss_conf 19869850268589698999999999837999997899973482479989999888 Q 537021.9.peg.7 831 GDNRIYYSLAAIKGVGTTTARHIMEASADKPFDSLEDFCSRVDSKKLNRRVLESLV 886 (1033) Q Consensus 831 ~~~~Ir~Gl~~Ikgvg~~~~~~Ive~r~~g~f~sl~df~~rv~~~~~nk~~le~Li 886 (1033) .+..|.++|..|+|||..++..|++.- | .|--.|+ ..++..+++.|. T Consensus 24 g~K~v~~aLt~I~GIG~~~A~~Ic~~~--g-----id~~~r~--g~Lteeei~~i~ 70 (154) T PTZ00134 24 GREKVTIALTAIKGIGRRFATVVCKQA--G-----VDVTKRA--GELTQEEINKIV 70 (154) T ss_pred CCCEEEEEEEEECCCCHHHHHHHHHHC--C-----CCCCCCC--CCCCHHHHHHHH T ss_conf 995889985322064899999999980--9-----8945421--649999999999 No 135 >pfam00411 Ribosomal_S11 Ribosomal protein S11. Probab=64.62 E-value=3.4 Score=20.15 Aligned_cols=43 Identities=26% Similarity=0.391 Sum_probs=22.3 Q ss_pred HHHHHHHHHHCCCCC--------CCCCCCCHHHHHHHH---HHHCCCCCHHHCC Q ss_conf 889999985104764--------334444204667765---5440375633323 Q 537021.9.peg.7 359 IVADFIQWAKKNNIP--------VGPGRGSGAGSVTAY---ALTITDIDPLRFS 401 (1033) Q Consensus 359 iV~Div~~ak~~gI~--------vGpGRGSAagSLVaY---~LgIT~VDPi~~~ 401 (1033) +..++.+.|++.||. .||||-+|--+|-.- .+-|+|+-|+.|| T Consensus 48 aa~~~~~~~~~~gi~~v~v~ikG~G~Gr~~air~l~~~gl~I~~I~D~Tpiphn 101 (109) T pfam00411 48 AAENAAKIAKEQGIKALEVKIKGPGPGRESALRALARSGLRIGRIRDVTPIPHN 101 (109) T ss_pred HHHHHHHHHHHHCCCEEEEEEECCCCCHHHHHHHHHHCCCEEEEEEECCCCCCC T ss_conf 999999999981985999999889987899999998779989999985999989 No 136 >cd04320 AspRS_cyto_N AspRS_cyto_N: N-terminal, anticodon recognition domain of the type found in Saccharomyces cerevisiae and human cytoplasmic aspartyl-tRNA synthetase (AspRS). This domain is a beta-barrel domain (OB fold) involved in binding the tRNA anticodon stem-loop. The enzymes in this group are homodimeric class2b aminoacyl-tRNA synthetases (aaRSs). aaRSs catalyze the specific attachment of amino acids (AAs) to their cognate tRNAs during protein biosynthesis. This 2-step reaction involves i) the activation of the AA by ATP in the presence of magnesium ions, followed by ii) the transfer of the activated AA to the terminal ribose of tRNA. In the case of the class2b aaRSs, the activated AA is attached to the 3'OH of the terminal ribose. Eukaryotes contain 2 sets of aaRSs, both of which are encoded by the nuclear genome. One set concerns with cytoplasmic protein synthesis, whereas the other exclusively with mitochondrial protein synthesis. Probab=64.39 E-value=8.6 Score=16.77 Aligned_cols=34 Identities=9% Similarity=0.289 Sum_probs=16.4 Q ss_pred EEEEEEEEEEEEECCCCCCEEEEEEEECCCCEEEEEEC Q ss_conf 99999996112101588876999999849704999953 Q 537021.9.peg.7 995 IRLAAMVVSKQQKKTRKGSRIGWVTFSEPAKEYEANFF 1032 (1033) Q Consensus 995 v~iaG~V~~i~~~~TKkG~~mafltleD~~g~~e~~iF 1032 (1033) |++.|.|-++|. .|.+++|+.|-|.+|.+.++++ T Consensus 2 V~~rGwv~~~R~----~g~kl~Fl~LRd~~g~iQ~V~~ 35 (102) T cd04320 2 VLIRARVHTSRA----QGAKLAFLVLRQQGYTIQGVLA 35 (102) T ss_pred EEEEEEEEEECC----CCCEEEEEEEECCCEEEEEEEE T ss_conf 899999972021----7971799999158718999998 No 137 >cd04321 ScAspRS_mt_like_N ScAspRS_mt_like_N: N-terminal, anticodon recognition domain of the type found in Saccharomyces cerevisiae mitochondrial (mt) aspartyl-tRNA synthetase (AspRS). This domain is a beta-barrel domain (OB fold) involved in binding the tRNA anticodon stem-loop. The enzymes in this fungal group are homodimeric class2b aminoacyl-tRNA synthetases (aaRSs). aaRSs catalyze the specific attachment of amino acids (AAs) to their cognate tRNAs during protein biosynthesis. This 2-step reaction involves i) the activation of the AA by ATP in the presence of magnesium ions, followed by ii) the transfer of the activated AA to the terminal ribose of tRNA. In the case of the class2b aaRSs, the activated AA is attached to the 3'OH of the terminal ribose. Eukaryotes contain 2 sets of aaRSs, both of which are encoded by the nuclear genome. One set concerns with cytoplasmic protein synthesis, whereas the other exclusively with mitochondrial protein synthesis. Mutations in the gene for Probab=64.17 E-value=8.6 Score=16.74 Aligned_cols=34 Identities=9% Similarity=0.158 Sum_probs=15.1 Q ss_pred EEEEEEEEEEEEECCCCCCEEEEEEEECCCC-EEEEEEC Q ss_conf 9999999611210158887699999984970-4999953 Q 537021.9.peg.7 995 IRLAAMVVSKQQKKTRKGSRIGWVTFSEPAK-EYEANFF 1032 (1033) Q Consensus 995 v~iaG~V~~i~~~~TKkG~~mafltleD~~g-~~e~~iF 1032 (1033) |++.|+|-..++. .| .+.|+.|-|.+| .+.++.. T Consensus 2 VtL~GWv~~rrd~---~g-~l~FidLRD~~G~~vQiV~~ 36 (86) T cd04321 2 VTLNGWIDRKPRI---VK-KLSFADLRDPNGDIIQLVST 36 (86) T ss_pred EEEEEEEEEEECC---CC-CEEEEEEECCCCCEEEEEEC T ss_conf 9999738745136---89-86999998399999999948 No 138 >PRK00286 xseA exodeoxyribonuclease VII large subunit; Reviewed Probab=64.13 E-value=8.6 Score=16.73 Aligned_cols=18 Identities=11% Similarity=0.041 Sum_probs=8.5 Q ss_pred CCEEEEEECCHHHHHHHH Q ss_conf 753999956968999999 Q 537021.9.peg.7 96 VPSIVLLVSTAEGYQRLI 113 (1033) Q Consensus 96 ~~~lvlLAkN~~Gy~nL~ 113 (1033) .+.+.+-.-...|-..|. T Consensus 90 ~~ql~v~~i~~~g~G~L~ 107 (443) T PRK00286 90 DYQLIVEDIEPAGIGALA 107 (443) T ss_pred CEEEEEEEEEECCCCHHH T ss_conf 189999781277921899 No 139 >TIGR01179 galE UDP-glucose 4-epimerase; InterPro: IPR005886 Synonym: UDP-galactose 4-epimerase UDP-glucose 4-epimerase (5.1.3.2 from EC) interconverts UDP-glucose and UDP-galactose which are precursors of glucose- and galactose-containing exopolysaccharides (EPS). A set of related proteins, some of which are tentatively identified as UDP-glucose-4-epimerase in Thermotoga maritima, Bacillus halodurans, and several archaea, but deeply branched from this set and lacking experimental evidence, are excluded and described by a separate model. ; GO: 0003978 UDP-glucose 4-epimerase activity, 0006012 galactose metabolic process. Probab=64.12 E-value=6.3 Score=17.92 Aligned_cols=23 Identities=22% Similarity=0.436 Sum_probs=15.2 Q ss_pred CCHHHHHHHHHHHHHHCCCCEEE Q ss_conf 65177499999999976996389 Q 537021.9.peg.7 49 TNNLFSALEFSQKACAAGIQPII 71 (1033) Q Consensus 49 ~~nl~Gav~F~~ackk~gIKPIi 71 (1033) +||+.|.+...++|+++|+|.|| T Consensus 101 ~NNv~nTl~L~~~m~~~GV~~~i 123 (341) T TIGR01179 101 RNNVVNTLNLLEAMQETGVKKFI 123 (341) T ss_pred HHHHHHHHHHHHHHHHHCCCCEE T ss_conf 00468999999999981897415 No 140 >PRK07228 N-ethylammeline chlorohydrolase; Provisional Probab=64.08 E-value=8.7 Score=16.73 Aligned_cols=64 Identities=14% Similarity=0.281 Sum_probs=36.7 Q ss_pred CCCHHHHHHCHHHHHHHCCC--------------------HHHHHHHH-------HHCCCCEEEEECCCHHHHHHHHHHH Q ss_conf 88133433144476443299--------------------89999999-------9789988998116517749999999 Q 537021.9.peg.7 9 PSFVHLQVHSSYSLLEGALS--------------------LNNILDKI-------AADQQPAIAITDTNNLFSALEFSQK 61 (1033) Q Consensus 9 ~~FvhLh~hS~ySlldg~~~--------------------~e~LV~~A-------~~~G~~AiAITD~~nl~Gav~F~~a 61 (1033) |-||.-|+|...|+.+|... ++++...+ .+.|..+ +.|+.+.+..-...++ T Consensus 56 PGlIdaH~H~~~~~~rg~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~a~~~~~e~l~~G~Tt--v~d~~~~~~~~~~~~a 133 (445) T PRK07228 56 PGLIQGHIHLCQTLFRGQADDLELLDWLKDRIWPLEASHDAESMYISADLGIAELIESGTTT--IVDMESVHHTDSAFEA 133 (445) T ss_pred ECHHHHHHCHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCEE--EEECCCHHHHHHHHHH T ss_conf 45350876953042331458877799999857765523899999999999999999769289--9714320136999999 Q ss_pred HHHCCCCEEEEEE Q ss_conf 9976996389999 Q 537021.9.peg.7 62 ACAAGIQPIIGCQ 74 (1033) Q Consensus 62 ckk~gIKPIiG~E 74 (1033) +.+.|++.+.|-. T Consensus 134 ~~~~GiR~~~~~~ 146 (445) T PRK07228 134 IADSGIRAVLGKV 146 (445) T ss_pred HHHHCCCEEEEEE T ss_conf 9981984588778 No 141 >PRK13124 consensus Probab=64.03 E-value=8.7 Score=16.72 Aligned_cols=11 Identities=27% Similarity=0.534 Sum_probs=3.7 Q ss_pred HHCCCCCEEEC Q ss_conf 30380215542 Q 537021.9.peg.7 436 NKYGHERVAQI 446 (1033) Q Consensus 436 ~kyG~~~va~i 446 (1033) ++||-+.+-.| T Consensus 135 ~~~gl~~I~lv 145 (257) T PRK13124 135 DKYGIYLIPLV 145 (257) T ss_pred HHCCCCEEEEE T ss_conf 86687357884 No 142 >TIGR00160 MGSA methylglyoxal synthase; InterPro: IPR004363 Methylglyoxal synthase (MGS) catalyzes the conversion of dihydroxyacetone phosphate to methylglyoxal and phosphate:Glycerone phosphate = methylglyoxal + phosphate It provides bacteria with an alternative to triosephosphate isomerase for metabolizing dihydroxyacetone phosphate. MGS is a small protein of about 13 to 17 kDa. An aspartate residue is involved in the catalytic mechanism. Methylglyoxal synthase contains a domain shared by other enzymes. Other proteins containing this domain include purine biosynthesis protein PurH and carbamoyl phosphate synthetase.; GO: 0008929 methylglyoxal synthase activity, 0019242 methylglyoxal biosynthetic process, 0005737 cytoplasm. Probab=63.31 E-value=6.5 Score=17.76 Aligned_cols=52 Identities=23% Similarity=0.374 Sum_probs=30.7 Q ss_pred EECCCCCCH--HHHHHHCCCHHHHHHHHHHHHHHCCCCEE---EEEECCCHHHHHHHHHHHHHHHHHCCCEEEEC Q ss_conf 981976796--78888779978999999999876598689---99943991158999999999998189899808 Q 537021.9.peg.7 146 MLTGGSSGP--IDRAFFFNSSQVAEKRLLTFKKLFGDRLY---VNLQRHRGYDRYRESQVVQLAYTHELPLVATN 215 (1033) Q Consensus 146 vlsg~~~g~--i~~~l~~~~~~~a~~~l~~l~~~Fgd~~y---lEl~~~~~~e~~~~~~l~~lA~~~~iPlVaTn 215 (1033) ++||+++|. |+-.+..|..+ -. + || +--|.|.++ .+.|++||.=++|| |||| T Consensus 54 ~~SGPmGGDqQiGa~Iaeg~Id-------~~--I----Ff~~PL~AqPHEPD----V~ALlRLc~V~nIP-lAtN 110 (143) T TIGR00160 54 LLSGPMGGDQQIGALIAEGKID-------AV--I----FFRDPLNAQPHEPD----VKALLRLCDVYNIP-LATN 110 (143) T ss_pred CCCCCCCHHHHHHHHHHHCHHH-------HH--H----HHHHHCCCCCCCCC----HHHHHHHHHHCCCH-HHHH T ss_conf 0568996246789998612265-------44--5----03300147888800----78898786450250-3440 No 143 >TIGR00010 TIGR00010 hydrolase, TatD family; InterPro: IPR015991 Escherichia coli TatD (synonym MttC) has been shown experimentally to be a DNase . On the basis of operon structure, it was originally thought to be involved in the Sec-independent export of protein precursors bearing twin arginine signal peptides by the Tat system . This alternative membrane targeting and translocation system (Mtt), also called twin arginine translocase (Tat) (MttA1A2BC or TatABCD), has been shown to transport fully folded proteins to, or across, the membrane , . However, it was later determined that TatD is not required for this translocation process , . Furthermore, it was noted that no exact correlation exists between the ability of an organism to synthesise a TatD-like protein and whether that organism has a Tat protein transport pathway . Also, with the exception of Escherichia coli , the genetic linkage between TatD and other Tat genes (operon structure) is not maintained. Members of this group may be distantly related to a large 3D fold-based domain superfamily of metalloenzymes . The description of this fold superfamily was based on an analysis of conservation patterns in three dimensions, and the discovery that the same active-site architecture occurs in a large set of enzymes involved primarily in nucleotide metabolism. The group is thought to include urease, dihydroorotase, allantoinase, hydantoinase, AMP-, adenine- and cytosine- deaminases, imidazolonepropionase, aryldialkylphosphatase, chlorohydrolase, formylmethanofuran dehydrogenase, and other enzymes .. Probab=63.28 E-value=8.9 Score=16.62 Aligned_cols=215 Identities=15% Similarity=0.186 Sum_probs=114.8 Q ss_pred HCHHHHHHHCC---CHHHHHHHHHHCCCCEEE-EECCCHHHHHH-HHHHHHHHCC---CCEEEEEEEEEECCCCCCCCCC Q ss_conf 14447644329---989999999978998899-81165177499-9999999769---9638999999854766433445 Q 537021.9.peg.7 17 HSSYSLLEGAL---SLNNILDKIAADQQPAIA-ITDTNNLFSAL-EFSQKACAAG---IQPIIGCQLDIDMQDNLKMTNQ 88 (1033) Q Consensus 17 hS~ySlldg~~---~~e~LV~~A~~~G~~AiA-ITD~~nl~Gav-~F~~ackk~g---IKPIiG~E~~v~~~~~~~~~~~ 88 (1033) |-+-+.++--. ..++.+++|++.|..++- +++ ++.++- +.+..+++.. +.+-+|.- + T Consensus 5 HcHL~~~~~~~~~~d~~~v~~ra~~~~v~~~v~vg~--~l~~~~W~~~~l~~~~~g~~v~~~~G~H--------P----- 69 (269) T TIGR00010 5 HCHLDLLDFEELHVDVEEVIERAKAEGVTAVVDVGT--DLEDLNWRALELAEKYPGDNVYAAVGVH--------P----- 69 (269) T ss_pred EECCCCCCCCHHCCCHHHHHHHHHHCCCEEEEEECC--CCCCCHHHHHHHHHHCCCCEEEEEECCC--------C----- T ss_conf 013562222122035899999999749929997346--7002069999999847997699982268--------7----- Q ss_pred CCCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCEEEECCCCCCHHHHHHHCCCHHHHH Q ss_conf 65666677539999569689999999998997324877874007899986348988998197679678888779978999 Q 537021.9.peg.7 89 GKTPFPTVPSIVLLVSTAEGYQRLIELVSRMYLLDQGKQSVRICLSWLQEIGTEGLIMLTGGSSGPIDRAFFFNSSQVAE 168 (1033) Q Consensus 89 ~~~~~~~~~~lvlLAkN~~Gy~nL~kL~S~a~~~~~~~~~p~i~~~~L~e~~~egLIvlsg~~~g~i~~~l~~~~~~~a~ 168 (1033) .=+=..+..=...|.+++.. +.+.++++ | |+ T Consensus 70 ----------~d~~~~~~~~~~~l~~l~~~---------------------~~~~v~Ai-G----E~------------- 100 (269) T TIGR00010 70 ----------LDVEDDTKEDIKELEKLAKN---------------------AHPKVVAI-G----ET------------- 100 (269) T ss_pred ----------CCHHHHHHHHHHHHHHHHHH---------------------HCCCEEEE-E----EE------------- T ss_conf ----------22204208999999999985---------------------06954798-5----10------------- Q ss_pred HHHHHHHHHCCCCEEEEEECCC-HHHHHHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCC Q ss_conf 9999998765986899994399-115899999999999818989980883007878899998766884698576531002 Q 537021.9.peg.7 169 KRLLTFKKLFGDRLYVNLQRHR-GYDRYRESQVVQLAYTHELPLVATNNSLFLSEEDYEAHDVLMAVAHSTVVSQKDRPR 247 (1033) Q Consensus 169 ~~l~~l~~~Fgd~~ylEl~~~~-~~e~~~~~~l~~lA~~~~iPlVaTndv~Y~~~~D~~~h~iL~aI~~~~~l~~~~~~~ 247 (1033) |=+||-.-...+ ..++.....-++||+++++|+|.=+ .-.+.+..++|.....-..-...+-+ T Consensus 101 ----------GLDY~~~~~~~e~~~Q~e~F~~q~~lA~~~~~P~iiH~-----RdA~~d~~~iL~~~y~~~~~~~~g~~- 164 (269) T TIGR00010 101 ----------GLDYYKADEKKERRRQKEVFRAQLQLAEELNLPVIIHA-----RDAEEDVLDILREYYAEKAPKVGGVL- 164 (269) T ss_pred ----------CCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEC-----CCHHHHHHHHHHHHHHCCCCCCCEEE- T ss_conf ----------44043068752178999999999999997199689978-----84579999999999842787787579- Q ss_pred CCCCCCCCCHHHHHHHCCHHH---H------H--H---HHHHHHHHHHHHH---HHHHHCCCCCCCCCCCCCCCCCHHHH Q ss_conf 453432343478875122035---8------9--9---8788641110001---33221023455575688776898899 Q 537021.9.peg.7 248 VTPDHYLKNRSEMVSIFSDLP---E------A--L---ENTVEIARRCSFI---LQVHAPILPRFIEKECDDIQKEEENE 310 (1033) Q Consensus 248 ~~~~~yLks~eEM~~~f~~~p---e------A--i---~NT~~IAe~c~~~---l~~~~~~lP~f~~p~~~~~~~~~~~~ 310 (1033) |.+-.+-|+.+.+.+.. - . . .+-.+++.++..+ .+.+.|.|=.-|... ..++..+ T Consensus 165 ----HCftg~~E~a~~~ld~gd~~~yiSisG~vTFknA~~l~~~v~~iP~e~LL~ETDSPyLaP~P~RG----K~N~P~~ 236 (269) T TIGR00010 165 ----HCFTGDAELAKKLLDLGDLLFYISISGIVTFKNAKSLREVVKKIPLERLLVETDSPYLAPVPYRG----KRNEPAF 236 (269) T ss_pred ----EECCCCHHHHHHHHCCCCEEEEEECCCCEEEECCHHHHHHHHHCCCCCCEEEECCCCCCCCCCCC----CCCCCCH T ss_conf ----81488889999996069879999728602210626789999745964323201674457889889----8567405 Q ss_pred HHHHHHHHH Q ss_conf 987654103 Q 537021.9.peg.7 311 LRNKAVAGL 319 (1033) Q Consensus 311 Lr~l~~~Gl 319 (1033) .+.-|.+=+ T Consensus 237 v~~v~~~iA 245 (269) T TIGR00010 237 VRYVVEAIA 245 (269) T ss_pred HHHHHHHHH T ss_conf 999999999 No 144 >PRK09358 adenosine deaminase; Provisional Probab=62.75 E-value=6.4 Score=17.82 Aligned_cols=26 Identities=27% Similarity=0.463 Sum_probs=23.7 Q ss_pred HHHHHHCHHHHHHHCCCHHHHHHHHHHCCCC Q ss_conf 3343314447644329989999999978998 Q 537021.9.peg.7 12 VHLQVHSSYSLLEGALSLNNILDKIAADQQP 42 (1033) Q Consensus 12 vhLh~hS~ySlldg~~~~e~LV~~A~~~G~~ 42 (1033) ++||+| |+|+++|+.+.+.|++.|.+ T Consensus 12 ~ELH~H-----L~Gsi~~etl~~la~~~~~~ 37 (333) T PRK09358 12 AELHLH-----LDGSLRPETILELARRNGID 37 (333) T ss_pred HHHHHC-----CCCCCCHHHHHHHHHHCCCC T ss_conf 898707-----76888999999999982999 No 145 >COG0099 RpsM Ribosomal protein S13 [Translation, ribosomal structure and biogenesis] Probab=62.66 E-value=8.2 Score=16.91 Aligned_cols=47 Identities=30% Similarity=0.434 Sum_probs=32.8 Q ss_pred CCCEEEEEHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCHHHHHHHH Q ss_conf 19869850268589698999999999837999997899973482479989999888 Q 537021.9.peg.7 831 GDNRIYYSLAAIKGVGTTTARHIMEASADKPFDSLEDFCSRVDSKKLNRRVLESLV 886 (1033) Q Consensus 831 ~~~~Ir~Gl~~Ikgvg~~~~~~Ive~r~~g~f~sl~df~~rv~~~~~nk~~le~Li 886 (1033) .++.+.++|.+|+|||...+..|++.- | .|.-.|+ +.+...+++.|- T Consensus 11 ~~K~v~iALt~IyGIG~~~a~~I~~~~--g-----i~~~~r~--~~Lteeei~~ir 57 (121) T COG0099 11 GNKRVVIALTYIYGIGRRRAKEICKKA--G-----IDPDKRV--GELTEEEIERLR 57 (121) T ss_pred CCCEEEEHHHHHCCCCHHHHHHHHHHC--C-----CCHHHHH--CCCCHHHHHHHH T ss_conf 982576504630353699999999991--9-----8866764--529999999999 No 146 >pfam08288 PIGA PIGA (GPI anchor biosynthesis). This domain is found on phosphatidylinositol n-acetylglucosaminyltransferase proteins. These proteins are involved in GPI anchor biosynthesis and are associated with disease the paroxysmal nocturnal haemoglobinuria. Probab=62.65 E-value=5.4 Score=18.47 Aligned_cols=23 Identities=35% Similarity=0.356 Sum_probs=12.8 Q ss_pred HHHHHHHHHHHCCCEEEECCCCC Q ss_conf 99999999981898998088300 Q 537021.9.peg.7 197 ESQVVQLAYTHELPLVATNNSLF 219 (1033) Q Consensus 197 ~~~l~~lA~~~~iPlVaTndv~Y 219 (1033) ..+.+--|+..|++.|-|--+.| T Consensus 64 ~hEailhA~tmGlktVfTDHSLF 86 (90) T pfam08288 64 AHEAILHARTMGLKTVFTDHSLF 86 (90) T ss_pred HHHHHHHHHHCCCCEEEECCCCC T ss_conf 88999999866981785245323 No 147 >PRK13132 consensus Probab=62.40 E-value=9.2 Score=16.50 Aligned_cols=47 Identities=19% Similarity=0.412 Sum_probs=19.6 Q ss_pred CCHH-HCCCHHHHHHHHCCCC------CCCCCCCCCHHHHHHHHHHHHHHCCCCCEEECCC Q ss_conf 5633-3231056762100145------7875333656878999999983038021554220 Q 537021.9.peg.7 395 IDPL-RFSLLFERFLNPDRMS------MPDFDIDFCQDRRDEVIRYVQNKYGHERVAQIIT 448 (1033) Q Consensus 395 VDPi-~~~LlFERFLnp~R~~------~PDIDiDf~~~~R~~vi~y~~~kyG~~~va~i~t 448 (1033) .+|+ +|+ +|+|+..-... .||. |.+..++..+ ..+++|-+.+-.|+- T Consensus 94 ~N~i~~~G--~e~F~~~~~~~GvdGlIipDL----P~ee~~~~~~-~~~~~~i~~I~lvaP 147 (246) T PRK13132 94 YNLIFAYG--LEKFVKKAKELGISGLIVPDL----PFEESEELIK-ECEKYNIALIPLISV 147 (246) T ss_pred CHHHHHCC--HHHHHHHHHHCCCCEEECCCC----CHHHHHHHHH-HHHHCCCCEEEEECC T ss_conf 10887729--999999998769985775799----9789899999-999859970144257 No 148 >cd04497 hPOT1_OB1_like hPOT1_OB1_like: A subfamily of OB folds similar to the first OB fold (OB1) of human protection of telomeres 1 protein (hPOT1), the single OB fold of the N-terminal domain of Schizosaccharomyces pombe POT1 (SpPOT1), and the first OB fold of the N-terminal domain of the alpha subunit (OB1Nalpha) of Oxytricha nova telomere end binding protein (OnTEBP). POT1 proteins recognize single-stranded (ss) 3-prime ends of the telomere. A 3-prime ss overhang is conserved in ciliated protozoa, yeast, and mammals. SpPOT1 is essential for telomere maintenance. It binds specifically to the ss G-rich telomeric sequence (GGTTAC) of S. pombe. hPOT1 binds specifically to ss telomeric DNA repeats ending with the sequence GGTTAG. Deletion of the S. pombe pot1+ gene results in a rapid loss of telomere sequences, chromosome mis-segregation and chromosome circularization. hPOT1 is implicated in telomere length regulation. The hPOT1 monomer consists of two closely connected OB folds (OB1-OB Probab=62.19 E-value=9.3 Score=16.47 Aligned_cols=33 Identities=21% Similarity=0.380 Sum_probs=14.1 Q ss_pred CCEEEEEEEEEEEEEECCCCC-CEEEEEEEECCC Q ss_conf 988999999961121015888-769999998497 Q 537021.9.peg.7 992 GNNIRLAAMVVSKQQKKTRKG-SRIGWVTFSEPA 1024 (1033) Q Consensus 992 g~~v~iaG~V~~i~~~~TKkG-~~mafltleD~~ 1024 (1033) |+.|.+.|+|+...-++-.+| +-+..++|-|.+ T Consensus 14 ~~~VnvigVV~~~~~p~~s~g~d~~~tl~I~D~S 47 (138) T cd04497 14 GGSVNVIGVVVDAGPPVRSKGTDYCCTLTITDPS 47 (138) T ss_pred CCEEEEEEEEEECCCCEECCCCEEEEEEEEECCC T ss_conf 9858899999865796446997299999999289 No 149 >COG1311 HYS2 Archaeal DNA polymerase II, small subunit/DNA polymerase delta, subunit B [DNA replication, recombination, and repair] Probab=62.00 E-value=9.3 Score=16.45 Aligned_cols=21 Identities=0% Similarity=0.078 Sum_probs=9.9 Q ss_pred HHHHHHCCCCCEEECCCHHHH Q ss_conf 999830380215542207787 Q 537021.9.peg.7 432 RYVQNKYGHERVAQIITFGSL 452 (1033) Q Consensus 432 ~y~~~kyG~~~va~i~t~~t~ 452 (1033) ++++.+|-.+++..+.+-.++ T Consensus 332 ~~~kslf~~~n~~~v~NP~~~ 352 (481) T COG1311 332 ELIKSLFSLNNLLFVSNPALV 352 (481) T ss_pred HHHCCCCCCCCEEECCCCCEE T ss_conf 120233102653761797379 No 150 >PRK08402 replication factor A; Reviewed Probab=61.78 E-value=9.4 Score=16.42 Aligned_cols=90 Identities=11% Similarity=0.200 Sum_probs=41.6 Q ss_pred CCEEEEEECHHHHHHCCCHH-HHHHHCCHHHHHHHHHHHHH--HCC-CCCCC---C--CCCCHHHHHHHHHCCCCCHHHH Q ss_conf 86688510213343112321-35654178999999998874--025-43433---2--3443588998762014101221 Q 537021.9.peg.7 552 DLPVTQFNMKWIEKAGLIKF-DFLGLKTLTLLQKSLDFLAQ--RGV-KVDLS---L--IPFDDHETYHLLTTKGTLGIFQ 622 (1033) Q Consensus 552 ~~~~~q~d~~~~e~~gl~K~-D~Lgl~~L~~i~~~~~~i~~--~~~-~~d~~---~--ip~~D~~~~~l~~~g~t~GvFQ 622 (1033) -..++-||-+..+.+.=+.. |+.-..+.++ ++ +|. ++.+. . +..+||.|-+.-..- + T Consensus 113 ~aRLvLWD~~v~~yy~~l~~GDVIKv~~~~V--------res~~g~peLhv~fr~riI~npedprVe~ipple------e 178 (357) T PRK08402 113 RARVVLWDAKVAEYYNKLEVGDVIKVLDAQV--------RESLSGLPELHINFRARIILNPEDPRVEEIPPLE------E 178 (357) T ss_pred CEEEEEECHHHHHHHHHCCCCCEEEEECHHH--------HHHCCCCHHEEECCEEEEEECCCCCCCCCCCCHH------H T ss_conf 3899983387898897537686699833134--------4311797120340135687178887634668356------5 Q ss_pred CCC-HHHHHHHHHHCCCH-HHHHHH-HHHHCCCHHH Q ss_conf 145-34788887717404-888876-6541141134 Q 537021.9.peg.7 623 LES-SGMRQALEGMQPDC-IEDIIA-LVSLYRPGPI 655 (1033) Q Consensus 623 ~es-~~~~~~l~~~~p~~-~~dl~~-~~al~RPGp~ 655 (1033) +-| +..|+-+..+++-. |-++-- +..+||+=.. T Consensus 179 Vr~~~y~RkkI~eL~~Gd~fvEvrGtIa~vYR~~~Y 214 (357) T PRK08402 179 VRSATYTRKKIKELEGGDRFVEVRGTIAKVYRVLVY 214 (357) T ss_pred HCCCHHHHHHHHHHCCCCEEEEEEEEEHHHHHHHHH T ss_conf 204414777898746786379975130867788998 No 151 >cd02071 MM_CoA_mut_B12_BD methylmalonyl CoA mutase B12 binding domain. This domain binds to B12 (adenosylcobamide), which initiates the conversion of succinyl CoA and methylmalonyl CoA by forming an adenosyl radical, which then undergoes a rearrangement exchanging a hydrogen atom with a group attached to a neighboring carbon atom. This family is present in both mammals and bacteria. Bacterial members are heterodimers and involved in the fermentation of pyruvate to propionate. Mammalian members are homodimers and responsible for the conversion of odd-chain fatty acids and branched-chain amino acids via propionyl CoA to succinyl CoA for further degradation. Probab=60.63 E-value=9.8 Score=16.27 Aligned_cols=43 Identities=9% Similarity=0.098 Sum_probs=30.4 Q ss_pred CCCHHHHHHHHHHCCCCEEEEECCC--HHHHHHHHHHHHHHCCCC Q ss_conf 2998999999997899889981165--177499999999976996 Q 537021.9.peg.7 26 ALSLNNILDKIAADQQPAIAITDTN--NLFSALEFSQKACAAGIQ 68 (1033) Q Consensus 26 ~~~~e~LV~~A~~~G~~AiAITD~~--nl~Gav~F~~ackk~gIK 68 (1033) -.+|+++++.|.+.+-.+++++-.. .+--+.++.++++++|+. T Consensus 36 ~~s~eeiv~~A~~e~ad~IglSsL~g~h~~~~~~l~~~L~e~G~~ 80 (122) T cd02071 36 RQTPEEIVEAAIQEDVDVIGLSSLSGGHMTLFPEVIELLRELGAG 80 (122) T ss_pred CCCHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHHHCCCC T ss_conf 889999999999739989999646554478999999999976999 No 152 >PRK00116 ruvA Holliday junction DNA helicase RuvA; Reviewed Probab=60.62 E-value=5.3 Score=18.50 Aligned_cols=16 Identities=44% Similarity=0.729 Sum_probs=9.4 Q ss_pred HHHHCCCCHHHHHHHH Q ss_conf 2685896989999999 Q 537021.9.peg.7 839 LAAIKGVGTTTARHIM 854 (1033) Q Consensus 839 l~~Ikgvg~~~~~~Iv 854 (1033) |+.|+|||.++|+.|+ T Consensus 110 L~~vpGIG~KtA~rIi 125 (198) T PRK00116 110 LTKVPGVGKKTAERIV 125 (198) T ss_pred HCCCCCCCHHHHHHHH T ss_conf 7068897889999999 No 153 >PRK00945 acetyl-CoA decarbonylase/synthase complex subunit epsilon; Provisional Probab=60.60 E-value=9.8 Score=16.26 Aligned_cols=45 Identities=16% Similarity=0.189 Sum_probs=29.7 Q ss_pred HHHHHHHHHHHHHHHCCCEEEECCCCCC------CHHHHHHHHHHHHHHCC Q ss_conf 5899999999999818989980883007------87889999876688469 Q 537021.9.peg.7 193 DRYRESQVVQLAYTHELPLVATNNSLFL------SEEDYEAHDVLMAVAHS 237 (1033) Q Consensus 193 e~~~~~~l~~lA~~~~iPlVaTndv~Y~------~~~D~~~h~iL~aI~~~ 237 (1033) |+...+.+++++++.++|++||..+.=. +++...+|.+-.+.+.- T Consensus 48 d~elle~~iki~k~~~ipi~aTg~s~~~l~e~g~~~~~i~~~~l~~~L~dp 98 (171) T PRK00945 48 DEELLDRAIKIAKAKNIPVAATGSSYKGLIEKGVKSKYIGLHELTNYLKDP 98 (171) T ss_pred CHHHHHHHHHHHHHCCCCEEEECCCCHHHHHCCCCHHHEEHHHHHHHHHCC T ss_conf 889999999999874985687445205456458871212099999885188 No 154 >pfam07393 Sec10 Exocyst complex component Sec10. This family contains the Sec10 component (approximately 650 residues long) of the eukaryotic exocyst complex, which specifically affects the synthesis and delivery of secretory and basolateral plasma membrane proteins. Probab=60.52 E-value=9.9 Score=16.25 Aligned_cols=27 Identities=22% Similarity=0.271 Sum_probs=16.3 Q ss_pred CCCCHHHHHHHHHCCCCCCCCCCHHHHHH Q ss_conf 47998999988867672347903899999 Q 537021.9.peg.7 875 KKLNRRVLESLVFAGALDCFGYSRMQLLQ 903 (1033) Q Consensus 875 ~~~nk~~le~Li~aGAfD~~~~~R~~ll~ 903 (1033) -.+.-..+..++..|++-.+ +|..+.. T Consensus 660 f~V~pe~l~~l~~~~~l~~~--~~~~v~~ 686 (703) T pfam07393 660 FLVQPDLLKELATEGALANL--DRELIRE 686 (703) T ss_pred EEECHHHHHHHHHHHHHCCC--CHHHHHH T ss_conf 25779899999753111269--9999999 No 155 >TIGR02236 recomb_radA DNA repair and recombination protein RadA; InterPro: IPR011938 This family consists exclusively of archaeal RadA protein, a homolog of bacterial RecA, eukaryotic RAD51 (IPR011941 from INTERPRO), and archaeal RadB (IPR011939 from INTERPRO). This protein is involved in DNA repair and recombination. The member from Pyrococcus horikoshii contains an intein .; GO: 0003684 damaged DNA binding, 0005524 ATP binding, 0008094 DNA-dependent ATPase activity, 0006281 DNA repair, 0006310 DNA recombination. Probab=60.38 E-value=9.9 Score=16.23 Aligned_cols=75 Identities=27% Similarity=0.443 Sum_probs=49.1 Q ss_pred HHHHCCCCHHHHHHHHH-HHH--C-CCCCCHHHHHHH-CCCCCC--CHHHHHHHHHCC--------CCCCCCCCHHHHHH Q ss_conf 26858969899999999-983--7-999997899973-482479--989999888676--------72347903899999 Q 537021.9.peg.7 839 LAAIKGVGTTTARHIME-ASA--D-KPFDSLEDFCSR-VDSKKL--NRRVLESLVFAG--------ALDCFGYSRMQLLQ 903 (1033) Q Consensus 839 l~~Ikgvg~~~~~~Ive-~r~--~-g~f~sl~df~~r-v~~~~~--nk~~le~Li~aG--------AfD~~~~~R~~ll~ 903 (1033) |+.+.|+|+.+|..||+ +|+ + |.|.+-.+++.| -+.+++ +.+.|..|+--| -|-.||.-..|+.+ T Consensus 35 L~~~~gI~E~~A~kiI~AAR~a~~~~~F~~a~~vl~rR~~v~kitTgs~~LDeLLGGGiETqaiTE~~GEFGSGKTQ~~H 114 (333) T TIGR02236 35 LSEIAGIGEGTAAKIIQAARKAADLGGFETADDVLERRKSVGKITTGSKELDELLGGGIETQAITEVFGEFGSGKTQICH 114 (333) T ss_pred HHHHCCCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCEEECCCHHHHHHCCCCCCCEEEEEEEEEECCCHHHHHH T ss_conf 53203787778999999999984676721489999987207504448636765607960000588877100786578766 Q ss_pred HHHHHHHHHH Q ss_conf 8999999999 Q 537021.9.peg.7 904 SLDNIQKYAQ 913 (1033) Q Consensus 904 ~i~~~~~~~~ 913 (1033) .+--.+|... T Consensus 115 qLAVnVQlP~ 124 (333) T TIGR02236 115 QLAVNVQLPE 124 (333) T ss_pred HHHHHHCCCC T ss_conf 7655322871 No 156 >COG3835 CdaR Sugar diacid utilization regulator [Transcription / Signal transduction mechanisms] Probab=60.20 E-value=10 Score=16.21 Aligned_cols=16 Identities=13% Similarity=0.501 Sum_probs=8.5 Q ss_pred HHHHHHHHHHHHHHHH Q ss_conf 6899999999989973 Q 537021.9.peg.7 106 AEGYQRLIELVSRMYL 121 (1033) Q Consensus 106 ~~Gy~nL~kL~S~a~~ 121 (1033) =.-|.+|.|+.+..-. T Consensus 98 Vr~y~ELVrm~AElli 113 (376) T COG3835 98 VRKYGELVRMTAELLI 113 (376) T ss_pred HHHHHHHHHHHHHHHH T ss_conf 8999999999999999 No 157 >PRK13114 consensus Probab=60.01 E-value=10 Score=16.19 Aligned_cols=31 Identities=19% Similarity=0.279 Sum_probs=12.2 Q ss_pred CCEEEEECCCHH--HHHHHHHHHHHHCCCCEEE Q ss_conf 988998116517--7499999999976996389 Q 537021.9.peg.7 41 QPAIAITDTNNL--FSALEFSQKACAAGIQPII 71 (1033) Q Consensus 41 ~~AiAITD~~nl--~Gav~F~~ackk~gIKPIi 71 (1033) .|-+-+|-.|.+ +|.-+|.+.|+++|+.-+| T Consensus 91 ~PivlM~Y~N~i~~~G~~~F~~~~~~aGvdG~I 123 (266) T PRK13114 91 VPLVLMGYANPMVRRGPDWFAAECKKAGVDGVI 123 (266) T ss_pred CCEEEEEEHHHHHHHHHHHHHHHHHHCCCCEEE T ss_conf 887998630199986499999999974997798 No 158 >PRK09482 xni exonuclease IX; Provisional Probab=59.93 E-value=10 Score=16.17 Aligned_cols=42 Identities=24% Similarity=0.496 Sum_probs=27.3 Q ss_pred EHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCHHHHHHHH Q ss_conf 0268589698999999999837999997899973482479989999888 Q 537021.9.peg.7 838 SLAAIKGVGTTTARHIMEASADKPFDSLEDFCSRVDSKKLNRRVLESLV 886 (1033) Q Consensus 838 Gl~~Ikgvg~~~~~~Ive~r~~g~f~sl~df~~rv~~~~~nk~~le~Li 886 (1033) ++-.|+|||+++|..+++ .|.|++..+..++. ++++.-+.|. T Consensus 183 NIPGV~GIG~KtA~~LL~-----~fgsle~i~~n~d~--i~~k~~~~l~ 224 (256) T PRK09482 183 KIPGVAGIGPKSAAELLN-----QFRSLEGIYESLDA--LPEKWRKKLE 224 (256) T ss_pred CCCCCCCCCHHHHHHHHH-----HHCCHHHHHHHHHH--CCHHHHHHHH T ss_conf 899999858889999999-----85509999983775--5689999999 No 159 >PRK10353 3-methyl-adenine DNA glycosylase I; Provisional Probab=59.55 E-value=10 Score=16.13 Aligned_cols=144 Identities=12% Similarity=0.105 Sum_probs=68.5 Q ss_pred CCCCCCHHHHHHHCCCCCCCCHHHHHHHHHHH----HHCCCHHHH----HHHHHHHCCCCCCHHHHHHHHHHHHHHHC-C Q ss_conf 85026966733410335530012568988788----632684130----78897620121101677878999998624-9 Q 537021.9.peg.7 670 KIVSIHPLIDPMLKETQGVIIYQEQVMQIAQL----LSGYSLSEA----DVLRRAMGKKIKEEMDKQKERFISGASKN-G 740 (1033) Q Consensus 670 ~~~~~hp~l~~iL~~T~Gv~vyQEQvm~ia~~----~ag~~~~ea----d~lRra~~kk~~~~~~~~~~~f~~g~~~~-g 740 (1033) +|..-+|.+...=.+-.|++++-+|.+-=... -||+|+.-. +.+|+|+.-=+++.+..+.++=++..+.+ | T Consensus 5 ~W~~~~~l~~~YHD~EWG~P~~dd~~LFE~L~LE~~QaGLSW~tiL~KRe~fr~AF~~Fd~~~VA~~~e~die~Ll~n~~ 84 (189) T PRK10353 5 GWVSQDPLYIAYHDNEWGVPETDSKKLFEMICLEGQQAGLSWITVLKKRENYRACFHQFDPVRVAAMQEEDVERLVQDAG 84 (189) T ss_pred CCCCCCHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCHHHHHCCCHHHHHHHHCCCC T ss_conf 89999817898860468985888799999999998524167999999899999998089989996389999998854621 Q ss_pred C--CH-------HHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHH Q ss_conf 9--99-------99999999999750001331014788899999999997302666777510023542025899988886 Q 537021.9.peg.7 741 I--SK-------TIAVNIFELLAKFADYGFNKSHAAAYAVISYQTAWMKTHYPVEFLAASMTLEMDNVEKIKKFCQDARQ 811 (1033) Q Consensus 741 ~--~~-------~~a~~i~~~i~~f~~Y~FnksHa~aya~~~y~~aylK~~yP~eF~aa~L~~~~~~~~~~~~~i~e~~~ 811 (1033) + .+ .-|..+-++.+.|++ |. .|-=.|. -|-|..--...+...+..+..-..+-+++|+ T Consensus 85 IIRNr~KI~AvI~NA~~~l~i~~e~gs--f~----------~ylW~fv-~~~p~~~~~~~~~~vP~~t~lS~~iskdLKk 151 (189) T PRK10353 85 IIRHRGKIQAIIGNARAYLQMEQNGEP--FA----------DFVWSFV-NHQPQVTQATTLSEIPTSTPASDALSKALKK 151 (189) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCC--HH----------HHHHHCC-CCCCCCCCCCCHHHCCCCCHHHHHHHHHHHH T ss_conf 246189999999999999999985599--99----------9998727-9988768845664367888879999999997 Q ss_pred CCCCCCCCCCCCCCE Q ss_conf 368503552334210 Q 537021.9.peg.7 812 FNIQIMPPSVNTPCV 826 (1033) Q Consensus 812 ~gi~vl~PdIN~S~~ 826 (1033) +|++..+|-|=.|.. T Consensus 152 rGFkFVGpt~~ysfm 166 (189) T PRK10353 152 RGFKFVGTTICYSFM 166 (189) T ss_pred CCCEEECHHHHHHHH T ss_conf 889361679999999 No 160 >PRK08654 pyruvate carboxylase subunit A; Validated Probab=59.15 E-value=10 Score=16.07 Aligned_cols=68 Identities=10% Similarity=0.090 Sum_probs=35.2 Q ss_pred CEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCCCCCCCCCCCCCEEEEE--ECCHHHHHHHHHHHHHH Q ss_conf 8899811651774999999999769963899999985476643344565666677539999--56968999999999899 Q 537021.9.peg.7 42 PAIAITDTNNLFSALEFSQKACAAGIQPIIGCQLDIDMQDNLKMTNQGKTPFPTVPSIVLL--VSTAEGYQRLIELVSRM 119 (1033) Q Consensus 42 ~AiAITD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~~~~~~~~~~~~~~~~~lvlL--AkN~~Gy~nL~kL~S~a 119 (1033) +-|-|+.+|- -++|..++|++.||+.+.= |-..+.... .. ...-+.+.+ +...+-|-|.-++++.+ T Consensus 3 ~kvLIANRGE--IA~RiiRa~relGi~tVaV---yS~~D~~s~----hv---~~ADe~~~ig~~~~~~sYLni~~Ii~~A 70 (497) T PRK08654 3 KKILIANRGE--IAIRVMRACRELGIKTVAV---YSEADKNAL----FV---KYADEAYPIGPAPPSKSYLNMERILEVA 70 (497) T ss_pred CEEEEECCCH--HHHHHHHHHHHCCCEEEEE---CCHHHHCCC----CH---HHCCEEEECCCCCCCCCCCCHHHHHHHH T ss_conf 6089987879--9999999999859939998---883785489----97---8889899848988321216899999999 Q ss_pred HH Q ss_conf 73 Q 537021.9.peg.7 120 YL 121 (1033) Q Consensus 120 ~~ 121 (1033) .. T Consensus 71 ~~ 72 (497) T PRK08654 71 KK 72 (497) T ss_pred HH T ss_conf 98 No 161 >CHL00041 rps11 ribosomal protein S11 Probab=58.50 E-value=5.7 Score=18.27 Aligned_cols=42 Identities=17% Similarity=0.309 Sum_probs=19.7 Q ss_pred HHHHHHHHHCCCCC--------CCCCCCCHHHHHHHH---HHHCCCCCHHHCC Q ss_conf 89999985104764--------334444204667765---5440375633323 Q 537021.9.peg.7 360 VADFIQWAKKNNIP--------VGPGRGSGAGSVTAY---ALTITDIDPLRFS 401 (1033) Q Consensus 360 V~Div~~ak~~gI~--------vGpGRGSAagSLVaY---~LgIT~VDPi~~~ 401 (1033) ...+.+.|++.||- .||||=+|--+|..- .+-|+|+-|+.|| T Consensus 62 a~~~~~~a~~~Gi~~v~v~vkG~G~GR~~aik~l~~~glkI~~I~D~TpiphN 114 (116) T CHL00041 62 AENAIRTVIDQGMKRAEVMIKGPGLGRDTALRAIRRSGLKLSSIRDVTPMPHN 114 (116) T ss_pred HHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHCCCEEEEEEECCCCCCC T ss_conf 99999999973984999999788845899999998789889999989999999 No 162 >PRK11805 N5-glutamine S-adenosyl-L-methionine-dependent methyltransferase; Provisional Probab=58.42 E-value=11 Score=15.98 Aligned_cols=75 Identities=15% Similarity=0.278 Sum_probs=45.2 Q ss_pred HHHHHHHHHHHC----CCCCCCCCCCCH---HHHHHHHHHHCCCCCHHHCCCHHHHHHHHCCCCCCCCCCCCCHHHHHHH Q ss_conf 888999998510----476433444420---4667765544037563332310567621001457875333656878999 Q 537021.9.peg.7 358 LIVADFIQWAKK----NNIPVGPGRGSG---AGSVTAYALTITDIDPLRFSLLFERFLNPDRMSMPDFDIDFCQDRRDEV 430 (1033) Q Consensus 358 LiV~Div~~ak~----~gI~vGpGRGSA---agSLVaY~LgIT~VDPi~~~LlFERFLnp~R~~~PDIDiDf~~~~R~~v 430 (1033) -++.|+++||.+ .||..|.|.=|+ |--|||.+||++.... -..+++-|+ .+.+..+ T Consensus 14 ~Ti~~~L~~a~~~~~~a~i~~ghG~ds~~~eA~~Ll~~~L~~~r~~l---~~~~d~~l~--------------~~e~~~~ 76 (307) T PRK11805 14 HTIRDLLRWAVSRFNAAGLFFGHGTDNAWDEAVQLVLHALHLPLDIP---EPFLDARLT--------------PSEKERI 76 (307) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHCCCHHHH---HHHHCCCCC--------------HHHHHHH T ss_conf 99999999999999967987678999999999999999859695566---774158999--------------9999999 Q ss_pred HHHHHHHC-CCCCEEECCCH Q ss_conf 99998303-80215542207 Q 537021.9.peg.7 431 IRYVQNKY-GHERVAQIITF 449 (1033) Q Consensus 431 i~y~~~ky-G~~~va~i~t~ 449 (1033) .+.+.+|- ++.=++.|... T Consensus 77 ~~lv~rR~~~r~PlaYI~G~ 96 (307) T PRK11805 77 LELIERRINERIPAAYLTNE 96 (307) T ss_pred HHHHHHHHHCCCCHHHHHCC T ss_conf 99999998379858999487 No 163 >pfam03934 GspK General secretion pathway protein K. Members of this family are involved in the general secretion pathway. The family includes proteins such as ExeK, PulK, OutX and XcpX. Probab=58.17 E-value=11 Score=15.95 Aligned_cols=32 Identities=22% Similarity=0.478 Sum_probs=27.5 Q ss_pred HCCCCHHHHHHHHHHHHCCCCCCHHHHHHHCC Q ss_conf 58969899999999983799999789997348 Q 537021.9.peg.7 842 IKGVGTTTARHIMEASADKPFDSLEDFCSRVD 873 (1033) Q Consensus 842 Ikgvg~~~~~~Ive~r~~g~f~sl~df~~rv~ 873 (1033) +.|++...++.|++.|..++|+|..||..+.. T Consensus 200 ~~~ls~~~A~~li~~R~~~gf~~v~df~~~~~ 231 (282) T pfam03934 200 FEGLSLSQAQRLLAARPADGFESVDDFLALPA 231 (282) T ss_pred CCCCCHHHHHHHHHHCCCCCCCCHHHHHHCHH T ss_conf 47999999999998375557999999972622 No 164 >pfam08732 HIM1 HIM1. HIM1 (high induction of mutagenesis protein 1) plays a role in the control of spontaneous and induced mutagenesis. It is thought to participate in the control of processing of mutational intermediates appearing during error-prone bypass of DNA damage. Probab=57.86 E-value=8.1 Score=16.96 Aligned_cols=49 Identities=16% Similarity=0.256 Sum_probs=28.9 Q ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHH Q ss_conf 778789999986249999999999999997500013310147888999999999 Q 537021.9.peg.7 726 DKQKERFISGASKNGISKTIAVNIFELLAKFADYGFNKSHAAAYAVISYQTAWM 779 (1033) Q Consensus 726 ~~~~~~f~~g~~~~g~~~~~a~~i~~~i~~f~~Y~FnksHa~aya~~~y~~ayl 779 (1033) -..+..|++..++-|+..+..|-+-. + .|--+-|-=++|++-|-..||. T Consensus 321 l~~k~~~l~e~r~vG~~tK~SElvAk----~-~Yr~PGs~llGY~ipA~KVA~v 369 (400) T pfam08732 321 LHHYVRRMNEWKDVGIATKTSELVAQ----F-SYRMPGSFLVGYPIPALKVAYV 369 (400) T ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHH----H-HHHCCCCHHCCCCCHHHHHHHH T ss_conf 99999999999861741008999999----9-8618870220143338889999 No 165 >PRK13134 consensus Probab=57.54 E-value=11 Score=15.87 Aligned_cols=45 Identities=20% Similarity=0.092 Sum_probs=29.7 Q ss_pred HCCCHHHHHHHHHHCCCCEEE----EECCCHHHHHHHHHHHHHHCCCCEE Q ss_conf 329989999999978998899----8116517749999999997699638 Q 537021.9.peg.7 25 GALSLNNILDKIAADQQPAIA----ITDTNNLFSALEFSQKACAAGIQPI 70 (1033) Q Consensus 25 g~~~~e~LV~~A~~~G~~AiA----ITD~~nl~Gav~F~~ackk~gIKPI 70 (1033) +.+++++.++.+++.|-+|+. .-|. ++-...++.++..+.|.--| T Consensus 2 ~~sri~~~f~~~k~~~~~ali~yitaG~P-~~e~s~~~i~~l~~~GaDii 50 (257) T PRK13134 2 SASRLERRIREAQAAGRPALIPFLTAGFP-TSERFWDELEALDAAGADII 50 (257) T ss_pred CCCHHHHHHHHHHHCCCCEEEEEEECCCC-CHHHHHHHHHHHHHCCCCEE T ss_conf 72389999999998599579988707079-97999999999997799999 No 166 >COG1154 Dxs Deoxyxylulose-5-phosphate synthase [Coenzyme metabolism / Lipid metabolism] Probab=56.99 E-value=11 Score=15.81 Aligned_cols=34 Identities=9% Similarity=0.141 Sum_probs=16.0 Q ss_pred HHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHHCCCCHHHHHHHH Q ss_conf 999999830380215542207787788999999975279988988654 Q 537021.9.peg.7 429 EVIRYVQNKYGHERVAQIITFGSLQAKAALRDVGRALQMPYSQVDRLC 476 (1033) Q Consensus 429 ~vi~y~~~kyG~~~va~i~t~~t~~~k~aird~~r~~g~~~~~~~~~~ 476 (1033) .+++.+++.=| --+-.|.|- |. -|++++|-|... T Consensus 260 ~~Lk~~kd~~g-PvllHv~T~---KG----------KGY~pAE~d~~~ 293 (627) T COG1154 260 PTLKNAKDLKG-PVLLHVVTK---KG----------KGYKPAEEDPIK 293 (627) T ss_pred HHHHHHHCCCC-CEEEEEEEC---CC----------CCCCHHHCCHHH T ss_conf 99999853799-889999965---88----------888823549653 No 167 >pfam01989 DUF126 Protein of unknown function DUF126. This archaebacterial protein family has no known function. Probab=56.92 E-value=5.9 Score=18.13 Aligned_cols=27 Identities=37% Similarity=0.695 Sum_probs=22.4 Q ss_pred CCCCHHHHHHHHHHHCCCCCHHHCCCHHH Q ss_conf 44420466776554403756333231056 Q 537021.9.peg.7 377 GRGSGAGSVTAYALTITDIDPLRFSLLFE 405 (1033) Q Consensus 377 GRGSAagSLVaY~LgIT~VDPi~~~LlFE 405 (1033) ||||.+||.|-|-|-....-|. .++|+ T Consensus 37 ~kGSt~gs~vL~~l~~~g~aP~--AiI~~ 63 (83) T pfam01989 37 GKGSTVGSYVLYELAKNGTAPA--AIVFE 63 (83) T ss_pred CCCCCHHHHHHHHHHHCCCCCH--HHHHC T ss_conf 9876468999999998699974--12224 No 168 >TIGR02706 P_butyryltrans phosphate butyryltransferase; InterPro: IPR014079 Members of this family are phosphate butyryltransferase enzymes, also called phosphotransbutyrylase. In general, this enzyme is found in butyrate-producing anaerobic bacteria, encoded next to the gene for butyrate kinase. Together, these two enzymes represent what may be the less common of two pathways for butyrate production from butyryl-CoA. The alternative is transfer of the CoA group to acetate by butyryl-CoA:acetate CoA transferase. This entry does not include the homologue from Thermotoga maritima, whose activity on butyryl-CoA is only 30 0f its activity with acetyl-CoA.. Probab=56.39 E-value=11 Score=15.88 Aligned_cols=85 Identities=15% Similarity=0.131 Sum_probs=46.4 Q ss_pred HHCCCCCCCCCCCCCC-CCCCCCCHHHHHHHCCHHH---HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHH Q ss_conf 8469857653100245-3432343478875122035---89987886411100013322102345557568877689889 Q 537021.9.peg.7 234 VAHSTVVSQKDRPRVT-PDHYLKNRSEMVSIFSDLP---EALENTVEIARRCSFILQVHAPILPRFIEKECDDIQKEEEN 309 (1033) Q Consensus 234 I~~~~~l~~~~~~~~~-~~~yLks~eEM~~~f~~~p---eAi~NT~~IAe~c~~~l~~~~~~lP~f~~p~~~~~~~~~~~ 309 (1033) .|+|+++++--.|... -++-|-=+|---..|+++. +.|+|+..+|..|-.+.|.-.+.=+..-+-+..+. .=... T Consensus 112 LRTGk~lSHVAVFe~P~~dRLl~lTDaAfN~yP~Lk~K~~ii~NaV~VAha~Gi~~PkVA~l~AVEvVNP~M~a-TvDAA 190 (295) T TIGR02706 112 LRTGKVLSHVAVFEVPGFDRLLFLTDAAFNIYPELKDKVQIINNAVKVAHAIGIEVPKVAVLAAVEVVNPKMPA-TVDAA 190 (295) T ss_pred CCCCCCCCCEEEECCCCCCHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCCCCCEEEECCEEEECCCCHH-HHHHH T ss_conf 10487122067715898551432312665028787899999764755023217886861153026356688715-79999 Q ss_pred HHHHHHHHHH Q ss_conf 9987654103 Q 537021.9.peg.7 310 ELRNKAVAGL 319 (1033) Q Consensus 310 ~Lr~l~~~Gl 319 (1033) .|.++...|- T Consensus 191 ~LakM~~RGQ 200 (295) T TIGR02706 191 ALAKMSDRGQ 200 (295) T ss_pred HHHHHCCCCC T ss_conf 9840020684 No 169 >COG0632 RuvA Holliday junction resolvasome, DNA-binding subunit [DNA replication, recombination, and repair] Probab=56.32 E-value=10 Score=16.15 Aligned_cols=68 Identities=24% Similarity=0.320 Sum_probs=33.8 Q ss_pred EHHHHCCCCHHHHHHHHHHHHCCCCCCHH--HHHHH-CCCCCCCHHHHHHHHH-CCCCCCCCCCHHHHHHHHHHH Q ss_conf 02685896989999999998379999978--99973-4824799899998886-767234790389999989999 Q 537021.9.peg.7 838 SLAAIKGVGTTTARHIMEASADKPFDSLE--DFCSR-VDSKKLNRRVLESLVF-AGALDCFGYSRMQLLQSLDNI 908 (1033) Q Consensus 838 Gl~~Ikgvg~~~~~~Ive~r~~g~f~sl~--df~~r-v~~~~~nk~~le~Li~-aGAfD~~~~~R~~ll~~i~~~ 908 (1033) -|+.++|||.++|+.|+-+-+ +.+..+. +...- .+.. .... ...+.. ..|+-.+|.++.++-..+..+ T Consensus 109 ~L~k~PGIGkKtAerivleLk-~K~~~~~~~~~~~~~~~~~-~~~~-~~~~~~~v~AL~~LGy~~~e~~~av~~v 180 (201) T COG0632 109 ALSKIPGIGKKTAERIVLELK-GKLAAFLKGDGGSPAEDLS-LDES-SPALEEAVEALVALGYKEKEIKKAVKKV 180 (201) T ss_pred HHHCCCCCCHHHHHHHHHHHH-HHHHHHCCCCCCCCCCCCC-CCCC-CHHHHHHHHHHHHCCCCHHHHHHHHHHH T ss_conf 644189877889999999976-0564113211355323456-5655-1235689999998499999999999998 No 170 >pfam04007 DUF354 Protein of unknown function (DUF354). Members of this family are around 350 amino acids in length. They are found in archaebacteria and have no known function. Probab=56.13 E-value=11 Score=15.78 Aligned_cols=18 Identities=11% Similarity=0.351 Sum_probs=11.6 Q ss_pred HHHHHHHHCCCEEEECCC Q ss_conf 999999818989980883 Q 537021.9.peg.7 200 VVQLAYTHELPLVATNNS 217 (1033) Q Consensus 200 l~~lA~~~~iPlVaTndv 217 (1033) +...|.-+|+|.|+.+|. T Consensus 95 a~~va~~LgipsI~f~Dt 112 (335) T pfam04007 95 LPRVAFGLRIPSIIVLDN 112 (335) T ss_pred HHHHHHHCCCCEEEEECC T ss_conf 999998829987999477 No 171 >COG1570 XseA Exonuclease VII, large subunit [DNA replication, recombination, and repair] Probab=55.97 E-value=12 Score=15.68 Aligned_cols=23 Identities=17% Similarity=-0.013 Sum_probs=15.4 Q ss_pred CCCEEEEEECCHHHHHHHHHHHH Q ss_conf 77539999569689999999998 Q 537021.9.peg.7 95 TVPSIVLLVSTAEGYQRLIELVS 117 (1033) Q Consensus 95 ~~~~lvlLAkN~~Gy~nL~kL~S 117 (1033) +.|.+++-..-..|-.+|....- T Consensus 89 G~YQi~~~~~~p~G~G~L~~~~E 111 (440) T COG1570 89 GDYQIVAESMEPAGLGALYLAFE 111 (440) T ss_pred CCEEEEEECCCCCCHHHHHHHHH T ss_conf 85699983278287258999999 No 172 >cd01312 Met_dep_hydrolase_D Metallo-dependent hydrolases, subgroup D is part of the superfamily of metallo-dependent hydrolases, a large group of proteins that show conservation in their 3-dimensional fold (TIM barrel) and in details of their active site. The vast majority of the members have a conserved metal binding site, involving four histidines and one aspartic acid residue. In the common reaction mechanism, the metal ion (or ions) deprotonate a water molecule for a nucleophilic attack on the substrate. The function of this subgroup is unknown. Probab=55.75 E-value=7.9 Score=17.08 Aligned_cols=62 Identities=8% Similarity=0.054 Sum_probs=38.7 Q ss_pred CCCHHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEE Q ss_conf 88133433144476443299899999999789988998116517749999999997699638 Q 537021.9.peg.7 9 PSFVHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPI 70 (1033) Q Consensus 9 ~~FvhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPI 70 (1033) |-||..|+|..+|.+.|.....++..--...--..=+++....-.++-.....+.+.|+..+ T Consensus 31 PGfVnaH~H~~~~~~~~~~~~~~l~~wl~~~~~~~~~~~~~~~~~~~~~~~~e~l~~G~Ttv 92 (381) T cd01312 31 PGLINAHTHLEFSANVAQFTYGRFRAWLLSVINSRDELLKQPWEEAIRQGIRQMLESGTTSI 92 (381) T ss_pred ECCCCHHHCHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCCEE T ss_conf 58721504854464551158886999998734778741879999999999999998596166 No 173 >TIGR00593 pola DNA polymerase I; InterPro: IPR002298 DNA carries the biological information that instructs cells how to exist in an ordered fashion. Accurate replication is thus one of the most important events in the cell life cycle. This function is mediated by DNA-directed DNA polymerases, which add nucleotide triphosphate (dNTP) residues to the 5'-end of the growing DNA chain, using a complementary DNA as template. Small RNA molecules are generally used as primers for chain elongation, although terminal proteins may also be used. DNA-dependent DNA polymerases have been grouped into families, denoted A, B and X, on the basis of sequence similarities , . Members of family A, which includes bacterial and bacteriophage polymerases, share significant similarity to Escherichia coli polymerase I; hence family A is also known as the pol I family. The bacterial polymerases also contain an exonuclease activity, which is coded for in the N-terminal portion. Three motifs, A, B and C , are seen to be conserved across all DNA polymerases, with motifs A and C also seen in RNA polymerases. They are centred on invariant residues, and their structural significance was implied from the Klenow (E. coli) structure. Motif A contains a strictly-conserved aspartate at the junction of a beta-strand and an alpha-helix; motif B contains an alpha-helix with positive charges; and motif C has a doublet of negative charges, located in a beta-turn-beta secondary structure .; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006260 DNA replication. Probab=55.70 E-value=12 Score=15.65 Aligned_cols=65 Identities=22% Similarity=0.279 Sum_probs=38.5 Q ss_pred HHHHHHHHHHHHHCCCCCCCEEEECCCCHHHH------EECCCCCCCCCEEEEEECHHHHHH----CC-CHHHHHHHCCH Q ss_conf 88987201122100244744033525666662------001236768866885102133431----12-32135654178 Q 537021.9.peg.7 511 LLEISQKLEGLYRHASTHAAGIVIGDRPLSQL------VPMYRDVRSDLPVTQFNMKWIEKA----GL-IKFDFLGLKTL 579 (1033) Q Consensus 511 ~~~~a~~leg~~r~~~~Ha~Gvvi~~~~l~~~------~p~~~~~~~~~~~~q~d~~~~e~~----gl-~K~D~Lgl~~L 579 (1033) +-+-+-.+-|..=|++ ||+.|.++ .|..+..+.|.+.| |.+.+|++ .. |--.+|--|.| T Consensus 639 lE~~Iy~~aG~~FNin--------SPKQL~~vLF~KL~Lp~~kK~KtG~YST--d~~vLe~L~~~h~~~i~~~Ll~yR~L 708 (1005) T TIGR00593 639 LEEEIYELAGEEFNIN--------SPKQLGEVLFEKLGLPVGKKTKTGSYST--DADVLEKLAEKHPIHIALILLEYRQL 708 (1005) T ss_pred HHHHHHHHCCCCCCCC--------CHHHHHHHHHHHHCCCCCCCCCCCCCEE--CHHHHHHHHHCCCCCHHHHHHHHHHH T ss_conf 9999998749520688--------7335558869984788200346786213--38899988742888368999876666 Q ss_pred HHHHHH Q ss_conf 999999 Q 537021.9.peg.7 580 TLLQKS 585 (1033) Q Consensus 580 ~~i~~~ 585 (1033) |-++.| T Consensus 709 ~KLksT 714 (1005) T TIGR00593 709 TKLKST 714 (1005) T ss_pred HHHHHC T ss_conf 543101 No 174 >TIGR01011 rpsB_bact ribosomal protein S2; InterPro: IPR005706 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites , . About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many of ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome , . This family describes the bacterial, mitochondrial and chloroplast forms of ribosomal protein S2.; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0015935 small ribosomal subunit. Probab=55.66 E-value=9.6 Score=16.36 Aligned_cols=14 Identities=29% Similarity=0.444 Sum_probs=4.8 Q ss_pred HHHHHHHCCCCCCC Q ss_conf 99988867672347 Q 537021.9.peg.7 881 VLESLVFAGALDCF 894 (1033) Q Consensus 881 ~le~Li~aGAfD~~ 894 (1033) .||..-..|.||.+ T Consensus 116 ~lE~~~~~g~~~~L 129 (227) T TIGR01011 116 KLEKMEEDGTFDVL 129 (227) T ss_pred HHHHHHHHCCCCCC T ss_conf 99987761331247 No 175 >COG0100 RpsK Ribosomal protein S11 [Translation, ribosomal structure and biogenesis] Probab=55.58 E-value=7.8 Score=17.13 Aligned_cols=27 Identities=33% Similarity=0.647 Sum_probs=21.6 Q ss_pred CCCCCCHHHHHHHHHHHCCC---CCHHHCC Q ss_conf 34444204667765544037---5633323 Q 537021.9.peg.7 375 GPGRGSGAGSVTAYALTITD---IDPLRFS 401 (1033) Q Consensus 375 GpGRGSAagSLVaY~LgIT~---VDPi~~~ 401 (1033) ||||=+|-.+|.+=-|.||. |-||.|| T Consensus 90 G~GreaAiraL~~ag~~i~~I~DvTPiphn 119 (129) T COG0100 90 GPGREAAIRALAAAGLKITRIEDVTPIPHN 119 (129) T ss_pred CCCHHHHHHHHHHCCCEEEEEEECCCCCCC T ss_conf 975899999998736558788886877889 No 176 >TIGR03628 arch_S11P archaeal ribosomal protein S11P. This model describes exclusively the archaeal ribosomal protein S11P. It excludes homologous ribosomal proteins S14 from eukaryotes and S11 from bacteria. Probab=55.18 E-value=6.7 Score=17.69 Aligned_cols=27 Identities=26% Similarity=0.564 Sum_probs=13.3 Q ss_pred CCCCCCHHHHHHHHHHH---CCCCCHHHCC Q ss_conf 34444204667765544---0375633323 Q 537021.9.peg.7 375 GPGRGSGAGSVTAYALT---ITDIDPLRFS 401 (1033) Q Consensus 375 GpGRGSAagSLVaY~Lg---IT~VDPi~~~ 401 (1033) ||||=||--+|..==|. |.||-||.|| T Consensus 83 GpGresAiral~~~Gl~I~~I~DvTPiPHN 112 (114) T TIGR03628 83 GPGAQAAIRALARAGLRIGRIEDVTPIPHD 112 (114) T ss_pred CCCHHHHHHHHHHCCCEEEEEEECCCCCCC T ss_conf 862899999998689799998987999989 No 177 >PRK12366 replication factor A; Reviewed Probab=55.17 E-value=12 Score=15.58 Aligned_cols=14 Identities=14% Similarity=0.273 Sum_probs=5.7 Q ss_pred CCEEEECCCCHHHH Q ss_conf 44033525666662 Q 537021.9.peg.7 529 AAGIVIGDRPLSQL 542 (1033) Q Consensus 529 a~Gvvi~~~~l~~~ 542 (1033) --|.|++..|+.++ T Consensus 189 i~G~V~~~~~irtF 202 (649) T PRK12366 189 INGEITKAYPKKEF 202 (649) T ss_pred EEEEEEECCCCEEE T ss_conf 99999643686353 No 178 >TIGR00315 cdhB CO dehydrogenase/acetyl-CoA synthase complex, epsilon subunit; InterPro: IPR003704 Carbon monoxide dehydrogenase (Cdh) from Methanosarcina frisia Go1 is a Ni2+-, Fe2+-, and S2-containing alpha2beta2 heterotetramer . The CO dehydrogenase enzyme complex from Methanosarcina thermophila contains a corrinoid/iron-sulphur enzyme composed of two subunits (delta and gamma) . This family consists of carbon monoxide dehydrogenase I/II beta subunit 1.2.99.2 from EC and CO dehydrogenase (acetyl-CoA synthase epsilon subunit).; GO: 0003824 catalytic activity, 0008152 metabolic process. Probab=55.04 E-value=7.9 Score=17.04 Aligned_cols=27 Identities=30% Similarity=0.375 Sum_probs=21.8 Q ss_pred HHHHHHHHHHHHHHHCCCEEEECCCCC Q ss_conf 589999999999981898998088300 Q 537021.9.peg.7 193 DRYRESQVVQLAYTHELPLVATNNSLF 219 (1033) Q Consensus 193 e~~~~~~l~~lA~~~~iPlVaTndv~Y 219 (1033) |+...+.+.++.+++|||+|||.+..- T Consensus 42 ~Ek~~E~~~K~iekF~l~vvaTa~~~~ 68 (170) T TIGR00315 42 DEKLLELVVKFIEKFDLPVVATADAIK 68 (170) T ss_pred CCHHHHHHHHHHHHCCCCEEECCHHHH T ss_conf 121345555556432886474731138 No 179 >PRK13138 consensus Probab=54.78 E-value=12 Score=15.54 Aligned_cols=16 Identities=13% Similarity=0.146 Sum_probs=6.5 Q ss_pred HHHHHHHHHHHHHCCC Q ss_conf 7749999999997699 Q 537021.9.peg.7 52 LFSALEFSQKACAAGI 67 (1033) Q Consensus 52 l~Gav~F~~ackk~gI 67 (1033) +-...++.++..++|. T Consensus 26 ~e~t~~~~~~l~~~Ga 41 (264) T PRK13138 26 YDSCIIWADALIRGGA 41 (264) T ss_pred HHHHHHHHHHHHHCCC T ss_conf 8999999999997799 No 180 >TIGR00617 rpa1 replication factor-a protein 1 (rpa1); InterPro: IPR004591 All proteins in this family for which functions are known are part of a multiprotein complex made up of homologs of RPA1, RPA2 and RPA3 that bind single-stranded DNA and function in the recognition of DNA damage for nucleotide excision repair. The complex binds to single-stranded DNA sequences participating in DNA replication in addition to those mediating transcriptional repression and activation, and stimulates the activity of cognate strand exchange protein Sep1. It cooperates with T-AG and DNA topoisomerase I to unwind template DNA containing the Simian virus 40 origin of replication. This family is Replication factor-a protein 1 (RPA1); GO: 0003677 DNA binding, 0006260 DNA replication, 0005634 nucleus. Probab=54.15 E-value=12 Score=15.46 Aligned_cols=54 Identities=17% Similarity=0.303 Sum_probs=29.4 Q ss_pred HHHHHHHHHHCCC-CEE-EEEECCCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCHH Q ss_conf 9999999876598-689-9994399115899999999999818989980883007878 Q 537021.9.peg.7 168 EKRLLTFKKLFGD-RLY-VNLQRHRGYDRYRESQVVQLAYTHELPLVATNNSLFLSEE 223 (1033) Q Consensus 168 ~~~l~~l~~~Fgd-~~y-lEl~~~~~~e~~~~~~l~~lA~~~~iPlVaTndv~Y~~~~ 223 (1033) +..+..|.+-=|+ -+| +||.+-.+.| .+.-..-+.|+++- |++--+.|||..+. T Consensus 236 K~~irtw~N~RGEGKlFn~~L~D~~sge-IRaTaFNe~ad~FY-~~i~~g~vYYiSkg 291 (671) T TIGR00617 236 KSEIRTWSNARGEGKLFNVELLDSESGE-IRATAFNEQADKFY-DLIQVGKVYYISKG 291 (671) T ss_pred HHCCCCCCCCCCCCEEEEEEEECCCCCE-EEEECHHHHHHHHC-CEEEECCEEEEECC T ss_conf 0013302378888106555747179784-67742478875321-32433678799737 No 181 >cd01356 AcnX_swivel Putative Aconitase X swivel domain. It is predicted by comparative genomic analysis. The proteins are mainly found in archaea and proteobacteria. They are distantly related to Aconitase family of proteins by sequence similarity and seconary structure prediction. The functions have not yet been experimentally characterized. Thus, the prediction should be treated with caution. Probab=53.38 E-value=6.3 Score=17.89 Aligned_cols=27 Identities=37% Similarity=0.692 Sum_probs=19.5 Q ss_pred CCCCHHHHHHHHHHHCCCCCHHHCCCHHH Q ss_conf 44420466776554403756333231056 Q 537021.9.peg.7 377 GRGSGAGSVTAYALTITDIDPLRFSLLFE 405 (1033) Q Consensus 377 GRGSAagSLVaY~LgIT~VDPi~~~LlFE 405 (1033) ||||.+||.|-|.|--...-|. .++|+ T Consensus 52 ~kGSt~gs~vL~~l~~~g~aP~--Aii~~ 78 (123) T cd01356 52 GKGSTVGSYVLYELARNGTAPA--AIVFE 78 (123) T ss_pred CCCCCHHHHHHHHHHHCCCCCE--EEEEC T ss_conf 9866478999999998699971--54305 No 182 >TIGR02002 PTS-II-BC-glcB PTS system, glucose-specific IIBC component; InterPro: IPR011299 This entry represents the combined B and C domains of the PTS transport system enzyme II specific for glucose transport . Many of the genes in this family also include an A domain as part of the same polypeptide and thus should be given the name 'PTS system, glucose-specific IIABC component' while the Bacillus subtilus enzyme also contains an enzyme III domain which appears to act independently of the enzyme II domains . This group is most closely related to the N-acetylglucosamine-specific PTS enzymes (IPR010974 from INTERPRO).; GO: 0005355 glucose transmembrane transporter activity, 0015758 glucose transport, 0016021 integral to membrane. Probab=53.13 E-value=4.5 Score=19.09 Aligned_cols=12 Identities=8% Similarity=0.415 Sum_probs=5.3 Q ss_pred HHHCCCCEEEEE Q ss_conf 876598689999 Q 537021.9.peg.7 175 KKLFGDRLYVNL 186 (1033) Q Consensus 175 ~~~Fgd~~ylEl 186 (1033) +.+|.-=||.|+ T Consensus 240 HHIf~aPFwfE~ 251 (518) T TIGR02002 240 HHIFYAPFWFEF 251 (518) T ss_pred HHEECCCCEEEC T ss_conf 110038754555 No 183 >PRK03955 hypothetical protein; Reviewed Probab=52.89 E-value=6.7 Score=17.64 Aligned_cols=22 Identities=36% Similarity=0.596 Sum_probs=9.8 Q ss_pred CCCCHHHHHHHHHHHCCCCCHH Q ss_conf 4442046677655440375633 Q 537021.9.peg.7 377 GRGSGAGSVTAYALTITDIDPL 398 (1033) Q Consensus 377 GRGSAagSLVaY~LgIT~VDPi 398 (1033) ||||.+||.|-|.|--..+-|- T Consensus 60 ~kGSt~gs~vL~~l~~~g~AP~ 81 (132) T PRK03955 60 GKGSTVGSYVIYQLAKNGTAPK 81 (132) T ss_pred CCCCCHHHHHHHHHHHCCCCCC T ss_conf 9876468999999998699970 No 184 >TIGR02104 pulA_typeI pullulanase, type I; InterPro: IPR011840 Pullulan is an unusual, industrially important polysaccharide in which short alpha-1,4 chains (maltotriose) are connected in alpha-1,6 linkages. Enzymes that cleave alpha-1,6 linkages in pullulan and release maltotriose are called pullulanases although pullulan itself may not be the natural substrate. This family consists of pullulanases related to the entries in IPR011838 from INTERPRO and IPR011839 from INTERPRO but having a different domain architecture with shorter sequences. Members are called type I pullulanases .. Probab=52.89 E-value=11 Score=15.85 Aligned_cols=62 Identities=24% Similarity=0.431 Sum_probs=37.4 Q ss_pred CHHHHHHH---CCHHHHHHHHHHHHHH----CCCCCCCC--CCCCHHHHHHHHHCCCCCHHHHCCCHHHHHHHH Q ss_conf 32135654---1789999999988740----25434332--344358899876201410122114534788887 Q 537021.9.peg.7 569 IKFDFLGL---KTLTLLQKSLDFLAQR----GVKVDLSL--IPFDDHETYHLLTTKGTLGIFQLESSGMRQALE 633 (1033) Q Consensus 569 ~K~D~Lgl---~~L~~i~~~~~~i~~~----~~~~d~~~--ip~~D~~~~~l~~~g~t~GvFQ~es~~~~~~l~ 633 (1033) ..||++|+ .|+-.|++++..|..+ |+-+|+.. ||.+++++-+-..+=--+|.| ..-.|--+| T Consensus 337 FRFDLMGihD~~TMn~ir~~l~~Idp~IlLyGEGWDl~~tpL~~~~KA~~~NA~~mp~Ia~F---ND~fRDalK 407 (655) T TIGR02104 337 FRFDLMGIHDIETMNEIRKALNKIDPSILLYGEGWDLGGTPLPPEQKATKANAYQMPGIAFF---NDEFRDALK 407 (655) T ss_pred CHHHHCCHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEC---CCCCCCCCC T ss_conf 20101234369999999999853398379974577876788883223589889855832301---563236500 No 185 >PRK06380 metal-dependent hydrolase; Provisional Probab=52.83 E-value=6.7 Score=17.66 Aligned_cols=59 Identities=25% Similarity=0.311 Sum_probs=31.3 Q ss_pred CCCHHHHHHCHHHHHHHCCC---HHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEE Q ss_conf 88133433144476443299---899999999789988998116517749999999997699638 Q 537021.9.peg.7 9 PSFVHLQVHSSYSLLEGALS---LNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPI 70 (1033) Q Consensus 9 ~~FvhLh~hS~ySlldg~~~---~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPI 70 (1033) |-||..|+|...|+++|... +.++++..-.. .-.+|+.....++..-...+-+.|+..+ T Consensus 54 PGfIn~H~Hl~~~~~rg~~~~~~l~~~l~~~~~~---~~~~~~~~~~~~a~~~~~e~l~~GvTt~ 115 (418) T PRK06380 54 PGFINTHAHVGMSASKGLFDDVDLSDFLEKTFDY---DSSRTDNGIFNSALLGMYEMINSGITTF 115 (418) T ss_pred ECCCCHHHCHHHHHHHHHCCCCCHHHHHHHHHHH---HHHCCHHHHHHHHHHHHHHHHHCCCEEE T ss_conf 3700975485307667543899899999978888---7866979999999999999996696489 No 186 >cd07229 Pat_TGL3_like Triacylglycerol lipase 3. Triacylglycerol lipase 3 (TGL3) are responsible for all the TAG lipase activity of the lipid particle. Triacylglycerol (TAG) lipases are also necessary for the mobilization of TAG stored in lipid particles. TGL3 contains the consensus sequence motif GXSXG, which is found in lipolytic enzymes. This family includes Tgl3p from Saccharomyces cerevisiae. Probab=52.77 E-value=11 Score=15.76 Aligned_cols=43 Identities=19% Similarity=0.200 Sum_probs=26.9 Q ss_pred HHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHHCCCHHHHHHHHCC Q ss_conf 9985104764334444204667765544037563332310567621001 Q 537021.9.peg.7 364 IQWAKKNNIPVGPGRGSGAGSVTAYALTITDIDPLRFSLLFERFLNPDR 412 (1033) Q Consensus 364 v~~ak~~gI~vGpGRGSAagSLVaY~LgIT~VDPi~~~LlFERFLnp~R 412 (1033) |+.=-++|.+=--==||+|||+||=+++...=|-+ ++||+++. T Consensus 102 vkaL~e~~LLPrVIsGsS~GaiVAal~~t~tdeEL------~~~l~~~~ 144 (391) T cd07229 102 VKALWLRGLLPRIITGTATGALIAALVGVHTDEEL------LRFLDGDG 144 (391) T ss_pred HHHHHHCCCCCCEECCCCHHHHHHHHHHCCCHHHH------HHHHCCCC T ss_conf 99999738886420157536788989861777999------98834431 No 187 >KOG1111 consensus Probab=52.52 E-value=12 Score=15.40 Aligned_cols=44 Identities=20% Similarity=0.245 Sum_probs=33.8 Q ss_pred HHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHH Q ss_conf 433144476443299899999999789988998116517749999999997 Q 537021.9.peg.7 14 LQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACA 64 (1033) Q Consensus 14 Lh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk 64 (1033) .|-||.||.+- -+-+-.|..+|. -...||| ++||+..+-..--. T Consensus 93 vhghs~fS~la-----he~l~hartMGl-ktVfTdH-SlfGfad~~si~~n 136 (426) T KOG1111 93 VHGHSPFSYLA-----HEALMHARTMGL-KTVFTDH-SLFGFADIGSILTN 136 (426) T ss_pred EECCCHHHHHH-----HHHHHHHHHCCC-EEEEECC-CCCCCCCHHHHHHC T ss_conf 96577088899-----999998874582-5998524-31142331244211 No 188 >PRK12833 acetyl-CoA carboxylase biotin carboxylase subunit; Provisional Probab=52.50 E-value=13 Score=15.26 Aligned_cols=30 Identities=10% Similarity=0.263 Sum_probs=20.0 Q ss_pred CCCEEEEECCCHHHHHHHHHHHHHHCCCCEEE Q ss_conf 99889981165177499999999976996389 Q 537021.9.peg.7 40 QQPAIAITDTNNLFSALEFSQKACAAGIQPII 71 (1033) Q Consensus 40 G~~AiAITD~~nl~Gav~F~~ackk~gIKPIi 71 (1033) .++-|-|+.+|- -+++..++|++.||+.+. T Consensus 4 ~ikkvLIANRGE--IA~Riirt~relgi~tVa 33 (458) T PRK12833 4 RIRTVLVANRGE--IAVRVIRAAHELGMRAVA 33 (458) T ss_pred CCCEEEEECCCH--HHHHHHHHHHHCCCCEEE T ss_conf 787899977869--999999999983998999 No 189 >TIGR02386 rpoC_TIGR DNA-directed RNA polymerase, beta' subunit; InterPro: IPR012754 DNA-directed RNA polymerases 2.7.7.6 from EC (also known as DNA-dependent RNA polymerases) are responsible for the polymerisation of ribonucleotides into a sequence complementary to the template DNA. In eukaryotes, there are three different forms of DNA-directed RNA polymerases transcribing different sets of genes. Most RNA polymerases are multimeric enzymes and are composed of a variable number of subunits. The core RNA polymerase complex consists of five subunits (two alpha, one beta, one beta-prime and one omega) and is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme . The core RNA polymerase complex forms a "crab claw"-like structure with an internal channel running along the full length . The key functional sites of the enzyme, as defined by mutational and cross-linking analysis, are located on the inner wall of this channel. RNA synthesis follows after the attachment of RNA polymerase to a specific site, the promoter, on the template DNA strand. The RNA synthesis process continues until a termination sequence is reached. The RNA product, which is synthesised in the 5' to 3'direction, is known as the primary transcript. Eukaryotic nuclei contain three distinct types of RNA polymerases that differ in the RNA they synthesise: RNA polymerase I: located in the nucleoli, synthesises precursors of most ribosomal RNAs. RNA polymerase II: occurs in the nucleoplasm, synthesises mRNA precursors. RNA polymerase III: also occurs in the nucleoplasm, synthesises the precursors of 5S ribosomal RNA, the tRNAs, and a variety of other small nuclear and cytosolic RNAs. Eukaryotic cells are also known to contain separate mitochondrial and chloroplast RNA polymerases. Eukaryotic RNA polymerases, whose molecular masses vary in size from 500 to 700 kD, contain two non-identical large (>100 kDa) subunits and an array of up to 12 different small (less than 50 kDa) subunits. This entry represents the beta-prime subunit, RpoC, found in most bacteria. It excludes some, mainly cyanobacterial, species where RpoC is replaced by two homologous proteins that include an additional domain. One arm of the "claw" is predominantly formed by this subunit, the other being predominantly formed by the beta subunit. The active site of the enzyme is defined by three invariant aspartate residues within the beta-prime subunit .; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006350 transcription. Probab=52.49 E-value=13 Score=15.26 Aligned_cols=31 Identities=13% Similarity=0.343 Sum_probs=17.6 Q ss_pred HHHHHHHHHHH--HHHHCCCCCCCEEEECCCCH Q ss_conf 26889872011--22100244744033525666 Q 537021.9.peg.7 509 DRLLEISQKLE--GLYRHASTHAAGIVIGDRPL 539 (1033) Q Consensus 509 ~~~~~~a~~le--g~~r~~~~Ha~Gvvi~~~~l 539 (1033) ++--++.+||| --.|+.|-||.=||+.--|+ T Consensus 260 ~~~kkllKRLe~~e~F~~Sgn~PeWMvl~~~PV 292 (1552) T TIGR02386 260 QKRKKLLKRLEILEAFKNSGNRPEWMVLDVIPV 292 (1552) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCEECCCCCC T ss_conf 688888898788999971378875303531575 No 190 >COG1786 Swiveling domain associated with predicted aconitase [Energy production and conversion] Probab=52.27 E-value=6.7 Score=17.69 Aligned_cols=22 Identities=41% Similarity=0.610 Sum_probs=11.1 Q ss_pred CCCCHHHHHHHHHHHCCCCCHH Q ss_conf 4442046677655440375633 Q 537021.9.peg.7 377 GRGSGAGSVTAYALTITDIDPL 398 (1033) Q Consensus 377 GRGSAagSLVaY~LgIT~VDPi 398 (1033) ||||-.||-|-|-|--+..-|. T Consensus 59 grGStvGSyVl~~l~~~G~AP~ 80 (131) T COG1786 59 GRGSTVGSYVLYELAKNGRAPA 80 (131) T ss_pred CCCCCCCHHHHHHHHHCCCCCH T ss_conf 9876502899999998499844 No 191 >COG1880 CdhB CO dehydrogenase/acetyl-CoA synthase epsilon subunit [Energy production and conversion] Probab=52.08 E-value=13 Score=15.21 Aligned_cols=27 Identities=26% Similarity=0.287 Sum_probs=22.7 Q ss_pred HHHHHHHHHHHHHHHCCCEEEECCCCC Q ss_conf 589999999999981898998088300 Q 537021.9.peg.7 193 DRYRESQVVQLAYTHELPLVATNNSLF 219 (1033) Q Consensus 193 e~~~~~~l~~lA~~~~iPlVaTndv~Y 219 (1033) |++..+.+.+++++.++|+|||.+..= T Consensus 48 dee~~E~~vKi~ekfnipivaTa~~~~ 74 (170) T COG1880 48 DEELLELAVKIIEKFNIPIVATASSMG 74 (170) T ss_pred CHHHHHHHHHHHHHCCCCEEECCHHHC T ss_conf 889999999999864874475321102 No 192 >PRK13135 consensus Probab=51.71 E-value=13 Score=15.17 Aligned_cols=40 Identities=10% Similarity=0.179 Sum_probs=17.4 Q ss_pred CCHHHHHHHHHHCCCCEEE----EECCCHHHHHHHHHHHHHHCCC Q ss_conf 9989999999978998899----8116517749999999997699 Q 537021.9.peg.7 27 LSLNNILDKIAADQQPAIA----ITDTNNLFSALEFSQKACAAGI 67 (1033) Q Consensus 27 ~~~e~LV~~A~~~G~~AiA----ITD~~nl~Gav~F~~ackk~gI 67 (1033) +++++..+..++.|-+|+- .-|. ++-...++.++..++|. T Consensus 2 ~ri~~~f~~lk~~~~~ali~yitaG~P-~~~~s~~~l~~l~~~Ga 45 (267) T PRK13135 2 SRITGTFAELKQRNTKALVTFITAGDP-DLATTEALIPLLAESGA 45 (267) T ss_pred CHHHHHHHHHHHCCCCEEEEEECCCCC-CHHHHHHHHHHHHHCCC T ss_conf 748999999997699548988717189-98999999999997599 No 193 >cd07206 Pat_TGL3-4-5_SDP1 Triacylglycerol lipase 3, 4, and 5 and Sugar-Dependent 1 lipase. Triacylglycerol lipases are involved in triacylglycerol mobilization and degradation; they are found in lipid particles. TGL4 is 30% homologus to TGL3, whereas TGL5 is 26% homologus to TGL3. Sugar-Dependent 1 (SDP1) lipase has a patatin-like acyl-hydrolase domain that initiates the breakdown of storage oil in germinating Arabidopsis seeds. This family includes subfamilies of proteins: TGL3, TGL4, TGL5, and SDP1. Probab=51.38 E-value=8.6 Score=16.74 Aligned_cols=32 Identities=19% Similarity=0.115 Sum_probs=21.1 Q ss_pred HHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCC Q ss_conf 98510476433444420466776554403756 Q 537021.9.peg.7 365 QWAKKNNIPVGPGRGSGAGSVTAYALTITDID 396 (1033) Q Consensus 365 ~~ak~~gI~vGpGRGSAagSLVaY~LgIT~VD 396 (1033) +.=-++|.+=--==||+|||+||=++++..=+ T Consensus 89 kaL~e~~LLPrVIsGsS~GaivAa~~~t~tde 120 (298) T cd07206 89 KALWEQDLLPRVISGSSAGAIVAALLGTHTDE 120 (298) T ss_pred HHHHHCCCCCCEECCCCHHHHHHHHHHCCCHH T ss_conf 99997488875310575578889898516718 No 194 >TIGR01235 pyruv_carbox pyruvate carboxylase; InterPro: IPR005930 In the postabsorptive state, blood glucose concentration is kept constant by a combination of glycogenolysis and gluconeogenesis. During prolonged fasting, when hepatic glycogen is exhausted, gluconeogenesis becomes the only source of plasma glucose. Gluconeogenesis (the production of new glucose molecules) occurs mainly in liver and, to a small extent, in kidneys. Pyruvate carboxylase (6.4.1.1 from EC), a member of the biotin-dependent enzyme family, catalyses the ATP-dependent carboxylation of pyruvate to oxaloacetate, thus playing a crucial role in gluconeogenesis.ATP + pyruvate + HCO3- = ADP + phosphate + oxaloacetate Most well characterised forms of active enzyme consist of four identical subunits arranged in a tetrahedron-like structure. Each subunit contains three functional domains: the biotin carboxylation domain, the transcarboxylation domain and the biotin carboxyl carrier domain.; GO: 0004736 pyruvate carboxylase activity, 0006094 gluconeogenesis, 0005737 cytoplasm. Probab=51.01 E-value=14 Score=15.09 Aligned_cols=52 Identities=25% Similarity=0.249 Sum_probs=40.4 Q ss_pred CHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEE Q ss_conf 44476443299899999999789988998116517749999999997699638 Q 537021.9.peg.7 18 SSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPI 70 (1033) Q Consensus 18 S~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPI 70 (1033) -.+.--++..+|++++.-|++++..||=== +|=|+=-.+|.++|.++||.-| T Consensus 56 rd~~Pv~AYL~I~eiI~vAk~~~vDaiHPG-YGfLSE~~~Fa~~v~~aGi~FI 107 (1169) T TIGR01235 56 RDLKPVEAYLSIDEIIRVAKKNKVDAIHPG-YGFLSENSEFADAVVKAGIVFI 107 (1169) T ss_pred CCCCCCCCCCCHHHHHHHHHCCCCCEEECC-CCCCCCCHHHHHHHHHCCCEEE T ss_conf 026872130471569899600789777088-5622488789999986895673 No 195 >COG4955 Uncharacterized protein conserved in bacteria [Function unknown] Probab=50.84 E-value=14 Score=15.07 Aligned_cols=37 Identities=27% Similarity=0.258 Sum_probs=25.1 Q ss_pred HHHHHHHHCCCCCHHHHCCCHHHHHHHHHHCCCHHHHHHHHHHHC Q ss_conf 889987620141012211453478888771740488887665411 Q 537021.9.peg.7 606 HETYHLLTTKGTLGIFQLESSGMRQALEGMQPDCIEDIIALVSLY 650 (1033) Q Consensus 606 ~~~~~l~~~g~t~GvFQ~es~~~~~~l~~~~p~~~~dl~~~~al~ 650 (1033) .+||.|+..|-|+ .+..-++++|..+++|=++.++|- T Consensus 247 ~kT~tm~~qgf~l--------e~Ia~ir~LKtsTIeDHfVEI~lh 283 (343) T COG4955 247 YKTYTMLLQGFTL--------EDIAAIRQLKTSTIEDHFVEILLH 283 (343) T ss_pred HHHHHHHHHCCCH--------HHHHHHHHHHHHHHHHHHEEEEEC T ss_conf 9999999926988--------899999865030165560024304 No 196 >KOG3142 consensus Probab=50.29 E-value=14 Score=15.00 Aligned_cols=93 Identities=10% Similarity=0.075 Sum_probs=56.5 Q ss_pred HHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCC-HHHHHHHHHHHHCCCCCCCCCCCCCCEE Q ss_conf 999999750001331014788899999999997302666777510023542-0258999888863685035523342100 Q 537021.9.peg.7 749 IFELLAKFADYGFNKSHAAAYAVISYQTAWMKTHYPVEFLAASMTLEMDNV-EKIKKFCQDARQFNIQIMPPSVNTPCVD 827 (1033) Q Consensus 749 i~~~i~~f~~Y~FnksHa~aya~~~y~~aylK~~yP~eF~aa~L~~~~~~~-~~~~~~i~e~~~~gi~vl~PdIN~S~~~ 827 (1033) =|..+-.++..+++.|-+-|.+.+..-.+|+|.||-.-+...+.-+-.... .=+...+--+-=+.+-.+. |=..--.. T Consensus 39 pW~ef~d~~~fs~P~s~s~a~sRi~~Nl~yF~~NY~~iv~~~~~~sLi~~P~~Livl~~lv~~w~~LY~~r-d~pLvlfg 117 (187) T KOG3142 39 PWSEFFDRSAFSRPRSLSDATSRIKRNLSYFRVNYVIIVAILLFLSLITHPLSLIVLLALVAAWLFLYFLR-DEPLVLFG 117 (187) T ss_pred CHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEEEEC-CCCEEEEE T ss_conf 89999731134788657899999998789999859999999999999986799999999999988403425-89858840 Q ss_pred EEECCCEEEEEHHHH Q ss_conf 353198698502685 Q 537021.9.peg.7 828 FKVGDNRIYYSLAAI 842 (1033) Q Consensus 828 f~~~~~~Ir~Gl~~I 842 (1033) +.+.+.....||.-+ T Consensus 118 r~i~d~~~l~~L~~~ 132 (187) T KOG3142 118 RQISDREVLIGLVLI 132 (187) T ss_pred EEECCCCHHHHHHHH T ss_conf 783682213437999 No 197 >pfam02679 ComA (2R)-phospho-3-sulfolactate synthase (ComA). In methanobacteria (2R)-phospho-3-sulfolactate synthase (ComA) catalyses the first step of the biosynthesis of coenzyme M from phosphoenolpyruvate (P-enolpyruvate). This novel enzyme catalyses the stereospecific Michael addition of sulfite to P-enolpyruvate, forming L-2-phospho-3-sulfolactate (PSL). It is suggested that the ComA-catalysed reaction is analogous to those reactions catalysed by beta-elimination enzymes that proceed through an enolate intermediate. Probab=50.03 E-value=14 Score=14.97 Aligned_cols=12 Identities=8% Similarity=0.398 Sum_probs=5.0 Q ss_pred HHHHHHHHCCCC Q ss_conf 999998303802 Q 537021.9.peg.7 430 VIRYVQNKYGHE 441 (1033) Q Consensus 430 vi~y~~~kyG~~ 441 (1033) +..|..++||.+ T Consensus 210 ~q~~~I~~~G~~ 221 (245) T pfam02679 210 QQEWFIKRFGPN 221 (245) T ss_pred HHHHHHHHHCCC T ss_conf 999999997989 No 198 >COG0556 UvrB Helicase subunit of the DNA excision repair complex [DNA replication, recombination, and repair] Probab=49.83 E-value=13 Score=15.27 Aligned_cols=229 Identities=19% Similarity=0.276 Sum_probs=106.5 Q ss_pred CCHHHHHHHHHHHHHHCCCC---EEEEEECC---------------------CHHHHHHHHHHHHHHHHHCCCEEEECCC Q ss_conf 99789999999998765986---89999439---------------------9115899999999999818989980883 Q 537021.9.peg.7 162 NSSQVAEKRLLTFKKLFGDR---LYVNLQRH---------------------RGYDRYRESQVVQLAYTHELPLVATNNS 217 (1033) Q Consensus 162 ~~~~~a~~~l~~l~~~Fgd~---~ylEl~~~---------------------~~~e~~~~~~l~~lA~~~~iPlVaTndv 217 (1033) .+...|.+....+++.|++| +|+.-.+. ...|+.+......|...-++=+||+=.| T Consensus 65 hNKTLAaQLy~Efk~fFP~NaVEYFVSYYDYYQPEAYvp~tDtyIEKdasiNdeId~mR~SAt~sLleR~DVIVVaSVSc 144 (663) T COG0556 65 HNKTLAAQLYSEFKEFFPENAVEYFVSYYDYYQPEAYVPSTDTYIEKDASINDEIDRLRHSATRSLLERRDVIVVASVSC 144 (663) T ss_pred CCHHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCCCCCCCCCCEEECCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEEEE T ss_conf 55547999999999768676458886400146851345777735753650678999999998887750478699998765 Q ss_pred CCC--CHHHHHHHHHHHHHHCCCCCCCC--------------------CCCCCCCC---CCCCCHHHH---HHHCCHHHH Q ss_conf 007--87889999876688469857653--------------------10024534---323434788---751220358 Q 537021.9.peg.7 218 LFL--SEEDYEAHDVLMAVAHSTVVSQK--------------------DRPRVTPD---HYLKNRSEM---VSIFSDLPE 269 (1033) Q Consensus 218 ~Y~--~~~D~~~h~iL~aI~~~~~l~~~--------------------~~~~~~~~---~yLks~eEM---~~~f~~~pe 269 (1033) -|- +|+++. +....+..|..++.. +.++..++ -|.-+.++. .+.|.|--+ T Consensus 145 IYGlG~P~~y~--~~~~~l~vG~~i~~~~ll~~Lv~~qY~Rnd~~f~rG~FRvrGDvveIfPa~~~~~~~RieffgDEie 222 (663) T COG0556 145 IYGLGSPEEYL--KMMLSLEVGQEIDRRELLRKLVDLQYERNDVDFQRGTFRVRGDVVEIFPAYYDDEAIRIEFFGDEIE 222 (663) T ss_pred ECCCCCHHHHH--HHEEEEECCCCCCHHHHHHHHHHHCEEECCCCCCCCCEEEECCEEEECCCCCCCEEEEEEECCCHHH T ss_conf 10489869987--5379983577348999999999726111455546671585176799721666760599997275303 Q ss_pred HHHHHHHHHHHHHHHHHHHH--CCCCCCCCCCCCCCCCCHHHHHHH---HHHHHHHHCCCCCCC-CCCCCHHHHHHHHHH Q ss_conf 99878864111000133221--023455575688776898899987---654103420124643-245783899999887 Q 537021.9.peg.7 270 ALENTVEIARRCSFILQVHA--PILPRFIEKECDDIQKEEENELRN---KAVAGLEVRLAQGAI-AKGYHAQDYRQRLDF 343 (1033) Q Consensus 270 Ai~NT~~IAe~c~~~l~~~~--~~lP~f~~p~~~~~~~~~~~~Lr~---l~~~Gl~~R~~~~~~-~~~~~~~~y~~RLe~ 343 (1033) +| +.++.-.+. -.++.|..-+... ..++.+.+.+ ...+-|++|+..... .+....+...+|-+| T Consensus 223 ~i---------~~~dpltg~~~~~~~~~~iypash-yvt~~~~~~~Ai~~Ik~EL~eRl~~~~~~~kllEaqRL~qRT~~ 292 (663) T COG0556 223 RI---------SEFDPLTGQVIGELDRFTIYPASH-YVTPRERLEQAIESIKEELEERLKFFEKNNKLLEAQRLEQRTEY 292 (663) T ss_pred EE---------EEECCCCCCCCCCCCCEEEECCCC-CCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH T ss_conf 35---------344456685213455189805633-14788899999999999999999999877953788888877775 Q ss_pred HHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHHCCCHHHHHHHHCCCCCCCCCCCC Q ss_conf 7898986063055788899999851047643344442046677655440375633323105676210014578753336 Q 537021.9.peg.7 344 ELNVIARMKFSGYFLIVADFIQWAKKNNIPVGPGRGSGAGSVTAYALTITDIDPLRFSLLFERFLNPDRMSMPDFDIDF 422 (1033) Q Consensus 344 EL~vI~~~gf~~YFLiV~Div~~ak~~gI~vGpGRGSAagSLVaY~LgIT~VDPi~~~LlFERFLnp~R~~~PDIDiDf 422 (1033) -|+.+.++||+. -|=||+|- +.|-.-|...-+|.-|.= +--|+ |+...-+++|-|-==+ T Consensus 293 DlEMl~e~G~C~------GIENYSRh---l~gr~~Ge~P~tL~DYfp--------~d~Ll---~IDESHvTvPQi~gMy 351 (663) T COG0556 293 DLEMLRETGYCS------GIENYSRH---LTGRKPGEPPYTLFDYFP--------DDFLL---FIDESHVTVPQIGGMY 351 (663) T ss_pred HHHHHHHHCCCC------CHHHHHHH---HCCCCCCCCCCCHHHHCC--------CCEEE---EEECCCCCHHHHHCHH T ss_conf 399999828776------61333565---058999957984788588--------41389---9824446507653201 No 199 >pfam00289 CPSase_L_chain Carbamoyl-phosphate synthase L chain, N-terminal domain. Carbamoyl-phosphate synthase catalyses the ATP-dependent synthesis of carbamyl-phosphate from glutamine or ammonia and bicarbonate. This important enzyme initiates both the urea cycle and the biosynthesis of arginine and/or pyrimidines. The carbamoyl-phosphate synthase (CPS) enzyme in prokaryotes is a heterodimer of a small and large chain. The small chain promotes the hydrolysis of glutamine to ammonia, which is used by the large chain to synthesize carbamoyl phosphate. See pfam00988. The small chain has a GATase domain in the carboxyl terminus. See pfam00117. Probab=49.81 E-value=14 Score=14.95 Aligned_cols=26 Identities=12% Similarity=0.344 Sum_probs=13.1 Q ss_pred EEEECCCHHHHHHHHHHHHHHCCCCEEE Q ss_conf 9981165177499999999976996389 Q 537021.9.peg.7 44 IAITDTNNLFSALEFSQKACAAGIQPII 71 (1033) Q Consensus 44 iAITD~~nl~Gav~F~~ackk~gIKPIi 71 (1033) +-|...|-. +++..++|++.||+.+. T Consensus 4 vLIanrGei--A~ri~rt~re~gi~~v~ 29 (109) T pfam00289 4 VLVANRGEI--AVRIIRALRELGIETVA 29 (109) T ss_pred EEEECCCHH--HHHHHHHHHHCCCCEEE T ss_conf 999888799--99999999986997999 No 200 >TIGR02283 MltB_2 lytic murein transglycosylase; InterPro: IPR011970 Proteins of this entries are closely related to the MltB family lytic murein transglycosylases described by IPR011757 from INTERPRO and are likewise all proteobacterial, although that family and this one form clearly distinct clades. Several species have one member of each family. Many proteins of this entry(unlike the MltB family) contain an additional C-terminal domain, a putative peptidoglycan binding domain (IPR002477 from INTERPRO), not included in region described by this model. Many sequences appear to contain N-terminal lipoprotein attachment sites, as does Esherichia coli MltB in IPR011757 from INTERPRO.. Probab=49.04 E-value=14 Score=14.86 Aligned_cols=83 Identities=20% Similarity=0.283 Sum_probs=54.2 Q ss_pred HHHHHHHHCCCCCC---HHHHHHHHHHHHHHH-CCCCHHHHHHHHHHHHHHHHCCCC----CCHHHHHHHHHHHHHHHHH Q ss_conf 07889762012110---167787899999862-499999999999999975000133----1014788899999999997 Q 537021.9.peg.7 710 ADVLRRAMGKKIKE---EMDKQKERFISGASK-NGISKTIAVNIFELLAKFADYGFN----KSHAAAYAVISYQTAWMKT 781 (1033) Q Consensus 710 ad~lRra~~kk~~~---~~~~~~~~f~~g~~~-~g~~~~~a~~i~~~i~~f~~Y~Fn----ksHa~aya~~~y~~aylK~ 781 (1033) .|-+++.++.+... .|....+.++...-+ =|++..+.-.||-+=-.|+.|.=+ .-==-|-|.|||-++==+. T Consensus 59 ~~Y~~~~~~~~r~~~G~~~~~~~~~~L~~iE~~yGVp~~~l~AiWGmEs~fG~~~g~~~~M~dv~~sLATLAy~~~RR~~ 138 (313) T TIGR02283 59 WDYLSRRVSPKRIAIGRALLQRYAALLARIEKRYGVPAEILLAIWGMESDFGAYQGKDDVMFDVIRSLATLAYDGARRKD 138 (313) T ss_pred HHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHEEHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCCCHH T ss_conf 99976301478999999999999999999875238881041000113413103116751013389999998624665556 Q ss_pred HHHHHHHHHHH Q ss_conf 30266677751 Q 537021.9.peg.7 782 HYPVEFLAASM 792 (1033) Q Consensus 782 ~yP~eF~aa~L 792 (1033) +|-.||++||= T Consensus 139 ~F~~EL~aALk 149 (313) T TIGR02283 139 FFRTELIAALK 149 (313) T ss_pred HHHHHHHHHHH T ss_conf 76689999999 No 201 >pfam01301 Glyco_hydro_35 Glycosyl hydrolases family 35. Probab=48.95 E-value=14 Score=14.85 Aligned_cols=49 Identities=22% Similarity=0.165 Sum_probs=37.1 Q ss_pred HHHHHHHHHCCCCEEEE-------------ECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEE Q ss_conf 99999999789988998-------------11651774999999999769963899999985 Q 537021.9.peg.7 30 NNILDKIAADQQPAIAI-------------TDTNNLFSALEFSQKACAAGIQPIIGCQLDID 78 (1033) Q Consensus 30 e~LV~~A~~~G~~AiAI-------------TD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~ 78 (1033) ++.++++|..|+.+|+. -|-....-..+|.+.|+++|+.-|+=.-=|++ T Consensus 27 ~drL~K~KA~GlN~V~tYv~Wn~HE~~~G~~dF~G~~Dl~~Fl~la~~~GL~VilRpGPYic 88 (317) T pfam01301 27 PDRLQKAKALGLNTIETYVFWNLHEPEPGQYDFSGILDLVKFIKLAQEAGLYVILRPGPYIC 88 (317) T ss_pred HHHHHHHHHCCCCEEEEECCCCCCCCCCCEECCCCHHHHHHHHHHHHHCCCEEEECCCCEEE T ss_conf 99999999739978998557634589788401235678999999999869879952688563 No 202 >KOG0140 consensus Probab=48.89 E-value=10 Score=16.11 Aligned_cols=51 Identities=22% Similarity=0.244 Sum_probs=29.0 Q ss_pred HHHHHCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHH-HHHHHHHHCCCCC Q ss_conf 89762012110167787899999862499999999999-9999750001331 Q 537021.9.peg.7 713 LRRAMGKKIKEEMDKQKERFISGASKNGISKTIAVNIF-ELLAKFADYGFNK 763 (1033) Q Consensus 713 lRra~~kk~~~~~~~~~~~f~~g~~~~g~~~~~a~~i~-~~i~~f~~Y~Fnk 763 (1033) +-|-|-.+...+++.-+..--.-++.+=+..+.|..+- +.++-|+|||||+ T Consensus 319 ~aRL~~~~aa~evd~~r~~sy~aSiAK~fA~D~an~~at~AvQifGG~Gfn~ 370 (408) T KOG0140 319 LARLMTRNAAWEVDNGRRNSYYASIAKLFATDTANQAATNAVQIFGGNGFNK 370 (408) T ss_pred HHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 9999999889986268861278999998752058899877787544677564 No 203 >PRK08462 biotin carboxylase; Validated Probab=48.87 E-value=14 Score=14.84 Aligned_cols=70 Identities=13% Similarity=0.140 Sum_probs=34.5 Q ss_pred CCCEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCCCCCCCCCCCCCEEEEE--ECCHHHHHHHHHHHH Q ss_conf 998899811651774999999999769963899999985476643344565666677539999--569689999999998 Q 537021.9.peg.7 40 QQPAIAITDTNNLFSALEFSQKACAAGIQPIIGCQLDIDMQDNLKMTNQGKTPFPTVPSIVLL--VSTAEGYQRLIELVS 117 (1033) Q Consensus 40 G~~AiAITD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~~~~~~~~~~~~~~~~~lvlL--AkN~~Gy~nL~kL~S 117 (1033) .++-|-|...|- -+++-.++|++-||+.+.= |-..+....... ..-..+.+ +...+-|-|.-+++. T Consensus 3 ~~kkvLIANRGE--IA~Ri~ra~~elgi~tVav---ys~~D~~a~~v~-------~ad~~~~ig~~~~~~sYln~~~Ii~ 70 (446) T PRK08462 3 ELKRILIANRGE--IALRAIRTIQEMGKEAIAI---YSTADKDALYLK-------YADAKICIGGAKSSESYLNIPAIIS 70 (446) T ss_pred CCCEEEEECCHH--HHHHHHHHHHHCCCCEEEE---ECHHHHCCCHHH-------HCCEEEECCCCCCCCCCCCHHHHHH T ss_conf 788899989609--9999999999829969999---376771676287-------5989898289984334148999999 Q ss_pred HHHH Q ss_conf 9973 Q 537021.9.peg.7 118 RMYL 121 (1033) Q Consensus 118 ~a~~ 121 (1033) .+.. T Consensus 71 ~a~~ 74 (446) T PRK08462 71 AAEI 74 (446) T ss_pred HHHH T ss_conf 9999 No 204 >cd00128 XPG Xeroderma pigmentosum G N- and I-regions (XPGN, XPGI); contains the HhH2 motif; domain in nucleases. XPG is a eukaryotic enzyme that functions in nucleotide-excision repair and transcription-coupled repair of oxidative DNA damage. Functionally/structurally related to FEN-1; divalent metal ion-dependent exo- and endonuclease, and bacterial and bacteriophage 5'3' exonucleases. Probab=48.78 E-value=12 Score=15.60 Aligned_cols=42 Identities=19% Similarity=0.353 Sum_probs=24.9 Q ss_pred HHCCCHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCC Q ss_conf 3323105676210014578753336568789999999830380 Q 537021.9.peg.7 398 LRFSLLFERFLNPDRMSMPDFDIDFCQDRRDEVIRYVQNKYGH 440 (1033) Q Consensus 398 i~~~LlFERFLnp~R~~~PDIDiDf~~~~R~~vi~y~~~kyG~ 440 (1033) -.+.-..|=|+||.=.+ ..+++.+..-..+.+++++.+..|- T Consensus 264 ~~~~~~~~~f~~p~v~~-~~~~~~~~~pd~~~l~~fl~~e~~f 305 (316) T cd00128 264 FPLKEAREFFLNPEVTD-DFIDLRWRDPDEEGIIEFLCKEHGF 305 (316) T ss_pred CCHHHHHHHHCCCCCCC-CCCCCCCCCCCHHHHHHHHHHCCCC T ss_conf 99799999857999999-8667656999989999997436698 No 205 >PRK05298 excinuclease ABC subunit B; Provisional Probab=48.73 E-value=6.3 Score=17.91 Aligned_cols=204 Identities=19% Similarity=0.292 Sum_probs=100.9 Q ss_pred CCHHHHHHHHHHHHHHCCCC---EEEEEECC-CH--------------------HHHHHHHHHHHHHHHHCCCEEEECCC Q ss_conf 99789999999998765986---89999439-91--------------------15899999999999818989980883 Q 537021.9.peg.7 162 NSSQVAEKRLLTFKKLFGDR---LYVNLQRH-RG--------------------YDRYRESQVVQLAYTHELPLVATNNS 217 (1033) Q Consensus 162 ~~~~~a~~~l~~l~~~Fgd~---~ylEl~~~-~~--------------------~e~~~~~~l~~lA~~~~iPlVaTndv 217 (1033) .+...|.+....|++.||+| +|+.-.++ .+ -|..+......|-..-++=+||+=.| T Consensus 66 hNKTLAAQLy~Efk~fFP~NaVEYFVSYyDYYQPEAYip~tDtYIEKdssIN~eId~lR~sAT~SLl~RrDVIVVASVSC 145 (657) T PRK05298 66 HNKTLAAQLYGEFKEFFPENAVEYFVSYYDYYQPEAYVPSSDTYIEKDSSINEEIERLRLSATKSLLERRDVIVVASVSC 145 (657) T ss_pred CCHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCCCCCCCEEECCCHHHHHHHHHHHHHHHHHHCCCCEEEEEEHHH T ss_conf 65889999999999768887179985144566971114788852423634669999999999998854798699973676 Q ss_pred CCC--CHHHHHHHHHHHHHHCCCCCCCCCCCC-CCCCCCCCC-----------HHHHHHHCCHHHHHHHHHHHHHHHHHH Q ss_conf 007--878899998766884698576531002-453432343-----------478875122035899878864111000 Q 537021.9.peg.7 218 LFL--SEEDYEAHDVLMAVAHSTVVSQKDRPR-VTPDHYLKN-----------RSEMVSIFSDLPEALENTVEIARRCSF 283 (1033) Q Consensus 218 ~Y~--~~~D~~~h~iL~aI~~~~~l~~~~~~~-~~~~~yLks-----------~eEM~~~f~~~peAi~NT~~IAe~c~~ 283 (1033) -|- +|+|+.-.-+ .+..|..++.....+ +-.-+|-++ .-+..+.|+-. ....|=+++| T Consensus 146 IYGLGsPe~Y~~~~l--~l~~G~~i~r~~ll~~LV~~qY~RnD~~~~RG~FRVrGDvVdI~Pa~------~ed~aiRIef 217 (657) T PRK05298 146 IYGLGSPEEYLKMVL--HLRVGQEIDRRELLRRLVELQYERNDIDFQRGTFRVRGDVIDIFPAE------YEDRAIRIEF 217 (657) T ss_pred CCCCCCHHHHHHCEE--EEECCCEECHHHHHHHHHHHCCCCCCCCCCCCEEEEECCEEEEECCC------CCCEEEEEEE T ss_conf 168999799973437--77589771799999999982877676511685557736647883377------7761699999 Q ss_pred -----------HHHH--------HHCCCCCCCCCCCCCCCCCHHHHHH---HHHHHHHHHCCCCCCC-CCCCCHHHHHHH Q ss_conf -----------1332--------2102345557568877689889998---7654103420124643-245783899999 Q 537021.9.peg.7 284 -----------ILQV--------HAPILPRFIEKECDDIQKEEENELR---NKAVAGLEVRLAQGAI-AKGYHAQDYRQR 340 (1033) Q Consensus 284 -----------~l~~--------~~~~lP~f~~p~~~~~~~~~~~~Lr---~l~~~Gl~~R~~~~~~-~~~~~~~~y~~R 340 (1033) +.-. .-..+|. ... .++.+.+. +...+-|++|+..... .+....+...+| T Consensus 218 fgDeIE~I~~~Dpltg~~i~~~~~~~IfPA---shy----Vt~~e~l~~Ai~~I~~EL~eRl~~f~~~gKllEAqRL~qR 290 (657) T PRK05298 218 FGDEIERISEFDPLTGEVLGELDRVTIYPA---SHY----VTPRERLERAIESIKEELEERLKELEKEGKLLEAQRLEQR 290 (657) T ss_pred ECCCEEEEEEECCCCCCCCCCCCEEEEECC---CCC----CCCHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHH T ss_conf 578062899982778835243334998178---676----8997999999999999999999999976777988899999 Q ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHH Q ss_conf 8877898986063055788899999851047643344442046677655 Q 537021.9.peg.7 341 LDFELNVIARMKFSGYFLIVADFIQWAKKNNIPVGPGRGSGAGSVTAYA 389 (1033) Q Consensus 341 Le~EL~vI~~~gf~~YFLiV~Div~~ak~~gI~vGpGRGSAagSLVaY~ 389 (1033) -+|-|+.|.++||+. -|=||+|- +-|---|+..-+|.-|. T Consensus 291 T~yDlEMl~E~GyCs------GIENYSRh---l~gR~pGe~P~tLlDYf 330 (657) T PRK05298 291 TRYDLEMLRELGYCS------GIENYSRH---LSGRAPGEPPPTLLDYF 330 (657) T ss_pred HHHHHHHHHHCCCCC------CHHHHHHH---CCCCCCCCCCCHHHHHC T ss_conf 887999999828775------60121565---06999887894488757 No 206 >COG4956 Integral membrane protein (PIN domain superfamily) [General function prediction only] Probab=48.52 E-value=3.4 Score=20.21 Aligned_cols=35 Identities=9% Similarity=0.122 Sum_probs=13.7 Q ss_pred HHHHHCEEECC-CCHHHHHHHHHHCCCCCHHHHHCCC Q ss_conf 78872621163-8558999999874897688985055 Q 537021.9.peg.7 955 EQRVLGFYFSG-HPLDVYKPILRKMGIKGYEESVATI 990 (1033) Q Consensus 955 E~e~LGf~lS~-HPl~~~~~~l~~~~~~~~~~~~~~~ 990 (1033) -+++=|.-++. --|+..-+ ++...+.++.|+.+.. T Consensus 252 Ak~~~g~lvTND~NLnKVae-~qgV~vLNINDLAnAV 287 (356) T COG4956 252 AKVTGGKLVTNDFNLNKVAE-LQGVQVLNINDLANAV 287 (356) T ss_pred HHHHCCEEEECCCCHHHHHH-HCCCCEECHHHHHHHH T ss_conf 99968889724676888776-4488463088888873 No 207 >TIGR02472 sucr_P_syn_N sucrose-phosphate synthase, putative, glycosyltransferase domain; InterPro: IPR012822 This family consists of the N-terminal regions, or in some cases the entirety, of bacterial proteins closely related to plant sucrose-phosphate synthases (SPS). The C-terminal domain (IPR012821 from INTERPRO), found with most members of this family, resembles both bona fide plant sucrose-phosphate phosphatases (SPP) and the SPP-like domain of plant sucrose phosphate synthase (SPS). At least two members of this family lack the SPP-like domain, which may have binding or regulatory rather than enzymatic activity by analogy to plant SPS. This enzyme produces sucrose 6-phosphate and UDP from UDP-glucose and D-fructose 6-phosphate, and may be encoded near the gene for fructokinase.. Probab=48.49 E-value=9.5 Score=16.39 Aligned_cols=15 Identities=33% Similarity=0.408 Sum_probs=10.8 Q ss_pred HHHHHHHCCCEEEEC Q ss_conf 999998189899808 Q 537021.9.peg.7 201 VQLAYTHELPLVATN 215 (1033) Q Consensus 201 ~~lA~~~~iPlVaTn 215 (1033) .++|..++||+|-|. T Consensus 130 ~~ls~~L~vPlvfTG 144 (445) T TIGR02472 130 ARLSRLLGVPLVFTG 144 (445) T ss_pred HHHHHHCCCCEEEEC T ss_conf 999862589848837 No 208 >pfam00091 Tubulin Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial cell division. It is part of a ring in the middle of the dividing cell that is required for constriction of cell membrane and cell envelope to yield two daughter cells. FtsZ and tubulin are GTPases. FtsZ can polymerize into tubes, sheets, and rings in vitro and is ubiquitous in eubacteria and archaea. Tubulin is the major component of microtubules. Probab=48.20 E-value=11 Score=15.83 Aligned_cols=46 Identities=33% Similarity=0.354 Sum_probs=31.1 Q ss_pred HHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCC Q ss_conf 389999988778989860630557888999998510476433444420466776554403 Q 537021.9.peg.7 334 AQDYRQRLDFELNVIARMKFSGYFLIVADFIQWAKKNNIPVGPGRGSGAGSVTAYALTIT 393 (1033) Q Consensus 334 ~~~y~~RLe~EL~vI~~~gf~~YFLiV~Div~~ak~~gI~vGpGRGSAagSLVaY~LgIT 393 (1033) .++..+|+.+|++- .+-.+.|++++ + +|=|=||++||+++..|.-. T Consensus 102 ~~~~~~~irk~~E~---cd~~~gf~i~~---------S--lgGGTGSG~gs~l~~~l~~~ 147 (210) T pfam00091 102 AEESLEEIRKELEK---CDGLDGFFITA---------G--LGGGTGSGAAPVIAEILKEL 147 (210) T ss_pred HHHHHHHHHHHHHH---CCCCCEEEEEE---------E--ECCCCCCCHHHHHHHHHHHH T ss_conf 99999999999982---58845188772---------2--26877655899999999997 No 209 >KOG2499 consensus Probab=48.13 E-value=11 Score=15.82 Aligned_cols=15 Identities=27% Similarity=0.587 Sum_probs=13.9 Q ss_pred HHHHHHHHHCCCCCC Q ss_conf 899999851047643 Q 537021.9.peg.7 360 VADFIQWAKKNNIPV 374 (1033) Q Consensus 360 V~Div~~ak~~gI~v 374 (1033) |.++|+|||-+||.| T Consensus 252 v~evV~yarlRGIRV 266 (542) T KOG2499 252 VSEVVEYARLRGIRV 266 (542) T ss_pred HHHHHHHHHHCCCEE T ss_conf 999999988626366 No 210 >PRK08463 acetyl-CoA carboxylase; Validated Probab=48.05 E-value=15 Score=14.74 Aligned_cols=68 Identities=12% Similarity=0.060 Sum_probs=33.9 Q ss_pred CEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCCCCCCCCCCCCCEEEEEE-CCHHHHHHHHHHHHHHH Q ss_conf 88998116517749999999997699638999999854766433445656666775399995-69689999999998997 Q 537021.9.peg.7 42 PAIAITDTNNLFSALEFSQKACAAGIQPIIGCQLDIDMQDNLKMTNQGKTPFPTVPSIVLLV-STAEGYQRLIELVSRMY 120 (1033) Q Consensus 42 ~AiAITD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~~~~~~~~~~~~~~~~~lvlLA-kN~~Gy~nL~kL~S~a~ 120 (1033) +-|-|+.+|- -+++..++|++-||+++.= |-..+...... + ..+ ..+.+- ...+-|-|.-++++.+. T Consensus 3 ~kvLIANRGE--IA~Riirt~relgi~tVav---ys~~D~~s~hv---~-~Ad---ea~~lg~~~~~sYLn~~~Ii~~A~ 70 (478) T PRK08463 3 HKILIANRGE--IAVRIIRACRDLHIKSVAI---YTEPDRECLHV---K-IAD---EAYRIGTDPIKGYLDVKRIVEIAK 70 (478) T ss_pred CEEEEECCCH--HHHHHHHHHHHCCCCEEEE---ECHHHCCCCHH---H-HCC---EEEECCCCHHHHHCCHHHHHHHHH T ss_conf 6178966869--9999999999839978999---78576578347---7-578---766528873453069999999999 Q ss_pred H Q ss_conf 3 Q 537021.9.peg.7 121 L 121 (1033) Q Consensus 121 ~ 121 (1033) . T Consensus 71 ~ 71 (478) T PRK08463 71 A 71 (478) T ss_pred H T ss_conf 8 No 211 >PRK07075 isochorismate-pyruvate lyase; Reviewed Probab=48.04 E-value=15 Score=14.74 Aligned_cols=35 Identities=14% Similarity=0.152 Sum_probs=26.7 Q ss_pred CCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH Q ss_conf 11016778789999986249999999999999997 Q 537021.9.peg.7 721 IKEEMDKQKERFISGASKNGISKTIAVNIFELLAK 755 (1033) Q Consensus 721 ~~~~~~~~~~~f~~g~~~~g~~~~~a~~i~~~i~~ 755 (1033) .++-.+++-+.-..-|.+.|.+++.++.+|..|-. T Consensus 50 ~p~R~~~vi~~~r~~A~~~Gl~pd~vE~l~r~li~ 84 (101) T PRK07075 50 APERVAAMLPERRRWAEQAGLDADFVEKLFAQLIH 84 (101) T ss_pred CHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH T ss_conf 90799999999999999859699999999999999 No 212 >cd03320 OSBS o-Succinylbenzoate synthase (OSBS) catalyzes the conversion of 2-succinyl-6-hydroxy-2,4-cyclohexadiene-1-carboxylate (SHCHC) to 4-(2'-carboxyphenyl)-4-oxobutyrate (o-succinylbenzoate or OSB), a reaction in the menaquinone biosynthetic pathway. Menaquinone is an essential cofactor for anaerobic growth in eubacteria and some archaea. OSBS belongs to the enolase superfamily of enzymes, characterized by the presence of an enolate anion intermediate which is generated by abstraction of the alpha-proton of the carboxylate substrate by an active site residue and is stabilized by coordination to the essential Mg2+ ion. Probab=47.61 E-value=15 Score=14.69 Aligned_cols=25 Identities=16% Similarity=0.197 Sum_probs=16.7 Q ss_pred HHCCHHHHHHHHHHHHHHHCCCCCC Q ss_conf 6063055788899999851047643 Q 537021.9.peg.7 350 RMKFSGYFLIVADFIQWAKKNNIPV 374 (1033) Q Consensus 350 ~~gf~~YFLiV~Div~~ak~~gI~v 374 (1033) +..-.+=+.=...+++.|+++|+.+ T Consensus 206 K~~~~GGi~~~~~i~~~a~~~gi~~ 230 (263) T cd03320 206 KPALLGGPRALLELAEEARARGIPA 230 (263) T ss_pred CCHHCCCHHHHHHHHHHHHHCCCCE T ss_conf 7201089999999999999859969 No 213 >KOG0369 consensus Probab=47.36 E-value=15 Score=14.66 Aligned_cols=47 Identities=21% Similarity=0.237 Sum_probs=22.9 Q ss_pred HHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEE Q ss_conf 443299899999999789988998116517749999999997699638 Q 537021.9.peg.7 23 LEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPI 70 (1033) Q Consensus 23 ldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPI 70 (1033) -.....++++++.|++++..++-=- +|-++---+|.++|+++||.-| T Consensus 89 V~AYL~ideii~iak~~~vdavHPG-YGFLSErsdFA~av~~AGi~fi 135 (1176) T KOG0369 89 VGAYLAIDEIISIAKKHNVDAVHPG-YGFLSERSDFAQAVQDAGIRFI 135 (1176) T ss_pred HHHHHCHHHHHHHHHHCCCCEECCC-CCCCCCCHHHHHHHHHCCCEEE T ss_conf 6765147999999987288732577-4212212489999986596696 No 214 >pfam09194 Endonuc-BsobI Restriction endonuclease BsobI. Members of this family of prokaryotic restriction endonucleases recognize the double-stranded sequence CYCGRG (where Y = T/C, and R = A/G) and cleave after C-1. They catalyse the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. Probab=47.30 E-value=9.7 Score=16.31 Aligned_cols=34 Identities=29% Similarity=0.346 Sum_probs=19.6 Q ss_pred CCHHHHHH-HHHCCCCCHHHHCCCHHHHHHHHHHCCCHHHH Q ss_conf 43588998-76201410122114534788887717404888 Q 537021.9.peg.7 603 FDDHETYH-LLTTKGTLGIFQLESSGMRQALEGMQPDCIED 642 (1033) Q Consensus 603 ~~D~~~~~-l~~~g~t~GvFQ~es~~~~~~l~~~~p~~~~d 642 (1033) +=|+-||+ ||.+|||.|= .||.+-.-+.-+.|.- T Consensus 106 fveELvyRfLLtrGDtLGG------~MRNigG~laqrK~tR 140 (316) T pfam09194 106 FVEELVYRFLLTRGDTLGG------SMRNIGGSLAQRKFTR 140 (316) T ss_pred HHHHHHHHHHHHCCCCCCC------HHHHHHHHHHHHHHHH T ss_conf 8999999999963542350------3553367888999999 No 215 >PRK11337 DNA-binding transcriptional repressor RpiR; Provisional Probab=47.16 E-value=12 Score=15.49 Aligned_cols=17 Identities=24% Similarity=0.389 Sum_probs=5.5 Q ss_pred HHHHHCCCCHHHHHHHH Q ss_conf 99975279988988654 Q 537021.9.peg.7 460 DVGRALQMPYSQVDRLC 476 (1033) Q Consensus 460 d~~r~~g~~~~~~~~~~ 476 (1033) |+|+..|.+...+-+++ T Consensus 52 eLA~~~~vS~aTVvRf~ 68 (293) T PRK11337 52 DVAEALAVSEAMIVKVA 68 (293) T ss_pred HHHHHHCCCHHHHHHHH T ss_conf 99989599888999999 No 216 >COG0337 AroB 3-dehydroquinate synthetase [Amino acid transport and metabolism] Probab=47.09 E-value=15 Score=14.63 Aligned_cols=18 Identities=6% Similarity=0.279 Sum_probs=8.5 Q ss_pred HHHHHHHHHCCCCCCCCC Q ss_conf 899988886368503552 Q 537021.9.peg.7 803 KKFCQDARQFNIQIMPPS 820 (1033) Q Consensus 803 ~~~i~e~~~~gi~vl~Pd 820 (1033) ..+.+.+++.|+++..|+ T Consensus 289 ~r~~~~L~~~gLP~~~~~ 306 (360) T COG0337 289 ERILNLLKRYGLPTSLPD 306 (360) T ss_pred HHHHHHHHHCCCCCCCCC T ss_conf 999999998599956877 No 217 >TIGR00506 ribB 3,4-dihydroxy-2-butanone 4-phosphate synthase; InterPro: IPR000422 3,4-Dihydroxy-2-butanone 4-phosphate is biosynthesized from ribulose 5-phosphate and serves as the biosynthetic precursor for the xylene ring of riboflavin . It is sometimes found as a bifunctional enzyme with GTP cyclohydrolase II that catalyses the first committed step in the biosynthesis of riboflavin (IPR000926 from INTERPRO). No sequences with significant homology to DHBP synthase are found in the metazoa.; GO: 0008686 34-dihydroxy-2-butanone-4-phosphate synthase activity, 0009231 riboflavin biosynthetic process. Probab=46.82 E-value=11 Score=15.78 Aligned_cols=31 Identities=19% Similarity=0.375 Sum_probs=19.4 Q ss_pred HHHHHHHHCCH---HHHHHHHH--HHHHHHCCCCCC Q ss_conf 78989860630---55788899--999851047643 Q 537021.9.peg.7 344 ELNVIARMKFS---GYFLIVAD--FIQWAKKNNIPV 374 (1033) Q Consensus 344 EL~vI~~~gf~---~YFLiV~D--iv~~ak~~gI~v 374 (1033) ...|||+|==+ |=.+-.-| +++||+++|+.+ T Consensus 185 pa~VICE~~nd~edG~mar~~ekf~~~yA~Kh~l~~ 220 (230) T TIGR00506 185 PAGVICEIMNDEEDGTMARKPEKFLVEYAKKHGLKL 220 (230) T ss_pred CCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHCCCC T ss_conf 728998623772033103666488999998819850 No 218 >smart00475 53EXOc 5'-3' exonuclease. Probab=46.50 E-value=16 Score=14.57 Aligned_cols=31 Identities=29% Similarity=0.647 Sum_probs=22.4 Q ss_pred EHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHCC Q ss_conf 026858969899999999983799999789997348 Q 537021.9.peg.7 838 SLAAIKGVGTTTARHIMEASADKPFDSLEDFCSRVD 873 (1033) Q Consensus 838 Gl~~Ikgvg~~~~~~Ive~r~~g~f~sl~df~~rv~ 873 (1033) ++..|+|||+++|..++. .|.|+++.+..++ T Consensus 187 nIpGV~GIG~KtA~kLL~-----~ygsle~i~~~~d 217 (259) T smart00475 187 NIPGVPGIGEKTAAKLLK-----EFGSLENILENLD 217 (259) T ss_pred CCCCCCCCCHHHHHHHHH-----HCCCHHHHHHHHH T ss_conf 899999847899999999-----8399999998398 No 219 >TIGR00387 glcD glycolate oxidase, subunit GlcD; InterPro: IPR004490 The glc locus of Escherichia coli encodes the subunits of glycolate oxidase and the glc regulator protein. The subunit GlcD is similar to that of several D-lactate dehydrogenases, including that of E. coli. Glycolate oxidase has been found to have some D-lactate dehydrogenase activity.; GO: 0008891 glycolate oxidase activity, 0009339 glycolate oxidase complex. Probab=46.46 E-value=11 Score=15.93 Aligned_cols=135 Identities=19% Similarity=0.168 Sum_probs=71.1 Q ss_pred HHHHHHHHHHH-CCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHHCCCHHHHHHHHCCCCCCCCC Q ss_conf 88778989860-63055788899999851047643344442046677655440375633323105676210014578753 Q 537021.9.peg.7 341 LDFELNVIARM-KFSGYFLIVADFIQWAKKNNIPVGPGRGSGAGSVTAYALTITDIDPLRFSLLFERFLNPDRMSMPDFD 419 (1033) Q Consensus 341 Le~EL~vI~~~-gf~~YFLiV~Div~~ak~~gI~vGpGRGSAagSLVaY~LgIT~VDPi~~~LlFERFLnp~R~~~PDID 419 (1033) +..+++-|.+. +-+.=+=+-+|=-+ .......||=+|-. +++-+-|--.-+ -|+. | T Consensus 256 ~~~~~e~~~~~n~~a~~~~~Aq~E~n----~~~~~~~~R~~A~~-a~~~~~Gtkk~~----------~~~e--------D 312 (437) T TIGR00387 256 LLEKIEKICKKNAGAVDVQIAQDEEN----ERLKLWAGRRNAFK-AASKLSGTKKPK----------YLIE--------D 312 (437) T ss_pred HHHHHHHHHHHCCCEEHHHHHHHHHH----HHHHHHHHHHHHHH-HHHHHHCCCCCC----------EEEE--------C T ss_conf 99999999862155202477763688----99998862288999-998740555755----------2562--------3 Q ss_pred CCCCHHHHHHH---HHHHHHHCCCCCEEE--CCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCCHHH-HH Q ss_conf 33656878999---999983038021554--2207787788999999975279988988654310145898244344-52 Q 537021.9.peg.7 420 IDFCQDRRDEV---IRYVQNKYGHERVAQ--IITFGSLQAKAALRDVGRALQMPYSQVDRLCKLIPNDPAHPVSLQA-IA 493 (1033) Q Consensus 420 iDf~~~~R~~v---i~y~~~kyG~~~va~--i~t~~t~~~k~aird~~r~~g~~~~~~~~~~~~i~~~~~~~~~~~~-~~ 493 (1033) +=+|-+.=.++ |.-+-.||+.-+||. |.+|+.-+. +--+-. +- T Consensus 313 ~tVPR~~Lp~~l~~I~~ia~Ky~~~~~ANF~i~~~gHaGD-------------------------------GNlHP~i~~ 361 (437) T TIGR00387 313 GTVPRSKLPEVLKGIADIASKYDDLAVANFKIFIVGHAGD-------------------------------GNLHPLILT 361 (437) T ss_pred CCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEEEEEEECCC-------------------------------CCCCCEEEC T ss_conf 3004567789999999999751863454026778874177-------------------------------875432223 Q ss_pred CCHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCE Q ss_conf 29887886430742026889872011221002447440 Q 537021.9.peg.7 494 DDSRFREARLADPSVDRLLEISQKLEGLYRHASTHAAG 531 (1033) Q Consensus 494 ~~~~~~~~~~~~~~~~~~~~~a~~leg~~r~~~~Ha~G 531 (1033) +...-.++-..++-+.++|+.|-+|+|... |=|==| T Consensus 362 d~~~~~~~er~e~~~~EI~e~a~~LGGtiS--GEHGIG 397 (437) T TIGR00387 362 DEEDKGEMERVEELGGEILELAIELGGTIS--GEHGIG 397 (437) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHCCCCEE--EECCCC T ss_conf 788666899999999999998722277055--311546 No 220 >cd00008 53EXOc 5'-3' exonuclease; T5 type 5'-3' exonuclease domains may co-occur with DNA polymerase I (Pol I) domains, or be part of Pol I containing complexes. They digest dsDNA and ssDNA, releasing mono-,di- and tri-nucleotides, as well as oligonucleotides, and have also been reported to possess RNase H activity. Also called 5' nuclease family, involved in structure-specific cleavage of flaps formed by Pol I activity (similar to mammalian flap endonuclease I, FEN-1). A single nucleic acid strand may be threaded through the 5' nuclease enzyme before cleavage occurs. The domain binds two divalent metal ions which are necessary for activity. Probab=46.39 E-value=16 Score=14.55 Aligned_cols=30 Identities=27% Similarity=0.614 Sum_probs=18.2 Q ss_pred HHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHCC Q ss_conf 26858969899999999983799999789997348 Q 537021.9.peg.7 839 LAAIKGVGTTTARHIMEASADKPFDSLEDFCSRVD 873 (1033) Q Consensus 839 l~~Ikgvg~~~~~~Ive~r~~g~f~sl~df~~rv~ 873 (1033) +..|+|||.+++..++. .|.|+++.+..++ T Consensus 185 IpGV~GiG~KtA~kLl~-----~~gsle~i~~~~~ 214 (240) T cd00008 185 IPGVPGIGEKTAAKLLK-----EYGSLEGILENLD 214 (240) T ss_pred CCCCCCCCHHHHHHHHH-----HHCCHHHHHHHHH T ss_conf 89999857899999999-----8099999998298 No 221 >PRK05395 3-dehydroquinate dehydratase; Provisional Probab=46.10 E-value=12 Score=15.61 Aligned_cols=19 Identities=26% Similarity=0.497 Sum_probs=9.3 Q ss_pred HHHHHHCCHHHHHHHHHHH Q ss_conf 9898606305578889999 Q 537021.9.peg.7 346 NVIARMKFSGYFLIVADFI 364 (1033) Q Consensus 346 ~vI~~~gf~~YFLiV~Div 364 (1033) .+|..+|..+|+|-+.-++ T Consensus 122 g~I~G~G~~gY~lAl~~l~ 140 (143) T PRK05395 122 GVICGFGADGYLLALEALA 140 (143) T ss_pred EEEECCCHHHHHHHHHHHH T ss_conf 8995288478999999999 No 222 >TIGR00689 rpiB_lacA_lacB sugar-phosphate isomerases, RpiB/LacA/LacB family; InterPro: IPR003500 This entry represents the sugar isomerase enzymes ribose 5-phosphate isomerase B (rpiB), galactose isomerase subunit A (LacA) and galactose isomerase subunit B (LacB). Galactose-6-phosphate isomerase (5.3.1.26 from EC) is a heteromultimeric protein consisting of subunits LacA and LacB, and catalyses the conversion of D-galactose 6-phosphate to D-tagatose and 6-phosphate in the tagatose 6-phosphate pathway of lactose catabolism . Galactose-6-phosphate isomerase is induced by galactose or lactose. This entry represents the LacB subunit. Ribose 5-phosphate isomerase (5.3.1.6 from EC) forms a homodimer and catalyses the interconversion of D-ribose 5-phosphate and D-ribulose 5-phosphate in the non-oxidative branch of the pentose phosphate pathway. This reaction permits the synthesis of ribose from other sugars, as well as the recycling of sugars from nucleotide breakdown. Two unrelated enzymes can catalyse this reaction: RpiA (found in most organisms) and RpiB (found in some bacteria and eukaryotes). RpiB is also involved in metabolism of the rare sugar, allose, in addition to ribose sugars. The structures of RpiA and RpiB are distinct, RpiB having a Rossmann-type alpha/beta/alpha sandwich topology .; GO: 0005975 carbohydrate metabolic process. Probab=45.69 E-value=11 Score=15.91 Aligned_cols=36 Identities=11% Similarity=0.190 Sum_probs=16.3 Q ss_pred HHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHH Q ss_conf 999999999818989980883007878899998766 Q 537021.9.peg.7 197 ESQVVQLAYTHELPLVATNNSLFLSEEDYEAHDVLM 232 (1033) Q Consensus 197 ~~~l~~lA~~~~iPlVaTndv~Y~~~~D~~~h~iL~ 232 (1033) .+.+.++=++.++.++=...-.|-...||+.+-.++ T Consensus 13 K~~I~~hL~~~~~~~iD~G~~~~~~~~DYPdYA~~V 48 (146) T TIGR00689 13 KEKIIEHLEQKGLEVIDLGTLEYDESVDYPDYAKLV 48 (146) T ss_pred HHHHHHHHHHCCCCEEEECCCCCCCCCCCHHHHHHH T ss_conf 999999874368415750676788998874789999 No 223 >KOG3717 consensus Probab=45.54 E-value=5.8 Score=18.18 Aligned_cols=53 Identities=15% Similarity=0.079 Sum_probs=25.6 Q ss_pred HHCCCCCHHHCCCHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHH-HHHCCCCCEEECCCHHHHHHH Q ss_conf 440375633323105676210014578753336568789999999-830380215542207787788 Q 537021.9.peg.7 390 LTITDIDPLRFSLLFERFLNPDRMSMPDFDIDFCQDRRDEVIRYV-QNKYGHERVAQIITFGSLQAK 455 (1033) Q Consensus 390 LgIT~VDPi~~~LlFERFLnp~R~~~PDIDiDf~~~~R~~vi~y~-~~kyG~~~va~i~t~~t~~~k 455 (1033) +|=-..|==+|..+|---=-|++. .|.++.+. .-.++.+||.-|..-..|... T Consensus 141 ~~~~plcM~Qy~~lf~t~R~Pg~~-------------~D~~~~~~~~~n~e~~hV~Vi~~nqff~L~ 194 (612) T KOG3717 141 LGGQPLCMNQYYRLFSSCRVPGVG-------------QDTIVNQSKTYNPEPEHVTVICRNQFFELD 194 (612) T ss_pred HCCCCCCHHHHHHHHHCCCCCCCC-------------CCCCCCCCCCCCCCCCEEEEEECCCEEEEE T ss_conf 568963389998775134379998-------------543101234358898659999858379999 No 224 >PRK08104 consensus Probab=45.44 E-value=16 Score=14.44 Aligned_cols=119 Identities=15% Similarity=0.180 Sum_probs=47.4 Q ss_pred CCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCCCCCCCCCCCCCEEEEE---E Q ss_conf 9989999999978998899811651774999999999769963899999985476643344565666677539999---5 Q 537021.9.peg.7 27 LSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPIIGCQLDIDMQDNLKMTNQGKTPFPTVPSIVLL---V 103 (1033) Q Consensus 27 ~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~~~~~~~~~~~~~~~~~lvlL---A 103 (1033) +++++++++ .+-+++--..+..-+.+..+++.+.||+- +|+++......+.....+.. .+++++= . T Consensus 6 ~~~~~ll~~-----~~iipVir~~~~~~a~~la~al~~gGi~~---iEiTlrt~~a~~~I~~l~~~---~p~~~vGaGTV 74 (212) T PRK08104 6 TSAEDIFTR-----GPVVPVIVINKLEHAVPLAKALVAGGVRV---LEVTLRTPCALEAIRAIAKE---VPEAIVGAGTV 74 (212) T ss_pred CCHHHHHHH-----CCEEEEEECCCHHHHHHHHHHHHHCCCCE---EEEECCCCHHHHHHHHHHHH---CCCCEEEEEEC T ss_conf 899999974-----98689997799999999999999879988---99968881499999999986---89856854202 Q ss_pred CCHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCEEEECCCCC-CHHHHHHHCC Q ss_conf 6968999999999899732487787400789998634898899819767-9678888779 Q 537021.9.peg.7 104 STAEGYQRLIELVSRMYLLDQGKQSVRICLSWLQEIGTEGLIMLTGGSS-GPIDRAFFFN 162 (1033) Q Consensus 104 kN~~Gy~nL~kL~S~a~~~~~~~~~p~i~~~~L~e~~~egLIvlsg~~~-g~i~~~l~~~ 162 (1033) .|.+-.+ +.+. .-..|--.|.++.+.+..-+..++.++.||.. +++.+++..| T Consensus 75 ~~~e~~~---~ai~---aGA~FiVSP~~~~~v~~~a~~~~i~~iPGv~TpsEi~~A~~~G 128 (212) T PRK08104 75 LNPQQLA---EVTE---AGAQFAISPGLTEELLKAATEGTIPLIPGISTVSELMLGMDYG 128 (212) T ss_pred CCHHHHH---HHHH---CCCCEEECCCCCHHHHHHHHHCCCCEECCCCCHHHHHHHHHCC T ss_conf 6799999---9998---5999998489999999999982997656769999999999879 No 225 >TIGR00644 recJ single-stranded-DNA-specific exonuclease RecJ; InterPro: IPR004610 All proteins in this family are 5'-3' single-strand DNA exonucleases. These proteins are used in some aspects of mismatch repair, recombination, and recombinational repair. RecJ is a single-stranded DNA-specific endonuclease which is required for many types of recombination events, although the stringency of the requirement for RecJ appears to vary with the type of recombinational event monitored, and the other recombination gene products which are available.; GO: 0008409 5'-3' exonuclease activity, 0006281 DNA repair, 0006310 DNA recombination. Probab=45.39 E-value=16 Score=14.44 Aligned_cols=52 Identities=15% Similarity=0.222 Sum_probs=23.7 Q ss_pred HHHHHHHCC--CCEEEEEECCCHHHHH-HHHHHHHHHHHHCCCEEEECCCCCCCHH Q ss_conf 999987659--8689999439911589-9999999999818989980883007878 Q 537021.9.peg.7 171 LLTFKKLFG--DRLYVNLQRHRGYDRY-RESQVVQLAYTHELPLVATNNSLFLSEE 223 (1033) Q Consensus 171 l~~l~~~Fg--d~~ylEl~~~~~~e~~-~~~~l~~lA~~~~iPlVaTndv~Y~~~~ 223 (1033) +..+-+.|| .++..-|. ++..|.. .+..++.-+.+.|+-||.|=|+=-..-+ T Consensus 75 l~~~L~~lGp~~~~~~~IP-nR~~egYG~~~~~~~~~~~~G~~LiiTVD~Gi~a~~ 129 (705) T TIGR00644 75 LVEFLRELGPYVQVDYYIP-NRFTEGYGLSPEALRELKENGVSLIITVDNGISAHE 129 (705) T ss_pred HHHHHHHCCCCCEEEEEEC-CCCCCCCCCCHHHHHHHHHCCCEEEEECCCCHHHHH T ss_conf 9999995099732466626-635777788989999998669839998268742699 No 226 >PTZ00129 40S ribosomal protein S14; Provisional Probab=45.32 E-value=12 Score=15.46 Aligned_cols=27 Identities=33% Similarity=0.567 Sum_probs=10.6 Q ss_pred CCCCCCHHHHHHHHHHH---CCCCCHHHCC Q ss_conf 34444204667765544---0375633323 Q 537021.9.peg.7 375 GPGRGSGAGSVTAYALT---ITDIDPLRFS 401 (1033) Q Consensus 375 GpGRGSAagSLVaY~Lg---IT~VDPi~~~ 401 (1033) ||||=||--.|-+==|. |.||-||.|| T Consensus 108 GpGresAIRaL~~~Gl~I~~I~DVTPiPHn 137 (150) T PTZ00129 108 GPGAQSALRSLARSGLKIGRIEDVTPIPTD 137 (150) T ss_pred CCCHHHHHHHHHHCCCEEEEEEECCCCCCC T ss_conf 864667789987479889898877999999 No 227 >pfam01367 5_3_exonuc 5'-3' exonuclease, C-terminal SAM fold. Probab=45.26 E-value=16 Score=14.42 Aligned_cols=31 Identities=26% Similarity=0.637 Sum_probs=24.7 Q ss_pred EHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHCC Q ss_conf 026858969899999999983799999789997348 Q 537021.9.peg.7 838 SLAAIKGVGTTTARHIMEASADKPFDSLEDFCSRVD 873 (1033) Q Consensus 838 Gl~~Ikgvg~~~~~~Ive~r~~g~f~sl~df~~rv~ 873 (1033) ++-.|+|||+++|..++. .|.|++.++...+ T Consensus 19 nIPGv~GiG~KtA~~Ll~-----~~gsle~i~~n~d 49 (100) T pfam01367 19 NIPGVPGIGEKTAAKLLK-----EYGSLENIYENLD 49 (100) T ss_pred CCCCCCCCCCHHHHHHHH-----HCCCHHHHHHHHH T ss_conf 889999988168999999-----8198999998198 No 228 >pfam08423 Rad51 Rad51. Rad51 is a DNA repair and recombination protein and is a homologue of the bacterial ATPase RecA protein. Probab=45.26 E-value=16 Score=14.42 Aligned_cols=23 Identities=22% Similarity=0.199 Sum_probs=18.9 Q ss_pred HHHHHHHHHHHCCCEEEECCCCC Q ss_conf 99999999981898998088300 Q 537021.9.peg.7 197 ESQVVQLAYTHELPLVATNNSLF 219 (1033) Q Consensus 197 ~~~l~~lA~~~~iPlVaTndv~Y 219 (1033) ...|..+|.++++++|.||.|.- T Consensus 172 l~~L~~lA~~~~~aVvvTNQV~~ 194 (261) T pfam08423 172 LRSLQRLADEFGVAVVITNQVVA 194 (261) T ss_pred HHHHHHHHHHCCCEEEEEEEEEE T ss_conf 99999999980958999604798 No 229 >COG0052 RpsB Ribosomal protein S2 [Translation, ribosomal structure and biogenesis] Probab=45.25 E-value=16 Score=14.42 Aligned_cols=64 Identities=14% Similarity=0.217 Sum_probs=29.5 Q ss_pred HHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCC-CCCEEECCCHHHHHHHHHHHHHHHHCCCCH Q ss_conf 567621001457875333656878999999983038-021554220778778899999997527998 Q 537021.9.peg.7 404 FERFLNPDRMSMPDFDIDFCQDRRDEVIRYVQNKYG-HERVAQIITFGSLQAKAALRDVGRALQMPY 469 (1033) Q Consensus 404 FERFLnp~R~~~PDIDiDf~~~~R~~vi~y~~~kyG-~~~va~i~t~~t~~~k~aird~~r~~g~~~ 469 (1033) .-+|+-.+|...==||+.=...+=.+..++|.+-=. ...|.-++|- -+++-+|++.|+..|.++ T Consensus 27 m~~fIf~~RngihIIDL~kT~~~l~~A~~~v~~~~~~~g~ILfVgTK--~~a~~~V~~~A~r~g~~y 91 (252) T COG0052 27 MKPFIFGERNGIHIIDLQKTLERLREAYKFLRRIAANGGKILFVGTK--KQAQEPVKEFAERTGAYY 91 (252) T ss_pred CCCCCEEECCCCEEEEHHHHHHHHHHHHHHHHHHHCCCCEEEEEECH--HHHHHHHHHHHHHHCCCE T ss_conf 46650000277179879998999999999999997289979999520--887899999999959951 No 230 >PRK11507 hypothetical protein; Provisional Probab=45.03 E-value=16 Score=14.40 Aligned_cols=57 Identities=16% Similarity=0.282 Sum_probs=41.5 Q ss_pred EEECCCCHHHHHHHHHHCCCC-CHHHHHCCCCCCEEEEEEEEEEEEEECCCCCCEEEE Q ss_conf 211638558999999874897-688985055898899999996112101588876999 Q 537021.9.peg.7 961 FYFSGHPLDVYKPILRKMGIK-GYEESVATIRGNNIRLAAMVVSKQQKKTRKGSRIGW 1017 (1033) Q Consensus 961 f~lS~HPl~~~~~~l~~~~~~-~~~~~~~~~~g~~v~iaG~V~~i~~~~TKkG~~maf 1017 (1033) |.|-+||.=..-..|+-.+.. +-.+.........|.+-|.|..+++++-..|+...| T Consensus 4 f~L~~~~yIeL~~lLK~~glv~sGg~AK~~Ia~G~V~VnGevEtRKr~Ki~~GdvV~f 61 (70) T PRK11507 4 FSLGKHPHVELCDLLKLEGWSESGAQAKIAIAEGQVKVDGAVETRKRCKIVAGQTVSF 61 (70) T ss_pred EECCCCCCEEHHHHHHHCCCCCCCHHHHHHEECCEEEECCEEEEEEEEEEECCCEEEE T ss_conf 8638987100888884717204658778654187689999996331025108999999 No 231 >cd05152 MPH2' Macrolide 2'-Phosphotransferase (MPH2'). MPH2' is part of a larger superfamily that includes the catalytic domains of other kinases, such as the typical serine/threonine/tyrosine protein kinases (PKs), RIO kinases, actin-fragmin kinase (AFK), and phosphoinositide 3-kinase (PI3K). MPH2' catalyzes the transfer of the gamma-phosphoryl group from ATP to the 2'-hydroxyl of macrolide antibiotics such as erythromycin, clarithromycin, and azithromycin, among others. Macrolides penetrate the bacterial cell and bind to ribosomes, where it interrupts protein elongation, leading ultimately to the demise of the bacterium. Phosphorylation of macrolides leads to their inactivation. Based on substrate specificity and amino acid sequence, MPH2' is divided into types I and II, encoded by mphA and mphB genes, respectively. MPH2'I inactivates 14-membered ring macrolides while MPH2'II inactivates both 14- and 16-membered ring macrolides. Enzymatic inactivation of macrolides has been reported Probab=44.96 E-value=12 Score=15.66 Aligned_cols=15 Identities=33% Similarity=0.361 Sum_probs=6.2 Q ss_pred HHHHHHHHHHHHHHH Q ss_conf 899999887789898 Q 537021.9.peg.7 335 QDYRQRLDFELNVIA 349 (1033) Q Consensus 335 ~~y~~RLe~EL~vI~ 349 (1033) +++.+|++.|..+.. T Consensus 46 ~e~~~~l~~E~~~L~ 60 (276) T cd05152 46 PDVSERAAAEKRVLA 60 (276) T ss_pred HHHHHHHHHHHHHHH T ss_conf 789999999999999 No 232 >pfam00633 HHH Helix-hairpin-helix motif. The helix-hairpin-helix DNA-binding motif is found to be duplicated in the central domain of RuvA. The HhH domain of DisA, a bacterial checkpoint control protein, is a DNA-binding domain. Probab=44.92 E-value=12 Score=15.42 Aligned_cols=18 Identities=44% Similarity=0.665 Sum_probs=14.1 Q ss_pred EHHHHCCCCHHHHHHHHH Q ss_conf 026858969899999999 Q 537021.9.peg.7 838 SLAAIKGVGTTTARHIME 855 (1033) Q Consensus 838 Gl~~Ikgvg~~~~~~Ive 855 (1033) -|.+|+|||+++++.|.+ T Consensus 12 eL~~lpGVG~~tA~~I~~ 29 (30) T pfam00633 12 ELLALPGVGPKTAEAILS 29 (30) T ss_pred HHHHCCCCCHHHHHHHHC T ss_conf 997288977688998853 No 233 >PRK04301 radA DNA repair and recombination protein RadA; Validated Probab=44.91 E-value=16 Score=14.38 Aligned_cols=18 Identities=28% Similarity=0.307 Sum_probs=10.2 Q ss_pred HHHHHHHHCCCEEEECCC Q ss_conf 999999818989980883 Q 537021.9.peg.7 200 VVQLAYTHELPLVATNNS 217 (1033) Q Consensus 200 l~~lA~~~~iPlVaTndv 217 (1033) |.++|+++++++|.||.| T Consensus 236 L~~lA~~~niaVvvTNQV 253 (318) T PRK04301 236 LLRLADLYNAAVVVTNQV 253 (318) T ss_pred HHHHHHHCCCEEEEEEEE T ss_conf 999999859579996136 No 234 >COG2231 Uncharacterized protein related to Endonuclease III [DNA replication, recombination, and repair] Probab=44.70 E-value=12 Score=15.65 Aligned_cols=12 Identities=33% Similarity=0.371 Sum_probs=4.6 Q ss_pred HHHHHHHHHHHH Q ss_conf 899999999987 Q 537021.9.peg.7 165 QVAEKRLLTFKK 176 (1033) Q Consensus 165 ~~a~~~l~~l~~ 176 (1033) +.+++.+..++. T Consensus 46 knvekAlenLk~ 57 (215) T COG2231 46 KNVEKALENLKN 57 (215) T ss_pred HHHHHHHHHHHH T ss_conf 999999999988 No 235 >COG2426 Predicted membrane protein [Function unknown] Probab=44.60 E-value=17 Score=14.35 Aligned_cols=68 Identities=26% Similarity=0.380 Sum_probs=45.7 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHH-HHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHH Q ss_conf 8999876541034201246432457838999998877-898986063055788899999851047643344442046677 Q 537021.9.peg.7 308 ENELRNKAVAGLEVRLAQGAIAKGYHAQDYRQRLDFE-LNVIARMKFSGYFLIVADFIQWAKKNNIPVGPGRGSGAGSVT 386 (1033) Q Consensus 308 ~~~Lr~l~~~Gl~~R~~~~~~~~~~~~~~y~~RLe~E-L~vI~~~gf~~YFLiV~Div~~ak~~gI~vGpGRGSAagSLV 386 (1033) +....++++.+++. .. .....|.+|-.+- =..+++-||.+--+-|+ |+. ||-|-=.|.|- T Consensus 46 d~im~kl~~~rl~r-~~-------~lY~~~~~r~~rka~~yVER~G~iGL~iFvA----------IPL-P~TG~wtgaLa 106 (142) T COG2426 46 DRIMLKLKWTRLQR-PA-------CLYDWLVNRTRRKAKGYVERYGFIGLIIFVA----------IPL-PGTGAWTGALA 106 (142) T ss_pred HHHHHHHHHCCCCC-HH-------HHHHHHHHHHHHHCCCCHHHHHHHHHHHEEE----------CCC-CCCCHHHHHHH T ss_conf 99999986114576-29-------9999999999986057076653525032245----------368-87407689999 Q ss_pred HHHHHCCC Q ss_conf 65544037 Q 537021.9.peg.7 387 AYALTITD 394 (1033) Q Consensus 387 aY~LgIT~ 394 (1033) ||+|||.+ T Consensus 107 A~llgI~~ 114 (142) T COG2426 107 AYLLGIRE 114 (142) T ss_pred HHHHCCCH T ss_conf 99976862 No 236 >COG0384 Predicted epimerase, PhzC/PhzF homolog [General function prediction only] Probab=44.06 E-value=12 Score=15.63 Aligned_cols=90 Identities=18% Similarity=0.252 Sum_probs=52.3 Q ss_pred HHHHHHHHHHHHHHHHHHCCHH--HHHHHHHHH---------HHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHHCCC Q ss_conf 3899999887789898606305--578889999---------98510476433444420466776554403756333231 Q 537021.9.peg.7 334 AQDYRQRLDFELNVIARMKFSG--YFLIVADFI---------QWAKKNNIPVGPGRGSGAGSVTAYALTITDIDPLRFSL 402 (1033) Q Consensus 334 ~~~y~~RLe~EL~vI~~~gf~~--YFLiV~Div---------~~ak~~gI~vGpGRGSAagSLVaY~LgIT~VDPi~~~L 402 (1033) ..+-.++++--++.+......+ ++...+.-. .||-..||.==|.-|||+|+|.+|+-.--.... +.++ T Consensus 170 s~~av~~~~pd~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~R~FaP~~Gi~EDPaTGSA~~~La~yl~~~~~~~~-~~~~ 248 (291) T COG0384 170 SLEALDALRPDFSALTELSAGGGGVYVFAREGAGAEADFHARMFAPGIGVVEDPATGSAAGALAAYLAKHGGLPD-KLRL 248 (291) T ss_pred CHHHHHHCCCCHHHHHHHCCCCCCEEEEEECCCCCCCCEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCC-CCCE T ss_conf 889997669887998752336662599974057887758998135324887777764546589999998688887-6538 Q ss_pred HHHHHHHHCCCCCCCCCCCCCH Q ss_conf 0567621001457875333656 Q 537021.9.peg.7 403 LFERFLNPDRMSMPDFDIDFCQ 424 (1033) Q Consensus 403 lFERFLnp~R~~~PDIDiDf~~ 424 (1033) ++|-=--.+|.+.=.+-++-++ T Consensus 249 ~~~QG~~~gR~s~i~~~~~~~~ 270 (291) T COG0384 249 LIEQGTELGRPSRIEVRVDEDG 270 (291) T ss_pred EEEECCCCCCCCEEEEEEECCC T ss_conf 9982323699848999996368 No 237 >pfam03659 Glyco_hydro_71 Glycosyl hydrolase family 71. Family of alpha-1,3-glucanases. Probab=44.04 E-value=17 Score=14.28 Aligned_cols=63 Identities=17% Similarity=0.184 Sum_probs=45.5 Q ss_pred CHHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEE----ECCCHHHHHHHHHHHHHHCCCCEEEEEEE Q ss_conf 133433144476443299899999999789988998----11651774999999999769963899999 Q 537021.9.peg.7 11 FVHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAI----TDTNNLFSALEFSQKACAAGIQPIIGCQL 75 (1033) Q Consensus 11 FvhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAI----TD~~nl~Gav~F~~ackk~gIKPIiG~E~ 75 (1033) |+|..|--.++ .....-+.=|+.|++.|+.+.|| +|.-+..-+...+++|.+.|.|..+-.+. T Consensus 3 fAHfmvgn~~~--~t~~dw~~Di~~A~aagIDgFALNig~~d~~~~~~l~~ay~aA~~~gFkLFlSfD~ 69 (386) T pfam03659 3 FAHFMVGNTYN--YTPSDWESDIQLAQAAGIDAFALNIGSVDSYTDTQLALAYEAAAANGFKLFLSFDY 69 (386) T ss_pred EEEEEECCCCC--CCHHHHHHHHHHHHHCCCCEEEECCCCCCCCHHHHHHHHHHHHHHCCCEEEEECCC T ss_conf 79998607888--99899999999999849986776056788774999999999998669689995765 No 238 >PRK05929 consensus Probab=43.72 E-value=17 Score=14.25 Aligned_cols=72 Identities=6% Similarity=0.077 Sum_probs=31.8 Q ss_pred CCCCCHHHHHHH-------HHHHHHHHHHHHH--HHHCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCEEE Q ss_conf 479038999998-------9999999998865--4312777866665555655776668999988899985788726211 Q 537021.9.peg.7 893 CFGYSRMQLLQS-------LDNIQKYAQWVEK--NRTNKHENIFAHEKGTFSDKITLEKFSVENSSVRFENEQRVLGFYF 963 (1033) Q Consensus 893 ~~~~~R~~ll~~-------i~~~~~~~~~~~~--~~~~~q~~LF~~~~~~~~~~~~~~~~~~~~~~e~l~~E~e~LGf~l 963 (1033) .+|.+|.++-.- .+.+..+.....+ .+......||+= .-.+|++..- ...+-..|+.++--.+ T Consensus 720 ~L~is~~eA~~~i~~Yf~~yp~v~~y~~~~~~~a~~~Gyv~Tl~gR-------rr~ip~i~~~-~~~r~~~eR~AiN~pi 791 (870) T PRK05929 720 ILKISVSEAQGLIDAYFARYPEAAEFITETIEQASKNLRVTTMLGR-------ERIIDDWEEF-PGSRAASGRLAVNTRI 791 (870) T ss_pred HHCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCEECCCCC-------CCCCCCCCCC-HHHHHHHHHHHHCCCC T ss_conf 8498999999999999986912999999999999976967676886-------0028865677-4567788898818766 Q ss_pred CCCCHHHHH Q ss_conf 638558999 Q 537021.9.peg.7 964 SGHPLDVYK 972 (1033) Q Consensus 964 S~HPl~~~~ 972 (1033) -|.-=|..| T Consensus 792 QGtAADiiK 800 (870) T PRK05929 792 QGSAAELIK 800 (870) T ss_pred CHHHHHHHH T ss_conf 648999999 No 239 >COG1148 HdrA Heterodisulfide reductase, subunit A and related polyferredoxins [Energy production and conversion] Probab=43.67 E-value=17 Score=14.24 Aligned_cols=36 Identities=6% Similarity=0.064 Sum_probs=20.2 Q ss_pred CCCHHHHHHHHHHCCCCEEEEECCCHHHH---HHHHHHHHHH Q ss_conf 29989999999978998899811651774---9999999997 Q 537021.9.peg.7 26 ALSLNNILDKIAADQQPAIAITDTNNLFS---ALEFSQKACA 64 (1033) Q Consensus 26 ~~~~e~LV~~A~~~G~~AiAITD~~nl~G---av~F~~ackk 64 (1033) ...+++|.+.|+..|. ...||--|.+ .-.-.+..++ T Consensus 15 ~vDi~~l~~~ak~~gv---~v~d~~~~CSd~gqe~i~~dike 53 (622) T COG1148 15 VVDIEALKEFAKLEGV---VVADYPYMCSDPGQEMIKKDIKE 53 (622) T ss_pred EEEHHHHHHHHHCCCE---EEEEEEECCCCHHHHHHHHHHHH T ss_conf 4579999998740784---88633111486568999999997 No 240 >PRK07898 consensus Probab=43.55 E-value=17 Score=14.23 Aligned_cols=76 Identities=18% Similarity=0.274 Sum_probs=39.3 Q ss_pred CCCCCHHHHHHHH-------HHHHHHHHHHH-HHHHCC-CCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCEEE Q ss_conf 4790389999989-------99999999886-543127-77866665555655776668999988899985788726211 Q 537021.9.peg.7 893 CFGYSRMQLLQSL-------DNIQKYAQWVE-KNRTNK-HENIFAHEKGTFSDKITLEKFSVENSSVRFENEQRVLGFYF 963 (1033) Q Consensus 893 ~~~~~R~~ll~~i-------~~~~~~~~~~~-~~~~~~-q~~LF~~~~~~~~~~~~~~~~~~~~~~e~l~~E~e~LGf~l 963 (1033) .+|.+|.++-.-+ +.+..|..... ..+..| ...||+= .-.+|++..-+...+-..|+.++-..+ T Consensus 749 ~l~i~~~eA~~~i~~yf~~yp~v~~~~~~~~~~a~~~gyv~Tl~gR-------rr~~p~i~s~n~~~r~~~eR~A~N~pi 821 (902) T PRK07898 749 QLKISTEEAKELMDAYFARFGGVRDYLREVVEQARKDGYTETLFGR-------RRYLPDLTSDNRQRREAAERAALNAPI 821 (902) T ss_pred HHCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCEEECCCCC-------CCCCCCCCCCCHHHHHHHHHHHHCCCC T ss_conf 7298999999999999986906999999999999977906787877-------113887778897789899999828666 Q ss_pred CCCCHHHHHHHH Q ss_conf 638558999999 Q 537021.9.peg.7 964 SGHPLDVYKPIL 975 (1033) Q Consensus 964 S~HPl~~~~~~l 975 (1033) -|.-=|..|-.+ T Consensus 822 QGtaADiiK~AM 833 (902) T PRK07898 822 QGSAADIIKVAM 833 (902) T ss_pred CHHHHHHHHHHH T ss_conf 606999999999 No 241 >pfam03344 Daxx Daxx Family. The Daxx protein (also known as the Fas-binding protein) is thought to play a role in apoptosis, but precise role played by Daxx remains to be determined. Daxx forms a complex with Axin. Probab=43.46 E-value=17 Score=14.22 Aligned_cols=55 Identities=24% Similarity=0.341 Sum_probs=28.0 Q ss_pred CHHHHHHHH--CCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHHC Q ss_conf 105676210--0145787533365687899999998303802155422077877889999999752 Q 537021.9.peg.7 402 LLFERFLNP--DRMSMPDFDIDFCQDRRDEVIRYVQNKYGHERVAQIITFGSLQAKAALRDVGRAL 465 (1033) Q Consensus 402 LlFERFLnp--~R~~~PDIDiDf~~~~R~~vi~y~~~kyG~~~va~i~t~~t~~~k~aird~~r~~ 465 (1033) =--|||+|. .+..+||+- |-.| ++.-..+|++ ...-.-....-|+-|.||||-.| T Consensus 280 rrier~in~Pe~~~~fPDy~----Dil~--~v~~an~rh~---L~l~~~~~~~lAqeaFr~vG~~l 336 (715) T pfam03344 280 RRIERFINKPEAQDTPPDYG----DILQ--AVERANERHG---LGLSRKQLQQMAQDAFREVGNRL 336 (715) T ss_pred HHHHHHHCCCCCCCCCCCHH----HHHH--HHHHHHHHHC---CCCCHHHHHHHHHHHHHHHHHHH T ss_conf 99999863855466898779----9999--9998877643---59889999999999999988999 No 242 >PRK05586 biotin carboxylase; Validated Probab=43.29 E-value=17 Score=14.20 Aligned_cols=68 Identities=15% Similarity=0.241 Sum_probs=35.2 Q ss_pred CEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCCCCCCCCCCCCCEEEEE--ECCHHHHHHHHHHHHHH Q ss_conf 8899811651774999999999769963899999985476643344565666677539999--56968999999999899 Q 537021.9.peg.7 42 PAIAITDTNNLFSALEFSQKACAAGIQPIIGCQLDIDMQDNLKMTNQGKTPFPTVPSIVLL--VSTAEGYQRLIELVSRM 119 (1033) Q Consensus 42 ~AiAITD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~~~~~~~~~~~~~~~~~lvlL--AkN~~Gy~nL~kL~S~a 119 (1033) +-|-|+..|- -+++..++|++-||+++.- |-..+....... ..-..+.+ +...+-|-|.-+++..+ T Consensus 3 ~kvLIANRGE--IA~Ri~rt~~~lgi~tVav---ys~~D~~a~hv~-------~Ade~~~lg~~~~~~sYln~~~ii~~A 70 (447) T PRK05586 3 KKILIANRGE--IAVRIIRACREMGIETVAV---YSEIDKDALHVQ-------LADEAVCIGPASSKDSYLNIYNILSAT 70 (447) T ss_pred CEEEEECCCH--HHHHHHHHHHHCCCCEEEE---CCCCCCCCHHHH-------HCCEEEECCCCCHHHHHCCHHHHHHHH T ss_conf 5499989879--9999999999839959999---170336785476-------498998739999565114899999999 Q ss_pred HH Q ss_conf 73 Q 537021.9.peg.7 120 YL 121 (1033) Q Consensus 120 ~~ 121 (1033) .. T Consensus 71 ~~ 72 (447) T PRK05586 71 VL 72 (447) T ss_pred HH T ss_conf 98 No 243 >COG1263 PtsG Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific [Carbohydrate transport and metabolism] Probab=43.05 E-value=11 Score=16.02 Aligned_cols=14 Identities=21% Similarity=0.320 Sum_probs=6.5 Q ss_pred HHHHHHHHHHHHHC Q ss_conf 87899999998303 Q 537021.9.peg.7 425 DRRDEVIRYVQNKY 438 (1033) Q Consensus 425 ~~R~~vi~y~~~ky 438 (1033) -.+.-|+-++.+|| T Consensus 128 ~~~G~V~~~l~~~~ 141 (393) T COG1263 128 IYQGTVIPILYNRF 141 (393) T ss_pred CEEHHHHHHHHHHH T ss_conf 50038999999999 No 244 >cd00530 PTE Phosphotriesterase (PTE) catalyzes the hydrolysis of organophosphate nerve agents, including the chemical warfare agents VX, soman, and sarin as well as the insecticide paraoxon. PTE exists as a homodimer with one active site per monomer. The active site is located next to a binuclear metal center, at the C-terminal end of a TIM alpha- beta barrel motif. The native enzyme contains two zinc ions at the active site however these can be replaced with other metals such as cobalt, cadmium, nickel or manganese and the enzyme remains active. Probab=43.01 E-value=17 Score=14.17 Aligned_cols=74 Identities=22% Similarity=0.285 Sum_probs=41.6 Q ss_pred CCCEEEECCCCCCHHH--HHHHCCCHHHHHHHHHHHHHHCCCCEEEEEECCC-HHHHHHHHHHHHHHHHHCCCEEEECCC Q ss_conf 8988998197679678--8887799789999999998765986899994399-115899999999999818989980883 Q 537021.9.peg.7 141 TEGLIMLTGGSSGPID--RAFFFNSSQVAEKRLLTFKKLFGDRLYVNLQRHR-GYDRYRESQVVQLAYTHELPLVATNNS 217 (1033) Q Consensus 141 ~egLIvlsg~~~g~i~--~~l~~~~~~~a~~~l~~l~~~Fgd~~ylEl~~~~-~~e~~~~~~l~~lA~~~~iPlVaTndv 217 (1033) .|+|++-+++.....+ ........+.+.+.+..|++ .|.+-.++.+..+ +.+ ...+.++|++.|+.+|++.-- T Consensus 6 HEHl~~d~~~~~~~~~~~~~~~l~~~~~~~~el~~~k~-~Gg~tiVD~T~~g~GRd---~~~L~~is~~tgv~IV~~TG~ 81 (293) T cd00530 6 HEHLIIDSSGFVRDPPEVDDFDLADVEAAKEELKRFRA-HGGRTIVDATPPGIGRD---VEKLAEVARATGVNIVAATGF 81 (293) T ss_pred CCCEEECCHHHHCCCCCCCCHHHCCHHHHHHHHHHHHH-HCCCEEEECCCCCCCCC---HHHHHHHHHHCCCEEEECCCC T ss_conf 48867557886258988864002059999999999998-48987995478666769---999999999639828967878 Q ss_pred C Q ss_conf 0 Q 537021.9.peg.7 218 L 218 (1033) Q Consensus 218 ~ 218 (1033) | T Consensus 82 y 82 (293) T cd00530 82 Y 82 (293) T ss_pred C T ss_conf 8 No 245 >cd00466 DHQase_II Dehydroquinase (DHQase), type II. Dehydroquinase (or 3-dehydroquinate dehydratase) catalyzes the reversible dehydration of 3-dehydroquinate to form 3-dehydroshikimate. This reaction is part of two metabolic pathways: the biosynthetic shikimate pathway and the catabolic quinate pathway. There are two types of DHQases, which are distinct from each other in amino acid sequence and three-dimensional structure. Type I enzymes usually catalyze the biosynthetic reaction using a syn elimination mechanism. In contrast, type II enzymes, found in the quinate pathway of fungi and in the shikimate pathway of many bacteria, are dodecameric enzymes that employ an anti elimination reaction mechanism. Probab=42.86 E-value=14 Score=14.84 Aligned_cols=17 Identities=35% Similarity=0.491 Sum_probs=8.0 Q ss_pred HHHHHHCCHHHHHHHHH Q ss_conf 98986063055788899 Q 537021.9.peg.7 346 NVIARMKFSGYFLIVAD 362 (1033) Q Consensus 346 ~vI~~~gf~~YFLiV~D 362 (1033) .+|..+|..+|+|-+.- T Consensus 120 g~I~G~G~~gY~lAl~~ 136 (140) T cd00466 120 GVIAGLGADGYRLALEA 136 (140) T ss_pred EEEECCCHHHHHHHHHH T ss_conf 89970785789999999 No 246 >PRK11302 DNA-binding transcriptional regulator HexR; Provisional Probab=42.83 E-value=17 Score=14.14 Aligned_cols=36 Identities=11% Similarity=0.113 Sum_probs=16.9 Q ss_pred CCCEEEEEECCCHHHHHHHHHHHHHHHHHCCCEEEECC Q ss_conf 98689999439911589999999999981898998088 Q 537021.9.peg.7 179 GDRLYVNLQRHRGYDRYRESQVVQLAYTHELPLVATNN 216 (1033) Q Consensus 179 gd~~ylEl~~~~~~e~~~~~~l~~lA~~~~iPlVaTnd 216 (1033) ++++.+.+...+... -...+.++|++.|+|+|+..+ T Consensus 175 ~~Dvvi~iS~sG~t~--e~i~~~~~Ak~~ga~vIaIT~ 210 (284) T PRK11302 175 DGDVVVLISHTGRTK--SLVELAQLARENGATVIGITA 210 (284) T ss_pred CCCEEEEECCCCCCH--HHHHHHHHHHHCCCCEEEECC T ss_conf 988899973799987--999999999987995899779 No 247 >TIGR00021 rpiA ribose 5-phosphate isomerase A; InterPro: IPR004788 Ribose 5-phosphate isomerase (5.3.1.6 from EC), also known as phosphoriboisomerase, catalyses the conversion of D-ribose 5-phosphate to D-ribulose 5-phosphate in the non-oxidative branch of the pentose phosphate pathway.; GO: 0004751 ribose-5-phosphate isomerase activity, 0009052 pentose-phosphate shunt non-oxidative branch. Probab=42.74 E-value=18 Score=14.13 Aligned_cols=63 Identities=21% Similarity=0.332 Sum_probs=42.7 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCEEEEECC-CEEE-EEHHHHCCCCHHHH-HHHHHHHHCCCCCCHHHH Q ss_conf 99988886368503552334210035319-8698-50268589698999-999999837999997899 Q 537021.9.peg.7 804 KFCQDARQFNIQIMPPSVNTPCVDFKVGD-NRIY-YSLAAIKGVGTTTA-RHIMEASADKPFDSLEDF 868 (1033) Q Consensus 804 ~~i~e~~~~gi~vl~PdIN~S~~~f~~~~-~~Ir-~Gl~~Ikgvg~~~~-~~Ive~r~~g~f~sl~df 868 (1033) .--+-|+..||++-..| ..+..+=+|++ +.|= -.|..|||-|.... |+||+.... .|==+-|- T Consensus 55 ~t~~lA~~~GIpl~~l~-~~~~lDltIDGADEvdl~~l~lIKGGGgALlrEKIva~~s~-~~ivIaDe 120 (236) T TIGR00021 55 QTAELARELGIPLSSLD-EVPELDLTIDGADEVDLPNLNLIKGGGGALLREKIVASASK-RFIVIADE 120 (236) T ss_pred HHHHHHHHCCCEEECCC-CCCEEEEEEECCCCCCCCCCCEEECCCHHHHHHHHHHHHCC-EEEEEECC T ss_conf 99999998298040225-57720068606341030337643132067889999987528-06999727 No 248 >PRK05907 hypothetical protein; Provisional Probab=42.55 E-value=12 Score=15.65 Aligned_cols=18 Identities=50% Similarity=0.859 Sum_probs=10.0 Q ss_pred HHHHHHHCCCCCHHHHCC Q ss_conf 899876201410122114 Q 537021.9.peg.7 607 ETYHLLTTKGTLGIFQLE 624 (1033) Q Consensus 607 ~~~~l~~~g~t~GvFQ~e 624 (1033) .+|.||.+-+|+||||.| T Consensus 62 e~~glFaS~evi~I~q~e 79 (311) T PRK05907 62 ETFGLFASKETIGIYQAE 79 (311) T ss_pred HHHCCCCCCCEEEEHHHH T ss_conf 862241223010003197 No 249 >PRK07178 acetyl-CoA carboxylase; Validated Probab=42.38 E-value=18 Score=14.09 Aligned_cols=68 Identities=13% Similarity=0.110 Sum_probs=34.6 Q ss_pred CEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCCCCCCCCCCCCCEEEEE-ECCHHHHHHHHHHHHHHH Q ss_conf 8899811651774999999999769963899999985476643344565666677539999-569689999999998997 Q 537021.9.peg.7 42 PAIAITDTNNLFSALEFSQKACAAGIQPIIGCQLDIDMQDNLKMTNQGKTPFPTVPSIVLL-VSTAEGYQRLIELVSRMY 120 (1033) Q Consensus 42 ~AiAITD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~~~~~~~~~~~~~~~~~lvlL-AkN~~Gy~nL~kL~S~a~ 120 (1033) +-|-|+.+|- -++|..++|++-||+++.= |-..+..... ....+ ..+.+ +...+-|-|.-+++..+. T Consensus 3 ~kvLIANRGE--IA~RiiRt~~elgi~tVav---ys~~D~~a~h----v~~AD---ea~~ig~~~~~sYln~~~Ii~~A~ 70 (471) T PRK07178 3 KKILIANRGE--IAVRIVRACAEMGIRSVAI---YSEADRHALH----VKRAD---EAHSIGADPLAGYLNPRKLVNLAV 70 (471) T ss_pred CEEEEECCCH--HHHHHHHHHHHCCCCEEEE---ECHHHCCCHH----HHHCC---EEEECCCCHHHHHCCHHHHHHHHH T ss_conf 6089977869--9999999999839948999---0837566836----88488---888718872665449999999999 Q ss_pred H Q ss_conf 3 Q 537021.9.peg.7 121 L 121 (1033) Q Consensus 121 ~ 121 (1033) . T Consensus 71 ~ 71 (471) T PRK07178 71 E 71 (471) T ss_pred H T ss_conf 9 No 250 >PRK08097 ligB NAD-dependent DNA ligase LigB; Reviewed Probab=42.12 E-value=18 Score=14.06 Aligned_cols=18 Identities=22% Similarity=0.506 Sum_probs=13.3 Q ss_pred HHHHCCCCHHHHHHHHHH Q ss_conf 268589698999999999 Q 537021.9.peg.7 839 LAAIKGVGTTTARHIMEA 856 (1033) Q Consensus 839 l~~Ikgvg~~~~~~Ive~ 856 (1033) |.+|.|||+..+++|++. T Consensus 523 l~~i~gIG~~~A~si~~f 540 (563) T PRK08097 523 WQQLPGIGEGRARQLIAF 540 (563) T ss_pred HHCCCCCCHHHHHHHHHH T ss_conf 955798489999999999 No 251 >TIGR03314 Se_ssnA putative selenium metabolism protein SsnA. Members of this protein family are found exclusively in genomes that contain putative set of labile selenium-dependent enzyme accessory proteins as well as homologs of a labile selenium-dependent purine hydroxylase. A mutant in this gene in Escherichia coli had improved stationary phase viability. The function is unknown. Probab=42.08 E-value=18 Score=14.06 Aligned_cols=22 Identities=23% Similarity=0.493 Sum_probs=11.3 Q ss_pred HHHHCCCHHHHHHHHHHHCCCHH Q ss_conf 87717404888876654114113 Q 537021.9.peg.7 632 LEGMQPDCIEDIIALVSLYRPGP 654 (1033) Q Consensus 632 l~~~~p~~~~dl~~~~al~RPGp 654 (1033) +..+.|-.--||+.+- +-.|.| T Consensus 362 iGsL~~Gk~ADlv~~d-~~~p~~ 383 (441) T TIGR03314 362 FGRLEPGAKADLIIVD-YNAPTP 383 (441) T ss_pred CEEECCCCCCCEEEEC-CCCCCC T ss_conf 2513787605789994-999865 No 252 >PRK09875 putative hydrolase; Provisional Probab=42.03 E-value=18 Score=14.05 Aligned_cols=72 Identities=13% Similarity=0.158 Sum_probs=40.4 Q ss_pred CCCEEEECCCCCCHHHHHHHCCCHHHHHHHHHHHHHHCCCCEEEEEECCC-HHHHHHHHHHHHHHHHHCCCEEEECCCC Q ss_conf 89889981976796788887799789999999998765986899994399-1158999999999998189899808830 Q 537021.9.peg.7 141 TEGLIMLTGGSSGPIDRAFFFNSSQVAEKRLLTFKKLFGDRLYVNLQRHR-GYDRYRESQVVQLAYTHELPLVATNNSL 218 (1033) Q Consensus 141 ~egLIvlsg~~~g~i~~~l~~~~~~~a~~~l~~l~~~Fgd~~ylEl~~~~-~~e~~~~~~l~~lA~~~~iPlVaTndv~ 218 (1033) .|+|++-+++..+..... ....+.+.+.+..|+.. |.+-.++.+..+ +.+ ...+.++|++.|+.+|++.--| T Consensus 12 HEHl~id~~~~~~~~d~~--ld~~~~~~~El~~~k~~-Gg~tiVd~T~~g~GRd---~~~l~~is~~tGv~IV~~TG~y 84 (292) T PRK09875 12 HEHLHIDLSGFKNNVDCR--LDQYAFICQEMNDLMTR-GVRNVIEMTNRYMGRN---AQFMLDVMRETGINVVACTGYY 84 (292) T ss_pred CCCEEECCHHHCCCCCCC--CCCHHHHHHHHHHHHHC-CCCEEEECCCCCCCCC---HHHHHHHHHHCCCCEEECCCCC T ss_conf 567352475343187844--36399999999999980-9997997788776859---9999999997299099817778 No 253 >KOG0932 consensus Probab=41.84 E-value=18 Score=14.03 Aligned_cols=14 Identities=29% Similarity=0.688 Sum_probs=5.6 Q ss_pred HHHHHHHHHHHHHC Q ss_conf 87899999998303 Q 537021.9.peg.7 425 DRRDEVIRYVQNKY 438 (1033) Q Consensus 425 ~~R~~vi~y~~~ky 438 (1033) ..|+.|+..--.|| T Consensus 372 qeRERvL~hFS~Ry 385 (774) T KOG0932 372 QERERVLRHFSRRY 385 (774) T ss_pred HHHHHHHHHHHHHH T ss_conf 88899999987776 No 254 >COG0821 gcpE 1-hydroxy-2-methyl-2-(e)-butenyl 4-diphosphate synthase [Lipid metabolism] Probab=41.69 E-value=18 Score=14.01 Aligned_cols=164 Identities=20% Similarity=0.234 Sum_probs=75.0 Q ss_pred CHHHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCC Q ss_conf 97899999999987659868999943991158999999999998189899808830078788999987668846985765 Q 537021.9.peg.7 163 SSQVAEKRLLTFKKLFGDRLYVNLQRHRGYDRYRESQVVQLAYTHELPLVATNNSLFLSEEDYEAHDVLMAVAHSTVVSQ 242 (1033) Q Consensus 163 ~~~~a~~~l~~l~~~Fgd~~ylEl~~~~~~e~~~~~~l~~lA~~~~iPlVaTndv~Y~~~~D~~~h~iL~aI~~~~~l~~ 242 (1033) ....++..+.+.+.+.. .=-++.|-..++..-...+.++.++.++|+|| |.||..+ -++.|+..+ T Consensus 31 ~T~Dv~aTv~QI~~L~~--aG~dIVRvtv~~~e~A~A~~~Ik~~~~vPLVa--DiHf~~r------la~~~~~~g----- 95 (361) T COG0821 31 DTADVEATVAQIKALER--AGCDIVRVTVPDMEAAEALKEIKQRLNVPLVA--DIHFDYR------LALEAAECG----- 95 (361) T ss_pred CCCCHHHHHHHHHHHHH--CCCCEEEEECCCHHHHHHHHHHHHHCCCCEEE--EEECCHH------HHHHHHHCC----- T ss_conf 73017999999999998--18978999369978999999999847998798--7305689------999756427----- Q ss_pred CCCCCCCCCCCCCCHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHH-HHCCCCCC--CCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 310024534323434788751220358998788641110001332-21023455--575688776898899987654103 Q 537021.9.peg.7 243 KDRPRVTPDHYLKNRSEMVSIFSDLPEALENTVEIARRCSFILQV-HAPILPRF--IEKECDDIQKEEENELRNKAVAGL 319 (1033) Q Consensus 243 ~~~~~~~~~~yLks~eEM~~~f~~~peAi~NT~~IAe~c~~~l~~-~~~~lP~f--~~p~~~~~~~~~~~~Lr~l~~~Gl 319 (1033) .+..|.|+.-+- +.+ .|.++-++.... -++=|+-+.+-- .+..+-+| |+|+ . | +...+ T Consensus 96 ~~k~RINPGNig-~~~----~v~~vVe~Ak~~-g~piRIGVN~GSLek~~~~ky~~pt~e---------a-l---veSAl 156 (361) T COG0821 96 VDKVRINPGNIG-FKD----RVREVVEAAKDK-GIPIRIGVNAGSLEKRLLEKYGGPTPE---------A-L---VESAL 156 (361) T ss_pred CCEEEECCCCCC-CHH----HHHHHHHHHHHC-CCCEEEECCCCCHHHHHHHHHCCCCHH---------H-H---HHHHH T ss_conf 426987786567-367----799999999975-998799526686169999985479878---------9-9---99999 Q ss_pred HHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCC Q ss_conf 4201246432457838999998877898986063055788899999851047643 Q 537021.9.peg.7 320 EVRLAQGAIAKGYHAQDYRQRLDFELNVIARMKFSGYFLIVADFIQWAKKNNIPV 374 (1033) Q Consensus 320 ~~R~~~~~~~~~~~~~~y~~RLe~EL~vI~~~gf~~YFLiV~Div~~ak~~gI~v 374 (1033) ..-++.++|.+- ++|-.+.-++-|+.|.-+-..|+.-++++ T Consensus 157 -------------~~a~~~e~l~f~-~i~iS~K~Sdv~~~v~aYr~lA~~~dyPL 197 (361) T COG0821 157 -------------EHAELLEELGFD-DIKVSVKASDVQLMVAAYRLLAKRCDYPL 197 (361) T ss_pred -------------HHHHHHHHCCCC-CEEEEEECCCHHHHHHHHHHHHHHCCCCC T ss_conf -------------999999977998-67999873878999999999998668772 No 255 >cd07218 Pat_iPLA2 Calcium-independent phospholipase A2; Classified as Group IVA-1 PLA2. Calcium-independent phospholipase A2; otherwise known as Group IVA-1 PLA2. It contains the lipase consensus sequence (Gly-X-Ser-X-Gly);mutagenesis experiments confirm the role of this serine as a nucleophile. Some members of this group show triacylglycerol lipase activity (EC 3:1:1:3). Members include iPLA-1, iPLA-2, and iPLA-3 from Aedes aegypti and show acylglycerol transacylase/lipase activity. Also includes putative iPLA2-eta from Pediculus humanus corporis which shows patatin-like phospholipase activity. Probab=41.42 E-value=16 Score=14.53 Aligned_cols=22 Identities=27% Similarity=0.305 Sum_probs=15.0 Q ss_pred CCCCCHHHHHHHHHHHCCCCCHH Q ss_conf 44442046677655440375633 Q 537021.9.peg.7 376 PGRGSGAGSVTAYALTITDIDPL 398 (1033) Q Consensus 376 pGRGSAagSLVaY~LgIT~VDPi 398 (1033) +-=||+||||+|.+|- +++|+- T Consensus 33 ~~~GaSAGaL~A~~l~-~~~~~~ 54 (245) T cd07218 33 KISGASAGALAACCLL-CDLPLG 54 (245) T ss_pred EEEEEHHHHHHHHHHH-CCCCHH T ss_conf 5788829999999998-399999 No 256 >TIGR01012 Sa_S2_E_A ribosomal protein S2; InterPro: IPR005707 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites , . About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many of ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome , . This family describes the ribosomal protein of the eukaryotic cytosol and of Archaea, homologous to S2 of bacteria. It is designated typically as Sa in eukaryotes and Sa or S2 in the archaea. ; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0015935 small ribosomal subunit. Probab=41.41 E-value=13 Score=15.31 Aligned_cols=20 Identities=25% Similarity=0.340 Sum_probs=12.0 Q ss_pred HHHHHHHHHHCCCEEEECCC Q ss_conf 99999999818989980883 Q 537021.9.peg.7 198 SQVVQLAYTHELPLVATNNS 217 (1033) Q Consensus 198 ~~l~~lA~~~~iPlVaTndv 217 (1033) ++.+.=|.+-|||+||..|. T Consensus 122 ~Qa~~EA~~vgiP~vALcDT 141 (197) T TIGR01012 122 HQALKEASEVGIPIVALCDT 141 (197) T ss_pred CCHHHHHHCCCCCEEEEECC T ss_conf 21212000158966898758 No 257 >TIGR00873 gnd 6-phosphogluconate dehydrogenase, decarboxylating; InterPro: IPR006113 6-Phosphogluconate dehydrogenase (1.1.1.44 from EC) (6PGD) is an oxidative carboxylase that catalyses the decarboxylating reduction of 6-phosphogluconate into ribulose 5-phosphate in the presence of NADP. This reaction is a component of the hexose mono-phosphate shunt and pentose phosphate pathways (PPP) , . Prokaryotic and eukaryotic 6PGD are proteins of about 470 amino acids whose sequence are highly conserved . The protein is a homodimer in which the monomers act independently : each contains a large, mainly alpha-helical domain and a smaller beta-alpha-beta domain, containing a mixed parallel and anti-parallel 6-stranded beta sheet . NADP is bound in a cleft in the small domain, the substrate binding in an adjacent pocket . This model does not specify whether the cofactor is NADP only, NAD only, or both.; GO: 0004616 phosphogluconate dehydrogenase (decarboxylating) activity, 0050661 NADP binding, 0006098 pentose-phosphate shunt. Probab=41.37 E-value=11 Score=16.00 Aligned_cols=78 Identities=18% Similarity=0.373 Sum_probs=51.0 Q ss_pred CCCCHHHCCC--------HHHHHHHH---CCC----CCCCCCCCCCHH--HHHHHHHHHHHHCCCCCEEECCCHHHHHHH Q ss_conf 3756333231--------05676210---014----578753336568--789999999830380215542207787788 Q 537021.9.peg.7 393 TDIDPLRFSL--------LFERFLNP---DRM----SMPDFDIDFCQD--RRDEVIRYVQNKYGHERVAQIITFGSLQAK 455 (1033) Q Consensus 393 T~VDPi~~~L--------lFERFLnp---~R~----~~PDIDiDf~~~--~R~~vi~y~~~kyG~~~va~i~t~~t~~~k 455 (1033) |.+|.|..|. .|-||||- +|. .+|-+=.=++.. .+++-|++|++ .=+++.|+.|. |.= T Consensus 268 T~~~ALd~G~P~tlI~EsVfAR~lSs~K~~R~~A~~~l~g~p~~~~~~~~D~~~fi~~vr~---ALY~sKI~SYA--QGF 342 (480) T TIGR00873 268 TAIDALDLGVPLTLITESVFARYLSSLKEERVAASKLLSGDPLAEEKALQDKEEFIEDVRQ---ALYASKIVSYA--QGF 342 (480) T ss_pred HHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHH---HHHHHHHHHHH--HHH T ss_conf 7667653389820799999999884128999999973488965677863466889999999---98877799899--989 Q ss_pred HHHHHHHHHCC--CCHHHHHHH Q ss_conf 99999997527--998898865 Q 537021.9.peg.7 456 AALRDVGRALQ--MPYSQVDRL 475 (1033) Q Consensus 456 ~aird~~r~~g--~~~~~~~~~ 475 (1033) +-+|.+++-+| +++.++-.+ T Consensus 343 ~ll~~aS~~yGWdLnlg~IAli 364 (480) T TIGR00873 343 MLLKEASEEYGWDLNLGEIALI 364 (480) T ss_pred HHHHHHHHHCCCCCCCHHHHHH T ss_conf 9999999872889981358877 No 258 >cd04491 SoSSB_OBF SoSSB_OBF: A subfamily of OB folds similar to the OB fold of the crenarchaeote Sulfolobus solfataricus single-stranded (ss) DNA-binding protein (SSoSSB). SSoSSB has a single OB fold, and it physically and functionally interacts with RNA polymerase. In vitro, SSoSSB can substitute for the basal transcription factor TBP, stimulating transcription from promoters under conditions in which TBP is limiting, and supporting transcription when TBP is absent. SSoSSB selectively melts the duplex DNA of promoter sequences. It also relieves transcriptional repression by the chromatin Alba. In addition, SSoSSB activates reverse gyrase activity, which involves DNA binding, DNA cleavage, strand passage and ligation. SSoSSB stimulates all these steps in the presence of the chromatin protein, Sul7d. SSoSSB antagonizes the inhibitory effect of Sul7d on reverse gyrase supercoiling activity. It also physically and functionally interacts with Mini-chromosome Maintenance (MCM), stimulating Probab=41.33 E-value=18 Score=13.97 Aligned_cols=36 Identities=14% Similarity=0.255 Sum_probs=24.2 Q ss_pred EEEEEEEEEEEC--CCCC--CEEEEEEEECCCCEEEEEEC Q ss_conf 999996112101--5888--76999999849704999953 Q 537021.9.peg.7 997 LAAMVVSKQQKK--TRKG--SRIGWVTFSEPAKEYEANFF 1032 (1033) Q Consensus 997 iaG~V~~i~~~~--TKkG--~~mafltleD~~g~~e~~iF 1032 (1033) +.|.|+++-.++ +++| ..+.-+.+.|+||.+.+++| T Consensus 2 v~~~V~~~~~~r~f~~dg~~~~v~~~~i~D~TG~ir~t~W 41 (82) T cd04491 2 VEGKVLSISEPREFTRDGSEGKVQSGLVGDETGTIRFTLW 41 (82) T ss_pred EEEEEEECCCCEEEEECCCEEEEEEEEEECCCCEEEEEEE T ss_conf 8999998778879973997689999999989987999996 No 259 >COG0402 SsnA Cytosine deaminase and related metal-dependent hydrolases [Nucleotide transport and metabolism / General function prediction only] Probab=41.28 E-value=18 Score=13.97 Aligned_cols=61 Identities=23% Similarity=0.210 Sum_probs=26.8 Q ss_pred CCCHHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEE--ECCCHHHHHHHHHHHHHHCCCCE Q ss_conf 88133433144476443299899999999789988998--11651774999999999769963 Q 537021.9.peg.7 9 PSFVHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAI--TDTNNLFSALEFSQKACAAGIQP 69 (1033) Q Consensus 9 ~~FvhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAI--TD~~nl~Gav~F~~ackk~gIKP 69 (1033) |.||..|+|...|+..|......+..--...-++..+. +....-.++..+...+.++|+.- T Consensus 57 Pgfvn~H~H~~~t~~~g~~~~~~l~~wl~~~~~~~~~~~~~~e~~~~~a~~~~~e~l~~G~t~ 119 (421) T COG0402 57 PGFVNAHTHLDQTLLRGLADDLPLLEWLERYVWPREARLLTEEDLYARALLALLEMLRNGTTT 119 (421) T ss_pred CCCCCCCCCHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCE T ss_conf 696224528577601234666532366766434555414999999999999999999669768 No 260 >smart00146 PI3Kc Phosphoinositide 3-kinase, catalytic domain. Phosphoinositide 3-kinase isoforms participate in a variety of processes, including cell motility, the Ras pathway, vesicle trafficking and secretion, and apoptosis. These homologues may be either lipid kinases and/or protein kinases: the former phosphorylate the 3-position in the inositol ring of inositol phospholipids. The ataxia telangiectesia-mutated gene produced, the targets of rapamycin (TOR) and the DNA-dependent kinase have not been found to possess lipid kinase activity. Some of this family possess PI-4 kinase activities. Probab=41.27 E-value=13 Score=15.35 Aligned_cols=26 Identities=23% Similarity=0.365 Sum_probs=18.7 Q ss_pred CCCHHHHHHHHHHHCCCCCHHHCCCHHH Q ss_conf 4420466776554403756333231056 Q 537021.9.peg.7 378 RGSGAGSVTAYALTITDIDPLRFSLLFE 405 (1033) Q Consensus 378 RGSAagSLVaY~LgIT~VDPi~~~LlFE 405 (1033) +--|+.|+|+|+|||.+=-|= |+++. T Consensus 91 ~S~A~~s~~~yilglgDRH~~--Nili~ 116 (202) T smart00146 91 RSCAGYSVITYILGLGDRHND--NIMLD 116 (202) T ss_pred HHHHHHHHHHHHHCCCCCCCC--CEEEE T ss_conf 999999999998216889866--37881 No 261 >cd02742 GH20_hexosaminidase Beta-N-acetylhexosaminidases of glycosyl hydrolase family 20 (GH20) catalyze the removal of beta-1,4-linked N-acetyl-D-hexosamine residues from the non-reducing ends of N-acetyl-beta-D-hexosaminides including N-acetylglucosides and N-acetylgalactosides. These enzymes are broadly distributed in microorganisms, plants and animals, and play roles in various key physiological and pathological processes. These processes include cell structural integrity, energy storage, cellular signaling, fertilization, pathogen defense, viral penetration, the development of carcinomas, inflammatory events and lysosomal storage disorders. The GH20 enzymes include the eukaryotic beta-N-acetylhexosaminidases A and B, the bacterial chitobiases, dispersin B, and lacto-N-biosidase. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by the solvent or the enzyme, but by the substrate itself. Probab=41.26 E-value=9.2 Score=16.52 Aligned_cols=16 Identities=31% Similarity=0.451 Sum_probs=12.1 Q ss_pred HHHHHHHHHCCCCCCC Q ss_conf 8999998510476433 Q 537021.9.peg.7 360 VADFIQWAKKNNIPVG 375 (1033) Q Consensus 360 V~Div~~ak~~gI~vG 375 (1033) +.+||+||+++||.|= T Consensus 74 i~eiv~yA~~rgI~vI 89 (303) T cd02742 74 LKDIIEYAAARGIEVI 89 (303) T ss_pred HHHHHHHHHHCCCEEE T ss_conf 9999999998499897 No 262 >PRK03980 flap endonuclease-1; Provisional Probab=41.23 E-value=18 Score=13.96 Aligned_cols=58 Identities=22% Similarity=0.449 Sum_probs=41.0 Q ss_pred HHHCCCHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHHC Q ss_conf 333231056762100145787533365687899999998303802155422077877889999999752 Q 537021.9.peg.7 397 PLRFSLLFERFLNPDRMSMPDFDIDFCQDRRDEVIRYVQNKYGHERVAQIITFGSLQAKAALRDVGRAL 465 (1033) Q Consensus 397 Pi~~~LlFERFLnp~R~~~PDIDiDf~~~~R~~vi~y~~~kyG~~~va~i~t~~t~~~k~aird~~r~~ 465 (1033) |-.|.-.+|=|+||.=+ +|+++.+..-..+.+++|+.+.+| |+.=+-+.++..+-++. T Consensus 226 ~~~~~e~r~lF~~p~v~--~~~~l~~~~pd~e~l~~fL~~e~~---------fse~RV~~~i~kl~k~~ 283 (295) T PRK03980 226 PVDPEEIREFFLNPPVT--DDYELKWKKPDKEGIIEFLVEEHD---------FSEDRVEKALERLKKAL 283 (295) T ss_pred CCCHHHHHHHHCCCCCC--CCCCCCCCCCCHHHHHHHHHHCCC---------CCHHHHHHHHHHHHHHH T ss_conf 99979999984799988--878878799998999999876429---------89999999999999874 No 263 >PTZ00254 40S ribosomal protein SA; Provisional Probab=41.11 E-value=18 Score=13.95 Aligned_cols=22 Identities=18% Similarity=0.345 Sum_probs=15.8 Q ss_pred HHHHHHHHHHHHCCCEEEECCC Q ss_conf 9999999999818989980883 Q 537021.9.peg.7 196 RESQVVQLAYTHELPLVATNNS 217 (1033) Q Consensus 196 ~~~~l~~lA~~~~iPlVaTndv 217 (1033) ...+.+.=|.+.+||+||..|. T Consensus 129 ~d~~AV~EA~~~nIPvIal~DT 150 (242) T PTZ00254 129 TDHQAIREASYVNIPVIALCDT 150 (242) T ss_pred CCHHHHHHHHHCCCCEEEEECC T ss_conf 4408999999859988986169 No 264 >smart00278 HhH1 Helix-hairpin-helix DNA-binding motif class 1. Probab=40.98 E-value=15 Score=14.76 Aligned_cols=19 Identities=42% Similarity=0.704 Sum_probs=15.9 Q ss_pred HHHHCCCCHHHHHHHHHHH Q ss_conf 2685896989999999998 Q 537021.9.peg.7 839 LAAIKGVGTTTARHIMEAS 857 (1033) Q Consensus 839 l~~Ikgvg~~~~~~Ive~r 857 (1033) |..|+|||++.++.|++.. T Consensus 3 L~~v~GIG~k~A~~ll~~~ 21 (26) T smart00278 3 LLKVPGIGPKTAEKILEAX 21 (26) T ss_pred CCCCCCCCCHHHHHHHHHH T ss_conf 1017998811599999976 No 265 >TIGR00884 guaA_Cterm GMP synthase, C-terminal domain; InterPro: IPR001674 The amidotransferase family of enzymes utilises the ammonia derived from the hydrolysis of glutamine for a subsequent chemical reaction catalyzed by the same enzyme. The ammonia intermediate does not dissociate into solution during the chemical transformations . GMP synthetase is a glutamine amidotransferase from the de novo purine biosynthetic pathway. The C-terminal domain is specific to the GMP synthases 6.3.5.2 from EC. In prokaryotes this domain mediates dimerisation. Eukaryotic GMP synthases are monomers. This domain in eukaryotes includes several large insertions that may form globular domains .; GO: 0003922 GMP synthase (glutamine-hydrolyzing) activity, 0005524 ATP binding, 0006164 purine nucleotide biosynthetic process, 0006177 GMP biosynthetic process. Probab=40.89 E-value=19 Score=13.92 Aligned_cols=112 Identities=17% Similarity=0.256 Sum_probs=62.7 Q ss_pred CCCEEEECCCCCCHHHHHHHCCCHHHHHHHHHHHHHHCCCCEEEEEECCC-HHHHHHHHHHHHHHHHHCCCEEEECCCCC Q ss_conf 89889981976796788887799789999999998765986899994399-11589999999999981898998088300 Q 537021.9.peg.7 141 TEGLIMLTGGSSGPIDRAFFFNSSQVAEKRLLTFKKLFGDRLYVNLQRHR-GYDRYRESQVVQLAYTHELPLVATNNSLF 219 (1033) Q Consensus 141 ~egLIvlsg~~~g~i~~~l~~~~~~~a~~~l~~l~~~Fgd~~ylEl~~~~-~~e~~~~~~l~~lA~~~~iPlVaTndv~Y 219 (1033) ++-|++||||.+|-+.-.|. ++..|||++-=--+|+ ....+...=...|...+++|++ | T Consensus 17 ~~vi~ALSGGVDSsV~A~L~--------------hrAIGD~L~~vFVD~GLlR~gE~E~V~~~F~~~lg~nl~------~ 76 (319) T TIGR00884 17 AKVIIALSGGVDSSVAAVLL--------------HRAIGDRLTCVFVDHGLLRKGEAERVVKTFSDKLGLNLV------V 76 (319) T ss_pred CEEEEEECCCHHHHHHHHHH--------------HHHHCCCEEEEEECCCCCCCCCHHHHHHHHHHHHCCCCE------E T ss_conf 46899810881589999999--------------864227604898227888766378999998753089827------8 Q ss_pred CCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHCCHHHHHHHHHHHHHHHHH----HHHHHHHCCCCCC Q ss_conf 787889999876688469857653100245343234347887512203589987886411100----0133221023455 Q 537021.9.peg.7 220 LSEEDYEAHDVLMAVAHSTVVSQKDRPRVTPDHYLKNRSEMVSIFSDLPEALENTVEIARRCS----FILQVHAPILPRF 295 (1033) Q Consensus 220 ~~~~D~~~h~iL~aI~~~~~l~~~~~~~~~~~~yLks~eEM~~~f~~~peAi~NT~~IAe~c~----~~l~~~~~~lP~f 295 (1033) .+.+|..+... .|.+=.+.+|.- =+..|. +-|. +.|++++ +++-.-...+|-. T Consensus 77 VDA~e~FL~~L-----~GV~DPE~KRKI-IG~~FI-------~VFE----------~~A~~~~~~~~a~yL~QGTlYPDv 133 (319) T TIGR00884 77 VDAKERFLSAL-----KGVTDPEEKRKI-IGRVFI-------EVFE----------REAEKIGDKKKAEYLVQGTLYPDV 133 (319) T ss_pred ECCCHHHHHHC-----CCCCCCHHHCCH-HHHHHH-------HHHH----------HHHHHCCCCCEEEEEEECCCCCCE T ss_conf 76107988752-----889870341311-225667-------7788----------899850799779999734344746 No 266 >PRK05973 replicative DNA helicase; Provisional Probab=40.75 E-value=16 Score=14.39 Aligned_cols=19 Identities=16% Similarity=0.137 Sum_probs=12.8 Q ss_pred HHHHHHHHHHCCCEEEECC Q ss_conf 9999999981898998088 Q 537021.9.peg.7 198 SQVVQLAYTHELPLVATNN 216 (1033) Q Consensus 198 ~~l~~lA~~~~iPlVaTnd 216 (1033) ..+..+|+++++|+||... T Consensus 173 RsLK~lAkEl~vPVvaLSQ 191 (237) T PRK05973 173 RALKSFARKRGVILVFISQ 191 (237) T ss_pred HHHHHHHHHHCCCEEEEEC T ss_conf 9999999986993999400 No 267 >PRK12999 pyruvate carboxylase; Reviewed Probab=40.42 E-value=19 Score=13.87 Aligned_cols=45 Identities=24% Similarity=0.234 Sum_probs=22.3 Q ss_pred HHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCE Q ss_conf 4329989999999978998899811651774999999999769963 Q 537021.9.peg.7 24 EGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQP 69 (1033) Q Consensus 24 dg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKP 69 (1033) ++...++.+++.|++.|..||-=- +|=|+--.+|.++|.++||.- T Consensus 62 ~sYL~~~~ii~~A~~~~~dAiHPG-YGFLSEn~~Fa~~~~~~gi~f 106 (1147) T PRK12999 62 EAYLDIDEIIRVAKQAGVDAIHPG-YGFLSENPEFARACAEAGITF 106 (1147) T ss_pred HHHCCHHHHHHHHHHHCCCEEECC-CCHHHCCHHHHHHHHHCCCEE T ss_conf 111299999999999498999779-622007999999999878999 No 268 >cd00142 PI3Kc_like Phosphoinositide 3-kinase (PI3K)-like family, catalytic domain; The PI3K-like catalytic domain family is part of a larger superfamily that includes the catalytic domains of other kinases such as the typical serine/threonine/tyrosine protein kinases (PKs), aminoglycoside phosphotransferase, choline kinase, and RIO kinases. Members of the family include PI3K, phosphoinositide 4-kinase (PI4K), PI3K-related protein kinases (PIKKs), and TRansformation/tRanscription domain-Associated Protein (TRRAP). PI3Ks catalyze the transfer of the gamma-phosphoryl group from ATP to the 3-hydroxyl of the inositol ring of D-myo-phosphatidylinositol (PtdIns) or its derivatives, while PI4K catalyze the phosphorylation of the 4-hydroxyl of PtdIns. PIKKs are protein kinases that catalyze the phosphorylation of serine/threonine residues, especially those that are followed by a glutamine. PI3Ks play an important role in a variety of fundamental cellular processes, including cell motility, the Probab=40.02 E-value=12 Score=15.56 Aligned_cols=26 Identities=27% Similarity=0.400 Sum_probs=18.7 Q ss_pred CCCHHHHHHHHHHHCCCCCHHHCCCHHH Q ss_conf 4420466776554403756333231056 Q 537021.9.peg.7 378 RGSGAGSVTAYALTITDIDPLRFSLLFE 405 (1033) Q Consensus 378 RGSAagSLVaY~LgIT~VDPi~~~LlFE 405 (1033) +--|+.|+++|+|||-+=-|= |+++. T Consensus 118 ~S~A~~sv~~yvlgigDRH~~--NIli~ 143 (219) T cd00142 118 SSLAGYSVAGYILGIGDRHPD--NIMID 143 (219) T ss_pred HHHHHHHHHHHHEECCCCCCC--CEEEE T ss_conf 999999999983132689987--18898 No 269 >cd00443 ADA_AMPD Adenosine/AMP deaminase. Adenosine deaminases (ADAs) are present in pro- and eukaryotic organisms and catalyze the zinc dependent irreversible deamination of adenosine nucleosides to inosine nucleosides and ammonia. The eukaryotic AMP deaminase catalyzes a similar reaction leading to the hydrolytic removal of an amino group at the 6 position of the adenine nucleotide ring, a branch point in the adenylate catabolic pathway. Probab=39.92 E-value=14 Score=15.00 Aligned_cols=22 Identities=36% Similarity=0.632 Sum_probs=20.1 Q ss_pred HHHHHHCHHHHHHHCCCHHHHHHHHHH Q ss_conf 334331444764432998999999997 Q 537021.9.peg.7 12 VHLQVHSSYSLLEGALSLNNILDKIAA 38 (1033) Q Consensus 12 vhLh~hS~ySlldg~~~~e~LV~~A~~ 38 (1033) |+||+| |+|+++|+-+.+.|++ T Consensus 4 vELH~H-----L~Gsi~~~tl~ela~~ 25 (305) T cd00443 4 VELHAH-----LSGSISPETLLELIKK 25 (305) T ss_pred EEEEEC-----CCCCCCHHHHHHHHHH T ss_conf 357745-----6689899999999998 No 270 >pfam06026 Rib_5-P_isom_A Ribose 5-phosphate isomerase A (phosphoriboisomerase A). This family consists of several ribose 5-phosphate isomerase A or phosphoriboisomerase A (EC:5.3.1.6) from bacteria, eukaryotes and archaea. Probab=39.65 E-value=19 Score=13.78 Aligned_cols=52 Identities=17% Similarity=0.234 Sum_probs=33.8 Q ss_pred HHHHHHHCCCCCCCCCCCCCCEEEEECC-CEEEEEHHHHCCCCHHHH-HHHHHHH Q ss_conf 9988886368503552334210035319-869850268589698999-9999998 Q 537021.9.peg.7 805 FCQDARQFNIQIMPPSVNTPCVDFKVGD-NRIYYSLAAIKGVGTTTA-RHIMEAS 857 (1033) Q Consensus 805 ~i~e~~~~gi~vl~PdIN~S~~~f~~~~-~~Ir~Gl~~Ikgvg~~~~-~~Ive~r 857 (1033) --..|++.||++..++= .+..+..+++ +.|---|..|||-|.... |+|+... T Consensus 10 T~~~a~~~Gi~l~~l~~-~~~iDl~iDGADevd~~l~lIKGGGgallrEKivA~~ 63 (172) T pfam06026 10 TEELAKELGIPLSDLDE-VDELDLAIDGADEVDPNLNLIKGGGGALLREKIVASA 63 (172) T ss_pred HHHHHHHCCCEEECCCC-CCCCEEEECCHHHHCCCCCEEEECCHHHHHHHHHHHH T ss_conf 99999986996865211-7840156357677486768897081898889999986 No 271 >TIGR01325 O_suc_HS_sulf O-succinylhomoserine sulfhydrylase; InterPro: IPR006234 These sequences represent O-succinylhomoserine sulfhydrylase, one of several related pyridoxal phosphate-dependent enzymes of cysteine and methionine metabolism. This enzyme is part of an alternative pathway of homocysteine biosynthesis, a step in methionine biosynthesis.. Probab=39.61 E-value=5.3 Score=18.51 Aligned_cols=30 Identities=20% Similarity=0.345 Sum_probs=18.2 Q ss_pred CCCCCCCCCHHHHHHHHHCCCCCHHHHCCCH Q ss_conf 3433234435889987620141012211453 Q 537021.9.peg.7 596 VDLSLIPFDDHETYHLLTTKGTLGIFQLESS 626 (1033) Q Consensus 596 ~d~~~ip~~D~~~~~l~~~g~t~GvFQ~es~ 626 (1033) |...-.|.+|.+-|+-=++-+|.-+| +|+| T Consensus 119 v~v~fv~~~Dl~~WeaA~~~nTkl~f-~EtP 148 (386) T TIGR01325 119 VEVSFVDATDLKAWEAALKPNTKLVF-VETP 148 (386) T ss_pred EEEEECCCCCHHHHHHHCCCCCCEEE-EECC T ss_conf 06751786787888985699950788-6368 No 272 >PRK06111 acetyl-CoA carboxylase biotin carboxylase subunit; Validated Probab=39.53 E-value=19 Score=13.76 Aligned_cols=69 Identities=13% Similarity=0.156 Sum_probs=37.1 Q ss_pred CEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCCCCCCCCCCCCCEEEEEECC--HHHHHHHHHHHHHH Q ss_conf 8899811651774999999999769963899999985476643344565666677539999569--68999999999899 Q 537021.9.peg.7 42 PAIAITDTNNLFSALEFSQKACAAGIQPIIGCQLDIDMQDNLKMTNQGKTPFPTVPSIVLLVST--AEGYQRLIELVSRM 119 (1033) Q Consensus 42 ~AiAITD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~~~~~~~~~~~~~~~~~lvlLAkN--~~Gy~nL~kL~S~a 119 (1033) +-|-|+.+|- -++|..++|++-||+++.= |-..+...... + ..-..+.+-.+ .+-|-|.-+++..+ T Consensus 3 ~kvLIANRGE--IA~Ri~rt~~elgi~tvav---ys~~D~~a~hv---~----~Ade~v~ig~~~~~~sYln~~~Ii~~A 70 (449) T PRK06111 3 QKVLIANRGE--IAVRIIRTCQKLGIRTVAI---YSEADEDALHV---K----LADEAYLIGGPRVQESYLNLEKIIEIA 70 (449) T ss_pred CEEEEECCCH--HHHHHHHHHHHCCCCEEEE---CCHHHCCCHHH---H----HCCEEEEECCCCHHHHHCCHHHHHHHH T ss_conf 6189987879--9999999999849979999---17364568358---7----498988808997454333999999999 Q ss_pred HHC Q ss_conf 732 Q 537021.9.peg.7 120 YLL 122 (1033) Q Consensus 120 ~~~ 122 (1033) ... T Consensus 71 ~~~ 73 (449) T PRK06111 71 KKT 73 (449) T ss_pred HHH T ss_conf 983 No 273 >TIGR00639 PurN phosphoribosylglycinamide formyltransferase; InterPro: IPR004607 This family describes phosphoribosylglycinamide formyltransferase (GAR transformylase), one of several proteins found in formyltransferase. This enzyme uses formyl tetrahydrofolate as a formyl group donor to produce 5'-phosphoribosyl-N-formylglycinamide. PurT, a different GAR transformylase, uses ATP and formate rather than formyl tetrahydrofolate. Experimental proof includes complementation of Escherichia coli purN mutants by orthologs from vertebrates (where it is a domain of a multifunctional protein), Bacillus subtilis, and Arabidopsis thaliana. In phylogenetic analyses, the member from Saccharomyces cerevisiae shows a long branch length but membership in the family, while the formyltetrahydrofolate deformylases form a closely related outgroup. ; GO: 0004644 phosphoribosylglycinamide formyltransferase activity, 0006189 'de novo' IMP biosynthetic process. Probab=39.38 E-value=19 Score=13.75 Aligned_cols=159 Identities=19% Similarity=0.266 Sum_probs=79.9 Q ss_pred CHHHHHHHHHH--CCCCE---EEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCCCCCCCCCCCCCEEEEE Q ss_conf 98999999997--89988---99811651774999999999769963899999985476643344565666677539999 Q 537021.9.peg.7 28 SLNNILDKIAA--DQQPA---IAITDTNNLFSALEFSQKACAAGIQPIIGCQLDIDMQDNLKMTNQGKTPFPTVPSIVLL 102 (1033) Q Consensus 28 ~~e~LV~~A~~--~G~~A---iAITD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~~~~~~~~~~~~~~~~~lvlL 102 (1033) ..+-|++++++ .+++| +-|+|.-.-+|.-| |++++|.--. |+.+ .+++ =+ T Consensus 13 NLqAiiDA~~~~~~~~~A~v~lVisn~p~A~g~~R----A~~~~iP~~v-~~~~---------------~f~s-----Gi 67 (215) T TIGR00639 13 NLQAIIDAIKEGQGKIDASVVLVISNKPKAYGLER----AAQAGIPTKV-LSLK---------------DFDS-----GI 67 (215) T ss_pred HHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHH----HHHCCCCEEE-EECC---------------CCCC-----CC T ss_conf 79999999852127867368999865866266777----8875894677-4216---------------7887-----75 Q ss_pred ECCHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCEEEECCCCCCHHHHHHHCCCHHHHHHHHHHHHHHCCCCE Q ss_conf 56968999999999899732487787400789998634898899819767967888877997899999999987659868 Q 537021.9.peg.7 103 VSTAEGYQRLIELVSRMYLLDQGKQSVRICLSWLQEIGTEGLIMLTGGSSGPIDRAFFFNSSQVAEKRLLTFKKLFGDRL 182 (1033) Q Consensus 103 AkN~~Gy~nL~kL~S~a~~~~~~~~~p~i~~~~L~e~~~egLIvlsg~~~g~i~~~l~~~~~~~a~~~l~~l~~~Fgd~~ 182 (1033) ||..+.-+.=. ++.|.. .| -+.|.+ +.=.|||+-|-+.- +. ..|..-|.++= T Consensus 68 akeDkaaraka--------r~~~d~--~i-~~~l~~-~~vdlvvlAGfMRI------L~----------~~Fl~~f~grP 119 (215) T TIGR00639 68 AKEDKAARAKA--------REAFDQ--AI-AEELKA-AEVDLVVLAGFMRI------LG----------PTFLSAFAGRP 119 (215) T ss_pred CCCHHHHHHHH--------HHHHHH--HH-HHHHHH-HCCCEEEECCHHHH------CC----------HHHHHHHCCCC T ss_conf 31016788887--------899999--99-999996-09969998461444------16----------68898627998 Q ss_pred EEEEECC--CHHHH-H-HHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 9999439--91158-9-999999999981898998088300787889999876688469857653100245 Q 537021.9.peg.7 183 YVNLQRH--RGYDR-Y-RESQVVQLAYTHELPLVATNNSLFLSEEDYEAHDVLMAVAHSTVVSQKDRPRVT 249 (1033) Q Consensus 183 ylEl~~~--~~~e~-~-~~~~l~~lA~~~~iPlVaTndv~Y~~~~D~~~h~iL~aI~~~~~l~~~~~~~~~ 249 (1033) -|-+..- +.+.. . -..+.++...+.+-|+.+-.-|||.+.+ +.+|-.+=+...+-+. T Consensus 120 ~lNIHPSLLP~F~Gt~ha~~qAl~~G~k~nkPlt~GcTVH~V~e~----------vD~Gpii~Qa~VPil~ 180 (215) T TIGR00639 120 ILNIHPSLLPAFPGTLHAVEQALEAGVKENKPLTSGCTVHYVDEE----------VDTGPIIVQAKVPILP 180 (215) T ss_pred EEEECCCCCCCCCCCHHHHHHHHHHCCCCCCCCCCCCEEEEEECC----------CCCCCEEEEECCCCCC T ss_conf 475075446677775578999998355437863477528888254----------6788757874020249 No 274 >KOG0555 consensus Probab=39.30 E-value=20 Score=13.74 Aligned_cols=109 Identities=20% Similarity=0.186 Sum_probs=45.3 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHH---HHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHH Q ss_conf 88999876541034201246432457838999---998877898986063055788899999851047643344442046 Q 537021.9.peg.7 307 EENELRNKAVAGLEVRLAQGAIAKGYHAQDYR---QRLDFELNVIARMKFSGYFLIVADFIQWAKKNNIPVGPGRGSGAG 383 (1033) Q Consensus 307 ~~~~Lr~l~~~Gl~~R~~~~~~~~~~~~~~y~---~RLe~EL~vI~~~gf~~YFLiV~Div~~ak~~gI~vGpGRGSAag 383 (1033) ++.||.+-++--|+.-++..... .-+.+.|+ .|-..-|+--...-+.==||.--|+++-- - T Consensus 289 EeAyLTQSSQLYLEtclpAlgdv-y~I~~SyRAEkSrTRRHLsEytHVEaE~afltfd~ll~~i---------------E 352 (545) T KOG0555 289 EEAYLTQSSQLYLETCLPALGDV-YCIQQSYRAEKSRTRRHLSEYTHVEAECAFLTFDDLLDRI---------------E 352 (545) T ss_pred CHHHCCCHHHHHHHHHHHHCCCE-EEECHHHHHHHHHHHHHHHHHEEEEEECCCCCHHHHHHHH---------------H T ss_conf 40330223588898765520762-6732756655666666553311102324636488999999---------------9 Q ss_pred HHHHHHHHCCCCCHHHCCCHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 677655440375633323105676210014578753336568789999999830 Q 537021.9.peg.7 384 SVTAYALTITDIDPLRFSLLFERFLNPDRMSMPDFDIDFCQDRRDEVIRYVQNK 437 (1033) Q Consensus 384 SLVaY~LgIT~VDPi~~~LlFERFLnp~R~~~PDIDiDf~~~~R~~vi~y~~~k 437 (1033) -|||+...=---||+... =-+|||+=.-+ - --|-.-.-.+.|.|+.+. T Consensus 353 ~lvc~~vdr~l~dp~~~l---i~~lnP~f~~P-~--~PFkRm~Y~dAI~wLke~ 400 (545) T KOG0555 353 ALVCDSVDRLLEDPIAPL---IKQLNPDFKAP-K--RPFKRMNYSDAIEWLKEH 400 (545) T ss_pred HHHHHHHHHHHHCHHHHH---HHHHCCCCCCC-C--CCHHCCCHHHHHHHHHHC T ss_conf 999999999986701566---88749898888-8--812107889999999863 No 275 >PRK07556 consensus Probab=39.11 E-value=20 Score=13.72 Aligned_cols=74 Identities=20% Similarity=0.295 Sum_probs=38.2 Q ss_pred CCCCCHHHHHH-------HHHHHHHHHHHHHH-H-HHCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCEEE Q ss_conf 47903899999-------89999999998865-4-312777866665555655776668999988899985788726211 Q 537021.9.peg.7 893 CFGYSRMQLLQ-------SLDNIQKYAQWVEK-N-RTNKHENIFAHEKGTFSDKITLEKFSVENSSVRFENEQRVLGFYF 963 (1033) Q Consensus 893 ~~~~~R~~ll~-------~i~~~~~~~~~~~~-~-~~~~q~~LF~~~~~~~~~~~~~~~~~~~~~~e~l~~E~e~LGf~l 963 (1033) .+|.+|.++-. ..+.+..+.....+ . +......||+= .-.+|++..-+...+-..|+..+-..+ T Consensus 821 ~l~i~~~eA~~~i~~yf~~y~~v~~~~~~~~~~a~~~gyv~Tl~gR-------rr~~p~i~s~n~~~r~~~eR~a~N~pi 893 (977) T PRK07556 821 QLGIPREEAGAYIKRYFERFPGIRAYMDATKAFAREHGYVETLFGR-------RIHYPDIKASNPSVRAFNERAAINAPI 893 (977) T ss_pred HCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCEECCCCC-------EECCCCCCCCCHHHHHHHHHHHHCCCC T ss_conf 7198999999999999986803999999999999977977888787-------032875568998899799999827764 Q ss_pred CCCCHHHHHH Q ss_conf 6385589999 Q 537021.9.peg.7 964 SGHPLDVYKP 973 (1033) Q Consensus 964 S~HPl~~~~~ 973 (1033) -|.-=+..|- T Consensus 894 QGtaAdiiK~ 903 (977) T PRK07556 894 QGTAADIIRR 903 (977) T ss_pred CHHHHHHHHH T ss_conf 3559999999 No 276 >KOG0538 consensus Probab=39.06 E-value=20 Score=13.71 Aligned_cols=29 Identities=31% Similarity=0.234 Sum_probs=16.2 Q ss_pred HHHHHHHHHHHHHHHCCC-EEEECCCCCCC Q ss_conf 589999999999981898-99808830078 Q 537021.9.peg.7 193 DRYRESQVVQLAYTHELP-LVATNNSLFLS 221 (1033) Q Consensus 193 e~~~~~~l~~lA~~~~iP-lVaTndv~Y~~ 221 (1033) |+.++.++++=|++.|.+ +|.|=|..++- T Consensus 132 dr~It~~Lv~raEk~GfkAlvlTvDtP~lG 161 (363) T KOG0538 132 DRDITEQLVKRAEKAGFKALVLTVDTPRLG 161 (363) T ss_pred CHHHHHHHHHHHHHCCCEEEEEEECCCCCC T ss_conf 446899999999972966999983461126 No 277 >pfam09862 DUF2089 Protein of unknown function(DUF2089). This domain, found in various hypothetical prokaryotic proteins, has no known function. Probab=38.94 E-value=18 Score=14.08 Aligned_cols=20 Identities=20% Similarity=0.599 Sum_probs=11.5 Q ss_pred HHHHHHHHHHHHCCCCHHHH Q ss_conf 78899999997527998898 Q 537021.9.peg.7 453 QAKAALRDVGRALQMPYSQV 472 (1033) Q Consensus 453 ~~k~aird~~r~~g~~~~~~ 472 (1033) +.|+.+++++|.||++|.++ T Consensus 47 ~~~Gnlke~~~~lgiSYpTv 66 (113) T pfam09862 47 KCRGNIKEVEKELGISYPTV 66 (113) T ss_pred HHCCCHHHHHHHHCCCCHHH T ss_conf 91688999999978881889 No 278 >KOG0741 consensus Probab=38.84 E-value=16 Score=14.44 Aligned_cols=12 Identities=42% Similarity=0.681 Sum_probs=5.6 Q ss_pred CCHHHHHHHHCC Q ss_conf 310567621001 Q 537021.9.peg.7 401 SLLFERFLNPDR 412 (1033) Q Consensus 401 ~LlFERFLnp~R 412 (1033) ||+=|--|.|+| T Consensus 380 DlIDEALLRPGR 391 (744) T KOG0741 380 DLIDEALLRPGR 391 (744) T ss_pred HHHHHHHCCCCC T ss_conf 667887558871 No 279 >pfam10481 Cenp-F_N Cenp-F N-terminal domain. Mitosin or centromere-associated protein-F (Cenp-F) is found bound across the centromere as one of the proteins of the outer layer of the kinetochore. Most of the kinetochore/centromere functions appear to depend upon binding of the C-terminal par to f the molecule, whereas the N-terminal part, here, may be a cytoplasmic player in controlling the function of microtubules and dynein. Probab=38.74 E-value=20 Score=13.67 Aligned_cols=131 Identities=16% Similarity=0.173 Sum_probs=75.0 Q ss_pred HHHHHHHHHHHHHHHHHHH-HHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 8998788641110001332-210234555756887768988999876541034201246432457838999998877898 Q 537021.9.peg.7 269 EALENTVEIARRCSFILQV-HAPILPRFIEKECDDIQKEEENELRNKAVAGLEVRLAQGAIAKGYHAQDYRQRLDFELNV 347 (1033) Q Consensus 269 eAi~NT~~IAe~c~~~l~~-~~~~lP~f~~p~~~~~~~~~~~~Lr~l~~~Gl~~R~~~~~~~~~~~~~~y~~RLe~EL~v 347 (1033) .-++.+..-+..|++.+.. ..|. -.|-+| .++..|-..-.++-|+++|.+. -+-++||+-|+.+ T Consensus 123 ~elErsQq~~~sad~s~~~~~TPQ-K~F~~P------lTP~~~~~dskyedL~EKYnke--------VeerKrLE~e~k~ 187 (288) T pfam10481 123 SELERSQQAAQSADVSLNPCSTPQ-KIFATP------LTPSQYYSGSKYEDLKEKYNKE--------VEERKRLEAEVKA 187 (288) T ss_pred HHHHHHHHHHHCCCCCCCCCCCCH-HHCCCC------CCCHHHCCCCCHHHHHHHHHHH--------HHHHHHHHHHHHH T ss_conf 999888775410133446468822-223677------8831312552089999999988--------9998989999999 Q ss_pred HHHHC----CHHHHHHHHHHHHHHHCCCC----------------CCCCCCCCHHHHHHHHHHHCCCCCHHHCCCHHHHH Q ss_conf 98606----30557888999998510476----------------43344442046677655440375633323105676 Q 537021.9.peg.7 348 IARMK----FSGYFLIVADFIQWAKKNNI----------------PVGPGRGSGAGSVTAYALTITDIDPLRFSLLFERF 407 (1033) Q Consensus 348 I~~~g----f~~YFLiV~Div~~ak~~gI----------------~vGpGRGSAagSLVaY~LgIT~VDPi~~~LlFERF 407 (1033) |.-.. ++---+--.||-|.--+..+ +--|.|++.+. ++.++=-++-|.+-.+.+||- T Consensus 188 l~~kk~~q~~~Qst~~HrdIARhQasSSvFsWQqe~tpsr~ss~~~~tp~rr~~sa---s~f~~e~e~tp~k~~~~~~~~ 264 (288) T pfam10481 188 LQAKKASQTIPQATMNHRDIARHQASSSVFSWQQEKTPSRLSSNALKTPLRRDFSA---SHFSGEQEVTPSRSTLQIGKG 264 (288) T ss_pred HHCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC---CCCCCCCCCCCCCCHHHHCCC T ss_conf 85044565454010228999876325654311113687654555445753356554---568753346888602341345 Q ss_pred HHHCCCCCCCCC Q ss_conf 210014578753 Q 537021.9.peg.7 408 LNPDRMSMPDFD 419 (1033) Q Consensus 408 Lnp~R~~~PDID 419 (1033) .--.++||-+ T Consensus 265 --~~~~s~~~~~ 274 (288) T pfam10481 265 --DANSSFPDNS 274 (288) T ss_pred --CCCCCCCCCC T ss_conf --6666788886 No 280 >cd07220 Pat_PNPLA2 Patatin-like phospholipase domain containing protein 2. PNPLA2 plays a key role in hydrolysis of stored triacylglecerols and is also known as adipose triglyceride lipase (ATGL). Members of this family share a patain domain, initially discovered in potato tubers. ATGL is expressed in white and brown adipose tissue in high mRNA levels. Mutations in PNPLA2 encoding adipose triglyceride lipase (ATGL) leads to neutral lipid storage disease (NLSD) which is characterized by the accumulation of triglycerides in multiple tissues. ATGL mutations are also commonly associated with severe forms of skeletal- and cardio-myopathy. This family includes patatin-like proteins: TTS-2.2 (transport-secretion protein 2.2), PNPLA2 (Patatin-like phospholipase domain-containing protein 2), and iPLA2-zeta (Calcium-independent phospholipase A2) from Homo sapiens. Probab=38.68 E-value=15 Score=14.62 Aligned_cols=21 Identities=38% Similarity=0.428 Sum_probs=15.5 Q ss_pred CCCHHHHHHHHHHHCCCCCHHH Q ss_conf 4420466776554403756333 Q 537021.9.peg.7 378 RGSGAGSVTAYALTITDIDPLR 399 (1033) Q Consensus 378 RGSAagSLVaY~LgIT~VDPi~ 399 (1033) -|+.||||+|-+| +++++|-+ T Consensus 41 ~GaSAGAL~Aa~l-~~~~~~~~ 61 (249) T cd07220 41 YGASAGALTATAL-VTGVCLGE 61 (249) T ss_pred EEECHHHHHHHHH-HHCCCHHH T ss_conf 6654878999999-83788999 No 281 >PRK13978 ribose-5-phosphate isomerase A; Provisional Probab=38.43 E-value=20 Score=13.64 Aligned_cols=52 Identities=19% Similarity=0.317 Sum_probs=30.9 Q ss_pred HHHHHHHCCCCCCCCCCCCCCEEEEECC-CEEEEEHHHHCCCCHHHH-HHHHHHH Q ss_conf 9988886368503552334210035319-869850268589698999-9999998 Q 537021.9.peg.7 805 FCQDARQFNIQIMPPSVNTPCVDFKVGD-NRIYYSLAAIKGVGTTTA-RHIMEAS 857 (1033) Q Consensus 805 ~i~e~~~~gi~vl~PdIN~S~~~f~~~~-~~Ir~Gl~~Ikgvg~~~~-~~Ive~r 857 (1033) --..|+++||++.+++ .....+..+++ +.|-..|..|||-|.... |.|+... T Consensus 60 T~~~a~~~Gi~l~~l~-~~~~iDl~iDGADEvd~~lnlIKGgGgal~rEKivA~~ 113 (228) T PRK13978 60 IAFLAKELGIKICEIN-DVDHIDLAIDGADEVDPSLNIIKGGGGALFREKVIDEM 113 (228) T ss_pred HHHHHHHCCCEEECCC-CCCCCCEEEECCCCCCCCCCEEEECCHHHHHHHHHHHC T ss_conf 9999998798076455-68734679808864178878898784999999999970 No 282 >cd00893 PI4Kc_III Phosphoinositide 4-kinase (PI4K), Type III, catalytic domain; The PI4K catalytic domain family is part of a larger superfamily that includes the catalytic domains of other kinases such as the typical serine/threonine/tyrosine protein kinases (PKs), aminoglycoside phosphotransferase, choline kinase, and RIO kinases. PI4Ks catalyze the transfer of the gamma-phosphoryl group from ATP to the 4-hydroxyl of the inositol ring of D-myo-phosphatidylinositol (PtdIns) to generate PtdIns(4)P, the major precursor in the synthesis of other phosphoinositides including PtdIns(4,5)P2, PtdIns(3,4)P2, and PtdIns(3,4,5)P3. There are two types of PI4Ks, types II and III. Type II PI4Ks lack the characteristic catalytic kinase domain present in PI3Ks and type III PI4Ks, and are excluded from this family. Two isoforms of type III PI4K, alpha and beta, exist in most eukaryotes. Probab=38.41 E-value=12 Score=15.58 Aligned_cols=17 Identities=29% Similarity=0.374 Sum_probs=8.0 Q ss_pred CCCHHHHHHHHHHHCCC Q ss_conf 44204667765544037 Q 537021.9.peg.7 378 RGSGAGSVTAYALTITD 394 (1033) Q Consensus 378 RGSAagSLVaY~LgIT~ 394 (1033) |-.||-|+|||+|||-+ T Consensus 129 ~S~AgYsvitYiLgigD 145 (289) T cd00893 129 ESMAGYSLLCYLLQIKD 145 (289) T ss_pred HHHHHHHHHHHHHCCCC T ss_conf 99999999999842576 No 283 >PRK08393 N-ethylammeline chlorohydrolase; Provisional Probab=38.29 E-value=20 Score=13.62 Aligned_cols=61 Identities=16% Similarity=0.221 Sum_probs=32.2 Q ss_pred CCCHHHHHHCHHHHHHHCCCH---HHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEEE Q ss_conf 881334331444764432998---999999997899889981165177499999999976996389 Q 537021.9.peg.7 9 PSFVHLQVHSSYSLLEGALSL---NNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPII 71 (1033) Q Consensus 9 ~~FvhLh~hS~ySlldg~~~~---e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPIi 71 (1033) |-||..|+|+..++++|...- .+++.....- . .-.+|......++.-....+.+.|+.-++ T Consensus 52 PGlIn~H~H~~~~~~rg~~~d~~l~~~l~~~~~~-~-~~~l~~e~~~~~~~~~~~e~l~~G~Tt~~ 115 (419) T PRK08393 52 PGFINAHTHSPMVLLRGLAEDVPLMEWLQNYIWP-R-ERKLKRKDVYWGAYLGLLEMARSGTTTFV 115 (419) T ss_pred CCCCCHHHCHHHHHHHHCCCCCCHHHHHHHHHHH-H-HHCCCHHHHHHHHHHHHHHHHHCCCEEEE T ss_conf 0810075586668787514899879999873436-6-76479899999999999999868976999 No 284 >KOG3666 consensus Probab=38.22 E-value=20 Score=13.61 Aligned_cols=14 Identities=29% Similarity=0.173 Sum_probs=5.6 Q ss_pred HHHHHHHHHHCCCC Q ss_conf 78999998624999 Q 537021.9.peg.7 729 KERFISGASKNGIS 742 (1033) Q Consensus 729 ~~~f~~g~~~~g~~ 742 (1033) -..|+..+.++|.+ T Consensus 873 L~~~ls~~qki~~S 886 (1141) T KOG3666 873 LQNFLSMFQKIILS 886 (1141) T ss_pred HHHHHHHHHHHCCC T ss_conf 99999999985233 No 285 >TIGR01381 E1_like_apg7 E1-like protein-activating enzyme Gsa7p/Apg7p; InterPro: IPR006285 This is a family of eukaryotic proteins found in animals, plants, and yeasts that includes Apg7p (YHR171W) from Saccharomyces cerevisiae and GSA7 from Pichia pastoris. Members are about 650 to 700 residues in length and include a central domain of about 150 residues shared with the ThiF/MoeB/HesA family of proteins. A low level of similarity to ubiquitin-activating enzyme E1 is described in a paper on peroxisome autophagy mediated by GSA7, and is the basis of the name ubiquitin activating enzyme E1-like protein. Members of the family appear to be involved in protein lipidation events analogous to ubiquitination and required for membrane fusion events during autophagy. . Probab=37.98 E-value=20 Score=13.58 Aligned_cols=29 Identities=24% Similarity=0.180 Sum_probs=14.1 Q ss_pred CCEEEECCCCCCCHHHHHHHHHHHHHHCC Q ss_conf 98998088300787889999876688469 Q 537021.9.peg.7 209 LPLVATNNSLFLSEEDYEAHDVLMAVAHS 237 (1033) Q Consensus 209 iPlVaTndv~Y~~~~D~~~h~iL~aI~~~ 237 (1033) |-++..-.+|-+..--+.++-+|-+|+.- T Consensus 220 l~~gf~Dps~~~~~~gw~lRN~La~~a~~ 248 (689) T TIGR01381 220 LLVGFIDPSSVASKAGWQLRNVLALVAYL 248 (689) T ss_pred EEEEEECCCCCCCCCCCHHHHHHHHHHHH T ss_conf 88998626644446782578999999861 No 286 >PRK07625 consensus Probab=37.97 E-value=20 Score=13.58 Aligned_cols=29 Identities=17% Similarity=0.165 Sum_probs=16.9 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCEEECCCC Q ss_conf 66689999888999857887262116385 Q 537021.9.peg.7 939 TLEKFSVENSSVRFENEQRVLGFYFSGHP 967 (1033) Q Consensus 939 ~~~~~~~~~~~e~l~~E~e~LGf~lS~HP 967 (1033) .+|++..-+...+-..|+.++-..+-|.- T Consensus 818 ~~p~i~s~n~~~r~~~eR~AiN~piQGsa 846 (922) T PRK07625 818 WLPEINGGNGPRRQAAERAAINAPMQGTA 846 (922) T ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCCHHH T ss_conf 17655688878887999999650244679 No 287 >PRK00702 ribose-5-phosphate isomerase A; Provisional Probab=37.94 E-value=20 Score=13.58 Aligned_cols=53 Identities=23% Similarity=0.320 Sum_probs=32.4 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCEEEEECC-CEEEEEHHHHCCCCHHHH-HHHHHHH Q ss_conf 99988886368503552334210035319-869850268589698999-9999998 Q 537021.9.peg.7 804 KFCQDARQFNIQIMPPSVNTPCVDFKVGD-NRIYYSLAAIKGVGTTTA-RHIMEAS 857 (1033) Q Consensus 804 ~~i~e~~~~gi~vl~PdIN~S~~~f~~~~-~~Ir~Gl~~Ikgvg~~~~-~~Ive~r 857 (1033) ..-..|+++||++.++| .....+..+++ +.|--.|..|||-|.... |.|+... T Consensus 57 ~T~~~a~~~Gi~l~~l~-~~~~iDlaiDGADevd~~l~lIKGgGgallrEKivA~~ 111 (222) T PRK00702 57 ASTELAKELGIPVFDLN-EVDSLDLYVDGADEIDPHLNLIKGGGAALTREKIVAAA 111 (222) T ss_pred HHHHHHHHCCCEEECCC-CCCCCCEEECCHHHHCCCCCEEECCHHHHHHHHHHHHH T ss_conf 99999998798686500-07852057436566466754785270999999999984 No 288 >PRK13690 hypothetical protein; Provisional Probab=37.78 E-value=21 Score=13.56 Aligned_cols=73 Identities=18% Similarity=0.154 Sum_probs=50.5 Q ss_pred CCCEEEECCCCCCHHHHHH--HCCCHHHHHHHHHHHHHHCC-CCEEEEEECCCHHHHHHHHHHH---HHHHHHCCCEEEE Q ss_conf 8988998197679678888--77997899999999987659-8689999439911589999999---9999818989980 Q 537021.9.peg.7 141 TEGLIMLTGGSSGPIDRAF--FFNSSQVAEKRLLTFKKLFG-DRLYVNLQRHRGYDRYRESQVV---QLAYTHELPLVAT 214 (1033) Q Consensus 141 ~egLIvlsg~~~g~i~~~l--~~~~~~~a~~~l~~l~~~Fg-d~~ylEl~~~~~~e~~~~~~l~---~lA~~~~iPlVaT 214 (1033) .+|=|+..||..|++.-.. ..+..+.++..+..+.+... -.+|+-.|. .+.+|..|+ +.|++++++.|.. T Consensus 24 ~~g~i~VvGcSTSEV~G~~IG~~~s~eva~~i~~~l~~~~~~~gi~LA~Qc----CEHlNRALvvEr~~a~~~~le~V~V 99 (183) T PRK13690 24 KPGQIFVLGCSTSEVLGERIGTAGSMEVAEAIVEALLEVLNETGIHLAVQG----CEHLNRALVVEREVAEKYGLEIVTV 99 (183) T ss_pred CCCCEEEEECCHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEEC----HHHHHHHHHHHHHHHHHCCCCEEEE T ss_conf 999889996045773677567777799999999999999875382787613----6776788998699899839926999 Q ss_pred CCC Q ss_conf 883 Q 537021.9.peg.7 215 NNS 217 (1033) Q Consensus 215 ndv 217 (1033) -++ T Consensus 100 vP~ 102 (183) T PRK13690 100 VPV 102 (183) T ss_pred EEC T ss_conf 725 No 289 >PRK11282 glcE glycolate oxidase FAD binding subunit; Provisional Probab=37.77 E-value=15 Score=14.67 Aligned_cols=22 Identities=23% Similarity=0.468 Sum_probs=16.6 Q ss_pred HHHHHHHHHCCCCCCCCCCCCHH Q ss_conf 89999985104764334444204 Q 537021.9.peg.7 360 VADFIQWAKKNNIPVGPGRGSGA 382 (1033) Q Consensus 360 V~Div~~ak~~gI~vGpGRGSAa 382 (1033) |.++||+|.+++++|-| ||.+- T Consensus 7 v~e~Vr~A~~~~~pv~p-~G~gT 28 (351) T PRK11282 7 LLERVRQAAADGTPLRI-RGGGS 28 (351) T ss_pred HHHHHHHHHHCCCCEEE-ECCCC T ss_conf 99999999987992899-89997 No 290 >pfam10706 Aminoglyc_resit Aminoglycoside-2''-adenylyltransferase. This family is conserved in Bacteria. It confers resistance to kanamycin, gentamicin, and tobramycin. The protein is also produced by plasmids in various bacterial species and confers resistance to essentially all clinically available aminoglycosides except streptomycin, and it eliminates the synergism between aminoglycosides and cell-wall active agents. Probab=37.66 E-value=21 Score=13.55 Aligned_cols=156 Identities=22% Similarity=0.299 Sum_probs=77.8 Q ss_pred HHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHHCCCHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHC Q ss_conf 88999998510476433444420466776554403756333231056762100145787533365687899999998303 Q 537021.9.peg.7 359 IVADFIQWAKKNNIPVGPGRGSGAGSVTAYALTITDIDPLRFSLLFERFLNPDRMSMPDFDIDFCQDRRDEVIRYVQNKY 438 (1033) Q Consensus 359 iV~Div~~ak~~gI~vGpGRGSAagSLVaY~LgIT~VDPi~~~LlFERFLnp~R~~~PDIDiDf~~~~R~~vi~y~~~ky 438 (1033) .++-+...+.+.|+++--+-|=|- | .+..|--.+ --|||++||.+|+.+...-+. -+ T Consensus 8 lI~~il~~ad~~~lp~Wi~GGWaI-------------D-----A~lGRiTR~----H~DIDl~~P~d~~ae~~~ll~-~~ 64 (174) T pfam10706 8 LIHQIFAAADKINLPLWIGGGWAI-------------D-----ARLGRITRE----HDDIDLTFPGDRHAEFECLLH-AF 64 (174) T ss_pred HHHHHHHHHHHCCCCEEECCCEEE-------------E-----EEECCCCCC----CCCCCCCCCCHHHHHHHHHHH-HC T ss_conf 999999988856985676373010-------------0-----000100455----667111376244789999999-74 Q ss_pred CCCCEEECCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCCHHHHHCCHHHHHHHHCCCCCHHHHHHHHHH Q ss_conf 80215542207787788999999975279988988654310145898244344522988788643074202688987201 Q 537021.9.peg.7 439 GHERVAQIITFGSLQAKAALRDVGRALQMPYSQVDRLCKLIPNDPAHPVSLQAIADDSRFREARLADPSVDRLLEISQKL 518 (1033) Q Consensus 439 G~~~va~i~t~~t~~~k~aird~~r~~g~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~l 518 (1033) |-- |.--. ++...-+.- + . +.+..|-+ .++-+-.| T Consensus 65 g~~----i~eq~-----------------dYGFLaq~~-------g-----~----------LlDcEP~~--~~~~~YEi 99 (174) T pfam10706 65 GGV----ITEQT-----------------DYGFLAQSQ-------G-----I----------LLDCEPAV--RIDDAYEI 99 (174) T ss_pred CCE----EEEEC-----------------CCEEEEECC-------C-----E----------EECCCCCE--ECCCCCCC T ss_conf 888----86641-----------------512112018-------7-----6----------87056524--41764424 Q ss_pred HHHHHCCCCCCCEEEECCCCHHHH--EEC-----CCCCCCCCEEEEEECHHHHHHCCCHHHHHHHCCHHHHHHH Q ss_conf 122100244744033525666662--001-----2367688668851021334311232135654178999999 Q 537021.9.peg.7 519 EGLYRHASTHAAGIVIGDRPLSQL--VPM-----YRDVRSDLPVTQFNMKWIEKAGLIKFDFLGLKTLTLLQKS 585 (1033) Q Consensus 519 eg~~r~~~~Ha~Gvvi~~~~l~~~--~p~-----~~~~~~~~~~~q~d~~~~e~~gl~K~D~Lgl~~L~~i~~~ 585 (1033) +|+|-..-.-+.-=||..+|+.-. -.+ +. .+.++.++|--++.++.-|.+ |-||-.+.-.++.. T Consensus 100 ~~~P~GsCP~~~eG~i~gkpvrC~SW~Ai~wdyfyY--~~Evp~s~W~~K~~~S~~lac-~slg~~~v~~lr~q 170 (174) T pfam10706 100 EGLPPGSCPLEPEGVIGGKPVRCNSWEAILWDYFYY--LDEVPQSEWPPKHFHSFALAC-DSLGEAAVHNLREQ 170 (174) T ss_pred CCCCCCCCCCCCCCCCCCCEEEECCHHHHHHHHHHH--HHCCCHHHCCCCCHHHHHHHH-HHHHHHHHHHHHHH T ss_conf 899998898886653368234302089999998876--400872118821014566788-54427899999998 No 291 >TIGR02855 spore_yabG sporulation peptidase YabG; InterPro: IPR008764 Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. The peptidases families associated with clan U- have an unknown catalytic mechanism as the protein fold of the active site domain and the active site residues have not been reported. This is a group of peptidases belong to MEROPS peptidase family U57 (clan U-). The type example is the YabG protein of Bacillus subtilis. This is a protease involved in the synthesis and maturation of the spore coat proteins SpoIVA and YrbA of Bacillus subtilis .. Probab=37.65 E-value=21 Score=13.55 Aligned_cols=35 Identities=20% Similarity=0.086 Sum_probs=18.3 Q ss_pred CCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCHHHHH Q ss_conf 99115899999999999818989980883007878899 Q 537021.9.peg.7 189 HRGYDRYRESQVVQLAYTHELPLVATNNSLFLSEEDYE 226 (1033) Q Consensus 189 ~~~~e~~~~~~l~~lA~~~~iPlVaTndv~Y~~~~D~~ 226 (1033) |-.-|..+.+.=+++=++.|+|++ -+|..+++.-+ T Consensus 118 H~DGD~~YL~~C~~~Y~~~gv~V~---G~~~~E~emPe 152 (292) T TIGR02855 118 HIDGDPEYLRKCLKLYKKLGVPVV---GIHCKEKEMPE 152 (292) T ss_pred EECCCHHHHHHHHHHHHHCCCCEE---EEEEEECCCCH T ss_conf 422888899999998866197279---99984121808 No 292 >KOG3172 consensus Probab=37.59 E-value=7 Score=17.53 Aligned_cols=16 Identities=38% Similarity=0.692 Sum_probs=8.0 Q ss_pred HCCCCCCCCCCCCHHH Q ss_conf 1047643344442046 Q 537021.9.peg.7 368 KKNNIPVGPGRGSGAG 383 (1033) Q Consensus 368 k~~gI~vGpGRGSAag 383 (1033) +.++...|||||-|+. T Consensus 87 ~~~~~g~~~~RG~~~~ 102 (119) T KOG3172 87 KSRSLGGGPGRGRARR 102 (119) T ss_pred CCCCCCCCCCCCCCCC T ss_conf 6787888887462122 No 293 >PRK07539 NADH dehydrogenase subunit E; Validated Probab=37.51 E-value=21 Score=13.53 Aligned_cols=48 Identities=27% Similarity=0.335 Sum_probs=28.0 Q ss_pred HHHHHHHCCCCCEEECCCHHHHH------HHHHHHHHHHHCCCCHHHHHHHHHH Q ss_conf 99998303802155422077877------8899999997527998898865431 Q 537021.9.peg.7 431 IRYVQNKYGHERVAQIITFGSLQ------AKAALRDVGRALQMPYSQVDRLCKL 478 (1033) Q Consensus 431 i~y~~~kyG~~~va~i~t~~t~~------~k~aird~~r~~g~~~~~~~~~~~~ 478 (1033) |+-+..||+.++-|.|--..-.+ +..|++.+|+.+|+|..++..++.. T Consensus 10 i~~ii~~yp~~~~allp~L~~iQ~~~G~i~~ea~~~iA~~l~i~~~~V~~vatF 63 (154) T PRK07539 10 IEHEIAKYPRPRSAVIPALKIVQEQRGWVPDEAIEAVADYLGMPAIDVEEVATF 63 (154) T ss_pred HHHHHHHCCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCHHHHHHHHHH T ss_conf 999999779977899999999999879939999999999979799999999998 No 294 >PRK11557 putative DNA-binding transcriptional regulator; Provisional Probab=37.37 E-value=21 Score=13.52 Aligned_cols=17 Identities=12% Similarity=-0.056 Sum_probs=7.4 Q ss_pred HHHHHHHHHCCCEEEEC Q ss_conf 99999998189899808 Q 537021.9.peg.7 199 QVVQLAYTHELPLVATN 215 (1033) Q Consensus 199 ~l~~lA~~~~iPlVaTn 215 (1033) .+++.|++.|.|+|+.. T Consensus 197 ~~~~~Ak~~Ga~iIaIT 213 (282) T PRK11557 197 LAADEALRVGGKVLAIT 213 (282) T ss_pred HHHHHHHHCCCEEEEEC T ss_conf 99999998799399972 No 295 >COG1649 Uncharacterized protein conserved in bacteria [Function unknown] Probab=37.36 E-value=18 Score=13.95 Aligned_cols=67 Identities=15% Similarity=0.008 Sum_probs=39.2 Q ss_pred CHHHHHHHHHHHHHHCCCCEEEEEECCC---------------------HHHHHHHHHHHHHHHHHCCCEEEECCCCCCC Q ss_conf 9789999999998765986899994399---------------------1158999999999998189899808830078 Q 537021.9.peg.7 163 SSQVAEKRLLTFKKLFGDRLYVNLQRHR---------------------GYDRYRESQVVQLAYTHELPLVATNNSLFLS 221 (1033) Q Consensus 163 ~~~~a~~~l~~l~~~Fgd~~ylEl~~~~---------------------~~e~~~~~~l~~lA~~~~iPlVaTndv~Y~~ 221 (1033) ...+....+..+..+=-+-+|+.+-+.+ .....-...++..|.+.++++.|==+....- T Consensus 62 ~~~el~~~ld~l~~ln~NTv~~qV~~~G~~lypS~~~p~s~~~~~~~~~~~g~DpLa~~I~~AHkr~l~v~aWf~~~~~a 141 (418) T COG1649 62 QRQELKDILDDLQKLNFNTVYPQVWNDGDALYPSAVLPWSDGLPGVLGVDPGYDPLAFVIAEAHKRGLEVHAWFNPYRMA 141 (418) T ss_pred CHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCEEEECHHHCCCC T ss_conf 69999999999997098526899965855013422356545767655778897868999999986498321141010358 Q ss_pred HHHHHHHH Q ss_conf 78899998 Q 537021.9.peg.7 222 EEDYEAHD 229 (1033) Q Consensus 222 ~~D~~~h~ 229 (1033) +.....+. T Consensus 142 ~~~s~~~~ 149 (418) T COG1649 142 PPTSPLTK 149 (418) T ss_pred CCCCHHHH T ss_conf 77775676 No 296 >pfam07570 consensus Probab=37.34 E-value=18 Score=14.14 Aligned_cols=25 Identities=36% Similarity=0.486 Sum_probs=17.7 Q ss_pred HHHHCCCCCCCCCCCCHHHHHHHHHH Q ss_conf 98510476433444420466776554 Q 537021.9.peg.7 365 QWAKKNNIPVGPGRGSGAGSVTAYAL 390 (1033) Q Consensus 365 ~~ak~~gI~vGpGRGSAagSLVaY~L 390 (1033) ++..-..+..| ||||.-+|-++--| T Consensus 9 ~~~~~y~v~kG-grgS~KS~~~a~~~ 33 (45) T pfam07570 9 NSSHFYIVAKG-GRGSGKSSVIALKI 33 (45) T ss_pred HCCCCEEEEEC-CCCCCCHHHHHHHH T ss_conf 00685077545-76766027899999 No 297 >TIGR03573 WbuX N-acetyl sugar amidotransferase. This enzyme has been implicated in the formation of the acetamido moiety (sugar-NC(=NH)CH3) which is found on some exopolysaccharides and is positively charged at neutral pH. The reaction involves ligation of ammonia with a sugar N-acetyl group, displacing water. In E. coli (O145 strain) and Pseudomonas aeruginosa (O12 strain) this gene is known as wbuX and ifnA respectively and likely acts on sialic acid. In Campylobacter jejuni, the gene is known as pseA and acts on pseudaminic acid in the process of flagellin glycosylation. In other Pseudomonas strains and various organisms it is unclear what the identity of the sugar substrate is, and in fact, the phylogenetic tree of this family sports a considerably deep branching suggestive of possible major differences in substrate structure. Nevertheless, the family is characterized by a conserved tetracysteine motif (CxxC.....[GN]xCxxC) possibly indicative of a metal binding site, as well as an Probab=37.23 E-value=21 Score=13.50 Aligned_cols=124 Identities=20% Similarity=0.227 Sum_probs=62.7 Q ss_pred CHHHHHHHHHHCC---CCEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCCCCCCCCCCCCCEEEEEEC Q ss_conf 9899999999789---9889981165177499999999976996389999998547664334456566667753999956 Q 537021.9.peg.7 28 SLNNILDKIAADQ---QPAIAITDTNNLFSALEFSQKACAAGIQPIIGCQLDIDMQDNLKMTNQGKTPFPTVPSIVLLVS 104 (1033) Q Consensus 28 ~~e~LV~~A~~~G---~~AiAITD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~~~~~~~~~~~~~~~~~lvlLAk 104 (1033) ..++|+++.+..| +..|-=- -|+-=+..-.+..+++.|++|+.= ++. . --- T Consensus 44 ~l~~l~~~~r~~~~~~YDCiv~v-SGGkDS~y~~~~l~~~~gl~pL~v---t~d--~--------------------~~~ 97 (343) T TIGR03573 44 ELEELVDKIKKKGGGRYDCIIGV-SGGKDSTYQAHVLKKKLGLNPLLV---TVD--P--------------------GWN 97 (343) T ss_pred HHHHHHHHHHHCCCCCCCEEEEC-CCCHHHHHHHHHHHHHHCCCEEEE---EEC--C--------------------CCC T ss_conf 99999999971489988689868-877289999999999829925999---835--9--------------------877 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCEEEECCCCCCHHHHHHHCCCHHHHHHHHHHHHHHCCCCEEE Q ss_conf 96899999999989973248778740078999863489889981976796788887799789999999998765986899 Q 537021.9.peg.7 105 TAEGYQRLIELVSRMYLLDQGKQSVRICLSWLQEIGTEGLIMLTGGSSGPIDRAFFFNSSQVAEKRLLTFKKLFGDRLYV 184 (1033) Q Consensus 105 N~~Gy~nL~kL~S~a~~~~~~~~~p~i~~~~L~e~~~egLIvlsg~~~g~i~~~l~~~~~~~a~~~l~~l~~~Fgd~~yl 184 (1033) +..|.+|+.+|+...-. | +|..++ +.+..++.....-..+|+-+ T Consensus 98 t~~g~~Ni~~l~~~lgv--------------------D-~i~~~~-------------n~~~~k~l~k~~~~~~gd~~-- 141 (343) T TIGR03573 98 TELGVKNLNNLIKKLGF--------------------D-LHTITI-------------NPETFRKLQRAYFKKVGDPE-- 141 (343) T ss_pred CHHHHHHHHHHHHHCCC--------------------C-EEEECC-------------CHHHHHHHHHHHHHHCCCHH-- T ss_conf 98999999999983699--------------------8-587469-------------99999999999998668946-- Q ss_pred EEECCCHHHHHHHHHHHHHHHHHCCCEEEECCCC Q ss_conf 9943991158999999999998189899808830 Q 537021.9.peg.7 185 NLQRHRGYDRYRESQVVQLAYTHELPLVATNNSL 218 (1033) Q Consensus 185 El~~~~~~e~~~~~~l~~lA~~~~iPlVaTndv~ 218 (1033) -+.+........++|.+++||+|...... T Consensus 142 -----~~~~~~i~~~~~~iA~k~~IplIi~Gen~ 170 (343) T TIGR03573 142 -----WPQDHAIFASVYQVALKFNIPLIIWGENI 170 (343) T ss_pred -----HHHHHHHHHHHHHHHHHCCCCEEEECCCC T ss_conf -----99999999999999998199989975670 No 298 >PRK08591 acetyl-CoA carboxylase biotin carboxylase subunit; Validated Probab=36.97 E-value=21 Score=13.47 Aligned_cols=68 Identities=13% Similarity=0.169 Sum_probs=37.4 Q ss_pred CEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCCCCCCCCCCCCCEEEEE--ECCHHHHHHHHHHHHHH Q ss_conf 8899811651774999999999769963899999985476643344565666677539999--56968999999999899 Q 537021.9.peg.7 42 PAIAITDTNNLFSALEFSQKACAAGIQPIIGCQLDIDMQDNLKMTNQGKTPFPTVPSIVLL--VSTAEGYQRLIELVSRM 119 (1033) Q Consensus 42 ~AiAITD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~~~~~~~~~~~~~~~~~lvlL--AkN~~Gy~nL~kL~S~a 119 (1033) +-|-|.+.| =-+++..++|++-||+.+.= |-..+...... + ..-+.+.+ +.+.+-|-|.-+++..+ T Consensus 3 ~kvLIANRG--EIA~RiiRt~~elgi~tVav---ys~~D~~a~hv---~----~ADeav~ig~~~~~~sYln~~~Ii~~A 70 (449) T PRK08591 3 DKILIANRG--EIALRILRACKELGIKTVAV---HSTADRDALHV---Q----LADEAVCIGPAPSKKSYLNIPAIISAA 70 (449) T ss_pred CEEEEECCC--HHHHHHHHHHHHCCCCEEEE---CCHHHCCCCHH---H----HCCEEEEECCCCCCCCCCCHHHHHHHH T ss_conf 448896784--99999999999849949998---68575278528---8----598889958988433304899999999 Q ss_pred HH Q ss_conf 73 Q 537021.9.peg.7 120 YL 121 (1033) Q Consensus 120 ~~ 121 (1033) .. T Consensus 71 ~~ 72 (449) T PRK08591 71 EI 72 (449) T ss_pred HH T ss_conf 98 No 299 >COG0083 ThrB Homoserine kinase [Amino acid transport and metabolism] Probab=36.95 E-value=18 Score=14.08 Aligned_cols=27 Identities=15% Similarity=0.116 Sum_probs=15.7 Q ss_pred CHHHHHHHCCH---HHHHHHHHHHHHHHHH Q ss_conf 34788751220---3589987886411100 Q 537021.9.peg.7 256 NRSEMVSIFSD---LPEALENTVEIARRCS 282 (1033) Q Consensus 256 s~eEM~~~f~~---~peAi~NT~~IAe~c~ 282 (1033) |+.|+++..+. +.+++.|+...|-.+. T Consensus 173 sT~~aR~vLP~~~~~~daV~n~s~~a~lv~ 202 (299) T COG0083 173 STAEARKVLPKSYSRKDAVFNLSRAALLVA 202 (299) T ss_pred CHHHHHHHCCCCCCHHHHHHHHHHHHHHHH T ss_conf 579999746566888999999999999999 No 300 >PRK12311 rpsB 30S ribosomal protein S2; Provisional Probab=36.92 E-value=21 Score=13.46 Aligned_cols=12 Identities=42% Similarity=0.476 Sum_probs=4.0 Q ss_pred HHHHHHHHHHHH Q ss_conf 988778989860 Q 537021.9.peg.7 340 RLDFELNVIARM 351 (1033) Q Consensus 340 RLe~EL~vI~~~ 351 (1033) +|++-|.=|..| T Consensus 145 KL~k~lgGIk~M 156 (332) T PRK12311 145 KLDRSLGGIKDM 156 (332) T ss_pred HHHHHHHHHHHH T ss_conf 999854625420 No 301 >cd01398 RPI_A RPI_A: Ribose 5-phosphate isomerase type A (RPI_A) subfamily; RPI catalyzes the reversible conversion of ribose-5-phosphate to ribulose 5-phosphate, the first step of the non-oxidative branch of the pentose phosphate pathway. This reaction leads to the conversion of phosphosugars into glycolysis intermediates, which are precursors for the synthesis of amino acids, vitamins, nucleotides, and cell wall components. In plants, RPI is part of the Calvin cycle as ribulose 5-phosphate is the carbon dioxide receptor in the first dark reaction of photosynthesis. There are two unrelated types of RPIs (A and B), which catalyze the same reaction, at least one type of RPI is present in an organism. RPI_A is more widely distributed than RPI_B in bacteria, eukaryotes, and archaea. Probab=36.91 E-value=21 Score=13.46 Aligned_cols=51 Identities=22% Similarity=0.328 Sum_probs=24.6 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCEEEEECC-CEEEEEHHHHCCCCHHHH-HHHHH Q ss_conf 99988886368503552334210035319-869850268589698999-99999 Q 537021.9.peg.7 804 KFCQDARQFNIQIMPPSVNTPCVDFKVGD-NRIYYSLAAIKGVGTTTA-RHIME 855 (1033) Q Consensus 804 ~~i~e~~~~gi~vl~PdIN~S~~~f~~~~-~~Ir~Gl~~Ikgvg~~~~-~~Ive 855 (1033) ..-..|+++||++.++| .....++.+++ +.|-.-|..|||=|.... |.|+. T Consensus 53 ~T~~~~~~~Gi~v~~~~-~~~~iDi~iDGaDevd~~l~lIKGgGgal~rEKiva 105 (213) T cd01398 53 QTEELARELGIPLTDLD-EVPRLDLAIDGADEVDPDLNLIKGGGGALLREKIVA 105 (213) T ss_pred HHHHHHHHCCCCEECHH-HCCCEEEEECCHHHHCCCCCEEEECCHHHHHHHHHH T ss_conf 99999998799742755-678300675374664547668970828989899999 No 302 >KOG3311 consensus Probab=36.88 E-value=21 Score=13.46 Aligned_cols=48 Identities=19% Similarity=0.390 Sum_probs=34.4 Q ss_pred CCEEEEEHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCHHHHHHHHHC Q ss_conf 986985026858969899999999983799999789997348247998999988867 Q 537021.9.peg.7 832 DNRIYYSLAAIKGVGTTTARHIMEASADKPFDSLEDFCSRVDSKKLNRRVLESLVFA 888 (1033) Q Consensus 832 ~~~Ir~Gl~~Ikgvg~~~~~~Ive~r~~g~f~sl~df~~rv~~~~~nk~~le~Li~a 888 (1033) ...|.|.|.+|+|||...++.|.. +-+ .|...|+. .+...+++.++.. T Consensus 23 ~~~V~fAl~~i~Gig~~~A~~ic~--K~~-----~~~~~r~g--elt~~qi~~i~~i 70 (152) T KOG3311 23 KRKVTFALTSIKGIGRRYAEIVCK--KAD-----LDLTKRAG--ELTEEQILRILQI 70 (152) T ss_pred CCEEEEEEEEEEEECHHHHHHHHH--HCC-----CCHHHHHC--CCCHHHHHHHHHH T ss_conf 800689888886511224444442--037-----40555413--1249999999998 No 303 >TIGR02836 spore_IV_A stage IV sporulation protein A; InterPro: IPR014201 This entry is designated stage IV sporulation protein A. It acts in the mother cell compartment and plays a role in spore coat morphogenesis . A comparative genome analysis of all sequenced genomes of Firmicutes shows that the proteins are strictly conserved among the sub-set of endospore-forming species. . Probab=36.87 E-value=21 Score=13.45 Aligned_cols=109 Identities=23% Similarity=0.240 Sum_probs=66.5 Q ss_pred EEEECCHHHHHHH--HHHHHHHHHCCCCCCCCCCCHHHHHHHCC---------CCEEEECCCCCCHHHHHHHCCCHHHH- Q ss_conf 9995696899999--99998997324877874007899986348---------98899819767967888877997899- Q 537021.9.peg.7 100 VLLVSTAEGYQRL--IELVSRMYLLDQGKQSVRICLSWLQEIGT---------EGLIMLTGGSSGPIDRAFFFNSSQVA- 167 (1033) Q Consensus 100 vlLAkN~~Gy~nL--~kL~S~a~~~~~~~~~p~i~~~~L~e~~~---------egLIvlsg~~~g~i~~~l~~~~~~~a- 167 (1033) --.++...||..= -|++|-=|.+ -.|+++.-+|..+ =|++|+|-|.=+.|+| .+|=+| T Consensus 100 GY~V~GA~GY~ee~kprMv~TPWYd------~eIPF~EAAEIGTrKVI~dHSTiG~lvTTDGsI~dI~R----edYv~AE 169 (492) T TIGR02836 100 GYIVKGALGYEEEDKPRMVSTPWYD------EEIPFEEAAEIGTRKVISDHSTIGLLVTTDGSITDIPR----EDYVEAE 169 (492) T ss_pred CCCCCCCCCCCCCCCCEECCCCCCC------CCCCHHHHHHCCCEEECCCCCEEEEEEECCCCCCCCCC----CCCHHHH T ss_conf 3015785450027864011588888------88872355123771011577507899711784034673----1340033 Q ss_pred HHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHHHHHHHHHCCCEEEECCCCC Q ss_conf 9999999876598689999439911589999999999981898998088300 Q 537021.9.peg.7 168 EKRLLTFKKLFGDRLYVNLQRHRGYDRYRESQVVQLAYTHELPLVATNNSLF 219 (1033) Q Consensus 168 ~~~l~~l~~~Fgd~~ylEl~~~~~~e~~~~~~l~~lA~~~~iPlVaTndv~Y 219 (1033) +++++.|+++= .-|-+=|....+....-.+-.-+|.++|++|+++.+=..+ T Consensus 170 ERVveELKe~~-KPFIilLNs~~P~~pET~~L~~eLe~KYDvpV~~~~v~~m 220 (492) T TIGR02836 170 ERVVEELKELN-KPFIILLNSTHPLAPETKALRKELEEKYDVPVLVVDVLEM 220 (492) T ss_pred HHHHHHHHCCC-CCEEEEECCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCC T ss_conf 57767764079-9679997688978865799999865208844589640002 No 304 >COG0120 RpiA Ribose 5-phosphate isomerase [Carbohydrate transport and metabolism] Probab=36.64 E-value=21 Score=13.43 Aligned_cols=53 Identities=19% Similarity=0.300 Sum_probs=36.2 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCEEEEECC-CEEEEEHHHHCCCCHHHH-HHHHHHH Q ss_conf 99988886368503552334210035319-869850268589698999-9999998 Q 537021.9.peg.7 804 KFCQDARQFNIQIMPPSVNTPCVDFKVGD-NRIYYSLAAIKGVGTTTA-RHIMEAS 857 (1033) Q Consensus 804 ~~i~e~~~~gi~vl~PdIN~S~~~f~~~~-~~Ir~Gl~~Ikgvg~~~~-~~Ive~r 857 (1033) .--..|++.||++..++ ..+..+-++++ +.|-..+..|||-|.... |.|+... T Consensus 57 ~t~~l~~~~GI~v~~l~-~~~~lDl~iDGADEvd~~~~lIKGGGgAl~rEKIva~~ 111 (227) T COG0120 57 QTEELARELGIPVSSLN-EVDSLDLAIDGADEVDPNLNLIKGGGGALLREKIVASA 111 (227) T ss_pred HHHHHHHHCCCEECCCC-CCCCCCEEEECCCCCCCCCCEECCCHHHHHHHHHHHHH T ss_conf 99999998297643733-36766657646411188887670375898898999974 No 305 >PRK07300 consensus Probab=36.63 E-value=21 Score=13.43 Aligned_cols=34 Identities=24% Similarity=0.278 Sum_probs=19.9 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCEEECCCCHHHHH Q ss_conf 6668999988899985788726211638558999 Q 537021.9.peg.7 939 TLEKFSVENSSVRFENEQRVLGFYFSGHPLDVYK 972 (1033) Q Consensus 939 ~~~~~~~~~~~e~l~~E~e~LGf~lS~HPl~~~~ 972 (1033) .+|++..-+...+-..|+.++-..+-|.-=|..| T Consensus 776 ~ip~i~s~n~~~r~~~eR~AiN~piQGtAADiiK 809 (880) T PRK07300 776 ELPDINSRNFNVRSFAERTAINSPIQGSAADILK 809 (880) T ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCCHHHHHHHH T ss_conf 5898678887788799898816024178999999 No 306 >PRK08786 consensus Probab=36.61 E-value=21 Score=13.42 Aligned_cols=33 Identities=15% Similarity=0.211 Sum_probs=17.4 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCEEECCCCHHHH Q ss_conf 666899998889998578872621163855899 Q 537021.9.peg.7 939 TLEKFSVENSSVRFENEQRVLGFYFSGHPLDVY 971 (1033) Q Consensus 939 ~~~~~~~~~~~e~l~~E~e~LGf~lS~HPl~~~ 971 (1033) .+|.+..-+...+-..|+.++-..+-|.-=|.. T Consensus 823 ~~p~i~s~n~~~r~~aeR~A~N~piQGsAADii 855 (927) T PRK08786 823 YLDFINAGSQGQRAGAERAAINAPMQGTAADII 855 (927) T ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCCHHHHHHH T ss_conf 288667888778878999985834435799999 No 307 >COG4294 Uve UV damage repair endonuclease [DNA replication, recombination, and repair] Probab=36.40 E-value=21 Score=13.40 Aligned_cols=41 Identities=10% Similarity=0.190 Sum_probs=29.3 Q ss_pred HHHHHHHHHHHHCCCEEEECCCCCCCH-----HHHHHHHHHHHHHC Q ss_conf 999999999981898998088300787-----88999987668846 Q 537021.9.peg.7 196 RESQVVQLAYTHELPLVATNNSLFLSE-----EDYEAHDVLMAVAH 236 (1033) Q Consensus 196 ~~~~l~~lA~~~~iPlVaTndv~Y~~~-----~D~~~h~iL~aI~~ 236 (1033) -...++.+.++.+||+|---.-|+.++ ++..+...+..|+. T Consensus 218 s~eelL~lCek~~iPlVfD~HHh~v~~~l~~~~~~sl~~~~~r~r~ 263 (347) T COG4294 218 STEELLPLCEKLNIPLVFDAHHHNVHPGLDREDSPSLMELIPRIRE 263 (347) T ss_pred CHHHHHHHHHHHCCCEEEECHHHHCCCCCCCCCCHHHHHHHHHHHH T ss_conf 6999999988738987985012330688865578149999999997 No 308 >PRK07213 chlorohydrolase; Provisional Probab=36.32 E-value=21 Score=13.47 Aligned_cols=62 Identities=15% Similarity=0.158 Sum_probs=37.4 Q ss_pred CCCHHHHHHCHHHHHHHCC---CHHHHHHHHHHCCCCEE-EEECCCHHHHHHHHHHHHHHCCCCEE Q ss_conf 8813343314447644329---98999999997899889-98116517749999999997699638 Q 537021.9.peg.7 9 PSFVHLQVHSSYSLLEGAL---SLNNILDKIAADQQPAI-AITDTNNLFSALEFSQKACAAGIQPI 70 (1033) Q Consensus 9 ~~FvhLh~hS~ySlldg~~---~~e~LV~~A~~~G~~Ai-AITD~~nl~Gav~F~~ackk~gIKPI 70 (1033) |-||.-|+|...|+++|.. ..++++........+.+ ..++.....++......+.+.|+.-+ T Consensus 50 PGfVNtHtHl~~s~~rg~~~~~~L~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~e~l~~G~Tt~ 115 (378) T PRK07213 50 PPLINAHTHIGDSSIKDIGIGKSLDELVKPPNGLKHKFLNSCSDKELVEGMSEGLYDMENNGIKAF 115 (378) T ss_pred ECCEEHHHCHHHHHHHHCCCCCCHHHHHHCCCCHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCEEE T ss_conf 398186658645766525489989999746311457987628999999999999999984781999 No 309 >pfam04219 DUF413 Protein of unknown function, DUF. Probab=36.29 E-value=22 Score=13.38 Aligned_cols=49 Identities=16% Similarity=0.324 Sum_probs=32.7 Q ss_pred HHCCCHHHHHHHHHH--------HCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHH Q ss_conf 632684130788976--------201211016778789999986249999999999999 Q 537021.9.peg.7 702 LSGYSLSEADVLRRA--------MGKKIKEEMDKQKERFISGASKNGISKTIAVNIFEL 752 (1033) Q Consensus 702 ~ag~~~~ead~lRra--------~~kk~~~~~~~~~~~f~~g~~~~g~~~~~a~~i~~~ 752 (1033) -+.||..||+.|.+- .|+..++ .+.++.|++-|.-..-+....+.+|-. T Consensus 15 sGdFTi~Ea~lLe~yG~~~~~L~~G~~~P~--t~eE~~Fv~v~~Ge~~p~s~~Ek~W~K 71 (94) T pfam04219 15 SGDFTIKEAELLEQYGCAMLALESGEREPE--TEEEEQFVAVCKGEQAPSTEIEKVWLK 71 (94) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHCCCCCCC--CHHHHHHHHHHCCCCCCCCHHHHHHHH T ss_conf 787679999999999699999983994889--988999999983888898999999999 No 310 >cd05168 PI4Kc_III_beta Phosphoinositide 4-kinase (PI4K), Type III, beta isoform, catalytic domain; The PI4K catalytic domain family is part of a larger superfamily that includes the catalytic domains of other kinases such as the typical serine/threonine/tyrosine protein kinases (PKs), aminoglycoside phosphotransferase, choline kinase, and RIO kinases. PI4Ks catalyze the transfer of the gamma-phosphoryl group from ATP to the 4-hydroxyl of the inositol ring of D-myo-phosphatidylinositol (PtdIns) to generate PtdIns(4)P, the major precursor in the synthesis of other phosphoinositides including PtdIns(4,5)P2, PtdIns(3,4)P2, and PtdIns(3,4,5)P3. Two isoforms of type III PI4K, alpha and beta, exist in most eukaryotes. PI4KIIIbeta (also called Pik1p in yeast) is a 110 kDa protein that is localized to the Golgi and the nucleus. It is required for maintaining the structural integrity of the Golgi complex (GC), and is a key regulator of protein transport from the GC to the plasma membrane. PI4KII Probab=36.26 E-value=13 Score=15.17 Aligned_cols=17 Identities=29% Similarity=0.374 Sum_probs=11.6 Q ss_pred CCCHHHHHHHHHHHCCC Q ss_conf 44204667765544037 Q 537021.9.peg.7 378 RGSGAGSVTAYALTITD 394 (1033) Q Consensus 378 RGSAagSLVaY~LgIT~ 394 (1033) |-.||=|+++|+|||-+ T Consensus 131 ~S~AgYsv~tYiLgIgD 147 (293) T cd05168 131 ESLAGYSLICYLLQIKD 147 (293) T ss_pred HHHHHHHHHHHHHCCCC T ss_conf 99999999999813566 No 311 >TIGR01370 cysRS possible cysteinyl-tRNA synthetase; InterPro: IPR016063 The prediction that proteins in this entry are cysteinyl-tRNA synthetases is supported by but challenged by . The proteins from Deinococcus radiodurans and Methanococcus jannaschii, species lacking a conventional cysteinyl-tRNA synthetase (Cys--tRNA ligase), have been indicated to be a novel form of that enzyme, perhaps distantly related to class I tRNA ligases. The protein from Thermotoga maritima is presumed to be a second isozyme of cysteinyl-tRNA synthetase. A number of homologous but more distantly related proteins are annotated as putative alpha-1,4 polygalactosaminidases.. Probab=36.16 E-value=20 Score=13.59 Aligned_cols=81 Identities=21% Similarity=0.389 Sum_probs=50.7 Q ss_pred CHHHHHHHHHHHHHHHHHHCCHH-HHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHHCCCHHHHHHHHC Q ss_conf 83899999887789898606305-57888999998510476433444420466776554403756333231056762100 Q 537021.9.peg.7 333 HAQDYRQRLDFELNVIARMKFSG-YFLIVADFIQWAKKNNIPVGPGRGSGAGSVTAYALTITDIDPLRFSLLFERFLNPD 411 (1033) Q Consensus 333 ~~~~y~~RLe~EL~vI~~~gf~~-YFLiV~Div~~ak~~gI~vGpGRGSAagSLVaY~LgIT~VDPi~~~LlFERFLnp~ 411 (1033) +.++-++=+=--|+=|.+.||.| |.=-|=.+=+||.+ |+ --|-|||--+|+|++.|.+ =- T Consensus 147 W~peWk~I~fsyLDrvI~QGF~GvYlD~iD~feywA~~-Gv---~~r~~aa~eMi~fv~~Ia~---------------y~ 207 (325) T TIGR01370 147 WDPEWKAIVFSYLDRVIAQGFDGVYLDKIDAFEYWAEE-GV---DNRDGAAKEMIAFVVEIAE---------------YA 207 (325) T ss_pred CCHHHHHHHHHHHHHHHHCCCCEEEECCCHHHHHHHHC-CC---CCCHHHHHHHHHHHHHHHH---------------HH T ss_conf 75789999999743675527872440442146888754-77---6844689999999999999---------------98 Q ss_pred CCCCCCCCCCCCHHHHHHHHHH Q ss_conf 1457875333656878999999 Q 537021.9.peg.7 412 RMSMPDFDIDFCQDRRDEVIRY 433 (1033) Q Consensus 412 R~~~PDIDiDf~~~~R~~vi~y 433 (1033) |..-||-=|=+|.+. +++|+| T Consensus 208 R~~~p~sf~IIPQNg-e~ll~~ 228 (325) T TIGR01370 208 REKKPDSFVIIPQNG-EELLEY 228 (325) T ss_pred HHCCCCEEEEECCCH-HHHCCC T ss_conf 605898179835673-775157 No 312 >pfam01152 Bac_globin Bacterial-like globin. This family of heme binding proteins are found mainly in bacteria. However they can also be found in some protozoa and plants as well. Probab=36.00 E-value=22 Score=13.35 Aligned_cols=72 Identities=19% Similarity=0.252 Sum_probs=42.2 Q ss_pred HHHHHHHHHHHHCCCHHHHH----HHHHHHCC-CC-CCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCC Q ss_conf 25689887886326841307----88976201-21-101677878999998624999999999999999750001331 Q 537021.9.peg.7 692 QEQVMQIAQLLSGYSLSEAD----VLRRAMGK-KI-KEEMDKQKERFISGASKNGISKTIAVNIFELLAKFADYGFNK 763 (1033) Q Consensus 692 QEQvm~ia~~~ag~~~~ead----~lRra~~k-k~-~~~~~~~~~~f~~g~~~~g~~~~~a~~i~~~i~~f~~Y~Fnk 763 (1033) +-|...+++.++|=...... .+|++-.. +. .+..+.+-+.|.+-+.+.|++++..++++..++..++.--|| T Consensus 43 ~~~~~fl~~~~GGP~~Y~g~~g~~~m~~~H~~~~I~~~~fd~~l~~~~~al~e~~v~~~~~~e~l~~l~~~~~~~vnr 120 (120) T pfam01152 43 NKLYEFLAQALGGPPHYSGRRGHPRLRERHAPFPITNAEFDAWLRHLADALADAGVHAILREEILKVMEFLAHTALNK 120 (120) T ss_pred HHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCC T ss_conf 999999999908986777787875699987289999999999999999999883989999999999999999984498 No 313 >CHL00067 rps2 ribosomal protein S2 Probab=35.65 E-value=22 Score=13.31 Aligned_cols=19 Identities=32% Similarity=0.293 Sum_probs=6.7 Q ss_pred EEEEEECCHHHHHHHHHHH Q ss_conf 3999956968999999999 Q 537021.9.peg.7 98 SIVLLVSTAEGYQRLIELV 116 (1033) Q Consensus 98 ~lvlLAkN~~Gy~nL~kL~ 116 (1033) |++=|-++-...+.-++.+ T Consensus 39 hIIdL~kT~~~L~~A~~~i 57 (227) T CHL00067 39 HIINLTQTARFLSEACDLV 57 (227) T ss_pred EEECHHHHHHHHHHHHHHH T ss_conf 7985999999999999999 No 314 >PRK10760 murein hydrolase B; Provisional Probab=35.51 E-value=22 Score=13.29 Aligned_cols=27 Identities=26% Similarity=0.179 Sum_probs=10.6 Q ss_pred HHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCH Q ss_conf 85104764334444204667765544037563 Q 537021.9.peg.7 366 WAKKNNIPVGPGRGSGAGSVTAYALTITDIDP 397 (1033) Q Consensus 366 ~ak~~gI~vGpGRGSAagSLVaY~LgIT~VDP 397 (1033) .+++.++....=+||=|| ++|.|+-=| T Consensus 198 i~~~~~~dp~~l~GSwAG-----AMG~~QFMP 224 (357) T PRK10760 198 MARDEQDDPLNLKGSFAG-----AMGYGQFMP 224 (357) T ss_pred HHHHCCCCHHHCCCCCCC-----CCCCCCCCC T ss_conf 998579986654676541-----247765086 No 315 >PRK09045 N-ethylammeline chlorohydrolase; Provisional Probab=35.50 E-value=18 Score=14.00 Aligned_cols=19 Identities=37% Similarity=0.656 Sum_probs=11.9 Q ss_pred CCCHHHHHHCHHHHHHHCC Q ss_conf 8813343314447644329 Q 537021.9.peg.7 9 PSFVHLQVHSSYSLLEGAL 27 (1033) Q Consensus 9 ~~FvhLh~hS~ySlldg~~ 27 (1033) |-||..|+|..+|+++|.. T Consensus 67 PGlVd~H~H~~~~~~rG~~ 85 (444) T PRK09045 67 PGLVNAHTHAAMSLLRGLA 85 (444) T ss_pred ECHHHHHHCHHHHHHHHHC T ss_conf 4241396697877176402 No 316 >COG1748 LYS9 Saccharopine dehydrogenase and related proteins [Amino acid transport and metabolism] Probab=35.49 E-value=22 Score=13.29 Aligned_cols=91 Identities=15% Similarity=0.090 Sum_probs=38.6 Q ss_pred CCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCC------CCCCCC Q ss_conf 68988999876541034201246432457838999998877898986063055788899999851047------643344 Q 537021.9.peg.7 304 QKEEENELRNKAVAGLEVRLAQGAIAKGYHAQDYRQRLDFELNVIARMKFSGYFLIVADFIQWAKKNN------IPVGPG 377 (1033) Q Consensus 304 ~~~~~~~Lr~l~~~Gl~~R~~~~~~~~~~~~~~y~~RLe~EL~vI~~~gf~~YFLiV~Div~~ak~~g------I~vGpG 377 (1033) .-+++..|++.+..+ |+.+...+..-.+ ...|.++|+.++... ..|++---++.-.+|... +..+.+ T Consensus 168 tws~e~~l~e~~~p~---~~~~~Gk~~~v~~--~~~~~~~~~~~~G~~--~~y~~~~~el~sL~~~i~~~~~~~~~~t~r 240 (389) T COG1748 168 TWSPEINLREYTRPA---RYWENGKWVEVDP--LEEREVFEFPVIGYG--DVYAFYHDELRSLVKTIPGVVRTRFEMTFR 240 (389) T ss_pred EECHHHHHHHHCCCE---EEEECCEEEEECC--CCCCCCCCCCCCCCE--EEEECCCCCHHHHHHHCCCCCEEEEEEECC T ss_conf 625788689754755---9984787998166--563233246778732--589537822777977575400046776048 Q ss_pred -CCCHHHHHHHHHHHCCCCCHHHCC Q ss_conf -442046677655440375633323 Q 537021.9.peg.7 378 -RGSGAGSVTAYALTITDIDPLRFS 401 (1033) Q Consensus 378 -RGSAagSLVaY~LgIT~VDPi~~~ 401 (1033) +|=++==.+--=||.++.+|+++. T Consensus 241 ~~g~~~~i~~L~~lGll~~~~v~~~ 265 (389) T COG1748 241 YPGHLEVIKALRDLGLLSREPVKVQ 265 (389) T ss_pred CHHHHHHHHHHHHCCCCCCCCCCCC T ss_conf 6409999999987578865543335 No 317 >smart00483 POLXc DNA polymerase X family. includes vertebrate polymerase beta and terminal deoxynucleotidyltransferases Probab=35.48 E-value=22 Score=13.29 Aligned_cols=12 Identities=25% Similarity=0.542 Sum_probs=7.1 Q ss_pred HHHHCCCCCHHH Q ss_conf 554403756333 Q 537021.9.peg.7 388 YALTITDIDPLR 399 (1033) Q Consensus 388 Y~LgIT~VDPi~ 399 (1033) |-+||+.+|=|+ T Consensus 106 ~~~Gi~tl~dL~ 117 (334) T smart00483 106 YRKGIRTLEELK 117 (334) T ss_pred HHCCCCCHHHHH T ss_conf 984988799999 No 318 >cd01321 ADGF Adenosine deaminase-related growth factors (ADGF), a novel family of secreted growth-factors with sequence similarty to adenosine deaminase. Probab=35.43 E-value=22 Score=13.28 Aligned_cols=24 Identities=17% Similarity=0.113 Sum_probs=18.9 Q ss_pred CHHHHHHCHHHHHHHCCCHHHHHHHHHHC Q ss_conf 13343314447644329989999999978 Q 537021.9.peg.7 11 FVHLQVHSSYSLLEGALSLNNILDKIAAD 39 (1033) Q Consensus 11 FvhLh~hS~ySlldg~~~~e~LV~~A~~~ 39 (1033) =++||+| ++|+.+|+-+++.|... T Consensus 27 GaeLH~H-----leG~l~pe~l~~la~~~ 50 (345) T cd01321 27 GALLHVH-----DTAMVSSDWLIKNATYR 50 (345) T ss_pred CHHHCCC-----CCCCCCHHHHHHHHHHH T ss_conf 0756117-----52577999999999999 No 319 >cd07222 Pat_PNPLA4 Patatin-like phospholipase domain containing protein 4. PNPLA4, also known as GS2 (gene sequence-2), shows both lipase and transacylation activities. GS2 lipase is expressed in various tissues, predominantly in muscle and adipocytes tissue. It is also expressed in keratinocytes and shows retinyl ester hydrolase, acylglycerol, TG hydrolase, and PLA2 activity. This family includes patatin-like proteins: GS2 from mammals, PNPLA4 (Patatin-like phospholipase domain-containing protein 4), and iPLA2-eta (Calcium-independent phospholipase A2) from Homo sapiens. Probab=35.33 E-value=22 Score=13.27 Aligned_cols=24 Identities=29% Similarity=0.314 Sum_probs=16.6 Q ss_pred CCCCCHHHHHHHHHHHCCCCCHHHC Q ss_conf 4444204667765544037563332 Q 537021.9.peg.7 376 PGRGSGAGSVTAYALTITDIDPLRF 400 (1033) Q Consensus 376 pGRGSAagSLVaY~LgIT~VDPi~~ 400 (1033) +-=|++||||||-++ .+..|+++. T Consensus 34 ~~~GaSAGAlvaa~l-~~~~~~l~~ 57 (246) T cd07222 34 RFAGASAGSLVAAVL-LTAPEKIEE 57 (246) T ss_pred EEEEEHHHHHHHHHH-HHCCHHHHH T ss_conf 677777999999999-819457999 No 320 >pfam04260 DUF436 Protein of unknown function (DUF436). Family of bacterial proteins with undetermined function. Probab=35.27 E-value=22 Score=13.26 Aligned_cols=73 Identities=19% Similarity=0.182 Sum_probs=50.2 Q ss_pred CCCEEEECCCCCCHHHHHH--HCCCHHHHHHHHHHHHHHCC-CCEEEEEECCCHHHHHHHHHHH---HHHHHHCCCEEEE Q ss_conf 8988998197679678888--77997899999999987659-8689999439911589999999---9999818989980 Q 537021.9.peg.7 141 TEGLIMLTGGSSGPIDRAF--FFNSSQVAEKRLLTFKKLFG-DRLYVNLQRHRGYDRYRESQVV---QLAYTHELPLVAT 214 (1033) Q Consensus 141 ~egLIvlsg~~~g~i~~~l--~~~~~~~a~~~l~~l~~~Fg-d~~ylEl~~~~~~e~~~~~~l~---~lA~~~~iPlVaT 214 (1033) .+|=|+..||..|++.-.. ..+..+.++..+..+.+... -.+|+-.|. .+.+|..++ +.|++++++.|.. T Consensus 17 ~~g~i~VvGcSTSEV~G~~IG~~ss~eva~~i~~~l~~~~~~~gi~lA~Q~----CEHlNRALvvEr~~a~~~~le~V~V 92 (172) T pfam04260 17 KKGDLFVLGCSTSEVLGGRIGTASSLEVAEAIVETLLEILNETGIYLAVQG----CEHLNRALVVEREVAEAYGLEEVTV 92 (172) T ss_pred CCCCEEEEECCHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEEC----HHHHHHHHHHHHHHHHHCCCCEEEE T ss_conf 999889996045874766467768699999999999999876193799855----5675678886298798739807998 Q ss_pred CCC Q ss_conf 883 Q 537021.9.peg.7 215 NNS 217 (1033) Q Consensus 215 ndv 217 (1033) -++ T Consensus 93 vP~ 95 (172) T pfam04260 93 VPV 95 (172) T ss_pred ECC T ss_conf 637 No 321 >COG0122 AlkA 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase [DNA replication, recombination, and repair] Probab=35.23 E-value=22 Score=13.26 Aligned_cols=97 Identities=15% Similarity=0.130 Sum_probs=45.3 Q ss_pred HCCCCHHHHHHHHHHHHHHHHC------CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHH Q ss_conf 2499999999999999975000------1331014788899999999997302666777510023542025899988886 Q 537021.9.peg.7 738 KNGISKTIAVNIFELLAKFADY------GFNKSHAAAYAVISYQTAWMKTHYPVEFLAASMTLEMDNVEKIKKFCQDARQ 811 (1033) Q Consensus 738 ~~g~~~~~a~~i~~~i~~f~~Y------~FnksHa~aya~~~y~~aylK~~yP~eF~aa~L~~~~~~~~~~~~~i~e~~~ 811 (1033) ..-++-+.|.++|+.+....|= .|+.....+ .-=+..+-... ....| +.|+.++-+ T Consensus 115 ~QqvS~~~A~~i~~rl~~~~g~~~~~~~~fptpe~l~------------~~~~~~l~~~g-----~s~~K-a~yi~~~A~ 176 (285) T COG0122 115 SQQVSVAAAAKIWARLVSLYGNALEIYHSFPTPEQLA------------AADEEALRRCG-----LSGRK-AEYIISLAR 176 (285) T ss_pred HHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHH------------HCCHHHHHHHC-----CCHHH-HHHHHHHHH T ss_conf 7650599999999999998187666566798999998------------47999998837-----85778-999999999 Q ss_pred CCCCCCCCCCCCCCEEEEECCCEEEEEHHHHCCCCHHHHHHHHHH Q ss_conf 368503552334210035319869850268589698999999999 Q 537021.9.peg.7 812 FNIQIMPPSVNTPCVDFKVGDNRIYYSLAAIKGVGTTTARHIMEA 856 (1033) Q Consensus 812 ~gi~vl~PdIN~S~~~f~~~~~~Ir~Gl~~Ikgvg~~~~~~Ive~ 856 (1033) .=..-. ||.+.. ...+++.++=-|.+|||||.-+++-+.-. T Consensus 177 ~~~~g~-~~~~~l---~~~~~e~a~e~L~~i~GIG~WTAe~~llf 217 (285) T COG0122 177 AAAEGE-LDLSEL---KPLSDEEAIEELTALKGIGPWTAEMFLLF 217 (285) T ss_pred HHHCCC-CCHHHH---CCCCHHHHHHHHHCCCCCCHHHHHHHHHH T ss_conf 998599-656766---25889999999873788679999999998 No 322 >PRK09239 chorismate mutase; Provisional Probab=34.75 E-value=23 Score=13.20 Aligned_cols=32 Identities=19% Similarity=0.255 Sum_probs=24.3 Q ss_pred HHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH Q ss_conf 16778789999986249999999999999997 Q 537021.9.peg.7 724 EMDKQKERFISGASKNGISKTIAVNIFELLAK 755 (1033) Q Consensus 724 ~~~~~~~~f~~g~~~~g~~~~~a~~i~~~i~~ 755 (1033) --+++-++...-|.+.|.+++.++++|..|-. T Consensus 56 RE~~vl~~~~~~a~~~gLdpdfae~l~~~Ii~ 87 (104) T PRK09239 56 REARQIERLRQLAKDANLDPDFAEKFLNFIIK 87 (104) T ss_pred HHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH T ss_conf 99999999999998849699999999999999 No 323 >pfam01049 Cadherin_C Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic binding units. A key determinant to the strength of the binding that it is mediated by cadherins is the juxtamembrane region of the cadherin. This region induces clustering and also binds to the protein p120ctn. Probab=34.72 E-value=11 Score=15.77 Aligned_cols=12 Identities=58% Similarity=0.886 Sum_probs=10.1 Q ss_pred CCCCCHHHHHHH Q ss_conf 444420466776 Q 537021.9.peg.7 376 PGRGSGAGSVTA 387 (1033) Q Consensus 376 pGRGSAagSLVa 387 (1033) -|+||.||||-+ T Consensus 102 EG~GS~AgSLSS 113 (145) T pfam01049 102 EGRGSVAGSLSS 113 (145) T ss_pred CCCCCCCCCCCC T ss_conf 368875444022 No 324 >pfam00728 Glyco_hydro_20 Glycosyl hydrolase family 20, catalytic domain. This domain has a TIM barrel fold. Probab=34.68 E-value=23 Score=13.19 Aligned_cols=16 Identities=19% Similarity=0.374 Sum_probs=11.8 Q ss_pred HHHHHHHHHCCCCCCC Q ss_conf 8999998510476433 Q 537021.9.peg.7 360 VADFIQWAKKNNIPVG 375 (1033) Q Consensus 360 V~Div~~ak~~gI~vG 375 (1033) +.++|+||+.+||.|= T Consensus 72 i~~lv~yA~~rgI~vi 87 (335) T pfam00728 72 IREIVAYAAARGIEVI 87 (335) T ss_pred HHHHHHHHHHCCCEEE T ss_conf 9999999998599898 No 325 >COG1737 RpiR Transcriptional regulators [Transcription] Probab=34.64 E-value=23 Score=13.19 Aligned_cols=37 Identities=11% Similarity=0.076 Sum_probs=16.5 Q ss_pred CCCEEEEEECCCHHHHHHHHHHHHHHHHHCCCEEEECCC Q ss_conf 986899994399115899999999999818989980883 Q 537021.9.peg.7 179 GDRLYVNLQRHRGYDRYRESQVVQLAYTHELPLVATNNS 217 (1033) Q Consensus 179 gd~~ylEl~~~~~~e~~~~~~l~~lA~~~~iPlVaTndv 217 (1033) ++++.+.+..-+-.. -...+.+.|++.|+|+|+..+. T Consensus 177 ~~Dv~i~iS~sG~t~--e~i~~a~~ak~~ga~vIaiT~~ 213 (281) T COG1737 177 PGDVVIAISFSGYTR--EIVEAAELAKERGAKVIAITDS 213 (281) T ss_pred CCCEEEEEECCCCCH--HHHHHHHHHHHCCCEEEEEECC T ss_conf 998899984899968--9999999999779929998399 No 326 >TIGR01448 recD_rel helicase, RecD/TraA family; InterPro: IPR006345 These sequences represent a family similar to RecD, the exodeoxyribonuclease V alpha chain of IPR006344 from INTERPRO. Members of this family, however, are not found in a context of RecB and RecC and are longer by about 200 amino acids at the amino end. Chlamydia muridarum has both a member of this family and a RecD. . Probab=34.55 E-value=22 Score=13.28 Aligned_cols=20 Identities=20% Similarity=0.230 Sum_probs=12.8 Q ss_pred CCHHHHHHHCCHHHHHHHHH Q ss_conf 23213565417899999999 Q 537021.9.peg.7 568 LIKFDFLGLKTLTLLQKSLD 587 (1033) Q Consensus 568 l~K~D~Lgl~~L~~i~~~~~ 587 (1033) -+--|||=..-.+||++.+- T Consensus 453 ~~~~DL~IvDE~SM~Dt~L~ 472 (769) T TIGR01448 453 PIDADLLIVDESSMVDTWLA 472 (769) T ss_pred CCCCCEEEEECCCHHHHHHH T ss_conf 47877699814621889999 No 327 >pfam07579 DUF1548 Domain of Unknown Function (DUF1548). This family appears to be found only in a small family of Chlamydia proteins. Probab=34.54 E-value=23 Score=13.18 Aligned_cols=74 Identities=26% Similarity=0.371 Sum_probs=45.1 Q ss_pred CCHHHHHHHHHHHCCCCC-HHH---CCCHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEECCCHHHHHH Q ss_conf 420466776554403756-333---2310567621001457875333656878999999983038021554220778778 Q 537021.9.peg.7 379 GSGAGSVTAYALTITDID-PLR---FSLLFERFLNPDRMSMPDFDIDFCQDRRDEVIRYVQNKYGHERVAQIITFGSLQA 454 (1033) Q Consensus 379 GSAagSLVaY~LgIT~VD-Pi~---~~LlFERFLnp~R~~~PDIDiDf~~~~R~~vi~y~~~kyG~~~va~i~t~~t~~~ 454 (1033) -+-+--|-+|-|..|..+ |-+ |+.+|-||+|..|.|- ...++|+.+-+=..- =-- T Consensus 25 d~l~~~L~~lTl~~Ts~~~~~~~~~Y~~~~~~Fln~Y~~S~------------~~LV~~i~~q~l~ss---------~e~ 83 (135) T pfam07579 25 DHLNRNLDALTLRQTSRLYPQKHELYDRLKKRFLNAYRNSG------------SNLVNYILDQFLTSS---------PEQ 83 (135) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------------HHHHHHHHHHHHCCC---------HHH T ss_conf 68874542667998740677648889999999999999706------------689999999986499---------999 Q ss_pred HHH-----HHHHHHHCCCCHHHHHH Q ss_conf 899-----99999752799889886 Q 537021.9.peg.7 455 KAA-----LRDVGRALQMPYSQVDR 474 (1033) Q Consensus 455 k~a-----ird~~r~~g~~~~~~~~ 474 (1033) +++ +-|+ +++++|+.+.-. T Consensus 84 ~~~l~~~ll~~l-~~i~lPE~~~~~ 107 (135) T pfam07579 84 KAALRNYLLDDL-TAINLPETEHAD 107 (135) T ss_pred HHHHHHHHHHHH-HHCCCCHHHHHH T ss_conf 999999999999-983897688998 No 328 >pfam08440 Poty_PP Potyviridae polyprotein. This domain is found in polyproteins of the viral Potyviridae taxon. Probab=34.52 E-value=17 Score=14.26 Aligned_cols=50 Identities=24% Similarity=0.353 Sum_probs=28.3 Q ss_pred HHHHHHHHHHCCCCCCCCCCC-CHHHHHHHHHHHCCCCCHHHCCC-HHHHHHHHCC Q ss_conf 889999985104764334444-20466776554403756333231-0567621001 Q 537021.9.peg.7 359 IVADFIQWAKKNNIPVGPGRG-SGAGSVTAYALTITDIDPLRFSL-LFERFLNPDR 412 (1033) Q Consensus 359 iV~Div~~ak~~gI~vGpGRG-SAagSLVaY~LgIT~VDPi~~~L-lFERFLnp~R 412 (1033) -+||.|.-.|.. .|.||. |+--+-|||-|- |+..-|.--+ +-+.-|+.|| T Consensus 121 ~lw~av~~~k~d---~~f~r~~s~~a~kiAytL~-TD~~sI~rTi~iId~Li~~E~ 172 (274) T pfam08440 121 KLWEAVLKYKPD---AGFGRLSSASACKIAYTLS-TDPTSIPRTIAIIDELLAEER 172 (274) T ss_pred HHHHHHHHHCCC---CCCCCCCCCCCCEEEEECC-CCHHHHHHHHHHHHHHHHHHH T ss_conf 999999984787---5678766332130668746-687689899999999999999 No 329 >KOG0400 consensus Probab=34.51 E-value=23 Score=13.17 Aligned_cols=18 Identities=28% Similarity=0.671 Sum_probs=8.7 Q ss_pred HCCCCHHHHHHHHHHHHH Q ss_conf 249999999999999997 Q 537021.9.peg.7 738 KNGISKTIAVNIFELLAK 755 (1033) Q Consensus 738 ~~g~~~~~a~~i~~~i~~ 755 (1033) .||..++.-+.++.+|.+ T Consensus 77 ~~Gl~PeiPeDLy~likk 94 (151) T KOG0400 77 SNGLAPEIPEDLYHLIKK 94 (151) T ss_pred HCCCCCCCCHHHHHHHHH T ss_conf 769988993889999999 No 330 >PRK07203 putative chlorohydrolase/aminohydrolase; Validated Probab=34.41 E-value=23 Score=13.16 Aligned_cols=22 Identities=14% Similarity=0.231 Sum_probs=11.3 Q ss_pred HHHHHCCCHHHHHHHHHHHCCCH Q ss_conf 88771740488887665411411 Q 537021.9.peg.7 631 ALEGMQPDCIEDIIALVSLYRPG 653 (1033) Q Consensus 631 ~l~~~~p~~~~dl~~~~al~RPG 653 (1033) -+..+.|-..-|||.+ -+-.|- T Consensus 362 ~iGsL~~Gk~ADlv~i-d~~~~~ 383 (442) T PRK07203 362 KFGLLEEGYKADLIIV-DYNPPT 383 (442) T ss_pred CCEEECCCCCCCEEEE-CCCCCC T ss_conf 8310278772478998-599985 No 331 >TIGR02198 rfaE_dom_I rfaE bifunctional protein, domain I; InterPro: IPR011913 RfaE is a protein involved in the biosynthesis of ADP-L-glycero-D-manno-heptose, a precursor for LPS inner core biosynthesis. RfaE is a bifunctional protein in Escherichia coli, and separate proteins in some other genome. The longer, N-terminal domain I (this family) is suggested to act in D-glycero-D-manno-heptose 1-phosphate biosynthesis, while domain II (IPR011914 from INTERPRO) adds ADP to yield ADP-D-glycero-D-manno-heptose .; GO: 0016773 phosphotransferase activity alcohol group as acceptor, 0016779 nucleotidyltransferase activity, 0005975 carbohydrate metabolic process. Probab=34.23 E-value=23 Score=13.14 Aligned_cols=30 Identities=13% Similarity=0.061 Sum_probs=19.7 Q ss_pred EECCC--HHHHHHHHHHHHHHHHHCCCEEEEC Q ss_conf 94399--1158999999999998189899808 Q 537021.9.peg.7 186 LQRHR--GYDRYRESQVVQLAYTHELPLVATN 215 (1033) Q Consensus 186 l~~~~--~~e~~~~~~l~~lA~~~~iPlVaTn 215 (1033) |.++. .-...+...++++|++.++|++.=+ T Consensus 151 LSDYaKGvLt~~v~~~~I~~Ar~~~~pVlVDP 182 (321) T TIGR02198 151 LSDYAKGVLTPSVVQEVIAAAREAGKPVLVDP 182 (321) T ss_pred EEECCCCCCCHHHHHHHHHHHHHCCCCEEEEC T ss_conf 98668763585789999999996689199807 No 332 >COG4865 Glutamate mutase epsilon subunit [Amino acid transport and metabolism] Probab=34.12 E-value=23 Score=13.13 Aligned_cols=78 Identities=23% Similarity=0.378 Sum_probs=44.5 Q ss_pred HHHHHHCCCCCC---CCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHH Q ss_conf 673341033553---00125689887886326841307889762012110167787899999862499999999999999 Q 537021.9.peg.7 677 LIDPMLKETQGV---IIYQEQVMQIAQLLSGYSLSEADVLRRAMGKKIKEEMDKQKERFISGASKNGISKTIAVNIFELL 753 (1033) Q Consensus 677 ~l~~iL~~T~Gv---~vyQEQvm~ia~~~ag~~~~ead~lRra~~kk~~~~~~~~~~~f~~g~~~~g~~~~~a~~i~~~i 753 (1033) .+|..|+-|.|| -|=--|+=.+.|++|- +| | |.++-..++. .-||..-....+|+++ T Consensus 234 iiEgLLA~eQGVksitiGygQvGNltQDvAa--------~~-A--------l~e~~~eYL~---~~~~~d~~isTVfhQW 293 (485) T COG4865 234 IIEGLLALEQGVKSITVGYGQVGNLTQDVAA--------IQ-A--------LRELAHEYLQ---SYGYTDYELSTVFHQW 293 (485) T ss_pred HHHHHHHHHHCCCEEEECCHHHCCHHHHHHH--------HH-H--------HHHHHHHHHC---CCCCCCEEEHHHHHHH T ss_conf 9999999983762277510110427889999--------99-9--------9999998711---2466613522688887 Q ss_pred HHHHHCCCCCCHHHHHHHHHHHHHH Q ss_conf 9750001331014788899999999 Q 537021.9.peg.7 754 AKFADYGFNKSHAAAYAVISYQTAW 778 (1033) Q Consensus 754 ~~f~~Y~FnksHa~aya~~~y~~ay 778 (1033) . =||++-.+-||++++|-+|- T Consensus 294 m----GGFPedE~kA~~~Is~~~a~ 314 (485) T COG4865 294 M----GGFPEDESKAFAIISWGAAV 314 (485) T ss_pred H----CCCCHHHHHHHEEHHHHHHH T ss_conf 5----48976665453111077788 No 333 >TIGR02248 mutH_TIGR DNA mismatch repair endonuclease MutH; InterPro: IPR004230 MutS, MutL and MutH are the three essential proteins for initiation of methyl-directed DNA mismatch repair to correct mistakes made during DNA replication in Escherichia coli. MutH cleaves a newly synthesized and unmethylated daughter strand 5' to the sequence d(GATC) in a hemi-methylated duplex. Activation of MutH requires the recognition of a DNA mismatch by MutS and MutL .; GO: 0004519 endonuclease activity, 0006304 DNA modification. Probab=33.95 E-value=23 Score=13.21 Aligned_cols=25 Identities=32% Similarity=0.388 Sum_probs=11.4 Q ss_pred HHHHHHHHHHHH-CCCCE-----EEEEEEEE Q ss_conf 749999999997-69963-----89999998 Q 537021.9.peg.7 53 FSALEFSQKACA-AGIQP-----IIGCQLDI 77 (1033) Q Consensus 53 ~Gav~F~~ackk-~gIKP-----IiG~E~~v 77 (1033) .-.+|..--|-. +|=|| -+|+|+-- T Consensus 43 G~lLE~~LGa~aLHGsK~EqDF~~LGvELKt 73 (220) T TIGR02248 43 GKLLELILGATALHGSKPEQDFAHLGVELKT 73 (220) T ss_pred HHHHHHHHCCCEECCCCCCCCHHHCCEEEEE T ss_conf 5798886463122167732262451736220 No 334 >cd00896 PI3Kc_III Phosphoinositide 3-kinase (PI3K), class III, catalytic domain; The PI3K catalytic domain family is part of a larger superfamily that includes the catalytic domains of other kinases such as the typical serine/threonine/tyrosine protein kinases (PKs), aminoglycoside phosphotransferase, choline kinase, and RIO kinases. PI3Ks catalyze the transfer of the gamma-phosphoryl group from ATP to the 3-hydroxyl of the inositol ring of D-myo-phosphatidylinositol (PtdIns) or its derivatives. PI3Ks play an important role in a variety of fundamental cellular processes, including cell motility, the Ras pathway, vesicle trafficking and secretion, immune cell activation and apoptosis. They can be divided into three main classes (I, II, and III), defined by their substrate specificity, regulation, and domain structure. Class III PI3Ks, also called Vps34 (vacuolar protein sorting 34), contain an N-terminal lipid binding C2 domain, a PI3K homology domain of unknown function, and a C-termin Probab=33.94 E-value=15 Score=14.66 Aligned_cols=43 Identities=28% Similarity=0.480 Sum_probs=23.9 Q ss_pred HHHHHHCCHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHCC Q ss_conf 135654178999999998874025434332344358899876201410122114 Q 537021.9.peg.7 571 FDFLGLKTLTLLQKSLDFLAQRGVKVDLSLIPFDDHETYHLLTTKGTLGIFQLE 624 (1033) Q Consensus 571 ~D~Lgl~~L~~i~~~~~~i~~~~~~~d~~~ip~~D~~~~~l~~~g~t~GvFQ~e 624 (1033) -|.|-+-..++++..+ ++.|. |+.- .+|+.+..|...|+-++- T Consensus 108 QD~l~lQli~lm~~i~---~~~~l--dl~l------~~Y~Vi~t~~~~GlIE~V 150 (350) T cd00896 108 QDQLVIQIISLMDRLL---KKENL--DLKL------TPYKVLATSPTDGLVEFI 150 (350) T ss_pred HHHHHHHHHHHHHHHH---HHCCC--CCEE------EEEEEEECCCCCEEEEEE T ss_conf 8899999999999999---97697--7410------136999746994379996 No 335 >cd01425 RPS2 Ribosomal protein S2 (RPS2), involved in formation of the translation initiation complex, where it might contact the messenger RNA and several components of the ribosome. It has been shown that in Escherichia coli RPS2 is essential for the binding of ribosomal protein S1 to the 30s ribosomal subunit. In humans, most likely in all vertebrates, and perhaps in all metazoans, the protein also functions as the 67 kDa laminin receptor (LAMR1 or 67LR), which is formed from a 37 kDa precursor, and is overexpressed in many tumors. 67LR is a cell surface receptor which interacts with a variety of ligands, laminin-1 and others. It is assumed that the ligand interactions are mediated via the conserved C-terminus, which becomes extracellular as the protein undergoes conformational changes which are not well understood. Specifically, a conserved palindromic motif, LMWWML, may participate in the interactions. 67LR plays essential roles in the adhesion of cells to the basement membrane an Probab=33.90 E-value=23 Score=13.10 Aligned_cols=67 Identities=12% Similarity=0.166 Sum_probs=27.4 Q ss_pred CCHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCC-CCCEEECCCHHHHHHHHHHHHHHHHCCCCH Q ss_conf 310567621001457875333656878999999983038-021554220778778899999997527998 Q 537021.9.peg.7 401 SLLFERFLNPDRMSMPDFDIDFCQDRRDEVIRYVQNKYG-HERVAQIITFGSLQAKAALRDVGRALQMPY 469 (1033) Q Consensus 401 ~LlFERFLnp~R~~~PDIDiDf~~~~R~~vi~y~~~kyG-~~~va~i~t~~t~~~k~aird~~r~~g~~~ 469 (1033) +--++.|+-..|....=||++-....=.++.+++.+--. ...+.-|+|- .+++.+|+..++..|.++ T Consensus 16 np~m~~yI~g~r~g~~IidL~~T~~~L~~A~~~i~~~~~~~g~ILfVgtk--~~~~~~v~~~a~~~~~~y 83 (193) T cd01425 16 NPKMKPYIYGERNGIHIIDLEKTLEKLRLALNFIANIAAKGGKILFVGTK--PQAQRAVKKFAERTGSFY 83 (193) T ss_pred CCCCCCHHHCCCCCCEEEEHHHHHHHHHHHHHHHHHHHHCCCEEEEEECC--HHHHHHHHHHHHHCCCEE T ss_conf 98720022245198199779999999999999999999759959999757--899999999999839945 No 336 >TIGR02025 BchH magnesium chelatase, H subunit; InterPro: IPR011771 This entry represents the H subunit of the magnesium chelatase complex responsible for magnesium insertion into the protoporphyrin IX ring in the biosynthesis of both chlorophyll and bacteriochlorophyll. In chlorophyll-utilizing species, this gene is known as ChlH, while in bacteriochlorophyll-utilizing spoecies it is called BchH. Subunit H is the largest (~140kDa) of the three subunits (the others being BchD/ChlD and BchI/ChlI), and is known to bind protoporphyrin IX. Subunit H is homologous to the CobN subunit of cobaltochelatase and by anology with that enzyme, subunit H is believed to also bind the magnesium ion which is inserted into the ring. In conjunction with the hydrolysis of ATP by subunits I and D, a conformation change is believed to happen in subunit H causing the magnesium ion insertion into the distorted protoporphyrin ring .; GO: 0016851 magnesium chelatase activity, 0015995 chlorophyll biosynthetic process. Probab=33.79 E-value=23 Score=13.09 Aligned_cols=710 Identities=16% Similarity=0.215 Sum_probs=304.1 Q ss_pred HHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHH-H--------HHH-----------------HCCCCCCCCCCCCCC Q ss_conf 99999999998189899808830078788999987-6--------688-----------------469857653100245 Q 537021.9.peg.7 196 RESQVVQLAYTHELPLVATNNSLFLSEEDYEAHDV-L--------MAV-----------------AHSTVVSQKDRPRVT 249 (1033) Q Consensus 196 ~~~~l~~lA~~~~iPlVaTndv~Y~~~~D~~~h~i-L--------~aI-----------------~~~~~l~~~~~~~~~ 249 (1033) ..+...++=+++|+|.++..+.-+.+-++|..-+. | .|| ..|++-... . T Consensus 366 ~~E~A~~iL~~ldVPY~~A~PL~fQ~~~~W~~~~~Gl~P~Qval~vAiPELDGAi~P~ilgGl~~~~gttCaGC-----d 440 (1384) T TIGR02025 366 AVEAAVEILKKLDVPYIVAIPLLFQTIEEWKASDLGLSPLQVALSVAIPELDGAIEPVILGGLEGGAGTTCAGC-----D 440 (1384) T ss_pred HHHHHHHHHHHCCCCEEECCCCEECCCCCCHHHHCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCEECCC-----C T ss_conf 89999998741588303206720126211103214886234323440112366415510135247888610244-----5 Q ss_pred CCCCCCCHHHHHHHCCHHHHHHHHHHHHHHHHH--HHH-----HHHHCCCCCCCCCC----------------------- Q ss_conf 343234347887512203589987886411100--013-----32210234555756----------------------- Q 537021.9.peg.7 250 PDHYLKNRSEMVSIFSDLPEALENTVEIARRCS--FIL-----QVHAPILPRFIEKE----------------------- 299 (1033) Q Consensus 250 ~~~yLks~eEM~~~f~~~peAi~NT~~IAe~c~--~~l-----~~~~~~lP~f~~p~----------------------- 299 (1033) ...-|-|.+.-.+..+ +| +.+..||+|+. +.| -+++.-.=-|.-|+ T Consensus 441 kGCqfsssdns~~~~~-~~---eR~~~La~R~~r~v~LR~~~~~ekKvAiv~f~FPP~~Gn~GTAAyLdVf~SL~~~L~~ 516 (1384) T TIGR02025 441 KGCQFSSSDNSAKAIP-VQ---ERLEALAERVKRWVRLRKKPNAEKKVAIVVFNFPPDLGNVGTAAYLDVFESLYELLKR 516 (1384) T ss_pred CCCCCCCCCCCCCEEE-CH---HHHHHHHHHHHHHHHHHHCCHHCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 5664445533244122-07---8999999999999997412000471789970588885124332342036899999999 Q ss_pred ----CCCCC--CCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCC Q ss_conf ----88776--898899987654103420124643245783899999887789898606305578889999985104764 Q 537021.9.peg.7 300 ----CDDIQ--KEEENELRNKAVAGLEVRLAQGAIAKGYHAQDYRQRLDFELNVIARMKFSGYFLIVADFIQWAKKNNIP 373 (1033) Q Consensus 300 ----~~~~~--~~~~~~Lr~l~~~Gl~~R~~~~~~~~~~~~~~y~~RLe~EL~vI~~~gf~~YFLiV~Div~~ak~~gI~ 373 (1033) |+++. -.+.+.|++.+.++-+.-|.... .+...+ +.|-..+--..|-.......+-...=| T Consensus 517 LK~~GY~V~~lPeS~e~L~~~v~~~~~~~~~~~~-nkPfP~----------~~v~~~~~v~~~~~~~~~~~rIE~~Wg-- 583 (1384) T TIGR02025 517 LKDEGYDVGDLPESVEELIEAVLEGNEEQYGSAD-NKPFPS----------LNVAARIPVEQYEKLYPELERIEEQWG-- 583 (1384) T ss_pred HHHCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCC-CCCCCH----------HHHCCCCHHHHHHHHHHHHHHHHHHCC-- T ss_conf 8626887245778888999998865898714436-788780----------331033304566765267899986027-- Q ss_pred CCCCCCCHHHHHHHHHHHCC--------------CCCHHHCCCHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCC Q ss_conf 33444420466776554403--------------7563332310567621001457875333656878999999983038 Q 537021.9.peg.7 374 VGPGRGSGAGSVTAYALTIT--------------DIDPLRFSLLFERFLNPDRMSMPDFDIDFCQDRRDEVIRYVQNKYG 439 (1033) Q Consensus 374 vGpGRGSAagSLVaY~LgIT--------------~VDPi~~~LlFERFLnp~R~~~PDIDiDf~~~~R~~vi~y~~~kyG 439 (1033) --||++---||==-+++|-- +=||+ .|||+|=..|.-. =---..||.+-|+ T Consensus 584 ~~PG~~~~dG~~~l~i~G~~fGNvfIGvQP~~GyegDPM--RLlF~r~~~PhH~-------------f~AFY~wL~~~f~ 648 (1384) T TIGR02025 584 KAPGELLTDGSDELFILGAQFGNVFIGVQPSFGYEGDPM--RLLFERDAAPHHA-------------FAAFYRWLDREFK 648 (1384) T ss_pred CCCCCCCCCCCCCEEEEEEEECCEEEEECCCCCCCCCCH--HHHHHCCCCCCHH-------------HHHHHHHHHHHHC T ss_conf 898774354653002000000426788668888367606--9997204787745-------------8999999868648 Q ss_pred CCCEEECCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCCHHHHHCCHHHHHHHHCCCCCHHHHHHHHHHH Q ss_conf 02155422077877889999999752799889886543101458982443445229887886430742026889872011 Q 537021.9.peg.7 440 HERVAQIITFGSLQAKAALRDVGRALQMPYSQVDRLCKLIPNDPAHPVSLQAIADDSRFREARLADPSVDRLLEISQKLE 519 (1033) Q Consensus 440 ~~~va~i~t~~t~~~k~aird~~r~~g~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~le 519 (1033) .|=|-+.+|.|++== =-|+-+| -.+..|+..-+.+-|.|--+.-++|.-.. +|+| T Consensus 649 ADAllHFGtHGslEf-----MPG~Q~G---------------ltg~CWPD~LlG~lPnfY~YavNNPSEa~---IAKR-- 703 (1384) T TIGR02025 649 ADALLHFGTHGSLEF-----MPGKQVG---------------LTGECWPDRLLGDLPNFYLYAVNNPSEAT---IAKR-- 703 (1384) T ss_pred CCEEEEECCCCCEEE-----CCCCCCC---------------CCCCCCCCHHCCCCCCCEECCCCCHHHHH---HHHH-- T ss_conf 973887457653222-----6778758---------------88777772010367510002788803789---9988-- Q ss_pred HHHHCCCCCCCEEEECC-CCHHHHEECCCCCCCCCEEE-EEECHHHHHHCCCHHHHHHHCCHHHHHHHHHHHHHHCCCCC Q ss_conf 22100244744033525-66666200123676886688-51021334311232135654178999999998874025434 Q 537021.9.peg.7 520 GLYRHASTHAAGIVIGD-RPLSQLVPMYRDVRSDLPVT-QFNMKWIEKAGLIKFDFLGLKTLTLLQKSLDFLAQRGVKVD 597 (1033) Q Consensus 520 g~~r~~~~Ha~Gvvi~~-~~l~~~~p~~~~~~~~~~~~-q~d~~~~e~~gl~K~D~Lgl~~L~~i~~~~~~i~~~~~~~d 597 (1033) |. .+..||- .| |+.+ .|.+-= +==++-+++.+=.+--==.=+.-.+...+.+.+...+.+-| T Consensus 704 ---Rg-----~AtlvSy~~P-----P~~~---AGLYK~L~~LkeLi~~yr~~~~~~~~a~~~~~~e~I~~ka~~l~Ld~d 767 (1384) T TIGR02025 704 ---RG-----YATLVSYLTP-----PLER---AGLYKGLRELKELIAEYRELPSKNAEARGAEILEAIMKKAEELNLDTD 767 (1384) T ss_pred ---HH-----HHHHHHHCCC-----CHHH---CCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCC T ss_conf ---65-----2232432358-----2342---365033799999999840377010002468999999999997178887 Q ss_pred CCCCC-----CCHHHHHHHHHCCCCCHHHHCCCHHHHHHH-HHHCCCHHHHHHH----HHHHCCC--HHHH--------- Q ss_conf 33234-----435889987620141012211453478888-7717404888876----6541141--1342--------- Q 537021.9.peg.7 598 LSLIP-----FDDHETYHLLTTKGTLGIFQLESSGMRQAL-EGMQPDCIEDIIA----LVSLYRP--GPID--------- 656 (1033) Q Consensus 598 ~~~ip-----~~D~~~~~l~~~g~t~GvFQ~es~~~~~~l-~~~~p~~~~dl~~----~~al~RP--Gp~~--------- 656 (1033) ...++ +|| .=.+-+-.=-+.-+..+|++=+=.=| --=+|-++++.++ +++.-|| |-++ T Consensus 768 ~p~~~inaeemdq-~~~~~~v~rL~~~l~~lE~RLip~GLHV~G~a~~~~~~~~~L~~~~~~~Re~~Geld~~sLp~l~~ 846 (1384) T TIGR02025 768 VPEPKINAEEMDQ-EQFEDFVGRLYIYLTELENRLIPLGLHVLGEAPSAESLVDTLMEIASVDREDLGELDERSLPELVL 846 (1384) T ss_pred CCCCCCCHHHCCC-CCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHH T ss_conf 8787777211368-703566766411231245512235760347897677899999999842686555412455689999 Q ss_pred --HH--HHHHHHHCCCCCCCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHH--CCCCCCH------ Q ss_conf --00--36888504657850269667334103355300125689887886326841307889762--0121101------ Q 537021.9.peg.7 657 --NI--VVYNNRKNGKEKIVSIHPLIDPMLKETQGVIIYQEQVMQIAQLLSGYSLSEADVLRRAM--GKKIKEE------ 724 (1033) Q Consensus 657 --~~--~~yi~rk~g~e~~~~~hp~l~~iL~~T~Gv~vyQEQvm~ia~~~ag~~~~ead~lRra~--~kk~~~~------ 724 (1033) .| +.++.|++-.+... --..++.|...++ +.|-+++.+.. ..+-|.- ..|.... T Consensus 847 ~~lG~~~~~~~r~se~g~~~-~~~~~~~~~~~~~------~~~~~~v~e~~-------~~~~~~~~f~~~~~~~~~~~ks 912 (1384) T TIGR02025 847 TTLGKNFEEISRRSEEGILE-EIESLEKVTEEAA------VAVKRLVEEAL-------EEDERGTMFAEKLESELKVAKS 912 (1384) T ss_pred HHHCCCHHHHHHHCCCCCHH-HHHHHHHHHHHHH------HHHHHHHHHHH-------HHCCCCEEEEECCCCEEEHHHH T ss_conf 76212556775313533278-9999999999999------99999999998-------7423530255214651202466 Q ss_pred HHHHHHHHHHHHHHCCCCHHHHHHHHHH-HHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHH Q ss_conf 6778789999986249999999999999-997500013310147888999999999973026667775100235420258 Q 537021.9.peg.7 725 MDKQKERFISGASKNGISKTIAVNIFEL-LAKFADYGFNKSHAAAYAVISYQTAWMKTHYPVEFLAASMTLEMDNVEKIK 803 (1033) Q Consensus 725 ~~~~~~~f~~g~~~~g~~~~~a~~i~~~-i~~f~~Y~FnksHa~aya~~~y~~aylK~~yP~eF~aa~L~~~~~~~~~~~ 803 (1033) +.+.-...+.++..-.+++..++.+|+. +..- +|.---..=|.+++.. ..|.|-+. T Consensus 913 ~~~~~~~~~~~s~~pe~~~~~~~~l~~~GL~~~----------------------~~~~~~~~E~~gl~~a-L~G~Yi~P 969 (1384) T TIGR02025 913 KKEELEAVLQKSKQPELDKEKLDRLFQYGLERI----------------------LKLLLEDEEMNGLLKA-LEGEYIPP 969 (1384) T ss_pred HHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHH----------------------HHHHHHHCCHHHHHHH-HCCCEECC T ss_conf 789999997347875556478768999999999----------------------9988620222347876-26860278 Q ss_pred HHHHHHHHCCCCCCCCCCCC-CCEEEEEC-----------------------CC-------EEEEEHHHHCCCCHHHHHH Q ss_conf 99988886368503552334-21003531-----------------------98-------6985026858969899999 Q 537021.9.peg.7 804 KFCQDARQFNIQIMPPSVNT-PCVDFKVG-----------------------DN-------RIYYSLAAIKGVGTTTARH 852 (1033) Q Consensus 804 ~~i~e~~~~gi~vl~PdIN~-S~~~f~~~-----------------------~~-------~Ir~Gl~~Ikgvg~~~~~~ 852 (1033) .==.|.-|-|-.|||-==|- |---|++- ++ .|..|+-+||-=|+..+.. T Consensus 970 gPGGDLvR~Gp~vLPTGrNIHalDP~rIPs~~A~k~G~~~Ad~Ll~~~~~e~~G~yPEtiA~vLWGtDniKT~GE~~aq~ 1049 (1384) T TIGR02025 970 GPGGDLVRNGPNVLPTGRNIHALDPYRIPSPAAVKRGTRIADKLLEKHRAENGGNYPETIAVVLWGTDNIKTKGESVAQA 1049 (1384) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCCCCCCHHHHH T ss_conf 85678342667658898642366875465289999899999999999998636897366775654201100154289999 Q ss_pred HHHHHHCC--CCCCHHHHHHHCC---CCCCCHHHHHHH-HHCCCCCCCCCCHHHHHHHHHHH---------HHHHHHH-- Q ss_conf 99998379--9999789997348---247998999988-86767234790389999989999---------9999988-- Q 537021.9.peg.7 853 IMEASADK--PFDSLEDFCSRVD---SKKLNRRVLESL-VFAGALDCFGYSRMQLLQSLDNI---------QKYAQWV-- 915 (1033) Q Consensus 853 Ive~r~~g--~f~sl~df~~rv~---~~~~nk~~le~L-i~aGAfD~~~~~R~~ll~~i~~~---------~~~~~~~-- 915 (1033) |- -=| |-.|-.==+.++. +-.++|--|.-+ --+|-|..+=.+-..||..+-++ +++.++. T Consensus 1050 l~---L~GA~P~~D~~Gri~~~~L~PLe~LGRPRIDVvv~lSGIFRD~f~~q~~LLD~avk~AA~ADEP~emNfvRKHaL 1126 (1384) T TIGR02025 1050 LA---LMGAEPVPDALGRIARYELIPLEKLGRPRIDVVVTLSGIFRDLFVNQMELLDEAVKLAAKADEPLEMNFVRKHAL 1126 (1384) T ss_pred HH---HCCCCCCCCCCCCEEEEEECCHHHCCCCEEEEEEECCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHH T ss_conf 99---818705634556301005534767379606888822844877899999999999999851277844661678899 Q ss_pred -HH-----HHHCCCCCCCCCCCCCCCCCCC-CCCCCCCCHHHHHHH---HHHHHCEEECCCC-HHHHHHHHHHCCCCCHH Q ss_conf -65-----4312777866665555655776-668999988899985---7887262116385-58999999874897688 Q 537021.9.peg.7 916 -EK-----NRTNKHENIFAHEKGTFSDKIT-LEKFSVENSSVRFEN---EQRVLGFYFSGHP-LDVYKPILRKMGIKGYE 984 (1033) Q Consensus 916 -~~-----~~~~~q~~LF~~~~~~~~~~~~-~~~~~~~~~~e~l~~---E~e~LGf~lS~HP-l~~~~~~l~~~~~~~~~ 984 (1033) .+ +-..--.-+|.+..+...-.+. .-+-..|...+.|+. -+.+.-|.=.+.+ ++.-.++|++. ..+.. T Consensus 1127 ~~~~e~Gi~~e~Aa~R~FsNA~G~YGsnVN~~vE~s~WE~e~eL~d~f~~RksFAYg~~~~gmm~~~~Eil~~~-L~~vD 1205 (1384) T TIGR02025 1127 AQAEEGGIDVEEAAARVFSNAPGSYGSNVNDLVENSAWEDEGELADTFIKRKSFAYGRQGKGMMEQKPEILQSA-LKNVD 1205 (1384) T ss_pred HHHHHCCCCHHHHHHHCCCCCCCCCCCCCCCCEECCCCCCHHHHHHHHHHCCEEECCCCCCCHHHCCHHHHHHH-HHCCC T ss_conf 99974597678886220468898755330352226888876679877564030220788787100035899987-20325 Q ss_pred HHHCCCCCCEE----------EEEEEEEEEEEECCCCCCEEEEEEEECCCCE Q ss_conf 98505589889----------9999996112101588876999999849704 Q 537021.9.peg.7 985 ESVATIRGNNI----------RLAAMVVSKQQKKTRKGSRIGWVTFSEPAKE 1026 (1033) Q Consensus 985 ~~~~~~~g~~v----------~iaG~V~~i~~~~TKkG~~mafltleD~~g~ 1026 (1033) ..-...|..++ ...|+-.++++..+|.. ..--+-++|.|+. T Consensus 1206 ~~fQn~DS~E~GltDIdhYy~~~Gg~~~av~~~~Gkts-~~~~~yv~d~t~~ 1256 (1384) T TIGR02025 1206 LTFQNVDSVEIGLTDIDHYYESLGGVSKAVRRAKGKTS-KAPAVYVEDTTKD 1256 (1384) T ss_pred EEEEEEEEEEEEECCCHHHCCCCCHHHHHHHHCCCCCC-CCCCCCEECCCCC T ss_conf 57775303898520100000246414334101168733-0553001014758 No 337 >cd05164 PIKKc Phosphoinositide 3-kinase-related protein kinase (PIKK) subfamily, catalytic domain; The PIKK catalytic domain subfamily is part of a larger superfamily that includes the catalytic domains of other kinases such as the typical serine/threonine/tyrosine protein kinases (PKs), aminoglycoside phosphotransferase, choline kinase, and RIO kinases. Members include ATM (Ataxia telangiectasia mutated), ATR (Ataxia telangiectasia and Rad3-related), TOR (Target of rapamycin), SMG-1 (Suppressor of morphogenetic effect on genitalia-1), and DNA-PK (DNA-dependent protein kinase). PIKKs have intrinsic serine/threonine kinase activity and are distinguished from other PKs by their unique catalytic domain, similar to that of lipid PI3K, and their large molecular weight (240-470 kDa). They show strong preference for phosphorylating serine/threonine residues followed by a glutamine and are also referred to as (S/T)-Q-directed kinases. They all contain a FATC (FRAP, ATM and TRRAP, C-terminal) d Probab=33.78 E-value=19 Score=13.78 Aligned_cols=26 Identities=23% Similarity=0.323 Sum_probs=19.2 Q ss_pred CCCCHHHHHHHHHHHCCCCCHHHCCCHH Q ss_conf 4442046677655440375633323105 Q 537021.9.peg.7 377 GRGSGAGSVTAYALTITDIDPLRFSLLF 404 (1033) Q Consensus 377 GRGSAagSLVaY~LgIT~VDPi~~~LlF 404 (1033) .+-.|+-|+|+|+|||.+=-|= |+|+ T Consensus 120 ~~S~A~~sv~~yilglgDRH~~--NIl~ 145 (222) T cd05164 120 TRSTAVMSIVGYILGLGDRHLD--NILI 145 (222) T ss_pred HHHHHHHHHHHHHHCCCCCCCC--CEEE T ss_conf 9999999999997125888866--5777 No 338 >PRK00074 guaA GMP synthase; Reviewed Probab=33.74 E-value=23 Score=13.08 Aligned_cols=16 Identities=31% Similarity=0.476 Sum_probs=7.7 Q ss_pred HHHHHHHCCCCHHHHH Q ss_conf 9999975279988988 Q 537021.9.peg.7 458 LRDVGRALQMPYSQVD 473 (1033) Q Consensus 458 ird~~r~~g~~~~~~~ 473 (1033) ||.+||.||+|.+.++ T Consensus 372 VR~lg~~Lglp~~~~~ 387 (513) T PRK00074 372 VRKLGLELGLPEEIVY 387 (513) T ss_pred HHHHHHHHCCCHHHHC T ss_conf 9999999689777602 No 339 >pfam00454 PI3_PI4_kinase Phosphatidylinositol 3- and 4-kinase. Some members of this family probably do not have lipid kinase activity and are protein kinases. Probab=33.61 E-value=20 Score=13.73 Aligned_cols=21 Identities=29% Similarity=0.158 Sum_probs=16.0 Q ss_pred CCCCHHHHHHHHHHHCCCCCH Q ss_conf 444204667765544037563 Q 537021.9.peg.7 377 GRGSGAGSVTAYALTITDIDP 397 (1033) Q Consensus 377 GRGSAagSLVaY~LgIT~VDP 397 (1033) .+-.|+-|+|+|+|||.+=-| T Consensus 121 ~~S~A~~sv~~YilglgDRH~ 141 (233) T pfam00454 121 VRSCAGMSVLDYILGNGDRHL 141 (233) T ss_pred HHHHHHHHHHHHHHCCCCCCC T ss_conf 987899999999812788887 No 340 >cd00561 CobA_CobO_BtuR ATP:corrinoid adenosyltransferase BtuR/CobO/CobP. This family consists of the BtuR, CobO, CobP proteins all of which are Cob(I)alamin (vitamin B12) adenosyltransferase, which is involved in cobalamin (vitamin B12) biosynthesis. This enzyme is a homodimer, which catalyzes the adenosylation reaction: ATP + cob(I)alamin + H2O <= phosphate + diphosphate + adenosylcobalamin. Probab=33.60 E-value=23 Score=13.06 Aligned_cols=51 Identities=18% Similarity=0.249 Sum_probs=32.4 Q ss_pred HHHHHHHHCCHHHHHHHHHHHHHHHC----CCCCCCCCCCCHHHHHHHHHHHCCCCCHHHC Q ss_conf 78989860630557888999998510----4764334444204667765544037563332 Q 537021.9.peg.7 344 ELNVIARMKFSGYFLIVADFIQWAKK----NNIPVGPGRGSGAGSVTAYALTITDIDPLRF 400 (1033) Q Consensus 344 EL~vI~~~gf~~YFLiV~Div~~ak~----~gI~vGpGRGSAagSLVaY~LgIT~VDPi~~ 400 (1033) |+....+.|+.+ ..|+++.-++ ..+ |=.||+ +.-.|+..+=-||++-++|| T Consensus 103 Ei~~a~~~gli~----~~~v~~~l~~rp~~~ev-VlTGr~-~p~~L~e~ADlVTEm~~vKH 157 (159) T cd00561 103 EINYALGYGLLD----VEEVVDLLKAKPEDLEL-VLTGRN-APKELIEAADLVTEMREVKH 157 (159) T ss_pred HHHHHHHCCCCC----HHHHHHHHHHCCCCCEE-EEECCC-CCHHHHHHCCEEEEEEECCC T ss_conf 689999859917----99999999848999789-996999-99999985750246231156 No 341 >pfam07905 PucR Purine catabolism regulatory protein-like family. The bacterial proteins found in this family are similar to the purine catabolism regulatory protein expressed by Bacillus subtilis (PucR). PucR is thought to be a transcriptional activator involved in the induction of the purine degradation pathway, and may contain a LysR-like DNA-binding domain. It is similar to LysR-type regulators in that it represses its own expression. The other members of this family are also annotated as being putative regulatory proteins. Probab=33.58 E-value=23 Score=13.06 Aligned_cols=69 Identities=14% Similarity=0.116 Sum_probs=39.8 Q ss_pred HHCCCCEEEECCCCCCHHHHHHHCCCHHHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHHHHHHHHHCCCEEEECC Q ss_conf 6348988998197679678888779978999999999876598689999439911589999999999981898998088 Q 537021.9.peg.7 138 EIGTEGLIMLTGGSSGPIDRAFFFNSSQVAEKRLLTFKKLFGDRLYVNLQRHRGYDRYRESQVVQLAYTHELPLVATNN 216 (1033) Q Consensus 138 e~~~egLIvlsg~~~g~i~~~l~~~~~~~a~~~l~~l~~~Fgd~~ylEl~~~~~~e~~~~~~l~~lA~~~~iPlVaTnd 216 (1033) ++-+.|-+++|.+.. ...+.+...+.+..+.+.----+-+.+.++- ...-..++++|+++++|++..+. T Consensus 38 ~~l~~gElvlTtg~~-------~~~~~~~~~~~i~~L~~~g~agL~i~~g~~~---~~iP~~~i~~a~~~~~Pli~iP~ 106 (122) T pfam07905 38 PWLRGGELLLTTGYG-------LKDDPEALREFVRELAEAGAAGLGIKTGRYI---PEIPEELIAAANRLGLPLIELPR 106 (122) T ss_pred HHCCCCEEEEEECCC-------CCCCHHHHHHHHHHHHHCCCEEEEEECCCCC---CCCCHHHHHHHHHCCCCEEEECC T ss_conf 963698599972554-------4799999999999999789649999425554---66999999999974997799459 No 342 >PTZ00010 beta tubulin; Provisional Probab=33.57 E-value=23 Score=13.06 Aligned_cols=46 Identities=20% Similarity=0.183 Sum_probs=31.7 Q ss_pred CHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHC Q ss_conf 838999998877898986063055788899999851047643344442046677655440 Q 537021.9.peg.7 333 HAQDYRQRLDFELNVIARMKFSGYFLIVADFIQWAKKNNIPVGPGRGSGAGSVTAYALTI 392 (1033) Q Consensus 333 ~~~~y~~RLe~EL~vI~~~gf~~YFLiV~Div~~ak~~gI~vGpGRGSAagSLVaY~LgI 392 (1033) ..+++.+|+.+|++- -+--+.|++.+ . +|=|=||+-||+++-.|.- T Consensus 112 ~~~~i~d~irk~~E~---cD~l~GF~i~h---------S--l~GGTGSGlGs~l~e~L~d 157 (443) T PTZ00010 112 LIDSVLDVCRKEAES---CDCLQGFQLSH---------S--LGGGTGSGMGTLLISKLRE 157 (443) T ss_pred HHHHHHHHHHHHHHH---CCCCCCEEEEE---------E--CCCCCCCCHHHHHHHHHHH T ss_conf 899999999999980---78767558895---------2--6887766188999999986 No 343 >cd01996 Alpha_ANH_like_III This is a subfamily of Adenine nucleotide alpha hydrolases superfamily.Adeninosine nucleotide alpha hydrolases superfamily includes N type ATP PPases and ATP sulphurylases. It forms a apha/beta/apha fold which binds to Adenosine group. This subfamily of proteins is predicted to bind ATP. This domain has a strongly conserved motif SGGKD at the N terminus. Probab=33.48 E-value=24 Score=13.05 Aligned_cols=101 Identities=18% Similarity=0.251 Sum_probs=52.0 Q ss_pred CHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCCCCCCCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 51774999999999769963899999985476643344565666677539999569689999999998997324877874 Q 537021.9.peg.7 50 NNLFSALEFSQKACAAGIQPIIGCQLDIDMQDNLKMTNQGKTPFPTVPSIVLLVSTAEGYQRLIELVSRMYLLDQGKQSV 129 (1033) Q Consensus 50 ~nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~~~~~~~~~~~~~~~~~lvlLAkN~~Gy~nL~kL~S~a~~~~~~~~~p 129 (1033) |.-=+.+-.+.+.++.|++|+.= ++. . + -.+..|.+||-+++.. ++ T Consensus 10 GGKDS~y~~~~lk~kyglnpL~V---t~~--~----------------~----~~t~~g~~Nl~nl~~~-----g~---- 55 (154) T cd01996 10 GGKDSSYALYLLKEKYGLNPLAV---TVD--N----------------G----FNSEEAVKNIKNLIKK-----GL---- 55 (154) T ss_pred CCHHHHHHHHHHHHHHCCCEEEE---EEC--C----------------C----CCCHHHHHHHHHHHHC-----CC---- T ss_conf 88189999999999949965999---967--9----------------8----7546899999999974-----99---- Q ss_pred CCCHHHHHHHCCCCEEEECCCCCCHHHHHHHCCCHHHHHH-HHHHHHHHCCCCEEEEEECCCHHHHHHHHHHHHHHHHHC Q ss_conf 0078999863489889981976796788887799789999-999998765986899994399115899999999999818 Q 537021.9.peg.7 130 RICLSWLQEIGTEGLIMLTGGSSGPIDRAFFFNSSQVAEK-RLLTFKKLFGDRLYVNLQRHRGYDRYRESQVVQLAYTHE 208 (1033) Q Consensus 130 ~i~~~~L~e~~~egLIvlsg~~~g~i~~~l~~~~~~~a~~-~l~~l~~~Fgd~~ylEl~~~~~~e~~~~~~l~~lA~~~~ 208 (1033) +++.++.. .+..+. ....+..-+|+-++ +...-.....+++|.+++ T Consensus 56 -------------D~~~~~~~-------------~~~~~~l~~~~~~~~~gd~~~-------~~~~~i~~~~~~~A~k~~ 102 (154) T cd01996 56 -------------DLDHLVIN-------------PEEMKDLQLARFKAKVGDPCW-------PCDTAIFTSLYKVALKFG 102 (154) T ss_pred -------------CEEEEECC-------------HHHHHHHHHHHHHHCCCCCHH-------HHHHHHHHHHHHHHHHCC T ss_conf -------------81699328-------------889999999999844799348-------899879989999999939 Q ss_pred CCEEEECCC Q ss_conf 989980883 Q 537021.9.peg.7 209 LPLVATNNS 217 (1033) Q Consensus 209 iPlVaTndv 217 (1033) ||+|..... T Consensus 103 IplIi~Gen 111 (154) T cd01996 103 IPLIITGEN 111 (154) T ss_pred CCEEEECCC T ss_conf 998997278 No 344 >PRK13126 consensus Probab=33.25 E-value=24 Score=13.02 Aligned_cols=32 Identities=28% Similarity=0.299 Sum_probs=12.5 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHCCCCCEEECCC Q ss_conf 7875333656878999999983038021554220 Q 537021.9.peg.7 415 MPDFDIDFCQDRRDEVIRYVQNKYGHERVAQIIT 448 (1033) Q Consensus 415 ~PDIDiDf~~~~R~~vi~y~~~kyG~~~va~i~t 448 (1033) .||.-+|.+.+-. -..-..++||-+.+..+.+ T Consensus 101 ipDLP~e~~ee~~--~~~~~~~~~gl~~I~lv~p 132 (237) T PRK13126 101 APDLLIDFPGDLE--RYLELSREYGLAPSFFIPS 132 (237) T ss_pred ECCCCCCCCHHHH--HHHHHHHHCCCCEEEEECC T ss_conf 3688877817789--9999999769977997389 No 345 >TIGR00331 hrcA heat-inducible transcription repressor HrcA; InterPro: IPR002571 In response to elevated temperature, both prokaryotic and eukaryotic cells increase expression of a small family of chaperones. The regulatory network that functions to control the transcription of the heat shock genes in bacteria includes unique structural motifs in the promoter region of these genes and the expression of alternate sigma factors. One of the conserved structural motifs, the inverted repeat CIRCE element, is found in the 5' region of many heat shock operons . For Bacillus subtilis three classes of heat shock genes regulated by different mechanisms have been described. Regulation of class I heat shock genes (dnaK and groE operons) involves an inverted repeat (CIRCE element) which most probably serves as an operator for a repressor .; GO: 0006355 regulation of transcription DNA-dependent. Probab=33.19 E-value=24 Score=13.01 Aligned_cols=111 Identities=16% Similarity=0.277 Sum_probs=62.6 Q ss_pred HHHHHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHHCCCCHHHHHHH----------------HHHCCCCCCCCC Q ss_conf 6878999999983038021554220778778899999997527998898865----------------431014589824 Q 537021.9.peg.7 424 QDRRDEVIRYVQNKYGHERVAQIITFGSLQAKAALRDVGRALQMPYSQVDRL----------------CKLIPNDPAHPV 487 (1033) Q Consensus 424 ~~~R~~vi~y~~~kyG~~~va~i~t~~t~~~k~aird~~r~~g~~~~~~~~~----------------~~~i~~~~~~~~ 487 (1033) ++|+..|++-+.+.| |-|=...+.|.-++..+ ++++.+++... |..||...|... T Consensus 1 t~R~~~iL~~Iv~~Y-------~~~~~PVGSk~L~~~~~--l~~SSATIRN~Ma~LE~~Gfi~k~HtSsGRiPt~~GYRy 71 (343) T TIGR00331 1 TERQRKILKAIVEEY-------IKTGEPVGSKTLLEKYN--LGLSSATIRNDMADLEELGFIEKPHTSSGRIPTDKGYRY 71 (343) T ss_pred CCHHHHHHHHHHHHH-------HHCCCCCHHHHHHHHCC--CCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHH T ss_conf 906889999999997-------33489620476775617--999970365779998750220047415787522300689 Q ss_pred CHHHHHCCHH--------HHH-HHHCCCCCH--HHHHHHHHHHHHHHCCCCCCCEEEECCCCH------HHHEECCC Q ss_conf 4344522988--------788-643074202--688987201122100244744033525666------66200123 Q 537021.9.peg.7 488 SLQAIADDSR--------FRE-ARLADPSVD--RLLEISQKLEGLYRHASTHAAGIVIGDRPL------SQLVPMYR 547 (1033) Q Consensus 488 ~~~~~~~~~~--------~~~-~~~~~~~~~--~~~~~a~~leg~~r~~~~Ha~Gvvi~~~~l------~~~~p~~~ 547 (1033) ..+.+.+... .+. +........ +++..|..+-.-. +|..|||+.|.+= .+++|+.. T Consensus 72 YVd~l~~~~~~~~~e~~~I~~~~~~~~~~~~~~~~l~~A~~~LS~l----~~~~~vv~~p~~~~~~~~~i~Li~l~~ 144 (343) T TIGR00331 72 YVDHLLKVDSLTEEEKRRIQNQLLQERFELEDVKVLKLAASLLSEL----TNYTAVVLGPRLSQSKLKHIELIPLDP 144 (343) T ss_pred HHHHHCCCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHH----HCCEEEEEEECCCCCCEEEEEEEEECC T ss_conf 9997447888787899999998635312402389999999999997----657388870022444110356675366 No 346 >KOG2836 consensus Probab=33.18 E-value=24 Score=13.01 Aligned_cols=33 Identities=15% Similarity=0.278 Sum_probs=20.9 Q ss_pred EEEECCCHHHHHHHHHHHHHHCCCCEEEE-EEEE Q ss_conf 99811651774999999999769963899-9999 Q 537021.9.peg.7 44 IAITDTNNLFSALEFSQKACAAGIQPIIG-CQLD 76 (1033) Q Consensus 44 iAITD~~nl~Gav~F~~ackk~gIKPIiG-~E~~ 76 (1033) .-|||.-+=+..-.|...-+|+|...++- ||.+ T Consensus 19 FLIThnPtnaTln~fieELkKygvttvVRVCe~T 52 (173) T KOG2836 19 FLITHNPTNATLNKFIEELKKYGVTTVVRVCEPT 52 (173) T ss_pred EEEECCCCCHHHHHHHHHHHHCCCEEEEEECCCC T ss_conf 9985598711499999999963972899951464 No 347 >COG5249 RER1 Golgi protein involved in Golgi-to-ER retrieval [Intracellular trafficking and secretion] Probab=33.05 E-value=16 Score=14.39 Aligned_cols=23 Identities=30% Similarity=0.488 Sum_probs=13.9 Q ss_pred HHHHHHHHHCCCCCHHHCCCHHHHHHHHC Q ss_conf 66776554403756333231056762100 Q 537021.9.peg.7 383 GSVTAYALTITDIDPLRFSLLFERFLNPD 411 (1033) Q Consensus 383 gSLVaY~LgIT~VDPi~~~LlFERFLnp~ 411 (1033) =-+|||||||--.+. |=-||-|. T Consensus 57 wY~icY~LgiyLLn~------flaFLTPK 79 (180) T COG5249 57 WYLICYCLGIYLLNA------FLAFLTPK 79 (180) T ss_pred HHHHHHHHHHHHHHH------HHHHHCCC T ss_conf 356899899999999------99994887 No 348 >cd02186 alpha_tubulin The tubulin superfamily includes five distinct families, the alpha-, beta-, gamma-, delta-, and epsilon-tubulins and a sixth family (zeta-tubulin) which is present only in kinetoplastid protozoa. The alpha- and beta-tubulins are the major components of microtubules, while gamma-tubulin plays a major role in the nucleation of microtubule assembly. The delta- and epsilon-tubulins are widespread but unlike the alpha, beta, and gamma-tubulins they are not ubiquitous among eukaryotes. The alpha/beta-tubulin heterodimer is the structural subunit of microtubules. The alpha- and beta-tubulins share 40% amino-acid sequence identity, exist in several isotype forms, and undergo a variety of posttranslational modifications. The structures of alpha- and beta-tubulin are basically identical: each monomer is formed by a core of two beta-sheets surrounded by alpha-helices. The monomer structure is very compact, but can be divided into three regions based on function: the amino Probab=33.02 E-value=18 Score=13.96 Aligned_cols=45 Identities=27% Similarity=0.211 Sum_probs=31.8 Q ss_pred CHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 83899999887789898606305578889999985104764334444204667765544 Q 537021.9.peg.7 333 HAQDYRQRLDFELNVIARMKFSGYFLIVADFIQWAKKNNIPVGPGRGSGAGSVTAYALT 391 (1033) Q Consensus 333 ~~~~y~~RLe~EL~vI~~~gf~~YFLiV~Div~~ak~~gI~vGpGRGSAagSLVaY~Lg 391 (1033) ..+++.+|+.+|++-- +.-..|++++ . +|=|=||+-||+++-.|. T Consensus 113 ~~d~i~d~IRkevE~C---D~l~gf~~~h---------S--l~GGTGSGlGs~lle~L~ 157 (434) T cd02186 113 IIDLVLDRIRKLADNC---TGLQGFLIFH---------S--FGGGTGSGFGSLLLERLS 157 (434) T ss_pred HHHHHHHHHHHHHHCC---CCCCCEEEEE---------E--CCCCCCCHHHHHHHHHHH T ss_conf 7889999999999738---9866436785---------3--688762329999999987 No 349 >cd05166 PI3Kc_II Phosphoinositide 3-kinase (PI3K), class II, catalytic domain; The PI3K catalytic domain family is part of a larger superfamily that includes the catalytic domains of other kinases such as the typical serine/threonine/tyrosine protein kinases (PKs), aminoglycoside phosphotransferase, choline kinase, and RIO kinases. PI3Ks catalyze the transfer of the gamma-phosphoryl group from ATP to the 3-hydroxyl of the inositol ring of D-myo-phosphatidylinositol (PtdIns) or its derivatives. PI3Ks play an important role in a variety of fundamental cellular processes, including cell motility, the Ras pathway, vesicle trafficking and secretion, immune cell activation and apoptosis. They can be divided into three main classes (I, II, and III), defined by their substrate specificity, regulation, and domain structure. Class II PI3Ks preferentially use PtdIns as a substrate to produce PtdIns(3)P, but can also phosphorylate PtdIns(4)P. They function as monomers and do not associate with any Probab=32.98 E-value=16 Score=14.39 Aligned_cols=44 Identities=30% Similarity=0.459 Sum_probs=25.5 Q ss_pred HHHHHHHCCHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHCC Q ss_conf 2135654178999999998874025434332344358899876201410122114 Q 537021.9.peg.7 570 KFDFLGLKTLTLLQKSLDFLAQRGVKVDLSLIPFDDHETYHLLTTKGTLGIFQLE 624 (1033) Q Consensus 570 K~D~Lgl~~L~~i~~~~~~i~~~~~~~d~~~ip~~D~~~~~l~~~g~t~GvFQ~e 624 (1033) .=|.|.+-.+++++...+. .| +|+.-. +|+.+..|...|+-++- T Consensus 102 RQD~L~lQli~~md~i~~~---~g--ldl~l~------pY~vlatg~~~G~IE~V 145 (353) T cd05166 102 RQDMLVLQMINIMDKIWLQ---EG--LDLRMI------TFRCLSTGYDRGMVELV 145 (353) T ss_pred CHHHHHHHHHHHHHHHHHH---CC--CCEEEE------EEEEEEECCCCEEEEEE T ss_conf 2899999999999999997---59--983888------76999935995138980 No 350 >KOG2805 consensus Probab=32.79 E-value=24 Score=12.96 Aligned_cols=214 Identities=16% Similarity=0.198 Sum_probs=94.1 Q ss_pred EEEECCCCCCHHHHHHHCCC-HHHHHHHHHHHHHHCCCCEEE----EEECCC--HHHHHHHHHHHHHHHHHCCCEEEECC Q ss_conf 89981976796788887799-789999999998765986899----994399--11589999999999981898998088 Q 537021.9.peg.7 144 LIMLTGGSSGPIDRAFFFNS-SQVAEKRLLTFKKLFGDRLYV----NLQRHR--GYDRYRESQVVQLAYTHELPLVATNN 216 (1033) Q Consensus 144 LIvlsg~~~g~i~~~l~~~~-~~~a~~~l~~l~~~Fgd~~yl----El~~~~--~~e~~~~~~l~~lA~~~~iPlVaTnd 216 (1033) ++.+||+.+|-++-.++... ++. --+|. ++..++ .+-+.-...+...++.++||+= . T Consensus 9 vvamSgGVDSsVaa~Ll~~~g~~v-------------~gv~M~nWd~~de~~s~cp~e~D~~da~~Vc~~LnI~~~---~ 72 (377) T KOG2805 9 VVAMSGGVDSSVAARLLAARGYNV-------------TGVFMKNWDSLDEFGSQCPAERDWKDAKRVCKQLNIPLH---Q 72 (377) T ss_pred EEEECCCCHHHHHHHHHHHCCCCE-------------EEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHCCEEE---E T ss_conf 999537711899999997418871-------------699662201222356689811208999999987097027---8 Q ss_pred CCCCCHHHHHH--HHHHHHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHCC-HHHHHHHHHH--HHHH----HHHHHHH- Q ss_conf 30078788999--98766884698576531002453432343478875122-0358998788--6411----1000133- Q 537021.9.peg.7 217 SLFLSEEDYEA--HDVLMAVAHSTVVSQKDRPRVTPDHYLKNRSEMVSIFS-DLPEALENTV--EIAR----RCSFILQ- 286 (1033) Q Consensus 217 v~Y~~~~D~~~--h~iL~aI~~~~~l~~~~~~~~~~~~yLks~eEM~~~f~-~~peAi~NT~--~IAe----~c~~~l~- 286 (1033) |+| .+|-|.. -..|.--+.|.+-+ ++ ..-+.+.| |. -+..|++|-. .||. +..+... T Consensus 73 Vnf-~kEYW~~Vfs~~L~~Y~~G~TPN-PD---I~CN~~IK--------Fg~~~~~a~en~~~d~latGHYAr~~~~~~~ 139 (377) T KOG2805 73 VNF-VKEYWNDVFSPFLEEYENGRTPN-PD---ILCNKHIK--------FGKFFKHAIENLGYDWLATGHYARVVLEDED 139 (377) T ss_pred EEE-HHHHHHHHHHHHHHHHHCCCCCC-CC---CCCCCEEE--------CCHHHHHHHHHCCCCEEEEEEEEEEECCCCC T ss_conf 743-79999999999988876589999-97---13445142--------1178999987558873774212144037556 Q ss_pred --HHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHH-CCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHH Q ss_conf --22102345557568877689889998765410342-012464324578389999988778989860630557888999 Q 537021.9.peg.7 287 --VHAPILPRFIEKECDDIQKEEENELRNKAVAGLEV-RLAQGAIAKGYHAQDYRQRLDFELNVIARMKFSGYFLIVADF 363 (1033) Q Consensus 287 --~~~~~lP~f~~p~~~~~~~~~~~~Lr~l~~~Gl~~-R~~~~~~~~~~~~~~y~~RLe~EL~vI~~~gf~~YFLiV~Di 363 (1033) +.....|..+.. +..-+|..+..+-|+. +++-+ ...+ .++ T Consensus 140 ~~~~~l~~~~d~~K-------DQt~FL~~in~~~L~r~lfPlg-----~~~K-------------------------~eV 182 (377) T KOG2805 140 NAESHLLISKDMVK-------DQTYFLSTINQTQLKRLLFPLG-----CLTK-------------------------SEV 182 (377) T ss_pred CCCEEEEECCCCCC-------CCEEEEECCCHHHHHHHHCCCC-----CCCH-------------------------HHH T ss_conf 76235761256667-------7326763036999986206676-----5677-------------------------999 Q ss_pred HHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHHCCCHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCC Q ss_conf 998510476433444420466776554403756333231056762100145787533365687899999998303802 Q 537021.9.peg.7 364 IQWAKKNNIPVGPGRGSGAGSVTAYALTITDIDPLRFSLLFERFLNPDRMSMPDFDIDFCQDRRDEVIRYVQNKYGHE 441 (1033) Q Consensus 364 v~~ak~~gI~vGpGRGSAagSLVaY~LgIT~VDPi~~~LlFERFLnp~R~~~PDIDiDf~~~~R~~vi~y~~~kyG~~ 441 (1033) -+.|++.|.++.----| .||-=|.++++ |.-||-..=.++|-+=+++... .-+.+-+|.. T Consensus 183 k~lA~~~gf~~aeK~eS---------qGICFvgk~~~---F~dFl~~yi~~~~g~Il~i~~g------s~vg~h~G~h 242 (377) T KOG2805 183 KKLAKQAGFPNAEKPES---------QGICFVGKIKH---FSDFLQRYIGSSPGPILEIDSG------SVVGNHRGIH 242 (377) T ss_pred HHHHHHCCCCCCCCCCC---------CEEEEECCCHH---HHHHHHHHCCCCCCCEEEECCC------CCCCCCCCEE T ss_conf 99998659864568665---------02688536234---7899998547899871883288------6223332357 No 351 >cd01305 archeal_chlorohydrolases Predicted chlorohydrolases. These metallo-dependent hydrolases from archea are part of the superfamily of metallo-dependent hydrolases, a large group of proteins that show conservation in their 3-dimensional fold (TIM barrel) and in details of their active site. They have a conserved metal binding site, involving four histidines and one aspartic acid residue. In the common reaction mechanism, the metal ion (or ions) deprotonate a water molecule for a nucleophilic attack on the substrate. Some members of this subgroup are predicted to be chlorohyrolases. Probab=32.76 E-value=24 Score=12.96 Aligned_cols=64 Identities=13% Similarity=0.158 Sum_probs=36.2 Q ss_pred CCCHHHHHHCHHHHHHHCCC---HHHHHHHHHHCCCCEE-EEECCCHHHHHHHHHHHHHHCCCCEEEE Q ss_conf 88133433144476443299---8999999997899889-9811651774999999999769963899 Q 537021.9.peg.7 9 PSFVHLQVHSSYSLLEGALS---LNNILDKIAADQQPAI-AITDTNNLFSALEFSQKACAAGIQPIIG 72 (1033) Q Consensus 9 ~~FvhLh~hS~ySlldg~~~---~e~LV~~A~~~G~~Ai-AITD~~nl~Gav~F~~ackk~gIKPIiG 72 (1033) |-||.-|+|...|+++|... .++++.-.....++-. ++++..-..++......+.+.|+.-+.. T Consensus 4 PGlVNaH~Hl~~~~~rg~~~d~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~t~~~d 71 (263) T cd01305 4 PALVNAHTHLGDSAIKEVGDGLPLDDLVAPPDGLKHRLLAQADDRELAEAMRKVLRDMRETGIGAFAD 71 (263) T ss_pred CCCCCHHHCHHHHHHHHCCCCCCHHHHHHCHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCEEEEE T ss_conf 58211886878787862569999899830635426899874899999999999999999759969996 No 352 >PRK02227 hypothetical protein; Provisional Probab=32.73 E-value=24 Score=12.96 Aligned_cols=16 Identities=19% Similarity=0.412 Sum_probs=8.1 Q ss_pred HHHHHHHHHCCCCCCC Q ss_conf 8999998510476433 Q 537021.9.peg.7 360 VADFIQWAKKNNIPVG 375 (1033) Q Consensus 360 V~Div~~ak~~gI~vG 375 (1033) ...+|..||++|..+| T Consensus 169 L~~fv~~a~~~gl~~g 184 (239) T PRK02227 169 LAEFVAEARAHGLMTA 184 (239) T ss_pred HHHHHHHHHHCCCEEE T ss_conf 9999999997599399 No 353 >COG1448 TyrB Aspartate/tyrosine/aromatic aminotransferase [Amino acid transport and metabolism] Probab=32.66 E-value=24 Score=12.95 Aligned_cols=125 Identities=13% Similarity=0.197 Sum_probs=72.3 Q ss_pred CCHHHHHHHHHHHHHH--HHCCCCCCHHHHHHHHHHHHHHHHHHHHHHH--HHHHHCCCCCCCHHHHHHHHHHHHCCCCC Q ss_conf 9999999999999975--0001331014788899999999997302666--77751002354202589998888636850 Q 537021.9.peg.7 741 ISKTIAVNIFELLAKF--ADYGFNKSHAAAYAVISYQTAWMKTHYPVEF--LAASMTLEMDNVEKIKKFCQDARQFNIQI 816 (1033) Q Consensus 741 ~~~~~a~~i~~~i~~f--~~Y~FnksHa~aya~~~y~~aylK~~yP~eF--~aa~L~~~~~~~~~~~~~i~e~~~~gi~v 816 (1033) -+.+++.++.++++.- +.|+=+-+|.++-.-...-...||+.+-.|- |-..|. .-...++++++..+ T Consensus 263 ~~~~~a~~v~sqlk~~iR~~ySnPP~~Ga~vva~IL~~p~Lra~W~~El~~Mr~Ri~------~mR~~lv~~L~~~~--- 333 (396) T COG1448 263 EDAEEADRVLSQLKAIIRTNYSNPPAHGAAVVATILNNPELRAEWEQELEEMRQRIL------EMRQALVDALKALG--- 333 (396) T ss_pred CCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHC--- T ss_conf 788999999999999998635898412399999994899999999999999999999------99999999998627--- Q ss_pred CCCCCCCCCEEEEECCCEEEEEHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCHHHHHHHHHC Q ss_conf 355233421003531986985026858969899999999983799999789997348247998999988867 Q 537021.9.peg.7 817 MPPSVNTPCVDFKVGDNRIYYSLAAIKGVGTTTARHIMEASADKPFDSLEDFCSRVDSKKLNRRVLESLVFA 888 (1033) Q Consensus 817 l~PdIN~S~~~f~~~~~~Ir~Gl~~Ikgvg~~~~~~Ive~r~~g~f~sl~df~~rv~~~~~nk~~le~Li~a 888 (1033) +.-.++|.+...+ +=+--|++.+-+..+-|+ -+-|..-.. |++..++|++.++.|++| T Consensus 334 -----~~~~f~~i~~Q~G----MFsy~Gls~~QV~rLree--~~IY~v~sG---Ri~vaGl~~~ni~~va~a 391 (396) T COG1448 334 -----APRNFDFIISQRG----MFSYTGLSPEQVDRLREE--FGIYLVASG---RINVAGLNTSNIDYVAKA 391 (396) T ss_pred -----CCCCCCHHHHCCC----EEECCCCCHHHHHHHHHH--CCEEEECCC---EEEECCCCHHHHHHHHHH T ss_conf -----8866224754675----244589999999999985--667995378---146536874408999999 No 354 >cd05173 PI3Kc_IA_beta Phosphoinositide 3-kinase (PI3K), class IA, beta isoform, catalytic domain; The PI3K catalytic domain family is part of a larger superfamily that includes the catalytic domains of other kinases such as the typical serine/threonine/tyrosine protein kinases (PKs), aminoglycoside phosphotransferase, choline kinase, and RIO kinases. PI3Ks catalyze the transfer of the gamma-phosphoryl group from ATP to the 3-hydroxyl of the inositol ring of D-myo-phosphatidylinositol (PtdIns) or its derivatives. PI3Ks can be divided into three main classes (I, II, and III), defined by their substrate specificity, regulation, and domain structure. Class I PI3Ks are the only enzymes capable of converting PtdIns(4,5)P2 to the critical second messenger PtdIns(3,4,5)P3. Class I enzymes are heterodimers and exist in multiple isoforms consisting of one catalytic subunit (out of four isoforms) and one of several regulatory subunits. They are further classified into class IA (alpha, beta and de Probab=32.59 E-value=20 Score=13.59 Aligned_cols=87 Identities=21% Similarity=0.275 Sum_probs=46.5 Q ss_pred HHHCCCCCCCEEEECCCCH--HHHEECCCC------CCCCCEEEEEECHHHHHHCCCHHHHHHHCCHHHHHHHHHHHHHH Q ss_conf 2100244744033525666--662001236------76886688510213343112321356541789999999988740 Q 537021.9.peg.7 521 LYRHASTHAAGIVIGDRPL--SQLVPMYRD------VRSDLPVTQFNMKWIEKAGLIKFDFLGLKTLTLLQKSLDFLAQR 592 (1033) Q Consensus 521 ~~r~~~~Ha~Gvvi~~~~l--~~~~p~~~~------~~~~~~~~q~d~~~~e~~gl~K~D~Lgl~~L~~i~~~~~~i~~~ 592 (1033) +|-+.++..+|+++..-.+ ....|+... ..+...++-=..|| |.=|.|.+-.+++++...+. . T Consensus 55 lPldP~~~~~~i~~~~c~v~~S~~~Pl~l~f~n~~~~~~~~~vIfK~GDD------LRQD~L~lQli~lmd~iwk~---~ 125 (362) T cd05173 55 SPLNPSIILSELNVEKCKYMDSKMKPLWIVYNNKLFGGDSLGIIFKNGDD------LRQDMLTLQILRLMDTLWKE---A 125 (362) T ss_pred CCCCCCEEEEEEECCCCEEECCCCCCEEEEEECCCCCCCCEEEEEECCCC------CCHHHHHHHHHHHHHHHHHH---C T ss_conf 99996016843752445773214796489974258899816899987987------04999999999999999997---5 Q ss_pred CCCCCCCCCCCCHHHHHHHHHCCCCCHHHHCC Q ss_conf 25434332344358899876201410122114 Q 537021.9.peg.7 593 GVKVDLSLIPFDDHETYHLLTTKGTLGIFQLE 624 (1033) Q Consensus 593 ~~~~d~~~ip~~D~~~~~l~~~g~t~GvFQ~e 624 (1033) | +|+.-.| |+.++.|...|+-++- T Consensus 126 g--LDL~l~P------Y~vlaTg~~~GlIE~V 149 (362) T cd05173 126 G--LDLRIVP------YGCLATGDRSGLIEVV 149 (362) T ss_pred C--CCEEEEE------EEEEEECCCCEEEEEE T ss_conf 9--9859998------7999975995069980 No 355 >TIGR02392 rpoH_proteo alternative sigma factor RpoH; InterPro: IPR012759 The bacterial core RNA polymerase complex, which consists of five subunits, is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme . RNA polymerase recruits alternative sigma factors as a means of switching on specific regulons. Most bacteria express a multiplicity of sigma factors. Two of these factors, sigma-70 (gene rpoD), generally known as the major or primary sigma factor, and sigma-54 (gene rpoN or ntrA) direct the transcription of a wide variety of genes. The other sigma factors, known as alternative sigma factors, are required for the transcription of specific subsets of genes. With regard to sequence similarity, sigma factors can be grouped into two classes, the sigma-54 and sigma-70 families. Sequence alignments of the sigma70 family members reveal four conserved regions that can be further divided into subregions eg. sub-region 2.2, which may be involved in the binding of the sigma factor to the core RNA polymerase; and sub-region 4.2, which seems to harbor a DNA-binding 'helix-turn-helix' motif involved in binding the conserved -35 region of promoters recognized by the major sigma factors , . This entry represents the proteobacterial clade of sigma factors called RpoH. This protein may be called sigma-32, sigma factor H, heat shock sigma factor, and alternative sigma factor RpoH. Note that in some species the single locus rpoH may be replaced by two or more differentially regulated stress response sigma factors.; GO: 0003677 DNA binding, 0003700 transcription factor activity, 0008270 zinc ion binding, 0016987 sigma factor activity, 0006352 transcription initiation, 0006355 regulation of transcription DNA-dependent, 0009408 response to heat. Probab=32.52 E-value=19 Score=13.85 Aligned_cols=221 Identities=19% Similarity=0.295 Sum_probs=110.1 Q ss_pred CHHHHHHHHHHHHHHH--HHHCCHHHHHHHHHHHHHHHCCCCCC-------CCCCCCHHHHHHHHHHHCCCCCHHHCCCH Q ss_conf 8389999988778989--86063055788899999851047643-------34444204667765544037563332310 Q 537021.9.peg.7 333 HAQDYRQRLDFELNVI--ARMKFSGYFLIVADFIQWAKKNNIPV-------GPGRGSGAGSVTAYALTITDIDPLRFSLL 403 (1033) Q Consensus 333 ~~~~y~~RLe~EL~vI--~~~gf~~YFLiV~Div~~ak~~gI~v-------GpGRGSAagSLVaY~LgIT~VDPi~~~Ll 403 (1033) +.+.-++=+---|..+ .-+||.||=|-..|+|. +=||-+ =|.||= =||+|+. T Consensus 33 Dl~AA~~Lv~SHLRfVv~iArgY~GYGL~~aDLIQ---EGNIGLMkAVkRFdP~~Gv---RLvSfAv------------- 93 (279) T TIGR02392 33 DLDAAKKLVLSHLRFVVKIARGYRGYGLPQADLIQ---EGNIGLMKAVKRFDPERGV---RLVSFAV------------- 93 (279) T ss_pred CHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHH---HHHHHHHHHHHHCCCCCCC---CHHHHHH------------- T ss_conf 77899987466247875532036777744244342---2226655334421789872---2047888------------- Q ss_pred HHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEECCCHHH-----HHHHHHHHHHHHHC-C-CCHHHHHHHH Q ss_conf 567621001457875333656878999999983038021554220778-----77889999999752-7-9988988654 Q 537021.9.peg.7 404 FERFLNPDRMSMPDFDIDFCQDRRDEVIRYVQNKYGHERVAQIITFGS-----LQAKAALRDVGRAL-Q-MPYSQVDRLC 476 (1033) Q Consensus 404 FERFLnp~R~~~PDIDiDf~~~~R~~vi~y~~~kyG~~~va~i~t~~t-----~~~k~aird~~r~~-g-~~~~~~~~~~ 476 (1033) +=-|-++++||-.=| +-..|+|--- |.-|++-+-+.+.- | +++.+++.+| T Consensus 94 --------------------HWIkAeIHEyILrnW---RlVKvATTkaQrKLFFNLRk~K~ki~~l~~gw~~~~e~~~~A 150 (279) T TIGR02392 94 --------------------HWIKAEIHEYILRNW---RLVKVATTKAQRKLFFNLRKMKKKITRLQDGWLNPEEVEAIA 150 (279) T ss_pred --------------------HHHHHHHHHHHHHHH---HHHHHHCCHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHH T ss_conf --------------------999999866899755---598885277678887658889998631015798989999999 Q ss_pred HHCCCCCCCCCCHHHHHCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCEEEECC-CCHHHHEECCC--CCCCCC Q ss_conf 310145898244344522988788643074202688987201122100244744033525-66666200123--676886 Q 537021.9.peg.7 477 KLIPNDPAHPVSLQAIADDSRFREARLADPSVDRLLEISQKLEGLYRHASTHAAGIVIGD-RPLSQLVPMYR--DVRSDL 553 (1033) Q Consensus 477 ~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~leg~~r~~~~Ha~Gvvi~~-~~l~~~~p~~~--~~~~~~ 553 (1033) +..- - + ..++.++=.||.|.=-.+-.+... .. +....+.|+.. +..... T Consensus 151 ~~L~----V--~-------------------~~eV~eMe~RlsG~D~~L~~~~~~---~~~~~~~~~~~~~yL~D~~~~~ 202 (279) T TIGR02392 151 EELG----V--S-------------------EEEVLEMESRLSGRDISLNASIDD---DEDDGGSDFAPIAYLADKATSD 202 (279) T ss_pred HHCC----C--C-------------------HHHHHHHHHCCCCCCCCCCCCCCC---CCCCCCCHHHHHHHHCCCCCCC T ss_conf 7638----9--8-------------------889988751455377334788765---5345631145788630477778 Q ss_pred EEEEEECHHHHHHCCCHHHHHHHCCHHHHHHHHHHHHHHCCCCCCC-CCCCCH-HHHHHHHHCCCCC---HHHHCCCHHH Q ss_conf 6885102133431123213565417899999999887402543433-234435-8899876201410---1221145347 Q 537021.9.peg.7 554 PVTQFNMKWIEKAGLIKFDFLGLKTLTLLQKSLDFLAQRGVKVDLS-LIPFDD-HETYHLLTTKGTL---GIFQLESSGM 628 (1033) Q Consensus 554 ~~~q~d~~~~e~~gl~K~D~Lgl~~L~~i~~~~~~i~~~~~~~d~~-~ip~~D-~~~~~l~~~g~t~---GvFQ~es~~~ 628 (1033) +-.+...++-++. . -..|+.|+....++-.+|=-. .+. || |.|++-|+.-.-+ =|=|+|..+| T Consensus 203 ~~~~l~~~~~~~~----------~-~~~L~~AL~~Ld~RsR~I~~~RwL~-d~~~~TL~~LA~eygvSaERiRQiE~~Am 270 (279) T TIGR02392 203 PEATLEEEQWEEL----------Q-TQKLANALESLDARSRRIIEARWLD-DDGKLTLHELAAEYGVSAERIRQIEKNAM 270 (279) T ss_pred HHHHHHHHHHHHH----------H-HHHHHHHHHHHHHHHHHHHHHHCCC-CCCCCCHHHHHHHHCCCHHHHHHHHHHHH T ss_conf 8899999899999----------9-9999999885228879999873027-88996478999884874667999999999 Q ss_pred HHHHHHH Q ss_conf 8888771 Q 537021.9.peg.7 629 RQALEGM 635 (1033) Q Consensus 629 ~~~l~~~ 635 (1033) +++=.-+ T Consensus 271 ~K~k~~~ 277 (279) T TIGR02392 271 KKLKAAL 277 (279) T ss_pred HHHHHHH T ss_conf 9999984 No 356 >TIGR01755 flav_wrbA flavoprotein WrbA; InterPro: IPR010089 This entry represents a protein, WrbA, related to and slightly larger than flavodoxin. In Escherichia coli, this protein is produced during stationary phase, binds to the trp repressor, and makes trp operon repression more efficient. WrbA does not interact with the trp operator by itself. Three copies are found in Sinorhizobium meliloti, and members are found in species in which homologs of the Escherichia coli trp operon repressor TrpR (P03032 from SWISSPROT) are not detected.; GO: 0010181 FMN binding, 0016481 negative regulation of transcription. Probab=32.43 E-value=24 Score=12.92 Aligned_cols=142 Identities=16% Similarity=0.198 Sum_probs=81.6 Q ss_pred CHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCCCCCCCCCCCCCEEEEEECCHHHHHHHHHHHHHH-HHCCCCCCC Q ss_conf 5177499999999976996389999998547664334456566667753999956968999999999899-732487787 Q 537021.9.peg.7 50 NNLFSALEFSQKACAAGIQPIIGCQLDIDMQDNLKMTNQGKTPFPTVPSIVLLVSTAEGYQRLIELVSRM-YLLDQGKQS 128 (1033) Q Consensus 50 ~nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~~~~~~~~~~~~~~~~~lvlLAkN~~Gy~nL~kL~S~a-~~~~~~~~~ 128 (1033) =++||+++-.-.|-..|.+-+=|+|+.|.. .++..+ .++.+-+-.. +. .-+..- T Consensus 8 YS~YGh~e~MA~AvAeGa~~~~GAEv~vkr------------VPE~~P------------eev~~~~Gaie~~-K~dq~a 62 (205) T TIGR01755 8 YSMYGHIETMAKAVAEGAREVEGAEVVVKR------------VPELVP------------EEVAEKSGAIEHK-KLDQEA 62 (205) T ss_pred ECCHHHHHHHHHHHHCCCCCCCCEEEEEEE------------CCCCCC------------HHHHHHCCCCCCC-CCCCCC T ss_conf 144016899999986246436851798873------------788887------------7999851331255-544337 Q ss_pred CCCCHHH----HHHHCCCCEEEECCCCCCHHHHHHHCCCHHHHHHHHH-----HHHHHCC---CCEEEE-EECCCHHHHH Q ss_conf 4007899----9863489889981976796788887799789999999-----9987659---868999-9439911589 Q 537021.9.peg.7 129 VRICLSW----LQEIGTEGLIMLTGGSSGPIDRAFFFNSSQVAEKRLL-----TFKKLFG---DRLYVN-LQRHRGYDRY 195 (1033) Q Consensus 129 p~i~~~~----L~e~~~egLIvlsg~~~g~i~~~l~~~~~~~a~~~l~-----~l~~~Fg---d~~ylE-l~~~~~~e~~ 195 (1033) |....+. |.+| |-||+-++.++|-+..-. ++.++ |.+..|= ...|.. -..|++.|+- T Consensus 63 PvaT~~eneevLA~y--DAiI~Gt~TRyG~masQM--------r~fLDqtGGLW~~gaL~GKvGSvf~SsaTqHGGqE~t 132 (205) T TIGR01755 63 PVATPEENEEVLADY--DAIIFGTPTRYGNMASQM--------RNFLDQTGGLWASGALVGKVGSVFTSSATQHGGQEST 132 (205) T ss_pred CCCCCCCCHHHHCCC--CEEEECCCCCCCCCHHHH--------HHHHHCCCCCCCCCCCCCCEEEEEEECCCCCCHHHHH T ss_conf 626834340210057--637772887334504688--------8874105770201540212255764257474416789 Q ss_pred HHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHH Q ss_conf 99999999998189899808830078788999987 Q 537021.9.peg.7 196 RESQVVQLAYTHELPLVATNNSLFLSEEDYEAHDV 230 (1033) Q Consensus 196 ~~~~l~~lA~~~~iPlVaTndv~Y~~~~D~~~h~i 230 (1033) +..-...|.. +|-=+| ..-|..++....-.| T Consensus 133 i~~~~~tl~H-hGMi~V---gL~YA~~~q~~~d~v 163 (205) T TIGR01755 133 ILSFIVTLLH-HGMIIV---GLPYAEQEQMGVDEV 163 (205) T ss_pred HHHHHHHHHH-CCEEEE---CCCHHHHHHCCCCEE T ss_conf 9999999976-586774---666243532477766 No 357 >pfam00318 Ribosomal_S2 Ribosomal protein S2. Probab=32.36 E-value=24 Score=12.91 Aligned_cols=67 Identities=10% Similarity=0.189 Sum_probs=31.3 Q ss_pred CCHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCC-CCEEECCCHHHHHHHHHHHHHHHHCCCCH Q ss_conf 3105676210014578753336568789999999830380-21554220778778899999997527998 Q 537021.9.peg.7 401 SLLFERFLNPDRMSMPDFDIDFCQDRRDEVIRYVQNKYGH-ERVAQIITFGSLQAKAALRDVGRALQMPY 469 (1033) Q Consensus 401 ~LlFERFLnp~R~~~PDIDiDf~~~~R~~vi~y~~~kyG~-~~va~i~t~~t~~~k~aird~~r~~g~~~ 469 (1033) +=-++.|+-..|....=||++.-...=.++.+++.+---. .++.-|+|- -+++.+|+..|+..|.++ T Consensus 16 np~m~~yIy~~r~gi~IidL~~T~~~L~~A~~~i~~i~~~~~~iLfVgTk--~~~~~~v~~~a~~~~~~y 83 (205) T pfam00318 16 NPKMKPYIYGERNGIHIIDLEKTLEKLRKAANFIKEIAAKGGKILFVGTK--KQAQEAVKKFAKRTGQFY 83 (205) T ss_pred CCCCHHHHHCCCCCCEEEEHHHHHHHHHHHHHHHHHHHHCCCEEEEEECC--HHHHHHHHHHHHHHCCCC T ss_conf 99812010144599279729999999999999999999669828999777--899999999999839974 No 358 >KOG3718 consensus Probab=32.33 E-value=24 Score=12.91 Aligned_cols=19 Identities=42% Similarity=0.541 Sum_probs=12.2 Q ss_pred HHHHHHHHHHHCCCCCCCCCCC Q ss_conf 8889999985104764334444 Q 537021.9.peg.7 358 LIVADFIQWAKKNNIPVGPGRG 379 (1033) Q Consensus 358 LiV~Div~~ak~~gI~vGpGRG 379 (1033) |-+||++|..| |+|--.|| T Consensus 133 LsyWdl~Rkek---ipvtssrg 151 (609) T KOG3718 133 LSYWDLTRKEK---IPVTSSRG 151 (609) T ss_pred HHHHHHHHHCC---CCEECCCC T ss_conf 68999875045---65004689 No 359 >COG2089 SpsE Sialic acid synthase [Cell envelope biogenesis, outer membrane] Probab=32.28 E-value=25 Score=12.90 Aligned_cols=19 Identities=32% Similarity=0.279 Sum_probs=10.6 Q ss_pred CHHHHHHHHHHHHHHCCCC Q ss_conf 5177499999999976996 Q 537021.9.peg.7 50 NNLFSALEFSQKACAAGIQ 68 (1033) Q Consensus 50 ~nl~Gav~F~~ackk~gIK 68 (1033) |.|-=+.+..++|+++|-. T Consensus 27 G~le~A~~lIdaAk~aGAD 45 (347) T COG2089 27 GDLERAKELIDAAKEAGAD 45 (347) T ss_pred CCHHHHHHHHHHHHHCCCC T ss_conf 7589999999999973866 No 360 >cd01295 AdeC Adenine deaminase (AdeC) directly deaminates adenine to form hypoxanthine. This reaction is part of one of the adenine salvage pathways, as well as the degradation pathway. It is important for adenine utilization as a purine, as well as a nitrogen source in bacteria and archea. Probab=32.24 E-value=25 Score=12.90 Aligned_cols=60 Identities=10% Similarity=0.155 Sum_probs=37.8 Q ss_pred CCCCHHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCH---HH---HHHHHHHHHHHCCCCEEEE Q ss_conf 98813343314447644329989999999978998899811651---77---4999999999769963899 Q 537021.9.peg.7 8 SPSFVHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNN---LF---SALEFSQKACAAGIQPIIG 72 (1033) Q Consensus 8 ~~~FvhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~n---l~---Gav~F~~ackk~gIKPIiG 72 (1033) -|-|+-.|+|-+- +..+|.++.+.+...|..++- +|-.. +. |+-.+.+.+++..++-.++ T Consensus 8 ~PGfID~HvHies----s~~tP~~~a~~~l~~GtTTvv-~Dpheianv~G~~gi~~~l~~~~~~p~~~~~~ 73 (422) T cd01295 8 VPGFIDAHLHIES----SMLTPSEFAKAVLPHGTTTVI-ADPHEIANVAGVDGIEFMLEDAKKTPLDIFWM 73 (422) T ss_pred CCCEEECCCCCCC----CCCCHHHHHHHHHHCCCEEEE-CCCHHHHHHCCHHHHHHHHHHHHCCCCEEEEE T ss_conf 8487431278866----654999999999856827994-78789987378999999999861288618997 No 361 >PRK02261 methylaspartate mutase subunit S; Provisional Probab=32.21 E-value=25 Score=12.89 Aligned_cols=13 Identities=15% Similarity=0.248 Sum_probs=5.0 Q ss_pred CHHHHHHHHHHHH Q ss_conf 8389999988778 Q 537021.9.peg.7 333 HAQDYRQRLDFEL 345 (1033) Q Consensus 333 ~~~~y~~RLe~EL 345 (1033) ..++..+-|..+| T Consensus 122 ~~~~ii~~i~~~l 134 (137) T PRK02261 122 DLEEVIDDLKADL 134 (137) T ss_pred CHHHHHHHHHHHH T ss_conf 9999999999986 No 362 >cd00891 PI3Kc Phosphoinositide 3-kinase (PI3K), catalytic domain; The PI3K catalytic domain family is part of a larger superfamily that includes the catalytic domains of other kinases such as the typical serine/threonine/tyrosine protein kinases (PKs), aminoglycoside phosphotransferase, choline kinase, and RIO kinases. PI3Ks catalyze the transfer of the gamma-phosphoryl group from ATP to the 3-hydroxyl of the inositol ring of D-myo-phosphatidylinositol (PtdIns) or its derivatives. PI3Ks play an important role in a variety of fundamental cellular processes, including cell motility, the Ras pathway, vesicle trafficking and secretion, immune cell activation and apoptosis. They can be divided into three main classes (I, II, and III), defined by their substrate specificity, regulation, and domain structure. Class I PI3Ks are the only enzymes capable of converting PtdIns(4,5)P2 to the critical second messenger PtdIns(3,4,5)P3. Class I enzymes are heterodimers and exist in multiple isoforms c Probab=32.21 E-value=17 Score=14.25 Aligned_cols=44 Identities=25% Similarity=0.520 Sum_probs=23.5 Q ss_pred HHHHHHHCCHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHCC Q ss_conf 2135654178999999998874025434332344358899876201410122114 Q 537021.9.peg.7 570 KFDFLGLKTLTLLQKSLDFLAQRGVKVDLSLIPFDDHETYHLLTTKGTLGIFQLE 624 (1033) Q Consensus 570 K~D~Lgl~~L~~i~~~~~~i~~~~~~~d~~~ip~~D~~~~~l~~~g~t~GvFQ~e 624 (1033) .-|.|-+-.+.+++... ++.|. |+.- .+|+.+..|...|+-++- T Consensus 102 RQD~l~lQli~lm~~i~---~~~~l--dl~l------~~Y~vl~t~~~~GlIE~V 145 (352) T cd00891 102 RQDMLTLQMIRLMDKIW---KKEGL--DLRM------TPYGCIATGDGVGMIEVV 145 (352) T ss_pred HHHHHHHHHHHHHHHHH---HHCCC--CEEE------EEEEEEEECCCCEEEEEC T ss_conf 59999999999999999---86799--7178------876899705985268986 No 363 >cd05167 PI4Kc_III_alpha Phosphoinositide 4-kinase (PI4K), Type III, alpha isoform, catalytic domain; The PI4K catalytic domain family is part of a larger superfamily that includes the catalytic domains of other kinases such as the typical serine/threonine/tyrosine protein kinases (PKs), aminoglycoside phosphotransferase, choline kinase, and RIO kinases. PI4Ks catalyze the transfer of the gamma-phosphoryl group from ATP to the 4-hydroxyl of the inositol ring of D-myo-phosphatidylinositol (PtdIns) to generate PtdIns(4)P, the major precursor in the synthesis of other phosphoinositides including PtdIns(4,5)P2, PtdIns(3,4)P2, and PtdIns(3,4,5)P3. Two isoforms of type III PI4K, alpha and beta, exist in most eukaryotes. PI4KIIIalpha is a 220 kDa protein found in the plasma membrane and the endoplasmic reticulum (ER). The role of PI4KIIIalpha in the ER remains unclear. In the plasma membrane, it provides PtdIns(4)P, which is then converted by PI5Ks to PtdIns(4,5)P2, an important signaling mole Probab=32.18 E-value=21 Score=13.46 Aligned_cols=15 Identities=20% Similarity=0.534 Sum_probs=5.9 Q ss_pred HHHHHCCCCCHHHHC Q ss_conf 987620141012211 Q 537021.9.peg.7 609 YHLLTTKGTLGIFQL 623 (1033) Q Consensus 609 ~~l~~~g~t~GvFQ~ 623 (1033) |+.++.|.+.|+-++ T Consensus 93 Y~vl~tg~~~G~IE~ 107 (311) T cd05167 93 YRVVATGPGCGVIEV 107 (311) T ss_pred EEEEECCCCCEEEEE T ss_conf 799980698436998 No 364 >KOG3120 consensus Probab=32.16 E-value=24 Score=12.98 Aligned_cols=14 Identities=14% Similarity=0.249 Sum_probs=6.2 Q ss_pred HHHHHHHHHHHHCC Q ss_conf 89999876688469 Q 537021.9.peg.7 224 DYEAHDVLMAVAHS 237 (1033) Q Consensus 224 D~~~h~iL~aI~~~ 237 (1033) -+..|+....|-+| T Consensus 121 a~~~~d~F~~IfTN 134 (256) T KOG3120 121 AAGIHDLFSEIFTN 134 (256) T ss_pred HCCHHHHHHHHHCC T ss_conf 72579999998369 No 365 >TIGR02479 FliA_WhiG RNA polymerase sigma factor, FliA/WhiG family; InterPro: IPR012845 The bacterial core RNA polymerase complex, which consists of five subunits, is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme . RNA polymerase recruits alternative sigma factors as a means of switching on specific regulons. Most bacteria express a multiplicity of sigma factors. Two of these factors, sigma-70 (gene rpoD), generally known as the major or primary sigma factor, and sigma-54 (gene rpoN or ntrA) direct the transcription of a wide variety of genes. The other sigma factors, known as alternative sigma factors, are required for the transcription of specific subsets of genes. With regard to sequence similarity, sigma factors can be grouped into two classes, the sigma-54 and sigma-70 families. Sequence alignments of the sigma70 family members reveal four conserved regions that can be further divided into subregions eg. sub-region 2.2, which may be involved in the binding of the sigma factor to the core RNA polymerase; and sub-region 4.2, which seems to harbor a DNA-binding 'helix-turn-helix' motif involved in binding the conserved -35 region of promoters recognized by the major sigma factors , . Most members of this family are the flagellar operon sigma factor FliA, controlling transcription of bacterial flagellar genes by RNA polymerase. An exception is the sigma factor WhiG in the genus Streptomyces, involved in the production of sporulating aerial mycelium.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0016987 sigma factor activity, 0006352 transcription initiation, 0045449 regulation of transcription. Probab=32.11 E-value=25 Score=12.88 Aligned_cols=21 Identities=24% Similarity=0.512 Sum_probs=11.4 Q ss_pred HHHHHHHHHCCCCHHHHHHHH Q ss_conf 999999975279988988654 Q 537021.9.peg.7 456 AALRDVGRALQMPYSQVDRLC 476 (1033) Q Consensus 456 ~aird~~r~~g~~~~~~~~~~ 476 (1033) --+|++|.|||+++.-|-+|- T Consensus 195 LnLKEIg~VL~lsESRvSQiH 215 (227) T TIGR02479 195 LNLKEIGEVLGLSESRVSQIH 215 (227) T ss_pred CCHHHHHHHHCCCHHHHHHHH T ss_conf 668899988378705799999 No 366 >KOG0023 consensus Probab=32.02 E-value=25 Score=12.87 Aligned_cols=48 Identities=21% Similarity=0.342 Sum_probs=38.1 Q ss_pred CCHHHHHHHHHHHHHHCCCCCCCCCCCCH-HHHHHHHHCCCCCHHHHCC Q ss_conf 17899999999887402543433234435-8899876201410122114 Q 537021.9.peg.7 577 KTLTLLQKSLDFLAQRGVKVDLSLIPFDD-HETYHLLTTKGTLGIFQLE 624 (1033) Q Consensus 577 ~~L~~i~~~~~~i~~~~~~~d~~~ip~~D-~~~~~l~~~g~t~GvFQ~e 624 (1033) .+.--.+.+++++.+.++..+++-+|++| +.+|+.+.+|+..+=|-++ T Consensus 305 G~~ket~E~Ldf~a~~~ik~~IE~v~~~~v~~a~erm~kgdV~yRfVvD 353 (360) T KOG0023 305 GSRKETQEALDFVARGLIKSPIELVKLSEVNEAYERMEKGDVRYRFVVD 353 (360) T ss_pred CCHHHHHHHHHHHHCCCCCCCEEEEEHHHHHHHHHHHHHCCEEEEEEEE T ss_conf 5488799999999717886746997542778999998716701699997 No 367 >COG4464 CapC Capsular polysaccharide biosynthesis protein [Carbohydrate transport and metabolism / Cell envelope biogenesis, outer membrane] Probab=32.00 E-value=25 Score=12.87 Aligned_cols=188 Identities=12% Similarity=0.148 Sum_probs=98.1 Q ss_pred HHHHHHCHHHHHHHCCCHHH---HHHHHHHCCCCEEEEECCCHHHHHHH------------HHHHHH--HCCCCEEEEEE Q ss_conf 33433144476443299899---99999978998899811651774999------------999999--76996389999 Q 537021.9.peg.7 12 VHLQVHSSYSLLEGALSLNN---ILDKIAADQQPAIAITDTNNLFSALE------------FSQKAC--AAGIQPIIGCQ 74 (1033) Q Consensus 12 vhLh~hS~ySlldg~~~~e~---LV~~A~~~G~~AiAITD~~nl~Gav~------------F~~ack--k~gIKPIiG~E 74 (1033) +-.|||=-.-+=||..+.++ +.+.|.+.|...+.-|-|. +.|.++ .-...+ .-.|+-+.|-| T Consensus 2 IDIH~HIlp~iDDGp~s~eesl~ml~~A~~qGvt~iVaTsHh-~~g~y~n~~~~v~~~~~~ln~~~~~~aidl~v~pGQE 80 (254) T COG4464 2 IDIHSHILPDIDDGPKSLEESLAMLREAVRQGVTKIVATSHH-LHGRYENPIEKVKEKANQLNEILKKEAIDLKVLPGQE 80 (254) T ss_pred CCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCEEEEECCCC-CCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEECCCCE T ss_conf 450134667878898758999999999997576079513332-4776689199999999999988886267826616866 Q ss_pred EEEECCCCCCCCCCC-CCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCEEEECCCCCC Q ss_conf 998547664334456-5666677539999569689999999998997324877874007899986348988998197679 Q 537021.9.peg.7 75 LDIDMQDNLKMTNQG-KTPFPTVPSIVLLVSTAEGYQRLIELVSRMYLLDQGKQSVRICLSWLQEIGTEGLIMLTGGSSG 153 (1033) Q Consensus 75 ~~v~~~~~~~~~~~~-~~~~~~~~~lvlLAkN~~Gy~nL~kL~S~a~~~~~~~~~p~i~~~~L~e~~~egLIvlsg~~~g 153 (1033) +++...--.+..+.. ..-.++.|-++=+..| ..||-.-+.+.+....|++-+.+... T Consensus 81 IrIt~~vl~~l~~g~I~tindskYlLIEF~~~---------------------~v~~ya~~lf~elq~kGi~PIIAHPE- 138 (254) T COG4464 81 IRITGDVLDDLDKGIILTINDSKYLLIEFPMN---------------------HVPRYADQLFFELQSKGIIPIIAHPE- 138 (254) T ss_pred EEECHHHHHHHHCCCCCCCCCCCEEEEECCCC---------------------CCHHHHHHHHHHHHHCCCEEEEECHH- T ss_conf 89750888787628434033663499974688---------------------62245999999999779503664625- Q ss_pred HHHHHHHCCCHHHHHHHHHHHHHHCCCCEEEEEECC---CHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHH Q ss_conf 678888779978999999999876598689999439---91158999999999998189899808830078788999987 Q 537021.9.peg.7 154 PIDRAFFFNSSQVAEKRLLTFKKLFGDRLYVNLQRH---RGYDRYRESQVVQLAYTHELPLVATNNSLFLSEEDYEAHDV 230 (1033) Q Consensus 154 ~i~~~l~~~~~~~a~~~l~~l~~~Fgd~~ylEl~~~---~~~e~~~~~~l~~lA~~~~iPlVaTndv~Y~~~~D~~~h~i 230 (1033) ++....+-...+.++-..+.|..+... +.......+..+++-+. ++--+..+|.|-+...-+...++ T Consensus 139 ---------Rn~~i~kn~~~lyeLid~ga~sQvts~Sl~GlfGK~ikK~a~~~iE~-~L~hFiASDAHn~~~R~f~~~ea 208 (254) T COG4464 139 ---------RNRAIQKNPYLLYELIDKGAYSQVTSSSLAGLFGKKIKKFALQLIEA-NLVHFIASDAHNVDKRPFHMQEA 208 (254) T ss_pred ---------HHHHHHHCHHHHHHHHHCCCCEEECHHHHHHHHHHHHHHHHHHHHHC-CCCEEEECCCCCCCCCCCCHHHH T ss_conf ---------67888859699999985562214225767766658899999999974-63125403343357889338999 Q ss_pred HH Q ss_conf 66 Q 537021.9.peg.7 231 LM 232 (1033) Q Consensus 231 L~ 232 (1033) +. T Consensus 209 ~~ 210 (254) T COG4464 209 FH 210 (254) T ss_pred HH T ss_conf 99 No 368 >cd05013 SIS_RpiR RpiR-like protein. RpiR contains a SIS (Sugar ISomerase) domain, which is found in many phosphosugar isomerases and phosphosugar binding proteins. In E. coli, rpiR negatively regulates the expression of rpiB gene. Both rpiB and rpiA are ribose phosphate isomerases that catalyze the reversible reactions of ribose 5-phosphate into ribulose 5-phosphate. Probab=31.92 E-value=25 Score=12.86 Aligned_cols=41 Identities=7% Similarity=0.022 Sum_probs=25.3 Q ss_pred CCCCEEEEEECCCHHHHHHHHHHHHHHHHHCCCEEEECCCCCC Q ss_conf 5986899994399115899999999999818989980883007 Q 537021.9.peg.7 178 FGDRLYVNLQRHRGYDRYRESQVVQLAYTHELPLVATNNSLFL 220 (1033) Q Consensus 178 Fgd~~ylEl~~~~~~e~~~~~~l~~lA~~~~iPlVaTndv~Y~ 220 (1033) -++++.+-+...+.. ....++++.|++.|+|+|+-.+...- T Consensus 59 ~~~d~~i~iS~sg~~--~~~~~~~~~ak~~g~~ii~IT~~~~s 99 (139) T cd05013 59 TPGDVVIAISFSGET--KETVEAAEIAKERGAKVIAITDSANS 99 (139) T ss_pred CCCCEEEEECCCCCC--HHHHHHHHHHHHCCCEEEEEECCCCC T ss_conf 999999997686363--78999999999869979999799999 No 369 >PRK07217 replication factor A; Reviewed Probab=31.84 E-value=25 Score=12.85 Aligned_cols=21 Identities=48% Similarity=0.616 Sum_probs=9.0 Q ss_pred CCCCCHHHHHHHHHHHHHCCCHHHH Q ss_conf 5530012568988788632684130 Q 537021.9.peg.7 686 QGVIIYQEQVMQIAQLLSGYSLSEA 710 (1033) Q Consensus 686 ~Gv~vyQEQvm~ia~~~ag~~~~ea 710 (1033) +-+++-+|.. -.+.|.++.+| T Consensus 229 ~~~i~~re~t----e~l~G~~le~a 249 (311) T PRK07217 229 QEVIINREAT----EELTGITLEEA 249 (311) T ss_pred EEEEECHHHH----HHHCCCCHHHH T ss_conf 7899633666----66508789999 No 370 >KOG3818 consensus Probab=31.68 E-value=25 Score=12.83 Aligned_cols=91 Identities=24% Similarity=0.306 Sum_probs=47.9 Q ss_pred HHHHCCCHHHHHHHCCHHHHHHH--HHHHHHHCCC---CCCCCCCCCHHHHHHHHHC---CC-----C----CHHHHCCC Q ss_conf 34311232135654178999999--9988740254---3433234435889987620---14-----1----01221145 Q 537021.9.peg.7 563 IEKAGLIKFDFLGLKTLTLLQKS--LDFLAQRGVK---VDLSLIPFDDHETYHLLTT---KG-----T----LGIFQLES 625 (1033) Q Consensus 563 ~e~~gl~K~D~Lgl~~L~~i~~~--~~~i~~~~~~---~d~~~ip~~D~~~~~l~~~---g~-----t----~GvFQ~es 625 (1033) .+.+| -+|+||...-+..+-. ++..++.+.+ +-+.++-+||++|++-+.. |. | .|=| .-| T Consensus 250 r~~~g--N~n~~Gg~~~~~~k~sA~L~~lE~~~~d~~fVfLSdV~LD~~~vm~aL~kifqgy~~~pP~~iIlcG~F-tS~ 326 (525) T KOG3818 250 RKELG--NLNWLGGDSKIAFKCSARLRSLEAENTDTSFVFLSDVFLDDKKVMEALRKIFQGYKDAPPTAIILCGSF-TSS 326 (525) T ss_pred HHHHC--CCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEEHHCCCCHHHHHHHHHHHHHCCCCCCEEEEEECCC-CCC T ss_conf 98734--575237860455678899999997474744999751103659999999999831568998389995465-566 Q ss_pred HHHHHHHHHHCCCHHHHHHHHHHH-----------CCCHHHHH Q ss_conf 347888877174048888766541-----------14113420 Q 537021.9.peg.7 626 SGMRQALEGMQPDCIEDIIALVSL-----------YRPGPIDN 657 (1033) Q Consensus 626 ~~~~~~l~~~~p~~~~dl~~~~al-----------~RPGp~~~ 657 (1033) |-...-..++ -+.|.-|++.+.- +=|||-|- T Consensus 327 p~~~~s~~~~-k~~f~~LA~~l~~~~~~~ekT~fIFVPGP~Dp 368 (525) T KOG3818 327 PRQTSSSDQL-KDGFRWLAAQLTCFRKDYEKTQFIFVPGPNDP 368 (525) T ss_pred CCCCCHHHHH-HHHHHHHHHHCCCCCCCCCCCEEEEECCCCCC T ss_conf 5565368999-98899998642323453101238994599887 No 371 >pfam08585 DUF1767 Domain of unknown function (DUF1767). Eukaryotic domain of unknown function. This domain is found to the N-terminus of the nucleic acid binding domain. Probab=31.58 E-value=25 Score=12.82 Aligned_cols=45 Identities=11% Similarity=0.306 Sum_probs=32.2 Q ss_pred HHHHHHHCCCCCCCCCCCCCCEEEEECCCEEEEEHHHHCCCCHHHHH Q ss_conf 99888863685035523342100353198698502685896989999 Q 537021.9.peg.7 805 FCQDARQFNIQIMPPSVNTPCVDFKVGDNRIYYSLAAIKGVGTTTAR 851 (1033) Q Consensus 805 ~i~e~~~~gi~vl~PdIN~S~~~f~~~~~~Ir~Gl~~Ikgvg~~~~~ 851 (1033) +..|.+..|-+++|.+|+. ...+++ .+.+.+=+..|..||..... T Consensus 42 L~sDl~~~~~~~LP~~i~~-~~~~~l-~Gp~vlQv~~i~dIs~p~~~ 86 (89) T pfam08585 42 LNSDLRDIGSGVLPDDIAE-VQKITL-EGPIVLQVNSIRDISQPLYN 86 (89) T ss_pred HHHHHHHCCCCCCCCHHHH-CCCCEE-CCCEEEEEHHHHHCCCCHHH T ss_conf 9857987188878950431-654064-78679884203314653020 No 372 >TIGR02257 cobalto_cobN cobaltochelatase, CobN subunit; InterPro: IPR011953 This family of proteins catalyzes the insertion of cobalt into the corrin ring of hydrogenobyrinic acid a,c-diamide. This aerobic branch of corrin ring synthesis is part of the adenosylcobalamin biosynthetic pathway .. Probab=31.58 E-value=25 Score=12.82 Aligned_cols=66 Identities=15% Similarity=0.183 Sum_probs=45.6 Q ss_pred HHHHCCCHHHHHHHHHHHHHH--CCCCEEEEEECCCHHHHHHHHHHHHHHHHHCCCEEEECCCCCC----CHHHHH Q ss_conf 888779978999999999876--5986899994399115899999999999818989980883007----878899 Q 537021.9.peg.7 157 RAFFFNSSQVAEKRLLTFKKL--FGDRLYVNLQRHRGYDRYRESQVVQLAYTHELPLVATNNSLFL----SEEDYE 226 (1033) Q Consensus 157 ~~l~~~~~~~a~~~l~~l~~~--Fgd~~ylEl~~~~~~e~~~~~~l~~lA~~~~iPlVaTndv~Y~----~~~D~~ 226 (1033) ..++.|+..-+...++.|..- -+--+|+. ...|......+..+=++.++.+|.|.-.+=. ..+... T Consensus 223 ~~l~Ag~t~~~~~L~~~L~~~GL~P~p~fVS----SLKd~~~q~~~~~~~~~~~~~~v~~tT~FA~slnv~~~~a~ 294 (1310) T TIGR02257 223 SLLLAGDTALIEALIDALRQRGLRPRPVFVS----SLKDPAVQAGVLDLLKEEDVALVITTTGFASSLNVSNEQAD 294 (1310) T ss_pred HHHHHCCHHHHHHHHHHHHHCCCCCCCEEEE----CCCCHHHHHHHHHHHHHCCCEEEEECCHHHHHHCCCCCCCC T ss_conf 8877403589999999998637986212010----00587899999999962896089832034553203577776 No 373 >COG1070 XylB Sugar (pentulose and hexulose) kinases [Carbohydrate transport and metabolism] Probab=31.27 E-value=25 Score=12.78 Aligned_cols=13 Identities=15% Similarity=0.680 Sum_probs=7.2 Q ss_pred HHHHHHHHHHHHH Q ss_conf 5578889999985 Q 537021.9.peg.7 355 GYFLIVADFIQWA 367 (1033) Q Consensus 355 ~YFLiV~Div~~a 367 (1033) .+||...|+++|. T Consensus 156 ~~il~~~dyl~~r 168 (502) T COG1070 156 AKILLIKDYLRYR 168 (502) T ss_pred HHEECHHHHHHHH T ss_conf 5512689999986 No 374 >PRK08434 consensus Probab=31.24 E-value=25 Score=12.77 Aligned_cols=19 Identities=16% Similarity=0.242 Sum_probs=7.0 Q ss_pred CHHHHHHHHHHHHCEEECC Q ss_conf 8889998578872621163 Q 537021.9.peg.7 947 NSSVRFENEQRVLGFYFSG 965 (1033) Q Consensus 947 ~~~e~l~~E~e~LGf~lS~ 965 (1033) +...+-..|+.++-..+-| T Consensus 793 ~~~~~~~~eR~A~N~piQG 811 (887) T PRK08434 793 TPMQKAMYEREAVNTIFQG 811 (887) T ss_pred CHHHHHHHHHHHHCCCCCH T ss_conf 9889989999983823366 No 375 >PRK13015 3-dehydroquinate dehydratase; Reviewed Probab=31.23 E-value=19 Score=13.91 Aligned_cols=23 Identities=26% Similarity=0.345 Sum_probs=11.7 Q ss_pred HHHHHHCCHHHHHHHHHHHHHHH Q ss_conf 98986063055788899999851 Q 537021.9.peg.7 346 NVIARMKFSGYFLIVADFIQWAK 368 (1033) Q Consensus 346 ~vI~~~gf~~YFLiV~Div~~ak 368 (1033) .+|..+|..+|.|-+.-+++.-| T Consensus 122 g~I~G~G~~gY~lAl~~l~~ll~ 144 (148) T PRK13015 122 GVICGLGTEGYRLALRRLATLFK 144 (148) T ss_pred EEEECCCHHHHHHHHHHHHHHHH T ss_conf 99971777889999999999997 No 376 >TIGR00314 cdhA CO dehydrogenase/acetyl-CoA synthase complex, epsilon subunit; InterPro: IPR004460 Acetyl-CoA decarbonylase/synthase (ACDS) is a multienzyme complex (carbon monoxide dehydrogenase is a synonym). The ACDS complex carries out an unusual reaction involving the reversible cleavage and synthesis of acetyl-CoA in methanogens. The family contains the 4Fe-4S ferredoxin domain [C-x(2)-C-x(2)-C-x(3)-C-[PEG]] between residues 448-462 of the model.; GO: 0018492 carbon-monoxide dehydrogenase (acceptor) activity, 0006084 acetyl-CoA metabolic process. Probab=31.16 E-value=25 Score=12.76 Aligned_cols=253 Identities=15% Similarity=0.111 Sum_probs=132.0 Q ss_pred HHHHHHHHHHHHHHC--CCCCCCCCCCHHHHHHHCCCCEEEECCCCCCHHHHHHHCCCHHHHHHHHHHHHHHCC---CCE Q ss_conf 999999999899732--487787400789998634898899819767967888877997899999999987659---868 Q 537021.9.peg.7 108 GYQRLIELVSRMYLL--DQGKQSVRICLSWLQEIGTEGLIMLTGGSSGPIDRAFFFNSSQVAEKRLLTFKKLFG---DRL 182 (1033) Q Consensus 108 Gy~nL~kL~S~a~~~--~~~~~~p~i~~~~L~e~~~egLIvlsg~~~g~i~~~l~~~~~~~a~~~l~~l~~~Fg---d~~ 182 (1033) -...+|-|-|-...+ .+.++-..|++.- ..+=||+.+|.=|--. +..-|+..+..+.+.|| .+| T Consensus 65 P~cD~CcLCT~GkCdL~~~K~GACGid~~~-----q~aR~V~iAC~IG~sa------H~gHaRHLv~hlie~~Gkkl~d~ 133 (795) T TIGR00314 65 PICDMCCLCTMGKCDLSRNKRGACGIDLKA-----QQARIVLIACCIGASA------HTGHARHLVDHLIEELGKKLEDY 133 (795) T ss_pred CHHHHHHHCCCCCCCCCCCCCCCCCCCHHH-----CCCCEEHHHHHHHHHH------HHHHHHHHHHHHHHHHCCHHHCC T ss_conf 443364415798650467785533003653-----1353102366776766------55556778999998715300014 Q ss_pred EEEEECCCHHHHHH--------------HHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCC Q ss_conf 99994399115899--------------9999999998189899808830078788999987668846985765310024 Q 537021.9.peg.7 183 YVNLQRHRGYDRYR--------------ESQVVQLAYTHELPLVATNNSLFLSEEDYEAHDVLMAVAHSTVVSQKDRPRV 248 (1033) Q Consensus 183 ylEl~~~~~~e~~~--------------~~~l~~lA~~~~iPlVaTndv~Y~~~~D~~~h~iL~aI~~~~~l~~~~~~~~ 248 (1033) =+.|-..=..+.-+ ..++++.| +...-++|.|.++|.= .+...| T Consensus 134 piDLG~~v~~eAPi~rtv~GikPkTLgDL~~~L~Ya-------------------Eeqi~~~l~a~HtGqE---g~~lDy 191 (795) T TIGR00314 134 PIDLGSEVDVEAPITRTVTGIKPKTLGDLREALDYA-------------------EEQIVHVLSAVHTGQE---GDYLDY 191 (795) T ss_pred CCCCCCCCCEECCHHHEECCCCCCCHHHHHHHHHHH-------------------HHHHHHHHHHEECCCC---CCCCHH T ss_conf 544688852446622202046778778999999998-------------------8888877641001676---653205 Q ss_pred -CCCCCCCCHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCC Q ss_conf -5343234347887512203589987886411100013322102345557568877689889998765410342012464 Q 537021.9.peg.7 249 -TPDHYLKNRSEMVSIFSDLPEALENTVEIARRCSFILQVHAPILPRFIEKECDDIQKEEENELRNKAVAGLEVRLAQGA 327 (1033) Q Consensus 249 -~~~~yLks~eEM~~~f~~~peAi~NT~~IAe~c~~~l~~~~~~lP~f~~p~~~~~~~~~~~~Lr~l~~~Gl~~R~~~~~ 327 (1033) +...|+ .+.++ .=...-.||.-|.|.+|....--|=..+ |+..-..++-.+ + -.++.. T Consensus 192 ESKAlH~-------gm~D~---l~~EvaDlAQIvAy~~PKg~~d~PLve~--GfG~~D~sKp~i--l-------viGHnv 250 (795) T TIGR00314 192 ESKALHA-------GMLDD---LGREVADLAQIVAYNLPKGEEDAPLVEL--GFGVIDKSKPVI--L-------VIGHNV 250 (795) T ss_pred HHHHHHH-------HHHHH---HHHHHHHHHHHHHCCCCCCCCCCCEEEC--CCCEEECCCCEE--E-------EECCCC T ss_conf 6799888-------67656---6678888999986377888887773320--672451678889--9-------966856 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHHCCCHHHHH Q ss_conf 32457838999998877898986063055788899999851047643344442046677655440375633323105676 Q 537021.9.peg.7 328 IAKGYHAQDYRQRLDFELNVIARMKFSGYFLIVADFIQWAKKNNIPVGPGRGSGAGSVTAYALTITDIDPLRFSLLFERF 407 (1033) Q Consensus 328 ~~~~~~~~~y~~RLe~EL~vI~~~gf~~YFLiV~Div~~ak~~gI~vGpGRGSAagSLVaY~LgIT~VDPi~~~LlFERF 407 (1033) . ....+.+-| .|-+.+.+...+|-==+..|+.||..+--| || ||-+-|-| T Consensus 251 ~----p~~~i~dYl-eEn~~eD~ve~~GiCCtA~D~TRy~~kaKv-vg---------------------pls~QL~~--- 300 (795) T TIGR00314 251 L----PGADIADYL-EENGMEDEVELCGICCTAIDMTRYNEKAKV-VG---------------------PLSRQLRV--- 300 (795) T ss_pred C----CHHHHHHHH-HHCCCCCCEEEECCCCCCCCHHHHCCCCCE-EC---------------------CCHHCEEE--- T ss_conf 4----512578887-516840304530300043101001047834-53---------------------20101044--- Q ss_pred HHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEECCCHHH Q ss_conf 21001457875333656878999999983038021554220778 Q 537021.9.peg.7 408 LNPDRMSMPDFDIDFCQDRRDEVIRYVQNKYGHERVAQIITFGS 451 (1033) Q Consensus 408 Lnp~R~~~PDIDiDf~~~~R~~vi~y~~~kyG~~~va~i~t~~t 451 (1033) =|...||+=|==++==|-+|++.+.+ ...+.|+|-.. T Consensus 301 ---iRaG~aDV~v~DEQCiRaD~Le~~~K----~g~~~IaT~~k 337 (795) T TIGR00314 301 ---IRAGVADVIVVDEQCIRADILEEVKK----LGIPLIATNDK 337 (795) T ss_pred ---EECCCCCEEEECCCCHHHHHHHHHHH----CCCCEEECCHH T ss_conf ---35178737997473012789999986----58606734178 No 377 >cd05176 PI3Kc_C2_alpha Phosphoinositide 3-kinase (PI3K), class II, alpha isoform, catalytic domain; The PI3K catalytic domain family is part of a larger superfamily that includes the catalytic domains of other kinases such as the typical serine/threonine/tyrosine protein kinases (PKs), aminoglycoside phosphotransferase, choline kinase, and RIO kinases. PI3Ks catalyze the transfer of the gamma-phosphoryl group from ATP to the 3-hydroxyl of the inositol ring of D-myo-phosphatidylinositol (PtdIns) or its derivatives. PI3Ks play an important role in a variety of fundamental cellular processes, including cell motility, the Ras pathway, vesicle trafficking and secretion, immune cell activation and apoptosis. They can be divided into three main classes (I, II, and III), defined by their substrate specificity, regulation, and domain structure. Class II PI3Ks preferentially use PtdIns as a substrate to produce PtdIns(3)P, but can also phosphorylate PtdIns(4)P. They function as monomers and do n Probab=31.14 E-value=22 Score=13.27 Aligned_cols=87 Identities=24% Similarity=0.283 Sum_probs=45.7 Q ss_pred HHHCCCCCCCEEEECCCCH--HHHEECCC-----CC-CCCCEEEEEECHHHHHHCCCHHHHHHHCCHHHHHHHHHHHHHH Q ss_conf 2100244744033525666--66200123-----67-6886688510213343112321356541789999999988740 Q 537021.9.peg.7 521 LYRHASTHAAGIVIGDRPL--SQLVPMYR-----DV-RSDLPVTQFNMKWIEKAGLIKFDFLGLKTLTLLQKSLDFLAQR 592 (1033) Q Consensus 521 ~~r~~~~Ha~Gvvi~~~~l--~~~~p~~~-----~~-~~~~~~~q~d~~~~e~~gl~K~D~Lgl~~L~~i~~~~~~i~~~ 592 (1033) +|-..++..+||++..-.+ ....|+.. ++ .+...++-=..|| +.=|.|.+-.+++++...+ +. T Consensus 51 lPl~P~~~~~~i~~~~c~vf~S~~~Pl~l~f~~~d~~g~~~~vIfK~GDD------LRQD~L~lQli~lmd~i~k---~~ 121 (353) T cd05176 51 LPLSPSLVAKELNIKVCSFFSSNAVPLKIALVNADPLGEEINVMFKVGED------LRQDMLALQMIKIMDKIWL---QE 121 (353) T ss_pred CCCCCHHEEEEEECCCCEEECCCCCCEEEEEEECCCCCCEEEEEEECCCC------CCHHHHHHHHHHHHHHHHH---HC T ss_conf 89996104335850353453225795689996238889826899977975------2088999999999999998---67 Q ss_pred CCCCCCCCCCCCHHHHHHHHHCCCCCHHHHCC Q ss_conf 25434332344358899876201410122114 Q 537021.9.peg.7 593 GVKVDLSLIPFDDHETYHLLTTKGTLGIFQLE 624 (1033) Q Consensus 593 ~~~~d~~~ip~~D~~~~~l~~~g~t~GvFQ~e 624 (1033) | +|+.-.| |+.++.|...|+-|+- T Consensus 122 g--LDL~l~p------Y~vlaTg~~~G~IE~V 145 (353) T cd05176 122 G--LDLRMVI------FKCLSTGKDRGMVELV 145 (353) T ss_pred C--CCEEEEE------EEEEEECCCCCCEEEC T ss_conf 9--9758887------6899815987653541 No 378 >cd07231 Pat_SDP1-like Sugar-Dependent 1 like lipase. Sugar-Dependent 1 (SDP1) lipase has a patatin-like acyl-hydrolase domain that initiates the breakdown of storage oil in germinating Arabidopsis seeds. This acyl-hydrolase domain is homologus to yeast triacylglycerol lipase 3 and human adipose triglyceride lipase. This family includes SDP1 from Arabidopsis thaliana. Probab=31.06 E-value=23 Score=13.09 Aligned_cols=29 Identities=14% Similarity=0.121 Sum_probs=18.8 Q ss_pred HHHHCCCCCCCCCCCCHHHHHHHHHHHCC Q ss_conf 98510476433444420466776554403 Q 537021.9.peg.7 365 QWAKKNNIPVGPGRGSGAGSVTAYALTIT 393 (1033) Q Consensus 365 ~~ak~~gI~vGpGRGSAagSLVaY~LgIT 393 (1033) +.=-++|.+=--==||+|||+||=+++.- T Consensus 88 kaL~e~~LLPrVIsGsS~GaivAal~~t~ 116 (323) T cd07231 88 RTLVEHQLLPRVIAGSSVGSIVCAIIATR 116 (323) T ss_pred HHHHHCCCCCCEECCCCHHHHHHHHHHCC T ss_conf 99997489875311675888999998608 No 379 >cd05165 PI3Kc_I Phosphoinositide 3-kinase (PI3K), class I, catalytic domain; The PI3K catalytic domain family is part of a larger superfamily that includes the catalytic domains of other kinases such as the typical serine/threonine/tyrosine protein kinases (PKs), aminoglycoside phosphotransferase, choline kinase, and RIO kinases. PI3Ks catalyze the transfer of the gamma-phosphoryl group from ATP to the 3-hydroxyl of the inositol ring of D-myo-phosphatidylinositol (PtdIns) or its derivatives. PI3Ks play an important role in a variety of fundamental cellular processes, including cell motility, the Ras pathway, vesicle trafficking and secretion, immune cell activation and apoptosis. They can be divided into three main classes (I, II, and III), defined by their substrate specificity, regulation, and domain structure. Class I PI3Ks are the only enzymes capable of converting PtdIns(4,5)P2 to the critical second messenger PtdIns(3,4,5)P3. In vitro, they can also phosphorylate the substrates P Probab=31.02 E-value=18 Score=13.95 Aligned_cols=87 Identities=20% Similarity=0.285 Sum_probs=44.0 Q ss_pred HHHHCCCCCCCEEEECCCCHHHHEECCCCCCCCCEEEEEECHHH-----HHHC-CC------HHHHHHHCCHHHHHHHHH Q ss_conf 22100244744033525666662001236768866885102133-----4311-23------213565417899999999 Q 537021.9.peg.7 520 GLYRHASTHAAGIVIGDRPLSQLVPMYRDVRSDLPVTQFNMKWI-----EKAG-LI------KFDFLGLKTLTLLQKSLD 587 (1033) Q Consensus 520 g~~r~~~~Ha~Gvvi~~~~l~~~~p~~~~~~~~~~~~q~d~~~~-----e~~g-l~------K~D~Lgl~~L~~i~~~~~ 587 (1033) .+|-+.++..+|+++..- .+..+.+...+ ..|.-.+- +..+ .. .=|.|.+-.+.+++...+ T Consensus 55 ~lPl~P~~~~~~i~~~~c------~v~~S~~~Pl~-l~f~~~d~~~~~~~~~~vIfK~GDDLRQD~L~lQli~lmd~i~k 127 (366) T cd05165 55 QSPLNPSLKLGELRIEKC------KVMDSKKKPLW-LVFENADPTALSNENVGIIFKNGDDLRQDMLTLQILRIMDSIWK 127 (366) T ss_pred CCCCCHHHEEEEEEECCC------EEECCCCCCEE-EEEECCCCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHH T ss_conf 999990453514861440------55121468637-99865774445799757999789870588999999999999999 Q ss_pred HHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHCC Q ss_conf 8874025434332344358899876201410122114 Q 537021.9.peg.7 588 FLAQRGVKVDLSLIPFDDHETYHLLTTKGTLGIFQLE 624 (1033) Q Consensus 588 ~i~~~~~~~d~~~ip~~D~~~~~l~~~g~t~GvFQ~e 624 (1033) +.| +|+.- .+|+.+..|...|+-|+- T Consensus 128 ---~~g--LdL~l------~pY~vlat~~~~GlIE~V 153 (366) T cd05165 128 ---EEG--LDLRM------LPYGCLSTGDKIGLIEVV 153 (366) T ss_pred ---HCC--CCCEE------EEEEEEEECCCCCCEEEC T ss_conf ---669--98325------777899845987742236 No 380 >PRK08835 consensus Probab=30.67 E-value=26 Score=12.70 Aligned_cols=25 Identities=16% Similarity=0.206 Sum_probs=10.6 Q ss_pred CCCCCCCHHHHHHHHHHHHCEEECC Q ss_conf 6899998889998578872621163 Q 537021.9.peg.7 941 EKFSVENSSVRFENEQRVLGFYFSG 965 (1033) Q Consensus 941 ~~~~~~~~~e~l~~E~e~LGf~lS~ 965 (1033) |++..-+...+-..|+..+-..+-| T Consensus 828 p~i~s~n~~~r~~~eR~a~N~piQG 852 (931) T PRK08835 828 PEITSRNGMRRKAAERAAINAPMQG 852 (931) T ss_pred CCCCCCCHHHHHHHHHHHHCCCCCH T ss_conf 8656888778868999995712445 No 381 >cd00892 PIKKc_ATR ATR (Ataxia telangiectasia and Rad3-related), catalytic domain; The ATR catalytic domain subfamily is part of a larger superfamily that includes the catalytic domains of other kinases such as the typical serine/threonine/tyrosine protein kinases (PKs), aminoglycoside phosphotransferase, choline kinase, and RIO kinases. ATR is also referred to as Mei-41 (Drosophila), Esr1/Mec1p (Saccharomyces cerevisiae), Rad3 (Schizosaccharomyces pombe), and FRAP-related protein (human). ATR is a member of the phosphoinositide 3-kinase-related protein kinase (PIKK) subfamily. PIKKs have intrinsic serine/threonine kinase activity and are distinguished from other PKs by their unique catalytic domain, similar to that of lipid PI3K, and their large molecular weight (240-470 kDa). ATR contains a UME domain of unknown function, a FAT (FRAP, ATM and TRRAP) domain, a catalytic domain, and a FATC domain at the C-terminus. Together with its downstream effector kinase, Chk1, ATR plays a central Probab=30.60 E-value=23 Score=13.10 Aligned_cols=27 Identities=26% Similarity=0.423 Sum_probs=19.2 Q ss_pred CCCCHHHHHHHHHHHCCCCCHHHCCCHHH Q ss_conf 44420466776554403756333231056 Q 537021.9.peg.7 377 GRGSGAGSVTAYALTITDIDPLRFSLLFE 405 (1033) Q Consensus 377 GRGSAagSLVaY~LgIT~VDPi~~~LlFE 405 (1033) .+--|+-|+|+|+|||.+=-|= |+|+. T Consensus 128 ~~S~A~~sv~~YilglGDRH~~--NIli~ 154 (237) T cd00892 128 TRSTAVMSMVGYILGLGDRHGE--NILFD 154 (237) T ss_pred HHHHHHHHHHHHHHCCCCCCCC--CEEEE T ss_conf 9999999999986034788876--58998 No 382 >PRK10428 putative stress-response protein; Provisional Probab=30.57 E-value=21 Score=13.51 Aligned_cols=28 Identities=25% Similarity=0.597 Sum_probs=21.2 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHCCCC Q ss_conf 5787533365687899999998303802 Q 537021.9.peg.7 414 SMPDFDIDFCQDRRDEVIRYVQNKYGHE 441 (1033) Q Consensus 414 ~~PDIDiDf~~~~R~~vi~y~~~kyG~~ 441 (1033) .+.|=|+|...-+|++.+.-|.+|||.. T Consensus 23 ~LTdDdl~~i~G~re~L~GkiQerYG~s 50 (69) T PRK10428 23 KLTDDDMTVIEGKRDQLVGKIQERYGYQ 50 (69) T ss_pred CCCCCHHHHHCCCHHHHHHHHHHHHCCC T ss_conf 2654288884586999987999997837 No 383 >TIGR02354 thiF_fam2 thiamine biosynthesis protein ThiF; InterPro: IPR012729 Members of the HesA/MoeB/ThiF family of proteins (IPR000594 from INTERPRO) include a number of members encoded in the midst of thiamine biosynthetic operons. This mix of known and putative ThiF proteins shows a deep split in phylogenetic trees. The Escherichia coli ThiF and MoeB proteins are seemingly more closely related than the E. coli ThiF and Campylobacter (for example) ThiF. This entry represents the divergent clade of putative ThiF proteins as found in Campylobacter.. Probab=30.56 E-value=8.7 Score=16.72 Aligned_cols=17 Identities=18% Similarity=0.100 Sum_probs=7.5 Q ss_pred EEECCCCCCHHHHHHHC Q ss_conf 99819767967888877 Q 537021.9.peg.7 145 IMLTGGSSGPIDRAFFF 161 (1033) Q Consensus 145 Ivlsg~~~g~i~~~l~~ 161 (1033) ||-.||++|.+.-.+.+ T Consensus 26 icGlGGLGS~~AinLAR 42 (200) T TIGR02354 26 ICGLGGLGSNVAINLAR 42 (200) T ss_pred EEECCCCHHHHHHHHHH T ss_conf 97016116899999987 No 384 >cd02189 delta_tubulin The tubulin superfamily includes five distinct families, the alpha-, beta-, gamma-, delta-, and epsilon-tubulins and a sixth family (zeta-tubulin) which is present only in kinetoplastid protozoa. The alpha- and beta-tubulins are the major components of microtubules, while gamma-tubulin plays a major role in the nucleation of microtubule assembly. The delta- and epsilon-tubulins are widespread but unlike the alpha, beta, and gamma-tubulins they are not ubiquitous among eukaryotes. Delta-tubulin plays an essential role in forming the triplet microtubules of centrioles and basal bodies. Probab=30.54 E-value=23 Score=13.21 Aligned_cols=49 Identities=22% Similarity=0.185 Sum_probs=33.7 Q ss_pred CHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCC Q ss_conf 838999998877898986063055788899999851047643344442046677655440375 Q 537021.9.peg.7 333 HAQDYRQRLDFELNVIARMKFSGYFLIVADFIQWAKKNNIPVGPGRGSGAGSVTAYALTITDI 395 (1033) Q Consensus 333 ~~~~y~~RLe~EL~vI~~~gf~~YFLiV~Div~~ak~~gI~vGpGRGSAagSLVaY~LgIT~V 395 (1033) ..+.+.+|+.+|++--. -..-|++++ . +|=|=||+-||+++-.|.-.=- T Consensus 108 ~~d~i~d~IRk~~E~CD---~lqGF~i~h---------S--lgGGTGSGlGs~lle~L~d~yp 156 (446) T cd02189 108 IKEDILDLIRKEVEKCD---SFEGFLVLH---------S--LAGGTGSGLGSRVTELLRDEYP 156 (446) T ss_pred HHHHHHHHHHHHHHCCC---CCCCEEEEE---------E--CCCCCCCHHHHHHHHHHHHHCC T ss_conf 89999999999997357---757667784---------0--5786754177788899887668 No 385 >PRK05797 consensus Probab=30.49 E-value=26 Score=12.68 Aligned_cols=26 Identities=19% Similarity=0.218 Sum_probs=13.8 Q ss_pred CCCCCCCCHHHHHHHHHHHHCEEECC Q ss_conf 66899998889998578872621163 Q 537021.9.peg.7 940 LEKFSVENSSVRFENEQRVLGFYFSG 965 (1033) Q Consensus 940 ~~~~~~~~~~e~l~~E~e~LGf~lS~ 965 (1033) +|++..-+...+-..|+.++-..+-| T Consensus 766 ~p~i~s~n~~~r~~~eR~AiN~piQG 791 (869) T PRK05797 766 IPEIKSSNKIVRAFGERLAMNTPIQG 791 (869) T ss_pred CCCCCCCCHHHHHHHHHHHHCCCCCH T ss_conf 88777889889977888782746556 No 386 >PRK07456 consensus Probab=30.49 E-value=26 Score=12.68 Aligned_cols=10 Identities=10% Similarity=-0.419 Sum_probs=3.9 Q ss_pred HHHHHHHHCC Q ss_conf 9998888636 Q 537021.9.peg.7 804 KFCQDARQFN 813 (1033) Q Consensus 804 ~~i~e~~~~g 813 (1033) ..+..|++.| T Consensus 833 ~~~~~a~~~g 842 (975) T PRK07456 833 LQERLALSKG 842 (975) T ss_pred HHHHHHHHCC T ss_conf 9999999769 No 387 >KOG1095 consensus Probab=30.29 E-value=26 Score=12.65 Aligned_cols=74 Identities=18% Similarity=0.230 Sum_probs=40.1 Q ss_pred CCHHHHHHHHHHCCC----------HHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCCCHHHHH--HHCCCCCCCCH Q ss_conf 453478888771740----------488887665411411342003688850465785026966733--41033553001 Q 537021.9.peg.7 624 ESSGMRQALEGMQPD----------CIEDIIALVSLYRPGPIDNIVVYNNRKNGKEKIVSIHPLIDP--MLKETQGVIIY 691 (1033) Q Consensus 624 es~~~~~~l~~~~p~----------~~~dl~~~~al~RPGp~~~~~~yi~rk~g~e~~~~~hp~l~~--iL~~T~Gv~vy 691 (1033) |..-.+++|++.+|. .+-+++..+| .|+.|++++| ++|.+|.- +|-....+ T Consensus 778 ~gGDL~sflr~~r~~~~~~~~L~~~dLl~~a~dvA--------~G~~YLe~~~------fvHRDLAaRNCLL~~~r~--- 840 (1025) T KOG1095 778 EGGDLLSFLRESRPAPFQPSNLSMRDLLAFALDVA--------KGMNYLESKH------FVHRDLAARNCLLDERRV--- 840 (1025) T ss_pred CCCCHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHH--------HHHHHHHHCC------CCCCCHHHHHEEECCCCC--- T ss_conf 46728999874366667778788999999999886--------4457898679------767551044323215674--- Q ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCC Q ss_conf 25689887886326841307889762012110 Q 537021.9.peg.7 692 QEQVMQIAQLLSGYSLSEADVLRRAMGKKIKE 723 (1033) Q Consensus 692 QEQvm~ia~~~ag~~~~ead~lRra~~kk~~~ 723 (1033) +++|+|.+++ |..++-+-.|..+ T Consensus 841 --------VKIaDFGlAr-Diy~~~yyr~~~~ 863 (1025) T KOG1095 841 --------VKIADFGLAR-DIYDKDYYRKHGE 863 (1025) T ss_pred --------EEECCCCHHH-HHHHCHHEECCCC T ss_conf --------7874441137-6640100431674 No 388 >cd02922 FCB2_FMN Flavocytochrome b2 (FCB2) FMN-binding domain. FCB2 (AKA L-lactate:cytochrome c oxidoreductase) is a respiratory enzyme located in the intermembrane space of fungal mitochondria which catalyzes the oxidation of L-lactate to pyruvate. FCB2 also participates in a short electron-transport chain involving cytochrome c and cytochrome oxidase which ultimately directs the reducing equivalents gained from L-lactate oxidation to oxygen, yielding one molecule of ATP for every L-lactate molecule consumed. FCB2 is composed of 2 domains: a C-terminal flavin-binding domain, which includes the active site for lacate oxidation, and an N-terminal b2-cytochrome domain, required for efficient cytochrome c reduction. FCB2 is a homotetramer and contains two noncovalently bound cofactors, FMN and heme per subunit. Probab=30.28 E-value=26 Score=12.65 Aligned_cols=22 Identities=14% Similarity=0.126 Sum_probs=12.5 Q ss_pred HHHHHHHHHHCCCEEEECCCCC Q ss_conf 9999999981898998088300 Q 537021.9.peg.7 198 SQVVQLAYTHELPLVATNNSLF 219 (1033) Q Consensus 198 ~~l~~lA~~~~iPlVaTndv~Y 219 (1033) ..+.+-|.+.++|.+.+.-..+ T Consensus 84 ~a~AraA~~~gi~~~lSt~ss~ 105 (344) T cd02922 84 LNLARAAGKHGILQMISTNASC 105 (344) T ss_pred HHHHHHHHHCCCCEEEECCCCC T ss_conf 9999999974886574057778 No 389 >cd02190 epsilon_tubulin The tubulin superfamily includes five distinct families, the alpha-, beta-, gamma-, delta-, and epsilon-tubulins and a sixth family (zeta-tubulin) which is present only in kinetoplastid protozoa. The epsilon-tubulins which are widespread but not ubiquitous among eukaryotes play a role in basal body/centriole morphogenesis. Probab=30.26 E-value=24 Score=12.99 Aligned_cols=46 Identities=20% Similarity=0.255 Sum_probs=32.7 Q ss_pred HHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCC Q ss_conf 389999988778989860630557888999998510476433444420466776554403 Q 537021.9.peg.7 334 AQDYRQRLDFELNVIARMKFSGYFLIVADFIQWAKKNNIPVGPGRGSGAGSVTAYALTIT 393 (1033) Q Consensus 334 ~~~y~~RLe~EL~vI~~~gf~~YFLiV~Div~~ak~~gI~vGpGRGSAagSLVaY~LgIT 393 (1033) .+++.+|+.+|++--. .-+.|++++ . +|=|=||+-||+++-.|.-. T Consensus 82 ~d~i~d~IrkevE~CD---~l~Gf~~~h---------S--l~GGTGSGlGs~l~e~L~d~ 127 (379) T cd02190 82 IDSILEKIRKAAEKCD---SLQSFFILH---------S--LGGGTGSGLGTYVLELLADE 127 (379) T ss_pred HHHHHHHHHHHHHCCC---CCCCEEEEE---------E--CCCCCCCCHHHHHHHHHHHH T ss_conf 9999999999997169---967645773---------0--48877651888878988976 No 390 >TIGR01859 fruc_bis_ald_ fructose-1,6-bisphosphate aldolase, class II; InterPro: IPR011289 Fructose-1,6-bisphosphate (FBP) aldolases reversibly cleave FBP to two triose phosphates, glycerone phophate and D-glyceraldehyde 3-phosphate. They play a key role in both glycolysis (FBP cleavage) and gluconeogenesis (FBP synthesis). These enzymes can be divide into two classes based on their mode of catalysis. Class I FBP aldolases form a Schiff-base intermediate between glycerone phophate and an active site lysine residue, while the class II enzymes use a divalent cation such as Zn(2+) for catalysis . This entry represents class II FBP aldolases found in Gram-positive bacteria, a variety of Gram-negative baceria, and amitochondriate protists. These enzymes are homotetramers where each momomer forms a TIM barrel fold .; GO: 0004332 fructose-bisphosphate aldolase activity, 0008270 zinc ion binding, 0006096 glycolysis, 0030388 fructose 16-bisphosphate metabolic process. Probab=30.23 E-value=25 Score=12.79 Aligned_cols=165 Identities=16% Similarity=0.161 Sum_probs=87.9 Q ss_pred CCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCCCCCCCCCCCCCEEEEEEC-- Q ss_conf 998999999997899889981165177499999999976996389999998547664334456566667753999956-- Q 537021.9.peg.7 27 LSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPIIGCQLDIDMQDNLKMTNQGKTPFPTVPSIVLLVS-- 104 (1033) Q Consensus 27 ~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~~~~~~~~~~~~~~~~~lvlLAk-- 104 (1033) .+-+++++.|++.|| ||+==..|||==.=--.+||++..=.-|+.+.= + .+=|+= T Consensus 2 V~~k~~L~~A~~~gY-AvGaFN~NNlE~~qAIl~AAee~~SPVI~q~Se-------------G---------A~KY~GkG 58 (339) T TIGR01859 2 VNGKEILQKAKKEGY-AVGAFNINNLEWTQAILQAAEEERSPVIVQVSE-------------G---------AIKYMGKG 58 (339) T ss_pred CCHHHHHHHHHHCCC-CCCCCCHHHHHHHHHHHHHHHHHCCCEEEEHHH-------------H---------HHHCCCCC T ss_conf 648899999864683-114201216899999999998736875531202-------------7---------87404853 Q ss_pred -CHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCEEEECCCCCCHHHHHHHCC-CHHHHHHHHHHHHHHCCCCE Q ss_conf -9689999999998997324877874007899986348988998197679678888779-97899999999987659868 Q 537021.9.peg.7 105 -TAEGYQRLIELVSRMYLLDQGKQSVRICLSWLQEIGTEGLIMLTGGSSGPIDRAFFFN-SSQVAEKRLLTFKKLFGDRL 182 (1033) Q Consensus 105 -N~~Gy~nL~kL~S~a~~~~~~~~~p~i~~~~L~e~~~egLIvlsg~~~g~i~~~l~~~-~~~~a~~~l~~l~~~Fgd~~ 182 (1033) -+.||+....|+-....+ ..-..|. .-.|.-| .++.+.+.+.- | T Consensus 59 d~~~G~~~~~~mV~~L~E~-~~~tVPV-------------------------aLHLDHG~s~E~c~~AI~a-----G--- 104 (339) T TIGR01859 59 DATLGYKVAVAMVKTLIES-MSITVPV-------------------------ALHLDHGSSYESCIKAIKA-----G--- 104 (339) T ss_pred HHHHHHHHHHHHHHHHHHH-CCCCEEE-------------------------EEECCCCCCHHHHHHHHHC-----C--- T ss_conf 1778899999999999986-7993206-------------------------8646887648999999964-----9--- Q ss_pred EEEEEC---CCHHHHH--HHHHHHHHHHHHC--CC--------------EEEEC-CCCCCCHHHHHHHHH-----HHHHH Q ss_conf 999943---9911589--9999999999818--98--------------99808-830078788999987-----66884 Q 537021.9.peg.7 183 YVNLQR---HRGYDRY--RESQVVQLAYTHE--LP--------------LVATN-NSLFLSEEDYEAHDV-----LMAVA 235 (1033) Q Consensus 183 ylEl~~---~~~~e~~--~~~~l~~lA~~~~--iP--------------lVaTn-dv~Y~~~~D~~~h~i-----L~aI~ 235 (1033) |-.+|= |.+.|+. ..++++++|...| |- ++... +++|.+|+|..-.=- .+|+. T Consensus 105 fsSVMIDgShlP~EEN~~~Tkkvve~Ah~~GDYvSVEaElG~~gG~ED~i~V~~~~~~ladp~Ea~~fV~eT~~D~LA~a 184 (339) T TIGR01859 105 FSSVMIDGSHLPFEENVALTKKVVEIAHAKGDYVSVEAELGTIGGIEDGIVVVEKEGELADPDEAEQFVKETGVDYLAAA 184 (339) T ss_pred CCEEECCCCCCCHHHHHHHHHHHHHHHHHCCCEEEEEEEECEECCEECCEEEEECCCEECCHHHHHHHHHHCCCCEECCC T ss_conf 81672024268788888888999999985199799999843035276273787156367587999999976387754010 Q ss_pred CCCCCCCCCCCCCCCC Q ss_conf 6985765310024534 Q 537021.9.peg.7 236 HSTVVSQKDRPRVTPD 251 (1033) Q Consensus 236 ~~~~l~~~~~~~~~~~ 251 (1033) -|+... ++.|..+ T Consensus 185 IGt~HG---a~kykge 197 (339) T TIGR01859 185 IGTSHG---AFKYKGE 197 (339) T ss_pred CCCCCC---CCCCCCC T ss_conf 035023---5788866 No 391 >TIGR01307 pgm_bpd_ind 2,3-bisphosphoglycerate-independent phosphoglycerate mutase; InterPro: IPR005995 This 2,3-bisphosphoglycerate-independent phosphoglycerate mutase (iPGAM) is a metalloenzyme found particularly in eubacteria and higher plants. It is distantly related to archaeal iPGAM (IPR004456 from INTERPRO) and distinct from the unrelated cofactor-dependent PGAM (PIRSF001490 from PIRSF). Activity has been demonstrated for proteins from a variety of organisms, including Pseudomonas syringae pv. tomato , Bacillus subtilis , Bacillus stearothermophilus , maize , castor bean , and Trypanosoma brucei . The structure of the B. stearothermophilus enzyme (PDB:1EJJ) has two domains . Residues 1-76 and 311-511 form the phosphatase domain, containing the active site residue and two metal-binding sites. This domain is similar to alkaline phosphatase (PDB:1ALK) and arylsulphatase (PDB:1AUK), which are members of the SCOP alkaline phosphatase-like superfamily, but there is meagre sequence similarity outside of the metal-binding segments. Residues 77-310 form the phosphotransferase domain, which is poorly conserved (or perhaps unrelated) in the archaeal enzymes.; GO: 0004619 phosphoglycerate mutase activity, 0006007 glucose catabolic process. Probab=30.23 E-value=26 Score=12.65 Aligned_cols=17 Identities=18% Similarity=0.196 Sum_probs=6.9 Q ss_pred HHHHHHHHHHHHCCCEE Q ss_conf 99999999998189899 Q 537021.9.peg.7 196 RESQVVQLAYTHELPLV 212 (1033) Q Consensus 196 ~~~~l~~lA~~~~iPlV 212 (1033) +.-.|+++|++.+++-| T Consensus 133 Hl~AL~~~A~~~G~~k~ 149 (529) T TIGR01307 133 HLIALIELAAERGIEKV 149 (529) T ss_pred HHHHHHHHHHHCCCEEE T ss_conf 99999999966893179 No 392 >TIGR01496 DHPS dihydropteroate synthase; InterPro: IPR006390 This domain is present in sequences representing dihydropteroate synthase, the enzyme that catalyzes the second to last step in folic acid biosynthesis. Dihydropteroate synthase (2.5.1.15 from EC) (DHPS) catalyses the condensation of 6-hydroxymethyl-7,8-dihydropteridine pyrophosphate to para-aminobenzoic acid to form 7,8-dihydropteroate. This is the second step in the three-step pathway leading from 6-hydroxymethyl-7,8-dihydropterin to 7,8-dihydrofolate. DHPS is the target of sulphonamides, which are substrate analogues that compete with para-aminobenzoic acid. Bacterial DHPS (gene sul or folP) is a protein of about 275 to 315 amino acid residues that is either chromosomally encoded or found on various antibiotic resistance plasmids. In the lower eukaryote Pneumocystis carinii, DHPS is the C-terminal domain of a multifunctional folate synthesis enzyme (gene fas) .; GO: 0004156 dihydropteroate synthase activity, 0009396 folic acid and derivative biosynthetic process. Probab=30.19 E-value=26 Score=12.64 Aligned_cols=14 Identities=29% Similarity=0.468 Sum_probs=6.0 Q ss_pred HHHHHHHHCCCEEE Q ss_conf 99999981898998 Q 537021.9.peg.7 200 VVQLAYTHELPLVA 213 (1033) Q Consensus 200 l~~lA~~~~iPlVa 213 (1033) +.+++.++++|+|. T Consensus 117 m~~vaae~~~~~vl 130 (268) T TIGR01496 117 MLEVAAEYGVPLVL 130 (268) T ss_pred HHHHHHHHCCCEEE T ss_conf 89999984898898 No 393 >PRK10053 hypothetical protein; Provisional Probab=30.13 E-value=26 Score=12.63 Aligned_cols=47 Identities=13% Similarity=0.214 Sum_probs=36.8 Q ss_pred HCCCCCHHHHHCCCCCCEEEEEEEEEEEEEECCCCCCEEEEEEEECCCCEEEEEE Q ss_conf 7489768898505589889999999611210158887699999984970499995 Q 537021.9.peg.7 977 KMGIKGYEESVATIRGNNIRLAAMVVSKQQKKTRKGSRIGWVTFSEPAKEYEANF 1031 (1033) Q Consensus 977 ~~~~~~~~~~~~~~~g~~v~iaG~V~~i~~~~TKkG~~mafltleD~~g~~e~~i 1031 (1033) +-+..+......++++++|.+-|.|++. ..++.| .|.|.||.+++-| T Consensus 46 ~a~~~TV~qAk~m~Dda~V~L~GnIv~~-----lg~d~Y---~FrD~TG~I~VeI 92 (130) T PRK10053 46 DARKMTVEQAKTMHDGATVSLRGNLIDH-----KGDDRY---VFRDKSGEINVII 92 (130) T ss_pred CCCEEHHHHHHHCCCCCEEEEEEEEEEE-----CCCCEE---EEECCCCCEEEEE T ss_conf 6633469988646699849998888861-----488348---9977997599996 No 394 >pfam12116 SpoIIID Stage III sporulation protein D. This stage III sporulation protein is a small DNA-binding family that is essential for gene expression of the mother-cell compartment during sporulation. The domain is found in bacteria and viruses, and is about 40 amino acids in length. It has a conserved RGG sequence motif. Probab=30.07 E-value=17 Score=14.15 Aligned_cols=32 Identities=28% Similarity=0.494 Sum_probs=23.5 Q ss_pred CCEEECCCHHHHHHHHHHHHHHHHCCCCHHHHH Q ss_conf 215542207787788999999975279988988 Q 537021.9.peg.7 441 ERVAQIITFGSLQAKAALRDVGRALQMPYSQVD 473 (1033) Q Consensus 441 ~~va~i~t~~t~~~k~aird~~r~~g~~~~~~~ 473 (1033) ++|..|+.| -+..++-||.+|++||++...+- T Consensus 6 eR~~~i~~Y-Ii~~~aTVR~tAk~FGvSKSTVH 37 (82) T pfam12116 6 ERVLEIANY-IIENKATVRQAAKVFGVSKSTVH 37 (82) T ss_pred HHHHHHHHH-HHHCCHHHHHHHHHHCCCHHHHH T ss_conf 999999999-99626289999999696098898 No 395 >COG0757 AroQ 3-dehydroquinate dehydratase II [Amino acid transport and metabolism] Probab=30.05 E-value=25 Score=12.83 Aligned_cols=22 Identities=23% Similarity=0.458 Sum_probs=10.6 Q ss_pred HHHHHHCCHHHHHHHHHHHHHH Q ss_conf 9898606305578889999985 Q 537021.9.peg.7 346 NVIARMKFSGYFLIVADFIQWA 367 (1033) Q Consensus 346 ~vI~~~gf~~YFLiV~Div~~a 367 (1033) .+|+.+|..+|-|-+.-+++.. T Consensus 121 GvI~GlG~~GY~lAl~~l~~~~ 142 (146) T COG0757 121 GVICGLGAQGYLLALRALVNIL 142 (146) T ss_pred EEEECCCHHHHHHHHHHHHHHH T ss_conf 3684376888999999999986 No 396 >PRK06687 chlorohydrolase; Validated Probab=30.00 E-value=27 Score=12.62 Aligned_cols=60 Identities=20% Similarity=0.135 Sum_probs=33.2 Q ss_pred CCCHHHHHHCHHHHHHHCCC---HHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEE Q ss_conf 88133433144476443299---899999999789988998116517749999999997699638 Q 537021.9.peg.7 9 PSFVHLQVHSSYSLLEGALS---LNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPI 70 (1033) Q Consensus 9 ~~FvhLh~hS~ySlldg~~~---~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPI 70 (1033) |-||..|+|..+|.++|... ..++++.... .+. -.+|......++.-....+.+.|+--+ T Consensus 58 PGlVnaH~Hl~~~~~rg~~~~~~l~~wl~~~~~-~~~-~~~~~~~~~~~~~~~~~e~l~~G~Tt~ 120 (422) T PRK06687 58 PGLVNTHTHVVMSLLRGIGDDMLLQPWLETRIW-PLE-SQFTPELAVASTELGLLEMVKSGTTTF 120 (422) T ss_pred ECCCCHHHCHHHHHHHHCCCCCCHHHHHHHHHH-HHH-CCCCHHHHHHHHHHHHHHHHHCCCEEE T ss_conf 485077558755625414589988999999776-655-138999999999999999997597498 No 397 >cd02072 Glm_B12_BD B12 binding domain of glutamate mutase (Glm). Glutamate mutase catalysis the conversion of (S)-glutamate with (2S,3S)-3-methylaspartate. The rearrangement reaction is initiated by the extraction of a hydrogen from the protein-bound substrate by a 5'-desoxyadenosyl radical, which is generated by the homolytic cleavage of the organometallic bond of the cofactor B12. Glm is a heterotetrameric molecule consisting of two alpha and two epsilon polypeptide chains. Probab=30.00 E-value=27 Score=12.62 Aligned_cols=41 Identities=15% Similarity=0.231 Sum_probs=17.8 Q ss_pred CCHHHHHHHHHHCCCCEEEEECCCHHHH---HHHHHHHHHHCCCC Q ss_conf 9989999999978998899811651774---99999999976996 Q 537021.9.peg.7 27 LSLNNILDKIAADQQPAIAITDTNNLFS---ALEFSQKACAAGIQ 68 (1033) Q Consensus 27 ~~~e~LV~~A~~~G~~AiAITD~~nl~G---av~F~~ackk~gIK 68 (1033) .+|+++++.|.+.+-.+++++-.++ .+ ..++.+.++++|++ T Consensus 37 ~~~e~~~~~a~e~~ad~i~vSsl~g-~~~~~~~~l~~~L~e~G~~ 80 (128) T cd02072 37 SPQEEFIDAAIETDADAILVSSLYG-HGEIDCKGLREKCDEAGLK 80 (128) T ss_pred CCHHHHHHHHHHCCCCEEEECCCCC-CCCHHHHHHHHHHHHCCCC T ss_conf 9999999999873999999823202-5624899999999967999 No 398 >COG1087 GalE UDP-glucose 4-epimerase [Cell envelope biogenesis, outer membrane] Probab=29.93 E-value=27 Score=12.61 Aligned_cols=39 Identities=5% Similarity=0.135 Sum_probs=23.6 Q ss_pred HHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHHCCCC Q ss_conf 9999998303802155422077877889999999752799 Q 537021.9.peg.7 429 EVIRYVQNKYGHERVAQIITFGSLQAKAALRDVGRALQMP 468 (1033) Q Consensus 429 ~vi~y~~~kyG~~~va~i~t~~t~~~k~aird~~r~~g~~ 468 (1033) ..++|+.+ =|..++...+|=.-+.-+..|+-+.||-|.+ T Consensus 236 ~Al~~L~~-~g~~~~~NLG~G~G~SV~evi~a~~~vtg~~ 274 (329) T COG1087 236 LALKYLKE-GGSNNIFNLGSGNGFSVLEVIEAAKKVTGRD 274 (329) T ss_pred HHHHHHHH-CCCEEEEECCCCCCEEHHHHHHHHHHHHCCC T ss_conf 99999981-8960489746897524999999999986995 No 399 >pfam03352 Adenine_glyco Methyladenine glycosylase. The DNA-3-methyladenine glycosylase I is constitutively expressed and is specific for the alkylated 3-methyladenine DNA. Probab=29.91 E-value=27 Score=12.61 Aligned_cols=63 Identities=17% Similarity=0.148 Sum_probs=32.8 Q ss_pred CHHHHHHHCCCCCCCCHHHHHHHHHH----HHHCCCHHHH----HHHHHHHCCCCCCHHHHHHHHHHHHHH Q ss_conf 96673341033553001256898878----8632684130----788976201211016778789999986 Q 537021.9.peg.7 675 HPLIDPMLKETQGVIIYQEQVMQIAQ----LLSGYSLSEA----DVLRRAMGKKIKEEMDKQKERFISGAS 737 (1033) Q Consensus 675 hp~l~~iL~~T~Gv~vyQEQvm~ia~----~~ag~~~~ea----d~lRra~~kk~~~~~~~~~~~f~~g~~ 737 (1033) +|.+...=.+-.|++++-+|-+-=.. --||+|+.-. +.+|+|+.-=+++.+..+.++=++..+ T Consensus 5 ~~~~~~YHD~EWG~P~~dd~~LFE~L~LE~~QaGLSW~tIL~KR~~fr~aF~~Fd~~~VA~~~e~~ie~Ll 75 (179) T pfam03352 5 DPLYVAYHDEEWGVPVHDDRKLFELLCLEGFQAGLSWITILKKREAFREAFAGFDPEKVAAFTEADVERLL 75 (179) T ss_pred CHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCHHHHHCCCHHHHHHHH T ss_conf 85799887426898488879999999998973647899999879999999828999999648999999985 No 400 >KOG0556 consensus Probab=29.82 E-value=27 Score=12.59 Aligned_cols=27 Identities=26% Similarity=0.539 Sum_probs=11.7 Q ss_pred HHHHHHHCCCCHHHHHHHHHHHHHHHHCCCC Q ss_conf 9999862499999999999999975000133 Q 537021.9.peg.7 732 FISGASKNGISKTIAVNIFELLAKFADYGFN 762 (1033) Q Consensus 732 f~~g~~~~g~~~~~a~~i~~~i~~f~~Y~Fn 762 (1033) .++.|.+.|+++. .|-.+|.+| .||-+ T Consensus 470 L~era~~hGid~~---~i~~YidsF-ryG~P 496 (533) T KOG0556 470 LVERAKEHGIDPS---KISTYIDSF-RYGAP 496 (533) T ss_pred HHHHHHHCCCCHH---HHHHHHHHH-CCCCC T ss_conf 9999997398878---999888874-05899 No 401 >cd06364 PBP1_CaSR Ligand-binding domain of the CaSR calcium-sensing receptor, which is a member of the family C receptors within the G-protein coupled receptor superfamily. Ligand-binding domain of the CaSR calcium-sensing receptor, which is a member of the family C receptors within the G-protein coupled receptor superfamily. CaSR provides feedback control of extracellular calcium homeostasis by responding sensitively to acute fluctuations in extracellular ionized Ca2+ concentration. This ligand-binding domain has homology to the bacterial leucine-isoleucine-valine binding protein (LIVBP) and a leucine binding protein (LBP). CaSR is widely expressed in mammalian tissues and is active in tissues that are not directly involved in extracellular calcium homeostasis. Moreover, CaSR responds to aromatic, aliphatic, and polar amino acids, but not to positively charged or branched chain amino acids, which suggests that changes in plasma amino acid levels are likely to modulate whole body calci Probab=29.63 E-value=27 Score=12.57 Aligned_cols=80 Identities=18% Similarity=0.255 Sum_probs=54.1 Q ss_pred CCCCCCCCCCHHHHHHHHHHHCCCCCHHHCCC----H--HHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEE Q ss_conf 76433444420466776554403756333231----0--56762100145787533365687899999998303802155 Q 537021.9.peg.7 371 NIPVGPGRGSGAGSVTAYALTITDIDPLRFSL----L--FERFLNPDRMSMPDFDIDFCQDRRDEVIRYVQNKYGHERVA 444 (1033) Q Consensus 371 gI~vGpGRGSAagSLVaY~LgIT~VDPi~~~L----l--FERFLnp~R~~~PDIDiDf~~~~R~~vi~y~~~kyG~~~va 444 (1033) ++ +| +.+|+....||-+|+.-.|==|-|+= | =+||=+.-|+.+|| ..+-.++--+..+||...|+ T Consensus 121 av-iG-~~~S~~S~~vA~lL~l~~iPqISy~sts~~LSdk~~fp~F~RtvPsD-------~~qa~amv~li~~F~W~wV~ 191 (510) T cd06364 121 AV-VG-ATGSGVSTAVANLLGLFYIPQVSYASSSRLLSNKNQFKSFLRTIPND-------EHQATAMADIIEYFRWNWVG 191 (510) T ss_pred EE-EC-CCCCCHHHHHHHHHCCCCCCEECCCCCCCCCCCCCCCCCCEECCCCC-------HHHHHHHHHHHHHCCCEEEE T ss_conf 99-88-99764899999997066784440366871117811066410117887-------89999999999977965999 Q ss_pred ECCCHHHHHHHHHHHH Q ss_conf 4220778778899999 Q 537021.9.peg.7 445 QIITFGSLQAKAALRD 460 (1033) Q Consensus 445 ~i~t~~t~~~k~aird 460 (1033) -|++=..++ +..+++ T Consensus 192 ~V~sdd~yG-~~~~~~ 206 (510) T cd06364 192 TIAADDDYG-RPGIEK 206 (510) T ss_pred EEECCCHHH-HHHHHH T ss_conf 885078378-999999 No 402 >pfam02310 B12-binding B12 binding domain. This domain binds to B12 (adenosylcobamide), it is found in several enzymes, such as glutamate mutase, methionine synthase and methylmalonyl-CoA mutase. Probab=29.59 E-value=27 Score=12.57 Aligned_cols=18 Identities=17% Similarity=0.440 Sum_probs=5.7 Q ss_pred HHHHHHHHHHCCCCEEEE Q ss_conf 899999999789988998 Q 537021.9.peg.7 29 LNNILDKIAADQQPAIAI 46 (1033) Q Consensus 29 ~e~LV~~A~~~G~~AiAI 46 (1033) ++++++.+++..-..+|+ T Consensus 40 ~~~i~~~i~~~~pdiVgi 57 (121) T pfam02310 40 PEDIVEAIRAENPDVVGL 57 (121) T ss_pred HHHHHHHHHHCCCCEEEE T ss_conf 999999999809999999 No 403 >PRK10702 endonuclease III; Provisional Probab=29.58 E-value=27 Score=12.56 Aligned_cols=17 Identities=35% Similarity=0.388 Sum_probs=7.7 Q ss_pred CCCCCCHHHHHHHHHHHC Q ss_conf 344442046677655440 Q 537021.9.peg.7 375 GPGRGSGAGSVTAYALTI 392 (1033) Q Consensus 375 GpGRGSAagSLVaY~LgI 392 (1033) |-||=. |+++.+++.|. T Consensus 116 GIG~kT-A~aIl~~a~~~ 132 (211) T PRK10702 116 GVGRKT-ANVVLNTAFGW 132 (211) T ss_pred CCCHHH-HHHHHHHHCCC T ss_conf 635889-99999998499 No 404 >cd02809 alpha_hydroxyacid_oxid_FMN Family of homologous FMN-dependent alpha-hydroxyacid oxidizing enzymes. This family occurs in both prokaryotes and eukaryotes. Members of this family include flavocytochrome b2 (FCB2), glycolate oxidase (GOX), lactate monooxygenase (LMO), mandelate dehydrogenase (MDH), and long chain hydroxyacid oxidase (LCHAO). In green plants, glycolate oxidase is one of the key enzymes in photorespiration where it oxidizes glycolate to glyoxylate. LMO catalyzes the oxidation of L-lactate to acetate and carbon dioxide. MDH oxidizes (S)-mandelate to phenylglyoxalate. It is an enzyme in the mandelate pathway that occurs in several strains of Pseudomonas which converts (R)-mandelate to benzoate. Probab=29.57 E-value=27 Score=12.56 Aligned_cols=13 Identities=15% Similarity=0.243 Sum_probs=5.8 Q ss_pred HHHHCCHHHHHHH Q ss_conf 9860630557888 Q 537021.9.peg.7 348 IARMKFSGYFLIV 360 (1033) Q Consensus 348 I~~~gf~~YFLiV 360 (1033) .++.||..-+|+| T Consensus 138 A~~aG~~al~lTv 150 (299) T cd02809 138 AEAAGYKALVLTV 150 (299) T ss_pred HHHCCCCEEEEEC T ss_conf 9985999899970 No 405 >PRK11230 glycolate oxidase subunit GlcD; Provisional Probab=29.47 E-value=27 Score=12.55 Aligned_cols=20 Identities=25% Similarity=0.493 Sum_probs=16.1 Q ss_pred HHHHHHHHHCCCCCCCCCCCC Q ss_conf 899999851047643344442 Q 537021.9.peg.7 360 VADFIQWAKKNNIPVGPGRGS 380 (1033) Q Consensus 360 V~Div~~ak~~gI~vGpGRGS 380 (1033) |..+|+||.+++|+|=| ||. T Consensus 68 V~~iv~~a~~~~ipvvp-rGg 87 (499) T PRK11230 68 VQALLAVCHRLRVPVVA-RGA 87 (499) T ss_pred HHHHHHHHHHCCCCEEE-ECC T ss_conf 99999999987990999-789 No 406 >cd01916 ACS_1 Acetyl-CoA synthase (ACS), also known as acetyl-CoA decarbonylase, is found in acetogenic and methanogenic organisms and is responsible for the synthesis and breakdown of acetyl-CoA. ACS forms a heterotetramer with carbon monoxide dehydrogenase (CODH) consisting of two ACS and two CODH subunits. CODH reduces carbon dioxide to carbon monoxide and ACS then synthesizes acetyl-CoA from carbon monoxide, CoA, and a methyl group donated by another protein (CoFeSP). ACS has three structural domains, an N-terminal rossman fold domain with a helical region at its N-terminus which interacts with CODH, and two alpha + beta fold domains. A Ni-Fe-S center referred to as the A-cluster is located in the C-terminal domain. A large cavity exists between the three domains which may bind CoA. Probab=29.46 E-value=27 Score=12.55 Aligned_cols=76 Identities=22% Similarity=0.278 Sum_probs=41.3 Q ss_pred HHHHHHHHHHHHHCCCC----CCCCCCCCHHHHHHHHHHHCCCCC-------HHHCCCHHHHHHHHCCCCCCCCCCCCCH Q ss_conf 57888999998510476----433444420466776554403756-------3332310567621001457875333656 Q 537021.9.peg.7 356 YFLIVADFIQWAKKNNI----PVGPGRGSGAGSVTAYALTITDID-------PLRFSLLFERFLNPDRMSMPDFDIDFCQ 424 (1033) Q Consensus 356 YFLiV~Div~~ak~~gI----~vGpGRGSAagSLVaY~LgIT~VD-------Pi~~~LlFERFLnp~R~~~PDIDiDf~~ 424 (1033) .++-+++++.|-+++|. -|| | +-|...+||..+ |+-+-|-| -|..+||+=+==++ T Consensus 209 Nv~p~~~i~dY~ee~~L~d~vEv~-G-------iCCTAiD~tRy~~~aKvvgpls~qlk~------iRsGi~DViVvDEQ 274 (731) T cd01916 209 NVAPGAEIMDYLEENGLEDKVEVG-G-------ICCTAIDLTRYNEKAKVVGPLSRQLKV------VRSGIADVVVVDEQ 274 (731) T ss_pred CCCHHHHHHHHHHHCCCCHHEEEE-E-------EEEEEEECCCCCCCCEEECHHHHHHHH------HHCCCCCEEEECCC T ss_conf 753178999999975971222574-1-------323011013314676464167999999------87169957996453 Q ss_pred HHHHHHHHHHHHHCCCCCEEECCCH Q ss_conf 8789999999830380215542207 Q 537021.9.peg.7 425 DRRDEVIRYVQNKYGHERVAQIITF 449 (1033) Q Consensus 425 ~~R~~vi~y~~~kyG~~~va~i~t~ 449 (1033) =-|-+|++--.+ -....|+|- T Consensus 275 Cir~Dileea~k----~~~pvIAt~ 295 (731) T cd01916 275 CIRADILEEAQK----LGIPVIATN 295 (731) T ss_pred CCCHHHHHHHHH----CCCCEEEEC T ss_conf 221218999986----399489826 No 407 >pfam03918 CcmH Cytochrome C biogenesis protein. Members of this family include NrfF, CcmH, CycL, Ccl2. Probab=29.43 E-value=27 Score=12.55 Aligned_cols=16 Identities=19% Similarity=0.351 Sum_probs=8.2 Q ss_pred HHHHHHHHHHHHCCCE Q ss_conf 9999999999818989 Q 537021.9.peg.7 196 RESQVVQLAYTHELPL 211 (1033) Q Consensus 196 ~~~~l~~lA~~~~iPl 211 (1033) ...+..+++++..=|+ T Consensus 27 ~e~r~~~l~~~lRCpv 42 (145) T pfam03918 27 QEARARELSKELRCPV 42 (145) T ss_pred HHHHHHHHHHHCCCCC T ss_conf 9999999998278888 No 408 >PRK10144 formate-dependent nitrite reductase complex subunit NrfF; Provisional Probab=29.35 E-value=27 Score=12.53 Aligned_cols=13 Identities=23% Similarity=0.345 Sum_probs=5.2 Q ss_pred HHHHHHHHHHCCC Q ss_conf 9999999981898 Q 537021.9.peg.7 198 SQVVQLAYTHELP 210 (1033) Q Consensus 198 ~~l~~lA~~~~iP 210 (1033) .+..+|+++..=| T Consensus 32 ~r~~~L~~elRCp 44 (126) T PRK10144 32 QQALNIASQLRCP 44 (126) T ss_pred HHHHHHHHHCCCC T ss_conf 9999999827887 No 409 >KOG2781 consensus Probab=29.24 E-value=27 Score=12.52 Aligned_cols=56 Identities=18% Similarity=0.221 Sum_probs=37.7 Q ss_pred HHHHHHHHHHHHHHHH--HHCCHHHHHHHHHHHHHHHCCCC----CCCCCCCCHHHHHHHHH Q ss_conf 3899999887789898--60630557888999998510476----43344442046677655 Q 537021.9.peg.7 334 AQDYRQRLDFELNVIA--RMKFSGYFLIVADFIQWAKKNNI----PVGPGRGSGAGSVTAYA 389 (1033) Q Consensus 334 ~~~y~~RLe~EL~vI~--~~gf~~YFLiV~Div~~ak~~gI----~vGpGRGSAagSLVaY~ 389 (1033) +-.=..+..+||+.|- ...+.-==+.|..+|..||++|+ .|.--||--+|-+||++ T Consensus 93 PSsrL~~FaKelkLvfPNaqr~nRG~~~~~~lv~a~ra~~~Td~iivHEhRG~PdgL~vshl 154 (290) T KOG2781 93 PSSRLKMFAKELKLVFPNAQRLNRGNYVVGELVDAARANGVTDLIIVHEHRGIPDGLVVSHL 154 (290) T ss_pred CHHHHHHHHHHHEEECCCHHHHCCCCEEHHHHHHHHHHCCCCEEEEEECCCCCCCCEEEEEC T ss_conf 50789999976407446725413664347999999987798149999526898770278705 No 410 >cd04487 RecJ_OBF2_like RecJ_OBF2_like: A subfamily of OB folds corresponding to the second OB fold (OBF2) of archaeal-specific proteins with similarity to eubacterial RecJ. RecJ is an ssDNA-specific exonuclease. Although the overall sequence similarity of these proteins to eubacterial RecJ proteins is marginal, they appear to carry motifs, which have been shown to be essential for nuclease function in Escherichia coli RecJ. In addition to this OB fold, most proteins in this subfamily contain: i) an N-terminal OB fold belonging to a different domain family (the ribosomal S1-like RNA-binding family); and ii) a domain, C-terminal to OBF2, characteristic of DHH family proteins. DHH family proteins include E. coli RecJ, and are predicted to have a phosphoesterase function. Probab=29.19 E-value=27 Score=12.51 Aligned_cols=33 Identities=21% Similarity=0.262 Sum_probs=22.3 Q ss_pred EEEEEEEEEEEEECCCCCCEEEEEEEECCCCEEEEEEC Q ss_conf 99999996112101588876999999849704999953 Q 537021.9.peg.7 995 IRLAAMVVSKQQKKTRKGSRIGWVTFSEPAKEYEANFF 1032 (1033) Q Consensus 995 v~iaG~V~~i~~~~TKkG~~mafltleD~~g~~e~~iF 1032 (1033) |++-|.|++++.. .|- .++++.|++|...+-.| T Consensus 1 V~i~GeV~qikQT---~GP--TIFti~Detg~v~~AAF 33 (73) T cd04487 1 VHIEGEVVQIKQT---SGP--TIFTLRDETGTVWAAAF 33 (73) T ss_pred CEEEEEEEEEEEC---CCC--EEEEEECCCCCEEHHHH T ss_conf 9799999998988---999--99999759996982051 No 411 >pfam03248 Rer1 Rer1 family. RER1 family protein are involved in involved in the retrieval of some endoplasmic reticulum membrane proteins from the early golgi compartment. The C terminus of yeast Rer1p interacts with a coatomer complex. Probab=29.13 E-value=15 Score=14.78 Aligned_cols=23 Identities=43% Similarity=0.605 Sum_probs=15.7 Q ss_pred HHHHHHHCCCCCHHHCCCHHHHHHHHCCC Q ss_conf 77655440375633323105676210014 Q 537021.9.peg.7 385 VTAYALTITDIDPLRFSLLFERFLNPDRM 413 (1033) Q Consensus 385 LVaY~LgIT~VDPi~~~LlFERFLnp~R~ 413 (1033) +|||.|||--+| +|=.||+|-.. T Consensus 51 iV~Y~LgIylLn------lfi~FLtPk~d 73 (175) T pfam03248 51 VVSYALGIYLLN------LFLAFLTPKFD 73 (175) T ss_pred EEHHHHHHHHHH------HHHHHHCCCCC T ss_conf 203989999999------99998396778 No 412 >COG3066 MutH DNA mismatch repair protein [DNA replication, recombination, and repair] Probab=29.13 E-value=27 Score=12.51 Aligned_cols=16 Identities=50% Similarity=0.646 Sum_probs=5.3 Q ss_pred HHHHHHHHHHHCCCHH Q ss_conf 5689887886326841 Q 537021.9.peg.7 693 EQVMQIAQLLSGYSLS 708 (1033) Q Consensus 693 EQvm~ia~~~ag~~~~ 708 (1033) ||.++-||.++||+++ T Consensus 15 ~~Ll~~Aq~isG~t~g 30 (229) T COG3066 15 EQLLAQAQQLSGLTLG 30 (229) T ss_pred HHHHHHHHHHHCCCHH T ss_conf 9999999987076099 No 413 >TIGR03147 cyt_nit_nrfF cytochrome c nitrite reductase, accessory protein NrfF. Probab=29.13 E-value=27 Score=12.51 Aligned_cols=14 Identities=29% Similarity=0.408 Sum_probs=5.6 Q ss_pred HHHHHHHHHHHCCC Q ss_conf 99999999981898 Q 537021.9.peg.7 197 ESQVVQLAYTHELP 210 (1033) Q Consensus 197 ~~~l~~lA~~~~iP 210 (1033) ..+..+|+++..=| T Consensus 31 e~r~~~L~~~lRCp 44 (126) T TIGR03147 31 RTRAVALAKSLRCP 44 (126) T ss_pred HHHHHHHHHHCCCC T ss_conf 99999999816887 No 414 >cd06569 GH20_Sm-chitobiase-like The chitobiase of Serratia marcescens is a beta-N-1,4-acetylhexosaminidase with a glycosyl hydrolase family 20 (GH20) domain that hydrolyzes the beta-1,4-glycosidic linkages in oligomers derived from chitin. Chitin is degraded by a two step process: i) a chitinase hydrolyzes the chitin to oligosaccharides and disaccharides such as di-N-acetyl-D-glucosamine and chitobiose, ii) chitobiase then further degrades these oligomers into monomers. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by solvent or the enzyme, but by the substrate itself. Probab=28.98 E-value=27 Score=12.49 Aligned_cols=23 Identities=30% Similarity=0.556 Sum_probs=18.5 Q ss_pred HHHHHHHHHCCCCCC-----CCCCCCHH Q ss_conf 899999851047643-----34444204 Q 537021.9.peg.7 360 VADFIQWAKKNNIPV-----GPGRGSGA 382 (1033) Q Consensus 360 V~Div~~ak~~gI~v-----GpGRGSAa 382 (1033) +.|||+||+++||.| -||--+|| T Consensus 99 i~eiV~YA~~RgI~VIPEID~PGH~~aa 126 (445) T cd06569 99 YIEILKYAKARHIEVIPEIDMPGHARAA 126 (445) T ss_pred HHHHHHHHHHCCCEEEECCCCCCCHHHH T ss_conf 9999999997599886506786216899 No 415 >cd05171 PIKKc_ATM Ataxia telangiectasia mutated (ATM), catalytic domain; The ATM catalytic domain subfamily is part of a larger superfamily that includes the catalytic domains of other kinases such as the typical serine/threonine/tyrosine protein kinases (PKs), aminoglycoside phosphotransferase, choline kinase, and RIO kinases. ATM is a member of the phosphoinositide 3-kinase-related protein kinase (PIKK) subfamily. PIKKs have intrinsic serine/threonine kinase activity and are distinguished from other PKs by their unique catalytic domain, similar to that of lipid PI3K, and their large molecular weight (240-470 kDa). ATM contains a FAT (FRAP, ATM and TRRAP) domain, a catalytic domain, and a FATC domain at the C-terminus. ATM is critical in the response to DNA double strand breaks (DSBs) caused by radiation. It is activated at the site of a DSB and phosphorylates key substrates that trigger pathways that regulate DNA repair and cell cycle checkpoints at the G1/S, S phase, and G2/M transi Probab=28.96 E-value=5.2 Score=18.63 Aligned_cols=43 Identities=21% Similarity=0.357 Sum_probs=27.3 Q ss_pred CCCCHHHHHHHHHHHCCCCCHHHCCCHHHHHHHHCCCCCCCCCCCCC Q ss_conf 44420466776554403756333231056762100145787533365 Q 537021.9.peg.7 377 GRGSGAGSVTAYALTITDIDPLRFSLLFERFLNPDRMSMPDFDIDFC 423 (1033) Q Consensus 377 GRGSAagSLVaY~LgIT~VDPi~~~LlFERFLnp~R~~~PDIDiDf~ 423 (1033) .+=.|+-|+|.|+|||.+=-|- |+|+.+ +.+++---|.+.-|. T Consensus 170 t~S~A~~s~vgYILGLGDRH~~--NIli~~--~tG~vvhIDF~~~f~ 212 (279) T cd05171 170 TRSVATSSIVGYILGLGDRHAN--NILIDE--KTAEVVHIDLGIAFE 212 (279) T ss_pred HHHHHHHHHHHHHHCCCCCCCC--CEEEEC--CCCCEEEECCHHHHC T ss_conf 9999999999984215788865--278875--875199972477633 No 416 >PRK07997 consensus Probab=28.88 E-value=28 Score=12.47 Aligned_cols=29 Identities=17% Similarity=0.224 Sum_probs=17.5 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCEEECCCC Q ss_conf 66689999888999857887262116385 Q 537021.9.peg.7 939 TLEKFSVENSSVRFENEQRVLGFYFSGHP 967 (1033) Q Consensus 939 ~~~~~~~~~~~e~l~~E~e~LGf~lS~HP 967 (1033) .+|++..-+...+-..|+..+-..+-|.- T Consensus 824 ~~~~i~s~n~~~r~~~eR~a~N~piQgta 852 (928) T PRK07997 824 YLPDIKSSNAARRAAAERAAINAPMQGTA 852 (928) T ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCCHHH T ss_conf 58876789988996899999581133789 No 417 >pfam08549 SWI-SNF_Ssr4 Fungal domain of unknown function (DUF1750). This is a fungal domain of unknown function. Probab=28.79 E-value=28 Score=12.46 Aligned_cols=19 Identities=5% Similarity=0.079 Sum_probs=8.5 Q ss_pred HHHHHHHHHHHHHHCCCCC Q ss_conf 6878999999983038021 Q 537021.9.peg.7 424 QDRRDEVIRYVQNKYGHER 442 (1033) Q Consensus 424 ~~~R~~vi~y~~~kyG~~~ 442 (1033) ..|=+.=++|..|-|++=+ T Consensus 290 ~~RYqqhHEWMEEIfsSPY 308 (669) T pfam08549 290 KMRYQQHHEWMEEILASPY 308 (669) T ss_pred HHHHHHHHHHHHHHHCCCC T ss_conf 9988887899999862875 No 418 >COG1190 LysU Lysyl-tRNA synthetase (class II) [Translation, ribosomal structure and biogenesis] Probab=28.76 E-value=28 Score=12.46 Aligned_cols=12 Identities=33% Similarity=0.404 Sum_probs=4.7 Q ss_pred HHHHHHHHHHHH Q ss_conf 899987654103 Q 537021.9.peg.7 308 ENELRNKAVAGL 319 (1033) Q Consensus 308 ~~~Lr~l~~~Gl 319 (1033) +-||++++-.|+ T Consensus 237 ELyLKRliVGG~ 248 (502) T COG1190 237 ELYLKRLIVGGF 248 (502) T ss_pred HHHHHHHHHCCC T ss_conf 899998875372 No 419 >KOG3719 consensus Probab=28.75 E-value=28 Score=12.46 Aligned_cols=25 Identities=24% Similarity=0.518 Sum_probs=18.0 Q ss_pred HHHHHHH--HHCCCHHHHHHHHHHHCC Q ss_conf 4788887--717404888876654114 Q 537021.9.peg.7 627 GMRQALE--GMQPDCIEDIIALVSLYR 651 (1033) Q Consensus 627 ~~~~~l~--~~~p~~~~dl~~~~al~R 651 (1033) +-+..++ ++.|+++-.++.++|-|+ T Consensus 431 ~~r~~~K~~klsPDs~vQlafQ~A~~a 457 (638) T KOG3719 431 GGRSAIKKLKLSPDSVVQLAFQMAYYA 457 (638) T ss_pred CCHHHHHHHCCCCHHHHHHHHHHHHHH T ss_conf 786788763889079999999999999 No 420 >TIGR01787 squalene_cyclas squalene/oxidosqualene cyclases; InterPro: IPR002365 Several enzymes catalyze mechanistically related reactions which involve the highly complex cyclic rearrangement of squalene or its 2,3 oxide. Lanosterol synthase (5.4.99.7 from EC) (oxidosqualene--lanosterol cyclase) catalyzes the cyclization of (S)-2,3-epoxysqualene to lanosterol, the initial precursor of cholesterol, steroid hormones and vitamin D in vertebrates and of ergosterol in fungi (gene ERG7). Cycloartenol synthase (5.4.99.8 from EC) (2,3-epoxysqualene--cycloartenol cyclase), is a plant enzyme that catalyzes the cyclization of (S)-2,3-epoxysqualene to cycloartenol, and hopene synthase (5.4.99 from EC) (squalene--hopene cyclase), is a bacterial enzyme that catalyzes the cyclization of squalene into hopene, a key step in hopanoid (triterpenoid) metabolism. These enzymes are evolutionary related proteins of about 70 to 85 kD.; GO: 0016829 lyase activity, 0008152 metabolic process. Probab=28.50 E-value=28 Score=12.42 Aligned_cols=54 Identities=22% Similarity=0.287 Sum_probs=26.6 Q ss_pred CEEEEEECHHH----HHHCCCH-HHHHHHCCHHHHHHHHHHHHHHCCCCCCCCCCCCHHH-HHHHHHC Q ss_conf 66885102133----4311232-1356541789999999988740254343323443588-9987620 Q 537021.9.peg.7 553 LPVTQFNMKWI----EKAGLIK-FDFLGLKTLTLLQKSLDFLAQRGVKVDLSLIPFDDHE-TYHLLTT 614 (1033) Q Consensus 553 ~~~~q~d~~~~----e~~gl~K-~D~Lgl~~L~~i~~~~~~i~~~~~~~d~~~ip~~D~~-~~~l~~~ 614 (1033) ++=|-|.-..+ =+-|+.. .| .-...|.++.+.++++-+ .+=+.+|-+ +|+-..+ T Consensus 356 vWDT~f~~~ALqeva~~sG~~~r~~----~~~p~l~Ka~efL~~~Qi----~~~~~gd~~sv~R~~~k 415 (716) T TIGR01787 356 VWDTAFAVQALQEVALDSGLDLRLD----EFHPALVKALEFLLKKQI----KDDAYGDWKSVYRHNLK 415 (716) T ss_pred CHHHHHHHHHHHHHHHHHHHHHCCC----CCHHHHHHHHHHHHHCCC----CCCCCCCHHHHHCCCCC T ss_conf 0477899999999999976752670----211678778998986378----88878630453136788 No 421 >cd05174 PI3Kc_IA_delta Phosphoinositide 3-kinase (PI3K), class IA, delta isoform, catalytic domain; The PI3K catalytic domain family is part of a larger superfamily that includes the catalytic domains of other kinases such as the typical serine/threonine/tyrosine protein kinases (PKs), aminoglycoside phosphotransferase, choline kinase, and RIO kinases. PI3Ks catalyze the transfer of the gamma-phosphoryl group from ATP to the 3-hydroxyl of the inositol ring of D-myo-phosphatidylinositol (PtdIns) or its derivatives. PI3Ks can be divided into three main classes (I, II, and III), defined by their substrate specificity, regulation, and domain structure. Class I PI3Ks are the only enzymes capable of converting PtdIns(4,5)P2 to the critical second messenger PtdIns(3,4,5)P3. Class I enzymes are heterodimers and exist in multiple isoforms consisting of one catalytic subunit (out of four isoforms) and one of several regulatory subunits. They are further classified into class IA (alpha, beta and Probab=28.49 E-value=27 Score=12.61 Aligned_cols=87 Identities=18% Similarity=0.285 Sum_probs=48.8 Q ss_pred HHHCCCCCCCEEEECCCCH--HHHEECCC------CCCCCCEEEEEECHHHHHHCCCHHHHHHHCCHHHHHHHHHHHHHH Q ss_conf 2100244744033525666--66200123------676886688510213343112321356541789999999988740 Q 537021.9.peg.7 521 LYRHASTHAAGIVIGDRPL--SQLVPMYR------DVRSDLPVTQFNMKWIEKAGLIKFDFLGLKTLTLLQKSLDFLAQR 592 (1033) Q Consensus 521 ~~r~~~~Ha~Gvvi~~~~l--~~~~p~~~------~~~~~~~~~q~d~~~~e~~gl~K~D~Lgl~~L~~i~~~~~~i~~~ 592 (1033) +|-+.++..+||++..-.+ ....|+.. ...+...++-=..|| +.=|.|.+-.+++++...+. . T Consensus 55 lPl~P~~~~~~i~~~~c~v~~S~~~PL~l~f~~~d~~g~~~~vIfK~GDD------LRQD~L~lQli~lmd~i~k~---~ 125 (361) T cd05174 55 SPLSPSIILCEVCVDQCTFMDSKMKPLWIMYKNEEAGGGSVGIIFKNGDD------LRQDMLTLQMIQLMDVLWKQ---E 125 (361) T ss_pred CCCCCCEEEEEEECCCCEEECCCCCCEEEEEECCCCCCCCEEEEEECCCC------CCHHHHHHHHHHHHHHHHHH---C T ss_conf 99996307812871444773335795689996258889966799978976------30999999999999999997---7 Q ss_pred CCCCCCCCCCCCHHHHHHHHHCCCCCHHHHCC Q ss_conf 25434332344358899876201410122114 Q 537021.9.peg.7 593 GVKVDLSLIPFDDHETYHLLTTKGTLGIFQLE 624 (1033) Q Consensus 593 ~~~~d~~~ip~~D~~~~~l~~~g~t~GvFQ~e 624 (1033) | +|+.-.| |+.++.|...|+-++- T Consensus 126 g--LDL~l~p------Y~vlaTg~~~G~IE~V 149 (361) T cd05174 126 G--LDLRMTP------YGCLSTGDKTGLIEVV 149 (361) T ss_pred C--CCEEEEE------EEEEEECCCCEEEEEE T ss_conf 9--9837898------6899954996049994 No 422 >pfam01257 Complex1_24kDa Respiratory-chain NADH dehydrogenase 24 Kd subunit. Probab=28.48 E-value=28 Score=12.42 Aligned_cols=39 Identities=18% Similarity=0.387 Sum_probs=27.8 Q ss_pred HHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCC Q ss_conf 999998303802155422077877889999999752799889886543101 Q 537021.9.peg.7 430 VIRYVQNKYGHERVAQIITFGSLQAKAALRDVGRALQMPYSQVDRLCKLIP 480 (1033) Q Consensus 430 vi~y~~~kyG~~~va~i~t~~t~~~k~aird~~r~~g~~~~~~~~~~~~i~ 480 (1033) +++.++++||. =+..|++.+|+.+|+|..++..++..-+ T Consensus 18 ~L~~iQ~~~Gy------------ip~~~~~~iA~~l~v~~~~V~~v~tFY~ 56 (145) T pfam01257 18 LLHLAQEQYGY------------LPDEAIEYIAELLGIPPARVYEVATFYS 56 (145) T ss_pred HHHHHHHHCCC------------CCHHHHHHHHHHHCCCHHHHHHHHHHCC T ss_conf 99999998499------------6999999999998889999999998642 No 423 >PRK05070 DNA mismatch repair protein; Provisional Probab=28.41 E-value=28 Score=12.41 Aligned_cols=13 Identities=15% Similarity=0.015 Sum_probs=5.1 Q ss_pred CCHHHHHHHHHHH Q ss_conf 6517749999999 Q 537021.9.peg.7 49 TNNLFSALEFSQK 61 (1033) Q Consensus 49 ~~nl~Gav~F~~a 61 (1033) .|.+.-.++++-- T Consensus 40 KGwvGqliE~~LG 52 (218) T PRK05070 40 KGWVGQLLELALG 52 (218) T ss_pred CCHHHHHHHHHHC T ss_conf 5549999999848 No 424 >pfam07340 Herpes_IE1 Cytomegalovirus IE1 protein. Expression from a human cytomegalovirus early promoter (E1.7) has been shown to be activated in trans by the IE2 gene product. Although the IE1 gene product alone had no effect on this early viral promoter, maximal early promoter activity was detected when both IE1 and IE2 gene products were present. The IE1 protein from cytomegalovirus is also known as UL123. Probab=28.35 E-value=24 Score=12.94 Aligned_cols=167 Identities=14% Similarity=0.176 Sum_probs=79.8 Q ss_pred CHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCC----HHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHH------- Q ss_conf 878899998766884698576531002453432343----478875122035899878864111000133221------- Q 537021.9.peg.7 221 SEEDYEAHDVLMAVAHSTVVSQKDRPRVTPDHYLKN----RSEMVSIFSDLPEALENTVEIARRCSFILQVHA------- 289 (1033) Q Consensus 221 ~~~D~~~h~iL~aI~~~~~l~~~~~~~~~~~~yLks----~eEM~~~f~~~peAi~NT~~IAe~c~~~l~~~~------- 289 (1033) +|.+..++.+..-|+...-. .|. ...+++|.- .++|...|......+.|+..|.++..--|...+ T Consensus 77 npg~~ilq~lV~QIKlRV~r---~r~-~~k~~~l~q~tqir~~f~~~F~~~~~~lqna~diLdKv~epF~dmKcI~~t~~ 152 (392) T pfam07340 77 NPEKDILAELVKQIKVRVDI---VRH-KIKEHMLKKYTQMDEKFTAAFNMMGGCLQNALDILDKVHEPFEDMKCIGLTMQ 152 (392) T ss_pred CHHHHHHHHHHHHHHHHHHH---HHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHCCCHHHHHHH T ss_conf 75779999999999988899---888-99999999999999999999999999988899998741541304542989999 Q ss_pred CCCCCCCCCCCCCCCCCHHHH---HHHHHH--HHHHHCCCCCCCCCCCCHHHHHHHHHHHH-HHHHHHCCHHHHHHHHHH Q ss_conf 023455575688776898899---987654--10342012464324578389999988778-989860630557888999 Q 537021.9.peg.7 290 PILPRFIEKECDDIQKEEENE---LRNKAV--AGLEVRLAQGAIAKGYHAQDYRQRLDFEL-NVIARMKFSGYFLIVADF 363 (1033) Q Consensus 290 ~~lP~f~~p~~~~~~~~~~~~---Lr~l~~--~Gl~~R~~~~~~~~~~~~~~y~~RLe~EL-~vI~~~gf~~YFLiV~Di 363 (1033) ..+|.|++|++. .+.+ +++++. .|+..+++. ....|++--= ++=.++-|.-|=-++ T Consensus 153 ~m~~~y~vP~~~-----~~k~~~cvK~l~d~avn~a~kle~----------al~~Kv~~KK~~l~~~~~Y~~~ky~~--- 214 (392) T pfam07340 153 NMYENYIVPEDK-----RDLWLACIKDLHDVAKGAANKLGG----------ALKAKAQAKKDELNRKMMYIALKHIE--- 214 (392) T ss_pred HHCCCCCCCCCH-----HHHHHHHHHHHHHHHHHHHHHHHH----------HHHHHHHHHHHHHHHHHHHHHHHHHH--- T ss_conf 866476589206-----889999999999999999999999----------99999998799999999999999999--- Q ss_pred HHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHH---CCCHHHHHHHHCCC Q ss_conf 998510476433444420466776554403756333---23105676210014 Q 537021.9.peg.7 364 IQWAKKNNIPVGPGRGSGAGSVTAYALTITDIDPLR---FSLLFERFLNPDRM 413 (1033) Q Consensus 364 v~~ak~~gI~vGpGRGSAagSLVaY~LgIT~VDPi~---~~LlFERFLnp~R~ 413 (1033) ++-+ ++ .+|----+-.-..+++=|..+-||.. |++--=.-|..||. T Consensus 215 --~~~~-~~-~~PKt~ng~~~Ai~fl~~lpq~dpe~~~~~g~~i~k~lD~E~~ 263 (392) T pfam07340 215 --FFTK-NS-AFPKTTNGCSAAIAALQNFHQCSPDEIKAHAQKIFKILDEERD 263 (392) T ss_pred --HHHC-CC-CCCCCCCCHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCHH T ss_conf --9970-56-7887656488999998458879868899989999873230246 No 425 >PRK13913 3-methyladenine DNA glycosylase; Provisional Probab=28.27 E-value=28 Score=12.39 Aligned_cols=11 Identities=18% Similarity=0.434 Sum_probs=6.3 Q ss_pred CHHHHHHHCCH Q ss_conf 34788751220 Q 537021.9.peg.7 256 NRSEMVSIFSD 266 (1033) Q Consensus 256 s~eEM~~~f~~ 266 (1033) +++|++++|.+ T Consensus 166 ~Ydelq~~fe~ 176 (218) T PRK13913 166 DYDELQHFFEK 176 (218) T ss_pred CHHHHHHHHHH T ss_conf 99999999996 No 426 >TIGR00855 L12 ribosomal protein L7/L12; InterPro: IPR000206 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites , . About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many of ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome , . This family of large subunit ribosomal proteins is called the L7/L12 family. L7/L12 is present in each 50S subunit in four copies organised as two dimers. The L8 protein complex consisting of two dimers of L7/L12 and L10 in Escherichia coli ribosomes is assembled on the conserved region of 23 S rRNA termed the GTPase-associated domain . The L7/L12 dimer probably interacts with EF-Tu. L7 and L12 only differ in a single post translational modification of the addition an acetyl group to the N terminus of L7.; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome. Probab=28.19 E-value=23 Score=13.08 Aligned_cols=45 Identities=29% Similarity=0.398 Sum_probs=34.6 Q ss_pred HHHHHHHH-HHCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHH---HHHCCCCHHHHHHHHHHHHHHH Q ss_conf 68988788-6326841307889762012110167787899999---8624999999999999999750 Q 537021.9.peg.7 694 QVMQIAQL-LSGYSLSEADVLRRAMGKKIKEEMDKQKERFISG---ASKNGISKTIAVNIFELLAKFA 757 (1033) Q Consensus 694 Qvm~ia~~-~ag~~~~ead~lRra~~kk~~~~~~~~~~~f~~g---~~~~g~~~~~a~~i~~~i~~f~ 757 (1033) +|+++.+. +.|+.|.||=. .+++ .++.|++++.|++|=..++.-| T Consensus 77 ~ViKvVR~~ITGLGLkEAK~-------------------LVe~aPk~~KE~v~KeeAE~~KkkLeeaG 125 (131) T TIGR00855 77 AVIKVVREVITGLGLKEAKD-------------------LVEKAPKALKEGVSKEEAEEIKKKLEEAG 125 (131) T ss_pred EEEEEECHHHCCCHHHHHHH-------------------HHHHCCCHHHCCCCHHHHHHHHHHHHHCC T ss_conf 21233200021423688999-------------------98606801223889768999999997477 No 427 >PRK04207 glyceraldehyde-3-phosphate dehydrogenase; Provisional Probab=28.09 E-value=28 Score=12.37 Aligned_cols=45 Identities=22% Similarity=0.368 Sum_probs=22.4 Q ss_pred HHHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHHCCCCHHHHH Q ss_conf 789999999830380215542207787788999999975279988988 Q 537021.9.peg.7 426 RRDEVIRYVQNKYGHERVAQIITFGSLQAKAALRDVGRALQMPYSQVD 473 (1033) Q Consensus 426 ~R~~vi~y~~~kyG~~~va~i~t~~t~~~k~aird~~r~~g~~~~~~~ 473 (1033) .+++|++-+.+. .+++.+.+-..+.+-+.++|.+|-+|-|-..+. T Consensus 234 t~~eVl~~~~~~---~Ri~~~~~~~g~~sta~i~e~ardlgr~r~Dl~ 278 (338) T PRK04207 234 TKDDVLEALENT---PRILLISASDGIDSTAELIEYARDLGRPRGDLY 278 (338) T ss_pred CHHHHHHHHHHC---CCEEEEECCCCCCCHHHHHHHHHHCCCCCCCHH T ss_conf 889999999638---958999765687238999999998199867704 No 428 >PRK09177 xanthine-guanine phosphoribosyltransferase; Validated Probab=28.06 E-value=28 Score=12.37 Aligned_cols=26 Identities=23% Similarity=0.304 Sum_probs=14.3 Q ss_pred CCCCCCC-HHHHHHHHHHHCCCCCHHH Q ss_conf 3344442-0466776554403756333 Q 537021.9.peg.7 374 VGPGRGS-GAGSVTAYALTITDIDPLR 399 (1033) Q Consensus 374 vGpGRGS-AagSLVaY~LgIT~VDPi~ 399 (1033) ||-+||- ..+.++|..|||-.++.+. T Consensus 36 V~IaRGGliPA~~ls~~L~i~~i~~i~ 62 (156) T PRK09177 36 IAVTRGGLVPAAILARELGIRLVDTVC 62 (156) T ss_pred EEECCCCCCHHHHHHHHHCCCEEEEEE T ss_conf 999147831599999985984057999 No 429 >KOG1091 consensus Probab=28.02 E-value=28 Score=12.36 Aligned_cols=32 Identities=25% Similarity=0.199 Sum_probs=19.2 Q ss_pred HHHHHHHHHHH-HHHCCCCCCCCCCCCHHHHHHHHHHHCC Q ss_conf 55788899999-8510476433444420466776554403 Q 537021.9.peg.7 355 GYFLIVADFIQ-WAKKNNIPVGPGRGSGAGSVTAYALTIT 393 (1033) Q Consensus 355 ~YFLiV~Div~-~ak~~gI~vGpGRGSAagSLVaY~LgIT 393 (1033) -|.|||||-+. +-.-+++ =+++=+|+-+|.|- T Consensus 295 ~dLLiVWD~~l~~d~pr~~-------Lv~~m~VsmLL~IR 327 (625) T KOG1091 295 QDLLIVWDHVLIFDSPRGI-------LVACMFVSMLLYIR 327 (625) T ss_pred HHHHHHHHHHHHCCCCHHH-------HHHHHHHHHHHHHH T ss_conf 7789985442312684188-------88899999999999 No 430 >PRK06182 short chain dehydrogenase; Validated Probab=28.00 E-value=28 Score=12.36 Aligned_cols=47 Identities=13% Similarity=0.027 Sum_probs=27.1 Q ss_pred CHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 9999999999999750001331014788899999999997302666777510 Q 537021.9.peg.7 742 SKTIAVNIFELLAKFADYGFNKSHAAAYAVISYQTAWMKTHYPVEFLAASMT 793 (1033) Q Consensus 742 ~~~~a~~i~~~i~~f~~Y~FnksHa~aya~~~y~~aylK~~yP~eF~aa~L~ 793 (1033) ++.+|+.|+..+.. ..+|.|= -.+.-++...+|+...|.-++-.+|. T Consensus 223 p~~vA~~i~~a~~~----~~p~~ry-~~G~~a~~~~~l~r~lP~~l~d~il~ 269 (273) T PRK06182 223 PQVIADAISKAVTA----KRPKTRY-AVGFGAKPLIALRRILPDRAFDRLIM 269 (273) T ss_pred HHHHHHHHHHHHHC----CCCCEEE-ECCHHHHHHHHHHHHCCHHHHHHHHH T ss_conf 99999999999828----9998378-76854899999999888999999998 No 431 >cd07224 Pat_like Patatin-like phospholipase. Patatin-like phospholipase. This family consists of various patatin glycoproteins from plants. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein, but it also has the enzymatic activity of lipid acyl hydrolase, catalysing the cleavage of fatty acids from membrane lipids. Members of this family have been found also in vertebrates. Probab=27.93 E-value=28 Score=12.35 Aligned_cols=40 Identities=28% Similarity=0.449 Sum_probs=22.7 Q ss_pred HHHHHHHHHHHHHCCCC--CCCCCCCCHHHHHHHHHHHCCCCCHHH Q ss_conf 57888999998510476--433444420466776554403756333 Q 537021.9.peg.7 356 YFLIVADFIQWAKKNNI--PVGPGRGSGAGSVTAYALTITDIDPLR 399 (1033) Q Consensus 356 YFLiV~Div~~ak~~gI--~vGpGRGSAagSLVaY~LgIT~VDPi~ 399 (1033) |.+=|.... +++|+ ..-+=-|+.|||||| ++-...++|-+ T Consensus 13 YHlGV~~~L---~e~g~~~~~~~~aGaSAGAl~a-a~~~~g~~~~~ 54 (233) T cd07224 13 YHLGVLSLL---IEAGVINETTPLAGASAGSLAA-ACSASGLSPEE 54 (233) T ss_pred HHHHHHHHH---HHCCCCCCCCEEEEECHHHHHH-HHHHCCCCHHH T ss_conf 999999999---9717323356478865889999-99974999999 No 432 >cd00895 PI3Kc_C2_beta Phosphoinositide 3-kinase (PI3K), class II, beta isoform, catalytic domain; The PI3K catalytic domain family is part of a larger superfamily that includes the catalytic domains of other kinases such as the typical serine/threonine/tyrosine protein kinases (PKs), aminoglycoside phosphotransferase, choline kinase, and RIO kinases. PI3Ks catalyze the transfer of the gamma-phosphoryl group from ATP to the 3-hydroxyl of the inositol ring of D-myo-phosphatidylinositol (PtdIns) or its derivatives. PI3Ks play an important role in a variety of fundamental cellular processes, including cell motility, the Ras pathway, vesicle trafficking and secretion, immune cell activation and apoptosis. They can be divided into three main classes (I, II, and III), defined by their substrate specificity, regulation, and domain structure. Class II PI3Ks preferentially use PtdIns as a substrate to produce PtdIns(3)P, but can also phosphorylate PtdIns(4)P. They function as monomers and do not Probab=27.91 E-value=27 Score=12.51 Aligned_cols=87 Identities=21% Similarity=0.259 Sum_probs=46.6 Q ss_pred HHHCCCCCCCEEEECCCCH--HHHEECCCC-----C-CCCCEEEEEECHHHHHHCCCHHHHHHHCCHHHHHHHHHHHHHH Q ss_conf 2100244744033525666--662001236-----7-6886688510213343112321356541789999999988740 Q 537021.9.peg.7 521 LYRHASTHAAGIVIGDRPL--SQLVPMYRD-----V-RSDLPVTQFNMKWIEKAGLIKFDFLGLKTLTLLQKSLDFLAQR 592 (1033) Q Consensus 521 ~~r~~~~Ha~Gvvi~~~~l--~~~~p~~~~-----~-~~~~~~~q~d~~~~e~~gl~K~D~Lgl~~L~~i~~~~~~i~~~ 592 (1033) +|-+.++..+||++-.-.+ ....|+... + .+...++-=..|| +.=|.|.+-.+.+++...+. . T Consensus 52 LPl~P~~~v~gi~~~~~~~f~S~~~Pl~l~F~~~d~~~~~~~vIfK~GDD------LRQD~L~lQli~lmd~i~~~---~ 122 (354) T cd00895 52 LPLSPSLLVKGIVPRDCSYFNSNAVPLKLSFQNVDPLGENIRVIFKCGDD------LRQDMLTLQMIRIMNKIWVQ---E 122 (354) T ss_pred CCCCCCEEEEEEECCCCEEECCCCCCEEEEEECCCCCCCCEEEEEECCCC------HHHHHHHHHHHHHHHHHHHH---C T ss_conf 78698637988964763670435784489986058888855799976973------40889999999999999997---6 Q ss_pred CCCCCCCCCCCCHHHHHHHHHCCCCCHHHHCC Q ss_conf 25434332344358899876201410122114 Q 537021.9.peg.7 593 GVKVDLSLIPFDDHETYHLLTTKGTLGIFQLE 624 (1033) Q Consensus 593 ~~~~d~~~ip~~D~~~~~l~~~g~t~GvFQ~e 624 (1033) | +|+.-.| |+.++.|...|+-|+- T Consensus 123 g--LDL~l~p------Y~vlatg~~~G~IE~V 146 (354) T cd00895 123 G--LDMRMVI------FRCFSTGRGRGMVEMI 146 (354) T ss_pred C--CCEEEEE------EEEEEECCCCCEEEEC T ss_conf 9--9826787------7999855987427751 No 433 >COG4034 Uncharacterized protein conserved in archaea [Function unknown] Probab=27.79 E-value=29 Score=12.33 Aligned_cols=53 Identities=19% Similarity=0.162 Sum_probs=41.6 Q ss_pred HHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHH-HHHCCCCEEEEEEE Q ss_conf 443299899999999789988998116517749999999-99769963899999 Q 537021.9.peg.7 23 LEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQK-ACAAGIQPIIGCQL 75 (1033) Q Consensus 23 ldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~a-ckk~gIKPIiG~E~ 75 (1033) ..+---..+++..|+.+|..++++.-.++.-|..+-.++ +++.||-.|+|++. T Consensus 87 rg~~~~~pqi~raaralg~e~v~v~~~~gv~gl~e~lk~lv~~~~id~IvgvD~ 140 (328) T COG4034 87 RGGRRVKPQIVRAARALGKEAVAVDLVDGVRGLAEGLKALVEKEGIDLIVGVDV 140 (328) T ss_pred CCCCEECHHHHHHHHHHCCEEEEEECCCCCCCHHHHHHHHHHHHCCCEEEEECC T ss_conf 278253188999999809616999625773118899999998618557999615 No 434 >TIGR00631 uvrb excinuclease ABC, B subunit; InterPro: IPR004807 All proteins in this family for which functions are known are DNA helicases that function in the nucleotide excision repair and are endonucleases that make the 3' incision next to DNA damage. They are part of a pathway requiring UvrA, UvrB, UvrC, and UvrD homologs.; GO: 0009381 excinuclease ABC activity, 0006289 nucleotide-excision repair, 0005737 cytoplasm, 0009380 excinuclease repair complex. Probab=27.78 E-value=29 Score=12.33 Aligned_cols=208 Identities=19% Similarity=0.289 Sum_probs=110.8 Q ss_pred CCCHHHHHHHHHHHHHHCCCC---EEEEEEC---------------------CCHHHHHHHHHHHHHHHHHCCCEEEECC Q ss_conf 799789999999998765986---8999943---------------------9911589999999999981898998088 Q 537021.9.peg.7 161 FNSSQVAEKRLLTFKKLFGDR---LYVNLQR---------------------HRGYDRYRESQVVQLAYTHELPLVATNN 216 (1033) Q Consensus 161 ~~~~~~a~~~l~~l~~~Fgd~---~ylEl~~---------------------~~~~e~~~~~~l~~lA~~~~iPlVaTnd 216 (1033) ..|...|.+....|++.||+| +|+.-.+ |..-|+.+..+...|.+.-+|=+|||=. T Consensus 61 aHNKTLAAQLY~EfKefFPeNAVEYFvSYYDYYQPEAYvP~~DtyIEKdaSINdeIerlR~SAT~SLl~RrDVIVVASVS 140 (667) T TIGR00631 61 AHNKTLAAQLYNEFKEFFPENAVEYFVSYYDYYQPEAYVPSKDTYIEKDASINDEIERLRLSATKSLLERRDVIVVASVS 140 (667) T ss_pred CCCHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCCCCCCCEEECCCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEE T ss_conf 57776799999999863867724525520323787321479884130455300467677889888642378789998755 Q ss_pred CCCC--CHHHHHHHHHHHHHHCCCCCCC--------------------CCCCCCCCC---CCC-CCHHH-HH--HHCCHH Q ss_conf 3007--8788999987668846985765--------------------310024534---323-43478-87--512203 Q 537021.9.peg.7 217 SLFL--SEEDYEAHDVLMAVAHSTVVSQ--------------------KDRPRVTPD---HYL-KNRSE-MV--SIFSDL 267 (1033) Q Consensus 217 v~Y~--~~~D~~~h~iL~aI~~~~~l~~--------------------~~~~~~~~~---~yL-ks~eE-M~--~~f~~~ 267 (1033) |=|- +|++|.-.-+ .+..|..++. .+.||.-++ -+. -..++ .. ++|.|- T Consensus 141 cIYGLG~P~~Y~~~~~--~l~vG~~~~~~~ll~~LV~lqY~RNd~~f~RG~FRvrGDVvEIfP~~~~~~~~~RiEfF~DE 218 (667) T TIGR00631 141 CIYGLGSPEEYLKMVL--HLEVGKEIDRRELLRRLVELQYERNDVDFQRGTFRVRGDVVEIFPAAYEDEFAVRIEFFGDE 218 (667) T ss_pred EECCCCCCHHHHCCEE--EEEECCCCCHHHHHHHHHHHCEEECCEECCCCCEEEEEEEEEEECCCCCCCCEEEEEEECCE T ss_conf 2068888255642278--65407855889999886340305712340577125763189863477677724899852554 Q ss_pred HHHHHHH----HHHHHHHHHHHHHHHCCCCC--CCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHH Q ss_conf 5899878----86411100013322102345--55756887768988999876541034201246432457838999998 Q 537021.9.peg.7 268 PEALENT----VEIARRCSFILQVHAPILPR--FIEKECDDIQKEEENELRNKAVAGLEVRLAQGAIAKGYHAQDYRQRL 341 (1033) Q Consensus 268 peAi~NT----~~IAe~c~~~l~~~~~~lP~--f~~p~~~~~~~~~~~~Lr~l~~~Gl~~R~~~~~~~~~~~~~~y~~RL 341 (1033) =|.|..- .++....+ .-..+|. |.+|.. . -..+=+.+++-..+.++.-=.++. ....+--.+|= T Consensus 219 ie~I~~~dpLtg~~~~~~~-----~~~ifPA~HyV~p~e-~-~~~Ai~~I~~EL~eRl~~f~~~~k---llEaQRL~qRT 288 (667) T TIGR00631 219 IERISLVDPLTGEVLRELD-----SFTIFPASHYVTPEE-R-LERAIKEIEKELEERLKYFEEQGK---LLEAQRLKQRT 288 (667) T ss_pred EEEHHHHCCCCCCHHHHCC-----CEEEECCCEECCCCH-H-HHHHHHHHHHHHHHHHHHHHHCCC---HHHHHHHHHHH T ss_conf 3202120511051110056-----158605400025804-7-999999999999999999987795---17888898868 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHH Q ss_conf 877898986063055788899999851047643344442046677655 Q 537021.9.peg.7 342 DFELNVIARMKFSGYFLIVADFIQWAKKNNIPVGPGRGSGAGSVTAYA 389 (1033) Q Consensus 342 e~EL~vI~~~gf~~YFLiV~Div~~ak~~gI~vGpGRGSAagSLVaY~ 389 (1033) +|=|+.|.+|||+.= |. |||| .+-|-.-||---||.=|. T Consensus 289 ~yDLEMl~e~G~C~G---IE---NYSR---hl~GR~~GEpP~tLlDYF 327 (667) T TIGR00631 289 EYDLEMLREMGYCSG---IE---NYSR---HLSGRKPGEPPYTLLDYF 327 (667) T ss_pred HHHHHHHHHCCCCCC---CC---HHCC---CCCCCCCCCCCCCHHHHC T ss_conf 877888750673377---00---0000---216777631596477650 No 435 >pfam10087 DUF2325 Uncharacterized protein conserved in bacteria (DUF2325). Members of this family of hypothetical bacterial proteins have no known function. Probab=27.77 E-value=29 Score=12.33 Aligned_cols=19 Identities=21% Similarity=0.214 Sum_probs=12.4 Q ss_pred HHHHHHHHHHHCCCEEEEC Q ss_conf 9999999998189899808 Q 537021.9.peg.7 197 ESQVVQLAYTHELPLVATN 215 (1033) Q Consensus 197 ~~~l~~lA~~~~iPlVaTn 215 (1033) ...+.+.|++.++|+|... T Consensus 62 ~~~vK~~akk~~~p~v~~~ 80 (96) T pfam10087 62 MWRVKEEAKKRGIPVVFSR 80 (96) T ss_pred HHHHHHHHHHHCCCEEEEC T ss_conf 9999999998499789976 No 436 >KOG0624 consensus Probab=27.75 E-value=29 Score=12.33 Aligned_cols=43 Identities=19% Similarity=0.101 Sum_probs=17.8 Q ss_pred HHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHH Q ss_conf 31444764432998999999997899889981165177499999999 Q 537021.9.peg.7 16 VHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKA 62 (1033) Q Consensus 16 ~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ac 62 (1033) +-+.|-...++.++.+. ++--++|-.-+| .+-++.++--|.+| T Consensus 22 l~~~~e~a~~~~~~adv-ekhlElGk~lla---~~Q~sDALt~yHaA 64 (504) T KOG0624 22 LELFLEGAESTASPADV-EKHLELGKELLA---RGQLSDALTHYHAA 64 (504) T ss_pred HHHHHHHHHHCCCHHHH-HHHHHHHHHHHH---HHHHHHHHHHHHHH T ss_conf 99998787725999999-999998899997---34677899998888 No 437 >TIGR00789 flhB_rel FlhB domain protein; InterPro: IPR004683 This group describes a short protein (80-93 residues) homologous to the C-terminus of the flagellar biosynthetic protein FlhB. It is found so far only in species that also have FlhB. In a phylogenetic tree based on alignment of both this family and the homologous region of FlhB and its homologs, the members of this family form a monophyletic set.. Probab=27.74 E-value=29 Score=12.33 Aligned_cols=36 Identities=25% Similarity=0.435 Sum_probs=26.4 Q ss_pred HHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHCCCCC Q ss_conf 9999999999981898998088300787889999876688469857 Q 537021.9.peg.7 195 YRESQVVQLAYTHELPLVATNNSLFLSEEDYEAHDVLMAVAHSTVV 240 (1033) Q Consensus 195 ~~~~~l~~lA~~~~iPlVaTndv~Y~~~~D~~~h~iL~aI~~~~~l 240 (1033) ...++++++|+++|||+. +|..+-++|.-.+.+..+ T Consensus 27 evA~~II~~AK~~gipi~----------ed~~L~~~L~~l~l~~~I 62 (84) T TIGR00789 27 EVAEKIIELAKKHGIPIK----------EDEDLVDVLLKLDLDDEI 62 (84) T ss_pred HHHHHHHHHHHHCCCCCC----------CCHHHHHHHHHCCCCCCC T ss_conf 578899999976389976----------582899987440545768 No 438 >TIGR01326 OAH_OAS_sulfhy O-acetylhomoserine aminocarboxypropyltransferase/cysteine synthase; InterPro: IPR006235 This family of sequences is a distinct clade of the Cys/Met metabolism pyridoxal phosphate-dependent enzyme superfamily. Members include examples of OAH/OAS sulfhydrylase, an enzyme with activity both as O-acetylhomoserine sulfhydrylase (OAH, 2.5.1.49 from EC) and O-acetylserine sulfhydrylase (OAS, 2.5.1.47 from EC). An alternate name for OAH sulfhydrylase is homocysteine synthase.; GO: 0016765 transferase activity transferring alkyl or aryl (other than methyl) groups, 0006520 amino acid metabolic process. Probab=27.62 E-value=29 Score=12.31 Aligned_cols=223 Identities=17% Similarity=0.228 Sum_probs=100.9 Q ss_pred HHHHHHHCCCC---EEEEEECCCHHHHHHHHHHHHHHHHH--CCCEEEECCC----CCCCHHHHHHHHHHHHHHCCCCCC Q ss_conf 99998765986---89999439911589999999999981--8989980883----007878899998766884698576 Q 537021.9.peg.7 171 LLTFKKLFGDR---LYVNLQRHRGYDRYRESQVVQLAYTH--ELPLVATNNS----LFLSEEDYEAHDVLMAVAHSTVVS 241 (1033) Q Consensus 171 l~~l~~~Fgd~---~ylEl~~~~~~e~~~~~~l~~lA~~~--~iPlVaTndv----~Y~~~~D~~~h~iL~aI~~~~~l~ 241 (1033) .+.|++...+| +|+|---+...+=.=.+++.++|++. +||+|+=|-+ ++++|=|+=+==|.+ T Consensus 133 pe~~~k~id~nTKAvf~EtIgNP~~~v~Die~~a~~Ah~~PhgvPliVDNT~atpGYL~rPi~hGADIVvh--------- 203 (434) T TIGR01326 133 PEELEKAIDENTKAVFAETIGNPALNVPDIEAVAEVAHAHPHGVPLIVDNTFATPGYLCRPIDHGADIVVH--------- 203 (434) T ss_pred HHHHHHHCCCCCEEEEEECCCCCCCCCCCHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCHHHCCCCEEEE--------- T ss_conf 78999760667518984012387767678589999998678983488747868764100645649867996--------- Q ss_pred CCCCCCCCCCCCCCCHHHHHHHCCHHHHHHHHHHHHHHHHHH-HHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 531002453432343478875122035899878864111000-1332210234555756887768988999876541034 Q 537021.9.peg.7 242 QKDRPRVTPDHYLKNRSEMVSIFSDLPEALENTVEIARRCSF-ILQVHAPILPRFIEKECDDIQKEEENELRNKAVAGLE 320 (1033) Q Consensus 242 ~~~~~~~~~~~yLks~eEM~~~f~~~peAi~NT~~IAe~c~~-~l~~~~~~lP~f~~p~~~~~~~~~~~~Lr~l~~~Gl~ 320 (1033) +..=| ..-.--+|- .-|.|.=+| +|..+.-++|.|-+|+- ++=|++ T Consensus 204 -------SaTK~----------iGGHG~aiG--G~ivD~G~FndW~~~~gkFP~f~~Pdp--------------sYHGl~ 250 (434) T TIGR01326 204 -------SATKY----------IGGHGTAIG--GVIVDGGKFNDWANNPGKFPLFTTPDP--------------SYHGLV 250 (434) T ss_pred -------EECCC----------CCCCEEEEE--EEEEECCCCCCCCCCCCCCCCCCCCCC--------------CCCCCE T ss_conf -------10010----------126314650--478826755630147874777888888--------------658712 Q ss_pred HCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHHC Q ss_conf 20124643245783899999887789898606305578889999985104764334444204667765544037563332 Q 537021.9.peg.7 321 VRLAQGAIAKGYHAQDYRQRLDFELNVIARMKFSGYFLIVADFIQWAKKNNIPVGPGRGSGAGSVTAYALTITDIDPLRF 400 (1033) Q Consensus 321 ~R~~~~~~~~~~~~~~y~~RLe~EL~vI~~~gf~~YFLiV~Div~~ak~~gI~vGpGRGSAagSLVaY~LgIT~VDPi~~ 400 (1033) |.+-. .--+.+|...-==|+.=-|.|=|+- |+|.+=-=||+|- =.++=| T Consensus 251 --f~E~~------------------g~~~~~G~~~~Afi~~~Rv~~LRD~--------Gaa~sPFnAFLll-QGLETL-- 299 (434) T TIGR01326 251 --FTETF------------------GEFEGLGLGNIAFIVKARVQLLRDL--------GAALSPFNAFLLL-QGLETL-- 299 (434) T ss_pred --EEECC------------------CCCCCCCCCCHHEEEEEEEEECHHC--------CCCCCHHHHHHHH-HHHHHH-- T ss_conf --65315------------------5757778500000344467422100--------3340178999999-866678-- Q ss_pred CCHHHHHH-HHCCC-----CCCCCC-CCCCHHHHHHHHHHHHHHCCCCC-EEECCCHHHHHHHHHHHHHHHHCCC Q ss_conf 31056762-10014-----578753-33656878999999983038021-5542207787788999999975279 Q 537021.9.peg.7 401 SLLFERFL-NPDRM-----SMPDFD-IDFCQDRRDEVIRYVQNKYGHER-VAQIITFGSLQAKAALRDVGRALQM 467 (1033) Q Consensus 401 ~LlFERFL-np~R~-----~~PDID-iDf~~~~R~~vi~y~~~kyG~~~-va~i~t~~t~~~k~aird~~r~~g~ 467 (1033) -|--||-. |-.++ +=|||- +.+|.-.= -...-+.+||.... .+.|.||+==+.+.+=+-+..++.+ T Consensus 300 sLR~ERH~~NA~kVA~fL~~Hp~V~wV~YPGL~s-~~~h~lAkkYl~~G~~G~vLsFe~kGG~ea~~kfidalkL 373 (434) T TIGR01326 300 SLRMERHVENALKVAEFLEAHPKVAWVNYPGLAS-HPHHALAKKYLPKGLFGAVLSFEIKGGREAGKKFIDALKL 373 (434) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCC-CHHHHHHHHHHCCCCEEEEEECCCCCCHHHHHHHHHHHHH T ss_conf 7678888899999999972788445134888788-7537889987417942678741321828899999877677 No 439 >pfam05301 Mec-17 Touch receptor neuron protein Mec-17. Mec-17 is the protein product of one of the 18 genes required for the development and function of the touch receptor neuron for gentle touch. Mec-17 is specifically required for maintaining the differentiation of the touch receptor. This family is conserved to higher eukaryotes. Probab=27.61 E-value=29 Score=12.31 Aligned_cols=34 Identities=18% Similarity=0.516 Sum_probs=13.8 Q ss_pred HHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCC Q ss_conf 0567621001457875333656878999999983038 Q 537021.9.peg.7 403 LFERFLNPDRMSMPDFDIDFCQDRRDEVIRYVQNKYG 439 (1033) Q Consensus 403 lFERFLnp~R~~~PDIDiDf~~~~R~~vi~y~~~kyG 439 (1033) |||.+|..+..++-.+-+|=|+ ...+.++.+.|| T Consensus 67 Lf~~ML~~e~~~p~~la~DrPS---~Kll~FL~Khy~ 100 (120) T pfam05301 67 LFDEMLKKENVEPHQCAVDRPS---AKLLQFLSKHYG 100 (120) T ss_pred HHHHHHHHCCCCHHHCCCCCCC---HHHHHHHHHHCC T ss_conf 9999998719981346105898---999999998569 No 440 >cd00894 PI3Kc_IB_gamma Phosphoinositide 3-kinase (PI3K), class IB, gamma isoform, catalytic domain; The PI3K catalytic domain family is part of a larger superfamily that includes the catalytic domains of other kinases such as the typical serine/threonine/tyrosine protein kinases (PKs), aminoglycoside phosphotransferase, choline kinase, and RIO kinases. PI3Ks catalyze the transfer of the gamma-phosphoryl group from ATP to the 3-hydroxyl of the inositol ring of D-myo-phosphatidylinositol (PtdIns) or its derivatives. PI3Ks can be divided into three main classes (I, II, and III), defined by their substrate specificity, regulation, and domain structure. Class I PI3Ks are the only enzymes capable of converting PtdIns(4,5)P2 to the critical second messenger PtdIns(3,4,5)P3. Class I enzymes are heterodimers and exist in multiple isoforms consisting of one catalytic subunit (out of four isoforms) and one of several regulatory subunits. They are further classified into class IA (alpha, beta and Probab=27.60 E-value=26 Score=12.63 Aligned_cols=44 Identities=25% Similarity=0.543 Sum_probs=27.1 Q ss_pred HHHHHHHCCHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHCC Q ss_conf 2135654178999999998874025434332344358899876201410122114 Q 537021.9.peg.7 570 KFDFLGLKTLTLLQKSLDFLAQRGVKVDLSLIPFDDHETYHLLTTKGTLGIFQLE 624 (1033) Q Consensus 570 K~D~Lgl~~L~~i~~~~~~i~~~~~~~d~~~ip~~D~~~~~l~~~g~t~GvFQ~e 624 (1033) .=|.|.+-.+.+++... ++.| +|+.-.| |+.++.|...|+-++- T Consensus 111 RQD~L~lQli~lMd~i~---k~~g--LDL~l~p------Y~vlatg~~~G~IE~V 154 (365) T cd00894 111 RQDMLILQILRIMESIW---ETES--LDLCLLP------YGCISTGDKIGMIEIV 154 (365) T ss_pred CHHHHHHHHHHHHHHHH---HHCC--CCCCEEE------EEEEEECCCCEEEEEC T ss_conf 59999999999999999---8669--9854067------7899955992289983 No 441 >pfam03102 NeuB NeuB family. NeuB is the prokaryotic N-acetylneuraminic acid (Neu5Ac) synthase. It catalyses the direct formation of Neu5Ac (the most common sialic acid) by condensation of phosphoenolpyruvate (PEP) and N-acetylmannosamine (ManNAc). This reaction has only been observed in prokaryotes; eukaryotes synthesize the 9-phosphate form, Neu5Ac-9-P, and utilize ManNAc-6-P instead of ManNAc. Such eukaryotic enzymes are not present in this family. This family also contains SpsE spore coat polysaccharide biosynthesis proteins. Probab=27.57 E-value=29 Score=12.30 Aligned_cols=70 Identities=13% Similarity=0.077 Sum_probs=30.5 Q ss_pred HHHHHHHHHHHHHHCCCCEEE--EEECCCHHHH-HHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHCCCCC Q ss_conf 789999999998765986899--9943991158-9999999999981898998088300787889999876688469857 Q 537021.9.peg.7 164 SQVAEKRLLTFKKLFGDRLYV--NLQRHRGYDR-YRESQVVQLAYTHELPLVATNNSLFLSEEDYEAHDVLMAVAHSTVV 240 (1033) Q Consensus 164 ~~~a~~~l~~l~~~Fgd~~yl--El~~~~~~e~-~~~~~l~~lA~~~~iPlVaTndv~Y~~~~D~~~h~iL~aI~~~~~l 240 (1033) .++.++.+..+......++-+ -++.+..+.+ ...+.+..|.+.++.|+ .-.| |-.. ..-.+.|+..|..+ T Consensus 125 ~~eI~~~i~~~~~~~~~~i~llhCvS~YPt~~~~~nL~~I~~lk~~f~~~i-G~Sd-Hs~g-----~~~~~~A~~~GA~i 197 (240) T pfam03102 125 LEEIEEAVETLREAGNEDITLLHCTSEYPAPFEDVNLRAIPTLKEAFGVPV-GYSD-HTLG-----IEAPIAAVALGASV 197 (240) T ss_pred HHHHHHHHHHHHHCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCE-EECC-CCCC-----CHHHHHHHHCCCCE T ss_conf 999999999999634567768665888999746600999889998679968-8479-8897-----07779999829939 No 442 >PRK12460 2-keto-3-deoxygluconate permease; Provisional Probab=27.55 E-value=29 Score=12.30 Aligned_cols=42 Identities=36% Similarity=0.527 Sum_probs=34.1 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCHH Q ss_conf 055788899999851047643344442046677655440375633 Q 537021.9.peg.7 354 SGYFLIVADFIQWAKKNNIPVGPGRGSGAGSVTAYALTITDIDPL 398 (1033) Q Consensus 354 ~~YFLiV~Div~~ak~~gI~vGpGRGSAagSLVaY~LgIT~VDPi 398 (1033) .++|++..|.. .+.+|+ .|-+=||.||.-|+-=.-|-++||- T Consensus 229 tg~~~~~~dr~--~g~~g~-aG~A~sstAGnAvatPa~vA~~dPs 270 (308) T PRK12460 229 GGFFNIRADRL--VGGTGI-AGAAASSTAGNAVATPLAIAQADPS 270 (308) T ss_pred HHHHHHHHHHH--HCCCCC-HHHHHHHHHHHHHHHHHHHHHHCCH T ss_conf 83999999999--589972-2468977777877538999874903 No 443 >COG1735 Php Predicted metal-dependent hydrolase with the TIM-barrel fold [General function prediction only] Probab=27.55 E-value=29 Score=12.30 Aligned_cols=20 Identities=20% Similarity=0.235 Sum_probs=9.1 Q ss_pred HHHHHHHHHCCCEEEECCCC Q ss_conf 99999998189899808830 Q 537021.9.peg.7 199 QVVQLAYTHELPLVATNNSL 218 (1033) Q Consensus 199 ~l~~lA~~~~iPlVaTndv~ 218 (1033) .+.++|+++++-+|++...| T Consensus 79 ~m~~vs~atglnIV~~TGfy 98 (316) T COG1735 79 KMRRVAEATGLNIVAATGFY 98 (316) T ss_pred HHHHHHHHHCCCEEEECCCC T ss_conf 99999997198589704533 No 444 >cd03332 LMO_FMN L-Lactate 2-monooxygenase (LMO) FMN-binding domain. LMO is a FMN-containing enzyme that catalyzes the conversion of L-lactate and oxygen to acetate, carbon dioxide, and water. LMO is a member of the family of alpha-hydroxy acid oxidases. It is thought to be a homooctamer with two- and four- fold axes in the center of the octamer. Probab=27.54 E-value=29 Score=12.30 Aligned_cols=18 Identities=17% Similarity=0.242 Sum_probs=7.6 Q ss_pred HHHHHHHHCCCEEEECCC Q ss_conf 999999818989980883 Q 537021.9.peg.7 200 VVQLAYTHELPLVATNNS 217 (1033) Q Consensus 200 l~~lA~~~~iPlVaTndv 217 (1033) +.+-|.+.|+|.+.++.. T Consensus 107 ~AraA~~~g~~~~lSt~s 124 (383) T cd03332 107 TARAAAELGVPYILSTAS 124 (383) T ss_pred HHHHHHHHCCCEECCCCC T ss_conf 999999835862205776 No 445 >cd00080 HhH2_motif Helix-hairpin-helix class 2 (Pol1 family) motif. HhH2 domains are found in Rad2 family of prokaryotic and eukaryotic replication and repair nucleases, i.e., DNA polymerase I, Taq DNA polymerase, DNA repair protein Rad2 endonuclease, flap endonuclease, exonuclease I and IX, 5'-3' exonuclease and also bacteriophage Rnase H. These nucleases degrade RNA-DNA or DNA-DNA duplexes, or both and play essential roles in DNA duplication, repair, and recombination. Probab=27.50 E-value=29 Score=12.29 Aligned_cols=30 Identities=27% Similarity=0.633 Sum_probs=22.4 Q ss_pred HHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHCC Q ss_conf 26858969899999999983799999789997348 Q 537021.9.peg.7 839 LAAIKGVGTTTARHIMEASADKPFDSLEDFCSRVD 873 (1033) Q Consensus 839 l~~Ikgvg~~~~~~Ive~r~~g~f~sl~df~~rv~ 873 (1033) +..|+|||.++|..++. .|.|+++.+...+ T Consensus 24 ipGV~GIG~ktA~~ll~-----~~~~l~~i~~~~~ 53 (75) T cd00080 24 IPGVPGIGPKTALKLLK-----EYGSLENLLENLD 53 (75) T ss_pred CCCCCCCCHHHHHHHHH-----HCCCHHHHHHHHH T ss_conf 87758637999999999-----9099999998387 No 446 >pfam01026 TatD_DNase TatD related DNase. This family of proteins are related to a large superfamily of metalloenzymes. TatD, a member of this family has been shown experimentally to be a DNase enzyme. Probab=27.49 E-value=29 Score=12.29 Aligned_cols=57 Identities=11% Similarity=0.141 Sum_probs=40.0 Q ss_pred HHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCC--CCEEEEEE Q ss_conf 3314447644329989999999978998899811651774999999999769--96389999 Q 537021.9.peg.7 15 QVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAG--IQPIIGCQ 74 (1033) Q Consensus 15 h~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~g--IKPIiG~E 74 (1033) |||-.+. +-....++.+++|++.|...+.++-. +.....+..+.|++++ |.+-+|+- T Consensus 4 H~HL~~~--~~~~d~~~vl~~a~~~gv~~ii~~~~-~~~~~~~~~~la~~~~~~v~~~~GiH 62 (255) T pfam01026 4 HCHLDFK--KFDGDRDEVIERAREAGVTAVVVVGT-DLKDFERALELARKYPGKVYAAVGVH 62 (255) T ss_pred EECCCCH--HHCCCHHHHHHHHHHCCCCEEEEECC-CHHHHHHHHHHHHHCCCCEEEEECCC T ss_conf 1378981--43569999999999879998999569-99999999999976876678995358 No 447 >pfam02638 DUF187 Uncharacterized BCR, COG1649. Probab=27.42 E-value=29 Score=12.28 Aligned_cols=30 Identities=10% Similarity=-0.126 Sum_probs=18.5 Q ss_pred HHHHHHHHHHHHCCCEEEECCCCCCCHHHH Q ss_conf 999999999981898998088300787889 Q 537021.9.peg.7 196 RESQVVQLAYTHELPLVATNNSLFLSEEDY 225 (1033) Q Consensus 196 ~~~~l~~lA~~~~iPlVaTndv~Y~~~~D~ 225 (1033) -...+++-|.+.|+.+.|==++.+..+.+. T Consensus 111 PL~~~I~EaHkRGlevhAWF~~~~~~~~~s 140 (394) T pfam02638 111 PLAFMIDEAHKRNLRVHPWFEFGFNAPALS 140 (394) T ss_pred HHHHHHHHHHHCCCEEEEEEEECCCCCCCH T ss_conf 899999999876998989987035778612 No 448 >COG0313 Predicted methyltransferases [General function prediction only] Probab=27.42 E-value=29 Score=12.28 Aligned_cols=100 Identities=20% Similarity=0.249 Sum_probs=49.5 Q ss_pred HHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHHCCCHHHHHHHHCCCCCCCCCCCCCHH Q ss_conf 98986063055788899999851047643344442046677655440375633323105676210014578753336568 Q 537021.9.peg.7 346 NVIARMKFSGYFLIVADFIQWAKKNNIPVGPGRGSGAGSVTAYALTITDIDPLRFSLLFERFLNPDRMSMPDFDIDFCQD 425 (1033) Q Consensus 346 ~vI~~~gf~~YFLiV~Div~~ak~~gI~vGpGRGSAagSLVaY~LgIT~VDPi~~~LlFERFLnp~R~~~PDIDiDf~~~ 425 (1033) .+|.+-|++.-====+.+|++|++.||.|-|==|- +|+++ +|=.....+= ..+||=||...- .+ T Consensus 81 alVSDAG~P~ISDPG~~LV~~a~~~gi~V~~lPG~--sA~~t-AL~~SGl~~~--~F~F~GFLP~k~-----------~~ 144 (275) T COG0313 81 ALVSDAGTPLISDPGYELVRAAREAGIRVVPLPGP--SALIT-ALSASGLPSQ--RFLFEGFLPRKS-----------KE 144 (275) T ss_pred EEEECCCCCCCCCCCHHHHHHHHHCCCCEEECCCC--CHHHH-HHHHCCCCCC--CEEEECCCCCCC-----------CH T ss_conf 99805899766685099999999869918966883--49999-9997589999--726841179983-----------27 Q ss_pred HHHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHHCC Q ss_conf 78999999983038021554220778778899999997527 Q 537021.9.peg.7 426 RRDEVIRYVQNKYGHERVAQIITFGSLQAKAALRDVGRALQ 466 (1033) Q Consensus 426 ~R~~vi~y~~~kyG~~~va~i~t~~t~~~k~aird~~r~~g 466 (1033) ||+. ++-+.+. .-++|.==+..+-...+.|..-++| T Consensus 145 R~~~-l~~l~~~----~~t~IfyEsphRl~~tL~d~~~~~g 180 (275) T COG0313 145 RRKR-LEALANE----PRTLIFYESPHRLLATLEDIVEVLG 180 (275) T ss_pred HHHH-HHHHHHC----CCEEEEEECCHHHHHHHHHHHHHCC T ss_conf 8999-9998745----7739999663658999999999718 No 449 >PRK10434 srlR DNA-bindng transcriptional repressor SrlR; Provisional Probab=27.39 E-value=29 Score=12.28 Aligned_cols=35 Identities=14% Similarity=0.415 Sum_probs=25.3 Q ss_pred CHHHHHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHHCCCCHHHHH Q ss_conf 568789999999830380215542207787788999999975279988988 Q 537021.9.peg.7 423 CQDRRDEVIRYVQNKYGHERVAQIITFGSLQAKAALRDVGRALQMPYSQVD 473 (1033) Q Consensus 423 ~~~~R~~vi~y~~~kyG~~~va~i~t~~t~~~k~aird~~r~~g~~~~~~~ 473 (1033) +.+||+++++||.+ .| +..+.|+++.||++..++. T Consensus 3 ~~eR~~~Il~~L~~-~g---------------~v~v~eLa~~l~VS~~TIR 37 (256) T PRK10434 3 PRQRQAAILEYLQK-QG---------------KCSVEELAQYFDTTGTTIR 37 (256) T ss_pred HHHHHHHHHHHHHH-CC---------------CEEHHHHHHHHCCCHHHHH T ss_conf 89999999999998-69---------------7999999998796998999 No 450 >cd05170 PIKKc_SMG1 Suppressor of morphogenetic effect on genitalia-1 (SMG-1), catalytic domain; The SMG-1 catalytic domain subfamily is part of a larger superfamily that includes the catalytic domains of other kinases such as the typical serine/threonine/tyrosine protein kinases (PKs), aminoglycoside phosphotransferase, choline kinase, and RIO kinases. SMG-1 is a member of the phosphoinositide 3-kinase-related protein kinase (PIKK) subfamily. PIKKs have intrinsic serine/threonine kinase activity and are distinguished from other PKs by their unique catalytic domain, similar to that of lipid PI3K, and their large molecular weight (240-470 kDa). In addition to its catalytic domain, SMG-1 contains a FATC (FRAP, ATM and TRRAP, C-terminal) domain at the C-terminus. SMG-1 plays a critical role in the mRNA surveillance mechanism known as non-sense mediated mRNA decay (NMD). NMD protects the cells from the accumulation of aberrant mRNAs with premature termination codons (PTCs) generated by geno Probab=27.34 E-value=29 Score=12.29 Aligned_cols=27 Identities=19% Similarity=0.322 Sum_probs=20.9 Q ss_pred CCCCHHHHHHHHHHHCCCCCHHHCCCHHH Q ss_conf 44420466776554403756333231056 Q 537021.9.peg.7 377 GRGSGAGSVTAYALTITDIDPLRFSLLFE 405 (1033) Q Consensus 377 GRGSAagSLVaY~LgIT~VDPi~~~LlFE 405 (1033) .|-.|+.|+|.|.|||-+=-|= |+|+. T Consensus 198 t~S~A~~SivgYIlGLGDRH~~--NILi~ 224 (307) T cd05170 198 ARSTAVMSMIGYVIGLGDRHLD--NVLID 224 (307) T ss_pred HHHHHHHHHHHHHHCCCCCCCC--EEEEE T ss_conf 9989999999997236789864--08998 No 451 >TIGR00191 thrB homoserine kinase; InterPro: IPR000870 Saccharomyces cerevisiae strains containing the erg8-1 mutation are temperature sensitive for growth due to a defect in phosphomevalonate kinase, an enzyme of isoprene and ergosterol biosynthesis. Subcloning and DNA sequencing have defined the functional ERG8 regulon as an 850bp upstream region and an adjacent 1,272bp open reading frame. The deduced ERG8 protein contains 424 residues and shows no similarity to known proteins, except within a putative ATP-binding domain present in many kinases . Enzymes that share the N-terminal Gly/Ser-rich putative ATP-binding region include galactokinase, homoserine kinase, mevalonate kinase and phosphomevalonate kinase. Homoserine kinase is a homodimeric enzyme involved in threonine biosynthesis. Sequence comparison of the yeast enzyme with the corresponding proteins from bacterial sources reveals the presence of several highly conserved regions, the pattern of occurrence of which suggests that the ancestral sequences might have been composed from separate (functional) domains. A block of similar residues, found towards the C-terminus, is also present in many other proteins involved in threonine (or serine) metabolism; this motif may therefore represent the binding site for the hydroxy-amino acids. Limited similarity was detected between a motif conserved among the homoserine kinases and consensus sequences found in other mono- or dinucleotide-binding proteins .; GO: 0004413 homoserine kinase activity, 0005524 ATP binding, 0006566 threonine metabolic process. Probab=27.27 E-value=29 Score=12.26 Aligned_cols=28 Identities=11% Similarity=0.100 Sum_probs=15.8 Q ss_pred CHHHHHHHCCH---HHHHHHHHHHHHHHHHH Q ss_conf 34788751220---35899878864111000 Q 537021.9.peg.7 256 NRSEMVSIFSD---LPEALENTVEIARRCSF 283 (1033) Q Consensus 256 s~eEM~~~f~~---~peAi~NT~~IAe~c~~ 283 (1033) |++|-++.-+. .-+++.|..+++-++.. T Consensus 226 sT~~AR~vLPk~Y~~~d~v~n~~~l~~l~~A 256 (359) T TIGR00191 226 STAEARAVLPKAYPRQDLVFNLSHLAGLVHA 256 (359) T ss_pred CHHHHHHHCCCCCCHHHHHHHHHHHHHHHHH T ss_conf 5688962055546788999988778777756 No 452 >PRK00941 acetyl-CoA decarbonylase/synthase complex subunit alpha; Validated Probab=27.17 E-value=29 Score=12.25 Aligned_cols=66 Identities=24% Similarity=0.346 Sum_probs=33.8 Q ss_pred HHHHHHHHHHHHHCCCC----CCCCCCCCHHHHHHHHHHHCCCCC-------HHHCCCHHHHHHHHCCCCCCCCCCCCCH Q ss_conf 57888999998510476----433444420466776554403756-------3332310567621001457875333656 Q 537021.9.peg.7 356 YFLIVADFIQWAKKNNI----PVGPGRGSGAGSVTAYALTITDID-------PLRFSLLFERFLNPDRMSMPDFDIDFCQ 424 (1033) Q Consensus 356 YFLiV~Div~~ak~~gI----~vGpGRGSAagSLVaY~LgIT~VD-------Pi~~~LlFERFLnp~R~~~PDIDiDf~~ 424 (1033) .++-+++++.|-+++|. -++ | +-|...+||..+ |+-+-|-| -|..+||+=+==++ T Consensus 246 Nv~p~~~i~dY~ee~~L~d~vEi~-G-------iCCTAiD~tRy~~~aKVvGpls~qlk~------iRsGi~DViVvDEQ 311 (779) T PRK00941 246 NVAPGTEIIDYLEENGLTDKVEVA-G-------ICCTAIDITRYNEKAKVVGPLSKQLKF------IRSGIPDVIVVDEQ 311 (779) T ss_pred CCCHHHHHHHHHHHCCCHHHEEEE-E-------EEEEEECCCCCCCCCEEECHHHHHHHH------HHCCCCCEEEECCC T ss_conf 753278999999876842432674-2-------332110134436776574267999999------87169957996453 Q ss_pred HHHHHHHHHHH Q ss_conf 87899999998 Q 537021.9.peg.7 425 DRRDEVIRYVQ 435 (1033) Q Consensus 425 ~~R~~vi~y~~ 435 (1033) =-|.+|++--. T Consensus 312 Cvr~Dileea~ 322 (779) T PRK00941 312 CVRTDILEEAK 322 (779) T ss_pred CCCHHHHHHHH T ss_conf 22121899998 No 453 >pfam09128 RGS-like Regulator of G protein signalling-like domain. Members of this family adopt a structure consisting of twelve helices that fold into a compact domain that contains the overall structural scaffold observed in other RGS proteins and three additional helical elements that pack closely to it. Helices 1-9 comprise the RGS (pfam00615) fold, in which helices 4-7 form a classic antiparallel bundle adjacent to the other helices. Like other RGS structures, helices 7 and 8 span the length of the folded domain and form essentially one continuous helix with a kink in the middle. Helices 10-12 form an apparently stable C-terminal extension of the structural domain, and although other RGS proteins lack this structure, these elements are intimately associated with the rest of the structural framework by hydrophobic interactions. Members of the family bind to active G-alpha proteins, promoting GTP hydrolysis by the alpha subunit of heterotrimeric G proteins, thereby inactivating the G Probab=27.13 E-value=29 Score=12.24 Aligned_cols=21 Identities=14% Similarity=0.283 Sum_probs=8.9 Q ss_pred HHHCCCHHHHHHHHHHHHHHC Q ss_conf 887799789999999998765 Q 537021.9.peg.7 158 AFFFNSSQVAEKRLLTFKKLF 178 (1033) Q Consensus 158 ~l~~~~~~~a~~~l~~l~~~F 178 (1033) +...++.++.++|+-.+...| T Consensus 42 lyk~~~~Ke~rk~~~ei~stF 62 (188) T pfam09128 42 LYKQGTSKEARRWFMEIHSTF 62 (188) T ss_pred HHHCCCHHHHHHHHHHHHHHH T ss_conf 985368689999999999987 No 454 >TIGR01265 tyr_nico_aTase tyrosine/nicotianamine aminotransferases; InterPro: IPR005958 This entry describes tyrosine aminotransferase (2.6.1.5 from EC) as found in animals and Trypanosoma cruzi (Euglenozoa). It is the first enzyme of a pathway of tyrosine degradation via homogentisate.L-tyrosine + 2-oxoglutarate = 4-hydroxyphenylpyruvate + L-glutamate. Several plant enzyme designated as probable tyrosine aminotransferases are very closely related to an experimentally demonstrated nicotianamine aminotransferase, an enzyme in a siderophore (iron uptake chelator) biosynthesis pathway are included in this family. ; GO: 0008483 transaminase activity, 0030170 pyridoxal phosphate binding, 0006519 amino acid and derivative metabolic process. Probab=27.12 E-value=29 Score=12.24 Aligned_cols=22 Identities=18% Similarity=0.462 Sum_probs=17.4 Q ss_pred HHHHHHHHHHHHHHHHCCCEEE Q ss_conf 1589999999999981898998 Q 537021.9.peg.7 192 YDRYRESQVVQLAYTHELPLVA 213 (1033) Q Consensus 192 ~e~~~~~~l~~lA~~~~iPlVa 213 (1033) +.+++.+.++++|++++||++| T Consensus 201 F~~~HL~k~~~~A~kl~i~iIA 222 (424) T TIGR01265 201 FSRDHLEKIAEVARKLGIPIIA 222 (424) T ss_pred CCHHHHHHHHHHHHHHCCCEEE T ss_conf 7888899999999870982687 No 455 >PRK08928 consensus Probab=27.12 E-value=29 Score=12.24 Aligned_cols=26 Identities=12% Similarity=0.185 Sum_probs=13.1 Q ss_pred CCCCCCCCHHHHHHHHHHHHCEEECC Q ss_conf 66899998889998578872621163 Q 537021.9.peg.7 940 LEKFSVENSSVRFENEQRVLGFYFSG 965 (1033) Q Consensus 940 ~~~~~~~~~~e~l~~E~e~LGf~lS~ 965 (1033) +|++..-+...+-..|+.++-..+-| T Consensus 760 ~p~i~s~n~~~r~~~eR~A~N~piQG 785 (861) T PRK08928 760 IPLINSKKHTLRQFAERAAINAPIQG 785 (861) T ss_pred CCCCCCCCHHHHHHHHHHHHCHHHHH T ss_conf 87555778778858999997512128 No 456 >pfam03811 Ins_element1 Insertion element protein. Probab=27.08 E-value=29 Score=12.24 Aligned_cols=17 Identities=24% Similarity=0.255 Sum_probs=10.7 Q ss_pred HHHHHHHHCCCCHHHHH Q ss_conf 99999975279988988 Q 537021.9.peg.7 457 ALRDVGRALQMPYSQVD 473 (1033) Q Consensus 457 aird~~r~~g~~~~~~~ 473 (1033) -|||+||+|+++...+= T Consensus 65 GiRdtaRvl~Is~nTVl 81 (88) T pfam03811 65 GCRYTARTLKIGINTVI 81 (88) T ss_pred CHHHHHHHHCCCHHHHH T ss_conf 54044787375689999 No 457 >PRK06887 consensus Probab=26.99 E-value=29 Score=12.23 Aligned_cols=38 Identities=18% Similarity=0.242 Sum_probs=24.2 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHCEEECCCCHHHHHHHH Q ss_conf 76668999988899985788726211638558999999 Q 537021.9.peg.7 938 ITLEKFSVENSSVRFENEQRVLGFYFSGHPLDVYKPIL 975 (1033) Q Consensus 938 ~~~~~~~~~~~~e~l~~E~e~LGf~lS~HPl~~~~~~l 975 (1033) -.+|++...+...+-..|+.++-..+-|.-=|..|-.+ T Consensus 850 r~~p~i~s~n~~~r~~~eR~aiN~piQGtAADiiK~AM 887 (954) T PRK06887 850 LYLPDINSSNAMRRKGAERVAINAPMQGTAADIIKRAM 887 (954) T ss_pred CCCCCCCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHH T ss_conf 06876668998899789999837565267999999999 No 458 >TIGR03533 L3_gln_methyl protein-(glutamine-N5) methyltransferase, ribosomal protein L3-specific. Members of this protein family methylate ribosomal protein L3 on a glutamine side chain. This family is related to HemK, a protein-glutamine methyltranferase for peptide chain release factors. Probab=26.84 E-value=30 Score=12.20 Aligned_cols=76 Identities=16% Similarity=0.250 Sum_probs=39.5 Q ss_pred HHHHHHHHHHHHC----CCCCCCCCCCCH---HHHHHHHHHHCCCCCHHHCCCHHHHHHHHCCCCCCCCCCCCCHHHHHH Q ss_conf 7888999998510----476433444420---466776554403756333231056762100145787533365687899 Q 537021.9.peg.7 357 FLIVADFIQWAKK----NNIPVGPGRGSG---AGSVTAYALTITDIDPLRFSLLFERFLNPDRMSMPDFDIDFCQDRRDE 429 (1033) Q Consensus 357 FLiV~Div~~ak~----~gI~vGpGRGSA---agSLVaY~LgIT~VDPi~~~LlFERFLnp~R~~~PDIDiDf~~~~R~~ 429 (1033) |-++.|+++||.+ .||..|.|--|+ |-=|+|++||++....+ +.+++=| +...++. T Consensus 1 m~T~~~~L~~a~~~l~~agi~~~~G~~~prldAe~LLa~~L~~~r~~l~---~~~~~~l--------------~~~e~~~ 63 (284) T TIGR03533 1 LHTIRDFLRWAVSRFNAAGLFFGHGTDNAWDEAVYLVLHALHLPLDILE---PFLDARL--------------TPSEKER 63 (284) T ss_pred CCCHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHCCCHHHHH---HHHCCCC--------------CHHHHHH T ss_conf 9088999999999999779977889999999999999999695975677---6324889--------------9999999 Q ss_pred HHHHHHHHC-CCCCEEECCCH Q ss_conf 999998303-80215542207 Q 537021.9.peg.7 430 VIRYVQNKY-GHERVAQIITF 449 (1033) Q Consensus 430 vi~y~~~ky-G~~~va~i~t~ 449 (1033) .-+++.+|- |+.=++.|... T Consensus 64 ~~~lv~RR~~~r~Pl~YI~G~ 84 (284) T TIGR03533 64 ILELIERRIEERIPVAYLTNE 84 (284) T ss_pred HHHHHHHHHHCCCCHHHHHCC T ss_conf 999999998279858998486 No 459 >TIGR00853 pts-lac PTS system, lactose/cellobiose family IIB component; InterPro: IPR014350 The phosphoenolpyruvate-dependent sugar phosphotransferase system (PTS) , is a major carbohydrate transport system in bacteria. The PTS catalyses the phosphorylation of incoming sugar substrates and coupled with translocation across the cell membrane, makes the PTS a link between the uptake and metabolism of sugars. The general mechanism of the PTS is the following: a phosphoryl group from phosphoenolpyruvate (PEP) is transferred via a signal transduction pathway, to enzyme I (EI) which in turn transfers it to a phosphoryl carrier, the histidine protein (HPr). Phospho-HPr then transfers the phosphoryl group to a sugar-specific permease, a membrane-bound complex known as enzyme 2 (EII), which transports the sugar to the cell. EII consists of at least three structurally distinct domains IIA, IIB and IIC . These can either be fused together in a single polypeptide chain or exist as two or three interactive chains, formerly called enzymes II (EII) and III (EIII). The first domain (IIA or EIIA) carries the first permease-specific phosphorylation site, a histidine which is phosphorylated by phospho-HPr. The second domain (IIB or EIIB) is phosphorylated by phospho-IIA on a cysteinyl or histidyl residue, depending on the sugar transported. Finally, the phosphoryl group is transferred from the IIB domain to the sugar substrate concomitantly with the sugar uptake processed by the IIC domain. This third domain (IIC or EIIC) forms the translocation channel and the specific substrate-binding site. An additional transmembrane domain IID, homologous to IIC, can be found in some PTSs, e.g. for mannose , , , . According to structural and sequence analyses, the PTS EIIB domain (2.7.1.69 from EC) can be divided in five groups , , : The PTS EIIB type 1 domain, which is found in the Glucose class of PTS, has an average length of about 80 amino acids. It forms a split alpha/beta sandwich composed of an antiparallel sheet (beta 1 to beta 4) and three alpha helices superimposed onto one side of the sheet. The phosphorylation site (Cys) is located at the end of the first beta strand on a protrusion formed by the edge of beta 1 and the reverse turn between beta 1 and beta 2 . The PTS EIIB type 2 domain, which is found in the Mannitol class of PTS, has an average length of about 100 amino acids. It consists of a four stranded parallel beta sheet flanked by two alpha helices (alpha 1 and 3) on one face and helix alpha 2 on the opposite face, with a characteristic Rossmann fold comprising two right-handed beta-alpha-beta motifs. The phosphorylation site (Cys) is located at the N-terminus of the domain, in the first beta strand. The PTS EIIB type 3 domain, which is found in the Lactose class of PTS, has an average length of about 100 amino acids. It is composed of a central four-stranded parallel open twisted beta sheet, which is flanked by three alpha helices on the concave side and two on the convex side of the beta sheet. The phosphorylation site (Cys) is located in the C-terminal end of the first beta strand . The PTS EIIB type 4 domain, which is found in the Mannose class of PTS, has an average length of about 160 amino acids. It has a central core of seven parallel beta strands surrounded by a total of six alpha-helices. Three helices cover the front face, one the back face with the remaining two capping the central beta sheet at the top and bottom. The phosphorylation site (His) is located at the suface exposed loop between strand 1 and helix 1 . The PTS EIIB type 5 domain, which is found in the Sorbitol class of PTS, has an average length of about 190 amino acids. The phosphorylation site (Cys) is located in the N-terminus of the domain. The EIIB type 3 protein is often found downstream of EIIC type 3. This entry hits fewer proteins than IPR013012 from INTERPRO. . Probab=26.80 E-value=30 Score=12.20 Aligned_cols=26 Identities=8% Similarity=0.086 Sum_probs=21.6 Q ss_pred HHHHHHHHHHHCCCEEEECCCCCCCH Q ss_conf 99999999981898998088300787 Q 537021.9.peg.7 197 ESQVVQLAYTHELPLVATNNSLFLSE 222 (1033) Q Consensus 197 ~~~l~~lA~~~~iPlVaTndv~Y~~~ 222 (1033) ...+.+..+.+|||+..|+.++|-.= T Consensus 107 ~~~lk~~t~~~Gip~~~i~~~~Yg~~ 132 (142) T TIGR00853 107 LPDLKKETDKKGIPVEVINGAQYGKL 132 (142) T ss_pred HHHHHHHCCCCCCCEEEECCCCCEEE T ss_conf 79999851358975366387002004 No 460 >cd05175 PI3Kc_IA_alpha Phosphoinositide 3-kinase (PI3K), class IA, alpha isoform, catalytic domain; The PI3K catalytic domain family is part of a larger superfamily that includes the catalytic domains of other kinases such as the typical serine/threonine/tyrosine protein kinases (PKs), aminoglycoside phosphotransferase, choline kinase, and RIO kinases. PI3Ks catalyze the transfer of the gamma-phosphoryl group from ATP to the 3-hydroxyl of the inositol ring of D-myo-phosphatidylinositol (PtdIns) or its derivatives. PI3Ks can be divided into three main classes (I, II, and III), defined by their substrate specificity, regulation, and domain structure. Class I PI3Ks are the only enzymes capable of converting PtdIns(4,5)P2 to the critical second messenger PtdIns(3,4,5)P3. Class I enzymes are heterodimers and exist in multiple isoforms consisting of one catalytic subunit (out of four isoforms) and one of several regulatory subunits. They are further classified into class IA (alpha, beta and Probab=26.76 E-value=30 Score=12.21 Aligned_cols=87 Identities=14% Similarity=0.296 Sum_probs=45.3 Q ss_pred HHHHCCCCCCCEEEECCCCHHHHEECCCCCCCCCEEEEEECHHH-HH-H-----CCC------HHHHHHHCCHHHHHHHH Q ss_conf 22100244744033525666662001236768866885102133-43-1-----123------21356541789999999 Q 537021.9.peg.7 520 GLYRHASTHAAGIVIGDRPLSQLVPMYRDVRSDLPVTQFNMKWI-EK-A-----GLI------KFDFLGLKTLTLLQKSL 586 (1033) Q Consensus 520 g~~r~~~~Ha~Gvvi~~~~l~~~~p~~~~~~~~~~~~q~d~~~~-e~-~-----gl~------K~D~Lgl~~L~~i~~~~ 586 (1033) .+|-+.++..+|+++..-. +..+++...+++ |...+. .+ . =.. .=|.|.|-.+++++... T Consensus 54 ~lPl~P~~~~~~i~~~~c~------v~~S~~~PL~l~-f~n~d~~~~~~~~~~~vIfK~GDDLRQD~L~lQli~lmd~i~ 126 (366) T cd05175 54 TSPLNPAHQLGNLRLEECR------IMSSAKRPLWLN-WENPDIMSELLFQNNEIIFKNGDDLRQDMLTLQIIRIMENIW 126 (366) T ss_pred CCCCCCHHEECCEEECCCE------EECCCCCCEEEE-EECCCCCCCCCCCCCEEEEECCCCHHHHHHHHHHHHHHHHHH T ss_conf 9998813256517613450------221036841678-524785444457884289977975409999999999999999 Q ss_pred HHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHCC Q ss_conf 98874025434332344358899876201410122114 Q 537021.9.peg.7 587 DFLAQRGVKVDLSLIPFDDHETYHLLTTKGTLGIFQLE 624 (1033) Q Consensus 587 ~~i~~~~~~~d~~~ip~~D~~~~~l~~~g~t~GvFQ~e 624 (1033) +. .| +|+.-.| |+.++.|...|+-|+- T Consensus 127 k~---~g--LDL~l~p------Y~vlaTg~~~G~IE~V 153 (366) T cd05175 127 QN---QG--LDLRMLP------YGCLSIGDCVGLIEVV 153 (366) T ss_pred HH---CC--CCEEEEE------EEEEEECCCCCCEEEC T ss_conf 97---69--9859998------7887605887603535 No 461 >KOG0781 consensus Probab=26.75 E-value=30 Score=12.19 Aligned_cols=110 Identities=18% Similarity=0.153 Sum_probs=56.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEECCCEEEEEHHHHCC Q ss_conf 14788899999999997302666777510023542025899988886368503552334210035319869850268589 Q 537021.9.peg.7 765 HAAAYAVISYQTAWMKTHYPVEFLAASMTLEMDNVEKIKKFCQDARQFNIQIMPPSVNTPCVDFKVGDNRIYYSLAAIKG 844 (1033) Q Consensus 765 Ha~aya~~~y~~aylK~~yP~eF~aa~L~~~~~~~~~~~~~i~e~~~~gi~vl~PdIN~S~~~f~~~~~~Ir~Gl~~Ikg 844 (1033) -|...|.++|| |--|--...+||+=+...|-.+.+..-++-....+= +...+.-.+ || |. T Consensus 391 KSTNLAKIayW---LlqNkfrVlIAACDTFRsGAvEQLrtHv~rl~~l~~---------~~v~lfekG----Yg----kd 450 (587) T KOG0781 391 KSTNLAKIAYW---LLQNKFRVLIAACDTFRSGAVEQLRTHVERLSALHG---------TMVELFEKG----YG----KD 450 (587) T ss_pred CCCHHHHHHHH---HHHCCCEEEEEECCCHHHHHHHHHHHHHHHHHHHCC---------CHHHHHHHH----CC----CC T ss_conf 51329999999---985783699986243124478999999999987455---------204888610----47----78 Q ss_pred CCHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCHHHHHHHHHCCCCCCC Q ss_conf 69899999999983799999789997348247998999988867672347 Q 537021.9.peg.7 845 VGTTTARHIMEASADKPFDSLEDFCSRVDSKKLNRRVLESLVFAGALDCF 894 (1033) Q Consensus 845 vg~~~~~~Ive~r~~g~f~sl~df~~rv~~~~~nk~~le~Li~aGAfD~~ 894 (1033) ...-+-++|.+.|.+|-=-=|.|-.-|+.....=-+.|+.||++.--|-+ T Consensus 451 ~a~vak~AI~~a~~~gfDvvLiDTAGRmhn~~~LM~~L~kl~~~n~pD~i 500 (587) T KOG0781 451 AAGVAKEAIQEARNQGFDVVLIDTAGRMHNNAPLMTSLAKLIKVNKPDLI 500 (587) T ss_pred HHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHHHHHHHHHCCCCCEE T ss_conf 28999999999986698789983544334780678999999744798659 No 462 >KOG0638 consensus Probab=26.74 E-value=30 Score=12.19 Aligned_cols=14 Identities=21% Similarity=0.214 Sum_probs=10.0 Q ss_pred CCCEEEEEECCHHH Q ss_conf 77539999569689 Q 537021.9.peg.7 95 TVPSIVLLVSTAEG 108 (1033) Q Consensus 95 ~~~~lvlLAkN~~G 108 (1033) ...|+.+.+-|.+- T Consensus 17 ~f~Hi~F~vgna~q 30 (381) T KOG0638 17 RFHHIEFWVGNAKQ 30 (381) T ss_pred EEEEEEEEECCCHH T ss_conf 46679999367178 No 463 >COG0329 DapA Dihydrodipicolinate synthase/N-acetylneuraminate lyase [Amino acid transport and metabolism / Cell envelope biogenesis, outer membrane] Probab=26.71 E-value=30 Score=12.19 Aligned_cols=90 Identities=13% Similarity=0.182 Sum_probs=45.8 Q ss_pred CCCCEEEECCCCCCHHHHHHHCCCHHHHHHHHHHHHHHCCCC--EEEEEECCCHHHHHHHHHHHHHHHHHCCCEEEECCC Q ss_conf 489889981976796788887799789999999998765986--899994399115899999999999818989980883 Q 537021.9.peg.7 140 GTEGLIMLTGGSSGPIDRAFFFNSSQVAEKRLLTFKKLFGDR--LYVNLQRHRGYDRYRESQVVQLAYTHELPLVATNNS 217 (1033) Q Consensus 140 ~~egLIvlsg~~~g~i~~~l~~~~~~~a~~~l~~l~~~Fgd~--~ylEl~~~~~~e~~~~~~l~~lA~~~~iPlVaTndv 217 (1033) ..+||++. |..|+.+.+ ..+|-.+.++..++.-+++ +..++..+...| .-.+.+.|++.|+--+..-.. T Consensus 38 Gv~gi~~~--GttGE~~~L----s~eEr~~v~~~~v~~~~grvpviaG~g~~~t~e---ai~lak~a~~~Gad~il~v~P 108 (299) T COG0329 38 GVDGLVVL--GTTGESPTL----TLEERKEVLEAVVEAVGGRVPVIAGVGSNSTAE---AIELAKHAEKLGADGILVVPP 108 (299) T ss_pred CCCEEEEC--CCCCCCHHC----CHHHHHHHHHHHHHHHCCCCCEEEECCCCCHHH---HHHHHHHHHHCCCCEEEEECC T ss_conf 99889979--866572216----999999999999999689777898628777999---999999999709999998489 Q ss_pred CCCCHHHHHHHHHHHHHHCCC Q ss_conf 007878899998766884698 Q 537021.9.peg.7 218 LFLSEEDYEAHDVLMAVAHST 238 (1033) Q Consensus 218 ~Y~~~~D~~~h~iL~aI~~~~ 238 (1033) +|..+.+..+.+-+.+|...+ T Consensus 109 yY~k~~~~gl~~hf~~ia~a~ 129 (299) T COG0329 109 YYNKPSQEGLYAHFKAIAEAV 129 (299) T ss_pred CCCCCCHHHHHHHHHHHHHHC T ss_conf 788989799999999999851 No 464 >COG3088 CcmH Uncharacterized protein involved in biosynthesis of c-type cytochromes [Posttranslational modification, protein turnover, chaperones] Probab=26.60 E-value=30 Score=12.17 Aligned_cols=12 Identities=17% Similarity=0.163 Sum_probs=4.6 Q ss_pred HHHHHHHHHCCC Q ss_conf 999999981898 Q 537021.9.peg.7 199 QVVQLAYTHELP 210 (1033) Q Consensus 199 ~l~~lA~~~~iP 210 (1033) +...|++++.-| T Consensus 37 ra~~Lt~~LRCp 48 (153) T COG3088 37 RARALTEELRCP 48 (153) T ss_pred HHHHHHHHCCCC T ss_conf 999999863887 No 465 >cd01123 Rad51_DMC1_radA Rad51_DMC1_radA,B. This group of recombinases includes the eukaryotic proteins RAD51, RAD55/57 and the meiosis-specific protein DMC1, and the archaeal proteins radA and radB. They are closely related to the bacterial RecA group. Rad51 proteins catalyze a similiar recombination reaction as RecA, using ATP-dependent DNA binding activity and a DNA-dependent ATPase. However, this reaction is less efficient and requires accessory proteins such as RAD55/57 . Probab=26.59 E-value=30 Score=12.17 Aligned_cols=23 Identities=22% Similarity=0.173 Sum_probs=18.5 Q ss_pred HHHHHHHHHHHCCCEEEECCCCC Q ss_conf 99999999981898998088300 Q 537021.9.peg.7 197 ESQVVQLAYTHELPLVATNNSLF 219 (1033) Q Consensus 197 ~~~l~~lA~~~~iPlVaTndv~Y 219 (1033) ...|..+|+++++++|.||.|.- T Consensus 149 ~~~L~~lA~~~~~aVvvtNqvt~ 171 (235) T cd01123 149 LRTLKRLADEFNVAVVITNQVTA 171 (235) T ss_pred HHHHHHHHHHCCCEEEEECEEEE T ss_conf 99999999980979999680655 No 466 >TIGR02865 spore_II_E stage II sporulation protein E; InterPro: IPR014221 This entry contains the stage II sporulation protein E (SpoIIE), which is a multiple membrane spanning protein with two separable functions. It plays a role in the switch to polar cell division during sporulation and it has phosphatase activity, located in the C-terminal region, which is required to activate sigma-F. All members of this entry are found in endospore-forming Gram-positive bacteria. A homologous protein, not a member of this entry, is found in the Cyanobacterium-like (and presumably non-spore-forming) photosynthesizer Heliobacillus mobilis.. Probab=26.50 E-value=27 Score=12.50 Aligned_cols=415 Identities=16% Similarity=0.187 Sum_probs=178.8 Q ss_pred HHHHHCCCCCCCCCCCCCCC-CCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHH Q ss_conf 33221023455575688776-89889998765410342012464324578389999988778989860630557888999 Q 537021.9.peg.7 285 LQVHAPILPRFIEKECDDIQ-KEEENELRNKAVAGLEVRLAQGAIAKGYHAQDYRQRLDFELNVIARMKFSGYFLIVADF 363 (1033) Q Consensus 285 l~~~~~~lP~f~~p~~~~~~-~~~~~~Lr~l~~~Gl~~R~~~~~~~~~~~~~~y~~RLe~EL~vI~~~gf~~YFLiV~Di 363 (1033) +...-+.+-..+++...... .=+---|--++..|+.-- .++.|++ ++-| ++.||-+.-=+ T Consensus 136 f~~s~~~l~~~~~k~~~~~EEivs~~Il~asvl~G~~gl-------------~i~gR~s--~~~i----~a~~~VL~~sy 196 (794) T TIGR02865 136 FNYSIPCLKNGRTKHLLSNEEIVSLIILIASVLTGLRGL-------------SIWGRLS--LENI----IARLAVLLISY 196 (794) T ss_pred HHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCC-------------CCCCHHH--HHHH----HHHHHHHHHHH T ss_conf 998898742898775566430689999999999840577-------------4014121--8889----99999999985 Q ss_pred HHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHHC-CCH--------------HHHHHHHCCCCCCCCCCCCCHHHHH Q ss_conf 9985104764334444204667765544037563332-310--------------5676210014578753336568789 Q 537021.9.peg.7 364 IQWAKKNNIPVGPGRGSGAGSVTAYALTITDIDPLRF-SLL--------------FERFLNPDRMSMPDFDIDFCQDRRD 428 (1033) Q Consensus 364 v~~ak~~gI~vGpGRGSAagSLVaY~LgIT~VDPi~~-~Ll--------------FERFLnp~R~~~PDIDiDf~~~~R~ 428 (1033) + =|+|=|||+|-.+-=.++++.++++.+ +++ |-=||+=- |=+-+|. T Consensus 197 ~---------gGs~~G~A~Gvv~G~i~~l~~~~~~~~~g~~gPPtLsDsqamaYvf~GLLgG~----------Fk~~~K~ 257 (794) T TIGR02865 197 I---------GGSGAGAAIGVVIGVILGLANNANLIQIGVFGPPTLSDSQAMAYVFAGLLGGI----------FKELGKI 257 (794) T ss_pred H---------HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHH----------HHHHHHH T ss_conf 4---------23214789999999999864112201331134865203679999999999989----------8752179 Q ss_pred H------HHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCC---CCCCHHHHHCCHHHH Q ss_conf 9------9999983038021554220778778899999997527998898865431014589---824434452298878 Q 537021.9.peg.7 429 E------VIRYVQNKYGHERVAQIITFGSLQAKAALRDVGRALQMPYSQVDRLCKLIPNDPA---HPVSLQAIADDSRFR 499 (1033) Q Consensus 429 ~------vi~y~~~kyG~~~va~i~t~~t~~~k~aird~~r~~g~~~~~~~~~~~~i~~~~~---~~~~~~~~~~~~~~~ 499 (1033) - |-.-+..-|+.-+++...+. |..+.=-++=-+.+|..-.+++.+.+++... .....+.+.+ +.| T Consensus 258 gt~~gylvg~~i~~~Y~~~~~~~~~~l----A~e~l~A~~lF~l~P~~~~~~~~~~l~~~~~aPP~~~~~~~Y~~--k~r 331 (794) T TIGR02865 258 GTAIGYLVGFLILAFYTQGSVAFSLAL----AYEVLIAALLFLLIPNKIYKKLERELDGEKKAPPQKDLQEDYIR--KVR 331 (794) T ss_pred HHHHHHHHHHHHHHHHHCCCHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHH--HHH T ss_conf 999999999999999806513344446----89999999999986899999989757863257784552222555--678 Q ss_pred HHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCEEEECCCCHHHHEE---CCCCCCC------CCEEEEEECHHHHHHCCCH Q ss_conf 864307420268898720112210024474403352566666200---1236768------8668851021334311232 Q 537021.9.peg.7 500 EARLADPSVDRLLEISQKLEGLYRHASTHAAGIVIGDRPLSQLVP---MYRDVRS------DLPVTQFNMKWIEKAGLIK 570 (1033) Q Consensus 500 ~~~~~~~~~~~~~~~a~~leg~~r~~~~Ha~Gvvi~~~~l~~~~p---~~~~~~~------~~~~~q~d~~~~e~~gl~K 570 (1033) +...+ ++-+.+.-+.-+=++..+ .--+.+|+.+.+.-- +...=.+ ...-.+|+.+-=+..- T Consensus 332 ~I~~~-----kl~~fS~vl~~ls~tf~~--~~~~~~~e~~~~~~~~~~l~e~l~~~vCq~C~m~~~CW~Re~~~TY~--- 401 (794) T TIGR02865 332 EIAAE-----KLEEFSEVLLELSKTFLE--VEALASNEKLTMKKKSSYLLEKLAERVCQSCNMKHICWKREFYYTYS--- 401 (794) T ss_pred HHHHH-----HHHHHHHHHHHHHHHHHH--HHHHCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHH--- T ss_conf 99999-----999999999998678998--77531883023444489999842243478641122243136367899--- Q ss_pred HHHHHHCCHHHHHHHHHHHHHHCCCCCCCCCC--------------------CCHH---HHHH-HHHCCCCCHHHHCC-- Q ss_conf 13565417899999999887402543433234--------------------4358---8998-76201410122114-- Q 537021.9.peg.7 571 FDFLGLKTLTLLQKSLDFLAQRGVKVDLSLIP--------------------FDDH---ETYH-LLTTKGTLGIFQLE-- 624 (1033) Q Consensus 571 ~D~Lgl~~L~~i~~~~~~i~~~~~~~d~~~ip--------------------~~D~---~~~~-l~~~g~t~GvFQ~e-- 624 (1033) .+.+..+.+++ +.++-...+| +++. ..|+ -++.++-.-==|+. T Consensus 402 ----------am~El~~~~e~-~~~l~~s~~p~~~e~kC~k~~~l~~~~~~~~~~y~~~~~w~~rl~e~r~l~a~Ql~gv 470 (794) T TIGR02865 402 ----------AMEELIENLEE-KKDLPNSKLPDELERKCIKRKELVKTTEEILNNYKINEMWRKRLEEGRELVAEQLKGV 470 (794) T ss_pred ----------HHHHHHHHHHC-CCCCCCCCCCHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf ----------99999999614-7676665564588851475899999999998768888999999889899999997519 Q ss_pred CHHHHHHHHHH----CCCHHHHHHHHHHHCCCH-HHHHHHH------------HHHHHCC-CCCCCCCCHHHHHHHCCCC Q ss_conf 53478888771----740488887665411411-3420036------------8885046-5785026966733410335 Q 537021.9.peg.7 625 SSGMRQALEGM----QPDCIEDIIALVSLYRPG-PIDNIVV------------YNNRKNG-KEKIVSIHPLIDPMLKETQ 686 (1033) Q Consensus 625 s~~~~~~l~~~----~p~~~~dl~~~~al~RPG-p~~~~~~------------yi~rk~g-~e~~~~~hp~l~~iL~~T~ 686 (1033) |..|-.+.+.+ +=+.-.+-...-+|..-| |.+-+.. ++++=.| .+-++-+-|.|.+++.+- T Consensus 471 ~~~~~~~~~ei~~~~~f~~~~E~~I~~~L~k~gI~~~~v~~y~~~~g~~~v~~~~~~c~g~~~~~k~I~p~i~~~~g~~- 549 (794) T TIGR02865 471 AESVEDIAKEINKEISFNQLLEEKIIRALNKNGIEVEDVLIYNTEGGRIIVELTIAACGGKGECEKKIAPIISEVTGEL- 549 (794) T ss_pred HHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCEEEEEEEECCCCCEEEEEEECCCCCCCCCCCCCCCHHHHHCCCE- T ss_conf 9999999998630122016679999999844389055798887599825899873378898765102541324511434- Q ss_pred CCCCHHHHHHHHHH-HHHCCCHHHHHHHHHHH--CC--CCCCHHHH--------HHHHHH----HHHHHCC-CCHHHHHH Q ss_conf 53001256898878-86326841307889762--01--21101677--------878999----9986249-99999999 Q 537021.9.peg.7 687 GVIIYQEQVMQIAQ-LLSGYSLSEADVLRRAM--GK--KIKEEMDK--------QKERFI----SGASKNG-ISKTIAVN 748 (1033) Q Consensus 687 Gv~vyQEQvm~ia~-~~ag~~~~ead~lRra~--~k--k~~~~~~~--------~~~~f~----~g~~~~g-~~~~~a~~ 748 (1033) +.|=||+.-.--. -.+.++.+++..++=+. ++ |..+...- --.+|. +|| -+| -......+ T Consensus 550 -~~v~~~~c~~~~~ng~C~~~F~~~~~y~v~t~~~~~~K~Ge~vSGDSY~~~~L~~Gky~~~iSDGM-G~G~~A~~ES~a 627 (794) T TIGR02865 550 -MVVKDERCSIDPKNGRCHLTFEETPKYHVSTGVAKAAKDGELVSGDSYSFGKLSAGKYAVAISDGM-GSGPEAAQESSA 627 (794) T ss_pred -EEEECCCCCCCCCCCEEEEECCCCCCHHHHHHHHHCCCCCCCCCCCEEEEEECCCCEEEEEEECCC-CCHHHHHHHHHH T ss_conf -565223358689987136750578023655565533789994016556776317970799985479-981889999999 Q ss_pred HHHHHHHHHHCCCCCCHHH Q ss_conf 9999997500013310147 Q 537021.9.peg.7 749 IFELLAKFADYGFNKSHAA 767 (1033) Q Consensus 749 i~~~i~~f~~Y~FnksHa~ 767 (1033) +-+++|+|..-|||+.-|+ T Consensus 628 ~~~LlEk~l~~Gf~~~~AI 646 (794) T TIGR02865 628 CVRLLEKFLESGFDREVAI 646 (794) T ss_pred HHHHHHHHHHCCCCHHHHH T ss_conf 9999999997078876899 No 467 >PRK10425 DNase TatD; Provisional Probab=26.49 E-value=30 Score=12.16 Aligned_cols=57 Identities=19% Similarity=0.150 Sum_probs=41.8 Q ss_pred HHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHC-CCCEEEEEE Q ss_conf 331444764432998999999997899889981165177499999999976-996389999 Q 537021.9.peg.7 15 QVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAA-GIQPIIGCQ 74 (1033) Q Consensus 15 h~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~-gIKPIiG~E 74 (1033) |||=.++-. -...++++++|++.|...+-+.=. ++...-+..+.|+++ ++.+.+|+- T Consensus 5 H~HL~~~~f--~~d~~~vl~rA~~~gV~~ii~~g~-~~~~~~~~~~la~~~~~v~~~~GiH 62 (258) T PRK10425 5 GVNLTSSQF--AKDRDDVVARAFAAGVNGLLITGT-NLRESQQAQKLARQYPSCWSTAGVH 62 (258) T ss_pred EECCCCHHH--HCCHHHHHHHHHHCCCCEEEEECC-CHHHHHHHHHHHHHCCCCEEEECCC T ss_conf 758898777--578999999999879998999469-9999999999998688735983358 No 468 >TIGR03674 fen_arch flap structure-specific endonuclease. Endonuclease that cleaves the 5'-overhanging flap structure that is generated by displacement synthesis when DNA polymerase encounters the 5'-end of a downstream Okazaki fragment. Has 5'-endo-/exonuclease and 5'-pseudo-Y-endonuclease activities. Cleaves the junction between single and double-stranded regions of flap DNA Probab=26.44 E-value=30 Score=12.15 Aligned_cols=51 Identities=20% Similarity=0.464 Sum_probs=36.4 Q ss_pred CCCHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHH Q ss_conf 23105676210014578753336568789999999830380215542207787788999999 Q 537021.9.peg.7 400 FSLLFERFLNPDRMSMPDFDIDFCQDRRDEVIRYVQNKYGHERVAQIITFGSLQAKAALRDV 461 (1033) Q Consensus 400 ~~LlFERFLnp~R~~~PDIDiDf~~~~R~~vi~y~~~kyG~~~va~i~t~~t~~~k~aird~ 461 (1033) |.-.+|=|+||.=+ +++++.+..-..+.+++++.+.+| |+-=+.+.++.-+ T Consensus 273 ~~~~r~lF~~p~v~--d~~~~~~~~pd~e~l~~fL~~e~~---------fse~RV~~~i~kl 323 (338) T TIGR03674 273 YDEIREFFLNPPVT--DDYELKWRKPDKEGIIEFLCDEHD---------FSEDRVERALERL 323 (338) T ss_pred HHHHHHHHCCCCCC--CCCCCCCCCCCHHHHHHHHHHCCC---------CCHHHHHHHHHHH T ss_conf 99999984899989--877776799998999999553559---------7999999999999 No 469 >cd00288 Pyruvate_Kinase Pyruvate kinase (PK): Large allosteric enzyme that regulates glycolysis through binding of the substrate, phosphoenolpyruvate, and one or more allosteric effectors. Like other allosteric enzymes, PK has a high substrate affinity R state and a low affinity T state. PK exists as several different isozymes, depending on organism and tissue type. In mammals, there are four PK isozymes: R, found in red blood cells, L, found in liver, M1, found in skeletal muscle, and M2, found in kidney, adipose tissue, and lung. PK forms a homotetramer, with each subunit containing three domains. The T state to R state transition of PK is more complex than in most allosteric enzymes, involving a concerted rotation of all 3 domains of each monomer in the homotetramer. Probab=26.34 E-value=30 Score=12.14 Aligned_cols=39 Identities=10% Similarity=-0.072 Sum_probs=22.7 Q ss_pred HHHHHHHHHCCCCEEEE-ECCCHHHHHHHH---H-HHHHHCCCC Q ss_conf 99999999789988998-116517749999---9-999976996 Q 537021.9.peg.7 30 NNILDKIAADQQPAIAI-TDTNNLFSALEF---S-QKACAAGIQ 68 (1033) Q Consensus 30 e~LV~~A~~~G~~AiAI-TD~~nl~Gav~F---~-~ackk~gIK 68 (1033) .+.++...+.||...=| .-|++.--+.+- . +++++.|.. T Consensus 17 ~~~l~~li~aG~~v~RiN~SHg~~e~~~~~i~~iR~~~~~~~~~ 60 (480) T cd00288 17 VENLKKLIKAGMNVARMNFSHGSHEYHQSRIDNVREAAEKTGGP 60 (480) T ss_pred HHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHHHHHHCCCC T ss_conf 99999999879999999899999999999999999999971998 No 470 >PRK00257 erythronate-4-phosphate dehydrogenase; Validated Probab=26.28 E-value=30 Score=12.13 Aligned_cols=27 Identities=22% Similarity=0.335 Sum_probs=20.2 Q ss_pred HHHHCCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 985104764334444204667765544 Q 537021.9.peg.7 365 QWAKKNNIPVGPGRGSGAGSVTAYALT 391 (1033) Q Consensus 365 ~~ak~~gI~vGpGRGSAagSLVaY~Lg 391 (1033) .||+++||.+.-.-|.-|.|.+-|+++ T Consensus 75 ~~l~~~gI~v~naPG~Na~sVaEyvl~ 101 (379) T PRK00257 75 DYFAEAGITWSNAPGCNARGVVDYVLG 101 (379) T ss_pred HHHHHCCCEEEECCCCCHHHHHHHHHH T ss_conf 999869978996998777999999999 No 471 >PRK07084 fructose-bisphosphate aldolase; Provisional Probab=26.28 E-value=30 Score=12.13 Aligned_cols=49 Identities=16% Similarity=0.146 Sum_probs=40.3 Q ss_pred HCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEE Q ss_conf 32998999999997899889981165177499999999976996389999 Q 537021.9.peg.7 25 GALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPIIGCQ 74 (1033) Q Consensus 25 g~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPIiG~E 74 (1033) +..+..++++.|++.|| ||+=-+-.|+-.+--...+|++.+-.-|+++. T Consensus 8 ~LV~~k~lL~~A~~~~y-AV~AfNv~~~e~~~Avi~AAee~~sPvIlq~s 56 (321) T PRK07084 8 GLVNTRELFAKAVKGGY-AIPAYNFNNLEQLQAIIMACVETKSPVILQVS 56 (321) T ss_pred ECCCHHHHHHHHHHCCC-EEEEEEECCHHHHHHHHHHHHHHCCCEEEEEC T ss_conf 10329999999998893-79987779999999999999997999899927 No 472 >COG4008 Predicted metal-binding transcription factor [Transcription] Probab=26.24 E-value=22 Score=13.25 Aligned_cols=35 Identities=34% Similarity=0.494 Sum_probs=28.0 Q ss_pred HHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHHCCCH Q ss_conf 85104764334444204667765544037563332310 Q 537021.9.peg.7 366 WAKKNNIPVGPGRGSGAGSVTAYALTITDIDPLRFSLL 403 (1033) Q Consensus 366 ~ak~~gI~vGpGRGSAagSLVaY~LgIT~VDPi~~~Ll 403 (1033) |||. -++|-|-+.--|||| ||-.||.-.|++-.-| T Consensus 52 Fak~--T~Lg~Ge~TCFGSLV-WCCKitKPCplRD~~l 86 (153) T COG4008 52 FAKR--TMLGYGENTCFGSLV-WCCKITKPCPLRDGVL 86 (153) T ss_pred HHHC--CCCCCCCCCCCCEEE-EEECCCCCCCCCHHHH T ss_conf 8503--634467776000024-6623789866207788 No 473 >PRK07562 ribonucleotide-diphosphate reductase subunit alpha; Validated Probab=26.19 E-value=30 Score=12.12 Aligned_cols=14 Identities=43% Similarity=0.515 Sum_probs=5.9 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 67787899999862 Q 537021.9.peg.7 725 MDKQKERFISGASK 738 (1033) Q Consensus 725 ~~~~~~~f~~g~~~ 738 (1033) |.+.-+.|++||+- T Consensus 868 MqAA~QpfidgAIS 881 (1177) T PRK07562 868 MMAAAQPFISGAIS 881 (1177) T ss_pred HHHHHHHHHCCCCC T ss_conf 99997776068762 No 474 >pfam08901 DUF1847 Protein of unknown function (DUF1847). This family of proteins are functionally uncharacterized. THey contain 4 N-terminal cysteines that may form a zinc binding domain. Probab=26.17 E-value=30 Score=12.12 Aligned_cols=22 Identities=14% Similarity=0.069 Sum_probs=11.8 Q ss_pred CCHHHHHHHHHHCCCCEEEEEC Q ss_conf 9989999999978998899811 Q 537021.9.peg.7 27 LSLNNILDKIAADQQPAIAITD 48 (1033) Q Consensus 27 ~~~e~LV~~A~~~G~~AiAITD 48 (1033) .++++.++.|+++|++-++|+= T Consensus 41 tRvEEii~FAk~mgykkiGiAf 62 (157) T pfam08901 41 TRVEEIIEFAKRMGYKKIGIAF 62 (157) T ss_pred HHHHHHHHHHHHCCCCEEEEHH T ss_conf 6599999999985997553022 No 475 >cd01313 Met_dep_hydrolase_E Metallo-dependent hydrolases, subgroup D is part of the superfamily of metallo-dependent hydrolases, a large group of proteins that show conservation in their 3-dimensional fold (TIM barrel) and in details of their active site. The vast majority of the members have a conserved metal binding site, involving four histidines and one aspartic acid residue. In the common reaction mechanism, the metal ion (or ions) deprotonate a water molecule for a nucleophilic attack on the substrate. The function of this subgroup is unknown. Probab=26.15 E-value=30 Score=12.11 Aligned_cols=62 Identities=16% Similarity=0.100 Sum_probs=36.0 Q ss_pred CCCHHHHHHCHHHHHHHCCC----HHHHHHHHHHCCCC-EEEEECCCHHHHHHHHHHHHHHCCCCEE Q ss_conf 88133433144476443299----89999999978998-8998116517749999999997699638 Q 537021.9.peg.7 9 PSFVHLQVHSSYSLLEGALS----LNNILDKIAADQQP-AIAITDTNNLFSALEFSQKACAAGIQPI 70 (1033) Q Consensus 9 ~~FvhLh~hS~ySlldg~~~----~e~LV~~A~~~G~~-AiAITD~~nl~Gav~F~~ackk~gIKPI 70 (1033) |-||..|+|...|+++|... +.+......+.-++ +-.+|....-.++......+.+.|+.-+ T Consensus 42 PGfVn~H~H~~q~~~rG~~~~~~~~~~~~~~w~~~~~~~~~~lt~e~~~~~a~~~~~E~L~sG~Ttv 108 (418) T cd01313 42 PGMPNLHSHAFQRAMAGLTEYRGSAADSFWTWRELMYRFAARLTPEQIEAIARQLYIEMLLAGITAV 108 (418) T ss_pred ECCHHHHHCHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCEEE T ss_conf 7865377586405366220336763103568998775888847999999999999999997696599 No 476 >KOG2214 consensus Probab=26.06 E-value=31 Score=12.10 Aligned_cols=34 Identities=26% Similarity=0.341 Sum_probs=22.8 Q ss_pred CCCCHHHHHHHHHHHCCCCCHHHCCCHHHHHHHHCC Q ss_conf 444204667765544037563332310567621001 Q 537021.9.peg.7 377 GRGSGAGSVTAYALTITDIDPLRFSLLFERFLNPDR 412 (1033) Q Consensus 377 GRGSAagSLVaY~LgIT~VDPi~~~LlFERFLnp~R 412 (1033) =-||+||++||=+++.-.=.-+ .-||+-|++..= T Consensus 206 IsGsS~GaivAsl~~v~~~eEl--~~Ll~~~~~~~~ 239 (543) T KOG2214 206 ISGSSAGAIVASLVGVRSNEEL--KQLLTNFLHSLF 239 (543) T ss_pred HCCCCHHHHHHHHHHHCCHHHH--HHHHCCCHHHHH T ss_conf 2377536788888752666889--988532157654 No 477 >cd04736 MDH_FMN Mandelate dehydrogenase (MDH)-like FMN-binding domain. MDH is part of a widespread family of homologous FMN-dependent a-hydroxy acid oxidizing enzymes that oxidizes (S)-mandelate to phenylglyoxalate. MDH is an enzyme in the mandelate pathway that occurs in several strains of Pseudomonas which converts (R)-mandelate to benzoate. This family occurs in both prokaryotes and eukaryotes. Members of this family include flavocytochrome b2 (FCB2), glycolate oxidase (GOX), lactate monooxygenase (LMO), mandelate dehydrogenase (MDH), and long chain hydroxyacid oxidase (LCHAO). Probab=26.05 E-value=31 Score=12.10 Aligned_cols=22 Identities=18% Similarity=0.240 Sum_probs=14.1 Q ss_pred HHHHHHHHHCCCEEEECCCCCC Q ss_conf 9999999818989980883007 Q 537021.9.peg.7 199 QVVQLAYTHELPLVATNNSLFL 220 (1033) Q Consensus 199 ~l~~lA~~~~iPlVaTndv~Y~ 220 (1033) .+.+-|.+.|+|.+.++-..+. T Consensus 85 a~AraA~~~gi~~~lSt~ss~s 106 (361) T cd04736 85 ALARAAAKAGIPFVLSTASNMS 106 (361) T ss_pred HHHHHHHHCCCCEEECCCCCCC T ss_conf 9999999879878967999999 No 478 >PRK06450 threonine synthase; Validated Probab=25.94 E-value=31 Score=12.08 Aligned_cols=38 Identities=21% Similarity=0.231 Sum_probs=0.0 Q ss_pred HHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEEE Q ss_conf 9999997899889981165177499999999976996389 Q 537021.9.peg.7 32 ILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPII 71 (1033) Q Consensus 32 LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPIi 71 (1033) .|.+|++.|.+.|+..=.||...++-++ |...||+.+| T Consensus 86 ~v~~a~~~G~~~vv~aSsGN~g~slAay--aa~~Gi~~~I 123 (336) T PRK06450 86 LISYLADRGIKEISEDSSGNAGASIAAY--GAAAGIKVKI 123 (336) T ss_pred HHHHHHHHCCCEEEEECCHHHHHHHHHH--HHHCCCCEEE T ss_conf 9999998098845763851999999999--9984996899 No 479 >cd07311 terB_like_1 tellurium resistance terB-like protein, subgroup 1. This family includes several uncharacterized bacterial proteins. The prototype of this CD is tellurite resistance protein from Nostoc punctiforme that belongs to COG3793. Its precise biological function and its mechanism responsible for tellurium resistance still remains rather poorly understood. Probab=25.92 E-value=31 Score=12.08 Aligned_cols=23 Identities=26% Similarity=0.359 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHCCCCHHHHHHH Q ss_conf 78899999997527998898865 Q 537021.9.peg.7 453 QAKAALRDVGRALQMPYSQVDRL 475 (1033) Q Consensus 453 ~~k~aird~~r~~g~~~~~~~~~ 475 (1033) +-+.+|++.++.||++.+++.++ T Consensus 108 ~E~~~i~~~a~~LGI~ed~V~~l 130 (150) T cd07311 108 GEVAAVRKAASLLGISEDEVQKL 130 (150) T ss_pred HHHHHHHHHHHHHCCCHHHHHHH T ss_conf 89999999999929979999999 No 480 >cd06564 GH20_DspB_LnbB-like Glycosyl hydrolase family 20 (GH20) catalytic domain of dispersin B (DspB), lacto-N-biosidase (LnbB) and related proteins. Dispersin B is a soluble beta-N-acetylglucosamidase found in bacteria that hydrolyzes the beta-1,6-linkages of PGA (poly-beta-(1,6)-N-acetylglucosamine), a major component of the extracellular polysaccharide matrix. Lacto-N-biosidase hydrolyzes lacto-N-biose (LNB) type I oligosaccharides at the nonreducing terminus to produce lacto-N-biose as part of the GNB/LNB (galacto-N-biose/lacto-N-biose I) degradation pathway. The lacto-N-biosidase from Bifidobacterium bifidum has this GH20 domain, a carbohydrate binding module 32, and a bacterial immunoglobulin-like domain 2, as well as a YSIRK signal peptide and a G5 membrane anchor at the N and C termini, respectively. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by solvent or the enzyme, but by the substrate itself. Probab=25.92 E-value=25 Score=12.76 Aligned_cols=73 Identities=12% Similarity=0.110 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHCCCCCCH------------HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHCC Q ss_conf 99999999975000133101------------478889999999999730266677751002354202589998888636 Q 537021.9.peg.7 746 AVNIFELLAKFADYGFNKSH------------AAAYAVISYQTAWMKTHYPVEFLAASMTLEMDNVEKIKKFCQDARQFN 813 (1033) Q Consensus 746 a~~i~~~i~~f~~Y~FnksH------------a~aya~~~y~~aylK~~yP~eF~aa~L~~~~~~~~~~~~~i~e~~~~g 813 (1033) .+.|-+.|...|-|.+|+=| -.++.....+.+......|.....+.-....-..+.+..+++-|+.+| T Consensus 16 ~~~ik~~ID~Ma~~KlN~lH~HLtDdqgw~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~YT~~di~eiv~YA~~rg 95 (326) T cd06564 16 MDFLKDIIKTMSWYKMNDLQLHLNDNLIFNLDDMSTTVNNATYASDDVKSGNNYYNLTANDGYYTKEEFKELIAYAKDRG 95 (326) T ss_pred HHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCC T ss_conf 99999999999980998788888679854225666333233333445566655678778887068999999999999849 Q ss_pred CCCCC Q ss_conf 85035 Q 537021.9.peg.7 814 IQIMP 818 (1033) Q Consensus 814 i~vl~ 818 (1033) |+|+| T Consensus 96 I~VIP 100 (326) T cd06564 96 VNIIP 100 (326) T ss_pred CEEEE T ss_conf 98986 No 481 >pfam04076 BOF Bacterial OB fold (BOF) protein. Proteins in this family form an OB-fold. Analysis of the predicted binding site of BOF family proteins implies that they lack nucleic acid-binding properties. They contain an predicted N-terminal signal peptide which indicates that they localize in the periplasm where they may function to bind proteins, small molecules, or other typical OB-fold ligands. As hypothesized for the distantly related OB-fold containing bacterial enterotoxins, the loss of nucleotide-binding function and the rapid evolution of the BOF ligand-binding site may be associated with the presence of BOF proteins in mobile genetic elements and their potential role in bacterial pathogenicity. Probab=25.88 E-value=31 Score=12.08 Aligned_cols=45 Identities=7% Similarity=0.138 Sum_probs=0.0 Q ss_pred CCCCHHHHHCCCCCCEEEEEEEEEEEEEECCCCCCEEEEEEEECCCCEEEEEE Q ss_conf 89768898505589889999999611210158887699999984970499995 Q 537021.9.peg.7 979 GIKGYEESVATIRGNNIRLAAMVVSKQQKKTRKGSRIGWVTFSEPAKEYEANF 1031 (1033) Q Consensus 979 ~~~~~~~~~~~~~g~~v~iaG~V~~i~~~~TKkG~~mafltleD~~g~~e~~i 1031 (1033) ...+..+..+..+++.|.+-|.|+ +...++.| .|.|.+|.+.+-| T Consensus 44 ~~~TV~~a~~~~Dd~~V~L~G~Iv-----~~l~~d~Y---~F~D~TG~I~VeI 88 (126) T pfam04076 44 TKTTVKQAKSAADDAWVSLEGNIV-----RQIGDDEY---EFRDASGTIKVDI 88 (126) T ss_pred CEEHHHHHHHCCCCCCEEEEEEEE-----EEECCCEE---EEECCCCCEEEEE T ss_conf 488399886587999399988999-----68489538---9978996399996 No 482 >TIGR00707 argD acetylornithine and succinylornithine aminotransferases; InterPro: IPR004636 This family of proteins, for which ornithine aminotransferases form an outgroup, consists mostly of proteins designated acetylornithine aminotransferase. However, the two very closely related members from Escherichia coli are assigned different enzymatic activities. One is acetylornithine aminotransferase (2.6.1.11 from EC), ArgD, an enzyme of arginine biosynthesis, while another is succinylornithine aminotransferase, an enzyme of the arginine succinyltransferase pathway, an ammonia-generating pathway of arginine catabolism . Members of this family may also act on ornithine, like ornithine aminotransferase (2.6.1.13 from EC) and on succinyldiaminopimelate, like N-succinyldiaminopmelate-aminotransferase (2.6.1.17 from EC, DapC, an enzyme of lysine biosynthesis) . ; GO: 0008483 transaminase activity, 0006525 arginine metabolic process. Probab=25.70 E-value=29 Score=12.34 Aligned_cols=60 Identities=17% Similarity=0.237 Sum_probs=0.0 Q ss_pred CCHHHHHHHHHHHHHH---CCCCCCCCCCCCH-HHHHHHHHCCC-----CCHHH----HCCC---HHHHHHHHHHC Q ss_conf 1789999999988740---2543433234435-88998762014-----10122----1145---34788887717 Q 537021.9.peg.7 577 KTLTLLQKSLDFLAQR---GVKVDLSLIPFDD-HETYHLLTTKG-----TLGIF----QLES---SGMRQALEGMQ 636 (1033) Q Consensus 577 ~~L~~i~~~~~~i~~~---~~~~d~~~ip~~D-~~~~~l~~~g~-----t~GvF----Q~es---~~~~~~l~~~~ 636 (1033) |||+.|+-|-+-=-++ -..-+...+|+|| +...+-.=..+ |.+|| |=|| |+-...|+.++ T Consensus 137 RT~g~LsaTgqpKy~~~F~PLvpgf~y~~yNDe~~~l~~~i~~~~eg~rtAAvivEpiQGEgGV~pa~~~fL~a~r 212 (402) T TIGR00707 137 RTMGALSATGQPKYQKPFEPLVPGFKYVPYNDEIEALKEAIDDEKEGDRTAAVIVEPIQGEGGVVPADKEFLKALR 212 (402) T ss_pred CHHHHHHCCCCCCCCCCCCCCCCCCEEECCCCCHHHHHHHHHCCCCCCCEEEEEEEEECCCCCCCCCCHHHHHHHH T ss_conf 0045430047764568578842381244488978999998502577871889998532178653348778999999 No 483 >cd05169 PIKKc_TOR TOR (Target of rapamycin), catalytic domain; The TOR catalytic domain subfamily is part of a larger superfamily that includes the catalytic domains of other kinases such as the typical serine/threonine/tyrosine protein kinases (PKs), aminoglycoside phosphotransferase, choline kinase, and RIO kinases. TOR is a member of the phosphoinositide 3-kinase-related protein kinase (PIKK) subfamily. PIKKs have intrinsic serine/threonine kinase activity and are distinguished from other PKs by their unique catalytic domain, similar to that of lipid PI3K, and their large molecular weight (240-470 kDa). TOR contains a rapamycin binding domain, a catalytic domain, and a FATC (FRAP, ATM and TRRAP, C-terminal) domain at the C-terminus. It is also called FRAP (FK506 binding protein 12-rapamycin associated protein). TOR is a central component of the eukaryotic growth regulatory network. It controls the expression of many genes transcribed by all three RNA polymerases. It associates with Probab=25.63 E-value=27 Score=12.58 Aligned_cols=14 Identities=29% Similarity=0.423 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHCCC Q ss_conf 04667765544037 Q 537021.9.peg.7 381 GAGSVTAYALTITD 394 (1033) Q Consensus 381 AagSLVaY~LgIT~ 394 (1033) |+.|+|+|+|||-+ T Consensus 174 A~~SvvgYIlGlGD 187 (280) T cd05169 174 AVMSMVGYILGLGD 187 (280) T ss_pred HHHHHHHHEEECCC T ss_conf 99999988013168 No 484 >PRK02101 hypothetical protein; Validated Probab=25.63 E-value=31 Score=12.04 Aligned_cols=64 Identities=17% Similarity=0.185 Sum_probs=0.0 Q ss_pred CCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCC Q ss_conf 33553001256898878863268413078897620121101677878999998624999999999999999750001331 Q 537021.9.peg.7 684 ETQGVIIYQEQVMQIAQLLSGYSLSEADVLRRAMGKKIKEEMDKQKERFISGASKNGISKTIAVNIFELLAKFADYGFNK 763 (1033) Q Consensus 684 ~T~Gv~vyQEQvm~ia~~~ag~~~~ead~lRra~~kk~~~~~~~~~~~f~~g~~~~g~~~~~a~~i~~~i~~f~~Y~Fnk 763 (1033) +....+.|+|+.-+++..+..+|..| |.+.|+ ++++.|++-|+.++.| .-.|+. T Consensus 20 ~~~s~P~f~~~t~~l~~~Lk~~s~~e---l~~lm~----------------------is~~lA~~n~~r~q~~-~~~~~~ 73 (257) T PRK02101 20 DRLTQPELLDQSQALIDELRQLSPDQ---LASLMK----------------------ISDKLAGLNAQRFQDW-QPDFTT 73 (257) T ss_pred CCCCCCCHHHHHHHHHHHHHHCCHHH---HHHHHC----------------------CCHHHHHHHHHHHHHH-CCCCCC T ss_conf 66787672799999999998379999---999978----------------------7999999999999983-545676 Q ss_pred CHHHHHHHHHH Q ss_conf 01478889999 Q 537021.9.peg.7 764 SHAAAYAVISY 774 (1033) Q Consensus 764 sHa~aya~~~y 774 (1033) +++.. |+.+| T Consensus 74 ~~~~p-Ai~~y 83 (257) T PRK02101 74 ANARP-AILAF 83 (257) T ss_pred CHHHH-HHHHC T ss_conf 33566-99982 No 485 >TIGR00542 hxl6Piso_put hexulose-6-phosphate isomerase, putative; InterPro: IPR004560 This family shows similarity to other isomerases. Putative identification as hexulose-6-phosphate isomerase has been reported. This family is conserved at better than 40 0dentity among the four known examples from three species: Escherichia coli (SgbU and SgaU), Haemophilus influenzae, and Mycoplasma pneumoniae. The rarity of the family, high level of conservation, and proposed catabolic role suggests lateral transfer may be a part of the evolutionary history of this protein.; GO: 0016861 intramolecular oxidoreductase activity interconverting aldoses and ketoses, 0005975 carbohydrate metabolic process. Probab=25.63 E-value=31 Score=12.04 Aligned_cols=30 Identities=13% Similarity=0.220 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHCCCEE--EECCCCCCCHHH Q ss_conf 999999999998189899--808830078788 Q 537021.9.peg.7 195 YRESQVVQLAYTHELPLV--ATNNSLFLSEED 224 (1033) Q Consensus 195 ~~~~~l~~lA~~~~iPlV--aTndv~Y~~~~D 224 (1033) .+..+.+.||+++||.+| |.-||||-..+. T Consensus 94 eIm~KAi~La~~lGIR~IQLAgYDVYYe~~d~ 125 (290) T TIGR00542 94 EIMEKAIQLARDLGIRIIQLAGYDVYYEEHDE 125 (290) T ss_pred HHHHHHHHHHHHCCCEEEEECCCCCCCCCCCC T ss_conf 99999999997579606622333003355760 No 486 >COG2055 Malate/L-lactate dehydrogenases [Energy production and conversion] Probab=25.61 E-value=31 Score=12.04 Aligned_cols=63 Identities=16% Similarity=0.307 Sum_probs=0.0 Q ss_pred CCCCHHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEE Q ss_conf 988133433144476443299899999999789988998116517749999999997699638 Q 537021.9.peg.7 8 SPSFVHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPI 70 (1033) Q Consensus 8 ~~~FvhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPI 70 (1033) .|+.+-++-+..|.-.-.-.-.+.++++|++.|.-++|+-+-+....+--|...|-..|.--| T Consensus 76 ~~a~~~~Da~~g~G~~a~~~am~~aie~Ak~~Gia~vav~ns~H~G~~g~y~~~aA~~GlIgi 138 (349) T COG2055 76 APAVAVLDADGGFGQVAAKKAMELAIEKAKQHGIAAVAVRNSNHFGALGYYAEQAAEAGLIGI 138 (349) T ss_pred CCCEEEEECCCCCCHHHHHHHHHHHHHHHHHHCEEEEEEECCCCCCHHHHHHHHHHHCCCEEE T ss_conf 570899847777324789999999999999709069998437761137999999997897699 No 487 >TIGR01803 CM-like chorismate mutase related enzymes; InterPro: IPR008241 Isochorismate pyruvate-lyase (IPL; PchB) catalyses the second reaction in the pyochelin biosynthetic pathway of Pseudomonas aeruginosa, conversion of isochorismate to salicylate plus pyruvate (following the initial PchA-dependent conversion of chorismate to isochorismate) . This enzyme can also carry out the chorismate mutase (CM) reaction, but with a low catalytic efficiency. It is unlikely that PchB plays a significant role in aromatic amino acid biosynthesis. This enzyme is a stand-alone version of a chorismate mutase domain of the AroQ class. The three types of CM are AroQ class, Prokaryotic type (e.g., IPR008239 from INTERPRO and others); AroQ class, Eukaryotic type (IPR008238 from INTERPRO); and AroH class (IPR008243 from INTERPRO). They fall into two structural folds (AroQ class and AroH class) which are completely unrelated . The two types of the AroQ structural class (the Escherichia coli CM dimer and the yeast CM monomer) can be structurally superimposed, and the topology of the four-helix bundle forming the active site is conserved . The PchB-type of chorismate mutase domain, while sharing conserved residues and the predicted secondary structure with the other subgroups of the AroQ class (Prokaryotic type), has isochorismate pyruvate-lyase (IPL) as a primary catalytic activity. PchB can still use the same active site either for the IPL or for the CM reaction. It has been suggested that PchB was derived from an AroQ-class CM by a gene duplication event followed by selection for efficient IPL function in the course of the evolution of the pyochelin siderophore pathway, with only residual CM activity remaining. It can be further speculated that contemporary CMs may already possess (weak) IPL activity . For additional information please see , , .. Probab=25.59 E-value=31 Score=12.04 Aligned_cols=32 Identities=16% Similarity=0.266 Sum_probs=0.0 Q ss_pred CCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHH Q ss_conf 10167787899999862499999999999999 Q 537021.9.peg.7 722 KEEMDKQKERFISGASKNGISKTIAVNIFELL 753 (1033) Q Consensus 722 ~~~~~~~~~~f~~g~~~~g~~~~~a~~i~~~i 753 (1033) |+-+.++++.=---|.++|+++...+.+|.+| T Consensus 43 P~RvA~v~~naaR~A~e~GlDp~F~e~l~~~i 74 (82) T TIGR01803 43 PERVAEVKENAARYAAELGLDPDFLETLYDQI 74 (82) T ss_pred CHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHH T ss_conf 34689862877766786389988999999999 No 488 >TIGR02275 DHB_AMP_lig 2,3-dihydroxybenzoate-AMP ligase; InterPro: IPR011963 Siderophores are low molecular weight iron-chelating compounds synthesised by many bacteria to aid in the aquisition of this vital trace element . Proteins in this entry are adenylation components of non-ribosomal peptide synthases (NRPSs) involved in the biosynthesis of siderophores. These proteins belong to the AMP-binding family and are mostly thought to activate 2,3-dihydroxybenzoate (DHB) by ligation of AMP from ATP with the release of pyrophosphate. One of these proteins, however, is believed to differ in function, activating salicylate rather than DHB. The crystal structure of 2,3-dihydroxybenzoate-AMP ligase (P40871 from SWISSPROT) from Bacillus subtilis has been examined . This protein is composed of a large N-terminal domain (~420 aa) and a more compact C-terminal domain (~110 aa), with an overall "hammer-and-anvil" fold similar to that of firefly luciferase . The active site is located in a deep compartment located at the interface of the domains, with a p-loop thought to be involved in catalysis located at the entrance to the cavity. Relatively little conformational change was observed during catalysis.; GO: 0008668 (23-dihydroxybenzoyl)adenylate synthase activity, 0019290 siderophore biosynthetic process. Probab=25.52 E-value=26 Score=12.66 Aligned_cols=48 Identities=13% Similarity=0.183 Sum_probs=0.0 Q ss_pred HHHHCCCCCCCCCCCHHHHHHHCCCCEEEECCCCCCHHHHHHHCCCHHHHHHHHHHH Q ss_conf 997324877874007899986348988998197679678888779978999999999 Q 537021.9.peg.7 118 RMYLLDQGKQSVRICLSWLQEIGTEGLIMLTGGSSGPIDRAFFFNSSQVAEKRLLTF 174 (1033) Q Consensus 118 ~a~~~~~~~~~p~i~~~~L~e~~~egLIvlsg~~~g~i~~~l~~~~~~~a~~~l~~l 174 (1033) .++..++||.. -++....+||-|++.| ..-.-|.+|+++.|.++++.+ T Consensus 408 ~~Fd~~GFY~t-----GD~v~~t~EGYi~V~G----R~KDQINRGGEKiAAEEiEN~ 455 (534) T TIGR02275 408 KAFDAEGFYYT-----GDLVKLTPEGYIVVVG----RAKDQINRGGEKIAAEEIENL 455 (534) T ss_pred HHHCCCCCEEC-----CCEEEECCCCCEEEEE----CCCCCCCCCCCEEHHHHHHHH T ss_conf 97447896441-----5167886886178860----001440577410057779867 No 489 >pfam09420 Nop16 Ribosome biogenesis protein Nop16. Nop16 is a protein involved in ribosome biogenesis. Probab=25.42 E-value=27 Score=12.52 Aligned_cols=22 Identities=23% Similarity=0.320 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHCCCCCEEE Q ss_conf 6878999999983038021554 Q 537021.9.peg.7 424 QDRRDEVIRYVQNKYGHERVAQ 445 (1033) Q Consensus 424 ~~~R~~vi~y~~~kyG~~~va~ 445 (1033) ..+-.+.++|+.+|||.|+.|+ T Consensus 127 s~~e~~~v~~Lv~KhGdDy~aM 148 (173) T pfam09420 127 SKREEEWVERLVEKHGDDYKAM 148 (173) T ss_pred CHHHHHHHHHHHHHHCCCHHHH T ss_conf 8779999999999965309998 No 490 >PTZ00075 S-adenosyl-L-homocysteine hydrolase; Provisional Probab=25.28 E-value=16 Score=14.39 Aligned_cols=23 Identities=26% Similarity=0.492 Sum_probs=0.0 Q ss_pred CCCCHHHH-HHHHHHHCCCCCHHH Q ss_conf 44420466-776554403756333 Q 537021.9.peg.7 377 GRGSGAGS-VTAYALTITDIDPLR 399 (1033) Q Consensus 377 GRGSAagS-LVaY~LgIT~VDPi~ 399 (1033) |||+|..- -.-.-.-||+||||+ T Consensus 266 GkG~A~~~rg~GA~ViVtEiDPi~ 289 (476) T PTZ00075 266 GKGCAQALRGFGARVVVTEIDPIC 289 (476) T ss_pred CHHHHHHHCCCCCEEEEECCCCHH T ss_conf 121476653589789996168078 No 491 >cd01303 GDEase Guanine deaminase (GDEase). Guanine deaminase is an aminohydrolase responsible for the conversion of guanine to xanthine and ammonia, the first step to utilize guanine as a nitrogen source. This reaction also removes the guanine base from the pool and therefore can play a role in the regulation of cellular GTP and the guanylate nucleotide pool. Probab=25.27 E-value=31 Score=11.99 Aligned_cols=64 Identities=19% Similarity=0.240 Sum_probs=0.0 Q ss_pred CCCHHHHHHCHHHHHHHCCC----------------------------HHHHHHHHHHCCCCEEEEECCCHHHHHHHHHH Q ss_conf 88133433144476443299----------------------------89999999978998899811651774999999 Q 537021.9.peg.7 9 PSFVHLQVHSSYSLLEGALS----------------------------LNNILDKIAADQQPAIAITDTNNLFSALEFSQ 60 (1033) Q Consensus 9 ~~FvhLh~hS~ySlldg~~~----------------------------~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ 60 (1033) |-||..|+|...+.+.|... ....+..+.+.|..++.---....-..-...+ T Consensus 65 PGlVd~HtH~~~~~~rg~~~d~~l~~wl~~~~~p~~~~~~~~~~~~~~~~~~~~e~l~~G~TTv~~~~~~~~~~~~~~~~ 144 (429) T cd01303 65 PGFIDTHIHAPQYANIGSGLGEPLLDWLETYTFPEEAKFADPAYAREVYGRFLDELLRNGTTTACYFATIHPESTEALFE 144 (429) T ss_pred ECCHHHHHCHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHH T ss_conf 78340876886360674158995799987567998886289999999999999999867916999715778689999999 Q ss_pred HHHHCCCCEEEE Q ss_conf 999769963899 Q 537021.9.peg.7 61 KACAAGIQPIIG 72 (1033) Q Consensus 61 ackk~gIKPIiG 72 (1033) ++.+.||+.++| T Consensus 145 a~~~~GiR~~~g 156 (429) T cd01303 145 EAAKRGQRAIAG 156 (429) T ss_pred HHHHCCCEEEEE T ss_conf 999829959998 No 492 >TIGR01063 gyrA DNA gyrase, A subunit; InterPro: IPR005743 DNA topoisomerases regulate the number of topological links between two DNA strands (i.e. change the number of superhelical turns) by catalysing transient single- or double-strand breaks, crossing the strands through one another, then resealing the breaks. These enzymes have several functions: to remove DNA supercoils during transcription and DNA replication; for strand breakage during recombination; for chromosome condensation; and to disentangle intertwined DNA during mitosis , . DNA topoisomerases are divided into two classes: type I enzymes (5.99.1.2 from EC; topoisomerases I, III and V) break single-strand DNA, and type II enzymes (5.99.1.3 from EC; topoisomerases II, IV and VI) break double-strand DNA . Type II topoisomerases are ATP-dependent enzymes, and can be subdivided according to their structure and reaction mechanisms: type IIA (topoisomerase II or gyrase, and topoisomerase IV) and type IIB (topoisomerase VI). These enzymes are responsible for relaxing supercoiled DNA as well as for introducing both negative and positive supercoils . Topoisomerase II (called gyrase in bacteria) primarily introduces negative supercoils into DNA. In bacteria, topoisomerase II consists of two polypeptide subunits, gyrA and gyrB, which form a heterotetramer: (BA)2. In most eukaryotes, topoisomerase II consists of a single polypeptide, where the N- and C-terminal regions correspond to gyrB and gyrA, respectively. This entry represents the A subunit (gyrA) as found predominantly in bacteria, but does not include the topoisomerase II enzymes composed of a single polypeptide, as are found in most eukaryotes. GyrA has two functional domains: an N-terminal that forms the covalent DNA-protein bridge that is responsible for the breaking- and rejoining function, and a C-terminal that can bind DNA non-specifically . More information about this protein can be found at Protein of the Month: DNA Topoisomerase .; GO: 0003677 DNA binding, 0003918 DNA topoisomerase (ATP-hydrolyzing) activity, 0005524 ATP binding, 0006265 DNA topological change, 0006268 DNA unwinding during replication, 0005694 chromosome. Probab=25.24 E-value=31 Score=11.99 Aligned_cols=56 Identities=21% Similarity=0.262 Sum_probs=0.0 Q ss_pred HHHHCCCCCHHHHCCCHHHHHHHHHHCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHC------ Q ss_conf 87620141012211453478888771740488887665411411342003688850465785026966733410------ Q 537021.9.peg.7 610 HLLTTKGTLGIFQLESSGMRQALEGMQPDCIEDIIALVSLYRPGPIDNIVVYNNRKNGKEKIVSIHPLIDPMLK------ 683 (1033) Q Consensus 610 ~l~~~g~t~GvFQ~es~~~~~~l~~~~p~~~~dl~~~~al~RPGp~~~~~~yi~rk~g~e~~~~~hp~l~~iL~------ 683 (1033) .||++|= |.=...|...+-|+++.+|++.+.+ -+..+.=.++++|+ T Consensus 158 nLLvNGS--------sGIAVGMATNIPPHNL~Eiida~~~--------------------~i~Np~~~i~eLl~~i~GPD 209 (864) T TIGR01063 158 NLLVNGS--------SGIAVGMATNIPPHNLGEIIDAILA--------------------YIDNPDISIEELLEVIKGPD 209 (864) T ss_pred HHHCCCC--------CCCCCCCCCCCCCCCHHHHHHHHHH--------------------HHCCCCCCHHHHHCCCCCCC T ss_conf 4561324--------7600022257887336899999999--------------------87189988679956077113 Q ss_pred -CCCCCCCHHH Q ss_conf -3355300125 Q 537021.9.peg.7 684 -ETQGVIIYQE 693 (1033) Q Consensus 684 -~T~Gv~vyQE 693 (1033) ||-|+|+|.+ T Consensus 210 FPTgg~I~G~~ 220 (864) T TIGR01063 210 FPTGGIILGRS 220 (864) T ss_pred CCHHHHHCCCC T ss_conf 71056430500 No 493 >PRK05826 pyruvate kinase; Provisional Probab=25.23 E-value=31 Score=11.99 Aligned_cols=174 Identities=17% Similarity=0.108 Sum_probs=0.0 Q ss_pred HHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCCCCCCCCCCCCCEEEEEECCHHHH Q ss_conf 99999999789988998116517749999999997699638999999854766433445656666775399995696899 Q 537021.9.peg.7 30 NNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPIIGCQLDIDMQDNLKMTNQGKTPFPTVPSIVLLVSTAEGY 109 (1033) Q Consensus 30 e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~~~~~~~~~~~~~~~~~lvlLAkN~~Gy 109 (1033) .+-++.|.++|...+|+.=-.+-.-+-+.-+.+++.|-+.+. ++.=-.+.+|+ T Consensus 176 ~~di~fa~~~~vD~ialSFVrsa~DV~~lr~~l~~~~~~~~~---------------------------IIaKIE~~~al 228 (461) T PRK05826 176 KADIKFAAEQGVDYIAVSFVRNAEDVEEARRLLREAGNCDAK---------------------------IIAKIERAEAV 228 (461) T ss_pred HHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHHHHCCCCCCE---------------------------EEEEECCHHHH T ss_conf 999999997699999746889867899999999972999844---------------------------89983688998 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCEEEECCCCCCHHHHHHHCCCHHHHHHHHHHHHHHCCCCEEEEEECC Q ss_conf 99999998997324877874007899986348988998197679678888779978999999999876598689999439 Q 537021.9.peg.7 110 QRLIELVSRMYLLDQGKQSVRICLSWLQEIGTEGLIMLTGGSSGPIDRAFFFNSSQVAEKRLLTFKKLFGDRLYVNLQRH 189 (1033) Q Consensus 110 ~nL~kL~S~a~~~~~~~~~p~i~~~~L~e~~~egLIvlsg~~~g~i~~~l~~~~~~~a~~~l~~l~~~Fgd~~ylEl~~~ 189 (1033) +||-.++. .+||+++==| ++=+|+... T Consensus 229 ~NldeIi~----------------------~sDgIMIARG-------------------------------DLgvEi~~e 255 (461) T PRK05826 229 DNLDEIIE----------------------ASDGIMVARG-------------------------------DLGVEIPDA 255 (461) T ss_pred HHHHHHHH----------------------HCCEEEEECC-------------------------------CCCCCCCHH T ss_conf 73999998----------------------6589999778-------------------------------530326876 Q ss_pred CHHHHHHHHHHHHHHHHHCCCEEEECC-----CCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHC Q ss_conf 911589999999999981898998088-----300787889999876688469857653100245343234347887512 Q 537021.9.peg.7 190 RGYDRYRESQVVQLAYTHELPLVATNN-----SLFLSEEDYEAHDVLMAVAHSTVVSQKDRPRVTPDHYLKNRSEMVSIF 264 (1033) Q Consensus 190 ~~~e~~~~~~l~~lA~~~~iPlVaTnd-----v~Y~~~~D~~~h~iL~aI~~~~~l~~~~~~~~~~~~yLks~eEM~~~f 264 (1033) ..+ ...+.+++.+.+.+.|++.... ++-..|..++.-|+-.||-.|+ +.-+.|.|--.-.| T Consensus 256 ~vp--~~Qk~Ii~~c~~~gkpvivATqmLeSM~~~p~PTRAEv~DVanAv~dG~------------DavmLS~ETA~G~y 321 (461) T PRK05826 256 EVP--GLQKKIIRKAREAGKPVITATQMLESMIENPRPTRAEVSDVANAVLDGT------------DAVMLSGETAAGKY 321 (461) T ss_pred HHH--HHHHHHHHHHHHCCCEEEEECCCHHHHCCCCCCCHHHHHHHHHHHHHCC------------CEEEECHHHCCCCC T ss_conf 749--9999999988754998999765689760799997277878999998368------------78997432237878 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 20358998788641110001332210234555756 Q 537021.9.peg.7 265 SDLPEALENTVEIARRCSFILQVHAPILPRFIEKE 299 (1033) Q Consensus 265 ~~~peAi~NT~~IAe~c~~~l~~~~~~lP~f~~p~ 299 (1033) +- ++++-...|+...+-.+.......+.-..+. T Consensus 322 Pv--~aV~~m~~I~~~aE~~~~~~~~~~~~~~~~~ 354 (461) T PRK05826 322 PV--EAVEAMARICKGAEKEFLNVSKHRLDSTFFD 354 (461) T ss_pred HH--HHHHHHHHHHHHHHHCHHHHCCHHCCCCCCC T ss_conf 89--9999999999999723233100000124589 No 494 >pfam08667 BetR BetR domain. This family includes an N-terminal helix-turn-helix domain. Probab=25.19 E-value=26 Score=12.66 Aligned_cols=42 Identities=24% Similarity=0.381 Sum_probs=0.0 Q ss_pred CCEEECCCHHHHHHHHHHHHHHHHCCCCHHHH-HHHHHHCCCC Q ss_conf 21554220778778899999997527998898-8654310145 Q 537021.9.peg.7 441 ERVAQIITFGSLQAKAALRDVGRALQMPYSQV-DRLCKLIPND 482 (1033) Q Consensus 441 ~~va~i~t~~t~~~k~aird~~r~~g~~~~~~-~~~~~~i~~~ 482 (1033) ++|+...|-.....|+-...++|+||+++... .++...+||. T Consensus 8 ~rvr~lL~r~GI~kRqh~s~is~iL~LSfS~a~RKLkG~spW~ 50 (146) T pfam08667 8 ERVRKLLTRTGIAERQHNTTIANILGLSFSQAYRKLKGSIPWN 50 (146) T ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHCCCHHHHHHHHCCCCCCC T ss_conf 9999999863997567788999997874898878735899744 No 495 >cd05177 PI3Kc_C2_gamma Phosphoinositide 3-kinase (PI3K), class II, gamma isoform, catalytic domain; The PI3K catalytic domain family is part of a larger superfamily that includes the catalytic domains of other kinases such as the typical serine/threonine/tyrosine protein kinases (PKs), aminoglycoside phosphotransferase, choline kinase, and RIO kinases. PI3Ks catalyze the transfer of the gamma-phosphoryl group from ATP to the 3-hydroxyl of the inositol ring of D-myo-phosphatidylinositol (PtdIns) or its derivatives. PI3Ks play an important role in a variety of fundamental cellular processes, including cell motility, the Ras pathway, vesicle trafficking and secretion, immune cell activation and apoptosis. They can be divided into three main classes (I, II, and III), defined by their substrate specificity, regulation, and domain structure. Class II PI3Ks preferentially use PtdIns as a substrate to produce PtdIns(3)P, but can also phosphorylate PtdIns(4)P. They function as monomers and do n Probab=25.17 E-value=32 Score=11.98 Aligned_cols=14 Identities=21% Similarity=0.373 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHCCC Q ss_conf 04667765544037 Q 537021.9.peg.7 381 GAGSVTAYALTITD 394 (1033) Q Consensus 381 AagSLVaY~LgIT~ 394 (1033) ||=|+++|+|||-+ T Consensus 196 AgYcV~TYiLgIgD 209 (354) T cd05177 196 AGWCVVTFILGVCD 209 (354) T ss_pred HHHHHHHHHHCCCC T ss_conf 99999998711466 No 496 >cd07204 Pat_PNPLA_like Patatin-like phospholipase domain containing protein family. Members of this family share a patain domain, initially discovered in potato tubers. PNPLA protein members show non-specific hydrolase activity with a variety of substrates such as triacylglycerol, phospholipids, and retinylesters. It contains the lipase consensus sequence (Gly-X-Ser-X-Gly). Nomenclature of PNPLA family could be misleading as some of the mammalian members of this family show hydrolase, but no phospholipase activity. Probab=25.14 E-value=32 Score=11.97 Aligned_cols=33 Identities=18% Similarity=0.349 Sum_probs=0.0 Q ss_pred CCHHHHHHHHHHHCCCCCHHHCCCHHHHHHHHCC Q ss_conf 4204667765544037563332310567621001 Q 537021.9.peg.7 379 GSGAGSVTAYALTITDIDPLRFSLLFERFLNPDR 412 (1033) Q Consensus 379 GSAagSLVaY~LgIT~VDPi~~~LlFERFLnp~R 412 (1033) |+.||||+| ++-.+.+++-+.-=.+.++....| T Consensus 37 GaSAGAl~a-a~~~~g~~~~~~~~~~~~~~~~~~ 69 (243) T cd07204 37 GASAGAIVA-AVVLCGVSMEEACSFILKVVSEAR 69 (243) T ss_pred EEHHHHHHH-HHHHHCCCHHHHHHHHHHHHHHHH T ss_conf 985889999-999928999999999999999998 No 497 >PRK05377 fructose-1,6-bisphosphate aldolase; Reviewed Probab=25.13 E-value=32 Score=11.97 Aligned_cols=98 Identities=19% Similarity=0.232 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCCCCCCCCCCCCCEEEEEECCHHHHHHHHHHHH--HHHHCCCCCCC Q ss_conf 1774999999999769963899999985476643344565666677539999569689999999998--99732487787 Q 537021.9.peg.7 51 NLFSALEFSQKACAAGIQPIIGCQLDIDMQDNLKMTNQGKTPFPTVPSIVLLVSTAEGYQRLIELVS--RMYLLDQGKQS 128 (1033) Q Consensus 51 nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~~~~~~~~~~~~~~~~~lvlLAkN~~Gy~nL~kL~S--~a~~~~~~~~~ 128 (1033) |+....+.-+.|+++|+-||+==|+.+.-.+...... --...=+.+|.+|.+ ...++.--... T Consensus 156 ~v~qq~~yA~~iq~~GLVPIVEPEV~id~~~k~e~E~---------------~l~~~il~~L~~l~~d~~VmLKlTlPe~ 220 (296) T PRK05377 156 VVAQQFEVAKQILAAGLVPIIEPEVDINSPDKAEAEA---------------LLKAEILKQLDSLPEDQQVMLKLTIPTE 220 (296) T ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHH---------------HHHHHHHHHHHHCCCCCCEEEECCCCCC T ss_conf 9999999999999879832106653568822678999---------------9999999999735788717986247651 Q ss_pred CCCCHHHHHHHCCCCEEEECCCCCCHHHHHHHCCC Q ss_conf 40078999863489889981976796788887799 Q 537021.9.peg.7 129 VRICLSWLQEIGTEGLIMLTGGSSGPIDRAFFFNS 163 (1033) Q Consensus 129 p~i~~~~L~e~~~egLIvlsg~~~g~i~~~l~~~~ 163 (1033) |-...+.....+--.+++|||+.+-+-....+..+ T Consensus 221 ~~~y~~l~~hp~V~rVVaLSGGys~~eA~~~L~~n 255 (296) T PRK05377 221 ANLYKELIDHPKVLRVVALSGGYSRDEANELLARN 255 (296) T ss_pred CCHHHHHCCCCCCCEEEEECCCCCHHHHHHHHHCC T ss_conf 00156633587743587735887879999998439 No 498 >cd06449 ACCD Aminocyclopropane-1-carboxylate deaminase (ACCD): Pyridoxal phosphate (PLP)-dependent enzyme which catalyzes the conversion of 1-aminocyclopropane-L-carboxylate (ACC), a precursor of the plant hormone ethylene, to alpha-ketobutyrate and ammonia. Probab=25.11 E-value=32 Score=11.97 Aligned_cols=111 Identities=14% Similarity=0.102 Sum_probs=0.0 Q ss_pred HCCHHHHHHHHHHHHHHHCCC-----CCCCCCCCCHHHHHHHHH------HHCCCCCHHHC-CCHHHHHHHHCCCCCCCC Q ss_conf 063055788899999851047-----643344442046677655------44037563332-310567621001457875 Q 537021.9.peg.7 351 MKFSGYFLIVADFIQWAKKNN-----IPVGPGRGSGAGSVTAYA------LTITDIDPLRF-SLLFERFLNPDRMSMPDF 418 (1033) Q Consensus 351 ~gf~~YFLiV~Div~~ak~~g-----I~vGpGRGSAagSLVaY~------LgIT~VDPi~~-~LlFERFLnp~R~~~PDI 418 (1033) ++..+|--.+.|+..++++.+ |.+..|=|.-+.-|++++ -.|..|++... .-...+-++.....+.++ T Consensus 152 ~~~~Gy~~~~~Ei~~q~~~~~~~~d~iv~~~GtGgT~aGL~~g~~~~~~~~~viGv~v~~~~~~~~~~~~~~~~~~~~~~ 231 (307) T cd06449 152 LGGLGYVGFVLEIAQQEEELGFKFDSIVVCSVTGSTHAGLSVGLAALGRQRRVIGIDASAKPEKTKAQVLRIAQAKLAEE 231 (307) T ss_pred CCCHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCHHHHHHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHHHHHHHHHH T ss_conf 75407999999999999964999886999348755699999999974999977999836857889999999999999971 Q ss_pred CCCCCHHHHHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHHCCCCHH Q ss_conf 3336568789999999830380215542207787788999999975279988 Q 537021.9.peg.7 419 DIDFCQDRRDEVIRYVQNKYGHERVAQIITFGSLQAKAALRDVGRALQMPYS 470 (1033) Q Consensus 419 DiDf~~~~R~~vi~y~~~kyG~~~va~i~t~~t~~~k~aird~~r~~g~~~~ 470 (1033) .++.+...-...-+|....|| ..+=....|++++++.-|+..+ T Consensus 232 ~~~~~~~~~~~~~~~~g~gYg---------~~~~e~~~ai~~~a~~eGI~lD 274 (307) T cd06449 232 GLEVKEEDVVLDDDYAAPEYG---------IPNDETIEAIKLCARLEGIITD 274 (307) T ss_pred CCCCCHHHEEEECCCCCCCCC---------CCCHHHHHHHHHHHHHHCCCCC T ss_conf 668987883420256388427---------4799999999999998291237 No 499 >PRK08517 DNA polymerase III subunit epsilon; Provisional Probab=25.09 E-value=32 Score=11.97 Aligned_cols=115 Identities=16% Similarity=0.182 Sum_probs=0.0 Q ss_pred CCCHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCC-----CCC Q ss_conf 343478875122035899878864111000133221023455575688776898899987654103420124-----643 Q 537021.9.peg.7 254 LKNRSEMVSIFSDLPEALENTVEIARRCSFILQVHAPILPRFIEKECDDIQKEEENELRNKAVAGLEVRLAQ-----GAI 328 (1033) Q Consensus 254 Lks~eEM~~~f~~~peAi~NT~~IAe~c~~~l~~~~~~lP~f~~p~~~~~~~~~~~~Lr~l~~~Gl~~R~~~-----~~~ 328 (1033) ++......+..+++-+-+.++.-+|...+|++.+....+-+.-.+.-.+...+.-. |.+.++.+.+.++.. +.. T Consensus 130 v~~aP~~~evl~~f~~Fi~d~ilVaHNa~FD~~FL~~~l~r~g~~~l~n~~lDTl~-LAR~~~p~~~ykL~~La~~~~I~ 208 (259) T PRK08517 130 LENAPSLKEVLEDFRLFLGDSVFVAHNVNFDYNFISRSLEEIGLGPLLNRKLCTID-LAKRTIESPRYGLSFLKELLGIE 208 (259) T ss_pred HHCCCCHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHHHHHCCCCCCCCCEEEEHH-HHHHHCCCCCCCHHHHHHHCCCC T ss_conf 82799889999999999688959993727609999999998399976898566089-99987587787889999975989 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHC Q ss_conf 24578389999988778989860630557888999998510 Q 537021.9.peg.7 329 AKGYHAQDYRQRLDFELNVIARMKFSGYFLIVADFIQWAKK 369 (1033) Q Consensus 329 ~~~~~~~~y~~RLe~EL~vI~~~gf~~YFLiV~Div~~ak~ 369 (1033) ....++-..-.+.-.|+=.+.=..++-|.-++.|+++|+|. T Consensus 209 ~~~~HRAl~DA~aTaevF~~~L~~l~~~v~t~~dli~fsk~ 249 (259) T PRK08517 209 TEVHHRAYADALAAYEIFEICLLNLPSYIKTTEDLIDFSKT 249 (259) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHH T ss_conf 99997845889999999999998566556649999998765 No 500 >cd00740 MeTr MeTr subgroup of pterin binding enzymes. This family includes cobalamin-dependent methyltransferases such as methyltetrahydrofolate, corrinoid iron-sulfur protein methyltransferase (MeTr) and methionine synthase (MetH). Cobalamin-dependent methyltransferases catalyze the transfer of a methyl group via a methyl- cob(III)amide intermediate. These include MeTr, a functional heterodimer, and the folate binding domain of MetH. Probab=24.98 E-value=32 Score=11.95 Aligned_cols=49 Identities=12% Similarity=0.173 Sum_probs=0.0 Q ss_pred HHHHHCCCHHHHHHHHHHCCCCEEEEE-CCCHHHH--------HHHHHHHHHH-CCCCE Q ss_conf 764432998999999997899889981-1651774--------9999999997-69963 Q 537021.9.peg.7 21 SLLEGALSLNNILDKIAADQQPAIAIT-DTNNLFS--------ALEFSQKACA-AGIQP 69 (1033) Q Consensus 21 Slldg~~~~e~LV~~A~~~G~~AiAIT-D~~nl~G--------av~F~~ackk-~gIKP 69 (1033) |+-+|.-+.+.++..|++.|-.-|||| |...+.- +-+.+..+.+ +||.| T Consensus 100 s~e~g~er~~~i~pLakkyga~vI~L~~de~Gip~t~e~R~~ia~~i~~~~~~~~Gi~~ 158 (252) T cd00740 100 NLEDGEERFLKVARLAKEHGAAVVVLAFDEQGQAKTRDKKVEIAERAYEALTEFVGFPP 158 (252) T ss_pred CCCCCHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHCCCCH T ss_conf 34454889999999998709989999528999989999999999999999998569988 Done!