Query gi|254781004|ref|YP_003065417.1| threonyl-tRNA synthetase [Candidatus Liberibacter asiaticus str. psy62] Match_columns 652 No_of_seqs 297 out of 2612 Neff 6.8 Searched_HMMs 39220 Date Mon May 30 04:17:25 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254781004.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK12444 threonyl-tRNA synthet 100.0 0 0 1706.1 58.5 632 1-646 1-636 (639) 2 PRK04483 threonyl-tRNA synthet 100.0 0 0 1699.0 60.1 630 4-646 1-633 (634) 3 PRK03772 threonyl-tRNA synthet 100.0 0 0 1696.9 56.5 636 4-652 1-639 (642) 4 PRK00413 thrS threonyl-tRNA sy 100.0 0 0 1691.6 58.0 631 6-648 1-633 (639) 5 PRK12304 thrS threonyl-tRNA sy 100.0 0 0 1595.4 54.4 598 34-648 1-602 (604) 6 PRK12305 thrS threonyl-tRNA sy 100.0 0 0 1485.0 47.6 557 70-644 4-574 (576) 7 TIGR00418 thrS threonyl-tRNA s 100.0 0 0 1445.2 40.3 561 74-645 1-595 (595) 8 COG0441 ThrS Threonyl-tRNA syn 100.0 0 0 1289.1 43.0 583 37-650 1-585 (589) 9 KOG1637 consensus 100.0 0 0 1142.1 24.9 546 6-643 2-553 (560) 10 PRK03991 threonyl-tRNA synthet 100.0 0 0 846.8 29.0 539 54-646 34-593 (613) 11 cd00771 ThrRS_core Threonyl-tR 100.0 0 0 781.7 21.5 296 245-552 1-298 (298) 12 PRK12325 prolyl-tRNA synthetas 100.0 0 0 753.4 26.3 366 256-642 30-437 (438) 13 PRK09194 prolyl-tRNA synthetas 100.0 0 0 652.5 21.9 392 240-643 17-566 (570) 14 TIGR00409 proS_fam_II prolyl-t 100.0 0 0 624.9 15.2 378 257-643 32-619 (620) 15 PRK08661 prolyl-tRNA synthetas 100.0 0 0 547.1 26.8 337 262-643 34-384 (478) 16 COG0442 ProS Prolyl-tRNA synth 100.0 0 0 519.9 15.2 376 257-642 31-499 (500) 17 cd00778 ProRS_core_arch_euk Pr 100.0 0 0 506.8 14.2 245 252-540 12-260 (261) 18 TIGR00408 proS_fam_I prolyl-tR 100.0 0 0 468.4 20.2 359 234-644 12-413 (533) 19 KOG2324 consensus 100.0 0 0 466.6 17.8 379 254-643 33-428 (457) 20 cd00779 ProRS_core_prok Prolyl 100.0 0 0 432.2 13.1 250 241-541 2-255 (255) 21 cd00772 ProRS_core Prolyl-tRNA 100.0 0 0 424.3 12.4 241 261-541 20-264 (264) 22 PRK04173 glycyl-tRNA synthetas 100.0 0 0 368.6 22.6 346 262-645 29-457 (460) 23 KOG4163 consensus 100.0 0 0 368.8 14.4 371 217-644 54-447 (551) 24 PRK00037 hisS histidyl-tRNA sy 100.0 0 0 353.0 22.9 357 262-644 7-415 (417) 25 TIGR00389 glyS_dimeric glycyl- 100.0 0 0 359.8 17.1 379 263-644 27-605 (606) 26 cd00770 SerRS_core Seryl-tRNA 100.0 0 0 348.6 13.6 269 240-553 15-296 (297) 27 COG0423 GRS1 Glycyl-tRNA synth 100.0 0 0 339.5 18.5 376 262-645 29-551 (558) 28 cd00670 Gly_His_Pro_Ser_Thr_tR 100.0 2E-40 5.6E-45 298.8 14.5 232 272-539 1-235 (235) 29 cd00768 class_II_aaRS-like_cor 100.0 2.9E-38 7.4E-43 283.9 11.0 209 275-534 1-211 (211) 30 KOG2298 consensus 100.0 8.6E-33 2.2E-37 246.1 16.2 382 255-644 22-591 (599) 31 pfam00587 tRNA-synt_2b tRNA sy 100.0 2.6E-34 6.5E-39 256.6 7.4 162 275-443 1-168 (170) 32 cd00858 GlyRS_anticodon GlyRS 99.9 3.7E-27 9.5E-32 207.2 13.5 99 545-644 19-119 (121) 33 CHL00201 syh histidine-tRNA sy 99.9 8.9E-25 2.3E-29 190.8 25.3 362 262-646 7-422 (424) 34 COG0124 HisS Histidyl-tRNA syn 99.9 5E-24 1.3E-28 185.6 24.0 365 262-644 7-427 (429) 35 PRK12420 histidyl-tRNA synthet 99.9 5.9E-22 1.5E-26 171.3 22.5 350 261-635 6-420 (421) 36 cd00862 ProRS_anticodon_zinc P 99.9 3.6E-23 9.3E-28 179.7 12.7 101 543-643 1-108 (202) 37 cd00861 ProRS_anticodon_short 99.9 4.3E-23 1.1E-27 179.2 11.9 91 552-642 1-94 (94) 38 cd00860 ThrRS_anticodon ThrRS 99.9 7.5E-23 1.9E-27 177.5 12.1 91 552-642 1-91 (91) 39 PRK05431 seryl-tRNA synthetase 99.9 1.6E-22 4E-27 175.3 12.9 266 242-553 135-413 (422) 40 cd00738 HGTP_anticodon HGTP an 99.9 1.8E-22 4.5E-27 175.0 12.2 90 552-641 1-93 (94) 41 COG0172 SerS Seryl-tRNA synthe 99.8 2.2E-20 5.6E-25 160.5 13.1 284 211-553 118-418 (429) 42 cd02426 Pol_gamma_b_Cterm C-te 99.8 6.6E-19 1.7E-23 150.3 11.5 100 545-645 20-126 (128) 43 pfam03129 HGTP_anticodon Antic 99.8 9.4E-19 2.4E-23 149.2 11.6 90 554-643 1-92 (93) 44 cd00774 GlyRS-like_core Glycyl 99.8 3.8E-19 9.8E-24 151.9 8.2 154 262-431 21-178 (254) 45 cd00859 HisRS_anticodon HisRS 99.7 1.4E-15 3.5E-20 127.4 11.5 90 553-642 2-91 (91) 46 PRK12292 hisZ ATP phosphoribos 99.6 3.2E-13 8.1E-18 111.1 20.6 320 261-609 5-383 (388) 47 COG0442 ProS Prolyl-tRNA synth 99.6 4.2E-15 1.1E-19 124.0 9.0 240 399-641 109-384 (500) 48 KOG1936 consensus 99.6 7E-13 1.8E-17 108.7 19.9 390 224-644 26-514 (518) 49 PRK00252 alaS alanyl-tRNA synt 99.6 4.9E-14 1.3E-18 116.7 13.6 135 72-225 550-689 (864) 50 PRK13902 alaS lanyl-tRNA synth 99.6 9.2E-14 2.3E-18 114.8 12.7 148 73-239 591-742 (898) 51 COG0013 AlaS Alanyl-tRNA synth 99.6 2E-13 5E-18 112.5 14.3 162 69-255 560-726 (879) 52 KOG2509 consensus 99.5 2.8E-14 7.2E-19 118.4 8.5 294 210-560 131-441 (455) 53 cd01667 TGS_ThrRS_N TGS _ThrRS 99.5 2.3E-14 5.8E-19 119.0 7.5 60 5-64 1-61 (61) 54 PRK01584 alanyl-tRNA synthetas 99.5 6.4E-14 1.6E-18 115.9 9.7 132 75-224 456-591 (593) 55 TIGR00442 hisS histidyl-tRNA s 99.5 8.2E-12 2.1E-16 101.3 17.0 366 261-640 6-445 (446) 56 PRK00960 seryl-tRNA synthetase 99.3 2.4E-11 6.2E-16 98.1 8.8 249 261-549 210-498 (516) 57 TIGR00414 serS seryl-tRNA synt 99.2 1.3E-10 3.4E-15 93.0 9.6 283 226-553 147-459 (460) 58 PRK12421 ATP phosphoribosyltra 99.0 8.2E-08 2.1E-12 73.8 17.5 152 263-430 11-166 (391) 59 KOG0188 consensus 99.0 7.6E-09 1.9E-13 80.9 11.1 184 71-276 566-761 (895) 60 COG2872 Predicted metal-depend 99.0 1.7E-09 4.4E-14 85.3 7.4 138 71-224 95-234 (241) 61 pfam02824 TGS TGS domain. The 99.0 1.9E-09 4.8E-14 85.1 6.6 59 5-63 1-60 (60) 62 cd00773 HisRS-like_core Class 99.0 6.4E-09 1.6E-13 81.4 9.3 144 273-430 2-147 (261) 63 PRK12293 hisZ ATP phosphoribos 98.9 1.1E-08 2.9E-13 79.7 9.3 139 259-423 5-144 (281) 64 cd01616 TGS The TGS domain, na 98.9 6.5E-09 1.7E-13 81.3 6.9 59 5-63 1-60 (60) 65 PRK12295 hisZ ATP phosphoribos 98.8 1.3E-08 3.3E-13 79.3 7.8 135 276-430 7-145 (373) 66 smart00863 tRNA_SAD Threonyl a 98.5 2.1E-08 5.4E-13 77.8 0.1 43 171-221 1-44 (44) 67 pfam07973 tRNA_SAD Threonyl an 98.5 3.7E-08 9.4E-13 76.1 0.8 43 171-221 1-44 (44) 68 PRK12820 bifunctional aspartyl 98.5 4.6E-07 1.2E-11 68.6 6.3 116 272-411 155-273 (706) 69 TIGR00459 aspS_bact aspartyl-t 98.4 9.9E-07 2.5E-11 66.3 7.4 125 272-420 150-282 (653) 70 PRK09537 pylS pyrolysyl-tRNA s 98.4 2.8E-07 7.2E-12 70.0 4.3 102 276-389 209-311 (420) 71 COG0173 AspS Aspartyl-tRNA syn 98.4 7.9E-07 2E-11 67.0 6.6 116 272-418 140-262 (585) 72 PRK00484 lysS lysyl-tRNA synth 98.4 1.5E-06 3.8E-11 65.1 8.0 116 273-411 171-288 (491) 73 PRK09350 lysyl-tRNA synthetase 98.4 1.6E-06 4E-11 64.9 7.9 126 267-411 6-138 (325) 74 TIGR00415 serS_MJ seryl-tRNA s 98.3 4.9E-06 1.2E-10 61.5 8.5 164 257-421 206-397 (520) 75 cd01668 TGS_RelA_SpoT TGS_RelA 98.2 2.6E-06 6.5E-11 63.4 5.5 58 6-63 2-60 (60) 76 PRK05159 aspC aspartyl-tRNA sy 98.2 1.3E-05 3.4E-10 58.5 8.8 114 272-411 136-253 (434) 77 PRK10872 relA GDP/GTP pyrophos 98.2 4E-06 1E-10 62.1 5.8 25 169-193 177-212 (743) 78 cd00775 LysRS_core Lys_tRNA sy 98.2 6.2E-06 1.6E-10 60.8 6.6 115 273-411 8-125 (329) 79 PRK03932 asnC asparaginyl-tRNA 98.2 8.1E-06 2.1E-10 60.0 7.1 125 273-410 135-268 (462) 80 PRK11092 bifunctional (p)ppGpp 98.1 6.2E-06 1.6E-10 60.8 6.0 119 178-305 181-317 (702) 81 KOG2411 consensus 98.1 5.3E-06 1.3E-10 61.3 5.5 110 270-408 175-293 (628) 82 pfam00152 tRNA-synt_2 tRNA syn 98.1 1.8E-05 4.5E-10 57.6 8.2 118 272-413 21-141 (341) 83 PRK07080 hypothetical protein; 98.1 0.00011 2.7E-09 52.3 11.8 246 261-549 36-310 (318) 84 PRK02983 lysS lysyl-tRNA synth 98.1 2E-05 5.1E-10 57.3 7.8 115 273-410 776-892 (1099) 85 PRK00476 aspS aspartyl-tRNA sy 98.1 7.3E-06 1.9E-10 60.3 5.1 115 273-411 142-259 (587) 86 COG0317 SpoT Guanosine polypho 98.0 2.1E-05 5.3E-10 57.1 7.2 12 569-580 642-653 (701) 87 PRK12445 lysyl-tRNA synthetase 98.0 1E-05 2.6E-10 59.2 5.5 116 272-411 183-301 (505) 88 TIGR00344 alaS alanyl-tRNA syn 97.9 3.3E-05 8.4E-10 55.8 6.7 141 58-205 583-730 (914) 89 cd00776 AsxRS_core Asx tRNA sy 97.9 4.4E-05 1.1E-09 54.9 7.3 114 272-410 23-139 (322) 90 cd00669 Asp_Lys_Asn_RS_core As 97.9 6.1E-05 1.6E-09 53.9 7.7 115 273-411 1-118 (269) 91 COG3705 HisZ ATP phosphoribosy 97.9 4.7E-05 1.2E-09 54.7 6.8 152 260-428 4-158 (390) 92 cd00777 AspRS_core Asp tRNA sy 97.9 4.2E-05 1.1E-09 55.0 6.4 117 273-412 1-119 (280) 93 PRK06462 asparagine synthetase 97.8 0.00014 3.5E-09 51.5 7.8 146 230-411 1-149 (332) 94 COG2269 Truncated, possibly in 97.7 0.00033 8.5E-09 48.8 8.6 133 263-413 2-140 (322) 95 TIGR02367 PylS pyrrolysyl-tRNA 97.6 3.1E-05 8E-10 55.9 2.0 124 274-409 240-367 (453) 96 TIGR00691 spoT_relA RelA/SpoT 97.6 0.00019 4.8E-09 50.5 5.9 72 3-74 380-454 (741) 97 PRK07440 hypothetical protein; 97.2 0.0018 4.6E-08 43.8 7.1 61 1-64 1-66 (70) 98 COG1190 LysU Lysyl-tRNA synthe 96.8 0.0085 2.2E-07 39.1 8.0 131 250-417 165-296 (502) 99 PRK04172 pheS phenylalanyl-tRN 96.7 0.0075 1.9E-07 39.5 7.1 142 272-430 243-425 (501) 100 COG0017 AsnS Aspartyl/asparagi 96.7 0.015 3.8E-07 37.4 8.6 112 273-409 134-247 (435) 101 PTZ00326 phenylalanyl-tRNA syn 96.6 0.02 5.1E-07 36.6 8.5 141 273-429 241-430 (505) 102 KOG1885 consensus 96.2 0.025 6.4E-07 35.9 7.2 97 273-392 225-322 (560) 103 KOG2105 consensus 96.1 0.018 4.5E-07 36.9 6.1 145 50-206 86-234 (415) 104 COG0016 PheS Phenylalanyl-tRNA 96.0 0.15 3.9E-06 30.4 11.0 158 242-421 78-246 (335) 105 COG2104 ThiS Sulfur transfer p 95.9 0.036 9.2E-07 34.8 6.7 58 4-64 2-64 (68) 106 PRK06253 O-phosphoseryl-tRNA s 95.6 0.029 7.4E-07 35.5 5.4 69 355-430 202-270 (527) 107 TIGR00457 asnS asparaginyl-tRN 95.5 0.098 2.5E-06 31.8 7.7 255 273-582 146-442 (495) 108 cd00496 PheRS_alpha_core Pheny 95.5 0.14 3.6E-06 30.8 8.5 121 277-419 4-132 (218) 109 PRK05659 sulfur carrier protei 95.4 0.062 1.6E-06 33.2 6.3 57 5-64 1-62 (66) 110 cd01669 TGS_Ygr210_C TGS_Ygr21 95.3 0.017 4.4E-07 37.0 3.4 53 9-63 21-76 (76) 111 PRK07696 sulfur carrier protei 95.2 0.08 2E-06 32.4 6.4 57 5-64 1-63 (67) 112 TIGR00462 genX lysyl-tRNA synt 95.0 0.16 4E-06 30.4 7.4 119 275-411 23-143 (330) 113 PRK06083 sulfur carrier protei 94.9 0.11 2.7E-06 31.5 6.5 60 1-64 1-65 (69) 114 PRK07569 bidirectional hydroge 94.7 0.28 7.2E-06 28.6 8.1 79 1-83 1-95 (236) 115 PRK11840 bifunctional sulfur c 94.3 0.13 3.4E-06 30.9 5.7 58 5-65 1-63 (327) 116 PRK05863 sulfur carrier protei 94.2 0.15 3.9E-06 30.5 5.8 57 5-64 1-61 (65) 117 PRK06944 sulfur carrier protei 94.0 0.19 4.9E-06 29.8 6.1 56 5-64 1-61 (65) 118 TIGR00471 pheT_arch phenylalan 93.8 0.068 1.7E-06 32.9 3.5 115 258-387 373-500 (605) 119 PRK08053 sulfur carrier protei 93.1 0.42 1.1E-05 27.4 6.5 57 5-64 1-62 (66) 120 pfam02597 ThiS ThiS family. Th 93.0 0.33 8.4E-06 28.2 5.9 53 11-65 11-67 (70) 121 cd00565 ThiS ThiS (ThiaminS) i 92.9 0.29 7.3E-06 28.6 5.4 55 7-64 2-61 (65) 122 PRK06488 sulfur carrier protei 92.8 0.32 8.2E-06 28.3 5.6 56 5-64 1-61 (65) 123 cd00769 PheRS_beta_core Phenyl 92.3 0.088 2.2E-06 32.1 2.1 91 276-374 2-93 (198) 124 TIGR01683 thiS thiamine biosyn 91.6 0.53 1.3E-05 26.8 5.5 52 11-64 4-61 (65) 125 PRK09616 pheT phenylalanyl-tRN 91.4 0.52 1.3E-05 26.8 5.3 92 273-373 355-448 (546) 126 pfam01409 tRNA-synt_2d tRNA sy 91.1 1.3 3.3E-05 24.1 8.9 111 272-394 15-131 (243) 127 TIGR00458 aspS_arch aspartyl-t 89.9 1.2 3E-05 24.4 6.0 106 273-402 155-263 (466) 128 KOG1547 consensus 89.6 0.63 1.6E-05 26.2 4.4 94 508-627 160-267 (336) 129 TIGR00470 sepS O-phosphoseryl- 87.5 1 2.6E-05 24.8 4.4 218 335-610 185-432 (558) 130 KOG0554 consensus 86.7 2.6 6.5E-05 22.0 6.3 103 273-396 132-240 (446) 131 TIGR00443 hisS_second histidyl 86.5 2.6 6.7E-05 22.0 6.7 157 259-428 3-161 (414) 132 PRK00488 pheS phenylalanyl-tRN 85.4 3 7.6E-05 21.6 8.0 116 261-395 95-216 (338) 133 cd00144 PP2Ac Protein phosphat 84.6 3.2 8.2E-05 21.4 6.8 94 114-222 1-106 (269) 134 PRK07860 NADH dehydrogenase su 83.3 3.6 9.3E-05 21.0 8.4 80 2-82 14-109 (809) 135 COG0072 PheT Phenylalanyl-tRNA 83.0 0.95 2.4E-05 25.0 2.4 94 272-374 349-444 (650) 136 cd01666 TGS_DRG_C TGS_DRG_C: 82.8 1.7 4.4E-05 23.2 3.7 51 13-63 17-75 (75) 137 COG1034 NuoG NADH dehydrogenas 81.3 4.3 0.00011 20.5 8.1 76 4-82 1-92 (693) 138 KOG0556 consensus 81.3 3.7 9.5E-05 20.9 4.9 93 274-395 229-328 (533) 139 COG5019 CDC3 Septin family pro 80.7 4.5 0.00011 20.4 5.4 20 564-583 180-199 (373) 140 PTZ00239 serine/threonine prot 80.5 4.5 0.00012 20.3 6.5 99 110-223 12-122 (303) 141 PRK08493 NADH dehydrogenase su 80.1 4.7 0.00012 20.2 7.8 69 4-76 1-83 (819) 142 cd00754 MoaD MoaD family. Memb 79.6 4.8 0.00012 20.1 5.4 52 14-65 19-77 (80) 143 PRK06437 hypothetical protein; 76.7 5.8 0.00015 19.6 4.7 61 1-63 1-62 (67) 144 COG2024 Phenylalanyl-tRNA synt 76.5 0.99 2.5E-05 24.9 0.8 179 351-578 199-392 (536) 145 cd01206 Homer Homer type EVH1 74.7 3.1 7.8E-05 21.5 2.9 34 207-244 76-109 (111) 146 COG3383 Uncharacterized anaero 74.1 6.8 0.00017 19.1 8.7 69 1-72 1-84 (978) 147 PRK12814 putative NADPH-depend 73.9 4.4 0.00011 20.4 3.6 30 1-32 1-30 (652) 148 PTZ00244 serine/threonine-prot 73.1 7.1 0.00018 19.0 6.3 98 111-223 22-131 (294) 149 pfam10752 DUF2533 Protein of u 69.5 7.4 0.00019 18.8 3.9 41 109-149 40-80 (84) 150 PRK09129 NADH dehydrogenase su 69.0 8.8 0.00022 18.4 7.9 76 4-82 1-92 (777) 151 KOG0555 consensus 69.0 8.2 0.00021 18.6 4.0 113 273-416 243-364 (545) 152 pfam07801 DUF1647 Protein of u 68.7 8.9 0.00023 18.3 5.4 64 44-125 46-110 (142) 153 PRK08364 sulfur carrier protei 67.8 4.3 0.00011 20.5 2.4 50 13-64 13-63 (67) 154 PRK02645 ppnK inorganic polyph 66.7 9.7 0.00025 18.1 6.8 62 580-644 221-289 (304) 155 pfam00735 Septin Septin. Membe 66.2 9.9 0.00025 18.0 4.9 75 550-625 138-223 (280) 156 PRK00629 pheT phenylalanyl-tRN 65.8 6.5 0.00016 19.3 3.0 90 273-372 483-574 (786) 157 PRK09130 NADH dehydrogenase su 63.8 11 0.00028 17.7 7.7 75 5-82 2-93 (680) 158 PRK08166 NADH dehydrogenase su 63.5 8.2 0.00021 18.5 3.2 75 5-82 2-97 (858) 159 TIGR00499 lysS_bact lysyl-tRNA 62.9 11 0.00029 17.6 6.7 109 271-402 188-298 (538) 160 pfam00568 WH1 WH1 domain. WASp 61.9 12 0.0003 17.5 3.7 29 207-241 82-110 (111) 161 PRK03708 ppnK inorganic polyph 61.3 12 0.00031 17.4 6.6 61 580-644 208-273 (278) 162 PRK03372 ppnK inorganic polyph 61.2 12 0.00031 17.4 6.8 64 580-646 224-292 (303) 163 CHL00192 syfB phenylalanyl-tRN 61.0 9.6 0.00024 18.1 3.1 88 272-373 392-480 (702) 164 PRK03147 thiol-disulfide oxido 59.9 13 0.00033 17.2 4.5 79 554-645 95-176 (176) 165 cd00837 EVH1 EVH1 (Enabled, Va 59.4 13 0.00033 17.2 3.7 29 207-241 75-103 (104) 166 COG1163 DRG Predicted GTPase [ 58.3 13 0.00034 17.1 3.5 49 530-582 276-333 (365) 167 PRK01777 hypothetical protein; 58.1 14 0.00035 17.0 4.5 63 1-64 1-76 (95) 168 PRK11872 antC anthranilate dio 57.0 14 0.00036 16.9 3.8 35 1-35 1-37 (340) 169 cd04938 TGS_Obg-like TGS_Obg-l 56.4 6.5 0.00017 19.2 1.6 51 9-62 22-75 (76) 170 COG1854 LuxS LuxS protein invo 56.1 15 0.00038 16.8 7.2 77 54-131 33-112 (161) 171 PRK05265 pyridoxine 5'-phospha 55.0 15 0.00039 16.7 5.8 45 531-584 75-131 (240) 172 KOG2784 consensus 54.7 3.1 7.9E-05 21.5 -0.3 139 263-422 206-388 (483) 173 PRK03604 moaC bifunctional mol 53.7 12 0.0003 17.4 2.6 75 545-620 138-223 (301) 174 cd01207 Ena-Vasp Enabled-VASP- 53.2 16 0.00042 16.5 3.6 40 198-243 67-108 (111) 175 cd01990 Alpha_ANH_like_I This 52.5 14 0.00035 17.0 2.7 59 238-296 127-192 (202) 176 PRK04759 consensus 51.5 17 0.00044 16.3 6.7 40 604-644 241-286 (294) 177 cd01850 CDC_Septin CDC/Septin. 50.6 18 0.00046 16.2 5.0 67 554-624 143-223 (276) 178 TIGR00326 eubact_ribD riboflav 50.6 9.9 0.00025 18.0 1.8 20 627-646 298-317 (393) 179 COG3222 Uncharacterized protei 50.5 16 0.00042 16.5 2.9 13 572-584 172-184 (211) 180 pfam02590 SPOUT_MTase Predicte 49.9 18 0.00047 16.1 5.7 84 527-611 7-106 (155) 181 smart00166 UBX Domain present 48.8 19 0.00048 16.0 3.2 31 3-33 5-37 (80) 182 smart00156 PP2Ac Protein phosp 47.3 20 0.00051 15.9 6.1 95 114-223 1-107 (271) 183 cd00003 PNPsynthase Pyridoxine 47.2 20 0.00051 15.9 5.8 63 531-604 72-146 (234) 184 cd01226 PH_exo84 Exocyst compl 46.4 15 0.00039 16.7 2.2 40 190-243 60-99 (100) 185 PRK08299 isocitrate dehydrogen 45.6 21 0.00054 15.7 5.4 15 630-644 383-397 (403) 186 smart00295 B41 Band 4.1 homolo 45.5 21 0.00054 15.7 3.2 33 2-34 3-37 (207) 187 TIGR01122 ilvE_I branched-chai 44.4 5.7 0.00014 19.7 -0.2 34 509-543 193-228 (302) 188 PRK02889 tolB translocation pr 43.6 23 0.00058 15.5 6.2 14 124-137 55-68 (430) 189 PRK11180 rluD 23S rRNA pseudou 43.5 23 0.00058 15.5 3.4 58 1-61 1-67 (325) 190 smart00461 WH1 WASP homology r 42.8 23 0.00059 15.4 3.7 29 207-241 77-105 (106) 191 PRK04922 tolB translocation pr 42.5 24 0.0006 15.4 5.3 18 120-137 53-70 (439) 192 cd00824 PTBI IRS-like phosphot 42.4 24 0.0006 15.4 4.2 47 243-289 48-100 (104) 193 pfam03740 PdxJ Pyridoxal phosp 42.2 24 0.00061 15.4 5.4 64 531-605 73-148 (239) 194 cd01772 SAKS1_UBX SAKS1 (SAPK- 41.8 24 0.00062 15.3 3.1 30 3-32 5-36 (79) 195 TIGR00472 pheT_bact phenylalan 41.7 19 0.00049 16.0 2.1 86 282-373 526-614 (848) 196 TIGR01344 malate_syn_A malate 41.7 4.2 0.00011 20.5 -1.2 23 378-400 145-168 (522) 197 PRK02270 consensus 41.6 22 0.00055 15.7 2.4 72 25-98 20-92 (327) 198 pfam01582 TIR TIR domain. The 40.9 25 0.00063 15.2 3.2 13 112-124 41-53 (139) 199 TIGR01687 moaD_arch MoaD famil 40.8 25 0.00064 15.2 4.9 51 14-64 19-89 (93) 200 PRK04539 ppnK inorganic polyph 40.6 25 0.00064 15.2 6.6 41 604-644 243-288 (296) 201 pfam08921 DUF1904 Domain of un 40.4 25 0.00065 15.2 4.9 78 218-295 3-90 (107) 202 TIGR01857 FGAM-synthase phosph 39.9 7.7 0.0002 18.7 -0.1 230 11-247 548-866 (1279) 203 pfam12362 DUF3646 DNA polymera 39.2 26 0.00067 15.0 2.8 20 75-94 94-113 (117) 204 TIGR00106 TIGR00106 conserved 38.8 27 0.00068 15.0 3.7 58 530-587 16-84 (103) 205 pfam06646 Mycoplasma_p37 High 37.4 20 0.00051 15.9 1.7 19 114-132 52-70 (383) 206 KOG1261 consensus 37.0 18 0.00046 16.2 1.4 33 252-284 200-243 (552) 207 PRK02260 S-ribosylhomocysteina 36.9 29 0.00073 14.8 7.4 72 58-131 37-112 (163) 208 COG3894 Uncharacterized metal- 36.9 29 0.00073 14.8 2.6 58 1-67 1-61 (614) 209 pfam01963 TraB TraB family. pA 36.4 19 0.00049 16.0 1.5 45 568-612 162-209 (225) 210 PRK07199 phosphoribosylpyropho 36.4 8 0.0002 18.6 -0.5 72 23-98 15-88 (301) 211 PRK01911 ppnK inorganic polyph 36.4 29 0.00074 14.8 6.5 41 604-644 237-281 (290) 212 pfam10254 Pacs-1 PACS-1 cytoso 35.9 30 0.00076 14.7 3.8 18 505-522 324-341 (413) 213 PRK01185 ppnK inorganic polyph 35.6 30 0.00076 14.7 5.1 101 525-650 164-270 (272) 214 pfam11543 UN_NPL4 Nuclear pore 35.5 30 0.00077 14.7 3.2 35 1-35 1-38 (80) 215 TIGR02943 Sig70_famx1 RNA poly 35.4 21 0.00053 15.8 1.5 82 230-313 10-108 (194) 216 TIGR02037 degP_htrA_DO proteas 35.1 21 0.00053 15.8 1.5 22 422-443 116-140 (484) 217 KOG3063 consensus 35.1 25 0.00063 15.2 1.8 88 2-95 11-101 (301) 218 KOG3851 consensus 34.5 26 0.00067 15.1 1.9 53 195-249 199-251 (446) 219 PRK13255 thiopurine S-methyltr 34.0 32 0.00081 14.5 3.8 31 126-156 52-82 (218) 220 cd01817 RGS12_RBD RGS12 (regul 33.6 32 0.00082 14.5 4.1 56 6-61 3-67 (73) 221 KOG3309 consensus 33.4 32 0.00083 14.4 3.2 48 505-558 43-93 (159) 222 PRK02649 ppnK inorganic polyph 32.7 33 0.00084 14.4 5.4 41 604-644 243-288 (305) 223 PHA01622 CRISPR-associated Cas 32.6 33 0.00085 14.3 4.9 146 222-413 18-169 (204) 224 pfam06616 BsuBI_PstI_RE BsuBI/ 32.5 33 0.00085 14.3 4.1 18 219-236 259-276 (306) 225 pfam04571 Lipin_N lipin, N-ter 32.5 33 0.00085 14.3 2.7 61 4-67 30-91 (111) 226 PRK03378 ppnK inorganic polyph 32.3 34 0.00086 14.3 5.8 96 525-644 182-284 (292) 227 COG1977 MoaD Molybdopterin con 32.2 34 0.00086 14.3 3.4 47 19-65 26-81 (84) 228 cd01202 FRS2 Fibroblast growth 32.2 34 0.00086 14.3 4.5 43 243-285 46-94 (102) 229 TIGR02224 recomb_XerC tyrosine 32.1 24 0.00062 15.3 1.4 15 507-521 175-189 (313) 230 cd00727 malate_synt_A Malate s 32.0 33 0.00083 14.4 2.0 191 101-379 88-306 (511) 231 pfam08428 Rib Rib/alpha-like r 31.7 26 0.00066 15.1 1.5 14 5-18 45-58 (65) 232 COG4829 CatC1 Muconolactone de 30.9 28 0.00072 14.9 1.6 39 302-358 41-79 (98) 233 PRK00103 SPOUT methyltransfera 30.8 36 0.00091 14.1 6.1 84 527-611 7-105 (156) 234 TIGR02225 recomb_XerD tyrosine 30.8 24 0.00061 15.3 1.2 18 401-418 62-79 (305) 235 PRK05259 consensus 30.3 29 0.00075 14.7 1.6 10 80-89 69-78 (310) 236 TIGR02393 RpoD_Cterm RNA polym 30.3 10 0.00026 17.9 -0.8 74 17-90 100-175 (240) 237 PRK09255 malate synthase; Vali 29.9 12 0.0003 17.5 -0.5 43 253-295 189-244 (531) 238 cd02958 UAS UAS family; UAS is 29.8 37 0.00094 14.0 6.8 66 569-643 43-109 (114) 239 PRK09601 translation-associate 29.6 37 0.00095 14.0 4.1 16 117-132 135-150 (364) 240 cd01770 p47_UBX p47_UBX p47 i 29.4 38 0.00096 14.0 2.5 54 4-57 6-67 (79) 241 PRK05282 peptidase E; Validate 28.9 38 0.00097 13.9 4.8 15 441-455 46-60 (233) 242 TIGR01381 E1_like_apg7 E1-like 28.5 39 0.00099 13.9 4.0 37 546-582 597-644 (689) 243 TIGR00439 ftsX putative protei 28.1 39 0.001 13.8 3.3 68 83-166 61-130 (314) 244 KOG1434 consensus 26.9 41 0.0011 13.7 2.2 43 488-543 127-174 (335) 245 pfam07476 MAAL_C Methylasparta 26.9 41 0.0011 13.7 3.8 64 522-585 72-143 (249) 246 PRK02458 ribose-phosphate pyro 26.4 37 0.00093 14.1 1.5 72 25-98 24-96 (323) 247 cd01203 DOK_PTB Downstream of 26.4 42 0.0011 13.6 4.9 44 244-287 49-98 (104) 248 COG0854 PdxJ Pyridoxal phospha 26.2 42 0.0011 13.6 4.1 55 550-606 83-149 (243) 249 cd04886 ACT_ThrD-II-like C-ter 25.7 43 0.0011 13.6 5.3 55 529-583 10-72 (73) 250 pfam12387 Peptidase_C74 Pestiv 25.4 44 0.0011 13.5 1.7 12 531-542 86-97 (200) 251 PRK05809 3-hydroxybutyryl-CoA 25.4 44 0.0011 13.5 4.6 15 57-71 12-26 (260) 252 pfam04895 DUF651 Archaeal prot 25.3 44 0.0011 13.5 1.9 32 248-281 29-61 (110) 253 PRK02812 ribose-phosphate pyro 25.0 29 0.00075 14.7 0.8 78 19-98 26-108 (331) 254 TIGR02635 RhaI_grampos L-rhamn 24.9 45 0.0011 13.5 8.8 167 340-542 55-232 (382) 255 pfam04914 DltD_C DltD C-termin 24.1 46 0.0012 13.4 4.9 52 528-588 35-97 (130) 256 KOG2655 consensus 24.1 46 0.0012 13.3 4.6 14 295-308 161-174 (366) 257 PRK01742 tolB translocation pr 23.8 47 0.0012 13.3 5.2 14 124-137 54-67 (430) 258 pfam00789 UBX UBX domain. This 23.8 47 0.0012 13.3 3.5 29 3-31 7-37 (81) 259 pfam10605 3HBOH 3HB-oligomer h 23.4 48 0.0012 13.3 2.4 23 590-612 542-564 (689) 260 KOG0373 consensus 23.4 15 0.00038 16.8 -1.0 27 377-403 215-242 (306) 261 TIGR02331 rib_alpha Rib/alpha/ 23.1 46 0.0012 13.4 1.5 19 3-21 48-66 (83) 262 PRK06143 enoyl-CoA hydratase; 23.1 48 0.0012 13.2 3.7 46 56-101 16-63 (260) 263 cd03314 MAL Methylaspartate am 23.0 49 0.0012 13.2 2.7 71 522-592 194-274 (369) 264 pfam07447 VP40 Matrix protein 22.6 27 0.0007 14.9 0.3 12 262-273 89-100 (295) 265 PRK04043 tolB translocation pr 22.3 50 0.0013 13.1 5.7 21 120-140 44-64 (419) 266 PRK05137 tolB translocation pr 22.1 50 0.0013 13.1 6.1 15 123-137 56-70 (437) 267 PRK11130 moaD molybdopterin sy 22.1 51 0.0013 13.1 4.2 50 15-64 19-77 (81) 268 pfam06183 DinI DinI-like famil 21.8 51 0.0013 13.0 6.0 47 80-131 12-58 (65) 269 KOG1643 consensus 21.7 51 0.0013 13.0 2.3 34 56-89 84-117 (247) 270 pfam03658 UPF0125 Uncharacteri 21.7 51 0.0013 13.0 4.2 56 9-64 9-73 (83) 271 pfam09837 DUF2064 Uncharacteri 21.0 53 0.0014 12.9 3.6 27 79-106 77-103 (121) 272 KOG3859 consensus 20.9 53 0.0014 12.9 2.5 46 229-276 116-163 (406) 273 PRK05658 RNA polymerase sigma 20.9 40 0.001 13.8 0.9 47 379-425 418-488 (620) 274 pfam04343 DUF488 Protein of un 20.9 53 0.0014 12.9 3.2 18 570-587 4-21 (123) 275 PRK12775 putative trifunctiona 20.9 53 0.0014 12.9 7.3 73 543-619 560-645 (993) 276 pfam01216 Calsequestrin Calseq 20.8 54 0.0014 12.9 9.7 145 123-287 57-224 (350) 277 KOG0372 consensus 20.6 54 0.0014 12.9 3.2 106 113-223 15-122 (303) 278 TIGR00559 pdxJ pyridoxal phosp 20.1 55 0.0014 12.8 5.8 63 531-606 73-151 (265) No 1 >PRK12444 threonyl-tRNA synthetase; Reviewed Probab=100.00 E-value=0 Score=1706.12 Aligned_cols=632 Identities=45% Similarity=0.830 Sum_probs=618.6 Q ss_pred CCC-CEEEECCCCCEEECCCCCCHHHHHHHHCCCCCCCEEEEEECCEEEECCEEEC-CCEEEEECCCCHHHHHHHHHHHH Q ss_conf 988-7089928998874189888999998636110112899998988997745303-67079862899999999999899 Q gi|254781004|r 1 MPS-DIKLTFPDGSIKNFPVHATGSDVAESISRSLAKKAVAVAINGKVCDLSDPVR-EGSIEIITPEDPRSLAVIRHSCA 78 (652) Q Consensus 1 M~~-~i~I~lpDG~~~~~~~g~t~~dia~~i~~~~~~~~vaa~vng~l~dL~~~l~-d~~i~~i~~~~~eg~~i~~hS~a 78 (652) |++ +|+||||||++++++.|+|+.|||++|++++++++|||+|||+++||++||+ ||+|+++|++|++|+.++||||| T Consensus 1 m~~~mi~ItLpDGs~~e~~~~~t~~~iA~~I~~~Lak~avaa~vng~l~Dl~~~i~~d~~le~it~~~~eg~~v~rHS~A 80 (639) T PRK12444 1 MKEQMIEIKFPDGSVKEFVKGITLEEIAGSISSSLKKKAVAGKVNDELYDLRRNIEENAEVEIITIDSNEGVEIARHSAA 80 (639) T ss_pred CCCCCEEEECCCCCEEECCCCCCHHHHHHHHCHHHHHHEEEEEECCEEEECCCCCCCCCEEEEECCCCHHHHHHHHHHHH T ss_conf 98776079828998456789999999999877666656289998999972785168997899955899778999988899 Q ss_pred HHHHHHHHHHCCCCEEEEECCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHCCCHHH Q ss_conf 99999999977998899724358855788408888889899999999999996699804853276785445441671135 Q gi|254781004|r 79 HIMAEAVQSIWPEVQVAIGPIIEDGFYYDFDKEQPFSSDELAQIEKKMQEIIARDSPFLKQYLSHEKARELFESKKEAYK 158 (652) Q Consensus 79 hlL~~Av~~l~p~~kl~igp~i~~GfYyD~~~~~~it~~dl~~Ie~~M~~ii~~~~pi~r~~vs~~eA~~~F~~~~~~~K 158 (652) ||||+||+++||++|++|||+|++||||||+.++++|++||++||++|++||++|+||+|.++|+++|+++|+.++++|| T Consensus 81 Hvla~Av~~l~p~~kl~iGP~ie~GFYYDf~~~~~~t~~Dl~~IE~~M~~iik~~~~f~r~~~s~~eA~~~f~~~~~~yK 160 (639) T PRK12444 81 HILAQAVKRLYGDVNLGVGPVIENGFYYDMDLPSSVNVEDLRKIEKEMKKIINENIKIERVEVSREEAAKLFQEMNDRLK 160 (639) T ss_pred HHHHHHHHHHCCCCEEEECCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEECHHHHHHHHHHCCCHHH T ss_conf 99999999977996388789888832475346999998999999999999873789749998589999999986384777 Q ss_pred HHHHHHCCCCCCCEEECCCCCEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEEEECCHHHHHHHHHHH Q ss_conf 56654203455421302575201000235521444222333333211101577564206899999837988999999999 Q gi|254781004|r 159 VEILESIPAKENVTLYRQGEWFDLCRGPHVRSTGQVKKFFKLMKVAGAYWRGDSNRPMLSRIYGTAWNTQQELTQYLYFL 238 (652) Q Consensus 159 ~~li~~~~~~~~v~~y~~g~~~Dlc~GphvpsTg~ik~~fkL~~~sgayw~gd~~~~~l~Riyg~af~~~~~l~~~~~~~ 238 (652) +++++.+++++.+|+|+||+|+|||+||||||||.||+ |||+++||||||||++|+||||||||||||+++|++||+++ T Consensus 161 ~eli~~~~~~~~is~Y~~g~f~DLCrGPHv~~t~~ika-fkL~~vsgAYWrGd~~n~~LqRIYG~af~~k~~L~~yl~~l 239 (639) T PRK12444 161 LELLEAIPSGESVTLYKQGEFVDLCRGPHLPSTGYLKA-FQLTHVSGAYWRGDSNNQVLQRIYGVAFSSQKELEEYLHFV 239 (639) T ss_pred HHHHHCCCCCCCEEEEEECCEEEECCCCCCCCCCCCEE-EEEEEEECCEECCCCCCCCEEEEEEEEECCHHHHHHHHHHH T ss_conf 78875178998432899489889668877787454407-89734002142788768221689999858857799999999 Q ss_pred HHHHHHHHHHHHHHCCEEEECCCCCCEEEEECCHHHHHHHHHHHHHHHH-HCCCEEEECCCCCCCCCCCCCCCCCCCCCC Q ss_conf 9753200887541144044226787435650107899999999999887-689638555432443100011210012555 Q gi|254781004|r 239 EESEKRDHRKLAREMDLFHIAEDGSGVIFWHRKGWKIFQTLISYMRRKI-KDDYEEINTPQVLDQHLWQQSGHWDSYRAN 317 (652) Q Consensus 239 eea~~rdHr~lg~~~~lf~~~~~~~G~~~wlP~G~~i~~~ie~~ir~~~-~~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~ 317 (652) ||||+||||||||+|+||+|+++|||+|||||+|++|++.|++|+|++. ++||+||.||+|++++||++||||++|+++ T Consensus 240 eEakkRDHRklGkel~LF~f~~~~pG~~fwlP~G~~i~~~le~~~r~~~~~~GY~eV~TP~i~~~~Lw~~SGH~~~Y~e~ 319 (639) T PRK12444 240 EEAAKRNHRKLGNELELFMFSEEAPGMPFYLPKGQIIRNELEAFLREIQKEYNYQEVRTPFMMNQELWERSGHWGHYKDN 319 (639) T ss_pred HHHHHCCHHHHHHHCCEEEECCCCCCCEEEECCHHHHHHHHHHHHHHHHHHCCCEEECCCEEEHHHHHHHCCHHHHHHHC T ss_conf 99886478876532272882277897447846786999999999999999839878228666215687616604333302 Q ss_pred CCEEECCCCCCCCCCEEEECCCCHHHHHHHHHHCCCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEECCH Q ss_conf 42010014567778237743431257777652012222310432000100012787645200010563533442101053 Q gi|254781004|r 318 MFAVKCADDTIKDLRTFALKPMNCPGHVAVFNHGLKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVFCTK 397 (652) Q Consensus 318 mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F~~~ 397 (652) ||.++ .++++|+|||||||+||+||++.+|||||||+|++|+|++||||.||+|+||||||+|||||||+||++ T Consensus 320 Mf~~~------~d~~~~~lKPMNCP~H~~if~~~~rSYRdLPlR~aEfg~vhR~E~SG~L~GL~RVR~FtqDDAHIFct~ 393 (639) T PRK12444 320 MYFSE------VDNKSFALKPMNCPGHMLMFKNKLHSYRELPIRMCEFGQVHRHEFSGALNGLLRVRTFCQDDAHLFVTP 393 (639) T ss_pred CCCCC------CCCCEEECCCCCCHHHHHHHHCCCCCHHHCCHHHHHCCCEECCCCCCCCCCCCCCCCCCCCCCEEECCH T ss_conf 75545------454033126768588999986056214436522243364633778766457677889987982465388 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 76788789999998876655213321000146531125860357789999999998502121256675205555554554 Q gi|254781004|r 398 EQMFNECLKIHNLIISIYKDFGFEKIMVKLSTRPEKRVGSDALWDDAENIMKGVLDTIKNSSQDRINTGVLLGEGAFYGP 477 (652) Q Consensus 398 eq~~~e~~~~~~~~~~v~~~~g~~~~~~~ls~~~~~~~g~~~~~~~ae~~l~~~l~~~~~~~~~~~~~~~~~g~~afygp 477 (652) ||+++|+.++++++..+|.+|||+ +.+.+||+|++++|+++.|+.||..|+++|+..+ ++|.+++|+|||||| T Consensus 394 dQi~~Ei~~~~~~~~~vy~~fGf~-~~~~LStrpek~~G~de~W~~Ae~~L~~aL~~~g------~~y~~~~GegAFYGP 466 (639) T PRK12444 394 EQIEDEIKSVMAQIDYVYKTFGFE-YEVELSTRPEDSMGDDKLWEQAEAALENVLQSLN------YKYRLNEGDGAFYGP 466 (639) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCE-EEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHCC------CCEEECCCCCCCCCC T ss_conf 999999999999999999747972-9998167863345888999999999999999749------975867686364032 Q ss_pred CCCCCHHHHCCCCCCCCCCCHHHHCCCCCCCEEEEECCCEEEEEEEEECCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEE Q ss_conf 32110021103310012110111022346741673179721379964034103789999999973870887337808999 Q gi|254781004|r 478 KFEYILKDAIGRDWQCGTIQVDFNLPSRFNAFYVNSHSEKCHPVMIHRAVFGSIERFIGIMIENFKGNLPLWLSPIQAIV 557 (652) Q Consensus 478 kid~~~~d~~gr~~~~~tiq~df~~p~~f~l~y~~~dg~~~~pvmihr~~~Gs~eR~ia~liE~~~g~~P~wLAP~QV~I 557 (652) |||++++|++||.|||+|+|+||++|+||+++|+++||++++|||||||++||+|||||+|||||+|+||+||||+||+| T Consensus 467 KiD~~v~DalgR~wq~~TIQlDF~lPeRF~l~Y~~~dg~~~rPVmIHRAilGS~ERFigiLiEh~~G~fP~WLaP~Qv~I 546 (639) T PRK12444 467 KIDFHIKDALNRSHQCGTIQLDFQMPEKFDLNYIDEKNEKVRPVVIHRAVLGSLDRFLAILIEHFGGAFPAWVAPVQVKV 546 (639) T ss_pred EEEEEEEECCCCEEECCEEEEEECCHHHCCCEEECCCCCCCCCEEEEECCCCCHHHHHHHHHHHCCCCCCCCCCCCEEEE T ss_conf 65689980788724512345520566335877787899825988999578777899999999972798985657200699 Q ss_pred EECCHH-HHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHCCCCEEEEECHHHHHCCCEEEEECCCCCEEEECHHH Q ss_conf 857966-8999999999999879889997589663676665543489889998742532581899988988663342999 Q gi|254781004|r 558 TTITSS-AVEYAQEIANLLKSHHLSIETDFRNETINYKIREHSIKKIPIIIICGDKEASERSIGIRRFGSTTTQKVSLLD 636 (652) Q Consensus 558 ipi~e~-~~eya~~i~~~L~~~girV~~Ddr~~~~G~Kir~a~l~giP~~ivIG~ke~e~~~Vtir~R~~~~~~~i~lee 636 (652) |||+++ +.+||++|++.|+++|+||++|+|++++|+|||+|++.+|||++|||++|+++++|+||.|++++++.+++++ T Consensus 547 ipi~~~~~~~Ya~~v~~~L~~~giRv~vD~~~e~l~~KIR~a~~~kiPy~lVVGdkE~e~~~VsvR~r~~~~~~~~~~~~ 626 (639) T PRK12444 547 IPVSNAVHVQYADEVADKLAQAGVRVERDERDEKLGYKIREAQMQKIPYVLVIGDKEMENGAVNVRKYGEEKSEVVELDM 626 (639) T ss_pred EECCCCHHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHCCCCEEEEEEHHHHHCCEEEEEECCCCCCCEEEHHH T ss_conf 95377027999999999999789889998999977599998743389989998415653595899977887347067999 Q ss_pred HHHHHHHHCC Q ss_conf 9999987314 Q gi|254781004|r 637 AIPILTKESL 646 (652) Q Consensus 637 ~i~~l~~e~~ 646 (652) |++.|.+|.. T Consensus 627 fi~~l~~ei~ 636 (639) T PRK12444 627 FVESIKEEIK 636 (639) T ss_pred HHHHHHHHHH T ss_conf 9999999997 No 2 >PRK04483 threonyl-tRNA synthetase; Validated Probab=100.00 E-value=0 Score=1698.99 Aligned_cols=630 Identities=46% Similarity=0.899 Sum_probs=616.2 Q ss_pred CEEEECCCCCEEECCCCCCHHHHHHHHCCCCCCCEEEEEECCEEEECCEEEC-CCEEEEECCCCHHHHHHHHHHHHHHHH Q ss_conf 7089928998874189888999998636110112899998988997745303-670798628999999999998999999 Q gi|254781004|r 4 DIKLTFPDGSIKNFPVHATGSDVAESISRSLAKKAVAVAINGKVCDLSDPVR-EGSIEIITPEDPRSLAVIRHSCAHIMA 82 (652) Q Consensus 4 ~i~I~lpDG~~~~~~~g~t~~dia~~i~~~~~~~~vaa~vng~l~dL~~~l~-d~~i~~i~~~~~eg~~i~~hS~ahlL~ 82 (652) +|+||||||++++++.|+|+.|||++|++++++++|||+|||+++||++||+ ||+|+|+|++|++|++|+||||||||| T Consensus 1 MI~ItLPDGs~ke~~~g~T~~dIA~~Is~~Lak~avaa~VNG~l~DL~~pL~~D~~leiiT~~d~egl~v~rHS~AHvla 80 (634) T PRK04483 1 MITITLPDGSRREFESPVSVMQVAQSIGAGLAKATIAGQVDGQLVDASDLIEHDASLRIITAKDAEGVEIIRHSCAHLVG 80 (634) T ss_pred CCEEECCCCCEEECCCCCCHHHHHHHHCHHHHHHHEEEEECCEEEECCCCCCCCCEEEEECCCCHHHHHHHHHHHHHHHH T ss_conf 96799089984777899999999998675666661899989999707850689978999569997899999999999999 Q ss_pred HHHHHHCCCCEEEEECCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHCCCHHHHHHH Q ss_conf 99999779988997243588557884088888898999999999999966998048532767854454416711355665 Q gi|254781004|r 83 EAVQSIWPEVQVAIGPIIEDGFYYDFDKEQPFSSDELAQIEKKMQEIIARDSPFLKQYLSHEKARELFESKKEAYKVEIL 162 (652) Q Consensus 83 ~Av~~l~p~~kl~igp~i~~GfYyD~~~~~~it~~dl~~Ie~~M~~ii~~~~pi~r~~vs~~eA~~~F~~~~~~~K~~li 162 (652) +||+++||++|++|||+|++||||||+.+.++|++||++||++|++||++|+||+|.++|+++|+++|++.|++||++++ T Consensus 81 ~Av~~l~p~~kl~iGP~ie~GFYYDf~~~~~~t~~Dl~~IE~~M~~iik~~~~~~r~~~s~~eA~~~f~~~~~~yK~eli 160 (634) T PRK04483 81 HAVKQLYPEVKMVIGPVIAEGFYYDIYSERPFTPEDMAAIEQRMQELIAQDYDVIKKVTPRAEVIEVFAQRGEEYKLRLI 160 (634) T ss_pred HHHHHHCCCCEEEECCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHCCCHHHHHHH T ss_conf 99999779975887897788445732689989988999999999998607787599980799999999876977878798 Q ss_pred HHCCCC-CCCEEECCCCCEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEEEECCHHHHHHHHHHHHHH Q ss_conf 420345-5421302575201000235521444222333333211101577564206899999837988999999999975 Q gi|254781004|r 163 ESIPAK-ENVTLYRQGEWFDLCRGPHVRSTGQVKKFFKLMKVAGAYWRGDSNRPMLSRIYGTAWNTQQELTQYLYFLEES 241 (652) Q Consensus 163 ~~~~~~-~~v~~y~~g~~~Dlc~GphvpsTg~ik~~fkL~~~sgayw~gd~~~~~l~Riyg~af~~~~~l~~~~~~~eea 241 (652) +.+.++ +.+++|+||+|+|||+||||||||.||+ |||+++||||||||++|+||||||||||||+++|++||+++||| T Consensus 161 ~~~~~~~~~is~Y~~g~f~DLCrGPHv~~T~~ika-fkL~~vsgaYWrGd~~n~~LqRIYG~af~~k~~L~~yl~~~eEa 239 (634) T PRK04483 161 EDMSDDITAMGLYYHQEYVDMCRGPHVPNTRFLKA-FKLTRISGAYWRGDAKNEQLQRIYGTAWADKKQLDAYILRMEEA 239 (634) T ss_pred HHCCCCCCEEEEEECCCEEEECCCCCCCCCCCCCH-HEEEEHHHHEECCCCCCCCEEEEEEEEECCHHHHHHHHHHHHHH T ss_conf 64488873588997388758379998897111311-12301102112688757102899999508989999999999998 Q ss_pred HHHHHHHHHHHCCEEEECCCCCCEEEEECCHHHHHHHHHHHHHHHH-HCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCE Q ss_conf 3200887541144044226787435650107899999999999887-689638555432443100011210012555420 Q gi|254781004|r 242 EKRDHRKLAREMDLFHIAEDGSGVIFWHRKGWKIFQTLISYMRRKI-KDDYEEINTPQVLDQHLWQQSGHWDSYRANMFA 320 (652) Q Consensus 242 ~~rdHr~lg~~~~lf~~~~~~~G~~~wlP~G~~i~~~ie~~ir~~~-~~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~ 320 (652) ||||||||||+|+||+|+++|||+|||||+|++|++.|++|+|+++ ++||+||.||+|++++||++||||++|+++||. T Consensus 240 kkRDHRklGkel~LF~f~~~~pG~~~wlP~G~~i~~~le~~~r~~~~~~Gy~eV~tP~i~~~~Lw~~SGH~~~Y~e~Mf~ 319 (634) T PRK04483 240 DKRDHRKIGKAQDLFHLQEEAPGLVFWHPKGWSLWQVVEQYMRKVYRDSGYGEVRCPQILDVSLWQKSGHWDNYQDAMFF 319 (634) T ss_pred HHCCHHHHHHHCCEEEECCCCCCCEEECCCHHHHHHHHHHHHHHHHHHCCCEEEECCEECCHHHHHHCCCHHHHHHCCCE T ss_conf 86588887542110673377886438816689999999999999999829959988625137787625616455303713 Q ss_pred EECCCCCCCCCCEEEECCCCHHHHHHHHHHCCCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEECCHHHH Q ss_conf 10014567778237743431257777652012222310432000100012787645200010563533442101053767 Q gi|254781004|r 321 VKCADDTIKDLRTFALKPMNCPGHVAVFNHGLKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVFCTKEQM 400 (652) Q Consensus 321 ~~~~~~~~~~~~~~~LkPmncp~h~~i~~~~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F~~~eq~ 400 (652) . +.++++|+|||||||+||+||++.+|||||||+|++|+|++||||.||+|+||||||+|+|||||+||++||+ T Consensus 320 ~------~~~~~~~~lKPMNCP~H~~if~~~~rSYRdLPlR~aE~g~~~R~E~SG~L~GL~RvR~FtqdDaHIFc~~dQi 393 (634) T PRK04483 320 T------ESEKRTYAVKPMNCPGHVQVFNQGLHSYRDLPIRYGEFGACHRNEPSGALHGILRVRGFTQDDGHVFCLESQI 393 (634) T ss_pred E------CCCCCEEEECCCCCHHHHHHHHCCCCCHHHCCHHHHHCCCEEECCCCCCCCCCCCCCCCCCCCCEEECCHHHH T ss_conf 4------2554035515768564879987177444434366554330220567765457456889988983356389999 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 88789999998876655213321000146531125860357789999999998502121256675205555554554321 Q gi|254781004|r 401 FNECLKIHNLIISIYKDFGFEKIMVKLSTRPEKRVGSDALWDDAENIMKGVLDTIKNSSQDRINTGVLLGEGAFYGPKFE 480 (652) Q Consensus 401 ~~e~~~~~~~~~~v~~~~g~~~~~~~ls~~~~~~~g~~~~~~~ae~~l~~~l~~~~~~~~~~~~~~~~~g~~afygpkid 480 (652) ++|+..+++++..+|..|||..+.+++|++|++++|+++.|+.||..|+++|+..+ ++|.+++|++|||||||| T Consensus 394 ~~Ei~~~l~~~~~vy~~fGf~~~~~~lStrpek~~G~~~~W~~AE~~L~~al~~~g------~~y~~~~GegAFYGPKID 467 (634) T PRK04483 394 EAEVTAFHQQALAVYTAFGFDDIQIKIALRPEKRLGDDATWDKAEAALRSALGVCG------VEWQELPGEGAFYGPKIE 467 (634) T ss_pred HHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHCC------CCCEECCCCCCEECCCEE T ss_conf 99999999999999986399538999635762345875568899999999999749------870776797010566228 Q ss_pred CCHHHHCCCCCCCCCCCHHHHCCCCCCCEEEEECCCEEEEEEEEECCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEEC Q ss_conf 10021103310012110111022346741673179721379964034103789999999973870887337808999857 Q gi|254781004|r 481 YILKDAIGRDWQCGTIQVDFNLPSRFNAFYVNSHSEKCHPVMIHRAVFGSIERFIGIMIENFKGNLPLWLSPIQAIVTTI 560 (652) Q Consensus 481 ~~~~d~~gr~~~~~tiq~df~~p~~f~l~y~~~dg~~~~pvmihr~~~Gs~eR~ia~liE~~~g~~P~wLAP~QV~Iipi 560 (652) ++++|++||.|||+|+|+||++|+||+++|+++||++++|||||||++||+|||||+|||||+|+||+||||+||+|||| T Consensus 468 ~~v~DalgR~~q~~TiQlDf~lP~rF~l~Y~~~dg~~~rPvmIHRAilGS~ERfi~iLiEh~~G~fP~WLaP~Qv~iipv 547 (634) T PRK04483 468 YHLKDAIGRTWQLGTMQVDFMMPGRLGAEYVDENSQKKHPVMLHRAIVGSMERFIGILIEHHAGAFPAWLAPVQVVVANI 547 (634) T ss_pred EEEEECCCCEEECCEEEEECCCCCCCCCEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHCCCCCCCCCCCCEEEEEEC T ss_conf 99992799867411457621453436778985689838608999457677899999999973798985768735899955 Q ss_pred CHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHCCCCEEEEECHHHHHCCCEEEEECCCCCEEEECHHHHHHH Q ss_conf 96689999999999998798899975896636766655434898899987425325818999889886633429999999 Q gi|254781004|r 561 TSSAVEYAQEIANLLKSHHLSIETDFRNETINYKIREHSIKKIPIIIICGDKEASERSIGIRRFGSTTTQKVSLLDAIPI 640 (652) Q Consensus 561 ~e~~~eya~~i~~~L~~~girV~~Ddr~~~~G~Kir~a~l~giP~~ivIG~ke~e~~~Vtir~R~~~~~~~i~lee~i~~ 640 (652) ++++.+||++|++.|+++|+||++|+|++++|+|||+|++.+|||++|||+||+++++|+||.|+++++..+++++|++. T Consensus 548 ~~~~~~ya~~v~~~L~~~g~rv~vD~~~e~lg~KIR~a~~~kiPy~lVvG~kE~e~~~VsvR~r~~~~~~~~~~~~fi~~ 627 (634) T PRK04483 548 TDAQAEYVDSVRKTLANQGFRVSADLRNEKIGYKIREHTLQRVPYLLVVGDREKENGAVAVRTRSGEDLGTMTVSAFIER 627 (634) T ss_pred CHHHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHCCCCEEEEECHHHHHCCEEEEEECCCCCCCEEEHHHHHHH T ss_conf 80489999999999997897899978998574999987433899899981256645978999778974571569999999 Q ss_pred HHHHCC Q ss_conf 987314 Q gi|254781004|r 641 LTKESL 646 (652) Q Consensus 641 l~~e~~ 646 (652) |++|+. T Consensus 628 l~~e~a 633 (634) T PRK04483 628 LQAEQA 633 (634) T ss_pred HHHHHC T ss_conf 998762 No 3 >PRK03772 threonyl-tRNA synthetase; Validated Probab=100.00 E-value=0 Score=1696.90 Aligned_cols=636 Identities=47% Similarity=0.903 Sum_probs=620.4 Q ss_pred CEEEECCCCCEEECCCCCCHHHHHHHHCCCCCCCEEEEEECCEEEECCEEEC-CCEEEEECCCCHHHHHHHHHHHHHHHH Q ss_conf 7089928998874189888999998636110112899998988997745303-670798628999999999998999999 Q gi|254781004|r 4 DIKLTFPDGSIKNFPVHATGSDVAESISRSLAKKAVAVAINGKVCDLSDPVR-EGSIEIITPEDPRSLAVIRHSCAHIMA 82 (652) Q Consensus 4 ~i~I~lpDG~~~~~~~g~t~~dia~~i~~~~~~~~vaa~vng~l~dL~~~l~-d~~i~~i~~~~~eg~~i~~hS~ahlL~ 82 (652) +|+||||||++++++.|+|++|||++|++++++++|||+|||+++||++||+ ||+|+|+|++|++|++++||||||||| T Consensus 1 M~~ItLPDGs~~~~~~~~t~~dIA~~Is~~Lak~avaa~vng~l~Dl~~~i~~d~~le~it~~d~eg~~v~rHS~AHvla 80 (642) T PRK03772 1 MPVITLPDGSQRHFDHPVSPMDVALDIGPGLAKACIAGRVNGELVDACDLIENDAQLSIITAKDEEGLEIIRHSCAHLLG 80 (642) T ss_pred CCEEECCCCCEEECCCCCCHHHHHHHHCHHHHHHHEEEEECCEEEECCCCCCCCCEEEEECCCCHHHHHHHHHHHHHHHH T ss_conf 97798789983567899999999999786776662899989999866861579977999358997899999888999999 Q ss_pred HHHHHHCCCCEEEEECCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHCCCHHHHHHH Q ss_conf 99999779988997243588557884088888898999999999999966998048532767854454416711355665 Q gi|254781004|r 83 EAVQSIWPEVQVAIGPIIEDGFYYDFDKEQPFSSDELAQIEKKMQEIIARDSPFLKQYLSHEKARELFESKKEAYKVEIL 162 (652) Q Consensus 83 ~Av~~l~p~~kl~igp~i~~GfYyD~~~~~~it~~dl~~Ie~~M~~ii~~~~pi~r~~vs~~eA~~~F~~~~~~~K~~li 162 (652) +||+++||++|++|||+|++||||||+.++++|++||++||++|++||++|+||+|.++++++|+++|++.+++||++++ T Consensus 81 ~Av~~l~p~~kl~iGP~ie~GFYYDf~~~~~~t~~Dl~~IE~~M~~iik~~~~~~r~~~s~~eA~~~f~~~~~~yK~eli 160 (642) T PRK03772 81 HAIKQLWPHTKMAIGPVIDNGFYYDVDLDRTLTQEDLEALEKRMHELAKTNYDVIKKKVSWQEARDTFAARGESYKVAIL 160 (642) T ss_pred HHHHHHCCCCEEEECCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEECHHHHHHHHHHCCCHHHHHHH T ss_conf 99999769967845786577157722479999989999999999999846998499996889999999852863554555 Q ss_pred H-HCCCCCCCEEECCCCCEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEEEECCHHHHHHHHHHHHHH Q ss_conf 4-203455421302575201000235521444222333333211101577564206899999837988999999999975 Q gi|254781004|r 163 E-SIPAKENVTLYRQGEWFDLCRGPHVRSTGQVKKFFKLMKVAGAYWRGDSNRPMLSRIYGTAWNTQQELTQYLYFLEES 241 (652) Q Consensus 163 ~-~~~~~~~v~~y~~g~~~Dlc~GphvpsTg~ik~~fkL~~~sgayw~gd~~~~~l~Riyg~af~~~~~l~~~~~~~eea 241 (652) + .+++++.+++|+||+|+|||+||||||||.||+ |||+++||||||||++|+|||||||||||++++|++||+++||| T Consensus 161 ~e~~~~~~~is~Y~~g~f~DLCrGPHvp~t~~ika-fkl~~vagAYWrGd~~n~~LqRIYG~af~~k~~L~~~l~~~eEa 239 (642) T PRK03772 161 DENISHDDKPGLYHHEEYVDMCRGPHVPNMRFCHH-FKLMKTAGAYWRGDSNNKMLQRIYGTAWADKKALNAYLQRLEEA 239 (642) T ss_pred HHCCCCCCCEEEEECCCEEEECCCCCCCCCCCCCE-EEEEECCHHEECCCCCCCHHEEEEEEECCCHHHHHHHHHHHHHH T ss_conf 42277888267993598789558887787555532-57731232114777657000799998618989999999999999 Q ss_pred HHHHHHHHHHHCCEEEECCCCCCEEEEECCHHHHHHHHHHHHHHHH-HCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCE Q ss_conf 3200887541144044226787435650107899999999999887-689638555432443100011210012555420 Q gi|254781004|r 242 EKRDHRKLAREMDLFHIAEDGSGVIFWHRKGWKIFQTLISYMRRKI-KDDYEEINTPQVLDQHLWQQSGHWDSYRANMFA 320 (652) Q Consensus 242 ~~rdHr~lg~~~~lf~~~~~~~G~~~wlP~G~~i~~~ie~~ir~~~-~~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~ 320 (652) ||||||||||+|+||+|+++|||+|||||+|++|++.|++|+|+++ ++||+||.||+|++.+||++||||++|++|||. T Consensus 240 kkRDHRklGkel~LF~f~~~~pG~~fwlP~G~~i~~~le~~~r~~~~~~Gy~eV~TP~i~~~~Lw~~SGH~~~Y~e~Mf~ 319 (642) T PRK03772 240 AKRDHRKIGKQLDLYHMQEEAPGMVFWHNDGWTIFRELEVFVRSKLKEYQYQEVKGPFMMDRVLWEKTGHWDNYADAMFT 319 (642) T ss_pred HHCCHHHHHHHCCCEEECCCCCCCEEECCCHHHHHHHHHHHHHHHHHHCCCEEEECCHHCCHHHHHHCCHHHHHHCCCCC T ss_conf 86588888775220475465887636658588999999999999999869859855432137788734447664103422 Q ss_pred EECCCCCCCCCCEEEECCCCHHHHHHHHHHCCCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEECCHHHH Q ss_conf 10014567778237743431257777652012222310432000100012787645200010563533442101053767 Q gi|254781004|r 321 VKCADDTIKDLRTFALKPMNCPGHVAVFNHGLKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVFCTKEQM 400 (652) Q Consensus 321 ~~~~~~~~~~~~~~~LkPmncp~h~~i~~~~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F~~~eq~ 400 (652) ++. ++++|+|||||||+||+||++.+|||||||+|++|+|++||||.||+|+||+|||.|+|||||+||++||+ T Consensus 320 ~~~------e~~ey~lKPMNCP~H~~iy~~~~rSYRdLPlR~aEfg~~~R~E~SG~L~GL~RVR~FtqdDaHIFC~~dQi 393 (642) T PRK03772 320 TSS------ENREYAIKPMNCPGHVQIFNQGLKSYRDLPLRMAEFGSCHRNEPSGSLHGLMRVRGFTQDDAHIFCTEEQI 393 (642) T ss_pred CCC------CHHHHCCCCCCCHHHHHHHHCCCCCHHHCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCEEECCHHHH T ss_conf 342------01443046778734889986166434436545554113431666655347456889987884355378999 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 88789999998876655213321000146531125860357789999999998502121256675205555554554321 Q gi|254781004|r 401 FNECLKIHNLIISIYKDFGFEKIMVKLSTRPEKRVGSDALWDDAENIMKGVLDTIKNSSQDRINTGVLLGEGAFYGPKFE 480 (652) Q Consensus 401 ~~e~~~~~~~~~~v~~~~g~~~~~~~ls~~~~~~~g~~~~~~~ae~~l~~~l~~~~~~~~~~~~~~~~~g~~afygpkid 480 (652) ++|+.++++++..+|..|||+.+.+++|++|++++|+++.|+.||..|+++|+..+ ++|.+.+|+||||||||| T Consensus 394 ~~Ei~~~l~~~~~vy~~fGf~~~~~~LStrpek~~G~~e~W~~AE~~L~~aL~~~g------~~y~~~~GegAFYGPKID 467 (642) T PRK03772 394 RDEVNSCIKMVYDMYSTFGFEKIVVKLSTRPEKRIGSDEMWDRAEADLAVALEENN------IPFEYQPGEGAFYGPKIE 467 (642) T ss_pred HHHHHHHHHHHHHHHHHCCCCEEEEEECCCCHHCCCCHHHHHHHHHHHHHHHHHCC------CCEEECCCCCCCCCCCCC T ss_conf 99999999999999997398638999626822025888899999999999999739------970455676344455302 Q ss_pred CCHHHHCCCCCCCCCCCHHHHCCCCCCCEEEEECCCEEEEEEEEECCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEEC Q ss_conf 10021103310012110111022346741673179721379964034103789999999973870887337808999857 Q gi|254781004|r 481 YILKDAIGRDWQCGTIQVDFNLPSRFNAFYVNSHSEKCHPVMIHRAVFGSIERFIGIMIENFKGNLPLWLSPIQAIVTTI 560 (652) Q Consensus 481 ~~~~d~~gr~~~~~tiq~df~~p~~f~l~y~~~dg~~~~pvmihr~~~Gs~eR~ia~liE~~~g~~P~wLAP~QV~Iipi 560 (652) ++++|++||.|||+|+|+||++|+||+++|+++||++++|||||||++||+|||||+|||||+|+||+||||+||+|||| T Consensus 468 ~~v~DalgR~wq~~TiQlDF~lP~RF~l~Y~~~dg~~~~PVmIHRAilGS~ERfiaiLiEh~~G~~P~WLaP~Qv~VipI 547 (642) T PRK03772 468 FTLYDCLDRAWQCGTVQLDFSLPGRLSASYVGEDNERKVPVMIHRAILGSMERFIGILTEEYAGFFPTWLAPVQVVVMNI 547 (642) T ss_pred EEEECCCCCEEEECCCEEECCCHHHCCCEEECCCCCCCCCEEEEECCCCCHHHHHHHHHHHCCCCCCCCCCCCEEEEEEC T ss_conf 59970789768802324300135442888887899857858999246477899999999980799985647604799954 Q ss_pred CHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHCCCCEEEEECHHHHHCCCEEEEECCCCCEEEECHHHHHHH Q ss_conf 96689999999999998798899975896636766655434898899987425325818999889886633429999999 Q gi|254781004|r 561 TSSAVEYAQEIANLLKSHHLSIETDFRNETINYKIREHSIKKIPIIIICGDKEASERSIGIRRFGSTTTQKVSLLDAIPI 640 (652) Q Consensus 561 ~e~~~eya~~i~~~L~~~girV~~Ddr~~~~G~Kir~a~l~giP~~ivIG~ke~e~~~Vtir~R~~~~~~~i~lee~i~~ 640 (652) ++++.+||++|++.|.++|+||++|+|++++|+|||+|++.+|||++|||++|+++++|+||.|++++++.+++++|++. T Consensus 548 s~~~~~YA~~v~~~L~~~giRv~vD~~~e~l~~KIR~a~~~kiPy~lVvGdkE~~~~~VsvR~r~~~~~~~~~l~~fi~~ 627 (642) T PRK03772 548 TDSQSDYVQEVTKKLQNAGIRVKADLRNEKIGFKIREHTLRRVPYMLVCGDKEVEAGKVAVRTRRGKDLGSMDVNDFIEK 627 (642) T ss_pred CCHHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHCCCCEEEEEEHHHHHCCEEEEEECCCCCCCEEEHHHHHHH T ss_conf 81589999999999997897899978998187999998761799899980246525958999778973571679999999 Q ss_pred HHHHCCCHHHCC Q ss_conf 987314356229 Q gi|254781004|r 641 LTKESLPPDCQK 652 (652) Q Consensus 641 l~~e~~~~d~~~ 652 (652) |++|.....+++ T Consensus 628 l~~ei~~r~l~~ 639 (642) T PRK03772 628 LQQEIRSRSLKQ 639 (642) T ss_pred HHHHHHHHHHHH T ss_conf 999998602544 No 4 >PRK00413 thrS threonyl-tRNA synthetase; Reviewed Probab=100.00 E-value=0 Score=1691.63 Aligned_cols=631 Identities=54% Similarity=0.997 Sum_probs=618.4 Q ss_pred EEECCCCCEEECCCCCCHHHHHHHHCCCCCCCEEEEEECCEEEECCEEE-CCCEEEEECCCCHHHHHHHHHHHHHHHHHH Q ss_conf 8992899887418988899999863611011289999898899774530-367079862899999999999899999999 Q gi|254781004|r 6 KLTFPDGSIKNFPVHATGSDVAESISRSLAKKAVAVAINGKVCDLSDPV-REGSIEIITPEDPRSLAVIRHSCAHIMAEA 84 (652) Q Consensus 6 ~I~lpDG~~~~~~~g~t~~dia~~i~~~~~~~~vaa~vng~l~dL~~~l-~d~~i~~i~~~~~eg~~i~~hS~ahlL~~A 84 (652) +||||||++++++.|+|+.|||++|++++++++|||+|||+++||++|| +||+|+|+|++|++|+.|+|||||||||+| T Consensus 1 tItlPDGs~~~~~~~~t~~~iA~~Is~~Lak~avaa~vng~l~Dl~~~l~~d~~leiit~~d~egl~v~rHS~AHvla~A 80 (639) T PRK00413 1 SITFPDGSVREFPKGVTAAEVAASISPSLAKKAVAGKVNGELVDLSTPIEEDATVEIVTADDEDGLEIIRHSAAHVLAQA 80 (639) T ss_pred CEECCCCCEEECCCCCCHHHHHHHHCHHHHHHHEEEEECCEEEECCCCCCCCCEEEEECCCCHHHHHHHHHHHHHHHHHH T ss_conf 96799998246789999999999979676656089998999974686058997899946999789999888899999999 Q ss_pred HHHHCCCCEEEEECCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHCCCHHHHHHHHH Q ss_conf 99977998899724358855788408888889899999999999996699804853276785445441671135566542 Q gi|254781004|r 85 VQSIWPEVQVAIGPIIEDGFYYDFDKEQPFSSDELAQIEKKMQEIIARDSPFLKQYLSHEKARELFESKKEAYKVEILES 164 (652) Q Consensus 85 v~~l~p~~kl~igp~i~~GfYyD~~~~~~it~~dl~~Ie~~M~~ii~~~~pi~r~~vs~~eA~~~F~~~~~~~K~~li~~ 164 (652) |+++||++|+++||+|++||||||+.++++|++||++||++|++||++|+||+|.++|+++|+++|++.+++||+++++. T Consensus 81 v~~l~p~~kl~iGP~ie~GFYYDf~~~~~it~~Dl~~IEk~M~~iik~~~~~~r~~vs~~eA~~~F~~~~~~yK~eli~~ 160 (639) T PRK00413 81 VQRLFPDAKLGIGPVIENGFYYDFDVEEPFTPEDLPAIEKEMKKIIKENLPFTREVVSREEALELFADIGEPYKLELIED 160 (639) T ss_pred HHHHCCCCEEEECCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHCCCHHHHHHHHH T ss_conf 99977997787789757703672126998998999999999999984799849999189999999986186689999863 Q ss_pred CCCCCCCEEECCCCCEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEEEECCHHHHHHHHHHHHHHHHH Q ss_conf 03455421302575201000235521444222333333211101577564206899999837988999999999975320 Q gi|254781004|r 165 IPAKENVTLYRQGEWFDLCRGPHVRSTGQVKKFFKLMKVAGAYWRGDSNRPMLSRIYGTAWNTQQELTQYLYFLEESEKR 244 (652) Q Consensus 165 ~~~~~~v~~y~~g~~~Dlc~GphvpsTg~ik~~fkL~~~sgayw~gd~~~~~l~Riyg~af~~~~~l~~~~~~~eea~~r 244 (652) +++++.+++|+||+|+|||+||||||||.||+ |||+++||||||||++|+|||||||||||++++|++||+++|||||| T Consensus 161 ~~~~~~is~Y~~g~f~DLCrGPHv~~t~~ika-fKL~~vsgaYWrGd~~n~~LqRIYG~af~~k~~L~~~l~~~eEakkR 239 (639) T PRK00413 161 IPEGEEITIYRQGEFVDLCRGPHVPSTGKIKA-FKLLKVAGAYWRGDSKNEMLQRIYGTAWADKEDLDAYLHRLEEAEKR 239 (639) T ss_pred CCCCCCEEEEEECCEEEECCCCCCCCCCCCCE-EEEEECCHHHCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHHHHHHC T ss_conf 67998579999155789668887765220445-78753141110788667303899998519989999999999998852 Q ss_pred HHHHHHHHCCEEEECCCCCCEEEEECCHHHHHHHHHHHHHHHH-HCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEEC Q ss_conf 0887541144044226787435650107899999999999887-689638555432443100011210012555420100 Q gi|254781004|r 245 DHRKLAREMDLFHIAEDGSGVIFWHRKGWKIFQTLISYMRRKI-KDDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKC 323 (652) Q Consensus 245 dHr~lg~~~~lf~~~~~~~G~~~wlP~G~~i~~~ie~~ir~~~-~~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~ 323 (652) |||||||+|+||+|+++|||+|||||+|++|++.|++|+|+++ ++||+||.||+|++++||++||||++|++|||+++. T Consensus 240 DHRklGkel~LF~f~~~~pGl~~w~P~G~~i~~~le~~~r~~~~~~Gy~eV~TP~i~~~~Lw~~SGHw~~Y~e~Mf~~~~ 319 (639) T PRK00413 240 DHRKLGKELDLFSFQEEGPGLPFWHPKGWIIRNELEDYMRRRHLKAGYQEVNTPHILDRSLWETSGHWDHYRENMFPTMS 319 (639) T ss_pred CHHHHHHHCCCEEECCCCCCCEEECCCHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCHHHHHHCCCHHHHCCCCCCCEE T ss_conf 88888774351570366887646637689999999999999999869859836413468899654742232254223240 Q ss_pred CCCCCCCCCEEEECCCCHHHHHHHHHHCCCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEECCHHHHHHH Q ss_conf 14567778237743431257777652012222310432000100012787645200010563533442101053767887 Q gi|254781004|r 324 ADDTIKDLRTFALKPMNCPGHVAVFNHGLKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVFCTKEQMFNE 403 (652) Q Consensus 324 ~~~~~~~~~~~~LkPmncp~h~~i~~~~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F~~~eq~~~e 403 (652) .++++|+|||||||+|++||++.+|||||||+|++|+|++||||.||+|+||||||+|+|||||+||++||+++| T Consensus 320 -----~~~~~~~lKPMNCP~H~~if~~~~~SYRdLPlR~aEfg~~~R~E~SG~L~GL~RvR~FtqdDaHIFc~~dQi~~E 394 (639) T PRK00413 320 -----DDGEDYALKPMNCPGHILIFKSGLRSYRELPLRLAEFGTVHRYEKSGALHGLMRVRGFTQDDAHIFCTPEQIKDE 394 (639) T ss_pred -----CCCCEEEECCCCCCCHHHHHHCCCCCHHHCCHHHHHHHHHHCCCCCCCCCCEEEEEEEECCCCEEECCHHHHHHH T ss_conf -----152103204668700335644355355547556664113205677874456423300001584132478999999 Q ss_pred HHHHHHHHHHHHHHHCCCCCEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCH Q ss_conf 89999998876655213321000146531125860357789999999998502121256675205555554554321100 Q gi|254781004|r 404 CLKIHNLIISIYKDFGFEKIMVKLSTRPEKRVGSDALWDDAENIMKGVLDTIKNSSQDRINTGVLLGEGAFYGPKFEYIL 483 (652) Q Consensus 404 ~~~~~~~~~~v~~~~g~~~~~~~ls~~~~~~~g~~~~~~~ae~~l~~~l~~~~~~~~~~~~~~~~~g~~afygpkid~~~ 483 (652) +.++++++..+|..|||..+.+++|++|++++|+++.|+.||..|+++|+..+ ++|.+++|+|||||||||+++ T Consensus 395 i~~~l~~~~~vy~~fGf~~~~~~lStrpek~~g~~e~W~~Ae~~L~~al~~~g------~~y~~~~GegAFYGPKID~~~ 468 (639) T PRK00413 395 LLSVLDLILSVYKDFGLTDYYLELSTRPEKRVGSDELWEKATEALREALEESG------LEYVENPGEGAFYGPKIDVQL 468 (639) T ss_pred HHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHCC------CCEEECCCCCCEECCEEEEEE T ss_conf 99999999999997499803899964751135888999999999999999749------980878886111101667785 Q ss_pred HHHCCCCCCCCCCCHHHHCCCCCCCEEEEECCCEEEEEEEEECCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCHH Q ss_conf 21103310012110111022346741673179721379964034103789999999973870887337808999857966 Q gi|254781004|r 484 KDAIGRDWQCGTIQVDFNLPSRFNAFYVNSHSEKCHPVMIHRAVFGSIERFIGIMIENFKGNLPLWLSPIQAIVTTITSS 563 (652) Q Consensus 484 ~d~~gr~~~~~tiq~df~~p~~f~l~y~~~dg~~~~pvmihr~~~Gs~eR~ia~liE~~~g~~P~wLAP~QV~Iipi~e~ 563 (652) +|++||.|||+|+|+||++|+||+++|+++||++++|||||||++||+|||||+|||||+|+||+||||+||+||||+++ T Consensus 469 ~DalgR~wq~~TiQlDf~lPerF~l~Y~~~dg~~~~PVmIHRAi~GS~ERfiaiLiEh~~G~~P~WLaP~Qv~IipV~~~ 548 (639) T PRK00413 469 KDAIGRTWQCGTIQLDFNLPERFDLEYIAADGEKHRPVMIHRAIFGSIERFIGILIEHYAGAFPLWLAPVQVVVIPITDE 548 (639) T ss_pred ECCCCCEEECCHHHHHHCCHHHCCCEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHCCCCCCCCCCCEEEEEECCHH T ss_conf 14468713200233222245424888998889816308999525565899999999980898873657615899961847 Q ss_pred HHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHCCCCEEEEECHHHHHCCCEEEEECCCCCEEEECHHHHHHHHHH Q ss_conf 89999999999998798899975896636766655434898899987425325818999889886633429999999987 Q gi|254781004|r 564 AVEYAQEIANLLKSHHLSIETDFRNETINYKIREHSIKKIPIIIICGDKEASERSIGIRRFGSTTTQKVSLLDAIPILTK 643 (652) Q Consensus 564 ~~eya~~i~~~L~~~girV~~Ddr~~~~G~Kir~a~l~giP~~ivIG~ke~e~~~Vtir~R~~~~~~~i~lee~i~~l~~ 643 (652) +.+||++|++.|+++|+||++|+|++++|+|||+|++.+|||++|||++|+++++|+||.|++++++.+++++|++.|++ T Consensus 549 ~~~ya~~v~~~l~~~giRv~vD~~~e~l~~KIR~a~~~kiPy~~VvGdkE~~~~~VsvR~r~~~~~~~~~~~efi~~l~~ 628 (639) T PRK00413 549 HADYAKEVAKALRDAGIRVEVDLRNEKIGYKIREAQLQKVPYMLVVGDKEVEAGTVSVRRRGGGDLGTMPVDEFVEKIKE 628 (639) T ss_pred HHHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHCCCCEEEEEEHHHHHCCEEEEEECCCCCCCEEEHHHHHHHHHH T ss_conf 89999999999997898899978998575999998653799899980036535958999778972465689999999999 Q ss_pred HCCCH Q ss_conf 31435 Q gi|254781004|r 644 ESLPP 648 (652) Q Consensus 644 e~~~~ 648 (652) |.... T Consensus 629 ei~~r 633 (639) T PRK00413 629 EIARR 633 (639) T ss_pred HHHHH T ss_conf 99875 No 5 >PRK12304 thrS threonyl-tRNA synthetase; Reviewed Probab=100.00 E-value=0 Score=1595.41 Aligned_cols=598 Identities=40% Similarity=0.761 Sum_probs=580.3 Q ss_pred CCCCEEEEEECCEEEECCEEE-CCCEEEEECCC-CHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCEEEEEECC Q ss_conf 011289999898899774530-36707986289-9999999999899999999999779988997243588557884088 Q gi|254781004|r 34 LAKKAVAVAINGKVCDLSDPV-REGSIEIITPE-DPRSLAVIRHSCAHIMAEAVQSIWPEVQVAIGPIIEDGFYYDFDKE 111 (652) Q Consensus 34 ~~~~~vaa~vng~l~dL~~~l-~d~~i~~i~~~-~~eg~~i~~hS~ahlL~~Av~~l~p~~kl~igp~i~~GfYyD~~~~ 111 (652) |++++|||+|||+++||++|+ .||++++++++ |++|+.++|||||||||+||+++||++|++|||+|++||||||+.+ T Consensus 1 Lak~~va~~vng~l~Dl~~~~~~d~~~~~i~~~~s~eg~~v~rHS~AHvla~Av~~l~p~~k~~iGP~ie~GFYYDf~~~ 80 (604) T PRK12304 1 MSSELIAYKKDEQIIDLQSAKVLGNGGEPIYFDNSPEALEVIRHSCAHLMAQAIKSLYPDAKFFVGPVVEEGFYYDFRVE 80 (604) T ss_pred CCHHCEEEEECCEEEECCCCCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCEEEEEECCC T ss_conf 93250899989999717876678964899826999789999998799999999999779976987897588115851569 Q ss_pred CCCCHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHCCCHHHHHHHHHCCCCCCCEEECCCCCEEECCCCCCCHH Q ss_conf 88889899999999999996699804853276785445441671135566542034554213025752010002355214 Q gi|254781004|r 112 QPFSSDELAQIEKKMQEIIARDSPFLKQYLSHEKARELFESKKEAYKVEILESIPAKENVTLYRQGEWFDLCRGPHVRST 191 (652) Q Consensus 112 ~~it~~dl~~Ie~~M~~ii~~~~pi~r~~vs~~eA~~~F~~~~~~~K~~li~~~~~~~~v~~y~~g~~~Dlc~GphvpsT 191 (652) .++|++||++||++|++||++|+||+|.++++++|+++|++ ++||+++++.+++ +.+++|+||+|+|||+||||||| T Consensus 81 ~~~t~~Dl~~IE~~M~~iik~~~~f~r~~~~~~eA~~~f~~--~~yK~eli~~~~~-~~is~Y~~g~f~DLCrGPHv~~T 157 (604) T PRK12304 81 SKIGEEDLPKIEKKMKELAKAKLEITKYEITKEEALEKFKN--DDLKQAVLSRIPD-DAVSIYKQGEFEDLCRGPHLPNT 157 (604) T ss_pred CCCCHHHHHHHHHHHHHHHHCCCCEEEEEECHHHHHHHHCC--CCCCHHHHHCCCC-CCEEEEEECCEEEECCCCCCCCC T ss_conf 98998999999999999996899859999779999998605--8404588843678-85699984896686588766772 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEECCC-CCCEEEEEC Q ss_conf 4422233333321110157756420689999983798899999999997532008875411440442267-874356501 Q gi|254781004|r 192 GQVKKFFKLMKVAGAYWRGDSNRPMLSRIYGTAWNTQQELTQYLYFLEESEKRDHRKLAREMDLFHIAED-GSGVIFWHR 270 (652) Q Consensus 192 g~ik~~fkL~~~sgayw~gd~~~~~l~Riyg~af~~~~~l~~~~~~~eea~~rdHr~lg~~~~lf~~~~~-~~G~~~wlP 270 (652) |.||+ |||+++||||||||++|+||||||||||||+++|++||+++||||||||||||++|+||+|+++ |||+||||| T Consensus 158 ~~ika-fKL~~vsgaYWrGd~~n~~LqRIYG~af~~k~~L~~~l~~~eEakkRDHRklGkel~LF~f~~~~g~G~~~wlP 236 (604) T PRK12304 158 KFLRF-FKLTRVAGAYLGGDEKNEMLTRIYGIAFADKESLKDYLFIIEEAKKRDHRKLGVEMKLFTFDDEIGAGLPIWLP 236 (604) T ss_pred CCCCE-EEEEECCCEEECCCCCCCCEEEEEEEECCCHHHHHHHHHHHHHHHHCCHHHHHHHCCCEECCCCCCCCCEEECC T ss_conf 22626-79853375177578867103899998419999999999999999866788887745515625545887536837 Q ss_pred CHHHHHHHHHHHHHHHH-HCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHH Q ss_conf 07899999999999887-68963855543244310001121001255542010014567778237743431257777652 Q gi|254781004|r 271 KGWKIFQTLISYMRRKI-KDDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHVAVFN 349 (652) Q Consensus 271 ~G~~i~~~ie~~ir~~~-~~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~ 349 (652) +|++|++.|++|+|+++ ++||+||.||+|++.+||++||||++|++|||..+ .++++|+|||||||+||+||+ T Consensus 237 ~G~~i~~~le~~~r~~~~~~Gy~eV~TP~i~~~~Lw~~SGH~~~Y~e~Mf~~~------~d~~~y~lKPMNCP~H~~if~ 310 (604) T PRK12304 237 KGARLRSKLEHLLYKAHRLRGYEPVRGPEILKSDVWKISGHYANYKENMYFTT------IDEQEYGIKPMNCVGHIKVYQ 310 (604) T ss_pred CHHHHHHHHHHHHHHHHHHCCCEEECCCCCCCHHHHHCCCHHHHHCCCCCEEE------CCCCEEECCCCCCHHHHHHHH T ss_conf 75499999999999999982996803865455888730551532105764450------577433036779574999986 Q ss_pred HCCCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCCC Q ss_conf 01222231043200010001278764520001056353344210105376788789999998876655213321000146 Q gi|254781004|r 350 HGLKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVFCTKEQMFNECLKIHNLIISIYKDFGFEKIMVKLST 429 (652) Q Consensus 350 ~~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F~~~eq~~~e~~~~~~~~~~v~~~~g~~~~~~~ls~ 429 (652) +.+|||||||+|++|+|++||||.||+|+||||||+|+|||||+||++||+++|+.++++++..+|..|||. +.+.+|| T Consensus 311 ~~~rSYRdLPlR~aEfg~~hR~E~SG~L~GL~RvR~FtqDDaHIFc~~dQi~~Ei~~~l~~~~~vy~~fGf~-~~~~lst 389 (604) T PRK12304 311 SDLRSYRDLPLKFFEYGVVHRHEKSGVLHGLFRVREFTQDDAHIFCMPSQIKEEVLEILDFVDKIMKAFDFS-YEMEIST 389 (604) T ss_pred CCCCCHHHCCHHHHHCCCEECCCCCCCCCCCCCCCCCCCCCCEEECCHHHHHHHHHHHHHHHHHHHHHCCCE-EEEEECC T ss_conf 277444435555454353202677766346455778977884465489999999999999999999745961-9999768 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCHHHHCCCCCCCE Q ss_conf 53112586035778999999999850212125667520555555455432110021103310012110111022346741 Q gi|254781004|r 430 RPEKRVGSDALWDDAENIMKGVLDTIKNSSQDRINTGVLLGEGAFYGPKFEYILKDAIGRDWQCGTIQVDFNLPSRFNAF 509 (652) Q Consensus 430 ~~~~~~g~~~~~~~ae~~l~~~l~~~~~~~~~~~~~~~~~g~~afygpkid~~~~d~~gr~~~~~tiq~df~~p~~f~l~ 509 (652) +|++++|+++.|+.||..|+++|+..+ ++|.+++|+|||||||||++++|++||.|||+|+|+||++|+||+++ T Consensus 390 rpek~~G~de~W~~Ae~~L~~aL~~~g------~~y~~~~GegAFYGPKID~~v~DalgR~wq~~TIQlDf~lPeRF~l~ 463 (604) T PRK12304 390 KPAKAIGDDEVWEKATKALKEALDENG------LKYGIDEGGGAFYGPKIDIKITDALKRKWQCGTIQVDFNLPERFKLE 463 (604) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHCC------CCCCCCCCCCCCEECCEEEEEECCCCCHHHHHHHHHHHHCHHHCCCE T ss_conf 863347988999999999999999649------98665748552032110179862248898877888876364315738 Q ss_pred EEEECCCEEEEEEEEECCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCHHHHHHHHHHHHHHHHCCCEEEEECCCC Q ss_conf 67317972137996403410378999999997387088733780899985796689999999999998798899975896 Q gi|254781004|r 510 YVNSHSEKCHPVMIHRAVFGSIERFIGIMIENFKGNLPLWLSPIQAIVTTITSSAVEYAQEIANLLKSHHLSIETDFRNE 589 (652) Q Consensus 510 y~~~dg~~~~pvmihr~~~Gs~eR~ia~liE~~~g~~P~wLAP~QV~Iipi~e~~~eya~~i~~~L~~~girV~~Ddr~~ 589 (652) |+++||++++|||||||++||+|||||+|||||+|+||+||||+||+||||++++.+||++|++.|.++|+||++|+|++ T Consensus 464 Y~~~dg~~~rPVmIHRAilGS~ERfiaiLiEh~~G~~P~WLaP~Qv~IipV~~~~~~YA~~v~~~L~~~girv~~D~~~e 543 (604) T PRK12304 464 YTDENNEKKQPVMIHRAILGSFERFIGILTEHFAGEFPFFIAPTQVAIVPINEAHHVYAKELQKELLKLDIDSEVYDKNE 543 (604) T ss_pred EECCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCEEEEECCCC T ss_conf 98889983677899720357789999999998089998575761689996077789999999999998898799738999 Q ss_pred CCCHHHHHHHHCCCCEEEEECHHHHHCCCEEEEECCCCCEEEECHHHHHHHHHHHCCCH Q ss_conf 63676665543489889998742532581899988988663342999999998731435 Q gi|254781004|r 590 TINYKIREHSIKKIPIIIICGDKEASERSIGIRRFGSTTTQKVSLLDAIPILTKESLPP 648 (652) Q Consensus 590 ~~G~Kir~a~l~giP~~ivIG~ke~e~~~Vtir~R~~~~~~~i~lee~i~~l~~e~~~~ 648 (652) ++|+|||+|++.+|||++|||+||+++++|+||.|+++++..+++++|++.|++|+..+ T Consensus 544 ~l~~KIR~a~~~kiPy~lVvGdkE~~~~~vsvR~r~~~~~~~~~~~efi~~l~~eI~~~ 602 (604) T PRK12304 544 SLNKKIRTAEKQRVPMILVLGDEEVENRSVAIRDRRAREQYNLSLDEFLNLVKEKMSEV 602 (604) T ss_pred CHHHHHHHHHHCCCCEEEEEEHHHHHCCEEEEEECCCCCCCEEEHHHHHHHHHHHHHHC T ss_conf 77699999764389989998446765596889977887457062999999999997442 No 6 >PRK12305 thrS threonyl-tRNA synthetase; Reviewed Probab=100.00 E-value=0 Score=1484.98 Aligned_cols=557 Identities=39% Similarity=0.721 Sum_probs=534.5 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHH Q ss_conf 99999989999999999977998899724358855788408888889899999999999996699804853276785445 Q gi|254781004|r 70 LAVIRHSCAHIMAEAVQSIWPEVQVAIGPIIEDGFYYDFDKEQPFSSDELAQIEKKMQEIIARDSPFLKQYLSHEKAREL 149 (652) Q Consensus 70 ~~i~~hS~ahlL~~Av~~l~p~~kl~igp~i~~GfYyD~~~~~~it~~dl~~Ie~~M~~ii~~~~pi~r~~vs~~eA~~~ 149 (652) ..-+|||||||||+||++|||++|+++||+|++||||||+.+.+||++||++||++|++||++++||+|.++++ +. T Consensus 4 ~~~lRHS~AHila~Av~~l~p~~kl~iGP~ie~GFYYDf~~~~~~t~~Dl~~IE~~m~~iik~~~~~~r~~~~~----~~ 79 (576) T PRK12305 4 DKKLNHSTSHLLAAAILKLYPNVKLGIGPAIDEGFYYDFDFEDPLSESDLLKIEKLMKKLASQNLKMVKVDGAN----YD 79 (576) T ss_pred CCCCCHHHHHHHHHHHHHHCCCCEEEECCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEHHHH----HH T ss_conf 52003359999999999977997699789868866686246999998999999999999985899879833507----86 Q ss_pred HHHCCCHHHHHHHHHCCC-CCCCEEECCCC------CEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEE Q ss_conf 441671135566542034-55421302575------20100023552144422233333321110157756420689999 Q gi|254781004|r 150 FESKKEAYKVEILESIPA-KENVTLYRQGE------WFDLCRGPHVRSTGQVKKFFKLMKVAGAYWRGDSNRPMLSRIYG 222 (652) Q Consensus 150 F~~~~~~~K~~li~~~~~-~~~v~~y~~g~------~~Dlc~GphvpsTg~ik~~fkL~~~sgayw~gd~~~~~l~Riyg 222 (652) |+ +++||+++++.++. ++.+++|.+|+ |+|||+||||||||.||+ |||+++||||||||++|+||||||| T Consensus 80 f~--~~~yK~eli~~~~~~~~~i~~y~~~~~~~~~~f~DLCrGpHv~~t~~ika-fkL~~vsgaYWrGd~~n~~LqRIYG 156 (576) T PRK12305 80 FK--NQPYKKELYEELKKKGEEITFYSIVDPKNKELFFDLCAGNHVESTKEIKN-FKLLSIAGAYWRGDSKNKQLTRIYG 156 (576) T ss_pred HC--CCCHHHHHHHHCCCCCCEEEEEECCCCCCCCCEEECCCCCCCCCCCCCCE-EEEEECCCCEECCCCCCCEEEEEEE T ss_conf 75--89079999852734696488996367666772658789988897232424-6886436630357875810089999 Q ss_pred EEECCHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEECCC-CCCEEEEECCHHHHHHHHHHHHHHHH-HCCCEEEECCCCC Q ss_conf 983798899999999997532008875411440442267-87435650107899999999999887-6896385554324 Q gi|254781004|r 223 TAWNTQQELTQYLYFLEESEKRDHRKLAREMDLFHIAED-GSGVIFWHRKGWKIFQTLISYMRRKI-KDDYEEINTPQVL 300 (652) Q Consensus 223 ~af~~~~~l~~~~~~~eea~~rdHr~lg~~~~lf~~~~~-~~G~~~wlP~G~~i~~~ie~~ir~~~-~~G~~eV~tP~l~ 300 (652) |||||+++|++||+++||||||||||||++|+||+|+++ |||+|||||+|++|++.|++|+|+++ ++||+||.||+|+ T Consensus 157 ~af~~k~~L~~yl~~~eEakkRDHRklGkel~LF~f~~~~gpG~~~wlP~G~~i~~~le~~~r~~~~~~Gy~eV~TP~i~ 236 (576) T PRK12305 157 TSWESKEELDKYLAILQERKERDHRKIGKDLKLFTFSSLAGQGFPIWLENGMIIKNAIRKFILELDRKYGFDEVLTPHFG 236 (576) T ss_pred EEECCHHHHHHHHHHHHHHHHCCHHHHHHHCCEEEECCCCCCCCEEECCCHHHHHHHHHHHHHHHHHHCCCEEEECCCCC T ss_conf 97179889999999999998657787642156587234468876467476789999999999999998699898464214 Q ss_pred CCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHHHCCCCHHHCCHHHHHHEEEECCCCCCCCEEE Q ss_conf 43100011210012555420100145677782377434312577776520122223104320001000127876452000 Q gi|254781004|r 301 DQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHVAVFNHGLKSYRELPVRLAEFGSVYRNEPSGSLHGL 380 (652) Q Consensus 301 ~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL 380 (652) +++||++||||++|++|||+++. .++++|+|||||||+||+||++.+|||||||+|++|+|++||||.||+++|| T Consensus 237 ~~~Lw~~SGH~~~Y~e~Mf~~~~-----~~~e~~~lKPMNCP~H~~if~~~~rSYRdLPlR~aEfg~~~R~E~SG~L~GL 311 (576) T PRK12305 237 EKKLYKTSGHLDHYKDDMFKPLK-----IENEELIPRPMTCPHHIILYSNELRSYRDLPIRLSEQSRLYRYEKSGALTGL 311 (576) T ss_pred CHHHHHHCCCHHHHHHHCCCCEE-----CCCCEEEECCCCCHHHHHHHHCCCCCHHHCCHHHHCCCCEECCCCCCCCCCC T ss_conf 28899766646556642573063-----1564165357786326777642775444324220004435447888655575 Q ss_pred EECCCCCCCCCEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCC----CCCCCCCHHHHHHHHHHHHHHHHHHC Q ss_conf 10563533442101053767887899999988766552133210001465----31125860357789999999998502 Q gi|254781004|r 381 MRVRGFTQDDAHVFCTKEQMFNECLKIHNLIISIYKDFGFEKIMVKLSTR----PEKRVGSDALWDDAENIMKGVLDTIK 456 (652) Q Consensus 381 ~R~ReF~~~Dah~F~~~eq~~~e~~~~~~~~~~v~~~~g~~~~~~~ls~~----~~~~~g~~~~~~~ae~~l~~~l~~~~ 456 (652) ||||+|+|||||+||++||+.+|+.++++++..+|..|||+.+.+.+|++ +++++|+++.|+.||.+|+++|+..+ T Consensus 312 ~RvR~FtqDDAHIFct~dQi~~Ei~~~l~~i~~vy~~fGf~~~~~~lS~r~~~~~ek~~g~de~W~~Ae~~L~~al~~~g 391 (576) T PRK12305 312 ERVRAMDLTEGHIFVRKDQIESEVKHLFKMIQEALSIFKIKIFYISLSLRDPNDKEKFFDDDQMWEQAEADLKKMLKDNK 391 (576) T ss_pred CCCCCCCCCCCCEEECHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHCC T ss_conf 54668876871212089999999999999999999853987315899943898875345972769999999999999749 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCHHHHCCCCCCCEEEEECCCEEEEEEEEECCCHHHHHHHH Q ss_conf 12125667520555555455432110021103310012110111022346741673179721379964034103789999 Q gi|254781004|r 457 NSSQDRINTGVLLGEGAFYGPKFEYILKDAIGRDWQCGTIQVDFNLPSRFNAFYVNSHSEKCHPVMIHRAVFGSIERFIG 536 (652) Q Consensus 457 ~~~~~~~~~~~~~g~~afygpkid~~~~d~~gr~~~~~tiq~df~~p~~f~l~y~~~dg~~~~pvmihr~~~Gs~eR~ia 536 (652) ++|.+++|++||||||||++++|++||.|||+|+|+||++|+||+++|+++||++++|||||||++||+||||| T Consensus 392 ------~~y~~~~GegAFYGPKID~~v~DalgR~wq~~TIQlDf~lPeRF~l~Y~~~dg~~~rPVmIHRAilGS~ERfig 465 (576) T PRK12305 392 ------INYKEMIGEAAFYGPKIDFQVKTVLNKEITVSTIQLDFLLPEKFNISYINENNEKETPVLIHRGLIGTYERFIA 465 (576) T ss_pred ------CCCEECCCCCCEECCCCCEEEECCCCCEEEEEEEEEEECCHHHCCCEEECCCCCCCCCEEEEECCCCCHHHHHH T ss_conf ------97142306501056763469850678678742456663458557988877888767875987335474899999 Q ss_pred HHHHHHCCCCCCCCCCEEEEEEECCHH-HHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHCCCCEEEEECHHHHH Q ss_conf 999973870887337808999857966-8999999999999879889997589663676665543489889998742532 Q gi|254781004|r 537 IMIENFKGNLPLWLSPIQAIVTTITSS-AVEYAQEIANLLKSHHLSIETDFRNETINYKIREHSIKKIPIIIICGDKEAS 615 (652) Q Consensus 537 ~liE~~~g~~P~wLAP~QV~Iipi~e~-~~eya~~i~~~L~~~girV~~Ddr~~~~G~Kir~a~l~giP~~ivIG~ke~e 615 (652) +|||||+|+||+||||+||+||||+++ +.+||++|++.|.++|+||++|+|++++|+|||+|++.+|||++|||+||++ T Consensus 466 iLiEh~~G~~P~WLaP~Qv~IipV~e~~~~~ya~~v~~~L~~~giRv~vD~~~e~lg~KIR~a~~~kiPy~lVvGdkE~~ 545 (576) T PRK12305 466 ILLEQTKGNLPFWLSPKQVIIIPVNENLNTDYAKELNKKLFKLGFRSEIDLRNERLSKKIREAQISKIKFQIIIGDEEIK 545 (576) T ss_pred HHHHHCCCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHCCCCEEEEEEHHHHH T ss_conf 99997079998362730379995386035999999999999789879997899967699999753499989998003554 Q ss_pred CCCEEEEECCCCCEEEECHHHHHHHHHHH Q ss_conf 58189998898866334299999999873 Q gi|254781004|r 616 ERSIGIRRFGSTTTQKVSLLDAIPILTKE 644 (652) Q Consensus 616 ~~~Vtir~R~~~~~~~i~lee~i~~l~~e 644 (652) +++|+||.|+++++..|++++|++.|++- T Consensus 546 ~~~VsvR~r~~~~~~~~~l~efi~~l~~l 574 (576) T PRK12305 546 NNTITYRKYGSEETTTVSIEDFIKMLKNL 574 (576) T ss_pred CCEEEEEECCCCCCCEEEHHHHHHHHHHH T ss_conf 59689997788855546899999999973 No 7 >TIGR00418 thrS threonyl-tRNA synthetase; InterPro: IPR002320 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . Threonyl-tRNA synthetase (6.1.1.3 from EC) exists as a monomer and belongs to class IIa. The enzyme from Escherichia coli represses the translation of its own mRNA. The crystal structure of the complex between tRNA(Thr) and ThrRS show structural features that reveal novel strategies for providing specificity in tRNA selection. These include an amino-terminal domain containing a novel protein fold that makes minor groove contacts with the tRNA acceptor stem. The enzyme induces a large deformation of the anticodon loop, resulting in an interaction between two adjacent anticodon bases, which accounts for their prominent role in tRNA identity and translational regulation. A zinc ion found in the active site is implicated in amino acid recognition/discrimination . The zinc ion may act to ensure that only amino acids that possess a hydroxyl group attached to the beta-position are activated .; GO: 0004829 threonine-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006435 threonyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=100.00 E-value=0 Score=1445.19 Aligned_cols=561 Identities=48% Similarity=0.900 Sum_probs=545.1 Q ss_pred HHHHHHHHHHHHHHHCCCCEEEEECCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHC Q ss_conf 99899999999999779988997243588557884088888898999999999999966998048532767854454416 Q gi|254781004|r 74 RHSCAHIMAEAVQSIWPEVQVAIGPIIEDGFYYDFDKEQPFSSDELAQIEKKMQEIIARDSPFLKQYLSHEKARELFESK 153 (652) Q Consensus 74 ~hS~ahlL~~Av~~l~p~~kl~igp~i~~GfYyD~~~~~~it~~dl~~Ie~~M~~ii~~~~pi~r~~vs~~eA~~~F~~~ 153 (652) |||+||||++||+++||++++++||++++||||||+.+.++|++||++||++|++++++++||.|.++|+++|.++|+.. T Consensus 1 rh~~Ah~l~~a~k~~~~~~~~~~gp~~e~GFyyD~~~~~~~~~~dl~~ie~~m~~~~~~~~~~~~~~~s~~~A~~~f~~~ 80 (595) T TIGR00418 1 RHSIAHLLAEAVKQLYPDVKLAVGPVIEDGFYYDFEVDKSITEEDLEKIEKEMKEIIKKNYPIARLSVSLEEALEAFKEV 80 (595) T ss_pred CCCHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCCCCHHCCHHHHHHHHHHHHHHHHHHCCEEEEEECHHHHHHHHHHH T ss_conf 95012463899970478718998255478754552000002778999999999999864057025772478999999862 Q ss_pred CCHHHHHHHHH-CCCCCCCEEECCCC-CEEECCCCCCCHHHHHH---HHHHHHHHHHHCCCCCCCCCEEEEEEE----EE Q ss_conf 71135566542-03455421302575-20100023552144422---233333321110157756420689999----98 Q gi|254781004|r 154 KEAYKVEILES-IPAKENVTLYRQGE-WFDLCRGPHVRSTGQVK---KFFKLMKVAGAYWRGDSNRPMLSRIYG----TA 224 (652) Q Consensus 154 ~~~~K~~li~~-~~~~~~v~~y~~g~-~~Dlc~GphvpsTg~ik---~~fkL~~~sgayw~gd~~~~~l~Riyg----~a 224 (652) +++||.+|++. .+.+..+++|.+++ |+|+|+|||||+||.++ ++|||+++|||||+||++|+||||||| || T Consensus 81 ~~~~k~~l~~~~~~~~~~~~~y~~~~~f~dlC~GpHlp~~~~~~~~~~~fkL~~~~gaYw~Gd~~n~~LqRiyGn~~~~~ 160 (595) T TIGR00418 81 GEPLKLELLDEIIPNGEAVTVYGQGEAFVDLCRGPHLPNTSRIKPAQKAFKLEKVAGAYWRGDSKNEMLQRIYGNIDITA 160 (595) T ss_pred CCHHHHHHHHHCCCCCCCEEEECCCCEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCEEEEECCCEEEEC T ss_conf 13122677851045666420422521345615797683235667640345665432331026889853788857254501 Q ss_pred ECCHHHHHHHHHHHHHHHHH--HHHHHHHHCCEEEECC-CCCCEEEEECCHHHHHHHHHHHHHHHH-HCCCEEEECCCCC Q ss_conf 37988999999999975320--0887541144044226-787435650107899999999999887-6896385554324 Q gi|254781004|r 225 WNTQQELTQYLYFLEESEKR--DHRKLAREMDLFHIAE-DGSGVIFWHRKGWKIFQTLISYMRRKI-KDDYEEINTPQVL 300 (652) Q Consensus 225 f~~~~~l~~~~~~~eea~~r--dHr~lg~~~~lf~~~~-~~~G~~~wlP~G~~i~~~ie~~ir~~~-~~G~~eV~tP~l~ 300 (652) |+++++|++|+.++|||++| |||||||+|+||+|++ .|||+|+|||+|++|++.|++|++.+. ++||++|.||+|+ T Consensus 161 ~~~k~~L~~yl~~~eEa~~r~PDHRklGKeL~Lf~f~~~~gpG~~~WlPkG~~ir~~ledf~~~~~~~~Gy~~V~TP~m~ 240 (595) T TIGR00418 161 FADKKQLAEYLKRLEEAKKREPDHRKLGKELELFSFEPEIGPGLPFWLPKGALIRNLLEDFVREKQIKRGYEEVETPIMY 240 (595) T ss_pred CCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHCCCCCCCCCCCCCEEECCCHHHHHHHHHHHHHHHHHHCCCCCCCCCEEC T ss_conf 45878999999999997426857256754531234583204874146576467899999999999987588641574121 Q ss_pred CCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHHHCCCCHHHCCHHHHHHE-EEECCCCCCCCEE Q ss_conf 431000112100125554201001456777823774343125777765201222231043200010-0012787645200 Q gi|254781004|r 301 DQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHVAVFNHGLKSYRELPVRLAEFG-SVYRNEPSGSLHG 379 (652) Q Consensus 301 ~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~~~~SYrdLPlrl~e~~-~~~R~E~sg~l~G 379 (652) +.+||++|||||||+|+||+.+.. ++++|+||||||||||+||++..+||||||+|+||+| ++||+|.||+|+| T Consensus 241 ~~~l~~~sGH~~~Y~e~Mf~~~~~-----~~~~~~LkPMnCpgH~~i~k~~~~SYR~LP~R~aE~g~~~hR~E~sG~L~G 315 (595) T TIGR00418 241 DSELWEISGHWDHYKERMFPFTEE-----DNQEFMLKPMNCPGHILIFKSSLRSYRDLPLRIAELGTTVHRYEKSGALHG 315 (595) T ss_pred CCCCHHHHHHHHHCCCCCCCCEEE-----CCCCCCCCCCCCHHHHHHHCCCCCCHHHCCCHHHHCCCEEEEEECCCCCCH T ss_conf 565424530545034235401010-----353033456775445330017777732335034331860478842542101 Q ss_pred EEECCCCCCCCCEEECC-HHHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCC-CC-C--CCCCHHHHHHHHHHHHHHHHH Q ss_conf 01056353344210105-3767887899999988766552133210001465-31-1--258603577899999999985 Q gi|254781004|r 380 LMRVRGFTQDDAHVFCT-KEQMFNECLKIHNLIISIYKDFGFEKIMVKLSTR-PE-K--RVGSDALWDDAENIMKGVLDT 454 (652) Q Consensus 380 L~R~ReF~~~Dah~F~~-~eq~~~e~~~~~~~~~~v~~~~g~~~~~~~ls~~-~~-~--~~g~~~~~~~ae~~l~~~l~~ 454 (652) |+|||+|||+||||||+ .||+.+|+.++++++..||.+|||+.+.+.||+| |+ . ++|+++.|+.|+..|++++.. T Consensus 316 L~RvR~FT~dDaHifc~d~dQi~~E~~~~~~l~~~v~~~fgf~~~~~~LS~R~Pe~~~Ky~g~d~~W~~~~~~l~~~l~~ 395 (595) T TIGR00418 316 LKRVRGFTQDDAHIFCTDEDQIKSEIKNQFRLIQKVYSKFGFSNYKYELSTRDPENDKKYIGEDELWEKAESALEEALKE 395 (595) T ss_pred HHHHHHCCCCCCCEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHC T ss_conf 01644306998524537867799999999999999988728842168998717775455458878999999999999850 Q ss_pred HCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCHHHHCCCCCCCEEEEEC--CCEEEEEEEEECCCHHHH Q ss_conf 021212566752055555545543211002110331001211011102234674167317--972137996403410378 Q gi|254781004|r 455 IKNSSQDRINTGVLLGEGAFYGPKFEYILKDAIGRDWQCGTIQVDFNLPSRFNAFYVNSH--SEKCHPVMIHRAVFGSIE 532 (652) Q Consensus 455 ~~~~~~~~~~~~~~~g~~afygpkid~~~~d~~gr~~~~~tiq~df~~p~~f~l~y~~~d--g~~~~pvmihr~~~Gs~e 532 (652) .+ ++|++.+|++||||||||+++.|+++|.|||+|||+||++|+||+++|+++| |+.++|||||||++||+| T Consensus 396 ~~------~~yei~~G~~AFYGPKiDf~~~DaL~R~~q~~TvQlDf~lPeRF~l~Y~~~dqd~~~~~Pv~IHra~~GS~E 469 (595) T TIGR00418 396 LG------VDYEIDEGEGAFYGPKIDFQIKDALGREWQCATVQLDFELPERFDLTYINEDQDNKEKRPVMIHRAILGSIE 469 (595) T ss_pred CC------CCEEEECCCCEEECCEEEEEEEECCCCHHHCCCEEEECCCCCCCCCEEEEECCCCCCCCCEEEECCCCHHHH T ss_conf 79------840550788113022268887510176425881488715887203148420367870067789804400589 Q ss_pred HHHHHHHHHHCCCC-------CCCCCCEEEEEEECC-HHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHCCCC Q ss_conf 99999999738708-------873378089998579-6689999999999998798899975896636766655434898 Q gi|254781004|r 533 RFIGIMIENFKGNL-------PLWLSPIQAIVTTIT-SSAVEYAQEIANLLKSHHLSIETDFRNETINYKIREHSIKKIP 604 (652) Q Consensus 533 R~ia~liE~~~g~~-------P~wLAP~QV~Iipi~-e~~~eya~~i~~~L~~~girV~~Ddr~~~~G~Kir~a~l~giP 604 (652) ||||+|+|||+|.| |+||||+||+||||+ ++|.+||.+|+++|+++||||.+|+|++|+|+|||+|++.+|| T Consensus 470 Rfi~iL~E~~~G~fde~~PMLP~WLaP~QV~viPV~i~~h~~yA~kv~~~L~~~giRv~~D~r~e~lgkKIR~a~~~kiP 549 (595) T TIGR00418 470 RFIAILLEKYAGAFDERLPMLPLWLAPVQVVVIPVSIEKHLDYAKKVAEKLKKEGIRVDVDDRNERLGKKIREAQKQKIP 549 (595) T ss_pred HHHHHHHHHHHHHHHCCCCCCCEECCCEEEEEEECCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCHHHHHHHHHCCCCC T ss_conf 99999998752221146898854217623899705778899999999999985797798838888013320220132206 Q ss_pred EEEEECHHHHHC--CCEEEEECC---CCCEEEECHHHHHHHHHHHC Q ss_conf 899987425325--818999889---88663342999999998731 Q gi|254781004|r 605 IIIICGDKEASE--RSIGIRRFG---STTTQKVSLLDAIPILTKES 645 (652) Q Consensus 605 ~~ivIG~ke~e~--~~Vtir~R~---~~~~~~i~lee~i~~l~~e~ 645 (652) |++|||++|+++ ++||||.|+ +++...|+++||+++|+.++ T Consensus 550 y~~V~Gd~E~e~~K~~v~vR~~~t~k~~~~~~m~~~E~~~~~~~~~ 595 (595) T TIGR00418 550 YMLVVGDKEMEELKEAVNVRTRKTLKGQKLEKMSLDEFLEKVRKEV 595 (595) T ss_pred EEEEECCHHHHCCCCEEEECCHHHHCCCCEEEECHHHHHHHHHHCC T ss_conf 7999873023138966852240210463101430999999997339 No 8 >COG0441 ThrS Threonyl-tRNA synthetase [Translation, ribosomal structure and biogenesis] Probab=100.00 E-value=0 Score=1289.10 Aligned_cols=583 Identities=51% Similarity=0.935 Sum_probs=560.6 Q ss_pred CEEEEEECCEEEECCEEECCCEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCEEEEEECCCCCCH Q ss_conf 28999989889977453036707986289999999999989999999999977998899724358855788408888889 Q gi|254781004|r 37 KAVAVAINGKVCDLSDPVREGSIEIITPEDPRSLAVIRHSCAHIMAEAVQSIWPEVQVAIGPIIEDGFYYDFDKEQPFSS 116 (652) Q Consensus 37 ~~vaa~vng~l~dL~~~l~d~~i~~i~~~~~eg~~i~~hS~ahlL~~Av~~l~p~~kl~igp~i~~GfYyD~~~~~~it~ 116 (652) .++++.+|| .+|+.. +.++..+.+|+.++|||+||||++||+++||+ +++||+|++||||||+.+.+||+ T Consensus 1 ~~~~~~~~~-~~~~~~-------~~~~~~~~~~~~~~rhs~ah~l~~av~~l~p~--~~~gp~ie~gfyyd~~~~~~~~~ 70 (589) T COG0441 1 KALAIHVDG-ELDLKD-------EIITAEDEEGLEIIRHSCAHVLAQAVKRLYPD--VTIGPVIEEGFYYDFDVKEPITP 70 (589) T ss_pred CEEEEECCC-CEECCC-------CCCCCCCCCCHHHEEHHHHHHHHHHHHHHCCC--CCCCCCCCCEEEEEECCCCCCCH T ss_conf 905897266-142220-------42576674213430066889999999975888--65467613405884025888898 Q ss_pred HHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHCCCHHHHHHHHHCCCCCCCEEECCCCCEEECCCCCCCHHHHHHH Q ss_conf 89999999999999669980485327678544544167113556654203455421302575201000235521444222 Q gi|254781004|r 117 DELAQIEKKMQEIIARDSPFLKQYLSHEKARELFESKKEAYKVEILESIPAKENVTLYRQGEWFDLCRGPHVRSTGQVKK 196 (652) Q Consensus 117 ~dl~~Ie~~M~~ii~~~~pi~r~~vs~~eA~~~F~~~~~~~K~~li~~~~~~~~v~~y~~g~~~Dlc~GphvpsTg~ik~ 196 (652) +||.+||++|++|++++++|+|..+|+++|++.|. +||+++|+..+ ..+++|++|+|+|||+||||||||++ + T Consensus 71 ~dl~~ie~~m~~i~~~~~~~~~~~~~~e~a~~~f~----~yK~~~i~~~~--~~~s~y~~~~~~dlc~gph~~~t~~i-~ 143 (589) T COG0441 71 EDLLKIEKEMKEIAKENLPIEREVVSREEARAPFG----PYKAELIDCKG--HPLSEYSQGEFVDLCRGPHVPSTGKI-A 143 (589) T ss_pred HHHHHHHHHHHHHHHHCCCEEEEEECHHHHHHHHH----HHHHHHHHCCC--CCCCEEECCCCCCCCCCCCCCCCCEE-E T ss_conf 89999999999999835772899804999987766----65799986279--87506735760003678888765337-8 Q ss_pred HHHHHHHHHHCCCCCCCCCEEEEEEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEECC-CCCCEEEEECCHHHH Q ss_conf 3333332111015775642068999998379889999999999753200887541144044226-787435650107899 Q gi|254781004|r 197 FFKLMKVAGAYWRGDSNRPMLSRIYGTAWNTQQELTQYLYFLEESEKRDHRKLAREMDLFHIAE-DGSGVIFWHRKGWKI 275 (652) Q Consensus 197 ~fkL~~~sgayw~gd~~~~~l~Riyg~af~~~~~l~~~~~~~eea~~rdHr~lg~~~~lf~~~~-~~~G~~~wlP~G~~i 275 (652) |||+++|||||+||++|+||||||||||+++++|++|++++|||++|||||||++|+||+|++ .+||+++|+|+|++| T Consensus 144 -fkl~~~~~ayw~gd~~~~~l~riygta~~~~~~l~~~l~~~eea~krdHrklg~el~LF~~~~~~~~G~~~~~pkG~~i 222 (589) T COG0441 144 -FKLLKLAGAYWRGDENNEMLQRIYGTAFADKKELEAYLKRLEEAKKRDHRKLGKELDLFSFSPEEGPGLPFWHPKGATI 222 (589) T ss_pred -EEEEEECCHHHCCCCCCCCEEEEECCCCCCHHHHHHHHHHHHHCCCCCHHHHHHHHCCEEECCCCCCCCEEECCCCCCH T ss_conf -8988812453336877721067733355788999999766665027864768776214341444588506987884669 Q ss_pred HHHHHHHHHHHH-HCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHHHCCCC Q ss_conf 999999999887-6896385554324431000112100125554201001456777823774343125777765201222 Q gi|254781004|r 276 FQTLISYMRRKI-KDDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHVAVFNHGLKS 354 (652) Q Consensus 276 ~~~ie~~ir~~~-~~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~~~~S 354 (652) ++.|++|+|... .+||++|.||+|.+.+||++||||++|+++||.++.. +++|+|||||||+|++||++..+| T Consensus 223 r~~le~y~~~~~~~~Gy~~V~TP~~~~~~l~~~SGH~~~y~e~mf~~~~~------~~~~~lKpmNCpgh~~ifk~~~~S 296 (589) T COG0441 223 RNLLEDYVRTKLRSYGYQEVKTPVLADLELWELSGHWDNYKEDMFLTESD------DREYALKPMNCPGHILIFKSGLRS 296 (589) T ss_pred HHHHHHHHHHHHHHCCCEEECCCEEEECCCCHHCCCHHHCCCCCEEECCC------CHHHEEEECCCHHHHHHHHCCCCC T ss_conf 99999999989875585271287363000010112045513660662137------723306410787689998537861 Q ss_pred HHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCCCCCC Q ss_conf 23104320001000127876452000105635334421010537678878999999887665521332100014653112 Q gi|254781004|r 355 YRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVFCTKEQMFNECLKIHNLIISIYKDFGFEKIMVKLSTRPEKR 434 (652) Q Consensus 355 YrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F~~~eq~~~e~~~~~~~~~~v~~~~g~~~~~~~ls~~~~~~ 434 (652) ||+||+|++|+|.+||+|+||+++||||+|+|||+|||+||+++|+.+|+.++++++..+|.+||+..+.+.+|++| ++ T Consensus 297 YR~LP~r~~E~g~v~R~E~SGal~GL~RvR~ftqdD~Hifc~~dQi~~E~~~~~~~i~~v~~~fg~~~y~~~ls~r~-k~ 375 (589) T COG0441 297 YRELPLRLAEFGYVYRYEKSGALHGLMRVRGFTQDDAHIFCTPDQIKDEFKGILELILEVYKDFGFTDYEVKLSTRP-KF 375 (589) T ss_pred EECCCHHHHHCCEEECCCCCCHHHCCCCCCCEEECCCCEECCHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCC-CC T ss_conf 10044043314422115676422224001132245620140467889999999999999998469753799996477-64 Q ss_pred CCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCHHHHCCCCCCCEEEEEC Q ss_conf 58603577899999999985021212566752055555545543211002110331001211011102234674167317 Q gi|254781004|r 435 VGSDALWDDAENIMKGVLDTIKNSSQDRINTGVLLGEGAFYGPKFEYILKDAIGRDWQCGTIQVDFNLPSRFNAFYVNSH 514 (652) Q Consensus 435 ~g~~~~~~~ae~~l~~~l~~~~~~~~~~~~~~~~~g~~afygpkid~~~~d~~gr~~~~~tiq~df~~p~~f~l~y~~~d 514 (652) +|+++.|+.|++.|.++++..+ +++...+|++||||||+|+++.|++||.|||+|+|+||++|+||+++|+++| T Consensus 376 ig~d~~W~~a~~~l~~al~~~~------~~~~~~~G~gaFyGPKid~~v~Dalgr~~q~~TIQlDf~lpeRF~l~Yv~~d 449 (589) T COG0441 376 IGSDEMWDKAEAALREALKEIG------VEYVEEPGEGAFYGPKIDFQVKDALGREWQLGTIQLDFNLPERFDLEYVDED 449 (589) T ss_pred CCCHHHHHHHHHHHHHHHHHHC------CEEEECCCCEEEECCCCCEEEEECCCCCEECCEEEEECCCHHHCEEEEECCC T ss_conf 6873556778899998887608------6036468735797766424887356861005617870377242147998577 Q ss_pred CCEEEEEEEEECCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHH Q ss_conf 97213799640341037899999999738708873378089998579668999999999999879889997589663676 Q gi|254781004|r 515 SEKCHPVMIHRAVFGSIERFIGIMIENFKGNLPLWLSPIQAIVTTITSSAVEYAQEIANLLKSHHLSIETDFRNETINYK 594 (652) Q Consensus 515 g~~~~pvmihr~~~Gs~eR~ia~liE~~~g~~P~wLAP~QV~Iipi~e~~~eya~~i~~~L~~~girV~~Ddr~~~~G~K 594 (652) |++++|||||||++||+|||+|+|+||++|.||+||||+||+||||++++.+||.+|++.|+++|+||++|+|++++|+| T Consensus 450 ~~~~~PvmiHrai~GSiERfi~iLiE~~~G~~P~WLaPvQv~VipV~~~~~~ya~~v~~~L~~~giRvdvD~~~e~l~kK 529 (589) T COG0441 450 GEKKRPVIIHRAILGSIERFIGILLEHYAGALPTWLAPVQVRVIPVADEHLDYAKEVAEKLRKAGIRVDIDDRNEKLGKK 529 (589) T ss_pred CCCCCCEEEEECCCHHHHHHHHHHHHHCCCCCCCCCCCCEEEEEEECHHHHHHHHHHHHHHHHCCCEEEECCCCCCHHHH T ss_conf 87538789992464148999999998646988765785179999937377789999999999729702412566606899 Q ss_pred HHHHHHCCCCEEEEECHHHHHCCCEEEEECCCCCEEEECHHHHHHHHHHHCCCHHH Q ss_conf 66554348988999874253258189998898866334299999999873143562 Q gi|254781004|r 595 IREHSIKKIPIIIICGDKEASERSIGIRRFGSTTTQKVSLLDAIPILTKESLPPDC 650 (652) Q Consensus 595 ir~a~l~giP~~ivIG~ke~e~~~Vtir~R~~~~~~~i~lee~i~~l~~e~~~~d~ 650 (652) ||+|...+|||+||||++|+++++|++|+|++.++..|++++|++.++++...... T Consensus 530 IR~a~~~kipyiiVvG~kE~e~~~v~vR~r~~~~~~~~~l~e~i~~ik~e~~~~~~ 585 (589) T COG0441 530 IREAGTQKIPYVIVVGDKEVETGTVVVRRRGGKQQKSMTLEELVEELKKEIEGRPN 585 (589) T ss_pred HHHHHHCCCCEEEEECHHHHHCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCC T ss_conf 99877628787999731432038717997267755634699999999987644557 No 9 >KOG1637 consensus Probab=100.00 E-value=0 Score=1142.08 Aligned_cols=546 Identities=39% Similarity=0.737 Sum_probs=524.1 Q ss_pred EEECCCCCEEE-CCCCCCHHHHHHHHCCCCCCCEEEEEECCEEEECCEEEC-CCEEEEECCCCHHHHHHHHHHHHHHHHH Q ss_conf 89928998874-189888999998636110112899998988997745303-6707986289999999999989999999 Q gi|254781004|r 6 KLTFPDGSIKN-FPVHATGSDVAESISRSLAKKAVAVAINGKVCDLSDPVR-EGSIEIITPEDPRSLAVIRHSCAHIMAE 83 (652) Q Consensus 6 ~I~lpDG~~~~-~~~g~t~~dia~~i~~~~~~~~vaa~vng~l~dL~~~l~-d~~i~~i~~~~~eg~~i~~hS~ahlL~~ 83 (652) .++||||.+.+ ...+|||++||.. +++++++++.++|||.+|||++|++ || +++++++|++|..+||||+||+|+. T Consensus 2 ~~~Lpdg~~~~~~~w~ttp~~ia~~-s~~la~~~~~~~vn~~~~Dl~rp~e~~~-lell~f~~~~~k~vfwhssahvlg~ 79 (560) T KOG1637 2 IIVLPDGKVVEGVSWETTPYDIACQ-SKGLADDAVIAKVNGVLWDLDRPLEGDC-LELLKFDDDEGKDVFWHSSAHVLGE 79 (560) T ss_pred EEECCCCCEEEEEECCCCHHHHHHH-CCCHHHHHHHHHHCCCEECCCCCCHHHH-HHHCCCCCCCCCEEEEEHHHHHHHH T ss_conf 0556885565112336784677641-0221666587762681312578550557-8870678723301244255667668 Q ss_pred HHHHHCCCCEEEEECCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHCCCHHHHHHHH Q ss_conf 99997799889972435885578840888888989999999999999669980485327678544544167113556654 Q gi|254781004|r 84 AVQSIWPEVQVAIGPIIEDGFYYDFDKEQPFSSDELAQIEKKMQEIIARDSPFLKQYLSHEKARELFESKKEAYKVEILE 163 (652) Q Consensus 84 Av~~l~p~~kl~igp~i~~GfYyD~~~~~~it~~dl~~Ie~~M~~ii~~~~pi~r~~vs~~eA~~~F~~~~~~~K~~li~ 163 (652) |+.+.|+ +++++||++++|||||+ . ..++.+|++.||. T Consensus 80 a~e~~~g-~~lc~Gpp~~~gf~yd~-~-~~i~~~d~~~~e~--------------------------------------- 117 (560) T KOG1637 80 ALEQEYG-AHLCIGPPIEEGFYYDM-L-DEISSNDFPSIEA--------------------------------------- 117 (560) T ss_pred HHHHHCC-EEEEECCCCCCCEEHHH-H-CCCCCCCCCCCCC--------------------------------------- T ss_conf 8887608-46750798767530000-0-3335222542235--------------------------------------- Q ss_pred HCCCCCCCEEECCCCCEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEEEECCHHHHHHHHHHHHHHHH Q ss_conf 20345542130257520100023552144422233333321110157756420689999983798899999999997532 Q gi|254781004|r 164 SIPAKENVTLYRQGEWFDLCRGPHVRSTGQVKKFFKLMKVAGAYWRGDSNRPMLSRIYGTAWNTQQELTQYLYFLEESEK 243 (652) Q Consensus 164 ~~~~~~~v~~y~~g~~~Dlc~GphvpsTg~ik~~fkL~~~sgayw~gd~~~~~l~Riyg~af~~~~~l~~~~~~~eea~~ 243 (652) |||++||.+|+ ||+++.|++||.|| +||++++|++|++++|||++ T Consensus 118 ----------------------phi~~tg~ika-~k~~~nss~yw~~~------------sfp~~k~~k~~~~~~~Ea~~ 162 (560) T KOG1637 118 ----------------------PHIRHTGKIKA-FKILKNSSAYWEGD------------SFPDPKQLKEWEKFQEEAKK 162 (560) T ss_pred ----------------------CCCCCCCEEEE-EEEECCCHHHHCCC------------CCCCHHHHHHHHHHCHHHHH T ss_conf ----------------------55554340121-23303522333166------------68987899999864555441 Q ss_pred HHHHHHHHHCCEEEECCCCCCEEEEECCHHHHHHHHHHHHHHHHH-CCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEE Q ss_conf 008875411440442267874356501078999999999998876-8963855543244310001121001255542010 Q gi|254781004|r 244 RDHRKLAREMDLFHIAEDGSGVIFWHRKGWKIFQTLISYMRRKIK-DDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVK 322 (652) Q Consensus 244 rdHr~lg~~~~lf~~~~~~~G~~~wlP~G~~i~~~ie~~ir~~~~-~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~ 322 (652) ||||+||++|+||||++.|||.+||||+|++|+|+|.+++|.+++ +||+||.||+|+++.||++||||+||.+|||.++ T Consensus 163 rdHRkig~~qeLfff~~lSPGS~FflP~G~~iyN~Lv~fir~ey~~rGf~EVitPniy~~~LWe~SGHwqnY~enmF~~e 242 (560) T KOG1637 163 RDHRKIGKEQELFFFHELSPGSCFFLPHGTRIYNTLVDFIRAEYRKRGFTEVITPNIYNKKLWETSGHWQNYSENMFKFE 242 (560) T ss_pred HHHHHHHHHHHHEEECCCCCCCEEECCCCCHHHHHHHHHHHHHHHHCCCCEECCCCHHHHHHHHHCCCHHHHHHHCEEEE T ss_conf 00555402222024135798624531486427889999999998753874322722121566653252555443101231 Q ss_pred CCCCCCCCCCEEEECCCCHHHHHHHHHHCCCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEECCHHHHHH Q ss_conf 01456777823774343125777765201222231043200010001278764520001056353344210105376788 Q gi|254781004|r 323 CADDTIKDLRTFALKPMNCPGHVAVFNHGLKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVFCTKEQMFN 402 (652) Q Consensus 323 ~~~~~~~~~~~~~LkPmncp~h~~i~~~~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F~~~eq~~~ 402 (652) . ++++|+|||||||+||+||++..||||+||+|++.||.+||||.||+++||+|+|+|.||||||||+++|+.+ T Consensus 243 ~------eke~~~LKPMNCPgHcLmf~~r~rS~reLPlR~aDFg~LHRnE~SGaLsGLTRvRrFqQDDaHIFCt~~Qi~~ 316 (560) T KOG1637 243 V------EKEEFALKPMNCPGHCLMFAHRDRSYRELPLRFADFGVLHRNEASGALSGLTRVRRFQQDDAHIFCTPDQVKE 316 (560) T ss_pred E------CHHHHCCCCCCCCCCCCCCCCCCCCHHHCCCCCCCCCEEEECCCCCCCCCCCEEEEECCCCCEEEECCCCHHH T ss_conf 0------1033225866897530133667742744784226752253054564232210110110157158855763799 Q ss_pred HHHHHHHHHHHHHHHHCCCCCEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 78999999887665521332100014653112586035778999999999850212125667520555555455432110 Q gi|254781004|r 403 ECLKIHNLIISIYKDFGFEKIMVKLSTRPEKRVGSDALWDDAENIMKGVLDTIKNSSQDRINTGVLLGEGAFYGPKFEYI 482 (652) Q Consensus 403 e~~~~~~~~~~v~~~~g~~~~~~~ls~~~~~~~g~~~~~~~ae~~l~~~l~~~~~~~~~~~~~~~~~g~~afygpkid~~ 482 (652) |+.++++++..+|..|||. +.+.+||||++++|+.+.|+.||..|+++|+..|. ++.+++|+|||||||||++ T Consensus 317 Eik~~l~fl~~vY~~fgf~-f~l~lSTRPe~~lG~l~~Wd~AE~~L~~al~e~g~------pw~lN~GDGAFYGPKIDi~ 389 (560) T KOG1637 317 EIKGCLDFLDYVYGVFGFT-FKLNLSTRPEKFLGDLETWDEAEFKLEEALNESGE------PWVLNPGDGAFYGPKIDIT 389 (560) T ss_pred HHHHHHHHHHHHHHHCCCC-CEEEECCCHHHHCCCHHHHHHHHHHHHHHHHHHCC------CCEECCCCCCEECCEEEEE T ss_conf 9987999999998612542-02574167587665788899999999999997099------7142289853026433257 Q ss_pred HHHHCCCCCCCCCCCHHHHCCCCCCCEEEEECCCEEEEEEEEECCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCH Q ss_conf 02110331001211011102234674167317972137996403410378999999997387088733780899985796 Q gi|254781004|r 483 LKDAIGRDWQCGTIQVDFNLPSRFNAFYVNSHSEKCHPVMIHRAVFGSIERFIGIMIENFKGNLPLWLSPIQAIVTTITS 562 (652) Q Consensus 483 ~~d~~gr~~~~~tiq~df~~p~~f~l~y~~~dg~~~~pvmihr~~~Gs~eR~ia~liE~~~g~~P~wLAP~QV~Iipi~e 562 (652) +.|+++|.+||.|||+||++|.||+|.|.+++|..++|||||||++||+||++|+|+|+++|+||+||+|.|+.||||++ T Consensus 390 l~Dal~r~hQcaTIQLDFqLP~rFdL~y~~~~g~~erPVmIHRAIlGSvERmiaiL~E~~~gkwPFWlSPRq~~vIpVse 469 (560) T KOG1637 390 LDDALGRKHQCATIQLDFQLPIRFDLEYETEDGDLERPVMIHRAILGSVERMIAILLESYGGKWPFWLSPRQAVVIPVSE 469 (560) T ss_pred HHHHCCCCCCEEEEEECCCCHHHCCCEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEECCCEEEEEECCC T ss_conf 24332761010256631367333175110466652312358998752199999999997079998445462389997787 Q ss_pred HHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHCCCCEEEEECHHHHHCCCEEEEECCCCC---EEEECHHHHHH Q ss_conf 689999999999998798899975896636766655434898899987425325818999889886---63342999999 Q gi|254781004|r 563 SAVEYAQEIANLLKSHHLSIETDFRNETINYKIREHSIKKIPIIIICGDKEASERSIGIRRFGSTT---TQKVSLLDAIP 639 (652) Q Consensus 563 ~~~eya~~i~~~L~~~girV~~Ddr~~~~G~Kir~a~l~giP~~ivIG~ke~e~~~Vtir~R~~~~---~~~i~lee~i~ 639 (652) ....||.+|...|..+|+-|++|.++.++++|+|+|++..+.|++|||+||++.++|+||.|++.+ +..++++++++ T Consensus 470 ~~~~ya~~V~~ql~~a~f~~Dld~t~~tl~kkir~Aqla~yn~i~VVGdkE~~~~~vnVr~Rd~~~~~~~~~~tie~~~~ 549 (560) T KOG1637 470 GPLDYATSVQKQLEEAGFYVDLDPTDSTLRKKIRNAQLAHYNFIFVVGDKEVETGRVNVRTRDNRDNKTESEMTIEELSD 549 (560) T ss_pred CHHHHHHHHHHHHHHHHCEEECCCCCCHHHHHHHHHHHCCCEEEEEECHHHHHCCCEEEEECCCCCCCCCHHEEHHHHHH T ss_conf 40338999999998620066147763568888866232250299997545653274789943575543210232999999 Q ss_pred HHHH Q ss_conf 9987 Q gi|254781004|r 640 ILTK 643 (652) Q Consensus 640 ~l~~ 643 (652) .+.+ T Consensus 550 ~~~~ 553 (560) T KOG1637 550 EFKE 553 (560) T ss_pred HHHH T ss_conf 9998 No 10 >PRK03991 threonyl-tRNA synthetase; Validated Probab=100.00 E-value=0 Score=846.79 Aligned_cols=539 Identities=22% Similarity=0.374 Sum_probs=416.5 Q ss_pred ECCCEEEEECCCCHHH---HHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHH Q ss_conf 0367079862899999---9999998999999999997799889972435885578840888888989999999999999 Q gi|254781004|r 54 VREGSIEIITPEDPRS---LAVIRHSCAHIMAEAVQSIWPEVQVAIGPIIEDGFYYDFDKEQPFSSDELAQIEKKMQEII 130 (652) Q Consensus 54 l~d~~i~~i~~~~~eg---~~i~~hS~ahlL~~Av~~l~p~~kl~igp~i~~GfYyD~~~~~~it~~dl~~Ie~~M~~ii 130 (652) +++|-|-|++....|. ..+....+.-+.-.| .++ .-.++.+-| |--.+.+- =+++.-.+|=+.|.+.. T Consensus 34 ~e~~lv~F~aVE~~De~~p~~v~~~av~ei~~~~-~~v-k~~~ivlYP------yAHLSs~L-a~P~~A~~vL~~le~~L 104 (613) T PRK03991 34 LEEALVVFTAVEKGDESNPEEVVEKAVEEIIKVA-EKV-KAENIVLYP------YAHLSSDL-ASPKVAVEVLKKLEEEL 104 (613) T ss_pred CCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHHH-HHC-CCCEEEEEC------CCCCCCCC-CCHHHHHHHHHHHHHHH T ss_conf 3321899998716887898999999999999999-855-987799945------10025876-89689999999999998 Q ss_pred -HCCCCEEEEECCHHHHHHHHHHCCCHHHHHHHHHCCCCCCCEEECCCCCEEECCCCCCCHHHHHHHHHHHHHHHHHCCC Q ss_conf -6699804853276785445441671135566542034554213025752010002355214442223333332111015 Q gi|254781004|r 131 -ARDSPFLKQYLSHEKARELFESKKEAYKVEILESIPAKENVTLYRQGEWFDLCRGPHVRSTGQVKKFFKLMKVAGAYWR 209 (652) Q Consensus 131 -~~~~pi~r~~vs~~eA~~~F~~~~~~~K~~li~~~~~~~~v~~y~~g~~~Dlc~GphvpsTg~ik~~fkL~~~sgayw~ 209 (652) +.++.+.|--.-.-.|.++= -.|-|. .||.+.+...+.-.. ..-..++ .||. T Consensus 105 ~~~~~eV~rAPFGwyK~F~i~-ckGHPL-sElSR~i~~~~~~~~---------------~~~~~~~----------~~~~ 157 (613) T PRK03991 105 KSEGYEVYRAPFGWYKAFKIS-CKGHPL-SELSRTITPEEEEVV---------------KEEEEIR----------SYWY 157 (613) T ss_pred HHCCCEEEECCCCCCEEEEEE-ECCCCC-HHHCCCCCCCCCHHH---------------HHHHHCC----------CEEE T ss_conf 647954997176420225777-357440-896420366610222---------------3133134----------4079 Q ss_pred CCCCCCEEEEEEEEEECCHHHHHHHHH-----HHHHHHHHHHHHHHHHCCEEEECCCC-CCEEEEECCHHHHHHHHHHHH Q ss_conf 775642068999998379889999999-----99975320088754114404422678-743565010789999999999 Q gi|254781004|r 210 GDSNRPMLSRIYGTAWNTQQELTQYLY-----FLEESEKRDHRKLAREMDLFHIAEDG-SGVIFWHRKGWKIFQTLISYM 283 (652) Q Consensus 210 gd~~~~~l~Riyg~af~~~~~l~~~~~-----~~eea~~rdHr~lg~~~~lf~~~~~~-~G~~~wlP~G~~i~~~ie~~i 283 (652) --...-.+.-+--..|..-+.|+...+ ....-+...|.+|.++++|.-..+.| +|.-.|+|+|+.|++.|++|+ T Consensus 158 il~pdg~~~~~~~~~~~~~~~l~~l~~~e~~~~~~~~~epphv~~mr~~~~~dyEp~Sd~G~~rw~PkG~~i~~lle~Yv 237 (613) T PRK03991 158 ILTPDGELHDVEDFDFTGYEDLKALVDKEVFKKELVGGEPPHVELMRKKELADYEPGSDVGHMRYYPKGRLMKDLLEDYV 237 (613) T ss_pred EECCCCCCCCHHHHCCCCCHHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCCCCCCCCCCCEEECCCHHHHHHHHHHHH T ss_conf 98899875464462311357799999987413445778983889998717655574787775123554789999999999 Q ss_pred HHHH-HCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHHHCCCCHHHCCHHH Q ss_conf 9887-689638555432443100011210012555420100145677782377434312577776520122223104320 Q gi|254781004|r 284 RRKI-KDDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHVAVFNHGLKSYRELPVRL 362 (652) Q Consensus 284 r~~~-~~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~~~~SYrdLPlrl 362 (652) |... .+||+||.||+|++.+||+.||||++|+++||.+.. ++++|+|||||||+|++||+++.+||||||+|+ T Consensus 238 ~~~~~~~G~~eV~TP~m~d~~~~~~sgH~~~f~e~my~~~~------d~~~~~LrpmnCpgh~~i~~~~~~SYRdLPlRl 311 (613) T PRK03991 238 YNLVKELGAMPVETPNMYDLSHPAIREHADKFGERQYRVKS------DKKDLMLRFAACFGQFAMLKDMTISYKNLPLKM 311 (613) T ss_pred HHHHHHCCCEEEECCCEEECCCCHHHHHHHHHCCCCEEEEC------CCHHHCCCCCCCCCCEEEECCCCCCHHHCCHHH T ss_conf 99998639668457616405540255688761113425632------763323044578631235102445743264677 Q ss_pred HHHEEE-ECCCCCCCCEEEEECCCCCCCCCEEECCH-HHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCCCCCCCCCHHH Q ss_conf 001000-12787645200010563533442101053-7678878999999887665521332100014653112586035 Q gi|254781004|r 363 AEFGSV-YRNEPSGSLHGLMRVRGFTQDDAHVFCTK-EQMFNECLKIHNLIISIYKDFGFEKIMVKLSTRPEKRVGSDAL 440 (652) Q Consensus 363 ~e~~~~-~R~E~sg~l~GL~R~ReF~~~Dah~F~~~-eq~~~e~~~~~~~~~~v~~~~g~~~~~~~ls~~~~~~~g~~~~ 440 (652) ||+|++ ||||+||+|+||+|||+|||+||||||++ +|+.+|+.++++++..+|.+||++. .+..+. +.++ T Consensus 312 ~E~g~~syR~E~SG~L~GL~RVR~FTqdDaHIFC~~~dQi~~E~~~~~~li~~v~~~fG~~y-~~~~r~-------t~e~ 383 (613) T PRK03991 312 YELSDYSFRLEQRGELVGLKRLRAFTMPDMHTLCKDMDEAMEEFEKQHKLILETGRDLGRDY-EVAFRF-------TEDF 383 (613) T ss_pred HHHCCCEEECCCCCCEECEEEEECCCCCCCEECCCCHHHHHHHHHHHHHHHHHHHHHCCCCE-EEEECC-------CHHH T ss_conf 65224202125677444401332356686113168778999999999999999998639763-799707-------8888 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCHHHHCCCCCCCEEEEECCCEEEE Q ss_conf 77899999999985021212566752055555545543211002110331001211011102234674167317972137 Q gi|254781004|r 441 WDDAENIMKGVLDTIKNSSQDRINTGVLLGEGAFYGPKFEYILKDAIGRDWQCGTIQVDFNLPSRFNAFYVNSHSEKCHP 520 (652) Q Consensus 441 ~~~ae~~l~~~l~~~~~~~~~~~~~~~~~g~~afygpkid~~~~d~~gr~~~~~tiq~df~~p~~f~l~y~~~dg~~~~p 520 (652) |+.++.+|.++++..+.+. .+...+|+++|||||||+++.|++||.|||+|+|+||++|+||+++|+++||++++| T Consensus 384 w~~ae~~l~~~~~~~g~~~----~~e~~~g~~~fyGPKIDf~v~DalGR~wq~~TiQlDf~lp~RF~l~Yv~edg~~~~P 459 (613) T PRK03991 384 YEENKDWIVELAKREGKPV----LVEILPERKHYWVLKVEFAIIDSLGRPIENPTVQIDVENAERFGIKYVDENGKEKYP 459 (613) T ss_pred HHHHHHHHHHHHHHCCCCC----EEEECCCCCCEECCCEEEEEEECCCCCEEEEEEEEECCCHHHCCCEEECCCCCCCCC T ss_conf 8899999999999729971----476658763143141689998468972450158983278322687899899989588 Q ss_pred EEEEECCCHHHHHHHHHHHHHHCC--------CCCCCCCCEEEEEEECCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCC Q ss_conf 996403410378999999997387--------088733780899985796689999999999998798899975896636 Q gi|254781004|r 521 VMIHRAVFGSIERFIGIMIENFKG--------NLPLWLSPIQAIVTTITSSAVEYAQEIANLLKSHHLSIETDFRNETIN 592 (652) Q Consensus 521 vmihr~~~Gs~eR~ia~liE~~~g--------~~P~wLAP~QV~Iipi~e~~~eya~~i~~~L~~~girV~~Ddr~~~~G 592 (652) ||||||++||+|||||+||||++| .||+||||+||+||||++++.+||++|++.|+++||||++|||++++| T Consensus 460 vmiHrAi~GS~ERfi~~LiE~~ag~~~~Gk~p~~P~WLaP~Qv~iiPVs~~~~~ya~~v~~~L~~~giRv~vD~r~e~lg 539 (613) T PRK03991 460 VILHCSPTGSIERVIYALLEKAAKEEKEGKVPMLPTWLSPVQVRVIPVSEEHLDYAEEVADKLEAAGIRVDVDDRDESLG 539 (613) T ss_pred EEEECCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEEECHHHHHHHHHHHHHHHHCCCEEEEECCCCCHH T ss_conf 58972665639999999999850101247878888787860379997073689999999999997798899877888561 Q ss_pred HHHHHHHHCCCCEEEEECHHHHHCCCEEEEECCCCCEEEECHHHHHHHHHHHCC Q ss_conf 766655434898899987425325818999889886633429999999987314 Q gi|254781004|r 593 YKIREHSIKKIPIIIICGDKEASERSIGIRRFGSTTTQKVSLLDAIPILTKESL 646 (652) Q Consensus 593 ~Kir~a~l~giP~~ivIG~ke~e~~~Vtir~R~~~~~~~i~lee~i~~l~~e~~ 646 (652) +|||+|++.+|||++|||++|+++|+|+||.|+++++..|++|+|++.|++|+. T Consensus 540 kKIR~A~~~kiPyi~VvGdkE~e~gtvsVR~R~~~~~~~msleeli~~i~~ev~ 593 (613) T PRK03991 540 KKIRDAGKEWVPYVVVIGDKEMESGKLTVTVREESEKEEMTLEELIERIKEETK 593 (613) T ss_pred HHHHHHHHCCCCEEEEEEHHHHHCCEEEEEECCCCCCCCCCHHHHHHHHHHHHC T ss_conf 899987543588899993135424937999877886161229999999999971 No 11 >cd00771 ThrRS_core Threonyl-tRNA synthetase (ThrRS) class II core catalytic domain. ThrRS is a homodimer. It is responsible for the attachment of threonine to the 3' OH group of ribose of the appropriate tRNA. This domain is primarily responsible for ATP-dependent formation of the enzyme bound aminoacyl-adenylate. Class II assignment is based upon its structure and the presence of three characteristic sequence motifs in the core domain. Probab=100.00 E-value=0 Score=781.67 Aligned_cols=296 Identities=53% Similarity=1.008 Sum_probs=287.9 Q ss_pred HHHHHHHHCCEEEECC-CCCCEEEEECCHHHHHHHHHHHHHHHH-HCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEE Q ss_conf 0887541144044226-787435650107899999999999887-68963855543244310001121001255542010 Q gi|254781004|r 245 DHRKLAREMDLFHIAE-DGSGVIFWHRKGWKIFQTLISYMRRKI-KDDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVK 322 (652) Q Consensus 245 dHr~lg~~~~lf~~~~-~~~G~~~wlP~G~~i~~~ie~~ir~~~-~~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~ 322 (652) ||||||++|+||+|.+ .|||+|+|||+|++|+++|++++|+++ ++||+||.||+|++++||++||||++|++|||.++ T Consensus 1 dHrklg~el~lf~~~~~~g~G~~~w~P~G~~i~~~L~~~ir~~~~~~Gy~eV~tP~i~~~~Lw~~SGH~~~y~e~mf~~~ 80 (298) T cd00771 1 DHRRLGGELELFFFFDEAGPGLPFWLPKGAIIRNELEDFLRELQRKRGYQEVETPIIYNKELWETSGHWDHYRENMFPFE 80 (298) T ss_pred CHHHHCCCCCCCCCCCCCCCCCEEECCCHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCHHHHHHCCCCCCCCCCEEEEE T ss_conf 94574401157511777667657883888999999999999999986998998861266556754776311368568888 Q ss_pred CCCCCCCCCCEEEECCCCHHHHHHHHHHCCCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEECCHHHHHH Q ss_conf 01456777823774343125777765201222231043200010001278764520001056353344210105376788 Q gi|254781004|r 323 CADDTIKDLRTFALKPMNCPGHVAVFNHGLKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVFCTKEQMFN 402 (652) Q Consensus 323 ~~~~~~~~~~~~~LkPmncp~h~~i~~~~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F~~~eq~~~ 402 (652) . ++++|+|||||||+|++||++.+|||||||+|++|+|++||||+||+++||+|+|+|+|+|||+||++||+.+ T Consensus 81 ~------~~~~~~LkPmnCp~h~~i~~~~~~SYrdLPlR~~E~g~~~R~E~sG~L~GL~RvR~FtqdDaHiFc~~eQ~~~ 154 (298) T cd00771 81 E------EDEEYGLKPMNCPGHCLIFKSKPRSYRDLPLRLAEFGTVHRYEQSGALHGLTRVRGFTQDDAHIFCTPDQIKE 154 (298) T ss_pred E------CCCCEEEECCCCHHHHHHHHCCCCCHHHCCHHHHCCCCEECCCCCCCCCCCCCCCCCCCCCCEEECCHHHHHH T ss_conf 6------2541688506868899998635567343467652075055476776755744555674576324416889999 Q ss_pred HHHHHHHHHHHHHHHHCCCCCEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 78999999887665521332100014653112586035778999999999850212125667520555555455432110 Q gi|254781004|r 403 ECLKIHNLIISIYKDFGFEKIMVKLSTRPEKRVGSDALWDDAENIMKGVLDTIKNSSQDRINTGVLLGEGAFYGPKFEYI 482 (652) Q Consensus 403 e~~~~~~~~~~v~~~~g~~~~~~~ls~~~~~~~g~~~~~~~ae~~l~~~l~~~~~~~~~~~~~~~~~g~~afygpkid~~ 482 (652) |+.++++++..+|.+||+..+.+++|++|++++|+++.|+.|+..|+++++..+ ++|.+++|++||||||||++ T Consensus 155 E~~~~~~~~~~vy~~fG~~~~~~~ls~r~ek~~g~~e~W~~ae~~l~~~l~~~~------~~~~~~~ge~AFyGPKiD~~ 228 (298) T cd00771 155 EIKGVLDLIKEVYSDFGFFDYKVELSTRPEKFIGSDEVWEKAEAALREALEEIG------LPYEINEGEGAFYGPKIDFH 228 (298) T ss_pred HHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHCC------CCCEECCCCCEEEEEEEEEE T ss_conf 999999999999998288732489964787677998999999999999999759------98575788600130058999 Q ss_pred HHHHCCCCCCCCCCCHHHHCCCCCCCEEEEECCCEEEEEEEEECCCHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 0211033100121101110223467416731797213799640341037899999999738708873378 Q gi|254781004|r 483 LKDAIGRDWQCGTIQVDFNLPSRFNAFYVNSHSEKCHPVMIHRAVFGSIERFIGIMIENFKGNLPLWLSP 552 (652) Q Consensus 483 ~~d~~gr~~~~~tiq~df~~p~~f~l~y~~~dg~~~~pvmihr~~~Gs~eR~ia~liE~~~g~~P~wLAP 552 (652) ++|++||.|||+|+|+||++|+||+++|+++||++++|||||||++||+|||||+||||++|+||+|||| T Consensus 229 ~~D~lgR~~q~~TiQlDf~lp~rF~l~Y~~~dg~~~~Pv~iHrai~GS~ERfia~LiE~~~G~~P~WLaP 298 (298) T cd00771 229 VKDALGREWQCSTIQLDFNLPERFDLTYIGEDGEKKRPVMIHRAILGSIERFIGILIEHYAGKFPLWLAP 298 (298) T ss_pred EEECCCCEEEEEEEEEEECCHHHCCCEEECCCCCCCCCEEEEECCCCHHHHHHHHHHHHCCCCCCCCCCC T ss_conf 8606897687014688506743269889989998648989995576709999999998607998866593 No 12 >PRK12325 prolyl-tRNA synthetase; Provisional Probab=100.00 E-value=0 Score=753.43 Aligned_cols=366 Identities=20% Similarity=0.280 Sum_probs=315.1 Q ss_pred EEECCCCCCEEEEECCHHHHHHHHHHHHHHHHH-CCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEE Q ss_conf 442267874356501078999999999998876-8963855543244310001121001255542010014567778237 Q gi|254781004|r 256 FHIAEDGSGVIFWHRKGWKIFQTLISYMRRKIK-DDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTF 334 (652) Q Consensus 256 f~~~~~~~G~~~wlP~G~~i~~~ie~~ir~~~~-~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~ 334 (652) =+|+++|||+|+|||+|++|+++|+++||+++. .|||||.||+|++++||++||||++|+++||.+ .++++++| T Consensus 30 g~i~~~~~Gl~~wlP~G~~i~~~le~~ir~e~~~~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~-----~d~~~~~~ 104 (438) T PRK12325 30 GMIRQQAAGIYSWLPLGLKVLKKIENIVREEQNRAGAIEILMPTIQPADLWRESGRYDAYGKEMLRI-----KDRHDREM 104 (438) T ss_pred CCCCCCCCCCCEECCCHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCHHHHHHHCCHHHHHHHHCEE-----ECCCCCEE T ss_conf 7756346873221605899999999999999997499799877658778998628764555742121-----34777743 Q ss_pred EECCCCHHHHHHHHHHCCCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEE-CCHHHHHHHHHHHHHHHHH Q ss_conf 743431257777652012222310432000100012787645200010563533442101-0537678878999999887 Q gi|254781004|r 335 ALKPMNCPGHVAVFNHGLKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVF-CTKEQMFNECLKIHNLIIS 413 (652) Q Consensus 335 ~LkPmncp~h~~i~~~~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F-~~~eq~~~e~~~~~~~~~~ 413 (652) ||||||||+|+.||++.++||||||+|+||+|++||||+|| ++||+|+|||+|+|||+| |+.+++.+++..+...|.. T Consensus 105 ~l~Pmnc~~~~~i~~~~~~SYRdLPlrl~q~g~~~R~E~sp-~~GLlRvReF~m~Dah~F~~~~e~~~~~~~~v~~~y~~ 183 (438) T PRK12325 105 LYGPTNEEMITDIFRSYVKSYKDLPLNLYHIQWKFRDEIRP-RFGVMRGREFLMKDAYSFDLDEEGARKSYNRMFVAYLR 183 (438) T ss_pred CCCCCCCHHHHHHHHHCCCCHHHCCHHHHHHHHEECCCCCC-CCCCCEEEEEECCCHHHCCCCHHHHHHHHHHHHHHHHH T ss_conf 13899849999999740342654257887644401120267-64410122333041543246889999999999999999 Q ss_pred HHHHHCCCCCEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCC------------ Q ss_conf 66552133210001465311258603577899999999985021212566752055555545543211------------ Q gi|254781004|r 414 IYKDFGFEKIMVKLSTRPEKRVGSDALWDDAENIMKGVLDTIKNSSQDRINTGVLLGEGAFYGPKFEY------------ 481 (652) Q Consensus 414 v~~~~g~~~~~~~ls~~~~~~~g~~~~~~~ae~~l~~~l~~~~~~~~~~~~~~~~~g~~afygpkid~------------ 481 (652) +|.+||+..+.+++++++.+...++++|..++.+...++.+ ...++.+++++++++ T Consensus 184 if~~lgl~~~~v~ad~g~igg~~s~Ef~~~~~~ged~i~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~ 251 (438) T PRK12325 184 TFKRLGLKAIPMRADTGPIGGDLSHEFHILAETGESTVFYD------------KDFLDLPVPGEDIDYDADLQDIVDEWT 251 (438) T ss_pred HHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHCCCHHEEEC------------HHHHHCCCCCCCCCHHHHHHHHHHHHH T ss_conf 99980994489982367667751267476652265024661------------333422567655462356678998887 Q ss_pred ---CHHHHCCCCCCCC-----------------CCCHHHHCCCCCCCEEEEECCCEEEEEEEEECCCH-HHHHHHHHHHH Q ss_conf ---0021103310012-----------------11011102234674167317972137996403410-37899999999 Q gi|254781004|r 482 ---ILKDAIGRDWQCG-----------------TIQVDFNLPSRFNAFYVNSHSEKCHPVMIHRAVFG-SIERFIGIMIE 540 (652) Q Consensus 482 ---~~~d~~gr~~~~~-----------------tiq~df~~p~~f~l~y~~~dg~~~~pvmihr~~~G-s~eR~ia~liE 540 (652) ...+..+..++|+ ++|++..++++|+++|.++||+++++ |+|||| |+|||||+||| T Consensus 252 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~iEvG~if~lg~~ys~~~~~~~~~~dg~~~~~---~m~~~GIgieR~ia~liE 328 (438) T PRK12325 252 SLYAATDEMHDEAKFAAVPEERRLSARGIEVGHIFYFGTKYSEPMNAKVQGPDGKEVPV---HMGSYGIGVSRLVAAIIE 328 (438) T ss_pred HHHHHHHHHCCHHHCCCCCHHHHEECCCEEEEEEEECCCCCCHHCCCCCCCCCCCCCCE---EEECCCCCHHHHHHHHHH T ss_conf 65034664346755376675663011330331243045301522067444877773222---554133428789999999 Q ss_pred HH----CCCCCCCCCCEEEEEEEC---CHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHCCCCEEEEECHHH Q ss_conf 73----870887337808999857---96689999999999998798899975896636766655434898899987425 Q gi|254781004|r 541 NF----KGNLPLWLSPIQAIVTTI---TSSAVEYAQEIANLLKSHHLSIETDFRNETINYKIREHSIKKIPIIIICGDKE 613 (652) Q Consensus 541 ~~----~g~~P~wLAP~QV~Iipi---~e~~~eya~~i~~~L~~~girV~~Ddr~~~~G~Kir~a~l~giP~~ivIG~ke 613 (652) ++ |++||+||||+||+|||| ++++.+||++|++.|.++|+||.+|||++|+|+|||+|+++||||+||||+|| T Consensus 329 ~~~d~~G~~wP~wlAP~QV~Iipi~~~~~~~~~~A~~l~~~L~~~girv~~Ddr~e~lG~Kir~a~l~giPy~ivVG~ke 408 (438) T PRK12325 329 ASHDDKGIIWPESVAPFKVGIINLKQGDAACDAACEKLYAALTAAGVDVLYDDTDERAGAKFATMDLIGLPWQIIVGPKG 408 (438) T ss_pred HHCCCCCCCCCCCCCCEEEEEEEECCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHCCCCEEEEEECCH T ss_conf 83646885467674865799997069979999999999999997899899989999972999999866999899990780 Q ss_pred HHCCCEEEEECCCCCEEEECHHHHHHHHH Q ss_conf 32581899988988663342999999998 Q gi|254781004|r 614 ASERSIGIRRFGSTTTQKVSLLDAIPILT 642 (652) Q Consensus 614 ~e~~~Vtir~R~~~~~~~i~lee~i~~l~ 642 (652) +++++|+||.|++++++.|++++|++.|+ T Consensus 409 ~e~g~V~vR~R~~~~~~~v~ide~i~~lk 437 (438) T PRK12325 409 LAEGKVELKDRKTGEREELSVEAAINRLK 437 (438) T ss_pred HHCCEEEEEECCCCCEEEEEHHHHHHHHC T ss_conf 54698999998899606764999999860 No 13 >PRK09194 prolyl-tRNA synthetase; Provisional Probab=100.00 E-value=0 Score=652.53 Aligned_cols=392 Identities=19% Similarity=0.279 Sum_probs=324.2 Q ss_pred HHHHHHHHHHHHHCCEEEECCCCCCEEEEECCHHHHHHHHHHHHHHHHHC-CCEEEECCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 75320088754114404422678743565010789999999999988768-96385554324431000112100125554 Q gi|254781004|r 240 ESEKRDHRKLAREMDLFHIAEDGSGVIFWHRKGWKIFQTLISYMRRKIKD-DYEEINTPQVLDQHLWQQSGHWDSYRANM 318 (652) Q Consensus 240 ea~~rdHr~lg~~~~lf~~~~~~~G~~~wlP~G~~i~~~ie~~ir~~~~~-G~~eV~tP~l~~~~Lw~~SGh~~~y~~~m 318 (652) +|+-..|+-|=+ .=++.+.++|+|.|||+|+|++++|+++||+||++ |+|||.||.|+|++||++||||+.|+++| T Consensus 17 dae~~Sh~Ll~r---aG~Ir~~~sGiy~~LPlg~rvl~Kie~IIr~em~~iGa~Ev~mp~l~p~elW~~sgR~~~~g~el 93 (570) T PRK09194 17 DAEVISHQLMLR---AGYIRKLASGIYTYLPLGLRVLRKIENIVREEMNKIGAQEVLMPALQPAELWQESGRWEKYGPEL 93 (570) T ss_pred CCCHHHHHHHHH---CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCHHHHHHCCCHHHCCHHH T ss_conf 324566999987---06734143772554243699999999999999986498798535679778987618821236134 Q ss_pred CEEECCCCCCCCCCEEEECCCCHHHHHHHHHHCCCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEECC-H Q ss_conf 201001456777823774343125777765201222231043200010001278764520001056353344210105-3 Q gi|254781004|r 319 FAVKCADDTIKDLRTFALKPMNCPGHVAVFNHGLKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVFCT-K 397 (652) Q Consensus 319 f~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F~~-~ 397 (652) |. .+||++++|+|+||||+..+.+.++.++||||||+++||||+|||||+|| |+||||+|||+|||||||+. . T Consensus 94 ~r-----~kDR~~~~~~L~PThEE~it~lv~~~i~SYkqLP~~lYQIqtKfRDE~RP-R~GllR~REF~MKDaYSFd~~~ 167 (570) T PRK09194 94 LR-----LKDRHGRDFVLGPTHEEVITDLVRNEIKSYKQLPLNLYQIQTKFRDEIRP-RFGLMRGREFIMKDAYSFHADE 167 (570) T ss_pred EE-----EECCCCCEEEECCCCHHHHHHHHHHHHCCHHHCCHHEECCCCEECCCCCC-CCCCHHHHHHHHHCCCCCCCCH T ss_conf 69-----85278985653787389999999985113764680200132333477887-6762345653451245666998 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCC------------- Q ss_conf 7678878999999887665521332100014653112586035778999999999850212125667------------- Q gi|254781004|r 398 EQMFNECLKIHNLIISIYKDFGFEKIMVKLSTRPEKRVGSDALWDDAENIMKGVLDTIKNSSQDRIN------------- 464 (652) Q Consensus 398 eq~~~e~~~~~~~~~~v~~~~g~~~~~~~ls~~~~~~~g~~~~~~~ae~~l~~~l~~~~~~~~~~~~------------- 464 (652) +++...|..+.+.|..+|.++|+....+...++..++..++|++..++.+...++.+..+.+..+.+ T Consensus 168 e~l~~tY~~~~~AY~~IF~rlgL~~~~v~ADsG~iGG~~ShEF~~~s~~GED~i~~C~~c~yaaN~E~A~~~~~~~~~~~ 247 (570) T PRK09194 168 ESLDETYDAMYQAYSRIFTRLGLDFRAVEADSGAIGGSASHEFMVLAESGEDTIVYSDESDYAANIEKAEALPPPEPRAA 247 (570) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCEEEEEECCCCCCEEEEECCCCHHHHHHHHHCCCCCCCCCC T ss_conf 99999999999999999998098629999146675886652378763789717999689760564877511477666754 Q ss_pred -----------------------------------C-------------------CCC---------------------- Q ss_conf -----------------------------------5-------------------205---------------------- Q gi|254781004|r 465 -----------------------------------T-------------------GVL---------------------- 468 (652) Q Consensus 465 -----------------------------------~-------------------~~~---------------------- 468 (652) | +++ T Consensus 248 ~~~~~e~v~TP~~~TIe~l~~~~~v~~~~~iKtl~~~~~~~~~~~~v~v~irGD~evNe~Kl~~~~~~~~l~~a~~eei~ 327 (570) T PRK09194 248 PTEELEKVDTPNAKTIEELAEFLNVPAEKTVKTLLVKADEEGEGELVAVLVRGDHELNEVKLENLLGAAPLELATEEEIR 327 (570) T ss_pred CCCCCEEECCCCCCHHHHHHHHHCCCHHHHEEEEEEEEECCCCCCEEEEEECCCCCCCHHHHHHCCCCCCCCCCCHHHHH T ss_conf 30003042599962199999874999888135689984217887358998226520048988740375312248999999 Q ss_pred ---CCCCCCCCC------------------------CCCCCHHH--------------------------------HCCC Q ss_conf ---555554554------------------------32110021--------------------------------1033 Q gi|254781004|r 469 ---LGEGAFYGP------------------------KFEYILKD--------------------------------AIGR 489 (652) Q Consensus 469 ---~g~~afygp------------------------kid~~~~d--------------------------------~~gr 489 (652) .....|.|| +-|+++.. .... T Consensus 328 ~~~g~~~G~iGPv~~~v~ii~D~~v~~~~n~v~GAN~~d~H~~~vn~~RD~~~~~~~Di~~v~~GD~Cp~c~~~L~~~rg 407 (570) T PRK09194 328 KLFGAPPGFLGPVNLPIPIIADRSVADMSNFVVGANEDDKHYVGVNWGRDFPVPEVADLRNVVEGDPSPDGGGTLKIARG 407 (570) T ss_pred HHCCCCCCCCCCCCCCCEEEEEHHHHHHCCHHCCCCCCCCEEECCCCCCCCCCCCCCCHHHCCCCCCCCCCCCEEEEEEE T ss_conf 75189877556667881499703452100010035887645406765656786431125552568889889971277403 Q ss_pred CCCCCCCCHHHHCCCCCCCEEEEECCCEEEEEEEEECCCHH-HHHHHHHHHHHH----CCCCCCCCCCEEEEEEECC--- Q ss_conf 10012110111022346741673179721379964034103-789999999973----8708873378089998579--- Q gi|254781004|r 490 DWQCGTIQVDFNLPSRFNAFYVNSHSEKCHPVMIHRAVFGS-IERFIGIMIENF----KGNLPLWLSPIQAIVTTIT--- 561 (652) Q Consensus 490 ~~~~~tiq~df~~p~~f~l~y~~~dg~~~~pvmihr~~~Gs-~eR~ia~liE~~----~g~~P~wLAP~QV~Iipi~--- 561 (652) ++.++++|++..+++.|+++|.|++|+.++++| ||||+ ++|++||+||++ |.+||..+|||||.|||++ T Consensus 408 IEVGHiF~LGtkYS~~m~a~~~d~~Gk~~p~~M---GCYGIGVsRllaAiiEq~~De~Gi~WP~~IAPf~V~Ii~~~~~~ 484 (570) T PRK09194 408 IEVGHIFQLGTKYSEAMNATFLDENGKAQPLTM---GCYGIGVSRLVAAAIEQNHDERGIIWPKAIAPFDVHIVPVNMKD 484 (570) T ss_pred EEEEEEECCCCCCCHHCCCEEECCCCCCCCCEE---EEEECCHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEEEECCCCC T ss_conf 688876312330146209879998898976145---62002467899999998377775407888688499999679998 Q ss_pred HHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHCCCCEEEEECHHHHHCCCEEEEECCCCCEEEECHHHHHHHH Q ss_conf 66899999999999987988999758966367666554348988999874253258189998898866334299999999 Q gi|254781004|r 562 SSAVEYAQEIANLLKSHHLSIETDFRNETINYKIREHSIKKIPIIIICGDKEASERSIGIRRFGSTTTQKVSLLDAIPIL 641 (652) Q Consensus 562 e~~~eya~~i~~~L~~~girV~~Ddr~~~~G~Kir~a~l~giP~~ivIG~ke~e~~~Vtir~R~~~~~~~i~lee~i~~l 641 (652) +++.+.|++||+.|.++|++|.+|||++|+|+||+||||+|||++||||+|.++++.|+++.|.++++..++++++++.+ T Consensus 485 ~~~~~~ae~iy~~L~~~GidVllDDR~er~G~Kf~DaDLIGiP~~vvvGkk~l~~g~vEvk~R~~~~~~~v~~~el~~~v 564 (570) T PRK09194 485 EEVRELAEKLYAELLAAGIDVLLDDRKERPGVKFADAELIGIPHRIVVGKRGLAEGIVEYKNRRTGEKEEVKLDELVEFL 564 (570) T ss_pred HHHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHCCCCEEEEECCCCCCCCEEEEEEECCCCEEEECHHHHHHHH T ss_conf 89999999999999978998999899988536667516418997999926740158599999068975886199999999 Q ss_pred HH Q ss_conf 87 Q gi|254781004|r 642 TK 643 (652) Q Consensus 642 ~~ 643 (652) .+ T Consensus 565 ~~ 566 (570) T PRK09194 565 KE 566 (570) T ss_pred HH T ss_conf 99 No 14 >TIGR00409 proS_fam_II prolyl-tRNA synthetase; InterPro: IPR004500 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . Prolyl-tRNA synthetase (6.1.1.15 from EC) exists in two forms, which are loosely related. The first form, is present in the majority of eubacteria species. The second one, present in some eubacteria, is essentially present in archaea and eukaryota. Prolyl-tRNA synthetase belongs to class IIa. This family includes the enzyme from Escherichia coli that contains all three of the conserved consensus motifs characteristic of class II aminoacyl-tRNA synthetases and the enzyme from the spirochete Borrelia burgdorferi.; GO: 0000166 nucleotide binding, 0004827 proline-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006433 prolyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=100.00 E-value=0 Score=624.85 Aligned_cols=378 Identities=19% Similarity=0.318 Sum_probs=326.2 Q ss_pred EECCCCCCEEEEECCHHHHHHHHHHHHHHHHHC-C-CEEEECCCCCC-CCCCCCCCCCCCCCC---CCCEEECCCCCCCC Q ss_conf 422678743565010789999999999988768-9-63855543244-310001121001255---54201001456777 Q gi|254781004|r 257 HIAEDGSGVIFWHRKGWKIFQTLISYMRRKIKD-D-YEEINTPQVLD-QHLWQQSGHWDSYRA---NMFAVKCADDTIKD 330 (652) Q Consensus 257 ~~~~~~~G~~~wlP~G~~i~~~ie~~ir~~~~~-G-~~eV~tP~l~~-~~Lw~~SGh~~~y~~---~mf~~~~~~~~~~~ 330 (652) ||.+.++|+|.|||+|+||+++|++++|++|+. | ++||.||.+++ ++||+.|||||+|++ +||. .+||. T Consensus 32 fIR~~~SGlY~~LPL~~rVL~Kv~~ivreeM~~~GGA~E~~lp~~~~la~LW~~SGRW~~YG~seGELl~-----lkDR~ 106 (620) T TIGR00409 32 FIRRLGSGLYNWLPLGLRVLKKVENIVREEMNKDGGAIEVLLPALQPLAELWQESGRWDTYGPSEGELLR-----LKDRK 106 (620) T ss_pred EEEECCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCEEEE-----EEECC T ss_conf 4772434144565789999998899999862267853346676677899987540753246865433667-----65268 Q ss_pred CCEEEECCCCHHHHHHHHHHCCCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEECC-HHHHHHHHHHHHH Q ss_conf 823774343125777765201222231043200010001278764520001056353344210105-3767887899999 Q gi|254781004|r 331 LRTFALKPMNCPGHVAVFNHGLKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVFCT-KEQMFNECLKIHN 409 (652) Q Consensus 331 ~~~~~LkPmncp~h~~i~~~~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F~~-~eq~~~e~~~~~~ 409 (652) +++|||+||+|+.+|.|.++.++|||+||+.||||++|||||+|| ||||||+|||+|||||||++ .|.+.++|..+.+ T Consensus 107 ~r~f~LgpT~EEvIT~l~~~~i~SYKqLP~~lYQI~tKFRDEiRP-RFGl~RgREFiMKDAYSFH~~~ESL~~ty~~m~~ 185 (620) T TIGR00409 107 GREFVLGPTHEEVITDLVRNEIKSYKQLPLNLYQIQTKFRDEIRP-RFGLLRGREFIMKDAYSFHSDEESLDATYQKMDQ 185 (620) T ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEECCCCCCCCCCCC-CCCCCCCHHHHCCCCCCCCCCHHHHHHHHHHHHH T ss_conf 887356786655689999998754530683010045121442048-8760001232302425544787679999889999 Q ss_pred HHHHHHHHHCCCCCE-ECCCCC-CCCCCCCHHHHHHHHHHHHHHHHHH-CCCCCC-CCCCCCCCCC-----CC------- Q ss_conf 988766552133210-001465-3112586035778999999999850-212125-6675205555-----55------- Q gi|254781004|r 410 LIISIYKDFGFEKIM-VKLSTR-PEKRVGSDALWDDAENIMKGVLDTI-KNSSQD-RINTGVLLGE-----GA------- 473 (652) Q Consensus 410 ~~~~v~~~~g~~~~~-~~ls~~-~~~~~g~~~~~~~ae~~l~~~l~~~-~~~~~~-~~~~~~~~g~-----~a------- 473 (652) .|.++|.++|++..+ +.+.++ ..++..++|+...++.+..+++-+. ...+.. +++-...... .+ T Consensus 186 aY~~IF~rlGldFR~iV~ADSGdaIGG~~S~EF~vL~~~GEDti~~~~h~~dyaAlNiE~A~~~~~~~P~~~~~~~~~~~ 265 (620) T TIGR00409 186 AYSNIFSRLGLDFRPIVQADSGDAIGGSASHEFMVLAESGEDTIVYSDHESDYAALNIELAEALKPGEPRNAPTAELKKV 265 (620) T ss_pred HHHHHHHHCCCCCCEEEEECCCCCCCCCCCHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHCCCCCCCCHHHCCCCC T ss_conf 99999986589821068715787578886256788664589617886257651232367776425788750113314454 Q ss_pred -------------------------------------------------------------------------------- Q ss_conf -------------------------------------------------------------------------------- Q gi|254781004|r 474 -------------------------------------------------------------------------------- 473 (652) Q Consensus 474 -------------------------------------------------------------------------------- 473 (652) T Consensus 266 ~tPN~~ti~~~~~~f~~~~~~~~K~ll~ka~~~~~K~~~v~l~vRGD~ielnevK~~n~l~klaPhy~~~~~le~~~~~~ 345 (620) T TIGR00409 266 DTPNTKTIAELVEFFNLPAEKVVKTLLVKAVDKSGKEPLVALLVRGDDIELNEVKAKNLLNKLAPHYLVAQVLELATGEE 345 (620) T ss_pred CCCCCHHHHHHHHHHCCCHHHHHCEEEEEEEECCCCEEEEEEEEECCCCCHHHHCCCCCCCHHCCCHHHCCEEECCCCHH T ss_conf 77441118999998678876831031466651378606888897367013221113466202134101034011147147 Q ss_pred -------------------------------CCCC---C--CCC---CH------------------------------- Q ss_conf -------------------------------4554---3--211---00------------------------------- Q gi|254781004|r 474 -------------------------------FYGP---K--FEY---IL------------------------------- 483 (652) Q Consensus 474 -------------------------------fygp---k--id~---~~------------------------------- 483 (652) |.++ . .|. .+ T Consensus 346 ~~~~~~~~~~~~~g~~~~~~sddyiatnkqv~~~~~~~~khvd~iif~~~~~~~f~~~A~~~~~h~~~vn~~~~~~~~~~ 425 (620) T TIGR00409 346 EIEQKIASKPDLLGPVKINGSDDYIATNKQVFIDIPVLIKHVDQIIFTVALMSDFAAGANADDKHYKNVNWDRDVAIPEV 425 (620) T ss_pred HHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCCCCEEECCCCCCCCHHHH T ss_conf 99998864067555656665310233101344203566644646621263526850103454565030263100211322 Q ss_pred --------------HHH------CCCCCCCCCCCHHHHCCCCCCCEEEEECCCEE-EEEEEEECCCHH-HHHHHHHHHHH Q ss_conf --------------211------03310012110111022346741673179721-379964034103-78999999997 Q gi|254781004|r 484 --------------KDA------IGRDWQCGTIQVDFNLPSRFNAFYVNSHSEKC-HPVMIHRAVFGS-IERFIGIMIEN 541 (652) Q Consensus 484 --------------~d~------~gr~~~~~tiq~df~~p~~f~l~y~~~dg~~~-~pvmihr~~~Gs-~eR~ia~liE~ 541 (652) -|. ...++.++.+||+..|++.++++|+|+||+.+ ...| ||||+ |+|+++|++|+ T Consensus 426 ~~D~r~~~eGDrP~Pdg~G~L~~~~gIEVGHIF~LG~kYS~al~A~f~de~Gk~~t~~~M---GCYGIGVSRl~~Ai~Eq 502 (620) T TIGR00409 426 VADIRKVKEGDRPSPDGKGTLKIAKGIEVGHIFKLGTKYSKALKATFLDENGKEQTFLTM---GCYGIGVSRLVSAIAEQ 502 (620) T ss_pred HHHHHHEEECCCCCCCCCCEEEECCCCEEEEEECCCHHHHHHHCCEEECCCCCCCCEEEC---CCCCCHHHHHHHHHHHH T ss_conf 200100024777884387405640562341133022033333065564668850252560---55571389999999985 Q ss_pred H----CCCCCCCCCCEEEEEEECC----HHHHHHHHHHHHHHH-----HCCCEEEEECCCCCCCHHHHH--HHHCCCCEE Q ss_conf 3----8708873378089998579----668999999999999-----879889997589663676665--543489889 Q gi|254781004|r 542 F----KGNLPLWLSPIQAIVTTIT----SSAVEYAQEIANLLK-----SHHLSIETDFRNETINYKIRE--HSIKKIPII 606 (652) Q Consensus 542 ~----~g~~P~wLAP~QV~Iipi~----e~~~eya~~i~~~L~-----~~girV~~Ddr~~~~G~Kir~--a~l~giP~~ 606 (652) + |.+||..+|||+|+|+|.+ +++.++|++||.+|+ ++|++|++|||+|++|.|+|| .+|+|+|++ T Consensus 503 ~~D~~G~~WP~~~APy~V~Iv~~N~~~se~~~~~A~~~Y~~L~ndPtm~~g~dvllDDR~ER~Gv~f~Dkl~eLIG~P~~ 582 (620) T TIGR00409 503 HYDERGIIWPKAIAPYDVVIVVMNMKDSEEQKQLAEEVYSELLNDPTMAQGVDVLLDDRNERAGVKFRDKLSELIGIPLR 582 (620) T ss_pred CCCCCCCCCCCCCCCCEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHCCCCEE T ss_conf 05878636764538814899617898427899999999998502711365335664101023555156677775598719 Q ss_pred EEECHHHHHCCCEEEEECCCCCEEEECHHHHHHHHHH Q ss_conf 9987425325818999889886633429999999987 Q gi|254781004|r 607 IICGDKEASERSIGIRRFGSTTTQKVSLLDAIPILTK 643 (652) Q Consensus 607 ivIG~ke~e~~~Vtir~R~~~~~~~i~lee~i~~l~~ 643 (652) ||||++.++++.|+++.|.+++++.++.+++.+.|.+ T Consensus 583 v~~G~~~l~~~~~Ev~~R~~gEk~~i~~~~~~~~~~~ 619 (620) T TIGR00409 583 VVVGKKNLDNGEIEVKKRRNGEKQLIKKDELVEVLEE 619 (620) T ss_pred EEECCCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHC T ss_conf 9975621677516898615510457778999998622 No 15 >PRK08661 prolyl-tRNA synthetase; Provisional Probab=100.00 E-value=0 Score=547.08 Aligned_cols=337 Identities=22% Similarity=0.325 Sum_probs=290.9 Q ss_pred CCCEEEEECCHHHHHHHHHHHHHHHHHC-CCEEEECCCCCC-CCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCC Q ss_conf 8743565010789999999999988768-963855543244-31000112100125554201001456777823774343 Q gi|254781004|r 262 GSGVIFWHRKGWKIFQTLISYMRRKIKD-DYEEINTPQVLD-QHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPM 339 (652) Q Consensus 262 ~~G~~~wlP~G~~i~~~ie~~ir~~~~~-G~~eV~tP~l~~-~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPm 339 (652) -+|+++|+|+|++||++|++++|++++. |.+++++|.++| ..||++++||+.|++++|+++. .+.++.++.+||||| T Consensus 34 VkG~~v~~P~g~~iw~~i~~~~~~~~~~~g~~~~~fP~liP~~~l~kE~~hvegF~pE~~~VT~-~G~~~l~~~l~lrPT 112 (478) T PRK08661 34 VKGCMVIRPYGYAIWENIQKILDEMFKETGHENVYFPLLIPESLLQKEAEHIEGFAPEVAWVTH-GGGEKLEEPLALRPT 112 (478) T ss_pred CCEEEEECCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHCCCCCEEEEEEC-CCCCCCCCCCCCCCC T ss_conf 8727997874899999999999999986398087302406689998666566156740489954-787555676310888 Q ss_pred CHHHHHHHHHHCCCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEECC-HHHHHHHHHHHHHHHHHHHH-H Q ss_conf 125777765201222231043200010001278764520001056353344210105-37678878999999887665-5 Q gi|254781004|r 340 NCPGHVAVFNHGLKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVFCT-KEQMFNECLKIHNLIISIYK-D 417 (652) Q Consensus 340 ncp~h~~i~~~~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F~~-~eq~~~e~~~~~~~~~~v~~-~ 417 (652) ++..++.+|+.|++||||||+++|||+++||||++| |. ++|+|||+|+|||+|+. .+++.+++..+++.|..++. . T Consensus 113 sEt~i~~~~~~wI~SyrdLPl~l~Qw~~v~R~E~r~-r~-flR~rEFl~qe~ht~h~t~eea~~~~~~~~~~Y~~~~e~~ 190 (478) T PRK08661 113 SETIIYPMFKKWIQSYRDLPLLYNQWVNVVRWEKKT-RP-FLRTREFLWQEGHTAHATEEEAEEETLEMLEIYKEFAEDY 190 (478) T ss_pred CHHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCC-CC-HHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 799999999998523652690002004458657889-63-1434263542120113999999999999999999999983 Q ss_pred HCCCCCEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCC- Q ss_conf 2133210001465311258603577899999999985021212566752055555545543211002110331001211- Q gi|254781004|r 418 FGFEKIMVKLSTRPEKRVGSDALWDDAENIMKGVLDTIKNSSQDRINTGVLLGEGAFYGPKFEYILKDAIGRDWQCGTI- 496 (652) Q Consensus 418 ~g~~~~~~~ls~~~~~~~g~~~~~~~ae~~l~~~l~~~~~~~~~~~~~~~~~g~~afygpkid~~~~d~~gr~~~~~ti- 496 (652) +++..+.. ..+..+++.|.+. .-.++..+.| |+..|+||+ T Consensus 191 LaiPvi~g-~k~~~ekFaGa~~------------------------------------T~~iea~m~d--GkaLQ~gtsH 231 (478) T PRK08661 191 LAIPVIIG-KKTEWEKFAGADY------------------------------------TYTIEAMMPD--GKALQIGTSH 231 (478) T ss_pred CCCCEECC-CCCCCCCCCCCCC------------------------------------CCEEEEECCC--CCEEECCCEE T ss_conf 78755637-7783021789865------------------------------------4205887268--8631143142 Q ss_pred CHHHHCCCCCCCEEEEECCCEEEEEEEEECCCHHHHHHHHHHHHHHC----CCCCCCCCCEEEEEEECC-----HHHHHH Q ss_conf 01110223467416731797213799640341037899999999738----708873378089998579-----668999 Q gi|254781004|r 497 QVDFNLPSRFNAFYVNSHSEKCHPVMIHRAVFGSIERFIGIMIENFK----GNLPLWLSPIQAIVTTIT-----SSAVEY 567 (652) Q Consensus 497 q~df~~p~~f~l~y~~~dg~~~~pvmihr~~~Gs~eR~ia~liE~~~----g~~P~wLAP~QV~Iipi~-----e~~~ey 567 (652) +|+.++++.|+++|.|++|+.++++| +|||..+|+||++|+.|+ .+||+.|||+||+||||. +++.++ T Consensus 232 ~LGq~fskaf~i~f~d~~g~~~~~~~---ts~GistR~iga~Im~H~Dd~GlvlPp~iAP~qVvIvPi~~~~~~~~v~~~ 308 (478) T PRK08661 232 YLGQNFAKAFDIKFQDKDGKLEYVHQ---TSWGVSTRLIGALIMVHGDDKGLVLPPKIAPIQVVIIPILKKKEEEEVLEY 308 (478) T ss_pred CCCHHHHHHCCCEEECCCCCEEEEEE---ECCCHHHHHHHHHHHHHCCCCCCCCCHHHCCCCEEEEEECCCCCHHHHHHH T ss_conf 16223567549699889998867887---534542789999999847998651784559830899984578878999999 Q ss_pred HHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHCCCCEEEEECHHHHHCCCEEEEECCCCCEEEECHHHHHHHHHH Q ss_conf 9999999998798899975896636766655434898899987425325818999889886633429999999987 Q gi|254781004|r 568 AQEIANLLKSHHLSIETDFRNETINYKIREHSIKKIPIIIICGDKEASERSIGIRRFGSTTTQKVSLLDAIPILTK 643 (652) Q Consensus 568 a~~i~~~L~~~girV~~Ddr~~~~G~Kir~a~l~giP~~ivIG~ke~e~~~Vtir~R~~~~~~~i~lee~i~~l~~ 643 (652) |++|++.|+++|+||.+|||++++|+||.+|++.|||++|+||+|++++++|++.+||++++..++++++.+.+.+ T Consensus 309 ~~~i~~~L~~~girv~~Ddrd~~pG~Kf~~~elkGvPlRieiGprDle~~~v~v~RRDt~eK~~v~~~~l~~~v~~ 384 (478) T PRK08661 309 AKELAEELKAAGIRVKLDDRDKTPGWKFYEWELKGVPLRIEIGPRDLENNTVVLVRRDTLEKETVSLDELVEEVPE 384 (478) T ss_pred HHHHHHHHHHCCEEEEEECCCCCHHHHHHHHHHCCCCEEEEECCCHHCCCEEEEEECCCCCCEEECHHHHHHHHHH T ss_conf 9999999987790799817665665666661014785899977201028979999867986146659999999999 No 16 >COG0442 ProS Prolyl-tRNA synthetase [Translation, ribosomal structure and biogenesis] Probab=100.00 E-value=0 Score=519.92 Aligned_cols=376 Identities=18% Similarity=0.289 Sum_probs=299.5 Q ss_pred EECCCCCCEEEEECCHHHHHHHHHHHHHHHHHC-CCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEE Q ss_conf 422678743565010789999999999988768-9638555432443100011210012555420100145677782377 Q gi|254781004|r 257 HIAEDGSGVIFWHRKGWKIFQTLISYMRRKIKD-DYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFA 335 (652) Q Consensus 257 ~~~~~~~G~~~wlP~G~~i~~~ie~~ir~~~~~-G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~ 335 (652) ++++.++|+|.|||+|++|+++|++++|++|++ |++||++|.|+|++||++||||+.|+++||++ +++++++|| T Consensus 31 ~i~~~~~G~y~~lP~g~rv~~kI~~iir~em~~~G~~Evl~P~L~p~eLwkEs~r~~~f~~El~~v-----~drg~~~l~ 105 (500) T COG0442 31 MIRKPVKGLYVWLPLGLRVLEKIENIIREEMDKIGAQEVLFPTLIPAELWKESGRWEGFGPELFRV-----KDRGDRPLA 105 (500) T ss_pred CEEECCCCEEEECCCHHHHHHHHHHHHHHHHHHCCCEEEECHHCCHHHHHHHHCHHHCCCHHHEEE-----ECCCCCEEE T ss_conf 602025716997765899999999999999986085088311058789998748563036444899-----716996343 Q ss_pred ECCCCHHHHHHHHHHCCCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEECC-HHHHHHHHHHHHHHHHHH Q ss_conf 4343125777765201222231043200010001278764520001056353344210105-376788789999998876 Q gi|254781004|r 336 LKPMNCPGHVAVFNHGLKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVFCT-KEQMFNECLKIHNLIISI 414 (652) Q Consensus 336 LkPmncp~h~~i~~~~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F~~-~eq~~~e~~~~~~~~~~v 414 (652) |+||||+.++.||+++++||||||+++|||++|||||++| ++||||+|||+|+|||||+. .+++.+++..+.+.|.++ T Consensus 106 L~PTsEe~it~~~~~~i~SYkdLPl~lYQi~~kfRdE~rp-r~gllR~REF~mkdaySfh~~~e~a~~~y~~~~~~Y~~i 184 (500) T COG0442 106 LRPTSEEVITDMFRKWIRSYKDLPLKLYQIQSKFRDEKRP-RFGLLRGREFLMKDAYSFHADEEDAEETYEKMLDAYSRI 184 (500) T ss_pred ECCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCC-CCCCCCHHEEEECCCCCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 5788689999999998655651881210134588433468-877300020330111210289899999999999999999 Q ss_pred HHHHCCCCCEECCCCCCCCCCCCHHHHHHH-HH-------------HHHHHH-HHHCCCCC-CCCCCCC----------- Q ss_conf 655213321000146531125860357789-99-------------999999-85021212-5667520----------- Q gi|254781004|r 415 YKDFGFEKIMVKLSTRPEKRVGSDALWDDA-EN-------------IMKGVL-DTIKNSSQ-DRINTGV----------- 467 (652) Q Consensus 415 ~~~~g~~~~~~~ls~~~~~~~g~~~~~~~a-e~-------------~l~~~l-~~~~~~~~-~~~~~~~----------- 467 (652) |..+|+....+...++..+...++++...+ ++ -.+.+. +....... ....+.. T Consensus 185 f~~i~l~~~~~~ad~g~~Gg~~S~eF~~l~pd~ge~qi~ts~~y~aN~e~a~~~~~~~~~~~~~~~~v~t~s~~~s~r~~ 264 (500) T COG0442 185 FLRLPLIFGPVPADEGFIGGSYSHEFEALMPDGGEDQIATSHHYGANFEKAFIDIKFEDEEEGELEYVHTTSYGISTRII 264 (500) T ss_pred HHHCCCEEEEECCCCCCCCCCCCEEEEEECCCCCCCEEEEECCHHHHHHHHCCCCCCCCCCCCCCEEECCCCEEEEEEEE T ss_conf 98589538862566778887653699998568985479980626666877446777556665541475146268874001 Q ss_pred -----CCCC-CCCCCCCC---------------------------CCC-----------HH----HHCCC-------CCC Q ss_conf -----5555-55455432---------------------------110-----------02----11033-------100 Q gi|254781004|r 468 -----LLGE-GAFYGPKF---------------------------EYI-----------LK----DAIGR-------DWQ 492 (652) Q Consensus 468 -----~~g~-~afygpki---------------------------d~~-----------~~----d~~gr-------~~~ 492 (652) -.|+ .-+.||.+ |+. .. +..|. ... T Consensus 265 ~~~i~i~GDn~G~v~Pp~vA~~qV~~~~~~~~ga~~h~~~~~~~rd~~~~l~~~~~~~~~D~~~~~~~G~kl~~~e~ieV 344 (500) T COG0442 265 GAAILIHGDNEGLVLPPIVADIQVVIVPIFIKGANEHYKVVNYGRDVAEPLEKLGIRVEGDDRSPDGPGFKLNIWEGIEV 344 (500) T ss_pred EEEEEEECCCCCCCCCCHHCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCEEEECCCCCCCCCCCEEEEEECCCC T ss_conf 58999714887764875004534899751336741013343045444566543025884245689999721102102445 Q ss_pred CCCCCHHHHCCCCCCCEEEEECCCEEEEEEEEECCCHH-HHHHHHHHHHHH----CCCCCCCCCCEEEEEEECCH---HH Q ss_conf 12110111022346741673179721379964034103-789999999973----87088733780899985796---68 Q gi|254781004|r 493 CGTIQVDFNLPSRFNAFYVNSHSEKCHPVMIHRAVFGS-IERFIGIMIENF----KGNLPLWLSPIQAIVTTITS---SA 564 (652) Q Consensus 493 ~~tiq~df~~p~~f~l~y~~~dg~~~~pvmihr~~~Gs-~eR~ia~liE~~----~g~~P~wLAP~QV~Iipi~e---~~ 564 (652) ...++++..+++.+++++.+.||+.+..+| +|||. ++|++++++|++ +..||..+||++|.|++++. +. T Consensus 345 ghif~lG~kyse~~~a~v~~r~g~~~~~~m---g~ygigvsr~v~a~ieq~~d~~gi~w~~a~apf~~~iv~~n~~~~~~ 421 (500) T COG0442 345 GHIFELGTKYSEAMNATVLDRDGKEQPKTM---GCYGIGVSRLVAALLEQIHDENGIIWPKAIAPFDVHIVPVNTKDFKQ 421 (500) T ss_pred CEEEEECCHHHHHCEEEEEECCCCCCCEEE---ECHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEECCCHHHHH T ss_conf 779998741254180499924887144178---70000034599999997362226716001475000799748635999 Q ss_pred HHHHHHHHHHHHHCC-CEEEEECCCCCCCHHHHHHHHCCCCEEEEECHHHHHCCCEEEEECCCCCEEEECHHHHHHHHH Q ss_conf 999999999999879-889997589663676665543489889998742532581899988988663342999999998 Q gi|254781004|r 565 VEYAQEIANLLKSHH-LSIETDFRNETINYKIREHSIKKIPIIIICGDKEASERSIGIRRFGSTTTQKVSLLDAIPILT 642 (652) Q Consensus 565 ~eya~~i~~~L~~~g-irV~~Ddr~~~~G~Kir~a~l~giP~~ivIG~ke~e~~~Vtir~R~~~~~~~i~lee~i~~l~ 642 (652) ...|++++..|...| .+|.+|||++++|.|+.+|+++|+|+++++|++ .+++.|+++.|.+++...++.+.+++++. T Consensus 422 ~~~~~~~~~~l~~~G~~e~~~ddr~er~g~k~~~a~liGiP~~~~~g~~-~~~g~~e~k~r~~ge~~~~~~~~l~~~~~ 499 (500) T COG0442 422 AEAAEKLYVELPWCGTVEVLLDDRDERPGVKFADADLIGIPLRIVVGKR-LAEGEVEVKCRKCGEKEAVTIEALFARLY 499 (500) T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHCCCCCCCCCCCEEECCCCEEEECCC-CCCCCEEEEECCCCCHHHCCHHHHHHHHH T ss_conf 9875658998775786214454220146750148747104403310334-46882368864778610154998888860 No 17 >cd00778 ProRS_core_arch_euk Prolyl-tRNA synthetase (ProRS) class II core catalytic domain. ProRS is a homodimer. It is responsible for the attachment of proline to the 3' OH group of ribose of the appropriate tRNA. This domain is primarily responsible for ATP-dependent formation of the enzyme bound aminoacyl-adenylate. Class II assignment is based upon its structure and the presence of three characteristic sequence motifs in the core domain. This subfamily contains the core domain of ProRS from archaea, the cytoplasm of eukaryotes and some bacteria. Probab=100.00 E-value=0 Score=506.78 Aligned_cols=245 Identities=23% Similarity=0.407 Sum_probs=213.8 Q ss_pred HCCEEEECCCCCCEEEEECCHHHHHHHHHHHHHHHHH-CCCEEEECCCCCCCCCC-CCCCCCCCCCCCCCEEECCCCCCC Q ss_conf 1440442267874356501078999999999998876-89638555432443100-011210012555420100145677 Q gi|254781004|r 252 EMDLFHIAEDGSGVIFWHRKGWKIFQTLISYMRRKIK-DDYEEINTPQVLDQHLW-QQSGHWDSYRANMFAVKCADDTIK 329 (652) Q Consensus 252 ~~~lf~~~~~~~G~~~wlP~G~~i~~~ie~~ir~~~~-~G~~eV~tP~l~~~~Lw-~~SGh~~~y~~~mf~~~~~~~~~~ 329 (652) ..+|+--. --||+|+|||+|++++++|++++|++++ .|||||.||+|+++++| ++||||++|+++||.++. .+.++ T Consensus 12 ~~~l~d~~-pv~G~~~~lP~G~~i~~~l~~~~r~~~~~~G~~ev~~P~l~~~~~~~k~sgh~~~y~~emf~v~~-~g~~~ 89 (261) T cd00778 12 KAELIDYG-PVKGCMVFRPYGYAIWENIQKILDKEIKETGHENVYFPLLIPESELEKEKEHIEGFAPEVAWVTH-GGLEE 89 (261) T ss_pred HHCCCCCC-CCCCEEEECCCHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCHHHHHHHHCHHHCCCCCCEEEEC-CCCCC T ss_conf 81772247-99830888632899999999999999998699699883327889999742814305765335521-68520 Q ss_pred CCCEEEECCCCHHHHHHHHHHCCCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCE-EECCHHHHHHHHHHHH Q ss_conf 782377434312577776520122223104320001000127876452000105635334421-0105376788789999 Q gi|254781004|r 330 DLRTFALKPMNCPGHVAVFNHGLKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAH-VFCTKEQMFNECLKIH 408 (652) Q Consensus 330 ~~~~~~LkPmncp~h~~i~~~~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah-~F~~~eq~~~e~~~~~ 408 (652) ++++|||||||||+|+.||++++|||||||+|+||+|++||||+|| ++||||+|||+||||| +||++||+.+++..++ T Consensus 90 ~d~~y~LkPmne~~~~~i~~~~~~SYRdLPlrl~q~g~~~R~E~s~-~~gL~RvReF~~dDahs~~~t~eq~~~e~~~~~ 168 (261) T cd00778 90 LEEPLALRPTSETAIYPMFSKWIRSYRDLPLKINQWVNVFRWETKT-TRPFLRTREFLWQEGHTAHATEEEAEEEVLQIL 168 (261) T ss_pred CCCCCCCCCCCHHHHHHHHHHHHCCCCCCCHHHHHCCCEECCCCCC-CCCCCEEEEEEEECCCCCCCCHHHHHHHHHHHH T ss_conf 1356034788869999999735054022782555304224377888-876532542523423365589999999999999 Q ss_pred HHHHHHHHHHCCCCCEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHCC Q ss_conf 99887665521332100014653112586035778999999999850212125667520555555455432110021103 Q gi|254781004|r 409 NLIISIYKDFGFEKIMVKLSTRPEKRVGSDALWDDAENIMKGVLDTIKNSSQDRINTGVLLGEGAFYGPKFEYILKDAIG 488 (652) Q Consensus 409 ~~~~~v~~~~g~~~~~~~ls~~~~~~~g~~~~~~~ae~~l~~~l~~~~~~~~~~~~~~~~~g~~afygpkid~~~~d~~g 488 (652) +++..+|.+++.. ++..+.+++ |+ ...||||+|++|+.+.| | T Consensus 169 ~~y~~i~~~l~~~--~~~~~~~~~--------~d--------------------------k~~Ga~y~~~id~~~~d--g 210 (261) T cd00778 169 DLYKEFYEDLLAI--PVVKGRKTE--------WE--------------------------KFAGADYTYTIEAMMPD--G 210 (261) T ss_pred HHHHHHHHHHCCE--EEEEECCCH--------HC--------------------------CCCCCCCCEEEEEECCC--C T ss_conf 9999999984781--578605652--------11--------------------------16785100014787478--8 Q ss_pred CCCCCCCC-CHHHHCCCCCCCEEEEECCCEEEEEEEEECCCHHHHHHHHHHHH Q ss_conf 31001211-01110223467416731797213799640341037899999999 Q gi|254781004|r 489 RDWQCGTI-QVDFNLPSRFNAFYVNSHSEKCHPVMIHRAVFGSIERFIGIMIE 540 (652) Q Consensus 489 r~~~~~ti-q~df~~p~~f~l~y~~~dg~~~~pvmihr~~~Gs~eR~ia~liE 540 (652) |.|||+|+ |+|+++|++|+++|+++||++++| |||||||.||||||||| T Consensus 211 r~~q~gti~~l~~~~s~~f~l~y~~~dg~~~~p---hr~~~G~~eR~ig~lI~ 260 (261) T cd00778 211 RALQSGTSHNLGQNFSKAFDIKYQDKDGQKEYV---HQTSWGISTRLIGAIIM 260 (261) T ss_pred CEEEEEHHHHCCCCHHHHCCCEEECCCCCCCCC---EEEEEEHHHHHHHHHHC T ss_conf 704675102027653563798999999998085---57412499899889734 No 18 >TIGR00408 proS_fam_I prolyl-tRNA synthetase; InterPro: IPR004499 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . Prolyl-tRNA synthetase is a class II tRNA synthetase and is recognized by IPR002314 from INTERPRO, which recognises tRNA synthetases for Gly, His, Ser, and Pro. The prolyl-tRNA synthetases are divided into two widely divergent families. This family includes the archaeal enzyme, the Pro-specific domain of a human multifunctional tRNA ligase, and the enzyme from the spirochete Borrelia burgdorferi (Lyme desease spirochete). The other family, IPR004500 from INTERPRO, includes enzymes from Escherichia coli, Bacillus subtilis, Synechocystis sp. (strain PCC6803), and one of the two prolyl-tRNA synthetases of Saccharomyces cerevisiae (Baker's yeast).; GO: 0000166 nucleotide binding, 0004827 proline-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006433 prolyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=100.00 E-value=0 Score=468.41 Aligned_cols=359 Identities=22% Similarity=0.358 Sum_probs=310.4 Q ss_pred HHHHHHHHHHHHHHHHHHHCCEEEECCCCCCEEEEECCHHHHHHHHHHHHHHHHHC--------------------CCEE Q ss_conf 99999975320088754114404422678743565010789999999999988768--------------------9638 Q gi|254781004|r 234 YLYFLEESEKRDHRKLAREMDLFHIAEDGSGVIFWHRKGWKIFQTLISYMRRKIKD--------------------DYEE 293 (652) Q Consensus 234 ~~~~~eea~~rdHr~lg~~~~lf~~~~~~~G~~~wlP~G~~i~~~ie~~ir~~~~~--------------------G~~e 293 (652) |...+++|+--||| -.=.|+++|||+|+.||.+|.+++|+-+++ |.++ T Consensus 12 Y~~il~kAel~d~R------------ypVKG~~v~~P~g~~iw~~iq~~~~~~~~~~~~~~~~~L~~F~P~~~~~Y~h~~ 79 (533) T TIGR00408 12 YDQILQKAELIDYR------------YPVKGCYVFLPYGFKIWENIQKILRDKLDEIKLKDKVILKKFIPIESLPYGHEE 79 (533) T ss_pred HHHHHHCCEEEEEC------------CCCCCEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHH T ss_conf 89987305576641------------760103784598778999999999999987643546766405677777777001 Q ss_pred EECCCCCCCCCCCC-CCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHHHCCCCHHHCCHHHHHHEEEECCC Q ss_conf 55543244310001-12100125554201001456777823774343125777765201222231043200010001278 Q gi|254781004|r 294 INTPQVLDQHLWQQ-SGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHVAVFNHGLKSYRELPVRLAEFGSVYRNE 372 (652) Q Consensus 294 V~tP~l~~~~Lw~~-SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~~~~SYrdLPlrl~e~~~~~R~E 372 (652) |+.|.|.+.+++++ ..|.+.|.+|.|+++. .+....++.++||||+|-..+.||+.|++||+||||++|||..+|||| T Consensus 80 v~fP~L~p~~~L~KEk~HikGF~pEv~wi~~-~G~~~L~~pLaLRPTSEt~~y~m~~kWv~sy~DLPlk~nQw~~vfR~E 158 (533) T TIGR00408 80 VYFPMLIPESELEKEKDHIKGFEPEVYWITH-GGLKKLDEPLALRPTSETAMYNMYKKWVKSYRDLPLKINQWVNVFRYE 158 (533) T ss_pred EECCCCCCHHHHHHHCCCCCCCCCHHHHEEC-CCCCCCCCCCEECCCCHHHHHHHHHHHHHHCCCCCEEEEEEEEEEECC T ss_conf 0056445255786321356885840021010-377724675012586178887657878512158876898435889802 Q ss_pred CCCCCEEEEECCCCC-CCCCEE-ECCHHHHHHHHHHHHHHHHHHHH-HHCCCCCEECCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 764520001056353-344210-10537678878999999887665-521332100014653112586035778999999 Q gi|254781004|r 373 PSGSLHGLMRVRGFT-QDDAHV-FCTKEQMFNECLKIHNLIISIYK-DFGFEKIMVKLSTRPEKRVGSDALWDDAENIMK 449 (652) Q Consensus 373 ~sg~l~GL~R~ReF~-~~Dah~-F~~~eq~~~e~~~~~~~~~~v~~-~~g~~~~~~~ls~~~~~~~g~~~~~~~ae~~l~ 449 (652) .+.+ .+|+|+|||+ +++||+ |.+.+++++++..++++|..+|. .+++.... .+-+-.++|.|+..-| T Consensus 159 ~kHT-RPflR~rEf~twqE~HT~H~t~~eAee~v~~~~~~Yk~f~~~~LaIP~~~-g~k~E~eKFaGA~~T~-------- 228 (533) T TIGR00408 159 KKHT-RPFLRTREFLTWQEAHTAHATAEEAEEEVLKALDLYKEFIENLLAIPYFV-GRKSEKEKFAGAEYTW-------- 228 (533) T ss_pred CCCC-CCCHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCEEE-ECCCCCCCCCCCCCCE-------- T ss_conf 1787-86102333210100013143777889999999998899975028864687-1488755478876223-------- Q ss_pred HHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCC-HHHHCCCCCCCEEEEECCCEEEEEEEEECCC Q ss_conf 999850212125667520555555455432110021103310012110-1110223467416731797213799640341 Q gi|254781004|r 450 GVLDTIKNSSQDRINTGVLLGEGAFYGPKFEYILKDAIGRDWQCGTIQ-VDFNLPSRFNAFYVNSHSEKCHPVMIHRAVF 528 (652) Q Consensus 450 ~~l~~~~~~~~~~~~~~~~~g~~afygpkid~~~~d~~gr~~~~~tiq-~df~~p~~f~l~y~~~dg~~~~pvmihr~~~ 528 (652) .+|..+- ..||.-||+|+. |+.+.++.|+++|.+.+|....-...|..|| T Consensus 229 ----------------------------~~e~imP-DgG~~LQ~ATsH~LGqnF~K~F~i~fe~p~Gs~~~~~~ayq~s~ 279 (533) T TIGR00408 229 ----------------------------TFETIMP-DGGRTLQIATSHDLGQNFAKTFDIKFENPTGSDMDKVYAYQTSY 279 (533) T ss_pred ----------------------------EEEEECC-CCCEEHHHHHHHHHHCCCEEEEEEEEECCCCCCCCCCCCEECCC T ss_conf ----------------------------5675247-76500122121133035402036788778889764101001013 Q ss_pred HHHHHHHHHHHHHH----CCCCCCCCCCEEEEEEE-C----C-----HHHHHHHHHHHHHHHHCCCEEEEECCCC-CCCH Q ss_conf 03789999999973----87088733780899985-7----9-----6689999999999998798899975896-6367 Q gi|254781004|r 529 GSIERFIGIMIENF----KGNLPLWLSPIQAIVTT-I----T-----SSAVEYAQEIANLLKSHHLSIETDFRNE-TINY 593 (652) Q Consensus 529 Gs~eR~ia~liE~~----~g~~P~wLAP~QV~Iip-i----~-----e~~~eya~~i~~~L~~~girV~~Ddr~~-~~G~ 593 (652) |..+|+|||||.-| |.+||+-|||+||+||| + + +++.|+|++|.+.|++.+|||.+|+++. ++|+ T Consensus 280 G~StR~igali~~HsD~~GLVLPp~vAp~Qvv~~Pv~IFkK~s~E~~~~~~e~a~~v~~~Lk~~~iRv~~D~~~~~~pG~ 359 (533) T TIGR00408 280 GISTRVIGALILIHSDEKGLVLPPRVAPIQVVIIPVIIFKKKSDEENKKVMELARKVEDKLKKLGIRVHLDKRDNYRPGK 359 (533) T ss_pred CHHHHHHHHEEEEECCCCCCCCCCCCCCEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCC T ss_conf 35788754500012267775167012733589998677558752147789999999998773041007970355753231 Q ss_pred HHHHHHHCCCCEEEEECHHHHHCCCEEEEECCCCC--EEEECHHHHHHH-HHHH Q ss_conf 66655434898899987425325818999889886--633429999999-9873 Q gi|254781004|r 594 KIREHSIKKIPIIIICGDKEASERSIGIRRFGSTT--TQKVSLLDAIPI-LTKE 644 (652) Q Consensus 594 Kir~a~l~giP~~ivIG~ke~e~~~Vtir~R~~~~--~~~i~lee~i~~-l~~e 644 (652) ||.++|+.|||++++||+|+++.|.|++-+|++++ +..++++++.+. +.+. T Consensus 360 kf~~~E~kGiPlR~evGP~D~~~n~~~~~rR~~~~kfK~~~~~~~L~~~k~~~~ 413 (533) T TIGR00408 360 KFSKWELKGIPLRLEVGPNDIEKNIVVVSRRDTKRKFKYQVSLDELEERKVVEL 413 (533) T ss_pred HHHCCCEEECCEEEEECCCCCCCCEEEEEEECCCCEEEEEEEHHHHHHHHHHHH T ss_conf 320453330426998777510437689999617713557720545446789999 No 19 >KOG2324 consensus Probab=100.00 E-value=0 Score=466.60 Aligned_cols=379 Identities=18% Similarity=0.241 Sum_probs=312.2 Q ss_pred CEEEECCCCCCEEEEECCHHHHHHHHHHHHHHHHHC-CCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCC Q ss_conf 404422678743565010789999999999988768-9638555432443100011210012555420100145677782 Q gi|254781004|r 254 DLFHIAEDGSGVIFWHRKGWKIFQTLISYMRRKIKD-DYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLR 332 (652) Q Consensus 254 ~lf~~~~~~~G~~~wlP~G~~i~~~ie~~ir~~~~~-G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~ 332 (652) ++=|+.+.+||+|-|||+|.|+.+++.+.++.+|+. |+++|.+|+|.+++||+++||||..+.+||. ..||.++ T Consensus 33 e~GfI~ps~~G~yq~LPlg~R~~~K~~~~l~~~mqs~Ga~kIslp~ls~~~LWekTgRw~~~gsEl~r-----l~Dr~gk 107 (457) T KOG2324 33 EVGFIRPSSPGLYQLLPLGLRVLNKLCRLLDNEMQSGGAQKISLPILSSKELWEKTGRWDAMGSELFR-----LHDRKGK 107 (457) T ss_pred HHCCCCCCCCCCEEECCCHHHHHHHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHCCCCCCCCHHHEE-----EECCCCC T ss_conf 70710148987012130037999999999999987236705740446868998761851124604537-----5104787 Q ss_pred EEEECCCCHHHHHHHHHHCCC-CHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEECC-HHHHHHHHHHHHHH Q ss_conf 377434312577776520122-2231043200010001278764520001056353344210105-37678878999999 Q gi|254781004|r 333 TFALKPMNCPGHVAVFNHGLK-SYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVFCT-KEQMFNECLKIHNL 410 (652) Q Consensus 333 ~~~LkPmncp~h~~i~~~~~~-SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F~~-~eq~~~e~~~~~~~ 410 (652) +|||.||+++.++.+.++.+. |||+||+++||+|.|||||++| |+||||+|||+|||+|+|.+ +|.+.+.|..+.+. T Consensus 108 q~cL~pThEE~iT~lmat~~~lsykqlPi~vYQigrKfRDElrp-RfGLlRgREFlMKDmYsFd~~~etA~qTy~~v~~a 186 (457) T KOG2324 108 QMCLTPTHEEDITALMATYIPLSYKQLPIRVYQIGRKFRDELRP-RFGLLRGREFLMKDMYSFDSDEETAQQTYQLVDQA 186 (457) T ss_pred EECCCCCHHHHHHHHHHHCCCCCCCCCCEEEEEECHHHHHCCCC-CCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHH T ss_conf 73057754899999998627432202867766505454413475-42301247888765330258888999999999999 Q ss_pred HHHHHHHHCCCCCEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHCCCC Q ss_conf 88766552133210001465311258603577899999999985021212566752055555545543211002110331 Q gi|254781004|r 411 IISIYKDFGFEKIMVKLSTRPEKRVGSDALWDDAENIMKGVLDTIKNSSQDRINTGVLLGEGAFYGPKFEYILKDAIGRD 490 (652) Q Consensus 411 ~~~v~~~~g~~~~~~~ls~~~~~~~g~~~~~~~ae~~l~~~l~~~~~~~~~~~~~~~~~g~~afygpkid~~~~d~~gr~ 490 (652) |..+|+.+|+..+.+.+.++..++.-++|++-..+......+.+..+.+..+.+-...++- +. -|+..-.--...... T Consensus 187 Y~~iFkqL~~pfVkv~AdsG~iGG~vShEfhl~~~vgED~l~~C~~C~~s~n~e~~~~sk~-~~-Cp~C~~~~L~~~~~I 264 (457) T KOG2324 187 YDRIFKQLGLPFVKVWADSGDIGGEVSHEFHLIHPVGEDTLMSCPSCGYSKNSEDLDLSKI-AS-CPKCNEGRLTKTKSI 264 (457) T ss_pred HHHHHHHCCCCEEEEEECCCCCCCEEEEEEECCCCCCCCCEEECCCCCCCCCHHHHCCCCC-CC-CCCCCCCCCCCCCCE T ss_conf 9999997399769986035664761012575257667500344676775576012137765-66-876567775112434 Q ss_pred CCCCCCCHHHHCCCCCCCEEEEECCCEEEEEEEEECCCHH-HHHHHHHHHHH----HCCCCCCCCCCEEEEEEEC-CHHH Q ss_conf 0012110111022346741673179721379964034103-78999999997----3870887337808999857-9668 Q gi|254781004|r 491 WQCGTIQVDFNLPSRFNAFYVNSHSEKCHPVMIHRAVFGS-IERFIGIMIEN----FKGNLPLWLSPIQAIVTTI-TSSA 564 (652) Q Consensus 491 ~~~~tiq~df~~p~~f~l~y~~~dg~~~~pvmihr~~~Gs-~eR~ia~liE~----~~g~~P~wLAP~QV~Iipi-~e~~ 564 (652) ..++|+.++..+++.|+++|++-.|+.+. +|+||||+ ++|++||.+|- .+.+||.-||||+|++|+. +... T Consensus 265 EVgHtF~LG~kYS~~lna~f~~~~gKpe~---l~MgCyGIGVtRllaAa~evls~~~~lrwP~~iAPy~vcli~pk~~~~ 341 (457) T KOG2324 265 EVGHTFLLGTKYSKPLNAKFVNVEGKPEF---LHMGCYGIGVTRLLAAAAEVLSDDKGLRWPSLIAPYKVCLIGPKKGSK 341 (457) T ss_pred EEEEEEEECCCCCCCCCCEEEEECCCCCE---EEECCEECCHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEECCCCCCH T ss_conf 77778871430154347655440598406---874020004898999999981565543150002751347853688402 Q ss_pred HHHHHHHHHHHHHCC------CEEEEECCCC-CCCHHHHHHHHCCCCEEEEEC-HHHHHCCCEEEEECCCCCEEEECHHH Q ss_conf 999999999999879------8899975896-636766655434898899987-42532581899988988663342999 Q gi|254781004|r 565 VEYAQEIANLLKSHH------LSIETDFRNE-TINYKIREHSIKKIPIIIICG-DKEASERSIGIRRFGSTTTQKVSLLD 636 (652) Q Consensus 565 ~eya~~i~~~L~~~g------irV~~Ddr~~-~~G~Kir~a~l~giP~~ivIG-~ke~e~~~Vtir~R~~~~~~~i~lee 636 (652) ..-+.+|+.+|.... =.+++|||.+ ++|+|+++|..+|+|++|||| .+-..+..++|+.+..++...+..+. T Consensus 342 ~q~~~ev~~el~~~~~~~~l~~~iLlddr~~ltiG~Ri~dA~~lG~PfviVvg~s~~~~~~~~EV~~~~~ge~~~l~~~~ 421 (457) T KOG2324 342 SQRAQEVISELLNDEAVGNLHGEILLDDREELTIGKRIKDANRLGIPFVIVVGNSASWDNPEIEVRTIRWGESAELDKDG 421 (457) T ss_pred HHHHHHHHHHHHCCHHHHHHCCCEECCCHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCEEEEEEEECCCEECCCHHH T ss_conf 31399999876043234443245001306663267766567754998799970334678965899982047320252566 Q ss_pred HHHHHHH Q ss_conf 9999987 Q gi|254781004|r 637 AIPILTK 643 (652) Q Consensus 637 ~i~~l~~ 643 (652) +.+.+.+ T Consensus 422 ~~~l~~~ 428 (457) T KOG2324 422 FMKLLSE 428 (457) T ss_pred HHHHHHH T ss_conf 8887432 No 20 >cd00779 ProRS_core_prok Prolyl-tRNA synthetase (ProRS) class II core catalytic domain. ProRS is a homodimer. It is responsible for the attachment of proline to the 3' OH group of ribose of the appropriate tRNA. This domain is primarily responsible for ATP-dependent formation of the enzyme bound aminoacyl-adenylate. Class II assignment is based upon its structure and the presence of three characteristic sequence motifs in the core domain. This subfamily contains the core domain of ProRS from prokaryotes and from the mitochondria of eukaryotes. Probab=100.00 E-value=0 Score=432.20 Aligned_cols=250 Identities=23% Similarity=0.392 Sum_probs=209.1 Q ss_pred HHHHHHHHHHHHCCEEEECCCCCCEEEEECCHHHHHHHHHHHHHHHHH-CCCEEEECCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 532008875411440442267874356501078999999999998876-8963855543244310001121001255542 Q gi|254781004|r 241 SEKRDHRKLAREMDLFHIAEDGSGVIFWHRKGWKIFQTLISYMRRKIK-DDYEEINTPQVLDQHLWQQSGHWDSYRANMF 319 (652) Q Consensus 241 a~~rdHr~lg~~~~lf~~~~~~~G~~~wlP~G~~i~~~ie~~ir~~~~-~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf 319 (652) |+...|+-|= .. =+|++.+||+|+|+|+|++|+++|++++|++++ .||+||.||.|.|++||++||||++|+++|| T Consensus 2 ~~~~s~~ll~-~a--g~i~~~~~G~~~~lP~G~~i~~~i~~~i~~~~~~~G~~ev~~P~l~~~~l~~~sg~~~~f~~emf 78 (255) T cd00779 2 AEIISHKLLL-RA--GFIRQTSSGLYSWLPLGLRVLKKIENIIREEMNKIGAQEILMPILQPAELWKESGRWDAYGPELL 78 (255) T ss_pred CCHHHHHHHH-HC--CCCCCCCCCCEEECCHHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCHHHHHHHCCCCCCCCCEE T ss_conf 4256799998-75--88765788627885029999999999999999985998997653473888744253013687315 Q ss_pred EEECCCCCCCCCCEEEECCCCHHHHHHHHHHCCCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEECC-HH Q ss_conf 01001456777823774343125777765201222231043200010001278764520001056353344210105-37 Q gi|254781004|r 320 AVKCADDTIKDLRTFALKPMNCPGHVAVFNHGLKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVFCT-KE 398 (652) Q Consensus 320 ~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F~~-~e 398 (652) .+. ++++++|+|+|||||.|+.+|+++++||||||+++|||++|||||+|| ++||+|+|||+|+|||+||. ++ T Consensus 79 ~v~-----d~~~~~~~L~PT~E~~~~~~~~~~i~SyrdLPl~~~q~~~~fR~E~r~-~~GllR~rEF~~~d~hsf~~~~e 152 (255) T cd00779 79 RLK-----DRHGKEFLLGPTHEEVITDLVANEIKSYKQLPLNLYQIQTKFRDEIRP-RFGLMRGREFLMKDAYSFDIDEE 152 (255) T ss_pred EEE-----CCCCCEEEECCCCHHHHHHHHHHHCCCHHHCCEEEECCCCEECCCCCC-CCCCCEEEEEEEEHHHCCCCCHH T ss_conf 445-----378877731688748899999865036122574541353354256678-86632431377002203569989 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 67887899999988766552133210001465311258603577899999999985021212566752055555545543 Q gi|254781004|r 399 QMFNECLKIHNLIISIYKDFGFEKIMVKLSTRPEKRVGSDALWDDAENIMKGVLDTIKNSSQDRINTGVLLGEGAFYGPK 478 (652) Q Consensus 399 q~~~e~~~~~~~~~~v~~~~g~~~~~~~ls~~~~~~~g~~~~~~~ae~~l~~~l~~~~~~~~~~~~~~~~~g~~afygpk 478 (652) ++.+++..+.+.|..+++.+|+....+.+++++.++..+++++..+. T Consensus 153 ~a~~~~~~~~~~y~~i~~~Lglp~~~v~~~~g~~g~~~s~ef~~~~~--------------------------------- 199 (255) T cd00779 153 SLEETYEKMYQAYSRIFKRLGLPFVKVEADSGAIGGSLSHEFHVLSP--------------------------------- 199 (255) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCEEEEEEEC--------------------------------- T ss_conf 99999999999999999983998799854766778874579999824--------------------------------- Q ss_pred CCCCHHHHCCCCCCCC-CCCHHHHCCCCCCCEEEEECCCEEEEEEEEECCCH-HHHHHHHHHHHH Q ss_conf 2110021103310012-11011102234674167317972137996403410-378999999997 Q gi|254781004|r 479 FEYILKDAIGRDWQCG-TIQVDFNLPSRFNAFYVNSHSEKCHPVMIHRAVFG-SIERFIGIMIEN 541 (652) Q Consensus 479 id~~~~d~~gr~~~~~-tiq~df~~p~~f~l~y~~~dg~~~~pvmihr~~~G-s~eR~ia~liE~ 541 (652) + ..++..+++ +++++.+++++|+++|.|+||++++|+| +||| |++|+||||||+ T Consensus 200 ----l--~~~~~lEvg~~~~lG~~~s~~~~i~y~d~dg~~~~~~m---~s~Gi~i~R~i~AliEq 255 (255) T cd00779 200 ----L--KITKGIEVGHIFQLGTKYSKALGATFLDENGKPKPLEM---GCYGIGVSRLLAAIIEQ 255 (255) T ss_pred ----C--CCCCEEEEEEEECCCCHHHHHCCCEEECCCCCEEEEEE---ECCCCHHHHHHHHHCCC T ss_conf ----7--77756899745505625578559999999999884697---06504898888884049 No 21 >cd00772 ProRS_core Prolyl-tRNA synthetase (ProRS) class II core catalytic domain. ProRS is a homodimer. It is responsible for the attachment of proline to the 3' OH group of ribose of the appropriate tRNA. This domain is primarily responsible for ATP-dependent formation of the enzyme bound aminoacyl-adenylate. Class II assignment is based upon its structure and the presence of three characteristic sequence motifs in the core domain. Probab=100.00 E-value=0 Score=424.30 Aligned_cols=241 Identities=19% Similarity=0.250 Sum_probs=202.1 Q ss_pred CCCCEEEEECCHHHHHHHHHHHHHHHHHC-CCEEEECCCCCCCCCCCCCCCCCCCCC-CCCEEECCCCCCCCCCEEEECC Q ss_conf 78743565010789999999999988768-963855543244310001121001255-5420100145677782377434 Q gi|254781004|r 261 DGSGVIFWHRKGWKIFQTLISYMRRKIKD-DYEEINTPQVLDQHLWQQSGHWDSYRA-NMFAVKCADDTIKDLRTFALKP 338 (652) Q Consensus 261 ~~~G~~~wlP~G~~i~~~ie~~ir~~~~~-G~~eV~tP~l~~~~Lw~~SGh~~~y~~-~mf~~~~~~~~~~~~~~~~LkP 338 (652) -++|+|+|||+|++|+++|++++|+++++ |+|||.||+|+|.+||++||||+.|++ +||.++. +..+..+++|+||| T Consensus 20 ~~~G~~~~lP~G~~i~~ki~~~i~~e~~~~G~qev~~P~l~p~~lw~ksgr~~~~~~~e~~~~~~-~~~~~~e~~~~L~P 98 (264) T cd00772 20 PGRGIINFLPLAKAILDKIENVLDKMFKEHGAQNALFPFFILASFLEKEAEHDEGFSKELAVFKD-AGDEELEEDFALRP 98 (264) T ss_pred CCCCEEEECHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCHHHHHHCCCHHHCCCCCEEEEEE-CCCCCCCCCCCCCC T ss_conf 98875888905999999999999999998299498125679989999769844437852379973-48742123520167 Q ss_pred CCHHHHHHHHHHCCCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEE-CCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 31257777652012222310432000100012787645200010563533442101-05376788789999998876655 Q gi|254781004|r 339 MNCPGHVAVFNHGLKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVF-CTKEQMFNECLKIHNLIISIYKD 417 (652) Q Consensus 339 mncp~h~~i~~~~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F-~~~eq~~~e~~~~~~~~~~v~~~ 417 (652) ||||.|+.+|+++++||||||+++|||++|||||+|| ++||||+|||+|+|||+| ++.+++.+++..++++|.++|.+ T Consensus 99 T~Ee~~~~~~~~~i~SyrdLPl~lyQ~~~~fR~E~r~-r~GllR~REFlmkDahsf~~t~e~a~~~y~~~~~~Y~~if~~ 177 (264) T cd00772 99 TLEENIGEIAAKFIKSWKDLPQHLNQIGNKFRDEIRP-RFGFLRAREFIMKDGHSAHADAEEADEEFLNMLSAYAEIARD 177 (264) T ss_pred CCCHHHHHHHHHHCCCHHHCCHHHHHHHHHHHCCCCC-CCCCHHHEEEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 7868999999864076655797787655786057885-543323204341103532599999999999999999999999 Q ss_pred HCCCCCEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCC Q ss_conf 21332100014653112586035778999999999850212125667520555555455432110021103310012110 Q gi|254781004|r 418 FGFEKIMVKLSTRPEKRVGSDALWDDAENIMKGVLDTIKNSSQDRINTGVLLGEGAFYGPKFEYILKDAIGRDWQCGTIQ 497 (652) Q Consensus 418 ~g~~~~~~~ls~~~~~~~g~~~~~~~ae~~l~~~l~~~~~~~~~~~~~~~~~g~~afygpkid~~~~d~~gr~~~~~tiq 497 (652) +|...+....++.+++++|.+..|+. +.....|.+ .....+++++ T Consensus 178 lg~lp~~~~~~~~g~kf~Ga~~s~ef--------------------~~~~~~G~~---------------~~~ev~H~f~ 222 (264) T cd00772 178 LAAIDFIEGEADEGAKFAGASKSREF--------------------EALMEDGKA---------------KQAETGHIFG 222 (264) T ss_pred HCCCCCEEEECCCCCCCCCCEEEEEE--------------------EEECCCCCC---------------CEEEEEEECC T ss_conf 66985167644778856885016899--------------------998788871---------------3644688815 Q ss_pred HHHHCCCCCCCEEEEECCCEEEEEEEEECCCH-HHHHHHHHHHHH Q ss_conf 11102234674167317972137996403410-378999999997 Q gi|254781004|r 498 VDFNLPSRFNAFYVNSHSEKCHPVMIHRAVFG-SIERFIGIMIEN 541 (652) Q Consensus 498 ~df~~p~~f~l~y~~~dg~~~~pvmihr~~~G-s~eR~ia~liE~ 541 (652) ++|.++..++++|.++||+.+.++| +||| |++|+|||+||+ T Consensus 223 ~g~~~s~~~~~~f~~~dg~~~~v~m---~syGIgitRligaiIeq 264 (264) T cd00772 223 EGFARAFDLKAKFLDKDGKEKFFEM---GCWGIGISRFIGAIIEQ 264 (264) T ss_pred CCCCEECCCCCEEECCCCCEEEEEE---CCCCCHHHHHHHEEECC T ss_conf 7850122566578889999822580---36778177444405529 No 22 >PRK04173 glycyl-tRNA synthetase; Provisional Probab=100.00 E-value=0 Score=368.59 Aligned_cols=346 Identities=19% Similarity=0.278 Sum_probs=272.1 Q ss_pred CCCEEEEECCHHHHHHHHHHHHHHHH--HC-CCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECC-------------- Q ss_conf 87435650107899999999999887--68-96385554324431000112100125554201001-------------- Q gi|254781004|r 262 GSGVIFWHRKGWKIFQTLISYMRRKI--KD-DYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCA-------------- 324 (652) Q Consensus 262 ~~G~~~wlP~G~~i~~~ie~~ir~~~--~~-G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~-------------- 324 (652) -+|+|=|=|+|..+.++|.+.||+++ .+ +..+|-+|+|.+...|+.|||+++|.|-|.-.... T Consensus 29 ~ag~~DyGPlG~~LK~Ni~~~W~~~fv~~~~~~~~id~~ii~~~~V~~ASGHv~~F~D~~v~c~~~~~r~RaD~Lie~~~ 108 (460) T PRK04173 29 LAGFWDYGPLGVELKNNIKKAWWKSFVQERPDVVGIDSPIIMPPEVWKASGHVDNFSDPLVECKKCKKRYRADHLIEEYL 108 (460) T ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCHHHHHCCCCCCCCCCCEEECCCCCCEECHHHHHHHHH T ss_conf 12361558535999999999999998652587788614444898785446774656762347576783630167766653 Q ss_pred ---------------------------------------------CCCCCCCCEEEECCCCHHHHHHHHHHCCCCHHH-C Q ss_conf ---------------------------------------------456777823774343125777765201222231-0 Q gi|254781004|r 325 ---------------------------------------------DDTIKDLRTFALKPMNCPGHVAVFNHGLKSYRE-L 358 (652) Q Consensus 325 ---------------------------------------------~~~~~~~~~~~LkPmncp~h~~i~~~~~~SYrd-L 358 (652) .........-.|||-++-++..-|++-.++.|. | T Consensus 109 ~~~~~~~~~~~l~~~i~~~~i~cP~cg~~~lt~~~~FNLMF~T~iG~~~~~~~~~YLRPETAQGiFvnFk~v~~~~r~Kl 188 (460) T PRK04173 109 GIDAEGLSNEELEELIEENNIKCPECGGENWTEVRQFNLMFKTFIGPVEDSSSLGYLRPETAQGIFVNFKNVLRTARKKL 188 (460) T ss_pred HHCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCEEEEEEEEECCCCCCCCCCCEEECHHHHHHHHHHHHHHHHHCCCCC T ss_conf 20123358999999999708879999996775443120555751355667775023160441045661199999717788 Q ss_pred CHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCCCCCCCCCH Q ss_conf 43200010001278764520001056353344210105376788789999998876655213321000146531125860 Q gi|254781004|r 359 PVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVFCTKEQMFNECLKIHNLIISIYKDFGFEKIMVKLSTRPEKRVGSD 438 (652) Q Consensus 359 Plrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F~~~eq~~~e~~~~~~~~~~v~~~~g~~~~~~~ls~~~~~~~g~~ 438 (652) |+.++|+|+.||||+|| ++|++|+|||+|.+...||.|++..+++..++......+..+|+....++.-..+.. T Consensus 189 PFgiaQIGKaFRNEIsP-r~~lfR~REFeQmEiE~F~~P~~~~e~~~yw~~~~~~fl~~lGi~~~~lR~r~h~~~----- 262 (460) T PRK04173 189 PFGIAQIGKSFRNEITP-RNFIFRTREFEQMELEFFVKPGTDEEWFAYWIALRKNFLLDLGIDPENLRFREHLPE----- 262 (460) T ss_pred CEEEEECCCCCCCCCCC-CCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCHHHEEEEECCCH----- T ss_conf 80431004001243065-545122345555243310586745899999999999999983998899478630613----- Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCC----CCCCCHHHHCCCCCCCCCCCHHHHCCCCCCCEEEEE- Q ss_conf 357789999999998502121256675205555554554----321100211033100121101110223467416731- Q gi|254781004|r 439 ALWDDAENIMKGVLDTIKNSSQDRINTGVLLGEGAFYGP----KFEYILKDAIGRDWQCGTIQVDFNLPSRFNAFYVNS- 513 (652) Q Consensus 439 ~~~~~ae~~l~~~l~~~~~~~~~~~~~~~~~g~~afygp----kid~~~~d~~gr~~~~~tiq~df~~p~~f~l~y~~~- 513 (652) +...+|. ....+++...-|.....|. ..|...... . ..-+++|.++ T Consensus 263 elAHYa~-------------~~~D~E~~~~~Gw~E~~Gia~RtdyDL~~H~~---------------~-Sg~~L~~~d~~ 313 (460) T PRK04173 263 ELAHYSK-------------ATWDIEYKFPFGWGELEGIANRTDYDLSRHSK---------------H-SGEDLSYFDQE 313 (460) T ss_pred HHCCCCH-------------HHEEEEEECCCCEEEEEEECCCCHHHHHHHHH---------------H-CCCCEEEEECC T ss_conf 4200022-------------00558994589828876640642102788987---------------4-48871798225 Q ss_pred CCCEEEEEEEEECCCHHHHHHHHHHHHH-H------C------CCCCCCCCCEEEEEEEC--CHHHHHHHHHHHHHHHHC Q ss_conf 7972137996403410378999999997-3------8------70887337808999857--966899999999999987 Q gi|254781004|r 514 HSEKCHPVMIHRAVFGSIERFIGIMIEN-F------K------GNLPLWLSPIQAIVTTI--TSSAVEYAQEIANLLKSH 578 (652) Q Consensus 514 dg~~~~pvmihr~~~Gs~eR~ia~liE~-~------~------g~~P~wLAP~QV~Iipi--~e~~~eya~~i~~~L~~~ 578 (652) .|++-.|.+|- .++| ++|++.+++++ | . .+||+.|||+||+|+|+ +++..+.|++|++.|++. T Consensus 314 ~~~k~iPhVIE-pS~G-vdR~~la~L~~ay~ee~~~~~e~R~vL~l~p~lAPikvaVlPL~kk~~l~~~a~~i~~~L~~~ 391 (460) T PRK04173 314 TGEKYIPYVIE-PSFG-LDRLFLAFLEDAYTEEELGDGDKRTVLRLPPALAPVKVAVLPLVKKDKLSEKAREIYAELRKD 391 (460) T ss_pred CCCEECCEEEE-ECCC-CCHHHHHHHHHHHHHHCCCCCCEEEEEECCCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHC T ss_conf 58561340564-0467-108999999876520004787505899626645771699996556612589999999999877 Q ss_pred CCEEEEECCCCCCCHHHHHHHHCCCCEEEEECHHHHHCCCEEEEECCCCCEEEECHHHHHHHHHHHC Q ss_conf 9889997589663676665543489889998742532581899988988663342999999998731 Q gi|254781004|r 579 HLSIETDFRNETINYKIREHSIKKIPIIIICGDKEASERSIGIRRFGSTTTQKVSLLDAIPILTKES 645 (652) Q Consensus 579 girV~~Ddr~~~~G~Kir~a~l~giP~~ivIG~ke~e~~~Vtir~R~~~~~~~i~lee~i~~l~~e~ 645 (652) |++|.+|+++ |||+|+|++|.+|+||+|+|+.+.++++|||||.||+++|..|+++++.+.|.+.. T Consensus 392 ~~~~~~D~sg-sIGkRY~R~DeiGtPf~ITiD~~tled~tVTiR~RDt~~Q~Rv~i~~l~~~l~~~~ 457 (460) T PRK04173 392 GFNVDYDDSG-SIGKRYRRQDEIGTPFCVTVDFDTLEDNTVTIRDRDTMEQVRVKIDELEDYLAEKL 457 (460) T ss_pred CCEEEEECCC-CHHHHHHHCCCCCCCEEEEECCCCCCCCCEEEEECCCCCEEEEEHHHHHHHHHHHH T ss_conf 9759996899-85454453102689879997688676894897835875348989999999999986 No 23 >KOG4163 consensus Probab=100.00 E-value=0 Score=368.75 Aligned_cols=371 Identities=19% Similarity=0.304 Sum_probs=302.7 Q ss_pred EEEEEEEEECCHHHHHHHHHHH-HHHHHHHHHHHHHHCCEEEECCC-CCCEEEEECCHHHHHHHHHHHHHHHHHC-CCEE Q ss_conf 6899999837988999999999-97532008875411440442267-8743565010789999999999988768-9638 Q gi|254781004|r 217 LSRIYGTAWNTQQELTQYLYFL-EESEKRDHRKLAREMDLFHIAED-GSGVIFWHRKGWKIFQTLISYMRRKIKD-DYEE 293 (652) Q Consensus 217 l~Riyg~af~~~~~l~~~~~~~-eea~~rdHr~lg~~~~lf~~~~~-~~G~~~wlP~G~~i~~~ie~~ir~~~~~-G~~e 293 (652) =++..|+.-...+++.+|--.. ..+ | |++=. -+|+|+..|.|+.||+.|+.|...+.+. |.++ T Consensus 54 ~~~~~gl~~~k~~df~~WY~qVitk~----------e----mieYydvsGcyilRP~s~aIWe~Iq~wfd~~ik~lGv~n 119 (551) T KOG4163 54 KATLLGLTAKKEEDFPEWYSQVITKG----------E----MIEYYDVSGCYILRPWSYAIWEAIQDWFDAEIKKLGVKN 119 (551) T ss_pred HHHHCCCCCCCCCCHHHHHHHHHHHH----------H----HHEEECCCCEEEECCHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 55421553341123178999985434----------1----100233541489553478999999999987888736434 Q ss_pred EECCCCCCCCCCCC-CCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHHHCCCCHHHCCHHHHHHEEEECCC Q ss_conf 55543244310001-12100125554201001456777823774343125777765201222231043200010001278 Q gi|254781004|r 294 INTPQVLDQHLWQQ-SGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHVAVFNHGLKSYRELPVRLAEFGSVYRNE 372 (652) Q Consensus 294 V~tP~l~~~~Lw~~-SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~~~~SYrdLPlrl~e~~~~~R~E 372 (652) .+.|.+.+...-++ ..|.+.|.++.=+++.. +....++..++|||++..+...|..|++|||||||+|.||.++.||| T Consensus 120 cYFPmfVs~~~LEkEk~Hve~FaPEvAwVTr~-G~seLeepiaiRPTSETvmyp~yakWi~ShRDLPlkLNQW~nVvRWE 198 (551) T KOG4163 120 CYFPMFVSKSVLEKEKDHVEGFAPEVAWVTRA-GNSELEEPIAIRPTSETVMYPYYAKWIQSHRDLPLKLNQWCNVVRWE 198 (551) T ss_pred CEEEEECCHHHHHHHHHHHCCCCCCEEEEEEC-CCCCCCCCEEECCCCCCEECHHHHHHHHHHCCCCHHHHHHHHHEEEE T ss_conf 01033247888765565541578532799703-88411464223467521103888998876046754665432223530 Q ss_pred CCCCCEEEEECCCCCCCCCEE-ECCHHHHHHHHHHHHHHHHHHHHHHCCC-CCEECCCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 764520001056353344210-1053767887899999988766552133-21000146531125860357789999999 Q gi|254781004|r 373 PSGSLHGLMRVRGFTQDDAHV-FCTKEQMFNECLKIHNLIISIYKDFGFE-KIMVKLSTRPEKRVGSDALWDDAENIMKG 450 (652) Q Consensus 373 ~sg~l~GL~R~ReF~~~Dah~-F~~~eq~~~e~~~~~~~~~~v~~~~g~~-~~~~~ls~~~~~~~g~~~~~~~ae~~l~~ 450 (652) .++. .+.+|+|||++.++|+ |.+++.+++|+..++++|..+|..+--. .+..+. +..+++.|.+. T Consensus 199 fk~p-~PFlRtrEFLWQEGHTAfat~~eA~eEvlqiLdlYa~vy~ellAiPVvkGrK-se~EkFaGgd~----------- 265 (551) T KOG4163 199 FKHP-QPFLRTREFLWQEGHTAFATPEEAEEEVLQILDLYARVYEELLAIPVVKGRK-SEKEKFAGGDY----------- 265 (551) T ss_pred CCCC-CCCHHHHHHHHHCCCCHHCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCC-CHHHHCCCCCC----------- T ss_conf 2489-7512355777745730107976778999999999999987620343014763-45330257750----------- Q ss_pred HHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCC-CHHHHCCCCCCCEEEEE-CCCEEEEEEEEECCC Q ss_conf 9985021212566752055555545543211002110331001211-01110223467416731-797213799640341 Q gi|254781004|r 451 VLDTIKNSSQDRINTGVLLGEGAFYGPKFEYILKDAIGRDWQCGTI-QVDFNLPSRFNAFYVNS-HSEKCHPVMIHRAVF 528 (652) Q Consensus 451 ~l~~~~~~~~~~~~~~~~~g~~afygpkid~~~~d~~gr~~~~~ti-q~df~~p~~f~l~y~~~-dg~~~~pvmihr~~~ 528 (652) ...++ .+....||..|++|. .++.|+++.|++.|.+. +|+++.++ +.++ T Consensus 266 ---------TttvE-----------------a~i~~~GrgiQgaTSH~LGQNFSkmF~i~~ed~~~g~~~fv~---QnSW 316 (551) T KOG4163 266 ---------TTTVE-----------------AFIPCSGRGIQGATSHHLGQNFSKMFEIVFEDPGEGEKEFVW---QNSW 316 (551) T ss_pred ---------EEEEE-----------------EEECCCCCCCCCCCHHHHHHHHHHHHCEEECCCCCCCHHHEE---ECCC T ss_conf ---------47776-----------------652466655554320256677998623441588741122033---1255 Q ss_pred HHHHHHHHHHHHHHC----CCCCCCCCCEEEEEEECC----------HHHHHHHHHHHHHHHHCCCEEEEECCCC-CCCH Q ss_conf 037899999999738----708873378089998579----------6689999999999998798899975896-6367 Q gi|254781004|r 529 GSIERFIGIMIENFK----GNLPLWLSPIQAIVTTIT----------SSAVEYAQEIANLLKSHHLSIETDFRNE-TINY 593 (652) Q Consensus 529 Gs~eR~ia~liE~~~----g~~P~wLAP~QV~Iipi~----------e~~~eya~~i~~~L~~~girV~~Ddr~~-~~G~ 593 (652) |..+|+||++++.|| .++|+.+||+||+|||+. +...++|+.+..+|..+|+||..|+||+ ++|+ T Consensus 317 g~sTRtiGvmiM~HgDdkGLvLPPrVA~vQvVvvP~git~~~s~~~~~~l~~a~~~v~~~L~~~giR~~~D~rDnytpGw 396 (551) T KOG4163 317 GLSTRTIGVMIMTHGDDKGLVLPPRVAPVQVVVVPVGITDATSEEDKQELLDACSAVESRLLGAGIRAEADLRDNYTPGW 396 (551) T ss_pred CCCCCEEEEEEEEECCCCCCCCCCCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCC T ss_conf 53421134799983178653048752413799995663566746778999999999999985168167503434678885 Q ss_pred HHHHHHHCCCCEEEEECHHHHHCCCEEEEECCCCCEEEECHHHHHHHHHHH Q ss_conf 666554348988999874253258189998898866334299999999873 Q gi|254781004|r 594 KIREHSIKKIPIIIICGDKEASERSIGIRRFGSTTTQKVSLLDAIPILTKE 644 (652) Q Consensus 594 Kir~a~l~giP~~ivIG~ke~e~~~Vtir~R~~~~~~~i~lee~i~~l~~e 644 (652) ||.++++.|||++|.||+++++++.|.+.+||++++..|+++++.+.+++. T Consensus 397 KfnhwElkGVPlRiEiGPrD~~~~qv~~VrRd~~~K~~v~~~~l~~~v~el 447 (551) T KOG4163 397 KFNHWELKGVPLRIEIGPRDLASNQVVAVRRDTGEKKDVSLGDLEKTVKEL 447 (551) T ss_pred CCCCEEECCCEEEEEECCCHHHHCEEEEEECCCCCCCCCCHHHHHHHHHHH T ss_conf 333004327416887563302008389998357753455778899999999 No 24 >PRK00037 hisS histidyl-tRNA synthetase; Reviewed Probab=100.00 E-value=0 Score=353.01 Aligned_cols=357 Identities=17% Similarity=0.253 Sum_probs=250.8 Q ss_pred CCCEEEEECCHHHHHHHHHHHHHHHHH-CCCEEEECCCCCCCCCCCCC--CCCCCCCCCCCEEECCCCCCCCCCEEEECC Q ss_conf 874356501078999999999998876-89638555432443100011--210012555420100145677782377434 Q gi|254781004|r 262 GSGVIFWHRKGWKIFQTLISYMRRKIK-DDYEEINTPQVLDQHLWQQS--GHWDSYRANMFAVKCADDTIKDLRTFALKP 338 (652) Q Consensus 262 ~~G~~~wlP~G~~i~~~ie~~ir~~~~-~G~~eV~tP~l~~~~Lw~~S--Gh~~~y~~~mf~~~~~~~~~~~~~~~~LkP 338 (652) -.|+.=|+|.++++++.|++.+|..++ +||+||.||++-..++|..+ ++++.+.++||.+. |+.++.++||| T Consensus 7 p~G~rD~lP~~~~~~~~i~~~i~~~~~~~Gy~~I~tP~lE~~d~~~~~~g~~~d~~~k~~y~f~-----D~~g~~l~LRp 81 (417) T PRK00037 7 PRGTRDILPEESAKWQYVEDTIREVFERYGFSEIRTPIFEYTELFKRKIGESTDIVEKEMYTFQ-----DKGGRSLTLRP 81 (417) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCCHHHHHCCCCCCCCCHHHHHEEEEE-----CCCCCEEEECC T ss_conf 9987767988999999999999999998599585775215098862457763220133205677-----59998898567 Q ss_pred CCHHHHHH-HHHHCCCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEECCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 31257777-65201222231043200010001278764520001056353344210105376788789999998876655 Q gi|254781004|r 339 MNCPGHVA-VFNHGLKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVFCTKEQMFNECLKIHNLIISIYKD 417 (652) Q Consensus 339 mncp~h~~-i~~~~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F~~~eq~~~e~~~~~~~~~~v~~~ 417 (652) +.||.++- +..+...+|++||+|+|++|++||+|.++ .| |.|||+|.|+|+|+.++...+ ..++.++..++.. T Consensus 82 D~T~~iaR~~~~~~~~~~~~~P~r~~y~g~vfR~e~p~--~G--R~REf~Q~g~eiiG~~~~~aD--aEvi~l~~~~l~~ 155 (417) T PRK00037 82 EGTAPVVRAVIENKLYNELPKPFKLYYIGPMFRYERPQ--KG--RYRQFHQFGVEVIGSDSPAAD--AEVIALAADLLKA 155 (417) T ss_pred CCCHHHHHHHHHHHHHCCCCCCEEEEEECCEEECCCCC--CC--CCCCEEECCHHHHCCCCHHHH--HHHHHHHHHHHHH T ss_conf 76729999999721010688765899872466047898--87--635245635277466417888--9999999999998 Q ss_pred HCCCCCEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCC----CCC-------CCCCCCCCCC--CCCC-CCCH Q ss_conf 213321000146531125860357789999999998502121256----675-------2055555545--5432-1100 Q gi|254781004|r 418 FGFEKIMVKLSTRPEKRVGSDALWDDAENIMKGVLDTIKNSSQDR----INT-------GVLLGEGAFY--GPKF-EYIL 483 (652) Q Consensus 418 ~g~~~~~~~ls~~~~~~~g~~~~~~~ae~~l~~~l~~~~~~~~~~----~~~-------~~~~g~~afy--gpki-d~~~ 483 (652) +|+....+.+.. +|+.+.+......|.+.+.......... ... ...++..++. .|++ ++.. T Consensus 156 lGl~~~~i~i~~-----~g~~~~~~~~r~~L~~~l~~~~~~l~~~~~~~~~~~~l~~ld~~~~~~~~~~~~a~~l~~~l~ 230 (417) T PRK00037 156 LGLKGLVLLLNS-----LGDFEIRANYRKALVGYLEKHLDELDEDSKRRLETNPLRILDKKDKEDQEILADAPKLLDYLD 230 (417) T ss_pred CCCCCCCCCCCC-----CCCHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHCHHHHHHHC T ss_conf 399652001056-----776888999999999999997643387887766542778764238879999973625453327 Q ss_pred HHHCCCCCCCCC--------CCHHHHCCCCCC----CE---EEEECC-------------------CEEEEEEEEECCCH Q ss_conf 211033100121--------101110223467----41---673179-------------------72137996403410 Q gi|254781004|r 484 KDAIGRDWQCGT--------IQVDFNLPSRFN----AF---YVNSHS-------------------EKCHPVMIHRAVFG 529 (652) Q Consensus 484 ~d~~gr~~~~~t--------iq~df~~p~~f~----l~---y~~~dg-------------------~~~~pvmihr~~~G 529 (652) .+++.+.+++.. +.+|+.+...++ +. |....| .+..|-+. .|+ T Consensus 231 ~~~~~~l~~l~~~l~~~~~~~~~Dl~lvRgldYYTG~vFe~~~~~~~~~~si~gGGRYD~L~~~fgg~~~PAvG-fa~-- 307 (417) T PRK00037 231 EESKAHFEELKELLDALGIPYVIDPRLVRGLDYYTGTVFEFVTDGLGAQGTVCGGGRYDGLVEQFGGPPTPAVG-FAI-- 307 (417) T ss_pred HHHHHHHHHHHHHHHHCCCCEEECHHHCCCCCCCCCEEEEEEECCCCCCCEEECCCCHHHHHHHHCCCCCCEEE-EEE-- T ss_conf 78899999999999975996998624505766534338999937876501042587315789984899787478-986-- Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHCCCCEEEEE Q ss_conf 37899999999738708873378089998579668999999999999879889997589663676665543489889998 Q gi|254781004|r 530 SIERFIGIMIENFKGNLPLWLSPIQAIVTTITSSAVEYAQEIANLLKSHHLSIETDFRNETINYKIREHSIKKIPIIIIC 609 (652) Q Consensus 530 s~eR~ia~liE~~~g~~P~wLAP~QV~Iipi~e~~~eya~~i~~~L~~~girV~~Ddr~~~~G~Kir~a~l~giP~~ivI 609 (652) ++||++++|.|+ |++|.||.|+|++++...+|.+|++.|+++|++|++|.++.++|++|+.|+..|+||++|| T Consensus 308 gieRi~~~l~~~-------~~~~~~v~v~~~~~~~~~~a~~i~~~Lr~~gi~v~~~~~~~~l~k~~~~A~~~~~~~~viv 380 (417) T PRK00037 308 GIERLLLLLEEL-------GLESVDVYVVVLGEEAEAAALKLAEKLRAAGIRVELDLGGRKLKKQFKYADKSGARFALIL 380 (417) T ss_pred CHHHHHHHHHHC-------CCCCCCEEEEECCHHHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHCCCCEEEEE T ss_conf 399999999964-------8787767999659899999999999999889969995899899999999998797999998 Q ss_pred CHHHHHCCCEEEEECCCCCEEEECHHHHHHHHHHH Q ss_conf 74253258189998898866334299999999873 Q gi|254781004|r 610 GDKEASERSIGIRRFGSTTTQKVSLLDAIPILTKE 644 (652) Q Consensus 610 G~ke~e~~~Vtir~R~~~~~~~i~lee~i~~l~~e 644 (652) |++|+++|+|+||++.++++.+++++++++.|++. T Consensus 381 G~~E~~~~~v~vk~l~~g~q~~v~~~~l~~~lk~~ 415 (417) T PRK00037 381 GEDELANGTVTLKDLRTGEQETVALDELVEFLKEL 415 (417) T ss_pred CCHHHHCCEEEEEECCCCCEEEEEHHHHHHHHHHH T ss_conf 74488679689998999971796599999999997 No 25 >TIGR00389 glyS_dimeric glycyl-tRNA synthetase; InterPro: IPR002315 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . In eubacteria, glycyl-tRNA synthetase (6.1.1.14 from EC) is an alpha2/beta2 tetramer composed of 2 different subunits , , . In some eubacteria, in archaea and eukaryota, glycyl-tRNA synthetase is an alpha2 dimer, this family. It belongs to class IIc and is one of the most complex synthetases. What is most interesting is the lack of similarity between the two types: divergence at the sequence level is so great that it is impossible to infer descent from common genes. The alpha (see IPR002310 from INTERPRO) and beta subunits (see IPR002311 from INTERPRO) also lack significant sequence similarity. However, they are translated from a single mRNA , and a single chain glycyl-tRNA synthetase from Chlamydia trachomatis has been found to have significant similarity with both domains, suggesting divergence from a single polypeptide chain . The sequence and crystal structure of the homodimeric glycyl-tRNA synthetase from Thermus thermophilus, shows that each monomer consists of an active site strongly resembling that of the aspartyl and seryl enzymes, a C-terminal anticodon recognition domain of 100 residues and a third domain unusually inserted between motifs 1 and 2 almost certainly interacting with the acceptor arm of tRNA(Gly). The C-terminal domain has a novel five-stranded parallel-antiparallel beta-sheet structure with three surrounding helices. The active site residues most probably responsible for substrate recognition, in particular in the Gly binding pocket, can be identified by inference from aspartyl-tRNA synthetase due to the conserved nature of the class II active site , .; GO: 0000166 nucleotide binding, 0004820 glycine-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006426 glycyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=100.00 E-value=0 Score=359.76 Aligned_cols=379 Identities=22% Similarity=0.289 Sum_probs=279.5 Q ss_pred CCEEEEECCHHHHHHHHHHHHHHHH--HCC-CEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCC------------ Q ss_conf 7435650107899999999999887--689-6385554324431000112100125554201001456------------ Q gi|254781004|r 263 SGVIFWHRKGWKIFQTLISYMRRKI--KDD-YEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDT------------ 327 (652) Q Consensus 263 ~G~~~wlP~G~~i~~~ie~~ir~~~--~~G-~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~------------ 327 (652) +|++=|=|+|..+.|+|.+.||+.+ +.+ ..+|-||+|++.+.|++|||.|+|.|-|.-....... T Consensus 27 aG~~DyGPLG~~Lk~~I~~~wr~~fi~~e~d~~~id~P~i~~~evlKASGHvd~F~D~~v~Ck~C~~~fRADHl~e~~~~ 106 (606) T TIGR00389 27 AGFWDYGPLGAVLKNNIKNAWRKFFIINERDVLEIDSPIITPEEVLKASGHVDNFTDPMVDCKKCKERFRADHLIEEKLG 106 (606) T ss_pred HHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCEEEECCCCCCCHHHHHHCCCCCCCCCCEEECCCCCCEECHHHHHHHHHH T ss_conf 20010157247899999998887511103653464245337257997528645673122673788804010123256651 Q ss_pred --CCC----------------------------------------------------------------------CCEEE Q ss_conf --777----------------------------------------------------------------------82377 Q gi|254781004|r 328 --IKD----------------------------------------------------------------------LRTFA 335 (652) Q Consensus 328 --~~~----------------------------------------------------------------------~~~~~ 335 (652) ..+ .+.=+ T Consensus 107 ~~~~~dqqiK~V~~~D~K~~E~iL~~iDG~~~~eL~E~m~~~~i~CP~C~g~~nl~~v~~FNLMF~T~IG~~~~~~~~gY 186 (606) T TIGR00389 107 KDVEGDQQIKDVKLDDVKEYEKILAKIDGETGPELKEVMEKYDINCPNCGGENNLTEVRSFNLMFQTEIGVVGDSKRKGY 186 (606) T ss_pred HHCCCCCCCCCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCHHCCCCCEEEECCCCCCCCEEE T ss_conf 00047610012685457779989755078711567877634278898223545613501204541102113347762241 Q ss_pred ECCCCHHHHHHHHHHCCCCHH-HCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEECCH-HHHHHHHHHHHHHHHH Q ss_conf 434312577776520122223-10432000100012787645200010563533442101053-7678878999999887 Q gi|254781004|r 336 LKPMNCPGHVAVFNHGLKSYR-ELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVFCTK-EQMFNECLKIHNLIIS 413 (652) Q Consensus 336 LkPmncp~h~~i~~~~~~SYr-dLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F~~~-eq~~~e~~~~~~~~~~ 413 (652) |||=+--|+..=|++-+.+-| .||+.++|||+.||||+|| +.||+|+|||+|.++..||.| ++....+..+.+.... T Consensus 187 LRPETAQG~FinFk~LL~~~r~klPFgvAQiGKSfRNEIsP-r~gl~R~REF~QaE~E~FV~P~~k~h~~F~~~~~~~~~ 265 (606) T TIGR00389 187 LRPETAQGIFINFKRLLQFFRNKLPFGVAQIGKSFRNEISP-RQGLIRLREFEQAEIEFFVDPEDKSHPKFEEVKQEVVP 265 (606) T ss_pred CCCHHCCCCCHHHHHHHHHHCCCCCEEEEEECCEEECCCCC-CCCEEEECHHHHHHHHHCCCCCCCCCCCHHHHHHCCCC T ss_conf 47601047512168999974589964899744136040038-98757103154466874259600167200145405001 Q ss_pred HHHHH------C--CCCCEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHC-CCCCCCCCCCCCCCCCCCCCCC-CCCCH Q ss_conf 66552------1--3321000146531125860357789999999998502-1212566752055555545543-21100 Q gi|254781004|r 414 IYKDF------G--FEKIMVKLSTRPEKRVGSDALWDDAENIMKGVLDTIK-NSSQDRINTGVLLGEGAFYGPK-FEYIL 483 (652) Q Consensus 414 v~~~~------g--~~~~~~~ls~~~~~~~g~~~~~~~ae~~l~~~l~~~~-~~~~~~~~~~~~~g~~afygpk-id~~~ 483 (652) .+..+ | -..+.+.+...-++++=+.+.-.+.-+.....|...| ......-.......+-|.|..+ .|+-+ T Consensus 266 Llp~~R~~~E~g~~~~~v~~t~gEAVe~G~i~ne~L~YFia~~~~FL~~igkI~~dklRFrqH~~~E~AHYA~dcwD~E~ 345 (606) T TIGR00389 266 LLPAERQMQESGESEKPVKMTIGEAVEKGIIENETLAYFIALVKRFLLEIGKINPDKLRFRQHLKNEMAHYAKDCWDAEI 345 (606) T ss_pred CCCCCCCCHHCCCCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHCCCHHHHHHHHHHEEEEE T ss_conf 36764540001576405444178886346521006899999999999865278878852331360045565553623556 Q ss_pred HHH-CCCCCCCC-------------------------------------------------------------------- Q ss_conf 211-03310012-------------------------------------------------------------------- Q gi|254781004|r 484 KDA-IGRDWQCG-------------------------------------------------------------------- 494 (652) Q Consensus 484 ~d~-~gr~~~~~-------------------------------------------------------------------- 494 (652) ... -|-.+-.| T Consensus 346 ~t~~yGwiE~~GiADR~~yDL~~Hsk~s~~~L~Vf~~~DeP~~~~~~~i~~~~k~~G~~FrkdAk~ies~L~~lse~dl~ 425 (606) T TIGR00389 346 LTSRYGWIEVVGIADRTDYDLTQHSKFSGKDLSVFQELDEPREVTKREIELNKKKLGPKFRKDAKKIESALENLSEDDLE 425 (606) T ss_pred ECCCCCCEEEEEEECCCCCCHHHHHHHHCCCCEEEECCCCCEEEEEEEEEEECCCCCCHHHCHHHHHHHHHHHCCHHHHH T ss_conf 65798738998740677221688999849896163057887257788987301346843201178999999732874789 Q ss_pred ------------CCCHHH---H-CCCCCCCE--EEEECCCEEEEEEEEECCCHHHHHHHHHHHHHH-------C-----C Q ss_conf ------------110111---0-22346741--673179721379964034103789999999973-------8-----7 Q gi|254781004|r 495 ------------TIQVDF---N-LPSRFNAF--YVNSHSEKCHPVMIHRAVFGSIERFIGIMIENF-------K-----G 544 (652) Q Consensus 495 ------------tiq~df---~-~p~~f~l~--y~~~dg~~~~pvmihr~~~Gs~eR~ia~liE~~-------~-----g 544 (652) |++.|- . .+.-+.++ -.-..+++-.|.+| -.++| ++|++.+|++|- + . T Consensus 426 ~~~~~l~~~G~f~~~~dG~~ve~~~~~v~i~~~~~v~~~ek~iP~VI-EPS~G-i~Ri~Y~~l~hsy~~~~~d~e~R~vL 503 (606) T TIGR00389 426 EVEKELDEEGKFIIEVDGKEVEILKDLVEIKEVEEVVTGEKYIPHVI-EPSFG-IDRIIYSLLEHSYQEEVLDGEEREVL 503 (606) T ss_pred HHHHHHHCCCEEEEEECCEEEEECHHHEEEEEEEEEEEEEEEECCEE-ECCCH-HHHHHHHHHHHHHCCCCCCCCCEEEE T ss_conf 99998634986999985818987121242666678885348806076-16750-37899999987432351027861374 Q ss_pred CCCCCCCCEEEEEEECC--HHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHCCCCEEEEECHHHHHCCCEEEE Q ss_conf 08873378089998579--6689999999999998798899975896636766655434898899987425325818999 Q gi|254781004|r 545 NLPLWLSPIQAIVTTIT--SSAVEYAQEIANLLKSHHLSIETDFRNETINYKIREHSIKKIPIIIICGDKEASERSIGIR 622 (652) Q Consensus 545 ~~P~wLAP~QV~Iipi~--e~~~eya~~i~~~L~~~girV~~Ddr~~~~G~Kir~a~l~giP~~ivIG~ke~e~~~Vtir 622 (652) +||+||||+||.|+|+. ++..+-|.+|++.|++.++++.+|||+-|||||||++|.+|+||+|+|.-+.+|++||||| T Consensus 504 ~Lp~~~aP~kv~VlPLvnk~~l~~~a~~i~~~Lr~~~i~~~~DdSG~SIGkRYrR~DEIGtPF~vT~DfetlED~tVT~R 583 (606) T TIGR00389 504 RLPPHVAPIKVAVLPLVNKEELKEIAKEIEQALRKSGIIIEYDDSGTSIGKRYRRADEIGTPFCVTVDFETLEDETVTIR 583 (606) T ss_pred ECCCCCCCEEEEEEECCCCCCCHHHHHHHHHHHHHCCCEEEECCCCCCHHHHHHCCCCCCCCCEEEECCCCCCCCEEEEE T ss_conf 15866285679996311565407899999999862697699877898610031014436887158742531107646773 Q ss_pred ECCCCCEEEECHHHHHHHHHHH Q ss_conf 8898866334299999999873 Q gi|254781004|r 623 RFGSTTTQKVSLLDAIPILTKE 644 (652) Q Consensus 623 ~R~~~~~~~i~lee~i~~l~~e 644 (652) .||+..|.+++++|++..|++. T Consensus 584 eRds~~QvRv~I~eL~~~~~~~ 605 (606) T TIGR00389 584 ERDSLKQVRVKIEELVSYLKKL 605 (606) T ss_pred ECCCCEEEEEEHHHHHHHHHHC T ss_conf 0788404536634389999860 No 26 >cd00770 SerRS_core Seryl-tRNA synthetase (SerRS) class II core catalytic domain. SerRS is responsible for the attachment of serine to the 3' OH group of ribose of the appropriate tRNA. This domain It is primarily responsible for ATP-dependent formation of the enzyme bound aminoacyl-adenylate. Class II assignment is based upon its structure and the presence of three characteristic sequence motifs in the core domain. SerRS synthetase is a homodimer. Probab=100.00 E-value=0 Score=348.57 Aligned_cols=269 Identities=19% Similarity=0.332 Sum_probs=223.3 Q ss_pred HHHHHHHHHHHHHCCEEEECC---C-CCCEEEEECCHHHHHHHHHHHHHHHH-HCCCEEEECCCCCCCCCCCCCCCCCCC Q ss_conf 753200887541144044226---7-87435650107899999999999887-689638555432443100011210012 Q gi|254781004|r 240 ESEKRDHRKLAREMDLFHIAE---D-GSGVIFWHRKGWKIFQTLISYMRRKI-KDDYEEINTPQVLDQHLWQQSGHWDSY 314 (652) Q Consensus 240 ea~~rdHr~lg~~~~lf~~~~---~-~~G~~~wlP~G~~i~~~ie~~ir~~~-~~G~~eV~tP~l~~~~Lw~~SGh~~~y 314 (652) .-++||||+||++|+||.|+. . |+|+|+|.|.|+++.+.|++|+++.+ ++||+||.||.|.+.++|+.||||++| T Consensus 15 ~f~~rdH~~Lg~~L~L~dfe~~akvsG~rf~~l~g~ga~Le~AL~~f~~d~~~~~Gy~eV~tP~iv~~~~~~~sG~~~~~ 94 (297) T cd00770 15 DFKPKDHVELGEKLDILDFERGAKVSGSRFYYLKGDGALLERALINFALDFLTKRGFTPVIPPFLVRKEVMEGTGQLPKF 94 (297) T ss_pred CCCCCCHHHHHHHCCCEECCCCCCCCCCEEEEECCHHHHHHHHHHHHHHHHHHHCCCEEEECCCCCHHHHHHHHCCCCCC T ss_conf 99988999999977881023224689877799888899999999999999999879989988530138998543578763 Q ss_pred CCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHHHCCCCHHHCCHHHHHHEEEECCCCCCC---CEEEEECCCCCCCCC Q ss_conf 55542010014567778237743431257777652012222310432000100012787645---200010563533442 Q gi|254781004|r 315 RANMFAVKCADDTIKDLRTFALKPMNCPGHVAVFNHGLKSYRELPVRLAEFGSVYRNEPSGS---LHGLMRVRGFTQDDA 391 (652) Q Consensus 315 ~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~~~~SYrdLPlrl~e~~~~~R~E~sg~---l~GL~R~ReF~~~Da 391 (652) .++||... +++++|.||+|+.++.+|++.+.||+|||+|++.++.+||.|.++. ++||+|+|+|++.|+ T Consensus 95 ~e~~y~i~--------~~~~~Li~tae~~l~~~~~~~~~~~~dLPlr~~~~s~cfR~Eags~G~d~~GL~RvhqF~kvE~ 166 (297) T cd00770 95 DEQLYKVE--------GEDLYLIATAEVPLAALHRDEILEEEELPLKYAGYSPCFRKEAGSAGRDTRGLFRVHQFEKVEQ 166 (297) T ss_pred HHHHHCCC--------CHHHHHHHCCCCCEEEEEEEEECCHHHCCCCEEECCCHHHHCCCCCCCCCCCEEEEEECCEEEE T ss_conf 56521035--------5055554405666223442003177558865022463413203777755554578872110037 Q ss_pred EEECCHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCC Q ss_conf 10105376788789999998876655213321000146531125860357789999999998502121256675205555 Q gi|254781004|r 392 HVFCTKEQMFNECLKIHNLIISIYKDFGFEKIMVKLSTRPEKRVGSDALWDDAENIMKGVLDTIKNSSQDRINTGVLLGE 471 (652) Q Consensus 392 h~F~~~eq~~~e~~~~~~~~~~v~~~~g~~~~~~~ls~~~~~~~g~~~~~~~ae~~l~~~l~~~~~~~~~~~~~~~~~g~ 471 (652) |+||+|||+.+++..++.....+++.+|+....+.+++++-+..... +.+++ .+.++. T Consensus 167 ~~~c~pe~s~~~~e~~~~~~e~~l~~L~lpyrvv~~~tgdlg~~a~~---------------------~~DiE-~W~P~~ 224 (297) T cd00770 167 FVFTKPEESWEELEELISNAEEILQELGLPYRVVNICTGDLGFAAAK---------------------KYDIE-AWMPGQ 224 (297) T ss_pred EEECCHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCHHH---------------------EEEEE-EEEHHH T ss_conf 76168788999999999999999998155014501246555732222---------------------07656-310541 Q ss_pred CCCCCCCCCCCHHHHCCCCCCCCCCCHHHHCCCCCCCEEEEECCCEEEEEEEEECCCHHHHHHHHHHHHHHC-----CCC Q ss_conf 554554321100211033100121101110223467416731797213799640341037899999999738-----708 Q gi|254781004|r 472 GAFYGPKFEYILKDAIGRDWQCGTIQVDFNLPSRFNAFYVNSHSEKCHPVMIHRAVFGSIERFIGIMIENFK-----GNL 546 (652) Q Consensus 472 ~afygpkid~~~~d~~gr~~~~~tiq~df~~p~~f~l~y~~~dg~~~~pvmihr~~~Gs~eR~ia~liE~~~-----g~~ 546 (652) +++ ++..-.+...||+ +.|+++.|.+++|.+..+++.+.|+.++++|+|+||+|+|- ... T Consensus 225 ~~y--------------~EvsS~Snc~DfQ-arRl~iry~~~~~~~~~~~htlngt~~A~~R~l~allEn~q~~dg~v~i 289 (297) T cd00770 225 GKY--------------REISSCSNCTDFQ-ARRLNIRYRDKKDGKKQYVHTLNGTALATPRTIVAILENYQTEDGSVVI 289 (297) T ss_pred CCE--------------EEEEECCCCHHHH-HHHCCCEEECCCCCCEEEEEEECCCCCHHHHHHHHHHHHCCCCCCCEEC T ss_conf 976--------------8861315400066-6523677734899952654881363208888999999727389987868 Q ss_pred CCCCCCE Q ss_conf 8733780 Q gi|254781004|r 547 PLWLSPI 553 (652) Q Consensus 547 P~wLAP~ 553 (652) |.-|-|| T Consensus 290 P~~L~~y 296 (297) T cd00770 290 PEVLRPY 296 (297) T ss_pred CCCCCCC T ss_conf 8003131 No 27 >COG0423 GRS1 Glycyl-tRNA synthetase (class II) [Translation, ribosomal structure and biogenesis] Probab=100.00 E-value=0 Score=339.47 Aligned_cols=376 Identities=22% Similarity=0.285 Sum_probs=254.1 Q ss_pred CCCEEEEECCHHHHHHHHHHHHHHHH--H-CCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCC----------- Q ss_conf 87435650107899999999999887--6-896385554324431000112100125554201001456----------- Q gi|254781004|r 262 GSGVIFWHRKGWKIFQTLISYMRRKI--K-DDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDT----------- 327 (652) Q Consensus 262 ~~G~~~wlP~G~~i~~~ie~~ir~~~--~-~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~----------- 327 (652) .+|+|=|=|+|..+.++|++.||+.+ + .+..+|.||+|.|.+.|+.|||.|+|.|-|......... T Consensus 29 ~~GfyDYGPlG~~LK~nI~~~Wrk~fV~~~e~~~eIdtpii~p~~V~kASGHvd~FsDplv~c~~c~~~yRADHLiEe~l 108 (558) T COG0423 29 LAGFYDYGPLGVELKNNIKEAWRKSFVTEREDVVEIDTPIILPEEVWKASGHVDKFSDPLVECKKCGERYRADHLIEEYL 108 (558) T ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHEECCCCEEEECCCCCCCHHHHHHCCCCCCCCCCEEECCCCCHHHHHHHHHHHHH T ss_conf 34653257751899999999999986624687678425544868886434761346562043462453453368889876 Q ss_pred ----------------------------------------------CCCCCEEEECCCCHHHHHHHHHHCCCCHH-HCCH Q ss_conf ----------------------------------------------77782377434312577776520122223-1043 Q gi|254781004|r 328 ----------------------------------------------IKDLRTFALKPMNCPGHVAVFNHGLKSYR-ELPV 360 (652) Q Consensus 328 ----------------------------------------------~~~~~~~~LkPmncp~h~~i~~~~~~SYr-dLPl 360 (652) -..+..-.|||-+-.++..-|++-.++.| .||+ T Consensus 109 ~~~~~~~~~~~e~~~ii~~~~ir~p~~g~~l~~v~~FNLMF~T~IGp~~~~~~YLRPETAQGiFvnFk~l~~~~r~klPF 188 (558) T COG0423 109 GKDGHGNMSPEELTEIIREYDIRCPECGGELNEVREFNLMFKTTIGPVEDSLGYLRPETAQGIFVNFKNLLEFARNKLPF 188 (558) T ss_pred HHCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCEEEEEEEEEECCCCCCCEEECCCCCCHHHHHHHHHHHHHCCCCCE T ss_conf 41034557878888888861775887688668750344666731457788632456510021215369999975368984 Q ss_pred HHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEECCHHH-------------------H--HH--------------HHH Q ss_conf 200010001278764520001056353344210105376-------------------7--88--------------789 Q gi|254781004|r 361 RLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVFCTKEQ-------------------M--FN--------------ECL 405 (652) Q Consensus 361 rl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F~~~eq-------------------~--~~--------------e~~ 405 (652) .++|+|+.||||+|| ++||+|+|||+|.++..||.|+. . .+ +.. T Consensus 189 giaQIGKsfRNEISP-r~gl~R~REF~QaEiE~Fv~P~~k~~p~f~~v~~~~l~l~~~~~q~~~~~~EAv~~g~~~n~~~ 267 (558) T COG0423 189 GIAQIGKSFRNEISP-RNGLFRTREFEQAEIEFFVDPEEKEHPKFNEVKDEKLPLLPREAQEEGTEEEAVEEGIVENETL 267 (558) T ss_pred EEEEECHHHCCCCCC-CCCEEEHHHHHHHEEEEEECCCCCCCCCHHHHHHHHCCCCCHHHHHHHHHHHHHHCCEECCHHH T ss_conf 888623101465276-6536651234312013588777556863333321002466477764100345544154504438 Q ss_pred -HHHHHHHHHHHHHCCCCCEECCCCCCCCC-----------------CCCHHHH---HHHHHHHHHHHHHHCCCCCCCCC Q ss_conf -99999887665521332100014653112-----------------5860357---78999999999850212125667 Q gi|254781004|r 406 -KIHNLIISIYKDFGFEKIMVKLSTRPEKR-----------------VGSDALW---DDAENIMKGVLDTIKNSSQDRIN 464 (652) Q Consensus 406 -~~~~~~~~v~~~~g~~~~~~~ls~~~~~~-----------------~g~~~~~---~~ae~~l~~~l~~~~~~~~~~~~ 464 (652) .++.....+...+|+..-.++.-.++..- .|-.|.| +...-.|..-.+..+.+.....+ T Consensus 268 ~y~~~~~~~fl~~lGI~~e~lRfrqh~~~E~AHYa~~twD~E~~~~~~gWiE~~GiAdRtdYDL~~H~k~s~~~l~v~~~ 347 (558) T COG0423 268 AYFIARTKFFLEDLGIDPEKLRFRQHLPEELAHYSKDTWDAEYKFPFGGWIELVGIADRTDYDLSRHSKFSGEDLTVFRE 347 (558) T ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHCCHHHHHHHHHCCEEEEEECCCCCEEEEEEEECCCCCCHHHHHHHCCCCCEEEEC T ss_conf 99999999999980999898066523908776654402327996378866999986135544556565403666223321 Q ss_pred CC--------CCCCCCCCCCCCCCCCHHH---HCC------CCCCCCCCCHHHHCCCCCCCEEEEECCCEEEEEEEEECC Q ss_conf 52--------0555555455432110021---103------310012110111022346741673179721379964034 Q gi|254781004|r 465 TG--------VLLGEGAFYGPKFEYILKD---AIG------RDWQCGTIQVDFNLPSRFNAFYVNSHSEKCHPVMIHRAV 527 (652) Q Consensus 465 ~~--------~~~g~~afygpkid~~~~d---~~g------r~~~~~tiq~df~~p~~f~l~y~~~dg~~~~pvmihr~~ 527 (652) +. ...-+-.+.||+..-..+. ++. +.-...++-.|.. .+...- ...|++-.|.+|- .+ T Consensus 348 ~~ePk~v~~~~~~~~~~~~gp~~k~~a~~v~~~~se~~~~~~~~~~~~~~~~~~---~i~~~~-~~~~e~~iP~VIE-PS 422 (558) T COG0423 348 YDEPKEVERWHSKVDLKFLGPEFKKDAKKVAEALSELEELEKELNGYEVSKDLV---IIEEVE-KVTGEKYIPHVIE-PS 422 (558) T ss_pred CCCCCEEEEEECCCCHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHH---HHEEEE-ECCCCEECCCEEC-CC T ss_conf 477622331001466210381445566777654456666653047621353100---100110-0247550671565-67 Q ss_pred CHHHHHHHHHHHHHH-----C------CCCCCCCCCEEEEEEECC--HHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHH Q ss_conf 103789999999973-----8------708873378089998579--668999999999999879889997589663676 Q gi|254781004|r 528 FGSIERFIGIMIENF-----K------GNLPLWLSPIQAIVTTIT--SSAVEYAQEIANLLKSHHLSIETDFRNETINYK 594 (652) Q Consensus 528 ~Gs~eR~ia~liE~~-----~------g~~P~wLAP~QV~Iipi~--e~~~eya~~i~~~L~~~girV~~Ddr~~~~G~K 594 (652) +| ++|.+.+++||- . .+||++|||+||+|+|+. ++..+.|.+|++.|++.|+.|.+||++ +||++ T Consensus 423 fG-idRi~y~~l~~ay~~ee~~e~R~vLrl~p~lAPikvaVlPL~~k~~l~~~a~~i~~~L~~~~~~v~yDdsG-sIGrR 500 (558) T COG0423 423 FG-IDRIFYALLEHAYTEEEVEEKRIVLRLPPDLAPIKVAVLPLVKKDELVEIAKEIFEKLRELGFNVDYDDSG-SIGRR 500 (558) T ss_pred CC-CHHHHHHHHHHHHCCCCCCCCEEEEECCCCCCCEEEEEEEEECCCCHHHHHHHHHHHHHHCCCEEEECCCC-CHHHH T ss_conf 77-11899999987620113566505995586558527999960224214689999999998539669862888-37665 Q ss_pred HHHHHHCCCCEEEEECHHHHHCCCEEEEECCCCCEEEECHHHHHHHHHHHC Q ss_conf 665543489889998742532581899988988663342999999998731 Q gi|254781004|r 595 IREHSIKKIPIIIICGDKEASERSIGIRRFGSTTTQKVSLLDAIPILTKES 645 (652) Q Consensus 595 ir~a~l~giP~~ivIG~ke~e~~~Vtir~R~~~~~~~i~lee~i~~l~~e~ 645 (652) +|.+|.+|+||+|+|..+.++++|||||.||+++|..++++++...|.+.. T Consensus 501 YrR~DEIGtPfcVTvD~eTleD~tVTiReRDs~~QvRv~i~el~~~l~~~~ 551 (558) T COG0423 501 YRRQDEIGTPFCVTVDFETLEDNTVTIRERDSMEQVRVKIEELADYLRELI 551 (558) T ss_pred HHHCCCCCCCEEEEECCCCCCCCCEEEEECCCHHEEEEEHHHHHHHHHHHH T ss_conf 553344589618995477566784798625761204545999999999984 No 28 >cd00670 Gly_His_Pro_Ser_Thr_tRS_core Gly_His_Pro_Ser_Thr_tRNA synthetase class II core domain. This domain is the core catalytic domain of tRNA synthetases of the subgroup containing glycyl, histidyl, prolyl, seryl and threonyl tRNA synthetases. It is primarily responsible for ATP-dependent formation of the enzyme bound aminoacyl-adenylate. These enzymes belong to class II aminoacyl-tRNA synthetases (aaRS) based upon their structure and the presence of three characteristic sequence motifs in the core domain. This domain is also found at the C-terminus of eukaryotic GCN2 protein kinase and at the N-terminus of the ATP phosphoribosyltransferase accessory subunit, HisZ and the accessory subunit of mitochondrial polymerase gamma (Pol gamma b) . Most class II tRNA synthetases are dimers, with this subgroup consisting of mostly homodimers. These enzymes attach a specific amino acid to the 3' OH group of ribose of the appropriate tRNA. Probab=100.00 E-value=2e-40 Score=298.82 Aligned_cols=232 Identities=25% Similarity=0.347 Sum_probs=177.1 Q ss_pred HHHHHHHHHHHHHHHH-HCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHHH Q ss_conf 7899999999999887-689638555432443100011210012555420100145677782377434312577776520 Q gi|254781004|r 272 GWKIFQTLISYMRRKI-KDDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHVAVFNH 350 (652) Q Consensus 272 G~~i~~~ie~~ir~~~-~~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~ 350 (652) |++|+++|++++++.+ ++||++|.+|.|.+.++|+.||||++|+++||.+... ..+..+++|+|+||+|+.|+.+|++ T Consensus 1 Ga~L~~aL~~~~~~~~~~~G~~~v~~P~l~~~~~~~~sg~~~~f~~~~y~v~~~-~~~~~~~~~~L~PTsE~~~~~~~~~ 79 (235) T cd00670 1 GTALWRALERFLDDRMAEYGYQEILFPFLAPTVLFFKGGHLDGYRKEMYTFEDK-GRELRDTDLVLRPAACEPIYQIFSG 79 (235) T ss_pred CHHHHHHHHHHHHHHHHHCCCEEEECCCCCCHHHHHHCCCCCCCCCCEEEEECC-CCCCCCCCEEEEECCCHHHHHHHHC T ss_conf 988999999999999998699899898557899992238675452052898237-7666787389930786999999948 Q ss_pred CCCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCC Q ss_conf 12222310432000100012787645200010563533442101053767887899999988766552133210001465 Q gi|254781004|r 351 GLKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVFCTKEQMFNECLKIHNLIISIYKDFGFEKIMVKLSTR 430 (652) Q Consensus 351 ~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F~~~eq~~~e~~~~~~~~~~v~~~~g~~~~~~~ls~~ 430 (652) ++.||||||+|++|+++|||+|.++ .+||+|+|+|+|+|+|+||+++|+.+++..+++.+..++..||+....+ +... T Consensus 80 ~i~s~~dLPlr~~~~s~~fR~E~~~-~~Gl~Rv~eF~k~E~~~~~~~e~~~~~~~~~~~~~~~i~~~L~l~y~v~-~~~~ 157 (235) T cd00670 80 EILSYRALPLRLDQIGPCFRHEPSG-RRGLMRVREFRQVEYVVFGEPEEAEEERREWLELAEEIARELGLPVRVV-VADD 157 (235) T ss_pred CCCCHHHCCEEEEEECCEECCCCCC-CCCCEEEEEEEEEEEEEEECHHHHHHHHHHHHHHHHHHHHHCCCCEEEE-ECCC T ss_conf 3287687995454456533367899-9998367899985079996889999999999999999999869977999-7586 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHC-CCCCCCCCCC-HHHHCCCCCCC Q ss_conf 311258603577899999999985021212566752055555545543211002110-3310012110-11102234674 Q gi|254781004|r 431 PEKRVGSDALWDDAENIMKGVLDTIKNSSQDRINTGVLLGEGAFYGPKFEYILKDAI-GRDWQCGTIQ-VDFNLPSRFNA 508 (652) Q Consensus 431 ~~~~~g~~~~~~~ae~~l~~~l~~~~~~~~~~~~~~~~~g~~afygpkid~~~~d~~-gr~~~~~tiq-~df~~p~~f~l 508 (652) ++.+.+.... . +.++ ..+.|+.+..-. |+..++++.. .....+.+++. T Consensus 158 ~d~g~~~~~~--~--------------------------~~~a--~~~~d~E~~~p~~~~~~e~~s~S~~~~~q~r~~~~ 207 (235) T cd00670 158 PFFGRGGKRG--L--------------------------DAGR--ETVVEFELLLPLPGRAKETAVGSANVHLDHFGASF 207 (235) T ss_pred CCCCCCCCCC--C--------------------------CCCC--CCCEEEEEEECCCCCEEEEEEEEECCCHHHCCCCC T ss_conf 1337664100--0--------------------------2565--55646999977899467899930257844430253 Q ss_pred EEEEECCCEEEEEEEEECCCHHHHHHHHHHH Q ss_conf 1673179721379964034103789999999 Q gi|254781004|r 509 FYVNSHSEKCHPVMIHRAVFGSIERFIGIMI 539 (652) Q Consensus 509 ~y~~~dg~~~~pvmihr~~~Gs~eR~ia~li 539 (652) .|.+.+|+..+.. .+..|.++|+|+||| T Consensus 208 ~~~~~~~~~~ht~---~~s~~a~~R~l~All 235 (235) T cd00670 208 KIDEDGGGRAHTG---CGGAGGEERLVLALL 235 (235) T ss_pred EEECCCCCEEEEE---CCCCCHHHHHHHHHC T ss_conf 0146999689997---263288999999869 No 29 >cd00768 class_II_aaRS-like_core Class II tRNA amino-acyl synthetase-like catalytic core domain. Class II amino acyl-tRNA synthetases (aaRS) share a common fold and generally attach an amino acid to the 3' OH of ribose of the appropriate tRNA. PheRS is an exception in that it attaches the amino acid at the 2'-OH group, like class I aaRSs. These enzymes are usually homodimers. This domain is primarily responsible for ATP-dependent formation of the enzyme bound aminoacyl-adenylate. The substrate specificity of this reaction is further determined by additional domains. Intererestingly, this domain is also found is asparagine synthase A (AsnA), in the accessory subunit of mitochondrial polymerase gamma and in the bacterial ATP phosphoribosyltransferase regulatory subunit HisZ. Probab=100.00 E-value=2.9e-38 Score=283.87 Aligned_cols=209 Identities=26% Similarity=0.367 Sum_probs=176.1 Q ss_pred HHHHHHHHHHHHHH-CCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHHHCCC Q ss_conf 99999999998876-89638555432443100011210012555420100145677782377434312577776520122 Q gi|254781004|r 275 IFQTLISYMRRKIK-DDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHVAVFNHGLK 353 (652) Q Consensus 275 i~~~ie~~ir~~~~-~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~~~~ 353 (652) ++++|++++|+.++ +||+||.||.|.+.++|+.||||+++...+ .+..+++|+|+||+||+|+.+|++++ T Consensus 1 ~r~~ie~~~r~~~~~~G~~ev~~P~l~~~~~~~~sg~~~~~~~~~--------~~~~~~~~~L~pt~e~~~~~~~~~~~- 71 (211) T cd00768 1 IRSKIEQKLRRFMAELGFQEVETPIVEREPLLEKAGHEPKDLLPV--------GAENEEDLYLRPTLEPGLVRLFVSHI- 71 (211) T ss_pred CHHHHHHHHHHHHHHCCCEEEECCEECCHHHHHHCCCCHHHHHHH--------CCCCCCEEEEECCCCHHHHHHHHHHH- T ss_conf 978999999999998699899998342799997506762223011--------04679769995168499999975247- Q ss_pred CHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCCCCC Q ss_conf 22310432000100012787645200010563533442101053767887899999988766552133210001465311 Q gi|254781004|r 354 SYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVFCTKEQMFNECLKIHNLIISIYKDFGFEKIMVKLSTRPEK 433 (652) Q Consensus 354 SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F~~~eq~~~e~~~~~~~~~~v~~~~g~~~~~~~ls~~~~~ 433 (652) ||||+|++|++++||+|.++ .||+|+|+|+|+|+|+||+++|..+++..++..+.+++..+|+....+.....+ T Consensus 72 --~dLP~r~~~~~~~fR~E~~~--~Gl~RvreF~~~e~~~f~~~~~~~~~~~~~~~~~~~i~~~lgl~~~~~~~~~~~-- 145 (211) T cd00768 72 --RKLPLRLAEIGPAFRNEGGR--RGLRRVREFTQLEGEVFGEDGEEASEFEELIELTEELLRALGIKLDIVFVEKTP-- 145 (211) T ss_pred --HHCCHHHHEECCEECCCCCC--CCCCCCCCEEEEEEEEEECCHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECC-- T ss_conf --65777864320056057888--884460468898479998848889999999999999999749988869999778-- Q ss_pred CCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCHHHH-CCCCCCCEEEE Q ss_conf 25860357789999999998502121256675205555554554321100211033100121101110-22346741673 Q gi|254781004|r 434 RVGSDALWDDAENIMKGVLDTIKNSSQDRINTGVLLGEGAFYGPKFEYILKDAIGRDWQCGTIQVDFN-LPSRFNAFYVN 512 (652) Q Consensus 434 ~~g~~~~~~~ae~~l~~~l~~~~~~~~~~~~~~~~~g~~afygpkid~~~~d~~gr~~~~~tiq~df~-~p~~f~l~y~~ 512 (652) ....+++++++.|+.+....|+.+++++++.... .+.+|+++|.+ T Consensus 146 ----------------------------------~~~~~~~a~~~~Die~~~p~~~~~Ev~s~~~~~d~qsr~~~i~y~~ 191 (211) T cd00768 146 ----------------------------------GEFSPGGAGPGFEIEVDHPEGRGLEIGSGGYRQDEQARAADLYFLD 191 (211) T ss_pred ----------------------------------CCCCCCCCEEEEEEEEECCCCCEEEEECEECCCCHHHHHCCCEEEC T ss_conf ----------------------------------6766645235455789837897999935461677878846989999 Q ss_pred ECCCEEEEEEEEECCCHHHHHH Q ss_conf 1797213799640341037899 Q gi|254781004|r 513 SHSEKCHPVMIHRAVFGSIERF 534 (652) Q Consensus 513 ~dg~~~~pvmihr~~~Gs~eR~ 534 (652) +||+...|.|+++|+ +++|+ T Consensus 192 ~dg~~~~~~~~~~gl--~v~Rl 211 (211) T cd00768 192 EALEYRYPPTIGFGL--GLERL 211 (211) T ss_pred CCCCEEEEEEEECEE--CCCCC T ss_conf 999998755630324--13669 No 30 >KOG2298 consensus Probab=100.00 E-value=8.6e-33 Score=246.12 Aligned_cols=382 Identities=19% Similarity=0.229 Sum_probs=235.7 Q ss_pred EEEECC----C--CCCEEEEECCHHHHHHHHHHHHHHHH--HCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCC Q ss_conf 044226----7--87435650107899999999999887--689638555432443100011210012555420100145 Q gi|254781004|r 255 LFHIAE----D--GSGVIFWHRKGWKIFQTLISYMRRKI--KDDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADD 326 (652) Q Consensus 255 lf~~~~----~--~~G~~~wlP~G~~i~~~ie~~ir~~~--~~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~ 326 (652) .||+++ . -+|+|=+-|.|..+.++|.+.||+++ ..+-.||-.|.+.|....+.|||.|+|.+=|..-..-.. T Consensus 22 rfFy~~sf~IYgGVsGLyD~GP~Gcalk~Nil~~WRkhFilEE~MlEvdct~ltP~~VlkaSGHVdkF~D~mvkD~ktGe 101 (599) T KOG2298 22 RFFYDPSFAIYGGVSGLYDFGPPGCALKSNILSLWRKHFILEEDMLEVDCTMLTPEPVLKASGHVDKFADWMVKDEKTGE 101 (599) T ss_pred CEEECCCHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCEEECCCCCCCHHHHHCCCCHHHHHHHHHCCCCCCC T ss_conf 22125404653473322125998605677699999998741302044016757808876346646666677740755652 Q ss_pred --------------------------------------------------------------------------CCCCCC Q ss_conf --------------------------------------------------------------------------677782 Q gi|254781004|r 327 --------------------------------------------------------------------------TIKDLR 332 (652) Q Consensus 327 --------------------------------------------------------------------------~~~~~~ 332 (652) ....+- T Consensus 102 cfRADHLvk~~~~rl~~~~~~~~~~e~e~iLa~~d~~s~~el~~~~~kyni~sP~tgn~Ls~p~~FNLMF~T~IGpsG~~ 181 (599) T KOG2298 102 CFRADHLVKDAEERLKKKASAEVKAEMEKILAKLDGYSGQELGELISKYNIKSPVTGNDLSEPRQFNLMFETQIGPSGGL 181 (599) T ss_pred EEHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCEECCCCCCCCCCC T ss_conf 22056788999986201122677888999998750787588888998615789876887677711110011121677785 Q ss_pred EEEECCCCHHHHHHHHHHCCC-CHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEECCHHHH-HHHHHHHHHH Q ss_conf 377434312577776520122-22310432000100012787645200010563533442101053767-8878999999 Q gi|254781004|r 333 TFALKPMNCPGHVAVFNHGLK-SYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVFCTKEQM-FNECLKIHNL 410 (652) Q Consensus 333 ~~~LkPmncp~h~~i~~~~~~-SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F~~~eq~-~~e~~~~~~~ 410 (652) .-.|||-+-.++.+-|+.-.- .-+.||+--+|+|..||||+|| |.||+|||||+|.+.+-|+.|+.- ...+..+... T Consensus 182 kgyLRPETAQG~FlNFkrlle~N~~KlPFA~AqiG~~fRNEISp-RsGLlRvrEF~maEIEHFvdP~~K~h~kF~~V~~~ 260 (599) T KOG2298 182 KGYLRPETAQGQFLNFKRLLEFNQGKLPFASAQIGKSFRNEISP-RSGLLRVREFTMAEIEHFVDPLLKSHPKFSLVAAE 260 (599) T ss_pred CCCCCCCCCCCCCCCHHHHHHHCCCCCCCHHHHHCHHHHHCCCC-CCCCEEEEEEEHHHHHCCCCCCCCCCHHHHHHHHH T ss_conf 54327400123101489998752887763488746576513375-55743678754677653489777787366655431 Q ss_pred HHH-----------------------------------------HHHHHCCCCCEECCCCCCCCCCCCHHHHHHHHHHH- Q ss_conf 887-----------------------------------------66552133210001465311258603577899999- Q gi|254781004|r 411 IIS-----------------------------------------IYKDFGFEKIMVKLSTRPEKRVGSDALWDDAENIM- 448 (652) Q Consensus 411 ~~~-----------------------------------------v~~~~g~~~~~~~ls~~~~~~~g~~~~~~~ae~~l- 448 (652) -.. ....+|+....++...+- +.++..+|..-+ T Consensus 261 ~l~l~~~~~q~~g~~a~~~~lgEaV~kg~V~netlGyfi~Ri~~fL~~lGid~~rlRFRqH~-----~nEMAHYA~DCWD 335 (599) T KOG2298 261 KLRLFPRDKQLSGQSAQKVALGEAVAKGTVNNETLGYFIGRIYLFLNKLGIDKERLRFRQHM-----ANEMAHYAFDCWD 335 (599) T ss_pred HHHHCCHHHHHCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHCCHHHHH-----HHHHHHHHCCCCC T ss_conf 55423323443465056747989987604453036788999999999847655321067776-----5333333015322 Q ss_pred -----------------HHHHHHHCCCCCCC--------------CCCCCCCCCCCCCCCCCCCC---HHHHCCCCCCCC Q ss_conf -----------------99998502121256--------------67520555555455432110---021103310012 Q gi|254781004|r 449 -----------------KGVLDTIKNSSQDR--------------INTGVLLGEGAFYGPKFEYI---LKDAIGRDWQCG 494 (652) Q Consensus 449 -----------------~~~l~~~~~~~~~~--------------~~~~~~~g~~afygpkid~~---~~d~~gr~~~~~ 494 (652) +.+.+........+ ++....+-.....|+++.-. +..++...++.- T Consensus 336 aEi~tSYGWIEcVG~ADRs~yDL~~Hs~~t~~~Lv~~~kl~ePkeve~~~i~~~kk~~g~~fk~~ak~v~~~l~~~s~~e 415 (599) T KOG2298 336 AEIKTSYGWIECVGCADRAAYDLSTHSRASKVWLVAELRLREPKEVEFAVIPPNKKELGCAFKKDAKGVNETLIFPSGKE 415 (599) T ss_pred HHHHHCCCCEEEEECCCHHHEEEECCCCCCCCCEEEHHHCCCCCEEEEEEECCCHHHCCHHHHHCCCCHHHHHHCCCHHH T ss_conf 32421027578741043544233247655677655111048860688988644244306123413510357760587889 Q ss_pred CCCHHHHCC--CCCCCEEEEEC------CC-EEE-------EEEEEECCCHHHHHHHHHHHHHH---C-------CCCCC Q ss_conf 110111022--34674167317------97-213-------79964034103789999999973---8-------70887 Q gi|254781004|r 495 TIQVDFNLP--SRFNAFYVNSH------SE-KCH-------PVMIHRAVFGSIERFIGIMIENF---K-------GNLPL 548 (652) Q Consensus 495 tiq~df~~p--~~f~l~y~~~d------g~-~~~-------pvmihr~~~Gs~eR~ia~liE~~---~-------g~~P~ 548 (652) .+..--++. ..+.+...+.+ .. .+. |-+| ..+|| +-|++..|+||. . -.||+ T Consensus 416 ~~~~~~~L~~~g~~~~~v~~l~k~~v~i~~~~k~~hV~e~~P~VI-EPSfG-iGRI~Y~l~EHsF~R~~de~R~~~sfpp 493 (599) T KOG2298 416 LIETLENLGDHGLLHMYVVNLSKLHVRIKRKTKVPHVREVNPDVI-EPSFG-IGRISYQLTEHSFTRKGDEQRKVLSFPP 493 (599) T ss_pred HHHHHHHHHCCCCEEEEEEECCCCEEEEEEEEEEEEEEEECCCCC-CCCCH-HHHHHHHHHHHHHCCCCCCCCCEECCCC T ss_conf 999999762177389998740321045520242245786168755-45400-4678899876551567654564331476 Q ss_pred CCCCEEEEEEECC--HHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHCCCCEEEEECHHHHHCCCEEEEECCC Q ss_conf 3378089998579--66899999999999987988999758966367666554348988999874253258189998898 Q gi|254781004|r 549 WLSPIQAIVTTIT--SSAVEYAQEIANLLKSHHLSIETDFRNETINYKIREHSIKKIPIIIICGDKEASERSIGIRRFGS 626 (652) Q Consensus 549 wLAP~QV~Iipi~--e~~~eya~~i~~~L~~~girV~~Ddr~~~~G~Kir~a~l~giP~~ivIG~ke~e~~~Vtir~R~~ 626 (652) -+||++|.|.|++ ++..+++++|.+.|.++|+.+.+||++-|||+|++..|.+|||+-|+|....+++++||+|.||+ T Consensus 494 ~vAPiK~~v~pls~n~ef~pv~~~ls~~L~~~gis~kvdds~~SIGrrYAr~DElgipFgiTvDfdtlk~~tVTlReRdS 573 (599) T KOG2298 494 LVAPIKVALDPLSPNLEFRPVCQGLSNELTENGISVKVDDSSSSIGRRYARTDELGIPFGVTVDFDTLKNGTVTLRERDS 573 (599) T ss_pred CCCCEEEEEEECCCCHHHHHHHHHHHHHHHHCCCEEEECCCCCCHHHHHHCCCCCCCCEEEEECHHHHCCCEEEEEECCC T ss_conf 53634899972367325789999999999867876996577874765321355336743799744241376079861542 Q ss_pred CCEEEECHHHHHHHHHHH Q ss_conf 866334299999999873 Q gi|254781004|r 627 TTTQKVSLLDAIPILTKE 644 (652) Q Consensus 627 ~~~~~i~lee~i~~l~~e 644 (652) +.|..+++++++..+.+- T Consensus 574 ~~QvR~~i~e~~s~v~~~ 591 (599) T KOG2298 574 TMQVRMHISKLKSFLIKY 591 (599) T ss_pred HHHHHHHHHHHHHHHHHH T ss_conf 887876399899999999 No 31 >pfam00587 tRNA-synt_2b tRNA synthetase class II core domain (G, H, P, S and T). Other tRNA synthetase sub-families are too dissimilar to be included. This domain is the core catalytic domain of tRNA synthetases and includes glycyl, histidyl, prolyl, seryl and threonyl tRNA synthetases. Probab=100.00 E-value=2.6e-34 Score=256.64 Aligned_cols=162 Identities=33% Similarity=0.574 Sum_probs=144.3 Q ss_pred HHHHHHHHHHHHH-HCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHHHCCC Q ss_conf 9999999999887-689638555432443100011210012555420100145677782377434312577776520122 Q gi|254781004|r 275 IFQTLISYMRRKI-KDDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHVAVFNHGLK 353 (652) Q Consensus 275 i~~~ie~~ir~~~-~~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~~~~ 353 (652) ++++|++++++.+ ++||++|.||+|.+.++|+.||||++|.++||.+. ++++++++|+||+|++++.+|++..+ T Consensus 1 l~~~l~~~~~~~~~~~G~~~i~tP~l~~~~~~~~sg~~~~~~~~my~~~-----d~~~~~l~Lrp~~~~~~~~~~~~~~~ 75 (170) T pfam00587 1 LRNALENFIRDLFKRYGYQEVDTPILEPKELWEGSGHWDDYFDEMYRFK-----DRGGEELYLRPTAEVGITRLFKNEIL 75 (170) T ss_pred CHHHHHHHHHHHHHHCCCEEEECCEEEEHHHHHHCCCCCCCHHHEEEEE-----CCCCCEEEEECCCCHHHHHHHHHHCC T ss_conf 9799999999999985998999991756789834477655753125673-----58997688732574588899886200 Q ss_pred CHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCCCC- Q ss_conf 2231043200010001278764520001056353344210105376788789999998876655213321000146531- Q gi|254781004|r 354 SYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVFCTKEQMFNECLKIHNLIISIYKDFGFEKIMVKLSTRPE- 432 (652) Q Consensus 354 SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F~~~eq~~~e~~~~~~~~~~v~~~~g~~~~~~~ls~~~~- 432 (652) ||++||+|+|++|++||+|.++ +.||+|+|||+|.|+|+||+++|+.++...++.++.++++.+|+. +.+.++.+.+ T Consensus 76 ~~~~lP~r~~~~~~~fR~E~~~-~~Gl~R~reF~q~d~~~~~~~e~s~~~~~e~~~~~~~~~~~lgl~-~~i~~~~~g~l 153 (170) T pfam00587 76 SYRDLPLKLYQIGPCFRYEARP-RRGLGRVREFTQVDAEIFGTPEQSEEEDEELLKLAEEILQDLGLP-YRVVLNTRGDL 153 (170) T ss_pred CCCCCCCEEECCCCCCCCCCCC-CCCCCEEEEEEEECEEEEECCCHHHHHHHHHHHHHHHHHHHCCCC-EEEEECCCCCC T ss_conf 3235860133065300677776-667743678898047999670042999999999999999987996-79997267778 Q ss_pred ----CCCCCHHHHHH Q ss_conf ----12586035778 Q gi|254781004|r 433 ----KRVGSDALWDD 443 (652) Q Consensus 433 ----~~~g~~~~~~~ 443 (652) .+.++.+.|.. T Consensus 154 ~~~~~~~~d~E~w~~ 168 (170) T pfam00587 154 GGYASKTGDLEAWLP 168 (170) T ss_pred CCHHHHCCCCCEECC T ss_conf 986786078520578 No 32 >cd00858 GlyRS_anticodon GlyRS Glycyl-anticodon binding domain. GlyRS belongs to class II aminoacyl-tRNA synthetases (aaRS). This alignment contains the anticodon binding domain, which is responsible for specificity in tRNA-binding, so that the activated amino acid is transferred to a ribose 3' OH group of the appropriate tRNA only. Probab=99.95 E-value=3.7e-27 Score=207.22 Aligned_cols=99 Identities=21% Similarity=0.270 Sum_probs=95.7 Q ss_pred CCCCCCCCEEEEEEECC--HHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHCCCCEEEEECHHHHHCCCEEEE Q ss_conf 08873378089998579--6689999999999998798899975896636766655434898899987425325818999 Q gi|254781004|r 545 NLPLWLSPIQAIVTTIT--SSAVEYAQEIANLLKSHHLSIETDFRNETINYKIREHSIKKIPIIIICGDKEASERSIGIR 622 (652) Q Consensus 545 ~~P~wLAP~QV~Iipi~--e~~~eya~~i~~~L~~~girV~~Ddr~~~~G~Kir~a~l~giP~~ivIG~ke~e~~~Vtir 622 (652) +||+||||+||+|+||+ +++.+||++|++.|+++|+||.+|+|+ ++|+|||+|+++||||+|+||++++++|+||+| T Consensus 19 ~lP~~lAP~qv~IiPi~~~d~~~~~a~~i~~~L~~~gi~v~~Dd~~-s~G~K~~~~d~~giP~~ivIg~~~le~~~VtiR 97 (121) T cd00858 19 RLPPALAPIKVAVLPLVKRDELVEIAKEISEELRELGFSVKYDDSG-SIGRRYARQDEIGTPFCVTVDFDTLEDGTVTIR 97 (121) T ss_pred ECCCCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCCEEEEECCC-CHHHHHHHHHHHCCCEEEEEECCEECCCEEEEE T ss_conf 8788638837999974160777999999999998779779970388-831444678874898399992761037879999 Q ss_pred ECCCCCEEEECHHHHHHHHHHH Q ss_conf 8898866334299999999873 Q gi|254781004|r 623 RFGSTTTQKVSLLDAIPILTKE 644 (652) Q Consensus 623 ~R~~~~~~~i~lee~i~~l~~e 644 (652) +|+++++..|+++++++.+++. T Consensus 98 ~Rdt~~q~~v~i~el~~~i~~l 119 (121) T cd00858 98 ERDSMRQVRVKIEELPSYLREL 119 (121) T ss_pred ECCCCCEEEEEHHHHHHHHHHH T ss_conf 9899957899999999999998 No 33 >CHL00201 syh histidine-tRNA synthetase; Provisional Probab=99.95 E-value=8.9e-25 Score=190.79 Aligned_cols=362 Identities=19% Similarity=0.239 Sum_probs=226.6 Q ss_pred CCCEEEEECCHHHHHHHHHHHHHHHH-HCCCEEEECCCCCCCCCCCCCCCC--CCCCCCCCEEECCCCCCCCCCEEEECC Q ss_conf 87435650107899999999999887-689638555432443100011210--012555420100145677782377434 Q gi|254781004|r 262 GSGVIFWHRKGWKIFQTLISYMRRKI-KDDYEEINTPQVLDQHLWQQSGHW--DSYRANMFAVKCADDTIKDLRTFALKP 338 (652) Q Consensus 262 ~~G~~~wlP~G~~i~~~ie~~ir~~~-~~G~~eV~tP~l~~~~Lw~~SGh~--~~y~~~mf~~~~~~~~~~~~~~~~LkP 338 (652) -.|+-=|+|.-..+++.|++.+++-. .+||++|.||++=..+++..++-- |-...+||.+. |+.++.++||| T Consensus 7 ~kG~rD~lp~e~~~~~~i~~~i~~~f~~~Gf~~I~TP~lE~~e~~~~~~g~~~d~i~k~~y~f~-----D~~g~~l~LRp 81 (424) T CHL00201 7 LRGTKDILPDEIEYWQFIHEKAATLLKLANYQEIRTPIFENTELFDRGIGEETDIVNKEMYRFF-----DRGNRSITLRP 81 (424) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCHHHHCCCCCCCCHHHHHHHEEEE-----CCCCCEEEECC T ss_conf 9999878999999999999999999998599250475404198862357874205666532545-----59998888678 Q ss_pred CCHHHHHHHHHHCCCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEECCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 31257777652012222310432000100012787645200010563533442101053767887899999988766552 Q gi|254781004|r 339 MNCPGHVAVFNHGLKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVFCTKEQMFNECLKIHNLIISIYKDF 418 (652) Q Consensus 339 mncp~h~~i~~~~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F~~~eq~~~e~~~~~~~~~~v~~~~ 418 (652) =--+..+-++..........|+|+|++|.+||+|... .| |-|||+|-++.+|=.++.. .-..++.+..+++..+ T Consensus 82 D~T~piaR~~~~~~~~~~~~P~r~~y~g~vfR~e~pq--~G--R~REF~Q~gvEiiG~~~~~--aDaEvi~l~~~~l~~l 155 (424) T CHL00201 82 EGTASIVRAFIENRMAYHMRPQRLWYSGPMFRYERPQ--SG--RQRQFHQLGIEFIGSSDPR--ADMEVIHLAMTLFNEL 155 (424) T ss_pred CCHHHHHHHHHHCCCCCCCCCEEEEEECCEEECCCCC--CC--CCCHHHHCCEEEECCCCHH--HHHHHHHHHHHHHHHC T ss_conf 7709999999974765579987899875478058987--88--6421041462000689828--7899999999999865 Q ss_pred CCCCCEECCCCCCCCCCCCHHHHHHHHHHHHHHHH----HHCCCCCCCCCCC----CC---CCCCCC--CCCCCCCCH-H Q ss_conf 13321000146531125860357789999999998----5021212566752----05---555554--554321100-2 Q gi|254781004|r 419 GFEKIMVKLSTRPEKRVGSDALWDDAENIMKGVLD----TIKNSSQDRINTG----VL---LGEGAF--YGPKFEYIL-K 484 (652) Q Consensus 419 g~~~~~~~ls~~~~~~~g~~~~~~~ae~~l~~~l~----~~~~~~~~~~~~~----~~---~g~~af--ygpkid~~~-~ 484 (652) |+..+.+.+.... ..+.-..+...+.+.+. .........+... .. ....+. -.|.+.... . T Consensus 156 gl~~~~i~in~~g-----~~~~~~~~~~~l~~~l~~~~~~l~~~~~~~l~~~~~~~l~~~~~~~~~~l~~~~~l~~~~~~ 230 (424) T CHL00201 156 PVHRLVLDINSIG-----KMEDRALYRDKLTEYLSQYNQDLDEDSQNRLSSNPLRILDSKNGATQEILDMAPLLSDFLSL 230 (424) T ss_pred CCCCEEHHCCCCC-----CHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHCCHHHHHHHHCHHHHHHHCH T ss_conf 9652010003325-----59988889999999999987543778775321148999873060479998644320102048 Q ss_pred HHC-------CCCCC-CCCCCHHHHCCCCC--------CC---------------EE---EEECCCEEEEEEEEECCCHH Q ss_conf 110-------33100-12110111022346--------74---------------16---73179721379964034103 Q gi|254781004|r 485 DAI-------GRDWQ-CGTIQVDFNLPSRF--------NA---------------FY---VNSHSEKCHPVMIHRAVFGS 530 (652) Q Consensus 485 d~~-------gr~~~-~~tiq~df~~p~~f--------~l---------------~y---~~~dg~~~~pvmihr~~~Gs 530 (652) .+. ..... .-.+++|+.+...+ .. -| +..=|.+..|-+ |.--+ T Consensus 231 ~~~~~l~~l~~~l~~~~v~~~~D~slvRGLdYYTG~VFE~~~~~~~~~~si~gGGRYD~Lv~~fgg~~~Pav---GfaiG 307 (424) T CHL00201 231 ESEEHFADVCEYLANLNIPYKINPKLVRGLDYYNDTAFEIKTLGSDAQDTVCGGGRYDSLVEQLGGPKTPAV---GWAIG 307 (424) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCHHHHCCCCCCCCCEEEEEECCCCCCCCCCCCCCCHHHHHHCCCCCCCEE---EEEEC T ss_conf 889999999999984387645287771576764573699996788862342378770778997299999857---78843 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCEEEEEEECCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHCCCCEEEEEC Q ss_conf 78999999997387088733780899985796689999999999998798899975896636766655434898899987 Q gi|254781004|r 531 IERFIGIMIENFKGNLPLWLSPIQAIVTTITSSAVEYAQEIANLLKSHHLSIETDFRNETINYKIREHSIKKIPIIIICG 610 (652) Q Consensus 531 ~eR~ia~liE~~~g~~P~wLAP~QV~Iipi~e~~~eya~~i~~~L~~~girV~~Ddr~~~~G~Kir~a~l~giP~~ivIG 610 (652) +||++.++ |. .+.+| -.+.+|.|++.+++...+|-+|.+.|+++|+.|+++-.+.++++.++.|+..|+||+|||| T Consensus 308 ieRl~~~l-~~-~~~~~--~~~~~~~v~~~~~~~~~~~~~ia~~LR~~gi~~e~~~~~~~l~kql~~A~k~~~~~~iiiG 383 (424) T CHL00201 308 IERLLLIA-KD-NILLP--KPSIDFYIATKGNVAIKIAFSIQRFLHNQGLKSELDLSSSNFSKQIKQAAKKRALACIILG 383 (424) T ss_pred HHHHHHHH-HH-CCCCC--CCCCCEEEEECCHHHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHCCCCEEEEEC T ss_conf 99999999-73-37777--7788589997784999999999999998899399970899999999999987999999985 Q ss_pred HHHHHCCCEEEEECCCCCEE---EECHHHHHHHHHHHCC Q ss_conf 42532581899988988663---3429999999987314 Q gi|254781004|r 611 DKEASERSIGIRRFGSTTTQ---KVSLLDAIPILTKESL 646 (652) Q Consensus 611 ~ke~e~~~Vtir~R~~~~~~---~i~lee~i~~l~~e~~ 646 (652) ++|+++++|+||+-.+++|+ .++++|++..|++..+ T Consensus 384 ~~E~~~~~v~lK~l~sg~Q~~~~~~~l~~~i~~lkk~~~ 422 (424) T CHL00201 384 DNEIARGAITIKWLDSQIQENSQYLNFKDFISYLKKKIL 422 (424) T ss_pred CHHHHCCEEEEEECCCCCCCCEECCCHHHHHHHHHHHHH T ss_conf 158868968998689988752302479999999999872 No 34 >COG0124 HisS Histidyl-tRNA synthetase [Translation, ribosomal structure and biogenesis] Probab=99.94 E-value=5e-24 Score=185.61 Aligned_cols=365 Identities=19% Similarity=0.252 Sum_probs=233.7 Q ss_pred CCCEEEEECCHHHHHHHHHHHHHHHH-HCCCEEEECCCCCCCCCCCC-CCCCCC-CCCCCCEEECCCCCCCCCCEEEECC Q ss_conf 87435650107899999999999887-68963855543244310001-121001-2555420100145677782377434 Q gi|254781004|r 262 GSGVIFWHRKGWKIFQTLISYMRRKI-KDDYEEINTPQVLDQHLWQQ-SGHWDS-YRANMFAVKCADDTIKDLRTFALKP 338 (652) Q Consensus 262 ~~G~~~wlP~G~~i~~~ie~~ir~~~-~~G~~eV~tP~l~~~~Lw~~-SGh~~~-y~~~mf~~~~~~~~~~~~~~~~LkP 338 (652) =.|+.=|+|.=...++.|++.+|+-. ++||.||.||++=.++|+.. +|--.. -..+||.+. |+.++.++||| T Consensus 7 prG~~D~lp~d~~~~~~i~~~~~~v~~~yGf~eI~TPifE~telf~r~~Ge~td~v~kemY~F~-----Dkggr~laLRp 81 (429) T COG0124 7 PRGTRDFLPEDMALREYIESTIRKVFESYGFSEIRTPIFEYTELFARKSGEETDVVEKEMYTFK-----DKGGRSLALRP 81 (429) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCEECCCCCCCCHHHHHHCCCCCCCCCCCCEEEEE-----ECCCCEEEECC T ss_conf 9986655867789999999999999998099765575511166762016974333321028999-----48998898446 Q ss_pred CCHHHHHHHHHHCCCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEECCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 31257777652012222310432000100012787645200010563533442101053767887899999988766552 Q gi|254781004|r 339 MNCPGHVAVFNHGLKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVFCTKEQMFNECLKIHNLIISIYKDF 418 (652) Q Consensus 339 mncp~h~~i~~~~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F~~~eq~~~e~~~~~~~~~~v~~~~ 418 (652) ---....-.+..+...+. .|+|+|.+|.+||+|... =-|-|+|.|-|..+|=.++ ...-..++.+...++..+ T Consensus 82 e~Tapv~R~~~en~~~~~-~p~k~yy~g~vfRyErPQ----~GR~RqF~Q~g~E~iG~~~--~~~DAEvi~l~~~~l~~l 154 (429) T COG0124 82 ELTAPVARAVAENKLDLP-KPLKLYYFGPVFRYERPQ----KGRYRQFYQFGVEVIGSDS--PDADAEVIALAVEILEAL 154 (429) T ss_pred CCCHHHHHHHHHCCCCCC-CCEEEEEECCEECCCCCC----CCCCEEEEECCEEEECCCC--CCCCHHHHHHHHHHHHHC T ss_conf 674999999995622036-872599833562378888----8875036876767767998--653899999999999974 Q ss_pred CCCCCEECCCCCCCC-----CCCCH--HH----HHHHHHHHHHHHHHHCCCC--CCCCCCCCCCCC--CCCC--CCCC-C Q ss_conf 133210001465311-----25860--35----7789999999998502121--256675205555--5545--5432-1 Q gi|254781004|r 419 GFEKIMVKLSTRPEK-----RVGSD--AL----WDDAENIMKGVLDTIKNSS--QDRINTGVLLGE--GAFY--GPKF-E 480 (652) Q Consensus 419 g~~~~~~~ls~~~~~-----~~g~~--~~----~~~ae~~l~~~l~~~~~~~--~~~~~~~~~~g~--~afy--gpki-d 480 (652) |+..+.+.+..+..- +.|-+ +. -+.-+..+...|....... ...+.......+ .++. -|.+ + T Consensus 155 Gi~~~~l~iN~~g~l~~~~~~~gi~~~~~l~~~ldk~~k~~~~~L~e~~~~r~~~n~lr~ld~k~~~~~~~~~~ap~i~~ 234 (429) T COG0124 155 GIGGFTLEINSRGILEGRLEYLGIDQREALLRYLDKLDKIGKLELDEDSKRRLKTNPLRVLDSKKDSDQELLKNAPELLD 234 (429) T ss_pred CCCCEEEEECCCCCHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHCCCHHHHHHHCCHHHHH T ss_conf 99867999817644788877506116899999974654678987606655343114489998616328999852077640 Q ss_pred C-------------CHHHHCCCCCC--CC-CCCHHHHCCCCCCCEEEE------------------ECCCEEEEEEEEEC Q ss_conf 1-------------00211033100--12-110111022346741673------------------17972137996403 Q gi|254781004|r 481 Y-------------ILKDAIGRDWQ--CG-TIQVDFNLPSRFNAFYVN------------------SHSEKCHPVMIHRA 526 (652) Q Consensus 481 ~-------------~~~d~~gr~~~--~~-tiq~df~~p~~f~l~y~~------------------~dg~~~~pvmihr~ 526 (652) + ..-+++|.... .+ .-.+|+....-|...... .=|....|=++ .| T Consensus 235 ~~~~e~~~~~~~v~~~L~~~g~~~~id~~lvRGLDYYtg~VFE~~~~~~~~~~sI~gGGRYD~Lv~~~gG~~~pavG-Fa 313 (429) T COG0124 235 YLDEESLEHLEELLALLDALGISYEIDPSLVRGLDYYTGTVFEAVTDGLGAQGSVCGGGRYDGLVEEFGGKPTPAVG-FA 313 (429) T ss_pred HCCHHHHHHHHHHHHHHHHCCCCEEECCCEECCHHHCCCEEEEEEECCCCCCCCEECCCCCHHHHHHHCCCCCCCEE-EE T ss_conf 11288999999999999983997798663003613206668999976886443155276615899984899887136-75 Q ss_pred CCHHHHHHHHHHHHHHCCCCCCC-CCCEEEEEEECCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHCCCCE Q ss_conf 41037899999999738708873-37808999857966899999999999987988999758966367666554348988 Q gi|254781004|r 527 VFGSIERFIGIMIENFKGNLPLW-LSPIQAIVTTITSSAVEYAQEIANLLKSHHLSIETDFRNETINYKIREHSIKKIPI 605 (652) Q Consensus 527 ~~Gs~eR~ia~liE~~~g~~P~w-LAP~QV~Iipi~e~~~eya~~i~~~L~~~girV~~Ddr~~~~G~Kir~a~l~giP~ 605 (652) + ++||++.+|-+... +.+ -+|.+|.|+|+.+....+|-++.+.|+++|+.|++|-.+.++++.++.|+..|+++ T Consensus 314 i--GveRl~~~l~~~~~---~~~~~~~~~v~v~~~~~~~~~~a~~la~~LR~~g~~~~~~~~~r~~k~q~k~A~~~g~~~ 388 (429) T COG0124 314 I--GVERLILALEEEGK---EDPVETRVDVYVVPLGEDAEPEALKLAQKLRAAGISVEVDYSGRKLKKQFKYADKLGARF 388 (429) T ss_pred H--HHHHHHHHHHHCCC---CCCCCCCCCEEEEECCCHHHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHCCCCE T ss_conf 1--79999999997188---777677788899984861599999999999975994899855656899999998779988 Q ss_pred EEEECHHHHHCCCEEEEECCCCCEEEECHHHHHHHHHHH Q ss_conf 999874253258189998898866334299999999873 Q gi|254781004|r 606 IIICGDKEASERSIGIRRFGSTTTQKVSLLDAIPILTKE 644 (652) Q Consensus 606 ~ivIG~ke~e~~~Vtir~R~~~~~~~i~lee~i~~l~~e 644 (652) .|+||++|+++++|+|++-.++++..++++++.+.+... T Consensus 389 ~viiGe~E~~~g~v~vKdl~t~eq~~v~~~~~~~~~~~~ 427 (429) T COG0124 389 AVILGEDELANGVVTVKDLATGEQEEVPLDELVEELKEL 427 (429) T ss_pred EEEECCHHHHCCCEEEEECCCCCCCEECHHHHHHHHHHH T ss_conf 999852588659789734777863334499999998742 No 35 >PRK12420 histidyl-tRNA synthetase; Provisional Probab=99.91 E-value=5.9e-22 Score=171.35 Aligned_cols=350 Identities=15% Similarity=0.204 Sum_probs=209.2 Q ss_pred CCCCEEEEECCHHHHHHHHHHHHHHHH-HCCCEEEECCCCCCCCCCCCC-CCCCCCCCCCCEEECCCCCCCCCCEEEECC Q ss_conf 787435650107899999999999887-689638555432443100011-210012555420100145677782377434 Q gi|254781004|r 261 DGSGVIFWHRKGWKIFQTLISYMRRKI-KDDYEEINTPQVLDQHLWQQS-GHWDSYRANMFAVKCADDTIKDLRTFALKP 338 (652) Q Consensus 261 ~~~G~~~wlP~G~~i~~~ie~~ir~~~-~~G~~eV~tP~l~~~~Lw~~S-Gh~~~y~~~mf~~~~~~~~~~~~~~~~LkP 338 (652) --.|+-=++|.=...++.|++.+++.. ++||+||.||++=..+|+..+ |.-+.-..+||.+. |+.++.++||| T Consensus 6 ~~rG~rD~lp~e~~~~~~i~~~~~~~~~~~Gy~~I~tP~~E~~elf~~~~g~~~~i~kemy~f~-----D~~gr~l~LRP 80 (421) T PRK12420 6 NVKGTKDYLPEEQVLRNKIKRACEDTFERYGCKPLETPTLNMYELMSYKYGGGDEILKEIYTLQ-----DQGKRDLALRY 80 (421) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCCHHHHHCCCCCCCCHHHHHEEEEE-----CCCCCEEEECC T ss_conf 9998887899999999999999999999859947247650118764244788442241107888-----18998798567 Q ss_pred CCHHHHHHHHHHCCCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEECCHH-HHHHHHHHHHHHHHHHHHH Q ss_conf 312577776520122223104320001000127876452000105635334421010537-6788789999998876655 Q gi|254781004|r 339 MNCPGHVAVFNHGLKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVFCTKE-QMFNECLKIHNLIISIYKD 417 (652) Q Consensus 339 mncp~h~~i~~~~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F~~~e-q~~~e~~~~~~~~~~v~~~ 417 (652) =--+..+-++... .. -.+|+|+|.+|.+||+|. |. .| |-|||+|-+..+|=.++ .+.. .++.+..++++. T Consensus 81 e~Ta~iaR~~a~~-~~-~~~P~k~~y~g~vfR~er-pq-~G--R~ReF~Q~GvEiiG~~~~~aDa---EiI~la~~~l~~ 151 (421) T PRK12420 81 DLTIPFAKVVAMN-PN-IRLPFKRYEIGKVFRDGP-IK-QG--RFREFIQCDVDIVGVESVMAEA---ELMSMAFELFRT 151 (421) T ss_pred CCCHHHHHHHHHC-CC-CCCCEEEEEECCEEECCC-CC-CC--CCCEEEECCEEEECCCCHHHHH---HHHHHHHHHHHH T ss_conf 7658999999977-57-778745888767785389-87-88--5551343575420788778899---999999999996 Q ss_pred HCCCCCEECCCCCCC-----CCCCCHHHHH-HH-----------HHHHHHHHHHHC--------------CCCCCCCC-C Q ss_conf 213321000146531-----1258603577-89-----------999999998502--------------12125667-5 Q gi|254781004|r 418 FGFEKIMVKLSTRPE-----KRVGSDALWD-DA-----------ENIMKGVLDTIK--------------NSSQDRIN-T 465 (652) Q Consensus 418 ~g~~~~~~~ls~~~~-----~~~g~~~~~~-~a-----------e~~l~~~l~~~~--------------~~~~~~~~-~ 465 (652) +|++ +.+.++.+.. ...|...... .. ...+...+...+ ......+. . T Consensus 152 lgl~-~~i~i~~~~l~~~ll~~~~~~~e~~~~~~~~ldk~~k~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 230 (421) T PRK12420 152 LNLE-VTIQYNNRKLLNGILQAINIPTELTSDVILSLDKIEKIGIDGVRKDVLERGISEEMADTICNTVLSCLQLSIADF 230 (421) T ss_pred CCCC-EEEEECCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHH T ss_conf 5986-599955299999888870999999999999988765316788999988668888999879998735680138998 Q ss_pred CCCCCC-------C---------CCCCC--CCCCCHHHHCCCCCCCCCCCHHH------H----CCCCCCCEEEEECCCE Q ss_conf 205555-------5---------54554--32110021103310012110111------0----2234674167317972 Q gi|254781004|r 466 GVLLGE-------G---------AFYGP--KFEYILKDAIGRDWQCGTIQVDF------N----LPSRFNAFYVNSHSEK 517 (652) Q Consensus 466 ~~~~g~-------~---------afygp--kid~~~~d~~gr~~~~~tiq~df------~----~p~~f~l~y~~~dg~~ 517 (652) ...... . .-+|. ++.+...-.-|-....+|+.--+ . ---|+|--...-.|.. T Consensus 231 ~~~~~~~~~~e~~~~l~~l~~~L~~~gi~~~i~~D~~lVRGLdYYTg~VFE~~~~~~~~~~~i~gGGRYD~Li~~fgg~~ 310 (421) T PRK12420 231 KEAFNNPLVADGVNELQQLQQYLIALGINENTIFNPFLARGLTMYTGTVYEIFLKDGSITSSIGSGGRYDNIIGAFRGDN 310 (421) T ss_pred HHHHCCHHHHHHHHHHHHHHHHHHHCCCCCEEEECHHHHCCCCCEEEEEEEEEECCCCCCCCCCCCCCHHHHHHHHCCCC T ss_conf 63205855599999999999999976999518776767416664200589999558764434435665054898818998 Q ss_pred -EEEEEEEECCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCHHHHHHHHHHHHHHHHC-CCEEEEECCCCCCCHHH Q ss_conf -1379964034103789999999973870887337808999857966899999999999987-98899975896636766 Q gi|254781004|r 518 -CHPVMIHRAVFGSIERFIGIMIENFKGNLPLWLSPIQAIVTTITSSAVEYAQEIANLLKSH-HLSIETDFRNETINYKI 595 (652) Q Consensus 518 -~~pvmihr~~~Gs~eR~ia~liE~~~g~~P~wLAP~QV~Iipi~e~~~eya~~i~~~L~~~-girV~~Ddr~~~~G~Ki 595 (652) ..|-+ |.--++||++.+|-+... . -++..|.|+|+..+. .|-.+.+.|+++ |++|++|..+.++++.+ T Consensus 311 ~~~Pav---Gfa~Gierl~~~L~~~~~--~---~~~~~v~v~~~~~~~--~a~~ia~~LR~~~gi~~~~~~~~~~lkkql 380 (421) T PRK12420 311 MNYPTV---GISFGLDVIYTALSQKET--I---SSTADVFIIPLGTEL--QCLQIAQQLRSTTSLKVELELAGRKLKRAL 380 (421) T ss_pred CCCCEE---EEEEHHHHHHHHHHCCCC--C---CCCCCEEEEECCCHH--HHHHHHHHHHHCCCCEEEEECCCCCHHHHH T ss_conf 999747---998519999999855045--7---988988999787789--999999999845996699965999999999 Q ss_pred HHHHHCCCCEEEEECHHHHHCCCEEEEECCCCCEEEECHH Q ss_conf 6554348988999874253258189998898866334299 Q gi|254781004|r 596 REHSIKKIPIIIICGDKEASERSIGIRRFGSTTTQKVSLL 635 (652) Q Consensus 596 r~a~l~giP~~ivIG~ke~e~~~Vtir~R~~~~~~~i~le 635 (652) +.|+..|++|.||||++|+++|+|+||+-.++++++|+++ T Consensus 381 k~A~k~~~~~~iiiG~~E~~~~~v~lK~l~tg~q~~v~l~ 420 (421) T PRK12420 381 NYANKENIPYVLIIGEEELSTETVMLRNMKEGSEVKVPLS 420 (421) T ss_pred HHHHHCCCCEEEEECCHHHHCCEEEEEECCCCCEEEEECC T ss_conf 9998879999999876388789789998899973798288 No 36 >cd00862 ProRS_anticodon_zinc ProRS Prolyl-anticodon binding domain, long version found predominantly in eukaryotes and archaea. ProRS belongs to class II aminoacyl-tRNA synthetases (aaRS). This alignment contains the anticodon binding domain, which is responsible for specificity in tRNA-binding, so that the activated amino acid is transferred to a ribose 3' OH group of the appropriate tRNA only, and an additional C-terminal zinc-binding domain specific to this subfamily of aaRSs. Probab=99.90 E-value=3.6e-23 Score=179.69 Aligned_cols=101 Identities=24% Similarity=0.393 Sum_probs=94.8 Q ss_pred CCCCCCCCCCEEEEEEECC------HHHHHHHHHHHHHHHHCCCEEEEECCCC-CCCHHHHHHHHCCCCEEEEECHHHHH Q ss_conf 8708873378089998579------6689999999999998798899975896-63676665543489889998742532 Q gi|254781004|r 543 KGNLPLWLSPIQAIVTTIT------SSAVEYAQEIANLLKSHHLSIETDFRNE-TINYKIREHSIKKIPIIIICGDKEAS 615 (652) Q Consensus 543 ~g~~P~wLAP~QV~Iipi~------e~~~eya~~i~~~L~~~girV~~Ddr~~-~~G~Kir~a~l~giP~~ivIG~ke~e 615 (652) |.+||+.|||+||+||||- +.+.+||++|++.|+++|+||.+|+|++ |+|+||.+|++.|||++|.||+|+++ T Consensus 1 GLvlPP~iAP~QVVIiPi~~k~~~~~~v~~~~~~i~~~L~~~~iRv~iD~rd~~~pG~K~~~wE~kGVPiRIEIGprDle 80 (202) T cd00862 1 GLVLPPRVAPIQVVIVPIGIKDEKREEVLEAADELAERLKAAGIRVHVDDRDNYTPGWKFNDWELKGVPLRIEIGPRDLE 80 (202) T ss_pred CCCCCCCCCCCEEEEEEECCCHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCHHHHHHHHHHHCCCEEEEECHHHHH T ss_conf 97368420585499997348723289999999999999986798899977878688888899998089779998715863 Q ss_pred CCCEEEEECCCCCEEEECHHHHHHHHHH Q ss_conf 5818999889886633429999999987 Q gi|254781004|r 616 ERSIGIRRFGSTTTQKVSLLDAIPILTK 643 (652) Q Consensus 616 ~~~Vtir~R~~~~~~~i~lee~i~~l~~ 643 (652) +++|++.+||++++..++++++.+.+.+ T Consensus 81 ~~~v~v~rRD~~~K~~v~~~el~~~i~~ 108 (202) T cd00862 81 KNTVVIVRRDTGEKKTVPLAELVEKVPE 108 (202) T ss_pred CCCEEEEEECCCCEEEEEHHHHHHHHHH T ss_conf 7918999917995789788999999999 No 37 >cd00861 ProRS_anticodon_short ProRS Prolyl-anticodon binding domain, short version found predominantly in bacteria. ProRS belongs to class II aminoacyl-tRNA synthetases (aaRS). This alignment contains the anticodon binding domain, which is responsible for specificity in tRNA-binding, so that the activated amino acid is transferred to a ribose 3' OH group of the appropriate tRNA only. Probab=99.90 E-value=4.3e-23 Score=179.20 Aligned_cols=91 Identities=21% Similarity=0.335 Sum_probs=88.4 Q ss_pred CEEEEEEEC---CHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHCCCCEEEEECHHHHHCCCEEEEECCCCC Q ss_conf 808999857---96689999999999998798899975896636766655434898899987425325818999889886 Q gi|254781004|r 552 PIQAIVTTI---TSSAVEYAQEIANLLKSHHLSIETDFRNETINYKIREHSIKKIPIIIICGDKEASERSIGIRRFGSTT 628 (652) Q Consensus 552 P~QV~Iipi---~e~~~eya~~i~~~L~~~girV~~Ddr~~~~G~Kir~a~l~giP~~ivIG~ke~e~~~Vtir~R~~~~ 628 (652) |+||+|||+ ++++.+||++||+.|+++|+||.+|||++++|+||++|+++|+||+||||++++++++|++|+|++++ T Consensus 1 P~qv~Iipi~~~~~~~~~~a~~l~~~L~~~gi~v~~Ddr~~~~G~K~~~ael~GiP~~iiiG~k~~~~~~v~vr~R~~~~ 80 (94) T cd00861 1 PFDVVIIPMNMKDEVQQELAEKLYAELQAAGVDVLLDDRNERPGVKFADADLIGIPYRIVVGKKSAAEGIVEIKVRKTGE 80 (94) T ss_pred CEEEEEEECCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHCCCCEEEEECCCCHHCCEEEEEECCCCC T ss_conf 94999997489988999999999999988798999987997277999999974998799990662117999999978995 Q ss_pred EEEECHHHHHHHHH Q ss_conf 63342999999998 Q gi|254781004|r 629 TQKVSLLDAIPILT 642 (652) Q Consensus 629 ~~~i~lee~i~~l~ 642 (652) +..++++++++.|+ T Consensus 81 ~~~v~~~~~i~~ik 94 (94) T cd00861 81 KEEISIDELLEFLQ 94 (94) T ss_pred EEEECHHHHHHHHC T ss_conf 68971999999759 No 38 >cd00860 ThrRS_anticodon ThrRS Threonyl-anticodon binding domain. ThrRS belongs to class II aminoacyl-tRNA synthetases (aaRS). This alignment contains the anticodon binding domain, which is responsible for specificity in tRNA-binding, so that the activated amino acid is transferred to a ribose 3' OH group of the appropriate tRNA only. Probab=99.90 E-value=7.5e-23 Score=177.50 Aligned_cols=91 Identities=34% Similarity=0.577 Sum_probs=89.5 Q ss_pred CEEEEEEECCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHCCCCEEEEECHHHHHCCCEEEEECCCCCEEE Q ss_conf 80899985796689999999999998798899975896636766655434898899987425325818999889886633 Q gi|254781004|r 552 PIQAIVTTITSSAVEYAQEIANLLKSHHLSIETDFRNETINYKIREHSIKKIPIIIICGDKEASERSIGIRRFGSTTTQK 631 (652) Q Consensus 552 P~QV~Iipi~e~~~eya~~i~~~L~~~girV~~Ddr~~~~G~Kir~a~l~giP~~ivIG~ke~e~~~Vtir~R~~~~~~~ 631 (652) |+||+||||++++.+||.+|++.|+++|+||++|+|++++|+||++|++.|+||++|||++|+++++|++|+|++++++. T Consensus 1 P~QV~Iipi~~~~~~ya~~i~~~Lr~~girv~~D~~~~~lg~ki~~a~~~~ip~~iiiG~~E~~~~~v~vr~r~~~~q~~ 80 (91) T cd00860 1 PVQVVVIPVTDEHLDYAKEVAKKLSDAGIRVEVDLRNEKLGKKIREAQLQKIPYILVVGDKEVETGTVSVRTRDGGDLGS 80 (91) T ss_pred CEEEEEEEECHHHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHCCCEEEEECHHHHHCCEEEEEECCCCCEEE T ss_conf 92899998888799999999999998899899965889875999999981999899996046538979999989993678 Q ss_pred ECHHHHHHHHH Q ss_conf 42999999998 Q gi|254781004|r 632 VSLLDAIPILT 642 (652) Q Consensus 632 i~lee~i~~l~ 642 (652) +++++|++.|+ T Consensus 81 v~~~~~i~~lk 91 (91) T cd00860 81 MSLDEFIEKLK 91 (91) T ss_pred EEHHHHHHHHC T ss_conf 77999999749 No 39 >PRK05431 seryl-tRNA synthetase; Provisional Probab=99.89 E-value=1.6e-22 Score=175.31 Aligned_cols=266 Identities=18% Similarity=0.321 Sum_probs=210.1 Q ss_pred HHHHHHHHHHHCCEEEECC----CCCCEEEEECCHHHHHHHHHHHHHH-HHHCCCEEEECCCCCCCCCCCCCCCCCCCCC Q ss_conf 3200887541144044226----7874356501078999999999998-8768963855543244310001121001255 Q gi|254781004|r 242 EKRDHRKLAREMDLFHIAE----DGSGVIFWHRKGWKIFQTLISYMRR-KIKDDYEEINTPQVLDQHLWQQSGHWDSYRA 316 (652) Q Consensus 242 ~~rdHr~lg~~~~lf~~~~----~~~G~~~wlP~G~~i~~~ie~~ir~-~~~~G~~eV~tP~l~~~~Lw~~SGh~~~y~~ 316 (652) +-+||-.||+.++|+-|.. .|++.|++.-.|+++-+.|.+|+-. ..++||++|.+|.|.+.+...-+|...++.+ T Consensus 135 ~~k~H~elge~l~liDfe~aaKiSGsRF~~Lkg~~A~LerALi~f~ld~~~~~Gy~~v~~P~lv~~~~~~gtGqlPkfe~ 214 (422) T PRK05431 135 EPKDHWELGEKLGILDFERAAKVSGSRFYVLKGDGARLERALIQFMLDLHTEEGYTEVIPPYLVNEESMYGTGQLPKFEE 214 (422) T ss_pred CCCCHHHHHHHCCCCCHHHHHHHCCCCEEEEHHHHHHHHHHHHHHHHHHHHHCCCEEEECHHHHHHHHHHHHCCCCCCHH T ss_conf 88778899876476104667755078704600489999999999999999987997996737636788744537974487 Q ss_pred CCCEEECCCCCCCCCCEEEECCCCHHHHHHHHHHCCCCHHHCCHHHHHHEEEECCCCCC---CCEEEEECCCCCCCCCEE Q ss_conf 54201001456777823774343125777765201222231043200010001278764---520001056353344210 Q gi|254781004|r 317 NMFAVKCADDTIKDLRTFALKPMNCPGHVAVFNHGLKSYRELPVRLAEFGSVYRNEPSG---SLHGLMRVRGFTQDDAHV 393 (652) Q Consensus 317 ~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~~~~SYrdLPlrl~e~~~~~R~E~sg---~l~GL~R~ReF~~~Dah~ 393 (652) +||... +++++|=||.|-..+-+|++.+-...+||+++.=++.|||-|.-. -..||+|+.+|.--+... T Consensus 215 ~~Y~i~--------~~dl~Li~TAEvpL~~~~~~eil~~~~LPik~~~~S~CFR~EAGs~GkdtrGl~RvHQF~KVE~f~ 286 (422) T PRK05431 215 DLFKIE--------DDDLYLIPTAEVPLTNLHRDEILDEEELPIKYTAYSPCFRSEAGSAGRDTRGLIRQHQFDKVELVK 286 (422) T ss_pred HHHHHC--------CCCEEEECCCCCCHHHHHHCCCCCHHHCCHHHEEECCHHHHCCCCCCCCCCCCEEEEEECEEEEEE T ss_conf 754105--------887176056532123344011336543984452788365303133675567612443000034335 Q ss_pred ECCHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCC Q ss_conf 10537678878999999887665521332100014653112586035778999999999850212125667520555555 Q gi|254781004|r 394 FCTKEQMFNECLKIHNLIISIYKDFGFEKIMVKLSTRPEKRVGSDALWDDAENIMKGVLDTIKNSSQDRINTGVLLGEGA 473 (652) Q Consensus 394 F~~~eq~~~e~~~~~~~~~~v~~~~g~~~~~~~ls~~~~~~~g~~~~~~~ae~~l~~~l~~~~~~~~~~~~~~~~~g~~a 473 (652) ||.||+..+++..++.....++..+|+....+.+++++-++..... .+++ .+.+|... T Consensus 287 ~~~pe~S~~~~e~~~~~~e~i~~~L~lpyrvv~~~sgdlg~~a~~~---------------------yDiE-~W~P~~~~ 344 (422) T PRK05431 287 FTKPEDSYAELEELTGNAEEILQKLELPYRVVLLCTGDMGFSAAKT---------------------YDLE-VWLPGQNT 344 (422) T ss_pred ECCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHH---------------------EEEE-EEECCCCC T ss_conf 3586779999999999999999875985023567876557665654---------------------5457-87156685 Q ss_pred CCCCCCCCCHHHHCCCCCCCCCCCHHHHCCCCCCCEEEEECCCEEEEEEEEECCCHHHHHHHHHHHHHH----C-CCCCC Q ss_conf 455432110021103310012110111022346741673179721379964034103789999999973----8-70887 Q gi|254781004|r 474 FYGPKFEYILKDAIGRDWQCGTIQVDFNLPSRFNAFYVNSHSEKCHPVMIHRAVFGSIERFIGIMIENF----K-GNLPL 548 (652) Q Consensus 474 fygpkid~~~~d~~gr~~~~~tiq~df~~p~~f~l~y~~~dg~~~~pvmihr~~~Gs~eR~ia~liE~~----~-g~~P~ 548 (652) + +-.--|+. ..||+ +.|.++.|.+.+|+...+.++.-+.+ .+.|+++||+|.| | ...|. T Consensus 345 y-------------~EvsS~Sn-c~d~Q-srRl~iry~~~~~~~~~~htlNgt~~-A~~R~l~ailEn~q~~dg~i~iP~ 408 (422) T PRK05431 345 Y-------------REISSCSN-CTDFQ-ARRANIRYRDEDGKPELVHTLNGSGL-AVGRTLVAILENYQQADGSVTIPE 408 (422) T ss_pred E-------------EEEEEECC-CCHHH-HHHCCCEEECCCCCEEEEEECCCCHH-HHHHHHHHHHHHCCCCCCCEECCH T ss_conf 4-------------52400015-01488-75302655179998664043115121-887899999884648998884781 Q ss_pred CCCCE Q ss_conf 33780 Q gi|254781004|r 549 WLSPI 553 (652) Q Consensus 549 wLAP~ 553 (652) -|-|| T Consensus 409 ~L~~y 413 (422) T PRK05431 409 VLRPY 413 (422) T ss_pred HHHHH T ss_conf 31201 No 40 >cd00738 HGTP_anticodon HGTP anticodon binding domain, as found at the C-terminus of histidyl, glycyl, threonyl and prolyl tRNA synthetases, which are classified as a group of class II aminoacyl-tRNA synthetases (aaRS). In aaRSs, the anticodon binding domain is responsible for specificity in tRNA-binding, so that the activated amino acid is transferred to a ribose 3' OH group of the appropriate tRNA only. This domain is also found in the accessory subunit of mitochondrial polymerase gamma (Pol gamma b). Probab=99.89 E-value=1.8e-22 Score=174.97 Aligned_cols=90 Identities=22% Similarity=0.419 Sum_probs=87.3 Q ss_pred CEEEEEEECCH---HHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHCCCCEEEEECHHHHHCCCEEEEECCCCC Q ss_conf 80899985796---689999999999998798899975896636766655434898899987425325818999889886 Q gi|254781004|r 552 PIQAIVTTITS---SAVEYAQEIANLLKSHHLSIETDFRNETINYKIREHSIKKIPIIIICGDKEASERSIGIRRFGSTT 628 (652) Q Consensus 552 P~QV~Iipi~e---~~~eya~~i~~~L~~~girV~~Ddr~~~~G~Kir~a~l~giP~~ivIG~ke~e~~~Vtir~R~~~~ 628 (652) |+||+||||++ ++.+||.+|+++|+++|+||.+|+|++++|+||++|+++|+||++|||++|+++++|++|+|++++ T Consensus 1 P~qv~Iipi~~~~~e~~~~a~~l~~~L~~~gi~v~~Ddr~~~~g~k~~~a~~~giP~~iiiG~~e~e~~~v~vr~r~~~e 80 (94) T cd00738 1 PIDVAIVPLTDPRVEAREYAQKLLNALLANGIRVLYDDRERKIGKKFREADLRGVPFAVVVGEDELENGKVTVKSRDTGE 80 (94) T ss_pred CEEEEEEEECCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHCCCCEEEEECCHHHHCCEEEEEECCCCC T ss_conf 91999998438977899999999999998899899987999975999999974998999988003437989999989997 Q ss_pred EEEECHHHHHHHH Q ss_conf 6334299999999 Q gi|254781004|r 629 TQKVSLLDAIPIL 641 (652) Q Consensus 629 ~~~i~lee~i~~l 641 (652) ++.++++++++.| T Consensus 81 q~~v~~~e~~~~l 93 (94) T cd00738 81 SETLHVDELPEFL 93 (94) T ss_pred EEEEEHHHHHHHH T ss_conf 7898999988865 No 41 >COG0172 SerS Seryl-tRNA synthetase [Translation, ribosomal structure and biogenesis] Probab=99.85 E-value=2.2e-20 Score=160.48 Aligned_cols=284 Identities=20% Similarity=0.351 Sum_probs=212.4 Q ss_pred CCCCCEEEEEEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEECC----CCCCEEEEECCHHHHHHHHHHHHHH- Q ss_conf 75642068999998379889999999999753200887541144044226----7874356501078999999999998- Q gi|254781004|r 211 DSNRPMLSRIYGTAWNTQQELTQYLYFLEESEKRDHRKLAREMDLFHIAE----DGSGVIFWHRKGWKIFQTLISYMRR- 285 (652) Q Consensus 211 d~~~~~l~Riyg~af~~~~~l~~~~~~~eea~~rdHr~lg~~~~lf~~~~----~~~G~~~wlP~G~~i~~~ie~~ir~- 285 (652) |+..-...|..|..-.. .-+-+-+||.-||..+++--|.. .|++.+++.+.|+++-+.|.+|+=. T Consensus 118 de~~n~~vr~~g~~~~~----------~~~f~pk~H~~lge~l~~~Df~~aaKvsGsrf~~~~~~~a~L~rAL~~f~ld~ 187 (429) T COG0172 118 DEDDNVEVRRWGEPPVF----------VFDFEPKDHVELGEKLGLLDFERAAKVSGSRFYFYKGKGARLERALIQFMLDL 187 (429) T ss_pred CCCCCEEEEEEECCCCC----------CCCCCCCHHHHHHHHCCCCHHHHHCCCCCCCEEEECCHHHHHHHHHHHHHHHH T ss_conf 86565488997337654----------44678530998765427624566502078743898077899999999999999 Q ss_pred HHHCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHHHCCCCHHHCCHHHHHH Q ss_conf 87689638555432443100011210012555420100145677782377434312577776520122223104320001 Q gi|254781004|r 286 KIKDDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHVAVFNHGLKSYRELPVRLAEF 365 (652) Q Consensus 286 ~~~~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~~~~SYrdLPlrl~e~ 365 (652) +-++||++|++|.|.+.+.-..+|.|.+|.++||.+.. .+|+|=||.|-..+.+++..+-+-.+||++++-+ T Consensus 188 ~~~~Gf~e~~~P~lv~~e~m~gtgqlpkf~e~~y~v~~--------~~~~LipTaEvpl~~l~~~Eil~~~~LP~k~~~~ 259 (429) T COG0172 188 HTKHGFTEVLPPYLVNLESMFGTGQLPKFEEDLYKVED--------PDLYLIPTAEVPLTNLHRDEILDEEDLPIKYTAY 259 (429) T ss_pred HHHCCCEEEECCEEECHHHHHCCCCCCCCCCCCEEECC--------CCEEEEECCHHHHHHHHCCCCCCCCCCCEEEEEE T ss_conf 98769658657606059886223789888012158458--------9879970202156786516201521278026787 Q ss_pred EEEECCCCCC---CCEEEEECCCCCCCCCEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCCCCCCCCCHHHHH Q ss_conf 0001278764---5200010563533442101053767887899999988766552133210001465311258603577 Q gi|254781004|r 366 GSVYRNEPSG---SLHGLMRVRGFTQDDAHVFCTKEQMFNECLKIHNLIISIYKDFGFEKIMVKLSTRPEKRVGSDALWD 442 (652) Q Consensus 366 ~~~~R~E~sg---~l~GL~R~ReF~~~Dah~F~~~eq~~~e~~~~~~~~~~v~~~~g~~~~~~~ls~~~~~~~g~~~~~~ 442 (652) ++|||.|... -.-||.|+.+|..-+.-.||.|++..++...+.....+++..+++....+.+.+++.++..... T Consensus 260 S~cFR~EAGs~GrdtrGliRvHQF~KVE~v~~~~Pe~S~~~~E~m~~~ae~il~~LeLPyRvv~lctGDlgf~a~kk--- 336 (429) T COG0172 260 SPCFRSEAGSAGKDTRGLIRVHQFDKVELVVITKPEESEEELEEMLGNAEEVLQELELPYRVVNLCTGDLGFSAAKK--- 336 (429) T ss_pred CHHHHCCCCCCCCCCCCEEEEEEEEEEEEEEEECCCHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCC--- T ss_conf 72542145656643553014664355899997070116999999999999999970898368432567768864673--- Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCHHHHCCCCCCCEEEE-ECCCEEEEE Q ss_conf 8999999999850212125667520555555455432110021103310012110111022346741673-179721379 Q gi|254781004|r 443 DAENIMKGVLDTIKNSSQDRINTGVLLGEGAFYGPKFEYILKDAIGRDWQCGTIQVDFNLPSRFNAFYVN-SHSEKCHPV 521 (652) Q Consensus 443 ~ae~~l~~~l~~~~~~~~~~~~~~~~~g~~afygpkid~~~~d~~gr~~~~~tiq~df~~p~~f~l~y~~-~dg~~~~pv 521 (652) ..++ .+-|+.+.+ .....|+. -.||+ +.|+++.|-+ .+|+...+ T Consensus 337 ------------------YDlE-vWlP~q~~y-------------rEisScSn-c~DfQ-aRR~~~Ryr~~~~~k~~~v- 381 (429) T COG0172 337 ------------------YDLE-VWLPGQNKY-------------REISSCSN-CTDFQ-ARRLNIRYRDKEEGKREFV- 381 (429) T ss_pred ------------------EEEE-EEECCCCCC-------------EEEEEEEC-CCCHH-HHHHHCCCCCCCCCCCEEE- T ss_conf ------------------2379-985377874-------------02124434-54288-8987540563657996699- Q ss_pred EEEECCCHH---HHHHHHHHHHHHC---C--CCCCCCCCE Q ss_conf 964034103---7899999999738---7--088733780 Q gi|254781004|r 522 MIHRAVFGS---IERFIGIMIENFK---G--NLPLWLSPI 553 (652) Q Consensus 522 mihr~~~Gs---~eR~ia~liE~~~---g--~~P~wLAP~ 553 (652) | +|=|| +.|.+.|++|.|. | .-|--|=|| T Consensus 382 --h-TLNGsglA~~R~l~AilENyq~~dG~v~IPevL~~y 418 (429) T COG0172 382 --H-TLNGSGLAVGRTLVAILENYQQEDGSVKIPEVLRPY 418 (429) T ss_pred --E-ECCCHHHHHHHHHHHHHHCCCCCCCCEECCHHHHHH T ss_conf --9-546427799999999997153789986344788732 No 42 >cd02426 Pol_gamma_b_Cterm C-terminal domain of mitochondrial DNA polymerase gamma B subunit, which is required for processivity. Polymerase gamma replicates and repairs mitochondrial DNA. The c-terminal domain of its B subunit is strikingly similar to the anticodon-binding domain of glycyl tRNA synthetase. Probab=99.80 E-value=6.6e-19 Score=150.30 Aligned_cols=100 Identities=12% Similarity=0.161 Sum_probs=91.4 Q ss_pred CCCCCCCCEEEEEEECC---HHHHHHHHHHHHHHHHCCCEE----EEECCCCCCCHHHHHHHHCCCCEEEEECHHHHHCC Q ss_conf 08873378089998579---668999999999999879889----99758966367666554348988999874253258 Q gi|254781004|r 545 NLPLWLSPIQAIVTTIT---SSAVEYAQEIANLLKSHHLSI----ETDFRNETINYKIREHSIKKIPIIIICGDKEASER 617 (652) Q Consensus 545 ~~P~wLAP~QV~Iipi~---e~~~eya~~i~~~L~~~girV----~~Ddr~~~~G~Kir~a~l~giP~~ivIG~ke~e~~ 617 (652) +||+.|||+||+|+|++ ++..+.|++|++.|+++|+.| .+|+ ..++|+++|++|.+|+||+|+|+.+.+++| T Consensus 20 ~Lhp~LAP~Kvai~~~~~~~~eL~~~a~~l~~~L~~~gi~v~~~~~~d~-~~sIGkrY~r~DeiGtP~citvD~~TLedg 98 (128) T cd02426 20 KLHPCLAPYKVAIDCGKGDTAELRDLCQGLKNELREAGLSVWPGYLETQ-HSSLEQLLDKYDEMGVLFTLLISEQTLENG 98 (128) T ss_pred ECCCCCCCEEEEEEECCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCC-CCCHHHHHHHHHHCCCCEEEEECCCCCCCC T ss_conf 4488747717999703688188899999999999873994354635454-347889887766428847999778743479 Q ss_pred CEEEEECCCCCEEEECHHHHHHHHHHHC Q ss_conf 1899988988663342999999998731 Q gi|254781004|r 618 SIGIRRFGSTTTQKVSLLDAIPILTKES 645 (652) Q Consensus 618 ~Vtir~R~~~~~~~i~lee~i~~l~~e~ 645 (652) +||+|+||+++|+.|+++++.+.|.+.. T Consensus 99 tVtlR~RDT~~QerI~I~dl~~yl~~~i 126 (128) T cd02426 99 LLQLRSRDTTLKETIHISDLPDYLLRYI 126 (128) T ss_pred CEEEEECCCCHHEEEEHHHHHHHHHHHH T ss_conf 5898854760213576999899999986 No 43 >pfam03129 HGTP_anticodon Anticodon binding domain. This domain is found in histidyl, glycyl, threonyl and prolyl tRNA synthetases it is probably the anticodon binding domain. Probab=99.80 E-value=9.4e-19 Score=149.24 Aligned_cols=90 Identities=27% Similarity=0.451 Sum_probs=87.0 Q ss_pred EEEEEECCHH--HHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHCCCCEEEEECHHHHHCCCEEEEECCCCCEEE Q ss_conf 8999857966--89999999999998798899975896636766655434898899987425325818999889886633 Q gi|254781004|r 554 QAIVTTITSS--AVEYAQEIANLLKSHHLSIETDFRNETINYKIREHSIKKIPIIIICGDKEASERSIGIRRFGSTTTQK 631 (652) Q Consensus 554 QV~Iipi~e~--~~eya~~i~~~L~~~girV~~Ddr~~~~G~Kir~a~l~giP~~ivIG~ke~e~~~Vtir~R~~~~~~~ 631 (652) ||+|||++++ +.+||.+|++.|+++|++|.+|+++.++|+||++|++.|+||+++||++|+++++|++|+|+++++.. T Consensus 1 qV~Vipi~~~~~~~~~a~~i~~~Lr~~gi~v~~d~~~~~~~~k~~~a~~~g~p~~iiiG~~E~~~~~v~ik~r~~~~~~~ 80 (93) T pfam03129 1 QVVVIPLGEKDELEDYAQKLAEELREAGIRVELDDRNESLGKKFRDADLIGIPFRLVVGEKELENGTVTVRDRDTGEKET 80 (93) T ss_pred CEEEEEECCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHCCCEEEEECHHHHHCCEEEEEECCCCCEEE T ss_conf 99999822867689999999999987899799988999978999989870999078897788858989999999997889 Q ss_pred ECHHHHHHHHHH Q ss_conf 429999999987 Q gi|254781004|r 632 VSLLDAIPILTK 643 (652) Q Consensus 632 i~lee~i~~l~~ 643 (652) ++++++++.|++ T Consensus 81 v~~~~~~~~i~~ 92 (93) T pfam03129 81 VSLEELVEKLKE 92 (93) T ss_pred EEHHHHHHHHHH T ss_conf 879999999974 No 44 >cd00774 GlyRS-like_core Glycyl-tRNA synthetase (GlyRS)-like class II core catalytic domain. GlyRS functions as a homodimer in eukaryotes, archaea and some bacteria and as a heterotetramer in the remainder of prokaryotes. It is responsible for the attachment of glycine to the 3' OH group of ribose of the appropriate tRNA. This domain is primarily responsible for ATP binding and hydrolysis. This alignment contains only sequences from the GlyRS form which homodimerizes. The heterotetramer glyQ is in a different family of class II aaRS. Class II assignment is based upon its structure and the presence of three characteristic sequence motifs. This domain is also found at the N-terminus of the accessory subunit of mitochondrial polymerase gamma (Pol gamma b). Pol gamma b stimulates processive DNA synthesis and is functional as a homodimer, which can associate with the catalytic subunit Pol gamma alpha to form a heterotrimer. Despite significant both structural and sequence similarity with Gly Probab=99.79 E-value=3.8e-19 Score=151.93 Aligned_cols=154 Identities=23% Similarity=0.300 Sum_probs=129.3 Q ss_pred CCCEEEEECCHHHHHHHHHHHHHHHH--HC-CCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECC Q ss_conf 87435650107899999999999887--68-9638555432443100011210012555420100145677782377434 Q gi|254781004|r 262 GSGVIFWHRKGWKIFQTLISYMRRKI--KD-DYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKP 338 (652) Q Consensus 262 ~~G~~~wlP~G~~i~~~ie~~ir~~~--~~-G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkP 338 (652) .+|+|=|=|+|..++++|.+.||+++ .+ +..+|.+|+|.|.- ||.+.. ...+..+..-.||| T Consensus 21 ~ag~yDyGPlG~~LK~Ni~~~W~~~fv~~~~~~~~id~~ii~p~L--------------mF~T~i-G~~~~~~~~~yLRP 85 (254) T cd00774 21 VAGFYDYGPLGVELKNNIKSAWRKSFVLEEEDMLEIDSPIITPEL--------------MFKTSI-GPVESGGNLGYLRP 85 (254) T ss_pred HHCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCC--------------CEECCC-CCCCCCCCCEEECH T ss_conf 310615482379999999999999987437987995410036632--------------200121-56558986035363 Q ss_pred CCHHHHHHHHHHCCCCHH-HCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEECCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 312577776520122223-1043200010001278764520001056353344210105376788789999998876655 Q gi|254781004|r 339 MNCPGHVAVFNHGLKSYR-ELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVFCTKEQMFNECLKIHNLIISIYKD 417 (652) Q Consensus 339 mncp~h~~i~~~~~~SYr-dLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F~~~eq~~~e~~~~~~~~~~v~~~ 417 (652) -++.++..-|++-.++.| .||+.++|+|++||||+|| ++||+|+|||+|.+...||.|++....+..+.+.....+.. T Consensus 86 ETAQGifvnFk~~~~~~r~kLPFGiaQIGk~FRNEIsP-r~~l~R~REF~q~EiE~F~~P~~~~~~~~~w~~~~l~~~~~ 164 (254) T cd00774 86 ETAQGIFVNFKNLLEFNRRKLPFGVAQIGKSFRNEISP-RNGLFRVREFTQAEIEFFVDPEKSHPWFDYWADQRLKWLPK 164 (254) T ss_pred HHHHHHHHHHHHHHHHCCCCCCEEEEEEEECCCCCCCC-CCCCCCHHCHHHHEEEEEECCCCCHHHHHHHHHHHHHHHHH T ss_conf 45347798789999964888884456542022673477-76754000002125788857986425789999999866387 Q ss_pred HCCCCCEECCCCCC Q ss_conf 21332100014653 Q gi|254781004|r 418 FGFEKIMVKLSTRP 431 (652) Q Consensus 418 ~g~~~~~~~ls~~~ 431 (652) +|......+.+..+ T Consensus 165 ~~~~~~~~~~~~~~ 178 (254) T cd00774 165 FAQSPENLRLTDHE 178 (254) T ss_pred HCCCCCCEEEECCC T ss_conf 38996653673046 No 45 >cd00859 HisRS_anticodon HisRS Histidyl-anticodon binding domain. HisRS belongs to class II aminoacyl-tRNA synthetases (aaRS). This alignment contains the anticodon binding domain, which is responsible for specificity in tRNA-binding, so that the activated amino acid is transferred to a ribose 3' OH group of the appropriate tRNA only. Probab=99.66 E-value=1.4e-15 Score=127.43 Aligned_cols=90 Identities=16% Similarity=0.283 Sum_probs=87.6 Q ss_pred EEEEEEECCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHCCCCEEEEECHHHHHCCCEEEEECCCCCEEEE Q ss_conf 08999857966899999999999987988999758966367666554348988999874253258189998898866334 Q gi|254781004|r 553 IQAIVTTITSSAVEYAQEIANLLKSHHLSIETDFRNETINYKIREHSIKKIPIIIICGDKEASERSIGIRRFGSTTTQKV 632 (652) Q Consensus 553 ~QV~Iipi~e~~~eya~~i~~~L~~~girV~~Ddr~~~~G~Kir~a~l~giP~~ivIG~ke~e~~~Vtir~R~~~~~~~i 632 (652) +||.|+|++++..+||.+|.+.|+++|++|.+|.++.+++++++.|+..|+||+||||++|+++++|++|++.++++..+ T Consensus 2 vdv~ii~~~~~~~~~a~~i~~~LR~~gi~v~~~~~~~~l~kqlk~A~k~~~~~~iiiG~~E~~~~~v~iK~~~t~~q~~i 81 (91) T cd00859 2 VDVYVVPLGEGALSEALELAEQLRDAGIKAEIDYGGRKLKKQFKYADRSGARFAVILGEDELAAGVVTVKDLETGEQETV 81 (91) T ss_pred CEEEEEEECHHHHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHCCCEEEEECCHHHHCCEEEEEECCCCCEEEE T ss_conf 48999981869999999999999988993999736887889999999709988999865142169799999999958998 Q ss_pred CHHHHHHHHH Q ss_conf 2999999998 Q gi|254781004|r 633 SLLDAIPILT 642 (652) Q Consensus 633 ~lee~i~~l~ 642 (652) +++++++.|+ T Consensus 82 ~~~eli~~lK 91 (91) T cd00859 82 ALDELVEELK 91 (91) T ss_pred EHHHHHHHHC T ss_conf 8999999759 No 46 >PRK12292 hisZ ATP phosphoribosyltransferase regulatory subunit; Provisional Probab=99.63 E-value=3.2e-13 Score=111.10 Aligned_cols=320 Identities=16% Similarity=0.171 Sum_probs=179.7 Q ss_pred CCCCEEEEECCHHHHHHHHHHHHHHHH-HCCCEEEECCCCCCCCCCCC-CCCCCCCCCCCCEEECCCCCCCCCCEEEECC Q ss_conf 787435650107899999999999887-68963855543244310001-1210012555420100145677782377434 Q gi|254781004|r 261 DGSGVIFWHRKGWKIFQTLISYMRRKI-KDDYEEINTPQVLDQHLWQQ-SGHWDSYRANMFAVKCADDTIKDLRTFALKP 338 (652) Q Consensus 261 ~~~G~~~wlP~G~~i~~~ie~~ir~~~-~~G~~eV~tP~l~~~~Lw~~-SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkP 338 (652) .-.|+-=++|.=...++.+++.+++.+ ++||++|.||.+-..+++.. +|. .-..+||.+.. +..++.++||| T Consensus 5 ~P~G~~D~lp~e~~~~~~i~~~~~~~f~~~Gy~~I~tP~lE~~e~f~~~~g~--~~~~~~~~f~D----~~~Gr~l~LRp 78 (388) T PRK12292 5 LPEGIRDLLPEEARKIEEIRRRLLDVFRLWGYEEVITPTLEYLDTLLTGGGA--DLDLRTFKLVD----QLSGRTLGLRP 78 (388) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCHHHHCCCCCC--CCCHHEEEEEE----CCCCCEEEECC T ss_conf 9999866799999999999999999999879967357660629885445775--02110489982----69997898568 Q ss_pred CCHHHHHHHHHHCCCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEECCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 31257777652012222310432000100012787645200010563533442101053767887899999988766552 Q gi|254781004|r 339 MNCPGHVAVFNHGLKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVFCTKEQMFNECLKIHNLIISIYKDF 418 (652) Q Consensus 339 mncp~h~~i~~~~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F~~~eq~~~e~~~~~~~~~~v~~~~ 418 (652) =--+..+-++.+..++ ...|+|+|.+|.+||+|... . -|-|||+|-++.+|=.++ .. .-..++.+..++++.+ T Consensus 79 D~T~~iaR~~~~~~~~-~~~p~r~~Y~g~vfR~e~pq--~--Gr~REf~Q~G~EiiG~~~-~~-aDaEvI~l~~~~l~~l 151 (388) T PRK12292 79 DMTPQIARIAATRLAN-RPGPLRLCYAGNVFRAQPRG--L--GRSREFLQSGVELIGDAG-IE-ADAEVLSLLLDALKAL 151 (388) T ss_pred CCCHHHHHHHHHHHHC-CCCCEEEEEEEEEEEECCCC--C--CCCCEEEECCEEEECCCC-HH-HHHHHHHHHHHHHHHC T ss_conf 7719999999973104-67876899852268705898--9--961023425755317888-89-9999999999999976 Q ss_pred CCCCCEECCCCCCCC-----CCC-CHHHHHHHHHHHHHHHHHHCCCCCCCC--C-----------------CCCCCCCCC Q ss_conf 133210001465311-----258-603577899999999985021212566--7-----------------520555555 Q gi|254781004|r 419 GFEKIMVKLSTRPEK-----RVG-SDALWDDAENIMKGVLDTIKNSSQDRI--N-----------------TGVLLGEGA 473 (652) Q Consensus 419 g~~~~~~~ls~~~~~-----~~g-~~~~~~~ae~~l~~~l~~~~~~~~~~~--~-----------------~~~~~g~~a 473 (652) |+..+.+.+.....- ..+ +.+.. ..+..++..........+ . ......... T Consensus 152 gl~~~~i~i~~~~i~~~ll~~~~~~~~~~----~~l~~~l~~kd~~~l~~l~~~~~~~~~l~~l~~l~g~~~~~~~~~~~ 227 (388) T PRK12292 152 GLPNFTLDLGHVGLFRALLAALGLPEELE----EALRTALANKDYVALEELDLPEELRARLLALPRLRGGPEVLLEARKL 227 (388) T ss_pred CCCCEEEEECCHHHHHHHHHHCCCCHHHH----HHHHHHHHHHCHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHH T ss_conf 99864999778889999999839999999----99999998748666876399999999999887745999999999976 Q ss_pred CCCCCC----C-C-CHHHHCCCCCCCCCCCHHHHCC--------CCCCCEEEEEC-----------------CCEEEEEE Q ss_conf 455432----1-1-0021103310012110111022--------34674167317-----------------97213799 Q gi|254781004|r 474 FYGPKF----E-Y-ILKDAIGRDWQCGTIQVDFNLP--------SRFNAFYVNSH-----------------SEKCHPVM 522 (652) Q Consensus 474 fygpki----d-~-~~~d~~gr~~~~~tiq~df~~p--------~~f~l~y~~~d-----------------g~~~~pvm 522 (652) +.+++. + + .+.+.+........+.+|+.+- .-|.. |.... |.. .|-+ T Consensus 228 ~~~~~~~~~l~~l~~l~~~l~~~~~~~~i~iD~~lvRGldYYTG~vFE~-~~~~~~~~i~gGGRYD~L~~~fG~~-~PAv 305 (388) T PRK12292 228 LPGSPAKRALDELKALAEALEKAGYGIYLQLDLGLLRHYDYYTGIVFQG-YVDGVGNPIARGGRYDDLGGRFGRA-RPAT 305 (388) T ss_pred CCCCHHHHHHHHHHHHHHHHHHCCCCCEEEECCHHCCCCCCCCCEEEEE-EECCCCCEECCCCCHHHHHHHHCCC-CCEE T ss_conf 6795568999999999999984499964720400035766656439999-9189888250575664799983899-9715 Q ss_pred EEECCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHH-HC Q ss_conf 64034103789999999973870887337808999857966899999999999987988999758966367666554-34 Q gi|254781004|r 523 IHRAVFGSIERFIGIMIENFKGNLPLWLSPIQAIVTTITSSAVEYAQEIANLLKSHHLSIETDFRNETINYKIREHS-IK 601 (652) Q Consensus 523 ihr~~~Gs~eR~ia~liE~~~g~~P~wLAP~QV~Iipi~e~~~eya~~i~~~L~~~girV~~Ddr~~~~G~Kir~a~-l~ 601 (652) |.--++||++.++-+.- -+|.++.||+..++..++|-++.+.|+++|++|++|.++.....-.+-+. +. T Consensus 306 ---Gfa~g~eRl~~~l~~~~-------~~~~~~~~v~~~~~~~~~a~~~a~~LR~~G~~v~~~l~~~~~~~~~~~~~~~~ 375 (388) T PRK12292 306 ---GFSLDLDRLRELLPEEE-------ERKQKDLLILAPWERLEAALAAAQALRKKGEIVVLALPGHDDAEAEAADRQLV 375 (388) T ss_pred ---EEEECHHHHHHHHHHCC-------CCCCCEEEEECCHHHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHH T ss_conf ---89963999999864205-------67886799985867699999999999977988999899998688988779986 Q ss_pred CCCEEEEE Q ss_conf 89889998 Q gi|254781004|r 602 KIPIIIIC 609 (652) Q Consensus 602 giP~~ivI 609 (652) -.....++ T Consensus 376 ~~~~~w~~ 383 (388) T PRK12292 376 CREGVWVV 383 (388) T ss_pred HCCCCEEE T ss_conf 15881899 No 47 >COG0442 ProS Prolyl-tRNA synthetase [Translation, ribosomal structure and biogenesis] Probab=99.60 E-value=4.2e-15 Score=124.03 Aligned_cols=240 Identities=20% Similarity=0.205 Sum_probs=165.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCEECCCCCCCCC----------------CCCHHHHHHHHHHHHHHHHHHCCC-CCC Q ss_conf 678878999999887665521332100014653112----------------586035778999999999850212-125 Q gi|254781004|r 399 QMFNECLKIHNLIISIYKDFGFEKIMVKLSTRPEKR----------------VGSDALWDDAENIMKGVLDTIKNS-SQD 461 (652) Q Consensus 399 q~~~e~~~~~~~~~~v~~~~g~~~~~~~ls~~~~~~----------------~g~~~~~~~ae~~l~~~l~~~~~~-~~~ 461 (652) ..++-+..++.....-|+++-+..+.+..-.|.+.. ...+..++.|+....+.++.-..- ... T Consensus 109 TsEe~it~~~~~~i~SYkdLPl~lYQi~~kfRdE~rpr~gllR~REF~mkdaySfh~~~e~a~~~y~~~~~~Y~~if~~i 188 (500) T COG0442 109 TSEEVITDMFRKWIRSYKDLPLKLYQIQSKFRDEKRPRFGLLRGREFLMKDAYSFHADEEDAEETYEKMLDAYSRIFLRL 188 (500) T ss_pred CCHHHHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCCCCCCCCHHEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHC T ss_conf 86899999999986556518812101345884334688773000203301112102898999999999999999999858 Q ss_pred CCCCCCCCCCCCCCCCCC--CCCHHHHCCCCCCCCCCC-HHHHCCCCC-CCEEEEEC-CCEEEEEEEEECCCHHHHHHHH Q ss_conf 667520555555455432--110021103310012110-111022346-74167317-9721379964034103789999 Q gi|254781004|r 462 RINTGVLLGEGAFYGPKF--EYILKDAIGRDWQCGTIQ-VDFNLPSRF-NAFYVNSH-SEKCHPVMIHRAVFGSIERFIG 536 (652) Q Consensus 462 ~~~~~~~~g~~afygpki--d~~~~d~~gr~~~~~tiq-~df~~p~~f-~l~y~~~d-g~~~~pvmihr~~~Gs~eR~ia 536 (652) .+.+...+.+..|-|... .|.+.---+++-|++|.. +.-|..+.| +.+|.+++ +... ..|..+||..+|.++ T Consensus 189 ~l~~~~~~ad~g~~Gg~~S~eF~~l~pd~ge~qi~ts~~y~aN~e~a~~~~~~~~~~~~~~~---~v~t~s~~~s~r~~~ 265 (500) T COG0442 189 PLIFGPVPADEGFIGGSYSHEFEALMPDGGEDQIATSHHYGANFEKAFIDIKFEDEEEGELE---YVHTTSYGISTRIIG 265 (500) T ss_pred CCEEEEECCCCCCCCCCCCEEEEEECCCCCCCEEEEECCHHHHHHHHCCCCCCCCCCCCCCE---EECCCCEEEEEEEEE T ss_conf 95388625667788876536999985689854799806266668774467775566655414---751462688740015 Q ss_pred HHHHHHC----CCCCCCCCCEEEEEEECC-------HHHHHHHHHHHHHHHHCCCEEEEECC-CCCCCHHHHHHHHCCCC Q ss_conf 9999738----708873378089998579-------66899999999999987988999758-96636766655434898 Q gi|254781004|r 537 IMIENFK----GNLPLWLSPIQAIVTTIT-------SSAVEYAQEIANLLKSHHLSIETDFR-NETINYKIREHSIKKIP 604 (652) Q Consensus 537 ~liE~~~----g~~P~wLAP~QV~Iipi~-------e~~~eya~~i~~~L~~~girV~~Ddr-~~~~G~Kir~a~l~giP 604 (652) ++|..+| ...|+-+||.||+++|+- ....+|++++...|...++||..|++ +.++|+|+..++.+||| T Consensus 266 ~~i~i~GDn~G~v~Pp~vA~~qV~~~~~~~~ga~~h~~~~~~~rd~~~~l~~~~~~~~~D~~~~~~~G~kl~~~e~ieVg 345 (500) T COG0442 266 AAILIHGDNEGLVLPPIVADIQVVIVPIFIKGANEHYKVVNYGRDVAEPLEKLGIRVEGDDRSPDGPGFKLNIWEGIEVG 345 (500) T ss_pred EEEEEECCCCCCCCCCHHCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCEEEECCCCCCCCCCCEEEEEECCCCC T ss_conf 89997148877648750045348997513367410133430454445665430258842456899997211021024457 Q ss_pred EEEEECHHHHHCCCEEEEECCCCC--EEEECHHHHHHHH Q ss_conf 899987425325818999889886--6334299999999 Q gi|254781004|r 605 IIIICGDKEASERSIGIRRFGSTT--TQKVSLLDAIPIL 641 (652) Q Consensus 605 ~~ivIG~ke~e~~~Vtir~R~~~~--~~~i~lee~i~~l 641 (652) .++.+|.|..++.++++-+|++.+ ....+....+.++ T Consensus 346 hif~lG~kyse~~~a~v~~r~g~~~~~~mg~ygigvsr~ 384 (500) T COG0442 346 HIFELGTKYSEAMNATVLDRDGKEQPKTMGCYGIGVSRL 384 (500) T ss_pred EEEEECCHHHHHCEEEEEECCCCCCCEEEECHHHHHHHH T ss_conf 799987412541804999248871441787000003459 No 48 >KOG1936 consensus Probab=99.60 E-value=7e-13 Score=108.73 Aligned_cols=390 Identities=15% Similarity=0.219 Sum_probs=228.4 Q ss_pred EECCHHHHHHHHHHHHHHHHHHHHHHHHHCCEEE---ECCCCCCEEEEECCHHHHHHHHHHHHHHHHH-CCCEEEECCCC Q ss_conf 8379889999999999753200887541144044---2267874356501078999999999998876-89638555432 Q gi|254781004|r 224 AWNTQQELTQYLYFLEESEKRDHRKLAREMDLFH---IAEDGSGVIFWHRKGWKIFQTLISYMRRKIK-DDYEEINTPQV 299 (652) Q Consensus 224 af~~~~~l~~~~~~~eea~~rdHr~lg~~~~lf~---~~~~~~G~~~wlP~G~~i~~~ie~~ir~~~~-~G~~eV~tP~l 299 (652) +-.++.+.++--+.+-+++.-- |..-..|. .-....|--=|-|.-..+++.|-+.+.+-++ +|+..|-||++ T Consensus 26 ~~a~~~~~~~~va~~~~~~~~~----~~~~~~~~~k~~lKtPKGTrD~~p~qm~lRe~if~~i~~vFkrhGa~~iDTPVF 101 (518) T KOG1936 26 SCASPEQISEEVALILALKNGL----GRSGSIFKKKFSLKTPKGTRDFSPEQMALREKIFSTIKEVFKRHGAETIDTPVF 101 (518) T ss_pred CCCCHHHHHHHHHHHHHHHCCC----CCCCCCCCCCEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCEECCCCCH T ss_conf 4588667528999999874136----777777676232069998776888888999999999999999719744466004 Q ss_pred CCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECC-CCHHHHHHHHHHCCCCHHHCCHHHHHHEEEECCCCCCCCE Q ss_conf 443100011210012555420100145677782377434-3125777765201222231043200010001278764520 Q gi|254781004|r 300 LDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKP-MNCPGHVAVFNHGLKSYRELPVRLAEFGSVYRNEPSGSLH 378 (652) Q Consensus 300 ~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkP-mncp~h~~i~~~~~~SYrdLPlrl~e~~~~~R~E~sg~l~ 378 (652) --+++.- .+|+|+- ...-+..|..+|-..||+ -+-|..-.+..+++.| +..|||+.+||... |+. T Consensus 102 ElkeiL~-----gKYGEds--kLiYdlkDQGGEl~SLRYDLTVPfARylAmNki~s-----ikRy~iAkVyRRd~-P~m- 167 (518) T KOG1936 102 ELKEILT-----GKYGEDS--KLIYDLKDQGGELCSLRYDLTVPFARYLAMNKITS-----IKRYHIAKVYRRDQ-PAM- 167 (518) T ss_pred HHHHHHH-----HHCCCCC--CEEEEHHHCCCCEEEEECCCCCHHHHHHHHCCCCC-----CEEEEEEEEEECCC-CHH- T ss_conf 5999986-----6405532--30675232488588741356647999998714552-----11046787875068-144- Q ss_pred EEEECCCCCCCCCEEECCHHHH--HHHHHHHHHHHHHHHHHHCCCCCEECCCCCCC------------CCCC-------- Q ss_conf 0010563533442101053767--88789999998876655213321000146531------------1258-------- Q gi|254781004|r 379 GLMRVRGFTQDDAHVFCTKEQM--FNECLKIHNLIISIYKDFGFEKIMVKLSTRPE------------KRVG-------- 436 (652) Q Consensus 379 GL~R~ReF~~~Dah~F~~~eq~--~~e~~~~~~~~~~v~~~~g~~~~~~~ls~~~~------------~~~g-------- 436 (652) -==|-|||.|-|.-|--.-+++ ..| ++..+.+++..+|+.++.+.++.|.. ...+ T Consensus 168 trGR~REFYQcDFDIAG~~d~M~pdaE---~lkiv~e~L~~l~Igd~~iKvNhRkiLdgmf~v~GVp~~~frtICSsIDK 244 (518) T KOG1936 168 TRGRYREFYQCDFDIAGQFDPMIPDAE---CLKIVVEILSRLGIGDYGIKVNHRKILDGMFAVCGVPEDKFRTICSSIDK 244 (518) T ss_pred HCHHHHHHHCCCCCCCCCCCCCCCHHH---HHHHHHHHHHHCCCCCEEEEECHHHHHHHHHHHHCCCHHHHHHHHHHHHH T ss_conf 230245565067521266777786099---99999999865275515997147899998998809997896248776545 Q ss_pred -CHHHHHHHHH-----------H---H---------HHHHHHHCCCCCCCCCCCCCCCCCCC--------------CCCC Q ss_conf -6035778999-----------9---9---------99998502121256675205555554--------------5543 Q gi|254781004|r 437 -SDALWDDAEN-----------I---M---------KGVLDTIKNSSQDRINTGVLLGEGAF--------------YGPK 478 (652) Q Consensus 437 -~~~~~~~ae~-----------~---l---------~~~l~~~~~~~~~~~~~~~~~g~~af--------------ygpk 478 (652) +...|+.-.. . + .+.++.+..... + .....+-.++ +..+ T Consensus 245 LdK~pwedVkkEmv~eKGlsee~ad~igeyv~~~g~~eL~e~l~~d~~--l-~~n~~a~eal~dlk~Lf~y~~~fg~s~~ 321 (518) T KOG1936 245 LDKMPWEDVKKEMVFEKGLSEEAADRIGEYVSLKGLDELLEKLIADPK--L-SQNEAAKEALADLKQLFEYLEIFGISER 321 (518) T ss_pred HHCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCHHHHHHHHHCCCC--C-CCCHHHHHHHHHHHHHHHHHHHCCCCCE T ss_conf 503889999999877417899999999878621668889999854875--4-3338899989999999999997298633 Q ss_pred CCCCHHHHCCCCCCCCCCCHHHH---------------CC------------CCCCCEEE--EECCCEEEEEEEEECCCH Q ss_conf 21100211033100121101110---------------22------------34674167--317972137996403410 Q gi|254781004|r 479 FEYILKDAIGRDWQCGTIQVDFN---------------LP------------SRFNAFYV--NSHSEKCHPVMIHRAVFG 529 (652) Q Consensus 479 id~~~~d~~gr~~~~~tiq~df~---------------~p------------~~f~l~y~--~~dg~~~~pvmihr~~~G 529 (652) |.|.+.-+-|-....|.|+-.+. .+ -|+|--.. +..+.. . .|.| T Consensus 322 isfDlSLARGLDYYTGvIyEav~~~~~~~~~a~~~~~~~~~e~~~vGSvaaGGRYDnLv~mf~~k~~~-v------PcvG 394 (518) T KOG1936 322 ISFDLSLARGLDYYTGVIYEAVLRGLRLICPAGRYDQAGSTEPGGVGSVAAGGRYDNLVGMFDKKGDK-V------PCVG 394 (518) T ss_pred EEEEHHHHCCCHHHHCEEEEEEECCCCCCCCCHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCC-C------CCCC T ss_conf 78525875340133123445652254235761122203665677754512476323689875124574-8------7454 Q ss_pred ---HHHHHHHHHHHHHCC-CCCCCCCCEEEEEEECCHHHHHHHHHHHHHHHHCCCEEEEECC-CCCCCHHHHHHHHCCCC Q ss_conf ---378999999997387-0887337808999857966899999999999987988999758-96636766655434898 Q gi|254781004|r 530 ---SIERFIGIMIENFKG-NLPLWLSPIQAIVTTITSSAVEYAQEIANLLKSHHLSIETDFR-NETINYKIREHSIKKIP 604 (652) Q Consensus 530 ---s~eR~ia~liE~~~g-~~P~wLAP~QV~Iipi~e~~~eya~~i~~~L~~~girV~~Ddr-~~~~G~Kir~a~l~giP 604 (652) ++||+..++=+...- .--..=.-.||-|..-........-++.+.|..+||-.++=-. +-++-+-+..++..||| T Consensus 395 ~S~GVeRiFsile~r~~~~~~~iR~t~t~V~V~~~~k~~l~eR~k~v~~Lw~agI~aE~~yk~~~~~~~q~~~~e~~~ip 474 (518) T KOG1936 395 QSVGVERIFSILEQRAATVATKIRTTETQVYVAAAGKNLLFERLKVVNALWDAGIKAEYLYKANPKLLKQFQYAEEAGIP 474 (518) T ss_pred EEEHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEECCCCHHHHHHHHHHHHHHCCCCHHHHHHCCCCHHHHHHHHHHCCCC T ss_conf 04007689999999887654102567628999836774089999999999863755200010484377888788863997 Q ss_pred EEEEECHHHHHCCCEEEEECCCCCEEEECHHHHHHHHHHH Q ss_conf 8999874253258189998898866334299999999873 Q gi|254781004|r 605 IIIICGDKEASERSIGIRRFGSTTTQKVSLLDAIPILTKE 644 (652) Q Consensus 605 ~~ivIG~ke~e~~~Vtir~R~~~~~~~i~lee~i~~l~~e 644 (652) +.+|||++|+.++.|.||+-.+.+.+.++.|+|+..|++- T Consensus 475 ~~vi~Ge~El~~G~V~vk~~~~re~~~v~~e~~v~~l~~~ 514 (518) T KOG1936 475 LAVILGEEELADGSVKVKNVGTREEVLVKREQFVSELKDL 514 (518) T ss_pred EEEEECHHHHCCCEEEEEECCCCCCEECCHHHHHHHHHHH T ss_conf 6999344565077079876145533103399999999877 No 49 >PRK00252 alaS alanyl-tRNA synthetase; Reviewed Probab=99.59 E-value=4.9e-14 Score=116.68 Aligned_cols=135 Identities=26% Similarity=0.483 Sum_probs=106.0 Q ss_pred HHHHHHHHHHHHHHHHHCCCCEE-EEECCC-CCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHH Q ss_conf 99998999999999997799889-972435-8855788408888889899999999999996699804853276785445 Q gi|254781004|r 72 VIRHSCAHIMAEAVQSIWPEVQV-AIGPII-EDGFYYDFDKEQPFSSDELAQIEKKMQEIIARDSPFLKQYLSHEKAREL 149 (652) Q Consensus 72 i~~hS~ahlL~~Av~~l~p~~kl-~igp~i-~~GfYyD~~~~~~it~~dl~~Ie~~M~~ii~~~~pi~r~~vs~~eA~~~ 149 (652) .-.||++|||.+|++++++. ++ .-|--+ .+.+-.||....++|++++.+||+.++++|.+|+|+....++.++|.+. T Consensus 550 ~~nHTatHlL~~alr~vlG~-hv~Q~GS~v~~~~lrfDfsh~~~lt~eel~~iE~~vN~~I~~n~~V~~~~~~~~eA~~~ 628 (864) T PRK00252 550 ARNHSATHLLHAALRQVLGE-HVTQKGSLVAPDRLRFDFSHFEALTPEELRQVEDLVNEQIRENLPVETEEMDIEEAKKM 628 (864) T ss_pred HHHCHHHHHHHHHHHHHHCC-CEEECCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCEEEEECCHHHHHHH T ss_conf 86357999999999998487-70866633577607997316888999999999999999999679658985698999752 Q ss_pred H--HHCCCHHHHHHHHHCCCCCCCEEECCCCC-EEECCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEEEE Q ss_conf 4--41671135566542034554213025752-0100023552144422233333321110157756420689999983 Q gi|254781004|r 150 F--ESKKEAYKVEILESIPAKENVTLYRQGEW-FDLCRGPHVRSTGQVKKFFKLMKVAGAYWRGDSNRPMLSRIYGTAW 225 (652) Q Consensus 150 F--~~~~~~~K~~li~~~~~~~~v~~y~~g~~-~Dlc~GphvpsTg~ik~~fkL~~~sgayw~gd~~~~~l~Riyg~af 225 (652) - ...+++| ++.|.+...|+| +.+|.|+||.+||.|+. ||+++.++- ..-..||..++= T Consensus 629 ga~alfgekY----------~d~VRVV~ig~~s~ElCGGTHv~~TgeIg~-~kI~~e~~i-------~~GvrRIeavtG 689 (864) T PRK00252 629 GAMALFGEKY----------GDEVRVVSMGDFSIELCGGTHVSNTGDIGL-FKIVSESGV-------AAGVRRIEAVTG 689 (864) T ss_pred CHHHHCCCCC----------CCEEEEEEECCEEEECCCCCCCCCCCEEEE-EEEEEECCC-------CCCCEEEEEEEC T ss_conf 3121014214----------880699998992685678988997052354-999984563-------588179999715 No 50 >PRK13902 alaS lanyl-tRNA synthetase; Provisional Probab=99.56 E-value=9.2e-14 Score=114.80 Aligned_cols=148 Identities=19% Similarity=0.397 Sum_probs=110.4 Q ss_pred HHHHHHHHHHHHHHHHCCCCEE-EEECCC-CCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHH Q ss_conf 9998999999999997799889-972435-88557884088888898999999999999966998048532767854454 Q gi|254781004|r 73 IRHSCAHIMAEAVQSIWPEVQV-AIGPII-EDGFYYDFDKEQPFSSDELAQIEKKMQEIIARDSPFLKQYLSHEKARELF 150 (652) Q Consensus 73 ~~hS~ahlL~~Av~~l~p~~kl-~igp~i-~~GfYyD~~~~~~it~~dl~~Ie~~M~~ii~~~~pi~r~~vs~~eA~~~F 150 (652) -.||++|||.+|++++++. ++ .-|..+ .+-+-.||....++|++++.+||+.++++|.+|+|+....++.++|.+.+ T Consensus 591 ~nHTaTHlL~~alr~vLG~-hv~Q~Gs~v~~~~lRfDfsh~~~lt~eel~~IE~~vN~~I~~n~~V~~~~~~~~eA~~~~ 669 (898) T PRK13902 591 RHHTATHILLAAARKVLGD-HVWQAGAQKDTDKARLDITHYKRITREEVKEIERLANRIVMENRPVKVEWMDRNEAEQKY 669 (898) T ss_pred HHHHHHHHHHHHHHHHHCC-CEEECCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCEEEEEECHHHHHHHC T ss_conf 6003888877999987277-568403233798089962488878989999999999999984896279994789997615 Q ss_pred HHCCCHHHHHHHH-HCCCCCCCEEECCCCC-EEECCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEEEECCH Q ss_conf 4167113556654-2034554213025752-0100023552144422233333321110157756420689999983798 Q gi|254781004|r 151 ESKKEAYKVEILE-SIPAKENVTLYRQGEW-FDLCRGPHVRSTGQVKKFFKLMKVAGAYWRGDSNRPMLSRIYGTAWNTQ 228 (652) Q Consensus 151 ~~~~~~~K~~li~-~~~~~~~v~~y~~g~~-~Dlc~GphvpsTg~ik~~fkL~~~sgayw~gd~~~~~l~Riyg~af~~~ 228 (652) .- .|.. ..+.++.|.+...|+| +.+|.|.||.+||.|+. ||+++.++. ..-..||..++=. T Consensus 670 a~-------alf~gG~kygd~VRVV~ig~~s~ElCGGTHV~nTgeIG~-fkIise~si-------~~GVRRIEavtG~-- 732 (898) T PRK13902 670 GF-------RLYQGGVVPGKEIRVVEIEGWDVEACGGTHVKNTGEIGP-IKILKTERI-------QDGVERLEFAAGD-- 732 (898) T ss_pred CH-------HHHCCCCCCCCCEEEEEECCCEEECCCCCCCCCCCCEEE-EEEEEECCE-------ECCCEEEEEEECH-- T ss_conf 40-------332278788873589865780365237987787031343-899985526-------3781699998078-- Q ss_pred HHHHHHHHHHH Q ss_conf 89999999999 Q gi|254781004|r 229 QELTQYLYFLE 239 (652) Q Consensus 229 ~~l~~~~~~~e 239 (652) ..-+|+...+ T Consensus 733 -~A~~~~~~~~ 742 (898) T PRK13902 733 -AAVEYIQELE 742 (898) T ss_pred -HHHHHHHHHH T ss_conf -9999999999 No 51 >COG0013 AlaS Alanyl-tRNA synthetase [Translation, ribosomal structure and biogenesis] Probab=99.56 E-value=2e-13 Score=112.52 Aligned_cols=162 Identities=22% Similarity=0.450 Sum_probs=117.0 Q ss_pred HHHH-HHHHHHHHHHHHHHHHCCCCEEEEECCC-CCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCHHHH Q ss_conf 9999-9998999999999997799889972435-8855788408888889899999999999996699804853276785 Q gi|254781004|r 69 SLAV-IRHSCAHIMAEAVQSIWPEVQVAIGPII-EDGFYYDFDKEQPFSSDELAQIEKKMQEIIARDSPFLKQYLSHEKA 146 (652) Q Consensus 69 g~~i-~~hS~ahlL~~Av~~l~p~~kl~igp~i-~~GfYyD~~~~~~it~~dl~~Ie~~M~~ii~~~~pi~r~~vs~~eA 146 (652) -.++ -.||++|||.+|++++.+.--..-|--+ .+-.-.||....++|.++|.+||...+++|.+|+|+....++.++| T Consensus 560 R~~~~~nHTATHLLh~ALr~VLG~hV~QaGSlv~~~~lRfDfsH~~~it~Eel~~IE~~vNe~I~~n~~V~~~~m~~~eA 639 (879) T COG0013 560 RRRLMRNHTATHLLHAALRKVLGDHVWQAGSLVDPEKLRFDFSHYKALTAEELKEIERLVNEIIRENLPVKTEEMDLDEA 639 (879) T ss_pred HHHHHHHCHHHHHHHHHHHHHHCCCCEECCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCHHHH T ss_conf 99998627699999999999737321206764677706885317888999999999999999998289414897369999 Q ss_pred HHH--HHHCCCHHHHHHHHHCCCCCCCEEECCCCC-EEECCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEE Q ss_conf 445--441671135566542034554213025752-01000235521444222333333211101577564206899999 Q gi|254781004|r 147 REL--FESKKEAYKVEILESIPAKENVTLYRQGEW-FDLCRGPHVRSTGQVKKFFKLMKVAGAYWRGDSNRPMLSRIYGT 223 (652) Q Consensus 147 ~~~--F~~~~~~~K~~li~~~~~~~~v~~y~~g~~-~Dlc~GphvpsTg~ik~~fkL~~~sgayw~gd~~~~~l~Riyg~ 223 (652) .+. ....|++| . +.|.+...|+| +.+|.|.||.+||.|+. ||+++.+|- +. -.-||-.+ T Consensus 640 k~~Ga~alfGekY---------~-~~VRVV~ig~~SvElCGGTHv~~Tg~IG~-fkI~~e~~i-----~a--GVrRIeav 701 (879) T COG0013 640 KAKGAMALFGEKY---------G-DEVRVVEIGDFSVELCGGTHVSNTGDIGL-FKIISEEGI-----AA--GVRRIEAV 701 (879) T ss_pred HHHCHHHHCCCCC---------C-CEEEEEEECCCEEECCCCCCCCCCCCEEE-EEEEEECCC-----CC--CEEEEEEE T ss_conf 9719296528867---------9-86899998991566137745677565011-799876413-----06--75899998 Q ss_pred EECCHHHHHHHHHHHHHHHHHHHHHHHHHCCE Q ss_conf 83798899999999997532008875411440 Q gi|254781004|r 224 AWNTQQELTQYLYFLEESEKRDHRKLAREMDL 255 (652) Q Consensus 224 af~~~~~l~~~~~~~eea~~rdHr~lg~~~~l 255 (652) + ....+ +|++..++ .-+.+...|+- T Consensus 702 t--G~~a~-~~~~~~~~----~l~~~a~~lk~ 726 (879) T COG0013 702 T--GEAAL-EYLQEQED----LLKEIASLLKV 726 (879) T ss_pred C--CHHHH-HHHHHHHH----HHHHHHHHHCC T ss_conf 2--78999-99999999----99999998689 No 52 >KOG2509 consensus Probab=99.54 E-value=2.8e-14 Score=118.35 Aligned_cols=294 Identities=18% Similarity=0.256 Sum_probs=208.7 Q ss_pred CCCCCCEEEEEEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEECC---C-CCCEEEEECCHHHHHHHHHHHHHH Q ss_conf 775642068999998379889999999999753200887541144044226---7-874356501078999999999998 Q gi|254781004|r 210 GDSNRPMLSRIYGTAWNTQQELTQYLYFLEESEKRDHRKLAREMDLFHIAE---D-GSGVIFWHRKGWKIFQTLISYMRR 285 (652) Q Consensus 210 gd~~~~~l~Riyg~af~~~~~l~~~~~~~eea~~rdHr~lg~~~~lf~~~~---~-~~G~~~wlP~G~~i~~~ie~~ir~ 285 (652) +|+.+..++|++|-.-.+. +-+.|-.|+..+++.-... . |.-.|+..-.|..+-..|.+|--. T Consensus 131 ~ee~~~~v~~~~~~~~~~~-------------~~~~~~~l~~~~~~~~~~~~a~vsG~r~Yyl~g~~a~LeqALi~yal~ 197 (455) T KOG2509 131 NEEQANVVQRVGGPLEFEF-------------KLSHHDELVRALGFIDLERGAKVSGHRGYYLKGAGAFLEQALINYALD 197 (455) T ss_pred CHHHHHHHHHHCCCCCCCC-------------CCCCHHHHHHHCCCCCCCCHHHCCCCCCEEECCHHHHHHHHHHHHHHH T ss_conf 3235268888638753024-------------665388999860663400010014665158757799999999999999 Q ss_pred -HHHCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHHHCCCCHHHCCHHHHH Q ss_conf -8768963855543244310001121001255542010014567778237743431257777652012222310432000 Q gi|254781004|r 286 -KIKDDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHVAVFNHGLKSYRELPVRLAE 364 (652) Q Consensus 286 -~~~~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~~~~SYrdLPlrl~e 364 (652) -.++||..+.+|.|.++++.+..|-...+-+..+.... .. .+-+|--|.+-..+-++++..-.-.+||+++.= T Consensus 198 ~l~~kGy~pl~~P~i~rkeVm~~cg~~~~~d~~~~y~ld-----~~-~~~~LiaTaE~plAa~~~~e~~~~~~lPiK~vg 271 (455) T KOG2509 198 FLNAKGYTPLTTPDILRKEVMQKCGQLPRFDEEQYYVLD-----GG-DEKYLIATAEQPLAAYHRDEWLEEDQLPIKYVG 271 (455) T ss_pred HHHHCCCCCCCCCHHHHHHHHHHHCCCCCCCCCEEEEEC-----CC-CCCEEEEECCCHHHHHHCCCCCCCCCCCEEEEE T ss_conf 999769824668145369999985657678832178606-----88-642267621110666503640361127601343 Q ss_pred HEEEECCCC-CC--CCEEEEECCCCCCCCCEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCCCCCCCCCHHHH Q ss_conf 100012787-64--520001056353344210105376788789999998876655213321000146531125860357 Q gi|254781004|r 365 FGSVYRNEP-SG--SLHGLMRVRGFTQDDAHVFCTKEQMFNECLKIHNLIISIYKDFGFEKIMVKLSTRPEKRVGSDALW 441 (652) Q Consensus 365 ~~~~~R~E~-sg--~l~GL~R~ReF~~~Dah~F~~~eq~~~e~~~~~~~~~~v~~~~g~~~~~~~ls~~~~~~~g~~~~~ 441 (652) .++|||-|. |. .--||+||.+|+--+....|.|++..+++..++....++|..+|+..-.+...+.. .|.... T Consensus 272 ~S~CfR~EaGs~G~d~~GlyRVHqF~KVE~Fvit~Pe~S~~~~eEmi~~~eef~qsLgip~rvl~m~S~e---Lg~aAa- 347 (455) T KOG2509 272 VSRCFRAEAGSHGKDTKGLYRVHQFEKVEQFVITGPEDSWEMLEEMINNQEEFYQSLGLPYRVLNMPSGE---LGAAAA- 347 (455) T ss_pred HHHHHHHHHHHCCCCCCCCEEEEEEEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCHH---HCCHHH- T ss_conf 0378887760046545551466520025788865853569999999999999999809960673477423---376777- Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCHHHHCCCCCCCEEE---EECCCEE Q ss_conf 7899999999985021212566752055555545543211002110331001211011102234674167---3179721 Q gi|254781004|r 442 DDAENIMKGVLDTIKNSSQDRINTGVLLGEGAFYGPKFEYILKDAIGRDWQCGTIQVDFNLPSRFNAFYV---NSHSEKC 518 (652) Q Consensus 442 ~~ae~~l~~~l~~~~~~~~~~~~~~~~~g~~afygpkid~~~~d~~gr~~~~~tiq~df~~p~~f~l~y~---~~dg~~~ 518 (652) . +.+++ -+.+|-++|- +..-...-.||+ +.|+++.|- ..||+.. T Consensus 348 -------------k----KyDiE-AWmPgrg~yg--------------El~ScSNCTDyQ-SRRL~IRy~~~k~~~~~~~ 394 (455) T KOG2509 348 -------------K----KYDIE-AWMPGRGAYG--------------ELVSCSNCTDYQ-SRRLGIRYGQKKTNDGEKK 394 (455) T ss_pred -------------H----HCCHH-HHCCCCCCCC--------------CCCCCCCCHHHH-HHHHHHHCCCCCCCCCCCC T ss_conf -------------5----12666-5267655323--------------301014652577-7654202052324677541 Q ss_pred EEEEEEECCCHHHHHHHHHHHHHH----CCCCCCCCCCEEEE--EEEC Q ss_conf 379964034103789999999973----87088733780899--9857 Q gi|254781004|r 519 HPVMIHRAVFGSIERFIGIMIENF----KGNLPLWLSPIQAI--VTTI 560 (652) Q Consensus 519 ~pvmihr~~~Gs~eR~ia~liE~~----~g~~P~wLAP~QV~--Iipi 560 (652) ++.|+.-+. --+.|.|-+|+|.| |+..|.-|-||==. .+|. T Consensus 395 yvHtLN~Ta-cA~~R~l~aiLEnyQ~edGi~VPe~Lr~ym~~~~~ip~ 441 (455) T KOG2509 395 YVHTLNGTA-CATPRALCAILENYQTEDGIEVPEVLRPYMGGQEFIPF 441 (455) T ss_pred EEEECCHHH-HHHHHHHHHHHHHCCCCCCCCCCHHHHHHCCCCCCCCC T ss_conf 554304066-76467887767534677881677767740688640212 No 53 >cd01667 TGS_ThrRS_N TGS _ThrRS_N: ThrRS (threonyl-tRNA Synthetase) is a class II tRNA synthetase that couples threonine to its cognate tRNA. In addition to its catalytic and anticodon-binding domains, ThrRS has an N-terminal TGS domain, named after the ThrRS, GTPase, and SpoT proteins where it occurs. The TGS domain is thought to interact with the tRNA acceptor arm along with an adjacent N-terminal domain. The specific function of TGS is not well understood. Probab=99.53 E-value=2.3e-14 Score=118.97 Aligned_cols=60 Identities=55% Similarity=0.803 Sum_probs=58.1 Q ss_pred EEEECCCCCEEECCCCCCHHHHHHHHCCCCCCCEEEEEECCEEEECCEEEC-CCEEEEECC Q ss_conf 089928998874189888999998636110112899998988997745303-670798628 Q gi|254781004|r 5 IKLTFPDGSIKNFPVHATGSDVAESISRSLAKKAVAVAINGKVCDLSDPVR-EGSIEIITP 64 (652) Q Consensus 5 i~I~lpDG~~~~~~~g~t~~dia~~i~~~~~~~~vaa~vng~l~dL~~~l~-d~~i~~i~~ 64 (652) |+||||||++++|+.|+|++|||++|++++++++++|+|||+++||++||+ ||+|+|+|+ T Consensus 1 i~ItlpDgs~~~~~~~~T~~eiA~~i~~~l~k~~vaa~vng~l~DL~~~i~~d~~v~~iT~ 61 (61) T cd01667 1 IKITLPDGSVKEFPKGTTPLDIAKSISPGLAKKAVAAKVNGELVDLSRPLEEDCELEIITF 61 (61) T ss_pred CEEECCCCCEEECCCCCCHHHHHHHHCHHHHCCEEEEEECCEEECCCCCCCCCCEEEEEEC T ss_conf 9898899996887899899999998596634137999999999835741269999999849 No 54 >PRK01584 alanyl-tRNA synthetase; Provisional Probab=99.53 E-value=6.4e-14 Score=115.88 Aligned_cols=132 Identities=26% Similarity=0.443 Sum_probs=105.4 Q ss_pred HHHHHHHHHHHHHHCCCCEEEEECCC-CCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHH-- Q ss_conf 98999999999997799889972435-885578840888888989999999999999669980485327678544544-- Q gi|254781004|r 75 HSCAHIMAEAVQSIWPEVQVAIGPII-EDGFYYDFDKEQPFSSDELAQIEKKMQEIIARDSPFLKQYLSHEKARELFE-- 151 (652) Q Consensus 75 hS~ahlL~~Av~~l~p~~kl~igp~i-~~GfYyD~~~~~~it~~dl~~Ie~~M~~ii~~~~pi~r~~vs~~eA~~~F~-- 151 (652) ||++|||.+|++++.++.--.-|-.+ .+.+-+||....++|++++.+||..++++|.+|.|+...++++++|.+... T Consensus 456 HTATHLLh~ALR~VLG~hV~Q~GS~v~~~~lRfDfth~~~lt~eei~~IE~~vN~~I~~~~~V~~~~~~~~eA~~~ga~a 535 (593) T PRK01584 456 HTATHLLHKALQLVLGDHVRQKGSNITAERLRFDFSHPRKMTDDEIKKVEDIVNLQIKNDLSVKKEVMSLEEAKEKGAMA 535 (593) T ss_pred CHHHHHHHHHHHHHCCCCEEECCCCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCEEEEEECHHHHHHCCCEE T ss_conf 54899999999986287559878545798179996279999999999999999999983896259982499999749926 Q ss_pred HCCCHHHHHHHHHCCCCCCCEEECCCCC-EEECCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEEE Q ss_conf 1671135566542034554213025752-010002355214442223333332111015775642068999998 Q gi|254781004|r 152 SKKEAYKVEILESIPAKENVTLYRQGEW-FDLCRGPHVRSTGQVKKFFKLMKVAGAYWRGDSNRPMLSRIYGTA 224 (652) Q Consensus 152 ~~~~~~K~~li~~~~~~~~v~~y~~g~~-~Dlc~GphvpsTg~ik~~fkL~~~sgayw~gd~~~~~l~Riyg~a 224 (652) -.|++| ++.|.+...|+| +.+|.|.||.|||.|+. ||+++.++. + .-..||.+++ T Consensus 536 lfgekY----------~d~VRVv~i~~~S~ElCGGTHv~~TgeIg~-fkI~~e~si-----~--~GvrRIeAvt 591 (593) T PRK01584 536 LFGEKY----------EDVVSVYEIDGFSIEVCGGPHVENTSELGT-FKIQKEQSS-----S--SGVRRIKAIL 591 (593) T ss_pred ECCCCC----------CCEEEEEEECCEEEECCCCCCCCCHHHCCC-EEEEEECCC-----C--CCCEEEEEEE T ss_conf 228999----------986689988996786578613387733446-799976766-----0--7847899981 No 55 >TIGR00442 hisS histidyl-tRNA synthetase; InterPro: IPR015807 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . Histidyl-tRNA synthetase (6.1.1.21 from EC) is an alpha2 dimer that belongs to class IIa. Every completed genome includes a histidyl-tRNA synthetase. Apparent second copies from Bacillus subtilis, Synechocystis sp., and Aquifex aeolicus are slightly shorter, more closely related to each other than to other hisS proteins, and not demonstrated to act as histidyl-tRNA synthetases (see IPR004517 from INTERPRO). The regulatory protein kinase GCN2 of Saccharomyces cerevisiae (YDR283c), and related proteins from other species designated eIF-2 alpha kinase, have a domain closely related to histidyl-tRNA synthetase that may serve to detect and respond to uncharged tRNA(his), an indicator of amino acid starvation, but these regulatory proteins are not orthologous.; GO: 0000166 nucleotide binding, 0004821 histidine-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006427 histidyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=99.48 E-value=8.2e-12 Score=101.34 Aligned_cols=366 Identities=17% Similarity=0.177 Sum_probs=222.1 Q ss_pred CCCCEEEEECCHHHHHHHHHHHHHHHH-HCCCEEEECCCCCCCCCCCC-C-CCCCCCCCCCCEEECCCCCCCCCCEEEEC Q ss_conf 787435650107899999999999887-68963855543244310001-1-21001255542010014567778237743 Q gi|254781004|r 261 DGSGVIFWHRKGWKIFQTLISYMRRKI-KDDYEEINTPQVLDQHLWQQ-S-GHWDSYRANMFAVKCADDTIKDLRTFALK 337 (652) Q Consensus 261 ~~~G~~~wlP~G~~i~~~ie~~ir~~~-~~G~~eV~tP~l~~~~Lw~~-S-Gh~~~y~~~mf~~~~~~~~~~~~~~~~Lk 337 (652) .-.|.--|+|.-...++.++..++.-. ++||.+|.||++-..+|+.. . .-.+....+||. ..|+.+++.+|+ T Consensus 6 ~~~G~~d~~p~~~~~~~~~~~~~~~~~~~~g~~~~~~P~~e~~~~~~~~~~~~~~~~~~~~y~-----f~d~~~~~~~lr 80 (446) T TIGR00442 6 KPRGTKDWLPGELVYRQWIEAKLRSVLESYGYSEIRTPIFEYTELFARGIGKSGDEVEKELYT-----FKDKGGRDLALR 80 (446) T ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHEE-----EECCCCCEEEEC T ss_conf 345520025415789999999999999751772001211444566653025301334444301-----212788414422 Q ss_pred CCCHHHHHHHHHHCCCCH-HHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEECCHHHHHHHHHHHHHHHHHHHH Q ss_conf 431257777652012222-3104320001000127876452000105635334421010537678878999999887665 Q gi|254781004|r 338 PMNCPGHVAVFNHGLKSY-RELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVFCTKEQMFNECLKIHNLIISIYK 416 (652) Q Consensus 338 Pmncp~h~~i~~~~~~SY-rdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F~~~eq~~~e~~~~~~~~~~v~~ 416 (652) |--.......+......+ ...|++++.+|.+||+|..- . --|-|+|.|-+..++-..+ ...-..++.+....+. T Consensus 81 p~~t~~~~r~~~~~~~~~~~~~p~~~~~~g~~~r~e~p~--~-~gr~r~f~~~~~~~~G~~~--~~~~~~~~~~~~~~~~ 155 (446) T TIGR00442 81 PELTAPVVRLVAENKLLLPANKPLRLYYIGPVFRYERPQ--L-KGRYREFWQFGCEVIGSES--PLADAEVLSLAVEGLK 155 (446) T ss_pred CCHHHHHHHHHHHHHHHCCCCCCEEEEEEECHHHCCCHH--H-HCCHHHHHHCCEEEECCCC--CHHHHHHHHHHHHHHH T ss_conf 200368999998754310467642432320101022412--3-0101133211504643665--2036899999999898 Q ss_pred HHCCCC-CEECCCCCCCC-----CCC---------------CHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCC Q ss_conf 521332-10001465311-----258---------------603577899999999985021212566752055555545 Q gi|254781004|r 417 DFGFEK-IMVKLSTRPEK-----RVG---------------SDALWDDAENIMKGVLDTIKNSSQDRINTGVLLGEGAFY 475 (652) Q Consensus 417 ~~g~~~-~~~~ls~~~~~-----~~g---------------~~~~~~~ae~~l~~~l~~~~~~~~~~~~~~~~~g~~afy 475 (652) .+|+.. +.+.+...... +.+ +...|+..+..+...+...........+.....+ +... T Consensus 156 ~~g~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ 234 (446) T TIGR00442 156 ALGLEGHFKLKLNHLGILEGILEYRGALEEDVYLTPIKDKLDKDDQERLEEELTNPLRILDEKVDELLELLKLKG-APEI 234 (446) T ss_pred HHCCCCEEEEEECCCHHHHHHHHHHCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC-CHHH T ss_conf 607663278874130014566665211023456656666623011567899876544321222456777763256-2257 Q ss_pred CCCCCCCHHHHCCCCCC----------CCCCC--------------HHHHCCCCCCC---------------EE---EEE Q ss_conf 54321100211033100----------12110--------------11102234674---------------16---731 Q gi|254781004|r 476 GPKFEYILKDAIGRDWQ----------CGTIQ--------------VDFNLPSRFNA---------------FY---VNS 513 (652) Q Consensus 476 gpkid~~~~d~~gr~~~----------~~tiq--------------~df~~p~~f~l---------------~y---~~~ 513 (652) ...++.......+.... +..++ +|+....-|.. .| +.. T Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~rGldyy~g~~~e~~~~~~~~~~~~~~GG~yd~l~~~ 314 (446) T TIGR00442 235 LDYLDELLDESEGHFEGLKDLEELFQLLEDLGIDDLYELNLGLARGLDYYTGTVFEIVTNALGAQGTVCGGGRYDNLVEE 314 (446) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEECCHHHHHHHHHCCEEEEEECCCCCCCCCEECCCCHHHHHHH T ss_conf 88888874123578889989999999986505421022051133100111132344310112432100046510125665 Q ss_pred CCCEEEEEEEEECCCHHHHHHHHHHHHHHCC--CCCCCCCCEEEEEEECCHHHHHHHHHHHHHHHHC--CCEEEEECC-- Q ss_conf 7972137996403410378999999997387--0887337808999857966899999999999987--988999758-- Q gi|254781004|r 514 HSEKCHPVMIHRAVFGSIERFIGIMIENFKG--NLPLWLSPIQAIVTTITSSAVEYAQEIANLLKSH--HLSIETDFR-- 587 (652) Q Consensus 514 dg~~~~pvmihr~~~Gs~eR~ia~liE~~~g--~~P~wLAP~QV~Iipi~e~~~eya~~i~~~L~~~--girV~~Ddr-- 587 (652) -|....|-+ |.--+++|++.++-..... .-+.--+...+.+++..++...++-.+...+++. |+.+.++-. T Consensus 315 ~~~~~~~~~---G~~~g~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 391 (446) T TIGR00442 315 FGGPPTPAV---GFAFGLERLINLLDAEKPSFPLAKVVDARVFVLVVGLGEEALPEALRLAEKLREALPGISAELYPKNN 391 (446) T ss_pred HCCCCCCCC---HHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEEECCHHHHHHHHHHHHHHHHHCCCCEEEEEECCC T ss_conf 067545410---02330899999875654310000111110257786064245789999999998733785168852211 Q ss_pred CCCCCHHHHHHHHCCCCEEEEECHHHHHCCCEEEEECCCC-CEEEECHHHHHHH Q ss_conf 9663676665543489889998742532581899988988-6633429999999 Q gi|254781004|r 588 NETINYKIREHSIKKIPIIIICGDKEASERSIGIRRFGST-TTQKVSLLDAIPI 640 (652) Q Consensus 588 ~~~~G~Kir~a~l~giP~~ivIG~ke~e~~~Vtir~R~~~-~~~~i~lee~i~~ 640 (652) ..++.+.++.|+..|.-+.+++|+.|+.++++++++-.++ ++..++.++++.. T Consensus 392 ~~~~~~~~~~a~~~g~~~~~~~G~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~ 445 (446) T TIGR00442 392 GRKLKKQFSYADKLGARFAVVLGEEELKEGKVTVKDLETGFEQTTLALDELIDI 445 (446) T ss_pred CHHHHHHHHHHHHCCCEEEEEECCCCCCCCCEEEHHHHHHHHHHHHHHHHHHHC T ss_conf 001345554564348548998453100134122012231245677757877512 No 56 >PRK00960 seryl-tRNA synthetase; Provisional Probab=99.28 E-value=2.4e-11 Score=98.11 Aligned_cols=249 Identities=19% Similarity=0.236 Sum_probs=161.7 Q ss_pred CCCCEEEEECCHHHHHHHHHHHHHHH-HHC-CCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCC-------- Q ss_conf 78743565010789999999999988-768-96385554324431000112100125554201001456777-------- Q gi|254781004|r 261 DGSGVIFWHRKGWKIFQTLISYMRRK-IKD-DYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKD-------- 330 (652) Q Consensus 261 ~~~G~~~wlP~G~~i~~~ie~~ir~~-~~~-G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~-------- 330 (652) .|-|..|+-|.-+++.+..++++-++ .+. ||.|...|-|.|-+.|.+|||++.--++||.+--. + |+ T Consensus 210 ~gRGQW~y~P~~t~LFRafE~ii~Ee~i~~lGf~e~lFPKLiplevm~kmryleGlp~gmYYVCpP--K-RDPe~feef~ 286 (516) T PRK00960 210 PGRGQWFYTPPMAKLFRAFEEIILEECIEKLGFDECLFPKLIPLDVMYKMRYLEGLPEGMYYVCPP--K-RDPEMFEEFV 286 (516) T ss_pred CCCCCEEECCHHHHHHHHHHHHHHHHHHHHCCCHHHHCCCCCHHHHHHHHCCCCCCCCCCEEECCC--C-CCHHHHHHHH T ss_conf 888742437758899999999999997432376100013455299997733012699872786699--8-9868999997 Q ss_pred -------------------CCEEEECCCCHHHHHHHHHHCCCCHHHCCHHHHH-HEEEECCCCCCCCEEEEECCCCCCCC Q ss_conf -------------------8237743431257777652012222310432000-10001278764520001056353344 Q gi|254781004|r 331 -------------------LRTFALKPMNCPGHVAVFNHGLKSYRELPVRLAE-FGSVYRNEPSGSLHGLMRVRGFTQDD 390 (652) Q Consensus 331 -------------------~~~~~LkPmncp~h~~i~~~~~~SYrdLPlrl~e-~~~~~R~E~sg~l~GL~R~ReF~~~D 390 (652) +--|+|-|.-||.-...++...-+--+||++++. -||.||||--| -+||-||-||+--+ T Consensus 287 ~e~~i~~EiPi~~lKe~l~~P~yvlapAQCePFy~fl~~e~v~~d~lPik~FDrSGwtyRwE~GG-~~GleRV~EF~RiE 365 (516) T PRK00960 287 NEMMIKKEIPIDLLKEKLRDPGYVLAPAQCEPFYQFLDHELVDVDELPIKFFDKSGWTYRWEGGG-AKGLDRVNEFLRIE 365 (516) T ss_pred HHHEEECCCCHHHHHHHHCCCCCEEEECCCCCHHHHHCCCCCCCCCCCEEEEECCCCEEEECCCC-CCCHHHHHHHHEEE T ss_conf 77334053668999986128861351024993788736772565558657885588626404787-43235550101477 Q ss_pred CEEECCHHHHHHHHHHH-HHHHHHHHHHHCCCCCEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 21010537678878999-99988766552133210001465311258603577899999999985021212566752055 Q gi|254781004|r 391 AHVFCTKEQMFNECLKI-HNLIISIYKDFGFEKIMVKLSTRPEKRVGSDALWDDAENIMKGVLDTIKNSSQDRINTGVLL 469 (652) Q Consensus 391 ah~F~~~eq~~~e~~~~-~~~~~~v~~~~g~~~~~~~ls~~~~~~~g~~~~~~~ae~~l~~~l~~~~~~~~~~~~~~~~~ 469 (652) .-=--+|+ ..++.... ......+-..+.++ +.......| |-.+...+. .. ++++. T Consensus 366 ~VW~gtpe-~ve~~r~~~~~y~~~~ae~LdLE-~w~ev~ddP---------fylegr~~e----~r------~iE~p--- 421 (516) T PRK00960 366 CVWMGTPE-FVEEVRDDTLKYAHILAEKLDLE-YWTEVGDDP---------FYLEGRKLE----DR------GIEFP--- 421 (516) T ss_pred EEEECCHH-HHHHHHHHHHHHHHHHHHHHCHH-HHEEECCCC---------HHHCCCCCC----CC------CCCCC--- T ss_conf 78846889-99999888999999988774324-422305675---------032256544----55------64368--- Q ss_pred CCCCCCCCCCCCCHHHHC---CCCCCCC--CCCHHHHCCCCCCCEEEEECCCEEEEEEEEECCCH-HHHHHHHHHHHHHC Q ss_conf 555545543211002110---3310012--11011102234674167317972137996403410-37899999999738 Q gi|254781004|r 470 GEGAFYGPKFEYILKDAI---GRDWQCG--TIQVDFNLPSRFNAFYVNSHSEKCHPVMIHRAVFG-SIERFIGIMIENFK 543 (652) Q Consensus 470 g~~afygpkid~~~~d~~---gr~~~~~--tiq~df~~p~~f~l~y~~~dg~~~~pvmihr~~~G-s~eR~ia~liE~~~ 543 (652) -.|+.|..+.--- .+.|... ..-.+.++|+-|+++- ..|.. +..||-| +++|.++..+-|+| T Consensus 422 -----~vp~ye~~~~LP~~~~e~k~v~v~s~nVhG~hf~eGF~VK~--~~g~~-----~WtGCtG~GL~RWv~gFLAQ~G 489 (516) T PRK00960 422 -----DVPKYEMRLWLPHIKEERKGVAVTSANIHGTHFVEGFNVKD--YKGRK-----LWTGCTGVGLTRWVIGFLAQKG 489 (516) T ss_pred -----CCCCEEEEEECCCCCCCCCEEEEEEEEECCCCCCCCCCCEE--CCCCE-----EECCCCCCCHHHHHHHHHHHCC T ss_conf -----88854679965764776772688886314753567652200--35873-----1137764457899999998707 Q ss_pred C---CCCCC Q ss_conf 7---08873 Q gi|254781004|r 544 G---NLPLW 549 (652) Q Consensus 544 g---~~P~w 549 (652) - .||.- T Consensus 490 fd~~~WPe~ 498 (516) T PRK00960 490 FDPDNWPEE 498 (516) T ss_pred CCCCCCCHH T ss_conf 981216499 No 57 >TIGR00414 serS seryl-tRNA synthetase; InterPro: IPR002317 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . Seryl-tRNA synthetase (6.1.1.11 from EC) exists as monomer and belongs to class IIa .; GO: 0000166 nucleotide binding, 0004828 serine-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006434 seryl-tRNA aminoacylation, 0005737 cytoplasm. Probab=99.20 E-value=1.3e-10 Score=92.98 Aligned_cols=283 Identities=20% Similarity=0.356 Sum_probs=209.9 Q ss_pred CCHHHHHHHHH--HHHHHHHH--HHHHHHHHCCEEEECC----CCCCEEEEECCHHHHHHHHHHHHHH-HH-HCCCEEEE Q ss_conf 79889999999--99975320--0887541144044226----7874356501078999999999998-87-68963855 Q gi|254781004|r 226 NTQQELTQYLY--FLEESEKR--DHRKLAREMDLFHIAE----DGSGVIFWHRKGWKIFQTLISYMRR-KI-KDDYEEIN 295 (652) Q Consensus 226 ~~~~~l~~~~~--~~eea~~r--dHr~lg~~~~lf~~~~----~~~G~~~wlP~G~~i~~~ie~~ir~-~~-~~G~~eV~ 295 (652) ..++....... .....+-. +|..+|..+++.-+.. .|+-.+++.-.|.++.+.+.+++-. .. +.||+++. T Consensus 147 ~~~~~~~~~~~~~~~~~~~~~~~~h~~~~~~~~~~d~~~~~~~~g~~~~~~~~~~~~~~~al~~~~l~~~~~~~g~~~~~ 226 (460) T TIGR00414 147 YDEEVKKWGTPPLPSFDFKPKDLPHWELGEKLGLLDFERAVKLAGSRFYYLKGDGAKLERALINFMLDLLREKNGYSELY 226 (460) T ss_pred HHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHCCCCCHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEC T ss_conf 01356664100122001331112156788770651013333441010121321367899999999998644322640101 Q ss_pred CCCCCCCCCCCCCC---CCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHHHCCCCH--HHCCHHHHHHEEEEC Q ss_conf 54324431000112---1001255542010014567778237743431257777652012222--310432000100012 Q gi|254781004|r 296 TPQVLDQHLWQQSG---HWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHVAVFNHGLKSY--RELPVRLAEFGSVYR 370 (652) Q Consensus 296 tP~l~~~~Lw~~SG---h~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~~~~SY--rdLPlrl~e~~~~~R 370 (652) +|.+.+.++-..+| ....+.++.|.... .+++.|.|-|+.+-....++++.+-+- .+||+++.-++.+|| T Consensus 227 ~p~~~~~~~~~~~g~~~~~~~~~~~~~~~~~-----~~~~~~~l~~t~e~~~~~~~~~~~~~~~~~~lp~~~~~~~~cfr 301 (460) T TIGR00414 227 PPYLVNEELLDGTGFNPQLPKFEEDLFKLED-----TDDEKLYLIPTAEVPLTNLHRDEILDLPSEELPIKYTGYSPCFR 301 (460) T ss_pred CCHHHHHHHHHHCCCCCCCCCHHHHHHHHCC-----CCCCEEEEEECCHHHHHHHHHHHHCCCCCCCCCEEHCCCCHHHH T ss_conf 4035423443102456556530234432126-----77750465201000122222000102652103400000220233 Q ss_pred CCCCC---CCEEEEECCCCCCCCCEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCCCCCCCCC----HHHHHH Q ss_conf 78764---52000105635334421010537678878999999887665521332100014653112586----035778 Q gi|254781004|r 371 NEPSG---SLHGLMRVRGFTQDDAHVFCTKEQMFNECLKIHNLIISIYKDFGFEKIMVKLSTRPEKRVGS----DALWDD 443 (652) Q Consensus 371 ~E~sg---~l~GL~R~ReF~~~Dah~F~~~eq~~~e~~~~~~~~~~v~~~~g~~~~~~~ls~~~~~~~g~----~~~~~~ 443 (652) .|... ...|+.|+.+|..-+...||.|++...+...+......++..+++....+.+.+++.++... .+.|.. T Consensus 302 ~eag~~g~~~~g~~r~h~f~k~e~~~~~~p~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~gd~g~~~~~~yd~e~W~p 381 (460) T TIGR00414 302 SEAGSYGKDTKGLIRVHQFNKVELVKFCKPEESWEELEELLSDAEKILQELELPYRVLNLCTGDLGFGAAKKYDLEVWLP 381 (460) T ss_pred HHCCCCCCCCHHHHHHHHCCCEEEEEEECCHHHHHHHHHHHHHHHHHHHHHCCCHHHHEECCCCCCCCCCCCEEEEEEEE T ss_conf 32134554100012111003102456635401489999998789999997437511100012434642011010013410 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCHHHHCCCCCCCEEEEECCCEEEEEEE Q ss_conf 99999999985021212566752055555545543211002110331001211011102234674167317972137996 Q gi|254781004|r 444 AENIMKGVLDTIKNSSQDRINTGVLLGEGAFYGPKFEYILKDAIGRDWQCGTIQVDFNLPSRFNAFYVNSHSEKCHPVMI 523 (652) Q Consensus 444 ae~~l~~~l~~~~~~~~~~~~~~~~~g~~afygpkid~~~~d~~gr~~~~~tiq~df~~p~~f~l~y~~~dg~~~~pvmi 523 (652) +.. | +.+ +..-.|+. ..|| .+.+.++.|.+.++.....-.+ T Consensus 382 ~~~-----------------------g-g~y-------------~e~~~~~~-~~d~-~~~~~~~~~~~~~~~~~~~~~~ 422 (460) T TIGR00414 382 GQN-----------------------G-GTY-------------REISSCSN-CTDF-QARRLNIRYKDKGGGKRENEYV 422 (460) T ss_pred CCC-----------------------C-CCC-------------CEECCCCC-CHHH-HHHHHHHHHHHCCCCCCCCCEE T ss_conf 467-----------------------8-741-------------10001233-1145-6665443332203665321000 Q ss_pred EECCCH---HHHHHHHHHHHHHC---C--CCCCCCCCE Q ss_conf 403410---37899999999738---7--088733780 Q gi|254781004|r 524 HRAVFG---SIERFIGIMIENFK---G--NLPLWLSPI 553 (652) Q Consensus 524 hr~~~G---s~eR~ia~liE~~~---g--~~P~wLAP~ 553 (652) | .+-| .+.|.+++++|+|. | ..|..|.|+ T Consensus 423 ~-~lng~~~~~~r~~~~~~en~~~~~g~~~~p~~l~~~ 459 (460) T TIGR00414 423 H-TLNGTALAVGRTLVAILENYQDEDGSVEVPEVLRKY 459 (460) T ss_pred E-ECCCCHHHHHHHHHHHHHHHCCCCCCEECHHHHHCC T ss_conf 1-003312345678887776410557745412554113 No 58 >PRK12421 ATP phosphoribosyltransferase regulatory subunit; Provisional Probab=99.03 E-value=8.2e-08 Score=73.75 Aligned_cols=152 Identities=18% Similarity=0.185 Sum_probs=114.0 Q ss_pred CCEEEEECCHHHHHHHHHHHHHHHH-HCCCEEEECCCCCCCCCCCC-CCCCCCCCCCCCEEECCCCCCC-CCCEEEECCC Q ss_conf 7435650107899999999999887-68963855543244310001-1210012555420100145677-7823774343 Q gi|254781004|r 263 SGVIFWHRKGWKIFQTLISYMRRKI-KDDYEEINTPQVLDQHLWQQ-SGHWDSYRANMFAVKCADDTIK-DLRTFALKPM 339 (652) Q Consensus 263 ~G~~~wlP~G~~i~~~ie~~ir~~~-~~G~~eV~tP~l~~~~Lw~~-SGh~~~y~~~mf~~~~~~~~~~-~~~~~~LkPm 339 (652) .|+-=+||--...++.|.+-+.+-+ ++||++|.||.+-..+.... +|+ .-...||.+. |. .++.++|||= T Consensus 11 ~G~~D~lP~ea~~~~~l~~~l~~~f~~~Gy~~V~tP~lE~~e~l~~~~g~--~~~~~~f~~~-----D~~sGr~l~LRpD 83 (391) T PRK12421 11 DGVADLLPEEAQKIERLRRRLLDLFASRGYQLVMPPFIEYLESLLTGAGQ--DLDLQTFKLI-----DQLSGRLMGVRAD 83 (391) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCHHHHCCCCCC--CCCEEEEEEE-----ECCCCCEEEECCC T ss_conf 99875899999999999999999999869976216760728785656687--3414799996-----1799978655653 Q ss_pred CHHHHHHHHHHCCCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEECCHH-HHHHHHHHHHHHHHHHHHHH Q ss_conf 12577776520122223104320001000127876452000105635334421010537-67887899999988766552 Q gi|254781004|r 340 NCPGHVAVFNHGLKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVFCTKE-QMFNECLKIHNLIISIYKDF 418 (652) Q Consensus 340 ncp~h~~i~~~~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F~~~e-q~~~e~~~~~~~~~~v~~~~ 418 (652) --|..+-+..+. ..++.|.|++.+|.+||+|.. +.-|.|||+|-++.+|=..+ ++. ..++.++.+.+..+ T Consensus 84 ~T~~vARi~a~~--~~~~~p~Rl~Y~G~V~R~~~~----~~gr~Ref~Q~G~EliG~~~~~aD---~EvI~l~~~~L~~l 154 (391) T PRK12421 84 ITPQVARIDAHL--LNREGVARYCYAGSVLHTLPQ----GLNASRAPLQLGAELYGHAGIEAD---IEIIDLMLGLLRNA 154 (391) T ss_pred CCHHHHHHHHHH--CCCCCCEEEEEECEEEECCCC----CCCCCCCCEECCEEEECCCCHHHH---HHHHHHHHHHHHHC T ss_conf 206889999874--046886579996517751688----888766521505898569987989---99999999999976 Q ss_pred CCCCCEECCCCC Q ss_conf 133210001465 Q gi|254781004|r 419 GFEKIMVKLSTR 430 (652) Q Consensus 419 g~~~~~~~ls~~ 430 (652) |+..+.+.++.. T Consensus 155 Gl~~~~l~lgh~ 166 (391) T PRK12421 155 GVEGLHLDLGHV 166 (391) T ss_pred CCCCEEEEECCH T ss_conf 998069996874 No 59 >KOG0188 consensus Probab=99.00 E-value=7.6e-09 Score=80.89 Aligned_cols=184 Identities=17% Similarity=0.370 Sum_probs=125.2 Q ss_pred HHH-HHHHHHHHHHHHHHHCCCCEEEEECCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHH Q ss_conf 999-9989999999999977998899724358855788408888889899999999999996699804853276785445 Q gi|254781004|r 71 AVI-RHSCAHIMAEAVQSIWPEVQVAIGPIIEDGFYYDFDKEQPFSSDELAQIEKKMQEIIARDSPFLKQYLSHEKAREL 149 (652) Q Consensus 71 ~i~-~hS~ahlL~~Av~~l~p~~kl~igp~i~~GfYyD~~~~~~it~~dl~~Ie~~M~~ii~~~~pi~r~~vs~~eA~~~ 149 (652) .+. .||++|||..|++++++++.-.-...-.+-+-+||....++|.++|.++|..|++.|.++.++.-.+++..+|.++ T Consensus 566 ~im~nHTaTHlLN~aLr~~l~~t~QkGSlV~pdklRfDf~~k~~lt~eql~~vE~~~ne~I~~~~~V~~~e~~l~~ak~i 645 (895) T KOG0188 566 LIMRNHTATHLLNFALRQVLKGTDQKGSLVAPDKLRFDFSTKGALTKEQLKKVEDKINEFIQKNAPVYAKELPLAEAKKI 645 (895) T ss_pred HHHHCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHCC T ss_conf 77631409999999999862652456664575670564236777899999999999999986388569886578887327 Q ss_pred HHHCCCHHHHHHHHHCCCCCCCEEECCCC----CEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEEEE Q ss_conf 44167113556654203455421302575----20100023552144422233333321110157756420689999983 Q gi|254781004|r 150 FESKKEAYKVEILESIPAKENVTLYRQGE----WFDLCRGPHVRSTGQVKKFFKLMKVAGAYWRGDSNRPMLSRIYGTAW 225 (652) Q Consensus 150 F~~~~~~~K~~li~~~~~~~~v~~y~~g~----~~Dlc~GphvpsTg~ik~~fkL~~~sgayw~gd~~~~~l~Riyg~af 225 (652) =.-.-.. -+.++ +.|.+...|+ -+.+|.|.|+.+||.|+. |-+.+..| -++-..||-+++= T Consensus 646 ~glr~v~-----~E~YP--dpVRVVsvg~~~~~svE~CgGTH~~~t~~i~~-fvI~se~~-------iakgirRivavTG 710 (895) T KOG0188 646 PGLRAVF-----DEVYP--DPVRVVSVGAGQLTSVEFCGGTHLTNTSHIGD-FVIVSEEG-------IAKGIRRIVAVTG 710 (895) T ss_pred CCCCCCC-----CCCCC--CCEEEEEECCCCCCCEEECCCCCCCCCCCCCE-EEEECHHH-------HHHCEEEEEEECC T ss_conf 7710025-----77458--86489983366543101027744466555442-89963044-------4105067878336 Q ss_pred CCHHHHHHHHHHHHH-------HHHHHHHHHHHHCCEEEECCCCCCEEEEECCHHHHH Q ss_conf 798899999999997-------532008875411440442267874356501078999 Q gi|254781004|r 226 NTQQELTQYLYFLEE-------SEKRDHRKLAREMDLFHIAEDGSGVIFWHRKGWKIF 276 (652) Q Consensus 226 ~~~~~l~~~~~~~ee-------a~~rdHr~lg~~~~lf~~~~~~~G~~~wlP~G~~i~ 276 (652) +...+...+-+-+.+ +..++-.+||..++ .+-+|.|--.-.+.+ T Consensus 711 ~eA~~a~~~a~~~~~~~~~~~~~~~kei~~l~~~~~-------~a~iP~~~k~~~~~~ 761 (895) T KOG0188 711 KEAQEALRLAEDLDKDLRKTDKALEKEIQELGENLD-------HAVIPYWQKDEIREL 761 (895) T ss_pred HHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHC-------CCCCHHHHHHHHHHH T ss_conf 889999998887876540265256899999998631-------453507759999999 No 60 >COG2872 Predicted metal-dependent hydrolases related to alanyl-tRNA synthetase HxxxH domain [General function prediction only] Probab=98.99 E-value=1.7e-09 Score=85.34 Aligned_cols=138 Identities=24% Similarity=0.389 Sum_probs=104.0 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHH Q ss_conf 99999899999999999779988997243588557884088888898999999999999966998048532767854454 Q gi|254781004|r 71 AVIRHSCAHIMAEAVQSIWPEVQVAIGPIIEDGFYYDFDKEQPFSSDELAQIEKKMQEIIARDSPFLKQYLSHEKARELF 150 (652) Q Consensus 71 ~i~~hS~ahlL~~Av~~l~p~~kl~igp~i~~GfYyD~~~~~~it~~dl~~Ie~~M~~ii~~~~pi~r~~vs~~eA~~~F 150 (652) ..=.||+.|||..++...|+ +..+.+...++-=+-||+.+. +++++..+++..++++++|+|+.-..++++||.+++ T Consensus 95 ~mr~HTa~Hlls~~~~~~~~-~~~~g~~i~~d~~~iD~~~e~--~~~~~~~v~~~~ne~v~~~~~v~~~~i~~EE~~~~p 171 (241) T COG2872 95 HMRMHTALHLLSAVLYKVYG-ALTTGFEIGEDYARIDFDGED--TEDEIEEVEALANELVKENLPVIIYFIPREEAEKLP 171 (241) T ss_pred HHHHHHHHHHHHHHHHCCCC-CEEECCEECCCCCCCCCCCCC--CHHHHHHHHHHHHHHHHCCCEEEEEECCHHHHHHCC T ss_conf 87560799999987501246-313112003663020246663--055699999999899875973024552789986484 Q ss_pred HHCCCHHHHHHHHHCCCCC-CCEEECCCCC-EEECCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEEE Q ss_conf 4167113556654203455-4213025752-010002355214442223333332111015775642068999998 Q gi|254781004|r 151 ESKKEAYKVEILESIPAKE-NVTLYRQGEW-FDLCRGPHVRSTGQVKKFFKLMKVAGAYWRGDSNRPMLSRIYGTA 224 (652) Q Consensus 151 ~~~~~~~K~~li~~~~~~~-~v~~y~~g~~-~Dlc~GphvpsTg~ik~~fkL~~~sgayw~gd~~~~~l~Riyg~a 224 (652) .... .+. ...+... .+.+..++++ +--|.|+||.+|+.|+. +|++++. .+.+.+.|||=++ T Consensus 172 ~~~~--~~~---~~~~~~~g~iRiV~I~did~qpCgGtHVk~T~EIG~-Iki~k~e-------~k~k~~~Rv~f~~ 234 (241) T COG2872 172 GLVK--LKN---KVPPDVEGKIRIVEIGDIDVQPCGGTHVKNTGEIGE-IKILKTE-------KKGKGNRRVYFTL 234 (241) T ss_pred CHHH--HCC---CCCCCCCCEEEEEEECCEECCCCCCCCCCCCCEEEE-EEEEEEE-------EECCCEEEEEEEE T ss_conf 7102--324---588655651899998990316788852288620677-9999999-------8057248999993 No 61 >pfam02824 TGS TGS domain. The TGS domain is named after ThrRS, GTPase, and SpoT. Interestingly, TGS domain was detected also at the amino terminus of the uridine kinase from the spirochaete Treponema pallidum (but not any other organism, including the related spirochaete Borrelia burgdorferi). TGS is a small domain that consists of ~50 amino acid residues and is predicted to possess a predominantly beta-sheet structure. There is no direct information on the functions of the TGS domain, but its presence in two types of regulatory proteins (the GTPases and guanosine polyphosphate phosphohydrolases/synthetases) suggests a ligand (most likely nucleotide)-binding, regulatory role. Probab=98.96 E-value=1.9e-09 Score=85.07 Aligned_cols=59 Identities=31% Similarity=0.528 Sum_probs=55.9 Q ss_pred EEEECCCCCEEECCCCCCHHHHHHHHCCCCCCCEEEEEECCEEEECCEEECC-CEEEEEC Q ss_conf 0899289988741898889999986361101128999989889977453036-7079862 Q gi|254781004|r 5 IKLTFPDGSIKNFPVHATGSDVAESISRSLAKKAVAVAINGKVCDLSDPVRE-GSIEIIT 63 (652) Q Consensus 5 i~I~lpDG~~~~~~~g~t~~dia~~i~~~~~~~~vaa~vng~l~dL~~~l~d-~~i~~i~ 63 (652) |.|.+|||+.+++|.|+|+.|+|..|+.++++.+++|+|||+++||+++|+| +.|+++| T Consensus 1 i~V~tP~G~~~~lp~GaT~~D~A~~Ih~~lg~~~~~A~VnG~~v~l~~~L~~gd~V~IiT 60 (60) T pfam02824 1 IRVYTPDGKVPELPRGSTPEDFAYAIHTDLGKKFIGAKVNGQRVGLDHVLEDGDVVEIVT 60 (60) T ss_pred CEEECCCCCEEECCCCCCHHHHHHHHCHHHHHEEEEEEECCEECCCCCCCCCCCEEEEEC T ss_conf 999899998687899896999998869786411767899988876211248999999879 No 62 >cd00773 HisRS-like_core Class II Histidinyl-tRNA synthetase (HisRS)-like catalytic core domain. HisRS is a homodimer. It is responsible for the attachment of histidine to the 3' OH group of ribose of the appropriate tRNA. This domain is primarily responsible for ATP-dependent formation of the enzyme bound aminoacyl-adenylate. Class II assignment is based upon its structure and the presence of three characteristic sequence motifs. This domain is also found at the C-terminus of eukaryotic GCN2 protein kinase and at the N-terminus of the ATP phosphoribosyltransferase accessory subunit, HisZ. HisZ along with HisG catalyze the first reaction in histidine biosynthesis. HisZ is found only in a subset of bacteria and differs from HisRS in lacking a C-terminal anti-codon binding domain. Probab=98.96 E-value=6.4e-09 Score=81.41 Aligned_cols=144 Identities=25% Similarity=0.348 Sum_probs=107.9 Q ss_pred HHHHHHHHHHHHHHH-HCCCEEEECCCCCCCCCCCC-CCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHHH Q ss_conf 899999999999887-68963855543244310001-1210012555420100145677782377434312577776520 Q gi|254781004|r 273 WKIFQTLISYMRRKI-KDDYEEINTPQVLDQHLWQQ-SGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHVAVFNH 350 (652) Q Consensus 273 ~~i~~~ie~~ir~~~-~~G~~eV~tP~l~~~~Lw~~-SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~ 350 (652) .++++.|++-+++-+ ++||++|.||.+-+.+++.. +| +...++||.+. |..++.++|||=--+..+-++.+ T Consensus 2 ~~~r~~i~~~l~~~f~~~Gy~~I~~P~lE~~d~~~~~~~--~~~~~~~~~f~-----D~~g~~l~LRpD~T~~iaR~~~~ 74 (261) T cd00773 2 AALRRYIEDTLREVFERYGYEEIDTPVFEYTELFLRKSG--DEVSKEMYRFK-----DKGGRDLALRPDLTAPVARAVAE 74 (261) T ss_pred HHHHHHHHHHHHHHHHHCCCEEEECCCCCCHHHHCCCCC--CCCHHCEEEEE-----CCCCCEEEECCCCCHHHHHHHHH T ss_conf 689999999999999986997866776452767524465--31021169999-----79999897678662799999997 Q ss_pred CCCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCC Q ss_conf 12222310432000100012787645200010563533442101053767887899999988766552133210001465 Q gi|254781004|r 351 GLKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVFCTKEQMFNECLKIHNLIISIYKDFGFEKIMVKLSTR 430 (652) Q Consensus 351 ~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F~~~eq~~~e~~~~~~~~~~v~~~~g~~~~~~~ls~~ 430 (652) ...+ ..+|+|++.+|.+||++.++ .-|-|||+|-++.+|-...- ..-..++.+..++++.+|+..+.+.++.. T Consensus 75 ~~~~-~~~p~r~~Y~g~VfR~~~~~----~g~~rE~~Q~G~EiiG~~~~--~ad~Eii~l~~~~l~~lg~~~~~i~i~~~ 147 (261) T cd00773 75 NLLS-LPLPLKLYYIGPVFRYERPQ----KGRYREFYQVGVEIIGSDSP--LADAEVIALAVEILEALGLKDFQIKINHR 147 (261) T ss_pred HHCC-CCCCEEEEEECCEEEECCCC----CCCCCCCCCEEEEEECCCCH--HHHHHHHHHHHHHHHHCCCCCEEEEEECH T ss_conf 4153-78885899973189976899----99844322146999728975--88999999999999976998649998088 No 63 >PRK12293 hisZ ATP phosphoribosyltransferase regulatory subunit; Provisional Probab=98.91 E-value=1.1e-08 Score=79.72 Aligned_cols=139 Identities=15% Similarity=0.195 Sum_probs=107.1 Q ss_pred CCCCCCEEEEECCHHHHHHHHHHHHHHHH-HCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEEC Q ss_conf 26787435650107899999999999887-68963855543244310001121001255542010014567778237743 Q gi|254781004|r 259 AEDGSGVIFWHRKGWKIFQTLISYMRRKI-KDDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALK 337 (652) Q Consensus 259 ~~~~~G~~~wlP~G~~i~~~ie~~ir~~~-~~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~Lk 337 (652) |+.=.|+--|+|.=..+++.|++-+++-. ++||+||.||++=-.+ ..+ +...++||.+ .|+.++.++|| T Consensus 5 ~~lP~GtrD~~p~~~~~~~~Ie~~~~~~f~~~Gy~eI~TP~~E~~e---~~~--~~~~kemy~f-----~D~~gr~l~LR 74 (281) T PRK12293 5 HEIPQGSRLYFGKSAKLKRDIENVASEILYKEGFEEIVTPSFSYHQ---HLS--VADEKELLRF-----SDEKNHEISLR 74 (281) T ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEECCCEEEEE---CCC--CCCCCEEEEE-----ECCCCCEEEEC T ss_conf 7899978166819999999999999999998599485378568644---346--6563017999-----87999879977 Q ss_pred CCCHHHHHHHHHHCCCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEECCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 43125777765201222231043200010001278764520001056353344210105376788789999998876655 Q gi|254781004|r 338 PMNCPGHVAVFNHGLKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVFCTKEQMFNECLKIHNLIISIYKD 417 (652) Q Consensus 338 Pmncp~h~~i~~~~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F~~~eq~~~e~~~~~~~~~~v~~~ 417 (652) |=.-+..+-++.++.. -...|+|+|-+|.+||+|. |||+|-...++=.++- ..++.+...+++. T Consensus 75 Pd~Ta~iaR~v~~~l~-~~~~p~rl~Y~g~vFRye~----------rEf~Q~GvEliG~~s~-----~Evi~la~~~l~~ 138 (281) T PRK12293 75 ADSTVDVVRIITKRLG-RSTEHKKWFYIQPVFRYPT----------TEIYQIGAEIIGESDL-----SKVLNIAAEIFNE 138 (281) T ss_pred CCCCHHHHHHHHHCCC-CCCCCEEEEEECCEEECCC----------CCCEEECEEEECCCCH-----HHHHHHHHHHHHH T ss_conf 8887899999996554-4689757998756353467----------7746847488779999-----9999999999997 Q ss_pred HCCCCC Q ss_conf 213321 Q gi|254781004|r 418 FGFEKI 423 (652) Q Consensus 418 ~g~~~~ 423 (652) +|+... T Consensus 139 lgl~~~ 144 (281) T PRK12293 139 LEIEPV 144 (281) T ss_pred CCCCCE T ss_conf 699624 No 64 >cd01616 TGS The TGS domain, named after the ThrRS, GTPase, and SpoT/RelA proteins where it occurs, is structurally similar to ubiquitin. TGS is a small domain of about 50 amino acid residues with a predominantly beta-sheet structure. There is no direct information on the function of the TGS domain, but its presence in two types of regulatory proteins (the GTPases and guanosine polyphosphate phosphohydrolases/synthetases) suggests a ligand (most likely nucleotide)-binding, regulatory role. Probab=98.87 E-value=6.5e-09 Score=81.34 Aligned_cols=59 Identities=32% Similarity=0.508 Sum_probs=55.3 Q ss_pred EEEECCCCCEEECCCCCCHHHHHHHHCCCCCCCEEEEEECCEEEECCEEECC-CEEEEEC Q ss_conf 0899289988741898889999986361101128999989889977453036-7079862 Q gi|254781004|r 5 IKLTFPDGSIKNFPVHATGSDVAESISRSLAKKAVAVAINGKVCDLSDPVRE-GSIEIIT 63 (652) Q Consensus 5 i~I~lpDG~~~~~~~g~t~~dia~~i~~~~~~~~vaa~vng~l~dL~~~l~d-~~i~~i~ 63 (652) |-|-.|||+++++|.|+|+.|+|..|+.++++.+++|+|||+++||+++|++ +.|+++| T Consensus 1 i~vftP~G~~~~lP~GaT~~D~A~~Iht~lg~~~~~a~Vng~~~~l~~~L~~gd~V~IiT 60 (60) T cd01616 1 MIIFTPDGSAVELPKGATAMDFALKIHTDLGKGFIGALVNGQLVDLSYTLQDGDTVSIVT 60 (60) T ss_pred CEEECCCCCEEECCCCCCHHHHHHHHCCCHHHEEEEEEECCEECCCCCCCCCCCEEEEEC T ss_conf 999899998798899898999999876361545999999999965166999999999979 No 65 >PRK12295 hisZ ATP phosphoribosyltransferase regulatory subunit; Provisional Probab=98.85 E-value=1.3e-08 Score=79.25 Aligned_cols=135 Identities=16% Similarity=0.209 Sum_probs=96.6 Q ss_pred HHHHHHHHHHHH-HCCCEEEECCCCCCCCCCC-CCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHHHCCC Q ss_conf 999999999887-6896385554324431000-11210012555420100145677782377434312577776520122 Q gi|254781004|r 276 FQTLISYMRRKI-KDDYEEINTPQVLDQHLWQ-QSGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHVAVFNHGLK 353 (652) Q Consensus 276 ~~~ie~~ir~~~-~~G~~eV~tP~l~~~~Lw~-~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~~~~ 353 (652) ++++.+-++.-+ .+||++|.||.+-..+++. .||- .....||.+. |..++.++|||=--+..+-++.. T Consensus 7 ~~~~~~~i~~~f~~~Gy~~I~tP~lE~~d~~~~~~Ge--~~~~~~y~f~-----D~~g~~l~LRpDlT~piaR~~~~--- 76 (373) T PRK12295 7 SAALAEALLASFEAAGAVRVDPPILQPAEPFLDLSGE--DIRRRIFVTS-----DENGEELCLRPDFTIPVCRRHLA--- 76 (373) T ss_pred HHHHHHHHHHHHHHCCCEEECCCCCCCHHHHCCCCCC--HHHEEEEEEE-----CCCCCEEEEECCCCHHHHHHHHH--- T ss_conf 9999999999999869968568760417764143563--3110048899-----89989899817888999999998--- Q ss_pred CHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEECCHH--HHHHHHHHHHHHHHHHHHHHCCCCCEECCCCC Q ss_conf 223104320001000127876452000105635334421010537--67887899999988766552133210001465 Q gi|254781004|r 354 SYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVFCTKE--QMFNECLKIHNLIISIYKDFGFEKIMVKLSTR 430 (652) Q Consensus 354 SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F~~~e--q~~~e~~~~~~~~~~v~~~~g~~~~~~~ls~~ 430 (652) +....|+|+|.+|.+||+. +| |.|||+|-++.+|=..+ .+.. .++.+..+.+..+|+..+.+.++.. T Consensus 77 ~~~~~p~R~~Y~G~VfR~q-~g------r~rEf~Q~GvEiiG~~~~~~aDa---Evi~la~~~l~~lgl~~~~i~ig~~ 145 (373) T PRK12295 77 SNAGEPARYSYLGEVFRQR-RD------RASEFLQAGIESFGRADPAAADA---EVLALALEALAALGPADLEIRLGDV 145 (373) T ss_pred HCCCCCEEEEEECCEEECC-CC------CCCCEEEEEEEEECCCCCCCCCH---HHHHHHHHHHHHCCCCCEEEEECCH T ss_conf 3899980078773322457-99------98851670157857898210649---9999999999975997639996678 No 66 >smart00863 tRNA_SAD Threonyl and Alanyl tRNA synthetase second additional domain. The catalytically active form of threonyl/alanyl tRNA synthetase is a dimer. Within the tRNA synthetase class II dimer, the bound tRNA interacts with both monomers making specific interactions with the catalytic domain, the C-terminal domain, and this SAD domain (the second additional domain). The second additional domain is comprised of a pair of perpendicularly orientated antiparallel beta sheets, of four and three strands, respectively, that surround a central alpha helix that forms the core of the domain. Probab=98.49 E-value=2.1e-08 Score=77.80 Aligned_cols=43 Identities=47% Similarity=1.019 Sum_probs=39.5 Q ss_pred CEEECCCCC-EEECCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEE Q ss_conf 213025752-010002355214442223333332111015775642068999 Q gi|254781004|r 171 VTLYRQGEW-FDLCRGPHVRSTGQVKKFFKLMKVAGAYWRGDSNRPMLSRIY 221 (652) Q Consensus 171 v~~y~~g~~-~Dlc~GphvpsTg~ik~~fkL~~~sgayw~gd~~~~~l~Riy 221 (652) +++|++|++ +|+|.||||+||+.+++ |||++++++||+ +|||| T Consensus 1 vrvv~i~~~~~~~C~GtHv~~T~~ig~-~ki~~~~~~~~~-------~~RI~ 44 (44) T smart00863 1 VRVVSIGDFDVELCGGTHVPNTGEIGA-FKILSVSGAYWG-------LQRIY 44 (44) T ss_pred CEEEEECCEEEECCCCCCCCCCCEEEE-EEEEEECHHCCC-------CEECC T ss_conf 979999997987878966367062436-999995621328-------59249 No 67 >pfam07973 tRNA_SAD Threonyl and Alanyl tRNA synthetase second additional domain. The catalytically active from of threonyl/alanyl tRNA synthetase is a dimer. Within the tRNA synthetase class II dimer, the bound tRNA interacts with both monomers making specific interactions with the catalytic domain, the C-terminal domain, and this domain (the second additional domain). The second additional domain is comprised of a pair of perpendicularly orientated antiparallel beta sheets, of four and three strands, respectively, that surround a central alpha helix that forms the core of the domain. Probab=98.46 E-value=3.7e-08 Score=76.13 Aligned_cols=43 Identities=47% Similarity=0.839 Sum_probs=39.5 Q ss_pred CEEECCCC-CEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEE Q ss_conf 21302575-2010002355214442223333332111015775642068999 Q gi|254781004|r 171 VTLYRQGE-WFDLCRGPHVRSTGQVKKFFKLMKVAGAYWRGDSNRPMLSRIY 221 (652) Q Consensus 171 v~~y~~g~-~~Dlc~GphvpsTg~ik~~fkL~~~sgayw~gd~~~~~l~Riy 221 (652) +.+|.+|+ ++|+|.||||+||+.|++ |||+ ++++.++++|||| T Consensus 1 vrvv~ig~~~~~~C~GtHV~nT~~I~~-fki~-------k~~~~~~g~~RI~ 44 (44) T pfam07973 1 VRVVRIGDFDVDLCGGTHVPNTGEIGA-FKIL-------KGDSKNKGLQRIY 44 (44) T ss_pred CEEEEECCEEEECCCCCCCCCCCCCCE-EEEE-------ECCCCCCCCEECC T ss_conf 979998991987877958788252012-9999-------7144828759629 No 68 >PRK12820 bifunctional aspartyl-tRNA synthetase/aspartyl/glutamyl-tRNA amidotransferase subunit C; Provisional Probab=98.46 E-value=4.6e-07 Score=68.59 Aligned_cols=116 Identities=19% Similarity=0.237 Sum_probs=74.8 Q ss_pred HHHHHHHHHHHHHHHH-HCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHHH Q ss_conf 7899999999999887-689638555432443100011210012555420100145677782377434312577776520 Q gi|254781004|r 272 GWKIFQTLISYMRRKI-KDDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHVAVFNH 350 (652) Q Consensus 272 G~~i~~~ie~~ir~~~-~~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~ 350 (652) =.+++.++...+|+.+ +.||.||.||.|+... -+ | +.+.+.+.... .+.-|+|. ..-++|++ T Consensus 155 ~~~~Rskv~~~iR~~l~~~gF~EVETP~L~~st-pE--G-----ArdflvPsr~~----~~~fyaLp-----QSPQlyKQ 217 (706) T PRK12820 155 HLAKRHRIIKCARDFLDSRGFLEIETPILTKST-PE--G-----ARDYLVPSRIH----PKEFYALP-----QSPQLFKQ 217 (706) T ss_pred HHHHHHHHHHHHHHHHHHCCCEEEECCEECCCC-CC--C-----CCCCCCCEECC----CCCEECCC-----CCHHHHHH T ss_conf 999999999999999987798896687044567-53--2-----45750024458----87650056-----68299999 Q ss_pred CCC-CHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCE-EECCHHHHHHHHHHHHHHH Q ss_conf 122-223104320001000127876452000105635334421-0105376788789999998 Q gi|254781004|r 351 GLK-SYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAH-VFCTKEQMFNECLKIHNLI 411 (652) Q Consensus 351 ~~~-SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah-~F~~~eq~~~e~~~~~~~~ 411 (652) .+- | -+ =|+|||+.|||||-+.+ -|--||||-|++ +|...+++.+-+..++..+ T Consensus 218 lLMvg--G~-dRyFqIarcFRdEd~r~----dRqPEFTqlDiEmSF~d~edvm~l~E~li~~~ 273 (706) T PRK12820 218 LLMIA--GF-ERYFQLARCFRDEDLRP----NRQPEFTQLDIEASFIDEEFIFELIEELTARM 273 (706) T ss_pred HHHHC--CC-CEEEEEEEEECCCCCCC----CCCCCEEEEEEEHHCCCHHHHHHHHHHHHHHH T ss_conf 99862--86-61799600112899998----87983406874213489999999999999999 No 69 >TIGR00459 aspS_bact aspartyl-tRNA synthetase; InterPro: IPR004524 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . Aspartyl tRNA synthetase 6.1.1.12 from EC is an alpha2 dimer that belongs to class IIb. Structural analysis combined with mutagenesis and enzymology data on the yeast enzyme point to a tRNA binding process that starts by a recognition event between the tRNA anticodon loop and the synthetase anticodon binding module . This family represents aspartyl-tRNA synthetases from the bacteria and from mitochondria. In some species, this enzyme aminoacylates tRNA for both Asp and Asn; Asp-tRNA(asn) is subsequently transamidated to Asn-tRNA(asn). ; GO: 0000166 nucleotide binding, 0004815 aspartate-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006422 aspartyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=98.43 E-value=9.9e-07 Score=66.29 Aligned_cols=125 Identities=18% Similarity=0.255 Sum_probs=87.0 Q ss_pred HHHHHHHHHHHHHHHHH-CCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHHH Q ss_conf 78999999999998876-89638555432443100011210012555420100145677782377434312577776520 Q gi|254781004|r 272 GWKIFQTLISYMRRKIK-DDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHVAVFNH 350 (652) Q Consensus 272 G~~i~~~ie~~ir~~~~-~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~ 350 (652) =+.+|.+|...||.-++ .||.||.||+|.++. =+.=+|-+.+ ...+-.|..++-|||- + T Consensus 150 ~L~lR~~v~~~~Rn~l~~~gFleiETP~LtKsT-------PEGARDyLVP-sRv~rsdG~g~FYALp------------Q 209 (653) T TIGR00459 150 RLKLRHKVTKAVRNFLDQQGFLEIETPILTKST-------PEGARDYLVP-SRVHRSDGKGEFYALP------------Q 209 (653) T ss_pred HHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCC-------CCCHHHCCCC-CCCCCCCCCCCEECCC------------C T ss_conf 986688989999866642787065067777888-------4311015775-5300368588451468------------9 Q ss_pred CCCCHHHCCH-----HHHHHEEEECCCCCCCCEEEEECCCCCCCCCE-EECCHHHHHHHHHHHHHHHH-HHHHHHCC Q ss_conf 1222231043-----20001000127876452000105635334421-01053767887899999988-76655213 Q gi|254781004|r 351 GLKSYRELPV-----RLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAH-VFCTKEQMFNECLKIHNLII-SIYKDFGF 420 (652) Q Consensus 351 ~~~SYrdLPl-----rl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah-~F~~~eq~~~e~~~~~~~~~-~v~~~~g~ 420 (652) .+.=||+|=+ |||||..|||||=+.+ =|=-||||=|+. +||+.+++.+-++.++.-+- .+..+-|+ T Consensus 210 SPQlfKQLLMvsG~drYyQIArCFRDEDLRA----DRQPEFTQiD~E~SF~~~~~vm~l~E~l~~~vf~~v~~krGl 282 (653) T TIGR00459 210 SPQLFKQLLMVSGVDRYYQIARCFRDEDLRA----DRQPEFTQIDMEMSFMTQEDVMDLIEKLVSGVFKEVKGKRGL 282 (653) T ss_pred CHHHHHHHHHHCCCCCCEEEEEEECCCCCCC----CCCCCCCHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCCCE T ss_conf 7568999875125553135788976625666----577873244263112893568999999999999997078513 No 70 >PRK09537 pylS pyrolysyl-tRNA synthetase; Reviewed Probab=98.41 E-value=2.8e-07 Score=70.04 Aligned_cols=102 Identities=20% Similarity=0.390 Sum_probs=72.3 Q ss_pred HHHHHHHHHHH-HHCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHHHCCCC Q ss_conf 99999999988-76896385554324431000112100125554201001456777823774343125777765201222 Q gi|254781004|r 276 FQTLISYMRRK-IKDDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHVAVFNHGLKS 354 (652) Q Consensus 276 ~~~ie~~ir~~-~~~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~~~~S 354 (652) +.+||+-+.+- .++||.|+.+|.+.|.+--+.-|- |+- .-.. ....+-++.+|||||-.|.-+.+.+.-.|- T Consensus 209 lgkler~it~ffvdrgfleikspilip~eyiermgi-d~d--~els----kQiFrvd~n~CLRPMLAPnLYnylrkL~R~ 281 (420) T PRK09537 209 LGKLERDITKFFVDRGFLEIKSPILIPAEYIERMGI-DND--TELS----KQIFRVDKNFCLRPMLAPNLYNYLRKLDRA 281 (420) T ss_pred HHHHHHHHHHHHHCCCCEEECCCEECCHHHHHHHCC-CCH--HHHH----HHHEEECCCCCCCCCCCHHHHHHHHHHHHH T ss_conf 767788889985414513514850421999998578-860--5655----451674345332410142699999999861 Q ss_pred HHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCC Q ss_conf 23104320001000127876452000105635334 Q gi|254781004|r 355 YRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQD 389 (652) Q Consensus 355 YrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~ 389 (652) .-| |++++|+|.+||.|--|. --.-||||- T Consensus 282 lpd-PIrIFEIGpCyRKESdG~----~HLeEFTMl 311 (420) T PRK09537 282 LPD-PIKIFEIGPCYRKESDGK----EHLEEFTMV 311 (420) T ss_pred CCC-CEEEEEECCCCCCCCCCH----HHHHHHHHH T ss_conf 688-715886204303444106----788777777 No 71 >COG0173 AspS Aspartyl-tRNA synthetase [Translation, ribosomal structure and biogenesis] Probab=98.41 E-value=7.9e-07 Score=66.97 Aligned_cols=116 Identities=20% Similarity=0.330 Sum_probs=78.0 Q ss_pred HHHHHHHHHHHHHHHHH-CCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHHH Q ss_conf 78999999999998876-89638555432443100011210012555420100145677782377434312577776520 Q gi|254781004|r 272 GWKIFQTLISYMRRKIK-DDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHVAVFNH 350 (652) Q Consensus 272 G~~i~~~ie~~ir~~~~-~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~ 350 (652) -.++|.++...+|..+. .||.||.||.|..+.= + .-|+|..---..|+...---+ T Consensus 140 ~l~lR~kv~~~iR~~ld~~gF~EiETPiLtkSTP-------E-----------------GARDfLVPSRv~~G~FYALPQ 195 (585) T COG0173 140 NLKLRSKVTKAIRNFLDDQGFLEIETPILTKSTP-------E-----------------GARDFLVPSRVHPGKFYALPQ 195 (585) T ss_pred HHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCC-------C-----------------CCCCCCCCCCCCCCCEEECCC T ss_conf 9999999999999977654985734674146798-------6-----------------554430134558996530688 Q ss_pred CCCCHHHCCH-----HHHHHEEEECCCCCCCCEEEEECCCCCCCCCE-EECCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 1222231043-----20001000127876452000105635334421-01053767887899999988766552 Q gi|254781004|r 351 GLKSYRELPV-----RLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAH-VFCTKEQMFNECLKIHNLIISIYKDF 418 (652) Q Consensus 351 ~~~SYrdLPl-----rl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah-~F~~~eq~~~e~~~~~~~~~~v~~~~ 418 (652) .+.-||+|=+ |+|||..|||||-..+ =|--||||-|.. ||++.+++.+-...++ ..+++.+ T Consensus 196 SPQlfKQLLMvsGfdRYyQIarCFRDEDlRa----DRQPEFTQiD~EmSF~~~edv~~~~E~l~---~~vf~~~ 262 (585) T COG0173 196 SPQLFKQLLMVAGFDRYYQIARCFRDEDLRA----DRQPEFTQIDLEMSFVDEEDVMELIEKLL---RYVFKEV 262 (585) T ss_pred CHHHHHHHHHHHCCCCEEEEEEEECCCCCCC----CCCCCCEEEEEEEECCCHHHHHHHHHHHH---HHHHHHH T ss_conf 8799999998805330023565404664444----46875336767764388999999999999---9999985 No 72 >PRK00484 lysS lysyl-tRNA synthetase; Reviewed Probab=98.41 E-value=1.5e-06 Score=65.07 Aligned_cols=116 Identities=17% Similarity=0.204 Sum_probs=73.2 Q ss_pred HHHHHHHHHHHHHHH-HCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHHHC Q ss_conf 899999999999887-6896385554324431000112100125554201001456777823774343125777765201 Q gi|254781004|r 273 WKIFQTLISYMRRKI-KDDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHVAVFNHG 351 (652) Q Consensus 273 ~~i~~~ie~~ir~~~-~~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~~ 351 (652) .+++.+|.+.+|+.+ ++||.||.||.|++.. .| .+-.++....+ ..+.+++|+= .-++|++. T Consensus 171 f~~Rs~ii~~iR~~l~~~gF~EVeTPiL~~~~----gG------A~ArpF~t~~n--~l~~~~yL~~-----SPqLylk~ 233 (491) T PRK00484 171 FRKRSKIISAIRRFLDNRGFLEVETPMLQPIP----GG------AAARPFITHHN--ALDIDLYLRI-----APELYLKR 233 (491) T ss_pred HHHHHHHHHHHHHHHHHCCEEEEECCCCCCCC----CC------CCCCCCCCCCC--CCCCCEEECC-----CHHHHHHH T ss_conf 99999999999999986768998678776668----87------56676556445--6685544206-----87898777 Q ss_pred CCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCE-EECCHHHHHHHHHHHHHHH Q ss_conf 22223104320001000127876452000105635334421-0105376788789999998 Q gi|254781004|r 352 LKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAH-VFCTKEQMFNECLKIHNLI 411 (652) Q Consensus 352 ~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah-~F~~~eq~~~e~~~~~~~~ 411 (652) .-- --+ =|+||||.+||||-+.+ .|.-||||-|+| +|+..+++.+....++..+ T Consensus 234 l~v-gG~-ervfeI~r~FR~E~~~~----rH~pEFT~lE~e~af~d~~dvm~l~E~li~~v 288 (491) T PRK00484 234 LIV-GGF-ERVFEIGRNFRNEGIDT----RHNPEFTMIEFYQAYADYNDMMDLTEELIRHL 288 (491) T ss_pred HHH-CCC-HHHEEHHHHHHCCCCCC----CCCCCEEEEEEEEECCCHHHHHHHHHHHHHHH T ss_conf 876-272-02222487662575543----41840102136772479999999999999999 No 73 >PRK09350 lysyl-tRNA synthetase; Provisional Probab=98.40 E-value=1.6e-06 Score=64.89 Aligned_cols=126 Identities=15% Similarity=0.155 Sum_probs=77.4 Q ss_pred EEECCH----HHHHHHHHHHHHHHH-HCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCH Q ss_conf 650107----899999999999887-689638555432443100011210012555420100145677782377434312 Q gi|254781004|r 267 FWHRKG----WKIFQTLISYMRRKI-KDDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPMNC 341 (652) Q Consensus 267 ~wlP~G----~~i~~~ie~~ir~~~-~~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmnc 341 (652) .|.|-+ +++|.+|...+|+.+ ++||.||.||.|++..- .-+|.+-+.-+-+.+. ...+..+.|.... T Consensus 6 ~w~~~as~~~l~~Rs~i~~~iR~ff~~~gFlEVeTP~L~~~t~--~~~~~~~f~~~~~~~~-----~~~~~~~yL~~SP- 77 (325) T PRK09350 6 SWQPSASIKNLLKRAAIIAEIRRFFADRGVLEVETPAMSQATV--TDIHLVPFETRFVGPG-----HSQGKTLWLMTSP- 77 (325) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCC--CCCCCCCEEEECCCCC-----CCCCCCEEEECCH- T ss_conf 7777147999999999999999999988968977984257788--7646762221025655-----4567664550791- Q ss_pred HHHHHHHHHCC-CCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCE-EECCHHHHHHHHHHHHHHH Q ss_conf 57777652012-2223104320001000127876452000105635334421-0105376788789999998 Q gi|254781004|r 342 PGHVAVFNHGL-KSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAH-VFCTKEQMFNECLKIHNLI 411 (652) Q Consensus 342 p~h~~i~~~~~-~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah-~F~~~eq~~~e~~~~~~~~ 411 (652) ++|++.. -|. + =|+||||.+||+|.+++ .|.-||||-|.| +|+..+++.++...++..+ T Consensus 78 ----ql~~k~l~~~G--~-~rvfqI~~~FR~E~~~~----~H~pEFtmlE~~~~~~d~~d~m~~~e~ll~~~ 138 (325) T PRK09350 78 ----EYHMKRLLAAG--S-GPIFQLCRSFRNEEAGR----YHNPEFTMLEWYRPHYDMYRLMNEVDDLLQQV 138 (325) T ss_pred ----HHHHHHHHHCC--C-CCEEEEEHHHCCCCCCC----CCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHH T ss_conf ----99999998667--8-85589611322899987----65737788899981548999999999999999 No 74 >TIGR00415 serS_MJ seryl-tRNA synthetase; InterPro: IPR004503 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . Seryl-tRNA synthetase (6.1.1.11 from EC) exists as monomer and belongs to class IIa . The seryl-tRNA synthetases from a few of the archaea that belong to this group are different from the set of mutually more closely related seryl-tRNA synthetases from eubacteria, eukaryotes, and other archaea (IPR002317 from INTERPRO).; GO: 0000166 nucleotide binding, 0004828 serine-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006434 seryl-tRNA aminoacylation, 0005737 cytoplasm. Probab=98.30 E-value=4.9e-06 Score=61.50 Aligned_cols=164 Identities=16% Similarity=0.226 Sum_probs=135.1 Q ss_pred EECCC-CCCEEEEECCHHHHHHHHHHHHHHH-HHC-CCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCC----- Q ss_conf 42267-8743565010789999999999988-768-963855543244310001121001255542010014567----- Q gi|254781004|r 257 HIAED-GSGVIFWHRKGWKIFQTLISYMRRK-IKD-DYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTI----- 328 (652) Q Consensus 257 ~~~~~-~~G~~~wlP~G~~i~~~ie~~ir~~-~~~-G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~----- 328 (652) .+... |.|..++-|.=..+.+.+++++-++ .+. |+++...|.+.|.++.......+..-+.||......... T Consensus 206 w~~~~pg~g~w~~~p~~~~~~~~~~~~~~~~~~~~~g~~~c~~p~~~p~~~~~~~~~~~g~p~g~~~~~~p~~~p~~~~~ 285 (520) T TIGR00415 206 WVKKFPGRGQWFYGPPITALFRALEELFVEEVVKPLGFVECLFPKLIPLDILYKLRYLEGLPEGLYYCCPPKRDPELFEE 285 (520) T ss_pred CHHCCCCCCCEECCCCHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEECCCCCCHHHHHH T ss_conf 01126887620016506788888998889887654011345310010267777767641365452552277556357898 Q ss_pred -------------------CCCCEEEECCCCHHHHHHHHHHCCCCHHHCCHHHHH-HEEEECCCCCCCCEEEEECCCCCC Q ss_conf -------------------778237743431257777652012222310432000-100012787645200010563533 Q gi|254781004|r 329 -------------------KDLRTFALKPMNCPGHVAVFNHGLKSYRELPVRLAE-FGSVYRNEPSGSLHGLMRVRGFTQ 388 (652) Q Consensus 329 -------------------~~~~~~~LkPmncp~h~~i~~~~~~SYrdLPlrl~e-~~~~~R~E~sg~l~GL~R~ReF~~ 388 (652) -.+..|.+.|..|+.....+...+-+-.|+|+.+++ -||.||+|..| ..||-|+.+|.. T Consensus 286 ~~~~~~~~~~~p~~~~~~~~~dpg~~~~p~~c~p~~~~~~~~~~~~~d~p~~~~d~~gw~~~w~~gg-~~g~d~~~~~~~ 364 (520) T TIGR00415 286 FKNELLINKEVPLDLLKNGLKDPGYVLAPAQCEPFYQFLEGEVVDAEDLPVKFFDRSGWTYRWEGGG-AKGLDRVNEFLR 364 (520) T ss_pred HHHHHHHHCCCCHHHHHHHHCCCCCEECCCCCCHHHHHHCCCCCCCCCCCEEEECCCCCEEEECCCC-CCCHHHHHHHHH T ss_conf 7665542003557898753048770430311003567541211000014311220577424423766-321245545331 Q ss_pred CCCEEECCHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 442101053767887899999988766552133 Q gi|254781004|r 389 DDAHVFCTKEQMFNECLKIHNLIISIYKDFGFE 421 (652) Q Consensus 389 ~Dah~F~~~eq~~~e~~~~~~~~~~v~~~~g~~ 421 (652) -+.--...|+..++.-...+.+.......+.+. T Consensus 365 ~~~~w~~~p~~~~~~~d~~~~~~~~~~~~~~~~ 397 (520) T TIGR00415 365 VELVWLGEPGFTEETRDKTLELAEDLADELDLE 397 (520) T ss_pred EEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 113551475201233344566555555541001 No 75 >cd01668 TGS_RelA_SpoT TGS_RelA_SpoT: The RelA (SpoT) protein, also referred to as ppGpp hydrolase/synthetase, is a ribosome-associated protein that is activated during amino acid starvation and thought to mediate the stringent response. RelA contains a TGS domain, named after the Threonyl-tRNA Synthetase, GTPase, and SpoT proteins where it occurs. The function of the TGS domain is unknown. Probab=98.22 E-value=2.6e-06 Score=63.44 Aligned_cols=58 Identities=34% Similarity=0.532 Sum_probs=53.2 Q ss_pred EEECCCCCEEECCCCCCHHHHHHHHCCCCCCCEEEEEECCEEEECCEEECCC-EEEEEC Q ss_conf 8992899887418988899999863611011289999898899774530367-079862 Q gi|254781004|r 6 KLTFPDGSIKNFPVHATGSDVAESISRSLAKKAVAVAINGKVCDLSDPVREG-SIEIIT 63 (652) Q Consensus 6 ~I~lpDG~~~~~~~g~t~~dia~~i~~~~~~~~vaa~vng~l~dL~~~l~d~-~i~~i~ 63 (652) -+=-|+|.+++.|.|.|+.|.|-.++.++++..++|+|||++.+|+++|+|+ .|+++| T Consensus 2 ~vftP~G~~~~LP~GaT~lDfAy~iht~lG~~~~~a~vNg~~~~l~~~L~~GD~VeIiT 60 (60) T cd01668 2 YVFTPKGEIIELPAGATVLDFAYAIHTEIGNRCVGAKVNGKLVPLSTVLKDGDIVEIIT 60 (60) T ss_pred EEECCCCCEEECCCCCCHHHEHHHHCCCHHHEEEEEEECCEECCCCCCCCCCCEEEEEC T ss_conf 99899997787899981410433554101422689999999989876828999999879 No 76 >PRK05159 aspC aspartyl-tRNA synthetase; Provisional Probab=98.20 E-value=1.3e-05 Score=58.51 Aligned_cols=114 Identities=18% Similarity=0.312 Sum_probs=72.9 Q ss_pred HHHHHHHHHHHHHHHH-HCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHHH Q ss_conf 7899999999999887-689638555432443100011210012555420100145677782377434312577776520 Q gi|254781004|r 272 GWKIFQTLISYMRRKI-KDDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHVAVFNH 350 (652) Q Consensus 272 G~~i~~~ie~~ir~~~-~~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~ 350 (652) =.+++.++.+.+|+.+ ++||.||.||.|....-- | +-+.|++... ++++.|+=. -++|++ T Consensus 136 ~~r~Rs~i~~~iR~fl~~~gF~EVeTP~l~~~~~e---G-----ga~~f~~~~~------~~~~yL~~S-----pqly~q 196 (434) T PRK05159 136 IFKIRSEVLRAFREFLYEEGFTEIFTPKIVATGTE---G-----GTELFPVKYF------EKEAFLAQS-----PQLYKQ 196 (434) T ss_pred HHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCC---C-----CCCCCCEEEC------CCEEEECCC-----CHHHHH T ss_conf 99999999999999998779199978743256788---7-----5565861312------753341468-----379999 Q ss_pred CC-CCHHHCCHHHHHHEEEECCCCCCC-CEEEEECCCCCCCCCE-EECCHHHHHHHHHHHHHHH Q ss_conf 12-222310432000100012787645-2000105635334421-0105376788789999998 Q gi|254781004|r 351 GL-KSYRELPVRLAEFGSVYRNEPSGS-LHGLMRVRGFTQDDAH-VFCTKEQMFNECLKIHNLI 411 (652) Q Consensus 351 ~~-~SYrdLPlrl~e~~~~~R~E~sg~-l~GL~R~ReF~~~Dah-~F~~~eq~~~e~~~~~~~~ 411 (652) .. -| .+. |+||||.+||||-+.+ || .-||||-|+| .|++.+++.+....++..+ T Consensus 197 ~li~~--G~~-rvfeI~~~FR~E~~~t~RH----~pEFT~lE~e~af~d~~d~m~~~E~li~~i 253 (434) T PRK05159 197 MMMAA--GFE-RVFEIGPAFRAEEHNTTRH----LNEAISIDVEMGFIDEEDVMDLLENLLKYV 253 (434) T ss_pred HHHHC--CCC-CEEEECCCEECCCCCCCCC----HHHHHHHHHHHCCCCHHHHHHHHHHHHHHH T ss_conf 98763--568-5599660102365875445----356765666631068999999999999999 No 77 >PRK10872 relA GDP/GTP pyrophosphokinase; Provisional Probab=98.18 E-value=4e-06 Score=62.08 Aligned_cols=25 Identities=28% Similarity=0.329 Sum_probs=16.1 Q ss_pred CCCEEE-----CCC------CCEEECCCCCCCHHHH Q ss_conf 542130-----257------5201000235521444 Q gi|254781004|r 169 ENVTLY-----RQG------EWFDLCRGPHVRSTGQ 193 (652) Q Consensus 169 ~~v~~y-----~~g------~~~Dlc~GphvpsTg~ 193 (652) ++..+| |.| +..|||.-+.=|..-. T Consensus 177 ETl~IYAPLA~RLGi~~iK~ELEDLsF~~l~P~~y~ 212 (743) T PRK10872 177 ECTNIYAPLANRLGIGQLKWELEDYCFRYLHPTEYK 212 (743) T ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCHHHHH T ss_conf 999999998774173999999999998622999999 No 78 >cd00775 LysRS_core Lys_tRNA synthetase (LysRS) class II core domain. Class II LysRS is a dimer which attaches a lysine to the 3' OH group of ribose of the appropriate tRNA. Its assignment to class II aaRS is based upon its structure and the presence of three characteristic sequence motifs in the core domain. It is found in eukaryotes as well as some prokaryotes and archaea. However, LysRS belongs to class I aaRS's in some prokaryotes and archaea. The catalytic core domain is primarily responsible for the ATP-dependent formation of the enzyme bound aminoacyl-adenylate. Probab=98.16 E-value=6.2e-06 Score=60.80 Aligned_cols=115 Identities=18% Similarity=0.237 Sum_probs=71.8 Q ss_pred HHHHHHHHHHHHHHH-HCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHHHC Q ss_conf 899999999999887-6896385554324431000112100125554201001456777823774343125777765201 Q gi|254781004|r 273 WKIFQTLISYMRRKI-KDDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHVAVFNHG 351 (652) Q Consensus 273 ~~i~~~ie~~ir~~~-~~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~~ 351 (652) .+++.+|...+|+.+ ++||.||.||.|.+.. +|-- -..|.+.. ...+..+.|.-. -++|++. T Consensus 8 ~~~Rs~i~~~iR~ff~~~gF~Ev~TPiL~~~~----~g~~----~~~f~~~~----~~~~~~~yL~qS-----pQl~~q~ 70 (329) T cd00775 8 FIVRSKIISYIRKFLDDRGFLEVETPMLQPIA----GGAA----ARPFITHH----NALDMDLYLRIA-----PELYLKR 70 (329) T ss_pred HHHHHHHHHHHHHHHHHCCCEEEECCCCCCCC----CCCC----CCCEEECC----CCCCCCCCCCCC-----HHHHHHH T ss_conf 99999999999999998898899798655668----8756----75313224----467867230689-----8999999 Q ss_pred CC-CHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCE-EECCHHHHHHHHHHHHHHH Q ss_conf 22-223104320001000127876452000105635334421-0105376788789999998 Q gi|254781004|r 352 LK-SYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAH-VFCTKEQMFNECLKIHNLI 411 (652) Q Consensus 352 ~~-SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah-~F~~~eq~~~e~~~~~~~~ 411 (652) .- | .+. |+||||.+||+|-+++ .+.-||||-|++ +|++.+++.+....++..+ T Consensus 71 l~~~--g~~-rvfqI~p~FR~E~~~~----rHl~EFtmle~E~~f~d~~dvm~~~E~li~~i 125 (329) T cd00775 71 LIVG--GFE-RVYEIGRNFRNEGIDL----THNPEFTMIEFYEAYADYNDMMDLTEDLFSGL 125 (329) T ss_pred HHHC--CCC-CEEEEEECCCCCCCCC----CCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHH T ss_conf 9856--777-6799810224799997----74734563457774479999999999999999 No 79 >PRK03932 asnC asparaginyl-tRNA synthetase; Validated Probab=98.16 E-value=8.1e-06 Score=59.99 Aligned_cols=125 Identities=14% Similarity=0.207 Sum_probs=70.5 Q ss_pred HHHHHHHHHHHHHHH-HCCCEEEECCCCCCC------CCCCCC-CCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHH Q ss_conf 899999999999887-689638555432443------100011-210012555420100145677782377434312577 Q gi|254781004|r 273 WKIFQTLISYMRRKI-KDDYEEINTPQVLDQ------HLWQQS-GHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGH 344 (652) Q Consensus 273 ~~i~~~ie~~ir~~~-~~G~~eV~tP~l~~~------~Lw~~S-Gh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h 344 (652) .+++.+|.+.+|+.+ ++||.||.||.|+.. +++..+ +...+.... ........+|.-+++.-... T Consensus 135 fr~RS~i~~~iR~~l~~~~F~EVeTP~L~~~~~eGaa~~F~v~~~~~~~~~~~-------~~~~~~~~~yf~~~~yL~~S 207 (462) T PRK03932 135 MRVRNTLAQAIHEFFQENGFVWVHTPIITASDCEGAGELFRVTTLDLENLPRT-------DGKVDFSKDFFGKEAFLTVS 207 (462) T ss_pred HHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCEEECCCCCCCCCCC-------CCCCCCHHHHCCCCCCCCCC T ss_conf 99999999999999976796799788653568764557404214543445433-------44433114542776202657 Q ss_pred HHHHHHCCCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCE-EECCHHHHHHHHHHHHHH Q ss_conf 776520122223104320001000127876452000105635334421-010537678878999999 Q gi|254781004|r 345 VAVFNHGLKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAH-VFCTKEQMFNECLKIHNL 410 (652) Q Consensus 345 ~~i~~~~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah-~F~~~eq~~~e~~~~~~~ 410 (652) -++|++..-+ -+. |+||||.+||||-+-+. .-.-||||-|+| .|...+++.+-...++.. T Consensus 208 pqLylq~li~--G~e-rVfeIg~~FRnE~~~t~---RH~pEFT~lE~e~a~~d~~d~m~l~E~li~~ 268 (462) T PRK03932 208 GQLYAEAYAM--ALS-KVYTFGPTFRAENSNTR---RHLAEFWMIEPEMAFADLEDNMDLAEDMLKY 268 (462) T ss_pred HHHHHHHHHH--HCC-CEEEEEHHHHHCCCCCC---CCCCCCEEEEEEEECCCHHHHHHHHHHHHHH T ss_conf 6899999875--206-48997332320567754---4543002555311006899999999999999 No 80 >PRK11092 bifunctional (p)ppGpp synthetase II/ guanosine-3',5'-bis pyrophosphate 3'-pyrophosphohydrolase; Provisional Probab=98.13 E-value=6.2e-06 Score=60.80 Aligned_cols=119 Identities=11% Similarity=0.118 Sum_probs=48.7 Q ss_pred CCEEECCCCCCCHHHH-HHHHHHHHHHHH-HCC--------C-CCCCCCEEEEEEEEEECCHHHHHHHHHHHHHHHHHHH Q ss_conf 5201000235521444-222333333211-101--------5-7756420689999983798899999999997532008 Q gi|254781004|r 178 EWFDLCRGPHVRSTGQ-VKKFFKLMKVAG-AYW--------R-GDSNRPMLSRIYGTAWNTQQELTQYLYFLEESEKRDH 246 (652) Q Consensus 178 ~~~Dlc~GphvpsTg~-ik~~fkL~~~sg-ayw--------~-gd~~~~~l~Riyg~af~~~~~l~~~~~~~eea~~rdH 246 (652) +..|||.-.+-|..-. |+...+..+... .|- . =++++ .=-+|+|= .|---.-|.++. +-+ T Consensus 181 ELEDlsf~~l~P~~y~~i~~~l~~~~~~r~~~i~~~~~~l~~~L~~~~-i~~~v~gR---~K~~ySI~~Km~-----~k~ 251 (702) T PRK11092 181 ELEELGFEALYPNRYRVIKEVVKAARGNRKEMIQKILSEIEGRLQEAG-IPCRVSGR---EKHLYSIYCKMV-----LKE 251 (702) T ss_pred HHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC-CEEEEEEC---CCCHHHHHHHHH-----HCC T ss_conf 999999987498999999999998766789999999999999999779-91599812---574737889987-----338 Q ss_pred HHHHHHCCEEEECCC-CCCEEEEECCHH--HH----HHHHHHHHHHHHHCCCEEEECCCCCCCCCC Q ss_conf 875411440442267-874356501078--99----999999999887689638555432443100 Q gi|254781004|r 247 RKLAREMDLFHIAED-GSGVIFWHRKGW--KI----FQTLISYMRRKIKDDYEEINTPQVLDQHLW 305 (652) Q Consensus 247 r~lg~~~~lf~~~~~-~~G~~~wlP~G~--~i----~~~ie~~ir~~~~~G~~eV~tP~l~~~~Lw 305 (652) ..+..-.|||-|.=. ..-.--|.-+|. .. -..+.+||..=-.+|||-+.|.++.|...| T Consensus 252 ~~f~~i~Dl~a~RIiv~~~~~CY~~LG~iH~~~~pipgrfkDYIa~PK~NgYQSLHTtV~gp~g~~ 317 (702) T PRK11092 252 QRFHSIMDIYAFRVIVDDSDTCYRVLGQMHSLYKPRPGRVKDYIAIPKANGYQSLHTSMIGPHGVP 317 (702) T ss_pred CCCCCCCCHHHEEEEECCHHHHHHHHHHHHHHCCCCCCHHHHHCCCCCCCCCCEEEEEEECCCCCE T ss_conf 882321240016999688999999999999736879422665412789887301367633599957 No 81 >KOG2411 consensus Probab=98.12 E-value=5.3e-06 Score=61.28 Aligned_cols=110 Identities=20% Similarity=0.435 Sum_probs=76.1 Q ss_pred CCHHHHHHHHHHHHHHHH-HC-CCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHH-HHHH Q ss_conf 107899999999999887-68-96385554324431000112100125554201001456777823774343125-7777 Q gi|254781004|r 270 RKGWKIFQTLISYMRRKI-KD-DYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCP-GHVA 346 (652) Q Consensus 270 P~G~~i~~~ie~~ir~~~-~~-G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp-~h~~ 346 (652) -+-++++..+..-||+.+ ++ ||.||.||+|+.+. -| ..++|.. |+-.| ++.. T Consensus 175 q~nLrlRS~~v~~iR~yl~n~~GFvevETPtLFkrT----Pg--------------------GA~EFvV-Ptr~~~g~FY 229 (628) T KOG2411 175 QNNLRLRSNVVKKIRRYLNNRHGFVEVETPTLFKRT----PG--------------------GAREFVV-PTRTPRGKFY 229 (628) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCEEECCCCCHHCCC----CC--------------------CCCEEEC-CCCCCCCCEE T ss_conf 878999999999999998610473551572020148----88--------------------6412322-3678988555 Q ss_pred HHHHCCCCHHHCCH-----HHHHHEEEECCCCCCCCEEEEECCCCCCCCCE-EECCHHHHHHHHHHHH Q ss_conf 65201222231043-----20001000127876452000105635334421-0105376788789999 Q gi|254781004|r 347 VFNHGLKSYRELPV-----RLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAH-VFCTKEQMFNECLKIH 408 (652) Q Consensus 347 i~~~~~~SYrdLPl-----rl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah-~F~~~eq~~~e~~~~~ 408 (652) -.-+.+.-||+|=+ |+||+..|||||-+.. =|--||||-|+. +|...+.+.+-+...+ T Consensus 230 aLpQSPQQfKQlLMvsGidrYyQiARCfRDEdlR~----DRQPEFTQvD~EMsF~~~~dim~liEdll 293 (628) T KOG2411 230 ALPQSPQQFKQLLMVSGIDRYYQIARCFRDEDLRA----DRQPEFTQVDMEMSFTDQEDIMKLIEDLL 293 (628) T ss_pred ECCCCHHHHHHHHHHHCHHHHHHHHHHHCCCCCCC----CCCCCCEEEEEEEECCCHHHHHHHHHHHH T ss_conf 36889899999998723036776776641345676----56876435444785367787899999999 No 82 >pfam00152 tRNA-synt_2 tRNA synthetases class II (D, K and N). Probab=98.12 E-value=1.8e-05 Score=57.64 Aligned_cols=118 Identities=16% Similarity=0.178 Sum_probs=73.6 Q ss_pred HHHHHHHHHHHHHHHH-HCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHHH Q ss_conf 7899999999999887-689638555432443100011210012555420100145677782377434312577776520 Q gi|254781004|r 272 GWKIFQTLISYMRRKI-KDDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHVAVFNH 350 (652) Q Consensus 272 G~~i~~~ie~~ir~~~-~~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~ 350 (652) =++++..|.+.+|+.+ ++||.||.||.|.+.. ..|.. +.|.+... .++...|....+-+.-.+.. T Consensus 21 ~l~~Rs~i~~~iR~ff~~~~f~EV~TP~L~~~~---~e~~a-----~~F~~~~~-----~~~~~yL~~Spel~~k~ll~- 86 (341) T pfam00152 21 NLKLRSKIIRAIREFLDERGFLEVETPILTKST---PEGGA-----RDFLVPKF-----YAKEAYLPQSPQLYKQLLMV- 86 (341) T ss_pred HHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCC---CCCCC-----CEEEECCC-----CCCCEEECCCHHHHHHHHHH- T ss_conf 999999999999999998898998798200558---98777-----66450267-----89823405488999999986- Q ss_pred CCCCHHHCCHHHHHHEEEECCCCC-CCCEEEEECCCCCCCCCE-EECCHHHHHHHHHHHHHHHHH Q ss_conf 122223104320001000127876-452000105635334421-010537678878999999887 Q gi|254781004|r 351 GLKSYRELPVRLAEFGSVYRNEPS-GSLHGLMRVRGFTQDDAH-VFCTKEQMFNECLKIHNLIIS 413 (652) Q Consensus 351 ~~~SYrdLPlrl~e~~~~~R~E~s-g~l~GL~R~ReF~~~Dah-~F~~~eq~~~e~~~~~~~~~~ 413 (652) + .+ =|+||||.+||+|.+ +.+ -.-||||-|.| .|.+-+++.+....++..+.. T Consensus 87 ---~--g~-~rVfei~~~FR~E~~~t~r----H~~EFtmlE~y~a~~d~~d~m~~~E~li~~i~~ 141 (341) T pfam00152 87 ---A--GF-DRVFQIAPCFRDEDLRTDR----HPPEFTQLDLEMSFVDYEDVMDLTEDLIKYVFK 141 (341) T ss_pred ---C--CC-CCCEEEECHHCCCCCCCCC----CCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHH T ss_conf ---5--88-7637972320279898865----507788776755359999999999999999999 No 83 >PRK07080 hypothetical protein; Validated Probab=98.10 E-value=0.00011 Score=52.26 Aligned_cols=246 Identities=15% Similarity=0.154 Sum_probs=147.0 Q ss_pred CCCCEEEEECCHHHHHHHHHHHHHHH-HHCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEEC-CCCC----------- Q ss_conf 78743565010789999999999988-7689638555432443100011210012555420100-1456----------- Q gi|254781004|r 261 DGSGVIFWHRKGWKIFQTLISYMRRK-IKDDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKC-ADDT----------- 327 (652) Q Consensus 261 ~~~G~~~wlP~G~~i~~~ie~~ir~~-~~~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~-~~~~----------- 327 (652) -..|+|==-..=-.|-..|+.++... -..|.+-...|-+.+...+++||....| +++-.... ..+. T Consensus 36 gvdGlYgRs~~fE~vv~~l~a~i~~~g~~~~~evlrFPPvm~r~~fE~s~YlkSF-P~L~GtV~~f~G~e~~h~~ll~~~ 114 (318) T PRK07080 36 GVDGLYGRSGLFERVVDALDALITRLGADQGAEVLRFPPAMSRAEFERSGYLKSF-PQLAGTVHSFCGNEREHRRLLACL 114 (318) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCEEECCCCCCHHHHHHCCHHHHC-CHHHCCEECCCCCCHHHHHHHHHH T ss_conf 8874303615499999999999987364458715766998865553330465525-002010010468968999999998 Q ss_pred --------CCCCCEEEECCCCHHHHHHHHHHCCCCHHHCC---HHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEECC Q ss_conf --------77782377434312577776520122223104---3200010001278764520001056353344210105 Q gi|254781004|r 328 --------IKDLRTFALKPMNCPGHVAVFNHGLKSYRELP---VRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVFCT 396 (652) Q Consensus 328 --------~~~~~~~~LkPmncp~h~~i~~~~~~SYrdLP---lrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F~~ 396 (652) +-.--+.+|-|..|=..+.+..+.- .|| .++--.+.|||+|+|- -+.|...|.|.+---.=+ T Consensus 115 ~~g~dWt~~~~ptdvvLtPAACyPvYP~~a~rG----~LP~~G~~~dv~~yCFRhEPS~---dp~Rmq~FRMrEyV~iGt 187 (318) T PRK07080 115 DRGEDWTESQKPTDVVLTPAACYPVYPVVASRG----PLPAGGRIVDVFSYCFRHEPSL---DPARMQLFRMREYVRIGT 187 (318) T ss_pred HCCCCCCCCCCCCCEEECCCCCCCCCHHHCCCC----CCCCCCEEEEEEEEHHCCCCCC---CHHHHHHHHCEEEEEECC T ss_conf 718984223576645750401142537660278----8988874898764210178988---858888644044577179 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCC Q ss_conf 37678878999999887665521332100014653112586035778999999999850212125667520555555455 Q gi|254781004|r 397 KEQMFNECLKIHNLIISIYKDFGFEKIMVKLSTRPEKRVGSDALWDDAENIMKGVLDTIKNSSQDRINTGVLLGEGAFYG 476 (652) Q Consensus 397 ~eq~~~e~~~~~~~~~~v~~~~g~~~~~~~ls~~~~~~~g~~~~~~~ae~~l~~~l~~~~~~~~~~~~~~~~~g~~afyg 476 (652) ++|+.+.-..++.-...+...+|+.. .+..+..|. .|..- -..+....++.| ++++.. T Consensus 188 ~e~v~a~R~~W~~r~~~~~~~L~L~~-~vd~AnDPF--FGr~G-k~~a~~Qreq~L-----------KfElli------- 245 (318) T PRK07080 188 PEQILAFRQRWIERGTRMADALGLPV-EIDVANDPF--FGRGG-KIVADSQREQNL-----------KFELLI------- 245 (318) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCC-CEECCCCCC--CCCHH-HHHHHHHHHHHC-----------CEEEEE------- T ss_conf 99999999999999999999759985-231157875--46307-777557776421-----------347999------- Q ss_pred CCCCCCHHHHCCCCCCCCC-CCHHHHCCCCCCCEEEEECCCEEEEEEEEECCCH-HHHHHHHHHHHHHCC---CCCCC Q ss_conf 4321100211033100121-1011102234674167317972137996403410-378999999997387---08873 Q gi|254781004|r 477 PKFEYILKDAIGRDWQCGT-IQVDFNLPSRFNAFYVNSHSEKCHPVMIHRAVFG-SIERFIGIMIENFKG---NLPLW 549 (652) Q Consensus 477 pkid~~~~d~~gr~~~~~t-iq~df~~p~~f~l~y~~~dg~~~~pvmihr~~~G-s~eR~ia~liE~~~g---~~P~w 549 (652) -+.+. .+.-.|.. .+.-.....+|++ .++||. ..|.+|.| ++||+.-||.-|||. .||.. T Consensus 246 -----pi~~~-~~ptAc~S~NyH~dhFG~~~~i--~~adg~-----~AHT~CVgFGlERlalAL~~~HGlD~~~WP~~ 310 (318) T PRK07080 246 -----PIESD-ARPTACMSFNYHMDHFGLTWGI--RTADGA-----VAHTGCVGFGLERLALALFRHHGLDPAAWPAA 310 (318) T ss_pred -----ECCCC-CCCEEEEEECHHHHHCCCCCCC--CCCCCC-----EEEHHHHHCCHHHHHHHHHHHCCCCHHHCCHH T ss_conf -----73689-9971589621145550366674--047887-----10010211159999999998719981437788 No 84 >PRK02983 lysS lysyl-tRNA synthetase; Provisional Probab=98.08 E-value=2e-05 Score=57.26 Aligned_cols=115 Identities=17% Similarity=0.159 Sum_probs=68.2 Q ss_pred HHHHHHHHHHHHHHH-HCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHHHC Q ss_conf 899999999999887-6896385554324431000112100125554201001456777823774343125777765201 Q gi|254781004|r 273 WKIFQTLISYMRRKI-KDDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHVAVFNHG 351 (652) Q Consensus 273 ~~i~~~ie~~ir~~~-~~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~~ 351 (652) .+++.++...+|+.+ ++||.||.||.|++. ||- ++--++....+ ..+.+++|+=. -++|++. T Consensus 776 ~~~RS~Vi~aiR~~L~~~GFlEVETPiLq~~-----~GG-----A~ARPFlThsn--a~d~~~YLriA-----PELflKR 838 (1099) T PRK02983 776 IRARSAVLRAVRETLVAKGFLEVETPILQQI-----HGG-----ANARPFLTHIN--AYDLDLYLRIA-----PELYLKR 838 (1099) T ss_pred HHHHHHHHHHHHHHHHHCCCEEEECCCCCCC-----CCC-----CCCCCCCCCCC--CCCCCCEEECC-----HHHHHHH T ss_conf 9999999999999998789889557546778-----887-----56785301033--67866145148-----0899999 Q ss_pred CCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCE-EECCHHHHHHHHHHHHHH Q ss_conf 22223104320001000127876452000105635334421-010537678878999999 Q gi|254781004|r 352 LKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAH-VFCTKEQMFNECLKIHNL 410 (652) Q Consensus 352 ~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah-~F~~~eq~~~e~~~~~~~ 410 (652) +--= -+ =|+|||+.+||||--.+ .|--||||-|.| +|+..+++.+....++.. T Consensus 839 LmVG-Gf-eRVFEI~RcFRNEglra----dHnPEFTmLE~Y~Ay~Dyedmm~ltEeLi~~ 892 (1099) T PRK02983 839 LCVG-GV-ERVFELGRAFRNEGVDF----SHNPEFTLLEAYQAHADYLVMRDLCRELIQN 892 (1099) T ss_pred HHHC-CC-CCCEEECCCCCCCCCCC----CCCCCHHHHHHHHHHCCHHHHHHHHHHHHHH T ss_conf 9864-86-31134325446799998----7494079999999847899999999999999 No 85 >PRK00476 aspS aspartyl-tRNA synthetase; Validated Probab=98.05 E-value=7.3e-06 Score=60.31 Aligned_cols=115 Identities=22% Similarity=0.300 Sum_probs=71.3 Q ss_pred HHHHHHHHHHHHHHHH-CCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHHHC Q ss_conf 8999999999998876-896385554324431000112100125554201001456777823774343125777765201 Q gi|254781004|r 273 WKIFQTLISYMRRKIK-DDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHVAVFNHG 351 (652) Q Consensus 273 ~~i~~~ie~~ir~~~~-~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~~ 351 (652) .++|.++...+|+.+. +||.||.||.|+.+.- .| . ++ |.+-. + ..++.-|+|. ..-++|++. T Consensus 142 fr~Rs~i~~~iR~fl~~~gFiEVeTPiL~~s~~---eG-A---~~--F~vps-r--~~~~~fyaL~-----qSPQLykq~ 204 (587) T PRK00476 142 LKLRSKVTSAIRNFLDDNGFLEIETPILTKSTP---EG-A---RD--YLVPS-R--VHPGKFYALP-----QSPQLFKQL 204 (587) T ss_pred HHHHHHHHHHHHHHHHHCCCCEECCCCCCCCCC---CC-C---CC--CCCCC-C--CCCCCEECCC-----CCHHHHHHH T ss_conf 999999999999999975974004754325686---66-6---44--30131-0--5788551356-----797999999 Q ss_pred C-CCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCE-EECCHHHHHHHHHHHHHHH Q ss_conf 2-2223104320001000127876452000105635334421-0105376788789999998 Q gi|254781004|r 352 L-KSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAH-VFCTKEQMFNECLKIHNLI 411 (652) Q Consensus 352 ~-~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah-~F~~~eq~~~e~~~~~~~~ 411 (652) + -| -+ =|+||||.+||+|-+-+. |--||||-|+| .|+..+++.+-...++..+ T Consensus 205 L~vg--G~-erVyeig~~FRnE~~~t~----r~pEFt~lE~E~af~d~~dvm~~~E~li~~v 259 (587) T PRK00476 205 LMVA--GF-DRYYQIARCFRDEDLRAD----RQPEFTQIDIEMSFVTQEDVMDLMEGLIRHV 259 (587) T ss_pred HHHC--CC-CEEEEECCCCCCCCCCCC----CCCHHHEEEHHHHCCCHHHHHHHHHHHHHHH T ss_conf 8851--75-527996431238888766----6823423111111168999999999999999 No 86 >COG0317 SpoT Guanosine polyphosphate pyrophosphohydrolases/synthetases [Signal transduction mechanisms / Transcription] Probab=98.04 E-value=2.1e-05 Score=57.14 Aligned_cols=12 Identities=8% Similarity=0.534 Sum_probs=4.5 Q ss_pred HHHHHHHHHCCC Q ss_conf 999999998798 Q gi|254781004|r 569 QEIANLLKSHHL 580 (652) Q Consensus 569 ~~i~~~L~~~gi 580 (652) .+|.+.|.+.+. T Consensus 642 ~~i~~~i~~~~~ 653 (701) T COG0317 642 RDVSQVLANEKI 653 (701) T ss_pred HHHHHHHHHCCC T ss_conf 999999985788 No 87 >PRK12445 lysyl-tRNA synthetase; Reviewed Probab=98.03 E-value=1e-05 Score=59.25 Aligned_cols=116 Identities=14% Similarity=0.177 Sum_probs=72.0 Q ss_pred HHHHHHHHHHHHHHHH-HCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHHH Q ss_conf 7899999999999887-689638555432443100011210012555420100145677782377434312577776520 Q gi|254781004|r 272 GWKIFQTLISYMRRKI-KDDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHVAVFNH 350 (652) Q Consensus 272 G~~i~~~ie~~ir~~~-~~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~ 350 (652) =.+++.+|...+|+.+ ++||.||.||.|.+. -.|- . ..-|.+-. ...+.+++|+=. -++|++ T Consensus 183 ~~~~Rs~ii~~iR~~l~~~gF~EVeTPil~~~----~gGa-~---Arpf~t~~----~~~~~~~yL~qS-----PQLykk 245 (505) T PRK12445 183 TFVVRSKILAAIRQFMVARGFMEVETPMMQVI----PGGA-S---ARPFITHH----NALDLDMYLRIA-----PELYLK 245 (505) T ss_pred HHHHHHHHHHHHHHHHHHCCCEEEECCCCCCC----CCCC-C---CCEEEECC----CCCCCCEEECCC-----CHHHHH T ss_conf 99999999999999999779389978875554----5876-6---61033046----678867434169-----399999 Q ss_pred CC-CCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCE-EECCHHHHHHHHHHHHHHH Q ss_conf 12-2223104320001000127876452000105635334421-0105376788789999998 Q gi|254781004|r 351 GL-KSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAH-VFCTKEQMFNECLKIHNLI 411 (652) Q Consensus 351 ~~-~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah-~F~~~eq~~~e~~~~~~~~ 411 (652) .+ -| .+ =|+||||.+||||-+.+ -|.-||||-|.| +|+..+++.+-...++..+ T Consensus 246 ~Lmvg--Gf-dRvfeI~r~FRnE~~~~----~H~PEFT~lE~e~af~d~~dvm~l~E~li~~i 301 (505) T PRK12445 246 RLVVG--GF-ERVFEINRNFRNEGISV----RHNPEFTMMELYMAYADYHDLIELTESLFRTL 301 (505) T ss_pred HHHHC--CC-CCEEEHHHHHHCCCCCC----CCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHH T ss_conf 99857--97-44576278760788886----65444543568763278989999999999999 No 88 >TIGR00344 alaS alanyl-tRNA synthetase; InterPro: IPR002318 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . Alanyl-tRNA synthetase (6.1.1.7 from EC) is an alpha4 tetramer that belongs to class IIc. ; GO: 0004813 alanine-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006419 alanyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=97.94 E-value=3.3e-05 Score=55.77 Aligned_cols=141 Identities=21% Similarity=0.399 Sum_probs=107.3 Q ss_pred EEEEECCCCHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCC-CCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCE Q ss_conf 0798628999999999998999999999997799889972435-885578840888888989999999999999669980 Q gi|254781004|r 58 SIEIITPEDPRSLAVIRHSCAHIMAEAVQSIWPEVQVAIGPII-EDGFYYDFDKEQPFSSDELAQIEKKMQEIIARDSPF 136 (652) Q Consensus 58 ~i~~i~~~~~eg~~i~~hS~ahlL~~Av~~l~p~~kl~igp~i-~~GfYyD~~~~~~it~~dl~~Ie~~M~~ii~~~~pi 136 (652) .+....-........-.||+.|++..|++++....--..|..+ .+-.-+||.....++.+++..|+...++.+.+++|+ T Consensus 583 ~~~~~~~~~~~~~~~~~h~~~h~~~~~~~~~~g~~~~~~g~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 662 (914) T TIGR00344 583 KVTAVVDEKRRLRLKRNHTATHLLHAALKKVLGPHVWQAGSLVSAKKLRFDFSHPRALTKEELEEVEDLVNEQILANLPV 662 (914) T ss_pred EEEEEECHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCC T ss_conf 14553104566655520037899999999864112221012000013210010000267778999999999998752250 Q ss_pred EEEECCHHHHHHHHHHCCCHHHHHHHHHCCCCCCCEEECCCC------CEEECCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 485327678544544167113556654203455421302575------201000235521444222333333211 Q gi|254781004|r 137 LKQYLSHEKARELFESKKEAYKVEILESIPAKENVTLYRQGE------WFDLCRGPHVRSTGQVKKFFKLMKVAG 205 (652) Q Consensus 137 ~r~~vs~~eA~~~F~~~~~~~K~~li~~~~~~~~v~~y~~g~------~~Dlc~GphvpsTg~ik~~fkL~~~sg 205 (652) ....++.+.|.+.+...-+.. ....++.+.+...++ -+.+|.|.|+.+||.++ .|++.+.+| T Consensus 663 ~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~cgg~h~~~~g~~g-~~~~~~~~~ 730 (914) T TIGR00344 663 KVEFLDLDEAKAKGALALFGE------GYGPGEKVRVVSVGDPNGELYSVELCGGTHVENTGDIG-LFKILKESG 730 (914) T ss_pred HHHHHCHHHHHHHHHHHHCCC------CCCCCCEEEEEECCCCCCCEEEEEECCCCCCCCCCCCC-EEEEEEHHH T ss_conf 121201344554300110024------45776405788507754210121001563112322200-023321000 No 89 >cd00776 AsxRS_core Asx tRNA synthetase (AspRS/AsnRS) class II core domain. Assignment to class II aminoacyl-tRNA synthetases (aaRS) based upon its structure and the presence of three characteristic sequence motifs in the core domain. This family includes AsnRS as well as a subgroup of AspRS. AsnRS and AspRS are homodimers, which attach either asparagine or aspartate to the 3'OH group of ribose of the appropriate tRNA. While archaea lack asnRS, they possess a non-discriminating aspRS, which can mischarge Asp-tRNA with Asn. Subsequently, a tRNA-dependent aspartate amidotransferase converts the bound aspartate to asparagine. The catalytic core domain is primarily responsible for the ATP-dependent formation of the enzyme bound aminoacyl-adenylate. Probab=97.93 E-value=4.4e-05 Score=54.89 Aligned_cols=114 Identities=18% Similarity=0.306 Sum_probs=69.0 Q ss_pred HHHHHHHHHHHHHHHH-HCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHHH Q ss_conf 7899999999999887-689638555432443100011210012555420100145677782377434312577776520 Q gi|254781004|r 272 GWKIFQTLISYMRRKI-KDDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHVAVFNH 350 (652) Q Consensus 272 G~~i~~~ie~~ir~~~-~~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~ 350 (652) =.+++.+|...+|+.+ ++||.||.||.|+...- -|..+ .|.+.. -++.+.|+=.. ++|++ T Consensus 23 ~l~~Rs~i~~~iR~fl~~~gF~EVeTP~L~~~~~---eg~a~-----~f~~~~------~~~~~yL~~Sp-----el~Kq 83 (322) T cd00776 23 IFRIRSEVLRAFREFLRENGFTEVHTPKITSTDT---EGGAE-----LFKVSY------FGKPAYLAQSP-----QLYKE 83 (322) T ss_pred HHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCC---CCCCC-----CCCCCC------CCCCEEECCCH-----HHHHH T ss_conf 9999999999999999988999997984007889---86546-----676210------58760015485-----88999 Q ss_pred CCCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCE-EEC-CHHHHHHHHHHHHHH Q ss_conf 122223104320001000127876452000105635334421-010-537678878999999 Q gi|254781004|r 351 GLKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAH-VFC-TKEQMFNECLKIHNL 410 (652) Q Consensus 351 ~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah-~F~-~~eq~~~e~~~~~~~ 410 (652) ..-+ -+. |+||+|.+||||-+.+. .-.-||||-|.+ .|+ .-+++.+.+..++.. T Consensus 84 lli~--G~~-rVfei~~~FR~E~~~t~---rH~pEFTmlE~e~af~~d~~d~m~~~E~li~~ 139 (322) T cd00776 84 MLIA--ALE-RVYEIGPVFRAEKSNTR---RHLSEFWMLEAEMAFIEDYNEVMDLIEELIKY 139 (322) T ss_pred HHHC--CCC-CEEEECCEEECCCCCCC---CCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHH T ss_conf 9753--424-43887261407999875---34788876525552332899999999999999 No 90 >cd00669 Asp_Lys_Asn_RS_core Asp_Lys_Asn_tRNA synthetase class II core domain. This domain is the core catalytic domain of class II aminoacyl-tRNA synthetases of the subgroup containing aspartyl, lysyl, and asparaginyl tRNA synthetases. It is primarily responsible for ATP-dependent formation of the enzyme bound aminoacyl-adenylate. Class II assignment is based upon its structure and the presence of three characteristic sequence motifs. Nearly all class II tRNA synthetases are dimers and enzymes in this subgroup are homodimers. These enzymes attach a specific amino acid to the 3' OH group of ribose of the appropriate tRNA. Probab=97.91 E-value=6.1e-05 Score=53.94 Aligned_cols=115 Identities=19% Similarity=0.201 Sum_probs=72.9 Q ss_pred HHHHHHHHHHHHHHH-HCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHH-HHHH Q ss_conf 899999999999887-68963855543244310001121001255542010014567778237743431257777-6520 Q gi|254781004|r 273 WKIFQTLISYMRRKI-KDDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHVA-VFNH 350 (652) Q Consensus 273 ~~i~~~ie~~ir~~~-~~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~-i~~~ 350 (652) ++++..|.+.+|+-+ ++||.||.||.|.... ..|. -++|.+... ..+..+.|.-. =+.|.. +... T Consensus 1 lr~Rs~i~~~iR~ff~~~~f~EV~TP~l~~~~---~~~~-----~~~f~~~~~----~~~~~~~L~~S-pel~~k~ll~~ 67 (269) T cd00669 1 FKVRSKIIKAIRDFMDDRGFLEVETPMLQKIT---GGAG-----ARPFLVKYN----ALGLDYYLRIS-PQLFKKRLMVG 67 (269) T ss_pred CCHHHHHHHHHHHHHHHCCCEEEECCEEECCC---CCCC-----CCEEEEECC----CCCCEEEECCC-HHHHHHHHHHC T ss_conf 91699999999999998898998798530578---9876-----742573137----89940773438-89999999865 Q ss_pred CCCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCE-EECCHHHHHHHHHHHHHHH Q ss_conf 122223104320001000127876452000105635334421-0105376788789999998 Q gi|254781004|r 351 GLKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAH-VFCTKEQMFNECLKIHNLI 411 (652) Q Consensus 351 ~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah-~F~~~eq~~~e~~~~~~~~ 411 (652) .+ =++||+|.+||+|-++++ ..-||||-|.| .|...+++.+....++..+ T Consensus 68 ------g~-~~if~i~~~FR~e~~~~r----H~~EFtmlE~y~~~~d~~~~m~~~e~li~~~ 118 (269) T cd00669 68 ------GL-DRVFEINRNFRNEDLRAR----HQPEFTMMDLEMAFADYEDVIELTERLVRHL 118 (269) T ss_pred ------CC-CCEEEEECCCCCCCCCCC----CCHHHHHHHHHCCCCCHHHHHHHHHHHHHHH T ss_conf ------88-867998462107898655----4348775787514899999999999999999 No 91 >COG3705 HisZ ATP phosphoribosyltransferase involved in histidine biosynthesis [Amino acid transport and metabolism] Probab=97.89 E-value=4.7e-05 Score=54.71 Aligned_cols=152 Identities=18% Similarity=0.238 Sum_probs=105.9 Q ss_pred CCCCCEEEEECCHHHHHHHHHHHHHHHHH-CCCEEEECCCCCCCCCCCCC-CCCCCCCCCCCEEECCCCCCCCCCEEEEC Q ss_conf 67874356501078999999999998876-89638555432443100011-21001255542010014567778237743 Q gi|254781004|r 260 EDGSGVIFWHRKGWKIFQTLISYMRRKIK-DDYEEINTPQVLDQHLWQQS-GHWDSYRANMFAVKCADDTIKDLRTFALK 337 (652) Q Consensus 260 ~~~~G~~~wlP~G~~i~~~ie~~ir~~~~-~G~~eV~tP~l~~~~Lw~~S-Gh~~~y~~~mf~~~~~~~~~~~~~~~~Lk 337 (652) +.-.|+-.-||.=++..+.|.+.++..+. +||+.|.||+|.+.+....- |+- -...||.+.. ..++.+||| T Consensus 4 ~lp~g~rd~Lp~e~~~~~~i~~~l~~~f~~~Gy~~v~tP~lE~~d~~l~~~g~~--l~~~~f~l~d-----~~g~~l~LR 76 (390) T COG3705 4 QLPEGIRDVLPLEARRKEEIRDQLLALFRAWGYERVETPTLEPADPLLDGAGED--LRRRLFKLED-----ETGGRLGLR 76 (390) T ss_pred CCCCCCHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCHH--HHHHHEEEEC-----CCCCEEEEC T ss_conf 688752010636776479999999999998087404663134026654022255--5522068765-----789768835 Q ss_pred CCCHHHHHHHHHHCCCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEECC-HHHHHHHHHHHHHHHHHHHH Q ss_conf 43125777765201222231043200010001278764520001056353344210105-37678878999999887665 Q gi|254781004|r 338 PMNCPGHVAVFNHGLKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVFCT-KEQMFNECLKIHNLIISIYK 416 (652) Q Consensus 338 Pmncp~h~~i~~~~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F~~-~eq~~~e~~~~~~~~~~v~~ 416 (652) |-=-+..+.+..+...+ .|.|++=.|++||.. ++ ++| |.+||+|-=...+=. .-++..| ++.....+.+ T Consensus 77 pD~T~pVaR~~~~~~~~---~P~Rl~Y~G~Vfr~~-~~-~~g--~~~Ef~QaGiEllG~~~~~ADaE---vi~la~~~L~ 146 (390) T COG3705 77 PDFTIPVARIHATLLAG---TPLRLSYAGKVFRAR-EG-RHG--RRAEFLQAGIELLGDDSAAADAE---VIALALAALK 146 (390) T ss_pred CCCCHHHHHHHHHHCCC---CCCEEEECCHHHHCC-HH-CCC--CCCCHHHHHHHHHCCCCCHHHHH---HHHHHHHHHH T ss_conf 43308999999982378---871466401454020-10-267--64204553367737872000289---9999999999 Q ss_pred HHCCCCCEECCC Q ss_conf 521332100014 Q gi|254781004|r 417 DFGFEKIMVKLS 428 (652) Q Consensus 417 ~~g~~~~~~~ls 428 (652) ..|+....+.+. T Consensus 147 ~~gl~~~~l~LG 158 (390) T COG3705 147 ALGLADLKLELG 158 (390) T ss_pred HCCCCCEEEEEC T ss_conf 748867489945 No 92 >cd00777 AspRS_core Asp tRNA synthetase (aspRS) class II core domain. Class II assignment is based upon its structure and the presence of three characteristic sequence motifs. AspRS is a homodimer, which attaches a specific amino acid to the 3' OH group of ribose of the appropriate tRNA. The catalytic core domain is primarily responsible for the ATP-dependent formation of the enzyme bound aminoacyl-adenylate. AspRS in this family differ from those found in the AsxRS family by a GAD insert in the core domain. Probab=97.88 E-value=4.2e-05 Score=55.05 Aligned_cols=117 Identities=19% Similarity=0.265 Sum_probs=70.5 Q ss_pred HHHHHHHHHHHHHHH-HCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHHHC Q ss_conf 899999999999887-6896385554324431000112100125554201001456777823774343125777765201 Q gi|254781004|r 273 WKIFQTLISYMRRKI-KDDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHVAVFNHG 351 (652) Q Consensus 273 ~~i~~~ie~~ir~~~-~~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~~ 351 (652) ++++..|.+.+|+.+ ++||.||.||.|.... .|-. ..|.... ....+..|.|.-.. + +|++. T Consensus 1 lr~Rs~i~~~iR~ff~~~gflEV~TPiL~~~~----~ega-----~~f~~~~---~~~~~~~~~L~~Sp-e----l~~k~ 63 (280) T cd00777 1 LRLRSRVIKAIRNFLDEQGFVEIETPILTKST----PEGA-----RDFLVPS---RLHPGKFYALPQSP-Q----LFKQL 63 (280) T ss_pred CCHHHHHHHHHHHHHHHCCCEEEECCCCCCCC----CCCC-----CCCEECC---CCCCCCCCCCCCCH-H----HHHHH T ss_conf 96699999999999998898998798106878----7767-----6644244---16887603688798-9----99999 Q ss_pred CCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCE-EECCHHHHHHHHHHHHHHHH Q ss_conf 22223104320001000127876452000105635334421-01053767887899999988 Q gi|254781004|r 352 LKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAH-VFCTKEQMFNECLKIHNLII 412 (652) Q Consensus 352 ~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah-~F~~~eq~~~e~~~~~~~~~ 412 (652) .-. -.+ =++||+|.+||+|.+++. |--||||-|+| .|...+++.+....++..+. T Consensus 64 l~~-~g~-~rvf~i~~~FR~E~~~~~----h~~EFtmLE~e~~~~~~~d~m~~~E~li~~i~ 119 (280) T cd00777 64 LMV-SGF-DRYFQIARCFRDEDLRAD----RQPEFTQIDIEMSFVDQEDIMSLIEGLLKYVF 119 (280) T ss_pred HHH-CCC-CCCEEEECEECCCCCCCC----CCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHH T ss_conf 985-687-775798451747888876----63477623442268889999999999999999 No 93 >PRK06462 asparagine synthetase A; Reviewed Probab=97.79 E-value=0.00014 Score=51.53 Aligned_cols=146 Identities=15% Similarity=0.105 Sum_probs=83.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCEEEECCCCCCEEEEECCHHHHHHHHHHHHHHHH-HCCCEEEECCCCCCCCCCCCC Q ss_conf 9999999999753200887541144044226787435650107899999999999887-689638555432443100011 Q gi|254781004|r 230 ELTQYLYFLEESEKRDHRKLAREMDLFHIAEDGSGVIFWHRKGWKIFQTLISYMRRKI-KDDYEEINTPQVLDQHLWQQS 308 (652) Q Consensus 230 ~l~~~~~~~eea~~rdHr~lg~~~~lf~~~~~~~G~~~wlP~G~~i~~~ie~~ir~~~-~~G~~eV~tP~l~~~~Lw~~S 308 (652) +++.|...+++ +.++|.+- .-++. .-.=.+++.+|.+.+|+.+ ++||.||.||.|++.. + T Consensus 1 ~~~~~~~~~~~--~~~~r~~~------ir~~~-------~~~ifriRs~i~~~iR~f~~~~gF~EV~TPiL~~~~----~ 61 (332) T PRK06462 1 DLERYLKMIEE--WVRYSWKI------ISSEK-------YSLVLKIQSSILRYTREFLDGRGFVEVLPPIISPST----D 61 (332) T ss_pred CHHHHHHHHHH--HHHHHHHC------CCCHH-------HHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCC----C T ss_conf 94888878999--87302440------25878-------999999999999999999988899998797236778----8 Q ss_pred CCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHHHC-CCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCC Q ss_conf 2100125554201001456777823774343125777765201-222231043200010001278764520001056353 Q gi|254781004|r 309 GHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHVAVFNHG-LKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFT 387 (652) Q Consensus 309 Gh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~~-~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~ 387 (652) |-.. +.|.+.. +--++++.|+... ++|++. +-+ .++ |+||+|.+||+|-..+ ....-.-||| T Consensus 62 ~~~~----~~~~~~~----~~~~~~~~L~~Sp-----ql~lk~li~~--g~~-rVfeIg~~FR~E~~d~-~t~rHlpEFt 124 (332) T PRK06462 62 PLMG----DAKPASI----DFYGVEYYLADSM-----IFHKQLMLRL--LKG-KVFYLSPNFRLEPVDK-DTGRHLYEFT 124 (332) T ss_pred CCCC----CCCCCEE----ECCCCCEEECCCH-----HHHHHHHHHC--CCC-CEEEECCHHHCCCCCC-CCCCCCHHHH T ss_conf 7667----7601058----4489863314586-----9999999863--899-7799745231687776-7776545777 Q ss_pred CCCCEE-ECCHHHHHHHHHHHHHHH Q ss_conf 344210-105376788789999998 Q gi|254781004|r 388 QDDAHV-FCTKEQMFNECLKIHNLI 411 (652) Q Consensus 388 ~~Dah~-F~~~eq~~~e~~~~~~~~ 411 (652) |-|+|. |...+++.+....++..+ T Consensus 125 ~lE~y~a~~d~~d~m~~~E~li~~i 149 (332) T PRK06462 125 QLDIEIEGADLEEVMSLAEDLIKYL 149 (332) T ss_pred HHHHHHHHCCHHHHHHHHHHHHHHH T ss_conf 6558787389999999999999999 No 94 >COG2269 Truncated, possibly inactive, lysyl-tRNA synthetase (class II) [Translation, ribosomal structure and biogenesis] Probab=97.69 E-value=0.00033 Score=48.83 Aligned_cols=133 Identities=20% Similarity=0.262 Sum_probs=88.8 Q ss_pred CCEEEEECCH----HHHHHHHHHHHHHHH-HCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEEC Q ss_conf 7435650107----899999999999887-68963855543244310001121001255542010014567778237743 Q gi|254781004|r 263 SGVIFWHRKG----WKIFQTLISYMRRKI-KDDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALK 337 (652) Q Consensus 263 ~G~~~wlP~G----~~i~~~ie~~ir~~~-~~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~Lk 337 (652) ++.+.|.|-- ..-|+.|.+-+|.-+ ++|+.||.||.|+..-.- --|+..|.-+-+. ........+.|. T Consensus 2 ~~~~~W~p~~~~~~ll~Ra~i~~~iR~FF~erg~lEVeTp~Ls~a~vt--d~hL~~F~Te~~~-----~~~~~~~~l~L~ 74 (322) T COG2269 2 SETPWWQPSASIDNLLKRAAIIAAIRRFFAERGVLEVETPALSVAPVT--DIHLHPFETEFLG-----PGGAKGKPLWLH 74 (322) T ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEECCHHHHCCCCC--CCCEEEEEEEEEC-----CCCCCCCEEEEE T ss_conf 988768888888999989999999999998769258533676507898--6423445667654-----676665303665 Q ss_pred CCCHHHHHHHHHHCCCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCE-EECCHHHHHHHHHHHHHHHHH Q ss_conf 4312577776520122223104320001000127876452000105635334421-010537678878999999887 Q gi|254781004|r 338 PMNCPGHVAVFNHGLKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAH-VFCTKEQMFNECLKIHNLIIS 413 (652) Q Consensus 338 Pmncp~h~~i~~~~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah-~F~~~eq~~~e~~~~~~~~~~ 413 (652) +.-+-+.--+.....+ .+||++.+|||+-.|.+| --||||-+=| .+|......+++...++.+.. T Consensus 75 TSPEy~mKrLLAag~~-------~ifql~kvfRN~E~G~~H----~PEFTMLEWYrv~~d~~~lm~e~~~Ll~~vl~ 140 (322) T COG2269 75 TSPEYHMKRLLAAGSG-------PIFQLGKVFRNEEMGRLH----NPEFTMLEWYRVGCDYYRLMNEVDDLLQLVLE 140 (322) T ss_pred CCCHHHHHHHHHCCCC-------CCHHHHHHHHCCCCCCCC----CCCEEEEEEECCCCCHHHHHHHHHHHHHHHHH T ss_conf 6807778999975488-------601454777263325667----98436765100678599999999999999970 No 95 >TIGR02367 PylS pyrrolysyl-tRNA synthetase; InterPro: IPR012739 PylS is the archaeal enzyme responsible for charging the pyrrolysine tRNA, PylT, by ligating a free molecule of pyrrolysine. Pyrrolysine is encoded at an in-frame UAG (amber) at least in several corrinoid-dependent methyltransferases of the archaeal genera Methanosarcina and Methanococcoides, e.g. trimethylamine methyltransferase. . Probab=97.58 E-value=3.1e-05 Score=55.93 Aligned_cols=124 Identities=18% Similarity=0.297 Sum_probs=83.1 Q ss_pred HHHHHHHHHHHH-HHHCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHHHCC Q ss_conf 999999999998-8768963855543244310001121001255542010014567778237743431257777652012 Q gi|254781004|r 274 KIFQTLISYMRR-KIKDDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHVAVFNHGL 352 (652) Q Consensus 274 ~i~~~ie~~ir~-~~~~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~~~ 352 (652) --+-+||+-+.. -.++||.||..|.|.|.+--+.-|= ++-.+ ......|-+..+|||||-.|.-+...+.-- T Consensus 240 ~ylGkler~it~ffvdrGfleiksPiliP~ey~ermGi-~ndte------lskqifrvdkn~ClrPmlaP~lynylrkld 312 (453) T TIGR02367 240 DYLGKLEREITKFFVDRGFLEIKSPILIPLEYVERMGI-DNDTE------LSKQIFRVDKNLCLRPMLAPTLYNYLRKLD 312 (453) T ss_pred HHHHHHHHHHHHHHHHCCEEEECCCCCCCHHHHHHCCC-CCCHH------HHHHHHHHHHHHHCCCCHHHHHHHHHHHHH T ss_conf 67546766676665312402323631010568865077-64025------666666441001012100056899999987 Q ss_pred CCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCC---CEEECCHHHHHHHHHHHHH Q ss_conf 22231043200010001278764520001056353344---2101053767887899999 Q gi|254781004|r 353 KSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDD---AHVFCTKEQMFNECLKIHN 409 (652) Q Consensus 353 ~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~D---ah~F~~~eq~~~e~~~~~~ 409 (652) |.--+ |++++|+|.+||.|--| ---.-||||-. +-+=|+.|.++.-+..+++ T Consensus 313 r~lP~-Pik~fe~GPCyrkesdG----kehleeftm~nfCqmGsGCtrenle~~i~~fl~ 367 (453) T TIGR02367 313 RILPD-PIKVFEVGPCYRKESDG----KEHLEEFTMLNFCQMGSGCTRENLEALIKEFLD 367 (453) T ss_pred HHCCC-CEEEEEECCCCCCCCCC----HHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHH T ss_conf 53789-64798606752435653----356666665554314787326779999999887 No 96 >TIGR00691 spoT_relA RelA/SpoT family protein; InterPro: IPR004811 The functions of Escherichia coli RelA and SpoT differ somewhat. RelA (2.7.6.5 from EC) produces pppGpp (or ppGpp) from ATP and GTP (or GDP). SpoT (3.1.7.2 from EC) degrades ppGpp, but may also act as a secondary ppGpp synthetase. The two proteins are strongly similar. In many species, a single homolog to SpoT and RelA appears responsible for both ppGpp synthesis and ppGpp degradation. (p)ppGpp is a regulatory metabolite of the stringent response, but appears also to be involved in antibiotic biosynthesis in some species. ; GO: 0015969 guanosine tetraphosphate metabolic process. Probab=97.57 E-value=0.00019 Score=50.53 Aligned_cols=72 Identities=25% Similarity=0.430 Sum_probs=61.6 Q ss_pred CCEEEECCCCCEEECCCCCCHHHHHHHHCCCCCCCEEEEEECCEEEE-CCEEECCC-EEEEECCCCHH-HHHHHH Q ss_conf 87089928998874189888999998636110112899998988997-74530367-07986289999-999999 Q gi|254781004|r 3 SDIKLTFPDGSIKNFPVHATGSDVAESISRSLAKKAVAVAINGKVCD-LSDPVREG-SIEIITPEDPR-SLAVIR 74 (652) Q Consensus 3 ~~i~I~lpDG~~~~~~~g~t~~dia~~i~~~~~~~~vaa~vng~l~d-L~~~l~d~-~i~~i~~~~~e-g~~i~~ 74 (652) +.|-|==|-|.+.+.|.|.|+.|+|-.+........+.|+|||+... |+++|+++ -|+|+|-++.. |.+.-| T Consensus 380 ~~i~VFtP~G~~~~Lp~Gs~~vDfAY~vHtd~G~~~~ga~~ng~~~~Sl~~~L~nG~vVEi~t~k~~~aGP~~~W 454 (741) T TIGR00691 380 EEIYVFTPKGDVVELPSGSTPVDFAYAVHTDVGHKCTGAKVNGKIVPSLDKELENGDVVEIITGKNSNAGPSVDW 454 (741) T ss_pred CCCEEECCCEEEEEECCCCCHHHHHHHHCHHHCCEEEEEEECCEEECCCCEEECCCCEEEEEECCCCCCCCCHHH T ss_conf 542463683008874689956677876330243436746897888034450103786899985788776788798 No 97 >PRK07440 hypothetical protein; Provisional Probab=97.19 E-value=0.0018 Score=43.76 Aligned_cols=61 Identities=25% Similarity=0.388 Sum_probs=51.0 Q ss_pred CCCCEEEECCCCCEEECCCCCCHHHHHHHHCCCCCCCEEEEEECCEEEECC----EEECC-CEEEEECC Q ss_conf 988708992899887418988899999863611011289999898899774----53036-70798628 Q gi|254781004|r 1 MPSDIKLTFPDGSIKNFPVHATGSDVAESISRSLAKKAVAVAINGKVCDLS----DPVRE-GSIEIITP 64 (652) Q Consensus 1 M~~~i~I~lpDG~~~~~~~g~t~~dia~~i~~~~~~~~vaa~vng~l~dL~----~~l~d-~~i~~i~~ 64 (652) ||+.|+|++ +|..++++.++|+.++.+.. ++....||+.+|+++.+=+ +.|++ +.||++++ T Consensus 1 m~~~i~I~v-NGe~~~~~~~~ti~~LL~~l--~~~~~~vaVE~N~~Iv~r~~~~~~~l~~gD~iEIv~~ 66 (70) T PRK07440 1 MSNPITLQV-NGETRTCSPGTTLPDLLQQL--GFNPRLVAVEYNGEILHRQFWEQTQVQPGDRLEIVTI 66 (70) T ss_pred CCCCCEEEE-CCEEEECCCCCCHHHHHHHC--CCCCCEEEEEECCEEECHHHCCCEECCCCCEEEEEEE T ss_conf 998419999-99887859999899999876--9998769999688670867707106799999999999 No 98 >COG1190 LysU Lysyl-tRNA synthetase (class II) [Translation, ribosomal structure and biogenesis] Probab=96.84 E-value=0.0085 Score=39.15 Aligned_cols=131 Identities=19% Similarity=0.232 Sum_probs=75.1 Q ss_pred HHHCCEEEECCCCCCEEEEECCHHHHHHHHHHHHHHHHH-CCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCC Q ss_conf 411440442267874356501078999999999998876-8963855543244310001121001255542010014567 Q gi|254781004|r 250 AREMDLFHIAEDGSGVIFWHRKGWKIFQTLISYMRRKIK-DDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTI 328 (652) Q Consensus 250 g~~~~lf~~~~~~~G~~~wlP~G~~i~~~ie~~ir~~~~-~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~ 328 (652) .+-+||+.=.+. -.-...|-+|.+.+|+.+. +||.||.||.+++.- |-... ++ |.+ ..+ T Consensus 165 ~RylDLi~N~e~--------r~~f~~Rs~ii~~iR~fl~~~gFlEVETP~lq~i~-----GGA~A-rP--F~T--hhN-- 224 (502) T COG1190 165 QRYLDLIVNPES--------RQTFIKRSKIIRAIREFLDDRGFLEVETPMLQPIP-----GGAAA-RP--FIT--HHN-- 224 (502) T ss_pred HHHHHHHCCHHH--------HHHHHHHHHHHHHHHHHHHHCCCEEECCCCCCCCC-----CCCCC-CC--CEE--EEC-- T ss_conf 888898659999--------99999999999999999987797584160003557-----87322-65--012--202-- Q ss_pred CCCCEEEECCCCHHHHHHHHHHCCCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEECCHHHHHHHHHHHH Q ss_conf 77823774343125777765201222231043200010001278764520001056353344210105376788789999 Q gi|254781004|r 329 KDLRTFALKPMNCPGHVAVFNHGLKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVFCTKEQMFNECLKIH 408 (652) Q Consensus 329 ~~~~~~~LkPmncp~h~~i~~~~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F~~~eq~~~e~~~~~ 408 (652) ..+-++.||=..| +.-+.-+.+ -+ =|.||+|..||||---++ ---||||-++| |+...|..++ T Consensus 225 ald~dlyLRIApE----LyLKRliVG--G~-erVfEIgr~FRNEGid~t----HNPEFTmlE~Y------~AYaDy~D~m 287 (502) T COG1190 225 ALDMDLYLRIAPE----LYLKRLIVG--GF-ERVFEIGRNFRNEGIDTT----HNPEFTMLEFY------QAYADYEDLM 287 (502) T ss_pred CCCCCEEEEECCH----HHHHHHHHC--CC-HHHEEECCCCCCCCCCCC----CCCCHHHHHHH------HHHHHHHHHH T ss_conf 3677568762418----999988753--72-142230552003787666----48423568999------9985799999 Q ss_pred HHHHHHHHH Q ss_conf 998876655 Q gi|254781004|r 409 NLIISIYKD 417 (652) Q Consensus 409 ~~~~~v~~~ 417 (652) ++....++. T Consensus 288 ~ltE~Li~~ 296 (502) T COG1190 288 DLTEELIKE 296 (502) T ss_pred HHHHHHHHH T ss_conf 999999999 No 99 >PRK04172 pheS phenylalanyl-tRNA synthetase subunit alpha; Provisional Probab=96.74 E-value=0.0075 Score=39.53 Aligned_cols=142 Identities=18% Similarity=0.263 Sum_probs=80.7 Q ss_pred HHHHHHHHHHHHHHHHH-CCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCC----------- Q ss_conf 78999999999998876-896385554324431000112100125554201001456777823774343----------- Q gi|254781004|r 272 GWKIFQTLISYMRRKIK-DDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPM----------- 339 (652) Q Consensus 272 G~~i~~~ie~~ir~~~~-~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPm----------- 339 (652) ...-+.++.+.+|+-+- -||+|+..|.+ ..+.|- -|-+|.|+.....|.++--|.=.|. T Consensus 243 ~~Hp~~~~~~~~r~i~~~mGF~e~~g~~v-e~~fwN--------fdaLf~Pq~HPARd~~Dtf~l~~~~~~~lp~~~~~~ 313 (501) T PRK04172 243 KKHPYREFIEEVRDILVEMGFEEVKGPIV-ETEFWN--------FDALFQPQDHPAREMQDTFYLKYPSKGELPEELVER 313 (501) T ss_pred CCCHHHHHHHHHHHHHHHCCCEEEECCCE-ECCCCC--------HHHHCCCCCCCCCCCCCEEEECCCCCCCCCHHHHHH T ss_conf 74889999999999999789768536610-100003--------343058989985554453786576545573788988 Q ss_pred ----CHH------------------------HHHHHHHH-CCCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCC Q ss_conf ----125------------------------77776520-1222231043200010001278764520001056353344 Q gi|254781004|r 340 ----NCP------------------------GHVAVFNH-GLKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDD 390 (652) Q Consensus 340 ----ncp------------------------~h~~i~~~-~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~D 390 (652) |+- .|+.-... ..-...+-|+|++-+|+|||+|.--+ +..-+|.|-| T Consensus 314 Vk~~HE~Gg~~gS~Gw~y~w~~~~~~~~~lRthtt~~~~r~l~~~~~~p~~~f~~~rvfR~e~~d~----tH~~~F~Q~e 389 (501) T PRK04172 314 VKEVHEHGGDTGSRGWGYKWDEDIAKRLVLRTHTTALSARYLAERPEPPGKYFSIGRVFRPETIDA----THLPEFYQLE 389 (501) T ss_pred HHHHHHCCCCCCCCCCCCCCCHHHHHHHCCCCCCCHHHHHHHHHCCCCCEEEECCCCEEECCCCCC----CCCCEEEEEE T ss_conf 999874277678877677778556643010436746679999746899835735650542688886----5441332224 Q ss_pred CEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCC Q ss_conf 2101053767887899999988766552133210001465 Q gi|254781004|r 391 AHVFCTKEQMFNECLKIHNLIISIYKDFGFEKIMVKLSTR 430 (652) Q Consensus 391 ah~F~~~eq~~~e~~~~~~~~~~v~~~~g~~~~~~~ls~~ 430 (652) +-+... .-.+...+.....++..+|+.....+-+.- T Consensus 390 g~v~~~----~~~~~~L~g~l~~f~~~~g~~~~rfrp~yf 425 (501) T PRK04172 390 GIVMGE----DVSFRHLLGILKEFYKRLGFEEVKFRPAYF 425 (501) T ss_pred EEEECC----CCCHHHHHHHHHHHHHHHCCCCEEECCCCC T ss_conf 899848----887999999999999985886078679989 No 100 >COG0017 AsnS Aspartyl/asparaginyl-tRNA synthetases [Translation, ribosomal structure and biogenesis] Probab=96.73 E-value=0.015 Score=37.43 Aligned_cols=112 Identities=17% Similarity=0.286 Sum_probs=71.9 Q ss_pred HHHHHHHHHHHHHHH-HCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHHHC Q ss_conf 899999999999887-6896385554324431000112100125554201001456777823774343125777765201 Q gi|254781004|r 273 WKIFQTLISYMRRKI-KDDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHVAVFNHG 351 (652) Q Consensus 273 ~~i~~~ie~~ir~~~-~~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~~ 351 (652) ++|++.|...+|+.+ ++||.||.||.|...+-= =+-++|..... +++..|- ..-++|+.. T Consensus 134 ~kirs~i~~a~~eff~~~gF~eV~tP~i~~~~~E--------Gg~elF~v~yf------~~~a~Lt-----qS~QLyke~ 194 (435) T COG0017 134 FKIRSSILRAIREFFYENGFTEVHTPIITASATE--------GGGELFKVDYF------DKEAYLT-----QSPQLYKEA 194 (435) T ss_pred HHHHHHHHHHHHHHHHHCCCEEECCCEEECCCCC--------CCCEEEEEEEC------CCCEEEE-----CCHHHHHHH T ss_conf 9499999999999997589589659468534789--------98505777506------8566771-----478899999 Q ss_pred CCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCE-EECCHHHHHHHHHHHHH Q ss_conf 22223104320001000127876452000105635334421-01053767887899999 Q gi|254781004|r 352 LKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAH-VFCTKEQMFNECLKIHN 409 (652) Q Consensus 352 ~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah-~F~~~eq~~~e~~~~~~ 409 (652) .-.- | =|.|+||-.||-|.|-++. -+-||+|-|+. .|++-+++.+-...++. T Consensus 195 ~~~a--l-~rVf~igP~FRAE~s~T~R---HL~EF~~ld~Emaf~~~~d~m~l~E~~i~ 247 (435) T COG0017 195 LAAA--L-ERVFTIGPTFRAEKSNTRR---HLSEFWMLDPEMAFADLNDVMDLAEELIK 247 (435) T ss_pred HHHH--H-CCEEEECCCEECCCCCCCC---HHHHHHEECCEECCCCHHHHHHHHHHHHH T ss_conf 9998--5-7648956725547789700---36667330211136768889999999999 No 101 >PTZ00326 phenylalanyl-tRNA synthetase; Provisional Probab=96.60 E-value=0.02 Score=36.56 Aligned_cols=141 Identities=15% Similarity=0.187 Sum_probs=81.5 Q ss_pred HHHHHHHHHHHHHHH-HCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCC----------- Q ss_conf 899999999999887-68963855543244310001121001255542010014567778237743431----------- Q gi|254781004|r 273 WKIFQTLISYMRRKI-KDDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPMN----------- 340 (652) Q Consensus 273 ~~i~~~ie~~ir~~~-~~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmn----------- 340 (652) ..-+.++.+.+|+-+ .-||.|+.+|.....+.|-- |-+|.|+.....+.++--|.=.|-. T Consensus 241 ~hp~~~~~~~~r~if~~mGF~E~~~~~~ves~fwNf--------DaLf~Pq~HPARd~~DTF~l~~p~~~~~~~~~~~~~ 312 (505) T PTZ00326 241 LHPLLKVRQEFREIFMEMGFEEMPTNRWVESSFWNF--------DALFIPQQHPARDLQDTFFLSKPETSKFLDAEYVER 312 (505) T ss_pred CCHHHHHHHHHHHHHHHCCCEECCCCCEEEECCCCC--------HHHCCCCCCCCCCCCCEEEECCCCCCCCCCHHHHHH T ss_conf 688999999999999977976766897133132131--------110489899866523347744754245687899998 Q ss_pred ---------------------------------HHHHH-HHHH---HCCCCHHHCCHHHHHHEEEECCCCCCCCEEEEEC Q ss_conf ---------------------------------25777-7652---0122223104320001000127876452000105 Q gi|254781004|r 341 ---------------------------------CPGHV-AVFN---HGLKSYRELPVRLAEFGSVYRNEPSGSLHGLMRV 383 (652) Q Consensus 341 ---------------------------------cp~h~-~i~~---~~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ 383 (652) -+..+ .+++ ..+.+.-.-|.+++-+|.|||+|.-.+ +-. T Consensus 313 Vk~vHe~G~~gs~Gw~y~W~~~~a~~~~LRthTt~vs~r~l~~~~~~~~~~~~~~P~k~fsi~rv~R~e~~d~----th~ 388 (505) T PTZ00326 313 VKAVHSGGGYGSIGWQYNWKLEEARKNVLRTHTTAVSARMLYKLAQQYQKGKGFKPKKYFSIDRVFRNETLDA----THL 388 (505) T ss_pred HHHHHCCCCCCCCCCCCCCCHHHHHHCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCEEEECCCEEECCCCCC----CCC T ss_conf 8888506887888767787977875206365775376999998754046556889814752372641577776----534 Q ss_pred CCCCCCCCEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCCC Q ss_conf 6353344210105376788789999998876655213321000146 Q gi|254781004|r 384 RGFTQDDAHVFCTKEQMFNECLKIHNLIISIYKDFGFEKIMVKLST 429 (652) Q Consensus 384 ReF~~~Dah~F~~~eq~~~e~~~~~~~~~~v~~~~g~~~~~~~ls~ 429 (652) -+|.|-|+-+... .-.+.........++..+|......+-+. T Consensus 389 ~~f~Q~eg~~~~~----~~~~~~l~~~l~~f~~~~g~~~~rfrp~y 430 (505) T PTZ00326 389 AEFHQVEGVVIDR----NLSLGDLMGTLREFFRRIGISKLRFKPAF 430 (505) T ss_pred CEEEEEEEEEECC----CCCHHHHHHHHHHHHHHHCCCCEEECCCC T ss_conf 0455435789648----88899999999999998498757868888 No 102 >KOG1885 consensus Probab=96.24 E-value=0.025 Score=35.91 Aligned_cols=97 Identities=23% Similarity=0.262 Sum_probs=57.2 Q ss_pred HHHHHHHHHHHHHHH-HCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHHHC Q ss_conf 899999999999887-6896385554324431000112100125554201001456777823774343125777765201 Q gi|254781004|r 273 WKIFQTLISYMRRKI-KDDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHVAVFNHG 351 (652) Q Consensus 273 ~~i~~~ie~~ir~~~-~~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~~ 351 (652) ++++-+|..++|.-+ ++||.||.||.|... +|-... + =|.+ ..+ ..+.++.||=..+-.|-.+.-.. T Consensus 225 f~~RakII~~iRkfld~rgFlEVETPmmn~i-----aGGA~A-k--PFIT--~hn--dldm~LylRiAPEL~lK~LvVGG 292 (560) T KOG1885 225 FRIRAKIISYIRKFLDSRGFLEVETPMMNMI-----AGGATA-K--PFIT--HHN--DLDMDLYLRIAPELYLKMLVVGG 292 (560) T ss_pred HHHHHHHHHHHHHHHHHCCCEEECCHHHCCC-----CCCCCC-C--CEEE--CCC--CCCCCEEEEECHHHHHHHHHHCC T ss_conf 9999999999999865449568444655254-----686325-7--6043--145--56755145636599999898601 Q ss_pred CCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCE Q ss_conf 22223104320001000127876452000105635334421 Q gi|254781004|r 352 LKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAH 392 (652) Q Consensus 352 ~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah 392 (652) + -|.||||+.||||-- -++---|||--+.| T Consensus 293 l-------drVYEIGr~FRNEGI----DlTHNPEFTTcEfY 322 (560) T KOG1885 293 L-------DRVYEIGRQFRNEGI----DLTHNPEFTTCEFY 322 (560) T ss_pred H-------HHHHHHHHHHHHCCC----CCCCCCCCCHHHHH T ss_conf 7-------899998787630576----65458774218999 No 103 >KOG2105 consensus Probab=96.15 E-value=0.018 Score=36.92 Aligned_cols=145 Identities=14% Similarity=0.262 Sum_probs=93.5 Q ss_pred CCEEECCCEEEEECCCCHHHH-HHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCEEEEEECC-CCCCHHHHHHHHHHHH Q ss_conf 745303670798628999999-99999899999999999779988997243588557884088-8888989999999999 Q gi|254781004|r 50 LSDPVREGSIEIITPEDPRSL-AVIRHSCAHIMAEAVQSIWPEVQVAIGPIIEDGFYYDFDKE-QPFSSDELAQIEKKMQ 127 (652) Q Consensus 50 L~~~l~d~~i~~i~~~~~eg~-~i~~hS~ahlL~~Av~~l~p~~kl~igp~i~~GfYyD~~~~-~~it~~dl~~Ie~~M~ 127 (652) .+.+|+-++-..|..+.+.-+ ..-+||-.||+.+.+..+|. .+-+ . .-=.||---+..+ .++|.+.+..|++..+ T Consensus 86 ~~~~iePGt~V~V~VD~qkR~D~MQQHsGQHLitAvad~~fK-lkTt-S-WelG~~~sa~e~~~~~mTaeqvaaieqsvN 162 (415) T KOG2105 86 VNDYIEPGTTVEVAVDEQKRMDYMQQHSGQHLITAVADHLFK-LKTT-S-WELGRFRSAIELDTPSMTAEQVAAIEQSVN 162 (415) T ss_pred CCCCCCCCCEEEEEECHHHHHHHHHHHCCHHHHHHHHHHHHC-CCEE-E-EECCCEEECCCCCCCCCCHHHHHHHHHHHH T ss_conf 378779997699985578774898871322699999976403-2200-3-541451440236888746999987899988 Q ss_pred HHHHCCCCEEEEECCHHH-HHHHHHHCCCHHHHHHHHHCCCCCCCEEECCCC-CEEECCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 999669980485327678-544544167113556654203455421302575-201000235521444222333333211 Q gi|254781004|r 128 EIIARDSPFLKQYLSHEK-ARELFESKKEAYKVEILESIPAKENVTLYRQGE-WFDLCRGPHVRSTGQVKKFFKLMKVAG 205 (652) Q Consensus 128 ~ii~~~~pi~r~~vs~~e-A~~~F~~~~~~~K~~li~~~~~~~~v~~y~~g~-~~Dlc~GphvpsTg~ik~~fkL~~~sg 205 (652) +.|..-+|..-.+.+.++ -++--...+.++. ....+.+...++ --.+|-|.||.+|+.++. .|++..-. T Consensus 163 e~I~d~~~~~v~E~sl~d~eVeqVS~~~~~D~--------haG~iRvvnI~~vDSN~CCGTHvs~~Sdl~v-I~ILgtek 233 (415) T KOG2105 163 EKIRDRLPVNVRELSLDDPEVEQVSGRGLPDD--------HAGPIRVVNIEGVDSNMCCGTHVSNLSDLQV-IKILGTEK 233 (415) T ss_pred HHHHHCCCHHEEECCCCCCCHHHCCCCCCCCC--------CCCCEEEEEECCCCCCCCCCCCCCCHHHCCE-EEEEECCC T ss_conf 99873243020101159842200268998544--------6686489853575577640012443311535-89840431 Q ss_pred H Q ss_conf 1 Q gi|254781004|r 206 A 206 (652) Q Consensus 206 a 206 (652) + T Consensus 234 g 234 (415) T KOG2105 234 G 234 (415) T ss_pred C T ss_conf 3 No 104 >COG0016 PheS Phenylalanyl-tRNA synthetase alpha subunit [Translation, ribosomal structure and biogenesis] Probab=95.98 E-value=0.15 Score=30.45 Aligned_cols=158 Identities=13% Similarity=0.130 Sum_probs=91.2 Q ss_pred HHHHHHHHH--HHCCEEEECCCCCCEEEEECCHHHHHHHHHHHHHHHH-HCCCEEEECCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 320088754--1144044226787435650107899999999999887-6896385554324431000112100125554 Q gi|254781004|r 242 EKRDHRKLA--REMDLFHIAEDGSGVIFWHRKGWKIFQTLISYMRRKI-KDDYEEINTPQVLDQHLWQQSGHWDSYRANM 318 (652) Q Consensus 242 ~~rdHr~lg--~~~~lf~~~~~~~G~~~wlP~G~~i~~~ie~~ir~~~-~~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~m 318 (652) ++.....-| +.++.-.++-..||- ...|-+...++.+.+-+++.+ .-||+++..|. ...+.|- .+-+ T Consensus 78 ~~~~l~~~~~~~~l~~e~~dv~lp~~-~~~~G~~Hpl~~~~e~i~~iF~~mGF~~~~gp~-IE~d~~N--------FDaL 147 (335) T COG0016 78 LTPELEAAGLWERLAFEKIDVTLPGR-RIYPGSLHPLTQTIEEIEDIFLGMGFTEVEGPE-IETDFYN--------FDAL 147 (335) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCC-CCCCCCCCHHHHHHHHHHHHHHHCCCEECCCCC-CCCCCCC--------HHHH T ss_conf 88999987898862215777578886-677898670899999999999974966645882-1320225--------2432 Q ss_pred CEEECCCCCCCCCC--------EEEECCCCHHHHHHHHHHCCCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCC Q ss_conf 20100145677782--------3774343125777765201222231043200010001278764520001056353344 Q gi|254781004|r 319 FAVKCADDTIKDLR--------TFALKPMNCPGHVAVFNHGLKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDD 390 (652) Q Consensus 319 f~~~~~~~~~~~~~--------~~~LkPmncp~h~~i~~~~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~D 390 (652) |.|......+.++- ..+||-=+-|..+-.......+ |+|++-+|.+||++..-+ +-.-+|.|-| T Consensus 148 n~P~dHPARdmqDTFy~~~~~~~~lLRTHTs~vq~R~l~~~~~~----P~k~~~~grvyR~D~~Da----THs~~FhQiE 219 (335) T COG0016 148 NIPQDHPARDMQDTFYLKDDREKLLLRTHTSPVQARTLAENAKI----PIKIFSPGRVYRNDTVDA----THSPEFHQIE 219 (335) T ss_pred CCCCCCCCCCCCCEEEECCCCCCEEECCCCCHHHHHHHHHCCCC----CCEEECCCCEECCCCCCC----CCCHHEEEEE T ss_conf 49999984124443787688773002366827659999847999----915845663411788776----4261013367 Q ss_pred CEEECCHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 2101053767887899999988766552133 Q gi|254781004|r 391 AHVFCTKEQMFNECLKIHNLIISIYKDFGFE 421 (652) Q Consensus 391 ah~F~~~eq~~~e~~~~~~~~~~v~~~~g~~ 421 (652) +-+... .-.+.........+++.|+.. T Consensus 220 Glvvd~----~~s~~~Lkg~L~~f~~~~fg~ 246 (335) T COG0016 220 GLVVDK----NISFADLKGTLEEFAKKFFGE 246 (335) T ss_pred EEEEEC----CCCHHHHHHHHHHHHHHHCCC T ss_conf 899958----956999999999999996377 No 105 >COG2104 ThiS Sulfur transfer protein involved in thiamine biosynthesis [Coenzyme metabolism] Probab=95.89 E-value=0.036 Score=34.80 Aligned_cols=58 Identities=22% Similarity=0.402 Sum_probs=48.2 Q ss_pred CEEEECCCCCEEECCCCCCHHHHHHHHCCCCCCCEEEEEECCEEEE----CCEEECC-CEEEEECC Q ss_conf 7089928998874189888999998636110112899998988997----7453036-70798628 Q gi|254781004|r 4 DIKLTFPDGSIKNFPVHATGSDVAESISRSLAKKAVAVAINGKVCD----LSDPVRE-GSIEIITP 64 (652) Q Consensus 4 ~i~I~lpDG~~~~~~~g~t~~dia~~i~~~~~~~~vaa~vng~l~d----L~~~l~d-~~i~~i~~ 64 (652) .|+|.+ +|+..+.+.++|+.|+.++.+ +....+|+.+||++.. ..+.+++ +.|+++++ T Consensus 2 ~m~i~~-ng~~~e~~~~~tv~dLL~~l~--~~~~~vav~vNg~iVpr~~~~~~~l~~gD~ievv~~ 64 (68) T COG2104 2 PMTIQL-NGKEVEIAEGTTVADLLAQLG--LNPEGVAVAVNGEIVPRSQWADTILKEGDRIEVVRV 64 (68) T ss_pred CEEEEE-CCEEEECCCCCCHHHHHHHHC--CCCCEEEEEECCEECCCHHHHHCCCCCCCEEEEEEE T ss_conf 589998-998988378873999999729--998559999889982504632013158998999986 No 106 >PRK06253 O-phosphoseryl-tRNA synthetase; Reviewed Probab=95.64 E-value=0.029 Score=35.46 Aligned_cols=69 Identities=14% Similarity=0.273 Sum_probs=47.8 Q ss_pred HHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCC Q ss_conf 2310432000100012787645200010563533442101053767887899999988766552133210001465 Q gi|254781004|r 355 YRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVFCTKEQMFNECLKIHNLIISIYKDFGFEKIMVKLSTR 430 (652) Q Consensus 355 YrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F~~~eq~~~e~~~~~~~~~~v~~~~g~~~~~~~ls~~ 430 (652) -+.+|++++-+..|||.|+. -.-.|.++|.|-..-+-. ++ -.+..-.......+..|||+.+..+-... T Consensus 202 k~~~P~klFSIDRcFRrEq~---eD~~hL~~yhsascvv~~-ed---Vn~d~gkav~~~lL~~fGF~~frFrPdek 270 (527) T PRK06253 202 RKPLPFKLFSIDRCFRREQK---EDASRLMTYHSASCVVAG-ED---VTVDDGKAVAEGLLSQFGFTNFRFRPDEK 270 (527) T ss_pred CCCCCEEEEEEEHEEECHHH---CCHHHHHHHHEEEEEEEC-CC---CCHHHHHHHHHHHHHHHCCCEEEECCCCC T ss_conf 67998368741010220000---245563454432269975-98---66777799999999970975378434423 No 107 >TIGR00457 asnS asparaginyl-tRNA synthetase; InterPro: IPR004522 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . AsparaginyltRNA synthetase (6.1.1.22 from EC) is an alpha2 dimer that belongs to class IIb. There is a striking similarity between asparaginyl-tRNA synthetases and archaeal/eukaryotic type aspartyl-tRNA synthetases (IPR004523 from INTERPRO) and a striking divergence of bacterial type aspartyl-tRNA synthetases (IPR004524 from INTERPRO). This family, AsnS, represents asparaginyl-tRNA synthetases from the three domains of life. Some species lack this enzyme and charge tRNA(asn) by misacylation with Asp, followed by transamidation of Asp to Asn.; GO: 0000166 nucleotide binding, 0004816 asparagine-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006421 asparaginyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=95.52 E-value=0.098 Score=31.80 Aligned_cols=255 Identities=15% Similarity=0.189 Sum_probs=129.9 Q ss_pred HHHHHHHHHHHHHHH-HCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCC-----C---CCCCEEEECC----C Q ss_conf 899999999999887-6896385554324431000112100125554201001456-----7---7782377434----3 Q gi|254781004|r 273 WKIFQTLISYMRRKI-KDDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDT-----I---KDLRTFALKP----M 339 (652) Q Consensus 273 ~~i~~~ie~~ir~~~-~~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~-----~---~~~~~~~LkP----m 339 (652) +|||+.|..-+++.+ ++|+..|.||.|+..+- +. +-+||.+..++.. + .-..+|.=|+ - T Consensus 146 mrvR~~l~~ai~~yF~~~gF~~V~~PilT~~d~-------EG-AGElF~v~TLdl~~lp~~~g~~df~~~fFgk~ayLTV 217 (495) T TIGR00457 146 MRVRNALSQAIHEYFQKNGFVKVSPPILTSNDC-------EG-AGELFKVSTLDLEKLPRNDGKIDFSKDFFGKEAYLTV 217 (495) T ss_pred HHHHHHHHHHHHHHHHCCCCEEECCCEEECCCC-------CC-CCCCCEEEEHHHHCCCCCCCCCCCCCCCCCCEEEEEE T ss_conf 999999999999876047867868863113688-------88-7543322003242153326888763024886012330 Q ss_pred CHHHHHHHHHHCCCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCE-EECCHHHHHHHHHHHHHHHH-HHHHH Q ss_conf 12577776520122223104320001000127876452000105635334421-01053767887899999988-76655 Q gi|254781004|r 340 NCPGHVAVFNHGLKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAH-VFCTKEQMFNECLKIHNLII-SIYKD 417 (652) Q Consensus 340 ncp~h~~i~~~~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah-~F~~~eq~~~e~~~~~~~~~-~v~~~ 417 (652) +--.|.+++..... ++|-||-.||=|.|-|. .=+=||.|=+.. -||+-++....+..+++.+. .|+.. T Consensus 218 SGQL~~E~~A~AL~-------kvyTfGPTFRAEkS~T~---RHLsEFWMiEPE~AF~~l~d~~~~~E~~~k~~~k~VL~~ 287 (495) T TIGR00457 218 SGQLYAEAYALALS-------KVYTFGPTFRAEKSNTS---RHLSEFWMIEPEMAFANLNDLLQLIETLIKYIIKAVLEN 287 (495) T ss_pred CCHHHHHHHHHHHC-------CCEECCCCEECCCCCCC---CCHHHHHHHHHHEEECCHHHHHHHHHHHHHHHHHHHHCC T ss_conf 22789999999747-------81445874004286487---011122101120000013333899999999999998305 Q ss_pred --------HCCCCC---EECCCCC--------CCCCCCCHHHH---HHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCC Q ss_conf --------213321---0001465--------31125860357---7899999999985021212566752055555545 Q gi|254781004|r 418 --------FGFEKI---MVKLSTR--------PEKRVGSDALW---DDAENIMKGVLDTIKNSSQDRINTGVLLGEGAFY 475 (652) Q Consensus 418 --------~g~~~~---~~~ls~~--------~~~~~g~~~~~---~~ae~~l~~~l~~~~~~~~~~~~~~~~~g~~afy 475 (652) |-.... .-.|+.. -...+-++..+ ..|...|++.-... .-+|+.+.. -+ T Consensus 288 ~qellPkqfi~~~enn~~~~L~~~~~~~~~~~l~~~i~~~F~~i~Y~~Ai~iL~~~~~~~------~~~Fe~~d~---~w 358 (495) T TIGR00457 288 QQELLPKQFICSQENNLLKFLEKNFDKDLIKKLENIINNKFARITYTDAIEILKESDNKE------KKNFEYEDF---EW 358 (495) T ss_pred HHHHCCHHHCCCCHHHHHHHHHHHCCHHHHHHHHHHHHCCCCEECHHHHHHHHHHHHHCC------CCCCCCCCC---CC T ss_conf 245042010377713799986641275677522566622676004789999987421114------643588861---34 Q ss_pred CCCCC----CCHHHHCCCCCCCCCCCHHHHCCCCCCCEEEEECCCEEEEEEEEECCCHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 54321----10021103310012110111022346741673179721379964034103789999999973870887337 Q gi|254781004|r 476 GPKFE----YILKDAIGRDWQCGTIQVDFNLPSRFNAFYVNSHSEKCHPVMIHRAVFGSIERFIGIMIENFKGNLPLWLS 551 (652) Q Consensus 476 gpkid----~~~~d~~gr~~~~~tiq~df~~p~~f~l~y~~~dg~~~~pvmihr~~~Gs~eR~ia~liE~~~g~~P~wLA 551 (652) |-.+. -.+.+...+.. +.+ =.++|+.+++.|.-.+......|- |+ -=|+ T Consensus 359 G~DL~~eHER~LaE~~Fk~q--Pvf--V~~YPk~~KaFYMk~NS~D~~tV~----a~-------------------DlL~ 411 (495) T TIGR00457 359 GIDLQTEHERFLAEEYFKPQ--PVF--VTDYPKDIKAFYMKLNSDDGKTVA----AV-------------------DLLA 411 (495) T ss_pred CCCCCHHHHHHHHHCCCCCC--CEE--EECCCCCCCCHHCCCCCCCCCEEE----EE-------------------EEEC T ss_conf 88863368998763158988--889--973885558010536678887478----99-------------------9837 Q ss_pred CEEE-EEEECCHHHHHHHHHHHHHHHHCCCEE Q ss_conf 8089-998579668999999999999879889 Q gi|254781004|r 552 PIQA-IVTTITSSAVEYAQEIANLLKSHHLSI 582 (652) Q Consensus 552 P~QV-~Iipi~e~~~eya~~i~~~L~~~girV 582 (652) |--+ -||==|+.-.+ -+++.+++++.|++- T Consensus 412 P~GiGEIiGGS~Re~d-ld~L~~r~k~~g~d~ 442 (495) T TIGR00457 412 PKGIGEIIGGSEREDD-LDKLEERMKEMGLDT 442 (495) T ss_pred CCCEEEEECCCCCHHH-HHHHHHHHHHCCCCC T ss_conf 8962665167543226-899999999648688 No 108 >cd00496 PheRS_alpha_core Phenylalanyl-tRNA synthetase (PheRS) alpha chain catalytic core domain. PheRS belongs to class II aminoacyl-tRNA synthetases (aaRS) based upon its structure and the presence of three characteristic sequence motifs. This domain is primarily responsible for ATP-dependent formation of the enzyme bound aminoacyl-adenylate. While class II aaRSs generally aminoacylate the 3'-OH ribose of the appropriate tRNA, PheRS is an exception in that it attaches the amino acid at the 2'-OH group, like class I aaRSs. PheRS is an alpha-2/ beta-2 tetramer. Probab=95.51 E-value=0.14 Score=30.75 Aligned_cols=121 Identities=13% Similarity=0.189 Sum_probs=67.6 Q ss_pred HHHHHHHHHHHH-CCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCC-------CCEEEECCCCHHHHHHHH Q ss_conf 999999998876-896385554324431000112100125554201001456777-------823774343125777765 Q gi|254781004|r 277 QTLISYMRRKIK-DDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKD-------LRTFALKPMNCPGHVAVF 348 (652) Q Consensus 277 ~~ie~~ir~~~~-~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~-------~~~~~LkPmncp~h~~i~ 348 (652) ..+.+-+++.+. -||+++..|.+.+. |.+| +.++.+......+.. +...+||+.+-++..... T Consensus 4 ~~~~~~i~~~f~~~GF~~~~gp~ie~~--------~~NF-D~Ln~P~dHPaR~~~DTfy~~~~~~~lLRtHTS~~q~r~l 74 (218) T cd00496 4 NKVIEEIEDIFVSMGFTEVEGPEVETD--------FYNF-DALNIPQDHPARDMQDTFYINDPARLLLRTHTSAVQARAL 74 (218) T ss_pred HHHHHHHHHHHHHCCCEEECCCCCCCC--------HHHH-HHHCCCCCCCCCCCCCEEEECCCCCCEECCCCCHHHHHHH T ss_conf 899999999998779868228951231--------0304-5338598875537243278637766021578756679999 Q ss_pred HHCCCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEECCHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 20122223104320001000127876452000105635334421010537678878999999887665521 Q gi|254781004|r 349 NHGLKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVFCTKEQMFNECLKIHNLIISIYKDFG 419 (652) Q Consensus 349 ~~~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F~~~eq~~~e~~~~~~~~~~v~~~~g 419 (652) ++. .-|+++.-+|.|||+|.-.+ .-.-+|.|-|+-.-... -............++.++ T Consensus 75 ~~~-----~~p~~~~~~G~VyR~D~iDa----tH~~~FhQ~Eg~~v~~~----~~~~~Lk~~l~~~~~~lf 132 (218) T cd00496 75 AKL-----KPPIRIFSIGRVYRNDEIDA----THLPEFHQIEGLVVDKG----LTFADLKGTLEEFAKELF 132 (218) T ss_pred HHC-----CCCCEEEECCCEEECCCCCC----CCCCEEEEEEEEEEECC----CCHHHHHHHHHHHHHHHC T ss_conf 745-----89817994250465388984----31531257637999565----119999999999999973 No 109 >PRK05659 sulfur carrier protein ThiS; Validated Probab=95.37 E-value=0.062 Score=33.18 Aligned_cols=57 Identities=25% Similarity=0.454 Sum_probs=45.4 Q ss_pred EEEECCCCCEEECCCCCCHHHHHHHHCCCCCCCEEEEEECCEEEECC----EEECC-CEEEEECC Q ss_conf 08992899887418988899999863611011289999898899774----53036-70798628 Q gi|254781004|r 5 IKLTFPDGSIKNFPVHATGSDVAESISRSLAKKAVAVAINGKVCDLS----DPVRE-GSIEIITP 64 (652) Q Consensus 5 i~I~lpDG~~~~~~~g~t~~dia~~i~~~~~~~~vaa~vng~l~dL~----~~l~d-~~i~~i~~ 64 (652) |+|++ +|..++++.++|+.++.+..+ +...-+|..+|+++.+=+ +.|.+ +.|+++++ T Consensus 1 M~I~v-NGe~~~~~~~~tl~~Ll~~l~--~~~~~vAVevN~~ivpr~~~~~~~L~egD~iEIv~~ 62 (66) T PRK05659 1 MNIQL-NGEPRELPDGESVAALLAREG--LAGRRVAVEVNGEIVPRSQHASTALREGDVVEIVHA 62 (66) T ss_pred CEEEE-CCEEEECCCCCCHHHHHHHCC--CCCCCEEEEECCEECCHHHHCCCCCCCCCEEEEEEE T ss_conf 99999-996878699996999998769--999809999899881778957265899999999998 No 110 >cd01669 TGS_Ygr210_C TGS_Ygr210_C: The C-terminal TGS domain of Ygr210 GTP-binding protein which is a member of Obg-like family of GTPases, and present in archaea. Several Obg-like family members possess a C-terminal RNA-binding domain, the TGS domain, which is also present in threonyl-tRNA synthetase and in bacterial guanosine polyphosphatase SpoT. TGS is a small domain of about 50 amino acid residues with a predominantly beta-sheet structure. There is no direct information on the function of the TGS domain, but its presence in two types of regulatory proteins (the GTPases and guanosine polyphosphate phosphohydrolases/synthetases) suggests a ligand (most likely nucleotide)-binding, regulatory role. Probab=95.34 E-value=0.017 Score=37.00 Aligned_cols=53 Identities=19% Similarity=0.302 Sum_probs=43.6 Q ss_pred CCCCCEEECCCCCCHHHHHHHHCCCCCCCEEEEE--ECCEEEECCEEECCCE-EEEEC Q ss_conf 2899887418988899999863611011289999--8988997745303670-79862 Q gi|254781004|r 9 FPDGSIKNFPVHATGSDVAESISRSLAKKAVAVA--INGKVCDLSDPVREGS-IEIIT 63 (652) Q Consensus 9 lpDG~~~~~~~g~t~~dia~~i~~~~~~~~vaa~--vng~l~dL~~~l~d~~-i~~i~ 63 (652) ||| ..-.|.|+|+.|+|-.|...+++..+.|. +||+...+.++|++++ |+++| T Consensus 21 LPD--~~llp~GsT~~DfAy~IHTdiG~~f~~Aida~~~r~v~~d~~L~~gDvi~Ivt 76 (76) T cd01669 21 LPD--AFLLPKGSTARDLAYAIHTDIGDGFLHAIDARTGRRVGEDYELKHRDVIKIVS 76 (76) T ss_pred CCC--EEEECCCCCHHHHHHHHHHHHHHHCCEEEEEECCCCCCCCCCCCCCCEEEEEC T ss_conf 876--68807999489999999999875140113000587858872847999999809 No 111 >PRK07696 sulfur carrier protein ThiS; Provisional Probab=95.18 E-value=0.08 Score=32.41 Aligned_cols=57 Identities=23% Similarity=0.346 Sum_probs=45.3 Q ss_pred EEEECCCCCEEECCCCC-CHHHHHHHHCCCCCCCEEEEEECCEEEECC----EEECC-CEEEEECC Q ss_conf 08992899887418988-899999863611011289999898899774----53036-70798628 Q gi|254781004|r 5 IKLTFPDGSIKNFPVHA-TGSDVAESISRSLAKKAVAVAINGKVCDLS----DPVRE-GSIEIITP 64 (652) Q Consensus 5 i~I~lpDG~~~~~~~g~-t~~dia~~i~~~~~~~~vaa~vng~l~dL~----~~l~d-~~i~~i~~ 64 (652) |+|++ +|..++++.++ |+.++.++++ +....||..+|+++.+=+ +.|.| +.||+|+| T Consensus 1 M~I~v-NGe~~~~~~~~~tv~~Ll~~l~--l~~~~vaVE~N~eIvpr~~~~~~~L~~GD~iEIV~f 63 (67) T PRK07696 1 MNLKI-NGNQIEVPESVKTVAELLTHLE--LDNRIVVVERNKDILQKDDHTDTSVFDGDQIEIVTF 63 (67) T ss_pred CEEEE-CCEEEECCCCCCCHHHHHHHCC--CCCCEEEEEECCEECCHHHHCCEECCCCCEEEEEEE T ss_conf 98999-9988885886545999998669--998669999798575768809447799999999999 No 112 >TIGR00462 genX lysyl-tRNA synthetase homolog GenX; InterPro: IPR004525 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . This entry represents lysyl-tRNA synthetases from bacteria, as well as other related proteins. Escherichia coli, Haemophilus influenzae, and Aquifex aeolicus each have a protein closely homologous to the C-terminal region of lysyl-tRNA synthetase (LysS). Multiple sequence alignment of these proteins with the homologous regions of collected LysS proteins shows that these proteins form a distinct set rather than just similar truncations of LysS, so they appear to be orthologous. The protein is termed PoxA or GenX after its designation in E. coli. Its function is unknown.; GO: 0000166 nucleotide binding, 0004824 lysine-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006430 lysyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=94.98 E-value=0.16 Score=30.37 Aligned_cols=119 Identities=17% Similarity=0.188 Sum_probs=63.5 Q ss_pred HHHHHHHHHHHHH-HCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHHHCCC Q ss_conf 9999999999887-689638555432443100011210012555420100145677782377434312577776520122 Q gi|254781004|r 275 IFQTLISYMRRKI-KDDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHVAVFNHGLK 353 (652) Q Consensus 275 i~~~ie~~ir~~~-~~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~~~~ 353 (652) -+..|.+-||.-+ .+|+.||.||.|.....-. -|...+.-+. +......+...=|. |+=+.|. |.-+- T Consensus 23 kRa~~i~~iR~FF~e~G~~EV~TP~l~~~~~~D--~~l~p~~~~~-----l~~g~~~~~~~WL~-TSPEY~M---K~LL~ 91 (330) T TIGR00462 23 KRAKIIAEIRKFFKERGLLEVETPLLSDFPVTD--LHLEPFKTEF-----LEPGGEQNKVLWLS-TSPEYHM---KRLLS 91 (330) T ss_pred HHHHHHHHHHHHHHHCCCEEECCCCHHCCCCCC--EEEEEEEEEE-----ECCCCCCCCEECCC-CCCCHHH---HHHHH T ss_conf 899999999998861784253263010156443--0221368888-----42665333312114-5872789---99986 Q ss_pred CHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEE-CCHHHHHHHHHHHHHHH Q ss_conf 22310432000100012787645200010563533442101-05376788789999998 Q gi|254781004|r 354 SYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVF-CTKEQMFNECLKIHNLI 411 (652) Q Consensus 354 SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F-~~~eq~~~e~~~~~~~~ 411 (652) .|+ --++|++++||||--|.+ ---||+|-+=|-- |..-.+.+|+...++.+ T Consensus 92 ~~~---~~iFQi~k~FRN~E~G~~----H~~EF~MLEWY~~~~d~~~Li~E~~~LL~~~ 143 (330) T TIGR00462 92 AGK---GPIFQITKVFRNEEAGRL----HNPEFTMLEWYRPHYDMLRLINEVDDLLQQL 143 (330) T ss_pred HCC---CCEEEEEEHHHHCCCCCC----CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 217---862310000100024675----7753100111211023899999999999997 No 113 >PRK06083 sulfur carrier protein ThiS; Provisional Probab=94.94 E-value=0.11 Score=31.54 Aligned_cols=60 Identities=22% Similarity=0.350 Sum_probs=47.4 Q ss_pred CCCCEEEECCCCCEEECCCCCCHHHHHHHHCCCCCCCEEEEEECCEEEECC----EEECC-CEEEEECC Q ss_conf 988708992899887418988899999863611011289999898899774----53036-70798628 Q gi|254781004|r 1 MPSDIKLTFPDGSIKNFPVHATGSDVAESISRSLAKKAVAVAINGKVCDLS----DPVRE-GSIEIITP 64 (652) Q Consensus 1 M~~~i~I~lpDG~~~~~~~g~t~~dia~~i~~~~~~~~vaa~vng~l~dL~----~~l~d-~~i~~i~~ 64 (652) |. +|+|++ +|..++++.++|+.++....+ ++..-+|..+|+++..-+ +.|.+ +.|++++. T Consensus 1 m~-~m~I~v-Nge~~~v~~~~tl~~Ll~~l~--l~~~giAvaiN~~iVPRs~W~~~~l~~gD~Iei~~A 65 (69) T PRK06083 1 MN-TITISI-NDQSYQVPTSSSLAQIIEQLS--LPELGCVFAINNQVVPRSEWQSTVLSEGDAISLFQA 65 (69) T ss_pred CC-EEEEEE-CCEEEECCCCCCHHHHHHHHC--CCCCCEEEEECCEEECHHHCCCCCCCCCCEEEEEEE T ss_conf 97-789999-998877389983999999738--998868999778384356716045899999999988 No 114 >PRK07569 bidirectional hydrogenase complex protein HoxU; Validated Probab=94.68 E-value=0.28 Score=28.64 Aligned_cols=79 Identities=23% Similarity=0.276 Sum_probs=50.9 Q ss_pred CCCCEEEECCCCCEEECCCCCCHHHHHHHHCCCCCC-------------CEEEEEECC--EEEEC-CEEECCCEEEEECC Q ss_conf 988708992899887418988899999863611011-------------289999898--89977-45303670798628 Q gi|254781004|r 1 MPSDIKLTFPDGSIKNFPVHATGSDVAESISRSLAK-------------KAVAVAING--KVCDL-SDPVREGSIEIITP 64 (652) Q Consensus 1 M~~~i~I~lpDG~~~~~~~g~t~~dia~~i~~~~~~-------------~~vaa~vng--~l~dL-~~~l~d~~i~~i~~ 64 (652) |+ +|+|+. ||+..+++.|+|++|.|+..+-..+. .+-++.|+| ++.-- ++|++|+-+ |.. T Consensus 1 M~-mvtltI-dG~~v~v~~G~tILeaa~~~Gi~IP~LCy~~~l~~~G~CR~C~VEveG~~~lv~sC~t~v~eGM~--V~T 76 (236) T PRK07569 1 MS-VKTLTI-DDQLVSAREGETILQAAREAGIPIPTLCHLEGVSDVGACRLCLVEVEGSNKLLPACVTPVAEGMV--VQT 76 (236) T ss_pred CC-EEEEEE-CCEEEEECCCCHHHHHHHHCCCCCCEECCCCCCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCE--EEE T ss_conf 97-389999-99999968989599999985998472546899398886673279989985315663758899848--992 Q ss_pred CCHHHHHHHHHHHHHHHHH Q ss_conf 9999999999989999999 Q gi|254781004|r 65 EDPRSLAVIRHSCAHIMAE 83 (652) Q Consensus 65 ~~~eg~~i~~hS~ahlL~~ 83 (652) +|+...+.=+--+.+||+. T Consensus 77 ~s~~v~~~Rk~~lellL~~ 95 (236) T PRK07569 77 NTPRLQEYRRMIVELLFAE 95 (236) T ss_pred CCHHHHHHHHHHHHHHHHC T ss_conf 9989999999999999972 No 115 >PRK11840 bifunctional sulfur carrier protein/thiazole synthase protein; Provisional Probab=94.29 E-value=0.13 Score=30.89 Aligned_cols=58 Identities=22% Similarity=0.306 Sum_probs=45.9 Q ss_pred EEEECCCCCEEECCCCCCHHHHHHHHCCCCCCCEEEEEECCEEEECC----EEECC-CEEEEECCC Q ss_conf 08992899887418988899999863611011289999898899774----53036-707986289 Q gi|254781004|r 5 IKLTFPDGSIKNFPVHATGSDVAESISRSLAKKAVAVAINGKVCDLS----DPVRE-GSIEIITPE 65 (652) Q Consensus 5 i~I~lpDG~~~~~~~g~t~~dia~~i~~~~~~~~vaa~vng~l~dL~----~~l~d-~~i~~i~~~ 65 (652) |+|++ +|..++++.++|+.++.... ++....||+.+|+++..-+ +.|.| +.||++++- T Consensus 1 M~Itv-NGE~r~~~~~~Tv~~Ll~~L--~l~p~~VAVe~N~eIvPR~~w~~t~L~eGD~lEIV~fV 63 (327) T PRK11840 1 MRIRL-NGEPREVPAGLTIEALLAEL--GLDPKKVAVERNLEIVPRSEYGSVALSEGDQLEIVHFV 63 (327) T ss_pred CEEEE-CCEEEECCCCCCHHHHHHHC--CCCCCCEEEEECCEECCHHHCCCCCCCCCCEEEEEEEE T ss_conf 97999-99864358878699999974--89976156720470611444265216899889999997 No 116 >PRK05863 sulfur carrier protein ThiS; Provisional Probab=94.19 E-value=0.15 Score=30.46 Aligned_cols=57 Identities=19% Similarity=0.321 Sum_probs=44.6 Q ss_pred EEEECCCCCEEECCCCCCHHHHHHHHCCCCCCCEEEEEECCEEEECC---EEECC-CEEEEECC Q ss_conf 08992899887418988899999863611011289999898899774---53036-70798628 Q gi|254781004|r 5 IKLTFPDGSIKNFPVHATGSDVAESISRSLAKKAVAVAINGKVCDLS---DPVRE-GSIEIITP 64 (652) Q Consensus 5 i~I~lpDG~~~~~~~g~t~~dia~~i~~~~~~~~vaa~vng~l~dL~---~~l~d-~~i~~i~~ 64 (652) |+|++ +|..++++.++|+.++....+ ++..-+|..||+++..-+ ++|.| +.|++++. T Consensus 1 M~I~v-Nge~~e~~~~~Tva~Ll~~l~--~~~~giAVavN~~iVPRs~w~t~L~eGd~veiv~a 61 (65) T PRK05863 1 MIVVV-NEEQVEVDEQTTVAALLDSLG--FPEKGIAVAVDWSVLPRSEWATKLRDGARLEVVTA 61 (65) T ss_pred CEEEE-CCEEEECCCCCCHHHHHHHCC--CCCCCEEEEECCEEECCHHHCCCCCCCCEEEEEEE T ss_conf 98999-993578699870999998729--99885899978958450684062789999999987 No 117 >PRK06944 sulfur carrier protein ThiS; Provisional Probab=94.02 E-value=0.19 Score=29.78 Aligned_cols=56 Identities=18% Similarity=0.357 Sum_probs=44.1 Q ss_pred EEEECCCCCEEECCCCCCHHHHHHHHCCCCCCCEEEEEECCEEEECC----EEECC-CEEEEECC Q ss_conf 08992899887418988899999863611011289999898899774----53036-70798628 Q gi|254781004|r 5 IKLTFPDGSIKNFPVHATGSDVAESISRSLAKKAVAVAINGKVCDLS----DPVRE-GSIEIITP 64 (652) Q Consensus 5 i~I~lpDG~~~~~~~g~t~~dia~~i~~~~~~~~vaa~vng~l~dL~----~~l~d-~~i~~i~~ 64 (652) |+|++ +|..++++.|.|+.|+.+.++. . ..+|+.||++++.-+ +.|.+ +.|+++.. T Consensus 1 M~I~v-Nge~~~lp~g~tvadll~~l~~--~-~~~AVavN~~~VPRs~~~~~~L~~GD~ieiv~~ 61 (65) T PRK06944 1 MDIQL-NQQTLSLPDGATVADALAAYGA--R-PPFAVAVNGNFVARTQHAARALAAGDRLDLVHP 61 (65) T ss_pred CEEEE-CCEEEECCCCCCHHHHHHHCCC--C-CCEEEEECCEEECHHHHHHCCCCCCCEEEEEEE T ss_conf 98999-9946427999979999986286--8-977999847980314533312789999999974 No 118 >TIGR00471 pheT_arch phenylalanyl-tRNA synthetase, beta subunit; InterPro: IPR004531 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . Phenylalanyl-tRNA synthetase (6.1.1.20 from EC) is an alpha2/beta2 tetramer composed of 2 subunits that belongs to class IIc. In eubacteria, a small subunit (pheS gene) can be designated as beta (E. coli) or alpha subunit (see IPR002319 from INTERPRO). Reciprocally the large subunit (pheT gene) can be designated as alpha (E. coli) or beta. In all other kingdoms the two subunits have equivalent length in eukaryota, and can be identified by specific signatures. The enzyme from Thermus thermophilus has an alpha 2 beta 2 type quaternary structure and is one of the most complicated members of the synthetase family. Identification of phenylalanyl-tRNA synthetase as a member of class II aaRSs was based only on sequence alignment of the small alpha-subunit with other synthetases . This family describes the beta subunit. The beta subunits break into two subfamilies that are considerably different in sequence, length, and pattern of gaps (see also IPR004532 from INTERPRO). This family represents the subfamily that includes the beta subunit from eukaryotic cytosol, the archaea, and spirochetes.; GO: 0000166 nucleotide binding, 0004826 phenylalanine-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006432 phenylalanyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=93.84 E-value=0.068 Score=32.90 Aligned_cols=115 Identities=15% Similarity=0.150 Sum_probs=73.0 Q ss_pred ECCCCCCEEEEECCHHHH---HHHHHHHHHHHHH-CCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCE Q ss_conf 226787435650107899---9999999998876-896385554324431000112100125554201001456777823 Q gi|254781004|r 258 IAEDGSGVIFWHRKGWKI---FQTLISYMRRKIK-DDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRT 333 (652) Q Consensus 258 ~~~~~~G~~~wlP~G~~i---~~~ie~~ir~~~~-~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~ 333 (652) +....|++|.-.-.|-.- |+++.+.+|+-|- .||+||.+=.|.+.+ ++|..+...+.....-- T Consensus 373 ~~~~~pE~P~~~t~g~~~Plp~~~~~~~~R~~mvG~GF~EVi~l~L~s~~-------------~~~~~mR~~d~~~k~~v 439 (605) T TIGR00471 373 YNNIEPELPKILTIGRLKPLPLEKVSDIIREIMVGLGFQEVISLTLTSEE-------------VLFKRMRIEDNNFKLAV 439 (605) T ss_pred HHCCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHCCCHHHHHHCCCCCHH-------------HHHHHHCCCCCCCHHHH T ss_conf 21068777712222104888777887576542204704342410047806-------------99998075788720033 Q ss_pred EEECCCCHHHHHH--------HHHHCCCCHHHCCHHHHHHEEE-ECCCCCCCCEEEEECCCCC Q ss_conf 7743431257777--------6520122223104320001000-1278764520001056353 Q gi|254781004|r 334 FALKPMNCPGHVA--------VFNHGLKSYRELPVRLAEFGSV-YRNEPSGSLHGLMRVRGFT 387 (652) Q Consensus 334 ~~LkPmncp~h~~--------i~~~~~~SYrdLPlrl~e~~~~-~R~E~sg~l~GL~R~ReF~ 387 (652) =.+.|++.++.+. +=.-...+..+||.++||+|.| +.|+.+-+ -=.|.|-|+ T Consensus 440 ~V~~P~t~e~~~~R~SlLP~LLe~~~~Nk~~~lP~KiFE~GDVV~~d~~~er--k~~~~r~~~ 500 (605) T TIGR00471 440 KVANPLTLEYTIVRTSLLPGLLETLSENKHHELPQKIFEIGDVVVKDDDSER--KETRSRVVT 500 (605) T ss_pred HHCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEEEEEECCCHHH--HHHCCEEEE T ss_conf 3117876233377656223678888651489988207887438882872012--230562555 No 119 >PRK08053 sulfur carrier protein ThiS; Provisional Probab=93.09 E-value=0.42 Score=27.45 Aligned_cols=57 Identities=16% Similarity=0.250 Sum_probs=44.7 Q ss_pred EEEECCCCCEEECCCCCCHHHHHHHHCCCCCCCEEEEEECCEEEECC----EEECC-CEEEEECC Q ss_conf 08992899887418988899999863611011289999898899774----53036-70798628 Q gi|254781004|r 5 IKLTFPDGSIKNFPVHATGSDVAESISRSLAKKAVAVAINGKVCDLS----DPVRE-GSIEIITP 64 (652) Q Consensus 5 i~I~lpDG~~~~~~~g~t~~dia~~i~~~~~~~~vaa~vng~l~dL~----~~l~d-~~i~~i~~ 64 (652) |+|++ +|..++++.++|+.++..... +...-+|..||+++..=+ ++|.| +.|++++. T Consensus 1 M~I~i-N~e~~e~~~~~tl~~Ll~~l~--~~~~giAvAiN~~vVPRs~W~~~~L~egD~I~i~~A 62 (66) T PRK08053 1 MQILF-NDQPMQCAAGQTVHELLEQLN--QLQPGAALAINQQIVPREQWAQHIVQDGDQILLFQV 62 (66) T ss_pred CEEEE-CCEEEECCCCCCHHHHHHHHC--CCCCCEEEEECCEEECHHHCCCCCCCCCCEEEEEEE T ss_conf 99999-998878689985999998727--888858999889785578866531689999999984 No 120 >pfam02597 ThiS ThiS family. ThiS (thiaminS) is a 66 aa protein involved in sulphur transfer. ThiS is coded in the thiCEFSGH operon in E. coli. This family of proteins have two conserved Glycines at the COOH terminus. Thiocarboxylate is formed at the last G in the activation process. Sulphur is transferred from ThiI to ThiS in a reaction catalysed by IscS. MoaD, a protein involved sulphur transfer in molybdopterin synthesis, is about the same length and shows limited sequence similarity to ThiS. Both have the conserved GG at the COOH end. Probab=93.01 E-value=0.33 Score=28.18 Aligned_cols=53 Identities=28% Similarity=0.499 Sum_probs=42.3 Q ss_pred CCCEEECCCCCCHHHHHHHHCCCCCCCEEEEEECCEEEEC---CEEECC-CEEEEECCC Q ss_conf 9988741898889999986361101128999989889977---453036-707986289 Q gi|254781004|r 11 DGSIKNFPVHATGSDVAESISRSLAKKAVAVAINGKVCDL---SDPVRE-GSIEIITPE 65 (652) Q Consensus 11 DG~~~~~~~g~t~~dia~~i~~~~~~~~vaa~vng~l~dL---~~~l~d-~~i~~i~~~ 65 (652) .|+..+++.++|+.|+...++. ....+++.+||++.+- +++|.| +.|+++++- T Consensus 11 ~G~~~~~~~~~tv~~ll~~l~~--~~~~v~v~vN~~~v~~~~~~~~L~~gD~v~ivppV 67 (70) T pfam02597 11 AGKELELPEGATVAELLEALGL--NPERVAVAVNGEIVPRSQADTPLKDGDEVAIIPPV 67 (70) T ss_pred CCCEEECCCCCCHHHHHHHHCC--CCCCEEEEECCEECCCCCCCCCCCCCCEEEEECCC T ss_conf 5997635999969999998388--88888999999986785677677999999997886 No 121 >cd00565 ThiS ThiS (ThiaminS) is a sulfur carrier protein involved in thiamin biosynthesis in bacteria. The ThiS fold, like those of two closely related proteins MoaD and Urm1, is similar to that of ubiquitin although there is little or no sequence similarity. Probab=92.87 E-value=0.29 Score=28.61 Aligned_cols=55 Identities=27% Similarity=0.432 Sum_probs=43.1 Q ss_pred EECCCCCEEECCCCCCHHHHHHHHCCCCCCCEEEEEECCEEEECC----EEECC-CEEEEECC Q ss_conf 992899887418988899999863611011289999898899774----53036-70798628 Q gi|254781004|r 7 LTFPDGSIKNFPVHATGSDVAESISRSLAKKAVAVAINGKVCDLS----DPVRE-GSIEIITP 64 (652) Q Consensus 7 I~lpDG~~~~~~~g~t~~dia~~i~~~~~~~~vaa~vng~l~dL~----~~l~d-~~i~~i~~ 64 (652) |++ +|...+++.++|+.++.+..+- ....+|..+|+++..-+ +.+.| +.|+++++ T Consensus 2 I~i-NG~~~~~~~~~tl~~ll~~~~~--~~~~vaV~~N~~ivpk~~~~~~~l~~gD~iEii~~ 61 (65) T cd00565 2 ITV-NGEPREVEEGTTLAELLEELGL--PPEGVAVEVNGEIVPRSEWADTELQEGDRIEIVTA 61 (65) T ss_pred EEE-CCEEEECCCCCCHHHHHHHCCC--CCCCEEEEECCEECCHHHCCCCCCCCCCEEEEEEE T ss_conf 899-9998886999979999987699--98859999999980878928100899899999987 No 122 >PRK06488 sulfur carrier protein ThiS; Validated Probab=92.82 E-value=0.32 Score=28.27 Aligned_cols=56 Identities=23% Similarity=0.303 Sum_probs=43.2 Q ss_pred EEEECCCCCEEECCCCCCHHHHHHHHCCCCCCCEEEEEECCEEEECC----EEECC-CEEEEECC Q ss_conf 08992899887418988899999863611011289999898899774----53036-70798628 Q gi|254781004|r 5 IKLTFPDGSIKNFPVHATGSDVAESISRSLAKKAVAVAINGKVCDLS----DPVRE-GSIEIITP 64 (652) Q Consensus 5 i~I~lpDG~~~~~~~g~t~~dia~~i~~~~~~~~vaa~vng~l~dL~----~~l~d-~~i~~i~~ 64 (652) |+|++ +|..++++ ++|+.++....+ +....+|.-|||+++.-+ ++|.+ +.||+++. T Consensus 1 MkI~v-NGe~~e~~-a~tl~~Ll~elg--~~~~~vAvAvN~~~Vpr~~~~~~~L~eGDriEIv~p 61 (65) T PRK06488 1 MKLFV-NGEEIETA-ATTLALLLAELG--YEGNWLATAVNGELVHSEARAQFVLHEGDRIEILSP 61 (65) T ss_pred CEEEE-CCEEEEEC-HHHHHHHHHHCC--CCCCCEEEEECCEEECHHHHCCCCCCCCCEEEEECC T ss_conf 98999-99777700-868999999729--888808999899987424505451799999999646 No 123 >cd00769 PheRS_beta_core Phenylalanyl-tRNA synthetase (PheRS) beta chain core domain. PheRS belongs to class II aminoacyl-tRNA synthetases (aaRS) based upon its structure. While class II aaRSs generally aminoacylate the 3'-OH ribose of the appropriate tRNA, PheRS is an exception in that it attaches the amino acid at the 2'-OH group, like class I aaRSs. PheRS is an alpha-2/ beta-2 tetramer. While the alpha chain contains a catalytic core domain, the beta chain has a non-catalytic core domain. Probab=92.29 E-value=0.088 Score=32.14 Aligned_cols=91 Identities=18% Similarity=0.203 Sum_probs=59.6 Q ss_pred HHHHHHHHHHHHH-CCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHHHCCCC Q ss_conf 9999999998876-896385554324431000112100125554201001456777823774343125777765201222 Q gi|254781004|r 276 FQTLISYMRRKIK-DDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHVAVFNHGLKS 354 (652) Q Consensus 276 ~~~ie~~ir~~~~-~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~~~~S 354 (652) ++++++-+|..+. .||+||.|-.+.+.+..+.-+.- .++...... .-..+.=+|||+-=|+.....+...+ T Consensus 2 ~~~l~~~ir~~L~~~Gf~Ev~t~s~~~~~~~~~~~~~---~~~~i~l~N----Pls~e~~~lR~sLlp~LL~~~~~N~~- 73 (198) T cd00769 2 LQKLERKLRRLLAGLGFQEVITYSLTSPEEAELFDGG---LDEAVELSN----PLSEEYSVLRTSLLPGLLDALARNLN- 73 (198) T ss_pred HHHHHHHHHHHHHHCCCCEEEECCCCCHHHHHHHCCC---CCCEEEEEC----CCCHHHHHHHHHHHHHHHHHHHHHCC- T ss_conf 6799999999999789714130676898999872488---787389708----86566889888779999999986505- Q ss_pred HHHCCHHHHHHEEEECCCCC Q ss_conf 23104320001000127876 Q gi|254781004|r 355 YRELPVRLAEFGSVYRNEPS 374 (652) Q Consensus 355 YrdLPlrl~e~~~~~R~E~s 374 (652) ++..++++||+|.+|..+.. T Consensus 74 r~~~~~~lFEiG~vy~~~~~ 93 (198) T cd00769 74 RKNKPLRLFEIGRVFLKDED 93 (198) T ss_pred CCCCCCEEEEEEEEEECCCC T ss_conf 67777328997638966888 No 124 >TIGR01683 thiS thiamine biosynthesis protein ThiS; InterPro: IPR010035 This entry identifies the ThiS family, which are thiamine-biosynthesis proteins related to MoaD (IPR010034 from INTERPRO). MoaD is a molybdenum cofactor biosynthesis protein. Both ThiS and MoaD proteins are involved in sulphur transfer.. Probab=91.64 E-value=0.53 Score=26.77 Aligned_cols=52 Identities=31% Similarity=0.388 Sum_probs=42.6 Q ss_pred CCCEEECCCCCCHHHHHHHHCCCCCC-CEEEEEECCEEEECC----EEECC-CEEEEECC Q ss_conf 99887418988899999863611011-289999898899774----53036-70798628 Q gi|254781004|r 11 DGSIKNFPVHATGSDVAESISRSLAK-KAVAVAINGKVCDLS----DPVRE-GSIEIITP 64 (652) Q Consensus 11 DG~~~~~~~g~t~~dia~~i~~~~~~-~~vaa~vng~l~dL~----~~l~d-~~i~~i~~ 64 (652) +|..++++.++|+.++..++ ++.. ..||..+|+.+.+=+ +.|.+ +.|+++++ T Consensus 4 NG~~~~~~~~~t~~~LL~~~--~~~~~~~vave~N~~ivp~~~w~~~~l~~gD~iEiv~~ 61 (65) T TIGR01683 4 NGEPVEVEDGTTLAALLESL--GLDPKRRVAVEVNGEIVPRSEWEDTILKEGDRIEIVTF 61 (65) T ss_pred CCCCEEECCCCCHHHHHHHH--CCCCCCEEEEEECCEEECCCCCCCEEECCCCEEEEEEE T ss_conf 99124625888878988860--75358748998698340834567510158985889976 No 125 >PRK09616 pheT phenylalanyl-tRNA synthetase subunit beta; Reviewed Probab=91.45 E-value=0.52 Score=26.81 Aligned_cols=92 Identities=13% Similarity=0.138 Sum_probs=57.6 Q ss_pred HHHHHHHHHHHHHHHH-CCCEEEECCCCCCCCCC-CCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHHH Q ss_conf 8999999999998876-89638555432443100-011210012555420100145677782377434312577776520 Q gi|254781004|r 273 WKIFQTLISYMRRKIK-DDYEEINTPQVLDQHLW-QQSGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHVAVFNH 350 (652) Q Consensus 273 ~~i~~~ie~~ir~~~~-~G~~eV~tP~l~~~~Lw-~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~ 350 (652) ..-.+++.+-+|..+. .||+||.|=.+.+.+.. +..+..+ ++...... .-.++.=+|||+-=|+.....+. T Consensus 355 ~~~~~~~~~~ir~~L~~~G~~Evitysf~s~~~~~~~~~~~~---~~~v~l~N----Pis~e~s~mRtsLlp~LL~~~~~ 427 (546) T PRK09616 355 LHPIEELSRAIRDLMVGLGFQEVMNFTLTSEEVLFEKMNLEE---DDPVEVLN----PISEDYTVLRTSLLPSLLEFLSI 427 (546) T ss_pred CCHHHHHHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHCCCC---CCEEEECC----CCCHHHHHHHHHHHHHHHHHHHH T ss_conf 998999999999999968961276225689899987616886---87489648----87456777775538999999998 Q ss_pred CCCCHHHCCHHHHHHEEEECCCC Q ss_conf 12222310432000100012787 Q gi|254781004|r 351 GLKSYRELPVRLAEFGSVYRNEP 373 (652) Q Consensus 351 ~~~SYrdLPlrl~e~~~~~R~E~ 373 (652) ..+ |..|+++||+|.+|.... T Consensus 428 N~~--r~~~~~lFEiG~Vf~~~~ 448 (546) T PRK09616 428 NKH--REYPQKIFEIGDVVLIDE 448 (546) T ss_pred HHC--CCCCEEEEEEECEEECCC T ss_conf 466--899845997405580488 No 126 >pfam01409 tRNA-synt_2d tRNA synthetases class II core domain (F). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only phenylalanyl-tRNA synthetases. This is the core catalytic domain. Probab=91.07 E-value=1.3 Score=24.07 Aligned_cols=111 Identities=18% Similarity=0.289 Sum_probs=62.4 Q ss_pred HHHHHHHHHHHHHHHHH-CCCEEEECCCCCCC-----CCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHH Q ss_conf 78999999999998876-89638555432443-----1000112100125554201001456777823774343125777 Q gi|254781004|r 272 GWKIFQTLISYMRRKIK-DDYEEINTPQVLDQ-----HLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHV 345 (652) Q Consensus 272 G~~i~~~ie~~ir~~~~-~G~~eV~tP~l~~~-----~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~ 345 (652) +.-.+..+.+-+++.+. .||+++..|.+.+. .|.--..|=.--..+-|.... +..++..+||.=+-++.. T Consensus 15 ~~HPi~~~~~~i~~~f~~~GF~~~~g~eves~~~NFDaLn~P~dHPaR~~~DTfy~~~----~~~~~~~vLRtHTS~~q~ 90 (243) T pfam01409 15 GLHPLTRVLEEIRDIFLSMGFEEVEGPEVESDFYNFDALNIPQDHPARDMQDTFYLKK----PLKARRLLLRTHTTPVQA 90 (243) T ss_pred CCCHHHHHHHHHHHHHHHCCCEEEECCCCCCHHHHHHHHCCCCCCCCCCCCCEEEECC----CCCCCCEEECCCCCHHHH T ss_conf 8585999999999999877986876983564277777557898898767676034336----666768657687988999 Q ss_pred HHHHHCCCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEE Q ss_conf 7652012222310432000100012787645200010563533442101 Q gi|254781004|r 346 AVFNHGLKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVF 394 (652) Q Consensus 346 ~i~~~~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F 394 (652) ...+ ...+.|+|++-+|.|||+|.-.+ .-.-+|.|-|+-.. T Consensus 91 r~l~----~~~~~p~r~~~~grvyR~d~~Da----tH~~~FhQ~Eg~vv 131 (243) T pfam01409 91 RTLA----EKNKPPIKIFSIGRVFRRDQVDA----THLPEFHQVEGLVV 131 (243) T ss_pred HHHH----HCCCCCEEEECCCEEEECCCCCC----CCCCEEEEEEEEEE T ss_conf 9997----42799848981431332588984----31531057557896 No 127 >TIGR00458 aspS_arch aspartyl-tRNA synthetase; InterPro: IPR004523 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . Aspartyl tRNA synthetase 6.1.1.12 from EC is an alpha2 dimer that belongs to class IIb. Structural analysis combined with mutagenesis and enzymology data on the yeast enzyme point to a tRNA binding process that starts by a recognition event between the tRNA anticodon loop and the synthetase anticodon binding module . This family represents aspartyl-tRNA synthetases from the eukaryotic cytosol and from the archaea. In some species, this enzyme aminoacylates tRNA for both Asp and Asn; Asp-tRNA(asn) is subsequently transamidated to Asn-tRNA(asn). ; GO: 0000166 nucleotide binding, 0004815 aspartate-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006422 aspartyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=89.89 E-value=1.2 Score=24.38 Aligned_cols=106 Identities=17% Similarity=0.359 Sum_probs=67.4 Q ss_pred HHHHHHHHHHHHHHHH-CCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHHHC Q ss_conf 8999999999998876-896385554324431000112100125554201001456777823774343125777765201 Q gi|254781004|r 273 WKIFQTLISYMRRKIK-DDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHVAVFNHG 351 (652) Q Consensus 273 ~~i~~~ie~~ir~~~~-~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~~ 351 (652) ++|+..+.+.+|+.+. .||.||.||-|..+- --=|-++|...... .+-| |. ..=++||+. T Consensus 155 F~I~~~~~~~vR~~L~~~~F~Ev~TPKlv~~~--------tEGGt~lF~i~YF~-----~~Af-L~-----QSPQlyKQ~ 215 (466) T TIGR00458 155 FRIRSEVLESVREFLAEEGFIEVHTPKLVASA--------TEGGTELFPITYFE-----REAF-LG-----QSPQLYKQV 215 (466) T ss_pred HHHHHHHHHHHHHHHHHCCCEEEECCCEEECC--------CCCCCCCCEEEEEC-----CCCC-CC-----CCCHHHHHH T ss_conf 55689999999998730882563076214216--------68885332012212-----3000-13-----570356687 Q ss_pred CCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCE-EECCH-HHHHH Q ss_conf 22223104320001000127876452000105635334421-01053-76788 Q gi|254781004|r 352 LKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAH-VFCTK-EQMFN 402 (652) Q Consensus 352 ~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah-~F~~~-eq~~~ 402 (652) +-+ -+|- |.||||-+||-|-+-+. --.-|++--|.. +|... .++.+ T Consensus 216 l~~-~g~~-rVyEI~PiFRAE~~nT~---RHl~E~~siDiE~~F~dhy~~v~d 263 (466) T TIGR00458 216 LMA-AGLE-RVYEIGPIFRAEEHNTR---RHLNEAISIDIEMAFEDHYEDVMD 263 (466) T ss_pred HHH-CCCC-CEEEECCEEECCCCCCH---HHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 765-2652-03550552205666521---455444442244442753789999 No 128 >KOG1547 consensus Probab=89.57 E-value=0.63 Score=26.23 Aligned_cols=94 Identities=21% Similarity=0.390 Sum_probs=64.0 Q ss_pred CEEEEECCCEEEEEEEEECCCHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEEC---C-----HHHHHHHHHHHHHHHHCC Q ss_conf 41673179721379964034103789999999973870887337808999857---9-----668999999999999879 Q gi|254781004|r 508 AFYVNSHSEKCHPVMIHRAVFGSIERFIGIMIENFKGNLPLWLSPIQAIVTTI---T-----SSAVEYAQEIANLLKSHH 579 (652) Q Consensus 508 l~y~~~dg~~~~pvmihr~~~Gs~eR~ia~liE~~~g~~P~wLAP~QV~Iipi---~-----e~~~eya~~i~~~L~~~g 579 (652) +.|+-..|..-+|.=| -|+--|.| -|.|||| + ++-.++-+.|.++|..+| T Consensus 160 lyFi~ptGhsLrplDi---------eflkrLt~-------------vvNvvPVIakaDtlTleEr~~FkqrI~~el~~~~ 217 (336) T KOG1547 160 LYFIPPTGHSLRPLDI---------EFLKRLTE-------------VVNVVPVIAKADTLTLEERSAFKQRIRKELEKHG 217 (336) T ss_pred EEEECCCCCCCCCCCH---------HHHHHHHH-------------HHEEEEEEEECCCCCHHHHHHHHHHHHHHHHHCC T ss_conf 9996798875670449---------99998865-------------5204335750664427779999999999999659 Q ss_pred CEEEE------ECCCCCCCHHHHHHHHCCCCEEEEECHHHHHCCCEEEEECCCC Q ss_conf 88999------7589663676665543489889998742532581899988988 Q gi|254781004|r 580 LSIET------DFRNETINYKIREHSIKKIPIIIICGDKEASERSIGIRRFGST 627 (652) Q Consensus 580 irV~~------Ddr~~~~G~Kir~a~l~giP~~ivIG~ke~e~~~Vtir~R~~~ 627 (652) |+|.- |+-+..++.|+|+. ||+.+|=.++|..-|.-.++-|.+. T Consensus 218 i~vYPq~~fded~ed~~lN~kvR~~----iPFAVVGsd~e~~vnG~~vlGRktr 267 (336) T KOG1547 218 IDVYPQDSFDEDLEDKTLNDKVRES----IPFAVVGSDKEIQVNGRRVLGRKTR 267 (336) T ss_pred CCCCCCCCCCCCHHHHHHHHHHHHH----CCEEEECCCCEEEECCEEEECCCCC T ss_conf 4003554333445677877988850----8747853662388868475235564 No 129 >TIGR00470 sepS O-phosphoseryl-tRNA(Cys) synthetase; InterPro: IPR005246 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . All known phenylalanyl-tRNA synthetases, excepting a monomeric mitochondrial form, have an alpha 2 beta 2 heterotetrameric structure. This family of archaeal proteins resembles known phenylalanyl-tRNA synthetase alpha chains but is longer and of unknown function; more generally, it resembles class 2 aminoacyl tRNA ligases. . Probab=87.55 E-value=1 Score=24.76 Aligned_cols=218 Identities=20% Similarity=0.295 Sum_probs=116.4 Q ss_pred EECCCCHH----HHHH----HHHHCCCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEE-C--CHHHHHHH Q ss_conf 74343125----7777----652012222310432000100012787645200010563533442101-0--53767887 Q gi|254781004|r 335 ALKPMNCP----GHVA----VFNHGLKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVF-C--TKEQMFNE 403 (652) Q Consensus 335 ~LkPmncp----~h~~----i~~~~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F-~--~~eq~~~e 403 (652) -|||.+-. .|.. |--+++..-+-|||+||=|-.|||-|++-= |+|=+|= ||- | ..+++ . T Consensus 185 dLKP~S~tlTLRSHMTsGWFItLs~i~dk~~lPlKLFSIDRCFRREQ~ED-----~shLmtY---hSASCVvVdeev--s 254 (558) T TIGR00470 185 DLKPVSSTLTLRSHMTSGWFITLSSIADKLKLPLKLFSIDRCFRREQKED-----ASHLMTY---HSASCVVVDEEV--S 254 (558) T ss_pred CCCCCCCCCCCHHHCCCHHHHHHHHHHHHCCCCCEEECCCCCCCCCCCCC-----CCCCCCE---EEEEEEEECCCC--C T ss_conf 16886554320010100278889878751589711220032314466435-----3100113---454667857712--4 Q ss_pred HHHHHHHHHHHHHHHCCCCCEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCH Q ss_conf 89999998876655213321000146531125860357789999999998502121256675205555554554321100 Q gi|254781004|r 404 CLKIHNLIISIYKDFGFEKIMVKLSTRPEKRVGSDALWDDAENIMKGVLDTIKNSSQDRINTGVLLGEGAFYGPKFEYIL 483 (652) Q Consensus 404 ~~~~~~~~~~v~~~~g~~~~~~~ls~~~~~~~g~~~~~~~ae~~l~~~l~~~~~~~~~~~~~~~~~g~~afygpkid~~~ 483 (652) +..=.....-.+..|||..+..+-.....||.--+.. .|--.|=||+.-.. T Consensus 255 vDdGKaVAEgLL~qfGF~~FrFrpDEKkSKYYiP~TQ-----------------------------TEVyAyHPkL~Gss 305 (558) T TIGR00470 255 VDDGKAVAEGLLEQFGFEKFRFRPDEKKSKYYIPDTQ-----------------------------TEVYAYHPKLKGSS 305 (558) T ss_pred CCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCC-----------------------------CEEEEECCCCCCCC T ss_conf 3642688999998708544750576464543578885-----------------------------05677626456664 Q ss_pred HHHCCCCCCCCCCCHHHHCCCCCCCEEEEECCCEEEEEEEEECCCH-HHHHHHHHHHH----------HHCCCCCCC--- Q ss_conf 2110331001211011102234674167317972137996403410-37899999999----------738708873--- Q gi|254781004|r 484 KDAIGRDWQCGTIQVDFNLPSRFNAFYVNSHSEKCHPVMIHRAVFG-SIERFIGIMIE----------NFKGNLPLW--- 549 (652) Q Consensus 484 ~d~~gr~~~~~tiq~df~~p~~f~l~y~~~dg~~~~pvmihr~~~G-s~eR~ia~liE----------~~~g~~P~w--- 549 (652) ..--..+...+|+ +...|.++...=++. ||| =+| ++||+=.||=- |.-|.|-+. T Consensus 306 tkysdeWiEvATF--G~YSPiALa~Y~Id~------pVM----NLGlGVERlAMIlygy~DVR~MVYPQiyGE~rLSDrD 373 (558) T TIGR00470 306 TKYSDEWIEVATF--GVYSPIALAKYDIDV------PVM----NLGLGVERLAMILYGYEDVRKMVYPQIYGEYRLSDRD 373 (558) T ss_pred CCCCCCCEEEEEC--CCCHHHHHHHCCCCC------CEE----ECCCHHHHHHHHHHCHHHHHHCCCCCCCCEEECCHHH T ss_conf 4104764798743--431046774168496------522----2563187678778140334322688421223123366 Q ss_pred CC-CEEEEEEECCHHHHHHHHHHHHHHHHCCCEEEEECCCCC-CCHHH-HHHHHCCCC--EEEEEC Q ss_conf 37-808999857966899999999999987988999758966-36766-655434898--899987 Q gi|254781004|r 550 LS-PIQAIVTTITSSAVEYAQEIANLLKSHHLSIETDFRNET-INYKI-REHSIKKIP--IIIICG 610 (652) Q Consensus 550 LA-P~QV~Iipi~e~~~eya~~i~~~L~~~girV~~Ddr~~~-~G~Ki-r~a~l~giP--~~ivIG 610 (652) +| =++|.-+|+.+.-...|++|-+...++. |. .| =+.-. |+-+.+|=. ..+-|= T Consensus 374 IA~~irv~~vP~t~~g~~~a~~iv~~~~aha------d~-pSPCSfev~regef~Grrrvi~V~~~ 432 (558) T TIGR00470 374 IAEMIRVDKVPVTAEGLKLAEEIVETAEAHA------DE-PSPCSFEVYREGEFLGRRRVIKVEVL 432 (558) T ss_pred HHHHHEECCCCCCHHHHHHHHHHHHHHHHCC------CC-CCCCEEEEEECCCCCCCCEEEEEEEE T ss_conf 6745010467522889999999999999607------78-88941899842200366138999999 No 130 >KOG0554 consensus Probab=86.71 E-value=2.6 Score=22.03 Aligned_cols=103 Identities=21% Similarity=0.321 Sum_probs=69.7 Q ss_pred HHHHHHHHHHHHHHH-HCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCC----CHHHHHHH Q ss_conf 899999999999887-6896385554324431000112100125554201001456777823774343----12577776 Q gi|254781004|r 273 WKIFQTLISYMRRKI-KDDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPM----NCPGHVAV 347 (652) Q Consensus 273 ~~i~~~ie~~ir~~~-~~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPm----ncp~h~~i 347 (652) +|+++.+..-.+..+ .++|..|.||.|...|- +. .-+||.+..+-. ...+|.=+|| +--.|... T Consensus 132 ~RvRs~~~~a~h~ffq~~~F~~i~tPiiTtnDC-------EG-aGE~F~Vtt~~d---~~~~fFgrp~fLTVSgQLhlEa 200 (446) T KOG0554 132 LRVRSALAFATHSFFQSHDFTYINTPIITTNDC-------EG-AGEVFQVTTLTD---YSKDFFGRPAFLTVSGQLHLEA 200 (446) T ss_pred HHHHHHHHHHHHHHHHHCCCEEECCCEEECCCC-------CC-CCCEEEEEECCC---CCCCCCCCCEEEEEECEEHHHH T ss_conf 999989999999999875946733867634577-------78-740589874686---5433148850799713215998 Q ss_pred HHHCCCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEE-ECC Q ss_conf 5201222231043200010001278764520001056353344210-105 Q gi|254781004|r 348 FNHGLKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHV-FCT 396 (652) Q Consensus 348 ~~~~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~-F~~ 396 (652) +...+ =|.|-+|..||-|.|-+- .-.-||.|-++.+ |+. T Consensus 201 ~a~aL-------srvyt~gP~FRAE~s~t~---rHLaEFwmlE~E~af~~ 240 (446) T KOG0554 201 MACAL-------SRVYTFGPTFRAENSHTS---RHLAEFWMLEAELAFAE 240 (446) T ss_pred HHHHH-------CCEEEECCCEECCCCCCH---HHHHHHHHHHHHHHHHH T ss_conf 87552-------323761661322678754---67765652427887888 No 131 >TIGR00443 hisS_second histidyl-tRNA synthetase 2, putative; InterPro: IPR004517 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . Histidyl-tRNA synthetase (6.1.1.21 from EC) is an alpha2 dimer that belongs to class IIa (see IPR004516 from INTERPRO). Every completed genome includes a histidyl-tRNA synthetase. Apparent second copies from Bacillus subtilis, Synechocystis spp., and Aquifex aeolicus are slightly shorter, more closely related to each other than to other hisS proteins, and not demonstrated to act as histidyl-tRNA synthetases. The regulatory protein kinase GCN2 of Saccharomyces cerevisiae (YDR283c), and related proteins from other species designated eIF-2 alpha kinase, have a domain closely related to histidyl-tRNA synthetase that may serve to detect and respond to uncharged tRNA(his), an indicator of amino acid starvation, but these regulatory proteins are not orthologous. An unusual feature is that this putative second HisS is shorter at the C-terminus. It shows greater similarity to archaeal HisS than to other bacterial HisS. There is reason by analogy to suspect this second HisS may have a function other than (or in addition to) ligating His to its tRNA: the protein kinase DGCN2 of Drosophila has a C-terminal domain related to HisS that appears to detect uncharged tRNA(his), an indicator of amino acid starvation, and respond by phosphorylating eIF-2alpha.; GO: 0000105 histidine biosynthetic process, 0005737 cytoplasm. Probab=86.55 E-value=2.6 Score=21.98 Aligned_cols=157 Identities=17% Similarity=0.113 Sum_probs=98.0 Q ss_pred CCCCCCEEEEECCHHHHHHHHHHHHHHHH-HCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCC-CCEEEE Q ss_conf 26787435650107899999999999887-6896385554324431000112100125554201001456777-823774 Q gi|254781004|r 259 AEDGSGVIFWHRKGWKIFQTLISYMRRKI-KDDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKD-LRTFAL 336 (652) Q Consensus 259 ~~~~~G~~~wlP~G~~i~~~ie~~ir~~~-~~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~-~~~~~L 336 (652) +..-.|.-.++|.-.....++.+-+..-. +.||+.+.+|++..-+.-...+-... .+.+. ..+.. ++.+.| T Consensus 3 ~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~--~~~~~-----~~d~~~~~~~~l 75 (414) T TIGR00443 3 FEPPAGERDLLPGEYELKKKINKKLTDLFDKWGYQFIETSTLEWYDTLGLGGAIGA--KELIK-----LLDSGTGGTLGL 75 (414) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHCCCCCCHHHHHHHCCCCCH--HHHHE-----EECCCCCCCEEE T ss_conf 66555403444567778889988999999861401210132211233321011000--02100-----000345540211 Q ss_pred CCCCHHHHHHHHHHCCCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEECCHHHHHHHHHHHHHHHHHHHH Q ss_conf 34312577776520122223104320001000127876452000105635334421010537678878999999887665 Q gi|254781004|r 337 KPMNCPGHVAVFNHGLKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVFCTKEQMFNECLKIHNLIISIYK 416 (652) Q Consensus 337 kPmncp~h~~i~~~~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F~~~eq~~~e~~~~~~~~~~v~~ 416 (652) +|--.........++.+. ..+|.|++-.+.+||..+.+. + -|.-+|.+.-....-. .....-..++.+...... T Consensus 76 ~~~~~~~~~~~~~~~~~~-~~~p~~~~~~~~~~~~~p~~~-~--g~~~~~~~~g~~l~~~--~~~~~d~~~~~~~~~~l~ 149 (414) T TIGR00443 76 RPDLTGPIARAAVSKLLD-NGLPERVGYFANVFRLPPEGY-G--GRGFEFLQAGVELLGD--GGVLADAEVLLLLTGCLK 149 (414) T ss_pred CCCCHHHHHHHHHHHHHH-CCCCHHHHHHHHHHCCCCCCC-C--CCCCHHHHHHHHHHCC--CCCHHHHHHHHHHHHHHH T ss_conf 222014688999988762-366113322211102675334-5--6541234420232103--430125789999999886 Q ss_pred HHCCCCCEECCC Q ss_conf 521332100014 Q gi|254781004|r 417 DFGFEKIMVKLS 428 (652) Q Consensus 417 ~~g~~~~~~~ls 428 (652) ..|+....+.++ T Consensus 150 ~~gl~~~~~~~~ 161 (414) T TIGR00443 150 SLGLADWKLILG 161 (414) T ss_pred HCCCHHHHHHHH T ss_conf 315013455541 No 132 >PRK00488 pheS phenylalanyl-tRNA synthetase subunit alpha; Validated Probab=85.36 E-value=3 Score=21.58 Aligned_cols=116 Identities=16% Similarity=0.227 Sum_probs=61.9 Q ss_pred CCCCEEEEECCHHHHHHHHHHHHHHHHH-CCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCC-----CCEE Q ss_conf 7874356501078999999999998876-896385554324431000112100125554201001456777-----8237 Q gi|254781004|r 261 DGSGVIFWHRKGWKIFQTLISYMRRKIK-DDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKD-----LRTF 334 (652) Q Consensus 261 ~~~G~~~wlP~G~~i~~~ie~~ir~~~~-~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~-----~~~~ 334 (652) .-||-+.+ +-.......+.+-++.-+. -||..+.-|.+-.. . |. | +-++.|......+.. +... T Consensus 95 TlP~~~~~-~G~~HPit~~~~ei~~iF~~~Gf~v~~GpeiE~~-~------~N-F-daLn~P~~HPAR~~~DTFy~~~~~ 164 (338) T PRK00488 95 TLPGRRIE-LGGLHPLTRTIERIEDIFVGLGFEVAEGPEIEDD-Y------YN-F-EALNIPKDHPARDMQDTFYIDDRL 164 (338) T ss_pred CCCCCCCC-CCCCCHHHHHHHHHHHHHHHCCCEEECCCCEEEH-H------HH-H-HHHCCCCCCCCCCCCCEEEECCCC T ss_conf 78788888-7664769999999999999869769618831004-5------46-8-875699788543646628976871 Q ss_pred EECCCCHHHHHHHHHHCCCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEEC Q ss_conf 7434312577776520122223104320001000127876452000105635334421010 Q gi|254781004|r 335 ALKPMNCPGHVAVFNHGLKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVFC 395 (652) Q Consensus 335 ~LkPmncp~h~~i~~~~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F~ 395 (652) +||.-+-|..+-...+ ...|+|+.-.|.|||+|.--+ +-.-.|.|-|+-... T Consensus 165 lLRTHTS~~qir~m~~-----~~pp~~~i~~GrvyR~d~~Da----tH~~~FhQ~Egl~vd 216 (338) T PRK00488 165 LLRTHTSPVQIRTMEK-----QKPPIRIIAPGRVYRNDSDDA----THSPMFHQVEGLVVD 216 (338) T ss_pred EECCCCCHHHHHHHHH-----CCCCEEEEECCCCCCCCCCCC----CCCEEEEEEEEEEEE T ss_conf 3435687888999972-----699869995266315899987----535046640058971 No 133 >cd00144 PP2Ac Protein phosphatase 2A homologues, catalytic domain. Large family of serine/threonine phosphatases, including PP1, PP2A and PP2B (calcineurin) family members. Probab=84.60 E-value=3.2 Score=21.36 Aligned_cols=94 Identities=19% Similarity=0.388 Sum_probs=65.1 Q ss_pred CCHHHHHHHHHHHHHHHHCCCCEEEEECC----------HHHHHHHHHHCCCHHHHHHHHHCCCCCCCEEECCCCCEEEC Q ss_conf 88989999999999999669980485327----------67854454416711355665420345542130257520100 Q gi|254781004|r 114 FSSDELAQIEKKMQEIIARDSPFLKQYLS----------HEKARELFESKKEAYKVEILESIPAKENVTLYRQGEWFDLC 183 (652) Q Consensus 114 it~~dl~~Ie~~M~~ii~~~~pi~r~~vs----------~~eA~~~F~~~~~~~K~~li~~~~~~~~v~~y~~g~~~Dlc 183 (652) ++++|+..|-++.++|.++.-.+.+..-+ ..+.+.+|+..|.+ +...+.-.|||+| T Consensus 1 l~~~~i~~l~~~~~~il~~ep~ll~i~~pv~VvGDlHG~~~DL~~i~~~~g~p------------~~~~ylFLGDYVD-- 66 (269) T cd00144 1 LSEEEVLLLIRAVKEIFRQEPNLVEVSAPVTVCGDIHGQFYDLLRLFRKGGFP------------PDDNYLFLGDYVD-- 66 (269) T ss_pred CCHHHHHHHHHHHHHHHHHCCCEEEECCCEEEEEECCCCHHHHHHHHHHCCCC------------CCCCEEECCCCCC-- T ss_conf 99899999999999999859985996699899973889999999999965999------------8553883465306-- Q ss_pred CCCC-CCHHHHHHHHHHHHHHHHHC-CCCCCCCCEEEEEEE Q ss_conf 0235-52144422233333321110-157756420689999 Q gi|254781004|r 184 RGPH-VRSTGQVKKFFKLMKVAGAY-WRGDSNRPMLSRIYG 222 (652) Q Consensus 184 ~Gph-vpsTg~ik~~fkL~~~sgay-w~gd~~~~~l~Riyg 222 (652) ||++ |.---.+-+ .|+..=.-.| -||+..-..+...|| T Consensus 67 RG~~slEvi~lL~~-lKi~~P~~v~LLRGNHE~~~~n~~yg 106 (269) T cd00144 67 RGPQSIEVILLLFA-LKILYPNNFFLLRGNHESPSINKIYG 106 (269) T ss_pred CCCCCCHHHHHHHH-HHHHCCCEEEEECCCCCCCCCCCCCC T ss_conf 77877499999999-99859884896077762620123347 No 134 >PRK07860 NADH dehydrogenase subunit G; Validated Probab=83.28 E-value=3.6 Score=20.99 Aligned_cols=80 Identities=19% Similarity=0.225 Sum_probs=52.9 Q ss_pred CCCEEEECCCCCEEECCCCCCHHHHHHHHCCCCC---------C----CEEEEEECCEEEEC---CEEECCCEEEEECCC Q ss_conf 8870899289988741898889999986361101---------1----28999989889977---453036707986289 Q gi|254781004|r 2 PSDIKLTFPDGSIKNFPVHATGSDVAESISRSLA---------K----KAVAVAINGKVCDL---SDPVREGSIEIITPE 65 (652) Q Consensus 2 ~~~i~I~lpDG~~~~~~~g~t~~dia~~i~~~~~---------~----~~vaa~vng~l~dL---~~~l~d~~i~~i~~~ 65 (652) +++|+||. ||+..+++.|+|+++.|+..+-..+ . ...++.|+|.-+-. .+|+.|+-+-....+ T Consensus 14 ~~mv~l~I-dg~~v~v~~g~til~aa~~~Gi~IP~fC~h~~L~~~g~CR~ClVev~g~~~~~~sC~tpv~~Gm~V~T~~~ 92 (809) T PRK07860 14 PELVTLTI-DGVEVSVPKGTLVIRAAELIGIQIPRFCDHPLLDPVGACRQCLVEVEGQRKPLASCTTTVTDGMVVRTQLT 92 (809) T ss_pred CCCEEEEE-CCEEEEECCCCHHHHHHHHCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCEEEECCC T ss_conf 88069999-99999969999899999974998565679999598886645289989996788061876899988874389 Q ss_pred CHHHHHHHHHHHHHHHH Q ss_conf 99999999998999999 Q gi|254781004|r 66 DPRSLAVIRHSCAHIMA 82 (652) Q Consensus 66 ~~eg~~i~~hS~ahlL~ 82 (652) |+...+.=+--+-+||+ T Consensus 93 s~~~~~~r~~vlE~ll~ 109 (809) T PRK07860 93 SPVADKAQHGVMELLLI 109 (809) T ss_pred CHHHHHHHHHHHHHHHH T ss_conf 99999999999999985 No 135 >COG0072 PheT Phenylalanyl-tRNA synthetase beta subunit [Translation, ribosomal structure and biogenesis] Probab=83.05 E-value=0.95 Score=24.99 Aligned_cols=94 Identities=20% Similarity=0.249 Sum_probs=60.0 Q ss_pred HHHHHHHHHHHHHHHHH-CCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHHH Q ss_conf 78999999999998876-89638555432443100011210012555420100145677782377434312577776520 Q gi|254781004|r 272 GWKIFQTLISYMRRKIK-DDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHVAVFNH 350 (652) Q Consensus 272 G~~i~~~ie~~ir~~~~-~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~ 350 (652) +..-.+...+.+|..+. .||+||.|-.+.+.+.-..-+.-. ++ ..... ..-..+.-++|++--|+-....+. T Consensus 349 ~~~~~~~~~r~vr~~l~~~G~~Evitysl~s~e~~~~~~~~~---~~--~~~l~--NPiS~e~s~mR~sLlp~LL~~~~~ 421 (650) T COG0072 349 GLTPLQKFRRKVRRALVGLGFQEVITYSLTSPEEAKLFGLEN---DE--ALELA--NPISEEYSVLRTSLLPGLLEALSY 421 (650) T ss_pred CCCHHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHCCCC---CC--CEEEC--CCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 656678999999999985897599861258989999742377---75--26856--867623888887778889999998 Q ss_pred CCCCHHHCC-HHHHHHEEEECCCCC Q ss_conf 122223104-320001000127876 Q gi|254781004|r 351 GLKSYRELP-VRLAEFGSVYRNEPS 374 (652) Q Consensus 351 ~~~SYrdLP-lrl~e~~~~~R~E~s 374 (652) ..+ |..| .++||+|.+|..... T Consensus 422 N~~--r~~~~~~iFEiG~v~~~~~~ 444 (650) T COG0072 422 NKN--RKNPDVRIFEIGDVFVKDEE 444 (650) T ss_pred HHC--CCCCCEEEEEEEEEEECCCC T ss_conf 753--36886468866745705886 No 136 >cd01666 TGS_DRG_C TGS_DRG_C: DRG (developmentally regulated GTP-binding protein) represents a family of GTP-binding proteins that includes two members, DRG1 and DRG2. DRG1 and DRG2 have a C-terminal TGS domain (named after the ThrRS, GTPase, and SpoT proteins where it occurs) with a predominantly beta-sheet structure. The function of TGS is unknown but its presence in two types of regulatory proteins (the DRG GTPases and guanosine polyphosphate phosphohydrolases/synthetases) suggests a ligand (most likely nucleotide)-binding, regulatory role. Probab=82.79 E-value=1.7 Score=23.22 Aligned_cols=51 Identities=16% Similarity=0.205 Sum_probs=38.6 Q ss_pred CEEECCCCCCHHHHHHHHCCCCCCCEEEEEE-------CCEEEECCEEECCCE-EEEEC Q ss_conf 8874189888999998636110112899998-------988997745303670-79862 Q gi|254781004|r 13 SIKNFPVHATGSDVAESISRSLAKKAVAVAI-------NGKVCDLSDPVREGS-IEIIT 63 (652) Q Consensus 13 ~~~~~~~g~t~~dia~~i~~~~~~~~vaa~v-------ng~l~dL~~~l~d~~-i~~i~ 63 (652) ...-...|+|+.|+|..|..++.++.--|.| +++-+.+++.|+|.+ |+++. T Consensus 17 ~P~il~~gsTV~d~~~~iH~~l~~~fkyA~VWG~S~K~~~QrVG~~H~L~D~DVV~Iv~ 75 (75) T cd01666 17 EPVILRRGSTVEDVCNKIHKDLVKQFKYALVWGSSVKHSPQRVGLDHVLEDEDVVQIVK 75 (75) T ss_pred CCEEECCCCCHHHHHHHHHHHHHHHCCEEEEECCCCCCCCCCCCCCCEECCCCEEEEEC T ss_conf 89785699959999999999998547889995677573853738985736898999849 No 137 >COG1034 NuoG NADH dehydrogenase/NADH:ubiquinone oxidoreductase 75 kD subunit (chain G) [Energy production and conversion] Probab=81.29 E-value=4.3 Score=20.50 Aligned_cols=76 Identities=17% Similarity=0.281 Sum_probs=50.2 Q ss_pred CEEEECCCCCEEECCCCCCHHHHHHHHCCCCCC-------------CEEEEEECC--EEEEC-CEEECCCEEEEECCCCH Q ss_conf 708992899887418988899999863611011-------------289999898--89977-45303670798628999 Q gi|254781004|r 4 DIKLTFPDGSIKNFPVHATGSDVAESISRSLAK-------------KAVAVAING--KVCDL-SDPVREGSIEIITPEDP 67 (652) Q Consensus 4 ~i~I~lpDG~~~~~~~g~t~~dia~~i~~~~~~-------------~~vaa~vng--~l~dL-~~~l~d~~i~~i~~~~~ 67 (652) ++||+. ||+..++++|.|+++.|.+-+...+. ...++.|+| ++.-- .+|..|+.+ +..+|+ T Consensus 1 m~tI~I-DG~ei~v~~g~tvLqAa~~aGi~IP~fCyh~~ls~~GaCRmClVEveg~~k~~~SC~tpv~dGM~--I~T~s~ 77 (693) T COG1034 1 MVTITI-DGKEIEVPEGETVLQAAREAGIDIPTFCYHPRLSIAGACRMCLVEVEGAPKLVASCATPVTDGMV--ISTNSE 77 (693) T ss_pred CEEEEE-CCEEEECCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCEE--EECCCH T ss_conf 939998-99898618972899999985998772103578874453257799966888625301650678818--852899 Q ss_pred HHHHHHHHHHHHHHH Q ss_conf 999999998999999 Q gi|254781004|r 68 RSLAVIRHSCAHIMA 82 (652) Q Consensus 68 eg~~i~~hS~ahlL~ 82 (652) +....-+--+.+||. T Consensus 78 ~vk~~R~~vmE~LLi 92 (693) T COG1034 78 EVKKAREGVMEFLLI 92 (693) T ss_pred HHHHHHHHHHHHHHH T ss_conf 999999999999981 No 138 >KOG0556 consensus Probab=81.26 E-value=3.7 Score=20.91 Aligned_cols=93 Identities=24% Similarity=0.403 Sum_probs=56.4 Q ss_pred HHHHHHHHHHHHHHH-CCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHHHCC Q ss_conf 999999999998876-8963855543244310001121001255542010014567778237743431257777652012 Q gi|254781004|r 274 KIFQTLISYMRRKIK-DDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHVAVFNHGL 352 (652) Q Consensus 274 ~i~~~ie~~ir~~~~-~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~~~ 352 (652) +|-.-+...-|+.++ +||.||.||-|..+. | --+.|.|.+..... |. ...+.+ T Consensus 229 riq~gvc~~FRe~L~~kgF~EIhTpKli~as----S----EGGanvF~v~Yfk~-----------~A-------~LAQSP 282 (533) T KOG0556 229 RIQAGVCFAFREYLRSKGFVEIHTPKLIGAS----S----EGGANVFRVSYFKQ-----------KA-------YLAQSP 282 (533) T ss_pred EHHHHHHHHHHHHHHHCCCCEECCCCCCCCC----C----CCCCEEEEEEECCC-----------CC-------HHHCCH T ss_conf 0688899999999975686043263201466----7----77722689986067-----------50-------320585 Q ss_pred CCHHHCCH-----HHHHHEEEECCCCCCCCEEEEECCCCCCCCCE-EEC Q ss_conf 22231043-----20001000127876452000105635334421-010 Q gi|254781004|r 353 KSYRELPV-----RLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAH-VFC 395 (652) Q Consensus 353 ~SYrdLPl-----rl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah-~F~ 395 (652) .=||++-+ |.|++|.+||-|-|-+- .-.-||+--|.. -|| T Consensus 283 QLyKQMaI~gdf~rVyeIGpVfRAEdSnth---RhltEFvGLD~EMaf~ 328 (533) T KOG0556 283 QLYKQMAICGDFERVYEIGPVFRAEDSNTH---RHLTEFVGLDLEMAFN 328 (533) T ss_pred HHHHHHHHHCCHHHEEEECCEEECCCCCHH---HHHHHHHCCCHHHHHH T ss_conf 888778876061104652355660446416---6667761742254777 No 139 >COG5019 CDC3 Septin family protein [Cell division and chromosome partitioning / Cytoskeleton] Probab=80.73 E-value=4.5 Score=20.37 Aligned_cols=20 Identities=15% Similarity=0.275 Sum_probs=8.2 Q ss_pred HHHHHHHHHHHHHHCCCEEE Q ss_conf 89999999999998798899 Q gi|254781004|r 564 AVEYAQEIANLLKSHHLSIE 583 (652) Q Consensus 564 ~~eya~~i~~~L~~~girV~ 583 (652) ..++-+.|.+.|...+|+|. T Consensus 180 l~~~K~~I~~~i~~~nI~vf 199 (373) T COG5019 180 LAEFKERIREDLEQYNIPVF 199 (373) T ss_pred HHHHHHHHHHHHHHHCCCEE T ss_conf 99999999999998187622 No 140 >PTZ00239 serine/threonine protein phosphatase 2A; Provisional Probab=80.50 E-value=4.5 Score=20.32 Aligned_cols=99 Identities=15% Similarity=0.198 Sum_probs=69.9 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECC----------HHHHHHHHHHCCCHHHHHHHHHCCCCCCCEEECCCCC Q ss_conf 888888989999999999999669980485327----------6785445441671135566542034554213025752 Q gi|254781004|r 110 KEQPFSSDELAQIEKKMQEIIARDSPFLKQYLS----------HEKARELFESKKEAYKVEILESIPAKENVTLYRQGEW 179 (652) Q Consensus 110 ~~~~it~~dl~~Ie~~M~~ii~~~~pi~r~~vs----------~~eA~~~F~~~~~~~K~~li~~~~~~~~v~~y~~g~~ 179 (652) ....++++|+..|-+++++|.++.-.+.+...+ ..+.+.+|+..|.+ ......-.||| T Consensus 12 ~g~~l~~~~i~~lc~~~~~i~~~Ep~ll~i~~Pv~VvGDiHGq~~DLl~if~~~G~p------------~~~~YlFLGDY 79 (303) T PTZ00239 12 NGGCLPERDLKLVCERAKEIFLEESNVQPVKAPVNVCGDIHGQFYDLQALMKEGGDI------------PEHNYIFIGDF 79 (303) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHCCCEEEECCCEEEEEECCCCHHHHHHHHHHCCCC------------CCCCEEECCCC T ss_conf 489999999999999999999859975885799799944788989999999973999------------86658847863 Q ss_pred EEECCCCCCCHHH-HHHHHHHHHHHHHHC-CCCCCCCCEEEEEEEE Q ss_conf 0100023552144-422233333321110-1577564206899999 Q gi|254781004|r 180 FDLCRGPHVRSTG-QVKKFFKLMKVAGAY-WRGDSNRPMLSRIYGT 223 (652) Q Consensus 180 ~Dlc~GphvpsTg-~ik~~fkL~~~sgay-w~gd~~~~~l~Riyg~ 223 (652) +| ||++=--|- .+-+ +|+..=...| -||+..-.++.+.||. T Consensus 80 VD--RG~~s~Evi~lL~a-lKv~yP~~v~LLRGNHE~~~~n~~yGF 122 (303) T PTZ00239 80 VD--RGYNSVETMEYLLC-LKVKYPGNIVLLRGNHESRQCTQVYGF 122 (303) T ss_pred CC--CCCCCCHHHHHHHH-HHHHCCCCEEECCCCCCCCCCCCCCCC T ss_conf 26--78874289999999-997589827860567747355666795 No 141 >PRK08493 NADH dehydrogenase subunit G; Validated Probab=80.10 E-value=4.7 Score=20.23 Aligned_cols=69 Identities=16% Similarity=0.119 Sum_probs=44.6 Q ss_pred CEEEECCCCCEEECCCCCCHHHHHHHHCCCCC-------------CCEEEEEECCEEEEC-CEEECCCEEEEECCCCHHH Q ss_conf 70899289988741898889999986361101-------------128999989889977-4530367079862899999 Q gi|254781004|r 4 DIKLTFPDGSIKNFPVHATGSDVAESISRSLA-------------KKAVAVAINGKVCDL-SDPVREGSIEIITPEDPRS 69 (652) Q Consensus 4 ~i~I~lpDG~~~~~~~g~t~~dia~~i~~~~~-------------~~~vaa~vng~l~dL-~~~l~d~~i~~i~~~~~eg 69 (652) +|+||+ ||+..+++.|+|+++.|+..+-..+ =...++.|+|++.-- ++|+.|+-+ |..+|++. T Consensus 1 mv~i~I-dg~~~~~~~g~til~aa~~~gi~IP~lCy~~~l~p~g~Cr~C~VEv~G~~~~sC~t~v~~GM~--V~T~s~~v 77 (819) T PRK08493 1 MITINI-NGKEIECQEGEYILNAARRNGIFIPAICYLSGCSPTLACRLCMVEADGKRVYSCNTKAKEGMN--ILTNTPNL 77 (819) T ss_pred CCEEEE-CCEEEEECCCCHHHHHHHHCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCE--EEECCHHH T ss_conf 918999-999999699997999999739985614489992968877663898789482674888899999--99799999 Q ss_pred HHHHHHH Q ss_conf 9999998 Q gi|254781004|r 70 LAVIRHS 76 (652) Q Consensus 70 ~~i~~hS 76 (652) .+. |+. T Consensus 78 ~~~-r~~ 83 (819) T PRK08493 78 NDE-RNA 83 (819) T ss_pred HHH-HHH T ss_conf 999-999 No 142 >cd00754 MoaD MoaD family. Members of this family are involved in biosynthesis of the molybdenum cofactor (Moco), an essential cofactor of a diverse group of redox enzymes. Moco biosynthesis is an evolutionarily conserved pathway present in eubacteria, archaea and eukaryotes. Moco contains a tricyclic pyranopterin, termed molybdopterin (MPT), that contains the cis-dithiolene group responsible for molybdenum ligation. This dithiolene group is generated by MPT synthase, the second major step in Moco biosynthesis. MPT synthase consists of a large (MoeE) and small (MoaD) subunit. The small subunit is inserted into the lare subunit to form the active site. The small subunit, which is structurally similar to ubiquitin, contains a C-terminal thiocarboxylated glycine residue that serves as a sulfur donor for the synthesis of the MPT dithiolene group. Probab=79.59 E-value=4.8 Score=20.12 Aligned_cols=52 Identities=23% Similarity=0.450 Sum_probs=38.1 Q ss_pred EEECCCCCCHHHHHHHHCCCCCC------CEEEEEECCEEEECCEEECC-CEEEEECCC Q ss_conf 87418988899999863611011------28999989889977453036-707986289 Q gi|254781004|r 14 IKNFPVHATGSDVAESISRSLAK------KAVAVAINGKVCDLSDPVRE-GSIEIITPE 65 (652) Q Consensus 14 ~~~~~~g~t~~dia~~i~~~~~~------~~vaa~vng~l~dL~~~l~d-~~i~~i~~~ 65 (652) ..+++.++|+.++.+.+...+.. +.+...||++..+.+++|+| ++|.|+..- T Consensus 19 ~~~~~~~~Tv~~L~~~L~~~~~~~~~~~~~~~~iavN~~~~~~~~~l~~gdEva~~PPv 77 (80) T cd00754 19 ELELPEGATVGELLDALEARYPGLLEELLARVRIAVNGEYVRLDTPLKDGDEVAIIPPV 77 (80) T ss_pred EEECCCCCCHHHHHHHHHHHCCHHHHHHHHCEEEEECCEECCCCCCCCCCCEEEEECCC T ss_conf 99889998899999999987817676431040998887983677156999999997998 No 143 >PRK06437 hypothetical protein; Provisional Probab=76.66 E-value=5.8 Score=19.61 Aligned_cols=61 Identities=16% Similarity=0.168 Sum_probs=45.0 Q ss_pred CCCCEEEECCCCCEEECCCCCCHHHHHHHHCCCCCCCEEEEEECCEEEECCEEE-CCCEEEEEC Q ss_conf 988708992899887418988899999863611011289999898899774530-367079862 Q gi|254781004|r 1 MPSDIKLTFPDGSIKNFPVHATGSDVAESISRSLAKKAVAVAINGKVCDLSDPV-REGSIEIIT 63 (652) Q Consensus 1 M~~~i~I~lpDG~~~~~~~g~t~~dia~~i~~~~~~~~vaa~vng~l~dL~~~l-~d~~i~~i~ 63 (652) |-++|+|.-.-.+..+++...|+.||.++++- ...-.++.|||...--...+ ++.++.|+- T Consensus 1 ~~~MI~vkG~~~k~i~id~e~~v~Di~k~~~l--dee~yvvIvNGsP~~~D~~Vk~eDdv~~LE 62 (67) T PRK06437 1 MIAMIRVKGHINKTIEIDHELTVNDIIKDLGL--DEEEYVVIVNGSPVLEDHNVKKEDDVLILE 62 (67) T ss_pred CEEEEEEECEECEEEEEECCCCHHHHHHHCCC--CCCCEEEEECCCCCCCCCCCCCCCCEEEEE T ss_conf 95799981100306984401328789998199--921079998698544445348556789999 No 144 >COG2024 Phenylalanyl-tRNA synthetase alpha subunit (archaeal type) [Translation, ribosomal structure and biogenesis] Probab=76.55 E-value=0.99 Score=24.89 Aligned_cols=179 Identities=21% Similarity=0.266 Sum_probs=84.4 Q ss_pred CCCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCC Q ss_conf 12222310432000100012787645200010563533442101053767887899999988766552133210001465 Q gi|254781004|r 351 GLKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVFCTKEQMFNECLKIHNLIISIYKDFGFEKIMVKLSTR 430 (652) Q Consensus 351 ~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F~~~eq~~~e~~~~~~~~~~v~~~~g~~~~~~~ls~~ 430 (652) .+.--+++|+.++-|..|||-|++--.+-|| |---|-.....+.. .+..-.....-.+..|||+.+..+-... T Consensus 199 ~i~~r~~~PlklFSIDRCFRREQ~ED~shLm-----tYhSASCVvvde~v--tvD~GKaVAEglL~qfGFe~F~FrpDEK 271 (536) T COG2024 199 EILKREDPPLKLFSIDRCFRREQREDASHLM-----TYHSASCVVVDEDV--TVDDGKAVAEGLLRQFGFEKFRFRPDEK 271 (536) T ss_pred HHHHCCCCCCEEEEHHHHHHHHHHCCHHHHH-----HHCCCEEEEECCCC--CCCCCHHHHHHHHHHHCCCCEEECCCCC T ss_conf 8872348984164256776655310156664-----32034079975864--4354389999999971940035466200 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHCC-CCCCCCCCCHHHHCCCCCCCE Q ss_conf 3112586035778999999999850212125667520555555455432110021103-310012110111022346741 Q gi|254781004|r 431 PEKRVGSDALWDDAENIMKGVLDTIKNSSQDRINTGVLLGEGAFYGPKFEYILKDAIG-RDWQCGTIQVDFNLPSRFNAF 509 (652) Q Consensus 431 ~~~~~g~~~~~~~ae~~l~~~l~~~~~~~~~~~~~~~~~g~~afygpkid~~~~d~~g-r~~~~~tiq~df~~p~~f~l~ 509 (652) ..++.--+.. .+--.|-||+.-... ..+ .+....|+. ...|..+.- T Consensus 272 ~SKYYvP~TQ-----------------------------TEVyAyHPkL~gs~~-kysdgWiEiATFG--lYSP~ALae- 318 (536) T COG2024 272 KSKYYVPGTQ-----------------------------TEVYAYHPKLVGSIE-KYSDGWIEIATFG--LYSPIALAE- 318 (536) T ss_pred CCCCCCCCCC-----------------------------CEEEEECCCCCCCCC-CCCCCCEEEEEEC--CCCHHHHHH- T ss_conf 3553479963-----------------------------047774540036521-2588708999603--567277987- Q ss_pred EEEECCCEEEEEEEEECCCH-HHHHHHHHHHHH---HCC---CCCCC-CCC------EEEEEEECCHHHHHHHHHHHHHH Q ss_conf 67317972137996403410-378999999997---387---08873-378------08999857966899999999999 Q gi|254781004|r 510 YVNSHSEKCHPVMIHRAVFG-SIERFIGIMIEN---FKG---NLPLW-LSP------IQAIVTTITSSAVEYAQEIANLL 575 (652) Q Consensus 510 y~~~dg~~~~pvmihr~~~G-s~eR~ia~liE~---~~g---~~P~w-LAP------~QV~Iipi~e~~~eya~~i~~~L 575 (652) | .--.||| -+| ++||+-.+|--. -.. .++.| +|- +.|--.|+...-...|++|...+ T Consensus 319 Y-----~Id~pVM----NLGlGVERlaMIl~g~~DVR~mvYpqi~ew~lSD~diA~mi~v~~~P~t~~g~~~a~~vV~~~ 389 (536) T COG2024 319 Y-----GIDYPVM----NLGLGVERLAMILHGADDVRSMVYPQIYEWRLSDRDIARMIRVDKVPATSEGRRLAREVVRTC 389 (536) T ss_pred C-----CCCCCEE----ECCHHHHHHHHHHHCCHHHHHHHCCCCCCCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHH T ss_conf 2-----9997012----032208999999817447766513453445666588886640045667747899999999999 Q ss_pred HHC Q ss_conf 987 Q gi|254781004|r 576 KSH 578 (652) Q Consensus 576 ~~~ 578 (652) ..+ T Consensus 390 ~~h 392 (536) T COG2024 390 EAH 392 (536) T ss_pred HHH T ss_conf 970 No 145 >cd01206 Homer Homer type EVH1 domain. Homer is a synaptic scaffolding protein, involved in neuronal signaling. It contains an EVH1 domain, which binds to both neurotransmitter receptors, such as the metabotropic glutamate receptor (mGluR) and to other scaffolding proteins via PPXXF motifs, in order to target them to the synaptic junction. It has a PH-like fold, despite having minimal sequence similarity to PH or PTB domains. Probab=74.68 E-value=3.1 Score=21.51 Aligned_cols=34 Identities=21% Similarity=0.542 Sum_probs=28.1 Q ss_pred CCCCCCCCCEEEEEEEEEECCHHHHHHHHHHHHHHHHH Q ss_conf 01577564206899999837988999999999975320 Q gi|254781004|r 207 YWRGDSNRPMLSRIYGTAWNTQQELTQYLYFLEESEKR 244 (652) Q Consensus 207 yw~gd~~~~~l~Riyg~af~~~~~l~~~~~~~eea~~r 244 (652) -|+..-.|. +||.-|.++++|.++-..++|+++. T Consensus 76 QW~D~Rant----VyGLgFsSe~e~~~Fa~~f~e~~ea 109 (111) T cd01206 76 QWADSRANT----VYGLGFSSEQQLTKFAEKFQEVKEA 109 (111) T ss_pred CCCCCCCCC----EEECCCCCHHHHHHHHHHHHHHHHH T ss_conf 424344452----6723658889999999999998875 No 146 >COG3383 Uncharacterized anaerobic dehydrogenase [General function prediction only] Probab=74.08 E-value=6.8 Score=19.12 Aligned_cols=69 Identities=23% Similarity=0.306 Sum_probs=46.7 Q ss_pred CCCC-EEEECCCCCEEECCCCCCHHHHHHHHCCCCC-------------CCEEEEEECCEEEEC-CEEECCCEEEEECCC Q ss_conf 9887-0899289988741898889999986361101-------------128999989889977-453036707986289 Q gi|254781004|r 1 MPSD-IKLTFPDGSIKNFPVHATGSDVAESISRSLA-------------KKAVAVAINGKVCDL-SDPVREGSIEIITPE 65 (652) Q Consensus 1 M~~~-i~I~lpDG~~~~~~~g~t~~dia~~i~~~~~-------------~~~vaa~vng~l~dL-~~~l~d~~i~~i~~~ 65 (652) |+.. |++|+ ||...+++.|+|+++.+..-+-..+ =+..++.+||++.-- +++++|+-+ +..+ T Consensus 1 m~~~~i~vti-dg~~~~v~~G~tiL~a~~~~gI~iP~iCy~~~l~pi~sCd~ClVEidG~l~rsCsT~v~dGm~--v~t~ 77 (978) T COG3383 1 MAEKMITVTI-DGRSIEVEEGTTILRAANRNGIEIPHICYHESLGPIGSCDTCLVEIDGKLVRSCSTPVEDGMV--VRTN 77 (978) T ss_pred CCCEEEEEEE-CCEEEECCCCHHHHHHHHHCCCCCCCEECCCCCCCCCCCCEEEEEECCCEECCCCCCCCCCCE--EECC T ss_conf 9730699998-786875278708999999659855512236787865555368998568261321554267828--9616 Q ss_pred CHHHHHH Q ss_conf 9999999 Q gi|254781004|r 66 DPRSLAV 72 (652) Q Consensus 66 ~~eg~~i 72 (652) ++.-..+ T Consensus 78 s~rvk~~ 84 (978) T COG3383 78 SERVKEA 84 (978) T ss_pred CHHHHHH T ss_conf 5889999 No 147 >PRK12814 putative NADPH-dependent glutamate synthase small subunit; Provisional Probab=73.94 E-value=4.4 Score=20.40 Aligned_cols=30 Identities=23% Similarity=0.174 Sum_probs=26.3 Q ss_pred CCCCEEEECCCCCEEECCCCCCHHHHHHHHCC Q ss_conf 98870899289988741898889999986361 Q gi|254781004|r 1 MPSDIKLTFPDGSIKNFPVHATGSDVAESISR 32 (652) Q Consensus 1 M~~~i~I~lpDG~~~~~~~g~t~~dia~~i~~ 32 (652) |+ +|+|++ ||+....+.|.|+++.|++.+- T Consensus 1 m~-~i~~~i-~g~~~~~~~g~til~~a~~~gi 30 (652) T PRK12814 1 MN-TISLTI-NGRSVTAAPGTSILEAAASAGI 30 (652) T ss_pred CC-CEEEEE-CCEEEEECCCCCHHHHHHHCCC T ss_conf 97-069998-8989876899809999997599 No 148 >PTZ00244 serine/threonine-protein phosphatase PP1; Provisional Probab=73.13 E-value=7.1 Score=18.96 Aligned_cols=98 Identities=15% Similarity=0.268 Sum_probs=69.1 Q ss_pred CCCCCHHHHHHHHHHHHHHHHCCCCEEEEECC----------HHHHHHHHHHCCCHHHHHHHHHCCCCCCCEEECCCCCE Q ss_conf 88888989999999999999669980485327----------67854454416711355665420345542130257520 Q gi|254781004|r 111 EQPFSSDELAQIEKKMQEIIARDSPFLKQYLS----------HEKARELFESKKEAYKVEILESIPAKENVTLYRQGEWF 180 (652) Q Consensus 111 ~~~it~~dl~~Ie~~M~~ii~~~~pi~r~~vs----------~~eA~~~F~~~~~~~K~~li~~~~~~~~v~~y~~g~~~ 180 (652) ..-++++|+..|-.++++|.++.-.+.+..-+ ..+.+++|+..|.| +..+....|||+ T Consensus 22 ~~~i~~~~i~~L~~~~~~i~~~Ep~ll~i~~Pv~VvGDiHGq~~DLl~if~~~G~P------------~~~~YlFLGDYV 89 (294) T PTZ00244 22 QILIREEDIRAVLTEVREIFMSQPMLLEIRPPVRVCGDTHGQYYDLLRIFEKCGFP------------PYSNYLFLGDYV 89 (294) T ss_pred CCCCCHHHHHHHHHHHHHHHHHCCCEEEECCCEEEEEECCCCHHHHHHHHHHCCCC------------CCCCEEECCCCC T ss_conf 20359999999999999999859985885799799988999989999999974999------------865688215535 Q ss_pred EECCCCCC-CHHHHHHHHHHHHHHHHHC-CCCCCCCCEEEEEEEE Q ss_conf 10002355-2144422233333321110-1577564206899999 Q gi|254781004|r 181 DLCRGPHV-RSTGQVKKFFKLMKVAGAY-WRGDSNRPMLSRIYGT 223 (652) Q Consensus 181 Dlc~Gphv-psTg~ik~~fkL~~~sgay-w~gd~~~~~l~Riyg~ 223 (652) | ||++= .---.+-+ .|+.-=.-.| -||+..-..+.+.||. T Consensus 90 D--RG~~SlEvi~lL~a-lKi~yP~~v~LLRGNHE~~~~n~~yGF 131 (294) T PTZ00244 90 D--RGKHSVETITLQFC-YKIVYPENFFLLRGNHECASINKMYGF 131 (294) T ss_pred C--CCCCCCHHHHHHHH-HHHHCCCCEEECCCCCCCCCCCCCCCC T ss_conf 7--77653099999999-986488808962787647355677886 No 149 >pfam10752 DUF2533 Protein of unknown function (DUF2533). This family of proteins with unknown function appears to be restricted to Bacillus spp. Probab=69.54 E-value=7.4 Score=18.83 Aligned_cols=41 Identities=20% Similarity=0.436 Sum_probs=35.7 Q ss_pred ECCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHH Q ss_conf 08888889899999999999996699804853276785445 Q gi|254781004|r 109 DKEQPFSSDELAQIEKKMQEIIARDSPFLKQYLSHEKAREL 149 (652) Q Consensus 109 ~~~~~it~~dl~~Ie~~M~~ii~~~~pi~r~~vs~~eA~~~ 149 (652) ..+.+||-+.+..|.++|+++.+..+--.|+.|+.+-..+. T Consensus 40 ~~g~~Fs~d~IN~iTk~mNelAk~givP~Rk~VT~eMV~EY 80 (84) T pfam10752 40 RQGKPFTTDAINEITKQMNQLAKKGIVPTRKLVTVEMVKEY 80 (84) T ss_pred HCCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHH T ss_conf 77998729999999999999998487874044239999999 No 150 >PRK09129 NADH dehydrogenase subunit G; Validated Probab=68.96 E-value=8.8 Score=18.35 Aligned_cols=76 Identities=11% Similarity=0.228 Sum_probs=47.7 Q ss_pred CEEEECCCCCEEECCCCCCHHHHHHHHCCCCCC-------------CEEEEEECCE--EEEC-CEEECCCEEEEECCCCH Q ss_conf 708992899887418988899999863611011-------------2899998988--9977-45303670798628999 Q gi|254781004|r 4 DIKLTFPDGSIKNFPVHATGSDVAESISRSLAK-------------KAVAVAINGK--VCDL-SDPVREGSIEIITPEDP 67 (652) Q Consensus 4 ~i~I~lpDG~~~~~~~g~t~~dia~~i~~~~~~-------------~~vaa~vng~--l~dL-~~~l~d~~i~~i~~~~~ 67 (652) +|+||. ||+..++++|+|+++.|+..+-..+. ...++.|+|. +.-- ++|+.|+-+ |..+|+ T Consensus 1 mv~~~i-dg~~~~~~~g~til~aa~~~gi~iP~~C~~~~l~~~g~Cr~C~Vev~~~~~~~~aC~~~~~~gm~--v~t~~~ 77 (777) T PRK09129 1 MVEIEI-DGKKVEVPEGSMVIEAADKAGIYIPHFCYHKKLSIAANCRMCLVEVEKAPKPLPACATPVTDGMK--VRTRSE 77 (777) T ss_pred CCEEEE-CCEEEEECCCCHHHHHHHHCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCE--EEECCH T ss_conf 948999-99999969999699999975998775678999498985714799989984656141878899988--994999 Q ss_pred HHHHHHHHHHHHHHH Q ss_conf 999999998999999 Q gi|254781004|r 68 RSLAVIRHSCAHIMA 82 (652) Q Consensus 68 eg~~i~~hS~ahlL~ 82 (652) +..+.=+.-+-+||+ T Consensus 78 ~~~~~r~~~~e~ll~ 92 (777) T PRK09129 78 KAVKAQKSVMEFLLI 92 (777) T ss_pred HHHHHHHHHHHHHHH T ss_conf 999999999999985 No 151 >KOG0555 consensus Probab=68.96 E-value=8.2 Score=18.56 Aligned_cols=113 Identities=18% Similarity=0.330 Sum_probs=70.9 Q ss_pred HHHHHHHHHHHHHHH-HCCCEEEECCCCCCCC------CCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHH Q ss_conf 899999999999887-6896385554324431------000112100125554201001456777823774343125777 Q gi|254781004|r 273 WKIFQTLISYMRRKI-KDDYEEINTPQVLDQH------LWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHV 345 (652) Q Consensus 273 ~~i~~~ie~~ir~~~-~~G~~eV~tP~l~~~~------Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~ 345 (652) ++++..+-+.+|..+ .+||.||..|+|.... |++ +|.|+++-|..++.. +.|- |.-|.- T Consensus 243 LK~Ra~~lr~~Rd~y~~~~ytEVtPPtmVQTQVEGGsTLFk----ldYyGEeAyLTQSSQ--------LYLE-tclpAl- 308 (545) T KOG0555 243 LKARAALLRAMRDHYFERGYTEVTPPTMVQTQVEGGSTLFK----LDYYGEEAYLTQSSQ--------LYLE-TCLPAL- 308 (545) T ss_pred HHHHHHHHHHHHHHHHHCCCEECCCCCEEEEEECCCCEEEE----ECCCCCHHHCCCHHH--------HHHH-HHHHHC- T ss_conf 99999999999988874484245797157887517604873----113574033022358--------8898-765520- Q ss_pred HHHHHCCCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCE-EECCHHHHHHHHHHH-HHHHHHHHH Q ss_conf 76520122223104320001000127876452000105635334421-010537678878999-999887665 Q gi|254781004|r 346 AVFNHGLKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAH-VFCTKEQMFNECLKI-HNLIISIYK 416 (652) Q Consensus 346 ~i~~~~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah-~F~~~eq~~~e~~~~-~~~~~~v~~ 416 (652) =..|-|+-.||-|+|.+|. -.-|+|--+|. -|.+-|+..+.+... ++.+..++. T Consensus 309 --------------gdvy~I~~SyRAEkSrTRR---HLsEytHVEaE~afltfd~ll~~iE~lvc~~vdr~l~ 364 (545) T KOG0555 309 --------------GDVYCIQQSYRAEKSRTRR---HLSEYTHVEAECAFLTFDDLLDRIEALVCDSVDRLLE 364 (545) T ss_pred --------------CCEEEECHHHHHHHHHHHH---HHHHHEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf --------------7626732756655666666---5533111023246364889999999999999999986 No 152 >pfam07801 DUF1647 Protein of unknown function (DUF1647). The sequences making up this family are all derived from hypothetical proteins expressed by C. elegans. The region in question is approximately 160 amino acids long. The GO annotation for this protein indicates the protein to be involved in nematode larval development and to have a positive regulation on growth rate. Probab=68.72 E-value=8.9 Score=18.32 Aligned_cols=64 Identities=19% Similarity=0.373 Sum_probs=48.9 Q ss_pred CCEEEECCEEEC-CCEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCEEEEEECCCCCCHHHHHHH Q ss_conf 988997745303-6707986289999999999989999999999977998899724358855788408888889899999 Q gi|254781004|r 44 NGKVCDLSDPVR-EGSIEIITPEDPRSLAVIRHSCAHIMAEAVQSIWPEVQVAIGPIIEDGFYYDFDKEQPFSSDELAQI 122 (652) Q Consensus 44 ng~l~dL~~~l~-d~~i~~i~~~~~eg~~i~~hS~ahlL~~Av~~l~p~~kl~igp~i~~GfYyD~~~~~~it~~dl~~I 122 (652) -+++.||+.+++ +.+|-||+..|.+-.....+|. .+|++.||..|+.. |. -.+++..+.+| T Consensus 46 ~~~~v~l~~~~~n~~~~VFVSAtS~dH~~~~~~s~-----~sirk~yP~~K~IL---------Y~----L~ls~~~i~~L 107 (142) T pfam07801 46 PESFVDLSDTIRNESDVVFVSATSDDHLNNSMKSY-----KSIRKYYPNHKYIL---------YS----LNLSEAYISKL 107 (142) T ss_pred CCCCCCCCHHHCCCCCEEEEEECCHHHHHHHHHHH-----HHHHHHCCCCCEEE---------EE----CCCCHHHHHHH T ss_conf 75435300110588750899704548899999999-----99998688865799---------84----68888899873 Q ss_pred HHH Q ss_conf 999 Q gi|254781004|r 123 EKK 125 (652) Q Consensus 123 e~~ 125 (652) ++. T Consensus 108 ~~~ 110 (142) T pfam07801 108 PKN 110 (142) T ss_pred CCC T ss_conf 437 No 153 >PRK08364 sulfur carrier protein ThiS; Provisional Probab=67.75 E-value=4.3 Score=20.48 Aligned_cols=50 Identities=18% Similarity=0.312 Sum_probs=36.4 Q ss_pred CEEECCCCCCHHHHHHHHCCCCCCCEEEEEECCEEEECCEEECCCE-EEEECC Q ss_conf 8874189888999998636110112899998988997745303670-798628 Q gi|254781004|r 13 SIKNFPVHATGSDVAESISRSLAKKAVAVAINGKVCDLSDPVREGS-IEIITP 64 (652) Q Consensus 13 ~~~~~~~g~t~~dia~~i~~~~~~~~vaa~vng~l~dL~~~l~d~~-i~~i~~ 64 (652) +..+..+|.+..|+.+..+- ......|+|||+..=-..|++|++ ++.+.. T Consensus 13 keiew~~gmkv~dvlr~vgf--ntesaiakvngkvaled~~v~dgdyvevipv 63 (67) T PRK08364 13 KEIEWRKGMKVADVLRAVGF--NTESAIAKVNGRVALEEEEVKDGDYVEVIPV 63 (67) T ss_pred HHHHHHCCCCHHHHHHHHCC--CCHHHHHEECCEEEECCCCCCCCCEEEEEEE T ss_conf 43246508889999998467--6076655037668732476678973899975 No 154 >PRK02645 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=66.74 E-value=9.7 Score=18.05 Aligned_cols=62 Identities=6% Similarity=-0.154 Sum_probs=27.4 Q ss_pred CEEEEECCCCCCCHHHHHHHHCCCCEEEEECHH---HHHCC-CEEEEECCCCCEEEECH---HHHHHHHHHH Q ss_conf 889997589663676665543489889998742---53258-18999889886633429---9999999873 Q gi|254781004|r 580 LSIETDFRNETINYKIREHSIKKIPIIIICGDK---EASER-SIGIRRFGSTTTQKVSL---LDAIPILTKE 644 (652) Q Consensus 580 irV~~Ddr~~~~G~Kir~a~l~giP~~ivIG~k---e~e~~-~Vtir~R~~~~~~~i~l---ee~i~~l~~e 644 (652) .|..+=..+.++--+.. .-..-+..+.+.-+ +++.+ .|.|+.-... ...+.+ .+|.+.|+++ T Consensus 221 ~RPlVlp~~s~I~i~~~--~~~~~~~~l~~DGq~~~~l~~gd~v~I~~s~~~-~~~i~~~~~~~ff~~Lr~K 289 (304) T PRK02645 221 SRPIVIPPTSRVVIWPL--GDYDLNIKLWKDGVLATSIWPGQRCVIQKARHP-AKFIILRESPSYYRTLREK 289 (304) T ss_pred CCCEEECCCCEEEEEEC--CCCCCCEEEEECCCCCEECCCCCEEEEEECCCE-EEEEECCCCCCHHHHHHHH T ss_conf 88889799984999985--689874799988997348599999999988973-4899829999999998756 No 155 >pfam00735 Septin Septin. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP. As regards the septins, these are polypeptides of 30-65kDa with three characteristic GTPase motifs (G-1, G-3 and G-4) that are similar to those of the Ras family. The G-4 motif is strictly conserved with a unique septin consensus of AKAD. Most septins are thought to have at least one coiled-coil region, which in some cases is necessary for intermolecular interactions that allow septins to polymerize to form rod-shaped complexes. In turn, these are arranged into tandem arrays to form filaments. They are multifunctional proteins, with roles in cytokinesis, sporulation, germ cell development, exocytosis and apoptosis. Probab=66.24 E-value=9.9 Score=17.99 Aligned_cols=75 Identities=15% Similarity=0.209 Sum_probs=42.0 Q ss_pred CCCEEEEEEEC---C-----HHHHHHHHHHHHHHHHCCCEEEE--E-CCCCCCCHHHHHHHHCCCCEEEEECHHHHHCCC Q ss_conf 37808999857---9-----66899999999999987988999--7-589663676665543489889998742532581 Q gi|254781004|r 550 LSPIQAIVTTI---T-----SSAVEYAQEIANLLKSHHLSIET--D-FRNETINYKIREHSIKKIPIIIICGDKEASERS 618 (652) Q Consensus 550 LAP~QV~Iipi---~-----e~~~eya~~i~~~L~~~girV~~--D-dr~~~~G~Kir~a~l~giP~~ivIG~ke~e~~~ 618 (652) |+.. |.|||| + ++...+=+.|.+.|.++||++.- | +.++.--......-...+|+.||=.....+++. T Consensus 138 Ls~~-vNvIPVIaKADtLT~~El~~~K~~I~~~l~~~~I~i~~f~~~~~d~~e~~~~~~~~~~~~PFaIigS~~~~~~~g 216 (280) T pfam00735 138 LHEK-VNIIPVIAKADTLTPDELQRFKKRIRAEIERQNIKIYHFPDEESDEDEEKEQTEQLKSSIPFAIVGSNEEIENDG 216 (280) T ss_pred HHCC-CCEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCEECCCCCCCCCHHHHHHHHHHCCCCCCEEECCCCEEECCC T ss_conf 7144-653666621455899999999999999999859725468885435577899998740579977988885385289 Q ss_pred EEEEECC Q ss_conf 8999889 Q gi|254781004|r 619 IGIRRFG 625 (652) Q Consensus 619 Vtir~R~ 625 (652) -.+|-|. T Consensus 217 ~~vrgR~ 223 (280) T pfam00735 217 KKVRGRK 223 (280) T ss_pred CEEEEEC T ss_conf 7886654 No 156 >PRK00629 pheT phenylalanyl-tRNA synthetase subunit beta; Reviewed Probab=65.81 E-value=6.5 Score=19.27 Aligned_cols=90 Identities=16% Similarity=0.151 Sum_probs=55.6 Q ss_pred HHHHHHHHHHHHHHHH-CCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHHHC Q ss_conf 8999999999998876-896385554324431000112100125554201001456777823774343125777765201 Q gi|254781004|r 273 WKIFQTLISYMRRKIK-DDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHVAVFNHG 351 (652) Q Consensus 273 ~~i~~~ie~~ir~~~~-~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~~ 351 (652) ..-..++.+.+|..+. .||+||.|=.+.+.+..+.-+- .++...... . -..+.=+|||+-=|+.....+.. T Consensus 483 ~~~~~~~~~~ir~~L~~~Gf~EvitySf~s~~~~~~~~~----~~~~v~l~N--P--ls~e~s~lR~SLlp~LL~~~~~N 554 (786) T PRK00629 483 LTEAQKRRRRVRRALAARGYQEAITYSFVSPEVADLFGL----GEDALKLLN--P--ISSELSVMRTSLLPGLLEALAYN 554 (786) T ss_pred CCHHHHHHHHHHHHHHHCCCEEEECCCCCCHHHHHHHCC----CCCCEEECC--C--CCCHHHHHHHHHHHHHHHHHHHH T ss_conf 998999999999999978984241366679799986278----877568537--5--42126888887799999989988 Q ss_pred C-CCHHHCCHHHHHHEEEECCC Q ss_conf 2-22231043200010001278 Q gi|254781004|r 352 L-KSYRELPVRLAEFGSVYRNE 372 (652) Q Consensus 352 ~-~SYrdLPlrl~e~~~~~R~E 372 (652) . |.++ .+++||+|.+|... T Consensus 555 ~~r~~~--~i~lFEiG~vf~~~ 574 (786) T PRK00629 555 QNRKNK--DVALFEIGLVFLPG 574 (786) T ss_pred HHCCCC--CEEEEEEEEEECCC T ss_conf 627998--76799875438999 No 157 >PRK09130 NADH dehydrogenase subunit G; Validated Probab=63.85 E-value=11 Score=17.69 Aligned_cols=75 Identities=21% Similarity=0.298 Sum_probs=49.2 Q ss_pred EEEECCCCCEEECCCCCCHHHHHHHHCCCCCC-------------CEEEEEECCE---EEE-CCEEECCCEEEEECCCCH Q ss_conf 08992899887418988899999863611011-------------2899998988---997-745303670798628999 Q gi|254781004|r 5 IKLTFPDGSIKNFPVHATGSDVAESISRSLAK-------------KAVAVAINGK---VCD-LSDPVREGSIEIITPEDP 67 (652) Q Consensus 5 i~I~lpDG~~~~~~~g~t~~dia~~i~~~~~~-------------~~vaa~vng~---l~d-L~~~l~d~~i~~i~~~~~ 67 (652) |+||. ||+..+++.|+|+++.|+..+-..+. ...++.|.|. +.- =.+|+.|+-+ |..+|+ T Consensus 2 v~l~I-dg~~~~v~~g~til~A~~~~Gi~IP~fCyh~~L~~~g~CR~ClVeve~~~pk~v~sC~t~v~~gm~--v~t~s~ 78 (680) T PRK09130 2 VKLTI-DGKEIEVPDGYTLLQACEAAGAEIPRFCYHERLSIAGNCRMCLVEVKGGPPKPVASCAMPVGEGMV--IKTNTP 78 (680) T ss_pred CEEEE-CCEEEEECCCCHHHHHHHHCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCE--EEECCH T ss_conf 58999-999999699996999999759985625689994989855156999739968767310877899999--997999 Q ss_pred HHHHHHHHHHHHHHH Q ss_conf 999999998999999 Q gi|254781004|r 68 RSLAVIRHSCAHIMA 82 (652) Q Consensus 68 eg~~i~~hS~ahlL~ 82 (652) ...++=+--+-+||+ T Consensus 79 ~v~~~r~~vle~ll~ 93 (680) T PRK09130 79 MVKKAREGVMEFLLI 93 (680) T ss_pred HHHHHHHHHHHHHHH T ss_conf 999999999999983 No 158 >PRK08166 NADH dehydrogenase subunit G; Validated Probab=63.51 E-value=8.2 Score=18.53 Aligned_cols=75 Identities=15% Similarity=0.181 Sum_probs=44.3 Q ss_pred EEEECCCCCEEECCCCCCHHHHHHHHCCCCCC-------------CEEEEEE-----CC--EEEEC-CEEECCCEEEEEC Q ss_conf 08992899887418988899999863611011-------------2899998-----98--89977-4530367079862 Q gi|254781004|r 5 IKLTFPDGSIKNFPVHATGSDVAESISRSLAK-------------KAVAVAI-----NG--KVCDL-SDPVREGSIEIIT 63 (652) Q Consensus 5 i~I~lpDG~~~~~~~g~t~~dia~~i~~~~~~-------------~~vaa~v-----ng--~l~dL-~~~l~d~~i~~i~ 63 (652) ++||. ||+..+++.|+|+++.|+..+-..+. ...++.| +| ++.-= .+|+.|+-+ |. T Consensus 2 v~~~I-dg~~~~v~~g~til~aa~~~Gi~IP~~C~~~~l~~~g~CR~ClVev~~~~~~~~~~~~~sC~t~v~~Gm~--V~ 78 (858) T PRK08166 2 ATIHV-DGKEYEVNGADNLLQACLSLGIDIPYFCWHPALGSVGACRQCAVKQYQNPEDTRGRLVMSCMTPATDGTF--IS 78 (858) T ss_pred CEEEE-CCEEEEECCCCHHHHHHHHCCCCCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCE--EE T ss_conf 48999-9999985899969999997599866367999989887772218997226757887647020777899999--99 Q ss_pred CCCHHHHHHHHHHHHHHHH Q ss_conf 8999999999998999999 Q gi|254781004|r 64 PEDPRSLAVIRHSCAHIMA 82 (652) Q Consensus 64 ~~~~eg~~i~~hS~ahlL~ 82 (652) .+|++..++=+--+-+||+ T Consensus 79 T~s~~v~~~r~~~le~ll~ 97 (858) T PRK08166 79 IDDEEAKAFRASVVEWLMT 97 (858) T ss_pred ECCHHHHHHHHHHHHHHHH T ss_conf 7999999999999999984 No 159 >TIGR00499 lysS_bact lysyl-tRNA synthetase; InterPro: IPR002313 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . Lysyl-tRNA synthetase (6.1.1.6 from EC) is an alpha 2 homodimer that belong to both class I and class II. In eubacteria and eukaryota lysyl-tRNA synthetases belong to class II in the same family as aspartyl tRNA synthetase. The class Ic lysyl-tRNA synthetase family is present in archaea and some eubacteria . Moreover in some eubacteria there is a gene X, which is similar to a part of lysyl-tRNA synthetase from class II. Lysyl-tRNA synthetase is duplicated in some species with, for example in E. coli, as a constitutive gene (lysS) and an induced one (lysU). No residues are directly involved in catalysis, but a number of highly conserved amino acids and three metal ions coordinate the substrates and stabilise the pentavalent transition state. Lysine is activated by being attached to the alpha-phosphate of AMP before being transferred to the cognate tRNA. The refined crystal structures give "snapshots" of the active site corresponding to key steps in the aminoacylation reaction and provide the structural framework for understanding the mechanism of lysine activation. The active site of LysU is shaped to position the substrates for the nucleophilic attack of the lysine carboxylate on the ATP alpha-phosphate. No residues are directly involved in catalysis, but a number of highly conserved amino acids and three metal ions coordinate the substrates and stabilise the pentavalent transition state. A loop close to the catalytic pocket, disordered in the lysine-bound structure, becomes ordered upon adenine binding .; GO: 0000166 nucleotide binding, 0004824 lysine-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006430 lysyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=62.92 E-value=11 Score=17.58 Aligned_cols=109 Identities=16% Similarity=0.126 Sum_probs=61.5 Q ss_pred CHHHHHHHHHHHHHHHH-HCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHH Q ss_conf 07899999999999887-68963855543244310001121001255542010014567778237743431257777652 Q gi|254781004|r 271 KGWKIFQTLISYMRRKI-KDDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHVAVFN 349 (652) Q Consensus 271 ~G~~i~~~ie~~ir~~~-~~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~ 349 (652) .-...+..+...+|..+ .+|+.||.||.+....--....- |. ...+ ..+-++.++-.++-..-.+.- T Consensus 188 ~~~~~~~~~~~~~~~~~~~~g~~e~e~p~~~~~~gg~~~~p--------f~--~~~n--~~~~~~~~~~~~e~~l~~l~~ 255 (538) T TIGR00499 188 DTFLKRSKIIAAIRRFLDDRGFLEVETPLLQVIPGGANARP--------FI--THHN--ALDLDLYLRIAPELYLKRLVV 255 (538) T ss_pred HHHHHHHHHHHHHHHHHHHCCCEEECCCHHHCCCCCCCCCC--------CE--ECCC--CCCCCHHHHHHHHHHHHHHHH T ss_conf 67788999999999987624842211310101267654465--------01--0023--224421211225778877764 Q ss_pred HCCCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEE-ECCHHHHHH Q ss_conf 01222231043200010001278764520001056353344210-105376788 Q gi|254781004|r 350 HGLKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHV-FCTKEQMFN 402 (652) Q Consensus 350 ~~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~-F~~~eq~~~ 402 (652) . -+ -++|++|..||+|--. ...--+|++-+.|- +...+.+.. T Consensus 256 g------g~-~~~~e~g~~f~neg~~----~~hnpef~~~e~y~~~~~~~~~~~ 298 (538) T TIGR00499 256 G------GL-EKVYEIGRNFRNEGVD----TTHNPEFTSLEFYQAYADYEDLLD 298 (538) T ss_pred C------CH-HHHHHHHHHHHCCCCC----CCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 0------13-6778764333104665----445753024556776510889999 No 160 >pfam00568 WH1 WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mGluR5 bind a protein called homer, which is a WH1 domain homologue. A subset of WH1 domains has been termed a "EVH1" domain and appear to bind a polyproline motif. Probab=61.86 E-value=12 Score=17.46 Aligned_cols=29 Identities=17% Similarity=0.432 Sum_probs=25.0 Q ss_pred CCCCCCCCCEEEEEEEEEECCHHHHHHHHHHHHHH Q ss_conf 01577564206899999837988999999999975 Q gi|254781004|r 207 YWRGDSNRPMLSRIYGTAWNTQQELTQYLYFLEES 241 (652) Q Consensus 207 yw~gd~~~~~l~Riyg~af~~~~~l~~~~~~~eea 241 (652) -|+||. .+||..|.++++.+++.+.++|+ T Consensus 82 ~f~~~~------~~~GLnFase~EA~~F~~~v~~~ 110 (111) T pfam00568 82 TFADSR------CVYGLNFASEEEATKFAKAVQEA 110 (111) T ss_pred EEECCC------EEEEEECCCHHHHHHHHHHHHHH T ss_conf 988487------17986019999999999999854 No 161 >PRK03708 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=61.34 E-value=12 Score=17.39 Aligned_cols=61 Identities=18% Similarity=0.100 Sum_probs=25.9 Q ss_pred CEEEEECCCCCCCHHHHHHHHCCCCEEEEECHHH---HH-CCCEEEEECCCCCE-EEECHHHHHHHHHHH Q ss_conf 8899975896636766655434898899987425---32-58189998898866-334299999999873 Q gi|254781004|r 580 LSIETDFRNETINYKIREHSIKKIPIIIICGDKE---AS-ERSIGIRRFGSTTT-QKVSLLDAIPILTKE 644 (652) Q Consensus 580 irV~~Ddr~~~~G~Kir~a~l~giP~~ivIG~ke---~e-~~~Vtir~R~~~~~-~~i~lee~i~~l~~e 644 (652) .|..+-..+.++--++... +-+..++++-+. ++ ...|+|+.-....+ ...+ +++.++++++ T Consensus 208 ~RplVv~~~~~I~i~~~~~---~~~~~l~~DG~~~~~l~~gd~v~I~~s~~~~~lir~~-~~~y~~l~~K 273 (278) T PRK03708 208 SRPMVVPGSSRIDVKFLAK---GREIILAIDGQYYEELSPETEITIVKSPRKTKFVRFT-KEIYPKYTMK 273 (278) T ss_pred CCCEEECCCCEEEEEEECC---CCCEEEEECCCCEEECCCCCEEEEEECCCEEEEEEEC-CCCHHHHHHH T ss_conf 8988989998799999168---9847999879730673999999999889748999978-9755899876 No 162 >PRK03372 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=61.21 E-value=12 Score=17.37 Aligned_cols=64 Identities=9% Similarity=0.091 Sum_probs=33.6 Q ss_pred CEEEEECCCCCCCHHHHHHHHCCCCEEEEECHHH---HH-CCCEEEEECCCC-CEEEECHHHHHHHHHHHCC Q ss_conf 8899975896636766655434898899987425---32-581899988988-6633429999999987314 Q gi|254781004|r 580 LSIETDFRNETINYKIREHSIKKIPIIIICGDKE---AS-ERSIGIRRFGST-TTQKVSLLDAIPILTKESL 646 (652) Q Consensus 580 irV~~Ddr~~~~G~Kir~a~l~giP~~ivIG~ke---~e-~~~Vtir~R~~~-~~~~i~lee~i~~l~~e~~ 646 (652) .|..+-..+.++--++.... -+..+++.-++ ++ ...|.|+.-... ....++-..|.+.|+++-. T Consensus 224 ~RplVl~~~~~i~i~v~~~~---~~~~l~~DGq~~~~l~~gd~v~i~~s~~~~~lir~~~~~f~~~Lr~Kl~ 292 (303) T PRK03372 224 ARPLVVSPESTVAVEILPDT---HDAVLWCDGRREVDLPPGARVEVRRGATPVRLARLDSAPFTDRLVRKFR 292 (303) T ss_pred CCCEEECCCCEEEEEECCCC---CCEEEEECCCCCEECCCCCEEEEEECCCEEEEEEECCCCHHHHHHHHCC T ss_conf 89889899986999980699---8569998798615649999999998898089998089897999999728 No 163 >CHL00192 syfB phenylalanyl-tRNA synthetase beta chain; Provisional Probab=61.03 E-value=9.6 Score=18.09 Aligned_cols=88 Identities=13% Similarity=0.145 Sum_probs=52.6 Q ss_pred HHHHHHHHHHHHHHHHH-CCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHHH Q ss_conf 78999999999998876-89638555432443100011210012555420100145677782377434312577776520 Q gi|254781004|r 272 GWKIFQTLISYMRRKIK-DDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHVAVFNH 350 (652) Q Consensus 272 G~~i~~~ie~~ir~~~~-~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~ 350 (652) ...-.+++.+-+|..+. .||+|+.|=.+.+.+-. .++...... .-..+.=.|||+-=|+.....+. T Consensus 392 ~~~~~~~~~~~lr~~L~~~Gf~E~itysf~s~~~~---------~~~~v~l~N----Pis~e~s~LR~sLlpgLL~~~~~ 458 (702) T CHL00192 392 RLSFLRNFIDKIRSLLRNLGLTELVHYSLVKSYTY---------YKGEIKLYN----PLLQDYSTLRSSLLPGLILAQQY 458 (702) T ss_pred CCCHHHHHHHHHHHHHHHCCCCEEEEECCCCHHHC---------CCCCEEECC----CCCHHHHHHHHHHHHHHHHHHHH T ss_conf 88988999999999999689834775314684445---------677289708----75243788888779999999986 Q ss_pred CCCCHHHCCHHHHHHEEEECCCC Q ss_conf 12222310432000100012787 Q gi|254781004|r 351 GLKSYRELPVRLAEFGSVYRNEP 373 (652) Q Consensus 351 ~~~SYrdLPlrl~e~~~~~R~E~ 373 (652) .. ++..-.+++||+|.+|.... T Consensus 459 N~-~r~~~~i~lFEiG~vf~~~~ 480 (702) T CHL00192 459 NL-KQSNQTIEGFEIGHVFRDNT 480 (702) T ss_pred HH-CCCCCCEEEEEEECCCCCCC T ss_conf 15-35997742897645526887 No 164 >PRK03147 thiol-disulfide oxidoreductase; Provisional Probab=59.87 E-value=13 Score=17.22 Aligned_cols=79 Identities=8% Similarity=0.119 Sum_probs=45.8 Q ss_pred EEEEEECC-HHHHHHHHHHHHHHHHC--CCEEEEECCCCCCCHHHHHHHHCCCCEEEEECHHHHHCCCEEEEECCCCCEE Q ss_conf 89998579-66899999999999987--9889997589663676665543489889998742532581899988988663 Q gi|254781004|r 554 QAIVTTIT-SSAVEYAQEIANLLKSH--HLSIETDFRNETINYKIREHSIKKIPIIIICGDKEASERSIGIRRFGSTTTQ 630 (652) Q Consensus 554 QV~Iipi~-e~~~eya~~i~~~L~~~--girV~~Ddr~~~~G~Kir~a~l~giP~~ivIG~ke~e~~~Vtir~R~~~~~~ 630 (652) .|.|+.|+ ++..+ .+.+-+.+. .+.+.+|..+ .++.++ ...++|..++|+++ |.|.-+..+ ... T Consensus 95 ~v~vi~i~~d~~~~---~v~~f~~~~~~~~pv~~D~~~-~~~~~~---~v~~~P~t~lId~~----G~I~~~~~G--~i~ 161 (176) T PRK03147 95 GVEIIAVNVDETDI---AVKNFVNQYGLKFPVAIDKGR-QVIDAY---GVGPLPTTFLIDKD----GKVVKVITG--EMT 161 (176) T ss_pred CEEEEEECCCCCHH---HHHHHHHHCCCCCEEEECCCC-HHHHHC---CCCCCCEEEEECCC----CEEEEEEEC--CCC T ss_conf 44785220788788---898889870996228987973-589876---99988869999799----979999978--999 Q ss_pred EECHHHHHHHHHHHC Q ss_conf 342999999998731 Q gi|254781004|r 631 KVSLLDAIPILTKES 645 (652) Q Consensus 631 ~i~lee~i~~l~~e~ 645 (652) .-.+++.++.|++|+ T Consensus 162 ~~~l~~~i~~lk~e~ 176 (176) T PRK03147 162 EEMLEEYLNKIKPEA 176 (176) T ss_pred HHHHHHHHHHHHCCC T ss_conf 999999999874469 No 165 >cd00837 EVH1 EVH1 (Enabled, Vasp-Homology) or WASP Homology (WH1) domain. The EVH1 domain binds to other proteins at proline rich sequences in either FPPPP or PPXXF motifs. It is found in the cytoskeletal reorganization proteins Enabled VASP, and WASP, and in the synaptic scaffolding protein Homer. It has a PH-like fold, despite having minimal sequence similarity to PH or PTB domains. Probab=59.45 E-value=13 Score=17.17 Aligned_cols=29 Identities=24% Similarity=0.546 Sum_probs=25.2 Q ss_pred CCCCCCCCCEEEEEEEEEECCHHHHHHHHHHHHHH Q ss_conf 01577564206899999837988999999999975 Q gi|254781004|r 207 YWRGDSNRPMLSRIYGTAWNTQQELTQYLYFLEES 241 (652) Q Consensus 207 yw~gd~~~~~l~Riyg~af~~~~~l~~~~~~~eea 241 (652) .|++|. ++||..|.+.++.+++.+.+.+| T Consensus 75 ~f~~d~------~~~GLnFase~EA~~F~~~v~~a 103 (104) T cd00837 75 QWEDDN------CVYGLNFASEEEAAQFRKKVLEA 103 (104) T ss_pred EEECCC------EEEEEEECCHHHHHHHHHHHHHH T ss_conf 997188------78998439999999999999855 No 166 >COG1163 DRG Predicted GTPase [General function prediction only] Probab=58.28 E-value=13 Score=17.10 Aligned_cols=49 Identities=12% Similarity=0.218 Sum_probs=28.5 Q ss_pred HHHHHHHHHHHHHC---------CCCCCCCCCEEEEEEECCHHHHHHHHHHHHHHHHCCCEE Q ss_conf 37899999999738---------708873378089998579668999999999999879889 Q gi|254781004|r 530 SIERFIGIMIENFK---------GNLPLWLSPIQAIVTTITSSAVEYAQEIANLLKSHHLSI 582 (652) Q Consensus 530 s~eR~ia~liE~~~---------g~~P~wLAP~QV~Iipi~e~~~eya~~i~~~L~~~girV 582 (652) .++++.--+-...+ |.-|-.=-| .|+.-+..+.+.|++|...|.++ ||- T Consensus 276 nld~L~e~i~~~L~liRVYtK~~g~~pd~~~P---lIlr~GsTV~Dvc~~IH~~l~~~-Fry 333 (365) T COG1163 276 NLDELKERIWDVLGLIRVYTKPPGEEPDFDEP---LILRRGSTVGDVCRKIHRDLVEN-FRY 333 (365) T ss_pred CHHHHHHHHHHHHCEEEEEECCCCCCCCCCCC---EEEECCCCHHHHHHHHHHHHHHH-CCE T ss_conf 98899999998748189984589998888898---58727980999999998999974-556 No 167 >PRK01777 hypothetical protein; Validated Probab=58.13 E-value=14 Score=17.02 Aligned_cols=63 Identities=22% Similarity=0.366 Sum_probs=37.2 Q ss_pred CCCCEEEE----CCCCC---EEECCCCCCHHHHHHH--HC---CCCCCCEEEEEECCEEEECCEEECCC-EEEEECC Q ss_conf 98870899----28998---8741898889999986--36---11011289999898899774530367-0798628 Q gi|254781004|r 1 MPSDIKLT----FPDGS---IKNFPVHATGSDVAES--IS---RSLAKKAVAVAINGKVCDLSDPVREG-SIEIITP 64 (652) Q Consensus 1 M~~~i~I~----lpDG~---~~~~~~g~t~~dia~~--i~---~~~~~~~vaa~vng~l~dL~~~l~d~-~i~~i~~ 64 (652) |+ +|+|+ +||.. ..+++.|+|+.|.... |. +.+.-.....=|=|++..|+++|.++ .||+.-. T Consensus 1 m~-~i~VeV~YA~p~~Q~~~~l~v~~GtTv~~Ai~~Sgi~~~fpeidl~~~~vGIfgk~~~ld~~L~~GDRVEIYRP 76 (95) T PRK01777 1 MG-KIRVEVVYALPERQYLQRLTLQEGATVEEAIRASGLLELRTDIDLAKNKVGIFSRPVKLTDVLRDGDRVEIYRP 76 (95) T ss_pred CC-EEEEEEEEECCCEEEEEEEECCCCCCHHHHHHHCCCHHHCCCCCCCCCCEEEECCCCCCCCCCCCCCEEEEECC T ss_conf 97-50899999889827999998699993999999829654487655354745078848388772899999998536 No 168 >PRK11872 antC anthranilate dioxygenase reductase; Provisional Probab=57.03 E-value=14 Score=16.90 Aligned_cols=35 Identities=23% Similarity=0.202 Sum_probs=27.2 Q ss_pred CCCCEEEECCCCCEEEC--CCCCCHHHHHHHHCCCCC Q ss_conf 98870899289988741--898889999986361101 Q gi|254781004|r 1 MPSDIKLTFPDGSIKNF--PVHATGSDVAESISRSLA 35 (652) Q Consensus 1 M~~~i~I~lpDG~~~~~--~~g~t~~dia~~i~~~~~ 35 (652) |+-+|+|.|+||..+++ ..+-|++|-|..-+-.+. T Consensus 1 m~~~v~~~~~dg~~~~~~~~~~etiL~aal~~Gi~lp 37 (340) T PRK11872 1 MNHKVALSFADGKTLFIPVNHDELLLDAALRQGINLP 37 (340) T ss_pred CCEEEEEEECCCCEEEEEECCCCHHHHHHHHCCCCCC T ss_conf 9517999967998799983899939999997699863 No 169 >cd04938 TGS_Obg-like TGS_Obg-like: The C-terminal TGS domain of Obg-like GTPases such as those present in DRG (developmentally regulated GTP-binding protein), and GTP-binding proteins Ygr210 and YchF. The TGS domain (named after the ThrRS, GTPase, and SpoT proteins where it occurs) is a small domain of about 50 amino acid residues with a predominantly beta-sheet structure. There is no direct information on the function of the TGS domain, but its presence in two types of regulatory proteins (the GTPases and guanosine polyphosphate phosphohydrolases/synthetases) suggests a ligand (most likely nucleotide)-binding, regulatory role. Probab=56.42 E-value=6.5 Score=19.23 Aligned_cols=51 Identities=20% Similarity=0.200 Sum_probs=32.2 Q ss_pred CCCCCEEECCCCCCHHHHHHHHCCCCCCCEEEEEECCE--EEECCEEECCCE-EEEE Q ss_conf 28998874189888999998636110112899998988--997745303670-7986 Q gi|254781004|r 9 FPDGSIKNFPVHATGSDVAESISRSLAKKAVAVAINGK--VCDLSDPVREGS-IEII 62 (652) Q Consensus 9 lpDG~~~~~~~g~t~~dia~~i~~~~~~~~vaa~vng~--l~dL~~~l~d~~-i~~i 62 (652) ||| ..-++.|+|+.|+|..|...+.+..+-|. ||+ ..+=.+.++|++ |.|. T Consensus 22 lpD--~~Lv~~Gst~~d~A~~ih~d~~~~fi~av-~g~~r~eg~d~~l~dgDIIkf~ 75 (76) T cd04938 22 FRD--CVLVKKGTTVGDVARKIHGDLEKGFIEAV-GGRRRLEGKDVILGKNDILKFK 75 (76) T ss_pred CCC--EEEECCCCCHHHHHHHHHHHHHHHHHHHH-HCEEEECCCCEEECCCCEEEEE T ss_conf 883--69987989789999998767788899886-2537954773660789999984 No 170 >COG1854 LuxS LuxS protein involved in autoinducer AI2 synthesis [Signal transduction mechanisms] Probab=56.05 E-value=15 Score=16.79 Aligned_cols=77 Identities=18% Similarity=0.286 Sum_probs=51.4 Q ss_pred ECCCEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHCCCC--EEEEECC-CCCCEEEEEECCCCCCHHHHHHHHHHHHHHH Q ss_conf 036707986289999999999989999999999977998--8997243-5885578840888888989999999999999 Q gi|254781004|r 54 VREGSIEIITPEDPRSLAVIRHSCAHIMAEAVQSIWPEV--QVAIGPI-IEDGFYYDFDKEQPFSSDELAQIEKKMQEII 130 (652) Q Consensus 54 l~d~~i~~i~~~~~eg~~i~~hS~ahlL~~Av~~l~p~~--kl~igp~-i~~GfYyD~~~~~~it~~dl~~Ie~~M~~ii 130 (652) |..=+|+|.-.+-+-=-..=-||+-||+|.-+++---+. -+-++|- -..|||.-+-+... +.+-+..+++-|+++. T Consensus 33 It~fDiRf~qPNke~m~~~~iHTlEHL~A~~iR~h~~g~~~iID~SPMGCrTGFYm~l~G~~~-~~~i~~~~~~~m~dvl 111 (161) T COG1854 33 ITKFDIRFCQPNKEIMPPAGIHTLEHLLAGFIRNHLNGNVEIIDISPMGCRTGFYMILIGTPT-SQDIADVLEATMKDVL 111 (161) T ss_pred EEEEEEEEECCCHHHCCCCCHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCEEEEEECCCC-HHHHHHHHHHHHHHHH T ss_conf 898678873687464886213069999999998524574469984276555324999977987-7999999999999997 Q ss_pred H Q ss_conf 6 Q gi|254781004|r 131 A 131 (652) Q Consensus 131 ~ 131 (652) + T Consensus 112 ~ 112 (161) T COG1854 112 K 112 (161) T ss_pred C T ss_conf 1 No 171 >PRK05265 pyridoxine 5'-phosphate synthase; Provisional Probab=55.01 E-value=15 Score=16.68 Aligned_cols=45 Identities=16% Similarity=0.262 Sum_probs=31.0 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCEEEEEEECC------------HHHHHHHHHHHHHHHHCCCEEEE Q ss_conf 7899999999738708873378089998579------------66899999999999987988999 Q gi|254781004|r 531 IERFIGIMIENFKGNLPLWLSPIQAIVTTIT------------SSAVEYAQEIANLLKSHHLSIET 584 (652) Q Consensus 531 ~eR~ia~liE~~~g~~P~wLAP~QV~Iipi~------------e~~~eya~~i~~~L~~~girV~~ 584 (652) .+.|+.+.++ +-|.||+++|-+ .+..+.-.++-+.|+++|+||-+ T Consensus 75 t~e~i~ia~~---------~kP~qvtLVPe~r~e~TTegGld~~~~~~~L~~~i~~lk~~gIrvSL 131 (240) T PRK05265 75 TEEMLDIALE---------IKPHQVTLVPEKREELTTEGGLDVAGQFDKLKPAIARLKDAGIRVSL 131 (240) T ss_pred CHHHHHHHHH---------CCCCEEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCEEEE T ss_conf 1889999998---------49985998889988626788937765789999999999865981799 No 172 >KOG2784 consensus Probab=54.72 E-value=3.1 Score=21.47 Aligned_cols=139 Identities=14% Similarity=0.200 Sum_probs=68.7 Q ss_pred CCEEEEECCHHHHHHHHHHHHHHH-HHCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCC- Q ss_conf 743565010789999999999988-768963855543244310001121001255542010014567778237743431- Q gi|254781004|r 263 SGVIFWHRKGWKIFQTLISYMRRK-IKDDYEEINTPQVLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPMN- 340 (652) Q Consensus 263 ~G~~~wlP~G~~i~~~ie~~ir~~-~~~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmn- 340 (652) |-.-..+|+ .++-.--|.- +.-||.|.-|- -|.+||-|.. +-+|.|+-....|.++--|..-|.. T Consensus 206 p~~G~lHPL-----mKvR~eFRqiF~emGFsEMptn------~yVEssFWNF--DALfqPQqHpARDahDTFfl~~Pa~s 272 (483) T KOG2784 206 PSSGHLHPL-----MKVREEFRQIFFEMGFSEMPTN------NYVESSFWNF--DALFQPQQHPARDAHDTFFLKDPATS 272 (483) T ss_pred CCCCCCCHH-----HHHHHHHHHHHHHCCCCCCCCC------CCHHHCCCCC--HHHCCCCCCCCCCCCCCEEECCHHHC T ss_conf 977765548-----8889999999998062306654------4011024453--00047555875333452476371330 Q ss_pred -------------------------------HHHHHHHHHHCCC--CHHH---------CCHHHHHHEEEECCCCCCCCE Q ss_conf -------------------------------2577776520122--2231---------043200010001278764520 Q gi|254781004|r 341 -------------------------------CPGHVAVFNHGLK--SYRE---------LPVRLAEFGSVYRNEPSGSLH 378 (652) Q Consensus 341 -------------------------------cp~h~~i~~~~~~--SYrd---------LPlrl~e~~~~~R~E~sg~l~ 378 (652) ++..-.+.|.+.. |-|. -|-+++-|..+||||.-.+ T Consensus 273 ~~~p~dY~~rVk~vH~~G~ygs~GY~y~wk~eEaqKnvLRTHTTavSArmLy~LAk~~f~p~K~FSIDrVFRNEtvDa-- 350 (483) T KOG2784 273 TKFPEDYLERVKAVHEQGGYGSIGYRYNWKLEEAQKNVLRTHTTAVSARMLYRLAKKGFKPAKYFSIDRVFRNETVDA-- 350 (483) T ss_pred CCCCHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHCCCCCH-- T ss_conf 369899999999887168867765678889899987777650477649999999747998511010413440343205-- Q ss_pred EEEECCCCCCCCCEEECCHHHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 00105635334421010537678878999999887665521332 Q gi|254781004|r 379 GLMRVRGFTQDDAHVFCTKEQMFNECLKIHNLIISIYKDFGFEK 422 (652) Q Consensus 379 GL~R~ReF~~~Dah~F~~~eq~~~e~~~~~~~~~~v~~~~g~~~ 422 (652) +-.-||.|-++-+-..- -..-..+......+..+|... T Consensus 351 --THLAEFHQVEGviad~g----ltLgdLig~l~~ff~~lg~tn 388 (483) T KOG2784 351 --THLAEFHQVEGVIADKG----LTLGDLIGILMEFFTKLGATN 388 (483) T ss_pred --HHHHHHHHHCEEEECCC----CCHHHHHHHHHHHHHCCCCCC T ss_conf --77877763203651378----769999999999973268766 No 173 >PRK03604 moaC bifunctional molybdenum cofactor biosynthesis protein C/molybdopterin-binding protein; Provisional Probab=53.68 E-value=12 Score=17.42 Aligned_cols=75 Identities=15% Similarity=0.095 Sum_probs=53.6 Q ss_pred CCCCCCCCEEEEEEECCHHH------HHHHHHHHHHHHHCCCEE-----EEECCCCCCCHHHHHHHHCCCCEEEEECHHH Q ss_conf 08873378089998579668------999999999999879889-----9975896636766655434898899987425 Q gi|254781004|r 545 NLPLWLSPIQAIVTTITSSA------VEYAQEIANLLKSHHLSI-----ETDFRNETINYKIREHSIKKIPIIIICGDKE 613 (652) Q Consensus 545 ~~P~wLAP~QV~Iipi~e~~------~eya~~i~~~L~~~girV-----~~Ddr~~~~G~Kir~a~l~giP~~ivIG~ke 613 (652) .|-.-+.|..++||.+|+.. ..-..-+.+.|.++|+.| .=|++ +.+...++.|--.|+-++++.|--. T Consensus 138 ~w~~~~~~~~aAViTvSD~~s~G~reD~SG~~l~e~L~~~G~~V~~~~vVPDd~-~~I~~~l~~~~~~~~dlIlTtGGTG 216 (301) T PRK03604 138 HKRKARYGPSAAVLVLSDSIAAGTKEDRSGKLLVEGLEEAGCEVSHYQIIPDEP-EEIAAAVQAWIAEGVELIITTGGTG 216 (301) T ss_pred CCCCCCCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHCCCCCCEEEEECCCH-HHHHHHHHHHHHCCCCEEEEECCCC T ss_conf 745567898669999627313797768456999999998698103028968988-9999999999977997999708977 Q ss_pred HHCCCEE Q ss_conf 3258189 Q gi|254781004|r 614 ASERSIG 620 (652) Q Consensus 614 ~e~~~Vt 620 (652) +...-|| T Consensus 217 ~s~RDvT 223 (301) T PRK03604 217 LGPRDVT 223 (301) T ss_pred CCCCCCC T ss_conf 8999788 No 174 >cd01207 Ena-Vasp Enabled-VASP-type homology (EVH1) domain. The EVH1 domain binds to other proteins at proline rich sequences. It is found in proteins involved in cytoskeletal reorganization such as Enabled and VASP. Ena-VASP type EVH1 domains specifically recognize FPPPP motifs in the focal adhesion proteins zyxin and vinculin, and the ActA surface protein of Listeria monocytogenes. It has a PH-like fold, despite having minimal sequence similarity to PH or PTB domains. Probab=53.22 E-value=16 Score=16.49 Aligned_cols=40 Identities=15% Similarity=0.414 Sum_probs=28.6 Q ss_pred HHHHHHHHH--CCCCCCCCCEEEEEEEEEECCHHHHHHHHHHHHHHHH Q ss_conf 333332111--0157756420689999983798899999999997532 Q gi|254781004|r 198 FKLMKVAGA--YWRGDSNRPMLSRIYGTAWNTQQELTQYLYFLEESEK 243 (652) Q Consensus 198 fkL~~~sga--yw~gd~~~~~l~Riyg~af~~~~~l~~~~~~~eea~~ 243 (652) ++-++.+.- -|+-+. ++||..|.+++++.+|.+-+.+|-+ T Consensus 67 l~Y~kat~~FHqWrd~r------~v~GL~F~S~~dA~~F~~~v~~Ale 108 (111) T cd01207 67 LKYNQATPTFHQWRDAR------QVYGLNFGSKEDATMFASAMLSALE 108 (111) T ss_pred CEEEECCCCCEEEECCC------EEEECCCCCHHHHHHHHHHHHHHHH T ss_conf 77875477620227388------8962045999999999999999987 No 175 >cd01990 Alpha_ANH_like_I This is a subfamily of Adenine nucleotide alpha hydrolases superfamily. Adenine nucleotide alpha hydrolases superfamily includes N type ATP PPases and ATP sulphurylases. It forms a apha/beta/apha fold which binds to Adenosine group. This subfamily of proteins probably binds ATP. This domain is about 200 amino acids long with a strongly conserved motif SGGKD at the N terminus. Probab=52.54 E-value=14 Score=17.04 Aligned_cols=59 Identities=20% Similarity=0.246 Sum_probs=44.0 Q ss_pred HHHHH--HHHHHHHHHHCCEEEECC-CCCCEEEEECCHHHHHHHHHHHHHH-H--HH-CCCEEEEC Q ss_conf 99753--200887541144044226-7874356501078999999999998-8--76-89638555 Q gi|254781004|r 238 LEESE--KRDHRKLAREMDLFHIAE-DGSGVIFWHRKGWKIFQTLISYMRR-K--IK-DDYEEINT 296 (652) Q Consensus 238 ~eea~--~rdHr~lg~~~~lf~~~~-~~~G~~~wlP~G~~i~~~ie~~ir~-~--~~-~G~~eV~t 296 (652) +.|+. |.|=|.|+++|+|-..++ .+|++-.=.|.|..|-...-+.+++ | ++ .|+..|.- T Consensus 127 L~e~gl~K~eVR~la~~lgLp~~~kp~~~CLasRip~g~~it~e~L~~v~~aE~~L~~~G~~~~Rv 192 (202) T cd01990 127 LAEAGLGKAEIRELARELGLPTWDKPAMACLASRIPYGTEITEERLKKVEAAEEFLRSLGFREVRV 192 (202) T ss_pred HHHCCCCHHHHHHHHHHCCCCCCCCCCCCCEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEE T ss_conf 265197889999999983998566899875466657888489999999999999999819982789 No 176 >PRK04759 consensus Probab=51.48 E-value=17 Score=16.31 Aligned_cols=40 Identities=13% Similarity=0.130 Sum_probs=20.4 Q ss_pred CEEEEECHHH---HHC-CCEEEEECCCCCEEEE--CHHHHHHHHHHH Q ss_conf 8899987425---325-8189998898866334--299999999873 Q gi|254781004|r 604 PIIIICGDKE---ASE-RSIGIRRFGSTTTQKV--SLLDAIPILTKE 644 (652) Q Consensus 604 P~~ivIG~ke---~e~-~~Vtir~R~~~~~~~i--~lee~i~~l~~e 644 (652) +..+.+.-+. ++. +.|.|+.-...- ..+ +-.++.+.|+++ T Consensus 241 ~~~l~~DG~~~~~l~~gd~I~I~~s~~~~-~li~~~~~~ff~~Lr~K 286 (294) T PRK04759 241 TQEVSCDGQVSLPVSPGDEIHIYQSPNVL-KLIHPKDYSYYHVLRNK 286 (294) T ss_pred CEEEEECCCCCEECCCCCEEEEEECCCEE-EEEECCCCCHHHHHHHH T ss_conf 57999999975553999999999889658-99956999989999756 No 177 >cd01850 CDC_Septin CDC/Septin. Septins are a conserved family of GTP-binding proteins associated with diverse processes in dividing and non-dividing cells. They were first discovered in the budding yeast S. cerevisiae as a set of genes (CDC3, CDC10, CDC11 and CDC12) required for normal bud morphology. Septins are also present in metazoan cells, where they are required for cytokinesis in some systems, and implicated in a variety of other processes involving organization of the cell cortex and exocytosis. In humans, 12 septin genes generate dozens of polypeptides, many of which comprise heterooligomeric complexes. Since septin mutants are commonly defective in cytokinesis and formation of the neck formation of the neck filaments/septin rings, septins have been considered to be the primary constituents of the neck filaments. Septins belong to the GTPase superfamily for their conserved GTPase motifs and enzymatic activities. Probab=50.59 E-value=18 Score=16.21 Aligned_cols=67 Identities=18% Similarity=0.296 Sum_probs=40.4 Q ss_pred EEEEEEC---C-----HHHHHHHHHHHHHHHHCCCEEE---EECCCC---CCCHHHHHHHHCCCCEEEEECHHHHHCCCE Q ss_conf 8999857---9-----6689999999999998798899---975896---636766655434898899987425325818 Q gi|254781004|r 554 QAIVTTI---T-----SSAVEYAQEIANLLKSHHLSIE---TDFRNE---TINYKIREHSIKKIPIIIICGDKEASERSI 619 (652) Q Consensus 554 QV~Iipi---~-----e~~~eya~~i~~~L~~~girV~---~Ddr~~---~~G~Kir~a~l~giP~~ivIG~ke~e~~~V 619 (652) .|.|||| + ++...+=+.|.+.|.++||++. .|+.++ ....++ ...+|+.||-.....+++.- T Consensus 143 ~vNvIPVIaKADtlT~~El~~~K~~I~~~l~~~~I~if~~~~~~~~~~~~~~~~~i----~~~~PFaIi~S~~~~~~~g~ 218 (276) T cd01850 143 RVNIIPVIAKADTLTPEELKEFKQRIMEDIEEHNIKIYKFPEDEEDDETIEENKKL----RSLIPFAVVGSNEEVEVNGK 218 (276) T ss_pred CCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCEECCCCCCCCHHHHHHHHHH----HCCCCCEEECCCCEEEECCC T ss_conf 56456678516669999999999999999998596355798776528899877777----44798089867862750892 Q ss_pred EEEEC Q ss_conf 99988 Q gi|254781004|r 620 GIRRF 624 (652) Q Consensus 620 tir~R 624 (652) .+|-| T Consensus 219 ~vrgR 223 (276) T cd01850 219 KVRGR 223 (276) T ss_pred EEEEE T ss_conf 88776 No 178 >TIGR00326 eubact_ribD riboflavin biosynthesis protein RibD; InterPro: IPR004794 This entry describes the riboflavin biosynthesis protein (ribD) as found in Escherichia coli. The N-terminal domain includes the conserved zinc-binding site region that is shared by proteins such as cytosine deaminase, mammalian apolipoprotein B mRNA editing protein, blasticidin-S deaminase, and Bacillus subtilis competence protein comEB. The C-terminal domain is homologous to the full length of yeast HTP reductase, a protein required for riboflavin biosynthesis. A number of archaeal proteins that may be related to the riboflavin biosynthesis protein contain only the C-terminal domain.; GO: 0008703 5-amino-6-(5-phosphoribosylamino)uracil reductase activity, 0008835 diaminohydroxyphosphoribosylaminopyrimidine deaminase activity, 0009231 riboflavin biosynthetic process. Probab=50.57 E-value=9.9 Score=17.97 Aligned_cols=20 Identities=10% Similarity=0.318 Sum_probs=12.9 Q ss_pred CCEEEECHHHHHHHHHHHCC Q ss_conf 86633429999999987314 Q gi|254781004|r 627 TTTQKVSLLDAIPILTKESL 646 (652) Q Consensus 627 ~~~~~i~lee~i~~l~~e~~ 646 (652) .+++.+++.+++..|-++.+ T Consensus 298 le~~~~~~~~~~~~L~~r~~ 317 (393) T TIGR00326 298 LEKERLTIEEVMEQLGKRGI 317 (393) T ss_pred CCCCCCCHHHHHHHHHHCCC T ss_conf 22354488999999874475 No 179 >COG3222 Uncharacterized protein conserved in bacteria [Function unknown] Probab=50.48 E-value=16 Score=16.46 Aligned_cols=13 Identities=8% Similarity=0.194 Sum_probs=7.0 Q ss_pred HHHHHHCCCEEEE Q ss_conf 9999987988999 Q gi|254781004|r 572 ANLLKSHHLSIET 584 (652) Q Consensus 572 ~~~L~~~girV~~ 584 (652) .+.|+++|+++.. T Consensus 172 l~~lrqng~~~~l 184 (211) T COG3222 172 LKALRQNGIDVYL 184 (211) T ss_pred HHHHHHCCCCCCC T ss_conf 9999974996113 No 180 >pfam02590 SPOUT_MTase Predicted SPOUT methyltransferase. This family of proteins are predicted to be SPOUT methyltransferases. Probab=49.89 E-value=18 Score=16.14 Aligned_cols=84 Identities=15% Similarity=0.188 Sum_probs=60.3 Q ss_pred CCHH-HHHHHHHHHHHHCCCCCCCCCCEEEEEEECC---H------HHHHHHHHHHHHHHHCCCEEEEECCCCC-----C Q ss_conf 4103-7899999999738708873378089998579---6------6899999999999987988999758966-----3 Q gi|254781004|r 527 VFGS-IERFIGIMIENFKGNLPLWLSPIQAIVTTIT---S------SAVEYAQEIANLLKSHHLSIETDFRNET-----I 591 (652) Q Consensus 527 ~~Gs-~eR~ia~liE~~~g~~P~wLAP~QV~Iipi~---e------~~~eya~~i~~~L~~~girV~~Ddr~~~-----~ 591 (652) +.|. -..++..+++.|.-++|. -.++++.-++-. . .....++.+...+.+..+-|.+|.++.. + T Consensus 7 ~iGk~~~~~~~~~~~~Y~kRl~~-~~~i~~iel~~~~~~~~~~~~~~~~~E~~~i~~~i~~~~~~I~LDe~Gk~~sS~~f 85 (155) T pfam02590 7 AVGKLKEKYVKDGIAEYLKRLSR-YCKLELIELPDEKRPSAADIEAIKEKEGERILAAIPPGSYVIALDERGKELSSEEF 85 (155) T ss_pred EECCCCCHHHHHHHHHHHHHHCC-CCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCHHHH T ss_conf 97275977999999999998174-47885799378778775589999999999999737999989999379983787999 Q ss_pred CHHHHHHHHCCCC-EEEEECH Q ss_conf 6766655434898-8999874 Q gi|254781004|r 592 NYKIREHSIKKIP-IIIICGD 611 (652) Q Consensus 592 G~Kir~a~l~giP-~~ivIG~ 611 (652) ..++.++...|.. +..+||. T Consensus 86 a~~i~~~~~~g~~~i~FiIGG 106 (155) T pfam02590 86 AKLLEDLRLKGKSDITFVIGG 106 (155) T ss_pred HHHHHHHHHCCCCCEEEEEEC T ss_conf 999999983389755999837 No 181 >smart00166 UBX Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p. Probab=48.78 E-value=19 Score=16.03 Aligned_cols=31 Identities=32% Similarity=0.478 Sum_probs=25.1 Q ss_pred CCEEEECCCCCEE--ECCCCCCHHHHHHHHCCC Q ss_conf 8708992899887--418988899999863611 Q gi|254781004|r 3 SDIKLTFPDGSIK--NFPVHATGSDVAESISRS 33 (652) Q Consensus 3 ~~i~I~lpDG~~~--~~~~g~t~~dia~~i~~~ 33 (652) +.|.|.||||+.. .|+...|+.+|..-+... T Consensus 5 ~~iqiRlpdG~~l~~~F~~~dtl~~v~~fV~~~ 37 (80) T smart00166 5 CRLQIRLPDGSRLVRRFPSSDTLRTVYEFVSAA 37 (80) T ss_pred EEEEEEECCCCEEEEECCCCCHHHHHHHHHHHC T ss_conf 799999199998999838978399999999864 No 182 >smart00156 PP2Ac Protein phosphatase 2A homologues, catalytic domain. Large family of serine/threonine phosphatases, that includes PP1, PP2A and PP2B (calcineurin) family members. Probab=47.26 E-value=20 Score=15.87 Aligned_cols=95 Identities=17% Similarity=0.351 Sum_probs=65.3 Q ss_pred CCHHHHHHHHHHHHHHHHCCCCEEEEECC----------HHHHHHHHHHCCCHHHHHHHHHCCCCCCCEEECCCCCEEEC Q ss_conf 88989999999999999669980485327----------67854454416711355665420345542130257520100 Q gi|254781004|r 114 FSSDELAQIEKKMQEIIARDSPFLKQYLS----------HEKARELFESKKEAYKVEILESIPAKENVTLYRQGEWFDLC 183 (652) Q Consensus 114 it~~dl~~Ie~~M~~ii~~~~pi~r~~vs----------~~eA~~~F~~~~~~~K~~li~~~~~~~~v~~y~~g~~~Dlc 183 (652) ++++|+..|-.+..+|.++.-.+.+..-+ ..+.+.+|+..|.+ ....+.-.|||+| T Consensus 1 ~~~edi~~l~~~~~~il~~ep~ll~i~~pv~VvGDlHG~~~DL~~i~~~~g~p------------~~~~ylFLGDYVD-- 66 (271) T smart00156 1 LYAEEILELLREVKEIFRQEPNLVEVSAPVTVCGDIHGQFDDLLRLFDLNGPP------------PDTNYVFLGDYVD-- 66 (271) T ss_pred CCHHHHHHHHHHHHHHHHHCCCEEEECCCEEEEEECCCCHHHHHHHHHHCCCC------------CCCCEEEECCCCC-- T ss_conf 98899999999999999859985995599899964889999999999974999------------8662895044205-- Q ss_pred CCCCCCHH-HHHHHHHHHHHHHHHC-CCCCCCCCEEEEEEEE Q ss_conf 02355214-4422233333321110-1577564206899999 Q gi|254781004|r 184 RGPHVRST-GQVKKFFKLMKVAGAY-WRGDSNRPMLSRIYGT 223 (652) Q Consensus 184 ~GphvpsT-g~ik~~fkL~~~sgay-w~gd~~~~~l~Riyg~ 223 (652) ||++=--+ -.+-+ +|+..=.-.| -||+..-..+...||. T Consensus 67 RG~~s~Evi~lL~~-lKi~~P~~v~lLRGNHE~~~~~~~ygF 107 (271) T smart00156 67 RGPFSIEVILLLFA-LKILYPNRVVLLRGNHESRSMNEIYGF 107 (271) T ss_pred CCCCCHHHHHHHHH-HHHHCCCEEEECCCCCCCHHHCCCCCH T ss_conf 68774289999999-998699859971688656655467783 No 183 >cd00003 PNPsynthase Pyridoxine 5'-phosphate (PNP) synthase domain; pyridoxal 5'-phosphate is the active form of vitamin B6 that acts as an essential, ubiquitous coenzyme in amino acid metabolism. In bacteria, formation of pyridoxine 5'-phosphate is a step in the biosynthesis of vitamin B6. PNP synthase, a homooctameric enzyme, catalyzes the final step in PNP biosynthesis, the condensation of 1-amino-acetone 3-phosphate and 1-deoxy-D-xylulose 5-phosphate. PNP synthase adopts a TIM barrel topology, intersubunit contacts are mediated by three ''extra'' helices, generating a tetramer of symmetric dimers with shared active sites; the open state has been proposed to accept substrates and to release products, while most of the catalytic events are likely to occur in the closed state; a hydrophilic channel running through the center of the barrel was identified as the essential structural feature that enables PNP synthase to release water molecules produced during the reaction from the closed, Probab=47.19 E-value=20 Score=15.86 Aligned_cols=63 Identities=16% Similarity=0.171 Sum_probs=38.1 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCEEEEEEECC------H------HHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHH Q ss_conf 7899999999738708873378089998579------6------689999999999998798899975896636766655 Q gi|254781004|r 531 IERFIGIMIENFKGNLPLWLSPIQAIVTTIT------S------SAVEYAQEIANLLKSHHLSIETDFRNETINYKIREH 598 (652) Q Consensus 531 ~eR~ia~liE~~~g~~P~wLAP~QV~Iipi~------e------~~~eya~~i~~~L~~~girV~~Ddr~~~~G~Kir~a 598 (652) .+.|+.+.++ +-|.||+++|-+ + +..+.-+++-+.|+++|+||-+=--.. -.-+.-| T Consensus 72 ~~emi~ia~~---------~kP~~vtLVPe~r~elTTegGld~~~~~~~L~~~i~~lk~~~IrvSLFIDPd--~~qi~~a 140 (234) T cd00003 72 TEEMLEIALE---------VKPHQVTLVPEKREELTTEGGLDVAGQAEKLKPIIERLKDAGIRVSLFIDPD--PEQIEAA 140 (234) T ss_pred CHHHHHHHHH---------CCCCEEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCEEEEEECCC--HHHHHHH T ss_conf 3899999998---------4998789878887864178892665478899999999986598279972798--7899999 Q ss_pred HHCCCC Q ss_conf 434898 Q gi|254781004|r 599 SIKKIP 604 (652) Q Consensus 599 ~l~giP 604 (652) ...|.+ T Consensus 141 ~~~Gad 146 (234) T cd00003 141 KEVGAD 146 (234) T ss_pred HHHCCC T ss_conf 984939 No 184 >cd01226 PH_exo84 Exocyst complex 84-kDa subunit Pleckstrin Homology (PH) domain. Exo84 is a subunit of the exocyt complex, which is important in intracellular trafficking. In metazoa, Exo84 has a PH domain towards its N-terminus. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinsases, regulators of G-proteins, endocytotic GTPases, adaptors, a well as cytoskeletal associated molecules and in lipid associated enzymes. Probab=46.38 E-value=15 Score=16.66 Aligned_cols=40 Identities=28% Similarity=0.417 Sum_probs=29.1 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEEEECCHHHHHHHHHHHHHHHH Q ss_conf 144422233333321110157756420689999983798899999999997532 Q gi|254781004|r 190 STGQVKKFFKLMKVAGAYWRGDSNRPMLSRIYGTAWNTQQELTQYLYFLEESEK 243 (652) Q Consensus 190 sTg~ik~~fkL~~~sgayw~gd~~~~~l~Riyg~af~~~~~l~~~~~~~eea~~ 243 (652) +.+-+|++|||+-..+ +|+|- -++.+.-++||..+|+||+ T Consensus 60 D~~~~knafKlL~fp~------------sr~fQ--~e~a~~K~eWld~le~Akr 99 (100) T cd01226 60 DRENAKKVLKLLIFPE------------SRIYQ--CESARIKTEWFEELEQAKR 99 (100) T ss_pred CCCCHHHHEEEEECCC------------CCEEE--CCCHHHHHHHHHHHHHHHC T ss_conf 7837332178887798------------62788--5885789999999998752 No 185 >PRK08299 isocitrate dehydrogenase; Validated Probab=45.58 E-value=21 Score=15.70 Aligned_cols=15 Identities=0% Similarity=0.000 Sum_probs=9.6 Q ss_pred EEECHHHHHHHHHHH Q ss_conf 334299999999873 Q gi|254781004|r 630 QKVSLLDAIPILTKE 644 (652) Q Consensus 630 ~~i~lee~i~~l~~e 644 (652) .-+..++|++.+.+. T Consensus 383 ~~l~T~~Fi~~v~~~ 397 (403) T PRK08299 383 KWLTTEGFLDKIDEN 397 (403) T ss_pred CEEEHHHHHHHHHHH T ss_conf 725099999999999 No 186 >smart00295 B41 Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs. Probab=45.48 E-value=21 Score=15.69 Aligned_cols=33 Identities=27% Similarity=0.510 Sum_probs=25.8 Q ss_pred CCCEEEECCCCCEEEC--CCCCCHHHHHHHHCCCC Q ss_conf 8870899289988741--89888999998636110 Q gi|254781004|r 2 PSDIKLTFPDGSIKNF--PVHATGSDVAESISRSL 34 (652) Q Consensus 2 ~~~i~I~lpDG~~~~~--~~g~t~~dia~~i~~~~ 34 (652) +..++|.|+||+.+++ ...+|+.|+...+...+ T Consensus 3 ~~~~~V~l~dg~~~~~~~~~~tt~~e~~~~v~~~l 37 (207) T smart00295 3 PRVLKVYLLDGTTLEFEVDSSTTAEELLETVCRKL 37 (207) T ss_pred CEEEEEEEECCCEEEEEECCCCCHHHHHHHHHHHH T ss_conf 54889981089879999789998999999999994 No 187 >TIGR01122 ilvE_I branched-chain amino acid aminotransferase; InterPro: IPR005785 Aminotransferases share certain mechanistic features with other pyridoxal-phosphate dependent enzymes, such as the covalent binding of the pyridoxal-phosphate group to a lysine residue. On the basis of sequence similarity, these various enzymes can be grouped into subfamilies. One of these, called class-IV, currently consists of proteins of about 270 to 415 amino-acid residues that share a few regions of sequence similarity. Surprisingly, the best conserved region does not include the lysine residue to which the pyridoxal-phosphate group is known to be attached, in ilvE, but is located some 40 residues at the C-terminus side of the PlP-lysine. Among the class IV aminotransferases are two phylogenetically separable groups of branched-chain amino acid aminotransferase (IlvE) (2.6.1.42 from EC). The last common ancestor of the two lineages appears also to have given rise to a family of D-amino acid aminotransferases (DAAT). This model represents the IlvE family more strongly similar to the DAAT family. ; GO: 0004084 branched-chain-amino-acid transaminase activity, 0009081 branched chain family amino acid metabolic process. Probab=44.44 E-value=5.7 Score=19.66 Aligned_cols=34 Identities=9% Similarity=0.225 Sum_probs=20.9 Q ss_pred EEEEECCCEEEEEEEEECCCHHHHH-HHHHHHH-HHC Q ss_conf 1673179721379964034103789-9999999-738 Q gi|254781004|r 509 FYVNSHSEKCHPVMIHRAVFGSIER-FIGIMIE-NFK 543 (652) Q Consensus 509 ~y~~~dg~~~~pvmihr~~~Gs~eR-~ia~liE-~~~ 543 (652) -|+-+||.-.+|-... .++-+++| .|=-|+. ..| T Consensus 193 IF~VK~G~l~TPp~~~-s~L~GITRd~VitlA~ke~G 228 (302) T TIGR01122 193 IFIVKDGVLITPPVSS-SILKGITRDTVITLAKKELG 228 (302) T ss_pred EEEEECCEEECCCCHH-HHCCCCCHHHHHHHHHHCCC T ss_conf 7887086787898704-22488888899999851086 No 188 >PRK02889 tolB translocation protein TolB; Provisional Probab=43.62 E-value=23 Score=15.50 Aligned_cols=14 Identities=14% Similarity=0.199 Sum_probs=8.0 Q ss_pred HHHHHHHHCCCCEE Q ss_conf 99999996699804 Q gi|254781004|r 124 KKMQEIIARDSPFL 137 (652) Q Consensus 124 ~~M~~ii~~~~pi~ 137 (652) ..|.++|..|+... T Consensus 55 ~~i~~II~~DL~~S 68 (430) T PRK02889 55 QQVTSIVRADLARS 68 (430) T ss_pred HHHHHHHHHHHHHC T ss_conf 67999999888728 No 189 >PRK11180 rluD 23S rRNA pseudouridine synthase D; Provisional Probab=43.55 E-value=23 Score=15.49 Aligned_cols=58 Identities=17% Similarity=0.240 Sum_probs=36.9 Q ss_pred CCCCEEEECCCCCEEECCCCCCHHHHHHHHCCCCCCC----E---EEEEECCEEEEC-CEEEC-CCEEEE Q ss_conf 9887089928998874189888999998636110112----8---999989889977-45303-670798 Q gi|254781004|r 1 MPSDIKLTFPDGSIKNFPVHATGSDVAESISRSLAKK----A---VAVAINGKVCDL-SDPVR-EGSIEI 61 (652) Q Consensus 1 M~~~i~I~lpDG~~~~~~~g~t~~dia~~i~~~~~~~----~---vaa~vng~l~dL-~~~l~-d~~i~~ 61 (652) |+.+|++|| ++-+-..|.++-.+.+..-+++++. . =..+|||+...- ++.+. +..|++ T Consensus 1 ~~~~~~lT~---tV~~~~aG~RLD~fL~~~~~~~srs~i~k~ir~g~I~VNgk~v~k~~~~v~~GD~I~i 67 (325) T PRK11180 1 MAQQVQLTA---TVSESQLGQRLDQALAELFPDYSRSRIKEWILDQRVLVNGKVINKPKEKVLGGEQVAI 67 (325) T ss_pred CCCEEEEEE---EECCCCCCCHHHHHHHHHCCCCCHHHHHHHHHCCCEEECCEECCCCCCCCCCCCEEEE T ss_conf 996378999---9991007973999999757888999999999869889999994782127889999998 No 190 >smart00461 WH1 WASP homology region 1. Region of the Wiskott-Aldrich syndrome protein (WASp) that contains point mutations in the majority of patients with WAS. Unknown function. Ena-like WH1 domains bind polyproline-containing peptides, and that Homer contains a WH1 domain. Probab=42.84 E-value=23 Score=15.42 Aligned_cols=29 Identities=17% Similarity=0.478 Sum_probs=24.3 Q ss_pred CCCCCCCCCEEEEEEEEEECCHHHHHHHHHHHHHH Q ss_conf 01577564206899999837988999999999975 Q gi|254781004|r 207 YWRGDSNRPMLSRIYGTAWNTQQELTQYLYFLEES 241 (652) Q Consensus 207 yw~gd~~~~~l~Riyg~af~~~~~l~~~~~~~eea 241 (652) .|++|. ++||..|.++++.+++.+.+.+| T Consensus 77 ~f~~d~------~~~GLnFase~EA~~F~~~v~~a 105 (106) T smart00461 77 QWADDK------CVYGLNFASEEEAKKFRKKVLKA 105 (106) T ss_pred EEECCC------EEEEEEECCHHHHHHHHHHHHHH T ss_conf 998188------78998419999999999999965 No 191 >PRK04922 tolB translocation protein TolB; Provisional Probab=42.48 E-value=24 Score=15.38 Aligned_cols=18 Identities=6% Similarity=0.099 Sum_probs=12.1 Q ss_pred HHHHHHHHHHHHCCCCEE Q ss_conf 999999999996699804 Q gi|254781004|r 120 AQIEKKMQEIIARDSPFL 137 (652) Q Consensus 120 ~~Ie~~M~~ii~~~~pi~ 137 (652) ...+..+.++|..|+.-. T Consensus 53 ~~~~~~i~~II~~DL~~S 70 (439) T PRK04922 53 TAPQTDVSAVVSADLDRS 70 (439) T ss_pred CCHHHHHHHHHHHHHHHC T ss_conf 550364999998657727 No 192 >cd00824 PTBI IRS-like phosphotyrosine-binding domain (PTBi); This domain has a PH-like fold and is found in insulin receptor substrate molecules and in other eukaryotic signaling molecules such as FRS2 and Dok. IRS and Dok molecules have an N-terminal PH domain, which is followed by an IRS-like PTB domain. FRS2 just has an N-terminal PTBi domain. This PTBi domain is shorter than the PTB domain which is found in SHC, Numb and other proteins. The PTBi domain binds to phosphotyrosines which are in NPXpY motifs. Probab=42.40 E-value=24 Score=15.37 Aligned_cols=47 Identities=19% Similarity=0.342 Sum_probs=36.9 Q ss_pred HHHHHHHHHHCCEEEECC-----CCCCEEEEEC-CHHHHHHHHHHHHHHHHHC Q ss_conf 200887541144044226-----7874356501-0789999999999988768 Q gi|254781004|r 243 KRDHRKLAREMDLFHIAE-----DGSGVIFWHR-KGWKIFQTLISYMRRKIKD 289 (652) Q Consensus 243 ~rdHr~lg~~~~lf~~~~-----~~~G~~~wlP-~G~~i~~~ie~~ir~~~~~ 289 (652) -+.=|+-|.+-..|+|.- .|+|++.+.- .|..|.+.+...|.+.+++ T Consensus 48 l~~lRryG~~~~~F~fEaGR~c~~G~G~f~f~~~~~~~i~~~~~~~I~~~~~~ 100 (104) T cd00824 48 LMFLRRYGYDSNLFSFEAGRRCVTGEGIFTFQTDRAEEIFQNVHETILAAMKA 100 (104) T ss_pred HHHEEEECCCCCEEEEEECCCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHHH T ss_conf 47713403568899999048889898479999089999999999999999976 No 193 >pfam03740 PdxJ Pyridoxal phosphate biosynthesis protein PdxJ. Members of this family belong to the PdxJ family that catalyses the condensation of 1-deoxy-d-xylulose-5-phosphate (DXP) and 1-amino-3-oxo-4-(phosphohydroxy)propan-2-one to form pyridoxine 5'-phosphate (PNP). This reaction is involved in de novo synthesis of pyridoxine (vitamin B6) and pyridoxal phosphate. Probab=42.16 E-value=24 Score=15.35 Aligned_cols=64 Identities=13% Similarity=0.136 Sum_probs=38.0 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCEEEEEEECC------------HHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHH Q ss_conf 7899999999738708873378089998579------------6689999999999998798899975896636766655 Q gi|254781004|r 531 IERFIGIMIENFKGNLPLWLSPIQAIVTTIT------------SSAVEYAQEIANLLKSHHLSIETDFRNETINYKIREH 598 (652) Q Consensus 531 ~eR~ia~liE~~~g~~P~wLAP~QV~Iipi~------------e~~~eya~~i~~~L~~~girV~~Ddr~~~~G~Kir~a 598 (652) .+.|+.+.++ +-|.||+++|-+ .+..+.-.++-+.|+++|+||-+=--.. -.-+..| T Consensus 73 ~~emi~ia~~---------~kP~qvtLVPE~r~elTTegGld~~~~~~~L~~~i~~lk~~girvSlFIDpd--~~~i~~a 141 (239) T pfam03740 73 TEEMLELALK---------TKPHQVTLVPEKREEITTEGGLDVVAQLEKLKPAIRRLKNAGIRVSLFIDPD--PEQIEAA 141 (239) T ss_pred HHHHHHHHHH---------CCCCEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEEECCC--HHHHHHH T ss_conf 4999999998---------4998589888999873568880633406899999999860785389970799--8999999 Q ss_pred HHCCCCE Q ss_conf 4348988 Q gi|254781004|r 599 SIKKIPI 605 (652) Q Consensus 599 ~l~giP~ 605 (652) ...|.+. T Consensus 142 ~~~Gad~ 148 (239) T pfam03740 142 KIVGADR 148 (239) T ss_pred HHCCCCE T ss_conf 9809299 No 194 >cd01772 SAKS1_UBX SAKS1 (SAPK-substrate-1), also known as Y33K, is a widely expressed protein containing N-terminal UBA (ubiquitin-associated) and C-terminal UBX (ubiqiutin-like) domains that was identified as a substrate of stress-activated protein kinases (SAPKs). SAKS1 is related evolutionarily to two other UBA/UBX-containing proteins, p47 and Faf1. The UBA and UBX domains of SAKS1 bind ubiquitin tetramers and valosin-containing protein (VCP), respectively suggesting a role for SAKS1 as an adaptor that directs VCP to polyubiquitinated proteins facilitating its destruction by the proteasome. The UBX domain has a beta-grasp fold similar to that of ubiquitin however, UBX lacks the c-terminal double glycine motif and is thus unlikely to be conjugated to other proteins. Probab=41.78 E-value=24 Score=15.31 Aligned_cols=30 Identities=17% Similarity=0.239 Sum_probs=24.4 Q ss_pred CCEEEECCCCCEE--ECCCCCCHHHHHHHHCC Q ss_conf 8708992899887--41898889999986361 Q gi|254781004|r 3 SDIKLTFPDGSIK--NFPVHATGSDVAESISR 32 (652) Q Consensus 3 ~~i~I~lpDG~~~--~~~~g~t~~dia~~i~~ 32 (652) ..|.|.||||+.. .|+...|+.++..-+.. T Consensus 5 t~iqiRLpdG~~l~~~F~~~~tL~~V~~~V~~ 36 (79) T cd01772 5 TRIQIRLLDGTTLKQTFKAREQLAAVRLFVEL 36 (79) T ss_pred EEEEEECCCCCEEEEECCCCCHHHHHHHHHHH T ss_conf 89999959999888880798939999999985 No 195 >TIGR00472 pheT_bact phenylalanyl-tRNA synthetase, beta subunit; InterPro: IPR004532 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . Phenylalanyl-tRNA synthetase (6.1.1.20 from EC) is an alpha2/beta2 tetramer composed of 2 subunits that belongs to class IIc. In eubacteria, a small subunit (pheS gene) can be designated as beta (E. coli) or alpha subunit (see IPR002319 from INTERPRO). Reciprocally the large subunit (pheT gene) can be designated as alpha (E. coli) or beta. In all other kingdoms the two subunits have equivalent length in eukaryota, and can be identified by specific signatures. The enzyme from Thermus thermophilus has an alpha 2 beta 2 type quaternary structure and is one of the most complicated members of the synthetase family. Identification of phenylalanyl-tRNA synthetase as a member of class II aaRSs was based only on sequence alignment of the small alpha-subunit with other synthetases . This family describes the beta subunit. The beta subunits break into two subfamilies that are considerably different in sequence, length, and pattern of gaps (see also IPR004531 from INTERPRO). This family represents the subfamily that includes the beta subunit from bacteria other than spirochetes, as well as a chloroplast-encoded form from Porphyra purpurea. The chloroplast-derived sequence is considerably shorter at the N-terminal.; GO: 0000166 nucleotide binding, 0004826 phenylalanine-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006432 phenylalanyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=41.70 E-value=19 Score=16.01 Aligned_cols=86 Identities=14% Similarity=0.194 Sum_probs=52.7 Q ss_pred HHHHHHH-CCCEEEECCCCCCCCCCCCCCCCCCCC-CCCCEEECCCCCCCCCCEEEECCCCHHHHHHHH-HHCCCCHHHC Q ss_conf 9998876-896385554324431000112100125-554201001456777823774343125777765-2012222310 Q gi|254781004|r 282 YMRRKIK-DDYEEINTPQVLDQHLWQQSGHWDSYR-ANMFAVKCADDTIKDLRTFALKPMNCPGHVAVF-NHGLKSYREL 358 (652) Q Consensus 282 ~ir~~~~-~G~~eV~tP~l~~~~Lw~~SGh~~~y~-~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~-~~~~~SYrdL 358 (652) -+|.-+. .||+||.|..+.+.+=-+..++-..-. ++.+..... --.+.=+||++.-|+..... .|.-|+-++ T Consensus 526 ~~~~~l~~~G~~E~itYsl~~~~~~~~~~~~~~~~~~~~~~i~NP----~s~e~~~lR~sL~~sLL~~~~~N~~r~~~~- 600 (848) T TIGR00472 526 KLRKLLVGLGLNEVITYSLVSKEKLEKFNFPKLENLDELVEIKNP----LSEERSVLRTSLLPSLLEVLAYNQNRKNKD- 600 (848) T ss_pred HHHHHHHHCCCCEEEEECCCCHHHHHHHCCCCCCCCHHEEEECCC----CCHHHHHHHHHHHHHHHHHHHHHHHCCCCC- T ss_conf 999999848985777301388799987168777861010451589----835478876644688999999998638971- Q ss_pred CHHHHHHEEEECCCC Q ss_conf 432000100012787 Q gi|254781004|r 359 PVRLAEFGSVYRNEP 373 (652) Q Consensus 359 Plrl~e~~~~~R~E~ 373 (652) +++||+|.+|-... T Consensus 601 -~~lFE~G~~f~~~~ 614 (848) T TIGR00472 601 -VKLFEIGKVFAKDG 614 (848) T ss_pred -EEEEEEEEEECCCC T ss_conf -58998876426885 No 196 >TIGR01344 malate_syn_A malate synthase A; InterPro: IPR006252 These sequences represent plant malate synthase and one of two bacterial forms, designated malate synthase A. This enzyme and isocitrate lyase are the two characteristic enzymes of the glyoxylate shunt. The shunt enables the cell to use acetyl-CoA to generate increased levels of TCA cycle intermediates for biosynthetic pathways such as gluconeogenesis.; GO: 0004474 malate synthase activity, 0006097 glyoxylate cycle. Probab=41.66 E-value=4.2 Score=20.54 Aligned_cols=23 Identities=22% Similarity=0.333 Sum_probs=18.4 Q ss_pred EEEEECCCCCCCCCEEE-CCHHHH Q ss_conf 00010563533442101-053767 Q gi|254781004|r 378 HGLMRVRGFTQDDAHVF-CTKEQM 400 (652) Q Consensus 378 ~GL~R~ReF~~~Dah~F-~~~eq~ 400 (652) .-++|.|+..|.|=|+- ....-+ T Consensus 145 vL~~RPRGWHL~ErHl~G~DG~~i 168 (522) T TIGR01344 145 VLIVRPRGWHLPERHLEGIDGEAI 168 (522) T ss_pred EEEEECCCCCCCCCCEECCCCCCC T ss_conf 788844888675551347987602 No 197 >PRK02270 consensus Probab=41.58 E-value=22 Score=15.66 Aligned_cols=72 Identities=14% Similarity=0.146 Sum_probs=32.2 Q ss_pred HHHHHHCCCCCCCEEEEEECCEE-EECCEEECCCEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEC Q ss_conf 99986361101128999989889-977453036707986289999999999989999999999977998899724 Q gi|254781004|r 25 DVAESISRSLAKKAVAVAINGKV-CDLSDPVREGSIEIITPEDPRSLAVIRHSCAHIMAEAVQSIWPEVQVAIGP 98 (652) Q Consensus 25 dia~~i~~~~~~~~vaa~vng~l-~dL~~~l~d~~i~~i~~~~~eg~~i~~hS~ahlL~~Av~~l~p~~kl~igp 98 (652) +||+.++-.+.+..+--.=||++ ..+...+.+++|-++...++.-..-+-. ..+|..|+++.--..-..+-| T Consensus 20 ~IA~~Lg~~l~~~~~~~F~DGE~~v~i~esvrg~dV~ivqs~~~pvnd~lmE--Lll~idA~rrasA~~It~ViP 92 (327) T PRK02270 20 EISKITNIPLSTIEKTVFADGEVLLKSKETVRNRDVFIVASTSRPVNENIME--LLIFIDSLKRASAKEINVILS 92 (327) T ss_pred HHHHHHCCCCCCEEEEECCCCCEEEEECCCCCCCEEEEECCCCCCCHHHHHH--HHHHHHHHHHCCCCCEEEEEC T ss_conf 9999969973122889989998898427877898089987899993178999--999999998738872589712 No 198 >pfam01582 TIR TIR domain. The Toll/interleukin-1 receptor (TIR) homology domain is an intracellular signalling domain found in MyD88, interleukin 1 receptor and the Toll receptor. It contains three highly-conserved regions, and mediates protein-protein interactions between the Toll-like receptors (TLRs) and signal-transduction components. TIR-like motifs are also found in plant proteins thought to be involved in resistance to disease. When activated, TIR domains recruit cytoplasmic adaptor proteins MyD88 and TOLLIP (Toll interacting protein). In turn, these associate with various kinases to set off signalling cascades. Probab=40.91 E-value=25 Score=15.22 Aligned_cols=13 Identities=23% Similarity=0.340 Sum_probs=4.8 Q ss_pred CCCCHHHHHHHHH Q ss_conf 8888989999999 Q gi|254781004|r 112 QPFSSDELAQIEK 124 (652) Q Consensus 112 ~~it~~dl~~Ie~ 124 (652) .++.++-+..|++ T Consensus 41 ~~i~~~i~~aI~~ 53 (139) T pfam01582 41 ESILENLFEAIEK 53 (139) T ss_pred CCHHHHHHHHHHH T ss_conf 8459999999997 No 199 >TIGR01687 moaD_arch MoaD family protein; InterPro: IPR010038 Members of this family appear to be archaeal and bacterial (proteobacteria and Thermus) versions of MoaD, subunit 1 of molybdopterin converting factor.. Probab=40.79 E-value=25 Score=15.21 Aligned_cols=51 Identities=24% Similarity=0.403 Sum_probs=34.9 Q ss_pred EEECCCCCCHHHHHHHHCCCCC-------CC-------EEEEEECCE-EE----ECCEEECCC-EEEEECC Q ss_conf 8741898889999986361101-------12-------899998988-99----774530367-0798628 Q gi|254781004|r 14 IKNFPVHATGSDVAESISRSLA-------KK-------AVAVAINGK-VC----DLSDPVREG-SIEIITP 64 (652) Q Consensus 14 ~~~~~~g~t~~dia~~i~~~~~-------~~-------~vaa~vng~-l~----dL~~~l~d~-~i~~i~~ 64 (652) ..|.+.|-|+.|+.+.+...++ +. -|...|||+ =. +|+++|+|+ .|.++.. T Consensus 19 eiE~~~g~tv~dll~~l~~~Yp~~~~e~~~et~~~~~~~v~ilvNGran~~~l~GL~~~LkdGD~vai~PP 89 (93) T TIGR01687 19 EIELEDGKTVGDLLEELSSRYPKEFSELFKETGLGLVPNVIILVNGRANVDWLEGLETELKDGDVVAIFPP 89 (93) T ss_pred EEECCCCCCHHHHHHHHHHHCCHHHHHHHCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCEEEEECC T ss_conf 98779998079989998861565566651477887646578985164143220365752327875675069 No 200 >PRK04539 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=40.61 E-value=25 Score=15.19 Aligned_cols=41 Identities=12% Similarity=0.149 Sum_probs=20.0 Q ss_pred CEEEEECHHH---HHC-CCEEEEECCCCCE-EEECHHHHHHHHHHH Q ss_conf 8899987425---325-8189998898866-334299999999873 Q gi|254781004|r 604 PIIIICGDKE---ASE-RSIGIRRFGSTTT-QKVSLLDAIPILTKE 644 (652) Q Consensus 604 P~~ivIG~ke---~e~-~~Vtir~R~~~~~-~~i~lee~i~~l~~e 644 (652) +-.+++.-+. ++. +.|.|+.-...-+ ....-.+|.+.|+++ T Consensus 243 ~~~l~~DGq~~~~l~~gd~I~I~~s~~~~~li~~~~~~f~~~Lr~K 288 (296) T PRK04539 243 DARVHFDGQTHIDVQNLDRITIRRYRNPLRILHPTDYQYFKTLRQK 288 (296) T ss_pred CEEEEECCCCCEECCCCCEEEEEECCCEEEEEECCCCCHHHHHHHH T ss_conf 7899990998537199899999988976999974999989998645 No 201 >pfam08921 DUF1904 Domain of unknown function (DUF1904). This domain is found in a set of hypothetical bacterial proteins. Probab=40.35 E-value=25 Score=15.16 Aligned_cols=78 Identities=10% Similarity=0.205 Sum_probs=47.6 Q ss_pred EEEEEEEECCHHHHHHH-HHHHHHHH--HHHHHHHHHHCCEEEECCC-CCCE----EEEECCHHHHHHHHHHHHHHHHH- Q ss_conf 89999983798899999-99999753--2008875411440442267-8743----56501078999999999998876- Q gi|254781004|r 218 SRIYGTAWNTQQELTQY-LYFLEESE--KRDHRKLAREMDLFHIAED-GSGV----IFWHRKGWKIFQTLISYMRRKIK- 288 (652) Q Consensus 218 ~Riyg~af~~~~~l~~~-~~~~eea~--~rdHr~lg~~~~lf~~~~~-~~G~----~~wlP~G~~i~~~ie~~ir~~~~- 288 (652) .|+.|+....-+.+..- +..+.+.- .|++=.|--.---|+++.. +.|. .+|.|+|-.+..++.+.+++..+ T Consensus 3 lrfRG~~~~~v~~lS~~Li~eLa~i~~~~~e~ftlE~i~s~~~~~G~~~~~ypfVEVlWF~R~qe~~d~vA~~It~~v~~ 82 (107) T pfam08921 3 LRFRGLEEDAVKELSKTLIEELADICKCPPEAFTLEWINSVFFRDGKIDDAYPFVEVLWFPRDQETQDLVAQIITEAVRA 82 (107) T ss_pred EEEECCCHHHHHHHHHHHHHHHHHHHCCCCHHEEEEEEEEEEEECCCCCCCCCEEEEEEECCCHHHHHHHHHHHHHHHHH T ss_conf 88945998999997399999999997798244899983368997375567733599998038989999999999999984 Q ss_pred -CCCEEEE Q ss_conf -8963855 Q gi|254781004|r 289 -DDYEEIN 295 (652) Q Consensus 289 -~G~~eV~ 295 (652) .|++.|. T Consensus 83 ~~~~~~V~ 90 (107) T pfam08921 83 VNGLQDVA 90 (107) T ss_pred CCCCCEEE T ss_conf 38997399 No 202 >TIGR01857 FGAM-synthase phosphoribosylformylglycinamidine synthase; InterPro: IPR010141 This entry represents a single-molecule form of phosphoribosylformylglycinamidine synthase, also called FGAM synthase, an enzyme of purine de novo biosynthesis, which represent a second clade of the enzymes found in Clostridia, Bifidobacteria and Streptococcus species. This enzyme performs the fourth step in IMP biosynthesis (the precursor of all purines) from PRPP.. Probab=39.92 E-value=7.7 Score=18.72 Aligned_cols=230 Identities=19% Similarity=0.277 Sum_probs=114.1 Q ss_pred CCCEEECCCC-------CCHHHHH---HHHCCCCCCCEEEEEE----------CC-EEEECCEE-E-CCCE-----EEEE Q ss_conf 9988741898-------8899999---8636110112899998----------98-89977453-0-3670-----7986 Q gi|254781004|r 11 DGSIKNFPVH-------ATGSDVA---ESISRSLAKKAVAVAI----------NG-KVCDLSDP-V-REGS-----IEII 62 (652) Q Consensus 11 DG~~~~~~~g-------~t~~dia---~~i~~~~~~~~vaa~v----------ng-~l~dL~~~-l-~d~~-----i~~i 62 (652) ||+...+.+. ++..|+= +...+.--.++|.|.| || .++||+|. | ++|- |+.. T Consensus 548 dGTELAISESQERMAVvv~~Ed~d~F~~~~~eENL~AtvvA~VTd~~rL~m~W~G~~IVdl~R~FL~tNGv~~~~~~~V~ 627 (1279) T TIGR01857 548 DGTELAISESQERMAVVVSKEDVDKFLKYAEEENLEATVVATVTDKPRLVMNWKGKTIVDLSREFLDTNGVKQEIDVKVK 627 (1279) T ss_pred CCCEEEEECCCCCCEEECCCCCHHHHHHHHHHCCCCEEEEEEEECCCCEEEEECCCEEEEEEHHHHHHCCCEEEEEEEEE T ss_conf 85133120145511467286565789987532385228999995699706711786578013211300884178989996 Q ss_pred CCCCHHH---HHHHHHHHHHHHHHHHHHHCC----------CCEEEEE----------------------CCC------- Q ss_conf 2899999---999999899999999999779----------9889972----------------------435------- Q gi|254781004|r 63 TPEDPRS---LAVIRHSCAHIMAEAVQSIWP----------EVQVAIG----------------------PII------- 100 (652) Q Consensus 63 ~~~~~eg---~~i~~hS~ahlL~~Av~~l~p----------~~kl~ig----------------------p~i------- 100 (652) +.+.+.. ..+=..|+.-=+.+.+++|-- ++-++-| |.+ T Consensus 628 ~~~~~~~~~~~~~~~~~l~~~~l~~~~~LNv~SqKGL~e~FDSsvG~gTvl~PlGGkyQLTP~~a~v~KlPVl~g~t~tA 707 (1279) T TIGR01857 628 DKDVKLSVEKFKVSEETLEEKWLKVLKDLNVASQKGLVERFDSSVGAGTVLMPLGGKYQLTPTEASVAKLPVLGGETKTA 707 (1279) T ss_pred ECCCCCCCCCEECCCHHHHHHHHHHHHHHCCCCCCCCEEEEECCCCCHHHCCCCCCCCCCCHHHHHHHCCCCCCCCCCCE T ss_conf 21255642321136204788887655430121247752120034171021377888645770257873277578865340 Q ss_pred ---CCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHCC----CHHHHHHHHHCCCCCCCEE Q ss_conf ---885578840888888989999999999999669980485327678544544167----1135566542034554213 Q gi|254781004|r 101 ---EDGFYYDFDKEQPFSSDELAQIEKKMQEIIARDSPFLKQYLSHEKARELFESKK----EAYKVEILESIPAKENVTL 173 (652) Q Consensus 101 ---~~GfYyD~~~~~~it~~dl~~Ie~~M~~ii~~~~pi~r~~vs~~eA~~~F~~~~----~~~K~~li~~~~~~~~v~~ 173 (652) --||===+....+++=.-+.-||. .-+||++...+.+-.+|..|=-+...... .|. +.||-.+...--.-+ T Consensus 708 S~~a~GFNPYi~~ws~yhGA~yaVvEs-~AkLvAaG~~y~karLSFQEYFEkL~~~~ekwgkP~-aAlLGA~~aQ~dLG~ 785 (1279) T TIGR01857 708 SAIAWGFNPYIAEWSPYHGAIYAVVES-LAKLVAAGVDYKKARLSFQEYFEKLDKDEEKWGKPF-AALLGALKAQVDLGL 785 (1279) T ss_pred EEEEECCCCCCCCCCCCHHHHHHHHHH-HHHHHHCCCCEEEEEEEEEHHHHHCCCCCCCCCCHH-HHHHHHHHHHHHCCC T ss_conf 244416675013577502367888999-999997289703788620022332377751006708-999889999887088 Q ss_pred E-------CCCCCEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEEE-----ECCHHHHHHHHHHHHHH Q ss_conf 0-------25752010002355214442223333332111015775642068999998-----37988999999999975 Q gi|254781004|r 174 Y-------RQGEWFDLCRGPHVRSTGQVKKFFKLMKVAGAYWRGDSNRPMLSRIYGTA-----WNTQQELTQYLYFLEES 241 (652) Q Consensus 174 y-------~~g~~~Dlc~GphvpsTg~ik~~fkL~~~sgayw~gd~~~~~l~Riyg~a-----f~~~~~l~~~~~~~eea 241 (652) = =.|.|-||-.-|=+-|=+- . .-+..++-+.=++.=.+ .+.+|-|-+ -++.+.|++-++.+++. T Consensus 786 ~AIGGKDSMSGtF~dL~VPPTLisFAV-~-~~~~~~v~SpEFK~~~~--~i~~i~~~~l~~~~~~d~~~lk~nf~~~~~~ 861 (1279) T TIGR01857 786 PAIGGKDSMSGTFEDLDVPPTLISFAV-A-TANVKRVISPEFKKAGE--KIYLIEGKALEDDLTLDSDELKENFEKIEEL 861 (1279) T ss_pred CCCCCCCCCCCCCCCCCCCCHHHHHHH-H-HHHCCCCCCCHHHCCCC--EEEEECCCCCCCCCCCCHHHHHHHHHHHHHH T ss_conf 766863477501588988702553443-2-43111023602330287--5887066202246750678899999999984 Q ss_pred HHHHHH Q ss_conf 320088 Q gi|254781004|r 242 EKRDHR 247 (652) Q Consensus 242 ~~rdHr 247 (652) ..||. T Consensus 862 -~~~~k 866 (1279) T TIGR01857 862 -IKDHK 866 (1279) T ss_pred -HCCCC T ss_conf -05774 No 203 >pfam12362 DUF3646 DNA polymerase III gamma and tau subunits C terminal. This domain family is found in bacteria, and is approximately 120 amino acids in length. The family is found in association with pfam00004. The proteins in this family are frequently annotated as the gamma and tau subunits of DNA polymerase III, however there is little accompanying literature to back this up. Probab=39.18 E-value=26 Score=15.04 Aligned_cols=20 Identities=25% Similarity=0.489 Sum_probs=8.3 Q ss_pred HHHHHHHHHHHHHHCCCCEE Q ss_conf 98999999999997799889 Q gi|254781004|r 75 HSCAHIMAEAVQSIWPEVQV 94 (652) Q Consensus 75 hS~ahlL~~Av~~l~p~~kl 94 (652) .-..|=+.+|+.+.||++++ T Consensus 94 ~~~~~P~v~avl~~FP~A~I 113 (117) T pfam12362 94 DARAHPLVQAVLAAFPGAKI 113 (117) T ss_pred HHHCCHHHHHHHHHCCCCEE T ss_conf 99879899999987989887 No 204 >TIGR00106 TIGR00106 conserved hypothetical protein TIGR00106; InterPro: IPR002767 This entry contains several hypothetical proteins of unknown function found in archaebacteria, eukaryotes and eubacteria. The structures of YBL001c from Saccharomyces cerevisiae and its homologue MTH1187 from the archaea Methanobacterium thermoautotrophicum have been determined . These proteins have a ferredoxin-like alpha/beta sandwich structure with anti-parallel beta-sheets. Generally, they have two domains that form a single beta-sheet dimer, where two dimers pack sheet-to-sheet into a tetramer, some proteins having an extra C-terminal helix. . Probab=38.81 E-value=27 Score=15.01 Aligned_cols=58 Identities=17% Similarity=0.244 Sum_probs=35.6 Q ss_pred HHHHHHHHHHHHHCC-----CCCCCCCCEEEEEEE-CCHHHHHHHHHHHHHHHHCCC-EEE----EECC Q ss_conf 378999999997387-----088733780899985-796689999999999998798-899----9758 Q gi|254781004|r 530 SIERFIGIMIENFKG-----NLPLWLSPIQAIVTT-ITSSAVEYAQEIANLLKSHHL-SIE----TDFR 587 (652) Q Consensus 530 s~eR~ia~liE~~~g-----~~P~wLAP~QV~Iip-i~e~~~eya~~i~~~L~~~gi-rV~----~Ddr 587 (652) |++.|+|.++|...| .+-..|.|.--.|=- ==+++.+..+++.+.+.++|. |+. +|+| T Consensus 16 SvS~YVA~~~k~L~Gyqae~~lk~~l~~mGT~iEgEdld~l~~~~~~~Hea~~~~G~~R~~~~~k~~~R 84 (103) T TIGR00106 16 SVSSYVAAVIKVLKGYQAEEGLKYELHAMGTLIEGEDLDELFEAIKAIHEAVFEKGSDRVYSSLKIDTR 84 (103) T ss_pred CHHHHHHHHHHHHHHHCCCCCCCEEECCCCCEEECCCHHHHHHHHHHHHHHHHHCCCCEEEEEEEECCC T ss_conf 777899999999851034457743442576523046978999999998654753599515788862068 No 205 >pfam06646 Mycoplasma_p37 High affinity transport system protein p37. This family consists of several high affinity transport system protein p37 sequences which are specific to Mycoplasma species. The p37 gene is part of an operon encoding two additional proteins which are highly similar to components of the periplasmic binding-protein-dependent transport systems of Gram-negative bacteria.It has been suggested that p37 is part of a homologous, high-affinity transport system in M. hyorhinis, a Gram-positive bacterium. Probab=37.42 E-value=20 Score=15.88 Aligned_cols=19 Identities=5% Similarity=0.065 Sum_probs=12.4 Q ss_pred CCHHHHHHHHHHHHHHHHC Q ss_conf 8898999999999999966 Q gi|254781004|r 114 FSSDELAQIEKKMQEIIAR 132 (652) Q Consensus 114 it~~dl~~Ie~~M~~ii~~ 132 (652) +...-|..+.++..+|=++ T Consensus 52 ~~~~Fl~~~~~~fnkLkn~ 70 (383) T pfam06646 52 AINKFLKNFSNEFSKLKKA 70 (383) T ss_pred HHHHHHHHHHHHHHHHHCC T ss_conf 7899999999999886517 No 206 >KOG1261 consensus Probab=36.96 E-value=18 Score=16.16 Aligned_cols=33 Identities=21% Similarity=0.450 Sum_probs=22.7 Q ss_pred HCCEEEECC------CCCCEEEEECCH-----HHHHHHHHHHHH Q ss_conf 144044226------787435650107-----899999999999 Q gi|254781004|r 252 EMDLFHIAE------DGSGVIFWHRKG-----WKIFQTLISYMR 284 (652) Q Consensus 252 ~~~lf~~~~------~~~G~~~wlP~G-----~~i~~~ie~~ir 284 (652) +.+|||||. .|+|-||+||+= +++|+-+-.+.. T Consensus 200 DFGLyffHNAK~li~~GsGPyFYLPKmeh~~EaklWndvF~~ae 243 (552) T KOG1261 200 DFGLYFFHNAKELIKQGSGPYFYLPKMEHHEEAKLWNDVFCVAE 243 (552) T ss_pred EEEEEEEECHHHHHHCCCCCEEECCCCCCHHHHHHHHHHHHHHH T ss_conf 43456771279998548987365632110678888898999888 No 207 >PRK02260 S-ribosylhomocysteinase; Provisional Probab=36.93 E-value=29 Score=14.81 Aligned_cols=72 Identities=14% Similarity=0.330 Sum_probs=47.5 Q ss_pred EEEEECCCCHHHH-HHHHHHHHHHHHHHHHHHCC--CCEEEEECCC-CCCEEEEEECCCCCCHHHHHHHHHHHHHHHH Q ss_conf 0798628999999-99999899999999999779--9889972435-8855788408888889899999999999996 Q gi|254781004|r 58 SIEIITPEDPRSL-AVIRHSCAHIMAEAVQSIWP--EVQVAIGPII-EDGFYYDFDKEQPFSSDELAQIEKKMQEIIA 131 (652) Q Consensus 58 ~i~~i~~~~~eg~-~i~~hS~ahlL~~Av~~l~p--~~kl~igp~i-~~GfYyD~~~~~~it~~dl~~Ie~~M~~ii~ 131 (652) +|+|...+.+ .+ ..--||+-||+|--+++-.. .--+-+||-= ..|||.-..++.. +++-+..+++-|+.|.. T Consensus 37 DlRf~qPN~e-~m~~~~~HTlEHL~A~~lRnh~~~~~~iId~sPMGCrTGFYl~~~g~~~-~~~v~~~~~~~l~~I~~ 112 (163) T PRK02260 37 DLRFCQPNKE-AMPTAGLHTLEHLLAGFMRNHLDDGVEIIDISPMGCRTGFYLSLIGTPD-EQDVADALKASLEDVLQ 112 (163) T ss_pred EEEECCCCHH-HCCCCHHHHHHHHHHHHHHHCCCCCCCEEEECCCCCCCCEEEEEECCCC-HHHHHHHHHHHHHHHHH T ss_conf 7764378832-3786205699999999987472135876985266545303899958999-89999999999999985 No 208 >COG3894 Uncharacterized metal-binding protein [General function prediction only] Probab=36.90 E-value=29 Score=14.81 Aligned_cols=58 Identities=31% Similarity=0.496 Sum_probs=35.3 Q ss_pred CCCCEEEEC-CCCCEEECCCCCCHHHHHHHHCCCCCCCEEEEEECCE-EEECCEE-ECCCEEEEECCCCH Q ss_conf 988708992-8998874189888999998636110112899998988-9977453-03670798628999 Q gi|254781004|r 1 MPSDIKLTF-PDGSIKNFPVHATGSDVAESISRSLAKKAVAVAINGK-VCDLSDP-VREGSIEIITPEDP 67 (652) Q Consensus 1 M~~~i~I~l-pDG~~~~~~~g~t~~dia~~i~~~~~~~~vaa~vng~-l~dL~~~-l~d~~i~~i~~~~~ 67 (652) || -||| |-|+.-+ +.|+|.+|.|+.++.... .++ .|+ .+.-... +.+++.++++.++. T Consensus 1 ~p---~v~f~psgkr~~-~~g~~il~aar~~gv~i~--s~c---ggk~~cgkc~v~v~~g~~~i~s~~dh 61 (614) T COG3894 1 MP---LVTFMPSGKRGE-DEGTTILDAARRLGVYIR--SVC---GGKGTCGKCQVVVQEGNHKIVSSTDH 61 (614) T ss_pred CC---EEEEECCCCCCC-CCCCHHHHHHHHHCCEEE--EEC---CCCCCCCCEEEEEEECCCEECCCHHH T ss_conf 96---268623787578-899608899986184676--304---78753243399999087035454057 No 209 >pfam01963 TraB TraB family. pAD1 is a hemolysin/bacteriocin plasmid originally identified in Enterococcus faecalis DS16. It encodes a mating response to a peptide sex pheromone, cAD1, secreted by recipient bacteria. Once the plasmid pAD1 is acquired, production of the pheromone ceases--a trait related in part to a determinant designated traB. However a related protein is found in C. elegans, suggesting that members of the TraB family have some more general function. Probab=36.43 E-value=19 Score=16.01 Aligned_cols=45 Identities=13% Similarity=0.201 Sum_probs=32.8 Q ss_pred HHHHHHHHHH---CCCEEEEECCCCCCCHHHHHHHHCCCCEEEEECHH Q ss_conf 9999999998---79889997589663676665543489889998742 Q gi|254781004|r 568 AQEIANLLKS---HHLSIETDFRNETINYKIREHSIKKIPIIIICGDK 612 (652) Q Consensus 568 a~~i~~~L~~---~girV~~Ddr~~~~G~Kir~a~l~giP~~ivIG~k 612 (652) .+++.+.|.+ .-.+|.+|.||.-+..+++++.--|--.+.|||.= T Consensus 162 l~~~~~e~~~~~P~l~~vLIdERD~ymA~~L~~~~~~~~~vVAVVGag 209 (225) T pfam01963 162 LEELMQEFREEFPALYEVLIDERDQYMAHKLLEAADPGKKVVAVVGAG 209 (225) T ss_pred HHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCH T ss_conf 999999999869545415666778999999999865998099998800 No 210 >PRK07199 phosphoribosylpyrophosphate synthetase; Provisional Probab=36.40 E-value=8 Score=18.61 Aligned_cols=72 Identities=14% Similarity=0.233 Sum_probs=36.5 Q ss_pred HHHHHHHHCCCCCCCEEEEEECCEE-EECCEEECCCEEEEECC-CCHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEC Q ss_conf 9999986361101128999989889-97745303670798628-9999999999989999999999977998899724 Q gi|254781004|r 23 GSDVAESISRSLAKKAVAVAINGKV-CDLSDPVREGSIEIITP-EDPRSLAVIRHSCAHIMAEAVQSIWPEVQVAIGP 98 (652) Q Consensus 23 ~~dia~~i~~~~~~~~vaa~vng~l-~dL~~~l~d~~i~~i~~-~~~eg~~i~~hS~ahlL~~Av~~l~p~~kl~igp 98 (652) ..+||+..+-.+.+..+--.=||++ ..+...+.+.+|-++.. .++. .-+- =.++|+.|+++.--..-..+-| T Consensus 15 A~~iA~~Lg~~l~~~~~~~F~DGE~~v~i~e~vrg~dV~iiqs~~~pn--d~lm--ELll~idA~r~a~A~~It~ViP 88 (301) T PRK07199 15 AGRLAAALGVEVGRITLHRFPDGESYVRLDSGVAGRTVVLVCSLDQPD--EKLL--PLLFAAEAARELGARRVVLVAP 88 (301) T ss_pred HHHHHHHHCCCCEEEEEEECCCCCEEEEECCCCCCCEEEEECCCCCCC--HHHH--HHHHHHHHHHHCCCCEEEEEEC T ss_conf 999999969970512899879998899617987798389988999973--8899--9999999998748874899815 No 211 >PRK01911 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=36.40 E-value=29 Score=14.75 Aligned_cols=41 Identities=10% Similarity=0.097 Sum_probs=19.9 Q ss_pred CEEEEECHHH--HH-CCCEEEEECCCCCE-EEECHHHHHHHHHHH Q ss_conf 8899987425--32-58189998898866-334299999999873 Q gi|254781004|r 604 PIIIICGDKE--AS-ERSIGIRRFGSTTT-QKVSLLDAIPILTKE 644 (652) Q Consensus 604 P~~ivIG~ke--~e-~~~Vtir~R~~~~~-~~i~lee~i~~l~~e 644 (652) ++.+.++-+. ++ ...|.|+.....-+ ...+-.+|.+.|+++ T Consensus 237 ~~~l~~DG~~~~l~~~d~i~i~ks~~~~~li~~~~~~f~~~Lr~K 281 (290) T PRK01911 237 NFLVSLDGRSETVPNDTELTIKKADYTIKVVKRFNHTFYKTLRNK 281 (290) T ss_pred CEEEEECCCCEECCCCCEEEEEECCCEEEEEECCCCCHHHHHHHH T ss_conf 479998898206199999999989976899976999989998631 No 212 >pfam10254 Pacs-1 PACS-1 cytosolic sorting protein. PACS-1 is a cytosolic sorting protein that directs the localisation of membrane proteins in the trans-Golgi network (TGN)/endosomal system. PACS-1 connects the clathrin adaptor AP-1 to acidic cluster sorting motifs contained in the cytoplasmic domain of cargo proteins such as furin, the cation-independent mannose-6-phosphate receptor and in viral proteins such as human immunodeficiency virus type 1 Nef. Probab=35.85 E-value=30 Score=14.70 Aligned_cols=18 Identities=28% Similarity=0.324 Sum_probs=10.8 Q ss_pred CCCCEEEEECCCEEEEEE Q ss_conf 467416731797213799 Q gi|254781004|r 505 RFNAFYVNSHSEKCHPVM 522 (652) Q Consensus 505 ~f~l~y~~~dg~~~~pvm 522 (652) .|.++|+.++-+++.|+| T Consensus 324 tlsmt~vtKEKkkK~~im 341 (413) T pfam10254 324 TMSMTVVTKEKNKKVPVM 341 (413) T ss_pred EEEEEEEEHHHCCCCCEE T ss_conf 269997323224454310 No 213 >PRK01185 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=35.60 E-value=30 Score=14.67 Aligned_cols=101 Identities=14% Similarity=0.073 Sum_probs=46.2 Q ss_pred ECCCHHHHHHHHHHHHHHCC-CCCCCCCCEEEEEEECCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHCCC Q ss_conf 03410378999999997387-08873378089998579668999999999999879889997589663676665543489 Q gi|254781004|r 525 RAVFGSIERFIGIMIENFKG-NLPLWLSPIQAIVTTITSSAVEYAQEIANLLKSHHLSIETDFRNETINYKIREHSIKKI 603 (652) Q Consensus 525 r~~~Gs~eR~ia~liE~~~g-~~P~wLAP~QV~Iipi~e~~~eya~~i~~~L~~~girV~~Ddr~~~~G~Kir~a~l~gi 603 (652) -++-||. |-.+ .-|| ...+.+. -+.|.||..- ....|..+=..+..+.-|+...+ . T Consensus 164 sTPTGST----AYsl-SaGGPIv~P~~~--~i~ltPI~Ph-------------~l~~RPlVl~~~s~I~i~v~~~~--~- 220 (272) T PRK01185 164 ATPIGSS----SYSS-SAGGPLLLPTLN--GMVISYLAPY-------------SSRSKPVVVSSKSTVEIKIAGRD--Q- 220 (272) T ss_pred ECCCCHH----HHHH-HCCCCCCCCCCC--EEEEECCCCC-------------CCCCCCEEECCCCEEEEEECCCC--C- T ss_conf 6788437----8897-669942279988--4999816777-------------67899889899986999991589--8- Q ss_pred CEEEEEC-HHH--HH-CCCEEEEECCCCCE-EEECHHHHHHHHHHHCCCHHH Q ss_conf 8899987-425--32-58189998898866-334299999999873143562 Q gi|254781004|r 604 PIIIICG-DKE--AS-ERSIGIRRFGSTTT-QKVSLLDAIPILTKESLPPDC 650 (652) Q Consensus 604 P~~ivIG-~ke--~e-~~~Vtir~R~~~~~-~~i~lee~i~~l~~e~~~~d~ 650 (652) +..++++ ... ++ ...|+|+.-....+ ...+ ++|.+.|+++ +.+|+ T Consensus 221 ~~~l~~DG~~~~~l~~~d~I~I~~s~~~~~li~~~-~~fy~~Lr~K-Ll~~~ 270 (272) T PRK01185 221 RSLLILDGQIEYKLSSGDTVNISVSENGARFISFR-ESFYDRLRDK-VIKHV 270 (272) T ss_pred CEEEEECCCCCEECCCCCEEEEEECCCEEEEEECC-CCHHHHHHHH-HHHHH T ss_conf 67999918851575999899999889758998379-8889999999-76652 No 214 >pfam11543 UN_NPL4 Nuclear pore localisation protein NPL4. Npl4 is part of the heterodimer UN along with Ufd1 which is involved in the recruitment of p97, an AAA ATPase, for tasks involving the ubiquitin pathway. Npl4 has a ubiquitin-like domain which has within its structure a beta-grasp fold with a helical insert. Probab=35.49 E-value=30 Score=14.66 Aligned_cols=35 Identities=23% Similarity=0.423 Sum_probs=25.7 Q ss_pred CCCCEEEEC--CCCCEE-ECCCCCCHHHHHHHHCCCCC Q ss_conf 988708992--899887-41898889999986361101 Q gi|254781004|r 1 MPSDIKLTF--PDGSIK-NFPVHATGSDVAESISRSLA 35 (652) Q Consensus 1 M~~~i~I~l--pDG~~~-~~~~g~t~~dia~~i~~~~~ 35 (652) |..+|-|.. |||+.| +++...|+.++-+.+.+.+. T Consensus 1 m~~~iiiRVqS~dGtkRIev~~~~t~~~l~~kV~~~f~ 38 (80) T pfam11543 1 MHEEIIIRVQSPDGTKRIEISSTSTLSTLLSKVAEELG 38 (80) T ss_pred CCCCEEEEEECCCCCEEEEECCCCHHHHHHHHHHHHHC T ss_conf 98518999988898467760870129999999999968 No 215 >TIGR02943 Sig70_famx1 RNA polymerase sigma-70 factor, TIGR02943 family; InterPro: IPR014289 The bacterial core RNA polymerase complex, which consists of five subunits, is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme . RNA polymerase recruits alternative sigma factors as a means of switching on specific regulons. Most bacteria express a multiplicity of sigma factors. Two of these factors, sigma-70 (gene rpoD), generally known as the major or primary sigma factor, and sigma-54 (gene rpoN or ntrA) direct the transcription of a wide variety of genes. The other sigma factors, known as alternative sigma factors, are required for the transcription of specific subsets of genes. With regard to sequence similarity, sigma factors can be grouped into two classes, the sigma-54 and sigma-70 families. Sequence alignments of the sigma70 family members reveal four conserved regions that can be further divided into subregions eg. sub-region 2.2, which may be involved in the binding of the sigma factor to the core RNA polymerase; and sub-region 4.2, which seems to harbor a DNA-binding 'helix-turn-helix' motif involved in binding the conserved -35 region of promoters recognized by the major sigma factors , . This group of sigma factors are part of the larger sigma-70 family. From homology, tree building, bidirectional best hits and their one-to-a-genome distribution, they appear to represent a conserved family.. Probab=35.44 E-value=21 Score=15.76 Aligned_cols=82 Identities=24% Similarity=0.385 Sum_probs=43.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCC------EEEECCC--C-CCEEEEECCHHHHHHHHHHHHHHHHHC-C-------CE Q ss_conf 9999999999753200887541144------0442267--8-743565010789999999999988768-9-------63 Q gi|254781004|r 230 ELTQYLYFLEESEKRDHRKLAREMD------LFHIAED--G-SGVIFWHRKGWKIFQTLISYMRRKIKD-D-------YE 292 (652) Q Consensus 230 ~l~~~~~~~eea~~rdHr~lg~~~~------lf~~~~~--~-~G~~~wlP~G~~i~~~ie~~ir~~~~~-G-------~~ 292 (652) ++--+|=+.+.+..|||+-+..|.= -+.=.+. | +-+=.|+ =.+++|||.+.+|...+. - +. T Consensus 10 ~~Rr~LLrFA~lQLRdh~d~AEDaVQEtLl~Al~~~~~F~G~s~lkTWl--~~ILknKiID~LR~~~R~~~v~laT~~d~ 87 (194) T TIGR02943 10 DLRRDLLRFARLQLRDHEDLAEDAVQETLLAALEHADSFAGRSALKTWL--FAILKNKIIDVLRAKGREVKVALATDLDD 87 (194) T ss_pred HHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHH--HHHHHHHHHHHHHCCCCEEEEEEECCHHH T ss_conf 9879999998623888267888999999999873577601489999999--99999999999834675454332010220 Q ss_pred EEECCCCCCCCCCCCCCCCCC Q ss_conf 855543244310001121001 Q gi|254781004|r 293 EINTPQVLDQHLWQQSGHWDS 313 (652) Q Consensus 293 eV~tP~l~~~~Lw~~SGh~~~ 313 (652) |..-=......|+..+|||+. T Consensus 88 e~~dE~d~~~~lF~~~GHW~~ 108 (194) T TIGR02943 88 ELDDEADEFDALFTQNGHWAE 108 (194) T ss_pred HCCCCHHHHHHCCCCCCCCCC T ss_conf 126710232431685766144 No 216 >TIGR02037 degP_htrA_DO protease Do; InterPro: IPR011782 Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes . They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence . Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases . Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base . The geometric orientations of the catalytic residues are similar between families, despite different protein folds . The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) , . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This family consists serine peptidases belonging to MEROPS peptidase family S1, subfamily S1C (protease Do, clan PA(S)). They are variously designated DegP, DegQ, heat shock protein HtrA, MucD and protease DO. The ortholog in Pseudomonas aeruginosa is designated MucD and is found in an operon that controls mucoid phenotype. This family also includes the DegQ (HhoA) paralog in Escherichia coli which can rescue a DegP mutant, but not the smaller DegS paralog, which cannot. Members of this family are located in the periplasm and have separable functions as both protease and chaperone. Members have a trypsin domain and two copies of a PDZ domain. This protein protects bacteria from thermal and other stresses and may be important for the survival of bacterial pathogens . The chaperone function is dominant at low temperatures, whereas the proteolytic activity is turned on at elevated temperatures .; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis. Probab=35.15 E-value=21 Score=15.77 Aligned_cols=22 Identities=41% Similarity=0.361 Sum_probs=9.7 Q ss_pred CCEECCCCCC---CCCCCCHHHHHH Q ss_conf 2100014653---112586035778 Q gi|254781004|r 422 KIMVKLSTRP---EKRVGSDALWDD 443 (652) Q Consensus 422 ~~~~~ls~~~---~~~~g~~~~~~~ 443 (652) .+.+.|+.+- .+-+|.|+..|. T Consensus 116 ~I~V~L~DgrefkAklvG~D~~~D~ 140 (484) T TIGR02037 116 EITVTLSDGREFKAKLVGKDPRTDI 140 (484) T ss_pred EEEEEECCCCEEEEEEECCCCCEEE T ss_conf 7999945994855688666772138 No 217 >KOG3063 consensus Probab=35.06 E-value=25 Score=15.25 Aligned_cols=88 Identities=14% Similarity=0.181 Sum_probs=45.9 Q ss_pred CCCEEEECCCCCEEEC-CCCCCHHHHHHHHCCCCCCCEEEEEECCEEEECCEEEC-CC-EEEEECCCCHHHHHHHHHHHH Q ss_conf 8870899289988741-89888999998636110112899998988997745303-67-079862899999999999899 Q gi|254781004|r 2 PSDIKLTFPDGSIKNF-PVHATGSDVAESISRSLAKKAVAVAINGKVCDLSDPVR-EG-SIEIITPEDPRSLAVIRHSCA 78 (652) Q Consensus 2 ~~~i~I~lpDG~~~~~-~~g~t~~dia~~i~~~~~~~~vaa~vng~l~dL~~~l~-d~-~i~~i~~~~~eg~~i~~hS~a 78 (652) |.+|.|.|.|+..+.. +..+--...-+ ..--+....|.++||=.+++ +..++ ++ .|+|+.-=+ ..|-|+.. T Consensus 11 ~~di~i~~~~~e~Rk~v~~k~e~g~~e~-~~lf~dgEtv~G~V~l~lk~-gkkleH~GikiefiGqIe----~~~drgn~ 84 (301) T KOG3063 11 SIDIEILFDNEESRKQVDMKTEDGKKEK-HPLFYDGETVSGKVNLRLKD-GKKLEHQGIKIEFIGQIE----MYYDRGNF 84 (301) T ss_pred CEEEEEEECCCHHHEECCCCCCCCCEEE-CCEEECCCEEEEEEEEEECC-CCCCCCCCEEEEEEEEEE----EEECCCCH T ss_conf 7059999768223111001145786032-30573687530189999747-861112764899998799----87247868 Q ss_pred HHHHHHHHHHCCCCEEE Q ss_conf 99999999977998899 Q gi|254781004|r 79 HIMAEAVQSIWPEVQVA 95 (652) Q Consensus 79 hlL~~Av~~l~p~~kl~ 95 (652) |=+..-+++|-+...++ T Consensus 85 ~eF~~lv~eLa~pGel~ 101 (301) T KOG3063 85 HEFTSLVRELARPGELT 101 (301) T ss_pred HHHHHHHHHHCCCCCEE T ss_conf 89889898636863301 No 218 >KOG3851 consensus Probab=34.49 E-value=26 Score=15.07 Aligned_cols=53 Identities=25% Similarity=0.304 Sum_probs=42.9 Q ss_pred HHHHHHHHHHHHCCCCCCCCCEEEEEEEEEECCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 2233333321110157756420689999983798899999999997532008875 Q gi|254781004|r 195 KKFFKLMKVAGAYWRGDSNRPMLSRIYGTAWNTQQELTQYLYFLEESEKRDHRKL 249 (652) Q Consensus 195 k~~fkL~~~sgayw~gd~~~~~l~Riyg~af~~~~~l~~~~~~~eea~~rdHr~l 249 (652) +|-=|+.-+|-+|||---.+...-=||.|+-|+-=-.+.|.+.+++.-+ ||.| T Consensus 199 GAPQKi~yise~y~Rk~gvRd~a~iiy~Tsl~~iFgVk~Y~~AL~k~~~--~rni 251 (446) T KOG3851 199 GAPQKIMYISESYFRKRGVRDNANIIYNTSLPTIFGVKHYADALEKVIQ--ERNI 251 (446) T ss_pred CCCHHHHHHHHHHHHHHCCCCCCCEEEECCCCCEECHHHHHHHHHHHHH--HCCE T ss_conf 8853311235999987275556527985176501037889999999987--5453 No 219 >PRK13255 thiopurine S-methyltransferase; Reviewed Probab=33.97 E-value=32 Score=14.50 Aligned_cols=31 Identities=16% Similarity=0.136 Sum_probs=11.4 Q ss_pred HHHHHHCCCCEEEEECCHHHHHHHHHHCCCH Q ss_conf 9999966998048532767854454416711 Q gi|254781004|r 126 MQEIIARDSPFLKQYLSHEKARELFESKKEA 156 (652) Q Consensus 126 M~~ii~~~~pi~r~~vs~~eA~~~F~~~~~~ 156 (652) |.=+.++.+.+.-.++|..-..++|++++.+ T Consensus 52 m~wLa~~G~~VvGvEls~~Av~~ff~e~~l~ 82 (218) T PRK13255 52 MLWLAEQGHEVLGVELSELAVEQFFAENGLT 82 (218) T ss_pred HHHHHHCCCEEEEEECHHHHHHHHHHHCCCC T ss_conf 9999848972699835299999999973898 No 220 >cd01817 RGS12_RBD RGS12 (regulator of G signalling 12), and RGS14, are members of a family of GTPase-activating proteins (GAP's) specific for the G-alpha subunit, which act as key inhibitors of G-protein-mediated cell responses in eukaryotes. Their domain architecture includes tandem RBD domains as well as PDZ , PTB, and RGS, and GoLoco domains. Probab=33.58 E-value=32 Score=14.45 Aligned_cols=56 Identities=16% Similarity=0.240 Sum_probs=32.4 Q ss_pred EEECCCCCEEE--CCCCCCHHHHHHHHCCC--CCCCEEEEEE--CCEEEECCEE---ECCCEEEE Q ss_conf 89928998874--18988899999863611--0112899998--9889977453---03670798 Q gi|254781004|r 6 KLTFPDGSIKN--FPVHATGSDVAESISRS--LAKKAVAVAI--NGKVCDLSDP---VREGSIEI 61 (652) Q Consensus 6 ~I~lpDG~~~~--~~~g~t~~dia~~i~~~--~~~~~vaa~v--ng~l~dL~~~---l~d~~i~~ 61 (652) .|+||||+.-. ...|.|+.|+...+-+. +.-+++-..+ |++.-+|+.+ +.+.+|++ T Consensus 3 ~V~LpDGsttvv~~rpg~tIr~~l~~L~eKRg~~~~a~dvfl~g~~Kpl~L~qdss~L~~~evrl 67 (73) T cd01817 3 RVILPDGSTTVVPTRPGESIRDLLSGLCEKRGINYAAVDLFLVGGDKPLVLDQDSSVLAGQEVRL 67 (73) T ss_pred EEECCCCCEEEEEECCCCCHHHHHHHHHHHCCCCCCCEEEEEECCCCCEECCCCCEEECCCEEEE T ss_conf 79988997688982799889999998998639982036999806884044489703533847888 No 221 >KOG3309 consensus Probab=33.37 E-value=32 Score=14.43 Aligned_cols=48 Identities=17% Similarity=0.146 Sum_probs=27.2 Q ss_pred CCCCEEEEECCCEEEEEEEEECCCHHHHHHHHHHHHHH---CCCCCCCCCCEEEEEE Q ss_conf 46741673179721379964034103789999999973---8708873378089998 Q gi|254781004|r 505 RFNAFYVNSHSEKCHPVMIHRAVFGSIERFIGIMIENF---KGNLPLWLSPIQAIVT 558 (652) Q Consensus 505 ~f~l~y~~~dg~~~~pvmihr~~~Gs~eR~ia~liE~~---~g~~P~wLAP~QV~Ii 558 (652) ..+++|++.||++.... |..|. -++-++-++. .|+.--.||=--..|| T Consensus 43 ~i~Itfv~~dG~~~~i~----g~vGd--tlLd~ah~n~idleGACEgslACSTCHVi 93 (159) T KOG3309 43 DIKITFVDPDGEEIKIK----GKVGD--TLLDAAHENNLDLEGACEGSLACSTCHVI 93 (159) T ss_pred EEEEEEECCCCCEEEEE----EECCH--HHHHHHHHCCCCCCCCCCCCCCCCCEEEE T ss_conf 37899998999788755----20443--79999987498765633562101310899 No 222 >PRK02649 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=32.74 E-value=33 Score=14.36 Aligned_cols=41 Identities=12% Similarity=0.193 Sum_probs=20.9 Q ss_pred CEEEEECHHH---HH-CCCEEEEECCCCCE-EEECHHHHHHHHHHH Q ss_conf 8899987425---32-58189998898866-334299999999873 Q gi|254781004|r 604 PIIIICGDKE---AS-ERSIGIRRFGSTTT-QKVSLLDAIPILTKE 644 (652) Q Consensus 604 P~~ivIG~ke---~e-~~~Vtir~R~~~~~-~~i~lee~i~~l~~e 644 (652) +..++++-++ ++ ...|.|+.-...-+ ...+-.+|.+.|+++ T Consensus 243 ~~~l~~DGq~~~~l~~gd~i~i~~s~~~~~li~~~~~~ff~~Lr~K 288 (305) T PRK02649 243 RLVMVVDGNAGCYVWPEDRVLIRRSPYPVRFIRLQDPEFFRVLREK 288 (305) T ss_pred CEEEEECCCCCEECCCCCEEEEEECCCEEEEEECCCCCHHHHHHHH T ss_conf 6799986997457499999999988973899973999989999876 No 223 >PHA01622 CRISPR-associated Cas4-like protein Probab=32.57 E-value=33 Score=14.34 Aligned_cols=146 Identities=17% Similarity=0.170 Sum_probs=84.7 Q ss_pred EEEECCHHH---HHHHHHHHHHHHHHHHHHHHHHCCEEEECCCCCCEEEEECCHHHHHHHHHHHHHHHHHCCCEEEECCC Q ss_conf 998379889---99999999975320088754114404422678743565010789999999999988768963855543 Q gi|254781004|r 222 GTAWNTQQE---LTQYLYFLEESEKRDHRKLAREMDLFHIAEDGSGVIFWHRKGWKIFQTLISYMRRKIKDDYEEINTPQ 298 (652) Q Consensus 222 g~af~~~~~---l~~~~~~~eea~~rdHr~lg~~~~lf~~~~~~~G~~~wlP~G~~i~~~ie~~ir~~~~~G~~eV~tP~ 298 (652) ||-|||.-- -+.||++.++-++- +|. -.+-.|..+...+|++.|++++. |+.-|. T Consensus 18 dTLFPSE~GICvRkSylaR~~~F~k~----~~~---------------e~~d~G~~~H~~veq~~re~lnC---etEv~I 75 (204) T PHA01622 18 DTLFPSEVGICYRKSYLARNIEFERG----INE---------------IYLDLGEQYHERIEQYFKEKLNC---QTEVEI 75 (204) T ss_pred CCCCCCCEEEEEEHHHHHHHHHHHHH----CCH---------------HHHCCCHHHHHHHHHHHHHHHCC---EEECCC T ss_conf 55373230577655577778999873----032---------------32201269999999999987374---573233 Q ss_pred CCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHH---HHHHHCCCCHHHCCHHHHHHEEEECCCCCC Q ss_conf 24431000112100125554201001456777823774343125777---765201222231043200010001278764 Q gi|254781004|r 299 VLDQHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHV---AVFNHGLKSYRELPVRLAEFGSVYRNEPSG 375 (652) Q Consensus 299 l~~~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~---~i~~~~~~SYrdLPlrl~e~~~~~R~E~sg 375 (652) -.+-+-.+.|||.|.--.|-. ...+..... .-.| ...|. .+|..-.. ..|++++.+=-+|=|-..+ T Consensus 76 s~ei~GI~IsGRiD~iC~Ndl----IEiKTt~~~--~~~p--~~~Hl~Qv~vY~~l~~---~q~~~~~~iYiVYi~R~~~ 144 (204) T PHA01622 76 KDEIEGIKISGRIDIVCNNDL----LEIKTISYN--YFQV--KEYHLYQVALYYHILK---KQNYQINNVYIVYLNRNTR 144 (204) T ss_pred CCCEEEEEEEEEEEEECCCCE----EEEEEEECC--CCCC--HHHHHHHHHHHHHHHC---CCCCCCCCEEEEEEECCCC T ss_conf 554015999725501025744----999963020--3770--5889999999999973---4786334279999954732 Q ss_pred CCEEEEECCCCCCCCCEEECCHHHHHHHHHHHHHHHHH Q ss_conf 52000105635334421010537678878999999887 Q gi|254781004|r 376 SLHGLMRVRGFTQDDAHVFCTKEQMFNECLKIHNLIIS 413 (652) Q Consensus 376 ~l~GL~R~ReF~~~Dah~F~~~eq~~~e~~~~~~~~~~ 413 (652) -++||..++ .+++..+.+..+++.. T Consensus 145 ------eVkef~ide-------svLet~~~~~~~~i~~ 169 (204) T PHA01622 145 ------EVKQFKIDE-------KVLETYYQKVIEWIKK 169 (204) T ss_pred ------HHHHEEECH-------HHHHHHHHHHHHHHHH T ss_conf ------557212049-------9999999999999999 No 224 >pfam06616 BsuBI_PstI_RE BsuBI/PstI restriction endonuclease C-terminus. This family represents the C-terminus of bacterial enzymes similar to type II restriction endonucleases BsuBI and PstI (EC:3.1.21.4). The enzymes of the BsuBI restriction/modification (R/M) system recognize the target sequence 5'CTGCAG and are functionally identical with those of the PstI R/M system. Probab=32.54 E-value=33 Score=14.34 Aligned_cols=18 Identities=22% Similarity=0.467 Sum_probs=7.3 Q ss_pred EEEEEEECCHHHHHHHHH Q ss_conf 999998379889999999 Q gi|254781004|r 219 RIYGTAWNTQQELTQYLY 236 (652) Q Consensus 219 Riyg~af~~~~~l~~~~~ 236 (652) .||.|||++++.++.|+. T Consensus 259 lvfVTAF~dR~~fkk~~~ 276 (306) T pfam06616 259 LVFVTAFPDRSAFKKYLA 276 (306) T ss_pred EEEEEECCCHHHHHHHHH T ss_conf 699995589899999988 No 225 >pfam04571 Lipin_N lipin, N-terminal conserved region. Mutations in the lipin gene lead to fatty liver dystrophy in mice. The protein has been shown to be phosphorylated by the TOR Ser/Thr protein kinases in response to insulin stimulation. The conserved region is found at the N-terminus of the member proteins. Probab=32.46 E-value=33 Score=14.33 Aligned_cols=61 Identities=21% Similarity=0.340 Sum_probs=41.3 Q ss_pred CEEEECCCCCEEECCCCCCHHHHHHHHCCCCCCCEEEEEECCEEEECCEEEC-CCEEEEECCCCH Q ss_conf 7089928998874189888999998636110112899998988997745303-670798628999 Q gi|254781004|r 4 DIKLTFPDGSIKNFPVHATGSDVAESISRSLAKKAVAVAINGKVCDLSDPVR-EGSIEIITPEDP 67 (652) Q Consensus 4 ~i~I~lpDG~~~~~~~g~t~~dia~~i~~~~~~~~vaa~vng~l~dL~~~l~-d~~i~~i~~~~~ 67 (652) -|-|.=|||+-+.-|-.+....+ +-+. -...+|-..|||+..|+.-.|. .++.-|+...++ T Consensus 30 VIVVeq~DGsl~~SPfhVRFGK~-~vl~--~~eK~V~i~VNg~~~~~~MkLg~~GEAfFv~e~~~ 91 (111) T pfam04571 30 VIVVEQPDGSLRCSPFHVRFGKF-GVLK--SSEKVVDIEVNGVEVDFHMKLGDSGEAFFVEETED 91 (111) T ss_pred EEEEECCCCCEECCCCEEEEEEE-EEEC--CCCCEEEEEECCEECCCEEEECCCCEEEEEEECCC T ss_conf 99997489988606537996236-7512--66768999999998265488689943999997048 No 226 >PRK03378 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=32.28 E-value=34 Score=14.31 Aligned_cols=96 Identities=15% Similarity=0.116 Sum_probs=43.3 Q ss_pred ECCCHHHHHHHHHHHHHHCC-CCCCCCCCEEEEEEECCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHCCC Q ss_conf 03410378999999997387-08873378089998579668999999999999879889997589663676665543489 Q gi|254781004|r 525 RAVFGSIERFIGIMIENFKG-NLPLWLSPIQAIVTTITSSAVEYAQEIANLLKSHHLSIETDFRNETINYKIREHSIKKI 603 (652) Q Consensus 525 r~~~Gs~eR~ia~liE~~~g-~~P~wLAP~QV~Iipi~e~~~eya~~i~~~L~~~girV~~Ddr~~~~G~Kir~a~l~gi 603 (652) -++.||. |-.+ .-|| ...+.+. -+.|.||..-. ...|..+=..+..+--++.+. .- T Consensus 182 sTPTGST----AYsl-SAGGPIv~P~~~--~~~ltPI~PHs-------------Ls~RPlVl~~~~~I~i~v~~~---~~ 238 (292) T PRK03378 182 STPTGST----AYSL-SAGGPILTPSLD--AITLVPMFPHT-------------LSARPLVINSSSTIRLRFSHR---RS 238 (292) T ss_pred ECCCCHH----HHHH-HCCCCCCCCCCC--CEEEEECCCCC-------------CCCCCEEECCCCEEEEEECCC---CC T ss_conf 6687457----7675-269963069987--27998257775-------------678988989998799998369---98 Q ss_pred CEEEEECHHH---HHC-CCEEEEECCCCCEEEEC--HHHHHHHHHHH Q ss_conf 8899987425---325-81899988988663342--99999999873 Q gi|254781004|r 604 PIIIICGDKE---ASE-RSIGIRRFGSTTTQKVS--LLDAIPILTKE 644 (652) Q Consensus 604 P~~ivIG~ke---~e~-~~Vtir~R~~~~~~~i~--lee~i~~l~~e 644 (652) +..+.+.-+. ++. ..|.|+.-... ...+. -.+|.+.|+++ T Consensus 239 ~~~vs~DGq~~~~l~~gd~I~I~~s~~~-~~li~~~~~~ff~~Lr~K 284 (292) T PRK03378 239 DLEISCDSQIALPIQEGEEVLIRRSDYH-LNLIHPKDYSYFNTLSTK 284 (292) T ss_pred CEEEEECCCCCEECCCCCEEEEEECCCE-EEEEECCCCCHHHHHHHH T ss_conf 5799983997436399999999989965-899961999989999755 No 227 >COG1977 MoaD Molybdopterin converting factor, small subunit [Coenzyme metabolism] Probab=32.21 E-value=34 Score=14.30 Aligned_cols=47 Identities=23% Similarity=0.383 Sum_probs=32.0 Q ss_pred CCCCHHHHHHHHCCC-------CC-CCEEEEEECCEEEECCEEECC-CEEEEECCC Q ss_conf 988899999863611-------01-128999989889977453036-707986289 Q gi|254781004|r 19 VHATGSDVAESISRS-------LA-KKAVAVAINGKVCDLSDPVRE-GSIEIITPE 65 (652) Q Consensus 19 ~g~t~~dia~~i~~~-------~~-~~~vaa~vng~l~dL~~~l~d-~~i~~i~~~ 65 (652) .+.|+.++.+.+.+. +. ...+.+.+|..+.+++++|.| ++|.|+..- T Consensus 26 ~~~tv~~L~~~l~~~~~~~~~~~~~~~~v~~~~~~~~~~~~t~L~dGDeVa~~PPV 81 (84) T COG1977 26 VGATVGELEELLPKEGERWLLALEDNIVVNAANNEFLVGLDTPLKDGDEVAFFPPV 81 (84) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCEEECCCCCCCCCCCEEEEECCC T ss_conf 77899999999887612677624666379985034764777526899999994898 No 228 >cd01202 FRS2 Fibroblast growth factor receptor substrate 2 (FRS2/SNT1) Phosphotyrosine-binding domain (IRS1-like). FRS2 mediates signaling downstream of the FGF receptor. It has an N-terminal PTBi domain, which has a PH-like fold and is similiar to the PTB domain that is found in insulin receptor substrate molecules. This PTBi domain is shorter than the PTB domain which is found in SHC, Numb and other proteins. The PTBi domain binds to phosphotyrosines which are in NPXpY motifs. Probab=32.19 E-value=34 Score=14.30 Aligned_cols=43 Identities=23% Similarity=0.458 Sum_probs=32.5 Q ss_pred HHHHHHHHHHCCEEEECC-----CCCCEEEE-ECCHHHHHHHHHHHHHH Q ss_conf 200887541144044226-----78743565-01078999999999998 Q gi|254781004|r 243 KRDHRKLAREMDLFHIAE-----DGSGVIFW-HRKGWKIFQTLISYMRR 285 (652) Q Consensus 243 ~rdHr~lg~~~~lf~~~~-----~~~G~~~w-lP~G~~i~~~ie~~ir~ 285 (652) -+.-|+-|.+-.+|+|.. .|+|++.+ .+.|-.|++.|+.+|.+ T Consensus 46 ~~~LRRYG~~~~~FsFEaGRrC~tGeGif~F~~~~~~eif~~vq~~iq~ 94 (102) T cd01202 46 LLCLRRYGYNSDLFSFESGRRCQTGEGIFAFRCKRAEELFNLLQSYIQE 94 (102) T ss_pred HHHHHHHCCCCCEEEEECCCCCCCCCCEEEEECCCHHHHHHHHHHHHHH T ss_conf 7885641647988999715868999978999739999999999999986 No 229 >TIGR02224 recomb_XerC tyrosine recombinase XerC; InterPro: IPR011931 The phage integrase family describes a number of recombinases with tyrosine active sites that transiently bind covalently to DNA. Many are associated with mobile DNA elements, including phage, transposons, and phase variation loci. This entry represents XerC, which is closely related to the other chromosomal protein XerD (IPR011932 from INTERPRO). XerC and XerD are site-specific recombinases which help resolve chromosome dimers to monomers for cell division after DNA replication. XerC and XerD interact with cell division protein FtsK .; GO: 0003677 DNA binding, 0006310 DNA recombination, 0007049 cell cycle, 0007059 chromosome segregation, 0015074 DNA integration, 0051301 cell division. Probab=32.09 E-value=24 Score=15.27 Aligned_cols=15 Identities=20% Similarity=0.116 Sum_probs=7.0 Q ss_pred CCEEEEECCCEEEEE Q ss_conf 741673179721379 Q gi|254781004|r 507 NAFYVNSHSEKCHPV 521 (652) Q Consensus 507 ~l~y~~~dg~~~~pv 521 (652) +.-.+-.-|.|+|.| T Consensus 175 ~~vrV~GKG~KeR~V 189 (313) T TIGR02224 175 GLVRVRGKGNKERIV 189 (313) T ss_pred CEEEEEECCCCEEEC T ss_conf 767774148813530 No 230 >cd00727 malate_synt_A Malate synthase A (MSA), present in some bacteria, plants and fungi. Prokaryotic MSAs tend to be monomeric, whereas eukaryotic enzymes are homomultimers. In general, malate synthase catalyzes the Claisen condensation of glyoxylate and acetyl-CoA to malyl-CoA, which hydrolyzes to malate and CoA. This reaction is part of the glyoxylate cycle, which allows certain organisms, like plants and fungi, to derive their carbon requirements from two-carbon compounds, by bypassing the two carboxylation steps of the citric acid cycle. Probab=31.99 E-value=33 Score=14.41 Aligned_cols=191 Identities=16% Similarity=0.328 Sum_probs=97.1 Q ss_pred CCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHCCCHHHHHHHHHCCCCCCCEEECCCCCE Q ss_conf 88557884088888898999999999999966998048532767854454416711355665420345542130257520 Q gi|254781004|r 101 EDGFYYDFDKEQPFSSDELAQIEKKMQEIIARDSPFLKQYLSHEKARELFESKKEAYKVEILESIPAKENVTLYRQGEWF 180 (652) Q Consensus 101 ~~GfYyD~~~~~~it~~dl~~Ie~~M~~ii~~~~pi~r~~vs~~eA~~~F~~~~~~~K~~li~~~~~~~~v~~y~~g~~~ 180 (652) .+||-.||+...+.|=+.+-.-...+...+...+.|+.. .|-.|++ .++..+-+.| T Consensus 88 A~~~maDfEDs~sPtW~N~i~gq~Nl~dA~~~~i~f~~~-------------~gk~y~l------~~~~a~L~vR----- 143 (511) T cd00727 88 AKVFMADFEDANAPTWENQVEGQINLRDAVRGTISFTSP-------------EGKEYKL------NDTPATLIVR----- 143 (511) T ss_pred CCEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCC-------------CCCEEEE------CCCCCEEEEE----- T ss_conf 868984366678996688999999999998187765389-------------9964300------7988768861----- Q ss_pred EECCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEECC Q ss_conf 10002355214442223333332111015775642068999998379889999999999753200887541144044226 Q gi|254781004|r 181 DLCRGPHVRSTGQVKKFFKLMKVAGAYWRGDSNRPMLSRIYGTAWNTQQELTQYLYFLEESEKRDHRKLAREMDLFHIAE 260 (652) Q Consensus 181 Dlc~GphvpsTg~ik~~fkL~~~sgayw~gd~~~~~l~Riyg~af~~~~~l~~~~~~~eea~~rdHr~lg~~~~lf~~~~ 260 (652) .||=|++--... +.=. .|.|.-| +..||+||. T Consensus 144 --pRG~HL~E~hv~---vdG~-----------------p~~g~Lf--------------------------Dfgly~fhN 175 (511) T cd00727 144 --PRGWHLPEKHVL---VDGE-----------------PVSGSLF--------------------------DFGLYFFHN 175 (511) T ss_pred --CCCCCCCCCCEE---ECCE-----------------ECCEEHH--------------------------HHHHHHHHH T ss_conf --886686877755---8785-----------------1512177--------------------------668999820 Q ss_pred ------CCCCEEEEECC-----HHHHHHHHHHHHHHHH--HCCCEEEECCC-----C--CCCCCCCC--------CCCCC Q ss_conf ------78743565010-----7899999999999887--68963855543-----2--44310001--------12100 Q gi|254781004|r 261 ------DGSGVIFWHRK-----GWKIFQTLISYMRRKI--KDDYEEINTPQ-----V--LDQHLWQQ--------SGHWD 312 (652) Q Consensus 261 ------~~~G~~~wlP~-----G~~i~~~ie~~ir~~~--~~G~~eV~tP~-----l--~~~~Lw~~--------SGh~~ 312 (652) .|+|-|||||+ -+++|+.+-.+..+.+ ..|...+..-. . +..=||+. -|||| T Consensus 176 a~~l~~~g~GpyfYLPKlEs~~EArlWndvf~~ae~~lglp~GtIkatvLIETi~AaFemeEILyeLR~h~~GLN~GRWD 255 (511) T cd00727 176 AKALLARGSGPYFYLPKMESHLEARLWNDVFVFAQDYLGLPRGTIKATVLIETLPAAFEMDEILYELRDHSAGLNCGRWD 255 (511) T ss_pred HHHHHHCCCCCEEECCCCCCHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEHHHHHHHHHHHHHHHHHHHHHHHCCCCCHH T ss_conf 99998469987584314577899999999999999974888883578863755577772799999999876422464068 Q ss_pred CCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHHHCCCCHHHCCHHHHHHEEEECCCCCCCCEE Q ss_conf 1255542010014567778237743431257777652012222310432000100012787645200 Q gi|254781004|r 313 SYRANMFAVKCADDTIKDLRTFALKPMNCPGHVAVFNHGLKSYRELPVRLAEFGSVYRNEPSGSLHG 379 (652) Q Consensus 313 ~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~~~~SYrdLPlrl~e~~~~~R~E~sg~l~G 379 (652) .- |.... .-++..+++|-- .....|-..-.+||-+|=++ +|||--... ..| T Consensus 256 Yi----fS~Ik---~~~~~~~~vlPD---R~~vtM~~pFm~aY~~lLv~-----tCHkRGa~A-iGG 306 (511) T cd00727 256 YI----FSFIK---KFRNHPDFVLPD---RAQVTMTVPFMRAYSELLIK-----TCHRRGAHA-MGG 306 (511) T ss_pred HH----HHHHH---HHCCCCCCCCCC---HHHCCCCCHHHHHHHHHHHH-----HHCCCCCCC-CCC T ss_conf 89----99999---966487671797---66556785999999999999-----970558750-013 No 231 >pfam08428 Rib Rib/alpha-like repeat. The region featured in this family is found repeated in a number of bacterial surface proteins, such as Rib and alpha. These are expressed by group B streptococci, and Rib is thought to confer protective immunity. Probab=31.67 E-value=26 Score=15.10 Aligned_cols=14 Identities=43% Similarity=0.926 Sum_probs=5.6 Q ss_pred EEEECCCCCEEECC Q ss_conf 08992899887418 Q gi|254781004|r 5 IKLTFPDGSIKNFP 18 (652) Q Consensus 5 i~I~lpDG~~~~~~ 18 (652) +.||+||||.-+++ T Consensus 45 V~VTYPDGS~d~V~ 58 (65) T pfam08428 45 VVVTYPDGSKDEVP 58 (65) T ss_pred EEEECCCCCCEEEE T ss_conf 99985899810888 No 232 >COG4829 CatC1 Muconolactone delta-isomerase [Secondary metabolites biosynthesis, transport, and catabolism] Probab=30.91 E-value=28 Score=14.85 Aligned_cols=39 Identities=23% Similarity=0.316 Sum_probs=28.7 Q ss_pred CCCCCCCCCCCCCCCCCCEEECCCCCCCCCCEEEECCCCHHHHHHHHHHCCCCHHHC Q ss_conf 310001121001255542010014567778237743431257777652012222310 Q gi|254781004|r 302 QHLWQQSGHWDSYRANMFAVKCADDTIKDLRTFALKPMNCPGHVAVFNHGLKSYREL 358 (652) Q Consensus 302 ~~Lw~~SGh~~~y~~~mf~~~~~~~~~~~~~~~~LkPmncp~h~~i~~~~~~SYrdL 358 (652) ..||..-|||.+|+- | .+ |-|.+.|.++-+--+.||++- T Consensus 41 ~~lWR~~Geyanysl--F---------------d~-dd~~eLh~~L~~~P~f~ym~~ 79 (98) T COG4829 41 LRLWRRPGEYANYSL--F---------------DA-DDNGELHQLLASMPPFSYMTD 79 (98) T ss_pred HHHHHCCCCCCCEEE--E---------------CC-CCHHHHHHHHHCCCCCCCCCC T ss_conf 999861431023112--3---------------07-861789999962998632244 No 233 >PRK00103 SPOUT methyltransferase superfamily protein; Provisional Probab=30.76 E-value=36 Score=14.14 Aligned_cols=84 Identities=10% Similarity=0.187 Sum_probs=57.5 Q ss_pred CCHH-HHHHHHHHHHHHCCCCCCCCCCEEEEEEEC-C--H------HHHHHHHHHHHHHHHCCCEEEEECCCCC-----C Q ss_conf 4103-789999999973870887337808999857-9--6------6899999999999987988999758966-----3 Q gi|254781004|r 527 VFGS-IERFIGIMIENFKGNLPLWLSPIQAIVTTI-T--S------SAVEYAQEIANLLKSHHLSIETDFRNET-----I 591 (652) Q Consensus 527 ~~Gs-~eR~ia~liE~~~g~~P~wLAP~QV~Iipi-~--e------~~~eya~~i~~~L~~~girV~~Ddr~~~-----~ 591 (652) +.|- -..++-.+++.|.-+++.. +++++.-+|- + . ....-++.|.+.+....+-|.+|.++.. + T Consensus 7 ~iGK~~~~~~~~~i~~Y~kRl~~~-~~i~~~el~~~~~~~~~~~~~~~~~E~~~il~~~~~~~~~I~LDe~Gk~~sS~~f 85 (156) T PRK00103 7 AVGKLMPKWVKEGFAEYAKRLPRD-CKLELIEIPDEKRGKNADIERIKAKEGERILAAIPKGARVVALDIRGKPWTSEQF 85 (156) T ss_pred EEECCCCHHHHHHHHHHHHHHCCC-CCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCHHHH T ss_conf 970769679999999999971742-5885589478777675899999999999999847999989999179980576999 Q ss_pred CHHHHHHHHCCCCEEEEECH Q ss_conf 67666554348988999874 Q gi|254781004|r 592 NYKIREHSIKKIPIIIICGD 611 (652) Q Consensus 592 G~Kir~a~l~giP~~ivIG~ 611 (652) ..++.++...|-.+..+||. T Consensus 86 a~~l~~~~~~g~~i~FiIGG 105 (156) T PRK00103 86 AKELERWRLDGRDVTFVIGG 105 (156) T ss_pred HHHHHHHHHCCCCEEEEEEC T ss_conf 99999998639966999978 No 234 >TIGR02225 recomb_XerD tyrosine recombinase XerD; InterPro: IPR011932 Proteins in this entry are part of the wider so-called "phage" integrase fmaily which describes a number of recombinases with tyrosine active sites that transiently bind covalently to DNA. Many are associated with mobile DNA elements, including phage, transposons, and phase variation loci. This entry represents the chromosomally-encoded polypeptide XerD, which is closely related to, and interacts with, another chromosomally-encoded polypeptide, XerC (IPR011931 from INTERPRO). These proteins are site-specific tyrosine recombinases, which act by catalyzing the cutting and rejoining of the recombining DNA molecules . They form a cyclic heterotetrameric complex composed of two molecules of XerC and two molecules of XerD. The XerC-XerD complex is essential to convert dimers of the bacterial chromosome into monomers to permit their segregation at cell division. It also contributes to the segregational stability of plasmids. XerD is composed of two domains, an N-terminal alpha-helical domain, and a C-terminal domain composed mostly of alpha helices, with a three-stranded antiprallel beta sheet along one edge . The active site is found at the extreme C-terminal end of the polypeptide. Most of the C-terminal domain is structurally homologous to that of other "phage" integrases, but the active site regions of these enzymes are stucturally distinct.; GO: 0003677 DNA binding, 0006310 DNA recombination, 0007049 cell cycle, 0007059 chromosome segregation, 0015074 DNA integration, 0051301 cell division. Probab=30.75 E-value=24 Score=15.32 Aligned_cols=18 Identities=11% Similarity=0.135 Sum_probs=7.7 Q ss_pred HHHHHHHHHHHHHHHHHH Q ss_conf 887899999988766552 Q gi|254781004|r 401 FNECLKIHNLIISIYKDF 418 (652) Q Consensus 401 ~~e~~~~~~~~~~v~~~~ 418 (652) ...+...+..+..+|+.+ T Consensus 62 ~~S~AR~lsalR~Fy~fL 79 (305) T TIGR02225 62 ARSIARALSALRSFYRFL 79 (305) T ss_pred CHHHHHHHHHHHHHHHHH T ss_conf 002889999999999999 No 235 >PRK05259 consensus Probab=30.29 E-value=29 Score=14.74 Aligned_cols=10 Identities=30% Similarity=0.355 Sum_probs=4.8 Q ss_pred HHHHHHHHHC Q ss_conf 9999999977 Q gi|254781004|r 80 IMAEAVQSIW 89 (652) Q Consensus 80 lL~~Av~~l~ 89 (652) +|+.|+++.- T Consensus 69 l~i~A~r~~~ 78 (310) T PRK05259 69 IMIDALRRSS 78 (310) T ss_pred HHHHHHHHCC T ss_conf 9999998738 No 236 >TIGR02393 RpoD_Cterm RNA polymerase sigma factor RpoD; InterPro: IPR012760 The bacterial core RNA polymerase complex, which consists of five subunits, is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme . RNA polymerase recruits alternative sigma factors as a means of switching on specific regulons. Most bacteria express a multiplicity of sigma factors. Two of these factors, sigma-70 (gene rpoD), generally known as the major or primary sigma factor, and sigma-54 (gene rpoN or ntrA) direct the transcription of a wide variety of genes. The other sigma factors, known as alternative sigma factors, are required for the transcription of specific subsets of genes. With regard to sequence similarity, sigma factors can be grouped into two classes, the sigma-54 and sigma-70 families. Sequence alignments of the sigma70 family members reveal four conserved regions that can be further divided into subregions eg. sub-region 2.2, which may be involved in the binding of the sigma factor to the core RNA polymerase; and sub-region 4.2, which seems to harbor a DNA-binding 'helix-turn-helix' motif involved in binding the conserved -35 region of promoters recognized by the major sigma factors , . This entry represents the C-terminal region of sigma 70 (RpoD) which contains the well-conserved regions 2, 3 and 4. Region 2 of sigma-70 is the most conserved region of the entire protein. The high conservation is due to region 2 containing both the -10 promoter recognition helix and the primary core RNA polymerase binding determinant. The core-binding helix, interacts with the clamp domain of the largest polymerase subunit, beta prime , . The aromatic residues of the recognition helix, found at the C terminus of this domain are thought to mediate strand separation, thereby allowing transcription initiation , . Region 3 forms a discrete compact three helical domain within the sigma-factor. Region is not normally involved in the recognition of promoter DNA, but in some specific bacterial promoters containing an extended -10 promoter element, residues within region 3 play an important role. Region 3 primarily is involved in binding the core RNA polymerase in the holoenzyme .Region 4 is involved in binding to the -35 promoter element via a helix-turn-helix motif .; GO: 0003677 DNA binding, 0016987 sigma factor activity, 0006352 transcription initiation, 0006355 regulation of transcription DNA-dependent. Probab=30.27 E-value=10 Score=17.90 Aligned_cols=74 Identities=15% Similarity=0.175 Sum_probs=52.1 Q ss_pred CCCCCCHHHHHHHHCCCCCCCEEEEEECCEEEECCEEE-CCCEEEEECC-CCHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 18988899999863611011289999898899774530-3670798628-99999999999899999999999779 Q gi|254781004|r 17 FPVHATGSDVAESISRSLAKKAVAVAINGKVCDLSDPV-REGSIEIITP-EDPRSLAVIRHSCAHIMAEAVQSIWP 90 (652) Q Consensus 17 ~~~g~t~~dia~~i~~~~~~~~vaa~vng~l~dL~~~l-~d~~i~~i~~-~~~eg~~i~~hS~ahlL~~Av~~l~p 90 (652) .-..-|..+||+.++-...|---..+++-+++.|.+|| +|.+=.|-|| .|++...-.-+....+|...+.+++. T Consensus 100 ~Gr~Pt~EelAe~~g~~~~Kv~~i~~~~~~PiSLe~piG~e~dS~l~DFi~D~~~~~P~~~~~~~~l~~~l~~vL~ 175 (240) T TIGR02393 100 LGREPTDEELAERMGLPVEKVREIKKIAQEPISLETPIGEEEDSSLGDFIEDESIESPEDAAAKELLREQLEEVLE 175 (240) T ss_pred HCCCCCHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 2889987789987089989999999862488656667788777744265257444685899999999999999986 No 237 >PRK09255 malate synthase; Validated Probab=29.86 E-value=12 Score=17.45 Aligned_cols=43 Identities=14% Similarity=0.374 Sum_probs=29.9 Q ss_pred CCEEEEC------CCCCCEEEEECC-----HHHHHHHHHHHHHHHH--HCCCEEEE Q ss_conf 4404422------678743565010-----7899999999999887--68963855 Q gi|254781004|r 253 MDLFHIA------EDGSGVIFWHRK-----GWKIFQTLISYMRRKI--KDDYEEIN 295 (652) Q Consensus 253 ~~lf~~~------~~~~G~~~wlP~-----G~~i~~~ie~~ir~~~--~~G~~eV~ 295 (652) ..||+|| +.|+|-|||||+ -+++||.+-.+..+.+ ..|...+. T Consensus 189 fgLy~fhNa~~l~~~g~GpyfYLPKlEs~~EArlWndvF~~ae~~lglp~GTIkaT 244 (531) T PRK09255 189 FALYFFHNAKELLAKGSGPYFYLPKLESHLEARLWNDVFVFAEDRLGLPRGTIKAT 244 (531) T ss_pred HHHHHHHHHHHHHHCCCCCEEECCCCCCHHHHHHHHHHHHHHHHHHCCCCCCEEEE T ss_conf 99999853899983799865741234548899999999999999728998844788 No 238 >cd02958 UAS UAS family; UAS is a domain of unknown function. Most members of this family are uncharacterized proteins with similarity to FAS-associated factor 1 (FAF1) and ETEA because of the presence of a UAS domain N-terminal to a ubiquitin-associated UBX domain. FAF1 is a longer protein, compared to the other members of this family, having additional N-terminal domains, a ubiquitin-associated UBA domain and a nuclear targeting domain. FAF1 is an apoptotic signaling molecule that acts downstream in the Fas signal transduction pathway. It interacts with the cytoplasmic domain of Fas, but not to a Fas mutant that is deficient in signal transduction. ETEA is the protein product of a highly expressed gene in T-cells and eosinophils of atopic dermatitis patients. The presence of the ubiquitin-associated UBX domain in the proteins of this family suggests the possibility of their involvement in ubiquitination. Recently, FAF1 has been shown to interact with valosin-containing protein (VCP), Probab=29.76 E-value=37 Score=14.03 Aligned_cols=66 Identities=11% Similarity=0.207 Sum_probs=42.2 Q ss_pred HHHHHHHHHCCCEEEEECCCCCCCHHHHH-HHHCCCCEEEEECHHHHHCCCEEEEECCCCCEEEECHHHHHHHHHH Q ss_conf 99999999879889997589663676665-5434898899987425325818999889886633429999999987 Q gi|254781004|r 569 QEIANLLKSHHLSIETDFRNETINYKIRE-HSIKKIPIIIICGDKEASERSIGIRRFGSTTTQKVSLLDAIPILTK 643 (652) Q Consensus 569 ~~i~~~L~~~girV~~Ddr~~~~G~Kir~-a~l~giP~~ivIG~ke~e~~~Vtir~R~~~~~~~i~lee~i~~l~~ 643 (652) ++|.+-++++-+-..+|... .-|.++.. ....++|++++|+++ +|.+--|. .+.++.++|+..|.+ T Consensus 43 ~~V~~~i~~~fV~w~~d~~~-~eg~~~~~~y~~~~~P~i~iidp~---tge~v~~~-----~G~~~p~~~l~~L~~ 109 (114) T cd02958 43 ESVKEFIRENFIFWQCDIDS-SEGQRFLQSYKVDKYPHIAIIDPR---TGEVLKVW-----SGNITPEDLLSQLIE 109 (114) T ss_pred HHHHHHHHHHEEEEEEECCC-HHHHHHHHHCCCCCCCEEEEEECC---CCCEEEEE-----ECCCCHHHHHHHHHH T ss_conf 89999997116898631698-579999986699998979999789---99599898-----689998999999999 No 239 >PRK09601 translation-associated GTPase; Reviewed Probab=29.55 E-value=37 Score=14.01 Aligned_cols=16 Identities=19% Similarity=0.553 Sum_probs=9.4 Q ss_pred HHHHHHHHHHHHHHHC Q ss_conf 8999999999999966 Q gi|254781004|r 117 DELAQIEKKMQEIIAR 132 (652) Q Consensus 117 ~dl~~Ie~~M~~ii~~ 132 (652) .|+..||+++.++.++ T Consensus 135 ~Dl~~vek~~~rl~k~ 150 (364) T PRK09601 135 ADLESVEKRLERLEKK 150 (364) T ss_pred HHHHHHHHHHHHHHHH T ss_conf 9999999999999755 No 240 >cd01770 p47_UBX p47_UBX p47 is an adaptor molecule of the cytosolic AAA ATPase p97. The principal role of the p97-p47 complex is to regulate membrane fusion events. Mono-ubiquitin recognition by p47 is crucial for p97-p47-mediated Golgi membrane fusion events. p47 has carboxy-terminal SEP and UBX domains. The UBX domain has a beta-grasp fold similar to that of ubiquitin however, UBX lacks the c-terminal double glycine motif and is thus unlikely to be conjugated to other proteins. Probab=29.36 E-value=38 Score=13.99 Aligned_cols=54 Identities=22% Similarity=0.254 Sum_probs=32.5 Q ss_pred CEEEECCCCCEE--ECCCCCCHHHHHHHHCC---CCC-CCEE--EEEECCEEEECCEEECCC Q ss_conf 708992899887--41898889999986361---101-1289--999898899774530367 Q gi|254781004|r 4 DIKLTFPDGSIK--NFPVHATGSDVAESISR---SLA-KKAV--AVAINGKVCDLSDPVREG 57 (652) Q Consensus 4 ~i~I~lpDG~~~--~~~~g~t~~dia~~i~~---~~~-~~~v--aa~vng~l~dL~~~l~d~ 57 (652) .|.|.|+||+.. .|...-|+.||-.-|.. ..+ ..-. .+.=+-.+.|.+..|+|+ T Consensus 6 ~IQIRL~dGsR~v~~FN~~~tV~dv~~fI~~~~p~~~~~~F~L~t~FP~k~l~d~~~Tl~eA 67 (79) T cd01770 6 SIQIRLADGKRLVQKFNSSHRVSDVRDFIVNARPEFAARPFTLMTAFPVKELSDESLTLKEA 67 (79) T ss_pred EEEEEECCCCEEEEEECCCCCHHHHHHHHHHHCCCCCCCCEEEECCCCCCCCCCCCCCHHHC T ss_conf 89999189988999957978899999999983987789987886579997358787777885 No 241 >PRK05282 peptidase E; Validated Probab=28.95 E-value=38 Score=13.94 Aligned_cols=15 Identities=33% Similarity=0.652 Sum_probs=5.7 Q ss_pred HHHHHHHHHHHHHHH Q ss_conf 778999999999850 Q gi|254781004|r 441 WDDAENIMKGVLDTI 455 (652) Q Consensus 441 ~~~ae~~l~~~l~~~ 455 (652) |+..+....+++... T Consensus 46 ~d~Yt~~v~~af~~l 60 (233) T PRK05282 46 WDDYTAKVAEVLAPL 60 (233) T ss_pred HHHHHHHHHHHHHHC T ss_conf 899999999999866 No 242 >TIGR01381 E1_like_apg7 E1-like protein-activating enzyme Gsa7p/Apg7p; InterPro: IPR006285 This is a family of eukaryotic proteins found in animals, plants, and yeasts that includes Apg7p (YHR171W) from Saccharomyces cerevisiae and GSA7 from Pichia pastoris. Members are about 650 to 700 residues in length and include a central domain of about 150 residues shared with the ThiF/MoeB/HesA family of proteins. A low level of similarity to ubiquitin-activating enzyme E1 is described in a paper on peroxisome autophagy mediated by GSA7, and is the basis of the name ubiquitin activating enzyme E1-like protein. Members of the family appear to be involved in protein lipidation events analogous to ubiquitination and required for membrane fusion events during autophagy. . Probab=28.53 E-value=39 Score=13.89 Aligned_cols=37 Identities=16% Similarity=0.019 Sum_probs=16.8 Q ss_pred CCCCCCCEEEEEE--------EC---CHHHHHHHHHHHHHHHHCCCEE Q ss_conf 8873378089998--------57---9668999999999999879889 Q gi|254781004|r 546 LPLWLSPIQAIVT--------TI---TSSAVEYAQEIANLLKSHHLSI 582 (652) Q Consensus 546 ~P~wLAP~QV~Ii--------pi---~e~~~eya~~i~~~L~~~girV 582 (652) =|+=++|.|++=. |. .+.|.+=.+.|..+.++.|-.- T Consensus 597 tvLG~lPhQiRGfl~~F~~i~l~~~~~~~C~ACs~~v~~ey~~rgw~F 644 (689) T TIGR01381 597 TVLGILPHQIRGFLSRFEQIKLEAKRYDSCVACSEAVLAEYKERGWKF 644 (689) T ss_pred CCCCCCCCCHHHHCCCCEEECCCCCCCCCCCCHHHHHHHHHHHHHHHH T ss_conf 521467510110101220001011044642005799999987422468 No 243 >TIGR00439 ftsX putative protein insertion permease FtsX; InterPro: IPR004513 FtsX is an integral membrane protein encoded in the same operon as signal recognition particle docking protein FtsY and FtsE. FtsE is a hydrophilic nucleotide-binding protein that associates with the inner membrane by means of association with FtsX; FtsE mutants are viable only in high salt, supporting possible roles in cell division, as previously indicated, or in transport.; GO: 0007049 cell cycle, 0016021 integral to membrane. Probab=28.05 E-value=39 Score=13.83 Aligned_cols=68 Identities=22% Similarity=0.388 Sum_probs=34.0 Q ss_pred HHHHHHCCCCEEEEECCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCEE-EEECCHHHHHHHHHH-CCCHHHHH Q ss_conf 9999977998899724358855788408888889899999999999996699804-853276785445441-67113556 Q gi|254781004|r 83 EAVQSIWPEVQVAIGPIIEDGFYYDFDKEQPFSSDELAQIEKKMQEIIARDSPFL-KQYLSHEKARELFES-KKEAYKVE 160 (652) Q Consensus 83 ~Av~~l~p~~kl~igp~i~~GfYyD~~~~~~it~~dl~~Ie~~M~~ii~~~~pi~-r~~vs~~eA~~~F~~-~~~~~K~~ 160 (652) .|.-+.||..++|| ...+.+++++=..+-.++++ +.-++ -..+||+|+++=|+. +|+-..++ T Consensus 61 ~a~t~~ypspqiTv------------Yl~k~l~~~~a~~Vv~~l~~----~kgve~~~y~sred~L~Ef~sWsgfg~~l~ 124 (314) T TIGR00439 61 SALTQLYPSPQITV------------YLEKALAEEDADTVVSKLRR----DKGVEKINYISREDALAEFKSWSGFGELLE 124 (314) T ss_pred HHHHHCCCCCCHHH------------HHHHHHCHHHHHHHHHHHHH----HCCCCHHHCCCHHHHHHHHHCCCCCCHHHH T ss_conf 77420178840357------------87750134468999999987----406202101135667788633464300567 Q ss_pred HHHHCC Q ss_conf 654203 Q gi|254781004|r 161 ILESIP 166 (652) Q Consensus 161 li~~~~ 166 (652) +++.-| T Consensus 125 mLd~NP 130 (314) T TIGR00439 125 MLDDNP 130 (314) T ss_pred HHCCCC T ss_conf 641787 No 244 >KOG1434 consensus Probab=26.92 E-value=41 Score=13.70 Aligned_cols=43 Identities=26% Similarity=0.453 Sum_probs=21.6 Q ss_pred CCCCCCCCCCHHHHCCCCCC-----CEEEEECCCEEEEEEEEECCCHHHHHHHHHHHHHHC Q ss_conf 33100121101110223467-----416731797213799640341037899999999738 Q gi|254781004|r 488 GRDWQCGTIQVDFNLPSRFN-----AFYVNSHSEKCHPVMIHRAVFGSIERFIGIMIENFK 543 (652) Q Consensus 488 gr~~~~~tiq~df~~p~~f~-----l~y~~~dg~~~~pvmihr~~~Gs~eR~ia~liE~~~ 543 (652) |+..-+.|.-++.++|..|+ +-|+|-+|.- -.+|...+. |+++ T Consensus 127 GKTQlshtLcVt~QlPr~~Gg~~Gk~ifIDTEgtF------------rpdRi~~IA-e~~~ 174 (335) T KOG1434 127 GKTQLSHTLCVTVQLPREMGGVGGKAIFIDTEGTF------------RPDRIKDIA-ERFK 174 (335) T ss_pred CCCCEEEEEEEEEECCHHHCCCCCEEEEEECCCCC------------CHHHHHHHH-HHHC T ss_conf 70102358999765643218877428999258861------------458999999-8748 No 245 >pfam07476 MAAL_C Methylaspartate ammonia-lyase C-terminus. Methylaspartate ammonia-lyase EC:4.3.1.2 catalyses the second step of fermentation of glutamate. It is a homodimer. This family represents the C-terminal region of Methylaspartate ammonia-lyase and contains a TIM barrel fold similar to the pfam01188. This family represents the catalytic domain and contains a metal binding site. Probab=26.92 E-value=41 Score=13.70 Aligned_cols=64 Identities=14% Similarity=0.292 Sum_probs=38.0 Q ss_pred EEEECCCHHHHHHHHHHHH---HHCCCCCCCCCCEEEEEE-EC----CHHHHHHHHHHHHHHHHCCCEEEEE Q ss_conf 9640341037899999999---738708873378089998-57----9668999999999999879889997 Q gi|254781004|r 522 MIHRAVFGSIERFIGIMIE---NFKGNLPLWLSPIQAIVT-TI----TSSAVEYAQEIANLLKSHHLSIETD 585 (652) Q Consensus 522 mihr~~~Gs~eR~ia~liE---~~~g~~P~wLAP~QV~Ii-pi----~e~~~eya~~i~~~L~~~girV~~D 585 (652) .+|--+||++...++-=.+ .|=+.+-..-||+.+.|= |+ .+++.+.-.++.+.|.+.|+.|.+- T Consensus 72 ~lHiDvYGtiG~~f~~d~~~~adYl~~Le~aaap~~L~iEgPmd~g~r~~Qie~l~~Lr~~L~~~G~~v~iV 143 (249) T pfam07476 72 ILHIDVYGTIGDAFDNDLDRMADYLATLEEAAAPFPLRIEGPMDAGSKEAQIERLAALREKLDRRGIGVEIV 143 (249) T ss_pred EEEEECCCHHHHHCCCCHHHHHHHHHHHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEE T ss_conf 078871106888808978999999999998568972453077566775999999999999999648984798 No 246 >PRK02458 ribose-phosphate pyrophosphokinase; Provisional Probab=26.41 E-value=37 Score=14.06 Aligned_cols=72 Identities=18% Similarity=0.195 Sum_probs=29.0 Q ss_pred HHHHHHCCCCCCCEEEEEECCEEE-ECCEEECCCEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEC Q ss_conf 999863611011289999898899-77453036707986289999999999989999999999977998899724 Q gi|254781004|r 25 DVAESISRSLAKKAVAVAINGKVC-DLSDPVREGSIEIITPEDPRSLAVIRHSCAHIMAEAVQSIWPEVQVAIGP 98 (652) Q Consensus 25 dia~~i~~~~~~~~vaa~vng~l~-dL~~~l~d~~i~~i~~~~~eg~~i~~hS~ahlL~~Av~~l~p~~kl~igp 98 (652) +||+.++-.+.+..+--.=||++. .+...+.+.+|-++...+..-..-+. -.++|+.|+++.-...-..+-| T Consensus 24 ~ia~~Lg~~l~~~~~~~FpDGE~~v~i~e~Vrg~dV~iiqs~~~p~nd~lm--ELll~idA~r~agA~~It~ViP 96 (323) T PRK02458 24 KIAKAAGIPLGKLSSRQFSDGEIMINIEESVRGDDIYIIQSTSFPVNDHLW--ELLIMIDACKRASANTVNVVLP 96 (323) T ss_pred HHHHHHCCCCCCEEEEECCCCCEEEEECCCCCCCEEEEECCCCCCCHHHHH--HHHHHHHHHHHCCCCEEEEEEC T ss_conf 999996898412188988999889864787789838998689989308999--9999999998738871899832 No 247 >cd01203 DOK_PTB Downstream of tyrosine kinase (DOK) Phosphotyrosine-binding domain. This domain has a PH-like fold and is similiar to the PTB domain that is found in insulin receptor substrate molecules The DOK family of eukaryotic signaling molecules have an N-terminal PH domain, followed by an IRS-like PTB domain. This PTBi domain is shorter than the PTB domain which is found in SHC, Numb and other proteins. The PTBi domain binds to phosphotyrosines which are in NPXpY motifs. Probab=26.40 E-value=42 Score=13.64 Aligned_cols=44 Identities=27% Similarity=0.450 Sum_probs=35.5 Q ss_pred HHHHHHHHHCCEEEECC-----CCCCEE-EEECCHHHHHHHHHHHHHHHH Q ss_conf 00887541144044226-----787435-650107899999999999887 Q gi|254781004|r 244 RDHRKLAREMDLFHIAE-----DGSGVI-FWHRKGWKIFQTLISYMRRKI 287 (652) Q Consensus 244 rdHr~lg~~~~lf~~~~-----~~~G~~-~wlP~G~~i~~~ie~~ir~~~ 287 (652) +--|+-|.+-.+|.|.- .|+|++ |-.+.|-.|++.++..|+... T Consensus 49 ~~lRryG~d~~~FsFEaGRrC~~GeG~f~F~t~~~~~If~~v~~~i~~qk 98 (104) T cd01203 49 RFLRKYGRDKGKFSFEAGRRCTSGEGVFTFDTTQGNEIFRAVEAAIKSQK 98 (104) T ss_pred HHHHHHCCCCCEEEEEECCCCCCCCCEEEEECCCHHHHHHHHHHHHHHHH T ss_conf 88465464786899981487898997799954999999999999999998 No 248 >COG0854 PdxJ Pyridoxal phosphate biosynthesis protein [Coenzyme metabolism] Probab=26.25 E-value=42 Score=13.62 Aligned_cols=55 Identities=13% Similarity=0.141 Sum_probs=24.4 Q ss_pred CCCEEEEEEECC------H------HHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHCCCCEE Q ss_conf 378089998579------6------68999999999999879889997589663676665543489889 Q gi|254781004|r 550 LSPIQAIVTTIT------S------SAVEYAQEIANLLKSHHLSIETDFRNETINYKIREHSIKKIPII 606 (652) Q Consensus 550 LAP~QV~Iipi~------e------~~~eya~~i~~~L~~~girV~~Ddr~~~~G~Kir~a~l~giP~~ 606 (652) +-|.||+++|=+ | +..+.-..+..+|++.|+||-+=--.. -.-|..|-..|.|++ T Consensus 83 ~kP~~vtLVPe~r~evTTegGlD~~~~~~~l~~~v~~L~~~GirVSLFiD~d--~~qi~aa~~~gA~~I 149 (243) T COG0854 83 TKPHQVTLVPEKREEVTTEGGLDVAGQLDKLRDAVRRLKNAGIRVSLFIDPD--PEQIEAAAEVGAPRI 149 (243) T ss_pred CCCCEEEECCCCHHHCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEEECCC--HHHHHHHHHHCCCEE T ss_conf 5987478578964641455563355002469999999985797699972799--899999998499879 No 249 >cd04886 ACT_ThrD-II-like C-terminal ACT domain of biodegradative (catabolic) threonine dehydratase II (ThrD-II) and other related ACT domains. This CD includes the C-terminal ACT domain of biodegradative (catabolic) threonine dehydratase II (ThrD-II) and other related ACT domains. The Escherichia coli tdcB gene product, ThrD-II, anaerobically catalyzes the pyridoxal phosphate-dependent dehydration of L-threonine and L-serine to ammonia and to alpha-ketobutyrate and pyruvate, respectively. Tetrameric ThrD-II is subject to allosteric activation by AMP, inhibition by alpha-keto acids, and catabolite inactivation by several metabolites of glycolysis and the citric acid cycle. Also included in this CD are N-terminal ACT domains present in smaller (~170 a.a.) archaeal proteins of unknown function. Members of this CD belong to the superfamily of ACT regulatory domains. Probab=25.75 E-value=43 Score=13.56 Aligned_cols=55 Identities=16% Similarity=0.198 Sum_probs=33.5 Q ss_pred HHHHHHHHHHHHHHCC-------CCCCCCCCEEEEE-EECCHHHHHHHHHHHHHHHHCCCEEE Q ss_conf 0378999999997387-------0887337808999-85796689999999999998798899 Q gi|254781004|r 529 GSIERFIGIMIENFKG-------NLPLWLSPIQAIV-TTITSSAVEYAQEIANLLKSHHLSIE 583 (652) Q Consensus 529 Gs~eR~ia~liE~~~g-------~~P~wLAP~QV~I-ipi~e~~~eya~~i~~~L~~~girV~ 583 (652) |.+.++..++-+..+= ++-..+.+-.+.| +.+-..-.+.+++|.+.|+++||+|. T Consensus 10 G~L~~l~~~ia~~gaNI~~i~H~R~~~~~~~~~~~v~i~~Et~~~eh~~~i~~~L~~~Gy~v~ 72 (73) T cd04886 10 GQLAKLLAVIAEAGANIIEVSHDRAFKTLPLGEVEVELTLETRGAEHIEEIIAALREAGYDVR 72 (73) T ss_pred CHHHHHHHHHHHCCCCEEEEEEEECCCCCCCCEEEEEEEEECCCHHHHHHHHHHHHHCCCCEE T ss_conf 179999999998698289999986458898876999999984999999999999998699717 No 250 >pfam12387 Peptidase_C74 Pestivirus NS2 peptidase. The pestivirus NS2 peptidase is responsible for single cleavage between NS2 and NS3 of the bovine viral diarrhea virus polyprotein, a cleavage that is correlated with cytopathogenicity. The peptidase is activated by its interaction with 'J-domain protein interacting with viral protein' - Jiv. Probab=25.43 E-value=44 Score=13.54 Aligned_cols=12 Identities=33% Similarity=0.493 Sum_probs=8.2 Q ss_pred HHHHHHHHHHHH Q ss_conf 789999999973 Q gi|254781004|r 531 IERFIGIMIENF 542 (652) Q Consensus 531 ~eR~ia~liE~~ 542 (652) ++|+||+|||-+ T Consensus 86 ~SR~iAalIElN 97 (200) T pfam12387 86 LSRLIAALIELN 97 (200) T ss_pred HHHHHHHHHHHH T ss_conf 999999999863 No 251 >PRK05809 3-hydroxybutyryl-CoA dehydratase; Validated Probab=25.38 E-value=44 Score=13.51 Aligned_cols=15 Identities=33% Similarity=0.740 Sum_probs=5.3 Q ss_pred CEEEEECCCCHHHHH Q ss_conf 707986289999999 Q gi|254781004|r 57 GSIEIITPEDPRSLA 71 (652) Q Consensus 57 ~~i~~i~~~~~eg~~ 71 (652) +.|-.||++.|+.+. T Consensus 12 ~~va~itlnrP~~~N 26 (260) T PRK05809 12 GNIAVVTINRPKALN 26 (260) T ss_pred CCEEEEEECCCCCCC T ss_conf 999999975898778 No 252 >pfam04895 DUF651 Archaeal protein of unknown function (DUF651). This family represents the carboxy terminal region of an archaeal protein of unknown function. Probab=25.29 E-value=44 Score=13.50 Aligned_cols=32 Identities=13% Similarity=0.256 Sum_probs=20.6 Q ss_pred HHHHHCCEEEECCCCCCEEEEECCHH-HHHHHHHH Q ss_conf 75411440442267874356501078-99999999 Q gi|254781004|r 248 KLAREMDLFHIAEDGSGVIFWHRKGW-KIFQTLIS 281 (652) Q Consensus 248 ~lg~~~~lf~~~~~~~G~~~wlP~G~-~i~~~ie~ 281 (652) +++++-....+.+..|| +|.|.|. .|++.+-+ T Consensus 29 ~~kRQA~viv~REi~p~--Y~aPvGVW~VRE~vR~ 61 (110) T pfam04895 29 RRRRQAGAIVLREITPG--YYAPVGVWQVRENVRE 61 (110) T ss_pred HCCCCCEEEEEEEECCC--CEEEEEEEEHHHHHHH T ss_conf 80773128999985487--4311542534899999 No 253 >PRK02812 ribose-phosphate pyrophosphokinase; Provisional Probab=24.96 E-value=29 Score=14.74 Aligned_cols=78 Identities=15% Similarity=0.217 Sum_probs=29.8 Q ss_pred CCCCHHHHHHHHCC----CCCCCEEEEEECCEEE-ECCEEECCCEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHCCCCE Q ss_conf 98889999986361----1011289999898899-774530367079862899999999999899999999999779988 Q gi|254781004|r 19 VHATGSDVAESISR----SLAKKAVAVAINGKVC-DLSDPVREGSIEIITPEDPRSLAVIRHSCAHIMAEAVQSIWPEVQ 93 (652) Q Consensus 19 ~g~t~~dia~~i~~----~~~~~~vaa~vng~l~-dL~~~l~d~~i~~i~~~~~eg~~i~~hS~ahlL~~Av~~l~p~~k 93 (652) .|++-.++|+.|.+ .+.+..+--.=||++. .....+.+++|-++...+..-..-+. =.++|..|+++.--..- T Consensus 26 sg~s~~~La~~Ia~~Lg~~l~~~~~~kF~DGE~~v~i~esVrg~dV~iiqs~~~pvnd~lm--ELll~idA~krasA~~I 103 (331) T PRK02812 26 SGSSNPALAQEVARYLGIDLGPMIRKRFADGELYVQIQESIRGCDVYLIQPTCAPVNDHLM--ELLIMVDACRRASARQI 103 (331) T ss_pred ECCCCHHHHHHHHHHHCCCCCCEEEEECCCCCEEEEECCCCCCCEEEEECCCCCCCCHHHH--HHHHHHHHHHHCCCCEE T ss_conf 7899999999999996898534078988999889865687788769998379998418999--99999999877578727 Q ss_pred EEEEC Q ss_conf 99724 Q gi|254781004|r 94 VAIGP 98 (652) Q Consensus 94 l~igp 98 (652) ..+-| T Consensus 104 t~ViP 108 (331) T PRK02812 104 TAVIP 108 (331) T ss_pred EEEEC T ss_conf 99832 No 254 >TIGR02635 RhaI_grampos L-rhamnose isomerase; InterPro: IPR013457 The proteins in this entry are closely related to the L-rhamnose isomerases (IPR013451 from INTERPRO) found in Pseudomonas stutzeri and in a number of the Rhizobiales. They are encoded in similar genomic contexts, close to genes apparently involved in rhamnose catabolism such as rhamnulose-1-phosphate aldolase (IPR013454 from INTERPRO), sugar kinases, and sugar transporters.. Probab=24.94 E-value=45 Score=13.46 Aligned_cols=167 Identities=22% Similarity=0.311 Sum_probs=96.0 Q ss_pred CHHHHHH-HHHHCCCCHHHCCHHHHHHEEEECCCCCCCCEEEEECCCCCCCCCEEE---CCHHH-HHHHHHHHHHHHHHH Q ss_conf 1257777-652012222310432000100012787645200010563533442101---05376-788789999998876 Q gi|254781004|r 340 NCPGHVA-VFNHGLKSYRELPVRLAEFGSVYRNEPSGSLHGLMRVRGFTQDDAHVF---CTKEQ-MFNECLKIHNLIISI 414 (652) Q Consensus 340 ncp~h~~-i~~~~~~SYrdLPlrl~e~~~~~R~E~sg~l~GL~R~ReF~~~Dah~F---~~~eq-~~~e~~~~~~~~~~v 414 (652) -||...+ |==..+-.|+.| .+++| ..|-.-|=.-.--| |||.|=| |.++. +..-...-+-.|.++ T Consensus 55 ~cPtVALHIPWD~v~Dy~~L-~~yAE--------~~Gl~~GaiNpNlF-QDDDYKfGSlThp~~~iR~KAi~h~LeCvdI 124 (382) T TIGR02635 55 ICPTVALHIPWDRVDDYEEL-AEYAE--------ELGLKLGAINPNLF-QDDDYKFGSLTHPDKRIRRKAIDHLLECVDI 124 (382) T ss_pred CCCCCCCCCCCCCCCCHHHH-HHHHH--------HCCCCCCCCCCCCC-CCCCCCCCCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 76410114899896026999-99998--------72884445378767-6854355676573479999999999989999 Q ss_pred HHHHCCCCCEECCCCCCCCCCCCHH---HHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHCCCCC Q ss_conf 6552133210001465311258603---5778999999999850212125667520555555455432110021103310 Q gi|254781004|r 415 YKDFGFEKIMVKLSTRPEKRVGSDA---LWDDAENIMKGVLDTIKNSSQDRINTGVLLGEGAFYGPKFEYILKDAIGRDW 491 (652) Q Consensus 415 ~~~~g~~~~~~~ls~~~~~~~g~~~---~~~~ae~~l~~~l~~~~~~~~~~~~~~~~~g~~afygpkid~~~~d~~gr~~ 491 (652) .+..|-..+++=|+.+. .+-|-+. --+.-++.|.++...++......++|. -.|-|||-+.+- .| T Consensus 125 a~~tGSkdisLWLADGT-nYPGQdD~R~Rk~RL~EsL~~vY~~l~~dmRlLiEYK--FFEPAfY~TD~p---------DW 192 (382) T TIGR02635 125 AKKTGSKDISLWLADGT-NYPGQDDIRKRKDRLEESLAEVYEHLGDDMRLLIEYK--FFEPAFYHTDIP---------DW 192 (382) T ss_pred HHHHCCCEEEEEEECCC-CCCCCCCHHHHHHHHHHHHHHHHHHCCCCCEEEEEEE--CCCCCCCCCCCC---------HH T ss_conf 98608852562210578-7358877544788998889999851698877853220--457534457833---------35 Q ss_pred CCCCCCHHHHCCCCCCCEEEEECCCEEEEE--EEEECCCHH-HHHHHHHHHHHH Q ss_conf 012110111022346741673179721379--964034103-789999999973 Q gi|254781004|r 492 QCGTIQVDFNLPSRFNAFYVNSHSEKCHPV--MIHRAVFGS-IERFIGIMIENF 542 (652) Q Consensus 492 ~~~tiq~df~~p~~f~l~y~~~dg~~~~pv--mihr~~~Gs-~eR~ia~liE~~ 542 (652) -.+ +.+++ +=|.+..++ .+|-+. |+ ||=+||.|+..- T Consensus 193 GtA-----Y~~~~--------kLG~rA~VlVD~GHHa~-GTNIE~IVA~LL~e~ 232 (382) T TIGR02635 193 GTA-----YALSK--------KLGDRALVLVDTGHHAQ-GTNIEFIVATLLDEK 232 (382) T ss_pred HHH-----HHHHH--------HCCCCEEEEEECCCCCC-CCCHHHHHHHHHHCC T ss_conf 799-----99998--------54892389862577987-774899999887267 No 255 >pfam04914 DltD_C DltD C-terminal region. DltD is and integral membrane protein involved in the biosynthesis of D-alanyl-lipoteichoic acid. This is important in controlling the net ionic charge in lipoteichoic acid (LTA). This family is found in bacteria of the Bacillus/Clostridium group. DltD binds Dcp and ligates it with D-alanine. DltD does not ligate acyl carrier protein (ACP) with D-alanine. It also has thioesterase activity for mischarged D-alanyl-acyl carrier protein (ACP). DltD is thought to be responsible for discriminating between Dcp involved in the D-alanylation of LTA, and ACP involved in fatty acid biosynthesis. This family consists of the C-terminal region of DltD. Probab=24.10 E-value=46 Score=13.35 Aligned_cols=52 Identities=12% Similarity=0.268 Sum_probs=35.5 Q ss_pred CHHHHHHHHHHHHHHCCCCCCCCCCEEEEEEECC-----------HHHHHHHHHHHHHHHHCCCEEEEECCC Q ss_conf 1037899999999738708873378089998579-----------668999999999999879889997589 Q gi|254781004|r 528 FGSIERFIGIMIENFKGNLPLWLSPIQAIVTTIT-----------SSAVEYAQEIANLLKSHHLSIETDFRN 588 (652) Q Consensus 528 ~Gs~eR~ia~liE~~~g~~P~wLAP~QV~Iipi~-----------e~~~eya~~i~~~L~~~girV~~Ddr~ 588 (652) |+-++-++-++-|.. +=..++|+||+ +.-.++.++|...+.++|++ .+|.++ T Consensus 35 y~Dlql~L~~~~~~~--------~~vlfVi~PvNgkWydytGl~k~~r~~~y~KI~~~~~~~Gf~-i~Dls~ 97 (130) T pfam04914 35 YNDLQLVLDQFAKAG--------ADVLFVIPPVNGKWYDYTGLSKEMRQQYYKKIKYQLKSQGFN-IADLSD 97 (130) T ss_pred HHHHHHHHHHHHHCC--------CCEEEEECCCCCHHHHHCCCCHHHHHHHHHHHHHHHHHCCCE-EEECCC T ss_conf 068999999999849--------941898547770588870899999999999999999987983-872445 No 256 >KOG2655 consensus Probab=24.05 E-value=46 Score=13.35 Aligned_cols=14 Identities=7% Similarity=0.124 Sum_probs=7.5 Q ss_pred ECCCCCCCCCCCCC Q ss_conf 55432443100011 Q gi|254781004|r 295 NTPQVLDQHLWQQS 308 (652) Q Consensus 295 ~tP~l~~~~Lw~~S 308 (652) ..|+|.+++..... T Consensus 161 iIPVI~KaD~lT~~ 174 (366) T KOG2655 161 LIPVIAKADTLTKD 174 (366) T ss_pred CCCEEECCCCCCHH T ss_conf 10316405668989 No 257 >PRK01742 tolB translocation protein TolB; Provisional Probab=23.80 E-value=47 Score=13.32 Aligned_cols=14 Identities=29% Similarity=0.276 Sum_probs=8.0 Q ss_pred HHHHHHHHCCCCEE Q ss_conf 99999996699804 Q gi|254781004|r 124 KKMQEIIARDSPFL 137 (652) Q Consensus 124 ~~M~~ii~~~~pi~ 137 (652) +.+.++|..|+... T Consensus 54 ~~i~~vI~~DL~~S 67 (430) T PRK01742 54 EDIAKIIAADLRNS 67 (430) T ss_pred HHHHHHHHHHHHCC T ss_conf 78999998777608 No 258 >pfam00789 UBX UBX domain. This domain is present in ubiquitin-regulatory proteins and is a general Cdc48-interacting module. Probab=23.78 E-value=47 Score=13.31 Aligned_cols=29 Identities=24% Similarity=0.409 Sum_probs=22.7 Q ss_pred CCEEEECCCCCEE--ECCCCCCHHHHHHHHC Q ss_conf 8708992899887--4189888999998636 Q gi|254781004|r 3 SDIKLTFPDGSIK--NFPVHATGSDVAESIS 31 (652) Q Consensus 3 ~~i~I~lpDG~~~--~~~~g~t~~dia~~i~ 31 (652) ..|.|.||||+.. .|....|+.++..-+. T Consensus 7 t~I~iRlpdG~r~~r~F~~~~tl~~v~~fV~ 37 (81) T pfam00789 7 CRLQIRLPDGSRLVRRFNSSDPLQDVYDFVD 37 (81) T ss_pred EEEEEECCCCCEEEEEECCCCCHHHHHHHHH T ss_conf 8999998999889999089983999999999 No 259 >pfam10605 3HBOH 3HB-oligomer hydrolase (3HBOH). D-(-)-3-hydroxybutyrate oligomer hydrolase (also known as 3HB-oligomer hydrolase) functions in the degradation of poly-3-hydroxybutyrate (PHB). It catalyses the hydrolysis of D(-)-3-hydroxybutyrate oligomers (3HB-oligomers) into 3HB-monomers. Probab=23.43 E-value=48 Score=13.27 Aligned_cols=23 Identities=22% Similarity=0.358 Sum_probs=13.7 Q ss_pred CCCHHHHHHHHCCCCEEEEECHH Q ss_conf 63676665543489889998742 Q gi|254781004|r 590 TINYKIREHSIKKIPIIIICGDK 612 (652) Q Consensus 590 ~~G~Kir~a~l~giP~~ivIG~k 612 (652) .+..-.+.++|.|-|-+||-|.. T Consensus 542 Gi~ev~~~gnL~GKPAiIV~GRs 564 (689) T pfam10605 542 GVSAVRRSGNLQGKPAIIVHGRA 564 (689) T ss_pred HHHHHHHCCCCCCCCEEEEECCC T ss_conf 49999861656898538983465 No 260 >KOG0373 consensus Probab=23.41 E-value=15 Score=16.79 Aligned_cols=27 Identities=22% Similarity=0.508 Sum_probs=15.7 Q ss_pred CEEEEECCCCCC-CCCEEECCHHHHHHH Q ss_conf 200010563533-442101053767887 Q gi|254781004|r 377 LHGLMRVRGFTQ-DDAHVFCTKEQMFNE 403 (652) Q Consensus 377 l~GL~R~ReF~~-~Dah~F~~~eq~~~e 403 (652) ++|---++||.+ ++.-..|+..|...| T Consensus 215 lFGskVt~eF~~iN~L~LicRaHQLV~E 242 (306) T KOG0373 215 LFGSKVTTEFNHINNLNLICRAHQLVQE 242 (306) T ss_pred EECHHHHHHHHHHCCHHHHHHHHHHHHH T ss_conf 3130555887740651787868899985 No 261 >TIGR02331 rib_alpha Rib/alpha/Esp surface antigen repeat; InterPro: IPR012706 This entry represents a region of about 79 amino acids found tandemly repeated up to fourteen times within the proteins that contain it. The repeats lack cysteines and are highly conserved, even at the DNA level, within and between proteins . Proteins containing these repeats include the Rib and alpha surface antigens of group B Streptococcus, Esp of Enterococcus faecalis, and related proteins of Lactobacillus. Most members of this protein family also have the cell wall anchor motif, LPXTG, shared by many staphyloccal and streptococcal surface antigens. These repeats are thought to define protective epitopes and may play a role in generating phenotypic and genotypic variation .. Probab=23.12 E-value=46 Score=13.40 Aligned_cols=19 Identities=32% Similarity=0.688 Sum_probs=12.8 Q ss_pred CCEEEECCCCCEEECCCCC Q ss_conf 8708992899887418988 Q gi|254781004|r 3 SDIKLTFPDGSIKNFPVHA 21 (652) Q Consensus 3 ~~i~I~lpDG~~~~~~~g~ 21 (652) ..|.||.|||+.=+++--+ T Consensus 48 a~v~VTYPDGtkDtV~VtV 66 (83) T TIGR02331 48 AVVVVTYPDGTKDTVPVTV 66 (83) T ss_pred CEEEEECCCCCCCCCCCCE T ss_conf 6477844899614125523 No 262 >PRK06143 enoyl-CoA hydratase; Provisional Probab=23.10 E-value=48 Score=13.23 Aligned_cols=46 Identities=15% Similarity=0.225 Sum_probs=24.2 Q ss_pred CCEEEEECCCCHHHHHHHHHHHHHHHHHHHHHH--CCCCEEEEECCCC Q ss_conf 670798628999999999998999999999997--7998899724358 Q gi|254781004|r 56 EGSIEIITPEDPRSLAVIRHSCAHIMAEAVQSI--WPEVQVAIGPIIE 101 (652) Q Consensus 56 d~~i~~i~~~~~eg~~i~~hS~ahlL~~Av~~l--~p~~kl~igp~i~ 101 (652) |+.|-.||++.|+-+.++-.....=|..|+.++ -|++++.|=-.-+ T Consensus 16 e~gVa~ItlnrP~~~Nal~~~m~~eL~~~~~~~~~d~~vrvvVltg~G 63 (260) T PRK06143 16 DRGVATLTIRNAGSLNILGTPVILALTQALRWLAADPDVRVLVLRGAG 63 (260) T ss_pred CCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEECCC T ss_conf 898899997589887999999999999999988538995599996588 No 263 >cd03314 MAL Methylaspartate ammonia lyase (3-methylaspartase, MAL) is a homodimeric enzyme, catalyzing the magnesium-dependent reversible alpha,beta-elimination of ammonia from L-threo-(2S,3S)-3-methylaspartic acid to mesaconic acid. This reaction is part of the main catabolic pathway for glutamate. MAL belongs to the enolase superfamily of enzymes, characterized by the presence of an enolate anion intermediate which is generated by abstraction of the alpha-proton of the carboxylate substrate by an active site residue and is stabilized by coordination to the essential Mg2+ ion. Probab=22.96 E-value=49 Score=13.21 Aligned_cols=71 Identities=14% Similarity=0.255 Sum_probs=44.9 Q ss_pred EEEECCCHHHHHHHHHHHH---HHCCCCCCCCCCEEEEEE-EC----CHHHHHHHHHHHHHHHHCCCEEEE--ECCCCCC Q ss_conf 9640341037899999999---738708873378089998-57----966899999999999987988999--7589663 Q gi|254781004|r 522 MIHRAVFGSIERFIGIMIE---NFKGNLPLWLSPIQAIVT-TI----TSSAVEYAQEIANLLKSHHLSIET--DFRNETI 591 (652) Q Consensus 522 mihr~~~Gs~eR~ia~liE---~~~g~~P~wLAP~QV~Ii-pi----~e~~~eya~~i~~~L~~~girV~~--Ddr~~~~ 591 (652) .+|--+||++...++-=.+ .|-+.+-..-+|+.+.|= |+ .+++.+.-.++.+.|.+.|+.|.+ |..-+|+ T Consensus 194 ~lHiDVYGtiG~~f~~d~~r~adyl~~le~aa~p~~L~IEgPmd~g~r~~QIe~l~~Lr~~L~~~G~~v~iVADEWCNT~ 273 (369) T cd03314 194 ILHIDVYGTIGQAFDPDPDRAADYLATLEEAAAPFPLRIEGPMDAGSREAQIERMAALRAELDRRGVGVRIVADEWCNTL 273 (369) T ss_pred CEEEECCHHHHHHHCCCHHHHHHHHHHHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEHHCCCH T ss_conf 26887210588884898899999999999746897244217757677699999999999999964898469853111888 Q ss_pred C Q ss_conf 6 Q gi|254781004|r 592 N 592 (652) Q Consensus 592 G 592 (652) . T Consensus 274 e 274 (369) T cd03314 274 E 274 (369) T ss_pred H T ss_conf 9 No 264 >pfam07447 VP40 Matrix protein VP40. This family contains viral VP40 matrix proteins that seem to be restricted to the Filoviridae. These play an important role in the assembly process of virus particles by interacting with cellular factors, cellular membranes, and the ribonuclearprotein particle complex. It has been shown that the N-terminal region of VP40 folds into a mixture of hexameric and octameric states - these may have distinct roles. Probab=22.63 E-value=27 Score=14.92 Aligned_cols=12 Identities=25% Similarity=0.432 Sum_probs=6.4 Q ss_pred CCCEEEEECCHH Q ss_conf 874356501078 Q gi|254781004|r 262 GSGVIFWHRKGW 273 (652) Q Consensus 262 ~~G~~~wlP~G~ 273 (652) -.|+|.|||+|- T Consensus 89 mK~iPiWLPLGi 100 (295) T pfam07447 89 MKGIPAWLPLGI 100 (295) T ss_pred HHCCCEEEECCC T ss_conf 406750133132 No 265 >PRK04043 tolB translocation protein TolB; Provisional Probab=22.34 E-value=50 Score=13.13 Aligned_cols=21 Identities=19% Similarity=0.351 Sum_probs=14.0 Q ss_pred HHHHHHHHHHHHCCCCEEEEE Q ss_conf 999999999996699804853 Q gi|254781004|r 120 AQIEKKMQEIIARDSPFLKQY 140 (652) Q Consensus 120 ~~Ie~~M~~ii~~~~pi~r~~ 140 (652) ..+.+.+.+||..|+...-.- T Consensus 44 ~~l~~~i~~II~~DL~rSG~F 64 (419) T PRK04043 44 ANLKLKFFKILVNDLKVSSHF 64 (419) T ss_pred CCHHHHHHHHHHHHHHCCCCC T ss_conf 413689999998753416585 No 266 >PRK05137 tolB translocation protein TolB; Provisional Probab=22.14 E-value=50 Score=13.10 Aligned_cols=15 Identities=20% Similarity=0.200 Sum_probs=7.1 Q ss_pred HHHHHHHHHCCCCEE Q ss_conf 999999996699804 Q gi|254781004|r 123 EKKMQEIIARDSPFL 137 (652) Q Consensus 123 e~~M~~ii~~~~pi~ 137 (652) -+.|.++|..|+... T Consensus 56 ~~~i~~II~~DL~~S 70 (437) T PRK05137 56 GADIAQVIAADLKRS 70 (437) T ss_pred HHHHHHHHHHHHHHC T ss_conf 688999999778638 No 267 >PRK11130 moaD molybdopterin synthase small subunit; Provisional Probab=22.08 E-value=51 Score=13.09 Aligned_cols=50 Identities=12% Similarity=0.322 Sum_probs=33.5 Q ss_pred EECC-CCCCHHHHHHHHCCC-------CCCCEEEEEECCEEEECCEEECC-CEEEEECC Q ss_conf 7418-988899999863611-------01128999989889977453036-70798628 Q gi|254781004|r 15 KNFP-VHATGSDVAESISRS-------LAKKAVAVAINGKVCDLSDPVRE-GSIEIITP 64 (652) Q Consensus 15 ~~~~-~g~t~~dia~~i~~~-------~~~~~vaa~vng~l~dL~~~l~d-~~i~~i~~ 64 (652) .+++ .++|+.++.+.+... +...-+.+-||.++.+.+++|.| ++|-|+.. T Consensus 19 ~~l~~~~~tv~~l~~~L~~~~~~~~~~l~~~~~~~AVNq~~~~~d~~l~~gDEVAffPP 77 (81) T PRK11130 19 LELAADFPTVEALRQHLAQQGDRWALALEDGKLLAAVNQTLVSFDHPLTDGDEVAFFPP 77 (81) T ss_pred EECCCCCCCHHHHHHHHHHHCCHHHHHHCCCCEEEEECCEECCCCCCCCCCCEEEECCC T ss_conf 78789998899999999985835888624685677331377689976699999998089 No 268 >pfam06183 DinI DinI-like family. This family of short proteins includes DNA-damage-inducible protein I (DinI) and related proteins. The SOS response, a set of cellular phenomena exhibited by eubacteria, is initiated by various causes that include DNA damage-induced replication arrest, and is positively regulated by the co- protease activity of RecA. Escherichia coli DinI, a LexA-regulated SOS gene product, shuts off the initiation of the SOS response when overexpressed in vivo. Biochemical and genetic studies indicated that DinI physically interacts with RecA to inhibit its co-protease activity. The structure of DinI is known. Probab=21.76 E-value=51 Score=13.05 Aligned_cols=47 Identities=19% Similarity=0.345 Sum_probs=30.5 Q ss_pred HHHHHHHHHCCCCEEEEECCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHH Q ss_conf 9999999977998899724358855788408888889899999999999996 Q gi|254781004|r 80 IMAEAVQSIWPEVQVAIGPIIEDGFYYDFDKEQPFSSDELAQIEKKMQEIIA 131 (652) Q Consensus 80 lL~~Av~~l~p~~kl~igp~i~~GfYyD~~~~~~it~~dl~~Ie~~M~~ii~ 131 (652) =|-+=+++.||++.|.|.+.-.+|+= +.+. +.+|-+.|+.-++++-+ T Consensus 12 EL~kRl~~~fPd~~V~Vr~~s~n~l~--v~g~---~k~dK~~i~eiLqE~we 58 (65) T pfam06183 12 ELTKRIQSLYPDTVVSVRKGSANGLS--VTGA---AKDDKERIMEILQETWE 58 (65) T ss_pred HHHHHHHHHCCCCEEEEEECCCCCEE--ECCC---CHHHHHHHHHHHHHHHH T ss_conf 99998653389957987105679705--4387---74039999999999996 No 269 >KOG1643 consensus Probab=21.70 E-value=51 Score=13.04 Aligned_cols=34 Identities=12% Similarity=0.063 Sum_probs=12.0 Q ss_pred CCEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 6707986289999999999989999999999977 Q gi|254781004|r 56 EGSIEIITPEDPRSLAVIRHSCAHIMAEAVQSIW 89 (652) Q Consensus 56 d~~i~~i~~~~~eg~~i~~hS~ahlL~~Av~~l~ 89 (652) |+-.+++-..+.|-+++|--|.+|+--++...|- T Consensus 84 d~G~~wVIlGHSERR~~fgEsd~~i~~K~~~Al~ 117 (247) T KOG1643 84 DLGAEWVILGHSERRHVFGESDEFIADKTAHALA 117 (247) T ss_pred HCCCCEEEECCHHHHHHHCCCHHHHHHHHHHHHH T ss_conf 6799789954466643507734789999999997 No 270 >pfam03658 UPF0125 Uncharacterized protein family (UPF0125). Probab=21.69 E-value=51 Score=13.04 Aligned_cols=56 Identities=25% Similarity=0.395 Sum_probs=34.0 Q ss_pred CCCCC---EEECCCCCCHHHHHHH--HC---CCCCCCEEEEEECCEEEECCEEECCC-EEEEECC Q ss_conf 28998---8741898889999986--36---11011289999898899774530367-0798628 Q gi|254781004|r 9 FPDGS---IKNFPVHATGSDVAES--IS---RSLAKKAVAVAINGKVCDLSDPVREG-SIEIITP 64 (652) Q Consensus 9 lpDG~---~~~~~~g~t~~dia~~--i~---~~~~~~~vaa~vng~l~dL~~~l~d~-~i~~i~~ 64 (652) +|+.. ..+++.|+|..|..+. +. +.+.-.....=|=|++..|+++|.++ .||+.-. T Consensus 9 ~p~~q~~~~l~v~~GtTv~~Ai~~Sgi~~~fPeidl~~~~vGIfgk~v~l~~~L~~GDRVEIYRP 73 (83) T pfam03658 9 LPERQYLKSVDVAEGTTVREAILQSGLLRQFPDIDLSKNKVGIFGKPVKLDDPLKDGDRIEIYRP 73 (83) T ss_pred CCCEEEEEEEECCCCCCHHHHHHHCCCHHHCCCCCCCCCCEEEECCCCCCCCCCCCCCEEEEECC T ss_conf 89957999998699894999999759565696655564636468847388770799999999536 No 271 >pfam09837 DUF2064 Uncharacterized protein conserved in bacteria (DUF2064). This domain, found in various prokaryotic proteins, has no known function. Probab=21.00 E-value=53 Score=12.94 Aligned_cols=27 Identities=22% Similarity=0.608 Sum_probs=16.6 Q ss_pred HHHHHHHHHHCCCCEEEEECCCCCCEEE Q ss_conf 9999999997799889972435885578 Q gi|254781004|r 79 HIMAEAVQSIWPEVQVAIGPIIEDGFYY 106 (652) Q Consensus 79 hlL~~Av~~l~p~~kl~igp~i~~GfYy 106 (652) ..|.+|...|- +..+.+||+.+.|||. T Consensus 77 ~~l~~A~~~L~-~~d~VlgPa~DGGy~L 103 (121) T pfam09837 77 ELLAQAFEALE-RHDAVLGPAEDGGYYL 103 (121) T ss_pred HHHHHHHHHHH-CCCEEEEECCCCCEEE T ss_conf 99999999862-1997999748899999 No 272 >KOG3859 consensus Probab=20.92 E-value=53 Score=12.93 Aligned_cols=46 Identities=17% Similarity=0.231 Sum_probs=25.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCEEEECC--CCCCEEEEECCHHHHH Q ss_conf 89999999999753200887541144044226--7874356501078999 Q gi|254781004|r 229 QELTQYLYFLEESEKRDHRKLAREMDLFHIAE--DGSGVIFWHRKGWKIF 276 (652) Q Consensus 229 ~~l~~~~~~~eea~~rdHr~lg~~~~lf~~~~--~~~G~~~wlP~G~~i~ 276 (652) +-+-+|+...=||-.-.--||-+- ||.+|+ .-.-+||--|-|-.+. T Consensus 116 k~iVdyidaQFEaYLQEELKi~Rs--l~~~hDsRiH~CLYFI~PTGH~LK 163 (406) T KOG3859 116 KPIVDYIDAQFEAYLQEELKIRRS--LFTYHDSRIHVCLYFISPTGHSLK 163 (406) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHH--HHHHCCCCEEEEEEEECCCCCCHH T ss_conf 269999999999999998778888--887526725899999568874145 No 273 >PRK05658 RNA polymerase sigma factor RpoD; Validated Probab=20.90 E-value=40 Score=13.76 Aligned_cols=47 Identities=21% Similarity=0.309 Sum_probs=24.7 Q ss_pred EEEECCC-CCCCCCEEECC-----------------------HHHHHHHHHHHHHHHHHHHHHHCCCCCEE Q ss_conf 0010563-53344210105-----------------------37678878999999887665521332100 Q gi|254781004|r 379 GLMRVRG-FTQDDAHVFCT-----------------------KEQMFNECLKIHNLIISIYKDFGFEKIMV 425 (652) Q Consensus 379 GL~R~Re-F~~~Dah~F~~-----------------------~eq~~~e~~~~~~~~~~v~~~~g~~~~~~ 425 (652) ||||.-+ |--.=+|=|.| |-++-+.+.++......+...+|-++.+- T Consensus 418 GLMkAVdKFeyrrGyKFSTYATWWIRQAITRaIaDqaRTIRiPVHmietinkl~r~~r~l~qe~Grept~e 488 (620) T PRK05658 418 GLMKAVDKFEYRRGYKFSTYATWWIRQAITRSIADQARTIRIPVHMIETINKLNRISRQMLQEMGREPTPE 488 (620) T ss_pred HHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCHH T ss_conf 99999986284559713577999999999999986177401518899999999999999999828999999 No 274 >pfam04343 DUF488 Protein of unknown function, DUF488. This family includes several proteins of uncharacterized function. Probab=20.88 E-value=53 Score=12.93 Aligned_cols=18 Identities=28% Similarity=0.466 Sum_probs=7.5 Q ss_pred HHHHHHHHCCCEEEEECC Q ss_conf 999999987988999758 Q gi|254781004|r 570 EIANLLKSHHLSIETDFR 587 (652) Q Consensus 570 ~i~~~L~~~girV~~Ddr 587 (652) ++++.|++.|++|.+|-| T Consensus 4 ~f~~~L~~~gi~~lVDVR 21 (123) T pfam04343 4 RFYDLLKADGIRVLVDVR 21 (123) T ss_pred HHHHHHHHCCCEEEEEEC T ss_conf 999999987986999837 No 275 >PRK12775 putative trifunctional 2-polyprenylphenol hydroxylase/glutamate synthase subunit beta/ferritin domain-containing protein; Provisional Probab=20.88 E-value=53 Score=12.93 Aligned_cols=73 Identities=18% Similarity=0.263 Sum_probs=41.3 Q ss_pred CCCCCCCCCC----EEEEEEECCHHHHHHHHHHHHHHHHCCC-EEEEECC---CCCCCH--HHHHHHHCCCCEEEEECHH Q ss_conf 8708873378----0899985796689999999999998798-8999758---966367--6665543489889998742 Q gi|254781004|r 543 KGNLPLWLSP----IQAIVTTITSSAVEYAQEIANLLKSHHL-SIETDFR---NETINY--KIREHSIKKIPIIIICGDK 612 (652) Q Consensus 543 ~g~~P~wLAP----~QV~Iipi~e~~~eya~~i~~~L~~~gi-rV~~Ddr---~~~~G~--Kir~a~l~giP~~ivIG~k 612 (652) +..+|..=.| -.|+||-=++-.++.|+. ..+| |- .|.+=.| .+-++. -+.+|+..||-+..-.-+. T Consensus 560 ~~~~p~~dtPv~~GK~VvVIGGGNTAMDaART-A~RL---GAe~VtivYRRte~EMPA~~eEV~~A~EEGV~F~fL~~Pv 635 (993) T PRK12775 560 GDKFPYLDTPVSVGKSVVVIGAGNTAMDCLRV-ARRL---GAATVRCVYRRSEAEAPARIEEIRHAKEEGVDFFFLHSPV 635 (993) T ss_pred CCCCCCCCCCCCCCCEEEEECCCHHHHHHHHH-HHHC---CCCEEEEEEECCHHHCCCCHHHHHHHHHCCCEEEECCCCC T ss_conf 77677567864579969998994369998899-9976---9986799996885638988999987986593999626971 Q ss_pred HH---HCCCE Q ss_conf 53---25818 Q gi|254781004|r 613 EA---SERSI 619 (652) Q Consensus 613 e~---e~~~V 619 (652) ++ +++.| T Consensus 636 eiigde~G~V 645 (993) T PRK12775 636 EILVTEEGSV 645 (993) T ss_pred EEEECCCCEE T ss_conf 6786588639 No 276 >pfam01216 Calsequestrin Calsequestrin. Probab=20.85 E-value=54 Score=12.92 Aligned_cols=145 Identities=12% Similarity=0.180 Sum_probs=75.6 Q ss_pred HHHHHHHHHCCCCEEEEECCHHHHHHHHHHCCCHHHHHHHHHCCCCCCCEEECCCCCEEECCCCCCCHHHHHHHH-HHHH Q ss_conf 999999996699804853276785445441671135566542034554213025752010002355214442223-3333 Q gi|254781004|r 123 EKKMQEIIARDSPFLKQYLSHEKARELFESKKEAYKVEILESIPAKENVTLYRQGEWFDLCRGPHVRSTGQVKKF-FKLM 201 (652) Q Consensus 123 e~~M~~ii~~~~pi~r~~vs~~eA~~~F~~~~~~~K~~li~~~~~~~~v~~y~~g~~~Dlc~GphvpsTg~ik~~-fkL~ 201 (652) |..-+.+-.+++++-+...+++..+. +..| +...+++.+++.|.-++ +.|+.-..+ |-.+ .|.+ T Consensus 57 elAAq~l~~k~I~l~kVDat~e~~La--~k~g----------v~gypTlkvFr~g~~~e-Y~G~R~ad~--IV~~l~k~~ 121 (350) T pfam01216 57 ELAAQVLEDKGVGFGLVDSEKDAAVA--KKLG----------LDEEDSLYVFKGDETIE-YDGEFSADT--LVEFLLDVL 121 (350) T ss_pred HHHHHHHCCCCCCEEEECCCHHHHHH--HHCC----------CCCCCEEEEEECCEEEC-CCCCCCHHH--HHHHHHHHH T ss_conf 99998741258863885151168899--9829----------87675289988992650-588766789--999999702 Q ss_pred HHHHHCCCCCCCCCEE----EEEEEEEECCHHHHHHHHHHHHHHH-HHHH--------HHHHHHCCEEEECCCCCCEEEE Q ss_conf 3211101577564206----8999998379889999999999753-2008--------8754114404422678743565 Q gi|254781004|r 202 KVAGAYWRGDSNRPML----SRIYGTAWNTQQELTQYLYFLEESE-KRDH--------RKLAREMDLFHIAEDGSGVIFW 268 (652) Q Consensus 202 ~~sgayw~gd~~~~~l----~Riyg~af~~~~~l~~~~~~~eea~-~rdH--------r~lg~~~~lf~~~~~~~G~~~w 268 (652) .-+..--.+...-+.+ .-|-=+.|...++=+.|..+.+-|. -||| ..+-+.+++ ..++..++ T Consensus 122 ~ppv~~i~s~~e~~~f~~~~d~v~vIGyF~~~ds~~~k~F~~aAe~~~~~~~F~~t~d~~vak~~~~-----k~~~Vvly 196 (350) T pfam01216 122 EDPVEIINGKLELQAFENIEDEIKLIGYFKSEDSEHYKAFEDAAEEFHPYIPFFATFDKKVAKKLTL-----KLNEIDFY 196 (350) T ss_pred CCCHHHHCCHHHHHHHHHCCCCEEEEEEECCCCCHHHHHHHHHHHHCCCCCEEEEECCHHHHHHCCC-----CCCCEEEE T ss_conf 6814442588889987613585799999788787789999999985577753788776999997189-----88854572 Q ss_pred ECC---------HHHHHHHHHHHHHHHH Q ss_conf 010---------7899999999999887 Q gi|254781004|r 269 HRK---------GWKIFQTLISYMRRKI 287 (652) Q Consensus 269 lP~---------G~~i~~~ie~~ir~~~ 287 (652) -|- |-.--..|.+|+..+. T Consensus 197 k~F~de~~~~~~~~~~eeel~~fv~~~~ 224 (350) T pfam01216 197 EPFMDEPITIPDKPNSEEEIVEFVKEHQ 224 (350) T ss_pred CCCCCCCCCCCCCCCCHHHHHHHHHHCC T ss_conf 5535577657899888899999999767 No 277 >KOG0372 consensus Probab=20.56 E-value=54 Score=12.88 Aligned_cols=106 Identities=15% Similarity=0.185 Sum_probs=45.4 Q ss_pred CCCHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHCCC-HHHHHHHHHCCCCCCCEEECCCCCEEECCCCCCCHH Q ss_conf 8889899999999999996699804853276785445441671-135566542034554213025752010002355214 Q gi|254781004|r 113 PFSSDELAQIEKKMQEIIARDSPFLKQYLSHEKARELFESKKE-AYKVEILESIPAKENVTLYRQGEWFDLCRGPHVRST 191 (652) Q Consensus 113 ~it~~dl~~Ie~~M~~ii~~~~pi~r~~vs~~eA~~~F~~~~~-~~K~~li~~~~~~~~v~~y~~g~~~Dlc~GphvpsT 191 (652) -+.+.++..+-.+.++|.-+.-.+.+...+.--.-++ .|+ -+-+++++--..-+.......|||+| ||.---.| T Consensus 15 li~E~eV~~LC~~~~eiL~~E~NV~~i~tPvtvcGDI---HGQf~Dllelf~igG~~~~t~YlFLGDyVD--RG~~SvEt 89 (303) T KOG0372 15 LIAESEVKALCAKVREILVEESNVQRIDTPVTVCGDI---HGQFYDLLELFRIGGDVPETNYLFLGDYVD--RGYYSVET 89 (303) T ss_pred CCCHHHHHHHHHHHHHHHHCCCCCEECCCCCEEEECC---CCHHHHHHHHHHHCCCCCCCCEEEECCHHC--CCCCHHHH T ss_conf 7858899999999999985378944447881786035---303999999997179999886676412010--46016999 Q ss_pred HHHHHHHHHHHHHH-HCCCCCCCCCEEEEEEEE Q ss_conf 44222333333211-101577564206899999 Q gi|254781004|r 192 GQVKKFFKLMKVAG-AYWRGDSNRPMLSRIYGT 223 (652) Q Consensus 192 g~ik~~fkL~~~sg-ayw~gd~~~~~l~Riyg~ 223 (652) =.+..+.|+.--.- .--||+....|++++||. T Consensus 90 ~lLLl~lK~rYP~ritLiRGNHEsRqitqvYGF 122 (303) T KOG0372 90 FLLLLALKVRYPDRITLIRGNHESRQITQVYGF 122 (303) T ss_pred HHHHHHHHHCCCCEEEEEECCCHHHHHHHHHHH T ss_conf 999999960286405775147322123555527 No 278 >TIGR00559 pdxJ pyridoxal phosphate biosynthetic protein PdxJ; InterPro: IPR004569 Pyridoxal phosphate is the active form of vitamin B6 (pyridoxine or pyridoxal). PLP is a versatile catalyst, acting as a coenzyme in a multitude of reactions, including decarboxylation, deamination and transamination , , . PLP-dependent enzymes are primarily involved in the biosynthesis of amino acids and amino acid-derived metabolites, but they are also found in the biosynthetic pathways of amino sugars and in the synthesis or catabolism of neurotransmitters; pyridoxal phosphate can also inhibit DNA polymerases and several steroid receptors . Inadequate levels of pyridoxal phosphate in the brain can cause neurological dysfunction, particularly epilepsy . PLP enzymes exist in their resting state as a Schiff base, the aldehyde group of PLP forming a linkage with the epsilon-amino group of an active site lysine residue on the enzyme. The alpha-amino group of the substrate displaces the lysine epsilon-amino group, in the process forming a new aldimine with the substrate. This aldimine is the common central intermediate for all PLP-catalyzed reactions, enzymatic and non-enzymatic . In Escherichia coli, the pdx genes involved in vitamin B6 have been characterised , , . This entry represents PdxJ, which catalyses the condensation of 1-amino-3-oxo-4-(phosphohydroxy)propan-2-one and 1-deoxy-D-xylulose-5-phosphate to form pyridoxine-5'-phosphate. The product of the PdxJ reaction is then oxidized by PdxH to pyridoxal 5'-phosphate.; GO: 0008615 pyridoxine biosynthetic process, 0005737 cytoplasm. Probab=20.08 E-value=55 Score=12.82 Aligned_cols=63 Identities=16% Similarity=0.232 Sum_probs=44.7 Q ss_pred HHHHHHHHHHHHCCCCCCCCC--CEEEEEEECC------H------HHHHHHHHHHHHHHHCCCEEE--EECCCCCCCHH Q ss_conf 789999999973870887337--8089998579------6------689999999999998798899--97589663676 Q gi|254781004|r 531 IERFIGIMIENFKGNLPLWLS--PIQAIVTTIT------S------SAVEYAQEIANLLKSHHLSIE--TDFRNETINYK 594 (652) Q Consensus 531 ~eR~ia~liE~~~g~~P~wLA--P~QV~Iipi~------e------~~~eya~~i~~~L~~~girV~--~Ddr~~~~G~K 594 (652) -|||+.++.| += |-||.++|=+ + ...+.-.++.+.+.++||+|- +|-..+ . T Consensus 73 ~ee~~~~~~~---------~knkP~~vTlVPe~r~evTtegGLDva~~~dkL~~~~~~~~~~GI~vSlFId~~~d----~ 139 (265) T TIGR00559 73 TEEMIEIAVE---------IKNKPEKVTLVPEKRDEVTTEGGLDVARLKDKLKELVKRLHEAGIEVSLFIDADKD----Q 139 (265) T ss_pred HHHHHHHHHH---------HCCCCCEEEECCCCCCEEECCCCCCHHHHHHHHHHHHHHHHHCCCEEEEEECCCHH----H T ss_conf 1899999998---------55899738741698860302644000110467999999998679858774254468----8 Q ss_pred HHHHHHCCCCEE Q ss_conf 665543489889 Q gi|254781004|r 595 IREHSIKKIPII 606 (652) Q Consensus 595 ir~a~l~giP~~ 606 (652) |.-|...|+|.+ T Consensus 140 i~~A~e~g~~~i 151 (265) T TIGR00559 140 ISAAAEVGADRI 151 (265) T ss_pred HHHHHHCCCCEE T ss_conf 899997089848 Done!