Query gi|254780419|ref|YP_003064832.1| aspartyl-tRNA synthetase [Candidatus Liberibacter asiaticus str. psy62] Match_columns 622 No_of_seqs 217 out of 3095 Neff 5.8 Searched_HMMs 39220 Date Sun May 29 16:40:06 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780419.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 COG0173 AspS Aspartyl-tRNA syn 100.0 0 0 1725.1 48.6 582 4-593 1-584 (585) 2 TIGR00459 aspS_bact aspartyl-t 100.0 0 0 1730.5 39.2 587 4-592 2-647 (653) 3 PRK12820 bifunctional aspartyl 100.0 0 0 1546.6 48.0 599 7-616 7-626 (706) 4 PRK00476 aspS aspartyl-tRNA sy 100.0 0 0 1432.7 50.0 585 2-594 1-587 (587) 5 KOG2411 consensus 100.0 0 0 1337.8 41.6 580 3-596 32-628 (628) 6 PRK02983 lysS lysyl-tRNA synth 100.0 0 0 919.9 31.2 443 4-565 643-1096(1099) 7 PRK12445 lysyl-tRNA synthetase 100.0 0 0 885.6 31.7 431 3-566 39-502 (505) 8 PRK05159 aspC aspartyl-tRNA sy 100.0 0 0 877.6 35.0 416 4-571 2-434 (434) 9 PRK00484 lysS lysyl-tRNA synth 100.0 0 0 868.5 35.3 428 5-565 31-487 (491) 10 PRK03932 asnC asparaginyl-tRNA 100.0 0 0 781.1 34.5 433 8-570 5-461 (462) 11 COG0017 AsnS Aspartyl/asparagi 100.0 0 0 712.3 33.3 408 4-567 2-431 (435) 12 TIGR00458 aspS_arch aspartyl-t 100.0 0 0 691.4 23.3 417 7-567 1-462 (466) 13 cd00777 AspRS_core Asp tRNA sy 100.0 0 0 636.4 19.4 280 142-567 1-280 (280) 14 TIGR00457 asnS asparaginyl-tRN 100.0 0 0 623.3 25.3 444 14-569 12-493 (495) 15 pfam00152 tRNA-synt_2 tRNA syn 100.0 0 0 626.8 19.9 331 121-567 1-341 (341) 16 cd00776 AsxRS_core Asx tRNA sy 100.0 0 0 617.4 20.0 304 119-567 1-322 (322) 17 KOG0556 consensus 100.0 0 0 605.2 26.0 431 8-567 72-529 (533) 18 COG1190 LysU Lysyl-tRNA synthe 100.0 0 0 599.5 27.6 422 5-564 38-497 (502) 19 PRK09350 lysyl-tRNA synthetase 100.0 0 0 599.8 20.0 304 134-566 8-324 (325) 20 cd00775 LysRS_core Lys_tRNA sy 100.0 0 0 595.4 19.2 312 135-566 1-328 (329) 21 PRK06462 asparagine synthetase 100.0 0 0 553.7 16.8 313 121-566 8-326 (332) 22 cd00669 Asp_Lys_Asn_RS_core As 100.0 0 0 514.1 17.7 258 142-567 1-269 (269) 23 KOG1885 consensus 100.0 0 0 511.4 18.3 443 5-564 82-553 (560) 24 KOG0555 consensus 100.0 0 0 459.6 22.5 402 15-565 120-539 (545) 25 KOG0554 consensus 100.0 0 0 461.0 19.6 400 12-569 14-444 (446) 26 TIGR00499 lysS_bact lysyl-tRNA 100.0 1.4E-45 0 345.1 16.5 446 13-566 52-535 (538) 27 cd04317 EcAspRS_like_N EcAspRS 100.0 4.9E-38 1.3E-42 291.6 16.4 134 5-138 1-135 (135) 28 COG2269 Truncated, possibly in 100.0 1E-34 2.6E-39 267.9 15.5 291 140-566 14-321 (322) 29 TIGR00462 genX lysyl-tRNA synt 100.0 2E-28 5.1E-33 222.9 13.7 293 140-563 19-326 (330) 30 cd04316 ND_PkAspRS_like_N ND_P 99.9 7.2E-24 1.8E-28 190.2 14.3 103 7-116 1-107 (108) 31 cd04319 PhAsnRS_like_N PhAsnRS 99.9 1.8E-23 4.5E-28 187.5 12.9 101 20-131 1-103 (103) 32 cd04322 LysRS_N LysRS_N: N-ter 99.9 3.7E-22 9.5E-27 178.0 10.8 104 20-133 1-108 (108) 33 cd04321 ScAspRS_mt_like_N ScAs 99.8 1.1E-19 2.7E-24 160.4 12.2 84 20-106 1-86 (86) 34 cd04320 AspRS_cyto_N AspRS_cyt 99.8 1.1E-17 2.7E-22 146.2 12.7 95 20-116 1-102 (102) 35 cd04323 AsnRS_cyto_like_N AsnR 99.8 1.6E-17 4.1E-22 144.8 11.7 83 20-106 1-84 (84) 36 cd04100 Asp_Lys_Asn_RS_N Asp_L 99.7 1.9E-16 4.9E-21 137.1 11.8 83 20-106 1-85 (85) 37 cd04318 EcAsnRS_like_N EcAsnRS 99.5 1E-13 2.6E-18 117.6 11.2 80 20-106 1-82 (82) 38 pfam02938 GAD GAD domain. This 99.4 2.1E-13 5.4E-18 115.4 6.3 93 314-413 1-94 (94) 39 cd00496 PheRS_alpha_core Pheny 99.2 1.3E-10 3.3E-15 95.4 9.1 89 146-240 4-109 (218) 40 PRK00488 pheS phenylalanyl-tRN 99.2 5.5E-10 1.4E-14 90.9 10.7 107 125-239 86-212 (338) 41 PTZ00326 phenylalanyl-tRNA syn 99.1 4.8E-10 1.2E-14 91.3 10.2 99 140-239 239-396 (505) 42 COG0016 PheS Phenylalanyl-tRNA 99.1 2.9E-10 7.3E-15 92.9 7.4 110 123-239 88-220 (335) 43 pfam01409 tRNA-synt_2d tRNA sy 99.1 1.4E-09 3.6E-14 88.0 10.2 87 145-239 19-128 (243) 44 PRK04172 pheS phenylalanyl-tRN 99.0 3.8E-09 9.7E-14 84.9 11.2 69 451-560 419-490 (501) 45 cd00768 class_II_aaRS-like_cor 99.0 6.9E-10 1.8E-14 90.2 6.4 116 144-261 1-126 (211) 46 PRK12295 hisZ ATP phosphoribos 98.7 1.2E-06 3.1E-11 67.0 13.6 111 144-260 6-133 (373) 47 pfam00587 tRNA-synt_2b tRNA sy 98.6 1.2E-07 3.1E-12 74.1 6.8 113 144-259 1-131 (170) 48 pfam01336 tRNA_anti OB-fold nu 98.6 1.2E-06 3E-11 67.0 10.7 74 21-104 1-75 (75) 49 PRK00037 hisS histidyl-tRNA sy 98.2 3.9E-06 1E-10 63.4 6.0 98 140-241 16-133 (417) 50 CHL00201 syh histidine-tRNA sy 98.1 3.3E-06 8.5E-11 63.9 4.9 99 140-241 16-132 (424) 51 cd00670 Gly_His_Pro_Ser_Thr_tR 98.1 1.8E-05 4.6E-10 58.6 7.6 115 142-259 2-138 (235) 52 cd00771 ThrRS_core Threonyl-tR 98.1 1.5E-05 3.9E-10 59.2 7.1 115 141-261 29-164 (298) 53 PRK12325 prolyl-tRNA synthetas 98.1 1.6E-05 4.2E-10 58.9 7.1 110 141-258 46-179 (438) 54 cd00779 ProRS_core_prok Prolyl 98.1 1.4E-05 3.6E-10 59.4 6.6 110 141-258 30-163 (255) 55 COG0124 HisS Histidyl-tRNA syn 98.0 1.3E-05 3.3E-10 59.7 5.3 111 141-259 17-153 (429) 56 cd00778 ProRS_core_arch_euk Pr 97.9 4.5E-05 1.1E-09 55.8 7.5 121 140-264 30-174 (261) 57 PRK09537 pylS pyrolysyl-tRNA s 97.9 5.2E-06 1.3E-10 62.5 2.2 86 146-238 210-312 (420) 58 PRK12420 histidyl-tRNA synthet 97.8 6.9E-05 1.8E-09 54.5 6.7 98 141-241 17-129 (421) 59 PRK07892 consensus 97.8 0.00034 8.6E-09 49.5 9.8 11 587-597 982-992 (1180) 60 cd00770 SerRS_core Seryl-tRNA 97.8 6.9E-05 1.8E-09 54.4 6.0 124 141-276 51-199 (297) 61 TIGR02367 PylS pyrrolysyl-tRNA 97.7 3.1E-05 7.8E-10 57.0 3.9 110 144-260 241-369 (453) 62 cd00772 ProRS_core Prolyl-tRNA 97.7 0.00018 4.6E-09 51.5 7.3 111 141-258 31-169 (264) 63 PRK12292 hisZ ATP phosphoribos 97.7 9.7E-05 2.5E-09 53.4 5.9 99 141-241 16-128 (388) 64 PRK06658 consensus 97.6 0.0021 5.4E-08 43.8 11.8 20 238-257 403-422 (1145) 65 PRK06826 dnaE DNA polymerase I 97.6 0.0012 2.9E-08 45.7 9.8 11 275-285 465-475 (1157) 66 cd00773 HisRS-like_core Class 97.5 0.00032 8E-09 49.7 6.6 113 142-258 2-133 (261) 67 PRK09074 consensus 97.5 0.0016 4.1E-08 44.7 9.3 11 208-218 344-354 (1149) 68 COG0442 ProS Prolyl-tRNA synth 97.4 0.00038 9.6E-09 49.2 6.0 115 141-265 46-185 (500) 69 PRK05673 dnaE DNA polymerase I 97.4 0.0017 4.3E-08 44.5 8.6 10 586-595 944-953 (1143) 70 PRK07012 consensus 97.4 0.001 2.7E-08 46.0 7.4 15 586-600 952-966 (1173) 71 PRK07374 dnaE DNA polymerase I 97.3 0.0016 4.1E-08 44.7 8.0 16 585-600 966-981 (1171) 72 PRK06637 consensus 97.3 0.0018 4.6E-08 44.3 8.2 15 586-600 987-1001(1182) 73 PRK12305 thrS threonyl-tRNA sy 97.3 0.0013 3.4E-08 45.2 7.1 111 142-260 208-342 (576) 74 PRK12293 hisZ ATP phosphoribos 97.3 0.00078 2E-08 46.9 5.9 110 140-258 17-137 (281) 75 cd00774 GlyRS-like_core Glycyl 97.2 0.00061 1.6E-08 47.7 5.1 95 142-246 32-144 (254) 76 PRK12421 ATP phosphoribosyltra 97.2 0.0009 2.3E-08 46.5 5.8 99 141-241 20-131 (391) 77 PRK06920 dnaE DNA polymerase I 97.2 0.0052 1.3E-07 41.0 9.6 22 196-217 298-326 (1107) 78 PRK08744 consensus 97.2 0.013 3.3E-07 38.1 11.4 11 585-595 970-980 (1195) 79 COG2511 GatE Archaeal Glu-tRNA 97.1 0.0047 1.2E-07 41.3 8.9 116 296-425 279-402 (631) 80 PRK05431 seryl-tRNA synthetase 97.1 0.0041 1.1E-07 41.7 8.0 123 142-277 170-318 (422) 81 PRK06326 consensus 96.9 0.0043 1.1E-07 41.6 7.2 14 587-600 996-1009(1240) 82 COG1107 Archaea-specific RecJ- 96.9 0.0017 4.4E-08 44.5 5.0 86 8-105 204-289 (715) 83 PRK09194 prolyl-tRNA synthetas 96.9 0.0028 7E-08 43.0 5.6 114 141-264 46-188 (570) 84 PRK08512 consensus 96.9 0.0054 1.4E-07 40.9 7.1 15 585-599 973-987 (1185) 85 TIGR00469 pheS_mito phenylalan 96.8 0.00073 1.8E-08 47.1 2.3 67 451-558 266-337 (460) 86 KOG2324 consensus 96.8 0.0023 5.8E-08 43.6 4.8 113 142-263 52-193 (457) 87 PRK12444 threonyl-tRNA synthet 96.7 0.0068 1.7E-07 40.2 6.9 115 142-264 274-414 (639) 88 PRK04483 threonyl-tRNA synthet 96.7 0.0071 1.8E-07 40.1 7.0 113 142-264 271-411 (634) 89 PRK12304 thrS threonyl-tRNA sy 96.7 0.0063 1.6E-07 40.4 6.7 112 142-259 239-371 (604) 90 PRK04028 glutamyl-tRNA(Gln) am 96.6 0.074 1.9E-06 32.7 12.4 202 295-536 279-498 (631) 91 PRK00413 thrS threonyl-tRNA sy 96.6 0.012 3.1E-07 38.4 7.3 119 142-263 268-408 (639) 92 PRK03772 threonyl-tRNA synthet 96.5 0.012 3E-07 38.5 6.8 113 142-260 271-404 (642) 93 PRK04036 DNA polymerase II sma 96.4 0.082 2.1E-06 32.5 10.8 64 16-79 144-209 (497) 94 PRK05425 asparagine synthetase 96.4 0.0044 1.1E-07 41.6 4.0 77 481-561 231-308 (327) 95 cd04483 hOBFC1_like hOBFC1_lik 96.3 0.051 1.3E-06 33.9 9.1 69 23-102 2-90 (92) 96 cd04490 PolII_SU_OBF PolII_SU_ 96.3 0.033 8.3E-07 35.3 8.0 60 20-79 1-62 (79) 97 COG0441 ThrS Threonyl-tRNA syn 96.3 0.013 3.4E-07 38.1 5.9 138 142-290 220-380 (589) 98 KOG1035 consensus 96.1 0.019 4.8E-07 37.0 6.0 106 142-263 932-1058(1351) 99 PRK03991 threonyl-tRNA synthet 96.1 0.017 4.2E-07 37.4 5.7 117 143-262 228-364 (613) 100 TIGR00442 hisS histidyl-tRNA s 96.0 0.017 4.2E-07 37.4 5.4 119 138-261 14-158 (446) 101 cd04487 RecJ_OBF2_like RecJ_OB 95.8 0.045 1.1E-06 34.3 7.0 73 21-104 1-73 (73) 102 COG0172 SerS Seryl-tRNA synthe 95.8 0.061 1.5E-06 33.4 7.7 123 146-281 174-326 (429) 103 cd04485 DnaE_OBF DnaE_OBF: A s 95.6 0.12 3E-06 31.3 8.5 75 22-107 1-80 (84) 104 PRK06253 O-phosphoseryl-tRNA s 95.5 0.014 3.6E-07 37.9 3.3 57 481-564 285-343 (527) 105 cd04478 RPA2_DBD_D RPA2_DBD_D: 95.4 0.25 6.5E-06 28.9 9.5 76 20-105 1-78 (95) 106 cd00645 AsnA Asparagine synthe 95.3 0.033 8.5E-07 35.2 4.7 77 481-561 221-298 (309) 107 COG3705 HisZ ATP phosphoribosy 95.2 0.034 8.6E-07 35.2 4.5 96 142-241 17-125 (390) 108 cd04482 RPA2_OBF_like RPA2_OBF 94.9 0.34 8.7E-06 28.0 9.8 74 22-108 2-78 (91) 109 COG4085 Predicted RNA-binding 94.8 0.25 6.3E-06 29.0 8.0 65 15-79 48-118 (204) 110 KOG2783 consensus 94.6 0.014 3.6E-07 37.9 1.3 75 449-561 247-321 (436) 111 PTZ00213 asparagine synthetase 94.6 0.07 1.8E-06 32.9 4.7 77 481-561 259-336 (350) 112 PRK06253 O-phosphoseryl-tRNA s 94.5 0.028 7E-07 35.8 2.5 65 195-260 177-258 (527) 113 TIGR00418 thrS threonyl-tRNA s 94.3 0.11 2.7E-06 31.6 5.1 108 143-261 213-349 (595) 114 KOG2784 consensus 94.3 0.081 2.1E-06 32.5 4.5 21 149-169 218-238 (483) 115 COG5235 RFA2 Single-stranded D 94.0 0.54 1.4E-05 26.6 8.7 73 20-102 68-141 (258) 116 PRK09348 glyQ glycyl-tRNA synt 94.0 0.037 9.5E-07 34.9 2.3 36 147-192 6-41 (291) 117 cd04492 YhaM_OBF_like YhaM_OBF 93.9 0.56 1.4E-05 26.5 9.7 66 31-108 15-80 (83) 118 pfam02091 tRNA-synt_2e Glycyl- 93.9 0.018 4.6E-07 37.2 0.6 62 147-231 3-66 (284) 119 cd00733 GlyRS_alpha_core Class 93.7 0.022 5.6E-07 36.5 0.8 28 534-561 151-178 (279) 120 pfam04076 BOF Bacterial OB fol 93.5 0.49 1.3E-05 26.9 7.4 77 10-102 49-125 (126) 121 COG3111 Periplasmic protein wi 93.2 0.54 1.4E-05 26.6 7.2 75 14-104 53-127 (128) 122 COG2502 AsnA Asparagine synthe 93.2 0.17 4.4E-06 30.1 4.6 70 488-561 246-315 (330) 123 PRK08661 prolyl-tRNA synthetas 93.0 0.26 6.7E-06 28.8 5.3 115 141-265 44-187 (478) 124 COG3689 Predicted membrane pro 92.7 0.68 1.7E-05 25.8 7.1 11 247-257 157-167 (271) 125 cd04489 ExoVII_LU_OBF ExoVII_L 92.0 1 2.6E-05 24.6 10.2 75 21-104 2-77 (78) 126 PRK00037 hisS histidyl-tRNA sy 90.4 0.4 1E-05 27.5 4.0 54 534-594 300-356 (417) 127 cd00768 class_II_aaRS-like_cor 90.1 0.022 5.6E-07 36.6 -2.8 53 479-546 154-211 (211) 128 PRK10053 hypothetical protein; 89.6 1.7 4.2E-05 23.1 7.3 75 12-102 55-129 (130) 129 PRK12420 histidyl-tRNA synthet 89.5 0.32 8.1E-06 28.2 2.8 43 522-564 299-346 (421) 130 PRK12292 hisZ ATP phosphoribos 89.0 0.68 1.7E-05 25.9 4.2 58 524-592 293-356 (388) 131 COG0752 GlyQ Glycyl-tRNA synth 88.7 0.35 8.9E-06 27.9 2.6 63 146-231 8-72 (298) 132 CHL00201 syh histidine-tRNA sy 88.6 0.53 1.4E-05 26.6 3.5 61 524-592 289-356 (424) 133 COG1570 XseA Exonuclease VII, 88.3 2 5.1E-05 22.5 8.5 76 19-103 24-100 (440) 134 PRK04173 glycyl-tRNA synthetas 88.3 0.23 5.9E-06 29.2 1.4 52 201-254 177-234 (460) 135 cd00769 PheRS_beta_core Phenyl 88.1 2.1 5.3E-05 22.4 6.7 27 145-171 2-28 (198) 136 KOG2509 consensus 87.9 0.45 1.1E-05 27.1 2.7 103 142-259 185-320 (455) 137 TIGR00137 gid gid protein; Int 86.9 0.58 1.5E-05 26.3 2.8 39 385-424 329-378 (444) 138 KOG1936 consensus 86.6 0.44 1.1E-05 27.2 2.0 95 140-242 72-183 (518) 139 pfam08661 Rep_fac-A_3 Replicat 86.6 2.5 6.4E-05 21.8 7.9 67 5-79 7-73 (111) 140 PRK10917 ATP-dependent DNA hel 86.2 2.6 6.6E-05 21.7 10.2 65 13-79 53-120 (677) 141 KOG3108 consensus 85.8 2.7 6.9E-05 21.6 9.2 74 20-103 70-144 (265) 142 COG1571 Predicted DNA-binding 85.8 2.7 7E-05 21.5 8.9 24 20-43 42-66 (421) 143 cd04479 RPA3 RPA3: A subfamily 84.6 3.1 7.9E-05 21.2 8.9 75 9-106 6-80 (101) 144 PRK09537 pylS pyrolysyl-tRNA s 84.2 0.43 1.1E-05 27.3 1.1 29 530-558 376-405 (420) 145 PRK13480 3'-5' exoribonuclease 84.1 3.2 8.2E-05 21.0 11.1 80 17-108 10-94 (314) 146 TIGR00470 sepS O-phosphoseryl- 83.9 0.42 1.1E-05 27.3 0.9 45 126-170 37-84 (558) 147 TIGR00468 pheS phenylalanyl-tR 83.9 1.3 3.2E-05 23.9 3.4 83 449-560 262-348 (362) 148 cd04488 RecG_wedge_OBF RecG_we 81.7 3.9 0.0001 20.4 8.9 56 23-80 2-61 (75) 149 PRK12366 replication factor A; 80.8 4.2 0.00011 20.2 8.2 79 18-107 72-164 (649) 150 COG2024 Phenylalanyl-tRNA synt 80.6 0.86 2.2E-05 25.1 1.5 31 529-559 317-348 (536) 151 TIGR00669 asnA aspartate--ammo 80.2 1.9 4.9E-05 22.6 3.2 70 488-561 246-315 (330) 152 COG1311 HYS2 Archaeal DNA poly 79.5 4.7 0.00012 19.9 7.2 166 17-223 137-305 (481) 153 cd03524 RPA2_OBF_family RPA2_O 78.9 4.8 0.00012 19.8 8.4 69 22-101 1-73 (75) 154 TIGR00409 proS_fam_II prolyl-t 77.2 2.7 6.9E-05 21.6 3.2 109 141-264 47-194 (620) 155 COG0423 GRS1 Glycyl-tRNA synth 76.3 1 2.5E-05 24.7 0.8 31 213-245 190-223 (558) 156 TIGR00389 glyS_dimeric glycyl- 75.6 2.5 6.3E-05 21.9 2.6 39 201-245 199-247 (606) 157 pfam02074 Peptidase_M32 Carbox 74.4 3.2 8.2E-05 21.0 3.0 21 447-467 224-245 (494) 158 pfam04556 DpnII DpnII restrict 73.7 5.8 0.00015 19.2 4.1 29 420-459 231-259 (284) 159 COG0380 OtsA Trehalose-6-phosp 73.3 6.7 0.00017 18.8 5.1 59 143-205 132-202 (486) 160 PRK13902 alaS lanyl-tRNA synth 72.5 1.6 4.1E-05 23.2 1.0 13 94-106 200-212 (898) 161 PRK07275 single-stranded DNA-b 70.5 7.7 0.0002 18.3 10.4 85 20-106 4-105 (163) 162 PRK01584 alanyl-tRNA synthetas 67.9 3.4 8.7E-05 20.8 1.9 21 147-167 5-25 (593) 163 TIGR02422 protocat_beta protoc 66.0 2.8 7.2E-05 21.4 1.2 22 468-489 196-217 (224) 164 TIGR01771 L-LDH-NAD L-lactate 65.6 1.8 4.5E-05 22.9 0.1 72 535-606 226-300 (302) 165 PRK00252 alaS alanyl-tRNA synt 64.7 3.2 8.3E-05 21.0 1.3 21 414-434 561-585 (864) 166 cd04484 polC_OBF polC_OBF: A s 63.9 10 0.00026 17.4 9.9 74 20-103 1-81 (82) 167 TIGR01389 recQ ATP-dependent D 63.1 2.5 6.5E-05 21.8 0.5 106 194-310 187-315 (607) 168 TIGR00614 recQ_fam ATP-depende 62.2 2.7 6.9E-05 21.6 0.5 28 526-557 451-479 (497) 169 TIGR00134 gatE_arch glutamyl-t 61.8 11 0.00028 17.2 6.5 113 294-419 275-396 (631) 170 COG0013 AlaS Alanyl-tRNA synth 60.8 4.5 0.00011 20.0 1.4 27 567-593 773-799 (879) 171 COG5011 Uncharacterized protei 60.5 4.2 0.00011 20.2 1.2 43 240-285 12-56 (228) 172 TIGR01405 polC_Gram_pos DNA po 60.4 12 0.0003 17.0 8.8 80 18-105 7-95 (1264) 173 smart00769 WHy Water Stress an 60.1 3.6 9.2E-05 20.7 0.8 24 474-497 30-53 (100) 174 PRK00286 xseA exodeoxyribonucl 59.7 12 0.00031 16.9 9.9 78 19-105 24-102 (443) 175 PRK07274 single-stranded DNA-b 59.6 12 0.00031 16.9 10.2 83 20-105 4-103 (131) 176 PRK09616 pheT phenylalanyl-tRN 58.6 13 0.00032 16.8 5.8 53 206-261 197-249 (546) 177 pfam02689 Herpes_Helicase Heli 58.5 3.6 9.3E-05 20.7 0.6 10 193-203 223-232 (801) 178 COG2160 AraA L-arabinose isome 58.3 13 0.00032 16.8 4.7 11 479-489 397-407 (497) 179 cd04496 SSB_OBF SSB_OBF: A sub 57.6 13 0.00033 16.7 9.4 56 47-104 45-100 (100) 180 COG0514 RecQ Superfamily II DN 57.2 3.7 9.5E-05 20.6 0.5 13 287-299 293-305 (590) 181 cd06460 M32_Taq Peptidase fami 56.8 11 0.00028 17.2 2.8 32 521-553 349-380 (396) 182 pfam00436 SSB Single-strand bi 56.4 14 0.00035 16.5 6.7 83 20-104 3-104 (104) 183 PRK07211 replication factor A; 56.3 14 0.00035 16.5 8.9 77 20-107 65-153 (473) 184 TIGR02470 sucr_synth sucrose s 55.1 6.4 0.00016 18.9 1.4 32 374-409 371-402 (790) 185 PRK00448 polC DNA polymerase I 54.9 14 0.00037 16.4 8.9 63 17-79 233-301 (1436) 186 pfam03558 TBSV_P22 TBSV core p 54.6 3.3 8.5E-05 20.9 -0.1 62 119-199 99-163 (189) 187 COG0587 DnaE DNA polymerase II 54.5 15 0.00037 16.3 7.9 13 538-550 876-889 (1139) 188 PTZ00274 cytochrome b5 reducta 54.4 9.7 0.00025 17.6 2.2 53 193-250 151-209 (325) 189 cd00673 AlaRS_core Alanyl-tRNA 53.3 8.1 0.00021 18.1 1.7 96 146-263 2-106 (232) 190 COG4930 Predicted ATP-dependen 52.8 4.4 0.00011 20.0 0.3 30 410-439 445-477 (683) 191 cd04474 RPA1_DBD_A RPA1_DBD_A: 52.2 16 0.0004 16.1 8.9 79 20-102 11-100 (104) 192 PRK06752 single-stranded DNA-b 52.0 16 0.00041 16.1 10.1 86 20-107 4-106 (112) 193 TIGR01358 DAHP_synth_II 3-deox 51.6 16 0.00041 16.0 3.9 134 227-444 217-355 (450) 194 KOG1894 consensus 50.1 8.2 0.00021 18.1 1.3 58 145-208 106-175 (412) 195 KOG2653 consensus 49.6 17 0.00044 15.8 4.2 22 155-178 118-139 (487) 196 TIGR02972 TMAO_torE trimethyla 49.3 9.4 0.00024 17.7 1.5 13 524-536 37-50 (50) 197 COG0113 HemB Delta-aminolevuli 49.2 4.4 0.00011 20.0 -0.2 23 148-172 176-198 (330) 198 PRK08182 single-stranded DNA-b 49.0 18 0.00045 15.7 5.6 62 43-106 50-112 (148) 199 COG0095 LplA Lipoate-protein l 48.8 12 0.00029 17.1 1.9 66 427-499 80-146 (248) 200 pfam08022 FAD_binding_8 FAD-bi 48.1 3.9 9.9E-05 20.5 -0.6 23 428-461 31-53 (103) 201 TIGR01622 SF-CC1 splicing fact 48.0 10 0.00025 17.5 1.5 15 250-264 245-259 (531) 202 pfam10778 DehI Halocarboxylic 47.7 13 0.00032 16.8 2.0 11 254-264 13-23 (159) 203 PRK08763 single-stranded DNA-b 47.5 19 0.00047 15.6 10.3 85 20-106 7-110 (160) 204 KOG2298 consensus 47.4 6.7 0.00017 18.7 0.5 91 214-309 213-333 (599) 205 pfam08624 CRC_subunit Chromati 46.8 8.9 0.00023 17.9 1.1 35 215-255 53-89 (139) 206 TIGR01398 FlhA flagellar biosy 45.6 19 0.00048 15.5 2.6 94 338-439 580-675 (713) 207 PRK06958 single-stranded DNA-b 45.6 20 0.0005 15.4 7.3 85 20-106 6-110 (181) 208 COG2317 Zn-dependent carboxype 45.6 5 0.00013 19.7 -0.4 12 449-460 226-237 (497) 209 pfam03223 V-ATPase_C V-ATPase 45.5 20 0.00051 15.4 3.0 11 528-538 307-317 (371) 210 PRK11057 ATP-dependent DNA hel 44.8 7.8 0.0002 18.3 0.5 11 287-297 299-309 (607) 211 COG5608 LEA14-like dessication 44.2 8.4 0.00021 18.1 0.6 10 94-103 73-82 (161) 212 cd04823 ALAD_PBGS_aspartate_ri 43.7 5.5 0.00014 19.4 -0.4 23 148-172 168-190 (320) 213 KOG3358 consensus 43.1 7.7 0.0002 18.3 0.3 43 448-491 100-144 (211) 214 KOG4059 consensus 43.1 11 0.00028 17.3 1.0 25 177-208 125-151 (193) 215 pfam01340 MetJ Met Apo-repress 43.1 18 0.00046 15.7 2.2 42 454-518 60-102 (104) 216 cd01674 Homoaconitase_Swivel H 43.1 9.4 0.00024 17.7 0.7 69 480-554 44-114 (129) 217 cd04491 SoSSB_OBF SoSSB_OBF: A 43.0 22 0.00055 15.1 7.0 59 32-103 20-78 (82) 218 cd06811 PLPDE_III_yhfX_like Ty 42.6 22 0.00056 15.1 8.2 33 430-462 183-215 (382) 219 PRK05264 transcriptional repre 42.4 22 0.00056 15.1 2.6 41 454-517 61-102 (104) 220 PRK02801 primosomal replicatio 42.4 22 0.00056 15.1 7.1 81 19-105 3-101 (101) 221 PRK13384 delta-aminolevulinic 42.4 6.4 0.00016 18.9 -0.2 22 149-172 175-196 (323) 222 TIGR00441 gmhA phosphoheptose 42.1 17 0.00043 15.9 1.9 125 322-456 38-174 (186) 223 cd00384 ALAD_PBGS Porphobilino 41.9 6.3 0.00016 18.9 -0.3 23 148-172 163-185 (314) 224 pfam06934 CTI Fatty acid cis/t 41.8 6.5 0.00017 18.8 -0.2 14 432-445 409-422 (688) 225 PRK07922 N-acetylglutamate syn 41.8 10 0.00026 17.5 0.7 37 154-190 116-160 (170) 226 PRK06751 single-stranded DNA-b 41.6 23 0.00058 15.0 10.8 85 20-106 3-104 (172) 227 PRK06863 single-stranded DNA-b 41.4 23 0.00058 15.0 5.7 85 20-106 6-110 (168) 228 PRK07459 single-stranded DNA-b 41.4 23 0.00058 15.0 10.5 87 20-107 5-104 (121) 229 TIGR00736 nifR3_rel_arch TIM-b 41.0 10 0.00027 17.4 0.7 30 199-239 84-114 (234) 230 KOG1637 consensus 40.5 21 0.00054 15.2 2.2 109 143-259 193-324 (560) 231 PRK00439 leuD 3-isopropylmalat 40.2 16 0.00041 16.0 1.5 20 482-501 49-68 (163) 232 KOG0353 consensus 40.0 9.5 0.00024 17.7 0.4 10 287-296 380-389 (695) 233 PRK06461 single-stranded DNA-b 39.8 24 0.00061 14.8 9.4 86 8-109 6-100 (130) 234 TIGR00787 dctP TRAP transporte 39.6 24 0.00062 14.8 2.4 47 233-282 74-122 (267) 235 pfam00490 ALAD Delta-aminolevu 39.5 6.9 0.00017 18.7 -0.4 23 148-172 171-193 (322) 236 cd00490 Met_repressor_MetJ Met 39.2 23 0.0006 14.9 2.3 41 454-517 60-101 (103) 237 KOG1559 consensus 38.9 9.6 0.00025 17.6 0.2 19 526-544 314-334 (340) 238 PTZ00182 3-methyl-2-oxobutanat 38.9 16 0.0004 16.1 1.3 22 30-51 11-35 (355) 239 cd06546 GH18_CTS3_chitinase GH 38.9 19 0.00048 15.5 1.7 43 209-259 96-139 (256) 240 KOG0188 consensus 38.5 20 0.00051 15.4 1.8 15 35-49 207-222 (895) 241 TIGR01237 D1pyr5carbox2 delta- 38.4 14 0.00035 16.5 1.0 17 62-78 27-43 (518) 242 PRK13867 type IV secretion sys 37.5 7.6 0.00019 18.4 -0.5 14 228-241 31-44 (63) 243 PRK08402 replication factor A; 36.3 27 0.00069 14.4 6.5 78 19-106 75-165 (357) 244 KOG4330 consensus 36.2 13 0.00032 16.8 0.5 49 230-289 143-200 (206) 245 cd01579 AcnA_Bact_Swivel Bacte 35.2 17 0.00043 15.9 1.0 48 483-536 50-100 (121) 246 KOG0230 consensus 35.0 17 0.00044 15.8 1.0 20 528-547 989-1008(1598) 247 PRK06293 single-stranded DNA-b 34.9 28 0.00073 14.3 8.7 61 43-105 40-100 (161) 248 TIGR01993 Pyr-5-nucltdase pyri 34.6 29 0.00073 14.2 3.2 34 380-413 87-122 (205) 249 KOG4079 consensus 34.5 29 0.00074 14.2 3.9 57 175-253 43-103 (169) 250 pfam10105 DUF2344 Uncharacteri 34.3 24 0.00061 14.8 1.7 42 240-284 11-54 (186) 251 cd05707 S1_Rrp5_repeat_sc11 S1 34.1 29 0.00075 14.2 3.8 54 22-79 3-57 (68) 252 PRK02929 L-arabinose isomerase 33.8 30 0.00075 14.1 2.8 53 138-210 249-301 (499) 253 PRK09283 delta-aminolevulinic 33.8 9.3 0.00024 17.7 -0.5 23 148-172 169-191 (321) 254 KOG0351 consensus 33.8 14 0.00035 16.5 0.4 31 123-153 403-443 (941) 255 cd04824 eu_ALAD_PBGS_cysteine_ 33.1 11 0.00028 17.2 -0.2 24 148-172 167-190 (320) 256 PRK11798 ClpXP protease specif 32.9 24 0.00061 14.8 1.5 56 143-206 6-63 (140) 257 CHL00181 cbbX CbbX; Provisiona 32.8 30 0.00077 14.1 2.0 57 378-445 142-198 (287) 258 cd01577 IPMI_Swivel Aconatase- 32.7 18 0.00045 15.8 0.8 50 482-536 18-69 (91) 259 cd05145 RIO1_like RIO kinase f 32.6 31 0.00079 14.0 3.1 28 435-477 156-183 (190) 260 cd01938 ADPGK_ADPPFK ADP-depen 32.4 31 0.00079 14.0 2.5 14 94-107 27-40 (445) 261 cd05147 RIO1_euk RIO kinase fa 32.3 31 0.00079 14.0 3.3 28 435-477 156-183 (190) 262 KOG0142 consensus 32.3 18 0.00046 15.7 0.8 72 500-588 106-184 (225) 263 pfam02610 Arabinose_Isome L-ar 32.0 32 0.0008 13.9 2.8 13 414-426 255-267 (359) 264 COG3655 Predicted transcriptio 31.2 32 0.00083 13.8 2.6 31 376-406 40-70 (73) 265 KOG3567 consensus 31.1 19 0.00049 15.5 0.7 57 186-244 256-323 (501) 266 TIGR02538 type_IV_pilB type IV 30.6 32 0.00082 13.9 1.8 34 496-533 293-328 (577) 267 PRK13411 molecular chaperone D 29.7 32 0.00081 13.9 1.7 31 486-518 328-365 (655) 268 cd05146 RIO3_euk RIO kinase fa 29.6 34 0.00088 13.7 3.3 29 434-477 162-190 (197) 269 cd04246 AAK_AK-DapG-like AAK_A 29.2 28 0.00072 14.3 1.3 12 273-284 189-200 (239) 270 COG1200 RecG RecG-like helicas 29.2 35 0.00089 13.6 10.4 79 15-105 57-139 (677) 271 TIGR02881 spore_V_K stage V sp 28.6 36 0.00091 13.6 2.4 69 363-445 111-180 (261) 272 TIGR02095 glgA glycogen/starch 28.4 16 0.00042 16.0 0.0 15 526-540 422-440 (517) 273 TIGR00408 proS_fam_I prolyl-tR 27.9 37 0.00093 13.5 2.4 121 140-265 37-202 (533) 274 pfam10451 Stn1 Telomere regula 27.9 37 0.00094 13.5 7.6 73 19-105 64-145 (444) 275 COG2844 GlnD UTP:GlnB (protein 27.9 26 0.00066 14.6 0.9 83 170-261 199-297 (867) 276 pfam12109 CXCR4_N CXCR4 Chemok 27.9 21 0.00054 15.2 0.5 11 217-227 21-31 (33) 277 TIGR00388 glyQ glycyl-tRNA syn 27.5 27 0.00068 14.5 1.0 24 146-169 4-27 (295) 278 PRK07080 hypothetical protein; 27.3 34 0.00087 13.7 1.5 41 217-259 160-201 (318) 279 PRK05853 hypothetical protein; 27.0 38 0.00097 13.4 9.4 82 18-101 3-103 (168) 280 TIGR01621 RluA-like pseudourid 26.9 14 0.00035 16.5 -0.6 22 190-212 81-104 (222) 281 PRK10606 btuE putative glutath 26.6 39 0.00098 13.3 4.9 39 326-365 28-67 (183) 282 KOG0100 consensus 26.3 39 0.001 13.3 2.0 74 490-575 367-465 (663) 283 PRK10841 hybrid sensory kinase 26.3 39 0.001 13.3 2.3 13 379-391 628-640 (947) 284 pfam03197 FRD2 Bacteriophage F 26.2 39 0.001 13.3 3.5 41 174-228 21-66 (102) 285 TIGR00092 TIGR00092 GTP-bindin 26.2 22 0.00055 15.1 0.3 134 209-351 98-242 (390) 286 pfam05873 Mt_ATP-synt_D ATP sy 26.1 37 0.00095 13.4 1.5 22 271-292 124-145 (160) 287 pfam06862 DUF1253 Protein of u 25.9 24 0.0006 14.8 0.5 35 228-264 147-182 (435) 288 KOG3706 consensus 25.9 29 0.00075 14.2 1.0 67 340-408 334-400 (629) 289 PRK11260 cystine transporter s 25.9 36 0.00091 13.5 1.4 48 219-292 53-100 (264) 290 KOG3182 consensus 25.8 40 0.001 13.2 1.8 82 141-223 73-166 (212) 291 COG1951 TtdA Tartrate dehydrat 25.7 40 0.001 13.2 2.3 44 524-567 171-223 (297) 292 TIGR02924 ICDH_alpha isocitrat 25.6 28 0.00071 14.3 0.8 166 67-304 26-207 (481) 293 cd04261 AAK_AKii-LysC-BS AAK_A 25.4 39 0.00099 13.3 1.5 11 273-283 189-199 (239) 294 TIGR02708 L_lactate_ox L-lacta 25.3 26 0.00067 14.5 0.6 19 545-563 340-358 (368) 295 COG1982 LdcC Arginine/lysine/o 25.0 28 0.00072 14.3 0.7 44 158-203 203-246 (557) 296 cd04461 S1_Rrp5_repeat_hs8_sc7 24.9 41 0.0011 13.1 3.2 54 17-74 14-68 (83) 297 PRK10203 hypothetical protein; 24.6 28 0.00072 14.3 0.7 11 118-128 37-47 (122) 298 KOG4134 consensus 24.6 37 0.00093 13.5 1.2 22 217-241 128-149 (253) 299 PRK05733 single-stranded DNA-b 24.5 42 0.0011 13.1 6.4 83 20-105 7-112 (172) 300 COG1415 Uncharacterized conser 24.2 43 0.0011 13.0 2.9 28 506-535 279-306 (373) 301 PRK06642 single-stranded DNA-b 24.1 43 0.0011 13.0 5.5 81 20-102 7-108 (152) 302 cd05119 RIO RIO kinase family, 23.9 38 0.00097 13.3 1.2 29 434-477 152-180 (187) 303 TIGR00348 hsdR type I site-spe 23.9 18 0.00046 15.7 -0.4 19 197-215 182-205 (813) 304 KOG3100 consensus 23.8 23 0.00059 14.9 0.1 12 209-220 153-164 (206) 305 pfam01411 tRNA-synt_2c tRNA sy 23.7 43 0.0011 12.9 4.4 96 148-260 2-109 (545) 306 pfam08918 PhoQ_Sensor PhoQ Sen 23.5 44 0.0011 12.9 1.7 35 152-205 101-135 (180) 307 CHL00192 syfB phenylalanyl-tRN 23.2 44 0.0011 12.9 7.1 81 479-587 548-633 (702) 308 cd00144 PP2Ac Protein phosphat 23.2 43 0.0011 13.0 1.4 23 148-170 9-32 (269) 309 COG2037 Ftr Formylmethanofuran 23.1 27 0.00069 14.4 0.3 36 453-490 218-253 (297) 310 COG1031 Uncharacterized Fe-S o 23.1 45 0.0011 12.9 1.6 13 251-263 261-273 (560) 311 pfam06953 ArsD Arsenical resis 22.8 45 0.0012 12.8 3.9 13 378-390 55-67 (119) 312 COG4874 Uncharacterized protei 22.8 35 0.0009 13.6 0.9 14 486-499 295-309 (318) 313 COG1093 SUI2 Translation initi 22.7 45 0.0012 12.8 2.8 36 310-351 136-171 (269) 314 PRK08486 single-stranded DNA-b 22.5 46 0.0012 12.8 7.5 85 20-106 4-107 (180) 315 KOG1491 consensus 22.5 46 0.0012 12.8 4.2 19 207-225 112-130 (391) 316 pfam04930 FUN14 FUN14 family. 22.4 46 0.0012 12.8 2.9 26 518-543 67-92 (97) 317 COG5127 Vacuolar H+-ATPase V1 22.2 34 0.00086 13.7 0.7 14 528-541 306-319 (383) 318 TIGR00414 serS seryl-tRNA synt 22.1 47 0.0012 12.7 5.7 131 142-277 202-359 (460) 319 PRK10838 spr putative outer me 22.1 43 0.0011 12.9 1.2 14 391-404 127-140 (188) 320 TIGR02289 M3_not_pepF oligoend 22.0 47 0.0012 12.7 2.0 39 209-261 249-289 (553) 321 TIGR02151 IPP_isom_2 isopenten 21.9 36 0.00093 13.5 0.8 56 489-560 272-341 (349) 322 pfam11736 DUF3299 Protein of u 21.8 47 0.0012 12.7 8.6 88 10-103 45-142 (144) 323 KOG0352 consensus 21.8 30 0.00077 14.1 0.3 11 287-297 318-328 (641) 324 COG4087 Soluble P-type ATPase 21.7 47 0.0012 12.7 1.4 28 231-258 62-91 (152) 325 COG4944 Uncharacterized protei 21.6 33 0.00083 13.8 0.5 22 522-543 142-166 (213) 326 PRK05813 single-stranded DNA-b 21.5 48 0.0012 12.6 2.3 83 16-105 6-101 (217) 327 cd00340 GSH_Peroxidase Glutath 21.4 48 0.0012 12.6 4.9 40 326-365 25-64 (152) 328 pfam07209 DUF1415 Protein of u 21.4 48 0.0012 12.6 1.4 56 155-223 57-117 (174) 329 TIGR02995 ectoine_ehuB ectoine 21.4 27 0.00069 14.4 0.0 11 154-164 28-38 (285) 330 smart00090 RIO RIO-like kinase 21.3 47 0.0012 12.7 1.2 28 435-477 187-214 (237) 331 TIGR01802 CM_pl-yst chorismate 21.1 40 0.001 13.2 0.9 16 206-221 55-70 (273) 332 pfam06218 NPR2 Nitrogen permea 21.1 42 0.0011 13.0 1.0 26 540-565 324-349 (400) 333 KOG2803 consensus 21.1 44 0.0011 12.9 1.1 75 186-289 20-94 (358) 334 TIGR00987 himA integration hos 21.0 49 0.0012 12.6 4.1 16 248-263 21-36 (96) 335 COG3060 MetJ Transcriptional r 21.0 49 0.0012 12.6 2.2 41 454-517 61-102 (105) 336 COG3122 Uncharacterized protei 21.0 48 0.0012 12.6 1.3 56 150-209 2-58 (215) 337 pfam08415 NRPS Nonribosomal pe 20.9 49 0.0013 12.6 1.4 34 164-199 22-57 (57) 338 PRK11865 pyruvate ferredoxin o 20.8 39 0.001 13.3 0.8 20 440-461 234-253 (298) 339 PRK11107 hybrid sensory histid 20.7 45 0.0011 12.8 1.1 18 245-262 405-422 (920) 340 COG0478 RIO-like serine/threon 20.7 50 0.0013 12.5 1.9 30 434-478 240-269 (304) 341 cd01900 YchF YchF subfamily. 20.3 19 0.00048 15.5 -1.0 19 207-225 90-108 (274) 342 pfam10343 DUF2419 Protein of u 20.2 23 0.00058 14.9 -0.5 39 211-265 130-168 (282) 343 TIGR00175 mito_nad_idh isocitr 20.2 51 0.0013 12.5 2.0 24 244-267 177-200 (348) 344 pfam07711 RabGGT_insert Rab ge 20.1 51 0.0013 12.5 1.6 42 395-448 27-68 (102) 345 TIGR01387 cztR_silR_copR heavy 20.1 39 0.00098 13.3 0.6 24 164-191 69-103 (219) 346 PRK00714 dinucleoside polyphos 20.0 51 0.0013 12.4 2.2 39 225-286 113-151 (156) No 1 >COG0173 AspS Aspartyl-tRNA synthetase [Translation, ribosomal structure and biogenesis] Probab=100.00 E-value=0 Score=1725.11 Aligned_cols=582 Identities=47% Similarity=0.802 Sum_probs=564.2 Q ss_pred CCCCCHHHCCHHHCCCEEEEEEEECCCCCCCCEEEEEEECCCCEEEEEECCC--CHHHHHHHCCCCCCEEEEEEEEEECC Q ss_conf 7742042369799898899997980044469879999977982599997488--15789985289885899999999758 Q gi|254780419|r 4 YRTHSCGDLGVSDVGSFVRLSGWVHRVRPHGGIIFLDIRDHYGITQVVANPD--SACFEIAKSVRSEWVLCIDGLVMSRS 81 (622) Q Consensus 4 ~Rth~cg~l~~~~~g~~V~i~GwV~~~R~~g~l~F~~lrD~sG~~Q~v~~~~--~~~~~~~~~l~~esvv~v~G~V~~r~ 81 (622) ||||+||+|+.+|+|++|+|+|||+++|+||+++|+||||++|++|+|++++ .++++.++.++.|+||.|+|+|++|+ T Consensus 1 ~Rt~~cg~l~~~~vG~~V~L~GWV~r~Rd~GgliFiDLRDr~GivQvv~~~~~~~~~~~~a~~lr~E~vi~V~G~V~~R~ 80 (585) T COG0173 1 MRTHYCGELRESHVGQTVTLSGWVHRRRDHGGLIFIDLRDREGIVQVVFDPEDSPEAFEVASRLRNEFVIQVTGTVRARP 80 (585) T ss_pred CCCEECCCCCHHHCCCEEEEEEEEEECCCCCCEEEEECCCCCCEEEEEECCCCCHHHHHHHHHCCCEEEEEEEEEEEECC T ss_conf 97432220478778987999761654023687689984467870799978876889999997268527999988998467 Q ss_pred CCCCCCCCCCCEEEEEEEEEEEEECCCCCCCCCCCCCCCCHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHHHCCCE Q ss_conf 65558877786599998899995021179986457667898782112320001211112124589999999875203651 Q gi|254780419|r 82 AETINANIITGQIELSAQKIEILSVAEELPLPVFGEREYPESIRFKYRFLDLRRDTLHKNIVKRTRIINSMRCRMIAENF 161 (622) Q Consensus 82 ~~~~n~~~~tG~~Ei~~~~~~il~~a~~~P~~~~~~~~~~e~~rl~~R~LdLR~~~~~~~l~~Rs~i~~~~R~~l~~~gF 161 (622) ++++|++++||+|||.+++++|||+|++|||++.+..+++||+||+|||||||||.|+.+|++||++++++|+||+++|| T Consensus 81 e~~~N~~l~TGeiEv~a~~i~vln~s~~lPf~i~d~~~~~Ee~RLkYRyLDLRR~~m~~~l~lR~kv~~~iR~~ld~~gF 160 (585) T COG0173 81 EGTINPNLPTGEIEVLAEEIEVLNASKTLPFQIEDETNASEEIRLKYRYLDLRRPEMQKNLKLRSKVTKAIRNFLDDQGF 160 (585) T ss_pred CCCCCCCCCCCEEEEEEEEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 55668888861499996268998647889947788888652220244555636888999999999999999997765498 Q ss_pred EEEECCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCHHHHHHHHHCCCHHHHEEEHHHHCCCCCCCCCCCCCEEEEEEEE Q ss_conf 46203321124645664543222221034431123854564433105702331100110377766553800003565410 Q gi|254780419|r 162 IECSTPILTASSPEGARDFLVPSRVNQGSFYALPQAPQQYKQLLMASGFDRYFQIAPCFRDEDPRSDRLPGEFYQLDMEM 241 (622) Q Consensus 162 ~EveTP~L~ksTpeGArdFlVPsR~~~g~fYALpQSPQlyKQlLmv~G~dryfqiarcFRdEd~R~dRqp~EFTqlD~Em 241 (622) +|||||+|+|||||||||||||||+|||+||||||||||||||||||||||||||||||||||+|+|||| ||||||+|| T Consensus 161 ~EiETPiLtkSTPEGARDfLVPSRv~~G~FYALPQSPQlfKQLLMvsGfdRYyQIarCFRDEDlRaDRQP-EFTQiD~Em 239 (585) T COG0173 161 LEIETPILTKSTPEGARDFLVPSRVHPGKFYALPQSPQLFKQLLMVAGFDRYYQIARCFRDEDLRADRQP-EFTQIDLEM 239 (585) T ss_pred EEEECCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCHHHHHHHHHHHCCCCEEEEEEEECCCCCCCCCCC-CCEEEEEEE T ss_conf 5734674146798655443013455899653068887999999988053300235654046644444687-533676776 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHCC Q ss_conf 26899999999999999999998087234577321218988751367776444764012443101787524566687403 Q gi|254780419|r 242 SFVEQDDILNTMENVLRGVFEEFSNGNTVSNNFLRICYDDAIRRYGTDKPDLRNPIIMHNVTEHFVHSEFRVFSKILSVN 321 (622) Q Consensus 242 sF~~~edvm~l~E~li~~i~~~v~~~~~i~~pF~rmtY~eAm~~YGsDKPDLR~~lei~dit~~~~~~~f~vf~~~~~~~ 321 (622) ||++++|||+++|+|++.+|+++. +++++.|||||||+|||++||||||||||+++|+|++++|++++|++|++++..+ T Consensus 240 SF~~~edv~~~~E~l~~~vf~~~~-~i~l~~pFprmtY~eAm~~YGSDKPDlRf~lel~dvt~~~~~~~F~vF~~~~~~~ 318 (585) T COG0173 240 SFVDEEDVMELIEKLLRYVFKEVK-GIELKTPFPRMTYAEAMRRYGSDKPDLRFPLELIDVTDLFKDSEFKVFSNALANG 318 (585) T ss_pred ECCCHHHHHHHHHHHHHHHHHHHC-CCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCEEHHHHHHHCCCCCCEEHHHHHCC T ss_conf 438899999999999999999850-9754898561059999997389998556883521279874478851303145069 Q ss_pred CCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCHHHHCCHHHHHHHHHHHCCCCCCCEEE Q ss_conf 78479999627876652103567799987642024623552047775333570255325278999999818875572699 Q gi|254780419|r 322 PGYEVWAIPAKGGGGNRAFCDRMNEWARSQGQPGLAYIFWRLDESGNVAGMGPVAKHLNIVNTEAIRVRLDMQDGDSCFF 401 (622) Q Consensus 322 ~~~~v~~i~~~~~~~srk~~D~l~e~ak~~g~kgL~~i~~~~~~~~~~~~~spi~K~l~~~~~~~l~~~~~~~~GD~i~f 401 (622) +.+++..++.. ...||+++|.++++|+.+|++||+|++++.+. ..|||+||++++..++|.+++++++||++|| T Consensus 319 ~~v~ai~vp~~-~~~sRk~id~~~~~ak~~gakGLa~ikv~~~~-----~~gpi~kfl~e~~~~~l~~~~~~~~gD~iff 392 (585) T COG0173 319 GRVKAIVVPGG-ASLSRKQIDELTEFAKIYGAKGLAYIKVEEDG-----LKGPIAKFLSEEILEELIERLGAEDGDIIFF 392 (585) T ss_pred CEEEEEECCCC-CCCCHHHHHHHHHHHHHCCCCCEEEEEEECCC-----CCCHHHHHCCHHHHHHHHHHHCCCCCCEEEE T ss_conf 82799984786-64688999999999997498743899971687-----6661776447899999999828777888999 Q ss_pred EECCCCCCCCHHHHHHHHHHHHHHCCCCCCEEEEEEEECCCEEECCCCCCCCHHHCCCCCCCCCHHHHHCCCHHHHHHHC Q ss_conf 80885411001347889998773003557510288960770535455470002110024766430655418877741313 Q gi|254780419|r 402 IAGDPLKFCSFAADARNHIAQELGIINCNCFEFCWIMDFPFYEWNEEEKKIDFAHNPFTMPQGGMESLKGNDLLSIKAFQ 481 (622) Q Consensus 402 ~a~~~~~~~~~lg~lR~~ia~~l~li~~~~~~flWV~DFPLFe~dee~~r~~s~HHPFTaP~~~~~~l~~~dp~~v~a~~ 481 (622) +||++..+++.||++|.+++++|+|+++++|+||||||||||||+|++++|.|+|||||||++++.++++++|++++|++ T Consensus 393 ~A~~~~~~~~~lGalR~~l~~~l~li~~~~~~f~WVvDFPlfE~~ee~~~~~a~HHPFT~P~~~~~~~l~~~p~~~~a~a 472 (585) T COG0173 393 VADKKKVVNKALGALRLKLGKELGLIDKDQFKFLWVVDFPLFEWDEEEGRYVAAHHPFTMPKPEDLELLEADPESVRARA 472 (585) T ss_pred ECCCHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCCCCCCCCCCEECCCCCCCCCCCCCHHHHHCCHHHHHHHH T ss_conf 66948899998899999988873877834324899970676677534586740448878998442445416977764532 Q ss_pred CCEEECCEEECCCCEECCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCCCEECCHHHHHHHHCCCCCCEEEEC Q ss_conf 67777567721251231569999999998599989998667789999860849734222468999999837398403213 Q gi|254780419|r 482 YDLVCNGFEIASGGIRNHMSEVMLQAFSNVGISKEVVENRFGGLYRAFQCGMPPHGGIAAGIDRIVMLLLGAKNVREVSL 561 (622) Q Consensus 482 YDLVlNG~EiggGSiRIHd~~iQ~~~f~~lgl~~ee~~~~Fg~lL~Al~yG~PPHgG~alGlDRlvmll~g~~sIRdVIa 561 (622) |||||||+||||||||||++++|+++|+.+|++++++++||||||+||+||+|||||||+|||||||||||++||||||| T Consensus 473 YDlVlNG~ElggGSiRIh~~eiQ~~vF~~lg~~~eea~ekFGFll~Af~yGaPPHgGiA~GlDRlvmll~g~~sIReVIA 552 (585) T COG0173 473 YDLVLNGYELGGGSIRIHDPEIQEKVFEILGISPEEAEEKFGFLLEAFKYGAPPHGGIAFGLDRLVMLLTGAESIRDVIA 552 (585) T ss_pred CCEEECCEEECCCEEEECCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCCCEECCHHHHHHHHCCCCCHHHEEE T ss_conf 56776027513543640788999999998199989999887789999765899976242459999999828974442131 Q ss_pred CCCCCCCCCCCCCCCCCCCHHHHHHCCCEEEC Q ss_conf 78999871034688672898899870835605 Q gi|254780419|r 562 FPMSQNFCDLLMGSPSTVSVEQLRELGLRIVE 593 (622) Q Consensus 562 FPKt~~g~Dlm~~aPs~v~~~qL~e~~i~~~~ 593 (622) ||||++|.||||+|||.|+++||+||+|++.+ T Consensus 553 FPKt~~a~dllt~APs~v~~~qL~El~i~~~~ 584 (585) T COG0173 553 FPKTQQAADLLTNAPSEVDEKQLKELSIKVRP 584 (585) T ss_pred CCCCCCCCCCCCCCCCCCCHHHHHHCCEECCC T ss_conf 47886555501379975899998750100257 No 2 >TIGR00459 aspS_bact aspartyl-tRNA synthetase; InterPro: IPR004524 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . Aspartyl tRNA synthetase 6.1.1.12 from EC is an alpha2 dimer that belongs to class IIb. Structural analysis combined with mutagenesis and enzymology data on the yeast enzyme point to a tRNA binding process that starts by a recognition event between the tRNA anticodon loop and the synthetase anticodon binding module . This family represents aspartyl-tRNA synthetases from the bacteria and from mitochondria. In some species, this enzyme aminoacylates tRNA for both Asp and Asn; Asp-tRNA(asn) is subsequently transamidated to Asn-tRNA(asn). ; GO: 0000166 nucleotide binding, 0004815 aspartate-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006422 aspartyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=100.00 E-value=0 Score=1730.52 Aligned_cols=587 Identities=42% Similarity=0.716 Sum_probs=567.4 Q ss_pred CCC-CCHHHCCHHHCCCEEEEEEEEC-CCCCCCC-EEEEEEECCCC-EEEEEECCC--CHHHHHHHCCCCCCEEEEEEEE Q ss_conf 774-2042369799898899997980-0444698-79999977982-599997488--1578998528988589999999 Q gi|254780419|r 4 YRT-HSCGDLGVSDVGSFVRLSGWVH-RVRPHGG-IIFLDIRDHYG-ITQVVANPD--SACFEIAKSVRSEWVLCIDGLV 77 (622) Q Consensus 4 ~Rt-h~cg~l~~~~~g~~V~i~GwV~-~~R~~g~-l~F~~lrD~sG-~~Q~v~~~~--~~~~~~~~~l~~esvv~v~G~V 77 (622) +|| ||||+|+.+++||+|||+|||+ ++|++|+ ++|++|||++| +|||+++++ ...+..++.||.|+||.|+|+| T Consensus 2 ~r~t~~cg~l~~~~~G~~Vtl~GWv~qr~rdlG~nliFidlRD~~GdivQv~~~p~~~~~a~~~a~~lr~E~vv~v~G~v 81 (653) T TIGR00459 2 MRTTHYCGQLSEKHLGKTVTLAGWVKQRRRDLGGNLIFIDLRDRSGDIVQVVCDPDVSKDALELAKGLRNEDVVQVKGKV 81 (653) T ss_pred CCCEEEECCCCCCCCCCEEEEEEEEECCCCCCCCCEEEEEEECCCCCEEEEEECCCCCHHHHHHHHHCCCCEEEEEEEEE T ss_conf 86304501343011687799985431340115881799986258888899986775678899999733552289999999 Q ss_pred EECCCCCCCCCCCCCEEEEEEEEEEEEE-CCCCCCCCCCC-CCCCCHH---HHCCCCCEEECCCCCCCCHHHHHHHHHHH Q ss_conf 9758655588777865999988999950-21179986457-6678987---82112320001211112124589999999 Q gi|254780419|r 78 MSRSAETINANIITGQIELSAQKIEILS-VAEELPLPVFG-EREYPES---IRFKYRFLDLRRDTLHKNIVKRTRIINSM 152 (622) Q Consensus 78 ~~r~~~~~n~~~~tG~~Ei~~~~~~il~-~a~~~P~~~~~-~~~~~e~---~rl~~R~LdLR~~~~~~~l~~Rs~i~~~~ 152 (622) .+|+++..|++++||+|||.+++|+|+| +|+++|+.+.+ +.++.|+ +||+|||||||||.|+.+|++||+|+.++ T Consensus 82 ~~R~~~~~~~~l~tg~~Ei~~~~i~~~NG~s~~~P~~~~Df~~~a~e~~EDvRLkYRYLDLRrp~~~~~L~lR~~v~~~~ 161 (653) T TIGR00459 82 SARPEGSINRNLDTGEIEILAEEITLLNGKSKTPPLIIEDFKTDAEEEREDVRLKYRYLDLRRPEMQQRLKLRHKVTKAV 161 (653) T ss_pred EECCCCCCCCCCCCCEEEEECCCEEEEECCCCCCCCEEECCCCCCCCCCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHH T ss_conf 86585344655676348898186268612126879403235200000101666100222204878999986688989999 Q ss_pred HHHHHCCCEEEEECCCCCCCCCCCCCCCCCCCCCC----HHHHHHCCCCHHHHHHHHHCCCHHHHEEEHHHHCCCCCCCC Q ss_conf 87520365146203321124645664543222221----03443112385456443310570233110011037776655 Q gi|254780419|r 153 RCRMIAENFIECSTPILTASSPEGARDFLVPSRVN----QGSFYALPQAPQQYKQLLMASGFDRYFQIAPCFRDEDPRSD 228 (622) Q Consensus 153 R~~l~~~gF~EveTP~L~ksTpeGArdFlVPsR~~----~g~fYALpQSPQlyKQlLmv~G~dryfqiarcFRdEd~R~d 228 (622) |+||+++||+|||||||||||||||||||||||+| +|+|||||||||||||||||||+|||||||||||||||||| T Consensus 162 Rn~l~~~gFleiETP~LtKsTPEGARDyLVPsRv~rsdG~g~FYALpQSPQlfKQLLMvsG~drYyQIArCFRDEDLRAD 241 (653) T TIGR00459 162 RNFLDQQGFLEIETPILTKSTPEGARDYLVPSRVHRSDGKGEFYALPQSPQLFKQLLMVSGVDRYYQIARCFRDEDLRAD 241 (653) T ss_pred HHHHHHCCCCEEECCCCCCCCCCCHHHCCCCCCCCCCCCCCCEECCCCCHHHHHHHHHHCCCCCCEEEEEEECCCCCCCC T ss_conf 86664278706506777788843110157755300368588451468975689998751255531357889766256665 Q ss_pred CCCCCEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHC---CCCCCCC---CCC-----------CCHHHHHHHCCCCCC Q ss_conf 38000035654102689999999999999999999808---7234577---321-----------218988751367776 Q gi|254780419|r 229 RLPGEFYQLDMEMSFVEQDDILNTMENVLRGVFEEFSN---GNTVSNN---FLR-----------ICYDDAIRRYGTDKP 291 (622) Q Consensus 229 Rqp~EFTqlD~EmsF~~~edvm~l~E~li~~i~~~v~~---~~~i~~p---F~r-----------mtY~eAm~~YGsDKP 291 (622) ||| ||||||+||||++++|||+++|+|+.+||+++.+ -++|+.| ||| |||+|||++|||||| T Consensus 242 RQP-EFTQiD~E~SF~~~~~vm~l~E~l~~~vf~~v~~krGl~~L~~~GtlfP~kkenGtvsifrmty~EAm~~yGsDKP 320 (653) T TIGR00459 242 RQP-EFTQIDMEMSFMTQEDVMDLIEKLVSGVFKEVKGKRGLIDLKKPGTLFPVKKENGTVSIFRMTYQEAMERYGSDKP 320 (653) T ss_pred CCC-CCCHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCCCEEEECCCCCEEEEECCCCEEEEEECCHHHHHHHCCCCCC T ss_conf 778-7324426311289356899999999999999707851375078852411221676079985567999861677687 Q ss_pred CCCCCCCCCCCCCCCC-----CCCHHHHHHHHHCCCCCEEEEEECCCC--CCCCCCHHHHH---HHHHHHHHCCCCCEEE Q ss_conf 4447640124431017-----875245666874037847999962787--66521035677---9998764202462355 Q gi|254780419|r 292 DLRNPIIMHNVTEHFV-----HSEFRVFSKILSVNPGYEVWAIPAKGG--GGNRAFCDRMN---EWARSQGQPGLAYIFW 361 (622) Q Consensus 292 DLR~~lei~dit~~~~-----~~~f~vf~~~~~~~~~~~v~~i~~~~~--~~srk~~D~l~---e~ak~~g~kgL~~i~~ 361 (622) |||||+++.|++++|+ +++|+||++++... +..+.+|..++. .+||+.++.+. ++|+.+|++||||+.+ T Consensus 321 DLR~~~el~dv~d~f~afshld~~F~vF~~~~~~P-~g~v~~i~v~~~~~~ls~~~i~~l~~~~~~a~~~GakGlAy~~v 399 (653) T TIGR00459 321 DLRFGLELIDVSDLFKAFSHLDSEFKVFSNLINAP-GGRVKAIRVKGGDAELSRKSIKELRVLIKFAKEYGAKGLAYLKV 399 (653) T ss_pred CCCCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHCC-CCEEEEEECCCCCCCCCHHHHHHHCHHHHHHHHCCCCEEEEEEE T ss_conf 78888346777667777664067534579998488-85577577278642255021222011445676658631566565 Q ss_pred EECCCCCCCCCCCHHHHCCHHHHHHHHHHHCCCCCCCEE--EEECC-CCCCCCHHH--HHHHHHHHHHH--CCCCCCEEE Q ss_conf 204777533357025532527899999981887557269--98088-541100134--78899987730--035575102 Q gi|254780419|r 362 RLDESGNVAGMGPVAKHLNIVNTEAIRVRLDMQDGDSCF--FIAGD-PLKFCSFAA--DARNHIAQELG--IINCNCFEF 434 (622) Q Consensus 362 ~~~~~~~~~~~spi~K~l~~~~~~~l~~~~~~~~GD~i~--f~a~~-~~~~~~~lg--~lR~~ia~~l~--li~~~~~~f 434 (622) +.+..+.....|||+|||+.+..+.|.+.+.++.||++| |.+|. ...+.++|| ++|++++++|+ |++++.|+| T Consensus 400 ~~~~~~~~~~~~~i~kfL~~~~~~~~~~~t~a~~gdillnPfg~~~l~~~v~~~lgGkalRl~l~~~l~sWL~d~~~~~f 479 (653) T TIGR00459 400 NEDAEGLEEVKSPIKKFLDEKKLKKLLERTDAQEGDILLNPFGAGDLKKIVLDALGGKALRLKLGKDLGSWLVDPDAFSF 479 (653) T ss_pred ECCCCCCCCCCCHHHHCCCHHHHHHHHHHHHCCCCCEEECCCCCHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCCCE T ss_conf 21674102453101202473577889987421268786557786058999998504168899987653000235231471 Q ss_pred EEEEE---------CCCEEECCCCCCCCHHHCCCCCCCCCHHHHHCCCHHHHHHHCCCEEECCEEECCCCEECCCHHHHH Q ss_conf 88960---------770535455470002110024766430655418877741313677775677212512315699999 Q gi|254780419|r 435 CWIMD---------FPFYEWNEEEKKIDFAHNPFTMPQGGMESLKGNDLLSIKAFQYDLVCNGFEIASGGIRNHMSEVML 505 (622) Q Consensus 435 lWV~D---------FPLFe~dee~~r~~s~HHPFTaP~~~~~~l~~~dp~~v~a~~YDLVlNG~EiggGSiRIHd~~iQ~ 505 (622) |||+| |||||+|+++++|.|+|||||||+.++.+.+++.|.++.|+|||||+|||||||||||||++++|+ T Consensus 480 lWv~DksGkkeklaFP~fE~d~~E~~~~a~HHpFT~Pk~~~~~~lE~~p~~a~a~aYDlVlNG~ElGGGSiRIh~~~~Q~ 559 (653) T TIGR00459 480 LWVVDKSGKKEKLAFPLFEKDEEEGRLVAAHHPFTMPKPEDLEDLETAPEEALAEAYDLVLNGVELGGGSIRIHDPEVQE 559 (653) T ss_pred EEEEECCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCCHHHHHCCHHHHHHHCCCEEEEEEEECCCEEEECCHHHHH T ss_conf 78983788702342765667765275584057889875122787740745541202876883069887104642878999 Q ss_pred HHHH-HCCCCHHHHHHHHHHHHHHHHCCCCCCCCEECCHHHHHHHHCCCCCCEEEECCCCCCCCCCCCCCCCCCCCHHHH Q ss_conf 9999-859998999866778999986084973422246899999983739840321378999871034688672898899 Q gi|254780419|r 506 QAFS-NVGISKEVVENRFGGLYRAFQCGMPPHGGIAAGIDRIVMLLLGAKNVREVSLFPMSQNFCDLLMGSPSTVSVEQL 584 (622) Q Consensus 506 ~~f~-~lgl~~ee~~~~Fg~lL~Al~yG~PPHgG~alGlDRlvmll~g~~sIRdVIaFPKt~~g~Dlm~~aPs~v~~~qL 584 (622) ++|+ .||++++++.++||||||||+|||||||||||||||||||||+++|||||||||||++|+||||+|||.|+++|| T Consensus 560 ~~F~d~Lgid~eea~EkFGFLLeA~~yGaPPHgG~A~GlDRL~Mll~~~~nIRDVIAFPKt~~a~cL~T~APs~~~~~~L 639 (653) T TIGR00459 560 KVFEDILGIDPEEAREKFGFLLEAFKYGAPPHGGLALGLDRLVMLLTGTDNIRDVIAFPKTAQASCLLTEAPSEIDEKQL 639 (653) T ss_pred HHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCEECCCCCCCCHHHHHHCCCCCCCHHHH T ss_conf 99987628898998776356875873576851015555899999980899722001788760124341148775798884 Q ss_pred HHCCCEEE Q ss_conf 87083560 Q gi|254780419|r 585 RELGLRIV 592 (622) Q Consensus 585 ~e~~i~~~ 592 (622) +||+|+.. T Consensus 640 ~~l~i~~~ 647 (653) T TIGR00459 640 EELSIKYS 647 (653) T ss_pred HHHHHHHC T ss_conf 11323313 No 3 >PRK12820 bifunctional aspartyl-tRNA synthetase/aspartyl/glutamyl-tRNA amidotransferase subunit C; Provisional Probab=100.00 E-value=0 Score=1546.63 Aligned_cols=599 Identities=37% Similarity=0.646 Sum_probs=555.2 Q ss_pred CCHHHCCHHHCCCEEEEEEEECCCCCCCCEEEEEEECCCCEEEEEECCCC---HHHHHHHCCCCCCEEEEEEEEEECCCC Q ss_conf 20423697998988999979800444698799999779825999974881---578998528988589999999975865 Q gi|254780419|r 7 HSCGDLGVSDVGSFVRLSGWVHRVRPHGGIIFLDIRDHYGITQVVANPDS---ACFEIAKSVRSEWVLCIDGLVMSRSAE 83 (622) Q Consensus 7 h~cg~l~~~~~g~~V~i~GwV~~~R~~g~l~F~~lrD~sG~~Q~v~~~~~---~~~~~~~~l~~esvv~v~G~V~~r~~~ 83 (622) -|||+|+.+|+|++|+|+|||+++|++|+++|++|||++|++|||++++. +.++.+++|+.||||.|+|+|++|+++ T Consensus 7 ~ycg~l~~~~vG~~V~l~GWV~~~R~~G~l~FidLRD~~G~vQvv~~~~~~~~d~~~~~~~l~~E~vI~V~G~V~~r~~~ 86 (706) T PRK12820 7 SFCGHLSLDDTGREVCLAGWVDAFRDHGELLFIHLRDRNGFIQAVFSPEAAPADVYELAASLRAEFCVALQGEVQKRLEE 86 (706) T ss_pred CCCCCCCHHHCCCEEEEEEEEEEEECCCCEEEEEEEECCCCEEEEECCCCCCHHHHHHHHCCCCCCEEEEEEEEEECCCC T ss_conf 45134994889998999786878966898499999728945899986886888899998539986789999899934765 Q ss_pred CCCCCCCCCEEEEEEEEEEEEECCCCCCCCCCCCC-----------CCCHHHHCCCCCEEECCCCCCCCHHHHHHHHHHH Q ss_conf 55887778659999889999502117998645766-----------7898782112320001211112124589999999 Q gi|254780419|r 84 TINANIITGQIELSAQKIEILSVAEELPLPVFGER-----------EYPESIRFKYRFLDLRRDTLHKNIVKRTRIINSM 152 (622) Q Consensus 84 ~~n~~~~tG~~Ei~~~~~~il~~a~~~P~~~~~~~-----------~~~e~~rl~~R~LdLR~~~~~~~l~~Rs~i~~~~ 152 (622) +.|++++||+|||.+++++|||+|+++||++.++. +++||+||+||||||||++++++|++||++++++ T Consensus 87 ~~n~~~~tGeiEI~v~~l~iLs~a~~lPf~i~~~~~~~~~~~~~~~~~~ee~RLkyRyLDLRr~~~~~~~~~Rskv~~~i 166 (706) T PRK12820 87 TENPHIETGDIEVFVRELSILAASEALPFAISDKAMTAGAGSAGADAVNEDLRLQYRYLDIRRPAMQDHLAKRHRIIKCA 166 (706) T ss_pred CCCCCCCCCCEEEEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHH T ss_conf 66888988618999769999877889998856333433344444456678887665677647998999999999999999 Q ss_pred HHHHHCCCEEEEECCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCHHHHHHHHHCCCHHHHEEEHHHHCCCCCCCCCCCC Q ss_conf 87520365146203321124645664543222221034431123854564433105702331100110377766553800 Q gi|254780419|r 153 RCRMIAENFIECSTPILTASSPEGARDFLVPSRVNQGSFYALPQAPQQYKQLLMASGFDRYFQIAPCFRDEDPRSDRLPG 232 (622) Q Consensus 153 R~~l~~~gF~EveTP~L~ksTpeGArdFlVPsR~~~g~fYALpQSPQlyKQlLmv~G~dryfqiarcFRdEd~R~dRqp~ 232 (622) |+||+++||+|||||+|++|||||||||+||||.|+|+||||||||||||||||||||||||||||||||||+|+|||| T Consensus 167 R~~l~~~gF~EVETP~L~~stpEGArdflvPsr~~~~~fyaLpQSPQlyKQlLMvgG~dRyFqIarcFRdEd~r~dRqP- 245 (706) T PRK12820 167 RDFLDSRGFLEIETPILTKSTPEGARDYLVPSRIHPKEFYALPQSPQLFKQLLMIAGFERYFQLARCFRDEDLRPNRQP- 245 (706) T ss_pred HHHHHHCCCEEEECCEECCCCCCCCCCCCCCEECCCCCEECCCCCHHHHHHHHHHCCCCEEEEEEEEECCCCCCCCCCC- T ss_conf 9999877988966870445675324575002445887650056682999999986286617996001128999988798- Q ss_pred CEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHH Q ss_conf 00356541026899999999999999999998087234577321218988751367776444764012443101787524 Q gi|254780419|r 233 EFYQLDMEMSFVEQDDILNTMENVLRGVFEEFSNGNTVSNNFLRICYDDAIRRYGTDKPDLRNPIIMHNVTEHFVHSEFR 312 (622) Q Consensus 233 EFTqlD~EmsF~~~edvm~l~E~li~~i~~~v~~~~~i~~pF~rmtY~eAm~~YGsDKPDLR~~lei~dit~~~~~~~f~ 312 (622) ||||||+||||+|++|||+++|+|++++|+ +. +..++.|||||||+|||++||||||||||+++++|++++|+++.|+ T Consensus 246 EFTqlDiEmSF~d~edvm~l~E~li~~~~~-~~-~~~l~~pFprmtY~eAm~~YGsDKPDlRf~lel~d~t~if~~~~f~ 323 (706) T PRK12820 246 EFTQLDIEASFIDEEFIFELIEELTARMFA-IG-GIALPRPFPRMPYAEAMDTTGSDRPDLRFDLKFADATDIFENTRYG 323 (706) T ss_pred CEEEEEEEHHCCCHHHHHHHHHHHHHHHHH-HC-CCCCCCCCCCCCHHHHHHHHCCCCCCCCCCHHHCCCCHHHCCCCCH T ss_conf 340687421348999999999999999998-53-8454687453419999998689874310230102330122155405 Q ss_pred HHHHHHHCCCCCEEEEEECCC--CCCCCCCH--HHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCHHHHCCHHHHHHHH Q ss_conf 566687403784799996278--76652103--56779998764202462355204777533357025532527899999 Q gi|254780419|r 313 VFSKILSVNPGYEVWAIPAKG--GGGNRAFC--DRMNEWARSQGQPGLAYIFWRLDESGNVAGMGPVAKHLNIVNTEAIR 388 (622) Q Consensus 313 vf~~~~~~~~~~~v~~i~~~~--~~~srk~~--D~l~e~ak~~g~kgL~~i~~~~~~~~~~~~~spi~K~l~~~~~~~l~ 388 (622) +|++++..++ .+.++..++ ...+++.+ +...+.++..|++|++|+++.. + .+.+||+||++++..+++. T Consensus 324 vF~~~l~~gg--~Vk~I~v~~~~~~~s~~~i~~e~~~eia~~~gakGl~w~k~~~---~--~l~~~i~kf~see~~~~l~ 396 (706) T PRK12820 324 IFKQILQRGG--RIKGINIKGQSEKLSKNVLQNEYAKEIAPSFGAKGMTWMRAEA---G--GLDSNIVQFFSADEKEALK 396 (706) T ss_pred HHHHHHHHCC--EEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEC---C--CCCCHHHHCCCHHHHHHHH T ss_conf 8999885088--6889976883566566665669999862430656625898616---8--6463144307999999999 Q ss_pred HHHCCCCCCCEEEEECCC-CCCCCHHHHHHHHHHHHHHCCCCCCEEEEEEEECCCEEECCCCCCCCHHHCCCCCCCCCHH Q ss_conf 981887557269980885-4110013478899987730035575102889607705354554700021100247664306 Q gi|254780419|r 389 VRLDMQDGDSCFFIAGDP-LKFCSFAADARNHIAQELGIINCNCFEFCWIMDFPFYEWNEEEKKIDFAHNPFTMPQGGME 467 (622) Q Consensus 389 ~~~~~~~GD~i~f~a~~~-~~~~~~lg~lR~~ia~~l~li~~~~~~flWV~DFPLFe~dee~~r~~s~HHPFTaP~~~~~ 467 (622) +.+++++||+++|+++.. ..++..||.+|.+++++++|++++.|+|||||||||||++++. +|.|+|||||||+.++. T Consensus 397 ~~~~~~~GD~i~~~ad~~~~~v~~~LG~lR~~la~~l~Li~~~~~~flWVtDFPLFE~~ee~-~~~s~HHPFT~P~~ed~ 475 (706) T PRK12820 397 RRFHAEDGDVIIMIADASCAIVLSALGQLRLHLADRLGLIPEGVFHPLWITDFPLFEATDDG-GVTSSHHPFTAPDREDF 475 (706) T ss_pred HHHCCCCCCEEEEECCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCCCC-CCCCCCCCCCCCCCCCC T ss_conf 98487889889997275066899886267899999848878786316885167422345444-65335489778882336 Q ss_pred HHHC-CCHHHHHHHCCCEEECCEEECCCCEECCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCCCEECCHHHH Q ss_conf 5541-887774131367777567721251231569999999998599989998667789999860849734222468999 Q gi|254780419|r 468 SLKG-NDLLSIKAFQYDLVCNGFEIASGGIRNHMSEVMLQAFSNVGISKEVVENRFGGLYRAFQCGMPPHGGIAAGIDRI 546 (622) Q Consensus 468 ~l~~-~dp~~v~a~~YDLVlNG~EiggGSiRIHd~~iQ~~~f~~lgl~~ee~~~~Fg~lL~Al~yG~PPHgG~alGlDRl 546 (622) +.+. +++++++|+||||||||+|||||||||||+++|+++|+.+|+++++++++|||||+||+|||||||||||||||| T Consensus 476 d~~~~~~~~~v~a~aYDLVlNG~ElggGSiRIHd~~iQ~~if~~lGl~~ee~~~kFgf~LeAf~YGaPPHGGialGlDRL 555 (706) T PRK12820 476 DPGDIEELLDLRSRAYDLVVNGEELGGGSIRINDKDIQLRIFAALGLSEEDIEDKFGFFLRAFDFAAPPHGGIALGLDRV 555 (706) T ss_pred CCCCCCCCCHHHCCEEEEEECCEEEEEEEEECCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCCCEEEHHHHH T ss_conf 83224553200036460689999961402203899999999997599978988899999999866999867031039999 Q ss_pred HHHHCCCCCCEEEECCCCCCCCCCCCCCCCCCCCHHHHHHCCCEEECCCHHHHHCCCHHH-HHHHCCCEEE Q ss_conf 999837398403213789998710346886728988998708356057301000000135-5542272576 Q gi|254780419|r 547 VMLLLGAKNVREVSLFPMSQNFCDLLMGSPSTVSVEQLRELGLRIVENPKKILKISVTTI-LRNVYGLKII 616 (622) Q Consensus 547 vmll~g~~sIRdVIaFPKt~~g~Dlm~~aPs~v~~~qL~e~~i~~~~~~k~~~~~~~~~~-~~~~~~~~~~ 616 (622) ||+|||++|||||||||||++|+|||++|||+|+++||+||||--.-+-+.++..+.+.+ ++|+.+|.-| T Consensus 556 vMlLtg~~sIRDVIaFPKt~~g~dlm~~aPs~v~~~QL~El~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 626 (706) T PRK12820 556 VSMILQTPSIREVIAFPKNRSAACPLTGAPSEVAQEQLAELGLLDLGDADQLPGDAEKEDLIDHLSWVSRI 626 (706) T ss_pred HHHHCCCCCHHHEECCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHH T ss_conf 99976998265052278998878600279886899999763774247755466655503457788888752 No 4 >PRK00476 aspS aspartyl-tRNA synthetase; Validated Probab=100.00 E-value=0 Score=1432.73 Aligned_cols=585 Identities=47% Similarity=0.794 Sum_probs=560.8 Q ss_pred CCCCCCCHHHCCHHHCCCEEEEEEEECCCCCCCCEEEEEEECCCCEEEEEECCC-CHHHHHHHCCCCCCEEEEEEEEEEC Q ss_conf 877742042369799898899997980044469879999977982599997488-1578998528988589999999975 Q gi|254780419|r 2 HRYRTHSCGDLGVSDVGSFVRLSGWVHRVRPHGGIIFLDIRDHYGITQVVANPD-SACFEIAKSVRSEWVLCIDGLVMSR 80 (622) Q Consensus 2 ~~~Rth~cg~l~~~~~g~~V~i~GwV~~~R~~g~l~F~~lrD~sG~~Q~v~~~~-~~~~~~~~~l~~esvv~v~G~V~~r 80 (622) |.|||||||||+++|+|++|+|+|||+++|++||++|++|||++|.+|||++++ .+.++.+++|+.||+|.|+|+|++| T Consensus 1 ~~~rth~~~~l~~~~~g~~V~v~GwV~~~R~~Gkl~F~~LrD~~G~iQiv~~~~~~~~~~~~~~l~~e~iv~V~G~v~~r 80 (587) T PRK00476 1 HMMRTHYCGELRESHVGQTVTLCGWVHRRRDHGGVIFIDLRDREGIVQVVFDPDDAEAFAVAESLRSEYVIQVTGKVRAR 80 (587) T ss_pred CCCCCCCCCCCCHHHCCCEEEEEEEEEEEECCCCCEEEEEEECCCCEEEEECCCCHHHHHHHHCCCCCCEEEEEEEEEEC T ss_conf 99656613128847698989997857536647982999998389788999918877899999669998789999899935 Q ss_pred CCCCCCCCCCCCEEEEEEEEEEEEECCCCCCCCCCCCCCCCHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHHHCCC Q ss_conf 86555887778659999889999502117998645766789878211232000121111212458999999987520365 Q gi|254780419|r 81 SAETINANIITGQIELSAQKIEILSVAEELPLPVFGEREYPESIRFKYRFLDLRRDTLHKNIVKRTRIINSMRCRMIAEN 160 (622) Q Consensus 81 ~~~~~n~~~~tG~~Ei~~~~~~il~~a~~~P~~~~~~~~~~e~~rl~~R~LdLR~~~~~~~l~~Rs~i~~~~R~~l~~~g 160 (622) |++.+|++++||++||.+++++|||+|+++||+++++.++++++||+|||||||++.++++|++||++++++|+||+++| T Consensus 81 p~~~~n~~~~tGeiei~~~~i~iLska~~lP~~~~~~~~~~e~~Rl~~RyLDLr~~~~~~~fr~Rs~i~~~iR~fl~~~g 160 (587) T PRK00476 81 PEGTVNPNLATGEIEVLASELEILNKSKTLPFQIDDETNVSEELRLKYRYLDLRRPEMQNNLKLRSKVTSAIRNFLDDNG 160 (587) T ss_pred CCCCCCCCCCCCCEEEEEEEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 76566877888728999708999876889998865566777888655478887217788999999999999999999759 Q ss_pred EEEEECCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCHHHHHHHHHCCCHHHHEEEHHHHCCCCCCCCCCCCCEEEEEEE Q ss_conf 14620332112464566454322222103443112385456443310570233110011037776655380000356541 Q gi|254780419|r 161 FIECSTPILTASSPEGARDFLVPSRVNQGSFYALPQAPQQYKQLLMASGFDRYFQIAPCFRDEDPRSDRLPGEFYQLDME 240 (622) Q Consensus 161 F~EveTP~L~ksTpeGArdFlVPsR~~~g~fYALpQSPQlyKQlLmv~G~dryfqiarcFRdEd~R~dRqp~EFTqlD~E 240 (622) |+|||||+|++|||||||||+||||+|+|+||||||||||||||||||||+|||||||||||||+|++||| ||||||+| T Consensus 161 FiEVeTPiL~~s~~eGA~~F~vpsr~~~~~fyaL~qSPQLykq~L~vgG~erVyeig~~FRnE~~~t~r~p-EFt~lE~E 239 (587) T PRK00476 161 FLEIETPILTKSTPEGARDYLVPSRVHPGKFYALPQSPQLFKQLLMVAGFDRYYQIARCFRDEDLRADRQP-EFTQIDIE 239 (587) T ss_pred CCEECCCCCCCCCCCCCCCCCCCCCCCCCCEECCCCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCCC-HHHEEEHH T ss_conf 74004754325686666443013105788551356797999999885175527996431238888766682-34231111 Q ss_pred ECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHC Q ss_conf 02689999999999999999999808723457732121898875136777644476401244310178752456668740 Q gi|254780419|r 241 MSFVEQDDILNTMENVLRGVFEEFSNGNTVSNNFLRICYDDAIRRYGTDKPDLRNPIIMHNVTEHFVHSEFRVFSKILSV 320 (622) Q Consensus 241 msF~~~edvm~l~E~li~~i~~~v~~~~~i~~pF~rmtY~eAm~~YGsDKPDLR~~lei~dit~~~~~~~f~vf~~~~~~ 320 (622) |||+|++|||+++|+|++++|+++.+ ..++.|||||||+|||.+|||||||||+++++.|+++++++++|++|.+.... T Consensus 240 ~af~d~~dvm~~~E~li~~v~~~v~~-~~l~~pf~rit~~eai~~yG~DKPDlR~~~~l~d~~~~~~~~~~~~f~~~~~~ 318 (587) T PRK00476 240 MSFVTQEDVMDLMEGLIRHVFKEVLG-VDLTKPFPRMTYAEAMRRYGSDKPDLRFPLELVDVTDLFKDSGFKVFAGAAND 318 (587) T ss_pred HHCCCHHHHHHHHHHHHHHHHHHHCC-CCCCCCCCEEEHHHHHHHHCCCCCCCCCCCHHHHHHHHHHCCCHHHHHHHHHC T ss_conf 11168999999999999999999719-74788842456999999857999654576215466887615641678999860 Q ss_pred CCCCEEEEEECCC-CCCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCHHHHCCHHHHHHHHHHHCCCCCCCE Q ss_conf 3784799996278-766521035677999876420246235520477753335702553252789999998188755726 Q gi|254780419|r 321 NPGYEVWAIPAKG-GGGNRAFCDRMNEWARSQGQPGLAYIFWRLDESGNVAGMGPVAKHLNIVNTEAIRVRLDMQDGDSC 399 (622) Q Consensus 321 ~~~~~v~~i~~~~-~~~srk~~D~l~e~ak~~g~kgL~~i~~~~~~~~~~~~~spi~K~l~~~~~~~l~~~~~~~~GD~i 399 (622) ....+.+++.+. ...+|+.+|.+.++++..|++||.|+.+... .+.||++|+++++..+.+.+++++++||++ T Consensus 319 -~~~~v~~i~~p~~~~~~r~~id~l~~~~~~~~~~~l~~i~~~~~-----~~~spi~k~~~~~~~~~l~~~~~~~~gd~~ 392 (587) T PRK00476 319 -PGGRVKAIRVPGGASLSRKQIDELTEFAKIYGAKGLAYIKVNED-----GLKGPIAKFLSEEELEALLERTGAKDGDLI 392 (587) T ss_pred -CCCEEEEEECCCCCCCCHHHHHHHHHHHHHHCCCCCEEEEEECC-----CCCCHHHHHCCHHHHHHHHHHHCCCCCCEE T ss_conf -58869999746867666667999999999818877337998055-----565607764699999999998488778779 Q ss_pred EEEECCCCCCCCHHHHHHHHHHHHHHCCCCCCEEEEEEEECCCEEECCCCCCCCHHHCCCCCCCCCHHHHHCCCHHHHHH Q ss_conf 99808854110013478899987730035575102889607705354554700021100247664306554188777413 Q gi|254780419|r 400 FFIAGDPLKFCSFAADARNHIAQELGIINCNCFEFCWIMDFPFYEWNEEEKKIDFAHNPFTMPQGGMESLKGNDLLSIKA 479 (622) Q Consensus 400 ~f~a~~~~~~~~~lg~lR~~ia~~l~li~~~~~~flWV~DFPLFe~dee~~r~~s~HHPFTaP~~~~~~l~~~dp~~v~a 479 (622) ||++|.+..+++.+|.+|.++++++++++++.++|+|||||||||+++++++|+|+|||||||++++.+++.+||++++| T Consensus 393 ~f~~~~~~~~~~~lg~~r~~~~~~~~li~~~~~~p~fV~dfPlfe~~~~~~~~~a~hhPFt~P~~~~~~~~~~dp~~~~~ 472 (587) T PRK00476 393 FFGADKKKVVNDALGALRLKLGKELGLIDEDKFAFLWVVDFPMFEYDEEEGRYVAAHHPFTMPKDEDLELLETDPGKALA 472 (587) T ss_pred EEECCCCHHHHHHHCHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHCCHHHHHE T ss_conf 99359607888763317999988846666666642689814531144202563102177458885378877538123234 Q ss_pred HCCCEEECCEEECCCCEECCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCCCEECCHHHHHHHHCCCCCCEEE Q ss_conf 13677775677212512315699999999985999899986677899998608497342224689999998373984032 Q gi|254780419|r 480 FQYDLVCNGFEIASGGIRNHMSEVMLQAFSNVGISKEVVENRFGGLYRAFQCGMPPHGGIAAGIDRIVMLLLGAKNVREV 559 (622) Q Consensus 480 ~~YDLVlNG~EiggGSiRIHd~~iQ~~~f~~lgl~~ee~~~~Fg~lL~Al~yG~PPHgG~alGlDRlvmll~g~~sIRdV 559 (622) ++||||+||+||||||+||||+++|+++|+.+|+++++++++|+|||+||+||||||||||||||||||+|||++||||| T Consensus 473 ~~yDLv~~G~El~~Gs~Ri~d~~~q~~~~~~~gl~~~~~~~~F~~yl~al~yG~PPhgG~GlGiDRLvmlltg~~nIRDV 552 (587) T PRK00476 473 YAYDLVLNGYELGGGSIRIHRPEIQEKVFEILGISEEEAEEKFGFLLDALKYGAPPHGGIAFGLDRLVMLLAGADSIRDV 552 (587) T ss_pred EEEEEEECCEEECCCEEECCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCCEEEECHHHHHHHHCCCCCHHEE T ss_conf 67405307789513503227999999999985999899988888999997679999763761499999997599955306 Q ss_pred ECCCCCCCCCCCCCCCCCCCCHHHHHHCCCEEECC Q ss_conf 13789998710346886728988998708356057 Q gi|254780419|r 560 SLFPMSQNFCDLLMGSPSTVSVEQLRELGLRIVEN 594 (622) Q Consensus 560 IaFPKt~~g~Dlm~~aPs~v~~~qL~e~~i~~~~~ 594 (622) ||||||++|.|||++||+.|+++||++|||++..+ T Consensus 553 I~FPkt~~g~d~~~~~p~~~~~~~l~~~~~~~~~~ 587 (587) T PRK00476 553 IAFPKTQSAADLLTGAPSPVDEKQLRELGIRVRKK 587 (587) T ss_pred EECCCCCCCCCCCCCCCCCCCHHHHHHCCEEEECC T ss_conf 50789999877335899868999998548585169 No 5 >KOG2411 consensus Probab=100.00 E-value=0 Score=1337.83 Aligned_cols=580 Identities=37% Similarity=0.621 Sum_probs=514.7 Q ss_pred CCCCCCHHHCCHHHCCCEEEEEEEECCCCCCCCEEEEEEECCCCEEEEEECCCCH--HHHHHHCCCCCCEEEEEEEEEEC Q ss_conf 7774204236979989889999798004446987999997798259999748815--78998528988589999999975 Q gi|254780419|r 3 RYRTHSCGDLGVSDVGSFVRLSGWVHRVRPHGGIIFLDIRDHYGITQVVANPDSA--CFEIAKSVRSEWVLCIDGLVMSR 80 (622) Q Consensus 3 ~~Rth~cg~l~~~~~g~~V~i~GwV~~~R~~g~l~F~~lrD~sG~~Q~v~~~~~~--~~~~~~~l~~esvv~v~G~V~~r 80 (622) .+|||+||+|...++|++|.+|||+...|.+|.+.|+.|||.+|++|..++.++- .......++.||+|+|.|+|..| T Consensus 32 ~~~t~~~~el~~~~vg~kv~l~GWl~~~~~~k~~~F~~LRD~~G~vq~lls~~s~~l~~~~~~~v~~e~vv~v~gtvv~R 111 (628) T KOG2411 32 VSRTSLCGELSVNDVGKKVVLCGWLELHRVHKMLTFFNLRDAYGIVQQLLSPDSFPLAQKLENDVPLEDVVQVEGTVVSR 111 (628) T ss_pred HHHHCCCHHHCCCCCCCEEEEEEEEEEEECCCCCEEEEEECCCCCEEEEECCHHHHHHHCCCCCCCHHHEEEEEEEEECC T ss_conf 76641010311576687799963023200025625787502586168985630456776045798754467663357426 Q ss_pred CCCCCCCCCCCCEEEEEEEEEEEEECCC-CCCCCCCCCC----CCCHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHH Q ss_conf 8655588777865999988999950211-7998645766----7898782112320001211112124589999999875 Q gi|254780419|r 81 SAETINANIITGQIELSAQKIEILSVAE-ELPLPVFGER----EYPESIRFKYRFLDLRRDTLHKNIVKRTRIINSMRCR 155 (622) Q Consensus 81 ~~~~~n~~~~tG~~Ei~~~~~~il~~a~-~~P~~~~~~~----~~~e~~rl~~R~LdLR~~~~~~~l~~Rs~i~~~~R~~ 155 (622) |...+|++|+||.+||.++.++++|.+. .+||.+.+.. ..+|.+||+|||||||+|.||.++|+||.+++.+|+| T Consensus 112 p~~sin~km~tg~vev~~e~~~vln~~~~~~p~~v~df~~ld~~~~er~rl~~RyldLR~~kmq~nLrlRS~~v~~iR~y 191 (628) T KOG2411 112 PNESINSKMKTGFVEVVAEKVEVLNPVNKKLPFEVTDFKELDDLAGERIRLRFRYLDLRRPKMQNNLRLRSNVVKKIRRY 191 (628) T ss_pred CCCCCCCCCCCCCEEEEEEEEEEECCCCCCCCCCHHHHHHHHCCCCCCCCCHHHHHHHCCHHHHHHHHHHHHHHHHHHHH T ss_conf 54546841235613787534488457667786200003221003244013100112124687887899999999999999 Q ss_pred HHCC-CEEEEECCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCHHHHHHHHHCCCHHHHEEEHHHHCCCCCCCCCCCCCE Q ss_conf 2036-514620332112464566454322222103443112385456443310570233110011037776655380000 Q gi|254780419|r 156 MIAE-NFIECSTPILTASSPEGARDFLVPSRVNQGSFYALPQAPQQYKQLLMASGFDRYFQIAPCFRDEDPRSDRLPGEF 234 (622) Q Consensus 156 l~~~-gF~EveTP~L~ksTpeGArdFlVPsR~~~g~fYALpQSPQlyKQlLmv~G~dryfqiarcFRdEd~R~dRqp~EF 234 (622) |.+. ||+|||||+|+|+||||||+|+||||..+|+|||||||||||||||||||+||||||||||||||+|+|||| || T Consensus 192 l~n~~GFvevETPtLFkrTPgGA~EFvVPtr~~~g~FYaLpQSPQQfKQlLMvsGidrYyQiARCfRDEdlR~DRQP-EF 270 (628) T KOG2411 192 LNNRHGFVEVETPTLFKRTPGGAREFVVPTRTPRGKFYALPQSPQQFKQLLMVSGIDRYYQIARCFRDEDLRADRQP-EF 270 (628) T ss_pred HHHHCCEEECCCCCHHCCCCCCCCEEECCCCCCCCCEEECCCCHHHHHHHHHHHCHHHHHHHHHHHCCCCCCCCCCC-CC T ss_conf 86104735515720201488864123223678988555368898999999987230367767766413456765687-64 Q ss_pred EEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHH Q ss_conf 35654102689999999999999999999808723457732121898875136777644476401244310178752456 Q gi|254780419|r 235 YQLDMEMSFVEQDDILNTMENVLRGVFEEFSNGNTVSNNFLRICYDDAIRRYGTDKPDLRNPIIMHNVTEHFVHSEFRVF 314 (622) Q Consensus 235 TqlD~EmsF~~~edvm~l~E~li~~i~~~v~~~~~i~~pF~rmtY~eAm~~YGsDKPDLR~~lei~dit~~~~~~~f~vf 314 (622) |||||||||++++|||+++|+|+.++|.+.. ++.++.|||||||+|||++|||||||+||+++|.|+++.+..+ | T Consensus 271 TQvD~EMsF~~~~dim~liEdll~~~ws~~k-~~~l~~PF~riTY~~Am~~YG~DKPD~Rf~l~l~dv~~~~~~~----~ 345 (628) T KOG2411 271 TQVDMEMSFTDQEDIMKLIEDLLRYVWSEDK-GIQLPVPFPRITYADAMDKYGSDKPDTRFPLKLKDVSESISKS----F 345 (628) T ss_pred EEEEEEEECCCHHHHHHHHHHHHHHHCHHHC-CCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCEECCHHHHHHHH----H T ss_conf 3544478536778789999999997362421-9988888664239999997389998665776732267765665----5 Q ss_pred HHHHHCCCCCEEEEEECCCCCC------CCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCHHHHCCHHHHHHHH Q ss_conf 6687403784799996278766------5210356779998764202462355204777533357025532527899999 Q gi|254780419|r 315 SKILSVNPGYEVWAIPAKGGGG------NRAFCDRMNEWARSQGQPGLAYIFWRLDESGNVAGMGPVAKHLNIVNTEAIR 388 (622) Q Consensus 315 ~~~~~~~~~~~v~~i~~~~~~~------srk~~D~l~e~ak~~g~kgL~~i~~~~~~~~~~~~~spi~K~l~~~~~~~l~ 388 (622) ......++.....+++...... .|+.++.+ .+.+.+|+...... +.+..|..++.++..+....++. T Consensus 346 ~~~~~~d~l~~~~~vv~~~~~~~~s~~~~rk~~e~~----~~~~~~~~~f~~~~---~~~~~w~~~~~~l~~e~~~~el~ 418 (628) T KOG2411 346 EEALLSDGLISVEIVVLGAKKYSNSNYKKRKIYESL----GKSGFVGLPFPIEH---DANKNWFKKFSSLTDEAVAIELI 418 (628) T ss_pred HHHHHCCCCCCEEEEEECCCCCCCHHHHHHHHHHHH----HHCCCCCCEEEEEC---CCHHHHHHHCHHHCCCHHHHHHH T ss_conf 555414885405899951434441256776799987----54346663001532---62011544132422423789999 Q ss_pred HHHCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHH--HCCCCCCEEEEEEEECCCEEECCCCC-CCCHHHCCCCCCCCC Q ss_conf 981887557269980885411001347889998773--00355751028896077053545547-000211002476643 Q gi|254780419|r 389 VRLDMQDGDSCFFIAGDPLKFCSFAADARNHIAQEL--GIINCNCFEFCWIMDFPFYEWNEEEK-KIDFAHNPFTMPQGG 465 (622) Q Consensus 389 ~~~~~~~GD~i~f~a~~~~~~~~~lg~lR~~ia~~l--~li~~~~~~flWV~DFPLFe~dee~~-r~~s~HHPFTaP~~~ 465 (622) ..+++++||++++++|.+..+++.||++|+.+...+ ++.+++.+.|+||||||||++.+|.+ ++.|+|||||||+++ T Consensus 419 ~~~~l~~~Div~~~~g~~~~~~~~LgrlRL~l~~l~~~~l~d~~~~~~lWVvDFPLF~p~~E~~q~L~StHHPFTaP~~~ 498 (628) T KOG2411 419 KQLGLAEGDIVLLCVGPKVSVNTPLGRLRLVLQDLLVKNLRDKSKFSFLWVVDFPLFSPWEEKNQRLESTHHPFTAPHPK 498 (628) T ss_pred HHHCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEECCCCCCCCCCCCEECCCCCCCCCCCHH T ss_conf 98588878889993268766777237789999998751643666534899983355676566775300025887788855 Q ss_pred HHHHHCCCHHHHHHHCCCEEECCEEECCCCEECCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCCCEECCHHH Q ss_conf 06554188777413136777756772125123156999999999859998999866778999986084973422246899 Q gi|254780419|r 466 MESLKGNDLLSIKAFQYDLVCNGFEIASGGIRNHMSEVMLQAFSNVGISKEVVENRFGGLYRAFQCGMPPHGGIAAGIDR 545 (622) Q Consensus 466 ~~~l~~~dp~~v~a~~YDLVlNG~EiggGSiRIHd~~iQ~~~f~~lgl~~ee~~~~Fg~lL~Al~yG~PPHgG~alGlDR 545 (622) |.+++.+.|.+++|+|||||+||+|||||||||||+++|+++++.++..++++.++ ||||+||.+|||||||||+|||| T Consensus 499 Di~~L~s~p~~~rgqhYDlV~NGvElGGGSiRIhn~diQr~vLe~iLk~p~~~~s~-gHLL~ALd~GaPPHGGiAlGlDR 577 (628) T KOG2411 499 DIDLLESAPEKIRGQHYDLVVNGVELGGGSIRIHNPDIQRYVLEDILKIPEDAESK-GHLLNALDMGAPPHGGIALGLDR 577 (628) T ss_pred HHHHHHCCCHHHHCCEEEEEECCEEECCCEEEECCHHHHHHHHHHHHCCCHHHHHH-HHHHHHHHCCCCCCCCEEECHHH T ss_conf 66676518166524300167714762473467528889999999985596666667-78998862699998744412788 Q ss_pred HHHHHCCCCCCEEEECCCCCCCCCCCCCCCCCCCCHHHHHHCCCEEECCCH Q ss_conf 999983739840321378999871034688672898899870835605730 Q gi|254780419|r 546 IVMLLLGAKNVREVSLFPMSQNFCDLLMGSPSTVSVEQLRELGLRIVENPK 596 (622) Q Consensus 546 lvmll~g~~sIRdVIaFPKt~~g~Dlm~~aPs~v~~~qL~e~~i~~~~~~k 596 (622) |||+|||++|||||||||||.+|.|||++|||.|+++||+.|||++.++.+ T Consensus 578 lvaml~~a~sIRDVIAFPKt~~G~Dlls~sPs~ip~e~L~~Y~I~~~~~~~ 628 (628) T KOG2411 578 LVAMLTGAPSIRDVIAFPKTTTGADLLSNSPSEIPEEQLEDYNIRVSNSTK 628 (628) T ss_pred HHHHHCCCCCHHEEEECCCCCCCCCCCCCCCCCCCHHHHHHCCEEECCCCC T ss_conf 999980897522256446657754313489888997883313055047899 No 6 >PRK02983 lysS lysyl-tRNA synthetase; Provisional Probab=100.00 E-value=0 Score=919.86 Aligned_cols=443 Identities=22% Similarity=0.318 Sum_probs=361.5 Q ss_pred CCCCCHHHCCHHHCCCEEEEEEEECCCCCCCCEEEEEEECCCCEEEEEECCCCHHH----HHHHCCCCCCEEEEEEEEEE Q ss_conf 77420423697998988999979800444698799999779825999974881578----99852898858999999997 Q gi|254780419|r 4 YRTHSCGDLGVSDVGSFVRLSGWVHRVRPHGGIIFLDIRDHYGITQVVANPDSACF----EIAKSVRSEWVLCIDGLVMS 79 (622) Q Consensus 4 ~Rth~cg~l~~~~~g~~V~i~GwV~~~R~~g~l~F~~lrD~sG~~Q~v~~~~~~~~----~~~~~l~~esvv~v~G~V~~ 79 (622) .|||||||++.++.|++|+|+|||+++|+|||++|+||||++|.+|+|++++.... +....+..|++|+|+|+|.. T Consensus 643 ~RTHT~~e~~~~~~G~~VtvaGwV~r~RDhGgl~FidLRD~~G~iQvv~~~d~~g~~~~~~~~~~vdlGDvI~V~G~V~~ 722 (1099) T PRK02983 643 PPSHTVAQALAAPDGEEVSVSGRVLRIRDYGGVLFAQLRDWSGELQVLLDNSRLGQGRLADFTAAVDLGDLIEATGTMGA 722 (1099) T ss_pred CCCCCHHHHCCCCCCCEEEEEEEEEEEECCCCEEEEEEEECCCCEEEEEECCCCCHHHHHHHHHHCCCCCEEEEEEEEEC T ss_conf 99741666424799998999987999843788699997418876799997775676899998762577887999858862 Q ss_pred CCCCCCCCCCCCCEEEEEEEEEEEEECCCCCCCCCCCCCCCCHHHHCCCCCEEEC-CCCCCCCHHHHHHHHHHHHHHHHC Q ss_conf 5865558877786599998899995021179986457667898782112320001-211112124589999999875203 Q gi|254780419|r 80 RSAETINANIITGQIELSAQKIEILSVAEELPLPVFGEREYPESIRFKYRFLDLR-RDTLHKNIVKRTRIINSMRCRMIA 158 (622) Q Consensus 80 r~~~~~n~~~~tG~~Ei~~~~~~il~~a~~~P~~~~~~~~~~e~~rl~~R~LdLR-~~~~~~~l~~Rs~i~~~~R~~l~~ 158 (622) + +|||+||.+++++++|+|.. |+|.+.+...++|+|+|||||||| ||+++++|++||++++++|+||++ T Consensus 723 t---------~tGEisV~a~~~~vlsKsL~-PLP~k~~gl~d~E~RlR~RYLDLr~npe~r~~~~~RS~Vi~aiR~~L~~ 792 (1099) T PRK02983 723 S---------RNGTPSLLVTSWRLIGKCLR-PLPDKWKGLTDPEARVRARYLDLAVNPEARDLIRARSAVLRAVRETLVA 792 (1099) T ss_pred C---------CCCCEEEEEEEEEEEECCCC-CCCCCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 7---------89845799748998404567-9997677988776755645565158989999999999999999999987 Q ss_pred CCEEEEECCCCCC-CCCCCCCCCCCCCCCCHHHHHHCCCCHHHHHHHHHCCCHHHHEEEHHHHCCCCCCCCCCCCCEEEE Q ss_conf 6514620332112-464566454322222103443112385456443310570233110011037776655380000356 Q gi|254780419|r 159 ENFIECSTPILTA-SSPEGARDFLVPSRVNQGSFYALPQAPQQYKQLLMASGFDRYFQIAPCFRDEDPRSDRLPGEFYQL 237 (622) Q Consensus 159 ~gF~EveTP~L~k-sTpeGArdFlVPsR~~~g~fYALpQSPQlyKQlLmv~G~dryfqiarcFRdEd~R~dRqp~EFTql 237 (622) +||+|||||||+. ++|+|||||+||+|.++++|| ||||||||||+||||||||||||||||||||+|+|||| ||||| T Consensus 793 ~GFlEVETPiLq~~~GGA~ARPFlThsna~d~~~Y-LriAPELflKRLmVGGfeRVFEI~RcFRNEglradHnP-EFTmL 870 (1099) T PRK02983 793 KGFLEVETPILQQIHGGANARPFLTHINAYDLDLY-LRIAPELYLKRLCVGGVERVFELGRAFRNEGVDFSHNP-EFTLL 870 (1099) T ss_pred CCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCE-EECCHHHHHHHHHHCCCCCCEEECCCCCCCCCCCCCCC-CHHHH T ss_conf 89889557546778887567853010336786614-51480899999986486311343254467999987494-07999 Q ss_pred EEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHH Q ss_conf 54102689999999999999999999808723457732121898875136777644476401244310178752456668 Q gi|254780419|r 238 DMEMSFVEQDDILNTMENVLRGVFEEFSNGNTVSNNFLRICYDDAIRRYGTDKPDLRNPIIMHNVTEHFVHSEFRVFSKI 317 (622) Q Consensus 238 D~EmsF~~~edvm~l~E~li~~i~~~v~~~~~i~~pF~rmtY~eAm~~YGsDKPDLR~~lei~dit~~~~~~~f~vf~~~ 317 (622) |+||||+|++|+|+++|+|+.++++++.+...+ -|||.+++++..|++..++...+. T Consensus 871 E~Y~Ay~Dyedmm~ltEeLi~~~a~~~~G~~v~------------------~~pd~~~~~~~iDls~~w~~~t~~----- 927 (1099) T PRK02983 871 EAYQAHADYLVMRDLCRELIQNAAQAANGEPVV------------------MRPGTDGVLEPVDISGEWPVKTVH----- 927 (1099) T ss_pred HHHHHHCCHHHHHHHHHHHHHHHHHHHHCCEEE------------------ECCCCCCCCEEEECCCCCEEEEHH----- T ss_conf 999984789999999999999999995196578------------------616877887686569995265399----- Q ss_pred HHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCHHHHCCHHHHHHHHHHHCCCCCC Q ss_conf 74037847999962787665210356779998764202462355204777533357025532527899999981887557 Q gi|254780419|r 318 LSVNPGYEVWAIPAKGGGGNRAFCDRMNEWARSQGQPGLAYIFWRLDESGNVAGMGPVAKHLNIVNTEAIRVRLDMQDGD 397 (622) Q Consensus 318 ~~~~~~~~v~~i~~~~~~~srk~~D~l~e~ak~~g~kgL~~i~~~~~~~~~~~~~spi~K~l~~~~~~~l~~~~~~~~GD 397 (622) +.+.+ ..|. .+... .+.+...++.++.+.+ T Consensus 928 ------------------------eai~e---~~G~--------------di~~~------~~~e~L~~~a~~~gi~--- 957 (1099) T PRK02983 928 ------------------------EAVSE---ALGE--------------EIDAD------TPLETLRRLCDAAGIP--- 957 (1099) T ss_pred ------------------------HHHHH---HHCC--------------CCCCC------CCHHHHHHHHHHCCCC--- T ss_conf ------------------------99999---8388--------------78999------9999999999977997--- Q ss_pred CEEEEECCCCCCCCHHHHHHHHHHHHHHCCCCCCEEEEEEEECCCEEECCCCCCCCHHHCCCCCCCCCHHHHHCCCHHHH Q ss_conf 26998088541100134788999877300355751028896077053545547000211002476643065541887774 Q gi|254780419|r 398 SCFFIAGDPLKFCSFAADARNHIAQELGIINCNCFEFCWIMDFPFYEWNEEEKKIDFAHNPFTMPQGGMESLKGNDLLSI 477 (622) Q Consensus 398 ~i~f~a~~~~~~~~~lg~lR~~ia~~l~li~~~~~~flWV~DFPLFe~dee~~r~~s~HHPFTaP~~~~~~l~~~dp~~v 477 (622) .......|.+..++.++ +++++...|+||+|||+ .|||||+|+.++ |. T Consensus 958 ---------~~~~~~~G~lv~eL~e~--lVE~~l~~PtFV~DfP~------------e~sPLarp~~~d-------p~-- 1005 (1099) T PRK02983 958 ---------YLTHWDAGAVVLELYEH--LVEDRTTEPTFYTDFPT------------SVSPLTRPHRSD-------PG-- 1005 (1099) T ss_pred ---------CCCCCCHHHHHHHHHHH--HCCCCCCCCEEEECCCC------------CCCCCCCCCCCC-------CC-- T ss_conf ---------78999977999999998--52737667789989972------------239041778999-------68-- Q ss_pred HHHCCCEEECCEEECCCCEECCCHHHHHHHHH-----HCCCCHHHHHHHHHHHHHHHHCCCCCCCCEECCHHHHHHHHCC Q ss_conf 13136777756772125123156999999999-----8599989998667789999860849734222468999999837 Q gi|254780419|r 478 KAFQYDLVCNGFEIASGGIRNHMSEVMLQAFS-----NVGISKEVVENRFGGLYRAFQCGMPPHGGIAAGIDRIVMLLLG 552 (622) Q Consensus 478 ~a~~YDLVlNG~EiggGSiRIHd~~iQ~~~f~-----~lgl~~ee~~~~Fg~lL~Al~yG~PPHgG~alGlDRlvmll~g 552 (622) .++.|||||||+|||+||+|+|||..|+++|. ..+.++|.++ .+..||+||+||||||||+|+|+|||||+||| T Consensus 1006 lAeRfDLvinG~ELgnGysELnDP~eQR~Rf~eQ~~~~~ggD~Eam~-~DEdFL~ALeYGmPPhGGialGIDRLVMLLTG 1084 (1099) T PRK02983 1006 VAERWDLVAWGVELGTAYSELTDPVEQRRRLTEQSLLAAGGDPEAME-LDEDFLQALEYAMPPTGGLGMGVDRLVMLITG 1084 (1099) T ss_pred EEEEEEEEECCEEEECCHHHHCCHHHHHHHHHHHHHHHHCCCCHHHH-CHHHHHHHHCCCCCCCCCEEEHHHHHHHHHCC T ss_conf 27877255558997021035169999999999999998679900233-21789998431899736475449999999629 Q ss_pred CCCCEEEECCCCC Q ss_conf 3984032137899 Q gi|254780419|r 553 AKNVREVSLFPMS 565 (622) Q Consensus 553 ~~sIRdVIaFPKt 565 (622) . ||||||+||-- T Consensus 1085 ~-SIREVIaFPl~ 1096 (1099) T PRK02983 1085 R-SIRETLPFPLA 1096 (1099) T ss_pred C-CHHHHCCCCCC T ss_conf 9-75987438787 No 7 >PRK12445 lysyl-tRNA synthetase; Reviewed Probab=100.00 E-value=0 Score=885.58 Aligned_cols=431 Identities=24% Similarity=0.406 Sum_probs=348.1 Q ss_pred CC-CCCCHHHCCHHH----------CCCEEEEEEEECCCCCCCCEEEEEEECCCCEEEEEECCCC---HHH-HHHHCCCC Q ss_conf 77-742042369799----------8988999979800444698799999779825999974881---578-99852898 Q gi|254780419|r 3 RY-RTHSCGDLGVSD----------VGSFVRLSGWVHRVRPHGGIIFLDIRDHYGITQVVANPDS---ACF-EIAKSVRS 67 (622) Q Consensus 3 ~~-Rth~cg~l~~~~----------~g~~V~i~GwV~~~R~~g~l~F~~lrD~sG~~Q~v~~~~~---~~~-~~~~~l~~ 67 (622) .| |||+|+++.+.. .+.+|+|+|||+++|++||++|++|||++|.+||+++++. ..+ +..+.+.. T Consensus 39 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~V~vaGrv~~~R~~Gk~~F~~LrD~~G~iQ~~~~~~~~~~~~~~~~~~~~~~ 118 (505) T PRK12445 39 DFRRDHTSDQLHEEFDAKDNQELESLNIEVSVAGRMMTRRIMGKASFVTLQDVGGRIQLYVARDSLPEGVYNDQFKKWDL 118 (505) T ss_pred CCCCCCCHHHHHHHHCCCCCCCCCCCCCEEEEEEEEEEEECCCCEEEEEEEECCEEEEEEEECCCCCHHHHHHHHHHCCC T ss_conf 89587709999998545684100147988999988987874798399999859843999995886866899999751215 Q ss_pred CCEEEEEEEEEECCCCCCCCCCCCCEEEEEEEEEEEEECCCCCCCCCCCCCCCCHHHHCCCCCEEE-CCCCCCCCHHHHH Q ss_conf 858999999997586555887778659999889999502117998645766789878211232000-1211112124589 Q gi|254780419|r 68 EWVLCIDGLVMSRSAETINANIITGQIELSAQKIEILSVAEELPLPVFGEREYPESIRFKYRFLDL-RRDTLHKNIVKRT 146 (622) Q Consensus 68 esvv~v~G~V~~r~~~~~n~~~~tG~~Ei~~~~~~il~~a~~~P~~~~~~~~~~e~~rl~~R~LdL-R~~~~~~~l~~Rs 146 (622) |++|+|+|++. +.+||++||.|+++++||+|.. |+|.+.+...++|+||+|||||| ||++++++|++|| T Consensus 119 gdii~v~G~~~---------~t~tGe~~i~~~~~~ilsksl~-plP~k~~g~~d~e~R~r~RyLDLir~~~~~~~~~~Rs 188 (505) T PRK12445 119 GDIIGARGTLF---------KTQTGELSIHCTELRLLTKALR-PLPDKFHGLQDQEVRYRQRYLDLIANDKSRQTFVVRS 188 (505) T ss_pred CEEEEEEEEEE---------CCCCCCCCEEEEEEEEECCCCC-CCCCCCCCCCCHHHHHHHHEECCCCCHHHHHHHHHHH T ss_conf 34999963660---------4787762125789999347778-9897455787887763411114365999999999999 Q ss_pred HHHHHHHHHHHCCCEEEEECCCCCCCCCCC--CCCCCCCCCCCHHHHHHCCCCHHHHHHHHHCCCHHHHEEEHHHHCCCC Q ss_conf 999999875203651462033211246456--645432222210344311238545644331057023311001103777 Q gi|254780419|r 147 RIINSMRCRMIAENFIECSTPILTASSPEG--ARDFLVPSRVNQGSFYALPQAPQQYKQLLMASGFDRYFQIAPCFRDED 224 (622) Q Consensus 147 ~i~~~~R~~l~~~gF~EveTP~L~ksTpeG--ArdFlVPsR~~~g~fYALpQSPQlyKQlLmv~G~dryfqiarcFRdEd 224 (622) +|++++|+||+++||+|||||+| ++|||| ||+|+||++.+++.|| ||||||||||+|||||||||||||||||||| T Consensus 189 ~ii~~iR~~l~~~gF~EVeTPil-~~~~gGa~Arpf~t~~~~~~~~~y-L~qSPQLykk~LmvgGfdRvfeI~r~FRnE~ 266 (505) T PRK12445 189 KILAAIRQFMVARGFMEVETPMM-QVIPGGASARPFITHHNALDLDMY-LRIAPELYLKRLVVGGFERVFEINRNFRNEG 266 (505) T ss_pred HHHHHHHHHHHHCCCEEEECCCC-CCCCCCCCCCEEEECCCCCCCCEE-ECCCCHHHHHHHHHCCCCCEEEHHHHHHCCC T ss_conf 99999999999779389978875-554587666103304667886743-4169399999998579744576278760788 Q ss_pred CCCCCCCCCEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCC---------CCCCCCCCCCHHHHHHHCC--CCCCCC Q ss_conf 66553800003565410268999999999999999999980872---------3457732121898875136--777644 Q gi|254780419|r 225 PRSDRLPGEFYQLDMEMSFVEQDDILNTMENVLRGVFEEFSNGN---------TVSNNFLRICYDDAIRRYG--TDKPDL 293 (622) Q Consensus 225 ~R~dRqp~EFTqlD~EmsF~~~edvm~l~E~li~~i~~~v~~~~---------~i~~pF~rmtY~eAm~~YG--sDKPDL 293 (622) +|++||| ||||||+||||+|++|||+++|+|++++++++.+.. .+..||+|+||.||+.+|. +|..|+ T Consensus 267 ~~~~H~P-EFT~lE~e~af~d~~dvm~l~E~li~~i~~~v~g~~~i~y~~~~~d~~~pf~rlt~~eai~~~~~~~~~~~~ 345 (505) T PRK12445 267 ISVRHNP-EFTMMELYMAYADYHDLIELTESLFRTLAQEVLGTTKVTYGEHVFDFGKPFEKLTMREAIKKYRPETDMADL 345 (505) T ss_pred CCCCCCH-HHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCEEEECCCHHCCCCCCCCCCHHHHHHHHCCCCCHHHC T ss_conf 8866544-454356876327898999999999999999844883586652012016984103499999985688874533 Q ss_pred CCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCC Q ss_conf 47640124431017875245666874037847999962787665210356779998764202462355204777533357 Q gi|254780419|r 294 RNPIIMHNVTEHFVHSEFRVFSKILSVNPGYEVWAIPAKGGGGNRAFCDRMNEWARSQGQPGLAYIFWRLDESGNVAGMG 373 (622) Q Consensus 294 R~~lei~dit~~~~~~~f~vf~~~~~~~~~~~v~~i~~~~~~~srk~~D~l~e~ak~~g~kgL~~i~~~~~~~~~~~~~s 373 (622) . .++.....++..|.+ + .. T Consensus 346 ---------~------------------------------------~~~~~~~~~~~~g~~------~--~~-------- 364 (505) T PRK12445 346 ---------D------------------------------------NFDAAKALAESIGIT------V--EK-------- 364 (505) T ss_pred ---------C------------------------------------CHHHHHHHHHHCCCC------C--CC-------- T ss_conf ---------4------------------------------------469999999985985------3--44-------- Q ss_pred CHHHHCCHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHCCCCCCEEEEEEEECCCEEECCCCCCCC Q ss_conf 02553252789999998188755726998088541100134788999877300355751028896077053545547000 Q gi|254780419|r 374 PVAKHLNIVNTEAIRVRLDMQDGDSCFFIAGDPLKFCSFAADARNHIAQELGIINCNCFEFCWIMDFPFYEWNEEEKKID 453 (622) Q Consensus 374 pi~K~l~~~~~~~l~~~~~~~~GD~i~f~a~~~~~~~~~lg~lR~~ia~~l~li~~~~~~flWV~DFPLFe~dee~~r~~ 453 (622) .+.+|.+..++.++ +++++..+|+||+|||. T Consensus 365 ------------------------------------~~~~g~l~~~~~~e--~~e~~l~~P~fv~d~P~----------- 395 (505) T PRK12445 365 ------------------------------------SWGLGRIVTEIFDE--VAEAHLIQPTFITEYPA----------- 395 (505) T ss_pred ------------------------------------CCCHHHHHHHHHHH--HHHHCCCCCCCCCCCCC----------- T ss_conf ------------------------------------43442799999887--77750555542447754----------- Q ss_pred HHHCCCCCCCCCHHHHHCCCHHHHHHHCCCEEECCEEECCCCEECCCHHHHHHHHHHC----CCCHHHHHHHHHHHHHHH Q ss_conf 2110024766430655418877741313677775677212512315699999999985----999899986677899998 Q gi|254780419|r 454 FAHNPFTMPQGGMESLKGNDLLSIKAFQYDLVCNGFEIASGGIRNHMSEVMLQAFSNV----GISKEVVENRFGGLYRAF 529 (622) Q Consensus 454 s~HHPFTaP~~~~~~l~~~dp~~v~a~~YDLVlNG~EiggGSiRIHd~~iQ~~~f~~l----gl~~ee~~~~Fg~lL~Al 529 (622) .+.||+++++. ||. .+.+||||+||.||||||+|||||+.|+++|+.. +..++++...+++||+|| T Consensus 396 -e~splak~~~~-------~p~--~t~rfdl~i~G~Ei~nG~~el~Dp~~Q~~rf~~q~~~k~~g~~e~~~~de~fl~Al 465 (505) T PRK12445 396 -EVSPLARRNDV-------NPE--ITDRFEFFIGGREIGNGFSELNDAEDQAERFQEQVNAKAAGDDEAMFYDEDYVTAL 465 (505) T ss_pred -CCCCHHCCCCC-------CHH--HHHEEEEEECCEEEEECEECCCCHHHHHHHHHHHHHHHHCCCCCHHHCHHHHHHHH T ss_conf -45830016799-------841--40145351189998746022689999999999988777547660001228899996 Q ss_pred HCCCCCCCCEECCHHHHHHHHCCCCCCEEEECCCCCC Q ss_conf 6084973422246899999983739840321378999 Q gi|254780419|r 530 QCGMPPHGGIAAGIDRIVMLLLGAKNVREVSLFPMSQ 566 (622) Q Consensus 530 ~yG~PPHgG~alGlDRlvmll~g~~sIRdVIaFPKt~ 566 (622) +||||||||||||+|||||+|||++||||||+||... T Consensus 466 ~yG~PPhgG~glGiDRLvMlLtg~~sIRdVI~FP~~r 502 (505) T PRK12445 466 EYGLPPTAGLGIGIDRMIMLFTNSHTIRDVILFPAMR 502 (505) T ss_pred HCCCCCHHCCHHHHHHHHHHHCCCCCHHEECCCCCCC T ss_conf 5699850034379999999981999564071699989 No 8 >PRK05159 aspC aspartyl-tRNA synthetase; Provisional Probab=100.00 E-value=0 Score=877.61 Aligned_cols=416 Identities=33% Similarity=0.579 Sum_probs=344.5 Q ss_pred CCCCCHHHCCHHHCCCEEEEEEEECCCCCCCCEEEEEEECCCCEEEEEECCC---CHHHHHHHCCCCCCEEEEEEEEEEC Q ss_conf 7742042369799898899997980044469879999977982599997488---1578998528988589999999975 Q gi|254780419|r 4 YRTHSCGDLGVSDVGSFVRLSGWVHRVRPHGGIIFLDIRDHYGITQVVANPD---SACFEIAKSVRSEWVLCIDGLVMSR 80 (622) Q Consensus 4 ~Rth~cg~l~~~~~g~~V~i~GwV~~~R~~g~l~F~~lrD~sG~~Q~v~~~~---~~~~~~~~~l~~esvv~v~G~V~~r 80 (622) |||||||||+++++|++|+|+|||+++|++||++|++|||++|.+|||++++ .+.++.+++|+.||+|.|+|+|++. T Consensus 2 ~r~h~~~dl~~~~~g~~V~v~G~v~~~R~~Gkl~Fi~LrD~~g~iQ~~~~~~~~~~~~~~~~~~l~~g~~v~v~G~v~~~ 81 (434) T PRK05159 2 MKRHYTSEVTPELDGKEVTLAGWVHEIRDLGGIKFLLLRDRTGIIQVVVPKKKVDEELFEGIKKLSRESVVSVTGVVKES 81 (434) T ss_pred CCCCCCCCCCHHHCCCEEEEEEEEEEEECCCCCEEEEEEECCCCEEEEEECCCCCHHHHHHHHCCCCCEEEEEEEEEECC T ss_conf 86313154994779999999898872881798599999969802899997998998999998579996299999999858 Q ss_pred CCCCCCCCCCCCEEEEEEEEEEEEECCCC-CCCCCCCCCCCCHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHHHCC Q ss_conf 86555887778659999889999502117-99864576678987821123200012111121245899999998752036 Q gi|254780419|r 81 SAETINANIITGQIELSAQKIEILSVAEE-LPLPVFGEREYPESIRFKYRFLDLRRDTLHKNIVKRTRIINSMRCRMIAE 159 (622) Q Consensus 81 ~~~~~n~~~~tG~~Ei~~~~~~il~~a~~-~P~~~~~~~~~~e~~rl~~R~LdLR~~~~~~~l~~Rs~i~~~~R~~l~~~ 159 (622) + + .+|++||.+++++|||+|.. +|+.+.++.+.++++|++|||||||++.++++|++||++++++|+||.++ T Consensus 82 ~------~-~~~~~ev~~~~i~ils~a~~~lP~~~~~~~~~~~~~rl~~R~LdLr~~~~~~~~r~Rs~i~~~iR~fl~~~ 154 (434) T PRK05159 82 E------K-APNGVEVQPEEIEVLNKADEPLPLDPSEKVEAELDTRLDNRFLDLRRPRVQAIFKIRSEVLRAFREFLYEE 154 (434) T ss_pred C------C-CCCCEEEEEEEEEEEECCCCCCCCCCCCCCCCCHHHHHHCHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 9------9-99758999989999834777788765455676988886530145239789999999999999999999877 Q ss_pred CEEEEECCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCHHHHHHHHHCCCHHHHEEEHHHHCCCCCCCCCCCCCEEEEEE Q ss_conf 51462033211246456645432222210344311238545644331057023311001103777665538000035654 Q gi|254780419|r 160 NFIECSTPILTASSPEGARDFLVPSRVNQGSFYALPQAPQQYKQLLMASGFDRYFQIAPCFRDEDPRSDRLPGEFYQLDM 239 (622) Q Consensus 160 gF~EveTP~L~ksTpeGArdFlVPsR~~~g~fYALpQSPQlyKQlLmv~G~dryfqiarcFRdEd~R~dRqp~EFTqlD~ 239 (622) ||+|||||+|+++++|||++ +.|...+...+| |+||||||||+||+|||+||||||||||||+++++||.+||||||+ T Consensus 155 gF~EVeTP~l~~~~~eGga~-~f~~~~~~~~~y-L~~Spqly~q~li~~G~~rvfeI~~~FR~E~~~t~RH~pEFT~lE~ 232 (434) T PRK05159 155 GFTEIFTPKIVATGTEGGTE-LFPVKYFEKEAF-LAQSPQLYKQMMMAAGFERVFEIGPAFRAEEHNTTRHLNEAISIDV 232 (434) T ss_pred CCEEEECCCCCCCCCCCCCC-CCCEEECCCEEE-ECCCCHHHHHHHHHCCCCCEEEECCCEECCCCCCCCCHHHHHHHHH T ss_conf 91999787432567887556-586131275334-1468379999987635685599660102365875445356765666 Q ss_pred EECCCCHHHHHHHHHHHHHHHHHHHHCCC------------CCCCCCCCCCHHHHHHHCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 10268999999999999999999980872------------345773212189887513677764447640124431017 Q gi|254780419|r 240 EMSFVEQDDILNTMENVLRGVFEEFSNGN------------TVSNNFLRICYDDAIRRYGTDKPDLRNPIIMHNVTEHFV 307 (622) Q Consensus 240 EmsF~~~edvm~l~E~li~~i~~~v~~~~------------~i~~pF~rmtY~eAm~~YGsDKPDLR~~lei~dit~~~~ 307 (622) ||||+|++|+|+++|+|++++|+.+.+.. .+..||+|+||.||++.. T Consensus 233 e~af~d~~d~m~~~E~li~~i~~~v~~~~~~~~~~~~~~~~~~~~~~~rit~~eai~~l--------------------- 291 (434) T PRK05159 233 EMGFIDEEDVMDLLENLLKYVYEDVAENCEKELELLGIELPVPETPIPRITYDEAIEIL--------------------- 291 (434) T ss_pred HHCCCCHHHHHHHHHHHHHHHHHHHHHHCHHHHHHCCCCCCCCCCCCCEEEHHHHHHHH--------------------- T ss_conf 63106899999999999999999998656477863586556679986477899999999--------------------- Q ss_pred CCCHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCHHHHCCHHHHHHH Q ss_conf 87524566687403784799996278766521035677999876420246235520477753335702553252789999 Q gi|254780419|r 308 HSEFRVFSKILSVNPGYEVWAIPAKGGGGNRAFCDRMNEWARSQGQPGLAYIFWRLDESGNVAGMGPVAKHLNIVNTEAI 387 (622) Q Consensus 308 ~~~f~vf~~~~~~~~~~~v~~i~~~~~~~srk~~D~l~e~ak~~g~kgL~~i~~~~~~~~~~~~~spi~K~l~~~~~~~l 387 (622) +..|.+ +.| .+.++.+.... T Consensus 292 -----------------------------------------~~~~~~-~~~-----------------g~dl~~~~e~~- 311 (434) T PRK05159 292 -----------------------------------------KSKGVE-ISW-----------------GDDLDTEGERL- 311 (434) T ss_pred -----------------------------------------HHCCCC-CCC-----------------CCCCCHHHHHH- T ss_conf -----------------------------------------851998-785-----------------55537899999- Q ss_pred HHHHCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHCCCCCCEEEEEEEECCCEEECCCCCCCCHHHCCC-CCCCCCH Q ss_conf 998188755726998088541100134788999877300355751028896077053545547000211002-4766430 Q gi|254780419|r 388 RVRLDMQDGDSCFFIAGDPLKFCSFAADARNHIAQELGIINCNCFEFCWIMDFPFYEWNEEEKKIDFAHNPF-TMPQGGM 466 (622) Q Consensus 388 ~~~~~~~~GD~i~f~a~~~~~~~~~lg~lR~~ia~~l~li~~~~~~flWV~DFPLFe~dee~~r~~s~HHPF-TaP~~~~ 466 (622) +++. +.. -.|+||+||| +.|+|| |+|++++ T Consensus 312 --------------------------------l~~~---~~~--~~~~fI~d~P------------~~~~pfy~~~~~~~ 342 (434) T PRK05159 312 --------------------------------LGKY---VKE--SDFYFITDWP------------TEIRPFYTMPYEDD 342 (434) T ss_pred --------------------------------HHHH---HCC--CCCEEEECCC------------HHCCCCCCCCCCCC T ss_conf --------------------------------9998---475--8988998997------------00086546777888 Q ss_pred HHHHCCCHHHHHHHCCCEEECCEEECCCCEECCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCCCEECCHHHH Q ss_conf 65541887774131367777567721251231569999999998599989998667789999860849734222468999 Q gi|254780419|r 467 ESLKGNDLLSIKAFQYDLVCNGFEIASGGIRNHMSEVMLQAFSNVGISKEVVENRFGGLYRAFQCGMPPHGGIAAGIDRI 546 (622) Q Consensus 467 ~~l~~~dp~~v~a~~YDLVlNG~EiggGSiRIHd~~iQ~~~f~~lgl~~ee~~~~Fg~lL~Al~yG~PPHgG~alGlDRl 546 (622) |. .+.+|||++||+||||||+||||++.|+++|+..|++++. |+|||+||+||||||||||||+||| T Consensus 343 -------~~--~~~~fdl~~~G~Ei~~G~~r~~d~~~l~~~~~~~g~~~~~----~~~yl~al~yG~PP~gG~glGiDRL 409 (434) T PRK05159 343 -------PE--ITKSFDLMYRGLEITSGAQRIHDYDMLVESIKEKGLNPES----FEFYLEAFKYGMPPHAGWGLGLERL 409 (434) T ss_pred -------CC--EEEEEEEEECCEEEEEEEEECCCHHHHHHHHHHCCCCHHH----HHHHHHHHHCCCCCCCEEEEHHHHH T ss_conf -------32--0014326776699843101117899999999974989899----9999999667999875587879999 Q ss_pred HHHHCCCCCCEEEECCCCCCCCCCC Q ss_conf 9998373984032137899987103 Q gi|254780419|r 547 VMLLLGAKNVREVSLFPMSQNFCDL 571 (622) Q Consensus 547 vmll~g~~sIRdVIaFPKt~~g~Dl 571 (622) +|+|||++||||||+|||+.+...| T Consensus 410 vm~l~g~~sIrdvi~FPr~~~rl~P 434 (434) T PRK05159 410 TMKLLGLENVREAVLFPRDRTRLTP 434 (434) T ss_pred HHHHCCCCCHHEEECCCCCCCCCCC T ss_conf 9998199966548148899898997 No 9 >PRK00484 lysS lysyl-tRNA synthetase; Reviewed Probab=100.00 E-value=0 Score=868.53 Aligned_cols=428 Identities=25% Similarity=0.399 Sum_probs=349.6 Q ss_pred CCCCHHHCCHHH---------CCCEEEEEEEECCCCCCCCEEEEEEECCCCEEEEEECCCC---HHHHHHHCCCCCCEEE Q ss_conf 742042369799---------8988999979800444698799999779825999974881---5789985289885899 Q gi|254780419|r 5 RTHSCGDLGVSD---------VGSFVRLSGWVHRVRPHGGIIFLDIRDHYGITQVVANPDS---ACFEIAKSVRSEWVLC 72 (622) Q Consensus 5 Rth~cg~l~~~~---------~g~~V~i~GwV~~~R~~g~l~F~~lrD~sG~~Q~v~~~~~---~~~~~~~~l~~esvv~ 72 (622) |||++.++++.+ .+.+|+|+|||+++|++||++|++|||++|.+|||++++. +.++.++.++.|++|. T Consensus 31 ~~~~~~~~~~~~~~~~~~~~~~~~~V~vaGrv~~~R~~Gk~~F~~lrD~~g~iQ~~~~~~~~~~~~~~~~~~l~~gd~i~ 110 (491) T PRK00484 31 RTHTSAELQAKYADLEKEELEALIEVSVAGRVMLKRVMGKASFATIQDGSGRIQLYVSKDDVGEEAYEAFKKLDLGDIIG 110 (491) T ss_pred CCCCHHHHHHHHHHCCCCCCCCCCEEEEEEEEEEEECCCCEEEEEEEECCCCEEEEEECCCCCHHHHHHHHCCCCCCEEE T ss_conf 86669999998751385334679889999878405056981999999699458999966758999999996068666898 Q ss_pred EEEEEEECCCCCCCCCCCCCEEEEEEEEEEEEECCCCCCCCCCCCCCCCHHHHCCCCCEEE-CCCCCCCCHHHHHHHHHH Q ss_conf 9999997586555887778659999889999502117998645766789878211232000-121111212458999999 Q gi|254780419|r 73 IDGLVMSRSAETINANIITGQIELSAQKIEILSVAEELPLPVFGEREYPESIRFKYRFLDL-RRDTLHKNIVKRTRIINS 151 (622) Q Consensus 73 v~G~V~~r~~~~~n~~~~tG~~Ei~~~~~~il~~a~~~P~~~~~~~~~~e~~rl~~R~LdL-R~~~~~~~l~~Rs~i~~~ 151 (622) |+|+|.+ .+|||+||.|++++|||+|.. |+|.+.+...++|+|++|||||| ||+.++++|++||+++++ T Consensus 111 v~G~v~~---------t~~Gel~v~~~~~~iLsksl~-plP~k~~g~~d~e~r~r~RyLDLi~n~~~~~~f~~Rs~ii~~ 180 (491) T PRK00484 111 VEGTLFK---------TKTGELSVKVTELTLLTKSLR-PLPDKFHGLSDQETRYRQRYLDLIVNEESRETFRKRSKIISA 180 (491) T ss_pred EEEEEEE---------CCCCEEEEECCEEEEECCCCC-CCCCCCCCCCCHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHH T ss_conf 7879986---------588516640331799316888-998544455675563113302102799999999999999999 Q ss_pred HHHHHHCCCEEEEECCCCCCCCCCC--CCCCCCCCCCCHHHHHHCCCCHHHHHHHHHCCCHHHHEEEHHHHCCCCCCCCC Q ss_conf 9875203651462033211246456--64543222221034431123854564433105702331100110377766553 Q gi|254780419|r 152 MRCRMIAENFIECSTPILTASSPEG--ARDFLVPSRVNQGSFYALPQAPQQYKQLLMASGFDRYFQIAPCFRDEDPRSDR 229 (622) Q Consensus 152 ~R~~l~~~gF~EveTP~L~ksTpeG--ArdFlVPsR~~~g~fYALpQSPQlyKQlLmv~G~dryfqiarcFRdEd~R~dR 229 (622) +|+||+++||+|||||+| ++|||| ||+|+++++.....|| |+||||||||+||||||||||||||||||||++++| T Consensus 181 iR~~l~~~gF~EVeTPiL-~~~~gGA~ArpF~t~~n~l~~~~y-L~~SPqLylk~l~vgG~ervfeI~r~FR~E~~~~rH 258 (491) T PRK00484 181 IRRFLDNRGFLEVETPML-QPIPGGAAARPFITHHNALDIDLY-LRIAPELYLKRLIVGGFERVFEIGRNFRNEGIDTRH 258 (491) T ss_pred HHHHHHHCCEEEEECCCC-CCCCCCCCCCCCCCCCCCCCCCEE-ECCCHHHHHHHHHHCCCHHHEEHHHHHHCCCCCCCC T ss_conf 999998676899867877-666887566765564456685544-206878987778762720222248766257554341 Q ss_pred CCCCEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCC---------CCCCCCCCCCCHHHHHHHC-CCCCCCCCCCCCC Q ss_conf 80000356541026899999999999999999998087---------2345773212189887513-6777644476401 Q gi|254780419|r 230 LPGEFYQLDMEMSFVEQDDILNTMENVLRGVFEEFSNG---------NTVSNNFLRICYDDAIRRY-GTDKPDLRNPIIM 299 (622) Q Consensus 230 qp~EFTqlD~EmsF~~~edvm~l~E~li~~i~~~v~~~---------~~i~~pF~rmtY~eAm~~Y-GsDKPDLR~~lei 299 (622) || ||||||+||||+|++|+|+++|+|++++++++.+. +.+..||+|+||.||+.+| |.|.+... T Consensus 259 ~p-EFT~lE~e~af~d~~dvm~l~E~li~~v~~~v~g~~~~~~~~~~idl~~pf~rit~~eai~~~~g~d~~~~~----- 332 (491) T PRK00484 259 NP-EFTMIEFYQAYADYNDMMDLTEELIRHLAKEVLGTTKITYGGTEIDFGKPFKRLTMVDAIKEYTGVDFTPQD----- 332 (491) T ss_pred CC-CEEEEEEEEECCCHHHHHHHHHHHHHHHHHHHCCCCEEEECCEEECCCCCCEECCHHHHHHHHCCCCCCCCC----- T ss_conf 84-010213677247999999999999999999961995673287763378995022399999997099876556----- Q ss_pred CCCCCCCCCCCHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCHHHHC Q ss_conf 24431017875245666874037847999962787665210356779998764202462355204777533357025532 Q gi|254780419|r 300 HNVTEHFVHSEFRVFSKILSVNPGYEVWAIPAKGGGGNRAFCDRMNEWARSQGQPGLAYIFWRLDESGNVAGMGPVAKHL 379 (622) Q Consensus 300 ~dit~~~~~~~f~vf~~~~~~~~~~~v~~i~~~~~~~srk~~D~l~e~ak~~g~kgL~~i~~~~~~~~~~~~~spi~K~l 379 (622) ..+.+.+.++..|.+ + .. .+ T Consensus 333 ----------------------------------------~~~~l~~~~~~~g~~------~--~~--~~---------- 352 (491) T PRK00484 333 ----------------------------------------EEEEARAIAKELGVE------V--EK--SW---------- 352 (491) T ss_pred ----------------------------------------CHHHHHHHHHHCCCC------C--CC--CC---------- T ss_conf ----------------------------------------789999999986998------7--88--77---------- Q ss_pred CHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHCCCCCCEEEEEEEECCCEEECCCCCCCCHHHCCC Q ss_conf 52789999998188755726998088541100134788999877300355751028896077053545547000211002 Q gi|254780419|r 380 NIVNTEAIRVRLDMQDGDSCFFIAGDPLKFCSFAADARNHIAQELGIINCNCFEFCWIMDFPFYEWNEEEKKIDFAHNPF 459 (622) Q Consensus 380 ~~~~~~~l~~~~~~~~GD~i~f~a~~~~~~~~~lg~lR~~ia~~l~li~~~~~~flWV~DFPLFe~dee~~r~~s~HHPF 459 (622) ..|.+..++.+. ++++....|+||+|||. + ..|| T Consensus 353 --------------------------------~~g~l~~~lfe~--~ve~~l~~PtFV~d~P~-----e-------~sPl 386 (491) T PRK00484 353 --------------------------------GLGKLINELFEE--FVEPKLIQPTFITDYPV-----E-------ISPL 386 (491) T ss_pred --------------------------------CHHHHHHHHHHH--HHHHHCCCCEECCCCCC-----C-------CCCC T ss_conf --------------------------------878999998887--76541146400136722-----1-------5875 Q ss_pred CCCCCCHHHHHCCCHHHHHHHCCCEEECCEEECCCCEECCCHHHHHHHHHHC----CCCHHHHHHHHHHHHHHHHCCCCC Q ss_conf 4766430655418877741313677775677212512315699999999985----999899986677899998608497 Q gi|254780419|r 460 TMPQGGMESLKGNDLLSIKAFQYDLVCNGFEIASGGIRNHMSEVMLQAFSNV----GISKEVVENRFGGLYRAFQCGMPP 535 (622) Q Consensus 460 TaP~~~~~~l~~~dp~~v~a~~YDLVlNG~EiggGSiRIHd~~iQ~~~f~~l----gl~~ee~~~~Fg~lL~Al~yG~PP 535 (622) |+|++++ |. .+.+||||+||+||||||+|||||..|+++|+.. ...++++.....+||+||+||||| T Consensus 387 ak~~~~d-------p~--~~erfdL~i~G~Ei~ng~~el~Dp~~Q~~rf~~q~~~k~~gd~ea~~~Dedfl~Al~yG~PP 457 (491) T PRK00484 387 AKRHRSN-------PG--FTERFELFIGGRELANAFSELNDPIDQRERFEAQVEAKEAGDDEAMFYDEDFLRALEYGMPP 457 (491) T ss_pred CCCCCCC-------CC--CEEEECEEECCEEECCCCHHHCCHHHHHHHHHHHHHHHHCCCCHHHHCCHHHHHHHHCCCCC T ss_conf 5668889-------67--03675643488997365033179999999999999998648832332409999997779999 Q ss_pred CCCEECCHHHHHHHHCCCCCCEEEECCCCC Q ss_conf 342224689999998373984032137899 Q gi|254780419|r 536 HGGIAAGIDRIVMLLLGAKNVREVSLFPMS 565 (622) Q Consensus 536 HgG~alGlDRlvmll~g~~sIRdVIaFPKt 565 (622) |||||||+|||||+|||++||||||+||.- T Consensus 458 hgG~GlGiDRLvMlLtg~~sIRdVI~FP~~ 487 (491) T PRK00484 458 TGGLGIGIDRLVMLLTNSPSIRDVILFPAM 487 (491) T ss_pred CCEEEEHHHHHHHHHCCCCCHHHCCCCCCC T ss_conf 852655699999998399957750168888 No 10 >PRK03932 asnC asparaginyl-tRNA synthetase; Validated Probab=100.00 E-value=0 Score=781.15 Aligned_cols=433 Identities=22% Similarity=0.314 Sum_probs=317.2 Q ss_pred CHHHCC--HHHCCCEEEEEEEECCCCCCCCEEEEEEECCCCEEEEEE--CCC-CHHHHHHHCCCCCCEEEEEEEEEECCC Q ss_conf 042369--799898899997980044469879999977982599997--488-157899852898858999999997586 Q gi|254780419|r 8 SCGDLG--VSDVGSFVRLSGWVHRVRPHGGIIFLDIRDHYGITQVVA--NPD-SACFEIAKSVRSEWVLCIDGLVMSRSA 82 (622) Q Consensus 8 ~cg~l~--~~~~g~~V~i~GwV~~~R~~g~l~F~~lrD~sG~~Q~v~--~~~-~~~~~~~~~l~~esvv~v~G~V~~r~~ 82 (622) +..+|- ++++|++|+|+|||+++|++||++|++|||++|.+|+++ +.. ...++.+++|+.||+|.|+|+|..++. T Consensus 5 ~i~~~~~~~~~ig~~V~v~Grv~~~R~~Gkl~Fi~LrD~tg~~qiq~~~~~~~~~~~~~~~~l~~gd~V~V~G~v~~t~~ 84 (462) T PRK03932 5 SIKDILKGKKAVGQEVTIRGWVRTKRDSGKISFLELNDGSCFKQIQVVKNNELPNYGEEVFKLTTGSSVYVTGTVVESPG 84 (462) T ss_pred CHHHHHCCCCCCCCEEEEEEEEEEEECCCCCEEEEEECCCCCEEEEEEECCCCHHHHHHHHCCCCCEEEEEEEEEEECCC T ss_conf 79997658223899899989778278179838999982898789999967976679999966999569999999992599 Q ss_pred CCCCCCCCCCEEEEEEEEEEEEECCC-CCCCCCCCCCCCCHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHHHCCCE Q ss_conf 55588777865999988999950211-79986457667898782112320001211112124589999999875203651 Q gi|254780419|r 83 ETINANIITGQIELSAQKIEILSVAE-ELPLPVFGEREYPESIRFKYRFLDLRRDTLHKNIVKRTRIINSMRCRMIAENF 161 (622) Q Consensus 83 ~~~n~~~~tG~~Ei~~~~~~il~~a~-~~P~~~~~~~~~~e~~rl~~R~LdLR~~~~~~~l~~Rs~i~~~~R~~l~~~gF 161 (622) ..|++||.++++++||+|. ++|++.+.+ ..+.+++|||||||++.++++|++||++++++|+||+++|| T Consensus 85 -------~~~~~El~~~~i~vls~~~~~~Pl~~~~~---~~e~~~~~r~LdlR~~~~~~ifr~RS~i~~~iR~~l~~~~F 154 (462) T PRK03932 85 -------AGQGYELQATKIEVIGEDPDTYPIQKKRH---SIEFLREIAHLRPRTNTFGAVMRVRNTLAQAIHEFFQENGF 154 (462) T ss_pred -------CCCCEEEEEEEEEEEECCCCCCCCCCCCC---CHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf -------99777999838999837986699883323---66777643355514667889999999999999999976796 Q ss_pred EEEECCCCCCCCCCCC-CCCCCCCCC-------CHH-----HHH----HCCCCHHHHHHHHHCCCHHHHEEEHHHHCCCC Q ss_conf 4620332112464566-454322222-------103-----443----11238545644331057023311001103777 Q gi|254780419|r 162 IECSTPILTASSPEGA-RDFLVPSRV-------NQG-----SFY----ALPQAPQQYKQLLMASGFDRYFQIAPCFRDED 224 (622) Q Consensus 162 ~EveTP~L~ksTpeGA-rdFlVPsR~-------~~g-----~fY----ALpQSPQlyKQlLmv~G~dryfqiarcFRdEd 224 (622) +|||||+|++++|||| +.|.|+++. .+. .|| .|+||||||||+ |++||+||||||||||||+ T Consensus 155 ~EVeTP~L~~~~~eGaa~~F~v~~~~~~~~~~~~~~~~~~~~yf~~~~yL~~SpqLylq~-li~G~erVfeIg~~FRnE~ 233 (462) T PRK03932 155 VWVHTPIITASDCEGAGELFRVTTLDLENLPRTDGKVDFSKDFFGKEAFLTVSGQLYAEA-YAMALSKVYTFGPTFRAEN 233 (462) T ss_pred EEEECCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCHHHHHHH-HHHHCCCEEEEEHHHHHCC T ss_conf 799788653568764557404214543445433444331145427762026576899999-8752064899733232056 Q ss_pred CCCCCCCCCEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHCCCCCCCCCCCCCCCCCCC Q ss_conf 66553800003565410268999999999999999999980872345773212189887513677764447640124431 Q gi|254780419|r 225 PRSDRLPGEFYQLDMEMSFVEQDDILNTMENVLRGVFEEFSNGNTVSNNFLRICYDDAIRRYGTDKPDLRNPIIMHNVTE 304 (622) Q Consensus 225 ~R~dRqp~EFTqlD~EmsF~~~edvm~l~E~li~~i~~~v~~~~~i~~pF~rmtY~eAm~~YGsDKPDLR~~lei~dit~ 304 (622) ++++||.+||||||+||||+|++|+|+++|+|++++|+.+.+... +.++..|.+. T Consensus 234 ~~t~RH~pEFT~lE~e~a~~d~~d~m~l~E~li~~i~~~v~~~~~-----------~el~~~~~~~-------------- 288 (462) T PRK03932 234 SNTRRHLAEFWMIEPEMAFADLEDNMDLAEDMLKYVIKAVLEECP-----------DDLEFLNRDV-------------- 288 (462) T ss_pred CCCCCCCCCCEEEEEEEECCCHHHHHHHHHHHHHHHHHHHHHHCH-----------HHHHHHCCCC-------------- T ss_conf 775445430025553110068999999999999999999987586-----------7787624547-------------- Q ss_pred CCCCCCHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCHHHHCCHHHH Q ss_conf 01787524566687403784799996278766521035677999876420246235520477753335702553252789 Q gi|254780419|r 305 HFVHSEFRVFSKILSVNPGYEVWAIPAKGGGGNRAFCDRMNEWARSQGQPGLAYIFWRLDESGNVAGMGPVAKHLNIVNT 384 (622) Q Consensus 305 ~~~~~~f~vf~~~~~~~~~~~v~~i~~~~~~~srk~~D~l~e~ak~~g~kgL~~i~~~~~~~~~~~~~spi~K~l~~~~~ 384 (622) . ...+...... .+..+.|..++...+..+..|.+- . . ...+ ...+..+.. T Consensus 289 --~---~~~~~~l~~~-----------~~~~~~~it~~eai~~l~~~~~~~------~--~--~~~~----g~dl~~~~E 338 (462) T PRK03932 289 --D---KGLIERLENF-----------IESPFPRITYTEAIEILQKSGKKF------E--F--PVEW----GDDLGSEHE 338 (462) T ss_pred --C---HHHHHHHHHH-----------CCCCCCEEEHHHHHHHHHHCCCCC------C--C--CCCC----CCCCCCHHH T ss_conf --8---3688999986-----------289972646999999999649645------6--7--7211----120354888 Q ss_pred HHHHHHHCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHCCCCCCEEEEEEEECCCEEECCCCCCCCHHHCCCCCCCC Q ss_conf 99999818875572699808854110013478899987730035575102889607705354554700021100247664 Q gi|254780419|r 385 EAIRVRLDMQDGDSCFFIAGDPLKFCSFAADARNHIAQELGIINCNCFEFCWIMDFPFYEWNEEEKKIDFAHNPFTMPQG 464 (622) Q Consensus 385 ~~l~~~~~~~~GD~i~f~a~~~~~~~~~lg~lR~~ia~~l~li~~~~~~flWV~DFPLFe~dee~~r~~s~HHPFTaP~~ 464 (622) ..++++ ..-.|+||+||| +.+.||+|+.. T Consensus 339 ---------------------------------~~l~e~------~~~~P~fi~d~P------------~~~~Pfy~~~~ 367 (462) T PRK03932 339 ---------------------------------RYLTEE------HFKKPVFVTNYP------------KDIKAFYMRLN 367 (462) T ss_pred ---------------------------------HHHHHH------HCCCCEEEECCC------------CCCCCCCCCCC T ss_conf ---------------------------------888888------658788997762------------01385335658 Q ss_pred CHHHHHCCCHHHHHHHCCCEEECCE-EECCCCEECCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCCCEECCH Q ss_conf 3065541887774131367777567-721251231569999999998599989998667789999860849734222468 Q gi|254780419|r 465 GMESLKGNDLLSIKAFQYDLVCNGF-EIASGGIRNHMSEVMLQAFSNVGISKEVVENRFGGLYRAFQCGMPPHGGIAAGI 543 (622) Q Consensus 465 ~~~~l~~~dp~~v~a~~YDLVlNG~-EiggGSiRIHd~~iQ~~~f~~lgl~~ee~~~~Fg~lL~Al~yG~PPHgG~alGl 543 (622) ++ +.. ..+|||++||+ ||+|||+|||+++.|++.++..|+++++ |+|||+|++||||||||||||+ T Consensus 368 ~d-------~~~--~~~~dll~~G~~Ei~~G~~R~~~~~~l~~~~~~~g~~~e~----~~~yl~~~~yG~PP~gG~GiGi 434 (462) T PRK03932 368 PD-------GKT--VAAMDLLAPGIGEIIGGSQREERLDVLDARMKELGLDKED----YWWYLDLRRYGSVPHSGFGLGF 434 (462) T ss_pred CC-------CCE--EEEEEEEECCCEEEECHHHHCCCHHHHHHHHHHCCCCHHH----HHHHHHHHHCCCCCCCEEEEHH T ss_conf 89-------774--6777677358647606555337999999999986999899----9999999767999877376779 Q ss_pred HHHHHHHCCCCCCEEEECCCCCCCCCC Q ss_conf 999999837398403213789998710 Q gi|254780419|r 544 DRIVMLLLGAKNVREVSLFPMSQNFCD 570 (622) Q Consensus 544 DRlvmll~g~~sIRdVIaFPKt~~g~D 570 (622) |||||+|||++||||||+||||.+-.| T Consensus 435 DRLvm~l~~~~sIRdvi~FPr~~~r~~ 461 (462) T PRK03932 435 ERLVAYVTGLENIRDVIPFPRTPGRAE 461 (462) T ss_pred HHHHHHHCCCCCHHEEECCCCCCCCCC T ss_conf 999999829996545864889877577 No 11 >COG0017 AsnS Aspartyl/asparaginyl-tRNA synthetases [Translation, ribosomal structure and biogenesis] Probab=100.00 E-value=0 Score=712.25 Aligned_cols=408 Identities=30% Similarity=0.512 Sum_probs=334.5 Q ss_pred CCCCCHHHCCHHHCCCEEEEEEEECCCCCCCCEEEEEEECCCCEEEEEECCC---CHHHHHHHCCCCCCEEEEEEEEEEC Q ss_conf 7742042369799898899997980044469879999977982599997488---1578998528988589999999975 Q gi|254780419|r 4 YRTHSCGDLGVSDVGSFVRLSGWVHRVRPHGGIIFLDIRDHYGITQVVANPD---SACFEIAKSVRSEWVLCIDGLVMSR 80 (622) Q Consensus 4 ~Rth~cg~l~~~~~g~~V~i~GwV~~~R~~g~l~F~~lrD~sG~~Q~v~~~~---~~~~~~~~~l~~esvv~v~G~V~~r 80 (622) +++++..++.....|++|+|+|||+++|++|+++|+.|||+||.+|||++++ .+.++ ++.|+.||+|.|+|+|++. T Consensus 2 ~~~~~i~di~~~~~~~~V~v~GWV~~~R~~g~i~Fi~lrDgsg~iQ~v~~~~~~~~~~~~-~~~L~~es~v~V~G~v~~~ 80 (435) T COG0017 2 MKRTYIKDIKPHVGGQEVTVRGWVHNKRDLGKIIFLVLRDGSGFIQAVVPKNKVYEELFK-AKKLTLESSVVVTGIVKAS 80 (435) T ss_pred CCEEEHHHHHCCCCCCEEEEEEEEEEECCCCCEEEEEEECCCCEEEEEEECCCCCHHHHH-HHCCCCCCEEEEEEEEECC T ss_conf 735437765044798579999887550015870899997588379999977877176766-6317876589999999748 Q ss_pred CCCCCCCCCCCCEEEEEEEEEEEEECCCCCCCCCCCCCCCCHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHHHCCC Q ss_conf 86555887778659999889999502117998645766789878211232000121111212458999999987520365 Q gi|254780419|r 81 SAETINANIITGQIELSAQKIEILSVAEELPLPVFGEREYPESIRFKYRFLDLRRDTLHKNIVKRTRIINSMRCRMIAEN 160 (622) Q Consensus 81 ~~~~~n~~~~tG~~Ei~~~~~~il~~a~~~P~~~~~~~~~~e~~rl~~R~LdLR~~~~~~~l~~Rs~i~~~~R~~l~~~g 160 (622) + +.+ +.+||++++++|+|.|++ |+|++.+..++.|+||++||||||++..+++|++||.+.+++|+||.++| T Consensus 81 ~------~a~-~g~El~v~~i~Vl~~a~~-~~Pi~~~~~~~~e~lld~rhL~lR~~~~~Av~kirs~i~~a~~eff~~~g 152 (435) T COG0017 81 P------KAP-QGFELQVEKIEVLGEADP-PYPIDKKEHSELETLLDNRHLDLRTPKIQAVFKIRSSILRAIREFFYENG 152 (435) T ss_pred C------CCC-CCEEEEEEEEEEEECCCC-CCCCCCCCCCCHHHHHHCHHEECCCCCHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 8------999-877998878899404688-77768645557888874403020462037789499999999999997589 Q ss_pred EEEEECCCCCCCCCCCCCC-CCCCCCCCHHHHHHCCCCHHHHHHHHHCCCHHHHEEEHHHHCCCCCCCCCCCCCEEEEEE Q ss_conf 1462033211246456645-432222210344311238545644331057023311001103777665538000035654 Q gi|254780419|r 161 FIECSTPILTASSPEGARD-FLVPSRVNQGSFYALPQAPQQYKQLLMASGFDRYFQIAPCFRDEDPRSDRLPGEFYQLDM 239 (622) Q Consensus 161 F~EveTP~L~ksTpeGArd-FlVPsR~~~g~fYALpQSPQlyKQlLmv~G~dryfqiarcFRdEd~R~dRqp~EFTqlD~ 239 (622) |+||+||+|+.+++|||.+ |-| ..+...+| |.||||||||++|.+ ++|||+|||+||+|.+++.||++||||||+ T Consensus 153 F~eV~tP~i~~~~~EGg~elF~v--~yf~~~a~-LtqS~QLyke~~~~a-l~rVf~igP~FRAE~s~T~RHL~EF~~ld~ 228 (435) T COG0017 153 FTEVHTPIITASATEGGGELFKV--DYFDKEAY-LTQSPQLYKEALAAA-LERVFTIGPTFRAEKSNTRRHLSEFWMLDP 228 (435) T ss_pred CEEECCCEEECCCCCCCCEEEEE--EECCCCEE-EECCHHHHHHHHHHH-HCCEEEECCCEECCCCCCCCHHHHHHEECC T ss_conf 58965946853478998505777--50685667-714788999999998-576489567255477897003666733021 Q ss_pred EECCCCHHHHHHHHHHHHHHHHHHHHCC------------CCC----CCCCCCCCHHHHHHHCCCCCCCCCCCCCCCCCC Q ss_conf 1026899999999999999999998087------------234----577321218988751367776444764012443 Q gi|254780419|r 240 EMSFVEQDDILNTMENVLRGVFEEFSNG------------NTV----SNNFLRICYDDAIRRYGTDKPDLRNPIIMHNVT 303 (622) Q Consensus 240 EmsF~~~edvm~l~E~li~~i~~~v~~~------------~~i----~~pF~rmtY~eAm~~YGsDKPDLR~~lei~dit 303 (622) ||||++++|+|+++|+|++++++.+... ..+ +.||||+||.||++.... T Consensus 229 Emaf~~~~d~m~l~E~~i~~i~~~v~e~~~~el~~l~~~~~~l~~~~~~pf~ritY~eAieiL~~--------------- 293 (435) T COG0017 229 EMAFADLNDVMDLAEELIKYLFKKVLEECADELEFLGRDNSELKRPESAPFPRITYKEAIEILEE--------------- 293 (435) T ss_pred EECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHCCCCCCCCCEEEHHHHHHHHHH--------------- T ss_conf 11367688899999999999999999971898887614611103444689637779999999986--------------- Q ss_pred CCCCCCCHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCHHHHCCHHH Q ss_conf 10178752456668740378479999627876652103567799987642024623552047775333570255325278 Q gi|254780419|r 304 EHFVHSEFRVFSKILSVNPGYEVWAIPAKGGGGNRAFCDRMNEWARSQGQPGLAYIFWRLDESGNVAGMGPVAKHLNIVN 383 (622) Q Consensus 304 ~~~~~~~f~vf~~~~~~~~~~~v~~i~~~~~~~srk~~D~l~e~ak~~g~kgL~~i~~~~~~~~~~~~~spi~K~l~~~~ 383 (622) +|.-.+.|.. .++.+. T Consensus 294 --------------------------------------------------~~~e~~~~Gd--------------Dl~~e~ 309 (435) T COG0017 294 --------------------------------------------------KGFEKVEWGD--------------DLGTEH 309 (435) T ss_pred --------------------------------------------------CCCCCCCCCC--------------CCCCHH T ss_conf --------------------------------------------------4776567777--------------568899 Q ss_pred HHHHHHHHCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHCCCCCCEE-EEEEEECCCEEECCCCCCCCHHHCCCCCC Q ss_conf 99999981887557269980885411001347889998773003557510-28896077053545547000211002476 Q gi|254780419|r 384 TEAIRVRLDMQDGDSCFFIAGDPLKFCSFAADARNHIAQELGIINCNCFE-FCWIMDFPFYEWNEEEKKIDFAHNPFTMP 462 (622) Q Consensus 384 ~~~l~~~~~~~~GD~i~f~a~~~~~~~~~lg~lR~~ia~~l~li~~~~~~-flWV~DFPLFe~dee~~r~~s~HHPFTaP 462 (622) ...+. ++ -++ |+||+|||. +- -||+|. T Consensus 310 Er~l~---------------------------------e~-------~~~~~vfv~~yP~------~~------kpFYm~ 337 (435) T COG0017 310 ERYLG---------------------------------EE-------YFKPPVFVTNYPK------EI------KPFYMR 337 (435) T ss_pred HHHHH---------------------------------HH-------HCCCCEEEEECCC------CC------CCCCCC T ss_conf 99999---------------------------------87-------5799499981764------44------665033 Q ss_pred CCCHHHHHCCCHHHHHHHCCCEEECC-EEECCCCEECCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCCCEEC Q ss_conf 64306554188777413136777756-77212512315699999999985999899986677899998608497342224 Q gi|254780419|r 463 QGGMESLKGNDLLSIKAFQYDLVCNG-FEIASGGIRNHMSEVMLQAFSNVGISKEVVENRFGGLYRAFQCGMPPHGGIAA 541 (622) Q Consensus 463 ~~~~~~l~~~dp~~v~a~~YDLVlNG-~EiggGSiRIHd~~iQ~~~f~~lgl~~ee~~~~Fg~lL~Al~yG~PPHgG~al 541 (622) ..++ +|. ...+|||.+.| -||.|||+|||++++..+.++..|+++++ |.|||++++||+|||||||| T Consensus 338 ~~~d------~p~--~~~~~Dll~p~~gEIigGsqRe~~~d~L~~ri~~~gl~~e~----~~wYld~~kyG~~PHaGfGl 405 (435) T COG0017 338 PDPD------NPG--TVASFDLLAPGGGEIIGGSQREHDYDLLVERIKEKGLDPES----YEWYLDLRKYGMPPHAGFGL 405 (435) T ss_pred CCCC------CCC--EEEEEEEECCCCEEEECCEECCCCHHHHHHHHHHCCCCHHH----HHHHHHHHHCCCCCCCCCCC T ss_conf 3799------887--69987652478705603623043099999999980999688----18999998738999860033 Q ss_pred CHHHHHHHHCCCCCCEEEECCCCCCC Q ss_conf 68999999837398403213789998 Q gi|254780419|r 542 GIDRIVMLLLGAKNVREVSLFPMSQN 567 (622) Q Consensus 542 GlDRlvmll~g~~sIRdVIaFPKt~~ 567 (622) |+|||+|.+||.+||||||+||++.. T Consensus 406 G~ERlv~~i~gl~nIRea~pFPR~~~ 431 (435) T COG0017 406 GLERLVMYILGLDNIREAIPFPRDPG 431 (435) T ss_pred CHHHHHHHHHCCCCCEECCCCCCCCC T ss_conf 79999999948883201244777778 No 12 >TIGR00458 aspS_arch aspartyl-tRNA synthetase; InterPro: IPR004523 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . Aspartyl tRNA synthetase 6.1.1.12 from EC is an alpha2 dimer that belongs to class IIb. Structural analysis combined with mutagenesis and enzymology data on the yeast enzyme point to a tRNA binding process that starts by a recognition event between the tRNA anticodon loop and the synthetase anticodon binding module . This family represents aspartyl-tRNA synthetases from the eukaryotic cytosol and from the archaea. In some species, this enzyme aminoacylates tRNA for both Asp and Asn; Asp-tRNA(asn) is subsequently transamidated to Asn-tRNA(asn). ; GO: 0000166 nucleotide binding, 0004815 aspartate-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006422 aspartyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=100.00 E-value=0 Score=691.39 Aligned_cols=417 Identities=34% Similarity=0.587 Sum_probs=335.9 Q ss_pred CCHHHCCHHHCC-CEEEEEEEECCCCCCCC-EEEEEEECCCCEEEEEECCCC------HHHHHHHCCCCCCEEEEEEEEE Q ss_conf 204236979989-88999979800444698-799999779825999974881------5789985289885899999999 Q gi|254780419|r 7 HSCGDLGVSDVG-SFVRLSGWVHRVRPHGG-IIFLDIRDHYGITQVVANPDS------ACFEIAKSVRSEWVLCIDGLVM 78 (622) Q Consensus 7 h~cg~l~~~~~g-~~V~i~GwV~~~R~~g~-l~F~~lrD~sG~~Q~v~~~~~------~~~~~~~~l~~esvv~v~G~V~ 78 (622) |+..|+..+..| ++|++.||||++|+.|+ ++|+.|||+.|++|++++.+. ..++.+++|+.||||.|+|.|+ T Consensus 1 ~~~~D~~~~~dgd~~V~~~g~VH~~R~lG~~l~F~~LRdr~g~~Qi~~~~~~eg~~~~~~~k~~~~l~~Esvv~v~G~V~ 80 (466) T TIGR00458 1 VYSADIKPEMDGDQEVLLMGWVHEIRDLGGTLIFVLLRDREGLIQILVPKKKEGVISKELFKLIKKLKKESVVAVKGKVK 80 (466) T ss_pred CCCCCCCCCCCCCCEEEEEEEEECHHHCCCEEEEEEEECCCCEEEEEECCCCCCCHHHHHHHHHHHCCCCCEEEEEEEEE T ss_conf 96544651017884589986541242145738998873567526776311114402488999986337564889731787 Q ss_pred ECCCCCCCCCCCCCEEEEEEEEEEEEECCC-CCCCCCCC--CCCC----------CHHHHCCCCCEEECCCCCCCCHHHH Q ss_conf 758655588777865999988999950211-79986457--6678----------9878211232000121111212458 Q gi|254780419|r 79 SRSAETINANIITGQIELSAQKIEILSVAE-ELPLPVFG--EREY----------PESIRFKYRFLDLRRDTLHKNIVKR 145 (622) Q Consensus 79 ~r~~~~~n~~~~tG~~Ei~~~~~~il~~a~-~~P~~~~~--~~~~----------~e~~rl~~R~LdLR~~~~~~~l~~R 145 (622) +-. ...+.+.+ +.+||.++++++||.|+ +||+.+.+ +..+ +-+|||++|+||||++..|++||+| T Consensus 81 ~~d-ep~~~~a~-~g~EI~~~~~~v~~~~~~pLPl~~~ea~k~~AE~daaglp~~~ldTRLd~R~~DLR~~~~~AIF~I~ 158 (466) T TIGR00458 81 KVD-EPEKEKAP-GGLEIIPTKLEVLNEAKEPLPLDLTEASKVEAELDAAGLPVVNLDTRLDARFLDLRRPKVQAIFRIR 158 (466) T ss_pred ECC-CCHHCCCC-CCEEEEEEHHHHHHCCCCCCCCCHHHCCCCHHHHHHCCCCEEEECCCHHHHHHHCCCCCHHHHHHHH T ss_conf 514-74100477-7525631000111227767887611102340334313752343001110001102565402125568 Q ss_pred HHHHHHHHHHHHCCCEEEEECCCCCCCCCCC-CCCCCCCCCCCHHHHHHCCCCHHHHHHHHHCCCHHHHEEEHHHHCCCC Q ss_conf 9999999875203651462033211246456-645432222210344311238545644331057023311001103777 Q gi|254780419|r 146 TRIINSMRCRMIAENFIECSTPILTASSPEG-ARDFLVPSRVNQGSFYALPQAPQQYKQLLMASGFDRYFQIAPCFRDED 224 (622) Q Consensus 146 s~i~~~~R~~l~~~gF~EveTP~L~ksTpeG-ArdFlVPsR~~~g~fYALpQSPQlyKQlLmv~G~dryfqiarcFRdEd 224 (622) +.++.++|+||.++||+||+||.|.+|..|| ..-|-| -.+.+++| |.||||+|||+||++||||||+|||.||+|+ T Consensus 159 ~~~~~~vR~~L~~~~F~Ev~TPKlv~~~tEGGt~lF~i--~YF~~~Af-L~QSPQlyKQ~l~~~g~~rVyEI~PiFRAE~ 235 (466) T TIGR00458 159 SEVLESVREFLAEEGFIEVHTPKLVASATEGGTELFPI--TYFEREAF-LGQSPQLYKQVLMAAGLERVYEIGPIFRAEE 235 (466) T ss_pred HHHHHHHHHHHHHCCCEEEECCCEEECCCCCCCCCCEE--EEECCCCC-CCCCCHHHHHHHHHCCCCCEEEECCEEECCC T ss_conf 99999999987308825630762142166888533201--22123000-1357035668776526520355055220566 Q ss_pred CCCCCCCCCEEEEEEEECCCCH-HHHHHHHHHHHHHHHHHHHCCC------------------CC-CCCCCCCCHHHHHH Q ss_conf 6655380000356541026899-9999999999999999980872------------------34-57732121898875 Q gi|254780419|r 225 PRSDRLPGEFYQLDMEMSFVEQ-DDILNTMENVLRGVFEEFSNGN------------------TV-SNNFLRICYDDAIR 284 (622) Q Consensus 225 ~R~dRqp~EFTqlD~EmsF~~~-edvm~l~E~li~~i~~~v~~~~------------------~i-~~pF~rmtY~eAm~ 284 (622) +.|.||.+|+|.|||||||.|+ +|||++.|+|+..+|.++.... ++ ..||.|+||+||+ T Consensus 236 ~nT~RHl~E~~siDiE~~F~dhy~~v~d~L~~lv~~~~~dv~~~~~~e~el~r~~~~~~~~~le~P~~~~~Rl~Y~Ea~- 314 (466) T TIGR00458 236 HNTRRHLNEAISIDIEMAFEDHYEDVMDILEELVVRVFEDVAEKCAKEIELVRKELETLEVKLEVPEEKFERLTYDEAI- 314 (466) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCCCCCCCCCHHHH- T ss_conf 6521455444442244442753789999999999999999998878889999875210231324888755532207789- Q ss_pred HCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEC Q ss_conf 13677764447640124431017875245666874037847999962787665210356779998764202462355204 Q gi|254780419|r 285 RYGTDKPDLRNPIIMHNVTEHFVHSEFRVFSKILSVNPGYEVWAIPAKGGGGNRAFCDRMNEWARSQGQPGLAYIFWRLD 364 (622) Q Consensus 285 ~YGsDKPDLR~~lei~dit~~~~~~~f~vf~~~~~~~~~~~v~~i~~~~~~~srk~~D~l~e~ak~~g~kgL~~i~~~~~ 364 (622) +.+...|.. +.+. T Consensus 315 -------------------------------------------------------------e~~~~~G~e----i~~g-- 327 (466) T TIGR00458 315 -------------------------------------------------------------EIANAKGKE----IKWG-- 327 (466) T ss_pred -------------------------------------------------------------HHHHHCCCC----CCCC-- T ss_conf -------------------------------------------------------------999865971----5554-- Q ss_pred CCCCCCCCCCHHHHCCHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCHHHHH-HHHHHHHHHCCCCCCEEEEEEEE-CCC Q ss_conf 7775333570255325278999999818875572699808854110013478-89998773003557510288960-770 Q gi|254780419|r 365 ESGNVAGMGPVAKHLNIVNTEAIRVRLDMQDGDSCFFIAGDPLKFCSFAADA-RNHIAQELGIINCNCFEFCWIMD-FPF 442 (622) Q Consensus 365 ~~~~~~~~spi~K~l~~~~~~~l~~~~~~~~GD~i~f~a~~~~~~~~~lg~l-R~~ia~~l~li~~~~~~flWV~D-FPL 442 (622) | .++-+. .+.+|++ |.+. + ..-||+| ||. T Consensus 328 E------------DLs~~a--------------------------~K~~G~l~re~~---------~--g~yfI~d~wP~ 358 (466) T TIGR00458 328 E------------DLSTEA--------------------------EKALGKLVREEM---------D--GLYFITDKWPT 358 (466) T ss_pred C------------CCCHHH--------------------------HHHHHHHHHHHH---------C--CCEEEEECCCC T ss_conf 6------------654357--------------------------887647887742---------7--61687405876 Q ss_pred EEECCCCCCCCHHHCCC-CCCCCCHHHHHCCCHHHHHHHCCCEEECCEEECCCCEECCCHHHHHHHHHHCCCCHHHHHHH Q ss_conf 53545547000211002-47664306554188777413136777756772125123156999999999859998999866 Q gi|254780419|r 443 YEWNEEEKKIDFAHNPF-TMPQGGMESLKGNDLLSIKAFQYDLVCNGFEIASGGIRNHMSEVMLQAFSNVGISKEVVENR 521 (622) Q Consensus 443 Fe~dee~~r~~s~HHPF-TaP~~~~~~l~~~dp~~v~a~~YDLVlNG~EiggGSiRIHd~~iQ~~~f~~lgl~~ee~~~~ 521 (622) +- .|| |||..++ +|. .|.+|||...+.||.||++|||+.+++...++..|++|++. - T Consensus 359 ------ei------~PFY~MPd~E~------~P~--~s~~fDl~~~~~Ei~SGA~RiH~~dlL~E~iK~~Gl~P~dp--~ 416 (466) T TIGR00458 359 ------EI------RPFYTMPDDED------NPE--ISKSFDLMYRDLEISSGAQRIHLYDLLVEKIKAKGLNPEDP--G 416 (466) T ss_pred ------CC------CCCCCCCCCCC------CCC--CCHHHHHHHCCCHHCCHHHHHHHHHHHHHHHHHCCCCCCCC--C T ss_conf ------46------76213888656------786--00144433311011000345445888899998458898853--3 Q ss_pred HHHHHHHHHCCCCCCCCEECCHHHHHHHHCCCCCCEEEECCCCCCC Q ss_conf 7789999860849734222468999999837398403213789998 Q gi|254780419|r 522 FGGLYRAFQCGMPPHGGIAAGIDRIVMLLLGAKNVREVSLFPMSQN 567 (622) Q Consensus 522 Fg~lL~Al~yG~PPHgG~alGlDRlvmll~g~~sIRdVIaFPKt~~ 567 (622) |.+|++||+||||||+|||||.||++|.+++.+|||+..+||+..+ T Consensus 417 ~~~Yl~aF~YG~PPHAGwGlG~eR~~M~~~~lkNiR~~~LFPRDr~ 462 (466) T TIGR00458 417 FKDYLEAFKYGMPPHAGWGLGAERLVMVLLGLKNIREAVLFPRDRK 462 (466) T ss_pred HHHHHHHCCCCCCCCCCCCCHHHHHHHHHHCCHHHHHHCCCCCCCC T ss_conf 6878733047847888764106788887740101122004888865 No 13 >cd00777 AspRS_core Asp tRNA synthetase (aspRS) class II core domain. Class II assignment is based upon its structure and the presence of three characteristic sequence motifs. AspRS is a homodimer, which attaches a specific amino acid to the 3' OH group of ribose of the appropriate tRNA. The catalytic core domain is primarily responsible for the ATP-dependent formation of the enzyme bound aminoacyl-adenylate. AspRS in this family differ from those found in the AsxRS family by a GAD insert in the core domain. Probab=100.00 E-value=0 Score=636.41 Aligned_cols=280 Identities=52% Similarity=0.915 Sum_probs=266.7 Q ss_pred HHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCHHHHHHHHHCCCHHHHEEEHHHHC Q ss_conf 24589999999875203651462033211246456645432222210344311238545644331057023311001103 Q gi|254780419|r 142 IVKRTRIINSMRCRMIAENFIECSTPILTASSPEGARDFLVPSRVNQGSFYALPQAPQQYKQLLMASGFDRYFQIAPCFR 221 (622) Q Consensus 142 l~~Rs~i~~~~R~~l~~~gF~EveTP~L~ksTpeGArdFlVPsR~~~g~fYALpQSPQlyKQlLmv~G~dryfqiarcFR 221 (622) ||+||++++++|+||.++||+||+||+|++++||||++|.||++.++|.+|.|+||||+|||+||+||++|||||||||| T Consensus 1 lr~Rs~i~~~iR~ff~~~gflEV~TPiL~~~~~ega~~f~~~~~~~~~~~~~L~~Spel~~k~l~~~g~~rvf~i~~~FR 80 (280) T cd00777 1 LRLRSRVIKAIRNFLDEQGFVEIETPILTKSTPEGARDFLVPSRLHPGKFYALPQSPQLFKQLLMVSGFDRYFQIARCFR 80 (280) T ss_pred CCHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCEECCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCEEEECEEC T ss_conf 96699999999999998898998798106878776766442441688760368879899999998568777579845174 Q ss_pred CCCCCCCCCCCCEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHCCCCCCCCCCCCCCCC Q ss_conf 77766553800003565410268999999999999999999980872345773212189887513677764447640124 Q gi|254780419|r 222 DEDPRSDRLPGEFYQLDMEMSFVEQDDILNTMENVLRGVFEEFSNGNTVSNNFLRICYDDAIRRYGTDKPDLRNPIIMHN 301 (622) Q Consensus 222 dEd~R~dRqp~EFTqlD~EmsF~~~edvm~l~E~li~~i~~~v~~~~~i~~pF~rmtY~eAm~~YGsDKPDLR~~lei~d 301 (622) ||++++.||| ||||||.||+|+|++|+|+++|+|++++++.+.+ ..++.||+||||.||+++||. T Consensus 81 ~E~~~~~h~~-EFtmLE~e~~~~~~~d~m~~~E~li~~i~~~~~~-~~~~~~~~rity~ea~~~~g~------------- 145 (280) T cd00777 81 DEDLRADRQP-EFTQIDIEMSFVDQEDIMSLIEGLLKYVFKEVLG-VELTTPFPRMTYAEAMERYGF------------- 145 (280) T ss_pred CCCCCCCCCH-HHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHC-CCCCCCCCEEEHHHHHHHHCC------------- T ss_conf 7888876634-7762344226888999999999999999999819-877999757768899998689------------- Q ss_pred CCCCCCCCCHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCHHHHCCH Q ss_conf 43101787524566687403784799996278766521035677999876420246235520477753335702553252 Q gi|254780419|r 302 VTEHFVHSEFRVFSKILSVNPGYEVWAIPAKGGGGNRAFCDRMNEWARSQGQPGLAYIFWRLDESGNVAGMGPVAKHLNI 381 (622) Q Consensus 302 it~~~~~~~f~vf~~~~~~~~~~~v~~i~~~~~~~srk~~D~l~e~ak~~g~kgL~~i~~~~~~~~~~~~~spi~K~l~~ 381 (622) T Consensus 146 -------------------------------------------------------------------------------- 145 (280) T cd00777 146 -------------------------------------------------------------------------------- 145 (280) T ss_pred -------------------------------------------------------------------------------- T ss_conf -------------------------------------------------------------------------------- Q ss_pred HHHHHHHHHHCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHCCCCCCEEEEEEEECCCEEECCCCCCCCHHHCCCCC Q ss_conf 78999999818875572699808854110013478899987730035575102889607705354554700021100247 Q gi|254780419|r 382 VNTEAIRVRLDMQDGDSCFFIAGDPLKFCSFAADARNHIAQELGIINCNCFEFCWIMDFPFYEWNEEEKKIDFAHNPFTM 461 (622) Q Consensus 382 ~~~~~l~~~~~~~~GD~i~f~a~~~~~~~~~lg~lR~~ia~~l~li~~~~~~flWV~DFPLFe~dee~~r~~s~HHPFTa 461 (622) +++|++|||+++++++.+++.+.||||++ T Consensus 146 ---------------------------------------------------~~~~~~d~~~~~~~~~~~~~~~~~~pf~~ 174 (280) T cd00777 146 ---------------------------------------------------KFLWIVDFPLFEWDEEEGRLVSAHHPFTA 174 (280) T ss_pred ---------------------------------------------------CCCCCCCCCCCCCCHHHCCCCCCCCCCCC T ss_conf ---------------------------------------------------73311357533462442552222077347 Q ss_pred CCCCHHHHHCCCHHHHHHHCCCEEECCEEECCCCEECCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCCCEEC Q ss_conf 66430655418877741313677775677212512315699999999985999899986677899998608497342224 Q gi|254780419|r 462 PQGGMESLKGNDLLSIKAFQYDLVCNGFEIASGGIRNHMSEVMLQAFSNVGISKEVVENRFGGLYRAFQCGMPPHGGIAA 541 (622) Q Consensus 462 P~~~~~~l~~~dp~~v~a~~YDLVlNG~EiggGSiRIHd~~iQ~~~f~~lgl~~ee~~~~Fg~lL~Al~yG~PPHgG~al 541 (622) |.+.+......+|..+++.+|||++||+||+|||+|+||+++|+++|+..|+++++..++|.|||+|++||+|||||+|| T Consensus 175 p~~~~~~~~~~~p~~~~~~~fdl~~~G~El~nG~~r~~d~~~~~~~~~~~~~~~~~~~~~~~~~l~a~~~G~pP~~G~gl 254 (280) T cd00777 175 PKEEDLDLLEKDPEDARAQAYDLVLNGVELGGGSIRIHDPDIQEKVFEILGLSEEEAEEKFGFLLEAFKYGAPPHGGIAL 254 (280) T ss_pred CCHHHHHHHHCCCHHHHHHHHCCCCCCEEECCCCCEECCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCCEEEH T ss_conf 82441677633802456777413568644235310227999999999985989667788899999997669998743515 Q ss_pred CHHHHHHHHCCCCCCEEEECCCCCCC Q ss_conf 68999999837398403213789998 Q gi|254780419|r 542 GIDRIVMLLLGAKNVREVSLFPMSQN 567 (622) Q Consensus 542 GlDRlvmll~g~~sIRdVIaFPKt~~ 567 (622) |+|||||++||.+||||||+||+|++ T Consensus 255 GiDRLvm~l~g~~~Irdvi~FPR~~n 280 (280) T cd00777 255 GLDRLVMLLTGSESIRDVIAFPKTQN 280 (280) T ss_pred HHHHHHHHHHCCCCHHEEECCCCCCC T ss_conf 89999999928982870734799998 No 14 >TIGR00457 asnS asparaginyl-tRNA synthetase; InterPro: IPR004522 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . AsparaginyltRNA synthetase (6.1.1.22 from EC) is an alpha2 dimer that belongs to class IIb. There is a striking similarity between asparaginyl-tRNA synthetases and archaeal/eukaryotic type aspartyl-tRNA synthetases (IPR004523 from INTERPRO) and a striking divergence of bacterial type aspartyl-tRNA synthetases (IPR004524 from INTERPRO). This family, AsnS, represents asparaginyl-tRNA synthetases from the three domains of life. Some species lack this enzyme and charge tRNA(asn) by misacylation with Asp, followed by transamidation of Asp to Asn.; GO: 0000166 nucleotide binding, 0004816 asparagine-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006421 asparaginyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=100.00 E-value=0 Score=623.27 Aligned_cols=444 Identities=23% Similarity=0.381 Sum_probs=329.2 Q ss_pred HHHCCCEEEEEEEECCCC-CCCCEEEEEEECCC--CEEEEEECCC--CHHHHHHHCCCCCCEEEEEEEEEE-CCCCCCCC Q ss_conf 799898899997980044-46987999997798--2599997488--157899852898858999999997-58655588 Q gi|254780419|r 14 VSDVGSFVRLSGWVHRVR-PHGGIIFLDIRDHY--GITQVVANPD--SACFEIAKSVRSEWVLCIDGLVMS-RSAETINA 87 (622) Q Consensus 14 ~~~~g~~V~i~GwV~~~R-~~g~l~F~~lrD~s--G~~Q~v~~~~--~~~~~~~~~l~~esvv~v~G~V~~-r~~~~~n~ 87 (622) ...+|++|+|.|||+++| ..|+++|+.|+||| +.+|+|++.+ ...+..+++|+.|++|.|+|+|.. .|.+. T Consensus 12 ~~~~~~~v~~~GW~~~kRDs~~~i~Fl~~~DGSsl~~iQ~V~~~~~~~~~~~~~~~l~~g~~v~v~G~~~~~~P~~~--- 88 (495) T TIGR00457 12 YKAVGEEVTVSGWVRTKRDSSKKIIFLELNDGSSLGPIQAVINGELNEYLFQLLKSLTTGSSVSVTGKVVELSPGKG--- 88 (495) T ss_pred HHHCCCEEEEEEEEEEHHCCCCCEEEEEEECCCEECCEEEEEECCHHHHHHHHHHHHCCCEEEEEEEEEEECCCCCC--- T ss_conf 65309828996534100000276789996258300443789705022689999875202317999768886687887--- Q ss_pred CCCCCEEEEEEE----EEEEEECCC-CCCCCCCCCC-CCCHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHHHCCCE Q ss_conf 777865999988----999950211-7998645766-7898782112320001211112124589999999875203651 Q gi|254780419|r 88 NIITGQIELSAQ----KIEILSVAE-ELPLPVFGER-EYPESIRFKYRFLDLRRDTLHKNIVKRTRIINSMRCRMIAENF 161 (622) Q Consensus 88 ~~~tG~~Ei~~~----~~~il~~a~-~~P~~~~~~~-~~~e~~rl~~R~LdLR~~~~~~~l~~Rs~i~~~~R~~l~~~gF 161 (622) .| +.+||.++ +++|++.|. +--+|++.+. +..|-+| .+|||.+|.....|++|+||.+.+++++||.++|| T Consensus 89 -aP-q~~EL~~~nPvk~~ev~g~a~~p~~YPl~~K~Ghs~efLr-~~~HLr~Rt~~~~AvmrvR~~l~~ai~~yF~~~gF 165 (495) T TIGR00457 89 -AP-QPVELKVKNPVKKIEVVGEAEDPDDYPLQKKEGHSLEFLR-DIAHLRLRTNTLGAVMRVRNALSQAIHEYFQKNGF 165 (495) T ss_pred -CC-CCEEEEEECCCEEEEEEEECCCCCCCCCCCCCCCCHHHHH-CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf -68-6313687467314688774178877776422368767763-27637778988778999999999999987604786 Q ss_pred EEEECCCCCCCCCCCCCC-CCCCC-------CCC---------HHH-HHHCCCCHHHHHHHHHCCCHHHHEEEHHHHCCC Q ss_conf 462033211246456645-43222-------221---------034-431123854564433105702331100110377 Q gi|254780419|r 162 IECSTPILTASSPEGARD-FLVPS-------RVN---------QGS-FYALPQAPQQYKQLLMASGFDRYFQIAPCFRDE 223 (622) Q Consensus 162 ~EveTP~L~ksTpeGArd-FlVPs-------R~~---------~g~-fYALpQSPQlyKQlLmv~G~dryfqiarcFRdE 223 (622) ++|+|||||.|.||||.| |-|-| |.. .|| .| |.||-||+--.+ +..|.|+|.|+|||||| T Consensus 166 ~~V~~PilT~~d~EGAGElF~v~TLdl~~lp~~~g~~df~~~fFgk~ay-LTVSGQL~~E~~-A~AL~kvyTfGPTFRAE 243 (495) T TIGR00457 166 VKVSPPILTSNDCEGAGELFKVSTLDLEKLPRNDGKIDFSKDFFGKEAY-LTVSGQLYAEAY-ALALSKVYTFGPTFRAE 243 (495) T ss_pred EEECCCEEECCCCCCCCCCCEEEEHHHHCCCCCCCCCCCCCCCCCCEEE-EEECCHHHHHHH-HHHHCCCEECCCCEECC T ss_conf 7868863113688887543322003242153326888763024886012-330227899999-99747814458740042 Q ss_pred CCCCCCCCCCEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCC--CCCCCCCCCCHHH--HHHHCCCCCCCCCCCCCC Q ss_conf 766553800003565410268999999999999999999980872--3457732121898--875136777644476401 Q gi|254780419|r 224 DPRSDRLPGEFYQLDMEMSFVEQDDILNTMENVLRGVFEEFSNGN--TVSNNFLRICYDD--AIRRYGTDKPDLRNPIIM 299 (622) Q Consensus 224 d~R~dRqp~EFTqlD~EmsF~~~edvm~l~E~li~~i~~~v~~~~--~i~~pF~rmtY~e--Am~~YGsDKPDLR~~lei 299 (622) +|.|.||++|||||+.||||++.+|.+.++|++|+++++.|++.. .++..| +.=+| +|+..+. T Consensus 244 kS~T~RHLsEFWMiEPE~AF~~l~d~~~~~E~~~k~~~k~VL~~~qellPkqf--i~~~enn~~~~L~~----------- 310 (495) T TIGR00457 244 KSNTSRHLSEFWMIEPEMAFANLNDLLQLIETLIKYIIKAVLENQQELLPKQF--ICSQENNLLKFLEK----------- 310 (495) T ss_pred CCCCCCCHHHHHHHHHHEEECCHHHHHHHHHHHHHHHHHHHHCCHHHHCCHHH--CCCCHHHHHHHHHH----------- T ss_conf 86487011122101120000013333899999999999998305245042010--37771379998664----------- Q ss_pred CCCCCCCCCCCHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCHHHHC Q ss_conf 24431017875245666874037847999962787665210356779998764202462355204777533357025532 Q gi|254780419|r 300 HNVTEHFVHSEFRVFSKILSVNPGYEVWAIPAKGGGGNRAFCDRMNEWARSQGQPGLAYIFWRLDESGNVAGMGPVAKHL 379 (622) Q Consensus 300 ~dit~~~~~~~f~vf~~~~~~~~~~~v~~i~~~~~~~srk~~D~l~e~ak~~g~kgL~~i~~~~~~~~~~~~~spi~K~l 379 (622) +.+-...+... .+. ...+.+-.+++..++.+....++-. .+..+. . T Consensus 311 --------~~~~~~~~~l~---------~~i--~~~F~~i~Y~~Ai~iL~~~~~~~~~--~Fe~~d-~------------ 356 (495) T TIGR00457 311 --------NFDKDLIKKLE---------NII--NNKFARITYTDAIEILKESDNKEKK--NFEYED-F------------ 356 (495) T ss_pred --------HCCHHHHHHHH---------HHH--HCCCCEECHHHHHHHHHHHHHCCCC--CCCCCC-C------------ T ss_conf --------12756775225---------666--2267600478999998742111464--358886-1------------ Q ss_pred CHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHCCCCCCEE-EEEEEECCCEEECCCCCCCCHHHCC Q ss_conf 527899999981887557269980885411001347889998773003557510-2889607705354554700021100 Q gi|254780419|r 380 NIVNTEAIRVRLDMQDGDSCFFIAGDPLKFCSFAADARNHIAQELGIINCNCFE-FCWIMDFPFYEWNEEEKKIDFAHNP 458 (622) Q Consensus 380 ~~~~~~~l~~~~~~~~GD~i~f~a~~~~~~~~~lg~lR~~ia~~l~li~~~~~~-flWV~DFPLFe~dee~~r~~s~HHP 458 (622) +-|+ . .-.. ++..|.+.---+ |++||||| +.++ | T Consensus 357 --------------~wG~--------D-----L~~e------HER~LaE~~Fk~qPvfV~~YP--------k~~K----a 391 (495) T TIGR00457 357 --------------EWGI--------D-----LQTE------HERFLAEEYFKPQPVFVTDYP--------KDIK----A 391 (495) T ss_pred --------------CCCC--------C-----CCHH------HHHHHHHCCCCCCCEEEECCC--------CCCC----C T ss_conf --------------3488--------8-----6336------899876315898888997388--------5558----0 Q ss_pred CCCCCCCHHHHHCCCHHHHHHHCCCEEECC-E-EECCCCEECCCHHHHHHHHHHCCCCHH-HHHHHHHHHHHHHHCCCCC Q ss_conf 247664306554188777413136777756-7-721251231569999999998599989-9986677899998608497 Q gi|254780419|r 459 FTMPQGGMESLKGNDLLSIKAFQYDLVCNG-F-EIASGGIRNHMSEVMLQAFSNVGISKE-VVENRFGGLYRAFQCGMPP 535 (622) Q Consensus 459 FTaP~~~~~~l~~~dp~~v~a~~YDLVlNG-~-EiggGSiRIHd~~iQ~~~f~~lgl~~e-e~~~~Fg~lL~Al~yG~PP 535 (622) |+|=.+. +|+..| .+.||.+.+ + ||.|||+||||.|.+..+|+..|++.+ +. .||++-.|||.-| T Consensus 392 FYMk~NS------~D~~tV--~a~DlL~P~GiGEIiGGS~Re~dld~L~~r~k~~g~d~~G~l----~WY~DlRKyGs~P 459 (495) T TIGR00457 392 FYMKLNS------DDGKTV--AAVDLLAPKGIGEIIGGSEREDDLDKLEERMKEMGLDTDGAL----NWYLDLRKYGSVP 459 (495) T ss_pred HHCCCCC------CCCCEE--EEEEEECCCCEEEEECCCCCHHHHHHHHHHHHHCCCCCCCCC----HHHHHHCCEECCC T ss_conf 1053667------888747--899983789626651675432268999999996486887750----4326415120566 Q ss_pred CCCEECCHHHHHHHHCCCCCCEEEECCCCCCCCC Q ss_conf 3422246899999983739840321378999871 Q gi|254780419|r 536 HGGIAAGIDRIVMLLLGAKNVREVSLFPMSQNFC 569 (622) Q Consensus 536 HgG~alGlDRlvmll~g~~sIRdVIaFPKt~~g~ 569 (622) ||||||||||||+++||..||||||+||.+..-. T Consensus 460 H~GFGLGfERl~ay~~G~~niRD~IPFPR~~~~~ 493 (495) T TIGR00457 460 HSGFGLGFERLVAYITGLENIRDAIPFPRTPGNI 493 (495) T ss_pred CCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCC T ss_conf 4664324899999981787444166778775568 No 15 >pfam00152 tRNA-synt_2 tRNA synthetases class II (D, K and N). Probab=100.00 E-value=0 Score=626.80 Aligned_cols=331 Identities=45% Similarity=0.742 Sum_probs=272.1 Q ss_pred CHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCC-CCCCCCCCCCCHHHHHHCCCCHH Q ss_conf 98782112320001211112124589999999875203651462033211246456-64543222221034431123854 Q gi|254780419|r 121 PESIRFKYRFLDLRRDTLHKNIVKRTRIINSMRCRMIAENFIECSTPILTASSPEG-ARDFLVPSRVNQGSFYALPQAPQ 199 (622) Q Consensus 121 ~e~~rl~~R~LdLR~~~~~~~l~~Rs~i~~~~R~~l~~~gF~EveTP~L~ksTpeG-ArdFlVPsR~~~g~fYALpQSPQ 199 (622) ++|+|+++||||||+++++++|++||++++++|+||.++||+||+||+|+++++|| |++|.|+.. ....+| |+|||| T Consensus 1 ~~e~r~~~R~ldlr~~~~~~~l~~Rs~i~~~iR~ff~~~~f~EV~TP~L~~~~~e~~a~~F~~~~~-~~~~~y-L~~Spe 78 (341) T pfam00152 1 SEETRLKYRYLDLRRPKMQANLKLRSKIIRAIREFLDERGFLEVETPILTKSTPEGGARDFLVPKF-YAKEAY-LPQSPQ 78 (341) T ss_pred CHHHHHCCEEEEECCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCEEEECCC-CCCCEE-ECCCHH T ss_conf 922322470134148668999999999999999999988989987982005589877766450267-898234-054889 Q ss_pred HHHHHHHCCCHHHHEEEHHHHCCCCCCCCCCCCCEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHC---------CCCC Q ss_conf 5644331057023311001103777665538000035654102689999999999999999999808---------7234 Q gi|254780419|r 200 QYKQLLMASGFDRYFQIAPCFRDEDPRSDRLPGEFYQLDMEMSFVEQDDILNTMENVLRGVFEEFSN---------GNTV 270 (622) Q Consensus 200 lyKQlLmv~G~dryfqiarcFRdEd~R~dRqp~EFTqlD~EmsF~~~edvm~l~E~li~~i~~~v~~---------~~~i 270 (622) +|||+||+||++||||||||||||+++++||++||||||+||+|+|++|+|+++|+|++++++.+.+ +..+ T Consensus 79 l~~k~ll~~g~~rVfei~~~FR~E~~~t~rH~~EFtmlE~y~a~~d~~d~m~~~E~li~~i~~~v~~~~~~~~~~~~~~~ 158 (341) T pfam00152 79 LYKQLLMVAGFDRVFQIAPCFRDEDLRTDRHPPEFTQLDLEMSFVDYEDVMDLTEDLIKYVFKEVKGKTEKGELLLGIEL 158 (341) T ss_pred HHHHHHHHCCCCCCEEEECHHCCCCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCCHHHHCCCCCC T ss_conf 99999986588763797232027989886550778877675535999999999999999999998634402441245547 Q ss_pred CCCCCCCCHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHH Q ss_conf 57732121898875136777644476401244310178752456668740378479999627876652103567799987 Q gi|254780419|r 271 SNNFLRICYDDAIRRYGTDKPDLRNPIIMHNVTEHFVHSEFRVFSKILSVNPGYEVWAIPAKGGGGNRAFCDRMNEWARS 350 (622) Q Consensus 271 ~~pF~rmtY~eAm~~YGsDKPDLR~~lei~dit~~~~~~~f~vf~~~~~~~~~~~v~~i~~~~~~~srk~~D~l~e~ak~ 350 (622) +.||+||||.||+.+|+.++||++++.++.++.+.. T Consensus 159 ~~~f~rity~ea~~~~~~~~~~~~~~~~l~~~~~~~-------------------------------------------- 194 (341) T pfam00152 159 PEPFPRITYAEAIERYGSDKPDLRFGLELKDVTEIK-------------------------------------------- 194 (341) T ss_pred CCCCEEEEHHHHHHHHCCCCCCCCCHHHHHHHHHHH-------------------------------------------- T ss_conf 898447429999999719998868879999999984-------------------------------------------- Q ss_pred HHHCCCCCEEEEECCCCCCCCCCCHHHHCCHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHCCCCC Q ss_conf 64202462355204777533357025532527899999981887557269980885411001347889998773003557 Q gi|254780419|r 351 QGQPGLAYIFWRLDESGNVAGMGPVAKHLNIVNTEAIRVRLDMQDGDSCFFIAGDPLKFCSFAADARNHIAQELGIINCN 430 (622) Q Consensus 351 ~g~kgL~~i~~~~~~~~~~~~~spi~K~l~~~~~~~l~~~~~~~~GD~i~f~a~~~~~~~~~lg~lR~~ia~~l~li~~~ 430 (622) ... . .... . ....++.+.....+. .|.+.. T Consensus 195 -~~~------~--~~~~--~--------------------------------------~~~~~~~~~~~~~e~-~l~~~~ 224 (341) T pfam00152 195 -FED------A--ALNG--G--------------------------------------SNKLLGALRSELGER-LLGDEN 224 (341) T ss_pred -CCC------C--CCCC--C--------------------------------------HHHHHHHHHHHHHHH-HHHHCC T ss_conf -422------3--3345--5--------------------------------------244899999999888-755303 Q ss_pred CEEEEEEEECCCEEECCCCCCCCHHHCCCCCCCCCHHHHHCCCHHHHHHHCCCEEECCEEECCCCEECCCHHHHHHHHHH Q ss_conf 51028896077053545547000211002476643065541887774131367777567721251231569999999998 Q gi|254780419|r 431 CFEFCWIMDFPFYEWNEEEKKIDFAHNPFTMPQGGMESLKGNDLLSIKAFQYDLVCNGFEIASGGIRNHMSEVMLQAFSN 510 (622) Q Consensus 431 ~~~flWV~DFPLFe~dee~~r~~s~HHPFTaP~~~~~~l~~~dp~~v~a~~YDLVlNG~EiggGSiRIHd~~iQ~~~f~~ 510 (622) ...|+||+|||.. ..||+++..++ +| ..+.+|||++||.||++||+|+||++.|+++|+. T Consensus 225 ~~~p~fv~dyP~~------------~~pf~~~~~~~------~~--~~a~rFel~~~G~Ei~nG~~el~d~~~~~~rf~~ 284 (341) T pfam00152 225 LDNPVFVTDFPLF------------KRPFYMPKDED------PP--GLAERFDLVLNGGEIGGGSIRIHDPEEQRKRFEE 284 (341) T ss_pred CCCEEEEECCCCC------------CCCCCCCCCCC------CC--CCHHEEEECCCCEEEECCCCCCCCHHHHHHHHHH T ss_conf 7861999888844------------38666867877------76--5012157526974982575345999999999998 Q ss_pred CCCCHHHHHHHHHHHHHHHHCCCCCCCCEECCHHHHHHHHCCCCCCEEEECCCCCCC Q ss_conf 599989998667789999860849734222468999999837398403213789998 Q gi|254780419|r 511 VGISKEVVENRFGGLYRAFQCGMPPHGGIAAGIDRIVMLLLGAKNVREVSLFPMSQN 567 (622) Q Consensus 511 lgl~~ee~~~~Fg~lL~Al~yG~PPHgG~alGlDRlvmll~g~~sIRdVIaFPKt~~ 567 (622) .|++++++...+.|||+||+||+|||||||||+|||+|++||.+||||||+||++.+ T Consensus 285 ~~~~~~~~~~~d~~yl~a~~~G~PP~~G~glGidRL~m~l~g~~~Irdv~~FPr~~~ 341 (341) T pfam00152 285 LGLDPEEAEEKFGFYLDALKYGMPPHGGIGLGLDRLVMLLTGLESIREVIAFPKTRK 341 (341) T ss_pred HCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEHHHHHHHHHCCCCCHHHCCCCCCCCC T ss_conf 198902335658999999866979970565789999999808994887267899999 No 16 >cd00776 AsxRS_core Asx tRNA synthetase (AspRS/AsnRS) class II core domain. Assignment to class II aminoacyl-tRNA synthetases (aaRS) based upon its structure and the presence of three characteristic sequence motifs in the core domain. This family includes AsnRS as well as a subgroup of AspRS. AsnRS and AspRS are homodimers, which attach either asparagine or aspartate to the 3'OH group of ribose of the appropriate tRNA. While archaea lack asnRS, they possess a non-discriminating aspRS, which can mischarge Asp-tRNA with Asn. Subsequently, a tRNA-dependent aspartate amidotransferase converts the bound aspartate to asparagine. The catalytic core domain is primarily responsible for the ATP-dependent formation of the enzyme bound aminoacyl-adenylate. Probab=100.00 E-value=0 Score=617.44 Aligned_cols=304 Identities=28% Similarity=0.499 Sum_probs=236.9 Q ss_pred CCCHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCC-CCCCCCCCCCCHHHHHHCCCC Q ss_conf 7898782112320001211112124589999999875203651462033211246456-645432222210344311238 Q gi|254780419|r 119 EYPESIRFKYRFLDLRRDTLHKNIVKRTRIINSMRCRMIAENFIECSTPILTASSPEG-ARDFLVPSRVNQGSFYALPQA 197 (622) Q Consensus 119 ~~~e~~rl~~R~LdLR~~~~~~~l~~Rs~i~~~~R~~l~~~gF~EveTP~L~ksTpeG-ArdFlVPsR~~~g~fYALpQS 197 (622) ++++|+|++|||||||++.++++|++||++++++|+||.++||+|||||+|+++++|| |+.|-+. .+...+| |+|| T Consensus 1 ~~~~e~r~~~R~Ldlr~~~~~~~l~~Rs~i~~~iR~fl~~~gF~EVeTP~L~~~~~eg~a~~f~~~--~~~~~~y-L~~S 77 (322) T cd00776 1 DANLETLLDNRHLDLRTPKVQAIFRIRSEVLRAFREFLRENGFTEVHTPKITSTDTEGGAELFKVS--YFGKPAY-LAQS 77 (322) T ss_pred CCCHHHHHCCCCEECCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCCCC--CCCCCEE-ECCC T ss_conf 979788761641242898799999999999999999999889999979840078898654667621--0587600-1548 Q ss_pred HHHHHHHHHCCCHHHHEEEHHHHCCCCCCCCCCCCCEEEEEEEECCC-CHHHHHHHHHHHHHHHHHHHHCCC-------- Q ss_conf 54564433105702331100110377766553800003565410268-999999999999999999980872-------- Q gi|254780419|r 198 PQQYKQLLMASGFDRYFQIAPCFRDEDPRSDRLPGEFYQLDMEMSFV-EQDDILNTMENVLRGVFEEFSNGN-------- 268 (622) Q Consensus 198 PQlyKQlLmv~G~dryfqiarcFRdEd~R~dRqp~EFTqlD~EmsF~-~~edvm~l~E~li~~i~~~v~~~~-------- 268 (622) ||||||| ||||++||||||||||||+++++||.+||||||+||||+ |++|+|+++|+|++++++.+.... T Consensus 78 pel~Kql-li~G~~rVfei~~~FR~E~~~t~rH~pEFTmlE~e~af~~d~~d~m~~~E~li~~i~~~v~~~~~~~~~~~~ 156 (322) T cd00776 78 PQLYKEM-LIAALERVYEIGPVFRAEKSNTRRHLSEFWMLEAEMAFIEDYNEVMDLIEELIKYIFKRVLERCAKELELVN 156 (322) T ss_pred HHHHHHH-HHCCCCCEEEECCEEECCCCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 5889999-753424438872614079998753478887652555233289999999999999999999865267889987 Q ss_pred -------CCCCCCCCCCHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCEEEEEECCCCCCCCCCH Q ss_conf -------3457732121898875136777644476401244310178752456668740378479999627876652103 Q gi|254780419|r 269 -------TVSNNFLRICYDDAIRRYGTDKPDLRNPIIMHNVTEHFVHSEFRVFSKILSVNPGYEVWAIPAKGGGGNRAFC 341 (622) Q Consensus 269 -------~i~~pF~rmtY~eAm~~YGsDKPDLR~~lei~dit~~~~~~~f~vf~~~~~~~~~~~v~~i~~~~~~~srk~~ 341 (622) .+..||+|+||.||++. T Consensus 157 ~~~~~~~~~~~~~~r~~~~ea~~~-------------------------------------------------------- 180 (322) T cd00776 157 QLNRELLKPLEPFPRITYDEAIEL-------------------------------------------------------- 180 (322) T ss_pred HCCCCCCCCCCCCCCCCHHHHHHH-------------------------------------------------------- T ss_conf 512110024799642419999999-------------------------------------------------------- Q ss_pred HHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCHHHHCCHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCHHHHHHHHHH Q ss_conf 56779998764202462355204777533357025532527899999981887557269980885411001347889998 Q gi|254780419|r 342 DRMNEWARSQGQPGLAYIFWRLDESGNVAGMGPVAKHLNIVNTEAIRVRLDMQDGDSCFFIAGDPLKFCSFAADARNHIA 421 (622) Q Consensus 342 D~l~e~ak~~g~kgL~~i~~~~~~~~~~~~~spi~K~l~~~~~~~l~~~~~~~~GD~i~f~a~~~~~~~~~lg~lR~~ia 421 (622) .+..+... ...+. ..++.+... .++ T Consensus 181 ------l~~~~~~~------------~~~~~----~dl~~~~e~---------------------------------~l~ 205 (322) T cd00776 181 ------LREKGVEE------------EVKWG----EDLSTEHER---------------------------------LLG 205 (322) T ss_pred ------HHHCCCCC------------CCCCC----CCCCHHHHH---------------------------------HHH T ss_conf ------98569966------------87766----633769999---------------------------------999 Q ss_pred HHHHCCCCCCEEEEEEEECCCEEECCCCCCCCHHHCCCCCCCCCHHHHHCCCHHHHHHHCCCEEECCE-EECCCCEECCC Q ss_conf 77300355751028896077053545547000211002476643065541887774131367777567-72125123156 Q gi|254780419|r 422 QELGIINCNCFEFCWIMDFPFYEWNEEEKKIDFAHNPFTMPQGGMESLKGNDLLSIKAFQYDLVCNGF-EIASGGIRNHM 500 (622) Q Consensus 422 ~~l~li~~~~~~flWV~DFPLFe~dee~~r~~s~HHPFTaP~~~~~~l~~~dp~~v~a~~YDLVlNG~-EiggGSiRIHd 500 (622) +.+ .-.|+||+|||. .++||+|+..+ ++|. .+.+|||++||+ ||+|||+|+|| T Consensus 206 ~~~------~~~p~fv~d~P~------------~~~pfy~~~~~------~~~~--~~~rfel~~~G~~El~~g~~r~~d 259 (322) T cd00776 206 EIV------KGDPVFVTDYPK------------EIKPFYMKPDD------DNPE--TVESFDLLMPGVGEIVGGSQRIHD 259 (322) T ss_pred HHH------CCCCEEEECCHH------------HCCCCCCCCCC------CCHH--HHHHHHEECCCCEEECCHHHHHCC T ss_conf 996------589889979744------------21976588668------8545--545420012892786776664089 Q ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCCCEECCHHHHHHHHCCCCCCEEEECCCCCCC Q ss_conf 9999999998599989998667789999860849734222468999999837398403213789998 Q gi|254780419|r 501 SEVMLQAFSNVGISKEVVENRFGGLYRAFQCGMPPHGGIAAGIDRIVMLLLGAKNVREVSLFPMSQN 567 (622) Q Consensus 501 ~~iQ~~~f~~lgl~~ee~~~~Fg~lL~Al~yG~PPHgG~alGlDRlvmll~g~~sIRdVIaFPKt~~ 567 (622) ++.|+++|+..|++++. |+|||+||+||||||||+|||+|||||+|||++||||||+||||.+ T Consensus 260 ~~~l~~r~~~~g~~~~~----~~~yl~al~~G~PP~gG~glGiDRLvmll~g~~~Irdvi~FPr~~~ 322 (322) T cd00776 260 YDELEERIKEHGLDPES----FEWYLDLRKYGMPPHGGFGLGLERLVMWLLGLDNIREAILFPRDPK 322 (322) T ss_pred HHHHHHHHHHCCCCHHH----HHHHHHHHHCCCCCCCEEEEHHHHHHHHHCCCCCHHHEECCCCCCC T ss_conf 99999999985999789----9999999766999972454389999999829972760513899999 No 17 >KOG0556 consensus Probab=100.00 E-value=0 Score=605.16 Aligned_cols=431 Identities=28% Similarity=0.441 Sum_probs=333.9 Q ss_pred CHHHCCHHHCCCEEEEEEEECCCCCCCCEEEEEEECCCCEEEEEECCCC------HHHHHHHCCCCCCEEEEEEEEEECC Q ss_conf 0423697998988999979800444698799999779825999974881------5789985289885899999999758 Q gi|254780419|r 8 SCGDLGVSDVGSFVRLSGWVHRVRPHGGIIFLDIRDHYGITQVVANPDS------ACFEIAKSVRSEWVLCIDGLVMSRS 81 (622) Q Consensus 8 ~cg~l~~~~~g~~V~i~GwV~~~R~~g~l~F~~lrD~sG~~Q~v~~~~~------~~~~~~~~l~~esvv~v~G~V~~r~ 81 (622) ...+|..+..|++|.|+|+||+.|..||++|+.||++..++||++..+. ...+.+++++.||+|.|.|+|.+-+ T Consensus 72 ~v~dl~~~~~~~~V~vRgrVhtsr~~GK~~FlvLRq~~~tVQ~~~~~~~~~~isk~Mvkf~~~is~ESiV~v~g~v~k~~ 151 (533) T KOG0556 72 DVSDLDESNDGSEVLVRGRVHTSRLKGKLCFLVLRQQGSTVQCLVAVNEDGTISKQMVKFAGSISKESIVDVRGVVVKVK 151 (533) T ss_pred EHHHHHHHCCCCEEEEEEEEEECCCCCEEEEEEEECCCCEEEEEEECCCCCHHHHHHHHHHHHCCCCEEEEEEEEEECCC T ss_conf 24342331278369999888513445507999994067359999971787607778999986458120899998871589 Q ss_pred CCCCCCCCCCCEEEEEEEEEEEEECCC-CCCCCCCCCCC-----------------CCHHHHCCCCCEEECCCCCCCCHH Q ss_conf 655588777865999988999950211-79986457667-----------------898782112320001211112124 Q gi|254780419|r 82 AETINANIITGQIELSAQKIEILSVAE-ELPLPVFGERE-----------------YPESIRFKYRFLDLRRDTLHKNIV 143 (622) Q Consensus 82 ~~~~n~~~~tG~~Ei~~~~~~il~~a~-~~P~~~~~~~~-----------------~~e~~rl~~R~LdLR~~~~~~~l~ 143 (622) +. -....+-++||++.++.++|.|. .|||.+.+..- ++.++||+||.||||.|..|++|| T Consensus 152 ~~--i~scT~qdvEi~v~~iyviS~a~~~LPl~veDasrse~~eE~a~~~~~~~~~Vn~dtRLdnRvlDLRtptnqAiFr 229 (533) T KOG0556 152 EP--IKSCTVQDVEIHVRKIYVISIALPNLPLQVEDASRSEPDEEKAAEPESTLARVNLDTRLDNRVLDLRTPTNQAIFR 229 (533) T ss_pred CC--CCCCCCCEEEEEEEEEEEEECCCCCCCEEEHHHCCCCCCHHHHCCCCCCCCEECCCCCCCCEEEECCCCCCHHEEE T ss_conf 85--6664300057899899997136666870322200356303453287665430012230133135313664421000 Q ss_pred HHHHHHHHHHHHHHCCCEEEEECCCCCC-CCCCCCCCCCCCCCCCHHHHHHCCCCHHHHHHHHHCCCHHHHEEEHHHHCC Q ss_conf 5899999998752036514620332112-464566454322222103443112385456443310570233110011037 Q gi|254780419|r 144 KRTRIINSMRCRMIAENFIECSTPILTA-SSPEGARDFLVPSRVNQGSFYALPQAPQQYKQLLMASGFDRYFQIAPCFRD 222 (622) Q Consensus 144 ~Rs~i~~~~R~~l~~~gF~EveTP~L~k-sTpeGArdFlVPsR~~~g~fYALpQSPQlyKQlLmv~G~dryfqiarcFRd 222 (622) +.+.|..++|+||..+||+||+||.|.. |+.+||.-|-|- .+++++| |+|||||||||++||+|+|||+||++||+ T Consensus 230 iq~gvc~~FRe~L~~kgF~EIhTpKli~asSEGGanvF~v~--Yfk~~A~-LAQSPQLyKQMaI~gdf~rVyeIGpVfRA 306 (533) T KOG0556 230 IQAGVCFAFREYLRSKGFVEIHTPKLIGASSEGGANVFRVS--YFKQKAY-LAQSPQLYKQMAICGDFERVYEIGPVFRA 306 (533) T ss_pred HHHHHHHHHHHHHHHCCCCEECCCCCCCCCCCCCCEEEEEE--ECCCCCH-HHCCHHHHHHHHHHCCHHHEEEECCEEEC T ss_conf 68889999999997568604326320146677772268998--6067503-20585888778876061104652355660 Q ss_pred CCCCCCCCCCCEEEEEEEECCC-CHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHCCCCCCCCCCCCCCCC Q ss_conf 7766553800003565410268-999999999999999999980872345773212189887513677764447640124 Q gi|254780419|r 223 EDPRSDRLPGEFYQLDMEMSFV-EQDDILNTMENVLRGVFEEFSNGNTVSNNFLRICYDDAIRRYGTDKPDLRNPIIMHN 301 (622) Q Consensus 223 Ed~R~dRqp~EFTqlD~EmsF~-~~edvm~l~E~li~~i~~~v~~~~~i~~pF~rmtY~eAm~~YGsDKPDLR~~lei~d 301 (622) |||++.||.+||+.||+||+|- +|++||+.+.+|+..+|+.+.. ||.-||.. T Consensus 307 EdSnthRhltEFvGLD~EMaf~~hYhEVm~~i~~lfv~IF~~l~e---------------------------ry~~Eie~ 359 (533) T KOG0556 307 EDSNTHRHLTEFVGLDLEMAFNEHYHEVMDTIGELFVFIFKGLRE---------------------------RYAKEIET 359 (533) T ss_pred CCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------------------------HHHHHHHH T ss_conf 446416666776174225477777999999999999999987888---------------------------88999999 Q ss_pred CCCCCCCCCHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCHHHHCCH Q ss_conf 43101787524566687403784799996278766521035677999876420246235520477753335702553252 Q gi|254780419|r 302 VTEHFVHSEFRVFSKILSVNPGYEVWAIPAKGGGGNRAFCDRMNEWARSQGQPGLAYIFWRLDESGNVAGMGPVAKHLNI 381 (622) Q Consensus 302 it~~~~~~~f~vf~~~~~~~~~~~v~~i~~~~~~~srk~~D~l~e~ak~~g~kgL~~i~~~~~~~~~~~~~spi~K~l~~ 381 (622) +...++-.+|+....++ | +.. T Consensus 360 Vr~qyp~e~fkf~~~~l-------------------r----------------------------------------l~~ 380 (533) T KOG0556 360 VRKQYPFEPFKFLEPPL-------------------R----------------------------------------LTF 380 (533) T ss_pred HHHCCCCCCCCCCCCCE-------------------E----------------------------------------EEH T ss_conf 86108886333578755-------------------7----------------------------------------655 Q ss_pred HHHHHHHHHHCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHCCCCCCEEEEEEEECCCEEECCCCCCCCHHHCCC-C Q ss_conf 789999998188755726998088541100134788999877300355751028896077053545547000211002-4 Q gi|254780419|r 382 VNTEAIRVRLDMQDGDSCFFIAGDPLKFCSFAADARNHIAQELGIINCNCFEFCWIMDFPFYEWNEEEKKIDFAHNPF-T 460 (622) Q Consensus 382 ~~~~~l~~~~~~~~GD~i~f~a~~~~~~~~~lg~lR~~ia~~l~li~~~~~~flWV~DFPLFe~dee~~r~~s~HHPF-T 460 (622) .+.-++....+.+.||. .|-.....+.||.+-.+ .+ .-.|--+-+||+ +-.|| | T Consensus 381 ~e~v~mLreaGvE~g~~----dDlsTe~Ek~LG~lV~e---ky------~tdfyildkyP~------------avRPFYT 435 (533) T KOG0556 381 KEGVAMLREAGVEMGDE----DDLSTESEKKLGQLVRE---KY------DTDFYILDKYPL------------AVRPFYT 435 (533) T ss_pred HHHHHHHHHCCCCCCCC----CCCCCHHHHHHHHHHHH---HH------CCCEEEECCCCC------------CCCCCCC T ss_conf 88999999728635872----23588658899999998---71------784799714754------------4466433 Q ss_pred CCCCCHHHHHCCCHHHHHHHCCCEEECCEEECCCCEECCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCCCEE Q ss_conf 76643065541887774131367777567721251231569999999998599989998667789999860849734222 Q gi|254780419|r 461 MPQGGMESLKGNDLLSIKAFQYDLVCNGFEIASGGIRNHMSEVMLQAFSNVGISKEVVENRFGGLYRAFQCGMPPHGGIA 540 (622) Q Consensus 461 aP~~~~~~l~~~dp~~v~a~~YDLVlNG~EiggGSiRIHd~~iQ~~~f~~lgl~~ee~~~~Fg~lL~Al~yG~PPHgG~a 540 (622) ||.+++.. .+++||..+.|.||-||.||||||+++.++.+..|+++... +-|+++|+||||||||.| T Consensus 436 mpd~~~p~---------ySnSyD~fmRGeEIlSGAQRIhdpe~L~era~~hGid~~~i----~~YidsFryG~PPHaGgG 502 (533) T KOG0556 436 MPDPENPR---------YSNSYDFFMRGEEILSGAQRIHDPELLVERAKEHGIDPSKI----STYIDSFRYGAPPHAGGG 502 (533) T ss_pred CCCCCCCC---------CCCCHHHEECHHHHHCCCCCCCCHHHHHHHHHHCCCCHHHH----HHHHHHHCCCCCCCCCCC T ss_conf 67888877---------34414322346666344300378899999999739887899----988887405899877875 Q ss_pred CCHHHHHHHHCCCCCCEEEECCCCCCC Q ss_conf 468999999837398403213789998 Q gi|254780419|r 541 AGIDRIVMLLLGAKNVREVSLFPMSQN 567 (622) Q Consensus 541 lGlDRlvmll~g~~sIRdVIaFPKt~~ 567 (622) +|++|+||+++|..|||-...||..-+ T Consensus 503 IGLERvvmlyl~L~nIR~~SlFPRDPk 529 (533) T KOG0556 503 IGLERVVMLYLGLNNIRKTSLFPRDPK 529 (533) T ss_pred CCHHHHHHHHHCCCCCHHHCCCCCCCC T ss_conf 159999999816775001025788825 No 18 >COG1190 LysU Lysyl-tRNA synthetase (class II) [Translation, ribosomal structure and biogenesis] Probab=100.00 E-value=0 Score=599.51 Aligned_cols=422 Identities=27% Similarity=0.448 Sum_probs=334.4 Q ss_pred CCCCHHHCCHHHCCC----------EEEEEEEECCCCCCCCEEEEEEECCCCEEEEEECCCC---HHHH-HHHCCCCCCE Q ss_conf 742042369799898----------8999979800444698799999779825999974881---5789-9852898858 Q gi|254780419|r 5 RTHSCGDLGVSDVGS----------FVRLSGWVHRVRPHGGIIFLDIRDHYGITQVVANPDS---ACFE-IAKSVRSEWV 70 (622) Q Consensus 5 Rth~cg~l~~~~~g~----------~V~i~GwV~~~R~~g~l~F~~lrD~sG~~Q~v~~~~~---~~~~-~~~~l~~esv 70 (622) |||++.++++.+.++ +|+++|+|..+|.+||++|++|.|.+|.+|++++++. +.++ ..+.+..+++ T Consensus 38 ~~~~~~~l~~~~~~~~~~el~~~~~~v~vAGRi~~~R~~GK~~F~~i~d~~gkiQ~yi~k~~~~~~~~~~~~~~~dlGDi 117 (502) T COG1190 38 RTHTSADLREKYADKTKEELEALNIEVSVAGRIMTIRNMGKASFADLQDGSGKIQLYVNKDEVGEEVFEALFKKLDLGDI 117 (502) T ss_pred CCCCHHHHHHHHHCCCHHHHHHCCCEEEEECCEEEECCCCCEEEEEEECCCCEEEEEEECCCCCHHHHHHHHHCCCCCCE T ss_conf 64549999999753240233200550588432663202575258999408963899995465464668888632665788 Q ss_pred EEEEEEEEECCCCCCCCCCCCCEEEEEEEEEEEEECCCCCCCCCCCCCCCCHHHHCCCCCEEE-CCCCCCCCHHHHHHHH Q ss_conf 999999997586555887778659999889999502117998645766789878211232000-1211112124589999 Q gi|254780419|r 71 LCIDGLVMSRSAETINANIITGQIELSAQKIEILSVAEELPLPVFGEREYPESIRFKYRFLDL-RRDTLHKNIVKRTRII 149 (622) Q Consensus 71 v~v~G~V~~r~~~~~n~~~~tG~~Ei~~~~~~il~~a~~~P~~~~~~~~~~e~~rl~~R~LdL-R~~~~~~~l~~Rs~i~ 149 (622) |+|+|.+.. .+|||+.|.|+++++|++|.. |+|-+.+...+.|+|+|+||||| -+++.+..|..||+++ T Consensus 118 igv~G~~~~---------T~~GelSv~v~~~~lLsKsL~-pLPeK~hgL~D~E~RyR~RylDLi~N~e~r~~f~~Rs~ii 187 (502) T COG1190 118 IGVEGPLFK---------TKTGELSVSVEELRLLSKSLR-PLPEKFHGLTDKEIRYRQRYLDLIVNPESRQTFIKRSKII 187 (502) T ss_pred EEEEEEEEE---------CCCCCEEEEEEEEEEECCCCC-CCCHHHCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHH T ss_conf 866420565---------489846899877756202077-7974543786388899988889865999999999999999 Q ss_pred HHHHHHHHCCCEEEEECCCCCCCCCCC--CCCCCCCCCCCHH----HHHHCCCCHHHHHHHHHCCCHHHHEEEHHHHCCC Q ss_conf 999875203651462033211246456--6454322222103----4431123854564433105702331100110377 Q gi|254780419|r 150 NSMRCRMIAENFIECSTPILTASSPEG--ARDFLVPSRVNQG----SFYALPQAPQQYKQLLMASGFDRYFQIAPCFRDE 223 (622) Q Consensus 150 ~~~R~~l~~~gF~EveTP~L~ksTpeG--ArdFlVPsR~~~g----~fYALpQSPQlyKQlLmv~G~dryfqiarcFRdE 223 (622) ++||+||+++||+|||||+|. .-++| ||+|. +|++ .+| |.-+|+||-..|+||||+|||+|+|+|||| T Consensus 188 ~~iR~fl~~~gFlEVETP~lq-~i~GGA~ArPF~----ThhNald~dly-LRIApELyLKRliVGG~erVfEIgr~FRNE 261 (502) T COG1190 188 RAIREFLDDRGFLEVETPMLQ-PIPGGAAARPFI----THHNALDMDLY-LRIAPELYLKRLIVGGFERVFEIGRNFRNE 261 (502) T ss_pred HHHHHHHHHCCCEEECCCCCC-CCCCCCCCCCCE----EEECCCCCCEE-EEECCHHHHHHHHHCCCHHHEEECCCCCCC T ss_conf 999999987797584160003-557873226501----22023677568-762418999988753721422305520037 Q ss_pred CCCCCCCCCCEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCC---------CCCCCCCCCCCHHHHHHHCCC-CCCCC Q ss_conf 76655380000356541026899999999999999999998087---------234577321218988751367-77644 Q gi|254780419|r 224 DPRSDRLPGEFYQLDMEMSFVEQDDILNTMENVLRGVFEEFSNG---------NTVSNNFLRICYDDAIRRYGT-DKPDL 293 (622) Q Consensus 224 d~R~dRqp~EFTqlD~EmsF~~~edvm~l~E~li~~i~~~v~~~---------~~i~~pF~rmtY~eAm~~YGs-DKPDL 293 (622) +..+.+.| |||+|++.+||+|++|+|+++|+|++++.+++.+. +++..||.|+++.||+..|.. |. T Consensus 262 Gid~tHNP-EFTmlE~Y~AYaDy~D~m~ltE~Li~~~a~~v~gt~~v~y~~~~id~~~pf~ri~m~dal~e~~g~~~--- 337 (502) T COG1190 262 GIDTTHNP-EFTMLEFYQAYADYEDLMDLTEELIKELAKEVNGTTKVTYGGQEIDFSKPFKRITMVDALKEYLGVDF--- 337 (502) T ss_pred CCCCCCCC-CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEECCEEEECCCCEEEEEHHHHHHHHHCCCC--- T ss_conf 87666484-23568999998579999999999999999995497378789976754897225664899998718665--- Q ss_pred CCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCC Q ss_conf 47640124431017875245666874037847999962787665210356779998764202462355204777533357 Q gi|254780419|r 294 RNPIIMHNVTEHFVHSEFRVFSKILSVNPGYEVWAIPAKGGGGNRAFCDRMNEWARSQGQPGLAYIFWRLDESGNVAGMG 373 (622) Q Consensus 294 R~~lei~dit~~~~~~~f~vf~~~~~~~~~~~v~~i~~~~~~~srk~~D~l~e~ak~~g~kgL~~i~~~~~~~~~~~~~s 373 (622) .++ ...+.+.+.++..+.+... T Consensus 338 ------~~~------------------------------------~~~e~~~~~ak~~~i~~~~---------------- 359 (502) T COG1190 338 ------DDL------------------------------------FDDEEAKELAKKHGIEVEK---------------- 359 (502) T ss_pred ------CCC------------------------------------CCHHHHHHHHHHCCCCCCC---------------- T ss_conf ------545------------------------------------8879999999980897576---------------- Q ss_pred CHHHHCCHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHCCCCCCEEEEEEEECCCEEECCCCCCCC Q ss_conf 02553252789999998188755726998088541100134788999877300355751028896077053545547000 Q gi|254780419|r 374 PVAKHLNIVNTEAIRVRLDMQDGDSCFFIAGDPLKFCSFAADARNHIAQELGIINCNCFEFCWIMDFPFYEWNEEEKKID 453 (622) Q Consensus 374 pi~K~l~~~~~~~l~~~~~~~~GD~i~f~a~~~~~~~~~lg~lR~~ia~~l~li~~~~~~flWV~DFPLFe~dee~~r~~ 453 (622) .. . +..|++-.++.++ +++.....|+||+|||- +.. T Consensus 360 --------------------------------~~-~-~~~g~ll~~lFe~--~vE~~liqPTFv~d~P~-----eiS--- 395 (502) T COG1190 360 --------------------------------YG-T-WGLGHLLNELFEE--LVEAKLIQPTFVTDHPV-----EIS--- 395 (502) T ss_pred --------------------------------CC-C-CCHHHHHHHHHHH--HHHHHHCCCCEEECCCC-----CCC--- T ss_conf --------------------------------55-5-3678999999987--76564148824616854-----368--- Q ss_pred HHHCCCCCCCCCHHHHHCCCHHHHHHHCCCEEECCEEECCCCEECCCHHHHHHHHHH------CCCCHH-HHHHHHHHHH Q ss_conf 211002476643065541887774131367777567721251231569999999998------599989-9986677899 Q gi|254780419|r 454 FAHNPFTMPQGGMESLKGNDLLSIKAFQYDLVCNGFEIASGGIRNHMSEVMLQAFSN------VGISKE-VVENRFGGLY 526 (622) Q Consensus 454 s~HHPFTaP~~~~~~l~~~dp~~v~a~~YDLVlNG~EiggGSiRIHd~~iQ~~~f~~------lgl~~e-e~~~~Fg~lL 526 (622) |.+-++.+ +|. ....|+|.+||.||+-|---.|||..|.++|+. +|-+++ .+++. |+ T Consensus 396 ----PLak~~~~-------~p~--~teRFElfi~g~EiaNaysELNDP~dQ~~RF~~Q~~~k~~Gddea~~~Ded---fv 459 (502) T COG1190 396 ----PLAKRHRS-------NPG--LTERFELFIGGKEIANAYSELNDPVDQRERFEAQVEEKEAGDDEAMELDED---FV 459 (502) T ss_pred ----CCCCCCCC-------CCC--HHHHHEEEECCEEEEECCCHHCCHHHHHHHHHHHHHHHHCCCCCCCCCCHH---HH T ss_conf ----66567988-------865--132210135458763112220699999999999999987299300115799---99 Q ss_pred HHHHCCCCCCCCEECCHHHHHHHHCCCCCCEEEECCCC Q ss_conf 99860849734222468999999837398403213789 Q gi|254780419|r 527 RAFQCGMPPHGGIAAGIDRIVMLLLGAKNVREVSLFPM 564 (622) Q Consensus 527 ~Al~yG~PPHgG~alGlDRlvmll~g~~sIRdVIaFPK 564 (622) +||+|||||+||+|||+|||||+|||+.||||||+||- T Consensus 460 ~ALeyGmPPTgG~GiGIDRLvMllT~~~sIRdVilFP~ 497 (502) T COG1190 460 EALEYGMPPTGGLGIGIDRLVMLLTNSPSIRDVILFPA 497 (502) T ss_pred HHHHCCCCCCCCCCCCHHHHHHHHCCCCCHHHEECCCC T ss_conf 99866999988751257777777739985542221565 No 19 >PRK09350 lysyl-tRNA synthetase; Provisional Probab=100.00 E-value=0 Score=599.82 Aligned_cols=304 Identities=22% Similarity=0.368 Sum_probs=241.1 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCC------CCCCCCCCCHHHHHHCCCCHHHHHHHHHC Q ss_conf 121111212458999999987520365146203321124645664------54322222103443112385456443310 Q gi|254780419|r 134 RRDTLHKNIVKRTRIINSMRCRMIAENFIECSTPILTASSPEGAR------DFLVPSRVNQGSFYALPQAPQQYKQLLMA 207 (622) Q Consensus 134 R~~~~~~~l~~Rs~i~~~~R~~l~~~gF~EveTP~L~ksTpeGAr------dFlVPsR~~~g~fYALpQSPQlyKQlLmv 207 (622) |.....+||++||++++++|+||+++||+|||||+|+++||+||+ +|+||++.+++.+| |+||||||||+||+ T Consensus 8 ~~~as~~~l~~Rs~i~~~iR~ff~~~gFlEVeTP~L~~~t~~~~~~~~f~~~~~~~~~~~~~~~y-L~~SPql~~k~l~~ 86 (325) T PRK09350 8 QPSASIKNLLKRAAIIAEIRRFFADRGVLEVETPAMSQATVTDIHLVPFETRFVGPGHSQGKTLW-LMTSPEYHMKRLLA 86 (325) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCEEEECCCCCCCCCCCEE-EECCHHHHHHHHHH T ss_conf 77147999999999999999999988968977984257788764676222102565545676645-50791999999986 Q ss_pred CCHHHHEEEHHHHCCCCCCCCCCCCCEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHCC Q ss_conf 57023311001103777665538000035654102689999999999999999999808723457732121898875136 Q gi|254780419|r 208 SGFDRYFQIAPCFRDEDPRSDRLPGEFYQLDMEMSFVEQDDILNTMENVLRGVFEEFSNGNTVSNNFLRICYDDAIRRYG 287 (622) Q Consensus 208 ~G~dryfqiarcFRdEd~R~dRqp~EFTqlD~EmsF~~~edvm~l~E~li~~i~~~v~~~~~i~~pF~rmtY~eAm~~YG 287 (622) ||++||||||||||||+++++||| ||||||+||||+|++|+|+++|+|++++++ ..||++|||.+||.+|+ T Consensus 87 ~G~~rvfqI~~~FR~E~~~~~H~p-EFtmlE~~~~~~d~~d~m~~~e~ll~~~~~--------~~~~~~~~~~~~~~~~~ 157 (325) T PRK09350 87 AGSGPIFQLCRSFRNEEAGRYHNP-EFTMLEWYRPHYDMYRLMNEVDDLLQQVLD--------CPPAESLSYQQAFLRYL 157 (325) T ss_pred CCCCCEEEEEHHHCCCCCCCCCCC-HHHHHHHHHHCCCHHHHHHHHHHHHHHHHC--------CCCCCEEEHHHHHHHHH T ss_conf 678855896113228999876573-778889998154899999999999999974--------89852641999999981 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCC Q ss_conf 77764447640124431017875245666874037847999962787665210356779998764202462355204777 Q gi|254780419|r 288 TDKPDLRNPIIMHNVTEHFVHSEFRVFSKILSVNPGYEVWAIPAKGGGGNRAFCDRMNEWARSQGQPGLAYIFWRLDESG 367 (622) Q Consensus 288 sDKPDLR~~lei~dit~~~~~~~f~vf~~~~~~~~~~~v~~i~~~~~~~srk~~D~l~e~ak~~g~kgL~~i~~~~~~~~ 367 (622) .++|...-. ..+.+.++..+... +. . . T Consensus 158 ~~~~~~~~~---------------------------------------------~~l~~~~~~~~~~~---~~-~--~-- 184 (325) T PRK09350 158 GIDPLSADK---------------------------------------------TQLREVAAKLDLSN---IA-D--T-- 184 (325) T ss_pred CCCCCCCCH---------------------------------------------HHHHHHHHHCCCCC---CC-C--C-- T ss_conf 999654888---------------------------------------------99999999749975---66-7--5-- Q ss_pred CCCCCCCHHHHCCHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHCCCCCCEEEEEEEECCCEEECC Q ss_conf 53335702553252789999998188755726998088541100134788999877300355751028896077053545 Q gi|254780419|r 368 NVAGMGPVAKHLNIVNTEAIRVRLDMQDGDSCFFIAGDPLKFCSFAADARNHIAQELGIINCNCFEFCWIMDFPFYEWNE 447 (622) Q Consensus 368 ~~~~~spi~K~l~~~~~~~l~~~~~~~~GD~i~f~a~~~~~~~~~lg~lR~~ia~~l~li~~~~~~flWV~DFPLFe~de 447 (622) .+....+...+ . ...+-..+ +.-.|+||+||| T Consensus 185 -------------~~~~~~l~~~l---------------------~---~~~ve~~l-----~~~~P~fv~d~P------ 216 (325) T PRK09350 185 -------------EEDRDTLLQLL---------------------F---TFGVEPNI-----GKEKPTFVYHFP------ 216 (325) T ss_pred -------------CCCHHHHHHHH---------------------H---HHHHHHHC-----CCCCCEEEECCC------ T ss_conf -------------34787999999---------------------9---99888750-----765876995584------ Q ss_pred CCCCCCHHHCCCCCCCCCHHHHHCCCHHHHHHHCCCEEECCEEECCCCEECCCHHHHHHHHH-------HCCCCHHHHHH Q ss_conf 54700021100247664306554188777413136777756772125123156999999999-------85999899986 Q gi|254780419|r 448 EEKKIDFAHNPFTMPQGGMESLKGNDLLSIKAFQYDLVCNGFEIASGGIRNHMSEVMLQAFS-------NVGISKEVVEN 520 (622) Q Consensus 448 e~~r~~s~HHPFTaP~~~~~~l~~~dp~~v~a~~YDLVlNG~EiggGSiRIHd~~iQ~~~f~-------~lgl~~ee~~~ 520 (622) +.++||+++++++. . .+.+|||++||+||++||+||||++.|+++|+ .+|++++++++ T Consensus 217 ------~~~~~~~~~~~~d~-------~--~a~rfdl~~~G~El~~G~~el~d~~~~~~r~~~~~~~~~~~g~~~~~~d~ 281 (325) T PRK09350 217 ------ASQAALAQISTEDH-------R--VAERFEVYFKGIELANGFHELTDAREQLQRFEQDNRKRAARGLPQQPIDQ 281 (325) T ss_pred ------CCCCHHHCCCCCCC-------C--HHHHHHCCCCCEEECCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCHHHH T ss_conf ------33370323587898-------5--22321010487896676456689999999999999999857998005589 Q ss_pred HHHHHHHHHHCCCCCCCCEECCHHHHHHHHCCCCCCEEEECCCCCC Q ss_conf 6778999986084973422246899999983739840321378999 Q gi|254780419|r 521 RFGGLYRAFQCGMPPHGGIAAGIDRIVMLLLGAKNVREVSLFPMSQ 566 (622) Q Consensus 521 ~Fg~lL~Al~yG~PPHgG~alGlDRlvmll~g~~sIRdVIaFPKt~ 566 (622) |||+||+||||||||||||+|||||+|||++|||||||||..- T Consensus 282 ---~~l~a~~yG~PPhgG~glGlDRLvm~l~g~~~IRdvi~FP~dR 324 (325) T PRK09350 282 ---NLIAALKAGLPDCSGVALGVDRLIMLALGAESIAEVIAFPVDR 324 (325) T ss_pred ---HHHHHHHCCCCCCCEEEEHHHHHHHHHHCCCCHHHEECCCCCC T ss_conf ---9999966799998357655999999981899388710788554 No 20 >cd00775 LysRS_core Lys_tRNA synthetase (LysRS) class II core domain. Class II LysRS is a dimer which attaches a lysine to the 3' OH group of ribose of the appropriate tRNA. Its assignment to class II aaRS is based upon its structure and the presence of three characteristic sequence motifs in the core domain. It is found in eukaryotes as well as some prokaryotes and archaea. However, LysRS belongs to class I aaRS's in some prokaryotes and archaea. The catalytic core domain is primarily responsible for the ATP-dependent formation of the enzyme bound aminoacyl-adenylate. Probab=100.00 E-value=0 Score=595.43 Aligned_cols=312 Identities=23% Similarity=0.413 Sum_probs=236.2 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCC--CCCCCCCCCCCHHHHHHCCCCHHHHHHHHHCCCHHH Q ss_conf 211112124589999999875203651462033211246456--645432222210344311238545644331057023 Q gi|254780419|r 135 RDTLHKNIVKRTRIINSMRCRMIAENFIECSTPILTASSPEG--ARDFLVPSRVNQGSFYALPQAPQQYKQLLMASGFDR 212 (622) Q Consensus 135 ~~~~~~~l~~Rs~i~~~~R~~l~~~gF~EveTP~L~ksTpeG--ArdFlVPsR~~~g~fYALpQSPQlyKQlLmv~G~dr 212 (622) ||+++++|++||++++++|+||.++||+||+||+| +|||+| |++|+||++.+.+.+| |+||||||||+||+||++| T Consensus 1 n~~~~~~~~~Rs~i~~~iR~ff~~~gF~Ev~TPiL-~~~~~g~~~~~f~~~~~~~~~~~y-L~qSpQl~~q~l~~~g~~r 78 (329) T cd00775 1 NEEVRQTFIVRSKIISYIRKFLDDRGFLEVETPML-QPIAGGAAARPFITHHNALDMDLY-LRIAPELYLKRLIVGGFER 78 (329) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCC-CCCCCCCCCCCEEECCCCCCCCCC-CCCCHHHHHHHHHHCCCCC T ss_conf 97899999999999999999999889889979865-566887567531322446786723-0689899999998567776 Q ss_pred HEEEHHHHCCCCCCCCCCCCCEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCC---------CCCCCCCCCCCHHHHH Q ss_conf 3110011037776655380000356541026899999999999999999998087---------2345773212189887 Q gi|254780419|r 213 YFQIAPCFRDEDPRSDRLPGEFYQLDMEMSFVEQDDILNTMENVLRGVFEEFSNG---------NTVSNNFLRICYDDAI 283 (622) Q Consensus 213 yfqiarcFRdEd~R~dRqp~EFTqlD~EmsF~~~edvm~l~E~li~~i~~~v~~~---------~~i~~pF~rmtY~eAm 283 (622) |||||||||||+++++|+| ||||||+||||+|++|||+++|+|++++++++.+. ..+..||+||||.||+ T Consensus 79 vfqI~p~FR~E~~~~rHl~-EFtmle~E~~f~d~~dvm~~~E~li~~i~~~~~~~~~~~~~~~~~d~~~pf~rit~~ea~ 157 (329) T cd00775 79 VYEIGRNFRNEGIDLTHNP-EFTMIEFYEAYADYNDMMDLTEDLFSGLVKKINGKTKIEYGGKELDFTPPFKRVTMVDAL 157 (329) T ss_pred EEEEEECCCCCCCCCCCCC-HHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHHH T ss_conf 7998102247999977473-456345777447999999999999999999983887531466123568997437699999 Q ss_pred HHC-CCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEE Q ss_conf 513-6777644476401244310178752456668740378479999627876652103567799987642024623552 Q gi|254780419|r 284 RRY-GTDKPDLRNPIIMHNVTEHFVHSEFRVFSKILSVNPGYEVWAIPAKGGGGNRAFCDRMNEWARSQGQPGLAYIFWR 362 (622) Q Consensus 284 ~~Y-GsDKPDLR~~lei~dit~~~~~~~f~vf~~~~~~~~~~~v~~i~~~~~~~srk~~D~l~e~ak~~g~kgL~~i~~~ 362 (622) .+| |.|-++.... ...+..+..+.. ..+ . T Consensus 158 ~~~~g~~~~~~~~~----------------------------------------------~~~~~~k~~~~~-~~~---~ 187 (329) T cd00775 158 KEKTGIDFPELDLE----------------------------------------------QPEELAKLLAKL-IKE---K 187 (329) T ss_pred HHHHCCCCCCCCCC----------------------------------------------CHHHHHHHHHHH-CCC---C T ss_conf 99858998766732----------------------------------------------389999998775-686---6 Q ss_pred ECCCCCCCCCCCHHHHCCHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHCCCCCCEEEEEEEECCC Q ss_conf 04777533357025532527899999981887557269980885411001347889998773003557510288960770 Q gi|254780419|r 363 LDESGNVAGMGPVAKHLNIVNTEAIRVRLDMQDGDSCFFIAGDPLKFCSFAADARNHIAQELGIINCNCFEFCWIMDFPF 442 (622) Q Consensus 363 ~~~~~~~~~~spi~K~l~~~~~~~l~~~~~~~~GD~i~f~a~~~~~~~~~lg~lR~~ia~~l~li~~~~~~flWV~DFPL 442 (622) ... ... .+.+..++.. ..++++...++||+|||. T Consensus 188 ~~~--~~~------------------------------------------~~~~~~~~~~--~~ie~~~~~p~fi~dyP~ 221 (329) T cd00775 188 IEK--PRT------------------------------------------LGKLLDKLFE--EFVEPTLIQPTFIIDHPV 221 (329) T ss_pred CCC--CCC------------------------------------------HHHHHHHHHH--HHHHHHCCCCEEEECCHH T ss_conf 567--877------------------------------------------8999999999--999751379689947701 Q ss_pred EEECCCCCCCCHHHCCCCCCCCCHHHHHCCCHHHHHHHCCCEEECCEEECCCCEECCCHHHHHHHHHHCC----CCHHHH Q ss_conf 5354554700021100247664306554188777413136777756772125123156999999999859----998999 Q gi|254780419|r 443 YEWNEEEKKIDFAHNPFTMPQGGMESLKGNDLLSIKAFQYDLVCNGFEIASGGIRNHMSEVMLQAFSNVG----ISKEVV 518 (622) Q Consensus 443 Fe~dee~~r~~s~HHPFTaP~~~~~~l~~~dp~~v~a~~YDLVlNG~EiggGSiRIHd~~iQ~~~f~~lg----l~~ee~ 518 (622) ...||+++++++ |. .+.+|||++||+||+||++|+||++.|+.+|+... ....+. T Consensus 222 ------------~~~pf~~~~~~~-------~~--~~~~fdl~~~G~Ei~~G~~el~d~~~q~~r~~~~~~~~~~~~~~~ 280 (329) T cd00775 222 ------------EISPLAKRHRSN-------PG--LTERFELFICGKEIANAYTELNDPFDQRERFEEQAKQKEAGDDEA 280 (329) T ss_pred ------------HCCCHHHCCCCC-------CC--EEEECCCCCCCEEEECCEEEECCHHHHHHHHHHHHHHHHCCCCCH T ss_conf ------------118443129999-------77--010012456877984153311799999999999999985467511 Q ss_pred HHHHHHHHHHHHCCCCCCCCEECCHHHHHHHHCCCCCCEEEECCCCCC Q ss_conf 866778999986084973422246899999983739840321378999 Q gi|254780419|r 519 ENRFGGLYRAFQCGMPPHGGIAAGIDRIVMLLLGAKNVREVSLFPMSQ 566 (622) Q Consensus 519 ~~~Fg~lL~Al~yG~PPHgG~alGlDRlvmll~g~~sIRdVIaFPKt~ 566 (622) ..-..+||+|++||+|||||||||+|||||+|||.+||||||||||.. T Consensus 281 ~~~de~yl~a~k~G~pPhgG~glGldRLvm~l~g~~nIRdvi~FP~~r 328 (329) T cd00775 281 MMMDEDFVTALEYGMPPTGGLGIGIDRLVMLLTDSNSIRDVILFPAMR 328 (329) T ss_pred HHHHHHHHHHHHCCCCCCCEEEHHHHHHHHHHHCCCHHHHCCCCCCCC T ss_conf 242399999966799986114099999999993898187513799989 No 21 >PRK06462 asparagine synthetase A; Reviewed Probab=100.00 E-value=0 Score=553.70 Aligned_cols=313 Identities=21% Similarity=0.314 Sum_probs=234.5 Q ss_pred CHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCHHH Q ss_conf 98782112320001211112124589999999875203651462033211246456645432222210344311238545 Q gi|254780419|r 121 PESIRFKYRFLDLRRDTLHKNIVKRTRIINSMRCRMIAENFIECSTPILTASSPEGARDFLVPSRVNQGSFYALPQAPQQ 200 (622) Q Consensus 121 ~e~~rl~~R~LdLR~~~~~~~l~~Rs~i~~~~R~~l~~~gF~EveTP~L~ksTpeGArdFlVPsR~~~g~fYALpQSPQl 200 (622) .-|+|++||+++||+++++++||+||++++++|+||+++||+|||||+|+.+|++|+.+..+++..+.|+.+.|+||||| T Consensus 8 ~~~~~~~~r~~~ir~~~~~~ifriRs~i~~~iR~f~~~~gF~EV~TPiL~~~~~~~~~~~~~~~~~~~~~~~~L~~Spql 87 (332) T PRK06462 8 MIEEWVRYSWKIISSEKYSLVLKIQSSILRYTREFLDGRGFVEVLPPIISPSTDPLMGDAKPASIDFYGVEYYLADSMIF 87 (332) T ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCCCEEECCCCCEEECCCHHH T ss_conf 89998730244025878999999999999999999988899998797236778876677601058448986331458699 Q ss_pred HHHHHHCCCHHHHEEEHHHHCCCC--CCCCCCCCCEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCC Q ss_conf 644331057023311001103777--665538000035654102689999999999999999999808723457732121 Q gi|254780419|r 201 YKQLLMASGFDRYFQIAPCFRDED--PRSDRLPGEFYQLDMEMSFVEQDDILNTMENVLRGVFEEFSNGNTVSNNFLRIC 278 (622) Q Consensus 201 yKQlLmv~G~dryfqiarcFRdEd--~R~dRqp~EFTqlD~EmsF~~~edvm~l~E~li~~i~~~v~~~~~i~~pF~rmt 278 (622) |||++++||++||||||||||||+ +.+.||.+||||||+||||+|++|+|+++|+|++++++.+.+. T Consensus 88 ~lk~li~~g~~rVfeIg~~FR~E~~d~~t~rHlpEFt~lE~y~a~~d~~d~m~~~E~li~~i~~~v~~~----------- 156 (332) T PRK06462 88 HKQLMLRLLKGKVFYLSPNFRLEPVDKDTGRHLYEFTQLDIEIEGADLEEVMSLAEDLIKYLVKELLRE----------- 156 (332) T ss_pred HHHHHHHCCCCCEEEECCHHHCCCCCCCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHH----------- T ss_conf 999998638997799745231687776777654577765587873899999999999999999999986----------- Q ss_pred HHHHH-HHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCC Q ss_conf 89887-51367776444764012443101787524566687403784799996278766521035677999876420246 Q gi|254780419|r 279 YDDAI-RRYGTDKPDLRNPIIMHNVTEHFVHSEFRVFSKILSVNPGYEVWAIPAKGGGGNRAFCDRMNEWARSQGQPGLA 357 (622) Q Consensus 279 Y~eAm-~~YGsDKPDLR~~lei~dit~~~~~~~f~vf~~~~~~~~~~~v~~i~~~~~~~srk~~D~l~e~ak~~g~kgL~ 357 (622) |.+.+ ..+|.|.||+..+....+. +...+..+..+... T Consensus 157 ~~~~~~~~~~~~~~~~~~~~~ri~~---------------------------------------~eai~~l~~~~~~~-- 195 (332) T PRK06462 157 HSAELEFFLGRDLPEIKRPFKRITH---------------------------------------KEAIEILNEEGCSG-- 195 (332) T ss_pred HHHHHHHHCCCCCCCCCCCHHHCCH---------------------------------------HHHHHHHHHCCCCC-- T ss_conf 6799998707775235981145469---------------------------------------99999998627656-- Q ss_pred CEEEEECCCCCCCCCCCHHHHCCHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHCCCCCCEEEEEE Q ss_conf 23552047775333570255325278999999818875572699808854110013478899987730035575102889 Q gi|254780419|r 358 YIFWRLDESGNVAGMGPVAKHLNIVNTEAIRVRLDMQDGDSCFFIAGDPLKFCSFAADARNHIAQELGIINCNCFEFCWI 437 (622) Q Consensus 358 ~i~~~~~~~~~~~~~spi~K~l~~~~~~~l~~~~~~~~GD~i~f~a~~~~~~~~~lg~lR~~ia~~l~li~~~~~~flWV 437 (622) .. ...++.+..+ .+.+. .-.|+|| T Consensus 196 ---------------~~-~~~~~~~~e~---------------------------------~l~~~-------~~~P~fv 219 (332) T PRK06462 196 ---------------EE-LEEIGSEGEK---------------------------------SLSEH-------FEEPVWI 219 (332) T ss_pred ---------------CH-HHHHHHHHHH---------------------------------HHHHH-------HCCCEEE T ss_conf ---------------60-7675459999---------------------------------99998-------0897899 Q ss_pred EECCCEEECCCCCCCCHHHCCCCCCCCCHHHHHCCCHHHHHHHCCCEEE-CCE-EECCCCEECCCHHHHHHHHHHCCCCH Q ss_conf 6077053545547000211002476643065541887774131367777-567-72125123156999999999859998 Q gi|254780419|r 438 MDFPFYEWNEEEKKIDFAHNPFTMPQGGMESLKGNDLLSIKAFQYDLVC-NGF-EIASGGIRNHMSEVMLQAFSNVGISK 515 (622) Q Consensus 438 ~DFPLFe~dee~~r~~s~HHPFTaP~~~~~~l~~~dp~~v~a~~YDLVl-NG~-EiggGSiRIHd~~iQ~~~f~~lgl~~ 515 (622) +|||. ...||++.+++ +|. .+.+|||++ ||+ ||.|||+|+|+++.|.+.++..|+++ T Consensus 220 ~d~P~------------~~~Pfy~~~~~-------~~~--~~~~fdl~~~~g~~El~~g~~r~~~~~~l~~~~~~~g~~~ 278 (332) T PRK06462 220 INIPK------------GSREFYDREDE-------RPG--VLRDYDLLLPEGIGEAVSGGEREYEYEEIVERIREHGVDP 278 (332) T ss_pred ECCHH------------HHCHHHHCCCC-------CCC--EEEECCHHHHCCCEEECCHHHHHHHHHHHHHHHHHCCCCH T ss_conf 75625------------52958664899-------988--4875360330666222251767855999999999769997 Q ss_pred HHHHHHHHHHHHHHHCCCC-CCCCEECCHHHHHHHHCCCCCCEEEECCCCCC Q ss_conf 9998667789999860849-73422246899999983739840321378999 Q gi|254780419|r 516 EVVENRFGGLYRAFQCGMP-PHGGIAAGIDRIVMLLLGAKNVREVSLFPMSQ 566 (622) Q Consensus 516 ee~~~~Fg~lL~Al~yG~P-PHgG~alGlDRlvmll~g~~sIRdVIaFPKt~ 566 (622) +. +.|||+|++||+| ||||||||+|||||+|||.+||||||+||+.- T Consensus 279 ~~----~~~yl~~~~~G~P~P~gG~GiGiDRLvm~l~~~~~Irdvi~FPR~p 326 (332) T PRK06462 279 EN----YKWYLEMAKEGYPLPTAGFGIGVERLTRYICGLRDIREVQPFPRVP 326 (332) T ss_pred HH----HHHHHHHHHHCCCCCCCEEEEHHHHHHHHHHCCCCHHEEEECCCCC T ss_conf 89----9999999982899998776148999999980898576174179998 No 22 >cd00669 Asp_Lys_Asn_RS_core Asp_Lys_Asn_tRNA synthetase class II core domain. This domain is the core catalytic domain of class II aminoacyl-tRNA synthetases of the subgroup containing aspartyl, lysyl, and asparaginyl tRNA synthetases. It is primarily responsible for ATP-dependent formation of the enzyme bound aminoacyl-adenylate. Class II assignment is based upon its structure and the presence of three characteristic sequence motifs. Nearly all class II tRNA synthetases are dimers and enzymes in this subgroup are homodimers. These enzymes attach a specific amino acid to the 3' OH group of ribose of the appropriate tRNA. Probab=100.00 E-value=0 Score=514.11 Aligned_cols=258 Identities=34% Similarity=0.563 Sum_probs=213.8 Q ss_pred HHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCC-CCCCCCCCCCCCHHHHHHCCCCHHHHHHHHHCCCHHHHEEEHHHH Q ss_conf 2458999999987520365146203321124645-664543222221034431123854564433105702331100110 Q gi|254780419|r 142 IVKRTRIINSMRCRMIAENFIECSTPILTASSPE-GARDFLVPSRVNQGSFYALPQAPQQYKQLLMASGFDRYFQIAPCF 220 (622) Q Consensus 142 l~~Rs~i~~~~R~~l~~~gF~EveTP~L~ksTpe-GArdFlVPsR~~~g~fYALpQSPQlyKQlLmv~G~dryfqiarcF 220 (622) ||+||++++++|+||.++||+||+||+|+++++| ||++|.|+.. ..|..+-|+||||+|+|+||++|++|+|||+||| T Consensus 1 lr~Rs~i~~~iR~ff~~~~f~EV~TP~l~~~~~~~~~~~f~~~~~-~~~~~~~L~~Spel~~k~ll~~g~~~if~i~~~F 79 (269) T cd00669 1 FKVRSKIIKAIRDFMDDRGFLEVETPMLQKITGGAGARPFLVKYN-ALGLDYYLRISPQLFKKRLMVGGLDRVFEINRNF 79 (269) T ss_pred CCHHHHHHHHHHHHHHHCCCEEEECCEEECCCCCCCCCEEEEECC-CCCCEEEECCCHHHHHHHHHHCCCCCEEEEECCC T ss_conf 916999999999999988989987985305789876742573137-8994077343889999999865888679984621 Q ss_pred CCCCCCCCCCCCCEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCC---------CCCCCCCCHHHHHHHCCCCCC Q ss_conf 37776655380000356541026899999999999999999998087234---------577321218988751367776 Q gi|254780419|r 221 RDEDPRSDRLPGEFYQLDMEMSFVEQDDILNTMENVLRGVFEEFSNGNTV---------SNNFLRICYDDAIRRYGTDKP 291 (622) Q Consensus 221 RdEd~R~dRqp~EFTqlD~EmsF~~~edvm~l~E~li~~i~~~v~~~~~i---------~~pF~rmtY~eAm~~YGsDKP 291 (622) |||++.+ ||.+||||||.||+|+|++|+|+++|+|++++++++.+.... ..||+|++|.||+++ T Consensus 80 R~e~~~~-rH~~EFtmlE~y~~~~d~~~~m~~~e~li~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~ea~e~------ 152 (269) T cd00669 80 RNEDLRA-RHQPEFTMMDLEMAFADYEDVIELTERLVRHLAREVLGVTAVTYGFELEDFGLPFPRLTYREALER------ 152 (269) T ss_pred CCCCCCC-CCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHCCCCCCCCCCCCEEEHHHHHHH------ T ss_conf 0789865-543487757875148999999999999999999999655524535544445899878579999998------ Q ss_pred CCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCC Q ss_conf 44476401244310178752456668740378479999627876652103567799987642024623552047775333 Q gi|254780419|r 292 DLRNPIIMHNVTEHFVHSEFRVFSKILSVNPGYEVWAIPAKGGGGNRAFCDRMNEWARSQGQPGLAYIFWRLDESGNVAG 371 (622) Q Consensus 292 DLR~~lei~dit~~~~~~~f~vf~~~~~~~~~~~v~~i~~~~~~~srk~~D~l~e~ak~~g~kgL~~i~~~~~~~~~~~~ 371 (622) T Consensus 153 -------------------------------------------------------------------------------- 152 (269) T cd00669 153 -------------------------------------------------------------------------------- 152 (269) T ss_pred -------------------------------------------------------------------------------- T ss_conf -------------------------------------------------------------------------------- Q ss_pred CCCHHHHCCHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHCCCCCCEEEEEEEECCCEEECCCCCC Q ss_conf 57025532527899999981887557269980885411001347889998773003557510288960770535455470 Q gi|254780419|r 372 MGPVAKHLNIVNTEAIRVRLDMQDGDSCFFIAGDPLKFCSFAADARNHIAQELGIINCNCFEFCWIMDFPFYEWNEEEKK 451 (622) Q Consensus 372 ~spi~K~l~~~~~~~l~~~~~~~~GD~i~f~a~~~~~~~~~lg~lR~~ia~~l~li~~~~~~flWV~DFPLFe~dee~~r 451 (622) + -.|+||+|||. T Consensus 153 ----------------------------------------------------~-------~~p~fi~dyP~--------- 164 (269) T cd00669 153 ----------------------------------------------------Y-------GQPLFLTDYPA--------- 164 (269) T ss_pred ----------------------------------------------------H-------CCCEEEECCCH--------- T ss_conf ----------------------------------------------------4-------99789989850--------- Q ss_pred CCHHHCCC-CCCCCCHHHHHCCCHHHHHHHCCCEEECCEEECCCCEECCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHH Q ss_conf 00211002-47664306554188777413136777756772125123156999999999859998999866778999986 Q gi|254780419|r 452 IDFAHNPF-TMPQGGMESLKGNDLLSIKAFQYDLVCNGFEIASGGIRNHMSEVMLQAFSNVGISKEVVENRFGGLYRAFQ 530 (622) Q Consensus 452 ~~s~HHPF-TaP~~~~~~l~~~dp~~v~a~~YDLVlNG~EiggGSiRIHd~~iQ~~~f~~lgl~~ee~~~~Fg~lL~Al~ 530 (622) ..+|| +++.++ +|. .+..|||++||+||+|||+|+||++.|+++|+..++++++....+.|||+|++ T Consensus 165 ---~~~~~~~~~~~~-------~~~--~a~rfdl~~~G~El~nG~~r~~d~~~~~~r~~~~~~~~~~~~~~~~~yl~a~~ 232 (269) T cd00669 165 ---EMHSPLASPHDV-------NPE--IADAFDLFINGVEVGNGSSRLHDPDIQAEVFQEQGINKEAGMEYFEFYLKALE 232 (269) T ss_pred ---HCCCCCCCCCCC-------CCC--HHHHCCCCCCCEEECCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHH T ss_conf ---107802090799-------966--76650246786067032520488899999999856074455665899999987 Q ss_pred CCCCCCCCEECCHHHHHHHHCCCCCCEEEECCCCCCC Q ss_conf 0849734222468999999837398403213789998 Q gi|254780419|r 531 CGMPPHGGIAAGIDRIVMLLLGAKNVREVSLFPMSQN 567 (622) Q Consensus 531 yG~PPHgG~alGlDRlvmll~g~~sIRdVIaFPKt~~ 567 (622) ||+|||||+|||+|||+|+++|.+||||||+||++++ T Consensus 233 ~G~pp~~G~glG~dRL~m~~~g~~~Irdv~~FPr~rr 269 (269) T cd00669 233 YGLPPHGGLGIGIDRLIMLMTNSPTIREVIAFPKMRR 269 (269) T ss_pred CCCCCCEEEEEHHHHHHHHHHCCCCHHHCCCCCCCCC T ss_conf 7999971663479999999808996886247899889 No 23 >KOG1885 consensus Probab=100.00 E-value=0 Score=511.44 Aligned_cols=443 Identities=24% Similarity=0.391 Sum_probs=308.5 Q ss_pred CCCCHHHCCHHH---------CCCEEEEEEEECCCCCCC-CEEEEEEECCCCEEEEEECCCC----HHHHH-HHCCCCCC Q ss_conf 742042369799---------898899997980044469-8799999779825999974881----57899-85289885 Q gi|254780419|r 5 RTHSCGDLGVSD---------VGSFVRLSGWVHRVRPHG-GIIFLDIRDHYGITQVVANPDS----ACFEI-AKSVRSEW 69 (622) Q Consensus 5 Rth~cg~l~~~~---------~g~~V~i~GwV~~~R~~g-~l~F~~lrD~sG~~Q~v~~~~~----~~~~~-~~~l~~es 69 (622) -|++..++-+.+ .+.+|+|+|+|+++|-+| |++|+||++....+|++++.+. +.|+. .+.|+.++ T Consensus 82 vs~si~~fieky~~l~~ge~~~n~~~svaGRI~s~R~sGsKL~Fydl~~~g~klQvm~~~~~~~~~~~F~~~~~~lkrGD 161 (560) T KOG1885 82 VSISIPDFIEKYLHLATGEHLDNEIVSVAGRIHSKRESGSKLVFYDLHGDGVKLQVMANAKKITSEEDFEQLHKFLKRGD 161 (560) T ss_pred CCCCHHHHHHHHCCCCCCCCCCCCEEEEEEEEEEEECCCCCEEEEEEECCCEEEEEEEEHHHCCCHHHHHHHHHHHHCCC T ss_conf 00408999998557566565665225551067543015774699999469868999972200478788999986642168 Q ss_pred EEEEEEEEEECCCCCCCCCCCCCEEEEEEEEEEEEECCCCCCCCCCCCCCCCHHHHCCCCCEEE-CCCCCCCCHHHHHHH Q ss_conf 8999999997586555887778659999889999502117998645766789878211232000-121111212458999 Q gi|254780419|r 70 VLCIDGLVMSRSAETINANIITGQIELSAQKIEILSVAEELPLPVFGEREYPESIRFKYRFLDL-RRDTLHKNIVKRTRI 148 (622) Q Consensus 70 vv~v~G~V~~r~~~~~n~~~~tG~~Ei~~~~~~il~~a~~~P~~~~~~~~~~e~~rl~~R~LdL-R~~~~~~~l~~Rs~i 148 (622) +|++.|-+- +.+.||++|.++++.+||+|.. |+|-....-.+.|+|+++||||| -+++++.+|++|++| T Consensus 162 iig~~G~pg---------rt~~gELSi~~~~~~lLspcLh-~lP~~~~gLkD~EtRyrqRylDlilN~~~r~~f~~RakI 231 (560) T KOG1885 162 IIGVSGYPG---------RTKSGELSIIPNEIILLSPCLH-MLPHEHFGLKDKETRYRKRYLDLILNPEVRDRFRIRAKI 231 (560) T ss_pred EEEEECCCC---------CCCCCEEEEEECCHHEECCHHC-CCCHHHCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHH T ss_conf 776515778---------6878517774120202052313-487665177768889998889987088899999999999 Q ss_pred HHHHHHHHHCCCEEEEECCCCCCCCCCC--CCCCCCCCCCCHHH----HHHCCCCHHHHHHHHHCCCHHHHEEEHHHHCC Q ss_conf 9999875203651462033211246456--64543222221034----43112385456443310570233110011037 Q gi|254780419|r 149 INSMRCRMIAENFIECSTPILTASSPEG--ARDFLVPSRVNQGS----FYALPQAPQQYKQLLMASGFDRYFQIAPCFRD 222 (622) Q Consensus 149 ~~~~R~~l~~~gF~EveTP~L~ksTpeG--ArdFlVPsR~~~g~----fYALpQSPQlyKQlLmv~G~dryfqiarcFRd 222 (622) +..+|.||+++||+|||||+|.. -++| ||+|+ +|++. .| |.-+|+||-.+|+|||+||||+|||.||| T Consensus 232 I~~iRkfld~rgFlEVETPmmn~-iaGGA~AkPFI----T~hndldm~Ly-lRiAPEL~lK~LvVGGldrVYEIGr~FRN 305 (560) T KOG1885 232 ISYIRKFLDSRGFLEVETPMMNM-IAGGATAKPFI----THHNDLDMDLY-LRIAPELYLKMLVVGGLDRVYEIGRQFRN 305 (560) T ss_pred HHHHHHHHHHCCCEEECCHHHCC-CCCCCCCCCEE----ECCCCCCCCEE-EEECHHHHHHHHHHCCHHHHHHHHHHHHH T ss_conf 99999986544956844465525-46863257604----31455675514-56365999998986017899998787630 Q ss_pred CCCCCCCCCCCEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHCCCCCCCCCCCCCCCCC Q ss_conf 77665538000035654102689999999999999999999808723457732121898875136777644476401244 Q gi|254780419|r 223 EDPRSDRLPGEFYQLDMEMSFVEQDDILNTMENVLRGVFEEFSNGNTVSNNFLRICYDDAIRRYGTDKPDLRNPIIMHNV 302 (622) Q Consensus 223 Ed~R~dRqp~EFTqlD~EmsF~~~edvm~l~E~li~~i~~~v~~~~~i~~pF~rmtY~eAm~~YGsDKPDLR~~lei~di 302 (622) |+....+.| |||.+++.|||+|++|+|+++|.|+..+.+++.+...+. |-.+.|+-+ .+++ |+ T Consensus 306 EGIDlTHNP-EFTTcEfY~AYady~dlm~~TE~l~s~mv~~i~G~~~i~--------------y~p~~~~~~-~~el-df 368 (560) T KOG1885 306 EGIDLTHNP-EFTTCEFYMAYADYEDLMDMTEELLSGMVKNITGSYKIT--------------YHPNGPEEP-ELEL-DF 368 (560) T ss_pred CCCCCCCCC-CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEE--------------ECCCCCCCC-CEEE-EC T ss_conf 576654587-742189999875388899999999999999603743675--------------268898887-4366-23 Q ss_pred CCCCCCCCHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCHHHHCCHH Q ss_conf 31017875245666874037847999962787665210356779998764202462355204777533357025532527 Q gi|254780419|r 303 TEHFVHSEFRVFSKILSVNPGYEVWAIPAKGGGGNRAFCDRMNEWARSQGQPGLAYIFWRLDESGNVAGMGPVAKHLNIV 382 (622) Q Consensus 303 t~~~~~~~f~vf~~~~~~~~~~~v~~i~~~~~~~srk~~D~l~e~ak~~g~kgL~~i~~~~~~~~~~~~~spi~K~l~~~ 382 (622) +.-|. .+..+.+..+..|.+ +. + ...+..+ T Consensus 369 ~~pfr--------------------------------ri~mi~~L~k~lgi~------l~-----------~-~~~l~~~ 398 (560) T KOG1885 369 TRPFR--------------------------------RIEMIEELEKELGIK------LP-----------P-GSTLHTE 398 (560) T ss_pred CCCEE--------------------------------EEEHHHHHHHHHCCC------CC-----------C-CCCCCCH T ss_conf 68725--------------------------------450789999995887------89-----------8-3335753 Q ss_pred HHHHHHHHHCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHCCCCCCEEEEEEEECCC-EEECCCCCCCCHHHCCCCC Q ss_conf 899999981887557269980885411001347889998773003557510288960770-5354554700021100247 Q gi|254780419|r 383 NTEAIRVRLDMQDGDSCFFIAGDPLKFCSFAADARNHIAQELGIINCNCFEFCWIMDFPF-YEWNEEEKKIDFAHNPFTM 461 (622) Q Consensus 383 ~~~~l~~~~~~~~GD~i~f~a~~~~~~~~~lg~lR~~ia~~l~li~~~~~~flWV~DFPL-Fe~dee~~r~~s~HHPFTa 461 (622) +..++...+-...+ ..+..+..+...|.++ ..+ .++++..+|.||.|.|- .+ T Consensus 399 e~~~~L~~~~~~~~----v~~p~p~t~arlLdKL----vg~--flE~~cvnPTFi~~hP~imS----------------- 451 (560) T KOG1885 399 ETRELLKSLCVDEA----VECPPPRTTARLLDKL----VGE--FLEPTCVNPTFIIDHPQIMS----------------- 451 (560) T ss_pred HHHHHHHHHHHHCC----CCCCCCCCHHHHHHHH----HHH--HHCCCCCCCEEECCCCHHCC----------------- T ss_conf 16899999877536----6789831089999998----767--64103579706727813318----------------- Q ss_pred CCCCHHHHHCCCHHHHHHHCCCEEECCEEECCCCEECCCHHHHHHHHHH------CCCCHHHHHHHHHHHHHHHHCCCCC Q ss_conf 6643065541887774131367777567721251231569999999998------5999899986677899998608497 Q gi|254780419|r 462 PQGGMESLKGNDLLSIKAFQYDLVCNGFEIASGGIRNHMSEVMLQAFSN------VGISKEVVENRFGGLYRAFQCGMPP 535 (622) Q Consensus 462 P~~~~~~l~~~dp~~v~a~~YDLVlNG~EiggGSiRIHd~~iQ~~~f~~------lgl~~ee~~~~Fg~lL~Al~yG~PP 535 (622) |-..+ -.++++ ....|.|.++|.||+-----.+||-.|+++|+. .|-+ |+..--.-|.+||+||+|| T Consensus 452 PLAK~---hrs~~g--lteRFElFi~~kEicNAYtElNdP~~Qr~rFe~Q~~~k~~GDD--Ea~~~De~Fc~ALEYGlPP 524 (560) T KOG1885 452 PLAKY---HRSKAG--LTERFELFIAGKEICNAYTELNDPVDQRQRFEQQARDKDAGDD--EAQMVDEDFCTALEYGLPP 524 (560) T ss_pred CCCCC---CCCCCC--HHHHHHHHHCCHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCC--CCCCCCHHHHHHHHCCCCC T ss_conf 42224---565400--4567787541088754666606889999999999877644983--1136528788897707999 Q ss_pred CCCEECCHHHHHHHHCCCCCCEEEECCCC Q ss_conf 34222468999999837398403213789 Q gi|254780419|r 536 HGGIAAGIDRIVMLLLGAKNVREVSLFPM 564 (622) Q Consensus 536 HgG~alGlDRlvmll~g~~sIRdVIaFPK 564 (622) +||||+|+|||+|+||++.|||||++||- T Consensus 525 tgGwGmGIDRL~MllTds~~I~EVL~Fp~ 553 (560) T KOG1885 525 TGGWGMGIDRLVMLLTDSNNIREVLLFPA 553 (560) T ss_pred CCCCCCCHHHHHHHHCCCCCHHHEEECCC T ss_conf 77665366465332137765322352465 No 24 >KOG0555 consensus Probab=100.00 E-value=0 Score=459.62 Aligned_cols=402 Identities=24% Similarity=0.363 Sum_probs=317.7 Q ss_pred HHCCCEEEEEEEECCCCCCCCEEEEEEECCCCEEEEEECCCCHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCCCCCCEE Q ss_conf 99898899997980044469879999977982599997488157899852898858999999997586555887778659 Q gi|254780419|r 15 SDVGSFVRLSGWVHRVRPHGGIIFLDIRDHYGITQVVANPDSACFEIAKSVRSEWVLCIDGLVMSRSAETINANIITGQI 94 (622) Q Consensus 15 ~~~g~~V~i~GwV~~~R~~g~l~F~~lrD~sG~~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~r~~~~~n~~~~tG~~ 94 (622) ++.|+.|.+.||||+.|.+++++|+.|||++|.+|||...+-.....+-.|+.||+|.|.|++.+-|+|.. ..|+. T Consensus 120 ~~r~qrVkv~gWVhrlR~qk~l~FivLrdg~gflqCVl~~kl~~~yd~~~Ls~essv~vYG~i~~~p~GK~----apggh 195 (545) T KOG0555 120 ENRGQRVKVFGWVHRLRRQKSLIFIVLRDGTGFLQCVLSDKLCQSYDALTLSTESSVTVYGTIKKLPEGKS----APGGH 195 (545) T ss_pred CCCCCEEEEEHHHHHHHHCCCEEEEEEECCCCEEEEEECCHHHHHHCCCCCCCCCEEEEEEEEECCCCCCC----CCCCC T ss_conf 34585578403667665147349999966884689997446664320101233424899988741767777----99984 Q ss_pred EEEEEEEEEEECCCCC--CCCCCCCCCCCHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCC Q ss_conf 9998899995021179--98645766789878211232000121111212458999999987520365146203321124 Q gi|254780419|r 95 ELSAQKIEILSVAEEL--PLPVFGEREYPESIRFKYRFLDLRRDTLHKNIVKRTRIINSMRCRMIAENFIECSTPILTAS 172 (622) Q Consensus 95 Ei~~~~~~il~~a~~~--P~~~~~~~~~~e~~rl~~R~LdLR~~~~~~~l~~Rs~i~~~~R~~l~~~gF~EveTP~L~ks 172 (622) |+.|+-++|++.|-.. -++++... ..+..|++|||-||...++..|+.|+.+.+++|+++.+.|+.||..|+|... T Consensus 196 El~vdy~Eiig~Apag~~~n~lne~s--~~~~~LdnrHl~iRge~~s~vLK~Ra~~lr~~Rd~y~~~~ytEVtPPtmVQT 273 (545) T KOG0555 196 ELNVDYWEIIGLAPAGGFDNPLNEES--DVDVLLDNRHLVIRGENASKVLKARAALLRAMRDHYFERGYTEVTPPTMVQT 273 (545) T ss_pred EEEEEEEEEECCCCCCCCCCCCCCCC--CCCEEECCCEEEEECHHHHHHHHHHHHHHHHHHHHHHHCCCEECCCCCEEEE T ss_conf 68756146635567876566456557--8665850552688410478999999999999998887448424579715788 Q ss_pred CCC-CCCCCCCCCCCCHH-HHHHCCCCHHHHHHHHHCCCHHHHEEEHHHHCCCCCCCCCCCCCEEEEEEEECCCCHHHHH Q ss_conf 645-66454322222103-4431123854564433105702331100110377766553800003565410268999999 Q gi|254780419|r 173 SPE-GARDFLVPSRVNQG-SFYALPQAPQQYKQLLMASGFDRYFQIAPCFRDEDPRSDRLPGEFYQLDMEMSFVEQDDIL 250 (622) Q Consensus 173 Tpe-GArdFlVPsR~~~g-~fYALpQSPQlyKQlLmv~G~dryfqiarcFRdEd~R~dRqp~EFTqlD~EmsF~~~edvm 250 (622) --| |+.-|-. .+-| .+| |.||.|||-.-. +..+.++|.|...||+|.||+.||++|||.++.||+|++.+|.+ T Consensus 274 QVEGGsTLFkl---dYyGEeAy-LTQSSQLYLEtc-lpAlgdvy~I~~SyRAEkSrTRRHLsEytHVEaE~afltfd~ll 348 (545) T KOG0555 274 QVEGGSTLFKL---DYYGEEAY-LTQSSQLYLETC-LPALGDVYCIQQSYRAEKSRTRRHLSEYTHVEAECAFLTFDDLL 348 (545) T ss_pred EECCCCEEEEE---CCCCCHHH-CCCHHHHHHHHH-HHHCCCEEEECHHHHHHHHHHHHHHHHHEEEEEECCCCCHHHHH T ss_conf 75176048731---13574033-022358889876-55207626732756655666666553311102324636488999 Q ss_pred HHHHHHHHHHHHHHHCC-------------CCCCCCCCCCCHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHH Q ss_conf 99999999999998087-------------23457732121898875136777644476401244310178752456668 Q gi|254780419|r 251 NTMENVLRGVFEEFSNG-------------NTVSNNFLRICYDDAIRRYGTDKPDLRNPIIMHNVTEHFVHSEFRVFSKI 317 (622) Q Consensus 251 ~l~E~li~~i~~~v~~~-------------~~i~~pF~rmtY~eAm~~YGsDKPDLR~~lei~dit~~~~~~~f~vf~~~ 317 (622) +-+|.|+......++.. ...+.||.||.|.|||..- +-+|+.. T Consensus 349 ~~iE~lvc~~vdr~l~dp~~~li~~lnP~f~~P~~PFkRm~Y~dAI~wL-----------ke~~vk~------------- 404 (545) T KOG0555 349 DRIEALVCDSVDRLLEDPIAPLIKQLNPDFKAPKRPFKRMNYSDAIEWL-----------KEHDVKK------------- 404 (545) T ss_pred HHHHHHHHHHHHHHHHCHHHHHHHHHCCCCCCCCCCHHCCCHHHHHHHH-----------HHCCCCC------------- T ss_conf 9999999999999986701566887498988888812107889999999-----------8639767------------- Q ss_pred HHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCHHHHCCHHHHHHHHHHHCCCCCC Q ss_conf 74037847999962787665210356779998764202462355204777533357025532527899999981887557 Q gi|254780419|r 318 LSVNPGYEVWAIPAKGGGGNRAFCDRMNEWARSQGQPGLAYIFWRLDESGNVAGMGPVAKHLNIVNTEAIRVRLDMQDGD 397 (622) Q Consensus 318 ~~~~~~~~v~~i~~~~~~~srk~~D~l~e~ak~~g~kgL~~i~~~~~~~~~~~~~spi~K~l~~~~~~~l~~~~~~~~GD 397 (622) ++|..- + |.++..++--.+ T Consensus 405 -----------------------------------------------edg~~f------e-fGdDI~eAaER~------- 423 (545) T KOG0555 405 -----------------------------------------------EDGTDF------E-FGDDIPEAAERK------- 423 (545) T ss_pred -----------------------------------------------CCCCCC------C-CCCCHHHHHHHH------- T ss_conf -----------------------------------------------667602------0-466425678876------- Q ss_pred CEEEEECCCCCCCCHHHHHHHHHHHHHHCCCCCCEEEEEEEECCCEEECCCCCCCCHHHCCCCCCCCCHHHHHCCCHHHH Q ss_conf 26998088541100134788999877300355751028896077053545547000211002476643065541887774 Q gi|254780419|r 398 SCFFIAGDPLKFCSFAADARNHIAQELGIINCNCFEFCWIMDFPFYEWNEEEKKIDFAHNPFTMPQGGMESLKGNDLLSI 477 (622) Q Consensus 398 ~i~f~a~~~~~~~~~lg~lR~~ia~~l~li~~~~~~flWV~DFPLFe~dee~~r~~s~HHPFTaP~~~~~~l~~~dp~~v 477 (622) +.+ ..=.|.+.+.||. -|.+|+|++..+...+ T Consensus 424 ----------------------------mtd-tIg~PIfLtrFpv------------eiKsFYM~rc~dd~~l------- 455 (545) T KOG0555 424 ----------------------------MTD-TIGVPIFLTRFPV------------EIKSFYMKRCEDDPRL------- 455 (545) T ss_pred ----------------------------HHH-HCCCCEEEEECCC------------CCCCEEEECCCCCCCC------- T ss_conf ----------------------------555-2288147863333------------3445133116676554------- Q ss_pred HHHCCCEEECCE-EECCCCEECCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCCCEECCHHHHHHHHCCCCCC Q ss_conf 131367777567-7212512315699999999985999899986677899998608497342224689999998373984 Q gi|254780419|r 478 KAFQYDLVCNGF-EIASGGIRNHMSEVMLQAFSNVGISKEVVENRFGGLYRAFQCGMPPHGGIAAGIDRIVMLLLGAKNV 556 (622) Q Consensus 478 ~a~~YDLVlNG~-EiggGSiRIHd~~iQ~~~f~~lgl~~ee~~~~Fg~lL~Al~yG~PPHgG~alGlDRlvmll~g~~sI 556 (622) ..+-|+.+.|+ ||-|||+||.|.+....-|+.-|+++..+ =||++-.+||.-||||.|||++|++|+||+..+| T Consensus 456 -TESvD~LmPnVGEIvGGSMRi~d~eeLlagfkRegId~~pY----YWy~DqrkyGt~pHGGyGLGlERfL~wL~~r~~v 530 (545) T KOG0555 456 -TESVDVLMPNVGEIVGGSMRIDDSEELLAGFKREGIDPDPY----YWYTDQRKYGTCPHGGYGLGLERFLAWLCDRYHV 530 (545) T ss_pred -CEEEEEECCCCCCCCCCEEEECCHHHHHHHHHHCCCCCCCC----EEEEECCCCCCCCCCCCCCCHHHHHHHHHCCCCH T ss_conf -30011114786100366045165999998886558998870----4774010026689885131499999987222315 Q ss_pred EEEECCCCC Q ss_conf 032137899 Q gi|254780419|r 557 REVSLFPMS 565 (622) Q Consensus 557 RdVIaFPKt 565 (622) |||-+||+- T Consensus 531 re~cLyPRf 539 (545) T KOG0555 531 REVCLYPRF 539 (545) T ss_pred HHEEECCHH T ss_conf 231212313 No 25 >KOG0554 consensus Probab=100.00 E-value=0 Score=461.01 Aligned_cols=400 Identities=25% Similarity=0.417 Sum_probs=304.1 Q ss_pred CCHHHCCCEEEEEEEECCCCCCCCEEEEEEECCCC--EEEEEECCCCHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCCC Q ss_conf 69799898899997980044469879999977982--5999974881578998528988589999999975865558877 Q gi|254780419|r 12 LGVSDVGSFVRLSGWVHRVRPHGGIIFLDIRDHYG--ITQVVANPDSACFEIAKSVRSEWVLCIDGLVMSRSAETINANI 89 (622) Q Consensus 12 l~~~~~g~~V~i~GwV~~~R~~g~l~F~~lrD~sG--~~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~r~~~~~n~~~ 89 (622) +.....|+.+.|.|||.++|..|+++|+++.|+|. .+|||+++ +..+.+..+++|.++|.+..- .+. T Consensus 14 ~~~~~~g~t~~i~GWvKsvr~~~~~~Fl~i~DGs~~~~lQvVv~~-----~~~q~la~Gt~i~~~g~l~~~-~~~----- 82 (446) T KOG0554 14 LGHPRAGDTISIGGWVKSVRKLKKVTFLDINDGSCPSPLQVVVDS-----EQSQLLATGTCISAEGVLKVS-KGA----- 82 (446) T ss_pred CCCCCCCCCEEECCHHHHCCCCCCEEEEEECCCCCCCCEEEEECH-----HHHHHCCCCCEEEEEEEEEEC-CCH----- T ss_conf 357889980463324541314554289995589887664999562-----784020565269997568734-650----- Q ss_pred CCCEEEEEEEEEEEEECCCCCCCCCCCCCCCCHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCC Q ss_conf 78659999889999502117998645766789878211232000121111212458999999987520365146203321 Q gi|254780419|r 90 ITGQIELSAQKIEILSVAEELPLPVFGEREYPESIRFKYRFLDLRRDTLHKNIVKRTRIINSMRCRMIAENFIECSTPIL 169 (622) Q Consensus 90 ~tG~~Ei~~~~~~il~~a~~~P~~~~~~~~~~e~~rl~~R~LdLR~~~~~~~l~~Rs~i~~~~R~~l~~~gF~EveTP~L 169 (622) .-++|+.++++.+.+..++. +|+.++..++|.+| +.-||..|+..+.+.+|+||.+..++|.||.+++|++|.|||| T Consensus 83 -~q~iel~~eki~~vG~v~~~-ypl~Kk~lt~e~LR-~~~HLR~Rt~~~~av~RvRs~~~~a~h~ffq~~~F~~i~tPii 159 (446) T KOG0554 83 -KQQIELNAEKIKVVGTVDES-YPLQKKKLTPEMLR-DKLHLRSRTAKVGAVLRVRSALAFATHSFFQSHDFTYINTPII 159 (446) T ss_pred -HEEEEEEEEEEEEEEECCCC-CCCCCCCCCHHHHH-HCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEECCCEE T ss_conf -10110200578998534899-87742338988985-3233210025789999999899999999998759467338676 Q ss_pred CCCCCCCCCC-CCCCCCC-CHHHHHH----CCCCHHHHHHHHHCCCHHHHEEEHHHHCCCCCCCCCCCCCEEEEEEEECC Q ss_conf 1246456645-4322222-1034431----12385456443310570233110011037776655380000356541026 Q gi|254780419|r 170 TASSPEGARD-FLVPSRV-NQGSFYA----LPQAPQQYKQLLMASGFDRYFQIAPCFRDEDPRSDRLPGEFYQLDMEMSF 243 (622) Q Consensus 170 ~ksTpeGArd-FlVPsR~-~~g~fYA----LpQSPQlyKQlLmv~G~dryfqiarcFRdEd~R~dRqp~EFTqlD~EmsF 243 (622) |.++||||.| |-|-|-+ ..+.||. |.-|-||--- .|++++.|+|.++||||+|++++.||+.|||||+.|||| T Consensus 160 TtnDCEGaGE~F~Vtt~~d~~~~fFgrp~fLTVSgQLhlE-a~a~aLsrvyt~gP~FRAE~s~t~rHLaEFwmlE~E~af 238 (446) T KOG0554 160 TTNDCEGAGEVFQVTTLTDYSKDFFGRPAFLTVSGQLHLE-AMACALSRVYTFGPTFRAENSHTSRHLAEFWMLEAELAF 238 (446) T ss_pred ECCCCCCCCCEEEEEECCCCCCCCCCCCEEEEEECEEHHH-HHHHHHCCEEEECCCEECCCCCCHHHHHHHHHHHHHHHH T ss_conf 3457778740589874686543314885079971321599-887552323761661322678754677656524278878 Q ss_pred CC-HHHHHHHHHHHHHHHHHHHHCCCCC---------------------CCCCCCCCHHHHHHHCCCCCCCCCCCCCCCC Q ss_conf 89-9999999999999999998087234---------------------5773212189887513677764447640124 Q gi|254780419|r 244 VE-QDDILNTMENVLRGVFEEFSNGNTV---------------------SNNFLRICYDDAIRRYGTDKPDLRNPIIMHN 301 (622) Q Consensus 244 ~~-~edvm~l~E~li~~i~~~v~~~~~i---------------------~~pF~rmtY~eAm~~YGsDKPDLR~~lei~d 301 (622) ++ .+|.|..+|.+++++.+-++++..- ..+|.||||.||++. T Consensus 239 ~~sl~d~~~~~e~~~k~m~k~~le~~~Edm~l~~~~~~~~~~~rl~~~~~~~~~~itYteAie~---------------- 302 (446) T KOG0554 239 AESLDDLMSCAEAYIKHMIKYLLEKCIEDMELMHKNEDPGSIDRLELVAKENLLRITYTEAIEL---------------- 302 (446) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCHHEEECCCCCCCCCCHHHHHHHHHHHHEEHHHHHHH---------------- T ss_conf 8888888899999999999999860511010100114777531577720102332028999999---------------- Q ss_pred CCCCCCCCCHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCHHHHCCH Q ss_conf 43101787524566687403784799996278766521035677999876420246235520477753335702553252 Q gi|254780419|r 302 VTEHFVHSEFRVFSKILSVNPGYEVWAIPAKGGGGNRAFCDRMNEWARSQGQPGLAYIFWRLDESGNVAGMGPVAKHLNI 381 (622) Q Consensus 302 it~~~~~~~f~vf~~~~~~~~~~~v~~i~~~~~~~srk~~D~l~e~ak~~g~kgL~~i~~~~~~~~~~~~~spi~K~l~~ 381 (622) .+....+ .++. ..+|-- .|+. T Consensus 303 ----------------------------------------------L~~a~t~-----~fk~----~~kwG~----~lst 323 (446) T KOG0554 303 ----------------------------------------------LQKAVTK-----KFKT----PPKWGI----DLST 323 (446) T ss_pred ----------------------------------------------HHHHCCC-----CCCC----CCCCCC----CCCH T ss_conf ----------------------------------------------9984024-----4356----720154----4004 Q ss_pred HHHHHHHHHHCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHCCCCCCEEEEEEEECCCEEECCCCCCCCHHHCCCCC Q ss_conf 78999999818875572699808854110013478899987730035575102889607705354554700021100247 Q gi|254780419|r 382 VNTEAIRVRLDMQDGDSCFFIAGDPLKFCSFAADARNHIAQELGIINCNCFEFCWIMDFPFYEWNEEEKKIDFAHNPFTM 461 (622) Q Consensus 382 ~~~~~l~~~~~~~~GD~i~f~a~~~~~~~~~lg~lR~~ia~~l~li~~~~~~flWV~DFPLFe~dee~~r~~s~HHPFTa 461 (622) +. ...+.+++ .-+|++|+||| +.++ ||+| T Consensus 324 eh---------------------------------e~yL~~~~------~~~PVfV~dYP--------~~iK----pFYM 352 (446) T KOG0554 324 EH---------------------------------EKYLVEEC------FKKPVFVTDYP--------KGIK----PFYM 352 (446) T ss_pred HH---------------------------------HHHHHHHH------CCCCEEEEECC--------CCCC----CEEE T ss_conf 56---------------------------------87899985------68977997064--------4466----4488 Q ss_pred CCCCHHHHHCCCHHHHHHHCCCEEECCE-EECCCCEECCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCCCEE Q ss_conf 6643065541887774131367777567-721251231569999999998599989998667789999860849734222 Q gi|254780419|r 462 PQGGMESLKGNDLLSIKAFQYDLVCNGF-EIASGGIRNHMSEVMLQAFSNVGISKEVVENRFGGLYRAFQCGMPPHGGIA 540 (622) Q Consensus 462 P~~~~~~l~~~dp~~v~a~~YDLVlNG~-EiggGSiRIHd~~iQ~~~f~~lgl~~ee~~~~Fg~lL~Al~yG~PPHgG~a 540 (622) -..++. ..| -++||++.|+ ||.|||+|..+ ..+++..|+..++ +.|||+..+||.+|||||| T Consensus 353 r~n~~~-------~tV--aa~DlLVP~vGELiGGSlREe~----~~~l~e~g~~~~~----~eWYldLRryG~vphgGFG 415 (446) T KOG0554 353 RLNDDG-------KTV--AAFDLLVPGVGELIGGSLREER----KARLKERGLTREE----LEWYLDLRRYGSVPHGGFG 415 (446) T ss_pred EECCCC-------CEE--EEEEEECCCCHHHCCCCCCHHH----HHHHHHCCCCCCC----CCEEHHHHHCCCCCCCCCC T ss_conf 855999-------726--8998613663112276531235----8999865999512----1000115543899987314 Q ss_pred CCHHHHHHHHCCCCCCEEEECCCCCCCCC Q ss_conf 46899999983739840321378999871 Q gi|254780419|r 541 AGIDRIVMLLLGAKNVREVSLFPMSQNFC 569 (622) Q Consensus 541 lGlDRlvmll~g~~sIRdVIaFPKt~~g~ 569 (622) |||+|++.+++|.+||||||+||.+..-. T Consensus 416 lGfERmlq~~tG~~nIkd~IPFpR~~~s~ 444 (446) T KOG0554 416 LGFERMLQYLTGNDNIKDVIPFPRYPGSA 444 (446) T ss_pred CCHHHHHHHHHCCCCHHHCEECCCCCCCC T ss_conf 05999999985776542200057886513 No 26 >TIGR00499 lysS_bact lysyl-tRNA synthetase; InterPro: IPR002313 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . Lysyl-tRNA synthetase (6.1.1.6 from EC) is an alpha 2 homodimer that belong to both class I and class II. In eubacteria and eukaryota lysyl-tRNA synthetases belong to class II in the same family as aspartyl tRNA synthetase. The class Ic lysyl-tRNA synthetase family is present in archaea and some eubacteria . Moreover in some eubacteria there is a gene X, which is similar to a part of lysyl-tRNA synthetase from class II. Lysyl-tRNA synthetase is duplicated in some species with, for example in E. coli, as a constitutive gene (lysS) and an induced one (lysU). No residues are directly involved in catalysis, but a number of highly conserved amino acids and three metal ions coordinate the substrates and stabilise the pentavalent transition state. Lysine is activated by being attached to the alpha-phosphate of AMP before being transferred to the cognate tRNA. The refined crystal structures give "snapshots" of the active site corresponding to key steps in the aminoacylation reaction and provide the structural framework for understanding the mechanism of lysine activation. The active site of LysU is shaped to position the substrates for the nucleophilic attack of the lysine carboxylate on the ATP alpha-phosphate. No residues are directly involved in catalysis, but a number of highly conserved amino acids and three metal ions coordinate the substrates and stabilise the pentavalent transition state. A loop close to the catalytic pocket, disordered in the lysine-bound structure, becomes ordered upon adenine binding .; GO: 0000166 nucleotide binding, 0004824 lysine-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006430 lysyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=100.00 E-value=1.4e-45 Score=345.08 Aligned_cols=446 Identities=23% Similarity=0.402 Sum_probs=303.4 Q ss_pred CHHHC-CCEEEEEEEECCCC-CCCCEEEEEEECCC-----CEEEEEECCCCH------HHHHHHCC---CCCCEEEEEEE Q ss_conf 97998-98899997980044-46987999997798-----259999748815------78998528---98858999999 Q gi|254780419|r 13 GVSDV-GSFVRLSGWVHRVR-PHGGIIFLDIRDHY-----GITQVVANPDSA------CFEIAKSV---RSEWVLCIDGL 76 (622) Q Consensus 13 ~~~~~-g~~V~i~GwV~~~R-~~g~l~F~~lrD~s-----G~~Q~v~~~~~~------~~~~~~~l---~~esvv~v~G~ 76 (622) ..+.. +.+|.++|++..+| ..|+..|+.+.|.+ |.+|+++..+.. .......+ ..++.|.+.|. T Consensus 52 ~~~~~p~~~~~~~g~~~~~~~~~~~~~f~~~~~~~g~~~d~~~~~~~~~~~~gg~~p~~~~~~~~~~~~~~gd~~~~~g~ 131 (538) T TIGR00499 52 ELEEKPELEVSIAGRILALRFSGGKATFITLQDESGLKNDGQLQLYVNKDDLGGPEPDFYEFDEYLNFLDLGDIIGVTGY 131 (538) T ss_pred CCCCCCCHHHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHCCHHHCCCEEEECCC T ss_conf 21114320001222466555412322578862566544563378887301147876430233320120102541232034 Q ss_pred EEECCCCCCCCCCCCCEEEEEEEEEEEEECCCCCCCCCCCCCCCCHHHHCCCCCEEE-CCCCCCCCHHHHHHHHHHHHHH Q ss_conf 997586555887778659999889999502117998645766789878211232000-1211112124589999999875 Q gi|254780419|r 77 VMSRSAETINANIITGQIELSAQKIEILSVAEELPLPVFGEREYPESIRFKYRFLDL-RRDTLHKNIVKRTRIINSMRCR 155 (622) Q Consensus 77 V~~r~~~~~n~~~~tG~~Ei~~~~~~il~~a~~~P~~~~~~~~~~e~~rl~~R~LdL-R~~~~~~~l~~Rs~i~~~~R~~ 155 (622) +-. ....+|++.+.++++.++.++.. |.|...+.-.+.++|++.||||| -++..+..+..|++++.++|.| T Consensus 132 ~g~-------~~~~~~~l~~~~~~~~~~~~~~~-~~p~~~~gl~d~e~~~~~~~~d~~~n~~~~~~~~~~~~~~~~~~~~ 203 (538) T TIGR00499 132 PGP-------FKTKTGELSLHVTELQLLTKALR-PLPDKWHGLTDQETRYRQRYLDLIVNPDSRDTFLKRSKIIAAIRRF 203 (538) T ss_pred CCC-------CCCCCCEEEEEEHHHHHHHHHHH-CCCCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHH T ss_conf 453-------22444114554013456677642-2651004665156777765543310502466778899999999998 Q ss_pred HHCCCEEEEECCCCCCCCCCC--CCCCCCCCCCCHH----HHHHCCCCHHHHHHHHHCCCHHHHEEEHHHHCCCCCCCCC Q ss_conf 203651462033211246456--6454322222103----4431123854564433105702331100110377766553 Q gi|254780419|r 156 MIAENFIECSTPILTASSPEG--ARDFLVPSRVNQG----SFYALPQAPQQYKQLLMASGFDRYFQIAPCFRDEDPRSDR 229 (622) Q Consensus 156 l~~~gF~EveTP~L~ksTpeG--ArdFlVPsR~~~g----~fYALpQSPQlyKQlLmv~G~dryfqiarcFRdEd~R~dR 229 (622) ++++||+|||||++. +.|+| ||+|+. |++ .+| |.-+|++|-+.|++||++|+|+++|.||+|+..+.+ T Consensus 204 ~~~~g~~e~e~p~~~-~~~gg~~~~pf~~----~~n~~~~~~~-~~~~~e~~l~~l~~gg~~~~~e~g~~f~neg~~~~h 277 (538) T TIGR00499 204 LDDRGFLEVETPLLQ-VIPGGANARPFIT----HHNALDLDLY-LRIAPELYLKRLVVGGLEKVYEIGRNFRNEGVDTTH 277 (538) T ss_pred HHHCCCEEECCCHHH-CCCCCCCCCCCEE----CCCCCCCCHH-HHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCC T ss_conf 762484221131010-1267654465010----0232244212-112257788777640136778764333104665445 Q ss_pred CCCCEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCC-----CCCCCCCCHHHHHHHCCCCCCCCCCCCCCCCCCC Q ss_conf 80000356541026899999999999999999998087234-----5773212189887513677764447640124431 Q gi|254780419|r 230 LPGEFYQLDMEMSFVEQDDILNTMENVLRGVFEEFSNGNTV-----SNNFLRICYDDAIRRYGTDKPDLRNPIIMHNVTE 304 (622) Q Consensus 230 qp~EFTqlD~EmsF~~~edvm~l~E~li~~i~~~v~~~~~i-----~~pF~rmtY~eAm~~YGsDKPDLR~~lei~dit~ 304 (622) .| |||.+++..+|++++|+|.+.|.++..+.+.+.+...+ +..|+..+|.+.- -|+..+++.....+ T Consensus 278 np-ef~~~e~y~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~p~p~~~~~~~~~~~~-------~~~~~~~~~~~~~~ 349 (538) T TIGR00499 278 NP-EFTSLEFYQAYADYEDLLDLTENLFKFLAKELLGTTKITRLPVPKTFPELTYGDLE-------LDLKKPWKRITMLD 349 (538) T ss_pred CC-CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEEECCCCCCCCHHHCCCCC-------CCCCCCCHHHHHHH T ss_conf 75-30245567765108899998999999999875321100112566543112203210-------02355400224567 Q ss_pred CCCCCCHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCHHHHCCHHHH Q ss_conf 01787524566687403784799996278766521035677999876420246235520477753335702553252789 Q gi|254780419|r 305 HFVHSEFRVFSKILSVNPGYEVWAIPAKGGGGNRAFCDRMNEWARSQGQPGLAYIFWRLDESGNVAGMGPVAKHLNIVNT 384 (622) Q Consensus 305 ~~~~~~f~vf~~~~~~~~~~~v~~i~~~~~~~srk~~D~l~e~ak~~g~kgL~~i~~~~~~~~~~~~~spi~K~l~~~~~ 384 (622) .+..+.. ...|+..- .....+.. T Consensus 350 --------------------------------------~~~~~~~--~~~g~~~~-----------------~~~~~~~~ 372 (538) T TIGR00499 350 --------------------------------------ALKKYDL--LETGIDFD-----------------DLKDDETA 372 (538) T ss_pred --------------------------------------HHHHHHH--HCCCCCCH-----------------HCCHHHHH T ss_conf --------------------------------------7765200--00465400-----------------10025788 Q ss_pred HHHHHHHCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHCCCCCCEE--EEEEEECCCEEECCCCCCCCHHHCCCCCC Q ss_conf 9999981887557269980885411001347889998773003557510--28896077053545547000211002476 Q gi|254780419|r 385 EAIRVRLDMQDGDSCFFIAGDPLKFCSFAADARNHIAQELGIINCNCFE--FCWIMDFPFYEWNEEEKKIDFAHNPFTMP 462 (622) Q Consensus 385 ~~l~~~~~~~~GD~i~f~a~~~~~~~~~lg~lR~~ia~~l~li~~~~~~--flWV~DFPLFe~dee~~r~~s~HHPFTaP 462 (622) ..+....+.+ .|-.+... .....+.+-.++... .++..... +.+|+++|. +-. |...+ T Consensus 373 ~~~~~~~~~~----~~~~~~~~--~~~~~~~~~~~~~~~--~~e~~~~~~~p~~~~~~p~-----~~~-------pl~~~ 432 (538) T TIGR00499 373 KALAKELGIK----YFEVAEDS--GPLTLGHILNELFEE--FLEHTLIQTLPTFITGYPA-----EIS-------PLAKR 432 (538) T ss_pred HHHHHHHHHH----HHCCCCCC--CCCHHHHHHHHHHHH--HHHHHHHHHHHHHHHCCCH-----HHC-------CHHCC T ss_conf 8888875433----20223457--742068899998888--7755666532143404641-----222-------10005 Q ss_pred CCCHHHHHCCCHHHHHHHCCCEEECCEEECCCCEECCCHHHHHHHHH------HCC--CCHHHHHHHHHHHHHHHHCCCC Q ss_conf 64306554188777413136777756772125123156999999999------859--9989998667789999860849 Q gi|254780419|r 463 QGGMESLKGNDLLSIKAFQYDLVCNGFEIASGGIRNHMSEVMLQAFS------NVG--ISKEVVENRFGGLYRAFQCGMP 534 (622) Q Consensus 463 ~~~~~~l~~~dp~~v~a~~YDLVlNG~EiggGSiRIHd~~iQ~~~f~------~lg--l~~ee~~~~Fg~lL~Al~yG~P 534 (622) ...+|. ....|++.++|-|++.|-...++|..|..+|+ ..| ..++++...-.-|++|++||+| T Consensus 433 -------~~~~p~--~~~~fe~f~~g~e~~n~~~e~~~p~~~~~~~~~~~~~~~~gg~~~~~~~~~~d~~~~~~l~~g~p 503 (538) T TIGR00499 433 -------NPSNPE--LTDRFELFIGGKEIANAFSELNDPLDQRERFEAQLAEKEAGGNKTDDEAPLLDEDFVEALEYGLP 503 (538) T ss_pred -------CCCCCC--HHHHHHHEECCHHHHCCHHHHCCHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHCCC T ss_conf -------777753--01111100101122010122046278899999988887506775410344433678888871688 Q ss_pred CCCCEECCHHHHHHHHCCCCCCEEEECCCCCC Q ss_conf 73422246899999983739840321378999 Q gi|254780419|r 535 PHGGIAAGIDRIVMLLLGAKNVREVSLFPMSQ 566 (622) Q Consensus 535 PHgG~alGlDRlvmll~g~~sIRdVIaFPKt~ 566 (622) |.||+|+|+||++|++|+..+||||++||.-. T Consensus 504 p~~g~g~g~d~l~~l~~~~~~~~~~~~fp~~~ 535 (538) T TIGR00499 504 PTGGLGIGIDRLVMLLTDSPSIRDVLLFPALR 535 (538) T ss_pred CCCCCCCCHHHHHHHHHCCCCHHHHHHCCCCC T ss_conf 65654432777877661675101332025313 No 27 >cd04317 EcAspRS_like_N EcAspRS_like_N: N-terminal, anticodon recognition domain of the type found in Escherichia coli aspartyl-tRNA synthetase (AspRS), the human mitochondrial (mt) AspRS-2, the discriminating (D) Thermus thermophilus AspRS-1, and the nondiscriminating (ND) Helicobacter pylori AspRS. These homodimeric enzymes are class2b aminoacyl-tRNA synthetases (aaRSs). This domain is a beta-barrel domain (OB fold) involved in binding the tRNA anticodon stem-loop. aaRSs catalyze the specific attachment of amino acids (AAs) to their cognate tRNAs during protein biosynthesis. This 2-step reaction involves i) the activation of the AA by ATP in the presence of magnesium ions, followed by ii) the transfer of the activated AA to the terminal ribose of tRNA. In the case of the class2b aaRSs, the activated AA is attached to the 3'OH of the terminal ribose. Eukaryotes contain 2 sets of aaRSs, both of which are encoded by the nuclear genome. One set concerns with cytoplasmic synthesis, wh Probab=100.00 E-value=4.9e-38 Score=291.62 Aligned_cols=134 Identities=46% Similarity=0.835 Sum_probs=128.5 Q ss_pred CCCCHHHCCHHHCCCEEEEEEEECCCCCCCCEEEEEEECCCCEEEEEECCC-CHHHHHHHCCCCCCEEEEEEEEEECCCC Q ss_conf 742042369799898899997980044469879999977982599997488-1578998528988589999999975865 Q gi|254780419|r 5 RTHSCGDLGVSDVGSFVRLSGWVHRVRPHGGIIFLDIRDHYGITQVVANPD-SACFEIAKSVRSEWVLCIDGLVMSRSAE 83 (622) Q Consensus 5 Rth~cg~l~~~~~g~~V~i~GwV~~~R~~g~l~F~~lrD~sG~~Q~v~~~~-~~~~~~~~~l~~esvv~v~G~V~~r~~~ 83 (622) |||+||||+++++|++|+|+|||+++|++|+++|++|||++|++|||++++ ...++.+++|+.||+|.|+|+|++|+++ T Consensus 1 RTh~~~el~~~~~G~~V~v~GwV~~~R~~gki~Fi~LRD~tg~iQ~v~~~~~~~~~~~~~~l~~EsvV~V~G~V~~r~~~ 80 (135) T cd04317 1 RTHYCGELRESHVGQEVTLCGWVQRRRDHGGLIFIDLRDRYGIVQVVFDPEEAPEFELAEKLRNESVIQVTGKVRARPEG 80 (135) T ss_pred CCEEHHHCCHHHCCCEEEEEEEEEEEEECCCEEEEEEECCCCEEEEEECCCCCHHHHHHHHCCCCCEEEEEEEEEECCCC T ss_conf 96354218958899999999999999717979999985288358899957651678999718976799999999967976 Q ss_pred CCCCCCCCCEEEEEEEEEEEEECCCCCCCCCCCCCCCCHHHHCCCCCEEECCCCC Q ss_conf 5588777865999988999950211799864576678987821123200012111 Q gi|254780419|r 84 TINANIITGQIELSAQKIEILSVAEELPLPVFGEREYPESIRFKYRFLDLRRDTL 138 (622) Q Consensus 84 ~~n~~~~tG~~Ei~~~~~~il~~a~~~P~~~~~~~~~~e~~rl~~R~LdLR~~~~ 138 (622) ..|++++||++||.|++++|||+|.++||++.++..+++|+||+|||||||+|+| T Consensus 81 ~~n~~~~tGeiEi~v~~i~ils~a~~lP~~~~~~~~~~~e~Rl~~R~LDLR~~~~ 135 (135) T cd04317 81 TVNPKLPTGEIEVVASELEVLNKAKTLPFEIDDDVNVSEELRLKYRYLDLRRPKM 135 (135) T ss_pred CCCCCCCCCCEEEEEEEEEEEECCCCCCCCCCCCCCCCHHHHHCCEEEEECCCCC T ss_conf 6056656897799997999977288899886556778956620243356258889 No 28 >COG2269 Truncated, possibly inactive, lysyl-tRNA synthetase (class II) [Translation, ribosomal structure and biogenesis] Probab=100.00 E-value=1e-34 Score=267.87 Aligned_cols=291 Identities=25% Similarity=0.390 Sum_probs=200.8 Q ss_pred CCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCC------CCCCCCCCCCCHHHHHHCCCCHHHH-HHHHHCCCHHH Q ss_conf 2124589999999875203651462033211246456------6454322222103443112385456-44331057023 Q gi|254780419|r 140 KNIVKRTRIINSMRCRMIAENFIECSTPILTASSPEG------ARDFLVPSRVNQGSFYALPQAPQQY-KQLLMASGFDR 212 (622) Q Consensus 140 ~~l~~Rs~i~~~~R~~l~~~gF~EveTP~L~ksTpeG------ArdFlVPsR~~~g~fYALpQSPQly-KQlLmv~G~dr 212 (622) .++..|++++.+||.||.++||+|||||.|..++--- +.+|+-|.-...-.-| |..||+.. |.|| ++|-.+ T Consensus 14 ~~ll~Ra~i~~~iR~FF~erg~lEVeTp~Ls~a~vtd~hL~~F~Te~~~~~~~~~~~l~-L~TSPEy~mKrLL-Aag~~~ 91 (322) T COG2269 14 DNLLKRAAIIAAIRRFFAERGVLEVETPALSVAPVTDIHLHPFETEFLGPGGAKGKPLW-LHTSPEYHMKRLL-AAGSGP 91 (322) T ss_pred HHHHHHHHHHHHHHHHHHHCCCEEECCHHHHCCCCCCCCEEEEEEEEECCCCCCCCEEE-EECCCHHHHHHHH-HCCCCC T ss_conf 99998999999999999876925853367650789864234456676546766653036-6568077789999-754886 Q ss_pred HEEEHHHHCCCCCCCCCCCCCEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHC-CCCCC Q ss_conf 31100110377766553800003565410268999999999999999999980872345773212189887513-67776 Q gi|254780419|r 213 YFQIAPCFRDEDPRSDRLPGEFYQLDMEMSFVEQDDILNTMENVLRGVFEEFSNGNTVSNNFLRICYDDAIRRY-GTDKP 291 (622) Q Consensus 213 yfqiarcFRdEd~R~dRqp~EFTqlD~EmsF~~~edvm~l~E~li~~i~~~v~~~~~i~~pF~rmtY~eAm~~Y-GsDKP 291 (622) .|||++|||||. +..+|-+|||||+-.-.+.|+.-.|+.+.+|+..+.... ++.++||+||..+| |.|-+ T Consensus 92 ifql~kvfRN~E-~G~~H~PEFTMLEWYrv~~d~~~lm~e~~~Ll~~vl~~~--------~~E~ls~~eaF~r~~gid~l 162 (322) T COG2269 92 IFQLGKVFRNEE-MGRLHNPEFTMLEWYRVGCDYYRLMNEVDDLLQLVLECV--------EAERLSYQEAFLRYLGIDPL 162 (322) T ss_pred CHHHHHHHHCCC-CCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHCC--------CCCEEEHHHHHHHHHCCCCC T ss_conf 014547772633-256679843676510067859999999999999997037--------85355599999998388945 Q ss_pred CCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCC Q ss_conf 44476401244310178752456668740378479999627876652103567799987642024623552047775333 Q gi|254780419|r 292 DLRNPIIMHNVTEHFVHSEFRVFSKILSVNPGYEVWAIPAKGGGGNRAFCDRMNEWARSQGQPGLAYIFWRLDESGNVAG 371 (622) Q Consensus 292 DLR~~lei~dit~~~~~~~f~vf~~~~~~~~~~~v~~i~~~~~~~srk~~D~l~e~ak~~g~kgL~~i~~~~~~~~~~~~ 371 (622) - .+ .+.|.+.++.. |+. . . T Consensus 163 ~----------------~~------------------------------~~~L~~~~~~~---~l~---~--~------- 181 (322) T COG2269 163 S----------------AD------------------------------KTELREAAAKL---GLS---A--A------- 181 (322) T ss_pred C----------------CC------------------------------HHHHHHHHHHC---CCC---C--C------- T ss_conf 2----------------42------------------------------99999999855---877---7--7------- Q ss_pred CCCHHHHCCHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHCCCC--CCEEEEEEEECCCEEECCCC Q ss_conf 5702553252789999998188755726998088541100134788999877300355--75102889607705354554 Q gi|254780419|r 372 MGPVAKHLNIVNTEAIRVRLDMQDGDSCFFIAGDPLKFCSFAADARNHIAQELGIINC--NCFEFCWIMDFPFYEWNEEE 449 (622) Q Consensus 372 ~spi~K~l~~~~~~~l~~~~~~~~GD~i~f~a~~~~~~~~~lg~lR~~ia~~l~li~~--~~~~flWV~DFPLFe~dee~ 449 (622) +++....|.+.+- .+ .+++ +.-++++|.|||- + T Consensus 182 --------~~~~~d~L~~~lf-------------------------~~------~VEP~lg~~rpt~ly~fP~------~ 216 (322) T COG2269 182 --------TDEDWDTLLQLLF-------------------------VE------GVEPNLGKERPTFLYHFPA------S 216 (322) T ss_pred --------CCCCHHHHHHHHH-------------------------HH------HCCCCCCCCCCEEEEECCH------H T ss_conf --------7558999999988-------------------------86------4075668888448872727------7 Q ss_pred CCCCHHHCCCCCCCCCHHHHHCCCHHHHHHHCCCEEECCEEECCCCEECCCHHHHHHHHHH-------CCCCHHHHHHHH Q ss_conf 7000211002476643065541887774131367777567721251231569999999998-------599989998667 Q gi|254780419|r 450 KKIDFAHNPFTMPQGGMESLKGNDLLSIKAFQYDLVCNGFEIASGGIRNHMSEVMLQAFSN-------VGISKEVVENRF 522 (622) Q Consensus 450 ~r~~s~HHPFTaP~~~~~~l~~~dp~~v~a~~YDLVlNG~EiggGSiRIHd~~iQ~~~f~~-------lgl~~ee~~~~F 522 (622) . .| --.++++ || ..|..|.|.+.|+||+-|---..|+..|+++|+. .|+..-.+.+ T Consensus 217 q--aa----LA~i~~~-------D~--rVAERFElY~kGiELaNgf~EltDa~EqrrRfe~dn~~r~~~~l~~~piDe-- 279 (322) T COG2269 217 Q--AA----LAQISTG-------DP--RVAERFELYYKGIELANGFHELTDAAEQRRRFEQDNKERARRGLPQYPIDE-- 279 (322) T ss_pred H--HH----HHCCCCC-------CC--CHHHHHHHEEEEEEECCCCHHCCCHHHHHHHHHHHHHHHHHCCCCCCCCCH-- T ss_conf 8--87----6046778-------96--256664320100221024000178899999999988999863788777887-- Q ss_pred HHHHHHHHCCCCCCCCEECCHHHHHHHHCCCCCCEEEECCCCCC Q ss_conf 78999986084973422246899999983739840321378999 Q gi|254780419|r 523 GGLYRAFQCGMPPHGGIAAGIDRIVMLLLGAKNVREVSLFPMSQ 566 (622) Q Consensus 523 g~lL~Al~yG~PPHgG~alGlDRlvmll~g~~sIRdVIaFPKt~ 566 (622) .||.||+. |||.+|+|||+|||||+++|.++|-|||+||-.. T Consensus 280 -~fl~Al~~-mP~cSGvALG~DRLvmLalg~~~i~~Vi~f~v~~ 321 (322) T COG2269 280 -DFLAALAR-MPPCSGVALGFDRLVMLALGAESIDDVIAFPVAR 321 (322) T ss_pred -HHHHHHHH-CCCCCCCEECHHHHHHHHCCCCHHHHHHHCCCCC T ss_conf -89999985-8986541323889999981821487776131146 No 29 >TIGR00462 genX lysyl-tRNA synthetase homolog GenX; InterPro: IPR004525 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . This entry represents lysyl-tRNA synthetases from bacteria, as well as other related proteins. Escherichia coli, Haemophilus influenzae, and Aquifex aeolicus each have a protein closely homologous to the C-terminal region of lysyl-tRNA synthetase (LysS). Multiple sequence alignment of these proteins with the homologous regions of collected LysS proteins shows that these proteins form a distinct set rather than just similar truncations of LysS, so they appear to be orthologous. The protein is termed PoxA or GenX after its designation in E. coli. Its function is unknown.; GO: 0000166 nucleotide binding, 0004824 lysine-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006430 lysyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=99.96 E-value=2e-28 Score=222.88 Aligned_cols=293 Identities=25% Similarity=0.388 Sum_probs=193.6 Q ss_pred CCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCC------CCCCCCCCCCCHHHHHHCCCCHHHH-HHHHHCCCHHH Q ss_conf 2124589999999875203651462033211246456------6454322222103443112385456-44331057023 Q gi|254780419|r 140 KNIVKRTRIINSMRCRMIAENFIECSTPILTASSPEG------ARDFLVPSRVNQGSFYALPQAPQQY-KQLLMASGFDR 212 (622) Q Consensus 140 ~~l~~Rs~i~~~~R~~l~~~gF~EveTP~L~ksTpeG------ArdFlVPsR~~~g~fYALpQSPQly-KQlLmv~G~dr 212 (622) .|+..|++++..||+||.++|++||+||+|...+--- ..+||-|-. ..++--=|..||+-- |+|| ++|-.- T Consensus 19 ~n~LkRa~~i~~iR~FF~e~G~~EV~TP~l~~~~~~D~~l~p~~~~~l~~g~-~~~~~~WL~TSPEY~MK~LL-~~~~~~ 96 (330) T TIGR00462 19 KNLLKRAKIIAEIRKFFKERGLLEVETPLLSDFPVTDLHLEPFKTEFLEPGG-EQNKVLWLSTSPEYHMKRLL-SAGKGP 96 (330) T ss_pred HHHHHHHHHHHHHHHHHHHCCCEEECCCCHHCCCCCCEEEEEEEEEEECCCC-CCCCEECCCCCCCHHHHHHH-HHCCCC T ss_conf 6778899999999998861784253263010156443022136888842665-33331211458727899998-621786 Q ss_pred HEEEHHHHCCCCCCCCCCCCCEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHC-CCCCC Q ss_conf 31100110377766553800003565410268999999999999999999980872345773212189887513-67776 Q gi|254780419|r 213 YFQIAPCFRDEDPRSDRLPGEFYQLDMEMSFVEQDDILNTMENVLRGVFEEFSNGNTVSNNFLRICYDDAIRRY-GTDKP 291 (622) Q Consensus 213 yfqiarcFRdEd~R~dRqp~EFTqlD~EmsF~~~edvm~l~E~li~~i~~~v~~~~~i~~pF~rmtY~eAm~~Y-GsDKP 291 (622) +|||.|+||||..-.-+.| |||||+=...-.++.-+|+=+.+|+..+ ++. .+|..+||++|..+| |.|-| T Consensus 97 iFQi~k~FRN~E~G~~H~~-EF~MLEWY~~~~d~~~Li~E~~~LL~~~----ld~----~~~E~l~~~~~F~~y~g~d~l 167 (330) T TIGR00462 97 IFQITKVFRNEEAGRLHNP-EFTMLEWYRPHYDMLRLINEVDDLLQQL----LDC----PEAESLSYQEAFKRYLGLDPL 167 (330) T ss_pred EEEEEEHHHHCCCCCCCCC-CCCHHHHHHHHHHHHHHHHHHHHHHHHH----HCC----CCCCEECHHHHHHHHHCCCCC T ss_conf 2310000100024675775-3100111211023899999999999997----288----630000189999998447987 Q ss_pred CCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCC Q ss_conf 44476401244310178752456668740378479999627876652103567799987642024623552047775333 Q gi|254780419|r 292 DLRNPIIMHNVTEHFVHSEFRVFSKILSVNPGYEVWAIPAKGGGGNRAFCDRMNEWARSQGQPGLAYIFWRLDESGNVAG 371 (622) Q Consensus 292 DLR~~lei~dit~~~~~~~f~vf~~~~~~~~~~~v~~i~~~~~~~srk~~D~l~e~ak~~g~kgL~~i~~~~~~~~~~~~ 371 (622) . .....+.|.+.+.+-..+ .. T Consensus 168 ~----------------------------------------------~e~s~l~E~~~k~~~~~~----~~--------- 188 (330) T TIGR00462 168 S----------------------------------------------AEKSELKELAEKKDLDNV----AE--------- 188 (330) T ss_pred C----------------------------------------------HHHHHHHHHHHHCCCHHH----HC--------- T ss_conf 3----------------------------------------------025789999985473233----20--------- Q ss_pred CCCHHHHCCHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHCCCCCCEEEEEEEECCCEEECCCCCC Q ss_conf 57025532527899999981887557269980885411001347889998773003557510288960770535455470 Q gi|254780419|r 372 MGPVAKHLNIVNTEAIRVRLDMQDGDSCFFIAGDPLKFCSFAADARNHIAQELGIINCNCFEFCWIMDFPFYEWNEEEKK 451 (622) Q Consensus 372 ~spi~K~l~~~~~~~l~~~~~~~~GD~i~f~a~~~~~~~~~lg~lR~~ia~~l~li~~~~~~flWV~DFPLFe~dee~~r 451 (622) .++..+.|...+- +. .|. .. -+.-+|++|++|| .+ T Consensus 189 --------~~ED~~TLl~~~F---------~~----~VE-------~~---------~G~~rP~~~~~FP--------~~ 223 (330) T TIGR00462 189 --------EDEDRDTLLQLLF---------IE----VVE-------PQ---------IGKERPTFVYHFP--------ER 223 (330) T ss_pred --------CCCCHHHHHHHHH---------HH----HHH-------HC---------CCCCCCEEEEECC--------HH T ss_conf --------3444799999999---------87----640-------10---------3777875788070--------45 Q ss_pred CCHHHCCCCCCCCCHHHHHCCCHHHHHHHCCCEEECCEEECCCCEECCCHHHHHHHHH-------HCCCCHHHHHHHHHH Q ss_conf 0021100247664306554188777413136777756772125123156999999999-------859998999866778 Q gi|254780419|r 452 IDFAHNPFTMPQGGMESLKGNDLLSIKAFQYDLVCNGFEIASGGIRNHMSEVMLQAFS-------NVGISKEVVENRFGG 524 (622) Q Consensus 452 ~~s~HHPFTaP~~~~~~l~~~dp~~v~a~~YDLVlNG~EiggGSiRIHd~~iQ~~~f~-------~lgl~~ee~~~~Fg~ 524 (622) ..| ....-..|- -.|..|.+.+.|+||+-|=.-.-|+..|+++|+ ..++...+++++| T Consensus 224 ~aa-----------LA~v~~eD~--~~AERFE~y~KGiELANGf~E~t~a~E~r~R~E~~~~kR~~~~~P~~~~D~~~-- 288 (330) T TIGR00462 224 QAA-----------LAQVSTEDN--RVAERFELYYKGIELANGFHELTDAEEQRKRFERDNKKRAKRELPTLPIDEDF-- 288 (330) T ss_pred HHH-----------HHCCCCCCC--CCCHHEEHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHH-- T ss_conf 555-----------520451336--32001001001000027753225848899887688777886258985145999-- Q ss_pred HHHHHHCCCCCCCCEECCHHHHHHHHCCCCCCEEEECCC Q ss_conf 999986084973422246899999983739840321378 Q gi|254780419|r 525 LYRAFQCGMPPHGGIAAGIDRIVMLLLGAKNVREVSLFP 563 (622) Q Consensus 525 lL~Al~yG~PPHgG~alGlDRlvmll~g~~sIRdVIaFP 563 (622) |+|++.|||-.+|+|||+|||+||.+|.+.|-+||+|- T Consensus 289 -i~A~~~GMP~csGvaLGiDRL~~l~Lg~E~l~~~~~~a 326 (330) T TIGR00462 289 -IKALEDGMPDCSGVALGIDRLVMLALGKEELVEVIAFA 326 (330) T ss_pred -HHHHHCCCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHH T ss_conf -99985478741440255789999870732688899886 No 30 >cd04316 ND_PkAspRS_like_N ND_PkAspRS_like_N: N-terminal, anticodon recognition domain of the type found in the homodimeric non-discriminating (ND) Pyrococcus kodakaraensis aspartyl-tRNA synthetase (AspRS). This domain is a beta-barrel domain (OB fold) involved in binding the tRNA anticodon stem-loop. P. kodakaraensis AspRS is a class 2b aaRS. aaRSs catalyze the specific attachment of amino acids (AAs) to their cognate tRNAs during protein biosynthesis. This 2-step reaction involves i) the activation the AA by ATP in the presence of magnesium ions, followed by ii) the transfer of the activated AA to the terminal ribose of tRNA. In the case of the class2b aaRSs, the activated AA is attached to the 3'OH of the terminal ribose. P. kodakaraensis ND-AspRS can charge both tRNAAsp and tRNAAsn. Some of the enzymes in this group may be discriminating, based on the presence of homologs of asparaginyl-tRNA synthetase (AsnRS) in their completed genomes. Probab=99.92 E-value=7.2e-24 Score=190.25 Aligned_cols=103 Identities=33% Similarity=0.541 Sum_probs=91.4 Q ss_pred CCHHHCCHHHCCCEEEEEEEECCCCCCCCEEEEEEECCCCEEEEEECCC---CHHHHHHHCCCCCCEEEEEEEEEECCCC Q ss_conf 2042369799898899997980044469879999977982599997488---1578998528988589999999975865 Q gi|254780419|r 7 HSCGDLGVSDVGSFVRLSGWVHRVRPHGGIIFLDIRDHYGITQVVANPD---SACFEIAKSVRSEWVLCIDGLVMSRSAE 83 (622) Q Consensus 7 h~cg~l~~~~~g~~V~i~GwV~~~R~~g~l~F~~lrD~sG~~Q~v~~~~---~~~~~~~~~l~~esvv~v~G~V~~r~~~ 83 (622) ||||||+.+++|++|+|+|||+++|++|+++|++|||++|++||+++++ .+.++.+++|+.||+|.|+|+|.+++ T Consensus 1 H~~g~l~~~~~G~~V~i~Gwv~~~R~~g~~~Fi~LRD~~G~vQ~v~~~~~~~~~~~~~~~~l~~EsvV~V~G~V~~~~-- 78 (108) T cd04316 1 HYSAEITPELDGEEVTVAGWVHEIRDLGGIKFVILRDREGIVQVTAPKKKVDKELFKTVRKLSRESVISVTGTVKAEP-- 78 (108) T ss_pred CCCCCCCHHHCCCEEEEEEEEEEEECCCCEEEEEEECCCEEEEEEEECCCCCHHHHHHHHCCCCCEEEEEEEEEEECC-- T ss_conf 934759838999999998999823217987999998098779999937888989999986599944999999999489-- Q ss_pred CCCCCCCCCEEEEEEEEEEEEECCCC-CCCCCCC Q ss_conf 55887778659999889999502117-9986457 Q gi|254780419|r 84 TINANIITGQIELSAQKIEILSVAEE-LPLPVFG 116 (622) Q Consensus 84 ~~n~~~~tG~~Ei~~~~~~il~~a~~-~P~~~~~ 116 (622) +.+ |++||.+++++|||+|.+ +||.+.+ T Consensus 79 ----~~~-~giEi~v~~i~ils~a~~plP~~~~~ 107 (108) T cd04316 79 ----KAP-NGVEIIPEEIEVLSEAKTPLPLDPTG 107 (108) T ss_pred ----CCC-CCEEEEEEEEEEEECCCCCCCCCCCC T ss_conf ----899-96758964899985889998968589 No 31 >cd04319 PhAsnRS_like_N PhAsnRS_like_N: N-terminal, anticodon recognition domain of the type found in Pyrococcus horikoshii AsnRS asparaginyl-tRNA synthetase (AsnRS). This domain is a beta-barrel domain (OB fold) involved in binding the tRNA anticodon stem-loop. The archeal enzymes in this group are homodimeric class2b aminoacyl-tRNA synthetases (aaRSs). aaRSs catalyze the specific attachment of amino acids (AAs) to their cognate tRNAs during protein biosynthesis. This 2-step reaction involves i) the activation of the AA by ATP in the presence of magnesium ions, followed by ii) the transfer of the activated AA to the terminal ribose of tRNA. In the case of the class2b aaRSs, the activated AA is attached to the 3'OH of the terminal ribose. Probab=99.91 E-value=1.8e-23 Score=187.50 Aligned_cols=101 Identities=31% Similarity=0.476 Sum_probs=89.0 Q ss_pred EEEEEEEECCCCCCCCEEEEEEECCCCEEEEEECCC--CHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCCCCCCEEEEE Q ss_conf 899997980044469879999977982599997488--157899852898858999999997586555887778659999 Q gi|254780419|r 20 FVRLSGWVHRVRPHGGIIFLDIRDHYGITQVVANPD--SACFEIAKSVRSEWVLCIDGLVMSRSAETINANIITGQIELS 97 (622) Q Consensus 20 ~V~i~GwV~~~R~~g~l~F~~lrD~sG~~Q~v~~~~--~~~~~~~~~l~~esvv~v~G~V~~r~~~~~n~~~~tG~~Ei~ 97 (622) +|+|+|||+++|++|+++|++|||+||++|||++++ .+.++.+++++.||+|.|+|+|+++ |+.++| +||. T Consensus 1 eV~l~GWV~~~R~~g~i~Fi~LRD~tG~iQ~v~~~~~~~~~~~~~~~l~~Es~v~v~G~v~~~------~~~~~g-iEi~ 73 (103) T cd04319 1 KVTLAGWVYRKREVGKKAFIVLRDSTGIVQAVFSKDLNEEAYREAKKVGIESSVIVEGAVKAD------PRAPGG-AEVH 73 (103) T ss_pred CEEEEEEEEEEECCCCEEEEEECCCCEEEEEEECCCCCHHHHHHHHHCCCEEEEEEEEEEEEC------CCCCCC-EEEE T ss_conf 999999996763289779999826982799998787797999999856863999997798868------999987-8899 Q ss_pred EEEEEEEECCCCCCCCCCCCCCCCHHHHCCCCCE Q ss_conf 8899995021179986457667898782112320 Q gi|254780419|r 98 AQKIEILSVAEELPLPVFGEREYPESIRFKYRFL 131 (622) Q Consensus 98 ~~~~~il~~a~~~P~~~~~~~~~~e~~rl~~R~L 131 (622) +++++|||.|+++|++ .+.++|+||+|||| T Consensus 74 ~~~ieIl~~a~~~Pi~----~~~~~e~~l~~R~L 103 (103) T cd04319 74 GEKLEIIQNVEFFPIT----EDASDEFLLDVRHL 103 (103) T ss_pred EEEEEEEECCCCCCCC----CCCCCHHHHHCCCC T ss_conf 7289999858798988----98880231101379 No 32 >cd04322 LysRS_N LysRS_N: N-terminal, anticodon recognition domain of lysyl-tRNA synthetases (LysRS). These enzymes are homodimeric class 2b aminoacyl-tRNA synthetases (aaRSs). This domain is a beta-barrel domain (OB fold) involved in binding the tRNA anticodon stem-loop. aaRSs catalyze the specific attachment of amino acids (AAs) to their cognate tRNAs during protein biosynthesis. This 2-step reaction involves i) the activation of the AA by ATP in the presence of magnesium ions, followed by ii) the transfer of the activated AA to the terminal ribose of tRNA. In the case of the class2b aaRSs, the activated AA is attached to the 3'OH of the terminal ribose. Included in this group are E. coli LysS and LysU. These two isoforms of LysRS are encoded by distinct genes which are differently regulated. Eukaryotes contain 2 sets of aaRSs, both of which encoded by the nuclear genome. One set concerns with cytoplasmic protein synthesis, whereas the other exclusively with mitochondrial protein Probab=99.88 E-value=3.7e-22 Score=178.01 Aligned_cols=104 Identities=27% Similarity=0.369 Sum_probs=87.8 Q ss_pred EEEEEEEECCCCCCCCEEEEEEECCCCEEEEEECCCC---HHHHHH-HCCCCCCEEEEEEEEEECCCCCCCCCCCCCEEE Q ss_conf 8999979800444698799999779825999974881---578998-528988589999999975865558877786599 Q gi|254780419|r 20 FVRLSGWVHRVRPHGGIIFLDIRDHYGITQVVANPDS---ACFEIA-KSVRSEWVLCIDGLVMSRSAETINANIITGQIE 95 (622) Q Consensus 20 ~V~i~GwV~~~R~~g~l~F~~lrD~sG~~Q~v~~~~~---~~~~~~-~~l~~esvv~v~G~V~~r~~~~~n~~~~tG~~E 95 (622) +|+|+|||+++|++|+++|++|||++|.+|++++++. ..|+.+ +.++.|++|+|+|+|.. .+|||+| T Consensus 1 ~V~v~GrV~~~R~~G~l~FidLrD~~g~iQv~~~~~~~~~~~~~~~~~~~~~gDvv~v~G~v~~---------t~tGel~ 71 (108) T cd04322 1 EVSVAGRIMSKRGSGKLSFADLQDESGKIQVYVNKDDLGEEEFEDFKKLLDLGDIIGVTGTPFK---------TKTGELS 71 (108) T ss_pred CEEEEEEEEEEECCCCCEEEEEEECCCCEEEEEECCCCCHHHHHHHHHCCCCCEEEEEEEEEEC---------CCCCEEE T ss_conf 9899999984706798299999969961899997564775677777634443449999978751---------7993499 Q ss_pred EEEEEEEEEECCCCCCCCCCCCCCCCHHHHCCCCCEEE Q ss_conf 99889999502117998645766789878211232000 Q gi|254780419|r 96 LSAQKIEILSVAEELPLPVFGEREYPESIRFKYRFLDL 133 (622) Q Consensus 96 i~~~~~~il~~a~~~P~~~~~~~~~~e~~rl~~R~LdL 133 (622) |.++++++||+|.. |+|...+...++|+|++|||||| T Consensus 72 v~~~~~~lLsk~l~-plP~~~~~~~~~e~r~r~RyLDL 108 (108) T cd04322 72 IFVKEFTLLSKSLR-PLPEKFHGLTDVETRYRQRYLDL 108 (108) T ss_pred EEEEEHHCCCCCCC-CCCCCCCCCCCHHHEEHHHCCCC T ss_conf 99649120066678-99944657779612401200369 No 33 >cd04321 ScAspRS_mt_like_N ScAspRS_mt_like_N: N-terminal, anticodon recognition domain of the type found in Saccharomyces cerevisiae mitochondrial (mt) aspartyl-tRNA synthetase (AspRS). This domain is a beta-barrel domain (OB fold) involved in binding the tRNA anticodon stem-loop. The enzymes in this fungal group are homodimeric class2b aminoacyl-tRNA synthetases (aaRSs). aaRSs catalyze the specific attachment of amino acids (AAs) to their cognate tRNAs during protein biosynthesis. This 2-step reaction involves i) the activation of the AA by ATP in the presence of magnesium ions, followed by ii) the transfer of the activated AA to the terminal ribose of tRNA. In the case of the class2b aaRSs, the activated AA is attached to the 3'OH of the terminal ribose. Eukaryotes contain 2 sets of aaRSs, both of which are encoded by the nuclear genome. One set concerns with cytoplasmic protein synthesis, whereas the other exclusively with mitochondrial protein synthesis. Mutations in the gene for Probab=99.83 E-value=1.1e-19 Score=160.40 Aligned_cols=84 Identities=27% Similarity=0.446 Sum_probs=78.1 Q ss_pred EEEEEEEECCCCC-CCCEEEEEEECCCC-EEEEEECCCCHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCCCCCCEEEEE Q ss_conf 8999979800444-69879999977982-599997488157899852898858999999997586555887778659999 Q gi|254780419|r 20 FVRLSGWVHRVRP-HGGIIFLDIRDHYG-ITQVVANPDSACFEIAKSVRSEWVLCIDGLVMSRSAETINANIITGQIELS 97 (622) Q Consensus 20 ~V~i~GwV~~~R~-~g~l~F~~lrD~sG-~~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~r~~~~~n~~~~tG~~Ei~ 97 (622) +|+|+|||+++|+ ||+++|+||||.+| ++|+|.+++...++.+++++.||+|.|+|+|++|++ |++++||++||. T Consensus 1 kVtL~GWv~~rrd~~g~l~FidLRD~~G~~vQiV~~~~~~~~~~~~~~~~E~vV~V~G~V~~R~~---~~~~~tgeiEi~ 77 (86) T cd04321 1 KVTLNGWIDRKPRIVKKLSFADLRDPNGDIIQLVSTAKKDAFSLLKSITAESPVQVRGKLQLKEA---KSSEKNDEWELV 77 (86) T ss_pred CEEEEEEEEEEECCCCCEEEEEEECCCCCEEEEEECCCHHHHHHHHHCCCCCEEEEEEEEEECCC---CCCCCCCCEEEE T ss_conf 99999738745136898699999839999999994897778899974899668999999997898---899998789999 Q ss_pred EEEEEEEEC Q ss_conf 889999502 Q gi|254780419|r 98 AQKIEILSV 106 (622) Q Consensus 98 ~~~~~il~~ 106 (622) ++++++||. T Consensus 78 v~~i~iLNa 86 (86) T cd04321 78 VDDIQTLNA 86 (86) T ss_pred EEEEEEECC T ss_conf 839998167 No 34 >cd04320 AspRS_cyto_N AspRS_cyto_N: N-terminal, anticodon recognition domain of the type found in Saccharomyces cerevisiae and human cytoplasmic aspartyl-tRNA synthetase (AspRS). This domain is a beta-barrel domain (OB fold) involved in binding the tRNA anticodon stem-loop. The enzymes in this group are homodimeric class2b aminoacyl-tRNA synthetases (aaRSs). aaRSs catalyze the specific attachment of amino acids (AAs) to their cognate tRNAs during protein biosynthesis. This 2-step reaction involves i) the activation of the AA by ATP in the presence of magnesium ions, followed by ii) the transfer of the activated AA to the terminal ribose of tRNA. In the case of the class2b aaRSs, the activated AA is attached to the 3'OH of the terminal ribose. Eukaryotes contain 2 sets of aaRSs, both of which are encoded by the nuclear genome. One set concerns with cytoplasmic protein synthesis, whereas the other exclusively with mitochondrial protein synthesis. Probab=99.77 E-value=1.1e-17 Score=146.15 Aligned_cols=95 Identities=25% Similarity=0.304 Sum_probs=79.9 Q ss_pred EEEEEEEECCCCCCC-CEEEEEEECCCCEEEEEECCC-----CHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCCCCCCE Q ss_conf 899997980044469-879999977982599997488-----15789985289885899999999758655588777865 Q gi|254780419|r 20 FVRLSGWVHRVRPHG-GIIFLDIRDHYGITQVVANPD-----SACFEIAKSVRSEWVLCIDGLVMSRSAETINANIITGQ 93 (622) Q Consensus 20 ~V~i~GwV~~~R~~g-~l~F~~lrD~sG~~Q~v~~~~-----~~~~~~~~~l~~esvv~v~G~V~~r~~~~~n~~~~tG~ 93 (622) +|+++|||+++|++| +++|++|||++|++|+|+..+ .+.++.+++|+.||+|.|+|+|++++.. + .+...++ T Consensus 1 ~V~~rGwv~~~R~~g~kl~Fl~LRd~~g~iQ~V~~~~~~~~s~~m~~~~~~l~~ESvV~V~G~V~~~~~~-i-~~~~~~~ 78 (102) T cd04320 1 EVLIRARVHTSRAQGAKLAFLVLRQQGYTIQGVLAASAEGVSKQMVKWAGSLSKESIVDVEGTVKKPEEP-I-KSCTQQD 78 (102) T ss_pred CEEEEEEEEEECCCCCEEEEEEEECCCEEEEEEEEECCCCCCHHHHHHHHCCCCCEEEEEEEEEECCCCC-C-CCCCCCC T ss_conf 9899999972021797179999915871899999807887799999998559970099999999889975-5-6788687 Q ss_pred EEEEEEEEEEEECCC-CCCCCCCC Q ss_conf 999988999950211-79986457 Q gi|254780419|r 94 IELSAQKIEILSVAE-ELPLPVFG 116 (622) Q Consensus 94 ~Ei~~~~~~il~~a~-~~P~~~~~ 116 (622) +||.+++++++|+|. +|||++++ T Consensus 79 iEi~i~~i~vls~a~~~LPf~v~D 102 (102) T cd04320 79 VELHIEKIYVVSEAAEPLPFQLED 102 (102) T ss_pred EEEEEEEEEEEECCCCCCCCCCCC T ss_conf 899999999996689999941269 No 35 >cd04323 AsnRS_cyto_like_N AsnRS_cyto_like_N: N-terminal, anticodon recognition domain of the type found in human and Saccharomyces cerevisiae cytoplasmic asparaginyl-tRNA synthetase (AsnRS), in Brugia malayai AsnRs and, in various putative bacterial AsnRSs. This domain is a beta-barrel domain (OB fold) involved in binding the tRNA anticodon stem-loop. The enzymes in this group are homodimeric class2b aminoacyl-tRNA synthetases (aaRSs). aaRSs catalyze the specific attachment of amino acids (AAs) to their cognate tRNAs during protein biosynthesis. This 2-step reaction involves i) the activation of the AA by ATP in the presence of magnesium ions, followed by ii) the transfer of the activated AA to the terminal ribose of tRNA. In the case of the class2b aaRSs, the activated AA is attached to the 3'OH of the terminal ribose. Eukaryotes contain 2 sets of aaRSs, both of which are encoded by the nuclear genome. One set concerns with cytoplasmic synthesis, whereas the other exclusively with Probab=99.75 E-value=1.6e-17 Score=144.82 Aligned_cols=83 Identities=31% Similarity=0.443 Sum_probs=73.6 Q ss_pred EEEEEEEECCCCCCCCEEEEEEECCCCEEEEEECCCC-HHHHHHHCCCCCCEEEEEEEEEECCCCCCCCCCCCCEEEEEE Q ss_conf 8999979800444698799999779825999974881-578998528988589999999975865558877786599998 Q gi|254780419|r 20 FVRLSGWVHRVRPHGGIIFLDIRDHYGITQVVANPDS-ACFEIAKSVRSEWVLCIDGLVMSRSAETINANIITGQIELSA 98 (622) Q Consensus 20 ~V~i~GwV~~~R~~g~l~F~~lrD~sG~~Q~v~~~~~-~~~~~~~~l~~esvv~v~G~V~~r~~~~~n~~~~tG~~Ei~~ 98 (622) +|+|+|||+++|++|+++|++|||+||++|||++++. ..++.+++|+.||+|.|+|+|.++|... +++|.+||.+ T Consensus 1 kV~l~GWV~~~R~~g~l~Fi~LRD~~G~iQ~V~~~~~~~~~~~~~~l~~Es~v~V~G~v~~~~~~~----~~~~~~Ei~~ 76 (84) T cd04323 1 RVKVFGWVHRLRSQKKLMFLVLRDGTGFLQCVLSKKLVTEFYDAKSLTQESSVEVTGEVKEDPRAK----QAPGGYELQV 76 (84) T ss_pred CEEEEEEEEEEECCCCCEEEEEEECCEEEEEEECCCCCCHHHHHHHCCCCEEEEEEEEEEECCCCC----CCCCCEEEEE T ss_conf 999999899862389928999950887999998886573689997388835999999999789777----7997599994 Q ss_pred EEEEEEEC Q ss_conf 89999502 Q gi|254780419|r 99 QKIEILSV 106 (622) Q Consensus 99 ~~~~il~~ 106 (622) ++++|||+ T Consensus 77 ~~i~vl~e 84 (84) T cd04323 77 DYLEIIGE 84 (84) T ss_pred EEEEEECC T ss_conf 39999639 No 36 >cd04100 Asp_Lys_Asn_RS_N Asp_Lys_Asn_RS_N: N-terminal, anticodon recognition domain of class 2b aminoacyl-tRNA synthetases (aaRSs). This domain is a beta-barrel domain (OB fold) involved in binding the tRNA anticodon stem-loop. Class 2b aaRSs include the homodimeric aspartyl-, asparaginyl-, and lysyl-tRNA synthetases (AspRS, AsnRS, and LysRS). aaRSs catalyze the specific attachment of amino acids (AAs) to their cognate tRNAs during protein biosynthesis. This 2-step reaction involves i) the activation of the AA by ATP in the presence of magnesium ions, followed by ii) the transfer of the activated AA to the terminal ribose of tRNA. In the case of the class2b aaRSs, the activated AA is attached to the 3'OH of the terminal ribose. Eukaryotes contain 2 sets of aaRSs, both of which are encoded by the nuclear genome. One set concerns with cytoplasmic protein synthesis, whereas the other exclusively with mitochondrial protein synthesis. Included in this group are archeal and archeal-like A Probab=99.71 E-value=1.9e-16 Score=137.12 Aligned_cols=83 Identities=48% Similarity=0.803 Sum_probs=75.2 Q ss_pred EEEEEEEECCCCCCCCEEEEEEECCCCEEEEEECCCCH--HHHHHHCCCCCCEEEEEEEEEECCCCCCCCCCCCCEEEEE Q ss_conf 89999798004446987999997798259999748815--7899852898858999999997586555887778659999 Q gi|254780419|r 20 FVRLSGWVHRVRPHGGIIFLDIRDHYGITQVVANPDSA--CFEIAKSVRSEWVLCIDGLVMSRSAETINANIITGQIELS 97 (622) Q Consensus 20 ~V~i~GwV~~~R~~g~l~F~~lrD~sG~~Q~v~~~~~~--~~~~~~~l~~esvv~v~G~V~~r~~~~~n~~~~tG~~Ei~ 97 (622) +|+|+|||+++|++|+++|++|||++|.+|||++++.. .++.++.++.||+|.|+|+|.++|. +.+++|++||. T Consensus 1 ~V~v~Gwv~~~R~~g~~~Fi~lrD~~g~iQvv~~~~~~~~~~~~~~~l~~es~I~v~G~v~~~~~----~~~~~g~~Ei~ 76 (85) T cd04100 1 EVTLAGWVHSRRDHGGLIFIDLRDGSGIVQVVVNKEELGEFFEEAEKLRTESVVGVTGTVVKRPE----GNLATGEIELQ 76 (85) T ss_pred CEEEEEEEEEEECCCCEEEEEEECCCCEEEEEECCCCCHHHHHHHHCCCCCCEEEEEEEEEECCC----CCCCCCCEEEE T ss_conf 99999987904328988995850389508999917758578877635898889999999996887----77899889999 Q ss_pred EEEEEEEEC Q ss_conf 889999502 Q gi|254780419|r 98 AQKIEILSV 106 (622) Q Consensus 98 ~~~~~il~~ 106 (622) ++++++||+ T Consensus 77 ~~~i~il~~ 85 (85) T cd04100 77 AEELEVLSK 85 (85) T ss_pred EEEEEEECC T ss_conf 939999869 No 37 >cd04318 EcAsnRS_like_N EcAsnRS_like_N: N-terminal, anticodon recognition domain of the type found in Escherichia coli asparaginyl-tRNA synthetase (AsnRS) and, in Arabidopsis thaliana and Saccharomyces cerevisiae mitochondrial (mt) AsnRS. This domain is a beta-barrel domain (OB fold) involved in binding the tRNA anticodon stem-loop. The enzymes in this group are homodimeric class2b aminoacyl-tRNA synthetases (aaRSs). aaRSs catalyze the specific attachment of amino acids (AAs) to their cognate tRNAs during protein biosynthesis. This 2-step reaction involves i) the activation of the AA by ATP in the presence of magnesium ions, followed by ii) the transfer of the activated AA to the terminal ribose of tRNA. In the case of the class2b aaRSs, the activated AA is attached to the 3'OH of the terminal ribose. Eukaryotes contain 2 sets of aaRSs, both of which are encoded by the nuclear genome. One set concerns with cytoplasmic protein synthesis, whereas the other exclusively with mitochondrial Probab=99.54 E-value=1e-13 Score=117.63 Aligned_cols=80 Identities=26% Similarity=0.454 Sum_probs=71.7 Q ss_pred EEEEEEEECCCCCCCCEEEEEEECCCCE--EEEEECCCCHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCCCCCCEEEEE Q ss_conf 8999979800444698799999779825--99997488157899852898858999999997586555887778659999 Q gi|254780419|r 20 FVRLSGWVHRVRPHGGIIFLDIRDHYGI--TQVVANPDSACFEIAKSVRSEWVLCIDGLVMSRSAETINANIITGQIELS 97 (622) Q Consensus 20 ~V~i~GwV~~~R~~g~l~F~~lrD~sG~--~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~r~~~~~n~~~~tG~~Ei~ 97 (622) +|+++|||+++|++|+++|++|||+|+. +|||++++...++.+.+++.|++|.|+|+++++|.+ .+.+|+. T Consensus 1 ~V~i~GWV~~~R~~~~i~Fi~L~DGS~~~~lQvV~~~~~~~~~~~~~l~~gssv~v~G~l~~s~~~-------~q~~El~ 73 (82) T cd04318 1 EVTVNGWVRSVRDSKKISFIELNDGSCLKNLQVVVDKELTNFKEILKLSTGSSIRVEGVLVKSPGA-------KQPFELQ 73 (82) T ss_pred CEEEEEEEEEEECCCCEEEEEEECCCCCCCEEEEECCCCCCHHHHHCCCCCEEEEEEEEEECCCCC-------CCCEEEE T ss_conf 999998688886489749999978986443899983776797798446785399999999908999-------9999999 Q ss_pred EEEEEEEEC Q ss_conf 889999502 Q gi|254780419|r 98 AQKIEILSV 106 (622) Q Consensus 98 ~~~~~il~~ 106 (622) ++++++++. T Consensus 74 ~~~i~i~g~ 82 (82) T cd04318 74 AEKIEVLGE 82 (82) T ss_pred EEEEEEEEC T ss_conf 449999959 No 38 >pfam02938 GAD GAD domain. This domain is found in some members of the GatB and aspartyl tRNA synthetases. Probab=99.44 E-value=2.1e-13 Score=115.35 Aligned_cols=93 Identities=30% Similarity=0.390 Sum_probs=77.5 Q ss_pred HHHHHHCCCCCEEEEEECCC-CCCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCHHHHCCHHHHHHHHHHHC Q ss_conf 66687403784799996278-76652103567799987642024623552047775333570255325278999999818 Q gi|254780419|r 314 FSKILSVNPGYEVWAIPAKG-GGGNRAFCDRMNEWARSQGQPGLAYIFWRLDESGNVAGMGPVAKHLNIVNTEAIRVRLD 392 (622) Q Consensus 314 f~~~~~~~~~~~v~~i~~~~-~~~srk~~D~l~e~ak~~g~kgL~~i~~~~~~~~~~~~~spi~K~l~~~~~~~l~~~~~ 392 (622) |++++.. +..|++|++++ +.++|+++|.++++|+.+|++||+|+++..+ .+.|||+||+++++.++|.+.++ T Consensus 1 F~~~l~~--GG~Vkai~vp~~a~~sRk~id~l~~~ak~~GakGL~~ikv~~~-----~~~gpi~Kfl~~e~~~~l~~~~~ 73 (94) T pfam02938 1 FSEALKK--GGSVKALCVPGGAGLSRKQIDELERFAKEFGAKGLAWIKVEGG-----GHTGPIAKFLTEEEVEELLERVG 73 (94) T ss_pred CCHHHHC--CCEEEEEECCCCCCCCHHHHHHHHHHHHHHCCCCEEEEEECCC-----CCCCHHHHHCCHHHHHHHHHHHC T ss_conf 9347658--9989999878986225888999999999949884179997278-----76578887569999999999948 Q ss_pred CCCCCCEEEEECCCCCCCCHH Q ss_conf 875572699808854110013 Q gi|254780419|r 393 MQDGDSCFFIAGDPLKFCSFA 413 (622) Q Consensus 393 ~~~GD~i~f~a~~~~~~~~~l 413 (622) +++||++||+||++..+++.| T Consensus 74 a~~GD~iff~Ag~~~~v~~~L 94 (94) T pfam02938 74 AQNGDILLFVADKEKVVNKAL 94 (94) T ss_pred CCCCCEEEEECCCHHHHHHHC T ss_conf 999999999789989999649 No 39 >cd00496 PheRS_alpha_core Phenylalanyl-tRNA synthetase (PheRS) alpha chain catalytic core domain. PheRS belongs to class II aminoacyl-tRNA synthetases (aaRS) based upon its structure and the presence of three characteristic sequence motifs. This domain is primarily responsible for ATP-dependent formation of the enzyme bound aminoacyl-adenylate. While class II aaRSs generally aminoacylate the 3'-OH ribose of the appropriate tRNA, PheRS is an exception in that it attaches the amino acid at the 2'-OH group, like class I aaRSs. PheRS is an alpha-2/ beta-2 tetramer. Probab=99.20 E-value=1.3e-10 Score=95.45 Aligned_cols=89 Identities=25% Similarity=0.422 Sum_probs=58.5 Q ss_pred HHHHHHHHHHHHCCCEEEEECCCCCC--------CCCCCCCCCCCCCCCCHHHHHH-------CC--CCHHHHHHHHHCC Q ss_conf 99999998752036514620332112--------4645664543222221034431-------12--3854564433105 Q gi|254780419|r 146 TRIINSMRCRMIAENFIECSTPILTA--------SSPEGARDFLVPSRVNQGSFYA-------LP--QAPQQYKQLLMAS 208 (622) Q Consensus 146 s~i~~~~R~~l~~~gF~EveTP~L~k--------sTpeGArdFlVPsR~~~g~fYA-------Lp--QSPQlyKQlLmv~ 208 (622) ++++..++++|.+.||.+++.|.+.. ..|.. =|+|-..-.||. |. .||-|.+-|.--. T Consensus 4 ~~~~~~i~~~f~~~GF~~~~gp~ie~~~~NFD~Ln~P~d-----HPaR~~~DTfy~~~~~~~lLRtHTS~~q~r~l~~~~ 78 (218) T cd00496 4 NKVIEEIEDIFVSMGFTEVEGPEVETDFYNFDALNIPQD-----HPARDMQDTFYINDPARLLLRTHTSAVQARALAKLK 78 (218) T ss_pred HHHHHHHHHHHHHCCCEEECCCCCCCCHHHHHHHCCCCC-----CCCCCCCCEEEECCCCCCEECCCCCHHHHHHHHHCC T ss_conf 899999999998779868228951231030453385988-----755372432786377660215787566799997458 Q ss_pred CHHHHEEEHHHHCCCCCCCCCCCCCEEEEEEE Q ss_conf 70233110011037776655380000356541 Q gi|254780419|r 209 GFDRYFQIAPCFRDEDPRSDRLPGEFYQLDME 240 (622) Q Consensus 209 G~dryfqiarcFRdEd~R~dRqp~EFTqlD~E 240 (622) +=-|+|.+|+|||.|...+.+.| +|.|+|.= T Consensus 79 ~p~~~~~~G~VyR~D~iDatH~~-~FhQ~Eg~ 109 (218) T cd00496 79 PPIRIFSIGRVYRNDEIDATHLP-EFHQIEGL 109 (218) T ss_pred CCCEEEECCCEEECCCCCCCCCC-EEEEEEEE T ss_conf 98179942504653889843153-12576379 No 40 >PRK00488 pheS phenylalanyl-tRNA synthetase subunit alpha; Validated Probab=99.15 E-value=5.5e-10 Score=90.92 Aligned_cols=107 Identities=21% Similarity=0.307 Sum_probs=68.2 Q ss_pred HCCCCCEEEC---CCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCC--------CCCCC--CCCCCCCCCCCHHHH Q ss_conf 2112320001---2111121245899999998752036514620332112--------46456--645432222210344 Q gi|254780419|r 125 RFKYRFLDLR---RDTLHKNIVKRTRIINSMRCRMIAENFIECSTPILTA--------SSPEG--ARDFLVPSRVNQGSF 191 (622) Q Consensus 125 rl~~R~LdLR---~~~~~~~l~~Rs~i~~~~R~~l~~~gF~EveTP~L~k--------sTpeG--ArdFlVPsR~~~g~f 191 (622) ++..-.+|+- ++.....+--=++++..++++|...||..++-|-+-. ..|+. ||+ -.-.| T Consensus 86 ~L~~E~iDvTlP~~~~~~G~~HPit~~~~ei~~iF~~~Gf~v~~GpeiE~~~~NFdaLn~P~~HPAR~-------~~DTF 158 (338) T PRK00488 86 RLAAETIDVTLPGRRIELGGLHPLTRTIERIEDIFVGLGFEVAEGPEIEDDYYNFEALNIPKDHPARD-------MQDTF 158 (338) T ss_pred HHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCEEECCCCEEEHHHHHHHHCCCCCCCCCC-------CCCEE T ss_conf 75532463378788888766476999999999999986976961883100454688756997885436-------46628 Q ss_pred HH-----C--CCCHHHHHHHHHCCCHHHHEEEHHHHCCCCCCCCCCCCCEEEEEE Q ss_conf 31-----1--238545644331057023311001103777665538000035654 Q gi|254780419|r 192 YA-----L--PQAPQQYKQLLMASGFDRYFQIAPCFRDEDPRSDRLPGEFYQLDM 239 (622) Q Consensus 192 YA-----L--pQSPQlyKQlLmv~G~dryfqiarcFRdEd~R~dRqp~EFTqlD~ 239 (622) |- | -.||=|-.-|.--..==|++-.|||||.|...+.+.| +|.|+|. T Consensus 159 y~~~~~lLRTHTS~~qir~m~~~~pp~~~i~~GrvyR~d~~DatH~~-~FhQ~Eg 212 (338) T PRK00488 159 YIDDRLLLRTHTSPVQIRTMEKQKPPIRIIAPGRVYRNDSDDATHSP-MFHQVEG 212 (338) T ss_pred EECCCCEECCCCCHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCCCE-EEEEEEE T ss_conf 97687134356878889999726998699952663158999875350-4664005 No 41 >PTZ00326 phenylalanyl-tRNA synthetase; Provisional Probab=99.15 E-value=4.8e-10 Score=91.34 Aligned_cols=99 Identities=31% Similarity=0.474 Sum_probs=57.8 Q ss_pred CCHHHHHHHHHHHHHHHHCCCEEEEECCCCCC---------CCCCC--CCC----CCC--CC--------------CCC- Q ss_conf 21245899999998752036514620332112---------46456--645----432--22--------------221- Q gi|254780419|r 140 KNIVKRTRIINSMRCRMIAENFIECSTPILTA---------SSPEG--ARD----FLV--PS--------------RVN- 187 (622) Q Consensus 140 ~~l~~Rs~i~~~~R~~l~~~gF~EveTP~L~k---------sTpeG--Ard----FlV--Ps--------------R~~- 187 (622) ..+--=.+++..+|+.|.+.||.|++||-+.- -+|.. ||| |.+ |. ++| T Consensus 239 g~~hp~~~~~~~~r~if~~mGF~E~~~~~~ves~fwNfDaLf~Pq~HPARd~~DTF~l~~p~~~~~~~~~~~~~Vk~vHe 318 (505) T PTZ00326 239 GNLHPLLKVRQEFREIFMEMGFEEMPTNRWVESSFWNFDALFIPQQHPARDLQDTFFLSKPETSKFLDAEYVERVKAVHS 318 (505) T ss_pred CCCCHHHHHHHHHHHHHHHCCCEECCCCCEEEECCCCCHHHCCCCCCCCCCCCCEEEECCCCCCCCCCHHHHHHHHHHHC T ss_conf 76688999999999999977976766897133132131110489899866523347744754245687899998888850 Q ss_pred ------HHHHH------H----CC--CCH----HHHH---HHHHCCCH--HHHEEEHHHHCCCCCCCCCCCCCEEEEEE Q ss_conf ------03443------1----12--385----4564---43310570--23311001103777665538000035654 Q gi|254780419|r 188 ------QGSFY------A----LP--QAP----QQYK---QLLMASGF--DRYFQIAPCFRDEDPRSDRLPGEFYQLDM 239 (622) Q Consensus 188 ------~g~fY------A----Lp--QSP----QlyK---QlLmv~G~--dryfqiarcFRdEd~R~dRqp~EFTqlD~ 239 (622) .|--| | |. .|| .||+ |-.-..++ -|||-|+||||+|...+.+.| ||.|++- T Consensus 319 ~G~~gs~Gw~y~W~~~~a~~~~LRthTt~vs~r~l~~~~~~~~~~~~~~P~k~fsi~rv~R~e~~d~th~~-~f~Q~eg 396 (505) T PTZ00326 319 GGGYGSIGWQYNWKLEEARKNVLRTHTTAVSARMLYKLAQQYQKGKGFKPKKYFSIDRVFRNETLDATHLA-EFHQVEG 396 (505) T ss_pred CCCCCCCCCCCCCCHHHHHHCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCEEEECCCEEECCCCCCCCCC-EEEEEEE T ss_conf 68878887677879778752063657753769999987540465568898147523726415777765340-4554357 No 42 >COG0016 PheS Phenylalanyl-tRNA synthetase alpha subunit [Translation, ribosomal structure and biogenesis] Probab=99.10 E-value=2.9e-10 Score=92.95 Aligned_cols=110 Identities=21% Similarity=0.337 Sum_probs=68.9 Q ss_pred HHHCCCCCEEECCCC---CCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCC--------CCCCCCCCCCCCCCCHHHH Q ss_conf 782112320001211---11212458999999987520365146203321124--------6456645432222210344 Q gi|254780419|r 123 SIRFKYRFLDLRRDT---LHKNIVKRTRIINSMRCRMIAENFIECSTPILTAS--------SPEGARDFLVPSRVNQGSF 191 (622) Q Consensus 123 ~~rl~~R~LdLR~~~---~~~~l~~Rs~i~~~~R~~l~~~gF~EveTP~L~ks--------TpeGArdFlVPsR~~~g~f 191 (622) .-++..+.+|+--|. .....-.=..++..++++|.+.||.+++.|.+-.. -|.. =|+|--+..| T Consensus 88 ~~~l~~e~~dv~lp~~~~~~G~~Hpl~~~~e~i~~iF~~mGF~~~~gp~IE~d~~NFDaLn~P~d-----HPARdmqDTF 162 (335) T COG0016 88 WERLAFEKIDVTLPGRRIYPGSLHPLTQTIEEIEDIFLGMGFTEVEGPEIETDFYNFDALNIPQD-----HPARDMQDTF 162 (335) T ss_pred HHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCEECCCCCCCCCCCCHHHHCCCCC-----CCCCCCCCEE T ss_conf 88622157775788866778986708999999999999749666458821320225243249999-----9841244437 Q ss_pred HHCCC-----------CHHHHHHHHHCCC-HHHHEEEHHHHCCCCCCCCCCCCCEEEEEE Q ss_conf 31123-----------8545644331057-023311001103777665538000035654 Q gi|254780419|r 192 YALPQ-----------APQQYKQLLMASG-FDRYFQIAPCFRDEDPRSDRLPGEFYQLDM 239 (622) Q Consensus 192 YALpQ-----------SPQlyKQlLmv~G-~dryfqiarcFRdEd~R~dRqp~EFTqlD~ 239 (622) | +.- ||=+-..|.--.. ==|+|-++||||.+...+.+.| ||.|+|- T Consensus 163 y-~~~~~~~~lLRTHTs~vq~R~l~~~~~~P~k~~~~grvyR~D~~DaTHs~-~FhQiEG 220 (335) T COG0016 163 Y-LKDDREKLLLRTHTSPVQARTLAENAKIPIKIFSPGRVYRNDTVDATHSP-EFHQIEG 220 (335) T ss_pred E-ECCCCCCEEECCCCCHHHHHHHHHCCCCCCEEECCCCEECCCCCCCCCCH-HEEEEEE T ss_conf 8-76887730023668276599998479999158456634117887764261-0133678 No 43 >pfam01409 tRNA-synt_2d tRNA synthetases class II core domain (F). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only phenylalanyl-tRNA synthetases. This is the core catalytic domain. Probab=99.08 E-value=1.4e-09 Score=87.99 Aligned_cols=87 Identities=30% Similarity=0.473 Sum_probs=56.2 Q ss_pred HHHHHHHHHHHHHCCCEEEEECCCCCC--------CCCC--CCCCCCCCCCCCHHHHHH----------CC--CCHHHHH Q ss_conf 899999998752036514620332112--------4645--664543222221034431----------12--3854564 Q gi|254780419|r 145 RTRIINSMRCRMIAENFIECSTPILTA--------SSPE--GARDFLVPSRVNQGSFYA----------LP--QAPQQYK 202 (622) Q Consensus 145 Rs~i~~~~R~~l~~~gF~EveTP~L~k--------sTpe--GArdFlVPsR~~~g~fYA----------Lp--QSPQlyK 202 (622) =+.++..++++|.+.||.+++.|.+.. ..|. .||+ -.-.||- |. .||=+.. T Consensus 19 i~~~~~~i~~~f~~~GF~~~~g~eves~~~NFDaLn~P~dHPaR~-------~~DTfy~~~~~~~~~~vLRtHTS~~q~r 91 (243) T pfam01409 19 LTRVLEEIRDIFLSMGFEEVEGPEVESDFYNFDALNIPQDHPARD-------MQDTFYLKKPLKARRLLLRTHTTPVQAR 91 (243) T ss_pred HHHHHHHHHHHHHHCCCEEEECCCCCCHHHHHHHHCCCCCCCCCC-------CCCEEEECCCCCCCCEEECCCCCHHHHH T ss_conf 999999999999877986876983564277777557898898767-------6760343366667686576879889999 Q ss_pred HHHHCCC-HHHHEEEHHHHCCCCCCCCCCCCCEEEEEE Q ss_conf 4331057-023311001103777665538000035654 Q gi|254780419|r 203 QLLMASG-FDRYFQIAPCFRDEDPRSDRLPGEFYQLDM 239 (622) Q Consensus 203 QlLmv~G-~dryfqiarcFRdEd~R~dRqp~EFTqlD~ 239 (622) .|..-.. =-|+|.+|||||.|...+.+.| +|.|+|. T Consensus 92 ~l~~~~~~p~r~~~~grvyR~d~~DatH~~-~FhQ~Eg 128 (243) T pfam01409 92 TLAEKNKPPIKIFSIGRVFRRDQVDATHLP-EFHQVEG 128 (243) T ss_pred HHHHCCCCCEEEECCCEEEECCCCCCCCCC-EEEEEEE T ss_conf 997427998489814313325889843153-1057557 No 44 >PRK04172 pheS phenylalanyl-tRNA synthetase subunit alpha; Provisional Probab=99.05 E-value=3.8e-09 Score=84.91 Aligned_cols=69 Identities=36% Similarity=0.517 Sum_probs=46.2 Q ss_pred CCCHHHCCCCCCCCCHHHHHCCCHHHHHHHCCCEEEC--C-EEECCCCEECCCHHHHHHHHHHCCCCHHHHHHHHHHHHH Q ss_conf 0002110024766430655418877741313677775--6-772125123156999999999859998999866778999 Q gi|254780419|r 451 KIDFAHNPFTMPQGGMESLKGNDLLSIKAFQYDLVCN--G-FEIASGGIRNHMSEVMLQAFSNVGISKEVVENRFGGLYR 527 (622) Q Consensus 451 r~~s~HHPFTaP~~~~~~l~~~dp~~v~a~~YDLVlN--G-~EiggGSiRIHd~~iQ~~~f~~lgl~~ee~~~~Fg~lL~ 527 (622) |+.++-.|||-|.-|..- -+. | +||||.++ -+|++ ++.+|++ T Consensus 419 rfrp~yfPftEPs~E~~~----------------~~~~~~WiE~~g~G~--~~pev----l~~~g~~------------- 463 (501) T PRK04172 419 KFRPAYFPFTEPSVEVEV----------------YHPGLGWVELGGAGI--FRPEV----TEPLGID------------- 463 (501) T ss_pred EECCCCCCCCCCCEEEEE----------------EECCCCEEEEECCCC--CCHHH----HHHCCCC------------- T ss_conf 867998998998278999----------------957998699826757--88899----9755999------------- Q ss_pred HHHCCCCCCCCEECCHHHHHHHHCCCCCCEEEE Q ss_conf 986084973422246899999983739840321 Q gi|254780419|r 528 AFQCGMPPHGGIAAGIDRIVMLLLGAKNVREVS 560 (622) Q Consensus 528 Al~yG~PPHgG~alGlDRlvmll~g~~sIRdVI 560 (622) +|-..||||+||+.|+..|.++|||-. T Consensus 464 ------~~~~a~G~g~eR~am~~~gi~diR~l~ 490 (501) T PRK04172 464 ------VPVLAWGIGIDRLAMLRLGLDDIRDLY 490 (501) T ss_pred ------CCEEEEEECHHHHHHHHHCCHHHHHHH T ss_conf ------874787515999999982980888885 No 45 >cd00768 class_II_aaRS-like_core Class II tRNA amino-acyl synthetase-like catalytic core domain. Class II amino acyl-tRNA synthetases (aaRS) share a common fold and generally attach an amino acid to the 3' OH of ribose of the appropriate tRNA. PheRS is an exception in that it attaches the amino acid at the 2'-OH group, like class I aaRSs. These enzymes are usually homodimers. This domain is primarily responsible for ATP-dependent formation of the enzyme bound aminoacyl-adenylate. The substrate specificity of this reaction is further determined by additional domains. Intererestingly, this domain is also found is asparagine synthase A (AsnA), in the accessory subunit of mitochondrial polymerase gamma and in the bacterial ATP phosphoribosyltransferase regulatory subunit HisZ. Probab=99.02 E-value=6.9e-10 Score=90.21 Aligned_cols=116 Identities=26% Similarity=0.253 Sum_probs=73.6 Q ss_pred HHHHHHHHHHHHHHCCCEEEEECCCCCCCCC-CCCCC---CCCCCCCCHHHHHHCCCCHHHHHHHHHCCC----HHHHEE Q ss_conf 5899999998752036514620332112464-56645---432222210344311238545644331057----023311 Q gi|254780419|r 144 KRTRIINSMRCRMIAENFIECSTPILTASSP-EGARD---FLVPSRVNQGSFYALPQAPQQYKQLLMASG----FDRYFQ 215 (622) Q Consensus 144 ~Rs~i~~~~R~~l~~~gF~EveTP~L~ksTp-eGArd---FlVPsR~~~g~fYALpQSPQlyKQlLmv~G----~dryfq 215 (622) +|+++.+.+|++|.+.||.||.||+|.+... +|+.. +.++.....+.-|+|.-+.+...-.++.+. =-|||| T Consensus 1 ~r~~ie~~~r~~~~~~G~~ev~~P~l~~~~~~~~sg~~~~~~~~~~~~~~~~~~L~pt~e~~~~~~~~~~~~dLP~r~~~ 80 (211) T cd00768 1 IRSKIEQKLRRFMAELGFQEVETPIVEREPLLEKAGHEPKDLLPVGAENEEDLYLRPTLEPGLVRLFVSHIRKLPLRLAE 80 (211) T ss_pred CHHHHHHHHHHHHHHCCCEEEECCEECCHHHHHHCCCCHHHHHHHCCCCCCEEEEECCCCHHHHHHHHHHHHHCCHHHHE T ss_conf 97899999999999869989999834279999750676222301104679769995168499999975247657778643 Q ss_pred EHHHHCCCCCCC--CCCCCCEEEEEEEECCCCHHHHHHHHHHHHHHHH Q ss_conf 001103777665--5380000356541026899999999999999999 Q gi|254780419|r 216 IAPCFRDEDPRS--DRLPGEFYQLDMEMSFVEQDDILNTMENVLRGVF 261 (622) Q Consensus 216 iarcFRdEd~R~--dRqp~EFTqlD~EmsF~~~edvm~l~E~li~~i~ 261 (622) +++|||+|.... -|.- ||||+|+..... .++.-+..++|+..+. T Consensus 81 ~~~~fR~E~~~~Gl~Rvr-eF~~~e~~~f~~-~~~~~~~~~~~~~~~~ 126 (211) T cd00768 81 IGPAFRNEGGRRGLRRVR-EFTQLEGEVFGE-DGEEASEFEELIELTE 126 (211) T ss_pred ECCEECCCCCCCCCCCCC-CEEEEEEEEEEC-CHHHHHHHHHHHHHHH T ss_conf 200560578888844604-688984799988-4888999999999999 No 46 >PRK12295 hisZ ATP phosphoribosyltransferase regulatory subunit; Provisional Probab=98.69 E-value=1.2e-06 Score=67.03 Aligned_cols=111 Identities=20% Similarity=0.262 Sum_probs=59.4 Q ss_pred HHHHHHHHHHHHHHCCCEEEEECCCCC-------CCCCCC-CCCCCCCCCCCHHHHHHCC--CCHHHHHHHH--HCCCHH Q ss_conf 589999999875203651462033211-------246456-6454322222103443112--3854564433--105702 Q gi|254780419|r 144 KRTRIINSMRCRMIAENFIECSTPILT-------ASSPEG-ARDFLVPSRVNQGSFYALP--QAPQQYKQLL--MASGFD 211 (622) Q Consensus 144 ~Rs~i~~~~R~~l~~~gF~EveTP~L~-------ksTpeG-ArdFlVPsR~~~g~fYALp--QSPQlyKQlL--mv~G~d 211 (622) -|-.+...+++-|...||-+|+||++- ++.++- .+-|..-. ..|.--+|. -.||+-...+ -...-- T Consensus 6 r~~~~~~~i~~~f~~~Gy~~I~tP~lE~~d~~~~~~Ge~~~~~~y~f~D--~~g~~l~LRpDlT~piaR~~~~~~~~~p~ 83 (373) T PRK12295 6 RSAALAEALLASFEAAGAVRVDPPILQPAEPFLDLSGEDIRRRIFVTSD--ENGEELCLRPDFTIPVCRRHLASNAGEPA 83 (373) T ss_pred HHHHHHHHHHHHHHHCCCEEECCCCCCCHHHHCCCCCCHHHEEEEEEEC--CCCCEEEEECCCCHHHHHHHHHHCCCCCE T ss_conf 7999999999999986996856876041776414356331100488998--99898998178889999999983899980 Q ss_pred HHEEEHHHHCCCCCCCCCCCCCEEEEEEEECC-C----CHHHHHHHHHHHHHHH Q ss_conf 33110011037776655380000356541026-8----9999999999999999 Q gi|254780419|r 212 RYFQIAPCFRDEDPRSDRLPGEFYQLDMEMSF-V----EQDDILNTMENVLRGV 260 (622) Q Consensus 212 ryfqiarcFRdEd~R~dRqp~EFTqlD~EmsF-~----~~edvm~l~E~li~~i 260 (622) |||.+|+|||.... |. -||||.++|.== . ..-+|+.++-+.++.+ T Consensus 84 R~~Y~G~VfR~q~g---r~-rEf~Q~GvEiiG~~~~~~aDaEvi~la~~~l~~l 133 (373) T PRK12295 84 RYSYLGEVFRQRRD---RA-SEFLQAGIESFGRADPAAADAEVLALALEALAAL 133 (373) T ss_pred EEEEECCEEECCCC---CC-CCEEEEEEEEECCCCCCCCCHHHHHHHHHHHHHC T ss_conf 07877332245799---98-8516701578578982106499999999999975 No 47 >pfam00587 tRNA-synt_2b tRNA synthetase class II core domain (G, H, P, S and T). Other tRNA synthetase sub-families are too dissimilar to be included. This domain is the core catalytic domain of tRNA synthetases and includes glycyl, histidyl, prolyl, seryl and threonyl tRNA synthetases. Probab=98.61 E-value=1.2e-07 Score=74.11 Aligned_cols=113 Identities=29% Similarity=0.400 Sum_probs=67.2 Q ss_pred HHHHHHHHHHHHHHCCCEEEEECCCCCCCC----CCCCCCC---CCCCCCCHHHHHHC-CCCHH----HHHHHHHCCCH- Q ss_conf 589999999875203651462033211246----4566454---32222210344311-23854----56443310570- Q gi|254780419|r 144 KRTRIINSMRCRMIAENFIECSTPILTASS----PEGARDF---LVPSRVNQGSFYAL-PQAPQ----QYKQLLMASGF- 210 (622) Q Consensus 144 ~Rs~i~~~~R~~l~~~gF~EveTP~L~ksT----peGArdF---lVPsR~~~g~fYAL-pQSPQ----lyKQlLmv~G~- 210 (622) +|.++...+|+.+...||.||.||+|.... .++..+| +.......|.-|+| |.+.+ +|++-. .+-- T Consensus 1 l~~~l~~~~~~~~~~~G~~~i~tP~l~~~~~~~~sg~~~~~~~~my~~~d~~~~~l~Lrp~~~~~~~~~~~~~~-~~~~~ 79 (170) T pfam00587 1 LRNALENFIRDLFKRYGYQEVDTPILEPKELWEGSGHWDDYFDEMYRFKDRGGEELYLRPTAEVGITRLFKNEI-LSYRD 79 (170) T ss_pred CHHHHHHHHHHHHHHCCCEEEECCEEEEHHHHHHCCCCCCCHHHEEEEECCCCCEEEEECCCCHHHHHHHHHHC-CCCCC T ss_conf 97999999999999859989999917567898344776557531256735899768873257458889988620-03235 Q ss_pred --HHHEEEHHHHCCCC-CCC--CCCCCCEEEEEEEECCCCHHHHHHHHHHHHHH Q ss_conf --23311001103777-665--53800003565410268999999999999999 Q gi|254780419|r 211 --DRYFQIAPCFRDED-PRS--DRLPGEFYQLDMEMSFVEQDDILNTMENVLRG 259 (622) Q Consensus 211 --dryfqiarcFRdEd-~R~--dRqp~EFTqlD~EmsF~~~edvm~l~E~li~~ 259 (622) =|+||+++|||+|- -+. -|.= ||||.|+|. |.+.++.....++++.. T Consensus 80 lP~r~~~~~~~fR~E~~~~~Gl~R~r-eF~q~d~~~-~~~~e~s~~~~~e~~~~ 131 (170) T pfam00587 80 LPLKLYQIGPCFRYEARPRRGLGRVR-EFTQVDAEI-FGTPEQSEEEDEELLKL 131 (170) T ss_pred CCCEEECCCCCCCCCCCCCCCCCEEE-EEEEECEEE-EECCCHHHHHHHHHHHH T ss_conf 86013306530067777666774367-889804799-96700429999999999 No 48 >pfam01336 tRNA_anti OB-fold nucleic acid binding domain. This family contains OB-fold domains that bind to nucleic acids. The family includes the anti-codon binding domain of lysyl, aspartyl, and asparaginyl -tRNA synthetases (See pfam00152). Aminoacyl-tRNA synthetases catalyse the addition of an amino acid to the appropriate tRNA molecule EC:6.1.1.-. This family also includes part of RecG helicase involved in DNA repair. Replication factor A is a heterotrimeric complex, that contains a subunit in this family. This domain is also found at the C-terminus of bacterial DNA polymerase III alpha chain. Probab=98.56 E-value=1.2e-06 Score=67.04 Aligned_cols=74 Identities=35% Similarity=0.540 Sum_probs=62.4 Q ss_pred EEEEEEECCC-CCCCCEEEEEEECCCCEEEEEECCCCHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCCCCCCEEEEEEE Q ss_conf 9999798004-446987999997798259999748815789985289885899999999758655588777865999988 Q gi|254780419|r 21 VRLSGWVHRV-RPHGGIIFLDIRDHYGITQVVANPDSACFEIAKSVRSEWVLCIDGLVMSRSAETINANIITGQIELSAQ 99 (622) Q Consensus 21 V~i~GwV~~~-R~~g~l~F~~lrD~sG~~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~r~~~~~n~~~~tG~~Ei~~~ 99 (622) |++.|||.++ |..+++.|+.|+|.+|.+||++..+ ........|..++.|.|.|.+..++. |++++.+. T Consensus 1 v~v~G~V~~~~~~~~~~~~~~l~D~tg~i~~~~~~~-~~~~~~~~l~~g~~v~v~G~v~~~~~---------~~~~i~~~ 70 (75) T pfam01336 1 VTVAGRVTSVRRSGGKVAFLTLRDGTGSIQVVLFKE-EAEKLAKKLKEGDVVLVTGKVKKRPG---------GELELVVE 70 (75) T ss_pred CEEEEEEEEEEECCCCEEEEEEEECCEEEEEEEEHH-HHHHHHHCCCCCCEEEEEEEEEECCC---------CCEEEEEE T ss_conf 999999995188799989999997993999999523-86888632898989999999999749---------98999998 Q ss_pred EEEEE Q ss_conf 99995 Q gi|254780419|r 100 KIEIL 104 (622) Q Consensus 100 ~~~il 104 (622) +++++ T Consensus 71 ~i~~~ 75 (75) T pfam01336 71 EIEVL 75 (75) T ss_pred EEEEC T ss_conf 99979 No 49 >PRK00037 hisS histidyl-tRNA synthetase; Reviewed Probab=98.19 E-value=3.9e-06 Score=63.36 Aligned_cols=98 Identities=21% Similarity=0.341 Sum_probs=62.1 Q ss_pred CCHHHHHHHHHHHHHHHHCCCEEEEECCCCC------CCCCCCCCC------CCCCCCCCHHHHHHCC--CCHHHHHHHH Q ss_conf 2124589999999875203651462033211------246456645------4322222103443112--3854564433 Q gi|254780419|r 140 KNIVKRTRIINSMRCRMIAENFIECSTPILT------ASSPEGARD------FLVPSRVNQGSFYALP--QAPQQYKQLL 205 (622) Q Consensus 140 ~~l~~Rs~i~~~~R~~l~~~gF~EveTP~L~------ksTpeGArd------FlVPsR~~~g~fYALp--QSPQlyKQlL 205 (622) .-.+.|..+...+|+.+...||-||+||+|- ++..| ..+ |-.-. ..|.-+||. -.|+.-.-.. T Consensus 16 ~~~~~~~~i~~~i~~~~~~~Gy~~I~tP~lE~~d~~~~~~g~-~~d~~~k~~y~f~D--~~g~~l~LRpD~T~~iaR~~~ 92 (417) T PRK00037 16 EESAKWQYVEDTIREVFERYGFSEIRTPIFEYTELFKRKIGE-STDIVEKEMYTFQD--KGGRSLTLRPEGTAPVVRAVI 92 (417) T ss_pred HHHHHHHHHHHHHHHHHHHCCCEEEECCCCCHHHHHCCCCCC-CCCHHHHHEEEEEC--CCCCEEEECCCCCHHHHHHHH T ss_conf 899999999999999999859958577521509886245776-32201332056775--999889856776729999999 Q ss_pred ---HCCC---HHHHEEEHHHHCCCCCCCCCCCCCEEEEEEEE Q ss_conf ---1057---02331100110377766553800003565410 Q gi|254780419|r 206 ---MASG---FDRYFQIAPCFRDEDPRSDRLPGEFYQLDMEM 241 (622) Q Consensus 206 ---mv~G---~dryfqiarcFRdEd~R~dRqp~EFTqlD~Em 241 (622) +.+. ==|+|++++|||+|...+-|.- ||||.|+|+ T Consensus 93 ~~~~~~~~~~P~r~~y~g~vfR~e~p~~GR~R-Ef~Q~g~ei 133 (417) T PRK00037 93 ENKLYNELPKPFKLYYIGPMFRYERPQKGRYR-QFHQFGVEV 133 (417) T ss_pred HHHHHCCCCCCEEEEEECCEEECCCCCCCCCC-CEEECCHHH T ss_conf 72101068876589987246604789887635-245635277 No 50 >CHL00201 syh histidine-tRNA synthetase; Provisional Probab=98.14 E-value=3.3e-06 Score=63.87 Aligned_cols=99 Identities=21% Similarity=0.275 Sum_probs=59.2 Q ss_pred CCHHHHHHHHHHHHHHHHCCCEEEEECCCCC-------CCCCCCC---CC-CCCCCCCCHHHHHHCC--CCHHHHH---- Q ss_conf 2124589999999875203651462033211-------2464566---45-4322222103443112--3854564---- Q gi|254780419|r 140 KNIVKRTRIINSMRCRMIAENFIECSTPILT-------ASSPEGA---RD-FLVPSRVNQGSFYALP--QAPQQYK---- 202 (622) Q Consensus 140 ~~l~~Rs~i~~~~R~~l~~~gF~EveTP~L~-------ksTpeGA---rd-FlVPsR~~~g~fYALp--QSPQlyK---- 202 (622) .-...|..+...+++.+...||-+|+||++- ++.+++. .+ |-..- ..|.-.||. ..|++-. T Consensus 16 ~e~~~~~~i~~~i~~~f~~~Gf~~I~TP~lE~~e~~~~~~g~~~d~i~k~~y~f~D--~~g~~l~LRpD~T~piaR~~~~ 93 (424) T CHL00201 16 DEIEYWQFIHEKAATLLKLANYQEIRTPIFENTELFDRGIGEETDIVNKEMYRFFD--RGNRSITLRPEGTASIVRAFIE 93 (424) T ss_pred HHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCHHHHCCCCCCCCHHHHHHHEEEEC--CCCCEEEECCCCHHHHHHHHHH T ss_conf 99999999999999999985992504754041988623578742056665325455--9998888678770999999997 Q ss_pred -HHHHCCCHHHHEEEHHHHCCCCCCCCCCCCCEEEEEEEE Q ss_conf -433105702331100110377766553800003565410 Q gi|254780419|r 203 -QLLMASGFDRYFQIAPCFRDEDPRSDRLPGEFYQLDMEM 241 (622) Q Consensus 203 -QlLmv~G~dryfqiarcFRdEd~R~dRqp~EFTqlD~Em 241 (622) ++.....-=|+|++++|||.|..-.-|.= ||||.|+|. T Consensus 94 ~~~~~~~~P~r~~y~g~vfR~e~pq~GR~R-EF~Q~gvEi 132 (424) T CHL00201 94 NRMAYHMRPQRLWYSGPMFRYERPQSGRQR-QFHQLGIEF 132 (424) T ss_pred CCCCCCCCCEEEEEECCEEECCCCCCCCCC-HHHHCCEEE T ss_conf 476557998789987547805898788642-104146200 No 51 >cd00670 Gly_His_Pro_Ser_Thr_tRS_core Gly_His_Pro_Ser_Thr_tRNA synthetase class II core domain. This domain is the core catalytic domain of tRNA synthetases of the subgroup containing glycyl, histidyl, prolyl, seryl and threonyl tRNA synthetases. It is primarily responsible for ATP-dependent formation of the enzyme bound aminoacyl-adenylate. These enzymes belong to class II aminoacyl-tRNA synthetases (aaRS) based upon their structure and the presence of three characteristic sequence motifs in the core domain. This domain is also found at the C-terminus of eukaryotic GCN2 protein kinase and at the N-terminus of the ATP phosphoribosyltransferase accessory subunit, HisZ and the accessory subunit of mitochondrial polymerase gamma (Pol gamma b) . Most class II tRNA synthetases are dimers, with this subgroup consisting of mostly homodimers. These enzymes attach a specific amino acid to the 3' OH group of ribose of the appropriate tRNA. Probab=98.08 E-value=1.8e-05 Score=58.58 Aligned_cols=115 Identities=23% Similarity=0.257 Sum_probs=69.7 Q ss_pred HHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCC-C--C----CCC--CCCCCCC--CHHHHHHC-CCCHH----HHHHHH Q ss_conf 245899999998752036514620332112464-5--6----645--4322222--10344311-23854----564433 Q gi|254780419|r 142 IVKRTRIINSMRCRMIAENFIECSTPILTASSP-E--G----ARD--FLVPSRV--NQGSFYAL-PQAPQ----QYKQLL 205 (622) Q Consensus 142 l~~Rs~i~~~~R~~l~~~gF~EveTP~L~ksTp-e--G----Ard--FlVPsR~--~~g~fYAL-pQSPQ----lyKQlL 205 (622) .+++.++.+.+++.+.+.||.||.||.|.+... + | -++ |.|..+. ..+.=|+| |.|-. +|+.-. T Consensus 2 a~L~~aL~~~~~~~~~~~G~~~v~~P~l~~~~~~~~sg~~~~f~~~~y~v~~~~~~~~~~~~~L~PTsE~~~~~~~~~~i 81 (235) T cd00670 2 TALWRALERFLDDRMAEYGYQEILFPFLAPTVLFFKGGHLDGYRKEMYTFEDKGRELRDTDLVLRPAACEPIYQIFSGEI 81 (235) T ss_pred HHHHHHHHHHHHHHHHHCCCEEEECCCCCCHHHHHHCCCCCCCCCCEEEEECCCCCCCCCCEEEEECCCHHHHHHHHCCC T ss_conf 88999999999999998699899898557899992238675452052898237766678738993078699999994832 Q ss_pred HCCC--H-HHHEEEHHHHCCCCCCC---CCCCCCEEEEEEEECCCCHHHHHHHHHHHHHH Q ss_conf 1057--0-23311001103777665---53800003565410268999999999999999 Q gi|254780419|r 206 MASG--F-DRYFQIAPCFRDEDPRS---DRLPGEFYQLDMEMSFVEQDDILNTMENVLRG 259 (622) Q Consensus 206 mv~G--~-dryfqiarcFRdEd~R~---dRqp~EFTqlD~EmsF~~~edvm~l~E~li~~ 259 (622) .+- + =||+|+++|||+|-+-+ -|.- ||+|.|+. +|.+.++..+..|.++.. T Consensus 82 -~s~~dLPlr~~~~s~~fR~E~~~~~Gl~Rv~-eF~k~E~~-~~~~~e~~~~~~~~~~~~ 138 (235) T cd00670 82 -LSYRALPLRLDQIGPCFRHEPSGRRGLMRVR-EFRQVEYV-VFGEPEEAEEERREWLEL 138 (235) T ss_pred -CCHHHCCEEEEEECCEECCCCCCCCCCEEEE-EEEEEEEE-EEECHHHHHHHHHHHHHH T ss_conf -8768799545445653336789999983678-99985079-996889999999999999 No 52 >cd00771 ThrRS_core Threonyl-tRNA synthetase (ThrRS) class II core catalytic domain. ThrRS is a homodimer. It is responsible for the attachment of threonine to the 3' OH group of ribose of the appropriate tRNA. This domain is primarily responsible for ATP-dependent formation of the enzyme bound aminoacyl-adenylate. Class II assignment is based upon its structure and the presence of three characteristic sequence motifs in the core domain. Probab=98.07 E-value=1.5e-05 Score=59.16 Aligned_cols=115 Identities=22% Similarity=0.251 Sum_probs=72.7 Q ss_pred CHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCC---CC------CCCCCCCCCCCHHHHHHC-----CCCHHHHHHHHH Q ss_conf 1245899999998752036514620332112464---56------645432222210344311-----238545644331 Q gi|254780419|r 141 NIVKRTRIINSMRCRMIAENFIECSTPILTASSP---EG------ARDFLVPSRVNQGSFYAL-----PQAPQQYKQLLM 206 (622) Q Consensus 141 ~l~~Rs~i~~~~R~~l~~~gF~EveTP~L~ksTp---eG------ArdFlVPsR~~~g~fYAL-----pQSPQlyKQlLm 206 (622) =.++|..+...+|+...+.||.||.||.|.+..- .| .+=|.| ...+.-|+| |..-++||+.+. T Consensus 29 G~~i~~~L~~~ir~~~~~~Gy~eV~tP~i~~~~Lw~~SGH~~~y~e~mf~~---~~~~~~~~LkPmnCp~h~~i~~~~~~ 105 (298) T cd00771 29 GAIIRNELEDFLRELQRKRGYQEVETPIIYNKELWETSGHWDHYRENMFPF---EEEDEEYGLKPMNCPGHCLIFKSKPR 105 (298) T ss_pred HHHHHHHHHHHHHHHHHHCCCEEEECCCCCCHHHHHHCCCCCCCCCCEEEE---EECCCCEEEECCCCHHHHHHHHCCCC T ss_conf 899999999999999998699899886126655675477631136856888---86254168850686889999863556 Q ss_pred CCC--H-HHHEEEHHHHCCCCCCC----CCCCCCEEEEEEEECCCCHHHHHHHHHHHHHHHH Q ss_conf 057--0-23311001103777665----5380000356541026899999999999999999 Q gi|254780419|r 207 ASG--F-DRYFQIAPCFRDEDPRS----DRLPGEFYQLDMEMSFVEQDDILNTMENVLRGVF 261 (622) Q Consensus 207 v~G--~-dryfqiarcFRdEd~R~----dRqp~EFTqlD~EmsF~~~edvm~l~E~li~~i~ 261 (622) |= + =|++|++.|||+|-+-+ -|-- +|||-|.- .|.+.+++.+-+.+++.-+. T Consensus 106 -SYrdLPlR~~E~g~~~R~E~sG~L~GL~RvR-~FtqdDaH-iFc~~eQ~~~E~~~~~~~~~ 164 (298) T cd00771 106 -SYRDLPLRLAEFGTVHRYEQSGALHGLTRVR-GFTQDDAH-IFCTPDQIKEEIKGVLDLIK 164 (298) T ss_pred -CHHHCCHHHHCCCCEECCCCCCCCCCCCCCC-CCCCCCCE-EECCHHHHHHHHHHHHHHHH T ss_conf -7343467652075055476776755744555-67457632-44168899999999999999 No 53 >PRK12325 prolyl-tRNA synthetase; Provisional Probab=98.06 E-value=1.6e-05 Score=58.89 Aligned_cols=110 Identities=21% Similarity=0.251 Sum_probs=71.5 Q ss_pred CHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCC---CC----------------CCCC-CCCCCCCHHHHHHCCCCHHH Q ss_conf 1245899999998752036514620332112464---56----------------6454-32222210344311238545 Q gi|254780419|r 141 NIVKRTRIINSMRCRMIAENFIECSTPILTASSP---EG----------------ARDF-LVPSRVNQGSFYALPQAPQQ 200 (622) Q Consensus 141 ~l~~Rs~i~~~~R~~l~~~gF~EveTP~L~ksTp---eG----------------ArdF-lVPsR~~~g~fYALpQSPQl 200 (622) -.+++.++...+|+-+.+.||-||.||.|.++.- .| -++| |=|+- .++...-..+.+.= T Consensus 46 G~~i~~~le~~ir~e~~~~G~~eV~tP~l~~~~Lw~~SGh~~~y~~~mf~~~d~~~~~~~l~Pmn-c~~~~~i~~~~~~S 124 (438) T PRK12325 46 GLKVLKKIENIVREEQNRAGAIEILMPTIQPADLWRESGRYDAYGKEMLRIKDRHDREMLYGPTN-EEMITDIFRSYVKS 124 (438) T ss_pred HHHHHHHHHHHHHHHHHHCCCEEEECCCCCCHHHHHHHCCHHHHHHHHCEEECCCCCEECCCCCC-CHHHHHHHHHCCCC T ss_conf 89999999999999999749979987765877899862876455574212134777743138998-49999999740342 Q ss_pred HHHHHHCCCHHHHEEEHHHHCCCCCC-C--CCCCCCEEEEEEEECC-CCHHHHHHHHHHHHH Q ss_conf 64433105702331100110377766-5--5380000356541026-899999999999999 Q gi|254780419|r 201 YKQLLMASGFDRYFQIAPCFRDEDPR-S--DRLPGEFYQLDMEMSF-VEQDDILNTMENVLR 258 (622) Q Consensus 201 yKQlLmv~G~dryfqiarcFRdEd~R-~--dRqp~EFTqlD~EmsF-~~~edvm~l~E~li~ 258 (622) ||||=+ |+|||+.+||||-+- . -|-= ||||-|.. +| .+.++.....+.+.. T Consensus 125 YRdLPl-----rl~q~g~~~R~E~sp~~GLlRvR-eF~m~Dah-~F~~~~e~~~~~~~~v~~ 179 (438) T PRK12325 125 YKDLPL-----NLYHIQWKFRDEIRPRFGVMRGR-EFLMKDAY-SFDLDEEGARKSYNRMFV 179 (438) T ss_pred HHHCCH-----HHHHHHHEECCCCCCCCCCCEEE-EEECCCHH-HCCCCHHHHHHHHHHHHH T ss_conf 654257-----88764440112026764410122-33304154-324688999999999999 No 54 >cd00779 ProRS_core_prok Prolyl-tRNA synthetase (ProRS) class II core catalytic domain. ProRS is a homodimer. It is responsible for the attachment of proline to the 3' OH group of ribose of the appropriate tRNA. This domain is primarily responsible for ATP-dependent formation of the enzyme bound aminoacyl-adenylate. Class II assignment is based upon its structure and the presence of three characteristic sequence motifs in the core domain. This subfamily contains the core domain of ProRS from prokaryotes and from the mitochondria of eukaryotes. Probab=98.05 E-value=1.4e-05 Score=59.37 Aligned_cols=110 Identities=25% Similarity=0.301 Sum_probs=69.5 Q ss_pred CHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCC-------CC-CC-CCCCCCCCCHHHHHHC-CC-----------CHH Q ss_conf 1245899999998752036514620332112464-------56-64-5432222210344311-23-----------854 Q gi|254780419|r 141 NIVKRTRIINSMRCRMIAENFIECSTPILTASSP-------EG-AR-DFLVPSRVNQGSFYAL-PQ-----------APQ 199 (622) Q Consensus 141 ~l~~Rs~i~~~~R~~l~~~gF~EveTP~L~ksTp-------eG-Ar-dFlVPsR~~~g~fYAL-pQ-----------SPQ 199 (622) -.+++-++...+|+-+.+.|+.||.||.|+...- ++ +. -|-|..| .++-|+| |. .+. T Consensus 30 G~~i~~~i~~~i~~~~~~~G~~ev~~P~l~~~~l~~~sg~~~~f~~emf~v~d~--~~~~~~L~PT~E~~~~~~~~~~i~ 107 (255) T cd00779 30 GLRVLKKIENIIREEMNKIGAQEILMPILQPAELWKESGRWDAYGPELLRLKDR--HGKEFLLGPTHEEVITDLVANEIK 107 (255) T ss_pred HHHHHHHHHHHHHHHHHHCCCEEEECCCCCCHHHHHHHCCCCCCCCCEEEEECC--CCCEEEECCCCHHHHHHHHHHHCC T ss_conf 999999999999999998599899765347388874425301368731544537--887773168874889999986503 Q ss_pred HHHHHHHCCCHHHHEEEHHHHCCCCCCCC---CCCCCEEEEEEEECCCCHHHHHHHHHHHHH Q ss_conf 56443310570233110011037776655---380000356541026899999999999999 Q gi|254780419|r 200 QYKQLLMASGFDRYFQIAPCFRDEDPRSD---RLPGEFYQLDMEMSFVEQDDILNTMENVLR 258 (622) Q Consensus 200 lyKQlLmv~G~dryfqiarcFRdEd~R~d---Rqp~EFTqlD~EmsF~~~edvm~l~E~li~ 258 (622) =||+|=+ |||||+.|||+|-.-.- |.= ||+|-|+....++.++.....+.+.. T Consensus 108 SyrdLPl-----~~~q~~~~fR~E~r~~~GllR~r-EF~~~d~hsf~~~~e~a~~~~~~~~~ 163 (255) T cd00779 108 SYKQLPL-----NLYQIQTKFRDEIRPRFGLMRGR-EFLMKDAYSFDIDEESLEETYEKMYQ 163 (255) T ss_pred CHHHCCE-----EEECCCCEECCCCCCCCCCCEEE-EEEEEHHHCCCCCHHHHHHHHHHHHH T ss_conf 6122574-----54135335425667886632431-37700220356998999999999999 No 55 >COG0124 HisS Histidyl-tRNA synthetase [Translation, ribosomal structure and biogenesis] Probab=97.99 E-value=1.3e-05 Score=59.68 Aligned_cols=111 Identities=21% Similarity=0.304 Sum_probs=66.7 Q ss_pred CHHHHHHHHHHHHHHHHCCCEEEEECCCC------CCCCCCCCCC--CCCCCC-CCHHHHHH------------CCCCHH Q ss_conf 12458999999987520365146203321------1246456645--432222-21034431------------123854 Q gi|254780419|r 141 NIVKRTRIINSMRCRMIAENFIECSTPIL------TASSPEGARD--FLVPSR-VNQGSFYA------------LPQAPQ 199 (622) Q Consensus 141 ~l~~Rs~i~~~~R~~l~~~gF~EveTP~L------~ksTpeGArd--FlVPsR-~~~g~fYA------------LpQSPQ 199 (622) -...|..+...+|+-+..-||.||+||++ .++..|..+- -.+-+= -+.|...| +.+.++ T Consensus 17 d~~~~~~i~~~~~~v~~~yGf~eI~TPifE~telf~r~~Ge~td~v~kemY~F~Dkggr~laLRpe~Tapv~R~~~en~~ 96 (429) T COG0124 17 DMALREYIESTIRKVFESYGFSEIRTPIFEYTELFARKSGEETDVVEKEMYTFKDKGGRSLALRPELTAPVARAVAENKL 96 (429) T ss_pred HHHHHHHHHHHHHHHHHHCCCEECCCCCCCCHHHHHHCCCCCCCCCCCCEEEEEECCCCEEEECCCCCHHHHHHHHHCCC T ss_conf 78999999999999999809976557551116676201697433332102899948998898446674999999995622 Q ss_pred HHHHHHHCCCHHHHEEEHHHHCCCCCCCCCCCCCEEEEEEEEC-----CCCHHHHHHHHHHHHHH Q ss_conf 5644331057023311001103777665538000035654102-----68999999999999999 Q gi|254780419|r 200 QYKQLLMASGFDRYFQIAPCFRDEDPRSDRLPGEFYQLDMEMS-----FVEQDDILNTMENVLRG 259 (622) Q Consensus 200 lyKQlLmv~G~dryfqiarcFRdEd~R~dRqp~EFTqlD~Ems-----F~~~edvm~l~E~li~~ 259 (622) ...+ --|.|.+++|||-|..-+-|.= ||+|.|+|.- .+| -+|+.++-+++.. T Consensus 97 ~~~~------p~k~yy~g~vfRyErPQ~GR~R-qF~Q~g~E~iG~~~~~~D-AEvi~l~~~~l~~ 153 (429) T COG0124 97 DLPK------PLKLYYFGPVFRYERPQKGRYR-QFYQFGVEVIGSDSPDAD-AEVIALAVEILEA 153 (429) T ss_pred CCCC------CEEEEEECCEECCCCCCCCCCE-EEEECCEEEECCCCCCCC-HHHHHHHHHHHHH T ss_conf 0368------7259983356237888888750-368767677679986538-9999999999997 No 56 >cd00778 ProRS_core_arch_euk Prolyl-tRNA synthetase (ProRS) class II core catalytic domain. ProRS is a homodimer. It is responsible for the attachment of proline to the 3' OH group of ribose of the appropriate tRNA. This domain is primarily responsible for ATP-dependent formation of the enzyme bound aminoacyl-adenylate. Class II assignment is based upon its structure and the presence of three characteristic sequence motifs in the core domain. This subfamily contains the core domain of ProRS from archaea, the cytoplasm of eukaryotes and some bacteria. Probab=97.95 E-value=4.5e-05 Score=55.78 Aligned_cols=121 Identities=17% Similarity=0.167 Sum_probs=75.5 Q ss_pred CCHHHHHHHHHHHHHHHHCCCEEEEECCCCCC-C----CC---CCCCC--CCCCCCC--CHHHHHHCC-CC----HHHHH Q ss_conf 21245899999998752036514620332112-4----64---56645--4322222--103443112-38----54564 Q gi|254780419|r 140 KNIVKRTRIINSMRCRMIAENFIECSTPILTA-S----SP---EGARD--FLVPSRV--NQGSFYALP-QA----PQQYK 202 (622) Q Consensus 140 ~~l~~Rs~i~~~~R~~l~~~gF~EveTP~L~k-s----Tp---eGArd--FlVPsR~--~~g~fYALp-QS----PQlyK 202 (622) .-.+++.++...+|+-+.+.|+-||.||+|.. + +. +|-++ |.|...- ....-|||. -+ .|+|| T Consensus 30 ~G~~i~~~l~~~~r~~~~~~G~~ev~~P~l~~~~~~~k~sgh~~~y~~emf~v~~~g~~~~d~~y~LkPmne~~~~~i~~ 109 (261) T cd00778 30 YGYAIWENIQKILDKEIKETGHENVYFPLLIPESELEKEKEHIEGFAPEVAWVTHGGLEELEEPLALRPTSETAIYPMFS 109 (261) T ss_pred CHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCHHHHHHHHCHHHCCCCCCEEEECCCCCCCCCCCCCCCCCHHHHHHHHH T ss_conf 28999999999999999986996998833278899997428143057653355216852013560347888699999997 Q ss_pred HHHHCCC--H-HHHEEEHHHHCCCCCCCCCCC----CCEEEEEEEECCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 4331057--0-233110011037776655380----000356541026899999999999999999998 Q gi|254780419|r 203 QLLMASG--F-DRYFQIAPCFRDEDPRSDRLP----GEFYQLDMEMSFVEQDDILNTMENVLRGVFEEF 264 (622) Q Consensus 203 QlLmv~G--~-dryfqiarcFRdEd~R~dRqp----~EFTqlD~EmsF~~~edvm~l~E~li~~i~~~v 264 (622) .-+ -|- + =|+|||+.|||+|-+ + +++ -||||-|.-..|++.+++.+-++.++. ++..+ T Consensus 110 ~~~-~SYRdLPlrl~q~g~~~R~E~s-~-~~gL~RvReF~~dDahs~~~t~eq~~~e~~~~~~-~y~~i 174 (261) T cd00778 110 KWI-RSYRDLPLKINQWVNVFRWETK-T-TRPFLRTREFLWQEGHTAHATEEEAEEEVLQILD-LYKEF 174 (261) T ss_pred HHH-CCCCCCCHHHHHCCCEECCCCC-C-CCCCCEEEEEEEECCCCCCCCHHHHHHHHHHHHH-HHHHH T ss_conf 350-5402278255530422437788-8-8765325425234233655899999999999999-99999 No 57 >PRK09537 pylS pyrolysyl-tRNA synthetase; Reviewed Probab=97.91 E-value=5.2e-06 Score=62.50 Aligned_cols=86 Identities=28% Similarity=0.507 Sum_probs=53.0 Q ss_pred HHHHHHHHHHHHCCCEEEEECCCCCCCCC-C--CC-C-------CCCCCCCCCHHHHHHCC--CCHHHHH---HHHHC-C Q ss_conf 99999998752036514620332112464-5--66-4-------54322222103443112--3854564---43310-5 Q gi|254780419|r 146 TRIINSMRCRMIAENFIECSTPILTASSP-E--GA-R-------DFLVPSRVNQGSFYALP--QAPQQYK---QLLMA-S 208 (622) Q Consensus 146 s~i~~~~R~~l~~~gF~EveTP~L~ksTp-e--GA-r-------dFlVPsR~~~g~fYALp--QSPQlyK---QlLmv-~ 208 (622) .++-+.|.+||.++||+||..|||...-- | |- . -| |+..+ ++|. -+|.||- .|.-+ - T Consensus 210 gkler~it~ffvdrgfleikspilip~eyiermgid~d~elskQiF----rvd~n--~CLRPMLAPnLYnylrkL~R~lp 283 (420) T PRK09537 210 GKLERDITKFFVDRGFLEIKSPILIPAEYIERMGIDNDTELSKQIF----RVDKN--FCLRPMLAPNLYNYLRKLDRALP 283 (420) T ss_pred HHHHHHHHHHHHCCCCEEECCCEECCHHHHHHHCCCCHHHHHHHHE----EECCC--CCCCCCCCHHHHHHHHHHHHHCC T ss_conf 6778888998541451351485042199999857886056554516----74345--33241014269999999986168 Q ss_pred CHHHHEEEHHHHCCCCCCCCCCCCCEEEEE Q ss_conf 702331100110377766553800003565 Q gi|254780419|r 209 GFDRYFQIAPCFRDEDPRSDRLPGEFYQLD 238 (622) Q Consensus 209 G~dryfqiarcFRdEd~R~dRqp~EFTqlD 238 (622) .==|.|+||+|||-|.--+ -|+.||||+. T Consensus 284 dPIrIFEIGpCyRKESdG~-~HLeEFTMlN 312 (420) T PRK09537 284 DPIKIFEIGPCYRKESDGK-EHLEEFTMVN 312 (420) T ss_pred CCEEEEEECCCCCCCCCCH-HHHHHHHHHH T ss_conf 8715886204303444106-7887777776 No 58 >PRK12420 histidyl-tRNA synthetase; Provisional Probab=97.82 E-value=6.9e-05 Score=54.45 Aligned_cols=98 Identities=20% Similarity=0.255 Sum_probs=57.7 Q ss_pred CHHHHHHHHHHHHHHHHCCCEEEEECCCC-----CCCCC-CCC---CC-CCCCCCCCHHHHHHCC--CCHHHHHHHHHCC Q ss_conf 12458999999987520365146203321-----12464-566---45-4322222103443112--3854564433105 Q gi|254780419|r 141 NIVKRTRIINSMRCRMIAENFIECSTPIL-----TASSP-EGA---RD-FLVPSRVNQGSFYALP--QAPQQYKQLLMAS 208 (622) Q Consensus 141 ~l~~Rs~i~~~~R~~l~~~gF~EveTP~L-----~ksTp-eGA---rd-FlVPsR~~~g~fYALp--QSPQlyKQlLmv~ 208 (622) -...|..+...+|+.+...||-||+||++ +..+- +|. .+ |-... ..|.-.||. .-+++-.-..+-. T Consensus 17 e~~~~~~i~~~~~~~~~~~Gy~~I~tP~~E~~elf~~~~g~~~~i~kemy~f~D--~~gr~l~LRPe~Ta~iaR~~a~~~ 94 (421) T PRK12420 17 EQVLRNKIKRACEDTFERYGCKPLETPTLNMYELMSYKYGGGDEILKEIYTLQD--QGKRDLALRYDLTIPFAKVVAMNP 94 (421) T ss_pred HHHHHHHHHHHHHHHHHHCCCEEEECCCCCHHHHHCCCCCCCCHHHHHEEEEEC--CCCCEEEECCCCCHHHHHHHHHCC T ss_conf 999999999999999998599472476501187642447884422411078881--899879856776589999999775 Q ss_pred CH---HHHEEEHHHHCCCCCCCCCCCCCEEEEEEEE Q ss_conf 70---2331100110377766553800003565410 Q gi|254780419|r 209 GF---DRYFQIAPCFRDEDPRSDRLPGEFYQLDMEM 241 (622) Q Consensus 209 G~---dryfqiarcFRdEd~R~dRqp~EFTqlD~Em 241 (622) .. =|.|.+|+|||.|.-..-|.= ||+|+++|. T Consensus 95 ~~~~P~k~~y~g~vfR~erpq~GR~R-eF~Q~GvEi 129 (421) T PRK12420 95 NIRLPFKRYEIGKVFRDGPIKQGRFR-EFIQCDVDI 129 (421) T ss_pred CCCCCEEEEEECCEEECCCCCCCCCC-EEEECCEEE T ss_conf 77787458887677853898788555-134357542 No 59 >PRK07892 consensus Probab=97.79 E-value=0.00034 Score=49.52 Aligned_cols=11 Identities=36% Similarity=0.751 Sum_probs=6.2 Q ss_pred CCCEEECCCHH Q ss_conf 08356057301 Q gi|254780419|r 587 LGLRIVENPKK 597 (622) Q Consensus 587 ~~i~~~~~~k~ 597 (622) +|+-+..-|=. T Consensus 982 LG~ylS~HPl~ 992 (1180) T PRK07892 982 LGLYVSDHPLD 992 (1180) T ss_pred HEECCCCCHHH T ss_conf 32533508299 No 60 >cd00770 SerRS_core Seryl-tRNA synthetase (SerRS) class II core catalytic domain. SerRS is responsible for the attachment of serine to the 3' OH group of ribose of the appropriate tRNA. This domain It is primarily responsible for ATP-dependent formation of the enzyme bound aminoacyl-adenylate. Class II assignment is based upon its structure and the presence of three characteristic sequence motifs in the core domain. SerRS synthetase is a homodimer. Probab=97.77 E-value=6.9e-05 Score=54.42 Aligned_cols=124 Identities=17% Similarity=0.274 Sum_probs=75.3 Q ss_pred CHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCC-CCCCC----------------CCCCCCCCCHHH-HHHCCCCHHHHH Q ss_conf 124589999999875203651462033211246-45664----------------543222221034-431123854564 Q gi|254780419|r 141 NIVKRTRIINSMRCRMIAENFIECSTPILTASS-PEGAR----------------DFLVPSRVNQGS-FYALPQAPQQYK 202 (622) Q Consensus 141 ~l~~Rs~i~~~~R~~l~~~gF~EveTP~L~ksT-peGAr----------------dFlVPsR~~~g~-fYALpQSPQlyK 202 (622) ..+++..+++.+++.+.++||.||.||.+.++. -+|.. -||+||--.+-. +|+ ...=-|+ T Consensus 51 ga~Le~AL~~f~~d~~~~~Gy~eV~tP~iv~~~~~~~sG~~~~~~e~~y~i~~~~~~Li~tae~~l~~~~~--~~~~~~~ 128 (297) T cd00770 51 GALLERALINFALDFLTKRGFTPVIPPFLVRKEVMEGTGQLPKFDEQLYKVEGEDLYLIATAEVPLAALHR--DEILEEE 128 (297) T ss_pred HHHHHHHHHHHHHHHHHHCCCEEEECCCCCHHHHHHHHCCCCCCHHHHHCCCCHHHHHHHCCCCCEEEEEE--EEECCHH T ss_conf 99999999999999999879989988530138998543578763565210355055554405666223442--0031775 Q ss_pred HHHHCCCHHHHEEEHHHHCCCCCCC--C-----CCCCCEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 4331057023311001103777665--5-----38000035654102689999999999999999999808723457732 Q gi|254780419|r 203 QLLMASGFDRYFQIAPCFRDEDPRS--D-----RLPGEFYQLDMEMSFVEQDDILNTMENVLRGVFEEFSNGNTVSNNFL 275 (622) Q Consensus 203 QlLmv~G~dryfqiarcFRdEd~R~--d-----Rqp~EFTqlD~EmsF~~~edvm~l~E~li~~i~~~v~~~~~i~~pF~ 275 (622) +|= =||+.+++|||.|-+-+ | |.- .|||+|+- .|.+.++..+..|+|+...- ++.. .+..||. T Consensus 129 dLP-----lr~~~~s~cfR~Eags~G~d~~GL~Rvh-qF~kvE~~-~~c~pe~s~~~~e~~~~~~e-~~l~--~L~lpyr 198 (297) T cd00770 129 ELP-----LKYAGYSPCFRKEAGSAGRDTRGLFRVH-QFEKVEQF-VFTKPEESWEELEELISNAE-EILQ--ELGLPYR 198 (297) T ss_pred HCC-----CCEEECCCHHHHCCCCCCCCCCCEEEEE-ECCEEEEE-EECCHHHHHHHHHHHHHHHH-HHHH--HHCCCEE T ss_conf 588-----6502246341320377775555457887-21100377-61687889999999999999-9999--8155014 Q ss_pred C Q ss_conf 1 Q gi|254780419|r 276 R 276 (622) Q Consensus 276 r 276 (622) . T Consensus 199 v 199 (297) T cd00770 199 V 199 (297) T ss_pred E T ss_conf 5 No 61 >TIGR02367 PylS pyrrolysyl-tRNA synthetase; InterPro: IPR012739 PylS is the archaeal enzyme responsible for charging the pyrrolysine tRNA, PylT, by ligating a free molecule of pyrrolysine. Pyrrolysine is encoded at an in-frame UAG (amber) at least in several corrinoid-dependent methyltransferases of the archaeal genera Methanosarcina and Methanococcoides, e.g. trimethylamine methyltransferase. . Probab=97.74 E-value=3.1e-05 Score=56.97 Aligned_cols=110 Identities=25% Similarity=0.420 Sum_probs=70.1 Q ss_pred HHHHHHHHHHHHHHCCCEEEEECCCCCCCCC-C--CC-CC-------CCCCCCCCHHHHHHCC--CCHHHHHHHH----H Q ss_conf 5899999998752036514620332112464-5--66-45-------4322222103443112--3854564433----1 Q gi|254780419|r 144 KRTRIINSMRCRMIAENFIECSTPILTASSP-E--GA-RD-------FLVPSRVNQGSFYALP--QAPQQYKQLL----M 206 (622) Q Consensus 144 ~Rs~i~~~~R~~l~~~gF~EveTP~L~ksTp-e--GA-rd-------FlVPsR~~~g~fYALp--QSPQlyKQlL----m 206 (622) .-.++-+.+..||.++||+||..|||...-- | |- .| | |+.++ ++|. -+|.||.-|- + T Consensus 241 ylGkler~it~ffvdrGfleiksPiliP~ey~ermGi~ndtelskqif----rvdkn--~ClrPmlaP~lynylrkldr~ 314 (453) T TIGR02367 241 YLGKLEREITKFFVDRGFLEIKSPILIPLEYVERMGIDNDTELSKQIF----RVDKN--LCLRPMLAPTLYNYLRKLDRI 314 (453) T ss_pred HHHHHHHHHHHHHHHCCEEEECCCCCCCHHHHHHCCCCCCHHHHHHHH----HHHHH--HHCCCCHHHHHHHHHHHHHHH T ss_conf 754676667666531240232363101056886507764025666666----44100--101210005689999998753 Q ss_pred CCCHHHHEEEHHHHCCCCCCCCCCCCCEEEEEEE-E-CCCCHHHHHHHHHHHHHHH Q ss_conf 0570233110011037776655380000356541-0-2689999999999999999 Q gi|254780419|r 207 ASGFDRYFQIAPCFRDEDPRSDRLPGEFYQLDME-M-SFVEQDDILNTMENVLRGV 260 (622) Q Consensus 207 v~G~dryfqiarcFRdEd~R~dRqp~EFTqlD~E-m-sF~~~edvm~l~E~li~~i 260 (622) .-|==|+|+||||||-|.- ..-|+-|||++..= | |=.+.|.+-.++.+.+.++ T Consensus 315 lP~Pik~fe~GPCyrkesd-Gkehleeftm~nfCqmGsGCtrenle~~i~~fl~~l 369 (453) T TIGR02367 315 LPDPIKVFEVGPCYRKESD-GKEHLEEFTMLNFCQMGSGCTRENLEALIKEFLDYL 369 (453) T ss_pred CCCCEEEEEECCCCCCCCC-CHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHH T ss_conf 7896479860675243565-335666666555431478732677999999988761 No 62 >cd00772 ProRS_core Prolyl-tRNA synthetase (ProRS) class II core catalytic domain. ProRS is a homodimer. It is responsible for the attachment of proline to the 3' OH group of ribose of the appropriate tRNA. This domain is primarily responsible for ATP-dependent formation of the enzyme bound aminoacyl-adenylate. Class II assignment is based upon its structure and the presence of three characteristic sequence motifs in the core domain. Probab=97.70 E-value=0.00018 Score=51.48 Aligned_cols=111 Identities=19% Similarity=0.193 Sum_probs=73.0 Q ss_pred CHHHHHHHHHHHHHHHHCCCEEEEECCCCCCC------C--CC-CCCCCCCCCCCCHHH---HHHC------------CC Q ss_conf 12458999999987520365146203321124------6--45-664543222221034---4311------------23 Q gi|254780419|r 141 NIVKRTRIINSMRCRMIAENFIECSTPILTAS------S--PE-GARDFLVPSRVNQGS---FYAL------------PQ 196 (622) Q Consensus 141 ~l~~Rs~i~~~~R~~l~~~gF~EveTP~L~ks------T--pe-GArdFlVPsR~~~g~---fYAL------------pQ 196 (622) -++++.+|...+|+-|.+.|+-||-+|+|... - .+ |+.++.+.+.....+ =||| .+ T Consensus 31 G~~i~~ki~~~i~~e~~~~G~qev~~P~l~p~~lw~ksgr~~~~~~~e~~~~~~~~~~~~e~~~~L~PT~Ee~~~~~~~~ 110 (264) T cd00772 31 AKAILDKIENVLDKMFKEHGAQNALFPFFILASFLEKEAEHDEGFSKELAVFKDAGDEELEEDFALRPTLEENIGEIAAK 110 (264) T ss_pred HHHHHHHHHHHHHHHHHHCCCEEEECCCCCCHHHHHHCCCHHHCCCCCEEEEEECCCCCCCCCCCCCCCCCHHHHHHHHH T ss_conf 99999999999999999829949812567998999976984443785237997348742123520167786899999986 Q ss_pred CHHHHHHHHHCCCHHHHEEEHHHHCCCCCCCCCCC----CCEEEEEEEECCCCHHHHHHHHHHHHH Q ss_conf 85456443310570233110011037776655380----000356541026899999999999999 Q gi|254780419|r 197 APQQYKQLLMASGFDRYFQIAPCFRDEDPRSDRLP----GEFYQLDMEMSFVEQDDILNTMENVLR 258 (622) Q Consensus 197 SPQlyKQlLmv~G~dryfqiarcFRdEd~R~dRqp----~EFTqlD~EmsF~~~edvm~l~E~li~ 258 (622) -..-||||= =|+|||+.+||||-. .|+| -||+|-|.....+|.++..+-.+.++. T Consensus 111 ~i~SyrdLP-----l~lyQ~~~~fR~E~r--~r~GllR~REFlmkDahsf~~t~e~a~~~y~~~~~ 169 (264) T cd00772 111 FIKSWKDLP-----QHLNQIGNKFRDEIR--PRFGFLRAREFIMKDGHSAHADAEEADEEFLNMLS 169 (264) T ss_pred HCCCHHHCC-----HHHHHHHHHHHCCCC--CCCCCHHHEEEEEEEEEECCCCHHHHHHHHHHHHH T ss_conf 407665579-----778765578605788--55433232043411035325999999999999999 No 63 >PRK12292 hisZ ATP phosphoribosyltransferase regulatory subunit; Provisional Probab=97.70 E-value=9.7e-05 Score=53.39 Aligned_cols=99 Identities=22% Similarity=0.205 Sum_probs=60.0 Q ss_pred CHHHHHHHHHHHHHHHHCCCEEEEECCCCC-----CCCCCC-C--CCCCCCCCCCHHHHHHCC--CCHHHHH---H-HHH Q ss_conf 124589999999875203651462033211-----246456-6--454322222103443112--3854564---4-331 Q gi|254780419|r 141 NIVKRTRIINSMRCRMIAENFIECSTPILT-----ASSPEG-A--RDFLVPSRVNQGSFYALP--QAPQQYK---Q-LLM 206 (622) Q Consensus 141 ~l~~Rs~i~~~~R~~l~~~gF~EveTP~L~-----ksTpeG-A--rdFlVPsR~~~g~fYALp--QSPQlyK---Q-lLm 206 (622) -.+.|..+...+++.+...||-||+||++- ....+. . .-|.+--+ ..|.-.||. -.||+-. + +.- T Consensus 16 e~~~~~~i~~~~~~~f~~~Gy~~I~tP~lE~~e~f~~~~g~~~~~~~~~f~D~-~~Gr~l~LRpD~T~~iaR~~~~~~~~ 94 (388) T PRK12292 16 EARKIEEIRRRLLDVFRLWGYEEVITPTLEYLDTLLTGGGADLDLRTFKLVDQ-LSGRTLGLRPDMTPQIARIAATRLAN 94 (388) T ss_pred HHHHHHHHHHHHHHHHHHCCCEEEECCCCCCHHHHCCCCCCCCCHHEEEEEEC-CCCCEEEECCCCCHHHHHHHHHHHHC T ss_conf 99999999999999999879967357660629885445775021104899826-99978985687719999999973104 Q ss_pred CCCHHHHEEEHHHHCCCCCCCCCCCCCEEEEEEEE Q ss_conf 05702331100110377766553800003565410 Q gi|254780419|r 207 ASGFDRYFQIAPCFRDEDPRSDRLPGEFYQLDMEM 241 (622) Q Consensus 207 v~G~dryfqiarcFRdEd~R~dRqp~EFTqlD~Em 241 (622) ..+--|+|.+|+|||.|..-.-|.= ||+|+++|. T Consensus 95 ~~~p~r~~Y~g~vfR~e~pq~Gr~R-Ef~Q~G~Ei 128 (388) T PRK12292 95 RPGPLRLCYAGNVFRAQPRGLGRSR-EFLQSGVEL 128 (388) T ss_pred CCCCEEEEEEEEEEEECCCCCCCCC-EEEECCEEE T ss_conf 6787689985226870589899610-234257553 No 64 >PRK06658 consensus Probab=97.62 E-value=0.0021 Score=43.80 Aligned_cols=20 Identities=15% Similarity=0.521 Sum_probs=9.4 Q ss_pred EEEECCCCHHHHHHHHHHHH Q ss_conf 54102689999999999999 Q gi|254780419|r 238 DMEMSFVEQDDILNTMENVL 257 (622) Q Consensus 238 D~EmsF~~~edvm~l~E~li 257 (622) |+-.+.-..++|++.+.+-. T Consensus 403 DiDF~~~rR~eVi~Yv~~kY 422 (1145) T PRK06658 403 DIDFCFERRQEVIEYVRNKY 422 (1145) T ss_pred CCCCCCCCHHHHHHHHHHHH T ss_conf 65454004899999999885 No 65 >PRK06826 dnaE DNA polymerase III DnaE; Reviewed Probab=97.56 E-value=0.0012 Score=45.69 Aligned_cols=11 Identities=27% Similarity=0.377 Sum_probs=6.5 Q ss_pred CCCCHHHHHHH Q ss_conf 21218988751 Q gi|254780419|r 275 LRICYDDAIRR 285 (622) Q Consensus 275 ~rmtY~eAm~~ 285 (622) +-+|..+|++. T Consensus 465 ~~~~l~~a~~~ 475 (1157) T PRK06826 465 LGITIEKALEL 475 (1157) T ss_pred CCCCHHHHHHC T ss_conf 79988999751 No 66 >cd00773 HisRS-like_core Class II Histidinyl-tRNA synthetase (HisRS)-like catalytic core domain. HisRS is a homodimer. It is responsible for the attachment of histidine to the 3' OH group of ribose of the appropriate tRNA. This domain is primarily responsible for ATP-dependent formation of the enzyme bound aminoacyl-adenylate. Class II assignment is based upon its structure and the presence of three characteristic sequence motifs. This domain is also found at the C-terminus of eukaryotic GCN2 protein kinase and at the N-terminus of the ATP phosphoribosyltransferase accessory subunit, HisZ. HisZ along with HisG catalyze the first reaction in histidine biosynthesis. HisZ is found only in a subset of bacteria and differs from HisRS in lacking a C-terminal anti-codon binding domain. Probab=97.54 E-value=0.00032 Score=49.72 Aligned_cols=113 Identities=22% Similarity=0.338 Sum_probs=64.9 Q ss_pred HHHHHHHHHHHHHHHHCCCEEEEECCCCCC-------CCCCCCC-CCCCCCCCCHHHHHHCC--CCHHHHHHHHHCC--C Q ss_conf 245899999998752036514620332112-------4645664-54322222103443112--3854564433105--7 Q gi|254780419|r 142 IVKRTRIINSMRCRMIAENFIECSTPILTA-------SSPEGAR-DFLVPSRVNQGSFYALP--QAPQQYKQLLMAS--G 209 (622) Q Consensus 142 l~~Rs~i~~~~R~~l~~~gF~EveTP~L~k-------sTpeGAr-dFlVPsR~~~g~fYALp--QSPQlyKQlLmv~--G 209 (622) .++|-.+...+++.|...||-||+||++-. +..+..+ -|-+.. ..|+--||. -.||.-. ++.-. . T Consensus 2 ~~~r~~i~~~l~~~f~~~Gy~~I~~P~lE~~d~~~~~~~~~~~~~~~~f~D--~~g~~l~LRpD~T~~iaR-~~~~~~~~ 78 (261) T cd00773 2 AALRRYIEDTLREVFERYGYEEIDTPVFEYTELFLRKSGDEVSKEMYRFKD--KGGRDLALRPDLTAPVAR-AVAENLLS 78 (261) T ss_pred HHHHHHHHHHHHHHHHHCCCEEEECCCCCCHHHHCCCCCCCCHHCEEEEEC--CCCCEEEECCCCCHHHHH-HHHHHHCC T ss_conf 689999999999999986997866776452767524465310211699997--999989767866279999-99974153 Q ss_pred ---HHHHEEEHHHHCCCCCCCCCCCCCEEEEEEEE----CCCCHHHHHHHHHHHHH Q ss_conf ---02331100110377766553800003565410----26899999999999999 Q gi|254780419|r 210 ---FDRYFQIAPCFRDEDPRSDRLPGEFYQLDMEM----SFVEQDDILNTMENVLR 258 (622) Q Consensus 210 ---~dryfqiarcFRdEd~R~dRqp~EFTqlD~Em----sF~~~edvm~l~E~li~ 258 (622) --|+|.+++|||.+..-..|. -||+|+++|. +....-+++.++-+.+. T Consensus 79 ~~~p~r~~Y~g~VfR~~~~~~g~~-rE~~Q~G~EiiG~~~~~ad~Eii~l~~~~l~ 133 (261) T cd00773 79 LPLPLKLYYIGPVFRYERPQKGRY-REFYQVGVEIIGSDSPLADAEVIALAVEILE 133 (261) T ss_pred CCCCEEEEEECCEEEECCCCCCCC-CCCCCEEEEEECCCCHHHHHHHHHHHHHHHH T ss_conf 788858999731899768999984-4322146999728975889999999999999 No 67 >PRK09074 consensus Probab=97.45 E-value=0.0016 Score=44.68 Aligned_cols=11 Identities=45% Similarity=0.851 Sum_probs=7.0 Q ss_pred CCHHHHEEEHH Q ss_conf 57023311001 Q gi|254780419|r 208 SGFDRYFQIAP 218 (622) Q Consensus 208 ~G~dryfqiar 218 (622) -||.-||=|.. T Consensus 344 mgf~~YFLIV~ 354 (1149) T PRK09074 344 MGFPGYFLIVA 354 (1149) T ss_pred CCCCCEEEHHH T ss_conf 79976760999 No 68 >COG0442 ProS Prolyl-tRNA synthetase [Translation, ribosomal structure and biogenesis] Probab=97.45 E-value=0.00038 Score=49.16 Aligned_cols=115 Identities=24% Similarity=0.328 Sum_probs=79.0 Q ss_pred CHHHHHHHHHHHHHHHHCCCEEEEECCCCCCC-------------------CCCCCCCCCC-CCCCCHHHHHHCC-CCHH Q ss_conf 12458999999987520365146203321124-------------------6456645432-2222103443112-3854 Q gi|254780419|r 141 NIVKRTRIINSMRCRMIAENFIECSTPILTAS-------------------SPEGARDFLV-PSRVNQGSFYALP-QAPQ 199 (622) Q Consensus 141 ~l~~Rs~i~~~~R~~l~~~gF~EveTP~L~ks-------------------TpeGArdFlV-PsR~~~g~fYALp-QSPQ 199 (622) -++++.+|...+|+.|++-|..||--|+|..+ |-.|.++|.. || +.-.+|.+- +-=. T Consensus 46 g~rv~~kI~~iir~em~~~G~~Evl~P~L~p~eLwkEs~r~~~f~~El~~v~drg~~~l~L~PT--sEe~it~~~~~~i~ 123 (500) T COG0442 46 GLRVLEKIENIIREEMDKIGAQEVLFPTLIPAELWKESGRWEGFGPELFRVKDRGDRPLALRPT--SEEVITDMFRKWIR 123 (500) T ss_pred HHHHHHHHHHHHHHHHHHCCCEEEECHHCCHHHHHHHHCHHHCCCHHHEEEECCCCCEEEECCC--CHHHHHHHHHHHHH T ss_conf 8999999999999999860850883110587899987485630364448997169963435788--68999999999865 Q ss_pred HHHHHHHCCCHHHHEEEHHHHCCCCCCCC----CCCCCEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 56443310570233110011037776655----3800003565410268999999999999999999980 Q gi|254780419|r 200 QYKQLLMASGFDRYFQIAPCFRDEDPRSD----RLPGEFYQLDMEMSFVEQDDILNTMENVLRGVFEEFS 265 (622) Q Consensus 200 lyKQlLmv~G~dryfqiarcFRdEd~R~d----Rqp~EFTqlD~EmsF~~~edvm~l~E~li~~i~~~v~ 265 (622) =||||=+ ++|||...||||- |.. |- -||++=|..-..+|+++...-.+.|+. ++.++. T Consensus 124 SYkdLPl-----~lYQi~~kfRdE~-rpr~gllR~-REF~mkdaySfh~~~e~a~~~y~~~~~-~Y~~if 185 (500) T COG0442 124 SYKDLPL-----KLYQIQSKFRDEK-RPRFGLLRG-REFLMKDAYSFHADEEDAEETYEKMLD-AYSRIF 185 (500) T ss_pred HHHHCCC-----CEEEEEEEEECCC-CCCCCCCCH-HEEEECCCCCCCCCHHHHHHHHHHHHH-HHHHHH T ss_conf 5651881-----2101345884334-688773000-203301112102898999999999999-999999 No 69 >PRK05673 dnaE DNA polymerase III subunit alpha; Validated Probab=97.37 E-value=0.0017 Score=44.52 Aligned_cols=10 Identities=40% Similarity=0.750 Sum_probs=5.7 Q ss_pred HCCCEEECCC Q ss_conf 7083560573 Q gi|254780419|r 586 ELGLRIVENP 595 (622) Q Consensus 586 e~~i~~~~~~ 595 (622) -+|+-+..-| T Consensus 944 ~lG~ylS~HP 953 (1143) T PRK05673 944 TLGLYLSGHP 953 (1143) T ss_pred HHCCCCCCCH T ss_conf 8470766381 No 70 >PRK07012 consensus Probab=97.35 E-value=0.001 Score=46.01 Aligned_cols=15 Identities=13% Similarity=0.219 Sum_probs=7.5 Q ss_pred HCCCEEECCCHHHHH Q ss_conf 708356057301000 Q gi|254780419|r 586 ELGLRIVENPKKILK 600 (622) Q Consensus 586 e~~i~~~~~~k~~~~ 600 (622) -+|+-+..-|=..++ T Consensus 952 ~lG~yls~HPl~~~~ 966 (1173) T PRK07012 952 ALGFYLSGHLFDAYR 966 (1173) T ss_pred HHCCCCCCCCHHHHH T ss_conf 864250517089999 No 71 >PRK07374 dnaE DNA polymerase III subunit alpha; Validated Probab=97.32 E-value=0.0016 Score=44.68 Aligned_cols=16 Identities=31% Similarity=0.522 Sum_probs=8.6 Q ss_pred HHCCCEEECCCHHHHH Q ss_conf 8708356057301000 Q gi|254780419|r 585 RELGLRIVENPKKILK 600 (622) Q Consensus 585 ~e~~i~~~~~~k~~~~ 600 (622) +-+|+-+..-|=..++ T Consensus 966 e~lG~yls~HPl~~~~ 981 (1171) T PRK07374 966 ELLGFYLSDHPLKQLT 981 (1171) T ss_pred HHHEECCCCCHHHHHH T ss_conf 8863421417099999 No 72 >PRK06637 consensus Probab=97.31 E-value=0.0018 Score=44.29 Aligned_cols=15 Identities=27% Similarity=0.439 Sum_probs=8.7 Q ss_pred HCCCEEECCCHHHHH Q ss_conf 708356057301000 Q gi|254780419|r 586 ELGLRIVENPKKILK 600 (622) Q Consensus 586 e~~i~~~~~~k~~~~ 600 (622) -+|+-+..-|=.-+| T Consensus 987 ~lG~yls~HPl~~~~ 1001 (1182) T PRK06637 987 AMGLFISNHPLTEYQ 1001 (1182) T ss_pred HEEEEECCCCHHHHH T ss_conf 637800639638899 No 73 >PRK12305 thrS threonyl-tRNA synthetase; Reviewed Probab=97.27 E-value=0.0013 Score=45.23 Aligned_cols=111 Identities=17% Similarity=0.212 Sum_probs=70.7 Q ss_pred HHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCC---CCCCC-----CCCCCCCCHHHHHHC-----CCCHHHH------- Q ss_conf 245899999998752036514620332112464---56645-----432222210344311-----2385456------- Q gi|254780419|r 142 IVKRTRIINSMRCRMIAENFIECSTPILTASSP---EGARD-----FLVPSRVNQGSFYAL-----PQAPQQY------- 201 (622) Q Consensus 142 l~~Rs~i~~~~R~~l~~~gF~EveTP~L~ksTp---eGArd-----FlVPsR~~~g~fYAL-----pQSPQly------- 201 (622) .++|..+...+|+-....||.||.||.+.++.- .|--+ ...|.. ..+.-|+| |..-++| T Consensus 208 ~~i~~~le~~~r~~~~~~Gy~eV~TP~i~~~~Lw~~SGH~~~Y~e~Mf~~~~-~~~e~~~lKPMNCP~H~~if~~~~rSY 286 (576) T PRK12305 208 MIIKNAIRKFILELDRKYGFDEVLTPHFGEKKLYKTSGHLDHYKDDMFKPLK-IENEELIPRPMTCPHHIILYSNELRSY 286 (576) T ss_pred HHHHHHHHHHHHHHHHHCCCEEEECCCCCCHHHHHHCCCHHHHHHHCCCCEE-CCCCEEEECCCCCHHHHHHHHCCCCCH T ss_conf 8999999999999999869989846421428899766646556642573063-156416535778632677764277544 Q ss_pred HHHHHCCCHHHHEEEHHHHCCCCCCCC----CCCCCEEEEEEEECCCCHHHHHHHHHHHHHHH Q ss_conf 443310570233110011037776655----38000035654102689999999999999999 Q gi|254780419|r 202 KQLLMASGFDRYFQIAPCFRDEDPRSD----RLPGEFYQLDMEMSFVEQDDILNTMENVLRGV 260 (622) Q Consensus 202 KQlLmv~G~dryfqiarcFRdEd~R~d----Rqp~EFTqlD~EmsF~~~edvm~l~E~li~~i 260 (622) |.|=+ |++++|.|||+|-|-+= |-- .|||=|.- .|++.+++..-+.+++.-+ T Consensus 287 RdLPl-----R~aEfg~~~R~E~SG~L~GL~RvR-~FtqDDAH-IFct~dQi~~Ei~~~l~~i 342 (576) T PRK12305 287 RDLPI-----RLSEQSRLYRYEKSGALTGLERVR-AMDLTEGH-IFVRKDQIESEVKHLFKMI 342 (576) T ss_pred HHCCH-----HHHCCCCEECCCCCCCCCCCCCCC-CCCCCCCC-EEECHHHHHHHHHHHHHHH T ss_conf 43242-----200044354478886555755466-88768712-1208999999999999999 No 74 >PRK12293 hisZ ATP phosphoribosyltransferase regulatory subunit; Provisional Probab=97.26 E-value=0.00078 Score=46.89 Aligned_cols=110 Identities=19% Similarity=0.277 Sum_probs=60.6 Q ss_pred CCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCC---CCC-CCCCCCCCHHHHHHCCC--CHHHHH----HHHHCCC Q ss_conf 2124589999999875203651462033211246456---645-43222221034431123--854564----4331057 Q gi|254780419|r 140 KNIVKRTRIINSMRCRMIAENFIECSTPILTASSPEG---ARD-FLVPSRVNQGSFYALPQ--APQQYK----QLLMASG 209 (622) Q Consensus 140 ~~l~~Rs~i~~~~R~~l~~~gF~EveTP~L~ksTpeG---Ard-FlVPsR~~~g~fYALpQ--SPQlyK----QlLmv~G 209 (622) ...+++..+...+|+.+...||-||+||++--...-| ..+ |-.. -+.|.--||.= -+|.-. .+.-... T Consensus 17 ~~~~~~~~Ie~~~~~~f~~~Gy~eI~TP~~E~~e~~~~~~~kemy~f~--D~~gr~l~LRPd~Ta~iaR~v~~~l~~~~~ 94 (281) T PRK12293 17 KSAKLKRDIENVASEILYKEGFEEIVTPSFSYHQHLSVADEKELLRFS--DEKNHEISLRADSTVDVVRIITKRLGRSTE 94 (281) T ss_pred HHHHHHHHHHHHHHHHHHHCCCEEEECCCEEEEECCCCCCCCEEEEEE--CCCCCEEEECCCCCHHHHHHHHHCCCCCCC T ss_conf 999999999999999999859948537856864434665630179998--799987997788878999999965544689 Q ss_pred HHHHEEEHHHHCCCCCCCCCCCCCEEEEEEEE-CCCCHHHHHHHHHHHHH Q ss_conf 02331100110377766553800003565410-26899999999999999 Q gi|254780419|r 210 FDRYFQIAPCFRDEDPRSDRLPGEFYQLDMEM-SFVEQDDILNTMENVLR 258 (622) Q Consensus 210 ~dryfqiarcFRdEd~R~dRqp~EFTqlD~Em-sF~~~edvm~l~E~li~ 258 (622) --|+|-+|+|||-|. .||+|+.+|. .-.+..+|+.++-++++ T Consensus 95 p~rl~Y~g~vFRye~-------rEf~Q~GvEliG~~s~~Evi~la~~~l~ 137 (281) T PRK12293 95 HKKWFYIQPVFRYPT-------TEIYQIGAEIIGESDLSKVLNIAAEIFN 137 (281) T ss_pred CEEEEEECCEEECCC-------CCCEEECEEEECCCCHHHHHHHHHHHHH T ss_conf 757998756353467-------7746847488779999999999999999 No 75 >cd00774 GlyRS-like_core Glycyl-tRNA synthetase (GlyRS)-like class II core catalytic domain. GlyRS functions as a homodimer in eukaryotes, archaea and some bacteria and as a heterotetramer in the remainder of prokaryotes. It is responsible for the attachment of glycine to the 3' OH group of ribose of the appropriate tRNA. This domain is primarily responsible for ATP binding and hydrolysis. This alignment contains only sequences from the GlyRS form which homodimerizes. The heterotetramer glyQ is in a different family of class II aaRS. Class II assignment is based upon its structure and the presence of three characteristic sequence motifs. This domain is also found at the N-terminus of the accessory subunit of mitochondrial polymerase gamma (Pol gamma b). Pol gamma b stimulates processive DNA synthesis and is functional as a homodimer, which can associate with the catalytic subunit Pol gamma alpha to form a heterotrimer. Despite significant both structural and sequence similarity with Gly Probab=97.24 E-value=0.00061 Score=47.66 Aligned_cols=95 Identities=32% Similarity=0.534 Sum_probs=57.3 Q ss_pred HHHHHHHHHHHHHHHHC--CCEEEEECCCCCC-------CCC---CCCCCCCCCCCCCHHHHHHCCCCHHHHHHHHHCCC Q ss_conf 24589999999875203--6514620332112-------464---56645432222210344311238545644331057 Q gi|254780419|r 142 IVKRTRIINSMRCRMIA--ENFIECSTPILTA-------SSP---EGARDFLVPSRVNQGSFYALPQAPQQYKQLLMASG 209 (622) Q Consensus 142 l~~Rs~i~~~~R~~l~~--~gF~EveTP~L~k-------sTp---eGArdFlVPsR~~~g~fYALpQSPQlyKQlLmv~G 209 (622) ..++.++..+-|++|.. .+++||+||+++. .-| +|..-||=|. +-.|.|- .||.++=..+ T Consensus 32 ~~LK~Ni~~~W~~~fv~~~~~~~~id~~ii~p~LmF~T~iG~~~~~~~~~yLRPE-TAQGifv-------nFk~~~~~~r 103 (254) T cd00774 32 VELKNNIKSAWRKSFVLEEEDMLEIDSPIITPELMFKTSIGPVESGGNLGYLRPE-TAQGIFV-------NFKNLLEFNR 103 (254) T ss_pred HHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCEECCCCCCCCCCCCEEECHH-HHHHHHH-------HHHHHHHHCC T ss_conf 9999999999999987437987995410036632200121565589860353634-5347798-------7899999648 Q ss_pred HH---HHEEEHHHHCCCCC-CCC--CCCCCEEEEEEEECCCCH Q ss_conf 02---33110011037776-655--380000356541026899 Q gi|254780419|r 210 FD---RYFQIAPCFRDEDP-RSD--RLPGEFYQLDMEMSFVEQ 246 (622) Q Consensus 210 ~d---ryfqiarcFRdEd~-R~d--Rqp~EFTqlD~EmsF~~~ 246 (622) -. -+-|||+|||||=+ |.- |.= ||||.+||. |++. T Consensus 104 ~kLPFGiaQIGk~FRNEIsPr~~l~R~R-EF~q~EiE~-F~~P 144 (254) T cd00774 104 RKLPFGVAQIGKSFRNEISPRNGLFRVR-EFTQAEIEF-FVDP 144 (254) T ss_pred CCCCEEEEEEEECCCCCCCCCCCCCCHH-CHHHHEEEE-EECC T ss_conf 8888445654202267347776754000-002125788-8579 No 76 >PRK12421 ATP phosphoribosyltransferase regulatory subunit; Provisional Probab=97.22 E-value=0.0009 Score=46.47 Aligned_cols=99 Identities=15% Similarity=0.195 Sum_probs=62.0 Q ss_pred CHHHHHHHHHHHHHHHHCCCEEEEECCCCC-------CCCCCCCC-CCCCCCCCCHHHHHHCC--CCHHHHH---HHHHC Q ss_conf 124589999999875203651462033211-------24645664-54322222103443112--3854564---43310 Q gi|254780419|r 141 NIVKRTRIINSMRCRMIAENFIECSTPILT-------ASSPEGAR-DFLVPSRVNQGSFYALP--QAPQQYK---QLLMA 207 (622) Q Consensus 141 ~l~~Rs~i~~~~R~~l~~~gF~EveTP~L~-------ksTpeGAr-dFlVPsR~~~g~fYALp--QSPQlyK---QlLmv 207 (622) -.+.+..+...+.+.|...||-+|+||++- ++..+-.. -|-+--+ ..|.--+|. ..||.-. +++-. T Consensus 20 ea~~~~~l~~~l~~~f~~~Gy~~V~tP~lE~~e~l~~~~g~~~~~~~f~~~D~-~sGr~l~LRpD~T~~vARi~a~~~~~ 98 (391) T PRK12421 20 EAQKIERLRRRLLDLFASRGYQLVMPPFIEYLESLLTGAGQDLDLQTFKLIDQ-LSGRLMGVRADITPQVARIDAHLLNR 98 (391) T ss_pred HHHHHHHHHHHHHHHHHHCCCEEEECCCCCCHHHHCCCCCCCCCEEEEEEEEC-CCCCEEEECCCCCHHHHHHHHHHCCC T ss_conf 99999999999999999869976216760728785656687341479999617-99978655653206889999874046 Q ss_pred CCHHHHEEEHHHHCCCCCCCCCCCCCEEEEEEEE Q ss_conf 5702331100110377766553800003565410 Q gi|254780419|r 208 SGFDRYFQIAPCFRDEDPRSDRLPGEFYQLDMEM 241 (622) Q Consensus 208 ~G~dryfqiarcFRdEd~R~dRqp~EFTqlD~Em 241 (622) .+.-|+|.++++||.|.....|.= ||+|+++|. T Consensus 99 ~~p~Rl~Y~G~V~R~~~~~~gr~R-ef~Q~G~El 131 (391) T PRK12421 99 EGVARYCYAGSVLHTLPQGLNASR-APLQLGAEL 131 (391) T ss_pred CCCEEEEEECEEEECCCCCCCCCC-CCEECCEEE T ss_conf 886579996517751688888766-521505898 No 77 >PRK06920 dnaE DNA polymerase III DnaE; Reviewed Probab=97.20 E-value=0.0052 Score=41.01 Aligned_cols=22 Identities=27% Similarity=0.437 Sum_probs=11.0 Q ss_pred CCHHHHHHHH-----H--CCCHHHHEEEH Q ss_conf 3854564433-----1--05702331100 Q gi|254780419|r 196 QAPQQYKQLL-----M--ASGFDRYFQIA 217 (622) Q Consensus 196 QSPQlyKQlL-----m--v~G~dryfqia 217 (622) ..|+.|++.| + --||.-||=|. T Consensus 298 ~~~~~y~~RL~~EL~vI~~mgf~~YFLIV 326 (1107) T PRK06920 298 TPKEVHINRLNHELNVISRMGFSDYFLIV 326 (1107) T ss_pred CCCHHHHHHHHHHHHHHHHCCCCCEEEHH T ss_conf 99599999999999999864888787429 No 78 >PRK08744 consensus Probab=97.17 E-value=0.013 Score=38.14 Aligned_cols=11 Identities=27% Similarity=0.634 Sum_probs=6.4 Q ss_pred HHCCCEEECCC Q ss_conf 87083560573 Q gi|254780419|r 585 RELGLRIVENP 595 (622) Q Consensus 585 ~e~~i~~~~~~ 595 (622) +-+|+-+..-| T Consensus 970 e~LG~ylSgHP 980 (1195) T PRK08744 970 ETLGFYLSGHP 980 (1195) T ss_pred HHHEEEECCCC T ss_conf 88136665184 No 79 >COG2511 GatE Archaeal Glu-tRNAGln amidotransferase subunit E (contains GAD domain) [Translation, ribosomal structure and biogenesis] Probab=97.15 E-value=0.0047 Score=41.34 Aligned_cols=116 Identities=21% Similarity=0.288 Sum_probs=85.0 Q ss_pred CCCCCCCCCCCCCCCHHHHHHHHHCCCCCEEEEEECCCCCC--------CCCCHHHHHHHHHHHHHCCCCCEEEEECCCC Q ss_conf 64012443101787524566687403784799996278766--------5210356779998764202462355204777 Q gi|254780419|r 296 PIIMHNVTEHFVHSEFRVFSKILSVNPGYEVWAIPAKGGGG--------NRAFCDRMNEWARSQGQPGLAYIFWRLDESG 367 (622) Q Consensus 296 ~lei~dit~~~~~~~f~vf~~~~~~~~~~~v~~i~~~~~~~--------srk~~D~l~e~ak~~g~kgL~~i~~~~~~~~ 367 (622) .-++.|+|+.|.++.-+++++.+.. +++|.++..++... .|.+=..+.++|+.+|.+||.. .|+.. T Consensus 279 ~~~~~DVTdlF~~TkSkiI~~~l~~--gG~V~A~~l~~f~GlvG~EiqpGrRlGtE~AD~ar~~GvgGlfH----sDELP 352 (631) T COG2511 279 SEEVFDVTDLFANTKSKIIRSGLKK--GGKVYAVLLPGFAGLVGREIQPGRRLGTEFADYARAYGVGGLFH----SDELP 352 (631) T ss_pred CCCCCCHHHHHCCCCHHHHHHHHCC--CCEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCC----CCCCC T ss_conf 1444528988655731888875424--87699997477565223002577410357899999827775100----56675 Q ss_pred CCCCCCCHHHHCCHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHH Q ss_conf 5333570255325278999999818875572699808854110013478899987730 Q gi|254780419|r 368 NVAGMGPVAKHLNIVNTEAIRVRLDMQDGDSCFFIAGDPLKFCSFAADARNHIAQELG 425 (622) Q Consensus 368 ~~~~~spi~K~l~~~~~~~l~~~~~~~~GD~i~f~a~~~~~~~~~lg~lR~~ia~~l~ 425 (622) ... +++++.+++..++++.++|-+.++|+.+..+..++-.+-..+..-+. T Consensus 353 ~YG--------ITeeEV~~v~~~lga~~~Davvlva~~~~~~~~a~~~V~~Ra~~ai~ 402 (631) T COG2511 353 GYG--------ITEEEVEAVRDALGAGEEDAVVLVAGEEERAKRALEAVIERAKEAIE 402 (631) T ss_pred CCC--------CCHHHHHHHHHHHCCCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHC T ss_conf 678--------79999999999847997774899707788999999999999999871 No 80 >PRK05431 seryl-tRNA synthetase; Provisional Probab=97.07 E-value=0.0041 Score=41.72 Aligned_cols=123 Identities=19% Similarity=0.325 Sum_probs=72.3 Q ss_pred HHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCC-CCCC----------------CCCCCCCCCHHH-HHHCCCCHHHHHH Q ss_conf 245899999998752036514620332112464-5664----------------543222221034-4311238545644 Q gi|254780419|r 142 IVKRTRIINSMRCRMIAENFIECSTPILTASSP-EGAR----------------DFLVPSRVNQGS-FYALPQAPQQYKQ 203 (622) Q Consensus 142 l~~Rs~i~~~~R~~l~~~gF~EveTP~L~ksTp-eGAr----------------dFlVPsR~~~g~-fYALpQSPQlyKQ 203 (622) -++--.+++.+-+++.++||.||.||.|.++.. +|.. -||+||--.|=. +|+ .+ T Consensus 170 A~LerALi~f~ld~~~~~Gy~~v~~P~lv~~~~~~gtGqlPkfe~~~Y~i~~~dl~Li~TAEvpL~~~~~--------~e 241 (422) T PRK05431 170 ARLERALIQFMLDLHTEEGYTEVIPPYLVNEESMYGTGQLPKFEEDLFKIEDDDLYLIPTAEVPLTNLHR--------DE 241 (422) T ss_pred HHHHHHHHHHHHHHHHHCCCEEEECHHHHHHHHHHHHCCCCCCHHHHHHHCCCCEEEECCCCCCHHHHHH--------CC T ss_conf 9999999999999999879979967376367887445379744877541058871760565321233440--------11 Q ss_pred HHHCCCH-HHHEEEHHHHCCCCCCC--C-----CCCCCEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 3310570-23311001103777665--5-----38000035654102689999999999999999999808723457732 Q gi|254780419|r 204 LLMASGF-DRYFQIAPCFRDEDPRS--D-----RLPGEFYQLDMEMSFVEQDDILNTMENVLRGVFEEFSNGNTVSNNFL 275 (622) Q Consensus 204 lLmv~G~-dryfqiarcFRdEd~R~--d-----Rqp~EFTqlD~EmsF~~~edvm~l~E~li~~i~~~v~~~~~i~~pF~ 275 (622) ++-.+-+ =||.-+.+|||-|-..+ | |+- +|+-+++ .+|++.++-....|+|+...- ++.. .+..||. T Consensus 242 il~~~~LPik~~~~S~CFR~EAGs~GkdtrGl~RvH-QF~KVE~-f~~~~pe~S~~~~e~~~~~~e-~i~~--~L~lpyr 316 (422) T PRK05431 242 ILDEEELPIKYTAYSPCFRSEAGSAGRDTRGLIRQH-QFDKVEL-VKFTKPEDSYAELEELTGNAE-EILQ--KLELPYR 316 (422) T ss_pred CCCHHHCCHHHEEECCHHHHCCCCCCCCCCCCEEEE-EECEEEE-EEECCHHHHHHHHHHHHHHHH-HHHH--HCCCCCC T ss_conf 336543984452788365303133675567612443-0000343-353586779999999999999-9998--7598502 Q ss_pred CC Q ss_conf 12 Q gi|254780419|r 276 RI 277 (622) Q Consensus 276 rm 277 (622) .+ T Consensus 317 vv 318 (422) T PRK05431 317 VV 318 (422) T ss_pred CC T ss_conf 35 No 81 >PRK06326 consensus Probab=96.94 E-value=0.0043 Score=41.57 Aligned_cols=14 Identities=36% Similarity=0.717 Sum_probs=8.0 Q ss_pred CCCEEECCCHHHHH Q ss_conf 08356057301000 Q gi|254780419|r 587 LGLRIVENPKKILK 600 (622) Q Consensus 587 ~~i~~~~~~k~~~~ 600 (622) +|+-+..-|=..+| T Consensus 996 lG~yls~HPl~~~~ 1009 (1240) T PRK06326 996 LGIYLTEHPMDAVK 1009 (1240) T ss_pred HEEEECCCCHHHHH T ss_conf 47315546289999 No 82 >COG1107 Archaea-specific RecJ-like exonuclease, contains DnaJ-type Zn finger domain [DNA replication, recombination, and repair] Probab=96.93 E-value=0.0017 Score=44.46 Aligned_cols=86 Identities=20% Similarity=0.316 Sum_probs=64.9 Q ss_pred CHHHCCHHHCCCEEEEEEEECCCCCCCCEEEEEEECCCCEEEEEECCCCHHHHHHHCCCCCCEEEEEEEEEECCCCCCCC Q ss_conf 04236979989889999798004446987999997798259999748815789985289885899999999758655588 Q gi|254780419|r 8 SCGDLGVSDVGSFVRLSGWVHRVRPHGGIIFLDIRDHYGITQVVANPDSACFEIAKSVRSEWVLCIDGLVMSRSAETINA 87 (622) Q Consensus 8 ~cg~l~~~~~g~~V~i~GwV~~~R~~g~l~F~~lrD~sG~~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~r~~~~~n~ 87 (622) ...+|.. ++|++|+|.|-|..++..++-.-+.|+|.+|.+++-.-.+... -.--.+..+++|.|.|.|..|. T Consensus 204 ~i~did~-~ig~tV~I~GeV~qikqT~GPTVFtltDetg~i~aAAFe~aGv-RAyP~ievGdiV~ViG~V~~r~------ 275 (715) T COG1107 204 LIDDLDE-MIGKTVRIEGEVTQIKQTSGPTVFTLTDETGAIWAAAFEEAGV-RAYPEIEVGDIVEVIGEVTRRD------ 275 (715) T ss_pred CHHHHHH-HCCCEEEEEEEEEEEEECCCCEEEEEECCCCCEEHHHHCCCCC-CCCCCCCCCCEEEEEEEEEECC------ T ss_conf 0777775-3486699977899999758987999806888442666435774-3489987785699987775047------ Q ss_pred CCCCCEEEEEEEEEEEEE Q ss_conf 777865999988999950 Q gi|254780419|r 88 NIITGQIELSAQKIEILS 105 (622) Q Consensus 88 ~~~tG~~Ei~~~~~~il~ 105 (622) |.+.|.+.+++.|. T Consensus 276 ----g~lQiE~~~me~L~ 289 (715) T COG1107 276 ----GRLQIEIEAMEKLT 289 (715) T ss_pred ----CCEEEEEHHHHHHH T ss_conf ----82787624357751 No 83 >PRK09194 prolyl-tRNA synthetase; Provisional Probab=96.86 E-value=0.0028 Score=42.98 Aligned_cols=114 Identities=30% Similarity=0.408 Sum_probs=71.9 Q ss_pred CHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCC--C-C----------------CCCC-CCCCCCCHHHHHHCC-CCHH Q ss_conf 1245899999998752036514620332112464--5-6----------------6454-322222103443112-3854 Q gi|254780419|r 141 NIVKRTRIINSMRCRMIAENFIECSTPILTASSP--E-G----------------ARDF-LVPSRVNQGSFYALP-QAPQ 199 (622) Q Consensus 141 ~l~~Rs~i~~~~R~~l~~~gF~EveTP~L~ksTp--e-G----------------ArdF-lVPsR~~~g~fYALp-QSPQ 199 (622) -+|+..++...+|+-|++-|..||..|+|..+.. + | -++| |-|| |.-.+-.|. +.-. T Consensus 46 g~rvl~Kie~IIr~em~~iGa~Ev~mp~l~p~elW~~sgR~~~~g~el~r~kDR~~~~~~L~PT--hEE~it~lv~~~i~ 123 (570) T PRK09194 46 GLRVLRKIENIVREEMNKIGAQEVLMPALQPAELWQESGRWEKYGPELLRLKDRHGRDFVLGPT--HEEVITDLVRNEIK 123 (570) T ss_pred HHHHHHHHHHHHHHHHHHCCCEEEECCCCCCHHHHHHCCCHHHCCHHHEEEECCCCCEEEECCC--CHHHHHHHHHHHHC T ss_conf 6999999999999999864987985356797789876188212361346985278985653787--38999999998511 Q ss_pred HHHHHHHCCCHHHHEEEHHHHCCCCCCCC----CCCCCEEEEEEEECCC-CHHHHHHHHHHH---HHHHHHHH Q ss_conf 56443310570233110011037776655----3800003565410268-999999999999---99999998 Q gi|254780419|r 200 QYKQLLMASGFDRYFQIAPCFRDEDPRSD----RLPGEFYQLDMEMSFV-EQDDILNTMENV---LRGVFEEF 264 (622) Q Consensus 200 lyKQlLmv~G~dryfqiarcFRdEd~R~d----Rqp~EFTqlD~EmsF~-~~edvm~l~E~l---i~~i~~~v 264 (622) =||||=. ++|||..-||||- |.. |-= ||+|-|+. ||- |.++....-+.| ...+|+.. T Consensus 124 SYkqLP~-----~lYQIqtKfRDE~-RPR~GllR~R-EF~MKDaY-SFd~~~e~l~~tY~~~~~AY~~IF~rl 188 (570) T PRK09194 124 SYKQLPL-----NLYQIQTKFRDEI-RPRFGLMRGR-EFIMKDAY-SFHADEESLDETYDAMYQAYSRIFTRL 188 (570) T ss_pred CHHHCCH-----HEECCCCEECCCC-CCCCCCHHHH-HHHHHCCC-CCCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 3764680-----2001323334778-8767623456-53451245-666998999999999999999999980 No 84 >PRK08512 consensus Probab=96.86 E-value=0.0054 Score=40.87 Aligned_cols=15 Identities=20% Similarity=0.439 Sum_probs=7.7 Q ss_pred HHCCCEEECCCHHHH Q ss_conf 870835605730100 Q gi|254780419|r 585 RELGLRIVENPKKIL 599 (622) Q Consensus 585 ~e~~i~~~~~~k~~~ 599 (622) +-+|+-+..-|=..+ T Consensus 973 e~lG~y~s~HPl~~~ 987 (1185) T PRK08512 973 ETLGIYVSGHPLDEF 987 (1185) T ss_pred HHHCCCCCCCHHHHH T ss_conf 973428760718999 No 85 >TIGR00469 pheS_mito phenylalanyl-tRNA synthetase; InterPro: IPR004530 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . Phenylalanyl-tRNA synthetase (6.1.1.20 from EC) is an alpha2/beta2 tetramer composed of 2 subunits that belongs to class IIc. In eubacteria, a small subunit (pheS gene) can be designated as beta (E. coli) or alpha subunit (nomenclature adopted in InterPro). Reciprocally the large subunit (pheT gene) can be designated as alpha (E. coli) or beta (see IPR004531 from INTERPRO and IPR004532 from INTERPRO). In all other kingdoms the two subunits have equivalent length in eukaryota, and can be identified by specific signatures. The enzyme from Thermus thermophilus has an alpha 2 beta 2 type quaternary structure and is one of the most complicated members of the synthetase family. Identification of phenylalanyl-tRNA synthetase as a member of class II aaRSs was based only on sequence alignment of the small alpha-subunit with other synthetases . This family describes the mitochondrial phenylalanyl-tRNA synthetases. Unlike all other known phenylalanyl-tRNA synthetases, the mitochondrial form demonstrated from yeast is monomeric. It is similar to but longer than the alpha subunit (PheS) of the alpha 2 beta 2 form found in bacteria, Archaea, and eukaryotes, and shares the characteristic motifs of class II aminoacyl-tRNA ligases.; GO: 0000166 nucleotide binding, 0004826 phenylalanine-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006432 phenylalanyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=96.81 E-value=0.00073 Score=47.13 Aligned_cols=67 Identities=30% Similarity=0.441 Sum_probs=44.1 Q ss_pred CCCHHHCCCCCCCCCHHHHHCCCHHHHHHHCCCEEECCEEECC-CCEECCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHH Q ss_conf 0002110024766430655418877741313677775677212-512315699999999985999899986677899998 Q gi|254780419|r 451 KIDFAHNPFTMPQGGMESLKGNDLLSIKAFQYDLVCNGFEIAS-GGIRNHMSEVMLQAFSNVGISKEVVENRFGGLYRAF 529 (622) Q Consensus 451 r~~s~HHPFTaP~~~~~~l~~~dp~~v~a~~YDLVlNG~Eigg-GSiRIHd~~iQ~~~f~~lgl~~ee~~~~Fg~lL~Al 529 (622) ||.-+.-|||+|..|.+-....+.+ ||.| |=+| ..++. T Consensus 266 RW~~~YFP~T~PSWE~E~y~~~~WL--------------E~~GCG~~R-------~~~L~-------------------- 304 (460) T TIGR00469 266 RWVDAYFPFTAPSWELEVYFKDEWL--------------ELLGCGLIR-------QDVLL-------------------- 304 (460) T ss_pred EEEECCCCCCCCCCEEEEEECCCEE--------------EEECCCHHH-------HHHHH-------------------- T ss_conf 5750338898884058988837655--------------651551467-------78988-------------------- Q ss_pred HCCCCCC--CC--EECCHHHHHHHHCCCCCCEE Q ss_conf 6084973--42--22468999999837398403 Q gi|254780419|r 530 QCGMPPH--GG--IAAGIDRIVMLLLGAKNVRE 558 (622) Q Consensus 530 ~yG~PPH--gG--~alGlDRlvmll~g~~sIRd 558 (622) +.|.-|- -| ||+|||||.|+|=.-+.||= T Consensus 305 R~G~~~SetIGyAFG~GL~R~AM~LF~IPDIRL 337 (460) T TIGR00469 305 RAGVKQSETIGYAFGLGLDRLAMLLFDIPDIRL 337 (460) T ss_pred HCCCCCCCCCCEEECCCHHHHHHHHHCCCCEEE T ss_conf 458973332200001008899988734886156 No 86 >KOG2324 consensus Probab=96.80 E-value=0.0023 Score=43.59 Aligned_cols=113 Identities=28% Similarity=0.421 Sum_probs=71.8 Q ss_pred HHHHHHHHHHHHHHHHCCCEEEEECCCCCCC-----CC--C--CCCCCCCCCCCCHHHHHHC-------------CCCHH Q ss_conf 2458999999987520365146203321124-----64--5--6645432222210344311-------------23854 Q gi|254780419|r 142 IVKRTRIINSMRCRMIAENFIECSTPILTAS-----SP--E--GARDFLVPSRVNQGSFYAL-------------PQAPQ 199 (622) Q Consensus 142 l~~Rs~i~~~~R~~l~~~gF~EveTP~L~ks-----Tp--e--GArdFlVPsR~~~g~fYAL-------------pQSPQ 199 (622) +|.-.|++..++.-|.+-|=.+|.-|+|+.. |- + |..-|-+ |-.+|+-|+| .+||= T Consensus 52 ~R~~~K~~~~l~~~mqs~Ga~kIslp~ls~~~LWekTgRw~~~gsEl~rl--~Dr~gkq~cL~pThEE~iT~lmat~~~l 129 (457) T KOG2324 52 LRVLNKLCRLLDNEMQSGGAQKISLPILSSKELWEKTGRWDAMGSELFRL--HDRKGKQMCLTPTHEEDITALMATYIPL 129 (457) T ss_pred HHHHHHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHCCCCCCCCHHHEEE--ECCCCCEECCCCCHHHHHHHHHHHCCCC T ss_conf 79999999999999872367057404468689987618511246045375--1047877305775489999999862743 Q ss_pred HHHHHHHCCCHHHHEEEHHHHCCCC-CCCC--CCCCCEEEEEEEECCC-CHHHHHHHH---HHHHHHHHHH Q ss_conf 5644331057023311001103777-6655--3800003565410268-999999999---9999999999 Q gi|254780419|r 200 QYKQLLMASGFDRYFQIAPCFRDED-PRSD--RLPGEFYQLDMEMSFV-EQDDILNTM---ENVLRGVFEE 263 (622) Q Consensus 200 lyKQlLmv~G~dryfqiarcFRdEd-~R~d--Rqp~EFTqlD~EmsF~-~~edvm~l~---E~li~~i~~~ 263 (622) -||||=+ |+|||+|-||||= -|-- |-- ||.|=||. ||. |.|+-|..- -+....+|++ T Consensus 130 sykqlPi-----~vYQigrKfRDElrpRfGLlRgR-EFlMKDmY-sFd~~~etA~qTy~~v~~aY~~iFkq 193 (457) T KOG2324 130 SYKQLPI-----RVYQIGRKFRDELRPRFGLLRGR-EFLMKDMY-SFDSDEETAQQTYQLVDQAYDRIFKQ 193 (457) T ss_pred CCCCCCE-----EEEEECHHHHHCCCCCCCCHHHH-HHHHHHHH-CCCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 2202867-----76650545441347542301247-88876533-02588889999999999999999997 No 87 >PRK12444 threonyl-tRNA synthetase; Reviewed Probab=96.74 E-value=0.0068 Score=40.16 Aligned_cols=115 Identities=22% Similarity=0.348 Sum_probs=71.6 Q ss_pred HHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCC---CCC---------------CCCC-CCCCCCHHHHHHCCCCHHHHH Q ss_conf 245899999998752036514620332112464---566---------------4543-222221034431123854564 Q gi|254780419|r 142 IVKRTRIINSMRCRMIAENFIECSTPILTASSP---EGA---------------RDFL-VPSRVNQGSFYALPQAPQQYK 202 (622) Q Consensus 142 l~~Rs~i~~~~R~~l~~~gF~EveTP~L~ksTp---eGA---------------rdFl-VPsR~~~g~fYALpQSPQlyK 202 (622) .++|..+...+|+-....||.||-||.+.++.- .|- ++|. =|. ..|+...--.+.|-=|| T Consensus 274 ~~i~~~le~~~r~~~~~~GY~eV~TP~i~~~~Lw~~SGH~~~Y~e~Mf~~~~d~~~~~lKPM-NCP~H~~if~~~~rSYR 352 (639) T PRK12444 274 QIIRNELEAFLREIQKEYNYQEVRTPFMMNQELWERSGHWGHYKDNMYFSEVDNKSFALKPM-NCPGHMLMFKNKLHSYR 352 (639) T ss_pred HHHHHHHHHHHHHHHHHCCCEEECCCEEEHHHHHHHCCHHHHHHHCCCCCCCCCCEEECCCC-CCHHHHHHHHCCCCCHH T ss_conf 69999999999999998398782286662156876166043333027554545403312676-85889999860562144 Q ss_pred HHHHCCCHHHHEEEHHHHCCCCCCCC----CCCCCEEEEEEEECCCCHHHHHHHHHHHH---HHHHHHH Q ss_conf 43310570233110011037776655----38000035654102689999999999999---9999998 Q gi|254780419|r 203 QLLMASGFDRYFQIAPCFRDEDPRSD----RLPGEFYQLDMEMSFVEQDDILNTMENVL---RGVFEEF 264 (622) Q Consensus 203 QlLmv~G~dryfqiarcFRdEd~R~d----Rqp~EFTqlD~EmsF~~~edvm~l~E~li---~~i~~~v 264 (622) .|=+ ||+++|.|||+|-|-+= |-= .|||=|.- .|++.+++-+-+.+++ ..+++.+ T Consensus 353 dLPl-----R~aEfg~vhR~E~SG~L~GL~RVR-~FtqDDAH-IFct~dQi~~Ei~~~~~~~~~vy~~f 414 (639) T PRK12444 353 ELPI-----RMCEFGQVHRHEFSGALNGLLRVR-TFCQDDAH-LFVTPEQIEDEIKSVMAQIDYVYKTF 414 (639) T ss_pred HCCH-----HHHHCCCEECCCCCCCCCCCCCCC-CCCCCCCE-EECCHHHHHHHHHHHHHHHHHHHHHC T ss_conf 3652-----224336463377876645767788-99879824-65388999999999999999999747 No 88 >PRK04483 threonyl-tRNA synthetase; Validated Probab=96.74 E-value=0.0071 Score=40.06 Aligned_cols=113 Identities=24% Similarity=0.321 Sum_probs=69.9 Q ss_pred HHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCC---CCCC----C--CCCCCCCCHHHHHHC-----CCCHHH------- Q ss_conf 245899999998752036514620332112464---5664----5--432222210344311-----238545------- Q gi|254780419|r 142 IVKRTRIINSMRCRMIAENFIECSTPILTASSP---EGAR----D--FLVPSRVNQGSFYAL-----PQAPQQ------- 200 (622) Q Consensus 142 l~~Rs~i~~~~R~~l~~~gF~EveTP~L~ksTp---eGAr----d--FlVPsR~~~g~fYAL-----pQSPQl------- 200 (622) .++|..+...+|+-....||.||-||.+.++.- .|-- | |.+. . .+.-|+| |.--++ T Consensus 271 ~~i~~~le~~~r~~~~~~Gy~eV~tP~i~~~~Lw~~SGH~~~Y~e~Mf~~~--~-~~~~~~lKPMNCP~H~~if~~~~rS 347 (634) T PRK04483 271 WSLWQVVEQYMRKVYRDSGYGEVRCPQILDVSLWQKSGHWDNYQDAMFFTE--S-EKRTYAVKPMNCPGHVQVFNQGLHS 347 (634) T ss_pred HHHHHHHHHHHHHHHHHCCCEEEECCEECCHHHHHHCCCHHHHHHCCCEEC--C-CCCEEEECCCCCHHHHHHHHCCCCC T ss_conf 999999999999999982995998862513778762561645530371342--5-5403551576856487998717744 Q ss_pred HHHHHHCCCHHHHEEEHHHHCCCCCCCC----CCCCCEEEEEEEECCCCHHHHHHHHHHHH---HHHHHHH Q ss_conf 6443310570233110011037776655----38000035654102689999999999999---9999998 Q gi|254780419|r 201 YKQLLMASGFDRYFQIAPCFRDEDPRSD----RLPGEFYQLDMEMSFVEQDDILNTMENVL---RGVFEEF 264 (622) Q Consensus 201 yKQlLmv~G~dryfqiarcFRdEd~R~d----Rqp~EFTqlD~EmsF~~~edvm~l~E~li---~~i~~~v 264 (622) ||-|=+ |+++++.|||+|-|-+= |-= .|||=|.- .|++.+++-+-+.+++ ..+++.+ T Consensus 348 YRdLPl-----R~aE~g~~~R~E~SG~L~GL~RvR-~FtqdDaH-IFc~~dQi~~Ei~~~l~~~~~vy~~f 411 (634) T PRK04483 348 YRDLPI-----RYGEFGACHRNEPSGALHGILRVR-GFTQDDGH-VFCLESQIEAEVTAFHQQALAVYTAF 411 (634) T ss_pred HHHCCH-----HHHHCCCEEECCCCCCCCCCCCCC-CCCCCCCE-EECCHHHHHHHHHHHHHHHHHHHHHC T ss_conf 443436-----655433022056776545745688-99889833-56389999999999999999999863 No 89 >PRK12304 thrS threonyl-tRNA synthetase; Reviewed Probab=96.74 E-value=0.0063 Score=40.43 Aligned_cols=112 Identities=17% Similarity=0.227 Sum_probs=68.1 Q ss_pred HHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCC---CCC----CC--CCCCCCCCHHHHHHC-----CCCHHHHHHHHHC Q ss_conf 245899999998752036514620332112464---566----45--432222210344311-----2385456443310 Q gi|254780419|r 142 IVKRTRIINSMRCRMIAENFIECSTPILTASSP---EGA----RD--FLVPSRVNQGSFYAL-----PQAPQQYKQLLMA 207 (622) Q Consensus 142 l~~Rs~i~~~~R~~l~~~gF~EveTP~L~ksTp---eGA----rd--FlVPsR~~~g~fYAL-----pQSPQlyKQlLmv 207 (622) .++|..+...+|+-....||.||.||.+.++.- .|- +| |.+ .+ .+.-|+| |..-++||+-+ - T Consensus 239 ~~i~~~le~~~r~~~~~~Gy~eV~TP~i~~~~Lw~~SGH~~~Y~e~Mf~~--~~-d~~~y~lKPMNCP~H~~if~~~~-r 314 (604) T PRK12304 239 ARLRSKLEHLLYKAHRLRGYEPVRGPEILKSDVWKISGHYANYKENMYFT--TI-DEQEYGIKPMNCVGHIKVYQSDL-R 314 (604) T ss_pred HHHHHHHHHHHHHHHHHCCCEEECCCCCCCHHHHHCCCHHHHHCCCCCEE--EC-CCCEEECCCCCCHHHHHHHHCCC-C T ss_conf 49999999999999998299680386545588873055153210576445--05-77433036779574999986277-4 Q ss_pred CC--H-HHHEEEHHHHCCCCCCCC----CCCCCEEEEEEEECCCCHHHHHHHHHHHHHH Q ss_conf 57--0-233110011037776655----3800003565410268999999999999999 Q gi|254780419|r 208 SG--F-DRYFQIAPCFRDEDPRSD----RLPGEFYQLDMEMSFVEQDDILNTMENVLRG 259 (622) Q Consensus 208 ~G--~-dryfqiarcFRdEd~R~d----Rqp~EFTqlD~EmsF~~~edvm~l~E~li~~ 259 (622) |= + =|++++|.|||+|.|-+= |-- .|||=|.- .|++.++|-+-+-+++.. T Consensus 315 SYRdLPlR~aEfg~~hR~E~SG~L~GL~RvR-~FtqDDaH-IFc~~dQi~~Ei~~~l~~ 371 (604) T PRK12304 315 SYRDLPLKFFEYGVVHRHEKSGVLHGLFRVR-EFTQDDAH-IFCMPSQIKEEVLEILDF 371 (604) T ss_pred CHHHCCHHHHHCCCEECCCCCCCCCCCCCCC-CCCCCCCE-EECCHHHHHHHHHHHHHH T ss_conf 4443555545435320267776634645577-89778844-654899999999999999 No 90 >PRK04028 glutamyl-tRNA(Gln) amidotransferase subunit E; Validated Probab=96.60 E-value=0.074 Score=32.75 Aligned_cols=202 Identities=18% Similarity=0.212 Sum_probs=111.4 Q ss_pred CCCCCCCCCCCCCCCCHHHHHHHHHCCCCCEEEEEECCCCC--------CCCCCHHHHHHHHHHHHHCCCCCEEEEECCC Q ss_conf 76401244310178752456668740378479999627876--------6521035677999876420246235520477 Q gi|254780419|r 295 NPIIMHNVTEHFVHSEFRVFSKILSVNPGYEVWAIPAKGGG--------GNRAFCDRMNEWARSQGQPGLAYIFWRLDES 366 (622) Q Consensus 295 ~~lei~dit~~~~~~~f~vf~~~~~~~~~~~v~~i~~~~~~--------~srk~~D~l~e~ak~~g~kgL~~i~~~~~~~ 366 (622) ...+++|+|+.|+++.-+++++.+...+ .+.++..++-. .+|..-..+.+++|..|.+|+.. .||. T Consensus 279 ~~~~~~d~t~~~~~t~s~ii~~~l~~gg--~v~a~~~~~~~g~lg~e~q~~rR~g~E~~d~~k~~g~~Gi~H----sDEl 352 (631) T PRK04028 279 VVDEIVDVTELFKNTKSKVIKRALKKGG--KVLAVKLPGFKGLLGREIQPGRRLGTELADYAKWAGVGGIFH----SDEL 352 (631) T ss_pred CCCCHHHHHHHHCCCCCHHHHHHHHCCC--EEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCC----CCCC T ss_conf 3432001355531353078876550487--378775156344336566754200056766765415565334----5434 Q ss_pred CCCCCCCCHHHHCCHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHCCCCCCEEEE--EEEECCCEE Q ss_conf 753335702553252789999998188755726998088541100134788999877300355751028--896077053 Q gi|254780419|r 367 GNVAGMGPVAKHLNIVNTEAIRVRLDMQDGDSCFFIAGDPLKFCSFAADARNHIAQELGIINCNCFEFC--WIMDFPFYE 444 (622) Q Consensus 367 ~~~~~~spi~K~l~~~~~~~l~~~~~~~~GD~i~f~a~~~~~~~~~lg~lR~~ia~~l~li~~~~~~fl--WV~DFPLFe 444 (622) .... ++.++...+..++++...|.++++++++..+..++..+-....+.+.-++...-.+. =-|.| | - T Consensus 353 p~Yg--------i~~~e~~~~~~~l~~~~~Da~~l~~~~~~~~~~~~~~v~~R~~~~~~~vp~EtR~~~~dg~T~~-m-R 422 (631) T PRK04028 353 PGYG--------ITEEEVEALRDALGAGENDAVILVADEEEKAKKALEAVIERAKEALEGVPEETRKALPDGTTRY-L-R 422 (631) T ss_pred CCCC--------CCHHHHHHHHHHHCCCCCCCEEEECCCHHHHHHHHHHHHHHHHHHHCCCCHHHHCCCCCCCEEE-C-C T ss_conf 3446--------5266778888752467554135532737778889999999998754068555415677898742-3-5 Q ss_pred ECCCCCCCCHHHCCCCCCCCCHHHHHCCC--HHHHHHHCCCEEECCEEECCCCEECCCHHHHHHHH-HHCCCCHHHHH-- Q ss_conf 54554700021100247664306554188--77741313677775677212512315699999999-98599989998-- Q gi|254780419|r 445 WNEEEKKIDFAHNPFTMPQGGMESLKGND--LLSIKAFQYDLVCNGFEIASGGIRNHMSEVMLQAF-SNVGISKEVVE-- 519 (622) Q Consensus 445 ~dee~~r~~s~HHPFTaP~~~~~~l~~~d--p~~v~a~~YDLVlNG~EiggGSiRIHd~~iQ~~~f-~~lgl~~ee~~-- 519 (622) +-+...|| -|.+|..++.-++ ...+++.- =|+ |+..+++| +..|+++.++. T Consensus 423 ~k~ga~Ry--------fPEPDLPPi~Is~e~Ie~Ik~~L-------PEL---------P~~kr~Rl~~eygLs~~dA~~L 478 (631) T PRK04028 423 PLPGAARM--------YPETDIPPIRITEELLEKIKKNL-------PEL---------PEEKLERLVKEYGLSEELAEQL 478 (631) T ss_pred CCCCCCCC--------CCCCCCCCEECCHHHHHHHHHHC-------CCC---------HHHHHHHHHHHCCCCHHHHHHH T ss_conf 56753220--------77988774646999999999718-------999---------9999999999709899999999 Q ss_pred ---HHHHHHHHHHHCCCCCC Q ss_conf ---66778999986084973 Q gi|254780419|r 520 ---NRFGGLYRAFQCGMPPH 536 (622) Q Consensus 520 ---~~Fg~lL~Al~yG~PPH 536 (622) ....+|-++.+.++.|- T Consensus 479 ~~~~~l~~FE~~v~~~~~pk 498 (631) T PRK04028 479 ANSERLDLFEELVKKGVKPT 498 (631) T ss_pred HCCCHHHHHHHHHHCCCCHH T ss_conf 74738999999997169999 No 91 >PRK00413 thrS threonyl-tRNA synthetase; Reviewed Probab=96.59 E-value=0.012 Score=38.36 Aligned_cols=119 Identities=26% Similarity=0.365 Sum_probs=72.7 Q ss_pred HHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCC---CCC----CCCCCCCCCCHHHHHHC-----CCCHHHHHHHHHCCC Q ss_conf 245899999998752036514620332112464---566----45432222210344311-----238545644331057 Q gi|254780419|r 142 IVKRTRIINSMRCRMIAENFIECSTPILTASSP---EGA----RDFLVPSRVNQGSFYAL-----PQAPQQYKQLLMASG 209 (622) Q Consensus 142 l~~Rs~i~~~~R~~l~~~gF~EveTP~L~ksTp---eGA----rdFlVPsR~~~g~fYAL-----pQSPQlyKQlLmv~G 209 (622) -++|..+...+|+-....||.||-||.+.++.- .|- +|=.-|+-.-.+.-||| |.--++||+-. -|= T Consensus 268 ~~i~~~le~~~r~~~~~~Gy~eV~TP~i~~~~Lw~~SGHw~~Y~e~Mf~~~~~~~~~~~lKPMNCP~H~~if~~~~-~SY 346 (639) T PRK00413 268 WIIRNELEDYMRRRHLKAGYQEVNTPHILDRSLWETSGHWDHYRENMFPTMSDDGEDYALKPMNCPGHILIFKSGL-RSY 346 (639) T ss_pred HHHHHHHHHHHHHHHHHCCCEEEECCCCCCHHHHHHCCCHHHHCCCCCCCEECCCCEEEECCCCCCCHHHHHHCCC-CCH T ss_conf 9999999999999999869859836413468899654742232254223240152103204668700335644355-355 Q ss_pred --H-HHHEEEHHHHCCCCCCCC----CCCCCEEEEEEEECCCCHHHHHHHHHHHH---HHHHHH Q ss_conf --0-233110011037776655----38000035654102689999999999999---999999 Q gi|254780419|r 210 --F-DRYFQIAPCFRDEDPRSD----RLPGEFYQLDMEMSFVEQDDILNTMENVL---RGVFEE 263 (622) Q Consensus 210 --~-dryfqiarcFRdEd~R~d----Rqp~EFTqlD~EmsF~~~edvm~l~E~li---~~i~~~ 263 (622) + =||+++|.|||+|-|-+= |-- .|||=|.- -|++.++|.+-+.+++ ..+++. T Consensus 347 RdLPlR~aEfg~~~R~E~SG~L~GL~RvR-~FtqdDaH-IFc~~dQi~~Ei~~~l~~~~~vy~~ 408 (639) T PRK00413 347 RELPLRLAEFGTVHRYEKSGALHGLMRVR-GFTQDDAH-IFCTPEQIKDELLSVLDLILSVYKD 408 (639) T ss_pred HHCCHHHHHHHHHHCCCCCCCCCCEEEEE-EEECCCCE-EECCHHHHHHHHHHHHHHHHHHHHH T ss_conf 54755666411320567787445642330-00015841-3247899999999999999999997 No 92 >PRK03772 threonyl-tRNA synthetase; Validated Probab=96.52 E-value=0.012 Score=38.52 Aligned_cols=113 Identities=22% Similarity=0.314 Sum_probs=71.5 Q ss_pred HHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCC---CCC----CC--CCCCCCCCHHHHHHC-----CCCHHHHHHHHHC Q ss_conf 245899999998752036514620332112464---566----45--432222210344311-----2385456443310 Q gi|254780419|r 142 IVKRTRIINSMRCRMIAENFIECSTPILTASSP---EGA----RD--FLVPSRVNQGSFYAL-----PQAPQQYKQLLMA 207 (622) Q Consensus 142 l~~Rs~i~~~~R~~l~~~gF~EveTP~L~ksTp---eGA----rd--FlVPsR~~~g~fYAL-----pQSPQlyKQlLmv 207 (622) -++|..+...+|+-....||.||-||.+.+..- .|- +| |.+. ..+.-|+| |..-++||+-+ - T Consensus 271 ~~i~~~le~~~r~~~~~~Gy~eV~TP~i~~~~Lw~~SGH~~~Y~e~Mf~~~---~e~~ey~lKPMNCP~H~~iy~~~~-r 346 (642) T PRK03772 271 WTIFRELEVFVRSKLKEYQYQEVKGPFMMDRVLWEKTGHWDNYADAMFTTS---SENREYAIKPMNCPGHVQIFNQGL-K 346 (642) T ss_pred HHHHHHHHHHHHHHHHHCCCEEEECCHHCCHHHHHHCCHHHHHHCCCCCCC---CCHHHHCCCCCCCHHHHHHHHCCC-C T ss_conf 899999999999999986985985543213778873444766410342234---201443046778734889986166-4 Q ss_pred CC--H-HHHEEEHHHHCCCCCCCC----CCCCCEEEEEEEECCCCHHHHHHHHHHHHHHH Q ss_conf 57--0-233110011037776655----38000035654102689999999999999999 Q gi|254780419|r 208 SG--F-DRYFQIAPCFRDEDPRSD----RLPGEFYQLDMEMSFVEQDDILNTMENVLRGV 260 (622) Q Consensus 208 ~G--~-dryfqiarcFRdEd~R~d----Rqp~EFTqlD~EmsF~~~edvm~l~E~li~~i 260 (622) |= + =|++++|.|||+|-|-+= |-- .|||=|.- -|.+.+++-+-+.+++.-+ T Consensus 347 SYRdLPlR~aEfg~~~R~E~SG~L~GL~RVR-~FtqdDaH-IFC~~dQi~~Ei~~~l~~~ 404 (642) T PRK03772 347 SYRDLPLRMAEFGSCHRNEPSGSLHGLMRVR-GFTQDDAH-IFCTEEQIRDEVNSCIKMV 404 (642) T ss_pred CHHHCCHHHHHHHHHHCCCCCCCCCCCCCCC-CCCCCCCE-EECCHHHHHHHHHHHHHHH T ss_conf 3443654555411343166665534745688-99878843-5537899999999999999 No 93 >PRK04036 DNA polymerase II small subunit; Validated Probab=96.44 E-value=0.082 Score=32.46 Aligned_cols=64 Identities=20% Similarity=0.266 Sum_probs=53.5 Q ss_pred HCCCEEEEEEEECCCCCCC-CEEEEEEECCCCEEEEEECCCC-HHHHHHHCCCCCCEEEEEEEEEE Q ss_conf 9898899997980044469-8799999779825999974881-57899852898858999999997 Q gi|254780419|r 16 DVGSFVRLSGWVHRVRPHG-GIIFLDIRDHYGITQVVANPDS-ACFEIAKSVRSEWVLCIDGLVMS 79 (622) Q Consensus 16 ~~g~~V~i~GwV~~~R~~g-~l~F~~lrD~sG~~Q~v~~~~~-~~~~~~~~l~~esvv~v~G~V~~ 79 (622) .-+++|+|.|-|..+|..+ +-.-+.|-|.+|.+.|++.++. +.++.+..|-...||+|+|++.. T Consensus 144 ~~~~ev~IIGmV~d~r~tknG~~~~~lED~tG~i~vli~k~~~~~~~~a~~ll~DeVIgv~G~~s~ 209 (497) T PRK04036 144 KGEEEVSIIGMVSDIRSTKNGHKIVELEDPTGTFSVLIMKDREDLGELAEDLLPDEVIGVKGTLSG 209 (497) T ss_pred CCCCCEEEEEEECCCEECCCCCEEEEEECCCCEEEEEEECCCHHHHHHHHHCCCCCEEEEEEEECC T ss_conf 689608999997541313677479999878983999994783787777762457757899999869 No 94 >PRK05425 asparagine synthetase AsnA; Provisional Probab=96.40 E-value=0.0044 Score=41.56 Aligned_cols=77 Identities=26% Similarity=0.350 Sum_probs=55.4 Q ss_pred CCCEEEC-CEEECCCCEECCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCCCEECCHHHHHHHHCCCCCCEEE Q ss_conf 3677775-677212512315699999999985999899986677899998608497342224689999998373984032 Q gi|254780419|r 481 QYDLVCN-GFEIASGGIRNHMSEVMLQAFSNVGISKEVVENRFGGLYRAFQCGMPPHGGIAAGIDRIVMLLLGAKNVREV 559 (622) Q Consensus 481 ~YDLVlN-G~EiggGSiRIHd~~iQ~~~f~~lgl~~ee~~~~Fg~lL~Al~yG~PPHgG~alGlDRlvmll~g~~sIRdV 559 (622) -|+-|++ .+||+|=+||+.. +.+.+-++..|-.+ ..+-.| -=..+.--.|-+=|-|||=.||+|+|++...|-+| T Consensus 231 vw~~vl~~a~ElSSMGIRVd~-~sL~~QL~~~~~~d-r~~l~~--H~~ll~~~LP~TIGGGIGQSRlcM~lL~KaHIGEV 306 (327) T PRK05425 231 VWNPVLDRAFELSSMGIRVDE-EALKKQLKLTGDED-RLELEW--HQALLNGELPLTIGGGIGQSRLCMLLLRKAHIGEV 306 (327) T ss_pred EEEHHCCCEEEEECCEEEECH-HHHHHHHHHCCCHH-HHCCHH--HHHHHCCCCCCCCCCCCCHHHHHHHHHCCCCCCCE T ss_conf 960230732466235478889-99999999739735-434989--99997499984312762399999999732522115 Q ss_pred EC Q ss_conf 13 Q gi|254780419|r 560 SL 561 (622) Q Consensus 560 Ia 561 (622) .+ T Consensus 307 Q~ 308 (327) T PRK05425 307 QA 308 (327) T ss_pred EE T ss_conf 76 No 95 >cd04483 hOBFC1_like hOBFC1_like: A subfamily of OB folds similar to that found in human OB fold containing protein 1 (hOBFC1). Members of this group belong to the Replication protein A subunit 2 (RPA2) family of OB folds. RPA is a nuclear ssDNA binding protein (SSB) which appears to be involved in all aspects of DNA metabolism including replication, recombination, and repair. RPA also mediates specific interactions of various nuclear proteins. In animals, plants, and fungi, RPA is a heterotrimer with subunits of 70KDa (RPA1), 32kDa (RPA2), and 14 KDa (RPA3). The OB fold domain of RPA2 has dual roles in ssDNA binding and trimerization. Probab=96.30 E-value=0.051 Score=33.90 Aligned_cols=69 Identities=20% Similarity=0.258 Sum_probs=53.0 Q ss_pred EEEEECCCCCCCCEEEEEEECCCCEEEEEECCC-------------------CHHHHHHHCCCCCCEEEEEEEEEE-CCC Q ss_conf 997980044469879999977982599997488-------------------157899852898858999999997-586 Q gi|254780419|r 23 LSGWVHRVRPHGGIIFLDIRDHYGITQVVANPD-------------------SACFEIAKSVRSEWVLCIDGLVMS-RSA 82 (622) Q Consensus 23 i~GwV~~~R~~g~l~F~~lrD~sG~~Q~v~~~~-------------------~~~~~~~~~l~~esvv~v~G~V~~-r~~ 82 (622) |.|||..+|..-+...+.+-|+||.+-|++=++ ..+.+....|..++++.|.|.+.. | T Consensus 2 IlG~VVs~~ere~f~~y~vDDgTGcI~Cv~W~~~~~~~~~s~~~~~~~~~~~~~a~~~~~~l~iG~llRVRG~I~~yR-- 79 (92) T cd04483 2 ILGTVVSRRERETFYSFGVDDGTGVVNCVCWKNLSYAEVSSRSDAARILKSALMALKQAKVLEIGDLLRVRGSIRTYR-- 79 (92) T ss_pred EEEEEEEEEEECCEEEEEEECCCCEEEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEEEEEEEEE-- T ss_conf 379999997605716998608986699899604576776666784454444578887611577523899942688870-- Q ss_pred CCCCCCCCCCEEEEEEEEEE Q ss_conf 55588777865999988999 Q gi|254780419|r 83 ETINANIITGQIELSAQKIE 102 (622) Q Consensus 83 ~~~n~~~~tG~~Ei~~~~~~ 102 (622) |+-||.++.+. T Consensus 80 ---------g~rei~as~~~ 90 (92) T cd04483 80 ---------GEREINASVVY 90 (92) T ss_pred ---------CEEEEEEEEEE T ss_conf ---------44899689999 No 96 >cd04490 PolII_SU_OBF PolII_SU_OBF: A subfamily of OB folds corresponding to the OB fold found in Pyrococcus abyssi DNA polymerase II (PolII) small subunit. PolII is a family D DNA polymerase, having a 3-prime to 5-prime exonuclease activity. P. abyssi PolII is heterodimeric. The large subunit appears to be the polymerase, and the small subunit may be the exonuclease. The small subunit contains a calcineurin-like phosphatase superfamily domain C-terminal to this OB-fold domain. Probab=96.29 E-value=0.033 Score=35.31 Aligned_cols=60 Identities=27% Similarity=0.380 Sum_probs=50.8 Q ss_pred EEEEEEEECCCCCCC-CEEEEEEECCCCEEEEEECCCC-HHHHHHHCCCCCCEEEEEEEEEE Q ss_conf 899997980044469-8799999779825999974881-57899852898858999999997 Q gi|254780419|r 20 FVRLSGWVHRVRPHG-GIIFLDIRDHYGITQVVANPDS-ACFEIAKSVRSEWVLCIDGLVMS 79 (622) Q Consensus 20 ~V~i~GwV~~~R~~g-~l~F~~lrD~sG~~Q~v~~~~~-~~~~~~~~l~~esvv~v~G~V~~ 79 (622) +|++-|-|..+|..+ +-..+.|-|.||.++|++.+++ +.++.+..|-...||.|+|++.. T Consensus 1 ev~iIGmV~d~r~TknG~~~~~lED~TG~i~vl~~kd~~~~~~~~~~ll~DeVIGV~G~~s~ 62 (79) T cd04490 1 EVSIIGMVNDVRSTKNGHRIVELEDTTGRITVLLTKDKEELFEEAEDILPDEVIGVSGTVSK 62 (79) T ss_pred CEEEEEEECCCEECCCCCEEEEEECCCCEEEEEEECCCHHHHHHHHCCCCCCEEEEEEEECC T ss_conf 96999999332761688789999879998999996893677566752568878999999878 No 97 >COG0441 ThrS Threonyl-tRNA synthetase [Translation, ribosomal structure and biogenesis] Probab=96.26 E-value=0.013 Score=38.07 Aligned_cols=138 Identities=23% Similarity=0.330 Sum_probs=79.7 Q ss_pred HHHHHHHHHHHHHHHHCCCEEEEECCCCCCCC---CCCCCC------CCCCCCCCHHHHHHC-----CCCHHHHHHHHHC Q ss_conf 24589999999875203651462033211246---456645------432222210344311-----2385456443310 Q gi|254780419|r 142 IVKRTRIINSMRCRMIAENFIECSTPILTASS---PEGARD------FLVPSRVNQGSFYAL-----PQAPQQYKQLLMA 207 (622) Q Consensus 142 l~~Rs~i~~~~R~~l~~~gF~EveTP~L~ksT---peGArd------FlVPsR~~~g~fYAL-----pQSPQlyKQlLmv 207 (622) .++|..+...+|.-....||.||.||.+.... ..|.-+ |++.+- +.-|+| |.-=|.||+.+-. T Consensus 220 ~~ir~~le~y~~~~~~~~Gy~~V~TP~~~~~~l~~~SGH~~~y~e~mf~~~~~---~~~~~lKpmNCpgh~~ifk~~~~S 296 (589) T COG0441 220 ATIRNLLEDYVRTKLRSYGYQEVKTPVLADLELWELSGHWDNYKEDMFLTESD---DREYALKPMNCPGHILIFKSGLRS 296 (589) T ss_pred CCHHHHHHHHHHHHHHHCCCEEECCCEEEECCCCHHCCCHHHCCCCCEEECCC---CHHHEEEECCCHHHHHHHHCCCCC T ss_conf 66999999999989875585271287363000010112045513660662137---723306410787689998537861 Q ss_pred CC--HHHHEEEHHHHCCCCCCCC----CCCCCEEEEEEEECCCCHHHHHHHHHHHHHH---HHHHHHCCCCCCCCCCCCC Q ss_conf 57--0233110011037776655----3800003565410268999999999999999---9999808723457732121 Q gi|254780419|r 208 SG--FDRYFQIAPCFRDEDPRSD----RLPGEFYQLDMEMSFVEQDDILNTMENVLRG---VFEEFSNGNTVSNNFLRIC 278 (622) Q Consensus 208 ~G--~dryfqiarcFRdEd~R~d----Rqp~EFTqlD~EmsF~~~edvm~l~E~li~~---i~~~v~~~~~i~~pF~rmt 278 (622) == -=|+++++.|||.|.|-+. |= -+|||=|.- -|++.+.+.+-+-+.+.. +++.+. ++.--.+++ T Consensus 297 YR~LP~r~~E~g~v~R~E~SGal~GL~Rv-R~ftqdD~H-ifc~~dQi~~E~~~~~~~i~~v~~~fg----~~~y~~~ls 370 (589) T COG0441 297 YRELPLRLAEFGYVYRYEKSGALHGLMRV-RGFTQDDAH-IFCTPDQIKDEFKGILELILEVYKDFG----FTDYEVKLS 370 (589) T ss_pred EECCCHHHHHCCEEECCCCCCHHHCCCCC-CCEEECCCC-EECCHHHHHHHHHHHHHHHHHHHHHCC----CCEEEEEEE T ss_conf 10044043314422115676422224001-132245620-140467889999999999999998469----753799996 Q ss_pred HHHHHHHCCCCC Q ss_conf 898875136777 Q gi|254780419|r 279 YDDAIRRYGTDK 290 (622) Q Consensus 279 Y~eAm~~YGsDK 290 (622) ..+ ..||+|+ T Consensus 371 ~r~--k~ig~d~ 380 (589) T COG0441 371 TRP--KFIGSDE 380 (589) T ss_pred CCC--CCCCCHH T ss_conf 477--6468735 No 98 >KOG1035 consensus Probab=96.11 E-value=0.019 Score=37.03 Aligned_cols=106 Identities=17% Similarity=0.254 Sum_probs=67.6 Q ss_pred HHHHHHHHHHHHHHHHCCCEEEEECCCC-CCC-CCCC-CCCCCCCCCCCHHHHHHCC----C---------CHHHHHHHH Q ss_conf 2458999999987520365146203321-124-6456-6454322222103443112----3---------854564433 Q gi|254780419|r 142 IVKRTRIINSMRCRMIAENFIECSTPIL-TAS-SPEG-ARDFLVPSRVNQGSFYALP----Q---------APQQYKQLL 205 (622) Q Consensus 142 l~~Rs~i~~~~R~~l~~~gF~EveTP~L-~ks-TpeG-ArdFlVPsR~~~g~fYALp----Q---------SPQlyKQlL 205 (622) =.+|..+...+-+-|..+|++|++||-+ +.+ ++-+ +..+.|= .|-|.--+|| + |-+.+| T Consensus 932 ~~l~~~v~e~~~~ifr~Hga~~l~tpp~~~~~~~~~~~~~~v~~l--d~sG~~v~Lp~DLr~pfar~vs~N~~~~~K--- 1006 (1351) T KOG1035 932 NELREYVVEEVVKIFRKHGAIELETPPLSLRNACAYFSRKAVELL--DHSGDVVELPYDLRLPFARYVSRNSVLSFK--- 1006 (1351) T ss_pred HHHHHHHHHHHHHHHHHHCCEECCCCCCCCCCCCCHHCCCEEEEE--CCCCCEEEEECCCCCHHHHHHHHCHHHHHH--- T ss_conf 899999999999999982430216786553345420015403544--279978996300124677886605288877--- Q ss_pred HCCCHHHHEEEHHHHCCCCCCCCCCCCCEEEEEEEE-----CCCCHHHHHHHHHHHHHHHHHH Q ss_conf 105702331100110377766553800003565410-----2689999999999999999999 Q gi|254780419|r 206 MASGFDRYFQIAPCFRDEDPRSDRLPGEFYQLDMEM-----SFVEQDDILNTMENVLRGVFEE 263 (622) Q Consensus 206 mv~G~dryfqiarcFRdEd~R~dRqp~EFTqlD~Em-----sF~~~edvm~l~E~li~~i~~~ 263 (622) =|.|.|+||-++ + +||.|+++.|.-. | ...-+++.++-+++..++.+ T Consensus 1007 -------ry~i~rVyr~~~--~-~hP~~~~ec~fDii~~t~s-l~~AE~L~vi~Ei~~~~l~~ 1058 (1351) T KOG1035 1007 -------RYCISRVYRPAI--H-NHPKECLECDFDIIGPTTS-LTEAELLKVIVEITTEILHE 1058 (1351) T ss_pred -------HHHHHEEECCCC--C-CCCCCCCCEEEEEECCCCC-CCHHHHHHHHHHHHHHHHCC T ss_conf -------762320344634--5-8984010015667558777-40789999999999998605 No 99 >PRK03991 threonyl-tRNA synthetase; Validated Probab=96.09 E-value=0.017 Score=37.42 Aligned_cols=117 Identities=20% Similarity=0.230 Sum_probs=72.8 Q ss_pred HHHHHHHHHHHHHHHCCCEEEEECCCCCCC-CC---C-----CCCCCCCCCCC--CHHHHHHCCCCHHHHHHHHHCCC-- Q ss_conf 458999999987520365146203321124-64---5-----66454322222--10344311238545644331057-- Q gi|254780419|r 143 VKRTRIINSMRCRMIAENFIECSTPILTAS-SP---E-----GARDFLVPSRV--NQGSFYALPQAPQQYKQLLMASG-- 209 (622) Q Consensus 143 ~~Rs~i~~~~R~~l~~~gF~EveTP~L~ks-Tp---e-----GArdFlVPsR~--~~g~fYALpQSPQlyKQlLmv~G-- 209 (622) .+|..+..-+|+-..+.|++||+||++... -+ | |+|-|.+-+-. .-=+|-|-|+-=|+|||.+ .|= T Consensus 228 ~i~~lle~Yv~~~~~~~G~~eV~TP~m~d~~~~~~sgH~~~f~e~my~~~~d~~~~~LrpmnCpgh~~i~~~~~-~SYRd 306 (613) T PRK03991 228 LMKDLLEDYVYNLVKELGAMPVETPNMYDLSHPAIREHADKFGERQYRVKSDKKDLMLRFAACFGQFAMLKDMT-ISYKN 306 (613) T ss_pred HHHHHHHHHHHHHHHHCCCEEEECCCEEECCCCHHHHHHHHHCCCCEEEECCCHHHCCCCCCCCCCEEEECCCC-CCHHH T ss_conf 99999999999999863966845761640554025568876111342563276332304457863123510244-57432 Q ss_pred H-HHHEEEHHH-HCCCCCCCC----CCCCCEEEEEEEECCCCH-HHHHHHHHHHHHHHHH Q ss_conf 0-233110011-037776655----380000356541026899-9999999999999999 Q gi|254780419|r 210 F-DRYFQIAPC-FRDEDPRSD----RLPGEFYQLDMEMSFVEQ-DDILNTMENVLRGVFE 262 (622) Q Consensus 210 ~-dryfqiarc-FRdEd~R~d----Rqp~EFTqlD~EmsF~~~-edvm~l~E~li~~i~~ 262 (622) + =|.|++|.| ||.|.|-+= |-- .|||=|+- .|++. +++++-+.+++.-+.. T Consensus 307 LPlRl~E~g~~syR~E~SG~L~GL~RVR-~FTqdDaH-IFC~~~dQi~~E~~~~~~li~~ 364 (613) T PRK03991 307 LPLKMYELSDYSFRLEQRGELVGLKRLR-AFTMPDMH-TLCKDMDEAMEEFEKQHKLILE 364 (613) T ss_pred CCHHHHHHCCCEEECCCCCCEECEEEEE-CCCCCCCE-ECCCCHHHHHHHHHHHHHHHHH T ss_conf 6467765224202125677444401332-35668611-3168778999999999999999 No 100 >TIGR00442 hisS histidyl-tRNA synthetase; InterPro: IPR015807 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . Histidyl-tRNA synthetase (6.1.1.21 from EC) is an alpha2 dimer that belongs to class IIa. Every completed genome includes a histidyl-tRNA synthetase. Apparent second copies from Bacillus subtilis, Synechocystis sp., and Aquifex aeolicus are slightly shorter, more closely related to each other than to other hisS proteins, and not demonstrated to act as histidyl-tRNA synthetases (see IPR004517 from INTERPRO). The regulatory protein kinase GCN2 of Saccharomyces cerevisiae (YDR283c), and related proteins from other species designated eIF-2 alpha kinase, have a domain closely related to histidyl-tRNA synthetase that may serve to detect and respond to uncharged tRNA(his), an indicator of amino acid starvation, but these regulatory proteins are not orthologous.; GO: 0000166 nucleotide binding, 0004821 histidine-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006427 histidyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=96.02 E-value=0.017 Score=37.41 Aligned_cols=119 Identities=24% Similarity=0.366 Sum_probs=73.0 Q ss_pred CCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCC---C-CC-CCC------CCCCCCCCHHHHHHCCCCHH------- Q ss_conf 112124589999999875203651462033211246---4-56-645------43222221034431123854------- Q gi|254780419|r 138 LHKNIVKRTRIINSMRCRMIAENFIECSTPILTASS---P-EG-ARD------FLVPSRVNQGSFYALPQAPQ------- 199 (622) Q Consensus 138 ~~~~l~~Rs~i~~~~R~~l~~~gF~EveTP~L~ksT---p-eG-Ard------FlVPsR~~~g~fYALpQSPQ------- 199 (622) +-.-...|..+...+|..+...||.||.||++-... . -| +.+ |.. .-+.|.- +...|+ T Consensus 14 ~p~~~~~~~~~~~~~~~~~~~~g~~~~~~P~~e~~~~~~~~~~~~~~~~~~~~y~f--~d~~~~~--~~lrp~~t~~~~r 89 (446) T TIGR00442 14 LPGELVYRQWIEAKLRSVLESYGYSEIRTPIFEYTELFARGIGKSGDEVEKELYTF--KDKGGRD--LALRPELTAPVVR 89 (446) T ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHEEE--ECCCCCE--EEECCCHHHHHHH T ss_conf 54157899999999999997517720012114445666530253013344443012--1278841--4422200368999 Q ss_pred HH-HHHHHC--CCHHHHEEEHHHHCCCCCC-CCCCCCCEEEEEEEEC----CCCHHHHHHHHHHHHHHHH Q ss_conf 56-443310--5702331100110377766-5538000035654102----6899999999999999999 Q gi|254780419|r 200 QY-KQLLMA--SGFDRYFQIAPCFRDEDPR-SDRLPGEFYQLDMEMS----FVEQDDILNTMENVLRGVF 261 (622) Q Consensus 200 ly-KQlLmv--~G~dryfqiarcFRdEd~R-~dRqp~EFTqlD~Ems----F~~~edvm~l~E~li~~i~ 261 (622) .+ ...++. ..--|.|.+|+|||-|... +.|.- ||+|.++|.. ..-.-+++.+.-..+..+. T Consensus 90 ~~~~~~~~~~~~~p~~~~~~g~~~r~e~p~~~gr~r-~f~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~g 158 (446) T TIGR00442 90 LVAENKLLLPANKPLRLYYIGPVFRYERPQLKGRYR-EFWQFGCEVIGSESPLADAEVLSLAVEGLKALG 158 (446) T ss_pred HHHHHHHHCCCCCCEEEEEEECHHHCCCHHHHCCHH-HHHHCCEEEECCCCCHHHHHHHHHHHHHHHHHC T ss_conf 998754310467642432320101022412301011-332115046436652036899999999898607 No 101 >cd04487 RecJ_OBF2_like RecJ_OBF2_like: A subfamily of OB folds corresponding to the second OB fold (OBF2) of archaeal-specific proteins with similarity to eubacterial RecJ. RecJ is an ssDNA-specific exonuclease. Although the overall sequence similarity of these proteins to eubacterial RecJ proteins is marginal, they appear to carry motifs, which have been shown to be essential for nuclease function in Escherichia coli RecJ. In addition to this OB fold, most proteins in this subfamily contain: i) an N-terminal OB fold belonging to a different domain family (the ribosomal S1-like RNA-binding family); and ii) a domain, C-terminal to OBF2, characteristic of DHH family proteins. DHH family proteins include E. coli RecJ, and are predicted to have a phosphoesterase function. Probab=95.85 E-value=0.045 Score=34.30 Aligned_cols=73 Identities=22% Similarity=0.386 Sum_probs=54.8 Q ss_pred EEEEEEECCCCCCCCEEEEEEECCCCEEEEEECCCCHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCCCCCCEEEEEEEE Q ss_conf 99997980044469879999977982599997488157899852898858999999997586555887778659999889 Q gi|254780419|r 21 VRLSGWVHRVRPHGGIIFLDIRDHYGITQVVANPDSACFEIAKSVRSEWVLCIDGLVMSRSAETINANIITGQIELSAQK 100 (622) Q Consensus 21 V~i~GwV~~~R~~g~l~F~~lrD~sG~~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~r~~~~~n~~~~tG~~Ei~~~~ 100 (622) |.|.|-|..++.-++-.-+.++|.+|.+.|-.-.+.- ....-.+..+++|.|.|.|..|. |++.+.+.+ T Consensus 1 V~i~GeV~qikQT~GPTIFti~Detg~v~~AAFeeaG-vRAyPev~~gdiV~v~G~V~~r~----------g~lQiE~~~ 69 (73) T cd04487 1 VHIEGEVVQIKQTSGPTIFTLRDETGTVWAAAFEEAG-VRAYPEVEVGDIVRVTGEVEPRD----------GQLQIEVES 69 (73) T ss_pred CEEEEEEEEEEECCCCEEEEEECCCCCEEHHHHCCCC-CCCCCCCCCCCEEEEEEEEEEEC----------CEEEEEEEE T ss_conf 9799999998988999999997599969820513067-41288878898899998999708----------858999730 Q ss_pred EEEE Q ss_conf 9995 Q gi|254780419|r 101 IEIL 104 (622) Q Consensus 101 ~~il 104 (622) ++.| T Consensus 70 ~~~L 73 (73) T cd04487 70 LEVL 73 (73) T ss_pred EECC T ss_conf 2509 No 102 >COG0172 SerS Seryl-tRNA synthetase [Translation, ribosomal structure and biogenesis] Probab=95.84 E-value=0.061 Score=33.37 Aligned_cols=123 Identities=20% Similarity=0.315 Sum_probs=72.0 Q ss_pred HHHHHHHHHHH----HCCCEEEEECCCCCCCCC-CCCC----------------CCCCCCCCCHHHHHHCCCCHHHHHHH Q ss_conf 99999998752----036514620332112464-5664----------------54322222103443112385456443 Q gi|254780419|r 146 TRIINSMRCRM----IAENFIECSTPILTASSP-EGAR----------------DFLVPSRVNQGSFYALPQAPQQYKQL 204 (622) Q Consensus 146 s~i~~~~R~~l----~~~gF~EveTP~L~ksTp-eGAr----------------dFlVPsR~~~g~fYALpQSPQlyKQl 204 (622) +.+-+|+.+|| .++||.|+.+|.|.+... -|.+ -||+||.-.|= |+| ..+++ T Consensus 174 a~L~rAL~~f~ld~~~~~Gf~e~~~P~lv~~e~m~gtgqlpkf~e~~y~v~~~~~~LipTaEvpl--~~l-----~~~Ei 246 (429) T COG0172 174 ARLERALIQFMLDLHTKHGFTEVLPPYLVNLESMFGTGQLPKFEEDLYKVEDPDLYLIPTAEVPL--TNL-----HRDEI 246 (429) T ss_pred HHHHHHHHHHHHHHHHHCCCEEEECCEEECHHHHHCCCCCCCCCCCCEEECCCCEEEEECCHHHH--HHH-----HCCCC T ss_conf 99999999999999987696586576060598862237898880121584589879970202156--786-----51620 Q ss_pred HHCCCH-HHHEEEHHHHCCCCCC------C-CCCCCCEEEEEEEE-CCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 310570-2331100110377766------5-53800003565410-2689999999999999999999808723457732 Q gi|254780419|r 205 LMASGF-DRYFQIAPCFRDEDPR------S-DRLPGEFYQLDMEM-SFVEQDDILNTMENVLRGVFEEFSNGNTVSNNFL 275 (622) Q Consensus 205 Lmv~G~-dryfqiarcFRdEd~R------~-dRqp~EFTqlD~Em-sF~~~edvm~l~E~li~~i~~~v~~~~~i~~pF~ 275 (622) +-..-+ -+|+-..+|||-|-.- . .|+- +|.-+ || .|++-++-....|+|+... .+++. .+..|+. T Consensus 247 l~~~~LP~k~~~~S~cFR~EAGs~GrdtrGliRvH-QF~KV--E~v~~~~Pe~S~~~~E~m~~~a-e~il~--~LeLPyR 320 (429) T COG0172 247 LDEEDLPIKYTAYSPCFRSEAGSAGKDTRGLIRVH-QFDKV--ELVVITKPEESEEELEEMLGNA-EEVLQ--ELELPYR 320 (429) T ss_pred CCCCCCCEEEEEECHHHHCCCCCCCCCCCCEEEEE-EEEEE--EEEEEECCCHHHHHHHHHHHHH-HHHHH--HHCCCCE T ss_conf 15212780267877254214565664355301466-43558--9999707011699999999999-99999--7089836 Q ss_pred CCCHHH Q ss_conf 121898 Q gi|254780419|r 276 RICYDD 281 (622) Q Consensus 276 rmtY~e 281 (622) .+.+.. T Consensus 321 vv~lct 326 (429) T COG0172 321 VVNLCT 326 (429) T ss_pred EEEECC T ss_conf 843256 No 103 >cd04485 DnaE_OBF DnaE_OBF: A subfamily of OB folds corresponding to the C-terminal OB-fold nucleic acid binding domain of Thermus aquaticus and Escherichia coli type C replicative DNA polymerase III alpha subunit (DnaE). The DNA polymerase holoenzyme of E. coli contains two copies of this replicative polymerase, each of which copies a different DNA strand. This group also contains Bacillus subtilis DnaE. Replication in B. subtilis and Staphylococcus aureus requires two different type C polymerases, polC and DnaE, both of which are thought to be included in the DNA polymerase holoenzyme. At the B. subtilis replication fork, polC appears to be involved in leading strand synthesis and DnaE in lagging strand synthesis. Probab=95.65 E-value=0.12 Score=31.30 Aligned_cols=75 Identities=29% Similarity=0.468 Sum_probs=54.2 Q ss_pred EEEEEECCCCC----CC-CEEEEEEECCCCEEEEEECCCCHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCCCCCCEEEE Q ss_conf 99979800444----69-87999997798259999748815789985289885899999999758655588777865999 Q gi|254780419|r 22 RLSGWVHRVRP----HG-GIIFLDIRDHYGITQVVANPDSACFEIAKSVRSEWVLCIDGLVMSRSAETINANIITGQIEL 96 (622) Q Consensus 22 ~i~GwV~~~R~----~g-~l~F~~lrD~sG~~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~r~~~~~n~~~~tG~~Ei 96 (622) +++|=|.++|. .| +++|+.|-|.+|.+-+++-++. .......|..+.+|.|+|.|..+. |.+.+ T Consensus 1 ~iaG~I~~~k~~~tk~G~~~a~~tleD~~g~~e~~~F~~~-~~~~~~~l~~~~~v~i~g~v~~~~----------~~~~l 69 (84) T cd04485 1 TVAGLVTSVRRRRTKKGKRMAFVTLEDLTGSIEVVVFPET-YEKYRDLLKEDALLLVEGKVERRD----------GGLRL 69 (84) T ss_pred CEEEEEEEEEEEECCCCCEEEEEEEECCCCCEEEEECHHH-HHHHHHHHCCCCEEEEEEEEEEEC----------CEEEE T ss_conf 9899999989985579998999999437874999996999-999888744698899999999969----------98999 Q ss_pred EEEEEEEEECC Q ss_conf 98899995021 Q gi|254780419|r 97 SAQKIEILSVA 107 (622) Q Consensus 97 ~~~~~~il~~a 107 (622) .++++.-|..+ T Consensus 70 ~~~~i~~l~~~ 80 (84) T cd04485 70 IAERIEDLEDA 80 (84) T ss_pred EEEEEEEHHHH T ss_conf 99994678998 No 104 >PRK06253 O-phosphoseryl-tRNA synthetase; Reviewed Probab=95.49 E-value=0.014 Score=37.92 Aligned_cols=57 Identities=28% Similarity=0.579 Sum_probs=36.7 Q ss_pred CCCEEECCE-EECCCCEECCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCC-CCCCEECCHHHHHHHHCCCCCCEE Q ss_conf 367777567-721251231569999999998599989998667789999860849-734222468999999837398403 Q gi|254780419|r 481 QYDLVCNGF-EIASGGIRNHMSEVMLQAFSNVGISKEVVENRFGGLYRAFQCGMP-PHGGIAAGIDRIVMLLLGAKNVRE 558 (622) Q Consensus 481 ~YDLVlNG~-EiggGSiRIHd~~iQ~~~f~~lgl~~ee~~~~Fg~lL~Al~yG~P-PHgG~alGlDRlvmll~g~~sIRd 558 (622) +|-=.+.|| ||+|=+ |.+|++.. .||-| |---||||++||.|++-|.+.||+ T Consensus 285 ~yhp~~~gWvEvatfG--IysP~aL~------------------------~ygI~~PVmnlGlGvERLAMI~~~~~DiR~ 338 (527) T PRK06253 285 AYHPKLDGWVEVATFG--IYSPTALA------------------------EYGIDVPVMNLGLGVERLAMILHNAEDVRE 338 (527) T ss_pred EECCCCCCEEEEECCC--CCCHHHHH------------------------HCCCCCCEEECCCCHHHHHHHHCCCHHHHH T ss_conf 9636678628973142--56754454------------------------319997422001118999999818277887 Q ss_pred EECCCC Q ss_conf 213789 Q gi|254780419|r 559 VSLFPM 564 (622) Q Consensus 559 VIaFPK 564 (622) . .+|- T Consensus 339 l-~ypQ 343 (527) T PRK06253 339 M-VYPQ 343 (527) T ss_pred H-HCCC T ss_conf 6-3345 No 105 >cd04478 RPA2_DBD_D RPA2_DBD_D: A subfamily of OB folds corresponding to the OB fold of the central ssDNA-binding domain (DBD)-D of human RPA2 (also called RPA32). RPA2 is a subunit of Replication protein A (RPA). RPA is a nuclear ssDNA-binding protein (SSB) which appears to be involved in all aspects of DNA metabolism including replication, recombination, and repair. RPA also mediates specific interactions of various nuclear proteins. In animals, plants, and fungi, RPA is a heterotrimer with subunits of 70KDa (RPA1), 32kDa (RPA2), and 14 KDa (RPA3). The major DNA binding activity of RPA is associated with RPA1 DBD-A and DBD-B; RPA2 DBD-D is a weak ssDNA-binding domain. RPA2 DBD-D is also involved in trimerization. The ssDNA binding mechanism is believed to be multistep and to involve conformational change. N-terminal to human RPA2 DBD-D is a domain containing all the known phosphorylation sites of RPA. Human RPA2 is phosphorylated in a cell cycle dependent manner in response to DNA dam Probab=95.39 E-value=0.25 Score=28.93 Aligned_cols=76 Identities=21% Similarity=0.178 Sum_probs=53.9 Q ss_pred EEEEEEEECCCCCCCCEEEEEEECCCCEEEEEE--CCCCHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCCCCCCEEEEE Q ss_conf 899997980044469879999977982599997--488157899852898858999999997586555887778659999 Q gi|254780419|r 20 FVRLSGWVHRVRPHGGIIFLDIRDHYGITQVVA--NPDSACFEIAKSVRSEWVLCIDGLVMSRSAETINANIITGQIELS 97 (622) Q Consensus 20 ~V~i~GwV~~~R~~g~l~F~~lrD~sG~~Q~v~--~~~~~~~~~~~~l~~esvv~v~G~V~~r~~~~~n~~~~tG~~Ei~ 97 (622) .|++.|+|.++.....-.-+.|-|+||.+.+.. +.+.........+..+.-|.|.|.++.- .|..-|. T Consensus 1 ~V~~VG~V~~v~~~~t~~~y~idDgTG~i~~~~w~~~~~~~~~~~~~i~~g~YVrV~G~lk~f----------~~~~~I~ 70 (95) T cd04478 1 QVTLVGVVRNVEEQSTNITYTIDDGTGTIEVRQWLDDDNDDSSEVEPIEEGTYVRVFGNLKSF----------QGKKSIM 70 (95) T ss_pred CEEEEEEEEEEEECCCEEEEEEECCCCCEEEEEECCCCCCCCCCCCCCCCCCEEEEEEEEEEE----------CCEEEEE T ss_conf 989999999877725579999987987399999637787642224433269899999998002----------8913799 Q ss_pred EEEEEEEE Q ss_conf 88999950 Q gi|254780419|r 98 AQKIEILS 105 (622) Q Consensus 98 ~~~~~il~ 105 (622) +.++.-+. T Consensus 71 ~~~irpv~ 78 (95) T cd04478 71 AFSIRPVT 78 (95) T ss_pred EEEEEEEC T ss_conf 99999937 No 106 >cd00645 AsnA Asparagine synthetase (aspartate-ammonia ligase) (AsnA) catalyses the conversion of L-aspartate to L-asparagine in the presence of ATP and ammonia. AsnA is a homodimeric enzyme which is structurally similiar to the catalytic core domain of class II aminoacyl-tRNA synthetases. Ammonia-dependent AsnA is not homologous to the glutamine-dependent asparagine synthetase AsnB. Probab=95.30 E-value=0.033 Score=35.24 Aligned_cols=77 Identities=25% Similarity=0.321 Sum_probs=55.9 Q ss_pred CCCEEEC-CEEECCCCEECCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCCCEECCHHHHHHHHCCCCCCEEE Q ss_conf 3677775-677212512315699999999985999899986677899998608497342224689999998373984032 Q gi|254780419|r 481 QYDLVCN-GFEIASGGIRNHMSEVMLQAFSNVGISKEVVENRFGGLYRAFQCGMPPHGGIAAGIDRIVMLLLGAKNVREV 559 (622) Q Consensus 481 ~YDLVlN-G~EiggGSiRIHd~~iQ~~~f~~lgl~~ee~~~~Fg~lL~Al~yG~PPHgG~alGlDRlvmll~g~~sIRdV 559 (622) -|+-|++ .+||+|=+||+.. +.+++-++..|-.+. .+ +-|-=..+.--.|-+=|-|||=.||+|+|+....|-+| T Consensus 221 vw~~vl~~a~ElSSMGIRVd~-~~L~~QL~~~~~~dr-~~--l~~Hk~ll~~~LP~TIGGGIGQSRlcM~lL~KaHIGEV 296 (309) T cd00645 221 VWNPVLQRAFELSSMGIRVDE-ESLQKQLKLAGDEDR-LE--LPFHKMLLNGELPQTIGGGIGQSRLCMFLLQKAHIGEV 296 (309) T ss_pred EECHHCCCEEEEECCEEEECH-HHHHHHHHHCCCHHH-HC--CHHHHHHHCCCCCCCCCCCCCHHHHHHHHHCCCCCCCE T ss_conf 970230732256346468889-999999997598354-35--98999997499984523760399999999753732236 Q ss_pred EC Q ss_conf 13 Q gi|254780419|r 560 SL 561 (622) Q Consensus 560 Ia 561 (622) .+ T Consensus 297 Q~ 298 (309) T cd00645 297 QA 298 (309) T ss_pred EE T ss_conf 75 No 107 >COG3705 HisZ ATP phosphoribosyltransferase involved in histidine biosynthesis [Amino acid transport and metabolism] Probab=95.21 E-value=0.034 Score=35.21 Aligned_cols=96 Identities=22% Similarity=0.265 Sum_probs=60.1 Q ss_pred HHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCC---C---C--CCCCCCCCCCCHHHHHHCC--CCHHHH---HHHHHCC Q ss_conf 245899999998752036514620332112464---5---6--6454322222103443112--385456---4433105 Q gi|254780419|r 142 IVKRTRIINSMRCRMIAENFIECSTPILTASSP---E---G--ARDFLVPSRVNQGSFYALP--QAPQQY---KQLLMAS 208 (622) Q Consensus 142 l~~Rs~i~~~~R~~l~~~gF~EveTP~L~ksTp---e---G--ArdFlVPsR~~~g~fYALp--QSPQly---KQlLmv~ 208 (622) .+....+...+++.|...||..|+||+|--..| + + .|-|.+-.. .|.-.+|. --+|.- -. ++.+ T Consensus 17 ~~~~~~i~~~l~~~f~~~Gy~~v~tP~lE~~d~~l~~~g~~l~~~~f~l~d~--~g~~l~LRpD~T~pVaR~~~~-~~~~ 93 (390) T COG3705 17 ARRKEEIRDQLLALFRAWGYERVETPTLEPADPLLDGAGEDLRRRLFKLEDE--TGGRLGLRPDFTIPVARIHAT-LLAG 93 (390) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCHHHHHHHEEEECC--CCCEEEECCCCCHHHHHHHHH-HCCC T ss_conf 7647999999999999808740466313402665402225555220687657--897688354330899999998-2378 Q ss_pred CHHHHEEEHHHHCCCCCCCCCCCCCEEEEEEEE Q ss_conf 702331100110377766553800003565410 Q gi|254780419|r 209 GFDRYFQIAPCFRDEDPRSDRLPGEFYQLDMEM 241 (622) Q Consensus 209 G~dryfqiarcFRdEd~R~dRqp~EFTqlD~Em 241 (622) +-.||.=.|++||....+..|+- ||+|+-+|+ T Consensus 94 ~P~Rl~Y~G~Vfr~~~~~~g~~~-Ef~QaGiEl 125 (390) T COG3705 94 TPLRLSYAGKVFRAREGRHGRRA-EFLQAGIEL 125 (390) T ss_pred CCCEEEECCHHHHCCHHCCCCCC-CHHHHHHHH T ss_conf 87146640145402010267642-045533677 No 108 >cd04482 RPA2_OBF_like RPA2_OBF_like: A subgroup of uncharacterized archaeal OB folds with similarity to the OB fold of the central ssDNA-binding domain (DBD)-D of human RPA2 (also called RPA32). RPA2 is a subunit of Replication protein A (RPA). RPA is a nuclear ssDNA-binding protein (SSB) which appears to be involved in all aspects of DNA metabolism including replication, recombination, and repair. RPA also mediates specific interactions of various nuclear proteins. In animals, plants, and fungi, RPA is a heterotrimer with subunits of 70KDa (RPA1), 32kDa (RPA2), and 14 KDa (RPA3). The major DNA binding activity of RPA is associated with RPA1 DBD-A and DBD-B; RPA2 DBD-D is a weak ssDNA-binding domain. RPA2 DBD-D is also involved in trimerization. The ssDNA binding mechanism is believed to be multistep and to involve conformational change. N-terminal to human RPA2 DBD-D is a domain containing all the known phosphorylation sites of RPA. Human RPA2 is phosphorylated in a cell cycle depende Probab=94.93 E-value=0.34 Score=28.02 Aligned_cols=74 Identities=22% Similarity=0.333 Sum_probs=58.6 Q ss_pred EEEEEECCCC--CCCCEEEEEEECCCCEEEEEECCCC-HHHHHHHCCCCCCEEEEEEEEEECCCCCCCCCCCCCEEEEEE Q ss_conf 9997980044--4698799999779825999974881-578998528988589999999975865558877786599998 Q gi|254780419|r 22 RLSGWVHRVR--PHGGIIFLDIRDHYGITQVVANPDS-ACFEIAKSVRSEWVLCIDGLVMSRSAETINANIITGQIELSA 98 (622) Q Consensus 22 ~i~GwV~~~R--~~g~l~F~~lrD~sG~~Q~v~~~~~-~~~~~~~~l~~esvv~v~G~V~~r~~~~~n~~~~tG~~Ei~~ 98 (622) .+.|-|.+.- ..|+=.|+.+.|++|.+.+++-+.. ..-..+..|..++.|.|.|.|.. +. .|.+ T Consensus 2 ~~~G~V~~~P~~i~GGHv~~~v~~~~~~i~~~afepTg~~r~~~r~L~~GD~I~v~G~v~~------------~~-~lnl 68 (91) T cd04482 2 RVTGKVVEEPRTIEGGHVFFKISDGTGEIDCAAYEPTKEFRDVVRLLIPGDEVTVYGSVRP------------GT-TLNL 68 (91) T ss_pred EEEEEEECCEEEECCCEEEEEEEECCCEEEEEEECCCCHHHHHHHHCCCCCEEEEEEEECC------------CC-EEEE T ss_conf 8999997864895287499999939988999999877508899973899999999999878------------98-0889 Q ss_pred EEEEEEECCC Q ss_conf 8999950211 Q gi|254780419|r 99 QKIEILSVAE 108 (622) Q Consensus 99 ~~~~il~~a~ 108 (622) +.+.|++.+. T Consensus 69 Ekl~v~~L~~ 78 (91) T cd04482 69 EKLRVIRLAR 78 (91) T ss_pred EEEEEEECCC T ss_conf 9999632222 No 109 >COG4085 Predicted RNA-binding protein, contains TRAM domain [General function prediction only] Probab=94.80 E-value=0.25 Score=29.01 Aligned_cols=65 Identities=22% Similarity=0.151 Sum_probs=47.8 Q ss_pred HHCCCEEEEEEEECCCC--CCCCEEEEEEECCCCEEEEEECCCCHHHHHHH----CCCCCCEEEEEEEEEE Q ss_conf 99898899997980044--46987999997798259999748815789985----2898858999999997 Q gi|254780419|r 15 SDVGSFVRLSGWVHRVR--PHGGIIFLDIRDHYGITQVVANPDSACFEIAK----SVRSEWVLCIDGLVMS 79 (622) Q Consensus 15 ~~~g~~V~i~GwV~~~R--~~g~l~F~~lrD~sG~~Q~v~~~~~~~~~~~~----~l~~esvv~v~G~V~~ 79 (622) ..+.+.|++.|-|.+.+ .++++.++.|.|++|.+-++.....+.+-... -+..+.+|+|+|.|.. T Consensus 48 G~l~e~v~vkg~V~~~~n~~~~gi~~l~lndgtGti~vva~~~tee~l~~n~~~p~~~eGe~veVtGrv~~ 118 (204) T COG4085 48 GRLNEEVTVKGEVTADQNAIGGGIESLVLNDGTGTITVVASRSTEETLELNEGMPVTVEGEIVEVTGRVEE 118 (204) T ss_pred CEEECCCEEEEEEEEEECCCCCCEEEEEEECCCCCEEEEEECCHHHHHHHCCCCCCCCCCCEEEEEEEEEE T ss_conf 31202532214897531012366599999779874899990673676864478873226758999888877 No 110 >KOG2783 consensus Probab=94.64 E-value=0.014 Score=37.92 Aligned_cols=75 Identities=21% Similarity=0.296 Sum_probs=46.8 Q ss_pred CCCCCHHHCCCCCCCCCHHHHHCCCHHHHHHHCCCEEECCEEECCCCEECCCHHHHHHHHHHCCCCHHHHHHHHHHHHHH Q ss_conf 47000211002476643065541887774131367777567721251231569999999998599989998667789999 Q gi|254780419|r 449 EKKIDFAHNPFTMPQGGMESLKGNDLLSIKAFQYDLVCNGFEIASGGIRNHMSEVMLQAFSNVGISKEVVENRFGGLYRA 528 (622) Q Consensus 449 ~~r~~s~HHPFTaP~~~~~~l~~~dp~~v~a~~YDLVlNG~EiggGSiRIHd~~iQ~~~f~~lgl~~ee~~~~Fg~lL~A 528 (622) ..||.-+--|||.|.-+..-...++.+.+.+- --++..+.+..|+.+ .-|| T Consensus 247 ~~rwV~~yfpft~ps~eleI~~~~~wlevlgc--------------------gvi~~~il~~ag~~~-----~igw---- 297 (436) T KOG2783 247 EYRWVDAYFPFTHPSWELEIYFKGEWLEVLGC--------------------GVMRQRLLKRAGLNN-----YIGW---- 297 (436) T ss_pred HEEHEEEECCCCCCCEEEEEEECCCCHHHHCC--------------------CHHHHHHHHHCCCCC-----EEEE---- T ss_conf 11011057666898758999965876765155--------------------056778875204110-----0345---- Q ss_pred HHCCCCCCCCEECCHHHHHHHHCCCCCCEEEEC Q ss_conf 860849734222468999999837398403213 Q gi|254780419|r 529 FQCGMPPHGGIAAGIDRIVMLLLGAKNVREVSL 561 (622) Q Consensus 529 l~yG~PPHgG~alGlDRlvmll~g~~sIRdVIa 561 (622) -+|+|||||.|+|-+-+.||=-=. T Consensus 298 ---------afglgLerLAMll~~IpDiRlfWs 321 (436) T KOG2783 298 ---------AFGLGLERLAMLLFDIPDIRLFWS 321 (436) T ss_pred ---------EEECCHHHHHHHHHCCCCHHEEEC T ss_conf ---------650467888888736762210112 No 111 >PTZ00213 asparagine synthetase A; Provisional Probab=94.56 E-value=0.07 Score=32.94 Aligned_cols=77 Identities=25% Similarity=0.285 Sum_probs=55.2 Q ss_pred CCCEEEC-CEEECCCCEECCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCCCEECCHHHHHHHHCCCCCCEEE Q ss_conf 3677775-677212512315699999999985999899986677899998608497342224689999998373984032 Q gi|254780419|r 481 QYDLVCN-GFEIASGGIRNHMSEVMLQAFSNVGISKEVVENRFGGLYRAFQCGMPPHGGIAAGIDRIVMLLLGAKNVREV 559 (622) Q Consensus 481 ~YDLVlN-G~EiggGSiRIHd~~iQ~~~f~~lgl~~ee~~~~Fg~lL~Al~yG~PPHgG~alGlDRlvmll~g~~sIRdV 559 (622) -|+-|++ .+||+|=+||+.. +.+++-++..|-.+ . .++-|-=..+.--.|-+=|-|||=-||+|+|+....|-+| T Consensus 259 vw~pvL~~a~ElSSMGIRVd~-~sL~~QL~~~~~~d-r--~~l~~Hk~ll~geLP~TIGGGIGQSRlcM~lL~KaHIGEV 334 (350) T PTZ00213 259 VYNPVLDDVLELSSMGIRVDA-EALRRQLELTGNED-R--LKCVWHQCLLNGEFPQTIGGGIGQSRMVMFMLRKKHIGEV 334 (350) T ss_pred EECHHCCCEEEEECCEEEECH-HHHHHHHHHCCCHH-H--HCCHHHHHHHCCCCCCCCCCCCCHHHHHHHHHCCCCCCCE T ss_conf 961130751256346478889-99999998759835-5--4498999997599984322762199999999732622226 Q ss_pred EC Q ss_conf 13 Q gi|254780419|r 560 SL 561 (622) Q Consensus 560 Ia 561 (622) .+ T Consensus 335 Qa 336 (350) T PTZ00213 335 QC 336 (350) T ss_pred EE T ss_conf 64 No 112 >PRK06253 O-phosphoseryl-tRNA synthetase; Reviewed Probab=94.45 E-value=0.028 Score=35.82 Aligned_cols=65 Identities=26% Similarity=0.413 Sum_probs=43.0 Q ss_pred CCCHHHHHHHHHCCC--------HH------HHEEEHHHHCCCCC-CCCCCCCCEEEEEEEE--CCCCHHHHHHHHHHHH Q ss_conf 238545644331057--------02------33110011037776-6553800003565410--2689999999999999 Q gi|254780419|r 195 PQAPQQYKQLLMASG--------FD------RYFQIAPCFRDEDP-RSDRLPGEFYQLDMEM--SFVEQDDILNTMENVL 257 (622) Q Consensus 195 pQSPQlyKQlLmv~G--------~d------ryfqiarcFRdEd~-R~dRqp~EFTqlD~Em--sF~~~edvm~l~E~li 257 (622) |+|.++--+--|-|| .+ |.|.|.||||.|-. .+.|-| ||.|..+=. .-++.++...+.++++ T Consensus 177 p~s~~~tLRSHmTsgwfitL~~l~~k~~~P~klFSIDRcFRrEq~eD~~hL~-~yhsascvv~~edVn~d~gkav~~~lL 255 (527) T PRK06253 177 PESSRLTLRSHMTSGWFITLSALWERKPLPFKLFSIDRCFRREQKEDASRLM-TYHSASCVVAGEDVTVDDGKAVAEGLL 255 (527) T ss_pred CCCCCCEEHHCCCCHHHHHHHHHHHCCCCCEEEEEEEHEEECHHHCCHHHHH-HHHEEEEEEECCCCCHHHHHHHHHHHH T ss_conf 5444534100103027887999972679983687410102200002455634-544322699759866777799999999 Q ss_pred HHH Q ss_conf 999 Q gi|254780419|r 258 RGV 260 (622) Q Consensus 258 ~~i 260 (622) .+. T Consensus 256 ~~f 258 (527) T PRK06253 256 SQF 258 (527) T ss_pred HHH T ss_conf 970 No 113 >TIGR00418 thrS threonyl-tRNA synthetase; InterPro: IPR002320 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . Threonyl-tRNA synthetase (6.1.1.3 from EC) exists as a monomer and belongs to class IIa. The enzyme from Escherichia coli represses the translation of its own mRNA. The crystal structure of the complex between tRNA(Thr) and ThrRS show structural features that reveal novel strategies for providing specificity in tRNA selection. These include an amino-terminal domain containing a novel protein fold that makes minor groove contacts with the tRNA acceptor stem. The enzyme induces a large deformation of the anticodon loop, resulting in an interaction between two adjacent anticodon bases, which accounts for their prominent role in tRNA identity and translational regulation. A zinc ion found in the active site is implicated in amino acid recognition/discrimination . The zinc ion may act to ensure that only amino acids that possess a hydroxyl group attached to the beta-position are activated .; GO: 0004829 threonine-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006435 threonyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=94.26 E-value=0.11 Score=31.59 Aligned_cols=108 Identities=27% Similarity=0.356 Sum_probs=69.6 Q ss_pred HHHHHHHHHHHHHHHCCCEEEEECCCCCCCCC-C--C-C---------------CCC-CCCCCCCHHHHHHCCCCHHHHH Q ss_conf 45899999998752036514620332112464-5--6-6---------------454-3222221034431123854564 Q gi|254780419|r 143 VKRTRIINSMRCRMIAENFIECSTPILTASSP-E--G-A---------------RDF-LVPSRVNQGSFYALPQAPQQYK 202 (622) Q Consensus 143 ~~Rs~i~~~~R~~l~~~gF~EveTP~L~ksTp-e--G-A---------------rdF-lVPsR~~~g~fYALpQSPQlyK 202 (622) ++|..+...+|+-..+.||.+|+||+|..+.- + | + ++| |=|. ..||.+ |.|| T Consensus 213 ~ir~~ledf~~~~~~~~Gy~~V~TP~m~~~~l~~~sGH~~~Y~e~Mf~~~~~~~~~~~LkPM-nCpgH~-------~i~k 284 (595) T TIGR00418 213 LIRNLLEDFVREKQIKRGYEEVETPIMYDSELWEISGHWDHYKERMFPFTEEDNQEFMLKPM-NCPGHI-------LIFK 284 (595) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCEECCCCCHHHHHHHHHCCCCCCCCEEECCCCCCCCCC-CCHHHH-------HHHC T ss_conf 78999999999999875886415741215654245305450342354010103530334567-754453-------3001 Q ss_pred HHHHCC--CH-HHHEEEH-HHHCCCCCCCC----CCCCCEEEEEEEECCCC-HHHHHHHHHHHHHHHH Q ss_conf 433105--70-2331100-11037776655----38000035654102689-9999999999999999 Q gi|254780419|r 203 QLLMAS--GF-DRYFQIA-PCFRDEDPRSD----RLPGEFYQLDMEMSFVE-QDDILNTMENVLRGVF 261 (622) Q Consensus 203 QlLmv~--G~-dryfqia-rcFRdEd~R~d----Rqp~EFTqlD~EmsF~~-~edvm~l~E~li~~i~ 261 (622) |-+ -| -+ =|.||++ -|||.|-|=+= |== .|||-|+= =|++ .+++.+=+++.+.-+. T Consensus 285 ~~~-~SYR~LP~R~aE~g~~~hR~E~sG~L~GL~RvR-~FT~dDaH-ifc~d~dQi~~E~~~~~~l~~ 349 (595) T TIGR00418 285 SSL-RSYRDLPLRIAELGTTVHRYEKSGALHGLKRVR-GFTQDDAH-IFCTDEDQIKSEIKNQFRLIQ 349 (595) T ss_pred CCC-CCHHHCCCHHHHCCCEEEEEECCCCCCHHHHHH-HCCCCCCC-EEECCHHHHHHHHHHHHHHHH T ss_conf 777-773233503433186047884254210101644-30699852-453786779999999999999 No 114 >KOG2784 consensus Probab=94.25 E-value=0.081 Score=32.49 Aligned_cols=21 Identities=19% Similarity=0.134 Sum_probs=12.1 Q ss_pred HHHHHHHHHCCCEEEEECCCC Q ss_conf 999987520365146203321 Q gi|254780419|r 149 INSMRCRMIAENFIECSTPIL 169 (622) Q Consensus 149 ~~~~R~~l~~~gF~EveTP~L 169 (622) -..+|+.|.+.||.|.-|--. T Consensus 218 R~eFRqiF~emGFsEMptn~y 238 (483) T KOG2784 218 REEFRQIFFEMGFSEMPTNNY 238 (483) T ss_pred HHHHHHHHHHCCCCCCCCCCC T ss_conf 999999999806230665440 No 115 >COG5235 RFA2 Single-stranded DNA-binding replication protein A (RPA), medium (30 kD) subunit [DNA replication, recombination, and repair] Probab=94.01 E-value=0.54 Score=26.60 Aligned_cols=73 Identities=22% Similarity=0.241 Sum_probs=54.5 Q ss_pred EEEEEEEECCCCCCCCEEEEEEECCCCEEEEEECC-CCHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCCCCCCEEEEEE Q ss_conf 89999798004446987999997798259999748-81578998528988589999999975865558877786599998 Q gi|254780419|r 20 FVRLSGWVHRVRPHGGIIFLDIRDHYGITQVVANP-DSACFEIAKSVRSEWVLCIDGLVMSRSAETINANIITGQIELSA 98 (622) Q Consensus 20 ~V~i~GwV~~~R~~g~l~F~~lrD~sG~~Q~v~~~-~~~~~~~~~~l~~esvv~v~G~V~~r~~~~~n~~~~tG~~Ei~~ 98 (622) .|++-|||.++..+-.-+|+.+-|++|.+-|-.=+ +....+..+.+..+-.|.|.|-++.= .|.+.|.+ T Consensus 68 ~V~fVGvvrni~~~ttn~~~~iEDGTG~Ievr~W~~~~~~~e~~~d~~~~~yvkV~G~lk~F----------~GK~~I~~ 137 (258) T COG5235 68 NVQFVGVVRNIKTSTTNSMFVIEDGTGSIEVRFWPGNSYEEEQCKDLEEQNYVKVNGSLKTF----------NGKRSISA 137 (258) T ss_pred EEEEEEEEEEEEECCCCEEEEEECCCCEEEEEECCCCCHHHHHCCCCCCCCEEEEECCEEEE----------CCEEEEEH T ss_conf 68999999840103454189993487428988568872377754343326679994211220----------78057741 Q ss_pred EEEE Q ss_conf 8999 Q gi|254780419|r 99 QKIE 102 (622) Q Consensus 99 ~~~~ 102 (622) .-|. T Consensus 138 ~~i~ 141 (258) T COG5235 138 SHIS 141 (258) T ss_pred HHEE T ss_conf 0052 No 116 >PRK09348 glyQ glycyl-tRNA synthetase subunit alpha; Validated Probab=94.00 E-value=0.037 Score=34.90 Aligned_cols=36 Identities=17% Similarity=0.173 Sum_probs=18.6 Q ss_pred HHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCCCCCCHHHHH Q ss_conf 9999998752036514620332112464566454322222103443 Q gi|254780419|r 147 RIINSMRCRMIAENFIECSTPILTASSPEGARDFLVPSRVNQGSFY 192 (622) Q Consensus 147 ~i~~~~R~~l~~~gF~EveTP~L~ksTpeGArdFlVPsR~~~g~fY 192 (622) .++..+.+|..++|.+=++.=-+ +-||.-| ||..|. T Consensus 6 ~~I~~Lq~~W~~~GC~i~qpyd~----evGAgT~------~p~Tfl 41 (291) T PRK09348 6 DIILTLQDYWADQGCAILQPYDM----EVGAGTF------HPATFL 41 (291) T ss_pred HHHHHHHHHHHHCCCEEECCCCC----CCCCCCC------CHHHHH T ss_conf 99999999997679788657544----3454658------839888 No 117 >cd04492 YhaM_OBF_like YhaM_OBF_like: A subfamily of OB folds similar to that found in Bacillus subtilis YhaM and Staphylococcus aureus cmp-binding factor-1 (SaCBF1). Both these proteins are 3'-to-5'exoribonucleases. YhaM requires Mn2+ or Co2+ for activity and is inactive in the presence of Mg2+. YhaM also has a Mn2+ dependent 3'-to-5'single-stranded DNA exonuclease activity. SaCBF is also a double-stranded DNA binding protein, binding specifically to cmp, the replication enhancer found in S. aureus plasmid pT181. Proteins in this group combine an N-terminal OB fold with a C-terminal HD domain. The HD domain is found in metal-dependent phosphohydrolases. Probab=93.92 E-value=0.56 Score=26.49 Aligned_cols=66 Identities=17% Similarity=0.260 Sum_probs=48.5 Q ss_pred CCCCCEEEEEEECCCCEEEEEECCCCHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCCCCCCEEEEEEEEEEEEECCC Q ss_conf 446987999997798259999748815789985289885899999999758655588777865999988999950211 Q gi|254780419|r 31 RPHGGIIFLDIRDHYGITQVVANPDSACFEIAKSVRSEWVLCIDGLVMSRSAETINANIITGQIELSAQKIEILSVAE 108 (622) Q Consensus 31 R~~g~l~F~~lrD~sG~~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~r~~~~~n~~~~tG~~Ei~~~~~~il~~a~ 108 (622) |+.+....+.|+|.||.+-..+=...+ +....+..+++|.|.|.|..-. |.+.|.++++..++.++ T Consensus 15 k~G~~Yl~l~L~D~tG~I~ak~W~~~~--~~~~~~~~g~~V~v~G~v~~y~----------g~~Ql~I~~i~~~~~~d 80 (83) T cd04492 15 KNGKPYLALTLQDKTGEIEAKLWDASE--EDEEKFKPGDIVHVKGRVEEYR----------GRLQLKIQRIRLVTEED 80 (83) T ss_pred CCCCCEEEEEEECCCCEEEEEECCCCH--HHHHCCCCCCEEEEEEEEEEEC----------CCEEEEEEEEEECCCCC T ss_conf 799917899998788768888857886--7883078999999999998889----------95108998968998344 No 118 >pfam02091 tRNA-synt_2e Glycyl-tRNA synthetase alpha subunit. Probab=93.92 E-value=0.018 Score=37.15 Aligned_cols=62 Identities=24% Similarity=0.359 Sum_probs=33.6 Q ss_pred HHHHHHHHHHHCCCEEEEECCCCCCCCCC-CCCCCCCCCCCCHHHHH-HCCCCHHHHHHHHHCCCHHHHEEEHHHHCCCC Q ss_conf 99999987520365146203321124645-66454322222103443-11238545644331057023311001103777 Q gi|254780419|r 147 RIINSMRCRMIAENFIECSTPILTASSPE-GARDFLVPSRVNQGSFY-ALPQAPQQYKQLLMASGFDRYFQIAPCFRDED 224 (622) Q Consensus 147 ~i~~~~R~~l~~~gF~EveTP~L~ksTpe-GArdFlVPsR~~~g~fY-ALpQSPQlyKQlLmv~G~dryfqiarcFRdEd 224 (622) .++..+.+|..++|.+=++. .+-| ||.-| ||..|. ||--.| .+ +-=+-||.|--| T Consensus 3 ~ii~~L~~fW~~~GC~i~qp-----yd~e~GAgT~------~p~T~lr~lgp~p--w~----------~aYvqPsrRP~D 59 (284) T pfam02091 3 SMILTLQEYWASQGCVIMQP-----YDMEVGAGTF------NPATFLRALGPEP--WN----------VAYVEPSRRPTD 59 (284) T ss_pred HHHHHHHHHHHHCCCEEECC-----CCCCCCCCCC------CHHHHHHHCCCCC--CE----------EEEECCCCCCCC T ss_conf 89999999998779688777-----6542464658------8899886319986--41----------554246779999 Q ss_pred CCCCCCC Q ss_conf 6655380 Q gi|254780419|r 225 PRSDRLP 231 (622) Q Consensus 225 ~R~dRqp 231 (622) .|=--.| T Consensus 60 gRYGenP 66 (284) T pfam02091 60 GRYGENP 66 (284) T ss_pred CCCCCCC T ss_conf 8777892 No 119 >cd00733 GlyRS_alpha_core Class II Glycyl-tRNA synthetase (GlyRS) alpha subunit core catalytic domain. GlyRS functions as a homodimer in eukaryotes, archaea and some bacteria and as a heterotetramer in the remainder of prokaryotes and in arabidopsis. It is responsible for the attachment of glycine to the 3' OH group of ribose of the appropriate tRNA. This domain is primarily responsible for the ATP-dependent formation of the enzyme bound aminoacyl-adenylate. This alignment contains only sequences from the GlyRS form which heterotetramerizes. The homodimer form of GlyRS is in a different family of class II aaRS. Class II assignment is based upon structure and the presence of three characteristic sequence motifs. Probab=93.73 E-value=0.022 Score=36.53 Aligned_cols=28 Identities=36% Similarity=0.579 Sum_probs=18.0 Q ss_pred CCCCCEECCHHHHHHHHCCCCCCEEEEC Q ss_conf 9734222468999999837398403213 Q gi|254780419|r 534 PPHGGIAAGIDRIVMLLLGAKNVREVSL 561 (622) Q Consensus 534 PPHgG~alGlDRlvmll~g~~sIRdVIa 561 (622) |..+-+.-|++||.|.|-|.+||-|++- T Consensus 151 pv~~EiTYGLERiaMylQ~vd~v~dl~w 178 (279) T cd00733 151 PISVEITYGLERIAMYLQGVDNVYDIEW 178 (279) T ss_pred CCCEEEEHHHHHHHHHHHCCCHHEEEEC T ss_conf 5323310119999999857542314123 No 120 >pfam04076 BOF Bacterial OB fold (BOF) protein. Proteins in this family form an OB-fold. Analysis of the predicted binding site of BOF family proteins implies that they lack nucleic acid-binding properties. They contain an predicted N-terminal signal peptide which indicates that they localize in the periplasm where they may function to bind proteins, small molecules, or other typical OB-fold ligands. As hypothesized for the distantly related OB-fold containing bacterial enterotoxins, the loss of nucleotide-binding function and the rapid evolution of the BOF ligand-binding site may be associated with the presence of BOF proteins in mobile genetic elements and their potential role in bacterial pathogenicity. Probab=93.55 E-value=0.49 Score=26.87 Aligned_cols=77 Identities=21% Similarity=0.246 Sum_probs=52.7 Q ss_pred HHCCHHHCCCEEEEEEEECCCCCCCCEEEEEEECCCCEEEEEECCCCHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCCC Q ss_conf 23697998988999979800444698799999779825999974881578998528988589999999975865558877 Q gi|254780419|r 10 GDLGVSDVGSFVRLSGWVHRVRPHGGIIFLDIRDHYGITQVVANPDSACFEIAKSVRSEWVLCIDGLVMSRSAETINANI 89 (622) Q Consensus 10 g~l~~~~~g~~V~i~GwV~~~R~~g~l~F~~lrD~sG~~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~r~~~~~n~~~ 89 (622) .+......+..|+|.|.|.+.-..-. ...||++|.+++-++.+ .+. -..+++++-|.+.|.|-+.- + T Consensus 49 ~~a~~~~Dd~~V~L~G~Iv~~l~~d~---Y~F~D~TG~I~VeId~~--~w~-G~~v~p~~kVrI~GevDk~~----~--- 115 (126) T pfam04076 49 KQAKSAADDAWVSLEGNIVRQIGDDE---YEFRDASGTIKVDIDDR--VWN-GQEVQPKDKVKITGEVDKDL----N--- 115 (126) T ss_pred HHHHHCCCCCCEEEEEEEEEEECCCE---EEEECCCCCEEEEECHH--HCC-CCCCCCCCEEEEEEEECCCC----C--- T ss_conf 98865879993999889996848953---89978996399996888--838-98179999899999994788----8--- Q ss_pred CCCEEEEEEEEEE Q ss_conf 7865999988999 Q gi|254780419|r 90 ITGQIELSAQKIE 102 (622) Q Consensus 90 ~tG~~Ei~~~~~~ 102 (622) ..||.|..|+ T Consensus 116 ---~~eIdV~~I~ 125 (126) T pfam04076 116 ---KAEIDVKFIE 125 (126) T ss_pred ---CEEEEEEEEE T ss_conf ---7179888876 No 121 >COG3111 Periplasmic protein with OB-fold [Function unknown] Probab=93.22 E-value=0.54 Score=26.60 Aligned_cols=75 Identities=23% Similarity=0.239 Sum_probs=52.8 Q ss_pred HHHCCCEEEEEEEECCCCCCCCEEEEEEECCCCEEEEEECCCCHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCCCCCCE Q ss_conf 79989889999798004446987999997798259999748815789985289885899999999758655588777865 Q gi|254780419|r 14 VSDVGSFVRLSGWVHRVRPHGGIIFLDIRDHYGITQVVANPDSACFEIAKSVRSEWVLCIDGLVMSRSAETINANIITGQ 93 (622) Q Consensus 14 ~~~~g~~V~i~GwV~~~R~~g~l~F~~lrD~sG~~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~r~~~~~n~~~~tG~ 93 (622) ..+.+..|+|.|-+.+.-+.- =...||.||.+++.++.+ .| .-..+++++-|.+.|.|-+. .| . T Consensus 53 ~~~Dda~V~l~GnIv~qi~~D---~y~FrD~sGeI~VeIdd~--~w-~g~tv~P~dkV~I~GevDk~----~~------~ 116 (128) T COG3111 53 TLHDDAWVSLEGNIVRQIGDD---RYVFRDASGEINVDIDDK--VW-NGQTVTPKDKVRIQGEVDKD----WN------S 116 (128) T ss_pred CCCCCCEEEEEEEEEEEECCC---EEEEECCCCCEEEEECCC--CC-CCCCCCCCCEEEEEEEECCC----CC------C T ss_conf 302598499975677760783---699975896199981601--13-89706856689997687378----76------6 Q ss_pred EEEEEEEEEEE Q ss_conf 99998899995 Q gi|254780419|r 94 IELSAQKIEIL 104 (622) Q Consensus 94 ~Ei~~~~~~il 104 (622) .||.|..|+.+ T Consensus 117 ~eIdV~~I~k~ 127 (128) T COG3111 117 VEIDVKHIEKL 127 (128) T ss_pred CEEEHHHEEEC T ss_conf 25675155863 No 122 >COG2502 AsnA Asparagine synthetase A [Amino acid transport and metabolism] Probab=93.18 E-value=0.17 Score=30.14 Aligned_cols=70 Identities=26% Similarity=0.300 Sum_probs=50.2 Q ss_pred CEEECCCCEECCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCCCEECCHHHHHHHHCCCCCCEEEEC Q ss_conf 67721251231569999999998599989998667789999860849734222468999999837398403213 Q gi|254780419|r 488 GFEIASGGIRNHMSEVMLQAFSNVGISKEVVENRFGGLYRAFQCGMPPHGGIAAGIDRIVMLLLGAKNVREVSL 561 (622) Q Consensus 488 G~EiggGSiRIHd~~iQ~~~f~~lgl~~ee~~~~Fg~lL~Al~yG~PPHgG~alGlDRlvmll~g~~sIRdVIa 561 (622) ++||.|=+||+....+ ++-....|.. + ...+.|--..|.--+|-+-|=|||=-||+|+|+....|-+|-+ T Consensus 246 afElSSMGIRVde~~l-~~Ql~ltgde--D-rl~~~wHq~llng~lP~TIGGGIGQSRl~M~lL~k~HIGeVQ~ 315 (330) T COG2502 246 AFELSSMGIRVDEDAL-KRQLALTGDE--D-RLELEWHQMLLNGELPQTIGGGIGQSRLCMLLLQKKHIGEVQA 315 (330) T ss_pred CEEEECCEEEECHHHH-HHHHHCCCCH--H-HHCCHHHHHHHCCCCCCCCCCCCCHHHHHHHHHCCCCCCEEEE T ss_conf 1022124046658999-9998526863--5-4307699999739777110475018899999961232241551 No 123 >PRK08661 prolyl-tRNA synthetase; Provisional Probab=93.00 E-value=0.26 Score=28.84 Aligned_cols=115 Identities=20% Similarity=0.282 Sum_probs=74.7 Q ss_pred CHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCC--------CCC-CCC-CCC---------------CCCCCHHHHHH-C Q ss_conf 124589999999875203651462033211246--------456-645-432---------------22221034431-1 Q gi|254780419|r 141 NIVKRTRIINSMRCRMIAENFIECSTPILTASS--------PEG-ARD-FLV---------------PSRVNQGSFYA-L 194 (622) Q Consensus 141 ~l~~Rs~i~~~~R~~l~~~gF~EveTP~L~ksT--------peG-Ard-FlV---------------PsR~~~g~fYA-L 194 (622) -+++.-+|-+.+|+-|.+.|--|+--|+|...+ -|| |-+ |.| || +.--+|. . T Consensus 44 g~~iw~~i~~~~~~~~~~~g~~~~~fP~liP~~~l~kE~~hvegF~pE~~~VT~~G~~~l~~~l~lrPT--sEt~i~~~~ 121 (478) T PRK08661 44 GYAIWENIQKILDEMFKETGHENVYFPLLIPESLLQKEAEHIEGFAPEVAWVTHGGGEKLEEPLALRPT--SETIIYPMF 121 (478) T ss_pred HHHHHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCCCCCCC--CHHHHHHHH T ss_conf 899999999999999986398087302406689998666566156740489954787555676310888--799999999 Q ss_pred CCCHHHHHHHHHCCCHHHHEEEHHHHCCCCCCCC---CCCCCEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 2385456443310570233110011037776655---3800003565410268999999999999999999980 Q gi|254780419|r 195 PQAPQQYKQLLMASGFDRYFQIAPCFRDEDPRSD---RLPGEFYQLDMEMSFVEQDDILNTMENVLRGVFEEFS 265 (622) Q Consensus 195 pQSPQlyKQlLmv~G~dryfqiarcFRdEd~R~d---Rqp~EFTqlD~EmsF~~~edvm~l~E~li~~i~~~v~ 265 (622) ..-=+=||||=+ ++|||+-.||+| .|+. |. .||.+=|...+++|.++...-+..|+ .+++.+. T Consensus 122 ~~wI~SyrdLPl-----~l~Qw~~v~R~E-~r~r~flR~-rEFl~qe~ht~h~t~eea~~~~~~~~-~~Y~~~~ 187 (478) T PRK08661 122 KKWIQSYRDLPL-----LYNQWVNVVRWE-KKTRPFLRT-REFLWQEGHTAHATEEEAEEETLEML-EIYKEFA 187 (478) T ss_pred HHHHHHHHHCCC-----EEEEEEEEEECC-CCCCCHHHH-HHHHHHCCCCCCCCHHHHHHHHHHHH-HHHHHHH T ss_conf 998523652690-----002004458657-889631434-26354212011399999999999999-9999999 No 124 >COG3689 Predicted membrane protein [Function unknown] Probab=92.69 E-value=0.68 Score=25.85 Aligned_cols=11 Identities=36% Similarity=0.751 Sum_probs=4.5 Q ss_pred HHHHHHHHHHH Q ss_conf 99999999999 Q gi|254780419|r 247 DDILNTMENVL 257 (622) Q Consensus 247 edvm~l~E~li 257 (622) +|-++++|.+- T Consensus 157 enyl~~m~~iy 167 (271) T COG3689 157 ENYLDTMEEIY 167 (271) T ss_pred HHHHHHHHHHH T ss_conf 57999999875 No 125 >cd04489 ExoVII_LU_OBF ExoVII_LU_OBF: A subfamily of OB folds corresponding to the N-terminal OB-fold domain of Escherichia coli exodeoxyribonuclease VII (ExoVII) large subunit. E. coli ExoVII is composed of two non-identical subunits. E. coli ExoVII is a single-strand-specific exonuclease which degrades ssDNA from both 3-prime and 5-prime ends. ExoVII plays a role in methyl-directed mismatch repair in vivo. ExoVII may also guard the genome from mutagenesis by removing excess ssDNA, since the build up of ssDNA would lead to SOS induction and PolIV-dependent mutagenesis. Probab=91.97 E-value=1 Score=24.58 Aligned_cols=75 Identities=16% Similarity=0.216 Sum_probs=55.6 Q ss_pred EEEEEEECCCCC-CCCEEEEEEECCCCEEEEEECCCCHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCCCCCCEEEEEEE Q ss_conf 999979800444-6987999997798259999748815789985289885899999999758655588777865999988 Q gi|254780419|r 21 VRLSGWVHRVRP-HGGIIFLDIRDHYGITQVVANPDSACFEIAKSVRSEWVLCIDGLVMSRSAETINANIITGQIELSAQ 99 (622) Q Consensus 21 V~i~GwV~~~R~-~g~l~F~~lrD~sG~~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~r~~~~~n~~~~tG~~Ei~~~ 99 (622) |.|.|=|.+.+. .++-.|++|+|....+.|++-+... ......+..+.-|.|.|.+.-= .+.|.+.+.++ T Consensus 2 vwV~GEIs~~~~~~sGh~Yf~LkD~~a~i~~v~~~~~~-~~~~~~~~~G~~V~v~g~~~~y--------~~~G~~ql~v~ 72 (78) T cd04489 2 VWVEGEISNLKRPSSGHLYFTLKDEDASIRCVMWRSNA-RRLGFPLEEGMEVLVRGKVSFY--------EPRGGYQLIVE 72 (78) T ss_pred EEEEEEECCCEECCCCEEEEEEECCCEEEEEEEEHHHH-HHCCCCCCCCCEEEEEEEEEEE--------CCCCEEEEEEE T ss_conf 89999988558289963999994788199999966788-3279998899899999999999--------89840999999 Q ss_pred EEEEE Q ss_conf 99995 Q gi|254780419|r 100 KIEIL 104 (622) Q Consensus 100 ~~~il 104 (622) +++.. T Consensus 73 ~i~~~ 77 (78) T cd04489 73 EIEPA 77 (78) T ss_pred EEEEC T ss_conf 99978 No 126 >PRK00037 hisS histidyl-tRNA synthetase; Reviewed Probab=90.45 E-value=0.4 Score=27.51 Aligned_cols=54 Identities=28% Similarity=0.442 Sum_probs=30.0 Q ss_pred CCCCCEECCHHHHHHHHCCCCCCEEEECCCCCCCCCCCCCCCCCCC---CHHHHHHCCCEEECC Q ss_conf 9734222468999999837398403213789998710346886728---988998708356057 Q gi|254780419|r 534 PPHGGIAAGIDRIVMLLLGAKNVREVSLFPMSQNFCDLLMGSPSTV---SVEQLRELGLRIVEN 594 (622) Q Consensus 534 PPHgG~alGlDRlvmll~g~~sIRdVIaFPKt~~g~Dlm~~aPs~v---~~~qL~e~~i~~~~~ 594 (622) -|.-|||+|+||++.+|.... .+|. +-...++.++-... =.++|++-||++.-. T Consensus 300 ~PAvGfa~gieRi~~~l~~~~------~~~~-~v~v~~~~~~~~~~a~~i~~~Lr~~gi~v~~~ 356 (417) T PRK00037 300 TPAVGFAIGIERLLLLLEELG------LESV-DVYVVVLGEEAEAAALKLAEKLRAAGIRVELD 356 (417) T ss_pred CCEEEEEECHHHHHHHHHHCC------CCCC-CEEEEECCHHHHHHHHHHHHHHHHCCCEEEEE T ss_conf 874789863999999999648------7877-67999659899999999999999889969995 No 127 >cd00768 class_II_aaRS-like_core Class II tRNA amino-acyl synthetase-like catalytic core domain. Class II amino acyl-tRNA synthetases (aaRS) share a common fold and generally attach an amino acid to the 3' OH of ribose of the appropriate tRNA. PheRS is an exception in that it attaches the amino acid at the 2'-OH group, like class I aaRSs. These enzymes are usually homodimers. This domain is primarily responsible for ATP-dependent formation of the enzyme bound aminoacyl-adenylate. The substrate specificity of this reaction is further determined by additional domains. Intererestingly, this domain is also found is asparagine synthase A (AsnA), in the accessory subunit of mitochondrial polymerase gamma and in the bacterial ATP phosphoribosyltransferase regulatory subunit HisZ. Probab=90.09 E-value=0.022 Score=36.55 Aligned_cols=53 Identities=28% Similarity=0.490 Sum_probs=37.6 Q ss_pred HHCCCEEEC-----CEEECCCCEECCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCCCEECCHHHH Q ss_conf 313677775-----67721251231569999999998599989998667789999860849734222468999 Q gi|254780419|r 479 AFQYDLVCN-----GFEIASGGIRNHMSEVMLQAFSNVGISKEVVENRFGGLYRAFQCGMPPHGGIAAGIDRI 546 (622) Q Consensus 479 a~~YDLVlN-----G~EiggGSiRIHd~~iQ~~~f~~lgl~~ee~~~~Fg~lL~Al~yG~PPHgG~alGlDRl 546 (622) +..||+.++ +.|+||.|..- +.|.+.|++- |.-+..++..|+++|+|||++|| T Consensus 154 ~~~~Die~~~p~~~~~Ev~s~~~~~---d~qsr~~~i~------------y~~~dg~~~~~~~~~~gl~v~Rl 211 (211) T cd00768 154 GPGFEIEVDHPEGRGLEIGSGGYRQ---DEQARAADLY------------FLDEALEYRYPPTIGFGLGLERL 211 (211) T ss_pred EEEEEEEEECCCCCEEEEECEECCC---CHHHHHCCCE------------EECCCCCEEEEEEEECEECCCCC T ss_conf 3545578983789799993546167---7878846989------------99999999875563032413669 No 128 >PRK10053 hypothetical protein; Provisional Probab=89.61 E-value=1.7 Score=23.11 Aligned_cols=75 Identities=17% Similarity=0.293 Sum_probs=50.7 Q ss_pred CCHHHCCCEEEEEEEECCCCCCCCEEEEEEECCCCEEEEEECCCCHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCCCCC Q ss_conf 69799898899997980044469879999977982599997488157899852898858999999997586555887778 Q gi|254780419|r 12 LGVSDVGSFVRLSGWVHRVRPHGGIIFLDIRDHYGITQVVANPDSACFEIAKSVRSEWVLCIDGLVMSRSAETINANIIT 91 (622) Q Consensus 12 l~~~~~g~~V~i~GwV~~~R~~g~l~F~~lrD~sG~~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~r~~~~~n~~~~t 91 (622) ....+.|..|+|.|.|...-..-+.. .||.||.+.+-++.+ .|. -..+++++-|.+.|.|-+.- T Consensus 55 Ak~m~Dda~V~L~GnIv~~lg~d~Y~---FrD~TG~I~VeId~~--~W~-G~~VtP~d~V~I~GeVDke~---------- 118 (130) T PRK10053 55 AKTMHDGATVSLRGNLIDHKGDDRYV---FRDKSGEINVIIPAA--VFD-GREVQPDQMININGSLDKKS---------- 118 (130) T ss_pred HHHCCCCCEEEEEEEEEEECCCCEEE---EECCCCCEEEEECHH--HCC-CCCCCCCCEEEEEEEECCCC---------- T ss_conf 86466998499988888614883489---977997599996777--748-97859998789988972566---------- Q ss_pred CEEEEEEEEEE Q ss_conf 65999988999 Q gi|254780419|r 92 GQIELSAQKIE 102 (622) Q Consensus 92 G~~Ei~~~~~~ 102 (622) +.-.|.|+.++ T Consensus 119 ~~~~vdV~~iq 129 (130) T PRK10053 119 APPVVRVTHLQ 129 (130) T ss_pred CCCEEEEEEEC T ss_conf 88289988741 No 129 >PRK12420 histidyl-tRNA synthetase; Provisional Probab=89.54 E-value=0.32 Score=28.24 Aligned_cols=43 Identities=28% Similarity=0.532 Sum_probs=27.0 Q ss_pred HHHHHHHHHC-CCC-CCCCEECCHHHHHHHHCCCCCCE---EEECCCC Q ss_conf 7789999860-849-73422246899999983739840---3213789 Q gi|254780419|r 522 FGGLYRAFQC-GMP-PHGGIAAGIDRIVMLLLGAKNVR---EVSLFPM 564 (622) Q Consensus 522 Fg~lL~Al~y-G~P-PHgG~alGlDRlvmll~g~~sIR---dVIaFPK 564 (622) ...|++.|-- +.| |.-|||+|+|||+++|....+.. ||..-|- T Consensus 299 YD~Li~~fgg~~~~~PavGfa~Gierl~~~L~~~~~~~~~~~v~v~~~ 346 (421) T PRK12420 299 YDNIIGAFRGDNMNYPTVGISFGLDVIYTALSQKETISSTADVFIIPL 346 (421) T ss_pred HHHHHHHHCCCCCCCCEEEEEEHHHHHHHHHHCCCCCCCCCCEEEEEC T ss_conf 054898818998999747998519999999855045798898899978 No 130 >PRK12292 hisZ ATP phosphoribosyltransferase regulatory subunit; Provisional Probab=89.04 E-value=0.68 Score=25.88 Aligned_cols=58 Identities=28% Similarity=0.339 Sum_probs=34.2 Q ss_pred HHHHHHHCCCCCCCCEECCHHHHHHHHCCCCCCEEEECCCCCCCCCCCCCCCCCCCC------HHHHHHCCCEEE Q ss_conf 899998608497342224689999998373984032137899987103468867289------889987083560 Q gi|254780419|r 524 GLYRAFQCGMPPHGGIAAGIDRIVMLLLGAKNVREVSLFPMSQNFCDLLMGSPSTVS------VEQLRELGLRIV 592 (622) Q Consensus 524 ~lL~Al~yG~PPHgG~alGlDRlvmll~g~~sIRdVIaFPKt~~g~Dlm~~aPs~v~------~~qL~e~~i~~~ 592 (622) .|++.|- +--|.-|||+|+|||+++|-... ..+..+++.-.|.... .++|+.-|+++. T Consensus 293 ~L~~~fG-~~~PAvGfa~g~eRl~~~l~~~~----------~~~~~~~~v~~~~~~~~~a~~~a~~LR~~G~~v~ 356 (388) T PRK12292 293 DLGGRFG-RARPATGFSLDLDRLRELLPEEE----------ERKQKDLLILAPWERLEAALAAAQALRKKGEIVV 356 (388) T ss_pred HHHHHHC-CCCCEEEEEECHHHHHHHHHHCC----------CCCCCEEEEECCHHHHHHHHHHHHHHHHCCCEEE T ss_conf 7999838-99971589963999999864205----------6788679998586769999999999997798899 No 131 >COG0752 GlyQ Glycyl-tRNA synthetase, alpha subunit [Translation, ribosomal structure and biogenesis] Probab=88.74 E-value=0.35 Score=27.92 Aligned_cols=63 Identities=19% Similarity=0.202 Sum_probs=33.7 Q ss_pred HHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCHHHHHHHHHCCCHHH--HEEEHHHHCCC Q ss_conf 9999999875203651462033211246456645432222210344311238545644331057023--31100110377 Q gi|254780419|r 146 TRIINSMRCRMIAENFIECSTPILTASSPEGARDFLVPSRVNQGSFYALPQAPQQYKQLLMASGFDR--YFQIAPCFRDE 223 (622) Q Consensus 146 s~i~~~~R~~l~~~gF~EveTP~L~ksTpeGArdFlVPsR~~~g~fYALpQSPQlyKQlLmv~G~dr--yfqiarcFRdE 223 (622) ..++..+.+|..++|.+=+..=-+ +-||.-| ||..|. .+-|=+- +==+.||=|-- T Consensus 8 Q~~IltLq~yW~~qGC~i~QpyD~----evGAGT~------hPaTfL-------------ralGpePw~aAYVqPSRRP~ 64 (298) T COG0752 8 QGLILTLQNYWAEQGCTILQPYDM----EVGAGTF------HPATFL-------------RALGPEPWNAAYVQPSRRPT 64 (298) T ss_pred HHHHHHHHHHHHHCCCEEECCCCC----CCCCCCC------CHHHHH-------------HHCCCCCCCEEEECCCCCCC T ss_conf 999999999998769786145534----4566767------768899-------------76099764100105677998 Q ss_pred CCCCCCCC Q ss_conf 76655380 Q gi|254780419|r 224 DPRSDRLP 231 (622) Q Consensus 224 d~R~dRqp 231 (622) |.|---+| T Consensus 65 DGRYGenP 72 (298) T COG0752 65 DGRYGENP 72 (298) T ss_pred CCCCCCCC T ss_conf 88778892 No 132 >CHL00201 syh histidine-tRNA synthetase; Provisional Probab=88.65 E-value=0.53 Score=26.62 Aligned_cols=61 Identities=21% Similarity=0.257 Sum_probs=34.0 Q ss_pred HHHHHHHCCC--CCCCCEECCHHHHHHHHCCCCCCEEEECCCCCCCCCCCCCCCC-----CCCCHHHHHHCCCEEE Q ss_conf 8999986084--9734222468999999837398403213789998710346886-----7289889987083560 Q gi|254780419|r 524 GLYRAFQCGM--PPHGGIAAGIDRIVMLLLGAKNVREVSLFPMSQNFCDLLMGSP-----STVSVEQLRELGLRIV 592 (622) Q Consensus 524 ~lL~Al~yG~--PPHgG~alGlDRlvmll~g~~sIRdVIaFPKt~~g~Dlm~~aP-----s~v~~~qL~e~~i~~~ 592 (622) .|+..| |. -|.-|||+|+|||+.+|-. -..+|+++...-...-.+ +.--.++|+.-||++- T Consensus 289 ~Lv~~f--gg~~~PavGfaiGieRl~~~l~~------~~~~~~~~~~~~v~~~~~~~~~~~~~ia~~LR~~gi~~e 356 (424) T CHL00201 289 SLVEQL--GGPKTPAVGWAIGIERLLLIAKD------NILLPKPSIDFYIATKGNVAIKIAFSIQRFLHNQGLKSE 356 (424) T ss_pred HHHHHC--CCCCCCEEEEEECHHHHHHHHHH------CCCCCCCCCCEEEEECCHHHHHHHHHHHHHHHHCCCEEE T ss_conf 789972--99999857788439999999973------377777788589997784999999999999998899399 No 133 >COG1570 XseA Exonuclease VII, large subunit [DNA replication, recombination, and repair] Probab=88.28 E-value=2 Score=22.48 Aligned_cols=76 Identities=12% Similarity=0.181 Sum_probs=56.1 Q ss_pred CEEEEEEEECCCCCCC-CEEEEEEECCCCEEEEEECCCCHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCCCCCCEEEEE Q ss_conf 8899997980044469-879999977982599997488157899852898858999999997586555887778659999 Q gi|254780419|r 19 SFVRLSGWVHRVRPHG-GIIFLDIRDHYGITQVVANPDSACFEIAKSVRSEWVLCIDGLVMSRSAETINANIITGQIELS 97 (622) Q Consensus 19 ~~V~i~GwV~~~R~~g-~l~F~~lrD~sG~~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~r~~~~~n~~~~tG~~Ei~ 97 (622) ..|.|.|=|.+.+.+. +-.|+.|.|....++|++-+.... .+--.+..+.-|.|.|.|..=+ +-|++.+. T Consensus 24 ~~V~v~GEISn~t~~~sgH~YFtLKD~~A~i~c~mf~~~~~-~l~f~p~eG~~V~v~G~is~Y~--------~rG~YQi~ 94 (440) T COG1570 24 GQVWVRGEISNFTRPASGHLYFTLKDERAQIRCVMFKGNNR-RLKFRPEEGMQVLVRGKISLYE--------PRGDYQIV 94 (440) T ss_pred CEEEEEEEECCCCCCCCCCEEEEECCCCCEEEEEEECCCCC-CCCCCCCCCCEEEEEEEEEEEC--------CCCCEEEE T ss_conf 85999999637766888607999716771689999747000-2687866798899997788876--------98856999 Q ss_pred EEEEEE Q ss_conf 889999 Q gi|254780419|r 98 AQKIEI 103 (622) Q Consensus 98 ~~~~~i 103 (622) +++++- T Consensus 95 ~~~~~p 100 (440) T COG1570 95 AESMEP 100 (440) T ss_pred EECCCC T ss_conf 832782 No 134 >PRK04173 glycyl-tRNA synthetase; Provisional Probab=88.27 E-value=0.23 Score=29.20 Aligned_cols=52 Identities=29% Similarity=0.380 Sum_probs=30.0 Q ss_pred HHHHHHCCCHH---HHEEEHHHHCCCCC-CCC--CCCCCEEEEEEEECCCCHHHHHHHHH Q ss_conf 64433105702---33110011037776-655--38000035654102689999999999 Q gi|254780419|r 201 YKQLLMASGFD---RYFQIAPCFRDEDP-RSD--RLPGEFYQLDMEMSFVEQDDILNTME 254 (622) Q Consensus 201 yKQlLmv~G~d---ryfqiarcFRdEd~-R~d--Rqp~EFTqlD~EmsF~~~edvm~l~E 254 (622) ||.++=.++.. -+-|||++||||=+ |.- |.= ||+|.++|. |++.++-....+ T Consensus 177 Fk~v~~~~r~KlPFgiaQIGKaFRNEIsPr~~lfR~R-EFeQmEiE~-F~~P~~~~e~~~ 234 (460) T PRK04173 177 FKNVLRTARKKLPFGIAQIGKSFRNEITPRNFIFRTR-EFEQMELEF-FVKPGTDEEWFA 234 (460) T ss_pred HHHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCHHHH-HHHHHHHHH-HCCCCCHHHHHH T ss_conf 1999997177888043100400124306554512234-555524331-058674589999 No 135 >cd00769 PheRS_beta_core Phenylalanyl-tRNA synthetase (PheRS) beta chain core domain. PheRS belongs to class II aminoacyl-tRNA synthetases (aaRS) based upon its structure. While class II aaRSs generally aminoacylate the 3'-OH ribose of the appropriate tRNA, PheRS is an exception in that it attaches the amino acid at the 2'-OH group, like class I aaRSs. PheRS is an alpha-2/ beta-2 tetramer. While the alpha chain contains a catalytic core domain, the beta chain has a non-catalytic core domain. Probab=88.08 E-value=2.1 Score=22.40 Aligned_cols=27 Identities=22% Similarity=0.203 Sum_probs=23.1 Q ss_pred HHHHHHHHHHHHHCCCEEEEECCCCCC Q ss_conf 899999998752036514620332112 Q gi|254780419|r 145 RTRIINSMRCRMIAENFIECSTPILTA 171 (622) Q Consensus 145 Rs~i~~~~R~~l~~~gF~EveTP~L~k 171 (622) +.++.+.+|++|...||.||-|-.|+. T Consensus 2 ~~~l~~~ir~~L~~~Gf~Ev~t~s~~~ 28 (198) T cd00769 2 LQKLERKLRRLLAGLGFQEVITYSLTS 28 (198) T ss_pred HHHHHHHHHHHHHHCCCCEEEECCCCC T ss_conf 679999999999978971413067689 No 136 >KOG2509 consensus Probab=87.92 E-value=0.45 Score=27.15 Aligned_cols=103 Identities=17% Similarity=0.331 Sum_probs=54.5 Q ss_pred HHHHHHHHHHHHHHHHCCCEEEEECCCCCCC--------CCCCCC-----------CCCCCCCC------CHHHHHHCCC Q ss_conf 2458999999987520365146203321124--------645664-----------54322222------1034431123 Q gi|254780419|r 142 IVKRTRIINSMRCRMIAENFIECSTPILTAS--------SPEGAR-----------DFLVPSRV------NQGSFYALPQ 196 (622) Q Consensus 142 l~~Rs~i~~~~R~~l~~~gF~EveTP~L~ks--------TpeGAr-----------dFlVPsR~------~~g~fYALpQ 196 (622) ..+-..+++..-+++.++||+-+.||.+.+. .|.+-. -||+-|-- |.++++ .+ T Consensus 185 a~LeqALi~yal~~l~~kGy~pl~~P~i~rkeVm~~cg~~~~~d~~~~y~ld~~~~~~LiaTaE~plAa~~~~e~~--~~ 262 (455) T KOG2509 185 AFLEQALINYALDFLNAKGYTPLTTPDILRKEVMQKCGQLPRFDEEQYYVLDGGDEKYLIATAEQPLAAYHRDEWL--EE 262 (455) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCCCCCCEEEEECCCCCCEEEEECCCHHHHHHCCCCC--CC T ss_conf 9999999999999999769824668145369999985657678832178606886422676211106665036403--61 Q ss_pred CHHHHHHHHHCCCHHHHEEEHHHHCCCC-CCC--C----CCCCCEEEEEEEE-CCCCHHHHHHHHHHHHHH Q ss_conf 8545644331057023311001103777-665--5----3800003565410-268999999999999999 Q gi|254780419|r 197 APQQYKQLLMASGFDRYFQIAPCFRDED-PRS--D----RLPGEFYQLDMEM-SFVEQDDILNTMENVLRG 259 (622) Q Consensus 197 SPQlyKQlLmv~G~dryfqiarcFRdEd-~R~--d----Rqp~EFTqlD~Em-sF~~~edvm~l~E~li~~ 259 (622) -||-+ ||--..+|||.|- ++. . |-- +|+-+ || .+.+-++--.+.|+|+.. T Consensus 263 -~~lPi---------K~vg~S~CfR~EaGs~G~d~~GlyRVH-qF~KV--E~Fvit~Pe~S~~~~eEmi~~ 320 (455) T KOG2509 263 -DQLPI---------KYVGVSRCFRAEAGSHGKDTKGLYRVH-QFEKV--EQFVITGPEDSWEMLEEMINN 320 (455) T ss_pred -CCCCE---------EEEEHHHHHHHHHHHCCCCCCCCEEEE-EEEEE--EEEEECCCCHHHHHHHHHHHH T ss_conf -12760---------134303788877600465455514665-20025--788865853569999999999 No 137 >TIGR00137 gid gid protein; InterPro: IPR004417 The gid proteins are present in relatively few bacteria but very tightly conserved where they occur. The function of these proteins is unknown. They are closely related to gidA (glucose-inhibited division protein A), which appears to be present in all complete eubacterial genomes so far and in Saccharomyces cerevisiae.. Probab=86.94 E-value=0.58 Score=26.35 Aligned_cols=39 Identities=23% Similarity=0.393 Sum_probs=21.3 Q ss_pred HHHHHHHCCCCCCCEEEEECCCCCCC-----CHHH------HHHHHHHHHH Q ss_conf 99999818875572699808854110-----0134------7889998773 Q gi|254780419|r 385 EAIRVRLDMQDGDSCFFIAGDPLKFC-----SFAA------DARNHIAQEL 424 (622) Q Consensus 385 ~~l~~~~~~~~GD~i~f~a~~~~~~~-----~~lg------~lR~~ia~~l 424 (622) +-|...++.++-..+|| ||.-..+. .+.| +.|+.+|+++ T Consensus 329 ~lL~~~l~fk~r~~lFf-AGQ~tGvEGYvas~A~G~lAGiNAarLalGe~l 378 (444) T TIGR00137 329 KLLTASLQFKDREDLFF-AGQLTGVEGYVASAASGLLAGINAARLALGEKL 378 (444) T ss_pred HHHHHHHCCCCCCCEEE-CCEECCCHHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 56557530477133133-140003035776653014777778877506610 No 138 >KOG1936 consensus Probab=86.58 E-value=0.44 Score=27.25 Aligned_cols=95 Identities=21% Similarity=0.454 Sum_probs=53.0 Q ss_pred CCHHHHHHHHHHHHHHHHCCCEEEEECCCC-CC------CCCCCCCCCCCCCCCCHHHH----HHCCCCHHHHHH-HHH- Q ss_conf 212458999999987520365146203321-12------46456645432222210344----311238545644-331- Q gi|254780419|r 140 KNIVKRTRIINSMRCRMIAENFIECSTPIL-TA------SSPEGARDFLVPSRVNQGSF----YALPQAPQQYKQ-LLM- 206 (622) Q Consensus 140 ~~l~~Rs~i~~~~R~~l~~~gF~EveTP~L-~k------sTpeGArdFlVPsR~~~g~f----YALpQSPQlyKQ-lLm- 206 (622) .-+.+|.++...+++-|..+|..+|.||++ .+ .-.++---|-. .-+-|.- |-|.-+ |-. +.| T Consensus 72 ~qm~lRe~if~~i~~vFkrhGa~~iDTPVFElkeiL~gKYGEdskLiYdl--kDQGGEl~SLRYDLTVP---fARylAmN 146 (518) T KOG1936 72 EQMALREKIFSTIKEVFKRHGAETIDTPVFELKEILTGKYGEDSKLIYDL--KDQGGELCSLRYDLTVP---FARYLAMN 146 (518) T ss_pred HHHHHHHHHHHHHHHHHHHCCCEECCCCCHHHHHHHHHHCCCCCCEEEEH--HHCCCCEEEEECCCCCH---HHHHHHHC T ss_conf 88899999999999999971974446600459999866405532306752--32488588741356647---99999871 Q ss_pred -CCCHHHHEEEHHHHCCCCC---CCCCCCCCEEEEEEEEC Q ss_conf -0570233110011037776---65538000035654102 Q gi|254780419|r 207 -ASGFDRYFQIAPCFRDEDP---RSDRLPGEFYQLDMEMS 242 (622) Q Consensus 207 -v~G~dryfqiarcFRdEd~---R~dRqp~EFTqlD~Ems 242 (622) +.-+. -||||++||-..- |. |-- ||+|-|+..| T Consensus 147 ki~sik-Ry~iAkVyRRd~P~mtrG-R~R-EFYQcDFDIA 183 (518) T KOG1936 147 KITSIK-RYHIAKVYRRDQPAMTRG-RYR-EFYQCDFDIA 183 (518) T ss_pred CCCCCE-EEEEEEEEECCCCHHHCH-HHH-HHHCCCCCCC T ss_conf 455211-046787875068144230-245-5650675212 No 139 >pfam08661 Rep_fac-A_3 Replication factor A protein 3. Replication factor A is involved in eukaryotic DNA replication, recombination and repair. Probab=86.55 E-value=2.5 Score=21.81 Aligned_cols=67 Identities=25% Similarity=0.239 Sum_probs=45.3 Q ss_pred CCCCHHHCCHHHCCCEEEEEEEECCCCCCCCEEEEEEECCCCEEEEEECCCCHHHHHHHCCCCCCEEEEEEEEEE Q ss_conf 742042369799898899997980044469879999977982599997488157899852898858999999997 Q gi|254780419|r 5 RTHSCGDLGVSDVGSFVRLSGWVHRVRPHGGIIFLDIRDHYGITQVVANPDSACFEIAKSVRSEWVLCIDGLVMS 79 (622) Q Consensus 5 Rth~cg~l~~~~~g~~V~i~GwV~~~R~~g~l~F~~lrD~sG~~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~ 79 (622) |..-.+.+=.+++|+.|++.|.|.+.+..|+.+=+.--|+ |.++|...++. .+..+..|+|.|+|.. T Consensus 7 ~pRVn~~~L~~~~gk~VrivGkV~~~~~~g~~~~~~s~d~-~~V~v~l~~~~-------~~~~~~~vEViG~V~~ 73 (111) T pfam08661 7 TPRINGSMLSQFVGKPVRIVGKVESVDPSGETAILSSTDG-GNVTVSLNSPL-------DLEISGWVEVIGKVSP 73 (111) T ss_pred CCEECHHHHHHHCCCEEEEEEEEEEECCCCCEEEEECCCC-CEEEEEECCCC-------CCCCCCEEEEEEEECC T ss_conf 7508899997507982899999977769999799995999-69999958998-------8766878999999848 No 140 >PRK10917 ATP-dependent DNA helicase RecG; Provisional Probab=86.23 E-value=2.6 Score=21.69 Aligned_cols=65 Identities=15% Similarity=0.133 Sum_probs=49.1 Q ss_pred CHHHCCCEEEEEEEECCCC---CCCCEEEEEEECCCCEEEEEECCCCHHHHHHHCCCCCCEEEEEEEEEE Q ss_conf 9799898899997980044---469879999977982599997488157899852898858999999997 Q gi|254780419|r 13 GVSDVGSFVRLSGWVHRVR---PHGGIIFLDIRDHYGITQVVANPDSACFEIAKSVRSEWVLCIDGLVMS 79 (622) Q Consensus 13 ~~~~~g~~V~i~GwV~~~R---~~g~l~F~~lrD~sG~~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~ 79 (622) .....|+.|++.|.|.+.. ..++..-+.+.|++|.+.+++-.... -..+.++.+.-+.|.|+|.. T Consensus 53 ~el~~Ge~vtI~g~V~~~~~~~~~r~~l~v~v~D~tG~I~lvfFn~~p--yl~k~l~~G~~v~VsGKV~~ 120 (677) T PRK10917 53 AELRPGEKVTVEGTVLSAEVYFGPRRRLRVTLSDGTGVLTLVFFNNQP--YLKKQLKVGKRVAVYGKVKR 120 (677) T ss_pred HHCCCCCEEEEEEEEEEEECCCCCCCEEEEEEEECCEEEEEEEECCCH--HHHHHCCCCCEEEEEEEEEE T ss_conf 897999879999999998437789724999999898799999989847--89963899998999999985 No 141 >KOG3108 consensus Probab=85.83 E-value=2.7 Score=21.56 Aligned_cols=74 Identities=22% Similarity=0.183 Sum_probs=40.7 Q ss_pred EEEEEEEECCCCCCCCEEEEEEECCCCEEEEEECC-CCHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCCCCCCEEEEEE Q ss_conf 89999798004446987999997798259999748-81578998528988589999999975865558877786599998 Q gi|254780419|r 20 FVRLSGWVHRVRPHGGIIFLDIRDHYGITQVVANP-DSACFEIAKSVRSEWVLCIDGLVMSRSAETINANIITGQIELSA 98 (622) Q Consensus 20 ~V~i~GwV~~~R~~g~l~F~~lrD~sG~~Q~v~~~-~~~~~~~~~~l~~esvv~v~G~V~~r~~~~~n~~~~tG~~Ei~~ 98 (622) .|.+-|||.+.-+...-+++.|-|++|.+-+..=. +....+....+..+-.|.|.|.++. ..|...|.+ T Consensus 70 ~v~~VGivr~~e~~~t~i~y~I~D~tg~id~r~W~~~~~~~~e~~~l~~~~yVkv~G~Lk~----------f~Gk~sl~~ 139 (265) T KOG3108 70 AVSIVGIVRNIEKSATNITYEIEDGTGQIDVRQWFHDNAESEEMPALETGTYVKVYGHLKP----------FQGKKSLQV 139 (265) T ss_pred EEEEEEEEEECEECCCCEEEEEECCCCEEEEEECCCCCCHHHHCCCCCCCCEEEEECCCCC----------CCCCEEEEE T ss_conf 8899999970202676149998268650788870355302221741244757996401048----------899526778 Q ss_pred EEEEE Q ss_conf 89999 Q gi|254780419|r 99 QKIEI 103 (622) Q Consensus 99 ~~~~i 103 (622) .++.- T Consensus 140 fkI~p 144 (265) T KOG3108 140 FKIRP 144 (265) T ss_pred EEEEE T ss_conf 76555 No 142 >COG1571 Predicted DNA-binding protein containing a Zn-ribbon domain [General function prediction only] Probab=85.76 E-value=2.7 Score=21.53 Aligned_cols=24 Identities=25% Similarity=0.389 Sum_probs=12.6 Q ss_pred EEEEEEEECC-CCCCCCEEEEEEEC Q ss_conf 8999979800-44469879999977 Q gi|254780419|r 20 FVRLSGWVHR-VRPHGGIIFLDIRD 43 (622) Q Consensus 20 ~V~i~GwV~~-~R~~g~l~F~~lrD 43 (622) .|.++-||-. .|..+.+.+..-++ T Consensus 42 LvRlNP~ipyKTRGN~av~l~~~~~ 66 (421) T COG1571 42 LVRLNPNIPYKTRGNAAVLLLVARR 66 (421) T ss_pred CCCCCCCCCEECCCCCEEEEEEECC T ss_conf 4347899703316776168884047 No 143 >cd04479 RPA3 RPA3: A subfamily of OB folds similar to human RPA3 (also called RPA14). RPA3 is the smallest subunit of Replication protein A (RPA). RPA is a nuclear ssDNA binding protein (SSB) which appears to be involved in all aspects of DNA metabolism including replication, recombination, and repair. RPA also mediates specific interactions of various nuclear proteins. In animals, plants, and fungi, RPA is a heterotrimer with subunits of 70KDa (RPA1), 32kDa (RPA2), and 14 KDa (RPA3). RPA3 is believed to have a structural role in assembly of the RPA heterotrimer. Probab=84.60 E-value=3.1 Score=21.17 Aligned_cols=75 Identities=19% Similarity=0.084 Sum_probs=51.0 Q ss_pred HHHCCHHHCCCEEEEEEEECCCCCCCCEEEEEEECCCCEEEEEECCCCHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCC Q ss_conf 42369799898899997980044469879999977982599997488157899852898858999999997586555887 Q gi|254780419|r 9 CGDLGVSDVGSFVRLSGWVHRVRPHGGIIFLDIRDHYGITQVVANPDSACFEIAKSVRSEWVLCIDGLVMSRSAETINAN 88 (622) Q Consensus 9 cg~l~~~~~g~~V~i~GwV~~~R~~g~l~F~~lrD~sG~~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~r~~~~~n~~ 88 (622) .+++=.+++|+.|++.|.|.+.+.. -.-+..-|+ +.+|+.++.+. .+..+-.|+|.|+|.. T Consensus 6 n~~~L~~f~gk~VrivGkV~~~~g~--~~~~~s~Dg-~~v~v~l~~~~-------~~~~~~~vEViG~V~~--------- 66 (101) T cd04479 6 NGAMLSQFVGKTVRIVGKVEKVDGD--SLTLISSDG-VNVTVELNRPL-------DLPISGYVEVIGKVSP--------- 66 (101) T ss_pred CHHHHHHCCCCEEEEEEEEEEECCC--EEEEEECCC-CEEEEEECCCC-------CCCCCCEEEEEEEECC--------- T ss_conf 7899965389869999999986598--169992799-98999988998-------7656857999999999--------- Q ss_pred CCCCEEEEEEEEEEEEEC Q ss_conf 778659999889999502 Q gi|254780419|r 89 IITGQIELSAQKIEILSV 106 (622) Q Consensus 89 ~~tG~~Ei~~~~~~il~~ 106 (622) ...|.+.++.-++. T Consensus 67 ----~~~I~~~~~i~~g~ 80 (101) T cd04479 67 ----DLTIRVLSYIDFGD 80 (101) T ss_pred ----CCEEEEEEEEECCC T ss_conf ----99699999987897 No 144 >PRK09537 pylS pyrolysyl-tRNA synthetase; Reviewed Probab=84.19 E-value=0.43 Score=27.26 Aligned_cols=29 Identities=28% Similarity=0.578 Sum_probs=24.5 Q ss_pred HCC-CCCCCCEECCHHHHHHHHCCCCCCEE Q ss_conf 608-49734222468999999837398403 Q gi|254780419|r 530 QCG-MPPHGGIAAGIDRIVMLLLGAKNVRE 558 (622) Q Consensus 530 ~yG-~PPHgG~alGlDRlvmll~g~~sIRd 558 (622) +|| .-|--|.|+|++||+|+.-|..|||- T Consensus 376 ~Wgi~kPWiGaGFGLERLLkV~hg~~nikr 405 (420) T PRK09537 376 EWGIDKPWIGAGFGLERLLKVMHGFKNIKR 405 (420) T ss_pred CCCCCCCCCCCCHHHHHHHHHHHCCHHHHH T ss_conf 557667620233309999999835366777 No 145 >PRK13480 3'-5' exoribonuclease YhaM; Provisional Probab=84.11 E-value=3.2 Score=21.03 Aligned_cols=80 Identities=18% Similarity=0.207 Sum_probs=53.4 Q ss_pred CCCEEEEEEEECCC---C-CCC-CEEEEEEECCCCEEEEEECCCCHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCCCCC Q ss_conf 89889999798004---4-469-879999977982599997488157899852898858999999997586555887778 Q gi|254780419|r 17 VGSFVRLSGWVHRV---R-PHG-GIIFLDIRDHYGITQVVANPDSACFEIAKSVRSEWVLCIDGLVMSRSAETINANIIT 91 (622) Q Consensus 17 ~g~~V~i~GwV~~~---R-~~g-~l~F~~lrD~sG~~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~r~~~~~n~~~~t 91 (622) .|+.|.-.--|.++ . ..| ...-+.|+|.||.+.+.+=.+.+ +....+..+.+|.|+|.|..-. T Consensus 10 ~G~~v~~~~lv~~~~~~~tknG~~Yl~l~L~D~tG~I~ak~W~~~~--~~~~~~~~g~~V~v~G~v~~y~---------- 77 (314) T PRK13480 10 VGESVDHFLLIKSATKGVASNGKPFLTLILQDKSGDIEAKLWDVSK--EDEATYVPETIVHVKGDIINYR---------- 77 (314) T ss_pred CCCEEEEEEEEEECHHHCCCCCCCEEEEEEECCCCEEEEEECCCCH--HHHHHCCCCCEEEEEEEEEEEC---------- T ss_conf 9988889999987143318999814799998378768899678986--6686378998899999998768---------- Q ss_pred CEEEEEEEEEEEEECCC Q ss_conf 65999988999950211 Q gi|254780419|r 92 GQIELSAQKIEILSVAE 108 (622) Q Consensus 92 G~~Ei~~~~~~il~~a~ 108 (622) |...+.+..++..+..+ T Consensus 78 g~~Ql~i~~i~~~~~~d 94 (314) T PRK13480 78 GRKQLKVNQIRLATPED 94 (314) T ss_pred CCCCEEEEEEEECCCCC T ss_conf 98625888857678667 No 146 >TIGR00470 sepS O-phosphoseryl-tRNA(Cys) synthetase; InterPro: IPR005246 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . All known phenylalanyl-tRNA synthetases, excepting a monomeric mitochondrial form, have an alpha 2 beta 2 heterotetrameric structure. This family of archaeal proteins resembles known phenylalanyl-tRNA synthetase alpha chains but is longer and of unknown function; more generally, it resembles class 2 aminoacyl tRNA ligases. . Probab=83.95 E-value=0.42 Score=27.34 Aligned_cols=45 Identities=20% Similarity=0.296 Sum_probs=35.0 Q ss_pred CCCCCEEECCCCCCCCHHHH---HHHHHHHHHHHHCCCEEEEECCCCC Q ss_conf 11232000121111212458---9999999875203651462033211 Q gi|254780419|r 126 FKYRFLDLRRDTLHKNIVKR---TRIINSMRCRMIAENFIECSTPILT 170 (622) Q Consensus 126 l~~R~LdLR~~~~~~~l~~R---s~i~~~~R~~l~~~gF~EveTP~L~ 170 (622) |+.||.|-.-|+++-.+=.- -..+.-+|+....-||-|+=-|+|. T Consensus 37 l~~~h~d~~YPRl~f~~GK~Hpl~~TI~rlReAYLr~GFsE~vNPliv 84 (558) T TIGR00470 37 LKEKHIDEKYPRLKFEFGKEHPLFETIERLREAYLRLGFSEMVNPLIV 84 (558) T ss_pred HHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCEECCCCEEE T ss_conf 622786656872455458887568999999999864683130077465 No 147 >TIGR00468 pheS phenylalanyl-tRNA synthetase, alpha subunit; InterPro: IPR004529 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . Phenylalanyl-tRNA synthetase (6.1.1.20 from EC) is an alpha2/beta2 tetramer composed of 2 subunits that belongs to class IIc. In eubacteria, a small subunit (pheS gene) can be designated as beta (E. coli) or alpha subunit (nomenclature adopted in InterPro). Reciprocally the large subunit (pheT gene) can be designated as alpha (E. coli) or beta (see IPR004531 from INTERPRO and IPR004532 from INTERPRO). In all other kingdoms the two subunits have equivalent length in eukaryota, and can be identified by specific signatures. The enzyme from Thermus thermophilus has an alpha 2 beta 2 type quaternary structure and is one of the most complicated members of the synthetase family. Identification of phenylalanyl-tRNA synthetase as a member of class II aaRSs was based only on sequence alignment of the small alpha-subunit with other synthetases . This family describes the alpha subunit, which shows some similarity to class II aminoacyl-tRNA ligases. Mitochondrial phenylalanyl-tRNA synthetase is a single polypeptide chain, active as a monomer, and similar to this chain rather than to the beta chain, but excluded from this family. ; GO: 0000166 nucleotide binding, 0004826 phenylalanine-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006432 phenylalanyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=83.86 E-value=1.3 Score=23.92 Aligned_cols=83 Identities=27% Similarity=0.429 Sum_probs=53.5 Q ss_pred CCCCCHHHCCCCCCCCCHHHHHCC-CHHHHHHHCCCEEE--CCE-EECCCCEECCCHHHHHHHHHHCCCCHHHHHHHHHH Q ss_conf 470002110024766430655418-87774131367777--567-72125123156999999999859998999866778 Q gi|254780419|r 449 EKKIDFAHNPFTMPQGGMESLKGN-DLLSIKAFQYDLVC--NGF-EIASGGIRNHMSEVMLQAFSNVGISKEVVENRFGG 524 (622) Q Consensus 449 ~~r~~s~HHPFTaP~~~~~~l~~~-dp~~v~a~~YDLVl--NG~-EiggGSiRIHd~~iQ~~~f~~lgl~~ee~~~~Fg~ 524 (622) .-|+.++|.|||-|..+..-.... ..+- -|| .|+ |++|.++- .-.+++.+|++++++ T Consensus 262 ~~~~~p~~fp~t~p~~e~~~~~~~c~~gc-------~~c~~~~W~e~~g~g~~------~p~~~~~~g~~~~~~------ 322 (362) T TIGR00468 262 EVRFRPSYFPFTEPSAEVDVYCPECGKGC-------SVCKGTGWLELLGAGIF------RPEVLEPLGIDPEEY------ 322 (362) T ss_pred CEEEECCCCCCCCCCCCEEEEEHHHCCCC-------HHHCCCCEEEEECCCCC------CHHHHHHCCCCCCCC------ T ss_conf 20231033665555530244200001310-------01014650354045532------423554237873101------ Q ss_pred HHHHHHCCCCCCCCEECCHHHHHHHHCCCCCCEEEE Q ss_conf 999986084973422246899999983739840321 Q gi|254780419|r 525 LYRAFQCGMPPHGGIAAGIDRIVMLLLGAKNVREVS 560 (622) Q Consensus 525 lL~Al~yG~PPHgG~alGlDRlvmll~g~~sIRdVI 560 (622) |--++|+|++|+.|+-.|...||+.. T Consensus 323 ----------~~~~~g~g~~r~~~~~~~~~d~r~~~ 348 (362) T TIGR00468 323 ----------PGLAWGLGIERLAMLKYGLDDLRDLY 348 (362) T ss_pred ----------EEEEECCCHHHHHHHHHCHHHHHHHH T ss_conf ----------13432124012445451335688864 No 148 >cd04488 RecG_wedge_OBF RecG_wedge_OBF: A subfamily of OB folds corresponding to the OB fold found in the N-terminal (wedge) domain of Escherichia coli RecG. RecG is a branched-DNA-specific helicase, which catalyzes the interconversion of a DNA replication fork to a four-stranded (Holliday) junction in vivo and in vitro. This interconversion provides a route to repair stalled forks. The RecG monomer contains three domains. The N-terminal domain is named for its wedge structure, and may provide the specificity of RecG for binding branched-DNA structures. During the reversal of fork to Holliday junction, the wedge domain is fixed at the junction of the fork where the leading and lagging strand duplex arms meet, and is thought to promote the unwinding of the nascent leading and lagging strands. In order to form the Holliday junction, these nascent strands would be annealed, and the parental strands reannealed. The wedge domain may also be a processivity factor of RecG on these branched cha Probab=81.74 E-value=3.9 Score=20.40 Aligned_cols=56 Identities=16% Similarity=0.078 Sum_probs=39.5 Q ss_pred EEEEECCCC----CCCCEEEEEEECCCCEEEEEECCCCHHHHHHHCCCCCCEEEEEEEEEEC Q ss_conf 997980044----4698799999779825999974881578998528988589999999975 Q gi|254780419|r 23 LSGWVHRVR----PHGGIIFLDIRDHYGITQVVANPDSACFEIAKSVRSEWVLCIDGLVMSR 80 (622) Q Consensus 23 i~GwV~~~R----~~g~l~F~~lrD~sG~~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~r 80 (622) +.|-|.+.. ..+++.-+.+.|++|.+.+++-.-.. -..+.++.+.-+.+.|+|... T Consensus 2 i~g~I~~~~~~~~~~r~~l~~~v~D~tg~i~l~~Fn~~~--~~~~~~~~G~~v~v~Gkv~~~ 61 (75) T cd04488 2 VEGTVVSVEVVPRRGRRRLKVTLSDGTGTLTLVFFNFQP--YLKKQLPPGTRVRVSGKVKRF 61 (75) T ss_pred EEEEEEEEEEECCCCCEEEEEEEEECCCEEEEEEECCCH--HHHHHCCCCCEEEEEEEEEEC T ss_conf 999999988545899708999999699889999989977--999538899999999999713 No 149 >PRK12366 replication factor A; Reviewed Probab=80.76 E-value=4.2 Score=20.17 Aligned_cols=79 Identities=22% Similarity=0.375 Sum_probs=54.7 Q ss_pred C-CEEEEEEEECCC---C-------CCCCEEEEEEECCCCEEEEEECCCCHHHHHHHCCCCCCEEEEEEEEEECCCCCCC Q ss_conf 9-889999798004---4-------4698799999779825999974881578998528988589999999975865558 Q gi|254780419|r 18 G-SFVRLSGWVHRV---R-------PHGGIIFLDIRDHYGITQVVANPDSACFEIAKSVRSEWVLCIDGLVMSRSAETIN 86 (622) Q Consensus 18 g-~~V~i~GwV~~~---R-------~~g~l~F~~lrD~sG~~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~r~~~~~n 86 (622) | +.|.+-|.|.++ | .-|+++=+.+-|.||.+-+++=.+.. +.++.+..+.++.|+|..... . T Consensus 72 g~~~V~~~gkV~~Isd~rtF~r~dGs~GrV~nl~VaDeTG~iRltLWdd~a--~li~~le~GdvikI~g~~~r~----~- 144 (649) T PRK12366 72 GQKNVEITGKIIEISNIKEFNRKDGSTGKVASITIGDNTGIIRLTLWNDKT--KLIEGLKEGDVIKIENAFSRK----W- 144 (649) T ss_pred CCCCEEEEEEEEEEECCCEEECCCCCCCEEEEEEEECCCCCEEEEEECCHH--HHHHCCCCCCEEEECCCCCCC----C- T ss_conf 765238999998850561044268985248999997378968999860334--576356777589984631213----5- Q ss_pred CCCCCCEEEEEE---EEEEEEECC Q ss_conf 877786599998---899995021 Q gi|254780419|r 87 ANIITGQIELSA---QKIEILSVA 107 (622) Q Consensus 87 ~~~~tG~~Ei~~---~~~~il~~a 107 (622) .|.+|+.. .+++-+.+. T Consensus 145 ----~g~~El~~g~~~sI~kle~~ 164 (649) T PRK12366 145 ----NGDVELNSGSESSIEKLKEY 164 (649) T ss_pred ----CCCEEEEECCCCEEEECCCC T ss_conf ----88337871688504676734 No 150 >COG2024 Phenylalanyl-tRNA synthetase alpha subunit (archaeal type) [Translation, ribosomal structure and biogenesis] Probab=80.63 E-value=0.86 Score=25.13 Aligned_cols=31 Identities=32% Similarity=0.674 Sum_probs=26.7 Q ss_pred HHCCCC-CCCCEECCHHHHHHHHCCCCCCEEE Q ss_conf 860849-7342224689999998373984032 Q gi|254780419|r 529 FQCGMP-PHGGIAAGIDRIVMLLLGAKNVREV 559 (622) Q Consensus 529 l~yG~P-PHgG~alGlDRlvmll~g~~sIRdV 559 (622) -+||-| |---+|+|++||.|+|-|.+.+|.. T Consensus 317 aeY~Id~pVMNLGlGVERlaMIl~g~~DVR~m 348 (536) T COG2024 317 AEYGIDYPVMNLGLGVERLAMILHGADDVRSM 348 (536) T ss_pred HHCCCCCCEEECCHHHHHHHHHHHCCHHHHHH T ss_conf 87299970120322089999998174477665 No 151 >TIGR00669 asnA aspartate--ammonia ligase; InterPro: IPR004618 Aspartate--ammonia ligase (asparagine synthetase) 6.3.1.1 from EC catalyses the conversion of L-aspartate to L-asparagine in the presence of ATP and ammonia. This family represents one of two non-homologous forms of aspartate--ammonia ligase found in Escherichia coli. This type is also found in Haemophilus influenzae, Treponema pallidum and Lactobacillus delbrueckii, but appears to have a very limited distribution. The fact that the protein from the H. influenzae is more than 70 0dentical to that from the spirochete T. pallidum, but less than 65 0dentical to that from the closely related E. coli, strongly suggests lateral transfer.; GO: 0004071 aspartate-ammonia ligase activity, 0006529 asparagine biosynthetic process, 0005737 cytoplasm. Probab=80.17 E-value=1.9 Score=22.62 Aligned_cols=70 Identities=29% Similarity=0.354 Sum_probs=51.6 Q ss_pred CEEECCCCEECCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCCCEECCHHHHHHHHCCCCCCEEEEC Q ss_conf 67721251231569999999998599989998667789999860849734222468999999837398403213 Q gi|254780419|r 488 GFEIASGGIRNHMSEVMLQAFSNVGISKEVVENRFGGLYRAFQCGMPPHGGIAAGIDRIVMLLLGAKNVREVSL 561 (622) Q Consensus 488 G~EiggGSiRIHd~~iQ~~~f~~lgl~~ee~~~~Fg~lL~Al~yG~PPHgG~alGlDRlvmll~g~~sIRdVIa 561 (622) .+|+.+=+||+....+..+ +...| +|-.-+..|.-+.++--+|-.-|-|+|--||.|+|.....|-+|-+ T Consensus 246 afe~ssmGirvd~~~l~~q-~~ltG---ded~l~~~Whq~l~~G~lP~~iGGGiGqsrl~mlll~~~hiG~vq~ 315 (330) T TIGR00669 246 AFELSSMGIRVDEDALRLQ-LALTG---DEDRLELEWHQDLLNGKLPQTIGGGIGQSRLAMLLLQKKHIGEVQA 315 (330) T ss_pred HHHHHHCCEEECHHHHHHH-HHCCC---CCCHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHCCCCCCHHC T ss_conf 4322221115435787766-42047---6101234778997448776201565307899999862001452000 No 152 >COG1311 HYS2 Archaeal DNA polymerase II, small subunit/DNA polymerase delta, subunit B [DNA replication, recombination, and repair] Probab=79.48 E-value=4.7 Score=19.89 Aligned_cols=166 Identities=20% Similarity=0.229 Sum_probs=88.9 Q ss_pred CCCEEEEEEEECCCCCCCC-EEEEEEECCCCEEEEEECCCCHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCCCCCCEEE Q ss_conf 8988999979800444698-799999779825999974881578998528988589999999975865558877786599 Q gi|254780419|r 17 VGSFVRLSGWVHRVRPHGG-IIFLDIRDHYGITQVVANPDSACFEIAKSVRSEWVLCIDGLVMSRSAETINANIITGQIE 95 (622) Q Consensus 17 ~g~~V~i~GwV~~~R~~g~-l~F~~lrD~sG~~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~r~~~~~n~~~~tG~~E 95 (622) .|+.|+|.|=|...|...+ -.-+.+-|-+|.+.+++.++.+....+.++-..-+|.+.|.+..+.. T Consensus 137 ~g~dv~Iig~v~~~r~t~~gh~ii~~ed~tG~v~vvl~k~~e~~~~~~dvl~d~vig~~g~~t~~~~------------- 203 (481) T COG1311 137 GGSDVKIIGEVNDVRETKNGHFIISLEDTTGVVTVVLGKDREAGRFVVDVLFDEVIGVSGPVTPRSS------------- 203 (481) T ss_pred CCCCCEEEEEECCCEEEECCCEEEECCCCCCEEEEEECCCHHHHHHHHHHCCCCCCCCCCCCCCCCC------------- T ss_conf 6888479999712203204648998246566599996166233046776407752144676477544------------- Q ss_pred EEEEEEEEEECCCCCCCCCCCCCCCCHHHHCCCC-CEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCC Q ss_conf 9988999950211799864576678987821123-200012111121245899999998752036514620332112464 Q gi|254780419|r 96 LSAQKIEILSVAEELPLPVFGEREYPESIRFKYR-FLDLRRDTLHKNIVKRTRIINSMRCRMIAENFIECSTPILTASSP 174 (622) Q Consensus 96 i~~~~~~il~~a~~~P~~~~~~~~~~e~~rl~~R-~LdLR~~~~~~~l~~Rs~i~~~~R~~l~~~gF~EveTP~L~ksTp 174 (622) .++++.+-. .|....+ ... +.| |.-+...-......++.+....+=+||...+=+--..=.|.- T Consensus 204 -~a~~~~~p~----Vpg~~~~-~~~------~e~v~v~~isDih~GSk~F~~~~f~~fi~wl~g~~~~a~~vkylii--- 268 (481) T COG1311 204 -FADRIYLPD----VPGLSLN-NTG------DERVYVALISDIHRGSKEFLEDEFEKFIDWLNGPGDLASRVKYLII--- 268 (481) T ss_pred -CCCCCEECC----CCCCCCC-CCC------CCCEEEEEEEEEECCCHHHHHHHHHHHHHHHCCCCCCCCCEEEEEE--- T ss_conf -687522346----7555679-988------7655799976111052888888999999986387543212379997--- Q ss_pred CCCCCCCCC-CCCCHHHHHHCCCCHHHHHHHHHCCCHHHHEEEHHHHCCC Q ss_conf 566454322-2221034431123854564433105702331100110377 Q gi|254780419|r 175 EGARDFLVP-SRVNQGSFYALPQAPQQYKQLLMASGFDRYFQIAPCFRDE 223 (622) Q Consensus 175 eGArdFlVP-sR~~~g~fYALpQSPQlyKQlLmv~G~dryfqiarcFRdE 223 (622) .|+ +|+ --++|| |+.-|.++-=+|.|.++|+-+.+. T Consensus 269 agd---~VDGigiYpg----------q~~eL~i~di~~qy~~~A~~L~~v 305 (481) T COG1311 269 AGD---VVDGIGIYPG----------QEEELVIADIYEQYEELAEFLDQV 305 (481) T ss_pred ECC---CCCCCCCCCC----------CCCCCCCCCCHHHHHHHHHHHHHC T ss_conf 055---2134344468----------612100055167899999988538 No 153 >cd03524 RPA2_OBF_family RPA2_OBF_family: A family of oligonucleotide binding (OB) folds with similarity to the OB fold of the single strand (ss) DNA-binding domain (DBD)-D of human RPA2 (also called RPA32). RPA2 is a subunit of Replication protein A (RPA). RPA is a nuclear ssDNA-binding protein (SSB) which appears to be involved in all aspects of DNA metabolism including replication, recombination, and repair. RPA also mediates specific interactions of various nuclear proteins. In animals, plants, and fungi, RPA is a heterotrimer with subunits of 70KDa (RPA1), 32kDa (RPA2), and 14 KDa (RPA3). RPA contains six OB folds, which are involved in ssDNA binding and in trimerization. The ssDNA binding mechanism is believed to be multistep and to involve conformational change. This family also includes OB folds similar to those found in Escherichia coli SSB, the wedge domain of E. coli RecG (a branched-DNA-specific helicase), E. coli ssDNA specific exodeoxyribonuclease VII large subunit, Pyroco Probab=78.86 E-value=4.8 Score=19.76 Aligned_cols=69 Identities=23% Similarity=0.342 Sum_probs=48.4 Q ss_pred EEEEEECCCCCC---CCEEEEEEECCC-CEEEEEECCCCHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCCCCCCEEEEE Q ss_conf 999798004446---987999997798-2599997488157899852898858999999997586555887778659999 Q gi|254780419|r 22 RLSGWVHRVRPH---GGIIFLDIRDHY-GITQVVANPDSACFEIAKSVRSEWVLCIDGLVMSRSAETINANIITGQIELS 97 (622) Q Consensus 22 ~i~GwV~~~R~~---g~l~F~~lrD~s-G~~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~r~~~~~n~~~~tG~~Ei~ 97 (622) ++.|-|..++.. ++..-+.|.|++ +.+.|++..+... .....+..+.++.+.|.|.... |..++. T Consensus 1 ~i~g~V~~~~~~~~~~~~~~~~l~D~t~~~i~v~~~~~~~~-~~~~~~~~g~~v~i~g~v~~~~----------~~~~l~ 69 (75) T cd03524 1 TIVGIVVAVEEIRTEGKVLIFTLTDGTGGTIRVTLFGELAE-ELENLLKEGQVVYIKGKVKKFR----------GRLQLI 69 (75) T ss_pred CEEEEEEEEEEEECCCEEEEEEEECCCCCEEEEEEEEEHHH-HHHHHHHCCCEEEEEEEEEECC----------CEEEEE T ss_conf 97899999842003864999999769997999999988257-6877874782999980365236----------648999 Q ss_pred EEEE Q ss_conf 8899 Q gi|254780419|r 98 AQKI 101 (622) Q Consensus 98 ~~~~ 101 (622) +.++ T Consensus 70 ~~~~ 73 (75) T cd03524 70 VESI 73 (75) T ss_pred EEEE T ss_conf 9852 No 154 >TIGR00409 proS_fam_II prolyl-tRNA synthetase; InterPro: IPR004500 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . Prolyl-tRNA synthetase (6.1.1.15 from EC) exists in two forms, which are loosely related. The first form, is present in the majority of eubacteria species. The second one, present in some eubacteria, is essentially present in archaea and eukaryota. Prolyl-tRNA synthetase belongs to class IIa. This family includes the enzyme from Escherichia coli that contains all three of the conserved consensus motifs characteristic of class II aminoacyl-tRNA synthetases and the enzyme from the spirochete Borrelia burgdorferi.; GO: 0000166 nucleotide binding, 0004827 proline-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006433 prolyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=77.22 E-value=2.7 Score=21.55 Aligned_cols=109 Identities=30% Similarity=0.472 Sum_probs=69.7 Q ss_pred CHHHHHHHHHHHHHHHHCCC-EEEEECCCCCC-------------CCC-CC---------CCCC-CCCCCCCHHHHHHC- Q ss_conf 12458999999987520365-14620332112-------------464-56---------6454-32222210344311- Q gi|254780419|r 141 NIVKRTRIINSMRCRMIAEN-FIECSTPILTA-------------SSP-EG---------ARDF-LVPSRVNQGSFYAL- 194 (622) Q Consensus 141 ~l~~Rs~i~~~~R~~l~~~g-F~EveTP~L~k-------------sTp-eG---------ArdF-lVPsR~~~g~fYAL- 194 (622) .+|.=.||-+.+|+-|++.| =+|+==|.|.. ++| || -|+| |-|| |.--==+| T Consensus 47 ~~rVL~Kv~~ivreeM~~~GGA~E~~lp~~~~la~LW~~SGRW~~YG~seGELl~lkDR~~r~f~LgpT--~EEvIT~l~ 124 (620) T TIGR00409 47 GLRVLKKVENIVREEMNKDGGAIEVLLPALQPLAELWQESGRWDTYGPSEGELLRLKDRKGREFVLGPT--HEEVITDLV 124 (620) T ss_pred HHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCEEEEEEECCCCCCCCCCC--CHHHHHHHH T ss_conf 999999889999986226785334667667789998754075324686543366765268887356786--655689999 Q ss_pred ---CCCHHHHHHHHHCCCHHHHEEEHHHHCCCC------CCCCCCCCCEEEEEEEECCCC-HHHHHH---HHHHHHHHHH Q ss_conf ---238545644331057023311001103777------665538000035654102689-999999---9999999999 Q gi|254780419|r 195 ---PQAPQQYKQLLMASGFDRYFQIAPCFRDED------PRSDRLPGEFYQLDMEMSFVE-QDDILN---TMENVLRGVF 261 (622) Q Consensus 195 ---pQSPQlyKQlLmv~G~dryfqiarcFRdEd------~R~dRqp~EFTqlD~EmsF~~-~edvm~---l~E~li~~i~ 261 (622) -.| ||||=+ ..|||-+=||||= +|+. ||-|=|.+ ||.+ +|+... .+.+..+.|| T Consensus 125 ~~~i~S---YKqLP~-----~lYQI~tKFRDEiRPRFGl~RgR----EFiMKDAY-SFH~~~ESL~~ty~~m~~aY~~IF 191 (620) T TIGR00409 125 RNEIKS---YKQLPL-----NLYQIQTKFRDEIRPRFGLLRGR----EFIMKDAY-SFHSDEESLDATYQKMDQAYSNIF 191 (620) T ss_pred HHHHHH---HHHCCC-----CEECCCCCCCCCCCCCCCCCCCH----HHHCCCCC-CCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 998754---530683-----01004512144204887600012----32302425-544787679999889999999999 Q ss_pred HHH Q ss_conf 998 Q gi|254780419|r 262 EEF 264 (622) Q Consensus 262 ~~v 264 (622) +.. T Consensus 192 ~rl 194 (620) T TIGR00409 192 SRL 194 (620) T ss_pred HHC T ss_conf 865 No 155 >COG0423 GRS1 Glycyl-tRNA synthetase (class II) [Translation, ribosomal structure and biogenesis] Probab=76.35 E-value=1 Score=24.67 Aligned_cols=31 Identities=39% Similarity=0.583 Sum_probs=21.2 Q ss_pred HEEEHHHHCCCCCCCC---CCCCCEEEEEEEECCCC Q ss_conf 3110011037776655---38000035654102689 Q gi|254780419|r 213 YFQIAPCFRDEDPRSD---RLPGEFYQLDMEMSFVE 245 (622) Q Consensus 213 yfqiarcFRdEd~R~d---Rqp~EFTqlD~EmsF~~ 245 (622) +-||||.||||=+--. |- -||||.+||. |++ T Consensus 190 iaQIGKsfRNEISPr~gl~R~-REF~QaEiE~-Fv~ 223 (558) T COG0423 190 IAQIGKSFRNEISPRNGLFRT-REFEQAEIEF-FVD 223 (558) T ss_pred EEEECHHHCCCCCCCCCEEEH-HHHHHHEEEE-EEC T ss_conf 886231014652766536651-2343120135-887 No 156 >TIGR00389 glyS_dimeric glycyl-tRNA synthetase; InterPro: IPR002315 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . In eubacteria, glycyl-tRNA synthetase (6.1.1.14 from EC) is an alpha2/beta2 tetramer composed of 2 different subunits , , . In some eubacteria, in archaea and eukaryota, glycyl-tRNA synthetase is an alpha2 dimer, this family. It belongs to class IIc and is one of the most complex synthetases. What is most interesting is the lack of similarity between the two types: divergence at the sequence level is so great that it is impossible to infer descent from common genes. The alpha (see IPR002310 from INTERPRO) and beta subunits (see IPR002311 from INTERPRO) also lack significant sequence similarity. However, they are translated from a single mRNA , and a single chain glycyl-tRNA synthetase from Chlamydia trachomatis has been found to have significant similarity with both domains, suggesting divergence from a single polypeptide chain . The sequence and crystal structure of the homodimeric glycyl-tRNA synthetase from Thermus thermophilus, shows that each monomer consists of an active site strongly resembling that of the aspartyl and seryl enzymes, a C-terminal anticodon recognition domain of 100 residues and a third domain unusually inserted between motifs 1 and 2 almost certainly interacting with the acceptor arm of tRNA(Gly). The C-terminal domain has a novel five-stranded parallel-antiparallel beta-sheet structure with three surrounding helices. The active site residues most probably responsible for substrate recognition, in particular in the Gly binding pocket, can be identified by inference from aspartyl-tRNA synthetase due to the conserved nature of the class II active site , .; GO: 0000166 nucleotide binding, 0004820 glycine-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006426 glycyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=75.60 E-value=2.5 Score=21.87 Aligned_cols=39 Identities=38% Similarity=0.546 Sum_probs=25.4 Q ss_pred HHHHHHCCCHH----HHEEEHHHHCCCCC------CCCCCCCCEEEEEEEECCCC Q ss_conf 64433105702----33110011037776------65538000035654102689 Q gi|254780419|r 201 YKQLLMASGFD----RYFQIAPCFRDEDP------RSDRLPGEFYQLDMEMSFVE 245 (622) Q Consensus 201 yKQlLmv~G~d----ryfqiarcFRdEd~------R~dRqp~EFTqlD~EmsF~~ 245 (622) ||+||-..= . -+=||||.||||=| |. -||+|.+||- |++ T Consensus 199 Fk~LL~~~r-~klPFgvAQiGKSfRNEIsPr~gl~R~----REF~QaE~E~-FV~ 247 (606) T TIGR00389 199 FKRLLQFFR-NKLPFGVAQIGKSFRNEISPRQGLIRL----REFEQAEIEF-FVD 247 (606) T ss_pred HHHHHHHHC-CCCCEEEEEECCEEECCCCCCCCEEEE----CHHHHHHHHH-CCC T ss_conf 689999745-899648997441360400389875710----3154466874-259 No 157 >pfam02074 Peptidase_M32 Carboxypeptidase Taq (M32) metallopeptidase. Probab=74.38 E-value=3.2 Score=21.02 Aligned_cols=21 Identities=24% Similarity=0.298 Sum_probs=14.5 Q ss_pred CCCCCCCHHHCCCCCC-CCCHH Q ss_conf 5547000211002476-64306 Q gi|254780419|r 447 EEEKKIDFAHNPFTMP-QGGME 467 (622) Q Consensus 447 ee~~r~~s~HHPFTaP-~~~~~ 467 (622) -+.+|+.-++||||.- .+.|. T Consensus 224 f~~GRlD~S~HPF~~~~~~~Dv 245 (494) T pfam02074 224 LGTGRLDVTAHPFTTEFGIGDV 245 (494) T ss_pred CCCEEEECCCCCCCCCCCCCCE T ss_conf 7770573478987678897733 No 158 >pfam04556 DpnII DpnII restriction endonuclease. Members of this family are type II restriction enzymes (EC:3.1.21.4). They recognize the double-stranded unmethylated sequence GATC and cleave before G-1. http://rebase.neb.com/rebase/enz/DpnII.html Probab=73.67 E-value=5.8 Score=19.21 Aligned_cols=29 Identities=31% Similarity=0.536 Sum_probs=19.7 Q ss_pred HHHHHHCCCCCCEEEEEEEECCCEEECCCCCCCCHHHCCC Q ss_conf 9877300355751028896077053545547000211002 Q gi|254780419|r 420 IAQELGIINCNCFEFCWIMDFPFYEWNEEEKKIDFAHNPF 459 (622) Q Consensus 420 ia~~l~li~~~~~~flWV~DFPLFe~dee~~r~~s~HHPF 459 (622) +++++. ..+.+.|+|||| -+.|.++++|- T Consensus 231 l~~~~~--~~~~~eFiWITD---------G~GW~sak~~L 259 (284) T pfam04556 231 LAEKIK--ENTNVEFIWITD---------GKGWNSAKNKL 259 (284) T ss_pred HHHHHH--HCCCCEEEEEEC---------CCCHHHHHHHH T ss_conf 999985--169927999836---------80065506789 No 159 >COG0380 OtsA Trehalose-6-phosphate synthase [Carbohydrate transport and metabolism] Probab=73.32 E-value=6.7 Score=18.76 Aligned_cols=59 Identities=20% Similarity=0.266 Sum_probs=30.5 Q ss_pred HHHHHHHHHHHHHHHCCCEEEEECCCC-------CCCCCCCCC-C-CC---CCCCCCHHHHHHCCCCHHHHHHHH Q ss_conf 458999999987520365146203321-------124645664-5-43---222221034431123854564433 Q gi|254780419|r 143 VKRTRIINSMRCRMIAENFIECSTPIL-------TASSPEGAR-D-FL---VPSRVNQGSFYALPQAPQQYKQLL 205 (622) Q Consensus 143 ~~Rs~i~~~~R~~l~~~gF~EveTP~L-------~ksTpeGAr-d-Fl---VPsR~~~g~fYALpQSPQlyKQlL 205 (622) +.-.+....+.+-....+.+-|+-=-| -+.-|.+ + - |+ .||. --|-+||+.-|+.+.|| T Consensus 132 ~vN~~FAd~i~~~~~~gDiIWVhDYhL~L~P~mlR~~~~~~-~IgfFlHiPfPss---Evfr~lP~r~eIl~gll 202 (486) T COG0380 132 KVNRKFADKIVEIYEPGDIIWVHDYHLLLVPQMLRERIPDA-KIGFFLHIPFPSS---EVFRCLPWREEILEGLL 202 (486) T ss_pred HHHHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHHCCCC-EEEEEEECCCCCH---HHHHHCCHHHHHHHHHH T ss_conf 99999999999746789989997041666689999748876-5898896799998---99865730999998753 No 160 >PRK13902 alaS lanyl-tRNA synthetase; Provisional Probab=72.55 E-value=1.6 Score=23.20 Aligned_cols=13 Identities=15% Similarity=0.414 Sum_probs=9.5 Q ss_pred EEEEEEEEEEEEC Q ss_conf 9999889999502 Q gi|254780419|r 94 IELSAQKIEILSV 106 (622) Q Consensus 94 ~Ei~~~~~~il~~ 106 (622) .||.+..++|-|- T Consensus 200 sEi~~~GlEiwNl 212 (898) T PRK13902 200 FEVLVRGLELATL 212 (898) T ss_pred EEEEECCEEEEEE T ss_conf 3565197974422 No 161 >PRK07275 single-stranded DNA-binding protein; Provisional Probab=70.48 E-value=7.7 Score=18.32 Aligned_cols=85 Identities=18% Similarity=0.289 Sum_probs=52.8 Q ss_pred EEEEEEEECC---CC--CCC-CEEEEEE------ECCCC-----EEEEEECCCCHHHHHHHCCCCCCEEEEEEEEEECCC Q ss_conf 8999979800---44--469-8799999------77982-----599997488157899852898858999999997586 Q gi|254780419|r 20 FVRLSGWVHR---VR--PHG-GIIFLDI------RDHYG-----ITQVVANPDSACFEIAKSVRSEWVLCIDGLVMSRSA 82 (622) Q Consensus 20 ~V~i~GwV~~---~R--~~g-~l~F~~l------rD~sG-----~~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~r~~ 82 (622) +|+|.|+|.+ .| ..| .++-+.| +|..| .+.|++-.+. +....+.|+.++-|.|+|.++.|.= T Consensus 4 kV~LiGrl~~DpElr~t~~G~~v~~fslAv~r~~k~~~ge~~td~~~~v~wg~~-AE~~~~yl~KG~~V~VeGrl~~~~y 82 (163) T PRK07275 4 NVVLVGRMTRDAELRYTPSNVAVATFTLAVNRTFKSQNGEREADFINCVIWRQQ-AENLANWAKKGALIGVTGRIQTRNY 82 (163) T ss_pred EEEEEEECCCCCEEEECCCCCEEEEEEEEECCCEECCCCCCCCEEEEEEECCHH-HHHHHHHHCCCCEEEEEEEEECCCC T ss_conf 699989658886388989998899999984785587889503308999980778-9999987458999999999721553 Q ss_pred CCCCCCCCCCEEEEEEEEEEEEEC Q ss_conf 555887778659999889999502 Q gi|254780419|r 83 ETINANIITGQIELSAQKIEILSV 106 (622) Q Consensus 83 ~~~n~~~~tG~~Ei~~~~~~il~~ 106 (622) ...+-.-. =-.||.|+++++|.. T Consensus 83 ~dkdG~~r-~~teVv~~~~~~l~~ 105 (163) T PRK07275 83 ENQQGQRV-YVTEVVADNFQMLES 105 (163) T ss_pred CCCCCCEE-EEEEEEEEEEEEECC T ss_conf 75899789-999999968897025 No 162 >PRK01584 alanyl-tRNA synthetase; Provisional Probab=67.85 E-value=3.4 Score=20.84 Aligned_cols=21 Identities=5% Similarity=-0.001 Sum_probs=12.7 Q ss_pred HHHHHHHHHHHCCCEEEEECC Q ss_conf 999999875203651462033 Q gi|254780419|r 147 RIINSMRCRMIAENFIECSTP 167 (622) Q Consensus 147 ~i~~~~R~~l~~~gF~EveTP 167 (622) .|-+.+-+||.++|-..|..- T Consensus 5 eiR~~Fl~fF~~kgH~~v~ss 25 (593) T PRK01584 5 ELRKKYIDFFKSKGHVEIAGK 25 (593) T ss_pred HHHHHHHHHHHHCCCEECCCC T ss_conf 999999999987898883787 No 163 >TIGR02422 protocat_beta protocatechuate 3,4-dioxygenase, beta subunit; InterPro: IPR012785 Protocatechuate (3,4-dihydroxybenzene, PCA) is an aromatic compound which is a key intermediate in the degradation of the plant biopolymer lignin and other aromatic compounds. The key step of PCA degradation is the ring-cleavage performed by dioxygenases adding both atoms from molecular oxygen to specific carbon atoms within the ring. This step can be performed by two distinct mechanisms; intradiol cleavage and extradiol cleavage. In intradiol cleavage the oxygen atoms are added to the carbons carrying the hydroxyl groups, producing two carboxylate groups. In extradiol cleavage the oxygens are added to one carbon carrying a hydroxyl group and another carrying a hydrogen, resulting in the formation of a carboxylate group and an aldehydic group. For further information see . PCA dioxygenases fall into the broader category of catechol dioxygenases. These are metalloenzymes which bind non-haeme iron. The extradiol dioxygenases use Fe(II) to activate oxygen for nucleophilic attack on the aromatic substrate, while the intradiol dioxygenases use Fe(III) to activate the aromatic substrate for an electrophilic attack by oxygen . This entry represents the beta subunit of protocatechuate 3,4-dioxygenase, an enzyme which cleaves the PCA ring by an intradiol mechanism. It is composed of two subunits, alpha (IPR012786 from INTERPRO) and beta which are highly similar in structure and are thought to share a common ancestor , , . The core of each subunit is two four-stranded beta-sheets that fold upon each other to form a beta sandwhich. The active site cavity contains the Fe(III)-binding site and is located between the two subunits. All Fe(III) ligands are contributed by the beta subunit.; GO: 0005506 iron ion binding, 0018578 protocatechuate 34-dioxygenase activity, 0019619 protocatechuate catabolic process. Probab=66.02 E-value=2.8 Score=21.42 Aligned_cols=22 Identities=18% Similarity=0.374 Sum_probs=17.9 Q ss_pred HHHCCCHHHHHHHCCCEEECCE Q ss_conf 5541887774131367777567 Q gi|254780419|r 468 SLKGNDLLSIKAFQYDLVCNGF 489 (622) Q Consensus 468 ~l~~~dp~~v~a~~YDLVlNG~ 489 (622) ++..+.|..++|+.||+||.|- T Consensus 196 D~~~t~p~~al~Y~FDIVLRGr 217 (224) T TIGR02422 196 DLDNTQPEDALGYRFDIVLRGR 217 (224) T ss_pred HHHCCCCCCCEEEECCEEECCC T ss_conf 5640681401253120376287 No 164 >TIGR01771 L-LDH-NAD L-lactate dehydrogenase; InterPro: IPR011304 This entry represents the NAD-dependent L-lactate dehydrogenases from bacteria and eukaryotes. This enzyme functions as the final step in anaerobic glycolysis. Although lactate dehydrogenases have in some cases been mistaken for malate dehydrogenases due to the similarity of these two substrates and the apparent ease with which evolution can toggle these activities, critical residues have been identified which can discriminate between the two activities.; GO: 0004459 L-lactate dehydrogenase activity, 0019642 anaerobic glycolysis, 0005737 cytoplasm. Probab=65.56 E-value=1.8 Score=22.90 Aligned_cols=72 Identities=25% Similarity=0.362 Sum_probs=62.6 Q ss_pred CCCCEECCHHHHH-HHHCCCCCCEEEECCCCCCCC-CCCCCCCCCCCCHHHHHH-CCCEEECCCHHHHHCCCHHH Q ss_conf 7342224689999-998373984032137899987-103468867289889987-08356057301000000135 Q gi|254780419|r 535 PHGGIAAGIDRIV-MLLLGAKNVREVSLFPMSQNF-CDLLMGSPSTVSVEQLRE-LGLRIVENPKKILKISVTTI 606 (622) Q Consensus 535 PHgG~alGlDRlv-mll~g~~sIRdVIaFPKt~~g-~Dlm~~aPs~v~~~qL~e-~~i~~~~~~k~~~~~~~~~~ 606 (622) -.=|||+++=|++ ++|.++.+|.=|-++=+-+.| .|..++.|+.|..+=.++ +.+.+.++|++-.+-++.++ T Consensus 226 T~YGIG~~~a~i~~aIl~d~~~ilpvS~~l~G~yG~~dv~ig~Pa~lg~~Gv~~i~e~~L~~~E~~~f~~Sa~~L 300 (302) T TIGR01771 226 TYYGIGAAVARIVEAILKDENRILPVSAYLDGQYGIKDVYIGVPAVLGRNGVEEIIELPLSDEEKEAFQKSAETL 300 (302) T ss_pred CHHHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCCCCCEEECCCCCCCCCCCCEEECCCCCHHHHHHHHHHHHHH T ss_conf 339999999999999870687587888764161586631446540122246524242799989999999899996 No 165 >PRK00252 alaS alanyl-tRNA synthetase; Reviewed Probab=64.66 E-value=3.2 Score=21.02 Aligned_cols=21 Identities=14% Similarity=0.379 Sum_probs=10.0 Q ss_pred HHHHHHHHHHH----HCCCCCCEEE Q ss_conf 47889998773----0035575102 Q gi|254780419|r 414 ADARNHIAQEL----GIINCNCFEF 434 (622) Q Consensus 414 g~lR~~ia~~l----~li~~~~~~f 434 (622) +++|..++... .++.++..+| T Consensus 561 ~alr~vlG~hv~Q~GS~v~~~~lrf 585 (864) T PRK00252 561 AALRQVLGEHVTQKGSLVAPDRLRF 585 (864) T ss_pred HHHHHHHCCCEEECCCCCCCCEEEE T ss_conf 9999984877086663357760799 No 166 >cd04484 polC_OBF polC_OBF: A subfamily of OB folds corresponding to the N-terminal OB-fold nucleic acid binding domain of Bacillus subtilis type C replicative DNA polymerase III alpha subunit (polC). Replication in B. subtilis and Staphylococcus aureus requires two different polymerases, polC and DnaE. The holoenzyme is thought to include the two different polymerases. At the B. subtilis replication fork, polC appears to be involved in leading strand synthesis and DnaE in lagging strand synthesis. Probab=63.91 E-value=10 Score=17.43 Aligned_cols=74 Identities=19% Similarity=0.164 Sum_probs=47.0 Q ss_pred EEEEEEEECCC--CC--CC-CEEEEEEECCCCEEEE--EECCCCHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCCCCCC Q ss_conf 89999798004--44--69-8799999779825999--974881578998528988589999999975865558877786 Q gi|254780419|r 20 FVRLSGWVHRV--RP--HG-GIIFLDIRDHYGITQV--VANPDSACFEIAKSVRSEWVLCIDGLVMSRSAETINANIITG 92 (622) Q Consensus 20 ~V~i~GwV~~~--R~--~g-~l~F~~lrD~sG~~Q~--v~~~~~~~~~~~~~l~~esvv~v~G~V~~r~~~~~n~~~~tG 92 (622) .|+|.|-|-.. |. .| .+.-+.+.|.++++.| ..+++... ..+.+..+..|.|+|.|+--. =.+ T Consensus 1 nV~i~G~IF~~E~relk~gk~i~~~~iTD~t~Si~~K~F~~~~~~~--~~~~~~~G~~v~v~G~v~~D~--------f~~ 70 (82) T cd04484 1 NVVVEGEVFDLEIRELKSGRKILTFKVTDYTSSITVKKFLRKDEKD--KEELKSKGDWVRVRGKVQYDT--------FSK 70 (82) T ss_pred CEEEEEEEEEEEEEECCCCCEEEEEEEEECCCCEEEEEEECCCHHH--HHHHHCCCCEEEEEEEEEECC--------CCC T ss_conf 9899999997898972489899999998088979999983585444--566525897999999996317--------877 Q ss_pred EEEEEEEEEEE Q ss_conf 59999889999 Q gi|254780419|r 93 QIELSAQKIEI 103 (622) Q Consensus 93 ~~Ei~~~~~~i 103 (622) |+-+.+.+++- T Consensus 71 e~v~~~~~i~~ 81 (82) T cd04484 71 ELVLMINDIEE 81 (82) T ss_pred CEEEECCEEEE T ss_conf 15998112478 No 167 >TIGR01389 recQ ATP-dependent DNA helicase RecQ; InterPro: IPR006293 The ATP-dependent DNA helicase RecQ (3.6.1 from EC) is involved in genome maintenance . All homologues tested to date unwind paired DNA, translocating in a 3' to 5' direction and several have a preference for forked or 4-way DNA structures (e.g. Holliday junctions) or for G-quartet DNA. The yeast protein, Sgs1, is present in numerous foci that coincide with sites of de novo synthesis DNA, such as the replication fork, and protein levels peak during S-phase. A model has been proposed for Sgs1p action in the S-phase checkpoint response, both as a 'sensor' for damage during replication and a 'resolvase' for structures that arise at paused forks, such as the four-way 'chickenfoot' structure. The action of Sgs1p may serve to maintain the proper amount and integrity of ss DNA that is necessary for the binding of RPA (replication protein A, the eukaryotic ss DNA-binding protein)DNA pol complexes. Sgs1p would thus function by detecting (or resolving) aberrant DNA structures, and would thus contribute to the full activation of the DNA-dependent protein kinase, Mec1p and the effector kinase, Rad53p. Its ability to bind both the large subunit of RPA and the RecA-like protein Rad51p, place it in a unique position to resolve inappropriate fork structures that can occur when either the leading or lagging strand synthesis is stalled. Thus, RecQ helicases integrate checkpoint activation and checkpoint response. ; GO: 0004003 ATP-dependent DNA helicase activity, 0006310 DNA recombination, 0009432 SOS response. Probab=63.07 E-value=2.5 Score=21.78 Aligned_cols=106 Identities=22% Similarity=0.309 Sum_probs=50.4 Q ss_pred CCCCHHHHHHHHHCCCHHH---HEEEHHHHCCCCCCCCCCCCCEEEEEEE-ECCC----CHHHHHHHHHHHHHHHHHHHH Q ss_conf 1238545644331057023---3110011037776655380000356541-0268----999999999999999999980 Q gi|254780419|r 194 LPQAPQQYKQLLMASGFDR---YFQIAPCFRDEDPRSDRLPGEFYQLDME-MSFV----EQDDILNTMENVLRGVFEEFS 265 (622) Q Consensus 194 LpQSPQlyKQlLmv~G~dr---yfqiarcFRdEd~R~dRqp~EFTqlD~E-msF~----~~edvm~l~E~li~~i~~~v~ 265 (622) |++.+++ ++|||| +|++.+-+- .++|..+|-+...+ -|=+ +-..|=++.+.|-..=...+. T Consensus 187 L~~~~~f------~~SFdRpNl~~~v~~k~n-----~~~~l~~yl~~~~~G~SGIIYa~sR~~VE~~~~~L~s~G~~A~~ 255 (607) T TIGR01389 187 LKDANEF------ISSFDRPNLRFSVVKKNN-----KKKFLLEYLKKHREGQSGIIYASSRKKVEELAERLESQGISALA 255 (607) T ss_pred CCCCCCC------CCCCCCHHHHHHHHHHCC-----CCHHHHHHHHCCCCCCCEEEECCCHHHHHHHHHHHHHCCCCEEE T ss_conf 8986541------256775114334312037-----81368999750799953478767704589999999747962040 Q ss_pred --CCCCC---CCCCCCCCHHHH-----HHHC--CCCCCCCCCCCCCC---CCCCCCCCCC Q ss_conf --87234---577321218988-----7513--67776444764012---4431017875 Q gi|254780419|r 266 --NGNTV---SNNFLRICYDDA-----IRRY--GTDKPDLRNPIIMH---NVTEHFVHSE 310 (622) Q Consensus 266 --~~~~i---~~pF~rmtY~eA-----m~~Y--GsDKPDLR~~lei~---dit~~~~~~~ 310 (622) .|-.- +..=....|+|+ =.=| |+||||.||-..+. ||-.++++++ T Consensus 256 YHAGL~~~~R~e~Q~~Fl~d~~~vmVAT~AFGMGIdKpnVRFViH~d~P~~~EsYYQE~G 315 (607) T TIGR01389 256 YHAGLSNKVRAENQEDFLYDDVKVMVATNAFGMGIDKPNVRFVIHADMPKNIESYYQEAG 315 (607) T ss_pred CCCCCCHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCCCCEEEEECCCCCHHHHHCCCC T ss_conf 367987768999975444625548888712358747876548886368742111310355 No 168 >TIGR00614 recQ_fam ATP-dependent DNA helicase, RecQ family; InterPro: IPR004589 The ATP-dependent DNA helicase RecQ (3.6.1 from EC) is involved in genome maintenance . All homologues tested to date unwind paired DNA, translocating in a 3' to 5' direction and several have a preference for forked or 4-way DNA structures (e.g. Holliday junctions) or for G-quartet DNA. The yeast protein, Sgs1, is present in numerous foci that coincide with sites of de novo synthesis DNA, such as the replication fork, and protein levels peak during S-phase. A model has been proposed for Sgs1p action in the S-phase checkpoint response, both as a 'sensor' for damage during replication and a 'resolvase' for structures that arise at paused forks, such as the four-way 'chickenfoot' structure. The action of Sgs1p may serve to maintain the proper amount and integrity of ss DNA that is necessary for the binding of RPA (replication protein A, the eukaryotic ss DNA-binding protein)DNA pol complexes. Sgs1p would thus function by detecting (or resolving) aberrant DNA structures, and would thus contribute to the full activation of the DNA-dependent protein kinase, Mec1p and the effector kinase, Rad53p. Its ability to bind both the large subunit of RPA and the RecA-like protein Rad51p, place it in a unique position to resolve inappropriate fork structures that can occur when either the leading or lagging strand synthesis is stalled. Thus, RecQ helicases integrate checkpoint activation and checkpoint response. ; GO: 0008026 ATP-dependent helicase activity, 0006310 DNA recombination. Probab=62.23 E-value=2.7 Score=21.58 Aligned_cols=28 Identities=18% Similarity=0.449 Sum_probs=14.5 Q ss_pred HHHH-HCCCCCCCCEECCHHHHHHHHCCCCCCE Q ss_conf 9998-6084973422246899999983739840 Q gi|254780419|r 526 YRAF-QCGMPPHGGIAAGIDRIVMLLLGAKNVR 557 (622) Q Consensus 526 L~Al-~yG~PPHgG~alGlDRlvmll~g~~sIR 557 (622) .+.+ .-|.|-- ||+| =+|..|.|+.|=| T Consensus 451 ~~~~~~~G~~~~--Fg~~--~~i~~l~G~~~qk 479 (497) T TIGR00614 451 VDRLGNVGTPQK--FGIG--YVIDVLRGSNSQK 479 (497) T ss_pred HHHHHCCCCCCC--CCCC--HHHHHHCCCCCCC T ss_conf 778641486100--3331--1567451875340 No 169 >TIGR00134 gatE_arch glutamyl-tRNA(Gln) amidotransferase, subunit E; InterPro: IPR004414 The aspartyl-tRNA(Asn) amidotransferase, B subunit or designated here gatB_rel, is found only in the archaea. It is paralogous to the gatB-encoded subunit of Glu-tRNA(Gln) amidotransferase. The GatABC system operates in many bacteria to convert Glu-tRNA(Gln) into Gln-tRNA(Gln). However, the homologous system in archaea instead converts Asp-tRNA(Asn) to Asn-tRNA(Asn). Glu-tRNA(Gln) is converted to Gln-tRNA(Gln) by a heterodimeric amidotransferase of GatE (this protein) and GatD. The Archaea have an Asp-tRNA(Asn) amidotransferase instead of an Asp--tRNA ligase, but the genes have not been identified. It is likely that this protein replaces gatB in Asp-tRNA(Asn) amidotransferase but that both enzymes share gatA . ; GO: 0016884 carbon-nitrogen ligase activity with glutamine as amido-N-donor, 0006412 translation. Probab=61.81 E-value=11 Score=17.17 Aligned_cols=113 Identities=19% Similarity=0.287 Sum_probs=77.2 Q ss_pred CCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCEEEEEECCCCC--------CCCCCHHHHHHHHHHH-HHCCCCCEEEEEC Q ss_conf 476401244310178752456668740378479999627876--------6521035677999876-4202462355204 Q gi|254780419|r 294 RNPIIMHNVTEHFVHSEFRVFSKILSVNPGYEVWAIPAKGGG--------GNRAFCDRMNEWARSQ-GQPGLAYIFWRLD 364 (622) Q Consensus 294 R~~lei~dit~~~~~~~f~vf~~~~~~~~~~~v~~i~~~~~~--------~srk~~D~l~e~ak~~-g~kgL~~i~~~~~ 364 (622) +..-++.|++++|.++.-+++.+...... .++..+...+.. ..|..-..+.+.++.. |..|+... + T Consensus 275 ~~~~~~~d~~~~~~~~~~~~~~~~~~~~~-g~~~~~~l~g~~g~~g~~~~pg~~lg~~~~d~~~~~~g~~g~~~~----d 349 (631) T TIGR00134 275 EVEEKIFDVTEVFADTKSKIIRNALKKKN-GKVKAVLLRGFGGLVGKEIQPGRRLGTEFADYAKTYAGVGGLFHT----D 349 (631) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCC-CCEEEEEECCCCCCCCCCCCCCCHHCCHHHHHHHHHHCCCCCCCC----C T ss_conf 13455543577753235666643332014-532122110223201331266301010355555554202431112----2 Q ss_pred CCCCCCCCCCHHHHCCHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCHHHHHHHH Q ss_conf 7775333570255325278999999818875572699808854110013478899 Q gi|254780419|r 365 ESGNVAGMGPVAKHLNIVNTEAIRVRLDMQDGDSCFFIAGDPLKFCSFAADARNH 419 (622) Q Consensus 365 ~~~~~~~~spi~K~l~~~~~~~l~~~~~~~~GD~i~f~a~~~~~~~~~lg~lR~~ 419 (622) +... --+++++...+....+++.||.+.++++....+...+..+-.. T Consensus 350 ~lp~--------~g~~~~~~~~l~~~~~~~~~d~~~~~~~~~~~~~~~l~~~~~~ 396 (631) T TIGR00134 350 ELPA--------YGITEEEVKGLRDAVGAEEGDAVVLVAGEEVKVERALREVIER 396 (631) T ss_pred CCCC--------CCCCHHHHHHHHHHHCCCCCCEEEEECCCHHHHHHHHHHHHHH T ss_conf 2443--------4676788877777506666762688515034577899999987 No 170 >COG0013 AlaS Alanyl-tRNA synthetase [Translation, ribosomal structure and biogenesis] Probab=60.84 E-value=4.5 Score=20.00 Aligned_cols=27 Identities=15% Similarity=0.202 Sum_probs=12.2 Q ss_pred CCCCCCCCCCCCCCHHHHHHCCCEEEC Q ss_conf 871034688672898899870835605 Q gi|254780419|r 567 NFCDLLMGSPSTVSVEQLRELGLRIVE 593 (622) Q Consensus 567 ~g~Dlm~~aPs~v~~~qL~e~~i~~~~ 593 (622) .|.-.+...=+.++.+.|+++--+++. T Consensus 773 ~g~~~~~~~~~~~d~~~Lr~~a~~lk~ 799 (879) T COG0013 773 GGVKVLAKEVDGADMKELREIADDLKK 799 (879) T ss_pred CCEEEEEEEECCCCHHHHHHHHHHHHH T ss_conf 887999999447898999999999975 No 171 >COG5011 Uncharacterized protein conserved in bacteria [Function unknown] Probab=60.48 E-value=4.2 Score=20.18 Aligned_cols=43 Identities=19% Similarity=0.320 Sum_probs=31.3 Q ss_pred EECCCCHHHHHHHHHHHHHHHHHHH--HCCCCCCCCCCCCCHHHHHHH Q ss_conf 1026899999999999999999998--087234577321218988751 Q gi|254780419|r 240 EMSFVEQDDILNTMENVLRGVFEEF--SNGNTVSNNFLRICYDDAIRR 285 (622) Q Consensus 240 EmsF~~~edvm~l~E~li~~i~~~v--~~~~~i~~pF~rmtY~eAm~~ 285 (622) -|+|+++=|+|++++..++..=--+ .+| -.|=|||+.++||.- T Consensus 12 ~m~fvShLdlmRlidR~iRRAglpiayT~G---FhP~prmsia~alpv 56 (228) T COG5011 12 NMAFVSHLDLMRLIDRTIRRAGLPIAYTGG---FHPHPRMSIAQALPV 56 (228) T ss_pred CHHHHHHHHHHHHHHHHHHHCCCCEEECCC---CCCCCCEEECCCCCC T ss_conf 356888889999999998864985354378---899974221024433 No 172 >TIGR01405 polC_Gram_pos DNA polymerase III, alpha subunit, Gram-positive type; InterPro: IPR006308 These are the polypeptide chains of DNA polymerase III. Full-length homologs of this protein are restricted to the Gram-positive lineages, including the Mycoplasmas. This protein is designated alpha chain and given the gene symbol polC, but is not a full-length homolog of other polC genes. The N-terminal region of about 200 amino acids is rich in low-complexity sequence and poorly alignable. ; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006260 DNA replication, 0005737 cytoplasm. Probab=60.37 E-value=12 Score=17.00 Aligned_cols=80 Identities=10% Similarity=0.191 Sum_probs=60.2 Q ss_pred CCEEEEEEEECCCC----CCC-CEEEEEEECCCC--E--EEEEECCCCHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCC Q ss_conf 98899997980044----469-879999977982--5--99997488157899852898858999999997586555887 Q gi|254780419|r 18 GSFVRLSGWVHRVR----PHG-GIIFLDIRDHYG--I--TQVVANPDSACFEIAKSVRSEWVLCIDGLVMSRSAETINAN 88 (622) Q Consensus 18 g~~V~i~GwV~~~R----~~g-~l~F~~lrD~sG--~--~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~r~~~~~n~~ 88 (622) ..+|+|.|-|..+- ..| .+.=+.+.|.+- + +|...+.+.+..+.++.|+.+.-|.|+|.|.. + T Consensus 7 e~~~~~~G~iF~~e~~~~ksgr~~~~i~~tD~~~GGSl~~k~F~~~~~eD~~~~~~~K~~~wv~~~g~~~~--------~ 78 (1264) T TIGR01405 7 ENRVKVEGVIFKIEIKELKSGRTLLKIKVTDYTDGGSLILKKFLKSEKEDKEKFDGIKIGKWVRVRGKIEL--------D 78 (1264) T ss_pred CCEEEEEEEEEEEECCCCCCCCEEEEEEEEECCCCCEEEEEEECCCCCCHHHHHHHHCCCCCEEEEEEEEE--------E T ss_conf 34489999986330233577527999998731788606887604899720577734203773599998875--------2 Q ss_pred CCCCEEEEEEEEEEEEE Q ss_conf 77865999988999950 Q gi|254780419|r 89 IITGQIELSAQKIEILS 105 (622) Q Consensus 89 ~~tG~~Ei~~~~~~il~ 105 (622) ..+++..+.+.+|+=+. T Consensus 79 ~~~~~~~~~~~~I~~i~ 95 (1264) T TIGR01405 79 KFSRDLQMIIKDIEEIE 95 (1264) T ss_pred CCCCCCEEEEECCCCCC T ss_conf 21200058775021246 No 173 >smart00769 WHy Water Stress and Hypersensitive response. Probab=60.07 E-value=3.6 Score=20.67 Aligned_cols=24 Identities=38% Similarity=0.693 Sum_probs=17.6 Q ss_pred HHHHHHHCCCEEECCEEECCCCEE Q ss_conf 777413136777756772125123 Q gi|254780419|r 474 LLSIKAFQYDLVCNGFEIASGGIR 497 (622) Q Consensus 474 p~~v~a~~YDLVlNG~EiggGSiR 497 (622) |..+.+.+|+|-+||.+||+|... T Consensus 30 ~l~i~g~~y~l~~~g~~i~~G~~~ 53 (100) T smart00769 30 PIPVNGLSYDLYLNGVELGSGEIP 53 (100) T ss_pred CCEECCEEEEEEECCEEEEEEECC T ss_conf 601024599999999999868739 No 174 >PRK00286 xseA exodeoxyribonuclease VII large subunit; Reviewed Probab=59.65 E-value=12 Score=16.92 Aligned_cols=78 Identities=17% Similarity=0.168 Sum_probs=56.4 Q ss_pred CEEEEEEEECCCCC-CCCEEEEEEECCCCEEEEEECCCCHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCCCCCCEEEEE Q ss_conf 88999979800444-69879999977982599997488157899852898858999999997586555887778659999 Q gi|254780419|r 19 SFVRLSGWVHRVRP-HGGIIFLDIRDHYGITQVVANPDSACFEIAKSVRSEWVLCIDGLVMSRSAETINANIITGQIELS 97 (622) Q Consensus 19 ~~V~i~GwV~~~R~-~g~l~F~~lrD~sG~~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~r~~~~~n~~~~tG~~Ei~ 97 (622) ..|.|.|=|.+.+. .++-++++|.|....+.||+-..... .+.-.+..+.-|.|.|.|.-= .+.|.+.+. T Consensus 24 ~~v~V~GEIS~~~~~~sGH~YF~LkD~~a~i~~v~~~~~~~-~l~~~~~~G~~V~v~g~i~~Y--------~~~g~~ql~ 94 (443) T PRK00286 24 GQVWVRGEISNFTRPSSGHWYFTLKDEQAQIRCVMFRGSAR-RLKFKPEEGMQVLVRGKVSVY--------EPRGDYQLI 94 (443) T ss_pred CCEEEEEEECCCEECCCCEEEEEEEECCCEEEEEEECCHHH-HCCCCCCCCCEEEEEEEEEEE--------CCCCCEEEE T ss_conf 98999999646662899649999981892899999928475-289999899999999999898--------688618999 Q ss_pred EEEEEEEE Q ss_conf 88999950 Q gi|254780419|r 98 AQKIEILS 105 (622) Q Consensus 98 ~~~~~il~ 105 (622) +++++.-+ T Consensus 95 v~~i~~~g 102 (443) T PRK00286 95 VEDIEPAG 102 (443) T ss_pred EEEEEECC T ss_conf 97812779 No 175 >PRK07274 single-stranded DNA-binding protein; Provisional Probab=59.58 E-value=12 Score=16.91 Aligned_cols=83 Identities=16% Similarity=0.184 Sum_probs=51.8 Q ss_pred EEEEEEEECC---CC--CCC-CEEEEEEE------CCCC-----EEEEEECCCCHHHHHHHCCCCCCEEEEEEEEEECCC Q ss_conf 8999979800---44--469-87999997------7982-----599997488157899852898858999999997586 Q gi|254780419|r 20 FVRLSGWVHR---VR--PHG-GIIFLDIR------DHYG-----ITQVVANPDSACFEIAKSVRSEWVLCIDGLVMSRSA 82 (622) Q Consensus 20 ~V~i~GwV~~---~R--~~g-~l~F~~lr------D~sG-----~~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~r~~ 82 (622) .|+|.|++.+ .| ..| .++-+.|. +..| .+.|++-.. .+-...+.+..++-|.|.|.+..|.= T Consensus 4 ~V~LiGrL~~Dpel~~t~~G~~v~~fslAv~r~~k~~~ge~~td~~~vv~wgk-~Ae~~~~~~~KG~~V~V~G~L~~~~y 82 (131) T PRK07274 4 KVILIGRLTAKPELVKTATDKSVARVTLAVNRRFKNQNGEREADFINVVVWGK-LAETLVSYASKGSLISIDGELRTRKY 82 (131) T ss_pred EEEEEEECCCCCEEEECCCCCEEEEEEEEEECCEECCCCCEEEEEEEEEECCH-HHHHHHHHCCCCCEEEEEEEEECCCC T ss_conf 69999977779769998999889999998713068788978877999998284-79999875169999999999887487 Q ss_pred CCCCCCCCCCEEEEEEEEEEEEE Q ss_conf 55588777865999988999950 Q gi|254780419|r 83 ETINANIITGQIELSAQKIEILS 105 (622) Q Consensus 83 ~~~n~~~~tG~~Ei~~~~~~il~ 105 (622) . .+ ..+-=..||.|++++.|. T Consensus 83 d-kd-G~~~~~~eI~v~~~~~L~ 103 (131) T PRK07274 83 D-KD-GQTHYVTEVLCQSFQLLE 103 (131) T ss_pred C-CC-CCEEEEEEEEEEEEEECC T ss_conf 2-49-979999999997899745 No 176 >PRK09616 pheT phenylalanyl-tRNA synthetase subunit beta; Reviewed Probab=58.61 E-value=13 Score=16.80 Aligned_cols=53 Identities=9% Similarity=0.170 Sum_probs=31.9 Q ss_pred HCCCHHHHEEEHHHHCCCCCCCCCCCCCEEEEEEEECCCCHHHHHHHHHHHHHHHH Q ss_conf 10570233110011037776655380000356541026899999999999999999 Q gi|254780419|r 206 MASGFDRYFQIAPCFRDEDPRSDRLPGEFYQLDMEMSFVEQDDILNTMENVLRGVF 261 (622) Q Consensus 206 mv~G~dryfqiarcFRdEd~R~dRqp~EFTqlD~EmsF~~~edvm~l~E~li~~i~ 261 (622) ++.+=+++--+|-+.--+++.-+- .=|-|=+|.|..|.+-+......+...+. T Consensus 197 I~d~~~~v~alAGVmGG~~S~vt~---~T~~IllEsAg~Dp~~v~~al~~~~~~l~ 249 (546) T PRK09616 197 IVDSEGNVLSFPPIINSELTRVTE---GTRNLFIDVTGTDLEAVLLALNIIVTALA 249 (546) T ss_pred EEECCCCEEEECCCCCCCCEEECC---CCCEEEEEEECCCHHHHHHHHHHHHHHHH T ss_conf 992799646414312664146627---86548998515898999999999999999 No 177 >pfam02689 Herpes_Helicase Helicase. This family consists of Helicases from the Herpes viruses. Helicases are responsible for the unwinding of DNA and are essential for replication and completion of the viral life cycle. Probab=58.53 E-value=3.6 Score=20.66 Aligned_cols=10 Identities=30% Similarity=0.624 Sum_probs=6.1 Q ss_pred HCCCCHHHHHH Q ss_conf 11238545644 Q gi|254780419|r 193 ALPQAPQQYKQ 203 (622) Q Consensus 193 ALpQSPQlyKQ 203 (622) |+-++|| |++ T Consensus 223 a~~~Tpl-Yr~ 232 (801) T pfam02689 223 AWLKTPL-YRN 232 (801) T ss_pred HHHCCHH-HHC T ss_conf 7607868-867 No 178 >COG2160 AraA L-arabinose isomerase [Carbohydrate transport and metabolism] Probab=58.33 E-value=13 Score=16.76 Aligned_cols=11 Identities=27% Similarity=0.401 Sum_probs=5.2 Q ss_pred HHCCCEEECCE Q ss_conf 31367777567 Q gi|254780419|r 479 AFQYDLVCNGF 489 (622) Q Consensus 479 a~~YDLVlNG~ 489 (622) +.+|-||||.+ T Consensus 397 g~r~rLlvnev 407 (497) T COG2160 397 GGRYRLLVNEV 407 (497) T ss_pred CCEEEEEEECC T ss_conf 87699997225 No 179 >cd04496 SSB_OBF SSB_OBF: A subfamily of OB folds similar to the OB fold of ssDNA-binding protein (SSB). SSBs bind with high affinity to ssDNA. They bind to and protect ssDNA intermediates during DNA metabolic pathways. All bacterial and eukaryotic SSBs studied to date oligomerize to bring together four OB folds in their active state. The majority (e.g. Escherichia coli SSB) have a single OB fold per monomer, which oligomerize to form a homotetramer. However, Deinococcus and Thermus SSB proteins have two OB folds per monomer, which oligomerize to form a homodimer. Mycobacterium tuberculosis SSB varies in quaternary structure from E. coli SSB. It forms a dimer of dimers having a unique dimer interface, which lends the protein greater stability. Included in this group are OB folds similar to Escherichia coli PriB. E.coli PriB is homodimeric with each monomer having a single OB fold. It does not appear to form higher order oligomers. PriB is an essential protein for the replication restart Probab=57.64 E-value=13 Score=16.69 Aligned_cols=56 Identities=25% Similarity=0.227 Sum_probs=35.7 Q ss_pred EEEEEECCCCHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCCCCCCEEEEEEEEEEEE Q ss_conf 5999974881578998528988589999999975865558877786599998899995 Q gi|254780419|r 47 ITQVVANPDSACFEIAKSVRSEWVLCIDGLVMSRSAETINANIITGQIELSAQKIEIL 104 (622) Q Consensus 47 ~~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~r~~~~~n~~~~tG~~Ei~~~~~~il 104 (622) -+.|++-.+ .+-..++.+..++-|.|+|.+..+.=...+.... -.+||.++++.++ T Consensus 45 ~~~v~~wg~-~A~~~~~~l~KG~~V~V~G~l~~~~~~~~~g~~~-~~~~i~a~~i~~l 100 (100) T cd04496 45 WIRVVAFGK-LAENAAKYLKKGDLVYVEGRLRTRSWEDKDGQKR-YGTEVVADRIEFL 100 (100) T ss_pred EEEEEEECH-HHHHHHHHCCCCCEEEEEEEEECCEEECCCCCEE-EEEEEEEEEEEEC T ss_conf 999999970-8987787537998899999988123799999899-9999999999949 No 180 >COG0514 RecQ Superfamily II DNA helicase [DNA replication, recombination, and repair] Probab=57.18 E-value=3.7 Score=20.57 Aligned_cols=13 Identities=54% Similarity=0.754 Sum_probs=9.5 Q ss_pred CCCCCCCCCCCCC Q ss_conf 6777644476401 Q gi|254780419|r 287 GTDKPDLRNPIIM 299 (622) Q Consensus 287 GsDKPDLR~~lei 299 (622) |+||||+||-..+ T Consensus 293 GIdKpdVRfViH~ 305 (590) T COG0514 293 GIDKPDVRFVIHY 305 (590) T ss_pred CCCCCCCEEEEEE T ss_conf 7678884079980 No 181 >cd06460 M32_Taq Peptidase family M32 is a subclass of metallocarboxypeptidases which are distributed mainly in bacteria and archaea, and contain a HEXXH motif that coordinates a divalent cation such as Zn2+ or Co2+, so far only observed in the active site of neutral metallopeptidases but not in carboxypeptidases. M32 includes the thermostable carboxypeptidases (E.C. 3.4.17.19) from Thermus aquaticus (TaqCP) and Pyrococcus furiosus (PfuCP), which have broad specificities toward a wide range of C-terminal substrates that include basic, aromatic, neutral and polar amino acids. These enzymes have a similar fold to the M3 peptidases such as neurolysin and the M2 angiotensin converting enzyme (ACE). Novel peptidases from protozoa Trypanosoma cruzi, a causative agent of Chagas' disease, and Leishmania major, a parasite that causes leishmaniasis, are the first eukaryotic M32 enzymes identified so far, thus making these enzymes an attractive potential target for drug development against these o Probab=56.76 E-value=11 Score=17.23 Aligned_cols=32 Identities=19% Similarity=0.261 Sum_probs=12.2 Q ss_pred HHHHHHHHHHCCCCCCCCEECCHHHHHHHHCCC Q ss_conf 677899998608497342224689999998373 Q gi|254780419|r 521 RFGGLYRAFQCGMPPHGGIAAGIDRIVMLLLGA 553 (622) Q Consensus 521 ~Fg~lL~Al~yG~PPHgG~alGlDRlvmll~g~ 553 (622) .|+-+++=|+--.--||.. ..-+-|+.-.||. T Consensus 349 ~f~~l~~WL~e~Ih~~G~~-~~~~eLl~~~TGe 380 (396) T cd06460 349 DFSPLLEWLRENIHQHGSR-YSPDELLKKATGE 380 (396) T ss_pred CHHHHHHHHHHHHHHCCCC-CCHHHHHHHHHCC T ss_conf 9499999999999860247-9989999998689 No 182 >pfam00436 SSB Single-strand binding protein family. This family includes single stranded binding proteins and also the primosomal replication protein N (PriB). PriB forms a complex with PriA, PriC and ssDNA. Probab=56.36 E-value=14 Score=16.54 Aligned_cols=83 Identities=20% Similarity=0.235 Sum_probs=47.2 Q ss_pred EEEEEEEECCC---CC--CCC-EE-EEEEE-----CC-------CCEEEEEECCCCHHHHHHHCCCCCCEEEEEEEEEEC Q ss_conf 89999798004---44--698-79-99997-----79-------825999974881578998528988589999999975 Q gi|254780419|r 20 FVRLSGWVHRV---RP--HGG-II-FLDIR-----DH-------YGITQVVANPDSACFEIAKSVRSEWVLCIDGLVMSR 80 (622) Q Consensus 20 ~V~i~GwV~~~---R~--~g~-l~-F~~lr-----D~-------sG~~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~r 80 (622) .|+|.|.|..- |. .|+ ++ |-.-- |. +--+.|++-.+ .+...++.++.++.|.|.|.+..+ T Consensus 3 ~v~l~G~l~~dpe~r~~~~g~~~~~f~va~~~~~~~~~~~~~~~t~~~~v~~~g~-~A~~~~~~l~KG~~V~V~G~l~~~ 81 (104) T pfam00436 3 KVILVGRLTRDPELRYTPNGNAVANFTLAVNRRFKDQNGESDEETDFIRVVVWGK-LAENAAEYLKKGSLVYVEGRLRTR 81 (104) T ss_pred EEEEEEEECCCCEEEECCCCCEEEEEEEEEEECCCCCCCCCCCCCEEEEEEECCH-HHHHHHHCCCCCCEEEEEEEEEEE T ss_conf 9999999056989999689988999999996021346787024558998773054-465431227899799999997811 Q ss_pred CCCCCCCCCCCCEEEEEEEEEEEE Q ss_conf 865558877786599998899995 Q gi|254780419|r 81 SAETINANIITGQIELSAQKIEIL 104 (622) Q Consensus 81 ~~~~~n~~~~tG~~Ei~~~~~~il 104 (622) .=...+-. +-=.+||.++++++| T Consensus 82 ~~~~~~G~-~r~~~~v~~~~i~~l 104 (104) T pfam00436 82 KYEDQEGQ-KRYVTEIVADNVQFL 104 (104) T ss_pred EEECCCCC-EEEEEEEEEEEEEEC T ss_conf 37999998-999999999999949 No 183 >PRK07211 replication factor A; Reviewed Probab=56.32 E-value=14 Score=16.54 Aligned_cols=77 Identities=29% Similarity=0.329 Sum_probs=53.0 Q ss_pred EEEEEEEECCC-------C---C--CCCEEEEEEECCCCEEEEEECCCCHHHHHHHCCCCCCEEEEEEEEEECCCCCCCC Q ss_conf 89999798004-------4---4--6987999997798259999748815789985289885899999999758655588 Q gi|254780419|r 20 FVRLSGWVHRV-------R---P--HGGIIFLDIRDHYGITQVVANPDSACFEIAKSVRSEWVLCIDGLVMSRSAETINA 87 (622) Q Consensus 20 ~V~i~GwV~~~-------R---~--~g~l~F~~lrD~sG~~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~r~~~~~n~ 87 (622) .|.+-|.|.++ | + .|.++=+.+-|.||.+-+++=.+. +-.....+..+.|+.|+|..+ ...| T Consensus 65 ~V~F~akV~~v~d~RtF~R~~~dg~~GrV~ni~VaDeTG~IRvtLWDe~-A~a~~~~ievGdVLkI~Gr~k----~G~~- 138 (473) T PRK07211 65 EVKFLAKVTRVGDLRTFERDDEDDDDGRVINVEVADESGDVRVALWDDQ-AVAADDDLEPGDVLRIKGRPK----DGYN- 138 (473) T ss_pred EEEEEEEEEEECCCEEEECCCCCCCCCEEEEEEEECCCCCEEEEEECCH-HHHHHHCCCCCCEEEEEECCC----CCCC- T ss_conf 4799999815148644441688877633899998647895899985334-677762445674899940346----7866- Q ss_pred CCCCCEEEEEEEEEEEEECC Q ss_conf 77786599998899995021 Q gi|254780419|r 88 NIITGQIELSAQKIEILSVA 107 (622) Q Consensus 88 ~~~tG~~Ei~~~~~~il~~a 107 (622) | .||.+.+++....+ T Consensus 139 ----G-~Evs~~kie~d~da 153 (473) T PRK07211 139 ----G-LEVSADEVEPDEDA 153 (473) T ss_pred ----C-CEECHHHCCCCCCC T ss_conf ----6-35302113357676 No 184 >TIGR02470 sucr_synth sucrose synthase; InterPro: IPR012820 This entry represents sucrose synthase an enzyme that despite its name, generally uses rather produces sucrose. Sucrose plus UDP (or ADP) becomes D-fructose plus UDP-glucose (or ADP-glucose), which is then available for cell wall (or starch) biosynthesis. The enzyme is homologous to sucrose phosphate synthase, which catalyses the penultimate step in sucrose synthesis. Sucrose synthase is found, so far, exclusively in plants and cyanobacteria.; GO: 0016157 sucrose synthase activity. Probab=55.15 E-value=6.4 Score=18.88 Aligned_cols=32 Identities=16% Similarity=0.096 Sum_probs=21.0 Q ss_pred CHHHHCCHHHHHHHHHHHCCCCCCCEEEEECCCCCC Q ss_conf 025532527899999981887557269980885411 Q gi|254780419|r 374 PVAKHLNIVNTEAIRVRLDMQDGDSCFFIAGDPLKF 409 (622) Q Consensus 374 pi~K~l~~~~~~~l~~~~~~~~GD~i~f~a~~~~~~ 409 (622) |--..|..+..++|...++.+| |+|+ |.+... T Consensus 371 PYLE~FA~D~~~e~~~el~g~P-DLII---GNYSDG 402 (790) T TIGR02470 371 PYLETFAEDAEKEILAELQGKP-DLII---GNYSDG 402 (790) T ss_pred CCHHHHHHHHHHHHHHHHCCCC-CEEE---ECCCCH T ss_conf 2045668999999999846899-7067---325635 No 185 >PRK00448 polC DNA polymerase III PolC; Validated Probab=54.89 E-value=14 Score=16.38 Aligned_cols=63 Identities=17% Similarity=0.184 Sum_probs=45.2 Q ss_pred CCCEEEEEEEECCC--CC--CC-CEEEEEEECCCCEEEEEE-CCCCHHHHHHHCCCCCCEEEEEEEEEE Q ss_conf 89889999798004--44--69-879999977982599997-488157899852898858999999997 Q gi|254780419|r 17 VGSFVRLSGWVHRV--RP--HG-GIIFLDIRDHYGITQVVA-NPDSACFEIAKSVRSEWVLCIDGLVMS 79 (622) Q Consensus 17 ~g~~V~i~GwV~~~--R~--~g-~l~F~~lrD~sG~~Q~v~-~~~~~~~~~~~~l~~esvv~v~G~V~~ 79 (622) ....|+|.|-|-.. |. .| .+.=+.+.|.++.+.|-. ..+....+.+..+..+..|.|+|.|+. T Consensus 233 ~~~~v~v~G~vf~~e~~~~k~g~~i~~~~itD~t~si~~k~f~~~~~~~~~~~~i~~g~~v~v~g~~~~ 301 (1436) T PRK00448 233 EERRVVVEGYVFKKEIKELKSGRHILTFKITDYTSSITVKKFLRDKEDLKKFDEIKKGDWVKVRGSVQN 301 (1436) T ss_pred CCCEEEEEEEEEEEEEEECCCCCEEEEEEEEECCCCEEEEEECCCCCHHHHHHHCCCCCEEEEEEEEEE T ss_conf 258189999998667443157987999999826886899994476332789851679978999999851 No 186 >pfam03558 TBSV_P22 TBSV core protein P21/P22. This protein is required for cell-to-cell movement in plants. Furthermore, the membrane-associated protein is dispensable for both replication and transcription. Probab=54.65 E-value=3.3 Score=20.91 Aligned_cols=62 Identities=32% Similarity=0.509 Sum_probs=42.0 Q ss_pred CCCHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCCCCCCHH---HHHHCC Q ss_conf 78987821123200012111121245899999998752036514620332112464566454322222103---443112 Q gi|254780419|r 119 EYPESIRFKYRFLDLRRDTLHKNIVKRTRIINSMRCRMIAENFIECSTPILTASSPEGARDFLVPSRVNQG---SFYALP 195 (622) Q Consensus 119 ~~~e~~rl~~R~LdLR~~~~~~~l~~Rs~i~~~~R~~l~~~gF~EveTP~L~ksTpeGArdFlVPsR~~~g---~fYALp 195 (622) +...|+||.-||||..-.. ||.+ -.--||-|.-.|.| ||-.-|-||-|-+.| ..|+|| T Consensus 99 dlgkeirltm~hldfsvs~-------rs~v-------pivfgfeelvspfl-----egrelfsv~~rwq~glsaq~yslp 159 (189) T pfam03558 99 DLGKEIRLSLQHLDFSVSA-------RSAV-------PIVFGFEDLVSPFL-----EGRELFSVPFRWQFGLSAQCYSLP 159 (189) T ss_pred HCCHHHEEECCCCCEEEEE-------CCCC-------CEEECHHHHHHHHH-----CCCEEEEEEEEEEECCCCCCCCCC T ss_conf 2426446630113225540-------5678-------67841677521454-----564356677888841322203489 Q ss_pred CCHH Q ss_conf 3854 Q gi|254780419|r 196 QAPQ 199 (622) Q Consensus 196 QSPQ 199 (622) |++- T Consensus 160 ~~~w 163 (189) T pfam03558 160 PAKW 163 (189) T ss_pred CCCE T ss_conf 8633 No 187 >COG0587 DnaE DNA polymerase III, alpha subunit [DNA replication, recombination, and repair] Probab=54.55 E-value=15 Score=16.34 Aligned_cols=13 Identities=23% Similarity=0.350 Sum_probs=7.0 Q ss_pred CEE-CCHHHHHHHH Q ss_conf 222-4689999998 Q gi|254780419|r 538 GIA-AGIDRIVMLL 550 (622) Q Consensus 538 G~a-lGlDRlvmll 550 (622) -|. +|-+|--++. T Consensus 876 Afd~~~~~R~~ll~ 889 (1139) T COG0587 876 AFDSFGKNRAQLLA 889 (1139) T ss_pred CCCCHHHHHHHHHH T ss_conf 64211157999999 No 188 >PTZ00274 cytochrome b5 reductase; Provisional Probab=54.36 E-value=9.7 Score=17.61 Aligned_cols=53 Identities=23% Similarity=0.349 Sum_probs=34.4 Q ss_pred HCCCCHHHHHHHHHCCC---HHHHEEEHH-HHCCCCCC--CCCCCCCEEEEEEEECCCCHHHHH Q ss_conf 11238545644331057---023311001-10377766--553800003565410268999999 Q gi|254780419|r 193 ALPQAPQQYKQLLMASG---FDRYFQIAP-CFRDEDPR--SDRLPGEFYQLDMEMSFVEQDDIL 250 (622) Q Consensus 193 ALpQSPQlyKQlLmv~G---~dryfqiar-cFRdEd~R--~dRqp~EFTqlD~EmsF~~~edvm 250 (622) .++=.|-+||.+.|++| +.-.|||.| .++|.+-. .|+ |++-+=-+-.+.+||+ T Consensus 151 k~~ykpN~~k~IgMIAGGTGITPMyQvir~Il~nP~d~~~~Dk-----TkvsLlyaN~Te~DIL 209 (325) T PTZ00274 151 KIQYRPNRWKHVGMIAGGTGFTPMLQIIRHSLTEPWDSGEVDR-----TKLSFLFCNRTERHIL 209 (325) T ss_pred CCCCCCCCCCEEEEEECCCCCCHHHHHHHHHHCCCCCCCCCCC-----CEEEEEEECCCHHHHH T ss_conf 7753698666688973587766799999999719765676787-----5899999229878800 No 189 >cd00673 AlaRS_core Alanyl-tRNA synthetase (AlaRS) class II core catalytic domain. AlaRS is a homodimer. It is responsible for the attachment of alanine to the 3' OH group of ribose of the appropriate tRNA. This domain is primarily responsible for ATP-dependent formation of the enzyme bound aminoacyl-adenylate. Class II assignment is based upon its predicted structure and the presence of three characteristic sequence motifs. Probab=53.31 E-value=8.1 Score=18.15 Aligned_cols=96 Identities=19% Similarity=0.285 Sum_probs=50.1 Q ss_pred HHHHHHHHHHHHCCCEEEEECCCCC-CCCCCCCCCCCCCCCCCHHHHHHCCCCHHHHHHHHHCCC----HHHHEEEHHHH Q ss_conf 9999999875203651462033211-246456645432222210344311238545644331057----02331100110 Q gi|254780419|r 146 TRIINSMRCRMIAENFIECSTPILT-ASSPEGARDFLVPSRVNQGSFYALPQAPQQYKQLLMASG----FDRYFQIAPCF 220 (622) Q Consensus 146 s~i~~~~R~~l~~~gF~EveTP~L~-ksTpeGArdFlVPsR~~~g~fYALpQSPQlyKQlLmv~G----~dryfqiarcF 220 (622) ++|-..+=+||.++|-..|..--|. ...| .-.| +..| =++||..++ |- ..|.-..-+|. T Consensus 2 ~eiR~~Fl~fF~~~gH~~vpssslvp~~Dp--tllF-----tnAG--------M~~Fkp~fl-G~~~p~~~r~~~~QkCi 65 (232) T cd00673 2 SEIRETFLSFFEKKGHTRVPSSPVVPRDDP--TLLF-----TNAG--------MNQFKPIFL-GEVPPPANRLVNSQKCI 65 (232) T ss_pred HHHHHHHHHHHHHCCCEECCCCCCCCCCCC--CEEE-----EECC--------HHHHHHHHC-CCCCCCCCCCEEEEECC T ss_conf 689999999998789888278772238999--5014-----4114--------688899867-78899987510000123 Q ss_pred CCCC----CCCCCCCCCEEEEEEEECCCCHHHHHHHHHHHHHHHHHH Q ss_conf 3777----665538000035654102689999999999999999999 Q gi|254780419|r 221 RDED----PRSDRLPGEFYQLDMEMSFVEQDDILNTMENVLRGVFEE 263 (622) Q Consensus 221 RdEd----~R~dRqp~EFTqlD~EmsF~~~edvm~l~E~li~~i~~~ 263 (622) |.-| .++.||.+=|-||- --||.|| .-|+.|...|.- T Consensus 66 R~~di~nvG~t~rH~TfFEMLG-nfSFGdY-----fK~eaI~~awe~ 106 (232) T cd00673 66 RAGDIDNVGKTGRHHTFFEMLG-NFSFGDY-----FKEEAIAFAWEL 106 (232) T ss_pred CCCCCCCCCCCCCCCHHHHCCC-CCCHHHH-----HHHHHHHHHHHH T ss_conf 6578324677776402333044-4203145-----899999999998 No 190 >COG4930 Predicted ATP-dependent Lon-type protease [Posttranslational modification, protein turnover, chaperones] Probab=52.82 E-value=4.4 Score=20.03 Aligned_cols=30 Identities=23% Similarity=0.458 Sum_probs=16.1 Q ss_pred CCHHHHHHHHHHHHHHCCCCCCE---EEEEEEE Q ss_conf 00134788999877300355751---0288960 Q gi|254780419|r 410 CSFAADARNHIAQELGIINCNCF---EFCWIMD 439 (622) Q Consensus 410 ~~~lg~lR~~ia~~l~li~~~~~---~flWV~D 439 (622) ......+|..+.+.|..+-...| +|.++.+ T Consensus 445 leyAle~RrRVkeqLkKigg~EFfdvnfSyidn 477 (683) T COG4930 445 LEYALELRRRVKEQLKKIGGGEFFDVNFSYIDN 477 (683) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCCEEEEECC T ss_conf 999999999999999751786544332466416 No 191 >cd04474 RPA1_DBD_A RPA1_DBD_A: A subfamily of OB folds corresponding to the second OB fold, the ssDNA-binding domain (DBD)-A, of human RPA1 (also called RPA70). RPA1 is the large subunit of Replication protein A (RPA). RPA is a nuclear ssDNA-binding protein (SSB) which appears to be involved in all aspects of DNA metabolism including replication, recombination, and repair. RPA also mediates specific interactions of various nuclear proteins. In animals, plants, and fungi, RPA is a heterotrimer with subunits of 70KDa (RPA1), 32kDa (RPA2), and 14 KDa (RPA3). In addition to DBD-A, RPA1 contains three other OB folds: DBD-B, DBD-C, and RPA1N. The major DNA binding activity of human RPA (hRPA) and Saccharomyces cerevisiae RPA (ScRPA) is associated with DBD-A and DBD-B of RPA1. RPA1 DBD-C is involved in trimerization. The ssDNA-binding mechanism is believed to be multistep and to involve conformational change. Although ScRPA and the hRPA have similar ssDNA-binding properties, they differ funct Probab=52.25 E-value=16 Score=16.09 Aligned_cols=79 Identities=15% Similarity=0.122 Sum_probs=47.9 Q ss_pred EEEEEEEECC---------CCCCCCEEEEEEECC-CCEEEEEECCCCHHHHHHHCCCCCCEEEEEE-EEEECCCCCCCCC Q ss_conf 8999979800---------444698799999779-8259999748815789985289885899999-9997586555887 Q gi|254780419|r 20 FVRLSGWVHR---------VRPHGGIIFLDIRDH-YGITQVVANPDSACFEIAKSVRSEWVLCIDG-LVMSRSAETINAN 88 (622) Q Consensus 20 ~V~i~GwV~~---------~R~~g~l~F~~lrD~-sG~~Q~v~~~~~~~~~~~~~l~~esvv~v~G-~V~~r~~~~~n~~ 88 (622) .-+|.|+|.+ .|.-|++.-++|.|. +|.|.+++-.+. ....-..|..+.|-.+.| .|.. ..+..|+- T Consensus 11 ~w~Ik~RV~~Ks~ir~~~n~~g~gk~f~~~L~De~g~eI~at~f~~~-~dkf~~~l~~G~vY~is~g~Vk~-ank~y~~~ 88 (104) T cd04474 11 KWTIKARVTNKSDIRTWSNARGEGKLFSFDLLDEDGGEIRATFFNDA-VDKFYDLLEVGKVYYISKGSVKV-ANKKFNTL 88 (104) T ss_pred CEEEEEEEEECCCCCCEECCCCCCEEEEEEEEECCCCEEEEEEEHHH-HHHHHHHCCCCCEEEEECCEEEE-CCCCCCCC T ss_conf 45999999715775035068997359999999899898999990699-99877334328799995659984-34876689 Q ss_pred CCCCEEEEEEEEEE Q ss_conf 77865999988999 Q gi|254780419|r 89 IITGQIELSAQKIE 102 (622) Q Consensus 89 ~~tG~~Ei~~~~~~ 102 (622) ..++||..+..+ T Consensus 89 --~~~yeI~f~~~t 100 (104) T cd04474 89 --KNDYEITFNRDT 100 (104) T ss_pred --CCCEEEEECCCC T ss_conf --987899978997 No 192 >PRK06752 single-stranded DNA-binding protein; Validated Probab=52.03 E-value=16 Score=16.07 Aligned_cols=86 Identities=15% Similarity=0.167 Sum_probs=49.9 Q ss_pred EEEEEEEECCC---C--CCC-CEEEEEEE------CCCC-----EEEEEECCCCHHHHHHHCCCCCCEEEEEEEEEECCC Q ss_conf 89999798004---4--469-87999997------7982-----599997488157899852898858999999997586 Q gi|254780419|r 20 FVRLSGWVHRV---R--PHG-GIIFLDIR------DHYG-----ITQVVANPDSACFEIAKSVRSEWVLCIDGLVMSRSA 82 (622) Q Consensus 20 ~V~i~GwV~~~---R--~~g-~l~F~~lr------D~sG-----~~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~r~~ 82 (622) .|.|.|++.+- | ..| .++-+.|. +..| .+.|++-.+ .+-...+.++.++-|.|+|.++.|.= T Consensus 4 ~V~LiGrl~~dpelr~t~~G~~~~~f~lAv~r~~~~~~g~~~tdf~~~~~w~k-~Ae~~~~yl~KG~~V~v~Grl~~~~y 82 (112) T PRK06752 4 RVVLIGRLTKEPELYYTKQGVAYARVCVAVNRGFRNSLGEQQVDFINCVVWRK-SAENVTEYCTKGSLVGITGRIHTSNY 82 (112) T ss_pred EEEEEEECCCCCEEEECCCCCEEEEEEEEECCCEECCCCCEEEEEEEEEEECH-HHHHHHHHHCCCCEEEEEEEEECCCE T ss_conf 89999967868669998999789999996136448788978789999999965-89899998479999999999984610 Q ss_pred CCCCCCCCCCEEEEEEEEEEEEECC Q ss_conf 5558877786599998899995021 Q gi|254780419|r 83 ETINANIITGQIELSAQKIEILSVA 107 (622) Q Consensus 83 ~~~n~~~~tG~~Ei~~~~~~il~~a 107 (622) ...+-+..+ -.||.|++++.|+.- T Consensus 83 ~~~~G~~~~-~~eVvv~~v~fL~~k 106 (112) T PRK06752 83 EDDQGKRIY-RTEVVIESITFLERR 106 (112) T ss_pred ECCCCCEEE-EEEEEEEEEEECCCC T ss_conf 999997999-999999999977578 No 193 >TIGR01358 DAHP_synth_II 3-deoxy-7-phosphoheptulonate synthase; InterPro: IPR002480 Members of the 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthetase family (2.5.1.54 from EC) catalyse the first step in aromatic amino acid biosynthesis from chorismate. Class I (see IPR006218 from INTERPRO) includes bacterial and yeast enzymes; class II includes higher plants and various microorganisms . The first step in the common pathway leading to the biosynthesis of aromatic compounds is the stereospecific condensation of phosphoenolpyruvate (PEP) and D-erythrose-4-phosphate (E4P) giving rise to 3-deoxy-D-arabino-heptulosonate-7-phosphate (DAHP). This reaction is catalyzed by DAHP synthase, a metal-activated enzyme, which in microorganisms is the target for negative-feedback regulation by pathway intermediates or by end products.; GO: 0003849 3-deoxy-7-phosphoheptulonate synthase activity, 0009073 aromatic amino acid family biosynthetic process. Probab=51.59 E-value=16 Score=16.02 Aligned_cols=134 Identities=18% Similarity=0.289 Sum_probs=72.4 Q ss_pred CCCCCCCEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHCCC--CCCCCCCCCCCCCCCC Q ss_conf 55380000356541026899999999999999999998087234577321218988751367--7764447640124431 Q gi|254780419|r 227 SDRLPGEFYQLDMEMSFVEQDDILNTMENVLRGVFEEFSNGNTVSNNFLRICYDDAIRRYGT--DKPDLRNPIIMHNVTE 304 (622) Q Consensus 227 ~dRqp~EFTqlD~EmsF~~~edvm~l~E~li~~i~~~v~~~~~i~~pF~rmtY~eAm~~YGs--DKPDLR~~lei~dit~ 304 (622) .+.|| .++++|+..| -..+ -+.|++||.|+-+ ||- -...|+|- T Consensus 217 ~~~Qp-~L~~vefytS------------------HEAL-----------LL~YE~ALtR~d~Ged~~-----~~wfD~Sa 261 (450) T TIGR01358 217 PAEQP-ALQTVEFYTS------------------HEAL-----------LLDYEEALTRVDSGEDRS-----GGWFDLSA 261 (450) T ss_pred CCCCC-CCCCHHHHHH------------------HHHC-----------CCCHHCCCEECCCCCCCC-----CCCEECCC T ss_conf 50000-1220134341------------------3431-----------500102120024775887-----87401245 Q ss_pred CCCCCCHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHH-HHHHHHHHHCCCCCEEEEECCCCCCCCCCCHHHHCCHHH Q ss_conf 0178752456668740378479999627876652103567-799987642024623552047775333570255325278 Q gi|254780419|r 305 HFVHSEFRVFSKILSVNPGYEVWAIPAKGGGGNRAFCDRM-NEWARSQGQPGLAYIFWRLDESGNVAGMGPVAKHLNIVN 383 (622) Q Consensus 305 ~~~~~~f~vf~~~~~~~~~~~v~~i~~~~~~~srk~~D~l-~e~ak~~g~kgL~~i~~~~~~~~~~~~~spi~K~l~~~~ 383 (622) ++. +. +. -..++|-- .||+..... +|-++ .|| ....++ T Consensus 262 H~l----------------WI--------Ge-RTRQlDGAHvEfl~gv~N----PiGiK---------vGP---~m~pd~ 300 (450) T TIGR01358 262 HLL----------------WI--------GE-RTRQLDGAHVEFLRGVRN----PIGIK---------VGP---SMTPDE 300 (450) T ss_pred CCC----------------CC--------CC-CCCCCCHHHHHHHHHCCC----CEEEE---------ECC---CCCHHH T ss_conf 531----------------13--------65-102134889999863368----50365---------388---666889 Q ss_pred HHHHHHHHC--CCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHCCCCCCEEEEEEEECCCEE Q ss_conf 999999818--8755726998088541100134788999877300355751028896077053 Q gi|254780419|r 384 TEAIRVRLD--MQDGDSCFFIAGDPLKFCSFAADARNHIAQELGIINCNCFEFCWIMDFPFYE 444 (622) Q Consensus 384 ~~~l~~~~~--~~~GD~i~f~a~~~~~~~~~lg~lR~~ia~~l~li~~~~~~flWV~DFPLFe 444 (622) .-++++.++ .+||-+.|+.- . -.+++|.+|-.-+.-+....-..+||+| ||== T Consensus 301 ll~lie~LdP~~~PGRLtli~R---m----Ga~~~adkLP~L~~aV~~~G~~V~W~cD-PMHG 355 (450) T TIGR01358 301 LLRLIERLDPENEPGRLTLISR---M----GADKIADKLPPLLEAVKAAGRAVVWVCD-PMHG 355 (450) T ss_pred HHHHHHHCCCCCCCCCEEEEEE---C----CCCHHHHHCHHHHHHHHHCCCCEEEEEC-CCCC T ss_conf 9988654077998963688883---3----7501234053789999972792577547-8888 No 194 >KOG1894 consensus Probab=50.11 E-value=8.2 Score=18.13 Aligned_cols=58 Identities=26% Similarity=0.318 Sum_probs=34.0 Q ss_pred HHHHHHHHHHHHHCCCEEEEECCCCCCC---CCCC--CCCCCCCCCCCHHHHH-------HCCCCHHHHHHHHHCC Q ss_conf 8999999987520365146203321124---6456--6454322222103443-------1123854564433105 Q gi|254780419|r 145 RTRIINSMRCRMIAENFIECSTPILTAS---SPEG--ARDFLVPSRVNQGSFY-------ALPQAPQQYKQLLMAS 208 (622) Q Consensus 145 Rs~i~~~~R~~l~~~gF~EveTP~L~ks---TpeG--ArdFlVPsR~~~g~fY-------ALpQSPQlyKQlLmv~ 208 (622) |..--.+.++-+...+ |+.||.|... +.+| |.-|=| +--.|| ||||||=++||++-|+ T Consensus 106 ~~~~k~a~~el~r~kd--e~ntp~l~qr~s~s~~~~~~n~qsI----y~~~~~D~~~~~laL~i~~i~~k~~krv~ 175 (412) T KOG1894 106 ADRPKYALQELFRLKD--ESNTPVLQQRLSASAFGAIANIQSI----YNQGFYDQEDVVLALPISKIFFKDLKRVG 175 (412) T ss_pred HHHHHHHHHHHHHHHH--CCCCHHHHHCCCCCCHHHCCCCEEE----CCCCCCCCCCCCCCCCHHHHHHHHHHHHC T ss_conf 4415689999998752--2387277641143203221551122----15776678886510107566666656516 No 195 >KOG2653 consensus Probab=49.57 E-value=17 Score=15.81 Aligned_cols=22 Identities=18% Similarity=0.341 Sum_probs=12.6 Q ss_pred HHHCCCEEEEECCCCCCCCCCCCC Q ss_conf 520365146203321124645664 Q gi|254780419|r 155 RMIAENFIECSTPILTASSPEGAR 178 (622) Q Consensus 155 ~l~~~gF~EveTP~L~ksTpeGAr 178 (622) -+.++|++-|-.-+= .-.|||| T Consensus 118 el~k~GilfvG~GVS--GGEEGAR 139 (487) T KOG2653 118 ELAKKGILFVGSGVS--GGEEGAR 139 (487) T ss_pred HHHHCCCEEEECCCC--CCCCCCC T ss_conf 999668569844766--7642222 No 196 >TIGR02972 TMAO_torE trimethylamine N-oxide reductase system, TorE protein; InterPro: IPR014316 Members of this small, apparent transmembrane protein are designated TorE and occur in operons for the trimethylamine N-oxide (TMAO) reductase system. Members are closely related to the NapE protein of the related periplasmic nitrate reductase system. It may be that TorE is an integral membrane subunit of a complex with the reductase TorA.. Probab=49.26 E-value=9.4 Score=17.70 Aligned_cols=13 Identities=38% Similarity=0.702 Sum_probs=12.1 Q ss_pred HHHHHHHCCCCC-C Q ss_conf 899998608497-3 Q gi|254780419|r 524 GLYRAFQCGMPP-H 536 (622) Q Consensus 524 ~lL~Al~yG~PP-H 536 (622) |+++||-+|+|| | T Consensus 37 W~~Q~f~FG~~GlH 50 (50) T TIGR02972 37 WMIQAFGFGVPGLH 50 (50) T ss_pred HHHHHHHCCCCCCC T ss_conf 99999732899999 No 197 >COG0113 HemB Delta-aminolevulinic acid dehydratase [Coenzyme metabolism] Probab=49.19 E-value=4.4 Score=20.04 Aligned_cols=23 Identities=22% Similarity=0.400 Sum_probs=18.8 Q ss_pred HHHHHHHHHHCCCEEEEECCCCCCC Q ss_conf 9999987520365146203321124 Q gi|254780419|r 148 IINSMRCRMIAENFIECSTPILTAS 172 (622) Q Consensus 148 i~~~~R~~l~~~gF~EveTP~L~ks 172 (622) -+.+||+.|++.|| ++|||+.=| T Consensus 176 rV~aIR~aLd~ag~--~~v~IMsYs 198 (330) T COG0113 176 RVGAIREALDEAGF--IDVPIMSYS 198 (330) T ss_pred HHHHHHHHHHHCCC--CCCEEEEHH T ss_conf 39999999987698--764043206 No 198 >PRK08182 single-stranded DNA-binding protein; Provisional Probab=48.96 E-value=18 Score=15.74 Aligned_cols=62 Identities=15% Similarity=0.122 Sum_probs=38.0 Q ss_pred CCCCE-EEEEECCCCHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCCCCCCEEEEEEEEEEEEEC Q ss_conf 79825-99997488157899852898858999999997586555887778659999889999502 Q gi|254780419|r 43 DHYGI-TQVVANPDSACFEIAKSVRSEWVLCIDGLVMSRSAETINANIITGQIELSAQKIEILSV 106 (622) Q Consensus 43 D~sG~-~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~r~~~~~n~~~~tG~~Ei~~~~~~il~~ 106 (622) |..+. +-|.+=. ..+-..+..|+.++-|.|+|.+..|.=.+.+-.-.+ .+||.++++.+|.. T Consensus 50 d~~~~W~~V~~Wg-~~AE~~a~~L~KG~~V~V~Grl~~~~W~dkdG~~R~-~~eV~Ad~v~~l~~ 112 (148) T PRK08182 50 DRGGFWAPVELWH-RDAEHWARLYQKGMRVLVEGRMERDEWTDNEDNERV-TFKVEARRVGILPY 112 (148) T ss_pred ECCCEEEEEEEEH-HHHHHHHHHHCCCCEEEEEEEEEECCCCCCCCCEEE-EEEEEEEEEEECCC T ss_conf 6664799999873-699999998738998999989860134779998788-99999868883467 No 199 >COG0095 LplA Lipoate-protein ligase A [Coenzyme metabolism] Probab=48.81 E-value=12 Score=17.07 Aligned_cols=66 Identities=14% Similarity=0.012 Sum_probs=32.7 Q ss_pred CCCCCEEEEEEEECCCEEECCCCCCCCHHHCCCCCCCCCHHHHHCCCHH-HHHHHCCCEEECCEEECCCCEECC Q ss_conf 3557510288960770535455470002110024766430655418877-741313677775677212512315 Q gi|254780419|r 427 INCNCFEFCWIMDFPFYEWNEEEKKIDFAHNPFTMPQGGMESLKGNDLL-SIKAFQYDLVCNGFEIASGGIRNH 499 (622) Q Consensus 427 i~~~~~~flWV~DFPLFe~dee~~r~~s~HHPFTaP~~~~~~l~~~dp~-~v~a~~YDLVlNG~EiggGSiRIH 499 (622) .+.+..+|.||+.-+.+ ...+...+--|+.|-.+....+. .. ....-.||||++|-.|+|=++|+. T Consensus 80 hd~g~l~~S~i~~~~~~-----~~~~~~~~~~~~~~~~~~l~~lg--v~~~~~~~~nDl~v~gkKisG~Aq~~~ 146 (248) T COG0095 80 HDLGNLNYSVITPDEGG-----LESYETLYKFLLQPVIDALRALG--VEGAECPGRNDLVVDGKKISGSAQRRT 146 (248) T ss_pred ECCCCEEEEEEECCCCC-----CCCHHHHHHHHHHHHHHHHHHHC--CCEECCCCCCCEEECCEEEEEEHHHHH T ss_conf 54897899999789864-----33499999999999999999848--750016898757884789988367660 No 200 >pfam08022 FAD_binding_8 FAD-binding domain. Probab=48.07 E-value=3.9 Score=20.47 Aligned_cols=23 Identities=22% Similarity=0.353 Sum_probs=15.1 Q ss_pred CCCCEEEEEEEECCCEEECCCCCCCCHHHCCCCC Q ss_conf 5575102889607705354554700021100247 Q gi|254780419|r 428 NCNCFEFCWIMDFPFYEWNEEEKKIDFAHNPFTM 461 (622) Q Consensus 428 ~~~~~~flWV~DFPLFe~dee~~r~~s~HHPFTa 461 (622) .+++|.|+|+ | .-+.|++ ||||- T Consensus 31 ~pGq~vyl~~---p------~is~~q~--HPFTI 53 (103) T pfam08022 31 KPGQYIFINF---P------PISFLQS--HPFTI 53 (103) T ss_pred CCCCEEEEEE---C------CCCCCCC--CCCEE T ss_conf 9983999997---9------6576534--88230 No 201 >TIGR01622 SF-CC1 splicing factor, CC1-like family; InterPro: IPR006509 These sequences represent a subfamily of RNA splicing factors including the Pad-1 protein (Neurospora crassa), CAPER (mouse) and CC1.3 (human). All are characterised by an N-terminal arginine-rich, low complexity domain followed by three (or in the case of 4 human paralogs, two) RNA recognition domains. These splicing factors are closely related to the U2AF splicing factor family (IPR006529 from INTERPRO). ; GO: 0003723 RNA binding, 0006397 mRNA processing, 0005634 nucleus. Probab=48.05 E-value=10 Score=17.52 Aligned_cols=15 Identities=47% Similarity=0.775 Sum_probs=6.9 Q ss_pred HHHHHHHHHHHHHHH Q ss_conf 999999999999998 Q gi|254780419|r 250 LNTMENVLRGVFEEF 264 (622) Q Consensus 250 m~l~E~li~~i~~~v 264 (622) ++++|..|+.||.-+ T Consensus 245 fnitE~~L~~iFepF 259 (531) T TIGR01622 245 FNITEQELRQIFEPF 259 (531) T ss_pred HCCCHHHHHHHHHCC T ss_conf 003578874242104 No 202 >pfam10778 DehI Halocarboxylic acid dehydrogenase DehI. Haloacid dehalogenases catalyse the removal of halides from organic haloacids. DehI can process both L- and D-substrates. A crucial aspartate residue is predicted to activate a water molecule for nucleophilic attack of the substrate chiral centre resulting in an inversion of the configuration of either L- or D-substrates in contrast to D-only enzymes. Probab=47.74 E-value=13 Score=16.77 Aligned_cols=11 Identities=9% Similarity=0.298 Sum_probs=4.2 Q ss_pred HHHHHHHHHHH Q ss_conf 99999999998 Q gi|254780419|r 254 ENVLRGVFEEF 264 (622) Q Consensus 254 E~li~~i~~~v 264 (622) +..+...|..+ T Consensus 13 p~f~~~~W~~l 23 (159) T pfam10778 13 PTFLPAAWRAL 23 (159) T ss_pred HHHHHHHHHHH T ss_conf 37899999998 No 203 >PRK08763 single-stranded DNA-binding protein; Provisional Probab=47.53 E-value=19 Score=15.59 Aligned_cols=85 Identities=16% Similarity=0.116 Sum_probs=52.4 Q ss_pred EEEEEEEECC---CC--CCC-CEEEEEE------ECCCCE-------EEEEECCCCHHHHHHHCCCCCCEEEEEEEEEEC Q ss_conf 8999979800---44--469-8799999------779825-------999974881578998528988589999999975 Q gi|254780419|r 20 FVRLSGWVHR---VR--PHG-GIIFLDI------RDHYGI-------TQVVANPDSACFEIAKSVRSEWVLCIDGLVMSR 80 (622) Q Consensus 20 ~V~i~GwV~~---~R--~~g-~l~F~~l------rD~sG~-------~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~r 80 (622) +|+|.|+|.+ .| ..| .++-+.| +|..|. +.|++-.+ .+....+.|+.++-|.|+|.++.| T Consensus 7 kV~LiG~l~~Dpelr~t~sG~~v~~fslA~~~~~~~~~g~~~~~t~w~~vv~~gk-~AE~~~~yl~KG~~V~VeGrL~~~ 85 (160) T PRK08763 7 KVILVGNLGNDPDIKYTQSGMTITRISLATTSMRKDREGNTQERTEWHRVKFFGK-LGEIAGEYLRKGSQCYIEGEIRYD 85 (160) T ss_pred EEEEEEECCCCCEEEECCCCCEEEEEEEEECCCEECCCCCCEECEEEEEEEEEHH-HHHHHHHHHHCCCEEEEEEEEEEE T ss_conf 9999875475975878899987999999977873568888252105999998368-999999874179989998888885 Q ss_pred CCCCCCCCCCCCEEEEEEEEEEEEEC Q ss_conf 86555887778659999889999502 Q gi|254780419|r 81 SAETINANIITGQIELSAQKIEILSV 106 (622) Q Consensus 81 ~~~~~n~~~~tG~~Ei~~~~~~il~~ 106 (622) .=...+ ..+-=..||.+++++.|.. T Consensus 86 ~y~dkd-G~kr~~teIv~~~v~~l~~ 110 (160) T PRK08763 86 KFTGQD-GQERYVTEIVADEMQMLGG 110 (160) T ss_pred EEECCC-CCEEEEEEEEEEEEEECCC T ss_conf 517589-9889999999988898137 No 204 >KOG2298 consensus Probab=47.44 E-value=6.7 Score=18.74 Aligned_cols=91 Identities=27% Similarity=0.307 Sum_probs=43.6 Q ss_pred EEEHHHHCCCCC-CCC--CCCCCEEEEEEEECCCCH--------HHHHHHHHHHHHHH-------------HHHHHCCCC Q ss_conf 110011037776-655--380000356541026899--------99999999999999-------------999808723 Q gi|254780419|r 214 FQIAPCFRDEDP-RSD--RLPGEFYQLDMEMSFVEQ--------DDILNTMENVLRGV-------------FEEFSNGNT 269 (622) Q Consensus 214 fqiarcFRdEd~-R~d--Rqp~EFTqlD~EmsF~~~--------edvm~l~E~li~~i-------------~~~v~~~~~ 269 (622) =|||+.||||=| |.- |- -||||-++| -|+|- .+|-+.-=.|+..- -..|..+. T Consensus 213 AqiG~~fRNEISpRsGLlRv-rEF~maEIE-HFvdP~~K~h~kF~~V~~~~l~l~~~~~q~~g~~a~~~~lgEaV~kg~- 289 (599) T KOG2298 213 AQIGKSFRNEISPRSGLLRV-REFTMAEIE-HFVDPLLKSHPKFSLVAAEKLRLFPRDKQLSGQSAQKVALGEAVAKGT- 289 (599) T ss_pred HHHCHHHHHCCCCCCCCEEE-EEEEHHHHH-CCCCCCCCCCHHHHHHHHHHHHHCCHHHHHCCCHHHHHHHHHHHHHHC- T ss_conf 87465765133755574367-875467765-348977778736665543155423323443465056747989987604- Q ss_pred CCCCCCCCCHHHH-----HHHCCCCCCCCCCCCCCCCC-CCCCCCC Q ss_conf 4577321218988-----75136777644476401244-3101787 Q gi|254780419|r 270 VSNNFLRICYDDA-----IRRYGTDKPDLRNPIIMHNV-TEHFVHS 309 (622) Q Consensus 270 i~~pF~rmtY~eA-----m~~YGsDKPDLR~~lei~di-t~~~~~~ 309 (622) + .=.++-|.-| +.+-|+||--|||.-.+.|- ..+..+| T Consensus 290 V--~netlGyfi~Ri~~fL~~lGid~~rlRFRqH~~nEMAHYA~DC 333 (599) T KOG2298 290 V--NNETLGYFIGRIYLFLNKLGIDKERLRFRQHMANEMAHYAFDC 333 (599) T ss_pred C--CCCHHHHHHHHHHHHHHHHCCCHHHCCHHHHHHHHHHHHHCCC T ss_conf 4--5303678899999999984765532106777653333330153 No 205 >pfam08624 CRC_subunit Chromatin remodelling complex subunit. This family has been identified as a subunit of chromatin remodelling complexes. Saccharomyces cerevisiae RSC7 and its paralogue the SWI/SNF global transcription activator complex subunit SWP82 have been identified as subunits of the RSC chromatin remodelling complex, and SWI/SNF chromatin remodelling complex respectively. Probab=46.85 E-value=8.9 Score=17.86 Aligned_cols=35 Identities=23% Similarity=0.345 Sum_probs=17.4 Q ss_pred EEHHH--HCCCCCCCCCCCCCEEEEEEEECCCCHHHHHHHHHH Q ss_conf 10011--037776655380000356541026899999999999 Q gi|254780419|r 215 QIAPC--FRDEDPRSDRLPGEFYQLDMEMSFVEQDDILNTMEN 255 (622) Q Consensus 215 qiarc--FRdEd~R~dRqp~EFTqlD~EmsF~~~edvm~l~E~ 255 (622) +.||| |||+.+=-.+|| ...- =-+++++-..++|. T Consensus 53 e~aR~lg~rDSy~ff~~~~-~L~k-----i~~~~~Ek~~Li~~ 89 (139) T pfam08624 53 EPARLLGFRDSYLFFQKHP-SLYK-----IITTQDEKEDLIDR 89 (139) T ss_pred HHHHHHCCCHHHHHHHHCH-HHHE-----EECCHHHHHHHHHC T ss_conf 7788836432899998580-7533-----52798999999975 No 206 >TIGR01398 FlhA flagellar biosynthesis protein FlhA; InterPro: IPR006301 These sequences describe the flagellar biosynthesis protein FlhA, one of a large number of genes associated with the biosynthesis of functional bacterial flagella. Homologs of many such proteins, including FlhA, function in type III protein secretion systems. ; GO: 0015031 protein transport, 0016021 integral to membrane, 0019861 flagellum. Probab=45.65 E-value=19 Score=15.52 Aligned_cols=94 Identities=14% Similarity=0.093 Sum_probs=47.4 Q ss_pred CC-CHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCHHHHCCHHHHHHHHHHHCC-CCCCCEEEEECCCCCCCCHHHH Q ss_conf 21-035677999876420246235520477753335702553252789999998188-7557269980885411001347 Q gi|254780419|r 338 RA-FCDRMNEWARSQGQPGLAYIFWRLDESGNVAGMGPVAKHLNIVNTEAIRVRLDM-QDGDSCFFIAGDPLKFCSFAAD 415 (622) Q Consensus 338 rk-~~D~l~e~ak~~g~kgL~~i~~~~~~~~~~~~~spi~K~l~~~~~~~l~~~~~~-~~GD~i~f~a~~~~~~~~~lg~ 415 (622) .+ ..+.+.|.++.-=++-+.+... ++++.+... -++.+-.+.+.+.+.. ..|... =..=.|...++.+.. T Consensus 580 ~~~~~~~L~E~VR~rL~rqI~~~~~--~~~g~L~~~-----~L~~~~E~~l~~~l~~~~~~~~~-~~~L~P~~~~~l~~~ 651 (713) T TIGR01398 580 TKTDPDLLVEHVRQRLGRQITQQYL--DEDGVLKVI-----TLDPDLEAALAEALKEDGEGELL-ELALEPALLEELLRK 651 (713) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHC--CCCCEEEEE-----EECCHHHHHHHHHHHCCCCCCCC-CCCCCHHHHHHHHHH T ss_conf 0687878999999999999999644--889827999-----85838899999874058776421-257787788999999 Q ss_pred HHHHHHHHHHCCCCCCEEEEEEEE Q ss_conf 889998773003557510288960 Q gi|254780419|r 416 ARNHIAQELGIINCNCFEFCWIMD 439 (622) Q Consensus 416 lR~~ia~~l~li~~~~~~flWV~D 439 (622) ++.++.+-++.-....+.|+=||. T Consensus 652 ~~~~~~~~~~~g~~~~~~~vL~t~ 675 (713) T TIGR01398 652 VRKEVEKLAQNGEAEFVPPVLLTS 675 (713) T ss_pred HHHHHHHHHHHHHHHCCEEEEECC T ss_conf 999999998636771312799728 No 207 >PRK06958 single-stranded DNA-binding protein; Provisional Probab=45.64 E-value=20 Score=15.40 Aligned_cols=85 Identities=18% Similarity=0.177 Sum_probs=49.8 Q ss_pred EEEEEEEECC---CCC--CC-CEEEEEEE------CC-CCE-------EEEEECCCCHHHHHHHCCCCCCEEEEEEEEEE Q ss_conf 8999979800---444--69-87999997------79-825-------99997488157899852898858999999997 Q gi|254780419|r 20 FVRLSGWVHR---VRP--HG-GIIFLDIR------DH-YGI-------TQVVANPDSACFEIAKSVRSEWVLCIDGLVMS 79 (622) Q Consensus 20 ~V~i~GwV~~---~R~--~g-~l~F~~lr------D~-sG~-------~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~ 79 (622) +|+|.|+|.+ .|- .| .++-+.|- |. +|- +.|++-.+ .+....+.|+.++-|.|+|.++. T Consensus 6 kV~LiG~L~~DPelr~t~~G~~va~f~lA~~~~~~d~~~ge~~e~t~w~~v~~~gk-~AE~~~~yl~KG~~V~VeGrL~~ 84 (181) T PRK06958 6 KVILVGNLGADPEVRYLPSGDAVANIRLATTDRYKDKASGEFKEATEWHRVAFFGR-LAEIVGEYLKKGSSVYIEGRIRT 84 (181) T ss_pred EEEEEEECCCCCEEEECCCCCEEEEEEEEEECCEEECCCCCEEECCEEEEEECCCH-HHHHHHHHCCCCCEEEEEEEEEE T ss_conf 79998857869658787999779999999863333237883763163778503356-99999864479998999757770 Q ss_pred CCCCCCCCCCCCCEEEEEEEEEEEEEC Q ss_conf 586555887778659999889999502 Q gi|254780419|r 80 RSAETINANIITGQIELSAQKIEILSV 106 (622) Q Consensus 80 r~~~~~n~~~~tG~~Ei~~~~~~il~~ 106 (622) |.=...+ ..+-=-.||.|+++++|.. T Consensus 85 r~w~dkd-G~~ry~tEiv~d~~~~l~~ 110 (181) T PRK06958 85 RKWQGQD-GQDRYSTEIVADQMQMLGG 110 (181) T ss_pred CCCCCCC-CCEEEEEEEEEEEEEEECC T ss_conf 6459789-9989999999868898147 No 208 >COG2317 Zn-dependent carboxypeptidase [Amino acid transport and metabolism] Probab=45.59 E-value=5 Score=19.68 Aligned_cols=12 Identities=33% Similarity=0.750 Sum_probs=5.0 Q ss_pred CCCCCHHHCCCC Q ss_conf 470002110024 Q gi|254780419|r 449 EKKIDFAHNPFT 460 (622) Q Consensus 449 ~~r~~s~HHPFT 460 (622) -+|+.-+||||| T Consensus 226 ~GRlD~S~HPF~ 237 (497) T COG2317 226 RGRLDVSVHPFT 237 (497) T ss_pred CCCCCCCCCCCC T ss_conf 763037889865 No 209 >pfam03223 V-ATPase_C V-ATPase subunit C. Probab=45.46 E-value=20 Score=15.38 Aligned_cols=11 Identities=27% Similarity=0.688 Sum_probs=8.4 Q ss_pred HHHCCCCCCCC Q ss_conf 98608497342 Q gi|254780419|r 528 AFQCGMPPHGG 538 (622) Q Consensus 528 Al~yG~PPHgG 538 (622) .|+||.||+-= T Consensus 307 VLRYGLP~~F~ 317 (371) T pfam03223 307 VLRYGLPPNFQ 317 (371) T ss_pred HHHCCCCCCCE T ss_conf 66437995402 No 210 >PRK11057 ATP-dependent DNA helicase RecQ; Provisional Probab=44.78 E-value=7.8 Score=18.27 Aligned_cols=11 Identities=36% Similarity=0.785 Sum_probs=8.9 Q ss_pred CCCCCCCCCCC Q ss_conf 67776444764 Q gi|254780419|r 287 GTDKPDLRNPI 297 (622) Q Consensus 287 GsDKPDLR~~l 297 (622) |+||||.||-. T Consensus 299 GIdk~dVR~Vi 309 (607) T PRK11057 299 GINKPNVRFVV 309 (607) T ss_pred CCCCCCCCEEE T ss_conf 76777766799 No 211 >COG5608 LEA14-like dessication related protein [Defense mechanisms] Probab=44.24 E-value=8.4 Score=18.06 Aligned_cols=10 Identities=20% Similarity=0.298 Sum_probs=3.8 Q ss_pred EEEEEEEEEE Q ss_conf 9999889999 Q gi|254780419|r 94 IELSAQKIEI 103 (622) Q Consensus 94 ~Ei~~~~~~i 103 (622) +++...++.+ T Consensus 73 y~vymN~Iki 82 (161) T COG5608 73 YAVYMNDIKI 82 (161) T ss_pred EEEEECCEEE T ss_conf 9999865584 No 212 >cd04823 ALAD_PBGS_aspartate_rich Porphobilinogen synthase (PBGS), which is also called delta-aminolevulinic acid dehydratase (ALAD), catalyzes the condensation of two 5-aminolevulinic acid (ALA) molecules to form the pyrrole porphobilinogen (PBG), which is the second step in the biosynthesis of tetrapyrroles, such as heme, vitamin B12 and chlorophyll. This reaction involves the formation of a Schiff base link between the substrate and the enzyme. PBGSs are metalloenzymes, some of which have a second, allosteric metal binding site, beside the metal ion binding site in their active site. Although PBGS is a family of homologous enzymes, its metal ion utilization at catalytic site varies between zinc and magnesium and/or potassium. PBGS can be classified into two groups based on differences in their active site metal binding site. All of PBGS_aspartate_rich contain an aspartate rich metal binding site with the general sequence DXALDX(Y/F)X3G(H/Q)DG. They also contain an allosteric magnesiu Probab=43.71 E-value=5.5 Score=19.39 Aligned_cols=23 Identities=30% Similarity=0.410 Sum_probs=18.7 Q ss_pred HHHHHHHHHHCCCEEEEECCCCCCC Q ss_conf 9999987520365146203321124 Q gi|254780419|r 148 IINSMRCRMIAENFIECSTPILTAS 172 (622) Q Consensus 148 i~~~~R~~l~~~gF~EveTP~L~ks 172 (622) -+.+||+.|+++||. +||||.=| T Consensus 168 rV~aIR~~Ld~~gf~--~~~ImSYs 190 (320) T cd04823 168 RIGAIREALDAEGFT--NVSILSYA 190 (320) T ss_pred HHHHHHHHHHHCCCC--CCEEEEHH T ss_conf 899999999977998--74671245 No 213 >KOG3358 consensus Probab=43.14 E-value=7.7 Score=18.34 Aligned_cols=43 Identities=28% Similarity=0.342 Sum_probs=26.2 Q ss_pred CCCCCCHHHCCCCCCCCCH--HHHHCCCHHHHHHHCCCEEECCEEE Q ss_conf 5470002110024766430--6554188777413136777756772 Q gi|254780419|r 448 EEKKIDFAHNPFTMPQGGM--ESLKGNDLLSIKAFQYDLVCNGFEI 491 (622) Q Consensus 448 e~~r~~s~HHPFTaP~~~~--~~l~~~dp~~v~a~~YDLVlNG~Ei 491 (622) .++++--+|| ||+|-... ..-+-+|-+.=.++++-++|||-+- T Consensus 100 ~TgknLHSHh-f~sPlSgnqEVSafG~dgegDtgD~Wtvic~g~~W 144 (211) T KOG3358 100 KTGKNLHSHH-FTSPLSGNQEVSAFGEDGEGDTGDHWTVICNGKTW 144 (211) T ss_pred ECCCCHHHCC-CCCCCCCCEEEEEECCCCCCCCCCCEEEEECCCCC T ss_conf 3365301114-58877898157750455788866636999679441 No 214 >KOG4059 consensus Probab=43.14 E-value=11 Score=17.27 Aligned_cols=25 Identities=36% Similarity=0.848 Sum_probs=13.1 Q ss_pred CCCCCCCCCCCHHHHHHCC--CCHHHHHHHHHCC Q ss_conf 6454322222103443112--3854564433105 Q gi|254780419|r 177 ARDFLVPSRVNQGSFYALP--QAPQQYKQLLMAS 208 (622) Q Consensus 177 ArdFlVPsR~~~g~fYALp--QSPQlyKQlLmv~ 208 (622) +|-|+. ...|++| -||| |||.+.-| T Consensus 125 cR~Yl~------snl~~~P~~PSp~-Yk~~i~~G 151 (193) T KOG4059 125 CRAYLL------SNLYELPKQPSPT-YKQCIVKG 151 (193) T ss_pred HHHHHH------HHHHHCCCCCCCH-HHHHHHHC T ss_conf 157545------3154355799925-77552013 No 215 >pfam01340 MetJ Met Apo-repressor, MetJ. Probab=43.10 E-value=18 Score=15.68 Aligned_cols=42 Identities=19% Similarity=0.189 Sum_probs=28.0 Q ss_pred HHHCCCCC-CCCCHHHHHCCCHHHHHHHCCCEEECCEEECCCCEECCCHHHHHHHHHHCCCCHHHH Q ss_conf 21100247-664306554188777413136777756772125123156999999999859998999 Q gi|254780419|r 454 FAHNPFTM-PQGGMESLKGNDLLSIKAFQYDLVCNGFEIASGGIRNHMSEVMLQAFSNVGISKEVV 518 (622) Q Consensus 454 s~HHPFTa-P~~~~~~l~~~dp~~v~a~~YDLVlNG~EiggGSiRIHd~~iQ~~~f~~lgl~~ee~ 518 (622) |--|.||- |-+.|.++....|.. -|+.-+.+|+.+|+++++. T Consensus 60 AFLHAfTGQPLP~D~Dl~K~~~d~-----------------------iP~~ak~~m~~~Gi~~~~~ 102 (104) T pfam01340 60 AFLHAFTGQPLPDDADLRKERSDE-----------------------IPEAAKEIMRELGIDPETW 102 (104) T ss_pred HHHHHHCCCCCCCHHHHHHCCCCC-----------------------CCHHHHHHHHHHCCCHHHC T ss_conf 999987399899806550038651-----------------------6099999999919983321 No 216 >cd01674 Homoaconitase_Swivel Homoaconitase swivel domain. This family includes homoaconitase and other uncharacterized proteins of the Aconitase family. Homoaconitase is part of an unusual lysine biosynthesis pathway found only in filamentous fungi, in which lysine is synthesized via the alpha-aminoadipate pathway. In this pathway, homoaconitase catalyzes the conversion of cis-homoaconitic acid into homoisocitric acid. The reaction mechanism is believed to be similar to that of other aconitases. This is the swivel domain, which is believed to undergo swivelling conformational change in the enzyme mechanism. Probab=43.06 E-value=9.4 Score=17.70 Aligned_cols=69 Identities=16% Similarity=0.205 Sum_probs=41.5 Q ss_pred HCCCEEECCEEECCCCEECCCHHHHHHHHHHCCCCHHHHHHHHH--HHHHHHHCCCCCCCCEECCHHHHHHHHCCCC Q ss_conf 13677775677212512315699999999985999899986677--8999986084973422246899999983739 Q gi|254780419|r 480 FQYDLVCNGFEIASGGIRNHMSEVMLQAFSNVGISKEVVENRFG--GLYRAFQCGMPPHGGIAAGIDRIVMLLLGAK 554 (622) Q Consensus 480 ~~YDLVlNG~EiggGSiRIHd~~iQ~~~f~~lgl~~ee~~~~Fg--~lL~Al~yG~PPHgG~alGlDRlvmll~g~~ 554 (622) +..|+++-|---|.||-|.|-+-. ++..|+.---+ ..|+ |+=+++.-|.+|--= --=+++|.+...|.+ T Consensus 44 k~g~ILVaG~NFGcGSSREhA~~a----L~~~Gi~aVIA-~SFa~IF~rN~iNnGLlpI~l-P~~~~~l~~~~~~~~ 114 (129) T cd01674 44 KQGDILVSGFNFGTGSSREQAATA----LLAKGIPLVVS-GSFGNIFSRNSINNALLSIEL-PFLVQKLREAFANES 114 (129) T ss_pred CCCCEEEECCCCCCCCCHHHHHHH----HHHCCCCEEEE-CHHHHHHHHHHHHCCCCEEEC-HHHHHHHHHHHCCCC T ss_conf 788889966865767348899999----99849979998-529999997778718886876-499999999855888 No 217 >cd04491 SoSSB_OBF SoSSB_OBF: A subfamily of OB folds similar to the OB fold of the crenarchaeote Sulfolobus solfataricus single-stranded (ss) DNA-binding protein (SSoSSB). SSoSSB has a single OB fold, and it physically and functionally interacts with RNA polymerase. In vitro, SSoSSB can substitute for the basal transcription factor TBP, stimulating transcription from promoters under conditions in which TBP is limiting, and supporting transcription when TBP is absent. SSoSSB selectively melts the duplex DNA of promoter sequences. It also relieves transcriptional repression by the chromatin Alba. In addition, SSoSSB activates reverse gyrase activity, which involves DNA binding, DNA cleavage, strand passage and ligation. SSoSSB stimulates all these steps in the presence of the chromatin protein, Sul7d. SSoSSB antagonizes the inhibitory effect of Sul7d on reverse gyrase supercoiling activity. It also physically and functionally interacts with Mini-chromosome Maintenance (MCM), stimulating Probab=43.00 E-value=22 Score=15.12 Aligned_cols=59 Identities=20% Similarity=0.190 Sum_probs=41.6 Q ss_pred CCCCEEEEEEECCCCEEEEEECCCCHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCCCCCCEEEEEEEEEEE Q ss_conf 469879999977982599997488157899852898858999999997586555887778659999889999 Q gi|254780419|r 32 PHGGIIFLDIRDHYGITQVVANPDSACFEIAKSVRSEWVLCIDGLVMSRSAETINANIITGQIELSAQKIEI 103 (622) Q Consensus 32 ~~g~l~F~~lrD~sG~~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~r~~~~~n~~~~tG~~Ei~~~~~~i 103 (622) ..|.+.-+.+.|.||.+.+++=.+.. ...+..+++|.+.+.....- .|.+||.+..... T Consensus 20 ~~~~v~~~~i~D~TG~ir~t~W~~~~----~~~l~~Gd~v~i~~~~v~~~---------~g~~el~~~~~s~ 78 (82) T cd04491 20 SEGKVQSGLVGDETGTIRFTLWDEKA----ADDLEPGDVVRIENAYVREF---------NGRLELSVGKNSE 78 (82) T ss_pred CEEEEEEEEEECCCCEEEEEEECCCC----CCCCCCCCEEEEEEEEEEEE---------CCEEEEEECCCCE T ss_conf 76899999999899879999968744----45558999999968999888---------9979999899729 No 218 >cd06811 PLPDE_III_yhfX_like Type III Pyridoxal 5-phosphate (PLP)-Dependent Enzyme yhfX. This subfamily is composed of the uncharacterized protein yhfX from Escherichia coli K-12 and similar bacterial proteins. These proteins are homologous to bacterial alanine racemases (AR), which are fold type III PLP-dependent enzymes containing an N-terminal PLP-binding TIM-barrel domain and a C-terminal beta-sandwich domain. AR exists as homodimers with active sites that lie at the interface between the TIM barrel domain of one subunit and the beta-sandwich domain of the other subunit. It catalyzes the interconversion between L- and D-alanine, which is an essential component of the peptidoglycan layer of bacterial cell walls. Members of this subfamily may act as PLP-dependent enzymes. Probab=42.61 E-value=22 Score=15.08 Aligned_cols=33 Identities=24% Similarity=0.408 Sum_probs=27.9 Q ss_pred CCEEEEEEEECCCEEECCCCCCCCHHHCCCCCC Q ss_conf 751028896077053545547000211002476 Q gi|254780419|r 430 NCFEFCWIMDFPFYEWNEEEKKIDFAHNPFTMP 462 (622) Q Consensus 430 ~~~~flWV~DFPLFe~dee~~r~~s~HHPFTaP 462 (622) ...++.=||.||-|.||++.+++.++|+-+|.- T Consensus 183 ~gi~i~GvT~FPc~L~d~~~~~~~pT~N~~Tl~ 215 (382) T cd06811 183 PGIRIAGLTSFPCFLYDEEQGDIAPTPNLFTLL 215 (382) T ss_pred CCCEEEEEECEEEEEECCCCCCCCCCCCHHHHH T ss_conf 996796643200687747779511585299999 No 219 >PRK05264 transcriptional repressor protein MetJ; Provisional Probab=42.38 E-value=22 Score=15.06 Aligned_cols=41 Identities=24% Similarity=0.255 Sum_probs=27.7 Q ss_pred HHHCCCCC-CCCCHHHHHCCCHHHHHHHCCCEEECCEEECCCCEECCCHHHHHHHHHHCCCCHHH Q ss_conf 21100247-66430655418877741313677775677212512315699999999985999899 Q gi|254780419|r 454 FAHNPFTM-PQGGMESLKGNDLLSIKAFQYDLVCNGFEIASGGIRNHMSEVMLQAFSNVGISKEV 517 (622) Q Consensus 454 s~HHPFTa-P~~~~~~l~~~dp~~v~a~~YDLVlNG~EiggGSiRIHd~~iQ~~~f~~lgl~~ee 517 (622) |--|.||- |-+.|.++....|..+ |+.-+++++.+|++.++ T Consensus 61 AFLHAfTGQPLP~D~Dl~K~~~~~i-----------------------P~~~k~~m~~~Gi~~~~ 102 (104) T PRK05264 61 AFLHAFTGQPLPDDEDLRKERSDEI-----------------------PEAAKEIMREMGIDPET 102 (104) T ss_pred HHHHHHCCCCCCCHHHHHHCCCCCC-----------------------HHHHHHHHHHHCCCHHH T ss_conf 9999864998998377754173024-----------------------39999999992988322 No 220 >PRK02801 primosomal replication protein N; Provisional Probab=42.37 E-value=22 Score=15.06 Aligned_cols=81 Identities=20% Similarity=0.293 Sum_probs=48.4 Q ss_pred CEEEEEEEECCC---CC--CC-CEEEEEE------------ECCCCEEEEEECCCCHHHHHHHCCCCCCEEEEEEEEEEC Q ss_conf 889999798004---44--69-8799999------------779825999974881578998528988589999999975 Q gi|254780419|r 19 SFVRLSGWVHRV---RP--HG-GIIFLDI------------RDHYGITQVVANPDSACFEIAKSVRSEWVLCIDGLVMSR 80 (622) Q Consensus 19 ~~V~i~GwV~~~---R~--~g-~l~F~~l------------rD~sG~~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~r 80 (622) ..+.+.|-|... |. .| -.+=+.| |.....+|+|+..+ .....+.++..++.+.|+|-+..+ T Consensus 3 N~l~Lsg~v~~~~~lr~SPAGIp~~~f~LeH~S~Q~EAg~~Rqv~c~i~vv~~G~-~~~~~~~~l~~G~~v~v~GFLa~~ 81 (101) T PRK02801 3 NRLVLSGTVCRTPKRKVSPSGIPHCQFVLEHRSVQEEAGFLRQAWCQMPVIVSGH-QFQAITQSITVGSEITVQGFISCH 81 (101) T ss_pred CEEEEEEEEEECCCEEECCCCCEEEEEEEEEEEEEEECCCCEEEEEEEEEEEEHH-HHHHHHHCCCCCCEEEEEEEEEEC T ss_conf 4899999995067310589978228999987788542798278999999999616-789875045899889999998412 Q ss_pred CCCCCCCCCCCCEEEEEEEEEEEEE Q ss_conf 8655588777865999988999950 Q gi|254780419|r 81 SAETINANIITGQIELSAQKIEILS 105 (622) Q Consensus 81 ~~~~~n~~~~tG~~Ei~~~~~~il~ 105 (622) .. .|. .+.+-++++.|+.+. T Consensus 82 ~~--rng---~~~LVLHa~~Ie~i~ 101 (101) T PRK02801 82 KG--RNG---LSKLVLHAEQIELID 101 (101) T ss_pred CC--CCC---CCCEEEEEEEEEECC T ss_conf 56--578---743999902599819 No 221 >PRK13384 delta-aminolevulinic acid dehydratase; Provisional Probab=42.36 E-value=6.4 Score=18.90 Aligned_cols=22 Identities=27% Similarity=0.376 Sum_probs=16.0 Q ss_pred HHHHHHHHHCCCEEEEECCCCCCC Q ss_conf 999987520365146203321124 Q gi|254780419|r 149 INSMRCRMIAENFIECSTPILTAS 172 (622) Q Consensus 149 ~~~~R~~l~~~gF~EveTP~L~ks 172 (622) +.+||+.|+++|| .+||||.=| T Consensus 175 V~aIR~~Ld~~g~--~~v~ImSYs 196 (323) T PRK13384 175 VKAIRQGLDAAGF--EHVAILAHS 196 (323) T ss_pred HHHHHHHHHHCCC--CCCEEEEHH T ss_conf 9999999997799--874673366 No 222 >TIGR00441 gmhA phosphoheptose isomerase; InterPro: IPR004515 Phosphoheptose isomerase is involved in lipopolysaccharide biosynthesis, and more specifically in the synthesis of glyceromannoheptose 7-phosphate. It may also have a role in virulence in Haemophilus ducreyi.; GO: 0008968 phosphoheptose isomerase activity, 0009244 lipopolysaccharide core region biosynthetic process, 0005737 cytoplasm. Probab=42.05 E-value=17 Score=15.86 Aligned_cols=125 Identities=15% Similarity=0.128 Sum_probs=67.2 Q ss_pred CCCEEEEEECCCCCCCCCCHHHH---HHHHHHH--HHCCCCCEEEEECCCCCCCCCCCHHHHCCH-HHHHHHHHHHCCCC Q ss_conf 78479999627876652103567---7999876--420246235520477753335702553252-78999999818875 Q gi|254780419|r 322 PGYEVWAIPAKGGGGNRAFCDRM---NEWARSQ--GQPGLAYIFWRLDESGNVAGMGPVAKHLNI-VNTEAIRVRLDMQD 395 (622) Q Consensus 322 ~~~~v~~i~~~~~~~srk~~D~l---~e~ak~~--g~kgL~~i~~~~~~~~~~~~~spi~K~l~~-~~~~~l~~~~~~~~ 395 (622) .+.+++++=.. ++..|.. .|++-++ ..+||.=|.+..| ....+.|+..+.. +.-..-.+.++ ++ T Consensus 38 ~GgK~L~CGNG-----gSaadAqHFAaEl~GRf~~eR~glPAIAL~tD----~S~Ltai~NDygyd~vF~RqVEAlG-~~ 107 (186) T TIGR00441 38 NGGKILICGNG-----GSAADAQHFAAELVGRFKLERPGLPAIALNTD----ISILTAIANDYGYDEVFSRQVEALG-QE 107 (186) T ss_pred CCCCEEEECCC-----CCHHHHHHHHHHHHCCCCCCCCCCCHHHHHHH----HHHHHHCCCHHHHHHHHHHHHHHCC-CC T ss_conf 08908985685-----04335888999871640002677646777622----3532320011025678777766328-89 Q ss_pred CCCEEEEE--CCCCCCCCHHHHHHHHHHHHHHCC--CCCCEEEEE-EEECCCEEECC-CCCCCCHHH Q ss_conf 57269980--885411001347889998773003--557510288-96077053545-547000211 Q gi|254780419|r 396 GDSCFFIA--GDPLKFCSFAADARNHIAQELGII--NCNCFEFCW-IMDFPFYEWNE-EEKKIDFAH 456 (622) Q Consensus 396 GD~i~f~a--~~~~~~~~~lg~lR~~ia~~l~li--~~~~~~flW-V~DFPLFe~de-e~~r~~s~H 456 (622) ||+++-++ |....+-++..+.+.+=-+-..|. +-++++.+- --|.||+=|+. .+.|++=+| T Consensus 108 GDVL~GiSTSGNS~NvlkA~~~Ak~~gm~~i~L~G~dGGk~~~l~GPqD~~l~VP~~~~t~RIQE~H 174 (186) T TIGR00441 108 GDVLLGISTSGNSKNVLKAIEAAKDKGMKTIALTGKDGGKLAGLAGPQDVELRVPSKRDTARIQEIH 174 (186) T ss_pred CCEEEEEECCCCCHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCCEEEECCCCCCCCCHHHHH T ss_conf 8688874247670889999998845796699972178631133578886056767887584032348 No 223 >cd00384 ALAD_PBGS Porphobilinogen synthase (PBGS), which is also called delta-aminolevulinic acid dehydratase (ALAD), catalyzes the condensation of two 5-aminolevulinic acid (ALA) molecules to form the pyrrole porphobilinogen (PBG), which is the second step in the biosynthesis of tetrapyrroles, such as heme, vitamin B12 and chlorophyll. This reaction involves the formation of a Schiff base link between the substrate and the enzyme. PBGSs are metalloenzymes, some of which have a second, allosteric metal binding site, beside the metal ion binding site in their active site. Although PBGS is a family of homologous enzymes, its metal ion utilization at catalytic site varies between zinc and magnesium and/or potassium. PBGS can be classified into two groups based on differences in their active site metal binding site. They either contain a cysteine-rich zinc binding site (consensus DXCXCX(Y/F)X3G(H/Q)CG) or an aspartate-rich magnesium binding site (consensus DXALDX(Y/F)X3G(H/Q)DG). The cyste Probab=41.89 E-value=6.3 Score=18.92 Aligned_cols=23 Identities=22% Similarity=0.379 Sum_probs=18.5 Q ss_pred HHHHHHHHHHCCCEEEEECCCCCCC Q ss_conf 9999987520365146203321124 Q gi|254780419|r 148 IINSMRCRMIAENFIECSTPILTAS 172 (622) Q Consensus 148 i~~~~R~~l~~~gF~EveTP~L~ks 172 (622) -+.+||+.|+++||. +||||.=| T Consensus 163 rV~aIR~~Ld~~gf~--~v~ImSYs 185 (314) T cd00384 163 RVAAIREALDEAGFS--DVPIMSYS 185 (314) T ss_pred HHHHHHHHHHHCCCC--CCEEEEHH T ss_conf 899999999977999--88053227 No 224 >pfam06934 CTI Fatty acid cis/trans isomerase (CTI). This family consists of several fatty acid cis/trans isomerase proteins which appear to be found exclusively in bacteria of the orders Vibrionales and Pseudomonadales. Cis/trans isomerase (CTI) catalyses the cis-trans isomerisation of esterified fatty acids in phospholipids, mainly cis-oleic acid (C(16:1,9)) and cis-vaccenic acid (C(18:1,11)), in response to solvents. The CTI protein has been shown to be involved in solvent resistance in Pseudomonas putida. Probab=41.84 E-value=6.5 Score=18.83 Aligned_cols=14 Identities=29% Similarity=0.878 Sum_probs=11.4 Q ss_pred EEEEEEEECCCEEE Q ss_conf 10288960770535 Q gi|254780419|r 432 FEFCWIMDFPFYEW 445 (622) Q Consensus 432 ~~flWV~DFPLFe~ 445 (622) -+=.||+||||||. T Consensus 409 PkTaWviDYpLlER 422 (688) T pfam06934 409 PKTAWVIDYPLLER 422 (688) T ss_pred CCEEEEECCHHHHH T ss_conf 87244424399999 No 225 >PRK07922 N-acetylglutamate synthase; Validated Probab=41.80 E-value=10 Score=17.48 Aligned_cols=37 Identities=27% Similarity=0.315 Sum_probs=23.4 Q ss_pred HHHHCCCEEEEECCCC--------CCCCCCCCCCCCCCCCCCHHH Q ss_conf 7520365146203321--------124645664543222221034 Q gi|254780419|r 154 CRMIAENFIECSTPIL--------TASSPEGARDFLVPSRVNQGS 190 (622) Q Consensus 154 ~~l~~~gF~EveTP~L--------~ksTpeGArdFlVPsR~~~g~ 190 (622) +||.+.||.||+--.| .+|--||--+||=-+|++|+. T Consensus 116 ~fF~k~GF~ev~k~~lp~kvw~ec~r~~~~~~~~~~~~~~~~~~~ 160 (170) T PRK07922 116 EFFARHGFVEIDGTPVTAEVYEEMLRSYDEGVAEFLDLSRVKPNT 160 (170) T ss_pred HHHHHCCCEECCCCCCCHHHHHHHHHCCCHHHHHHHCCCCCCCCC T ss_conf 899976998766884998999998768652268772532247776 No 226 >PRK06751 single-stranded DNA-binding protein; Provisional Probab=41.62 E-value=23 Score=14.98 Aligned_cols=85 Identities=18% Similarity=0.241 Sum_probs=51.6 Q ss_pred EEEEEEEECC---CCC--CC-CEEEEEEE------CCCC-----EEEEEECCCCHHHHHHHCCCCCCEEEEEEEEEECCC Q ss_conf 8999979800---444--69-87999997------7982-----599997488157899852898858999999997586 Q gi|254780419|r 20 FVRLSGWVHR---VRP--HG-GIIFLDIR------DHYG-----ITQVVANPDSACFEIAKSVRSEWVLCIDGLVMSRSA 82 (622) Q Consensus 20 ~V~i~GwV~~---~R~--~g-~l~F~~lr------D~sG-----~~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~r~~ 82 (622) +|.|.|+|.+ .|. .| .++-+.|. |..| .+-||+-.+ .+....+.|+.++.|.|+|.++.|.= T Consensus 3 kV~LiGrL~~DPElr~t~~G~~v~~f~lAv~r~~~~~~g~~~td~~~vv~wgk-~AE~~~~yl~KG~~V~VeGrl~~r~y 81 (172) T PRK06751 3 RVILVGRLTKDPDLRYTPNGVAVATFTLAVNRAFANQQGEREADFINCVIWRK-QAENVANYLKKGSLAGVDGRLQTRNY 81 (172) T ss_pred EEEEEEECCCCCEEEECCCCCEEEEEEEEECCCEECCCCCEEEEEEEEEEECH-HHHHHHHHHCCCCEEEEEEEEECCCC T ss_conf 79999876878758888999879999999667602678976778999997288-89999987578998999998624766 Q ss_pred CCCCCCCCCCEEEEEEEEEEEEEC Q ss_conf 555887778659999889999502 Q gi|254780419|r 83 ETINANIITGQIELSAQKIEILSV 106 (622) Q Consensus 83 ~~~n~~~~tG~~Ei~~~~~~il~~ 106 (622) ...+ ..+-=-.||.|++++.|.. T Consensus 82 edkd-G~~r~~teIv~~~~~~L~~ 104 (172) T PRK06751 82 EGQD-GKRVYVTEVLAESVQFLEP 104 (172) T ss_pred CCCC-CCEEEEEEEEEEEEEECCC T ss_conf 8789-9889999999966897146 No 227 >PRK06863 single-stranded DNA-binding protein; Provisional Probab=41.39 E-value=23 Score=14.95 Aligned_cols=85 Identities=15% Similarity=0.222 Sum_probs=48.8 Q ss_pred EEEEEEEECC---CCC--CCC-EEEEEE-E-----CC-CCE-------EEEEECCCCHHHHHHHCCCCCCEEEEEEEEEE Q ss_conf 8999979800---444--698-799999-7-----79-825-------99997488157899852898858999999997 Q gi|254780419|r 20 FVRLSGWVHR---VRP--HGG-IIFLDI-R-----DH-YGI-------TQVVANPDSACFEIAKSVRSEWVLCIDGLVMS 79 (622) Q Consensus 20 ~V~i~GwV~~---~R~--~g~-l~F~~l-r-----D~-sG~-------~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~ 79 (622) +|+|.|+|.+ .|. .|+ ++=+.| - |. +|- ..|++-. ..+......|+.++-|.|+|.++. T Consensus 6 kV~LiGnlg~DPElr~t~sG~~v~~fslA~~~~~~~~~~ge~~e~t~w~~v~~wg-k~AE~~~~yl~KG~~V~VeGrL~~ 84 (168) T PRK06863 6 KVIIVGHLGNDPEIRTMPNGEAVANISVATSESWTDKNTGERREVTEWHRIVFYR-RQAEVAGEYLRKGSQVYVEGRLKT 84 (168) T ss_pred EEEEEEECCCCCEEEECCCCCEEEEEEEEECCCCEECCCCEEEEECHHHHHHHCH-HHHHHHHHHHCCCCEEEEEEEEEE T ss_conf 8999874686854888799987999999961561205887377502032120056-899999987568987999888875 Q ss_pred CCCCCCCCCCCCCEEEEEEEEEEEEEC Q ss_conf 586555887778659999889999502 Q gi|254780419|r 80 RSAETINANIITGQIELSAQKIEILSV 106 (622) Q Consensus 80 r~~~~~n~~~~tG~~Ei~~~~~~il~~ 106 (622) |.=...+-.-.+ ..||.++.++.|+. T Consensus 85 r~w~dk~G~~r~-~teV~~d~~~~l~~ 110 (168) T PRK06863 85 RKWQDQNGQDRY-TTEIQGDVLQMLGG 110 (168) T ss_pred CCCCCCCCCEEE-EEEEEEEEEEECCC T ss_conf 511879999899-99999857998357 No 228 >PRK07459 single-stranded DNA-binding protein; Provisional Probab=41.38 E-value=23 Score=14.95 Aligned_cols=87 Identities=17% Similarity=0.129 Sum_probs=49.4 Q ss_pred EEEEEEEECCC---C--CCCC-EEEEEEE-------CCCCEEEEEECCCCHHHHHHHCCCCCCEEEEEEEEEECCCCCCC Q ss_conf 89999798004---4--4698-7999997-------79825999974881578998528988589999999975865558 Q gi|254780419|r 20 FVRLSGWVHRV---R--PHGG-IIFLDIR-------DHYGITQVVANPDSACFEIAKSVRSEWVLCIDGLVMSRSAETIN 86 (622) Q Consensus 20 ~V~i~GwV~~~---R--~~g~-l~F~~lr-------D~sG~~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~r~~~~~n 86 (622) .|+|.|++.+- | ..|+ ++=+.|. +.+.-+.|++-.. .+....+.|+.++-|.|+|.+..+.=...+ T Consensus 5 ~V~LiGrLg~DpElr~t~sG~~v~~fslAv~~~~k~~~t~w~~v~~wgk-~Ae~~~~yl~KG~~V~V~G~l~~~~w~dk~ 83 (121) T PRK07459 5 SVTLVGRAGRDPEVRYFESGSVVCNLTLAVNRRSRDDEPDWFNLEIWGK-TAQVAADYVKKGSLIGITGSLKFDRWTDRN 83 (121) T ss_pred EEEEEEECCCCCEEEECCCCCEEEEEEEEECCCCCCCCCCEEEEEEECH-HHHHHHHHHCCCCEEEEEEEEECCCCEECC T ss_conf 8999996676961889799978999999846753577874999999890-899998873689899999998654247578 Q ss_pred CCCCCCEEEEEEEEEEEEECC Q ss_conf 877786599998899995021 Q gi|254780419|r 87 ANIITGQIELSAQKIEILSVA 107 (622) Q Consensus 87 ~~~~tG~~Ei~~~~~~il~~a 107 (622) ...+-=..||.|++++.|+.- T Consensus 84 ~G~~r~~~~V~v~~i~fl~~k 104 (121) T PRK07459 84 TGEDRSKPVIRVDRLELLGSK 104 (121) T ss_pred CCEEEEEEEEEEEEEEECCCC T ss_conf 998999999999699983688 No 229 >TIGR00736 nifR3_rel_arch TIM-barrel protein, putative; InterPro: IPR005270 The function of this family is unknown, but it may include TIM-barrel proteins.. Probab=41.02 E-value=10 Score=17.36 Aligned_cols=30 Identities=30% Similarity=0.572 Sum_probs=17.9 Q ss_pred HHHHHHHHCCCHHHHEEE-HHHHCCCCCCCCCCCCCEEEEEE Q ss_conf 456443310570233110-01103777665538000035654 Q gi|254780419|r 199 QQYKQLLMASGFDRYFQI-APCFRDEDPRSDRLPGEFYQLDM 239 (622) Q Consensus 199 QlyKQlLmv~G~dryfqi-arcFRdEd~R~dRqp~EFTqlD~ 239 (622) .+-+.++ +.-+.-+-|| |+| ||| |||.|=+ T Consensus 84 ~~~v~~~-~ae~~diiEiNaHC---------RQP-EiteiG~ 114 (234) T TIGR00736 84 ELDVELL-VAEFADIIEINAHC---------RQP-EITEIGI 114 (234) T ss_pred HHHHHHH-HHHHCCEEEECCCC---------CCC-CEEEECC T ss_conf 5677665-42113737885758---------897-6155056 No 230 >KOG1637 consensus Probab=40.50 E-value=21 Score=15.17 Aligned_cols=109 Identities=21% Similarity=0.307 Sum_probs=59.7 Q ss_pred HHHHHHHHHHHHHHHCCCEEEEECCCCCCCCC---CC-CCC-----CCCCCCCCHHHHHHCCC--CHHHHHHHHHCCCHH Q ss_conf 45899999998752036514620332112464---56-645-----43222221034431123--854564433105702 Q gi|254780419|r 143 VKRTRIINSMRCRMIAENFIECSTPILTASSP---EG-ARD-----FLVPSRVNQGSFYALPQ--APQQYKQLLMASGFD 211 (622) Q Consensus 143 ~~Rs~i~~~~R~~l~~~gF~EveTP~L~ksTp---eG-Ard-----FlVPsR~~~g~fYALpQ--SPQlyKQlLmv~G~d 211 (622) ++=.+++..||.-...+||.||-||.+.++.- .| ... |-|-. .+ .-|||-- .|+. -||-+--+ T Consensus 193 ~iyN~Lv~fir~ey~~rGf~EVitPniy~~~LWe~SGHwqnY~enmF~~e~--ek-e~~~LKPMNCPgH---cLmf~~r~ 266 (560) T KOG1637 193 RIYNTLVDFIRAEYRKRGFTEVITPNIYNKKLWETSGHWQNYSENMFKFEV--EK-EEFALKPMNCPGH---CLMFAHRD 266 (560) T ss_pred HHHHHHHHHHHHHHHHCCCCEECCCCHHHHHHHHHCCCHHHHHHHCEEEEE--CH-HHHCCCCCCCCCC---CCCCCCCC T ss_conf 278899999999987538743227221215666532525554431012310--10-3322586689753---01336677 Q ss_pred H--------HEEEHHHHCCCCCCC----CCCCCCEEEEEEEECCCCHHHHHHHHHHHHHH Q ss_conf 3--------311001103777665----53800003565410268999999999999999 Q gi|254780419|r 212 R--------YFQIAPCFRDEDPRS----DRLPGEFYQLDMEMSFVEQDDILNTMENVLRG 259 (622) Q Consensus 212 r--------yfqiarcFRdEd~R~----dRqp~EFTqlD~EmsF~~~edvm~l~E~li~~ 259 (622) | .=-++..-|+|-|-+ .|- -+|.|=|.- =|.+.++|-+.+.+.+.. T Consensus 267 rS~reLPlR~aDFg~LHRnE~SGaLsGLTRv-RrFqQDDaH-IFCt~~Qi~~Eik~~l~f 324 (560) T KOG1637 267 RSYRELPLRFADFGVLHRNEASGALSGLTRV-RRFQQDDAH-IFCTPDQVKEEIKGCLDF 324 (560) T ss_pred CCHHHCCCCCCCCCEEEECCCCCCCCCCCEE-EEECCCCCE-EEECCCCHHHHHHHHHHH T ss_conf 4274478422675225305456423221011-011015715-885576379999879999 No 231 >PRK00439 leuD 3-isopropylmalate dehydratase small subunit; Reviewed Probab=40.18 E-value=16 Score=16.04 Aligned_cols=20 Identities=25% Similarity=0.478 Sum_probs=10.7 Q ss_pred CCEEECCEEECCCCEECCCH Q ss_conf 67777567721251231569 Q gi|254780419|r 482 YDLVCNGFEIASGGIRNHMS 501 (622) Q Consensus 482 YDLVlNG~EiggGSiRIHd~ 501 (622) .|+++-|---|-||-|.|-+ T Consensus 49 gdIiVaG~NFG~GSSRE~A~ 68 (163) T PRK00439 49 GDIIVAGKNFGCGSSREHAP 68 (163) T ss_pred CCEEEECCCCCCCCCHHHHH T ss_conf 75899688516888679999 No 232 >KOG0353 consensus Probab=40.04 E-value=9.5 Score=17.66 Aligned_cols=10 Identities=60% Similarity=0.926 Sum_probs=6.2 Q ss_pred CCCCCCCCCC Q ss_conf 6777644476 Q gi|254780419|r 287 GTDKPDLRNP 296 (622) Q Consensus 287 GsDKPDLR~~ 296 (622) |+||||.||- T Consensus 380 gidkpdvrfv 389 (695) T KOG0353 380 GIDKPDVRFV 389 (695) T ss_pred CCCCCCEEEE T ss_conf 6788871699 No 233 >PRK06461 single-stranded DNA-binding protein; Reviewed Probab=39.81 E-value=24 Score=14.79 Aligned_cols=86 Identities=19% Similarity=0.199 Sum_probs=57.9 Q ss_pred CHHHCCHHHCCCEEEEEEEECCC-------CCC--CCEEEEEEECCCCEEEEEECCCCHHHHHHHCCCCCCEEEEEEEEE Q ss_conf 04236979989889999798004-------446--987999997798259999748815789985289885899999999 Q gi|254780419|r 8 SCGDLGVSDVGSFVRLSGWVHRV-------RPH--GGIIFLDIRDHYGITQVVANPDSACFEIAKSVRSEWVLCIDGLVM 78 (622) Q Consensus 8 ~cg~l~~~~~g~~V~i~GwV~~~-------R~~--g~l~F~~lrD~sG~~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~ 78 (622) ...+|.... +.|.|.|.|-.+ ++. ++++=+.+-|.||.+-+.+-.+ .+..+..+++|.++.--. T Consensus 6 KI~dL~pg~--~~V~v~~rVl~~~~~r~~~~~~g~~~V~~~~V~DeTG~I~~tlWde-----~~~~i~~GD~V~I~nayv 78 (130) T PRK06461 6 KIKDLKPGM--RRVNVTGRVLEVGEPKVIQTKGGPRTISEAVVGDETGRVKLTLWGD-----QAGTLKEGEVVKIENAWT 78 (130) T ss_pred EHHHCCCCC--CCEEEEEEEEECCCCCEEECCCCCEEEEEEEEECCCCEEEEEEECC-----CCCCCCCCCEEEEECCEE T ss_conf 898888998--9826999999768972786289966899999987998699999456-----456468999999944798 Q ss_pred ECCCCCCCCCCCCCEEEEEEEEEEEEECCCC Q ss_conf 7586555887778659999889999502117 Q gi|254780419|r 79 SRSAETINANIITGQIELSAQKIEILSVAEE 109 (622) Q Consensus 79 ~r~~~~~n~~~~tG~~Ei~~~~~~il~~a~~ 109 (622) . +--|.++|.+.+.--+.+... T Consensus 79 ~---------~~~g~l~L~vGk~g~I~~~~d 100 (130) T PRK06461 79 T---------LYRGKVQLNVGRYGSISESDD 100 (130) T ss_pred E---------EECCEEEEEECCCEEEEECCC T ss_conf 8---------888869998689626897687 No 234 >TIGR00787 dctP TRAP transporter solute receptor, DctP family; InterPro: IPR004682 TRAP-T family proteins generally consist of three components, and these systems have so far been found in Gram-negative bacteria, Gram-positive bacteria and archaea. Only one member of the family has been both sequenced and functionally characterised. This system is the DctPQM system of Rhodobacter capsulatus. DctP is a periplasmic dicarboxylate (malate, fumarate, succinate) binding receptor that is biochemically well-characterised.; GO: 0006810 transport, 0030288 outer membrane-bounded periplasmic space. Probab=39.58 E-value=24 Score=14.76 Aligned_cols=47 Identities=13% Similarity=0.306 Sum_probs=25.6 Q ss_pred CEEEEEEEECCCCHHHHHHHHHH-HHHH-HHHHHHCCCCCCCCCCCCCHHHH Q ss_conf 00356541026899999999999-9999-99998087234577321218988 Q gi|254780419|r 233 EFYQLDMEMSFVEQDDILNTMEN-VLRG-VFEEFSNGNTVSNNFLRICYDDA 282 (622) Q Consensus 233 EFTqlD~EmsF~~~edvm~l~E~-li~~-i~~~v~~~~~i~~pF~rmtY~eA 282 (622) |+-=+|+=--|-|++.+++++.+ -+-. +.+++. ... .-|.++.|+++ T Consensus 74 ~~~v~dlPflF~d~~~~~Kvl~G~~vG~~L~~~~e-~~~--LG~~~L~~w~n 122 (267) T TIGR00787 74 ELAVLDLPFLFKDTNHVHKVLDGSEVGKELLKSLE-KKN--LGLKGLAYWDN 122 (267) T ss_pred HHHHHCCCHHHHCHHHHHHHHCCCHHHHHHHHHHH-HHH--CCCCCHHHHHC T ss_conf 68771676243148999987366588999999888-763--24000033233 No 235 >pfam00490 ALAD Delta-aminolevulinic acid dehydratase. Probab=39.46 E-value=6.9 Score=18.68 Aligned_cols=23 Identities=22% Similarity=0.407 Sum_probs=18.4 Q ss_pred HHHHHHHHHHCCCEEEEECCCCCCC Q ss_conf 9999987520365146203321124 Q gi|254780419|r 148 IINSMRCRMIAENFIECSTPILTAS 172 (622) Q Consensus 148 i~~~~R~~l~~~gF~EveTP~L~ks 172 (622) -+.+||+.|+++||. +||||.=| T Consensus 171 rV~aIR~~Ld~~g~~--~~~ImSYs 193 (322) T pfam00490 171 RVGAIREALDEAGFT--DVPIMSYS 193 (322) T ss_pred HHHHHHHHHHHCCCC--CCEEEECH T ss_conf 899999999977999--87264306 No 236 >cd00490 Met_repressor_MetJ Met Repressor, MetJ. MetJ is a bacterial regulatory protein that uses S-adenosylmethionine (SAM) as a corepressor to regulate the production of Methionine. MetJ binds arrays of two to five adjacent copies of an eight base-pair 'metbox' sequence. MetJ forms sufficiently strong interactions with the sugar-phosphate backbone to accomodate sequence variation in natural operators. However, it is very sensitive to particular base changes in the operator. MetJ exists as a homodimer. Probab=39.19 E-value=23 Score=14.86 Aligned_cols=41 Identities=20% Similarity=0.208 Sum_probs=27.2 Q ss_pred HHHCCCCC-CCCCHHHHHCCCHHHHHHHCCCEEECCEEECCCCEECCCHHHHHHHHHHCCCCHHH Q ss_conf 21100247-66430655418877741313677775677212512315699999999985999899 Q gi|254780419|r 454 FAHNPFTM-PQGGMESLKGNDLLSIKAFQYDLVCNGFEIASGGIRNHMSEVMLQAFSNVGISKEV 517 (622) Q Consensus 454 s~HHPFTa-P~~~~~~l~~~dp~~v~a~~YDLVlNG~EiggGSiRIHd~~iQ~~~f~~lgl~~ee 517 (622) |--|.||- |-+.|.++....|.. -|+.-+.+|+.+|+++++ T Consensus 60 AFLHAfTGQPLP~D~Dl~K~~~d~-----------------------iP~~ak~~m~~~Gi~~~~ 101 (103) T cd00490 60 AFLHAFTGQPLPDDADLRKERSDE-----------------------IPEAAKELMREMGIEPET 101 (103) T ss_pred HHHHHHCCCCCCCHHHHHHCCCCC-----------------------CCHHHHHHHHHCCCCHHC T ss_conf 999987399899806661138641-----------------------609999999990998100 No 237 >KOG1559 consensus Probab=38.92 E-value=9.6 Score=17.63 Aligned_cols=19 Identities=21% Similarity=0.387 Sum_probs=11.0 Q ss_pred HHHHHCCCCCC-CC-EECCHH Q ss_conf 99986084973-42-224689 Q gi|254780419|r 526 YRAFQCGMPPH-GG-IAAGID 544 (622) Q Consensus 526 L~Al~yG~PPH-gG-~alGlD 544 (622) +.+|-|---|. .| -|.|.| T Consensus 314 lsnLIYnykPtY~Gy~gsg~d 334 (340) T KOG1559 314 LSNLIYNYKPTYCGYAGSGYD 334 (340) T ss_pred HHHHHHCCCCCEECCCCCCCC T ss_conf 987772067630234667664 No 238 >PTZ00182 3-methyl-2-oxobutanate dehydrogenase; Provisional Probab=38.89 E-value=16 Score=16.11 Aligned_cols=22 Identities=18% Similarity=0.084 Sum_probs=9.9 Q ss_pred CCCCCCEEEEEEEC---CCCEEEEE Q ss_conf 44469879999977---98259999 Q gi|254780419|r 30 VRPHGGIIFLDIRD---HYGITQVV 51 (622) Q Consensus 30 ~R~~g~l~F~~lrD---~sG~~Q~v 51 (622) +|.-....|.-.-+ .+|.+|=+ T Consensus 11 ~~~~~~~~~~~~~~~~~~~g~~~~~ 35 (355) T PTZ00182 11 RRQPSEFSFAPRFGSEESSGNTKKM 35 (355) T ss_pred CCCCCCEEECCCCCCCCCCCCCEEE T ss_conf 5154542426999987778882265 No 239 >cd06546 GH18_CTS3_chitinase GH18 domain of CTS3 (chitinase 3), an uncharacterized protein from the human fungal pathogen Coccidioides posadasii. CTS3 has a chitinase-like glycosyl hydrolase family 18 (GH18) domain; and has homologs in bacteria as well as fungi. Probab=38.88 E-value=19 Score=15.55 Aligned_cols=43 Identities=19% Similarity=0.373 Sum_probs=22.7 Q ss_pred CHHHHEEEHHHHCCCCCCCCCCCCCEEEEEEEE-CCCCHHHHHHHHHHHHHH Q ss_conf 702331100110377766553800003565410-268999999999999999 Q gi|254780419|r 209 GFDRYFQIAPCFRDEDPRSDRLPGEFYQLDMEM-SFVEQDDILNTMENVLRG 259 (622) Q Consensus 209 G~dryfqiarcFRdEd~R~dRqp~EFTqlD~Em-sF~~~edvm~l~E~li~~ 259 (622) .|++||..-+-+=- ++ -|..||+-. ..++.++|..++..|-.. T Consensus 96 ~F~~yY~~L~~~i~-----~~---~ldGiDLDvEe~~sl~~i~~LI~~L~~D 139 (256) T cd06546 96 DFERYYGQLRDMIR-----RR---GLDGLDLDVEEPMSLDGIIRLIDRLRSD 139 (256) T ss_pred HHHHHHHHHHHHHH-----HC---CCCCEEEECCCCCCHHHHHHHHHHHHHH T ss_conf 28999999999999-----67---9986565044667989999999999985 No 240 >KOG0188 consensus Probab=38.52 E-value=20 Score=15.38 Aligned_cols=15 Identities=27% Similarity=0.585 Sum_probs=7.8 Q ss_pred CEEEEEE-ECCCCEEE Q ss_conf 8799999-77982599 Q gi|254780419|r 35 GIIFLDI-RDHYGITQ 49 (622) Q Consensus 35 ~l~F~~l-rD~sG~~Q 49 (622) +++|+.- |+..|.++ T Consensus 207 NlVFIq~NRe~dGsL~ 222 (895) T KOG0188 207 NIVFIQYNREADGSLK 222 (895) T ss_pred EEEEEEECCCCCCCCC T ss_conf 0125774135687656 No 241 >TIGR01237 D1pyr5carbox2 delta-1-pyrroline-5-carboxylate dehydrogenase, putative; InterPro: IPR005932 The delta(1)-pyrroline-5-carboxylate synthetase (1.5.1.12 from EC) a mitochondrial inner membrane, ATP- and NADPH-dependent, bifunctional enzyme, catalyzes the reduction of glutamate to delta1-pyrroline-5-carboxylate, a critical step in the de novo biosynthesis of proline and ornithine. It is the rate-limiting enzyme in proline biosynthesis and is subject to feedback inhibition by proline.1-pyrroline-5-carboxylate + NAD+ + H2O = L-glutamate + NADH This model represents one of several related branches of delta-1-pyrroline-5-carboxylate dehydrogenase.; GO: 0003842 1-pyrroline-5-carboxylate dehydrogenase activity, 0006561 proline biosynthetic process. Probab=38.36 E-value=14 Score=16.53 Aligned_cols=17 Identities=18% Similarity=0.360 Sum_probs=11.0 Q ss_pred HHCCCCCCEEEEEEEEE Q ss_conf 85289885899999999 Q gi|254780419|r 62 AKSVRSEWVLCIDGLVM 78 (622) Q Consensus 62 ~~~l~~esvv~v~G~V~ 78 (622) ...|...+.+.|.|.-+ T Consensus 27 ke~LGK~Ypl~InGe~V 43 (518) T TIGR01237 27 KELLGKDYPLVINGEEV 43 (518) T ss_pred HHHCCCCCCCEECCCEE T ss_conf 97447716840368433 No 242 >PRK13867 type IV secretion system chaperone VirE1; Provisional Probab=37.52 E-value=7.6 Score=18.36 Aligned_cols=14 Identities=36% Similarity=0.560 Sum_probs=10.6 Q ss_pred CCCCCCEEEEEEEE Q ss_conf 53800003565410 Q gi|254780419|r 228 DRLPGEFYQLDMEM 241 (622) Q Consensus 228 dRqp~EFTqlD~Em 241 (622) ..||.-||.||+|| T Consensus 31 ~~~~ngftsldl~m 44 (63) T PRK13867 31 SHPSNGFTSLDLEM 44 (63) T ss_pred CCCCCCCCCCCHHH T ss_conf 56889832113667 No 243 >PRK08402 replication factor A; Reviewed Probab=36.34 E-value=27 Score=14.42 Aligned_cols=78 Identities=23% Similarity=0.193 Sum_probs=52.2 Q ss_pred CEEEEEEEECC--------CCC--CCCEEEEEEECCCCEEEEEECCCCHHHHHHHCCCCCCEEEEEE-EEEECCCCCCCC Q ss_conf 88999979800--------444--6987999997798259999748815789985289885899999-999758655588 Q gi|254780419|r 19 SFVRLSGWVHR--------VRP--HGGIIFLDIRDHYGITQVVANPDSACFEIAKSVRSEWVLCIDG-LVMSRSAETINA 87 (622) Q Consensus 19 ~~V~i~GwV~~--------~R~--~g~l~F~~lrD~sG~~Q~v~~~~~~~~~~~~~l~~esvv~v~G-~V~~r~~~~~n~ 87 (622) +.|-+.|+|-+ +++ .|+++=+.+-|.||.+-+|.= |....+....|..++||.|.- .|+.. T Consensus 75 rnVeivgrIi~ky~~REf~kkDGS~GkVASL~I~DeTG~aRLvLW-D~~v~~yy~~l~~GDVIKv~~~~Vres------- 146 (357) T PRK08402 75 REVNIVGRILKKYPPREYTKKDGSTGRVASLIIYDDTGRARVVLW-DAKVAEYYNKLEVGDVIKVLDAQVRES------- 146 (357) T ss_pred CCCEEEEEEEEECCCHHHHHCCCCCCEEEEEEEECCCCCEEEEEE-CHHHHHHHHHCCCCCEEEEECHHHHHH------- T ss_conf 675799987330580553003698753789999737883899983-387898897537686699833134431------- Q ss_pred CCCCCEEEEEEEEE--EEEEC Q ss_conf 77786599998899--99502 Q gi|254780419|r 88 NIITGQIELSAQKI--EILSV 106 (622) Q Consensus 88 ~~~tG~~Ei~~~~~--~il~~ 106 (622) + .|-.|+++.-- -|+|+ T Consensus 147 -~-~g~peLhv~fr~riI~np 165 (357) T PRK08402 147 -L-SGLPELHINFRARIILNP 165 (357) T ss_pred -C-CCCHHEEECCEEEEEECC T ss_conf -1-797120340135687178 No 244 >KOG4330 consensus Probab=36.19 E-value=13 Score=16.77 Aligned_cols=49 Identities=22% Similarity=0.412 Sum_probs=25.0 Q ss_pred CCCCEEEEEEEECCCCHHHHHHHHHHHHHH-------HHHHHHCCCCC--CCCCCCCCHHHHHHHCCCC Q ss_conf 800003565410268999999999999999-------99998087234--5773212189887513677 Q gi|254780419|r 230 LPGEFYQLDMEMSFVEQDDILNTMENVLRG-------VFEEFSNGNTV--SNNFLRICYDDAIRRYGTD 289 (622) Q Consensus 230 qp~EFTqlD~EmsF~~~edvm~l~E~li~~-------i~~~v~~~~~i--~~pF~rmtY~eAm~~YGsD 289 (622) +| +||. ..|.-+.|.|++. -+..|++...- -..|-..||++-|.|||+- T Consensus 143 k~-~FTl----------rqVqmICErllKerE~klReeyE~vLttKLaEQydafVkFt~dQi~rry~~~ 200 (206) T KOG4330 143 KP-LFTL----------RQVQMICERLLKEREIKLREEYEMVLTTKLAEQYDAFVKFTHDQIMRRYGST 200 (206) T ss_pred CC-HHHH----------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 32-0009----------9999999999999999999999999998899999999999899999987368 No 245 >cd01579 AcnA_Bact_Swivel Bacterial Aconitase-like swivel domain. Aconitase (aconitate hydratase or citrate hydrolyase) catalyzes the reversible isomerization of citrate and isocitrate as part of the TCA cycle. Cis-aconitate is formed as an intermediate product during the course of the reaction. This is the aconitase-like swivel domain, which is believed to undergo swivelling conformational change in the enzyme mechanism. This distinct subfamily is found only in bacteria and archea. Its exact characteristics are not known. Probab=35.20 E-value=17 Score=15.89 Aligned_cols=48 Identities=15% Similarity=0.216 Sum_probs=29.9 Q ss_pred CEEECCEEECCCCEECCCHHHHHHHHHHCCCCH---HHHHHHHHHHHHHHHCCCCCC Q ss_conf 777756772125123156999999999859998---999866778999986084973 Q gi|254780419|r 483 DLVCNGFEIASGGIRNHMSEVMLQAFSNVGISK---EVVENRFGGLYRAFQCGMPPH 536 (622) Q Consensus 483 DLVlNG~EiggGSiRIHd~~iQ~~~f~~lgl~~---ee~~~~Fg~lL~Al~yG~PPH 536 (622) .+++-|-.-|.||-|.|-+-. ++.+|+.- +.... -|+=++...|.+|- T Consensus 50 ~IiVaG~NFG~GSSRE~A~~a----l~~~Gi~aVIA~SFar--If~rN~in~Gllpl 100 (121) T cd01579 50 GFIVGGENYGQGSSREHAALA----PMYLGVRAVLAKSFAR--IHRANLINFGILPL 100 (121) T ss_pred EEEEECCCCCCCCCHHHHHHH----HHHCCCEEEEEHHHHH--HHHHHHHHCCCCEE T ss_conf 199966756789728999999----9983945999758999--99987887786208 No 246 >KOG0230 consensus Probab=35.04 E-value=17 Score=15.82 Aligned_cols=20 Identities=25% Similarity=0.205 Sum_probs=13.1 Q ss_pred HHHCCCCCCCCEECCHHHHH Q ss_conf 98608497342224689999 Q gi|254780419|r 528 AFQCGMPPHGGIAAGIDRIV 547 (622) Q Consensus 528 Al~yG~PPHgG~alGlDRlv 547 (622) +-.-|+|+|-=+++--||-. T Consensus 989 ~~~~~~~s~d~l~l~~~rr~ 1008 (1598) T KOG0230 989 EVDDGQLSHDILALNRDRRA 1008 (1598) T ss_pred CCCCCCCHHHHHHHHHHHHH T ss_conf 02457520447877777999 No 247 >PRK06293 single-stranded DNA-binding protein; Provisional Probab=34.89 E-value=28 Score=14.26 Aligned_cols=61 Identities=7% Similarity=-0.009 Sum_probs=39.1 Q ss_pred CCCCEEEEEECCCCHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCCCCCCEEEEEEEEEEEEE Q ss_conf 798259999748815789985289885899999999758655588777865999988999950 Q gi|254780419|r 43 DHYGITQVVANPDSACFEIAKSVRSEWVLCIDGLVMSRSAETINANIITGQIELSAQKIEILS 105 (622) Q Consensus 43 D~sG~~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~r~~~~~n~~~~tG~~Ei~~~~~~il~ 105 (622) |.+-.+.||+=.+ .+...++-|+.++-|.|+|.++.|.=...+ ..+-=-.||.|++++.|. T Consensus 40 ~et~fi~~v~W~k-~AE~~~~yl~KGs~V~VeGrl~trsyedkd-G~kry~tEVvad~~~fl~ 100 (161) T PRK06293 40 DETVWCRCNIWHN-RYDKMLPYLKKGSGVIVAGDISVESYMSKD-GSPQSSLVVSVDTLKFSP 100 (161) T ss_pred CCCEEEEEEEHHH-HHHHHHHHHCCCCEEEEEEEEEECCCCCCC-CCEEEEEEEEEEEEEECC T ss_conf 6865999997347-899999986478989999877734768789-988989999996789657 No 248 >TIGR01993 Pyr-5-nucltdase pyrimidine 5'-nucleotidase; InterPro: IPR010237 This family of proteins includes the SDT1/SSM1 gene from yeast, which has been shown to code for a pyrimidine (UMP/CMP) 5'nucleotidase. The family spans plants, fungi and bacteria. These enzymes are members of the haloacid dehalogenase (HAD) superfamily of hydrolases, specifically the IA subfamily.. Probab=34.59 E-value=29 Score=14.23 Aligned_cols=34 Identities=18% Similarity=0.160 Sum_probs=16.9 Q ss_pred CHHHHHHHHHHHC--CCCCCCEEEEECCCCCCCCHH Q ss_conf 5278999999818--875572699808854110013 Q gi|254780419|r 380 NIVNTEAIRVRLD--MQDGDSCFFIAGDPLKFCSFA 413 (622) Q Consensus 380 ~~~~~~~l~~~~~--~~~GD~i~f~a~~~~~~~~~l 413 (622) ....+.++..++. -|.|--++|.+++..=+.++| T Consensus 87 pdp~L~~~L~~LpqsGK~~Rk~iFTN~~~~Ha~r~l 122 (205) T TIGR01993 87 PDPELRNLLLRLPQSGKKGRKIIFTNGDRAHARRAL 122 (205) T ss_pred CCHHHHHHHHHHHHCCCCCCEEEEECCCHHHHHHHH T ss_conf 898899999973412655556776158789999999 No 249 >KOG4079 consensus Probab=34.47 E-value=29 Score=14.22 Aligned_cols=57 Identities=30% Similarity=0.564 Sum_probs=28.5 Q ss_pred CCCCCCCCCCCCCHHHHHHCCCCHHHHH----HHHHCCCHHHHEEEHHHHCCCCCCCCCCCCCEEEEEEEECCCCHHHHH Q ss_conf 5664543222221034431123854564----433105702331100110377766553800003565410268999999 Q gi|254780419|r 175 EGARDFLVPSRVNQGSFYALPQAPQQYK----QLLMASGFDRYFQIAPCFRDEDPRSDRLPGEFYQLDMEMSFVEQDDIL 250 (622) Q Consensus 175 eGArdFlVPsR~~~g~fYALpQSPQlyK----QlLmv~G~dryfqiarcFRdEd~R~dRqp~EFTqlD~EmsF~~~edvm 250 (622) +|||||+ |+..||- ||| |++|-..+ -.--++|||=|.+ +=.-+|++- .+.++|+ T Consensus 43 ~GARdFV---------fwNipQi--QykNP~VQ~~~~knm-tpsPF~R~YlddG--------r~vL~Dld~--~~r~eI~ 100 (169) T KOG4079 43 SGARDFV---------FWNIPQI--QYKNPKVQLVKHKNM-TPSPFARAYLDDG--------REVLFDLDG--MKREEIE 100 (169) T ss_pred CCCCCEE---------EECCHHH--CCCCCCEEEEEECCC-CCCHHHHHEECCC--------CEEEEECCC--CCHHHHH T ss_conf 6742068---------8553464--045871577751367-8886787401476--------258997455--5579999 Q ss_pred HHH Q ss_conf 999 Q gi|254780419|r 251 NTM 253 (622) Q Consensus 251 ~l~ 253 (622) +.+ T Consensus 101 ~hl 103 (169) T KOG4079 101 KHL 103 (169) T ss_pred HHH T ss_conf 999 No 250 >pfam10105 DUF2344 Uncharacterized protein conserved in bacteria (DUF2344). This domain, found in various hypothetical bacterial proteins and Radical Sam domain proteins, has no known function. Probab=34.32 E-value=24 Score=14.79 Aligned_cols=42 Identities=26% Similarity=0.355 Sum_probs=29.8 Q ss_pred EECCCCHHHHHHHHHHHHHHHHHHH--HCCCCCCCCCCCCCHHHHHH Q ss_conf 1026899999999999999999998--08723457732121898875 Q gi|254780419|r 240 EMSFVEQDDILNTMENVLRGVFEEF--SNGNTVSNNFLRICYDDAIR 284 (622) Q Consensus 240 EmsF~~~edvm~l~E~li~~i~~~v--~~~~~i~~pF~rmtY~eAm~ 284 (622) +|.|+++-|+|.++|..++..=--+ ..| =.|=|+|||..|+. T Consensus 11 ~~rflsHLDl~r~~eRa~RRa~lP~a~S~G---F~Phpkisf~~aLp 54 (186) T pfam10105 11 RMRFLSHLDLMRLFERALRRAGLPIAFSEG---FHPHPRISFALALP 54 (186) T ss_pred CEEEECHHHHHHHHHHHHHHCCCCHHHHCC---CCCCCCEEECCCCC T ss_conf 237722688999999999981897687379---89997274446755 No 251 >cd05707 S1_Rrp5_repeat_sc11 S1_Rrp5_repeat_sc11: Rrp5 is a trans-acting factor important for biogenesis of both the 40S and 60S eukaryotic ribosomal subunits. Rrp5 has two distinct regions, an N-terminal region containing tandemly repeated S1 RNA-binding domains (12 S1 repeats in Saccharomyces cerevisiae Rrp5 and 14 S1 repeats in Homo sapiens Rrp5) and a C-terminal region containing tetratricopeptide repeat (TPR) motifs thought to be involved in protein-protein interactions. Mutational studies have shown that each region represents a specific functional domain. Deletions within the S1-containing region inhibit pre-rRNA processing at either site A3 or A2, whereas deletions within the TPR region confer an inability to support cleavage of A0-A2. This CD includes S. cerevisiae S1 repeat 11 (sc11). Rrp5 is found in eukaryotes but not in prokaryotes or archaea. Probab=34.07 E-value=29 Score=14.17 Aligned_cols=54 Identities=19% Similarity=0.314 Sum_probs=31.2 Q ss_pred EEEEEECCCCCCCCEEEEEEECC-CCEEEEEECCCCHHHHHHHCCCCCCEEEEEEEEEE Q ss_conf 99979800444698799999779-82599997488157899852898858999999997 Q gi|254780419|r 22 RLSGWVHRVRPHGGIIFLDIRDH-YGITQVVANPDSACFEIAKSVRSEWVLCIDGLVMS 79 (622) Q Consensus 22 ~i~GwV~~~R~~g~l~F~~lrD~-sG~~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~ 79 (622) ++.|+|.++.+.| +|++|-.+ .|.+++--=.+.-.-+..+.++.+.. |+|+|.. T Consensus 3 ~v~G~V~~i~~~G--~FV~l~~~~~glVhiselsd~fVkd~~~~f~vGq~--V~~kV~~ 57 (68) T cd05707 3 VVRGFVKNIANNG--VFVTLGRGVDARVRVSELSDSYLKDWKKRFKVGQL--VKGKIVS 57 (68) T ss_pred EEEEEEEEECCCC--EEEEECCCCEEEEEHHHHCHHHHCCHHHHCCCCCE--EEEEEEE T ss_conf 9899999803640--79991799679999599172356499891799999--9999999 No 252 >PRK02929 L-arabinose isomerase; Provisional Probab=33.81 E-value=30 Score=14.14 Aligned_cols=53 Identities=25% Similarity=0.291 Sum_probs=27.1 Q ss_pred CCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCHHHHHHHHHCCCH Q ss_conf 1121245899999998752036514620332112464566454322222103443112385456443310570 Q gi|254780419|r 138 LHKNIVKRTRIINSMRCRMIAENFIECSTPILTASSPEGARDFLVPSRVNQGSFYALPQAPQQYKQLLMASGF 210 (622) Q Consensus 138 ~~~~l~~Rs~i~~~~R~~l~~~gF~EveTP~L~ksTpeGArdFlVPsR~~~g~fYALpQSPQlyKQlLmv~G~ 210 (622) ..+.++-..++--++|.||.+.||.-..| +..-.+.|.|=|-|--|.||.-|+ T Consensus 249 ~~~sl~~~AriElgl~~FL~~gg~~AFTt--------------------~FEdL~gl~QLPGLAvQrLMa~GY 301 (499) T PRK02929 249 KRQSVREAARIELGLKRFLEQGGFTAFTT--------------------NFEDLHGLKQLPGLAVQRLMAQGY 301 (499) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCCC--------------------CHHHCCCCCCCCCHHHHHHHHCCC T ss_conf 78999999999999999998189572116--------------------553355853486267888886077 No 253 >PRK09283 delta-aminolevulinic acid dehydratase; Validated Probab=33.80 E-value=9.3 Score=17.74 Aligned_cols=23 Identities=30% Similarity=0.508 Sum_probs=18.0 Q ss_pred HHHHHHHHHHCCCEEEEECCCCCCC Q ss_conf 9999987520365146203321124 Q gi|254780419|r 148 IINSMRCRMIAENFIECSTPILTAS 172 (622) Q Consensus 148 i~~~~R~~l~~~gF~EveTP~L~ks 172 (622) -+.+||+.|+++||. +||||.=| T Consensus 169 rV~aIR~~Ld~~g~~--~~~ImSYs 191 (321) T PRK09283 169 RVGAIREALDAAGFI--DVPIMSYS 191 (321) T ss_pred HHHHHHHHHHHCCCC--CCEEEEHH T ss_conf 899999999977998--87053236 No 254 >KOG0351 consensus Probab=33.76 E-value=14 Score=16.53 Aligned_cols=31 Identities=26% Similarity=0.276 Sum_probs=17.3 Q ss_pred HHHCCCCCEEE-C--CCC-------CCCCHHHHHHHHHHHH Q ss_conf 78211232000-1--211-------1121245899999998 Q gi|254780419|r 123 SIRFKYRFLDL-R--RDT-------LHKNIVKRTRIINSMR 153 (622) Q Consensus 123 ~~rl~~R~LdL-R--~~~-------~~~~l~~Rs~i~~~~R 153 (622) +.|-.||.|-+ | .+. ..++-++|..|+..++ T Consensus 403 dFRp~Yk~l~~l~~~~~~vP~iALTATAT~~v~~DIi~~L~ 443 (941) T KOG0351 403 DFRPSYKRLGLLRIRFPGVPFIALTATATERVREDVIRSLG 443 (941) T ss_pred CCCHHHHHHHHHHHHCCCCCEEEEEHHCCHHHHHHHHHHHC T ss_conf 33477899999985278997687530020889999999827 No 255 >cd04824 eu_ALAD_PBGS_cysteine_rich Porphobilinogen synthase (PBGS), which is also called delta-aminolevulinic acid dehydratase (ALAD), catalyzes the condensation of two 5-aminolevulinic acid (ALA) molecules to form the pyrrole porphobilinogen (PBG), which is the second step in the biosynthesis of tetrapyrroles, such as heme, vitamin B12 and chlorophyll. This reaction involves the formation of a Schiff base link between the substrate and the enzyme. PBGSs are metalloenzymes, some of which have a second, allosteric metal binding site, beside the metal ion binding site in their active site. Although PBGS is a family of homologous enzymes, its metal ion utilization at catalytic site varies between zinc and magnesium and/or potassium. PBGS can be classified into two groups based on differences in their active site metal binding site. The eukaryotic PBGSs represented by this model, which contain a cysteine-rich zinc binding motif (DXCXCX(Y/F)X3G(H/Q)CG), require zinc for their activity, they Probab=33.10 E-value=11 Score=17.18 Aligned_cols=24 Identities=8% Similarity=0.246 Sum_probs=17.8 Q ss_pred HHHHHHHHHHCCCEEEEECCCCCCC Q ss_conf 9999987520365146203321124 Q gi|254780419|r 148 IINSMRCRMIAENFIECSTPILTAS 172 (622) Q Consensus 148 i~~~~R~~l~~~gF~EveTP~L~ks 172 (622) -+.+||+.|+++||.. ++|||.=| T Consensus 167 rV~aIR~~LD~~g~~~-~v~ImSYs 190 (320) T cd04824 167 RVRAIKQALIQAGLGN-KVSVMSYS 190 (320) T ss_pred HHHHHHHHHHHCCCCC-CCCEEEHH T ss_conf 8999999999878955-56310177 No 256 >PRK11798 ClpXP protease specificity-enhancing factor; Provisional Probab=32.90 E-value=24 Score=14.79 Aligned_cols=56 Identities=21% Similarity=0.406 Sum_probs=34.1 Q ss_pred HHHHHHHHHHHHHHHCCCEEEEECCCC-CCCCCCCCCCCCCCCC-CCHHHHHHCCCCHHHHHHHHH Q ss_conf 458999999987520365146203321-1246456645432222-210344311238545644331 Q gi|254780419|r 143 VKRTRIINSMRCRMIAENFIECSTPIL-TASSPEGARDFLVPSR-VNQGSFYALPQAPQQYKQLLM 206 (622) Q Consensus 143 ~~Rs~i~~~~R~~l~~~gF~EveTP~L-~ksTpeGArdFlVPsR-~~~g~fYALpQSPQlyKQlLm 206 (622) -.|.-+++|+.+|..++|+ ||.| ...+-+|- .||.. +..|+- -|.-||+--.+|.| T Consensus 6 s~rPYLiRA~yeW~~Dn~~----TPyi~V~a~~~~v---~VP~~~v~dg~I-vLNIsp~Av~~L~i 63 (140) T PRK11798 6 STRPYLLRALYEWLVDNGL----TPHLLVDATYPGV---QVPMEYVKDGQI-VLNISPRAVGNLQL 63 (140) T ss_pred CCCCHHHHHHHHHHHHCCC----CCEEEEEECCCCC---CCCHHHCCCCEE-EEECCHHHHHCEEE T ss_conf 8842689999999972899----8549998179996---689888019989-99779888603077 No 257 >CHL00181 cbbX CbbX; Provisional Probab=32.80 E-value=30 Score=14.07 Aligned_cols=57 Identities=19% Similarity=0.195 Sum_probs=37.2 Q ss_pred HCCHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHCCCCCCEEEEEEEECCCEEE Q ss_conf 32527899999981887557269980885411001347889998773003557510288960770535 Q gi|254780419|r 378 HLNIVNTEAIRVRLDMQDGDSCFFIAGDPLKFCSFAADARNHIAQELGIINCNCFEFCWIMDFPFYEW 445 (622) Q Consensus 378 ~l~~~~~~~l~~~~~~~~GD~i~f~a~~~~~~~~~lg~lR~~ia~~l~li~~~~~~flWV~DFPLFe~ 445 (622) .+..+....|...+....++++++.||.+......+. .--+|- =+|.|..+||-|.. T Consensus 142 dfg~eaidtLl~~me~~~~~lvvI~AGY~~eM~~fl~-------~NpGL~----sRf~~~i~F~dYt~ 198 (287) T CHL00181 142 DYGAEAIEILLQVMENQRDDLVVIFAGYKDRMDKFYE-------SNPGLS----SRVANHVDFPDYTP 198 (287) T ss_pred CCHHHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHH-------HCCCHH----HHCCCEEECCCCCH T ss_conf 8379999999999870799889998467899999998-------590478----76887237798599 No 258 >cd01577 IPMI_Swivel Aconatase-like swivel domain of 3-isopropylmalate dehydratase and related uncharacterized proteins. 3-isopropylmalate dehydratase catalyzes the isomerization between 2-isopropylmalate and 3-isopropylmalate, via the formation of 2-isopropylmaleate 3-isopropylmalate. IPMI is involved in fungal and bacterial leucine biosynthesis and is also found in eukaryotes. This is the aconitase-like swivel domain, which is believed to undergo swivelling conformational change in the enzyme mechanism. Probab=32.73 E-value=18 Score=15.76 Aligned_cols=50 Identities=22% Similarity=0.397 Sum_probs=28.5 Q ss_pred CCEEECCEEECCCCEECCCHHHHHHHHHHCCCCHHHHHHHHH--HHHHHHHCCCCCC Q ss_conf 677775677212512315699999999985999899986677--8999986084973 Q gi|254780419|r 482 YDLVCNGFEIASGGIRNHMSEVMLQAFSNVGISKEVVENRFG--GLYRAFQCGMPPH 536 (622) Q Consensus 482 YDLVlNG~EiggGSiRIHd~~iQ~~~f~~lgl~~ee~~~~Fg--~lL~Al~yG~PPH 536 (622) .++++-|---|.||-|.|-+--. +..|+.-=- -+.|+ |+=++++-|.+|- T Consensus 18 ~~Ilvag~NFG~GSSREhA~~al----~~~Gi~~VI-a~SFa~If~~N~~n~GlLpi 69 (91) T cd01577 18 GDIIVAGKNFGCGSSREHAPWAL----KDAGIRAVI-AESFARIFFRNAINNGLLPV 69 (91) T ss_pred CCEEEECCCCCCCCCHHHHHHHH----HHCCCEEEE-EHHHHHHHHHHHHHCCCCEE T ss_conf 46899568657785289999999----984954999-72399999998887785128 No 259 >cd05145 RIO1_like RIO kinase family; RIO1, RIO3 and similar proteins, catalytic domain. The RIO kinase catalytic domain family is part of a larger superfamily, that includes the catalytic domains of other kinases such as the typical serine/threonine/tyrosine protein kinases (PKs), aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase (PI3K). RIO kinases are atypical protein serine kinases containing a kinase catalytic signature, but otherwise show very little sequence similarity to typical PKs. Serine kinases catalyze the transfer of the gamma-phosphoryl group from ATP to serine residues in protein substrates. The RIO catalytic domain is truncated compared to the catalytic domains of typical PKs, with deletions of the loops responsible for substrate binding. RIO1 is present in archaea, bacteria and eukaryotes. In addition, RIO3 is present in multicellular eukaryotes. RIO1 is essential for survival and is required for 18S rRNA processing, proper cell cycle pro Probab=32.62 E-value=31 Score=14.01 Aligned_cols=28 Identities=18% Similarity=0.211 Sum_probs=17.1 Q ss_pred EEEEECCCEEECCCCCCCCHHHCCCCCCCCCHHHHHCCCHHHH Q ss_conf 8896077053545547000211002476643065541887774 Q gi|254780419|r 435 CWIMDFPFYEWNEEEKKIDFAHNPFTMPQGGMESLKGNDLLSI 477 (622) Q Consensus 435 lWV~DFPLFe~dee~~r~~s~HHPFTaP~~~~~~l~~~dp~~v 477 (622) +||+||| ++.++.||- -.+++.-|...+ T Consensus 156 ~~iID~p---------QaV~~~hp~------A~e~L~RDi~ni 183 (190) T cd05145 156 PYIIDVS---------QAVELDHPN------ALEFLRRDIRNI 183 (190) T ss_pred EEEEECC---------CCCCCCCCC------HHHHHHHHHHHH T ss_conf 7999557---------612578998------899999999999 No 260 >cd01938 ADPGK_ADPPFK ADP-dependent glucokinase (ADPGK) and phosphofructokinase (ADPPFK). ADPGK and ADPPFK are proteins that rely on ADP rather than ATP to donate a phosphoryl group. They are found in certain hyperthermophilic archaea and in higher eukaryotes. A functional ADPGK has been characterized in mouse and is assumed to be desirable during ischemia/hypoxia. ADPGK and ADPPFK contain a large and a small domain with the binding site located in a groove between the domains. Partial domain closing is seen when ADP is bound, and further domain closing is observed when glucose is also bound. The oligomerization state apparently varies depending on the species, with some existing as monomers, some as dimers, and some as tetramers. Probab=32.40 E-value=31 Score=13.99 Aligned_cols=14 Identities=14% Similarity=0.387 Sum_probs=9.1 Q ss_pred EEEEEEEEEEEECC Q ss_conf 99998899995021 Q gi|254780419|r 94 IELSAQKIEILSVA 107 (622) Q Consensus 94 ~Ei~~~~~~il~~a 107 (622) +.+.++.++.|+.- T Consensus 27 ~d~~vdai~~L~~~ 40 (445) T cd01938 27 YDLNIDAVKLLDRE 40 (445) T ss_pred EEEEEEHHHHHHHC T ss_conf 76446099886552 No 261 >cd05147 RIO1_euk RIO kinase family; eukaryotic RIO1, catalytic domain. The RIO kinase catalytic domain family is part of a larger superfamily, that includes the catalytic domains of other kinases such as the typical serine/threonine/tyrosine protein kinases (PKs), aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase (PI3K). RIO kinases are atypical protein serine kinases containing a kinase catalytic signature, but otherwise show very little sequence similarity to typical PKs. Serine kinases catalyze the transfer of the gamma-phosphoryl group from ATP to serine residues in protein substrates. The RIO catalytic domain is truncated compared to the catalytic domains of typical PKs, with deletions of the loops responsible for substrate binding. RIO1 is present in archaea, bacteria and eukaryotes. This subfamily is composed of RIO1 proteins from eukaryotes. RIO1 is essential for survival and is required for 18S rRNA processing, proper cell cycle progression and c Probab=32.34 E-value=31 Score=13.98 Aligned_cols=28 Identities=14% Similarity=0.215 Sum_probs=16.5 Q ss_pred EEEEECCCEEECCCCCCCCHHHCCCCCCCCCHHHHHCCCHHHH Q ss_conf 8896077053545547000211002476643065541887774 Q gi|254780419|r 435 CWIMDFPFYEWNEEEKKIDFAHNPFTMPQGGMESLKGNDLLSI 477 (622) Q Consensus 435 lWV~DFPLFe~dee~~r~~s~HHPFTaP~~~~~~l~~~dp~~v 477 (622) +||+||| ++.++.||- -..++.-|...| T Consensus 156 ~~iID~p---------QaV~~~hpn------A~~~L~RDv~ni 183 (190) T cd05147 156 LYIIDVS---------QSVEHDHPH------ALEFLRMDCVNV 183 (190) T ss_pred EEEEECC---------CCCCCCCCC------HHHHHHHHHHHH T ss_conf 8999666---------411478978------899999999899 No 262 >KOG0142 consensus Probab=32.31 E-value=18 Score=15.65 Aligned_cols=72 Identities=24% Similarity=0.273 Sum_probs=46.3 Q ss_pred CHHHHHHHHHHCCCCHHHHH-HHHHHHHHHHHCCCCCCCCEECCHHHHHHHHCCCCCCEEEECCCCCCCCCCCCCCCCCC Q ss_conf 69999999998599989998-66778999986084973422246899999983739840321378999871034688672 Q gi|254780419|r 500 MSEVMLQAFSNVGISKEVVE-NRFGGLYRAFQCGMPPHGGIAAGIDRIVMLLLGAKNVREVSLFPMSQNFCDLLMGSPST 578 (622) Q Consensus 500 d~~iQ~~~f~~lgl~~ee~~-~~Fg~lL~Al~yG~PPHgG~alGlDRlvmll~g~~sIRdVIaFPKt~~g~Dlm~~aPs~ 578 (622) +...|++.+-++||..+++. ++|. ||.-.-|-||+.|-|| +.==|-|.|=+. -..+.--|+. T Consensus 106 r~AAqRkL~~ELGIp~e~v~pee~~-~ltrihYkA~sdg~wG-------------EhEIDYiL~~~~---~~~lnPnpnE 168 (225) T KOG0142 106 RRAAQRKLKAELGIPLEEVPPEEFN-FLTRIHYKAPSDGIWG-------------EHEIDYILFLVK---DVTLNPNPNE 168 (225) T ss_pred HHHHHHHHHHHHCCCCCCCCHHHCE-EHEEEEEECCCCCCCC-------------CCEEEEEEEEEC---CCCCCCCHHH T ss_conf 9999998777619972306878810-0114576157888754-------------001347999843---6788978445 Q ss_pred ------CCHHHHHHCC Q ss_conf ------8988998708 Q gi|254780419|r 579 ------VSVEQLRELG 588 (622) Q Consensus 579 ------v~~~qL~e~~ 588 (622) |+.+.|+|+- T Consensus 169 v~e~ryvs~eelkel~ 184 (225) T KOG0142 169 VSEIRYVSREELKELV 184 (225) T ss_pred HHHHHEECHHHHHHHH T ss_conf 5572112699999998 No 263 >pfam02610 Arabinose_Isome L-arabinose isomerase. This is a family of L-arabinose isomerases, AraA, EC:5.3.1.4. These enzymes catalyse the reaction: L-arabinose <= L-ribulose. This reaction is the first step in the pathway of L-arabinose utilisation as a carbon source after entering the cell L-arabinose is converted into L-ribulose by the L-arabinose isomerases enzyme. Probab=32.00 E-value=32 Score=13.94 Aligned_cols=13 Identities=15% Similarity=0.082 Sum_probs=5.7 Q ss_pred HHHHHHHHHHHHC Q ss_conf 4788999877300 Q gi|254780419|r 414 ADARNHIAQELGI 426 (622) Q Consensus 414 g~lR~~ia~~l~l 426 (622) -.+|.+++-+-.| T Consensus 255 ~~AriElglr~FL 267 (359) T pfam02610 255 EQARIELGLRRFL 267 (359) T ss_pred HHHHHHHHHHHHH T ss_conf 9999999999999 No 264 >COG3655 Predicted transcriptional regulator [Transcription] Probab=31.16 E-value=32 Score=13.85 Aligned_cols=31 Identities=23% Similarity=0.320 Sum_probs=25.3 Q ss_pred HHHCCHHHHHHHHHHHCCCCCCCEEEEECCC Q ss_conf 5532527899999981887557269980885 Q gi|254780419|r 376 AKHLNIVNTEAIRVRLDMQDGDSCFFIAGDP 406 (622) Q Consensus 376 ~K~l~~~~~~~l~~~~~~~~GD~i~f~a~~~ 406 (622) +|.+......+|++.++-++||++-|.++.. T Consensus 40 ~k~I~~~tL~~iC~~LeCqpgDiley~~d~~ 70 (73) T COG3655 40 VKAIRLSTLEKICKALECQPGDILEYVPDSD 70 (73) T ss_pred CCEEEHHHHHHHHHHCCCCHHHEEEEECCCC T ss_conf 6604489999999983898405457716875 No 265 >KOG3567 consensus Probab=31.09 E-value=19 Score=15.49 Aligned_cols=57 Identities=21% Similarity=0.258 Sum_probs=34.7 Q ss_pred CCHHHHHH----CCCCHHHH---HHHHHCCCHHHHEEEHHHHCCCCCCCCCCC----CCEEEEEEEECCC Q ss_conf 21034431----12385456---443310570233110011037776655380----0003565410268 Q gi|254780419|r 186 VNQGSFYA----LPQAPQQY---KQLLMASGFDRYFQIAPCFRDEDPRSDRLP----GEFYQLDMEMSFV 244 (622) Q Consensus 186 ~~~g~fYA----LpQSPQly---KQlLmv~G~dryfqiarcFRdEd~R~dRqp----~EFTqlD~EmsF~ 244 (622) .+.|.+|- -||+||+| -+++||+ +|=-+=+|||=+-..|.+++. +|=-+-++.|.|. T Consensus 256 ~~~g~w~~ig~r~Pq~pqlf~~v~~~~~ia--dgD~~~vrC~~~s~~~~rtt~~g~~~~Demcnfymmy~ 323 (501) T KOG3567 256 KHEGHWTLIGRRDPQLPQLFEPVNHIVCVA--DGDNQRVRCFFQSLGRNRTTLIGGTPEDEMCNFYMMYY 323 (501) T ss_pred CCCCCEEECCCCCCCCHHHHCCCCCCEEEE--CCCCEEEEEEECCCCCCCEEECCCCCCCCCCCEEEEEE T ss_conf 157603353666887033204778620330--37731789997458867423148876320012589998 No 266 >TIGR02538 type_IV_pilB type IV-A pilus assembly ATPase PilB; InterPro: IPR013374 This model describes a protein involved in type IV pilus biogenesis designated PilB in Pseudomonas aeruginosa but PilF in Neisseria gonorrhoeae; the more common usage, reflected here, is PilB. This protein is an ATPase involved in protein export for pilin assembly, and is closely related to GspE (IPR013369 from INTERPRO) of type II secretion systems (also referred to as the main terminal branch of the general secretion pathway). Note that type IV pilus systems are often divided into type IV-A and IV-B, with the latter group including bundle-forming pilus, mannose-sensitive hemagglutinin, etc. Members of this family are found in type IV-A systems.; GO: 0005524 ATP binding, 0008565 protein transporter activity, 0009297 pilus biogenesis. Probab=30.57 E-value=32 Score=13.88 Aligned_cols=34 Identities=18% Similarity=0.182 Sum_probs=21.3 Q ss_pred EECCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHH--CCC Q ss_conf 23156999999999859998999866778999986--084 Q gi|254780419|r 496 IRNHMSEVMLQAFSNVGISKEVVENRFGGLYRAFQ--CGM 533 (622) Q Consensus 496 iRIHd~~iQ~~~f~~lgl~~ee~~~~Fg~lL~Al~--yG~ 533 (622) .||=|+.=-+-=++.||+.|++ +..||+|++ ||| T Consensus 293 LRiLDsS~a~Ldi~~LGFeP~Q----k~~fL~Ai~kPqGM 328 (577) T TIGR02538 293 LRILDSSAAKLDIDKLGFEPDQ----KALFLEAIHKPQGM 328 (577) T ss_pred EEEECHHHCCCCCHHHCCCHHH----HHHHHHHHHCCCCC T ss_conf 7765522122674220688889----99999997079972 No 267 >PRK13411 molecular chaperone DnaK; Provisional Probab=29.72 E-value=32 Score=13.92 Aligned_cols=31 Identities=10% Similarity=0.069 Sum_probs=18.1 Q ss_pred ECCEEECCCCEECCCHHHHHHHHHH-------CCCCHHHH Q ss_conf 7567721251231569999999998-------59998999 Q gi|254780419|r 486 CNGFEIASGGIRNHMSEVMLQAFSN-------VGISKEVV 518 (622) Q Consensus 486 lNG~EiggGSiRIHd~~iQ~~~f~~-------lgl~~ee~ 518 (622) ++.+.|-|||-|| |-+|+.+-+. .+++++++ T Consensus 328 Id~ViLVGGSTRI--P~Vq~~v~~~F~gk~~~~~iNPDEa 365 (655) T PRK13411 328 IDRIILVGGSTRI--PAVQEAIQQFFGGKQPDRSVNPDEA 365 (655) T ss_pred EEEEEEECCCCCC--HHHHHHHHHHHCCCCCCCCCCCHHH T ss_conf 7899998881465--7999999998289988889782168 No 268 >cd05146 RIO3_euk RIO kinase family; eukaryotic RIO3, catalytic domain. The RIO kinase catalytic domain family is part of a larger superfamily, that includes the catalytic domains of other kinases such as the typical serine/threonine/tyrosine protein kinases (PKs), aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase (PI3K). RIO kinases are atypical protein serine kinases containing a kinase catalytic signature, but otherwise show very little sequence similarity to typical PKs. Serine kinases catalyze the transfer of the gamma-phosphoryl group from ATP to serine residues in protein substrates. The RIO catalytic domain is truncated compared to the catalytic domains of typical PKs, with deletions of the loops responsible for substrate binding. RIO3 is present only in multicellular eukaryotes. Its function is still unknown. Probab=29.55 E-value=34 Score=13.66 Aligned_cols=29 Identities=14% Similarity=0.161 Sum_probs=16.1 Q ss_pred EEEEEECCCEEECCCCCCCCHHHCCCCCCCCCHHHHHCCCHHHH Q ss_conf 28896077053545547000211002476643065541887774 Q gi|254780419|r 434 FCWIMDFPFYEWNEEEKKIDFAHNPFTMPQGGMESLKGNDLLSI 477 (622) Q Consensus 434 flWV~DFPLFe~dee~~r~~s~HHPFTaP~~~~~~l~~~dp~~v 477 (622) =+||+||| +..++.|| .-.+++.-|...| T Consensus 162 ~~~iID~~---------QaV~~~hp------~a~e~L~RDi~ni 190 (197) T cd05146 162 KVWFIDVS---------QSVEPTHP------HGLEFLFRDCRNV 190 (197) T ss_pred CEEEEECC---------CCCCCCCC------CHHHHHHHHHHHH T ss_conf 87999677---------40157897------8899999999899 No 269 >cd04246 AAK_AK-DapG-like AAK_AK-DapG-like: Amino Acid Kinase Superfamily (AAK), AK-DapG-like; this CD includes the N-terminal catalytic aspartokinase (AK) domain of the diaminopimelate-sensitive aspartokinase isoenzyme AKI (DapG), a monofunctional enzymes found in Bacilli (Bacillus subtilis 168), Clostridia, and Actinobacteria bacterial species, as well as, the catalytic AK domain of the lysine-sensitive aspartokinase isoenzyme AKII of Bacillus subtilis 168, the lysine plus threonine-sensitive aspartokinase of Corynebacterium glutamicum, and related isoenzymes. In Bacillus subtilis, the regulation of the diaminopimelate-lysine biosynthetic pathway involves dual control by diaminopimelate and lysine, effected through separate diaminopimelate- and lysine-sensitive aspartokinase isoenzymes. The role of the AKI isoenzyme is most likely to provide a constant level of aspartyl-beta-phosphate for the biosynthesis of diaminopimelate for peptidoglycan synthesis and dipicolinate during sporulati Probab=29.24 E-value=28 Score=14.30 Aligned_cols=12 Identities=25% Similarity=0.517 Sum_probs=5.8 Q ss_pred CCCCCCHHHHHH Q ss_conf 732121898875 Q gi|254780419|r 273 NFLRICYDDAIR 284 (622) Q Consensus 273 pF~rmtY~eAm~ 284 (622) ++++|||.|||+ T Consensus 189 ~i~~lsy~EA~e 200 (239) T cd04246 189 KLDVISYDEMLE 200 (239) T ss_pred CCCCCCHHHHHH T ss_conf 727527999999 No 270 >COG1200 RecG RecG-like helicase [DNA replication, recombination, and repair / Transcription] Probab=29.18 E-value=35 Score=13.62 Aligned_cols=79 Identities=15% Similarity=0.145 Sum_probs=53.9 Q ss_pred HHCCCEEEEEEEECCC--C-CCC-CEEEEEEECCCCEEEEEECCCCHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCCCC Q ss_conf 9989889999798004--4-469-87999997798259999748815789985289885899999999758655588777 Q gi|254780419|r 15 SDVGSFVRLSGWVHRV--R-PHG-GIIFLDIRDHYGITQVVANPDSACFEIAKSVRSEWVLCIDGLVMSRSAETINANII 90 (622) Q Consensus 15 ~~~g~~V~i~GwV~~~--R-~~g-~l~F~~lrD~sG~~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~r~~~~~n~~~~ 90 (622) ...|+.|++.|-|.+. . ..+ +..=+.+.|++|.+-+++-.-.. -..+.++.+..|.|.|+|..-. T Consensus 57 ~~~g~~vti~g~V~~~~~~~~~~~~~l~v~~~d~~~~l~l~fFn~~~--~l~~~~~~G~~v~v~Gk~~~~~--------- 125 (677) T COG1200 57 ARPGEIVTIEGTVLSHEKFPFGKRKLLKVTLSDGTGVLTLVFFNFPA--YLKKKLKVGERVIVYGKVKRFK--------- 125 (677) T ss_pred CCCCCEEEEEEEEEEEECCCCCCCCEEEEEEECCCEEEEEEEECCCH--HHHHHCCCCCEEEEEEEEEECC--------- T ss_conf 37786699999997640257788734999996296899999978668--8884187998899999895036--------- Q ss_pred CCEEEEEEEEEEEEE Q ss_conf 865999988999950 Q gi|254780419|r 91 TGQIELSAQKIEILS 105 (622) Q Consensus 91 tG~~Ei~~~~~~il~ 105 (622) |..++.--++.++. T Consensus 126 -~~~~~~hpe~~~~~ 139 (677) T COG1200 126 -GGLQITHPEYIVND 139 (677) T ss_pred -CCEEEECCEEEECC T ss_conf -83588766678327 No 271 >TIGR02881 spore_V_K stage V sporulation protein K; InterPro: IPR014232 Proteins in this entry include the stage V sporulation protein K (SpoVK), a close homologue of the Rubisco expression protein CbbX (IPR000470 from INTERPRO), and are members of an ATPase family associated with various cellular activities. These proteins are strictly limited to bacterial endospore-forming species, but are not found universally among members of this group; they are missing from the Clostridium species.. Probab=28.63 E-value=36 Score=13.55 Aligned_cols=69 Identities=22% Similarity=0.289 Sum_probs=50.3 Q ss_pred ECCCCCCCCCCCHHHHCCHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHCCCCCCE-EEEEEEECC Q ss_conf 0477753335702553252789999998188755726998088541100134788999877300355751-028896077 Q gi|254780419|r 363 LDESGNVAGMGPVAKHLNIVNTEAIRVRLDMQDGDSCFFIAGDPLKFCSFAADARNHIAQELGIINCNCF-EFCWIMDFP 441 (622) Q Consensus 363 ~~~~~~~~~~spi~K~l~~~~~~~l~~~~~~~~GD~i~f~a~~~~~~~~~lg~lR~~ia~~l~li~~~~~-~flWV~DFP 441 (622) +||..++.--| .|.|.-|....|...++-+.+++|++.||.+..-...|. .+|+.- +|-.-.||| T Consensus 111 iDEAYSLaRGG--EKDFGKEAIDtLVK~mEd~~~~lvlILAGY~~EM~yFL~------------~NPGL~SRFPi~i~FP 176 (261) T TIGR02881 111 IDEAYSLARGG--EKDFGKEAIDTLVKAMEDQRNELVLILAGYSDEMDYFLS------------LNPGLRSRFPISIDFP 176 (261) T ss_pred HHHHHHHHCCC--CCCCCCHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHH------------CCCCCCCCCCCEEECC T ss_conf 57777761488--887662088899998761569868997087689999862------------0779777665054188 Q ss_pred CEEE Q ss_conf 0535 Q gi|254780419|r 442 FYEW 445 (622) Q Consensus 442 LFe~ 445 (622) =|+- T Consensus 177 dY~~ 180 (261) T TIGR02881 177 DYTV 180 (261) T ss_pred CCCH T ss_conf 9988 No 272 >TIGR02095 glgA glycogen/starch synthases, ADP-glucose type; InterPro: IPR011835 This family consists of glycogen (or starch) synthases that use ADP-glucose (2.4.1.21 from EC), rather than UDP-glucose (2.4.1.11 from EC) as in animals, as the glucose donor. This enzyme is found in bacteria and plants. Whether the name given is glycogen synthase or starch synthase depends on context, and therefore on substrate.; GO: 0009011 starch synthase activity, 0009250 glucan biosynthetic process. Probab=28.41 E-value=16 Score=15.98 Aligned_cols=15 Identities=40% Similarity=0.736 Sum_probs=12.0 Q ss_pred HHHHHCCCCC----CCCEE Q ss_conf 9998608497----34222 Q gi|254780419|r 526 YRAFQCGMPP----HGGIA 540 (622) Q Consensus 526 L~Al~yG~PP----HgG~a 540 (622) |-|++|||+| .||++ T Consensus 422 l~amRYGt~PiVr~tGGL~ 440 (517) T TIGR02095 422 LYAMRYGTVPIVRRTGGLA 440 (517) T ss_pred HHHHHCCCCEEEECCCCCC T ss_conf 9897349953871588952 No 273 >TIGR00408 proS_fam_I prolyl-tRNA synthetase; InterPro: IPR004499 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . Prolyl-tRNA synthetase is a class II tRNA synthetase and is recognized by IPR002314 from INTERPRO, which recognises tRNA synthetases for Gly, His, Ser, and Pro. The prolyl-tRNA synthetases are divided into two widely divergent families. This family includes the archaeal enzyme, the Pro-specific domain of a human multifunctional tRNA ligase, and the enzyme from the spirochete Borrelia burgdorferi (Lyme desease spirochete). The other family, IPR004500 from INTERPRO, includes enzymes from Escherichia coli, Bacillus subtilis, Synechocystis sp. (strain PCC6803), and one of the two prolyl-tRNA synthetases of Saccharomyces cerevisiae (Baker's yeast).; GO: 0000166 nucleotide binding, 0004827 proline-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006433 prolyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=27.94 E-value=37 Score=13.47 Aligned_cols=121 Identities=19% Similarity=0.243 Sum_probs=65.0 Q ss_pred CCHHHHHHHHHHHHHHHHCC-------------------CEEEEECCCCCCCCC---CCC-----C--CCCCCCCCCHHH Q ss_conf 21245899999998752036-------------------514620332112464---566-----4--543222221034 Q gi|254780419|r 140 KNIVKRTRIINSMRCRMIAE-------------------NFIECSTPILTASSP---EGA-----R--DFLVPSRVNQGS 190 (622) Q Consensus 140 ~~l~~Rs~i~~~~R~~l~~~-------------------gF~EveTP~L~ksTp---eGA-----r--dFlVPsR~~~g~ 190 (622) .-+.+|..+-+.+|+-+... |--||-=|+|..++- |+- . -|+| .+-|. T Consensus 37 ~g~~iw~~iq~~~~~~~~~~~~~~~~~L~~F~P~~~~~Y~h~~v~fP~L~p~~~L~KEk~HikGF~pEv~wi---~~~G~ 113 (533) T TIGR00408 37 YGFKIWENIQKILRDKLDEIKLKDKVILKKFIPIESLPYGHEEVYFPMLIPESELEKEKDHIKGFEPEVYWI---THGGL 113 (533) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHEECCCCCCHHHHHHHCCCCCCCCCHHHHE---ECCCC T ss_conf 877899999999999998764354676640567777777700100564452557863213568858400210---10377 Q ss_pred H-----HHC-CCCH----HHH-HHHHHCCCHH-HHEEEHHHHCCCCCCCC---CCCCCEE-EEEEEECCCCHHHHHHHHH Q ss_conf 4-----311-2385----456-4433105702-33110011037776655---3800003-5654102689999999999 Q gi|254780419|r 191 F-----YAL-PQAP----QQY-KQLLMASGFD-RYFQIAPCFRDEDPRSD---RLPGEFY-QLDMEMSFVEQDDILNTME 254 (622) Q Consensus 191 f-----YAL-pQSP----Qly-KQlLmv~G~d-ryfqiarcFRdEd~R~d---Rqp~EFT-qlD~EmsF~~~edvm~l~E 254 (622) - -|| |.|= -+| ||..--..+. ||||++-+||.|.=.+. |. .||+ ==+-=.+|+|.++--.-+. T Consensus 114 ~~L~~pLaLRPTSEt~~y~m~~kWv~sy~DLPlk~nQw~~vfR~E~kHTRPflR~-rEf~twqE~HT~H~t~~eAee~v~ 192 (533) T TIGR00408 114 KKLDEPLALRPTSETAMYNMYKKWVKSYRDLPLKINQWVNVFRYEKKHTRPFLRT-REFLTWQEAHTAHATAEEAEEEVL 192 (533) T ss_pred CCCCCCCEECCCCHHHHHHHHHHHHHHCCCCCEEEEEEEEEEECCCCCCCCCHHH-HHHHCCCCCCCCCCCHHHHHHHHH T ss_conf 7246750125861788876578785121588768984358898021787861023-332101000131437778899999 Q ss_pred HHHHHHHHHHH Q ss_conf 99999999980 Q gi|254780419|r 255 NVLRGVFEEFS 265 (622) Q Consensus 255 ~li~~i~~~v~ 265 (622) ..+ .+.+++. T Consensus 193 ~~~-~~Yk~f~ 202 (533) T TIGR00408 193 KAL-DLYKEFI 202 (533) T ss_pred HHH-HHHHHHH T ss_conf 999-9889997 No 274 >pfam10451 Stn1 Telomere regulation protein Stn1. The budding yeast protein Stn1 is a DNA-binding protein which has specificity for telomeric DNA. Structural profiling has predicted an OB-fold. Probab=27.90 E-value=37 Score=13.47 Aligned_cols=73 Identities=18% Similarity=0.191 Sum_probs=46.7 Q ss_pred CEEEEEEEECCCC---CCC-CEEEEEEECCCC--EEEEEECCCCHHHHH---HHCCCCCCEEEEEEEEEECCCCCCCCCC Q ss_conf 8899997980044---469-879999977982--599997488157899---8528988589999999975865558877 Q gi|254780419|r 19 SFVRLSGWVHRVR---PHG-GIIFLDIRDHYG--ITQVVANPDSACFEI---AKSVRSEWVLCIDGLVMSRSAETINANI 89 (622) Q Consensus 19 ~~V~i~GwV~~~R---~~g-~l~F~~lrD~sG--~~Q~v~~~~~~~~~~---~~~l~~esvv~v~G~V~~r~~~~~n~~~ 89 (622) ..+.|.|-|--.| ..| .-+|+.|-|.|+ +++|-|+++... +. +..+ .++-+.|.|.+-. T Consensus 64 ~~IkviGcViG~kwk~Ig~~Dy~f~~lDDcS~~~~L~CKCsk~~ll-s~g~~i~~~-iG~TL~v~G~~n~---------- 131 (444) T pfam10451 64 RRIKVIGCVIGWKWKWIGGEDYAFIHIDDCSDSSILWCKCSKSQLL-SLGINITAG-IGRTLAVSGYMNL---------- 131 (444) T ss_pred EEEEEEEEEEEEEEEEECCCCEEEEEECCCCCCCEEEEEECHHHHH-HCCCCCCCC-CCCEEEEEEECCC---------- T ss_conf 3789999997668999758756999952678873389884688897-538860134-5746999860265---------- Q ss_pred CCCEEEEEEEEEEEEE Q ss_conf 7865999988999950 Q gi|254780419|r 90 ITGQIELSAQKIEILS 105 (622) Q Consensus 90 ~tG~~Ei~~~~~~il~ 105 (622) -..|+.|+.+++.. T Consensus 132 --~~~El~V~~ie~~~ 145 (444) T pfam10451 132 --EYLELVVTFIEYVV 145 (444) T ss_pred --EEEEEEEEEEEECC T ss_conf --14579999987144 No 275 >COG2844 GlnD UTP:GlnB (protein PII) uridylyltransferase [Posttranslational modification, protein turnover, chaperones] Probab=27.89 E-value=26 Score=14.57 Aligned_cols=83 Identities=25% Similarity=0.386 Sum_probs=36.7 Q ss_pred CCCCCCCCCCCCCC---CCCCHH--HHHHCCC----CHHHHHHHHHCCCHHHHEEEHHH------HCCCC-CCCCCCCCC Q ss_conf 12464566454322---222103--4431123----85456443310570233110011------03777-665538000 Q gi|254780419|r 170 TASSPEGARDFLVP---SRVNQG--SFYALPQ----APQQYKQLLMASGFDRYFQIAPC------FRDED-PRSDRLPGE 233 (622) Q Consensus 170 ~ksTpeGArdFlVP---sR~~~g--~fYALpQ----SPQlyKQlLmv~G~dryfqiarc------FRdEd-~R~dRqp~E 233 (622) .|++|||=||+-.= ++-|.| ..+.|-. |+|=||++. .+.+--+- .|| =|.|| +--|+|| | T Consensus 199 iK~gpGGLRDlhtL~WiA~~~~~~~~~~~l~~~g~lt~~E~~~l~--~~~~fL~~-VR~~LH~~~~R~edrLlFd~Q~-e 274 (867) T COG2844 199 IKSGPGGLRDLHTLLWIALRHFGATSLEELVKLGFLTRAEYKELD--EAYDFLWR-VRFALHLVAGRAEDRLLFDRQP-E 274 (867) T ss_pred CCCCCCCHHHHHHHHHHHHHHCCCCCHHHHHHCCCCCHHHHHHHH--HHHHHHHH-HHHHHHHHHCCHHHHCCHHHHH-H T ss_conf 677998502899999999986057888999743777589999999--99999999-9999999845514326766569-9 Q ss_pred EEEEEEEECCCCHHHHHHHHHHHHHHHH Q ss_conf 0356541026899999999999999999 Q gi|254780419|r 234 FYQLDMEMSFVEQDDILNTMENVLRGVF 261 (622) Q Consensus 234 FTqlD~EmsF~~~edvm~l~E~li~~i~ 261 (622) --+ -|.|.+.. -..+|.|+++-| T Consensus 275 VA~---~lGy~~~~--~~~vE~~Mk~yf 297 (867) T COG2844 275 VAE---ALGYEGRG--NRAVERMMKAYF 297 (867) T ss_pred HHH---HCCCCCCC--CHHHHHHHHHHH T ss_conf 999---82987887--439999999999 No 276 >pfam12109 CXCR4_N CXCR4 Chemokine receptor N terminal. CXCR4 and its ligand stromal cell-derived factor-1 (a.k.a. CXCL12) are essential for proper fetal development. CXCR4 is also the major coreceptor for T-tropic strains of human immunodeficiency virus 1 (HIV-1), and SDF-1 inhibits HIV-1 infection. Additionally, SDF-1 and CXCR4 mediate cancer cell migration and metastasis. The N terminal domain of most chemokine receptors is the ligand binding domain and so the N terminal domain of CXCR4 is the binding site for SDF-1. Probab=27.88 E-value=21 Score=15.19 Aligned_cols=11 Identities=45% Similarity=1.035 Sum_probs=7.0 Q ss_pred HHHHCCCCCCC Q ss_conf 01103777665 Q gi|254780419|r 217 APCFRDEDPRS 227 (622) Q Consensus 217 arcFRdEd~R~ 227 (622) -||||.|+..- T Consensus 21 EPCf~~eNa~F 31 (33) T pfam12109 21 EPCFRHENADF 31 (33) T ss_pred CCCCCCCCCCC T ss_conf 74101134543 No 277 >TIGR00388 glyQ glycyl-tRNA synthetase, alpha subunit; InterPro: IPR002310 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . In eubacteria, glycyl-tRNA synthetase (6.1.1.14 from EC) is an alpha2/beta2 tetramer composed of 2 different subunits , , . In some eubacteria, in archaea and eukaryota, glycyl-tRNA synthetase is an alpha2 dimer (see IPR002315 from INTERPRO). It belongs to class IIc and is one of the most complex synthetases. What is most interesting is the lack of similarity between the two types: divergence at the sequence level is so great that it is impossible to infer descent from common genes. The alpha and beta subunits (see IPR002311 from INTERPRO) also lack significant sequence similarity. However, they are translated from a single mRNA , and a single chain glycyl-tRNA synthetase from Chlamydia trachomatis has been found to have significant similarity with both domains, suggesting divergence from a single polypeptide chain .; GO: 0000166 nucleotide binding, 0004820 glycine-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006426 glycyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=27.50 E-value=27 Score=14.45 Aligned_cols=24 Identities=0% Similarity=-0.058 Sum_probs=14.2 Q ss_pred HHHHHHHHHHHHCCCEEEEECCCC Q ss_conf 999999987520365146203321 Q gi|254780419|r 146 TRIINSMRCRMIAENFIECSTPIL 169 (622) Q Consensus 146 s~i~~~~R~~l~~~gF~EveTP~L 169 (622) ..++..+.+|...+|.+-++.--+ T Consensus 4 ~~~~~~~~~~w~~~gc~~~~p~d~ 27 (295) T TIGR00388 4 QGLLLKLQEYWANQGCLLVQPYDV 27 (295) T ss_pred HHHHHHHHHHHHCCCCEEEECCCC T ss_conf 889999888641067146411233 No 278 >PRK07080 hypothetical protein; Validated Probab=27.30 E-value=34 Score=13.71 Aligned_cols=41 Identities=20% Similarity=0.323 Sum_probs=24.9 Q ss_pred HHHHCCCCCC-CCCCCCCEEEEEEEECCCCHHHHHHHHHHHHHH Q ss_conf 0110377766-553800003565410268999999999999999 Q gi|254780419|r 217 APCFRDEDPR-SDRLPGEFYQLDMEMSFVEQDDILNTMENVLRG 259 (622) Q Consensus 217 arcFRdEd~R-~dRqp~EFTqlD~EmsF~~~edvm~l~E~li~~ 259 (622) +-|||.|-|. -+|.. +|.|-+. .--.+.++++..-|.-+.. T Consensus 160 ~yCFRhEPS~dp~Rmq-~FRMrEy-V~iGt~e~v~a~R~~W~~r 201 (318) T PRK07080 160 SYCFRHEPSLDPARMQ-LFRMREY-VRIGTPEQILAFRQRWIER 201 (318) T ss_pred EEHHCCCCCCCHHHHH-HHHCEEE-EEECCHHHHHHHHHHHHHH T ss_conf 4210178988858888-6440445-7717999999999999999 No 279 >PRK05853 hypothetical protein; Validated Probab=27.04 E-value=38 Score=13.36 Aligned_cols=82 Identities=15% Similarity=0.157 Sum_probs=44.3 Q ss_pred CCEEEEEEEECC---CCCCCCEEEEEEE--------CCCCE--------EEEEECCCCHHHHHHHCCCCCCEEEEEEEEE Q ss_conf 988999979800---4446987999997--------79825--------9999748815789985289885899999999 Q gi|254780419|r 18 GSFVRLSGWVHR---VRPHGGIIFLDIR--------DHYGI--------TQVVANPDSACFEIAKSVRSEWVLCIDGLVM 78 (622) Q Consensus 18 g~~V~i~GwV~~---~R~~g~l~F~~lr--------D~sG~--------~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~ 78 (622) ...|+|.|+|.+ .|..+.-.++.+| +..|. +-|.+= ...+.....+|+.++.|.|.|.+. T Consensus 3 et~VTlvGnl~~DPe~R~t~~~~v~~FrvAsn~Rr~~~~G~W~dg~t~f~~V~~W-g~LAenv~~sL~KG~pViV~GRL~ 81 (168) T PRK05853 3 ETPITVVGHIVNDPQRRKVGDQEVIKFRVASNSRRRTADGGWEPGNSLFITVNCW-GRLVTGVGAALGKGAPVIVVGHVY 81 (168) T ss_pred CCEEEEEEECCCCCEEEECCCCCEEEEEEEECCCEECCCCCEEECCCEEEEEEEE-HHHHHHHHHHCCCCCEEEEEEEEE T ss_conf 8759999966879778870997489999986786787889888278369999982-899999986405999899999987 Q ss_pred ECCCCCCCCCCCCCEEEEEEEEE Q ss_conf 75865558877786599998899 Q gi|254780419|r 79 SRSAETINANIITGQIELSAQKI 101 (622) Q Consensus 79 ~r~~~~~n~~~~tG~~Ei~~~~~ 101 (622) .|.=...+-. +--.+||.+++| T Consensus 82 ~r~we~kdG~-kRs~~eI~A~~V 103 (168) T PRK05853 82 TSEYEDRDGN-RRSSLEMRATSV 103 (168) T ss_pred CCCCCCCCCC-EEEEEEEEEEEE T ss_conf 6512868998-888999999887 No 280 >TIGR01621 RluA-like pseudouridine synthase Rlu family protein, TIGR01621; InterPro: IPR006508 These sequences represent a clade within the pseudouridine synthase superfamily. The superfamily includes E. coli proteins: RluA, RluB, RluC, RluD, and RsuA. The sequences in this family are most closely related to RluA ; only Neisseria does not appear to have an RluA homolog. It is presumed that these sequences function as pseudouridine synthases, although perhaps with different specificity.. Probab=26.90 E-value=14 Score=16.51 Aligned_cols=22 Identities=23% Similarity=0.356 Sum_probs=18.0 Q ss_pred HHHHCCCCHHHH--HHHHHCCCHHH Q ss_conf 443112385456--44331057023 Q gi|254780419|r 190 SFYALPQAPQQY--KQLLMASGFDR 212 (622) Q Consensus 190 ~fYALpQSPQly--KQlLmv~G~dr 212 (622) ||| |+-|-|-. ||=|++|||+| T Consensus 81 KtY-LAls~~KPkKKQGlI~GdMek 104 (222) T TIGR01621 81 KTY-LALSDKKPKKKQGLIIGDMEK 104 (222) T ss_pred EEE-EECCCCCCCCCCCEEECCHHH T ss_conf 677-631368987444202557124 No 281 >PRK10606 btuE putative glutathione peroxidase; Provisional Probab=26.64 E-value=39 Score=13.32 Aligned_cols=39 Identities=5% Similarity=0.172 Sum_probs=24.1 Q ss_pred EEEEECCC-CCCCCCCHHHHHHHHHHHHHCCCCCEEEEECC Q ss_conf 99996278-76652103567799987642024623552047 Q gi|254780419|r 326 VWAIPAKG-GGGNRAFCDRMNEWARSQGQPGLAYIFWRLDE 365 (622) Q Consensus 326 v~~i~~~~-~~~srk~~D~l~e~ak~~g~kgL~~i~~~~~~ 365 (622) ++.+++.. .+.+ .+++.|+++-+.++.+||.-+-+..+. T Consensus 28 vLIVNvAS~CG~T-~qY~~L~~L~~~y~~~Gl~ILgFPcNq 67 (183) T PRK10606 28 LLIVNVASKCGLT-PQYEQLENIQKAWADQGFVVLGFPCNQ 67 (183) T ss_pred EEEEECCCCCCCC-HHHHHHHHHHHHHHCCCEEEEEEEHHH T ss_conf 9999777557880-779999999998411883899742066 No 282 >KOG0100 consensus Probab=26.33 E-value=39 Score=13.28 Aligned_cols=74 Identities=20% Similarity=0.197 Sum_probs=39.7 Q ss_pred EECCCCEECCCHHHHHH---HHH----HCCCCHHHHHHHHHHHHHHHHCCCCCCCCEE------------------CCHH Q ss_conf 72125123156999999---999----8599989998667789999860849734222------------------4689 Q gi|254780419|r 490 EIASGGIRNHMSEVMLQ---AFS----NVGISKEVVENRFGGLYRAFQCGMPPHGGIA------------------AGID 544 (622) Q Consensus 490 EiggGSiRIHd~~iQ~~---~f~----~lgl~~ee~~~~Fg~lL~Al~yG~PPHgG~a------------------lGlD 544 (622) =|-|||.||-. +|+- -|. .-|++++| |.-||+.-.||.- ||++ T Consensus 367 VLVGGsTrIPK--vQqllk~fF~GKepskGinPdE----------AVAYGAAVQaGvlsGee~t~divLLDv~pLtlGIE 434 (663) T KOG0100 367 VLVGGSTRIPK--VQQLLKDFFNGKEPSKGINPDE----------AVAYGAAVQAGVLSGEEDTGDIVLLDVNPLTLGIE 434 (663) T ss_pred EEECCCCCCHH--HHHHHHHHHCCCCCCCCCCHHH----------HHHHHHHHHHCCCCCCCCCCCEEEEEECCCCCEEE T ss_conf 99548656826--9999999828888667988678----------77765666532012566768579996045110036 Q ss_pred HHHHHHCCCCCCEEEECCCCCCCCCCCCCCC Q ss_conf 9999983739840321378999871034688 Q gi|254780419|r 545 RIVMLLLGAKNVREVSLFPMSQNFCDLLMGS 575 (622) Q Consensus 545 Rlvmll~g~~sIRdVIaFPKt~~g~Dlm~~a 575 (622) -.=-.||..--=--||+--|+|--+-.-.+- T Consensus 435 TvGGVMTklI~RNTviPTkKSQvFsTa~DnQ 465 (663) T KOG0100 435 TVGGVMTKLIPRNTVIPTKKSQVFSTAQDNQ 465 (663) T ss_pred EECCEEECCCCCCCCCCCCCCCEEEECCCCC T ss_conf 4143442024677626753020112100487 No 283 >PRK10841 hybrid sensory kinase in two-component regulatory system with RcsB and YojN; Provisional Probab=26.31 E-value=39 Score=13.27 Aligned_cols=13 Identities=0% Similarity=-0.110 Sum_probs=7.6 Q ss_pred CCHHHHHHHHHHH Q ss_conf 2527899999981 Q gi|254780419|r 379 LNIVNTEAIRVRL 391 (622) Q Consensus 379 l~~~~~~~l~~~~ 391 (622) ++++..+.|.+.+ T Consensus 628 Isee~~~~LFepF 640 (947) T PRK10841 628 IPAKEVVRLFDPF 640 (947) T ss_pred CCHHHHHHHHCHH T ss_conf 8999999884711 No 284 >pfam03197 FRD2 Bacteriophage FRD2 protein. Probab=26.22 E-value=39 Score=13.26 Aligned_cols=41 Identities=27% Similarity=0.597 Sum_probs=23.1 Q ss_pred CCCCCCCCCCCCCCHHHHHHCCCCHHHHHHHHHCC-CHHHHEEEH----HHHCCCCCCCC Q ss_conf 45664543222221034431123854564433105-702331100----11037776655 Q gi|254780419|r 174 PEGARDFLVPSRVNQGSFYALPQAPQQYKQLLMAS-GFDRYFQIA----PCFRDEDPRSD 228 (622) Q Consensus 174 peGArdFlVPsR~~~g~fYALpQSPQlyKQlLmv~-G~dryfqia----rcFRdEd~R~d 228 (622) -.|+ .|+|---..-|+-| +++ +.|+||+|. .|||.=++-+| T Consensus 21 ~dg~-~~V~~I~canG~i~-------------~a~G~~e~YFei~edEF~cFrEyt~ekd 66 (102) T pfam03197 21 DDGD-KAVVHITCANGKIF-------------SAGGELETYFEIYEDEFKCFREYTSEKD 66 (102) T ss_pred CCCC-EEEEEEEECCCCEE-------------CCCCCHHHHHHHHHHHEEEEEEECCCCC T ss_conf 3672-15899994598387-------------1798457774350241466787036787 No 285 >TIGR00092 TIGR00092 GTP-binding protein YchF; InterPro: IPR004396 This is a family of conserved hypothetical proteins found in both prokaryotes and eukaryotes. While the function of these proteins is not known, the crystal structure of P44681 from SWISSPROT from Haemophilus influenzae has been determined . This protein consists of three domains: an N-terminal domain which has a mononucleotide binding fold typical for the P-loop NTPases, a central domain which forms an alpha-helical coiled coil, and a C-terminal domain composed of a six-stranded half-barrel curved around an alpha helix. The central and C-terminal domains are topologically similar to RNA-binding proteins, while the N-terminal region contains the features typical of GTP-dependent molecular switches. The purified protein was capable of binding both double-stranded nucleic acid and GTP. It was suggested, therefore, that this protein might be part of a nucleoprotein complex and could function as a GTP-dependent translation factor.; GO: 0005525 GTP binding. Probab=26.16 E-value=22 Score=15.10 Aligned_cols=134 Identities=15% Similarity=0.240 Sum_probs=67.3 Q ss_pred CHHHHEEEHHHHCCCCCCC---CCCCC-CEEEEEEEECCCCHHHHHHHHHHHHHHHHHHH---HCC-CCCCCCCCCCCHH Q ss_conf 7023311001103777665---53800-00356541026899999999999999999998---087-2345773212189 Q gi|254780419|r 209 GFDRYFQIAPCFRDEDPRS---DRLPG-EFYQLDMEMSFVEQDDILNTMENVLRGVFEEF---SNG-NTVSNNFLRICYD 280 (622) Q Consensus 209 G~dryfqiarcFRdEd~R~---dRqp~-EFTqlD~EmsF~~~edvm~l~E~li~~i~~~v---~~~-~~i~~pF~rmtY~ 280 (622) -+|-.||+.|||-|+|.=| +=-|- -|-=|+-|..-+|-|-+-+..|++=+.+-+-- .++ ...+..+.-|+-- T Consensus 98 eVd~I~hVVRCF~d~~I~HV~G~VDPV~D~evI~~EL~LaDle~~~~~~E~~~~~i~~~~k~a~~~DK~~K~E~~lL~~~ 177 (390) T TIGR00092 98 EVDAICHVVRCFEDDDIIHVEGKVDPVRDLEVINNELILADLEFVEKILERLEKRIGKVKKKAKEGDKEAKEELALLEKI 177 (390) T ss_pred CCCCCEEEEEEEECCCEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHH T ss_conf 03310478863307735885175176232688888888888999999999999999974042310037788889999999 Q ss_pred HHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHH--HHCCCCCEEEEEECCCCCC-CCCCHHHHHHHHHHH Q ss_conf 8875136777644476401244310178752456668--7403784799996278766-521035677999876 Q gi|254780419|r 281 DAIRRYGTDKPDLRNPIIMHNVTEHFVHSEFRVFSKI--LSVNPGYEVWAIPAKGGGG-NRAFCDRMNEWARSQ 351 (622) Q Consensus 281 eAm~~YGsDKPDLR~~lei~dit~~~~~~~f~vf~~~--~~~~~~~~v~~i~~~~~~~-srk~~D~l~e~ak~~ 351 (622) . +-|..+-.+..+.+++...+.+.+++. +..+|...+.-+....... ..+..+...+++..+ T Consensus 178 ~---------~~L~~g~~~~~~~dhl~~~E~~~iks~~lLT~KP~l~~~NVsE~D~~~~~N~~~~~~~e~~~~~ 242 (390) T TIGR00092 178 L---------EGLENGQMVRHVGDHLTEEELKIIKSLNLLTAKPILYLANVSEDDLVNLDNKYLLRVKEIAAKQ 242 (390) T ss_pred H---------HHHHCCCCEEECCCCCCHHHHHHHHHHCCHHHHHHHEEECCCHHCCCCCCCHHHHHHHHHHHHC T ss_conf 9---------9986388433102345888999998622121001405431681100367883789999999723 No 286 >pfam05873 Mt_ATP-synt_D ATP synthase D chain, mitochondrial (ATP5H). This family consists of several ATP synthase D chain, mitochondrial (ATP5H) proteins. Subunit d has no extensive hydrophobic sequences, and is not apparently related to any subunit described in the simpler ATP synthases in bacteria and chloroplasts. Probab=26.08 E-value=37 Score=13.44 Aligned_cols=22 Identities=9% Similarity=0.105 Sum_probs=12.4 Q ss_pred CCCCCCCCHHHHHHHCCCCCCC Q ss_conf 5773212189887513677764 Q gi|254780419|r 271 SNNFLRICYDDAIRRYGTDKPD 292 (622) Q Consensus 271 ~~pF~rmtY~eAm~~YGsDKPD 292 (622) -.||..||.+|--.-+=--+|| T Consensus 124 ~~PfeqMT~ee~~~a~Pe~~~D 145 (160) T pfam05873 124 MIPYDQMTMEDFNEAFPETALD 145 (160) T ss_pred CCCHHHCCHHHHHHHCCCCCCC T ss_conf 8995767999999759300258 No 287 >pfam06862 DUF1253 Protein of unknown function (DUF1253). This family represents the C-terminal portion (approximately 500 residues) of several hypothetical eukaryotic proteins of unknown function. Probab=25.95 E-value=24 Score=14.83 Aligned_cols=35 Identities=23% Similarity=0.400 Sum_probs=18.5 Q ss_pred CCCCCCEEEEEEEECCCCHHHHHHHHH-HHHHHHHHHH Q ss_conf 538000035654102689999999999-9999999998 Q gi|254780419|r 228 DRLPGEFYQLDMEMSFVEQDDILNTME-NVLRGVFEEF 264 (622) Q Consensus 228 dRqp~EFTqlD~EmsF~~~edvm~l~E-~li~~i~~~v 264 (622) +|+- -|-. -+|+--+|+-|++.+-- +-+.++++.+ T Consensus 147 ~rd~-DFLS-SIEvlIiDqad~l~MQNW~Hv~~v~~~l 182 (435) T pfam06862 147 KRDY-DFLS-SIEVLIVDQADVILMQNWEHVLTVFKHL 182 (435) T ss_pred CCCC-CCCC-EEEEEEECCHHHHHHHHHHHHHHHHHHH T ss_conf 4456-4000-0467874326689874199999999984 No 288 >KOG3706 consensus Probab=25.93 E-value=29 Score=14.15 Aligned_cols=67 Identities=7% Similarity=0.032 Sum_probs=32.2 Q ss_pred CHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCHHHHCCHHHHHHHHHHHCCCCCCCEEEEECCCCC Q ss_conf 035677999876420246235520477753335702553252789999998188755726998088541 Q gi|254780419|r 340 FCDRMNEWARSQGQPGLAYIFWRLDESGNVAGMGPVAKHLNIVNTEAIRVRLDMQDGDSCFFIAGDPLK 408 (622) Q Consensus 340 ~~D~l~e~ak~~g~kgL~~i~~~~~~~~~~~~~spi~K~l~~~~~~~l~~~~~~~~GD~i~f~a~~~~~ 408 (622) .+|+++.|+-+..+.. -|..+.-.. .......--+-+|+++...+-.-..-+++||++.|-.|.-.. T Consensus 334 HyDdIeaFvlQvEGrK-~Wrly~P~~-~~eel~l~sS~Nf~eedlgePV~e~vle~GDllYfPRG~IHQ 400 (629) T KOG3706 334 HYDDIEAFVLQVEGRK-HWRLYHPTV-PLEELALVSSDNFTEEDLGEPVHEFVLEPGDLLYFPRGTIHQ 400 (629) T ss_pred CHHHHHHHHHEECCCE-EEEEECCCC-CHHHHHHCCCCCCCHHHHCCCHHHHHCCCCCEEEECCCCEEE T ss_conf 6113565630432600-057616997-176653203788796684871677533777578831753010 No 289 >PRK11260 cystine transporter subunit; Provisional Probab=25.90 E-value=36 Score=13.54 Aligned_cols=48 Identities=19% Similarity=0.429 Sum_probs=29.9 Q ss_pred HHCCCCCCCCCCCCCEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHCCCCCCC Q ss_conf 10377766553800003565410268999999999999999999980872345773212189887513677764 Q gi|254780419|r 219 CFRDEDPRSDRLPGEFYQLDMEMSFVEQDDILNTMENVLRGVFEEFSNGNTVSNNFLRICYDDAIRRYGTDKPD 292 (622) Q Consensus 219 cFRdEd~R~dRqp~EFTqlD~EmsF~~~edvm~l~E~li~~i~~~v~~~~~i~~pF~rmtY~eAm~~YGsDKPD 292 (622) +|+||+ .|.+.+|+|+ .+.+.+.. + ++..|..++|.+.+..--+.|=| T Consensus 53 ~~~d~~-------G~~~GfdvDl---------------~~~ia~~l--g--~~~e~~~~~~~~~i~~L~~g~~D 100 (264) T PRK11260 53 SFQGED-------GKLTGFEVEF---------------AEALAKHL--G--VKASLKPTKWDGMLASLDSKRID 100 (264) T ss_pred EEECCC-------CCEEEEEHHH---------------HHHHHHHH--C--CCEEEEECCHHHHHHHHHCCCCC T ss_conf 689999-------9988871999---------------99999984--9--95699978889999998669946 No 290 >KOG3182 consensus Probab=25.80 E-value=40 Score=13.21 Aligned_cols=82 Identities=15% Similarity=0.185 Sum_probs=48.7 Q ss_pred CHHHHHHHHHHHHHHHH---CCCEEEEECCCCCCCCCCCCCC--CCCCCCCCHHHHHHCC--CCHHHHHHHHHCCC---- Q ss_conf 12458999999987520---3651462033211246456645--4322222103443112--38545644331057---- Q gi|254780419|r 141 NIVKRTRIINSMRCRMI---AENFIECSTPILTASSPEGARD--FLVPSRVNQGSFYALP--QAPQQYKQLLMASG---- 209 (622) Q Consensus 141 ~l~~Rs~i~~~~R~~l~---~~gF~EveTP~L~ksTpeGArd--FlVPsR~~~g~fYALp--QSPQlyKQlLmv~G---- 209 (622) .+++|.+-....++||+ .+|+.-.++|.-..+++++... -+|-.-+..+..| |. +-+++-||+..+.| T Consensus 73 ay~V~g~~~~~~l~yl~~RE~nGY~~~~v~f~~e~~~~~p~v~~vlvyvaTp~N~~y-lGp~ple~iArqI~t~~GpsG~ 151 (212) T KOG3182 73 AYRVRGKQASEVLEYLNVRELNGYTTHEVEFYPEDAAELPEVLGVLVYVATPDNEYY-LGPAPLEEIARQIVTARGPSGP 151 (212) T ss_pred EEEECCHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCEEEEEEEEECCCCCCC-CCCCCHHHHHHHHHHCCCCCCC T ss_conf 999664267889999878762684012466651588777705789999705787465-6986689999999863499998 Q ss_pred -HHHHEEEHHHHCCC Q ss_conf -02331100110377 Q gi|254780419|r 210 -FDRYFQIAPCFRDE 223 (622) Q Consensus 210 -~dryfqiarcFRdE 223 (622) .|=.|.+|.|-|+. T Consensus 152 N~eYLf~La~am~~l 166 (212) T KOG3182 152 NREYLFNLAKAMRQL 166 (212) T ss_pred CHHHHHHHHHHHHHC T ss_conf 689999999999970 No 291 >COG1951 TtdA Tartrate dehydratase alpha subunit/Fumarate hydratase class I, N-terminal domain [Energy production and conversion] Probab=25.72 E-value=40 Score=13.20 Aligned_cols=44 Identities=16% Similarity=0.157 Sum_probs=22.8 Q ss_pred HHHHHHH-C---CCCC-CCCEECCHHHH-HHHHCC---CCCCEEEECCCCCCC Q ss_conf 8999986-0---8497-34222468999-999837---398403213789998 Q gi|254780419|r 524 GLYRAFQ-C---GMPP-HGGIAAGIDRI-VMLLLG---AKNVREVSLFPMSQN 567 (622) Q Consensus 524 ~lL~Al~-y---G~PP-HgG~alGlDRl-vmll~g---~~sIRdVIaFPKt~~ 567 (622) |.++.-+ + +||| |=|+|+|=..- .++|+. ...|-|.-+=|+..+ T Consensus 171 fVletv~~~G~~~CPP~~vgiGIGgt~~~A~~LaKkal~r~i~~~~~~~~~~~ 223 (297) T COG1951 171 FVLETVRTLGGNACPPYIVGIGIGGTAEKAALLAKKALLRPIDDRNPNEKNAK 223 (297) T ss_pred HHHHHHHHCCCCCCCCEEEEEEECCCHHHHHHHHHHHHCCCCCCCCCCCCCHH T ss_conf 99999985499899970688885775999999999996074555699842005 No 292 >TIGR02924 ICDH_alpha isocitrate dehydrogenase; InterPro: IPR014273 This entry represents a group of isocitrate dehydrogenases found mainly in the alphaproteobacteria. Many of the species containing these proteins appear to have a TCA cycle lacking only a determined isocitrate dehydrogenase. The precise identity of the cofactor (NADH -- 1.1.1.41 from EC vs. NADPH -- 1.1.1.42 from EC) is unclear.. Probab=25.62 E-value=28 Score=14.32 Aligned_cols=166 Identities=19% Similarity=0.321 Sum_probs=94.5 Q ss_pred CCCEEEEEEEEEECCCCCCCCCCCCCEEEEEEEEEEEEECCCCC---C-CCCCCCCCCCHHHHCCCCCEEE---CCCCCC Q ss_conf 88589999999975865558877786599998899995021179---9-8645766789878211232000---121111 Q gi|254780419|r 67 SEWVLCIDGLVMSRSAETINANIITGQIELSAQKIEILSVAEEL---P-LPVFGEREYPESIRFKYRFLDL---RRDTLH 139 (622) Q Consensus 67 ~esvv~v~G~V~~r~~~~~n~~~~tG~~Ei~~~~~~il~~a~~~---P-~~~~~~~~~~e~~rl~~R~LdL---R~~~~~ 139 (622) .|+-|.|+-+ +=.++..++.-++| |..++|+-+-+++-| | -.+++...-+..+-||. -|.| -||-.. T Consensus 26 A~A~i~ieti--EiG~~~Y~k~w~~G---I~~SsWesI~RtKvLLKaPitTPQGgG~KSLNVtlRK-~LGLyANiRPCvs 99 (481) T TIGR02924 26 AEAEIQIETI--EIGEKVYKKEWSSG---ISPSSWESIRRTKVLLKAPITTPQGGGHKSLNVTLRK-TLGLYANIRPCVS 99 (481) T ss_pred HCCCCEEEEE--EECCEECCCCCCCC---CCCCCHHHHHHCCCEECCCCCCCCCCCEEEHHHHHHH-HCCCEEECCCCCC T ss_conf 1796256788--71310114788547---8756078897577044178778865853412446432-2475041266530 Q ss_pred CCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCCCCCCHHHHHH---CCCCHHHH--HHHHHCCCHHHH- Q ss_conf 212458999999987520365146203321124645664543222221034431---12385456--443310570233- Q gi|254780419|r 140 KNIVKRTRIINSMRCRMIAENFIECSTPILTASSPEGARDFLVPSRVNQGSFYA---LPQAPQQY--KQLLMASGFDRY- 213 (622) Q Consensus 140 ~~l~~Rs~i~~~~R~~l~~~gF~EveTP~L~ksTpeGArdFlVPsR~~~g~fYA---LpQSPQly--KQlLmv~G~dry- 213 (622) =+-|+|-.-|-|- +|=-|=+.--.|+ =.|++-.| -.+..=||-||. T Consensus 100 ------------------yhP~i~t~~p~ld----------iVivRENEEDlYtGiEYR~T~d~Y~c~K~ITrsgsEkIc 151 (481) T TIGR02924 100 ------------------YHPVIETKYPNLD----------IVIVRENEEDLYTGIEYRQTQDTYECTKVITRSGSEKIC 151 (481) T ss_pred ------------------CCCEECCCCCCCC----------EEEEEECCCCCCCCEEEEECCCCEEEEEEEECCCCHHHH T ss_conf ------------------0352416779834----------699983564232551143057810255534068625789 Q ss_pred ---EEEHHHHCCCCCCCCCCCCCEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHCCCCC Q ss_conf ---11001103777665538000035654102689999999999999999999808723457732121898875136777 Q gi|254780419|r 214 ---FQIAPCFRDEDPRSDRLPGEFYQLDMEMSFVEQDDILNTMENVLRGVFEEFSNGNTVSNNFLRICYDDAIRRYGTDK 290 (622) Q Consensus 214 ---fqiarcFRdEd~R~dRqp~EFTqlD~EmsF~~~edvm~l~E~li~~i~~~v~~~~~i~~pF~rmtY~eAm~~YGsDK 290 (622) |+-||. ++|= -=|= ...+.||+++.++++.+|++++... T Consensus 152 rYAF~YA~~-------~nRK--kVTC-------l~KDNIMK~TDGiFHk~F~~IA~eY---------------------- 193 (481) T TIGR02924 152 RYAFEYARK-------HNRK--KVTC-------LIKDNIMKLTDGIFHKIFDKIAKEY---------------------- 193 (481) T ss_pred HHHHHHHHH-------CCCE--EEEE-------EECCCCCCCCCHHHHHHHHHHHHCC---------------------- T ss_conf 999898776-------4986--8999-------6203713012236787789974317---------------------- Q ss_pred CCCCCCCCCCCCCC Q ss_conf 64447640124431 Q gi|254780419|r 291 PDLRNPIIMHNVTE 304 (622) Q Consensus 291 PDLR~~lei~dit~ 304 (622) ||.--+--|+||.- T Consensus 194 PdI~~EhyIvDIG~ 207 (481) T TIGR02924 194 PDIESEHYIVDIGM 207 (481) T ss_pred CCCCCCCEEEECCH T ss_conf 88541640681033 No 293 >cd04261 AAK_AKii-LysC-BS AAK_AKii-LysC-BS: Amino Acid Kinase Superfamily (AAK), AKii; this CD includes the N-terminal catalytic aspartokinase (AK) domain of the lysine-sensitive aspartokinase isoenzyme AKII of Bacillus subtilis 168, and the lysine plus threonine-sensitive aspartokinase of Corynebacterium glutamicum, and related sequences. In B. subtilis 168, the regulation of the diaminopimelate (Dap)-lysine biosynthetic pathway involves dual control by Dap and lysine, effected through separate Dap- and lysine-sensitive aspartokinase isoenzymes. The B. subtilis 168 AKII is induced by methionine, and repressed and inhibited by lysine. Although Corynebacterium glutamicum is known to contain a single aspartokinase isoenzyme type, both the succinylase and dehydrogenase variant pathways of DAP-lysine synthesis operate simultaneously in this organism. In this organism and other various Gram-positive bacteria, the DAP-lysine pathway is feedback regulated by the concerted action of lysine and Probab=25.37 E-value=39 Score=13.30 Aligned_cols=11 Identities=27% Similarity=0.625 Sum_probs=4.8 Q ss_pred CCCCCCHHHHH Q ss_conf 73212189887 Q gi|254780419|r 273 NFLRICYDDAI 283 (622) Q Consensus 273 pF~rmtY~eAm 283 (622) ++++|||+||| T Consensus 189 ~i~~lsy~EA~ 199 (239) T cd04261 189 KLDEISYDEML 199 (239) T ss_pred CCCEECHHHHH T ss_conf 73770799999 No 294 >TIGR02708 L_lactate_ox L-lactate oxidase; InterPro: IPR014080 Members of this entry oxidize L-lactate to pyruvate, reducing molecular oxygen to hydrogen peroxide. The enzyme is known in Aerococcus viridans, Streptococcus iniae, and some strains of Streptococcus pyogenes where it appears to contribute to virulence.. Probab=25.31 E-value=26 Score=14.50 Aligned_cols=19 Identities=26% Similarity=0.438 Sum_probs=16.5 Q ss_pred HHHHHHCCCCCCEEEECCC Q ss_conf 9999983739840321378 Q gi|254780419|r 545 RIVMLLLGAKNVREVSLFP 563 (622) Q Consensus 545 Rlvmll~g~~sIRdVIaFP 563 (622) ..||-|+|+++|.||=.|- T Consensus 340 ~~VMQL~G~Q~i~D~K~~~ 358 (368) T TIGR02708 340 KRVMQLTGTQTIEDVKGLD 358 (368) T ss_pred HHHHHHCCCCCHHHHHCCC T ss_conf 7776413875156532141 No 295 >COG1982 LdcC Arginine/lysine/ornithine decarboxylases [Amino acid transport and metabolism] Probab=25.03 E-value=28 Score=14.29 Aligned_cols=44 Identities=27% Similarity=0.213 Sum_probs=33.0 Q ss_pred CCCEEEEECCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCHHHHHH Q ss_conf 3651462033211246456645432222210344311238545644 Q gi|254780419|r 158 AENFIECSTPILTASSPEGARDFLVPSRVNQGSFYALPQAPQQYKQ 203 (622) Q Consensus 158 ~~gF~EveTP~L~ksTpeGArdFlVPsR~~~g~fYALpQSPQlyKQ 203 (622) .+++...-.|.+..|..+++++|.++|- |+-- =||.|+.|+.+| T Consensus 203 ah~~~~~~~~~l~~~~~~~~~~~~tqS~-HK~l-~alSQaS~iHv~ 246 (557) T COG1982 203 AHPAHFDFSPMLPESALNGGADFVTQST-HKLL-AALSQASMIHVK 246 (557) T ss_pred CCCCCCCCCCCCCCHHHHCCCEEEEEEH-HHHH-HHHHHHHHHHHC T ss_conf 5774335555574114545743898105-4421-234366787508 No 296 >cd04461 S1_Rrp5_repeat_hs8_sc7 S1_Rrp5_repeat_hs8_sc7: Rrp5 Homo sapiens S1 repeat 8 (hs8) and Saccharomyces cerevisiae S1 repeat 7 (sc7)-like domains. Rrp5 is a trans-acting factor important for biogenesis of both the 40S and 60S eukaryotic ribosomal subunits. Rrp5 has two distinct regions, an N-terminal region containing tandemly repeated S1 RNA-binding domains (12 S1 repeats in S. cerevisiae Rrp5 and 14 S1 repeats in H. sapiens Rrp5) and a C-terminal region containing tetratricopeptide repeat (TPR) motifs thought to be involved in protein-protein interactions. Mutational studies have shown that each region represents a specific functional domain. Deletions within the S1-containing region inhibit pre-rRNA processing at either site A3 or A2, whereas deletions within the TPR region confer an inability to support cleavage of A0-A2. This CD includes H. sapiens S1 repeat 8 and S. cerevisiae S1 repeat 7. Rrp5 is found in eukaryotes but not in prokaryotes or archaea. Probab=24.91 E-value=41 Score=13.10 Aligned_cols=54 Identities=15% Similarity=0.271 Sum_probs=28.9 Q ss_pred CCCEEEEEEEECCCCCCCCEEEEEEECC-CCEEEEEECCCCHHHHHHHCCCCCCEEEEE Q ss_conf 8988999979800444698799999779-825999974881578998528988589999 Q gi|254780419|r 17 VGSFVRLSGWVHRVRPHGGIIFLDIRDH-YGITQVVANPDSACFEIAKSVRSEWVLCID 74 (622) Q Consensus 17 ~g~~V~i~GwV~~~R~~g~l~F~~lrD~-sG~~Q~v~~~~~~~~~~~~~l~~esvv~v~ 74 (622) +|. .+.|.|.++.+.| +|+.|-++ +|.+..---.+....+.-+.++.+..|.+. T Consensus 14 ~G~--iv~G~V~~i~~~G--~FV~l~~~v~Glv~~s~ls~~~~~~~~~~~~vGq~V~~~ 68 (83) T cd04461 14 PGM--VVHGYVRNITPYG--VFVEFLGGLTGLAPKSYISDEFVTDPSFGFKKGQSVTAK 68 (83) T ss_pred CCC--EEEEEEEEEECCE--EEEEECCCCEEEEEHHHCCCCCCCCHHHHCCCCCEEEEE T ss_conf 999--9999999992781--999956994899899996864226989945599999999 No 297 >PRK10203 hypothetical protein; Provisional Probab=24.62 E-value=28 Score=14.27 Aligned_cols=11 Identities=27% Similarity=0.555 Sum_probs=3.6 Q ss_pred CCCCHHHHCCC Q ss_conf 67898782112 Q gi|254780419|r 118 REYPESIRFKY 128 (622) Q Consensus 118 ~~~~e~~rl~~ 128 (622) ..+|+|+|+-| T Consensus 37 s~vP~elR~ay 47 (122) T PRK10203 37 SHVPPELRAGY 47 (122) T ss_pred CCCCHHHHHHH T ss_conf 68999999999 No 298 >KOG4134 consensus Probab=24.56 E-value=37 Score=13.48 Aligned_cols=22 Identities=18% Similarity=0.440 Sum_probs=8.2 Q ss_pred HHHHCCCCCCCCCCCCCEEEEEEEE Q ss_conf 0110377766553800003565410 Q gi|254780419|r 217 APCFRDEDPRSDRLPGEFYQLDMEM 241 (622) Q Consensus 217 arcFRdEd~R~dRqp~EFTqlD~Em 241 (622) -+||-+-=-++. +| +=|+ +++| T Consensus 128 hg~FNASIpk~n-ip-~dw~-fI~m 149 (253) T KOG4134 128 HGVFNASIPKTN-IP-ADWE-FIAM 149 (253) T ss_pred HHHHHCCCCCCC-CC-CCEE-EECC T ss_conf 102314578888-86-3304-4448 No 299 >PRK05733 single-stranded DNA-binding protein; Provisional Probab=24.55 E-value=42 Score=13.05 Aligned_cols=83 Identities=22% Similarity=0.247 Sum_probs=48.4 Q ss_pred EEEEEEEECC---CCC--CC-CEEEEEEE------CC-CC-------EEEEEECCCCHHHHHHHCCCCCCEEEEEEEEEE Q ss_conf 8999979800---444--69-87999997------79-82-------599997488157899852898858999999997 Q gi|254780419|r 20 FVRLSGWVHR---VRP--HG-GIIFLDIR------DH-YG-------ITQVVANPDSACFEIAKSVRSEWVLCIDGLVMS 79 (622) Q Consensus 20 ~V~i~GwV~~---~R~--~g-~l~F~~lr------D~-sG-------~~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~ 79 (622) +|+|.|+|.+ .|- .| .++-+.|- |. +| -+.|++-.. .+....+.|+.++-|.|+|.++. T Consensus 7 kV~LiG~lg~DPElr~t~~G~~v~~fslAt~~~~~~~~~ge~~e~T~w~~v~~wgk-~AE~~~~yl~KG~~V~VeGrL~t 85 (172) T PRK05733 7 KVILVGTCGQDPEVRYLPNGNAVTNLSLATSEQWTDKQSGQKVERTEWHRVSLFGK-VAEIAGEYLRKGSQVYIEGKLQT 85 (172) T ss_pred EEEEEEECCCCCEEEECCCCCEEEEEEEEEECCCCCCCCCCEECCCCEEEEEEECH-HHHHHHHHCCCCCEEEEEEEEEE T ss_conf 89997555779638888999879999999725500167875831232289987345-78888865389998999988884 Q ss_pred CCCCCCCCCCCCCEEEEEEE---EEEEEE Q ss_conf 58655588777865999988---999950 Q gi|254780419|r 80 RSAETINANIITGQIELSAQ---KIEILS 105 (622) Q Consensus 80 r~~~~~n~~~~tG~~Ei~~~---~~~il~ 105 (622) |.=.. + ..+-=-.||.|+ .++.|. T Consensus 86 r~wek-d-G~kry~teiv~~~~~~~q~lg 112 (172) T PRK05733 86 REWEK-D-GIKRYTTEIVVDMQGTMQLLG 112 (172) T ss_pred ECHHH-C-CCEEEEEEEEEECCCCEEEEC T ss_conf 04755-8-987989999994577578832 No 300 >COG1415 Uncharacterized conserved protein [Function unknown] Probab=24.17 E-value=43 Score=13.00 Aligned_cols=28 Identities=25% Similarity=0.342 Sum_probs=10.8 Q ss_pred HHHHHCCCCHHHHHHHHHHHHHHHHCCCCC Q ss_conf 999985999899986677899998608497 Q gi|254780419|r 506 QAFSNVGISKEVVENRFGGLYRAFQCGMPP 535 (622) Q Consensus 506 ~~f~~lgl~~ee~~~~Fg~lL~Al~yG~PP 535 (622) +.+-.-|+.+..+.+- .|+--+=||.|| T Consensus 279 elLl~~GiGpstvRAL--alVAEvIyg~~~ 306 (373) T COG1415 279 ELLLVPGIGPSTVRAL--ALVAEVIYGEPP 306 (373) T ss_pred HHHHCCCCCHHHHHHH--HHHHHHHHCCCC T ss_conf 9874068788999999--999999808998 No 301 >PRK06642 single-stranded DNA-binding protein; Provisional Probab=24.09 E-value=43 Score=12.99 Aligned_cols=81 Identities=25% Similarity=0.261 Sum_probs=43.3 Q ss_pred EEEEEEEECC---CCCC--C-CEEEEEEE------CC-CCE-------EEEEECCCCHHHHHH-HCCCCCCEEEEEEEEE Q ss_conf 8999979800---4446--9-87999997------79-825-------999974881578998-5289885899999999 Q gi|254780419|r 20 FVRLSGWVHR---VRPH--G-GIIFLDIR------DH-YGI-------TQVVANPDSACFEIA-KSVRSEWVLCIDGLVM 78 (622) Q Consensus 20 ~V~i~GwV~~---~R~~--g-~l~F~~lr------D~-sG~-------~Q~v~~~~~~~~~~~-~~l~~esvv~v~G~V~ 78 (622) .|+|.|++.+ +|.. | .++=+.|. |. +|- +.||+-.+. .-+.+ +-|+.++-|.|+|.++ T Consensus 7 kV~LiG~lg~DPElr~t~~G~~v~~fslA~~~~~k~~~~ge~~e~t~w~~vv~~~k~-~ae~~~~yl~KG~~V~VeG~L~ 85 (152) T PRK06642 7 KVILIGNVGRDPEIRTTGEGKKIINLSLATTETWKDRITSERKERTEWHRVVIFSEG-LVSVVERYVTKGSKLYIEGSLQ 85 (152) T ss_pred EEEEEEECCCCCEEEECCCCCEEEEEEEEEEECEEECCCCCCCCCHHHHHHHHHHHH-HHHHHHHHCCCCCEEEEEEEEE T ss_conf 899986357696588879997689999997301011257762431168887763389-9999998656888899966787 Q ss_pred ECCCCCCCCCCCCCEEEEEEEEEE Q ss_conf 758655588777865999988999 Q gi|254780419|r 79 SRSAETINANIITGQIELSAQKIE 102 (622) Q Consensus 79 ~r~~~~~n~~~~tG~~Ei~~~~~~ 102 (622) .|.=...+-. +-=-.||.|+++. T Consensus 86 ~r~y~dkdG~-~r~~teIv~~~~~ 108 (152) T PRK06642 86 TRKWNDNSGQ-EKYTTEVVLQNFN 108 (152) T ss_pred ECCCCCCCCC-EEEEEEEEEECCC T ss_conf 5355989999-8989999996787 No 302 >cd05119 RIO RIO kinase family, catalytic domain. The RIO kinase catalytic domain family is part of a larger superfamily, that includes the catalytic domains of other kinases such as the typical serine/threonine/tyrosine protein kinases (PKs), aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase (PI3K). RIO kinases are atypical protein serine kinases present in archaea, bacteria and eukaryotes. Serine kinases catalyze the transfer of the gamma-phosphoryl group from ATP to serine residues in protein substrates. RIO kinases contain a kinase catalytic signature, but otherwise show very little sequence similarity to typical PKs. The RIO catalytic domain is truncated compared to the catalytic domains of typical PKs, with deletions of the loops responsible for substrate binding. Most organisms contain at least two RIO kinases, RIO1 and RIO2. A third protein, RIO3, is present in multicellular eukaryotes. In yeast, RIO1 and RIO2 are essential for survival. They funct Probab=23.94 E-value=38 Score=13.34 Aligned_cols=29 Identities=24% Similarity=0.384 Sum_probs=16.4 Q ss_pred EEEEEECCCEEECCCCCCCCHHHCCCCCCCCCHHHHHCCCHHHH Q ss_conf 28896077053545547000211002476643065541887774 Q gi|254780419|r 434 FCWIMDFPFYEWNEEEKKIDFAHNPFTMPQGGMESLKGNDLLSI 477 (622) Q Consensus 434 flWV~DFPLFe~dee~~r~~s~HHPFTaP~~~~~~l~~~dp~~v 477 (622) =+||+||| ++.++.|| ...+++.-|...+ T Consensus 152 ~~~iID~p---------QaV~~~hp------~a~~~L~RDi~ni 180 (187) T cd05119 152 KVYIIDVP---------QAVEIDHP------NAEEFLRRDVENI 180 (187) T ss_pred EEEEEECC---------CEECCCCC------CHHHHHHHHHHHH T ss_conf 08999557---------21048998------8899999999899 No 303 >TIGR00348 hsdR type I site-specific deoxyribonuclease, HsdR family; InterPro: IPR004473 There are four classes of restriction endonucleases: types I, II,III and IV. All types of enzymes recognise specific short DNA sequences and carry out the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. They differ in their recognition sequence, subunit composition, cleavage position, and cofactor requirements , , as summarised below: Type I enzymes (3.1.21.3 from EC) cleave at sites remote from recognition site; require both ATP and S-adenosyl-L-methionine to function; multifunctional protein with both restriction and methylase (2.1.1.72 from EC) activities. Type II enzymes (3.1.21.4 from EC) cleave within or at short specific distances from recognition site; most require magnesium; single function (restriction) enzymes independent of methylase. Type III enzymes (3.1.21.5 from EC) cleave at sites a short distance from recognition site; require ATP (but doesn't hydrolyse it); S-adenosyl-L-methionine stimulates reaction but is not required; exists as part of a complex with a modification methylase methylase (2.1.1.72 from EC). Type IV enzymes target methylated DNA. Type I restriction endonucleases are components of prokaryotic DNA restriction-modification mechanisms that protects the organism against invading foreign DNA. Type I enzymes have three different subunits subunits - M (modification), S (specificity) and R (restriction) - that form multifunctional enzymes with restriction (3.1.21.3 from EC), methylase (2.1.1.72 from EC) and ATPase activities , . The S subunit is required for both restriction and modification and is responsible for recognition of the DNA sequence specific for the system. The M subunit is necessary for modification, and the R subunit is required for restriction. These enzymes use S-Adenosyl-L-methionine (AdoMet) as the methyl group donor in the methylation reaction, and have a requirement for ATP. They recognize asymmetric DNA sequences split into two domains of specific sequence, one 3-4 bp long and another 4-5 bp long, separated by a nonspecific spacer 6-8 bp in length. Cleavage occurs a considerable distance from the recognition sites, rarely less than 400 bp away and up to 7000 bp away. Adenosyl residues are methylated, one on each strand of the recognition sequence. These enzymes are widespread in eubacteria and archaea. In enteric bacteria they have been subdivide into four families: types IA, IB, IC and ID. This entry represents the R subunit (HsdR) of type I restriction endonucleases such as EcoRI (3.1.21.3 from EC), which recognises the DNA sequence 5'-GAATTC; the R protein (HsdR) is required for both nuclease and ATPase activity , , , .; GO: 0003677 DNA binding, 0005524 ATP binding, 0009035 Type I site-specific deoxyribonuclease activity, 0009307 DNA restriction-modification system. Probab=23.86 E-value=18 Score=15.66 Aligned_cols=19 Identities=32% Similarity=0.574 Sum_probs=12.5 Q ss_pred CH--HHHH--HHHHCC-CHHHHEE Q ss_conf 85--4564--433105-7023311 Q gi|254780419|r 197 AP--QQYK--QLLMAS-GFDRYFQ 215 (622) Q Consensus 197 SP--QlyK--QlLmv~-G~dryfq 215 (622) +| +||| |++||| |+..||= T Consensus 182 ~pE~~LF~YvQ~~v~s~g~~~~y~ 205 (813) T TIGR00348 182 IPEDELFKYVQILVISNGTKLRYG 205 (813) T ss_pred CCHHHCCCEEEEEEEECCCCCCCC T ss_conf 761120013576787255001357 No 304 >KOG3100 consensus Probab=23.77 E-value=23 Score=14.91 Aligned_cols=12 Identities=33% Similarity=0.623 Sum_probs=5.3 Q ss_pred CHHHHEEEHHHH Q ss_conf 702331100110 Q gi|254780419|r 209 GFDRYFQIAPCF 220 (622) Q Consensus 209 G~dryfqiarcF 220 (622) ++.+|||||..- T Consensus 153 ~~PKYFQIGtIV 164 (206) T KOG3100 153 EVPKYFQIGTIV 164 (206) T ss_pred HCCHHHEEEEEC T ss_conf 133240300110 No 305 >pfam01411 tRNA-synt_2c tRNA synthetases class II (A). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only alanyl-tRNA synthetases. Probab=23.73 E-value=43 Score=12.95 Aligned_cols=96 Identities=17% Similarity=0.240 Sum_probs=51.1 Q ss_pred HHHHHHHHHHCCCEEEEECCCCCCCCCCC-CCCCCCCCCCCHHHHHHCCCCHHHHHHHHHCCCH-----HHHEEEHHHHC Q ss_conf 99999875203651462033211246456-6454322222103443112385456443310570-----23311001103 Q gi|254780419|r 148 IINSMRCRMIAENFIECSTPILTASSPEG-ARDFLVPSRVNQGSFYALPQAPQQYKQLLMASGF-----DRYFQIAPCFR 221 (622) Q Consensus 148 i~~~~R~~l~~~gF~EveTP~L~ksTpeG-ArdFlVPsR~~~g~fYALpQSPQlyKQlLmv~G~-----dryfqiarcFR 221 (622) |-..+-+||.++|-..|..--|. |.. -.-++|-+ |=++||... .||. .|.=..-||.| T Consensus 2 iR~~fl~fF~~~gH~~i~s~~lv---p~~d~~llft~A------------gm~~fkp~~-~g~~~~pp~~~l~~~Q~CiR 65 (545) T pfam01411 2 IRQTFLDFFEKKGHTRVPSSPVV---PRNDPTLLFTNA------------GMNQFKPIF-LGGEDKPPYNRAVNSQKCIR 65 (545) T ss_pred HHHHHHHHHHHCCCEEECCCCCC---CCCCCCEEEECC------------CCHHCHHHH-CCCCCCCCCCCCCCCCCCEE T ss_conf 68999999987898880786706---799998146404------------601173765-49875899887244333244 Q ss_pred CCCC----CCCCCCCCEEEEEEEECCCCH--HHHHHHHHHHHHHH Q ss_conf 7776----655380000356541026899--99999999999999 Q gi|254780419|r 222 DEDP----RSDRLPGEFYQLDMEMSFVEQ--DDILNTMENVLRGV 260 (622) Q Consensus 222 dEd~----R~dRqp~EFTqlD~EmsF~~~--edvm~l~E~li~~i 260 (622) .-|+ ++.||.+=|.||- -.||-+| ++-+...=+++... T Consensus 66 ~~DidnVG~t~rH~T~FEMlG-n~SFg~YfK~eai~~a~e~lt~~ 109 (545) T pfam01411 66 AGDLDNVGRTARHHTFFEMLG-NFSFGDYFKEEAIEFAWELLTKE 109 (545) T ss_pred CCCHHHCCCCCCCCHHHHCCC-CCCCCCCHHHHHHHHHHHHHHHH T ss_conf 376543367777532322033-45156243789999999997656 No 306 >pfam08918 PhoQ_Sensor PhoQ Sensor. The PhoQ Sensor is required for the virulence of various Gram-negative bacteria by allowing interaction of PhoPQ with the intracellular membrane, resulting in remodelling of the bacterial cell surface and subsequent bacterial resistance to host antimicrobial peptides. The domain contains a major flat acidic surface, which binds to at least 3 calcium ions, neutralising the domain's negative charge and allowing interaction with the negatively charged membrane. Probab=23.50 E-value=44 Score=12.92 Aligned_cols=35 Identities=26% Similarity=0.190 Sum_probs=24.9 Q ss_pred HHHHHHCCCEEEEECCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCHHHHHHHH Q ss_conf 987520365146203321124645664543222221034431123854564433 Q gi|254780419|r 152 MRCRMIAENFIECSTPILTASSPEGARDFLVPSRVNQGSFYALPQAPQQYKQLL 205 (622) Q Consensus 152 ~R~~l~~~gF~EveTP~L~ksTpeGArdFlVPsR~~~g~fYALpQSPQlyKQlL 205 (622) -++||.++||.|++|-+=+.+. -|.+.||.-.||- T Consensus 101 ~~eWL~k~g~~Eidtd~~~s~~-------------------~L~~~~~~q~~L~ 135 (180) T pfam08918 101 QPEWLKKTGFHELDTDTDTSSA-------------------LLGNNPEAQDQLK 135 (180) T ss_pred CHHHCCCCCCEEEECCCCHHHH-------------------HHCCCHHHHHHHH T ss_conf 8976057994687647654788-------------------7656989999987 No 307 >CHL00192 syfB phenylalanyl-tRNA synthetase beta chain; Provisional Probab=23.21 E-value=44 Score=12.88 Aligned_cols=81 Identities=14% Similarity=0.130 Sum_probs=48.9 Q ss_pred HHCCCEEECCEEECCCCEECCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCC-CCCCEECCHHHHHHHHCCCCC-- Q ss_conf 31367777567721251231569999999998599989998667789999860849-734222468999999837398-- Q gi|254780419|r 479 AFQYDLVCNGFEIASGGIRNHMSEVMLQAFSNVGISKEVVENRFGGLYRAFQCGMP-PHGGIAAGIDRIVMLLLGAKN-- 555 (622) Q Consensus 479 a~~YDLVlNG~EiggGSiRIHd~~iQ~~~f~~lgl~~ee~~~~Fg~lL~Al~yG~P-PHgG~alGlDRlvmll~g~~s-- 555 (622) +++=++.+||..||- ==.|| |.+++ +||.+ |.-.+-|-+|.|......... T Consensus 548 g~sA~I~~~~~~iG~-iG~lh-P~v~~------------------------~~~l~~~v~~~Eidl~~l~~~~~~~~~~~ 601 (702) T CHL00192 548 GRSAEIIYNNENIGI-FGQLH-PLLAS------------------------NLGLNTSTYLFEFDLEKLQASIEQLNYLN 601 (702) T ss_pred CCEEEEEECCEEEEE-EEEEC-HHHHH------------------------HCCCCCCEEEEEEEHHHHHHHHHHCCCCC T ss_conf 754899999979999-99987-99998------------------------73999977999999799432344304466 Q ss_pred --CEEEECCCCCCCCCCCCCCCCCCCCHHHHHHC Q ss_conf --40321378999871034688672898899870 Q gi|254780419|r 556 --VREVSLFPMSQNFCDLLMGSPSTVSVEQLREL 587 (622) Q Consensus 556 --IRdVIaFPKt~~g~Dlm~~aPs~v~~~qL~e~ 587 (622) .+..--||.. -.|+=.-.|..|+-+++... T Consensus 602 ~~~~~~skfP~v--~RDlalvv~~~v~~~~I~~~ 633 (702) T CHL00192 602 YIIQPYSKYPSI--TRDLSLIIPKNMTISFLLKL 633 (702) T ss_pred CCCCCCCCCCCC--CEEEEEEECCCCCHHHHHHH T ss_conf 656778999970--06899998999979999999 No 308 >cd00144 PP2Ac Protein phosphatase 2A homologues, catalytic domain. Large family of serine/threonine phosphatases, including PP1, PP2A and PP2B (calcineurin) family members. Probab=23.19 E-value=43 Score=12.98 Aligned_cols=23 Identities=22% Similarity=0.450 Sum_probs=13.0 Q ss_pred HHHHHHHHH-HCCCEEEEECCCCC Q ss_conf 999998752-03651462033211 Q gi|254780419|r 148 IINSMRCRM-IAENFIECSTPILT 170 (622) Q Consensus 148 i~~~~R~~l-~~~gF~EveTP~L~ 170 (622) ++....+-| .+...++|+.|+.. T Consensus 9 l~~~~~~il~~ep~ll~i~~pv~V 32 (269) T cd00144 9 LIRAVKEIFRQEPNLVEVSAPVTV 32 (269) T ss_pred HHHHHHHHHHHCCCEEEECCCEEE T ss_conf 999999999859985996699899 No 309 >COG2037 Ftr Formylmethanofuran:tetrahydromethanopterin formyltransferase [Energy production and conversion] Probab=23.11 E-value=27 Score=14.42 Aligned_cols=36 Identities=19% Similarity=0.276 Sum_probs=20.6 Q ss_pred CHHHCCCCCCCCCHHHHHCCCHHHHHHHCCCEEECCEE Q ss_conf 02110024766430655418877741313677775677 Q gi|254780419|r 453 DFAHNPFTMPQGGMESLKGNDLLSIKAFQYDLVCNGFE 490 (622) Q Consensus 453 ~s~HHPFTaP~~~~~~l~~~dp~~v~a~~YDLVlNG~E 490 (622) .++.|||. |.-.+.-....=|..|.+ -|.+|+||+- T Consensus 218 atTNh~~C-Ptlkd~veds~vPegV~~-iyEIVInGl~ 253 (297) T COG2037 218 ATTNHKYC-PTLKDVVEDSEVPEGVKA-IYEIVINGLD 253 (297) T ss_pred CCCCCCCC-CCHHHCCCCCCCCCCCCC-EEEEEECCCC T ss_conf 76788768-625410566658765675-0177763768 No 310 >COG1031 Uncharacterized Fe-S oxidoreductase [Energy production and conversion] Probab=23.11 E-value=45 Score=12.86 Aligned_cols=13 Identities=23% Similarity=0.583 Sum_probs=6.9 Q ss_pred HHHHHHHHHHHHH Q ss_conf 9999999999999 Q gi|254780419|r 251 NTMENVLRGVFEE 263 (622) Q Consensus 251 ~l~E~li~~i~~~ 263 (622) +.+|+|++-++.. T Consensus 261 ealekL~~Gir~~ 273 (560) T COG1031 261 EALEKLFRGIRNV 273 (560) T ss_pred HHHHHHHHHHHHH T ss_conf 9999999999861 No 311 >pfam06953 ArsD Arsenical resistance operon trans-acting repressor ArsD. This family consists of several bacterial arsenical resistance operon trans-acting repressor ArsD proteins. ArsD is a trans-acting repressor of the arsRDABC operon that confers resistance to arsenicals and antimonials in Escherichia coli. It possesses two-pairs of vicinal cysteine residues, Cys(12)-Cys(13) and Cys(112)-Cys(113), that potentially form separate binding sites for the metalloids that trigger dissociation of ArsD from the operon. However, as a homodimer it has four vicinal cysteine pairs. Probab=22.82 E-value=45 Score=12.83 Aligned_cols=13 Identities=8% Similarity=-0.125 Sum_probs=4.5 Q ss_pred HCCHHHHHHHHHH Q ss_conf 3252789999998 Q gi|254780419|r 378 HLNIVNTEAIRVR 390 (622) Q Consensus 378 ~l~~~~~~~l~~~ 390 (622) |.......++.+. T Consensus 55 F~~N~~V~~~L~~ 67 (119) T pfam06953 55 FAENAVVKAFLET 67 (119) T ss_pred HHHCHHHHHHHHH T ss_conf 9727999999997 No 312 >COG4874 Uncharacterized protein conserved in bacteria containing a pentein-type domain [Function unknown] Probab=22.79 E-value=35 Score=13.58 Aligned_cols=14 Identities=29% Similarity=0.458 Sum_probs=9.8 Q ss_pred ECCEE-ECCCCEECC Q ss_conf 75677-212512315 Q gi|254780419|r 486 CNGFE-IASGGIRNH 499 (622) Q Consensus 486 lNG~E-iggGSiRIH 499 (622) +.-+| +||||.|-- T Consensus 295 VpTIE~~gGGSvRCM 309 (318) T COG4874 295 VPTIETIGGGSVRCM 309 (318) T ss_pred CCCEEEECCCCEEEE T ss_conf 662354458622554 No 313 >COG1093 SUI2 Translation initiation factor 2, alpha subunit (eIF-2alpha) [Translation, ribosomal structure and biogenesis] Probab=22.68 E-value=45 Score=12.81 Aligned_cols=36 Identities=19% Similarity=0.229 Sum_probs=16.5 Q ss_pred CHHHHHHHHHCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHH Q ss_conf 524566687403784799996278766521035677999876 Q gi|254780419|r 310 EFRVFSKILSVNPGYEVWAIPAKGGGGNRAFCDRMNEWARSQ 351 (622) Q Consensus 310 ~f~vf~~~~~~~~~~~v~~i~~~~~~~srk~~D~l~e~ak~~ 351 (622) -|..|..+...+ .+++. ....+...++.+.+.|+.+ T Consensus 136 ~y~aFE~aa~~g--~e~l~----~~~~~~~~~~~l~e~a~e~ 171 (269) T COG1093 136 LYDAFEAAAKEG--GEVLD----DEGVPEEWKEVLKEIAREN 171 (269) T ss_pred HHHHHHHHHHCC--CCCCC----CCCCCHHHHHHHHHHHHHC T ss_conf 989999998548--70026----6788777899999999962 No 314 >PRK08486 single-stranded DNA-binding protein; Provisional Probab=22.51 E-value=46 Score=12.78 Aligned_cols=85 Identities=16% Similarity=0.172 Sum_probs=49.6 Q ss_pred EEEEEEEECC---CCC--CC-CEEEEEE------ECCCC-------EEEEEECCCCHHHHHHHCCCCCCEEEEEEEEEEC Q ss_conf 8999979800---444--69-8799999------77982-------5999974881578998528988589999999975 Q gi|254780419|r 20 FVRLSGWVHR---VRP--HG-GIIFLDI------RDHYG-------ITQVVANPDSACFEIAKSVRSEWVLCIDGLVMSR 80 (622) Q Consensus 20 ~V~i~GwV~~---~R~--~g-~l~F~~l------rD~sG-------~~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~r 80 (622) .|.|.|+|.+ .|- .| .++=+.| ++..| .+-|++-.. .+.-....|+.++-|.|+|.++.| T Consensus 4 kViLvGrL~~DpElr~t~~G~~v~~f~lAv~r~~~~~~Ge~~e~t~f~~v~~wgk-~AE~~~~yl~KGs~V~VeGrL~~r 82 (180) T PRK08486 4 KVILVGNLTRDVELRYLPSGSAIATIGLATSRRFKKQDGEKGEEVCFIDIRLFGR-TAEIANQYLSKGSKVLIEGRLTFE 82 (180) T ss_pred EEEEEEECCCCCEEEECCCCCEEEEEEEEECCCEECCCCCCCCCCEEEEEEECCH-HHHHHHHHCCCCCEEEEEEEEECC T ss_conf 6999885686863888799988999999866866678887802236999988167-898887521899999999998846 Q ss_pred CCCCCCCCCCCCEEEEEEEEEEEEEC Q ss_conf 86555887778659999889999502 Q gi|254780419|r 81 SAETINANIITGQIELSAQKIEILSV 106 (622) Q Consensus 81 ~~~~~n~~~~tG~~Ei~~~~~~il~~ 106 (622) .=...+ ..+-=-.||.|++++.|.. T Consensus 83 ~w~dkd-G~~ry~teVv~d~~~fl~~ 107 (180) T PRK08486 83 SWMDQN-GQKRSKHTITAESMQMLDS 107 (180) T ss_pred CCCCCC-CCEEEEEEEEEEEEEEECC T ss_conf 451579-9889999999968898047 No 315 >KOG1491 consensus Probab=22.46 E-value=46 Score=12.78 Aligned_cols=19 Identities=26% Similarity=0.461 Sum_probs=15.4 Q ss_pred CCCHHHHEEEHHHHCCCCC Q ss_conf 0570233110011037776 Q gi|254780419|r 207 ASGFDRYFQIAPCFRDEDP 225 (622) Q Consensus 207 v~G~dryfqiarcFRdEd~ 225 (622) +.-.|-.||..|||-|.|. T Consensus 112 iR~vDaifhVVr~f~d~di 130 (391) T KOG1491 112 IRHVDAIFHVVRAFEDTDI 130 (391) T ss_pred HHHCCCEEEEEEECCCCCC T ss_conf 5520311688874176530 No 316 >pfam04930 FUN14 FUN14 family. This family of short proteins are found in eukaryotes and some archaea. Although the function of these proteins is not known they may contain transmembrane helices. Probab=22.43 E-value=46 Score=12.77 Aligned_cols=26 Identities=27% Similarity=0.346 Sum_probs=18.8 Q ss_pred HHHHHHHHHHHHHCCCCCCCCEECCH Q ss_conf 98667789999860849734222468 Q gi|254780419|r 518 VENRFGGLYRAFQCGMPPHGGIAAGI 543 (622) Q Consensus 518 ~~~~Fg~lL~Al~yG~PPHgG~alGl 543 (622) ...-+..+++.++...|..+||+-|+ T Consensus 67 ~~~~~~~~~~~l~~n~~~~~gF~aGf 92 (97) T pfam04930 67 VPTLGNKLLGFLKANIPFTAGFAAGF 92 (97) T ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHH T ss_conf 56899999999985675036999999 No 317 >COG5127 Vacuolar H+-ATPase V1 sector, subunit C [Energy production and conversion] Probab=22.25 E-value=34 Score=13.72 Aligned_cols=14 Identities=21% Similarity=0.413 Sum_probs=10.6 Q ss_pred HHHCCCCCCCCEEC Q ss_conf 98608497342224 Q gi|254780419|r 528 AFQCGMPPHGGIAA 541 (622) Q Consensus 528 Al~yG~PPHgG~al 541 (622) .|+||.|||--+-+ T Consensus 306 vlRYGLPp~f~~~I 319 (383) T COG5127 306 VLRYGLPPEFMFFI 319 (383) T ss_pred HHHHCCCCCEEEEE T ss_conf 99807994206898 No 318 >TIGR00414 serS seryl-tRNA synthetase; InterPro: IPR002317 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . Seryl-tRNA synthetase (6.1.1.11 from EC) exists as monomer and belongs to class IIa .; GO: 0000166 nucleotide binding, 0004828 serine-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006434 seryl-tRNA aminoacylation, 0005737 cytoplasm. Probab=22.11 E-value=47 Score=12.73 Aligned_cols=131 Identities=18% Similarity=0.350 Sum_probs=68.6 Q ss_pred HHHHHHHHHHHHHHH-HCCCEEEEECCCCCCCCC-CCCC-CCCCCC------CCCHHH---HHHCCCCHH----HH-HHH Q ss_conf 245899999998752-036514620332112464-5664-543222------221034---431123854----56-443 Q gi|254780419|r 142 IVKRTRIINSMRCRM-IAENFIECSTPILTASSP-EGAR-DFLVPS------RVNQGS---FYALPQAPQ----QY-KQL 204 (622) Q Consensus 142 l~~Rs~i~~~~R~~l-~~~gF~EveTP~L~ksTp-eGAr-dFlVPs------R~~~g~---fYALpQSPQ----ly-KQl 204 (622) .++...++..+-++. .++||.++.+|.+..... .|.. .|-.|. .+.... +|-.|.+-. +| ..+ T Consensus 202 ~~~~~al~~~~l~~~~~~~g~~~~~~p~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~l~~t~e~~~~~~~~~~~ 281 (460) T TIGR00414 202 AKLERALINFMLDLLREKNGYSELYPPYLVNEELLDGTGFNPQLPKFEEDLFKLEDTDDEKLYLIPTAEVPLTNLHRDEI 281 (460) T ss_pred HHHHHHHHHHHHHHHHHHCCCCEECCCHHHHHHHHHHCCCCCCCCCHHHHHHHHCCCCCCEEEEEECCHHHHHHHHHHHH T ss_conf 78999999999986443226401014035423443102456556530234432126777504652010001222220001 Q ss_pred HHC--CCH-HHHEEEHHHHCCCCCCCC-------CCCCCEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 310--570-233110011037776655-------3800003565410268999999999999999999980872345773 Q gi|254780419|r 205 LMA--SGF-DRYFQIAPCFRDEDPRSD-------RLPGEFYQLDMEMSFVEQDDILNTMENVLRGVFEEFSNGNTVSNNF 274 (622) Q Consensus 205 Lmv--~G~-dryfqiarcFRdEd~R~d-------Rqp~EFTqlD~EmsF~~~edvm~l~E~li~~i~~~v~~~~~i~~pF 274 (622) +-. +-+ -+|--..+|||.|-..+. |.- +|..+++ ..|+..++-....|.++...-+-+. .+..|+ T Consensus 282 ~~~~~~~lp~~~~~~~~cfr~eag~~g~~~~g~~r~h-~f~k~e~-~~~~~p~~~~~~~~~~~~~~~~~~~---~l~~p~ 356 (460) T TIGR00414 282 LDLPSEELPIKYTGYSPCFRSEAGSYGKDTKGLIRVH-QFNKVEL-VKFCKPEESWEELEELLSDAEKILQ---ELELPY 356 (460) T ss_pred CCCCCCCCCEEHCCCCHHHHHHCCCCCCCCHHHHHHH-HCCCEEE-EEEECCHHHHHHHHHHHHHHHHHHH---HHCCCH T ss_conf 0265210340000022023332134554100012111-0031024-5663540148999999878999999---743751 Q ss_pred CCC Q ss_conf 212 Q gi|254780419|r 275 LRI 277 (622) Q Consensus 275 ~rm 277 (622) ..+ T Consensus 357 ~~~ 359 (460) T TIGR00414 357 RVL 359 (460) T ss_pred HHH T ss_conf 110 No 319 >PRK10838 spr putative outer membrane lipoprotein; Provisional Probab=22.10 E-value=43 Score=12.95 Aligned_cols=14 Identities=36% Similarity=0.695 Sum_probs=10.6 Q ss_pred HCCCCCCCEEEEEC Q ss_conf 18875572699808 Q gi|254780419|r 391 LDMQDGDSCFFIAG 404 (622) Q Consensus 391 ~~~~~GD~i~f~a~ 404 (622) -++++||++||-.+ T Consensus 127 ~~l~pGDLVFF~~~ 140 (188) T PRK10838 127 SNLRTGDLVLFRAG 140 (188) T ss_pred HHCCCCCEEEECCC T ss_conf 87899888996699 No 320 >TIGR02289 M3_not_pepF oligoendopeptidase, M3 family; InterPro: IPR011976 Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site . The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This family consists of probable oligoendopeptidases belonging to MEROPS peptidase family M3, subfamily M3B (clan MA(E)). The family is related to lactococcal PepF and group B streptococcal PepB (IPR004438 from INTERPRO) but in a distinct clade with considerable sequence differences not only to IPR004438 from INTERPRO but also to IPR011977 from INTERPRO. Likely substrates are small peptides and not whole proteins, as with PepF, but members are not characterised and the activity profile may differ. Several bacteria have both a member of this family and a member of the PepF family.. Probab=21.96 E-value=47 Score=12.71 Aligned_cols=39 Identities=23% Similarity=0.384 Sum_probs=28.3 Q ss_pred CHH--HHEEEHHHHCCCCCCCCCCCCCEEEEEEEECCCCHHHHHHHHHHHHHHHH Q ss_conf 702--33110011037776655380000356541026899999999999999999 Q gi|254780419|r 209 GFD--RYFQIAPCFRDEDPRSDRLPGEFYQLDMEMSFVEQDDILNTMENVLRGVF 261 (622) Q Consensus 209 G~d--ryfqiarcFRdEd~R~dRqp~EFTqlD~EmsF~~~edvm~l~E~li~~i~ 261 (622) |++ |||--+.||+|-+.= .| +.+.+++|+-.+.|.+.+= T Consensus 249 gi~~l~pwD~s~~~~~gn~L---~P-----------~~~~~~~~~~~~~my~~l~ 289 (553) T TIGR02289 249 GIEKLRPWDESAVFLDGNVL---KP-----------FGNVDFLLEKAEKMYKELS 289 (553) T ss_pred CCHHCCCCCCCCCCCCCCCC---CC-----------CCCHHHHHHHHHHHHHHHC T ss_conf 61111420157678867866---88-----------6677899999999998408 No 321 >TIGR02151 IPP_isom_2 isopentenyl-diphosphate delta-isomerase, type 2; InterPro: IPR011179 This group represents the bacterial and archaeal isopentenyl-diphosphate delta-isomerase (IPP isomerase). IPP isomerase catalyses the interconversion of isopentenyl diphosphate (IPP) and dimethylallyl diphosphate (DMAPP), and is a key enzyme in the biosynthesis of isoprenoids via the mevalonate pathway. The bacterial and archaeal IPP isomerase (type 2 enzyme) differs from that found in eukaryotes (type 1 enzyme), and requires NADPH, magnesium, and FMN for activity , .; GO: 0004452 isopentenyl-diphosphate delta-isomerase activity, 0010181 FMN binding, 0008299 isoprenoid biosynthetic process, 0005737 cytoplasm. Probab=21.94 E-value=36 Score=13.49 Aligned_cols=56 Identities=29% Similarity=0.378 Sum_probs=33.2 Q ss_pred EEECCCCEECCCHHHHHH-------------HHHHCC-CCHHHHHHHHHHHHHHHHCCCCCCCCEECCHHHHHHHHCCCC Q ss_conf 772125123156999999-------------999859-998999866778999986084973422246899999983739 Q gi|254780419|r 489 FEIASGGIRNHMSEVMLQ-------------AFSNVG-ISKEVVENRFGGLYRAFQCGMPPHGGIAAGIDRIVMLLLGAK 554 (622) Q Consensus 489 ~EiggGSiRIHd~~iQ~~-------------~f~~lg-l~~ee~~~~Fg~lL~Al~yG~PPHgG~alGlDRlvmll~g~~ 554 (622) .=|+||++| |=-|+=+- .++.+- -.+|...+.+.-+++=| |.+|.|+|+. T Consensus 272 ~~iASGG~r-~GlD~AKAlALGA~~~G~A~~~L~~~~~~g~e~~~~~~~~~~~eL---------------k~~mfl~G~~ 335 (349) T TIGR02151 272 PLIASGGLR-TGLDVAKALALGADAVGMARPLLKAALDEGEEAVIEEIELIIEEL---------------KVAMFLTGAK 335 (349) T ss_pred CEEEECCCC-CHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHH---------------HHHHHHHCCC T ss_conf 368846777-888999999962118888899999885269889999999999999---------------9999871798 Q ss_pred CCEEEE Q ss_conf 840321 Q gi|254780419|r 555 NVREVS 560 (622) Q Consensus 555 sIRdVI 560 (622) ||.|.= T Consensus 336 ~i~EL~ 341 (349) T TIGR02151 336 NIKELK 341 (349) T ss_pred CHHHHC T ss_conf 879861 No 322 >pfam11736 DUF3299 Protein of unknown function (DUF3299). This is a family of bacterial proteins of unknown function. Probab=21.83 E-value=47 Score=12.69 Aligned_cols=88 Identities=17% Similarity=0.211 Sum_probs=43.0 Q ss_pred HHCCHHHCCCEEEEEEEECCCCC-CCCEEEEEEECCCCEE---------EEEECCCCHHHHHHHCCCCCCEEEEEEEEEE Q ss_conf 23697998988999979800444-6987999997798259---------9997488157899852898858999999997 Q gi|254780419|r 10 GDLGVSDVGSFVRLSGWVHRVRP-HGGIIFLDIRDHYGIT---------QVVANPDSACFEIAKSVRSEWVLCIDGLVMS 79 (622) Q Consensus 10 g~l~~~~~g~~V~i~GwV~~~R~-~g~l~F~~lrD~sG~~---------Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~ 79 (622) ..++.+..||.|+|.|+|.-.-. .+++.-+-|-=..|.+ |+|.=.-....+. . ..-+.|-|+|++.. T Consensus 45 ~~v~~~ldgk~vripGfvVPle~~~~~v~eFlLVPy~GACiHvPPPPpNQiV~V~~~~g~~~-~--~l~~pvwv~G~l~~ 121 (144) T pfam11736 45 APVVPELDGKKVRIPGFVVPLEGDEGRVTEFLLVPYFGACIHVPPPPPNQIVHVKYPKGIPL-D--DLYDPVWVEGTLRT 121 (144) T ss_pred CCCCHHHCCCEEEECCEEEEECCCCCCEEEEEECCCCCCEECCCCCCCCCEEEEECCCCCCC-H--HCCEEEEEEEEEEE T ss_conf 77517778988870457984346888367999748777465389469881899976998221-0--03110999999997 Q ss_pred CCCCCCCCCCCCCEEEEEEEEEEE Q ss_conf 586555887778659999889999 Q gi|254780419|r 80 RSAETINANIITGQIELSAQKIEI 103 (622) Q Consensus 80 r~~~~~n~~~~tG~~Ei~~~~~~i 103 (622) ... ...+.+-.+.+.+++++. T Consensus 122 ~~~---~~~~~~~~Y~~~a~~i~~ 142 (144) T pfam11736 122 ESS---SSELAEAGYTMEAEKIEP 142 (144) T ss_pred EEC---CCCCEEEEEEEEEEEEEE T ss_conf 640---330055567997307884 No 323 >KOG0352 consensus Probab=21.77 E-value=30 Score=14.08 Aligned_cols=11 Identities=64% Similarity=0.981 Sum_probs=9.6 Q ss_pred CCCCCCCCCCC Q ss_conf 67776444764 Q gi|254780419|r 287 GTDKPDLRNPI 297 (622) Q Consensus 287 GsDKPDLR~~l 297 (622) |.||||.||-. T Consensus 318 GVDKp~VRFVi 328 (641) T KOG0352 318 GVDKPDVRFVI 328 (641) T ss_pred CCCCCCEEEEE T ss_conf 66877615999 No 324 >COG4087 Soluble P-type ATPase [General function prediction only] Probab=21.68 E-value=47 Score=12.67 Aligned_cols=28 Identities=11% Similarity=0.203 Sum_probs=21.1 Q ss_pred CCCEEEEEEE--ECCCCHHHHHHHHHHHHH Q ss_conf 0000356541--026899999999999999 Q gi|254780419|r 231 PGEFYQLDME--MSFVEQDDILNTMENVLR 258 (622) Q Consensus 231 p~EFTqlD~E--msF~~~edvm~l~E~li~ 258 (622) ..||+.++.| |++++.+.=++++++|=+ T Consensus 62 lae~~gi~~~rv~a~a~~e~K~~ii~eLkk 91 (152) T COG4087 62 LAEFVGIPVERVFAGADPEMKAKIIRELKK 91 (152) T ss_pred HHHHCCCCEEEEECCCCHHHHHHHHHHHCC T ss_conf 998739960511034678789999998447 No 325 >COG4944 Uncharacterized protein conserved in bacteria [Function unknown] Probab=21.61 E-value=33 Score=13.85 Aligned_cols=22 Identities=36% Similarity=0.637 Sum_probs=16.2 Q ss_pred HHHHHHHHHCCCCCC---CCEECCH Q ss_conf 778999986084973---4222468 Q gi|254780419|r 522 FGGLYRAFQCGMPPH---GGIAAGI 543 (622) Q Consensus 522 Fg~lL~Al~yG~PPH---gG~alGl 543 (622) +--+|=|++||+|-| +|++-|+ T Consensus 142 lal~l~aLR~~AparP~lAGfaaGL 166 (213) T COG4944 142 LALLLLALRYGAPARPTLAGFAAGL 166 (213) T ss_pred HHHHHHHHHCCCCCCCHHHHHHHHH T ss_conf 9999999954899981588899988 No 326 >PRK05813 single-stranded DNA-binding protein; Provisional Probab=21.46 E-value=48 Score=12.64 Aligned_cols=83 Identities=17% Similarity=0.183 Sum_probs=41.1 Q ss_pred HCCCEEEEEEEECCCCC-----CC-CEEEEEE-----ECCCCEEEEEECCCCHHHHHHHCCCCCCEEEEEEEEEE--CCC Q ss_conf 98988999979800444-----69-8799999-----77982599997488157899852898858999999997--586 Q gi|254780419|r 16 DVGSFVRLSGWVHRVRP-----HG-GIIFLDI-----RDHYGITQVVANPDSACFEIAKSVRSEWVLCIDGLVMS--RSA 82 (622) Q Consensus 16 ~~g~~V~i~GwV~~~R~-----~g-~l~F~~l-----rD~sG~~Q~v~~~~~~~~~~~~~l~~esvv~v~G~V~~--r~~ 82 (622) ..+..|+|.|-|.+-=. .| +.--+.| -|..-.+-+++.... . ....+..+..|.|.|.++. |.+ T Consensus 6 ~enN~V~~~G~i~s~~~fSHE~~GEgFY~~~l~V~RLS~~~D~ip~~vSERL--i-~~~~~~~g~~i~v~GQ~RSYN~~~ 82 (217) T PRK05813 6 MLNNKVYLEGKVVSELEFSHEMYGEGFYTFKLEVPRLSDSKDILNITVSERL--L-ADMDLKVGTEIIVEGQLRSYNKFI 82 (217) T ss_pred CCCCEEEEEEEECCCCEEEEEEECCEEEEEEEEEECCCCCCCEEEEEEEHHH--H-HCCCCCCCCEEEEEEEEEECCCCC T ss_conf 0117899999983573573466334258999985114687433778977434--2-001135797899985244033667 Q ss_pred CCCCCCCCCCEEEEEEEEEEEEE Q ss_conf 55588777865999988999950 Q gi|254780419|r 83 ETINANIITGQIELSAQKIEILS 105 (622) Q Consensus 83 ~~~n~~~~tG~~Ei~~~~~~il~ 105 (622) +..|.- =+.|-|.+++.+. T Consensus 83 ~~~NrL----iLtvF~Rei~~~~ 101 (217) T PRK05813 83 DGSNRL----ILTVFARNIEYCD 101 (217) T ss_pred CCCCEE----EEEEEEEEEEECC T ss_conf 877449----9999876116214 No 327 >cd00340 GSH_Peroxidase Glutathione (GSH) peroxidase family; tetrameric selenoenzymes that catalyze the reduction of a variety of hydroperoxides including lipid peroxidases, using GSH as a specific electron donor substrate. GSH peroxidase contains one selenocysteine residue per subunit, which is involved in catalysis. Different isoenzymes are known in mammals,which are involved in protection against reactive oxygen species, redox regulation of many metabolic processes, peroxinitrite scavenging, and modulation of inflammatory processes. Probab=21.40 E-value=48 Score=12.63 Aligned_cols=40 Identities=8% Similarity=0.106 Sum_probs=24.3 Q ss_pred EEEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEECC Q ss_conf 9999627876652103567799987642024623552047 Q gi|254780419|r 326 VWAIPAKGGGGNRAFCDRMNEWARSQGQPGLAYIFWRLDE 365 (622) Q Consensus 326 v~~i~~~~~~~srk~~D~l~e~ak~~g~kgL~~i~~~~~~ 365 (622) ++.++.........++..|.++-.+++.+||.-+-+..+. T Consensus 25 vLiVNvAS~Cg~t~qy~~L~~L~~ky~~~gl~Il~fPcnq 64 (152) T cd00340 25 LLIVNVASKCGFTPQYEGLEALYEKYKDRGLVVLGFPCNQ 64 (152) T ss_pred EEEEECCCCCCCCCCHHHHHHHHHHHCCCCEEEEEEEHHH T ss_conf 9999667788887248899999998412884999636688 No 328 >pfam07209 DUF1415 Protein of unknown function (DUF1415). This family consists of several hypothetical bacterial proteins of around 180 residues in length. The function of this family is unknown. Probab=21.38 E-value=48 Score=12.63 Aligned_cols=56 Identities=29% Similarity=0.547 Sum_probs=33.5 Q ss_pred HHHCCCEEEEECCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCHHHHHHHHHCCCHHHHEEEHH-----HHCCC Q ss_conf 5203651462033211246456645432222210344311238545644331057023311001-----10377 Q gi|254780419|r 155 RMIAENFIECSTPILTASSPEGARDFLVPSRVNQGSFYALPQAPQQYKQLLMASGFDRYFQIAP-----CFRDE 223 (622) Q Consensus 155 ~l~~~gF~EveTP~L~ksTpeGArdFlVPsR~~~g~fYALpQSPQlyKQlLmv~G~dryfqiar-----cFRdE 223 (622) .|....--|++|-.|. .|+|-.||. ....|-.+. -++|-..|++-+||||. ||-.| T Consensus 57 ~L~~~~~~eieTTLli--~p~~l~dF~-----dy~dfl~~a------~~ll~~~~l~G~~QlAsFHP~y~F~G~ 117 (174) T pfam07209 57 LLDQDDAEEIETTLLI--LPNGLEDFL-----DYNDFLDIA------DALLEELGLEGVYQLASFHPDYCFAGT 117 (174) T ss_pred HHHCCCCCCCCEEEEE--CCCHHHHHH-----HHHHHHHHH------HHHHHHCCCCEEEEEEECCCCCEECCC T ss_conf 9973983211237886--312351398-----999999999------999997488707998844866533899 No 329 >TIGR02995 ectoine_ehuB ectoine/hydroxyectoine ABC transporter solute-binding protein; InterPro: IPR014337 Members of this entry are the extracellular solute-binding proteins of ABC transporters and they closely resemble amino acid transporters. The member from Sinorhizobium meliloti (Rhizobium meliloti) is involved in ectoine uptake, both for osmoprotection and for catabolism. All other members of the seed alignment are found associated with ectoine catabolic genes.. Probab=21.35 E-value=27 Score=14.42 Aligned_cols=11 Identities=9% Similarity=0.292 Sum_probs=6.9 Q ss_pred HHHHCCCEEEE Q ss_conf 75203651462 Q gi|254780419|r 154 CRMIAENFIEC 164 (622) Q Consensus 154 ~~l~~~gF~Ev 164 (622) +-|.+|||+-| T Consensus 28 eelKeQG~~Ri 38 (285) T TIGR02995 28 EELKEQGFVRI 38 (285) T ss_pred HHHHCCCEEEE T ss_conf 75523530477 No 330 >smart00090 RIO RIO-like kinase. Probab=21.33 E-value=47 Score=12.73 Aligned_cols=28 Identities=18% Similarity=0.154 Sum_probs=15.5 Q ss_pred EEEEECCCEEECCCCCCCCHHHCCCCCCCCCHHHHHCCCHHHH Q ss_conf 8896077053545547000211002476643065541887774 Q gi|254780419|r 435 CWIMDFPFYEWNEEEKKIDFAHNPFTMPQGGMESLKGNDLLSI 477 (622) Q Consensus 435 lWV~DFPLFe~dee~~r~~s~HHPFTaP~~~~~~l~~~dp~~v 477 (622) .||+||| ++....|| ....++.-|...| T Consensus 187 ~~iID~~---------QaV~~~hp------~A~~~L~RDi~nI 214 (237) T smart00090 187 VVIIDVS---------QSVELDHP------MALEFLERDIRNI 214 (237) T ss_pred EEEEECC---------CCCCCCCC------CHHHHHHHHHHHH T ss_conf 7999467---------52368998------8899999999999 No 331 >TIGR01802 CM_pl-yst chorismate mutase; InterPro: IPR008238 Chorismate mutase (CM; 5.4.99.5 from EC) catalyses the reaction at the branch point of the biosynthetic pathway leading to the three aromatic amino acids, phenylalanine, tryptophan and tyrosine (chorismic acid is the last common intermediate, and CM leads to the L-phenylalanine/L-tyrosine branch). It is part of the shikimate pathway, which is present only in bacteria, fungi and plants. Members of this family, which are restricted to plants and fungi, contain a chorismate mutase domain of the AroQ class (eukaryotic type) and have an all-helical structure. The monomer consists of a catalytic and a regulatory domain covalently linked by a loop, which functions as a molecular hinge. They are monofunctional, allosteric enzymes and are subject to allosteric inhibition by tyrosine and activation by tryptophan. The three types of CM are AroQ class, prokaryotic type (e.g., IPR008239 from INTERPRO and others); AroQ class, eukaryotic type; and AroH class (IPR008243 from INTERPRO). They fall into two structural folds (AroQ class and AroH class) which are completely unrelated . The two types of the AroQ structural class (the Escherichia coli CM dimer and the yeast CM monomer) can be structurally superimposed, and the topology of the four-helix bundle forming the active site is conserved . For additional information please see , , , , , , , .; GO: 0004106 chorismate mutase activity, 0009073 aromatic amino acid family biosynthetic process. Probab=21.15 E-value=40 Score=13.22 Aligned_cols=16 Identities=6% Similarity=0.177 Sum_probs=8.3 Q ss_pred HCCCHHHHEEEHHHHC Q ss_conf 1057023311001103 Q gi|254780419|r 206 MASGFDRYFQIAPCFR 221 (622) Q Consensus 206 mv~G~dryfqiarcFR 221 (622) |+.+-|+.+--+|=|. T Consensus 55 ~L~e~E~~ha~vRRf~ 70 (273) T TIGR01802 55 LLKETEKLHAKVRRFK 70 (273) T ss_pred HHHHHHHHHHHCCCCC T ss_conf 8855553343100368 No 332 >pfam06218 NPR2 Nitrogen permease regulator 2. This family of regulators are involved in post-translational control of nitrogen permease. Probab=21.14 E-value=42 Score=13.04 Aligned_cols=26 Identities=12% Similarity=0.221 Sum_probs=16.5 Q ss_pred ECCHHHHHHHHCCCCCCEEEECCCCC Q ss_conf 24689999998373984032137899 Q gi|254780419|r 540 AAGIDRIVMLLLGAKNVREVSLFPMS 565 (622) Q Consensus 540 alGlDRlvmll~g~~sIRdVIaFPKt 565 (622) ++-.-|++..=.=..=||-|=-+|=. T Consensus 324 ~IDvRRfI~FGvikg~i~RvhkYPi~ 349 (400) T pfam06218 324 GIDVRRFITFGVIKGLIYRVHKYPIL 349 (400) T ss_pred CCCHHHHHHHHHHHHHHHHEEECCCC T ss_conf 87688876786776453020344523 No 333 >KOG2803 consensus Probab=21.11 E-value=44 Score=12.93 Aligned_cols=75 Identities=19% Similarity=0.247 Sum_probs=36.8 Q ss_pred CCHHHHHHCCCCHHHHHHHHHCCCHHHHEEEHHHHCCCCCCCCCCCCCEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 21034431123854564433105702331100110377766553800003565410268999999999999999999980 Q gi|254780419|r 186 VNQGSFYALPQAPQQYKQLLMASGFDRYFQIAPCFRDEDPRSDRLPGEFYQLDMEMSFVEQDDILNTMENVLRGVFEEFS 265 (622) Q Consensus 186 ~~~g~fYALpQSPQlyKQlLmv~G~dryfqiarcFRdEd~R~dRqp~EFTqlD~EmsF~~~edvm~l~E~li~~i~~~v~ 265 (622) .|.|..=||-|+-|+=++| +|| .--||....+.-|+=||+ |+=+.+++.. +-+- T Consensus 20 ~HyGHanaLrQAkalGdkL-ivG----------VHsDeeI~~nKGpPV~t~----------eERy~~v~~i-----kWVD 73 (358) T KOG2803 20 VHYGHANALRQAKALGDKL-IVG----------VHSDEEITLNKGPPVFTD----------EERYEMVKAI-----KWVD 73 (358) T ss_pred HHHHHHHHHHHHHHHCCEE-EEE----------ECCHHHHHHCCCCCCCCH----------HHHHHHHHHC-----CHHH T ss_conf 2121258778999848848-999----------526698872589986668----------9999998602-----0166 Q ss_pred CCCCCCCCCCCCCHHHHHHHCCCC Q ss_conf 872345773212189887513677 Q gi|254780419|r 266 NGNTVSNNFLRICYDDAIRRYGTD 289 (622) Q Consensus 266 ~~~~i~~pF~rmtY~eAm~~YGsD 289 (622) + .-...| -.|=.+-|++||+| T Consensus 74 E-VV~~AP--yvtt~~~md~y~cd 94 (358) T KOG2803 74 E-VVEGAP--YVTTLEWMDKYGCD 94 (358) T ss_pred H-HHCCCC--EECCHHHHHHHCCE T ss_conf 6-525897--14149899872972 No 334 >TIGR00987 himA integration host factor, alpha subunit; InterPro: IPR005684 Integration host factor (IHF) is a heterodimeric protein consisting of alpha and beta (IPR005685 from INTERPRO) subunits. IHF binds the minor groove of DNA in a sequence-specific manner to induce a large bend. This bending stabilises distinct DNA conformations that are required during several bacterial processes, such as recombination, transposition, replication and transcription .; GO: 0003677 DNA binding, 0005694 chromosome. Probab=21.05 E-value=49 Score=12.58 Aligned_cols=16 Identities=13% Similarity=0.167 Sum_probs=5.6 Q ss_pred HHHHHHHHHHHHHHHH Q ss_conf 9999999999999999 Q gi|254780419|r 248 DILNTMENVLRGVFEE 263 (622) Q Consensus 248 dvm~l~E~li~~i~~~ 263 (622) |--.|+|-++.++-.. T Consensus 21 ea~~LVE~~~~~~~~A 36 (96) T TIGR00987 21 EAKELVELFLEEIRDA 36 (96) T ss_pred HHHHHHHHHHHHHHHH T ss_conf 7789999999999999 No 335 >COG3060 MetJ Transcriptional regulator of met regulon [Transcription / Amino acid transport and metabolism] Probab=21.04 E-value=49 Score=12.58 Aligned_cols=41 Identities=24% Similarity=0.245 Sum_probs=26.1 Q ss_pred HHHCCCCC-CCCCHHHHHCCCHHHHHHHCCCEEECCEEECCCCEECCCHHHHHHHHHHCCCCHHH Q ss_conf 21100247-66430655418877741313677775677212512315699999999985999899 Q gi|254780419|r 454 FAHNPFTM-PQGGMESLKGNDLLSIKAFQYDLVCNGFEIASGGIRNHMSEVMLQAFSNVGISKEV 517 (622) Q Consensus 454 s~HHPFTa-P~~~~~~l~~~dp~~v~a~~YDLVlNG~EiggGSiRIHd~~iQ~~~f~~lgl~~ee 517 (622) |--|.||- |-+.|.++....+. || |+.-+.++..+|+++|. T Consensus 61 aflhaftgqplptd~dl~ker~d--------------ei---------pe~ak~~mrelgidpe~ 102 (105) T COG3060 61 AFLHAFTGQPLPTDADLRKERSD--------------EI---------PEAAKEIMRELGIDPET 102 (105) T ss_pred HHHHHHCCCCCCCCHHHHHHCCC--------------CC---------HHHHHHHHHHHCCCCCC T ss_conf 99998739989973788773654--------------04---------69999999994898001 No 336 >COG3122 Uncharacterized protein conserved in bacteria [Function unknown] Probab=21.02 E-value=48 Score=12.62 Aligned_cols=56 Identities=29% Similarity=0.422 Sum_probs=34.1 Q ss_pred HHHHHHHHCCCEEEEECCCCCCCCCCC-CCCCCCCCCCCHHHHHHCCCCHHHHHHHHHCCC Q ss_conf 999875203651462033211246456-645432222210344311238545644331057 Q gi|254780419|r 150 NSMRCRMIAENFIECSTPILTASSPEG-ARDFLVPSRVNQGSFYALPQAPQQYKQLLMASG 209 (622) Q Consensus 150 ~~~R~~l~~~gF~EveTP~L~ksTpeG-ArdFlVPsR~~~g~fYALpQSPQlyKQlLmv~G 209 (622) ..+|+-|...||--.+.|.- +|||+| +|. ||..+... .-+|.--||+.|-=|.-|| T Consensus 2 ~~~~~~lv~~g~k~~~~~~~-~s~~g~~~r~--~~a~l~~~-~~~~~l~~Qllkaglv~Sk 58 (215) T COG3122 2 NSFRDQLVGLGFKPSPKPES-KSTPGGKGRA--VPAKLGKS-MAKLTLQEQLLKAGLVTSK 58 (215) T ss_pred CCHHHHHHHCCCCCCCCCCC-CCCCCCCCCC--CCHHHHHH-HHHCCCCHHHHHCCCCCCC T ss_conf 51889998446889999987-6788854341--63565356-8760063546543665431 No 337 >pfam08415 NRPS Nonribosomal peptide synthase. This domain is found in bacterial nonribosomal peptide synthetases (NRPS). NRPS are megaenzymes organized as iterative modules, one for each amino acid to be built into the peptide product. NRPS modules are involved in epothilone biosynthesis (EpoB), myxothiazol biosynthesis (MtaC and MtaD), and other functions. The NRPS domain tends to be found together with the condensation domain (pfam00668) and the phosphopantetheine binding domain (pfam00550). Probab=20.94 E-value=49 Score=12.57 Aligned_cols=34 Identities=24% Similarity=0.284 Sum_probs=15.9 Q ss_pred EECCCCCCCCCCCC-CCCCCCCCCCHHHH-HHCCCCHH Q ss_conf 20332112464566-45432222210344-31123854 Q gi|254780419|r 164 CSTPILTASSPEGA-RDFLVPSRVNQGSF-YALPQAPQ 199 (622) Q Consensus 164 veTP~L~ksTpeGA-rdFlVPsR~~~g~f-YALpQSPQ 199 (622) .-.|+.+.|+-+.+ ++. +....-|+. |++.|.|| T Consensus 22 ~~~PVVFTS~L~~~~~~~--~~~~~~g~~~~~isQTPQ 57 (57) T pfam08415 22 ALMPVVFTSALGLGGRGG--SAADALGEPVYGISQTPQ 57 (57) T ss_pred CCCCEEEECCCCCCCCCC--CHHHHCCCCCEEECCCCC T ss_conf 756788873468876661--046652872333537999 No 338 >PRK11865 pyruvate ferredoxin oxidoreductase subunit beta; Provisional Probab=20.77 E-value=39 Score=13.26 Aligned_cols=20 Identities=20% Similarity=0.607 Sum_probs=14.7 Q ss_pred CCCEEECCCCCCCCHHHCCCCC Q ss_conf 7705354554700021100247 Q gi|254780419|r 440 FPFYEWNEEEKKIDFAHNPFTM 461 (622) Q Consensus 440 FPLFe~dee~~r~~s~HHPFTa 461 (622) |||||++ .+++..++.||.. T Consensus 234 wpLye~~--~g~~~~~~~p~~~ 253 (298) T PRK11865 234 WPLFEIE--NGKFNISYEPLHC 253 (298) T ss_pred CEEEEEE--CCCCCCCCCCCCC T ss_conf 0489950--8823556788664 No 339 >PRK11107 hybrid sensory histidine kinase BarA; Provisional Probab=20.73 E-value=45 Score=12.84 Aligned_cols=18 Identities=6% Similarity=0.206 Sum_probs=8.2 Q ss_pred CHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999 Q gi|254780419|r 245 EQDDILNTMENVLRGVFE 262 (622) Q Consensus 245 ~~edvm~l~E~li~~i~~ 262 (622) |..-+..++-+|+.+.+| T Consensus 405 D~~Rl~QIl~NLl~NAiK 422 (920) T PRK11107 405 DPLRLQQIITNLVGNAIK 422 (920) T ss_pred CHHHHHHHHHHHHHHHHH T ss_conf 899999999999999997 No 340 >COG0478 RIO-like serine/threonine protein kinase fused to N-terminal HTH domain [Signal transduction mechanisms] Probab=20.71 E-value=50 Score=12.54 Aligned_cols=30 Identities=23% Similarity=0.293 Sum_probs=17.9 Q ss_pred EEEEEECCCEEECCCCCCCCHHHCCCCCCCCCHHHHHCCCHHHHH Q ss_conf 288960770535455470002110024766430655418877741 Q gi|254780419|r 434 FCWIMDFPFYEWNEEEKKIDFAHNPFTMPQGGMESLKGNDLLSIK 478 (622) Q Consensus 434 flWV~DFPLFe~dee~~r~~s~HHPFTaP~~~~~~l~~~dp~~v~ 478 (622) =.||+||| +|.+.-|| +...++.-|...|. T Consensus 240 ~~~vIDwP---------Q~v~~~hp------~a~e~l~RDv~nii 269 (304) T COG0478 240 DIVVIDWP---------QAVPISHP------DAEELLERDVENII 269 (304) T ss_pred CEEEEECC---------CCCCCCCC------CHHHHHHHHHHHHH T ss_conf 88998676---------66457897------88999996899999 No 341 >cd01900 YchF YchF subfamily. YchF is a member of the Obg family, which includes four other subfamilies of GTPases: Obg, DRG, Ygr210, and NOG1. Obg is an essential gene that is involved in DNA replication in C. crescentus and Streptomyces griseus and is associated with the ribosome. Several members of the family, including YchF, possess the TGS domain related to the RNA-binding proteins. Experimental data and genomic analysis suggest that YchF may be part of a nucleoprotein complex and may function as a GTP-dependent translational factor. Probab=20.28 E-value=19 Score=15.55 Aligned_cols=19 Identities=26% Similarity=0.550 Sum_probs=15.2 Q ss_pred CCCHHHHEEEHHHHCCCCC Q ss_conf 0570233110011037776 Q gi|254780419|r 207 ASGFDRYFQIAPCFRDEDP 225 (622) Q Consensus 207 v~G~dryfqiarcFRdEd~ 225 (622) +--.|-.+++.|||.|++. T Consensus 90 Ir~vDai~hVVR~F~d~~I 108 (274) T cd01900 90 IREVDAIAHVVRCFEDDDI 108 (274) T ss_pred HHHCCEEEEEHHHCCCCCC T ss_conf 8734634653211157730 No 342 >pfam10343 DUF2419 Protein of unknown function (DUF2419). This is a family of conserved proteins found from plants to humans. The function is not known. A few members are annotated as being cobyrinic acid a,c-diamide synthetase but this could not be confirmed. Probab=20.23 E-value=23 Score=14.94 Aligned_cols=39 Identities=23% Similarity=0.438 Sum_probs=20.3 Q ss_pred HHHEEEHHHHCCCCCCCCCCCCCEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 2331100110377766553800003565410268999999999999999999980 Q gi|254780419|r 211 DRYFQIAPCFRDEDPRSDRLPGEFYQLDMEMSFVEQDDILNTMENVLRGVFEEFS 265 (622) Q Consensus 211 dryfqiarcFRdEd~R~dRqp~EFTqlD~EmsF~~~edvm~l~E~li~~i~~~v~ 265 (622) +++=+-=||||||-.-..|+- +|. +...=++..+|..+. T Consensus 130 ~ll~~~Fp~FrD~s~y~G~~V----------~fy------KRAQIlvaDlw~~f~ 168 (282) T pfam10343 130 NLVVENFPCFRDEATYKGRRV----------RFY------KRAQILVADLWACFD 168 (282) T ss_pred HHHHHHCCCCCCCCEECCCEE----------EEE------HHHHHHHHHHHHHHC T ss_conf 999976898776244389888----------765------899999999999838 No 343 >TIGR00175 mito_nad_idh isocitrate dehydrogenase, NAD-dependent; InterPro: IPR004434 Several NAD- or NADP-dependent dehydrogenases, including 3-isopropylmalate dehydrogenase, tartrate dehydrogenase, and the multimeric forms of isocitrate dehydrogenase, share a nucleotide binding domain unrelated to that of lactate dehydrogenase and its homologs. These enzymes dehydrogenate their substates at a H-C-OH site adjacent to a H-C-COOH site; the latter carbon, now adjacent to a carbonyl group, readily decarboxylates. Mitochondrial NAD-dependent isocitrate dehydrogenases (IDH) resemble prokaryotic NADP-dependent IDH and 3-isopropylmalate dehydrogenase (an NAD-dependent enzyme) more closely than they resemble eukaryotic NADP-dependent IDH. The mitochondrial NAD-dependent isocitrate dehydrogenase is believed to be an alpha(2)-beta-gamma heterotetramer. All subunits are homologous and belog to this group. The NADP-dependent IDH of Thermus thermophilus strain HB8 resembles these NAD-dependent IDH, except for the residues involved in cofactor specificity, much more closely than it resembles other prokaryotic NADP-dependent IDH, including that of Thermus aquaticus. ; GO: 0004449 isocitrate dehydrogenase (NAD+) activity, 0006099 tricarboxylic acid cycle, 0005739 mitochondrion. Probab=20.20 E-value=51 Score=12.46 Aligned_cols=24 Identities=17% Similarity=0.338 Sum_probs=20.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 899999999999999999998087 Q gi|254780419|r 244 VEQDDILNTMENVLRGVFEEFSNG 267 (622) Q Consensus 244 ~~~edvm~l~E~li~~i~~~v~~~ 267 (622) +..-.||+|-.+|+..++++|... T Consensus 177 VHKANIMKL~DGLFl~~~~eVa~~ 200 (348) T TIGR00175 177 VHKANIMKLADGLFLEVCREVAKK 200 (348) T ss_pred EECCCHHHHHHHHHHHHHHHHHHH T ss_conf 825101111004789988875553 No 344 >pfam07711 RabGGT_insert Rab geranylgeranyl transferase alpha-subunit, insert domain. Rab geranylgeranyl transferase (RabGGT) catalyses the addition of two geranylgeranyl groups to the C-terminal cysteine residues of Rab proteins, which is crucial for membrane association and function of these proteins in intracellular vesicular trafficking. This domain is inserted between pfam01239 repeats. This domain adopts an Ig-like fold and is thought to be involved in protein-protein interactions and might be involved in the recognition and binding of REP. Probab=20.14 E-value=51 Score=12.45 Aligned_cols=42 Identities=17% Similarity=0.165 Sum_probs=26.1 Q ss_pred CCCCEEEEECCCCCCCCHHHHHHHHHHHHHHCCCCCCEEEEEEEECCCEEECCC Q ss_conf 557269980885411001347889998773003557510288960770535455 Q gi|254780419|r 395 DGDSCFFIAGDPLKFCSFAADARNHIAQELGIINCNCFEFCWIMDFPFYEWNEE 448 (622) Q Consensus 395 ~GD~i~f~a~~~~~~~~~lg~lR~~ia~~l~li~~~~~~flWV~DFPLFe~dee 448 (622) ..+++++..|.|..+.+-.-. ..+.++++|+.|+|-=..++. T Consensus 27 ~~~L~Lv~Dg~p~~veWrt~~------------~r~r~s~vwlcdlP~~si~d~ 68 (102) T pfam07711 27 MVGLLLVLDGQPQAVEWRTAH------------PRLRHSPVWLCDLPAGSINDQ 68 (102) T ss_pred CCEEEEEECCCCEEEEEECCC------------CCCCCCCEEEEECCHHHCCCC T ss_conf 326899967980799976899------------976758279983881235764 No 345 >TIGR01387 cztR_silR_copR heavy metal response regulator; InterPro: IPR006291 Members of this family contain a response regulator receiver domain and an associated transcriptional regulatory region. This group is separated phylogenetically from related proteins with similar architecture and contains a number of proteins associated with heavy metal resistance efflux systems for copper, silver, cadmium, and/or zinc. Most members are encoded by genes adjacent to genes which code for a member of the heavy metal sensor histidine kinase family (IPR006290 from INTERPRO), its partner in the two-component response regulator system. ; GO: 0000156 two-component response regulator activity, 0003677 DNA binding, 0000160 two-component signal transduction system (phosphorelay), 0007600 sensory perception. Probab=20.08 E-value=39 Score=13.32 Aligned_cols=24 Identities=42% Similarity=0.556 Sum_probs=15.3 Q ss_pred EECCCC--C---------CCCCCCCCCCCCCCCCCHHHH Q ss_conf 203321--1---------246456645432222210344 Q gi|254780419|r 164 CSTPIL--T---------ASSPEGARDFLVPSRVNQGSF 191 (622) Q Consensus 164 veTP~L--~---------ksTpeGArdFlVPsR~~~g~f 191 (622) -|||+| | |.=.-||-|||| +|-.| T Consensus 69 ~~~PVl~LTA~D~v~DkvkGLd~GAdDYLv----KPF~F 103 (219) T TIGR01387 69 KETPVLFLTARDSVADKVKGLDLGADDYLV----KPFEF 103 (219) T ss_pred CCCCEEEEECCCCCCCEEEEECCCCCCCCC----CCCCC T ss_conf 987348883378845435301147865413----78776 No 346 >PRK00714 dinucleoside polyphosphate hydrolase; Reviewed Probab=20.05 E-value=51 Score=12.44 Aligned_cols=39 Identities=21% Similarity=0.298 Sum_probs=19.3 Q ss_pred CCCCCCCCCEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHC Q ss_conf 66553800003565410268999999999999999999980872345773212189887513 Q gi|254780419|r 225 PRSDRLPGEFYQLDMEMSFVEQDDILNTMENVLRGVFEEFSNGNTVSNNFLRICYDDAIRRY 286 (622) Q Consensus 225 ~R~dRqp~EFTqlD~EmsF~~~edvm~l~E~li~~i~~~v~~~~~i~~pF~rmtY~eAm~~Y 286 (622) +.++.+| ||. +.-|++.+++++.+ .||.|=-|..++..| T Consensus 113 l~~~~~p-EF~----~wkW~~~~~~~~~v------------------v~FKr~vY~~vl~ef 151 (156) T PRK00714 113 LNTSHHP-EFD----AWRWVSYWYPLRQV------------------VPFKRDVYRRVLKEF 151 (156) T ss_pred CCCCCCC-CEE----EEEECCHHHHHHHC------------------CCCCHHHHHHHHHHH T ss_conf 6899998-623----34758889977763------------------331899999999999 Done!