Query gi|254780757|ref|YP_003065170.1| glycyl-tRNA synthetase subunit alpha [Candidatus Liberibacter asiaticus str. psy62] Match_columns 307 No_of_seqs 141 out of 623 Neff 3.4 Searched_HMMs 39220 Date Sun May 29 19:01:10 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780757.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK09348 glyQ glycyl-tRNA synt 100.0 0 0 1236.8 27.7 283 10-305 1-283 (291) 2 cd00733 GlyRS_alpha_core Class 100.0 0 0 1205.6 26.7 279 12-304 1-279 (279) 3 pfam02091 tRNA-synt_2e Glycyl- 100.0 0 0 1205.8 25.8 280 13-306 1-281 (284) 4 COG0752 GlyQ Glycyl-tRNA synth 100.0 0 0 1197.7 26.2 288 8-305 2-289 (298) 5 TIGR00388 glyQ glycyl-tRNA syn 100.0 0 0 727.6 20.0 285 11-305 1-287 (295) 6 PRK13902 alaS lanyl-tRNA synth 98.1 1.2E-05 3.1E-10 62.1 7.7 176 8-190 55-258 (898) 7 PRK01584 alanyl-tRNA synthetas 97.8 0.00014 3.5E-09 54.5 7.5 167 11-188 1-249 (593) 8 PRK00252 alaS alanyl-tRNA synt 97.5 0.0072 1.8E-07 42.0 12.9 173 10-187 1-243 (864) 9 cd00768 class_II_aaRS-like_cor 97.1 0.0024 6.1E-08 45.4 7.4 136 14-154 3-179 (211) 10 COG0013 AlaS Alanyl-tRNA synth 97.0 0.0031 7.9E-08 44.6 6.5 176 9-189 4-249 (879) 11 cd00673 AlaRS_core Alanyl-tRNA 96.7 0.013 3.4E-07 40.1 7.8 164 15-181 3-232 (232) 12 PRK10748 flavin mononucleotide 94.4 0.025 6.5E-07 38.0 2.2 82 48-142 120-208 (238) 13 PRK09449 nucleotidase; Provisi 91.3 0.27 7E-06 30.5 3.8 58 81-143 133-197 (225) 14 PRK05159 aspC aspartyl-tRNA sy 89.2 0.41 1E-05 29.3 3.2 40 144-190 173-212 (434) 15 PRK00476 aspS aspartyl-tRNA sy 89.0 0.38 9.7E-06 29.5 2.9 19 15-33 147-165 (587) 16 PRK12820 bifunctional aspartyl 87.7 0.69 1.8E-05 27.6 3.6 111 130-240 484-653 (706) 17 COG1011 Predicted hydrolase (H 86.8 0.64 1.6E-05 27.8 3.0 87 47-142 105-199 (229) 18 COG0173 AspS Aspartyl-tRNA syn 86.7 0.67 1.7E-05 27.7 3.0 64 126-189 462-552 (585) 19 PRK00484 lysS lysyl-tRNA synth 85.9 0.63 1.6E-05 27.9 2.5 74 14-91 175-269 (491) 20 KOG3109 consensus 85.4 1.6 4E-05 25.0 4.4 110 7-148 99-210 (244) 21 KOG2411 consensus 81.1 1.8 4.5E-05 24.7 3.2 89 128-219 506-625 (628) 22 cd00775 LysRS_core Lys_tRNA sy 80.2 0.73 1.9E-05 27.4 1.0 22 14-35 12-33 (329) 23 TIGR00459 aspS_bact aspartyl-t 78.8 1.7 4.3E-05 24.8 2.5 59 131-189 530-616 (653) 24 PRK12445 lysyl-tRNA synthetase 77.9 1.6 4E-05 25.0 2.1 56 134-189 411-497 (505) 25 pfam04222 DUF416 Protein of un 74.8 7.7 0.0002 20.0 6.6 62 238-306 46-107 (110) 26 PRK02983 lysS lysyl-tRNA synth 73.0 4 0.0001 22.1 3.2 55 134-189 1007-1092(1099) 27 cd00776 AsxRS_core Asx tRNA sy 72.4 1.6 4E-05 25.1 1.0 30 15-44 29-63 (322) 28 PRK04172 pheS phenylalanyl-tRN 70.4 9.7 0.00025 19.3 5.3 130 47-189 349-491 (501) 29 PRK09350 lysyl-tRNA synthetase 70.2 1.5 3.8E-05 25.2 0.5 21 14-34 20-40 (325) 30 TIGR00470 sepS O-phosphoseryl- 69.1 1.9 4.8E-05 24.5 0.8 34 165-204 336-369 (558) 31 PRK12420 histidyl-tRNA synthet 68.5 11 0.00027 19.0 4.7 53 54-113 96-156 (421) 32 KOG3085 consensus 68.4 8.4 0.00021 19.7 4.0 118 32-159 101-229 (237) 33 PRK13222 phosphoglycolate phos 67.8 11 0.00028 18.9 5.3 87 47-141 99-192 (228) 34 cd00777 AspRS_core Asp tRNA sy 66.5 3 7.7E-05 23.0 1.4 21 15-35 6-26 (280) 35 COG0017 AsnS Aspartyl/asparagi 66.0 3.5 9E-05 22.5 1.7 14 223-236 246-259 (435) 36 pfam01411 tRNA-synt_2c tRNA sy 65.8 1.6 4.1E-05 25.0 -0.1 160 16-186 2-188 (545) 37 PRK06462 asparagine synthetase 64.4 3 7.7E-05 23.0 1.1 60 15-112 34-98 (332) 38 cd00669 Asp_Lys_Asn_RS_core As 64.2 2.3 5.8E-05 23.9 0.4 171 14-189 5-263 (269) 39 PRK03932 asnC asparaginyl-tRNA 64.0 3.2 8.1E-05 22.8 1.1 16 16-31 141-156 (462) 40 TIGR02572 LcrR type III secret 63.4 1.5 3.8E-05 25.2 -0.7 33 105-147 8-40 (142) 41 cd00496 PheRS_alpha_core Pheny 63.1 2.8 7.1E-05 23.2 0.7 125 49-188 73-213 (218) 42 pfam01409 tRNA-synt_2d tRNA sy 62.5 14 0.00035 18.3 6.0 169 12-190 18-235 (243) 43 pfam00152 tRNA-synt_2 tRNA syn 61.6 2.6 6.6E-05 23.4 0.3 20 15-34 27-46 (341) 44 PRK11587 putative phosphatase; 61.1 7.3 0.00019 20.2 2.5 79 47-136 89-175 (218) 45 KOG1576 consensus 60.9 7.1 0.00018 20.3 2.5 108 52-190 89-208 (342) 46 PRK10563 6-phosphogluconate ph 60.7 8.7 0.00022 19.6 2.9 43 92-141 142-184 (221) 47 COG0016 PheS Phenylalanyl-tRNA 60.0 3 7.6E-05 23.0 0.4 168 13-190 113-328 (335) 48 PRK06253 O-phosphoseryl-tRNA s 58.3 3.3 8.5E-05 22.7 0.4 127 54-190 203-342 (527) 49 PRK13288 pyrophosphatase PpaX; 57.1 11 0.00027 19.0 2.8 109 14-138 63-177 (214) 50 TIGR00469 pheS_mito phenylalan 56.9 6.2 0.00016 20.7 1.6 88 112-205 259-357 (460) 51 PRK13229 consensus 56.0 14 0.00037 18.0 3.3 81 48-136 101-187 (234) 52 pfam06046 Sec6 Exocyst complex 55.3 18 0.00045 17.4 5.5 88 207-294 443-541 (556) 53 PRK13227 consensus 53.8 15 0.00038 18.0 3.1 84 47-138 102-191 (228) 54 PRK13228 consensus 52.6 16 0.0004 17.8 3.1 80 49-136 104-189 (232) 55 COG3916 LasI N-acyl-L-homoseri 52.2 4.7 0.00012 21.6 0.3 17 135-151 32-49 (209) 56 PRK09456 phosphatase; Provisio 51.4 15 0.00038 17.9 2.8 50 85-141 133-183 (199) 57 pfam01808 AICARFT_IMPCHas AICA 51.0 9.2 0.00024 19.5 1.7 87 62-164 70-169 (315) 58 PTZ00326 phenylalanyl-tRNA syn 49.6 4.9 0.00012 21.5 0.1 124 57-191 368-502 (505) 59 TIGR01993 Pyr-5-nucltdase pyri 48.6 12 0.00031 18.5 2.0 110 11-138 89-201 (205) 60 PRK10725 fructose-1-phosphatas 47.4 20 0.00051 17.0 2.9 27 92-124 142-168 (188) 61 COG0138 PurH AICAR transformyl 47.2 20 0.00052 17.0 3.0 61 47-108 187-247 (515) 62 pfam10775 ATP_sub_h ATP syntha 46.2 12 0.00032 18.5 1.7 20 95-114 10-29 (67) 63 cd00418 GlxRS_core Glutamyl-tR 46.2 7.4 0.00019 20.1 0.6 49 225-287 76-131 (223) 64 pfam07805 HipA_N HipA-like N-t 45.5 25 0.00063 16.4 3.8 49 71-123 15-70 (81) 65 PRK10826 2-deoxyglucose-6-phos 44.4 21 0.00053 16.9 2.7 70 48-124 99-174 (222) 66 TIGR00457 asnS asparaginyl-tRN 44.2 14 0.00035 18.2 1.7 20 169-188 465-484 (495) 67 KOG2623 consensus 44.0 26 0.00066 16.2 7.8 137 142-289 202-368 (467) 68 COG2024 Phenylalanyl-tRNA synt 43.2 9.4 0.00024 19.4 0.7 121 55-190 205-351 (536) 69 pfam06159 DUF974 Protein of un 43.1 20 0.00051 17.1 2.4 27 57-93 82-108 (233) 70 PRK00881 purH bifunctional pho 42.8 18 0.00047 17.3 2.2 92 56-164 198-302 (514) 71 PRK07106 5-aminoimidazole-4-ca 42.0 9 0.00023 19.5 0.5 18 273-290 315-332 (391) 72 KOG2783 consensus 41.9 13 0.00032 18.5 1.2 77 146-226 280-358 (436) 73 PRK00488 pheS phenylalanyl-tRN 41.1 7.1 0.00018 20.3 -0.1 168 13-190 109-330 (338) 74 PRK05184 pyrroloquinoline quin 40.8 28 0.00071 16.0 2.8 14 89-102 52-65 (302) 75 smart00798 AICARFT_IMPCHas AIC 40.4 20 0.00051 17.0 2.0 84 64-164 71-166 (311) 76 TIGR01326 OAH_OAS_sulfhy O-ace 40.3 24 0.0006 16.5 2.4 66 172-237 311-377 (434) 77 TIGR02737 caa3_CtaG cytochrome 39.8 8.1 0.00021 19.9 -0.0 20 151-171 254-273 (286) 78 pfam09621 LcrR Type III secret 39.8 18 0.00045 17.4 1.7 35 103-147 6-40 (139) 79 COG0647 NagD Predicted sugar p 38.5 31 0.0008 15.6 3.5 54 88-147 186-239 (269) 80 pfam09139 Mmp37 Mitochondrial 37.9 21 0.00054 16.8 1.9 34 70-109 181-217 (324) 81 PRK12295 hisZ ATP phosphoribos 37.6 32 0.00082 15.5 5.3 74 28-112 57-136 (373) 82 TIGR02252 DREG-2 REG-2-like, H 37.5 32 0.00082 15.5 3.0 59 79-141 155-224 (224) 83 TIGR00113 queA S-adenosylmethi 37.4 11 0.00027 19.0 0.3 11 36-46 222-232 (364) 84 PRK12293 hisZ ATP phosphoribos 35.3 35 0.00089 15.3 5.1 50 54-113 92-143 (281) 85 TIGR01458 HAD-SF-IIA-hyp3 HAD- 34.5 36 0.00091 15.2 3.8 46 85-136 173-218 (258) 86 COG3259 FrhA Coenzyme F420-red 34.3 32 0.00081 15.6 2.3 107 148-268 292-427 (441) 87 COG0124 HisS Histidyl-tRNA syn 33.7 37 0.00094 15.1 5.4 58 52-113 96-158 (429) 88 COG3343 RpoE DNA-directed RNA 32.6 30 0.00075 15.8 1.9 48 5-60 25-78 (175) 89 TIGR00644 recJ single-stranded 31.7 40 0.001 14.9 3.4 91 26-119 55-147 (705) 90 PRK13834 putative autoinducer 31.3 10 0.00026 19.1 -0.6 16 135-150 33-49 (207) 91 COG4326 Spo0M Sporulation cont 30.8 29 0.00075 15.8 1.7 56 108-165 138-199 (270) 92 PRK13226 phosphoglycolate phos 30.8 41 0.001 14.8 3.0 69 47-124 93-169 (221) 93 pfam09454 Vps23_core Vps23 cor 30.4 42 0.0011 14.7 2.6 20 272-294 36-55 (65) 94 PRK00037 hisS histidyl-tRNA sy 29.6 43 0.0011 14.6 4.7 60 54-117 100-164 (417) 95 PRK13223 phosphoglycolate phos 29.4 43 0.0011 14.6 2.7 105 160-293 156-267 (272) 96 TIGR02254 YjjG/YfnB HAD superf 29.2 40 0.001 14.8 2.1 48 91-143 158-206 (233) 97 COG3772 Phage-related lysozyme 28.5 19 0.00048 17.2 0.3 21 223-243 61-81 (152) 98 KOG2227 consensus 28.3 45 0.0011 14.5 3.3 63 129-192 198-265 (529) 99 PRK13224 consensus 28.1 45 0.0012 14.5 3.0 80 47-136 92-177 (216) 100 TIGR02108 PQQ_syn_pqqB coenzym 27.9 22 0.00057 16.7 0.6 23 123-147 291-313 (314) 101 cd00770 SerRS_core Seryl-tRNA 27.8 46 0.0012 14.4 3.5 61 52-114 126-198 (297) 102 TIGR01380 glut_syn glutathione 27.7 46 0.0012 14.4 2.6 28 17-44 153-180 (322) 103 KOG0188 consensus 27.4 31 0.0008 15.6 1.3 217 12-245 49-303 (895) 104 TIGR01366 serC_3 phosphoserine 26.5 29 0.00075 15.8 1.0 79 68-158 1-81 (362) 105 pfam03800 Nuf2 Nuf2 family. Me 26.4 42 0.0011 14.7 1.8 24 89-112 30-54 (144) 106 PRK11007 trehalose(maltose)-sp 26.2 49 0.0012 14.2 3.4 66 85-171 62-127 (473) 107 KOG3040 consensus 25.7 50 0.0013 14.2 2.7 102 9-138 120-221 (262) 108 pfam01215 COX5B Cytochrome c o 24.8 46 0.0012 14.4 1.8 26 162-188 7-34 (99) 109 smart00802 UME Domain in UVSB 24.7 52 0.0013 14.0 4.0 36 265-301 16-51 (107) 110 pfam06920 Ded_cyto Dedicator o 24.6 52 0.0013 14.0 2.6 61 171-240 89-149 (178) 111 PRK05431 seryl-tRNA synthetase 24.4 52 0.0013 14.0 3.6 97 38-144 231-342 (422) 112 cd00924 Cyt_c_Oxidase_Vb Cytoc 24.1 49 0.0012 14.2 1.8 27 162-189 7-35 (97) 113 pfam12410 rpo30_N Poxvirus DNA 24.1 40 0.001 14.8 1.4 52 213-273 76-134 (136) 114 COG1923 Hfq Uncharacterized ho 23.9 46 0.0012 14.4 1.6 23 93-115 2-24 (77) 115 PRK09537 pylS pyrolysyl-tRNA s 23.9 38 0.00096 15.0 1.2 121 46-187 271-406 (420) 116 PTZ00047 cytochrome C oxidase 23.6 37 0.00093 15.1 1.1 62 66-131 31-103 (163) 117 COG1190 LysU Lysyl-tRNA synthe 23.5 35 0.00088 15.3 0.9 27 162-188 467-493 (502) 118 PRK00147 queA S-adenosylmethio 23.4 28 0.00072 15.9 0.5 27 46-73 79-105 (345) 119 pfam01563 Alpha_E3_glycop Alph 23.4 55 0.0014 13.9 2.9 28 229-266 20-47 (56) 120 TIGR03112 6_pyr_pter_rel 6-pyr 23.4 47 0.0012 14.3 1.6 37 91-130 71-108 (113) 121 CHL00201 syh histidine-tRNA sy 22.8 56 0.0014 13.8 4.4 79 26-112 72-158 (424) 122 pfam00765 Autoind_synth Autoin 22.6 18 0.00046 17.3 -0.6 17 134-150 24-41 (182) 123 TIGR02253 CTE7 HAD superfamily 22.5 57 0.0014 13.7 3.2 46 91-142 168-215 (244) 124 TIGR00458 aspS_arch aspartyl-t 22.5 40 0.001 14.8 1.1 38 143-187 190-227 (466) 125 pfam02975 Me-amine-dh_L Methyl 22.1 49 0.0012 14.2 1.5 21 53-73 40-60 (122) 126 PRK12292 hisZ ATP phosphoribos 21.8 59 0.0015 13.6 4.5 52 54-112 95-154 (388) 127 pfam05066 RNA_pol_delta DNA-di 21.7 59 0.0015 13.6 1.8 19 5-23 25-43 (94) 128 TIGR03351 PhnX-like phosphonat 21.5 59 0.0015 13.6 2.6 87 47-140 93-189 (220) 129 COG2502 AsnA Asparagine synthe 21.4 60 0.0015 13.6 4.0 169 15-188 68-314 (330) 130 PRK02363 DNA-directed RNA poly 21.4 60 0.0015 13.6 1.9 21 5-25 25-45 (185) 131 PRK10834 hypothetical protein; 21.2 60 0.0015 13.6 3.0 60 225-287 94-158 (239) 132 KOG2360 consensus 21.2 60 0.0015 13.6 2.6 39 257-297 222-260 (413) 133 pfam10182 Flo11 Flo11 domain. 21.1 49 0.0013 14.2 1.3 77 75-170 31-110 (152) 134 cd00672 CysRS_core This is the 21.1 60 0.0015 13.5 3.1 67 98-217 90-156 (213) 135 PRK01424 S-adenosylmethionine: 20.7 30 0.00077 15.7 0.2 25 48-73 74-98 (366) 136 TIGR01838 PHA_synth_I poly(R)- 20.5 27 0.0007 16.0 -0.1 20 111-130 39-59 (541) 137 TIGR01990 bPGM beta-phosphoglu 20.3 63 0.0016 13.4 1.8 20 92-115 146-165 (190) 138 COG3926 zliS Lysozyme family p 20.3 54 0.0014 13.9 1.4 58 10-76 53-118 (252) 139 PRK13534 7-cyano-7-deazaguanin 20.1 63 0.0016 13.4 3.8 113 124-244 79-214 (630) 140 cd07641 BAR_ASAP1 The Bin/Amph 20.1 48 0.0012 14.3 1.1 16 166-181 60-75 (215) No 1 >PRK09348 glyQ glycyl-tRNA synthetase subunit alpha; Validated Probab=100.00 E-value=0 Score=1236.77 Aligned_cols=283 Identities=71% Similarity=1.225 Sum_probs=279.5 Q ss_pred CCCHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCHHHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEEEE Q ss_conf 77799999999999986695896116754333224857898744834401676415578876554667750011444455 Q gi|254780757|r 10 DLSFQNIILTLTQYWAQQGCTILQPYDMEVGAGTFHPSTTLRALGPLSWKAAYVQPSRRPLDGRYAENPNRLQHYYQFQV 89 (307) Q Consensus 10 ~~~fq~ii~~L~~fW~~~GC~i~qpyd~e~GAgT~hp~T~lr~lg~~pw~~aYvqPsrRP~DgRYGeNPNRlq~y~QfQV 89 (307) +|+||+||++||+||+++||+|+||||+|||||||||+||||+||||||++|||||||||+||||||||||||||||||| T Consensus 1 ~~~fq~~I~~Lq~~W~~~GC~i~qpyd~evGAgT~~p~Tflr~lgp~pw~~aYvqPsrRP~DgRYGeNPNRlq~y~QfQV 80 (291) T PRK09348 1 KMTFQDIILTLQDYWADQGCAILQPYDMEVGAGTFHPATFLRALGPEPWNAAYVQPSRRPTDGRYGENPNRLQHYYQFQV 80 (291) T ss_pred CCCHHHHHHHHHHHHHHCCCEEECCCCCCCCCCCCCHHHHHHCCCCCCCCEEEECCCCCCCCCCCCCCCHHHHHEEEEEE T ss_conf 97589999999999976797886575443454658839888502887640542134578887754689124201055689 Q ss_pred EECCCCHHHHHHHHHHHHHCCCCCCCCCEEEEECCCCCCHHHHCCCCEEEEECCEEEHHHHHHHHHCCCCCCCCHHHHHH Q ss_conf 75798335899999999981999621717988328887001101232176887874147778998668257751456787 Q gi|254780757|r 90 IIKPNPLNLQNLYIESLKAVGIDPLIHDVRFVEDNWESPTLGAWGLGWECWCDGMEISQFTYFQQVCGIECSPISGEITY 169 (307) Q Consensus 90 ilKPsp~niq~lYl~SL~~iGid~~~hDIrFveDnWEsPtLGAwGlGWEvwldGMEItQFTYFQQvGGi~c~pv~~EiTY 169 (307) ||||||+|+|+|||+||++||||+.+|||||||||||||||||||||||||||||||||||||||||||+|+|||+|||| T Consensus 81 ilKPsP~n~q~lyL~SL~~lgid~~~hDirFvED~WEsPtlGAwGlGWEVWldGMEitQFTYFQQvGGi~c~pvs~EiTY 160 (291) T PRK09348 81 ILKPSPDNIQELYLGSLEALGIDPLEHDIRFVEDNWESPTLGAWGLGWEVWLDGMEVTQFTYFQQVGGIECKPVTGEITY 160 (291) T ss_pred EECCCCCCHHHHHHHHHHHHCCCCCCCCEEEEECCCCCCCCCCCCCCEEEEECCEEEEEEEHHHHCCCCCCCCCCEEEEH T ss_conf 97789713999999999980889753543786337778762342343499987667652232011078015664344213 Q ss_pred HHHHHHHHHCCCCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 89999999869982113320688878831106244746997776350179899999999999999999850066532335 Q gi|254780757|r 170 GLERLAMYVQNVNSVYDIVFNAIEGQNVLYGDIFAQSEQEYSRYNFEYANPEILHNHFIDSEKECLDLLKKGIPNEHHRH 249 (307) Q Consensus 170 GLERiaMylQ~vdnvyDl~wn~~~~~~vtYGdif~q~E~E~S~YNFE~Ad~~~L~~~F~~yE~Ea~rLle~~LP~~~~~~ 249 (307) ||||||||||+|||||||+||+ .++||||||+|||+|||+||||+||+++|+++|++||+||++||+++|| T Consensus 161 GLERiAMylQ~vdnVyDl~wn~---~~vtYGDvf~q~E~E~S~YNFE~Ad~~~L~~~F~~~E~Ea~~Lle~~L~------ 231 (291) T PRK09348 161 GLERLAMYLQGVDNVYDLVWND---GGVTYGDVFLQNEVEQSTYNFEHADVEMLFKLFDDYEKEAKRLLEKGLP------ 231 (291) T ss_pred HHHHHHHHHHCCCEEEEEEECC---CCCCHHHHHHHHHHHHHHHCHHCCCHHHHHHHHHHHHHHHHHHHHCCCC------ T ss_conf 1999999982888048855258---9963999887779999884200089999999999999999999976998------ Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 53013828999999999878883875017889999999999999999999854516 Q gi|254780757|r 250 HLCVFPAYDQCIKASHIFNLLDARGVISTTERQRYILRIRSLAKACGEAFLMTPNG 305 (307) Q Consensus 250 ~~lvlPAYD~~LKaSH~FNLLDARGaISVTERq~YI~RIR~Lak~~A~ayl~~re~ 305 (307) +||||+||||||+||||||||||||||||+||+|||+|||+||++|+++|+. T Consensus 232 ----lPAYD~~LKaSH~FNLLDARGaISVTER~~YI~RIR~LAk~~A~~y~~~re~ 283 (291) T PRK09348 232 ----LPAYDYVLKASHTFNLLDARGAISVTERQRYILRIRNLARAVAEAYLESREA 283 (291) T ss_pred ----CCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf ----3089999987777653402677548999999999999999999999999985 No 2 >cd00733 GlyRS_alpha_core Class II Glycyl-tRNA synthetase (GlyRS) alpha subunit core catalytic domain. GlyRS functions as a homodimer in eukaryotes, archaea and some bacteria and as a heterotetramer in the remainder of prokaryotes and in arabidopsis. It is responsible for the attachment of glycine to the 3' OH group of ribose of the appropriate tRNA. This domain is primarily responsible for the ATP-dependent formation of the enzyme bound aminoacyl-adenylate. This alignment contains only sequences from the GlyRS form which heterotetramerizes. The homodimer form of GlyRS is in a different family of class II aaRS. Class II assignment is based upon structure and the presence of three characteristic sequence motifs. Probab=100.00 E-value=0 Score=1205.59 Aligned_cols=279 Identities=68% Similarity=1.189 Sum_probs=277.0 Q ss_pred CHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCHHHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEEEEEE Q ss_conf 79999999999998669589611675433322485789874483440167641557887655466775001144445575 Q gi|254780757|r 12 SFQNIILTLTQYWAQQGCTILQPYDMEVGAGTFHPSTTLRALGPLSWKAAYVQPSRRPLDGRYAENPNRLQHYYQFQVII 91 (307) Q Consensus 12 ~fq~ii~~L~~fW~~~GC~i~qpyd~e~GAgT~hp~T~lr~lg~~pw~~aYvqPsrRP~DgRYGeNPNRlq~y~QfQVil 91 (307) |||+||++||+||+++||+|+||||+|||||||||+|||||||||||++|||||||||+|||||+||||||||||||||| T Consensus 1 tfq~ii~~L~~fW~~~GC~i~qpyd~e~GAgT~hp~T~lr~lg~~pw~~aYvqPsrRP~DgRYG~NPNRlq~y~QfQVil 80 (279) T cd00733 1 TFQDLILKLQKFWASQGCLIIQPYDMEVGAGTFHPATFLRALGPEPWNVAYVEPSRRPTDGRYGENPNRLQHYYQFQVII 80 (279) T ss_pred CHHHHHHHHHHHHHHCCCEEECCCCCCCCCCCCCHHHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCCHHCEEEEEEEEE T ss_conf 97899999999998779688767654246465888998863199863155424677999987778932310003457998 Q ss_pred CCCCHHHHHHHHHHHHHCCCCCCCCCEEEEECCCCCCHHHHCCCCEEEEECCEEEHHHHHHHHHCCCCCCCCHHHHHHHH Q ss_conf 79833589999999998199962171798832888700110123217688787414777899866825775145678789 Q gi|254780757|r 92 KPNPLNLQNLYIESLKAVGIDPLIHDVRFVEDNWESPTLGAWGLGWECWCDGMEISQFTYFQQVCGIECSPISGEITYGL 171 (307) Q Consensus 92 KPsp~niq~lYl~SL~~iGid~~~hDIrFveDnWEsPtLGAwGlGWEvwldGMEItQFTYFQQvGGi~c~pv~~EiTYGL 171 (307) ||||+|+|+|||+||++||||+.+||||||||||||||||||||||||||||||||||||||||||++|+||++|||||| T Consensus 81 KPsp~n~q~lYL~SL~~igid~~~hDIrFveDnWEsPtLGAwGlGWEVwldGMEItQFTYFQQvGGi~c~pv~~EiTYGL 160 (279) T cd00733 81 KPSPDNIQELYLESLEALGINPKEHDIRFVEDNWESPTLGAWGLGWEVWLDGMEVTQFTYFQQVGGIPCKPISVEITYGL 160 (279) T ss_pred CCCCHHHHHHHHHHHHHCCCCCCCCCEEEEECCCCCCCCCCCCCCEEEEECCEEEEEEEEEEEECCEECCCCCEEEEHHH T ss_conf 78956589999998998497954362478614787875310234349998775776435121137821445323310119 Q ss_pred HHHHHHHCCCCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 99999986998211332068887883110624474699777635017989999999999999999985006653233553 Q gi|254780757|r 172 ERLAMYVQNVNSVYDIVFNAIEGQNVLYGDIFAQSEQEYSRYNFEYANPEILHNHFIDSEKECLDLLKKGIPNEHHRHHL 251 (307) Q Consensus 172 ERiaMylQ~vdnvyDl~wn~~~~~~vtYGdif~q~E~E~S~YNFE~Ad~~~L~~~F~~yE~Ea~rLle~~LP~~~~~~~~ 251 (307) ||||||||+|||||||+|| +++||||||+|+|+|||+||||+||+++|+++|++||+||++|++++|| T Consensus 161 ERiaMylQ~vd~v~dl~wn----~~vtYgdvf~q~E~e~S~YNFE~ad~~~L~~~F~~~E~Ea~~Ll~~~L~-------- 228 (279) T cd00733 161 ERIAMYLQGVDNVYDIEWN----KKITYGDVFLQNEIEQSVYNFEYANVDMLFQLFEDYEKEAKRLLELGLP-------- 228 (279) T ss_pred HHHHHHHHCCCHHEEEECC----CCCEEEHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCC-------- T ss_conf 9999998575423141237----8861012232779999986056469999999999999999999975999-------- Q ss_pred CCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 01382899999999987888387501788999999999999999999985451 Q gi|254780757|r 252 CVFPAYDQCIKASHIFNLLDARGVISTTERQRYILRIRSLAKACGEAFLMTPN 304 (307) Q Consensus 252 lvlPAYD~~LKaSH~FNLLDARGaISVTERq~YI~RIR~Lak~~A~ayl~~re 304 (307) +||||+||||||+||||||||||||||||+||+|||+|||+||++|+++|| T Consensus 229 --lPAYD~~LKaSH~FNLLDARGaISVtER~~YI~RIR~Lak~~a~~y~~~re 279 (279) T cd00733 229 --LPAYDYVLKCSHTFNLLDARGAISVTERQRYILRIRNLAREIAKLYVEQRE 279 (279) T ss_pred --CCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf --367999999988888773358865899999999999999999999998419 No 3 >pfam02091 tRNA-synt_2e Glycyl-tRNA synthetase alpha subunit. Probab=100.00 E-value=0 Score=1205.76 Aligned_cols=280 Identities=70% Similarity=1.224 Sum_probs=277.5 Q ss_pred HHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCHHHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEEEEEEC Q ss_conf 99999999999986695896116754333224857898744834401676415578876554667750011444455757 Q gi|254780757|r 13 FQNIILTLTQYWAQQGCTILQPYDMEVGAGTFHPSTTLRALGPLSWKAAYVQPSRRPLDGRYAENPNRLQHYYQFQVIIK 92 (307) Q Consensus 13 fq~ii~~L~~fW~~~GC~i~qpyd~e~GAgT~hp~T~lr~lg~~pw~~aYvqPsrRP~DgRYGeNPNRlq~y~QfQVilK 92 (307) ||+||++||+||+++||+|+||||+|||||||||+|||||||||||++|||||||||+|||||+|||||||||||||||| T Consensus 1 fq~ii~~L~~fW~~~GC~i~qpyd~e~GAgT~~p~T~lr~lgp~pw~~aYvqPsrRP~DgRYGenPnRlq~y~QfQVi~K 80 (284) T pfam02091 1 FQSMILTLQEYWASQGCVIMQPYDMEVGAGTFNPATFLRALGPEPWNVAYVEPSRRPTDGRYGENPNRLQHYYQFQVILK 80 (284) T ss_pred CHHHHHHHHHHHHHCCCEEECCCCCCCCCCCCCHHHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCCHHHHHEEEEEEEC T ss_conf 95899999999987796887776542464658889988631998641554246779999877789204200223578977 Q ss_pred CCCHHHHHHHHHHHHHCCCCCCCCCEEEEECCCCCCHHHHCCCCEEEEECCEEEHHHHHHHHHCCCCCCCCHHHHHHHHH Q ss_conf 98335899999999981999621717988328887001101232176887874147778998668257751456787899 Q gi|254780757|r 93 PNPLNLQNLYIESLKAVGIDPLIHDVRFVEDNWESPTLGAWGLGWECWCDGMEISQFTYFQQVCGIECSPISGEITYGLE 172 (307) Q Consensus 93 Psp~niq~lYl~SL~~iGid~~~hDIrFveDnWEsPtLGAwGlGWEvwldGMEItQFTYFQQvGGi~c~pv~~EiTYGLE 172 (307) |||+|+|+|||+||++||||+.+||||||||||||||||||||||||||||||||||||||||||++|+||++||||||| T Consensus 81 Psp~n~q~lYL~SL~~lgid~~~hDIrFveDnWEsPtLGAwGlGWEVwldGMEITQFTYFQQvGGi~c~pv~~EiTYGLE 160 (284) T pfam02091 81 PSPDNIQELYLGSLKALGIDPLDHDIRFVEDNWESPTLGAWGLGWEVWLDGMEITQFTYFQQVGGLECKPVSGEITYGLE 160 (284) T ss_pred CCCHHHHHHHHHHHHHCCCCCCCCCEEEEECCCCCCCCCCCCCCEEEEECCEEEEEEEEEEEECCEECCCCCEEEEHHHH T ss_conf 89466899999989983979543624786147878753102343399987757764351201378214453233101199 Q ss_pred HHHHHHCCCCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCC Q ss_conf 99999869982113320688878831106244746997776350179899999999999999999850066532335530 Q gi|254780757|r 173 RLAMYVQNVNSVYDIVFNAIEGQNVLYGDIFAQSEQEYSRYNFEYANPEILHNHFIDSEKECLDLLKKGIPNEHHRHHLC 252 (307) Q Consensus 173 RiaMylQ~vdnvyDl~wn~~~~~~vtYGdif~q~E~E~S~YNFE~Ad~~~L~~~F~~yE~Ea~rLle~~LP~~~~~~~~l 252 (307) |||||||+|||||||+|| +++||||||+|+|+|||+||||+||+++|+++|++||+||++||+++|| T Consensus 161 RiAMylQ~vdnv~Dl~Wn----~~vtYGDvf~q~E~E~S~yNFe~ad~~~L~~~F~~~E~E~~~Ll~~~L~--------- 227 (284) T pfam02091 161 RLAMYLQKVDSVYDLVWA----DGLTYGDIFQQNEVEQSTYNFETANVDMLFKHFDDYEEEALQLLENGLP--------- 227 (284) T ss_pred HHHHHHHCCCHHHEEECC----CCCCHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCC--------- T ss_conf 999999576224130147----9971424443779999886056479999999999999999999976999--------- Q ss_pred CCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHC-C Q ss_conf 13828999999999878883875017889999999999999999999854516-8 Q gi|254780757|r 253 VFPAYDQCIKASHIFNLLDARGVISTTERQRYILRIRSLAKACGEAFLMTPNG-G 306 (307) Q Consensus 253 vlPAYD~~LKaSH~FNLLDARGaISVTERq~YI~RIR~Lak~~A~ayl~~re~-g 306 (307) +||||+||||||+||||||||||||||||+||+|||+|||+||++|+++|+. | T Consensus 228 -lPAYD~~LK~SH~FNLLDARGaISVtER~~YI~RVR~Lak~~a~~yi~~R~~lg 281 (284) T pfam02091 228 -LPAYDFVLKASHAFNLLDARGAISVTERTRYILRIRQLARAVADLYYEQRESLG 281 (284) T ss_pred -CCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf -367999999988988874258865999999999999999999999999999649 No 4 >COG0752 GlyQ Glycyl-tRNA synthetase, alpha subunit [Translation, ribosomal structure and biogenesis] Probab=100.00 E-value=0 Score=1197.69 Aligned_cols=288 Identities=70% Similarity=1.209 Sum_probs=284.1 Q ss_pred CCCCCHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCHHHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEE Q ss_conf 67777999999999999866958961167543332248578987448344016764155788765546677500114444 Q gi|254780757|r 8 KNDLSFQNIILTLTQYWAQQGCTILQPYDMEVGAGTFHPSTTLRALGPLSWKAAYVQPSRRPLDGRYAENPNRLQHYYQF 87 (307) Q Consensus 8 ~~~~~fq~ii~~L~~fW~~~GC~i~qpyd~e~GAgT~hp~T~lr~lg~~pw~~aYvqPsrRP~DgRYGeNPNRlq~y~Qf 87 (307) +.+|+||+||++||+||++|||+|+||||+|||||||||+||||+||||||++|||||||||+||||||||||||||||| T Consensus 2 ~~~~~fQ~~IltLq~yW~~qGC~i~QpyD~evGAGT~hPaTfLralGpePw~aAYVqPSRRP~DGRYGenPNRlq~yyQf 81 (298) T COG0752 2 NKKLTFQGLILTLQNYWAEQGCTILQPYDMEVGAGTFHPATFLRALGPEPWNAAYVQPSRRPTDGRYGENPNRLQHYYQF 81 (298) T ss_pred CCCCCHHHHHHHHHHHHHHCCCEEECCCCCCCCCCCCCHHHHHHHCCCCCCCEEEECCCCCCCCCCCCCCCHHHHHHEEE T ss_conf 85433999999999999876978614553445667677688997609976410010567799888778892665233157 Q ss_pred EEEECCCCHHHHHHHHHHHHHCCCCCCCCCEEEEECCCCCCHHHHCCCCEEEEECCEEEHHHHHHHHHCCCCCCCCHHHH Q ss_conf 55757983358999999999819996217179883288870011012321768878741477789986682577514567 Q gi|254780757|r 88 QVIIKPNPLNLQNLYIESLKAVGIDPLIHDVRFVEDNWESPTLGAWGLGWECWCDGMEISQFTYFQQVCGIECSPISGEI 167 (307) Q Consensus 88 QVilKPsp~niq~lYl~SL~~iGid~~~hDIrFveDnWEsPtLGAwGlGWEvwldGMEItQFTYFQQvGGi~c~pv~~Ei 167 (307) ||||||||+|||||||+||++|||||+.|||||||||||||||||||||||||||||||||||||||||||+|+||++|| T Consensus 82 QVilKPsP~NiQeLYL~SL~~lGid~~~HDIRFVEDnWE~PTlGawGlGWEVWldGMEvTQFTYFQQvGGiec~pV~~EI 161 (298) T COG0752 82 QVIIKPSPDNIQELYLGSLEALGIDPLEHDIRFVEDNWENPTLGAWGLGWEVWLDGMEVTQFTYFQQVGGLECKPVSGEI 161 (298) T ss_pred EEEECCCCCCHHHHHHHHHHHCCCCHHHCCEEEECCCCCCCCCCCCCCCEEEEECCEEEEEEEHHHHHCCEECCCEEEEE T ss_conf 89955897118999998899809990123613411688788632023440677766265300064650782045311454 Q ss_pred HHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 87899999998699821133206888788311062447469977763501798999999999999999998500665323 Q gi|254780757|r 168 TYGLERLAMYVQNVNSVYDIVFNAIEGQNVLYGDIFAQSEQEYSRYNFEYANPEILHNHFIDSEKECLDLLKKGIPNEHH 247 (307) Q Consensus 168 TYGLERiaMylQ~vdnvyDl~wn~~~~~~vtYGdif~q~E~E~S~YNFE~Ad~~~L~~~F~~yE~Ea~rLle~~LP~~~~ 247 (307) ||||||||||||+|||||||.||+.+++.+||||||+|+|+|+|+||||+||+++||++|++||+||++|++++|| T Consensus 162 TYGlERlAmYiQ~vdnVydl~W~~~~~~~~tYgdvf~q~E~e~S~ynFE~ad~~~L~~~F~~~e~Ea~~ll~~~L~---- 237 (298) T COG0752 162 TYGLERLAMYIQGVDNVYDLEWNDGPGGKVTYGDVFLQNEVEQSKYNFEYADVDMLFRHFDDYEKEAKRLLELGLV---- 237 (298) T ss_pred EHHHHHHHHHHHCCCCEEEEEECCCCCCCEEEHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHCCCC---- T ss_conf 2229999999957661157650478887501013222327776440400178899999999999999999984898---- Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 3553013828999999999878883875017889999999999999999999854516 Q gi|254780757|r 248 RHHLCVFPAYDQCIKASHIFNLLDARGVISTTERQRYILRIRSLAKACGEAFLMTPNG 305 (307) Q Consensus 248 ~~~~lvlPAYD~~LKaSH~FNLLDARGaISVTERq~YI~RIR~Lak~~A~ayl~~re~ 305 (307) +||||+||||||+||||||||||||||||+||+|||+|||+||++|+++|+. T Consensus 238 ------lPAYd~vlKasH~FNlLDARgaISVtER~~YI~RiR~Lar~~a~~y~e~r~~ 289 (298) T COG0752 238 ------LPAYDYVLKASHTFNLLDARGAISVTERQRYILRIRNLARAVAEAYLESREA 289 (298) T ss_pred ------CCHHHHHHHHHHHHHHHCCCCCEEEHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf ------6339999976655533301565441688889999999999999999999985 No 5 >TIGR00388 glyQ glycyl-tRNA synthetase, alpha subunit; InterPro: IPR002310 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . In eubacteria, glycyl-tRNA synthetase (6.1.1.14 from EC) is an alpha2/beta2 tetramer composed of 2 different subunits , , . In some eubacteria, in archaea and eukaryota, glycyl-tRNA synthetase is an alpha2 dimer (see IPR002315 from INTERPRO). It belongs to class IIc and is one of the most complex synthetases. What is most interesting is the lack of similarity between the two types: divergence at the sequence level is so great that it is impossible to infer descent from common genes. The alpha and beta subunits (see IPR002311 from INTERPRO) also lack significant sequence similarity. However, they are translated from a single mRNA , and a single chain glycyl-tRNA synthetase from Chlamydia trachomatis has been found to have significant similarity with both domains, suggesting divergence from a single polypeptide chain .; GO: 0000166 nucleotide binding, 0004820 glycine-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006426 glycyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=100.00 E-value=0 Score=727.61 Aligned_cols=285 Identities=61% Similarity=1.105 Sum_probs=279.9 Q ss_pred CCHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCHHHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEEEEE Q ss_conf 77999999999999866958961167543332248578987448344016764155788765546677500114444557 Q gi|254780757|r 11 LSFQNIILTLTQYWAQQGCTILQPYDMEVGAGTFHPSTTLRALGPLSWKAAYVQPSRRPLDGRYAENPNRLQHYYQFQVI 90 (307) Q Consensus 11 ~~fq~ii~~L~~fW~~~GC~i~qpyd~e~GAgT~hp~T~lr~lg~~pw~~aYvqPsrRP~DgRYGeNPNRlq~y~QfQVi 90 (307) ++|+++|++|++||+.+||+++||||+++||||+||.|++|++||+||+++||+|||||+|||||+||||++|||||||+ T Consensus 1 ~~~~~~~~~~~~~w~~~gc~~~~p~d~~~g~g~~~p~~~~~~~gp~p~~~~~~~p~~~p~dg~~g~~p~~~~~~~~~~~~ 80 (295) T TIGR00388 1 LTFQGLLLKLQEYWANQGCLLVQPYDVEKGAGTLHPATFLRSLGPEPWSVAYVEPSRRPTDGRYGENPNRLGHYYQFQVV 80 (295) T ss_pred CCHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCHHHHHHHCCCCCCEEEEECCCCCCCCCCCCCCHHHHHHEEEEEEE T ss_conf 93588999988864106714641123322455220467876507652101000344576666567770132110025678 Q ss_pred ECCCCHHHHHHHHHHHHHCCCCCCCCCEEEEECCCCCCHHHHCCCCEEEEECCEEEHHHHHHHHHCCCCCCCCHHHHHHH Q ss_conf 57983358999999999819996217179883288870011012321768878741477789986682577514567878 Q gi|254780757|r 91 IKPNPLNLQNLYIESLKAVGIDPLIHDVRFVEDNWESPTLGAWGLGWECWCDGMEISQFTYFQQVCGIECSPISGEITYG 170 (307) Q Consensus 91 lKPsp~niq~lYl~SL~~iGid~~~hDIrFveDnWEsPtLGAwGlGWEvwldGMEItQFTYFQQvGGi~c~pv~~EiTYG 170 (307) +||+|+|+|++||+||+++|++|..|||||||||||+|||||||+|||||+|||||||||||||+||++|+|+++|+||| T Consensus 81 ~~p~p~~~~~~~~~~~~~~g~~~~~~d~~~~~d~w~~p~~g~wg~gw~~w~~g~~~~~~~~~~~~gg~~c~p~~~~~~~g 160 (295) T TIGR00388 81 IKPSPDNIQELYLDSLRALGIDPTEHDIRFVEDNWENPTLGAWGLGWEVWLDGLEVTQFTYFQQVGGLECKPVSVEITYG 160 (295) T ss_pred ECCCCCHHHHHHHHHHHHHCCCCCCCCCEEECCCCCCCCCCCCCCCHHHHCCCCHHHHHHHHHHHCCCCCCCCCEEHHHH T ss_conf 61684125787765566633773201100000256775443122433222054112333334430663224321100221 Q ss_pred HHHHHHHHCCCCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHH--HHHCCCCCCCC Q ss_conf 9999999869982113320688878831106244746997776350179899999999999999999--85006653233 Q gi|254780757|r 171 LERLAMYVQNVNSVYDIVFNAIEGQNVLYGDIFAQSEQEYSRYNFEYANPEILHNHFIDSEKECLDL--LKKGIPNEHHR 248 (307) Q Consensus 171 LERiaMylQ~vdnvyDl~wn~~~~~~vtYGdif~q~E~E~S~YNFE~Ad~~~L~~~F~~yE~Ea~rL--le~~LP~~~~~ 248 (307) +||++||+|++++++|+.|++...+.++|||+|+++|+|+|.||||.+++++++..|+.+++|+..+ +++++| T Consensus 161 ~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----- 235 (295) T TIGR00388 161 LERLALYLQGVENVLDLEWSDGPLGKLTYGDVFLEGEYEQSTYNFETADVDLLFELFSLYEKEALKLLALENGLP----- 235 (295) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCC----- T ss_conf 788888762011233311035655531022333310122211001112378999999877777777765303566----- Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 553013828999999999878883875017889999999999999999999854516 Q gi|254780757|r 249 HHLCVFPAYDQCIKASHIFNLLDARGVISTTERQRYILRIRSLAKACGEAFLMTPNG 305 (307) Q Consensus 249 ~~~lvlPAYD~~LKaSH~FNLLDARGaISVTERq~YI~RIR~Lak~~A~ayl~~re~ 305 (307) +|+||+++||+|.||+|||||+|+|+||++||+|||++++.|+..|.++++. T Consensus 236 -----~p~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 287 (295) T TIGR00388 236 -----LPAYDYVLKCSHSFNLLDARGAISVTERTRYILRIRNLAKGVAELYYEEREA 287 (295) T ss_pred -----CHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf -----4046777775433322210000212233457888888888889887754554 No 6 >PRK13902 alaS lanyl-tRNA synthetase; Provisional Probab=98.14 E-value=1.2e-05 Score=62.07 Aligned_cols=176 Identities=26% Similarity=0.409 Sum_probs=121.6 Q ss_pred CCCCCHHHHHHHHHHHHHHCCCEEEECCCCCC-----------CCCCCCHHHHHHHCCCCCCEEEEECCCCCCCCCCCCC Q ss_conf 67777999999999999866958961167543-----------3322485789874483440167641557887655466 Q gi|254780757|r 8 KNDLSFQNIILTLTQYWAQQGCTILQPYDMEV-----------GAGTFHPSTTLRALGPLSWKAAYVQPSRRPLDGRYAE 76 (307) Q Consensus 8 ~~~~~fq~ii~~L~~fW~~~GC~i~qpyd~e~-----------GAgT~hp~T~lr~lg~~pw~~aYvqPsrRP~DgRYGe 76 (307) +.+.+.-+|=.+--+|..++|+.++.+|.+=- |--.|-|.-++...-|..-+++-.|||.|-.|= | T Consensus 55 ~~~~~~~eiR~~Fl~fF~~~gH~~v~s~plvp~~~ddllftnAgm~~Fkp~~~~G~~~pp~~~~~~sQkCiR~nDl---e 131 (898) T PRK13902 55 SKKYSLDEMREKFLSFFEKRGHTRIKRYPVVARWRDDVYLTIASIYDFQPWVTSGIVPPPANPLVISQPCIRLNDI---D 131 (898) T ss_pred CCCCCHHHHHHHHHHHHHHCCCEEECCCCCCCCCCCCEEEEECCCHHCHHHHCCCCCCCCCCCCCCCCCCEECCCH---H T ss_conf 6767899999999999976899891785637899998057603513240443388758999972565223004767---5 Q ss_pred CCCCC-CEEEEEEEEECCCCHH----------HHHHHHHHH-HHCCCCCCCCCEEEEECCCCCCHHHHCCCCEEEEECCE Q ss_conf 77500-1144445575798335----------899999999-98199962171798832888700110123217688787 Q gi|254780757|r 77 NPNRL-QHYYQFQVIIKPNPLN----------LQNLYIESL-KAVGIDPLIHDVRFVEDNWESPTLGAWGLGWECWCDGM 144 (307) Q Consensus 77 NPNRl-q~y~QfQVilKPsp~n----------iq~lYl~SL-~~iGid~~~hDIrFveDnWEsPtLGAwGlGWEvwldGM 144 (307) |--+- .||.=|.-+=-=|-.+ ..++--+=| +.+||++.. |-|.||=|+-- |--|=--|+--+|+ T Consensus 132 nVG~t~rHhTfFEMlGn~SFg~~d~~~YfK~eaI~~awe~lt~~lgi~~~~--i~~ken~W~~g--GpcGPcsEi~~~Gl 207 (898) T PRK13902 132 NVGRTGRHLTSFEMMAHHAFNYPDKEVYWKDETVELCFEFFTKELGIDPEE--ITFKESPWEGG--GNAGPCFEVLVRGL 207 (898) T ss_pred HCCCCCCCCHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCHHH--EEEECCCCCCC--CCCCCCEEEEECCE T ss_conf 438887743545237654148997564707899999999987012878799--45624574789--99988635651979 Q ss_pred EEHH--HHHHHH-HCC--CCCCCCHHHHHHHHHHHHHHHCCCCCEEEEEEC Q ss_conf 4147--778998-668--257751456787899999998699821133206 Q gi|254780757|r 145 EISQ--FTYFQQ-VCG--IECSPISGEITYGLERLAMYVQNVNSVYDIVFN 190 (307) Q Consensus 145 EItQ--FTYFQQ-vGG--i~c~pv~~EiTYGLERiaMylQ~vdnvyDl~wn 190 (307) ||-- |+=|.+ -+| .++.--.+.--|||||||+.+||+.+.||..+. T Consensus 208 EiwNlVFmq~~~~~~g~~~~Lp~k~IDTGmGLERia~vlQg~~tnydt~F~ 258 (898) T PRK13902 208 ELATLVFMQYKKDKGGKYVEMPLKIVDTGYGLERIAWASQGTPTAYDAVFG 258 (898) T ss_pred EEEEEECCCCEECCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCHHHHHHH T ss_conf 744220234113699973308777634685699999999099953776428 No 7 >PRK01584 alanyl-tRNA synthetase; Provisional Probab=97.77 E-value=0.00014 Score=54.47 Aligned_cols=167 Identities=20% Similarity=0.264 Sum_probs=108.5 Q ss_pred CCHHHHHHHHHHHHHHCCCEEEECCCCC---------CCC--CCCCHHHHHHHCCCCCCEEEEECCCCCCCCC-CCCCCC Q ss_conf 7799999999999986695896116754---------333--2248578987448344016764155788765-546677 Q gi|254780757|r 11 LSFQNIILTLTQYWAQQGCTILQPYDME---------VGA--GTFHPSTTLRALGPLSWKAAYVQPSRRPLDG-RYAENP 78 (307) Q Consensus 11 ~~fq~ii~~L~~fW~~~GC~i~qpyd~e---------~GA--gT~hp~T~lr~lg~~pw~~aYvqPsrRP~Dg-RYGeNP 78 (307) |+-.+|=.+--+|..++|+.+..++.+= +.| -.|-|. |+...-|..=+++-+|+|-|-.|= --| . T Consensus 1 mt~~eiR~~Fl~fF~~kgH~~v~ssslvp~~dptllftnAGm~~Fk~~-f~G~~~p~~~r~~~~QkCiR~~DldnVG--~ 77 (593) T PRK01584 1 MTLDELRKKYIDFFKSKGHVEIAGKSLVPENDPTVLFNTAGMQPLVPY-LLGEPHPSGDMLVNVQKCLRTGDIDEVG--D 77 (593) T ss_pred CCHHHHHHHHHHHHHHCCCEECCCCCCCCCCCCCEEEEECCCCCCCHH-HCCCCCCCCCCCCCCCCEEECCCHHHCC--C T ss_conf 988999999999998789888378771779999835740451017123-2588899999745675313337563368--8 Q ss_pred CCCCEEEEEEEEE--------------------------CCCCH-----------------HHHHHHHHHHHHCCCCCCC Q ss_conf 5001144445575--------------------------79833-----------------5899999999981999621 Q gi|254780757|r 79 NRLQHYYQFQVII--------------------------KPNPL-----------------NLQNLYIESLKAVGIDPLI 115 (307) Q Consensus 79 NRlq~y~QfQVil--------------------------KPsp~-----------------niq~lYl~SL~~iGid~~~ 115 (307) .| ||.=|..+= +=.|+ ..-++. +.|||. . T Consensus 78 ~~--H~TfFEMLGn~SFgdYfK~eaI~~awe~lt~~~~l~l~~~rl~vtv~~~D~~~~~D~E~~~iW----~~~Gi~--~ 149 (593) T PRK01584 78 LS--HLTFFEMLGNWSLGAYFKEESVKYSFEFLTSPDYLNIPKDKLYVSVFEGDEDIPRDEETAKIW----ESLGIS--K 149 (593) T ss_pred CC--CCHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCHHHEEEEEEECCCCCCCCHHHHHHH----HHCCCC--H T ss_conf 87--527775054452331148999999999806632048883247999960587665448899999----981998--9 Q ss_pred CCEEEE--ECCCCCC--HHHHCCCCEEEEECC------------------EEEHH--HHHH-HHHCC--CCCCCCHHHHH Q ss_conf 717988--3288870--011012321768878------------------74147--7789-98668--25775145678 Q gi|254780757|r 116 HDVRFV--EDNWESP--TLGAWGLGWECWCDG------------------MEISQ--FTYF-QQVCG--IECSPISGEIT 168 (307) Q Consensus 116 hDIrFv--eDnWEsP--tLGAwGlGWEvwldG------------------MEItQ--FTYF-QQvGG--i~c~pv~~EiT 168 (307) ..|-+. +|||=|| -.|.-|=-=|+-.|+ +||-- |+=| ++-.| .++.--.+.-- T Consensus 150 ~~I~~~~~~dNfW~~~G~~GPCGPcsEi~~d~g~~~~~~~~~~~~d~~r~lEIWNlVFmqy~r~~~G~~~pLp~k~VDTG 229 (593) T PRK01584 150 DRIFYLSREHNFWGPVGETGPCGPDTEIFVDTGKPKCSVNCRITCSCGKYFEIWNNVFMQYNKDEDGNYEELKRKCVDTG 229 (593) T ss_pred HHEEECCCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEEEEEEECEEECCCCCEEECCCCCCCCC T ss_conf 99464576677100389988998757899746777667788987888876998654113521269997216887740368 Q ss_pred HHHHHHHHHHCCCCCEEEEE Q ss_conf 78999999986998211332 Q gi|254780757|r 169 YGLERLAMYVQNVNSVYDIV 188 (307) Q Consensus 169 YGLERiaMylQ~vdnvyDl~ 188 (307) +||||||+.+|++.|+||.. T Consensus 230 mGLERi~~vlQg~~snYdtD 249 (593) T PRK01584 230 MGIERTIAFLQGKSSVYDTD 249 (593) T ss_pred CCHHHHHHHHCCCCCCCCCC T ss_conf 68999999982999742031 No 8 >PRK00252 alaS alanyl-tRNA synthetase; Reviewed Probab=97.48 E-value=0.0072 Score=41.97 Aligned_cols=173 Identities=25% Similarity=0.304 Sum_probs=103.1 Q ss_pred CCCHHHHHHHHHHHHHHCCCEEEECCCCC-----------CCCCCCCHHHHHHHCCCCCCEEEEECCCCCCCCCCCCCCC Q ss_conf 77799999999999986695896116754-----------3332248578987448344016764155788765546677 Q gi|254780757|r 10 DLSFQNIILTLTQYWAQQGCTILQPYDME-----------VGAGTFHPSTTLRALGPLSWKAAYVQPSRRPLDGRYAENP 78 (307) Q Consensus 10 ~~~fq~ii~~L~~fW~~~GC~i~qpyd~e-----------~GAgT~hp~T~lr~lg~~pw~~aYvqPsrRP~DgRYGeNP 78 (307) .|+-.+|=.+--+|..++|..+..+|.+= -|-..|-| -|+...-|..-+++-.|+|-|-.|= +|- T Consensus 1 ~m~~~eiR~~fl~fF~~k~H~~v~s~~lvp~~DptllftnAGm~~fk~-~f~G~~~p~~~r~~~~QkCiR~nDl---d~V 76 (864) T PRK00252 1 MMTTAEIRQAFLDFFESKGHTVVPSSSLVPKNDPTLLFTNAGMVQFKD-YFLGLEKPPYPRATTSQKCIRTNDL---ENV 76 (864) T ss_pred CCCHHHHHHHHHHHHHHCCCEEECCCCCCCCCCCCEEEEECCCCCCHH-HHCCCCCCCCCCCCCCCCCEECCCH---HHC T ss_conf 997799999999999878988807877255899804675036401622-3258878999973454034003648---763 Q ss_pred CC-CCEEEEEEEEECCCCHH------------------------H--------HHHHHHHHHHCCCCCCCCCEEEE-ECC Q ss_conf 50-01144445575798335------------------------8--------99999999981999621717988-328 Q gi|254780757|r 79 NR-LQHYYQFQVIIKPNPLN------------------------L--------QNLYIESLKAVGIDPLIHDVRFV-EDN 124 (307) Q Consensus 79 NR-lq~y~QfQVilKPsp~n------------------------i--------q~lYl~SL~~iGid~~~hDIrFv-eDn 124 (307) -| -.||.=|..+=-=|-.| + +|-|--=++.+||++. |=++|- +|| T Consensus 77 G~t~rH~TfFEMlGN~SfGdYfK~eai~~awe~lt~~~~l~~~~l~vTv~~~D~e~~~iW~~~~gi~~~-~I~~~~~~dN 155 (864) T PRK00252 77 GYTARHHTFFEMLGNFSFGDYFKKEAIEFAWELLTSVLGLPKEKLYVTVYEDDDEAYDIWKKEIGVPPE-RIIRIGASDN 155 (864) T ss_pred CCCCCCCHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCHHHEEEEEECCCHHHHHHHHHCCCCCHH-HEEECCCCCC T ss_conf 888775254423765665510588999999999875418977995999937878999999870299889-9353675456 Q ss_pred -CCCCHHHHCCCCEEEEEC-------C------------EEEH--HHHHHHHHCCCCCCC---CHHHHHHHHHHHHHHHC Q ss_conf -887001101232176887-------8------------7414--777899866825775---14567878999999986 Q gi|254780757|r 125 -WESPTLGAWGLGWECWCD-------G------------MEIS--QFTYFQQVCGIECSP---ISGEITYGLERLAMYVQ 179 (307) Q Consensus 125 -WEsPtLGAwGlGWEvwld-------G------------MEIt--QFTYFQQvGGi~c~p---v~~EiTYGLERiaMylQ 179 (307) ||--..|.-|--=|+.-| | +||- =|+=|.+-..=...| -.+.--+||||||+.+| T Consensus 156 fW~mG~~GpcGPcsEI~yD~g~~~~~~~~~~~~~d~~r~lEiwNlVFmQynr~~~g~~~~Lp~k~IDTGmGLERl~~vlQ 235 (864) T PRK00252 156 FWSMGDTGPCGPCSEIFYDRGEEYWGGPPGSPEEDGDRYIEIWNLVFMQFNRDADGNLTPLPKKNIDTGMGLERIAAILQ 235 (864) T ss_pred CCCCCCCCCCCCCEEEEEEECCCCCCCCCCCCCCCCCCEEEEEEEHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHC T ss_conf 31148986998765789850652378998877667764566521010154028899744488887446767999999984 Q ss_pred CCCCEEEE Q ss_conf 99821133 Q gi|254780757|r 180 NVNSVYDI 187 (307) Q Consensus 180 ~vdnvyDl 187 (307) |+.++||. T Consensus 236 g~~snYdt 243 (864) T PRK00252 236 GVHSNYET 243 (864) T ss_pred CCCCCCCH T ss_conf 99876020 No 9 >cd00768 class_II_aaRS-like_core Class II tRNA amino-acyl synthetase-like catalytic core domain. Class II amino acyl-tRNA synthetases (aaRS) share a common fold and generally attach an amino acid to the 3' OH of ribose of the appropriate tRNA. PheRS is an exception in that it attaches the amino acid at the 2'-OH group, like class I aaRSs. These enzymes are usually homodimers. This domain is primarily responsible for ATP-dependent formation of the enzyme bound aminoacyl-adenylate. The substrate specificity of this reaction is further determined by additional domains. Intererestingly, this domain is also found is asparagine synthase A (AsnA), in the accessory subunit of mitochondrial polymerase gamma and in the bacterial ATP phosphoribosyltransferase regulatory subunit HisZ. Probab=97.15 E-value=0.0024 Score=45.43 Aligned_cols=136 Identities=26% Similarity=0.316 Sum_probs=94.5 Q ss_pred HHHHHHHHHHHHHCCCEEEECCCCC-------------------------CC-CCCCCHH---HHHHHCCCCCCEEEEEC Q ss_conf 9999999999986695896116754-------------------------33-3224857---89874483440167641 Q gi|254780757|r 14 QNIILTLTQYWAQQGCTILQPYDME-------------------------VG-AGTFHPS---TTLRALGPLSWKAAYVQ 64 (307) Q Consensus 14 q~ii~~L~~fW~~~GC~i~qpyd~e-------------------------~G-AgT~hp~---T~lr~lg~~pw~~aYvq 64 (307) +.|...+.++..+.|+.-+++-.+. .. ++|.-|. .+-..+..-|.+.+.+. T Consensus 3 ~~ie~~~r~~~~~~G~~ev~~P~l~~~~~~~~sg~~~~~~~~~~~~~~~~~~L~pt~e~~~~~~~~~~~~dLP~r~~~~~ 82 (211) T cd00768 3 SKIEQKLRRFMAELGFQEVETPIVEREPLLEKAGHEPKDLLPVGAENEEDLYLRPTLEPGLVRLFVSHIRKLPLRLAEIG 82 (211) T ss_pred HHHHHHHHHHHHHCCCEEEECCEECCHHHHHHCCCCHHHHHHHCCCCCCEEEEECCCCHHHHHHHHHHHHHCCHHHHEEC T ss_conf 89999999999986998999983427999975067622230110467976999516849999997524765777864320 Q ss_pred CCCCCCCCCCCCCCCCCCEEEEEEEEECCCCH-------HHHHHHHHHHHHCCCCCCCCCEEEEECCCCCCHHHHCCCCE Q ss_conf 55788765546677500114444557579833-------58999999999819996217179883288870011012321 Q gi|254780757|r 65 PSRRPLDGRYAENPNRLQHYYQFQVIIKPNPL-------NLQNLYIESLKAVGIDPLIHDVRFVEDNWESPTLGAWGLGW 137 (307) Q Consensus 65 PsrRP~DgRYGeNPNRlq~y~QfQVilKPsp~-------niq~lYl~SL~~iGid~~~hDIrFveDnWEsPtLGAwGlGW 137 (307) ||.|.-.|..| ..|+.++.|....+--.|+ ++.+++.+-++.+|++ ++.++|++.|.+-.-++.|..+ T Consensus 83 ~~fR~E~~~~G--l~RvreF~~~e~~~f~~~~~~~~~~~~~~~~~~~i~~~lgl~---~~~~~~~~~~~~~~~~~a~~~~ 157 (211) T cd00768 83 PAFRNEGGRRG--LRRVREFTQLEGEVFGEDGEEASEFEELIELTEELLRALGIK---LDIVFVEKTPGEFSPGGAGPGF 157 (211) T ss_pred CEECCCCCCCC--CCCCCCEEEEEEEEEECCHHHHHHHHHHHHHHHHHHHHCCCC---CCEEEEEECCCCCCCCCCEEEE T ss_conf 05605788888--446046889847999884888999999999999999974998---8869999778676664523545 Q ss_pred EEEEC-----CEEEHHHHHHHH Q ss_conf 76887-----874147778998 Q gi|254780757|r 138 ECWCD-----GMEISQFTYFQQ 154 (307) Q Consensus 138 Evwld-----GMEItQFTYFQQ 154 (307) ++... +.||+...++.+ T Consensus 158 Die~~~p~~~~~Ev~s~~~~~d 179 (211) T cd00768 158 EIEVDHPEGRGLEIGSGGYRQD 179 (211) T ss_pred EEEEECCCCCEEEEECEECCCC T ss_conf 5789837897999935461677 No 10 >COG0013 AlaS Alanyl-tRNA synthetase [Translation, ribosomal structure and biogenesis] Probab=96.96 E-value=0.0031 Score=44.61 Aligned_cols=176 Identities=22% Similarity=0.234 Sum_probs=116.1 Q ss_pred CCCCHHHHHHHHHHHHHHCCCEEEECCCCC---------C--CCCCCCHHHHHHHCCCCCCEEEEECCCCCCCCCCCCCC Q ss_conf 777799999999999986695896116754---------3--33224857898744834401676415578876554667 Q gi|254780757|r 9 NDLSFQNIILTLTQYWAQQGCTILQPYDME---------V--GAGTFHPSTTLRALGPLSWKAAYVQPSRRPLDGRYAEN 77 (307) Q Consensus 9 ~~~~fq~ii~~L~~fW~~~GC~i~qpyd~e---------~--GAgT~hp~T~lr~lg~~pw~~aYvqPsrRP~DgRYGeN 77 (307) .+|+--+|=.+--+|++++|-.++.++.+= + |-.+|-|. |+....|.+-+++-.|||.|-.|= || T Consensus 4 ~~~t~~EiR~~FL~FF~~kgH~~v~s~slVP~nDptLLftnAGm~~FK~~-f~g~v~p~~~r~~~sQkcIR~NDi---eN 79 (879) T COG0013 4 MKLTTNEIRQKFLDFFEKKGHTVVPSSPLVPRNDPTLLFTNAGMVQFKPY-FTGGVTPPANRAVTSQKCIRTNDI---DN 79 (879) T ss_pred CCCCHHHHHHHHHHHHHHCCCEECCCCCCCCCCCCCEEEEECCCCCCHHH-HCCCCCCCCCCEECCCCCCCCCCH---HH T ss_conf 65779999999999998779734578886789999768960463014113-217888999970010212014766---54 Q ss_pred CCCCC-EEEEEEEEECCCCHH--------------------------------HHHHHHHHHHHCCCCCCCCCEEE-EEC Q ss_conf 75001-144445575798335--------------------------------89999999998199962171798-832 Q gi|254780757|r 78 PNRLQ-HYYQFQVIIKPNPLN--------------------------------LQNLYIESLKAVGIDPLIHDVRF-VED 123 (307) Q Consensus 78 PNRlq-~y~QfQVilKPsp~n--------------------------------iq~lYl~SL~~iGid~~~hDIrF-veD 123 (307) --+-- ||.=|..+=-=|-.+ =.|-|-.+.+.+||. .+|=||+ -+| T Consensus 80 VG~T~RHhTfFEMLGNfSFGdYFKeeAI~~AwEflT~~lgl~~ekL~vtvy~~Ddea~~~W~~~~gip-~~rIir~~~~d 158 (879) T COG0013 80 VGYTARHHTFFEMLGNFSFGDYFKEEAIEFAWEFLTKVLGLPKEKLYVTVYEDDDEAYNEWEKIIGIP-PERIIRIGASD 158 (879) T ss_pred CCCCCCCHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCCHHHEEEEEECCCHHHHHHHHHHCCCC-HHHEEECCCCC T ss_conf 37665410677765057066778899999999998745089878979999458667888887632998-89845357678 Q ss_pred CCCCCHHHHCCCCEEEEECC-------------------EEEHH--HHHHHHH-CCC---CCCCCHHHHHHHHHHHHHHH Q ss_conf 88870011012321768878-------------------74147--7789986-682---57751456787899999998 Q gi|254780757|r 124 NWESPTLGAWGLGWECWCDG-------------------MEISQ--FTYFQQV-CGI---ECSPISGEITYGLERLAMYV 178 (307) Q Consensus 124 nWEsPtLGAwGlGWEvwldG-------------------MEItQ--FTYFQQv-GGi---~c~pv~~EiTYGLERiaMyl 178 (307) ||=+.-.|.-|---||.-|+ +||-- |.=|-+. ++- ++.--++.--||||||+-.+ T Consensus 159 NfW~~G~GPcGPcsEI~yD~G~~~~~~~~~~~~~d~dR~lEiwNLVFmQfnr~~~~g~~~~Lp~k~IDTGmGLERi~~vl 238 (879) T COG0013 159 NFWEMGDGPCGPCSEIFYDRGEEIGGGPPGSPDEDGDRYLEIWNLVFMQYNRDPGDGNYTPLPKKNIDTGMGLERIAAVL 238 (879) T ss_pred CCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCCEEEEEEEEEHHHCCCCCCCCCCCCCCCCCCCCCCHHHHHHHH T ss_conf 87778886888855899767766688877886777882689756430110137999875518887715686699999998 Q ss_pred CCCCCEEEEEE Q ss_conf 69982113320 Q gi|254780757|r 179 QNVNSVYDIVF 189 (307) Q Consensus 179 Q~vdnvyDl~w 189 (307) |++.++||+.. T Consensus 239 Q~~~tnyd~dl 249 (879) T COG0013 239 QGVPTNYDTDL 249 (879) T ss_pred CCCCCCHHHHH T ss_conf 49996214440 No 11 >cd00673 AlaRS_core Alanyl-tRNA synthetase (AlaRS) class II core catalytic domain. AlaRS is a homodimer. It is responsible for the attachment of alanine to the 3' OH group of ribose of the appropriate tRNA. This domain is primarily responsible for ATP-dependent formation of the enzyme bound aminoacyl-adenylate. Class II assignment is based upon its predicted structure and the presence of three characteristic sequence motifs. Probab=96.65 E-value=0.013 Score=40.07 Aligned_cols=164 Identities=20% Similarity=0.160 Sum_probs=95.2 Q ss_pred HHHHHHHHHHHHCCCEEEECCCCC---------CCCCCCCHHH-HHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEE Q ss_conf 999999999986695896116754---------3332248578-987448344016764155788765546677500114 Q gi|254780757|r 15 NIILTLTQYWAQQGCTILQPYDME---------VGAGTFHPST-TLRALGPLSWKAAYVQPSRRPLDGRYAENPNRLQHY 84 (307) Q Consensus 15 ~ii~~L~~fW~~~GC~i~qpyd~e---------~GAgT~hp~T-~lr~lg~~pw~~aYvqPsrRP~DgRYGeNPNRlq~y 84 (307) +|-.+--+|..++|..++.++.+= +.||--.-.- |+..--|.--+++-+|+|.|-.|=-=--.-+ .|| T Consensus 3 eiR~~Fl~fF~~~gH~~vpssslvp~~DptllFtnAGM~~Fkp~flG~~~p~~~r~~~~QkCiR~~di~nvG~t~--rH~ 80 (232) T cd00673 3 EIRETFLSFFEKKGHTRVPSSPVVPRDDPTLLFTNAGMNQFKPIFLGEVPPPANRLVNSQKCIRAGDIDNVGKTG--RHH 80 (232) T ss_pred HHHHHHHHHHHHCCCEECCCCCCCCCCCCCEEEEECCHHHHHHHHCCCCCCCCCCCEEEEECCCCCCCCCCCCCC--CCC T ss_conf 899999999987898882787722389995014411468889986778899987510000123657832467777--640 Q ss_pred EEEEEEECCCCHH--------------------------------HHHHHHHHHHHCCCCCCCCCEEEE-ECCCCCC-HH Q ss_conf 4445575798335--------------------------------899999999981999621717988-3288870-01 Q gi|254780757|r 85 YQFQVIIKPNPLN--------------------------------LQNLYIESLKAVGIDPLIHDVRFV-EDNWESP-TL 130 (307) Q Consensus 85 ~QfQVilKPsp~n--------------------------------iq~lYl~SL~~iGid~~~hDIrFv-eDnWEsP-tL 130 (307) .=|+.+=-=|-.+ -.|.+-.=++.+|| +..+-+||. +|||=+. -- T Consensus 81 TfFEMLGnfSFGdYfK~eaI~~awe~lt~~~~l~~~rlyvTv~~~Dde~~~~~~~~~gi-~~~~i~~~~~~dNfW~~G~~ 159 (232) T cd00673 81 TFFEMLGNFSFGDYFKEEAIAFAWELLTEVLGLPKDRLYVSVFEGDDEEEAIWWWKIGL-PGIRIERIGFKDNFWEMGGN 159 (232) T ss_pred HHHHCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEECCCHHHHHHHHHHCCC-CHHHEEECCCCCCCCCCCCC T ss_conf 23330444203145899999999998751237685624266746977999999974199-88983466776667869998 Q ss_pred HHCCCCEEEEECC-----------------EEEHHH--HHH-HHHCC--CCCCCCHHHHHHHHHHHHHHHCCC Q ss_conf 1012321768878-----------------741477--789-98668--257751456787899999998699 Q gi|254780757|r 131 GAWGLGWECWCDG-----------------MEISQF--TYF-QQVCG--IECSPISGEITYGLERLAMYVQNV 181 (307) Q Consensus 131 GAwGlGWEvwldG-----------------MEItQF--TYF-QQvGG--i~c~pv~~EiTYGLERiaMylQ~v 181 (307) |.-|=-=|+-.|+ +||--. +=| ++-.| .++.--.+.--+|||||||.+|+| T Consensus 160 GPcGPcsEI~yd~g~~~~~~~~~~~~~~r~lEiwNlVFmqy~r~~dg~l~~Lp~k~IDTGmGlERl~~vlQgv 232 (232) T cd00673 160 GPCGPCSEIFYDRGEERDAASLPNEDDDRYLEIWNLVFMQYNRDADGTYRPLPKKIVDTGMGLERLVWVLQGV 232 (232) T ss_pred CCCCCCEEEEECCCCCCCCCCCCCCCCCCEEEEEEEEEEEEEECCCCCCCCCCCCEECCCCCHHHHHHHHCCC T ss_conf 7998767789757777788657788899889986731375200699976508998141683799999998379 No 12 >PRK10748 flavin mononucleotide phosphatase; Provisional Probab=94.39 E-value=0.025 Score=38.01 Aligned_cols=82 Identities=22% Similarity=0.452 Sum_probs=59.6 Q ss_pred HHHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEEEEEE------CCCCHHHHHHHHHHHHHCCCCCCCCCEEEE Q ss_conf 89874483440167641557887655466775001144445575------798335899999999981999621717988 Q gi|254780757|r 48 TTLRALGPLSWKAAYVQPSRRPLDGRYAENPNRLQHYYQFQVII------KPNPLNLQNLYIESLKAVGIDPLIHDVRFV 121 (307) Q Consensus 48 T~lr~lg~~pw~~aYvqPsrRP~DgRYGeNPNRlq~y~QfQVil------KPsp~niq~lYl~SL~~iGid~~~hDIrFv 121 (307) ..|..|.. .++.+-|-- |.--..--.|.+|+.+-|+= ||.| ++|..-|+.+|+.|. .+-+| T Consensus 120 ~~L~~L~~-~y~L~iITN------G~~~~q~~gL~~~Fd~vi~See~G~~KP~~----~IF~~Al~~lg~~pe--e~l~V 186 (238) T PRK10748 120 DTLKQLAK-KWPLVAITN------GNAQPELFGLGDYFEFVLRAGPHGRSKPFS----DMYHLAAEKLNVPIG--EILHV 186 (238) T ss_pred HHHHHHHH-CCEEEEEEC------CCHHHHHCCHHHHHHHHCCCCCCCCCCCCH----HHHHHHHHHCCCCHH--HEEEE T ss_conf 99999973-471899957------942787638677516624412148899897----999999998298989--94543 Q ss_pred ECCCCCCHHHHCCCCEE-EEEC Q ss_conf 32888700110123217-6887 Q gi|254780757|r 122 EDNWESPTLGAWGLGWE-CWCD 142 (307) Q Consensus 122 eDnWEsPtLGAwGlGWE-vwld 142 (307) -|+|++--.||-..||. ||+| T Consensus 187 GD~~~~DI~GA~~aGm~~vW~N 208 (238) T PRK10748 187 GDDLTTDVAGAIRCGMQACWIN 208 (238) T ss_pred CCCCHHHHHHHHHCCCEEEEEC T ss_conf 5881887898998798699978 No 13 >PRK09449 nucleotidase; Provisional Probab=91.34 E-value=0.27 Score=30.52 Aligned_cols=58 Identities=31% Similarity=0.593 Sum_probs=45.6 Q ss_pred CCEEEEEEEE------ECCCCHHHHHHHHHHHHHCCCCCCCCCEEEEECCCCCCHHHHCCCCEE-EEECC Q ss_conf 0114444557------579833589999999998199962171798832888700110123217-68878 Q gi|254780757|r 81 LQHYYQFQVI------IKPNPLNLQNLYIESLKAVGIDPLIHDVRFVEDNWESPTLGAWGLGWE-CWCDG 143 (307) Q Consensus 81 lq~y~QfQVi------lKPsp~niq~lYl~SL~~iGid~~~hDIrFveDnWEsPtLGAwGlGWE-vwldG 143 (307) |.+|+-.=|+ -||.|+ +|...|+.+|+.+ ...+-||-|+|++--+||-..||. ||+|- T Consensus 133 L~~~Fd~iv~Se~~g~~KPdp~----iF~~al~~l~~~~-~e~~l~VGDs~~~Di~gA~~aG~~tvw~N~ 197 (225) T PRK09449 133 LRDYFDLLVISEQVGVAKPDKK----IFDYALEQMGNPD-RSRVLMVGDNLHSDILGGINAGIDTCWLNA 197 (225) T ss_pred HHHHCCEEEEECCCCCCCCCHH----HHHHHHHHCCCCC-HHHEEEECCCCCHHHHHHHHCCCEEEEECC T ss_conf 6651466886334576788749----9999999849998-688587568730467989987995999899 No 14 >PRK05159 aspC aspartyl-tRNA synthetase; Provisional Probab=89.18 E-value=0.41 Score=29.26 Aligned_cols=40 Identities=18% Similarity=0.309 Sum_probs=18.3 Q ss_pred EEEHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEEEC Q ss_conf 74147778998668257751456787899999998699821133206 Q gi|254780757|r 144 MEISQFTYFQQVCGIECSPISGEITYGLERLAMYVQNVNSVYDIVFN 190 (307) Q Consensus 144 MEItQFTYFQQvGGi~c~pv~~EiTYGLERiaMylQ~vdnvyDl~wn 190 (307) -+.-.++||.+-. --++|.|+ | ++|+. .-|.+-||.|-=+ T Consensus 173 a~~f~~~~~~~~~---yL~~Spql-y-~q~li--~~G~~rvfeI~~~ 212 (434) T PRK05159 173 TELFPVKYFEKEA---FLAQSPQL-Y-KQMMM--AAGFERVFEIGPA 212 (434) T ss_pred CCCCCEEECCCEE---EECCCCHH-H-HHHHH--HCCCCCEEEECCC T ss_conf 5658613127533---41468379-9-99987--6356855996601 No 15 >PRK00476 aspS aspartyl-tRNA synthetase; Validated Probab=89.01 E-value=0.38 Score=29.50 Aligned_cols=19 Identities=5% Similarity=0.302 Sum_probs=13.7 Q ss_pred HHHHHHHHHHHHCCCEEEE Q ss_conf 9999999999866958961 Q gi|254780757|r 15 NIILTLTQYWAQQGCTILQ 33 (307) Q Consensus 15 ~ii~~L~~fW~~~GC~i~q 33 (307) .++..+.+|..++|.+=.. T Consensus 147 ~i~~~iR~fl~~~gFiEVe 165 (587) T PRK00476 147 KVTSAIRNFLDDNGFLEIE 165 (587) T ss_pred HHHHHHHHHHHHCCCCEEC T ss_conf 9999999999975974004 No 16 >PRK12820 bifunctional aspartyl-tRNA synthetase/aspartyl/glutamyl-tRNA amidotransferase subunit C; Provisional Probab=87.74 E-value=0.69 Score=27.60 Aligned_cols=111 Identities=21% Similarity=0.290 Sum_probs=68.6 Q ss_pred HHHCCCCEEEEECCEEEH-----------HHHHHHHHCCCC----------------CCCCHHHHHHHHHHHHHHHCCCC Q ss_conf 110123217688787414-----------777899866825----------------77514567878999999986998 Q gi|254780757|r 130 LGAWGLGWECWCDGMEIS-----------QFTYFQQVCGIE----------------CSPISGEITYGLERLAMYVQNVN 182 (307) Q Consensus 130 LGAwGlGWEvwldGMEIt-----------QFTYFQQvGGi~----------------c~pv~~EiTYGLERiaMylQ~vd 182 (307) +.+-+-.++..|||.||. |-.-|..+|=-+ =-|..|-|..||+||.|.|-|.+ T Consensus 484 ~~v~a~aYDLVlNG~ElggGSiRIHd~~iQ~~if~~lGl~~ee~~~kFgf~LeAf~YGaPPHGGialGlDRLvMlLtg~~ 563 (706) T PRK12820 484 LDLRSRAYDLVVNGEELGGGSIRINDKDIQLRIFAALGLSEEDIEDKFGFFLRAFDFAAPPHGGIALGLDRVVSMILQTP 563 (706) T ss_pred CHHHCCEEEEEECCEEEEEEEEECCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCCCEEEHHHHHHHHHCCCC T ss_conf 20003646068999996140220389999999999759997898889999999986699986703103999999976998 Q ss_pred CEEEEEE------------CCCCC---C-----C-CCCC-------H----HHHHHHHHHHHHHHHHCCHHHHHHHHHHH Q ss_conf 2113320------------68887---8-----8-3110-------6----24474699777635017989999999999 Q gi|254780757|r 183 SVYDIVF------------NAIEG---Q-----N-VLYG-------D----IFAQSEQEYSRYNFEYANPEILHNHFIDS 230 (307) Q Consensus 183 nvyDl~w------------n~~~~---~-----~-vtYG-------d----if~q~E~E~S~YNFE~Ad~~~L~~~F~~y 230 (307) ||-|.+= +.++. + + +.-| | .-..|--.-|+-.|..+.-...-+.|++. T Consensus 564 sIRDVIaFPKt~~g~dlm~~aPs~v~~~QL~El~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 643 (706) T PRK12820 564 SIREVIAFPKNRSAACPLTGAPSEVAQEQLAELGLLDLGDADQLPGDAEKEDLIDHLSWVSRIGFAEAERAAIESALADA 643 (706) T ss_pred CHHHEECCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH T ss_conf 26505227899887860027988689999976377424775546665550345778888875237856778899999999 Q ss_pred HHHHHHHHHH Q ss_conf 9999999850 Q gi|254780757|r 231 EKECLDLLKK 240 (307) Q Consensus 231 E~Ea~rLle~ 240 (307) ++-+..|..- T Consensus 644 ~~~~~~~~~~ 653 (706) T PRK12820 644 EELAAQLEDI 653 (706) T ss_pred HHHHHHHHHC T ss_conf 9999998744 No 17 >COG1011 Predicted hydrolase (HAD superfamily) [General function prediction only] Probab=86.85 E-value=0.64 Score=27.85 Aligned_cols=87 Identities=26% Similarity=0.511 Sum_probs=61.5 Q ss_pred HHHHHHCCCCCCEEEEECCC-CCCCCCCCCCCCCCCCEEEEEEEE------ECCCCHHHHHHHHHHHHHCCCCCCCCCEE Q ss_conf 78987448344016764155-788765546677500114444557------57983358999999999819996217179 Q gi|254780757|r 47 STTLRALGPLSWKAAYVQPS-RRPLDGRYAENPNRLQHYYQFQVI------IKPNPLNLQNLYIESLKAVGIDPLIHDVR 119 (307) Q Consensus 47 ~T~lr~lg~~pw~~aYvqPs-rRP~DgRYGeNPNRlq~y~QfQVi------lKPsp~niq~lYl~SL~~iGid~~~hDIr 119 (307) ..+|+.++.+ ++++=+--. +.-...+-..-+ |-.|+..-++ .||.| ++|...++.+|++ ..++- T Consensus 105 ~~~L~~l~~~-~~l~ilTNg~~~~~~~~l~~~g--l~~~Fd~v~~s~~~g~~KP~~----~~f~~~~~~~g~~--p~~~l 175 (229) T COG1011 105 LEALKELGKK-YKLGILTNGARPHQERKLRQLG--LLDYFDAVFISEDVGVAKPDP----EIFEYALEKLGVP--PEEAL 175 (229) T ss_pred HHHHHHCCCC-CCEEEEECCCCHHHHHHHHHCC--CHHHCCEEEEECCCCCCCCCH----HHHHHHHHHCCCC--CHHEE T ss_conf 9999971545-7389987898088999999768--576677899976689889999----9999999983989--20189 Q ss_pred EEECCCCCCHHHHCCCCEE-EEEC Q ss_conf 8832888700110123217-6887 Q gi|254780757|r 120 FVEDNWESPTLGAWGLGWE-CWCD 142 (307) Q Consensus 120 FveDnWEsPtLGAwGlGWE-vwld 142 (307) ||.|+.++--+||--+||. ||++ T Consensus 176 ~VgD~~~~di~gA~~~G~~~vwi~ 199 (229) T COG1011 176 FVGDSLENDILGARALGMKTVWIN 199 (229) T ss_pred EECCCHHHHHHHHHHHCCEEEEEC T ss_conf 975987899999998198899863 No 18 >COG0173 AspS Aspartyl-tRNA synthetase [Translation, ribosomal structure and biogenesis] Probab=86.69 E-value=0.67 Score=27.71 Aligned_cols=64 Identities=23% Similarity=0.383 Sum_probs=51.4 Q ss_pred CCCHHHHCCCCEEEEECCEEEH-----------HHHHHHHHCCCCC----------------CCCHHHHHHHHHHHHHHH Q ss_conf 8700110123217688787414-----------7778998668257----------------751456787899999998 Q gi|254780757|r 126 ESPTLGAWGLGWECWCDGMEIS-----------QFTYFQQVCGIEC----------------SPISGEITYGLERLAMYV 178 (307) Q Consensus 126 EsPtLGAwGlGWEvwldGMEIt-----------QFTYFQQvGGi~c----------------~pv~~EiTYGLERiaMyl 178 (307) |+...-+-+-.+...|||-||. |=.-|.-+|=-+- .|.-|-|.+||.||.|-+ T Consensus 462 ~~~p~~~~a~aYDlVlNG~ElggGSiRIh~~eiQ~~vF~~lg~~~eea~ekFGFll~Af~yGaPPHgGiA~GlDRlvmll 541 (585) T COG0173 462 EADPESVRARAYDLVLNGYELGGGSIRIHDPEIQEKVFEILGISPEEAEEKFGFLLEAFKYGAPPHGGIAFGLDRLVMLL 541 (585) T ss_pred HCCHHHHHHHHCCEEECCEEECCCEEEECCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCCCEECCHHHHHHHH T ss_conf 16977764532567760275135436407889999999981999899998877899997658999762424599999998 Q ss_pred CCCCCEEEEEE Q ss_conf 69982113320 Q gi|254780757|r 179 QNVNSVYDIVF 189 (307) Q Consensus 179 Q~vdnvyDl~w 189 (307) -|.+|+-|++= T Consensus 542 ~g~~sIReVIA 552 (585) T COG0173 542 TGAESIRDVIA 552 (585) T ss_pred CCCCCHHHEEE T ss_conf 28974442131 No 19 >PRK00484 lysS lysyl-tRNA synthetase; Reviewed Probab=85.89 E-value=0.63 Score=27.91 Aligned_cols=74 Identities=14% Similarity=0.112 Sum_probs=38.6 Q ss_pred HHHHHHHHHHHHHCCCEEEECC---CCCCCCCCCCHHHHHHHC--------CCCCC----------EEEEECCCCCCCCC Q ss_conf 9999999999986695896116---754333224857898744--------83440----------16764155788765 Q gi|254780757|r 14 QNIILTLTQYWAQQGCTILQPY---DMEVGAGTFHPSTTLRAL--------GPLSW----------KAAYVQPSRRPLDG 72 (307) Q Consensus 14 q~ii~~L~~fW~~~GC~i~qpy---d~e~GAgT~hp~T~lr~l--------g~~pw----------~~aYvqPsrRP~Dg 72 (307) -.||..+-+|..++|++=...= .++=||.-.--.|..+++ .|+-+ ++-=+-||.|--|. T Consensus 175 s~ii~~iR~~l~~~gF~EVeTPiL~~~~gGA~ArpF~t~~n~l~~~~yL~~SPqLylk~l~vgG~ervfeI~r~FR~E~~ 254 (491) T PRK00484 175 SKIISAIRRFLDNRGFLEVETPMLQPIPGGAAARPFITHHNALDIDLYLRIAPELYLKRLIVGGFERVFEIGRNFRNEGI 254 (491) T ss_pred HHHHHHHHHHHHHCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCHHHHHHHHHHCCCHHHEEHHHHHHCCCC T ss_conf 99999999999867689986787766688756676556445668554420687898777876272022224876625755 Q ss_pred CCCCCCCCCCEEEEEEEEE Q ss_conf 5466775001144445575 Q gi|254780757|r 73 RYAENPNRLQHYYQFQVII 91 (307) Q Consensus 73 RYGeNPNRlq~y~QfQVil 91 (307) |- .++--+.|.-.-+ T Consensus 255 ~~----rH~pEFT~lE~e~ 269 (491) T PRK00484 255 DT----RHNPEFTMIEFYQ 269 (491) T ss_pred CC----CCCCCEEEEEEEE T ss_conf 43----4184010213677 No 20 >KOG3109 consensus Probab=85.40 E-value=1.6 Score=25.05 Aligned_cols=110 Identities=25% Similarity=0.278 Sum_probs=70.6 Q ss_pred CCCCCCHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCHHHHHHHCCCCCCE--EEEECCCCCCCCCCCCCCCCCCCEE Q ss_conf 56777799999999999986695896116754333224857898744834401--6764155788765546677500114 Q gi|254780757|r 7 KKNDLSFQNIILTLTQYWAQQGCTILQPYDMEVGAGTFHPSTTLRALGPLSWK--AAYVQPSRRPLDGRYAENPNRLQHY 84 (307) Q Consensus 7 ~~~~~~fq~ii~~L~~fW~~~GC~i~qpyd~e~GAgT~hp~T~lr~lg~~pw~--~aYvqPsrRP~DgRYGeNPNRlq~y 84 (307) -|..+-..++++.|.+ ++|.+.- -|---|-.-.|+-||=+-+- +.|.+-. ||+ T Consensus 99 LkPD~~LRnlLL~l~~----r~k~~FT------Na~k~HA~r~Lk~LGieDcFegii~~e~~----------np~----- 153 (244) T KOG3109 99 LKPDPVLRNLLLSLKK----RRKWIFT------NAYKVHAIRILKKLGIEDCFEGIICFETL----------NPI----- 153 (244) T ss_pred CCCCHHHHHHHHHCCC----CCEEEEC------CCCHHHHHHHHHHHCHHHHCCCEEEEECC----------CCC----- T ss_conf 5998789999983741----3579841------77289999999985768760230675116----------877----- Q ss_pred EEEEEEECCCCHHHHHHHHHHHHHCCCCCCCCCEEEEECCCCCCHHHHCCCCEEEEECCEEEHH Q ss_conf 4445575798335899999999981999621717988328887001101232176887874147 Q gi|254780757|r 85 YQFQVIIKPNPLNLQNLYIESLKAVGIDPLIHDVRFVEDNWESPTLGAWGLGWECWCDGMEISQ 148 (307) Q Consensus 85 ~QfQVilKPsp~niq~lYl~SL~~iGid~~~hDIrFveDnWEsPtLGAwGlGWEvwldGMEItQ 148 (307) ..+||+||+|+-.-. -.+..||+-..+-+ |+ ||=++-..+|--+||--|+-|-|--+ T Consensus 154 -~~~~vcKP~~~afE~----a~k~agi~~p~~t~-Ff-DDS~~NI~~ak~vGl~tvlv~~~~~~ 210 (244) T KOG3109 154 -EKTVVCKPSEEAFEK----AMKVAGIDSPRNTY-FF-DDSERNIQTAKEVGLKTVLVGREHKI 210 (244) T ss_pred -CCCEEECCCHHHHHH----HHHHHCCCCCCCEE-EE-CCCHHHHHHHHHCCCEEEEEEEEECC T ss_conf -785454578899999----99981878867448-97-17526689888536646999764055 No 21 >KOG2411 consensus Probab=81.14 E-value=1.8 Score=24.66 Aligned_cols=89 Identities=21% Similarity=0.366 Sum_probs=60.9 Q ss_pred CHHHHCCCCEEEEECCEEEH-----------HHHHHHHHCCCCCC---------------CCHHHHHHHHHHHHHHHCCC Q ss_conf 00110123217688787414-----------77789986682577---------------51456787899999998699 Q gi|254780757|r 128 PTLGAWGLGWECWCDGMEIS-----------QFTYFQQVCGIECS---------------PISGEITYGLERLAMYVQNV 181 (307) Q Consensus 128 PtLGAwGlGWEvwldGMEIt-----------QFTYFQQvGGi~c~---------------pv~~EiTYGLERiaMylQ~v 181 (307) .-.-+.|+-+...+||.|+- |=-.+.++++++-+ |.-|-|++||.|++|-|-+- T Consensus 506 ~p~~~rgqhYDlV~NGvElGGGSiRIhn~diQr~vLe~iLk~p~~~~s~gHLL~ALd~GaPPHGGiAlGlDRlvaml~~a 585 (628) T KOG2411 506 APEKIRGQHYDLVVNGVELGGGSIRIHNPDIQRYVLEDILKIPEDAESKGHLLNALDMGAPPHGGIALGLDRLVAMLTGA 585 (628) T ss_pred CCHHHHCCEEEEEECCEEECCCEEEECCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCCCEEECHHHHHHHHCCC T ss_conf 81665243001677147624734675288899999999855966666677899886269999874441278899998089 Q ss_pred CCEEEEEECCCCCCCCCCCHHHH-----HHHHHHHHHHHHHCC Q ss_conf 82113320688878831106244-----746997776350179 Q gi|254780757|r 182 NSVYDIVFNAIEGQNVLYGDIFA-----QSEQEYSRYNFEYAN 219 (307) Q Consensus 182 dnvyDl~wn~~~~~~vtYGdif~-----q~E~E~S~YNFE~Ad 219 (307) .||-|.+==- +..+=.|.-. -.|.|.--||.-+++ T Consensus 586 ~sIRDVIAFP---Kt~~G~Dlls~sPs~ip~e~L~~Y~I~~~~ 625 (628) T KOG2411 586 PSIRDVIAFP---KTTTGADLLSNSPSEIPEEQLEDYNIRVSN 625 (628) T ss_pred CCHHEEEECC---CCCCCCCCCCCCCCCCCHHHHHHCCEEECC T ss_conf 7522256446---657754313489888997883313055047 No 22 >cd00775 LysRS_core Lys_tRNA synthetase (LysRS) class II core domain. Class II LysRS is a dimer which attaches a lysine to the 3' OH group of ribose of the appropriate tRNA. Its assignment to class II aaRS is based upon its structure and the presence of three characteristic sequence motifs in the core domain. It is found in eukaryotes as well as some prokaryotes and archaea. However, LysRS belongs to class I aaRS's in some prokaryotes and archaea. The catalytic core domain is primarily responsible for the ATP-dependent formation of the enzyme bound aminoacyl-adenylate. Probab=80.20 E-value=0.73 Score=27.43 Aligned_cols=22 Identities=14% Similarity=0.273 Sum_probs=17.1 Q ss_pred HHHHHHHHHHHHHCCCEEEECC Q ss_conf 9999999999986695896116 Q gi|254780757|r 14 QNIILTLTQYWAQQGCTILQPY 35 (307) Q Consensus 14 q~ii~~L~~fW~~~GC~i~qpy 35 (307) ..|+..+.+|..++|.+=+++= T Consensus 12 s~i~~~iR~ff~~~gF~Ev~TP 33 (329) T cd00775 12 SKIISYIRKFLDDRGFLEVETP 33 (329) T ss_pred HHHHHHHHHHHHHCCCEEEECC T ss_conf 9999999999998898899798 No 23 >TIGR00459 aspS_bact aspartyl-tRNA synthetase; InterPro: IPR004524 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . Aspartyl tRNA synthetase 6.1.1.12 from EC is an alpha2 dimer that belongs to class IIb. Structural analysis combined with mutagenesis and enzymology data on the yeast enzyme point to a tRNA binding process that starts by a recognition event between the tRNA anticodon loop and the synthetase anticodon binding module . This family represents aspartyl-tRNA synthetases from the bacteria and from mitochondria. In some species, this enzyme aminoacylates tRNA for both Asp and Asn; Asp-tRNA(asn) is subsequently transamidated to Asn-tRNA(asn). ; GO: 0000166 nucleotide binding, 0004815 aspartate-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006422 aspartyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=78.80 E-value=1.7 Score=24.84 Aligned_cols=59 Identities=25% Similarity=0.521 Sum_probs=46.7 Q ss_pred HHCCCCEEEEECCEEE-----------HHHHHHHHHCCCCC-----------------CCCHHHHHHHHHHHHHHHCCCC Q ss_conf 1012321768878741-----------47778998668257-----------------7514567878999999986998 Q gi|254780757|r 131 GAWGLGWECWCDGMEI-----------SQFTYFQQVCGIEC-----------------SPISGEITYGLERLAMYVQNVN 182 (307) Q Consensus 131 GAwGlGWEvwldGMEI-----------tQFTYFQQvGGi~c-----------------~pv~~EiTYGLERiaMylQ~vd 182 (307) -|-+--+-+.+||-|| -|=+-|+..=||+= .|+-+=+.+||-||.|-|-+.| T Consensus 530 ~a~a~aYDlVlNG~ElGGGSiRIh~~~~Q~~~F~d~Lgid~eea~EkFGFLLeA~~yGaPPHgG~A~GlDRL~Mll~~~~ 609 (653) T TIGR00459 530 EALAEAYDLVLNGVELGGGSIRIHDPEVQEKVFEDILGIDPEEAREKFGFLLEAFKYGAPPHGGLALGLDRLVMLLTGTD 609 (653) T ss_pred HHHHHCCCEEEEEEEECCCEEEECCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCC T ss_conf 54120287688306988710464287899999987628898998776356875873576851015555899999980899 Q ss_pred CEEEEEE Q ss_conf 2113320 Q gi|254780757|r 183 SVYDIVF 189 (307) Q Consensus 183 nvyDl~w 189 (307) ||-|++= T Consensus 610 nIRDVIA 616 (653) T TIGR00459 610 NIRDVIA 616 (653) T ss_pred CCCEECC T ss_conf 7220017 No 24 >PRK12445 lysyl-tRNA synthetase; Reviewed Probab=77.87 E-value=1.6 Score=25.02 Aligned_cols=56 Identities=27% Similarity=0.499 Sum_probs=39.7 Q ss_pred CCCEEEEECCEEEH-----------HHHHHHHH-----CC--------------CC-CCCCHHHHHHHHHHHHHHHCCCC Q ss_conf 23217688787414-----------77789986-----68--------------25-77514567878999999986998 Q gi|254780757|r 134 GLGWECWCDGMEIS-----------QFTYFQQV-----CG--------------IE-CSPISGEITYGLERLAMYVQNVN 182 (307) Q Consensus 134 GlGWEvwldGMEIt-----------QFTYFQQv-----GG--------------i~-c~pv~~EiTYGLERiaMylQ~vd 182 (307) --.||+.++|+||. |---|+.. .| ++ =-|..+-+..|++||.|.|-|.+ T Consensus 411 t~rfdl~i~G~Ei~nG~~el~Dp~~Q~~rf~~q~~~k~~g~~e~~~~de~fl~Al~yG~PPhgG~glGiDRLvMlLtg~~ 490 (505) T PRK12445 411 TDRFEFFIGGREIGNGFSELNDAEDQAERFQEQVNAKAAGDDEAMFYDEDYVTALEYGLPPTAGLGIGIDRMIMLFTNSH 490 (505) T ss_pred HHEEEEEECCEEEEECEECCCCHHHHHHHHHHHHHHHHCCCCCHHHCHHHHHHHHHCCCCCHHCCHHHHHHHHHHHCCCC T ss_conf 01453511899987460226899999999999887775476600012288999965699850034379999999981999 Q ss_pred CEEEEEE Q ss_conf 2113320 Q gi|254780757|r 183 SVYDIVF 189 (307) Q Consensus 183 nvyDl~w 189 (307) ||-|.+- T Consensus 491 sIRdVI~ 497 (505) T PRK12445 491 TIRDVIL 497 (505) T ss_pred CHHEECC T ss_conf 5640716 No 25 >pfam04222 DUF416 Protein of unknown function, DUF. This is a bacterial family of uncharacterized proteins. Probab=74.82 E-value=7.7 Score=20.01 Aligned_cols=62 Identities=29% Similarity=0.438 Sum_probs=47.9 Q ss_pred HHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 850066532335530138289999999998788838750178899999999999999999998545168 Q gi|254780757|r 238 LKKGIPNEHHRHHLCVFPAYDQCIKASHIFNLLDARGVISTTERQRYILRIRSLAKACGEAFLMTPNGG 306 (307) Q Consensus 238 le~~LP~~~~~~~~lvlPAYD~~LKaSH~FNLLDARGaISVTERq~YI~RIR~Lak~~A~ayl~~re~g 306 (307) +|...|...+-.-.-|+||.|-|.--|.++|.+.++.. |. +..|-.++...-..|++.+++. T Consensus 46 lE~~iP~~~dfD~yGvyPA~DAc~aL~~lL~~~l~~~~----e~---a~~iS~lS~~TVa~~iE~q~~~ 107 (110) T pfam04222 46 LEEIIPDPDDFDSYGVYPAIDACVALSELLHALLAGDE----EE---AINISQLSLATVAAFIEAQAGE 107 (110) T ss_pred HHHHCCCCCCCCHHCHHHHHHHHHHHHHHHHHHHCCCH----HH---HHHHHHHHHHHHHHHHHHHHCC T ss_conf 98719993103311101899999999999998776788----89---9999999899999999986045 No 26 >PRK02983 lysS lysyl-tRNA synthetase; Provisional Probab=72.98 E-value=4 Score=22.11 Aligned_cols=55 Identities=22% Similarity=0.375 Sum_probs=34.2 Q ss_pred CCCEEEEECCEEEH-----------HHHHHH-----HHCCCC---------------CCCCHHHHHHHHHHHHHHHCCCC Q ss_conf 23217688787414-----------777899-----866825---------------77514567878999999986998 Q gi|254780757|r 134 GLGWECWCDGMEIS-----------QFTYFQ-----QVCGIE---------------CSPISGEITYGLERLAMYVQNVN 182 (307) Q Consensus 134 GlGWEvwldGMEIt-----------QFTYFQ-----QvGGi~---------------c~pv~~EiTYGLERiaMylQ~vd 182 (307) .--||..|||||+. |=--|| ..||-+ =-|.++-+..|+.||.|.|-|. T Consensus 1007 AeRfDLvinG~ELgnGysELnDP~eQR~Rf~eQ~~~~~ggD~Eam~~DEdFL~ALeYGmPPhGGialGIDRLVMLLTG~- 1085 (1099) T PRK02983 1007 AERWDLVAWGVELGTAYSELTDPVEQRRRLTEQSLLAAGGDPEAMELDEDFLQALEYAMPPTGGLGMGVDRLVMLITGR- 1085 (1099) T ss_pred EEEEEEEECCEEEECCHHHHCCHHHHHHHHHHHHHHHHCCCCHHHHCHHHHHHHHCCCCCCCCCEEEHHHHHHHHHCCC- T ss_conf 7877255558997021035169999999999999998679900233217899984318997364754499999996299- Q ss_pred CEEEEEE Q ss_conf 2113320 Q gi|254780757|r 183 SVYDIVF 189 (307) Q Consensus 183 nvyDl~w 189 (307) ||-|.+. T Consensus 1086 SIREVIa 1092 (1099) T PRK02983 1086 SIRETLP 1092 (1099) T ss_pred CHHHHCC T ss_conf 7598743 No 27 >cd00776 AsxRS_core Asx tRNA synthetase (AspRS/AsnRS) class II core domain. Assignment to class II aminoacyl-tRNA synthetases (aaRS) based upon its structure and the presence of three characteristic sequence motifs in the core domain. This family includes AsnRS as well as a subgroup of AspRS. AsnRS and AspRS are homodimers, which attach either asparagine or aspartate to the 3'OH group of ribose of the appropriate tRNA. While archaea lack asnRS, they possess a non-discriminating aspRS, which can mischarge Asp-tRNA with Asn. Subsequently, a tRNA-dependent aspartate amidotransferase converts the bound aspartate to asparagine. The catalytic core domain is primarily responsible for the ATP-dependent formation of the enzyme bound aminoacyl-adenylate. Probab=72.42 E-value=1.6 Score=25.05 Aligned_cols=30 Identities=23% Similarity=0.436 Sum_probs=20.3 Q ss_pred HHHHHHHHHHHHCCCEEE-----ECCCCCCCCCCC Q ss_conf 999999999986695896-----116754333224 Q gi|254780757|r 15 NIILTLTQYWAQQGCTIL-----QPYDMEVGAGTF 44 (307) Q Consensus 15 ~ii~~L~~fW~~~GC~i~-----qpyd~e~GAgT~ 44 (307) .|+..+-+|..++|.+=. ++.+.|-||-.| T Consensus 29 ~i~~~iR~fl~~~gF~EVeTP~L~~~~~eg~a~~f 63 (322) T cd00776 29 EVLRAFREFLRENGFTEVHTPKITSTDTEGGAELF 63 (322) T ss_pred HHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCC T ss_conf 99999999999889999979840078898654667 No 28 >PRK04172 pheS phenylalanyl-tRNA synthetase subunit alpha; Provisional Probab=70.37 E-value=9.7 Score=19.30 Aligned_cols=130 Identities=23% Similarity=0.339 Sum_probs=74.5 Q ss_pred HHHHHHC---CCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEEEEEECCCCHHHHHH---HHHHHHHCCCCCCCCCEEE Q ss_conf 7898744---834401676415578876554667750011444455757983358999---9999998199962171798 Q gi|254780757|r 47 STTLRAL---GPLSWKAAYVQPSRRPLDGRYAENPNRLQHYYQFQVIIKPNPLNLQNL---YIESLKAVGIDPLIHDVRF 120 (307) Q Consensus 47 ~T~lr~l---g~~pw~~aYvqPsrRP~DgRYGeNPNRlq~y~QfQVilKPsp~niq~l---Yl~SL~~iGid~~~hDIrF 120 (307) +...|.| +..|.++-.+--|.|+..= .+..+-.+||.--++--.--++.+| .-.=++.+|+. +||| T Consensus 349 ~~~~r~l~~~~~~p~~~f~~~rvfR~e~~----d~tH~~~F~Q~eg~v~~~~~~~~~L~g~l~~f~~~~g~~----~~rf 420 (501) T PRK04172 349 ALSARYLAERPEPPGKYFSIGRVFRPETI----DATHLPEFYQLEGIVMGEDVSFRHLLGILKEFYKRLGFE----EVKF 420 (501) T ss_pred HHHHHHHHHCCCCCEEEECCCCEEECCCC----CCCCCCEEEEEEEEEECCCCCHHHHHHHHHHHHHHHCCC----CEEE T ss_conf 66799997468998357356505426888----865441332224899848887999999999999985886----0786 Q ss_pred EECCC--CCCHH----HHCCCCE-EEEECCEEEHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEEE Q ss_conf 83288--87001----1012321-7688787414777899866825775145678789999999869982113320 Q gi|254780757|r 121 VEDNW--ESPTL----GAWGLGW-ECWCDGMEISQFTYFQQVCGIECSPISGEITYGLERLAMYVQNVNSVYDIVF 189 (307) Q Consensus 121 veDnW--EsPtL----GAwGlGW-EvwldGMEItQFTYFQQvGGi~c~pv~~EiTYGLERiaMylQ~vdnvyDl~w 189 (307) .-.=. --|++ --.|+|| ||-=-||==-. .+ .-.|+++.. .----|+||+||..-|+++|-||-= T Consensus 421 rp~yfPftEPs~E~~~~~~~~~WiE~~g~G~~~pe--vl-~~~g~~~~~--~a~G~g~eR~am~~~gi~diR~l~~ 491 (501) T PRK04172 421 RPAYFPFTEPSVEVEVYHPGLGWVELGGAGIFRPE--VT-EPLGIDVPV--LAWGIGIDRLAMLRLGLDDIRDLYS 491 (501) T ss_pred CCCCCCCCCCCEEEEEEECCCCEEEEECCCCCCHH--HH-HHCCCCCCE--EEEEECHHHHHHHHHCCHHHHHHHH T ss_conf 79989989982789999579986998267578889--99-755999874--7875159999999829808888850 No 29 >PRK09350 lysyl-tRNA synthetase; Provisional Probab=70.16 E-value=1.5 Score=25.16 Aligned_cols=21 Identities=19% Similarity=0.477 Sum_probs=16.9 Q ss_pred HHHHHHHHHHHHHCCCEEEEC Q ss_conf 999999999998669589611 Q gi|254780757|r 14 QNIILTLTQYWAQQGCTILQP 34 (307) Q Consensus 14 q~ii~~L~~fW~~~GC~i~qp 34 (307) ..|+..+.+|..++|-+=.+. T Consensus 20 s~i~~~iR~ff~~~gFlEVeT 40 (325) T PRK09350 20 AAIIAEIRRFFADRGVLEVET 40 (325) T ss_pred HHHHHHHHHHHHHCCCEEEEC T ss_conf 999999999999889689779 No 30 >TIGR00470 sepS O-phosphoseryl-tRNA(Cys) synthetase; InterPro: IPR005246 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . All known phenylalanyl-tRNA synthetases, excepting a monomeric mitochondrial form, have an alpha 2 beta 2 heterotetrameric structure. This family of archaeal proteins resembles known phenylalanyl-tRNA synthetase alpha chains but is longer and of unknown function; more generally, it resembles class 2 aminoacyl tRNA ligases. . Probab=69.08 E-value=1.9 Score=24.46 Aligned_cols=34 Identities=29% Similarity=0.511 Sum_probs=26.9 Q ss_pred HHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCCHHHH Q ss_conf 5678789999999869982113320688878831106244 Q gi|254780757|r 165 GEITYGLERLAMYVQNVNSVYDIVFNAIEGQNVLYGDIFA 204 (307) Q Consensus 165 ~EiTYGLERiaMylQ~vdnvyDl~wn~~~~~~vtYGdif~ 204 (307) --+--|.||+||-|-+-|.|-..+|= .-||++.+ T Consensus 336 MNLGlGVERlAMIlygy~DVR~MVYP------QiyGE~rL 369 (558) T TIGR00470 336 MNLGLGVERLAMILYGYEDVRKMVYP------QIYGEYRL 369 (558) T ss_pred EECCCHHHHHHHHHHCHHHHHHCCCC------CCCCEEEC T ss_conf 22563187678778140334322688------42122312 No 31 >PRK12420 histidyl-tRNA synthetase; Provisional Probab=68.50 E-value=11 Score=19.03 Aligned_cols=53 Identities=19% Similarity=0.293 Sum_probs=38.3 Q ss_pred CCCCCEEEEECCCCC---CCCCCCCCCCCCCCEEEEEEEEE--CCCCH---HHHHHHHHHHHHCCCCC Q ss_conf 834401676415578---87655466775001144445575--79833---58999999999819996 Q gi|254780757|r 54 GPLSWKAAYVQPSRR---PLDGRYAENPNRLQHYYQFQVII--KPNPL---NLQNLYIESLKAVGIDP 113 (307) Q Consensus 54 g~~pw~~aYvqPsrR---P~DgRYGeNPNRlq~y~QfQVil--KPsp~---niq~lYl~SL~~iGid~ 113 (307) -+.|.+..|..|+.| |--||| -++||+-|=+ .++|. .+..|-.++|+++||+. T Consensus 96 ~~~P~k~~y~g~vfR~erpq~GR~-------ReF~Q~GvEiiG~~~~~aDaEiI~la~~~l~~lgl~~ 156 (421) T PRK12420 96 IRLPFKRYEIGKVFRDGPIKQGRF-------REFIQCDVDIVGVESVMAEAELMSMAFELFRTLNLEV 156 (421) T ss_pred CCCCEEEEEECCEEECCCCCCCCC-------CEEEECCEEEECCCCHHHHHHHHHHHHHHHHHCCCCE T ss_conf 778745888767785389878855-------5134357542078877889999999999999659865 No 32 >KOG3085 consensus Probab=68.39 E-value=8.4 Score=19.75 Aligned_cols=118 Identities=21% Similarity=0.356 Sum_probs=76.3 Q ss_pred EECCCCCCCC--CCCCHH-HHHHHCCCCCCEEEEECCCCCCCCCCCC--CCCCCCCEEEEEEE------EECCCCHHHHH Q ss_conf 6116754333--224857-8987448344016764155788765546--67750011444455------75798335899 Q gi|254780757|r 32 LQPYDMEVGA--GTFHPS-TTLRALGPLSWKAAYVQPSRRPLDGRYA--ENPNRLQHYYQFQV------IIKPNPLNLQN 100 (307) Q Consensus 32 ~qpyd~e~GA--gT~hp~-T~lr~lg~~pw~~aYvqPsrRP~DgRYG--eNPNRlq~y~QfQV------ilKPsp~niq~ 100 (307) ...|+.+.++ -+.+++ .+|+.|..+-..+.-+- -+|-|+- .-+-.|-+|+-|-| +.||.|. T Consensus 101 ~~~~s~~~~~~~~~~~~~~~~lq~lR~~g~~l~iis----N~d~r~~~~l~~~~l~~~fD~vv~S~e~g~~KPDp~---- 172 (237) T KOG3085 101 FRLFSTFAPSAWKYLDGMQELLQKLRKKGTILGIIS----NFDDRLRLLLLPLGLSAYFDFVVESCEVGLEKPDPR---- 172 (237) T ss_pred HHEECCCCCCCCEECCHHHHHHHHHHHCCEEEEEEC----CCCHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCHH---- T ss_conf 200102355572116379999999986793999952----772777777640187876335311434246789868---- Q ss_pred HHHHHHHHCCCCCCCCCEEEEECCCCCCHHHHCCCCEEEEECCEEEHHHHHHHHHCCCC Q ss_conf 99999998199962171798832888700110123217688787414777899866825 Q gi|254780757|r 101 LYIESLKAVGIDPLIHDVRFVEDNWESPTLGAWGLGWECWCDGMEISQFTYFQQVCGIE 159 (307) Q Consensus 101 lYl~SL~~iGid~~~hDIrFveDnWEsPtLGAwGlGWEvwldGMEItQFTYFQQvGGi~ 159 (307) +|..-|+.+|+.| -|+-|+-|+-++--.||.-+||--++=.-.++-+-..++.-|++ T Consensus 173 If~~al~~l~v~P--ee~vhIgD~l~nD~~gA~~~G~~ailv~~~~~~~~~~~~~~~~~ 229 (237) T KOG3085 173 IFQLALERLGVKP--EECVHIGDLLENDYEGARNLGWHAILVDNSITALKELEYKLGID 229 (237) T ss_pred HHHHHHHHHCCCH--HHEEEECCCCCCCCHHHHHCCCEEEEECCCCCHHHHHHHCCCCC T ss_conf 9999998818996--88688468532122768974977999725652324332101432 No 33 >PRK13222 phosphoglycolate phosphatase; Provisional Probab=67.79 E-value=11 Score=18.93 Aligned_cols=87 Identities=23% Similarity=0.329 Sum_probs=52.9 Q ss_pred HHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEEEEE------ECCCCHHHHHHHHHHHHHCCCCCCCCCEEE Q ss_conf 78987448344016764155788765546677500114444557------579833589999999998199962171798 Q gi|254780757|r 47 STTLRALGPLSWKAAYVQPSRRPLDGRYAENPNRLQHYYQFQVI------IKPNPLNLQNLYIESLKAVGIDPLIHDVRF 120 (307) Q Consensus 47 ~T~lr~lg~~pw~~aYvqPsrRP~DgRYGeNPNRlq~y~QfQVi------lKPsp~niq~lYl~SL~~iGid~~~hDIrF 120 (307) ..+|+.|...-++++-|--+.|..--+. -.-.-+.+|+.+-|- -||.| +.|+..++.+|++| .+.-+ T Consensus 99 ~e~L~~L~~~g~~l~I~Tn~~~~~~~~~-l~~~gl~~~F~~i~~~d~v~~~KP~P----~~~~~a~~~lg~~p--~e~l~ 171 (228) T PRK13222 99 KETLAALKAAGYPLAVVTNKPEPFVAPL-LEALGIADYFSVVIGGDSLPNKKPDP----APLLLACEKLGIDP--EQMLF 171 (228) T ss_pred HHHHHHHHHCCCCEEEECCCHHHHHHHH-HHHHCCCCCCCEEEECCCCCCCCCCH----HHHHHHHHHCCCCC--CCEEE T ss_conf 9999999987996477548749999999-88627653323466322247778786----99999999819793--43489 Q ss_pred EECCCCCCHHHHCCCCEE-EEE Q ss_conf 832888700110123217-688 Q gi|254780757|r 121 VEDNWESPTLGAWGLGWE-CWC 141 (307) Q Consensus 121 veDnWEsPtLGAwGlGWE-vwl 141 (307) |+|. ++--.+|.-.|.- ||+ T Consensus 172 VGDs-~~Di~aA~~aG~~~i~v 192 (228) T PRK13222 172 VGDS-RNDIQAAKAAGCPSVGV 192 (228) T ss_pred EECC-HHHHHHHHHCCCEEEEE T ss_conf 8068-88999999969949998 No 34 >cd00777 AspRS_core Asp tRNA synthetase (aspRS) class II core domain. Class II assignment is based upon its structure and the presence of three characteristic sequence motifs. AspRS is a homodimer, which attaches a specific amino acid to the 3' OH group of ribose of the appropriate tRNA. The catalytic core domain is primarily responsible for the ATP-dependent formation of the enzyme bound aminoacyl-adenylate. AspRS in this family differ from those found in the AsxRS family by a GAD insert in the core domain. Probab=66.49 E-value=3 Score=22.99 Aligned_cols=21 Identities=14% Similarity=0.355 Sum_probs=17.1 Q ss_pred HHHHHHHHHHHHCCCEEEECC Q ss_conf 999999999986695896116 Q gi|254780757|r 15 NIILTLTQYWAQQGCTILQPY 35 (307) Q Consensus 15 ~ii~~L~~fW~~~GC~i~qpy 35 (307) .||..+.+|..++|-+=+++= T Consensus 6 ~i~~~iR~ff~~~gflEV~TP 26 (280) T cd00777 6 RVIKAIRNFLDEQGFVEIETP 26 (280) T ss_pred HHHHHHHHHHHHCCCEEEECC T ss_conf 999999999998898998798 No 35 >COG0017 AsnS Aspartyl/asparaginyl-tRNA synthetases [Translation, ribosomal structure and biogenesis] Probab=66.02 E-value=3.5 Score=22.47 Aligned_cols=14 Identities=29% Similarity=0.513 Sum_probs=4.9 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 99999999999999 Q gi|254780757|r 223 LHNHFIDSEKECLD 236 (307) Q Consensus 223 L~~~F~~yE~Ea~r 236 (307) +...|+..-++|.. T Consensus 246 i~~i~~~v~e~~~~ 259 (435) T COG0017 246 IKYLFKKVLEECAD 259 (435) T ss_pred HHHHHHHHHHHHHH T ss_conf 99999999997189 No 36 >pfam01411 tRNA-synt_2c tRNA synthetases class II (A). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only alanyl-tRNA synthetases. Probab=65.82 E-value=1.6 Score=24.99 Aligned_cols=160 Identities=24% Similarity=0.267 Sum_probs=82.1 Q ss_pred HHHHHHHHHHHCCCEEEECCCCC---------CCCCC--CCHHHHHHH-CCCCCCEEEEECCCCCCCCCCCCCCCCC-CC Q ss_conf 99999999986695896116754---------33322--485789874-4834401676415578876554667750-01 Q gi|254780757|r 16 IILTLTQYWAQQGCTILQPYDME---------VGAGT--FHPSTTLRA-LGPLSWKAAYVQPSRRPLDGRYAENPNR-LQ 82 (307) Q Consensus 16 ii~~L~~fW~~~GC~i~qpyd~e---------~GAgT--~hp~T~lr~-lg~~pw~~aYvqPsrRP~DgRYGeNPNR-lq 82 (307) |=.+--+|+.++|+.++.+|.+= +.||- |-|. |+.. ..|..=+++-+|||-|-.|= +|--+ -. T Consensus 2 iR~~fl~fF~~~gH~~i~s~~lvp~~d~~llft~Agm~~fkp~-~~g~~~~pp~~~l~~~Q~CiR~~Di---dnVG~t~r 77 (545) T pfam01411 2 IRQTFLDFFEKKGHTRVPSSPVVPRNDPTLLFTNAGMNQFKPI-FLGGEDKPPYNRAVNSQKCIRAGDL---DNVGRTAR 77 (545) T ss_pred HHHHHHHHHHHCCCEEECCCCCCCCCCCCEEEECCCCHHCHHH-HCCCCCCCCCCCCCCCCCCEECCCH---HHCCCCCC T ss_conf 6899999998789888078670679999814640460117376-5498758998872443332443765---43367777 Q ss_pred EEEEEEEEECCCC-----HHHHHHHHHHH-HHCCCCCCCCCEEEEECCCCCCHHHHCCCCEEEEECCEEEHHH-----HH Q ss_conf 1444455757983-----35899999999-9819996217179883288870011012321768878741477-----78 Q gi|254780757|r 83 HYYQFQVIIKPNP-----LNLQNLYIESL-KAVGIDPLIHDVRFVEDNWESPTLGAWGLGWECWCDGMEISQF-----TY 151 (307) Q Consensus 83 ~y~QfQVilKPsp-----~niq~lYl~SL-~~iGid~~~hDIrFveDnWEsPtLGAwGlGWEvwldGMEItQF-----TY 151 (307) ||.=|..+=-=|- +...+.-.+=| +.+||++. .|-|.++-|. |-.|-=||+-..|.|.--+ .= T Consensus 78 H~T~FEMlGn~SFg~YfK~eai~~a~e~lt~~l~i~~~--~l~~t~~~~d----~ea~~~w~~~~g~~~~~i~~~~~~~n 151 (545) T pfam01411 78 HHTFFEMLGNFSFGDYFKEEAIEFAWELLTKELGLDPE--RLYVTVYEWD----DEAGPIWEKIVGGPEERILRFGDKDN 151 (545) T ss_pred CCHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCHH--HEEEEECCCC----CCHHHHHHHHCCCCHHHCEECCCCCC T ss_conf 53232203345156243789999999997656178866--7699988886----33667688862999999062677776 Q ss_pred HHHHC-CCCCCCCHHHHH--HHHHHHHHHHCCCCCEEE Q ss_conf 99866-825775145678--789999999869982113 Q gi|254780757|r 152 FQQVC-GIECSPISGEIT--YGLERLAMYVQNVNSVYD 186 (307) Q Consensus 152 FQQvG-Gi~c~pv~~EiT--YGLERiaMylQ~vdnvyD 186 (307) |=++| +-.|-|.+ ||- ||+||.+-..++..+.+| T Consensus 152 fw~~g~~gpcgPcs-eI~~d~G~e~~~~~~~~~~~~~~ 188 (545) T pfam01411 152 FWEMGDTGPCGPCS-EIDYDRGEEIGGWPSGGTPTADD 188 (545) T ss_pred EEECCCCCCCCCCE-EEEECCCCCCCCCCCCCCCCCCC T ss_conf 05778988997768-99976672026888889988888 No 37 >PRK06462 asparagine synthetase A; Reviewed Probab=64.40 E-value=3 Score=22.98 Aligned_cols=60 Identities=7% Similarity=0.044 Sum_probs=35.0 Q ss_pred HHHHHHHHHHHHCCCEEEECC-----CCCCCCCCCCHHHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEEEE Q ss_conf 999999999986695896116-----754333224857898744834401676415578876554667750011444455 Q gi|254780757|r 15 NIILTLTQYWAQQGCTILQPY-----DMEVGAGTFHPSTTLRALGPLSWKAAYVQPSRRPLDGRYAENPNRLQHYYQFQV 89 (307) Q Consensus 15 ~ii~~L~~fW~~~GC~i~qpy-----d~e~GAgT~hp~T~lr~lg~~pw~~aYvqPsrRP~DgRYGeNPNRlq~y~QfQV 89 (307) .||..+-+|..++|.+-.++= +.+-+|+.| +.++ .|+--.+ T Consensus 34 ~i~~~iR~f~~~~gF~EV~TPiL~~~~~~~~~~~~------------~~~~----------------------~~~~~~~ 79 (332) T PRK06462 34 SILRYTREFLDGRGFVEVLPPIISPSTDPLMGDAK------------PASI----------------------DFYGVEY 79 (332) T ss_pred HHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCC------------CCEE----------------------ECCCCCE T ss_conf 99999999998889999879723677887667760------------1058----------------------4489863 Q ss_pred EECCCCHHHHHHHHHHHHHCCCC Q ss_conf 75798335899999999981999 Q gi|254780757|r 90 IIKPNPLNLQNLYIESLKAVGID 112 (307) Q Consensus 90 ilKPsp~niq~lYl~SL~~iGid 112 (307) -|..|| +||++-|-+-|++ T Consensus 80 ~L~~Sp----ql~lk~li~~g~~ 98 (332) T PRK06462 80 YLADSM----IFHKQLMLRLLKG 98 (332) T ss_pred EECCCH----HHHHHHHHHCCCC T ss_conf 314586----9999999863899 No 38 >cd00669 Asp_Lys_Asn_RS_core Asp_Lys_Asn_tRNA synthetase class II core domain. This domain is the core catalytic domain of class II aminoacyl-tRNA synthetases of the subgroup containing aspartyl, lysyl, and asparaginyl tRNA synthetases. It is primarily responsible for ATP-dependent formation of the enzyme bound aminoacyl-adenylate. Class II assignment is based upon its structure and the presence of three characteristic sequence motifs. Nearly all class II tRNA synthetases are dimers and enzymes in this subgroup are homodimers. These enzymes attach a specific amino acid to the 3' OH group of ribose of the appropriate tRNA. Probab=64.16 E-value=2.3 Score=23.86 Aligned_cols=171 Identities=18% Similarity=0.242 Sum_probs=80.2 Q ss_pred HHHHHHHHHHHHHCCCEE-----EECCCCCCCCCCC---------------CHHHHHHHC-CCCCCEEEEECCCCCCCCC Q ss_conf 999999999998669589-----6116754333224---------------857898744-8344016764155788765 Q gi|254780757|r 14 QNIILTLTQYWAQQGCTI-----LQPYDMEVGAGTF---------------HPSTTLRAL-GPLSWKAAYVQPSRRPLDG 72 (307) Q Consensus 14 q~ii~~L~~fW~~~GC~i-----~qpyd~e~GAgT~---------------hp~T~lr~l-g~~pw~~aYvqPsrRP~Dg 72 (307) ..|+..+.+|..++|-+= +...+.|-||-.| .|..++..| -.--=++--+-||.|-... T Consensus 5 s~i~~~iR~ff~~~~f~EV~TP~l~~~~~~~~~~~f~~~~~~~~~~~~L~~Spel~~k~ll~~g~~~if~i~~~FR~e~~ 84 (269) T cd00669 5 SKIIKAIRDFMDDRGFLEVETPMLQKITGGAGARPFLVKYNALGLDYYLRISPQLFKKRLMVGGLDRVFEINRNFRNEDL 84 (269) T ss_pred HHHHHHHHHHHHHCCCEEEECCEEECCCCCCCCCEEEEECCCCCCEEEECCCHHHHHHHHHHCCCCCEEEEECCCCCCCC T ss_conf 99999999999988989987985305789876742573137899407734388999999986588867998462107898 Q ss_pred CCCCCCCCCCEEEEEEEEECC-CCHH----HHHHHHHHHHH-CCCCCCCCC------------------------EEEEE Q ss_conf 546677500114444557579-8335----89999999998-199962171------------------------79883 Q gi|254780757|r 73 RYAENPNRLQHYYQFQVIIKP-NPLN----LQNLYIESLKA-VGIDPLIHD------------------------VRFVE 122 (307) Q Consensus 73 RYGeNPNRlq~y~QfQVilKP-sp~n----iq~lYl~SL~~-iGid~~~hD------------------------IrFve 122 (307) +--.|| -+.+.-.=+.= +-++ +.+|.-...+. ++......+ --||- T Consensus 85 ~~rH~~----EFtmlE~y~~~~d~~~~m~~~e~li~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~ea~e~~~~p~fi~ 160 (269) T cd00669 85 RARHQP----EFTMMDLEMAFADYEDVIELTERLVRHLAREVLGVTAVTYGFELEDFGLPFPRLTYREALERYGQPLFLT 160 (269) T ss_pred CCCCCH----HHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHCCCCCCCCCCCCEEEHHHHHHHHCCCEEEE T ss_conf 655434----8775787514899999999999999999999965552453554444589987857999999849978998 Q ss_pred CCCC----------CCHHHHCCCCEEEEECCEEEH-----------HHHHHHHHCCCC----------------CCCCHH Q ss_conf 2888----------700110123217688787414-----------777899866825----------------775145 Q gi|254780757|r 123 DNWE----------SPTLGAWGLGWECWCDGMEIS-----------QFTYFQQVCGIE----------------CSPISG 165 (307) Q Consensus 123 DnWE----------sPtLGAwGlGWEvwldGMEIt-----------QFTYFQQvGGi~----------------c~pv~~ 165 (307) | |- +|.=......||+-++|+||. |-.-|+..|--. =-|.++ T Consensus 161 d-yP~~~~~~~~~~~~~~~~~a~rfdl~~~G~El~nG~~r~~d~~~~~~r~~~~~~~~~~~~~~~~~yl~a~~~G~pp~~ 239 (269) T cd00669 161 D-YPAEMHSPLASPHDVNPEIADAFDLFINGVEVGNGSSRLHDPDIQAEVFQEQGINKEAGMEYFEFYLKALEYGLPPHG 239 (269) T ss_pred C-CCHHCCCCCCCCCCCCCCHHHHCCCCCCCEEECCCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCCE T ss_conf 9-850107802090799966766502467860670325204888999999998560744556658999999877999971 Q ss_pred HHHHHHHHHHHHHCCCCCEEEEEE Q ss_conf 678789999999869982113320 Q gi|254780757|r 166 EITYGLERLAMYVQNVNSVYDIVF 189 (307) Q Consensus 166 EiTYGLERiaMylQ~vdnvyDl~w 189 (307) -+.-|+|||.|++-|.+||-|... T Consensus 240 G~glG~dRL~m~~~g~~~Irdv~~ 263 (269) T cd00669 240 GLGIGIDRLIMLMTNSPTIREVIA 263 (269) T ss_pred EEEEHHHHHHHHHHCCCCHHHCCC T ss_conf 663479999999808996886247 No 39 >PRK03932 asnC asparaginyl-tRNA synthetase; Validated Probab=64.00 E-value=3.2 Score=22.79 Aligned_cols=16 Identities=6% Similarity=0.420 Sum_probs=9.1 Q ss_pred HHHHHHHHHHHCCCEE Q ss_conf 9999999998669589 Q gi|254780757|r 16 IILTLTQYWAQQGCTI 31 (307) Q Consensus 16 ii~~L~~fW~~~GC~i 31 (307) ||..+-+|..++|.+= T Consensus 141 i~~~iR~~l~~~~F~E 156 (462) T PRK03932 141 LAQAIHEFFQENGFVW 156 (462) T ss_pred HHHHHHHHHHHCCCEE T ss_conf 9999999997679679 No 40 >TIGR02572 LcrR type III secretion system regulator LcrR; InterPro: IPR013405 This family of proteins are encoded within type III secretion operons and have been characterised in Yersinia as a regulator of the Low-Calcium Response (LCR) .. Probab=63.36 E-value=1.5 Score=25.23 Aligned_cols=33 Identities=24% Similarity=0.509 Sum_probs=20.0 Q ss_pred HHHHCCCCCCCCCEEEEECCCCCCHHHHCCCCEEEEECCEEEH Q ss_conf 9998199962171798832888700110123217688787414 Q gi|254780757|r 105 SLKAVGIDPLIHDVRFVEDNWESPTLGAWGLGWECWCDGMEIS 147 (307) Q Consensus 105 SL~~iGid~~~hDIrFveDnWEsPtLGAwGlGWEvwldGMEIt 147 (307) |++.-||...=|=.+ . -+=-|||.||.+|||+. T Consensus 8 wf~~~G~~v~Ph~l~------~----s~I~LG~~~~~~G~eLa 40 (142) T TIGR02572 8 WFEETGLEVTPHCLQ------K----SPIQLGWRVEVDGCELA 40 (142) T ss_pred HHHHCCCEEECEECC------C----CCCCHHHHHHHCCCEEE T ss_conf 456088525034526------8----86201058654475078 No 41 >cd00496 PheRS_alpha_core Phenylalanyl-tRNA synthetase (PheRS) alpha chain catalytic core domain. PheRS belongs to class II aminoacyl-tRNA synthetases (aaRS) based upon its structure and the presence of three characteristic sequence motifs. This domain is primarily responsible for ATP-dependent formation of the enzyme bound aminoacyl-adenylate. While class II aaRSs generally aminoacylate the 3'-OH ribose of the appropriate tRNA, PheRS is an exception in that it attaches the amino acid at the 2'-OH group, like class I aaRSs. PheRS is an alpha-2/ beta-2 tetramer. Probab=63.12 E-value=2.8 Score=23.22 Aligned_cols=125 Identities=22% Similarity=0.262 Sum_probs=67.6 Q ss_pred HHHHCCCCCCEEEEECCCCCCC--CCCCCCCCCCCCEEEEEEEEE-CC--CCHHHHHHHHHHHHHCCCCCCCCCEEEEEC Q ss_conf 9874483440167641557887--655466775001144445575-79--833589999999998199962171798832 Q gi|254780757|r 49 TLRALGPLSWKAAYVQPSRRPL--DGRYAENPNRLQHYYQFQVII-KP--NPLNLQNLYIESLKAVGIDPLIHDVRFVED 123 (307) Q Consensus 49 ~lr~lg~~pw~~aYvqPsrRP~--DgRYGeNPNRlq~y~QfQVil-KP--sp~niq~lYl~SL~~iGid~~~hDIrFveD 123 (307) .|+... .|.++..+-.|.|+. |. ..+-.+||.-.++ .. +-.|.....-.=++.++- ....+||..+ T Consensus 73 ~l~~~~-~p~~~~~~G~VyR~D~iDa------tH~~~FhQ~Eg~~v~~~~~~~~Lk~~l~~~~~~lfg--~~~~~r~rp~ 143 (218) T cd00496 73 ALAKLK-PPIRIFSIGRVYRNDEIDA------THLPEFHQIEGLVVDKGLTFADLKGTLEEFAKELFG--PITKVRFRPS 143 (218) T ss_pred HHHHCC-CCCEEEECCCEEECCCCCC------CCCCEEEEEEEEEEECCCCHHHHHHHHHHHHHHHCC--CCCEEEECCC T ss_conf 997458-9817994250465388984------315312576379995651199999999999999738--7664536778 Q ss_pred CC--CCCHHHHCCCCEEEEECC----EEEHHH-----HHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEE Q ss_conf 88--870011012321768878----741477-----789986682577514567878999999986998211332 Q gi|254780757|r 124 NW--ESPTLGAWGLGWECWCDG----MEISQF-----TYFQQVCGIECSPISGEITYGLERLAMYVQNVNSVYDIV 188 (307) Q Consensus 124 nW--EsPtLGAwGlGWEvwldG----MEItQF-----TYFQQvGGi~c~pv~~EiTYGLERiaMylQ~vdnvyDl~ 188 (307) =. -.|+ +--+|+|++ +||--- ... .-+|++-+-...-.--|||||||-.-+++++-++- T Consensus 144 yFPFTePS-----~Evdv~~~~~~~WlEv~G~G~v~p~vL-~~~G~d~~~~g~AfG~GlERlaMl~~gI~DIR~~~ 213 (218) T cd00496 144 YFPFTEPS-----FEVDVYCPGCLGWLEILGCGMVRPEVL-ENAGIDEEYSGFAFGIGLERLAMLKYGIPDIRLFY 213 (218) T ss_pred CCCCCCCC-----EEEEEEECCCCCCEEEECCCCCCHHHH-HHCCCCCCCEEEEEEECHHHHHHHHCCCCHHHHHH T ss_conf 89999982-----699999658787148850465488999-86697999889998525889999983997688875 No 42 >pfam01409 tRNA-synt_2d tRNA synthetases class II core domain (F). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only phenylalanyl-tRNA synthetases. This is the core catalytic domain. Probab=62.53 E-value=14 Score=18.25 Aligned_cols=169 Identities=20% Similarity=0.284 Sum_probs=93.7 Q ss_pred CHHHHHHHHHHHHHHCCCEEEECCCCCCC--------CCCCCH--------------------------HHHHHHC---C Q ss_conf 79999999999998669589611675433--------322485--------------------------7898744---8 Q gi|254780757|r 12 SFQNIILTLTQYWAQQGCTILQPYDMEVG--------AGTFHP--------------------------STTLRAL---G 54 (307) Q Consensus 12 ~fq~ii~~L~~fW~~~GC~i~qpyd~e~G--------AgT~hp--------------------------~T~lr~l---g 54 (307) -...++-.+.+++...|-.++..-.+|.- -..-|| +.-.|.| . T Consensus 18 Pi~~~~~~i~~~f~~~GF~~~~g~eves~~~NFDaLn~P~dHPaR~~~DTfy~~~~~~~~~~vLRtHTS~~q~r~l~~~~ 97 (243) T pfam01409 18 PLTRVLEEIRDIFLSMGFEEVEGPEVESDFYNFDALNIPQDHPARDMQDTFYLKKPLKARRLLLRTHTTPVQARTLAEKN 97 (243) T ss_pred HHHHHHHHHHHHHHHCCCEEEECCCCCCHHHHHHHHCCCCCCCCCCCCCEEEECCCCCCCCEEECCCCCHHHHHHHHHCC T ss_conf 59999999999998779868769835642777775578988987676760343366667686576879889999997427 Q ss_pred CCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEEEEEECC---CCHHHHHHHHHHHHHC-CCCCCCCCEEEEECC--CCCC Q ss_conf 344016764155788765546677500114444557579---8335899999999981-999621717988328--8870 Q gi|254780757|r 55 PLSWKAAYVQPSRRPLDGRYAENPNRLQHYYQFQVIIKP---NPLNLQNLYIESLKAV-GIDPLIHDVRFVEDN--WESP 128 (307) Q Consensus 55 ~~pw~~aYvqPsrRP~DgRYGeNPNRlq~y~QfQVilKP---sp~niq~lYl~SL~~i-Gid~~~hDIrFveDn--WEsP 128 (307) ..|.++-.+--|.|+..- ..--+-.+||.--+.-- +-.|.....-.-++.+ |.+ -.+||.+.- +-.| T Consensus 98 ~~p~r~~~~grvyR~d~~----DatH~~~FhQ~Eg~vvd~~it~~~Lk~~l~~~~~~lfg~~---~~~R~rp~yFPFTeP 170 (243) T pfam01409 98 KPPIKIFSIGRVFRRDQV----DATHLPEFHQVEGLVVDENVSFADLKGVLEEFLRKFFGFE---VKVRFRPSYFPFTEP 170 (243) T ss_pred CCCEEEECCCEEEECCCC----CCCCCCEEEEEEEEEECCCCCHHHHHHHHHHHHHHHCCCC---CEEEECCCCCCCCCC T ss_conf 998489814313325889----8431531057557896577789999999999999973997---379867886999998 Q ss_pred HH--H---HCCCCE-EEEECCEEEHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEEEC Q ss_conf 01--1---012321-76887874147778998668257751456787899999998699821133206 Q gi|254780757|r 129 TL--G---AWGLGW-ECWCDGMEISQFTYFQQVCGIECSPISGEITYGLERLAMYVQNVNSVYDIVFN 190 (307) Q Consensus 129 tL--G---AwGlGW-EvwldGMEItQFTYFQQvGGi~c~pv~~EiTYGLERiaMylQ~vdnvyDl~wn 190 (307) ++ - .-+-|| ||-=-||==-+-. .-.||+.+-...-.--|||||||-.-++++|-++--| T Consensus 171 S~Evdv~~~~~~~WlEi~G~Gmv~p~vl---~~~gid~~~~g~AfG~GlERlaMl~~gi~DIR~~~~~ 235 (243) T pfam01409 171 SAEVDVYCCKGGGWIEIGGAGMVHPNVL---EAVGIDEDYPGFAFGLGVERLAMLKYGIDDIRDLYEN 235 (243) T ss_pred CCEEEEEEECCCCEEEEEECCCCCHHHH---HHCCCCCCCEEEEEEECHHHHHHHHCCCCHHHHHHHC T ss_conf 6179999964997258750366678899---8669898976999853488999998299778987626 No 43 >pfam00152 tRNA-synt_2 tRNA synthetases class II (D, K and N). Probab=61.65 E-value=2.6 Score=23.45 Aligned_cols=20 Identities=15% Similarity=0.405 Sum_probs=15.7 Q ss_pred HHHHHHHHHHHHCCCEEEEC Q ss_conf 99999999998669589611 Q gi|254780757|r 15 NIILTLTQYWAQQGCTILQP 34 (307) Q Consensus 15 ~ii~~L~~fW~~~GC~i~qp 34 (307) .|+..+-+|..++|.+-+++ T Consensus 27 ~i~~~iR~ff~~~~f~EV~T 46 (341) T pfam00152 27 KIIRAIREFLDERGFLEVET 46 (341) T ss_pred HHHHHHHHHHHHCCCEEEEC T ss_conf 99999999999889899879 No 44 >PRK11587 putative phosphatase; Provisional Probab=61.08 E-value=7.3 Score=20.18 Aligned_cols=79 Identities=22% Similarity=0.329 Sum_probs=46.9 Q ss_pred HHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEEEEEE--------CCCCHHHHHHHHHHHHHCCCCCCCCCE Q ss_conf 789874483440167641557887655466775001144445575--------798335899999999981999621717 Q gi|254780757|r 47 STTLRALGPLSWKAAYVQPSRRPLDGRYAENPNRLQHYYQFQVII--------KPNPLNLQNLYIESLKAVGIDPLIHDV 118 (307) Q Consensus 47 ~T~lr~lg~~pw~~aYvqPsrRP~DgRYGeNPNRlq~y~QfQVil--------KPsp~niq~lYl~SL~~iGid~~~hDI 118 (307) ..+|..|...-.+.|-|--+.||.-- -- .++.....|.+++ ||+| |.||.-++.+|++|. +. T Consensus 89 ~elL~~L~~~gi~~avvTs~~~~~~~-~~---~~~~~l~~~~~~v~~ddv~~~KP~P----d~yl~A~~~Lg~~p~--~~ 158 (218) T PRK11587 89 IALLNHLNKAGIPWAIVTSGSMPVAS-AR---HKAAGLPAPEVFVTAERVKRGKPEP----DAYLLGAQLLGLAPQ--EC 158 (218) T ss_pred HHHHHHHHHCCCCEEEEECCCHHHHH-HH---HHHCCCCCCCEEEECCCCCCCCCCC----HHHHHHHHHCCCCHH--HE T ss_conf 99999998779977887268278999-99---9723899786999711055789986----899999998298977--78 Q ss_pred EEEECCCCCCHHHHCCCC Q ss_conf 988328887001101232 Q gi|254780757|r 119 RFVEDNWESPTLGAWGLG 136 (307) Q Consensus 119 rFveDnWEsPtLGAwGlG 136 (307) -.|||- .+-...|..-| T Consensus 159 lvieDS-~~gi~AA~~AG 175 (218) T PRK11587 159 VVVEDA-PAGVLSGLAAG 175 (218) T ss_pred EEEECC-HHHHHHHHHCC T ss_conf 999477-88899999869 No 45 >KOG1576 consensus Probab=60.91 E-value=7.1 Score=20.27 Aligned_cols=108 Identities=25% Similarity=0.210 Sum_probs=66.2 Q ss_pred HCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEEEEEECCCCHHHHHHHHHHHHHCCCCC----CCCCEEEE--ECCC Q ss_conf 44834401676415578876554667750011444455757983358999999999819996----21717988--3288 Q gi|254780757|r 52 ALGPLSWKAAYVQPSRRPLDGRYAENPNRLQHYYQFQVIIKPNPLNLQNLYIESLKAVGIDP----LIHDVRFV--EDNW 125 (307) Q Consensus 52 ~lg~~pw~~aYvqPsrRP~DgRYGeNPNRlq~y~QfQVilKPsp~niq~lYl~SL~~iGid~----~~hDIrFv--eDnW 125 (307) ++..-|-.+-|+- -.=|||+..|+|+.-| |.+-+.+-.-+||+.|+.|- ..|||-|+ +|-= T Consensus 89 al~~vPR~aYyIa----TKvgRy~ld~~~~Fdf---------sadkvreSv~rSlerLqldyvDilqiHDvefap~ld~v 155 (342) T KOG1576 89 ALKDVPREAYYIA----TKVGRYELDYANMFDF---------SADKVRESVKRSLERLQLDYVDILQIHDVEFAPNLDIV 155 (342) T ss_pred HHHHCCHHHEEEE----EEEEECCCCCCCCCCC---------HHHHHHHHHHHHHHHHCCCEEEEEEEECCCCCCCCCHH T ss_conf 8763882343654----4332124573101451---------18999999999999857762678886220236531178 Q ss_pred CCCHHHHC------CCCEEEEECCEEEHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEEEC Q ss_conf 87001101------232176887874147778998668257751456787899999998699821133206 Q gi|254780757|r 126 ESPTLGAW------GLGWECWCDGMEISQFTYFQQVCGIECSPISGEITYGLERLAMYVQNVNSVYDIVFN 190 (307) Q Consensus 126 EsPtLGAw------GlGWEvwldGMEItQFTYFQQvGGi~c~pv~~EiTYGLERiaMylQ~vdnvyDl~wn 190 (307) =+-||-|- |.---+=+-|--|.- +||=+||=.=-++-|-|-....|| T Consensus 156 l~Etlp~Le~lk~~Gk~RfiGitgypldv------------------l~~~ae~~~G~~dvvlsY~ry~l~ 208 (342) T KOG1576 156 LNETLPALEELKQEGKIRFIGITGYPLDV------------------LTECAERGKGRLDVVLSYCRYTLN 208 (342) T ss_pred HHHHHHHHHHHHHCCCEEEEEECCCCHHH------------------HHHHHHCCCCCEEEEHHHHHHCCC T ss_conf 99889999999863853674202566389------------------999875588701325523221146 No 46 >PRK10563 6-phosphogluconate phosphatase; Provisional Probab=60.66 E-value=8.7 Score=19.64 Aligned_cols=43 Identities=21% Similarity=0.417 Sum_probs=32.9 Q ss_pred CCCCHHHHHHHHHHHHHCCCCCCCCCEEEEECCCCCCHHHHCCCCEEEEE Q ss_conf 79833589999999998199962171798832888700110123217688 Q gi|254780757|r 92 KPNPLNLQNLYIESLKAVGIDPLIHDVRFVEDNWESPTLGAWGLGWECWC 141 (307) Q Consensus 92 KPsp~niq~lYl~SL~~iGid~~~hDIrFveDnWEsPtLGAwGlGWEvwl 141 (307) ||.|+ +||..++.+|++|. +.-.|||- ++-...|..-|=.|+- T Consensus 142 KP~Pd----iyl~A~~~lgv~p~--~clviEDS-~~Gi~AA~aAGm~vi~ 184 (221) T PRK10563 142 KPDPA----LMFHAAEAMNVNVE--NCILVDDS-SAGAQSGIAAGMEVFY 184 (221) T ss_pred CCCCH----HHHHHHHHCCCCHH--HEEEEECC-HHHHHHHHHCCCEEEE T ss_conf 98809----99999998299867--84878489-9999999985996999 No 47 >COG0016 PheS Phenylalanyl-tRNA synthetase alpha subunit [Translation, ribosomal structure and biogenesis] Probab=60.03 E-value=3 Score=23.03 Aligned_cols=168 Identities=23% Similarity=0.278 Sum_probs=90.0 Q ss_pred HHHHHHHHHHHHHHCCCEEEECCCCCCCCCC--------CCHH---------------------------HHHHHCCCCC Q ss_conf 9999999999998669589611675433322--------4857---------------------------8987448344 Q gi|254780757|r 13 FQNIILTLTQYWAQQGCTILQPYDMEVGAGT--------FHPS---------------------------TTLRALGPLS 57 (307) Q Consensus 13 fq~ii~~L~~fW~~~GC~i~qpyd~e~GAgT--------~hp~---------------------------T~lr~lg~~p 57 (307) +-.+|-.+.++..+.|..+...-.+|..-=- .||| -.+....+.| T Consensus 113 l~~~~e~i~~iF~~mGF~~~~gp~IE~d~~NFDaLn~P~dHPARdmqDTFy~~~~~~~~lLRTHTs~vq~R~l~~~~~~P 192 (335) T COG0016 113 LTQTIEEIEDIFLGMGFTEVEGPEIETDFYNFDALNIPQDHPARDMQDTFYLKDDREKLLLRTHTSPVQARTLAENAKIP 192 (335) T ss_pred HHHHHHHHHHHHHHCCCEECCCCCCCCCCCCHHHHCCCCCCCCCCCCCEEEECCCCCCEEECCCCCHHHHHHHHHCCCCC T ss_conf 89999999999997496664588213202252432499999841244437876887730023668276599998479999 Q ss_pred CEEEEECCCCCCCCCCCCCCCCCCCEEEEEE-EEECC--CCHHHHHHHHHHHHHCC-CCCCCCCEEEEECC--CCCCHHH Q ss_conf 0167641557887655466775001144445-57579--83358999999999819-99621717988328--8870011 Q gi|254780757|r 58 WKAAYVQPSRRPLDGRYAENPNRLQHYYQFQ-VIIKP--NPLNLQNLYIESLKAVG-IDPLIHDVRFVEDN--WESPTLG 131 (307) Q Consensus 58 w~~aYvqPsrRP~DgRYGeNPNRlq~y~QfQ-VilKP--sp~niq~lYl~SL~~iG-id~~~hDIrFveDn--WEsPtLG 131 (307) .++.-+.-+.|+.+-. .-.+.++||.- +++.. +-.|+.-..-+-++.++ .+. .+||.-+= |--|+.- T Consensus 193 ~k~~~~grvyR~D~~D----aTHs~~FhQiEGlvvd~~~s~~~Lkg~L~~f~~~~fg~~~---~vRfrpsyFPFTEPS~E 265 (335) T COG0016 193 IKIFSPGRVYRNDTVD----ATHSPEFHQIEGLVVDKNISFADLKGTLEEFAKKFFGEDV---KVRFRPSYFPFTEPSAE 265 (335) T ss_pred CEEECCCCEECCCCCC----CCCCHHEEEEEEEEEECCCCHHHHHHHHHHHHHHHCCCCC---CEEEECCCCCCCCCEEE T ss_conf 1584566341178877----6426101336789995895699999999999999637776---35760687898997178 Q ss_pred --HC--C-CCE-EEEECCEEEHHHHHHHHHCCCCCCCCHH-HHHHHHHHHHHHHCCCCCEEEEEEC Q ss_conf --01--2-321-7688787414777899866825775145-6787899999998699821133206 Q gi|254780757|r 132 --AW--G-LGW-ECWCDGMEISQFTYFQQVCGIECSPISG-EITYGLERLAMYVQNVNSVYDIVFN 190 (307) Q Consensus 132 --Aw--G-lGW-EvwldGMEItQFTYFQQvGGi~c~pv~~-EiTYGLERiaMylQ~vdnvyDl~wn 190 (307) .| | -|| ||.=-||==-.-+ ...|++-++++| ----|||||||..-|++++-++-=| T Consensus 266 vdv~~~~~~~WlEi~G~Gmv~P~VL---~~~G~~~~~~~GfAfGlGlERlAMLkygI~DIR~l~~~ 328 (335) T COG0016 266 VDVYCPGCGGWLEILGCGMVHPNVL---EAVGIDPEEYSGFAFGLGLERLAMLKYGIPDIRDLYEN 328 (335) T ss_pred EEEEECCCCCEEEEECCCCCCHHHH---HHCCCCCCCCEEEEEEECHHHHHHHHHCCCHHHHHHHC T ss_conf 9999768786899835654578889---74599987605888740488999998298079998746 No 48 >PRK06253 O-phosphoseryl-tRNA synthetase; Reviewed Probab=58.25 E-value=3.3 Score=22.66 Aligned_cols=127 Identities=23% Similarity=0.361 Sum_probs=86.1 Q ss_pred CCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEEEEEECCCCHHHHHHH---HHHHHHCCCCCCCCCEEEEECCCCCCHH Q ss_conf 8344016764155788765546677500114444557579833589999---9999981999621717988328887001 Q gi|254780757|r 54 GPLSWKAAYVQPSRRPLDGRYAENPNRLQHYYQFQVIIKPNPLNLQNLY---IESLKAVGIDPLIHDVRFVEDNWESPTL 130 (307) Q Consensus 54 g~~pw~~aYvqPsrRP~DgRYGeNPNRlq~y~QfQVilKPsp~niq~lY---l~SL~~iGid~~~hDIrFveDnWEsPtL 130 (307) -+.|.+.--|.-|.|--- .+...||--|||---|+--.--|++++- -+=|+.+|. -++||..|+=.|+-- T Consensus 203 ~~~P~klFSIDRcFRrEq---~eD~~hL~~yhsascvv~~edVn~d~gkav~~~lL~~fGF----~~frFrPdek~skYY 275 (527) T PRK06253 203 KPLPFKLFSIDRCFRREQ---KEDASRLMTYHSASCVVAGEDVTVDDGKAVAEGLLSQFGF----TNFRFRPDEKRSKYY 275 (527) T ss_pred CCCCEEEEEEEHEEECHH---HCCHHHHHHHHEEEEEEECCCCCHHHHHHHHHHHHHHHCC----CEEEECCCCCCCCCC T ss_conf 799836874101022000---0245563454432269975986677779999999997097----537843442356653 Q ss_pred HHCCCCEEEEE-----C-CEEEHHHHHHHHHC----CCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEEEC Q ss_conf 10123217688-----7-87414777899866----8257751456787899999998699821133206 Q gi|254780757|r 131 GAWGLGWECWC-----D-GMEISQFTYFQQVC----GIECSPISGEITYGLERLAMYVQNVNSVYDIVFN 190 (307) Q Consensus 131 GAwGlGWEvwl-----d-GMEItQFTYFQQvG----Gi~c~pv~~EiTYGLERiaMylQ~vdnvyDl~wn 190 (307) -. +.-=||.. + =.||.-|--|--.. ||++ ...-+--|+|||||-+.+-+.|-+|.+- T Consensus 276 ~P-~TqtEVy~yhp~~~gWvEvatfGIysP~aL~~ygI~~--PVmnlGlGvERLAMI~~~~~DiR~l~yp 342 (527) T PRK06253 276 TP-DTQTEVYAYHPKLDGWVEVATFGIYSPTALAEYGIDV--PVMNLGLGVERLAMILHNAEDVREMVYP 342 (527) T ss_pred CC-CCCEEEEEECCCCCCEEEEECCCCCCHHHHHHCCCCC--CEEECCCCHHHHHHHHCCCHHHHHHHCC T ss_conf 78-8632799963667862897314256754454319997--4220011189999998182778876334 No 49 >PRK13288 pyrophosphatase PpaX; Provisional Probab=57.06 E-value=11 Score=19.02 Aligned_cols=109 Identities=20% Similarity=0.334 Sum_probs=60.3 Q ss_pred HHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCHHHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEEEEE--- Q ss_conf 99999999999866958961167543332248578987448344016764155788765546677500114444557--- Q gi|254780757|r 14 QNIILTLTQYWAQQGCTILQPYDMEVGAGTFHPSTTLRALGPLSWKAAYVQPSRRPLDGRYAENPNRLQHYYQFQVI--- 90 (307) Q Consensus 14 q~ii~~L~~fW~~~GC~i~qpyd~e~GAgT~hp~T~lr~lg~~pw~~aYvqPsrRP~DgRYGeNPNRlq~y~QfQVi--- 90 (307) .+++.....|-.......++||+ |. .-+|+.|..+-.+.+-|--+.|..=-+--+ -..+.+|+.+=|. T Consensus 63 ~~~~~~~~~~~~~~~~~~~~~~~-----gv---~e~L~~L~~~g~~l~ivTn~~~~~~~~~l~-~~gl~~~Fd~iv~~dd 133 (214) T PRK13288 63 EEMITTYREFNHDHHDELVEEYE-----TV---YETLQTLKKQGYKLGIVTTKARDTVEMGLK-LTGLDKFFDVVVTLDD 133 (214) T ss_pred HHHHHHHHHHHHHHHHHHCCCCC-----CH---HHHHHHHHHCCCCEEEEECCCHHHHHHHHH-HHCCCCCCCEEEECCC T ss_conf 99999999999998887386374-----79---999999997799267753575699999999-7222144315786133 Q ss_pred ---ECCCCHHHHHHHHHHHHHCCCCCCCCCEEEEECCCCCCHHHHCCCCEE Q ss_conf ---579833589999999998199962171798832888700110123217 Q gi|254780757|r 91 ---IKPNPLNLQNLYIESLKAVGIDPLIHDVRFVEDNWESPTLGAWGLGWE 138 (307) Q Consensus 91 ---lKPsp~niq~lYl~SL~~iGid~~~hDIrFveDnWEsPtLGAwGlGWE 138 (307) -||+| +.|+..++.+|++| .+.-+|+|.+ +--.+|.-.|=- T Consensus 134 v~~~KP~P----~~~~~a~~~l~~~p--~e~l~VGDs~-~Di~aA~~aG~~ 177 (214) T PRK13288 134 VEHAKPDP----EPVQKALELLGAKP--EEALMVGDNY-HDILAGKNAGTK 177 (214) T ss_pred CCCCCCCH----HHHHHHHHHHCCCC--CCEEEEECCH-HHHHHHHHCCCE T ss_conf 44569879----99999999939597--8689996898-999999995995 No 50 >TIGR00469 pheS_mito phenylalanyl-tRNA synthetase; InterPro: IPR004530 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . Phenylalanyl-tRNA synthetase (6.1.1.20 from EC) is an alpha2/beta2 tetramer composed of 2 subunits that belongs to class IIc. In eubacteria, a small subunit (pheS gene) can be designated as beta (E. coli) or alpha subunit (nomenclature adopted in InterPro). Reciprocally the large subunit (pheT gene) can be designated as alpha (E. coli) or beta (see IPR004531 from INTERPRO and IPR004532 from INTERPRO). In all other kingdoms the two subunits have equivalent length in eukaryota, and can be identified by specific signatures. The enzyme from Thermus thermophilus has an alpha 2 beta 2 type quaternary structure and is one of the most complicated members of the synthetase family. Identification of phenylalanyl-tRNA synthetase as a member of class II aaRSs was based only on sequence alignment of the small alpha-subunit with other synthetases . This family describes the mitochondrial phenylalanyl-tRNA synthetases. Unlike all other known phenylalanyl-tRNA synthetases, the mitochondrial form demonstrated from yeast is monomeric. It is similar to but longer than the alpha subunit (PheS) of the alpha 2 beta 2 form found in bacteria, Archaea, and eukaryotes, and shares the characteristic motifs of class II aminoacyl-tRNA ligases.; GO: 0000166 nucleotide binding, 0004826 phenylalanine-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006432 phenylalanyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=56.85 E-value=6.2 Score=20.69 Aligned_cols=88 Identities=20% Similarity=0.379 Sum_probs=50.5 Q ss_pred CCCCCCEEEEECC--CCCCHHHHCCCCEEEEEC-------CEEEHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCC Q ss_conf 9621717988328--887001101232176887-------8741477789986682577514567878999999986998 Q gi|254780757|r 112 DPLIHDVRFVEDN--WESPTLGAWGLGWECWCD-------GMEISQFTYFQQVCGIECSPISGEITYGLERLAMYVQNVN 182 (307) Q Consensus 112 d~~~hDIrFveDn--WEsPtLGAwGlGWEvwld-------GMEItQFTYFQQvGGi~c~pv~~EiTYGLERiaMylQ~vd 182 (307) -|++-.+|.|.-= |-.|+ |-| |||-. |--|-.---.|-.|=..-..|-=-.--|||||||-|-++- T Consensus 259 tPkelK~RW~~~YFP~T~PS---WE~--E~y~~~~WLE~~GCG~~R~~~L~R~G~~~SetIGyAFG~GL~R~AM~LF~IP 333 (460) T TIGR00469 259 TPKELKVRWVDAYFPFTAPS---WEL--EVYFKDEWLELLGCGLIRQDVLLRAGVKQSETIGYAFGLGLDRLAMLLFDIP 333 (460) T ss_pred CCCCCEEEEEECCCCCCCCC---CEE--EEEECCCEEEEECCCHHHHHHHHHCCCCCCCCCCEEECCCHHHHHHHHHCCC T ss_conf 74200155750338898884---058--9888376556515514677898845897333220000100889998873488 Q ss_pred CEEEEEECCCCC--CCCCCCHHHHH Q ss_conf 211332068887--88311062447 Q gi|254780757|r 183 SVYDIVFNAIEG--QNVLYGDIFAQ 205 (307) Q Consensus 183 nvyDl~wn~~~~--~~vtYGdif~q 205 (307) .|- |.|+.-++ ...+-||++.- T Consensus 334 DIR-LlWS~D~~F~~QFS~~dlh~~ 357 (460) T TIGR00469 334 DIR-LLWSRDERFLRQFSKKDLHLL 357 (460) T ss_pred CEE-EEEECCCCHHHHHCCHHHHCC T ss_conf 615-662045302333111023215 No 51 >PRK13229 consensus Probab=55.95 E-value=14 Score=18.04 Aligned_cols=81 Identities=17% Similarity=0.161 Sum_probs=42.1 Q ss_pred HHHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEEEEE------ECCCCHHHHHHHHHHHHHCCCCCCCCCEEEE Q ss_conf 8987448344016764155788765546677500114444557------5798335899999999981999621717988 Q gi|254780757|r 48 TTLRALGPLSWKAAYVQPSRRPLDGRYAENPNRLQHYYQFQVI------IKPNPLNLQNLYIESLKAVGIDPLIHDVRFV 121 (307) Q Consensus 48 T~lr~lg~~pw~~aYvqPsrRP~DgRYGeNPNRlq~y~QfQVi------lKPsp~niq~lYl~SL~~iGid~~~hDIrFv 121 (307) .+|..|...-++.+-|--+.|+.--+.-+. ..+.+|+.+-|- -||.| +.|+..++.+|++|. +.-+| T Consensus 101 e~L~~L~~~G~~laI~Tn~~~~~~~~~l~~-~gl~~~F~~i~~~d~~~~~KP~P----~~~~~al~~l~~~p~--~~l~V 173 (234) T PRK13229 101 EAMDRFASAGYQLAVCTNKYEELSVKLLES-LGLAARFAAICGGDTFSWRKPDP----RHLTETIARAGGDRD--RALMV 173 (234) T ss_pred HHHHHHHHCCCCEEECCCCHHHHHHHHHHC-CCCHHHHCEEEECCCCCCCCCCH----HHHHHHHHHHCCCCC--CEEEE T ss_conf 999999977997452169707888877521-36212303676135577899889----999999998199963--67999 Q ss_pred ECCCCCCHHHHCCCC Q ss_conf 328887001101232 Q gi|254780757|r 122 EDNWESPTLGAWGLG 136 (307) Q Consensus 122 eDnWEsPtLGAwGlG 136 (307) +|-. +--.+|.--| T Consensus 174 GDs~-~Di~aA~~AG 187 (234) T PRK13229 174 GDSR-TDIDTAKAAG 187 (234) T ss_pred ECCH-HHHHHHHHCC T ss_conf 2689-8999999969 No 52 >pfam06046 Sec6 Exocyst complex component Sec6. Sec6 is a component of the multiprotein exocyst complex. Sec6 interacts with Sec8, Sec10 and Exo70.These exocyst proteins localize to regions of active exocytosis-at the growing ends of interphase cells and in the medial region of cells undergoing cytokinesis-in an F-actin-dependent and exocytosis- independent manner. Probab=55.32 E-value=18 Score=17.41 Aligned_cols=88 Identities=16% Similarity=0.184 Sum_probs=48.1 Q ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHHH-----HHHHHCCCCCCCCCCCCCCCHHHHHH------HHHHHHHHHHHCCC Q ss_conf 69977763501798999999999999999-----99850066532335530138289999------99999878883875 Q gi|254780757|r 207 EQEYSRYNFEYANPEILHNHFIDSEKECL-----DLLKKGIPNEHHRHHLCVFPAYDQCI------KASHIFNLLDARGV 275 (307) Q Consensus 207 E~E~S~YNFE~Ad~~~L~~~F~~yE~Ea~-----rLle~~LP~~~~~~~~lvlPAYD~~L------KaSH~FNLLDARGa 275 (307) +..-+.-.-=..|.+.++..|..+...+. +.++.-..-.+.+....+...|.-++ .-+|+=.+|.+||= T Consensus 443 ~~~~~~a~~i~~D~~~l~~~F~~~~~~~~~~~~~~~l~~l~~ll~~~d~~~i~~~~~~l~~~ypD~~~~~v~alL~~R~D 522 (556) T pfam06046 443 EERDQFAEKMKRDAEQLFDFFQKLGDAAYLLSKFRVLEKLLDLLELEDPDALVLEVETLLSSYPDISEDHVEAVLKARGD 522 (556) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCC T ss_conf 88899999999999999999986599887852789999999998589877899999999984899999999999986579 Q ss_pred CHHHHHHHHHHHHHHHHHH Q ss_conf 0178899999999999999 Q gi|254780757|r 276 ISTTERQRYILRIRSLAKA 294 (307) Q Consensus 276 ISVTERq~YI~RIR~Lak~ 294 (307) +|-++++.-+.+.|.+.+. T Consensus 523 ~~rs~~k~i~~~~~~~~~~ 541 (556) T pfam06046 523 LDKSMVKALLSRLREIVKS 541 (556) T ss_pred CCHHHHHHHHHHHHHHHHC T ss_conf 9999999999999998746 No 53 >PRK13227 consensus Probab=53.84 E-value=15 Score=17.97 Aligned_cols=84 Identities=24% Similarity=0.341 Sum_probs=50.3 Q ss_pred HHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEEEEE------ECCCCHHHHHHHHHHHHHCCCCCCCCCEEE Q ss_conf 78987448344016764155788765546677500114444557------579833589999999998199962171798 Q gi|254780757|r 47 STTLRALGPLSWKAAYVQPSRRPLDGRYAENPNRLQHYYQFQVI------IKPNPLNLQNLYIESLKAVGIDPLIHDVRF 120 (307) Q Consensus 47 ~T~lr~lg~~pw~~aYvqPsrRP~DgRYGeNPNRlq~y~QfQVi------lKPsp~niq~lYl~SL~~iGid~~~hDIrF 120 (307) ..+|+.|-.+-.+++-|--+.|..=-..- +-.-|.+|+.+=|- -||.|+ .|+..++.+|++|. +.-+ T Consensus 102 ~~~L~~Lk~~g~~~~ivTn~~~~~~~~~l-~~~gl~~~Fd~iv~~ddv~~~KP~P~----~~~~~~~~l~~~p~--~~l~ 174 (228) T PRK13227 102 KETLEALKAQGYPLALVTNKPTPFVAPLL-EALGIADYFSVVIGGDSLPNKKPHPA----PLLLVCEKLGIAPE--QMLF 174 (228) T ss_pred HHHHHHHHHCCCCEEEEECCHHHHHHHHH-HHCCCHHHHHHHHCCCCCCCCCCCHH----HHHHHHHHHCCCCC--EEEE T ss_conf 99999999879965788457067666788-86093554344300333577799759----99999998196975--4999 Q ss_pred EECCCCCCHHHHCCCCEE Q ss_conf 832888700110123217 Q gi|254780757|r 121 VEDNWESPTLGAWGLGWE 138 (307) Q Consensus 121 veDnWEsPtLGAwGlGWE 138 (307) |+|. .+-..+|..-|.- T Consensus 175 VGDs-~~Di~aA~~AG~~ 191 (228) T PRK13227 175 VGDS-RNDILAAKAAGCP 191 (228) T ss_pred ECCC-HHHHHHHHHCCCC T ss_conf 7778-8899999996990 No 54 >PRK13228 consensus Probab=52.57 E-value=16 Score=17.78 Aligned_cols=80 Identities=23% Similarity=0.369 Sum_probs=36.2 Q ss_pred HHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEEEEE------ECCCCHHHHHHHHHHHHHCCCCCCCCCEEEEE Q ss_conf 987448344016764155788765546677500114444557------57983358999999999819996217179883 Q gi|254780757|r 49 TLRALGPLSWKAAYVQPSRRPLDGRYAENPNRLQHYYQFQVI------IKPNPLNLQNLYIESLKAVGIDPLIHDVRFVE 122 (307) Q Consensus 49 ~lr~lg~~pw~~aYvqPsrRP~DgRYGeNPNRlq~y~QfQVi------lKPsp~niq~lYl~SL~~iGid~~~hDIrFve 122 (307) +|..|...-++.+-|--+.|+.--+- -.-..|.+|+.+-|- -||+| +.|+..++.+|++| ++.-+|+ T Consensus 104 ~L~~L~~~g~~l~ivT~~~~~~~~~~-l~~~gl~~~Fd~iv~~d~v~~~KP~P----~~~l~a~~~lg~~p--~~~v~Vg 176 (232) T PRK13228 104 GLALLKALGYRLACITNKPEILAVPL-LKQTGLADYFELILGGDSLPEKKPDP----LPLLHACEVLGIDP--ANMVLVG 176 (232) T ss_pred HHHHHHHCCCCEEEEECCCHHHHHHH-HHHCCCCCCCEEEEECCCCCCCCCCH----HHHHHHHHHHCCCC--CCEEEEC T ss_conf 99999977996899808968999998-75302333310553043367889879----99999999929793--4089884 Q ss_pred CCCCCCHHHHCCCC Q ss_conf 28887001101232 Q gi|254780757|r 123 DNWESPTLGAWGLG 136 (307) Q Consensus 123 DnWEsPtLGAwGlG 136 (307) |.- +--.+|..-| T Consensus 177 Ds~-~di~aA~~AG 189 (232) T PRK13228 177 DSA-NDVIAARAAG 189 (232) T ss_pred CCH-HHHHHHHHCC T ss_conf 888-7999999959 No 55 >COG3916 LasI N-acyl-L-homoserine lactone synthetase [Signal transduction mechanisms / Secondary metabolites biosynthesis, transport, and catabolism] Probab=52.20 E-value=4.7 Score=21.59 Aligned_cols=17 Identities=53% Similarity=1.249 Sum_probs=13.9 Q ss_pred CCEEEEE-CCEEEHHHHH Q ss_conf 3217688-7874147778 Q gi|254780757|r 135 LGWECWC-DGMEISQFTY 151 (307) Q Consensus 135 lGWEvwl-dGMEItQFTY 151 (307) |||+|-| +||||-||-= T Consensus 32 L~W~v~~~~g~E~DqyD~ 49 (209) T COG3916 32 LGWDVVCIDGFEIDQYDN 49 (209) T ss_pred CCCCEECCCCCCCCCCCC T ss_conf 477103168844014678 No 56 >PRK09456 phosphatase; Provisional Probab=51.37 E-value=15 Score=17.94 Aligned_cols=50 Identities=28% Similarity=0.412 Sum_probs=39.8 Q ss_pred EEEEE-EECCCCHHHHHHHHHHHHHCCCCCCCCCEEEEECCCCCCHHHHCCCCEEEEE Q ss_conf 44455-7579833589999999998199962171798832888700110123217688 Q gi|254780757|r 85 YQFQV-IIKPNPLNLQNLYIESLKAVGIDPLIHDVRFVEDNWESPTLGAWGLGWECWC 141 (307) Q Consensus 85 ~QfQV-ilKPsp~niq~lYl~SL~~iGid~~~hDIrFveDnWEsPtLGAwGlGWEvwl 141 (307) +-+.| .+||.| ++|..-|+..|++|. ++-|+.|.=|| .-+|--+|+.-.+ T Consensus 133 ~S~~~g~~KPd~----~IY~~~l~~~~l~P~--e~lFiDD~~eN-v~aA~~lGi~~i~ 183 (199) T PRK09456 133 LSQDLGMRKPEA----RIYQHVLQAEGFSAA--DAVFFDDNADN-IEGANQLGITSIL 183 (199) T ss_pred EEEECCCCCCCH----HHHHHHHHHCCCCHH--HEEEECCCHHH-HHHHHHCCCEEEE T ss_conf 964648868979----999999998297966--86886599888-9999986997999 No 57 >pfam01808 AICARFT_IMPCHas AICARFT/IMPCHase bienzyme. This is a family of bifunctional enzymes catalysing the last two steps in de novo purine biosynthesis. The bifunctional enzyme is found in both prokaryotes and eukaryotes. The second last step is catalysed by 5-aminoimidazole-4-carboxamide ribonucleotide formyltransferase EC:2.1.2.3 (AICARFT), this enzyme catalyses the formylation of AICAR with 10-formyl-tetrahydrofolate to yield FAICAR and tetrahydrofolate. The last step is catalysed by IMP (Inosine monophosphate) cyclohydrolase EC:3.5.4.10 (IMPCHase), cyclizing FAICAR (5-formylaminoimidazole-4-carboxamide ribonucleotide) to IMP. Probab=51.05 E-value=9.2 Score=19.46 Aligned_cols=87 Identities=22% Similarity=0.197 Sum_probs=42.4 Q ss_pred EECCCCCCCCCCCCCCCCCCCEEEE-----------EEEE-EC-CCCHHHHHHHHHHHHHCCCCCCCCCEEEEECCCCCC Q ss_conf 6415578876554667750011444-----------4557-57-983358999999999819996217179883288870 Q gi|254780757|r 62 YVQPSRRPLDGRYAENPNRLQHYYQ-----------FQVI-IK-PNPLNLQNLYIESLKAVGIDPLIHDVRFVEDNWESP 128 (307) Q Consensus 62 YvqPsrRP~DgRYGeNPNRlq~y~Q-----------fQVi-lK-Psp~niq~lYl~SL~~iGid~~~hDIrFveDnWEsP 128 (307) ..-+-.+-.|=||||||.----+|. +.++ =| +|-.|+.|+.-. ++.+. ++.. +--.|==.--|| T Consensus 70 ~~~~~~k~~~LRYGENPHQ~Aa~Y~~~~~~~~~~~~~~qL~GKeLSYNNilD~daA-~~lv~-ef~~-~Pa~vIiKH~NP 146 (315) T pfam01808 70 FTLPGVRKQSLRYGENPHQQAAFYVDPTQKEGSIAPAEQLHGKEMSYNNLLDADAA-WRLVK-EFEN-QPAAAIVKHANP 146 (315) T ss_pred EEECCCCCCCCCCCCCCCCCCCEECCCCCCCCCCCCEEECCCCCCCCCCHHHHHHH-HHHHH-HCCC-CCEEEEEECCCC T ss_conf 10055223577478880451258456687766641236426887884537657899-72475-3589-975999943888 Q ss_pred HHHHCCCCEEEEECCEEEHHHHHHHHHCCCCCCCCH Q ss_conf 011012321768878741477789986682577514 Q gi|254780757|r 129 TLGAWGLGWECWCDGMEISQFTYFQQVCGIECSPIS 164 (307) Q Consensus 129 tLGAwGlGWEvwldGMEItQFTYFQQvGGi~c~pv~ 164 (307) ---|-| -+...=|+.. ++|+|+| T Consensus 147 CGvA~~-----------~~~~~Ay~~A--~~~Dp~S 169 (315) T pfam01808 147 CGVAVG-----------PSIAEAYARA--READPMS 169 (315) T ss_pred CEECCC-----------CCHHHHHHHH--HHCCCCC T ss_conf 621279-----------8999999999--8608734 No 58 >PTZ00326 phenylalanyl-tRNA synthetase; Provisional Probab=49.64 E-value=4.9 Score=21.45 Aligned_cols=124 Identities=18% Similarity=0.218 Sum_probs=65.7 Q ss_pred CCEEEEECCCCCCCCCCCCCCCCCCCEEEEEEEEECCCCHHHHH---HHHHHHHHCCCCCCCCCEEEEECCC--CCCHHH Q ss_conf 40167641557887655466775001144445575798335899---9999999819996217179883288--870011 Q gi|254780757|r 57 SWKAAYVQPSRRPLDGRYAENPNRLQHYYQFQVIIKPNPLNLQN---LYIESLKAVGIDPLIHDVRFVEDNW--ESPTLG 131 (307) Q Consensus 57 pw~~aYvqPsrRP~DgRYGeNPNRlq~y~QfQVilKPsp~niq~---lYl~SL~~iGid~~~hDIrFveDnW--EsPtLG 131 (307) |-++-.+--|.|+..-. +..+-.+||.--++-----++-+ ..-.=.+.+|++ +|||.-.-. --|++- T Consensus 368 P~k~fsi~rv~R~e~~d----~th~~~f~Q~eg~~~~~~~~~~~l~~~l~~f~~~~g~~----~~rfrp~yfpftePs~e 439 (505) T PTZ00326 368 PKKYFSIDRVFRNETLD----ATHLAEFHQVEGVVIDRNLSLGDLMGTLREFFRRIGIS----KLRFKPAFNPYTEPSME 439 (505) T ss_pred CCEEEECCCEEECCCCC----CCCCCEEEEEEEEEECCCCCHHHHHHHHHHHHHHHCCC----CEEECCCCCCCCCCCEE T ss_conf 81475237264157777----65340455435789648888999999999999984987----57868888898998278 Q ss_pred --HC--CCC-E-EEEECCEEEHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEEECC Q ss_conf --01--232-1-768878741477789986682577514567878999999986998211332068 Q gi|254780757|r 132 --AW--GLG-W-ECWCDGMEISQFTYFQQVCGIECSPISGEITYGLERLAMYVQNVNSVYDIVFNA 191 (307) Q Consensus 132 --Aw--GlG-W-EvwldGMEItQFTYFQQvGGi~c~pv~~EiTYGLERiaMylQ~vdnvyDl~wn~ 191 (307) ++ |+| | ||-=-||==-. .+ .-+|++.+-...---.|+||+||-.-+++++-||-=++ T Consensus 440 ~~~~~~~~~~w~Ei~g~Gm~~p~--vl-~~~g~~~~~~~~a~G~g~eR~am~~~gi~dir~l~~~~ 502 (505) T PTZ00326 440 IFGYHPQLKKWVEVGNSGLFRPE--ML-RPMGFPEDVTVIAWGLSLERPTMIKYGISNIRDLFGPK 502 (505) T ss_pred EEEEECCCCCEEEEECCCCCCHH--HH-HHCCCCCCCEEEEEEECHHHHHHHHHCCCHHHHHHCCC T ss_conf 99970788847998266575878--87-32595999848998623889999983985788974567 No 59 >TIGR01993 Pyr-5-nucltdase pyrimidine 5'-nucleotidase; InterPro: IPR010237 This family of proteins includes the SDT1/SSM1 gene from yeast, which has been shown to code for a pyrimidine (UMP/CMP) 5'nucleotidase. The family spans plants, fungi and bacteria. These enzymes are members of the haloacid dehalogenase (HAD) superfamily of hydrolases, specifically the IA subfamily.. Probab=48.57 E-value=12 Score=18.54 Aligned_cols=110 Identities=25% Similarity=0.311 Sum_probs=61.0 Q ss_pred CCHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCHHHHHHHCCCCCCE--EEEECCCCCCCCCCCCCCCCCCCEEEEEE Q ss_conf 7799999999999986695896116754333224857898744834401--67641557887655466775001144445 Q gi|254780757|r 11 LSFQNIILTLTQYWAQQGCTILQPYDMEVGAGTFHPSTTLRALGPLSWK--AAYVQPSRRPLDGRYAENPNRLQHYYQFQ 88 (307) Q Consensus 11 ~~fq~ii~~L~~fW~~~GC~i~qpyd~e~GAgT~hp~T~lr~lg~~pw~--~aYvqPsrRP~DgRYGeNPNRlq~y~QfQ 88 (307) ...+++|.+|-+-=..+-|+|.-=-| -.|-.-.|+.||=+-.- +-|.+-.+--+ +||.|--- =-.+| T Consensus 89 p~L~~~L~~LpqsGK~~Rk~iFTN~~------~~Ha~r~l~~LGi~d~FD~i~~~~~~~~~l---fGeaP~ss--dd~~~ 157 (205) T TIGR01993 89 PELRNLLLRLPQSGKKGRKIIFTNGD------RAHARRALNRLGIEDCFDGIFDIDTANPDL---FGEAPYSS--DDETL 157 (205) T ss_pred HHHHHHHHHHHHCCCCCCEEEEECCC------HHHHHHHHHHCCHHHCCCCEEEHHHHCHHH---CCCCCCCC--CCCCC T ss_conf 88999999734126555567761587------899999998647212046423051204123---16888788--88763 Q ss_pred EEECCCCHHHHHHHHHHHHHCC-CCCCCCCEEEEECCCCCCHHHHCCCCEE Q ss_conf 5757983358999999999819-9962171798832888700110123217 Q gi|254780757|r 89 VIIKPNPLNLQNLYIESLKAVG-IDPLIHDVRFVEDNWESPTLGAWGLGWE 138 (307) Q Consensus 89 VilKPsp~niq~lYl~SL~~iG-id~~~hDIrFveDnWEsPtLGAwGlGWE 138 (307) +|.||+|+ -|..-++..| .||.. --|+||=-.|=+ .|--|||- T Consensus 158 ~~~KP~~~----ay~~~~~~~~~vd~~~--~~f~DDS~rNi~-~~kaLGmk 201 (205) T TIGR01993 158 LIPKPSPE----AYEKALREAGVVDPER--AIFFDDSARNIA-AAKALGMK 201 (205) T ss_pred CCCCCCHH----HHHHHHHHHCCCCCCC--EEEECCCHHHHH-HHHHCCCC T ss_conf 12088889----9999999855887200--175304687888-78870661 No 60 >PRK10725 fructose-1-phosphatase; Provisional Probab=47.42 E-value=20 Score=17.05 Aligned_cols=27 Identities=26% Similarity=0.612 Sum_probs=16.1 Q ss_pred CCCCHHHHHHHHHHHHHCCCCCCCCCEEEEECC Q ss_conf 798335899999999981999621717988328 Q gi|254780757|r 92 KPNPLNLQNLYIESLKAVGIDPLIHDVRFVEDN 124 (307) Q Consensus 92 KPsp~niq~lYl~SL~~iGid~~~hDIrFveDn 124 (307) ||.| +.||.-++.+|++|. +.-.|||- T Consensus 142 KP~P----d~yl~a~~~lg~~p~--~clvieDS 168 (188) T PRK10725 142 KPAP----DTFLLCAQRMGVQPT--QCVVFEDA 168 (188) T ss_pred CCCC----HHHHHHHHHHCCCHH--HEEEEECC T ss_conf 9886----999999999196977--96879389 No 61 >COG0138 PurH AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) [Nucleotide transport and metabolism] Probab=47.24 E-value=20 Score=16.98 Aligned_cols=61 Identities=18% Similarity=0.153 Sum_probs=33.3 Q ss_pred HHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEEEEEECCCCHHHHHHHHHHHHH Q ss_conf 78987448344016764155788765546677500114444557579833589999999998 Q gi|254780757|r 47 STTLRALGPLSWKAAYVQPSRRPLDGRYAENPNRLQHYYQFQVIIKPNPLNLQNLYIESLKA 108 (307) Q Consensus 47 ~T~lr~lg~~pw~~aYvqPsrRP~DgRYGeNPNRlq~y~QfQVilKPsp~niq~lYl~SL~~ 108 (307) +.+|+.+-+......+...-.|-.+=||||||.----+|+.--. +|+..+.+.|-=+-|.. T Consensus 187 ~~~~~~~~~~~fp~~~~~~~~~~~~lRYGENPHQ~aa~Y~~~~~-~~~va~a~qL~GK~lSY 247 (515) T COG0138 187 ANYFSELLGGEFPEQLNLSFTKKQDLRYGENPHQSAAFYRDGNA-KGGVATAKQLQGKELSY 247 (515) T ss_pred HHHHHHHCCCCCCHHEECCCCCCEEEECCCCCCCCCEEEECCCC-CCCHHHHHHHCCCCCCC T ss_conf 99997531133641110243200010057897778747841787-87603478755972542 No 62 >pfam10775 ATP_sub_h ATP synthase complex subunit h. Subunit h is a component of the yeast mitochondrial F1-F0 ATP synthase. It is essential for the correct assembly and functioning of this enzyme. Subunit h occupies a central place in the peripheral stalk between the F1 sector and the membrane. Probab=46.24 E-value=12 Score=18.52 Aligned_cols=20 Identities=40% Similarity=0.529 Sum_probs=16.0 Q ss_pred CHHHHHHHHHHHHHCCCCCC Q ss_conf 33589999999998199962 Q gi|254780757|r 95 PLNLQNLYIESLKAVGIDPL 114 (307) Q Consensus 95 p~niq~lYl~SL~~iGid~~ 114 (307) -+-||||||..|++--.-+. T Consensus 10 ~d~vQDLYlrELKayKp~p~ 29 (67) T pfam10775 10 ADLVQDLYLKELKAYKPPPL 29 (67) T ss_pred HHHHHHHHHHHHHHCCCCCC T ss_conf 78889999999986389999 No 63 >cd00418 GlxRS_core Glutamyl-tRNA synthetase(GluRS)/Glutaminyl-tRNA synthetase (GlnRS) cataytic core domain. These enzymes attach Glu or Gln, respectively, to the appropriate tRNA. Like other class I tRNA synthetases, they aminoacylate the 2'-OH of the nucleotide at the 3' end of the tRNA. The core domain is based on the Rossman fold and is responsible for the ATP-dependent formation of the enzyme bound aminoacyl-adenylate. It contains the characteristic class I HIGH and KMSKS motifs, which are involved in ATP binding. These enzymes function as monomers. Archaea, cellular organelles, and some bacteria lack GlnRS. In these cases, the "non-descriminating" form of GluRS aminoacylates both tRNA(Glu) and tRNA(Gln) with Glu, which is converted to Gln when appropriate by a transamidation enzyme. The descriminating form of GluRS differs from GlnRS and the non-descriminating form of GluRS in their C-terminal anti-codon binding domains. Probab=46.21 E-value=7.4 Score=20.13 Aligned_cols=49 Identities=18% Similarity=0.358 Sum_probs=28.6 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHH-------HHHHHHHHHHHHCCCCHHHHHHHHHHH Q ss_conf 999999999999985006653233553013828999-------999999878883875017889999999 Q gi|254780757|r 225 NHFIDSEKECLDLLKKGIPNEHHRHHLCVFPAYDQC-------IKASHIFNLLDARGVISTTERQRYILR 287 (307) Q Consensus 225 ~~F~~yE~Ea~rLle~~LP~~~~~~~~lvlPAYD~~-------LKaSH~FNLLDARGaISVTERq~YI~R 287 (307) +.|+.|.+-+.+|++++. +|.|+.+ ++-+|+.-=.| -++-|+||.+|.+ T Consensus 76 ~r~~~y~~~~~~Li~~G~-----------~PtY~fa~vVDD~~~gITHViRG~D---~~~~t~~q~~l~~ 131 (223) T cd00418 76 DRFDIYYEYAEKLIEEGG-----------YPTYDFAVVVDDHLMGITHVLRGEE---HLDNTPKQIWLYE 131 (223) T ss_pred HHHHHHHHHHHHHHHCCC-----------CCCEEEEEEECCCCCCCCEEECCHH---HHHCCHHHHHHHH T ss_conf 589999999999975699-----------8140102585376578875654627---7646799999999 No 64 >pfam07805 HipA_N HipA-like N-terminal domain. The members of this family are similar to a region close to the N-terminus of the HipA protein expressed by various bacterial species. This protein is known to be involved in high-frequency persistence to the lethal effects of inhibition of either DNA or peptidoglycan synthesis. When expressed alone, it is toxic to bacterial cells, but it is usually tightly associated with HipB, and the HipA-HipB complex may be involved in autoregulation of the hip operon. The hip proteins may be involved in cell division control and may interact with cell division genes or their products. Probab=45.50 E-value=25 Score=16.36 Aligned_cols=49 Identities=18% Similarity=0.142 Sum_probs=33.9 Q ss_pred CCCCCCCCCCCCEEEEEEEEECC-CCHH------HHHHHHHHHHHCCCCCCCCCEEEEEC Q ss_conf 65546677500114444557579-8335------89999999998199962171798832 Q gi|254780757|r 71 DGRYAENPNRLQHYYQFQVIIKP-NPLN------LQNLYIESLKAVGIDPLIHDVRFVED 123 (307) Q Consensus 71 DgRYGeNPNRlq~y~QfQVilKP-sp~n------iq~lYl~SL~~iGid~~~hDIrFveD 123 (307) ||++....+..... .|+|| ++.+ .-.+|+.-.+++||+...+.+.-.++ T Consensus 15 ~~~~~~~~~~~~~~----~IvK~~~~~~~~~~~~nE~~~m~lA~~~Gi~v~~~~l~~~~~ 70 (81) T pfam07805 15 DGRFALPADGGGST----WIVKFPSGDDLPDLVENEYLCMRLARAAGLEVPETRLIQSGG 70 (81) T ss_pred CCEEEECCCCCCCC----EEEECCCCCCCCCHHHHHHHHHHHHHHCCEEECCCEEEEECC T ss_conf 99087247999877----899899867650429999999999998994947523999799 No 65 >PRK10826 2-deoxyglucose-6-phosphatase; Provisional Probab=44.39 E-value=21 Score=16.90 Aligned_cols=70 Identities=21% Similarity=0.333 Sum_probs=40.8 Q ss_pred HHHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEEEE------EECCCCHHHHHHHHHHHHHCCCCCCCCCEEEE Q ss_conf 898744834401676415578876554667750011444455------75798335899999999981999621717988 Q gi|254780757|r 48 TTLRALGPLSWKAAYVQPSRRPLDGRYAENPNRLQHYYQFQV------IIKPNPLNLQNLYIESLKAVGIDPLIHDVRFV 121 (307) Q Consensus 48 T~lr~lg~~pw~~aYvqPsrRP~DgRYGeNPNRlq~y~QfQV------ilKPsp~niq~lYl~SL~~iGid~~~hDIrFv 121 (307) -+|..|....++.+-|--+.|..=-+-= .-..|.+|+.+-| .-||.| +.||.-++.+|++|. +.-.| T Consensus 99 e~L~~l~~~g~~l~i~Ts~~~~~~~~~L-~~~gl~~~Fd~iv~~ddv~~~KP~P----e~yl~A~~~lg~~p~--e~lvv 171 (222) T PRK10826 99 EALALCKAQGLKIGLASASPLHMLEAVL-TMLDLRDYFDALASAEKLPYSKPHP----EVYLNCAAKLGVDPL--TCVAL 171 (222) T ss_pred HHHHHHHHCCCCEEEECCCCHHHHHHHH-HHCCCCCCEEEEECCCCCCCCCCCH----HHHHHHHHHHCCCHH--HEEEE T ss_conf 9999999779975886288489999999-8749964110576353256779984----999999999598988--96878 Q ss_pred ECC Q ss_conf 328 Q gi|254780757|r 122 EDN 124 (307) Q Consensus 122 eDn 124 (307) ||- T Consensus 172 eDS 174 (222) T PRK10826 172 EDS 174 (222) T ss_pred CCC T ss_conf 389 No 66 >TIGR00457 asnS asparaginyl-tRNA synthetase; InterPro: IPR004522 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . AsparaginyltRNA synthetase (6.1.1.22 from EC) is an alpha2 dimer that belongs to class IIb. There is a striking similarity between asparaginyl-tRNA synthetases and archaeal/eukaryotic type aspartyl-tRNA synthetases (IPR004523 from INTERPRO) and a striking divergence of bacterial type aspartyl-tRNA synthetases (IPR004524 from INTERPRO). This family, AsnS, represents asparaginyl-tRNA synthetases from the three domains of life. Some species lack this enzyme and charge tRNA(asn) by misacylation with Asp, followed by transamidation of Asp to Asn.; GO: 0000166 nucleotide binding, 0004816 asparagine-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006421 asparaginyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=44.25 E-value=14 Score=18.17 Aligned_cols=20 Identities=30% Similarity=0.725 Sum_probs=14.2 Q ss_pred HHHHHHHHHHCCCCCEEEEE Q ss_conf 78999999986998211332 Q gi|254780757|r 169 YGLERLAMYVQNVNSVYDIV 188 (307) Q Consensus 169 YGLERiaMylQ~vdnvyDl~ 188 (307) -|+|||--|+-|+.||-|.+ T Consensus 465 LGfERl~ay~~G~~niRD~I 484 (495) T TIGR00457 465 LGFERLVAYITGLENIRDAI 484 (495) T ss_pred HHHHHHHHHHHCCCCCCCCC T ss_conf 24899999981787444166 No 67 >KOG2623 consensus Probab=43.97 E-value=26 Score=16.20 Aligned_cols=137 Identities=17% Similarity=0.204 Sum_probs=93.3 Q ss_pred CCEEEHHHHH--HH---------------HHCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEEECCC-CCCCCCCC--- Q ss_conf 7874147778--99---------------866825775145678789999999869982113320688-87883110--- Q gi|254780757|r 142 DGMEISQFTY--FQ---------------QVCGIECSPISGEITYGLERLAMYVQNVNSVYDIVFNAI-EGQNVLYG--- 200 (307) Q Consensus 142 dGMEItQFTY--FQ---------------QvGGi~c~pv~~EiTYGLERiaMylQ~vdnvyDl~wn~~-~~~~vtYG--- 200 (307) +||--|-||| +| |+||-|- -|-||-|.|=|---.+....||-|..-=. ++.|-+.| T Consensus 202 ~GlSftEFtYQ~lQAYDfy~L~~~~g~~~QlGGsDQ---wGNitaG~dlI~ki~~~~~~vfGlT~PLlTsstG~KlGKSa 278 (467) T KOG2623 202 NGLSFTEFTYQLLQAYDFYHLYENYGCRFQLGGSDQ---WGNITAGTDLIRKIMPIQAFVFGLTFPLLTSSTGAKLGKSA 278 (467) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHCCEEEEECCCCC---CCCCCHHHHHHHHHCCCCCCEEEEEEEEEECCCCHHHCCCC T ss_conf 887499999999998769999986485698516432---36622078999985203134003566557657511323477 Q ss_pred --HHHHHHH------HHHHHHHHHHCCHHHHHHHHHHHHHHHHH-HHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHH Q ss_conf --6244746------99777635017989999999999999999-98500665323355301382899999999987888 Q gi|254780757|r 201 --DIFAQSE------QEYSRYNFEYANPEILHNHFIDSEKECLD-LLKKGIPNEHHRHHLCVFPAYDQCIKASHIFNLLD 271 (307) Q Consensus 201 --dif~q~E------~E~S~YNFE~Ad~~~L~~~F~~yE~Ea~r-Lle~~LP~~~~~~~~lvlPAYD~~LKaSH~FNLLD 271 (307) -|++..- .=+=.++--.++++.++++|-.---|-.+ +++.... . .-+-+-|-+-|+-+--++- T Consensus 279 GnAvWLdp~~tspy~lYQfF~~~pDd~v~k~LklfTfl~l~eI~~I~~~H~k------~--P~~r~aQ~~LA~eVTr~VH 350 (467) T KOG2623 279 GNAVWLDPSKTSPYHLYQFFASLPDDDVEKFLKLFTFLPLEEIKQILEEHRK------E--PSQRIAQKLLAAEVTRMVH 350 (467) T ss_pred CCEEEECCCCCCCHHHHHHHHHCCHHHHHHHHHHHHCCCHHHHHHHHHHHHC------C--HHHHHHHHHHHHHHHHHHC T ss_conf 8628846766881799999985864679999999836878999999998740------8--6655689999999999970 Q ss_pred HCCCCHHHHHHHHHHHHH Q ss_conf 387501788999999999 Q gi|254780757|r 272 ARGVISTTERQRYILRIR 289 (307) Q Consensus 272 ARGaISVTERq~YI~RIR 289 (307) -++..+++||+.-+++=+ T Consensus 351 G~egL~~A~r~T~al~g~ 368 (467) T KOG2623 351 GKEGLEVAERCTKALFGA 368 (467) T ss_pred CCCHHHHHHHHHHHHHCC T ss_conf 500488999999975066 No 68 >COG2024 Phenylalanyl-tRNA synthetase alpha subunit (archaeal type) [Translation, ribosomal structure and biogenesis] Probab=43.25 E-value=9.4 Score=19.40 Aligned_cols=121 Identities=23% Similarity=0.389 Sum_probs=72.6 Q ss_pred CCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEEEEEECC---CCHHHHHHHHHHHHHCCCCCCCCCEEEEECCCCC---- Q ss_conf 344016764155788765546677500114444557579---8335899999999981999621717988328887---- Q gi|254780757|r 55 PLSWKAAYVQPSRRPLDGRYAENPNRLQHYYQFQVIIKP---NPLNLQNLYIESLKAVGIDPLIHDVRFVEDNWES---- 127 (307) Q Consensus 55 ~~pw~~aYvqPsrRP~DgRYGeNPNRlq~y~QfQVilKP---sp~niq~lYl~SL~~iGid~~~hDIrFveDnWEs---- 127 (307) +-|.+.--+.-|.|-.. +|.-.||..||----++-- +-++-...--.-|+..|. .+.||+-|.=-| T Consensus 205 ~~PlklFSIDRCFRREQ---~ED~shLmtYhSASCVvvde~vtvD~GKaVAEglL~qfGF----e~F~FrpDEK~SKYYv 277 (536) T COG2024 205 DPPLKLFSIDRCFRREQ---REDASHLMTYHSASCVVVDEDVTVDDGKAVAEGLLRQFGF----EKFRFRPDEKKSKYYV 277 (536) T ss_pred CCCCEEEEHHHHHHHHH---HCCHHHHHHHCCCEEEEECCCCCCCCCHHHHHHHHHHHCC----CCEEECCCCCCCCCCC T ss_conf 89841642567766553---1015666432034079975864435438999999997194----0035466200355347 Q ss_pred -----------CHHH----HCCCCEEEEECCEEEHHHHHHHHH----CCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEE Q ss_conf -----------0011----012321768878741477789986----682577514567878999999986998211332 Q gi|254780757|r 128 -----------PTLG----AWGLGWECWCDGMEISQFTYFQQV----CGIECSPISGEITYGLERLAMYVQNVNSVYDIV 188 (307) Q Consensus 128 -----------PtLG----AwGlGWEvwldGMEItQFTYFQQv----GGi~c~pv~~EiTYGLERiaMylQ~vdnvyDl~ 188 (307) |-|+ -.+-|| .||.-|--+--+ =||+|-. --+--|.||+||.|-+.|.|-.+. T Consensus 278 P~TQTEVyAyHPkL~gs~~kysdgW------iEiATFGlYSP~ALaeY~Id~pV--MNLGlGVERlaMIl~g~~DVR~mv 349 (536) T COG2024 278 PGTQTEVYAYHPKLVGSIEKYSDGW------IEIATFGLYSPIALAEYGIDYPV--MNLGLGVERLAMILHGADDVRSMV 349 (536) T ss_pred CCCCCEEEEECCCCCCCCCCCCCCC------EEEEEECCCCHHHHHHCCCCCCE--EECCHHHHHHHHHHHCCHHHHHHH T ss_conf 9963047774540036521258870------89996035672779872999701--203220899999981744776651 Q ss_pred EC Q ss_conf 06 Q gi|254780757|r 189 FN 190 (307) Q Consensus 189 wn 190 (307) |- T Consensus 350 Yp 351 (536) T COG2024 350 YP 351 (536) T ss_pred CC T ss_conf 34 No 69 >pfam06159 DUF974 Protein of unknown function (DUF974). Family of uncharacterized eukaryotic proteins. Probab=43.05 E-value=20 Score=17.05 Aligned_cols=27 Identities=19% Similarity=0.572 Sum_probs=19.5 Q ss_pred CCEEEEECCCCCCCCCCCCCCCCCCCEEEEEEEEECC Q ss_conf 4016764155788765546677500114444557579 Q gi|254780757|r 57 SWKAAYVQPSRRPLDGRYAENPNRLQHYYQFQVIIKP 93 (307) Q Consensus 57 pw~~aYvqPsrRP~DgRYGeNPNRlq~y~QfQVilKP 93 (307) -+.|.|..| ||| +.-+.++|||||. || T Consensus 82 vC~V~Y~~~-----~Ge----~~tfRKffkF~v~-~P 108 (233) T pfam06159 82 VCSVSYTEA-----SGE----TKYFRKFFKFIVK-NP 108 (233) T ss_pred EEEEEEECC-----CCC----CCCCEEEEEEECC-CC T ss_conf 999999889-----986----2300003558555-64 No 70 >PRK00881 purH bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase; Provisional Probab=42.80 E-value=18 Score=17.30 Aligned_cols=92 Identities=17% Similarity=0.186 Sum_probs=43.8 Q ss_pred CCCEEEEECCCCCCCCCCCCCCCCCCCEEEEEEEEECC-------------CCHHHHHHHHHHHHHCCCCCCCCCEEEEE Q ss_conf 44016764155788765546677500114444557579-------------83358999999999819996217179883 Q gi|254780757|r 56 LSWKAAYVQPSRRPLDGRYAENPNRLQHYYQFQVIIKP-------------NPLNLQNLYIESLKAVGIDPLIHDVRFVE 122 (307) Q Consensus 56 ~pw~~aYvqPsrRP~DgRYGeNPNRlq~y~QfQVilKP-------------sp~niq~lYl~SL~~iGid~~~hDIrFve 122 (307) +.+.-...-+-.+-.+=||||||.----.|....-.+| |-.|+-|+. .++-+--.-.+--.|= T Consensus 198 ~~~p~~~~~~~~~~~~LRYGENPHQ~aa~Y~~~~~~~~~~~~~~ql~GKeLSYNN~lD~d----aA~~lv~ef~~pa~vI 273 (514) T PRK00881 198 EEFPETLNLSFEKKQDLRYGENPHQKAAFYRDKDGPAGGVATAKQLQGKELSYNNIADAD----AALECVKEFDEPACVI 273 (514) T ss_pred CCCCCEEEECCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCHHHCCCCCCCCCCHHHHH----HHHHHHHHCCCCEEEE T ss_conf 458621787142034542688814131430577767766421110058766744476689----9999998366768999 Q ss_pred CCCCCCHHHHCCCCEEEEECCEEEHHHHHHHHHCCCCCCCCH Q ss_conf 288870011012321768878741477789986682577514 Q gi|254780757|r 123 DNWESPTLGAWGLGWECWCDGMEISQFTYFQQVCGIECSPIS 164 (307) Q Consensus 123 DnWEsPtLGAwGlGWEvwldGMEItQFTYFQQvGGi~c~pv~ 164 (307) =.--||.--|-| -++.+=|+.. ++|+|+| T Consensus 274 iKH~nPCGvA~~-----------~~~~~Ay~~A--~~~Dp~S 302 (514) T PRK00881 274 VKHANPCGVAVG-----------DTILEAYDKA--YACDPVS 302 (514) T ss_pred EECCCCCCHHCC-----------CCHHHHHHHH--HHCCCCC T ss_conf 951687402046-----------5799999998--7049745 No 71 >PRK07106 5-aminoimidazole-4-carboxamide ribonucleotide transformylase; Provisional Probab=42.01 E-value=9 Score=19.54 Aligned_cols=18 Identities=6% Similarity=0.303 Sum_probs=9.8 Q ss_pred CCCCHHHHHHHHHHHHHH Q ss_conf 875017889999999999 Q gi|254780757|r 273 RGVISTTERQRYILRIRS 290 (307) Q Consensus 273 RGaISVTERq~YI~RIR~ 290 (307) -..++-.||..|+.+.+. T Consensus 315 ~~~l~~~e~~~~~~~~~g 332 (391) T PRK07106 315 PEPLTREEKRAWLDTLKG 332 (391) T ss_pred CCCCHHHHHHHHHCCCCC T ss_conf 530007777766502588 No 72 >KOG2783 consensus Probab=41.94 E-value=13 Score=18.47 Aligned_cols=77 Identities=19% Similarity=0.220 Sum_probs=38.4 Q ss_pred EHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEEECCCC--CCCCCCCHHHHHHHHHHHHHHHHHCCHHHH Q ss_conf 147778998668257751456787899999998699821133206888--788311062447469977763501798999 Q gi|254780757|r 146 ISQFTYFQQVCGIECSPISGEITYGLERLAMYVQNVNSVYDIVFNAIE--GQNVLYGDIFAQSEQEYSRYNFEYANPEIL 223 (307) Q Consensus 146 ItQFTYFQQvGGi~c~pv~~EiTYGLERiaMylQ~vdnvyDl~wn~~~--~~~vtYGdif~q~E~E~S~YNFE~Ad~~~L 223 (307) |.+=---++.|-+ +-|-.-.--|||||||-|-++-.+- +-|+.-+ .+...=|+|-- .=+-+|+|-=-+-|+.+. T Consensus 280 vi~~~il~~ag~~--~~igwafglgLerLAMll~~IpDiR-lfWs~DeRFlkqF~~g~I~~-~FKp~SkYP~c~~DiSFW 355 (436) T KOG2783 280 VMRQRLLKRAGLN--NYIGWAFGLGLERLAMLLFDIPDIR-LFWSFDERFLKQFSPGKIEP-KFKPYSKYPPCYKDISFW 355 (436) T ss_pred HHHHHHHHHCCCC--CEEEEEEECCHHHHHHHHHCCCCHH-EEECCCHHHHHHCCCCCCCC-CCCCCCCCCCCCCCEEEE T ss_conf 5677887520411--0034565046788888873676221-01122267787318554664-656545589752441686 Q ss_pred HHH Q ss_conf 999 Q gi|254780757|r 224 HNH 226 (307) Q Consensus 224 ~~~ 226 (307) +.. T Consensus 356 l~~ 358 (436) T KOG2783 356 LPQ 358 (436) T ss_pred ECC T ss_conf 066 No 73 >PRK00488 pheS phenylalanyl-tRNA synthetase subunit alpha; Validated Probab=41.12 E-value=7.1 Score=20.28 Aligned_cols=168 Identities=20% Similarity=0.240 Sum_probs=86.8 Q ss_pred HHHHHHHHHHHHHHCCCEEEECCCCCCCCC--------CCCHH------------HH---------HHHC--CCCCCEEE Q ss_conf 999999999999866958961167543332--------24857------------89---------8744--83440167 Q gi|254780757|r 13 FQNIILTLTQYWAQQGCTILQPYDMEVGAG--------TFHPS------------TT---------LRAL--GPLSWKAA 61 (307) Q Consensus 13 fq~ii~~L~~fW~~~GC~i~qpyd~e~GAg--------T~hp~------------T~---------lr~l--g~~pw~~a 61 (307) ...+|..+.+++..-|..+...-.+|-.-= --||| +. .|++ ++.|.++. T Consensus 109 it~~~~ei~~iF~~~Gf~v~~GpeiE~~~~NFdaLn~P~~HPAR~~~DTFy~~~~~lLRTHTS~~qir~m~~~~pp~~~i 188 (338) T PRK00488 109 LTRTIERIEDIFVGLGFEVAEGPEIEDDYYNFEALNIPKDHPARDMQDTFYIDDRLLLRTHTSPVQIRTMEKQKPPIRII 188 (338) T ss_pred HHHHHHHHHHHHHHCCCEEECCCCEEEHHHHHHHHCCCCCCCCCCCCCEEEECCCCEECCCCCHHHHHHHHHCCCCEEEE T ss_conf 99999999999998697696188310045468875699788543646628976871343568788899997269986999 Q ss_pred EECCCCCCCCCCCCCCCCCCCEEEEEEEEECCCCHHHHHH---HHHHHHHC-CCCCCCCCEEEE-------ECCCCCC-- Q ss_conf 6415578876554667750011444455757983358999---99999981-999621717988-------3288870-- Q gi|254780757|r 62 YVQPSRRPLDGRYAENPNRLQHYYQFQVIIKPNPLNLQNL---YIESLKAV-GIDPLIHDVRFV-------EDNWESP-- 128 (307) Q Consensus 62 YvqPsrRP~DgRYGeNPNRlq~y~QfQVilKPsp~niq~l---Yl~SL~~i-Gid~~~hDIrFv-------eDnWEsP-- 128 (307) ..--+.|..+-. +...-.+||.--++--.--++.+| --.=++.+ | .+-.+||. |--.|-- T Consensus 189 ~~GrvyR~d~~D----atH~~~FhQ~Egl~vd~~it~~~Lk~~l~~f~~~~fg---~~~~~R~rpsyFPFtEPs~Evdi~ 261 (338) T PRK00488 189 APGRVYRNDSDD----ATHSPMFHQVEGLVVDKNISFADLKGTLEDFLKAFFG---EDLKIRFRPSYFPFTEPSAEVDVS 261 (338) T ss_pred ECCCCCCCCCCC----CCCCEEEEEEEEEEEECCCCHHHHHHHHHHHHHHHCC---CCCEEEECCCCCCCCCCCEEEEEE T ss_conf 526631589998----7535046640058971788799999999999999728---766477547889999985347887 Q ss_pred --HHH------HCCCCEEEEE-CCEEEHHHHHHHHHCCCCCCCCHH-HHHHHHHHHHHHHCCCCCEEEEEEC Q ss_conf --011------0123217688-787414777899866825775145-6787899999998699821133206 Q gi|254780757|r 129 --TLG------AWGLGWECWC-DGMEISQFTYFQQVCGIECSPISG-EITYGLERLAMYVQNVNSVYDIVFN 190 (307) Q Consensus 129 --tLG------AwGlGWEvwl-dGMEItQFTYFQQvGGi~c~pv~~-EiTYGLERiaMylQ~vdnvyDl~wn 190 (307) .-+ .-+-||==++ -||=--+ .+ ..+|+|-+-.+| -.--|+|||||-.-+++++-++-=| T Consensus 262 ~~~~~g~gc~~~~~~~WlEi~G~Gmv~p~--Vl-~~~gid~~~~~G~AfG~GieR~aMl~~gi~DiR~~~~~ 330 (338) T PRK00488 262 CFNCKGKGCRVCKNTGWLEVLGCGMVHPN--VL-RNVGIDPEEYSGFAFGMGIERLAMLRYGINDLRDFFEN 330 (338) T ss_pred ECCCCCCCCCCCCCCCEEEECCCCCCCHH--HH-HHCCCCCCCCEEEEEEECHHHHHHHHCCCCHHHHHHHC T ss_conf 31457766544789963897376667889--99-86699955275998751588999998099668988736 No 74 >PRK05184 pyrroloquinoline quinone biosynthesis protein PqqB; Provisional Probab=40.77 E-value=28 Score=15.98 Aligned_cols=14 Identities=21% Similarity=0.342 Sum_probs=9.4 Q ss_pred EEECCCCHHHHHHH Q ss_conf 57579833589999 Q gi|254780757|r 89 VIIKPNPLNLQNLY 102 (307) Q Consensus 89 VilKPsp~niq~lY 102 (307) +++--||+=-+.|. T Consensus 52 ~LiDasPDlr~QL~ 65 (302) T PRK05184 52 VLVNASPDIRQQIR 65 (302) T ss_pred EEEECCCCHHHHHH T ss_conf 99989938999986 No 75 >smart00798 AICARFT_IMPCHas AICARFT/IMPCHase bienzyme. This is a family of bifunctional enzymes catalysing the last two steps in de novo purine biosynthesis. The bifunctional enzyme is found in both prokaryotes and eukaryotes. The second last step is catalysed by 5-aminoimidazole-4-carboxamide ribonucleotide formyltransferase (AICARFT), this enzyme catalyses the formylation of AICAR with 10-formyl-tetrahydrofolate to yield FAICAR and tetrahydrofolate. The last step is catalysed by IMP (Inosine monophosphate) cyclohydrolase (IMPCHase), cyclizing FAICAR (5-formylaminoimidazole-4-carboxamide ribonucleotide) to IMP. Probab=40.41 E-value=20 Score=17.05 Aligned_cols=84 Identities=20% Similarity=0.182 Sum_probs=43.2 Q ss_pred CCCCCCCCCCCCCCCCCCCEEEE----------E-EEEECC-CCHHHHHHHHHHHHHCCCCCCCCCEEEEECCCCCCHHH Q ss_conf 15578876554667750011444----------4-557579-83358999999999819996217179883288870011 Q gi|254780757|r 64 QPSRRPLDGRYAENPNRLQHYYQ----------F-QVIIKP-NPLNLQNLYIESLKAVGIDPLIHDVRFVEDNWESPTLG 131 (307) Q Consensus 64 qPsrRP~DgRYGeNPNRlq~y~Q----------f-QVilKP-sp~niq~lYl~SL~~iGid~~~hDIrFveDnWEsPtLG 131 (307) -+-.|-.+=||||||.----.|. | |+==|+ |-.|+.|+.-.- +.+. ++. +.-.|==.--||--- T Consensus 71 ~~~~~~~~LRYGENPHQ~Aa~Y~~~~~~~~~~~~~ql~GKeLSYNN~lD~daA~-~lv~-ef~--~pa~~IiKH~nPCGv 146 (311) T smart00798 71 LSFEKKQDLRYGENPHQKAAFYTDPDALGGIATAKQLQGKELSYNNILDADAAL-ELVK-EFD--EPACVIVKHANPCGV 146 (311) T ss_pred ECCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCEEECCCCCCCCCHHHHHHHH-HHHH-CCC--CCEEEEEECCCCCCC T ss_conf 734303477578882213143056886666544356058877855677689999-9986-268--874899824887641 Q ss_pred HCCCCEEEEECCEEEHHHHHHHHHCCCCCCCCH Q ss_conf 012321768878741477789986682577514 Q gi|254780757|r 132 AWGLGWECWCDGMEISQFTYFQQVCGIECSPIS 164 (307) Q Consensus 132 AwGlGWEvwldGMEItQFTYFQQvGGi~c~pv~ 164 (307) |.| -++..-|... ++|+|+| T Consensus 147 A~~-----------~~~~~A~~~A--~~~Dp~S 166 (311) T smart00798 147 AVG-----------DTLAEAYRKA--YAADPVS 166 (311) T ss_pred CCC-----------CHHHHHHHHH--HHCCCCC T ss_conf 245-----------0699999998--7029854 No 76 >TIGR01326 OAH_OAS_sulfhy O-acetylhomoserine aminocarboxypropyltransferase/cysteine synthase; InterPro: IPR006235 This family of sequences is a distinct clade of the Cys/Met metabolism pyridoxal phosphate-dependent enzyme superfamily. Members include examples of OAH/OAS sulfhydrylase, an enzyme with activity both as O-acetylhomoserine sulfhydrylase (OAH, 2.5.1.49 from EC) and O-acetylserine sulfhydrylase (OAS, 2.5.1.47 from EC). An alternate name for OAH sulfhydrylase is homocysteine synthase.; GO: 0016765 transferase activity transferring alkyl or aryl (other than methyl) groups, 0006520 amino acid metabolic process. Probab=40.27 E-value=24 Score=16.49 Aligned_cols=66 Identities=14% Similarity=0.133 Sum_probs=31.2 Q ss_pred HHHHHHHCCCCCEEEEEECCCCCC-CCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHH Q ss_conf 999999869982113320688878-831106244746997776350179899999999999999999 Q gi|254780757|r 172 ERLAMYVQNVNSVYDIVFNAIEGQ-NVLYGDIFAQSEQEYSRYNFEYANPEILHNHFIDSEKECLDL 237 (307) Q Consensus 172 ERiaMylQ~vdnvyDl~wn~~~~~-~vtYGdif~q~E~E~S~YNFE~Ad~~~L~~~F~~yE~Ea~rL 237 (307) .++|-||++-+.|--..|-+.++. .=.--.=|+.+-.-=|...||..+=.---+-|=+.-+=|.+| T Consensus 311 ~kVA~fL~~Hp~V~wV~YPGL~s~~~h~lAkkYl~~G~~G~vLsFe~kGG~ea~~kfidalkL~s~l 377 (434) T TIGR01326 311 LKVAEFLEAHPKVAWVNYPGLASHPHHALAKKYLPKGLFGAVLSFEIKGGREAGKKFIDALKLASHL 377 (434) T ss_pred HHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHH T ss_conf 9999997278844513488878875378899874179426787413218288999998776778776 No 77 >TIGR02737 caa3_CtaG cytochrome c oxidase assembly factor CtaG; InterPro: IPR014108 Members of this family are the CtaG protein required for assembly of active cytochrome c oxidase of the caa3 type, as found in Bacillus subtilis.. Probab=39.81 E-value=8.1 Score=19.88 Aligned_cols=20 Identities=40% Similarity=0.434 Sum_probs=16.2 Q ss_pred HHHHHCCCCCCCCHHHHHHHH Q ss_conf 899866825775145678789 Q gi|254780757|r 151 YFQQVCGIECSPISGEITYGL 171 (307) Q Consensus 151 YFQQvGGi~c~pv~~EiTYGL 171 (307) +=||.||+=-| |-=|||||. T Consensus 254 ~DQQLGGvlMK-I~QEIvYG~ 273 (286) T TIGR02737 254 EDQQLGGVLMK-IIQEIVYGT 273 (286) T ss_pred HHHHHHHHHHH-HHHHHHHHH T ss_conf 34678889999-999999999 No 78 >pfam09621 LcrR Type III secretion system regulator (LcrR). This family of proteins are encoded within type III secretion operons and have been characterized in Yersinia as a regulator of the Low-Calcium Response (LCR). Probab=39.80 E-value=18 Score=17.44 Aligned_cols=35 Identities=31% Similarity=0.509 Sum_probs=24.2 Q ss_pred HHHHHHCCCCCCCCCEEEEECCCCCCHHHHCCCCEEEEECCEEEH Q ss_conf 999998199962171798832888700110123217688787414 Q gi|254780757|r 103 IESLKAVGIDPLIHDVRFVEDNWESPTLGAWGLGWECWCDGMEIS 147 (307) Q Consensus 103 l~SL~~iGid~~~hDIrFveDnWEsPtLGAwGlGWEvwldGMEIt 147 (307) +.++++-|+...-|- |.+ -++-|||-+|++|||+. T Consensus 6 ~pw~~~~G~~v~P~~-------~~~---s~I~LG~~~~~~g~ela 40 (139) T pfam09621 6 IPWFEARGLEVTPHF-------LQK---SPIQLGWRFEYDGMELA 40 (139) T ss_pred HHHHHHCCEEEECEE-------ECC---CCCCCCCEEEECCEEEE T ss_conf 267986494761210-------058---84004317998587999 No 79 >COG0647 NagD Predicted sugar phosphatases of the HAD superfamily [Carbohydrate transport and metabolism] Probab=38.47 E-value=31 Score=15.62 Aligned_cols=54 Identities=31% Similarity=0.460 Sum_probs=44.5 Q ss_pred EEEECCCCHHHHHHHHHHHHHCCCCCCCCCEEEEECCCCCCHHHHCCCCEEEEECCEEEH Q ss_conf 557579833589999999998199962171798832888700110123217688787414 Q gi|254780757|r 88 QVIIKPNPLNLQNLYIESLKAVGIDPLIHDVRFVEDNWESPTLGAWGLGWECWCDGMEIS 147 (307) Q Consensus 88 QVilKPsp~niq~lYl~SL~~iGid~~~hDIrFveDnWEsPtLGAwGlGWEvwldGMEIt 147 (307) .++=||+|. +|-.-|+.+|.+.. ++-+|-|+-++--+||--.||..+|=---|+ T Consensus 186 ~~~GKP~~~----i~~~al~~~~~~~~--~~~mVGD~~~TDI~~a~~~G~~t~LV~TGv~ 239 (269) T COG0647 186 TVIGKPSPA----IYEAALEKLGLDRS--EVLMVGDRLDTDILGAKAAGLDTLLVLTGVS 239 (269) T ss_pred CCCCCCCHH----HHHHHHHHHCCCCC--CEEEECCCCHHHHHHHHHCCCCEEEECCCCC T ss_conf 433798899----99999998179832--0899858824569999985997799822678 No 80 >pfam09139 Mmp37 Mitochondrial matrix Mmp37. MMp37 is a mitochondrial matrix protein that functions in the translocation of proteins across the mitochondrial inner membrane. Probab=37.92 E-value=21 Score=16.84 Aligned_cols=34 Identities=38% Similarity=0.535 Sum_probs=26.8 Q ss_pred CCCCC---CCCCCCCCEEEEEEEEECCCCHHHHHHHHHHHHHC Q ss_conf 76554---66775001144445575798335899999999981 Q gi|254780757|r 70 LDGRY---AENPNRLQHYYQFQVIIKPNPLNLQNLYIESLKAV 109 (307) Q Consensus 70 ~DgRY---GeNPNRlq~y~QfQVilKPsp~niq~lYl~SL~~i 109 (307) .|=|. |||||+.+. |.+|.-+++++||..-|+.+ T Consensus 181 GD~RM~i~gEn~~KV~N------IV~~q~~~Fr~LY~p~l~~l 217 (324) T pfam09139 181 GDFRMKLGGENPNKVNN------IVKGNFANFRRLYKPILEDL 217 (324) T ss_pred CCCCEECCCCCHHHHHH------HHCCHHHHHHHHHHHHHHHH T ss_conf 97640107888789998------85111899999999999863 No 81 >PRK12295 hisZ ATP phosphoribosyltransferase regulatory subunit; Provisional Probab=37.64 E-value=32 Score=15.53 Aligned_cols=74 Identities=19% Similarity=0.196 Sum_probs=47.9 Q ss_pred CCEEEECCCCCCCCCCCCHHHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEEEEEEC--CCC----HHHHHH Q ss_conf 95896116754333224857898744834401676415578876554667750011444455757--983----358999 Q gi|254780757|r 28 GCTILQPYDMEVGAGTFHPSTTLRALGPLSWKAAYVQPSRRPLDGRYAENPNRLQHYYQFQVIIK--PNP----LNLQNL 101 (307) Q Consensus 28 GC~i~qpyd~e~GAgT~hp~T~lr~lg~~pw~~aYvqPsrRP~DgRYGeNPNRlq~y~QfQVilK--Psp----~niq~l 101 (307) |=.+.--+|+-+..+-++ ...-.+.|-+.+|..|+.|...||+ -.++|.-|=+= +++ ..+..+ T Consensus 57 g~~l~LRpDlT~piaR~~----~~~~~~~p~R~~Y~G~VfR~q~gr~-------rEf~Q~GvEiiG~~~~~~aDaEvi~l 125 (373) T PRK12295 57 GEELCLRPDFTIPVCRRH----LASNAGEPARYSYLGEVFRQRRDRA-------SEFLQAGIESFGRADPAAADAEVLAL 125 (373) T ss_pred CCEEEEECCCCHHHHHHH----HHHCCCCCEEEEEECCEEECCCCCC-------CCEEEEEEEEECCCCCCCCCHHHHHH T ss_conf 898998178889999999----9838999800787733224579998-------85167015785789821064999999 Q ss_pred HHHHHHHCCCC Q ss_conf 99999981999 Q gi|254780757|r 102 YIESLKAVGID 112 (307) Q Consensus 102 Yl~SL~~iGid 112 (307) -.++|+++|+. T Consensus 126 a~~~l~~lgl~ 136 (373) T PRK12295 126 ALEALAALGPA 136 (373) T ss_pred HHHHHHHCCCC T ss_conf 99999975997 No 82 >TIGR02252 DREG-2 REG-2-like, HAD superfamily (subfamily IA) hydrolase; InterPro: IPR011949 This family of proteins includes uncharacterised sequences from eukaryotes, cyanobacteria and Leptospira as well as the DREG-2 protein from Drosophila melanogaster which has been identified as a rhythmically (diurnally) regulated gene . This family is a member of the Haloacid Dehalogenase (HAD) superfamily of aspartate-nucleophile hydrolases. The superfamily is defined by the presence of three short catalytic motifs . The subfamilies are defined based on the location and the observed or predicted fold of a so-called, capping domain, , or the absence of such a domain. This family is a member of subfamily 1A in which the cap domain consists of a predicted alpha helical bundle found in between the first and second catalytic motifs. A distinctive feature of this family is a conserved tandem pair of tryptophan residues in the cap domain. The most divergent sequences included within the scope of this entry are from plants and have "FW" at this position instead. Most likely, these sequences, like the vast majority of HAD sequences, represent phosphatase enzymes.. Probab=37.55 E-value=32 Score=15.52 Aligned_cols=59 Identities=24% Similarity=0.452 Sum_probs=45.5 Q ss_pred CCC-CEEEEEEEE------ECCCCHHHHHHHHHHHHHCC----CCCCCCCEEEEECCCCCCHHHHCCCCEEEEE Q ss_conf 500-114444557------57983358999999999819----9962171798832888700110123217688 Q gi|254780757|r 79 NRL-QHYYQFQVI------IKPNPLNLQNLYIESLKAVG----IDPLIHDVRFVEDNWESPTLGAWGLGWECWC 141 (307) Q Consensus 79 NRl-q~y~QfQVi------lKPsp~niq~lYl~SL~~iG----id~~~hDIrFveDnWEsPtLGAwGlGWEvwl 141 (307) -.| -+|+.|-++ -||+|. ++..-|+.+| -+..-+.+-.|=|+++.--.||-.+||.-|| T Consensus 155 ~GL~~~~fd~~~~S~e~g~eKPDp~----IF~~Al~~~~~~~~~~~~p~~~~HiGD~~~~D~~gA~~~G~~~~L 224 (224) T TIGR02252 155 LGLFLEYFDFVVTSYEVGAEKPDPK----IFQKALERAGKKLTKEISPEEALHIGDSLRNDYEGARAAGWRALL 224 (224) T ss_pred CCCCHHHHHHEEEHHHHCCCCCCHH----HHHHHHHHHHHHHHCCCCCCEEEEECCCHHHHHHHHHHCCCCCCC T ss_conf 0824354404301466257898867----899999998887503577233776268803333657873866569 No 83 >TIGR00113 queA S-adenosylmethionine:tRNA ribosyltransferase-isomerase; InterPro: IPR003699 Queuosine is a hypermodified nucleoside that usually occurs in the first position of the anticodon of tRNAs specifying the amino acids asparagine, aspartate, histidine, and tyrosine. The hypermodified nucleoside is found in bacteria and eukaryotes . Queuosine is synthesized de novo exclusively in bacteria; for eukaryotes the compound is a nutrient factor. Queuosine biosynthesis protein, or S-adenosylmethionine:tRNA -ribosyltransferase-isomerase, is required for the synthesis of the queuosine precursor (oQ). ; GO: 0003824 catalytic activity, 0008616 queuosine biosynthetic process. Probab=37.37 E-value=11 Score=19.03 Aligned_cols=11 Identities=64% Similarity=1.207 Sum_probs=5.2 Q ss_pred CCCCCCCCCCH Q ss_conf 75433322485 Q gi|254780757|r 36 DMEVGAGTFHP 46 (307) Q Consensus 36 d~e~GAgT~hp 46 (307) .+.||||||.| T Consensus 222 TLHVGaGTF~p 232 (364) T TIGR00113 222 TLHVGAGTFRP 232 (364) T ss_pred EEEEECCCCCC T ss_conf 64530544750 No 84 >PRK12293 hisZ ATP phosphoribosyltransferase regulatory subunit; Provisional Probab=35.26 E-value=35 Score=15.27 Aligned_cols=50 Identities=28% Similarity=0.417 Sum_probs=40.2 Q ss_pred CCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEEEE--EECCCCHHHHHHHHHHHHHCCCCC Q ss_conf 834401676415578876554667750011444455--757983358999999999819996 Q gi|254780757|r 54 GPLSWKAAYVQPSRRPLDGRYAENPNRLQHYYQFQV--IIKPNPLNLQNLYIESLKAVGIDP 113 (307) Q Consensus 54 g~~pw~~aYvqPsrRP~DgRYGeNPNRlq~y~QfQV--ilKPsp~niq~lYl~SL~~iGid~ 113 (307) .+.|.+..|..|+.| | ++| ++||+-| |=.+||..+-.+-.++|+++|++. T Consensus 92 ~~~p~rl~Y~g~vFR-----y---e~r--Ef~Q~GvEliG~~s~~Evi~la~~~l~~lgl~~ 143 (281) T PRK12293 92 STEHKKWFYIQPVFR-----Y---PTT--EIYQIGAEIIGESDLSKVLNIAAEIFNELEIEP 143 (281) T ss_pred CCCCEEEEEECCEEE-----C---CCC--CCEEECEEEECCCCHHHHHHHHHHHHHHCCCCC T ss_conf 689757998756353-----4---677--746847488779999999999999999769962 No 85 >TIGR01458 HAD-SF-IIA-hyp3 HAD-superfamily subfamily IIA hydrolase, TIGR01458; InterPro: IPR006355 These sequences are all members of the IIA subfamily of the haloacid dehalogenase superfamily of aspartate-nucleophile hydrolases. . Probab=34.50 E-value=36 Score=15.19 Aligned_cols=46 Identities=26% Similarity=0.492 Sum_probs=38.2 Q ss_pred EEEEEEECCCCHHHHHHHHHHHHHCCCCCCCCCEEEEECCCCCCHHHHCCCC Q ss_conf 4445575798335899999999981999621717988328887001101232 Q gi|254780757|r 85 YQFQVIIKPNPLNLQNLYIESLKAVGIDPLIHDVRFVEDNWESPTLGAWGLG 136 (307) Q Consensus 85 ~QfQVilKPsp~niq~lYl~SL~~iGid~~~hDIrFveDnWEsPtLGAwGlG 136 (307) ..-.||=||||. -+.+.|++||+.| |.|-+|=||--+-.=||=-.| T Consensus 173 ~ka~vvGKPs~~----fF~~al~a~G~ep--eeaVMiGDD~~~DVGGAQ~cG 218 (258) T TIGR01458 173 IKAEVVGKPSKE----FFKEALRALGVEP--EEAVMIGDDLVDDVGGAQACG 218 (258) T ss_pred CEEEEECCCCHH----HHHHHHHHCCCCC--CEEEEECCCCCCCCCCHHHHC T ss_conf 778984188858----9988866258881--026886473454664032014 No 86 >COG3259 FrhA Coenzyme F420-reducing hydrogenase, alpha subunit [Energy production and conversion] Probab=34.34 E-value=32 Score=15.56 Aligned_cols=107 Identities=21% Similarity=0.315 Sum_probs=56.4 Q ss_pred HHHHHHHHCC---C-CCCCCHHHHHHHHHHHHHHHCCCCCEEEE----EECCCCCCCCCC-----CHHHHHHHHH----H Q ss_conf 7778998668---2-57751456787899999998699821133----206888788311-----0624474699----7 Q gi|254780757|r 148 QFTYFQQVCG---I-ECSPISGEITYGLERLAMYVQNVNSVYDI----VFNAIEGQNVLY-----GDIFAQSEQE----Y 210 (307) Q Consensus 148 QFTYFQQvGG---i-~c~pv~~EiTYGLERiaMylQ~vdnvyDl----~wn~~~~~~vtY-----Gdif~q~E~E----~ 210 (307) -|.||+-.|| + .---...||-|-+||+..-|-+.|.--+. .-+ .+.|+.. |-++|--+.+ - T Consensus 292 ~~e~~~~~g~~~~~~~h~ARaiEi~~~~e~a~~lL~~ld~~~k~r~~~e~~--~geGvg~vEAPRGtlvH~~~vdenG~I 369 (441) T COG3259 292 YEENFEELGGTHSFAYHLARAIEILYALERAIELLDELDITGKVRADVEPK--AGEGVGVVEAPRGTLVHHYRVDENGRI 369 (441) T ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCC--CCCEEEEEECCCCEEEEEEEECCCCEE T ss_conf 997687517763799999999999999999999862323478754436677--773147786476148999997489807 Q ss_pred --------HHHHH---HHCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHH-HHHH Q ss_conf --------77635---0179899999999999999999850066532335530138289999999-9987 Q gi|254780757|r 211 --------SRYNF---EYANPEILHNHFIDSEKECLDLLKKGIPNEHHRHHLCVFPAYDQCIKAS-HIFN 268 (307) Q Consensus 211 --------S~YNF---E~Ad~~~L~~~F~~yE~Ea~rLle~~LP~~~~~~~~lvlPAYD~~LKaS-H~FN 268 (307) +.+|+ |.+...+-.+.++.-...-..++| +|+-|||-|+-|| |+-+ T Consensus 370 ~~~niIvaT~~N~~ame~~v~~~a~~~~~~~~~~~~~~~E------------~viRAyDPC~sCatHvv~ 427 (441) T COG3259 370 KKANIIVATTFNVPAMEKAVRGVAERYIETVPEGLLNLVE------------MVIRAYDPCLSCATHVVG 427 (441) T ss_pred EEEEEEEECCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHH------------HHHHHCCCHHHHHHHHHH T ss_conf 7787884031456778899999999997302277777888------------777641726888998861 No 87 >COG0124 HisS Histidyl-tRNA synthetase [Translation, ribosomal structure and biogenesis] Probab=33.68 E-value=37 Score=15.10 Aligned_cols=58 Identities=31% Similarity=0.372 Sum_probs=41.4 Q ss_pred HCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEEEE--EECCCCH---HHHHHHHHHHHHCCCCC Q ss_conf 44834401676415578876554667750011444455--7579833---58999999999819996 Q gi|254780757|r 52 ALGPLSWKAAYVQPSRRPLDGRYAENPNRLQHYYQFQV--IIKPNPL---NLQNLYIESLKAVGIDP 113 (307) Q Consensus 52 ~lg~~pw~~aYvqPsrRP~DgRYGeNPNRlq~y~QfQV--ilKPsp~---niq~lYl~SL~~iGid~ 113 (307) .+.|.|.+..|..|+.|.-.- -+.|.-++|||-| |=.++|. .+..|--++|+++||+. T Consensus 96 ~~~~~p~k~yy~g~vfRyErP----Q~GR~RqF~Q~g~E~iG~~~~~~DAEvi~l~~~~l~~lGi~~ 158 (429) T COG0124 96 LDLPKPLKLYYFGPVFRYERP----QKGRYRQFYQFGVEVIGSDSPDADAEVIALAVEILEALGIGG 158 (429) T ss_pred CCCCCCEEEEEECCEECCCCC----CCCCCEEEEECCEEEECCCCCCCCHHHHHHHHHHHHHCCCCC T ss_conf 203687259983356237888----888750368767677679986538999999999999749986 No 88 >COG3343 RpoE DNA-directed RNA polymerase, delta subunit [Transcription] Probab=32.59 E-value=30 Score=15.80 Aligned_cols=48 Identities=19% Similarity=0.359 Sum_probs=26.5 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHCCCEEEECCCCCCCCC------CCCHHHHHHHCCCCCCEE Q ss_conf 33567777999999999999866958961167543332------248578987448344016 Q gi|254780757|r 5 HNKKNDLSFQNIILTLTQYWAQQGCTILQPYDMEVGAG------TFHPSTTLRALGPLSWKA 60 (307) Q Consensus 5 ~~~~~~~~fq~ii~~L~~fW~~~GC~i~qpyd~e~GAg------T~hp~T~lr~lg~~pw~~ 60 (307) ..++.+|+|++||...|+|-... +-++... ++|--+-|-+||...|.. T Consensus 25 e~~~~~~~F~dii~EI~~~~~~s--------~~ei~~~i~~FYTdln~DgrFi~LGdn~WgL 78 (175) T COG3343 25 EEKKKPFNFSDIINEIQKLLGVS--------KEEIRSRIGQFYTDLNIDGRFISLGDNKWGL 78 (175) T ss_pred HHCCCCCCHHHHHHHHHHHHCCC--------HHHHHHHHHHHHHHHCCCCCEEECCCCCCCH T ss_conf 97089865999999999983857--------8889999999999860477634645653121 No 89 >TIGR00644 recJ single-stranded-DNA-specific exonuclease RecJ; InterPro: IPR004610 All proteins in this family are 5'-3' single-strand DNA exonucleases. These proteins are used in some aspects of mismatch repair, recombination, and recombinational repair. RecJ is a single-stranded DNA-specific endonuclease which is required for many types of recombination events, although the stringency of the requirement for RecJ appears to vary with the type of recombinational event monitored, and the other recombination gene products which are available.; GO: 0008409 5'-3' exonuclease activity, 0006281 DNA repair, 0006310 DNA recombination. Probab=31.74 E-value=40 Score=14.88 Aligned_cols=91 Identities=27% Similarity=0.353 Sum_probs=66.5 Q ss_pred HCCCEEEECCCCCCCCCCCCHHHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEE--EEEECCCCHHHHHHHH Q ss_conf 66958961167543332248578987448344016764155788765546677500114444--5575798335899999 Q gi|254780757|r 26 QQGCTILQPYDMEVGAGTFHPSTTLRALGPLSWKAAYVQPSRRPLDGRYAENPNRLQHYYQF--QVIIKPNPLNLQNLYI 103 (307) Q Consensus 26 ~~GC~i~qpyd~e~GAgT~hp~T~lr~lg~~pw~~aYvqPsrRP~DgRYGeNPNRlq~y~Qf--QVilKPsp~niq~lYl 103 (307) +.--+|.==||.-==++|-==.-||+.||| -+++-|.=|= |=+|| ||.||+=+....+- |-||-.-=.-..--=+ T Consensus 55 ~eki~i~GDyDvDGiTSt~~l~~~L~~lGp-~~~~~~~IPn-R~~eg-YG~~~~~~~~~~~~G~~LiiTVD~Gi~a~~e~ 131 (705) T TIGR00644 55 NEKILIFGDYDVDGITSTAILVEFLRELGP-YVQVDYYIPN-RFTEG-YGLSPEALRELKENGVSLIITVDNGISAHEEI 131 (705) T ss_pred CCCEEEEECCCCCHHHHHHHHHHHHHHCCC-CCEEEEEECC-CCCCC-CCCCHHHHHHHHHCCCEEEEECCCCHHHHHHH T ss_conf 893899961752268999999999995099-7324666266-35777-78898999999866983999826874269999 Q ss_pred HHHHHCCCCCCCCCEE Q ss_conf 9999819996217179 Q gi|254780757|r 104 ESLKAVGIDPLIHDVR 119 (307) Q Consensus 104 ~SL~~iGid~~~hDIr 119 (307) ++.+.+|+|.-.-|=- T Consensus 132 ~~a~~~G~dVIVtDHH 147 (705) T TIGR00644 132 EYAKELGIDVIVTDHH 147 (705) T ss_pred HHHHHCCCEEEEECCC T ss_conf 9998669819997256 No 90 >PRK13834 putative autoinducer synthesis protein; Provisional Probab=31.29 E-value=10 Score=19.12 Aligned_cols=16 Identities=50% Similarity=0.927 Sum_probs=9.9 Q ss_pred CCEEEEE-CCEEEHHHH Q ss_conf 3217688-787414777 Q gi|254780757|r 135 LGWECWC-DGMEISQFT 150 (307) Q Consensus 135 lGWEvwl-dGMEItQFT 150 (307) |||+|=+ ||||+-||- T Consensus 33 LgW~v~~~~g~E~D~fD 49 (207) T PRK13834 33 LGWDVEIIDGCESDQFD 49 (207) T ss_pred CCCCCCCCCCCCCCCCC T ss_conf 58687689998832689 No 91 >COG4326 Spo0M Sporulation control protein [General function prediction only] Probab=30.84 E-value=29 Score=15.82 Aligned_cols=56 Identities=30% Similarity=0.404 Sum_probs=32.1 Q ss_pred HCCCCCCCCCEEEEECC-CCCCHHHH-CCCCEEEEECCEEEHHHHHHHH----HCCCCCCCCHH Q ss_conf 81999621717988328-88700110-1232176887874147778998----66825775145 Q gi|254780757|r 108 AVGIDPLIHDVRFVEDN-WESPTLGA-WGLGWECWCDGMEISQFTYFQQ----VCGIECSPISG 165 (307) Q Consensus 108 ~iGid~~~hDIrFveDn-WEsPtLGA-wGlGWEvwldGMEItQFTYFQQ----vGGi~c~pv~~ 165 (307) +++|||.+|||-||+-. |-.-.|-| --||. -|+-.+--|-.|||- |--++-.|.+| T Consensus 138 ~~aidp~D~D~l~Vr~hP~m~~vl~AiE~lGf--rL~~vdCEqa~yF~~a~PFVQEfEFvPTtG 199 (270) T COG4326 138 ALAIDPTDKDILTVRPHPMMDGVLSAIEALGF--RLRQVDCEQAKYFGGALPFVQEFEFVPTTG 199 (270) T ss_pred HCCCCCCCCCEEEEECCHHHHHHHHHHHHHCC--EEEECCCCCCCCCCCCCCCEEEEEEECCCC T ss_conf 20679866654998366679889999985163--731013131543355564236788721587 No 92 >PRK13226 phosphoglycolate phosphatase; Provisional Probab=30.79 E-value=41 Score=14.77 Aligned_cols=69 Identities=20% Similarity=0.254 Sum_probs=34.4 Q ss_pred HHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEEEEEE--------CCCCHHHHHHHHHHHHHCCCCCCCCCE Q ss_conf 789874483440167641557887655466775001144445575--------798335899999999981999621717 Q gi|254780757|r 47 STTLRALGPLSWKAAYVQPSRRPLDGRYAENPNRLQHYYQFQVII--------KPNPLNLQNLYIESLKAVGIDPLIHDV 118 (307) Q Consensus 47 ~T~lr~lg~~pw~~aYvqPsrRP~DgRYGeNPNRlq~y~QfQVil--------KPsp~niq~lYl~SL~~iGid~~~hDI 118 (307) ..+|..|...-.+.+-|--+.|..--..-+. ..+.+|+ .+|+ ||.|+ .||.-++.+|++|.+ . T Consensus 93 ~e~L~~L~~~g~~~~VvTnk~~~~~~~~l~~-~gl~~~F--~~iv~~ddv~~~KP~Pd----~~l~a~~~lgv~p~e--~ 163 (221) T PRK13226 93 EMMLARLECAGCVWGIVTNKPEYLARLILPQ-LGWEQRC--AVLIGGDTLAERKPHPL----PLLTAAERIGVAPTQ--C 163 (221) T ss_pred HHHHHHHHHCCCEEEEECCCCHHHHHHHHHH-HCCCCCC--CEEECCCCCCCCCCCHH----HHHHHHHHHCCCHHH--E T ss_conf 9999999977990898559818889999998-3985223--30001444567788869----999999994989536--8 Q ss_pred EEEECC Q ss_conf 988328 Q gi|254780757|r 119 RFVEDN 124 (307) Q Consensus 119 rFveDn 124 (307) -+|+|. T Consensus 164 v~VGDs 169 (221) T PRK13226 164 VYVGDD 169 (221) T ss_pred EEECCC T ss_conf 888999 No 93 >pfam09454 Vps23_core Vps23 core domain. ESCRT complexes form the main machinery driving protein sorting from endosomes to lysosomes. The core domain of the Vps23 subunit of the heterotrimeric ESCRT-I complex is a helical hairpin sandwiched in a fan-like formation between two other helical hairpins from Vps28 (pfam03997) and Vps37. Vps23 gives ESCRT-I complex its stability. Probab=30.39 E-value=42 Score=14.72 Aligned_cols=20 Identities=35% Similarity=0.669 Sum_probs=17.2 Q ss_pred HCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 38750178899999999999999 Q gi|254780757|r 272 ARGVISTTERQRYILRIRSLAKA 294 (307) Q Consensus 272 ARGaISVTERq~YI~RIR~Lak~ 294 (307) -||.|++ ..|+..||.|||. T Consensus 36 ~~g~I~~---d~ylK~vR~LsRe 55 (65) T pfam09454 36 KRGKIDL---ETYLKHVRELARE 55 (65) T ss_pred HCCCCCH---HHHHHHHHHHHHH T ss_conf 8698679---9999999999999 No 94 >PRK00037 hisS histidyl-tRNA synthetase; Reviewed Probab=29.65 E-value=43 Score=14.64 Aligned_cols=60 Identities=27% Similarity=0.374 Sum_probs=40.1 Q ss_pred CCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEEEEEE--CCCCH---HHHHHHHHHHHHCCCCCCCCC Q ss_conf 83440167641557887655466775001144445575--79833---589999999998199962171 Q gi|254780757|r 54 GPLSWKAAYVQPSRRPLDGRYAENPNRLQHYYQFQVII--KPNPL---NLQNLYIESLKAVGIDPLIHD 117 (307) Q Consensus 54 g~~pw~~aYvqPsrRP~DgRYGeNPNRlq~y~QfQVil--KPsp~---niq~lYl~SL~~iGid~~~hD 117 (307) .|.|.+.+|++|+.|-....-| |.-.++|.-|=| -++|. .+..+-.++|+.+|+....-+ T Consensus 100 ~~~P~r~~y~g~vfR~e~p~~G----R~REf~Q~g~eiiG~~~~~aDaEvi~l~~~~l~~lGl~~~~i~ 164 (417) T PRK00037 100 LPKPFKLYYIGPMFRYERPQKG----RYRQFHQFGVEVIGSDSPAADAEVIALAADLLKALGLKGLVLL 164 (417) T ss_pred CCCCEEEEEECCEEECCCCCCC----CCCCEEECCHHHHCCCCHHHHHHHHHHHHHHHHHCCCCCCCCC T ss_conf 8876589987246604789887----6352456352774664178889999999999998399652001 No 95 >PRK13223 phosphoglycolate phosphatase; Provisional Probab=29.42 E-value=43 Score=14.61 Aligned_cols=105 Identities=13% Similarity=0.212 Sum_probs=57.4 Q ss_pred CCCCHHHHHHHHHHHHHHHCC----CCCEEEEEECCCC---CCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH Q ss_conf 775145678789999999869----9821133206888---788311062447469977763501798999999999999 Q gi|254780757|r 160 CSPISGEITYGLERLAMYVQN----VNSVYDIVFNAIE---GQNVLYGDIFAQSEQEYSRYNFEYANPEILHNHFIDSEK 232 (307) Q Consensus 160 c~pv~~EiTYGLERiaMylQ~----vdnvyDl~wn~~~---~~~vtYGdif~q~E~E~S~YNFE~Ad~~~L~~~F~~yE~ 232 (307) -||-+-=+..-+|++-.--.+ =||+.|+.=-..- .=+||||-=+.+ ....++.+..+.+|.+- T Consensus 156 ~KPdP~~ll~ale~lGv~P~e~l~VGDS~~DI~AAkaAGi~~VgVtyGYn~g~--------~i~~~~pd~v~d~~~el-- 225 (272) T PRK13223 156 KKPDPAALLFVMKMAGVPPSQSLFVGDSRSDVLAAKAAGVQCVALSYGYNHGR--------PIAEESPTLVIDDLRAL-- 225 (272) T ss_pred CCCCHHHHHHHHHHHCCCHHHEEEECCCHHHHHHHHHCCCCEEEECCCCCCCC--------CHHHCCCCEEECCHHHH-- T ss_conf 99998999999999598976888888988899999986995899747999986--------36674999897679987-- Q ss_pred HHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHH Q ss_conf 9999985006653233553013828999999999878883875017889999999999999 Q gi|254780757|r 233 ECLDLLKKGIPNEHHRHHLCVFPAYDQCIKASHIFNLLDARGVISTTERQRYILRIRSLAK 293 (307) Q Consensus 233 Ea~rLle~~LP~~~~~~~~lvlPAYD~~LKaSH~FNLLDARGaISVTERq~YI~RIR~Lak 293 (307) ||-..++..-.++|- ++- -++|..|-|.-|.-.+.-|..||| T Consensus 226 ---------lp~~~~~~~~~~~~~----~~~------~~~~~~~~~~~~~~~~~~~~~~~~ 267 (272) T PRK13223 226 ---------LPGCADPAAEITLPD----LQP------SQSRESIVVVTRKLWMKVIKALAR 267 (272) T ss_pred ---------CCCCCCCCCCEECCC----CCC------CCCCCCEEEEEHHHHHHHHHHHHH T ss_conf ---------233378764414567----778------888985699846889999999987 No 96 >TIGR02254 YjjG/YfnB HAD superfamily (subfamily IA) hydrolase, TIGR02254; InterPro: IPR011951 This family consists of uncharacterised proteobacterial and Gram-positive bacterial sequences including YjjG from Escherichia coli and YfnB from Bacillus subtilis.. Probab=29.21 E-value=40 Score=14.84 Aligned_cols=48 Identities=25% Similarity=0.419 Sum_probs=37.2 Q ss_pred ECCCCHHHHHHHHHHHHHCCCCCCCCCEEEEECCCCCCHHHHCCCCEE-EEECC Q ss_conf 579833589999999998199962171798832888700110123217-68878 Q gi|254780757|r 91 IKPNPLNLQNLYIESLKAVGIDPLIHDVRFVEDNWESPTLGAWGLGWE-CWCDG 143 (307) Q Consensus 91 lKPsp~niq~lYl~SL~~iGid~~~hDIrFveDnWEsPtLGAwGlGWE-vwldG 143 (307) -||.|. ++--.|+.+| .+++...-.|=|||+|--.||---|=+ ||+|- T Consensus 158 ~KP~~~----iF~~aLer~~-~~~k~~~L~vGD~~~aDi~Ga~naGl~~vw~n~ 206 (233) T TIGR02254 158 EKPDKK----IFDYALERMG-KLKKEEVLMVGDSLEADIKGARNAGLDTVWLNP 206 (233) T ss_pred CCCCHH----HHHHHHHHCC-CCCCCEEEEEECCCHHHHHHHHHHCEEEEEECC T ss_conf 588856----7789987178-887231678607804578766661400355177 No 97 >COG3772 Phage-related lysozyme (muraminidase) [General function prediction only] Probab=28.51 E-value=19 Score=17.21 Aligned_cols=21 Identities=14% Similarity=0.298 Sum_probs=11.5 Q ss_pred HHHHHHHHHHHHHHHHHHCCC Q ss_conf 999999999999999850066 Q gi|254780757|r 223 LHNHFIDSEKECLDLLKKGIP 243 (307) Q Consensus 223 L~~~F~~yE~Ea~rLle~~LP 243 (307) |.+.-..++....+.+.-.|| T Consensus 61 l~~d~~~~~~~v~~~~~~~l~ 81 (152) T COG3772 61 LERDLAEAERAVNRYIKVPLT 81 (152) T ss_pred HHHHHHHHHHHHHHHCCCCCC T ss_conf 887888878988874388987 No 98 >KOG2227 consensus Probab=28.28 E-value=45 Score=14.48 Aligned_cols=63 Identities=8% Similarity=0.145 Sum_probs=26.7 Q ss_pred HHHHCCCCEE-EEECCEEEHHH-HHHHHHCC-C--CCCCCHHHHHHHHHHHHHHHCCCCCEEEEEECCC Q ss_conf 0110123217-68878741477-78998668-2--5775145678789999999869982113320688 Q gi|254780757|r 129 TLGAWGLGWE-CWCDGMEISQF-TYFQQVCG-I--ECSPISGEITYGLERLAMYVQNVNSVYDIVFNAI 192 (307) Q Consensus 129 tLGAwGlGWE-vwldGMEItQF-TYFQQvGG-i--~c~pv~~EiTYGLERiaMylQ~vdnvyDl~wn~~ 192 (307) .++-|+-+-+ |.||-|+++-= --|+-..+ + +|.--..+ ---+|-+--+.+.....+=|+.+.+ T Consensus 198 ~~~~~~~~~~~v~inc~sl~~~~aiF~kI~~~~~q~~~s~~~~-~~~~~~~~~h~~q~k~~~llVlDEm 265 (529) T KOG2227 198 SLSKSSKSPVTVYINCTSLTEASAIFKKIFSSLLQDLVSPGTG-MQHLEKFEKHTKQSKFMLLLVLDEM 265 (529) T ss_pred HHHHHCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHCCCCHH-HHHHHHHHHHHHCCCCEEEEEECHH T ss_conf 4034316651699851235425889999988898874289504-7899999998752563389872125 No 99 >PRK13224 consensus Probab=28.07 E-value=45 Score=14.45 Aligned_cols=80 Identities=21% Similarity=0.289 Sum_probs=31.4 Q ss_pred HHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEEEEE------ECCCCHHHHHHHHHHHHHCCCCCCCCCEEE Q ss_conf 78987448344016764155788765546677500114444557------579833589999999998199962171798 Q gi|254780757|r 47 STTLRALGPLSWKAAYVQPSRRPLDGRYAENPNRLQHYYQFQVI------IKPNPLNLQNLYIESLKAVGIDPLIHDVRF 120 (307) Q Consensus 47 ~T~lr~lg~~pw~~aYvqPsrRP~DgRYGeNPNRlq~y~QfQVi------lKPsp~niq~lYl~SL~~iGid~~~hDIrF 120 (307) .-+|+.|...-++.|-|--+.|..--+--++ -.|.+|+-+-|- -||.|+ .|+..++.+|.++ .-+ T Consensus 92 ~e~L~~L~~~g~~laivTn~~~~~~~~~L~~-~gl~~~Fd~iv~~d~v~~~KP~Pe----~~l~a~~~l~~~~----~l~ 162 (216) T PRK13224 92 EAALRHLKAEGWRLGLCTNKPVGPTRAILAH-FGLAELFDVVIGGDSLPQRKPDPA----PLRAAFAALGDEP----VLY 162 (216) T ss_pred HHHHHHHHHCCCCEEEEECCCHHHHHHHHHH-CCCHHHCCEEECCCCCCCCCCCHH----HHHHHHHHCCCCC----EEE T ss_conf 9999999977996687608934778889987-394432474654466899999999----9999997379999----899 Q ss_pred EECCCCCCHHHHCCCC Q ss_conf 8328887001101232 Q gi|254780757|r 121 VEDNWESPTLGAWGLG 136 (307) Q Consensus 121 veDnWEsPtLGAwGlG 136 (307) |+|. +.-..+|---| T Consensus 163 VGDs-~~D~~aA~~AG 177 (216) T PRK13224 163 VGDS-EVDAETAENAG 177 (216) T ss_pred ECCC-HHHHHHHHHHC T ss_conf 7598-87899999949 No 100 >TIGR02108 PQQ_syn_pqqB coenzyme PQQ biosynthesis protein B; InterPro: IPR011842 This entry describes coenzyme PQQ biosynthesis protein B, a gene required for the biosynthesis of pyrrolo-quinoline-quinone (coenzyme PQQ). PQQ is required for some glucose dehydrogenases and alcohol dehydrogenases. Note that this gene appears to be required for PQQ in biosynthesis in Methylobacterium extorquens (under the name pqqG) and in Klebsiella pneumoniae but that the equivalent pqqV in Acinetobacter calcoaceticus is not necessary for heterologous expression of PQQ biosynthesis in Escherichia coli. Based on this latter finding, it is suggested that PqqB might be a transporter or a PQQ-dependent enzyme rather than a PQQ biosynthesis enzyme .; GO: 0006810 transport, 0018189 pyrroloquinoline quinone biosynthetic process. Probab=27.88 E-value=22 Score=16.67 Aligned_cols=23 Identities=48% Similarity=0.715 Sum_probs=14.9 Q ss_pred CCCCCCHHHHCCCCEEEEECCEEEH Q ss_conf 2888700110123217688787414 Q gi|254780757|r 123 DNWESPTLGAWGLGWECWCDGMEIS 147 (307) Q Consensus 123 DnWEsPtLGAwGlGWEvwldGMEIt 147 (307) |-=|-..|-| =||||=-|||||. T Consensus 291 ~S~Era~~~a--~Gw~VA~DGm~i~ 313 (314) T TIGR02108 291 DSPERAEVEA--AGWEVAHDGMEIE 313 (314) T ss_pred CCHHHHHHHH--CCCEECCCCCEEC T ss_conf 6744788864--6860215775642 No 101 >cd00770 SerRS_core Seryl-tRNA synthetase (SerRS) class II core catalytic domain. SerRS is responsible for the attachment of serine to the 3' OH group of ribose of the appropriate tRNA. This domain It is primarily responsible for ATP-dependent formation of the enzyme bound aminoacyl-adenylate. Class II assignment is based upon its structure and the presence of three characteristic sequence motifs in the core domain. SerRS synthetase is a homodimer. Probab=27.77 E-value=46 Score=14.42 Aligned_cols=61 Identities=25% Similarity=0.200 Sum_probs=43.7 Q ss_pred HCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEEE-----EEECCCCHHHHHHHHH-------HHHHCCCCCC Q ss_conf 4483440167641557887655466775001144445-----5757983358999999-------9998199962 Q gi|254780757|r 52 ALGPLSWKAAYVQPSRRPLDGRYAENPNRLQHYYQFQ-----VIIKPNPLNLQNLYIE-------SLKAVGIDPL 114 (307) Q Consensus 52 ~lg~~pw~~aYvqPsrRP~DgRYGeNPNRlq~y~QfQ-----VilKPsp~niq~lYl~-------SL~~iGid~~ 114 (307) +...-|.+.+=+-||.|.--|-+|....-|.+-+||. ++.+| +.+.+..-+ =|+.|||... T Consensus 126 ~~~dLPlr~~~~s~cfR~Eags~G~d~~GL~RvhqF~kvE~~~~c~p--e~s~~~~e~~~~~~e~~l~~L~lpyr 198 (297) T cd00770 126 EEEELPLKYAGYSPCFRKEAGSAGRDTRGLFRVHQFEKVEQFVFTKP--EESWEELEELISNAEEILQELGLPYR 198 (297) T ss_pred CHHHCCCCEEECCCHHHHCCCCCCCCCCCEEEEEECCEEEEEEECCH--HHHHHHHHHHHHHHHHHHHHHCCCEE T ss_conf 77558865022463413203777755554578872110037761687--88999999999999999998155014 No 102 >TIGR01380 glut_syn glutathione synthase; InterPro: IPR006284 These are the glutathione synthetases found in Gram-negative bacteria. This gene does not appear to be present in genomes of Gram-positive bacteria. Glutathione synthetase has an ATP-binding domain in the COOH terminus and catalyzes the second step in the glutathione biosynthesis pathway: ATP + gamma-L-glutamyl-L-cysteine + glycine = ADP + phosphate + glutathione. Glutathione is a tripeptide that functions as a reductant in many cellular reactions. ; GO: 0004363 glutathione synthase activity, 0006750 glutathione biosynthetic process. Probab=27.68 E-value=46 Score=14.40 Aligned_cols=28 Identities=29% Similarity=0.515 Sum_probs=24.9 Q ss_pred HHHHHHHHHHCCCEEEECCCCCCCCCCC Q ss_conf 9999999986695896116754333224 Q gi|254780757|r 17 ILTLTQYWAQQGCTILQPYDMEVGAGTF 44 (307) Q Consensus 17 i~~L~~fW~~~GC~i~qpyd~e~GAgT~ 44 (307) +..+..||+++|=+|+-|.|-.=|.|-| T Consensus 153 ~~~ir~F~~e~~divLKPL~g~GG~gi~ 180 (322) T TIGR01380 153 KAEIRAFLAEHGDIVLKPLDGMGGEGIF 180 (322) T ss_pred HHHHHHHHHHHHCEEECCCCCCCCCEEE T ss_conf 7999999987512587266677784367 No 103 >KOG0188 consensus Probab=27.39 E-value=31 Score=15.61 Aligned_cols=217 Identities=23% Similarity=0.315 Sum_probs=108.1 Q ss_pred CHHHHHHHHHHHHHHCCCEEEE--CCCCC-CCCCCCCHHHHHHHCCCCCCEEEEE--CCCC---CCCCCCCCCCCCCCCE Q ss_conf 7999999999999866958961--16754-3332248578987448344016764--1557---8876554667750011 Q gi|254780757|r 12 SFQNIILTLTQYWAQQGCTILQ--PYDME-VGAGTFHPSTTLRALGPLSWKAAYV--QPSR---RPLDGRYAENPNRLQH 83 (307) Q Consensus 12 ~fq~ii~~L~~fW~~~GC~i~q--pyd~e-~GAgT~hp~T~lr~lg~~pw~~aYv--qPsr---RP~DgRYGeNPNRlq~ 83 (307) .|-....+|.+--..|.|+-.- -.|++ ||--+ +.+||+--||.=... .|- +-|. ----..||-++-||+- T Consensus 49 ifLgt~a~lkRa~NsQKCIRAGGKHNDLdDVGkD~-yHHTfFEMLGNWSFG-dYFKeEac~~AwElLt~vygi~~dRLYV 126 (895) T KOG0188 49 IFLGTMAKLKRAANSQKCIRAGGKHNDLDDVGKDF-YHHTFFEMLGNWSFG-DYFKEEACAWAWELLTFVYGIPTDRLYV 126 (895) T ss_pred HHHHCHHHHHHHHHHHHHHHCCCCCCCHHHCCCCC-HHHHHHHHHCCCCHH-HHHHHHHHHHHHHHHHHHHCCCCCCEEE T ss_conf 65422145676650466776268767666626652-156799986376388-7889999999999987763798350799 Q ss_pred EEEE--EEEECCCCHHHHHHHHHHHHHCCCCCCCCCEEE-EECC-CCCCHHHHCCCCEEEEECCE--------------- Q ss_conf 4444--557579833589999999998199962171798-8328-88700110123217688787--------------- Q gi|254780757|r 84 YYQF--QVIIKPNPLNLQNLYIESLKAVGIDPLIHDVRF-VEDN-WESPTLGAWGLGWECWCDGM--------------- 144 (307) Q Consensus 84 y~Qf--QVilKPsp~niq~lYl~SL~~iGid~~~hDIrF-veDn-WEsPtLGAwGlGWEvwldGM--------------- 144 (307) -|== -+.=-|.--..-++.+ .||+++ .|=+-| ..|| ||----|.-|--=|+--|-. T Consensus 127 TYF~Gde~~Gle~DlE~r~iW~----~lGvp~-srILp~~~kDNFWEMGdtGPCGPCtEIHyDriGgr~a~~LVN~ddp~ 201 (895) T KOG0188 127 TYFGGDEKLGLEPDLECREIWN----ELGVPD-SRILPFGMKDNFWEMGDTGPCGPCTEIHYDRIGGRDAVKLVNHDDPD 201 (895) T ss_pred EEECCCCCCCCCCCHHHHHHHH----HCCCCH-HHCCCCCCHHHHHHCCCCCCCCCCHHHHHHHHCCCCHHHHHCCCCCC T ss_conf 9835841037995668999999----829978-88067861021565058878878402101430686658772688965 Q ss_pred --EEHHHHHHH---HHCCCCCCCC---HHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCCHHHHHHHHHH-HHHH- Q ss_conf --414777899---8668257751---45678789999999869982113320688878831106244746997-7763- Q gi|254780757|r 145 --EISQFTYFQ---QVCGIECSPI---SGEITYGLERLAMYVQNVNSVYDIVFNAIEGQNVLYGDIFAQSEQEY-SRYN- 214 (307) Q Consensus 145 --EItQFTYFQ---QvGGi~c~pv---~~EiTYGLERiaMylQ~vdnvyDl~wn~~~~~~vtYGdif~q~E~E~-S~YN- 214 (307) ||=-.-+.| ..+| -++|. .+.---|+|||---|||+.|-||..-= .-||-.-+.-- -.|+ T Consensus 202 v~EiWNlVFIq~NRe~dG-sL~pLp~khIDTGMGfERLvsVlQnk~SNYDTDlF---------tPif~aiq~~~~~~YtG 271 (895) T KOG0188 202 VLEIWNIVFIQYNREADG-SLKPLPKKHIDTGMGFERLVSVLQNKTSNYDTDLF---------TPIFEAIQEGTGARYTG 271 (895) T ss_pred HHHEEEEEEEEECCCCCC-CCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCHHH---------HHHHHHHHHHCCCCCCC T ss_conf 554000125774135687-65667411454465799999999565556641243---------79999999860887557 Q ss_pred HHHCCHHHHHHHHHHHHHHHHHHHHH-CCCCC Q ss_conf 50179899999999999999999850-06653 Q gi|254780757|r 215 FEYANPEILHNHFIDSEKECLDLLKK-GIPNE 245 (307) Q Consensus 215 FE~Ad~~~L~~~F~~yE~Ea~rLle~-~LP~~ 245 (307) =+..-+++-++.-.++-+..--.|.. +.|+. T Consensus 272 ~d~d~~D~AyRVvADH~RtlTvalsDG~~Pdn 303 (895) T KOG0188 272 EDADGIDTAYRVVADHARTLTVALSDGGVPDN 303 (895) T ss_pred CCCCCCHHHHHHHHHHHHHHHHEEECCCCCCC T ss_conf 75434213688988776562101004877788 No 104 >TIGR01366 serC_3 phosphoserine aminotransferase, putative; InterPro: IPR006272 These sequences represent a putative variant form of the serine biosynthesis enzyme phosphoserine aminotransferase, as found in Mycobacterium tuberculosis and related high-GC Gram-positive bacteria. ; GO: 0009058 biosynthetic process. Probab=26.48 E-value=29 Score=15.83 Aligned_cols=79 Identities=19% Similarity=0.201 Sum_probs=42.5 Q ss_pred CCCCCCCCCCCCCCCEEEEEEEEECCCCHHHHHHHHHHHHHCCCCCCCCCEEEEECCCCCC--HHHHCCCCEEEEECCEE Q ss_conf 8876554667750011444455757983358999999999819996217179883288870--01101232176887874 Q gi|254780757|r 68 RPLDGRYAENPNRLQHYYQFQVIIKPNPLNLQNLYIESLKAVGIDPLIHDVRFVEDNWESP--TLGAWGLGWECWCDGME 145 (307) Q Consensus 68 RP~DgRYGeNPNRlq~y~QfQVilKPsp~niq~lYl~SL~~iGid~~~hDIrFveDnWEsP--tLGAwGlGWEvwldGME 145 (307) .|.|||||--|.+.. ++.+|.|--.+-..+|...+..-++-+--.-.+- .|=.--=|+||-|.--- T Consensus 1 ~P~dGrfG~GPskvr------------~e~~~~~~~~~~~~~Gtshrq~~vkn~vG~~r~G~~~lf~lP~Gye~il~~GG 68 (362) T TIGR01366 1 KPADGRFGAGPSKVR------------LEQLQALVATAASLLGTSHRQAPVKNLVGKVREGLAELFSLPDGYEVILGNGG 68 (362) T ss_pred CCCCCCCCCCCCCCC------------HHHHHHHHHCCHHEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCC T ss_conf 989886458732001------------88899887211110002102014677776788878998408886078761787 Q ss_pred EHHHHHHHHHCCC Q ss_conf 1477789986682 Q gi|254780757|r 146 ISQFTYFQQVCGI 158 (307) Q Consensus 146 ItQFTYFQQvGGi 158 (307) -|-|==--..|=| T Consensus 69 ~tafWd~a~~Gli 81 (362) T TIGR01366 69 ATAFWDAATFGLI 81 (362) T ss_pred CHHHHHHHHHHHH T ss_conf 3145666665334 No 105 >pfam03800 Nuf2 Nuf2 family. Members of this family are components of the mitotic spindle. It has been shown that Nuf2 from yeast is part of a complex called the Ndc80p complex. This complex is thought to bind to the microtubules of the spindle. An arabidopsis protein has been included in this family that has previously not been identified as a member of this family. The match is not strong, but in common with other members of this family contains coiled-coil to the C terminus of this region. Probab=26.36 E-value=42 Score=14.71 Aligned_cols=24 Identities=38% Similarity=0.583 Sum_probs=18.4 Q ss_pred EEECCCCHHHHHHHHHHHHH-CCCC Q ss_conf 57579833589999999998-1999 Q gi|254780757|r 89 VIIKPNPLNLQNLYIESLKA-VGID 112 (307) Q Consensus 89 VilKPsp~niq~lYl~SL~~-iGid 112 (307) -|.||.|+.+|.+|-.-|+. .|+. T Consensus 30 ~l~kP~~~~v~~iy~~~l~~~~g~~ 54 (144) T pfam03800 30 DLLKPTPEFVQMLYERILELFMGIT 54 (144) T ss_pred HHCCCCHHHHHHHHHHHHHHHHCCC T ss_conf 8809998999999999999997939 No 106 >PRK11007 trehalose(maltose)-specific PTS system components IIBC; Provisional Probab=26.17 E-value=49 Score=14.22 Aligned_cols=66 Identities=20% Similarity=0.287 Sum_probs=38.2 Q ss_pred EEEEEEECCCCHHHHHHHHHHHHHCCCCCCCCCEEEEECCCCCCHHHHCCCCEEEEECCEEEHHHHHHHHHCCCCCCCCH Q ss_conf 44455757983358999999999819996217179883288870011012321768878741477789986682577514 Q gi|254780757|r 85 YQFQVIIKPNPLNLQNLYIESLKAVGIDPLIHDVRFVEDNWESPTLGAWGLGWECWCDGMEISQFTYFQQVCGIECSPIS 164 (307) Q Consensus 85 ~QfQVilKPsp~niq~lYl~SL~~iGid~~~hDIrFveDnWEsPtLGAwGlGWEvwldGMEItQFTYFQQvGGi~c~pv~ 164 (307) -|||||+-| ++.+.|-+=++..|++-... ||..+.+ .--.+| .. .-+.-+.|+=---++ T Consensus 62 gQ~QVIIG~---~V~~vy~~~~~~~~~~~~~~-----~~~~~~~---~~~~~~--------~~--~~i~~is~if~PiIP 120 (473) T PRK11007 62 GQFQVVIGT---EVGDYYKALIAKTGQAEVDK-----EQAKKAA---RQNMKW--------HE--RLISHFAEIFFPLLP 120 (473) T ss_pred CEEEEEECC---CHHHHHHHHHHHHCCCCCCH-----HHHHHHH---HCCCCH--------HH--HHHHHHHHHHHHHHH T ss_conf 817999898---88999999998609876660-----2333443---214689--------99--999999998861499 Q ss_pred HHHHHHH Q ss_conf 5678789 Q gi|254780757|r 165 GEITYGL 171 (307) Q Consensus 165 ~EiTYGL 171 (307) +=++-|| T Consensus 121 ~l~aaGl 127 (473) T PRK11007 121 ALISGGL 127 (473) T ss_pred HHHHHHH T ss_conf 9999999 No 107 >KOG3040 consensus Probab=25.73 E-value=50 Score=14.16 Aligned_cols=102 Identities=25% Similarity=0.447 Sum_probs=65.2 Q ss_pred CCCCHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCHHHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEEE Q ss_conf 77779999999999998669589611675433322485789874483440167641557887655466775001144445 Q gi|254780757|r 9 NDLSFQNIILTLTQYWAQQGCTILQPYDMEVGAGTFHPSTTLRALGPLSWKAAYVQPSRRPLDGRYAENPNRLQHYYQFQ 88 (307) Q Consensus 9 ~~~~fq~ii~~L~~fW~~~GC~i~qpyd~e~GAgT~hp~T~lr~lg~~pw~~aYvqPsrRP~DgRYGeNPNRlq~y~QfQ 88 (307) +..+||.+-.+.+-.-..+. |.=+-+|-|-++-.+-=-+|||-|.-+|-- |.-+ -.-- T Consensus 120 e~F~y~~ln~AFrvL~e~~k-----~~LIai~kgryykr~~Gl~lgpG~fv~aLe----------yatg-------~~a~ 177 (262) T KOG3040 120 EGFSYQRLNRAFRVLLEMKK-----PLLIAIGKGRYYKRVDGLCLGPGPFVAALE----------YATG-------CEAT 177 (262) T ss_pred CCCCHHHHHHHHHHHHCCCC-----CEEEEECCCEEEEECCCCCCCCHHHHHHHH----------HCCC-------CEEE T ss_conf 44668899999999970788-----707995275156413663237427777755----------5048-------6578 Q ss_pred EEECCCCHHHHHHHHHHHHHCCCCCCCCCEEEEECCCCCCHHHHCCCCEE Q ss_conf 57579833589999999998199962171798832888700110123217 Q gi|254780757|r 89 VIIKPNPLNLQNLYIESLKAVGIDPLIHDVRFVEDNWESPTLGAWGLGWE 138 (307) Q Consensus 89 VilKPsp~niq~lYl~SL~~iGid~~~hDIrFveDnWEsPtLGAwGlGWE 138 (307) ||=||||. .+..-|.++|.+| |.+-++-||-.+-..||--.|-. T Consensus 178 vvGKP~~~----fFe~al~~~gv~p--~~aVMIGDD~~dDvgGAq~~GMr 221 (262) T KOG3040 178 VVGKPSPF----FFESALQALGVDP--EEAVMIGDDLNDDVGGAQACGMR 221 (262) T ss_pred EECCCCHH----HHHHHHHHCCCCH--HHHEEECCCCCCCHHHHHHHCCE T ss_conf 70699778----9999997638984--79167756223220557661622 No 108 >pfam01215 COX5B Cytochrome c oxidase subunit Vb. Probab=24.78 E-value=46 Score=14.37 Aligned_cols=26 Identities=31% Similarity=0.437 Sum_probs=18.7 Q ss_pred CCHHHHHHHHHHHHH--HHCCCCCEEEEE Q ss_conf 514567878999999--986998211332 Q gi|254780757|r 162 PISGEITYGLERLAM--YVQNVNSVYDIV 188 (307) Q Consensus 162 pv~~EiTYGLERiaM--ylQ~vdnvyDl~ 188 (307) |...|..-||||.-| -+||+| +||+. T Consensus 7 Ptd~EqATGlEr~Ell~~~~G~D-~fd~~ 34 (99) T pfam01215 7 PTDLDQETGLERRELLRELFGHD-VFDIK 34 (99) T ss_pred CCHHHHCCCHHHHHHHHHHCCCC-CCCCC T ss_conf 98668640578999999871999-45577 No 109 >smart00802 UME Domain in UVSB PI-3 kinase, MEI-41 and ESR-1. Characteristic domain in UVSP PI-3 kinase, MEI-41 and ESR-1. Found in nucleolar proteins. Associated with FAT, FATC, PI3_PI4_kinase modules. Probab=24.65 E-value=52 Score=14.03 Aligned_cols=36 Identities=25% Similarity=0.228 Sum_probs=27.7 Q ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9987888387501788999999999999999999985 Q gi|254780757|r 265 HIFNLLDARGVISTTERQRYILRIRSLAKACGEAFLM 301 (307) Q Consensus 265 H~FNLLDARGaISVTERq~YI~RIR~Lak~~A~ayl~ 301 (307) =...+.|.+|.-++.||...|..|..|.+.+. .++- T Consensus 16 f~~~l~~~~~~~~~~eKk~~i~si~~lI~l~g-~~is 51 (107) T smart00802 16 FSNILHDSSGKKPYNEKKRALRSIGFLIKLMG-KHIS 51 (107) T ss_pred HHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHH-HHHH T ss_conf 99998362337998999999999999999868-8889 No 110 >pfam06920 Ded_cyto Dedicator of cytokinesis. This family represents a conserved region approximately 200 residues long within a number of eukaryotic dedicator of cytokinesis proteins. These are potential guanine nucleotide exchange factors, which activate some small GTPases by exchanging bound GDP for free GTP. Probab=24.58 E-value=52 Score=14.02 Aligned_cols=61 Identities=25% Similarity=0.383 Sum_probs=43.0 Q ss_pred HHHHHHHHCCCCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999869982113320688878831106244746997776350179899999999999999999850 Q gi|254780757|r 171 LERLAMYVQNVNSVYDIVFNAIEGQNVLYGDIFAQSEQEYSRYNFEYANPEILHNHFIDSEKECLDLLKK 240 (307) Q Consensus 171 LERiaMylQ~vdnvyDl~wn~~~~~~vtYGdif~q~E~E~S~YNFE~Ad~~~L~~~F~~yE~Ea~rLle~ 240 (307) +..|.|-|||+=.+ .-| ++-..|-++|+..+.+ .+.....++.|...|.+.-.-|.+.|+. T Consensus 89 ~~~Lq~~L~G~v~a---~VN---gG~~~y~~~Fl~~~~~---~~~~~~~v~~Lk~~~~~~~~~~~~~L~l 149 (178) T pfam06920 89 IKTLQMLLQGSVDV---QVN---GGPLKYAEVFLSEDYA---NNYPAEHVQKLKKAFRDFLKICGEALEL 149 (178) T ss_pred HHHHHHHHCCCCCH---HHC---CCHHHHHHHHCCCCCC---CCCCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 78999985211335---741---6899999998397446---8535899999999999999999999999 No 111 >PRK05431 seryl-tRNA synthetase; Provisional Probab=24.41 E-value=52 Score=14.00 Aligned_cols=97 Identities=21% Similarity=0.203 Sum_probs=61.4 Q ss_pred CCCCCCCCHHHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEEE-----EEECCCCHHHHHHH-------HHH Q ss_conf 433322485789874483440167641557887655466775001144445-----57579833589999-------999 Q gi|254780757|r 38 EVGAGTFHPSTTLRALGPLSWKAAYVQPSRRPLDGRYAENPNRLQHYYQFQ-----VIIKPNPLNLQNLY-------IES 105 (307) Q Consensus 38 e~GAgT~hp~T~lr~lg~~pw~~aYvqPsrRP~DgRYGeNPNRlq~y~QfQ-----VilKPsp~niq~lY-------l~S 105 (307) ||.-+-||--..| ....=|.+.+=+-||.|---|-||...--|.+-|||. ++.+| +++.+.. -+- T Consensus 231 EvpL~~~~~~eil-~~~~LPik~~~~S~CFR~EAGs~GkdtrGl~RvHQF~KVE~f~~~~p--e~S~~~~e~~~~~~e~i 307 (422) T PRK05431 231 EVPLTNLHRDEIL-DEEELPIKYTAYSPCFRSEAGSAGRDTRGLIRQHQFDKVELVKFTKP--EDSYAELEELTGNAEEI 307 (422) T ss_pred CCCHHHHHHCCCC-CHHHCCHHHEEECCHHHHCCCCCCCCCCCCEEEEEECEEEEEEECCH--HHHHHHHHHHHHHHHHH T ss_conf 3212334401133-65439844527883653031336755676124430000343353586--77999999999999999 Q ss_pred HHHCCCCCCCCCEEEEECCCCCCHHHHCC---CCEEEEECCE Q ss_conf 99819996217179883288870011012---3217688787 Q gi|254780757|r 106 LKAVGIDPLIHDVRFVEDNWESPTLGAWG---LGWECWCDGM 144 (307) Q Consensus 106 L~~iGid~~~hDIrFveDnWEsPtLGAwG---lGWEvwldGM 144 (307) |+.|||-.+.-++ -|--||+.. .--|||+-|- T Consensus 308 ~~~L~lpyrvv~~-------~sgdlg~~a~~~yDiE~W~P~~ 342 (422) T PRK05431 308 LQKLELPYRVVLL-------CTGDMGFSAAKTYDLEVWLPGQ 342 (422) T ss_pred HHHCCCCCCCCCC-------CCCCCCHHHHHHEEEEEEECCC T ss_conf 9875985023567-------8765576656545457871566 No 112 >cd00924 Cyt_c_Oxidase_Vb Cytochrome c oxidase subunit Vb. Cytochrome c oxidase (CcO), the terminal oxidase in the respiratory chains of eukaryotes and most bacteria, is a multi-chain transmembrane protein located in the inner membrane of mitochondria and the cell membrane of prokaryotes. It catalyzes the reduction of O2 and simultaneously pumps protons across the membrane. The number of subunits varies from three to five in bacteria and up to 13 in mammalian mitochondria. Subunits I, II, and III of mammalian CcO are encoded within the mitochondrial genome and the remaining 10 subunits are encoded within the nuclear genome. Found only in eukaryotes, subunit Vb is one of three mammalian subunits that lacks a transmembrane region. Subunit Vb is located on the matrix side of the membrane and binds the regulatory subunit of protein kinase A. The abnormally extended conformation is stable only in the CcO assembly. Probab=24.14 E-value=49 Score=14.23 Aligned_cols=27 Identities=26% Similarity=0.422 Sum_probs=16.7 Q ss_pred CCHHHHHHHHHHHHH--HHCCCCCEEEEEE Q ss_conf 514567878999999--9869982113320 Q gi|254780757|r 162 PISGEITYGLERLAM--YVQNVNSVYDIVF 189 (307) Q Consensus 162 pv~~EiTYGLERiaM--ylQ~vdnvyDl~w 189 (307) |...|..-||||.-| -+||+| +||+.- T Consensus 7 ptd~eqATGlEr~Ella~~~G~D-~fd~~~ 35 (97) T cd00924 7 PTDLEQATGLERKELLAKLEGID-DFDMKP 35 (97) T ss_pred CCHHHHCCCHHHHHHHHHHCCCC-CCCCCC T ss_conf 88678651688999999872998-223666 No 113 >pfam12410 rpo30_N Poxvirus DNA dependent RNA polymerase 30kDa subunit. This family of proteins is found in viruses. Proteins in this family are typically between 193 and 259 amino acids in length. The family is found in association with pfam01096. There are two conserved sequence motifs: GIEYSKD and LRY. This family is N terminal of the 30 kDa subunit of poxvirus DNA-d-RNA-pol. It has structural similarity to the eukaryotic transcriptional elongation factor SII. Probab=24.13 E-value=40 Score=14.83 Aligned_cols=52 Identities=35% Similarity=0.432 Sum_probs=39.0 Q ss_pred HHHHHCCHHHHHHHHHHHHHHHHHHH-------HHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHC Q ss_conf 63501798999999999999999998-------50066532335530138289999999998788838 Q gi|254780757|r 213 YNFEYANPEILHNHFIDSEKECLDLL-------KKGIPNEHHRHHLCVFPAYDQCIKASHIFNLLDAR 273 (307) Q Consensus 213 YNFE~Ad~~~L~~~F~~yE~Ea~rLl-------e~~LP~~~~~~~~lvlPAYD~~LKaSH~FNLLDAR 273 (307) -|.|++|.--+.+.-+-.|+|..|-+ .++.+ -++=-+|-||| +-.||+||-+ T Consensus 76 tn~eY~dlc~~Ir~TnG~eke~LRYlLfgiKCv~~gVe-----YdId~l~D~dY----~~YFnVLdeK 134 (136) T pfam12410 76 TNKEYADLCDMIRSTNGTEKEILRYLLFGIKCVKKGVE-----YDIDKLRDYDY----DEYFNVLDEK 134 (136) T ss_pred CCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCC-----CCHHHCCCCCH----HHHHHHHHHC T ss_conf 55059999999987068249999999999999961897-----77222345563----8774654111 No 114 >COG1923 Hfq Uncharacterized host factor I protein [General function prediction only] Probab=23.93 E-value=46 Score=14.39 Aligned_cols=23 Identities=26% Similarity=0.374 Sum_probs=18.4 Q ss_pred CCCHHHHHHHHHHHHHCCCCCCC Q ss_conf 98335899999999981999621 Q gi|254780757|r 93 PNPLNLQNLYIESLKAVGIDPLI 115 (307) Q Consensus 93 Psp~niq~lYl~SL~~iGid~~~ 115 (307) -.|.|+||.||++++.=+|-... T Consensus 2 ~~~~nlQD~fLn~~Rk~~~~VtI 24 (77) T COG1923 2 AKGQNLQDPFLNALRKEKIPVTI 24 (77) T ss_pred CCCCCCCHHHHHHHHHCCCEEEE T ss_conf 86423326899988863964899 No 115 >PRK09537 pylS pyrolysyl-tRNA synthetase; Reviewed Probab=23.88 E-value=38 Score=15.02 Aligned_cols=121 Identities=27% Similarity=0.294 Sum_probs=69.9 Q ss_pred HHHHHHHC---CCCCCEEEEECCCCC-CCCCCCCCCCCCCCEEEEEEEEECCCCHHHHHHHHHHHHHCCCCCCC-CCEEE Q ss_conf 57898744---834401676415578-87655466775001144445575798335899999999981999621-71798 Q gi|254780757|r 46 PSTTLRAL---GPLSWKAAYVQPSRR-PLDGRYAENPNRLQHYYQFQVIIKPNPLNLQNLYIESLKAVGIDPLI-HDVRF 120 (307) Q Consensus 46 p~T~lr~l---g~~pw~~aYvqPsrR-P~DgRYGeNPNRlq~y~QfQVilKPsp~niq~lYl~SL~~iGid~~~-hDIrF 120 (307) -.+.||-| =|+|.++--+-||.| .+||.---|---+-.+-|.---+ .-+|.+.+--+-|.++|||... +|--. T Consensus 271 LYnylrkL~R~lpdPIrIFEIGpCyRKESdG~~HLeEFTMlNfcqmgsgc--trenle~~i~efl~~lgIdfeiv~dscm 348 (420) T PRK09537 271 LYNYLRKLDRALPDPIKIFEIGPCYRKESDGKEHLEEFTMVNFCQMGSGC--TRENLEALITEFLNHLGIDFEIVGDSCM 348 (420) T ss_pred HHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCC--CHHHHHHHHHHHHHHCCCEEEEECCCEE T ss_conf 99999999861688715886204303444106788777777688644898--6788999999999861972798425206 Q ss_pred E--------ECCCC--CCHHHHCCCCEEEEECCEEEHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEE Q ss_conf 8--------32888--7001101232176887874147778998668257751456787899999998699821133 Q gi|254780757|r 121 V--------EDNWE--SPTLGAWGLGWECWCDGMEISQFTYFQQVCGIECSPISGEITYGLERLAMYVQNVNSVYDI 187 (307) Q Consensus 121 v--------eDnWE--sPtLGAwGlGWEvwldGMEItQFTYFQQvGGi~c~pv~~EiTYGLERiaMylQ~vdnvyDl 187 (307) | ..|-| |..+|.-- ||+- =||+ +| =+-+-+|||||.|-.+|-.||-.. T Consensus 349 VYGdT~DvMhgDlELsSavvGP~p------LDr~-----------Wgi~-kP-WiGaGFGLERLLkV~hg~~nikra 406 (420) T PRK09537 349 VYGDTLDIMHGDLELSSAVVGPIP------LDRE-----------WGID-KP-WIGAGFGLERLLKVMHGFKNIKRA 406 (420) T ss_pred EECCHHHHHCCCHHHCCCCCCCCC------CCCC-----------CCCC-CC-CCCCCHHHHHHHHHHHCCHHHHHH T ss_conf 853225654264011035447744------5655-----------5766-76-202333099999998353667777 No 116 >PTZ00047 cytochrome C oxidase subunit II; Provisional Probab=23.65 E-value=37 Score=15.12 Aligned_cols=62 Identities=31% Similarity=0.687 Sum_probs=40.9 Q ss_pred CCCCCCCCCCCCCCCCCEEEEEEEEECCCCHHHHHHHHHHHHHCCCC-----CCCCCEEEEE------CCCCCCHHH Q ss_conf 57887655466775001144445575798335899999999981999-----6217179883------288870011 Q gi|254780757|r 66 SRRPLDGRYAENPNRLQHYYQFQVIIKPNPLNLQNLYIESLKAVGID-----PLIHDVRFVE------DNWESPTLG 131 (307) Q Consensus 66 srRP~DgRYGeNPNRlq~y~QfQVilKPsp~niq~lYl~SL~~iGid-----~~~hDIrFve------DnWEsPtLG 131 (307) -+-|+--.|-+||.+.-.||+||--|-+.- ||=-+-++-+-+| |..--|||+- -.|--|.|| T Consensus 31 ~~~~~~~~~~~~~~~~~~~~~f~s~mv~d~----DL~~G~~R~LeVDNrLvlPvg~pVr~liTS~DVIHSF~VPslg 103 (163) T PTZ00047 31 ERYPIPEKYLKNPELIPKYYSFQSNLVTDE----DLQPGMLRQLEVDKRLTLPTRTHIRFLVTATDVIHSWSIPSLG 103 (163) T ss_pred CCCCCCHHHCCCCCCCCCEEEEECCCCCHH----CCCCCCCEEECCCCCEEEECCCEEEEEEEECCCHHHCCCCCCC T ss_conf 037993686439401664356531565320----2899970155234861702898179999834100110056544 No 117 >COG1190 LysU Lysyl-tRNA synthetase (class II) [Translation, ribosomal structure and biogenesis] Probab=23.52 E-value=35 Score=15.31 Aligned_cols=27 Identities=33% Similarity=0.699 Sum_probs=11.6 Q ss_pred CCHHHHHHHHHHHHHHHCCCCCEEEEE Q ss_conf 514567878999999986998211332 Q gi|254780757|r 162 PISGEITYGLERLAMYVQNVNSVYDIV 188 (307) Q Consensus 162 pv~~EiTYGLERiaMylQ~vdnvyDl~ 188 (307) |.+|-.--|+.||+|.|-+..||-|++ T Consensus 467 PPTgG~GiGIDRLvMllT~~~sIRdVi 493 (502) T COG1190 467 PPTGGLGIGIDRLVMLLTNSPSIRDVI 493 (502) T ss_pred CCCCCCCCCHHHHHHHHCCCCCHHHEE T ss_conf 998875125777777773998554222 No 118 >PRK00147 queA S-adenosylmethionine:tRNA ribosyltransferase-isomerase; Provisional Probab=23.43 E-value=28 Score=15.92 Aligned_cols=27 Identities=41% Similarity=0.710 Sum_probs=18.2 Q ss_pred HHHHHHHCCCCCCEEEEECCCCCCCCCC Q ss_conf 5789874483440167641557887655 Q gi|254780757|r 46 PSTTLRALGPLSWKAAYVQPSRRPLDGR 73 (307) Q Consensus 46 p~T~lr~lg~~pw~~aYvqPsrRP~DgR 73 (307) .-.+++.+++..|. |.+.|+||+.-|. T Consensus 79 Eilll~~~~~~~w~-~lvk~~kk~k~G~ 105 (345) T PRK00147 79 EVLLLRPLGEDRWL-ALVRPAKRPKPGT 105 (345) T ss_pred EEEEEEECCCCEEE-EEEECCCCCCCCC T ss_conf 99998774898799-9995488899999 No 119 >pfam01563 Alpha_E3_glycop Alphavirus E3 glycoprotein. This protein is found in some alphaviruses as a virion associated spike protein. Probab=23.41 E-value=55 Score=13.86 Aligned_cols=28 Identities=21% Similarity=0.450 Sum_probs=21.1 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 99999999985006653233553013828999999999 Q gi|254780757|r 229 DSEKECLDLLKKGIPNEHHRHHLCVFPAYDQCIKASHI 266 (307) Q Consensus 229 ~yE~Ea~rLle~~LP~~~~~~~~lvlPAYD~~LKaSH~ 266 (307) .--++..++|+.+.. -|+||..|.|+-. T Consensus 20 ~~P~~tL~mLe~Nvd----------~p~Yd~LL~a~l~ 47 (56) T pfam01563 20 KEPEKTLRMLEDNVD----------NPGYDELLEAVLT 47 (56) T ss_pred CCHHHHHHHHHHHCC----------CCCHHHHHHHHHH T ss_conf 798999999998078----------8349999999950 No 120 >TIGR03112 6_pyr_pter_rel 6-pyruvoyl tetrahydropterin synthase-related domain. Members of this family are small proteins, or small domains of larger proteins, that occur in certain Firmicutes in the same regions as members of families TIGR03110 and TIGR03111. Members of TIGR03110 resemble exosortase, a proposed protein sorting transpeptidase (see TIGR02602). TIGR03111 represents a small clade among the group 2 glycosyltransferases. Members of the current protein family resemble eukaryotic known and prokaryotic predicted 6-pyruvoyl tetrahydropterin synthases. Probab=23.39 E-value=47 Score=14.33 Aligned_cols=37 Identities=19% Similarity=0.321 Sum_probs=25.3 Q ss_pred ECCCCHHHHHHHHHHHHHCCCCCCCCCEEEEE-CCCCCCHH Q ss_conf 57983358999999999819996217179883-28887001 Q gi|254780757|r 91 IKPNPLNLQNLYIESLKAVGIDPLIHDVRFVE-DNWESPTL 130 (307) Q Consensus 91 lKPsp~niq~lYl~SL~~iGid~~~hDIrFve-DnWEsPtL 130 (307) +-||-+||-.-.-+-|+.. .....+|+.+ .=|||||- T Consensus 71 ~NPT~ENia~yi~~~l~~~---L~~~~v~l~~v~v~EtPtr 108 (113) T TIGR03112 71 INPTLENIGDYFFDEIKKL---LKEKGWKLHSIEISETPTR 108 (113) T ss_pred CCCCHHHHHHHHHHHHHHH---HCCCCEEEEEEEEEECCCE T ss_conf 8987899999999999987---5607709999999989945 No 121 >CHL00201 syh histidine-tRNA synthetase; Provisional Probab=22.77 E-value=56 Score=13.78 Aligned_cols=79 Identities=14% Similarity=0.160 Sum_probs=46.6 Q ss_pred HCCCEEEECCCCCCCCCCCCHHHHHHHCCCCCCEEEEECCCCC---CCCCCCCCCCCCCCEEEEEEEEE--CCCCH---H Q ss_conf 6695896116754333224857898744834401676415578---87655466775001144445575--79833---5 Q gi|254780757|r 26 QQGCTILQPYDMEVGAGTFHPSTTLRALGPLSWKAAYVQPSRR---PLDGRYAENPNRLQHYYQFQVII--KPNPL---N 97 (307) Q Consensus 26 ~~GC~i~qpyd~e~GAgT~hp~T~lr~lg~~pw~~aYvqPsrR---P~DgRYGeNPNRlq~y~QfQVil--KPsp~---n 97 (307) +.|=.+.--+|+-+.-+-+--++- ....+.|.+..|..|+.| |.-||| -.++|+-|=+ -++|. . T Consensus 72 ~~g~~l~LRpD~T~piaR~~~~~~-~~~~~~P~r~~y~g~vfR~e~pq~GR~-------REF~Q~gvEiiG~~~~~aDaE 143 (424) T CHL00201 72 RGNRSITLRPEGTASIVRAFIENR-MAYHMRPQRLWYSGPMFRYERPQSGRQ-------RQFHQLGIEFIGSSDPRADME 143 (424) T ss_pred CCCCEEEECCCCHHHHHHHHHHCC-CCCCCCCEEEEEECCEEECCCCCCCCC-------CHHHHCCEEEECCCCHHHHHH T ss_conf 999888867877099999999747-655799878998754780589878864-------210414620006898287899 Q ss_pred HHHHHHHHHHHCCCC Q ss_conf 899999999981999 Q gi|254780757|r 98 LQNLYIESLKAVGID 112 (307) Q Consensus 98 iq~lYl~SL~~iGid 112 (307) +..|-.++|+++|+. T Consensus 144 vi~l~~~~l~~lgl~ 158 (424) T CHL00201 144 VIHLAMTLFNELPVH 158 (424) T ss_pred HHHHHHHHHHHCCCC T ss_conf 999999999865965 No 122 >pfam00765 Autoind_synth Autoinducer synthetase. Probab=22.59 E-value=18 Score=17.32 Aligned_cols=17 Identities=35% Similarity=1.036 Sum_probs=10.7 Q ss_pred CCCEEEEE-CCEEEHHHH Q ss_conf 23217688-787414777 Q gi|254780757|r 134 GLGWECWC-DGMEISQFT 150 (307) Q Consensus 134 GlGWEvwl-dGMEItQFT 150 (307) -|||+|=+ ||||+-||- T Consensus 24 rlgW~v~~~~g~E~DqyD 41 (182) T pfam00765 24 RLDWDVNCSNGMEFDEYD 41 (182) T ss_pred CCCCCCCCCCCCCCCCCC T ss_conf 269878789997755789 No 123 >TIGR02253 CTE7 HAD superfamily (subfamily IA) hydrolase, TIGR02253; InterPro: IPR011950 This family of sequences from archaea and metazoans includes the human uncharacterised protein CTE7. Pyrococcus species appear to have three different forms of this enzyme, so it is unclear whether all members of this family have the same function. This family is a member of the haloacid dehalogenase (HAD) superfamily of hydrolases which are characterised by three conserved sequence motifs. By virtue of an alpha helical domain in-between the first and second conserved motif, this family is a member of subfamily IA .. Probab=22.50 E-value=57 Score=13.74 Aligned_cols=46 Identities=28% Similarity=0.622 Sum_probs=38.4 Q ss_pred ECCCCHHHHHHHHHHHHHCCCCCCCCCEEEEECCCCCCHHHHCCCC--EEEEEC Q ss_conf 5798335899999999981999621717988328887001101232--176887 Q gi|254780757|r 91 IKPNPLNLQNLYIESLKAVGIDPLIHDVRFVEDNWESPTLGAWGLG--WECWCD 142 (307) Q Consensus 91 lKPsp~niq~lYl~SL~~iGid~~~hDIrFveDnWEsPtLGAwGlG--WEvwld 142 (307) .||.| +++...|+.+|+.|. |-=+|=|+=++--.||--+| =-||++ T Consensus 168 ~KPhP----~IF~~Al~~~gV~p~--eaVmVGD~L~~Di~GAk~~Gm~tTvwi~ 215 (244) T TIGR02253 168 EKPHP----KIFYAALRRLGVKPE--EAVMVGDRLDKDIKGAKNLGMKTTVWIN 215 (244) T ss_pred CCCCH----HHHHHHHHHHCCCCC--CEEEECCCCCCCCHHHHHHHHHEEEEEC T ss_conf 69785----899999997088936--6667678500022235641201037634 No 124 >TIGR00458 aspS_arch aspartyl-tRNA synthetase; InterPro: IPR004523 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . Aspartyl tRNA synthetase 6.1.1.12 from EC is an alpha2 dimer that belongs to class IIb. Structural analysis combined with mutagenesis and enzymology data on the yeast enzyme point to a tRNA binding process that starts by a recognition event between the tRNA anticodon loop and the synthetase anticodon binding module . This family represents aspartyl-tRNA synthetases from the eukaryotic cytosol and from the archaea. In some species, this enzyme aminoacylates tRNA for both Asp and Asn; Asp-tRNA(asn) is subsequently transamidated to Asn-tRNA(asn). ; GO: 0000166 nucleotide binding, 0004815 aspartate-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006422 aspartyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=22.46 E-value=40 Score=14.81 Aligned_cols=38 Identities=29% Similarity=0.506 Sum_probs=16.4 Q ss_pred CEEEHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEE Q ss_conf 874147778998668257751456787899999998699821133 Q gi|254780757|r 143 GMEISQFTYFQQVCGIECSPISGEITYGLERLAMYVQNVNSVYDI 187 (307) Q Consensus 143 GMEItQFTYFQQvGGi~c~pv~~EiTYGLERiaMylQ~vdnvyDl 187 (307) |-|+=..|||..-.=+.=.|= |-.=.|-.-+.|-||.| T Consensus 190 Gt~lF~i~YF~~~AfL~QSPQ-------lyKQ~l~~~g~~rVyEI 227 (466) T TIGR00458 190 GTELFPITYFEREAFLGQSPQ-------LYKQVLMAAGLERVYEI 227 (466) T ss_pred CCCCCEEEEECCCCCCCCCCH-------HHHHHHHHCCCCCEEEE T ss_conf 853320122123000135703-------56687765265203550 No 125 >pfam02975 Me-amine-dh_L Methylamine dehydrogenase, L chain. Probab=22.09 E-value=49 Score=14.21 Aligned_cols=21 Identities=29% Similarity=0.314 Sum_probs=18.3 Q ss_pred CCCCCCEEEEECCCCCCCCCC Q ss_conf 483440167641557887655 Q gi|254780757|r 53 LGPLSWKAAYVQPSRRPLDGR 73 (307) Q Consensus 53 lg~~pw~~aYvqPsrRP~DgR 73 (307) -|.||=.+..|--||-|.||| T Consensus 40 PGte~s~~sWvGtC~NP~Dgk 60 (122) T pfam02975 40 PGTELAPASWVASCFNPGDGQ 60 (122) T ss_pred CCCCCCCCEEEEECCCCCCCC T ss_conf 988557633678725899997 No 126 >PRK12292 hisZ ATP phosphoribosyltransferase regulatory subunit; Provisional Probab=21.78 E-value=59 Score=13.64 Aligned_cols=52 Identities=23% Similarity=0.310 Sum_probs=38.5 Q ss_pred CCCCCEEEEECCCCC---CCCCCCCCCCCCCCEEEEEEEEE--CCCCH---HHHHHHHHHHHHCCCC Q ss_conf 834401676415578---87655466775001144445575--79833---5899999999981999 Q gi|254780757|r 54 GPLSWKAAYVQPSRR---PLDGRYAENPNRLQHYYQFQVII--KPNPL---NLQNLYIESLKAVGID 112 (307) Q Consensus 54 g~~pw~~aYvqPsrR---P~DgRYGeNPNRlq~y~QfQVil--KPsp~---niq~lYl~SL~~iGid 112 (307) .+.|.+..|..|+.| |.-| |.-.++|+-|=+ .+||. .+..|-+++|+.+|+. T Consensus 95 ~~~p~r~~Y~g~vfR~e~pq~G-------r~REf~Q~G~EiiG~~~~~aDaEvI~l~~~~l~~lgl~ 154 (388) T PRK12292 95 RPGPLRLCYAGNVFRAQPRGLG-------RSREFLQSGVELIGDAGIEADAEVLSLLLDALKALGLP 154 (388) T ss_pred CCCCEEEEEEEEEEEECCCCCC-------CCCEEEECCEEEECCCCHHHHHHHHHHHHHHHHHCCCC T ss_conf 6787689985226870589899-------61023425755317888899999999999999976998 No 127 >pfam05066 RNA_pol_delta DNA-directed RNA polymerase delta subunit. The delta protein is a dispensable subunit of Bacillus subtilis RNA polymerase (RNAP) that has major effects on the biochemical properties of the purified enzyme. In the presence of delta, RNAP displays an increased specificity of transcription, a decreased affinity for nucleic acids, and an increased efficiency of RNA synthesis because of enhanced recycling. The delta protein, contains two distinct regions, an N-terminal domain and a glutamate and aspartate residue-rich carboxyl-terminal region. Probab=21.71 E-value=59 Score=13.63 Aligned_cols=19 Identities=11% Similarity=0.443 Sum_probs=11.2 Q ss_pred CCCCCCCCHHHHHHHHHHH Q ss_conf 3356777799999999999 Q gi|254780757|r 5 HNKKNDLSFQNIILTLTQY 23 (307) Q Consensus 5 ~~~~~~~~fq~ii~~L~~f 23 (307) ..++++|+|+||+...+++ T Consensus 25 ~ek~~~m~F~dL~~ev~~~ 43 (94) T pfam05066 25 KEKGKPMTFDDLVNEIQKL 43 (94) T ss_pred HHCCCCCCHHHHHHHHHHH T ss_conf 9849941399999999999 No 128 >TIGR03351 PhnX-like phosphonatase-like hydrolase. This clade of sequences are the closest homologs to the PhnX enzyme, phosphonoacetaldehyde (Pald) hydrolase (phosphonatase, TIGR01422). This phosphonatase-like enzyme and PhnX itself are members of the haloacid dehalogenase (HAD) superfamily (pfam00702) having a a number of distinctive features that set them apart from typical HAD enzymes. The typical HAD N-terminal motif DxDx(T/V) here is DxAGT and the usual conserved lysine prior to the C-terminal motif is instead an arginine. Also distinctive of phosphonatase, and particular to its bi-catalytic mechanism is a conserved lysine in the variable "cap" domain. This lysine forms a Schiff base with the aldehyde of phosphonoacetaldehyde, providing, through the resulting positive charge, a polarization of the C-P bond necesary for cleavage as well as a route to the initial product of cleavage, an ene-amine. The conservation of these elements in this phosphonatase-like enzyme suggests that the Probab=21.50 E-value=59 Score=13.60 Aligned_cols=87 Identities=17% Similarity=0.153 Sum_probs=45.5 Q ss_pred HHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEEEEE--------ECCCCHHHHHHHHHHHHHCCCCCCCCCE Q ss_conf 78987448344016764155788765546677500114444557--------5798335899999999981999621717 Q gi|254780757|r 47 STTLRALGPLSWKAAYVQPSRRPLDGRYAENPNRLQHYYQFQVI--------IKPNPLNLQNLYIESLKAVGIDPLIHDV 118 (307) Q Consensus 47 ~T~lr~lg~~pw~~aYvqPsrRP~DgRYGeNPNRlq~y~QfQVi--------lKPsp~niq~lYl~SL~~iGid~~~hDI 118 (307) ..+|..|...-++++-+--+.|..=-+.-+.- -++.+..|.++ -||.|+ +|+..++.+|+.+.. .+ T Consensus 93 ~elL~~L~~~gi~~av~T~~~~~~~~~~l~~~-g~~~~~~fd~vv~~~dv~~~KP~Pd----~~l~a~~~lg~~~~~-~~ 166 (220) T TIGR03351 93 EEAFRSLRSSGIKVALTTGFDRDTAERLLEKL-GWTVGDDVDAVVCPSDVAAGRPAPD----LILRAMELTGVQDVQ-SV 166 (220) T ss_pred HHHHHHHHHCCCCEEEECCCCHHHHHHHHHHH-CCCHHHHCCCCCCCCCCCCCCCCHH----HHHHHHHHHCCCCCC-CE T ss_conf 99999999879977663599779999999982-7640432240456343566798789----999999995889643-31 Q ss_pred EEEECCCCCCHHHH--CCCCEEEE Q ss_conf 98832888700110--12321768 Q gi|254780757|r 119 RFVEDNWESPTLGA--WGLGWECW 140 (307) Q Consensus 119 rFveDnWEsPtLGA--wGlGWEvw 140 (307) -+|+|..- --..| .|.++=|. T Consensus 167 ~vigDs~~-di~aA~~AG~~~~ig 189 (220) T TIGR03351 167 AVAGDTPN-DLEAGINAGAGAVVG 189 (220) T ss_pred EECCCCHH-HHHHHHHCCCCEEEE T ss_conf 35378788-999999859987999 No 129 >COG2502 AsnA Asparagine synthetase A [Amino acid transport and metabolism] Probab=21.38 E-value=60 Score=13.59 Aligned_cols=169 Identities=21% Similarity=0.356 Sum_probs=85.5 Q ss_pred HHHHHHHHHHHHCCCEEEECCCCCCCCCCCCHHHHHHHCCC--CCCEEEEECC-------------------CCC----- Q ss_conf 99999999998669589611675433322485789874483--4401676415-------------------578----- Q gi|254780757|r 15 NIILTLTQYWAQQGCTILQPYDMEVGAGTFHPSTTLRALGP--LSWKAAYVQP-------------------SRR----- 68 (307) Q Consensus 15 ~ii~~L~~fW~~~GC~i~qpyd~e~GAgT~hp~T~lr~lg~--~pw~~aYvqP-------------------srR----- 68 (307) +++-.|.+ |..+ -+--|+.+-|-|-|-.++.||.=.. .+....||.- ..| T Consensus 68 EvVhSLAK-WKR~---tL~r~~f~~~eGlythM~AlRpDeD~ld~~HSvYVDQWDWEkvi~~g~rNl~yLK~tV~kIY~~ 143 (330) T COG2502 68 EVVHSLAK-WKRH---TLARYGFSAGEGLYTHMKALRPDEDRLDPIHSVYVDQWDWEKVIPDGDRNLAYLKSTVEKIYAA 143 (330) T ss_pred HHHHHHHH-HHHH---HHHHCCCCCCCCEEEECCCCCCCCCCCCCHHEEEECCCCHHHHCCCCCCCHHHHHHHHHHHHHH T ss_conf 67688899-9888---8986387678724652110378733445302278521136442587540379999999999999 Q ss_pred ------CCCCCCCCCCCCCCEEEEEE---EEE--CC--CCHHHHHHHHHHHHH---CCC-----CCCCCCEEEEE-CCCC Q ss_conf ------87655466775001144445---575--79--833589999999998---199-----96217179883-2888 Q gi|254780757|r 69 ------PLDGRYAENPNRLQHYYQFQ---VII--KP--NPLNLQNLYIESLKA---VGI-----DPLIHDVRFVE-DNWE 126 (307) Q Consensus 69 ------P~DgRYGeNPNRlq~y~QfQ---Vil--KP--sp~niq~lYl~SL~~---iGi-----d~~~hDIrFve-DnWE 126 (307) --+-|||.- --|-..-+|- -+. -| +|.+--.-..+-+.| ||| |-+.||+|--+ |||. T Consensus 144 ir~te~av~~~~~~~-~~LP~~ItFihseeL~~ryP~L~~k~RE~ai~Ke~gAvFligIGg~LsdG~~hd~RaPdYDdWt 222 (330) T COG2502 144 IRETELAVSAEFGLA-PFLPDQITFIHSQELVARYPDLDPKGRERAIAKELGAVFLIGIGGKLSDGKPHDVRAPDYDDWT 222 (330) T ss_pred HHHHHHHHHHHCCCC-CCCCCCEEEEEHHHHHHHCCCCCCCHHHHHHHHHHCCEEEEECCCCCCCCCCCCCCCCCCCCCC T ss_conf 999999998652876-4587615796369999768899930246789986383899841650699995778899865577 Q ss_pred CCHHH-HCCC-------------CEEEEECCEEEHHHHHHHHHCCCCCC----------------CCHHHHHHHHHHHHH Q ss_conf 70011-0123-------------21768878741477789986682577----------------514567878999999 Q gi|254780757|r 127 SPTLG-AWGL-------------GWECWCDGMEISQFTYFQQVCGIECS----------------PISGEITYGLERLAM 176 (307) Q Consensus 127 sPtLG-AwGl-------------GWEvwldGMEItQFTYFQQvGGi~c~----------------pv~~EiTYGLERiaM 176 (307) +|+-+ --|| +.|+---|+-|.-=+.-.|+.=-.|. |-+.-=--|--|||| T Consensus 223 t~se~~~~gLNGDilvwn~~l~~afElSSMGIRVde~~l~~Ql~ltgdeDrl~~~wHq~llng~lP~TIGGGIGQSRl~M 302 (330) T COG2502 223 TPSELGYKGLNGDILVWNPVLGDAFELSSMGIRVDEDALKRQLALTGDEDRLELEWHQMLLNGELPQTIGGGIGQSRLCM 302 (330) T ss_pred CCCCCCCCCCCCCEEEECHHCCCCEEEECCEEEECHHHHHHHHHCCCCHHHHCCHHHHHHHCCCCCCCCCCCCCHHHHHH T ss_conf 85534546667757985200141102212404665899999985268635430769999973977711047501889999 Q ss_pred HHCCCCCEEEEE Q ss_conf 986998211332 Q gi|254780757|r 177 YVQNVNSVYDIV 188 (307) Q Consensus 177 ylQ~vdnvyDl~ 188 (307) ++-.+.++-... T Consensus 303 ~lL~k~HIGeVQ 314 (330) T COG2502 303 LLLQKKHIGEVQ 314 (330) T ss_pred HHHCCCCCCEEE T ss_conf 996123224155 No 130 >PRK02363 DNA-directed RNA polymerase subunit delta; Reviewed Probab=21.36 E-value=60 Score=13.58 Aligned_cols=21 Identities=10% Similarity=0.412 Sum_probs=16.0 Q ss_pred CCCCCCCCHHHHHHHHHHHHH Q ss_conf 335677779999999999998 Q gi|254780757|r 5 HNKKNDLSFQNIILTLTQYWA 25 (307) Q Consensus 5 ~~~~~~~~fq~ii~~L~~fW~ 25 (307) ..++++|+|++|+..++++-. T Consensus 25 ~~~~~~~~F~dL~~ei~~~~g 45 (185) T PRK02363 25 EDKGKPMTFNDLVNEIQKLLG 45 (185) T ss_pred HHCCCCCCHHHHHHHHHHHHC T ss_conf 964993039999999999958 No 131 >PRK10834 hypothetical protein; Provisional Probab=21.19 E-value=60 Score=13.56 Aligned_cols=60 Identities=18% Similarity=0.295 Sum_probs=38.2 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHH-----HHHHHHHH Q ss_conf 9999999999999850066532335530138289999999998788838750178-----89999999 Q gi|254780757|r 225 NHFIDSEKECLDLLKKGIPNEHHRHHLCVFPAYDQCIKASHIFNLLDARGVISTT-----ERQRYILR 287 (307) Q Consensus 225 ~~F~~yE~Ea~rLle~~LP~~~~~~~~lvlPAYD~~LKaSH~FNLLDARGaISVT-----ERq~YI~R 287 (307) ..+|+-...-+.|++++.|...--...-=+--||-|..|-++|.+ +.+|=|| ||+-||.| T Consensus 94 ~~YnEp~~Mk~~Li~~GVP~e~I~~D~AGfrT~DS~vRAk~VF~~---~~~iIVTQ~FH~~RAlfiAr 158 (239) T PRK10834 94 QSYNEPMTMRKDLIAAGVDPADIVLDYAGFRTLDSIVRTRKVFDT---NDFIIITQRFHCERALFIAL 158 (239) T ss_pred CCCCCHHHHHHHHHHCCCCHHHEECCCCCCCHHHHHHHHHHHHCC---CCEEEEECCHHHHHHHHHHH T ss_conf 889828999999998599899950566765489999999998099---82899946013899999999 No 132 >KOG2360 consensus Probab=21.18 E-value=60 Score=13.56 Aligned_cols=39 Identities=26% Similarity=0.241 Sum_probs=30.4 Q ss_pred HHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 89999999998788838750178899999999999999999 Q gi|254780757|r 257 YDQCIKASHIFNLLDARGVISTTERQRYILRIRSLAKACGE 297 (307) Q Consensus 257 YD~~LKaSH~FNLLDARGaISVTERq~YI~RIR~Lak~~A~ 297 (307) =-+.+|+||+=|++.+.|-|--.||.. -|.|.|.+.... T Consensus 222 aapg~KTsH~a~i~~n~gki~afe~d~--~r~~tl~~~l~~ 260 (413) T KOG2360 222 AAPGNKTSHLAAIMRNQGKIYAFERDA--KRAATLRKLLKI 260 (413) T ss_pred CCCCCCHHHHHHHHHCCCCCCHHHHHH--HHHHHHHHHHHH T ss_conf 466532112998862248721323416--889999999987 No 133 >pfam10182 Flo11 Flo11 domain. This presumed domain is found at the N-terminus of the S. cerevisiae Flo11 protein. Flo11 is required for diploid pseudohyphal formation and haploid invasive growth. It belongs to a family of proteins involved in invasive growth, cell-cell adhesion, and mating, many of which can substitute for each other under abnormal conditions. Probab=21.14 E-value=49 Score=14.18 Aligned_cols=77 Identities=21% Similarity=0.275 Sum_probs=47.1 Q ss_pred CCCCCCCCEEEEEEEEECCCCHHHHHHHHHHHHHCCCCCCCCCEEEEECCCCC---CHHHHCCCCEEEEECCEEEHHHHH Q ss_conf 66775001144445575798335899999999981999621717988328887---001101232176887874147778 Q gi|254780757|r 75 AENPNRLQHYYQFQVIIKPNPLNLQNLYIESLKAVGIDPLIHDVRFVEDNWES---PTLGAWGLGWECWCDGMEISQFTY 151 (307) Q Consensus 75 GeNPNRlq~y~QfQVilKPsp~niq~lYl~SL~~iGid~~~hDIrFveDnWEs---PtLGAwGlGWEvwldGMEItQFTY 151 (307) |+|--+.--++|.++=| .--+|.||+-||++-.....-.+--|=.. -..+.|+.--.|-- T Consensus 31 ~dn~y~VTi~f~~~~s~-------~l~~L~~LkI~G~~~pd~t~vl~~~n~~~~lidnp~d~tat~~V~a---------- 93 (152) T pfam10182 31 GDNLYKVTIHFQTQVSI-------PLKSLKSLKIIGLGSPDSTVVLYGYNENTNLIDNPTDFTATFLVYA---------- 93 (152) T ss_pred CCCEEEEEEEEEECCCC-------CHHHHHHEEEECCCCCCCCEEEEECCCCEEEECCHHHCEEEEEEEE---------- T ss_conf 69849999998642534-------5544102235035899844899968887156358001359999971---------- Q ss_pred HHHHCCCCCCCCHHHHHHH Q ss_conf 9986682577514567878 Q gi|254780757|r 152 FQQVCGIECSPISGEITYG 170 (307) Q Consensus 152 FQQvGGi~c~pv~~EiTYG 170 (307) |..|+.-|-| ..+|.|- T Consensus 94 -~~~~~~~cmP-~FqIqyd 110 (152) T pfam10182 94 -SKHGGLVCLP-NFQIQYD 110 (152) T ss_pred -ECCCCEEECC-CEEEEEE T ss_conf -7589837557-7689999 No 134 >cd00672 CysRS_core This is the catalytic core domain of cysteinyl tRNA synthetase (CysRS). This class I enzyme is a monomer, which aminoacylates the 2'-OH of the nucleotide at the 3' of the appropriate tRNA. The core domain is based on the Rossman fold and is responsible for the ATP-dependent formation of the enzyme bound aminoacyl-adenylate. It contains the characteristic class I HIGH and KMSKS motifs, which are involved in ATP binding. Probab=21.08 E-value=60 Score=13.54 Aligned_cols=67 Identities=27% Similarity=0.303 Sum_probs=44.4 Q ss_pred HHHHHHHHHHHCCCCCCCCCEEEEECCCCCCHHHHCCCCEEEEECCEEEHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 89999999998199962171798832888700110123217688787414777899866825775145678789999999 Q gi|254780757|r 98 LQNLYIESLKAVGIDPLIHDVRFVEDNWESPTLGAWGLGWECWCDGMEISQFTYFQQVCGIECSPISGEITYGLERLAMY 177 (307) Q Consensus 98 iq~lYl~SL~~iGid~~~hDIrFveDnWEsPtLGAwGlGWEvwldGMEItQFTYFQQvGGi~c~pv~~EiTYGLERiaMy 177 (307) -.+.|++.+++|||.+..+-.|- | -|+|. ||- T Consensus 90 y~~~f~~d~~~Lni~~p~~~Pra----------------W-------------------HiECs-------------aM~ 121 (213) T cd00672 90 YTAEFFEDMKALNVLPPDVYPRV----------------W-------------------HIECS-------------AMS 121 (213) T ss_pred HHHHHHHHHHHHCCCCCCCCCCE----------------E-------------------EEECH-------------HHH T ss_conf 99999999998289999813545----------------2-------------------01459-------------998 Q ss_pred HCCCCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 8699821133206888788311062447469977763501 Q gi|254780757|r 178 VQNVNSVYDIVFNAIEGQNVLYGDIFAQSEQEYSRYNFEY 217 (307) Q Consensus 178 lQ~vdnvyDl~wn~~~~~~vtYGdif~q~E~E~S~YNFE~ 217 (307) ..-..+-+||--. |+---=-+++||..||.--... T Consensus 122 ~~~lg~~~DIH~G-----G~DL~FPHHeNE~AQs~a~~~~ 156 (213) T cd00672 122 MKYLGETFDIHGG-----GVDLIFPHHENEIAQSEAAFGK 156 (213) T ss_pred HHHHCCCEEEECC-----CCCCCCCCCCCHHHHHHHHHCC T ss_conf 9882896256347-----6356589970499998886589 No 135 >PRK01424 S-adenosylmethionine:tRNA ribosyltransferase-isomerase; Provisional Probab=20.74 E-value=30 Score=15.74 Aligned_cols=25 Identities=28% Similarity=0.505 Sum_probs=15.3 Q ss_pred HHHHHCCCCCCEEEEECCCCCCCCCC Q ss_conf 89874483440167641557887655 Q gi|254780757|r 48 TTLRALGPLSWKAAYVQPSRRPLDGR 73 (307) Q Consensus 48 T~lr~lg~~pw~~aYvqPsrRP~DgR 73 (307) .+++.+....|. |.+.|+||...|. T Consensus 74 ~l~~~l~~~~w~-~l~kp~kk~~~G~ 98 (366) T PRK01424 74 NLNQKLSDNSWS-AFAKPARKLNVGD 98 (366) T ss_pred EEEEECCCCEEE-EEEECCCCCCCCC T ss_conf 987641687489-9995267874699 No 136 >TIGR01838 PHA_synth_I poly(R)-hydroxyalkanoic acid synthase, class I; InterPro: IPR010963 This entry represents the class I subfamily of poly(R)-hydroxyalkanoate synthases, which polymerises hydroxyacyl-CoAs with three to five carbons in the hydroxyacyl backbone into aliphatic esters termed poly(R)-hydroxyalkanoic acids. These polymers accumulate as carbon and energy storage inclusions in many species and can amount to 90 percent of the dry weight of the cell.; GO: 0008415 acyltransferase activity, 0042619 poly-hydroxybutyrate biosynthetic process, 0005737 cytoplasm. Probab=20.55 E-value=27 Score=16.03 Aligned_cols=20 Identities=35% Similarity=0.660 Sum_probs=13.0 Q ss_pred CCCCCCCEEEEECCCCC-CHH Q ss_conf 99621717988328887-001 Q gi|254780757|r 111 IDPLIHDVRFVEDNWES-PTL 130 (307) Q Consensus 111 id~~~hDIrFveDnWEs-PtL 130 (307) ++|..+|=||-.++|+| |-- T Consensus 39 ~~P~~~DrRF~~p~W~~~P~F 59 (541) T TIGR01838 39 AKPEPGDRRFSDPAWSSNPFF 59 (541) T ss_pred ECCCCCCCCCCCCCCCCCCHH T ss_conf 358788888987444237515 No 137 >TIGR01990 bPGM beta-phosphoglucomutase; InterPro: IPR010972 This model represents the beta-phosphoglucomutase enzyme which catalyses the interconversion of beta-D-glucose-1-phosphate and beta-D-glucose-6-phosphate. The 6-phosphate is capable of non-enzymatic anomerization (alpha to beta or vice versa) while the 1-phosphate is not. A separate enzyme is responsible for the isomerisation of the alpha anomers. Beta-D-glucose-1-phosphate results from the phosphorolysis of maltose (2.4.1.8 from EC), trehalose (2.4.1.64 from EC) or trehalose-6-phosphate (2.4.1.216 from EC). Alternatively, these reactions can be run in the synthetic direction to create the disaccharides. All sequenced genomes which contain a member of this family also appear to contain at least one putative maltose or trehalose phosphorylase. Three species, Lactococcus, Enterococcus and Neisseria appear to contain a pair of paralogous beta-PGM's. Beta-phosphoglucomutase is a member of the haloacid dehalogenase superfamily of hydrolase enzymes. These enzymes are characterised by a series of three catalytic motifs positioned within an alpha-beta (Rossman) fold . Beta-PGM contains an inserted alpha helical domain in between the first and second conserved motifs and thus is a member of subfamily IA of the superfamily , . The third catalytic motif comes in three variants, the third of which, containing a conserved DD or ED, is the only one found here as well as in several other related enzymes. The enzyme from Lactococcus lactis has been extensively characterised including a remarkable crystal structure which traps the pentacoordinate transition state . . Probab=20.28 E-value=63 Score=13.43 Aligned_cols=20 Identities=25% Similarity=0.798 Sum_probs=15.3 Q ss_pred CCCCHHHHHHHHHHHHHCCCCCCC Q ss_conf 798335899999999981999621 Q gi|254780757|r 92 KPNPLNLQNLYIESLKAVGIDPLI 115 (307) Q Consensus 92 KPsp~niq~lYl~SL~~iGid~~~ 115 (307) ||.| |++|.-.+.||+.|.+ T Consensus 146 KPdP----EIFL~AA~~LGv~P~~ 165 (190) T TIGR01990 146 KPDP----EIFLAAAEGLGVSPEE 165 (190) T ss_pred CCCH----HHHHHHHHHCCCCHHH T ss_conf 7786----7999999763897557 No 138 >COG3926 zliS Lysozyme family protein [General function prediction only] Probab=20.27 E-value=54 Score=13.88 Aligned_cols=58 Identities=24% Similarity=0.406 Sum_probs=42.1 Q ss_pred CCCHHHHHHHH-HHHHHHCCCEEEE------CCCCCCCCCCCCHHHHH-HHCCCCCCEEEEECCCCCCCCCCCCC Q ss_conf 77799999999-9999866958961------16754333224857898-74483440167641557887655466 Q gi|254780757|r 10 DLSFQNIILTL-TQYWAQQGCTILQ------PYDMEVGAGTFHPSTTL-RALGPLSWKAAYVQPSRRPLDGRYAE 76 (307) Q Consensus 10 ~~~fq~ii~~L-~~fW~~~GC~i~q------pyd~e~GAgT~hp~T~l-r~lg~~pw~~aYvqPsrRP~DgRYGe 76 (307) .||-++.+-.+ .+||.+.+|-.|- -+|.-|-.|--|++++| |+|||.+ --|-||--|. T Consensus 53 ~mT~aeA~~Iy~~~YW~k~~~D~lP~gVd~avfD~AVNsGp~rAa~~LQkal~~~~---------~v~~DGvIG~ 118 (252) T COG3926 53 AMTRAEATAIYREEYWNKVGGDDLPAGVDYAVFDAAVNSGPGRAAKWLQKALGPAY---------TVRVDGVIGA 118 (252) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHCCCCCHHHHHHHHHHCCCC---------CCCCCCCCCH T ss_conf 98899999999998640047676885200456534541684169999999856577---------6665676248 No 139 >PRK13534 7-cyano-7-deazaguanine tRNA-ribosyltransferase; Provisional Probab=20.15 E-value=63 Score=13.41 Aligned_cols=113 Identities=19% Similarity=0.274 Sum_probs=74.3 Q ss_pred CCCCCHHHHCCCCEEEEECC-EEEH--HHHHHHHHCCCCCC-------C---------CHHHHHHHHHHHHHHHCCCCCE Q ss_conf 88870011012321768878-7414--77789986682577-------5---------1456787899999998699821 Q gi|254780757|r 124 NWESPTLGAWGLGWECWCDG-MEIS--QFTYFQQVCGIECS-------P---------ISGEITYGLERLAMYVQNVNSV 184 (307) Q Consensus 124 nWEsPtLGAwGlGWEvwldG-MEIt--QFTYFQQvGGi~c~-------p---------v~~EiTYGLERiaMylQ~vdnv 184 (307) ||..|.+--.| |..++--| .||| +.--||.-=|-|-- | -.+|.|+ +|.--. .+. T Consensus 79 ~~dg~ImTDSG-~fQ~~~yG~vev~~~eiv~fQ~~IGsDI~~~LD~pt~p~~~~e~a~~~v~~T~--~R~~ea----~~~ 151 (630) T PRK13534 79 DFDGPIMTDSG-SFQLSVYGDVEVTNREIIEFQEKIGVDIGTILDIPTPPDVSREKAEEDLEETL--ERAKEA----IEI 151 (630) T ss_pred CCCCCEEECCC-CEEEEEECCEEECHHHHHHHHHHHCCCEEEECCCCCCCCCCHHHHHHHHHHHH--HHHHHH----HHH T ss_conf 89985797788-63777737662069999999998579988875778998989999999999999--999986----520 Q ss_pred EEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHH--HHHHHHHH--HCCCC Q ss_conf 13320688878831106244746997776350179899999999999--99999985--00665 Q gi|254780757|r 185 YDIVFNAIEGQNVLYGDIFAQSEQEYSRYNFEYANPEILHNHFIDSE--KECLDLLK--KGIPN 244 (307) Q Consensus 185 yDl~wn~~~~~~vtYGdif~q~E~E~S~YNFE~Ad~~~L~~~F~~yE--~Ea~rLle--~~LP~ 244 (307) -+..||.+- .|-+|-|+..++.++.|..+|+..-+.-.--+.+.|. .-+.-++. .+||. T Consensus 152 ~~~~l~gpv-QGg~y~DLr~~sa~~~~~~~f~~y~iG~~vplme~yry~~l~~~i~~~k~~lp~ 214 (630) T PRK13534 152 RELALNGTV-QGSTYPDLRRKSAELMSKMNFDIYPIGAVVPLMESYRYRDLVDIIINSKMHLPT 214 (630) T ss_pred HCCEEECCC-CCCCCHHHHHHHHHHHHHCCCCEEECCCCCHHHHHCCHHHHHHHHHHHHHCCCC T ss_conf 136463255-688728899999999874798776348850565542589999999999734999 No 140 >cd07641 BAR_ASAP1 The Bin/Amphiphysin/Rvs (BAR) domain of ArfGAP with SH3 domain, ANK repeat and PH domain containing protein 1. BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions. ASAP1 (ArfGAP with SH3 domain, ANK repeat and PH domain containing protein 1) is also known as DDEF1 (Development and Differentiation Enhancing Factor 1), AMAP1, centaurin beta-4, or PAG2. ASAP1 is an Arf GTPase activating protein (GAP) with activity towards Arf1 and Arf5 but not Arf6 However, it has been shown to bind GTP-Arf6 stably without GAP activity. It has been implicated in cell growth, migration, and survival, as well as in tumor invasion and malignancy. It binds paxillin and cortactin, two components of invadopodia which are essential for tumor invasiveness. It also binds focal adhesion kinase (FAK) and the SH2/SH3 adaptor CrkL. ASAP1 contains an N-terminal BAR domain, followed by a Pleckstrin homology (PH) domain, an Ar Probab=20.14 E-value=48 Score=14.28 Aligned_cols=16 Identities=0% Similarity=0.262 Sum_probs=6.2 Q ss_pred HHHHHHHHHHHHHCCC Q ss_conf 6787899999998699 Q gi|254780757|r 166 EITYGLERLAMYVQNV 181 (307) Q Consensus 166 EiTYGLERiaMylQ~v 181 (307) +|+=++.+.+-.+|.+ T Consensus 60 ~i~~~~~KFs~~~~El 75 (215) T cd07641 60 DLGTAFVKFSTLTKEL 75 (215) T ss_pred HHHHHHHHHHHHHHHH T ss_conf 7889999999999999 Done!