Query gi|254781094|ref|YP_003065507.1| glycyl-tRNA synthetase subunit beta [Candidatus Liberibacter asiaticus str. psy62] Match_columns 702 No_of_seqs 174 out of 1458 Neff 6.3 Searched_HMMs 39220 Date Mon May 30 06:06:26 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254781094.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK01233 glyS glycyl-tRNA synt 100.0 0 0 1716.8 71.8 664 1-702 1-686 (687) 2 COG0751 GlyS Glycyl-tRNA synth 100.0 0 0 1690.5 63.4 665 1-702 3-690 (691) 3 pfam02092 tRNA_synt_2f Glycyl- 100.0 0 0 1451.9 56.5 525 4-563 1-549 (549) 4 TIGR00211 glyS glycyl-tRNA syn 100.0 0 0 945.6 45.6 666 3-702 5-743 (743) 5 pfam05746 DALR_1 DALR anticodo 98.7 4.8E-07 1.2E-11 67.3 10.7 105 591-697 3-107 (117) 6 smart00836 DALR_1 DALR anticod 98.5 5.9E-06 1.5E-10 59.9 11.9 108 591-698 3-113 (122) 7 cd07956 Anticodon_Ia_Arg Antic 98.2 2.9E-05 7.4E-10 55.1 10.7 114 583-697 33-146 (156) 8 PRK12704 phosphodiesterase; Pr 98.1 5.2E-05 1.3E-09 53.4 10.1 111 361-483 250-361 (455) 9 COG0018 ArgS Arginyl-tRNA synt 98.1 9.7E-05 2.5E-09 51.6 11.1 117 579-697 450-567 (577) 10 PRK01611 argS arginyl-tRNA syn 98.0 0.0015 3.7E-08 43.5 16.3 114 582-697 446-560 (570) 11 PRK00106 hypothetical protein; 98.0 0.00012 3.1E-09 50.9 10.4 111 361-483 330-441 (535) 12 PRK12705 hypothetical protein; 98.0 0.00012 3E-09 51.0 10.2 111 361-483 279-390 (485) 13 TIGR00456 argS arginyl-tRNA sy 97.9 0.00047 1.2E-08 46.8 12.4 194 487-697 383-590 (600) 14 TIGR03319 YmdA_YtgF conserved 97.9 0.00015 3.8E-09 50.3 9.6 193 232-483 227-420 (514) 15 PRK12451 arginyl-tRNA syntheta 97.7 0.0055 1.4E-07 39.5 14.5 110 582-697 443-552 (562) 16 PRK10119 hypothetical protein; 96.9 0.037 9.4E-07 33.9 14.5 125 381-508 21-163 (231) 17 COG1418 Predicted HD superfami 96.9 0.015 3.7E-07 36.6 9.3 130 379-510 30-176 (222) 18 LOAD_HD consensus 96.9 0.005 1.3E-07 39.8 6.9 98 387-486 2-114 (114) 19 pfam01966 HD HD domain. HD dom 96.0 0.075 1.9E-06 31.8 8.6 99 387-486 2-108 (108) 20 PRK07152 nadD putative nicotin 96.0 0.11 2.7E-06 30.8 9.3 95 387-486 199-310 (343) 21 COG1713 Predicted HD superfami 95.7 0.097 2.5E-06 31.0 8.1 108 386-510 18-146 (187) 22 PRK13300 tRNA CCA-pyrophosphor 94.3 0.44 1.1E-05 26.5 11.0 37 364-405 153-189 (446) 23 KOG1195 consensus 94.0 0.49 1.2E-05 26.2 11.2 110 585-694 445-554 (567) 24 TIGR03671 cca_archaeal CCA-add 93.7 0.55 1.4E-05 25.9 10.2 35 365-404 149-183 (408) 25 KOG4426 consensus 93.3 0.65 1.7E-05 25.4 8.1 112 586-697 527-646 (656) 26 smart00471 HDc Metal dependent 93.2 0.67 1.7E-05 25.3 10.1 106 384-491 3-120 (124) 27 COG1746 CCA1 tRNA nucleotidylt 89.1 1.7 4.3E-05 22.6 8.4 54 367-425 155-215 (443) 28 PRK12703 tRNA 2'-O-methylase; 89.0 1.7 4.3E-05 22.5 9.3 95 385-483 187-292 (339) 29 KOG2796 consensus 87.1 2.2 5.6E-05 21.8 12.7 26 662-687 272-297 (366) 30 cd00077 HDc Metal dependent ph 86.8 2.3 5.9E-05 21.6 10.8 104 385-489 2-128 (145) 31 PRK11893 methionyl-tRNA synthe 86.7 2.3 5.9E-05 21.6 11.4 95 588-688 355-452 (512) 32 pfam11348 DUF3150 Protein of u 83.3 3.3 8.4E-05 20.6 7.8 73 408-487 122-213 (257) 33 cd07957 Anticodon_Ia_Met Antic 83.0 3.4 8.6E-05 20.5 10.6 83 588-676 8-94 (129) 34 PRK12268 methionyl-tRNA synthe 82.4 3.6 9.1E-05 20.3 12.4 75 589-669 390-464 (558) 35 pfam03029 ATP_bind_1 Conserved 75.6 5.3 0.00013 19.2 4.3 66 414-482 158-230 (234) 36 PRK00133 metG methionyl-tRNA s 73.7 6.3 0.00016 18.6 10.8 27 299-325 316-342 (666) 37 pfam08668 HDOD HDOD domain. 70.5 7.4 0.00019 18.2 9.3 81 381-461 90-196 (196) 38 PRK13902 alaS lanyl-tRNA synth 65.4 9.3 0.00024 17.5 9.2 50 14-65 52-105 (898) 39 pfam03651 consensus 65.4 9.3 0.00024 17.5 5.8 75 593-674 81-155 (193) 40 pfam01411 tRNA-synt_2c tRNA sy 62.5 11 0.00027 17.1 9.2 94 474-573 264-358 (545) 41 PRK12267 methionyl-tRNA synthe 61.0 11 0.00028 16.9 11.4 24 300-323 289-312 (644) 42 COG2206 c-di-GMP phosphodieste 60.9 11 0.00029 16.9 9.9 99 387-487 150-273 (344) 43 COG3437 Response regulator con 58.3 12 0.00032 16.6 7.4 107 380-488 180-311 (360) 44 COG1639 Predicted signal trans 55.7 14 0.00035 16.4 12.2 187 243-482 23-237 (289) 45 COG2316 Predicted hydrolase (H 54.4 14 0.00036 16.2 7.6 141 388-564 50-190 (212) 46 PRK03826 hypothetical protein; 54.3 14 0.00037 16.2 6.3 136 367-507 10-159 (195) 47 cd04801 CBS_pair_M50_like This 52.0 12 0.0003 16.8 2.3 31 277-308 14-45 (114) 48 TIGR00398 metG methionyl-tRNA 51.6 16 0.0004 15.9 12.4 72 621-692 443-520 (573) 49 KOG2531 consensus 48.3 13 0.00034 16.4 2.2 89 382-478 183-292 (545) 50 TIGR00425 CBF5 rRNA pseudourid 47.9 4.8 0.00012 19.4 -0.1 16 207-222 206-221 (326) 51 KOG1686 consensus 47.6 16 0.00042 15.8 2.5 76 129-209 53-131 (151) 52 TIGR02702 SufR_cyano iron-sulf 44.5 13 0.00032 16.6 1.6 22 549-570 85-107 (215) 53 pfam06275 DUF1031 Protein of u 43.6 21 0.00053 15.1 4.3 57 78-146 8-64 (80) 54 pfam00599 Flu_M2 Influenza Mat 42.8 21 0.00054 15.0 2.6 45 493-547 16-60 (97) 55 COG0317 SpoT Guanosine polypho 41.9 22 0.00056 14.9 4.2 75 415-520 377-451 (701) 56 pfam10490 Rb-bdg_C_Cenp-F Rb-b 41.7 5.1 0.00013 19.3 -0.8 18 494-511 27-44 (49) 57 pfam02728 Cu_amine_oxidN3 Copp 39.5 24 0.00061 14.7 2.7 28 154-183 69-96 (101) 58 PRK07568 aspartate aminotransf 38.5 25 0.00063 14.6 3.3 38 199-247 165-202 (396) 59 COG2838 Icd Monomeric isocitra 36.5 13 0.00034 16.4 0.7 85 5-93 8-109 (744) 60 KOG3066 consensus 34.5 28 0.00073 14.2 3.0 164 378-582 41-235 (271) 61 COG5514 Uncharacterized conser 34.4 29 0.00073 14.2 4.1 43 43-91 118-160 (203) 62 TIGR00277 HDIG uncharacterized 34.1 29 0.00074 14.1 7.2 76 385-460 4-103 (104) 63 COG0712 AtpH F0F1-type ATP syn 33.3 30 0.00076 14.0 4.7 15 506-520 9-23 (178) 64 TIGR02152 D_ribokin_bact ribok 33.2 30 0.00076 14.0 2.1 64 197-260 114-184 (303) 65 pfam10011 DUF2254 Predicted me 32.5 31 0.00078 13.9 2.5 25 483-507 253-278 (369) 66 PRK10905 hypothetical protein; 31.9 31 0.0008 13.9 3.9 33 79-113 253-285 (323) 67 PRK09775 hypothetical protein; 30.2 6.5 0.00017 18.5 -1.8 52 228-279 144-195 (443) 68 COG0013 AlaS Alanyl-tRNA synth 29.5 34 0.00087 13.6 7.4 48 20-69 6-57 (879) 69 pfam03971 IDH Monomeric isocit 29.2 35 0.00088 13.6 1.8 82 10-95 7-105 (735) 70 TIGR00718 sda_alpha L-serine d 29.1 14 0.00036 16.2 -0.2 16 507-522 3-18 (301) 71 TIGR01817 nifA Nif-specific re 28.7 35 0.0009 13.5 1.9 203 216-466 11-265 (574) 72 TIGR02068 cya_phycin_syn cyano 28.4 18 0.00047 15.4 0.3 102 408-509 410-547 (876) 73 pfam04842 DUF639 Plant protein 27.9 36 0.00093 13.4 2.0 34 411-444 280-313 (682) 74 COG5200 LUC7 U1 snRNP componen 26.8 38 0.00097 13.3 3.7 71 413-491 136-209 (258) 75 PRK10880 adenine DNA glycosyla 26.7 38 0.00097 13.3 2.1 90 279-405 10-101 (350) 76 pfam10508 Proteasom_PSMB Prote 26.0 35 0.00088 13.6 1.3 19 206-224 77-95 (503) 77 KOG0995 consensus 25.8 26 0.00067 14.4 0.7 60 424-493 335-400 (581) 78 TIGR00169 leuB 3-isopropylmala 25.6 40 0.001 13.1 2.8 98 388-499 168-273 (370) 79 PRK09692 integrase; Provisiona 25.1 41 0.001 13.1 4.2 61 540-600 341-403 (413) 80 pfam07075 DUF1343 Protein of u 24.6 23 0.00058 14.8 0.2 26 132-162 174-199 (359) 81 KOG2023 consensus 24.3 42 0.0011 13.0 5.8 66 156-233 169-241 (885) 82 COG1984 DUR1 Allophanate hydro 24.2 42 0.0011 13.0 1.5 22 46-67 63-84 (314) 83 pfam00310 GATase_2 Glutamine a 23.9 23 0.00059 14.8 0.1 19 287-306 193-211 (223) 84 pfam06425 consensus 23.8 43 0.0011 12.9 7.2 97 536-639 70-167 (170) 85 COG4245 TerY Uncharacterized p 23.6 15 0.00038 16.1 -0.9 82 38-126 44-126 (207) 86 COG4087 Soluble P-type ATPase 23.5 27 0.00069 14.3 0.4 21 496-516 100-120 (152) 87 PRK12293 hisZ ATP phosphoribos 23.3 44 0.0011 12.9 1.8 19 20-38 20-38 (281) 88 pfam08218 Citrate_ly_lig Citra 23.2 17 0.00043 15.7 -0.7 127 183-332 6-135 (182) 89 cd03237 ABC_RNaseL_inhibitor_d 23.0 45 0.0011 12.8 3.4 66 219-285 146-211 (246) 90 PRK00767 transcriptional regul 22.6 45 0.0012 12.8 2.7 25 218-242 4-28 (197) 91 pfam10095 DUF2333 Uncharacteri 22.6 37 0.00094 13.4 0.9 64 623-686 253-317 (337) 92 pfam10678 DUF2492 Protein of u 22.4 42 0.0011 13.0 1.2 45 511-555 8-73 (78) 93 cd00532 MGS-like MGS-like doma 22.4 22 0.00056 14.9 -0.2 30 482-511 68-97 (112) 94 KOG0139 consensus 22.0 46 0.0012 12.7 1.8 37 453-509 136-172 (398) 95 cd00365 HMG-CoA_reductase Hydr 21.9 47 0.0012 12.7 2.3 67 221-289 9-89 (376) 96 COG3263 NhaP-type Na+/H+ and K 21.6 34 0.00087 13.6 0.6 31 363-393 57-87 (574) 97 pfam08006 DUF1700 Protein of u 21.6 23 0.00059 14.8 -0.3 17 446-462 51-67 (181) 98 TIGR00721 tfx DNA-binding prot 21.4 48 0.0012 12.6 3.0 64 536-604 5-70 (142) 99 KOG1989 consensus 21.3 34 0.00086 13.7 0.5 49 219-267 141-190 (738) 100 pfam02572 CobA_CobO_BtuR ATP:c 21.2 26 0.00066 14.4 -0.1 29 261-290 132-160 (172) 101 pfam10660 MitoNEET_N Iron-cont 21.1 49 0.0012 12.6 2.5 24 119-142 3-26 (64) 102 cd07129 ALDH_KGSADH Alpha-Keto 20.9 49 0.0012 12.6 1.4 28 471-498 387-414 (454) 103 TIGR01283 nifE nitrogenase MoF 20.4 48 0.0012 12.6 1.1 25 171-195 156-180 (470) 104 PRK09440 avtA valine--pyruvate 20.3 50 0.0013 12.5 3.5 39 198-247 181-219 (416) 105 TIGR00004 TIGR00004 endoribonu 20.2 51 0.0013 12.4 3.9 44 10-59 38-82 (129) 106 pfam06090 Ins_P5_2-kin Inosito 20.2 51 0.0013 12.4 1.7 15 261-275 118-132 (301) 107 smart00437 TOP1Ac Bacterial DN 20.0 51 0.0013 12.4 2.8 54 431-484 16-72 (259) No 1 >PRK01233 glyS glycyl-tRNA synthetase subunit beta; Validated Probab=100.00 E-value=0 Score=1716.76 Aligned_cols=664 Identities=39% Similarity=0.671 Sum_probs=643.8 Q ss_pred CCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCEEEEEECCCCCCCCCEEEEECCCCC----- Q ss_conf 984016400201698999999999999999999976998033588630745899984954207510023027898----- Q gi|254781094|r 1 MPDFLLEIYSEEIPARMQHKAAEDLSIILIGLLKEEGVIYQNMRQYWTPHRLFLYLKGLSPHSPEKIEERLGPRV----- 75 (702) Q Consensus 1 M~~lLlEIgtEElPa~~~~~~~~~l~~~~~~~l~~~~l~~~~i~~~~TPRRLav~i~~l~~~q~d~~~e~kGP~~----- 75 (702) |+|||||||||||||+++..+.+||++.+.+.|++++|+|+++++|+|||||||+|+|++++|+|.++++||||+ T Consensus 1 m~~lLlEIgtEElPa~~~~~a~~~l~~~~~~~l~~~~l~~~~i~~~~TPRRLav~v~~l~~~q~d~~~e~kGP~v~~afd 80 (687) T PRK01233 1 MKDLLLELGTEELPAKALRKLAEQLAEKVTAELEEAGLSFEEIKVFATPRRLALLVEGLAEKQPDREEEKKGPAVKIAFD 80 (687) T ss_pred CCCCHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEECCCEEEEEECCCCCCCCCHHHEEECCCHHHHCC T ss_conf 96413415342478899999999999999999998799878606998587878887586311866133413764777547 Q ss_pred --CCCCHHHHHHHHHCCCCHHHHHHEEEECCCEEEEEEEEECCCCCHHHHHHHHHHHHHHCCCCCEEECCCCCCCCCCCE Q ss_conf --589878998999779982443320340698289999991588509999999999998628985131037765676741 Q gi|254781094|r 76 --GAGKKAIDGFLRSTGLQTISDCQIKKDPKKGDVYLAVLRKPKRLIEDVLKTIVPVAIQKVPWPKSMRWSTTHSPISAF 153 (702) Q Consensus 76 --g~pt~A~~GF~ks~gi~~~~~~~i~~~~~kg~~~~~~~~~~g~~~~eiL~~ii~~~i~~l~~pKsMrWg~~~~~~~~~ 153 (702) |+||+|++||+||+|++. +++. .++++||+|||+++..+|+++.++|+++++++|+++||||+||||+++ + T Consensus 81 ~~G~ptkA~~GF~k~~gi~~-~dL~-~~~~~kgey~~~~~~~kg~~~~elL~~ii~~~i~~l~~pKsMRWg~~~-----~ 153 (687) T PRK01233 81 ADGKPTKAAEGFARGQGITV-DDLE-RRETKKGEYLFYRKEVPGRPTKELLPEIVPEALAKLPIPKSMRWGNND-----L 153 (687) T ss_pred CCCCCCHHHHHHHHHCCCCH-HHCE-EEECCCEEEEEEEEEECCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCC-----C T ss_conf 67895399999999879999-9917-787199729999985688429999999999999769998300279999-----8 Q ss_pred EEEEECCEEEEEECCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCEEEECHHHHHHHHCCCCEEECHHHHHHHHHHHHH Q ss_conf 66530320567863677787231024458221111022344577856631679999862027847169999999999999 Q gi|254781094|r 154 SWIRPLKSILCILVAEDAKEKIIDLDLKEIPCGNITYGHRFHAPHPIKVQSLDHYIRDLEEAMVLLDPECRRNAILNDAH 233 (702) Q Consensus 154 ~FvRPirwIvaLl~~~~~~~~vi~~~~~gi~s~n~T~GHRf~~~~~i~I~~a~~Y~~~L~~~~Vi~d~~~R~~~I~~~i~ 233 (702) +|+||||||+||| |++||||+++|++|||+||||||+++++|.|++|++|++.|++++||+|+++||++|+++++ T Consensus 154 ~F~RPIrwIvaLl-----d~~vi~~~~~gi~s~~~t~GHRfl~~~~i~I~~~~~Y~~~L~~~~Vivd~~eRk~~I~~~i~ 228 (687) T PRK01233 154 RFVRPVHWLVALL-----GDEVIPFEILGIKSGRVTRGHRFLGPGEFTIDNADDYEEKLREAFVIADPEERKAIILAQIE 228 (687) T ss_pred EEECHHHHHHHHC-----CCEEEEEEEEEEECCCEECCCCCCCCCCEEECCHHHHHHHHHHCCEECCHHHHHHHHHHHHH T ss_conf 5545279898870-----88167899951452541125534688885518999999998519898369999999999999 Q ss_pred HHHHHCCCEECCCHHHHHHHHHHCCCCEEEEEEECHHHHCCCHHHHHHHHHHHCCCEEEECCCCCCCCEEEEEECCCCCC Q ss_conf 98885398214666788788744089846888307777128989999999840011365536886132489850577655 Q gi|254781094|r 234 RLASAVGLELVEDKDLLEEIIGLVEWVQVFMGSFDKKYLCLPEELIRLTIKTNQKCFVTRTREGVLANCFILVSNIQASD 313 (702) Q Consensus 234 ~~a~~~~~~~~~d~~Ll~Ev~~LvE~P~~~~G~F~~~fL~lP~evl~~~m~~hQkyf~~~~~~g~l~~~Fi~v~N~~~~~ 313 (702) ++|+++|++++.|++|||||++|||||++++|+|+++||+||+|||+|+|++||||||++|++|+|+||||+|||+.+.+ T Consensus 229 ~~a~~~~~~~~~d~~Ll~Ev~~LvE~P~~~~g~f~~~fL~lP~evl~~~m~~hQkyf~~~~~~g~l~~~Fi~vsN~~~~~ 308 (687) T PRK01233 229 ELAAANGGEADIDEDLLEEVTNLVEWPVVLLGSFEEKFLEVPQEVLITTMKEHQKYFPVRDQAGKLLPNFITVSNIESKD 308 (687) T ss_pred HHHHHCCCEECCCHHHHHHHHHHCCCCEEEEECCCHHHHCCCHHHHHHHHHHHHHHEEEECCCCCEEEEEEEEECCCCCC T ss_conf 99998696786888999999863678857885338998429989999999973531257889998832478860688667 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCEEHHHHHHHHHHHHHHH Q ss_conf 43024567889978764225999850300011111111110011220137889988421002303134551389989899 Q gi|254781094|r 314 GGAAIVQGNSRVVAARLEDALHFWKRDQNNLPNLSSLKESALKFNLDLSKPLDQRMARLDMLDVVFHAKIGTQGERVSRI 393 (702) Q Consensus 314 ~~~~v~~GnerVl~ARL~DA~FF~~~D~~~~~~~~~~~~~~~~~~~dlk~~L~~~~~~L~~~~v~F~~~LGs~~dK~~Ri 393 (702) .+.|++||||||+|||+||+|||++| +|+||++|+++|+ +||||++||||+|||+|+ T Consensus 309 -~~~I~~GNErVL~ARL~DA~FF~~~D--------------------~k~~L~~~~~~L~--~v~F~~~LGsl~dK~~Ri 365 (687) T PRK01233 309 -PENIIKGNEKVLRARLSDAEFFFKED--------------------LKKPLEDRLPKLE--NVTFHEKLGTLADKVERI 365 (687) T ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHHH--------------------CCCCHHHHHHHHH--HEEHHHCCCCHHHHHHHH T ss_conf -78889839999998862369999741--------------------5788788888753--033055137799999999 Q ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCCCCCCHH Q ss_conf 99999988652999889999999742367888984034578999999986236998899999997344776777886358 Q gi|254781094|r 394 RVLGKKIAQLIDADVALVDRAIVLSKADLCTEIVREFPELQGKIGKEYAVLQNENISCCDAIEEHLKPRGPLENVPTNKI 473 (702) Q Consensus 394 ~~la~~la~~l~~~~~~~~rAa~L~KaDL~t~mV~EFpeLQG~mG~~YA~~~ge~~~Va~aI~ehY~P~~~~d~lP~~~~ 473 (702) ++|+.+|+..++.+.+++.|||.||||||+|+||+|||||||+||+|||+++||+++||.||+|||+|+|++|.||+|++ T Consensus 366 ~~la~~ia~~l~~~~~~~~rAa~L~KaDL~T~mV~EFPELQGiMG~~YA~~~Ge~~~Va~AI~EhY~P~~~~d~lP~~~~ 445 (687) T PRK01233 366 RALAGYIAPQIGADVALAARAALLAKADLVTEMVGEFPELQGIMGEYYARLDGEDEAVALAIEEHYLPRFAGDELPSTPV 445 (687) T ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCCCCCCHH T ss_conf 99999999874978999999999986121323450538776789999999869997899999998289999999999789 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHH---------------HHHHHHH Q ss_conf 8999999749989876510567687769688887434468999837997887896303---------------6499999 Q gi|254781094|r 474 SITVSLADKLDTLINFWAINEKPSGSKDPYALRRATLGIIRIILENKIDIPLSQFIED---------------QNLILFF 538 (702) Q Consensus 474 g~ilsiaDKlDtl~g~f~ig~~PTGS~DPfaLRR~a~GiirIl~e~~l~l~l~~li~~---------------~~l~~Fi 538 (702) |++||||||+|||+|||++|++||||+|||||||+|+||||||++++|+++|.+++++ +++.+|| T Consensus 446 g~ilsiADKlDtL~g~f~ig~~PTGS~DPfaLRR~a~GiirIi~~~~~~l~l~~li~~~~~~~~~~~~~~~~~~~l~~F~ 525 (687) T PRK01233 446 GAAVALADKLDTLVGIFGIGLIPTGSKDPFALRRAALGVLRILLEKGLPLDLVELLEEAVALYGDKLTNANVADDVLDFF 525 (687) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHH T ss_conf 89999999998999999558865899981888999989999999638999899999999998644135445799999999 Q ss_pred HHHHHHHHHHCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHH Q ss_conf 99999987633888789988541689998999999999999871951388999999999985565312467777738568 Q gi|254781094|r 539 HDRLKLYLHDRDIRHDLIEAILRPENDNLLTIVDLIKHLNEFFSSAKGEKFLLSAKRIFQILAIEEKKNREISLEISPQY 618 (702) Q Consensus 539 ~~Rl~~~L~e~G~~~DiI~AVl~~~~~~~~~i~~r~~aL~~f~~~~~~~~l~~a~KRv~NIL~~~~k~~~~~~~~vd~~L 618 (702) .+|++++|+++||++|+|+||++.+.++++++..|+++|++|+++++|++++++|||+.||+++++. .....+|++| T Consensus 526 ~~Rl~~~l~~~g~~~dii~AVl~~~~~~~~~~~~r~~aL~~~~~~~~~~~l~~a~kRv~NIl~k~~~---~~~~~vd~~L 602 (687) T PRK01233 526 LDRLRALLQDEGIRVDVIDAVLARRPTDLLDFDARVEALQAFRTLPEAEALAAANKRVSNILKKAEG---KLSGAVDPSL 602 (687) T ss_pred HHHHHHHHHHCCCCHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCC---CCCCCCCHHH T ss_conf 9999999873799999999998079999999999999999987593289999999999999861666---5678759677 Q ss_pred HCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCH Q ss_conf 35989999999999999999998502538999999985327888751027036886899999999999999999860684 Q gi|254781094|r 619 LLLEAEKRLYAVISDFGTHIQESMDHKRYHQIGDLLHSICEPIEIFFDQVLVNVDDREVRDNRLSLLQYIKNIILIVINV 698 (702) Q Consensus 619 f~~~~Ek~L~~~~~~~~~~i~~~l~~~dy~~~L~~L~~L~~~Id~FFDnVmVm~eD~~IR~NRL~LL~~i~~~f~~iaDf 698 (702) |++++|++||+++.++.+++..+++.++|..++..|++|++|||+|||+|||||||++||+|||+||+.|+++|..+||| T Consensus 603 f~~~~E~~L~~~~~~~~~~~~~~~~~~~y~~~l~~L~~L~~~Id~FFD~VmVm~~d~~iR~NRL~LL~~i~~lf~~iaDf 682 (687) T PRK01233 603 FEEDAEKALYEALQALRPAVQPAFAAGDYQAALEALAALRPPVDAFFDNVMVMAEDEALRANRLALLSKLRELFLKVADI 682 (687) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEECCCCHHHHHHHHHHHHHHHHHHHHHCCH T ss_conf 37999999999999999999988754359999999997404899984889773898899999999999999999875275 Q ss_pred HHCC Q ss_conf 5219 Q gi|254781094|r 699 QKIV 702 (702) Q Consensus 699 s~Lv 702 (702) |+|- T Consensus 683 S~L~ 686 (687) T PRK01233 683 SLLQ 686 (687) T ss_pred HHHC T ss_conf 8406 No 2 >COG0751 GlyS Glycyl-tRNA synthetase, beta subunit [Translation, ribosomal structure and biogenesis] Probab=100.00 E-value=0 Score=1690.53 Aligned_cols=665 Identities=41% Similarity=0.680 Sum_probs=642.2 Q ss_pred CCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCEEEEEECCCCCCCCCEEEEECCCCC----- Q ss_conf 984016400201698999999999999999999976998033588630745899984954207510023027898----- Q gi|254781094|r 1 MPDFLLEIYSEEIPARMQHKAAEDLSIILIGLLKEEGVIYQNMRQYWTPHRLFLYLKGLSPHSPEKIEERLGPRV----- 75 (702) Q Consensus 1 M~~lLlEIgtEElPa~~~~~~~~~l~~~~~~~l~~~~l~~~~i~~~~TPRRLav~i~~l~~~q~d~~~e~kGP~~----- 75 (702) |++||+|||||||||+++.++.+||.+.+.+.|++++|+|+++++|+||||||++|+|++..|+|.++|+|||++ T Consensus 3 ~~~lLlEigtEELPA~~l~~~~~~l~~~~~~~L~~~~l~~~~v~~f~tPRRLal~v~~la~~q~d~~eE~kGP~~~~a~d 82 (691) T COG0751 3 MKDLLLEIGTEELPAKALRSAAEQLAKKFTAGLKEAGLSFEGVEVFATPRRLALLVKGLAEAQPDREEEKKGPPVKAAFD 82 (691) T ss_pred HHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEECCCCEEEEEECCCCCCCCCHHHHHCCCCCCCCCC T ss_conf 45579985412178798999999999999999998497711207831771455420476101645466612875014333 Q ss_pred --CCCCHHHHHHHHHCCCCHHHHHHEEEECCCEEEEEEEEE-CCCCCHHHHHHHHHHHHHHCCCCCEEECCCCCCCCCCC Q ss_conf --589878998999779982443320340698289999991-58850999999999999862898513103776567674 Q gi|254781094|r 76 --GAGKKAIDGFLRSTGLQTISDCQIKKDPKKGDVYLAVLR-KPKRLIEDVLKTIVPVAIQKVPWPKSMRWSTTHSPISA 152 (702) Q Consensus 76 --g~pt~A~~GF~ks~gi~~~~~~~i~~~~~kg~~~~~~~~-~~g~~~~eiL~~ii~~~i~~l~~pKsMrWg~~~~~~~~ 152 (702) |+||+|++||+|+||++ .++..+.++++||++|++..+ .+|++++++||++++++|+++||||+||||+++ T Consensus 83 ~~g~~TkA~eGF~r~~Gvs-~e~~~~~~~~k~G~~y~~~~~~~~g~~t~~iL~~i~~~~i~~l~~pK~MrWg~~~----- 156 (691) T COG0751 83 ADGKPTKAAEGFARGQGVS-VEDLERRKDDKKGEEYVYRVKKEEGQPTEELLPEIVPEAIASLPFPKSMRWGSKD----- 156 (691) T ss_pred CCCCCCHHHHHHHHHCCCC-HHHHHHHCCCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCCCCCEEECCCC----- T ss_conf 5899379999999877999-8994343027788589999850578718999999999999759898750457987----- Q ss_pred EEEEEECCEEEEEECCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCEEEECHHHHHHHHCCCCEEECHHHHHHHHHHHH Q ss_conf 16653032056786367778723102445822111102234457785663167999986202784716999999999999 Q gi|254781094|r 153 FSWIRPLKSILCILVAEDAKEKIIDLDLKEIPCGNITYGHRFHAPHPIKVQSLDHYIRDLEEAMVLLDPECRRNAILNDA 232 (702) Q Consensus 153 ~~FvRPirwIvaLl~~~~~~~~vi~~~~~gi~s~n~T~GHRf~~~~~i~I~~a~~Y~~~L~~~~Vi~d~~~R~~~I~~~i 232 (702) ++|+||||||+||| |++|+|+++.|++|||+||||||++++.++|++|++|.+.|++++||+|+++||++|..++ T Consensus 157 ~~fiRPi~~iv~L~-----g~evip~~i~gi~s~~~s~GHRfl~~~~i~I~~~~~Y~~~L~~~~VI~d~~eRk~~I~~~i 231 (691) T COG0751 157 VRFIRPIHWIVALL-----GDEVIPFEILGIKSGRITRGHRFLGPGEITIDSADDYVEKLREGKVIADPEERKAIILEQI 231 (691) T ss_pred EEEECCHHHHHHHH-----CCEEEEEEEEEEEECCCCCCEECCCCCCEEECCHHHHHHHHHHCCEECCHHHHHHHHHHHH T ss_conf 18853055666660-----8845200451166246332200157886361799999999974877477899999999999 Q ss_pred HHHHHHCCCEECCCHHHHHHHHHHCCCCEEEEEEECHHHHCCCHHHHHHHHHHHCCCEEEECCCCCCCCEEEEEECCCCC Q ss_conf 99888539821466678878874408984688830777712898999999984001136553688613248985057765 Q gi|254781094|r 233 HRLASAVGLELVEDKDLLEEIIGLVEWVQVFMGSFDKKYLCLPEELIRLTIKTNQKCFVTRTREGVLANCFILVSNIQAS 312 (702) Q Consensus 233 ~~~a~~~~~~~~~d~~Ll~Ev~~LvE~P~~~~G~F~~~fL~lP~evl~~~m~~hQkyf~~~~~~g~l~~~Fi~v~N~~~~ 312 (702) ..++.++|+.++.|++|||||++|||||+|++|+|+++||+||+|||||+|++||||||++|++|+|+|+||+|||+.+. T Consensus 232 ~~~a~~~~~~~~~de~LLeEV~~LvE~P~vl~g~Fee~FL~lP~EvlittMk~hQkYFpv~~~~g~L~p~FI~VsN~~~~ 311 (691) T COG0751 232 EELASKLGGVADIDEDLLEEVTALVEYPTVLLGKFEEKFLELPEEVLITTMKEHQKYFPVFDQGGKLLPHFIFVSNGEPK 311 (691) T ss_pred HHHHHHCCCEECCCHHHHHHHHHCCCCCHHEEECCCHHHHCCCHHHHHHHHHHCCEEEEEECCCCCCCCCEEEEECCCCC T ss_conf 99998629862577588999876025770003236699873889999999985553667888998705655999789988 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCEEHHHHHHHHHHHHHH Q ss_conf 54302456788997876422599985030001111111111001122013788998842100230313455138998989 Q gi|254781094|r 313 DGGAAIVQGNSRVVAARLEDALHFWKRDQNNLPNLSSLKESALKFNLDLSKPLDQRMARLDMLDVVFHAKIGTQGERVSR 392 (702) Q Consensus 313 ~~~~~v~~GnerVl~ARL~DA~FF~~~D~~~~~~~~~~~~~~~~~~~dlk~~L~~~~~~L~~~~v~F~~~LGs~~dK~~R 392 (702) | .+.|++||||||||||+||+|||++| +|+||++++++|+ +||||++|||++|||+| T Consensus 312 d-~~~Ii~GNEKVlraRLsDA~FF~k~D--------------------lk~~L~~~~~kL~--~VtFh~kLGTl~dKv~R 368 (691) T COG0751 312 D-PENIIEGNEKVLRARLSDAEFFFKED--------------------LKKPLESRLPKLK--TVTFHEKLGTLADKVER 368 (691) T ss_pred C-HHHHHCCCCEEEEECCCHHHHHHHHH--------------------HCCCHHHHHHHHC--EEEEHHHCCCHHHHHHH T ss_conf 8-66743166255410103388888755--------------------2056777777541--46513721658899999 Q ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCCCCCCH Q ss_conf 99999998865299988999999974236788898403457899999998623699889999999734477677788635 Q gi|254781094|r 393 IRVLGKKIAQLIDADVALVDRAIVLSKADLCTEIVREFPELQGKIGKEYAVLQNENISCCDAIEEHLKPRGPLENVPTNK 472 (702) Q Consensus 393 i~~la~~la~~l~~~~~~~~rAa~L~KaDL~t~mV~EFpeLQG~mG~~YA~~~ge~~~Va~aI~ehY~P~~~~d~lP~~~ 472 (702) ++.||.+||+.++.+.+++.|||.||||||+|+||+|||||||+||+|||+++||+++||.||+|||+|||+||+||+|+ T Consensus 369 i~~iA~~ia~~l~~d~~~~~rAa~l~KaDL~T~mV~EFpELQGiMG~~YA~~~Ge~~~VA~Ai~EhY~Pr~~gd~lP~t~ 448 (691) T COG0751 369 IKKIAAYIAPQLGADVEDADRAALLAKADLVTEMVGEFPELQGIMGEYYALHDGEDEEVALAIEEHYLPRFAGDELPSTP 448 (691) T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCCCCCH T ss_conf 99999999987285799999999997465667665588155377779999876998899999998438888898899853 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHH---------------HHHHHH Q ss_conf 88999999749989876510567687769688887434468999837997887896303---------------649999 Q gi|254781094|r 473 ISITVSLADKLDTLINFWAINEKPSGSKDPYALRRATLGIIRIILENKIDIPLSQFIED---------------QNLILF 537 (702) Q Consensus 473 ~g~ilsiaDKlDtl~g~f~ig~~PTGS~DPfaLRR~a~GiirIl~e~~l~l~l~~li~~---------------~~l~~F 537 (702) +|++||||||||||+|||+||++||||+|||||||+|+||+|||++++|++++.++++. +++++| T Consensus 449 ~g~~valADKLDTLvg~F~iG~iPtGSkDPyALRRaAlGiirIi~~~~l~l~l~~ll~~~~~~~~~~~~~~~~~~~l~~F 528 (691) T COG0751 449 VGAVVALADKLDTLVGFFGIGLIPTGSKDPYALRRAALGILRIILEKNLDLDLEELLDKAVASFKSKLTNAKVLEELLDF 528 (691) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCHHHHHHHH T ss_conf 77899999888788898854677589999588889986799999971799889999999999864430240039999999 Q ss_pred HHHHHHHHHHHCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHH Q ss_conf 99999998763388878998854168999899999999999987195138899999999998556531246777773856 Q gi|254781094|r 538 FHDRLKLYLHDRDIRHDLIEAILRPENDNLLTIVDLIKHLNEFFSSAKGEKFLLSAKRIFQILAIEEKKNREISLEISPQ 617 (702) Q Consensus 538 i~~Rl~~~L~e~G~~~DiI~AVl~~~~~~~~~i~~r~~aL~~f~~~~~~~~l~~a~KRv~NIL~~~~k~~~~~~~~vd~~ 617 (702) |.+|++.||+++||++|+|+||++.+.++|+++..|++||++|+..++++.+++++||++||+++.++ ..+..||++ T Consensus 529 ~~~Rl~~~l~d~g~~~dii~AVL~~~~~~l~~i~~r~~al~~f~~~~~~~~l~~a~kRv~nIl~k~~~---~~~~~v~~~ 605 (691) T COG0751 529 FLGRLRTYLQDEGYRKDIIDAVLALNPTDLLDIIARAEALQEFLDLPEAKALAAANKRVSNILAKAEK---KLSGTVDPS 605 (691) T ss_pred HHHHHHHHHHCCCCCHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCC---CCCCCCCHH T ss_conf 99999999872599889999997479997799999999999986292158999999899999873455---567766967 Q ss_pred HHCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 83598999999999999999999850253899999998532788875102703688689999999999999999986068 Q gi|254781094|r 618 YLLLEAEKRLYAVISDFGTHIQESMDHKRYHQIGDLLHSICEPIEIFFDQVLVNVDDREVRDNRLSLLQYIKNIILIVIN 697 (702) Q Consensus 618 Lf~~~~Ek~L~~~~~~~~~~i~~~l~~~dy~~~L~~L~~L~~~Id~FFDnVmVm~eD~~IR~NRL~LL~~i~~~f~~iaD 697 (702) ||++++|+.||+++..+..+...++..++|..++..|++|++|||.|||+||||+||++||+|||+||..+++.|..+|| T Consensus 606 Lf~~~~E~~L~~a~~~~~~~~~~a~a~~~y~~al~~L~~L~~pId~FFd~VmVm~eD~~iR~NRLalL~~l~~~f~~vAd 685 (691) T COG0751 606 LFEEDAEKALFEALQALKPKVAEALAEKDYQDALAALAELRPPIDEFFDNVMVMAEDEALRNNRLALLSKLRELFLKVAD 685 (691) T ss_pred HHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEECCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 85288999999999997777888874235999999999854269998567266478789999999999999999998724 Q ss_pred HHHCC Q ss_conf 45219 Q gi|254781094|r 698 VQKIV 702 (702) Q Consensus 698 fs~Lv 702 (702) ||+|+ T Consensus 686 ~~~l~ 690 (691) T COG0751 686 FSLLV 690 (691) T ss_pred HHHHC T ss_conf 99851 No 3 >pfam02092 tRNA_synt_2f Glycyl-tRNA synthetase beta subunit. Probab=100.00 E-value=0 Score=1451.85 Aligned_cols=525 Identities=41% Similarity=0.698 Sum_probs=508.9 Q ss_pred EEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCEEEEEECCCCCCCCCEEEEECCCCC-------C Q ss_conf 016400201698999999999999999999976998033588630745899984954207510023027898-------5 Q gi|254781094|r 4 FLLEIYSEEIPARMQHKAAEDLSIILIGLLKEEGVIYQNMRQYWTPHRLFLYLKGLSPHSPEKIEERLGPRV-------G 76 (702) Q Consensus 4 lLlEIgtEElPa~~~~~~~~~l~~~~~~~l~~~~l~~~~i~~~~TPRRLav~i~~l~~~q~d~~~e~kGP~~-------g 76 (702) ||||||||||||+++..+.+||.+.+.+.|++++|+|+++++|+|||||+++|++++.+|+|+++|+||||+ | T Consensus 1 lLlEIgtEELPa~~~~~~~~~l~~~~~~~L~~~~l~~~~i~~f~TPRRLav~i~~l~~~q~d~~~e~kGP~~~~AfD~~G 80 (549) T pfam02092 1 LLLEIGTEELPAKFLKKALEQLAELITKGLKEARLSFGEVKVFATPRRLAVLVEGLAEKQPDKEEEKKGPPVKIAFDADG 80 (549) T ss_pred CEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEECCCEEEEEECCCCCCCCCHHHEEECCCHHHHCCCCC T ss_conf 92774003589899999999999999999998799868606987487888885376300866243213876576546579 Q ss_pred CCCHHHHHHHHHCCCCHHHHHHEEEECCCEEEEEEEEECCCCCHHHHHHHHHHHHHHCCCCCEEECCCCCCCCCCCEEEE Q ss_conf 89878998999779982443320340698289999991588509999999999998628985131037765676741665 Q gi|254781094|r 77 AGKKAIDGFLRSTGLQTISDCQIKKDPKKGDVYLAVLRKPKRLIEDVLKTIVPVAIQKVPWPKSMRWSTTHSPISAFSWI 156 (702) Q Consensus 77 ~pt~A~~GF~ks~gi~~~~~~~i~~~~~kg~~~~~~~~~~g~~~~eiL~~ii~~~i~~l~~pKsMrWg~~~~~~~~~~Fv 156 (702) +||+|++||+||||++. +++.+ ++++||+|||+++.++|+++.++|+++++++|+++||||+||||+++ ++|+ T Consensus 81 ~ptkA~~GFaks~gv~~-~~L~~-~~t~kgey~~~~~~~~G~~~~elL~~ii~~~i~~l~~pKsMRWg~~~-----~~fv 153 (549) T pfam02092 81 NPTKAAEGFARSQGVSV-DDLEI-RETGKGEYLFAKKEEKGRPTAELLPEILPEAIKSLPFPKSMRWGDGD-----LRFV 153 (549) T ss_pred CCCHHHHHHHHHCCCCH-HHHEE-EECCCEEEEEEEEEECCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCC-----CEEE T ss_conf 91799999999869998-99254-66288669999982088419999999999999679998631408999-----7676 Q ss_pred EECCEEEEEECCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCEEEECHHHHHHHHCCCCEEECHHHHHHHHHHHHHHHH Q ss_conf 30320567863677787231024458221111022344577856631679999862027847169999999999999988 Q gi|254781094|r 157 RPLKSILCILVAEDAKEKIIDLDLKEIPCGNITYGHRFHAPHPIKVQSLDHYIRDLEEAMVLLDPECRRNAILNDAHRLA 236 (702) Q Consensus 157 RPirwIvaLl~~~~~~~~vi~~~~~gi~s~n~T~GHRf~~~~~i~I~~a~~Y~~~L~~~~Vi~d~~~R~~~I~~~i~~~a 236 (702) ||||||+||| |++||||+++|++|||+||||||+++++|+|++|++|++.|++++||+|+++||+.|+++++++| T Consensus 154 RPirwi~aL~-----~~~vi~~~~~gi~s~n~T~GHRf~~~~~i~I~~~~~Y~~~L~~~~Vi~d~~eRk~~I~~~i~~~a 228 (549) T pfam02092 154 RPIRWLVALL-----GDEVVPFEILGITSGNTTRGHRFLGPGPITIASADDYEETLRKAGVIVDFEERKELIREQIEALA 228 (549) T ss_pred CHHHHHHHHC-----CCEEEEEEEEEEECCCEECCCCCCCCCCEEECCHHHHHHHHHHCCEEECHHHHHHHHHHHHHHHH T ss_conf 6189899870-----78066899940322543314312578986748999999998529899269999999999999999 Q ss_pred HHCCCEECCCHHHHHHHHHHCCCCEEEEEEECHHHHCCCHHHHHHHHHHHCCCEEEECCCCCCCCEEEEEECCCCCCCCH Q ss_conf 85398214666788788744089846888307777128989999999840011365536886132489850577655430 Q gi|254781094|r 237 SAVGLELVEDKDLLEEIIGLVEWVQVFMGSFDKKYLCLPEELIRLTIKTNQKCFVTRTREGVLANCFILVSNIQASDGGA 316 (702) Q Consensus 237 ~~~~~~~~~d~~Ll~Ev~~LvE~P~~~~G~F~~~fL~lP~evl~~~m~~hQkyf~~~~~~g~l~~~Fi~v~N~~~~~~~~ 316 (702) +++|++++.|++||||||+|||||++++|+||++||+||+|||+|+|++||||||++|++|+|+||||+|||+.+++ .+ T Consensus 229 ~~~~~~~~~d~~LLdEV~~LVE~P~~l~G~Fd~~fL~LP~EvLit~M~~hQkyF~v~d~~g~L~~~Fi~VsN~~~~~-~~ 307 (549) T pfam02092 229 AELGGKVDIDEDLLEEVTGLVEWPVALLGSFDEEFLELPPEVLITSMKEHQRYFPVYDKDGKLLPYFITVSNGESKD-PE 307 (549) T ss_pred HHCCCEECCCHHHHHHHHHCCCCCEEEEECCCHHHHCCCHHHHHHHHHHHHHEEEEECCCCCCCCEEEEEECCCCCC-HH T ss_conf 97695786888999999850678767874337997429999999999961300466899998845178861788677-78 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCEEHHHHHHHHHHHHHHHHHH Q ss_conf 24567889978764225999850300011111111110011220137889988421002303134551389989899999 Q gi|254781094|r 317 AIVQGNSRVVAARLEDALHFWKRDQNNLPNLSSLKESALKFNLDLSKPLDQRMARLDMLDVVFHAKIGTQGERVSRIRVL 396 (702) Q Consensus 317 ~v~~GnerVl~ARL~DA~FF~~~D~~~~~~~~~~~~~~~~~~~dlk~~L~~~~~~L~~~~v~F~~~LGs~~dK~~Ri~~l 396 (702) .|++||||||||||+||+|||++| +|++|++|+++|+ +||||++|||++|||+|+++| T Consensus 308 ~I~~GNErVl~ARL~DA~FF~~~D--------------------~k~~L~~~~~~L~--~v~F~~~LGs~~dK~~Ri~~l 365 (549) T pfam02092 308 NVIKGNERVLRARLADAEFFYEED--------------------LKTPLESRVEKLK--KVVFQEKLGSLYDKVERIEAL 365 (549) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCC--------------------CCCCHHHHHHHHH--CEEEHHCCCCHHHHHHHHHHH T ss_conf 889838999998861489999768--------------------8888799999753--445056059899999999999 Q ss_pred HHHHHHHCC-CCHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCCCCCCHHHH Q ss_conf 999886529-9988999999974236788898403457899999998623699889999999734477677788635889 Q gi|254781094|r 397 GKKIAQLID-ADVALVDRAIVLSKADLCTEIVREFPELQGKIGKEYAVLQNENISCCDAIEEHLKPRGPLENVPTNKISI 475 (702) Q Consensus 397 a~~la~~l~-~~~~~~~rAa~L~KaDL~t~mV~EFpeLQG~mG~~YA~~~ge~~~Va~aI~ehY~P~~~~d~lP~~~~g~ 475 (702) +.+|++.++ .+.+.+.|||.||||||+|+||+|||||||+||+|||+++||+++||.||+|||+|++++|+||+|++|+ T Consensus 366 a~~ia~~~~~~~~~~~~rAa~L~K~DL~t~mV~EFpeLQGimG~~YA~~~ge~~~Va~AI~ehY~P~~~~d~lP~~~~g~ 445 (549) T pfam02092 366 AGKIAEQLGEADAEDAERAALLCKADLVTEMVGEFPELQGIMGRYYALHDGEDEEVALAIEEHYLPRFAGDALPSSPVGA 445 (549) T ss_pred HHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCCCCCCHHHH T ss_conf 99999974924099999999986200667876243777479999999986998799999999728999999999958989 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHH----------------HHHHHHHH Q ss_conf 99999749989876510567687769688887434468999837997887896303----------------64999999 Q gi|254781094|r 476 TVSLADKLDTLINFWAINEKPSGSKDPYALRRATLGIIRIILENKIDIPLSQFIED----------------QNLILFFH 539 (702) Q Consensus 476 ilsiaDKlDtl~g~f~ig~~PTGS~DPfaLRR~a~GiirIl~e~~l~l~l~~li~~----------------~~l~~Fi~ 539 (702) +||||||+|||+|||++|++||||+|||||||+|+||||||++++|+++|.+++++ +++.+||. T Consensus 446 ~lsiADKlDtl~g~F~ig~~PTGS~DPfaLRR~a~Giirii~e~~~~l~L~~li~~a~~~~~~~~~~~~~~~~~l~~F~~ 525 (549) T pfam02092 446 AVALADKLDTLVGIFGIGEIPTGSKDPFALRRAALGILRILIEKGLPLDLRELLEKAASLYGDKLLDAEDVVEDLLEFFL 525 (549) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH T ss_conf 99999999999999967998288999588999998999999964999989999999999765405885789999999999 Q ss_pred HHHHHHHHHCCCCHHHHHHHHHCC Q ss_conf 999998763388878998854168 Q gi|254781094|r 540 DRLKLYLHDRDIRHDLIEAILRPE 563 (702) Q Consensus 540 ~Rl~~~L~e~G~~~DiI~AVl~~~ 563 (702) +|++++|+++||++|+|+||++.+ T Consensus 526 ~Rl~~~l~~~g~~~d~i~AVl~~~ 549 (549) T pfam02092 526 DRLRSLLEEQGIRYDVIDAVLALG 549 (549) T ss_pred HHHHHHHHHCCCCHHHHHHHHCCC T ss_conf 999999987799889999986189 No 4 >TIGR00211 glyS glycyl-tRNA synthetase, beta subunit; InterPro: IPR015944 The aminoacyl-tRNA synthetases (6.1.1 from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold and are mostly monomeric, while class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet formation, flanked by alpha-helices , and are mostly dimeric or multimeric. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic aci, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . The 10 class I synthetases are considered to have in common the catalytic domain structure based on the Rossmann fold, which is totally different from the class II catalytic domain structure. The class I synthetases are further divided into three subclasses, a, b and c, according to sequence homology. No conserved structural features for tRNA recognition by class I synthetases have been established. Class-II tRNA synthetases do not share a high degree of similarity, however at least three conserved regions are present , , . In most eubacteria, glycyl-tRNA synthetase (6.1.1.14 from EC) is an alpha2/beta2 tetramer composed of 2 different subunits , , while in archaea, eukaryota and some eubacteria, glycyl-tRNA synthetase is an alpha2 dimer (see IPR002315 from INTERPRO). This entry represents the beta subunit of the tetrameric enzyme. What is most interesting is the lack of similarity between the two types: divergence at the sequence level is so great that it is impossible to infer descent from common genes. The alpha (see IPR002310 from INTERPRO) and beta subunits also lack significant sequence similarity. However, they are translated from a single mRNA , and a single chain glycyl-tRNA synthetase from Chlamydia trachomatis has been found to have significant similarity with both domains, suggesting divergence from a single polypeptide chain .; GO: 0000166 nucleotide binding, 0004820 glycine-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006426 glycyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=100.00 E-value=0 Score=945.64 Aligned_cols=666 Identities=35% Similarity=0.562 Sum_probs=632.1 Q ss_pred CEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCEEEEEECCCCCCCCCEEEEECCCCC------- Q ss_conf 4016400201698999999999999999999976998033588630745899984954207510023027898------- Q gi|254781094|r 3 DFLLEIYSEEIPARMQHKAAEDLSIILIGLLKEEGVIYQNMRQYWTPHRLFLYLKGLSPHSPEKIEERLGPRV------- 75 (702) Q Consensus 3 ~lLlEIgtEElPa~~~~~~~~~l~~~~~~~l~~~~l~~~~i~~~~TPRRLav~i~~l~~~q~d~~~e~kGP~~------- 75 (702) +||+|||+||+|++++.....++...+...+...++.++++++|+|||||++.+.+++..+++..++.+||++ T Consensus 5 ~~l~~~g~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~l~~~~~~~~~~~~~~~~~~~gp~~~~~~~~~ 84 (743) T TIGR00211 5 TLLVEIGTEELPAKALRSLATELGDKLTAGLNEAGLEHGSVEWFGTPRRLALLVKDLPELQPDRKEEKKGPPVKAAFDAQ 84 (743) T ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHCCHHHHHHCCCCCCEEEEECCHHHHHHHHHHHHCCCHHHHHHCCCHHHHHHHCC T ss_conf 32232311211378999998864431003455412552322122040356665530011030134542263134442012 Q ss_pred -CCCCHHHHHHHHHCCCCHHHHHHEEEECCCE---EEEEEEEECCCCCHHHHHHHHHHHHHHCCCCCEEECCCCCCCCCC Q ss_conf -5898789989997799824433203406982---899999915885099999999999986289851310377656767 Q gi|254781094|r 76 -GAGKKAIDGFLRSTGLQTISDCQIKKDPKKG---DVYLAVLRKPKRLIEDVLKTIVPVAIQKVPWPKSMRWSTTHSPIS 151 (702) Q Consensus 76 -g~pt~A~~GF~ks~gi~~~~~~~i~~~~~kg---~~~~~~~~~~g~~~~eiL~~ii~~~i~~l~~pKsMrWg~~~~~~~ 151 (702) |.|++++.||++++|+.. ++..+. .+.+| +|+++.+...|+++.++++++....+.++++||.||||+.+ T Consensus 85 ~g~~~~~~~g~~~~~g~~~-~~~~~~-~~~~~~~~~~~~~~~~~~g~~~~~l~~~~~~~~~~~l~~~~~~~wg~~~---- 158 (743) T TIGR00211 85 DGKPTKAALGFAKGQGLNV-EDAEIL-ETDKGNNKEWLFHRKFLEGEPTKDLLPPLVLELLAGLPFPKSMRWGDGD---- 158 (743) T ss_pred CCCCCHHHHHHHHCCCCCH-HHHHHH-HCCCCCCCCEEEEHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCC---- T ss_conf 5664301320010147761-233442-1146664100110224403203566678889998624775311235442---- Q ss_pred CEEEEEECCEEEEEECCCCCCCCEEEEEECC----CCCCCCCCCCCCCCCCCEEEECHHHHHHHHCCC-CEEECHHHHHH Q ss_conf 4166530320567863677787231024458----221111022344577856631679999862027-84716999999 Q gi|254781094|r 152 AFSWIRPLKSILCILVAEDAKEKIIDLDLKE----IPCGNITYGHRFHAPHPIKVQSLDHYIRDLEEA-MVLLDPECRRN 226 (702) Q Consensus 152 ~~~FvRPirwIvaLl~~~~~~~~vi~~~~~g----i~s~n~T~GHRf~~~~~i~I~~a~~Y~~~L~~~-~Vi~d~~~R~~ 226 (702) .+|+||+||++++++ |++++|+++.+ +.+++.++||||+++..+.+.+++.|.+.|+.+ +|++|+.+|+. T Consensus 159 -~~~~~p~~~~~~l~~----g~~~~~~~~~~~~~~~~~~~~~~g~~~~g~~~~~~~~~~~y~~~l~~~g~~~~d~~~~~~ 233 (743) T TIGR00211 159 -LRFIRPIHWLVLLLN----GDDVLPLEILGGGTTLESDRTTRGHRFLGEDEVSIESPDAYPELLKENGHVIADPKERKA 233 (743) T ss_pred -CCCCCCCEEEEEEEC----CCCCCCEEECCCCCEECCCCHHHHHHHCCCCCEEECCHHHHHHHHHHCCCEEECHHHHHH T ss_conf -100020000234433----663033044055310013200101211163100102515679988526863215167789 Q ss_pred HHHHHHHHHHHHCCCEECCCHHHHHHHHHHCCCCEEEEEEECHHHHCCCHHHHHHHHHHHCCCEEEECCCCC-----CCC Q ss_conf 999999998885398214666788788744089846888307777128989999999840011365536886-----132 Q gi|254781094|r 227 AILNDAHRLASAVGLELVEDKDLLEEIIGLVEWVQVFMGSFDKKYLCLPEELIRLTIKTNQKCFVTRTREGV-----LAN 301 (702) Q Consensus 227 ~I~~~i~~~a~~~~~~~~~d~~Ll~Ev~~LvE~P~~~~G~F~~~fL~lP~evl~~~m~~hQkyf~~~~~~g~-----l~~ 301 (702) .|..++...+.++++.+..+++|++||..++|||+++.|+|+++||.+|.|+++++|+.||||||+++++|+ +.| T Consensus 234 ~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~~~~p~~~~~~~~~~~l~~p~~~~~~~~~~~~~y~p~~~~~g~~~~p~l~p 313 (743) T TIGR00211 234 KILAELETLAAKLGGEADLDEDLLDEVESLVEYPTALLGKFEEEFLSLPAEVLVTTMKGHQRYFPVYDKDGKEESPTLLP 313 (743) T ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHCCCCEEECEECCCCCCCCCCCCC T ss_conf 99988888887504531014678989988764224676555577751421453210002430101001577633531024 Q ss_pred EEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCEEHHH Q ss_conf 48985057765543024567889978764225999850300011111111110011220137889988421002303134 Q gi|254781094|r 302 CFILVSNIQASDGGAAIVQGNSRVVAARLEDALHFWKRDQNNLPNLSSLKESALKFNLDLSKPLDQRMARLDMLDVVFHA 381 (702) Q Consensus 302 ~Fi~v~N~~~~~~~~~v~~GnerVl~ARL~DA~FF~~~D~~~~~~~~~~~~~~~~~~~dlk~~L~~~~~~L~~~~v~F~~ 381 (702) +|++|+|+.+.+ .+.|+.||++|+++||+||.|||+.| ++.++++++++|+ .|+|++ T Consensus 314 ~~~~~~~~~~~~-~~~~~~g~~~~~~~~l~d~~~~~~~d--------------------~~~~~~~~~~~l~--~~~~~~ 370 (743) T TIGR00211 314 HFITVANGNPKD-PEKIALGNEKVLRARLSDAEFFYKTD--------------------LKKPLDDNLPRLE--TVLFQE 370 (743) T ss_pred CEEEEECCCCCC-CHHHHCCCCHHHHHHHHHHHHHHHHC--------------------CCCCHHHHHHHHH--HHHHHH T ss_conf 425773365411-02333054112222332345655310--------------------2110233212445--544343 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCH--------HHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHH Q ss_conf 551389989899999999886529998--------899999997423678889840345789999999862369988999 Q gi|254781094|r 382 KIGTQGERVSRIRVLGKKIAQLIDADV--------ALVDRAIVLSKADLCTEIVREFPELQGKIGKEYAVLQNENISCCD 453 (702) Q Consensus 382 ~LGs~~dK~~Ri~~la~~la~~l~~~~--------~~~~rAa~L~KaDL~t~mV~EFpeLQG~mG~~YA~~~ge~~~Va~ 453 (702) +|||+.||+.|+..++.+++..++.+. ..+.|++.+|||||+|+||+|||++||+||.+||+++||++.|+. T Consensus 371 ~lg~~~d~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~dl~~~~~~~~~~~~g~~g~~y~~~~g~~~~~~~ 450 (743) T TIGR00211 371 ELGTLKDKVDRISELAGYLADQLGADENTPLEKALGHVKRAGLLSKADLLTNLVYEFPELQGIMGEKYALHDGEDEAVAL 450 (743) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHH T ss_conf 32024567789999999999875100200268888766666666666667666653001222344455541360367888 Q ss_pred HHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHH-- Q ss_conf 999973447767778863588999999749989876510567687769688887434468999837997887896303-- Q gi|254781094|r 454 AIEEHLKPRGPLENVPTNKISITVSLADKLDTLINFWAINEKPSGSKDPYALRRATLGIIRIILENKIDIPLSQFIED-- 531 (702) Q Consensus 454 aI~ehY~P~~~~d~lP~~~~g~ilsiaDKlDtl~g~f~ig~~PTGS~DPfaLRR~a~GiirIl~e~~l~l~l~~li~~-- 531 (702) ++.|||+|++++|++|++..|++++++||+||++|+|++|..|+||+|||||||++.|+++++++.+|.+++..++.. T Consensus 451 ~~~~~y~p~~~~~~~p~~~~~~~~~~~~~~d~~~~~~~~g~~p~g~~dp~~l~~~~~g~~~~~~~~~~~~~~~~~~~~gc 530 (743) T TIGR00211 451 ALNEHYLPRSAGDDLPSSLPGSVVALADKLDTLASLFGIGLLPTGSKDPFALRRAALGLLRILLEKNLELDLEELLTEGC 530 (743) T ss_pred HHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHH T ss_conf 77664222225765554124567776665556665542045678765512456778899999987057864688864101 Q ss_pred -----------------HHHHHHHHHHHHHHHHHC-CCCHHHHHHHHHCCCC---------------CHHHHHHHHHHHH Q ss_conf -----------------649999999999987633-8887899885416899---------------9899999999999 Q gi|254781094|r 532 -----------------QNLILFFHDRLKLYLHDR-DIRHDLIEAILRPEND---------------NLLTIVDLIKHLN 578 (702) Q Consensus 532 -----------------~~l~~Fi~~Rl~~~L~e~-G~~~DiI~AVl~~~~~---------------~~~~i~~r~~aL~ 578 (702) +++.+|+.+|++.++.+. |+++|+|++|++.+.. ++.++..+++++. T Consensus 531 ~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~cd~~~~~~~d~~~~~~~~~~~~~~~~~ 610 (743) T TIGR00211 531 RDFVLLQGDKLTNSNDLEDLLDFLLGRFRALLQDEQGIDPDVIDAVLGRGETNCDEAQSRLHDRLLADPADFDARAKALS 610 (743) T ss_pred HHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 23232113433304679999999999889886320154078899876046554035677666777643777888999999 Q ss_pred HHHCCH----HHHHHHHHHHHHHHHHHHHHHCCCCCCC--CCCHHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHH Q ss_conf 987195----1388999999999985565312467777--7385683598999999999999999999850253899999 Q gi|254781094|r 579 EFFSSA----KGEKFLLSAKRIFQILAIEEKKNREISL--EISPQYLLLEAEKRLYAVISDFGTHIQESMDHKRYHQIGD 652 (702) Q Consensus 579 ~f~~~~----~~~~l~~a~KRv~NIL~~~~k~~~~~~~--~vd~~Lf~~~~Ek~L~~~~~~~~~~i~~~l~~~dy~~~L~ 652 (702) .++..+ ..+.+..+++|+.||.++.......... .+++.+|.++.|+.+++++..+..+.......++|...+. T Consensus 611 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~d~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 690 (743) T TIGR00211 611 HFLDLPKNAQGAEALAAALKRVANILKKGGPPTDVLGPRPEINAPLLKEPKEKALFEALLALKPNLQELFSKGDYETALE 690 (743) T ss_pred HHHCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 86237731246788888765666643125777544566554450221154678999999987765555541003789999 Q ss_pred HHHHHHHHHHHHHC---CCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCHHHCC Q ss_conf 99853278887510---270368868999999999999999998606845219 Q gi|254781094|r 653 LLHSICEPIEIFFD---QVLVNVDDREVRDNRLSLLQYIKNIILIVINVQKIV 702 (702) Q Consensus 653 ~L~~L~~~Id~FFD---nVmVm~eD~~IR~NRL~LL~~i~~~f~~iaDfs~Lv 702 (702) .+..+++|++.||| +|+||++|+.+|.||+++|..+++.+..++|++.++ T Consensus 691 ~~~~l~~p~~~~~~gpd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 743 (743) T TIGR00211 691 ALAELREPVDEFFDGPDSVLVLADDDELRKNRLNLLGGLRDLFLELGDIGALV 743 (743) T ss_pred HHHHHHHHHHHHCCCCCCEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 98642100687507866347640645667778999988899888763233139 No 5 >pfam05746 DALR_1 DALR anticodon binding domain. This all alpha helical domain is the anticodon binding domain in Arginyl and glycyl tRNA synthetase. This domain is known as the DALR domain after characteristic conserved amino acids. Probab=98.65 E-value=4.8e-07 Score=67.33 Aligned_cols=105 Identities=25% Similarity=0.411 Sum_probs=90.6 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 99999999855653124677777385683598999999999999999999850253899999998532788875102703 Q gi|254781094|r 591 LSAKRIFQILAIEEKKNREISLEISPQYLLLEAEKRLYAVISDFGTHIQESMDHKRYHQIGDLLHSICEPIEIFFDQVLV 670 (702) Q Consensus 591 ~a~KRv~NIL~~~~k~~~~~~~~vd~~Lf~~~~Ek~L~~~~~~~~~~i~~~l~~~dy~~~L~~L~~L~~~Id~FFDnVmV 670 (702) =++.|+.+|+.+..... .....+..++.+++|++|...+......+..+....+-......+.+|....+.|++++.| T Consensus 3 Ya~aRi~SIl~k~~~~~--~~~~~~~~~~~~~~E~~Li~~l~~f~~~l~~a~~~~~P~~l~~Yl~~La~~f~~fY~~~~I 80 (117) T pfam05746 3 YAHARICSILRKAGELG--INLDIDALLLLEEEEKELLKALLQFPEVVEEAAEELEPHRLANYLYDLASAFHSFYNNCRV 80 (117) T ss_pred HHHHHHHHHHHHCCCCC--CCCCCCCCCCCCHHHHHHHHHHCCCHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 88999999998467667--7655430012999999999988052999999998668699999999999999999971967 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 688689999999999999999986068 Q gi|254781094|r 671 NVDDREVRDNRLSLLQYIKNIILIVIN 697 (702) Q Consensus 671 m~eD~~IR~NRL~LL~~i~~~f~~iaD 697 (702) .++|..++++||.|+..++..+..-.+ T Consensus 81 l~~~~~~~~~RL~L~~~~~~vl~~gl~ 107 (117) T pfam05746 81 LDEDDEERNARLALLKAVRQVLKNGLD 107 (117) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 489889999999999999999999978 No 6 >smart00836 DALR_1 DALR anticodon binding domain. This all alpha helical domain is the anticodon binding domain of Arginyl tRNA synthetase. This domain is known as the DALR domain after characteristic conserved amino acids PUBMED:10447505. Probab=98.45 E-value=5.9e-06 Score=59.86 Aligned_cols=108 Identities=17% Similarity=0.299 Sum_probs=91.2 Q ss_pred HHHHHHHHHHHHHHHCCCCC--CCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 99999999855653124677--7773856835989999999999999999998502538999999985327888751027 Q gi|254781094|r 591 LSAKRIFQILAIEEKKNREI--SLEISPQYLLLEAEKRLYAVISDFGTHIQESMDHKRYHQIGDLLHSICEPIEIFFDQV 668 (702) Q Consensus 591 ~a~KRv~NIL~~~~k~~~~~--~~~vd~~Lf~~~~Ek~L~~~~~~~~~~i~~~l~~~dy~~~L~~L~~L~~~Id~FFDnV 668 (702) =++.|+.+|+.+........ ....|.+.+.++.|++|...+......+..+....+-......+.+|....+.|.+++ T Consensus 3 Ya~aRi~SIl~k~~~~~~~~~~~~~~~~~~l~~~~E~~Li~~l~~f~~vl~~a~~~~~P~~l~~Yl~~La~~f~~fY~~~ 82 (122) T smart00836 3 YAHARICSILRKAGEAGLTLPDIADADLSLLTEPEELALLRLLARFPEVLEAAAETLEPHRLANYLYDLASAFHSFYNKC 82 (122) T ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCCHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCC T ss_conf 88999999999612237786432313353259999999999999778999999997695899999999999999998438 Q ss_pred CCCCC-CHHHHHHHHHHHHHHHHHHHHHCCH Q ss_conf 03688-6899999999999999999860684 Q gi|254781094|r 669 LVNVD-DREVRDNRLSLLQYIKNIILIVINV 698 (702) Q Consensus 669 mVm~e-D~~IR~NRL~LL~~i~~~f~~iaDf 698 (702) .|.++ |++++.+||.|+..++..+....++ T Consensus 83 ~Il~~~~~~~~~~RL~L~~~~~~vl~~~l~l 113 (122) T smart00836 83 RVLGEENPELRAARLALLKAVRQVLANGLRL 113 (122) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 4236898999999999999999999999874 No 7 >cd07956 Anticodon_Ia_Arg Anticodon-binding domain of arginyl tRNA synthetases. This domain is found in arginyl tRNA synthetases (ArgRS), which belong to the class Ia aminoacyl tRNA synthetases. It lies C-terminal to the catalytic core domain, and recognizes and specifically binds to the tRNA anticodon. ArgRS catalyzes the transfer of arginine to the 3'-end of its tRNA. Probab=98.20 E-value=2.9e-05 Score=55.12 Aligned_cols=114 Identities=16% Similarity=0.275 Sum_probs=95.4 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHH Q ss_conf 95138899999999998556531246777773856835989999999999999999998502538999999985327888 Q gi|254781094|r 583 SAKGEKFLLSAKRIFQILAIEEKKNREISLEISPQYLLLEAEKRLYAVISDFGTHIQESMDHKRYHQIGDLLHSICEPIE 662 (702) Q Consensus 583 ~~~~~~l~~a~KRv~NIL~~~~k~~~~~~~~vd~~Lf~~~~Ek~L~~~~~~~~~~i~~~l~~~dy~~~L~~L~~L~~~Id 662 (702) .+.+.=+.=++-|+..|+.+...... ....++.+++.++.|++|...+......+..+.....-......|.+|...-+ T Consensus 33 gnt~~YiQYa~aR~~SIl~k~~~~~~-~~~~~~~~~l~~~~e~~Li~~l~~fp~vl~~a~~~~~P~~l~~YL~~La~~F~ 111 (156) T cd07956 33 GDTGPYLQYAHARLCSILRKAGETIE-AEADADLSLLPEPDERDLILLLAKFPEVVKNAAETLEPHTIATYLFDLAHAFS 111 (156) T ss_pred CCCCEEHHHHHHHHHHHHHHCCCCCC-CCCCCCHHCCCCHHHHHHHHHHHCCHHHHHHHHHHCCCHHHHHHHHHHHHHHH T ss_conf 89851124899999999997245453-32225420049989999999996572899999997070699999999999999 Q ss_pred HHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 75102703688689999999999999999986068 Q gi|254781094|r 663 IFFDQVLVNVDDREVRDNRLSLLQYIKNIILIVIN 697 (702) Q Consensus 663 ~FFDnVmVm~eD~~IR~NRL~LL~~i~~~f~~iaD 697 (702) .|.++..|.++|+++++.||.|+..++..+..-.+ T Consensus 112 ~fY~~~~Ii~~~~~~~~~RL~L~~a~~~vL~~gL~ 146 (156) T cd07956 112 KFYNACPVLGAEEELRNARLALVAAARQVLANGLD 146 (156) T ss_pred HHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99974988789889999999999999999999865 No 8 >PRK12704 phosphodiesterase; Provisional Probab=98.09 E-value=5.2e-05 Score=53.38 Aligned_cols=111 Identities=24% Similarity=0.260 Sum_probs=92.4 Q ss_pred CCCCHHHHHHHHCCCCEEHHHHHH-HHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHH Q ss_conf 137889988421002303134551-3899898999999998865299988999999974236788898403457899999 Q gi|254781094|r 361 LSKPLDQRMARLDMLDVVFHAKIG-TQGERVSRIRVLGKKIAQLIDADVALVDRAIVLSKADLCTEIVREFPELQGKIGK 439 (702) Q Consensus 361 lk~~L~~~~~~L~~~~v~F~~~LG-s~~dK~~Ri~~la~~la~~l~~~~~~~~rAa~L~KaDL~t~mV~EFpeLQG~mG~ 439 (702) +-..|-..+-+|+ |.-..| ++++-+.-+..|+..+|..+|.|...+.||+.|- |+--.+-.|..--+-.+|. T Consensus 250 l~peli~~lGrLk-----yRtSygQNvL~Hs~Eva~lag~~A~elg~d~~~AkraglLH--DIGKa~d~e~eg~H~~iG~ 322 (455) T PRK12704 250 LHPELIKLIGRLK-----YRTSYGQNVLQHSREVAHLAGLMAAELGLDEKLAKRAGLLH--DIGKALDHEVEGSHVELGV 322 (455) T ss_pred CCHHHHHHHHHEE-----CCCCCCHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH--HHCCCCCCCCCCCHHHHHH T ss_conf 9979999986310-----20204047999999999999999998299999999998765--2003456546897799979 Q ss_pred HHHHHCCCCHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 99862369988999999973447767778863588999999749 Q gi|254781094|r 440 EYAVLQNENISCCDAIEEHLKPRGPLENVPTNKISITVSLADKL 483 (702) Q Consensus 440 ~YA~~~ge~~~Va~aI~ehY~P~~~~d~lP~~~~g~ilsiaDKl 483 (702) ..|++.||+++|+.||..|- +|.-|+|+.+.+|+.||-+ T Consensus 323 ~~a~k~ge~~~vinai~~HH-----~d~~~~~~~a~lV~aaDai 361 (455) T PRK12704 323 ELAKKYKESPVVINAIAAHH-----GDEEPTSIEAALVQAADAL 361 (455) T ss_pred HHHHHCCCCHHHHHHHHHHC-----CCCCCCCHHHHHHHHHHHH T ss_conf 99998299889999999836-----8778666088999999998 No 9 >COG0018 ArgS Arginyl-tRNA synthetase [Translation, ribosomal structure and biogenesis] Probab=98.07 E-value=9.7e-05 Score=51.56 Aligned_cols=117 Identities=18% Similarity=0.287 Sum_probs=93.3 Q ss_pred HHHCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH Q ss_conf 98719513889999999999855653124677777385683598999999999999999999850253899999998532 Q gi|254781094|r 579 EFFSSAKGEKFLLSAKRIFQILAIEEKKNREISLEISPQYLLLEAEKRLYAVISDFGTHIQESMDHKRYHQIGDLLHSIC 658 (702) Q Consensus 579 ~f~~~~~~~~l~~a~KRv~NIL~~~~k~~~~~~~~vd~~Lf~~~~Ek~L~~~~~~~~~~i~~~l~~~dy~~~L~~L~~L~ 658 (702) .|-. +-+.=+.=++-|+.+|+.+............+ ..+.++.|.+|...+.....-+..+.....-...-..|.+|. T Consensus 450 sfeg-Nt~pYvQYA~ARi~SIlrka~e~~~~~~~~~~-~~l~~~~E~~L~~~L~~fp~vl~~aa~~~~Ph~la~YL~~LA 527 (577) T COG0018 450 SFEG-NTAPYVQYAHARICSILRKAGEDELDLSTEAD-ALLTELEERELVKKLLEFPEVLEEAAEELEPHRLANYLYDLA 527 (577) T ss_pred HCCC-CCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCC-HHCCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH T ss_conf 4459-97215999999999999854443455332221-010682799999999984999999998569559999999999 Q ss_pred HHHHHHHCCCCCCCCC-HHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 7888751027036886-89999999999999999986068 Q gi|254781094|r 659 EPIEIFFDQVLVNVDD-REVRDNRLSLLQYIKNIILIVIN 697 (702) Q Consensus 659 ~~Id~FFDnVmVm~eD-~~IR~NRL~LL~~i~~~f~~iaD 697 (702) ..-+.|.+++.|.+++ +.+|..||+|...++..+..-.+ T Consensus 528 ~~Fn~fY~~~~Vl~~~~~~~~~aRL~L~~a~~~vL~ngL~ 567 (577) T COG0018 528 GSFNSFYNACPVLGAENEELRAARLALVKATRQVLKNGLD 567 (577) T ss_pred HHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999998508777899578999999999999999997787 No 10 >PRK01611 argS arginyl-tRNA synthetase; Reviewed Probab=98.02 E-value=0.0015 Score=43.51 Aligned_cols=114 Identities=17% Similarity=0.271 Sum_probs=92.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHH Q ss_conf 19513889999999999855653124677777385683598999999999999999999850253899999998532788 Q gi|254781094|r 582 SSAKGEKFLLSAKRIFQILAIEEKKNREISLEISPQYLLLEAEKRLYAVISDFGTHIQESMDHKRYHQIGDLLHSICEPI 661 (702) Q Consensus 582 ~~~~~~~l~~a~KRv~NIL~~~~k~~~~~~~~vd~~Lf~~~~Ek~L~~~~~~~~~~i~~~l~~~dy~~~L~~L~~L~~~I 661 (702) +.+.+.=+.=++-|+.+|+.+..... .....+...+.+++|++|...+......+..+.....=......|.+|...- T Consensus 446 ~g~t~~YiQYa~AR~~SIlrK~~~~~--~~~~~~~~~l~~~~E~~Li~~l~~fp~vv~~a~~~~~Ph~l~~YL~~La~~F 523 (570) T PRK01611 446 EGNNPPYVQYAHARICSILRKAAEEG--LAAAADLSLLTEEEELELIKKLAEFPEVVESAAEELEPHRIANYLYELAGAF 523 (570) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHCCC--CCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHH T ss_conf 38982578899999999998631234--5443433346998999999999887999999999688189999999999999 Q ss_pred HHHHCCCCCC-CCCHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 8751027036-88689999999999999999986068 Q gi|254781094|r 662 EIFFDQVLVN-VDDREVRDNRLSLLQYIKNIILIVIN 697 (702) Q Consensus 662 d~FFDnVmVm-~eD~~IR~NRL~LL~~i~~~f~~iaD 697 (702) +.|.++..|- .+++.+|+.||.|...++..+..-.+ T Consensus 524 n~fY~~~~vl~~~~~~~~~~RL~L~~a~~~vL~~gL~ 560 (570) T PRK01611 524 HSFYNACPVLKEEEEEERNARLALVKATAQVLKNGLD 560 (570) T ss_pred HHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9998519835899989999999999999999999864 No 11 >PRK00106 hypothetical protein; Provisional Probab=97.99 E-value=0.00012 Score=50.93 Aligned_cols=111 Identities=29% Similarity=0.331 Sum_probs=91.9 Q ss_pred CCCCHHHHHHHHCCCCEEHHHHHH-HHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHH Q ss_conf 137889988421002303134551-3899898999999998865299988999999974236788898403457899999 Q gi|254781094|r 361 LSKPLDQRMARLDMLDVVFHAKIG-TQGERVSRIRVLGKKIAQLIDADVALVDRAIVLSKADLCTEIVREFPELQGKIGK 439 (702) Q Consensus 361 lk~~L~~~~~~L~~~~v~F~~~LG-s~~dK~~Ri~~la~~la~~l~~~~~~~~rAa~L~KaDL~t~mV~EFpeLQG~mG~ 439 (702) +-..|-..+-+|+ |.-..| ++++-+.-+..|+..+|..+|.|...+.||+.|- |+--.+=.|..--+-.+|. T Consensus 330 lhpelik~lGrLk-----yRtSygQNvL~HS~Eva~lag~mA~ElG~d~~~AkRagLLH--DIGKa~d~e~egsH~~iG~ 402 (535) T PRK00106 330 LHPDLIKIMGRLQ-----FRTSYGQNVLRHSVEVGKLAGILAGELGENVALARRAGFLH--DMGKAIDREVEGSHVEIGM 402 (535) T ss_pred CCHHHHHHHHHHC-----CHHHCCHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH--HCCCCCCCCCCCCHHHHHH T ss_conf 9978999987600-----01024058999999999999999998399999999998765--1014566767898799999 Q ss_pred HHHHHCCCCHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 99862369988999999973447767778863588999999749 Q gi|254781094|r 440 EYAVLQNENISCCDAIEEHLKPRGPLENVPTNKISITVSLADKL 483 (702) Q Consensus 440 ~YA~~~ge~~~Va~aI~ehY~P~~~~d~lP~~~~g~ilsiaDKl 483 (702) ..|++.||+++|+.||..|- +|.-|+|+.+.+|+.||-+ T Consensus 403 ~lakky~E~~~VinaI~aHH-----~d~~~~s~~a~lV~aADai 441 (535) T PRK00106 403 EFARKYKEHPVVVNTIASHH-----GDVEPDSVIAVIVAAADAL 441 (535) T ss_pred HHHHHCCCCHHHHHHHHHHC-----CCCCCCCHHHHHHHHHHHH T ss_conf 99998399879999999827-----8888854888999999998 No 12 >PRK12705 hypothetical protein; Provisional Probab=97.98 E-value=0.00012 Score=50.97 Aligned_cols=111 Identities=31% Similarity=0.296 Sum_probs=91.3 Q ss_pred CCCCHHHHHHHHCCCCEEHHHHHH-HHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHH Q ss_conf 137889988421002303134551-3899898999999998865299988999999974236788898403457899999 Q gi|254781094|r 361 LSKPLDQRMARLDMLDVVFHAKIG-TQGERVSRIRVLGKKIAQLIDADVALVDRAIVLSKADLCTEIVREFPELQGKIGK 439 (702) Q Consensus 361 lk~~L~~~~~~L~~~~v~F~~~LG-s~~dK~~Ri~~la~~la~~l~~~~~~~~rAa~L~KaDL~t~mV~EFpeLQG~mG~ 439 (702) +...|-..+-+|+ |.-..| ++++-+.-+..|+..+|..+|.|...+.||+.|- |+--.+=.|-..-+-.+|. T Consensus 279 l~pelikllGrLk-----yRtSyGQNvL~HSiEvA~lag~mA~ElG~d~~~AkRaGLLH--DIGKAvd~e~egsH~~iG~ 351 (485) T PRK12705 279 LSPGLVRLLGRLY-----FRTSYGQNVLSHSLEVAFLAAILAALIGLDPALAKKAGLLH--DIGKSVDFESDGNHVEIGA 351 (485) T ss_pred CCHHHHHHHHCEE-----CCCCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH--HCCCCCCCCCCCCHHHHHH T ss_conf 9978999986010-----00234057999999999999999998399999999998665--1014456657898799999 Q ss_pred HHHHHCCCCHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 99862369988999999973447767778863588999999749 Q gi|254781094|r 440 EYAVLQNENISCCDAIEEHLKPRGPLENVPTNKISITVSLADKL 483 (702) Q Consensus 440 ~YA~~~ge~~~Va~aI~ehY~P~~~~d~lP~~~~g~ilsiaDKl 483 (702) ..|++.||+++|..||..|- +|.-|+|+.+.+|+.||.+ T Consensus 352 ~lakky~e~~~VinaI~aHH-----~d~~~~s~~a~lV~aADai 390 (485) T PRK12705 352 ELAKKFNEPDYVVNAIASHH-----NKVNPTTVYSVLVQIADAL 390 (485) T ss_pred HHHHHCCCCHHHHHHHHHHC-----CCCCCCCHHHHHHHHHHHH T ss_conf 99998298889999999837-----9888745898999999998 No 13 >TIGR00456 argS arginyl-tRNA synthetase; InterPro: IPR001278 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . Arginyl-tRNA synthetase (6.1.1.19 from EC) has been crystallized and preliminary X-ray crystallographic analysis of yeast arginyl-tRNA synthetase-yeast tRNAArg complexes is available . ; GO: 0000166 nucleotide binding, 0004814 arginine-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006420 arginyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=97.92 E-value=0.00047 Score=46.85 Aligned_cols=194 Identities=18% Similarity=0.197 Sum_probs=132.5 Q ss_pred HHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHH--HHHHHHHHHHHHHHHHHCCCCH-HHHHHH---- Q ss_conf 876510567687769688887434468999837997887896303--6499999999999876338887-899885---- Q gi|254781094|r 487 INFWAINEKPSGSKDPYALRRATLGIIRIILENKIDIPLSQFIED--QNLILFFHDRLKLYLHDRDIRH-DLIEAI---- 559 (702) Q Consensus 487 ~g~f~ig~~PTGS~DPfaLRR~a~GiirIl~e~~l~l~l~~li~~--~~l~~Fi~~Rl~~~L~e~G~~~-DiI~AV---- 559 (702) +-++..|..|=+|.-|+-=||- -=|.+++++++ +....++..|...+=. +-.. +++++| T Consensus 383 L~~~~f~~v~~~~GKkmkTR~G------------~~i~ld~ll~e~~~RA~~~~~~~~~~~~~--~e~~e~~a~~vG~gA 448 (600) T TIGR00456 383 LEHLEFGMVPLGSGKKMKTRRG------------NVISLDNLLDEASKRAGNVITKRNTDLEQ--DEAEEKVADAVGIGA 448 (600) T ss_pred EEEEEEEEEECCCCCCCCCCCC------------CCEEHHHHHHHHHHHHHHHHHHHHCCCCC--CCHHHHHHHHHHHHH T ss_conf 6899876432578885434556------------60223446899999999999611013522--203789999973679 Q ss_pred -----HHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHH-HHHHCCCCCCCCCCHHHHCCHHHHHHHHHHHH Q ss_conf -----416899989999999999998719513889999999999855-65312467777738568359899999999999 Q gi|254781094|r 560 -----LRPENDNLLTIVDLIKHLNEFFSSAKGEKFLLSAKRIFQILA-IEEKKNREISLEISPQYLLLEAEKRLYAVISD 633 (702) Q Consensus 560 -----l~~~~~~~~~i~~r~~aL~~f~~~~~~~~l~~a~KRv~NIL~-~~~k~~~~~~~~vd~~Lf~~~~Ek~L~~~~~~ 633 (702) ++...+..+.+.--+.+|+- +.+-+.=+.=||-|+..|+. +.... .......+-.++.++.|+.|...+.. T Consensus 449 vry~~l~~~~~~~~~Fd~d~~~L~f--~~n~~pY~QYAhaR~~Silr~~~~~~-~~~~~~~~~~~~~~~~E~~L~~~L~~ 525 (600) T TIGR00456 449 VRYADLSKNRETHYVFDWDTAMLSF--EGNTAPYIQYAHARICSILRKKAEID-GEKLIADDFLLLKEEKEKELLKLLLQ 525 (600) T ss_pred HHHHHHHHCCCCCCEEECCCHHHCC--CCCCCCHHHHHHHHHHHHHHHHCCCC-HHHHHHHHCCCCCCHHHHHHHHHHHH T ss_conf 9999997278988155012011115--78874325888999999876310233-13455320003688889999999856 Q ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCCCC-CHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 9999999850253899999998532788875102703688-689999999999999999986068 Q gi|254781094|r 634 FGTHIQESMDHKRYHQIGDLLHSICEPIEIFFDQVLVNVD-DREVRDNRLSLLQYIKNIILIVIN 697 (702) Q Consensus 634 ~~~~i~~~l~~~dy~~~L~~L~~L~~~Id~FFDnVmVm~e-D~~IR~NRL~LL~~i~~~f~~iaD 697 (702) +..-+..+.....-....+.|.+|....+.|.+++.|.+. ++.++..||+|+..++..+....+ T Consensus 526 fp~~~~~~~~~~~p~~l~~Yly~LA~~F~sfY~~~~v~~~~~~~~~~~RL~L~~~~~~~L~n~L~ 590 (600) T TIGR00456 526 FPEVIEEAAEEREPHVLTNYLYELAKLFSSFYKACPVLDAENEELAKARLALLKATRQTLKNGLE 590 (600) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 48999999985471888999999977642366417788986266899999999999999999887 No 14 >TIGR03319 YmdA_YtgF conserved hypothetical protein YmdA/YtgF. Probab=97.90 E-value=0.00015 Score=50.27 Aligned_cols=193 Identities=22% Similarity=0.260 Sum_probs=123.4 Q ss_pred HHHHHHHCCCEECCCHHHHHHHHHHCCCCEEEEEEECHHHHCCCHHHHHHHHHHHCCCEEEECCCCCCCCEEEEEECCCC Q ss_conf 99988853982146667887887440898468883077771289899999998400113655368861324898505776 Q gi|254781094|r 232 AHRLASAVGLELVEDKDLLEEIIGLVEWVQVFMGSFDKKYLCLPEELIRLTIKTNQKCFVTRTREGVLANCFILVSNIQA 311 (702) Q Consensus 232 i~~~a~~~~~~~~~d~~Ll~Ev~~LvE~P~~~~G~F~~~fL~lP~evl~~~m~~hQkyf~~~~~~g~l~~~Fi~v~N~~~ 311 (702) |..++.-.|..+++|+. -..+++-+||+= -.||=..+|.. +. .+|++.|-.| T Consensus 227 IRa~E~~TGVDlIIDDt----------Pe~V~lS~Fdpv----RREiAr~aLe~------Li-~DGRIhP~rI------- 278 (514) T TIGR03319 227 IRALETLTGVDLIIDDT----------PEAVILSGFDPV----RREIARMALEK------LI-QDGRIHPARI------- 278 (514) T ss_pred HHHHHHHHCCEEEECCC----------CCEEEEECCCHH----HHHHHHHHHHH------HH-HCCCCCHHHH------- T ss_conf 99999986974898599----------985888678868----89999999999------98-7499488999------- Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCEEHHHHHH-HHHHHH Q ss_conf 5543024567889978764225999850300011111111110011220137889988421002303134551-389989 Q gi|254781094|r 312 SDGGAAIVQGNSRVVAARLEDALHFWKRDQNNLPNLSSLKESALKFNLDLSKPLDQRMARLDMLDVVFHAKIG-TQGERV 390 (702) Q Consensus 312 ~~~~~~v~~GnerVl~ARL~DA~FF~~~D~~~~~~~~~~~~~~~~~~~dlk~~L~~~~~~L~~~~v~F~~~LG-s~~dK~ 390 (702) .+.-+.+.+.-++.+.--=++|.|=-.-. ++-.-|-..+-+|+ |--..| ++++-+ T Consensus 279 Ee~v~k~~~e~~~~i~~~Ge~a~~elgi~-------------------~~~peli~~lGrlk-----yRtSygQNvL~Hs 334 (514) T TIGR03319 279 EEMVEKATKEVDNAIREEGEQAAFDLGVH-------------------GLHPELIKLLGRLK-----FRTSYGQNVLQHS 334 (514) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCC-------------------CCCHHHHHHHHHEE-----HHHHCCHHHHHHH T ss_conf 99999999999999999999999981999-------------------99978999987500-----0102405899999 Q ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 89999999988652999889999999742367888984034578999999986236998899999997344776777886 Q gi|254781094|r 391 SRIRVLGKKIAQLIDADVALVDRAIVLSKADLCTEIVREFPELQGKIGKEYAVLQNENISCCDAIEEHLKPRGPLENVPT 470 (702) Q Consensus 391 ~Ri~~la~~la~~l~~~~~~~~rAa~L~KaDL~t~mV~EFpeLQG~mG~~YA~~~ge~~~Va~aI~ehY~P~~~~d~lP~ 470 (702) .-+..|+..+|..+|.|...+.||+.|- |+--.+=.|..--+-.+|...|++.||+++|+.||..|- +|.-|+ T Consensus 335 ~Eva~lag~~A~Elg~d~~~AkRaglLH--DIGKa~d~e~egsH~~iG~~~a~k~~e~~~vinai~~HH-----~~~~~~ 407 (514) T TIGR03319 335 IEVAHLAGIMAAELGEDVKLAKRAGLLH--DIGKAVDHEVEGSHVEIGAELAKKYKESPEVVNAIAAHH-----GDVEPT 407 (514) T ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHH--HCCCCCCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHC-----CCCCCC T ss_conf 9999999999998399999999998765--201456765799779999999998498889999999817-----877866 Q ss_pred CHHHHHHHHHHHH Q ss_conf 3588999999749 Q gi|254781094|r 471 NKISITVSLADKL 483 (702) Q Consensus 471 ~~~g~ilsiaDKl 483 (702) |+.+.+|+.||-+ T Consensus 408 ~~~a~lv~aaDai 420 (514) T TIGR03319 408 SIEAVLVAAADAL 420 (514) T ss_pred CHHHHHHHHHHHH T ss_conf 6088999999998 No 15 >PRK12451 arginyl-tRNA synthetase; Reviewed Probab=97.67 E-value=0.0055 Score=39.55 Aligned_cols=110 Identities=15% Similarity=0.240 Sum_probs=85.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHH Q ss_conf 19513889999999999855653124677777385683598999999999999999999850253899999998532788 Q gi|254781094|r 582 SSAKGEKFLLSAKRIFQILAIEEKKNREISLEISPQYLLLEAEKRLYAVISDFGTHIQESMDHKRYHQIGDLLHSICEPI 661 (702) Q Consensus 582 ~~~~~~~l~~a~KRv~NIL~~~~k~~~~~~~~vd~~Lf~~~~Ek~L~~~~~~~~~~i~~~l~~~dy~~~L~~L~~L~~~I 661 (702) +.+.+.=+.=++-|+.+|+.++... .+.....+.++.|.+|...+......+..+.....=......|.+|...- T Consensus 443 ~~~~~PYiQYAhARi~SIlrk~~~~-----~~~~~~~l~~~~e~~Li~~L~~fp~vv~~aa~~~~Ph~l~~Yl~~LA~~F 517 (562) T PRK12451 443 EGETGPYVQYTHARACSILRKESVE-----FETCTFALKDDHSWSVVKLLNKFPQVIEAAFNKNEPSVISKYVLDVAQSF 517 (562) T ss_pred CCCCCHHHHHHHHHHHHHHHHCCCC-----CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH T ss_conf 5788736999999999999744577-----55433446997999999999988999999998499379999999999999 Q ss_pred HHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 875102703688689999999999999999986068 Q gi|254781094|r 662 EIFFDQVLVNVDDREVRDNRLSLLQYIKNIILIVIN 697 (702) Q Consensus 662 d~FFDnVmVm~eD~~IR~NRL~LL~~i~~~f~~iaD 697 (702) +.|.++..|..++.. |+.||.|+..++..+..-.+ T Consensus 518 h~fY~~~~Vl~~~~~-~~aRL~L~~a~~~vl~ngL~ 552 (562) T PRK12451 518 NKYYGNVRILEESAE-KDSRLALVYAVTVVLKEGLR 552 (562) T ss_pred HHHHCCCCCCCCCHH-HHHHHHHHHHHHHHHHHHHH T ss_conf 999806877899565-89999999999999999743 No 16 >PRK10119 hypothetical protein; Provisional Probab=96.94 E-value=0.037 Score=33.90 Aligned_cols=125 Identities=20% Similarity=0.205 Sum_probs=79.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH-HHHHHHHH--CCHHHHHHHHHHHHHHC--CCC----HHH Q ss_conf 4551389989899999999886529998899999997423-67888984--03457899999998623--699----889 Q gi|254781094|r 381 AKIGTQGERVSRIRVLGKKIAQLIDADVALVDRAIVLSKA-DLCTEIVR--EFPELQGKIGKEYAVLQ--NEN----ISC 451 (702) Q Consensus 381 ~~LGs~~dK~~Ri~~la~~la~~l~~~~~~~~rAa~L~Ka-DL~t~mV~--EFpeLQG~mG~~YA~~~--ge~----~~V 451 (702) ..=|-=++-+.|+.++|..|+..-+.|...+.-||+|--. |+..+.-. .-+.+..-+.+...... +.+ +.| T Consensus 21 ~d~sHD~~Hi~RV~~lA~~Ia~~E~~D~~vv~lAAlLHDi~d~~~~~~~~~~~~~~~a~~~~~~l~~~~~~~~~~~I~~I 100 (231) T PRK10119 21 QDAAHDICHFRRVWATAQKLAADDDVDMLVVLTACYFHDIVSLAKNHPQRHRSSILAAEETRRILREEFPDFPAEKIEAV 100 (231) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHH T ss_conf 98863779999999999999987798899999999988503333578101033677899999999984478738999999 Q ss_pred HHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHH-----HHHCC----CCCCCCCCHHHHHHH Q ss_conf 9999997344776777886358899999974998987-----65105----676877696888874 Q gi|254781094|r 452 CDAIEEHLKPRGPLENVPTNKISITVSLADKLDTLIN-----FWAIN----EKPSGSKDPYALRRA 508 (702) Q Consensus 452 a~aI~ehY~P~~~~d~lP~~~~g~ilsiaDKlDtl~g-----~f~ig----~~PTGS~DPfaLRR~ 508 (702) +.+|..| +|++...|.|+.|.||-=|||||.|=. +|..| ..--...||++.+|. T Consensus 101 ~~~I~~~---sfs~~~~p~slE~kIVQDADRLDAiGAIGIARtF~ygG~~gr~lydp~~p~~~~~~ 163 (231) T PRK10119 101 CHAIEAH---SFSAQIAPLTTEAKIVQDADRLEALGAIGLARVFAVSGALGVALFDAEDPFAQHRP 163 (231) T ss_pred HHHHHHC---CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCC T ss_conf 9999962---66579998786677752237787500799999999963048856698875200157 No 17 >COG1418 Predicted HD superfamily hydrolase [General function prediction only] Probab=96.85 E-value=0.015 Score=36.64 Aligned_cols=130 Identities=23% Similarity=0.165 Sum_probs=98.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCC---------- Q ss_conf 1345513899898999999998865299988999999974236788898403457899999998623699---------- Q gi|254781094|r 379 FHAKIGTQGERVSRIRVLGKKIAQLIDADVALVDRAIVLSKADLCTEIVREFPELQGKIGKEYAVLQNEN---------- 448 (702) Q Consensus 379 F~~~LGs~~dK~~Ri~~la~~la~~l~~~~~~~~rAa~L~KaDL~t~mV~EFpeLQG~mG~~YA~~~ge~---------- 448 (702) |..-=++..+-+.|+..||..||...+.|...+.+||+|= |..-....+--.=+-+.|..-|...+++ T Consensus 30 ~~~~~~~~l~H~~~Va~lA~~Ia~~~g~D~~l~~~aaLLH--DIg~~~~~~~~~~h~~~gae~a~~~~~~~~~~~~~~~~ 107 (222) T COG1418 30 FRTYGQHVLEHSLRVAYLAYRIAEEEGVDPDLALRAALLH--DIGKAIDHEPGGSHAEIGAEIARKFLEDPVVINAIEAH 107 (222) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHH--HHCCCCCCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHC T ss_conf 2766505999999999999999988299999999999777--22011126775212899999999857663100167652 Q ss_pred HHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHH-------HHHCCCCCCCCCCHHHHHHHHH Q ss_conf 8899999997344776777886358899999974998987-------6510567687769688887434 Q gi|254781094|r 449 ISCCDAIEEHLKPRGPLENVPTNKISITVSLADKLDTLIN-------FWAINEKPSGSKDPYALRRATL 510 (702) Q Consensus 449 ~~Va~aI~ehY~P~~~~d~lP~~~~g~ilsiaDKlDtl~g-------~f~ig~~PTGS~DPfaLRR~a~ 510 (702) +.|+.+|..|=.-....|.+|.+..|.++.-+|++|-+.. .|+.+...=--+||...||.-- T Consensus 108 ~~~~~~i~~~s~~~~~~d~~~a~~~g~~~~dad~l~~~~~~~~~~~~~~~G~~~~y~~~~~~~~~~~v~ 176 (222) T COG1418 108 HGVEEIISRHSFLVAAADALSAARPGARLQDADRLDARGAIGEARVFLFEGVEKAYAIQAGRELRRIVE 176 (222) T ss_pred CCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCHHHHCCCCCHHHHHCCC T ss_conf 020100467998841023365467777600248999999999999998715401201235176521046 No 18 >LOAD_HD consensus Probab=96.85 E-value=0.005 Score=39.82 Aligned_cols=98 Identities=30% Similarity=0.262 Sum_probs=70.6 Q ss_pred HHHHHHHHHHHHHHHHHCCCCHH----HHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCH-------HHHHHH Q ss_conf 99898999999998865299988----9999999742367888984034578999999986236998-------899999 Q gi|254781094|r 387 GERVSRIRVLGKKIAQLIDADVA----LVDRAIVLSKADLCTEIVREFPELQGKIGKEYAVLQNENI-------SCCDAI 455 (702) Q Consensus 387 ~dK~~Ri~~la~~la~~l~~~~~----~~~rAa~L~KaDL~t~mV~EFpeLQG~mG~~YA~~~ge~~-------~Va~aI 455 (702) ++-+.|+..++..++..++.+.. .+..||.|- |+....+.+.|..+...|.+++...++.. ++...+ T Consensus 2 ~~Hs~~v~~~a~~l~~~~~~~~~~~~~~~~~aallH--DiGk~~~~~~~~~h~~~g~~~~~~~~~~~~~~~~~~~~~~~i 79 (114) T LOAD_HD 2 YEHSLRVAKLARLLAEELGLDEEEILELLLLAALLH--DIGKATVLELPFSHEIVGAEIARELGEREDLEAEKAEEVAAI 79 (114) T ss_pred CCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH--HCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHH T ss_conf 605799999999999992899078999999999997--603544435567628998999998614224444369999999 Q ss_pred HHHHCCCCCCCCCCCCHH----HHHHHHHHHHHHH Q ss_conf 997344776777886358----8999999749989 Q gi|254781094|r 456 EEHLKPRGPLENVPTNKI----SITVSLADKLDTL 486 (702) Q Consensus 456 ~ehY~P~~~~d~lP~~~~----g~ilsiaDKlDtl 486 (702) ..|+.+...+..-|.... +.++++||++|.+ T Consensus 80 ~~~H~~~~~~~~~~~~~~~~~~~~iv~~AD~~d~~ 114 (114) T LOAD_HD 80 LRHHELRSDGEPEPKLLGEETIAKIVKLADRLDAL 114 (114) T ss_pred HHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHCC T ss_conf 99752437885999877873778898874577409 No 19 >pfam01966 HD HD domain. HD domains are metal dependent phosphohydrolases. Probab=95.98 E-value=0.075 Score=31.78 Aligned_cols=99 Identities=22% Similarity=0.197 Sum_probs=71.2 Q ss_pred HHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHH-----CCH--HHHHHHHHHHHHHCCCCHHH-HHHHHHH Q ss_conf 998989999999988652999889999999742367888984-----034--57899999998623699889-9999997 Q gi|254781094|r 387 GERVSRIRVLGKKIAQLIDADVALVDRAIVLSKADLCTEIVR-----EFP--ELQGKIGKEYAVLQNENISC-CDAIEEH 458 (702) Q Consensus 387 ~dK~~Ri~~la~~la~~l~~~~~~~~rAa~L~KaDL~t~mV~-----EFp--eLQG~mG~~YA~~~ge~~~V-a~aI~eh 458 (702) ++-+.|+..++..++...+.+.+.+..||.|-=.--... .. ||. ......|...++........ +.+|.+| T Consensus 2 ~~Hs~~Va~l~~~ia~~~~~~~~~l~~aaLLHDiGk~~~-~~~~~~~~~~~~~~H~~~g~~~l~~~~~~~~~~~~~i~~H 80 (108) T pfam01966 2 LEHSLRVALLARELAEELGLDRELLLLAALLHDIGKPPF-GFLEKEEDFEIFLSHAVVGAEILRELEKPLDVDLKLILEH 80 (108) T ss_pred CHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCC-CCCCCHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHH T ss_conf 679999999999999984969999999999986156788-8768889986553229999999986762429999999981 Q ss_pred HCCCCCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 3447767778863588999999749989 Q gi|254781094|r 459 LKPRGPLENVPTNKISITVSLADKLDTL 486 (702) Q Consensus 459 Y~P~~~~d~lP~~~~g~ilsiaDKlDtl 486 (702) =.+-.....-|.+..+.++++||++|.+ T Consensus 81 h~~~~~~~~~~~~~~~~iv~~aD~~da~ 108 (108) T pfam01966 81 HESWEGAGYEPISLEARIVKLADRLDAL 108 (108) T ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHCC T ss_conf 6677899999999788899986487229 No 20 >PRK07152 nadD putative nicotinate-nucleotide adenylyltransferase; Validated Probab=95.97 E-value=0.11 Score=30.79 Aligned_cols=95 Identities=17% Similarity=0.151 Sum_probs=67.9 Q ss_pred HHHHHHHHHHHHHHHHHCCCCHHHHHHHHHH---HHH----HHHHHHH--------HCCHHHHHHHHHHHHHHC-C-CCH Q ss_conf 9989899999999886529998899999997---423----6788898--------403457899999998623-6-998 Q gi|254781094|r 387 GERVSRIRVLGKKIAQLIDADVALVDRAIVL---SKA----DLCTEIV--------REFPELQGKIGKEYAVLQ-N-ENI 449 (702) Q Consensus 387 ~dK~~Ri~~la~~la~~l~~~~~~~~rAa~L---~Ka----DL~t~mV--------~EFpeLQG~mG~~YA~~~-g-e~~ 449 (702) |.-+.|+.+.|..+|+.-+.|.+.+.-|+.| ||. ++..-+- .+.+-|+|..|.++|+.. | .++ T Consensus 199 y~H~l~Va~~A~~LA~~~~~d~~kA~~AgllHD~aK~~~~~~~~~~~~~~~~~~~~~~~~llH~~~ga~~~k~~~~i~D~ 278 (343) T PRK07152 199 YNHCLRVAQLAVELAKKNNLDPKKAYYAGLYHDITKEWDEEELLKYLKKYKIKLNNESWYVLHSYVGAYWLKHVYGIDDE 278 (343) T ss_pred HHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCH T ss_conf 99999999999999999597999999999999998159999999999981997554562212799999999998099979 Q ss_pred HHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 8999999973447767778863588999999749989 Q gi|254781094|r 450 SCCDAIEEHLKPRGPLENVPTNKISITVSLADKLDTL 486 (702) Q Consensus 450 ~Va~aI~ehY~P~~~~d~lP~~~~g~ilsiaDKlDtl 486 (702) +|-.||+-|-..+- --|....||=+||+++-- T Consensus 279 eIL~AI~~HTtG~~-----~ms~LdkIiy~AD~iEp~ 310 (343) T PRK07152 279 EILNAIRNHTVLYE-----EMSTLDKIVYVADKIEPG 310 (343) T ss_pred HHHHHHHHCCCCCC-----CCCHHHHHHHEECCCCCC T ss_conf 99999996283899-----968556344200024679 No 21 >COG1713 Predicted HD superfamily hydrolase involved in NAD metabolism [Coenzyme metabolism] Probab=95.66 E-value=0.097 Score=31.04 Aligned_cols=108 Identities=23% Similarity=0.217 Sum_probs=80.4 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHH---HHHHHHHHHH-------------HCCHH-HHHHHHHHHHHH-CCC Q ss_conf 89989899999999886529998899999997---4236788898-------------40345-789999999862-369 Q gi|254781094|r 386 QGERVSRIRVLGKKIAQLIDADVALVDRAIVL---SKADLCTEIV-------------REFPE-LQGKIGKEYAVL-QNE 447 (702) Q Consensus 386 ~~dK~~Ri~~la~~la~~l~~~~~~~~rAa~L---~KaDL~t~mV-------------~EFpe-LQG~mG~~YA~~-~ge 447 (702) -++-+.|+.+.|..+|+..+.|.+.+.-|+.| ||..=-..|+ .++|+ |+|..|.|+++. -|- T Consensus 18 R~~H~l~V~~~A~~LA~~y~~d~~kA~~AgilHD~aK~~p~~~~~~~~~~~~~~~~~~~~~~~llH~~vgay~~~~~fGi 97 (187) T COG1713 18 RFEHCLGVAETAIELAEAYGLDPEKAYLAGILHDIAKELPEQKLLKIAKKYGLELDLERESPLLLHGKVGAYLLKEEFGI 97 (187) T ss_pred HHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHCCCCHHHCCCHHHHHHHHHHHHHHHHHCC T ss_conf 99999999999999999958989999999999999961888999999998488831102585999899999999999399 Q ss_pred -CHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCH--HHHHHHHH Q ss_conf -9889999999734477677788635889999997499898765105676877696--88887434 Q gi|254781094|r 448 -NISCCDAIEEHLKPRGPLENVPTNKISITVSLADKLDTLINFWAINEKPSGSKDP--YALRRATL 510 (702) Q Consensus 448 -~~~Va~aI~ehY~P~~~~d~lP~~~~g~ilsiaDKlDtl~g~f~ig~~PTGS~DP--faLRR~a~ 510 (702) +++|-.||+-|--|+. .-|..+.||=+|||+.- |-..| --+|+.|+ T Consensus 98 ~De~VL~AI~~HTtg~~-----~mt~ldkIiyiAD~iEp------------~R~~pgv~~~R~~a~ 146 (187) T COG1713 98 KDEEVLSAIEYHTTGRK-----QMTLLDKILYVADKIEP------------GRGFPGVDKIRQIAL 146 (187) T ss_pred CCHHHHHHHHHHCCCCC-----CCCHHHHEEEEECCCCC------------CCCCCCCHHHHHHHH T ss_conf 94999999998346887-----60313110102013567------------889998345899999 No 22 >PRK13300 tRNA CCA-pyrophosphorylase; Provisional Probab=94.25 E-value=0.44 Score=26.54 Aligned_cols=37 Identities=19% Similarity=0.259 Sum_probs=24.1 Q ss_pred CHHHHHHHHCCCCEEHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 889988421002303134551389989899999999886529 Q gi|254781094|r 364 PLDQRMARLDMLDVVFHAKIGTQGERVSRIRVLGKKIAQLID 405 (702) Q Consensus 364 ~L~~~~~~L~~~~v~F~~~LGs~~dK~~Ri~~la~~la~~l~ 405 (702) .+.+-+.-|| .|.++.|.++-- .+++-.|.|+++.|- T Consensus 153 ~~~deVrLlK----~FlK~igvYGaE-~~~~GFSGYl~ElLi 189 (446) T PRK13300 153 KLRDEVRLLK----QFLKGIGVYGAE-LKTGGFSGYLCELLI 189 (446) T ss_pred CHHHHHHHHH----HHHHHCCCCCHH-HHCCCCHHHHHHHHH T ss_conf 2156899999----998746835456-650773578999999 No 23 >KOG1195 consensus Probab=94.01 E-value=0.49 Score=26.22 Aligned_cols=110 Identities=14% Similarity=0.129 Sum_probs=88.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHH Q ss_conf 13889999999999855653124677777385683598999999999999999999850253899999998532788875 Q gi|254781094|r 585 KGEKFLLSAKRIFQILAIEEKKNREISLEISPQYLLLEAEKRLYAVISDFGTHIQESMDHKRYHQIGDLLHSICEPIEIF 664 (702) Q Consensus 585 ~~~~l~~a~KRv~NIL~~~~k~~~~~~~~vd~~Lf~~~~Ek~L~~~~~~~~~~i~~~l~~~dy~~~L~~L~~L~~~Id~F 664 (702) .|.-+.=+.-|.+.|..+........-..+|-+++++|.=..|+..+..+..-+..+++.-+=......|..|...+..+ T Consensus 445 tg~yLQY~HsRL~Slerk~~~~~~~~l~~~d~~~l~ep~~~~Lv~~L~~fp~vve~s~e~lE~~~lV~YL~~La~a~s~~ 524 (567) T KOG1195 445 TGPYLQYTHSRLRSLERKFSDVTLDDLDEIDFSLLTEPDALLLVRLLLQFPEVVEKSMEQLEPCTLVTYLFKLAHAVSSA 524 (567) T ss_pred CHHHHHHHHHHHHHHHHHCCCCCCHHHHCCCHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHH T ss_conf 44788887878999998625667044410216651773699999998500899999997624343799999999999867 Q ss_pred HCCCCCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 102703688689999999999999999986 Q gi|254781094|r 665 FDQVLVNVDDREVRDNRLSLLQYIKNIILI 694 (702) Q Consensus 665 FDnVmVm~eD~~IR~NRL~LL~~i~~~f~~ 694 (702) .+...|-+...+|--.||.|...+++.... T Consensus 525 yk~l~V~d~~~eia~aRL~Lf~AvRqVL~n 554 (567) T KOG1195 525 YKILWVKDSPMEIALARLLLFSAVRQVLNN 554 (567) T ss_pred HHHEEECCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 761013688577799999999999999996 No 24 >TIGR03671 cca_archaeal CCA-adding enzyme. Probab=93.72 E-value=0.55 Score=25.87 Aligned_cols=35 Identities=23% Similarity=0.365 Sum_probs=20.3 Q ss_pred HHHHHHHHCCCCEEHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 8998842100230313455138998989999999988652 Q gi|254781094|r 365 LDQRMARLDMLDVVFHAKIGTQGERVSRIRVLGKKIAQLI 404 (702) Q Consensus 365 L~~~~~~L~~~~v~F~~~LGs~~dK~~Ri~~la~~la~~l 404 (702) +.+-+.-|| .|.+..|.++.- .+++-.|.|+++.| T Consensus 149 ~~deVrLlK----~Flk~~gvYGae-~~~~GFSGYl~ELL 183 (408) T TIGR03671 149 LRDDVRLLK----QFLKGIGVYGSE-LKTRGFSGYLCELL 183 (408) T ss_pred HHHHHHHHH----HHHHHCCCCCHH-HHCCCCHHHHHHHH T ss_conf 577899999----998747845356-66067347899999 No 25 >KOG4426 consensus Probab=93.27 E-value=0.65 Score=25.38 Aligned_cols=112 Identities=21% Similarity=0.234 Sum_probs=81.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCC---CCCHHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHH Q ss_conf 388999999999985565312467777---73856835989999999999999999998502538999999985327888 Q gi|254781094|r 586 GEKFLLSAKRIFQILAIEEKKNREISL---EISPQYLLLEAEKRLYAVISDFGTHIQESMDHKRYHQIGDLLHSICEPIE 662 (702) Q Consensus 586 ~~~l~~a~KRv~NIL~~~~k~~~~~~~---~vd~~Lf~~~~Ek~L~~~~~~~~~~i~~~l~~~dy~~~L~~L~~L~~~Id 662 (702) .--++-+|.|+..|..+..+.+..... +.-.-.+.++.|.+|-..+..+...+..+..+--.....+.+..|...-. T Consensus 527 AvYLLYa~tRIcSI~rksg~d~~~l~~~lkkT~ki~LDH~~E~kLg~~lLkf~e~v~~~~~~L~lh~lC~y~y~l~t~Ft 606 (656) T KOG4426 527 AVYLLYAYTRICSIARKSGKDNVDLIKELKKTGKIALDHEKEWKLGKHLLKFAEIVEKALDSLFLHVLCDYLYELATLFT 606 (656) T ss_pred EEHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 00358999999999987177425689999856862367777889999998589999999998879999999999999999 Q ss_pred HHHCCCCCCCCCHH-----HHHHHHHHHHHHHHHHHHHCC Q ss_conf 75102703688689-----999999999999999986068 Q gi|254781094|r 663 IFFDQVLVNVDDRE-----VRDNRLSLLQYIKNIILIVIN 697 (702) Q Consensus 663 ~FFDnVmVm~eD~~-----IR~NRL~LL~~i~~~f~~iaD 697 (702) +|.|+-.|+-.|++ +..-||.|.-.....+.+-.+ T Consensus 607 ~FY~~Cyv~e~~~eg~i~~vn~sRllLCeata~VmrkcF~ 646 (656) T KOG4426 607 EFYDSCYVIEKNKEGEIPFVNMSRLLLCEATAAVMRKCFH 646 (656) T ss_pred HHHHCCEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 9851207714654456543106789998999999997567 No 26 >smart00471 HDc Metal dependent phosphohydrolases with conserved 'HD' motif. Includes eukaryotic cyclic nucleotide phosphodiesterases (PDEc). This profile/HMM does not detect HD homologues in bacterial glycine aminoacyl-tRNA synthetases (beta subunit). Probab=93.18 E-value=0.67 Score=25.28 Aligned_cols=106 Identities=24% Similarity=0.165 Sum_probs=68.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCC-CHHHHHHHHHHHHHHHHHHHHH-----CC--HHHHHHHHHHHHHHCCCCHHHHH-- Q ss_conf 13899898999999998865299-9889999999742367888984-----03--45789999999862369988999-- Q gi|254781094|r 384 GTQGERVSRIRVLGKKIAQLIDA-DVALVDRAIVLSKADLCTEIVR-----EF--PELQGKIGKEYAVLQNENISCCD-- 453 (702) Q Consensus 384 Gs~~dK~~Ri~~la~~la~~l~~-~~~~~~rAa~L~KaDL~t~mV~-----EF--peLQG~mG~~YA~~~ge~~~Va~-- 453 (702) .+.++-+.|+..++..++..++. +...+..||.|- |+.--.+. ++ -.-...+|.......+..+.... T Consensus 3 ~~~~~Hs~~Va~la~~la~~~~~~~~~~l~~aaLlH--DiGk~~~~~~~~~~~~~~~~h~~~~~~~~~~~~~~~~~~~~~ 80 (124) T smart00471 3 YHVFEHSLRVAQLAAALAEELGLLDIELLLLAALLH--DIGKPGTPDSFLVKTSVLEDHHFIGAEILLEEEEPRILEEIL 80 (124) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHH--HHHCCCCCHHHHHCHHHHHHHHHHHHHHHHCCCCCHHHHHHH T ss_conf 049999999999999999992988499999999998--752446999998360988786999999998187547899999 Q ss_pred --HHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf --99997344776777886358899999974998987651 Q gi|254781094|r 454 --AIEEHLKPRGPLENVPTNKISITVSLADKLDTLINFWA 491 (702) Q Consensus 454 --aI~ehY~P~~~~d~lP~~~~g~ilsiaDKlDtl~g~f~ 491 (702) +|..|-....-...-+.+..+.++++||++|.+++... T Consensus 81 ~~~i~~Hh~~~~g~~~~~i~~~~~i~~~aD~~da~~~~~~ 120 (124) T smart00471 81 ATAILSHHERPDGLRGEPITLEARIVKVADRLDALRRDRR 120 (124) T ss_pred HHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCC T ss_conf 9999963798899987875878788764039777607999 No 27 >COG1746 CCA1 tRNA nucleotidyltransferase (CCA-adding enzyme) [Translation, ribosomal structure and biogenesis] Probab=89.13 E-value=1.7 Score=22.58 Aligned_cols=54 Identities=20% Similarity=0.192 Sum_probs=34.4 Q ss_pred HHHHHHCCCCEEHHHHHHHHHHHHHHHHHHHHHHHHHCC---CCHHHHHHHHHH----HHHHHHHH Q ss_conf 988421002303134551389989899999999886529---998899999997----42367888 Q gi|254781094|r 367 QRMARLDMLDVVFHAKIGTQGERVSRIRVLGKKIAQLID---ADVALVDRAIVL----SKADLCTE 425 (702) Q Consensus 367 ~~~~~L~~~~v~F~~~LGs~~dK~~Ri~~la~~la~~l~---~~~~~~~rAa~L----~KaDL~t~ 425 (702) +-+.-|| .|.++.|.++.- -|++-.|.|+|+.|- .+=..+-++|.. -+-|+... T Consensus 155 deVrLLK----~FlK~iGvYGaE-~rt~GFSGYL~ELLII~yGsFe~vl~~a~~wrp~~~ID~~~~ 215 (443) T COG1746 155 DEVRLLK----QFLKGIGVYGAE-LRTQGFSGYLCELLIIHYGSFENVLKAASRWRPGKIIDLEGH 215 (443) T ss_pred HHHHHHH----HHHHCCCCCCEE-EEECCCHHHHHHHHHHHHCCHHHHHHHHHCCCCCEEEECCCH T ss_conf 4889999----997505755502-451553588888899641659999999864489769704512 No 28 >PRK12703 tRNA 2'-O-methylase; Reviewed Probab=89.04 E-value=1.7 Score=22.54 Aligned_cols=95 Identities=23% Similarity=0.210 Sum_probs=73.3 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHC---- Q ss_conf 3899898999999998865299988999999974236788898403457899999998623699889999999734---- Q gi|254781094|r 385 TQGERVSRIRVLGKKIAQLIDADVALVDRAIVLSKADLCTEIVREFPELQGKIGKEYAVLQNENISCCDAIEEHLK---- 460 (702) Q Consensus 385 s~~dK~~Ri~~la~~la~~l~~~~~~~~rAa~L~KaDL~t~mV~EFpeLQG~mG~~YA~~~ge~~~Va~aI~ehY~---- 460 (702) .+-+-|.-+..+|-.||+..++|.+++.--|.|- |+--.-.. -=.+|+.|..-++..|.+++|+..++-|.- T Consensus 187 ~~i~hv~~v~~~a~~~a~~~~~d~~lv~aGaLLH--DiGRs~Th--~i~H~v~Ga~i~r~~g~~e~v~~I~erHiGAGit 262 (339) T PRK12703 187 LLIRHVKTVYKLAMRIADCINADRRLVAAGALLH--DIGRTKTN--GIDHAVAGAEILRKENIDDRVVSIVERHIGAGIT 262 (339) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHH--HHCCCCCC--CCHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCC T ss_conf 9999999999999999864286589998415775--30334326--8407776899999759988999999987447989 Q ss_pred -------CCCCCCCCCCCHHHHHHHHHHHH Q ss_conf -------47767778863588999999749 Q gi|254781094|r 461 -------PRGPLENVPTNKISITVSLADKL 483 (702) Q Consensus 461 -------P~~~~d~lP~~~~g~ilsiaDKl 483 (702) -.-+.|-+|+|..-.||+-||.| T Consensus 263 ~eEA~~lgLp~~dy~P~TlEEKIVa~ADNL 292 (339) T PRK12703 263 SEEAQKLGLPVKDYVPETIEEMIVAHADNL 292 (339) T ss_pred HHHHHHCCCCCCCCCCHHHHHHHHHHHHHH T ss_conf 899998499941148045899999814542 No 29 >KOG2796 consensus Probab=87.13 E-value=2.2 Score=21.75 Aligned_cols=26 Identities=23% Similarity=0.284 Sum_probs=20.3 Q ss_pred HHHHCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 87510270368868999999999999 Q gi|254781094|r 662 EIFFDQVLVNVDDREVRDNRLSLLQY 687 (702) Q Consensus 662 d~FFDnVmVm~eD~~IR~NRL~LL~~ 687 (702) ..|||++..||+-..+-.|--+|+.. T Consensus 272 ~r~~~~i~~~D~~~~~a~NnKALcll 297 (366) T KOG2796 272 HRFFTEILRMDPRNAVANNNKALCLL 297 (366) T ss_pred HHHHHHCCCCCCCCHHHHCHHHHHHH T ss_conf 99885042248987022051999999 No 30 >cd00077 HDc Metal dependent phosphohydrolases with conserved 'HD' motif Probab=86.76 E-value=2.3 Score=21.62 Aligned_cols=104 Identities=20% Similarity=0.114 Sum_probs=64.6 Q ss_pred HHHHHHHHHHHHHHHHHHHCCC---CHHHHHHHHHHHHHHHHH------HHHHCCHHHHHHHHHHHHHH------CCCCH Q ss_conf 3899898999999998865299---988999999974236788------89840345789999999862------36998 Q gi|254781094|r 385 TQGERVSRIRVLGKKIAQLIDA---DVALVDRAIVLSKADLCT------EIVREFPELQGKIGKEYAVL------QNENI 449 (702) Q Consensus 385 s~~dK~~Ri~~la~~la~~l~~---~~~~~~rAa~L~KaDL~t------~mV~EFpeLQG~mG~~YA~~------~ge~~ 449 (702) +.++-+.|+..++..++..++. +...+.-||.|--..-+. ..-.....-....|..++.. .+..+ T Consensus 2 ~~~~Hs~~V~~~a~~i~~~~~~~~~~~~~l~~aaLlHDiGk~~~~~~~~~~~~~~~~~H~~~g~~~~~~~~~~~~~~~~~ 81 (145) T cd00077 2 HRFEHSLRVAQLARRLAEELGLSEEDIELLRLAALLHDIGKPGTPDAITEEESELEKDHAIVGAEILRELLLEEVIKLID 81 (145) T ss_pred CHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHH T ss_conf 35189999999999999992949999999999999986032459852468999997679999999998660000000168 Q ss_pred H-HHHHHHHHHCCCCCCCCCC-------CCHHHHHHHHHHHHHHHHHH Q ss_conf 8-9999999734477677788-------63588999999749989876 Q gi|254781094|r 450 S-CCDAIEEHLKPRGPLENVP-------TNKISITVSLADKLDTLINF 489 (702) Q Consensus 450 ~-Va~aI~ehY~P~~~~d~lP-------~~~~g~ilsiaDKlDtl~g~ 489 (702) + |..++ .++.-+..+..-| .+..+.++++||++|.+... T Consensus 82 ~~v~~~~-~~h~e~~~~~~~~~~~~~~~~~~~~~i~~~aD~~d~~~~~ 128 (145) T cd00077 82 ELILAVD-ASHHERLDGLGYPDGLKGEEITLEARIVKLADRLDALRRD 128 (145) T ss_pred HHHHHHH-HHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCC T ss_conf 9999999-9845387899999999800199998999997999987269 No 31 >PRK11893 methionyl-tRNA synthetase; Reviewed Probab=86.68 E-value=2.3 Score=21.59 Aligned_cols=95 Identities=15% Similarity=0.126 Sum_probs=62.5 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCC Q ss_conf 89999999999855653124677777385683598999999999999999999850253899999998532788875102 Q gi|254781094|r 588 KFLLSAKRIFQILAIEEKKNREISLEISPQYLLLEAEKRLYAVISDFGTHIQESMDHKRYHQIGDLLHSICEPIEIFFDQ 667 (702) Q Consensus 588 ~l~~a~KRv~NIL~~~~k~~~~~~~~vd~~Lf~~~~Ek~L~~~~~~~~~~i~~~l~~~dy~~~L~~L~~L~~~Id~FFDn 667 (702) .+-....|+...+.+.. +.. +...--.++.++.+.+.+.++..++..++++-+|.+++..+.++....|.||+. T Consensus 355 ~~gNl~~R~~~~~~~~~--~~~----~p~~~~~~~~d~~~~~~~~~~~~~v~~~~e~~~~~~a~~~i~~~~~~~N~y~~~ 428 (512) T PRK11893 355 DLGNLLSRTLSMIAKYF--DGK----VPEPGEFSEADAALLEKADAALEQVREAMDNFAFHKALEAILELAAEANKYIDE 428 (512) T ss_pred HHHHHHHHHHHHHHHHC--CCC----CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999998751--787----888665887689999999999999999997125999999999999999999884 Q ss_pred CC---CCCCCHHHHHHHHHHHHHH Q ss_conf 70---3688689999999999999 Q gi|254781094|r 668 VL---VNVDDREVRDNRLSLLQYI 688 (702) Q Consensus 668 Vm---Vm~eD~~IR~NRL~LL~~i 688 (702) .. ...+|..-.++=+.++..+ T Consensus 429 ~~PW~l~k~d~~~~~~il~~~~~~ 452 (512) T PRK11893 429 QAPWSLAKTDPERLATVLYTLLES 452 (512) T ss_pred CCCHHHHCCCHHHHHHHHHHHHHH T ss_conf 787533158999999999999999 No 32 >pfam11348 DUF3150 Protein of unknown function (DUF3150). This bacterial family of proteins with unknown function appears to be restricted to Proteobacteria. Probab=83.34 E-value=3.3 Score=20.58 Aligned_cols=73 Identities=18% Similarity=0.171 Sum_probs=46.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCC------------------HHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCC-CCC Q ss_conf 88999999974236788898403------------------45789999999862369988999999973447767-778 Q gi|254781094|r 408 VALVDRAIVLSKADLCTEIVREF------------------PELQGKIGKEYAVLQNENISCCDAIEEHLKPRGPL-ENV 468 (702) Q Consensus 408 ~~~~~rAa~L~KaDL~t~mV~EF------------------peLQG~mG~~YA~~~ge~~~Va~aI~ehY~P~~~~-d~l 468 (702) -..+.|.+..++.+....|=..| .+..|.+|..| .|||...++-|.-...| +.+ T Consensus 122 wa~iIr~~a~~~e~v~~~l~F~~~~~~v~~~~~~~~~~gl~~~v~gL~~~L~-------~Eva~~a~~~~~~~~~gr~~v 194 (257) T pfam11348 122 WADIIRDAAPSKEDVDKRLSFDYQAIRVQPPEFSGADDGLDEEVNGLGGQLF-------QEIAKEARKSWEKSLAGRTEV 194 (257) T ss_pred HHHHHHHCCCCHHHHHHHCCCEEEEEEEECCCCCCCCCCHHHHHHHHHHHHH-------HHHHHHHHHHHHHHCCCCCCC T ss_conf 8999984499989999741761489862076533200017999987889999-------999999999998741687631 Q ss_pred CCCHHHHHHHHHHHHHHHH Q ss_conf 8635889999997499898 Q gi|254781094|r 469 PTNKISITVSLADKLDTLI 487 (702) Q Consensus 469 P~~~~g~ilsiaDKlDtl~ 487 (702) .+...+.+-.|.+|++.+. T Consensus 195 t~k~L~~lr~i~~Kl~~l~ 213 (257) T pfam11348 195 TRKTLRPLRRIRNKLDGLS 213 (257) T ss_pred CHHHHHHHHHHHHHHHHCC T ss_conf 5877788999999976201 No 33 >cd07957 Anticodon_Ia_Met Anticodon-binding domain of methionyl tRNA synthetases. This domain is found in methionyl tRNA synthetases (MetRS), which belong to the class Ia aminoacyl tRNA synthetases. It lies C-terminal to the catalytic core domain, and recognizes and specifically binds to the tRNA anticodon (CAU). MetRS catalyzes the transfer of methionine to the 3'-end of its tRNA. Probab=83.01 E-value=3.4 Score=20.49 Aligned_cols=83 Identities=12% Similarity=0.207 Sum_probs=58.9 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCC Q ss_conf 89999999999855653124677777385683598999999999999999999850253899999998532788875102 Q gi|254781094|r 588 KFLLSAKRIFQILAIEEKKNREISLEISPQYLLLEAEKRLYAVISDFGTHIQESMDHKRYHQIGDLLHSICEPIEIFFDQ 667 (702) Q Consensus 588 ~l~~a~KRv~NIL~~~~k~~~~~~~~vd~~Lf~~~~Ek~L~~~~~~~~~~i~~~l~~~dy~~~L~~L~~L~~~Id~FFDn 667 (702) .+-....|+.+.+.+.. +..+. ..+. ..+.++.+.+.+.+...++...+++.+|..+++.+-.+....|.||+. T Consensus 8 ~~GNLv~Rv~~~~~k~~--~~~i~-~~~~---~~~~d~~l~~~~~~~~~~~~~~~e~~~~~~al~~i~~l~~~~N~y~~~ 81 (129) T cd07957 8 NLGNLVNRTLNMASKYF--GGVVP-EFGG---LTEEDEELLEEAEELLEEVAEAMEELEFRKALEEIMELARAANKYIDE 81 (129) T ss_pred HHHHHHHHHHHHHHHHC--CCCCC-CCCC---CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99779999999999827--99799-9877---764139999999999999999998524999999999999999999996 Q ss_pred CC----CCCCCHH Q ss_conf 70----3688689 Q gi|254781094|r 668 VL----VNVDDRE 676 (702) Q Consensus 668 Vm----Vm~eD~~ 676 (702) .. +-++|++ T Consensus 82 ~~PW~l~K~~~~~ 94 (129) T cd07957 82 TAPWKLAKEEDPE 94 (129) T ss_pred CCCHHHHHCCHHH T ss_conf 5879888230299 No 34 >PRK12268 methionyl-tRNA synthetase; Reviewed Probab=82.36 E-value=3.6 Score=20.33 Aligned_cols=75 Identities=9% Similarity=0.197 Sum_probs=54.5 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 99999999998556531246777773856835989999999999999999998502538999999985327888751027 Q gi|254781094|r 589 FLLSAKRIFQILAIEEKKNREISLEISPQYLLLEAEKRLYAVISDFGTHIQESMDHKRYHQIGDLLHSICEPIEIFFDQV 668 (702) Q Consensus 589 l~~a~KRv~NIL~~~~k~~~~~~~~vd~~Lf~~~~Ek~L~~~~~~~~~~i~~~l~~~dy~~~L~~L~~L~~~Id~FFDnV 668 (702) +-....|+...+.+. .+..++ .+. -..+.++.+.+.+.++..++..++++-+|..+++.+.++....|.||+.. T Consensus 390 ~gNl~~R~~~~~~k~--~~g~~p---~~~-~~~~~d~~l~~~~~~~~~~v~~~~e~~~~~~Al~~i~~~~~~~N~y~~~~ 463 (558) T PRK12268 390 YGNLVNRVLSFIEKN--FGGEVP---EPG-ELGEEDEELLEKFQELFKEVGEHYEAIEFKKALEEIMELARLANKYLNEK 463 (558) T ss_pred HHHHHHHHHHHHHHH--CCCCCC---CCC-CCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHC T ss_conf 999999999999986--589789---876-79856599999999999999999975179999999999999999999837 Q ss_pred C Q ss_conf 0 Q gi|254781094|r 669 L 669 (702) Q Consensus 669 m 669 (702) . T Consensus 464 ~ 464 (558) T PRK12268 464 A 464 (558) T ss_pred C T ss_conf 9 No 35 >pfam03029 ATP_bind_1 Conserved hypothetical ATP binding protein. Members of this family are found in a range of archaea and eukaryotes and have hypothesized ATP binding activity. Probab=75.60 E-value=5.3 Score=19.16 Aligned_cols=66 Identities=18% Similarity=0.135 Sum_probs=43.0 Q ss_pred HHHHHHHHHHH-----HHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCC-CCCCCC-CCCCHHHHHHHHHHH Q ss_conf 99974236788-----8984034578999999986236998899999997344-776777-886358899999974 Q gi|254781094|r 414 AIVLSKADLCT-----EIVREFPELQGKIGKEYAVLQNENISCCDAIEEHLKP-RGPLEN-VPTNKISITVSLADK 482 (702) Q Consensus 414 Aa~L~KaDL~t-----~mV~EFpeLQG~mG~~YA~~~ge~~~Va~aI~ehY~P-~~~~d~-lP~~~~g~ilsiaDK 482 (702) ..-|+|+||++ +.-.++..||-.+...|.+ .+.++|.+|.|.|+. ++.... -...-...+++..|+ T Consensus 158 vnvlsK~Dl~~~~~~l~~~~d~~~l~~~l~~~~~~---L~~~l~~lledf~lv~~f~~ls~~~~~s~~~ll~~ID~ 230 (234) T pfam03029 158 VVALNKFDLLSLEFALKWFTDPDDLQLLLELDYRY---LNEAIRLALDLFYLVPRFLCDARETGESMEDLLTLIDE 230 (234) T ss_pred EEEECCHHCCCCHHHHHHHCCHHHHHHHHHHHHHH---HHHHHHHHHHHHCCCCCCEEEECCCHHHHHHHHHHHHH T ss_conf 43100041354244577752878898887887999---99999999987257366146568860279999999887 No 36 >PRK00133 metG methionyl-tRNA synthetase; Reviewed Probab=73.72 E-value=6.3 Score=18.64 Aligned_cols=27 Identities=15% Similarity=0.286 Sum_probs=11.0 Q ss_pred CCCEEEEEECCCCCCCCHHHHHHHHHH Q ss_conf 132489850577655430245678899 Q gi|254781094|r 299 LANCFILVSNIQASDGGAAIVQGNSRV 325 (702) Q Consensus 299 l~~~Fi~v~N~~~~~~~~~v~~GnerV 325 (702) +.+.|+.+-+...+-...+++.+.+-+ T Consensus 316 ~~hg~l~~~G~KmSKS~Gnvv~p~~~l 342 (666) T PRK00133 316 FAHGFLTVEGAKMSKSRGTFIWARTYL 342 (666) T ss_pred CCCCCEEECCEEEECCCCEEEEHHHHH T ss_conf 114648778806001267176179999 No 37 >pfam08668 HDOD HDOD domain. Probab=70.53 E-value=7.4 Score=18.16 Aligned_cols=81 Identities=16% Similarity=0.056 Sum_probs=53.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCC-CHHHHHHHHHHHHHH---HH----------HHHH------------HCCHHHH Q ss_conf 45513899898999999998865299-988999999974236---78----------8898------------4034578 Q gi|254781094|r 381 AKIGTQGERVSRIRVLGKKIAQLIDA-DVALVDRAIVLSKAD---LC----------TEIV------------REFPELQ 434 (702) Q Consensus 381 ~~LGs~~dK~~Ri~~la~~la~~l~~-~~~~~~rAa~L~KaD---L~----------t~mV------------~EFpeLQ 434 (702) ..+..+.+.......++..++..++. +.+.+.-|+.|.-.- |. ...+ .-|-.-. T Consensus 90 ~~~~~~w~~s~~~A~~a~~la~~~~~~~~e~a~~aGLLhdiG~l~l~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~g~~h 169 (196) T pfam08668 90 FDLKGFWRHSLACALAARLLARRLGLDDPEEAFTAGLLHDIGKLILLSLLPDEYEEILERVAAEGISLLEAERELLGTDH 169 (196) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHCCCCCHHHHHHHHHCCCH T ss_conf 55999999999999999999987699999999998899988899999887788999999987279999999999888799 Q ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHCC Q ss_conf 999999986236998899999997344 Q gi|254781094|r 435 GKIGKEYAVLQNENISCCDAIEEHLKP 461 (702) Q Consensus 435 G~mG~~YA~~~ge~~~Va~aI~ehY~P 461 (702) +.+|.+-+++=|.++++++||+.|+.| T Consensus 170 ~~ig~~l~~~W~lP~~i~~ai~~Hh~P 196 (196) T pfam08668 170 AEVGAALLERWNLPEELVEAIAYHHDP 196 (196) T ss_pred HHHHHHHHHHCCCCHHHHHHHHHHCCC T ss_conf 999999999869899999999972798 No 38 >PRK13902 alaS lanyl-tRNA synthetase; Provisional Probab=65.44 E-value=9.3 Score=17.47 Aligned_cols=50 Identities=10% Similarity=0.345 Sum_probs=29.6 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEE-CCC---EEEEEECCCCCCCCC Q ss_conf 9899999999999999999997699803358863-074---589998495420751 Q gi|254781094|r 14 PARMQHKAAEDLSIILIGLLKEEGVIYQNMRQYW-TPH---RLFLYLKGLSPHSPE 65 (702) Q Consensus 14 Pa~~~~~~~~~l~~~~~~~l~~~~l~~~~i~~~~-TPR---RLav~i~~l~~~q~d 65 (702) |+..-+....++.+.|.+.|++++ |.-+.+++ -|| -|-+...|+..-|+- T Consensus 52 ~~~~~~~~~~eiR~~Fl~fF~~~g--H~~v~s~plvp~~~ddllftnAgm~~Fkp~ 105 (898) T PRK13902 52 PPTSKKYSLDEMREKFLSFFEKRG--HTRIKRYPVVARWRDDVYLTIASIYDFQPW 105 (898) T ss_pred CCCCCCCCHHHHHHHHHHHHHHCC--CEEECCCCCCCCCCCCEEEEECCCHHCHHH T ss_conf 876676789999999999997689--989178563789999805760351324044 No 39 >pfam03651 consensus Probab=65.36 E-value=9.3 Score=17.47 Aligned_cols=75 Identities=15% Similarity=0.143 Sum_probs=46.3 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 99999985565312467777738568359899999999999999999985025389999999853278887510270368 Q gi|254781094|r 593 AKRIFQILAIEEKKNREISLEISPQYLLLEAEKRLYAVISDFGTHIQESMDHKRYHQIGDLLHSICEPIEIFFDQVLVNV 672 (702) Q Consensus 593 ~KRv~NIL~~~~k~~~~~~~~vd~~Lf~~~~Ek~L~~~~~~~~~~i~~~l~~~dy~~~L~~L~~L~~~Id~FFDnVmVm~ 672 (702) +.|+..|-+--.+.+...+..+..+ .+++|.+.|+.+.++-..........+ ++.-.++.||-+.|- .|.|.. T Consensus 81 ~~R~~ri~~lrW~~g~~lp~~~~~~--Ls~~E~e~f~~Y~~~l~~Y~~~~~d~~----ldLt~~l~PPk~~~I-~VRvl~ 153 (193) T pfam03651 81 YDRLLRIRSLRWEYGGLLPNSLRFH--LAAEEMEWFNEYKRSLATYMSSLGDEG----LDLTQDMKPPKSLFI-EVRCLK 153 (193) T ss_pred HHHHHHHHHHHHHCCCCCCHHHHHH--CCHHHHHHHHHHHHHHHHHHHHCCCCC----CCCCCCCCCCCCCEE-EEEEEE T ss_conf 9999999999987078897898842--899999999999999999987437887----244678689977658-999970 Q ss_pred CC Q ss_conf 86 Q gi|254781094|r 673 DD 674 (702) Q Consensus 673 eD 674 (702) |- T Consensus 154 D~ 155 (193) T pfam03651 154 DY 155 (193) T ss_pred CC T ss_conf 35 No 40 >pfam01411 tRNA-synt_2c tRNA synthetases class II (A). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only alanyl-tRNA synthetases. Probab=62.51 E-value=11 Score=17.12 Aligned_cols=94 Identities=19% Similarity=0.185 Sum_probs=63.7 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCC-HHHHHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 899999974998987651056768776968888743446899983799788-7896303649999999999987633888 Q gi|254781094|r 474 SITVSLADKLDTLINFWAINEKPSGSKDPYALRRATLGIIRIILENKIDIP-LSQFIEDQNLILFFHDRLKLYLHDRDIR 552 (702) Q Consensus 474 g~ilsiaDKlDtl~g~f~ig~~PTGS~DPfaLRR~a~GiirIl~e~~l~l~-l~~li~~~~l~~Fi~~Rl~~~L~e~G~~ 552 (702) .++-.|||-+=+++=+.+=|..||-..--|=|||-.-=.+|.....+++.+ |.+++ +-..+.+...+-+-.-. T Consensus 264 ~a~riIaDH~Ra~~f~I~DGv~PSN~grGYvlRrllRRa~r~~~~lg~~~~~L~~Lv------~~vi~~m~~~YpEL~~~ 337 (545) T pfam01411 264 EAYRVIADHTRALTFAIADGVVPSNEGRGYVLRRILRRALRHAKKLGLKEPFLYDLV------PTVIEEMGDAYPELKEK 337 (545) T ss_pred CEEEEHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCHHHHHH------HHHHHHHHHHCHHHHHH T ss_conf 226651777888999864675467777761478999999999987457642099999------99999875258688986 Q ss_pred HHHHHHHHHCCCCCHHHHHHH Q ss_conf 789988541689998999999 Q gi|254781094|r 553 HDLIEAILRPENDNLLTIVDL 573 (702) Q Consensus 553 ~DiI~AVl~~~~~~~~~i~~r 573 (702) .+.|..|+...........++ T Consensus 338 ~~~I~~ii~~EE~~F~~TL~~ 358 (545) T pfam01411 338 EDTIQEILELEEERFAKTLER 358 (545) T ss_pred HHHHHHHHHHHHHHHHHHHHH T ss_conf 899999999999999999998 No 41 >PRK12267 methionyl-tRNA synthetase; Reviewed Probab=61.04 E-value=11 Score=16.94 Aligned_cols=24 Identities=17% Similarity=0.234 Sum_probs=10.6 Q ss_pred CCEEEEEECCCCCCCCHHHHHHHH Q ss_conf 324898505776554302456788 Q gi|254781094|r 300 ANCFILVSNIQASDGGAAIVQGNS 323 (702) Q Consensus 300 ~~~Fi~v~N~~~~~~~~~v~~Gne 323 (702) .+-|+.+.+...+....+++.+.+ T Consensus 289 ~hg~~~~~g~KmSKS~Gn~i~~~~ 312 (644) T PRK12267 289 AHGWWLMKDGKMSKSKGNVVDPEE 312 (644) T ss_pred CCCEEECCCCEECCCCCCEECHHH T ss_conf 044187178600553684223799 No 42 >COG2206 c-di-GMP phosphodiesterase class II (HD-GYP domain) [Signal transduction mechanisms] Probab=60.93 E-value=11 Score=16.93 Aligned_cols=99 Identities=22% Similarity=0.134 Sum_probs=66.2 Q ss_pred HHHHHHHHHHHHHHHHHCCCCHHHH---HHHHHHHHH-------------HHHHHHHHCCHHHHHHHHHHHHHHCC-CCH Q ss_conf 9989899999999886529998899---999997423-------------67888984034578999999986236-998 Q gi|254781094|r 387 GERVSRIRVLGKKIAQLIDADVALV---DRAIVLSKA-------------DLCTEIVREFPELQGKIGKEYAVLQN-ENI 449 (702) Q Consensus 387 ~dK~~Ri~~la~~la~~l~~~~~~~---~rAa~L~Ka-------------DL~t~mV~EFpeLQG~mG~~YA~~~g-e~~ 449 (702) +--+.|+..++..|++.++.+...+ ..||.|-.. --.|.-=++--..+.++|..++..-+ .+. T Consensus 150 ~~Hs~~va~~a~~ia~~lgl~~~~i~~l~~aalLHDIGKi~ip~~IL~K~g~Lt~eE~~~ik~H~~~g~~iL~~~~~~~~ 229 (344) T COG2206 150 YGHSVRVAELAEAIAKKLGLSEEKIEELALAGLLHDIGKIGIPDSILNKPGKLTEEEFEIIKKHPIYGYDILKDLPEFLE 229 (344) T ss_pred HHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCCCCCHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCH T ss_conf 50138999999999998399999999999987775255413888986799889999999998647999999986631037 Q ss_pred HHH-HHHHHHH------CCCC-CCCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 899-9999973------4477-677788635889999997499898 Q gi|254781094|r 450 SCC-DAIEEHL------KPRG-PLENVPTNKISITVSLADKLDTLI 487 (702) Q Consensus 450 ~Va-~aI~ehY------~P~~-~~d~lP~~~~g~ilsiaDKlDtl~ 487 (702) .|+ .|+.-|- .|+| .|+++| ..+.|+|+||=.|.++ T Consensus 230 ~~~~~~l~HHEr~DGtGYP~GL~GeeI~--l~aRIiAVADvydAlt 273 (344) T COG2206 230 SVRAVALRHHERWDGTGYPRGLKGEEIP--LEARIIAVADVYDALT 273 (344) T ss_pred HHHHHHHHHHHCCCCCCCCCCCCCCCCC--HHHHHHHHHHHHHHHH T ss_conf 8999998721146888899987756488--8999999999999985 No 43 >COG3437 Response regulator containing a CheY-like receiver domain and an HD-GYP domain [Transcription / Signal transduction mechanisms] Probab=58.29 E-value=12 Score=16.63 Aligned_cols=107 Identities=20% Similarity=0.247 Sum_probs=67.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCH---HHHHHHHHH-------------HHHHHHHHHHHCCHHHHHHHHHHHHH Q ss_conf 34551389989899999999886529998---899999997-------------42367888984034578999999986 Q gi|254781094|r 380 HAKIGTQGERVSRIRVLGKKIAQLIDADV---ALVDRAIVL-------------SKADLCTEIVREFPELQGKIGKEYAV 443 (702) Q Consensus 380 ~~~LGs~~dK~~Ri~~la~~la~~l~~~~---~~~~rAa~L-------------~KaDL~t~mV~EFpeLQG~mG~~YA~ 443 (702) ...=+..++-++|+...+.-+|+.++.+. ..+.-||.| .|--..|.--.|.+...-+.|..--. T Consensus 180 E~R~~etg~H~~Rv~~~~~~lAe~lgLse~~v~~i~~AapLHDIGKvaiPD~ILlKpg~Lt~ee~~imk~H~~~G~~il~ 259 (360) T COG3437 180 EVRDYETGDHLERVAQYSELLAELLGLSEEEVDLIKKAAPLHDIGKVAIPDSILLKPGKLTSEEFEIMKGHPILGAEILK 259 (360) T ss_pred HHCCCCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCHHHCCCCCCCHHHHCCCCCCCHHHHHHHHCCHHHHHHHHH T ss_conf 84155404678999999999999967889999999960404213344677688608788987899998436587999999 Q ss_pred HCC----CCHHHHHHHHHHH----CCCC-CCCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 236----9988999999973----4477-6777886358899999974998987 Q gi|254781094|r 444 LQN----ENISCCDAIEEHL----KPRG-PLENVPTNKISITVSLADKLDTLIN 488 (702) Q Consensus 444 ~~g----e~~~Va~aI~ehY----~P~~-~~d~lP~~~~g~ilsiaDKlDtl~g 488 (702) ... --.++|.--.|+| .|.+ .||++|-+ |.|++|||-.|-|++ T Consensus 260 ~s~~~mq~a~eIa~~HHErwDGsGYPdgLkGd~IPl~--arI~aiADvfDAL~s 311 (360) T COG3437 260 SSERLMQVAAEIARHHHERWDGSGYPDGLKGDEIPLS--ARIVAIADVFDALVS 311 (360) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHH--HHHHHHHHHHHHHHC T ss_conf 8899999999999986551688999987776646516--779999999999964 No 44 >COG1639 Predicted signal transduction protein [Signal transduction mechanisms] Probab=55.72 E-value=14 Score=16.35 Aligned_cols=187 Identities=15% Similarity=0.035 Sum_probs=104.2 Q ss_pred ECCCHHHHHHHHHHCCCCEEEEEEECHHHHCCCHHHHHHHHHHHCCCEEEECCCCCCCCEEEEEECCCCCCCCHHHHHHH Q ss_conf 14666788788744089846888307777128989999999840011365536886132489850577655430245678 Q gi|254781094|r 243 LVEDKDLLEEIIGLVEWVQVFMGSFDKKYLCLPEELIRLTIKTNQKCFVTRTREGVLANCFILVSNIQASDGGAAIVQGN 322 (702) Q Consensus 243 ~~~d~~Ll~Ev~~LvE~P~~~~G~F~~~fL~lP~evl~~~m~~hQkyf~~~~~~g~l~~~Fi~v~N~~~~~~~~~v~~Gn 322 (702) +..-++.+-++--+++-|.+ ++-++-+.+..-.--+ -|-.-+. =+|||=+-.++. +.....++-|- T Consensus 23 lP~lP~vv~ki~~~~~~pn~-------s~~~ia~~I~~Dp~Lt-A~ll~lA-----NS~yfg~~~~i~-tl~~Ai~rLG~ 88 (289) T COG1639 23 LPPLPDVVLKLQKACSDPNV-------SLSDIAELISQDPALT-ARLLRLA-----NSPYFGFPREIT-TLNEAIVRLGI 88 (289) T ss_pred CCCCCHHHHHHHHHHHCCCC-------CHHHHHHHHHHCHHHH-HHHHHHH-----CCHHCCCCCCCC-CHHHHHHHHHH T ss_conf 69986799999999858999-------8999999981087899-9999995-----232307997447-29999999808 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCEEHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 89978764225999850300011111111110011220137889988421002303134551389989899999999886 Q gi|254781094|r 323 SRVVAARLEDALHFWKRDQNNLPNLSSLKESALKFNLDLSKPLDQRMARLDMLDVVFHAKIGTQGERVSRIRVLGKKIAQ 402 (702) Q Consensus 323 erVl~ARL~DA~FF~~~D~~~~~~~~~~~~~~~~~~~dlk~~L~~~~~~L~~~~v~F~~~LGs~~dK~~Ri~~la~~la~ 402 (702) ..|.+==+++|+--...+ ....-.+.--+.+.+.+..-++..++. T Consensus 89 ~~v~NLv~a~a~~~~~~~-----------------------------------~~~~~~~~~~~w~~a~~~A~ia~~La~ 133 (289) T COG1639 89 GLVINLVLALAEQAIQSV-----------------------------------NSSSAEDRQLFWDTAIETAMIAEGLAR 133 (289) T ss_pred HHHHHHHHHHHHHHHCCC-----------------------------------CCCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999999862157-----------------------------------872677899999999999999999999 Q ss_pred HCC-CCHHHHHHHHHHHHH-------------------HHHH--------HHHHCCHHHHHHHHHHHHHHCCCCHHHHHH Q ss_conf 529-998899999997423-------------------6788--------898403457899999998623699889999 Q gi|254781094|r 403 LID-ADVALVDRAIVLSKA-------------------DLCT--------EIVREFPELQGKIGKEYAVLQNENISCCDA 454 (702) Q Consensus 403 ~l~-~~~~~~~rAa~L~Ka-------------------DL~t--------~mV~EFpeLQG~mG~~YA~~~ge~~~Va~a 454 (702) .++ .+.+...-|+.|-++ ...+ +.-+=|+ .-.+|.+-+++=+.|++++++ T Consensus 134 ~~g~~~~~~~y~~gLLh~lG~l~ll~~~~~~~~~~~~~~~~~~~~~~~~~e~~~i~~--h~~Iga~llr~W~fp~~l~e~ 211 (289) T COG1639 134 ALGRADSDEAYTAGLLHNLGILVLLTDFPDHCELLDYLLALNNDELLALDEELGIFG--HASIGAYLLRRWNFPDDLIEA 211 (289) T ss_pred HHCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHCCCC--CHHHHHHHHHHCCCCHHHHHH T ss_conf 808531778999999997149999988379887799999716676511689861265--027999999984997799999 Q ss_pred HHHHHCCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 9997344776777886358899999974 Q gi|254781094|r 455 IEEHLKPRGPLENVPTNKISITVSLADK 482 (702) Q Consensus 455 I~ehY~P~~~~d~lP~~~~g~ilsiaDK 482 (702) |+-|+.|...+..+ +....+..+|+- T Consensus 212 i~~~~~~~~~~~~~--~~~~li~~~a~~ 237 (289) T COG1639 212 IRFHHNPSTEGARL--KYLDLILVLANL 237 (289) T ss_pred HHHHHCCCCCCCCH--HHHHHHHHHHHH T ss_conf 99740532125426--789999999999 No 45 >COG2316 Predicted hydrolase (HD superfamily) [General function prediction only] Probab=54.35 E-value=14 Score=16.21 Aligned_cols=141 Identities=15% Similarity=0.127 Sum_probs=94.0 Q ss_pred HHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCC Q ss_conf 98989999999988652999889999999742367888984034578999999986236998899999997344776777 Q gi|254781094|r 388 ERVSRIRVLGKKIAQLIDADVALVDRAIVLSKADLCTEIVREFPELQGKIGKEYAVLQNENISCCDAIEEHLKPRGPLEN 467 (702) Q Consensus 388 dK~~Ri~~la~~la~~l~~~~~~~~rAa~L~KaDL~t~mV~EFpeLQG~mG~~YA~~~ge~~~Va~aI~ehY~P~~~~d~ 467 (702) --.--......++|..+|.|.+.-.-+..|--.| -+.-.-=||=+|..|-.|.+..+-+++|++||.-|-.-.+.. T Consensus 50 kHcla~eavMr~lARe~gEDEEkw~~~GlLHD~D--Ye~tqgdpEeHgl~g~eiL~~edv~eeil~ai~~H~~~~~~~-- 125 (212) T COG2316 50 KHCLAVEAVMRWLAREWGEDEEKWAVTGLLHDFD--YELTQGDPEEHGLWGVEILREEDVSEEILDAIMGHAAYTGTP-- 125 (212) T ss_pred HHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCC--HHHHCCCHHHCCCCCEEHHHHCCCCHHHHHHHHHHHHHHCCC-- T ss_conf 9999999999999999585188888874242134--775049966668403000763399989999999866650699-- Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 88635889999997499898765105676877696888874344689998379978878963036499999999999876 Q gi|254781094|r 468 VPTNKISITVSLADKLDTLINFWAINEKPSGSKDPYALRRATLGIIRIILENKIDIPLSQFIEDQNLILFFHDRLKLYLH 547 (702) Q Consensus 468 lP~~~~g~ilsiaDKlDtl~g~f~ig~~PTGS~DPfaLRR~a~GiirIl~e~~l~l~l~~li~~~~l~~Fi~~Rl~~~L~ 547 (702) |+++....+--+ |.++|+...-. ++..+-++..-+ ++|+..|++.--- T Consensus 126 -~et~~e~aL~Av---D~~tGli~AaA---------------------Lirp~K~~~vv~-------~~~lkkrFKeK~F 173 (212) T COG2316 126 -RETRLEQALFAV---DELTGLIQAAA---------------------LIRPDKDVRVVE-------LSSLKKRFKEKGF 173 (212) T ss_pred -CCCHHHHHHHHH---CCCHHHHHHHH---------------------HHCCCCCCCCCC-------HHHHHHHHHHHHH T ss_conf -545899999861---43015899998---------------------717877431004-------7999999876555 Q ss_pred HCCCCHHHHHHHHHCCC Q ss_conf 33888789988541689 Q gi|254781094|r 548 DRDIRHDLIEAILRPEN 564 (702) Q Consensus 548 e~G~~~DiI~AVl~~~~ 564 (702) -.|.+.|-|+-+-..+- T Consensus 174 AaG~~Rd~vR~~E~lG~ 190 (212) T COG2316 174 AAGVNRDEVRQAEELGV 190 (212) T ss_pred HCCCCHHHHHHHHHHCC T ss_conf 41588999999999199 No 46 >PRK03826 hypothetical protein; Provisional Probab=54.27 E-value=14 Score=16.20 Aligned_cols=136 Identities=15% Similarity=0.220 Sum_probs=84.3 Q ss_pred HHHHHHCCCCEEHHHHHHHHHHHHHHHHHHHHHHHHH----CCCCHHHHHHHHHHHH---------HHHHHHHHHCCHHH Q ss_conf 9884210023031345513899898999999998865----2999889999999742---------36788898403457 Q gi|254781094|r 367 QRMARLDMLDVVFHAKIGTQGERVSRIRVLGKKIAQL----IDADVALVDRAIVLSK---------ADLCTEIVREFPEL 433 (702) Q Consensus 367 ~~~~~L~~~~v~F~~~LGs~~dK~~Ri~~la~~la~~----l~~~~~~~~rAa~L~K---------aDL~t~mV~EFpeL 433 (702) .|++...|++-.|.-.=-++.+-+-++..+|..+|.. .|.+. +.+|++.++= -|+.|-+=+-=|++ T Consensus 10 ~Rmk~I~RW~lm~n~~~ENvaeHS~~VA~iAh~La~I~n~~fg~~i-n~~~~a~~Al~HD~sEi~TGDipTPVKY~n~~i 88 (195) T PRK03826 10 SRLKLINRWPLMRNVRTENVSEHSLQVAMVAHALAVIKNRKFGGNL-NAERIALLAMYHDASEVLTGDLPTPVKYFNPEI 88 (195) T ss_pred HHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCC-CHHHHHHHHHHCCHHHHHCCCCCCCCCCCCHHH T ss_conf 8663321063655788450999889999999999999999818989-999999999835219886278998858689899 Q ss_pred HHHHHHHHHHHCCC-CHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHH Q ss_conf 89999999862369-988999999973447767778863588999999749989876510567687769688887 Q gi|254781094|r 434 QGKIGKEYAVLQNE-NISCCDAIEEHLKPRGPLENVPTNKISITVSLADKLDTLINFWAINEKPSGSKDPYALRR 507 (702) Q Consensus 434 QG~mG~~YA~~~ge-~~~Va~aI~ehY~P~~~~d~lP~~~~g~ilsiaDKlDtl~g~f~ig~~PTGS~DPfaLRR 507 (702) .--.+.+=...+.. =.-+-.-..+.|.|+...+.. .+..|.+|..|||+|-++-|+. +.-.|..+ |..-+ T Consensus 89 ~~~yk~iE~~a~~~ll~~LP~elq~~y~~~l~~~~~-~~~e~~iVkaAD~l~AylKc~~--E~~~GN~E-F~~a~ 159 (195) T PRK03826 89 AQEYKKIEKIAEQKLLDMLPEELQEDFAPLLDEHAA-SEEEKAIVKQADALCAYLKCLE--ELSAGNNE-FNLAK 159 (195) T ss_pred HHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHCCCCC-CHHHHHHHHHHHHHHHHHHHHH--HHHHCCHH-HHHHH T ss_conf 999999999999999984989999999998726778-8799999999999999999999--99919966-89999 No 47 >cd04801 CBS_pair_M50_like This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in association with the metalloprotease peptidase M50. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=51.98 E-value=12 Score=16.81 Aligned_cols=31 Identities=29% Similarity=0.498 Sum_probs=21.0 Q ss_pred HHHHH-HHHHHCCCEEEECCCCCCCCEEEEEEC Q ss_conf 99999-998400113655368861324898505 Q gi|254781094|r 277 ELIRL-TIKTNQKCFVTRTREGVLANCFILVSN 308 (702) Q Consensus 277 evl~~-~m~~hQkyf~~~~~~g~l~~~Fi~v~N 308 (702) |.+-. ....||++||+.|.+|+|.- +++..+ T Consensus 14 e~~~~~~~~~~~~~~pVvd~~g~l~G-ivt~~d 45 (114) T cd04801 14 EFVREYVLGSNQRRFVVVDNEGRYVG-IISLAD 45 (114) T ss_pred HHHHHHHHHCCCEEEEEECCCCEEEE-EEEHHH T ss_conf 99999985079668999878997999-999999 No 48 >TIGR00398 metG methionyl-tRNA synthetase; InterPro: IPR002304 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . Methionyl-tRNA synthetase (6.1.1.10 from EC) is an alpha 2 dimer that belongs to class Ia. In some species (archaea, eubacteria and eukaryota) a coding sequence, similar to the C-term end of MetRS, is present as an independent gene which is a tRNA binding domain as a dimer. In eubacteria, MetRS can also be splitted in two sub-classes corresponding to the presence of one or two CXXC domain specific to zinc binding. The crystal structures of a number of methionyl-tRNA synthases are known , , .; GO: 0000166 nucleotide binding, 0004825 methionine-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006431 methionyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=51.57 E-value=16 Score=15.91 Aligned_cols=72 Identities=13% Similarity=0.182 Sum_probs=55.0 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCC----CCCCCC--CHHHHHHHHHHHHHHHHHH Q ss_conf 98999999999999999999850253899999998532788875102----703688--6899999999999999999 Q gi|254781094|r 621 LEAEKRLYAVISDFGTHIQESMDHKRYHQIGDLLHSICEPIEIFFDQ----VLVNVD--DREVRDNRLSLLQYIKNII 692 (702) Q Consensus 621 ~~~Ek~L~~~~~~~~~~i~~~l~~~dy~~~L~~L~~L~~~Id~FFDn----VmVm~e--D~~IR~NRL~LL~~i~~~f 692 (702) .++-..|++.+.+...++.+++++..|..|+..+-+|..--|.|||+ +++..+ ..++...=+.++..++.++ T Consensus 443 ~~~~~~l~~~~~~~~~~~~~~~~~~~f~~Al~~~~~L~~~gNkyi~~~~PW~l~k~~~~~~~l~~~~~~~~e~~~~l~ 520 (573) T TIGR00398 443 TEEDKELLKKINETLEQIDKALESFEFRKALEEILKLADRGNKYIDDNEPWELAKQSEKLKELLAVCSLVAELIRVLS 520 (573) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 235668999999999999998862038999999999999861476523887132570679999999999999999999 No 49 >KOG2531 consensus Probab=48.35 E-value=13 Score=16.41 Aligned_cols=89 Identities=16% Similarity=0.229 Sum_probs=57.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHC-C------------CCH-----HH-HHHHHHHHHHHHHHHHHHCCHH--HHHHHHHH Q ss_conf 55138998989999999988652-9------------998-----89-9999997423678889840345--78999999 Q gi|254781094|r 382 KIGTQGERVSRIRVLGKKIAQLI-D------------ADV-----AL-VDRAIVLSKADLCTEIVREFPE--LQGKIGKE 440 (702) Q Consensus 382 ~LGs~~dK~~Ri~~la~~la~~l-~------------~~~-----~~-~~rAa~L~KaDL~t~mV~EFpe--LQG~mG~~ 440 (702) .-+-.|++++||.-++.++|-.+ + .+. +. -.++-..|--||..-.-...|. .+|-+|.| T Consensus 183 ~~pe~Ye~TerISLVSsFlaSlllG~~a~id~sDgsGMNL~dIr~k~ws~~~L~~~apdL~~KL~~pv~~~~~~G~I~~Y 262 (545) T KOG2531 183 QEPEAYEKTERISLVSSFLASLLLGSYAPIDESDGSGMNLLDIRKKKWSKALLDACAPDLEEKLGKPVPPMSIAGTISKY 262 (545) T ss_pred HCHHHHHCCCEEEHHHHHHHHHHHCCCCCEECCCCCCCHHHHHHHHHHHHHHHHHHCHHHHHHHCCCCCCCCCCCCHHHH T ss_conf 69276523332104899999998266132121466675088776655009998643730888827888850015650145 Q ss_pred HHHHCCCCHHHHHHHHHHHCCCCCCCCCCCCHHHHHHH Q ss_conf 98623699889999999734477677788635889999 Q gi|254781094|r 441 YAVLQNENISCCDAIEEHLKPRGPLENVPTNKISITVS 478 (702) Q Consensus 441 YA~~~ge~~~Va~aI~ehY~P~~~~d~lP~~~~g~ils 478 (702) |-++-|.++..-.. -+.||. |++..|..|. T Consensus 263 fv~r~gF~p~C~Vv-------~~tGDN-psslagL~l~ 292 (545) T KOG2531 263 FVKRYGFPPDCKVV-------PSTGDN-PSSLAGLPLR 292 (545) T ss_pred HHHHCCCCCCCEEE-------ECCCCC-HHHHHCCCCC T ss_conf 67641899998797-------357899-6775176336 No 50 >TIGR00425 CBF5 rRNA pseudouridine synthase, putative; InterPro: IPR004802 This family contains the only archaeal proteins markedly similar to bacterial TruB, the tRNA pseudouridine 55 synthase. However, among two related yeast proteins, the archaeal set matches yeast YLR175w far better than YNL292w. The first, termed centromere/microtubule binding protein 5 (CBF5), is an apparent rRNA pseudouridine synthase, while the second is the exclusive tRNA pseudouridine 55 synthase for both cytosolic and mitochondrial compartments. It is unclear whether the archaeal proteins in this family modify tRNA, rRNA, or both. . Probab=47.94 E-value=4.8 Score=19.43 Aligned_cols=16 Identities=25% Similarity=0.370 Sum_probs=7.6 Q ss_pred HHHHHHCCCCEEECHH Q ss_conf 9998620278471699 Q gi|254781094|r 207 HYIRDLEEAMVLLDPE 222 (702) Q Consensus 207 ~Y~~~L~~~~Vi~d~~ 222 (702) =|.+-|.++++.-+.+ T Consensus 206 vtLhDllDAy~fw~e~ 221 (326) T TIGR00425 206 VTLHDLLDAYVFWKED 221 (326) T ss_pred EEHHHHHEEEEEEECC T ss_conf 6042031110222037 No 51 >KOG1686 consensus Probab=47.56 E-value=16 Score=15.80 Aligned_cols=76 Identities=14% Similarity=0.236 Sum_probs=54.2 Q ss_pred HHHHHCCCCCEEECCCCCCCCCCCEEEEEECC---EEEEEECCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCEEEECH Q ss_conf 99986289851310377656767416653032---056786367778723102445822111102234457785663167 Q gi|254781094|r 129 PVAIQKVPWPKSMRWSTTHSPISAFSWIRPLK---SILCILVAEDAKEKIIDLDLKEIPCGNITYGHRFHAPHPIKVQSL 205 (702) Q Consensus 129 ~~~i~~l~~pKsMrWg~~~~~~~~~~FvRPir---wIvaLl~~~~~~~~vi~~~~~gi~s~n~T~GHRf~~~~~i~I~~a 205 (702) .++..+++|||-.--|..+. -+|.|||- ..-|.-+....+++|+-|.--.-+--+.+.|||.... .+.|.+. T Consensus 53 ~nv~d~i~l~kVlLvg~~~~----T~~grpvv~~vtv~aVv~e~g~~~kvvhfk~k~~k~tr~~~~~~qe~t-~LrIt~i 127 (151) T KOG1686 53 KNVLDSIPLPKVLLVGPVEE----TRIGRPVVKKVTVHAVVEEYGLLFKVVHFKSKQWKVTRRDLGHRQEAT-ELRITCI 127 (151) T ss_pred CCCCCCCCCCEEEEECCCCE----EEECCCCCCCCEEEEEEEECCCCCCEEEEEECCHHHHHHHHCCCCCCC-EEEEEEE T ss_conf 66566656404787667410----475673346522787743004455079998612234335420257661-7988650 Q ss_pred HHHH Q ss_conf 9999 Q gi|254781094|r 206 DHYI 209 (702) Q Consensus 206 ~~Y~ 209 (702) ..|+ T Consensus 128 t~ie 131 (151) T KOG1686 128 TGIE 131 (151) T ss_pred EEEE T ss_conf 0356 No 52 >TIGR02702 SufR_cyano iron-sulfur cluster biosynthesis transcriptional regulator SufR; InterPro: IPR014075 Iron-sulphur (FeS) clusters are important cofactors for numerous proteins involved in electron transfer, in redox and non-redox catalysis, in gene regulation, and as sensors of oxygen and iron. These functions depend on the various FeS cluster prosthetic groups, the most common being [2Fe-2S] and [4Fe-4S] . FeS cluster assembly is a complex process involving the mobilisation of Fe and S atoms from storage sources, their assembly into [Fe-S] form, their transport to specific cellular locations, and their transfer to recipient apoproteins. So far, three FeS assembly machineries have been identified, which are capable of synthesising all types of [Fe-S] clusters: ISC (iron-sulphur cluster), SUF (sulphur assimilation), and NIF (nitrogen fixation) systems. The ISC system is conserved in eubacteria and eukaryotes (mitochondria), and has broad specificity, targeting general FeS proteins , . It is encoded by the isc operon (iscRSUA-hscBA-fdx-iscX). IscS is a cysteine desulphurase, which obtains S from cysteine (converting it to alanine) and serves as a S donor for FeS cluster assembly. IscU and IscA act as scaffolds to accept S and Fe atoms, assembling clusters and transfering them to recipient apoproteins. HscA is a molecular chaperone and HscB is a co-chaperone. Fdx is a [2Fe-2S]-type ferredoxin. IscR is a transcription factor that regulates expression of the isc operon. IscX (also known as YfhJ) appears to interact with IscS and may function as an Fe donor during cluster assembly . The SUF system is an alternative pathway to the ISC system that operates under iron starvation and oxidative stress. It is found in eubacteria, archaea and eukaryotes (plastids). The SUF system is encoded by the suf operon (sufABCDSE), and the six encoded proteins are arranged into two complexes (SufSE and SufBCD) and one protein (SufA). SufS is a pyridoxal-phosphate (PLP) protein displaying cysteine desulphurase activity. SufE acts as a scaffold protein that accepts S from SufS and donates it to SufA . SufC is an ATPase with an unorthodox ATP-binding cassette (ABC)-like component. No specific functions have been assigned to SufB and SufD. SufA is homologous to IscA , acting as a scaffold protein in which Fe and S atoms are assembled into [FeS] cluster forms, which can then easily be transferred to apoproteins targets. In the NIF system, NifS and NifU are required for the formation of metalloclusters of nitrogenase in Azotobacter vinelandii, and other organisms, as well as in the maturation of other FeS proteins. Nitrogenase catalyses the fixation of nitrogen. It contains a complex cluster, the FeMo cofactor, which contains molybdenum, Fe and S. NifS is a cysteine desulphurase. NifU binds one Fe atom at its N-terminal, assembling an FeS cluster that is transferred to nitrogenase apoproteins . Nif proteins involved in the formation of FeS clusters can also be found in organisms that do not fix nitrogen . This entry represents members of the SufR cyanobacterial protein family of transcriptional regulators that control the SUF system. In all cases, the sufR gene is encoded near SUF system genes but in the opposite direction. This DNA-binding protein belongs to the DeoR family of helix-loop-helix proteins. All members also have a probable metal-binding motif C-X(12)-C-X(13)-C-X(14)-C near the C-terminus.. Probab=44.52 E-value=13 Score=16.57 Aligned_cols=22 Identities=5% Similarity=0.274 Sum_probs=10.2 Q ss_pred CCCCHHHHHHHHHC-CCCCHHHH Q ss_conf 38887899885416-89998999 Q gi|254781094|r 549 RDIRHDLIEAILRP-ENDNLLTI 570 (702) Q Consensus 549 ~G~~~DiI~AVl~~-~~~~~~~i 570 (702) .+|.-|+++++-.+ +.+.+-++ T Consensus 85 ~~FAl~LLdSl~etlG~~q~~~v 107 (215) T TIGR02702 85 GEFALSLLDSLAETLGPEQFEAV 107 (215) T ss_pred CHHHHHHHHHHHHHCCHHHHHHH T ss_conf 31678899988963787789999 No 53 >pfam06275 DUF1031 Protein of unknown function (DUF1031). This family consists of several Lactococcus lactis bacteriophage and Lactococcus lactis proteins of unknown function. Probab=43.57 E-value=21 Score=15.10 Aligned_cols=57 Identities=16% Similarity=0.050 Sum_probs=36.1 Q ss_pred CCHHHHHHHHHCCCCHHHHHHEEEECCCEEEEEEEEECCCCCHHHHHHHHHHHHHHCCCCCEEECCCCC Q ss_conf 987899899977998244332034069828999999158850999999999999862898513103776 Q gi|254781094|r 78 GKKAIDGFLRSTGLQTISDCQIKKDPKKGDVYLAVLRKPKRLIEDVLKTIVPVAIQKVPWPKSMRWSTT 146 (702) Q Consensus 78 pt~A~~GF~ks~gi~~~~~~~i~~~~~kg~~~~~~~~~~g~~~~eiL~~ii~~~i~~l~~pKsMrWg~~ 146 (702) |++-+.|.|++++.+... |.--+....+++|. .-|.++-.++-..++-.|-|+||+. T Consensus 8 ~lkKL~gLA~~~n~~~~h---------~~lSVkIsgRTK~n---HELsqlYlDIcnk~Nh~K~MkW~~L 64 (80) T pfam06275 8 PLKKLFGLAIKANCDVGH---------GELSVKISGRTKGN---HELSEIYLDICNKFNHGKDMKWGEL 64 (80) T ss_pred HHHHHHHHHHHCCCCCCC---------CCEEEEEEEEECCC---HHHHHHHHHHHHHHCCCCCCCHHHH T ss_conf 399998888863677788---------61468997641564---6499999999986356322729999 No 54 >pfam00599 Flu_M2 Influenza Matrix protein (M2). This protein spans the viral membrane with an extracellular amino-terminus external and a cytoplasmic carboxy-terminus. Probab=42.79 E-value=21 Score=15.02 Aligned_cols=45 Identities=24% Similarity=0.238 Sum_probs=28.4 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 5676877696888874344689998379978878963036499999999999876 Q gi|254781094|r 493 NEKPSGSKDPYALRRATLGIIRIILENKIDIPLSQFIEDQNLILFFHDRLKLYLH 547 (702) Q Consensus 493 g~~PTGS~DPfaLRR~a~GiirIl~e~~l~l~l~~li~~~~l~~Fi~~Rl~~~L~ 547 (702) ++.-++|+||...--+-+||+.+|+= |-+.-....+..|++.=|+ T Consensus 16 ecrc~dssdplv~aa~iigilhlilw----------ildrlffkciyrr~kyglk 60 (97) T pfam00599 16 ECRCNDSSDLLVAAASIIGILHLILW----------IFDRLFFKCAYRRFKHGLK 60 (97) T ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHH----------HHHHHHHHHHHHHHHHCCC T ss_conf 11048988608899999999999999----------9999999999999874024 No 55 >COG0317 SpoT Guanosine polyphosphate pyrophosphohydrolases/synthetases [Signal transduction mechanisms / Transcription] Probab=41.91 E-value=22 Score=14.93 Aligned_cols=75 Identities=24% Similarity=0.337 Sum_probs=55.8 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 99742367888984034578999999986236998899999997344776777886358899999974998987651056 Q gi|254781094|r 415 IVLSKADLCTEIVREFPELQGKIGKEYAVLQNENISCCDAIEEHLKPRGPLENVPTNKISITVSLADKLDTLINFWAINE 494 (702) Q Consensus 415 a~L~KaDL~t~mV~EFpeLQG~mG~~YA~~~ge~~~Va~aI~ehY~P~~~~d~lP~~~~g~ilsiaDKlDtl~g~f~ig~ 494 (702) ..--|.||-...|+=|+ |.|--=+||.. +..|-.|=.+.|=+|.=.+|. T Consensus 377 ~e~~k~dlf~d~VyvfT-----------------------------PkG~vi~LP~G--atplDFAY~vHt~iG~~c~gA 425 (701) T COG0317 377 LEQLKSDLFPDRVYVFT-----------------------------PKGKVIDLPKG--ATPLDFAYAVHTDIGHRCIGA 425 (701) T ss_pred HHHHHHCCCCCEEEEEC-----------------------------CCCCEEECCCC--CCHHHHHHHHHCHHCCEEEEE T ss_conf 99976210475289988-----------------------------99977967999--940421243304002504677 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHCC Q ss_conf 76877696888874344689998379 Q gi|254781094|r 495 KPSGSKDPYALRRATLGIIRIILENK 520 (702) Q Consensus 495 ~PTGS~DPfaLRR~a~GiirIl~e~~ 520 (702) +-.|--=|+--+=..--+|.|+..++ T Consensus 426 kVnG~ivpl~~~Lk~Gd~VEIit~k~ 451 (701) T COG0317 426 KVNGRIVPLTTKLQTGDQVEIITSKH 451 (701) T ss_pred EECCEEECCCEECCCCCEEEEEECCC T ss_conf 88999814650347899899994899 No 56 >pfam10490 Rb-bdg_C_Cenp-F Rb-binding domain of kinetochore protein Cenp-F/LEK1. Cenp-F, a centromeric kinetochore, microtubule-binding protein consisting of two 1,600-amino acid-long coils, is essential for the full functioning of the mitotic checkpoint pathway. This domain is at the very C-terminus of the C-terminal coiled-coil, and is one of the key Rb-binding domains. Probab=41.67 E-value=5.1 Score=19.27 Aligned_cols=18 Identities=50% Similarity=0.979 Sum_probs=16.0 Q ss_pred CCCCCCCCHHHHHHHHHH Q ss_conf 676877696888874344 Q gi|254781094|r 494 EKPSGSKDPYALRRATLG 511 (702) Q Consensus 494 ~~PTGS~DPfaLRR~a~G 511 (702) -+|||-.-||-|||..++ T Consensus 27 DIPtg~~sPyilRRttl~ 44 (49) T pfam10490 27 DIPTGPSSPYILRRTTLA 44 (49) T ss_pred CCCCCCCCCEEEEEECCC T ss_conf 588888776066630234 No 57 >pfam02728 Cu_amine_oxidN3 Copper amine oxidase, N3 domain. This domain is the second or third structural domain in copper amine oxidases, it is known as the N3 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitous and novel group of quinoenzymes that catalyse the oxidative deamination of primary amines to the corresponding aldehydes, with concomitant reduction of molecular oxygen to hydrogen peroxide. The enzymes are dimers of identical 70-90 kDa subunits, each of which contains a single copper ion and a covalently bound cofactor formed by the post-translational modification of a tyrosine side chain to 2,4,5-trihydroxyphenylalanine quinone (TPQ). Probab=39.54 E-value=24 Score=14.68 Aligned_cols=28 Identities=11% Similarity=0.392 Sum_probs=23.8 Q ss_pred EEEEECCEEEEEECCCCCCCCEEEEEECCC Q ss_conf 665303205678636777872310244582 Q gi|254781094|r 154 SWIRPLKSILCILVAEDAKEKIIDLDLKEI 183 (702) Q Consensus 154 ~FvRPirwIvaLl~~~~~~~~vi~~~~~gi 183 (702) -|+|||-.+.++.|-+ ..+|+.++..|+ T Consensus 69 ~Ya~PieGl~~~vDl~--~~~Vi~i~D~g~ 96 (101) T pfam02728 69 FYAHPIEGLEILVDLD--AKKVIEITDRGV 96 (101) T ss_pred CCCCCCCCEEEEEECC--CCEEEEEEECCC T ss_conf 0412115339999898--898999984895 No 58 >PRK07568 aspartate aminotransferase; Provisional Probab=38.47 E-value=25 Score=14.57 Aligned_cols=38 Identities=21% Similarity=0.219 Sum_probs=24.6 Q ss_pred CEEEECHHHHHHHHCCCCEEECHHHHHHHHHHHHHHHHHHCCCEECCCH Q ss_conf 5663167999986202784716999999999999998885398214666 Q gi|254781094|r 199 PIKVQSLDHYIRDLEEAMVLLDPECRRNAILNDAHRLASAVGLELVEDK 247 (702) Q Consensus 199 ~i~I~~a~~Y~~~L~~~~Vi~d~~~R~~~I~~~i~~~a~~~~~~~~~d~ 247 (702) .+.+++|.+ --+.+.+.++|+++ .++|+++|+.++.|+ T Consensus 165 ~ii~~~P~N------PTG~v~s~~~~~~l-----~~la~~~~~~ii~De 202 (396) T PRK07568 165 AILISNPGN------PTGVVYTKEELRQL-----ADIAKEHDLFLISDE 202 (396) T ss_pred EEEECCCCC------CCCCCCCHHHHHHH-----HHHHHHHCEEEECCC T ss_conf 998589929------88754015789999-----999987106674245 No 59 >COG2838 Icd Monomeric isocitrate dehydrogenase [Energy production and conversion] Probab=36.48 E-value=13 Score=16.42 Aligned_cols=85 Identities=18% Similarity=0.109 Sum_probs=52.2 Q ss_pred EEEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCEEEEEECCCCCCCCC--------------EEEEE Q ss_conf 1640020169899999999999999999997699803358863074589998495420751--------------00230 Q gi|254781094|r 5 LLEIYSEEIPARMQHKAAEDLSIILIGLLKEEGVIYQNMRQYWTPHRLFLYLKGLSPHSPE--------------KIEER 70 (702) Q Consensus 5 LlEIgtEElPa~~~~~~~~~l~~~~~~~l~~~~l~~~~i~~~~TPRRLav~i~~l~~~q~d--------------~~~e~ 70 (702) ++=-.|.|-|+- +..+|.-++.......+|.-+.-..-..-|=||-+-+-|.+.|.- ....+ T Consensus 8 iiyT~TDEaP~L----ATysllPIv~Af~~~AgI~VetsDISlagRILA~Fpe~Lte~Qrv~d~LaeLgel~k~pdaNiI 83 (744) T COG2838 8 IIYTLTDEAPAL----ATYSLLPIVKAFAAPAGIDVETSDISLAGRILAEFPEYLTEEQRVPDALAELGELTKLPDANII 83 (744) T ss_pred EEEEECCCCHHH----HHHHHHHHHHHHHCCCCCCEEECCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCCCEE T ss_conf 899834752677----7777789999961204775132222178889875356642541263689999887538887745 Q ss_pred CCCCCCC--C-CHHHHHHHHHCCCCH Q ss_conf 2789858--9-878998999779982 Q gi|254781094|r 71 LGPRVGA--G-KKAIDGFLRSTGLQT 93 (702) Q Consensus 71 kGP~~g~--p-t~A~~GF~ks~gi~~ 93 (702) |-|..++ | -+|+..=++.+|-.. T Consensus 84 KlPNISASvPQLkaAIkELQ~kGy~l 109 (744) T COG2838 84 KLPNISASVPQLKAAIKELQDKGYAL 109 (744) T ss_pred ECCCCCCCCHHHHHHHHHHHHCCCCC T ss_conf 36875444277999999998658768 No 60 >KOG3066 consensus Probab=34.51 E-value=28 Score=14.16 Aligned_cols=164 Identities=17% Similarity=0.213 Sum_probs=90.3 Q ss_pred EHHHHHHHHHHHHHHHHHHHHHHHHHC-------C-------CCHHHHHHHHHHHHHHHHHHHHHCCH----HHHHHHHH Q ss_conf 313455138998989999999988652-------9-------99889999999742367888984034----57899999 Q gi|254781094|r 378 VFHAKIGTQGERVSRIRVLGKKIAQLI-------D-------ADVALVDRAIVLSKADLCTEIVREFP----ELQGKIGK 439 (702) Q Consensus 378 ~F~~~LGs~~dK~~Ri~~la~~la~~l-------~-------~~~~~~~rAa~L~KaDL~t~mV~EFp----eLQG~mG~ 439 (702) +|+.-|-.-.||-+|+.+++..|.-.- + ....+.++...|-|. --.||- ||||.=-- T Consensus 41 sf~~~L~~~~dKrEriVklSRdITi~SKr~IFllHr~ss~~~~e~~l~~~~~~le~v-----r~k~f~~l~~EL~G~d~~ 115 (271) T KOG3066 41 SFKNFLQEDQDKRERIVKLSRDITIQSKRMIFLLHRTSSSGFPEPKLFDRTSILEKV-----RHKEFESLKRELAGLDAD 115 (271) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHEECCCEEEEEEEECCCCCCCCHHHHHHHHHHHHH-----HHHHHHHHHHHHCCCCHH T ss_conf 999999873788899995565403111003566653156898323445434588999-----999999999996077688 Q ss_pred HHHHH--CCCCHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHH----------- Q ss_conf 99862--36998899999997344776777886358899999974998987651056768776968888----------- Q gi|254781094|r 440 EYAVL--QNENISCCDAIEEHLKPRGPLENVPTNKISITVSLADKLDTLINFWAINEKPSGSKDPYALR----------- 506 (702) Q Consensus 440 ~YA~~--~ge~~~Va~aI~ehY~P~~~~d~lP~~~~g~ilsiaDKlDtl~g~f~ig~~PTGS~DPfaLR----------- 506 (702) .+.++ .|.-+-|-.+-+.|+.-.+.- . -+|+|-.+ ++|-||. |-|| T Consensus 116 kf~rA~t~GlQEYVEAvtF~~f~lsgtL-----------c----~~dein~~----lvpl~~~--~rl~in~iDYvLGva 174 (271) T KOG3066 116 KFSRACTHGLQEYVEAVTFKFFLLSGTL-----------C----QTDEINSC----LVPLDSS--FRLSINFIDYVLGVA 174 (271) T ss_pred HHHHHHCCCHHHHHHHHHHHHHHHHCCC-----------C----CHHHHHHE----ECCCCCC--CCEEEEHHHHHHHHH T ss_conf 8887604119999999999999972660-----------1----12326531----1336996--414422799898886 Q ss_pred HHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHC Q ss_conf 7434468999837997887896303649999999999987633888789988541689998999999999999871 Q gi|254781094|r 507 RATLGIIRIILENKIDIPLSQFIEDQNLILFFHDRLKLYLHDRDIRHDLIEAILRPENDNLLTIVDLIKHLNEFFS 582 (702) Q Consensus 507 R~a~GiirIl~e~~l~l~l~~li~~~~l~~Fi~~Rl~~~L~e~G~~~DiI~AVl~~~~~~~~~i~~r~~aL~~f~~ 582 (702) -..--+.|+++.+.=+-++..+ ..+..|+.+-.++.. -+....++++++.++...+.+-+. T Consensus 175 DlTGElMRm~I~~~s~g~I~~~---~~~~qFlRq~h~~~s------------~i~~~~~~~ye~~~Kl~vm~qSi~ 235 (271) T KOG3066 175 DLTGELMRMLITNGSKGSIQQL---TQQVQFLRQLHKNCS------------EIEHLPSKKYELQQKLSVMEQSIS 235 (271) T ss_pred HHHHHHHHHHHHCCCCCCHHHH---HHHHHHHHHHHHHHH------------HHCCCCCCHHHHHHHHHHHHHHHH T ss_conf 6278999999816767326639---999999999973012------------332589965889999999999999 No 61 >COG5514 Uncharacterized conserved protein [Function unknown] Probab=34.42 E-value=29 Score=14.15 Aligned_cols=43 Identities=16% Similarity=0.254 Sum_probs=20.0 Q ss_pred EEEEECCCEEEEEECCCCCCCCCEEEEECCCCCCCCCHHHHHHHHHCCC Q ss_conf 5886307458999849542075100230278985898789989997799 Q gi|254781094|r 43 MRQYWTPHRLFLYLKGLSPHSPEKIEERLGPRVGAGKKAIDGFLRSTGL 91 (702) Q Consensus 43 i~~~~TPRRLav~i~~l~~~q~d~~~e~kGP~~g~pt~A~~GF~ks~gi 91 (702) +.+|.-||-+|+..+|-+..|.-....-+|| -|...|+.+++- T Consensus 118 ~~sfcIPRgvc~~AeG~a~~t~~~~t~t~G~------iae~sfm~kn~k 160 (203) T COG5514 118 MQSFCIPRGVCLLAEGWANDTKFSVTATRGD------IAETSFMDKNFK 160 (203) T ss_pred EEEEECCCEEEEEEECCCCCCEEEEECCCCC------CHHHHHHHHCCC T ss_conf 8865213327998113567712455504677------115667774587 No 62 >TIGR00277 HDIG uncharacterized domain HDIG; InterPro: IPR006675 This domain is found in a few known nucleotidyltransferes and in a large number of uncharacterised proteins. It contains four widely separated His residues, the second of which is part of an invariant dipeptide His-Asp in a region matched approximately by the motif HDIG. . Probab=34.09 E-value=29 Score=14.12 Aligned_cols=76 Identities=22% Similarity=0.159 Sum_probs=58.6 Q ss_pred HHHHHHHHHHHHHHHHHH---HCC--CCHHHHHHHHHHHHHHHHHHHH-------------------HCCHHHHHHHHHH Q ss_conf 389989899999999886---529--9988999999974236788898-------------------4034578999999 Q gi|254781094|r 385 TQGERVSRIRVLGKKIAQ---LID--ADVALVDRAIVLSKADLCTEIV-------------------REFPELQGKIGKE 440 (702) Q Consensus 385 s~~dK~~Ri~~la~~la~---~l~--~~~~~~~rAa~L~KaDL~t~mV-------------------~EFpeLQG~mG~~ 440 (702) .+++.+.-+..++..++. .++ .+.+.+.+++.|--..-...-- ...-.-.+..|.. T Consensus 4 ~~~~h~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~l~hd~g~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~h~~~g~~ 83 (104) T TIGR00277 4 GVLEHSLEVAKLALALAELLRELGPEADVELARRGALLHDIGKPLTHEGFPEGQFGDNKHDLNPYLSKVLFLSHAVVGAE 83 (104) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHH T ss_conf 05678799999999999888861332358899987766531000013555566567751001301222221235678899 Q ss_pred HHHHCCCCHHHHHHHHHHHC Q ss_conf 98623699889999999734 Q gi|254781094|r 441 YAVLQNENISCCDAIEEHLK 460 (702) Q Consensus 441 YA~~~ge~~~Va~aI~ehY~ 460 (702) -++..|+++++...+.+|.- T Consensus 84 ~~~~~~~~~~~~~~~~~h~~ 103 (104) T TIGR00277 84 IARKYGEPPEVVDIIAEHHG 103 (104) T ss_pred HHHHHCCCHHHHHHHHHHHC T ss_conf 99871671578999987504 No 63 >COG0712 AtpH F0F1-type ATP synthase, delta subunit (mitochondrial oligomycin sensitivity protein) [Energy production and conversion] Probab=33.32 E-value=30 Score=14.03 Aligned_cols=15 Identities=20% Similarity=0.288 Sum_probs=7.1 Q ss_pred HHHHHHHHHHHHHCC Q ss_conf 874344689998379 Q gi|254781094|r 506 RRATLGIIRIILENK 520 (702) Q Consensus 506 RR~a~GiirIl~e~~ 520 (702) ||-|-.+.++-.+++ T Consensus 9 ~~YA~ALf~~A~e~~ 23 (178) T COG0712 9 RRYAKALFELAEEKG 23 (178) T ss_pred HHHHHHHHHHHHHHC T ss_conf 999999999999874 No 64 >TIGR02152 D_ribokin_bact ribokinase; InterPro: IPR011877 This entry describes ribokinase, an enzyme catalyzing the first step in ribose catabolism. The rbsK gene encoding ribokinase typically is found with ribose transport genes. Ribokinase belongs to the carbohydrate kinase pfkB family. ; GO: 0004747 ribokinase activity, 0006014 D-ribose metabolic process. Probab=33.22 E-value=30 Score=14.02 Aligned_cols=64 Identities=16% Similarity=0.218 Sum_probs=50.4 Q ss_pred CCCEEEECHHHHHHHHCCCCEEECHHHHHHHHHHHHHHHHHHCCCEECCCH-----HHH--HHHHHHCCCC Q ss_conf 785663167999986202784716999999999999998885398214666-----788--7887440898 Q gi|254781094|r 197 PHPIKVQSLDHYIRDLEEAMVLLDPECRRNAILNDAHRLASAVGLELVEDK-----DLL--EEIIGLVEWV 260 (702) Q Consensus 197 ~~~i~I~~a~~Y~~~L~~~~Vi~d~~~R~~~I~~~i~~~a~~~~~~~~~d~-----~Ll--~Ev~~LvE~P 260 (702) +..+.-...+.|++.+.+..+++-|-|=-..-...+.++|+++|.+++.|+ .|= +|...+|.|= T Consensus 114 N~~l~~~~~~~~~~~i~~s~~~~~QlEiP~etv~~a~~~a~~~G~~viLNPAPA~~~~~~m~e~l~~~D~i 184 (303) T TIGR02152 114 NAELTPEDIDAAEALIAESDIVLLQLEIPLETVLEALKIAKKHGVKVILNPAPAQKKLPKMDELLSLVDII 184 (303) T ss_pred CCCCCHHHHHHHHHHHHHCCEEEEECCCCHHHHHHHHHHHHHCCCCEEECCCCCCCCCCHHHHHHHHCCEE T ss_conf 64579889999998774688899723788899999999999658989866888887875489898642464 No 65 >pfam10011 DUF2254 Predicted membrane protein (DUF2254). Members of this family of bacterial proteins comprises various hypothetical and putative membrane proteins. Their exact function, has not, as yet, been defined. Probab=32.48 E-value=31 Score=13.94 Aligned_cols=25 Identities=24% Similarity=0.474 Sum_probs=12.8 Q ss_pred HHHHHHHHHCCCCCCCCCCH-HHHHH Q ss_conf 99898765105676877696-88887 Q gi|254781094|r 483 LDTLINFWAINEKPSGSKDP-YALRR 507 (702) Q Consensus 483 lDtl~g~f~ig~~PTGS~DP-faLRR 507 (702) .+.+..+|.+|..-|-.+|| ||+|. T Consensus 253 ~~~l~~~~~ig~~Rt~~QD~~Fgl~~ 278 (369) T pfam10011 253 REALRAAFTIGRERTFEQDPRFGLRQ 278 (369) T ss_pred HHHHHHHEEECCCCCHHHCHHHHHHH T ss_conf 99999884367565820094267999 No 66 >PRK10905 hypothetical protein; Validated Probab=31.90 E-value=31 Score=13.88 Aligned_cols=33 Identities=12% Similarity=0.225 Sum_probs=19.4 Q ss_pred CHHHHHHHHHCCCCHHHHHHEEEECCCEEEEEEEE Q ss_conf 87899899977998244332034069828999999 Q gi|254781094|r 79 KKAIDGFLRSTGLQTISDCQIKKDPKKGDVYLAVL 113 (702) Q Consensus 79 t~A~~GF~ks~gi~~~~~~~i~~~~~kg~~~~~~~ 113 (702) ...+.+|+|++++. +-..++..-++..||..+. T Consensus 253 ~~~l~a~ak~~~L~--ny~VY~T~RnGkpWYVLv~ 285 (323) T PRK10905 253 YDNLNGWAKKENLK--NYVVYETTRNGQPWYVLVS 285 (323) T ss_pred HHHHHHHHHHHCCC--CEEEEEEECCCCCEEEEEE T ss_conf 78899999983678--6278774017972599984 No 67 >PRK09775 hypothetical protein; Provisional Probab=30.19 E-value=6.5 Score=18.55 Aligned_cols=52 Identities=10% Similarity=0.088 Sum_probs=20.0 Q ss_pred HHHHHHHHHHHCCCEECCCHHHHHHHHHHCCCCEEEEEEECHHHHCCCHHHH Q ss_conf 9999999888539821466678878874408984688830777712898999 Q gi|254781094|r 228 ILNDAHRLASAVGLELVEDKDLLEEIIGLVEWVQVFMGSFDKKYLCLPEELI 279 (702) Q Consensus 228 I~~~i~~~a~~~~~~~~~d~~Ll~Ev~~LvE~P~~~~G~F~~~fL~lP~evl 279 (702) ++..+........+...+-+.-.+.-...-.-|-+-...=...|..|-++++ T Consensus 144 ~L~AL~~~G~D~~GNlliGe~a~~~~l~~~~~~~i~~~~k~~~Y~~LA~~al 195 (443) T PRK09775 144 VLYALTLFGGDYPGNWLVGEGAYQRWLAAPDPPAIPLDQKLTRYLQLAEDAL 195 (443) T ss_pred HHHHHHHHCCCCCCCEEECHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHH T ss_conf 9999998288888865437889999970888877580333778999999986 No 68 >COG0013 AlaS Alanyl-tRNA synthetase [Translation, ribosomal structure and biogenesis] Probab=29.53 E-value=34 Score=13.61 Aligned_cols=48 Identities=10% Similarity=0.198 Sum_probs=27.6 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCEEEEE-CCC-E--EEEEECCCCCCCCCEEEE Q ss_conf 9999999999999997699803358863-074-5--899984954207510023 Q gi|254781094|r 20 KAAEDLSIILIGLLKEEGVIYQNMRQYW-TPH-R--LFLYLKGLSPHSPEKIEE 69 (702) Q Consensus 20 ~~~~~l~~~~~~~l~~~~l~~~~i~~~~-TPR-R--Lav~i~~l~~~q~d~~~e 69 (702) .....+.+.|.+.|++.+ |.-+.+++ -|| | |-+...|+..-++-..-. T Consensus 6 ~t~~EiR~~FL~FF~~kg--H~~v~s~slVP~nDptLLftnAGm~~FK~~f~g~ 57 (879) T COG0013 6 LTTNEIRQKFLDFFEKKG--HTVVPSSPLVPRNDPTLLFTNAGMVQFKPYFTGG 57 (879) T ss_pred CCHHHHHHHHHHHHHHCC--CEECCCCCCCCCCCCCEEEEECCCCCCHHHHCCC T ss_conf 779999999999998779--7345788867899997689604630141132178 No 69 >pfam03971 IDH Monomeric isocitrate dehydrogenase. NADP(+)-dependent isocitrate dehydrogenase (ICD) is an important enzyme of the intermediary metabolism, as it controls the carbon flux within the citric acid cycle and supplies the cell with 2-oxoglutarate EC:1.1.1.42 and NADPH for biosynthetic purposes. Probab=29.25 E-value=35 Score=13.58 Aligned_cols=82 Identities=18% Similarity=0.114 Sum_probs=57.8 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCEEEEEECCCCCCCC--------------CEEEEECCCCC Q ss_conf 2016989999999999999999999769980335886307458999849542075--------------10023027898 Q gi|254781094|r 10 SEEIPARMQHKAAEDLSIILIGLLKEEGVIYQNMRQYWTPHRLFLYLKGLSPHSP--------------EKIEERLGPRV 75 (702) Q Consensus 10 tEElPa~~~~~~~~~l~~~~~~~l~~~~l~~~~i~~~~TPRRLav~i~~l~~~q~--------------d~~~e~kGP~~ 75 (702) |.|-|+- |.-+|.-++....+..+|..+.-..--.-|=|+.+-+.|.+.|. .....+|-|.+ T Consensus 7 tDEAPaL----AT~SlLPIv~aF~~~agi~vet~DISLAgRIlA~Fpe~L~~~Qrv~D~LaeLGeL~~~peANIIKLPNI 82 (735) T pfam03971 7 TDEAPAL----ATYSLLPIVKAFAASAGIDVETSDISLAGRILANFPDYLTEEQRVPDDLAELGELTKSPDANIIKLPNI 82 (735) T ss_pred CCCHHHH----HHHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHCCCCCCHHHCCCCHHHHHHHHHCCCCCCEEECCCC T ss_conf 7626888----777778999997313187477300459999998474438764258718999888737988777657866 Q ss_pred CCC---CHHHHHHHHHCCCCHHH Q ss_conf 589---87899899977998244 Q gi|254781094|r 76 GAG---KKAIDGFLRSTGLQTIS 95 (702) Q Consensus 76 g~p---t~A~~GF~ks~gi~~~~ 95 (702) ++- -+|+..=++++|.+..+ T Consensus 83 SASiPQL~aaI~ELQ~qGy~lPd 105 (735) T pfam03971 83 SASVPQLKAAIKELQDKGYAVPD 105 (735) T ss_pred CCCHHHHHHHHHHHHHCCCCCCC T ss_conf 56669999999999966999999 No 70 >TIGR00718 sda_alpha L-serine dehydratase, iron-sulfur-dependent, alpha subunit; InterPro: IPR004642 L-serine dehydratase converts serine into pyruvate in the gluconeogenesis pathway from serine. This model describes the alpha chain of an iron-sulphur-dependent L-serine dehydratase, found in Bacillus subtilis. A fairly deep split in a UPGMA tree separates members of this family of alpha chains from the homologous region of single chain forms such as found in Escherichia coli. This family of enzymes is not homologous to the pyridoxal phosphate-dependent threonine deaminases and eukaryotic serine deaminases. Synonym(s): Serine deaminase, L-hydroxyaminoacid dehydratase ; GO: 0003941 L-serine ammonia-lyase activity, 0006094 gluconeogenesis. Probab=29.13 E-value=14 Score=16.24 Aligned_cols=16 Identities=25% Similarity=0.412 Sum_probs=7.1 Q ss_pred HHHHHHHHHHHHCCCC Q ss_conf 7434468999837997 Q gi|254781094|r 507 RATLGIIRIILENKID 522 (702) Q Consensus 507 R~a~GiirIl~e~~l~ 522 (702) |.+--||.|.-|+++. T Consensus 3 ~~~~E~I~~~~E~~I~ 18 (301) T TIGR00718 3 RTVKELIDVTKEKQIK 18 (301) T ss_pred HHHHHHHHHHCCCCEE T ss_conf 1478899764147501 No 71 >TIGR01817 nifA Nif-specific regulatory protein; InterPro: IPR010113 This entry represents NifA, a DNA-binding regulatory protein for nitrogen fixation. Not included in this group are: the homologue in Aquifex aeolicus (which lacks nitrogenase), transcriptional activators of alternative nitrogenases (VFe or FeFe instead of MoFe), and truncated forms. In diazotrophic proteobacteria, the sigma54-dependent activator NifA activates transcription of the nif (nitrogen fixation) genes by a conserved mechanism common to members of the enhancer binding protein family. Although NifA proteins have similar domain structures, both transcriptional regulation of nifA expression and posttranslational regulation of NifA activity by oxygen and fixed nitrogen vary significantly from one organism to another. In Klebsiella pneumoniae and Azotobacter vinelandii, nifA is co-ordinately transcribed with a second gene, nifL, whose product inhibits NifA activity in response to oxygen and fixed nitrogen .; GO: 0003677 DNA binding, 0016563 transcription activator activity, 0009399 nitrogen fixation. Probab=28.66 E-value=35 Score=13.52 Aligned_cols=203 Identities=23% Similarity=0.235 Sum_probs=100.4 Q ss_pred CEEECHHHHHHHHHHHHHHHHHH-----CCCEECCC---HHHHHHHHHHCCCCEEEEEEECHHHHCCCHHHHHHHHHHHC Q ss_conf 84716999999999999998885-----39821466---67887887440898468883077771289899999998400 Q gi|254781094|r 216 MVLLDPECRRNAILNDAHRLASA-----VGLELVED---KDLLEEIIGLVEWVQVFMGSFDKKYLCLPEELIRLTIKTNQ 287 (702) Q Consensus 216 ~Vi~d~~~R~~~I~~~i~~~a~~-----~~~~~~~d---~~Ll~Ev~~LvE~P~~~~G~F~~~fL~lP~evl~~~m~~hQ 287 (702) ..|+....|-+..++++-..-.+ +|...+.| +.++-.-++..+-+..=+|+=+-+| ..|+-+|=.+|.+ + T Consensus 11 SkiL~~~~~L~~tL~~Vl~vL~~~l~m~~G~i~l~d~ege~~~~~a~~~~~e~~~~lg~~~~ry-~vgeg~ig~i~~~-G 88 (574) T TIGR01817 11 SKILSAPTRLEKTLANVLNVLSAMLQMRHGLISLSDSEGEPLLVAAIGWSEEEFAQLGSDDIRY-RVGEGAIGQIVAT-G 88 (574) T ss_pred HHHHCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEECCCCCCEEEEEEECCCCCCCCCCCCCCEE-CCCCCHHHHHHHC-C T ss_conf 9985170237899999999998852554017898678886011014303861011234688502-4734023566316-8 Q ss_pred CCEEEECC----------------CCCCCCEEEEEECCC-CCCCCHHHHHHHHHHHHH-HHHH--HHHHHHCCCCCCCCC Q ss_conf 11365536----------------886132489850577-655430245678899787-6422--599985030001111 Q gi|254781094|r 288 KCFVTRTR----------------EGVLANCFILVSNIQ-ASDGGAAIVQGNSRVVAA-RLED--ALHFWKRDQNNLPNL 347 (702) Q Consensus 288 kyf~~~~~----------------~g~l~~~Fi~v~N~~-~~~~~~~v~~GnerVl~A-RL~D--A~FF~~~D~~~~~~~ 347 (702) +-+.|.|- .|...=-||+|- +. ++ ..++ =||.+ |-.| +.+++++|. T Consensus 89 ~~lVv~~va~~~lF~d~~~~~l~~~~e~~~~FIgVP-i~~~d---~~~~----GvL~~~r~~~g~~~~~~~~~v------ 154 (574) T TIGR01817 89 NSLVVPDVAAEPLFLDRLSEVLSDPGEIPVPFIGVP-IKNAD---SETI----GVLAIDRIKEGASRERLEEEV------ 154 (574) T ss_pred CEEEECCCCCCHHHHHHHHHHHCCCCCEECCEEECC-CCCCC---CCCC----EEEEEEECCCCCCCCCCHHHH------ T ss_conf 706604667353457764554058874311134022-66777---7711----226772068888413461454------ Q ss_pred CCCCCCCCCCCCCCCCCHHHHHHHHCCCCEEHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHH-----HHHH-----HHHH Q ss_conf 11111100112201378899884210023031345513899898999999998865299988-----9999-----9997 Q gi|254781094|r 348 SSLKESALKFNLDLSKPLDQRMARLDMLDVVFHAKIGTQGERVSRIRVLGKKIAQLIDADVA-----LVDR-----AIVL 417 (702) Q Consensus 348 ~~~~~~~~~~~~dlk~~L~~~~~~L~~~~v~F~~~LGs~~dK~~Ri~~la~~la~~l~~~~~-----~~~r-----Aa~L 417 (702) .|-+. .=++--++-|+-+.....-.....+.. ...+ ...- T Consensus 155 ----------------RFL~m--------------VANL~gQtvRL~r~~~~~~~~~~~e~~~~~~~~~~~~sGa~~~~~ 204 (574) T TIGR01817 155 ----------------RFLEM--------------VANLIGQTVRLHRLVAARRERLIAEAAQLSKQLRDKKSGAEPEAA 204 (574) T ss_pred ----------------HHHHH--------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHH T ss_conf ----------------37999--------------999998788874456542011055677788752140015655412 Q ss_pred HHHH-HHHHHHHCCHHHHHHHHHHHHHHC--------CC----CHHHHHHHHHHHC-CCCCCC Q ss_conf 4236-788898403457899999998623--------69----9889999999734-477677 Q gi|254781094|r 418 SKAD-LCTEIVREFPELQGKIGKEYAVLQ--------NE----NISCCDAIEEHLK-PRGPLE 466 (702) Q Consensus 418 ~KaD-L~t~mV~EFpeLQG~mG~~YA~~~--------ge----~~~Va~aI~ehY~-P~~~~d 466 (702) ...- ..-.+||+=|.|+-+|-..-=.+. || .+-||.|| ||+ ||-..= T Consensus 205 ~~~~~~~~~i~G~Spam~~v~~~~~~vA~~nSTVLlRGESGTGKEl~A~AI--H~~SpR~~~P 265 (574) T TIGR01817 205 RRRSGKEDGIVGKSPAMRQVVDQIKVVARSNSTVLLRGESGTGKELIAKAI--HELSPRAKRP 265 (574) T ss_pred HHHCCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCHHHHHHHH--CCCCCCCCCC T ss_conf 331234474012478999999886520131766785056574433444234--0466455788 No 72 >TIGR02068 cya_phycin_syn cyanophycin synthetase; InterPro: IPR011810 Cyanophycin is an insoluble storage polymer for carbon, nitrogen, and energy, found in most Cyanobacteria. The polymer has a backbone of L-aspartic acid, with most Asp side chain carboxyl groups attached to L-arginine. The polymer is made by this enzyme, cyanophycin synthetase, and degraded by cyanophycinase. Heterologously expressed cyanophycin synthetase in Escherichia coli produces a closely related, water-soluble polymer with some Arg replaced by Lys. It is unclear whether enzymes that produce soluble cyanophycin-like polymers in vivo in non-Cyanobacterial species should be designated as cyanophycin synthetase itself or as a related enzyme. Cyanophycin synthesis is analogous to polyhydroxyalkanoic acid (PHA) biosynthesis, except that PHA polymers lack nitrogen and may be made under nitrogen-limiting conditions .; GO: 0005524 ATP binding, 0016874 ligase activity, 0009059 macromolecule biosynthetic process. Probab=28.35 E-value=18 Score=15.44 Aligned_cols=102 Identities=19% Similarity=0.226 Sum_probs=68.1 Q ss_pred HHHHHHHHHHHHH--------HHHHHHHH-CCHHHHHH---------HHHHHHHHCCCCHHHHHHHHHHHCCCCCCCCCC Q ss_conf 8899999997423--------67888984-03457899---------999998623699889999999734477677788 Q gi|254781094|r 408 VALVDRAIVLSKA--------DLCTEIVR-EFPELQGK---------IGKEYAVLQNENISCCDAIEEHLKPRGPLENVP 469 (702) Q Consensus 408 ~~~~~rAa~L~Ka--------DL~t~mV~-EFpeLQG~---------mG~~YA~~~ge~~~Va~aI~ehY~P~~~~d~lP 469 (702) .+.+.+|..-+|. |+||+-|. =..++.|. +=||.+=.+|.+..||.||-+|-.|..+...+| T Consensus 410 P~~~~~a~~aAk~iGLDiaG~D~v~~di~~PL~~~~G~iVEVNAaPGlrMH~~PS~G~pR~Va~Ai~d~LFP~~~~grIP 489 (876) T TIGR02068 410 PENIAIAVRAAKIIGLDIAGVDVVTEDISKPLRDTDGAIVEVNAAPGLRMHLAPSQGKPRNVAKAIVDMLFPEEDDGRIP 489 (876) T ss_pred HHHHHHHHHHHHEEEEEECCEEECCCCCCCCHHHCCCEEEEEECCCCHHHCCCCCCCCCCCCCHHHHHHCCCCCCCCCEE T ss_conf 45899998752000001212220031000374545972999856766344467777469871026875228887898344 Q ss_pred ---------CCHHHHHHHHH-HHHHHHHHHHH-----C--CCCCCC-CCCHHHHHHHH Q ss_conf ---------63588999999-74998987651-----0--567687-76968888743 Q gi|254781094|r 470 ---------TNKISITVSLA-DKLDTLINFWA-----I--NEKPSG-SKDPYALRRAT 509 (702) Q Consensus 470 ---------~~~~g~ilsia-DKlDtl~g~f~-----i--g~~PTG-S~DPfaLRR~a 509 (702) +|.+..++|=- +--=.-||+-. | -++-+| ..-|.+=||-- T Consensus 490 iV~vTGTNGKTt~~RL~Ahil~~~G~~vG~T~tDG~Yi~~~~v~~GDntGP~SAr~~L 547 (876) T TIGR02068 490 IVAVTGTNGKTTTTRLVAHILKQTGKVVGMTTTDGVYIGKKLVEKGDNTGPKSARRIL 547 (876) T ss_pred EEEEECCCCCHHHHHHHHHHHHHCCCEEEEEECCCEEECCEEEECCCCCCHHHHHHCC T ss_conf 8887268983557889999998569827642037677557666247898715730122 No 73 >pfam04842 DUF639 Plant protein of unknown function (DUF639). Plant protein of unknown function. Probab=27.95 E-value=36 Score=13.43 Aligned_cols=34 Identities=32% Similarity=0.533 Sum_probs=16.7 Q ss_pred HHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHH Q ss_conf 9999997423678889840345789999999862 Q gi|254781094|r 411 VDRAIVLSKADLCTEIVREFPELQGKIGKEYAVL 444 (702) Q Consensus 411 ~~rAa~L~KaDL~t~mV~EFpeLQG~mG~~YA~~ 444 (702) .+.|....-..+.-..|.||||+-|-+-|-|.+. T Consensus 280 FDkAV~ykS~~~sEpvvlEFpe~~G~tRRDyWLA 313 (682) T pfam04842 280 FDKAVSYSSIPGLEPVVLEFPELKGETRRDYWLA 313 (682) T ss_pred CCEEEEEECCCCCCCEEEECCCCCCCCHHHHHHH T ss_conf 2201476358878755998136788625899999 No 74 >COG5200 LUC7 U1 snRNP component, mediates U1 snRNP association with cap-binding complex [RNA processing and modification] Probab=26.82 E-value=38 Score=13.30 Aligned_cols=71 Identities=17% Similarity=0.147 Sum_probs=44.6 Q ss_pred HHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCC--CCC-CCHHHHHHHHHHHHHHHHHH Q ss_conf 999974236788898403457899999998623699889999999734477677--788-63588999999749989876 Q gi|254781094|r 413 RAIVLSKADLCTEIVREFPELQGKIGKEYAVLQNENISCCDAIEEHLKPRGPLE--NVP-TNKISITVSLADKLDTLINF 489 (702) Q Consensus 413 rAa~L~KaDL~t~mV~EFpeLQG~mG~~YA~~~ge~~~Va~aI~ehY~P~~~~d--~lP-~~~~g~ilsiaDKlDtl~g~ 489 (702) .--.|+|+|||+.-+.-|-|||-+.- +-++|+++++..-.-.+++- .+. -...|+-||..|-=--|.-. T Consensus 136 e~dalakrk~V~~a~~~f~el~rl~~--------~rkei~~~v~sm~en~gq~thqklqvC~iCgayLsrlDtdrrladH 207 (258) T COG5200 136 EGDALAKRKLVERACSAFNELERLRE--------ERKEIKEAVYSMVENNGQGTHQKLQVCGICGAYLSRLDTDRRLADH 207 (258) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCHHHHHHHH T ss_conf 48988878899999999999999999--------9999999999886076545465654530436678750206689987 Q ss_pred HH Q ss_conf 51 Q gi|254781094|r 490 WA 491 (702) Q Consensus 490 f~ 491 (702) |. T Consensus 208 f~ 209 (258) T COG5200 208 FN 209 (258) T ss_pred HC T ss_conf 33 No 75 >PRK10880 adenine DNA glycosylase; Provisional Probab=26.68 E-value=38 Score=13.28 Aligned_cols=90 Identities=13% Similarity=0.145 Sum_probs=46.7 Q ss_pred HHHHHHHHCC-CEEEECCCCCCCCEEEEEECCCCCCC-CHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCC Q ss_conf 9999984001-13655368861324898505776554-302456788997876422599985030001111111111001 Q gi|254781094|r 279 IRLTIKTNQK-CFVTRTREGVLANCFILVSNIQASDG-GAAIVQGNSRVVAARLEDALHFWKRDQNNLPNLSSLKESALK 356 (702) Q Consensus 279 l~~~m~~hQk-yf~~~~~~g~l~~~Fi~v~N~~~~~~-~~~v~~GnerVl~ARL~DA~FF~~~D~~~~~~~~~~~~~~~~ 356 (702) |+.=-..|.| -+|=+. .-.||-|-||-+---.. .++|+.=++|-+. |+-|-.=.=+. T Consensus 10 ll~Wy~~~~r~~lPWr~---~~~PY~vwvSEiMLQQTqv~tV~~yy~rf~~-~fP~~~~LA~A----------------- 68 (350) T PRK10880 10 VLDWYDKYGRKTLPWQI---DKTPYKVWLSEVMLQQTQVATVIPYFERFMA-RFPTVTDLANA----------------- 68 (350) T ss_pred HHHHHHHCCCCCCCCCC---CCCCHHHHHHHHHHCCCCHHHHHHHHHHHHH-HCCCHHHHHCC----------------- T ss_conf 99999873998899899---9982269999999834877899999999999-88399999779----------------- Q ss_pred CCCCCCCCHHHHHHHHCCCCEEHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 1220137889988421002303134551389989899999999886529 Q gi|254781094|r 357 FNLDLSKPLDQRMARLDMLDVVFHAKIGTQGERVSRIRVLGKKIAQLID 405 (702) Q Consensus 357 ~~~dlk~~L~~~~~~L~~~~v~F~~~LGs~~dK~~Ri~~la~~la~~l~ 405 (702) +.++-+ -+-++||= |-++.++.+.|..+....+ T Consensus 69 -------~~~~vl--------~~W~GLGY-Y~RArnLh~aA~~i~~~~~ 101 (350) T PRK10880 69 -------PLDEVL--------HLWTGLGY-YARARNLHKAAQQVATLHG 101 (350) T ss_pred -------CHHHHH--------HHHHHCCC-HHHHHHHHHHHHHHHHHCC T ss_conf -------999999--------98640693-8999999999999999758 No 76 >pfam10508 Proteasom_PSMB Proteasome non-ATPase 26S subunit. The 26S proteasome, a eukaryotic ATP-dependent, dumb-bell shaped, protease complex with a molecular mass of approx 20kDa consists of a central 20S proteasome,functioning as a catalytic machine, and two large V-shaped terminal modules, having possible regulatory roles,composed of multiple subunits of 25- 110 kDa attached to the central portion in opposite orientations. It is responsible for degradation of abnormal intracellular proteins, including oxidatively damaged proteins, and may play a role as a component of a cellular anti-oxidative system. Expression of catalytic core subunits including PSMB5 and peptidase activities of the proteasome were elevated following incubation with 3-methylcholanthrene. The 20S proteasome comprises a cylindrical stack of four rings, two outer rings formed by seven alpha-subunits (alpha1-alpha7) and two inner rings of seven beta-subunits (beta1-beta7). Two outer rings of alpha subunits maintain Probab=25.99 E-value=35 Score=13.57 Aligned_cols=19 Identities=26% Similarity=0.378 Sum_probs=6.7 Q ss_pred HHHHHHHCCCCEEECHHHH Q ss_conf 9999862027847169999 Q gi|254781094|r 206 DHYIRDLEEAMVLLDPECR 224 (702) Q Consensus 206 ~~Y~~~L~~~~Vi~d~~~R 224 (702) .+|...|..+--=.++.-| T Consensus 77 ~~y~~~L~~gL~h~~~~Vr 95 (503) T pfam10508 77 PQYQPVLQEGLPHPTPAVR 95 (503) T ss_pred HHHHHHHHHHCCCCCHHHH T ss_conf 9869999966689978999 No 77 >KOG0995 consensus Probab=25.83 E-value=26 Score=14.40 Aligned_cols=60 Identities=15% Similarity=0.055 Sum_probs=26.5 Q ss_pred HHHHHCCHHHHHHHHHHHHHHCCCCHHHH------HHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 88984034578999999986236998899------99999734477677788635889999997499898765105 Q gi|254781094|r 424 TEIVREFPELQGKIGKEYAVLQNENISCC------DAIEEHLKPRGPLENVPTNKISITVSLADKLDTLINFWAIN 493 (702) Q Consensus 424 t~mV~EFpeLQG~mG~~YA~~~ge~~~Va------~aI~ehY~P~~~~d~lP~~~~g~ilsiaDKlDtl~g~f~ig 493 (702) -.|+.|=-+|+-..-+.=-..++...+|- ++.++. + .+..--.-++++||-....-++.| T Consensus 335 e~mn~Er~~l~r~l~~i~~~~d~l~k~vw~~~l~~~~~f~~---------l-e~~~~~~~~l~~~i~l~~~~~~~n 400 (581) T KOG0995 335 ERMNLERNKLKRELNKIQSELDRLSKEVWELKLEIEDFFKE---------L-EKKFIDLNSLIRRIKLGIAENSKN 400 (581) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------H-HHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 99999999999999998877899999998677899889999---------9-999999999999998999987126 No 78 >TIGR00169 leuB 3-isopropylmalate dehydrogenase; InterPro: IPR004429 Several NAD- or NADP-dependent dehydrogenases, including 3-isopropylmalate dehydrogenase, tartrate dehydrogenase, and the dimeric forms of isocitrate dehydrogenase, share a nucleotide binding domain unrelated to that of lactate dehydrogenase and its homologs. These enzymes dehydrogenate their substates at a H-C-OH site adjacent to a H-C-COOH site; the latter carbon, now adjacent to a carbonyl group, readily decarboxylates. Among these decarboxylating dehydrogenases of hydroxyacids, overall sequence homology indicates evolutionary history rather than actual substrate or cofactor specifity, which may be toggled experimentally by replacement of just a few amino acids. 3-isopropylmalate dehydrogenase is an NAD-dependent enzyme and should have a sequence resembling HGSAPDI around residue 340. The subtrate binding loop should include a sequence resembling E[KQR]X(0,1)LLXXR around residue 115.; GO: 0003862 3-isopropylmalate dehydrogenase activity, 0009098 leucine biosynthetic process, 0005737 cytoplasm. Probab=25.57 E-value=40 Score=13.15 Aligned_cols=98 Identities=23% Similarity=0.281 Sum_probs=65.4 Q ss_pred HHHHHHHHHHHHHHHHCCC--CHHHHHHHHHHHHH----HHHHHHHH-CCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHC Q ss_conf 9898999999998865299--98899999997423----67888984-03457899999998623699889999999734 Q gi|254781094|r 388 ERVSRIRVLGKKIAQLIDA--DVALVDRAIVLSKA----DLCTEIVR-EFPELQGKIGKEYAVLQNENISCCDAIEEHLK 460 (702) Q Consensus 388 dK~~Ri~~la~~la~~l~~--~~~~~~rAa~L~Ka----DL~t~mV~-EFpeLQG~mG~~YA~~~ge~~~Va~aI~ehY~ 460 (702) +-++||.+.|-++|.+=.. ..-.++.|=-|.=. +-|.+|.. |||++. +..-.=+.- |--==|. T Consensus 168 ~Ei~RIaR~AFe~A~kr~~P~~vTSvDKANVL~sS~LWR~~V~e~~~~eYPdv~--------L~H~yiDnA--AM~Lvk~ 237 (370) T TIGR00169 168 PEIERIARVAFEMARKRRKPLKVTSVDKANVLESSRLWRKTVEEIAKEEYPDVE--------LEHQYIDNA--AMQLVKS 237 (370) T ss_pred HHHHHHHHHHHHHHHHCCCCCCEEEECHHHHHHHHHHHHHHHHHHHHCCCCCEE--------ECCCHHHHH--HHHHHCC T ss_conf 356789999999998548898532100120455437899999999846888057--------302158788--7675328 Q ss_pred CCCCC-CCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 47767-7788635889999997499898765105676877 Q gi|254781094|r 461 PRGPL-ENVPTNKISITVSLADKLDTLINFWAINEKPSGS 499 (702) Q Consensus 461 P~~~~-d~lP~~~~g~ilsiaDKlDtl~g~f~ig~~PTGS 499 (702) |+.-+ =-+-++..|-||| |=-=-|+| |+|+-||-| T Consensus 238 P~~fDGV~vT~N~FGDIlS--DeAsvi~G--SLGmLPSAS 273 (370) T TIGR00169 238 PRQFDGVVVTGNLFGDILS--DEASVITG--SLGMLPSAS 273 (370) T ss_pred CCCCCCEEEECCCCCHHHH--HHHHHHCC--HHHHHHHHH T ss_conf 6504866772653300688--88876301--223234554 No 79 >PRK09692 integrase; Provisional Probab=25.11 E-value=41 Score=13.09 Aligned_cols=61 Identities=18% Similarity=0.118 Sum_probs=43.7 Q ss_pred HHHHHHHHHCCCCHHHHHHHHHCC-CCCHHHHHHHHHHHHHHHC-CHHHHHHHHHHHHHHHHH Q ss_conf 999998763388878998854168-9998999999999999871-951388999999999985 Q gi|254781094|r 540 DRLKLYLHDRDIRHDLIEAILRPE-NDNLLTIVDLIKHLNEFFS-SAKGEKFLLSAKRIFQIL 600 (702) Q Consensus 540 ~Rl~~~L~e~G~~~DiI~AVl~~~-~~~~~~i~~r~~aL~~f~~-~~~~~~l~~a~KRv~NIL 600 (702) -=...++.+.|++.++|+.+|+-. .+.+..+|.|...|.+=+. -+.-.+.+.+..+-++|. T Consensus 341 rT~~T~l~e~g~~~~viE~~L~H~~~~~v~~~Ynr~~yl~eRr~~mq~Wadyl~~~~~g~~~~ 403 (413) T PRK09692 341 SIASTALNEQGFPPDVIEAALAHVDKNEVRRAYNRSDYLEQRRPMMQWWADFVMAADRGSMIE 403 (413) T ss_pred HHHHHHHHHCCCCHHHHHHHHCCCCCCHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHCCCEEE T ss_conf 779998997699999999986899998289995677599999999999999999996789441 No 80 >pfam07075 DUF1343 Protein of unknown function (DUF1343). This family consists of several hypothetical bacterial proteins of around 400 residues in length. The function of this family is unknown. Probab=24.62 E-value=23 Score=14.84 Aligned_cols=26 Identities=27% Similarity=0.681 Sum_probs=18.6 Q ss_pred HHCCCCCEEECCCCCCCCCCCEEEEEECCEE Q ss_conf 8628985131037765676741665303205 Q gi|254781094|r 132 IQKVPWPKSMRWSTTHSPISAFSWIRPLKSI 162 (702) Q Consensus 132 i~~l~~pKsMrWg~~~~~~~~~~FvRPirwI 162 (702) |.=-.|.++|.|.+. ++.|+.|=-++ T Consensus 174 Ipm~gw~R~m~~~~t-----gl~Wv~PSPNl 199 (359) T pfam07075 174 IPMEGWKRSMFYDET-----GLPWVPPSPNM 199 (359) T ss_pred EECCCCCCCCCCCCC-----CCCCCCCCCCC T ss_conf 967886767865746-----89988999998 No 81 >KOG2023 consensus Probab=24.27 E-value=42 Score=12.99 Aligned_cols=66 Identities=20% Similarity=0.176 Sum_probs=35.0 Q ss_pred EEECCEEEEEECCCCCCCCEEEEEECCCCCCCCCCCCCCCC-CCCEE------EECHHHHHHHHCCCCEEECHHHHHHHH Q ss_conf 53032056786367778723102445822111102234457-78566------316799998620278471699999999 Q gi|254781094|r 156 IRPLKSILCILVAEDAKEKIIDLDLKEIPCGNITYGHRFHA-PHPIK------VQSLDHYIRDLEEAMVLLDPECRRNAI 228 (702) Q Consensus 156 vRPirwIvaLl~~~~~~~~vi~~~~~gi~s~n~T~GHRf~~-~~~i~------I~~a~~Y~~~L~~~~Vi~d~~~R~~~I 228 (702) -||++-++--| +.|+. ..+-..|.|-... +..|. ..+.|.|.+.|-.-..=.|++.||+.- T Consensus 169 ~rpl~~mipkf-----------l~f~~-h~spkiRs~A~~cvNq~i~~~~qal~~~iD~Fle~lFalanD~~~eVRk~vC 236 (885) T KOG2023 169 TRPLNIMIPKF-----------LQFFK-HPSPKIRSHAVGCVNQFIIIQTQALYVHIDKFLEILFALANDEDPEVRKNVC 236 (885) T ss_pred CCCHHHHHHHH-----------HHHHH-CCCHHHHHHHHHHHHHEEECCCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHH T ss_conf 47667767999-----------99872-8983678998720222023372889999999999999980699888999999 Q ss_pred HHHHH Q ss_conf 99999 Q gi|254781094|r 229 LNDAH 233 (702) Q Consensus 229 ~~~i~ 233 (702) +.-.- T Consensus 237 ~alv~ 241 (885) T KOG2023 237 RALVF 241 (885) T ss_pred HHHHH T ss_conf 99999 No 82 >COG1984 DUR1 Allophanate hydrolase subunit 2 [Amino acid transport and metabolism] Probab=24.24 E-value=42 Score=12.98 Aligned_cols=22 Identities=5% Similarity=-0.130 Sum_probs=8.7 Q ss_pred EECCCEEEEEECCCCCCCCCEE Q ss_conf 6307458999849542075100 Q gi|254781094|r 46 YWTPHRLFLYLKGLSPHSPEKI 67 (702) Q Consensus 46 ~~TPRRLav~i~~l~~~q~d~~ 67 (702) |.++--+|+-=-++....++.+ T Consensus 63 f~~~~~ialTGad~~a~ld~~~ 84 (314) T COG1984 63 FTSDALIALTGADCEATLDGQE 84 (314) T ss_pred EECCCEEEEECCCCCCEECCEE T ss_conf 8258579995886510689988 No 83 >pfam00310 GATase_2 Glutamine amidotransferases class-II. Probab=23.90 E-value=23 Score=14.78 Aligned_cols=19 Identities=16% Similarity=0.044 Sum_probs=11.1 Q ss_pred CCCEEEECCCCCCCCEEEEE Q ss_conf 01136553688613248985 Q gi|254781094|r 287 QKCFVTRTREGVLANCFILV 306 (702) Q Consensus 287 Qkyf~~~~~~g~l~~~Fi~v 306 (702) .+.+.++|..|- -|-++.. T Consensus 193 ~~l~a~RD~~Gi-RPL~~G~ 211 (223) T pfam00310 193 RYVGATLDRNGL-RPARYGI 211 (223) T ss_pred CEEEEEECCCCC-CCEEEEE T ss_conf 999999857999-7639999 No 84 >pfam06425 consensus Probab=23.77 E-value=43 Score=12.93 Aligned_cols=97 Identities=12% Similarity=0.093 Sum_probs=44.0 Q ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHH-HHHHHHHHHHHHHCCCCCCCCC Q ss_conf 999999999876338887899885416899989999999999998719513889999-9999998556531246777773 Q gi|254781094|r 536 LFFHDRLKLYLHDRDIRHDLIEAILRPENDNLLTIVDLIKHLNEFFSSAKGEKFLLS-AKRIFQILAIEEKKNREISLEI 614 (702) Q Consensus 536 ~Fi~~Rl~~~L~e~G~~~DiI~AVl~~~~~~~~~i~~r~~aL~~f~~~~~~~~l~~a-~KRv~NIL~~~~k~~~~~~~~v 614 (702) .++.++.++-++-. +..|+ |..-....+.+-.++-.+-.-.+.+-+..+..+ .+|...|+............++ T Consensus 70 ~~f~~~v~~~lkAd---p~~Vd--L~~~~p~fY~lG~~l~~l~~de~~~l~~~l~~~f~~R~~~i~d~a~n~~~~~~s~f 144 (170) T pfam06425 70 KIYDEVIREVLKAD---PNSVD--LRSLCPHFYEFGYQLCHLLSGENADISQSLLETLRSRVKTRLDFSLNTATEDNSKF 144 (170) T ss_pred HHHHHHHHHHHHCC---CCEEE--HHHCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHH T ss_conf 65509999998169---87641--66427379999999998617300799999999999999999998823554532788 Q ss_pred CHHHHCCHHHHHHHHHHHHHHHHHH Q ss_conf 8568359899999999999999999 Q gi|254781094|r 615 SPQYLLLEAEKRLYAVISDFGTHIQ 639 (702) Q Consensus 615 d~~Lf~~~~Ek~L~~~~~~~~~~i~ 639 (702) -. -.++.|+.||.+-....+.+. T Consensus 145 ~~--~Lde~Er~Lf~~g~~s~k~~~ 167 (170) T pfam06425 145 LE--RLDNKERKLFLEGHRKERDFL 167 (170) T ss_pred HH--HCCHHHHHHHHHHHHHHHHHH T ss_conf 87--268999999999999999998 No 85 >COG4245 TerY Uncharacterized protein encoded in toxicity protection region of plasmid R478, contains von Willebrand factor (vWF) domain [General function prediction only] Probab=23.59 E-value=15 Score=16.08 Aligned_cols=82 Identities=12% Similarity=0.064 Sum_probs=39.8 Q ss_pred CCCCCEEEEECCCEEEEEECCCCCCCCCEEEEECCCCCCCCCHHHHHHHHHCCCCHHHHHHEEEECCCEEEEEE-EEECC Q ss_conf 98033588630745899984954207510023027898589878998999779982443320340698289999-99158 Q gi|254781094|r 38 VIYQNMRQYWTPHRLFLYLKGLSPHSPEKIEERLGPRVGAGKKAIDGFLRSTGLQTISDCQIKKDPKKGDVYLA-VLRKP 116 (702) Q Consensus 38 l~~~~i~~~~TPRRLav~i~~l~~~q~d~~~e~kGP~~g~pt~A~~GF~ks~gi~~~~~~~i~~~~~kg~~~~~-~~~~~ 116 (702) -.+=+|-+|+.|-|..+-.-++..-+...-.. +=|.|.-|+.-.+..+ |+ +...--....||+|==+ -.-+- T Consensus 44 ~v~lsIVTF~~~a~~~~pf~~~~nF~~p~L~a----~GgT~lGaAl~~a~d~-Ie--~~~~~~~a~~kgdyrP~vfLiTD 116 (207) T COG4245 44 RVELSIVTFGGPARVIQPFTDAANFNPPILTA----QGGTPLGAALTLALDM-IE--ERKRKYDANGKGDYRPWVFLITD 116 (207) T ss_pred EEEEEEEEECCCCEEEECHHHHHHCCCCCEEC----CCCCCHHHHHHHHHHH-HH--HHHHHCCCCCCCCCCEEEEEECC T ss_conf 05789998268506873315575448870136----9998067999999999-98--77765056775554417999538 Q ss_pred CCCHHHHHHH Q ss_conf 8509999999 Q gi|254781094|r 117 KRLIEDVLKT 126 (702) Q Consensus 117 g~~~~eiL~~ 126 (702) |.++-+.-.. T Consensus 117 G~PtD~w~~~ 126 (207) T COG4245 117 GEPTDDWQAG 126 (207) T ss_pred CCCCHHHHHH T ss_conf 9966577767 No 86 >COG4087 Soluble P-type ATPase [General function prediction only] Probab=23.48 E-value=27 Score=14.29 Aligned_cols=21 Identities=48% Similarity=0.623 Sum_probs=11.4 Q ss_pred CCCCCCHHHHHHHHHHHHHHH Q ss_conf 687769688887434468999 Q gi|254781094|r 496 PSGSKDPYALRRATLGIIRII 516 (702) Q Consensus 496 PTGS~DPfaLRR~a~GiirIl 516 (702) -+|..|-.|||+|.+||..|= T Consensus 100 GnGaND~laLr~ADlGI~tiq 120 (152) T COG4087 100 GNGANDILALREADLGICTIQ 120 (152) T ss_pred CCCCCHHHHHHHCCCCEEEEC T ss_conf 477522777653123148963 No 87 >PRK12293 hisZ ATP phosphoribosyltransferase regulatory subunit; Provisional Probab=23.25 E-value=44 Score=12.86 Aligned_cols=19 Identities=26% Similarity=0.394 Sum_probs=12.1 Q ss_pred HHHHHHHHHHHHHHHHCCC Q ss_conf 9999999999999997699 Q gi|254781094|r 20 KAAEDLSIILIGLLKEEGV 38 (702) Q Consensus 20 ~~~~~l~~~~~~~l~~~~l 38 (702) .....+.+.+.+.+...+. T Consensus 20 ~~~~~Ie~~~~~~f~~~Gy 38 (281) T PRK12293 20 KLKRDIENVASEILYKEGF 38 (281) T ss_pred HHHHHHHHHHHHHHHHCCC T ss_conf 9999999999999998599 No 88 >pfam08218 Citrate_ly_lig Citrate lyase ligase C-terminal domain. This family is composed of the C-terminal domain of citrate lyase ligase EC:6.2.1.22. Probab=23.20 E-value=17 Score=15.70 Aligned_cols=127 Identities=10% Similarity=0.112 Sum_probs=73.3 Q ss_pred CCCCCCCCCCCCCCCCCEEEECHHH--HHHHHCCCCEEECHHHHHHHHHHHHHHHHHHCCCEECCCHHHHHHHHHHCCCC Q ss_conf 2211110223445778566316799--99862027847169999999999999988853982146667887887440898 Q gi|254781094|r 183 IPCGNITYGHRFHAPHPIKVQSLDH--YIRDLEEAMVLLDPECRRNAILNDAHRLASAVGLELVEDKDLLEEIIGLVEWV 260 (702) Q Consensus 183 i~s~n~T~GHRf~~~~~i~I~~a~~--Y~~~L~~~~Vi~d~~~R~~~I~~~i~~~a~~~~~~~~~d~~Ll~Ev~~LvE~P 260 (702) .+++-.|.|||++-.+.-. ..| |.=.+.+..=...+++|.+++.++.+.+. |.++..-. .- T Consensus 6 MNaNPFT~GH~yLvE~Aa~---~~d~l~vFVV~eD~S~F~~~~R~~LV~~G~~~l~---NV~v~~g~-----------~Y 68 (182) T pfam08218 6 MNANPFTLGHRYLVEQAAA---ENDWVHLFVVSEDASLFSYEERFALVKQGTKDLD---NVTVHSGG-----------DY 68 (182) T ss_pred ECCCCCCCHHHHHHHHHHH---HCCEEEEEEEECCCCCCCHHHHHHHHHHHHCCCC---CEEEECCC-----------CE T ss_conf 7079885368999999997---4998999998355673889999999999755588---86995599-----------87 Q ss_pred EEEEEEECHHHHCCCHHHHHHHHHHHCCCEEEEC-CCCCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 4688830777712898999999984001136553-68861324898505776554302456788997876422 Q gi|254781094|r 261 QVFMGSFDKKYLCLPEELIRLTIKTNQKCFVTRT-REGVLANCFILVSNIQASDGGAAIVQGNSRVVAARLED 332 (702) Q Consensus 261 ~~~~G~F~~~fL~lP~evl~~~m~~hQkyf~~~~-~~g~l~~~Fi~v~N~~~~~~~~~v~~GnerVl~ARL~D 332 (702) .+-..+|+.=||.=..+++..-+.---+-|.-+- ..=.++..||+- +.-..|..-+.+.++.-|.+ T Consensus 69 iIS~aTFP~YFlKd~~~v~~~~~~lD~~iF~~~IApaL~It~RfVG~------EP~~~vT~~YN~~m~~~Lp~ 135 (182) T pfam08218 69 IISRATFPSYFIKEQDVVIKSQTTIDLRIFREYIAPALGITHRYVGE------EPFSRVTAIYNQAMQSELPP 135 (182) T ss_pred EEEEECCCHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEECC------CCCCHHHHHHHHHHHHHCCC T ss_conf 89710471555355689999999998999999622652876334058------98987799999999986684 No 89 >cd03237 ABC_RNaseL_inhibitor_domain2 The ATPase domain 2 of RNase L inhibitor. The ABC ATPase, RNase L inhibitor (RLI), is a key enzyme in ribosomal biogenesis, formation of translation preinitiation complexes, and assembly of HIV capsids. RLI's are not transport proteins and thus cluster with a group of soluble proteins that lack the transmembrane components commonly found in other members of the family. Structurally, RLI's have an N-terminal Fe-S domain and two nucleotide-binding domains which are arranged to form two composite active sites in their interface cleft. RLI is one of the most conserved enzymes between archaea and eukaryotes with a sequence identity of more than 48%. The high degree of evolutionary conservation suggests that RLI performs a central role in archaeal and eukaryotic physiology. Probab=22.97 E-value=45 Score=12.82 Aligned_cols=66 Identities=20% Similarity=0.099 Sum_probs=44.2 Q ss_pred ECHHHHHHHHHHHHHHHHHHCCCEECCCHHHHHHHHHHCCCCEEEEEEECHHHHCCCHHHHHHHHHH Q ss_conf 1699999999999999888539821466678878874408984688830777712898999999984 Q gi|254781094|r 219 LDPECRRNAILNDAHRLASAVGLELVEDKDLLEEIIGLVEWVQVFMGSFDKKYLCLPEELIRLTIKT 285 (702) Q Consensus 219 ~d~~~R~~~I~~~i~~~a~~~~~~~~~d~~Ll~Ev~~LvE~P~~~~G~F~~~fL~lP~evl~~~m~~ 285 (702) +|.+.| ..|++.+.+++++.|.+++.=.-=++++..+++.=.++-|.=.-.=..-|+||+...|-. T Consensus 146 LD~~~~-~~i~~~i~~l~~~~~~Tvi~VtHDl~~a~~~aDrIivl~G~g~i~~~g~p~ev~~~~~~~ 211 (246) T cd03237 146 LDVEQR-LMASKVIRRFAENNEKTAFVVEHDIIMIDYLADRLIVFEGEPSVNGVANPPQSLRSGMNR 211 (246) T ss_pred CCHHHH-HHHHHHHHHHHHHCCCEEEEECCCHHHHHHHCCEEEEECCCCEEEEEECCHHHHCHHHHH T ss_conf 899999-999999999998679899998378999998699999981886499998976872628889 No 90 >PRK00767 transcriptional regulator BetI; Validated Probab=22.59 E-value=45 Score=12.77 Aligned_cols=25 Identities=24% Similarity=0.292 Sum_probs=14.3 Q ss_pred EECHHHHHHHHHHHHHHHHHHCCCE Q ss_conf 7169999999999999988853982 Q gi|254781094|r 218 LLDPECRRNAILNDAHRLASAVGLE 242 (702) Q Consensus 218 i~d~~~R~~~I~~~i~~~a~~~~~~ 242 (702) ++|+++||+.|.+..-....+.|+. T Consensus 4 ~~~~~~RR~~i~~Aa~~~i~~~G~~ 28 (197) T PRK00767 4 VGMEPIRRQQLIDATLRSIGEVGLL 28 (197) T ss_pred CCCCHHHHHHHHHHHHHHHHHHCCC T ss_conf 9870789999999999999985935 No 91 >pfam10095 DUF2333 Uncharacterized protein conserved in bacteria (DUF2333). Members of this family of hypothetical bacterial proteins have no known function. Probab=22.56 E-value=37 Score=13.38 Aligned_cols=64 Identities=9% Similarity=0.116 Sum_probs=34.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHH-HHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHH Q ss_conf 99999999999999999985025389999999-85327888751027036886899999999999 Q gi|254781094|r 623 AEKRLYAVISDFGTHIQESMDHKRYHQIGDLL-HSICEPIEIFFDQVLVNVDDREVRDNRLSLLQ 686 (702) Q Consensus 623 ~Ek~L~~~~~~~~~~i~~~l~~~dy~~~L~~L-~~L~~~Id~FFDnVmVm~eD~~IR~NRL~LL~ 686 (702) +==+|+..+..++..+...+.+++-...+..+ -.|...-.....-+.+|-..--+=.|.+..+. T Consensus 253 ~~wAl~h~L~Aie~DFa~VL~~KNa~vsl~QiireLeatq~~~wsP~IlNG~gfGl~aNHslvMa 317 (337) T pfam10095 253 AAWALLHFLKAIEVDFADVLEKKNATVSLQQIIRELEATQETVWSPMILNGSGFGLLANHSLVMA 317 (337) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCEEECCCCCCCCHHHHHHHH T ss_conf 99999999999985499999863336389999999998876642987876887666263799999 No 92 >pfam10678 DUF2492 Protein of unknown function (DUF2492). This is a bacterial family of uncharacterized proteins. Probab=22.42 E-value=42 Score=12.97 Aligned_cols=45 Identities=13% Similarity=0.232 Sum_probs=30.0 Q ss_pred HHHHHHHHCCCCCCHHHHHH---------------------HHHHHHHHHHHHHHHHHHCCCCHHH Q ss_conf 46899983799788789630---------------------3649999999999987633888789 Q gi|254781094|r 511 GIIRIILENKIDIPLSQFIE---------------------DQNLILFFHDRLKLYLHDRDIRHDL 555 (702) Q Consensus 511 GiirIl~e~~l~l~l~~li~---------------------~~~l~~Fi~~Rl~~~L~e~G~~~Di 555 (702) -|+.++++.+-++.-.+|.+ ..+|++|+.+|=|..-.+.|+..+. T Consensus 8 ~VL~mml~~~~~~t~~~L~~~i~~~FG~~arFhTCSae~mta~eLi~FL~~kgKfi~~~~g~t~~~ 73 (78) T pfam10678 8 EVLNLMLASGEPYTREELKEAIAQEFGEEARFHTCSAEGLTADELIEFLLKKGKFIESEGGFTTNA 73 (78) T ss_pred HHHHHHHHCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHCCCEEECCCCEEECH T ss_conf 999999985897599999999999858665676656145989999999997698765289778633 No 93 >cd00532 MGS-like MGS-like domain. This domain composes the whole protein of methylglyoxal synthetase, which catalyzes the enolization of dihydroxyacetone phosphate (DHAP) to produce methylglyoxal. The family also includes the C-terminal domain in carbamoyl phosphate synthetase (CPS) where it catalyzes the last phosphorylation of a coaboxyphosphate intermediate to form the product carbamoyl phosphate and may also play a regulatory role. This family also includes inosine monophosphate cyclohydrolase. The known structures in this family show a common phosphate binding site. Probab=22.40 E-value=22 Score=14.92 Aligned_cols=30 Identities=27% Similarity=0.234 Sum_probs=15.5 Q ss_pred HHHHHHHHHHCCCCCCCCCCHHHHHHHHHH Q ss_conf 499898765105676877696888874344 Q gi|254781094|r 482 KLDTLINFWAINEKPSGSKDPYALRRATLG 511 (702) Q Consensus 482 KlDtl~g~f~ig~~PTGS~DPfaLRR~a~G 511 (702) ++|-++-.-.-+..+....|-|.+||+|+- T Consensus 68 ~I~lVINt~~~~~~~~~~~Dg~~IRr~Av~ 97 (112) T cd00532 68 KFDVVINLRDPRRDRCTDEDGTALLRLARL 97 (112) T ss_pred CEEEEEECCCCCCCCCCCCCHHHHHHHHHH T ss_conf 679999888888773324548999999997 No 94 >KOG0139 consensus Probab=22.04 E-value=46 Score=12.70 Aligned_cols=37 Identities=24% Similarity=0.427 Sum_probs=23.8 Q ss_pred HHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHH Q ss_conf 999997344776777886358899999974998987651056768776968888743 Q gi|254781094|r 453 DAIEEHLKPRGPLENVPTNKISITVSLADKLDTLINFWAINEKPSGSKDPYALRRAT 509 (702) Q Consensus 453 ~aI~ehY~P~~~~d~lP~~~~g~ilsiaDKlDtl~g~f~ig~~PTGS~DPfaLRR~a 509 (702) +..++-|+|.-.+|. +|+|++-+.-.|| |-||++-.| T Consensus 136 eeqK~~~~P~~~~d~-------------------vgsfAlSEpgaGS-Da~A~~T~A 172 (398) T KOG0139 136 EEQKEKYLPKLTGDL-------------------VGSFALSEPGAGS-DAFALKTTA 172 (398) T ss_pred HHHHHHHCCHHHCCC-------------------CCEEEECCCCCCC-CHHHHHHHH T ss_conf 988745334021031-------------------1104543787876-357766467 No 95 >cd00365 HMG-CoA_reductase Hydroxymethylglutaryl-coenzyme A (HMG-CoA) reductase (HMGR). Hydroxymethylglutaryl-coenzyme A (HMG-CoA) reductase (HMGR) is a tightly regulated enzyme, which catalyzes the synthesis of coenzyme A and mevalonate in isoprenoid synthesis. In mammals, this is the rate limiting committed step in cholesterol biosynthesis. Bacteria, such as Pseudomonas mevalonii, which rely solely on mevalonate for their carbon source, catalyze the reverse reaction, using an NAD-dependent HMGR to deacetylate mevalonate into 3-hydroxy-3-methylglutaryl-CoA. There are two classes of HMGR: class I enzymes which are found predominantly in eukaryotes and contain N-terminal membrane regions and class II enzymes which are found primarily in prokaryotes and are soluble as they lack the membrane region. With the exception of Archaeoglobus fulgidus, most archeae are assigned to class I, based on sequence similarity of the active site, even though they lack membrane regions. Yeast and human HMGR Probab=21.94 E-value=47 Score=12.69 Aligned_cols=67 Identities=21% Similarity=0.270 Sum_probs=41.8 Q ss_pred HHHHHHHHHH-------HHHHHHHHCCCEECCCHHHHHHHHHHCCCCEEEEEEEC---HHHHCCC----HHHHHHHHHHH Q ss_conf 9999999999-------99998885398214666788788744089846888307---7771289----89999999840 Q gi|254781094|r 221 PECRRNAILN-------DAHRLASAVGLELVEDKDLLEEIIGLVEWVQVFMGSFD---KKYLCLP----EELIRLTIKTN 286 (702) Q Consensus 221 ~~~R~~~I~~-------~i~~~a~~~~~~~~~d~~Ll~Ev~~LvE~P~~~~G~F~---~~fL~lP----~evl~~~m~~h 286 (702) +++|+++|.+ +++.+ +..++....-+..+|-+.+-+..|..+.|.|. ++| -+| +--++.++-.- T Consensus 9 ~~~r~~~~~~~~~l~~~~~~~l-~~~~l~~~~ad~~IEN~IG~~~lPlGvAg~l~InG~~y-~VPMATeE~SvVAa~srg 86 (376) T cd00365 9 HAARLDHIGQLLGLSHDDVQLL-ANAALPMDIANGMIENVIGTFELPYAVASNFQIDGRDV-LVPLVTEEPSIVAAASYM 86 (376) T ss_pred HHHHHHHHHHHCCCCHHHHHHH-HCCCCCHHHHHCCCCCCCEEEECCEEEECCEEECCEEE-EEEEEECHHHHHHHHHHH T ss_conf 6999999998729999999998-46999978873154301115432232654779988477-874674378899999889 Q ss_pred CCC Q ss_conf 011 Q gi|254781094|r 287 QKC 289 (702) Q Consensus 287 Qky 289 (702) =|. T Consensus 87 aK~ 89 (376) T cd00365 87 AKL 89 (376) T ss_pred HHH T ss_conf 999 No 96 >COG3263 NhaP-type Na+/H+ and K+/H+ antiporters with a unique C-terminal domain [Inorganic ion transport and metabolism] Probab=21.63 E-value=34 Score=13.63 Aligned_cols=31 Identities=19% Similarity=0.269 Sum_probs=14.8 Q ss_pred CCHHHHHHHHCCCCEEHHHHHHHHHHHHHHH Q ss_conf 7889988421002303134551389989899 Q gi|254781094|r 363 KPLDQRMARLDMLDVVFHAKIGTQGERVSRI 393 (702) Q Consensus 363 ~~L~~~~~~L~~~~v~F~~~LGs~~dK~~Ri 393 (702) .|+...+..|.+.=|.|..++||-..-.+-. T Consensus 57 y~~Ay~vg~lALaiILfdgG~~T~lss~r~a 87 (574) T COG3263 57 YPFAYMVGNLALAIILFDGGFGTQLSSFRVA 87 (574) T ss_pred CHHHHHHHHHHHHHHHHCCCCCCCHHHHHHH T ss_conf 3899999888888874137667718888977 No 97 >pfam08006 DUF1700 Protein of unknown function (DUF1700). This family contains many hypothetical bacterial proteins and two putative membrane proteins. Probab=21.62 E-value=23 Score=14.77 Aligned_cols=17 Identities=0% Similarity=0.018 Sum_probs=7.0 Q ss_pred CCCHHHHHHHHHHHCCC Q ss_conf 69988999999973447 Q gi|254781094|r 446 NENISCCDAIEEHLKPR 462 (702) Q Consensus 446 ge~~~Va~aI~ehY~P~ 462 (702) |.|+++|..|...|..+ T Consensus 51 G~P~~iA~ei~a~~~i~ 67 (181) T pfam08006 51 GNPKTIAKELKADYAID 67 (181) T ss_pred CCHHHHHHHHHHHHHHH T ss_conf 99999999999874077 No 98 >TIGR00721 tfx DNA-binding protein, Tfx family; InterPro: IPR004645 This is a family of Tfx DNA-binding proteins, which is restricted to the archaea. Homology among the members is strongest in the helix-turn-helix-containing N-terminal region. Tfx from Methanobacterium thermoautotrophicum is associated with the operon for molybdenum formyl-methanofuran dehydrogenase and binds a DNA sequence near its promoter.; GO: 0003677 DNA binding. Probab=21.42 E-value=48 Score=12.62 Aligned_cols=64 Identities=23% Similarity=0.266 Sum_probs=45.6 Q ss_pred HHHHHHHHHHH--HHCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999987--6338887899885416899989999999999998719513889999999999855653 Q gi|254781094|r 536 LFFHDRLKLYL--HDRDIRHDLIEAILRPENDNLLTIVDLIKHLNEFFSSAKGEKFLLSAKRIFQILAIEE 604 (702) Q Consensus 536 ~Fi~~Rl~~~L--~e~G~~~DiI~AVl~~~~~~~~~i~~r~~aL~~f~~~~~~~~l~~a~KRv~NIL~~~~ 604 (702) .|+.+|-.-.| +++|....-|.=.|.+.-.|+..+-+|+.- +-+...|-+..|+|+..=+...- T Consensus 5 tFLTe~Q~kVL~lR~kGl~Q~eIAk~LkTtRaNvS~iEkrA~E-----nIeKa~NTl~~~~~i~spv~i~~ 70 (142) T TIGR00721 5 TFLTERQIKVLELREKGLKQKEIAKILKTTRANVSIIEKRALE-----NIEKAKNTLKLYKRIKSPVRIEV 70 (142) T ss_pred CCCHHHHHHHHHHHHCCCCHHHHHHHHCCCCHHHHHHHHHHHH-----HHHHHHHHHHHHHHHCCCCEEEE T ss_conf 4340656899998633888788987726720137898887443-----23776638999997449825886 No 99 >KOG1989 consensus Probab=21.29 E-value=34 Score=13.65 Aligned_cols=49 Identities=16% Similarity=0.051 Sum_probs=18.5 Q ss_pred ECHHHHHHHHHHHHHHHHHHCCCEE-CCCHHHHHHHHHHCCCCEEEEEEE Q ss_conf 1699999999999999888539821-466678878874408984688830 Q gi|254781094|r 219 LDPECRRNAILNDAHRLASAVGLEL-VEDKDLLEEIIGLVEWVQVFMGSF 267 (702) Q Consensus 219 ~d~~~R~~~I~~~i~~~a~~~~~~~-~~d~~Ll~Ev~~LvE~P~~~~G~F 267 (702) +...|=-.++-+-..+++.=|.+.+ ++..||=-|.+=|.--=+-.+|.| T Consensus 141 lte~eVLkIf~dv~~AVa~mH~~~pPiIHRDLKiENvLls~~g~~KLCDF 190 (738) T KOG1989 141 LTEDEVLKIFYDVCEAVAAMHYLKPPIIHRDLKIENVLLSADGNYKLCDF 190 (738) T ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHEEECCCCCEEECCC T ss_conf 77378999999999999998637986200000012047768997885756 No 100 >pfam02572 CobA_CobO_BtuR ATP:corrinoid adenosyltransferase BtuR/CobO/CobP. This family consists of the BtuR, CobO, CobP proteins all of which are Cob(I)alamin adenosyltransferase, EC:2.5.1.17, involved in cobalamin (vitamin B12) biosynthesis. These enzymes catalyse the adenosylation reaction: ATP + cob(I)alamin + H2O <= phosphate + diphosphate + adenosylcobalamin. Probab=21.24 E-value=26 Score=14.42 Aligned_cols=29 Identities=14% Similarity=0.095 Sum_probs=16.1 Q ss_pred EEEEEEECHHHHCCCHHHHHHHHHHHCCCE Q ss_conf 468883077771289899999998400113 Q gi|254781094|r 261 QVFMGSFDKKYLCLPEELIRLTIKTNQKCF 290 (702) Q Consensus 261 ~~~~G~F~~~fL~lP~evl~~~m~~hQkyf 290 (702) .++.|.-.++-| ++.-=++|.|+.+--|| T Consensus 132 vVlTGr~~p~~L-~e~AD~VTEm~~vKHp~ 160 (172) T pfam02572 132 VVLTGRGAPPEL-IELADLVTEMREVKHPF 160 (172) T ss_pred EEEECCCCCHHH-HHHHHHHHHHHHCCCHH T ss_conf 999899999999-99704776321304802 No 101 >pfam10660 MitoNEET_N Iron-containing outer mitochondrial membrane protein N-terminus. MitoNEET_N is the N-terminal region of the MitoNEET and Miner-type proteins that carry a zf-CDGSH, pfam09360, redox-active 2Fe-2S cluster. The whole protein regulates oxidative capacity. The domain is an anchor sequence that tethers the protein to the outer membrane. Probab=21.10 E-value=49 Score=12.57 Aligned_cols=24 Identities=25% Similarity=0.542 Sum_probs=20.2 Q ss_pred CHHHHHHHHHHHHHHCCCCCEEEC Q ss_conf 099999999999986289851310 Q gi|254781094|r 119 LIEDVLKTIVPVAIQKVPWPKSMR 142 (702) Q Consensus 119 ~~~eiL~~ii~~~i~~l~~pKsMr 142 (702) .++.+....+|+-++++|.|+|-. T Consensus 3 ~ls~~vk~~lP~YL~~LPiPdS~g 26 (64) T pfam10660 3 SLSRIVKSSLPNYLKSLPIPKSFG 26 (64) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCC T ss_conf 789999975579983699973312 No 102 >cd07129 ALDH_KGSADH Alpha-Ketoglutaric Semialdehyde Dehydrogenase. Alpha-Ketoglutaric Semialdehyde (KGSA) Dehydrogenase (KGSADH, EC 1.2.1.26) catalyzes the NAD(P)+-dependent conversion of KGSA to alpha-ketoglutarate. This CD contains such sequences as those seen in Azospirillum brasilense, KGSADH-II (D-glucarate/D-galactarate-inducible) and KGSADH-III (hydroxy-L-proline-inducible). Both show similar high substrate specificity for KGSA and different coenzyme specificity; KGSADH-II is NAD+-dependent and KGSADH-III is NADP+-dependent. Also included in this CD is the NADP(+)-dependent aldehyde dehydrogenase from Vibrio harveyi which catalyzes the oxidation of long-chain aliphatic aldehydes to acids. Probab=20.93 E-value=49 Score=12.55 Aligned_cols=28 Identities=18% Similarity=0.284 Sum_probs=25.0 Q ss_pred CHHHHHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 3588999999749989876510567687 Q gi|254781094|r 471 NKISITVSLADKLDTLINFWAINEKPSG 498 (702) Q Consensus 471 ~~~g~ilsiaDKlDtl~g~f~ig~~PTG 498 (702) ...+....++++|++-+|.+.+|-.||| T Consensus 387 ~D~~~a~~~~~~l~~~aG~v~vN~~~tg 414 (454) T cd07129 387 DDLALARELLPVLERKAGRLLFNGWPTG 414 (454) T ss_pred CCHHHHHHHHHHHHHHCCEEEECCCCCC T ss_conf 4299999999999971898999998875 No 103 >TIGR01283 nifE nitrogenase MoFe cofactor biosynthesis protein NifE; InterPro: IPR005973 The enzyme responsible for nitrogen fixation, the nitrogenase, shows a high degree of conservation of structure, function, and amino acid sequence across wide phylogenetic ranges. All known Mo-nitrogenases consist of two components, component I (also called dinitrogenase, or Fe-Mo protein), an alpha2beta2 tetramer encoded by the nifD and nifK genes, and component II (dinitrogenase reductase, or Fe protein) a homodimer encoded by the nifH gene. Two operons, nifDK and nifEN, encode a tetrameric (alpha2beta2 and N2E2) enzymatic complex. Nitrogenase contains two unusual rare metal clusters; one of them is the iron molybdenum cofactor (FeMo-co), which is considered to be the site of dinitrogen reduction and whose biosynthesis requires the products of nifNE and of some other nif genes. It has been proposed that NifNE might serve as a scaffold upon which FeMo-co is built and then inserted into component I. ; GO: 0005515 protein binding, 0006461 protein complex assembly, 0009399 nitrogen fixation. Probab=20.38 E-value=48 Score=12.60 Aligned_cols=25 Identities=16% Similarity=0.199 Sum_probs=15.8 Q ss_pred CCCCEEEEEECCCCCCCCCCCCCCC Q ss_conf 7872310244582211110223445 Q gi|254781094|r 171 AKEKIIDLDLKEIPCGNITYGHRFH 195 (702) Q Consensus 171 ~~~~vi~~~~~gi~s~n~T~GHRf~ 195 (702) .|-.|||++=.|..++|++-|||-- T Consensus 156 ~G~~ViPV~s~GF~G~NK~lGnk~A 180 (470) T TIGR01283 156 YGIPVIPVDSEGFYGSNKNLGNKLA 180 (470) T ss_pred CCCCEEEECCCCCCCCCCCCCHHHH T ss_conf 5860675157887777755002789 No 104 >PRK09440 avtA valine--pyruvate transaminase; Provisional Probab=20.31 E-value=50 Score=12.46 Aligned_cols=39 Identities=18% Similarity=0.123 Sum_probs=28.0 Q ss_pred CCEEEECHHHHHHHHCCCCEEECHHHHHHHHHHHHHHHHHHCCCEECCCH Q ss_conf 85663167999986202784716999999999999998885398214666 Q gi|254781094|r 198 HPIKVQSLDHYIRDLEEAMVLLDPECRRNAILNDAHRLASAVGLELVEDK 247 (702) Q Consensus 198 ~~i~I~~a~~Y~~~L~~~~Vi~d~~~R~~~I~~~i~~~a~~~~~~~~~d~ 247 (702) +.+.+++|. ---+++.+.++|++. .++|+++++-++.|+ T Consensus 181 ~~ii~~~P~------NPTG~v~s~e~~~~l-----~~la~~~~i~iI~De 219 (416) T PRK09440 181 GAICVSRPT------NPTGNVITDEELAKL-----DALARQHNIPLIIDN 219 (416) T ss_pred EEEEECCCC------CCCCCCCCHHHHHHH-----HHHHHHCCEEEEEEC T ss_conf 399977898------998877889999999-----999998795899978 No 105 >TIGR00004 TIGR00004 endoribonuclease L-PSP, putative; InterPro: IPR006056 This family includes proteins encoded by the yabJ gene from Bacillus subtilis that is required for adenine-mediated repression of purine biosynthetic genes in vivo and codes for an acid-soluble, 14-kDa protein. The structure of this protein has been solved and the protein is found to form a homotrimer. The trimer has a deep cleft that is a probably ligand binding site and possibly forms an enzymatic active site. However the ligand and reaction are unknown.. Probab=20.20 E-value=51 Score=12.45 Aligned_cols=44 Identities=18% Similarity=0.268 Sum_probs=20.8 Q ss_pred CCCCCHH-HHHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCEEEEEECCC Q ss_conf 2016989-9999999999999999997699803358863074589998495 Q gi|254781094|r 10 SEEIPAR-MQHKAAEDLSIILIGLLKEEGVIYQNMRQYWTPHRLFLYLKGL 59 (702) Q Consensus 10 tEElPa~-~~~~~~~~l~~~~~~~l~~~~l~~~~i~~~~TPRRLav~i~~l 59 (702) |+|++.. -+..=.+|.-+++.+-|++++++++++- +.+|++.|| T Consensus 38 Tge~v~~g~~~eQa~Qvl~NlkAiL~aAG~~~~~vV------K~TvfL~D~ 82 (129) T TIGR00004 38 TGELVGGGDIAEQAEQVLENLKAILEAAGLSLDDVV------KTTVFLTDL 82 (129) T ss_pred CCEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCCEE------EEEEEECCH T ss_conf 875727987888898999989999998279846668------886630265 No 106 >pfam06090 Ins_P5_2-kin Inositol-pentakisphosphate 2-kinase. This is a family of inositol-pentakisphosphate 2-kinases (EC 2.7.1.158) (also known as inositol 1,3,4,5,6-pentakisphosphate 2-kinase, Ins(1,3,4,5,6)P5 2-kinase) and InsP5 2-kinase). This enzyme phosphorylates Ins(1,3,4,5,6)P5 to form Ins(1,2,3,4,5,6)P6 (also known as InsP6 or phytate). InsP6 is involved in many processes such as mRNA export, nonhomologous end-joining, endocytosis and ion channel regulation. Probab=20.18 E-value=51 Score=12.45 Aligned_cols=15 Identities=20% Similarity=0.308 Sum_probs=7.0 Q ss_pred EEEEEEECHHHHCCC Q ss_conf 468883077771289 Q gi|254781094|r 261 QVFMGSFDKKYLCLP 275 (702) Q Consensus 261 ~~~~G~F~~~fL~lP 275 (702) ..++=+|.+|.|-++ T Consensus 118 ~~i~vEiKPKwl~~s 132 (301) T pfam06090 118 NTITVEIKPKWLFLS 132 (301) T ss_pred CEEEEEECCCCCCCC T ss_conf 605786054400557 No 107 >smart00437 TOP1Ac Bacterial DNA topoisomerase I DNA-binding domain. Bacterial DNA topoisomerase I and III, Eukaryotic DNA topoisomeraes III, reverse gyrase alpha subunit Probab=20.00 E-value=51 Score=12.42 Aligned_cols=54 Identities=11% Similarity=0.120 Sum_probs=31.2 Q ss_pred HHHHHHHHHHHHHHCCCCHHHHHHHHHHHC---CCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 457899999998623699889999999734---477677788635889999997499 Q gi|254781094|r 431 PELQGKIGKEYAVLQNENISCCDAIEEHLK---PRGPLENVPTNKISITVSLADKLD 484 (702) Q Consensus 431 peLQG~mG~~YA~~~ge~~~Va~aI~ehY~---P~~~~d~lP~~~~g~ilsiaDKlD 484 (702) ++||..+.+.|-.-.-+.-.+|+.++|+=+ ||.....+|.+....+.+...++. T Consensus 16 s~Lq~~as~~~g~sa~~tl~iaQ~LYe~g~ITYpRTds~~l~~~~~~~~~~~i~~~~ 72 (259) T smart00437 16 STLQQEASRKLGFSAKKTMQIAQKLYEKGLITYPRTDSTRLSEEAVLEARNYISKHY 72 (259) T ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHHCCCEEECCCCCCCCCHHHHHHHHHHHHHHC T ss_conf 999999998669999999999999986785533476874639999999999999863 Done!