Query gi|254780897|ref|YP_003065310.1| trigger factor [Candidatus Liberibacter asiaticus str. psy62] Match_columns 473 No_of_seqs 157 out of 2082 Neff 8.0 Searched_HMMs 39220 Date Mon May 30 01:38:58 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780897.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK01490 tig trigger factor; P 100.0 0 0 701.3 49.0 432 1-445 1-434 (435) 2 COG0544 Tig FKBP-type peptidyl 100.0 0 0 680.4 45.3 435 1-444 1-439 (441) 3 TIGR00115 tig trigger factor; 100.0 0 0 563.6 39.9 429 1-440 2-470 (475) 4 pfam05697 Trigger_N Bacterial 100.0 2.3E-28 5.8E-33 246.1 16.5 144 1-153 1-144 (144) 5 pfam05698 Trigger_C Bacterial 99.9 1.1E-20 2.8E-25 184.4 20.8 158 269-427 1-162 (162) 6 pfam00254 FKBP_C FKBP-type pep 98.4 5.5E-07 1.4E-11 74.6 5.3 83 165-247 5-95 (95) 7 PRK10737 FKBP-type peptidyl-pr 98.3 9E-07 2.3E-11 72.9 4.2 89 166-254 2-144 (196) 8 COG1047 SlpA FKBP-type peptidy 98.1 4.4E-06 1.1E-10 67.4 5.4 60 166-225 2-64 (174) 9 COG0545 FkpA FKBP-type peptidy 97.5 0.00017 4.4E-09 54.7 4.8 83 166-250 115-205 (205) 10 KOG0552 consensus 97.5 0.0002 5.2E-09 54.0 4.9 85 166-250 134-226 (226) 11 KOG0549 consensus 97.3 0.00073 1.9E-08 49.6 5.7 89 164-252 82-178 (188) 12 pfam09312 SurA_N SurA N-termin 97.1 0.034 8.8E-07 36.2 12.9 100 309-420 13-112 (118) 13 PRK10902 FKBP-type peptidyl-pr 97.0 0.0011 2.7E-08 48.3 4.6 84 167-250 161-249 (270) 14 TIGR03516 ppisom_GldI peptidyl 97.0 0.0014 3.7E-08 47.2 4.9 83 166-250 85-176 (177) 15 KOG0544 consensus 97.0 0.0019 4.9E-08 46.2 5.5 84 166-249 16-107 (108) 16 PRK00059 prsA peptidylprolyl i 96.9 0.051 1.3E-06 34.8 13.7 140 138-310 171-331 (336) 17 PRK10770 peptidyl-prolyl cis-t 96.8 0.059 1.5E-06 34.3 13.3 37 200-237 236-272 (428) 18 PRK11570 peptidyl-prolyl cis-t 96.8 0.0023 5.9E-08 45.6 4.6 83 167-249 117-205 (206) 19 PRK12450 foldase protein PrsA; 96.5 0.02 5.2E-07 38.0 8.1 98 194-314 193-291 (309) 20 PRK03095 prsA peptidylprolyl i 96.1 0.086 2.2E-06 33.0 9.3 88 199-314 183-270 (287) 21 PRK01326 prsA foldase protein 95.8 0.11 2.8E-06 32.2 9.0 97 195-314 191-288 (310) 22 PRK00059 prsA peptidylprolyl i 95.4 0.26 6.7E-06 29.1 11.7 106 280-396 81-195 (336) 23 PRK02998 prsA peptidylprolyl i 95.3 0.15 3.9E-06 31.0 8.1 90 199-315 185-274 (283) 24 PRK04405 prsA peptidylprolyl i 95.0 0.19 4.8E-06 30.3 7.9 93 195-315 193-285 (298) 25 PRK10788 peptidyl-prolyl cis-t 94.0 0.55 1.4E-05 26.6 12.0 76 199-300 319-394 (622) 26 KOG0543 consensus 92.5 0.88 2.2E-05 24.9 13.9 85 168-253 102-193 (397) 27 PRK01885 greB transcription el 92.1 0.55 1.4E-05 26.6 6.0 79 142-224 57-144 (159) 28 TIGR01462 greA transcription e 91.4 0.64 1.6E-05 26.0 5.8 93 140-249 48-154 (155) 29 PRK00226 greA transcription el 90.9 0.8 2E-05 25.2 5.9 79 142-224 55-143 (157) 30 cd06552 ASCH_yqfb_like ASC-1 h 89.4 0.51 1.3E-05 26.8 3.8 43 239-281 42-84 (100) 31 PRK03002 prsA peptidylprolyl i 87.5 2.1 5.4E-05 21.9 9.3 90 198-316 186-275 (285) 32 PRK05892 nucleoside diphosphat 84.0 3 7.7E-05 20.6 6.1 33 206-251 127-159 (161) 33 PRK12318 methionine aminopepti 77.5 3.5 9E-05 20.1 3.8 21 163-183 122-142 (291) 34 PRK06342 transcription elongat 76.7 5 0.00013 18.9 5.7 72 145-222 68-149 (160) 35 PRK12897 methionine aminopepti 75.7 4.3 0.00011 19.4 3.9 15 164-178 82-96 (248) 36 TIGR00967 3a0501s007 preprotei 74.5 4 0.0001 19.6 3.5 39 16-60 388-427 (489) 37 pfam01272 GreA_GreB Transcript 73.3 1.9 4.7E-05 22.3 1.5 24 203-228 42-65 (77) 38 PRK09204 secY preprotein trans 72.8 4.7 0.00012 19.1 3.5 11 35-45 30-40 (427) 39 COG2411 Uncharacterized conser 72.1 4.6 0.00012 19.2 3.3 59 211-281 32-90 (188) 40 TIGR00500 met_pdase_I methioni 67.1 4.9 0.00012 18.9 2.6 54 161-218 88-153 (265) 41 pfam00344 SecY eubacterial sec 66.4 5.3 0.00014 18.6 2.6 11 374-384 262-272 (340) 42 PRK06330 transcript cleavage f 66.1 8.2 0.00021 17.1 6.6 15 251-265 570-584 (906) 43 cd00986 PDZ_LON_protease PDZ d 66.0 4.4 0.00011 19.3 2.2 36 204-248 42-77 (79) 44 PRK05753 nucleoside diphosphat 63.0 9.3 0.00024 16.7 7.3 68 168-249 50-125 (135) 45 COG0201 SecY Preprotein transl 55.4 12 0.00031 15.8 2.9 14 323-336 338-351 (436) 46 PRK08568 preprotein translocas 54.6 12 0.00032 15.7 3.3 14 34-47 41-54 (466) 47 TIGR02093 P_ylase glycogen/sta 54.4 13 0.00032 15.6 3.3 92 219-322 454-558 (822) 48 CHL00161 secY preprotein trans 54.0 13 0.00032 15.6 3.1 12 34-45 25-36 (417) 49 TIGR01570 A_thal_3588 uncharac 52.1 4.1 0.0001 19.6 0.0 18 40-57 4-21 (172) 50 TIGR02932 vnfK_nitrog V-contai 49.5 13 0.00032 15.6 2.2 53 214-276 178-230 (460) 51 COG0782 Uncharacterized conser 47.9 16 0.0004 14.9 5.2 19 206-224 118-136 (151) 52 TIGR02058 lin0512_fam conserve 45.5 14 0.00036 15.3 1.9 40 18-57 12-51 (120) 53 PRK12907 secY preprotein trans 43.3 18 0.00046 14.4 3.0 10 35-44 30-39 (434) 54 PRK04980 hypothetical protein; 42.9 18 0.00046 14.3 3.1 42 239-281 47-88 (102) 55 cd01089 PA2G4-like Related to 42.6 18 0.00047 14.3 2.3 34 163-200 79-112 (228) 56 cd01088 MetAP2 Methionine Amin 38.0 21 0.00055 13.8 2.1 20 165-184 69-88 (291) 57 COG0199 RpsN Ribosomal protein 34.4 21 0.00054 13.8 1.4 18 30-47 42-59 (61) 58 TIGR03342 dsrC_tusE_dsvC sulfu 34.3 24 0.00062 13.4 4.0 34 363-396 26-61 (108) 59 PRK08671 methionine aminopepti 33.8 25 0.00063 13.3 2.3 21 164-184 69-89 (293) 60 PRK12414 putative aminotransfe 33.8 22 0.00055 13.8 1.4 23 141-164 179-201 (384) 61 KOG2738 consensus 33.5 25 0.00063 13.3 2.3 45 139-183 145-213 (369) 62 pfam04266 ASCH ASCH domain. Th 32.9 25 0.00064 13.2 3.3 45 239-284 44-88 (104) 63 COG4086 Predicted secreted pro 30.5 28 0.0007 12.9 8.7 104 208-319 122-240 (299) 64 PRK01546 hypothetical protein; 29.7 28 0.00072 12.8 3.8 60 323-383 11-70 (79) 65 PRK11508 sulfurtransferase Tus 29.4 29 0.00073 12.8 3.9 34 363-396 27-62 (109) 66 TIGR02168 SMC_prok_B chromosom 29.3 29 0.00073 12.8 2.0 21 142-162 615-635 (1191) 67 TIGR01309 L30P_arch ribosomal 29.1 29 0.00074 12.7 2.4 22 256-277 78-100 (160) 68 TIGR02023 BchP-ChlP geranylger 29.0 23 0.00058 13.6 0.8 136 141-283 93-246 (408) 69 COG4283 Uncharacterized conser 28.1 30 0.00076 12.6 2.8 35 251-285 82-116 (170) 70 PRK07281 methionine aminopepti 27.8 30 0.00077 12.6 3.3 21 226-246 246-267 (286) 71 TIGR01265 tyr_nico_aTase tyros 27.3 31 0.00079 12.5 2.5 35 126-160 150-187 (424) 72 TIGR00998 8a0101 efflux pump m 26.8 31 0.0008 12.4 2.0 43 208-259 289-331 (379) 73 cd01651 RT_G2_intron RT_G2_int 26.5 32 0.00081 12.4 4.5 52 3-54 3-58 (226) 74 COG5192 BMS1 GTP-binding prote 25.8 32 0.00081 12.4 1.1 22 157-178 326-347 (1077) 75 TIGR00963 secA preprotein tran 25.5 17 0.00044 14.5 -0.3 26 375-400 813-838 (904) 76 pfam03299 TF_AP-2 Transcriptio 25.3 33 0.00085 12.2 6.7 26 236-261 13-40 (207) 77 KOG0546 consensus 25.1 33 0.00085 12.2 2.5 67 166-246 74-146 (372) 78 pfam01390 SEA SEA domain. Doma 24.8 34 0.00086 12.2 1.2 12 143-154 33-44 (107) 79 PRK02769 histidine decarboxyla 24.7 34 0.00087 12.2 7.7 14 375-388 354-367 (380) 80 pfam06021 Gly_acyl_tr_N Aralky 24.7 34 0.00087 12.2 3.1 44 65-126 11-54 (205) 81 TIGR02713 allophanate_hyd allo 24.1 24 0.00061 13.4 0.2 47 297-343 211-262 (582) 82 PRK11557 putative DNA-binding 23.9 35 0.00089 12.1 2.7 21 261-281 54-74 (282) 83 TIGR02103 pullul_strch alpha-1 23.4 36 0.00091 12.0 2.9 62 156-217 182-257 (945) 84 TIGR00059 L17 ribosomal protei 23.3 36 0.00092 12.0 1.0 28 364-391 21-51 (125) 85 pfam05089 NAGLU Alpha-N-acetyl 23.1 36 0.00092 12.0 3.5 16 165-180 303-318 (672) 86 TIGR02656 cyanin_plasto plasto 23.0 36 0.00093 11.9 1.1 17 160-176 14-30 (102) 87 pfam09953 DUF2187 Uncharacteri 22.1 31 0.00079 12.5 0.5 36 206-241 11-49 (57) 88 PRK07777 aminotransferase; Val 21.7 38 0.00098 11.7 2.0 14 365-378 335-348 (386) 89 PRK09795 aminopeptidase; Provi 21.7 38 0.00098 11.7 2.4 12 75-86 7-18 (361) 90 TIGR01461 greB transcription e 21.7 38 0.00098 11.7 5.9 90 145-248 56-155 (157) 91 pfam04358 DsrC DsrC like prote 21.6 39 0.00098 11.7 3.9 34 363-396 27-62 (109) 92 cd01085 APP X-Prolyl Aminopept 21.6 39 0.00098 11.7 3.0 16 141-156 30-45 (224) 93 pfam12322 T4_baseplate T4 bact 21.5 36 0.00092 12.0 0.7 104 124-273 16-123 (204) 94 TIGR00495 crvDNA_42K DNA-bindi 21.2 37 0.00094 11.9 0.7 68 165-237 106-192 (407) 95 cd01807 GDX_N GDX contains an 21.1 39 0.001 11.7 3.4 27 1-29 1-27 (74) 96 pfam07273 DUF1439 Protein of u 21.1 40 0.001 11.7 2.2 12 140-151 25-36 (177) 97 pfam08349 DUF1722 Protein of u 21.0 37 0.00093 11.9 0.6 15 365-379 56-70 (117) 98 PRK08960 hypothetical protein; 21.0 40 0.001 11.6 1.2 13 366-378 339-351 (387) 99 PRK02929 L-arabinose isomerase 21.0 40 0.001 11.6 4.9 39 112-158 200-238 (499) 100 TIGR02104 pulA_typeI pullulana 20.7 30 0.00077 12.6 0.2 41 374-419 531-575 (655) 101 PRK02047 hypothetical protein; 20.7 24 0.0006 13.5 -0.4 43 142-184 25-71 (91) 102 KOG0256 consensus 20.5 24 0.00062 13.4 -0.4 43 27-73 53-96 (471) 103 PRK00907 hypothetical protein; 20.4 24 0.0006 13.5 -0.4 37 142-178 26-66 (92) 104 PRK07683 aminotransferase A; V 20.4 41 0.001 11.5 1.8 13 366-378 336-348 (387) 105 TIGR01083 nth endonuclease III 20.3 41 0.001 11.5 2.1 53 257-313 48-106 (192) No 1 >PRK01490 tig trigger factor; Provisional Probab=100.00 E-value=0 Score=701.33 Aligned_cols=432 Identities=28% Similarity=0.520 Sum_probs=415.8 Q ss_pred CCEEEEECCCCEEEEEEEECHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 91189972797699999854899999999999998750738985897048899998610899999999999999999998 Q gi|254780897|r 1 MQVIEKFSEGLKRELDVIIPSNRLTDSFNERIEDIRSKANIKGFRPGKVPLSHIKSLYGKSILSETIDEIIKEIVPEILS 80 (473) Q Consensus 1 M~v~~~~~~~~~~~l~v~v~~~~~~~~~~~~~~~~~~~~~ipGFRkGKvP~~ii~~~~g~~i~~e~~~~li~~~~~~~i~ 80 (473) |+|++++.++|+++|+|+||+++|...++++++.++++++||||||||||++||+++||++|+.++++.+++.++.++++ T Consensus 1 M~i~~~~~~~~~~~l~v~v~~~~~~~~~~~~~~~~~k~~~ipGFRkGKvP~~ii~k~yg~~i~~e~~~~li~~~~~~~i~ 80 (435) T PRK01490 1 MQVTVEKLEGLERRLTITVPAEEIEKAVDKALKKLAKTVRIPGFRKGKVPRKIVEQRYGESVRQEALNELLPEAYEEAIK 80 (435) T ss_pred CCEEEEECCCCEEEEEEEECHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 96069977886799999998999999999999998652888998999999999999975999999999999999999999 Q ss_pred HCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHC Q ss_conf 72995422874122222233321101378820578876227877621113564310012111157899999976431000 Q gi|254780897|r 81 KRDERAAMRPSITINEGESDITSGLIEGTVDLKLRLSYDILPQIEISSFDDLQVTQDICEVDEKEIDRQMAEIAKNNVAF 160 (473) Q Consensus 81 ~~~~~~~~~P~i~~~~~~~~~~~~~~~~~~~~~~~~~~ev~Pei~l~~y~~i~i~~~~~~vtd~~Id~~i~~l~~~~~~~ 160 (473) ++++.|+|+|.+.... ++.+++|+|+++|+++|+|++++|++++++++.++||+++|+++|+++|+++|+| T Consensus 81 ~~~l~~v~~P~i~~~~---------~~~~~~~~f~~~~ev~Pe~~l~~~k~i~v~~~~~~vtd~~Vd~~i~~l~~~~a~~ 151 (435) T PRK01490 81 EEGLRPAGQPEIEVVE---------FEKGKDLEFTAEVEVYPEVELGDYKGLEVEKPVVEVTDEDVDEMLENLRKQFATL 151 (435) T ss_pred HCCCCCCCCCCCCCCC---------CCCCCCEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCC T ss_conf 6588756787445123---------5789977999999846877654433433345666789899999999999750765 Q ss_pred CCCCCCCCCCCEEEEEEEEEECCCCCCCCCCCCEEEEECCCCCCCCHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCE Q ss_conf 02343322111134445666426534666544203651786656310342068878875433234444444100038730 Q gi|254780897|r 161 EVKETESEIGDKVTVDYTVSVDNVILEDQSKKNVQFIVGSADLFSETTEILVGLKTGDQKEIERFFPEDHSIKDLAGKKV 240 (473) Q Consensus 161 ~~~~~~~~~gD~v~id~~~~~dg~~~~~~~~~~~~~~lg~~~~~~~f~~~liG~k~Gd~~~~~~~~P~d~~~~~laGk~v 240 (473) ++++|+++.||+|+|||+|++||++++|++++++.+.||++.++|||.++|+|||+||+++|+++||+||+.+++|||++ T Consensus 152 ~~~e~~~~~gD~v~id~~~~~dg~~~~~~~~~~~~~~lg~~~~i~gf~~~LiG~k~Gd~~~~~v~fp~d~~~~~laGk~~ 231 (435) T PRK01490 152 VPVERPAENGDRVTIDFVGSIDGEEFEGGKAEDFSLELGSGRMIPGFEEQLVGMKAGEEKTIDVTFPEDYHAEDLAGKEA 231 (435) T ss_pred CCCCHHHCCCCEEEEEEEEEECCCCCCCCCCCCEEEEECCCCCCHHHHHHHCCCCCCCEEEEEECCCCCCCCHHHCCCCE T ss_conf 56540212698799998987668204677765069990588775559998546877862112203764222024348852 Q ss_pred EEEEEHHHHHCCCCCCCCHHHHHHHC--CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHH Q ss_conf 46761011002467776747887502--3655789999998878999977777669999999986400347999999997 Q gi|254780897|r 241 RLNFSIKEVFSPLPVVVNNDLAVRLG--FESESAMRGLCSQKIKQHSEFLVRQKVKRQILDYISNKYTFDVPESLVENEY 318 (473) Q Consensus 241 ~f~v~v~~I~~~~~pel~def~k~~~--~~t~~elk~~ik~~l~~~~~~~~~~~~~~~i~~~L~~~~~~~lPe~lv~~e~ 318 (473) .|+|+|++|+++.+|+|||+||+++| ++|+++||+.|+++|..++...+++.++++++++|++.++|++|++||++++ T Consensus 232 ~f~v~v~~I~~~~lPeldDefak~~~~~~~t~~elk~~ik~~l~~~~~~~~~~~~~~~i~~~L~~~~~~~lPe~lv~~e~ 311 (435) T PRK01490 232 TFKVTVKEVKEKELPELDDEFAKKLGEEFETLEELRADIRKNLERELKEAQRAKVKEAVLDALVENAEFDLPEALVEQEI 311 (435) T ss_pred EEEEEEEEEECCCCCCCCHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHH T ss_conf 79998755302668987789999808753779999999999999999999999999999999998577779889999999 Q ss_pred HHHHHHHHHHHHHCCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCHHHH Q ss_conf 76789999997534864001100013456778999999999999999998643874289999999999998639998999 Q gi|254780897|r 319 NGILQKVRFEMSSANQKSQNVDSIDEEDLQYYHMLAKRRVLTGIVLGTIGEKNNIEVTEEEMQSALYQQLGRFPGHEKKM 398 (473) Q Consensus 319 ~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~Aek~vk~~lil~~Ia~~e~I~vs~~Ei~~~i~~~~~~~~~~~~~~ 398 (473) +++++++..++.++|++... +.+++++.+++.|+++|||+||+++||++++|+||++|+.+++.+++++||..++.+ T Consensus 312 ~~~~~~~~~~~~~~g~~~~~---~~e~~~e~~~~~A~~~vk~~lil~~ia~~e~I~vt~eev~~~i~~~a~~~g~~~~~v 388 (435) T PRK01490 312 DRLLRQAAQRLGQQGIDLEK---TEEELREEFREQAERRVKLGLLLAEIAKAEELEVSDEEVKAEIEEMASQYEQPPEVI 388 (435) T ss_pred HHHHHHHHHHHHHCCCCCCC---CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCCHHHH T ss_conf 99999999999854898010---799999999999999999999999999996898899999999999999869989999 Q ss_pred HHHHHHCHHHHHHHHHHHHHHHHHHHHHHHCEEEEEEECHHHHCCCC Q ss_conf 99987099989999999999999999988535665531488840676 Q gi|254780897|r 399 LDHFQKYPNALAELRAPIFEDKVIDHILKSVQIVDRKVTFDQLFDNS 445 (473) Q Consensus 399 ~~~~~~~~~~~~~i~~~ile~Kv~d~l~~~a~i~ek~~~~~el~~~~ 445 (473) ++++++++++.++++.++++|+++||+++|++++++++++++++.. T Consensus 389 -~~~~~~~~~~~~i~~~i~~~Kvld~l~~~a~v~ek~~s~~e~~k~~ 434 (435) T PRK01490 389 -EEYLKNPELLAALRADVLEEKVVDWLLEKAKVTEKEVSFEELMKEQ 434 (435) T ss_pred -HHHHHCHHHHHHHHHHHHHHHHHHHHHHHCEEEEEECCHHHHHCCC T ss_conf -9999697899999999999999999998595578864499983567 No 2 >COG0544 Tig FKBP-type peptidyl-prolyl cis-trans isomerase (trigger factor) [Posttranslational modification, protein turnover, chaperones] Probab=100.00 E-value=0 Score=680.40 Aligned_cols=435 Identities=26% Similarity=0.490 Sum_probs=419.6 Q ss_pred CCEEEEECCCCEEEEEEEECHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 91189972797699999854899999999999998750738985897048899998610899999999999999999998 Q gi|254780897|r 1 MQVIEKFSEGLKRELDVIIPSNRLTDSFNERIEDIRSKANIKGFRPGKVPLSHIKSLYGKSILSETIDEIIKEIVPEILS 80 (473) Q Consensus 1 M~v~~~~~~~~~~~l~v~v~~~~~~~~~~~~~~~~~~~~~ipGFRkGKvP~~ii~~~~g~~i~~e~~~~li~~~~~~~i~ 80 (473) |++++++++++.+.++|+||++++++.+++++++++++++||||||||||++||++|||.++++++++.+++.+|..+++ T Consensus 1 M~v~~e~~~~~~~~l~v~vp~~~~~~~~~~~~~~~~k~v~IpGFRkGKvP~~ii~~ryg~~v~~d~~~~ll~~~~~~ai~ 80 (441) T COG0544 1 MKVTVEKLEGLEVRLTVEVPAEEIKKALDKALKKLAKKVKIPGFRKGKVPRKVIEQRYGEAVRQDVLNELLPEAFEEAIK 80 (441) T ss_pred CCEEEEECCCCEEEEEEEECHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 97036545885799999988899889999999999964737897999888999999984999999999999999999999 Q ss_pred HCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHC Q ss_conf 72995422874122222233321101378820578876227877621113564310012111157899999976431000 Q gi|254780897|r 81 KRDERAAMRPSITINEGESDITSGLIEGTVDLKLRLSYDILPQIEISSFDDLQVTQDICEVDEKEIDRQMAEIAKNNVAF 160 (473) Q Consensus 81 ~~~~~~~~~P~i~~~~~~~~~~~~~~~~~~~~~~~~~~ev~Pei~l~~y~~i~i~~~~~~vtd~~Id~~i~~l~~~~~~~ 160 (473) +.++.|+|+|.+.+.+ ++.+.+|+|++.|+|+|+|++++|++|+++++.++++|++|++.|+++++++++| T Consensus 81 e~~~~~~~~p~~~~~~---------~e~~~~~~f~~~~ev~Pev~l~d~~~i~v~~~~~ev~d~dvd~~L~~l~~~~a~~ 151 (441) T COG0544 81 EEGLKPAGQPEIEITE---------FEKGEDFEFTAEVEVYPEVELGDYKGIEVEKPVVEVTDEDVDEELEKLRKRFATL 151 (441) T ss_pred HHCCCCCCCCCCCCCC---------CCCCCCEEEEEEEEEEECEECCCCCCCEEECCCCCCCHHHHHHHHHHHHHHCCCC T ss_conf 7086757888755421---------2578856999999984013147656652313776558899999999999961833 Q ss_pred CCCCCCCCCCCEEEEEEEEEECCCCCCCCCCCCEEEEECCCCCCCCHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCE Q ss_conf 02343322111134445666426534666544203651786656310342068878875433234444444100038730 Q gi|254780897|r 161 EVKETESEIGDKVTVDYTVSVDNVILEDQSKKNVQFIVGSADLFSETTEILVGLKTGDQKEIERFFPEDHSIKDLAGKKV 240 (473) Q Consensus 161 ~~~~~~~~~gD~v~id~~~~~dg~~~~~~~~~~~~~~lg~~~~~~~f~~~liG~k~Gd~~~~~~~~P~d~~~~~laGk~v 240 (473) ++++++++.||.|+|||.|++||++|+|++++|+.++||+++|||||+++|+|||+||+++|+++||++|+..+||||.+ T Consensus 152 ~~~e~~a~~gD~v~IDf~g~iDg~~fegg~ae~~~l~lGs~~fipgFe~~LvG~k~Ge~k~i~vtFP~~y~a~~LaGK~a 231 (441) T COG0544 152 EPVEGAAENGDRVTIDFEGSVDGEEFEGGKAENFSLELGSGRFIPGFEDQLVGMKAGEEKDIKVTFPEDYHAEELAGKEA 231 (441) T ss_pred CCCCCCCCCCCEEEEEEEEEECCEECCCCCCCCEEEEECCCCCHHHHHHHHCCCCCCCEEEEEEECCCCCCHHHHCCCCE T ss_conf 11443002599899999998768034675436707997078744428864136857975689887166530667389963 Q ss_pred EEEEEHHHHHCCCCCCCCHHHHHHHCCC-CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHH Q ss_conf 4676101100246777674788750236-557899999988789999777776699999999864003479999999977 Q gi|254780897|r 241 RLNFSIKEVFSPLPVVVNNDLAVRLGFE-SESAMRGLCSQKIKQHSEFLVRQKVKRQILDYISNKYTFDVPESLVENEYN 319 (473) Q Consensus 241 ~f~v~v~~I~~~~~pel~def~k~~~~~-t~~elk~~ik~~l~~~~~~~~~~~~~~~i~~~L~~~~~~~lPe~lv~~e~~ 319 (473) +|+|+|++|+.+.+|+||||||+++|.. |+++||+.+|++|+.+.........++++++.|.+.+.|++|++||++|++ T Consensus 232 ~F~V~vkeVk~~elpEldDEfAk~~~~~~tLe~Lk~~~r~~le~~~~~~~~~~~~~~~~~~L~e~~~~dlP~sli~~E~~ 311 (441) T COG0544 232 TFKVKVKEVKKRELPELDDEFAKKLGEEDTLEELKEKLRKNLERELKEATLEKRKEQLLDALVEANDFDLPESLVEAEID 311 (441) T ss_pred EEEEEEEEEEECCCCCCCHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHH T ss_conf 89999999852578887879998658321399999999999999999999999999999998751577898999999999 Q ss_pred HHHHHHHHHHHHCCCC-CHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCC-CHHH Q ss_conf 6789999997534864-0011000134567789999999999999999986438742899999999999986399-9899 Q gi|254780897|r 320 GILQKVRFEMSSANQK-SQNVDSIDEEDLQYYHMLAKRRVLTGIVLGTIGEKNNIEVTEEEMQSALYQQLGRFPG-HEKK 397 (473) Q Consensus 320 ~~~~~~~~~l~~~~~~-~~~~~~~~e~~~~~~~~~Aek~vk~~lil~~Ia~~e~I~vs~~Ei~~~i~~~~~~~~~-~~~~ 397 (473) ++++++..+++++|.. ....+.+.+++++.++..|++|||++|++++||+.++|+||++++.+.+..++++|++ .+.+ T Consensus 312 ~l~~~~~~~l~~~~~~~~~~~~~~~~~~~e~~~~~A~krVk~~Lvl~~ia~~~~i~v~~eei~~~i~~~a~~y~~~~~~e 391 (441) T COG0544 312 NLLKQALQQLQQQGIDSLEASGESEEELREEFKEEAEKRVKLGLLLEEIAKEEKLEVTEEEIKAEIEELARQYGGEQPEE 391 (441) T ss_pred HHHHHHHHHHHHCCCCHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCHHH T ss_conf 99999999988626524442242089999999999999998888999999870795799999999999999817976799 Q ss_pred HHHHHHHCHHHHHHHHHHHHHHHHHHHHHH-HCEEEEEEECHHHHCCC Q ss_conf 999987099989999999999999999988-53566553148884067 Q gi|254780897|r 398 MLDHFQKYPNALAELRAPIFEDKVIDHILK-SVQIVDRKVTFDQLFDN 444 (473) Q Consensus 398 ~~~~~~~~~~~~~~i~~~ile~Kv~d~l~~-~a~i~ek~~~~~el~~~ 444 (473) +.++|+.+++.+..+++.+++.+++++++. .+++++++++++++++. T Consensus 392 ~~~~~~~~~~~~~~~k~~~~~~k~v~~~~~~~~~~~~~~~~~~e~~~~ 439 (441) T COG0544 392 VIKLYYNNQELLDALKADILEEKAVDLLLANKKKVTEKEVSFEELMNE 439 (441) T ss_pred HHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHCC T ss_conf 999986275778877899999999999864410002566785564034 No 3 >TIGR00115 tig trigger factor; InterPro: IPR005215 The trigger factor is found in several prokaryotes, and is involved in protein export. Trigger factor is a ribosome-associated molecular chaperone and is the first chaperone to interact with nascent polypeptide. It acts as a chaperone by maintaining the newly synthesised protein in an open conformation. Trigger factor can bind at the same time as the signal recognition particle (SRP), but is excluded by the SRP receptor (FtsY). The central domain of trigger factor has peptidyl-prolyl cis/trans isomerase activity .; GO: 0015031 protein transport. Probab=100.00 E-value=0 Score=563.55 Aligned_cols=429 Identities=23% Similarity=0.403 Sum_probs=390.2 Q ss_pred CCEEEEECCCCEE--EEEEEECHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHH-----HHHHHHHHHHHHHH Q ss_conf 9118997279769--9999854899999999999998750738985897048899998610-----89999999999999 Q gi|254780897|r 1 MQVIEKFSEGLKR--ELDVIIPSNRLTDSFNERIEDIRSKANIKGFRPGKVPLSHIKSLYG-----KSILSETIDEIIKE 73 (473) Q Consensus 1 M~v~~~~~~~~~~--~l~v~v~~~~~~~~~~~~~~~~~~~~~ipGFRkGKvP~~ii~~~~g-----~~i~~e~~~~li~~ 73 (473) |++++++..+..+ .|+|+||++.++..|++.+++++++++|||||+||||+++|++||| .++.++|++.|++. T Consensus 2 m~~~~~~~~~~~~~L~l~i~i~~~~~~~~~~~~~~~~~k~v~I~GFR~GKVP~~l~~qr~~~AEL~~~~~q~vl~~l~~~ 81 (475) T TIGR00115 2 MSVKVEKLPGSEVGLRLEIEIDAEVIKKAYDKLLKKLAKKVKIDGFRKGKVPRNLVEQRYGKAELLASIKQDVLEELIQR 81 (475) T ss_pred CCEEEEEECCCCCEEEEEEEECHHHHHHHHHHHHHHHHHEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 63055420477743899997666889999999998876200358635788786999854116899999999999987367 Q ss_pred HHHHHHHHCCCC------CCCCCCCCCCCCCCCCCCCCCCCC--CCEEEEEEEECCCCCCCC--CCCCCCCCCCCCCCCH Q ss_conf 999999872995------422874122222233321101378--820578876227877621--1135643100121111 Q gi|254780897|r 74 IVPEILSKRDER------AAMRPSITINEGESDITSGLIEGT--VDLKLRLSYDILPQIEIS--SFDDLQVTQDICEVDE 143 (473) Q Consensus 74 ~~~~~i~~~~~~------~~~~P~i~~~~~~~~~~~~~~~~~--~~~~~~~~~ev~Pei~l~--~y~~i~i~~~~~~vtd 143 (473) .+.+++.+.++. +.+.|.+.+.. ++.| .+|+|++.|+|+|+|+++ +|++|++..|.+.++| T Consensus 82 ~~~~av~~~~~~~~Gnf~~~~~Pt~~~~~---------~~~~ld~~~~~~~~~~v~Pe~~l~~~~~k~i~~~~p~~~~~d 152 (475) T TIGR00115 82 SFKEAVKEEGINVKGNFSIADRPTIDIKT---------IKEGLDKDLVFTATFDVYPEVKLPSEDYKKIKVEKPEVEVTD 152 (475) T ss_pred HHHHHHHHHCCCCCCCCCCCCCCCCCHHH---------HCCCCCCCEEEEEEEEEEEEEEECCCCCCCCCCCCCCEEECH T ss_conf 89999997177653343436687520533---------123678871899998763026618755202110378455068 Q ss_pred HHHHHHHHHHHHHHHHCCCCC-C-CCCCCCEEEEE---EEEEECC---CCCCCCCCCCEEEEECCCCCCCCHHHHCCCCC Q ss_conf 578999999764310000234-3-32211113444---5666426---53466654420365178665631034206887 Q gi|254780897|r 144 KEIDRQMAEIAKNNVAFEVKE-T-ESEIGDKVTVD---YTVSVDN---VILEDQSKKNVQFIVGSADLFSETTEILVGLK 215 (473) Q Consensus 144 ~~Id~~i~~l~~~~~~~~~~~-~-~~~~gD~v~id---~~~~~dg---~~~~~~~~~~~~~~lg~~~~~~~f~~~liG~k 215 (473) ++||..|++|++++|+|++++ + +++.||.|+|| |.|.+|| ++|+|+++.||.+.||+++|++||+++|+||+ T Consensus 153 ~~~d~~lk~L~k~~a~~v~~~~~r~v~~~d~v~iDyaaF~g~vDg~aG~~~e~gka~~f~L~~G~~~~i~GFe~gl~Gm~ 232 (475) T TIGR00115 153 EDVDNELKNLQKQQATLVEVEDKRAVANDDIVTIDYAAFVGFVDGSAGEEFEGGKAENFELTLGSGRFIPGFEDGLVGMK 232 (475) T ss_pred HHHHHHHHHHHHHCCEEEECCCCCCCCCCCEEEEEEEEEEEEECCCCCHHHCCCCCCCEEEEEECCCCCCCHHHHHHHEE T ss_conf 99988999998612712122577000048858996200133311534401115410251789506771245542113200 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCEEEEEEHHHHHCCCCCCCCHHHHHHHC------CCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 887543323444444410003873046761011002467776747887502------36557899999988789999777 Q gi|254780897|r 216 TGDQKEIERFFPEDHSIKDLAGKKVRLNFSIKEVFSPLPVVVNNDLAVRLG------FESESAMRGLCSQKIKQHSEFLV 289 (473) Q Consensus 216 ~Gd~~~~~~~~P~d~~~~~laGk~v~f~v~v~~I~~~~~pel~def~k~~~------~~t~~elk~~ik~~l~~~~~~~~ 289 (473) +|+++.|++|||+|||.++|+||+|.|.|++|.|+.+.+|+|||+||+.++ +.|+++||+.++++|....+... T Consensus 233 ~ge~k~i~~tFP~dYhaE~LaGk~~~F~i~LK~Ik~relpelddef~k~~~~~~~q~~~tl~~Lk~~~~k~l~~~~k~~~ 312 (475) T TIGR00115 233 AGEEKDIEVTFPEDYHAEELAGKPAKFKIKLKEIKKRELPELDDEFAKDVLEEEYQEFNTLAELKAEVRKQLEEELKQEL 312 (475) T ss_pred CCCEEEEECCCCCCCCHHHHCCCCEEEEEEEHHHHHCCCCCCCHHHHHHCCHHHCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 47654430268752682542597324543200121115798786899740765401132289999999999999999999 Q ss_pred HHHHHHH-HHHHHHHHHCCC--CCHHHHHHHHHHHHHHHHHHHHHCCCCCHHH----HCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 7766999-999998640034--7999999997767899999975348640011----00013456778999999999999 Q gi|254780897|r 290 RQKVKRQ-ILDYISNKYTFD--VPESLVENEYNGILQKVRFEMSSANQKSQNV----DSIDEEDLQYYHMLAKRRVLTGI 362 (473) Q Consensus 290 ~~~~~~~-i~~~L~~~~~~~--lPe~lv~~e~~~~~~~~~~~l~~~~~~~~~~----~~~~e~~~~~~~~~Aek~vk~~l 362 (473) ....+++ +...|.+.+.++ ||++||+.|+++++++...++++.|+..+.+ ........+..+..|..+++..| T Consensus 313 ~~~~~~~k~i~~l~e~~~~DP~~P~~~I~~E~drll~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~~ea~~~~k~~l 392 (475) T TIGR00115 313 DNQEKNQKLIEKLSEKNTIDPYLPESMIEQEIDRLLRNQKQRAQRYGMNLEQLLQLSRESLEELKDRNKKEALKNVKSAL 392 (475) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999899999898731578875078999999999999999999718888999731256899987632888987568998 Q ss_pred HHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCHHH--HHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHCEEEEEEECHHH Q ss_conf 99999864387428999999999999863999899--9999870999899999999999999999885356655314888 Q gi|254780897|r 363 VLGTIGEKNNIEVTEEEMQSALYQQLGRFPGHEKK--MLDHFQKYPNALAELRAPIFEDKVIDHILKSVQIVDRKVTFDQ 440 (473) Q Consensus 363 il~~Ia~~e~I~vs~~Ei~~~i~~~~~~~~~~~~~--~~~~~~~~~~~~~~i~~~ile~Kv~d~l~~~a~i~ek~~~~~e 440 (473) ++.+||+.|+|.|++.||.+.+..++..++..... .+..+..+ +++......+.++-+...++.. .++.+..++.+ T Consensus 393 ~~~~~Ad~Ekikv~~~ev~~~~~~~~~~~~~~~~n~~~~~~~~~~-~l~~~~~~~~~kd~v~~~~~~~-~~~~~~~e~~~ 470 (475) T TIGR00115 393 IMEAIADEEKIKVTEEEVKSELEEEAEAVEYPSENEELKKQIRES-SLLEAKNMALLKDAVEAELLKK-LVTEKIKEYSE 470 (475) T ss_pred HHHHHHCHHCCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHC-CCHHHHEECCC T ss_conf 899871220378876899999999999828971248999987533-4678888998887999998603-23012101042 No 4 >pfam05697 Trigger_N Bacterial trigger factor protein (TF). In the E. coli cytosol, a fraction of the newly synthesized proteins requires the activity of molecular chaperones for folding to the native state. The major chaperones implicated in this folding process are the ribosome-associated Trigger Factor (TF), and the DnaK and GroEL chaperones with their respective co-chaperones. Trigger Factor is an ATP-independent chaperone and displays chaperone and peptidyl-prolyl-cis-trans-isomerase (PPIase) activities in vitro. It is composed of at least three domains, an N-terminal domain which mediates association with the large ribosomal subunit, a central substrate binding and PPIase domain with homology to FKBP proteins, and a C-terminal domain of unknown function. The positioning of TF at the peptide exit channel, together with its ability to interact with nascent chains as short as 57 residues renders TF a prime candidate for being the first chaperone that binds to the nascent polypeptide Probab=99.96 E-value=2.3e-28 Score=246.11 Aligned_cols=144 Identities=28% Similarity=0.576 Sum_probs=138.0 Q ss_pred CCEEEEECCCCEEEEEEEECHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 91189972797699999854899999999999998750738985897048899998610899999999999999999998 Q gi|254780897|r 1 MQVIEKFSEGLKRELDVIIPSNRLTDSFNERIEDIRSKANIKGFRPGKVPLSHIKSLYGKSILSETIDEIIKEIVPEILS 80 (473) Q Consensus 1 M~v~~~~~~~~~~~l~v~v~~~~~~~~~~~~~~~~~~~~~ipGFRkGKvP~~ii~~~~g~~i~~e~~~~li~~~~~~~i~ 80 (473) |++++++.++|+++++|+||+++|+..++++++.++++++||||||||||+++|+++||++|++++++.+++.++.++++ T Consensus 1 M~~~~~~~~~~~~~~~v~i~~~~~~~~~~~~~~~~~k~~~ipGFRkGKvP~~ii~k~~g~~i~~e~~~~li~~~~~~~~~ 80 (144) T pfam05697 1 MKVTVEKLEGLEVKLTVEVPAEEVEEAVDKALKELAKKVKIPGFRKGKVPRSVVKKRYGKEIYQEALEDLLQEAYEEALK 80 (144) T ss_pred CCEEEEECCCCEEEEEEEECHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 95389977897899999998999999999999999810889997899889999999987999999999999999999998 Q ss_pred HCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 7299542287412222223332110137882057887622787762111356431001211115789999997 Q gi|254780897|r 81 KRDERAAMRPSITINEGESDITSGLIEGTVDLKLRLSYDILPQIEISSFDDLQVTQDICEVDEKEIDRQMAEI 153 (473) Q Consensus 81 ~~~~~~~~~P~i~~~~~~~~~~~~~~~~~~~~~~~~~~ev~Pei~l~~y~~i~i~~~~~~vtd~~Id~~i~~l 153 (473) ++++.|+|+|.+.... +..+++|+|+++|+++|+|++++|++++++++..+|++++|+++|++| T Consensus 81 e~~~~~~~~P~i~~~~---------~~~~~~~~~~~~~ev~Pei~l~~~~~i~v~~~~~~v~d~dId~~i~~l 144 (144) T pfam05697 81 EEKLRPIGQPEIEEVE---------LEEGKDLEFTAEFEVYPEVELGDYKGLEVEKPEVEVTDEDVDEELERL 144 (144) T ss_pred HCCCCCCCCCCCCCCC---------CCCCCCEEEEEEEEECCCEECCCCCCCEEEECCCCCCHHHHHHHHHCC T ss_conf 4556736687311034---------567985799999996485003766764577567777999999999709 No 5 >pfam05698 Trigger_C Bacterial trigger factor protein (TF) C-terminus. In the E. coli cytosol, a fraction of the newly synthesized proteins requires the activity of molecular chaperones for folding to the native state. The major chaperones implicated in this folding process are the ribosome-associated Trigger Factor (TF), and the DnaK and GroEL chaperones with their respective co-chaperones. Trigger Factor is an ATP-independent chaperone and displays chaperone and peptidyl-prolyl-cis-trans-isomerase (PPIase) activities in vitro. It is composed of at least three domains, an N-terminal domain which mediates association with the large ribosomal subunit, a central substrate binding and PPIase domain with homology to FKBP proteins, and a C-terminal domain of unknown function. The positioning of TF at the peptide exit channel, together with its ability to interact with nascent chains as short as 57 residues renders TF a prime candidate for being the first chaperone that binds to the nascent p Probab=99.89 E-value=1.1e-20 Score=184.43 Aligned_cols=158 Identities=22% Similarity=0.399 Sum_probs=150.0 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHH----HCCHH Q ss_conf 55789999998878999977777669999999986400347999999997767899999975348640011----00013 Q gi|254780897|r 269 SESAMRGLCSQKIKQHSEFLVRQKVKRQILDYISNKYTFDVPESLVENEYNGILQKVRFEMSSANQKSQNV----DSIDE 344 (473) Q Consensus 269 t~~elk~~ik~~l~~~~~~~~~~~~~~~i~~~L~~~~~~~lPe~lv~~e~~~~~~~~~~~l~~~~~~~~~~----~~~~e 344 (473) |+++||+.|+++|+.++....+..++++++++|++.++|++|+++|+++++++++++..++.++|++.+.+ +...+ T Consensus 1 Tl~elk~~ik~~l~~~~~~~~~~~~~~~i~~~L~~~~~~evP~~~v~~e~~~~~~~~~~~~~~~g~~~e~~~~~~~~~~e 80 (162) T pfam05698 1 TLEELKEKLRKQLEEEKKEATAEELKEAILDKLVENAEIELPESLVEEEIDRLLRQFLQQLQGQGLDLEEYLSLSGSSEE 80 (162) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCCCHH T ss_conf 98999999999999999999999999999999998788999999999999999999999987669988888765256478 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHCHHHHHHHHHHHHHHHHHHH Q ss_conf 45677899999999999999999864387428999999999999863999899999987099989999999999999999 Q gi|254780897|r 345 EDLQYYHMLAKRRVLTGIVLGTIGEKNNIEVTEEEMQSALYQQLGRFPGHEKKMLDHFQKYPNALAELRAPIFEDKVIDH 424 (473) Q Consensus 345 ~~~~~~~~~Aek~vk~~lil~~Ia~~e~I~vs~~Ei~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~ile~Kv~d~ 424 (473) ++++.++..|+++||++||+.+||+.++|+||++|+++++.+++.+||.+++.+.++|.++ +.+.++++.++++||++| T Consensus 81 ~~~~~~~~~A~~~vk~~lil~~Ia~~e~i~vs~eev~~~i~~~a~~~~~~~~~i~~~~~~~-~~l~~l~~~ll~~K~~d~ 159 (162) T pfam05698 81 ELREEFKEEAEKRVKLGLILEEIAKEEKIEVTDEEIKEEIEELASQYGMEPEEVKEFYKKN-EQLSALKEDLLEEKAVDL 159 (162) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHCC-HHHHHHHHHHHHHHHHHH T ss_conf 8878889999999999999999998808998999999999999988699999999999755-569999999999999999 Q ss_pred HHH Q ss_conf 988 Q gi|254780897|r 425 ILK 427 (473) Q Consensus 425 l~~ 427 (473) |++ T Consensus 160 l~E 162 (162) T pfam05698 160 LLE 162 (162) T ss_pred HHC T ss_conf 739 No 6 >pfam00254 FKBP_C FKBP-type peptidyl-prolyl cis-trans isomerase. Probab=98.39 E-value=5.5e-07 Score=74.65 Aligned_cols=83 Identities=22% Similarity=0.365 Sum_probs=70.4 Q ss_pred CCCCCCCEEEEEEEEEE-CCCCCCCC--CCCCEEEEECCCCCCCCHHHHCCCCCCCCCCCCCCCCCCCCCCCCCC----- Q ss_conf 33221111344456664-26534666--54420365178665631034206887887543323444444410003----- Q gi|254780897|r 165 TESEIGDKVTVDYTVSV-DNVILEDQ--SKKNVQFIVGSADLFSETTEILVGLKTGDQKEIERFFPEDHSIKDLA----- 236 (473) Q Consensus 165 ~~~~~gD~v~id~~~~~-dg~~~~~~--~~~~~~~~lg~~~~~~~f~~~liG~k~Gd~~~~~~~~P~d~~~~~la----- 236 (473) +.++.||.|+|+|.+.. ||..|++. ..+.+.|.+|++.+++||+++|.||++|+++.+.+..+..|...... T Consensus 5 ~~p~~gd~V~v~y~~~~~dG~~~dss~~~~~p~~f~lG~~~~i~Gle~al~~M~~Ge~~~v~ip~~~ayG~~g~~~~Ip~ 84 (95) T pfam00254 5 RKPKKGDTVTVHYTGKLEDGTVFDSSYDRGKPFEFTLGSGQVIPGWDEGLLGMKVGEKRKLTIPPELAYGEEGLGGVIPP 84 (95) T ss_pred CCCCCCCEEEEEEEEEECCCCEEECCCCCCCCEEEEECCCCHHHHHHHHHCCCCCCCEEEEEECHHHCCCCCCCCCCCCC T ss_conf 78999899999999997899799806668978799956864677699987487899999999991993488898998597 Q ss_pred CCCEEEEEEHH Q ss_conf 87304676101 Q gi|254780897|r 237 GKKVRLNFSIK 247 (473) Q Consensus 237 Gk~v~f~v~v~ 247 (473) +..+.|+|+|. T Consensus 85 ~~~l~f~Iel~ 95 (95) T pfam00254 85 NATLVFEVELL 95 (95) T ss_pred CCEEEEEEEEC T ss_conf 99799999989 No 7 >PRK10737 FKBP-type peptidyl-prolyl cis-trans isomerase; Provisional Probab=98.27 E-value=9e-07 Score=72.94 Aligned_cols=89 Identities=19% Similarity=0.331 Sum_probs=69.1 Q ss_pred CCCCCCEEEEEEEEEE-CCCCCCCCC-CCCEEEEECCCCCCCCHHHHCCCCCCCCCCCCCCCCCCCCCCCC--------- Q ss_conf 3221111344456664-265346665-44203651786656310342068878875433234444444100--------- Q gi|254780897|r 166 ESEIGDKVTVDYTVSV-DNVILEDQS-KKNVQFIVGSADLFSETTEILVGLKTGDQKEIERFFPEDHSIKD--------- 234 (473) Q Consensus 166 ~~~~gD~v~id~~~~~-dg~~~~~~~-~~~~~~~lg~~~~~~~f~~~liG~k~Gd~~~~~~~~P~d~~~~~--------- 234 (473) .++++.+|.+.|..+. +|+.++... ..-+.|..|.++++|+|+++|+||++||+++|++.--+-|...+ T Consensus 2 kI~kn~vV~l~Y~l~~~dG~vvdss~~~~Pl~fl~G~g~lipglE~aL~G~~~Gd~~~vtl~peeAYG~~de~lv~~vp~ 81 (196) T PRK10737 2 KVAKDLVVSLAYQVRTEDGVLVDESPVSAPLDYLHGHGSLISGLETALEGHEVGDKFDVAVGANDAYGQYDENLVQRVPK 81 (196) T ss_pred EECCCCEEEEEEEEEECCCCEEEECCCCCCEEEEECCCCCCHHHHHHHCCCCCCCEEEEEECHHHHCCCCCHHHEEEECH T ss_conf 31689899999999918999987269998869995389856789998579999977899978799168889799488258 Q ss_pred -------------------------------------------CCCCCEEEEEEHHHHHCCCC Q ss_conf -------------------------------------------03873046761011002467 Q gi|254780897|r 235 -------------------------------------------LAGKKVRLNFSIKEVFSPLP 254 (473) Q Consensus 235 -------------------------------------------laGk~v~f~v~v~~I~~~~~ 254 (473) ||||+.+|.|+|.+|+..+. T Consensus 82 ~~f~~~~~le~Gm~~~~~~~~G~~~~~V~eV~~~~VtvD~NHPLAGk~L~F~VeVv~VReAT~ 144 (196) T PRK10737 82 DVFMGVDELQVGMRFLAETDQGPVPVEITAVEDDHVVVDGNHMLAGQNLKFNVEVVAIREATE 144 (196) T ss_pred HHCCCCCCCCCCCEEEECCCCCEEEEEEEEECCCEEEEECCCCCCCCEEEEEEEEEEEECCCH T ss_conf 896876434567099880799848899999779989995898778986799999998637999 No 8 >COG1047 SlpA FKBP-type peptidyl-prolyl cis-trans isomerases 2 [Posttranslational modification, protein turnover, chaperones] Probab=98.14 E-value=4.4e-06 Score=67.40 Aligned_cols=60 Identities=28% Similarity=0.497 Sum_probs=49.6 Q ss_pred CCCCCCEEEEEEEEEEC-CCCCCCCC--CCCEEEEECCCCCCCCHHHHCCCCCCCCCCCCCCC Q ss_conf 32211113444566642-65346665--44203651786656310342068878875433234 Q gi|254780897|r 166 ESEIGDKVTVDYTVSVD-NVILEDQS--KKNVQFIVGSADLFSETTEILVGLKTGDQKEIERF 225 (473) Q Consensus 166 ~~~~gD~v~id~~~~~d-g~~~~~~~--~~~~~~~lg~~~~~~~f~~~liG~k~Gd~~~~~~~ 225 (473) .++.||+|.++|++... |+.++... .....|.+|.++++|||+++|+||.+|+++++.+. T Consensus 2 ~i~k~~~V~i~Y~~~~~dg~v~Dtt~e~~~P~~~i~G~g~li~glE~al~g~~~Ge~~~V~Ip 64 (174) T COG1047 2 KIEKGDVVSLHYTLKVEDGEVVDTTDENYGPLTFIVGAGQLIPGLEEALLGKEVGEEFTVEIP 64 (174) T ss_pred CCCCCCEEEEEEEEEECCCCEEECCCCCCCCEEEEECCCCCCHHHHHHHHCCCCCCEEEEEEC T ss_conf 556798999999999569848773555579768996698720659998717778961689868 No 9 >COG0545 FkpA FKBP-type peptidyl-prolyl cis-trans isomerases 1 [Posttranslational modification, protein turnover, chaperones] Probab=97.50 E-value=0.00017 Score=54.66 Aligned_cols=83 Identities=20% Similarity=0.364 Sum_probs=65.0 Q ss_pred CCCCCCEEEEEEEEE-ECCCCCCCCCCCC--EEEEECCCCCCCCHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCC----- Q ss_conf 322111134445666-4265346665442--03651786656310342068878875433234444444100038----- Q gi|254780897|r 166 ESEIGDKVTVDYTVS-VDNVILEDQSKKN--VQFIVGSADLFSETTEILVGLKTGDQKEIERFFPEDHSIKDLAG----- 237 (473) Q Consensus 166 ~~~~gD~v~id~~~~-~dg~~~~~~~~~~--~~~~lg~~~~~~~f~~~liG~k~Gd~~~~~~~~P~d~~~~~laG----- 237 (473) .+..||.|.+.|.|. .||..|+..-..+ +.|.|| ..|+|..++|.||++|.++.+.+.-+--|......| T Consensus 115 ~~~~~~~V~vhY~G~l~~G~vFDsS~~rg~p~~f~l~--~vI~Gw~egl~~M~vG~k~~l~IP~~laYG~~g~~g~Ippn 192 (205) T COG0545 115 APKKGDTVTVHYTGTLIDGTVFDSSYDRGQPAEFPLG--GVIPGWDEGLQGMKVGGKRKLTIPPELAYGERGVPGVIPPN 192 (205) T ss_pred CCCCCCEEEEEEEEECCCCCCCCCCCCCCCCCEEECC--CEEEHHHHHHHHCCCCCEEEEEECCHHCCCCCCCCCCCCCC T ss_conf 9999997999988864799853453236997060268--72001898873187786489984803236767778988999 Q ss_pred CCEEEEEEHHHHH Q ss_conf 7304676101100 Q gi|254780897|r 238 KKVRLNFSIKEVF 250 (473) Q Consensus 238 k~v~f~v~v~~I~ 250 (473) -+..|+|++.+|+ T Consensus 193 s~LvFeVeLl~v~ 205 (205) T COG0545 193 STLVFEVELLDVK 205 (205) T ss_pred CEEEEEEEEEECC T ss_conf 7079999987519 No 10 >KOG0552 consensus Probab=97.47 E-value=0.0002 Score=54.05 Aligned_cols=85 Identities=15% Similarity=0.261 Sum_probs=69.5 Q ss_pred CCCCCCEEEEEEEEEE--CCCCCCCC-CCCCEE-EEECCCCCCCCHHHHCCCCCCCCCCCCCCCCCCCCCCCCC----CC Q ss_conf 3221111344456664--26534666-544203-6517866563103420688788754332344444441000----38 Q gi|254780897|r 166 ESEIGDKVTVDYTVSV--DNVILEDQ-SKKNVQ-FIVGSADLFSETTEILVGLKTGDQKEIERFFPEDHSIKDL----AG 237 (473) Q Consensus 166 ~~~~gD~v~id~~~~~--dg~~~~~~-~~~~~~-~~lg~~~~~~~f~~~liG~k~Gd~~~~~~~~P~d~~~~~l----aG 237 (473) .+..|+.|.++|.|.. +|+.|... ..+.+. |.||++..|+|++-++-||++|.++.+.+.-|--|....+ .+ T Consensus 134 ~a~~G~rV~v~Y~Gkl~~~GkvFd~~~~~kp~~~f~lg~g~VIkG~d~gv~GMkvGGkRrviIPp~lgYg~~g~~~Ippn 213 (226) T KOG0552 134 SAKKGKRVSVRYIGKLKGNGKVFDSNFGGKPFKLFRLGSGEVIKGWDVGVEGMKVGGKRRVIIPPELGYGKKGVPEIPPN 213 (226) T ss_pred CCCCCCEEEEEEEEEECCCCEEEECCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCC T ss_conf 78888888898899966787174212478886310037888773078764451037745897584204464676768999 Q ss_pred CCEEEEEEHHHHH Q ss_conf 7304676101100 Q gi|254780897|r 238 KKVRLNFSIKEVF 250 (473) Q Consensus 238 k~v~f~v~v~~I~ 250 (473) .+..|.|++..|. T Consensus 214 stL~fdVEL~~v~ 226 (226) T KOG0552 214 STLVFDVELLSVK 226 (226) T ss_pred CCEEEEEEEEECC T ss_conf 7479999998609 No 11 >KOG0549 consensus Probab=97.26 E-value=0.00073 Score=49.61 Aligned_cols=89 Identities=13% Similarity=0.223 Sum_probs=67.8 Q ss_pred CCCCCCCCEEEEEEEEEE-CCCCCCCCCC--CCEEEEECCCCCCCCHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCC--- Q ss_conf 433221111344456664-2653466654--4203651786656310342068878875433234444444100038--- Q gi|254780897|r 164 ETESEIGDKVTVDYTVSV-DNVILEDQSK--KNVQFIVGSADLFSETTEILVGLKTGDQKEIERFFPEDHSIKDLAG--- 237 (473) Q Consensus 164 ~~~~~~gD~v~id~~~~~-dg~~~~~~~~--~~~~~~lg~~~~~~~f~~~liG~k~Gd~~~~~~~~P~d~~~~~laG--- 237 (473) ...++.||.+.+.|.++. ||..|...-. .-++|.||.++.|+|.+.+|.||=+|+.+.+.+...-.|...-..+ T Consensus 82 ~~kak~GD~l~~HY~g~leDGt~fdSS~~rg~P~~f~LG~gqVIkG~Dqgl~gMCvGEkRkl~IPp~LgYG~~G~~~~IP 161 (188) T KOG0549 82 PEKAKKGDTLHVHYTGSLEDGTKFDSSYSRGAPFTFTLGTGQVIKGWDQGLLGMCVGEKRKLIIPPHLGYGERGAPPKIP 161 (188) T ss_pred CCCCCCCCEEEEEEEEEECCCCEEEEECCCCCCEEEEECCCCEECCHHHHHHHHCCCCCEEEECCCCCCCCCCCCCCCCC T ss_conf 53246788799998989548988960056899889991788520247677665170662278658312576679899889 Q ss_pred --CCEEEEEEHHHHHCC Q ss_conf --730467610110024 Q gi|254780897|r 238 --KKVRLNFSIKEVFSP 252 (473) Q Consensus 238 --k~v~f~v~v~~I~~~ 252 (473) -...|+|.+.+|.+. T Consensus 162 ~~A~LiFdiELv~i~~~ 178 (188) T KOG0549 162 GDAVLIFDIELVKIERG 178 (188) T ss_pred CCEEEEEEEEEEEEECC T ss_conf 97169999999984057 No 12 >pfam09312 SurA_N SurA N-terminal domain. This domain is found at the N-terminus of the chaperone SurA. It is a helical domain of unknown function. The C-terminus of the SurA protein folds back and forms part of this domain also but is not included in the current alignment. Probab=97.09 E-value=0.034 Score=36.18 Aligned_cols=100 Identities=11% Similarity=0.121 Sum_probs=62.6 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHH Q ss_conf 79999999977678999999753486400110001345677899999999999999999864387428999999999999 Q gi|254780897|r 309 VPESLVENEYNGILQKVRFEMSSANQKSQNVDSIDEEDLQYYHMLAKRRVLTGIVLGTIGEKNNIEVTEEEMQSALYQQL 388 (473) Q Consensus 309 lPe~lv~~e~~~~~~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~Aek~vk~~lil~~Ia~~e~I~vs~~Ei~~~i~~~~ 388 (473) +-.+-+++.+.. +..++..++..... ...++..+-.++-..-++..-|++.+|.||+++|+..+...+ T Consensus 13 It~~el~~~~~~----~~~~~~~~~~~~~~--------~~~l~~~vL~~LI~~~L~~q~A~~~gi~vsd~evd~~i~~ia 80 (118) T pfam09312 13 ILQSELDRRVKT----VKRQLAKQGTQLPP--------DAVLRRQVLERLILERIQLQMAERTGIRVDDEQLNQAIARIA 80 (118) T ss_pred CCHHHHHHHHHH----HHHHHHHCCCCCCC--------HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHH T ss_conf 489999999999----99999865899992--------899999999999999999999998599998999999999999 Q ss_pred HHCCCCHHHHHHHHHHCHHHHHHHHHHHHHHH Q ss_conf 86399989999998709998999999999999 Q gi|254780897|r 389 GRFPGHEKKMLDHFQKYPNALAELRAPIFEDK 420 (473) Q Consensus 389 ~~~~~~~~~~~~~~~~~~~~~~~i~~~ile~K 420 (473) .+++.+..++...+.+.+-....++..|..+- T Consensus 81 ~~n~~s~~~~~~~L~~~G~~~~~~r~~ir~qi 112 (118) T pfam09312 81 QQNNMTLDQLRAALAADGLSYDEFREQIRKEM 112 (118) T ss_pred HHCCCCHHHHHHHHHHCCCCHHHHHHHHHHHH T ss_conf 88598999999999984999999999999999 No 13 >PRK10902 FKBP-type peptidyl-prolyl cis-trans isomerase; Provisional Probab=97.01 E-value=0.0011 Score=48.31 Aligned_cols=84 Identities=14% Similarity=0.135 Sum_probs=61.3 Q ss_pred CCCCCEEEEEEEEE-ECCCCCCCCCCCCEEEEECCCCCCCCHHHHCCCCCCCCCCCCCCCCCCCCCCCC----CCCCCEE Q ss_conf 22111134445666-426534666544203651786656310342068878875433234444444100----0387304 Q gi|254780897|r 167 SEIGDKVTVDYTVS-VDNVILEDQSKKNVQFIVGSADLFSETTEILVGLKTGDQKEIERFFPEDHSIKD----LAGKKVR 241 (473) Q Consensus 167 ~~~gD~v~id~~~~-~dg~~~~~~~~~~~~~~lg~~~~~~~f~~~liG~k~Gd~~~~~~~~P~d~~~~~----laGk~v~ 241 (473) ...+|.|+|+|.|+ +||..|+..-..+.++...-+.+|||+.++|..|++|.++.+-+.----|...- -.+-... T Consensus 161 P~~~D~V~VhY~G~LiDGtvFDSS~~rg~P~~f~l~~VIpGw~EgL~~Mk~G~k~~l~IP~~LAYG~~g~~~IPpnstLi 240 (270) T PRK10902 161 PKDSDTVVVNYKGTLIDGKEFDNSYTRGEPLSFRLDGVIPGWTEGLKNIKKGGKIKLVIPPELAYGKAGVPGIPANSTLV 240 (270) T ss_pred CCCCCEEEEEEEEEECCCCEEECCCCCCCCEEEECCCHHHHHHHHHHCCCCCCEEEEEECCCCCCCCCCCCCCCCCCEEE T ss_conf 89999789999999789998616567799816677642369999984089997699998840151768899989898489 Q ss_pred EEEEHHHHH Q ss_conf 676101100 Q gi|254780897|r 242 LNFSIKEVF 250 (473) Q Consensus 242 f~v~v~~I~ 250 (473) |+|.+.+|+ T Consensus 241 FevELl~Ik 249 (270) T PRK10902 241 FDVELLDVK 249 (270) T ss_pred EEEEEEEEE T ss_conf 999999750 No 14 >TIGR03516 ppisom_GldI peptidyl-prolyl isomerase, gliding motility-associated. Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldI is a FKBP-type peptidyl-prolyl cis-trans isomerase (pfam00254) linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae and Cytophaga hutchinsonii. Knockout of this gene abolishes the gliding phenotype. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. This family is only found in Bacteroidetes containing the suite of genes proposed to confer the gliding motility phenotype. Probab=96.97 E-value=0.0014 Score=47.23 Aligned_cols=83 Identities=20% Similarity=0.300 Sum_probs=62.4 Q ss_pred CCCCCCEEEEEEEEE-ECCCCCCCCCC-CCEEEEECCCCCCCCHHHHCCCCCCCCCCCCCCCCCCC--CCCC-----CCC Q ss_conf 322111134445666-42653466654-42036517866563103420688788754332344444--4410-----003 Q gi|254780897|r 166 ESEIGDKVTVDYTVS-VDNVILEDQSK-KNVQFIVGSADLFSETTEILVGLKTGDQKEIERFFPED--HSIK-----DLA 236 (473) Q Consensus 166 ~~~~gD~v~id~~~~-~dg~~~~~~~~-~~~~~~lg~~~~~~~f~~~liG~k~Gd~~~~~~~~P~d--~~~~-----~la 236 (473) ..+.||.|+++|.+. .||..|..... ..++|.+|.+++++|+.++|.-|++|++..|- +|.. |... --. T Consensus 85 ~P~~gd~V~~~Y~~~~ldG~~~~ss~~~gp~~f~v~~~~vi~G~~egl~lm~~Ge~~~~~--iPs~LaYG~~G~~~~I~p 162 (177) T TIGR03516 85 TPEFGDLVTFEYDIRALDGDVIYSEEELGPQTYKVDQQDLFSGLRDGLKLMKEGETATFL--FPSHKAYGYYGDQNKIGP 162 (177) T ss_pred CCCCCCEEEEEEEEEECCCCEEEEECCCCCEEEEECCCCCCHHHHHHHHHCCCCCEEEEE--ECCHHCCCCCCCCCCCCC T ss_conf 999999899999999768988874034686569855656464899999855789789999--573004466788995496 Q ss_pred CCCEEEEEEHHHHH Q ss_conf 87304676101100 Q gi|254780897|r 237 GKKVRLNFSIKEVF 250 (473) Q Consensus 237 Gk~v~f~v~v~~I~ 250 (473) +.+..|+|++.+|+ T Consensus 163 n~~Lif~v~Ll~i~ 176 (177) T TIGR03516 163 NLPIISTVTLLNIK 176 (177) T ss_pred CCCEEEEEEEEEEC T ss_conf 98599999997331 No 15 >KOG0544 consensus Probab=96.95 E-value=0.0019 Score=46.21 Aligned_cols=84 Identities=15% Similarity=0.246 Sum_probs=65.5 Q ss_pred CCCCCCEEEEEEEEEE-CCCCCCCCCC--CCEEEEECCCCCCCCHHHHCCCCCCCCCCCCCCCCCCCCCCCCCC-----C Q ss_conf 3221111344456664-2653466654--420365178665631034206887887543323444444410003-----8 Q gi|254780897|r 166 ESEIGDKVTVDYTVSV-DNVILEDQSK--KNVQFIVGSADLFSETTEILVGLKTGDQKEIERFFPEDHSIKDLA-----G 237 (473) Q Consensus 166 ~~~~gD~v~id~~~~~-dg~~~~~~~~--~~~~~~lg~~~~~~~f~~~liG~k~Gd~~~~~~~~P~d~~~~~la-----G 237 (473) -.+.||.|++.|+|+. ||..|..... +-|.|.+|.+..|.|+++++.+|.+|+.-.+++.-+-.|...-+. + T Consensus 16 fpK~Gqtvt~hYtg~L~dG~kfDSs~dr~kPfkf~IGkgeVIkGwdegv~qmsvGekakLti~pd~aYG~~G~p~~IppN 95 (108) T KOG0544 16 FPKKGQTVTVHYTGTLQDGKKFDSSRDRGKPFKFKIGKGEVIKGWDEGVAQMSVGEKAKLTISPDYAYGPRGHPGGIPPN 95 (108) T ss_pred CCCCCCEEEEEEEEEECCCCEECCCCCCCCCEEEEECCCCEEECHHHCCHHCCCCCCCEEEECCCCCCCCCCCCCCCCCC T ss_conf 58899899999986851885830335469983698547226603223210115364010165763122778888755998 Q ss_pred CCEEEEEEHHHH Q ss_conf 730467610110 Q gi|254780897|r 238 KKVRLNFSIKEV 249 (473) Q Consensus 238 k~v~f~v~v~~I 249 (473) -+..|+|.+.+| T Consensus 96 atL~FdVEll~v 107 (108) T KOG0544 96 ATLVFDVELLKV 107 (108) T ss_pred CEEEEEEEEEEC T ss_conf 679999999731 No 16 >PRK00059 prsA peptidylprolyl isomerase; Provisional Probab=96.91 E-value=0.051 Score=34.83 Aligned_cols=140 Identities=8% Similarity=-0.026 Sum_probs=71.1 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCEEEEEE--------EEEECCCCCCCC----------C--CCCEEE- Q ss_conf 1211115789999997643100002343322111134445--------666426534666----------5--442036- Q gi|254780897|r 138 ICEVDEKEIDRQMAEIAKNNVAFEVKETESEIGDKVTVDY--------TVSVDNVILEDQ----------S--KKNVQF- 196 (473) Q Consensus 138 ~~~vtd~~Id~~i~~l~~~~~~~~~~~~~~~~gD~v~id~--------~~~~dg~~~~~~----------~--~~~~~~- 196 (473) .+.|||++|..+++.-...|..-.. . .... .+.+.. .-...|+.|... . +-+..+ T Consensus 171 ~v~Vsd~Ei~~yY~~nk~~f~~~p~-~--~~~~-~Il~~~~~~A~~i~~~l~~G~dF~~lAk~~S~D~~s~~~gGdLG~~ 246 (336) T PRK00059 171 DVKVTDKDAQKYYDENKSTFTEKPN-T--VHLS-HILVKTEEEAKKVKKRLDKGEDFAKVAKEVSQDPGSKDKGGDLGDV 246 (336) T ss_pred CCCCCHHHHHHHHHHHHHHHCCCCC-E--EEEE-EEEECCHHHHHHHHHHHHCCCCHHHHHHHHCCCCCCCCCCCCCCCC T ss_conf 6865889999999997887337866-5--8899-8887899999999999877998899999848995310148866640 Q ss_pred EECCCCCCCCHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEHHHHHCCCCCCCCHHHHHHHCCCCHHHHHHH Q ss_conf 51786656310342068878875433234444444100038730467610110024677767478875023655789999 Q gi|254780897|r 197 IVGSADLFSETTEILVGLKTGDQKEIERFFPEDHSIKDLAGKKVRLNFSIKEVFSPLPVVVNNDLAVRLGFESESAMRGL 276 (473) Q Consensus 197 ~lg~~~~~~~f~~~liG~k~Gd~~~~~~~~P~d~~~~~laGk~v~f~v~v~~I~~~~~pel~def~k~~~~~t~~elk~~ 276 (473) ..+.+.+.|.|.+++.++++|+.-. .+..+.. .+-|+|.++ .++. ..+.++.|+. T Consensus 247 ~~~~~~~~~~f~~a~f~Lk~GeiS~-pV~t~~G-----------yhIIkl~~~---~~~~----------~~~feevk~~ 301 (336) T PRK00059 247 PYSDSNYDKEFMDAAKALKEGEISA-PVKTQFG-----------YHIIKAIKK---KEYP----------VKPFDSVKEQ 301 (336) T ss_pred CCCCCCCCHHHHHHHHHCCCCCCCC-CEEECCE-----------EEEEEEEEE---CCCC----------CCCHHHHHHH T ss_conf 2676655668999997456888688-6066886-----------799999721---5788----------9999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 9988789999777776699999999864003479 Q gi|254780897|r 277 CSQKIKQHSEFLVRQKVKRQILDYISNKYTFDVP 310 (473) Q Consensus 277 ik~~l~~~~~~~~~~~~~~~i~~~L~~~~~~~lP 310 (473) |+..|..+.... ...+.++.|.+..++++- T Consensus 302 I~~~L~~qK~~~----~~~e~~~~l~~~a~I~i~ 331 (336) T PRK00059 302 IKKQLLQEKQSE----VFKKKIEEWKKDLKVKVY 331 (336) T ss_pred HHHHHHHHHHHH----HHHHHHHHHHHHCCEEEC T ss_conf 999999999999----999999999973780754 No 17 >PRK10770 peptidyl-prolyl cis-trans isomerase SurA; Provisional Probab=96.81 E-value=0.059 Score=34.29 Aligned_cols=37 Identities=16% Similarity=0.004 Sum_probs=26.4 Q ss_pred CCCCCCCHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 86656310342068878875433234444444100038 Q gi|254780897|r 200 SADLFSETTEILVGLKTGDQKEIERFFPEDHSIKDLAG 237 (473) Q Consensus 200 ~~~~~~~f~~~liG~k~Gd~~~~~~~~P~d~~~~~laG 237 (473) .+.+++.|.+++.++++|+-.. .+.-|..||.-.+.. T Consensus 236 ~~~lp~~f~~a~~~l~~Geis~-pi~s~~G~HIikl~d 272 (428) T PRK10770 236 IQELPGIFAQALSTAKKGDIVG-PIRSGVGFHILKVND 272 (428) T ss_pred CCCCHHHHHHHHHHCCCCCCCC-CEECCCCEEEEEEHH T ss_conf 2213078999997168987235-364178359997244 No 18 >PRK11570 peptidyl-prolyl cis-trans isomerase; Provisional Probab=96.77 E-value=0.0023 Score=45.61 Aligned_cols=83 Identities=13% Similarity=0.129 Sum_probs=61.0 Q ss_pred CCCCCEEEEEEEEE-ECCCCCCCCCCCCEEEEECCCCCCCCHHHHCCCCCCCCCCCCCCCCCCCCCCCCC-----CCCCE Q ss_conf 22111134445666-4265346665442036517866563103420688788754332344444441000-----38730 Q gi|254780897|r 167 SEIGDKVTVDYTVS-VDNVILEDQSKKNVQFIVGSADLFSETTEILVGLKTGDQKEIERFFPEDHSIKDL-----AGKKV 240 (473) Q Consensus 167 ~~~gD~v~id~~~~-~dg~~~~~~~~~~~~~~lg~~~~~~~f~~~liG~k~Gd~~~~~~~~P~d~~~~~l-----aGk~v 240 (473) ...+|.|.|+|.|+ +||..|+..-..+.++...-+.+|||+.++|..|++|.+..+-+.-.--|...-. .+-+. T Consensus 117 p~~~d~v~v~Y~g~l~dG~vfDss~~rg~p~~~~l~~vi~gw~e~L~~m~~G~k~~~~iP~~LaYG~~g~~~~Ippns~L 196 (206) T PRK11570 117 PARTDRVRVHYTGKLIDGTVFDSSVARGEPAEFPVNGVIPGWIEALTLMPVGSKWELTIPQELAYGERGAGASIPPFSTL 196 (206) T ss_pred CCCCCEEEEEEEEEECCCCEECCCCCCCCCCCEECCCCCHHHHHHHCCCCCCCEEEEEECCHHCCCCCCCCCCCCCCCEE T ss_conf 99999899999999789988513113599750035753678999851889998899998904263878899897959858 Q ss_pred EEEEEHHHH Q ss_conf 467610110 Q gi|254780897|r 241 RLNFSIKEV 249 (473) Q Consensus 241 ~f~v~v~~I 249 (473) .|+|.+.+| T Consensus 197 iFeveLl~i 205 (206) T PRK11570 197 VFEVELLEI 205 (206) T ss_pred EEEEEEEEE T ss_conf 999999976 No 19 >PRK12450 foldase protein PrsA; Reviewed Probab=96.53 E-value=0.02 Score=38.04 Aligned_cols=98 Identities=15% Similarity=0.152 Sum_probs=63.7 Q ss_pred EEEEECCCCCCCCHHHHCCCCCCCCCCCC-CCCCCCCCCCCCCCCCCEEEEEEHHHHHCCCCCCCCHHHHHHHCCCCHHH Q ss_conf 03651786656310342068878875433-23444444410003873046761011002467776747887502365578 Q gi|254780897|r 194 VQFIVGSADLFSETTEILVGLKTGDQKEI-ERFFPEDHSIKDLAGKKVRLNFSIKEVFSPLPVVVNNDLAVRLGFESESA 272 (473) Q Consensus 194 ~~~~lg~~~~~~~f~~~liG~k~Gd~~~~-~~~~P~d~~~~~laGk~v~f~v~v~~I~~~~~pel~def~k~~~~~t~~e 272 (473) ..|.-|++.+.++|.++.-+++.|+--++ .++=|--+. --|+ | |+-...|+. ..+.++ T Consensus 193 ~~fd~~~~~~~~ef~~AAfkLk~GevS~~i~~~~Pvkt~--------~GYh--I--IK~t~k~eK---------~~~~~~ 251 (309) T PRK12450 193 YTFDSGETTLPAEVVRAASGLKEGNRSEIITALDPATSK--------RTYH--I--IKVTKKATK---------KADWKA 251 (309) T ss_pred CCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCCC--------CCEE--E--EEEECCCCC---------CCCHHH T ss_conf 677778975628999998716688714422368872015--------7638--9--998157554---------566899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHH Q ss_conf 999999887899997777766999999998640034799999 Q gi|254780897|r 273 MRGLCSQKIKQHSEFLVRQKVKRQILDYISNKYTFDVPESLV 314 (473) Q Consensus 273 lk~~ik~~l~~~~~~~~~~~~~~~i~~~L~~~~~~~lPe~lv 314 (473) .++.|++.|..++. .+..+.+.++..+++...+.+...-+ T Consensus 252 ~K~~lk~~i~~~k~--~d~~~~~~vi~k~lKkanVkIkDkdl 291 (309) T PRK12450 252 YQKRLKDIIVTGKL--KDPDFQNKVIAKALDKANVKIKDKAF 291 (309) T ss_pred HHHHHHHHHHHHHH--CCHHHHHHHHHHHHHHCCCCCCCHHH T ss_conf 99999999999871--28799999999999977987675889 No 20 >PRK03095 prsA peptidylprolyl isomerase; Reviewed Probab=96.08 E-value=0.086 Score=33.02 Aligned_cols=88 Identities=9% Similarity=0.021 Sum_probs=60.3 Q ss_pred CCCCCCCCHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEHHHHHCCCCCCCCHHHHHHHCCCCHHHHHHHHH Q ss_conf 78665631034206887887543323444444410003873046761011002467776747887502365578999999 Q gi|254780897|r 199 GSADLFSETTEILVGLKTGDQKEIERFFPEDHSIKDLAGKKVRLNFSIKEVFSPLPVVVNNDLAVRLGFESESAMRGLCS 278 (473) Q Consensus 199 g~~~~~~~f~~~liG~k~Gd~~~~~~~~P~d~~~~~laGk~v~f~v~v~~I~~~~~pel~def~k~~~~~t~~elk~~ik 278 (473) +.+.+.|.|.+++-+++.|+. +-.+..+-.|+ -|++.+.+. +. .+.++.|..|+ T Consensus 183 ~~~~~~~~f~~aaf~Lk~gei-S~Pvkt~~Gyh-----------IIk~~~k~~---k~-----------~~~~~~K~~lk 236 (287) T PRK03095 183 GPGKMVKEFEDAAYKLKKDEV-SEPVKSQFGYH-----------IIKVTDIKE---PE-----------KSFEQSKADIK 236 (287) T ss_pred CCCCCHHHHHHHHHCCCCCCC-CCCEEECCEEE-----------EEEEEECCC---CC-----------CCHHHHHHHHH T ss_conf 987625999999972899985-88676088359-----------999841478---89-----------99899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHH Q ss_conf 887899997777766999999998640034799999 Q gi|254780897|r 279 QKIKQHSEFLVRQKVKRQILDYISNKYTFDVPESLV 314 (473) Q Consensus 279 ~~l~~~~~~~~~~~~~~~i~~~L~~~~~~~lPe~lv 314 (473) ..|..+... +..+.+.++..+++...+.+...-+ T Consensus 237 ~~i~~~k~~--D~~~~~~vi~k~lkkanV~ikD~~l 270 (287) T PRK03095 237 KELVQKKAQ--DGEFMNDLMMKEIKKADVKVDDKDL 270 (287) T ss_pred HHHHHHHHH--CHHHHHHHHHHHHHHCCCCCCCHHH T ss_conf 999999864--7699999999999977986153899 No 21 >PRK01326 prsA foldase protein PrsA; Reviewed Probab=95.84 E-value=0.11 Score=32.17 Aligned_cols=97 Identities=7% Similarity=0.091 Sum_probs=60.3 Q ss_pred EEEECCCCCCCCHHHHCCCCCCCCCCCC-CCCCCCCCCCCCCCCCCEEEEEEHHHHHCCCCCCCCHHHHHHHCCCCHHHH Q ss_conf 3651786656310342068878875433-234444444100038730467610110024677767478875023655789 Q gi|254780897|r 195 QFIVGSADLFSETTEILVGLKTGDQKEI-ERFFPEDHSIKDLAGKKVRLNFSIKEVFSPLPVVVNNDLAVRLGFESESAM 273 (473) Q Consensus 195 ~~~lg~~~~~~~f~~~liG~k~Gd~~~~-~~~~P~d~~~~~laGk~v~f~v~v~~I~~~~~pel~def~k~~~~~t~~el 273 (473) .|.-+...+++.|.++--+++.|+.-+. ..+-|-.|.. --+-|++.+ + |+- -.+.++. T Consensus 191 ~f~~~~~~~~~~f~~AAfkLk~gevS~~v~t~~p~~~q~-------gYhIIK~~~--k---~eK---------~~~~~~~ 249 (310) T PRK01326 191 KFDSGATNVPTDVKKAAFSLDEGGVSDVISVLDPTSYQK-------KYYIVKVTK--K---TEK---------KSDWQDY 249 (310) T ss_pred CCCCCCCCCCHHHHHHHHCCCCCCCCCCEECCCCCCCCC-------EEEEEEEEC--C---CCC---------CCCHHHH T ss_conf 547899807699999987066898466112478765554-------069999736--7---666---------7778999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHH Q ss_conf 99999887899997777766999999998640034799999 Q gi|254780897|r 274 RGLCSQKIKQHSEFLVRQKVKRQILDYISNKYTFDVPESLV 314 (473) Q Consensus 274 k~~ik~~l~~~~~~~~~~~~~~~i~~~L~~~~~~~lPe~lv 314 (473) +..|++.|..+. ..+..+.+.++..+++...+.+...-+ T Consensus 250 K~~lk~~i~~~k--~~D~~~~~~vi~k~lKkanVkIkDkdl 288 (310) T PRK01326 250 KKRLKAIILAQK--SKDMNFQNKVIAKALDKANVKIKDKAF 288 (310) T ss_pred HHHHHHHHHHHH--CCCHHHHHHHHHHHHHHCCCCCCCHHH T ss_conf 999999999986--227799999999999976987361889 No 22 >PRK00059 prsA peptidylprolyl isomerase; Provisional Probab=95.40 E-value=0.26 Score=29.12 Aligned_cols=106 Identities=10% Similarity=0.204 Sum_probs=63.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHH-------HCCHHHHHHHHHH Q ss_conf 878999977777669999999986400347999999997767899999975348640011-------0001345677899 Q gi|254780897|r 280 KIKQHSEFLVRQKVKRQILDYISNKYTFDVPESLVENEYNGILQKVRFEMSSANQKSQNV-------DSIDEEDLQYYHM 352 (473) Q Consensus 280 ~l~~~~~~~~~~~~~~~i~~~L~~~~~~~lPe~lv~~e~~~~~~~~~~~l~~~~~~~~~~-------~~~~e~~~~~~~~ 352 (473) .+........++...+.++..-.+...+.+++..|.++++.....+... .+.+.+.+ +.+.+.++ . T Consensus 81 ~~~~~r~qvL~~LI~~~Ll~q~A~~~gl~vsd~ev~~~I~~~~~~~~~~---g~f~~~~y~~~L~~~g~t~~~~~----~ 153 (336) T PRK00059 81 QLKQQKKQILESLITEKVLLQKAKELKLIPSDEELNKEVDKQVNEIKKQ---YNNDEEKFEEALKQTGFTEETFK----E 153 (336) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHH---CCCCHHHHHHHHHHCCCCHHHHH----H T ss_conf 9999999999999999999999998699849999999999999999883---79999999999998699999999----9 Q ss_pred HHHHHHHHHHHHHHHHHH--CCCCCCHHHHHHHHHHHHHHCCCCHH Q ss_conf 999999999999999864--38742899999999999986399989 Q gi|254780897|r 353 LAKRRVLTGIVLGTIGEK--NNIEVTEEEMQSALYQQLGRFPGHEK 396 (473) Q Consensus 353 ~Aek~vk~~lil~~Ia~~--e~I~vs~~Ei~~~i~~~~~~~~~~~~ 396 (473) . +|.+|++.++.+. .++.||++|+..+...-...|...|. T Consensus 154 ~----~r~~l~~~~l~~~v~~~v~Vsd~Ei~~yY~~nk~~f~~~p~ 195 (336) T PRK00059 154 Y----LKNQIIIEKVINEVVKDVKVTDKDAQKYYDENKSTFTEKPN 195 (336) T ss_pred H----HHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCC T ss_conf 9----99999999999997456865889999999997887337866 No 23 >PRK02998 prsA peptidylprolyl isomerase; Reviewed Probab=95.30 E-value=0.15 Score=31.02 Aligned_cols=90 Identities=10% Similarity=0.062 Sum_probs=63.9 Q ss_pred CCCCCCCCHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEHHHHHCCCCCCCCHHHHHHHCCCCHHHHHHHHH Q ss_conf 78665631034206887887543323444444410003873046761011002467776747887502365578999999 Q gi|254780897|r 199 GSADLFSETTEILVGLKTGDQKEIERFFPEDHSIKDLAGKKVRLNFSIKEVFSPLPVVVNNDLAVRLGFESESAMRGLCS 278 (473) Q Consensus 199 g~~~~~~~f~~~liG~k~Gd~~~~~~~~P~d~~~~~laGk~v~f~v~v~~I~~~~~pel~def~k~~~~~t~~elk~~ik 278 (473) +.+.+.|.|.+++-.+++|+. +-.+..+-.|+ -|+|.+ ++. ..+.++.|+.|+ T Consensus 185 ~~g~~~~~f~~Aaf~L~~G~v-S~PVkt~~GyH-----------IIkv~d--k~~-------------~~~~~~~K~~l~ 237 (283) T PRK02998 185 APGQTVKEFEEAAYKLDAGQV-SEPVKTTYGYH-----------IIKVTD--KKE-------------LKPFDEVKDSIR 237 (283) T ss_pred CCCCCCHHHHHHHHCCCCCCC-CCCEEECCEEE-----------EEEEEC--CCC-------------CCCHHHHHHHHH T ss_conf 999807899999975999994-88778788679-----------999801--688-------------999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHH Q ss_conf 8878999977777669999999986400347999999 Q gi|254780897|r 279 QKIKQHSEFLVRQKVKRQILDYISNKYTFDVPESLVE 315 (473) Q Consensus 279 ~~l~~~~~~~~~~~~~~~i~~~L~~~~~~~lPe~lv~ 315 (473) +.|..+........+.+.++..+++...+.+...-+. T Consensus 238 ~~i~~~k~~d~~~~~~~~vi~~~lkkanV~ikD~~lk 274 (283) T PRK02998 238 KDLEQQRLQDTTGKWKQQVVNDLLKDADIKVNDKEFK 274 (283) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHH T ss_conf 9999999755478999999999998779876638889 No 24 >PRK04405 prsA peptidylprolyl isomerase; Provisional Probab=95.02 E-value=0.19 Score=30.26 Aligned_cols=93 Identities=10% Similarity=0.048 Sum_probs=62.5 Q ss_pred EEEECCCCCCCCHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEHHHHHCCCCCCCCHHHHHHHCCCCHHHHH Q ss_conf 36517866563103420688788754332344444441000387304676101100246777674788750236557899 Q gi|254780897|r 195 QFIVGSADLFSETTEILVGLKTGDQKEIERFFPEDHSIKDLAGKKVRLNFSIKEVFSPLPVVVNNDLAVRLGFESESAMR 274 (473) Q Consensus 195 ~~~lg~~~~~~~f~~~liG~k~Gd~~~~~~~~P~d~~~~~laGk~v~f~v~v~~I~~~~~pel~def~k~~~~~t~~elk 274 (473) +|.-+.+.+.|+|.++.-.++.|+...-++...-.|+. |++.+ + ++. .+.++.+ T Consensus 193 ~f~~~~~~m~~~f~~aaf~Lk~geis~~PvkT~~GYhI-----------IK~~~--~---~~k----------~~~~~~K 246 (298) T PRK04405 193 AFDSTDTTLDSTFKTAAFKLKNGEYTTTPVKTTYGYEV-----------IKMIK--H---PAK----------GKFTDHK 246 (298) T ss_pred EECCCCCCCCHHHHHHHHCCCCCCCCCCCEECCCCEEE-----------EEEEC--C---CCC----------CCHHHHH T ss_conf 04079986176999999748899816884521673399-----------99834--5---777----------9889999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHH Q ss_conf 99998878999977777669999999986400347999999 Q gi|254780897|r 275 GLCSQKIKQHSEFLVRQKVKRQILDYISNKYTFDVPESLVE 315 (473) Q Consensus 275 ~~ik~~l~~~~~~~~~~~~~~~i~~~L~~~~~~~lPe~lv~ 315 (473) ..|++.|..+... +....+.++..+++...+.+-..-+. T Consensus 247 ~~l~~~i~~~k~~--D~~~~~~vi~k~lk~anVkikD~~lk 285 (298) T PRK04405 247 KELTKQIYAKWAS--DSTVMQNVISKVLKKANVSIKDKDLK 285 (298) T ss_pred HHHHHHHHHHHHH--CHHHHHHHHHHHHHHCCCCCCCHHHH T ss_conf 9999999998730--87999999999999779871568999 No 25 >PRK10788 peptidyl-prolyl cis-trans isomerase (rotamase D); Provisional Probab=93.97 E-value=0.55 Score=26.57 Aligned_cols=76 Identities=13% Similarity=0.062 Sum_probs=40.1 Q ss_pred CCCCCCCCHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEHHHHHCCCCCCCCHHHHHHHCCCCHHHHHHHHH Q ss_conf 78665631034206887887543323444444410003873046761011002467776747887502365578999999 Q gi|254780897|r 199 GSADLFSETTEILVGLKTGDQKEIERFFPEDHSIKDLAGKKVRLNFSIKEVFSPLPVVVNNDLAVRLGFESESAMRGLCS 278 (473) Q Consensus 199 g~~~~~~~f~~~liG~k~Gd~~~~~~~~P~d~~~~~laGk~v~f~v~v~~I~~~~~pel~def~k~~~~~t~~elk~~ik 278 (473) +.+.|.|+|.++..- .+|+. +-.+..+-.| .-|+|.+|+... ..+.++.|+.|+ T Consensus 319 ~~g~~~~~f~~a~~~-~~Gev-S~pV~t~fG~-----------hiikv~~i~~~~-------------~~~~eevk~eI~ 372 (622) T PRK10788 319 EDATTPDELKNAGLK-EKGQL-SGVIKSSVGF-----------LIVRLDDIQPAK-------------VKPLSEVRDDIA 372 (622) T ss_pred CCCCCHHHHHHHHHH-CCCCC-CCCEEECCCE-----------EEEEEECCCCCC-------------CCCHHHHHHHHH T ss_conf 666442899998874-67986-7847607823-----------899983246677-------------899899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH Q ss_conf 8878999977777669999999 Q gi|254780897|r 279 QKIKQHSEFLVRQKVKRQILDY 300 (473) Q Consensus 279 ~~l~~~~~~~~~~~~~~~i~~~ 300 (473) ..|..+.....-..+.+.+.+. T Consensus 373 ~~l~~~~a~~~~~~~~~~~~d~ 394 (622) T PRK10788 373 AKVKQEKALDAYYALQQKVSDA 394 (622) T ss_pred HHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999999999999999887 No 26 >KOG0543 consensus Probab=92.49 E-value=0.88 Score=24.91 Aligned_cols=85 Identities=11% Similarity=0.078 Sum_probs=59.8 Q ss_pred CCCCEEEEEEEEEECCCCCCCCCCCCEEEEECC-CCCCCCHHHHCCCCCCCCCCCCCCCCCCCCCCC--C----CCCCCE Q ss_conf 211113444566642653466654420365178-665631034206887887543323444444410--0----038730 Q gi|254780897|r 168 EIGDKVTVDYTVSVDNVILEDQSKKNVQFIVGS-ADLFSETTEILVGLKTGDQKEIERFFPEDHSIK--D----LAGKKV 240 (473) Q Consensus 168 ~~gD~v~id~~~~~dg~~~~~~~~~~~~~~lg~-~~~~~~f~~~liG~k~Gd~~~~~~~~P~d~~~~--~----laGk~v 240 (473) -.|-.|.+.|.|...|..|.. +.-+|.|.+|. ...+.|++-+|-.|++|+.-.+.+..---|... . --+-++ T Consensus 102 ~~g~~V~v~~~G~~~~~~f~~-~~~~fe~~~Ge~~~vi~Gle~al~~M~~GE~a~v~i~~~YayG~~~~~~p~IPPnA~l 180 (397) T KOG0543 102 NKGAVVKVHLEGELEDGVFDQ-RELRFEFGEGEDIDVIEGLEIALRMMKVGEVALVTIDPKYAYGEEGGEPPLIPPNATL 180 (397) T ss_pred CCCCEEEEEEEEEECCCCEEC-CCCCEEEECCCCCCHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCEE T ss_conf 877679999999988952132-5455478358765546799999976575625899957400237778889998989628 Q ss_pred EEEEEHHHHHCCC Q ss_conf 4676101100246 Q gi|254780897|r 241 RLNFSIKEVFSPL 253 (473) Q Consensus 241 ~f~v~v~~I~~~~ 253 (473) .|+|++++...+. T Consensus 181 ~yEVeL~~f~~~~ 193 (397) T KOG0543 181 LYEVELLDFELKE 193 (397) T ss_pred EEEEEEEEEECCC T ss_conf 9999998630476 No 27 >PRK01885 greB transcription elongation factor GreB; Reviewed Probab=92.07 E-value=0.55 Score=26.57 Aligned_cols=79 Identities=24% Similarity=0.330 Sum_probs=44.9 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCCCCCCEEEEEEEEEECCCCCCCCCCCCEEEEECCCC---------CCCCHHHHCC Q ss_conf 1157899999976431000023433221111344456664265346665442036517866---------5631034206 Q gi|254780897|r 142 DEKEIDRQMAEIAKNNVAFEVKETESEIGDKVTVDYTVSVDNVILEDQSKKNVQFIVGSAD---------LFSETTEILV 212 (473) Q Consensus 142 td~~Id~~i~~l~~~~~~~~~~~~~~~~gD~v~id~~~~~dg~~~~~~~~~~~~~~lg~~~---------~~~~f~~~li 212 (473) ....++..|..|..+.+....++.....+|.|.+--..++.+. .+...-|+ .+|... ..--+-.+|+ T Consensus 57 ~q~~le~rI~~Le~~L~~A~iid~~~~~~~~V~~Gs~V~l~~~---~g~~~~y~-iVG~~Ead~~~g~IS~~SPlg~ALl 132 (159) T PRK01885 57 RLREIDRRVRFLTKRLENLKVVDYSPQQEGKVFFGAWVEIENE---DGEEKRFR-IVGPDEIDGRKGYISIDSPMARALL 132 (159) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCCCCEEECCCEEEEEEC---CCCEEEEE-EECHHHCCCCCCCCCCCCHHHHHHH T ss_conf 9999999999999998733035766677881562408999866---89769999-9482442666782224698999983 Q ss_pred CCCCCCCCCCCC Q ss_conf 887887543323 Q gi|254780897|r 213 GLKTGDQKEIER 224 (473) Q Consensus 213 G~k~Gd~~~~~~ 224 (473) |+++||++++.+ T Consensus 133 Gkk~Gd~V~v~~ 144 (159) T PRK01885 133 KKEVGDEVTVNT 144 (159) T ss_pred CCCCCCEEEEEC T ss_conf 899999899985 No 28 >TIGR01462 greA transcription elongation factor GreA; InterPro: IPR006359 Bacterial GreA and GreB IPR006358 from INTERPRO promote transcription elongation by stimulating an endogenous, endonucleolytic transcript cleavage activity of the RNA polymerase (RNAP) , allowing RNA transcription to continue past template-encoded arresting sites. GreA and GreB are sequence homologues and have homologues in every known bacterial genome. GreA and GreB stimulate transcript cleavage in different ways; GreA induces cleavage of 3'-RNA fragments 23 nt in length and can only prevent the formation of arrested complexes, whereas GreB induces cleavage of fragments up to 18 nt in length and can rescue pre-existing arrested complexes. The 2.2 Å resolution crystal structure of Escherichia coli GreA comprises an N-terminal antiparallel alpha-helical coiled coil, and a C-terminal globular domain. While the C-terminal domain binds RNAP, the N-terminal coiled coil interacts with the transcript 3' end and is responsible for stimulating the transcript cleavage reaction . Functional differences between GreA and GreB correlate with the distribution of positively charged residues on one face of the N-terminal coiled coil. Because members of the family outside the Proteobacteria resemble GreA more closely than GreB, the GreB clade (IPR006358 from INTERPRO) forms a plausible outgroup and the remainder of the GreA/B family, included in this family, is designated GreA. In the Chlamydias and some spirochetes, the conserved region of these proteins is found as the C-terminal region of a much larger protein. ; GO: 0003677 DNA binding, 0003711 transcription elongation regulator activity, 0006355 regulation of transcription DNA-dependent. Probab=91.45 E-value=0.64 Score=25.99 Aligned_cols=93 Identities=24% Similarity=0.366 Sum_probs=52.8 Q ss_pred CCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCC--EEEEEEEEEECCCCCCCCCCCCEEE-EEC-------CC--C--CCC Q ss_conf 11115789999997643100002343322111--1344456664265346665442036-517-------86--6--563 Q gi|254780897|r 140 EVDEKEIDRQMAEIAKNNVAFEVKETESEIGD--KVTVDYTVSVDNVILEDQSKKNVQF-IVG-------SA--D--LFS 205 (473) Q Consensus 140 ~vtd~~Id~~i~~l~~~~~~~~~~~~~~~~gD--~v~id~~~~~dg~~~~~~~~~~~~~-~lg-------~~--~--~~~ 205 (473) .=....++.+|..|-..-+.-...+...-.+| .|.+--+.++-+.. +.+...+ +|| .+ . .-- T Consensus 48 Ke~Q~~~E~RI~~le~~l~~A~vi~~~~~~~~gd~V~~G~~V~~~~~~----~~e~~~y~IVG~~EaD~~~giekIS~~S 123 (155) T TIGR01462 48 KEEQGFIEGRIAELEDLLANAEVIDDSKLSTDGDKVSFGSTVTIKDED----TGEEETYTIVGSWEADPKEGIEKISIES 123 (155) T ss_pred HHHHHHHHHHHHHHHHHHHCEEECCHHHCCCCCCEEECCCEEEEEECC----CCCEEEEEEECCCCCCHHCCCEEEECCC T ss_conf 999999988999999887620623601004788377157589999838----9964899983452055330807822677 Q ss_pred CHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEHHHH Q ss_conf 10342068878875433234444444100038730467610110 Q gi|254780897|r 206 ETTEILVGLKTGDQKEIERFFPEDHSIKDLAGKKVRLNFSIKEV 249 (473) Q Consensus 206 ~f~~~liG~k~Gd~~~~~~~~P~d~~~~~laGk~v~f~v~v~~I 249 (473) -|-.+|+|+|+||.+++.+ |.. .+.++|.+| T Consensus 124 Plg~aL~Gkk~GD~~~v~~--P~g-----------~~~~~Il~i 154 (155) T TIGR01462 124 PLGKALIGKKVGDEVEVQV--PKG-----------EKEYEILKI 154 (155) T ss_pred HHHHHHCCCCCCCEEEEEE--CCC-----------EEEEEEEEE T ss_conf 0235333762587789985--892-----------379988873 No 29 >PRK00226 greA transcription elongation factor GreA; Reviewed Probab=90.92 E-value=0.8 Score=25.22 Aligned_cols=79 Identities=22% Similarity=0.368 Sum_probs=42.1 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCCCCCCEEEEEEEEEECCCCCCCCCCCCEEE-EECCCC---------CCCCHHHHC Q ss_conf 1157899999976431000023433221111344456664265346665442036-517866---------563103420 Q gi|254780897|r 142 DEKEIDRQMAEIAKNNVAFEVKETESEIGDKVTVDYTVSVDNVILEDQSKKNVQF-IVGSAD---------LFSETTEIL 211 (473) Q Consensus 142 td~~Id~~i~~l~~~~~~~~~~~~~~~~gD~v~id~~~~~dg~~~~~~~~~~~~~-~lg~~~---------~~~~f~~~l 211 (473) ....++..|..|....+....++.+....|.|.+--..++.+. . ..+...| .+|... ..--+-.+| T Consensus 55 ~q~~~e~rI~~Le~~L~~A~iid~~~~~~~~V~~Gs~V~l~d~--~--~g~~~~y~iVG~~Ead~~~g~IS~~SPlg~AL 130 (157) T PRK00226 55 EQGFIEGRIRELEDKLSNAEVIDPTKPSSGKVKFGATVTIYDL--D--TDEEETYQIVGDDEADIKEGKISVNSPIARAL 130 (157) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCEECCCEEEEEEC--C--CCCEEEEEEECCCCCCCCCCEECCCCHHHHHH T ss_conf 9999999999999999850145866577884206758999985--8--99679999913322456676524779899998 Q ss_pred CCCCCCCCCCCCC Q ss_conf 6887887543323 Q gi|254780897|r 212 VGLKTGDQKEIER 224 (473) Q Consensus 212 iG~k~Gd~~~~~~ 224 (473) +|+++||++++.. T Consensus 131 lGk~~Gd~V~v~~ 143 (157) T PRK00226 131 IGKKVGDTVTVKA 143 (157) T ss_pred HCCCCCCEEEEEC T ss_conf 2899999899984 No 30 >cd06552 ASCH_yqfb_like ASC-1 homology domain, subfamily similar to Escherichia coli Yqfb. The ASCH domain, a small beta-barrel domain found in all three kingdoms of life, resembles the RNA-binding PUA domain and may also interact with RNA. ASCH has been proposed to function as an RNA-binding domain during coactivation, RNA-processing and the regulation of prokaryotic translation. Probab=89.44 E-value=0.51 Score=26.78 Aligned_cols=43 Identities=14% Similarity=0.061 Sum_probs=39.4 Q ss_pred CEEEEEEHHHHHCCCCCCCCHHHHHHHCCCCHHHHHHHHHHHH Q ss_conf 3046761011002467776747887502365578999999887 Q gi|254780897|r 239 KVRLNFSIKEVFSPLPVVVNNDLAVRLGFESESAMRGLCSQKI 281 (473) Q Consensus 239 ~v~f~v~v~~I~~~~~pel~def~k~~~~~t~~elk~~ik~~l 281 (473) ...+.+.|.+|....+.+|||+-|.+=|+.|+++|++.|++.. T Consensus 42 ~~~~~~~I~sV~~~~l~eLt~edA~~egf~sl~eL~~~l~~iY 84 (100) T cd06552 42 RIFGEAEITSVEEKTLGELTDEDARQEGFPSLEELKEALKEIY 84 (100) T ss_pred EEEEEEEEEEEEEECHHHCCHHHHHHHCCCCHHHHHHHHHHHC T ss_conf 8989999999999854659989998828836999999999987 No 31 >PRK03002 prsA peptidylprolyl isomerase; Reviewed Probab=87.46 E-value=2.1 Score=21.86 Aligned_cols=90 Identities=11% Similarity=0.141 Sum_probs=60.6 Q ss_pred ECCCCCCCCHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEHHHHHCCCCCCCCHHHHHHHCCCCHHHHHHHH Q ss_conf 17866563103420688788754332344444441000387304676101100246777674788750236557899999 Q gi|254780897|r 198 VGSADLFSETTEILVGLKTGDQKEIERFFPEDHSIKDLAGKKVRLNFSIKEVFSPLPVVVNNDLAVRLGFESESAMRGLC 277 (473) Q Consensus 198 lg~~~~~~~f~~~liG~k~Gd~~~~~~~~P~d~~~~~laGk~v~f~v~v~~I~~~~~pel~def~k~~~~~t~~elk~~i 277 (473) .+.+.+.|.|.+++-++++|+. +=.+..+..|+. |+|.+ ++. ..+.++.|..| T Consensus 186 ~~~~~~~~ef~~aaf~L~~Gei-S~pv~t~~GyhI-----------Ik~~d--k~~-------------~~~~~~~k~~i 238 (285) T PRK03002 186 FNSGRMAPEFETAAYKLKVGQI-SNPVKSPNGYHI-----------IKLTD--KKD-------------LKPYDEVKDSI 238 (285) T ss_pred CCCCCCCHHHHHHHHHCCCCCC-CCCEEECCEEEE-----------EEEEE--CCC-------------CCCHHHHHHHH T ss_conf 5898766799999982899886-878774781699-----------99841--688-------------99989999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHH Q ss_conf 988789999777776699999999864003479999999 Q gi|254780897|r 278 SQKIKQHSEFLVRQKVKRQILDYISNKYTFDVPESLVEN 316 (473) Q Consensus 278 k~~l~~~~~~~~~~~~~~~i~~~L~~~~~~~lPe~lv~~ 316 (473) ++.|..+.. .+..+.+.++..+++...+.+.+.-+.. T Consensus 239 ~~~i~~~k~--~d~~~~~~~i~k~~kkanV~ikD~~lk~ 275 (285) T PRK03002 239 RKNLEEERI--ADPIFGKKLLQKELKKANIKINDSELKD 275 (285) T ss_pred HHHHHHHHH--HCHHHHHHHHHHHHHHCCCCCCCHHHHH T ss_conf 999999986--6779999999999987898436478888 No 32 >PRK05892 nucleoside diphosphate kinase regulator; Provisional Probab=84.00 E-value=3 Score=20.63 Aligned_cols=33 Identities=18% Similarity=0.228 Sum_probs=23.0 Q ss_pred CHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEHHHHHC Q ss_conf 1034206887887543323444444410003873046761011002 Q gi|254780897|r 206 ETTEILVGLKTGDQKEIERFFPEDHSIKDLAGKKVRLNFSIKEVFS 251 (473) Q Consensus 206 ~f~~~liG~k~Gd~~~~~~~~P~d~~~~~laGk~v~f~v~v~~I~~ 251 (473) -+-.+|+|+++||++++.. |.. .+.++|.+|+- T Consensus 127 PlgrALlGk~~GD~V~v~t--P~G-----------~~~~~Il~Ik~ 159 (161) T PRK05892 127 PLGQALAGHQPGDTVTYST--PQG-----------PNQVQLLAVKL 159 (161) T ss_pred HHHHHHHCCCCCCEEEEEC--CCC-----------CEEEEEEEEEC T ss_conf 7899984898999899975--997-----------57999999967 No 33 >PRK12318 methionine aminopeptidase; Provisional Probab=77.45 E-value=3.5 Score=20.08 Aligned_cols=21 Identities=24% Similarity=0.396 Sum_probs=9.7 Q ss_pred CCCCCCCCCEEEEEEEEEECC Q ss_conf 343322111134445666426 Q gi|254780897|r 163 KETESEIGDKVTVDYTVSVDN 183 (473) Q Consensus 163 ~~~~~~~gD~v~id~~~~~dg 183 (473) .+++.+.||.|.||+-+..+| T Consensus 122 ~~~~Lk~GDlV~iD~G~~y~G 142 (291) T PRK12318 122 NDIPLKNGDIMNIDVSCIVDG 142 (291) T ss_pred CCCCCCCCCEEEEECCEEECC T ss_conf 986167999899956786898 No 34 >PRK06342 transcription elongation factor regulatory protein; Validated Probab=76.72 E-value=5 Score=18.87 Aligned_cols=72 Identities=21% Similarity=0.269 Sum_probs=43.7 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCCCEEEEEEEEEECCCCCCCCCCCCEEE-EECCCC---------CCCCHHHHCCCC Q ss_conf 7899999976431000023433221111344456664265346665442036-517866---------563103420688 Q gi|254780897|r 145 EIDRQMAEIAKNNVAFEVKETESEIGDKVTVDYTVSVDNVILEDQSKKNVQF-IVGSAD---------LFSETTEILVGL 214 (473) Q Consensus 145 ~Id~~i~~l~~~~~~~~~~~~~~~~gD~v~id~~~~~dg~~~~~~~~~~~~~-~lg~~~---------~~~~f~~~liG~ 214 (473) .++..|..+.++..+...++ +....|.|.+--+.++.++ .+.. .+| ++|.+. ..--+-.+|+|+ T Consensus 68 ~i~R~iRyl~~Rl~~A~vv~-~~~~~d~V~FGatVt~~~e---~g~~--~t~~IVG~dEaD~~~g~IS~~SPlaraLlgk 141 (160) T PRK06342 68 RPLRDLRYLAARVRTAQLMP-DPPSTDVVAFGSTVTFSRD---DGRV--QTYRIVGEDEADPKAGSISYVSPVARLLMGK 141 (160) T ss_pred HHHHHHHHHHHHHHCCEEEC-CCCCCCEEEEEEEEEEEEC---CCCE--EEEEEECHHHCCCCCCEEECCCHHHHHHHCC T ss_conf 26688999999872178848-9998998998359999928---9999--9999968344473589675359899998389 Q ss_pred CCCCCCCC Q ss_conf 78875433 Q gi|254780897|r 215 KTGDQKEI 222 (473) Q Consensus 215 k~Gd~~~~ 222 (473) ++||.+++ T Consensus 142 ~vGD~V~v 149 (160) T PRK06342 142 AVGDVVGV 149 (160) T ss_pred CCCCEEEE T ss_conf 99998998 No 35 >PRK12897 methionine aminopeptidase; Reviewed Probab=75.73 E-value=4.3 Score=19.39 Aligned_cols=15 Identities=33% Similarity=0.446 Sum_probs=5.4 Q ss_pred CCCCCCCCEEEEEEE Q ss_conf 433221111344456 Q gi|254780897|r 164 ETESEIGDKVTVDYT 178 (473) Q Consensus 164 ~~~~~~gD~v~id~~ 178 (473) +++.+.||.|.||+- T Consensus 82 ~~~l~~GD~V~iD~G 96 (248) T PRK12897 82 DVPLTEGDIVTIDMV 96 (248) T ss_pred CCCCCCCCEEEEECC T ss_conf 851458999999574 No 36 >TIGR00967 3a0501s007 preprotein translocase, SecY subunit; InterPro: IPR002208 Secretion across the inner membrane in some Gram-negative bacteria occurs via the preprotein translocase pathway. Proteins are produced in the cytoplasm as precursors, and require a chaperone subunit to direct them to the translocase component. . From there, the mature proteins are either targeted to the outer membrane, or remain as periplasmic proteins. The translocase protein subunits are encoded on the bacterial chromosome. The translocase itself comprises 7 proteins, including a chaperone protein (SecB), an ATPase (SecA), an integral membrane complex (SecCY, SecE and SecG), and two additional membrane proteins that promote the release of the mature peptide into the periplasm (SecD and SecF) . The chaperone protein SecB is a highly acidic homotetrameric protein that exists as a "dimer of dimers" in the bacterial cytoplasm. SecB maintains preproteins in an unfolded state after translation, and targets these to the peripheral membrane protein ATPase SecA for secretion . The structure of the Escherichia coli SecYEG assembly revealed a sandwich of two membranes interacting through the extensive cytoplasmic domains . Each membrane is composed of dimers of SecYEG. The monomeric complex contains 15 transmembrane helices. The eubacterial secY protein interacts with the signal sequences of secretory proteins as well as with two other components of the protein translocation system: secA and secE. SecY is an integral plasma membrane protein of 419 to 492 amino acid residues that apparently contains 10 transmembrane (TM), 6 cytoplasmic and 5 periplasmic regions. Cytoplasmic regions 2 and 3, and TM domains 1, 2, 4, 5, 7 and 10 are well conserved: the conserved cytoplasmic regions are believed to interact with cytoplasmic secretion factors, while the TM domains may participate in protein export . Homologs of secY are found in archaebacteria . SecY is also encoded in the chloroplast genome of some algae where it could be involved in a prokaryotic-like protein export system across the two membranes of the chloroplast endoplasmic reticulum (CER) which is present in chromophyte and cryptophyte algae.; GO: 0015450 P-P-bond-hydrolysis-driven protein transmembrane transporter activity, 0009306 protein secretion, 0016020 membrane. Probab=74.49 E-value=4 Score=19.64 Aligned_cols=39 Identities=21% Similarity=0.388 Sum_probs=28.3 Q ss_pred EEEECHHHHHHHHHHHHHHHHHCCCCCCCCCCCC-CHHHHHHHHHH Q ss_conf 9985489999999999999875073898589704-88999986108 Q gi|254780897|r 16 DVIIPSNRLTDSFNERIEDIRSKANIKGFRPGKV-PLSHIKSLYGK 60 (473) Q Consensus 16 ~v~v~~~~~~~~~~~~~~~~~~~~~ipGFRkGKv-P~~ii~~~~g~ 60 (473) ++.+.++++.+.+++ ...-|||+||||- =.+.+++...+ T Consensus 388 t~~~NP~~~A~Nlkk------~G~~IPG~RpG~~~T~~yl~~~l~R 427 (489) T TIGR00967 388 TLQLNPEDLAKNLKK------QGMFIPGIRPGKEVTEKYLKRVLNR 427 (489) T ss_pred EEECCHHHHHHHHHH------CCCCCCCCCCCHHHHHHHHHHHHHH T ss_conf 034488999999965------3864386458657999999766335 No 37 >pfam01272 GreA_GreB Transcription elongation factor, GreA/GreB, C-term. This domain has a FKBP-like fold. Probab=73.30 E-value=1.9 Score=22.31 Aligned_cols=24 Identities=33% Similarity=0.574 Sum_probs=18.1 Q ss_pred CCCCHHHHCCCCCCCCCCCCCCCCCC Q ss_conf 56310342068878875433234444 Q gi|254780897|r 203 LFSETTEILVGLKTGDQKEIERFFPE 228 (473) Q Consensus 203 ~~~~f~~~liG~k~Gd~~~~~~~~P~ 228 (473) ..--+-.+|+|+++||++++++ |. T Consensus 42 ~~SPlg~ALlG~~~Gd~v~~~~--p~ 65 (77) T pfam01272 42 VLSPLGRALLGKKVGDEVEVTA--PS 65 (77) T ss_pred CCCHHHHHHHCCCCCCEEEEEC--CC T ss_conf 8898899973999999999996--99 No 38 >PRK09204 secY preprotein translocase subunit SecY; Reviewed Probab=72.83 E-value=4.7 Score=19.08 Aligned_cols=11 Identities=18% Similarity=0.304 Sum_probs=5.2 Q ss_pred HHHCCCCCCCC Q ss_conf 87507389858 Q gi|254780897|r 35 IRSKANIKGFR 45 (473) Q Consensus 35 ~~~~~~ipGFR 45 (473) +...+.+||-. T Consensus 30 lG~~IplPGid 40 (427) T PRK09204 30 LGSYIPVPGID 40 (427) T ss_pred HHCCCCCCCCC T ss_conf 84358889859 No 39 >COG2411 Uncharacterized conserved protein [Function unknown] Probab=72.14 E-value=4.6 Score=19.15 Aligned_cols=59 Identities=20% Similarity=0.130 Sum_probs=46.0 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEHHHHHCCCCCCCCHHHHHHHCCCCHHHHHHHHHHHH Q ss_conf 06887887543323444444410003873046761011002467776747887502365578999999887 Q gi|254780897|r 211 LVGLKTGDQKEIERFFPEDHSIKDLAGKKVRLNFSIKEVFSPLPVVVNNDLAVRLGFESESAMRGLCSQKI 281 (473) Q Consensus 211 liG~k~Gd~~~~~~~~P~d~~~~~laGk~v~f~v~v~~I~~~~~pel~def~k~~~~~t~~elk~~ik~~l 281 (473) .++.++|+.+-+.+ |-.+.-..+|+.|..+...|||||=|..=||.|.++|-+.++... T Consensus 32 ~~~~k~g~eVyIh~------------~g~i~gkAkIk~V~~KrV~ELTdEDAr~DGF~sreELi~~LkriY 90 (188) T COG2411 32 KIVLKPGSEVYIHS------------GGYIIGKAKIKKVKTKRVSELTDEDARLDGFRSREELIEELKRIY 90 (188) T ss_pred CCCCCCCCEEEEEE------------CCEEEEEEEEEEEEEEEHHHHHHHHHHHCCCCCHHHHHHHHHHHC T ss_conf 41379878899997------------898878989888787527651378887513225999999999982 No 40 >TIGR00500 met_pdase_I methionine aminopeptidase, type I; InterPro: IPR002467 Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site . The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This group of metallopeptidases belong to MEROPS peptidase family M24 (clan MG), subfamily M24A. Methionine aminopeptidase (3.4.11.18 from EC) (MAP) is responsible for the removal of the amino-terminal (initiator) methionine from nascent eukaryotic cytosolic and cytoplasmic prokaryotic proteins if the penultimate amino acid is small and uncharged. All MAP studied to date are monomeric proteins that require cobalt ions for activity. Two subfamilies of MAP enzymes are known to exist , . While being evolutionary related, they only share a limited amount of sequence similarity mostly clustered around the residues shown, in the Escherichia coli MAP , to be involved in cobalt-binding. The first family consists of enzymes from prokaryotes as well as eukaryotic MAP-1, while the second group (IPR002468 from INTERPRO) is made up of archaeal MAP and eukaryotic MAP-2. ; GO: 0004239 methionyl aminopeptidase activity, 0006508 proteolysis. Probab=67.05 E-value=4.9 Score=18.94 Aligned_cols=54 Identities=20% Similarity=0.302 Sum_probs=33.1 Q ss_pred CCC-CCCCCCCCEEEEEEEEEECCCCCCCCCCCCEEEEECC-CCC-----C----CCHHHHCC-CCCCCC Q ss_conf 023-4332211113444566642653466654420365178-665-----6----31034206-887887 Q gi|254780897|r 161 EVK-ETESEIGDKVTVDYTVSVDNVILEDQSKKNVQFIVGS-ADL-----F----SETTEILV-GLKTGD 218 (473) Q Consensus 161 ~~~-~~~~~~gD~v~id~~~~~dg~~~~~~~~~~~~~~lg~-~~~-----~----~~f~~~li-G~k~Gd 218 (473) .|. ++..++||.|.||+....+| +-+.+ ..+|.+|+ .+- + +.|..+|- -+|+|- T Consensus 88 iP~~~~vLKdGDivniD~g~~~~G--y~gD~--a~Tf~~G~n~~~~~~KLl~~t~esl~~aI~P~~kpG~ 153 (265) T TIGR00500 88 IPDKKKVLKDGDIVNIDVGVIYDG--YYGDT--AITFLVGKNVSPEAEKLLEATEESLYKAIEPEIKPGK 153 (265) T ss_pred CCCCCCEEECCCEEEEEEEEEECC--EEECC--CCEEEECCCCCHHHHHHHHHHHHHHHHCCCCCCCCCC T ss_conf 388472651367787557788603--02113--2313413877876788999999999852231014798 No 41 >pfam00344 SecY eubacterial secY protein. Probab=66.35 E-value=5.3 Score=18.62 Aligned_cols=11 Identities=9% Similarity=0.286 Sum_probs=4.2 Q ss_pred CCCHHHHHHHH Q ss_conf 42899999999 Q gi|254780897|r 374 EVTEEEMQSAL 384 (473) Q Consensus 374 ~vs~~Ei~~~i 384 (473) ..+++|+-+.+ T Consensus 262 ~~~p~~iAe~l 272 (340) T pfam00344 262 GFNPKDIAENL 272 (340) T ss_pred CCCHHHHHHHH T ss_conf 57889999999 No 42 >PRK06330 transcript cleavage factor/unknown domain fusion protein; Validated Probab=66.14 E-value=8.2 Score=17.13 Aligned_cols=15 Identities=0% Similarity=-0.356 Sum_probs=6.2 Q ss_pred CCCCCCCCHHHHHHH Q ss_conf 246777674788750 Q gi|254780897|r 251 SPLPVVVNNDLAVRL 265 (473) Q Consensus 251 ~~~~pel~def~k~~ 265 (473) +...|+.-+.|+.-+ T Consensus 570 k~~~p~w~~~f~~Lf 584 (906) T PRK06330 570 KKHLPNWEEYYKGLF 584 (906) T ss_pred HHHCCCHHHHHHHHH T ss_conf 984885999999984 No 43 >cd00986 PDZ_LON_protease PDZ domain of ATP-dependent LON serine proteases. Most PDZ domains bind C-terminal polypeptides, though binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this bacterial subfamily of protease-associated PDZ domains a C-terminal beta-strand is thought to form the peptide-binding groove base, a circular permutation with respect to PDZ domains found in Eumetazoan signaling proteins. Probab=66.05 E-value=4.4 Score=19.30 Aligned_cols=36 Identities=17% Similarity=0.193 Sum_probs=27.3 Q ss_pred CCCHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEHHH Q ss_conf 631034206887887543323444444410003873046761011 Q gi|254780897|r 204 FSETTEILVGLKTGDQKEIERFFPEDHSIKDLAGKKVRLNFSIKE 248 (473) Q Consensus 204 ~~~f~~~liG~k~Gd~~~~~~~~P~d~~~~~laGk~v~f~v~v~~ 248 (473) ..+|...|-++++||++++++.- .|+..+++|++.. T Consensus 42 ~~~l~~~i~~~~~Gd~V~l~v~R---------~gk~~~~~vtL~~ 77 (79) T cd00986 42 AEELIDYIQSKKEGDTVKLKVKR---------EEKELPEDLILKT 77 (79) T ss_pred HHHHHHHHHCCCCCCEEEEEEEE---------CCEEEEEEEEEEC T ss_conf 99999999659999989999999---------9999999999724 No 44 >PRK05753 nucleoside diphosphate kinase regulator; Provisional Probab=63.02 E-value=9.3 Score=16.71 Aligned_cols=68 Identities=22% Similarity=0.427 Sum_probs=34.8 Q ss_pred CCCCEEEEEEEEEECCCCCCCCCCCCEEEEE--------CCCCCCCCHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 2111134445666426534666544203651--------78665631034206887887543323444444410003873 Q gi|254780897|r 168 EIGDKVTVDYTVSVDNVILEDQSKKNVQFIV--------GSADLFSETTEILVGLKTGDQKEIERFFPEDHSIKDLAGKK 239 (473) Q Consensus 168 ~~gD~v~id~~~~~dg~~~~~~~~~~~~~~l--------g~~~~~~~f~~~liG~k~Gd~~~~~~~~P~d~~~~~laGk~ 239 (473) -..|.|.++-...+.+. ..+....++++- |.-+..--+-.+|+|+++||.+++. .|. |+. T Consensus 50 ip~dvV~~gS~V~~~d~--~~g~~~~~~LV~p~ead~~~gkISv~SPiG~ALlG~~~Gd~i~~~--~P~--------G~~ 117 (135) T PRK05753 50 MPADVVTMNSRVRFRDL--SSGEERVVTLVYPADADISNGQVSVLAPVGAALLGLSVGQSIDWP--LPG--------GKE 117 (135) T ss_pred CCCCEEEECCEEEEEEC--CCCCEEEEEEECHHHCCCCCCCCCCCCHHHHHHHCCCCCCEEEEE--CCC--------CCE T ss_conf 79998998989999988--999899999949026494479201016999997389999999998--899--------977 Q ss_pred EEEEEEHHHH Q ss_conf 0467610110 Q gi|254780897|r 240 VRLNFSIKEV 249 (473) Q Consensus 240 v~f~v~v~~I 249 (473) .. ++|.+| T Consensus 118 ~~--l~I~~V 125 (135) T PRK05753 118 TH--LTVLEV 125 (135) T ss_pred EE--EEEEEE T ss_conf 89--999999 No 45 >COG0201 SecY Preprotein translocase subunit SecY [Intracellular trafficking and secretion] Probab=55.36 E-value=12 Score=15.76 Aligned_cols=14 Identities=0% Similarity=0.022 Sum_probs=6.4 Q ss_pred HHHHHHHHHCCCCC Q ss_conf 99999975348640 Q gi|254780897|r 323 QKVRFEMSSANQKS 336 (473) Q Consensus 323 ~~~~~~l~~~~~~~ 336 (473) +++.++++.+|..- T Consensus 338 ~~~A~~lkksG~~I 351 (436) T COG0201 338 EEIAENLKKSGGFI 351 (436) T ss_pred HHHHHHHHHCCCCC T ss_conf 99999999769928 No 46 >PRK08568 preprotein translocase subunit SecY; Reviewed Probab=54.62 E-value=12 Score=15.67 Aligned_cols=14 Identities=14% Similarity=0.152 Sum_probs=7.3 Q ss_pred HHHHCCCCCCCCCC Q ss_conf 98750738985897 Q gi|254780897|r 34 DIRSKANIKGFRPG 47 (473) Q Consensus 34 ~~~~~~~ipGFRkG 47 (473) .+...+-+||=.++ T Consensus 41 rigs~IP~pGi~~~ 54 (466) T PRK08568 41 FILTNIPLYGISQA 54 (466) T ss_pred HHHCCCCCCCCCCC T ss_conf 98057889997957 No 47 >TIGR02093 P_ylase glycogen/starch/alpha-glucan phosphorylases; InterPro: IPR011833 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described . This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site . The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. Proteins in this entry are glycosyltransferases with phosphorylase activities. Members use phosphate to break alpha 1,4 linkages between pairs of glucose residues at the end of long glucose polymers, releasing alpha-D-glucose 1-phosphate. The nomenclature convention is to preface the name according to the natural substrate, as in glycogen phosphorylase, starch phosphorylase, maltodextrin phosphorylase, etc. The main role of glycogen phosphorylase (GPase) is to provide phosphorylated glucose molecules (G-1-P) . GPase is a highly regulated allosteric enzyme. The net effect of the regulatory site allows the enzyme to operate at a variety of rates; the enzyme is not simply regulated as "on" or "off", but rather it can be thought of being set to operate at an ideal rate based on changing conditions at in the cell. The most important allosteric effector is the phosphate molecule covalently attached to Ser14. This switches GPase from the b (inactive) state to the a (active) state. Upon phosphorylation, GPase attains about 800f its Vmax. When the enzyme is not phosphorylated, GPase activity is practically non-existent at low AMP levels . There is some apparent controversy as to the structure of GPase. All sources agree that the enzyme is multimeric, but there is apparent controversy as to the enzyme being a tetramer or a dimer. Apparently, GPase (in the a form) forms tetramers in the crystal form. The consensus seems to be that 'regardless of the a or b form, GPase functions as a dimer in vivo . The GPase monomer is best described as consisting of two domains, an N-terminal domain and a C-terminal domain . The C-terminal domain is often referred to as the catalytic domain. It consists of a beta-sheet core surrounded by layers of helical segments . The vitamin cofactor pyridoxal phosphate (PLP) is covalently attached to the amino acid backbone. The N-terminal domain also consists of a central beta-sheet core and is surrounded by layers of helical segments. The N-terminal domain contains different allosteric effector sites to regulate the enzyme. Bacterial phosphorylases follow the same catalytic mechanisms as their plant and animal counterparts, but differ considerably in terms of their substrate specificity and regulation. The catalytic domains are highly conserved while the regulatory sites are only poorly conserved. For maltodextrin phosphorylase from Escherichia coli the physiological role of the enzyme in the utilisation of maltidextrins is known in detail; that of all the other bacterial phosphorylases is still unclear. Roles in regulatuon of endogenous glycogen metabolism in periods of starvation, and sporulation, stress response or quick adaptation to changing environments are possible .; GO: 0004645 phosphorylase activity, 0030170 pyridoxal phosphate binding, 0005975 carbohydrate metabolic process. Probab=54.37 E-value=13 Score=15.64 Aligned_cols=92 Identities=15% Similarity=0.218 Sum_probs=57.4 Q ss_pred CCCCCCCCCCCCCCCCCCCCCEEEEEEHHHH-----HCCCCCCCCHHHHHHHC-C---CCHHHHH---HHHHHHHHHHHH Q ss_conf 5433234444444100038730467610110-----02467776747887502-3---6557899---999988789999 Q gi|254780897|r 219 QKEIERFFPEDHSIKDLAGKKVRLNFSIKEV-----FSPLPVVVNNDLAVRLG-F---ESESAMR---GLCSQKIKQHSE 286 (473) Q Consensus 219 ~~~~~~~~P~d~~~~~laGk~v~f~v~v~~I-----~~~~~pel~def~k~~~-~---~t~~elk---~~ik~~l~~~~~ 286 (473) =+.|-..||+=|.++. +.| -..-=|+|.+-+-+.+| - ++++.|+ ....+.--...= T Consensus 454 l~dF~~LyP~KF~NKT------------NGiTPRRWl~~aNP~LS~l~~e~iGt~~W~~~l~~L~~L~~ya~D~~f~~~~ 521 (822) T TIGR02093 454 LKDFYELYPEKFNNKT------------NGITPRRWLALANPGLSALLTETIGTDEWLTDLDLLKKLEPYADDSEFLEKF 521 (822) T ss_pred CHHHHHCCCCCCCCCC------------CCCHHHHHHHHHCHHHHHHHHHHHCCCHHHCCHHHHHHHHHCCCCHHHHHHH T ss_conf 4034521788613655------------8730678997518136899987624672221479885111004898999999 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCC-CHHHHHHHHHHHH Q ss_conf 77777669999999986400347-9999999977678 Q gi|254780897|r 287 FLVRQKVKRQILDYISNKYTFDV-PESLVENEYNGIL 322 (473) Q Consensus 287 ~~~~~~~~~~i~~~L~~~~~~~l-Pe~lv~~e~~~~~ 322 (473) ...+..-+..+.+.+.+.+.+.| |.+|.+=|+.||- T Consensus 522 ~~vK~~NK~~LA~yi~~~~GV~~dP~siFDVQvKRlH 558 (822) T TIGR02093 522 AQVKQANKQRLAAYIKEETGVEVDPNSIFDVQVKRLH 558 (822) T ss_pred HHHHHHHHHHHHHHHHHHCCCEECCCCCCCCCHHHHH T ss_conf 9999999999999999777927686655321123656 No 48 >CHL00161 secY preprotein translocase subunit SecY; Validated Probab=54.01 E-value=13 Score=15.60 Aligned_cols=12 Identities=8% Similarity=0.218 Sum_probs=6.4 Q ss_pred HHHHCCCCCCCC Q ss_conf 987507389858 Q gi|254780897|r 34 DIRSKANIKGFR 45 (473) Q Consensus 34 ~~~~~~~ipGFR 45 (473) .+...+.+||.. T Consensus 25 rlG~~IpiPGid 36 (417) T CHL00161 25 RLGTFIPVPGID 36 (417) T ss_pred HHHCCCCCCCCC T ss_conf 971518889839 No 49 >TIGR01570 A_thal_3588 uncharacterized plant-specific domain TIGR01570; InterPro: IPR006460 This family of hypothetical plant proteins are defined by a region of about 170 amino acids found at the C terminus. These proteins have highly divergent N-terminal regions rich in low complexity sequence. PSI-BLAST reveals no clear similarity to any characterised protein. At least 12 distinct members are found in Arabidopsis thaliana.. Probab=52.07 E-value=4.1 Score=19.57 Aligned_cols=18 Identities=33% Similarity=0.545 Sum_probs=12.5 Q ss_pred CCCCCCCCCCCHHHHHHH Q ss_conf 389858970488999986 Q gi|254780897|r 40 NIKGFRPGKVPLSHIKSL 57 (473) Q Consensus 40 ~ipGFRkGKvP~~ii~~~ 57 (473) +|=|||||||-..|=..+ T Consensus 4 TiFG~RKG~V~f~iQ~~~ 21 (172) T TIGR01570 4 TIFGYRKGRVTFSIQEDR 21 (172) T ss_pred EEEEEEECEEEEEEECCC T ss_conf 210002034889886798 No 50 >TIGR02932 vnfK_nitrog V-containing nitrogenase, beta subunit; InterPro: IPR014281 Nitrogenase, also called dinitrogenase, is the enzyme which catalyses the conversion of molecular nitrogen to ammonia (biological nitrogen fixation). The most widespread and most efficient nitrogenase contains a molybdenum cofactor. This protein family, VnfK, represents the beta subunit of the vanadium-containing V nitrogenase. It is homologous to NifK and AnfK, of the molybdenum-containing and the iron-only types, respectively.; GO: 0016163 nitrogenase activity, 0051536 iron-sulfur cluster binding, 0009399 nitrogen fixation, 0016613 vanadium-iron nitrogenase complex. Probab=49.51 E-value=13 Score=15.62 Aligned_cols=53 Identities=11% Similarity=0.134 Sum_probs=25.8 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEHHHHHCCCCCCCCHHHHHHHCCCCHHHHHHH Q ss_conf 878875433234444444100038730467610110024677767478875023655789999 Q gi|254780897|r 214 LKTGDQKEIERFFPEDHSIKDLAGKKVRLNFSIKEVFSPLPVVVNNDLAVRLGFESESAMRGL 276 (473) Q Consensus 214 ~k~Gd~~~~~~~~P~d~~~~~laGk~v~f~v~v~~I~~~~~pel~def~k~~~~~t~~elk~~ 276 (473) ..+||...++-.|.+ ..|.-.|-+ +-....-|-+-|.-....|-.|+++.... T Consensus 178 vnPGdv~~~k~~~~e---------m~v~~~i~~-dte~f~~P~~P~k~~~~~G~tt~edi~~~ 230 (460) T TIGR02932 178 VNPGDVVLLKHILSE---------MGVDANILL-DTEDFDSPILPDKSIFTYGRTTVEDIADS 230 (460) T ss_pred CCCCHHHHHHHHHHH---------CCCCCEEEE-ECCCCCCCCCCCCCHHHCCCHHHHHHHHH T ss_conf 476248999999976---------066604786-31246788787621223043048888877 No 51 >COG0782 Uncharacterized conserved protein, YhbC family [Function unknown] Probab=47.89 E-value=16 Score=14.90 Aligned_cols=19 Identities=32% Similarity=0.478 Sum_probs=15.5 Q ss_pred CHHHHCCCCCCCCCCCCCC Q ss_conf 1034206887887543323 Q gi|254780897|r 206 ETTEILVGLKTGDQKEIER 224 (473) Q Consensus 206 ~f~~~liG~k~Gd~~~~~~ 224 (473) -+-.+|+|+++||.+.+.. T Consensus 118 Pig~aLlGk~vGd~v~v~~ 136 (151) T COG0782 118 PLGRALLGKKVGDTVEVNT 136 (151) T ss_pred HHHHHHHCCCCCCEEEEEC T ss_conf 7789972899999899835 No 52 >TIGR02058 lin0512_fam conserved hypothetical protein; InterPro: IPR011719 This family consists of few members, broadly distributed. It occurs so far in several Firmicutes (twice in Oceanobacillus), one Cyanobacterium, one alpha Proteobacterium, and (with a long prefix) in plants. The function is unknown. The alignment includes a perfectly conserved motif GxGxDxHG near the N-terminus.. Probab=45.53 E-value=14 Score=15.28 Aligned_cols=40 Identities=15% Similarity=0.316 Sum_probs=28.5 Q ss_pred EECHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHH Q ss_conf 8548999999999999987507389858970488999986 Q gi|254780897|r 18 IIPSNRLTDSFNERIEDIRSKANIKGFRPGKVPLSHIKSL 57 (473) Q Consensus 18 ~v~~~~~~~~~~~~~~~~~~~~~ipGFRkGKvP~~ii~~~ 57 (473) .+-..++..+-.+|++..-.+-.+||||.|-+|.+-+..+ T Consensus 12 D~HGQ~~T~AA~rAv~NAI~~Ns~Pg~~~G~~p~~~~~~M 51 (120) T TIGR02058 12 DQHGQNITKAAARAVRNAIASNSLPGIREGSLPDSSLNEM 51 (120) T ss_pred EECCCCHHHHHHHHHHHHHHHCCCHHHCCCCCCCCCCCCC T ss_conf 4358886789999886445324730330255899870132 No 53 >PRK12907 secY preprotein translocase subunit SecY; Reviewed Probab=43.32 E-value=18 Score=14.39 Aligned_cols=10 Identities=10% Similarity=-0.051 Sum_probs=4.8 Q ss_pred HHHCCCCCCC Q ss_conf 8750738985 Q gi|254780897|r 35 IRSKANIKGF 44 (473) Q Consensus 35 ~~~~~~ipGF 44 (473) +...+.+||. T Consensus 30 lG~~IPiPgi 39 (434) T PRK12907 30 IGTFIPVPHT 39 (434) T ss_pred HCCCCCCCCC T ss_conf 6161898995 No 54 >PRK04980 hypothetical protein; Provisional Probab=42.90 E-value=18 Score=14.35 Aligned_cols=42 Identities=7% Similarity=-0.012 Sum_probs=34.4 Q ss_pred CEEEEEEHHHHHCCCCCCCCHHHHHHHCCCCHHHHHHHHHHHH Q ss_conf 3046761011002467776747887502365578999999887 Q gi|254780897|r 239 KVRLNFSIKEVFSPLPVVVNNDLAVRLGFESESAMRGLCSQKI 281 (473) Q Consensus 239 ~v~f~v~v~~I~~~~~pel~def~k~~~~~t~~elk~~ik~~l 281 (473) ..-+++.|.+|.-...-+||+.-|++=+.. +++||.-|++-. T Consensus 47 ~~fc~IeIlsV~pi~fd~Lne~HA~QEnmt-L~eLK~vI~~IY 88 (102) T PRK04980 47 RYFCTIEVLSVSPVTFDELNEKHAEQENMT-LPELKQVIAEIY 88 (102) T ss_pred CEEEEEEEEEEEEECHHHHHHHHHHHCCCC-HHHHHHHHHHHC T ss_conf 789999999987762767628889761898-899999999878 No 55 >cd01089 PA2G4-like Related to aminopepdidase M, this family contains proliferation-associated protein 2G4. Family members have been implicated in cell cycle control. Probab=42.59 E-value=18 Score=14.31 Aligned_cols=34 Identities=24% Similarity=0.313 Sum_probs=20.2 Q ss_pred CCCCCCCCCEEEEEEEEEECCCCCCCCCCCCEEEEECC Q ss_conf 34332211113444566642653466654420365178 Q gi|254780897|r 163 KETESEIGDKVTVDYTVSVDNVILEDQSKKNVQFIVGS 200 (473) Q Consensus 163 ~~~~~~~gD~v~id~~~~~dg~~~~~~~~~~~~~~lg~ 200 (473) .++..+.||.|.||+....||=.- .-.+++.+|. T Consensus 79 ~~~~L~~GDiV~iD~g~~~dGy~~----D~a~T~~VG~ 112 (228) T cd01089 79 ATYTLKDGDVVKIDLGCHIDGYIA----VVAHTIVVGA 112 (228) T ss_pred CCCCCCCCCEEEEEEEEEECCEEE----EEEEEEEECC T ss_conf 873546899899977688687798----7569999489 No 56 >cd01088 MetAP2 Methionine Aminopeptidase 2. E.C. 3.4.11.18. Also known as methionyl aminopeptidase and peptidase M. Catalyzes release of N-terminal amino acids, preferentially methionine, from peptides and arylamides. Probab=37.96 E-value=21 Score=13.79 Aligned_cols=20 Identities=35% Similarity=0.466 Sum_probs=11.8 Q ss_pred CCCCCCCEEEEEEEEEECCC Q ss_conf 33221111344456664265 Q gi|254780897|r 165 TESEIGDKVTVDYTVSVDNV 184 (473) Q Consensus 165 ~~~~~gD~v~id~~~~~dg~ 184 (473) +..+.||.|.||+...+||= T Consensus 69 ~~lk~GDiVkiD~G~~idG~ 88 (291) T cd01088 69 TVLKEGDVVKLDFGAHVDGY 88 (291) T ss_pred EEECCCCEEEEEEEEEECCE T ss_conf 46337998998665787788 No 57 >COG0199 RpsN Ribosomal protein S14 [Translation, ribosomal structure and biogenesis] Probab=34.44 E-value=21 Score=13.80 Aligned_cols=18 Identities=17% Similarity=0.344 Sum_probs=11.5 Q ss_pred HHHHHHHHCCCCCCCCCC Q ss_conf 999998750738985897 Q gi|254780897|r 30 ERIEDIRSKANIKGFRPG 47 (473) Q Consensus 30 ~~~~~~~~~~~ipGFRkG 47 (473) ..+++++..=.||||+|. T Consensus 42 ~cfRE~A~~g~ipGv~Ka 59 (61) T COG0199 42 ICFRELAHKGEIPGVKKA 59 (61) T ss_pred HHHHHHHHCCCCCCEEEC T ss_conf 999998610688874725 No 58 >TIGR03342 dsrC_tusE_dsvC sulfur relay protein, TusE/DsrC/DsvC family. Members of this protein family may be described as TusE, a partner to TusBCD in a sulfur relay system for 2-thiouridine biosynthesis, a tRNA base modification process. Other members are DsrC, a functionally similar protein in species where the sulfur relay system exists primarily for sulfur metabolism rather than tRNA base modification. Some members of this family are known explicitly as the gamma subunit of sulfite reductases. Probab=34.27 E-value=24 Score=13.37 Aligned_cols=34 Identities=24% Similarity=0.288 Sum_probs=26.5 Q ss_pred HHHHHHHHCCCCCCHHH--HHHHHHHHHHHCCCCHH Q ss_conf 99999864387428999--99999999986399989 Q gi|254780897|r 363 VLGTIGEKNNIEVTEEE--MQSALYQQLGRFPGHEK 396 (473) Q Consensus 363 il~~Ia~~e~I~vs~~E--i~~~i~~~~~~~~~~~~ 396 (473) +-..||+.++|+.|++- |-.++.+++..++..|. T Consensus 26 vA~~lA~~egI~Lt~~HW~vI~~lR~~y~~~~~~P~ 61 (108) T TIGR03342 26 VAEALAEEEGIELTEEHWEVINFLRDFYAEYNISPA 61 (108) T ss_pred HHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCCCC T ss_conf 999999883986689999999999999999789983 No 59 >PRK08671 methionine aminopeptidase; Provisional Probab=33.79 E-value=25 Score=13.31 Aligned_cols=21 Identities=38% Similarity=0.382 Sum_probs=12.9 Q ss_pred CCCCCCCCEEEEEEEEEECCC Q ss_conf 433221111344456664265 Q gi|254780897|r 164 ETESEIGDKVTVDYTVSVDNV 184 (473) Q Consensus 164 ~~~~~~gD~v~id~~~~~dg~ 184 (473) ++..+.||.|.||+...+||= T Consensus 69 ~~~lk~GDiVkiD~g~~idG~ 89 (293) T PRK08671 69 ETVFPEGDVVKLDLGAHVDGY 89 (293) T ss_pred CCCCCCCCEEEEEEEEEECCE T ss_conf 616258998999788898999 No 60 >PRK12414 putative aminotransferase; Provisional Probab=33.78 E-value=22 Score=13.76 Aligned_cols=23 Identities=13% Similarity=0.388 Sum_probs=8.8 Q ss_pred CCHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 111578999999764310000234 Q gi|254780897|r 141 VDEKEIDRQMAEIAKNNVAFEVKE 164 (473) Q Consensus 141 vtd~~Id~~i~~l~~~~~~~~~~~ 164 (473) .+.+++.+ |-.+.+++..|.-.| T Consensus 179 ~s~e~l~~-l~~la~~~~i~ii~D 201 (384) T PRK12414 179 FSEADLAR-LAQLTRGTDIVVLSD 201 (384) T ss_pred CCHHHHHH-HHHHHHHCCEEEEEC T ss_conf 75666999-999875158899966 No 61 >KOG2738 consensus Probab=33.47 E-value=25 Score=13.27 Aligned_cols=45 Identities=31% Similarity=0.323 Sum_probs=26.9 Q ss_pred CCCCHHHHHHHHHHHHHHHHH---------C---------------CCCCCCCCCCCEEEEEEEEEECC Q ss_conf 211115789999997643100---------0---------------02343322111134445666426 Q gi|254780897|r 139 CEVDEKEIDRQMAEIAKNNVA---------F---------------EVKETESEIGDKVTVDYTVSVDN 183 (473) Q Consensus 139 ~~vtd~~Id~~i~~l~~~~~~---------~---------------~~~~~~~~~gD~v~id~~~~~dg 183 (473) +-||-++||+++-+..-.++- | .|-.|+.+.||.|+||...+.+| T Consensus 145 PgvTTdEiD~~VH~a~Ierg~YPSPLnYy~FPKS~CTSVNEviCHGIPD~RpLedGDIvNiDVtvY~~G 213 (369) T KOG2738 145 PGVTTDEIDRAVHNAIIERGAYPSPLNYYGFPKSVCTSVNEVICHGIPDSRPLEDGDIVNIDVTVYLNG 213 (369) T ss_pred CCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHCCHHHEEECCCCCCCCCCCCCEEEEEEEEEECC T ss_conf 976388999999999986488588754578866662422214654788867277798786889999425 No 62 >pfam04266 ASCH ASCH domain. The ASCH domain adopts a beta-barrel fold similar to the pfam01472 domain. It is thought to function as an RNA-binding domain during coactivation, RNA-processing and possibly during prokaryotic translation regulation. Probab=32.93 E-value=25 Score=13.21 Aligned_cols=45 Identities=13% Similarity=-0.065 Sum_probs=35.6 Q ss_pred CEEEEEEHHHHHCCCCCCCCHHHHHHHCCCCHHHHHHHHHHHHHHH Q ss_conf 3046761011002467776747887502365578999999887899 Q gi|254780897|r 239 KVRLNFSIKEVFSPLPVVVNNDLAVRLGFESESAMRGLCSQKIKQH 284 (473) Q Consensus 239 ~v~f~v~v~~I~~~~~pel~def~k~~~~~t~~elk~~ik~~l~~~ 284 (473) .....+.+.+|.....-++++++|..=|. |+++++...++-.... T Consensus 44 ~~~~~i~v~~V~~~~f~~v~~e~A~~eGe-sl~~w~~~h~~~y~~~ 88 (104) T pfam04266 44 RPVGVIEVTDVEIIPFEEVTEEHAYLEGE-SLEEWRKVHREFYPEE 88 (104) T ss_pred CEEEEEEEEEEEEEEHHHCCHHHHHHHCC-CHHHHHHHHHHHHHHH T ss_conf 89999999899999844499999998488-9999999999970001 No 63 >COG4086 Predicted secreted protein [Function unknown] Probab=30.47 E-value=28 Score=12.91 Aligned_cols=104 Identities=17% Similarity=0.286 Sum_probs=54.9 Q ss_pred HHHCC--CCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEHHHHHCCCCCCCCH-----------HHHHHHCCCC--HHH Q ss_conf 34206--88788754332344444441000387304676101100246777674-----------7887502365--578 Q gi|254780897|r 208 TEILV--GLKTGDQKEIERFFPEDHSIKDLAGKKVRLNFSIKEVFSPLPVVVNN-----------DLAVRLGFES--ESA 272 (473) Q Consensus 208 ~~~li--G~k~Gd~~~~~~~~P~d~~~~~laGk~v~f~v~v~~I~~~~~pel~d-----------ef~k~~~~~t--~~e 272 (473) ..+|+ |+..+. +.+...||-. ....|+|----|++.+ --..|+-+. |+..+.|.+. ..+ T Consensus 122 anAL~TaGi~~a~-V~VtaP~pvS-GeAALaGv~KayE~a~----g~~Ipe~~KqvaneEL~~~sel~~k~G~d~~r~a~ 195 (299) T COG4086 122 ANALVTAGIEDAK-VTVTAPFPVS-GEAALAGVYKAYEAAV----GVQIPEANKQVANEELVATSELGDKIGDDPRRAAA 195 (299) T ss_pred HHHHHHCCCCCCE-EEEECCCCCC-CHHHHHHHHHHHHHHC----CCCCCHHHHHHHHHHHHHHHHHHHHCCCCHHHHHH T ss_conf 9999866998714-8993675675-1999999999999862----78786788999999999988764521887788999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHH Q ss_conf 99999988789999777776699999999864003479999999977 Q gi|254780897|r 273 MRGLCSQKIKQHSEFLVRQKVKRQILDYISNKYTFDVPESLVENEYN 319 (473) Q Consensus 273 lk~~ik~~l~~~~~~~~~~~~~~~i~~~L~~~~~~~lPe~lv~~e~~ 319 (473) |-..|++.+..+.. ....-.+.+++.+.+.+.+.||+.-|++-++ T Consensus 196 l~~~VK~~~a~~~~--~~~~dirkvv~dv~~~ynvnltd~qvn~i~~ 240 (299) T COG4086 196 LMAEVKEEVAKQKV--DDPADIRKVVDDVANNYNVNLTDTQVNQIVN 240 (299) T ss_pred HHHHHHHHHHHCCC--CCHHHHHHHHHHHHHHCCCCCCHHHHHHHHH T ss_conf 99999999974789--9989999999999987188997889999999 No 64 >PRK01546 hypothetical protein; Provisional Probab=29.70 E-value=28 Score=12.82 Aligned_cols=60 Identities=7% Similarity=0.049 Sum_probs=40.5 Q ss_pred HHHHHHHHHCCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHH Q ss_conf 9999997534864001100013456778999999999999999998643874289999999 Q gi|254780897|r 323 QKVRFEMSSANQKSQNVDSIDEEDLQYYHMLAKRRVLTGIVLGTIGEKNNIEVTEEEMQSA 383 (473) Q Consensus 323 ~~~~~~l~~~~~~~~~~~~~~e~~~~~~~~~Aek~vk~~lil~~Ia~~e~I~vs~~Ei~~~ 383 (473) +.....-+..|.+.++... ...+|+.|-..-+.+++.+|---+|...++-.||++-+.+. T Consensus 11 NeLA~K~K~~gLT~~E~~E-q~~LR~eYL~~fR~~~~~~l~~~kVvD~~GnDVTPeKlk~~ 70 (79) T PRK01546 11 NFLAKKAKAEGLTEEEQRE-RQSLREQYLKGFRQNMLNELKGIKVVNEQGTDVTPAKLKAL 70 (79) T ss_pred HHHHHHHCCCCCCHHHHHH-HHHHHHHHHHHHHHHHHHHHCCCEEECCCCCCCCHHHHHHH T ss_conf 9999864025899999999-99999999999999999884311378999897888999999 No 65 >PRK11508 sulfurtransferase TusE; Provisional Probab=29.42 E-value=29 Score=12.78 Aligned_cols=34 Identities=15% Similarity=0.142 Sum_probs=26.5 Q ss_pred HHHHHHHHCCCCCCHHH--HHHHHHHHHHHCCCCHH Q ss_conf 99999864387428999--99999999986399989 Q gi|254780897|r 363 VLGTIGEKNNIEVTEEE--MQSALYQQLGRFPGHEK 396 (473) Q Consensus 363 il~~Ia~~e~I~vs~~E--i~~~i~~~~~~~~~~~~ 396 (473) +-.+||+.++|+.|++- |-.++.+++..++.+|. T Consensus 27 vA~~lA~~egI~Lt~~HW~VI~~lR~~Y~~~~~~P~ 62 (109) T PRK11508 27 LAVVIAENEGISLSPEHWEVVRFVRDFYLEFNTSPA 62 (109) T ss_pred HHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCCCC T ss_conf 999999884986689999999999999999789972 No 66 >TIGR02168 SMC_prok_B chromosome segregation protein SMC; InterPro: IPR011890 The SMC (structural maintenance of chromosomes) family of proteins, exist in virtually all organisms including both bacteria and archaea. The SMC proteins are essential for successful chromosome transmission during replication and segregation of the genome in all organisms and form three types of heterodimer (SMC1SMC3, SMC2SMC4, SMC5SMC6), which are core components of large multiprotein complexes. The best known complexes are cohesin, which is responsible for sister-chromatid cohesion, and condensin, which is required for full chromosome condensation in mitosis. SMCs are generally present as single proteins in bacteria, and as at least six distinct proteins in eukaryotes. The proteins range in size from approximately 110 to 170 kDa, and share a five-domain structure, with globular N- and C-terminal (IPR003395 from INTERPRO) domains separated by a long (circa 100 nm or 900 residues) coiled coil segment in the centre of which is a globular ''hinge'' domain, characterised by a set of four highly conserved glycine residues that are typical of flexible regions in a protein. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif (XXXXD, where X is any hydrophobic residue), and a LSGG motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases . All SMC proteins appear to form dimers, either forming homodimers with themselves, as in the case of prokaryotic SMC proteins, or heterodimers between different but related SMC proteins. The dimers are arranged in an antiparallel alignment. This orientation brings the N- and C-terminal globular domains (from either different or identical protamers) together, which unites an ATP binding site (Walker A motif) within the N-terminal domain with a Walker B motif (DA box) within the C-terminal domain, to form a potentially functional ATPase. Protein interaction and microscopy data suggest that SMC dimers form a ring-like structure which might embrace DNA molecules. Non-SMC subunits associate with the SMC amino- and carboxy-terminal domains. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. SMCs share not only sequence similarity but also structural similarity with ABC proteins. SMC proteins function together with other proteins in a range of chromosomal transactions, including chromosome condensation, sister-chromatid cohesion, recombination, DNA repair and epigenetic silencing of gene expression . The smc gene is often associated with scpB (IPR005234 from INTERPRO) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle , , .; GO: 0005515 protein binding, 0005524 ATP binding, 0005694 chromosome. Probab=29.28 E-value=29 Score=12.77 Aligned_cols=21 Identities=0% Similarity=0.090 Sum_probs=12.9 Q ss_pred CHHHHHHHHHHHHHHHHHCCC Q ss_conf 115789999997643100002 Q gi|254780897|r 142 DEKEIDRQMAEIAKNNVAFEV 162 (473) Q Consensus 142 td~~Id~~i~~l~~~~~~~~~ 162 (473) .+..+...+..++........ T Consensus 615 ~~~~~~~~~~~lL~~~~vv~~ 635 (1191) T TIGR02168 615 FDPKLRKALSYLLGGVLVVDD 635 (1191) T ss_pred CCHHHHHHHHHHHCCEEEECC T ss_conf 606678999985587288717 No 67 >TIGR01309 L30P_arch ribosomal protein L30P; InterPro: IPR005997 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites , . About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many of ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome , . Ribosomal protein L30 is one of the proteins from the large ribosomal subunit. L30 belongs to a family of ribosomal proteins which, on the basis of sequence similarities, groups bacteria and archaea L30, yeast mitochondrial L33, and Drosophila, slime mould, fungal and mammalian L7 ribosomal proteins This model describes archaeal 50S ribosomal protein L30. These proteins share similarity to the longer eukaryotic 60S ribosomal protein L7 and to the much shorter (~60 residue) bacterial 50S ribosomal protein L30. ; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0015934 large ribosomal subunit. Probab=29.10 E-value=29 Score=12.75 Aligned_cols=22 Identities=18% Similarity=0.291 Sum_probs=13.9 Q ss_pred CCCHHHHHH-HCCCCHHHHHHHH Q ss_conf 767478875-0236557899999 Q gi|254780897|r 256 VVNNDLAVR-LGFESESAMRGLC 277 (473) Q Consensus 256 el~def~k~-~~~~t~~elk~~i 277 (473) .+||++++. ++++|++||=++| T Consensus 78 ~vtD~~v~E~t~~~~~~elA~al 100 (160) T TIGR01309 78 KVTDEYVKEVTGYESVDELAKAL 100 (160) T ss_pred CCCHHHHHHHCCCCCHHHHHHHH T ss_conf 67888876532888888999998 No 68 >TIGR02023 BchP-ChlP geranylgeranyl reductase; InterPro: IPR010253 This entry represents a group of geranylgeranyl reductases specific for the biosyntheses of bacteriochlorophyll and chlorophyll . It is unclear whether the processes of isoprenoid ligation to the chlorin ring and reduction of the geranylgeranyl chain to a phytyl chain are necessarily ordered the same way in all species (see introduction to ).; GO: 0045550 geranylgeranyl reductase activity, 0015979 photosynthesis, 0015995 chlorophyll biosynthetic process. Probab=28.96 E-value=23 Score=13.57 Aligned_cols=136 Identities=16% Similarity=0.146 Sum_probs=56.1 Q ss_pred CCHHHHHHHHHHH-HHHHHHCCC-----CCCCCC-CCCEEEEEEEEEECCCCCCCCCCC--CEEEEECCCCCCCCHHHHC Q ss_conf 1115789999997-643100002-----343322-111134445666426534666544--2036517866563103420 Q gi|254780897|r 141 VDEKEIDRQMAEI-AKNNVAFEV-----KETESE-IGDKVTVDYTVSVDNVILEDQSKK--NVQFIVGSADLFSETTEIL 211 (473) Q Consensus 141 vtd~~Id~~i~~l-~~~~~~~~~-----~~~~~~-~gD~v~id~~~~~dg~~~~~~~~~--~~~~~lg~~~~~~~f~~~l 211 (473) |.=+-.|..|-+- .+.-|++.. .+++.. -...++++|. +.+|+.--|+... ...+++|.+..--..--.- T Consensus 93 VrREvFD~~LReRA~kaGAe~~~g~f~~~~~d~~GWds~~~~~~~-~~~G~k~ag~~~~~v~~~~VIGADGA~S~vAR~~ 171 (408) T TIGR02023 93 VRREVFDEYLRERAQKAGAELIEGLFKKLERDEDGWDSRVTLQYR-EKDGKKEAGEEKKSVEADVVIGADGANSKVARKE 171 (408) T ss_pred EEHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEE-CCCCCCCCCCCCCCEEEEEEEECCCCCCHHHHHH T ss_conf 551441578999987621476656755201567777552589987-6777512567777378988760447886588971 Q ss_pred CCCCCCCCCCC----CCCCCCCCCCCCCCCCCEEEEEEHHHHHCCCCCCCCHHHHHH-----HCCCCHHHHHHHHHHHHH Q ss_conf 68878875433----234444444100038730467610110024677767478875-----023655789999998878 Q gi|254780897|r 212 VGLKTGDQKEI----ERFFPEDHSIKDLAGKKVRLNFSIKEVFSPLPVVVNNDLAVR-----LGFESESAMRGLCSQKIK 282 (473) Q Consensus 212 iG~k~Gd~~~~----~~~~P~d~~~~~laGk~v~f~v~v~~I~~~~~pel~def~k~-----~~~~t~~elk~~ik~~l~ 282 (473) ||.-.....-+ -+..|.+..-..+ +. ..+|=+ --.+-|.+==-.|=+ +|..|.-..+..+|+.+. T Consensus 172 ~~~~~~~~~viA~qEri~~p~~~~~~y~-e~--~~d~~~---~g~VSPDFYgW~FPk~dH~avG~Gt~P~h~~d~K~~~~ 245 (408) T TIGR02023 172 LGLPDNLPKVIAYQERIKLPDESKMKYY-EE--LADVYY---DGEVSPDFYGWVFPKGDHIAVGTGTEPTHGFDIKQLQA 245 (408) T ss_pred CCCCCCHHHEEEHHHEECCCCCCCCCCC-CC--EEEEEE---CCEECCCCCCCCCCCCCEEEEECCCCCCCHHHHHHHHH T ss_conf 8758860230201120048886565778-87--799998---88048642302068877788832788564278999999 Q ss_pred H Q ss_conf 9 Q gi|254780897|r 283 Q 283 (473) Q Consensus 283 ~ 283 (473) . T Consensus 246 ~ 246 (408) T TIGR02023 246 A 246 (408) T ss_pred H T ss_conf 9 No 69 >COG4283 Uncharacterized conserved protein [Function unknown] Probab=28.09 E-value=30 Score=12.62 Aligned_cols=35 Identities=6% Similarity=-0.163 Sum_probs=28.2 Q ss_pred CCCCCCCCHHHHHHHCCCCHHHHHHHHHHHHHHHH Q ss_conf 24677767478875023655789999998878999 Q gi|254780897|r 251 SPLPVVVNNDLAVRLGFESESAMRGLCSQKIKQHS 285 (473) Q Consensus 251 ~~~~pel~def~k~~~~~t~~elk~~ik~~l~~~~ 285 (473) =+.+++|+-.|-++.+..|+.+++..+.++..... T Consensus 82 Wn~~geln~~F~kkyq~~SL~e~~~~L~k~h~~v~ 116 (170) T COG4283 82 WNNLGELNQWFWKKYQHLSLKELKAKLNKNHNDVY 116 (170) T ss_pred CCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHH T ss_conf 14237899999998505469999999998799999 No 70 >PRK07281 methionine aminopeptidase; Reviewed Probab=27.82 E-value=30 Score=12.58 Aligned_cols=21 Identities=14% Similarity=0.226 Sum_probs=8.1 Q ss_pred CCCCCCCCCCCC-CCEEEEEEH Q ss_conf 444444100038-730467610 Q gi|254780897|r 226 FPEDHSIKDLAG-KKVRLNFSI 246 (473) Q Consensus 226 ~P~d~~~~~laG-k~v~f~v~v 246 (473) .++.|.....-| -.+.|+=+| T Consensus 246 ~~dgWt~~t~Dg~~sa~~Ehtv 267 (286) T PRK07281 246 MKTGWAHKTLDGGLSCQYEHQF 267 (286) T ss_pred CCCCEEEEECCCCEEEEECEEE T ss_conf 7997489948998777305599 No 71 >TIGR01265 tyr_nico_aTase tyrosine/nicotianamine aminotransferases; InterPro: IPR005958 This entry describes tyrosine aminotransferase (2.6.1.5 from EC) as found in animals and Trypanosoma cruzi (Euglenozoa). It is the first enzyme of a pathway of tyrosine degradation via homogentisate.L-tyrosine + 2-oxoglutarate = 4-hydroxyphenylpyruvate + L-glutamate. Several plant enzyme designated as probable tyrosine aminotransferases are very closely related to an experimentally demonstrated nicotianamine aminotransferase, an enzyme in a siderophore (iron uptake chelator) biosynthesis pathway are included in this family. ; GO: 0008483 transaminase activity, 0030170 pyridoxal phosphate binding, 0006519 amino acid and derivative metabolic process. Probab=27.34 E-value=31 Score=12.52 Aligned_cols=35 Identities=17% Similarity=0.266 Sum_probs=21.1 Q ss_pred CCCCCCCCCCCCCCCCCHHHHH---HHHHHHHHHHHHC Q ss_conf 2111356431001211115789---9999976431000 Q gi|254780897|r 126 ISSFDDLQVTQDICEVDEKEID---RQMAEIAKNNVAF 160 (473) Q Consensus 126 l~~y~~i~i~~~~~~vtd~~Id---~~i~~l~~~~~~~ 160 (473) +..|.+|.+..+.....|++.+ +.|+.|...++.- T Consensus 150 ~A~~~~lEvr~ydLdlPe~~WeiDL~~le~l~DekT~A 187 (424) T TIGR01265 150 RAAFSGLEVRLYDLDLPEKDWEIDLDGLESLADEKTVA 187 (424) T ss_pred HHHHCCCCCCCCCCCCCCCCCEECHHHHHHHHCCCCEE T ss_conf 99771872000246788875236478998763268769 No 72 >TIGR00998 8a0101 efflux pump membrane protein; InterPro: IPR005694 emrA and emrB confer resistance to carbonylcyanide m-chlorophenylhydrazone, nalidixic acid, and a number of other toxic compounds. EmrB encodes a highly hydrophobic 56.2-kDa peptide, with 14 potential alpha-helices to span the inner membrane. EmrA encodes a putative 42.7-kDa peptide containing a single hydrophobic domain and a large C-terminal hydrophilic domain.; GO: 0015238 drug transporter activity, 0006855 multidrug transport, 0009276 1-2nm peptidoglycan-based cell wall. Probab=26.79 E-value=31 Score=12.45 Aligned_cols=43 Identities=14% Similarity=0.083 Sum_probs=34.5 Q ss_pred HHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEHHHHHCCCCCCCCH Q ss_conf 3420688788754332344444441000387304676101100246777674 Q gi|254780897|r 208 TEILVGLKTGDQKEIERFFPEDHSIKDLAGKKVRLNFSIKEVFSPLPVVVNN 259 (473) Q Consensus 208 ~~~liG~k~Gd~~~~~~~~P~d~~~~~laGk~v~f~v~v~~I~~~~~pel~d 259 (473) +-+|.++++|..+++... +-|..+.|+=+|.+|.--++|..+= T Consensus 289 ETqL~~vR~G~~v~i~~D---------lYg~dv~F~GkV~gi~~GvLPDDtG 331 (379) T TIGR00998 289 ETQLKNVRIGQPVTIRSD---------LYGSDVVFEGKVTGISMGVLPDDTG 331 (379) T ss_pred CCCCCCCCCCCCEEEEEE---------ECCCCCEEEEEEEECCCCCCCCCHH T ss_conf 132057457872589985---------4389807974884226633567404 No 73 >cd01651 RT_G2_intron RT_G2_intron: Reverse transcriptases (RTs) with group II intron origin. RT transcribes DNA using RNA as template. Proteins in this subfamily are found in bacterial and mitochondrial group II introns. Their most probable ancestor was a retrotransposable element with both gag-like and pol-like genes. This subfamily of proteins appears to have captured the RT sequences from transposable elements, which lack long terminal repeats (LTRs). Probab=26.55 E-value=32 Score=12.42 Aligned_cols=52 Identities=23% Similarity=0.350 Sum_probs=29.4 Q ss_pred EEEEECCCCEEEEEEE-ECHHHHHHHHHHHHHHHHHCC---CCCCCCCCCCCHHHH Q ss_conf 1899727976999998-548999999999999987507---389858970488999 Q gi|254780897|r 3 VIEKFSEGLKRELDVI-IPSNRLTDSFNERIEDIRSKA---NIKGFRPGKVPLSHI 54 (473) Q Consensus 3 v~~~~~~~~~~~l~v~-v~~~~~~~~~~~~~~~~~~~~---~ipGFRkGKvP~~ii 54 (473) |-+-|.+|-.|.+++. +...-++..+..++..+-... .-=|||||+-....+ T Consensus 3 v~IpK~~g~~RpIs~~~~~dki~~~~i~~~L~~~~~~~~~~~~~gfr~gr~~~~ai 58 (226) T cd01651 3 VYIPKPNGKKRPLGIPTVRDRIVQEALKLVLEPIYEPRFSDCSYGFRPGRSAHDAL 58 (226) T ss_pred EEEECCCCCCCEECCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHH T ss_conf 76588997932357899899999999999999985274173332587798589999 No 74 >COG5192 BMS1 GTP-binding protein required for 40S ribosome biogenesis [Translation, ribosomal structure and biogenesis] Probab=25.76 E-value=32 Score=12.40 Aligned_cols=22 Identities=9% Similarity=-0.028 Sum_probs=12.9 Q ss_pred HHHCCCCCCCCCCCCEEEEEEE Q ss_conf 1000023433221111344456 Q gi|254780897|r 157 NVAFEVKETESEIGDKVTVDYT 178 (473) Q Consensus 157 ~~~~~~~~~~~~~gD~v~id~~ 178 (473) |+.+.++.+.....|.|.|+.- T Consensus 326 Y~PMsdiGGil~dKD~vyiev~ 347 (1077) T COG5192 326 YSPMSDIGGILKDKDRVYIEVP 347 (1077) T ss_pred ECCCCCCCCEEECCCCEEEECC T ss_conf 2443455552012652256446 No 75 >TIGR00963 secA preprotein translocase, SecA subunit; InterPro: IPR000185 Secretion across the inner membrane in some Gram-negative bacteria occurs via the preprotein translocase pathway. Proteins are produced in the cytoplasm as precursors, and require a chaperone subunit to direct them to the translocase component. . From there, the mature proteins are either targeted to the outer membrane, or remain as periplasmic proteins. The translocase protein subunits are encoded on the bacterial chromosome. The translocase itself comprises 7 proteins, including a chaperone protein (SecB), an ATPase (SecA), an integral membrane complex (SecCY, SecE and SecG), and two additional membrane proteins that promote the release of the mature peptide into the periplasm (SecD and SecF) . The chaperone protein SecB is a highly acidic homotetrameric protein that exists as a "dimer of dimers" in the bacterial cytoplasm. SecB maintains preproteins in an unfolded state after translation, and targets these to the peripheral membrane protein ATPase SecA for secretion . SecA is a cytoplasmic protein of 800 to 960 amino acid residues. Homologs of secA are also encoded in the chloroplast genome of some algae as well as in the nuclear genome of plants . It could be involved in the intraorganellar protein transport into thylakoids.; GO: 0005524 ATP binding, 0006605 protein targeting, 0006886 intracellular protein transport. Probab=25.51 E-value=17 Score=14.55 Aligned_cols=26 Identities=15% Similarity=0.368 Sum_probs=12.7 Q ss_pred CCHHHHHHHHHHHHHHCCCCHHHHHH Q ss_conf 28999999999999863999899999 Q gi|254780897|r 375 VTEEEMQSALYQQLGRFPGHEKKMLD 400 (473) Q Consensus 375 vs~~Ei~~~i~~~~~~~~~~~~~~~~ 400 (473) .+.+++...|...+..........+. T Consensus 813 ~~~~~~~~~l~~~~~~~Y~~k~~~~~ 838 (904) T TIGR00963 813 LTSEDLKEFLLEKIKAAYEEKEELLE 838 (904) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 78899999999999999998887657 No 76 >pfam03299 TF_AP-2 Transcription factor AP-2. Probab=25.27 E-value=33 Score=12.25 Aligned_cols=26 Identities=19% Similarity=0.258 Sum_probs=18.4 Q ss_pred CCCCEEEEEEHHHHHCCCCC-C-CCHHH Q ss_conf 38730467610110024677-7-67478 Q gi|254780897|r 236 AGKKVRLNFSIKEVFSPLPV-V-VNNDL 261 (473) Q Consensus 236 aGk~v~f~v~v~~I~~~~~p-e-l~def 261 (473) -+.+..|+|||-+|.|+.-| | ||-.+ T Consensus 13 lsstsKYkVTV~Ev~RRls~PE~LNaSl 40 (207) T pfam03299 13 LSSTSKYKVTVAEVQRRLSPPECLNASL 40 (207) T ss_pred CCCCCEEEEEHHHHHHCCCCHHHCCHHH T ss_conf 6787505655999987018811203888 No 77 >KOG0546 consensus Probab=25.13 E-value=33 Score=12.23 Aligned_cols=67 Identities=19% Similarity=0.145 Sum_probs=37.3 Q ss_pred CCCCCCEEEEEEEEEECCCCCCCCCCCCEEEEECCCCCCCCHHHHCCCC-C-----CCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 3221111344456664265346665442036517866563103420688-7-----887543323444444410003873 Q gi|254780897|r 166 ESEIGDKVTVDYTVSVDNVILEDQSKKNVQFIVGSADLFSETTEILVGL-K-----TGDQKEIERFFPEDHSIKDLAGKK 239 (473) Q Consensus 166 ~~~~gD~v~id~~~~~dg~~~~~~~~~~~~~~lg~~~~~~~f~~~liG~-k-----~Gd~~~~~~~~P~d~~~~~laGk~ 239 (473) -++-||++.-|=+| |+.+.|..-+|-.|.|..+. .-|++| . -|-++-|+ ..+.++|-|+- T Consensus 74 MiQgGDfs~gnGtG---GeSIYG~~FdDEnF~lKHdr------pflLSMAN~GpNTNgSQFFIT-----T~p~PHLdGkH 139 (372) T KOG0546 74 MIQGGDFSEGNGTG---GESIYGEKFDDENFELKHDR------PFLLSMANRGPNTNGSQFFIT-----TVPTPHLDGKH 139 (372) T ss_pred EEECCCCCCCCCCC---CCCCCCCCCCCCCCEECCCC------CHHHHHHCCCCCCCCCCEEEE-----CCCCCCCCCCE T ss_conf 66136533478887---55000563046530214686------011012027999888616896-----77777768824 Q ss_pred EEEEEEH Q ss_conf 0467610 Q gi|254780897|r 240 VRLNFSI 246 (473) Q Consensus 240 v~f~v~v 246 (473) |.|=--| T Consensus 140 VVFGqVI 146 (372) T KOG0546 140 VVFGQVI 146 (372) T ss_pred EEEEEEE T ss_conf 6886676 No 78 >pfam01390 SEA SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains. Recently a proteolytic activity has been shown for a SEA domain. Probab=24.80 E-value=34 Score=12.19 Aligned_cols=12 Identities=17% Similarity=0.451 Sum_probs=5.1 Q ss_pred HHHHHHHHHHHH Q ss_conf 157899999976 Q gi|254780897|r 143 EKEIDRQMAEIA 154 (473) Q Consensus 143 d~~Id~~i~~l~ 154 (473) ...|...+..+. T Consensus 33 ~~~i~~~~~~~f 44 (107) T pfam01390 33 ARKIENLLNEVF 44 (107) T ss_pred HHHHHHHHHHHH T ss_conf 999999999997 No 79 >PRK02769 histidine decarboxylase; Provisional Probab=24.68 E-value=34 Score=12.17 Aligned_cols=14 Identities=7% Similarity=0.249 Sum_probs=5.7 Q ss_pred CCHHHHHHHHHHHH Q ss_conf 28999999999999 Q gi|254780897|r 375 VTEEEMQSALYQQL 388 (473) Q Consensus 375 vs~~Ei~~~i~~~~ 388 (473) |+.+.++.++..++ T Consensus 354 Vt~e~iD~fl~DL~ 367 (380) T PRK02769 354 HNKQQIDSLIDELI 367 (380) T ss_pred CCHHHHHHHHHHHH T ss_conf 99999999999999 No 80 >pfam06021 Gly_acyl_tr_N Aralkyl acyl-CoA:amino acid N-acyltransferase. This family consists of several mammalian specific aralkyl acyl-CoA:amino acid N-acyltransferase (glycine N-acyltransferase) proteins EC:2.3.1.13. Probab=24.66 E-value=34 Score=12.17 Aligned_cols=44 Identities=16% Similarity=0.205 Sum_probs=31.3 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEECCCCCCC Q ss_conf 99999999999999987299542287412222223332110137882057887622787762 Q gi|254780897|r 65 ETIDEIIKEIVPEILSKRDERAAMRPSITINEGESDITSGLIEGTVDLKLRLSYDILPQIEI 126 (473) Q Consensus 65 e~~~~li~~~~~~~i~~~~~~~~~~P~i~~~~~~~~~~~~~~~~~~~~~~~~~~ev~Pei~l 126 (473) ..+++.+.+++++.++- .|. -..++.|.||..++-++-+|+|.. T Consensus 11 q~Le~sL~k~~PeSLKV-----YG~-------------v~~inrGNPf~levlVDsWPdF~t 54 (205) T pfam06021 11 QMLEKSLRKSLPESLKV-----YGT-------------VFHINHGNPFNLKALVDKWPDFNT 54 (205) T ss_pred HHHHHHHHHHCCHHHEE-----EEE-------------EEEECCCCCCCEEEEEECCCCCEE T ss_conf 99999999858166601-----115-------------787316998642787604888508 No 81 >TIGR02713 allophanate_hyd allophanate hydrolase; InterPro: IPR014085 Allophanate hydrolase catalyses the second reaction in an ATP-dependent two-step degradation of urea to ammonia and C02, following the action of the biotin-containing urea carboxylase. The yeast enzyme, a fusion of allophanate hydrolase to urea carboxylase, is designated urea amidolyase.. Probab=24.10 E-value=24 Score=13.38 Aligned_cols=47 Identities=13% Similarity=0.157 Sum_probs=30.1 Q ss_pred HHHHHHHHHCCCCCH-HHH---HHHHHHHHHHHHHHHHHCC-CCCHHHHCCH Q ss_conf 999998640034799-999---9997767899999975348-6400110001 Q gi|254780897|r 297 ILDYISNKYTFDVPE-SLV---ENEYNGILQKVRFEMSSAN-QKSQNVDSID 343 (473) Q Consensus 297 i~~~L~~~~~~~lPe-~lv---~~e~~~~~~~~~~~l~~~~-~~~~~~~~~~ 343 (473) ....+-...-|=||. ++- +.+...+++...++|...| .....++... T Consensus 211 ~~~~~p~~~rvGVP~~~~~fFgD~~~~~~f~~A~~~L~~~G~~~~v~iDf~P 262 (582) T TIGR02713 211 ALRRLPPKPRVGVPRAQLEFFGDSQAEAAFAAAVERLEALGLVEVVEIDFAP 262 (582) T ss_pred CCCCCCCCCEEECCCCCCHHCCHHHHHHHHHHHHHHHHHCCCEEEEECCCCH T ss_conf 0034789767722675700025678899999999999845981799728736 No 82 >PRK11557 putative DNA-binding transcriptional regulator; Provisional Probab=23.94 E-value=35 Score=12.07 Aligned_cols=21 Identities=24% Similarity=0.422 Sum_probs=8.6 Q ss_pred HHHHHCCCCHHHHHHHHHHHH Q ss_conf 887502365578999999887 Q gi|254780897|r 261 LAVRLGFESESAMRGLCSQKI 281 (473) Q Consensus 261 f~k~~~~~t~~elk~~ik~~l 281 (473) |++++||+...+||..+++.+ T Consensus 54 f~kkLGf~Gf~efK~~l~~~~ 74 (282) T PRK11557 54 FAQKLGYKGFPALKLALSEAL 74 (282) T ss_pred HHHHHCCCCHHHHHHHHHHHH T ss_conf 999938998999999999998 No 83 >TIGR02103 pullul_strch alpha-1,6-glucosidases, pullulanase-type; InterPro: IPR011839 Members of this protein family include secreted (or membrane-anchored) pullulanases of Gram-negative bacteria and pullulanase-type starch debranching enzymes of plants. Both enzymes hydrolyze alpha-1,6 glycosidic linkages. Pullulan is an unusual, industrially important polysaccharide in which short alpha-1,4 chains (maltotriose) are connected in alpha-1,6 linkages. Enzymes that cleave alpha-1,6 linkages in pullulan and release maltotriose are called pullulanases although pullulan itself may not be the natural substrate. This family is closely homologous to, but architecturally different from the pullulanases of Gram-positive bacteria (IPR011838 from INTERPRO).; GO: 0051060 pullulanase activity, 0005975 carbohydrate metabolic process. Probab=23.45 E-value=36 Score=12.00 Aligned_cols=62 Identities=16% Similarity=0.063 Sum_probs=31.9 Q ss_pred HHHHCCCCCCCCCCCCEEEEEEEEEE--CCC------------CCCCCCCCCEEEEECCCCCCCCHHHHCCCCCCC Q ss_conf 31000023433221111344456664--265------------346665442036517866563103420688788 Q gi|254780897|r 156 NNVAFEVKETESEIGDKVTVDYTVSV--DNV------------ILEDQSKKNVQFIVGSADLFSETTEILVGLKTG 217 (473) Q Consensus 156 ~~~~~~~~~~~~~~gD~v~id~~~~~--dg~------------~~~~~~~~~~~~~lg~~~~~~~f~~~liG~k~G 217 (473) -.+-|.-.......|.+=......+. -|+ .+--.+..-+.+.|.+..+-|.==++|--.|+- T Consensus 182 t~GvWs~~g~~~~~G~yYrY~V~VYhP~t~~vET~~VTDPYs~sLS~NS~~SqvvDL~d~~LkP~GWd~L~~pk~~ 257 (945) T TIGR02103 182 TSGVWSAEGEASWKGKYYRYEVTVYHPSTGKVETNLVTDPYSLSLSANSEYSQVVDLDDPSLKPEGWDALAAPKPQ 257 (945) T ss_pred CCCCEEECCCCCCCCCEEEEEEEEEEECCCCEEEEEECCHHHHHHCCCCCEEEEEECCCHHCCCCCCHHHCCCCCH T ss_conf 1750310044444574478898897406880576422663457315799568998669712076563346067731 No 84 >TIGR00059 L17 ribosomal protein L17; InterPro: IPR000456 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites , . About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many of ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome , . Ribosomal protein L17 is one of the proteins from the large ribosomal subunit. Bacterial L17 is a protein of 120 to 130 amino-acid residues while yeast YmL8 is twice as large (238 residues). The N-terminal half of YmL8 is colinear with the sequence of L17 from Escherichia coli.; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome. Probab=23.27 E-value=36 Score=11.98 Aligned_cols=28 Identities=14% Similarity=0.146 Sum_probs=14.6 Q ss_pred HHHHHHHCCCCCCHH---HHHHHHHHHHHHC Q ss_conf 999986438742899---9999999999863 Q gi|254780897|r 364 LGTIGEKNNIEVTEE---EMQSALYQQLGRF 391 (473) Q Consensus 364 l~~Ia~~e~I~vs~~---Ei~~~i~~~~~~~ 391 (473) ..++..++.|+=|.. |+...++.++.-- T Consensus 21 ~~~L~~he~I~TT~~KAKe~r~~~EklIT~A 51 (125) T TIGR00059 21 ASALIRHEKIETTLAKAKELRRVVEKLITLA 51 (125) T ss_pred HHHHHHCCCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 9999862886203578888888885776676 No 85 >pfam05089 NAGLU Alpha-N-acetylglucosaminidase (NAGLU). Alpha-N-acetylglucosaminidase, a lysosomal enzyme required for the stepwise degradation of heparan sulfate. Mutations on the alpha-N-acetylglucosaminidase (NAGLU) gene can lead to Mucopolysaccharidosis type IIIB (MPS IIIB; or Sanfilippo syndrome type B) characterized by neurological dysfunction but relatively mild somatic manifestations. Probab=23.12 E-value=36 Score=11.95 Aligned_cols=16 Identities=13% Similarity=0.156 Sum_probs=10.9 Q ss_pred CCCCCCCEEEEEEEEE Q ss_conf 3322111134445666 Q gi|254780897|r 165 TESEIGDKVTVDYTVS 180 (473) Q Consensus 165 ~~~~~gD~v~id~~~~ 180 (473) .++..|+.+++|+-+. T Consensus 303 ~~Vp~~~mlvLDL~se 318 (672) T pfam05089 303 SAVPKGRLLVLDLFSE 318 (672) T ss_pred HCCCCCCEEEEECCCC T ss_conf 3689887799863555 No 86 >TIGR02656 cyanin_plasto plastocyanin; InterPro: IPR002387 Blue or 'type-1' copper proteins are small proteins which bind a single copper atom and which are characterised by an intense electronic absorption band near 600 nm , . The most well known members of this class of proteins are the plant chloroplastic plastocyanins, and the distantly related bacterial azurins, which exchange electrons with cytochrome c551. Plastocyanin participates in electron transfer between the cytochrome b6f complex and photosystem I. Many cyanobacteria and eukaryotic algae can synthesise both plastocyanin and its functional analog cytochrome c6, depending on bioavailabilities of copper and iron, respectively . Plastocyanin participates in electron transfer between P700 and the cytochrome b/f complex in photosystem I. ; GO: 0005507 copper ion binding, 0009055 electron carrier activity, 0006118 electron transport. Probab=22.98 E-value=36 Score=11.93 Aligned_cols=17 Identities=29% Similarity=0.368 Sum_probs=10.4 Q ss_pred CCCCCCCCCCCCEEEEE Q ss_conf 00234332211113444 Q gi|254780897|r 160 FEVKETESEIGDKVTVD 176 (473) Q Consensus 160 ~~~~~~~~~~gD~v~id 176 (473) |+|.+=.++.||.|.+- T Consensus 14 F~P~~~~i~aGDtV~f~ 30 (102) T TIGR02656 14 FEPAKISIAAGDTVKFV 30 (102) T ss_pred EECCCEEECCCCEEEEE T ss_conf 34581046889817884 No 87 >pfam09953 DUF2187 Uncharacterized protein conserved in bacteria (DUF2187). This domain, found in various hypothetical bacterial proteins, has no known function. Probab=22.13 E-value=31 Score=12.49 Aligned_cols=36 Identities=25% Similarity=0.263 Sum_probs=24.8 Q ss_pred CHHHHCCCC--CCCC-CCCCCCCCCCCCCCCCCCCCCEE Q ss_conf 103420688--7887-54332344444441000387304 Q gi|254780897|r 206 ETTEILVGL--KTGD-QKEIERFFPEDHSIKDLAGKKVR 241 (473) Q Consensus 206 ~f~~~liG~--k~Gd-~~~~~~~~P~d~~~~~laGk~v~ 241 (473) +|.++|-|. |+++ ++.+++++-++|.+-++.-++|. T Consensus 11 EFk~Gl~GiVeKvNeNSVIVdlT~Men~~~L~lee~TVv 49 (57) T pfam09953 11 EFKNGLQGIVEKVNENSVIVDLTYMENFRDLDLEEKTVV 49 (57) T ss_pred EECCCCEEEEEEECCCCEEEEEEEECCCHHCCCCCEEEE T ss_conf 972642010013047838999885336001487711467 No 88 >PRK07777 aminotransferase; Validated Probab=21.71 E-value=38 Score=11.75 Aligned_cols=14 Identities=7% Similarity=0.131 Sum_probs=5.9 Q ss_pred HHHHHHCCCCCCHH Q ss_conf 99986438742899 Q gi|254780897|r 365 GTIGEKNNIEVTEE 378 (473) Q Consensus 365 ~~Ia~~e~I~vs~~ 378 (473) ..++++.+|-+.+- T Consensus 335 ~~L~~~~gV~v~PG 348 (386) T PRK07777 335 RALPERVGVAAIPM 348 (386) T ss_pred HHHHHHCCEEEECC T ss_conf 99998689999887 No 89 >PRK09795 aminopeptidase; Provisional Probab=21.70 E-value=38 Score=11.75 Aligned_cols=12 Identities=17% Similarity=0.202 Sum_probs=5.4 Q ss_pred HHHHHHHCCCCC Q ss_conf 999998729954 Q gi|254780897|r 75 VPEILSKRDERA 86 (473) Q Consensus 75 ~~~~i~~~~~~~ 86 (473) +.+.+.++++.. T Consensus 7 lr~~l~e~~lDa 18 (361) T PRK09795 7 LRDWLKAQQLDA 18 (361) T ss_pred HHHHHHHCCCCE T ss_conf 999999879999 No 90 >TIGR01461 greB transcription elongation factor GreB; InterPro: IPR006358 Bacterial GreA IPR006359 from INTERPRO and GreB promote transcription elongation by stimulating an endogenous, endonucleolytic transcript cleavage activity of the RNA polymerase, allowing RNA transcription to continue past template-encoded arresting sites. GreA and GreB are sequence homologues and have homologues in every known bacterial genome. GreA and GreB stimulate transcript cleavage in different ways; GreA induces cleavage of 3'-RNA fragments 23 nt in length and can only prevent the formation of arrested complexes, whereas GreB induces cleavage of fragments up to 18 nt in length and can rescue preexisting arrested complexes . A 15 Å resolution helical reconstruction of the Escherichia coli core RNA polymerase (RNAP)/GreB complex that allows fitting of high-resolution RNAP and GreB structures. The model of the complex reveals a remarkable binding mode for GreB; the globular C-terminal domain binds RNAP at the edge of the active site channel, while the N-terminal coiled-coil domain extends 45 Å into a channel directly to the RNAP active site. The results point to a key role for conserved acidic residues at the tip of the Gre factor coiled coil in modifying the RNAP active site to catalyse the transcript cleavage reaction, and mutational studies confirm that these positions are critical for Gre factor function. Functional differences between GreA and GreB correlate with the distribution of positively charged residues on one face of the N-terminal coiled coil.; GO: 0003677 DNA binding, 0003711 transcription elongation regulator activity, 0006355 regulation of transcription DNA-dependent. Probab=21.70 E-value=38 Score=11.75 Aligned_cols=90 Identities=26% Similarity=0.346 Sum_probs=59.3 Q ss_pred HHHHHHHHHHHHHHHCCCCC-CCCCCCCEEEEEEEEEECCCCCCCCCCCCEEEEECCCC---------CCCCHHHHCCCC Q ss_conf 78999999764310000234-33221111344456664265346665442036517866---------563103420688 Q gi|254780897|r 145 EIDRQMAEIAKNNVAFEVKE-TESEIGDKVTVDYTVSVDNVILEDQSKKNVQFIVGSAD---------LFSETTEILVGL 214 (473) Q Consensus 145 ~Id~~i~~l~~~~~~~~~~~-~~~~~gD~v~id~~~~~dg~~~~~~~~~~~~~~lg~~~---------~~~~f~~~liG~ 214 (473) +||..+--|+++.=.+..|+ ++.+..|.|.+--...+.+. +|....+. ++|-+. .+--+--+|+|+ T Consensus 56 EIDrRvRfLtKrLEnlkvVd~~p~~~t~~VfFGAwV~le~~---DG~~~r~r-IVG~DEid~r~n~ISidSPlARaL~kk 131 (157) T TIGR01461 56 EIDRRVRFLTKRLENLKVVDEEPREATDKVFFGAWVELEDD---DGREQRYR-IVGIDEIDTRKNKISIDSPLARALIKK 131 (157) T ss_pred HHHHHHHHHHHHHHHCEECCCCCCCCCCCEEEEEEEEEECC---CCCCCCEE-EEECCCCCCCCCCEEEECHHHHHHHCC T ss_conf 22112446555443001117653554576772258886538---88320268-860552123577201426678886301 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEHHH Q ss_conf 7887543323444444410003873046761011 Q gi|254780897|r 215 KTGDQKEIERFFPEDHSIKDLAGKKVRLNFSIKE 248 (473) Q Consensus 215 k~Gd~~~~~~~~P~d~~~~~laGk~v~f~v~v~~ 248 (473) ..||.+-++. |+ |....|-+.|-- T Consensus 132 e~gdevvvnT--Pa--------Ge~~~~v~~iEY 155 (157) T TIGR01461 132 EVGDEVVVNT--PA--------GEEEWYVLSIEY 155 (157) T ss_pred CCCCEEEEEC--CC--------CCCEEEEEEEEE T ss_conf 2587778757--88--------850478887520 No 91 >pfam04358 DsrC DsrC like protein. A family member from Desulfovibrio vulgaris has been observed to co-purify with Desulfovibrio vulgaris dissimilatory sulfite reductase, and many members of this family are annotated as the third (gamma) subunit of dissimilatory sulphite reductase. However, this protein appears to be only loosely associated to the sulfite reductase, which suggests that DsrC may not be an integral part of the dissimilatory sulphite reductase. Members of this family are found in organisms such as E. coli and H. influenzae which do not contain dissimilatory sulphite reductases but can synthesize assimilatory sirohaem sulphite and nitrite reductases. It is speculated that DsrC may be involved in the assembly, folding or stabilisation of sirohaem proteins. The strictly conserved cysteine in the C terminus suggests that DsrC may have a catalytic function in the metabolism of sulphur compounds. Probab=21.64 E-value=39 Score=11.74 Aligned_cols=34 Identities=24% Similarity=0.257 Sum_probs=26.2 Q ss_pred HHHHHHHHCCCCCCHHH--HHHHHHHHHHHCCCCHH Q ss_conf 99999864387428999--99999999986399989 Q gi|254780897|r 363 VLGTIGEKNNIEVTEEE--MQSALYQQLGRFPGHEK 396 (473) Q Consensus 363 il~~Ia~~e~I~vs~~E--i~~~i~~~~~~~~~~~~ 396 (473) +-.+||+.++|+.|++- |-.++.+++..++..|. T Consensus 27 vA~~lA~~egI~Lt~~HW~vI~~lR~~Y~~~~~~P~ 62 (109) T pfam04358 27 VAEALAAEEGIELTEEHWEVIRFLRDYYAEFGIAPA 62 (109) T ss_pred HHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCCCC T ss_conf 999999884986679999999999999999689971 No 92 >cd01085 APP X-Prolyl Aminopeptidase 2. E.C. 3.4.11.9. Also known as X-Pro aminopeptidase, proline aminopeptidase, aminopeptidase P, and aminoacylproline aminopeptidase. Catalyses release of any N-terminal amino acid, including proline, that is linked with proline, even from a dipeptide or tripeptide. Probab=21.63 E-value=39 Score=11.74 Aligned_cols=16 Identities=13% Similarity=0.316 Sum_probs=6.7 Q ss_pred CCHHHHHHHHHHHHHH Q ss_conf 1115789999997643 Q gi|254780897|r 141 VDEKEIDRQMAEIAKN 156 (473) Q Consensus 141 vtd~~Id~~i~~l~~~ 156 (473) +|+-++...++.+|.+ T Consensus 30 ~TE~~~a~~le~~r~~ 45 (224) T cd01085 30 ITELSAADKLEEFRRQ 45 (224) T ss_pred CCHHHHHHHHHHHHHH T ss_conf 3799999999999987 No 93 >pfam12322 T4_baseplate T4 bacteriophage base plate protein. This protein is found in viruses. Proteins in this family are typically between 208 to 249 amino acids in length. This protein has a single completely conserved residue S that may be functionally important. This family includes the two base plate proteins in T4 bacteriophages. These are gp51 and gp26, encoded by late genes. Probab=21.52 E-value=36 Score=11.96 Aligned_cols=104 Identities=14% Similarity=0.266 Sum_probs=55.4 Q ss_pred CCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCEEEEEEEEEECCCCCCCCCCCCEEEEECCCCC Q ss_conf 76211135643100121111578999999764310000234332211113444566642653466654420365178665 Q gi|254780897|r 124 IEISSFDDLQVTQDICEVDEKEIDRQMAEIAKNNVAFEVKETESEIGDKVTVDYTVSVDNVILEDQSKKNVQFIVGSADL 203 (473) Q Consensus 124 i~l~~y~~i~i~~~~~~vtd~~Id~~i~~l~~~~~~~~~~~~~~~~gD~v~id~~~~~dg~~~~~~~~~~~~~~lg~~~~ 203 (473) |-+++++.+-+-... =+.+++-.++.++...-. + +| ++ .|. T Consensus 16 FlVKEeK~LL~A~Es--~d~~~i~~ai~~iI~~C~-~---------~~--------------lp-------~fD------ 56 (204) T pfam12322 16 FTVKEYRDLLLVRES--KSPNEQEEIIDELIKECT-F---------PE--------------LP-------KFE------ 56 (204) T ss_pred CCHHHHHHHHHHHHC--CCHHHHHHHHHHHHHHCC-C---------CC--------------CC-------HHH------ T ss_conf 502999988888724--998999999999998604-4---------87--------------87-------388------ Q ss_pred CCCHHHHCCCCCCCCC-CCCCCCCCCCCCCCCCCCCCEEEEEEHHHHHCCC---CCCCCHHHHHHHCCCCHHHH Q ss_conf 6310342068878875-4332344444441000387304676101100246---77767478875023655789 Q gi|254780897|r 204 FSETTEILVGLKTGDQ-KEIERFFPEDHSIKDLAGKKVRLNFSIKEVFSPL---PVVVNNDLAVRLGFESESAM 273 (473) Q Consensus 204 ~~~f~~~liG~k~Gd~-~~~~~~~P~d~~~~~laGk~v~f~v~v~~I~~~~---~pel~def~k~~~~~t~~el 273 (473) ++-+.=+|-++.+|++ +.+.++=|++ |..+.+.|.+.+|+-.. .-.++|.+-=.+.+.++.+. T Consensus 57 iEYlFL~lRakSVGE~~v~~~~~C~~c-------~~~~~~~idL~~i~v~~~~~~I~l~d~i~i~mkyP~~~~~ 123 (204) T pfam12322 57 REYLFLNLWAHSLGKVKVEVTFTCPDC-------GKEIKVPLNLDQIKLTDEKNPILLVGGIKIKFRYPKLFED 123 (204) T ss_pred HHHHHHHHHHHHCCCCEEEEEEECCCC-------CCEEEEEEEEEEEEECCCCCCEEECCCEEEEEECCCHHHH T ss_conf 999999999862576147788878999-------9588999996623603577726863976999967970577 No 94 >TIGR00495 crvDNA_42K DNA-binding protein, 42 kDa; InterPro: IPR004545 Proteins of this family have been identified in a number of species as a nuclear (but not nucleolar) protein with a cell cycle dependence. Various names given to members of this family have included cell cycle protein p38-2G4, DNA-binding protein GBP16, and proliferation-associated protein 1. The proteins are closely related to methionine aminopeptidase, a cobalt-binding protein. This group sequences are classified as non-peptidase homologues in MEROPS peptidase family M24 (clan MG). . Probab=21.25 E-value=37 Score=11.91 Aligned_cols=68 Identities=18% Similarity=0.252 Sum_probs=35.9 Q ss_pred CCCCCCCEEEEEEEEEECCCCCCCCCCCCEEEEECC--CCCCCC------------HHHHCCCCCCCCCCCCCCC----- Q ss_conf 332211113444566642653466654420365178--665631------------0342068878875433234----- Q gi|254780897|r 165 TESEIGDKVTVDYTVSVDNVILEDQSKKNVQFIVGS--ADLFSE------------TTEILVGLKTGDQKEIERF----- 225 (473) Q Consensus 165 ~~~~~gD~v~id~~~~~dg~~~~~~~~~~~~~~lg~--~~~~~~------------f~~~liG~k~Gd~~~~~~~----- 225 (473) +..+.||.|.||+-+-+||=. +.-.++++||. ..-+-| ++-+|==.|+|-+ ...+| T Consensus 106 ~~Lk~GD~VKIdLG~HiDGFi----A~vAhT~vig~~~~~~~tG~~ADvI~AA~~Al~aAlRlvkPG~~-n~~VT~~~~K 180 (407) T TIGR00495 106 LVLKEGDVVKIDLGCHIDGFI----ALVAHTIVIGVASEEPVTGRKADVIAAAHLALEAALRLVKPGKT-NTQVTEAIEK 180 (407) T ss_pred HHHHCCCEEEEECCCCEEEEE----ECCEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCC-CCHHHHHHHH T ss_conf 453338747981366440356----41507899851458765570678999999999988740488798-7147889999 Q ss_pred CCCCCCCCCCCC Q ss_conf 444444100038 Q gi|254780897|r 226 FPEDHSIKDLAG 237 (473) Q Consensus 226 ~P~d~~~~~laG 237 (473) ...-|..+-.+| T Consensus 181 vA~~Y~c~~v~G 192 (407) T TIGR00495 181 VADSYGCKPVEG 192 (407) T ss_pred HHHHCCCEEEEE T ss_conf 973249846532 No 95 >cd01807 GDX_N GDX contains an N-terminal ubiquitin-like domain as well as an uncharacterized c-terminal domain. The function of GDX is unknown. Probab=21.15 E-value=39 Score=11.66 Aligned_cols=27 Identities=19% Similarity=0.337 Sum_probs=12.2 Q ss_pred CCEEEEECCCCEEEEEEEECHHHHHHHHH Q ss_conf 91189972797699999854899999999 Q gi|254780897|r 1 MQVIEKFSEGLKRELDVIIPSNRLTDSFN 29 (473) Q Consensus 1 M~v~~~~~~~~~~~l~v~v~~~~~~~~~~ 29 (473) |+|+++.+.| +.+++.|+.++--..+. T Consensus 1 M~I~VKtL~G--k~~~~~v~~~~tV~~lK 27 (74) T cd01807 1 MFLTVKLLQG--RECSLQVSEKESVSTLK 27 (74) T ss_pred CEEEEEECCC--CEEEEEECCCCHHHHHH T ss_conf 9799990799--78999979969299999 No 96 >pfam07273 DUF1439 Protein of unknown function (DUF1439). This family consists of several hypothetical bacterial proteins of around 190 residues in length. Several members of this family are annotated as being putative lipoproteins and are often known as YceB. The function of this family is unknown. Probab=21.07 E-value=40 Score=11.65 Aligned_cols=12 Identities=25% Similarity=0.534 Sum_probs=6.7 Q ss_pred CCCHHHHHHHHH Q ss_conf 111157899999 Q gi|254780897|r 140 EVDEKEIDRQMA 151 (473) Q Consensus 140 ~vtd~~Id~~i~ 151 (473) .||+.+|+.++. T Consensus 25 sISE~Ein~yL~ 36 (177) T pfam07273 25 SISEAEINQYLQ 36 (177) T ss_pred EECHHHHHHHHH T ss_conf 266999999998 No 97 >pfam08349 DUF1722 Protein of unknown function (DUF1722). This domain of unknown function is found in bacteria and archaea and is homologous to the hypothetical protein ybgA from E. coli. Probab=21.01 E-value=37 Score=11.92 Aligned_cols=15 Identities=20% Similarity=0.153 Sum_probs=5.5 Q ss_pred HHHHHHCCCCCCHHH Q ss_conf 999864387428999 Q gi|254780897|r 365 GTIGEKNNIEVTEEE 379 (473) Q Consensus 365 ~~Ia~~e~I~vs~~E 379 (473) ..|..-.+=.+|.+| T Consensus 56 ~Hi~Gyfk~~ls~~E 70 (117) T pfam08349 56 QHIQGYFKKQLSSEE 70 (117) T ss_pred HHHHHHHHHHCCHHH T ss_conf 999999870189999 No 98 >PRK08960 hypothetical protein; Provisional Probab=21.00 E-value=40 Score=11.64 Aligned_cols=13 Identities=15% Similarity=0.169 Sum_probs=5.9 Q ss_pred HHHHHCCCCCCHH Q ss_conf 9986438742899 Q gi|254780897|r 366 TIGEKNNIEVTEE 378 (473) Q Consensus 366 ~Ia~~e~I~vs~~ 378 (473) .+.++.++-+.+- T Consensus 339 ~ll~~~gV~v~PG 351 (387) T PRK08960 339 HFLETEHVAFTPG 351 (387) T ss_pred HHHHHCCEEEECC T ss_conf 9998699999747 No 99 >PRK02929 L-arabinose isomerase; Provisional Probab=20.96 E-value=40 Score=11.64 Aligned_cols=39 Identities=13% Similarity=0.124 Sum_probs=20.9 Q ss_pred EEEEEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 05788762278776211135643100121111578999999764310 Q gi|254780897|r 112 LKLRLSYDILPQIEISSFDDLQVTQDICEVDEKEIDRQMAEIAKNNV 158 (473) Q Consensus 112 ~~~~~~~ev~Pei~l~~y~~i~i~~~~~~vtd~~Id~~i~~l~~~~~ 158 (473) +.|..+++.++-=+|. ...-.|+|++|+..++.....|. T Consensus 200 i~fG~sVn~~gvgdLv--------~~v~~V~d~~vd~l~~eY~~~Y~ 238 (499) T PRK02929 200 IKFGWSVNTWGVGDLV--------AVVNAVSDGDVDALVDEYESLYD 238 (499) T ss_pred EEEEEEEEEECHHHHH--------HHHHHCCHHHHHHHHHHHHHHCC T ss_conf 0101488116178999--------99984799999999999986153 No 100 >TIGR02104 pulA_typeI pullulanase, type I; InterPro: IPR011840 Pullulan is an unusual, industrially important polysaccharide in which short alpha-1,4 chains (maltotriose) are connected in alpha-1,6 linkages. Enzymes that cleave alpha-1,6 linkages in pullulan and release maltotriose are called pullulanases although pullulan itself may not be the natural substrate. This family consists of pullulanases related to the entries in IPR011838 from INTERPRO and IPR011839 from INTERPRO but having a different domain architecture with shorter sequences. Members are called type I pullulanases .. Probab=20.70 E-value=30 Score=12.60 Aligned_cols=41 Identities=12% Similarity=0.142 Sum_probs=17.9 Q ss_pred CCCHHHHHHHHHHHHHHCCCCHHHH--HHHHH--HCHHHHHHHHHHHHHH Q ss_conf 4289999999999998639998999--99987--0999899999999999 Q gi|254780897|r 374 EVTEEEMQSALYQQLGRFPGHEKKM--LDHFQ--KYPNALAELRAPIFED 419 (473) Q Consensus 374 ~vs~~Ei~~~i~~~~~~~~~~~~~~--~~~~~--~~~~~~~~i~~~ile~ 419 (473) ++|.+|= +.||.+..+|..+ +.|-. ...+..+++|.-|.-+ T Consensus 531 ~LT~~Dk-----ERAMNSYnSpD~iNQlDW~r~~~~~d~v~Y~k~LI~LR 575 (655) T TIGR02104 531 ELTHADK-----ERAMNSYNSPDSINQLDWDRKATFKDVVNYVKGLIALR 575 (655) T ss_pred CCCHHHH-----HHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 5474455-----76421035763000036566653256778899998886 No 101 >PRK02047 hypothetical protein; Provisional Probab=20.67 E-value=24 Score=13.45 Aligned_cols=43 Identities=16% Similarity=0.228 Sum_probs=25.0 Q ss_pred CHHHHHHHHHHHHHHHH-HCCCC---CCCCCCCCEEEEEEEEEECCC Q ss_conf 11578999999764310-00023---433221111344456664265 Q gi|254780897|r 142 DEKEIDRQMAEIAKNNV-AFEVK---ETESEIGDKVTVDYTVSVDNV 184 (473) Q Consensus 142 td~~Id~~i~~l~~~~~-~~~~~---~~~~~~gD~v~id~~~~~dg~ 184 (473) ...+..+.+-.+..+++ .+.+. .+++..|-++.+.+.....++ T Consensus 25 a~~~~~~~V~~iv~~h~p~~~~~~v~~r~S~~G~Y~Svtv~i~ats~ 71 (91) T PRK02047 25 AHPEFADTIFKVVSVHDPEFDLEKIEERPSSGGNYTGLTITVRATSR 71 (91) T ss_pred CCCCHHHHHHHHHHHHCCCCCCCCEEECCCCCCEEEEEEEEEEECCH T ss_conf 87107899999999977999803378714999868999999998999 No 102 >KOG0256 consensus Probab=20.48 E-value=24 Score=13.37 Aligned_cols=43 Identities=16% Similarity=0.219 Sum_probs=24.0 Q ss_pred HHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHH-HHHHHHHHHHHHH Q ss_conf 9999999987507389858970488999986108-9999999999999 Q gi|254780897|r 27 SFNERIEDIRSKANIKGFRPGKVPLSHIKSLYGK-SILSETIDEIIKE 73 (473) Q Consensus 27 ~~~~~~~~~~~~~~ipGFRkGKvP~~ii~~~~g~-~i~~e~~~~li~~ 73 (473) .|-.-++.+.++ -|-+-|=|..||..-... +...+.+...+.+ T Consensus 53 ~yf~g~~~y~~n----py~~~~np~GiIQmGlaENqlc~DL~~~wl~k 96 (471) T KOG0256 53 SYFYGWKAYEKN----PYDPIKNPLGIIQMGLAENQLCFDLIESWLSK 96 (471) T ss_pred HHHHHHHHHCCC----CCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 578888860368----78877795068775056637889999999974 No 103 >PRK00907 hypothetical protein; Provisional Probab=20.39 E-value=24 Score=13.46 Aligned_cols=37 Identities=16% Similarity=0.151 Sum_probs=16.9 Q ss_pred CHHHHHHHHHHHHHHHHHCCC---C-CCCCCCCCEEEEEEE Q ss_conf 115789999997643100002---3-433221111344456 Q gi|254780897|r 142 DEKEIDRQMAEIAKNNVAFEV---K-ETESEIGDKVTVDYT 178 (473) Q Consensus 142 td~~Id~~i~~l~~~~~~~~~---~-~~~~~~gD~v~id~~ 178 (473) ....+..+|-.+..+++.--. + -|+...|-++-+... T Consensus 26 a~~~~~~~i~~iv~~~~~~~~~~~i~~r~SS~GkYisvTi~ 66 (92) T PRK00907 26 AERGLETELPRLLAATGVELVQERISWKHSSSGKYVSVRIG 66 (92) T ss_pred CCCCHHHHHHHHHHHHCCCCCHHHEEECCCCCCEEEEEEEE T ss_conf 86008999999999957754852567644789827999999 No 104 >PRK07683 aminotransferase A; Validated Probab=20.36 E-value=41 Score=11.54 Aligned_cols=13 Identities=15% Similarity=0.240 Sum_probs=6.4 Q ss_pred HHHHHCCCCCCHH Q ss_conf 9986438742899 Q gi|254780897|r 366 TIGEKNNIEVTEE 378 (473) Q Consensus 366 ~Ia~~e~I~vs~~ 378 (473) .++++.++-+.+- T Consensus 336 ~ll~~~gV~v~PG 348 (387) T PRK07683 336 DLVEEAGLAVVPG 348 (387) T ss_pred HHHHHCCEEEECC T ss_conf 9998399999878 No 105 >TIGR01083 nth endonuclease III; InterPro: IPR005759 The spectrum of DNA damage caused by reactive oxygen species includes a wide variety of modifications of purine and pyrimidine bases. Among these modified bases, 7,8-dihydro-8-oxoguanine (8-oxoG) is an important mutagenic lesion. Base excision repair is a critical mechanism for preventing mutations by removing the oxidative lesion from the DNA. Escherichia coli Nth protein (endonuclease III) has an 8-oxoG DNA glycosylase/AP lyase activity which removes 8-oxoG preferentially from 8-oxoG/G mispairs. Human hNTH1 protein, a homolog of E. coli Nth protein, has similar DNA glycosylase/AP lyase activity that removes 8-oxoG from 8-oxoG/G mispairs . ; GO: 0003906 DNA-(apurinic or apyrimidinic site) lyase activity, 0006284 base-excision repair, 0005622 intracellular. Probab=20.32 E-value=41 Score=11.54 Aligned_cols=53 Identities=11% Similarity=0.065 Sum_probs=34.8 Q ss_pred CCHHHHHHHC-C-----CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHH Q ss_conf 6747887502-3-----6557899999988789999777776699999999864003479999 Q gi|254780897|r 257 VNNDLAVRLG-F-----ESESAMRGLCSQKIKQHSEFLVRQKVKRQILDYISNKYTFDVPESL 313 (473) Q Consensus 257 l~def~k~~~-~-----~t~~elk~~ik~~l~~~~~~~~~~~~~~~i~~~L~~~~~~~lPe~l 313 (473) .|..||.... . -+.++|++.||. ---+..+.....++...|++.|..++|.++ T Consensus 48 aT~~LF~~Y~tp~~~a~a~~eel~~~Ik~----iGlYr~KAk~I~~~~~~LvE~y~GeVP~~~ 106 (192) T TIGR01083 48 ATKKLFEVYPTPQALAAAGLEELEEYIKS----IGLYRNKAKNIIALCRKLVERYGGEVPEDR 106 (192) T ss_pred CCHHHHHCCCCHHHHHCCCHHHHHHHHHC----CCCCHHHHHHHHHHHHHHHHHHCCCCCCCH T ss_conf 16786512778689960893134776422----586456899999999999998189877553 Done!