Query gi|254780868|ref|YP_003065281.1| birA bifunctional protein [Candidatus Liberibacter asiaticus str. psy62] Match_columns 252 No_of_seqs 159 out of 2775 Neff 8.1 Searched_HMMs 39220 Date Mon May 30 00:00:31 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780868.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK11886 biotin--protein ligas 100.0 0 0 432.5 23.2 234 12-250 76-313 (319) 2 PRK06955 biotin--protein ligas 100.0 0 0 413.5 23.9 238 11-250 30-291 (300) 3 COG0340 BirA Biotin-(acetyl-Co 100.0 0 0 405.9 21.4 229 19-251 1-234 (238) 4 PRK08330 biotin--protein ligas 100.0 0 0 396.8 21.6 224 11-251 8-232 (236) 5 PTZ00276 biotin/lipoate protei 100.0 0 0 383.8 20.7 230 11-251 4-245 (245) 6 PTZ00275 biotin-acetyl-CoA-car 100.0 0 0 368.6 22.6 237 11-251 13-286 (288) 7 PRK13325 bifunctional biotin-- 100.0 0 0 350.5 17.5 241 4-250 72-320 (592) 8 PRK08477 biotin--protein ligas 100.0 0 0 327.6 19.7 200 14-236 4-206 (214) 9 PRK05935 biotin--protein ligas 100.0 0 0 309.9 13.7 179 17-203 6-189 (190) 10 KOG1536 consensus 100.0 2.8E-37 7.1E-42 251.9 18.6 219 13-240 385-620 (649) 11 TIGR00121 birA_ligase biotin-[ 100.0 4.9E-37 1.2E-41 250.3 17.0 233 15-251 1-246 (248) 12 pfam03099 BPL_LplA_LipB Biotin 99.9 9.3E-26 2.4E-30 180.0 7.9 119 19-145 2-124 (124) 13 pfam02237 BPL_C Biotin protein 98.5 5.5E-07 1.4E-11 63.1 6.8 45 206-251 1-45 (47) 14 PRK03822 lplA lipoate-protein 95.6 0.14 3.5E-06 29.5 8.7 179 56-251 75-286 (338) 15 COG0095 LplA Lipoate-protein l 95.4 0.071 1.8E-06 31.2 6.7 134 42-185 66-215 (248) 16 PRK00008 lipB lipoyltransferas 95.2 0.097 2.5E-06 30.4 6.8 124 54-186 69-204 (212) 17 COG0321 LipB Lipoate-protein l 90.7 1.2 3E-05 23.6 6.5 127 53-186 76-212 (221) 18 pfam04017 DUF366 Domain of unk 90.6 0.63 1.6E-05 25.3 5.1 91 84-186 83-173 (183) 19 COG2029 Uncharacterized conser 87.7 1.6 4.1E-05 22.8 5.4 116 63-189 63-180 (189) 20 cd01728 LSm1 The eukaryotic Sm 86.4 2.5 6.3E-05 21.6 6.1 46 203-251 7-53 (74) 21 cd01717 Sm_B The eukaryotic Sm 75.0 6.2 0.00016 19.1 5.3 37 202-238 4-41 (79) 22 smart00651 Sm snRNP Sm protein 74.3 6.5 0.00017 19.0 5.0 35 204-238 4-39 (67) 23 cd01727 LSm8 The eukaryotic Sm 70.0 8.1 0.00021 18.4 5.7 41 204-247 5-46 (74) 24 cd01731 archaeal_Sm1 The archa 68.1 8.9 0.00023 18.1 4.9 34 205-238 7-41 (68) 25 cd01719 Sm_G The eukaryotic Sm 67.8 9 0.00023 18.1 5.1 35 204-238 6-41 (72) 26 cd01734 YlxS_C YxlS is a Bacil 62.6 11 0.00029 17.5 4.9 59 191-250 8-74 (83) 27 cd06168 LSm9 The eukaryotic Sm 62.2 12 0.00029 17.4 5.4 36 203-238 5-41 (75) 28 PRK09618 flgD flagellar basal 60.7 12 0.00031 17.3 8.9 42 201-242 88-132 (146) 29 pfam01423 LSM LSM domain. The 59.5 13 0.00033 17.2 5.2 34 205-238 5-39 (66) 30 cd01726 LSm6 The eukaryotic Sm 59.1 13 0.00033 17.1 4.8 46 205-251 7-57 (67) 31 cd01729 LSm7 The eukaryotic Sm 58.8 13 0.00034 17.1 4.8 37 202-238 6-43 (81) 32 PRK00092 hypothetical protein; 58.2 14 0.00035 17.0 5.4 60 190-250 77-142 (153) 33 PRK00737 small nuclear ribonuc 54.8 15 0.00039 16.7 4.9 34 205-238 11-45 (72) 34 TIGR01221 rmlC dTDP-4-dehydror 54.2 12 0.00032 17.2 2.8 54 194-251 24-82 (181) 35 cd01722 Sm_F The eukaryotic Sm 53.8 16 0.00041 16.6 5.1 33 205-237 8-41 (68) 36 cd00600 Sm_like The eukaryotic 53.4 16 0.00041 16.5 4.8 33 206-238 4-37 (63) 37 PRK02001 hypothetical protein; 48.8 19 0.00049 16.1 3.6 44 191-234 73-117 (154) 38 pfam10842 DUF2642 Protein of u 45.7 21 0.00055 15.8 5.2 39 200-239 13-51 (66) 39 PRK02627 acetylornithine amino 43.7 23 0.00059 15.6 4.8 19 15-34 98-117 (398) 40 pfam05899 Cupin_3 Protein of u 42.3 17 0.00042 16.5 1.9 22 230-251 33-54 (74) 41 cd01732 LSm5 The eukaryotic Sm 42.0 24 0.00062 15.4 4.3 34 205-238 10-44 (76) 42 pfam00202 Aminotran_3 Aminotra 41.6 25 0.00063 15.4 4.6 26 10-35 72-98 (338) 43 pfam01287 eIF-5a Eukaryotic el 39.9 26 0.00067 15.2 2.9 21 222-242 7-27 (69) 44 TIGR00484 EF-G translation elo 38.6 10 0.00026 17.8 0.4 15 47-61 508-522 (705) 45 PRK06009 flgD flagellar basal 38.3 28 0.00071 15.1 8.5 46 201-248 85-131 (135) 46 TIGR00963 secA preprotein tran 34.5 16 0.00042 16.5 0.9 14 84-97 373-386 (904) 47 COG4189 Predicted transcriptio 32.9 17 0.00044 16.4 0.8 49 106-154 158-216 (308) 48 PRK04950 putative solute/DNA c 32.5 35 0.00088 14.5 5.2 44 203-247 164-207 (212) 49 cd01730 LSm3 The eukaryotic Sm 31.6 36 0.00091 14.4 4.7 33 206-238 9-42 (82) 50 cd01723 LSm4 The eukaryotic Sm 31.6 36 0.00091 14.4 5.2 46 205-251 8-59 (76) 51 COG0779 Uncharacterized protei 31.2 36 0.00092 14.3 5.9 58 191-250 82-144 (153) 52 CHL00058 petD cytochrome b6/f 30.5 13 0.00032 17.2 -0.3 20 102-124 27-47 (160) 53 COG5414 TATA-binding protein-a 28.6 40 0.001 14.1 2.6 35 91-127 88-126 (392) 54 KOG1404 consensus 28.3 30 0.00075 14.9 1.3 45 4-49 99-144 (442) 55 PRK10595 SOS cell division inh 27.9 30 0.00076 14.9 1.3 65 51-118 56-125 (167) 56 cd05845 Ig2_L1-CAM_like Second 27.7 42 0.0011 14.0 2.7 26 54-79 48-73 (95) 57 cd01724 Sm_D1 The eukaryotic S 27.7 42 0.0011 14.0 4.8 32 206-237 9-41 (90) 58 pfam05614 DUF782 Circovirus pr 27.4 24 0.0006 15.5 0.7 10 108-117 31-41 (104) 59 TIGR01156 cytb6/f_IV cytb6/f c 26.3 11 0.00029 17.5 -1.1 12 102-116 27-38 (160) 60 COG1958 LSM1 Small nuclear rib 24.4 48 0.0012 13.6 5.8 34 205-238 14-48 (79) 61 KOG1486 consensus 23.6 50 0.0013 13.5 2.8 102 9-126 69-189 (364) 62 COG4466 Veg Uncharacterized pr 23.3 50 0.0013 13.5 6.2 56 196-252 7-71 (80) 63 TIGR03499 FlhF flagellar biosy 23.0 51 0.0013 13.4 3.0 38 23-60 8-45 (282) 64 pfam02576 DUF150 Uncharacteriz 22.9 51 0.0013 13.4 5.8 48 191-239 70-122 (141) 65 TIGR02295 HpaD 3,4-dihydroxyph 22.6 52 0.0013 13.4 2.3 28 225-252 249-276 (312) 66 PRK13476 cytochrome b6-f compl 22.1 21 0.00053 15.9 -0.4 13 102-117 27-39 (160) 67 PRK01194 V-type ATP synthase s 22.0 53 0.0014 13.3 5.8 75 169-243 78-159 (185) 68 PRK07481 hypothetical protein; 21.8 54 0.0014 13.3 3.3 31 4-34 94-125 (448) 69 KOG3159 consensus 21.4 55 0.0014 13.2 3.1 77 42-125 43-146 (336) 70 PRK08555 consensus 21.2 55 0.0014 13.2 3.1 30 10-39 108-139 (443) 71 TIGR01570 A_thal_3588 uncharac 21.2 32 0.00082 14.7 0.4 34 109-144 70-110 (172) 72 KOG3703 consensus 20.7 57 0.0014 13.1 2.3 81 20-100 444-545 (873) No 1 >PRK11886 biotin--protein ligase; Provisional Probab=100.00 E-value=0 Score=432.47 Aligned_cols=234 Identities=31% Similarity=0.441 Sum_probs=212.6 Q ss_pred CCEEEEEEECCCCHHHHHHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCCC-CCCCCCCCCCCCCCHHCCCHHHHHHH Q ss_conf 97049998164856899999997399997899982446783679985316633-33413223532240002101377999 Q gi|254780868|r 12 YSFRYEFFDTISSTNDECMKRALSGDLGNLWIVASCQTAGRGRRDNKWISDKG-NLYASLLLIDSISKDSLTLLSFAIAV 90 (252) Q Consensus 12 ~~~~i~~~~~~~STn~~~~~~~~~~~~~~~~v~A~~Qt~GrGr~gr~W~s~~G-~l~~S~~~~~~~~~~~~~~l~~~~~~ 90 (252) .++++++|+++||||++|++++. +.++|++|+|++||+||||+||+|+||+| |||||++++|+.++..++.+++++|+ T Consensus 76 ~~~~i~~~~~idSTN~~a~~~~~-~~~~gtvviAe~QT~GRGR~GR~W~Sp~g~~Ly~Si~l~p~~~~~~~~~Ltl~~av 154 (319) T PRK11886 76 DLGRVFVLPVIDSTNTYLLDRIA-ELKSGDLCLAEYQTAGRGRRGRQWVSPFGGNLYLSLYWRLNQGMAQAMGLSLVVGI 154 (319) T ss_pred CCCCEEEEECCCCHHHHHHHHHH-CCCCCEEEEECCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHH T ss_conf 79858998046858999998861-89998599988647688999790669899988999897679897887563699999 Q ss_pred HHHHHHHHHCCCCCEEEEEECCCEEEECCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCC--C-CCCCCCCCCCCCCCCC Q ss_conf 999999974454210246214650671022112222332322583267652154444464--2-2336532100236778 Q gi|254780868|r 91 AMRSVIASTLPVGTDVKIKWPNDILVFQRKIAGILIETLNLKNGLQAVVVGIGLNVKHCP--V-DTPYPVTSLQREGGCI 167 (252) Q Consensus 91 ai~~~l~~~~~~~~~~~iKWPNDi~~~~kKi~GIL~E~~~~~~~~~~~viGiGiNi~~~~--~-~~~~~~tsL~~~~~~~ 167 (252) |++++++.+. +.+++||||||||+++|||||||+|+....+...++|||||||++..+ . ..+.++|++.+.+..+ T Consensus 155 Av~~al~~~~--~~~~~IKWPNDIli~gkKi~GIL~E~~~e~~~~~~vViGIGINvn~~~~~~~~~~~a~s~l~~~g~~~ 232 (319) T PRK11886 155 AVAEALRRLG--AIDVGLKWPNDIYLNGRKLAGILVELTGETGDAAHVVIGIGINVAMPDFPEELIDQPWSDLQEAGPTI 232 (319) T ss_pred HHHHHHHHHC--CCCCEEECCCCCCCCCCEEEEEEEEEECCCCCCEEEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCC T ss_conf 9999999835--87621337963022796289980111026788417999977861588885334203233488717989 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCEEEEEEEEEECCCCCEEEECCCCCEEEEEE Q ss_conf 97899987779999999987303556488999998752389779999799389999998769874899969998999996 Q gi|254780868|r 168 DLKDIFSLLFQDVARVLDLWKKDTGREEIMNLWRCFACGIGDLITIKLSYGSILGRFVGVDDFGYLLLEEKKGCVRQIFT 247 (252) Q Consensus 168 ~~~~l~~~l~~~l~~~~~~~~~~~~~~~i~~~~~~~~~~~gk~V~i~~~~~~~~G~~~gId~~G~L~i~~~~G~~~~i~s 247 (252) +++++++.++.++...+..|.+ .+|++++++|++++.++||.|++..+++.++|++.|||++|+|+|++++| ++.|+| T Consensus 233 ~r~~ll~~ll~~l~~~~~~~~~-~g~~~~~~~w~~~~~~~Gk~V~v~~~~~~~~G~a~gId~~G~Liv~~~~g-~~~i~s 310 (319) T PRK11886 233 DRNQLAAELWKQLRAALELFEQ-EGLAPFLERWKKLDLFLGREVKLIIGQKEIFGIARGIDEQGALLLETDGG-EKPFNG 310 (319) T ss_pred CHHHHHHHHHHHHHHHHHHHHH-CCHHHHHHHHHHHHCCCCCEEEEEECCEEEEEEEEEECCCCEEEEEECCC-EEEEEE T ss_conf 9999999999999999999985-49899999999850716987999989989999998899998099998996-899998 Q ss_pred EEE Q ss_conf 311 Q gi|254780868|r 248 GDI 250 (252) Q Consensus 248 GeI 250 (252) ||| T Consensus 311 GEV 313 (319) T PRK11886 311 GEI 313 (319) T ss_pred EEE T ss_conf 889 No 2 >PRK06955 biotin--protein ligase; Provisional Probab=100.00 E-value=0 Score=413.47 Aligned_cols=238 Identities=30% Similarity=0.443 Sum_probs=206.8 Q ss_pred CCCEEEEEEECCCCHHHHHHHHHHC------CCCCCEEEEECCCCCCCCCCCCCCCCCCC-CCCCCCCCCCCCCCHHCCC Q ss_conf 9970499981648568999999973------99997899982446783679985316633-3341322353224000210 Q gi|254780868|r 11 PYSFRYEFFDTISSTNDECMKRALS------GDLGNLWIVASCQTAGRGRRDNKWISDKG-NLYASLLLIDSISKDSLTL 83 (252) Q Consensus 11 p~~~~i~~~~~~~STn~~~~~~~~~------~~~~~~~v~A~~Qt~GrGr~gr~W~s~~G-~l~~S~~~~~~~~~~~~~~ 83 (252) ..+|++++++++||||+++++.++. +.++|++|+|++||+||||+||.|+||+| +||||++++++.+...+++ T Consensus 30 ~~~~~~~v~~~~~STN~~l~~~~~~~~~~~~~~~~g~vvvAe~QTaGRGR~GR~W~Sp~g~~L~~Sl~l~~~~~~~~~~~ 109 (300) T PRK06955 30 ARAWPLEIVEETGSTNADLMARLKALPRSADALPAPIVRVAYEQTAGRGRQGRPWFAQPGNALLFSVACVLPRPVAALAG 109 (300) T ss_pred CCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCHHHHHH T ss_conf 68855999946587599999987633234458997659998854668899979054989998899998459989677522 Q ss_pred HHHHHHHHHHHHHHHHCC-CCCEEEEEECCCEEEECCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCC---C----- Q ss_conf 137799999999997445-4210246214650671022112222332322583267652154444464223---3----- Q gi|254780868|r 84 LSFAIAVAMRSVIASTLP-VGTDVKIKWPNDILVFQRKIAGILIETLNLKNGLQAVVVGIGLNVKHCPVDT---P----- 154 (252) Q Consensus 84 l~~~~~~ai~~~l~~~~~-~~~~~~iKWPNDi~~~~kKi~GIL~E~~~~~~~~~~~viGiGiNi~~~~~~~---~----- 154 (252) +|+++|+|++++++.+.. .+.+++||||||||+++||+||||+|+....++..++|||||||++..+... + T Consensus 110 Lsl~~glAv~~al~~~~~~~~~~v~LKWPNDIlv~gkKlaGIL~E~~~~~~~~~~vVIGIGINv~~~~~~~~~~~~~~~~ 189 (300) T PRK06955 110 LSLAVGVALAEALAALPAALGGRIALKWPNDLLIAGRKLAGILIETVWATPDATAVVIGIGLNVRRADALAAAVGALRAG 189 (300) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCEEEECCCHHEECCCEEEEEEEEEECCCCCCEEEEEEEEEEECCCCCCCCCCCCCCCC T ss_conf 69999999999999865204861001177442466834368853421157886189999999617786421111321235 Q ss_pred -------CCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCEEEEEECC-EEEEEEEEE Q ss_conf -------653210023677897899987779999999987303556488999998752389779999799-389999998 Q gi|254780868|r 155 -------YPVTSLQREGGCIDLKDIFSLLFQDVARVLDLWKKDTGREEIMNLWRCFACGIGDLITIKLSY-GSILGRFVG 226 (252) Q Consensus 155 -------~~~tsL~~~~~~~~~~~l~~~l~~~l~~~~~~~~~~~~~~~i~~~~~~~~~~~gk~V~i~~~~-~~~~G~~~g 226 (252) .+.+++.......++++++..++.++...+..|.. .+|+.++++|++++.++||+|++..++ +.++|++.| T Consensus 190 ~~~~~~~~~~~~~~~~~~~~~~~~ll~~ll~~l~~~~~~~~~-~g~~~~~~~w~~~~~~~G~~V~v~~~g~~~~~G~a~G 268 (300) T PRK06955 190 VATLASGLPPAALSAACAGANLTDTLAAALNALAPALAAFGA-DGLAPFAARWHALHAYAGREVVLLEDGVELARGVARG 268 (300) T ss_pred CCHHCCCCCHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHH-CCCHHHHHHHHHHCCCCCCEEEEEECCCEEEEEEEEE T ss_conf 400025677055553025489999999999999999999885-6819999999985032798699997898089999988 Q ss_pred ECCCCCEEEECCCCCEEEEEEEEE Q ss_conf 769874899969998999996311 Q gi|254780868|r 227 VDDFGYLLLEEKKGCVRQIFTGDI 250 (252) Q Consensus 227 Id~~G~L~i~~~~G~~~~i~sGeI 250 (252) ||++|+|+|++++| ++.|++||| T Consensus 269 Id~~G~L~v~t~~G-~~~i~sGEV 291 (300) T PRK06955 269 IDETGQLLLDTPAG-RQAIAAGDV 291 (300) T ss_pred ECCCCEEEEEECCC-EEEEEEEEE T ss_conf 99997699998996-899998999 No 3 >COG0340 BirA Biotin-(acetyl-CoA carboxylase) ligase [Coenzyme metabolism] Probab=100.00 E-value=0 Score=405.87 Aligned_cols=229 Identities=42% Similarity=0.667 Sum_probs=214.1 Q ss_pred EECCCCHHHHHHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCCC-CCCCCCCCCCCCCCHHCCCHHHHHHHHHHHHHH Q ss_conf 8164856899999997399997899982446783679985316633-334132235322400021013779999999999 Q gi|254780868|r 19 FDTISSTNDECMKRALSGDLGNLWIVASCQTAGRGRRDNKWISDKG-NLYASLLLIDSISKDSLTLLSFAIAVAMRSVIA 97 (252) Q Consensus 19 ~~~~~STn~~~~~~~~~~~~~~~~v~A~~Qt~GrGr~gr~W~s~~G-~l~~S~~~~~~~~~~~~~~l~~~~~~ai~~~l~ 97 (252) +++++|||++|++++.++.+.+++|+|++||+||||+||.|+||+| +||||++++++.+...++.+++++|+|++++++ T Consensus 1 ~~~i~STn~~a~~~~~~~~~~~~vvvA~~QTaGRGR~GR~W~Sp~G~~l~~S~~l~~~~~~~~~~~lsl~~g~av~~al~ 80 (238) T COG0340 1 FDEIDSTNTEAKERAASGAPEGTVVVAEEQTAGRGRRGRKWSSPKGGGLYMSLLLRPDLPPAELPSLSLVAGLAVAEALR 80 (238) T ss_pred CCCCCHHHHHHHHHHHHCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 98254089999999982899885899851046758899955298998879999987888767635668999999999999 Q ss_pred HHCCCCCEEEEEECCCEEEECCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCC---CCCCCCCCCCCCCCCCHHHHHH Q ss_conf 7445421024621465067102211222233232258326765215444446422---3365321002367789789998 Q gi|254780868|r 98 STLPVGTDVKIKWPNDILVFQRKIAGILIETLNLKNGLQAVVVGIGLNVKHCPVD---TPYPVTSLQREGGCIDLKDIFS 174 (252) Q Consensus 98 ~~~~~~~~~~iKWPNDi~~~~kKi~GIL~E~~~~~~~~~~~viGiGiNi~~~~~~---~~~~~tsL~~~~~~~~~~~l~~ 174 (252) ++. .+++||||||||+++||+||||+|+....+...++|||||||+++.|++ +.+++|||+++...++++++++ T Consensus 81 ~~~---~~~~iKWPNDv~~~~kKl~GIL~E~~~~~~~~~~~viGIGINv~~~~~~~~~i~~~atsL~~~~~~~~r~~l~~ 157 (238) T COG0340 81 KFG---IDVRIKWPNDVLLNGKKLAGILVEAVGDENGLLAVVVGIGINVNNPPPDFEDIGQPATSLQEEGEEIDREELLA 157 (238) T ss_pred HHC---CCCCEECCCEEEECCCEEEEEEEEEEECCCCCEEEEEEEEEEECCCCCCHHHHCCHHHHHHHHCCCCCHHHHHH T ss_conf 838---67666088623688960799999874157885489999976006688521122403666776306799999999 Q ss_pred HHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCEEEEE-EEEEECCCCCEEEECCCCCEEEEEEEEEC Q ss_conf 777999999998730355648899999875238977999979938999-99987698748999699989999963110 Q gi|254780868|r 175 LLFQDVARVLDLWKKDTGREEIMNLWRCFACGIGDLITIKLSYGSILG-RFVGVDDFGYLLLEEKKGCVRQIFTGDIF 251 (252) Q Consensus 175 ~l~~~l~~~~~~~~~~~~~~~i~~~~~~~~~~~gk~V~i~~~~~~~~G-~~~gId~~G~L~i~~~~G~~~~i~sGeI~ 251 (252) .++.++..++..|.+. ++.+++..|++++.++|+.|++..+++.+.| ++.|||++|+|++++++|.++.+.+|||+ T Consensus 158 ~ll~~l~~~~~~~~~~-~~~~i~~~~~~~~~~~g~~V~~~~~~~~~~gg~a~~id~~G~L~l~~~~g~~~~~~~Gev~ 234 (238) T COG0340 158 KLLKELEKYLRQFLRE-GFAPILARWRARSLSLGKEVRLTLGGGVIFGGIAKGIDEDGALLLETDDGEVQTIYSGEVS 234 (238) T ss_pred HHHHHHHHHHHHHHHC-CHHHHHHHHHHHHCCCCCEEEEEECCCEEEEEEEEEECCCCEEEEEECCCEEEEEECEEEE T ss_conf 9999999999998644-7057999999975537986999958960875399988789619999589708997331899 No 4 >PRK08330 biotin--protein ligase; Provisional Probab=100.00 E-value=0 Score=396.77 Aligned_cols=224 Identities=31% Similarity=0.446 Sum_probs=198.1 Q ss_pred CCCEEEEEEECCCCHHHHHHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCHHHHHHH Q ss_conf 99704999816485689999999739999789998244678367998531663333413223532240002101377999 Q gi|254780868|r 11 PYSFRYEFFDTISSTNDECMKRALSGDLGNLWIVASCQTAGRGRRDNKWISDKGNLYASLLLIDSISKDSLTLLSFAIAV 90 (252) Q Consensus 11 p~~~~i~~~~~~~STn~~~~~~~~~~~~~~~~v~A~~Qt~GrGr~gr~W~s~~G~l~~S~~~~~~~~~~~~~~l~~~~~~ 90 (252) -.+.+++|||++||||++|++... .++++|+|++||+||||+||+|+||+||||||++++++.+...++.+++++++ T Consensus 8 ~~G~~i~~~d~v~STN~~ak~~~~---~~g~vv~A~~QT~GRGR~gr~W~Sp~G~l~~S~~l~~~~~~~~~~~l~~~~~l 84 (236) T PRK08330 8 IIGRNVIYFQEVDSTNEYAKLLNL---EEGTVIVADRQTMGHGRKGRKWESPEGGLWMSVVLKPKVSQKDLPKLVFLGAL 84 (236) T ss_pred HHCCEEEEECCCCCHHHHHHHHCC---CCCCEEEECCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCHHHHHHHHHHHHH T ss_conf 745739997867888999987478---89949998901469899969865999986899993368897888899999999 Q ss_pred HHHHHHHHHCCCCCEEEEEECCCEEEECCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCC-CCCCCCH Q ss_conf 999999974454210246214650671022112222332322583267652154444464223365321002-3677897 Q gi|254780868|r 91 AMRSVIASTLPVGTDVKIKWPNDILVFQRKIAGILIETLNLKNGLQAVVVGIGLNVKHCPVDTPYPVTSLQR-EGGCIDL 169 (252) Q Consensus 91 ai~~~l~~~~~~~~~~~iKWPNDi~~~~kKi~GIL~E~~~~~~~~~~~viGiGiNi~~~~~~~~~~~tsL~~-~~~~~~~ 169 (252) |++++++++ +.+++||||||||+++||+||||+|... ..+|||||||+++. .|..+||+.. .+..+++ T Consensus 85 av~~al~~~---~~~~~iKWPNDIl~~~kKl~GILiE~~~-----~~vviGIGiNvn~~---~p~~atsl~~~~~~~~~~ 153 (236) T PRK08330 85 AVVDTLREF---SIDARIKWPNDVLVNYRKIAGVLVEGKG-----DFVILGIGLNVNNE---APDGATSMKEELGSEVPL 153 (236) T ss_pred HHHHHHHHH---CCCCCEECCCCEEECCCEEEEEEEEEEC-----CEEEEEEEEECCCC---CCCHHHHHHHHHCCCCCH T ss_conf 999999983---9886376785031579779999784038-----97999998955987---994063799986899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCEEEEEEEEEECCCCCEEEECCCCCEEEEEEEE Q ss_conf 89998777999999998730355648899999875238977999979938999999876987489996999899999631 Q gi|254780868|r 170 KDIFSLLFQDVARVLDLWKKDTGREEIMNLWRCFACGIGDLITIKLSYGSILGRFVGVDDFGYLLLEEKKGCVRQIFTGD 249 (252) Q Consensus 170 ~~l~~~l~~~l~~~~~~~~~~~~~~~i~~~~~~~~~~~gk~V~i~~~~~~~~G~~~gId~~G~L~i~~~~G~~~~i~sGe 249 (252) .++++.++..+..++..|.++ + ..+.+|.+++.++|+.|++..+++.++|++.|||++|+|+|++++|+++.|++|| T Consensus 154 ~~l~~~l~~~l~~~~~~~~~~-~--~~~~~~~~~~~~~g~~v~v~~~~~~~~G~~~gId~~G~L~v~~~~G~~~~i~sGe 230 (236) T PRK08330 154 LEVFRALITNLDRLYKTFLEN-P--LVILELAKRSMILGVEVKILGDGEDLEGIAEDIDDDGRLILRLDDGRVKKVVYGD 230 (236) T ss_pred HHHHHHHHHHHHHHHHHHHCC-C--HHHHHHHHHHHHCCCEEEEEECCCEEEEEEEEECCCCEEEEEECCCCEEEEEEEE T ss_conf 999999999999999998729-4--6699999999766986999959959999999999998699997999999999887 Q ss_pred EC Q ss_conf 10 Q gi|254780868|r 250 IF 251 (252) Q Consensus 250 I~ 251 (252) |- T Consensus 231 Vs 232 (236) T PRK08330 231 VS 232 (236) T ss_pred EE T ss_conf 99 No 5 >PTZ00276 biotin/lipoate protein ligase; Provisional Probab=100.00 E-value=0 Score=383.83 Aligned_cols=230 Identities=23% Similarity=0.401 Sum_probs=180.3 Q ss_pred CCCEEEEEEECCCCHHHHHHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-CCCHHCCCHHHHHH Q ss_conf 99704999816485689999999739999789998244678367998531663333413223532-24000210137799 Q gi|254780868|r 11 PYSFRYEFFDTISSTNDECMKRALSGDLGNLWIVASCQTAGRGRRDNKWISDKGNLYASLLLIDS-ISKDSLTLLSFAIA 89 (252) Q Consensus 11 p~~~~i~~~~~~~STn~~~~~~~~~~~~~~~~v~A~~Qt~GrGr~gr~W~s~~G~l~~S~~~~~~-~~~~~~~~l~~~~~ 89 (252) ..|++|+|||++||||++||+++.++.++|++|+|++||+||||+||+|+||+|++|||++++++ .+.+..+.++++++ T Consensus 4 ~~p~~i~~~d~v~STN~~ak~~a~~g~~~g~vvvAe~QTaGRGR~GR~W~SP~G~l~~Si~l~~~~~~~~~~~~ltl~aa 83 (245) T PTZ00276 4 DVPPNIHFVGEVTSTMDVARTMLAAAGGKPFAVLAESQTAGRGTGGRTWTSPKGNMYFTLCIPQKGVPPELVPVLPLITG 83 (245) T ss_pred CCCCCEEEEECCCCHHHHHHHHHHCCCCCCEEEEECHHCCCCCCCCCEEECCCCCCEEEEEECCCCCCHHHHHHHHHHHH T ss_conf 88872799923728999999999638999659998831168899989703899981999999878899779889999999 Q ss_pred HHHHHHHHHHCCCCCEEEEEECCCEEEECCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCC--CCCCCCCCCCCC---- Q ss_conf 99999999744542102462146506710221122223323225832676521544444642--233653210023---- Q gi|254780868|r 90 VAMRSVIASTLPVGTDVKIKWPNDILVFQRKIAGILIETLNLKNGLQAVVVGIGLNVKHCPV--DTPYPVTSLQRE---- 163 (252) Q Consensus 90 ~ai~~~l~~~~~~~~~~~iKWPNDi~~~~kKi~GIL~E~~~~~~~~~~~viGiGiNi~~~~~--~~~~~~tsL~~~---- 163 (252) +|+++++.+... +.+++||||||||+++||+||||+|+.. .++|||||||++..|+ +...+++++... T Consensus 84 lAv~~ai~~~~~-g~~~~IKWPNDil~~gkKv~GIL~E~~~-----~~vVvGIGINv~~~~~~~~~~~~~~~~~~~~~~~ 157 (245) T PTZ00276 84 LACRAAIMEVLH-GAAVHTKWPNDIIYAGKKIGGSLIESEG-----EYLIIGIGMNIEVAPPVTDAGRESTMVNEIAEDL 157 (245) T ss_pred HHHHHHHHHHCC-CCCCEEECCCCEEECCCEEEEEEEECCC-----CEEEEEEEEEECCCCCCCCCCCHHHHHHHHHHHH T ss_conf 999999998714-8750341885067879608999795259-----9799999887548998643332124578888872 Q ss_pred C-CCCCHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCEEEE----EECCEEEEEEEEEECCCCCEEEECC Q ss_conf 6-7789789998777999999998730355648899999875238977999----9799389999998769874899969 Q gi|254780868|r 164 G-GCIDLKDIFSLLFQDVARVLDLWKKDTGREEIMNLWRCFACGIGDLITI----KLSYGSILGRFVGVDDFGYLLLEEK 238 (252) Q Consensus 164 ~-~~~~~~~l~~~l~~~l~~~~~~~~~~~~~~~i~~~~~~~~~~~gk~V~i----~~~~~~~~G~~~gId~~G~L~i~~~ 238 (252) + ..+.+..+.+.+..++...+... ......+.+.|.. .+++.+.+ ....+..+|++.|||++|+|+|+++ T Consensus 158 ~~~~~~~~~l~~~l~~~~~~~~~~~--~~~~~~~~~~~~~---~~~~~~~~~~~~~~g~~~~~g~a~~Id~~G~LiVr~~ 232 (245) T PTZ00276 158 GVKSVTPQDLAEAVWKHFFDICSDP--ELTREILIESFDA---AMDKSLKLHKRTPTGRDPEELTALSLNEWGHLIVRRP 232 (245) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHCH--HHHHHHHHHHHHH---HHCCEEEEEEEECCCCCEEEEEEEEECCCCEEEEECC T ss_conf 7665779999999999999986120--1206789999999---8527578868982577517899999999943999949 Q ss_pred CCCEEEEEEEEEC Q ss_conf 9989999963110 Q gi|254780868|r 239 KGCVRQIFTGDIF 251 (252) Q Consensus 239 ~G~~~~i~sGeI~ 251 (252) ||+++.++|+||| T Consensus 233 dG~~~~v~a~~~f 245 (245) T PTZ00276 233 DGTEEDLMAEYLF 245 (245) T ss_pred CCCEEEEEECCCC T ss_conf 9989999721009 No 6 >PTZ00275 biotin-acetyl-CoA-carboxylase ligase; Provisional Probab=100.00 E-value=0 Score=368.61 Aligned_cols=237 Identities=25% Similarity=0.371 Sum_probs=201.0 Q ss_pred CCCEEEEEEECCCCHHHHHHHHHH----CC----CCCCEEEEECCCCCCCC------CCCCCCCCCCCCCCCCCC-CCCC Q ss_conf 997049998164856899999997----39----99978999824467836------799853166333341322-3532 Q gi|254780868|r 11 PYSFRYEFFDTISSTNDECMKRAL----SG----DLGNLWIVASCQTAGRG------RRDNKWISDKGNLYASLL-LIDS 75 (252) Q Consensus 11 p~~~~i~~~~~~~STn~~~~~~~~----~~----~~~~~~v~A~~Qt~GrG------r~gr~W~s~~G~l~~S~~-~~~~ 75 (252) ..+-++.|||++||||.||++.+. ++ +.++++|+|++||+||| |.||.|+||+|++++|++ ++|. T Consensus 13 ~~~~~~~~Fd~iDSTq~~~~~n~~~~~~~~~l~~~~~~tvI~a~~QT~GrG~r~~k~r~~R~W~S~~Gn~~~s~ill~p~ 92 (288) T PTZ00275 13 ELGALRIHFDVLDSTQLFCKRNKAFFIKNGKLKDDKHMIIISCNEQTNGIGTRDTKKKMDRMWLSDKGNLFISFIFLWKK 92 (288) T ss_pred HHCCEEEEEEECCCHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCEEECCCCCEEEEEEECCCC T ss_conf 42856999753264779999989988753766887873999987513788866655667976477999879999982588 Q ss_pred CCCHHCCCHHHHHHHHHHHHHHHHCCCCCEEEEEECCCEEEECCCCCCCCCCCCCCCC--------CCEEEEEECCCCCC Q ss_conf 2400021013779999999999744542102462146506710221122223323225--------83267652154444 Q gi|254780868|r 76 ISKDSLTLLSFAIAVAMRSVIASTLPVGTDVKIKWPNDILVFQRKIAGILIETLNLKN--------GLQAVVVGIGLNVK 147 (252) Q Consensus 76 ~~~~~~~~l~~~~~~ai~~~l~~~~~~~~~~~iKWPNDi~~~~kKi~GIL~E~~~~~~--------~~~~~viGiGiNi~ 147 (252) .+....+.+++++++|++++|+.+ +.+++||||||||+++||+||||+|+....+ ...++|+|||||++ T Consensus 93 ~~~~~~~~lt~~~a~av~~~l~~~---~i~~~ikwpnd~~~~~kk~~gil~e~~~~~d~~~~~~~~~~~~viiGIGINv~ 169 (288) T PTZ00275 93 EDLEKIKCLAQTCTVAISKTIEHF---HLRTQIKWINDVLIDEKKIAGCLVHLHYLDDFHFTDILSNYACIICGIGINIE 169 (288) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHC---CCCCCCCCCCEEEECCCEEEEEEEEEECCCCCCCCCCCCCEEEEEEEEEEEEC T ss_conf 876651357999999999999983---98533327840578684688998652023443322224653599999777314 Q ss_pred CCCCC--CCCCCCCCCCC----C-------CCCCHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCEEEEE Q ss_conf 46422--33653210023----6-------77897899987779999999987303556488999998752389779999 Q gi|254780868|r 148 HCPVD--TPYPVTSLQRE----G-------GCIDLKDIFSLLFQDVARVLDLWKKDTGREEIMNLWRCFACGIGDLITIK 214 (252) Q Consensus 148 ~~~~~--~~~~~tsL~~~----~-------~~~~~~~l~~~l~~~l~~~~~~~~~~~~~~~i~~~~~~~~~~~gk~V~i~ 214 (252) ..+.+ ++.+++++... . ..+++++++..++.+|..++..|.+ .+|++++++|..++.++||.|.+. T Consensus 170 ~~~~~~~l~~~~tsi~~~l~~~~~~~~~~~~ipsvedll~~Ll~nf~~~~~~l~~-~GFs~il~ew~~r~~~iGK~V~V~ 248 (288) T PTZ00275 170 LMDDDNLLGDNFTSIKKELINSQNDFGKFCSIPSVEDVMDKLILNFFATFNKLRE-EGFSSFLDYISIRLLYKDKKVLID 248 (288) T ss_pred CCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHH-CCCHHHHHHHHHHHHHCCCEEEEE T ss_conf 6753222355454688885245322343235778999999999999999999986-690999999999985469859999 Q ss_pred ECCEEEEEEEEEECCCCCEEEE-CCCCCEEEEEEEEEC Q ss_conf 7993899999987698748999-699989999963110 Q gi|254780868|r 215 LSYGSILGRFVGVDDFGYLLLE-EKKGCVRQIFTGDIF 251 (252) Q Consensus 215 ~~~~~~~G~~~gId~~G~L~i~-~~~G~~~~i~sGeI~ 251 (252) ..++.+.|+++|||+||+|++. ++++++..|++|+++ T Consensus 249 ~~n~~i~G~a~gId~DGsLILl~d~~n~ii~i~~g~~~ 286 (288) T PTZ00275 249 QDGELIAGHLEDLLDDGSIILLEEETNEIVRINHGHLF 286 (288) T ss_pred ECCCEEEEEEEEECCCCCEEEEECCCCCEEEEEECEEE T ss_conf 58978999998688999899887799948999724596 No 7 >PRK13325 bifunctional biotin--[acetyl-CoA-carboxylase] ligase/pantothenate kinase; Reviewed Probab=100.00 E-value=0 Score=350.55 Aligned_cols=241 Identities=29% Similarity=0.397 Sum_probs=206.7 Q ss_pred HHHHCCCCCCEEEEEEECCCCHHHHHHHHHH-CCC-CCCEEEEECCCCCCCCCCCCCCCCCCC-CCCCCCCCCCCCCCHH Q ss_conf 2475179997049998164856899999997-399-997899982446783679985316633-3341322353224000 Q gi|254780868|r 4 NLSECQNPYSFRYEFFDTISSTNDECMKRAL-SGD-LGNLWIVASCQTAGRGRRDNKWISDKG-NLYASLLLIDSISKDS 80 (252) Q Consensus 4 ~l~~~~~p~~~~i~~~~~~~STn~~~~~~~~-~~~-~~~~~v~A~~Qt~GrGr~gr~W~s~~G-~l~~S~~~~~~~~~~~ 80 (252) .|.+.....+|+.....+|.|+||++.+.++ +++ .+-.+|+|+.||+||||+||+|+||+| .||||+.+.++.+... T Consensus 72 ~l~~~~~~~gf~~~l~he~~ssn~~~~~~ar~~~~~~h~~icVA~~QTAGRGRrGR~W~s~~G~~L~FS~~W~Fprp~a~ 151 (592) T PRK13325 72 GLRELGERSGFQTALKHECASSNDEILELARIAPDKAHKTICVTHLQSKGRGRQGRKWSHRLGECLMFSFGWVFDRPQYE 151 (592) T ss_pred HHHHHHHCCCHHHHHCCCCCCCHHHHHHHHHHCCCHHHHHHHEEEECCCCCCCCCCCCCCCCCCEEEEEEEEECCCCHHH T ss_conf 78876213431555344478850899998753722765554201201247888898365788774788635653745787 Q ss_pred CCCHHHHHHHHHHHHHHHHCCCCCEEEEEECCCEEEECCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCC-CCCCCCCC Q ss_conf 21013779999999999744542102462146506710221122223323225832676521544444642-23365321 Q gi|254780868|r 81 LTLLSFAIAVAMRSVIASTLPVGTDVKIKWPNDILVFQRKIAGILIETLNLKNGLQAVVVGIGLNVKHCPV-DTPYPVTS 159 (252) Q Consensus 81 ~~~l~~~~~~ai~~~l~~~~~~~~~~~iKWPNDi~~~~kKi~GIL~E~~~~~~~~~~~viGiGiNi~~~~~-~~~~~~ts 159 (252) +.+||+++|++++++|+++ +.++++|||||||++++|+||||||+.... ....+|||||||+..+.+ +......+ T Consensus 152 L~GLSLaVa~Al~~aL~sl---GV~v~lKWPNDlll~~dKLAGILIETv~a~-~~t~aVIGIGINl~~p~~~e~Aa~v~~ 227 (592) T PRK13325 152 LGSLSPVAAVACRRALSRL---GLETQIKWPNDLVVGRDKLGGILIETVRAG-GKTVAVVGIGINFVLPKEVENAASVQS 227 (592) T ss_pred HCCCCHHHHHHHHHHHHHC---CCCCCCCCCCEEEECCCCEEEEEEEEEECC-CCCEEEEEEEECCCCCHHHHHHCCCCC T ss_conf 7484199999999999964---986234588504655873005999864158-871699996332357355531002334 Q ss_pred CCC---CCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCE-EEEEEEEEECCCCCEEE Q ss_conf 002---36778978999877799999999873035564889999987523897799997993-89999998769874899 Q gi|254780868|r 160 LQR---EGGCIDLKDIFSLLFQDVARVLDLWKKDTGREEIMNLWRCFACGIGDLITIKLSYG-SILGRFVGVDDFGYLLL 235 (252) Q Consensus 160 L~~---~~~~~~~~~l~~~l~~~l~~~~~~~~~~~~~~~i~~~~~~~~~~~gk~V~i~~~~~-~~~G~~~gId~~G~L~i 235 (252) +.. .-...+...+++.++.++...++.|.+ .||.++..+|++...+.||.|.+..++. ..+|++.|||++|.|++ T Consensus 228 ~~~~a~~~~~~dat~Ll~aLL~eL~~~L~~F~~-eGfAPF~arW~Aadaf~GqpVrLL~dGq~~~eGva~GVDEqGaLLL 306 (592) T PRK13325 228 LFQTASRRGNADAAVLLETLLAELDAVLLQYAR-DGFAPFVAEYQAANRDHGKAVLLLRDGETVFEGTVKGVDGQGVLHL 306 (592) T ss_pred CHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHH-CCCHHHHHHHHHHHHCCCCCEEEEECCCEEECCEEEEECCCCEEEE T ss_conf 033542336898789999999999999999987-1745668889876641798479850785574026850568870898 Q ss_pred ECCCCCEEEEEEEEE Q ss_conf 969998999996311 Q gi|254780868|r 236 EEKKGCVRQIFTGDI 250 (252) Q Consensus 236 ~~~~G~~~~i~sGeI 250 (252) +|++|. +.|.+||| T Consensus 307 eTa~G~-Q~i~sGEI 320 (592) T PRK13325 307 ETAEGK-QTVVSGEI 320 (592) T ss_pred ECCCCC-EEEEEEEE T ss_conf 637885-66774115 No 8 >PRK08477 biotin--protein ligase; Provisional Probab=100.00 E-value=0 Score=327.65 Aligned_cols=200 Identities=26% Similarity=0.449 Sum_probs=166.0 Q ss_pred EEEEEEECCCCHHHHHHHHHHCCC-CCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCC--CHHHHHHH Q ss_conf 049998164856899999997399-997899982446783679985316633334132235322400021--01377999 Q gi|254780868|r 14 FRYEFFDTISSTNDECMKRALSGD-LGNLWIVASCQTAGRGRRDNKWISDKGNLYASLLLIDSISKDSLT--LLSFAIAV 90 (252) Q Consensus 14 ~~i~~~~~~~STn~~~~~~~~~~~-~~~~~v~A~~Qt~GrGr~gr~W~s~~G~l~~S~~~~~~~~~~~~~--~l~~~~~~ 90 (252) ..++|||+++|||++|++.+.++. .++++|+|++||+||||+||.|+||+|||+||+++++...+..++ .+++.+++ T Consensus 4 m~I~~~~~v~STN~~l~e~~~~~~~~~~~~v~A~~QT~GrGr~gr~W~S~~GnL~~S~~l~~~~~~~~~~~~~~~i~~~~ 83 (214) T PRK08477 4 MEIRFFESLPSTQTYLLEKIKNGKLKAPFAIVAKEQTAGIGSRGNSWEGAKGNLFFSFALKESDLPKDLPLQSSSIYFGY 83 (214) T ss_pred EEEEEECCCCCHHHHHHHHHHCCCCCCCEEEEECHHCCCCCCCCCEEECCCCCEEEEEEECCCCCCCCCCHHHHHHHHHH T ss_conf 59999698788899999999759998986999882015889899816869998599999777656210337999999999 Q ss_pred HHHHHHHHHCCCCCEEEEEECCCEEEECCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCHH Q ss_conf 99999997445421024621465067102211222233232258326765215444446422336532100236778978 Q gi|254780868|r 91 AMRSVIASTLPVGTDVKIKWPNDILVFQRKIAGILIETLNLKNGLQAVVVGIGLNVKHCPVDTPYPVTSLQREGGCIDLK 170 (252) Q Consensus 91 ai~~~l~~~~~~~~~~~iKWPNDi~~~~kKi~GIL~E~~~~~~~~~~~viGiGiNi~~~~~~~~~~~tsL~~~~~~~~~~ 170 (252) ++.++++.. +.+++||||||||+++||+||||+|... .++|+|||||++.+|++. +.|. ..++++ T Consensus 84 ~~~~~l~~~---g~~~~iKWPNDI~~~~KKi~GIL~e~~~-----~~iVvGIGiNv~~~p~~~----~~ld---i~i~~~ 148 (214) T PRK08477 84 LLKEVLKEL---GSKVWLKWPNDLYLGDKKIGGVITNKIK-----NVIVCGIGLNLKSSPENY----ACLD---IEISDD 148 (214) T ss_pred HHHHHHHHC---CCCCEEECCCHHEECCCEEEEEEEECCC-----CEEEEEEEECCCCCCCHH----HHCC---CCCCHH T ss_conf 999999970---9884586872414879558999798059-----949996655167595035----5438---345899 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCEEEEEEEEEECCCCCEEEE Q ss_conf 999877799999999873035564889999987523897799997993899999987698748999 Q gi|254780868|r 171 DIFSLLFQDVARVLDLWKKDTGREEIMNLWRCFACGIGDLITIKLSYGSILGRFVGVDDFGYLLLE 236 (252) Q Consensus 171 ~l~~~l~~~l~~~~~~~~~~~~~~~i~~~~~~~~~~~gk~V~i~~~~~~~~G~~~gId~~G~L~i~ 236 (252) +++..++..+++.+. | ..+.++| +..++.++.++++.+++.++|++.+|++||+|+|+ T Consensus 149 ~ll~~~~~~~e~~~~-w------k~i~~~y-~~~f~k~~~~~~~~~g~~~~~~~~~l~~DG~L~i~ 206 (214) T PRK08477 149 DLLEGFLQKIEKKIL-W------KQIFSKY-KLEFHKSKSFSFHIEGEAVSLEDAELCEDGSILIN 206 (214) T ss_pred HHHHHHHHHHHHHHH-H------HHHHHHH-HHHHHCCCEEEEEECCEEEEEEEEEECCCCCEEEC T ss_conf 999999999998877-9------9999999-98864387189998998999998068698878999 No 9 >PRK05935 biotin--protein ligase; Provisional Probab=100.00 E-value=0 Score=309.93 Aligned_cols=179 Identities=27% Similarity=0.298 Sum_probs=149.3 Q ss_pred EEEECCCCHHHHHHHHHHCCCCC-CEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCHHHHHHHHHHHH Q ss_conf 99816485689999999739999-78999824467836799853166333341322353224000210137799999999 Q gi|254780868|r 17 EFFDTISSTNDECMKRALSGDLG-NLWIVASCQTAGRGRRDNKWISDKGNLYASLLLIDSISKDSLTLLSFAIAVAMRSV 95 (252) Q Consensus 17 ~~~~~~~STn~~~~~~~~~~~~~-~~~v~A~~Qt~GrGr~gr~W~s~~G~l~~S~~~~~~~~~~~~~~l~~~~~~ai~~~ 95 (252) ++|+++||||++|++++..+.+. +++|+|++||+||||+||.|+||+|++++|+++.++........+..+++.++.++ T Consensus 6 ~~~~eidSTN~~ak~~~~~~~~~~~~vv~A~~QT~GRGR~gr~W~S~~g~l~~S~~~~~~~~~~~~~~l~~l~~~av~~~ 85 (190) T PRK05935 6 YEIAETPSTNTTAKQGMHLWDPYALTVISTREQTAGKGKFGKSWHSSDKDLLASFCFFITVLDIDVSLLFRLGTEAVLRL 85 (190) T ss_pred EECCEECCHHHHHHHHHHCCCCCCCEEEEECHHHCCCCCCCCEEECCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHH T ss_conf 96571677899999999737998989999881104898799865669998089999964876438789999999999999 Q ss_pred HHHHCCCCCEEEEEECCCEEEECCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCC---CCCCCCCCCCCC-CCCCCHHH Q ss_conf 99744542102462146506710221122223323225832676521544444642---233653210023-67789789 Q gi|254780868|r 96 IASTLPVGTDVKIKWPNDILVFQRKIAGILIETLNLKNGLQAVVVGIGLNVKHCPV---DTPYPVTSLQRE-GGCIDLKD 171 (252) Q Consensus 96 l~~~~~~~~~~~iKWPNDi~~~~kKi~GIL~E~~~~~~~~~~~viGiGiNi~~~~~---~~~~~~tsL~~~-~~~~~~~~ 171 (252) ++.+. ..++.||||||||+++||+||||+|+... +...++|||||||+|..++ +...+||||+.+ |..+++++ T Consensus 86 ~~~~~--~~~~~IKWPNDIl~~~kKi~GIL~E~~~~-~~~~~~iiGIGINvN~~~~~~~~i~~~ATSL~~~~G~~vd~~~ 162 (190) T PRK05935 86 GQDLG--ITEAVIKWPNDVLVHGEKLCGVLCETIPV-KGGLGVILGIGVNGNTTKDELLSIDQPATSLQELLGHPIDLEE 162 (190) T ss_pred HHHHC--CCCCCEECCCHHCCCCCEEEEEEEEEECC-CCCEEEEEEEEEECCCCCCCCCCCCCCCCCHHHHHCCCCCHHH T ss_conf 99846--87743408665008791599997875115-8946999999987378820033679882229998789789999 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHH Q ss_conf 99877799999999873035564889999987 Q gi|254780868|r 172 IFSLLFQDVARVLDLWKKDTGREEIMNLWRCF 203 (252) Q Consensus 172 l~~~l~~~l~~~~~~~~~~~~~~~i~~~~~~~ 203 (252) +++.++.+++++| + .+|.++++.|..+ T Consensus 163 ll~~ll~~~e~~~----~-e~f~~~~~~~~~~ 189 (190) T PRK05935 163 QRERLIKHIKHLI----Q-ENFLPLLARESNH 189 (190) T ss_pred HHHHHHHHHHHHH----H-CCCHHHHHHHHCC T ss_conf 9999999999998----6-0880665576508 No 10 >KOG1536 consensus Probab=100.00 E-value=2.8e-37 Score=251.86 Aligned_cols=219 Identities=24% Similarity=0.338 Sum_probs=171.4 Q ss_pred CEEEEEEECCCCHHHHHHHHHHCCC-CCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHC---CCHHHHH Q ss_conf 7049998164856899999997399-99789998244678367998531663333413223532240002---1013779 Q gi|254780868|r 13 SFRYEFFDTISSTNDECMKRALSGD-LGNLWIVASCQTAGRGRRDNKWISDKGNLYASLLLIDSISKDSL---TLLSFAI 88 (252) Q Consensus 13 ~~~i~~~~~~~STn~~~~~~~~~~~-~~~~~v~A~~Qt~GrGr~gr~W~s~~G~l~~S~~~~~~~~~~~~---~~l~~~~ 88 (252) |--+.+-+.+.||||.+-......+ ..+.+|+|+.||+||||.||.|+||+|.+++|+.+..+...... +...+++ T Consensus 385 G~~ll~a~V~tST~~ild~n~s~lpse~g~vvv~~~Q~sGRGRGgN~WlsP~G~~~~sf~ism~~ksr~~~~i~~~~~l~ 464 (649) T KOG1536 385 GRVLLWAPVLTSTMDILDHNFSELPSEVGLVVVANIQTSGRGRGGNVWLSPKGCAMSSFTISMPLKSRVVPLIPFVQHLA 464 (649) T ss_pred CCEEEEEEECCHHHHHHHHHHHCCCCCCCEEEEEEEEEECCCCCCCEEECCCCEEEEEEEEEEECCCCCCCCHHHHHHHH T ss_conf 71699985121488987501303875452599986775045677876656733476888998205555344018999999 Q ss_pred HHHHHHHHHHHCC-CCCEEEEEECCCEEEEC-CCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCC--- Q ss_conf 9999999997445-42102462146506710-221122223323225832676521544444642233653210023--- Q gi|254780868|r 89 AVAMRSVIASTLP-VGTDVKIKWPNDILVFQ-RKIAGILIETLNLKNGLQAVVVGIGLNVKHCPVDTPYPVTSLQRE--- 163 (252) Q Consensus 89 ~~ai~~~l~~~~~-~~~~~~iKWPNDi~~~~-kKi~GIL~E~~~~~~~~~~~viGiGiNi~~~~~~~~~~~tsL~~~--- 163 (252) .+++.++++.... ..++++||||||||+++ .||||||+.+....+.+ ++.||+|||+.+. .|.+||... T Consensus 465 ~~~~v~~~~~~pGy~dIpvrIKWPNDlY~~~~lKvgGiLv~st~r~n~f-~v~iGCGiNVtN~-----~PT~cLN~ll~~ 538 (649) T KOG1536 465 LVAVVEAVRYAPGYPDIPVRIKWPNDLYVNGYLKVGGILVTSTYRSNKF-NVSIGCGINVTND-----GPTTCLNDLLDE 538 (649) T ss_pred HHHHHHHHHCCCCCCCCCEEEECCCCCEEEECCCCCEEEEEEEECCCEE-EEEEEEEEEECCC-----CCCEEHHHHHHH T ss_conf 9999999860889998854642685003531010132998756527658-9998502671599-----970359887776 Q ss_pred -C-------CCCCHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCEEEEEECCEEEEEEEEEECCCCCEEE Q ss_conf -6-------77897899987779999999987303556488999998752389779999799389999998769874899 Q gi|254780868|r 164 -G-------GCIDLKDIFSLLFQDVARVLDLWKKDTGREEIMNLWRCFACGIGDLITIKLSYGSILGRFVGVDDFGYLLL 235 (252) Q Consensus 164 -~-------~~~~~~~l~~~l~~~l~~~~~~~~~~~~~~~i~~~~~~~~~~~gk~V~i~~~~~~~~G~~~gId~~G~L~i 235 (252) . ..+..+++++.++..|++++..|. +.|++.++..|.+++.|-+|.|.+.+.++.+ +..++++.|.|++ T Consensus 539 ~n~e~~~~~l~~k~EkLiA~~~n~fe~~~k~f~-d~G~~silp~YYeyWLHS~Q~V~L~d~~e~v--ii~~~dD~G~L~~ 615 (649) T KOG1536 539 MNEERNLLHLAIKAEKLIAAVFNKFEKYFKLFM-DQGPKSILPLYYEYWLHSNQRVILEDKGEAV--IIGGTDDIGFLQV 615 (649) T ss_pred HCHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHH-HHCHHHHHHHHHHHHHCCCCEEECCCCCCCE--EEEEEECCCCEEE T ss_conf 442101200001499999999999999999998-6287767689999875268589625788505--8965604311121 Q ss_pred ECCCC Q ss_conf 96999 Q gi|254780868|r 236 EEKKG 240 (252) Q Consensus 236 ~~~~G 240 (252) +...+ T Consensus 616 ~~~~~ 620 (649) T KOG1536 616 SGLLL 620 (649) T ss_pred EHHCC T ss_conf 01203 No 11 >TIGR00121 birA_ligase biotin-[acetyl-CoA-carboxylase] ligase; InterPro: IPR004408 The biotin operon of Escherichia coli contains 5 structural genes involved in the synthesis of biotin. Transcription of the operon is regulated via one of these proteins, the biotin ligase BirA. BirA is an asymetric protein with 3 specific domains - an N-terminal DNA-binding domain, a central catalytic domain and a C-terminal of unknown function. The ligase reaction intermediate, biotinyl-5'-AMP, is the co-repressor that triggers DNA binding by BirA. The alpha-helical N-terminal domain of the BirA protein has the helix-turn-helix structure of DNA-binding proteins with a central DNA recognition helix. BirA undergoes several conformational changes related to repressor function and the N-terminal DNA-binding function is connected to the rest of the molecule through a hinge which will allow relocation of the domains during the reaction . Biotin-binding causes a large structural change thought to facilitate ATP-binding. Two repressor molecules form the operator-repressor complex, with dimer formation occuring simultaneously with DNA binding. DNA-binding may also cause a conformational change which allows this co-operative interaction. In the dimer structure, the beta-sheets in the central domain of each monomer are arranged side-by-side to form a single, seamless beta-sheet. The apparent orthologs among the eukaryotes are larger proteins that contain a domain with high sequence homology to BirA.; GO: 0004077 biotin-[acetyl-CoA-carboxylase] ligase activity, 0006464 protein modification process. Probab=100.00 E-value=4.9e-37 Score=250.35 Aligned_cols=233 Identities=33% Similarity=0.530 Sum_probs=206.6 Q ss_pred EEEEEECCCCHHHHHHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCCCC-CCCCCCCCCC-CCCHHCCCHHHHHHHHH Q ss_conf 499981648568999999973999978999824467836799853166333-3413223532-24000210137799999 Q gi|254780868|r 15 RYEFFDTISSTNDECMKRALSGDLGNLWIVASCQTAGRGRRDNKWISDKGN-LYASLLLIDS-ISKDSLTLLSFAIAVAM 92 (252) Q Consensus 15 ~i~~~~~~~STn~~~~~~~~~~~~~~~~v~A~~Qt~GrGr~gr~W~s~~G~-l~~S~~~~~~-~~~~~~~~l~~~~~~ai 92 (252) .+++++.++|||+++++....+.+.+.+++|+.|++||||+||.|.||+|+ +|+|+.+.++ .+....+.+++.+++++ T Consensus 1 ~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~g~g~~g~~W~~~~g~~~~~~~~~~~~~~p~~~~~~~~l~~~~~~ 80 (248) T TIGR00121 1 EVIYLDVLDSTNEYLLELAKEGKPKGDLVVADKQTAGRGRRGRVWLSPEGGGLYFSLILRPDDLPKSPAPGLTLLAGLAV 80 (248) T ss_pred CEEECCCCCCHHHHHHHHHCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCCHHHHHHHHHHHHHH T ss_conf 91103455534689997520456655078850323577645750137667735899975353233000002455677899 Q ss_pred HHHHHHHCCCCCEEEEEECCCEEEEC----CCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCC----CCCCC-CCC Q ss_conf 99999744542102462146506710----2211222233232258326765215444446422336----53210-023 Q gi|254780868|r 93 RSVIASTLPVGTDVKIKWPNDILVFQ----RKIAGILIETLNLKNGLQAVVVGIGLNVKHCPVDTPY----PVTSL-QRE 163 (252) Q Consensus 93 ~~~l~~~~~~~~~~~iKWPNDi~~~~----kKi~GIL~E~~~~~~~~~~~viGiGiNi~~~~~~~~~----~~tsL-~~~ 163 (252) ++++++.. .++.+|||||+++++ +|+||||+|.....+....+++|+|+|++..+...+. .+++. ... T Consensus 81 ~~~~~~~~---~~~~~kwpnd~~~~~~d~~~k~~g~l~~~~~~~~~~~~~~~g~g~~~n~~~~~~~~~~~~~~~~~~~~~ 157 (248) T TIGR00121 81 AEVLKELG---LQVGLKWPNDLLLKGNDNYKKLGGILTELTGKENRLDYVVIGIGINVNVQLPKPPDGLAEGATSLKEEL 157 (248) T ss_pred HHHHHHHC---CCCCCCCCCEEEEECCCCCHHHHHHHHHHCCCCCCCEEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHH T ss_conf 98887510---001122675036507754012322110000133550256774021135555420123433334445554 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHH-HHHHHHCCCCEEEEEECCEEEE-EEEEEECCCCCEEEECCCCC Q ss_conf 677897899987779999999987303556488999-9987523897799997993899-99998769874899969998 Q gi|254780868|r 164 GGCIDLKDIFSLLFQDVARVLDLWKKDTGREEIMNL-WRCFACGIGDLITIKLSYGSIL-GRFVGVDDFGYLLLEEKKGC 241 (252) Q Consensus 164 ~~~~~~~~l~~~l~~~l~~~~~~~~~~~~~~~i~~~-~~~~~~~~gk~V~i~~~~~~~~-G~~~gId~~G~L~i~~~~G~ 241 (252) +...++.+++..++..+...+..+.+ .+..+++.. |.+++.+.|+.+.+....+.+. |...+++.+|.++++..++. T Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~g~~~~~~~~g~~~~~~~d~~ 236 (248) T TIGR00121 158 GIDLDRGELLEGFLRNFEENLELFED-GGIDEILSLKWEKLSAHIGREVKLLTGNGEIEHGIALGIDEDGALLLELGDGG 236 (248) T ss_pred HCCCCHHHHHHHHHHHHHHHHHHHHH-CCCCHHHHHHHHHHHHHCCCEEEEEECCCEEEECHHHCCCCCCEEEEECCCCC T ss_conf 11210467899999888887776640-46336788888887530464037751456022011111344561677417760 Q ss_pred EEEEEEEEEC Q ss_conf 9999963110 Q gi|254780868|r 242 VRQIFTGDIF 251 (252) Q Consensus 242 ~~~i~sGeI~ 251 (252) ++.+.+|++. T Consensus 237 ~~~~~~g~~~ 246 (248) T TIGR00121 237 IEKILSGECS 246 (248) T ss_pred EEEEECCEEE T ss_conf 2343065033 No 12 >pfam03099 BPL_LplA_LipB Biotin/lipoate A/B protein ligase family. This family includes biotin protein ligase, lipoate-protein ligase A and B. Biotin is covalently attached at the active site of certain enzymes that transfer carbon dioxide from bicarbonate to organic acids to form cellular metabolites. Biotin protein ligase (BPL) is the enzyme responsible for attaching biotin to a specific lysine at the active site of biotin enzymes. Each organism probably has only one BPL. Biotin attachment is a two step reaction that results in the formation of an amide linkage between the carboxyl group of biotin and the epsilon-amino group of the modified lysine. Lipoate-protein ligase A (LPLA) catalyses the formation of an amide linkage between lipoic acid and a specific lysine residue in lipoate dependent enzymes. The unusual biosynthesis pathway of lipoic acid is mechanistically intertwined with attachment of the cofactor. Probab=99.93 E-value=9.3e-26 Score=180.05 Aligned_cols=119 Identities=36% Similarity=0.454 Sum_probs=92.6 Q ss_pred EECCCCHHHHHHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC---CHHCCCHHHHHHHHHHHH Q ss_conf 81648568999999973999978999824467836799853166333341322353224---000210137799999999 Q gi|254780868|r 19 FDTISSTNDECMKRALSGDLGNLWIVASCQTAGRGRRDNKWISDKGNLYASLLLIDSIS---KDSLTLLSFAIAVAMRSV 95 (252) Q Consensus 19 ~~~~~STn~~~~~~~~~~~~~~~~v~A~~Qt~GrGr~gr~W~s~~G~l~~S~~~~~~~~---~~~~~~l~~~~~~ai~~~ 95 (252) ...++|| ..+...++.+++.+++|++||+|| |.|+||+|+|+||++++++.. ......+++.++++++++ T Consensus 2 g~~~~st---~~~~~~~~~~~g~v~va~~QT~Gr----~~W~sp~g~l~~Si~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (124) T pfam03099 2 GRRQKST---KEEVNLEGLEEGGVVVARRQTGGR----RVWHSPGGNLAYSLLLLPELKSFKPSDLPAYVLLLVLAVLEA 74 (124) T ss_pred CCCCCCH---HHHHHHHCCCCCCEEEEEEECCCC----CCEECCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 7221378---999727257659899997746899----721689998899999756556401778899999999999999 Q ss_pred HHHHC-CCCCEEEEEECCCEEEECCCCCCCCCCCCCCCCCCEEEEEECCCC Q ss_conf 99744-542102462146506710221122223323225832676521544 Q gi|254780868|r 96 IASTL-PVGTDVKIKWPNDILVFQRKIAGILIETLNLKNGLQAVVVGIGLN 145 (252) Q Consensus 96 l~~~~-~~~~~~~iKWPNDi~~~~kKi~GIL~E~~~~~~~~~~~viGiGiN 145 (252) +.... ..+.++++|||||||+++||+||||+|... .....+++||||+| T Consensus 75 l~~~~~~~~~~~~iKWPNDi~~~~kKi~GiL~e~~~-~~~~~~~viGiGiN 124 (124) T pfam03099 75 LGPKAGIPGIWVRIKWPNDVLVNGKKIAGILQEIRR-GGTLHHGTLGIGIN 124 (124) T ss_pred HHHHHCCCCCEEEECCCCCCEECCCEEEEEEEEECC-CCCCCEEEEEEEEC T ss_conf 998631369627653865326889089999878204-89645799999619 No 13 >pfam02237 BPL_C Biotin protein ligase C terminal domain. The function of this structural domain is unknown. It is found to the C terminus of the biotin protein ligase catalytic domain pfam01317. Probab=98.46 E-value=5.5e-07 Score=63.06 Aligned_cols=45 Identities=40% Similarity=0.644 Sum_probs=42.4 Q ss_pred CCCCEEEEEECCEEEEEEEEEECCCCCEEEECCCCCEEEEEEEEEC Q ss_conf 3897799997993899999987698748999699989999963110 Q gi|254780868|r 206 GIGDLITIKLSYGSILGRFVGVDDFGYLLLEEKKGCVRQIFTGDIF 251 (252) Q Consensus 206 ~~gk~V~i~~~~~~~~G~~~gId~~G~L~i~~~~G~~~~i~sGeI~ 251 (252) ++||.|++..+++.++|+++|||++|+|++++++| ++.+++|||+ T Consensus 1 ~lG~~V~~~~~~~~v~G~a~gId~~G~Lll~~~~g-~~~i~~Gdvs 45 (47) T pfam02237 1 HLGKEVKVTTGGGKVEGIAVGIDDDGRLLLETDDG-IRIIISGDIS 45 (47) T ss_pred CCCCEEEEEECCCEEEEEEECCCCCCEEEEECCCC-EEEEEEEEEE T ss_conf 98978999938977999996718898499984896-4999832489 No 14 >PRK03822 lplA lipoate-protein ligase A; Provisional Probab=95.60 E-value=0.14 Score=29.46 Aligned_cols=179 Identities=14% Similarity=0.161 Sum_probs=88.0 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCHHCCCHHHHHHHHHHHHHHHHCCCCCEEEEEECCCEEEEC----CCCCCCCCCCCCC Q ss_conf 985316633334132235322400021013779999999999744542102462146506710----2211222233232 Q gi|254780868|r 56 DNKWISDKGNLYASLLLIDSISKDSLTLLSFAIAVAMRSVIASTLPVGTDVKIKWPNDILVFQ----RKIAGILIETLNL 131 (252) Q Consensus 56 gr~W~s~~G~l~~S~~~~~~~~~~~~~~l~~~~~~ai~~~l~~~~~~~~~~~iKWPNDi~~~~----kKi~GIL~E~~~~ 131 (252) |=.=+-..||+-||++...+. .+. .. ..--+.++|+++ |+++.+.=-|||.+++ ||++|--- .... T Consensus 75 GGaVYhDlGN~nfsfi~~~~~-~~~----~~-~~~~ii~aL~~l---Gi~a~~sgRNDl~i~g~~g~kKiSG~A~-~~~~ 144 (338) T PRK03822 75 GGAVFHDLGNTCFTFMAGKPE-YDK----TI-STSIVLNALNSL---GVSAEASGRNDLVVKTAEGDRKVSGSAY-RETK 144 (338) T ss_pred CCEEEECCCCCEEEEEECCCH-HCH----HH-HHHHHHHHHHHC---CCCCEECCCCCEEEECCCCCCEECCHHH-EECC T ss_conf 887998299857999837620-035----55-389999999973---9875476866779615877726605132-0208 Q ss_pred CCCCEE--EEEECCCC-----CCCCCCCCC--------CCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCC----- Q ss_conf 258326--76521544-----444642233--------6532100236778978999877799999999873035----- Q gi|254780868|r 132 KNGLQA--VVVGIGLN-----VKHCPVDTP--------YPVTSLQREGGCIDLKDIFSLLFQDVARVLDLWKKDT----- 191 (252) Q Consensus 132 ~~~~~~--~viGiGiN-----i~~~~~~~~--------~~~tsL~~~~~~~~~~~l~~~l~~~l~~~~~~~~~~~----- 191 (252) +...++ +.+-.-++ ++..+..+. ...|.|++..+.++.+++...++..+...+....... T Consensus 145 ~~~lhHGTlL~d~dl~~l~~~L~~~~~Kl~sKgikSVrsRVtNl~e~~~~i~~~~~~~~l~~~f~~~~~~~~~~~~l~~~ 224 (338) T PRK03822 145 DRGFHHGTLLLNADLSRLANYLNPDKKKLAAKGITSVRSRVTNLTELLPGITHEQVCEAITEAFFAHYGERVEAEIISPD 224 (338) T ss_pred CEEEEEEEEEECCCHHHHHHHCCCCHHHHHHCCCCCHHHCCCCHHHCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEECHH T ss_conf 82788568886288899986548983454322734215241166460899989999999999999983765543883776 Q ss_pred ---CCHHHHHH---HHHHHHCCCCEEEEEECCEEEEEEEEEECCCCCE--EEECCCCCEEEE-EEEEEC Q ss_conf ---56488999---9987523897799997993899999987698748--999699989999-963110 Q gi|254780868|r 192 ---GREEIMNL---WRCFACGIGDLITIKLSYGSILGRFVGVDDFGYL--LLEEKKGCVRQI-FTGDIF 251 (252) Q Consensus 192 ---~~~~i~~~---~~~~~~~~gk~V~i~~~~~~~~G~~~gId~~G~L--~i~~~~G~~~~i-~sGeI~ 251 (252) ....+.+. +....+..|+.=.+..... .+| +.|.+ .+.-++|.++.+ ..||=| T Consensus 225 ~~~~i~~~~e~~~~~~sWeW~YG~sP~f~~~~~---~rf----~~G~ve~~l~V~~G~I~~~ki~gDff 286 (338) T PRK03822 225 KTPDLPNFAETFARQSSWEWNFGQAPAFSHLLD---ERF----TWGGVELHFDVEKGHITRAQIFTDSL 286 (338) T ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEE---EEC----CCCEEEEEEEEECCEEEEEEEECCCC T ss_conf 765458999987550760131789987534533---553----67369999998289898999986689 No 15 >COG0095 LplA Lipoate-protein ligase A [Coenzyme metabolism] Probab=95.42 E-value=0.071 Score=31.24 Aligned_cols=134 Identities=19% Similarity=0.170 Sum_probs=73.8 Q ss_pred EEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCHHHHHHHHHHHHHHHHCCCCCE-EEEEECCCEEEECCC Q ss_conf 9998244678367998531663333413223532240002101377999999999974454210-246214650671022 Q gi|254780868|r 42 WIVASCQTAGRGRRDNKWISDKGNLYASLLLIDSISKDSLTLLSFAIAVAMRSVIASTLPVGTD-VKIKWPNDILVFQRK 120 (252) Q Consensus 42 ~v~A~~Qt~GrGr~gr~W~s~~G~l~~S~~~~~~~~~~~~~~l~~~~~~ai~~~l~~~~~~~~~-~~iKWPNDi~~~~kK 120 (252) +-+++++|-|. .=+-..|++.+|++.+.+..............-.+.++++.++ .+ +...=+|||.+++|| T Consensus 66 i~vvRR~sGGG-----aV~hd~g~l~~S~i~~~~~~~~~~~~~~~~~~~~~~~~l~~lg---v~~~~~~~~nDl~v~gkK 137 (248) T COG0095 66 IPVVRRPSGGG-----AVFHDLGNLNYSVITPDEGGLESYETLYKFLLQPVIDALRALG---VEGAECPGRNDLVVDGKK 137 (248) T ss_pred CCEEEECCCCC-----EEEECCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHC---CCEECCCCCCCEEECCEE T ss_conf 84799637997-----4895489789999978986433499999999999999999848---750016898757884789 Q ss_pred CCCCCCCCCCCCCCCEE--EEEECCCC-----CCCCCCC--------CCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 11222233232258326--76521544-----4446422--------336532100236778978999877799999999 Q gi|254780868|r 121 IAGILIETLNLKNGLQA--VVVGIGLN-----VKHCPVD--------TPYPVTSLQREGGCIDLKDIFSLLFQDVARVLD 185 (252) Q Consensus 121 i~GIL~E~~~~~~~~~~--~viGiGiN-----i~~~~~~--------~~~~~tsL~~~~~~~~~~~l~~~l~~~l~~~~~ 185 (252) ++|.=-- ...+.-..+ +..-+=+. ++.+++. .+...+++.+. ..++.+++...++..|...+. T Consensus 138 isG~Aq~-~~~~~~l~hgtll~~~d~~~l~~~l~~~~~k~~~k~~~s~~~rv~~l~~~-~~~~~~e~~~~l~~~f~~~~~ 215 (248) T COG0095 138 ISGSAQR-RTKGRILHHGTLLLDIDLELLARVLRVPKEKIKSKGIKSVRERVANLEEL-LKISVEEFLEALLEAFFKVLG 215 (248) T ss_pred EEEEHHH-HHCCCEEEEEEEEECCCHHHHHHHCCCCHHHCCCCCCCCHHHHCCHHHHC-CCCCHHHHHHHHHHHHHHHCC T ss_conf 9883676-60890688889997388899998528870321343346678861015541-698999999999999987637 No 16 >PRK00008 lipB lipoyltransferase; Reviewed Probab=95.18 E-value=0.097 Score=30.39 Aligned_cols=124 Identities=24% Similarity=0.190 Sum_probs=69.0 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCHHCCCHHHHHHHHHHHHHHHHCCCCCEEEEEECC--CEEEE-CCCCCCCCCCCCC Q ss_conf 79985316633334132235322400021013779999999999744542102462146--50671-0221122223323 Q gi|254780868|r 54 RRDNKWISDKGNLYASLLLIDSISKDSLTLLSFAIAVAMRSVIASTLPVGTDVKIKWPN--DILVF-QRKIAGILIETLN 130 (252) Q Consensus 54 r~gr~W~s~~G~l~~S~~~~~~~~~~~~~~l~~~~~~ai~~~l~~~~~~~~~~~iKWPN--Di~~~-~kKi~GIL~E~~~ 130 (252) |.|..=+-.||.|-.=.++...........+--..--++.++++.+ +++... -|. -|+++ ++|||-|=+... T Consensus 69 RGG~iTyHGPGQlV~Ypi~~L~~~~~~v~~~v~~lE~~iI~~l~~~---gi~a~~-~~~~~GVwv~~~~KIasIGi~v~- 143 (212) T PRK00008 69 RGGQVTYHGPGQLVAYPLLDLKRRKLDVRDYVRRLEEAVINTLAEF---GIKAER-REGAPGVWVGPERKIAAIGLRVR- 143 (212) T ss_pred CCCCEEEECCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHHHHHC---CCCCCC-CCCCCCEECCCCCEEEEEEEHCC- T ss_conf 9973576489878999998346556309999999999999999975---974133-68984121267866665321101- Q ss_pred CCCCCEEEEEECCCCCCCCC--------CCC-CCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 22583267652154444464--------223-365321002367789789998777999999998 Q gi|254780868|r 131 LKNGLQAVVVGIGLNVKHCP--------VDT-PYPVTSLQREGGCIDLKDIFSLLFQDVARVLDL 186 (252) Q Consensus 131 ~~~~~~~~viGiGiNi~~~~--------~~~-~~~~tsL~~~~~~~~~~~l~~~l~~~l~~~~~~ 186 (252) ..+ .-=|+-||++.+- ... +.+.|||++++..++.+++...+..+|.+.+.. T Consensus 144 --~~v--t~HG~alNv~~dL~~F~~I~pCG~~~~~vTSl~~~g~~i~~~ev~~~l~~~f~~~F~~ 204 (212) T PRK00008 144 --RGV--SFHGLALNVDMDLSPFSRINPCGIAGLGVTSLSDLGPPVTVEEVDPALAAAFAAVLGY 204 (212) T ss_pred --CCE--EECCEEEEECCCHHHCCCEECCCCCCCCCCCHHHHCCCCCHHHHHHHHHHHHHHHHCC T ss_conf --882--7077589846981432668747889987242999789999999999999999998097 No 17 >COG0321 LipB Lipoate-protein ligase B [Coenzyme metabolism] Probab=90.66 E-value=1.2 Score=23.60 Aligned_cols=127 Identities=24% Similarity=0.199 Sum_probs=69.2 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCHHHHHHHHHHHHHHHHCCCCCEEEEEECCCEEEEC-CCCCCCCCCCCCC Q ss_conf 679985316633334132235322400021013779999999999744542102462146506710-2211222233232 Q gi|254780868|r 53 GRRDNKWISDKGNLYASLLLIDSISKDSLTLLSFAIAVAMRSVIASTLPVGTDVKIKWPNDILVFQ-RKIAGILIETLNL 131 (252) Q Consensus 53 Gr~gr~W~s~~G~l~~S~~~~~~~~~~~~~~l~~~~~~ai~~~l~~~~~~~~~~~iKWPNDi~~~~-kKi~GIL~E~~~~ 131 (252) +|.|..=+-.||.+-.=.++....+....-.+--..--++.++|.++.-.+ ...=+=| -+|+++ +|||-|=+... T Consensus 76 ~RGGqvTyHGPGQ~V~Y~ildLkr~~~~vr~~V~~LEqavI~tLa~~~i~~-~~~~~~~-GVwV~~~~KIAaiGirir-- 151 (221) T COG0321 76 DRGGQVTYHGPGQLVAYPILDLKRPKLDVREYVRALEQAVINTLAEYGIEA-ERRPDRP-GVWVEEERKIAAIGIRIR-- 151 (221) T ss_pred CCCCEEEEECCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCCC-CCCCCCC-EEEECCCCEEEEEEEEEE-- T ss_conf 688336775898289999971465553199999999999999999749764-4569997-587568864888877871-- Q ss_pred CCCCEEEEEECCCCCCCCCC--------C-CCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 25832676521544444642--------2-3365321002367789789998777999999998 Q gi|254780868|r 132 KNGLQAVVVGIGLNVKHCPV--------D-TPYPVTSLQREGGCIDLKDIFSLLFQDVARVLDL 186 (252) Q Consensus 132 ~~~~~~~viGiGiNi~~~~~--------~-~~~~~tsL~~~~~~~~~~~l~~~l~~~l~~~~~~ 186 (252) .. ...=|+.||++.+-. . ...+.|||..++..++.+++...+.++|.+.+.. T Consensus 152 -r~--vt~HGlALNv~~DL~~F~~I~PCG~~~~~~tsl~d~~~~v~~~~V~~~l~~~~~~~l~~ 212 (221) T COG0321 152 -RG--VTFHGLALNVNMDLSPFNRIVPCGYAGMEVTSLSDLGPPVTVDEVAKALVAAFAKLLGP 212 (221) T ss_pred -CC--CCEEEEEEEECCCCHHCCCEECCCCCCCCEEEHHHHCCCCCHHHHHHHHHHHHHHHHCC T ss_conf -45--10101278505782120624036668972547989669875999999999999998487 No 18 >pfam04017 DUF366 Domain of unknown function (DUF366). Archaeal domain of unknown function. Probab=90.58 E-value=0.63 Score=25.32 Aligned_cols=91 Identities=16% Similarity=0.263 Sum_probs=54.6 Q ss_pred HHHHHHHHHHHHHHHHCCCCCEEEEEECCCEEEECCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCC Q ss_conf 13779999999999744542102462146506710221122223323225832676521544444642233653210023 Q gi|254780868|r 84 LSFAIAVAMRSVIASTLPVGTDVKIKWPNDILVFQRKIAGILIETLNLKNGLQAVVVGIGLNVKHCPVDTPYPVTSLQRE 163 (252) Q Consensus 84 l~~~~~~ai~~~l~~~~~~~~~~~iKWPNDi~~~~kKi~GIL~E~~~~~~~~~~~viGiGiNi~~~~~~~~~~~tsL~~~ 163 (252) .+-+....+.+.|++. +.++. +==-|||++++|++=-..- .+.....|=+|||+.. ...|-++.+|.+. T Consensus 83 rQRLl~~i~~evl~~~---g~~~~-r~GDDlyv~~~KlSVSIA~-----~s~vs~kiH~GINV~~--~g~PV~aigL~dl 151 (183) T pfam04017 83 RQRLLVAIAKEVLEDY---GVKLT-RDGDDLYVNGKKLSVSIAT-----ASPVSMKIHFGINVTS--EGTPVPAIGLEDL 151 (183) T ss_pred HHHHHHHHHHHHHHHC---CCCCE-ECCCEEEECCCEEEEEEEE-----CCCCCEEEEEEEEECC--CCCCCCCCCHHHC T ss_conf 9999999999999972---98606-5578168759678999996-----4887158998787604--6656541028773 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 67789789998777999999998 Q gi|254780868|r 164 GGCIDLKDIFSLLFQDVARVLDL 186 (252) Q Consensus 164 ~~~~~~~~l~~~l~~~l~~~~~~ 186 (252) |.. +..++.+.+...+..-++. T Consensus 152 g~~-~~~~~m~~i~~~y~~E~~~ 173 (183) T pfam04017 152 GID-DIKELMEDIASAYVEEIEK 173 (183) T ss_pred CCC-CHHHHHHHHHHHHHHHHHH T ss_conf 875-5999999999999999999 No 19 >COG2029 Uncharacterized conserved protein [Function unknown] Probab=87.72 E-value=1.6 Score=22.80 Aligned_cols=116 Identities=17% Similarity=0.174 Sum_probs=60.8 Q ss_pred CCCCCCCCCCCCCCCCHH--CCCHHHHHHHHHHHHHHHHCCCCCEEEEEECCCEEEECCCCCCCCCCCCCCCCCCEEEEE Q ss_conf 333341322353224000--210137799999999997445421024621465067102211222233232258326765 Q gi|254780868|r 63 KGNLYASLLLIDSISKDS--LTLLSFAIAVAMRSVIASTLPVGTDVKIKWPNDILVFQRKIAGILIETLNLKNGLQAVVV 140 (252) Q Consensus 63 ~G~l~~S~~~~~~~~~~~--~~~l~~~~~~ai~~~l~~~~~~~~~~~iKWPNDi~~~~kKi~GIL~E~~~~~~~~~~~vi 140 (252) +|+=++++++.....++. ...++-+.-..+.+.+.++ +.+.. .=-.|+|++++|+.=-..- . +....-| T Consensus 63 kseD~lhfivEhfd~~dl~lay~rQrLlv~i~~E~l~~~---gv~~~-ReGDDly~~grKLtVsIat----~-s~vs~ki 133 (189) T COG2029 63 KSEDMLHFIVEHFDHPDLLLAYLRQRLLVAILKEVLEKL---GVDLS-REGDDLYVEGRKLTVSIAT----V-SPVSSKI 133 (189) T ss_pred CCHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHH---CCCCC-CCCCCEEECCCEEEEEEEE----C-CCCCEEE T ss_conf 604567878861688538899999999999999999973---87201-3577036638578999973----4-7763068 Q ss_pred ECCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 2154444464223365321002367789789998777999999998730 Q gi|254780868|r 141 GIGLNVKHCPVDTPYPVTSLQREGGCIDLKDIFSLLFQDVARVLDLWKK 189 (252) Q Consensus 141 GiGiNi~~~~~~~~~~~tsL~~~~~~~~~~~l~~~l~~~l~~~~~~~~~ 189 (252) -+|||+...-.. .-++..|.+.|. .++.++...+..++.+-++..+. T Consensus 134 HlGiNV~t~g~p-~V~~igL~dlg~-~Di~~~m~~va~~yv~Eie~ie~ 180 (189) T COG2029 134 HLGINVKTEGVP-DVDAIGLEDLGY-GDILEFMERVAVAYVREIEKIEK 180 (189) T ss_pred EEEEEEECCCCC-CCCCCCCCCCCC-CCHHHHHHHHHHHHHHHHHHHHH T ss_conf 985765325578-877224310487-78899999999999999999998 No 20 >cd01728 LSm1 The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form the core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. LSm1 is one of at least seven subunits that assemble onto U6 snRNA to form a seven-membered ring structure. Sm-like proteins exist in archaea as well as prokaryotes that form heptameric and hexameric ring structures similar to those found in eukaryotes. Probab=86.42 E-value=2.5 Score=21.62 Aligned_cols=46 Identities=24% Similarity=0.555 Sum_probs=35.0 Q ss_pred HHHCCCCEEEEEECCE-EEEEEEEEECCCCCEEEECCCCCEEEEEEEEEC Q ss_conf 7523897799997993-899999987698748999699989999963110 Q gi|254780868|r 203 FACGIGDLITIKLSYG-SILGRFVGVDDFGYLLLEEKKGCVRQIFTGDIF 251 (252) Q Consensus 203 ~~~~~gk~V~i~~~~~-~~~G~~~gId~~G~L~i~~~~G~~~~i~sGeI~ 251 (252) ..-.++|+|.+...++ .+.|++.+.|..+.|++++ ..+.+++++-| T Consensus 7 L~~~ldkkv~V~l~dgR~~~G~Lr~fDq~~NlvL~~---~~Eri~~~~~~ 53 (74) T cd01728 7 LVDDLDKKVVVLLRDGRKLIGILRSFDQFANLVLQD---TVERIYVGDKY 53 (74) T ss_pred HHHHHCCEEEEEECCCCEEEEEEEEECCCCEEEEEE---EEEEEECCCCC T ss_conf 788629899999889989999999874654199320---58999748853 No 21 >cd01717 Sm_B The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. Sm subunit B heterodimerizes with subunit D3 and three such heterodimers form a hexameric ring structure with alternating B and D3 subunits. The D3 - B heterodimer also assembles into a heptameric ring containing D1, D2, E, F, and G subunits. Sm-like proteins exist in archaea as well as prokaryotes which form heptameric and hexameric ring structures similar to those found in eukaryotes. Probab=75.00 E-value=6.2 Score=19.12 Aligned_cols=37 Identities=22% Similarity=0.313 Sum_probs=30.8 Q ss_pred HHHHCCCCEEEEEECCE-EEEEEEEEECCCCCEEEECC Q ss_conf 87523897799997993-89999998769874899969 Q gi|254780868|r 202 CFACGIGDLITIKLSYG-SILGRFVGVDDFGYLLLEEK 238 (252) Q Consensus 202 ~~~~~~gk~V~i~~~~~-~~~G~~~gId~~G~L~i~~~ 238 (252) +...++++.|++...++ .+.|++.+.|....|++.+. T Consensus 4 kL~~~ln~~vrv~~~DGR~~vG~l~~~D~~~NlVL~~~ 41 (79) T cd01717 4 KMLQLINYRLRVTLQDGRQFVGQFLAFDKHMNLVLSDC 41 (79) T ss_pred HHHHHCCCEEEEEEECCCEEEEEEEEECCCCCEEEECC T ss_conf 56886598799999689599999999747663898374 No 22 >smart00651 Sm snRNP Sm proteins. small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing Probab=74.31 E-value=6.5 Score=19.01 Aligned_cols=35 Identities=31% Similarity=0.472 Sum_probs=29.4 Q ss_pred HHCCCCEEEEEECCE-EEEEEEEEECCCCCEEEECC Q ss_conf 523897799997993-89999998769874899969 Q gi|254780868|r 204 ACGIGDLITIKLSYG-SILGRFVGVDDFGYLLLEEK 238 (252) Q Consensus 204 ~~~~gk~V~i~~~~~-~~~G~~~gId~~G~L~i~~~ 238 (252) ..++|+.|.+...++ .+.|++.+.|+.-.|.+++. T Consensus 4 ~~~~~~~V~V~l~~g~~~~G~L~~~D~~mNlvL~~~ 39 (67) T smart00651 4 KKLIGKRVLVELKNGREYRGTLKGFDQFMNLVLEDV 39 (67) T ss_pred HHCCCCEEEEEECCCCEEEEEEEEECCCCCEEECEE T ss_conf 971998799999899699999999889997298729 No 23 >cd01727 LSm8 The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form the core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. LSm8 is one of at least seven subunits that assemble onto U6 snRNA to form a seven-membered ring structure. Sm-like proteins exist in archaea as well as prokaryotes that form heptameric and hexameric ring structures similar to those found in eukaryotes. Probab=70.05 E-value=8.1 Score=18.40 Aligned_cols=41 Identities=17% Similarity=0.385 Sum_probs=30.9 Q ss_pred HHCCCCEEEEEECCE-EEEEEEEEECCCCCEEEECCCCCEEEEEE Q ss_conf 523897799997993-89999998769874899969998999996 Q gi|254780868|r 204 ACGIGDLITIKLSYG-SILGRFVGVDDFGYLLLEEKKGCVRQIFT 247 (252) Q Consensus 204 ~~~~gk~V~i~~~~~-~~~G~~~gId~~G~L~i~~~~G~~~~i~s 247 (252) .-+++|+|.+...++ .+.|++.|-|....|++.+. .+.+++ T Consensus 5 ~~~ldk~V~Vi~~DGR~~vG~L~gfDq~~NlvL~~~---~Er~~~ 46 (74) T cd01727 5 EDYLNKTVSVITVDGRVIVGTLKGFDQATNLILDDS---HERVYS 46 (74) T ss_pred HHHHCCEEEEEECCCCEEEEEEEECCCCCEEEEEEE---EEEEEE T ss_conf 676397899998589599999984267324986425---999980 No 24 >cd01731 archaeal_Sm1 The archaeal sm1 proteins: The Sm proteins are conserved in all three domains of life and are always associated with U-rich RNA sequences. They function to mediate RNA-RNA interactions and RNA biogenesis. All Sm proteins contain a common sequence motif in two segments, Sm1 and Sm2, separated by a short variable linker. Eukaryotic Sm proteins form part of specific small nuclear ribonucleoproteins (snRNPs) that are involved in the processing of pre-mRNAs to mature mRNAs, and are a major component of the eukaryotic spliceosome. Most snRNPs consist of seven Sm proteins (B/B', D1, D2, D3, E, F and G) arranged in a ring on a uridine-rich sequence (Sm site), plus a small nuclear RNA (snRNA) (either U1, U2, U5 or U4/6). Since archaebacteria do not have any splicing apparatus, Sm proteins of archaebacteria may play a more general role. Archaeal Lsm proteins are likely to represent the ancestral Sm domain. Probab=68.09 E-value=8.9 Score=18.14 Aligned_cols=34 Identities=26% Similarity=0.410 Sum_probs=28.9 Q ss_pred HCCCCEEEEEECCE-EEEEEEEEECCCCCEEEECC Q ss_conf 23897799997993-89999998769874899969 Q gi|254780868|r 205 CGIGDLITIKLSYG-SILGRFVGVDDFGYLLLEEK 238 (252) Q Consensus 205 ~~~gk~V~i~~~~~-~~~G~~~gId~~G~L~i~~~ 238 (252) ..+|+.|.++..++ ++.|+..+.|+.-.+.+++. T Consensus 7 ~~~~k~V~V~Lk~g~~~~G~L~~~D~~mNlvL~da 41 (68) T cd01731 7 DSLNKPVLVKLKGGKEVRGRLKSYDQHMNLVLEDA 41 (68) T ss_pred HHCCCEEEEEECCCCEEEEEEEEECCCCCEEECCE T ss_conf 85498599999899899999999947531898246 No 25 >cd01719 Sm_G The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form the core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. Sm subunit G binds subunits E and F to form a trimer which then assembles onto snRNA along with the D1/D2 and D3/B heterodimers forming a seven-membered ring structure. Sm-like proteins exist in archaea as well as prokaryotes that form heptameric and hexameric ring structures similar to those found in eukaryotes. Probab=67.82 E-value=9 Score=18.11 Aligned_cols=35 Identities=23% Similarity=0.366 Sum_probs=29.6 Q ss_pred HHCCCCEEEEEECCE-EEEEEEEEECCCCCEEEECC Q ss_conf 523897799997993-89999998769874899969 Q gi|254780868|r 204 ACGIGDLITIKLSYG-SILGRFVGVDDFGYLLLEEK 238 (252) Q Consensus 204 ~~~~gk~V~i~~~~~-~~~G~~~gId~~G~L~i~~~ 238 (252) ..+++|+|.++..++ .+.|+..|.|..-.|.+++. T Consensus 6 ~~~ldk~v~Vkl~ggR~i~G~L~GfD~~mNLVLdda 41 (72) T cd01719 6 KKYMDKKLSLKLNGNRKVSGILRGFDPFMNLVLDDA 41 (72) T ss_pred HHHCCCEEEEEECCCCEEEEEEEEECCCCEEEEEEE T ss_conf 775498899998899699999997074202772305 No 26 >cd01734 YlxS_C YxlS is a Bacillus subtilis gene of unknown function with two domains that each have an alpha/beta fold. The N-terminal domain is composed of two alpha-helices and a three-stranded beta-sheet, while the C-terminal domain is composed of one alpha-helix and a five-stranded beta-sheet. This CD represents the C-terminal domain which has a fold similar to the Sm fold of proteins like Sm-D3. Probab=62.63 E-value=11 Score=17.49 Aligned_cols=59 Identities=27% Similarity=0.334 Sum_probs=37.2 Q ss_pred CCCHHHHHHHHHHHHCCCCEEEEEEC-----CEEEEEEEEEECCCCCEEEECCC---CCEEEEEEEEE Q ss_conf 55648899999875238977999979-----93899999987698748999699---98999996311 Q gi|254780868|r 191 TGREEIMNLWRCFACGIGDLITIKLS-----YGSILGRFVGVDDFGYLLLEEKK---GCVRQIFTGDI 250 (252) Q Consensus 191 ~~~~~i~~~~~~~~~~~gk~V~i~~~-----~~~~~G~~~gId~~G~L~i~~~~---G~~~~i~sGeI 250 (252) .|.+.-+....++..+.|+.|.+... ...+.|+..+++++. +.++.++ |+...|...+| T Consensus 8 PGldRpL~~~~~f~~~~G~~v~v~l~~~~~g~k~f~G~L~~v~~~~-i~l~~~~~~~~~~~~i~~~~I 74 (83) T cd01734 8 PGAERPLKKEADFERAVGKYVHVKLYQPIDGQKEFEGTLLGVDDDT-VTLEVDIKTRGKTVEIPLDKI 74 (83) T ss_pred CCCCCCCCCHHHHHHHCCCEEEEEEECCCCCCEEEEEEEEEEECCE-EEEEEECCCCCEEEEEEHHHH T ss_conf 9989867899999985798799999150189189999999883999-999995277878999746982 No 27 >cd06168 LSm9 The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form the core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. LSm9 proteins have a single Sm-like domain structure. Sm-like proteins exist in archaea as well as prokaryotes that form heptameric and hexameric ring structures similar to those found in eukaryotes. Probab=62.23 E-value=12 Score=17.45 Aligned_cols=36 Identities=19% Similarity=0.395 Sum_probs=29.9 Q ss_pred HHHCCCCEEEEEECCE-EEEEEEEEECCCCCEEEECC Q ss_conf 7523897799997993-89999998769874899969 Q gi|254780868|r 203 FACGIGDLITIKLSYG-SILGRFVGVDDFGYLLLEEK 238 (252) Q Consensus 203 ~~~~~gk~V~i~~~~~-~~~G~~~gId~~G~L~i~~~ 238 (252) ...++|+.+++...++ .+.|+|.-.|.++.+++-+. T Consensus 5 l~~ll~~~lrV~l~DGR~~vG~f~c~Dk~~NiIL~~~ 41 (75) T cd06168 5 LRSLLGRTMRIHMTDGRTLVGVFLCTDRDCNIILGSA 41 (75) T ss_pred HHHHCCCEEEEEEECCCEEEEEEEEECCCCCEEEECC T ss_conf 8986298799999679999999999737675998087 No 28 >PRK09618 flgD flagellar basal body rod modification protein; Provisional Probab=60.68 E-value=12 Score=17.28 Aligned_cols=42 Identities=19% Similarity=0.176 Sum_probs=32.3 Q ss_pred HHHHHCCCCEEEEEECCE-EEEEEEEEE--CCCCCEEEECCCCCE Q ss_conf 987523897799997993-899999987--698748999699989 Q gi|254780868|r 201 RCFACGIGDLITIKLSYG-SILGRFVGV--DDFGYLLLEEKKGCV 242 (252) Q Consensus 201 ~~~~~~~gk~V~i~~~~~-~~~G~~~gI--d~~G~L~i~~~~G~~ 242 (252) .....++||.|.+...++ .++|++..+ ..+|.+.+.-+||+. T Consensus 88 ~~a~slIGk~V~~~~~dg~~~sG~V~sv~~~~~g~v~~~ldnG~~ 132 (146) T PRK09618 88 LKYLEWIGKEVEWEHGDGEIVKGTVTSVKTGINGQVMVELDDGKW 132 (146) T ss_pred HHHHHHCCCEEEEECCCCCEEEEEEEEEEECCCCEEEEEECCCCE T ss_conf 999855598789973899888778999998659849999769949 No 29 >pfam01423 LSM LSM domain. The LSM domain contains Sm proteins as well as other related LSM (Like Sm) proteins. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major spliceosomal small nuclear RNAs. The U6 snRNP binds to the LSM (Like Sm) proteins. Sm proteins are also found in archaebacteria, which do not have any splicing apparatus suggesting a more general role for Sm proteins. All Sm proteins contain a common sequence motif in two segments, Sm1 and Sm2, separated by a short variable linker. This family also includes the bacterial Hfq (host factor Q) proteins. Hfq are also RNA-binding proteins, that form hexameric rings. Probab=59.52 E-value=13 Score=17.15 Aligned_cols=34 Identities=29% Similarity=0.541 Sum_probs=28.9 Q ss_pred HCCCCEEEEEECCE-EEEEEEEEECCCCCEEEECC Q ss_conf 23897799997993-89999998769874899969 Q gi|254780868|r 205 CGIGDLITIKLSYG-SILGRFVGVDDFGYLLLEEK 238 (252) Q Consensus 205 ~~~gk~V~i~~~~~-~~~G~~~gId~~G~L~i~~~ 238 (252) ..+|++|.+...++ .+.|+..+.|+.-.+++++. T Consensus 5 ~~~~~~V~V~l~~g~~~~G~L~~~D~~mNlvL~~~ 39 (66) T pfam01423 5 KLLGKRVTVELKNGRELRGTLKGFDQFMNLVLDDV 39 (66) T ss_pred HCCCCEEEEEECCCCEEEEEEEEECCCCCEEEEEE T ss_conf 74998799999899299999999889995099169 No 30 >cd01726 LSm6 The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form the core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. LSm6 is one of at least seven subunits that assemble onto U6 snRNA to form a seven-membered ring structure. Sm-like proteins exist in archaea as well as prokaryotes that form heptameric and hexameric ring structures similar to those found in eukaryotes. Probab=59.14 E-value=13 Score=17.11 Aligned_cols=46 Identities=28% Similarity=0.439 Sum_probs=33.1 Q ss_pred HCCCCEEEEEECCE-EEEEEEEEECCCCCEEEECC----CCCEEEEEEEEEC Q ss_conf 23897799997993-89999998769874899969----9989999963110 Q gi|254780868|r 205 CGIGDLITIKLSYG-SILGRFVGVDDFGYLLLEEK----KGCVRQIFTGDIF 251 (252) Q Consensus 205 ~~~gk~V~i~~~~~-~~~G~~~gId~~G~L~i~~~----~G~~~~i~sGeI~ 251 (252) ..+||.|.++..++ ++.|+.+.+|..=.|.+++. +|+. .-.-|||| T Consensus 7 ~~~gk~V~VkLk~G~ey~G~L~s~D~~MNl~L~~aeE~~~g~~-~~~~G~v~ 57 (67) T cd01726 7 AIIGRPVVVKLNSGVDYRGILACLDGYMNIALEQTEEYVNGQL-KNKYGDAF 57 (67) T ss_pred HCCCCEEEEEECCCCEEEEEEEEECCEEEEEECCEEEEECCEE-CCCCCCEE T ss_conf 6059909999889989899999885608468713499987889-35346399 No 31 >cd01729 LSm7 The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form the core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. LSm7 is one of at least seven subunits that assemble onto U6 snRNA to form a seven-membered ring structure. Sm-like proteins exist in archaea as well as prokaryotes that form heptameric and hexameric ring structures similar to those found in eukaryotes. Probab=58.77 E-value=13 Score=17.07 Aligned_cols=37 Identities=22% Similarity=0.350 Sum_probs=30.1 Q ss_pred HHHHCCCCEEEEEECCE-EEEEEEEEECCCCCEEEECC Q ss_conf 87523897799997993-89999998769874899969 Q gi|254780868|r 202 CFACGIGDLITIKLSYG-SILGRFVGVDDFGYLLLEEK 238 (252) Q Consensus 202 ~~~~~~gk~V~i~~~~~-~~~G~~~gId~~G~L~i~~~ 238 (252) +...+++++|.+...++ .+.|++.|-|..=.|++.+. T Consensus 6 dL~k~ldk~V~Vkl~~gR~v~G~L~gfD~~mNLVL~d~ 43 (81) T cd01729 6 DLSKYVDKKIRVKFQGGREVTGILKGYDQLLNLVLDDT 43 (81) T ss_pred HHHHHCCCEEEEEECCCCEEEEEEECCCCCCEEEEEEE T ss_conf 47885596899998799399999970466201776635 No 32 >PRK00092 hypothetical protein; Reviewed Probab=58.17 E-value=14 Score=17.01 Aligned_cols=60 Identities=23% Similarity=0.270 Sum_probs=40.1 Q ss_pred CCCCHHHHHHHHHHHHCCCCEEEEEEC-----CEEEEEEEEEECCCCCEEEECCCC-CEEEEEEEEE Q ss_conf 355648899999875238977999979-----938999999876987489996999-8999996311 Q gi|254780868|r 190 DTGREEIMNLWRCFACGIGDLITIKLS-----YGSILGRFVGVDDFGYLLLEEKKG-CVRQIFTGDI 250 (252) Q Consensus 190 ~~~~~~i~~~~~~~~~~~gk~V~i~~~-----~~~~~G~~~gId~~G~L~i~~~~G-~~~~i~sGeI 250 (252) ..|.+.-+...+.+-.++|+.|+++.. .+.+.|+..++++++ +.+..+++ ....|...+| T Consensus 77 SPGi~RpL~~~~~f~~~~G~~v~v~l~~~~~~~k~~~G~L~~~~~~~-i~l~~~~~~~~~~i~~~~I 142 (153) T PRK00092 77 SPGLDRPLKTAEHFRRFVGREVKVKLREPIDGRKKFQGRLLAVDGET-VTLEVEGKPKVVEIPLDNI 142 (153) T ss_pred CCCCCCCCCCHHHHHHHCCCEEEEEEECCCCCCEEEEEEEEEEECCE-EEEEECCCCEEEEEEHHHH T ss_conf 79999732698999986693899999446689649999999884998-9999889706999736981 No 33 >PRK00737 small nuclear ribonucleoprotein; Provisional Probab=54.83 E-value=15 Score=16.67 Aligned_cols=34 Identities=21% Similarity=0.320 Sum_probs=28.9 Q ss_pred HCCCCEEEEEECCE-EEEEEEEEECCCCCEEEECC Q ss_conf 23897799997993-89999998769874899969 Q gi|254780868|r 205 CGIGDLITIKLSYG-SILGRFVGVDDFGYLLLEEK 238 (252) Q Consensus 205 ~~~gk~V~i~~~~~-~~~G~~~gId~~G~L~i~~~ 238 (252) ..+|+.|.++..++ ++.|+..+.|..=.|.+++. T Consensus 11 ~~~~k~V~V~Lk~gr~~~G~L~~~D~~mNlVL~da 45 (72) T PRK00737 11 NSLNSPVLVRLKGGREFRGELQGYDIHMNLVLANA 45 (72) T ss_pred HHCCCEEEEEECCCCEEEEEEEEECCCCCEEECCE T ss_conf 87498499999899899999999853111798255 No 34 >TIGR01221 rmlC dTDP-4-dehydrorhamnose 3,5-epimerase; InterPro: IPR000888 Deoxythymidine diphosphate (dTDP)-4-keto-6-deoxy-d-hexulose 3, 5-epimerase (RmlC, 5.1.3.13 from EC) is involved in the biosynthesis of dTDP-l-rhamnose, which is an essential component of the bacterial cell wall, converting dTDP-4-keto-6-deoxy-D-glucose to dTDP-4-keto-L-rhamnose. The crystal structure of RmlC from Methanobacterium thermoautotrophicum was determined in the presence and absence of a substrate analogue. RmlC is a homodimer comprising a central jelly roll motif, which extends in two directions into longer beta-sheets. Binding of dTDP is stabilized by ionic interactions to the phosphate group and by a combination of ionic and hydrophobic interactions with the base. The active site, which is located in the centre of the jelly roll, is formed by residues that are conserved in all known RmlC sequence homologues. The active site is lined with a number of charged residues and a number of residues with hydrogen-bonding potentials, which together comprise a potential network for substrate binding and catalysis. The active site is also lined with aromatic residues which provide favorable environments for the base moiety of dTDP and potentially for the sugar moiety of the substrate .; GO: 0008830 dTDP-4-dehydrorhamnose 35-epimerase activity, 0009103 lipopolysaccharide biosynthetic process. Probab=54.23 E-value=12 Score=17.24 Aligned_cols=54 Identities=17% Similarity=0.235 Sum_probs=39.1 Q ss_pred HHHHHHHHHHHH-C-CCCEEEEEECCEEEE--EEEEEECCCCCEEEE-CCCCCEEEEEEEEEC Q ss_conf 488999998752-3-897799997993899--999987698748999-699989999963110 Q gi|254780868|r 194 EEIMNLWRCFAC-G-IGDLITIKLSYGSIL--GRFVGVDDFGYLLLE-EKKGCVRQIFTGDIF 251 (252) Q Consensus 194 ~~i~~~~~~~~~-~-~gk~V~i~~~~~~~~--G~~~gId~~G~L~i~-~~~G~~~~i~sGeI~ 251 (252) ..+.+.|++.++ . .|-++++-..|...+ |+..|| +-+++ ...|++=.+..|||| T Consensus 24 GFF~E~yn~~~F~~~~G~~~~FVQdN~S~S~~GVLRGL----HyQ~~~~~QGKLVRv~~Gevf 82 (181) T TIGR01221 24 GFFFETYNQEAFQEAVGIPVRFVQDNHSKSKKGVLRGL----HYQREEHAQGKLVRVLRGEVF 82 (181) T ss_pred CCHHHHHHHHHHHHHCCCCCCEEEECCCCCCCCCCCCE----ECCCCCCCCCCEEEEEEEEEE T ss_conf 61356642467887428876306755226567840110----101888888726789854588 No 35 >cd01722 Sm_F The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. Sm subunit F is capable of forming both homo- and hetero-heptamer ring structures. To form the hetero-heptamer, Sm subunit F initially binds subunits E and G to form a trimer which then assembles onto snRNA along with the D3/B and D1/D2 heterodimers. Probab=53.78 E-value=16 Score=16.56 Aligned_cols=33 Identities=30% Similarity=0.465 Sum_probs=28.0 Q ss_pred HCCCCEEEEEECCE-EEEEEEEEECCCCCEEEEC Q ss_conf 23897799997993-8999999876987489996 Q gi|254780868|r 205 CGIGDLITIKLSYG-SILGRFVGVDDFGYLLLEE 237 (252) Q Consensus 205 ~~~gk~V~i~~~~~-~~~G~~~gId~~G~L~i~~ 237 (252) ...||.|.++..++ ++.|+..++|..=.|.+++ T Consensus 8 ~~~gk~V~V~LK~G~~y~G~L~s~D~~MNl~L~~ 41 (68) T cd01722 8 DLTGKPVIVKLKWGMEYKGTLVSVDSYMNLQLAN 41 (68) T ss_pred HCCCCEEEEEECCCCEEEEEEEEECCCEEEEECC T ss_conf 6079829999889989999999972426559803 No 36 >cd00600 Sm_like The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form the core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. Sm-like proteins exist in archaea as well as prokaryotes that form heptameric and hexameric ring structures similar to those found in eukaryotes. Probab=53.44 E-value=16 Score=16.53 Aligned_cols=33 Identities=24% Similarity=0.487 Sum_probs=28.1 Q ss_pred CCCCEEEEEECCE-EEEEEEEEECCCCCEEEECC Q ss_conf 3897799997993-89999998769874899969 Q gi|254780868|r 206 GIGDLITIKLSYG-SILGRFVGVDDFGYLLLEEK 238 (252) Q Consensus 206 ~~gk~V~i~~~~~-~~~G~~~gId~~G~L~i~~~ 238 (252) .+|++|.+...++ .+.|++.+.|..-.+.+.+. T Consensus 4 ~ig~~V~V~l~~g~~~~G~L~~~D~~mNlvL~~~ 37 (63) T cd00600 4 LVGKTVRVELKDGRVLEGVLVAFDKYMNLVLDDV 37 (63) T ss_pred HCCCEEEEEECCCCEEEEEEEEECCCCCEEECCE T ss_conf 8698599999899599999999888654098679 No 37 >PRK02001 hypothetical protein; Validated Probab=48.77 E-value=19 Score=16.07 Aligned_cols=44 Identities=16% Similarity=0.250 Sum_probs=33.9 Q ss_pred CCCHHHHHHHHHHHHCCCCEEEEEECCE-EEEEEEEEECCCCCEE Q ss_conf 5564889999987523897799997993-8999999876987489 Q gi|254780868|r 191 TGREEIMNLWRCFACGIGDLITIKLSYG-SILGRFVGVDDFGYLL 234 (252) Q Consensus 191 ~~~~~i~~~~~~~~~~~gk~V~i~~~~~-~~~G~~~gId~~G~L~ 234 (252) .|.+.-+...+++-.++|+.|.+...++ .++|+...++++|--+ T Consensus 73 PGl~rPL~~~rqy~kniGr~v~V~~~~g~~~~G~L~~v~~~~i~L 117 (154) T PRK02001 73 AGIGSPLKVLRQYKKNIGRELEVLTKNGRKLEGVLKDADEEKIKV 117 (154) T ss_pred CCCCCCCCCHHHHHHHCCCEEEEEECCCCEEEEEEEEECCCEEEE T ss_conf 999987789899998559889999789989999999963982999 No 38 >pfam10842 DUF2642 Protein of unknown function (DUF2642). This family of proteins with unknown function appear to be restricted to Bacillus spp. Probab=45.75 E-value=21 Score=15.78 Aligned_cols=39 Identities=23% Similarity=0.373 Sum_probs=33.1 Q ss_pred HHHHHHCCCCEEEEEECCEEEEEEEEEECCCCCEEEECCC Q ss_conf 9987523897799997993899999987698748999699 Q gi|254780868|r 200 WRCFACGIGDLITIKLSYGSILGRFVGVDDFGYLLLEEKK 239 (252) Q Consensus 200 ~~~~~~~~gk~V~i~~~~~~~~G~~~gId~~G~L~i~~~~ 239 (252) |...-..+|+.|-+..-.+.+.|+..++-+| +|.++..+ T Consensus 13 yqtlqs~iG~~vvVeT~rgsvrG~L~dVkPD-Hivle~~~ 51 (66) T pfam10842 13 YQTLQSLIGRRVVVQTVRGSVRGRLRDVKPD-HLVIEAGD 51 (66) T ss_pred HHHHHHHHCCEEEEEEECCCEEEEEEEECCC-EEEEECCC T ss_conf 9999987483799998325168789610798-89997189 No 39 >PRK02627 acetylornithine aminotransferase; Provisional Probab=43.71 E-value=23 Score=15.58 Aligned_cols=19 Identities=21% Similarity=0.230 Sum_probs=8.0 Q ss_pred EEEEEECCCC-HHHHHHHHHH Q ss_conf 4999816485-6899999997 Q gi|254780868|r 15 RYEFFDTISS-TNDECMKRAL 34 (252) Q Consensus 15 ~i~~~~~~~S-Tn~~~~~~~~ 34 (252) ++. |-.-+| -++.|.+++. T Consensus 98 ~v~-f~~sGseAvE~AiKlAr 117 (398) T PRK02627 98 KVF-FCNSGAEANEAAIKLAR 117 (398) T ss_pred EEE-EECCHHHHHHHHHHHHH T ss_conf 799-82877999999999999 No 40 >pfam05899 Cupin_3 Protein of unknown function (DUF861). This family consists of several proteins which seem to be specific to plants and bacteria. The function of this family is unknown. Probab=42.28 E-value=17 Score=16.47 Aligned_cols=22 Identities=18% Similarity=0.314 Sum_probs=17.5 Q ss_pred CCCEEEECCCCCEEEEEEEEEC Q ss_conf 8748999699989999963110 Q gi|254780868|r 230 FGYLLLEEKKGCVRQIFTGDIF 251 (252) Q Consensus 230 ~G~L~i~~~~G~~~~i~sGeI~ 251 (252) +|+..|.+++|+.+.+.+||.+ T Consensus 33 eG~v~~t~~~G~~~~~~aGD~~ 54 (74) T pfam05899 33 SGEVTVTPEGGETVELRAGDLV 54 (74) T ss_pred EEEEEEECCCCCEEEECCCCEE T ss_conf 9699999399848998699899 No 41 >cd01732 LSm5 The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form the core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. LSm4 is one of at least seven subunits that assemble onto U6 snRNA to form a seven-membered ring structure. Sm-like proteins exist in archaea as well as prokaryotes that form heptameric and hexameric ring structures similar to those found in eukaryotes. Probab=42.05 E-value=24 Score=15.42 Aligned_cols=34 Identities=29% Similarity=0.488 Sum_probs=28.2 Q ss_pred HCCCCEEEEEECC-EEEEEEEEEECCCCCEEEECC Q ss_conf 2389779999799-389999998769874899969 Q gi|254780868|r 205 CGIGDLITIKLSY-GSILGRFVGVDDFGYLLLEEK 238 (252) Q Consensus 205 ~~~gk~V~i~~~~-~~~~G~~~gId~~G~L~i~~~ 238 (252) ..+|++|.+.+.+ .+++|+..|-|+.=.+++++. T Consensus 10 k~igs~Iwi~mk~drE~~GtL~GFDdyvNmVLeDv 44 (76) T cd01732 10 KCIGSRIWIVMKSDKEFVGTLLGFDDYVNMVLEDV 44 (76) T ss_pred HHCCCEEEEEECCCCEEEEEEECCCCEEEEEEEEE T ss_conf 53698799999899199999971000068898306 No 42 >pfam00202 Aminotran_3 Aminotransferase class-III. Probab=41.59 E-value=25 Score=15.38 Aligned_cols=26 Identities=27% Similarity=0.275 Sum_probs=14.5 Q ss_pred CCCC-EEEEEEECCCCHHHHHHHHHHC Q ss_conf 9997-0499981648568999999973 Q gi|254780868|r 10 NPYS-FRYEFFDTISSTNDECMKRALS 35 (252) Q Consensus 10 ~p~~-~~i~~~~~~~STn~~~~~~~~~ 35 (252) .|.+ .++.|..+=..-++.|.+++.. T Consensus 72 ~p~~~~~v~f~~sGseA~e~AiklAr~ 98 (338) T pfam00202 72 TPGGLDRVFFMNSGSEANETAIKLARQ 98 (338) T ss_pred CCCCCCEEEEECCHHHHHHHHHHHHHH T ss_conf 788868799828768999999999999 No 43 >pfam01287 eIF-5a Eukaryotic elongation factor 5A hypusine, DNA-binding OB fold. eIF5A, previously thought to be an initiation factor, has been shown to be required for peptide chain elongation in yeast. Probab=39.90 E-value=26 Score=15.22 Aligned_cols=21 Identities=19% Similarity=0.406 Sum_probs=9.4 Q ss_pred EEEEEECCCCCEEEECCCCCE Q ss_conf 999987698748999699989 Q gi|254780868|r 222 GRFVGVDDFGYLLLEEKKGCV 242 (252) Q Consensus 222 G~~~gId~~G~L~i~~~~G~~ 242 (252) +.+++|++||.+.+-+++|.+ T Consensus 7 yQli~I~~Dg~lsLMde~g~t 27 (69) T pfam01287 7 YQLIDISGDGFLSLMDEDGET 27 (69) T ss_pred EEEEEECCCCEEEEECCCCCC T ss_conf 489866688637766488988 No 44 >TIGR00484 EF-G translation elongation factor G; InterPro: IPR004540 Translation elongation factors are responsible for two main processes during protein synthesis on the ribosome , , . EF1A (or EF-Tu) is responsible for the selection and binding of the cognate aminoacyl-tRNA to the A-site (acceptor site) of the ribosome. EF2 (or EF-G) is responsible for the translocation of the peptidyl-tRNA from the A-site to the P-site (peptidyl-tRNA site) of the ribosome, thereby freeing the A-site for the next aminoacyl-tRNA to bind. Elongation factors are responsible for achieving accuracy of translation and both EF1A and EF2 are remarkably conserved throughout evolution. EF-G is a large, five-domain GTPase that promotes the directional movement of mRNA and tRNAs on the ribosome in a GTP-dependent manner. Unlike other GTPases, but by analogy to the myosin motor, EF-G performs its function of powering translocation in the GDP-bound form; that is, in a kinetically stable ribosome-EF-G(GDP) complex formed by GTP hydrolysis on the ribosome. The complex undergoes an extensive structural rearrangement, in particular affecting the small ribosomal subunit, which leads to mRNA-tRNA movement. Domain 4, which extends from the 'body' of the EF-G molecule much like a lever arm, appears to be essential for the structural transition to take place. In a hypothetical model, GTP hydrolysis induces a conformational change in the G domain of EF-G, which affects the interactions with neighbouring domains within EF-G. The resulting rearrangement of the domains relative to each other generates conformational strain in the ribosome to which EF-G is fixed. Because of structural features of the tRNA-ribosome complex, this conformational strain results in directional tRNA-mRNA movement. The functional parallels between EF-G and motor proteins suggest that EF-G differs from classical G-proteins in that it functions as a force-generating mechanochemical device rather than a conformational switch . Every completed bacterial genome has at least one copy, but some species have additional EF-G-like proteins. The closest homolog to canonical (e.g. Escherichia coli) EF-G in the spirochetes clusters as if it is derived from mitochondrial forms, while a more distant second copy is also present. Synechocystis sp. (strain PCC 6803) has a few proteins more closely related to EF-G than to any other characterised protein. Two of these resemble E. coli EF-G more closely than does the best match from the spirochetes; it may be that both function as authentic EF-G. More information about these proteins can be found at Protein of the Month: Elongation Factors . ; GO: 0003746 translation elongation factor activity, 0005525 GTP binding, 0006414 translational elongation, 0005622 intracellular. Probab=38.61 E-value=10 Score=17.76 Aligned_cols=15 Identities=33% Similarity=0.461 Sum_probs=7.1 Q ss_pred CCCCCCCCCCCCCCC Q ss_conf 446783679985316 Q gi|254780868|r 47 CQTAGRGRRDNKWIS 61 (252) Q Consensus 47 ~Qt~GrGr~gr~W~s 61 (252) +|+-||||.||.|.. T Consensus 508 kQSGGrGQyG~V~i~ 522 (705) T TIGR00484 508 KQSGGRGQYGHVVIE 522 (705) T ss_pred ECCCCCCCCEEEEEE T ss_conf 306898730168998 No 45 >PRK06009 flgD flagellar basal body rod modification protein; Reviewed Probab=38.34 E-value=28 Score=15.06 Aligned_cols=46 Identities=22% Similarity=0.099 Sum_probs=31.2 Q ss_pred HHHHHCCCCEEEEEECCEEEEEEEEEECC-CCCEEEECCCCCEEEEEEE Q ss_conf 98752389779999799389999998769-8748999699989999963 Q gi|254780868|r 201 RCFACGIGDLITIKLSYGSILGRFVGVDD-FGYLLLEEKKGCVRQIFTG 248 (252) Q Consensus 201 ~~~~~~~gk~V~i~~~~~~~~G~~~gId~-~G~L~i~~~~G~~~~i~sG 248 (252) .....++||.|. ..++.++|++..|.- ++..+...+||....+-.| T Consensus 85 ~qA~~lIGktVt--~~Dg~vtG~V~sV~~~~dg~~a~l~dG~~v~~~~g 131 (135) T PRK06009 85 SQAEGLIGRTVT--SADGSITGVVKSVTIYSDGIIATLEDGKKVLLGPG 131 (135) T ss_pred HHHHHHHCEEEE--CCCCEEEEEEEEEEEECCCEEEEEECCCEEEECCC T ss_conf 878876280797--07974899999999938944999838988983787 No 46 >TIGR00963 secA preprotein translocase, SecA subunit; InterPro: IPR000185 Secretion across the inner membrane in some Gram-negative bacteria occurs via the preprotein translocase pathway. Proteins are produced in the cytoplasm as precursors, and require a chaperone subunit to direct them to the translocase component. . From there, the mature proteins are either targeted to the outer membrane, or remain as periplasmic proteins. The translocase protein subunits are encoded on the bacterial chromosome. The translocase itself comprises 7 proteins, including a chaperone protein (SecB), an ATPase (SecA), an integral membrane complex (SecCY, SecE and SecG), and two additional membrane proteins that promote the release of the mature peptide into the periplasm (SecD and SecF) . The chaperone protein SecB is a highly acidic homotetrameric protein that exists as a "dimer of dimers" in the bacterial cytoplasm. SecB maintains preproteins in an unfolded state after translation, and targets these to the peripheral membrane protein ATPase SecA for secretion . SecA is a cytoplasmic protein of 800 to 960 amino acid residues. Homologs of secA are also encoded in the chloroplast genome of some algae as well as in the nuclear genome of plants . It could be involved in the intraorganellar protein transport into thylakoids.; GO: 0005524 ATP binding, 0006605 protein targeting, 0006886 intracellular protein transport. Probab=34.52 E-value=16 Score=16.50 Aligned_cols=14 Identities=21% Similarity=-0.078 Sum_probs=5.9 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 13779999999999 Q gi|254780868|r 84 LSFAIAVAMRSVIA 97 (252) Q Consensus 84 l~~~~~~ai~~~l~ 97 (252) ||=.+|.|-.++.+ T Consensus 373 LsGMTGTA~TE~~E 386 (904) T TIGR00963 373 LSGMTGTAKTEEEE 386 (904) T ss_pred HCCCCCCHHHHHHH T ss_conf 44687756899998 No 47 >COG4189 Predicted transcriptional regulator [Transcription] Probab=32.90 E-value=17 Score=16.35 Aligned_cols=49 Identities=16% Similarity=0.253 Sum_probs=28.0 Q ss_pred EEEEECCCEEEECCCCCCCCCCC--CCC--------CCCCEEEEEECCCCCCCCCCCCC Q ss_conf 24621465067102211222233--232--------25832676521544444642233 Q gi|254780868|r 106 VKIKWPNDILVFQRKIAGILIET--LNL--------KNGLQAVVVGIGLNVKHCPVDTP 154 (252) Q Consensus 106 ~~iKWPNDi~~~~kKi~GIL~E~--~~~--------~~~~~~~viGiGiNi~~~~~~~~ 154 (252) +.+|.||-.-+.++++.++=.-+ .+. +..+..-+-|+-|-+..+|.|+. T Consensus 158 vEYkfPNna~~~g~nv~~lE~smElSSE~P~Tn~nWPSDIs~~iNg~kig~WTsPgDfG 216 (308) T COG4189 158 VEYKFPNNAKHSGRNVKELEFSMELSSEVPGTNDNWPSDISFYINGVKIGIWTSPGDFG 216 (308) T ss_pred EEEECCCCHHHCCCCCCCEEEEEEECCCCCCCCCCCCCCEEEEECCEEEEEECCCCCCC T ss_conf 79988872110467742169998842568987789986449998788974552876545 No 48 >PRK04950 putative solute/DNA competence effector; Provisional Probab=32.51 E-value=35 Score=14.48 Aligned_cols=44 Identities=7% Similarity=0.138 Sum_probs=36.8 Q ss_pred HHHCCCCEEEEEECCEEEEEEEEEECCCCCEEEECCCCCEEEEEE Q ss_conf 752389779999799389999998769874899969998999996 Q gi|254780868|r 203 FACGIGDLITIKLSYGSILGRFVGVDDFGYLLLEEKKGCVRQIFT 247 (252) Q Consensus 203 ~~~~~gk~V~i~~~~~~~~G~~~gId~~G~L~i~~~~G~~~~i~s 247 (252) .....|+.|.+..+.....|++..|+.|| ..|+.++|-.-.+.+ T Consensus 164 ~~L~vgq~VkVk~Gk~~~~Ati~ei~Kd~-V~VqL~sGl~~kV~a 207 (212) T PRK04950 164 SELTVGQAVKVKAGKSPMDATVLEINKDD-VRVQLDSGLSMKVRA 207 (212) T ss_pred HHHCCCCEEEEEECCCCCCEEEEEEECCC-EEEEECCCCEEEEEH T ss_conf 99325988999606887775899982587-799927996899856 No 49 >cd01730 LSm3 The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form the core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. LSm3 is one of at least seven subunits that assemble onto U6 snRNA to form a seven-membered ring structure. Sm-like proteins exist in archaea as well as prokaryotes that form heptameric and hexameric ring structures similar to those found in eukaryotes. Probab=31.61 E-value=36 Score=14.39 Aligned_cols=33 Identities=18% Similarity=0.327 Sum_probs=27.5 Q ss_pred CCCCEEEEEECCE-EEEEEEEEECCCCCEEEECC Q ss_conf 3897799997993-89999998769874899969 Q gi|254780868|r 206 GIGDLITIKLSYG-SILGRFVGVDDFGYLLLEEK 238 (252) Q Consensus 206 ~~gk~V~i~~~~~-~~~G~~~gId~~G~L~i~~~ 238 (252) .+++.|.+...++ ++.|++.+-|..-.|++.+. T Consensus 9 sl~~~V~VklrggRel~G~L~afD~h~NlVL~d~ 42 (82) T cd01730 9 SLDERVYVKLRGDRELRGRLHAYDQHLNMILGDV 42 (82) T ss_pred HCCCEEEEEECCCCEEEEEEEEECCEEEEEEECC T ss_conf 2898699998799799999997340226885163 No 50 >cd01723 LSm4 The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form the core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. LSm4 is one of at least seven subunits that assemble onto U6 snRNA to form a seven-membered ring structure. Sm-like proteins exist in archaea as well as prokaryotes that form heptameric and hexameric ring structures similar to those found in eukaryotes. Probab=31.57 E-value=36 Score=14.39 Aligned_cols=46 Identities=20% Similarity=0.315 Sum_probs=32.6 Q ss_pred HCCCCEEEEEECCE-EEEEEEEEECCCCCEEEEC-----CCCCEEEEEEEEEC Q ss_conf 23897799997993-8999999876987489996-----99989999963110 Q gi|254780868|r 205 CGIGDLITIKLSYG-SILGRFVGVDDFGYLLLEE-----KKGCVRQIFTGDIF 251 (252) Q Consensus 205 ~~~gk~V~i~~~~~-~~~G~~~gId~~G~L~i~~-----~~G~~~~i~sGeI~ 251 (252) ...|+.|.++..++ .+.|+...+|+.=.+.+++ .+|. +....+++| T Consensus 8 ~a~g~~V~VELKng~~~~G~L~~~D~~MN~~L~~v~~t~~~G~-~~~~~~~v~ 59 (76) T cd01723 8 TAQNHPMLVELKNGETYNGHLVNCDNWMNIHLREVICTSKDGD-KFWKMPECY 59 (76) T ss_pred HCCCCEEEEEECCCCEEEEEEEEEECCCCCEEEEEEEECCCCC-CCCCCCEEE T ss_conf 5899899999889979999999973435819989999988998-625487299 No 51 >COG0779 Uncharacterized protein conserved in bacteria [Function unknown] Probab=31.23 E-value=36 Score=14.35 Aligned_cols=58 Identities=29% Similarity=0.336 Sum_probs=40.0 Q ss_pred CCCHHHHHHHHHHHHCCCCEEEEEE-----CCEEEEEEEEEECCCCCEEEECCCCCEEEEEEEEE Q ss_conf 5564889999987523897799997-----99389999998769874899969998999996311 Q gi|254780868|r 191 TGREEIMNLWRCFACGIGDLITIKL-----SYGSILGRFVGVDDFGYLLLEEKKGCVRQIFTGDI 250 (252) Q Consensus 191 ~~~~~i~~~~~~~~~~~gk~V~i~~-----~~~~~~G~~~gId~~G~L~i~~~~G~~~~i~sGeI 250 (252) .|.+.-+..-..+..+.|+.|.+.. +.+.++|+++.++.+- +.++ -+|+.-.|...+| T Consensus 82 PGldRpL~~~~~f~r~~G~~Vkv~l~~~~~~~k~~~G~i~~~d~~~-v~~~-~~~k~v~Ip~~~i 144 (153) T COG0779 82 PGLDRPLKTAEHFARFIGEKVKVKLRLPIEGRKKFEGKIVAVDGET-VTLE-VDGKEVEIPFSDI 144 (153) T ss_pred CCCCCCCCCHHHHHHHCCCEEEEEEECCCCCCEEEEEEEEEECCCE-EEEE-ECCEEEEEECCCC T ss_conf 9988775798999984695899999654488407889999972986-9999-7784899875631 No 52 >CHL00058 petD cytochrome b6/f complex subunit IV Probab=30.49 E-value=13 Score=17.21 Aligned_cols=20 Identities=45% Similarity=0.870 Sum_probs=11.3 Q ss_pred CCCEEEEEECCCEEE-ECCCCCCC Q ss_conf 421024621465067-10221122 Q gi|254780868|r 102 VGTDVKIKWPNDILV-FQRKIAGI 124 (252) Q Consensus 102 ~~~~~~iKWPNDi~~-~~kKi~GI 124 (252) +|.+ -||||+++ ----+-|. T Consensus 27 ygep---awpnd~ly~fpvvi~g~ 47 (160) T CHL00058 27 YGEP---AWPNDLLYIFPVVILGT 47 (160) T ss_pred CCCC---CCCCCHHHHHHHHHHHH T ss_conf 6887---88651355525687557 No 53 >COG5414 TATA-binding protein-associated factor [Transcription] Probab=28.60 E-value=40 Score=14.07 Aligned_cols=35 Identities=20% Similarity=0.225 Sum_probs=25.5 Q ss_pred HHHHHHHHHCCCCCEEEEEECCC----EEEECCCCCCCCCC Q ss_conf 99999997445421024621465----06710221122223 Q gi|254780868|r 91 AMRSVIASTLPVGTDVKIKWPND----ILVFQRKIAGILIE 127 (252) Q Consensus 91 ai~~~l~~~~~~~~~~~iKWPND----i~~~~kKi~GIL~E 127 (252) -+.+++++ +. +..+.+||-|| +-+|++|-++||+. T Consensus 88 ~V~~~~es-G~-~s~I~ik~kd~R~aVvt~N~~~Y~ailVd 126 (392) T COG5414 88 HVVDLGES-GD-YSGITIKIKDDRSAVVTHNNKKYPAILVD 126 (392) T ss_pred HHHHHHHC-CC-CCCCEEEECCCCEEEEEECCCCCCEEEEE T ss_conf 99977645-77-66715886057347998878064516750 No 54 >KOG1404 consensus Probab=28.32 E-value=30 Score=14.90 Aligned_cols=45 Identities=27% Similarity=0.240 Sum_probs=27.0 Q ss_pred HHHHCCCCCCEEEEEEECCCCH-HHHHHHHHHCCCCCCEEEEECCCC Q ss_conf 2475179997049998164856-899999997399997899982446 Q gi|254780868|r 4 NLSECQNPYSFRYEFFDTISST-NDECMKRALSGDLGNLWIVASCQT 49 (252) Q Consensus 4 ~l~~~~~p~~~~i~~~~~~~ST-n~~~~~~~~~~~~~~~~v~A~~Qt 49 (252) +..-...|.+.++.||-.-+|- |+.|..++... ....=|+|.++. T Consensus 99 e~L~s~~P~~l~~vfF~nsGsEANelal~mar~Y-t~~~diIa~r~~ 144 (442) T KOG1404 99 EALVSKLPGDLKVVFFVNSGSEANELALKMARLY-TGNLDIIARRNS 144 (442) T ss_pred HHHHHHCCCCCEEEEEECCCCHHHHHHHHHHHHH-CCCCEEEEEECC T ss_conf 9999868877128999558745889999999986-297158875055 No 55 >PRK10595 SOS cell division inhibitor; Provisional Probab=27.85 E-value=30 Score=14.89 Aligned_cols=65 Identities=15% Similarity=0.309 Sum_probs=37.0 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCHH----HHHHHHHHHHHHHHCCCCCEEEEEE-CCCEEEEC Q ss_conf 83679985316633334132235322400021013----7799999999997445421024621-46506710 Q gi|254780868|r 51 GRGRRDNKWISDKGNLYASLLLIDSISKDSLTLLS----FAIAVAMRSVIASTLPVGTDVKIKW-PNDILVFQ 118 (252) Q Consensus 51 GrGr~gr~W~s~~G~l~~S~~~~~~~~~~~~~~l~----~~~~~ai~~~l~~~~~~~~~~~iKW-PNDi~~~~ 118 (252) |.-.|=--|.+|..-|.=.++....+|.++...++ ..+.-|+.+||+. -+..+=|-| |+++=-.+ T Consensus 56 g~qSRWlLWLtPqqKLSR~Wl~~sGLPl~KvmQl~~~~p~~Tv~aMekAL~s---GnySvVL~WLp~eL~~~~ 125 (167) T PRK10595 56 GQQSRWQLWLTPQQKLSREWVQSSGLPLTKVMQISQLSPCHTVESMERALRT---GNYSVVIGWLPDELTEEE 125 (167) T ss_pred CCHHHEEEEECHHHHHHHHHHHHCCCCHHHHHHHCCCCCHHHHHHHHHHHHC---CCCEEEEEECCCCCCHHH T ss_conf 3102200021600133289999739972574201146805279999999854---983489972673369899 No 56 >cd05845 Ig2_L1-CAM_like Second immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM) and similar proteins. Ig2_L1-CAM_like: domain similar to the second immunoglobulin (Ig)-like domain of the L1 cell adhesion molecule (CAM). L1 belongs to the L1 subfamily of cell adhesion molecules (CAMs) and is comprised of an extracellular region having six Ig-like domains, five fibronectin type III domains, a transmembrane region and an intracellular domain. L1 is primarily expressed in the nervous system and is involved in its development and function. L1 is associated with an X-linked recessive disorder, X-linked hydrocephalus, MASA syndrome, or spastic paraplegia type 1, that involves abnormalities of axonal growth. Probab=27.74 E-value=42 Score=13.98 Aligned_cols=26 Identities=19% Similarity=0.278 Sum_probs=18.2 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCH Q ss_conf 79985316633334132235322400 Q gi|254780868|r 54 RRDNKWISDKGNLYASLLLIDSISKD 79 (252) Q Consensus 54 r~gr~W~s~~G~l~~S~~~~~~~~~~ 79 (252) +..|.-.++.||||||=+.+.+...+ T Consensus 48 ~d~Rvt~~~~GnLyFsnV~~~D~~~d 73 (95) T cd05845 48 QDERVSMGQNGNLYFANVEEQDSHPD 73 (95) T ss_pred CCCEEEECCCCCEEEEEEEECCCCCC T ss_conf 56507877899889886632567898 No 57 >cd01724 Sm_D1 The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. Sm subunit D1 heterodimerizes with subunit D2 and three such heterodimers form a hexameric ring structure with alternating D1 and D2 subunits. The D1 - D2 heterodimer also assembles into a heptameric ring containing DB, D3, E, F, and G subunits. Sm-like proteins exist in archaea as well as prokaryotes which form heptameric and hexameric ring structures similar to those found in eukaryotes. Probab=27.70 E-value=42 Score=13.97 Aligned_cols=32 Identities=28% Similarity=0.344 Sum_probs=26.6 Q ss_pred CCCCEEEEEECCE-EEEEEEEEECCCCCEEEEC Q ss_conf 3897799997993-8999999876987489996 Q gi|254780868|r 206 GIGDLITIKLSYG-SILGRFVGVDDFGYLLLEE 237 (252) Q Consensus 206 ~~gk~V~i~~~~~-~~~G~~~gId~~G~L~i~~ 237 (252) ..|+.|+++..++ .+.|+..++|+.-.+.+++ T Consensus 9 l~g~~VtVELKng~~~~G~L~~vd~~MN~~L~~ 41 (90) T cd01724 9 LTNETVTIELKNGTIVHGTITGVDPSMNTHLKN 41 (90) T ss_pred CCCCEEEEEECCCCEEEEEEEEECCCCEEEEEE T ss_conf 689879999879979999998813782018988 No 58 >pfam05614 DUF782 Circovirus protein of unknown function (DUF782). This family consists of porcine and bovine circovirus proteins of unknown function. Probab=27.36 E-value=24 Score=15.52 Aligned_cols=10 Identities=40% Similarity=1.046 Sum_probs=6.0 Q ss_pred EEEC-CCEEEE Q ss_conf 6214-650671 Q gi|254780868|r 108 IKWP-NDILVF 117 (252) Q Consensus 108 iKWP-NDi~~~ 117 (252) -.|| ||+|++ T Consensus 31 prwphndvyig 41 (104) T pfam05614 31 PRWPHNDVYIG 41 (104) T ss_pred CCCCCCCEEEC T ss_conf 88888854774 No 59 >TIGR01156 cytb6/f_IV cytb6/f complex subunit IV; InterPro: IPR005870 This entry describes subunit IV of the cytochrome b6/f complex. The cyt b6/f complex is central to the functions of oxygenic photosynthetic electron transport in cyanobacteria and its equivalents in algae and higher plants, mediating electron transfer between photosystem II (PSII) and photosystem I (PSI) reaction centre complexes. The cyt b6f complex consists of a cytochrome b core, in addition to cytochrome f, Rieske 2Fe-2S, and subunits IV, V, VI, VII. The complex has a central quinone exchange cavity, which is defined by two monomers that resemble the respiratory cytochrome bc1 complex . Cyt b6f has three prosthetic groups not found in cytbc1: chlorophyll-a, beta-carotene and a structurally unique covalently bound haem. The haem is thought to function in cyclic electron transport via anionic ferredoxin. Each monomer of the complex contains a molecule of chlorophyll-a and beta-carotene. Subunit IV is one of the principal subunits for the binding of the redox prosthetic groups. ; GO: 0045156 electron transporter transferring electrons within the cyclic electron transport pathway of photosynthesis activity, 0009775 photosynthetic electron transport in cytochrome b6/f. Probab=26.35 E-value=11 Score=17.52 Aligned_cols=12 Identities=50% Similarity=1.118 Sum_probs=8.6 Q ss_pred CCCEEEEEECCCEEE Q ss_conf 421024621465067 Q gi|254780868|r 102 VGTDVKIKWPNDILV 116 (252) Q Consensus 102 ~~~~~~iKWPNDi~~ 116 (252) +|.+ -||||+++ T Consensus 27 yGeP---aWPndlly 38 (160) T TIGR01156 27 YGEP---AWPNDLLY 38 (160) T ss_pred CCCC---CCHHHHHH T ss_conf 7888---76146899 No 60 >COG1958 LSM1 Small nuclear ribonucleoprotein (snRNP) homolog [Transcription] Probab=24.45 E-value=48 Score=13.60 Aligned_cols=34 Identities=26% Similarity=0.460 Sum_probs=28.1 Q ss_pred HCCCCEEEEEECCE-EEEEEEEEECCCCCEEEECC Q ss_conf 23897799997993-89999998769874899969 Q gi|254780868|r 205 CGIGDLITIKLSYG-SILGRFVGVDDFGYLLLEEK 238 (252) Q Consensus 205 ~~~gk~V~i~~~~~-~~~G~~~gId~~G~L~i~~~ 238 (252) ..+++.|.+...++ ++.|+..+.|+.=.|.+++. T Consensus 14 ~~~~~~V~V~lk~g~~~~G~L~~~D~~mNlvL~d~ 48 (79) T COG1958 14 KLLNKRVLVKLKNGREYRGTLVGFDQYMNLVLDDV 48 (79) T ss_pred HHCCCEEEEEECCCCEEEEEEEEECCCCCEEEECE T ss_conf 86298899998799599999999847541899142 No 61 >KOG1486 consensus Probab=23.59 E-value=50 Score=13.50 Aligned_cols=102 Identities=22% Similarity=0.251 Sum_probs=48.0 Q ss_pred CCCCCEEEEEEECCCCHHHHHHHHHHCCC--CCC---------------EEEEECCCCCCCCCCCCCCCCCCC--CCCCC Q ss_conf 79997049998164856899999997399--997---------------899982446783679985316633--33413 Q gi|254780868|r 9 QNPYSFRYEFFDTISSTNDECMKRALSGD--LGN---------------LWIVASCQTAGRGRRDNKWISDKG--NLYAS 69 (252) Q Consensus 9 ~~p~~~~i~~~~~~~STn~~~~~~~~~~~--~~~---------------~~v~A~~Qt~GrGr~gr~W~s~~G--~l~~S 69 (252) +.|.--+-..+..++||..++..+..... .+| -++--..|-+|||| .=+|... .|.+- T Consensus 69 GfPSVGKStlLs~iT~T~SeaA~yeFTTLtcIpGvi~y~ga~IQllDLPGIieGAsqgkGRGR---QviavArtaDlilM 145 (364) T KOG1486 69 GFPSVGKSTLLSKITSTHSEAASYEFTTLTCIPGVIHYNGANIQLLDLPGIIEGASQGKGRGR---QVIAVARTADLILM 145 (364) T ss_pred CCCCCCHHHHHHHHHCCHHHHHCCEEEEEEEECCEEEECCCEEEEECCCCCCCCCCCCCCCCC---EEEEEEECCCEEEE T ss_conf 488744787888764110222110246787303168766834799627530002113788772---68887403658999 Q ss_pred CCCCCCCCCHHCCCHHHHHHHHHHHHHHHHCCCCCEEEEEECCCEEEECCCCCCCCC Q ss_conf 223532240002101377999999999974454210246214650671022112222 Q gi|254780868|r 70 LLLIDSISKDSLTLLSFAIAVAMRSVIASTLPVGTDVKIKWPNDILVFQRKIAGILI 126 (252) Q Consensus 70 ~~~~~~~~~~~~~~l~~~~~~ai~~~l~~~~~~~~~~~iKWPNDi~~~~kKi~GIL~ 126 (252) ++-...+..+- ..+-+-|++. |+... |=|-+||+..||-|||=. T Consensus 146 -vLDatk~e~qr--------~~le~ELe~v---GiRLN-k~~Pniy~k~kk~gGi~f 189 (364) T KOG1486 146 -VLDATKSEDQR--------EILEKELEAV---GIRLN-KRKPNIYFKKKKTGGISF 189 (364) T ss_pred -EECCCCCHHHH--------HHHHHHHHHH---CEECC-CCCCCEEEEEECCCCEEE T ss_conf -95177615678--------9999999974---13414-789975888743687677 No 62 >COG4466 Veg Uncharacterized protein conserved in bacteria [Function unknown] Probab=23.28 E-value=50 Score=13.46 Aligned_cols=56 Identities=20% Similarity=0.307 Sum_probs=34.8 Q ss_pred HHHHHHHHHHCCCCEEEEEECCEEE-----EEEEEEECCCCCEEEE--CCCC--CEEEEEEEEECC Q ss_conf 8999998752389779999799389-----9999987698748999--6999--899999631109 Q gi|254780868|r 196 IMNLWRCFACGIGDLITIKLSYGSI-----LGRFVGVDDFGYLLLE--EKKG--CVRQIFTGDIFT 252 (252) Q Consensus 196 i~~~~~~~~~~~gk~V~i~~~~~~~-----~G~~~gId~~G~L~i~--~~~G--~~~~i~sGeI~~ 252 (252) +.+-=.+...+.|+.|.+..+++.. .|+....=+.=. +|+ ++.| ....++..||+| T Consensus 7 i~~IK~~i~ah~G~~v~lk~ngGRKk~~~r~G~L~EtYpSvF-Iiel~~d~~~~~~vSYsYsDILT 71 (80) T COG4466 7 IVDIKESIDAHLGERVTLKANGGRKKTIERSGILIETYPSVF-IIELDQDEGNFERVSYSYSDILT 71 (80) T ss_pred HHHHHHHHHHCCCCEEEEEECCCCEEEEHHCEEEEEECCCEE-EEEECCCCCCCEEEEEEEHHHEE T ss_conf 889999987355838999935981002021407866437289-99943667983699887214020 No 63 >TIGR03499 FlhF flagellar biosynthetic protein FlhF. Probab=23.03 E-value=51 Score=13.43 Aligned_cols=38 Identities=13% Similarity=0.121 Sum_probs=28.3 Q ss_pred CCHHHHHHHHHHCCCCCCEEEEECCCCCCCCCCCCCCC Q ss_conf 85689999999739999789998244678367998531 Q gi|254780868|r 23 SSTNDECMKRALSGDLGNLWIVASCQTAGRGRRDNKWI 60 (252) Q Consensus 23 ~STn~~~~~~~~~~~~~~~~v~A~~Qt~GrGr~gr~W~ 60 (252) .+++.+|.+.++...-+..+|++.++..++|..|+.++ T Consensus 8 a~~~~eAl~~ir~eLG~DAvILstr~v~~~g~~gk~~v 45 (282) T TIGR03499 8 APTMREALAKVKEELGPDAVILSTRKVRKGGLFGKKFV 45 (282) T ss_pred CCCHHHHHHHHHHHHCCCEEEEECEEECCCCCCCCCEE T ss_conf 29999999999999789949996938789986788729 No 64 >pfam02576 DUF150 Uncharacterized BCR, YhbC family COG0779. Probab=22.92 E-value=51 Score=13.42 Aligned_cols=48 Identities=27% Similarity=0.367 Sum_probs=34.4 Q ss_pred CCCHHHHHHHHHHHHCCCCEEEEEECC-----EEEEEEEEEECCCCCEEEECCC Q ss_conf 556488999998752389779999799-----3899999987698748999699 Q gi|254780868|r 191 TGREEIMNLWRCFACGIGDLITIKLSY-----GSILGRFVGVDDFGYLLLEEKK 239 (252) Q Consensus 191 ~~~~~i~~~~~~~~~~~gk~V~i~~~~-----~~~~G~~~gId~~G~L~i~~~~ 239 (252) .|.+.-+...+++..++|+.|++.... ..++|+..++++++ +.++.++ T Consensus 70 PGi~RpL~~~~~f~~~~G~~v~v~l~~~~~~~k~~~G~L~~~~~~~-i~l~~~~ 122 (141) T pfam02576 70 PGLERPLKTERHFARFIGKLVKVSLKEPIEGRKNFTGKLLEVDGDT-VTIEVDD 122 (141) T ss_pred CCCCCCCCCHHHHHHHCCCEEEEEEECCCCCEEEEEEEEEEEECCE-EEEEECC T ss_conf 9999834888999986594899999246699389999999886999-9999858 No 65 >TIGR02295 HpaD 3,4-dihydroxyphenylacetate 2,3-dioxygenase; InterPro: IPR011981 This enzyme catalyses the second step in the degradation of 4-hydroxyphenylacetate to succinate and pyruvate. 4-hydroxyphenylacetate arises from the degradation of tyrosine. The substrate, 3,4-dihydroxyphenylacetate (homoprotocatechuate) arises from the action of a hydroxylase on 4-hydroxyphenylacetate. The aromatic ring is opened by this dioxygenase exo to the 3,4-diol resulting in 2-hydroxy-5-carboxymethylmuconate semialdehyde.. Probab=22.65 E-value=52 Score=13.38 Aligned_cols=28 Identities=32% Similarity=0.565 Sum_probs=24.5 Q ss_pred EEECCCCCEEEECCCCCEEEEEEEEECC Q ss_conf 9876987489996999899999631109 Q gi|254780868|r 225 VGVDDFGYLLLEEKKGCVRQIFTGDIFT 252 (252) Q Consensus 225 ~gId~~G~L~i~~~~G~~~~i~sGeI~~ 252 (252) .||++--.|=|+|.||....|+++|=+| T Consensus 249 HG~SNAffLYlrDPDghRIElYt~DY~t 276 (312) T TIGR02295 249 HGVSNAFFLYLRDPDGHRIELYTGDYLT 276 (312) T ss_pred CCCCCCEEEEEECCCCCEEEEECCCCEE T ss_conf 6522200433346899789998288225 No 66 >PRK13476 cytochrome b6-f complex subunit IV; Provisional Probab=22.05 E-value=21 Score=15.87 Aligned_cols=13 Identities=46% Similarity=1.058 Sum_probs=9.0 Q ss_pred CCCEEEEEECCCEEEE Q ss_conf 4210246214650671 Q gi|254780868|r 102 VGTDVKIKWPNDILVF 117 (252) Q Consensus 102 ~~~~~~iKWPNDi~~~ 117 (252) +|.+ -||||++.. T Consensus 27 ygep---awpndlly~ 39 (160) T PRK13476 27 YGEP---AWPNDLLYI 39 (160) T ss_pred CCCC---CCCCHHHHH T ss_conf 7888---886227998 No 67 >PRK01194 V-type ATP synthase subunit E; Provisional Probab=21.98 E-value=53 Score=13.30 Aligned_cols=75 Identities=20% Similarity=0.197 Sum_probs=42.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHH-HCCCCEEEEEECCE------EEEEEEEEECCCCCEEEECCCCC Q ss_conf 789998777999999998730355648899999875-23897799997993------89999998769874899969998 Q gi|254780868|r 169 LKDIFSLLFQDVARVLDLWKKDTGREEIMNLWRCFA-CGIGDLITIKLSYG------SILGRFVGVDDFGYLLLEEKKGC 241 (252) Q Consensus 169 ~~~l~~~l~~~l~~~~~~~~~~~~~~~i~~~~~~~~-~~~gk~V~i~~~~~------~~~G~~~gId~~G~L~i~~~~G~ 241 (252) +.+++...+..+...+..+......+.++......+ -.+|+.+.+..... .....+..+|..|+.+++..||. T Consensus 78 ~~e~ld~~~~~~~e~i~~~~~~k~Y~~lL~kLi~~A~k~lg~d~iI~~~~~D~~~~~~~~~~~~~id~~GGviaeSkDGs 157 (185) T PRK01194 78 RREILKDYLDIAYEHLMNITKSKEYDSILNKMIEVAIKTLGEDCIIKVSESDKKKINNAKIKFADIDPYGGILAYSRDGK 157 (185) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECHHHHHHHHHHHHCCCCCCCCCCEEEEECCCC T ss_conf 99999999999999986687775589999999999998607864998689889998755540223555676689925885 Q ss_pred EE Q ss_conf 99 Q gi|254780868|r 242 VR 243 (252) Q Consensus 242 ~~ 243 (252) +. T Consensus 158 v~ 159 (185) T PRK01194 158 RE 159 (185) T ss_pred EE T ss_conf 68 No 68 >PRK07481 hypothetical protein; Provisional Probab=21.76 E-value=54 Score=13.27 Aligned_cols=31 Identities=26% Similarity=0.340 Sum_probs=15.1 Q ss_pred HHHHCCCCCCEEEEEEECCCC-HHHHHHHHHH Q ss_conf 247517999704999816485-6899999997 Q gi|254780868|r 4 NLSECQNPYSFRYEFFDTISS-TNDECMKRAL 34 (252) Q Consensus 4 ~l~~~~~p~~~~i~~~~~~~S-Tn~~~~~~~~ 34 (252) .|.++..|.+..-.+|-.-+| .|+.|.+.+. T Consensus 94 ~L~~~~~p~~~~rv~f~~SGseA~E~AiKlAr 125 (448) T PRK07481 94 KLIDMFEPEGMRRVFFSSGGSDSVETALKLAR 125 (448) T ss_pred HHHHHCCCCCCCEEEECCCHHHHHHHHHHHHH T ss_conf 99985187787889974977999999999999 No 69 >KOG3159 consensus Probab=21.41 E-value=55 Score=13.23 Aligned_cols=77 Identities=27% Similarity=0.304 Sum_probs=41.1 Q ss_pred EEEECCCCCCCCCCCCCCC-------------------------CCCCCCCCCCCCCCCCCCHHCCCHHHHHHHHHHHHH Q ss_conf 9998244678367998531-------------------------663333413223532240002101377999999999 Q gi|254780868|r 42 WIVASCQTAGRGRRDNKWI-------------------------SDKGNLYASLLLIDSISKDSLTLLSFAIAVAMRSVI 96 (252) Q Consensus 42 ~v~A~~Qt~GrGr~gr~W~-------------------------s~~G~l~~S~~~~~~~~~~~~~~l~~~~~~ai~~~l 96 (252) ++.+..-..=.||+.+.|+ -..|||.+|++..-. ..+...-+++ +.+++ T Consensus 43 l~w~N~p~vVIGRhQNpw~E~nv~~~~e~~I~liRR~SGGGTVyHDlGNLN~S~lt~re-~~~r~~nlk~-----iv~AL 116 (336) T KOG3159 43 LLWINDPCVVIGRHQNPWQEANVALLRENNIPLIRRFSGGGTVYHDLGNLNYSLLTNRE-KFDRKENLKI-----IVRAL 116 (336) T ss_pred EEEECCCEEEECCCCCCCEECCHHHHHHCCCEEEEEECCCCEEEEECCCEEEEEECCHH-HCCCCCCHHH-----HHHHH T ss_conf 99808954898258996544148889756973799845996599834752489971578-7676444378-----99986 Q ss_pred HHHCCCCCEEEEEECCCEEEE--CCCCCCCC Q ss_conf 974454210246214650671--02211222 Q gi|254780868|r 97 ASTLPVGTDVKIKWPNDILVF--QRKIAGIL 125 (252) Q Consensus 97 ~~~~~~~~~~~iKWPNDi~~~--~kKi~GIL 125 (252) ... ....++.+-=--|+.+. ++|++|-- T Consensus 117 n~~-e~~v~v~~nqR~Di~l~~g~rKiSGtA 146 (336) T KOG3159 117 NGD-EPFVKVNLNQRDDIVLDFGQRKISGTA 146 (336) T ss_pred CCC-CCEEEECCCCCCCCEECCCCCEECCCH T ss_conf 067-832761444234512305673334415 No 70 >PRK08555 consensus Probab=21.21 E-value=55 Score=13.20 Aligned_cols=30 Identities=20% Similarity=0.140 Sum_probs=16.0 Q ss_pred CCCCE--EEEEEECCCCHHHHHHHHHHCCCCC Q ss_conf 99970--4999816485689999999739999 Q gi|254780868|r 10 NPYSF--RYEFFDTISSTNDECMKRALSGDLG 39 (252) Q Consensus 10 ~p~~~--~i~~~~~~~STn~~~~~~~~~~~~~ 39 (252) .|.++ ++.|-.+=..-++.|.+.++..... T Consensus 108 ~p~~~~~~v~f~~SGSEAvE~AiklAr~~tgr 139 (443) T PRK08555 108 APGDFEKKVFFSNSGTEANEAALKIAKWYTGR 139 (443) T ss_pred CCCCCCCEEEEECCHHHHHHHHHHHHHHHCCC T ss_conf 77565655998087788999999998873799 No 71 >TIGR01570 A_thal_3588 uncharacterized plant-specific domain TIGR01570; InterPro: IPR006460 This family of hypothetical plant proteins are defined by a region of about 170 amino acids found at the C terminus. These proteins have highly divergent N-terminal regions rich in low complexity sequence. PSI-BLAST reveals no clear similarity to any characterised protein. At least 12 distinct members are found in Arabidopsis thaliana.. Probab=21.19 E-value=32 Score=14.69 Aligned_cols=34 Identities=24% Similarity=0.428 Sum_probs=20.5 Q ss_pred EECCCEEEECCCCCCCCCC--CCCCC-----CCCEEEEEECCC Q ss_conf 2146506710221122223--32322-----583267652154 Q gi|254780868|r 109 KWPNDILVFQRKIAGILIE--TLNLK-----NGLQAVVVGIGL 144 (252) Q Consensus 109 KWPNDi~~~~kKi~GIL~E--~~~~~-----~~~~~~viGiGi 144 (252) +| -+||||||+|==.=. +..+. +....+.+|=|+ T Consensus 70 ~W--tmyCNGrK~GYav~R~G~~~eed~~vl~~L~~V~vGAGV 110 (172) T TIGR01570 70 VW--TMYCNGRKVGYAVKRKGSASEEDMTVLEALSKVTVGAGV 110 (172) T ss_pred CC--CCEECCEEEEEEEECCCCCCHHHHHHHHHHCCCEEEEEE T ss_conf 51--400478261023551999898999999863374764567 No 72 >KOG3703 consensus Probab=20.72 E-value=57 Score=13.14 Aligned_cols=81 Identities=20% Similarity=0.074 Sum_probs=44.2 Q ss_pred ECCCCHHHHHH---HHHHCCCCCCEEEEECCCCCCCCCC---------C--CCCCCCCC-CCCCCCCCCCCCCC-HHC-- Q ss_conf 16485689999---9997399997899982446783679---------9--85316633-33413223532240-002-- Q gi|254780868|r 20 DTISSTNDECM---KRALSGDLGNLWIVASCQTAGRGRR---------D--NKWISDKG-NLYASLLLIDSISK-DSL-- 81 (252) Q Consensus 20 ~~~~STn~~~~---~~~~~~~~~~~~v~A~~Qt~GrGr~---------g--r~W~s~~G-~l~~S~~~~~~~~~-~~~-- 81 (252) -+|+||.+|-. ..++.|..+.-+++--+||+|-=.+ | .-=.|-.| .|++++++.|=.-. .++ T Consensus 444 i~VTsTEEYPHlkPAr~RrGFIH~~ImVLPRQTCGLfTHTify~eYPgG~~eldksI~GGeLF~TillNPIsIFMTHlsN 523 (873) T KOG3703 444 IKVTSTEEYPHLKPARYRRGFIHNGIMVLPRQTCGLFTHTIFYDEYPGGSDELDKSIQGGELFLTILLNPISIFMTHLSN 523 (873) T ss_pred EEEEECCCCCCCCHHHHHCCCHHCCEEEECCCCCCCEEEEEEECCCCCCHHHHHHHCCCCEEEEEEECCCHHHHHHHHHH T ss_conf 37512344776673655413100564782142235413455400389974888643466605876630608999997653 Q ss_pred ---CCHHHHHHHHHHHHHHHHC Q ss_conf ---1013779999999999744 Q gi|254780868|r 82 ---TLLSFAIAVAMRSVIASTL 100 (252) Q Consensus 82 ---~~l~~~~~~ai~~~l~~~~ 100 (252) -.+.+.+=..+.+.++..- T Consensus 524 YgnDRLgLYTF~~l~rFlqcWT 545 (873) T KOG3703 524 YGNDRLGLYTFENLVRFLQCWT 545 (873) T ss_pred CCCCCHHHHHHHHHHHHHHHHH T ss_conf 2654034565999999999876 Done!