Query gi|254780755|ref|YP_003065168.1| octaprenyl-diphosphate synthase protein [Candidatus Liberibacter asiaticus str. psy62] Match_columns 322 No_of_seqs 173 out of 4641 Neff 7.9 Searched_HMMs 39220 Date Sun May 29 19:20:46 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780755.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK10888 octaprenyl diphosphat 100.0 0 0 596.2 27.6 320 1-321 3-322 (323) 2 TIGR02749 prenyl_cyano solanes 100.0 0 0 594.1 21.1 316 2-322 4-325 (325) 3 TIGR02748 GerC3_HepT heptapren 100.0 0 0 582.7 20.6 320 1-321 2-324 (325) 4 cd00685 Trans_IPPS_HT Trans-Is 100.0 0 0 546.2 24.8 294 25-320 1-298 (298) 5 COG0142 IspA Geranylgeranyl py 100.0 0 0 530.1 26.4 314 3-322 2-322 (322) 6 CHL00151 preA prenyl transfera 100.0 0 0 515.0 24.7 280 3-322 6-288 (288) 7 PRK10581 geranyltranstransfera 100.0 0 0 482.1 24.1 279 4-321 3-298 (299) 8 pfam00348 polyprenyl_synt Poly 100.0 0 0 461.4 18.0 247 30-279 1-254 (260) 9 KOG0776 consensus 100.0 0 0 439.0 23.5 309 4-320 67-384 (384) 10 cd00867 Trans_IPPS Trans-Isopr 100.0 0 0 374.2 18.5 235 45-320 1-236 (236) 11 KOG0777 consensus 100.0 4.3E-43 0 284.5 16.9 280 27-312 20-300 (322) 12 KOG0711 consensus 100.0 2.3E-40 5.6E-45 267.8 18.7 316 2-321 2-346 (347) 13 cd00385 Isoprenoid_Biosyn_C1 I 100.0 1.6E-36 4.1E-41 244.1 11.6 225 65-318 13-242 (243) 14 cd00683 Trans_IPPS_HH Trans-Is 97.6 0.0029 7.3E-08 39.8 12.4 135 134-303 92-226 (265) 15 pfam00494 SQS_PSY Squalene/phy 97.4 0.002 5.1E-08 40.8 9.7 141 134-308 88-228 (262) 16 TIGR03464 HpnC squalene syntha 97.2 0.011 2.8E-07 36.2 11.5 133 135-303 86-218 (266) 17 pfam07307 HEPPP_synt_1 Heptapr 97.0 0.018 4.7E-07 34.9 10.6 109 58-174 29-137 (212) 18 TIGR03465 HpnD squalene syntha 96.8 0.022 5.5E-07 34.4 9.5 134 135-304 85-218 (266) 19 PRK13594 consensus 90.1 1.5 3.7E-05 23.2 11.0 164 58-244 31-198 (278) 20 KOG1166 consensus 86.5 0.13 3.4E-06 29.6 -0.7 87 65-185 801-899 (974) 21 PRK12871 ubiA prenyltransferas 85.8 2.7 6.9E-05 21.5 9.9 173 50-242 29-217 (297) 22 PRK11092 bifunctional (p)ppGpp 75.2 6.3 0.00016 19.3 12.2 31 76-109 67-97 (702) 23 cd00687 Terpene_cyclase_nonpla 74.4 6.6 0.00017 19.1 13.0 164 47-215 41-219 (303) 24 cd00868 Terpene_cyclase_C1 Ter 73.7 6.9 0.00018 19.0 15.2 171 44-216 26-214 (284) 25 cd02432 Nodulin-21_like_1 Nodu 66.2 10 0.00026 18.0 5.6 67 242-318 80-149 (218) 26 PRK12883 ubiA prenyltransferas 63.7 11 0.00029 17.7 10.0 159 62-244 37-198 (275) 27 PHA02092 hypothetical protein 61.2 9.4 0.00024 18.2 3.1 88 232-320 2-94 (108) 28 cd02434 Nodulin-21_like_3 Nodu 60.6 13 0.00032 17.4 6.0 34 243-276 76-109 (225) 29 PRK10872 relA GDP/GTP pyrophos 57.2 15 0.00037 17.1 7.6 31 76-109 77-107 (743) 30 TIGR00691 spoT_relA RelA/SpoT 56.0 11 0.00029 17.7 2.8 51 45-109 22-72 (741) 31 cd02433 Nodulin-21_like_2 Nodu 54.0 16 0.00042 16.7 5.9 34 243-276 93-126 (234) 32 PRK13593 consensus 51.9 18 0.00045 16.5 9.5 159 63-244 43-205 (283) 33 pfam00598 Flu_M1 Influenza Mat 49.1 20 0.0005 16.3 3.8 78 222-299 47-127 (156) 34 TIGR02562 cas3_yersinia CRISPR 41.8 25 0.00065 15.5 3.9 53 27-79 575-634 (1153) 35 PRK02936 argD acetylornithine 41.6 26 0.00065 15.5 4.1 135 160-302 159-311 (377) 36 pfam03936 Terpene_synth_C Terp 39.7 27 0.0007 15.4 13.6 167 44-215 40-228 (270) 37 pfam10776 DUF2600 Protein of u 39.2 28 0.00071 15.3 5.5 30 58-90 48-77 (330) 38 pfam06783 UPF0239 Uncharacteri 38.8 21 0.00052 16.1 1.9 24 186-209 13-36 (85) 39 pfam06603 DUF1141 Protein of u 37.6 30 0.00076 15.1 6.4 68 246-313 31-103 (106) 40 KOG2802 consensus 35.3 32 0.00082 14.9 6.1 107 168-299 370-478 (503) 41 PRK12872 ubiA prenyltransferas 34.5 28 0.00072 15.3 2.1 40 193-243 174-213 (297) 42 pfam06304 DUF1048 Protein of u 33.2 26 0.00065 15.5 1.7 34 192-225 18-56 (92) 43 KOG1345 consensus 28.6 26 0.00066 15.5 1.0 20 65-84 128-147 (378) 44 TIGR01957 nuoB_fam NADH-quinon 27.8 35 0.00089 14.7 1.6 28 66-93 26-53 (146) 45 COG1562 ERG9 Phytoene/squalene 25.7 47 0.0012 13.9 13.5 154 112-305 85-238 (288) 46 PRK04260 acetylornithine amino 25.6 47 0.0012 13.9 5.0 129 160-302 157-309 (375) 47 pfam08684 ocr DNA mimic ocr. T 23.7 40 0.001 14.3 1.3 44 66-113 8-51 (100) 48 PRK12884 ubiA prenyltransferas 22.2 55 0.0014 13.5 9.3 160 61-243 38-201 (281) 49 KOG0018 consensus 21.7 56 0.0014 13.4 2.4 44 63-112 486-530 (1141) 50 TIGR01283 nifE nitrogenase MoF 20.1 34 0.00087 14.8 0.3 49 230-278 344-407 (470) No 1 >PRK10888 octaprenyl diphosphate synthase; Provisional Probab=100.00 E-value=0 Score=596.22 Aligned_cols=320 Identities=42% Similarity=0.683 Sum_probs=309.6 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 94799999999999999999860778758999999999679822799999999997089857866665578999999987 Q gi|254780755|r 1 MKMLEDLTLKDMEKVNFLILKRICSDVEMIPDVVKYLIFSGGKRLRPMLTLATALMLEYRGDNHVLLACAIEFIHTATLL 80 (322) Q Consensus 1 ~~~l~~~~~~~l~~v~~~i~~~~~~~~~~l~~~~~~~~~~gGKr~Rp~l~~l~~~~~~~~~~~~~~~a~~iEliH~asLi 80 (322) +..+.++++++|++|++.|.+.+.+++|.+.++.+|++.+||||+||+||+++++++|++.++.+++|+++|++|+|||| T Consensus 3 l~~i~~~v~~dl~~v~~~l~~~~~~~~~~i~~~~~~~~~~GGKRlRP~l~ll~~~~~g~~~~~~~~~AaaiEliH~asLi 82 (323) T PRK10888 3 LEKINELTAQDMAGVNAAILEQLNSDVQLINQLGYYIISGGGKRIRPMIAVLAARAVGYQGNAHVTIAALIEFIHTATLL 82 (323) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 89999999999999999999986688188999999988269860859999999998299838899999999999999998 Q ss_pred HHHHHCHHHHHHHCCCCHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCHHH Q ss_conf 52110112200110121010011023431245788887642102431100012345677777754565542132210357 Q gi|254780755|r 81 HDDVVDDSALRRGKVAARLVWGNQTSVLVGDFLLSQAFCMVIETKSQEALEALSLVACTLAEGELRQLSLSKNLDVTEED 160 (322) Q Consensus 81 HDDI~D~~~~RRg~~t~~~~~G~~~Ail~GD~l~~~a~~~l~~~~~~~~~~~~~~~~~~l~~Gq~~d~~~~~~~~~s~~~ 160 (322) ||||||+|++|||+||+|.+||++.||++|||||+.+|+++++.++.++++.++.++..+++||++|+.+..+.++++++ T Consensus 83 HDDiiD~s~~RRG~pt~h~~~G~~~AIl~GD~l~~~a~~~l~~~~~~~~~~~~~~~~~~~~eGe~~q~~~~~~~~~~~~~ 162 (323) T PRK10888 83 HDDVVDESDMRRGKATANAAFGNAASVLVGDFIYTRAFQMMTSLGSLKVLEVMSEAVNVIAEGEVLQLMNVNDPDITEEN 162 (323) T ss_pred HCCCCCCCCCCCCCCCCCHHHCCCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHH T ss_conf 74656799888998540133044216762029998888866512766899999999999996889999854278999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCCCCCCCCHHHHHHHHCCCCCCCCH Q ss_conf 89899864335555443210001122221015799999731116898876411236700038401234542136532100 Q gi|254780755|r 161 YLHVIKSKTAVLFSAALEVSSLIAGVQNSVRQALKSYGMNLGIAFQLVDDVLDYRGEVNEMGKNIGEDFRNGKPTLPVIL 240 (322) Q Consensus 161 y~~ii~~KTa~Lf~~~~~~ga~lag~~~~~~~~l~~fG~~lG~afQI~DD~ld~~g~~~~~GK~~g~Dl~egk~Tlpii~ 240 (322) |++|++.|||+||++||.+||+++|++++..+.+++||+++|+||||+||+||++|+++.+|||+|+||++||+|+|+++ T Consensus 163 yl~~i~~KTa~Lf~~~~~~ga~lag~~~~~~~~l~~fG~~lG~AFQI~DDlLD~~gd~~~~GK~~g~Dl~eGk~TlPvi~ 242 (323) T PRK10888 163 YMRVIYSKTARLFEAAAQCSGILAGCTPEQEKGLQDYGRYLGTAFQLIDDLLDYSADGEQLGKNVGDDLNEGKPTLPLLH 242 (323) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHCCCCCCCHHCCCHHHHHHH T ss_conf 99999999999999999999987099999999999999999999999997520247878868996513425864799999 Q ss_pred HHHCCCCCCCHHHHHHCCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCC Q ss_conf 11014311000134320368999899999999999749699999999999999999842089998999999999999603 Q gi|254780755|r 241 AFQRGTKREKNFWKSTINDGKISVENLKKAFIIMRENRALVDTEYRAYSYGQKAKDSLRCLPDSSWKKSLMEVVDFCLQR 320 (322) Q Consensus 241 al~~~~~~~~~~l~~~~~~~~~~~~~~~~i~~ll~~~gai~~~~~~~~~~~~~A~~~L~~lp~~~~k~~L~~L~~~ii~R 320 (322) |+++++++++..+..++.+.. +.+.+++++++++++|+++++++++.+|.++|++.|+.||+++.|+.|.+|++|+++| T Consensus 243 Al~~~~~~~~~~l~~~~~~~~-~~~~~~~v~~~l~~~gai~~a~~~a~~~~~~A~~~L~~lp~s~~r~~L~~La~~~~~R 321 (323) T PRK10888 243 AMHHGTPEQAAMIRTAIEQGN-GRHLLEPVLEAMNACGSLEWTRQRAEEEADKAIAALQVLPDTPWREALIGLAHIAVQR 321 (323) T ss_pred HHHHCCHHHHHHHHHHHHCCC-CHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHC T ss_conf 998499989999999874788-4999999999999869599999999999999999987089978999999999999817 Q ss_pred C Q ss_conf 7 Q gi|254780755|r 321 L 321 (322) Q Consensus 321 ~ 321 (322) - T Consensus 322 ~ 322 (323) T PRK10888 322 D 322 (323) T ss_pred C T ss_conf 8 No 2 >TIGR02749 prenyl_cyano solanesyl diphosphate synthase; InterPro: IPR014120 This entry contains solanesyl diphosphate synthases from cyanobacteria or plastid-containing eukaryotes. A member from Arabidopsis (where both plastoquinone and ubiquinone contain the C(45) prenyl moiety) was characterised by heterologous expression as a solanesyl diphosphate synthase.. Probab=100.00 E-value=0 Score=594.10 Aligned_cols=316 Identities=33% Similarity=0.476 Sum_probs=303.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH---C---CCHHHHHHHHHHHHHHH Q ss_conf 47999999999999999998607787589999999996798227999999999970---8---98578666655789999 Q gi|254780755|r 2 KMLEDLTLKDMEKVNFLILKRICSDVEMIPDVVKYLIFSGGKRLRPMLTLATALML---E---YRGDNHVLLACAIEFIH 75 (322) Q Consensus 2 ~~l~~~~~~~l~~v~~~i~~~~~~~~~~l~~~~~~~~~~gGKr~Rp~l~~l~~~~~---~---~~~~~~~~~a~~iEliH 75 (322) ..|-.+|.+||..++.-+++.+++.||.+.-+++|+|..||||+||.+|+|++++. . .-+.+..++|-++|||| T Consensus 4 ~~l~~pVe~dL~~l~~NLk~lvGa~HPiL~AAAEHLF~AGGKR~RPaiVLLvSrAT~~~~GlkElt~~HRRLAEITEmIH 83 (325) T TIGR02749 4 TSLFAPVEDDLELLTDNLKSLVGARHPILYAAAEHLFSAGGKRLRPAIVLLVSRATAEEQGLKELTARHRRLAEITEMIH 83 (325) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHH T ss_conf 78889999999999987787314568678999999871589654068999999998873666357610353133686775 Q ss_pred HHHHHHHHHHCHHHHHHHCCCCHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 99987521101122001101210100110234312457888876421024311000123456777777545655421322 Q gi|254780755|r 76 TATLLHDDVVDDSALRRGKVAARLVWGNQTSVLVGDFLLSQAFCMVIETKSQEALEALSLVACTLAEGELRQLSLSKNLD 155 (322) Q Consensus 76 ~asLiHDDI~D~~~~RRg~~t~~~~~G~~~Ail~GD~l~~~a~~~l~~~~~~~~~~~~~~~~~~l~~Gq~~d~~~~~~~~ 155 (322) +||||||||+|+|++|||.||||..|||++|||+|||||+.|.=+|++++|.++++.+++++.+.+|||+.|-..+++.+ T Consensus 84 TASLVHDDV~DEsd~RRG~~TVhs~F~trvAVLAGDFLFAQaSWYLANLenLEVVKLls~VI~DFAEGEI~Qg~~~FD~d 163 (325) T TIGR02749 84 TASLVHDDVIDESDVRRGVETVHSLFGTRVAVLAGDFLFAQASWYLANLENLEVVKLLSKVITDFAEGEIKQGLNRFDSD 163 (325) T ss_pred HHHHHCCCEECCCCCCCCCCCCCCCCCCEEEEECCCHHHHHHHHHHHCCCCCCEEHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 23222022003543448885534568481676204067899999986167721001454787564117888655525788 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCCCCCCCCHHHHHHHHCCCC Q ss_conf 10357898998643355554432100011222210157999997311168988764112367000384012345421365 Q gi|254780755|r 156 VTEEDYLHVIKSKTAVLFSAALEVSSLIAGVQNSVRQALKSYGMNLGIAFQLVDDVLDYRGEVNEMGKNIGEDFRNGKPT 235 (322) Q Consensus 156 ~s~~~y~~ii~~KTa~Lf~~~~~~ga~lag~~~~~~~~l~~fG~~lG~afQI~DD~ld~~g~~~~~GK~~g~Dl~egk~T 235 (322) ++.|+|++-..+|||+|++++|+.+|+++..+.+..+.++.||+|||+||||.||+|||+|..+.+|||.|+||..|-.| T Consensus 164 ~~le~YleKSyYKTASL~AaSskaAAvLS~~~~~v~n~LY~yGkhLGLAFQvvDDiLDFTgsTe~LGKPagsDL~~GNlT 243 (325) T TIGR02749 164 LSLEDYLEKSYYKTASLVAASSKAAAVLSDVDSKVANDLYEYGKHLGLAFQVVDDILDFTGSTEQLGKPAGSDLKKGNLT 243 (325) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCHHHHHHHHHCCCCCCCCCCCCCHHHHCCCCCC T ss_conf 78887665644678999999989988851143899999877656616335445545157788511567514343057851 Q ss_pred CCCCHHHHCCCCCCCHHHHHHCCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHH Q ss_conf 32100110143110001343203689998999999999997496999999999999999998420899989999999999 Q gi|254780755|r 236 LPVILAFQRGTKREKNFWKSTINDGKISVENLKKAFIIMRENRALVDTEYRAYSYGQKAKDSLRCLPDSSWKKSLMEVVD 315 (322) Q Consensus 236 lpii~al~~~~~~~~~~l~~~~~~~~~~~~~~~~i~~ll~~~gai~~~~~~~~~~~~~A~~~L~~lp~~~~k~~L~~L~~ 315 (322) .|++||||+.|. |..+++.+-..+.++++.+++++++++|.+++++++++.+.|+.+|+.||+|++|++|.+|++ T Consensus 244 APvLfALE~~p~-----L~~LIer~F~~~~DLe~AL~lv~~~~aI~k~rELA~~~A~~A~~~L~~Lp~S~~r~aL~eLv~ 318 (325) T TIGR02749 244 APVLFALEEEPK-----LSELIEREFSQKGDLEQALELVRKSGAIKKARELAKEQAQLALQSLSVLPPSASREALKELVE 318 (325) T ss_pred HHHHHHHCCCCC-----CCHHHCCCCCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHH T ss_conf 579986237887-----201201113781457999999982372157899999999999986422789712578887331 Q ss_pred HHHCCCC Q ss_conf 9960379 Q gi|254780755|r 316 FCLQRLN 322 (322) Q Consensus 316 ~ii~R~~ 322 (322) |+++|++ T Consensus 319 f~l~RL~ 325 (325) T TIGR02749 319 FVLERLY 325 (325) T ss_pred HEECCCC T ss_conf 1001369 No 3 >TIGR02748 GerC3_HepT heptaprenyl diphosphate synthase component II; InterPro: IPR014119 The proteins in this entry are component II of the heterodimeric heptaprenyl diphosphate synthase. They are found proximate to the gene for component I (IPR009920 from INTERPRO). This enzyme acts in menaquinone-7 isoprenoid side chain biosynthesis.. Probab=100.00 E-value=0 Score=582.71 Aligned_cols=320 Identities=29% Similarity=0.485 Sum_probs=309.1 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHCC-CCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH-CCCHHHHHHHHHHHHHHHHHH Q ss_conf 947999999999999999998607-787589999999996798227999999999970-898578666655789999999 Q gi|254780755|r 1 MKMLEDLTLKDMEKVNFLILKRIC-SDVEMIPDVVKYLIFSGGKRLRPMLTLATALML-EYRGDNHVLLACAIEFIHTAT 78 (322) Q Consensus 1 ~~~l~~~~~~~l~~v~~~i~~~~~-~~~~~l~~~~~~~~~~gGKr~Rp~l~~l~~~~~-~~~~~~~~~~a~~iEliH~as 78 (322) |+.+...++++++.||+.|.+.+. +++|++.+++-|++..||||+||.||+|+++++ +++-+++..+|+++||||+|| T Consensus 2 L~~~Y~~l~~Di~~iE~EL~~~v~Ga~~~~l~eA~l~LL~AGGKRIRPVFVLLag~fGP~Ydl~~~K~vAV~LELIHMAS 81 (325) T TIGR02748 2 LKDIYSFLQKDIESIEKELEKAVQGAEHPVLSEAALHLLKAGGKRIRPVFVLLAGKFGPDYDLDKIKHVAVALELIHMAS 81 (325) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCCEEHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCC T ss_conf 47889998876999999999998520687357899999981894001799986763077533554656765576773532 Q ss_pred HHHHHHHCHHHHHHHCCCCHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCH Q ss_conf 87521101122001101210100110234312457888876421024311000123456777777545655421322103 Q gi|254780755|r 79 LLHDDVVDDSALRRGKVAARLVWGNQTSVLVGDFLLSQAFCMVIETKSQEALEALSLVACTLAEGELRQLSLSKNLDVTE 158 (322) Q Consensus 79 LiHDDI~D~~~~RRg~~t~~~~~G~~~Ail~GD~l~~~a~~~l~~~~~~~~~~~~~~~~~~l~~Gq~~d~~~~~~~~~s~ 158 (322) ||||||+|++++|||+||+..+|||.+|...||||||+|.+.+.+.++|++-+++|+++.++|.||+-|+.-.+|++.++ T Consensus 82 LVHDDVID~A~LRRG~~Ti~skW~NRiAMYTGDYlfA~slE~~t~i~dp~~H~~LS~tivEvc~GEIEQIkDkYN~dQ~l 161 (325) T TIGR02748 82 LVHDDVIDDADLRRGKPTIKSKWDNRIAMYTGDYLFAKSLETMTEIKDPRAHQILSKTIVEVCLGEIEQIKDKYNFDQNL 161 (325) T ss_pred CCCCCCCCCCCCCCCCHHHCCCCCCEEEEHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHCCCCCCCCH T ss_conf 02565205876778842012102773220033589999999985169827889999999988622101001013788860 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCCCCCCCCHHHHHHHHCCCCCCC Q ss_conf 57898998643355554432100011222210157999997311168988764112367000384012345421365321 Q gi|254780755|r 159 EDYLHVIKSKTAVLFSAALEVSSLIAGVQNSVRQALKSYGMNLGIAFQLVDDVLDYRGEVNEMGKNIGEDFRNGKPTLPV 238 (322) Q Consensus 159 ~~y~~ii~~KTa~Lf~~~~~~ga~lag~~~~~~~~l~~fG~~lG~afQI~DD~ld~~g~~~~~GK~~g~Dl~egk~Tlpi 238 (322) ..|++.|+.|||-|+++||++||+.||++++..+.++.||+++||+|||.||+|||.|+++++|||+|+||.+|..|||+ T Consensus 162 R~YLRRIKRKTALLIA~SCqLGAia~Ga~~~~~~~LY~FGYYvGMSyQI~DDiLDFvgTee~LGKPAG~DLlQGNvTLPv 241 (325) T TIGR02748 162 RTYLRRIKRKTALLIAASCQLGAIASGADEAIVKKLYWFGYYVGMSYQIIDDILDFVGTEEELGKPAGGDLLQGNVTLPV 241 (325) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHHH T ss_conf 03665666889999998776520005889878773655335420145444321366466311688757653552016788 Q ss_pred CHHHHCCCCCCCHHHHHHCCCC-CCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHH Q ss_conf 0011014311000134320368-999899999999999749699999999999999999842089998999999999999 Q gi|254780755|r 239 ILAFQRGTKREKNFWKSTINDG-KISVENLKKAFIIMRENRALVDTEYRAYSYGQKAKDSLRCLPDSSWKKSLMEVVDFC 317 (322) Q Consensus 239 i~al~~~~~~~~~~l~~~~~~~-~~~~~~~~~i~~ll~~~gai~~~~~~~~~~~~~A~~~L~~lp~~~~k~~L~~L~~~i 317 (322) +||+++ +..-|+.|.+...+. +.+.++.+.+++-++++++++++...+.-|.++|.+.|+.||.++.|..|.+++.|+ T Consensus 242 LYAl~d-~~~~K~~i~~v~~~~g~v~~~em~~~i~~vk~~~aI~~a~a~Sd~YL~KA~~~L~~LP~~~ak~~L~~iA~YI 320 (325) T TIGR02748 242 LYALED-PIGLKKRIEQVLEETGSVTAEEMEELIEEVKKSDAIERAYAVSDRYLKKALELLDGLPDGEAKKLLQEIAKYI 320 (325) T ss_pred HHHHCC-CHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH T ss_conf 885159-7888999973034456768889999999985056799999999999999999986278813568899999871 Q ss_pred HCCC Q ss_conf 6037 Q gi|254780755|r 318 LQRL 321 (322) Q Consensus 318 i~R~ 321 (322) -+|. T Consensus 321 GkR~ 324 (325) T TIGR02748 321 GKRK 324 (325) T ss_pred CCCC T ss_conf 2678 No 4 >cd00685 Trans_IPPS_HT Trans-Isoprenyl Diphosphate Synthases (Trans_IPPS), head-to-tail (HT) (1'-4) condensation reactions. This CD includes all-trans (E)-isoprenyl diphosphate synthases which synthesis various chain length (C10, C15, C20, C25, C30, C35, C40, C45, and C50) linear isoprenyl diphosphates from precursors, isopentenyl diphosphate (IPP) and dimethylallyl diphosphate (DMAPP). They catalyze the successive 1'-4 condensation of the 5-carbon IPP to allylic substrates geranyl-, farnesyl-, or geranylgeranyl-diphosphate. Isoprenoid chain elongation reactions proceed via electrophilic alkylations in which a new carbon-carbon single bond is generated through interaction between a highly reactive electron-deficient allylic carbocation and an electron-rich carbon-carbon double bond. The catalytic site consists of a large central cavity formed by mostly antiparallel alpha helices with two aspartate-rich regions (DDXX(XX)D) located on opposite walls. These residues mediate binding of pre Probab=100.00 E-value=0 Score=546.20 Aligned_cols=294 Identities=40% Similarity=0.655 Sum_probs=281.2 Q ss_pred CCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCH-HHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHCCCCHHHCCC Q ss_conf 7875899999999967982279999999999708985-786666557899999998752110112200110121010011 Q gi|254780755|r 25 SDVEMIPDVVKYLIFSGGKRLRPMLTLATALMLEYRG-DNHVLLACAIEFIHTATLLHDDVVDDSALRRGKVAARLVWGN 103 (322) Q Consensus 25 ~~~~~l~~~~~~~~~~gGKr~Rp~l~~l~~~~~~~~~-~~~~~~a~~iEliH~asLiHDDI~D~~~~RRg~~t~~~~~G~ 103 (322) +..+.+.++++|++.+||||+||.|++++++++|++. +.++.+|+++|+||+||||||||||+|++|||+||+|.+||+ T Consensus 1 ~~~~~l~~~~~y~~~~gGKrlRp~l~ll~~~~~~~~~~~~~~~~A~aiEliH~asLiHDDiiD~s~~RRG~pt~h~~~g~ 80 (298) T cd00685 1 SEVELLREALRYLLLAGGKRLRPLLVLLAARALGGPELEAALRLAAAIELLHTASLVHDDVMDNSDLRRGKPTVHKVFGN 80 (298) T ss_pred CCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHCCC T ss_conf 96389999999886389743879999999998199956889999999999999999982410586545899878887299 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCC---CCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 02343124578888764210243---110001234567777775456554213221035789899864335555443210 Q gi|254780755|r 104 QTSVLVGDFLLSQAFCMVIETKS---QEALEALSLVACTLAEGELRQLSLSKNLDVTEEDYLHVIKSKTAVLFSAALEVS 180 (322) Q Consensus 104 ~~Ail~GD~l~~~a~~~l~~~~~---~~~~~~~~~~~~~l~~Gq~~d~~~~~~~~~s~~~y~~ii~~KTa~Lf~~~~~~g 180 (322) +.|||+||||++.+|+++++.++ +++++.+++++..+++||++|+.+..+.+.++++|++|+.+|||+||+++|.+| T Consensus 81 ~~AIl~GD~l~~~a~~~l~~~~~~~~~~~~~~~~~~~~~~~~Gq~~d~~~~~~~~~~~~~y~~~~~~KTa~l~~~~~~~g 160 (298) T cd00685 81 ATAILAGDYLLARAFELLARLGNPYYPRALELFSEAILELVEGQLLDLLSEYDTDVTEEEYLRIIRLKTAALFAAAPLLG 160 (298) T ss_pred CCCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 64414889999999999986799220999999999999998888999863789999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCCCCCCCCHHHHHHHHCCCCCCCCHHHHCCCCCCCHHHHHHCCCC Q ss_conf 00112222101579999973111689887641123670003840123454213653210011014311000134320368 Q gi|254780755|r 181 SLIAGVQNSVRQALKSYGMNLGIAFQLVDDVLDYRGEVNEMGKNIGEDFRNGKPTLPVILAFQRGTKREKNFWKSTINDG 260 (322) Q Consensus 181 a~lag~~~~~~~~l~~fG~~lG~afQI~DD~ld~~g~~~~~GK~~g~Dl~egk~Tlpii~al~~~~~~~~~~l~~~~~~~ 260 (322) ++++|++++..+.+.+||+++|+||||+||++|++|+++.+|||+|+||++||+|||+++|++.+++.++..+..++++ T Consensus 161 a~~~~~~~~~~~~l~~~g~~lG~aFQi~DD~ld~~g~~~~~GK~~g~Dl~eGk~Tlp~i~al~~~~~~~~~~~~~~~~~- 239 (298) T cd00685 161 ALLAGADEEEAEALKRFGRNLGLAFQIQDDILDLFGDPETLGKPVGSDLREGKCTLPVLLALERAPPEQRLLLELLLGK- 239 (298) T ss_pred HHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHCCCCCCHHHCCCCHHHHHHHHHHCCHHHHHHHHHHHCC- T ss_conf 9983999999999999988768999999998742289788599843166569840999999987998779999986168- Q ss_pred CCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCC Q ss_conf 999899999999999749699999999999999999842089998999999999999603 Q gi|254780755|r 261 KISVENLKKAFIIMRENRALVDTEYRAYSYGQKAKDSLRCLPDSSWKKSLMEVVDFCLQR 320 (322) Q Consensus 261 ~~~~~~~~~i~~ll~~~gai~~~~~~~~~~~~~A~~~L~~lp~~~~k~~L~~L~~~ii~R 320 (322) .+.+++.+++++++++|+++++++.+.+|.++|++.|+.+|+++.|+.|..|++|+++| T Consensus 240 -~~~~~~~~v~~~i~~~g~~~~a~~~~~~~~~~A~~~L~~l~~~~~k~~L~~l~~~~~~R 298 (298) T cd00685 240 -GDEEDVEEARALVKKSGGIEYARELAREYEEKALEALKALPESPAREALRALADFILER 298 (298) T ss_pred -CCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCC T ss_conf -99999999999999869699999999999999999987399978999999999998629 No 5 >COG0142 IspA Geranylgeranyl pyrophosphate synthase [Coenzyme metabolism] Probab=100.00 E-value=0 Score=530.11 Aligned_cols=314 Identities=39% Similarity=0.649 Sum_probs=297.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHCC-CCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHH----HHHHHHHHHHHHHHH Q ss_conf 7999999999999999998607-78758999999999679822799999999997089857----866665578999999 Q gi|254780755|r 3 MLEDLTLKDMEKVNFLILKRIC-SDVEMIPDVVKYLIFSGGKRLRPMLTLATALMLEYRGD----NHVLLACAIEFIHTA 77 (322) Q Consensus 3 ~l~~~~~~~l~~v~~~i~~~~~-~~~~~l~~~~~~~~~~gGKr~Rp~l~~l~~~~~~~~~~----~~~~~a~~iEliH~a 77 (322) .+...+.+.++.|++.|.+.+. +.++.+.++..|.+..||||+||++++++++++|.+.+ +++.+|+++|+||++ T Consensus 2 ~~~~~~~~~~~~i~~~L~~~l~~~~~~~l~~a~~~~~~aGGKrlRP~l~l~~~~~~~~~~~~~~~~~~~~a~avEliH~~ 81 (322) T COG0142 2 DLLALLLKRLARIEELLSELLSGSDPELLLEAMRYLLLAGGKRLRPLLVLLAAEALGIDLETGGNDALDLAAAIELIHTA 81 (322) T ss_pred CHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHH T ss_conf 17899999999999999987175663889999997540687008899999999876986432307899999999999999 Q ss_pred HHHHHHHHCHHHHHHHCCCCHHHCCCHHHHHHHHHHHHHHHHHHHCCCC--CCCCHHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 9875211011220011012101001102343124578888764210243--11000123456777777545655421322 Q gi|254780755|r 78 TLLHDDVVDDSALRRGKVAARLVWGNQTSVLVGDFLLSQAFCMVIETKS--QEALEALSLVACTLAEGELRQLSLSKNLD 155 (322) Q Consensus 78 sLiHDDI~D~~~~RRg~~t~~~~~G~~~Ail~GD~l~~~a~~~l~~~~~--~~~~~~~~~~~~~l~~Gq~~d~~~~~~~~ 155 (322) |||||||||+|++|||+||+|.+||++.||++||+|++.||.++++.++ +.+.+.++.++..+|+||++|+.+..+. T Consensus 82 SLiHDDvmD~s~~RRG~pt~~~~~g~~~AIlaGD~L~~~Af~~l~~~~~~~~~~~~~~~~~~~~~~~GQ~lDl~~~~~~- 160 (322) T COG0142 82 SLIHDDLMDDDDLRRGKPTVHAKFGEATAILAGDALLAAAFELLSKLGSEALEAIKALAEAINGLCGGQALDLAFENKP- 160 (322) T ss_pred HHHHHHHCCCCCCCCCCCCHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCC- T ss_conf 9998651148864378975666736237999989999999999987585227999999999999977289998667889- Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCCCCCCCCHHHHHHHHCCCC Q ss_conf 10357898998643355554432100011222210157999997311168988764112367000384012345421365 Q gi|254780755|r 156 VTEEDYLHVIKSKTAVLFSAALEVSSLIAGVQNSVRQALKSYGMNLGIAFQLVDDVLDYRGEVNEMGKNIGEDFRNGKPT 235 (322) Q Consensus 156 ~s~~~y~~ii~~KTa~Lf~~~~~~ga~lag~~~~~~~~l~~fG~~lG~afQI~DD~ld~~g~~~~~GK~~g~Dl~egk~T 235 (322) +|+++|++|+++|||+||+++|.+|++++|++++..+.+..||+++|+||||+||+||++|+++.+|||+|+|+++||+| T Consensus 161 ~t~e~y~~~i~~KTa~L~~~a~~~ga~la~~~~~~~~~l~~~g~~lGlaFQi~DDiLD~~~d~~~lGK~~g~Dl~~gK~T 240 (322) T COG0142 161 VTLEEYLRVIELKTAALFAAAAVLGAILAGADEELLEALEDYGRNLGLAFQIQDDILDITGDEEELGKPVGSDLKEGKPT 240 (322) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHCCCCHHHHHHCCCC T ss_conf 99999999999989999999999999981998999999999999877888867747750587276289743568818852 Q ss_pred CCCCHHHHCCCCCCCHHHHHHCCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHH Q ss_conf 32100110143110001343203689998999999999997496999999999999999998420899989999999999 Q gi|254780755|r 236 LPVILAFQRGTKREKNFWKSTINDGKISVENLKKAFIIMRENRALVDTEYRAYSYGQKAKDSLRCLPDSSWKKSLMEVVD 315 (322) Q Consensus 236 lpii~al~~~~~~~~~~l~~~~~~~~~~~~~~~~i~~ll~~~gai~~~~~~~~~~~~~A~~~L~~lp~~~~k~~L~~L~~ 315 (322) ||++++++.+++.++ .+...+.+.. ++++++++++++|+++++..++..|.++|++.|+.+|+++.++.|..+++ T Consensus 241 ~p~l~~l~~~~~~~~-~~~~~~~~~~----~~~~~~~~~~~~g~~~~~~~~a~~~~~~a~~~L~~l~~~~~~~~L~~la~ 315 (322) T COG0142 241 LPVLLALEKANEDQK-LLRILLEGGG----EVEEALELLRKSGAIEYAKNLAKTYVEKAKEALEKLPDSEAKEALLELAD 315 (322) T ss_pred HHHHHHHHHCCHHHH-HHHHHHHHHH----HHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHH T ss_conf 999999984756567-8988751014----69999999997053899999999999999999884798368999999999 Q ss_pred HHHCCCC Q ss_conf 9960379 Q gi|254780755|r 316 FCLQRLN 322 (322) Q Consensus 316 ~ii~R~~ 322 (322) |+++|.+ T Consensus 316 ~~~~R~~ 322 (322) T COG0142 316 FIIKRKY 322 (322) T ss_pred HHHHCCC T ss_conf 9986129 No 6 >CHL00151 preA prenyl transferase; Reviewed Probab=100.00 E-value=0 Score=515.04 Aligned_cols=280 Identities=32% Similarity=0.482 Sum_probs=260.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHH---HHHHHHHHHHHHHHHHH Q ss_conf 799999999999999999860778758999999999679822799999999997089857---86666557899999998 Q gi|254780755|r 3 MLEDLTLKDMEKVNFLILKRICSDVEMIPDVVKYLIFSGGKRLRPMLTLATALMLEYRGD---NHVLLACAIEFIHTATL 79 (322) Q Consensus 3 ~l~~~~~~~l~~v~~~i~~~~~~~~~~l~~~~~~~~~~gGKr~Rp~l~~l~~~~~~~~~~---~~~~~a~~iEliH~asL 79 (322) .+..++.++|++|++.|.+.+.+++|.+.++++|++.+||||+||+||+++++++|++.+ ..+.+|+++|++|+||| T Consensus 6 ~i~~~i~~~l~~ve~~l~~~l~s~~~~l~~a~~y~~~~gGKRlRP~L~ll~~~~~g~~~~~~~~~~~~A~avEliH~aSL 85 (288) T CHL00151 6 KLLAPVEEDLESVEKNLKKLIGTRHPILSAAARHLFSAGGKRIRPAIVLLVAKATGGNQEIAPPQRRLAEITEMIHTASL 85 (288) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 88999999999999999988279966999999999865997286999999999819980010668999999999999999 Q ss_pred HHHHHHCHHHHHHHCCCCHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCHH Q ss_conf 75211011220011012101001102343124578888764210243110001234567777775456554213221035 Q gi|254780755|r 80 LHDDVVDDSALRRGKVAARLVWGNQTSVLVGDFLLSQAFCMVIETKSQEALEALSLVACTLAEGELRQLSLSKNLDVTEE 159 (322) Q Consensus 80 iHDDI~D~~~~RRg~~t~~~~~G~~~Ail~GD~l~~~a~~~l~~~~~~~~~~~~~~~~~~l~~Gq~~d~~~~~~~~~s~~ 159 (322) |||||||+|++|||+||+|++||++.||++||||++.+|+.+++.++..+.+.+++++.++++||..|..+..+.+.+++ T Consensus 86 IHDDi~D~s~~RRG~pt~h~~~G~~~AIL~GD~L~~~a~~~la~~~~~~~~~~~~~~i~~l~~g~~~~~~~~~~~~~~~~ 165 (288) T CHL00151 86 VHDDVVDEDSTRRGVPTVHKRFGTKIAVLAGDFLFAQSSWYLANLNNLEVVKLISKVITDLAEGEIRQGLVQFDTTLSTL 165 (288) T ss_pred HHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHH T ss_conf 84677899866689865324414389999988999999999760463256899999999999888987764258887899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCCCCCCCCHHHHHHHHCCCCCCCC Q ss_conf 78989986433555544321000112222101579999973111689887641123670003840123454213653210 Q gi|254780755|r 160 DYLHVIKSKTAVLFSAALEVSSLIAGVQNSVRQALKSYGMNLGIAFQLVDDVLDYRGEVNEMGKNIGEDFRNGKPTLPVI 239 (322) Q Consensus 160 ~y~~ii~~KTa~Lf~~~~~~ga~lag~~~~~~~~l~~fG~~lG~afQI~DD~ld~~g~~~~~GK~~g~Dl~egk~Tlpii 239 (322) +|++|+++|||+||++||++||+++|++++.++.+.+||+++|+||||+||+||++|+++.+|||+|+|+++||+|||++ T Consensus 166 ~y~~~~~~KTa~Lf~~a~~~gailaga~~~~~~~l~~~G~~lG~AFQI~DDlLD~~gd~~~~GK~~g~Dl~eGK~Tlp~l 245 (288) T CHL00151 166 EYIEKSFYKTASLLAASCKAAALLSDADEELLNDLYLYGKHLGLAFQIIDDILDITGSTESLGKPAGSDLKNGNLTAPVL 245 (288) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCHHHHHHHHHHCCCCCHHHHCCCCCHHHHCCCCCHHHH T ss_conf 99999985207999999999999809999999999998641311455349754035898997899114777598419999 Q ss_pred HHHHCCCCCCCHHHHHHCCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHC Q ss_conf 01101431100013432036899989999999999974969999999999999999984208999899999999999960 Q gi|254780755|r 240 LAFQRGTKREKNFWKSTINDGKISVENLKKAFIIMRENRALVDTEYRAYSYGQKAKDSLRCLPDSSWKKSLMEVVDFCLQ 319 (322) Q Consensus 240 ~al~~~~~~~~~~l~~~~~~~~~~~~~~~~i~~ll~~~gai~~~~~~~~~~~~~A~~~L~~lp~~~~k~~L~~L~~~ii~ 319 (322) ++++++ .+.++.+.++|++.|. |.++.++.|.+|++|+++ T Consensus 246 ~ale~~--------------------------------------~~~~~~l~~~a~~~l~--~~~~~~~~L~~l~~~i~~ 285 (288) T CHL00151 246 FALEQS--------------------------------------SKLAELIEREFCEGIL--PPSEAKDALLALADFIIN 285 (288) T ss_pred HHHHHH--------------------------------------HHHHHHHHHHHHHHHH--CCCHHHHHHHHHHHHHHH T ss_conf 999985--------------------------------------9999999999999765--147306999999999985 Q ss_pred CCC Q ss_conf 379 Q gi|254780755|r 320 RLN 322 (322) Q Consensus 320 R~~ 322 (322) |.| T Consensus 286 R~~ 288 (288) T CHL00151 286 RLS 288 (288) T ss_pred CCC T ss_conf 679 No 7 >PRK10581 geranyltranstransferase; Provisional Probab=100.00 E-value=0 Score=482.08 Aligned_cols=279 Identities=28% Similarity=0.432 Sum_probs=248.5 Q ss_pred HHHHHHHHHHHHHHHHHHHCCC---CCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999986077---8758999999999679822799999999997089857866665578999999987 Q gi|254780755|r 4 LEDLTLKDMEKVNFLILKRICS---DVEMIPDVVKYLIFSGGKRLRPMLTLATALMLEYRGDNHVLLACAIEFIHTATLL 80 (322) Q Consensus 4 l~~~~~~~l~~v~~~i~~~~~~---~~~~l~~~~~~~~~~gGKr~Rp~l~~l~~~~~~~~~~~~~~~a~~iEliH~asLi 80 (322) +.+.+.+-..+|++.+.+.+.+ .++.+.++++|.+..||||+||.||+++++++|++.+.++++|+++||+|+|||| T Consensus 3 f~~~~~~~~~~~~~~l~~~~~~~~~~~~~l~~a~~y~~~~GGKRlRP~L~ll~~~~~g~~~~~~~~~A~AiEliH~aSLI 82 (299) T PRK10581 3 FPQQLQACVKQANQALSRFIAPLPFQNTPVVEAMQYGALLGGKRLRPFLVYATGQMFGVSTNTLDAPAAAVECIHAYSLI 82 (299) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 79999999999999999863535789726999999898479660879999999998399867789999999999998989 Q ss_pred HHHH--HCHHHHHHHCCCCHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCCC-----CHHH---H--HHHHHHHHHHHHHH Q ss_conf 5211--011220011012101001102343124578888764210243110-----0012---3--45677777754565 Q gi|254780755|r 81 HDDV--VDDSALRRGKVAARLVWGNQTSVLVGDFLLSQAFCMVIETKSQEA-----LEAL---S--LVACTLAEGELRQL 148 (322) Q Consensus 81 HDDI--~D~~~~RRg~~t~~~~~G~~~Ail~GD~l~~~a~~~l~~~~~~~~-----~~~~---~--~~~~~l~~Gq~~d~ 148 (322) |||| ||+|++|||+||+|.+||+++||++||+|++.+|.++++.+.+.+ +..+ + .....+++||.+|+ T Consensus 83 HDDIp~iD~s~~RRG~pt~h~~~G~~~AIlaGD~L~~~a~~ll~~~~~~~~~~~~~~~~i~~l~~~~~~~~~~~GQ~lDl 162 (299) T PRK10581 83 HDDLPAMDDDDLRRGLPTCHVKFGEANAILAGDALQTLAFSILSDAPMPEVADRDRISMISELASASGIAGMCGGQALDL 162 (299) T ss_pred HCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 71840125886548987724334852467730289999999997189965326889999999999988988620189999 Q ss_pred HHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHCCCHHHHHHHHHCCCCCCCCCCCHHHH Q ss_conf 542132210357898998643355554432100011222-2101579999973111689887641123670003840123 Q gi|254780755|r 149 SLSKNLDVTEEDYLHVIKSKTAVLFSAALEVSSLIAGVQ-NSVRQALKSYGMNLGIAFQLVDDVLDYRGEVNEMGKNIGE 227 (322) Q Consensus 149 ~~~~~~~~s~~~y~~ii~~KTa~Lf~~~~~~ga~lag~~-~~~~~~l~~fG~~lG~afQI~DD~ld~~g~~~~~GK~~g~ 227 (322) .+.. ..++.+.|.+|+++|||+||++||.+|++++|.. .+..+.+.+||+++|+||||+||+||++|+++.+|||+|+ T Consensus 163 ~~~~-~~~~~~~~~~i~~~KTa~Lf~~a~~~gai~ag~~~~~~~~~l~~~g~~lGiAFQI~DDlLD~~gd~~~~GK~~g~ 241 (299) T PRK10581 163 EAEG-KHVPLDALERIHRHKTGALIRAAVRLGALSAGDKGRRALPVLDRYAESIGLAFQVQDDILDVVGDTATLGKRQGA 241 (299) T ss_pred HHHC-CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHCCCCCH T ss_conf 8523-799999999999999999999999999998499989999999999999889999864577703898997999261 Q ss_pred HHHHCCCCCCCCHHHHCCCCCCCHHHHHHCCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCC-CCHH Q ss_conf 454213653210011014311000134320368999899999999999749699999999999999999842089-9989 Q gi|254780755|r 228 DFRNGKPTLPVILAFQRGTKREKNFWKSTINDGKISVENLKKAFIIMRENRALVDTEYRAYSYGQKAKDSLRCLP-DSSW 306 (322) Q Consensus 228 Dl~egk~Tlpii~al~~~~~~~~~~l~~~~~~~~~~~~~~~~i~~ll~~~gai~~~~~~~~~~~~~A~~~L~~lp-~~~~ 306 (322) |+++||+|||+++++ +++++.+.++.++|++.|+.|+ .++. T Consensus 242 Dl~eGK~Tlp~i~gl--------------------------------------e~A~~~a~~l~~~a~~~L~~l~~~~~~ 283 (299) T PRK10581 242 DQQLGKSTYPALLGL--------------------------------------EQARKKARDLIDDARQSLKQLAAQSLD 283 (299) T ss_pred HHHCCCCCHHHHHHH--------------------------------------HHHHHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 466396539999999--------------------------------------999999999999999999874626625 Q ss_pred HHHHHHHHHHHHCCC Q ss_conf 999999999996037 Q gi|254780755|r 307 KKSLMEVVDFCLQRL 321 (322) Q Consensus 307 k~~L~~L~~~ii~R~ 321 (322) ++.|..|++|+++|- T Consensus 284 ~~~L~~la~yii~R~ 298 (299) T PRK10581 284 TSALEALADYIIQRN 298 (299) T ss_pred HHHHHHHHHHHHHCC T ss_conf 699999999998378 No 8 >pfam00348 polyprenyl_synt Polyprenyl synthetase. Probab=100.00 E-value=0 Score=461.37 Aligned_cols=247 Identities=34% Similarity=0.603 Sum_probs=219.2 Q ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHCCCCHHHCCCHHHHHH Q ss_conf 99999999967982279999999999708985786666557899999998752110112200110121010011023431 Q gi|254780755|r 30 IPDVVKYLIFSGGKRLRPMLTLATALMLEYRGDNHVLLACAIEFIHTATLLHDDVVDDSALRRGKVAARLVWGNQTSVLV 109 (322) Q Consensus 30 l~~~~~~~~~~gGKr~Rp~l~~l~~~~~~~~~~~~~~~a~~iEliH~asLiHDDI~D~~~~RRg~~t~~~~~G~~~Ail~ 109 (322) +.++++|++.+||||+||.|++++++++|++.+.++++|+++|++|+||||||||||+|++|||+||+|.+||++.||++ T Consensus 1 l~~am~y~~~~gGKrlRp~L~l~~~~~~g~~~~~~~~~A~avEllH~aSLIHDDI~D~s~~RRG~pt~h~~~G~~~Ail~ 80 (260) T pfam00348 1 LLAAMLYYLLAGGKRIRPLLVVLAARALGVEPETLLYLACAIEMIHTASLVHDDLMDNSDLRRGKPTCHKKFGEAGAILA 80 (260) T ss_pred CHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHCCHHHHHHH T ss_conf 97889988718934089999999999829988899999999999999999953324699877999860223111789997 Q ss_pred HHHHHHHHHHHHHCCCCC----CCCHH---HHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 245788887642102431----10001---23456777777545655421322103578989986433555544321000 Q gi|254780755|r 110 GDFLLSQAFCMVIETKSQ----EALEA---LSLVACTLAEGELRQLSLSKNLDVTEEDYLHVIKSKTAVLFSAALEVSSL 182 (322) Q Consensus 110 GD~l~~~a~~~l~~~~~~----~~~~~---~~~~~~~l~~Gq~~d~~~~~~~~~s~~~y~~ii~~KTa~Lf~~~~~~ga~ 182 (322) ||+|++.++.++++.++. .+..+ +.....+...||.+|+... +.+.++++|++|++.|||+||++||.+|++ T Consensus 81 GD~L~~~a~~~l~~~~~~~~~~~~~~i~~~~~~~~~~g~~gQ~~d~~~~-~~~~s~~~y~~~~~~KTa~Lf~~~~~~gai 159 (260) T pfam00348 81 GDALLSRAFQLLALLGHVRPEPKYILISELANAVGAQGEVGQLMDLETE-GKDITLEEYLRIVSYKTAALFYASVQLGAI 159 (260) T ss_pred HHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC-CCCCCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 5899999999998657998399999999999999999999999987567-889999999999999759999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCCCCCCCCHHHHHHHHCCCCCCCCHHHHCCCCCCCHHHHHHCCCCCC Q ss_conf 11222210157999997311168988764112367000384012345421365321001101431100013432036899 Q gi|254780755|r 183 IAGVQNSVRQALKSYGMNLGIAFQLVDDVLDYRGEVNEMGKNIGEDFRNGKPTLPVILAFQRGTKREKNFWKSTINDGKI 262 (322) Q Consensus 183 lag~~~~~~~~l~~fG~~lG~afQI~DD~ld~~g~~~~~GK~~g~Dl~egk~Tlpii~al~~~~~~~~~~l~~~~~~~~~ 262 (322) ++|.+++.++.+.+||+++|+||||+||++|++|+++.+|||+|+||++||+|||++++++++++++|+.+.++++++ T Consensus 160 lag~~~~~~~~l~~~g~~lGiaFQi~DDiLD~~gd~~~~GK~~g~Dl~egK~T~p~i~ale~a~~~~r~~l~~~~~~~-- 237 (260) T pfam00348 160 VAGADEEDEKDLYDFGRDLGLAFQIQDDILDLTGDTEELGKPVGTDLQEGKSTLPVLLALEGAREEQRKLLREAYEQK-- 237 (260) T ss_pred HCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHCCCCCCCHHHCCCHHHHHHHHHHCCCHHHHHHHHHHHCCC-- T ss_conf 819999999999999987368999899764144898770999554675690369999999869899999999998387-- Q ss_pred CHHHHHHHHHHHHHCCH Q ss_conf 98999999999997496 Q gi|254780755|r 263 SVENLKKAFIIMRENRA 279 (322) Q Consensus 263 ~~~~~~~i~~ll~~~ga 279 (322) .++..+++.++++...+ T Consensus 238 ~~~~~~~~~~~~~~~~~ 254 (260) T pfam00348 238 LSEAAAQGVALTKALEA 254 (260) T ss_pred CHHHHHHHHHHHHHHHH T ss_conf 78999999999999888 No 9 >KOG0776 consensus Probab=100.00 E-value=0 Score=438.96 Aligned_cols=309 Identities=30% Similarity=0.428 Sum_probs=278.5 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCC-C-HHHHHHHHHHHHCCCCHHHHHHHHHHHHHHC-CCHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999860778-7-5899999999967982279999999999708-9857866665578999999987 Q gi|254780755|r 4 LEDLTLKDMEKVNFLILKRICSD-V-EMIPDVVKYLIFSGGKRLRPMLTLATALMLE-YRGDNHVLLACAIEFIHTATLL 80 (322) Q Consensus 4 l~~~~~~~l~~v~~~i~~~~~~~-~-~~l~~~~~~~~~~gGKr~Rp~l~~l~~~~~~-~~~~~~~~~a~~iEliH~asLi 80 (322) ...++..++..++..+...+... + +.++...+|.+..+|||+||.+|++++++++ |+......+|+++||||+|||| T Consensus 67 ~~~~~~~~~~~ln~~l~~~~~~~~~~~~i~~a~ry~~la~gKr~rP~l~~~~~e~~~~g~~~~q~~~A~i~EMIHtaSLI 146 (384) T KOG0776 67 ELSYMARKARSLNGALHYAVPLANEPLLISEAMRYLLLAGGKRVRPLLCLAACELVGSGDESSQRSLAEIVEMIHTASLI 146 (384) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 89999999999864033312345665025788888887315505745654577760566507788899999999999998 Q ss_pred HHHH--HCHHHHHHHCCCCHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHC---CCC Q ss_conf 5211--01122001101210100110234312457888876421024311000123456777777545655421---322 Q gi|254780755|r 81 HDDV--VDDSALRRGKVAARLVWGNQTSVLVGDFLLSQAFCMVIETKSQEALEALSLVACTLAEGELRQLSLSK---NLD 155 (322) Q Consensus 81 HDDI--~D~~~~RRg~~t~~~~~G~~~Ail~GD~l~~~a~~~l~~~~~~~~~~~~~~~~~~l~~Gq~~d~~~~~---~~~ 155 (322) |||| ||++++|||+||.|+.||++.|||+||||++.|+..++.+.|+.+++.+++++.++++|++.+....+ +.+ T Consensus 147 HDDv~~mD~~d~RRGkpt~h~vfG~k~AvLaGD~LLa~A~~~la~l~n~~v~elm~~aI~dLv~ge~~~~~~~~~~~d~~ 226 (384) T KOG0776 147 HDDVPCMDDADLRRGKPTNHKVFGNKMAVLAGDALLALASEHLASLENPVVVELMASAIADLVRGEFTQGLVAGEGLDLD 226 (384) T ss_pred HCCCCCCCCCCCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCC T ss_conf 43754446431126887631120302554433789999999998616861899999999999976620244555444567 Q ss_pred -CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCCCCCCCCHHHHHHHHCCC Q ss_conf -1035789899864335555443210001122221015799999731116898876411236700038401234542136 Q gi|254780755|r 156 -VTEEDYLHVIKSKTAVLFSAALEVSSLIAGVQNSVRQALKSYGMNLGIAFQLVDDVLDYRGEVNEMGKNIGEDFRNGKP 234 (322) Q Consensus 156 -~s~~~y~~ii~~KTa~Lf~~~~~~ga~lag~~~~~~~~l~~fG~~lG~afQI~DD~ld~~g~~~~~GK~~g~Dl~egk~ 234 (322) +.++.|..+..+|||+|++.+|++|++++|.+++.++.+++||+++|++||+.||++|+....+.+|||.|.|+..|+. T Consensus 227 ~~~~e~~e~~~~~KTAsLla~Sc~~~aILgg~s~ev~e~~~~yGR~lGL~fQvvDDildftkss~elGK~ag~Dl~~g~l 306 (384) T KOG0776 227 DVGLEYLEFKTLLKTASLLAKSCVAAAILGGGSEEVIEAAFEYGRCLGLAFQVVDDILDFTKSSEELGKTAGKDLKAGKL 306 (384) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHCCCCHHHHHHHCCC T ss_conf 75368899999998998999999999987289999999999998888989998621447543324307623345651334 Q ss_pred CCCCCHHHHCCCCCCCHHHHHHCCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHH Q ss_conf 53210011014311000134320368999899999999999749699999999999999999842089998999999999 Q gi|254780755|r 235 TLPVILAFQRGTKREKNFWKSTINDGKISVENLKKAFIIMRENRALVDTEYRAYSYGQKAKDSLRCLPDSSWKKSLMEVV 314 (322) Q Consensus 235 Tlpii~al~~~~~~~~~~l~~~~~~~~~~~~~~~~i~~ll~~~gai~~~~~~~~~~~~~A~~~L~~lp~~~~k~~L~~L~ 314 (322) |+|+++++++.+. +.+.+.++...+.+..+..+.+. ++..+..+++.|.++|++.|+.+|.++.|++|++|+ T Consensus 307 T~P~Lf~~e~~pe-----~~e~l~~~~~e~~~~~~~~k~v~---~v~~a~~la~~~~~~Al~~l~~~p~s~ar~aL~~l~ 378 (384) T KOG0776 307 TAPVLFALEKSPE-----LREKLEREFSEPLDGFDADKAVP---GVALAKYLARRHNNKALEALQSLPRSEARSALENLV 378 (384) T ss_pred CCCCHHHHHHCHH-----HHHHHHHHCCCCCHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHH T ss_conf 4640334342758-----99999874125200467899989---999999999998899999984799956899999999 Q ss_pred HHHHCC Q ss_conf 999603 Q gi|254780755|r 315 DFCLQR 320 (322) Q Consensus 315 ~~ii~R 320 (322) .+++.| T Consensus 379 ~~~~~r 384 (384) T KOG0776 379 LAVLTR 384 (384) T ss_pred HHHHCC T ss_conf 987339 No 10 >cd00867 Trans_IPPS Trans-Isoprenyl Diphosphate Synthases (Trans_IPPS) of class 1 isoprenoid biosynthesis enzymes which either synthesis geranyl/farnesyl diphosphates (GPP/FPP) or longer chained products from isoprene precursors, isopentenyl diphosphate (IPP) and dimethylallyl diphosphate (DMAPP), or use geranyl (C10)-, farnesyl (C15)-, or geranylgeranyl (C20)-diphosphate as substrate. These enzymes produce a myriad of precursors for such end products as steroids, cholesterol, sesquiterpenes, heme, carotenoids, retinoids, diterpenes, ubiquinone, and archaeal ether linked lipids; and are widely distributed among archaea, bacteria, and eukareya. The enzymes in this family share the same 'isoprenoid synthase fold' and include the head-to-tail (HT) IPPS which catalyze the successive 1'-4 condensation of the 5-carbon IPP to the growing isoprene chain to form linear, all-trans, C10-, C15-, C20- C25-, C30-, C35-, C40-, C45-, or C50-isoprenoid diphosphates. The head-to-head (HH) IPPS catalyze t Probab=100.00 E-value=0 Score=374.21 Aligned_cols=235 Identities=37% Similarity=0.553 Sum_probs=217.0 Q ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHCCCCHHH-CCCHHHHHHHHHHHHHHHHHHHC Q ss_conf 79999999999708985786666557899999998752110112200110121010-01102343124578888764210 Q gi|254780755|r 45 LRPMLTLATALMLEYRGDNHVLLACAIEFIHTATLLHDDVVDDSALRRGKVAARLV-WGNQTSVLVGDFLLSQAFCMVIE 123 (322) Q Consensus 45 ~Rp~l~~l~~~~~~~~~~~~~~~a~~iEliH~asLiHDDI~D~~~~RRg~~t~~~~-~G~~~Ail~GD~l~~~a~~~l~~ 123 (322) +||++++++++++|++.+.++.+|+++|++|++|||||||||+++.|||+||+|.+ ||++.||++||++++.+|..+++ T Consensus 1 ~Rp~l~~~~~~~~g~~~~~~~~~a~avElih~~slihDDi~D~~~~RRg~~t~~~~~~g~~~ail~gd~l~~~a~~~l~~ 80 (236) T cd00867 1 SRPLLVLLLARALGGDLEAALRLAAAVELLHAASLVHDDIVDDSDLRRGKPTAHLRRFGNALAILAGDYLLARAFQLLAR 80 (236) T ss_pred CCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHCHHHHHHHHHHHHHHHHHHHHH T ss_conf 96599999999839888899999999999999999977533699777898656888605578999720999999999862 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 24311000123456777777545655421322103578989986433555544321000112222101579999973111 Q gi|254780755|r 124 TKSQEALEALSLVACTLAEGELRQLSLSKNLDVTEEDYLHVIKSKTAVLFSAALEVSSLIAGVQNSVRQALKSYGMNLGI 203 (322) Q Consensus 124 ~~~~~~~~~~~~~~~~l~~Gq~~d~~~~~~~~~s~~~y~~ii~~KTa~Lf~~~~~~ga~lag~~~~~~~~l~~fG~~lG~ 203 (322) .++....+.+++++..+++||.+|+.+..+...|+++|++++++|||+||+++|.+|+++++.+++..+.+.+||+++|+ T Consensus 81 ~~~~~~~~~~~~~~~~l~~Gq~~Dl~~~~~~~~t~~~~~~~~~~KTa~l~~~~~~~~a~~~~~~~~~~~~l~~~g~~lG~ 160 (236) T cd00867 81 LGYPRALELFAEALRELLEGQALDLEFERDTYETLDEYLEYCRYKTAGLVGLLCLLGAGLSGADDEQAEALKDYGRALGL 160 (236) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHH T ss_conf 38789999999999999998899976047888999999999998869999999999999819899999999999999999 Q ss_pred HHHHHHHHHCCCCCCCCCCCHHHHHHHHCCCCCCCCHHHHCCCCCCCHHHHHHCCCCCCCHHHHHHHHHHHHHCCHHHHH Q ss_conf 68988764112367000384012345421365321001101431100013432036899989999999999974969999 Q gi|254780755|r 204 AFQLVDDVLDYRGEVNEMGKNIGEDFRNGKPTLPVILAFQRGTKREKNFWKSTINDGKISVENLKKAFIIMRENRALVDT 283 (322) Q Consensus 204 afQI~DD~ld~~g~~~~~GK~~g~Dl~egk~Tlpii~al~~~~~~~~~~l~~~~~~~~~~~~~~~~i~~ll~~~gai~~~ 283 (322) ||||+||++|++|+++.+|| +|+|+++||+|||++++ .+.+ T Consensus 161 AfQi~DDllD~~~d~~~~gk-~~~D~~~gk~tlp~~~~--------------------------------------~~~~ 201 (236) T cd00867 161 AFQLTDDLLDVFGDAEELGK-VGSDLREGRITLPVILA--------------------------------------RERA 201 (236) T ss_pred HHHHHHHHHHCCCCHHHHCC-CHHHHHCCCCHHHHHHH--------------------------------------HHHH T ss_conf 99999999884089677695-62568769720999999--------------------------------------9999 Q ss_pred HHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCC Q ss_conf 9999999999999842089998999999999999603 Q gi|254780755|r 284 EYRAYSYGQKAKDSLRCLPDSSWKKSLMEVVDFCLQR 320 (322) Q Consensus 284 ~~~~~~~~~~A~~~L~~lp~~~~k~~L~~L~~~ii~R 320 (322) ...++...+.+...+..+| ..++.|..+++++++| T Consensus 202 ~~~a~~~~~~~~~~~~~~~--~~~~~l~~l~~~i~~r 236 (236) T cd00867 202 AEYAEEAYAALEALPPSLP--RARRALIALADFLYRR 236 (236) T ss_pred HHHHHHHHHHHHHHHHHCC--HHHHHHHHHHHHHHCC T ss_conf 9999999999998666267--8689999999998629 No 11 >KOG0777 consensus Probab=100.00 E-value=4.3e-43 Score=284.51 Aligned_cols=280 Identities=22% Similarity=0.320 Sum_probs=246.7 Q ss_pred CHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHCCCCHHHCCCHHH Q ss_conf 75899999999967982279999999999708985786666557899999998752110112200110121010011023 Q gi|254780755|r 27 VEMIPDVVKYLIFSGGKRLRPMLTLATALMLEYRGDNHVLLACAIEFIHTATLLHDDVVDDSALRRGKVAARLVWGNQTS 106 (322) Q Consensus 27 ~~~l~~~~~~~~~~gGKr~Rp~l~~l~~~~~~~~~~~~~~~a~~iEliH~asLiHDDI~D~~~~RRg~~t~~~~~G~~~A 106 (322) ...+.++..|++.-+||.+|.-|......+..-+.++...+.-++|++|++||..|||.|+|++|||.|++|.+||.... T Consensus 20 ~~ill~Py~yilq~PGKqfR~~L~~afNhwl~~P~dkLaii~~ivemLHNsSLLIDDIEDNs~LRRG~pvaHsIyGvpSt 99 (322) T KOG0777 20 ESILLKPYNYILQKPGKQFRLNLIVAFNHWLNLPKDKLAIISQIVEMLHNSSLLIDDIEDNSPLRRGQPVAHSIYGVPST 99 (322) T ss_pred HHHHHCHHHHHHHCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCEEECCCCCCCHHHCCCCCHHHHCCCCCH T ss_conf 89985528899847408899999999999982979899999999999822430211123364221687221120167421 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCC-CCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 4312457888876421024311000123456777777545655421322-103578989986433555544321000112 Q gi|254780755|r 107 VLVGDFLLSQAFCMVIETKSQEALEALSLVACTLAEGELRQLSLSKNLD-VTEEDYLHVIKSKTAVLFSAALEVSSLIAG 185 (322) Q Consensus 107 il~GD~l~~~a~~~l~~~~~~~~~~~~~~~~~~l~~Gq~~d~~~~~~~~-~s~~~y~~ii~~KTa~Lf~~~~~~ga~lag 185 (322) ||..+|+++.|.+.+..++.|..+++|.+-..++..||.+|+.|+.... .|++.|..|+-.|||.||.++..+--.++. T Consensus 100 INtANY~yFlalekV~qLdhP~a~kifteqLleLHrGQGldIYWRD~~tcPtee~Yk~Mv~~KTGGLF~La~rLMqlfS~ 179 (322) T KOG0777 100 INTANYMYFLALEKVSQLDHPNAIKIFTEQLLELHRGQGLDIYWRDFLTCPTEEMYKNMVMNKTGGLFRLALRLMQLFSH 179 (322) T ss_pred HHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCCEEEECCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHH T ss_conf 10357999999999972378118899999999985278850562113758878999999887415679999999999876 Q ss_pred HHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCCCCCCCCHHHHHHHHCCCCCCCCHHHHCCCCCCCHHHHHHCCCCCCCHH Q ss_conf 22210157999997311168988764112367000384012345421365321001101431100013432036899989 Q gi|254780755|r 186 VQNSVRQALKSYGMNLGIAFQLVDDVLDYRGEVNEMGKNIGEDFRNGKPTLPVILAFQRGTKREKNFWKSTINDGKISVE 265 (322) Q Consensus 186 ~~~~~~~~l~~fG~~lG~afQI~DD~ld~~g~~~~~GK~~g~Dl~egk~Tlpii~al~~~~~~~~~~l~~~~~~~~~~~~ 265 (322) .-+.+..+-.-+|+.|||+||++++...+....|.+..|+.|||.++|+++|+.. ..+.+.+..++..+..+-+ T Consensus 180 ----~kedl~pl~n~LGl~fQIRDDY~NL~~keysenKsFaEDlTEGKfsFP~iHA~~t--~~q~~Qvl~ILrqRT~did 253 (322) T KOG0777 180 ----HKEDLVPLINLLGLIFQIRDDYLNLKDKEYSENKSFAEDLTEGKFSFPIIHALKT--KGQTEQVLRILRQRTSDID 253 (322) T ss_pred ----CCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHCCCCHHHHHCCCCCCCCCHHHHHC--CCCHHHHHHHHHHHHCCCH T ss_conf ----3424788999876863004444031055411122066662047667711044650--7740999999998612403 Q ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHH Q ss_conf 99999999997496999999999999999998420899989999999 Q gi|254780755|r 266 NLKKAFIIMRENRALVDTEYRAYSYGQKAKDSLRCLPDSSWKKSLME 312 (322) Q Consensus 266 ~~~~i~~ll~~~gai~~~~~~~~~~~~~A~~~L~~lp~~~~k~~L~~ 312 (322) -.+..+.++++.|+++|++...++...+|+..++....+++-..|.. T Consensus 254 iKkyci~~LEd~gSf~YTrn~l~~L~a~a~~~i~~~g~Npyl~~L~~ 300 (322) T KOG0777 254 IKKYCIQILEDTGSFAYTRNFLNQLVAEARSMIKNDGENPYLPDLAS 300 (322) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHH T ss_conf 99999999997341798999999999999999982699955599998 No 12 >KOG0711 consensus Probab=100.00 E-value=2.3e-40 Score=267.81 Aligned_cols=316 Identities=20% Similarity=0.208 Sum_probs=245.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCC----C------HHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCH-------HHH Q ss_conf 4799999999999999999860778----7------5899999999967982279999999999708985-------786 Q gi|254780755|r 2 KMLEDLTLKDMEKVNFLILKRICSD----V------EMIPDVVKYLIFSGGKRLRPMLTLATALMLEYRG-------DNH 64 (322) Q Consensus 2 ~~l~~~~~~~l~~v~~~i~~~~~~~----~------~~l~~~~~~~~~~gGKr~Rp~l~~l~~~~~~~~~-------~~~ 64 (322) ...+...+..+.++-+.+.+.+... . +.+.....|- ..|||-.|.+.++-+.++..++. ..+ T Consensus 2 ~~~~~~~k~~f~~~f~~~vr~i~~~~~~~~~~~da~~~~~~~L~yN-~~GGK~nRgl~vv~s~~~L~~~~~l~~~~~~~a 80 (347) T KOG0711 2 SDVYAQEKQDFLQVFPVLVRVLTEDLMAHGESGDATEWLKEVLDYN-VIGGKLNRGLSVVDSFKALVEPRKLDEEELQLA 80 (347) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCC-CCCCCCCCCHHHHHHHHHHCCCCCCCHHHHHHH T ss_conf 3277889999999879999999987520689767999999987602-767522454048999998657767788999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCHHHHHHHCCCCHHHCCCH-HHHHHHHHHHHHHHHHHHC-CC----CCCCCHHHHHHHH Q ss_conf 6665578999999987521101122001101210100110-2343124578888764210-24----3110001234567 Q gi|254780755|r 65 VLLACAIEFIHTATLLHDDVVDDSALRRGKVAARLVWGNQ-TSVLVGDFLLSQAFCMVIE-TK----SQEALEALSLVAC 138 (322) Q Consensus 65 ~~~a~~iEliH~asLiHDDI~D~~~~RRg~~t~~~~~G~~-~Ail~GD~l~~~a~~~l~~-~~----~~~~~~~~~~~~~ 138 (322) ..+++++|+++++.||-|||||+|.+|||+||++++-|.+ .|||-+-.|.+....++.+ .+ -.++++.+.+... T Consensus 81 ~~lGw~vElLQaffLiaDDIMDnS~tRRGqpCWy~~~gVG~~AINDA~lLea~Iy~lLkk~fr~~~~y~~l~elf~ev~f 160 (347) T KOG0711 81 LILGWCVELLQAFFLVADDIMDNSKTRRGQPCWYQKPGVGLDAINDAFLLEAAIYKLLKKHFRNIYCYVDLVELFHEVTF 160 (347) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHH T ss_conf 99989999999889875453201002378843466688636665089999999999999862688607889998878888 Q ss_pred HHHHHHHHHHHHHC--CCCCCHHHHHHHHHHHHHHH-HHHHHHHHHHHHH-HHHHHHHHHHHHHHHCCCHHHHHHHHHCC Q ss_conf 77777545655421--32210357898998643355-5544321000112-22210157999997311168988764112 Q gi|254780755|r 139 TLAEGELRQLSLSK--NLDVTEEDYLHVIKSKTAVL-FSAALEVSSLIAG-VQNSVRQALKSYGMNLGIAFQLVDDVLDY 214 (322) Q Consensus 139 ~l~~Gq~~d~~~~~--~~~~s~~~y~~ii~~KTa~L-f~~~~~~ga~lag-~~~~~~~~l~~fG~~lG~afQI~DD~ld~ 214 (322) .++.||.++..... -...|++.|-.|+++|||.+ |-+|..+|.++|| ...+.......+...+|..||+|||+||+ T Consensus 161 ~T~lGdllt~~~~~~~ls~fsl~~y~~Iv~~KTa~YsFYLPialAl~~ag~~~~k~~~~~k~v~~~lg~~FQvQDDYLd~ 240 (347) T KOG0711 161 QTELGDLLTTPEGNKDLSKFSLEKYVFIVEYKTAYYSFYLPVALALLLAGIANLKEHACEKKVLLLLGEYFQVQDDYLDC 240 (347) T ss_pred HHHHHCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEECHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHH T ss_conf 87630101475433037664177777886313120233018999999842305887756889999998898310677774 Q ss_pred CCCCCCCCCHHHHHHHHCCCCCCCCHHHHCCCCCCCHHHHHHCCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHH Q ss_conf 36700038401234542136532100110143110001343203689998999999999997496999999999999999 Q gi|254780755|r 215 RGEVNEMGKNIGEDFRNGKPTLPVILAFQRGTKREKNFWKSTINDGKISVENLKKAFIIMRENRALVDTEYRAYSYGQKA 294 (322) Q Consensus 215 ~g~~~~~GK~~g~Dl~egk~Tlpii~al~~~~~~~~~~l~~~~~~~~~~~~~~~~i~~ll~~~gai~~~~~~~~~~~~~A 294 (322) +||++.+|| +|+||.++|+||.++.|++.+++++.+.+...+++ ..++.+..+..+.++.+.-..-...-.....+- T Consensus 241 fgDp~vtgk-iGtDIqDnKCsWlv~~al~~~~~eq~~~l~~~yg~--~~~~~v~~vk~ly~el~l~~~f~~yE~~~~~~I 317 (347) T KOG0711 241 FGDPEVTGK-IGTDIQDNKCSWLVVKALQRASAEQYKILFENYGK--PEAEAVAKVKALYKELHLPALFIEYEEGSYKKI 317 (347) T ss_pred CCCHHHCCC-CCCCCCCCCEEEEHHHHHHHCCHHHHHHHHHHCCC--CCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHH T ss_conf 288212177-77745567302531887764279999999986388--557999999888887466999999602478999 Q ss_pred HHHHHCCCC--CHHHHHHHHHHHHHHCCC Q ss_conf 998420899--989999999999996037 Q gi|254780755|r 295 KDSLRCLPD--SSWKKSLMEVVDFCLQRL 321 (322) Q Consensus 295 ~~~L~~lp~--~~~k~~L~~L~~~ii~R~ 321 (322) ...|+.++. ++.+..+..++.-+.+|. T Consensus 318 k~~I~~~~~~~~~~~~v~t~fl~kiykr~ 346 (347) T KOG0711 318 KKLISQVDEDTGVKVKVGTSFLNKIYKRS 346 (347) T ss_pred HHHHHHCCCCCCCHHHHHHHHHHHHHHHC T ss_conf 99998635777631436999999998502 No 13 >cd00385 Isoprenoid_Biosyn_C1 Isoprenoid Biosynthesis enzymes, Class 1; Superfamily of trans-isoprenyl diphosphate synthases (IPPS) and class I terpene cyclases which either synthesis geranyl/farnesyl diphosphates (GPP/FPP) or longer chained products from isoprene precursors, isopentenyl diphosphate (IPP) and dimethylallyl diphosphate (DMAPP), or use geranyl (C10)-, farnesyl (C15)-, or geranylgeranyl (C20)-diphosphate as substrate. These enzymes produce a myriad of precursors for such end products as steroids, cholesterol, sesquiterpenes, heme, carotenoids, retinoids, and diterpenes; and are widely distributed among archaea, bacteria, and eukaryota.The enzymes in this superfamily share the same 'isoprenoid synthase fold' and include several subgroups. The head-to-tail (HT) IPPS catalyze the successive 1'-4 condensation of the 5-carbon IPP to the growing isoprene chain to form linear, all-trans, C10-, C15-, C20- C25-, C30-, C35-, C40-, C45-, or C50-isoprenoid diphosphates. Cyclic monoter Probab=100.00 E-value=1.6e-36 Score=244.15 Aligned_cols=225 Identities=34% Similarity=0.454 Sum_probs=198.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCHHHHHHHCCCCHHH---CCCHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHH Q ss_conf 666557899999998752110112200110121010---01102343124578888764210243110001234567777 Q gi|254780755|r 65 VLLACAIEFIHTATLLHDDVVDDSALRRGKVAARLV---WGNQTSVLVGDFLLSQAFCMVIETKSQEALEALSLVACTLA 141 (322) Q Consensus 65 ~~~a~~iEliH~asLiHDDI~D~~~~RRg~~t~~~~---~G~~~Ail~GD~l~~~a~~~l~~~~~~~~~~~~~~~~~~l~ 141 (322) ...++++|++|++++|||||+|++..|||+|+.|.. ||...|++.||++++.++..+....++.....+.+++.+++ T Consensus 13 ~~~~~~~~~~~~~~~i~DDi~D~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 92 (243) T cd00385 13 SRLRAAVEKLHAASLVHDDIVDDSGTRRGLPTAHLAVAIDGLPEAILAGDLLLADAFEELAREGSPEALEILAEALLDLL 92 (243) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHH T ss_conf 99999999999999999715669999886385999987048399999999999999999986478999999999999999 Q ss_pred HHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCCCCCC Q ss_conf 77545655421322103578989986433555544321000112222101579999973111689887641123670003 Q gi|254780755|r 142 EGELRQLSLSKNLDVTEEDYLHVIKSKTAVLFSAALEVSSLIAGVQNSVRQALKSYGMNLGIAFQLVDDVLDYRGEVNEM 221 (322) Q Consensus 142 ~Gq~~d~~~~~~~~~s~~~y~~ii~~KTa~Lf~~~~~~ga~lag~~~~~~~~l~~fG~~lG~afQI~DD~ld~~g~~~~~ 221 (322) .||..|+.+..+...|.++|+++++.|||.++..++..|+..++.+.+..+.+.++|.++|++||+.||++|++++.+.+ T Consensus 93 ~g~~~d~~~~~~~~~s~~~y~~~~~~kta~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~a~ql~nDl~d~~~d~~~~ 172 (243) T cd00385 93 EGQLLDLKWRREYVPTLEEYLEYCRYKTAGLVGALCLLGAGLSGGEAELLEALRKLGRALGLAFQLTNDLLDYEGDAERG 172 (243) T ss_pred HHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCC T ss_conf 99999998657999999999997422899999999999999809999999999999999999999999562468480248 Q ss_pred CCHHHHHHHHCCCCCCCCHHHHCCCCCCCHHHHHHCCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 84012345421365321001101431100013432036899989999999999974969999999999999999984208 Q gi|254780755|r 222 GKNIGEDFRNGKPTLPVILAFQRGTKREKNFWKSTINDGKISVENLKKAFIIMRENRALVDTEYRAYSYGQKAKDSLRCL 301 (322) Q Consensus 222 GK~~g~Dl~egk~Tlpii~al~~~~~~~~~~l~~~~~~~~~~~~~~~~i~~ll~~~gai~~~~~~~~~~~~~A~~~L~~l 301 (322) +||.|+|.+++.+.....++.. .+.+++..+.+...+..+.+++++.++.+ T Consensus 173 ---------~g~~t~p~~~~~~~~~~~~~~~--------------------~~~~~~~~~~a~~~~~~~~~~~~~~l~~~ 223 (243) T cd00385 173 ---------EGKCTLPVLYALEYGVPAEDLL--------------------LVEKSGSLEEALEELAKLAEEALKELNEL 223 (243) T ss_pred ---------CCCCCHHHHHHHHHCCHHHHHH--------------------HHCCCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf ---------9986699999988488376655--------------------40135869999999999999999999999 Q ss_pred CCC--HHHHHHHHHHHHHH Q ss_conf 999--89999999999996 Q gi|254780755|r 302 PDS--SWKKSLMEVVDFCL 318 (322) Q Consensus 302 p~~--~~k~~L~~L~~~ii 318 (322) +.+ ..+..+..++.+++ T Consensus 224 ~~~~~~~~~~~~~~~~~~~ 242 (243) T cd00385 224 ILSLPDVPRALLALALNLY 242 (243) T ss_pred HCCCCCCCHHHHHHHHCCC T ss_conf 7788889999999986025 No 14 >cd00683 Trans_IPPS_HH Trans-Isoprenyl Diphosphate Synthases (Trans_IPPS), head-to-head (HH) (1'-1) condensation reaction. This CD includes squalene and phytoene synthases which catalyze the 1'-1 condensation of two 15-carbon (farnesyl) and 20-carbon (geranylgeranyl) isoprenyl diphosphates, respectively. The catalytic site consists of a large central cavity formed by mostly antiparallel alpha helices with two aspartate-rich regions (DXXXD) located on opposite walls. These residues mediate binding of prenyl phosphates. A two-step reaction has been proposed for squalene synthase (farnesyl-diphosphate farnesyltransferase) in which, two molecules of FPP react to form a stable cyclopropylcarbinyl diphosphate intermediate, and then the intermediate undergoes heterolysis, isomerization, and reduction with NADPH to form squalene, a precursor of cholestrol. The carotenoid biosynthesis enzyme, phytoene synthase (CrtB), catalyzes the condensation reaction of two molecules of geranylgeranyl diphosp Probab=97.61 E-value=0.0029 Score=39.83 Aligned_cols=135 Identities=23% Similarity=0.161 Sum_probs=82.0 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHC Q ss_conf 34567777775456554213221035789899864335555443210001122221015799999731116898876411 Q gi|254780755|r 134 SLVACTLAEGELRQLSLSKNLDVTEEDYLHVIKSKTAVLFSAALEVSSLIAGVQNSVRQALKSYGMNLGIAFQLVDDVLD 213 (322) Q Consensus 134 ~~~~~~l~~Gq~~d~~~~~~~~~s~~~y~~ii~~KTa~Lf~~~~~~ga~lag~~~~~~~~l~~fG~~lG~afQI~DD~ld 213 (322) .....++.+|...|+.... -.|+++...-.++-.|+...+++.+ ++ . ...+....++.++|+|+|+.|=+.| T Consensus 92 ~~~~~~li~g~~~Dl~~~~--~~t~~eL~~Yc~~vAg~VG~l~~~i---~g-~--~~~~~~~~~A~~lG~AlQltNilRD 163 (265) T cd00683 92 REPFRDLLAGMAMDLDKRR--YETLDELDEYCYYVAGVVGLMLLRV---FG-A--SSDEAALERARALGLALQLTNILRD 163 (265) T ss_pred HHHHHHHHHHHHHHCCCCC--CCCHHHHHHHHHHHHHHHHHHHHHH---HC-C--CCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 8999999999998756789--9999999999997117999999999---58-7--8818899999998899999989987 Q ss_pred CCCCCCCCCCHHHHHHHHCCCCCCCCHHHHCCCCCCCHHHHHHCCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHH Q ss_conf 23670003840123454213653210011014311000134320368999899999999999749699999999999999 Q gi|254780755|r 214 YRGEVNEMGKNIGEDFRNGKPTLPVILAFQRGTKREKNFWKSTINDGKISVENLKKAFIIMRENRALVDTEYRAYSYGQK 293 (322) Q Consensus 214 ~~g~~~~~GK~~g~Dl~egk~Tlpii~al~~~~~~~~~~l~~~~~~~~~~~~~~~~i~~ll~~~gai~~~~~~~~~~~~~ 293 (322) + +.|...|++-+|.-..-+..-. . ..+. .+..+ ..+..+ +++...+++.|.+. T Consensus 164 i-----------~eD~~~gRiYlP~~~l~~~gv~-~----~dl~-~~~~~-~~~~~~---------~~~~~~~A~~~~~~ 216 (265) T cd00683 164 V-----------GEDARRGRIYLPREELARFGVT-L----EDLL-APENS-PAFRAL---------LRRLIARARAHYRE 216 (265) T ss_pred H-----------HHHHHCCCEECCHHHHHHCCCC-H----HHHH-CCCCC-HHHHHH---------HHHHHHHHHHHHHH T ss_conf 9-----------9999589157889999884999-9----9984-76688-999999---------99999999999999 Q ss_pred HHHHHHCCCC Q ss_conf 9998420899 Q gi|254780755|r 294 AKDSLRCLPD 303 (322) Q Consensus 294 A~~~L~~lp~ 303 (322) |...+..+|. T Consensus 217 a~~~~~~lp~ 226 (265) T cd00683 217 ALAGLAALPR 226 (265) T ss_pred HHHHHHHCCH T ss_conf 9998986999 No 15 >pfam00494 SQS_PSY Squalene/phytoene synthase. Probab=97.44 E-value=0.002 Score=40.77 Aligned_cols=141 Identities=23% Similarity=0.228 Sum_probs=86.4 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHC Q ss_conf 34567777775456554213221035789899864335555443210001122221015799999731116898876411 Q gi|254780755|r 134 SLVACTLAEGELRQLSLSKNLDVTEEDYLHVIKSKTAVLFSAALEVSSLIAGVQNSVRQALKSYGMNLGIAFQLVDDVLD 213 (322) Q Consensus 134 ~~~~~~l~~Gq~~d~~~~~~~~~s~~~y~~ii~~KTa~Lf~~~~~~ga~lag~~~~~~~~l~~fG~~lG~afQI~DD~ld 213 (322) .....++++|...|+.... -.|+++...-.++-.|+...+.+.+- |..+ ..+.+..++.++|+|+|+.|=+.| T Consensus 88 ~~~~~~li~g~~~Dl~~~~--~~t~~dL~~Y~~~vAg~Vg~l~~~i~----g~~~-~~~~~~~~A~~lG~AlQltNilRD 160 (262) T pfam00494 88 REPFLELIDGMEMDLEKDR--YETLAELEEYCYRVAGVVGLLLLRLL----GVRD-DAEAADEAARHLGLALQLTNILRD 160 (262) T ss_pred HHHHHHHHHHHHHHHCCCC--CCCHHHHHHHHHHHHHHHHHHHHHHH----CCCC-CHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 8999999999998736799--99999999999881689999999995----7789-458899999999999999999974 Q ss_pred CCCCCCCCCCHHHHHHHHCCCCCCCCHHHHCCCCCCCHHHHHHCCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHH Q ss_conf 23670003840123454213653210011014311000134320368999899999999999749699999999999999 Q gi|254780755|r 214 YRGEVNEMGKNIGEDFRNGKPTLPVILAFQRGTKREKNFWKSTINDGKISVENLKKAFIIMRENRALVDTEYRAYSYGQK 293 (322) Q Consensus 214 ~~g~~~~~GK~~g~Dl~egk~Tlpii~al~~~~~~~~~~l~~~~~~~~~~~~~~~~i~~ll~~~gai~~~~~~~~~~~~~ 293 (322) + +.|...|++-+|.-..-+..-. . .++.. +..++ .+..+ +.+....++.+.+. T Consensus 161 i-----------~eD~~~gR~YLP~~~l~~~g~~-~----~~l~~-~~~~~-~~~~~---------~~~~~~~a~~~~~~ 213 (262) T pfam00494 161 V-----------GEDARRGRVYLPADWLARFGVS-P----EDLLR-GRASP-ELRAL---------IAELAARARAHLAE 213 (262) T ss_pred H-----------HHHHCCCCEECCHHHHHHCCCC-H----HHHHC-CCCCH-HHHHH---------HHHHHHHHHHHHHH T ss_conf 6-----------9998399277888999885999-9----99736-67999-99999---------99999999999999 Q ss_pred HHHHHHCCCCCHHHH Q ss_conf 999842089998999 Q gi|254780755|r 294 AKDSLRCLPDSSWKK 308 (322) Q Consensus 294 A~~~L~~lp~~~~k~ 308 (322) |...+..+|+...+. T Consensus 214 a~~~~~~l~p~~~r~ 228 (262) T pfam00494 214 ARALLALLPPRRARL 228 (262) T ss_pred HHHHHHHCCCHHHHH T ss_conf 999885389876789 No 16 >TIGR03464 HpnC squalene synthase HpnC. This family of genes are members of a superfamily (pfam00494) of phytoene and squalene synthases which catalyze the head-t0-head condensation of polyisoprene pyrophosphates. The genes of this family are often found in the same genetic locus with squalene-hopene cyclase genes, and are never associated with genes for the metabolism of phytoene. In the organisms Zymomonas mobilis and Bradyrhizobium japonicum these genes have been characterized as squalene synthases (farnesyl-pyrophosphate ligases). Often, these genes appear in tandem with the HpnD gene which appears to have resulted from an ancient gene duplication event. Presumably these proteins form a heteromeric complex, but this has not yet been experimentally demonstrated. Probab=97.22 E-value=0.011 Score=36.22 Aligned_cols=133 Identities=16% Similarity=0.172 Sum_probs=78.9 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCC Q ss_conf 45677777754565542132210357898998643355554432100011222210157999997311168988764112 Q gi|254780755|r 135 LVACTLAEGELRQLSLSKNLDVTEEDYLHVIKSKTAVLFSAALEVSSLIAGVQNSVRQALKSYGMNLGIAFQLVDDVLDY 214 (322) Q Consensus 135 ~~~~~l~~Gq~~d~~~~~~~~~s~~~y~~ii~~KTa~Lf~~~~~~ga~lag~~~~~~~~l~~fG~~lG~afQI~DD~ld~ 214 (322) ....++..|...|+.... -.|+++...-.+.-.|+...+++. ++ |.+. +....++.++|+|+|+.|=+.|+ T Consensus 86 ~~~~~li~g~~~Dl~~~~--~~t~~dL~~Yc~~vAg~VG~~~~~---i~-g~~~---~~~~~~a~~lG~AlQltNilRDi 156 (266) T TIGR03464 86 EPFLDLLDAFRQDVVVTR--YATWAELLDYCRRSANPVGRLVLD---LY-GASD---PENVALSDAICTALQLINFWQDV 156 (266) T ss_pred HHHHHHHHHHHHCCCCCC--CCCHHHHHHHHHHHHHHHHHHHHH---HH-CCCC---HHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999884146799--999999999999963899999999---82-7898---88999999997999999999985 Q ss_pred CCCCCCCCCHHHHHHHHCCCCCCCCHHHHCCCCCCCHHHHHHCCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHH Q ss_conf 36700038401234542136532100110143110001343203689998999999999997496999999999999999 Q gi|254780755|r 215 RGEVNEMGKNIGEDFRNGKPTLPVILAFQRGTKREKNFWKSTINDGKISVENLKKAFIIMRENRALVDTEYRAYSYGQKA 294 (322) Q Consensus 215 ~g~~~~~GK~~g~Dl~egk~Tlpii~al~~~~~~~~~~l~~~~~~~~~~~~~~~~i~~ll~~~gai~~~~~~~~~~~~~A 294 (322) +.|+..|++-+|.-..-+..-.. + .+. .+..+ ..+..+ ++....+++.|.++| T Consensus 157 -----------~eD~~~gRiYLP~e~L~~~gv~~--~---~l~-~~~~~-~~~~~~---------~~~~~~~A~~~y~~a 209 (266) T TIGR03464 157 -----------GKDLRKGRVYLPRDDLARFGVSE--E---DLA-AGRAT-PALREL---------MAFEVSRTRALLDRG 209 (266) T ss_pred -----------HHHHHCCCEECCHHHHHHCCCCH--H---HHH-CCCCC-HHHHHH---------HHHHHHHHHHHHHHH T ss_conf -----------87998798568699998859999--9---983-67899-899999---------999999999999989 Q ss_pred HHHHHCCCC Q ss_conf 998420899 Q gi|254780755|r 295 KDSLRCLPD 303 (322) Q Consensus 295 ~~~L~~lp~ 303 (322) ...+..+|. T Consensus 210 ~~~~~~lp~ 218 (266) T TIGR03464 210 APLAARIPG 218 (266) T ss_pred HHHHHHCCH T ss_conf 884875998 No 17 >pfam07307 HEPPP_synt_1 Heptaprenyl diphosphate synthase (HEPPP synthase) subunit 1. This family contains subunit 1 of bacterial heptaprenyl diphosphate synthase (HEPPP synthase) (EC:2.5.1.30) (approximately 230 residues long). The enzyme consists of two subunits, both of which are required for catalysis of heptaprenyl diphosphate synthesis. Probab=96.98 E-value=0.018 Score=34.87 Aligned_cols=109 Identities=18% Similarity=0.238 Sum_probs=74.5 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHCCCCHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHH Q ss_conf 89857866665578999999987521101122001101210100110234312457888876421024311000123456 Q gi|254780755|r 58 EYRGDNHVLLACAIEFIHTATLLHDDVVDDSALRRGKVAARLVWGNQTSVLVGDFLLSQAFCMVIETKSQEALEALSLVA 137 (322) Q Consensus 58 ~~~~~~~~~~a~~iEliH~asLiHDDI~D~~~~RRg~~t~~~~~G~~~Ail~GD~l~~~a~~~l~~~~~~~~~~~~~~~~ 137 (322) +-.++++.....++=|++.|-..||-|-.....+.+ .++. ....||+|||.=+..+.++++.++...++.+++++ T Consensus 29 ~~~~~~~~~~i~t~mLvq~aLDTHd~V~~~~~~~~~---~~k~--RQLtVLAGDyyS~lYY~lLa~~~~i~lIr~la~~I 103 (212) T pfam07307 29 ELTPEQAERYILTAMLVQIALDTHDKVSNANAETQE---TKKN--RQLTVLAGDYYSGLYYYLLSESGDIALIRLLAEAI 103 (212) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCH---HHHH--HHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH T ss_conf 999899999999999999998778764542312103---5563--10320110233189999998379869999999999 Q ss_pred HHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 7777775456554213221035789899864335555 Q gi|254780755|r 138 CTLAEGELRQLSLSKNLDVTEEDYLHVIKSKTAVLFS 174 (322) Q Consensus 138 ~~l~~Gq~~d~~~~~~~~~s~~~y~~ii~~KTa~Lf~ 174 (322) .++.+-.+.=- .....+.+++++-+..=-++|+. T Consensus 104 ~eiNe~Ki~lY---q~~~~~~e~~~~sv~~iESaL~~ 137 (212) T pfam07307 104 KEINEHKIRLY---QKKVETFEELMESVKTIESALFA 137 (212) T ss_pred HHHHHHHHHHH---HHHCCCHHHHHHHHHHHHHHHHH T ss_conf 99999999999---84038899999999999999999 No 18 >TIGR03465 HpnD squalene synthase HpnD. The genes of this family are often found in the same genetic locus with squalene-hopene cyclase genes, and are never associated with genes for the metabolism of phytoene. In the organisms Zymomonas mobilis and Bradyrhizobium japonicum these genes have been characterized as squalene synthases (farnesyl-pyrophosphate ligases). Often, these genes appear in tandem with the HpnC gene which appears to have resulted from an ancient gene duplication event. Presumably these proteins form a heteromeric complex, but this has not yet been experimentally demonstrated. Probab=96.76 E-value=0.022 Score=34.43 Aligned_cols=134 Identities=19% Similarity=0.157 Sum_probs=80.4 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCC Q ss_conf 45677777754565542132210357898998643355554432100011222210157999997311168988764112 Q gi|254780755|r 135 LVACTLAEGELRQLSLSKNLDVTEEDYLHVIKSKTAVLFSAALEVSSLIAGVQNSVRQALKSYGMNLGIAFQLVDDVLDY 214 (322) Q Consensus 135 ~~~~~l~~Gq~~d~~~~~~~~~s~~~y~~ii~~KTa~Lf~~~~~~ga~lag~~~~~~~~l~~fG~~lG~afQI~DD~ld~ 214 (322) +...++..|...|+.... -.|+++...-.+.-.|+...+++.+ + |..+ +....++.++|+|+|+.|=+.|+ T Consensus 85 ~~~~~li~g~~~Dl~~~~--~~t~~~L~~Y~~~vA~~vg~m~~~i---l-g~~~---~~~~~~A~~lG~A~QltNilRDv 155 (266) T TIGR03465 85 EDFLEVIDGMEMDLEQTR--YPDFAELDLYCDRVAGAVGRLSARI---F-GATD---ARTLEYAHHLGRALQLTNILRDV 155 (266) T ss_pred HHHHHHHHHHHCCCCCCC--CCCHHHHHHHHHHHHHHHHHHHHHH---C-CCCC---HHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999985578999--9999999999998636999999988---0-8997---57899999999999999999971 Q ss_pred CCCCCCCCCHHHHHHHHCCCCCCCCHHHHCCCCCCCHHHHHHCCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHH Q ss_conf 36700038401234542136532100110143110001343203689998999999999997496999999999999999 Q gi|254780755|r 215 RGEVNEMGKNIGEDFRNGKPTLPVILAFQRGTKREKNFWKSTINDGKISVENLKKAFIIMRENRALVDTEYRAYSYGQKA 294 (322) Q Consensus 215 ~g~~~~~GK~~g~Dl~egk~Tlpii~al~~~~~~~~~~l~~~~~~~~~~~~~~~~i~~ll~~~gai~~~~~~~~~~~~~A 294 (322) +.|...|++-+|.-..-+..-. . + ++. .+..++ .+.. + +.+...+++.|.+.| T Consensus 156 -----------~eD~~~gR~YLP~d~l~~~gv~-~-~---~l~-~~~~~~-~~~~---~------~~~~~~~A~~~~~~a 208 (266) T TIGR03465 156 -----------GEDARRGRIYLPAEELQRFGVP-A-A---DIL-EGRYSP-ALAA---L------CRFQAERARAHYAEA 208 (266) T ss_pred -----------HHHHHCCCEECCHHHHHHCCCC-H-H---HHH-CCCCCH-HHHH---H------HHHHHHHHHHHHHHH T ss_conf -----------8899789975779999886999-8-8---873-478887-8999---9------999999999999999 Q ss_pred HHHHHCCCCC Q ss_conf 9984208999 Q gi|254780755|r 295 KDSLRCLPDS 304 (322) Q Consensus 295 ~~~L~~lp~~ 304 (322) ...+..+|.. T Consensus 209 ~~~~~~lp~~ 218 (266) T TIGR03465 209 DALLPACDRR 218 (266) T ss_pred HHHHHHCCHH T ss_conf 9758648998 No 19 >PRK13594 consensus Probab=90.06 E-value=1.5 Score=23.17 Aligned_cols=164 Identities=16% Similarity=0.119 Sum_probs=76.5 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHCC---CCHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHH Q ss_conf 89857866665578999999987521101122001101---210100110234312457888876421024311000123 Q gi|254780755|r 58 EYRGDNHVLLACAIEFIHTATLLHDDVVDDSALRRGKV---AARLVWGNQTSVLVGDFLLSQAFCMVIETKSQEALEALS 134 (322) Q Consensus 58 ~~~~~~~~~~a~~iEliH~asLiHDDI~D~~~~RRg~~---t~~~~~G~~~Ail~GD~l~~~a~~~l~~~~~~~~~~~~~ 134 (322) |..+....-+.+++=++-.+.-+..|+.|-..=|-++| =...+...+.|....-.++..+.-... .-++...-. T Consensus 31 ~~~~~~~~l~~~~v~~~~a~g~~iND~~D~eiD~iN~P~RPipsg~is~~~Al~~~~~l~~~g~~~~~-~~~~~~~~i-- 107 (278) T PRK13594 31 GTLPAESLLLFCIVLLITGAGNVINDYFDKEIDAINRPDRPIPSGQISPGGALAYAILLFLIGNAIAI-LTTPLCLGI-- 107 (278) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHH-HHHHHHHHH-- T ss_conf 88865889999999999999999997878604534599998888862999999999999999999999-980999999-- Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHCCCHHHHHHHHHC Q ss_conf 45677777754565542132210357898998643355554432100011222-21015799999731116898876411 Q gi|254780755|r 135 LVACTLAEGELRQLSLSKNLDVTEEDYLHVIKSKTAVLFSAALEVSSLIAGVQ-NSVRQALKSYGMNLGIAFQLVDDVLD 213 (322) Q Consensus 135 ~~~~~l~~Gq~~d~~~~~~~~~s~~~y~~ii~~KTa~Lf~~~~~~ga~lag~~-~~~~~~l~~fG~~lG~afQI~DD~ld 213 (322) +.....-+-......++.. . .- .-++..=+| .+...|+...|.. ....-.+.-+..-+.++--|.-|+.| T Consensus 108 -~l~~~~ll~~Ys~~LK~~~-l-iG--Ni~Va~l~g----~~~~fg~~~~~~~~~~~~~~l~~~aFl~~l~REIvKDieD 178 (278) T PRK13594 108 -ALVNSVILILYAARLKKTP-F-LG--NISVAYLAA----SIFLFGGAIAGTEGLLSMLPIAAITFLAMLARELLKDAED 178 (278) T ss_pred -HHHHHHHHHHHHHHHHHCC-C-HH--HHHHHHHHH----HHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf -9999999999999997567-2-68--899999988----9999999997456418999999999999999999945530 Q ss_pred CCCCCCCCCCHHHHHHHHCCCCCCCCHHHHC Q ss_conf 2367000384012345421365321001101 Q gi|254780755|r 214 YRGEVNEMGKNIGEDFRNGKPTLPVILAFQR 244 (322) Q Consensus 214 ~~g~~~~~GK~~g~Dl~egk~Tlpii~al~~ 244 (322) +.| |.++|..|+|+.+-.++ T Consensus 179 ieG-----------D~~~g~~TLPi~~G~kk 198 (278) T PRK13594 179 IEG-----------DRAHGARTLPMLIGVRK 198 (278) T ss_pred CCC-----------HHHCCCCCCCHHHHHHH T ss_conf 123-----------88779974077971898 No 20 >KOG1166 consensus Probab=86.45 E-value=0.13 Score=29.56 Aligned_cols=87 Identities=18% Similarity=0.250 Sum_probs=54.8 Q ss_pred HHHHHHHHHHHHHHHHHHHH-HCHHHHHHHCCCCHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHH Q ss_conf 66655789999999875211-01122001101210100110234312457888876421024311000123456777777 Q gi|254780755|r 65 VLLACAIEFIHTATLLHDDV-VDDSALRRGKVAARLVWGNQTSVLVGDFLLSQAFCMVIETKSQEALEALSLVACTLAEG 143 (322) Q Consensus 65 ~~~a~~iEliH~asLiHDDI-~D~~~~RRg~~t~~~~~G~~~Ail~GD~l~~~a~~~l~~~~~~~~~~~~~~~~~~l~~G 143 (322) ...+-+||.+|..-.||-|| -|+--+||+. +..|+ +.|=+|+ --| T Consensus 801 ~qml~ive~lH~~~IIHgDiKPDNfll~~~~---~~~~~-----~~~l~lI--------------------------DfG 846 (974) T KOG1166 801 CQMLRIVEHLHAMGIIHGDIKPDNFLLRREI---CADSD-----SKGLYLI--------------------------DFG 846 (974) T ss_pred HHHHHHHHHHHHCCEECCCCCCCEEEEECCC---CCCCC-----CCCEEEE--------------------------ECC T ss_conf 9999999998751631046786514751466---78886-----6546999--------------------------626 Q ss_pred HHHHH-------HHHCCCCCCHHHHHHHHHHHH----HHHHHHHHHHHHHHHH Q ss_conf 54565-------542132210357898998643----3555544321000112 Q gi|254780755|r 144 ELRQL-------SLSKNLDVTEEDYLHVIKSKT----AVLFSAALEVSSLIAG 185 (322) Q Consensus 144 q~~d~-------~~~~~~~~s~~~y~~ii~~KT----a~Lf~~~~~~ga~lag 185 (322) +..|| .+...+....-++.+|..+|. .=||+.++.+-.+|.| T Consensus 847 ~siDm~lfp~~~~F~~~~~td~f~C~EM~~grpWtYq~DyfGlAa~~h~mLFG 899 (974) T KOG1166 847 RSIDMKLFPDGTKFKAVWHTDLFDCIEMREGRPWTYQIDYFGLAATVHVMLFG 899 (974) T ss_pred CCEEEEECCCCCEEEEEECCCCCHHHHHHCCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 40223676888677666335310038886699973245467899999999998 No 21 >PRK12871 ubiA prenyltransferase; Reviewed Probab=85.85 E-value=2.7 Score=21.54 Aligned_cols=173 Identities=14% Similarity=0.099 Sum_probs=88.1 Q ss_pred HHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHCHHH---------H---H--HHCCCCHHHCCCHHHHHHHHHHHH Q ss_conf 9999997089857866665578999999987521101122---------0---0--110121010011023431245788 Q gi|254780755|r 50 TLATALMLEYRGDNHVLLACAIEFIHTATLLHDDVVDDSA---------L---R--RGKVAARLVWGNQTSVLVGDFLLS 115 (322) Q Consensus 50 ~~l~~~~~~~~~~~~~~~a~~iEliH~asLiHDDI~D~~~---------~---R--Rg~~t~~~~~G~~~Ail~GD~l~~ 115 (322) ++..+...+.+-+-.+.....+=+.-.|-.|..|+-|..- + | +++|-..-+-..+.|+...=.|.+ T Consensus 29 al~~a~~g~~~~~~l~if~Lg~vlmrsAGCviNDyaDR~~D~~~ve~~v~r~~r~~k~RPLpsG~vs~keAl~lf~~L~~ 108 (297) T PRK12871 29 VLAFANYGGFSWELTIKAALIGLFGFEAGFVLNDYVDRKRDRLDVENTLTRYWRPFKERPIPSGKLSSKNAFALFILLAA 108 (297) T ss_pred HHHHHCCCCCCHHHHHHHHHHHHHHHHCCCEEHHHHHHCCCHHHCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHH T ss_conf 99986469997679999999999997057511003431022010244210025665689799888688999999999999 Q ss_pred HHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHH Q ss_conf 887642102431100012345677777754565542132210357898998643355-5544321000112222101579 Q gi|254780755|r 116 QAFCMVIETKSQEALEALSLVACTLAEGELRQLSLSKNLDVTEEDYLHVIKSKTAVL-FSAALEVSSLIAGVQNSVRQAL 194 (322) Q Consensus 116 ~a~~~l~~~~~~~~~~~~~~~~~~l~~Gq~~d~~~~~~~~~s~~~y~~ii~~KTa~L-f~~~~~~ga~lag~~~~~~~~l 194 (322) .++.++.-++.+...-.+.-....+..--......+. +++-.+.. .+.. |..+...|+...|..+-. --+ T Consensus 109 ~~~~LvL~Lp~~~~~~~~~~~l~A~~la~~YPfmKR~-------t~~Pqa~L-~g~~~f~~~p~a~aa~~G~~~~~-~~L 179 (297) T PRK12871 109 VTSALILTLPYPNSLYVFVIMLYSYGIEAFYQVKKRN-------QKYPVAQL-LGRTDFTLFPAAGYLCYGQPDMT-ALL 179 (297) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCC-------CCCCHHHH-HHHHHHHHHHHHHHHHCCCCCHH-HHH T ss_conf 9999999832530799999999999999998988536-------60329999-99998757899999981998789-999 Q ss_pred HHHHHH-CCCHHHHHHHHHCCCCCCCCCCCHHHHHHHHCCCCCCCCHHH Q ss_conf 999973-111689887641123670003840123454213653210011 Q gi|254780755|r 195 KSYGMN-LGIAFQLVDDVLDYRGEVNEMGKNIGEDFRNGKPTLPVILAF 242 (322) Q Consensus 195 ~~fG~~-lG~afQI~DD~ld~~g~~~~~GK~~g~Dl~egk~Tlpii~al 242 (322) --.+.- --+||--+.|+.|.. +|++-|.+|.|+++-. T Consensus 180 l~~~~l~WTvAYDT~Yam~Dre-----------dDlkiGvKStAILFG~ 217 (297) T PRK12871 180 YMVFFYPWTMAHLGLNDFIDLE-----------NDRARGMKSIAVLYGM 217 (297) T ss_pred HHHHHHHHHHHHHHHHHHHCCC-----------CHHHCCCHHHHHHHCC T ss_conf 9999999999999999871421-----------5485283032212167 No 22 >PRK11092 bifunctional (p)ppGpp synthetase II/ guanosine-3',5'-bis pyrophosphate 3'-pyrophosphohydrolase; Provisional Probab=75.23 E-value=6.3 Score=19.27 Aligned_cols=31 Identities=32% Similarity=0.425 Sum_probs=17.3 Q ss_pred HHHHHHHHHHCHHHHHHHCCCCHHHCCCHHHHHH Q ss_conf 9998752110112200110121010011023431 Q gi|254780755|r 76 TATLLHDDVVDDSALRRGKVAARLVWGNQTSVLV 109 (322) Q Consensus 76 ~asLiHDDI~D~~~~RRg~~t~~~~~G~~~Ail~ 109 (322) .|.|.||-|.|-..+ ...+-..||..+|-++ T Consensus 67 ~AaLLHDvvEDt~~t---~e~i~~~FG~~Va~lV 97 (702) T PRK11092 67 MAALLHDVIEDTPAT---YQDMEQLFGKSVAELV 97 (702) T ss_pred HHHHHHCCCCCCCCC---HHHHHHHHCHHHHHHH T ss_conf 999743311578999---9999998787899998 No 23 >cd00687 Terpene_cyclase_nonplant_C1 NonPlant Terpene Cyclases, Class 1 (C1). This CD includes terpenoid cyclases such as pentalenene synthase and aristolochene synthase which, using an all-trans pathway, catalyze the ionization of farnesyl diphosphate, followed by the formation of a macrocyclic intermediate by bond formation between C1 with either C10 (aristolochene synthase) or C11 (pentalenene synthase), resulting in production of tricyclic hydrocarbon pentalenene or bicyclic hydrocarbon aristolochene. As with other enzymes with the 'terpenoid synthase fold', they have two conserved metal binding motifs, proposed to coordinate Mg2+ ion-bridged binding of the diphosphate moiety of FPP to the enzymes. Metal-triggered substrate ionization initiates catalysis, and the alpha-barrel active site serves as a template to channel and stabilize the conformations of reactive carbocation intermediates through a complex cyclization cascade. These enzymes function in the monomeric form and are foun Probab=74.41 E-value=6.6 Score=19.14 Aligned_cols=164 Identities=15% Similarity=0.137 Sum_probs=83.9 Q ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHCCCCH-HH-----CCCHHHHHHHHHHHHHHHHH Q ss_conf 9999999997089857866665578999999987521101122001101210-10-----01102343124578888764 Q gi|254780755|r 47 PMLTLATALMLEYRGDNHVLLACAIEFIHTATLLHDDVVDDSALRRGKVAAR-LV-----WGNQTSVLVGDFLLSQAFCM 120 (322) Q Consensus 47 p~l~~l~~~~~~~~~~~~~~~a~~iEliH~asLiHDDI~D~~~~RRg~~t~~-~~-----~G~~~Ail~GD~l~~~a~~~ 120 (322) .-+..+++..+..-....+. .+..++ ...-+.||..|.....-+.+..- .. .|.......++--+..++.- T Consensus 41 ~~~~~~~a~~~P~a~~e~L~--l~~~~~-~w~F~~DD~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pl~~~l~d 117 (303) T cd00687 41 ADFGDLAALFYPDADDERLM--LAADLM-AWLFVFDDLLDRDQKSPEDGEAGVTRLLDILRGDGLDSPDDATPLEFGLAD 117 (303) T ss_pred CCHHHHHHHHCCCCCHHHHH--HHHHHH-HHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHH T ss_conf 99998898868999989999--999999-999985265456677888899999999998626788888999879999999 Q ss_pred ----HHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCC-CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHH--HH Q ss_conf ----210243110001234567777775456554213-22103578989986433555544321000112--2221--01 Q gi|254780755|r 121 ----VIETKSQEALEALSLVACTLAEGELRQLSLSKN-LDVTEEDYLHVIKSKTAVLFSAALEVSSLIAG--VQNS--VR 191 (322) Q Consensus 121 ----l~~~~~~~~~~~~~~~~~~l~~Gq~~d~~~~~~-~~~s~~~y~~ii~~KTa~Lf~~~~~~ga~lag--~~~~--~~ 191 (322) +....++.-...+.+.+.+...|+..+..++.. ...+.++|+.+=+.-.|..+..... -...| .++. .. T Consensus 118 l~~r~~~~~~~~~~~rf~~~~~~~~~a~~~E~~~r~~~~~ps~~eY~~~R~~~~g~~~~~~l~--e~~~g~~lp~~v~~~ 195 (303) T cd00687 118 LWRRTLARMSAEWFNRFAHYTEDYFDAYIWEGKNRLNGHVPDVAEYLEMRRFNIGADPCLGLS--EFIGGPEVPAAVRLD 195 (303) T ss_pred HHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCHHHHHHHH--HHHCCCCCCHHHHHC T ss_conf 999986569999999999999999999999999987489989999999976445570699999--998488899999849 Q ss_pred HHHHHHHHHCCCHHHHHHHHHCCC Q ss_conf 579999973111689887641123 Q gi|254780755|r 192 QALKSYGMNLGIAFQLVDDVLDYR 215 (322) Q Consensus 192 ~~l~~fG~~lG~afQI~DD~ld~~ 215 (322) ..++..-...+..-=+.||+..+. T Consensus 196 p~~~~l~~~~~~~v~~~NDi~S~~ 219 (303) T cd00687 196 PVMRALEALASDAIALVNDIYSYE 219 (303) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 799999999999999987898888 No 24 >cd00868 Terpene_cyclase_C1 Terpene cyclases, Class 1 (C1) of the class 1 family of isoprenoid biosynthesis enzymes, which share the 'isoprenoid synthase fold' and convert linear, all-trans, isoprenoids, geranyl (C10)-, farnesyl (C15)-, or geranylgeranyl (C20)-diphosphate into numerous cyclic forms of monoterpenes, diterpenes, and sesquiterpenes. Also included in this CD are the cis-trans terpene cyclases such as trichodiene synthase. The class I terpene cyclization reactions proceed via electrophilic alkylations in which a new carbon-carbon single bond is generated through interaction between a highly reactive electron-deficient allylic carbocation and an electron-rich carbon-carbon double bond. The catalytic site consists of a large central cavity formed by mostly antiparallel alpha helices with two aspartate-rich regions located on opposite walls. These residues mediate binding of prenyl phosphates via bridging Mg2+ ions, inducing proposed conformational changes that close the active Probab=73.75 E-value=6.9 Score=19.04 Aligned_cols=171 Identities=14% Similarity=0.055 Sum_probs=86.0 Q ss_pred HHHHHHHHHHHHHHCCCHHHHH-HHHHHHHHHHHHHHHHHHHHCHHHHHHHCCC---CHHHCCCHHHHHHHHHHH--HHH Q ss_conf 2799999999997089857866-6655789999999875211011220011012---101001102343124578--888 Q gi|254780755|r 44 RLRPMLTLATALMLEYRGDNHV-LLACAIEFIHTATLLHDDVVDDSALRRGKVA---ARLVWGNQTSVLVGDFLL--SQA 117 (322) Q Consensus 44 r~Rp~l~~l~~~~~~~~~~~~~-~~a~~iEliH~asLiHDDI~D~~~~RRg~~t---~~~~~G~~~Ail~GD~l~--~~a 117 (322) |-|+.-++..+-+.-..++... +.++ ..++ +..-+.||+-|..-+.-.... +-.+|..+..--.-+++- ..+ T Consensus 26 R~r~~e~y~~~~~~~~ep~~s~~R~~~-aK~~-~l~~~iDD~yD~~gt~~El~~~t~ai~rwd~~~~~~lp~~mk~~~~~ 103 (284) T cd00868 26 RDRLVECYFWAAGSYFEPQYSEARIAL-AKTI-ALLTVIDDTYDDYGTLEELELFTEAVERWDISAIDELPEYMKPVFKA 103 (284) T ss_pred HHHHHHHHHHHHHHHCCCCHHHHHHHH-HHHH-HHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHCCCCCHHHHHHHHH T ss_conf 688999999999865580149999999-9999-99999988740478899999999999827940001086599999999 Q ss_pred -HHH-------HHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCC-CCCCHHHHHHHHHHHHHHH-HHHHHHHHH--HHHH Q ss_conf -764-------210243110001234567777775456554213-2210357898998643355-554432100--0112 Q gi|254780755|r 118 -FCM-------VIETKSQEALEALSLVACTLAEGELRQLSLSKN-LDVTEEDYLHVIKSKTAVL-FSAALEVSS--LIAG 185 (322) Q Consensus 118 -~~~-------l~~~~~~~~~~~~~~~~~~l~~Gq~~d~~~~~~-~~~s~~~y~~ii~~KTa~L-f~~~~~~ga--~lag 185 (322) +.. +...........+.+...+++.+-+.+..|... .-.|.++|+++...-+|.- ..+.+..|. .... T Consensus 104 l~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~l~Ea~w~~~~~~Ps~~EYl~~~~~s~~~~~~~~~~~~~~g~~~~~ 183 (284) T cd00868 104 LYDLVNEIEEELAKEGGSESLPYLKEAWKDLLRAYLVEAKWANEGYVPSFEEYLENRRVSIGYPPLLALSFLGMGDILPE 183 (284) T ss_pred HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHCCHHHHHHHHHHHCCCCCCH T ss_conf 99999999999998359068999999999999999999999964989988999875042302899999999957888769 Q ss_pred HHHHHHHHHHHHHHHCCCHHHHHHHHHCCCC Q ss_conf 2221015799999731116898876411236 Q gi|254780755|r 186 VQNSVRQALKSYGMNLGIAFQLVDDVLDYRG 216 (322) Q Consensus 186 ~~~~~~~~l~~fG~~lG~afQI~DD~ld~~g 216 (322) ..-+......++-...++.+-+.||+-.+.- T Consensus 184 ~~~~~~~~~~~l~~~~~~i~rL~NDi~s~~~ 214 (284) T cd00868 184 EAFEWLPSYPKLVRASSTIGRLLNDIASYEK 214 (284) T ss_pred HHHHHHHCCHHHHHHHHHHHHHHHCHHHHHC T ss_conf 9997520264999999989999600045424 No 25 >cd02432 Nodulin-21_like_1 Nodulin-21 and CCC1-related protein family. Nodulin-21_like_1: This is a family of proteins closely related to nodulin-21, a plant nodule-specific protein that may be involved in symbiotic nitrogen fixation. This family is also related to CCC1, a yeast vacuole transmembrane protein that functions as an iron and manganese transporter. Probab=66.23 E-value=10 Score=18.04 Aligned_cols=67 Identities=16% Similarity=0.234 Sum_probs=36.0 Q ss_pred HHCCCCCCCHHHHHHCCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCC---CCCHHHHHHHHHHHHHH Q ss_conf 101431100013432036899989999999999974969999999999999999984208---99989999999999996 Q gi|254780755|r 242 FQRGTKREKNFWKSTINDGKISVENLKKAFIIMRENRALVDTEYRAYSYGQKAKDSLRCL---PDSSWKKSLMEVVDFCL 318 (322) Q Consensus 242 l~~~~~~~~~~l~~~~~~~~~~~~~~~~i~~ll~~~gai~~~~~~~~~~~~~A~~~L~~l---p~~~~k~~L~~L~~~ii 318 (322) +++.++.+++.+.+++.++-.++++.+++.+.+.+++.++. ...+++.-- +.+|.+..+..++.|++ T Consensus 80 i~~~pe~E~~El~~iy~~~G~~~~~a~~ia~~l~~~~~l~~----------~~~~Elg~~~~~~~~P~~aal~sf~sF~~ 149 (218) T cd02432 80 LAEDPEAELEELADIYEERGLSPELARQVADELMAKDALEA----------HARDELGITEGEYANPWQAALASAISFSV 149 (218) T ss_pred HHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCHHHH----------HHHHHCCCCCCCCCCHHHHHHHHHHHHHH T ss_conf 75191779999999998679998899999999973775877----------66774299987657889999999999999 No 26 >PRK12883 ubiA prenyltransferase UbiA-like protein; Reviewed Probab=63.65 E-value=11 Score=17.74 Aligned_cols=159 Identities=14% Similarity=0.134 Sum_probs=73.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHCC---CCHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHH Q ss_conf 7866665578999999987521101122001101---2101001102343124578888764210243110001234567 Q gi|254780755|r 62 DNHVLLACAIEFIHTATLLHDDVVDDSALRRGKV---AARLVWGNQTSVLVGDFLLSQAFCMVIETKSQEALEALSLVAC 138 (322) Q Consensus 62 ~~~~~~a~~iEliH~asLiHDDI~D~~~~RRg~~---t~~~~~G~~~Ail~GD~l~~~a~~~l~~~~~~~~~~~~~~~~~ 138 (322) ....-...++=+.-.++-+..|+.|-.-=|-++| -...+-..+.|...+-.++..++-+..-.+ +... .++ .. T Consensus 37 ~~~~l~~l~~~~~~aag~~iND~~D~eiD~iN~p~RPipsG~IS~~~A~~~~~~l~~~g~~l~~~ln-~~~~-~l~--~~ 112 (275) T PRK12883 37 KTLILIFLVVYLGCSGGNTINDYFDYEIDKINRPNRPLPRGAMSRKAALYYSVLQYAIGLALAYFLN-IWAF-LFA--LI 112 (275) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH-HHHH-HHH--HH T ss_conf 9999999999999999999985888840545589998877766899999999999999999999973-9999-999--99 Q ss_pred HHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCCC Q ss_conf 77777545655421322103578989986433555544321000112222101579999973111689887641123670 Q gi|254780755|r 139 TLAEGELRQLSLSKNLDVTEEDYLHVIKSKTAVLFSAALEVSSLIAGVQNSVRQALKSYGMNLGIAFQLVDDVLDYRGEV 218 (322) Q Consensus 139 ~l~~Gq~~d~~~~~~~~~s~~~y~~ii~~KTa~Lf~~~~~~ga~lag~~~~~~~~l~~fG~~lG~afQI~DD~ld~~g~~ 218 (322) ...-.-......++ .+.-- .-++ |.+.+.+...|+...|.. ...-.+.-++.-..+++-|.-|+.|+.| T Consensus 113 ~~~l~~~Ys~~lK~---~~l~g-Ni~v----a~~~g~~~l~g~~a~g~~-~~~~~l~~~afl~~l~reivkdieD~eg-- 181 (275) T PRK12883 113 AYVLTFIYAWKLKP---LPFIG-NVVV----ALLTAATPIYGAVGVGRI-GLAGYLAICAFLVNVSREIMKDIEDIEG-- 181 (275) T ss_pred HHHHHHHHHHHHCC---CCCHH-HHHH----HHHHHHHHHHHHHHHCCC-CHHHHHHHHHHHHHHHHHHHHHHHHHHH-- T ss_conf 99999998998665---88559-9999----999999999999995754-0999999999999999999999998610-- Q ss_pred CCCCCHHHHHHHHCCCCCCCCHHHHC Q ss_conf 00384012345421365321001101 Q gi|254780755|r 219 NEMGKNIGEDFRNGKPTLPVILAFQR 244 (322) Q Consensus 219 ~~~GK~~g~Dl~egk~Tlpii~al~~ 244 (322) |.++|.+|+|+.+-.++ T Consensus 182 ---------D~~~G~~TlPi~~G~~~ 198 (275) T PRK12883 182 ---------DMAKGAKTLPIIIGKKR 198 (275) T ss_pred ---------HHHCCCCCCCHHHHHHH T ss_conf ---------98769957378946797 No 27 >PHA02092 hypothetical protein Probab=61.19 E-value=9.4 Score=18.20 Aligned_cols=88 Identities=19% Similarity=0.339 Sum_probs=50.2 Q ss_pred CCCCCCCCHHHHCCCCCCCHHHHHHCCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHH-HCCCCCHH-HHH Q ss_conf 1365321001101431100013432036899989999999999974969999999999999999984-20899989-999 Q gi|254780755|r 232 GKPTLPVILAFQRGTKREKNFWKSTINDGKISVENLKKAFIIMRENRALVDTEYRAYSYGQKAKDSL-RCLPDSSW-KKS 309 (322) Q Consensus 232 gk~Tlpii~al~~~~~~~~~~l~~~~~~~~~~~~~~~~i~~ll~~~gai~~~~~~~~~~~~~A~~~L-~~lp~~~~-k~~ 309 (322) |.+.+-++++-+.+..-+++...-+..-+.+.... +.-...+.++++...++..+.+|.++.+..+ .+||.+.. ... T Consensus 2 ~~pn~rlll~eekshgvekely~l~~~mn~igann-qdt~gklfktkay~~~qt~~aey~ekirkii~~nfp~s~~y~~e 80 (108) T PHA02092 2 CRPNYRLLLAEEKSHGVEKELYELYTSMNAIGANN-QDTFGKLFKTKAYKLAQTEIAEYNEKIRKVILKNFPTSTRYQSE 80 (108) T ss_pred CCCCCEEEEEEHHCCCHHHHHHHHHHHHHHHCCCC-CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHH T ss_conf 98763123430120167799999999985514667-20689988888765888899999999999999629722578998 Q ss_pred ---HHHHHHHHHCC Q ss_conf ---99999999603 Q gi|254780755|r 310 ---LMEVVDFCLQR 320 (322) Q Consensus 310 ---L~~L~~~ii~R 320 (322) |+.++|-.++| T Consensus 81 liglrgivdvsmkr 94 (108) T PHA02092 81 LIGLRGIVDVSMKR 94 (108) T ss_pred HHCHHHEEEEHHHC T ss_conf 85511114213306 No 28 >cd02434 Nodulin-21_like_3 Nodulin-21 and CCC1-related protein family. Nodulin-21_like_3: This is a family of proteins closely related to nodulin-21, a plant nodule-specific protein that may be involved in symbiotic nitrogen fixation. This family is also related to CCC1, a yeast vacuole transmembrane protein that functions as an iron and manganese transporter. Probab=60.59 E-value=13 Score=17.40 Aligned_cols=34 Identities=12% Similarity=0.026 Sum_probs=13.8 Q ss_pred HCCCCCCCHHHHHHCCCCCCCHHHHHHHHHHHHH Q ss_conf 0143110001343203689998999999999997 Q gi|254780755|r 243 QRGTKREKNFWKSTINDGKISVENLKKAFIIMRE 276 (322) Q Consensus 243 ~~~~~~~~~~l~~~~~~~~~~~~~~~~i~~ll~~ 276 (322) ++.++.+++.+.+++.++-.++++.+++.+.+.+ T Consensus 76 ~~~pe~E~~El~~iy~~~G~~~~~A~~v~~~l~~ 109 (225) T cd02434 76 ENYPEGEKSEMVEIYSLKGLSEEVADQVVELLSK 109 (225) T ss_pred HHCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH T ss_conf 8594989999999998779998999999999983 No 29 >PRK10872 relA GDP/GTP pyrophosphokinase; Provisional Probab=57.25 E-value=15 Score=17.05 Aligned_cols=31 Identities=19% Similarity=0.094 Sum_probs=22.0 Q ss_pred HHHHHHHHHHCHHHHHHHCCCCHHHCCCHHHHHH Q ss_conf 9998752110112200110121010011023431 Q gi|254780755|r 76 TATLLHDDVVDDSALRRGKVAARLVWGNQTSVLV 109 (322) Q Consensus 76 ~asLiHDDI~D~~~~RRg~~t~~~~~G~~~Ail~ 109 (322) .|.|.||-|.|...+ ...+...||..+|-++ T Consensus 77 ~AalLhd~vedt~~t---~~~i~~~FG~~va~lV 107 (743) T PRK10872 77 RAALLFPLADANVVS---EDVLRESVGKSIVNLI 107 (743) T ss_pred HHHHCCCHHHCCCCC---HHHHHHHHCHHHHHHH T ss_conf 999650125247989---9999988687899985 No 30 >TIGR00691 spoT_relA RelA/SpoT family protein; InterPro: IPR004811 The functions of Escherichia coli RelA and SpoT differ somewhat. RelA (2.7.6.5 from EC) produces pppGpp (or ppGpp) from ATP and GTP (or GDP). SpoT (3.1.7.2 from EC) degrades ppGpp, but may also act as a secondary ppGpp synthetase. The two proteins are strongly similar. In many species, a single homolog to SpoT and RelA appears responsible for both ppGpp synthesis and ppGpp degradation. (p)ppGpp is a regulatory metabolite of the stringent response, but appears also to be involved in antibiotic biosynthesis in some species. ; GO: 0015969 guanosine tetraphosphate metabolic process. Probab=55.99 E-value=11 Score=17.72 Aligned_cols=51 Identities=22% Similarity=0.298 Sum_probs=37.5 Q ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHCCCCHHHCCCHHHHHH Q ss_conf 79999999999708985786666557899999998752110112200110121010011023431 Q gi|254780755|r 45 LRPMLTLATALMLEYRGDNHVLLACAIEFIHTATLLHDDVVDDSALRRGKVAARLVWGNQTSVLV 109 (322) Q Consensus 45 ~Rp~l~~l~~~~~~~~~~~~~~~a~~iEliH~asLiHDDI~D~~~~RRg~~t~~~~~G~~~Ail~ 109 (322) +.|+.|-+.=--+|+|...+ .|.|.||=|+|=+.+ ...+-..||..+|=|+ T Consensus 22 ~HP~~VA~~La~~~~D~~t~-----------~AaLLHDv~EDT~~t---~e~i~~~FG~eva~LV 72 (741) T TIGR00691 22 IHPLAVALILAELGMDEETV-----------CAALLHDVIEDTDVT---EEEIEEEFGEEVAELV 72 (741) T ss_pred HHHHHHHHHHHHCCCCHHHH-----------HHHHCCCEEECCCCC---HHHHHHHCCHHHHHHH T ss_conf 36789988987708998999-----------997407844138656---8999975070256453 No 31 >cd02433 Nodulin-21_like_2 Nodulin-21 and CCC1-related protein family. Nodulin-21_like_2: This is a family of proteins closely related to nodulin-21, a plant nodule-specific protein that may be involved in symbiotic nitrogen fixation. This family is also related to CCC1, a yeast vacuole transmembrane protein that functions as an iron and manganese transporter. Probab=53.96 E-value=16 Score=16.72 Aligned_cols=34 Identities=9% Similarity=0.005 Sum_probs=16.9 Q ss_pred HCCCCCCCHHHHHHCCCCCCCHHHHHHHHHHHHH Q ss_conf 0143110001343203689998999999999997 Q gi|254780755|r 243 QRGTKREKNFWKSTINDGKISVENLKKAFIIMRE 276 (322) Q Consensus 243 ~~~~~~~~~~l~~~~~~~~~~~~~~~~i~~ll~~ 276 (322) ++.++.+++.+.+++.+...++++.+++.+.+.+ T Consensus 93 ~~~p~~E~~El~~iy~~kG~~~~~A~~va~~l~~ 126 (234) T cd02433 93 RKHPLEEAAELALIYRAKGLDEEEAKRVASQLMN 126 (234) T ss_pred HHCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH T ss_conf 5095889999999998769998999999999985 No 32 >PRK13593 consensus Probab=51.93 E-value=18 Score=16.52 Aligned_cols=159 Identities=15% Similarity=0.078 Sum_probs=70.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCHHHHHHH---CCCCHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHH Q ss_conf 8666655789999999875211011220011---0121010011023431245788887642102431100012345677 Q gi|254780755|r 63 NHVLLACAIEFIHTATLLHDDVVDDSALRRG---KVAARLVWGNQTSVLVGDFLLSQAFCMVIETKSQEALEALSLVACT 139 (322) Q Consensus 63 ~~~~~a~~iEliH~asLiHDDI~D~~~~RRg---~~t~~~~~G~~~Ail~GD~l~~~a~~~l~~~~~~~~~~~~~~~~~~ 139 (322) ...-...++=+.-.++-+..|+.|-+-=|-+ +|-...+-..+.|...+-.++..++-+..-.+... -.++ ... T Consensus 43 ~~~~~~~~~~l~~aagn~iND~~D~eID~iN~P~RPIPsG~IS~~~Al~~~i~l~~~~~~l~~~l~~~~--~~i~--~~~ 118 (283) T PRK13593 43 HVTGAVAATILATAAGNAINDYFDRAIDKINRPDRPIPRGAVSPRGALVYSVFLFVAAVVLTSVLPLLA--IVIA--AMN 118 (283) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHH--HHH T ss_conf 999999999999999999974876514310389998888863999999999999999999999988999--9999--999 Q ss_pred HHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHCCCHHHHHHHHHCCCCCC Q ss_conf 777754565542132210357898998643355554432100011222-2101579999973111689887641123670 Q gi|254780755|r 140 LAEGELRQLSLSKNLDVTEEDYLHVIKSKTAVLFSAALEVSSLIAGVQ-NSVRQALKSYGMNLGIAFQLVDDVLDYRGEV 218 (322) Q Consensus 140 l~~Gq~~d~~~~~~~~~s~~~y~~ii~~KTa~Lf~~~~~~ga~lag~~-~~~~~~l~~fG~~lG~afQI~DD~ld~~g~~ 218 (322) +.-.-.....+++.. ... .-++. .+.+.+...|+...+.+ ....-.+.-+..-...+.-+..|+.|+.| T Consensus 119 ~~~~~~Y~~~lK~~~--~~g--Ni~v~----~~~g~~~l~G~~av~~~~~~~~~~l~~~~fl~t~~reiikdi~DieG-- 188 (283) T PRK13593 119 LLALVAYTELFKGLP--GVG--NALVA----YLTGSTFLFGAAAVGRILAFGVVVLFALAALATAAREIIKDVEDLDG-- 188 (283) T ss_pred HHHHHHHHHHHHCCC--CCC--HHHHH----HHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH-- T ss_conf 999999999995487--500--19999----99999999999997588873899999999999999999999986421-- Q ss_pred CCCCCHHHHHHHHCCCCCCCCHHHHC Q ss_conf 00384012345421365321001101 Q gi|254780755|r 219 NEMGKNIGEDFRNGKPTLPVILAFQR 244 (322) Q Consensus 219 ~~~GK~~g~Dl~egk~Tlpii~al~~ 244 (322) |-++|.+|+|+.+-.++ T Consensus 189 ---------D~~~G~~TLPv~~G~~~ 205 (283) T PRK13593 189 ---------DRAEGLRTLPIVIGVTR 205 (283) T ss_pred ---------HHHCCCCCCCHHHCHHH T ss_conf ---------98769975881626899 No 33 >pfam00598 Flu_M1 Influenza Matrix protein (M1). This protein forms a continuous shell on the inner side of the lipid bilayer, but its function is unclear. Probab=49.13 E-value=20 Score=16.25 Aligned_cols=78 Identities=14% Similarity=0.128 Sum_probs=42.7 Q ss_pred CCHHHHHHHHCCCCCCCCHHHHCCCCCCCHHHHHHCCCCCCCHHHHHHH---HHHHHHCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 8401234542136532100110143110001343203689998999999---9999974969999999999999999984 Q gi|254780755|r 222 GKNIGEDFRNGKPTLPVILAFQRGTKREKNFWKSTINDGKISVENLKKA---FIIMRENRALVDTEYRAYSYGQKAKDSL 298 (322) Q Consensus 222 GK~~g~Dl~egk~Tlpii~al~~~~~~~~~~l~~~~~~~~~~~~~~~~i---~~ll~~~gai~~~~~~~~~~~~~A~~~L 298 (322) .||+.+||..|..-+.+.+..-.--..+++++...--+++-++...... ...+++.-.+..+++.+..|..-|.... T Consensus 47 tr~~Ls~ltK~ilG~~~~~~~Pserg~qrRRFv~naL~Gngdpnnm~~aVkl~rKlkReitFH~AkEvalsystgALasC 126 (156) T pfam00598 47 TRPILSPLTKGILGFVFTLTVPSERGLQRRRFVQNALNGNGDPNNMDGAVKLYRKLKREITFHGAKEIALSYSAGALASC 126 (156) T ss_pred CCCCCCHHHHHHHHEEEEEECCCHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHCCHHHHHHH T ss_conf 58664567865541478984430666778787743255899923577889999877545061017898753021489999 Q ss_pred H Q ss_conf 2 Q gi|254780755|r 299 R 299 (322) Q Consensus 299 ~ 299 (322) . T Consensus 127 m 127 (156) T pfam00598 127 M 127 (156) T ss_pred H T ss_conf 9 No 34 >TIGR02562 cas3_yersinia CRISPR-associated helicase Cas3, Yersinia-type; InterPro: IPR013395 The helicase encoded in many CRISPR-associated (cas) gene clusters is designated Cas3. The Yersinia-type helicase Cas3 proteins differ from the more common Cas3 proteins by being considerably larger, though they still share a number of motifs, and replace Cas3 in some CRISPR/cas loci in a number of Proteobacteria, including Yersinia pestis, Chromobacterium violaceum, Erwinia carotovora subsp. atroseptica SCRI1043, Photorhabdus luminescens subsp. laumondii TTO1, Legionella pneumophila, etc.. Probab=41.76 E-value=25 Score=15.55 Aligned_cols=53 Identities=23% Similarity=0.365 Sum_probs=40.9 Q ss_pred CHHHHHHHHHHH-----HCCCCHHHHHHHHHHHHHH--CCCHHHHHHHHHHHHHHHHHHH Q ss_conf 758999999999-----6798227999999999970--8985786666557899999998 Q gi|254780755|r 27 VEMIPDVVKYLI-----FSGGKRLRPMLTLATALML--EYRGDNHVLLACAIEFIHTATL 79 (322) Q Consensus 27 ~~~l~~~~~~~~-----~~gGKr~Rp~l~~l~~~~~--~~~~~~~~~~a~~iEliH~asL 79 (322) .|.|.-.+-|++ .+|||.+-|+|=+++|.+. +.|.-..-.+-|.+=|+|.|.| T Consensus 575 aP~lvcTiDhLi~A~Es~rgG~~i~p~LRllsSDLiLDE~Ddy~~~dLpAl~RLV~lAgL 634 (1153) T TIGR02562 575 APVLVCTIDHLIPATESVRGGHHIAPMLRLLSSDLILDEPDDYEREDLPALLRLVQLAGL 634 (1153) T ss_pred CCHHHHHHHHCCCCCCCCCCCEEHHHHHHHHHHHHEECCCCCCCHHHHHHHHHHHHHHHH T ss_conf 512665543200467754456107889987532320048888871234899999999998 No 35 >PRK02936 argD acetylornithine aminotransferase; Provisional Probab=41.57 E-value=26 Score=15.53 Aligned_cols=135 Identities=15% Similarity=0.242 Sum_probs=64.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHH---------------CCCCCCCCCCCH Q ss_conf 78989986433555544321000112222101579999973111689887641---------------123670003840 Q gi|254780755|r 160 DYLHVIKSKTAVLFSAALEVSSLIAGVQNSVRQALKSYGMNLGIAFQLVDDVL---------------DYRGEVNEMGKN 224 (322) Q Consensus 160 ~y~~ii~~KTa~Lf~~~~~~ga~lag~~~~~~~~l~~fG~~lG~afQI~DD~l---------------d~~g~~~~~GK~ 224 (322) .+.++...+.|+.+.=|.+..+=.--.++++.+.++++...-|+.+ |.|++. ++..|--.+||. T Consensus 159 ~l~~~~~~~iAavi~EPiqg~gG~~~~~~~yl~~lr~lc~~~gill-I~DEV~tGfGRtG~~~a~e~~gv~PDii~~gK~ 237 (377) T PRK02936 159 ALEEVMNEEVAAVMLEVVQGEGGVIPADLSFLQEVQTLCKKFGALL-IIDEVQTGIGRTGTLFAYEQFGLEPDIVTVAKG 237 (377) T ss_pred HHHHCCCCCEEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCCE-ECCCCCCCCCCCCCCCHHHHCCCCCCEEEEEHH T ss_conf 9985168967899997876788976899999999999999829845-712211277655672036444988767886201 Q ss_pred HHHHHHHCCCCCCCCHHHHC--C-CCCCCHHHHHHCCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 12345421365321001101--4-31100013432036899989999999999974969999999999999999984208 Q gi|254780755|r 225 IGEDFRNGKPTLPVILAFQR--G-TKREKNFWKSTINDGKISVENLKKAFIIMRENRALVDTEYRAYSYGQKAKDSLRCL 301 (322) Q Consensus 225 ~g~Dl~egk~Tlpii~al~~--~-~~~~~~~l~~~~~~~~~~~~~~~~i~~ll~~~gai~~~~~~~~~~~~~A~~~L~~l 301 (322) .+ .| +|+--.+-+ . +.-.....-+.+..++..-..-...++.+++.+.++.++++-....+...+..... T Consensus 238 l~----~G---~P~sav~~~~~i~~~~~~~~~~~T~~gnpl~~aaa~a~L~~i~~~~l~~~~~~~g~~l~~~L~~~~~~~ 310 (377) T PRK02936 238 LG----NG---IPVGAMIGKKELGSAFTAGSHGSTFGGNYLAMAAAKEVLQVIKQPSFLEEVQEKGEYFLQKLQEELQHV 310 (377) T ss_pred HC----CC---CCCCCEECCHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 02----78---987333032877630378998888985999999999999998511199999999999999999984369 Q ss_pred C Q ss_conf 9 Q gi|254780755|r 302 P 302 (322) Q Consensus 302 p 302 (322) | T Consensus 311 ~ 311 (377) T PRK02936 311 E 311 (377) T ss_pred C T ss_conf 9 No 36 >pfam03936 Terpene_synth_C Terpene synthase family, metal binding domain. It has been suggested that this gene family be designated tps (for terpene synthase). It has been split into six subgroups on the basis of phylogeny, called tpsa-tpsf. tpsa includes vetispiridiene synthase, 5-epi- aristolochene synthase, and (+)-delta-cadinene synthase. tpsb includes (-)-limonene synthase. tpsc includes kaurene synthase A. tpsd includes taxadiene synthase, pinene synthase, and myrcene synthase. tpse includes kaurene synthase B. tpsf includes linalool synthase. Probab=39.74 E-value=27 Score=15.35 Aligned_cols=167 Identities=14% Similarity=0.047 Sum_probs=85.8 Q ss_pred HHHHHHHHHHHHHHCCCHH--HHHHHHHHHHHHHHHHHHHHHHHCHHHHH---HHCCCCHHHCCCHHHHHHHHHHHH--H Q ss_conf 2799999999997089857--86666557899999998752110112200---110121010011023431245788--8 Q gi|254780755|r 44 RLRPMLTLATALMLEYRGD--NHVLLACAIEFIHTATLLHDDVVDDSALR---RGKVAARLVWGNQTSVLVGDFLLS--Q 116 (322) Q Consensus 44 r~Rp~l~~l~~~~~~~~~~--~~~~~a~~iEliH~asLiHDDI~D~~~~R---Rg~~t~~~~~G~~~Ail~GD~l~~--~ 116 (322) |=|+.-++..+-+....|+ ....++ .-++ ...-+.||+-|.-.+. +.-.-+..+|..+..-...||+-. . T Consensus 40 Rdr~~e~yfw~~~~~feP~~s~~R~~~--aK~~-~l~t~iDD~~D~~gt~eEl~~ft~av~rWd~~~~~~lp~~~k~~~~ 116 (270) T pfam03936 40 RDRLVECYFWAAASYFEPQYSDARIAA--AKTA-ALLTVIDDTYDRYGTLEELELFTDAVERWDISAIDELPDYMKIVFK 116 (270) T ss_pred HHHHHHHHHHHHHHHCCCCHHHHHHHH--HHHH-HHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHCCHHHHHHHH T ss_conf 899899999999866482248899999--9999-9999743111046889999999999981693200008788999999 Q ss_pred H-HHHHHC--------CCCCCCCHHHHHHHHHHHHHHHHHHHHHCCC-CCCHHHHHHHHHHHHHHH-HHHHHHHHHHHHH Q ss_conf 8-764210--------2431100012345677777754565542132-210357898998643355-5544321000112 Q gi|254780755|r 117 A-FCMVIE--------TKSQEALEALSLVACTLAEGELRQLSLSKNL-DVTEEDYLHVIKSKTAVL-FSAALEVSSLIAG 185 (322) Q Consensus 117 a-~~~l~~--------~~~~~~~~~~~~~~~~l~~Gq~~d~~~~~~~-~~s~~~y~~ii~~KTa~L-f~~~~~~ga~lag 185 (322) + +....+ .+...+..-+.+...+++.+=+.+..|.... -.|.++|+++..--+|.- +.+.+.+|. ++ T Consensus 117 ~l~~~~~e~~~~~~~~~g~~~~~~~l~~~w~~l~~~~l~Ea~W~~~~~vPt~~EYl~~~~~s~~~~~~~~~~~~~m--~~ 194 (270) T pfam03936 117 ALLDTVEEIEKELSKEQGRSYIVPYLKEAWKDLLEAYLEEAKWAKGGHVPSFEEYLENGRVSIGAPLLLLLSFLGM--GD 194 (270) T ss_pred HHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHC--CC T ss_conf 9999999999999997494218999999999999999999999955979989999872067632599999999927--88 Q ss_pred HH-HHH---HHHHHHHHHHCCCHHHHHHHHHCCC Q ss_conf 22-210---1579999973111689887641123 Q gi|254780755|r 186 VQ-NSV---RQALKSYGMNLGIAFQLVDDVLDYR 215 (322) Q Consensus 186 ~~-~~~---~~~l~~fG~~lG~afQI~DD~ld~~ 215 (322) .. .+. ...-..+-...++...+.||+-.+. T Consensus 195 ~~~~e~~~~~~~~p~~~~~~~~i~RL~nDi~s~~ 228 (270) T pfam03936 195 SSGEEAFEWLKSRPKLVQLSSLILRLLNDIASYE 228 (270) T ss_pred CCCHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHH T ss_conf 8789999988635289999999999702415778 No 37 >pfam10776 DUF2600 Protein of unknown function (DUF2600). This is a bacterial family of proteins. Some members in the family are annotated as YtpB however currently no function is known. Probab=39.24 E-value=28 Score=15.31 Aligned_cols=30 Identities=20% Similarity=0.163 Sum_probs=14.5 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHCHHHH Q ss_conf 898578666655789999999875211011220 Q gi|254780755|r 58 EYRGDNHVLLACAIEFIHTATLLHDDVVDDSAL 90 (322) Q Consensus 58 ~~~~~~~~~~a~~iEliH~asLiHDDI~D~~~~ 90 (322) |...+..+++-.+.+.| |--.|-++|.+.. T Consensus 48 ~~~~~~~i~fIVAyQtI---sDYLDnLCDrs~~ 77 (330) T pfam10776 48 GENVDECIRFIVAYQTI---SDYLDNLCDRSTS 77 (330) T ss_pred CCHHHHHHHHHHHHHHH---HHHHHHHHHCCCC T ss_conf 51045477899989999---9998766525577 No 38 >pfam06783 UPF0239 Uncharacterized protein family (UPF0239). Probab=38.83 E-value=21 Score=16.12 Aligned_cols=24 Identities=29% Similarity=0.456 Sum_probs=19.2 Q ss_pred HHHHHHHHHHHHHHHCCCHHHHHH Q ss_conf 222101579999973111689887 Q gi|254780755|r 186 VQNSVRQALKSYGMNLGIAFQLVD 209 (322) Q Consensus 186 ~~~~~~~~l~~fG~~lG~afQI~D 209 (322) .++...+.+-+||..+|-.||+.= T Consensus 13 geED~~~~~~rygl~~gaifq~~c 36 (85) T pfam06783 13 GDEDFFELLIRYGLFLGAIFQFIC 36 (85) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 178999999999999999999999 No 39 >pfam06603 DUF1141 Protein of unknown function (DUF1141). This family consists of several hypothetical proteins of unknown function and seems to be specific to Bacteroides species. Probab=37.58 E-value=30 Score=15.15 Aligned_cols=68 Identities=16% Similarity=0.105 Sum_probs=43.7 Q ss_pred CCCCCHHHHHHCCCCCCCHHH-HHHHHHHHHHCCHHHHH----HHHHHHHHHHHHHHHHCCCCCHHHHHHHHH Q ss_conf 311000134320368999899-99999999974969999----999999999999984208999899999999 Q gi|254780755|r 246 TKREKNFWKSTINDGKISVEN-LKKAFIIMRENRALVDT----EYRAYSYGQKAKDSLRCLPDSSWKKSLMEV 313 (322) Q Consensus 246 ~~~~~~~l~~~~~~~~~~~~~-~~~i~~ll~~~gai~~~----~~~~~~~~~~A~~~L~~lp~~~~k~~L~~L 313 (322) +.+-.+....++..+-.++++ -.-.+.++..+++.-|. ++.++...+++++.|+.+|+|..|-.|... T Consensus 31 N~eV~~~~~~Ly~~~g~t~EeEA~lCLaLL~gYnatiydng~ke~~~Q~vLdRs~~vL~~Lp~SLLK~qLlt~ 103 (106) T pfam06603 31 NKEVLELSNDLYSQKGETPEEEANLCLALLMGYNATIYNNGEKEEKKQVVLDRSWDVLDKLPASLLKCQLLTY 103 (106) T ss_pred HHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCHHHHCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHH T ss_conf 4999999999987049998999999999997433046407428999999999999998647718999999988 No 40 >KOG2802 consensus Probab=35.29 E-value=32 Score=14.92 Aligned_cols=107 Identities=21% Similarity=0.226 Sum_probs=61.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHCCCHHHHHHHHHCCCCCCCCCCCHHHHHHHHCCCCCCCCHHHHCC Q ss_conf 4335555443210001122221015799--99973111689887641123670003840123454213653210011014 Q gi|254780755|r 168 KTAVLFSAALEVSSLIAGVQNSVRQALK--SYGMNLGIAFQLVDDVLDYRGEVNEMGKNIGEDFRNGKPTLPVILAFQRG 245 (322) Q Consensus 168 KTa~Lf~~~~~~ga~lag~~~~~~~~l~--~fG~~lG~afQI~DD~ld~~g~~~~~GK~~g~Dl~egk~Tlpii~al~~~ 245 (322) -||..++++|.......|.+- -+.+. -.|.-||+. .||+-|+-+ .||-.|+.-+-. .. T Consensus 370 VtGv~IAaa~m~lss~tgnPI--yD~~GSivvGaLLGmV---e~diyDvK~----------~diG~g~vRfKA-----E~ 429 (503) T KOG2802 370 VTGVIIAAACMGLSSITGNPI--YDSLGSIVVGALLGMV---ENDIYDVKA----------TDIGLGKVRFKA-----EV 429 (503) T ss_pred HHHHHHHHHHHHHHHHCCCCC--CCCCCHHHHHHHHHHH---HHHHHHCCC----------EEECCCEEEEEE-----EE T ss_conf 888999999888887218978--6542037898888777---776541322----------340464168887-----85 Q ss_pred CCCCCHHHHHHCCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 311000134320368999899999999999749699999999999999999842 Q gi|254780755|r 246 TKREKNFWKSTINDGKISVENLKKAFIIMRENRALVDTEYRAYSYGQKAKDSLR 299 (322) Q Consensus 246 ~~~~~~~l~~~~~~~~~~~~~~~~i~~ll~~~gai~~~~~~~~~~~~~A~~~L~ 299 (322) +-..+...++.+. .+++.+..+.+.+.+-.+.....+.+|-+...+.|. T Consensus 430 DFdGr~vtrsYL~-----kqd~akml~e~~~~~~~e~le~fml~~GEnivd~lG 478 (503) T KOG2802 430 DFDGRVVTRSYLE-----KQDFAKMLQEIQEVKTPEELETFMLKHGENIVDTLG 478 (503) T ss_pred CCCCHHHHHHHHH-----HHHHHHHHHHHHHCCCHHHHHHHHHHCCHHHHHHHH T ss_conf 2672216897872-----889999999998209989999999973448998763 No 41 >PRK12872 ubiA prenyltransferase; Reviewed Probab=34.48 E-value=28 Score=15.28 Aligned_cols=40 Identities=28% Similarity=0.424 Sum_probs=29.8 Q ss_pred HHHHHHHHCCCHHHHHHHHHCCCCCCCCCCCHHHHHHHHCCCCCCCCHHHH Q ss_conf 799999731116898876411236700038401234542136532100110 Q gi|254780755|r 193 ALKSYGMNLGIAFQLVDDVLDYRGEVNEMGKNIGEDFRNGKPTLPVILAFQ 243 (322) Q Consensus 193 ~l~~fG~~lG~afQI~DD~ld~~g~~~~~GK~~g~Dl~egk~Tlpii~al~ 243 (322) ...-|..-+...-.+.-|+.|+.| |..+|..|+|+++-.+ T Consensus 174 ~~a~F~fl~~lireivkDi~di~G-----------D~~~g~~TlPi~~G~~ 213 (297) T PRK12872 174 LVAAFAFLKNFVNTIVFDIKDIEG-----------DRNSGLKTLPIILGKR 213 (297) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCC-----------HHHCCCCEECHHHHHH T ss_conf 999999999999999715751315-----------4866983148352358 No 42 >pfam06304 DUF1048 Protein of unknown function (DUF1048). This family consists of several hypothetical bacterial proteins of unknown function. Probab=33.23 E-value=26 Score=15.53 Aligned_cols=34 Identities=26% Similarity=0.462 Sum_probs=22.3 Q ss_pred HHHHHHHHHCCC-----HHHHHHHHHCCCCCCCCCCCHH Q ss_conf 579999973111-----6898876411236700038401 Q gi|254780755|r 192 QALKSYGMNLGI-----AFQLVDDVLDYRGEVNEMGKNI 225 (322) Q Consensus 192 ~~l~~fG~~lG~-----afQI~DD~ld~~g~~~~~GK~~ 225 (322) +.+.+|-.+.|. .++|.+|++|+|-....-||++ T Consensus 18 ~~iqkYl~~~g~~~g~~~~~il~~lldLfEe~AadG~~V 56 (92) T pfam06304 18 KEIQKYLWNFGATDGADMKPILTDILDLFEEAAADGKSV 56 (92) T ss_pred HHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCCCH T ss_conf 999997542289866888999999999999988669972 No 43 >KOG1345 consensus Probab=28.58 E-value=26 Score=15.52 Aligned_cols=20 Identities=45% Similarity=0.775 Sum_probs=18.1 Q ss_pred HHHHHHHHHHHHHHHHHHHH Q ss_conf 66655789999999875211 Q gi|254780755|r 65 VLLACAIEFIHTATLLHDDV 84 (322) Q Consensus 65 ~~~a~~iEliH~asLiHDDI 84 (322) -.+++|+|.+|.-.|||-|+ T Consensus 128 ~ql~SAi~fMHsknlVHRdl 147 (378) T KOG1345 128 AQLLSAIEFMHSKNLVHRDL 147 (378) T ss_pred HHHHHHHHHHHCCCHHHCCC T ss_conf 99998987763001220212 No 44 >TIGR01957 nuoB_fam NADH-quinone oxidoreductase, B subunit; InterPro: IPR006138 Respiratory-chain NADH dehydrogenase (1.6.5.3 from EC) (also known as complex I or NADH-ubiquinone oxidoreductase) is an oligomeric enzymatic complex located in the inner mitochondrial membrane which also seems to exist in the chloroplast and in cyanobacteria (as a NADH-plastoquinone oxidoreductase). Among the 25 to 30 polypeptide subunits of this bioenergetic enzyme complex there is one with a molecular weight of 20 kDa (in mammals) , which is a component of the iron-sulphur (IP) fragment of the enzyme. It seems to bind a 4Fe-4S iron-sulphur cluster. The 20 kDa subunit has been found to be nuclear encoded, as a precursor form with a transit peptide in mammals, and in Neurospora crassa. It is mitochondrial encoded in Paramecium (gene psbG) and chloroplast encoded in various higher plants (gene ndhK or psbG).; GO: 0008137 NADH dehydrogenase (ubiquinone) activity, 0006120 mitochondrial electron transport NADH to ubiquinone. Probab=27.83 E-value=35 Score=14.70 Aligned_cols=28 Identities=32% Similarity=0.272 Sum_probs=17.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHCHHHHHHH Q ss_conf 6655789999999875211011220011 Q gi|254780755|r 66 LLACAIEFIHTATLLHDDVVDDSALRRG 93 (322) Q Consensus 66 ~~a~~iEliH~asLiHDDI~D~~~~RRg 93 (322) .++++||++|+++=-||==-=++..+|. T Consensus 26 lACCaIEMm~t~~s~yDldRFG~~~fR~ 53 (146) T TIGR01957 26 LACCAIEMMATGASRYDLDRFGSVVFRA 53 (146) T ss_pred HHHHHHHHHHHHHHHCCHHHCCCCCCCC T ss_conf 6558999999755412631337215877 No 45 >COG1562 ERG9 Phytoene/squalene synthetase [Lipid metabolism] Probab=25.66 E-value=47 Score=13.91 Aligned_cols=154 Identities=19% Similarity=0.203 Sum_probs=80.1 Q ss_pred HHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 57888876421024311000123456777777545655421322103578989986433555544321000112222101 Q gi|254780755|r 112 FLLSQAFCMVIETKSQEALEALSLVACTLAEGELRQLSLSKNLDVTEEDYLHVIKSKTAVLFSAALEVSSLIAGVQNSVR 191 (322) Q Consensus 112 ~l~~~a~~~l~~~~~~~~~~~~~~~~~~l~~Gq~~d~~~~~~~~~s~~~y~~ii~~KTa~Lf~~~~~~ga~lag~~~~~~ 191 (322) --+..+|..++.- ...-.+.+.+++.|...|+....-. +++++..-.++--|....+ +..+++ .+. . T Consensus 85 ~pv~~al~~~~~~-----~~~~~~~~~~~~da~~~Dl~~~~y~--~~~eL~~Yc~~vAg~vG~l---~~~Il~-~~~--~ 151 (288) T COG1562 85 HPVLAALVEVARR-----FGLPREAFPALIDAMRMDLDRTRYL--DFEELEEYCYGVAGAVGLL---LARILG-PDK--D 151 (288) T ss_pred CHHHHHHHHHHHH-----CCCCHHHHHHHHHHHHHHHHHCCCC--CHHHHHHHHHHHHHHHHHH---HHHHHC-CCC--C T ss_conf 8799988999987-----0897789999999999776534146--9899999998869999999---999858-462--0 Q ss_pred HHHHHHHHHCCCHHHHHHHHHCCCCCCCCCCCHHHHHHHHCCCCCCCCHHHHCCCCCCCHHHHHHCCCCCCCHHHHHHHH Q ss_conf 57999997311168988764112367000384012345421365321001101431100013432036899989999999 Q gi|254780755|r 192 QALKSYGMNLGIAFQLVDDVLDYRGEVNEMGKNIGEDFRNGKPTLPVILAFQRGTKREKNFWKSTINDGKISVENLKKAF 271 (322) Q Consensus 192 ~~l~~fG~~lG~afQI~DD~ld~~g~~~~~GK~~g~Dl~egk~Tlpii~al~~~~~~~~~~l~~~~~~~~~~~~~~~~i~ 271 (322) .........+|.|+|+.+=+.|+ +.|...|+.=+|.=-..+...+ .+. +... . ..+.+ . T Consensus 152 ~~~~~~a~~lG~A~QlvNilRdv-----------~eD~~~GrvylP~e~l~~~g~~-~~d----~~~~-~-~~~~~---~ 210 (288) T COG1562 152 AATRAYARGLGLALQLVNILRDV-----------GEDRRRGRVYLPAEELARFGVS-EAD----LLAG-R-VDDAF---R 210 (288) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH-----------HHHHHCCCCCCCHHHHHHHCCC-HHH----HHCC-C-CHHHH---H T ss_conf 01389999988999999999972-----------8899679042877999880999-889----8712-3-21689---9 Q ss_pred HHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCH Q ss_conf 9999749699999999999999999842089998 Q gi|254780755|r 272 IIMRENRALVDTEYRAYSYGQKAKDSLRCLPDSS 305 (322) Q Consensus 272 ~ll~~~gai~~~~~~~~~~~~~A~~~L~~lp~~~ 305 (322) ++ +.+--..++.+...|...+..+|... T Consensus 211 ~~------~~~~~~~ar~~~~~a~~~~~~lp~~~ 238 (288) T COG1562 211 EL------MRFEADRARDHLAEARRGLPALPGRA 238 (288) T ss_pred HH------HHHHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 99------99999999999999998665178311 No 46 >PRK04260 acetylornithine aminotransferase; Provisional Probab=25.60 E-value=47 Score=13.90 Aligned_cols=129 Identities=20% Similarity=0.262 Sum_probs=64.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHC---------------CCCCCCCCCCH Q ss_conf 789899864335555443210001122221015799999731116898876411---------------23670003840 Q gi|254780755|r 160 DYLHVIKSKTAVLFSAALEVSSLIAGVQNSVRQALKSYGMNLGIAFQLVDDVLD---------------YRGEVNEMGKN 224 (322) Q Consensus 160 ~y~~ii~~KTa~Lf~~~~~~ga~lag~~~~~~~~l~~fG~~lG~afQI~DD~ld---------------~~g~~~~~GK~ 224 (322) ...+....+.|+.+.=|.+...=.--.++++.+.++++...-|+.+ |.|++.- +..|--.+||. T Consensus 157 ~~~~~~~~~iAAiiiEpiqg~~G~~~~~~~yl~~lr~lc~~~gill-I~DEV~tGfGRtG~~fa~e~~gv~PDiv~~gKg 235 (375) T PRK04260 157 SVKALVNKNTAAVMLELVQGESGVLPADKAFVKALADYCQETGILL-IVDEVQTGIGRTGKLYAFEHYGIEPDIFTLAKG 235 (375) T ss_pred HHHHHHCCCCEEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCE-EEECCCCCCCCCCCCCHHHCCCCCCCEEEECHH T ss_conf 9998637783599998105889875789899999999999847625-730121178765686554316988878987101 Q ss_pred HHHHHHHCCCCCCCCHHHHCCCCCCCHHHHH---------HCCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHH Q ss_conf 1234542136532100110143110001343---------2036899989999999999974969999999999999999 Q gi|254780755|r 225 IGEDFRNGKPTLPVILAFQRGTKREKNFWKS---------TINDGKISVENLKKAFIIMRENRALVDTEYRAYSYGQKAK 295 (322) Q Consensus 225 ~g~Dl~egk~Tlpii~al~~~~~~~~~~l~~---------~~~~~~~~~~~~~~i~~ll~~~gai~~~~~~~~~~~~~A~ 295 (322) .+ .| +|+--.+- ++.+.. -+..++.+-..-..+++.+++.+.++.+++.-....+... T Consensus 236 lg----~G---~P~sav~~------~~~i~~~~~~~~h~~T~~gnpla~aaa~a~L~~i~~~~l~~~~~~~g~~l~~~L~ 302 (375) T PRK04260 236 LA----NG---LPVGAMLG------KSSLGGAFGYGSHGSTFGGNKLSMAAAKATLEIMLTAGFLETVLSKGNKLQEQLQ 302 (375) T ss_pred CC----CC---CCEEEEEE------CHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHH T ss_conf 04----88---64399998------6787446678785777998989999999999999864699999999999999999 Q ss_pred HHHHCCC Q ss_conf 9842089 Q gi|254780755|r 296 DSLRCLP 302 (322) Q Consensus 296 ~~L~~lp 302 (322) +.+...| T Consensus 303 ~~~~~~~ 309 (375) T PRK04260 303 EALQDKE 309 (375) T ss_pred HHHHCCC T ss_conf 9985399 No 47 >pfam08684 ocr DNA mimic ocr. The structure of an ocr protein from bacteriophage T7 has shown that this protein mimics the size and shape of a bent DNA molecule. ocr has also been shown to be an inhibitor of the complex type I DNA restriction enzymes. Probab=23.70 E-value=40 Score=14.34 Aligned_cols=44 Identities=20% Similarity=0.208 Sum_probs=31.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHCHHHHHHHCCCCHHHCCCHHHHHHHHHH Q ss_conf 665578999999987521101122001101210100110234312457 Q gi|254780755|r 66 LLACAIEFIHTATLLHDDVVDDSALRRGKVAARLVWGNQTSVLVGDFL 113 (322) Q Consensus 66 ~~a~~iEliH~asLiHDDI~D~~~~RRg~~t~~~~~G~~~Ail~GD~l 113 (322) ..+++.|.+...- =||||.|++..|- .+|..-++.+-+.-||.+ T Consensus 8 l~~~a~e~l~e~I-ryd~i~d~DD~~D---~iHe~ad~~VP~yy~diF 51 (100) T pfam08684 8 VFDHAYEALKENI-RYDDIRDTDDLSD---AIHEAADNAVPHYYADIF 51 (100) T ss_pred HHHHHHHHHHHHH-CCCCCCCCCHHHH---HHHHHHCCCCCEEEHHHH T ss_conf 9999999999874-1243356642889---999984168861023343 No 48 >PRK12884 ubiA prenyltransferase; Reviewed Probab=22.22 E-value=55 Score=13.50 Aligned_cols=160 Identities=16% Similarity=0.123 Sum_probs=73.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCHHHHH---HHCCCCHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHH Q ss_conf 5786666557899999998752110112200---1101210100110234312457888876421024311000123456 Q gi|254780755|r 61 GDNHVLLACAIEFIHTATLLHDDVVDDSALR---RGKVAARLVWGNQTSVLVGDFLLSQAFCMVIETKSQEALEALSLVA 137 (322) Q Consensus 61 ~~~~~~~a~~iEliH~asLiHDDI~D~~~~R---Rg~~t~~~~~G~~~Ail~GD~l~~~a~~~l~~~~~~~~~~~~~~~~ 137 (322) ....+-...++=+.+.|+-+..|+.|-+-=| +.+|-+-.+-..+.|...+=.+...++-...-.+ +... .+ .. T Consensus 38 ~~~~~l~~l~~~l~~~a~~~iNd~~D~diD~~n~~~RPlpsG~is~~~A~~~~~~l~~~~~~~~~~l~-~~~~-~l--~~ 113 (281) T PRK12884 38 LTKALLGFLSAFFASASAFALNDYFDVEIDRINRPDRPIPSGRVSLREALLLAIALFILGIVAAYLIS-PLAF-AV--VV 113 (281) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCHHHHCCCCCCCCCCCCCEECHHHHHHHHHHHHHHHHHHHHHHH-HHHH-HH--HH T ss_conf 89999999999999999988536240842500689998888627899999999999999999999999-9999-99--99 Q ss_pred HHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHCCCHHHHHHHHHCCCC Q ss_conf 777777545655421322103578989986433555544321000112222101-5799999731116898876411236 Q gi|254780755|r 138 CTLAEGELRQLSLSKNLDVTEEDYLHVIKSKTAVLFSAALEVSSLIAGVQNSVR-QALKSYGMNLGIAFQLVDDVLDYRG 216 (322) Q Consensus 138 ~~l~~Gq~~d~~~~~~~~~s~~~y~~ii~~KTa~Lf~~~~~~ga~lag~~~~~~-~~l~~fG~~lG~afQI~DD~ld~~g 216 (322) .-..-+-......++.. ... +++ -|..++.+...|....+.+.... =.+.-+..-...++-+..|+.|.. T Consensus 114 ~~~~~~~~Y~~~lK~~~--~~~---~~~---~~~~~g~~~l~g~~a~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~D~e- 184 (281) T PRK12884 114 LVLVLGILYDYKLKEYG--LIG---NLY---VAFLTGMTFIFGGIAAGELITEKSWLLAAMAFLMDLGREIAKDIEDVE- 184 (281) T ss_pred HHHHHHHHHHHHHHHCC--CHH---HHH---HHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH- T ss_conf 99999999999999576--535---999---999999999999999738975999999999999999999999805254- Q ss_pred CCCCCCCHHHHHHHHCCCCCCCCHHHH Q ss_conf 700038401234542136532100110 Q gi|254780755|r 217 EVNEMGKNIGEDFRNGKPTLPVILAFQ 243 (322) Q Consensus 217 ~~~~~GK~~g~Dl~egk~Tlpii~al~ 243 (322) .|-+.|.+|+|+.+-.+ T Consensus 185 ----------~D~~~Gi~tlpv~~G~~ 201 (281) T PRK12884 185 ----------GDRKRGARTLAILYGEK 201 (281) T ss_pred ----------HHHHCCCCCCCEEECHH T ss_conf ----------19876992677327789 No 49 >KOG0018 consensus Probab=21.73 E-value=56 Score=13.44 Aligned_cols=44 Identities=14% Similarity=0.110 Sum_probs=22.3 Q ss_pred HHHHHHHHHHHHHHHHH-HHHHHHCHHHHHHHCCCCHHHCCCHHHHHHHHH Q ss_conf 86666557899999998-752110112200110121010011023431245 Q gi|254780755|r 63 NHVLLACAIEFIHTATL-LHDDVVDDSALRRGKVAARLVWGNQTSVLVGDF 112 (322) Q Consensus 63 ~~~~~a~~iEliH~asL-iHDDI~D~~~~RRg~~t~~~~~G~~~Ail~GD~ 112 (322) +-.....++|.||-..= |++-|+|-- +|| |.+|+.++++..|-. T Consensus 486 R~~~~~eave~lKr~fPgv~GrviDLc-----~pt-~kkYeiAvt~~Lgk~ 530 (1141) T KOG0018 486 RRSRKQEAVEALKRLFPGVYGRVIDLC-----QPT-QKKYEIAVTVVLGKN 530 (1141) T ss_pred HHHHHHHHHHHHHHHCCCCCCHHHHCC-----CCC-HHHHHHHHHHHHHCC T ss_conf 889999999999875887210112034-----632-888899999997504 No 50 >TIGR01283 nifE nitrogenase MoFe cofactor biosynthesis protein NifE; InterPro: IPR005973 The enzyme responsible for nitrogen fixation, the nitrogenase, shows a high degree of conservation of structure, function, and amino acid sequence across wide phylogenetic ranges. All known Mo-nitrogenases consist of two components, component I (also called dinitrogenase, or Fe-Mo protein), an alpha2beta2 tetramer encoded by the nifD and nifK genes, and component II (dinitrogenase reductase, or Fe protein) a homodimer encoded by the nifH gene. Two operons, nifDK and nifEN, encode a tetrameric (alpha2beta2 and N2E2) enzymatic complex. Nitrogenase contains two unusual rare metal clusters; one of them is the iron molybdenum cofactor (FeMo-co), which is considered to be the site of dinitrogen reduction and whose biosynthesis requires the products of nifNE and of some other nif genes. It has been proposed that NifNE might serve as a scaffold upon which FeMo-co is built and then inserted into component I. ; GO: 0005515 protein binding, 0006461 protein complex assembly, 0009399 nitrogen fixation. Probab=20.08 E-value=34 Score=14.78 Aligned_cols=49 Identities=16% Similarity=0.338 Sum_probs=28.3 Q ss_pred HHC-CCCCCCCHHHHC------------CCCCCCHHHHHHCCCCCC--CHHHHHHHHHHHHHCC Q ss_conf 421-365321001101------------431100013432036899--9899999999999749 Q gi|254780755|r 230 RNG-KPTLPVILAFQR------------GTKREKNFWKSTINDGKI--SVENLKKAFIIMRENR 278 (322) Q Consensus 230 ~eg-k~Tlpii~al~~------------~~~~~~~~l~~~~~~~~~--~~~~~~~i~~ll~~~g 278 (322) .-| .+||=++.||+. ++++|+++++++.+.+.. ++.+-.++.+++.+++ T Consensus 344 ~GGa~KswSlv~Al~dLGMeVV~~GTqkg~~EDy~~I~e~~~~~~~m~Dd~~p~~L~~~~~~~~ 407 (470) T TIGR01283 344 TGGAVKSWSLVSALQDLGMEVVATGTQKGTEEDYERIRELMGEGTVMLDDANPRELLKLLLEYK 407 (470) T ss_pred CCCHHHHHHHHHHHHHCCCEEEEEEECCCCHHHHHHHHHHHCCCCEEEECCCHHHHHHHHHHHC T ss_conf 5864789999988845791799983007988899999997079967862588789999998728 Done!