Query gi|254780780|ref|YP_003065193.1| histidine triad (HIT) protein [Candidatus Liberibacter asiaticus str. psy62] Match_columns 155 No_of_seqs 113 out of 2377 Neff 7.8 Searched_HMMs 39220 Date Sun May 29 18:31:10 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780780.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 COG0537 Hit Diadenosine tetrap 100.0 6.5E-38 1.7E-42 257.7 14.0 137 11-148 1-137 (138) 2 cd01275 FHIT FHIT (fragile his 100.0 7.9E-36 2E-40 244.7 10.8 111 13-123 1-112 (126) 3 PRK10687 purine nucleoside pho 100.0 6.1E-35 1.5E-39 239.2 12.3 112 9-120 1-113 (119) 4 cd01277 HINT_subgroup HINT (hi 100.0 3.8E-35 9.7E-40 240.4 11.2 103 12-114 1-103 (103) 5 cd01276 PKCI_related Protein K 100.0 1.9E-32 4.8E-37 223.6 10.7 102 12-113 1-103 (104) 6 pfam01230 HIT HIT domain. 100.0 9.3E-31 2.4E-35 213.1 11.1 98 20-117 1-98 (98) 7 pfam11969 DcpS_C Scavenger mRN 100.0 7.2E-29 1.8E-33 201.3 10.2 109 12-121 1-109 (113) 8 cd01278 aprataxin_related apra 99.9 2.3E-27 5.8E-32 191.9 9.9 100 13-112 2-103 (104) 9 KOG3275 consensus 99.9 2E-27 5.2E-32 192.2 9.3 104 10-116 15-121 (127) 10 cd00468 HIT_like HIT family: H 99.9 1.2E-26 3.1E-31 187.4 8.6 86 28-113 1-86 (86) 11 KOG3379 consensus 99.9 5.7E-24 1.5E-28 170.7 11.5 125 24-148 16-150 (150) 12 cd00608 GalT Galactose-1-phosp 99.7 2.9E-16 7.4E-21 122.7 10.0 137 10-146 183-328 (329) 13 PRK11720 galactose-1-phosphate 99.7 7.4E-16 1.9E-20 120.1 10.2 136 10-146 193-337 (339) 14 pfam02744 GalP_UDP_tr_C Galact 99.7 8E-16 2E-20 119.9 9.6 143 11-153 14-164 (168) 15 COG1085 GalT Galactose-1-phosp 99.5 1.1E-13 2.9E-18 106.5 9.3 144 9-152 183-334 (338) 16 KOG4359 consensus 99.5 2E-13 5E-18 105.0 9.5 114 5-121 25-144 (166) 17 pfam04677 CwfJ_C_1 Protein sim 99.1 8.4E-10 2.1E-14 82.4 8.6 102 9-117 10-112 (122) 18 KOG2958 consensus 99.0 2.1E-09 5.4E-14 79.8 7.3 142 5-147 190-345 (354) 19 KOG2477 consensus 98.0 2.9E-05 7.4E-10 54.0 7.6 103 10-117 406-510 (628) 20 KOG2476 consensus 98.0 4.1E-05 1.1E-09 53.1 8.0 103 8-117 316-419 (528) 21 COG1085 GalT Galactose-1-phosp 97.8 7.7E-05 2E-09 51.4 6.8 105 8-112 37-161 (338) 22 cd00608 GalT Galactose-1-phosp 97.7 0.0002 5.2E-09 48.7 8.0 107 6-113 33-161 (329) 23 PRK11720 galactose-1-phosphate 97.6 0.00047 1.2E-08 46.5 8.0 106 4-112 43-170 (339) 24 KOG0562 consensus 97.3 0.00066 1.7E-08 45.6 5.8 85 24-113 14-104 (184) 25 TIGR00209 galT_1 galactose-1-p 97.1 0.00068 1.7E-08 45.5 4.2 134 13-146 212-355 (367) 26 PRK05471 CDP-diacylglycerol py 97.1 0.0022 5.5E-08 42.3 6.6 99 10-112 37-142 (252) 27 pfam02611 CDH CDP-diacylglycer 96.9 0.0041 1E-07 40.6 6.9 99 10-112 37-142 (249) 28 COG4360 APA2 ATP adenylyltrans 96.7 0.0023 6E-08 42.1 4.2 74 32-116 92-165 (298) 29 KOG3969 consensus 96.0 0.16 4.1E-06 30.7 10.4 93 21-117 156-259 (310) 30 KOG2720 consensus 95.2 0.0098 2.5E-07 38.3 1.8 74 29-112 164-237 (431) 31 COG2134 Cdh CDP-diacylglycerol 95.1 0.13 3.4E-06 31.2 7.2 98 11-112 40-144 (252) 32 KOG2958 consensus 90.9 0.9 2.3E-05 26.0 6.1 108 4-111 45-174 (354) 33 pfam01076 Mob_Pre Plasmid reco 88.3 2.4 6.1E-05 23.3 8.3 81 57-149 90-171 (196) 34 COG5075 Uncharacterized conser 87.4 2.1 5.2E-05 23.8 5.8 90 21-114 151-251 (305) 35 TIGR02705 nudix_YtkD nucleosid 70.7 4.3 0.00011 21.8 2.9 23 38-60 15-40 (158) 36 TIGR00672 cdh CDP-diacylglycer 70.2 4.8 0.00012 21.5 3.1 87 23-113 53-145 (251) 37 smart00537 DCX Domain in the D 66.8 11 0.00028 19.2 4.3 56 33-92 7-63 (89) 38 cd01617 DCX DCX The ubiquiti 63.9 8.6 0.00022 19.9 3.3 58 33-94 2-59 (80) 39 pfam03432 Relaxase Relaxase/Mo 58.5 17 0.00043 18.1 4.8 39 76-118 75-114 (240) 40 TIGR03279 cyano_FeS_chp putati 54.4 12 0.00031 18.9 2.8 29 10-38 85-116 (433) 41 COG1655 Uncharacterized protei 54.1 6.4 0.00016 20.7 1.3 48 10-58 19-72 (267) 42 PRK02770 S-adenosylmethionine 46.2 26 0.00068 16.8 3.5 54 84-149 61-114 (139) 43 TIGR01838 PHA_synth_I poly(R)- 41.7 31 0.00079 16.4 3.5 91 23-118 163-290 (541) 44 TIGR01734 D-ala-DACP-lig D-ala 34.3 38 0.00097 15.8 2.7 42 111-152 434-479 (513) 45 TIGR03330 SAM_DCase_Bsu S-aden 33.6 42 0.0011 15.6 3.5 53 85-149 48-100 (112) 46 TIGR03478 DMSO_red_II_bet DMSO 33.2 23 0.00058 17.2 1.4 22 5-26 183-206 (321) 47 TIGR02778 ligD_pol DNA polymer 33.1 28 0.0007 16.7 1.8 80 25-111 83-168 (251) 48 TIGR02182 GRXB Glutaredoxin, G 30.1 19 0.00049 17.7 0.6 15 111-125 102-116 (212) 49 TIGR01660 narH nitrate reducta 27.1 21 0.00054 17.4 0.4 17 6-22 236-254 (495) 50 PRK11637 hypothetical protein; 26.8 21 0.00053 17.5 0.3 81 30-125 321-401 (404) 51 KOG3816 consensus 25.2 27 0.00068 16.8 0.6 44 38-83 393-436 (526) 52 PRK01706 S-adenosylmethionine 23.6 62 0.0016 14.5 3.5 54 84-149 50-103 (123) 53 TIGR01684 viral_ppase viral ph 22.9 60 0.0015 14.6 2.0 21 33-53 1-21 (323) 54 pfam05632 DUF792 Borrelia burg 21.3 69 0.0018 14.2 4.8 61 53-114 16-77 (213) 55 pfam01868 UPF0086 Domain of un 21.3 69 0.0018 14.2 2.5 36 25-64 32-67 (89) 56 pfam10926 DUF2800 Protein of u 21.3 20 0.00052 17.6 -0.6 46 62-115 246-291 (363) 57 COG1140 NarY Nitrate reductase 20.5 32 0.00083 16.3 0.3 18 6-23 236-255 (513) 58 pfam01777 Ribosomal_L27e Ribos 20.3 73 0.0019 14.1 2.8 12 110-121 28-40 (85) No 1 >COG0537 Hit Diadenosine tetraphosphate (Ap4A) hydrolase and other HIT family hydrolases [Nucleotide transport and metabolism / Carbohydrate transport and metabolism / General function prediction only] Probab=100.00 E-value=6.5e-38 Score=257.70 Aligned_cols=137 Identities=47% Similarity=0.640 Sum_probs=128.5 Q ss_pred CCCHHHHHHCCCCCCCEEEECCEEEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCE Q ss_conf 97654544468887317877890999941788898229999415542134402788898999999730445531352666 Q gi|254780780|r 11 NQNIFIKIIRNETNACRVYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIAFLIKKIAIACKSAFQADGI 90 (155) Q Consensus 11 d~CiFC~Ii~ge~p~~~v~E~d~~~afld~~P~~~GH~lViPk~H~~~l~dL~~~~~~~l~~~~~~v~~al~~~~~~~~~ 90 (155) .+|+||+|++||+|+.+||||++++||+|++|.+|||+|||||+|+.+++||+++++++++..+++++++++++++++|+ T Consensus 1 ~~ciFc~ii~~e~~~~~Vye~~~~~afld~~P~~~gH~LviPk~h~~~l~~l~~~~~~~l~~~~~~ia~al~~~~~~~g~ 80 (138) T COG0537 1 MMCIFCKIIRGEIPANKVYEDEHVLAFLDIYPAAPGHTLVIPKRHVSDLEDLDPEELAELFLLAQKIAKALKEAFGADGY 80 (138) T ss_pred CCCCCCCCCCCCCCCEEEEECCCEEEEECCCCCCCCEEEEEECHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCCE T ss_conf 98731210037777418987798899977788999869999555455676099999999999999999999987197765 Q ss_pred EEEEECCCCCCCEEEEEEEEEEECCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 9998257566863101069999523799765345577666889899999999999999 Q gi|254780780|r 91 QILQFNGHAAGQTVPHLHFHVIPCKNGDNASHTNIHPTQKIENFAKLEINAQKIRKEL 148 (155) Q Consensus 91 ni~~~~g~~~gq~V~HlH~HiiPR~~~d~~~~~~~~~~~~~~~~~~l~~~a~~lr~~l 148 (155) |+++|+|..+||+|+|+|+||||||++|..+....|+...... +.+++.+++|+++| T Consensus 81 ni~~N~g~~agq~V~HlH~HvIPr~~~d~~~~~~~~~~~~~~~-~~l~~~~~~i~~~l 137 (138) T COG0537 81 NIGINNGKAAGQEVFHLHIHIIPRYKGDDNFPGPGWGTKVEPN-EELEELAEKIRKAL 137 (138) T ss_pred EEEEECCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCH-HHHHHHHHHHHHHH T ss_conf 8998647346988662799976897788777676556667867-88999999999865 No 2 >cd01275 FHIT FHIT (fragile histidine family): FHIT proteins, related to the HIT family carry a motif HxHxH/Qxx (x, is a hydrophobic amino acid), On the basis of sequence, substrate specificity, structure, evolution and mechanism, HIT proteins are classified into three branches: the Hint branch, which consists of adenosine 5' -monophosphoramide hydrolases, the Fhit branch, that consists of diadenosine polyphosphate hydrolases, and the GalT branch consisting of specific nucloside monophosphate transferases. Fhit plays a very important role in the development of tumours. Infact, Fhit deletions are among the earliest and most frequent genetic alterations in the development of tumours. Probab=100.00 E-value=7.9e-36 Score=244.70 Aligned_cols=111 Identities=29% Similarity=0.536 Sum_probs=105.9 Q ss_pred CHHHHHHCC-CCCCCEEEECCEEEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEE Q ss_conf 654544468-8873178778909999417888982299994155421344027888989999997304455313526669 Q gi|254780780|r 13 NIFIKIIRN-ETNACRVYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIAFLIKKIAIACKSAFQADGIQ 91 (155) Q Consensus 13 CiFC~Ii~g-e~p~~~v~E~d~~~afld~~P~~~GH~lViPk~H~~~l~dL~~~~~~~l~~~~~~v~~al~~~~~~~~~n 91 (155) |+||+|+.+ +.|+.+||||+.++||+|++|.+|||+|||||+|+.++.||+++++.+++.++++++++++++++++|+| T Consensus 1 CiFC~I~~~~~~~~~ivyede~~~aflD~~P~~~GH~LViPK~H~~~l~dl~~~e~~~l~~~~~~~~~~l~~~~~~~g~n 80 (126) T cd01275 1 CVFCDIPIKPDEDNLVFYRTKHSFAVVNLYPYNPGHVLVVPYRHVPRLEDLTPEEIADLFKLVQLAMKALKVVYKPDGFN 80 (126) T ss_pred CCCCCCCCCCCCCCEEEEECCCEEEEECCCCCCCCEEEEECCHHCCHHHCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCE T ss_conf 95452616999865189979969999768889996099975533051530999999999999999999999962767716 Q ss_pred EEEECCCCCCCEEEEEEEEEEECCCCCCCCCC Q ss_conf 99825756686310106999952379976534 Q gi|254780780|r 92 ILQFNGHAAGQTVPHLHFHVIPCKNGDNASHT 123 (155) Q Consensus 92 i~~~~g~~~gq~V~HlH~HiiPR~~~d~~~~~ 123 (155) +++|+|+.+||+|||+|+||||||++|..+.. T Consensus 81 i~~n~G~~aGQ~V~H~H~HiIPR~~~D~~f~~ 112 (126) T cd01275 81 IGINDGKAGGGIVPHVHIHIVPRWNGDTNFMP 112 (126) T ss_pred EEEECCCCCCCCCCEEEEEEECCCCCCCCCCE T ss_conf 99815712168636499998088489988655 No 3 >PRK10687 purine nucleoside phosphoramidase; Provisional Probab=100.00 E-value=6.1e-35 Score=239.18 Aligned_cols=112 Identities=29% Similarity=0.452 Sum_probs=99.9 Q ss_pred CCCCCHHHHHHCCCCCCCEEEECCEEEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHH-HHHHCCC Q ss_conf 589765454446888731787789099994178889822999941554213440278889899999973044-5531352 Q gi|254780780|r 9 YDNQNIFIKIIRNETNACRVYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIAFLIKKIAIA-CKSAFQA 87 (155) Q Consensus 9 ~dd~CiFC~Ii~ge~p~~~v~E~d~~~afld~~P~~~GH~lViPk~H~~~l~dL~~~~~~~l~~~~~~v~~a-l~~~~~~ 87 (155) +.|+|+||+|++||+|+.+|||||.++||+|++|.+|||+|||||+|+.++.|+++++...++.+++.+.+. .++.+.. T Consensus 1 M~~~cIFckIi~geipa~~VyEdd~~~AFlDi~P~apgH~LVIPK~h~~~~~dl~~~~~~~l~~~~~~~~~ia~~~~~~~ 80 (119) T PRK10687 1 MAEETIFSKIIRREIPSDIVYQDELVTAFRDISPQAPTHILIIPNILIPTVNDVTAEHEQALGRMITVAAKIAEQEGIAE 80 (119) T ss_pred CCCCCHHHHHHCCCCCCCEEEECCCEEEEECCCCCCCCEEEEEECHHHCCHHHCCHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 98657879987488776579747988998679989996399984355433756887689999999999999999828897 Q ss_pred CCEEEEEECCCCCCCEEEEEEEEEEECCCCCCC Q ss_conf 666999825756686310106999952379976 Q gi|254780780|r 88 DGIQILQFNGHAAGQTVPHLHFHVIPCKNGDNA 120 (155) Q Consensus 88 ~~~ni~~~~g~~~gq~V~HlH~HiiPR~~~d~~ 120 (155) +|+|+++|+|+.+||+|||+|+||||+..-... T Consensus 81 ~G~~i~~N~G~~aGQ~V~H~H~HvI~g~~lg~m 113 (119) T PRK10687 81 DGYRLIMNTNRHGGQEVYHIHMHLLGGRPLGPM 113 (119) T ss_pred CCEEEEEECCCCCCEECCEEEEEEECCCCCCCC T ss_conf 750799877886777617738997389857775 No 4 >cd01277 HINT_subgroup HINT (histidine triad nucleotide-binding protein) subgroup: Members of this CD belong to the superfamily of histidine triad hydrolases that act on alpha-phosphate of ribonucleotides. This subgroup includes members from all three forms of cellular life. Although the biochemical function has not been characterised for many of the members of this subgroup, the proteins from Yeast have been shown to be involved in secretion, peroxisome formation and gene expression. Probab=100.00 E-value=3.8e-35 Score=240.43 Aligned_cols=103 Identities=49% Similarity=0.760 Sum_probs=102.2 Q ss_pred CCHHHHHHCCCCCCCEEEECCEEEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEE Q ss_conf 76545444688873178778909999417888982299994155421344027888989999997304455313526669 Q gi|254780780|r 12 QNIFIKIIRNETNACRVYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIAFLIKKIAIACKSAFQADGIQ 91 (155) Q Consensus 12 ~CiFC~Ii~ge~p~~~v~E~d~~~afld~~P~~~GH~lViPk~H~~~l~dL~~~~~~~l~~~~~~v~~al~~~~~~~~~n 91 (155) ||+||+|++||+|+.+||||+.++||+|++|.+|||+||+||+|++++.||+++++.+++.+++++++++++.++++|+| T Consensus 1 dCiFc~I~~~e~~~~iv~E~e~~~af~d~~P~~pgH~LViPK~H~~~l~dl~~~~~~~l~~~~~~~~~~l~~~~~~~g~~ 80 (103) T cd01277 1 DCIFCKIIAGEIPSYKVYEDDHVLAFLDINPASKGHTLVIPKKHYENLLDLDPEELAELILAAKKVARALKKALKADGLN 80 (103) T ss_pred CCCCCCCCCCCCCCCEEEECCCEEEEECCCCCCCCEEEEEECCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHCCCCCCEE T ss_conf 98034561588876779979979999778889994299996257798113999999999999999999887445778779 Q ss_pred EEEECCCCCCCEEEEEEEEEEEC Q ss_conf 99825756686310106999952 Q gi|254780780|r 92 ILQFNGHAAGQTVPHLHFHVIPC 114 (155) Q Consensus 92 i~~~~g~~~gq~V~HlH~HiiPR 114 (155) +++|+|+.+||+|+|+|+||||| T Consensus 81 v~~n~g~~agq~v~H~H~HiiPR 103 (103) T cd01277 81 ILQNNGRAAGQVVFHVHVHVIPR 103 (103) T ss_pred EEECCCCCCCEEEEEEEEEEECC T ss_conf 98438762685755899999577 No 5 >cd01276 PKCI_related Protein Kinase C Interacting protein related (PKCI): PKCI and related proteins belong to the ubiquitous HIT family of hydrolases that act on alpha-phosphates of ribonucleotides. The members of this subgroup have a conserved HxHxHxx motif (x is a hydrophobic residue) that is a signature for this family. No enzymatic activity has been reported however, for PKCI and its related members. Probab=99.98 E-value=1.9e-32 Score=223.64 Aligned_cols=102 Identities=39% Similarity=0.597 Sum_probs=95.3 Q ss_pred CCHHHHHHCCCCCCCEEEECCEEEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH-HCCCCCE Q ss_conf 765454446888731787789099994178889822999941554213440278889899999973044553-1352666 Q gi|254780780|r 12 QNIFIKIIRNETNACRVYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIAFLIKKIAIACKS-AFQADGI 90 (155) Q Consensus 12 ~CiFC~Ii~ge~p~~~v~E~d~~~afld~~P~~~GH~lViPk~H~~~l~dL~~~~~~~l~~~~~~v~~al~~-~~~~~~~ 90 (155) ||+||+|++||+|+.+|||||.++||+|++|.+|||+|||||+|++++.+|++++...++.+++++.+.+++ .++++|+ T Consensus 1 dCiFC~I~~ge~p~~ivyede~~~aflD~~P~~~GH~LViPK~H~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 80 (104) T cd01276 1 DCIFCKIIRGEIPAKKVYEDDEVLAFHDINPQAPVHILVIPKKHIASLSDATEEDEELLGHLLSAAAKVAKDLGIAEDGY 80 (104) T ss_pred CCCCCCCCCCCCCCCEEEECCCEEEEECCCCCCCCEEEEEEEEEECCHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 99544461688787889979979999888999983299985422298588998589999999999999999858888886 Q ss_pred EEEEECCCCCCCEEEEEEEEEEE Q ss_conf 99982575668631010699995 Q gi|254780780|r 91 QILQFNGHAAGQTVPHLHFHVIP 113 (155) Q Consensus 91 ni~~~~g~~~gq~V~HlH~HiiP 113 (155) |+++|+|+.+||+|+|+|+|||+ T Consensus 81 ~v~~n~G~~agq~V~H~H~HvI~ 103 (104) T cd01276 81 RLVINCGKDGGQEVFHLHLHLLG 103 (104) T ss_pred EEEECCCCCCCEEECEEEEEEEC T ss_conf 79875887866550587998866 No 6 >pfam01230 HIT HIT domain. Probab=99.97 E-value=9.3e-31 Score=213.06 Aligned_cols=98 Identities=49% Similarity=0.833 Sum_probs=96.4 Q ss_pred CCCCCCCEEEECCEEEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCC Q ss_conf 68887317877890999941788898229999415542134402788898999999730445531352666999825756 Q gi|254780780|r 20 RNETNACRVYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIAFLIKKIAIACKSAFQADGIQILQFNGHA 99 (155) Q Consensus 20 ~ge~p~~~v~E~d~~~afld~~P~~~GH~lViPk~H~~~l~dL~~~~~~~l~~~~~~v~~al~~~~~~~~~ni~~~~g~~ 99 (155) +||+|+.+||||+.++||+|++|.+|||+|||||+|+++++||+++++.+++.+++++++++++.++++|+|+++|+|+. T Consensus 1 ~ge~p~~ivyed~~~~aflD~~P~~~gH~LViPk~H~~~l~dl~~~~~~~l~~~~~~~~~~l~~~~~~~g~~i~~n~G~~ 80 (98) T pfam01230 1 RGEIPSTVVYEDDLVLAFLDIDPQAPGHVLVIPKKHIRELHDLTPEELGDLMSEAKKVARALGKVFKADGYRVVINNGAA 80 (98) T ss_pred CCCCCCCEEEECCCEEEEECCCCCCCCEEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCC T ss_conf 99867618997897999987888999719999610528514299999999999999999999875156756899980312 Q ss_pred CCCEEEEEEEEEEECCCC Q ss_conf 686310106999952379 Q gi|254780780|r 100 AGQTVPHLHFHVIPCKNG 117 (155) Q Consensus 100 ~gq~V~HlH~HiiPR~~~ 117 (155) +||+|+|+|+|||||++| T Consensus 81 agq~v~H~H~HiiPr~~g 98 (98) T pfam01230 81 AGQSVPHLHIHIIPRREG 98 (98) T ss_pred CCEEEEEEEEEEECCCCC T ss_conf 570114799997077483 No 7 >pfam11969 DcpS_C Scavenger mRNA decapping enzyme C-term binding. This family consists of several scavenger mRNA decapping enzymes (DcpS) and is the C-terminal region. DcpS is a scavenger pyrophosphatase that hydrolyses the residual cap structure following 3' to 5' decay of an mRNA. The association of DcpS with 3' to 5' exonuclease exosome components suggests that these two activities are linked and there is a coupled exonucleolytic decay-dependent decapping pathway. The C-terminal domain contains a histidine triad (HIT) sequence with three histidines separated by hydrophobic residues. The central histidine within the DcpS HIT motif is critical for decapping activity and defines the HIT motif as a new mRNA decapping domain, making DcpS the first member of the HIT family of proteins with a defined biological function. Probab=99.96 E-value=7.2e-29 Score=201.29 Aligned_cols=109 Identities=30% Similarity=0.410 Sum_probs=96.9 Q ss_pred CCHHHHHHCCCCCCCEEEECCEEEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEE Q ss_conf 76545444688873178778909999417888982299994155421344027888989999997304455313526669 Q gi|254780780|r 12 QNIFIKIIRNETNACRVYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIAFLIKKIAIACKSAFQADGIQ 91 (155) Q Consensus 12 ~CiFC~Ii~ge~p~~~v~E~d~~~afld~~P~~~GH~lViPk~H~~~l~dL~~~~~~~l~~~~~~v~~al~~~~~~~~~n 91 (155) +|+||+|++|++|+++|||||.++||+|++|.+|||+|||||+|++++.||++++...+..+.+.+.+.+++ ....+++ T Consensus 1 ~ciFc~I~~~~~p~~ivyedd~~vaf~D~~P~~~gH~LVIPk~h~~~~~dl~~~~~~~l~~~~~~~~~~~~~-~~~~~~~ 79 (113) T pfam11969 1 KWVFCIIINGEEPERIVYEDDGFVAFKDIKPKAPLHLLVIPKKHIKSLRDLTKEHLPLLGHMREVAKKIAKE-KYNEGYR 79 (113) T ss_pred CCCCCEEECCCCCCCEEEECCCEEEEECCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHHH-HCCCCCC T ss_conf 946336536897778899799999986899899975999999740443548987999999999999999998-6897786 Q ss_pred EEEECCCCCCCEEEEEEEEEEECCCCCCCC Q ss_conf 998257566863101069999523799765 Q gi|254780780|r 92 ILQFNGHAAGQTVPHLHFHVIPCKNGDNAS 121 (155) Q Consensus 92 i~~~~g~~~gq~V~HlH~HiiPR~~~d~~~ 121 (155) ..+|+|..+||+|+|+|+|||||...++.. T Consensus 80 ~~~n~G~~a~qsv~HlH~HvI~~~~~~~~~ 109 (113) T pfam11969 80 DVLRLGFHYGPSVYHLHLHVIAPDFEPGLG 109 (113) T ss_pred EEEECCCCCCCCCCEEEEEEECCCCCCCCC T ss_conf 699726443636788899984677888777 No 8 >cd01278 aprataxin_related aprataxin related: Aprataxin, a HINT family hydrolase is mutated in ataxia oculomotor apraxia syndrome. All the members of this subgroup have the conserved HxHxHxx (where x is a hydrophobic residue) signature motif. Members of this subgroup are predominantly eukaryotic in origin. Probab=99.95 E-value=2.3e-27 Score=191.93 Aligned_cols=100 Identities=25% Similarity=0.285 Sum_probs=83.6 Q ss_pred CHHHHHHCCCCCCC--EEEECCEEEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCE Q ss_conf 65454446888731--7877890999941788898229999415542134402788898999999730445531352666 Q gi|254780780|r 13 NIFIKIIRNETNAC--RVYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIAFLIKKIAIACKSAFQADGI 90 (155) Q Consensus 13 CiFC~Ii~ge~p~~--~v~E~d~~~afld~~P~~~GH~lViPk~H~~~l~dL~~~~~~~l~~~~~~v~~al~~~~~~~~~ 90 (155) |+||+|++|++|+. +|||||.++||+|++|.+|||+|||||+|++++.+|++++...+..+.......+++..+.++. T Consensus 2 CiFC~Ii~g~~p~~~~ivyEdd~~~af~Di~P~a~~H~LVIPK~Hi~~~~~l~~e~~~ll~~m~~~~~~~~~~~~~~~~~ 81 (104) T cd01278 2 CHFCDIAKRRDPDPEDQVYEDDRVVVFKDIYPKARHHYLVIPKEHIASLKALTKEDVPLLEHMETVGREKLLRSDNTDPS 81 (104) T ss_pred CCCCEEECCCCCCCCCEEEECCCEEEEECCCCCCCCEEEEEEHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 64877336888986567975797999967998998449997156542387799989999999999999999983799877 Q ss_pred EEEEECCCCCCCEEEEEEEEEE Q ss_conf 9998257566863101069999 Q gi|254780780|r 91 QILQFNGHAAGQTVPHLHFHVI 112 (155) Q Consensus 91 ni~~~~g~~~gq~V~HlH~Hii 112 (155) ++.++.+..+||+|+|+|+||| T Consensus 82 ~~~~~~~~~~gqsV~HlH~HvI 103 (104) T cd01278 82 EFRFGFHAPPFTSVSHLHLHVI 103 (104) T ss_pred CEEEECCCCCCEEEEEEEEEEE T ss_conf 7899688789873647499986 No 9 >KOG3275 consensus Probab=99.95 E-value=2e-27 Score=192.22 Aligned_cols=104 Identities=38% Similarity=0.606 Sum_probs=95.5 Q ss_pred CCCCHHHHHHCCCCCCCEEEECCEEEEEECCCCCCCCEEEEEECCC---CCHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 8976545444688873178778909999417888982299994155---4213440278889899999973044553135 Q gi|254780780|r 10 DNQNIFIKIIRNETNACRVYEDDILLAIMDIMPRNPGHVLIIPKSR---IRDIFEAPPEILSQIAFLIKKIAIACKSAFQ 86 (155) Q Consensus 10 dd~CiFC~Ii~ge~p~~~v~E~d~~~afld~~P~~~GH~lViPk~H---~~~l~dL~~~~~~~l~~~~~~v~~al~~~~~ 86 (155) +.+|+||+|+++|+|+.+|||||.++||.|+.|..|||.|||||+| .+...|.+++.+++++.+++++++++. + T Consensus 15 ~~~tIF~kIi~keIPa~ii~Edd~~lAF~Di~Pqap~HfLvIPK~hi~~~s~aed~~~e~Lg~ll~~~k~vak~~G--l- 91 (127) T KOG3275 15 AAPTIFCKIIRKEIPAKIIFEDDRCLAFHDIAPQAPGHFLVIPKKHITQLSKAEDRDDELLGHLLPVAKKVAKALG--L- 91 (127) T ss_pred CCCCEEEEEECCCCCCCEEEECCCEEEEEECCCCCCCEEEEEECCCCCCHHHCCCCCHHHHHHHHHHHHHHHHHHC--C- T ss_conf 7880753321046773068623546898723888872399955443310121134898998778999999999848--4- Q ss_pred CCCEEEEEECCCCCCCEEEEEEEEEEECCC Q ss_conf 266699982575668631010699995237 Q gi|254780780|r 87 ADGIQILQFNGHAAGQTVPHLHFHVIPCKN 116 (155) Q Consensus 87 ~~~~ni~~~~g~~~gq~V~HlH~HiiPR~~ 116 (155) .+|||+++|||..++|+|+|+|+|++|++. T Consensus 92 ~~gYrvv~NnG~~g~QsV~HvH~HvlgGrq 121 (127) T KOG3275 92 EDGYRVVQNNGKDGHQSVYHVHLHVLGGRQ 121 (127) T ss_pred CCCEEEEECCCCCCCEEEEEEEEEEECCCC T ss_conf 342268882885453589999999817720 No 10 >cd00468 HIT_like HIT family: HIT (Histidine triad) proteins, named for a motif related to the sequence HxHxH/Qxx (x, a hydrophobic amino acid), are a superfamily of nucleotide hydrolases and transferases, which act on the alpha-phosphate of ribonucleotides. On the basis of sequence, substrate specificity, structure, evolution and mechanism, HIT proteins are classified in the literacture into three major branches: the Hint branch, which consists of adenosine 5' -monophosphoramide hydrolases, the Fhit branch, that consists of diadenosine polyphosphate hydrolases, and the GalT branch consisting of specific nucloside monophosphate transferases. Further sequence analysis reveals several new closely related, yet uncharacterized subgroups. Probab=99.94 E-value=1.2e-26 Score=187.38 Aligned_cols=86 Identities=29% Similarity=0.599 Sum_probs=85.1 Q ss_pred EEECCEEEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCEEEEE Q ss_conf 87789099994178889822999941554213440278889899999973044553135266699982575668631010 Q gi|254780780|r 28 VYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIAFLIKKIAIACKSAFQADGIQILQFNGHAAGQTVPHL 107 (155) Q Consensus 28 v~E~d~~~afld~~P~~~GH~lViPk~H~~~l~dL~~~~~~~l~~~~~~v~~al~~~~~~~~~ni~~~~g~~~gq~V~Hl 107 (155) |||||.++||+|++|.+|||+|||||+|+.++.||+++++.+++.+++++++++++.++++|+|+++|+|+++||+|||+ T Consensus 1 VyEde~~~af~Di~P~~~gH~LViPk~h~~~l~dl~~~~~~~l~~~~~~~~~~l~~~~~~~g~~i~~n~G~~aGQ~V~H~ 80 (86) T cd00468 1 VPDDEHSFAFVNLKPAAPGHVLVCPKRHVETLPDLDEALLADLVITAQRVAAELEKHGNVPSLTVFVNDGAAAGQSVPHV 80 (86) T ss_pred CCCCCCEEEEECCCCCCCCCEEEECCHHHCCHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCEECCEE T ss_conf 94278589997798899972999702241993469999999999999999999997359980799995577546633789 Q ss_pred EEEEEE Q ss_conf 699995 Q gi|254780780|r 108 HFHVIP 113 (155) Q Consensus 108 H~HiiP 113 (155) |+|||| T Consensus 81 H~HiiP 86 (86) T cd00468 81 HLHVLP 86 (86) T ss_pred EEEECC T ss_conf 998678 No 11 >KOG3379 consensus Probab=99.91 E-value=5.7e-24 Score=170.73 Aligned_cols=125 Identities=24% Similarity=0.418 Sum_probs=109.7 Q ss_pred CCCEEEECCEEEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCE Q ss_conf 73178778909999417888982299994155421344027888989999997304455313526669998257566863 Q gi|254780780|r 24 NACRVYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIAFLIKKIAIACKSAFQADGIQILQFNGHAAGQT 103 (155) Q Consensus 24 p~~~v~E~d~~~afld~~P~~~GH~lViPk~H~~~l~dL~~~~~~~l~~~~~~v~~al~~~~~~~~~ni~~~~g~~~gq~ 103 (155) |..++|++.+++||.+.+|+.|||+||+|+|-++.|.||+.+|.++|+..++++.+.+++.++....|+.+..|+.|||+ T Consensus 16 ~~~VFykT~~sfafvNlkPvvpgHVLv~P~R~vpRl~dLt~~E~aDlF~t~~~v~~~lek~~~~ts~ti~iQDG~~AGQT 95 (150) T KOG3379 16 PDHVFYKTKHSFAFVNLKPVVPGHVLVSPLRVVPRLTDLTAAETADLFTTVQKVQRVLEKHYNATSLTIAIQDGPEAGQT 95 (150) T ss_pred CCEEEEECCCEEEEEECCCCCCCEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCC T ss_conf 00289843511798702146566289820533454135981889999999999999999982666158984256001765 Q ss_pred EEEEEEEEEECCCCCCCCCCC----------CCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 101069999523799765345----------577666889899999999999999 Q gi|254780780|r 104 VPHLHFHVIPCKNGDNASHTN----------IHPTQKIENFAKLEINAQKIRKEL 148 (155) Q Consensus 104 V~HlH~HiiPR~~~d~~~~~~----------~~~~~~~~~~~~l~~~a~~lr~~l 148 (155) |||+|+||+||+.+|....-. ....++....++|++.|+.+|+.+ T Consensus 96 VpHvHvHIlPR~~gDf~~Nd~IY~~L~~~~~e~~~r~~Rs~eEM~eEA~~lr~~~ 150 (150) T KOG3379 96 VPHVHVHILPRKAGDFGDNDLIYDELDKHEKELEDRKPRSLEEMAEEAQRLREYF 150 (150) T ss_pred CCEEEEEECCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHC T ss_conf 6526898744202566664599998875245533378324899999999998529 No 12 >cd00608 GalT Galactose-1-phosphate uridyl transferase (GalT): This enzyme plays a key role in galactose metabolism by catalysing the transfer of a uridine 5'-phosphoryl group from UDP-galactose 1-phosphate. The structure of E.coli GalT reveals that the enzyme contains two identical subunits. It also demonstrates that the active site is formed by amino acid residues from both subunits of the dimer. Probab=99.68 E-value=2.9e-16 Score=122.65 Aligned_cols=137 Identities=17% Similarity=0.223 Sum_probs=108.5 Q ss_pred CCCCHHHHHHCCCC--CCCEEEECCEEEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCC- Q ss_conf 89765454446888--731787789099994178889822999941554213440278889899999973044553135- Q gi|254780780|r 10 DNQNIFIKIIRNET--NACRVYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIAFLIKKIAIACKSAFQ- 86 (155) Q Consensus 10 dd~CiFC~Ii~ge~--p~~~v~E~d~~~afld~~P~~~GH~lViPk~H~~~l~dL~~~~~~~l~~~~~~v~~al~~~~~- 86 (155) ...|+||+|++.|. ..++|+||++++||....+..|..++|+||+|+..|.+|++++..+|..+++.+.+.+.+.++ T Consensus 183 ~g~~l~~dii~~E~~~~~RiV~en~~fva~vP~aar~P~E~~I~Pk~h~~~f~~l~~~e~~~LA~~lk~~l~rl~~~~~~ 262 (329) T cd00608 183 HGRCLLCDYLKLELESKERIVVENEHFVAVVPFWARWPFEVHILPKRHVSRFTDLTDEEREDLAEILKRLLARYDNLFNC 262 (329) T ss_pred HCCCHHHHHHHHHHHCCCEEEEECCCEEEEECCCCCCCEEEEEEECCCCCCCHHCCHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 49971899899875148659997798899701546897079996056668802389999999999999999999986099 Q ss_pred CCCEEEEEECCCCCCCE----EEEEEEEEEECCCCCCCCC--CCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 26669998257566863----1010699995237997653--455776668898999999999999 Q gi|254780780|r 87 ADGIQILQFNGHAAGQT----VPHLHFHVIPCKNGDNASH--TNIHPTQKIENFAKLEINAQKIRK 146 (155) Q Consensus 87 ~~~~ni~~~~g~~~gq~----V~HlH~HiiPR~~~d~~~~--~~~~~~~~~~~~~~l~~~a~~lr~ 146 (155) .-.||++++..+..+.. -.|+|+.++||...+.... +.-+.++.+.+....|+.|++||+ T Consensus 263 ~~pYn~~~h~aP~~~~~~~~~~~H~hi~i~Pr~~~~~~k~lAGfE~gtG~~in~v~PE~aA~~LRe 328 (329) T cd00608 263 SFPYSMGWHQAPTGGKELENWYYHWHFEIPPRRSATVLKFMAGFELGAGEFINDVTPEQAAARLRE 328 (329) T ss_pred CCCCEEHHHHCCCCCCCCCCCCEEEEEEECCCCCCCCCCEEEEEHHHCCCCCCCCCHHHHHHHHHC T ss_conf 998172153189988778775389999966666788752137874435980379999999999855 No 13 >PRK11720 galactose-1-phosphate uridylyltransferase; Provisional Probab=99.66 E-value=7.4e-16 Score=120.12 Aligned_cols=136 Identities=16% Similarity=0.206 Sum_probs=104.4 Q ss_pred CCCCHHHHHHCCCCC--CCEEEECCEEEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 897654544468887--317877890999941788898229999415542134402788898999999730445531352 Q gi|254780780|r 10 DNQNIFIKIIRNETN--ACRVYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIAFLIKKIAIACKSAFQA 87 (155) Q Consensus 10 dd~CiFC~Ii~ge~p--~~~v~E~d~~~afld~~P~~~GH~lViPk~H~~~l~dL~~~~~~~l~~~~~~v~~al~~~~~~ 87 (155) ...|+||+|++.|.. .++|+||++++||....+..|..++|+||+|+.+|.+|++++..+|..+.+++.+.+.+.++. T Consensus 193 ~g~cl~cd~i~~E~~~~~RiV~e~~~fva~vPyasr~PfE~~I~Pk~h~~~f~~l~~~e~~~LA~~lk~vl~r~d~lf~~ 272 (339) T PRK11720 193 HGSPLLVDYVQRELADGERTVVETEHWLAVVPYWAAWPFETLLLPKRHVLRLTDLTDAQRDDLALALKKLTSRYDNLFQC 272 (339) T ss_pred HCCCCHHHHHHHHHCCCCEEEEECCCEEEEEECCCCCCCEEEECCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 19961788775541269769997798799831457897479974578877700299999999999999999999974399 Q ss_pred C-CEEEEEECCCCCCCEE--EEEEEEEEECC---CCCCCCCCCC-CCCCCCCCHHHHHHHHHHHHH Q ss_conf 6-6699982575668631--01069999523---7997653455-776668898999999999999 Q gi|254780780|r 88 D-GIQILQFNGHAAGQTV--PHLHFHVIPCK---NGDNASHTNI-HPTQKIENFAKLEINAQKIRK 146 (155) Q Consensus 88 ~-~~ni~~~~g~~~gq~V--~HlH~HiiPR~---~~d~~~~~~~-~~~~~~~~~~~l~~~a~~lr~ 146 (155) . -||++++..+..+..- .|+|+|++|+. .+-.....++ +......| -..|+.|++||+ T Consensus 273 p~PY~m~~h~aP~~~~~~~~~h~hi~~~Ppl~R~~~~~kflaG~E~~~~~~~d-~~PE~aA~~LRe 337 (339) T PRK11720 273 SFPYSMGWHGAPFNGEDNDHWQLHAHFYPPLLRSATVRKFMVGYEMLAETQRD-LTPEQAAERLRA 337 (339) T ss_pred CCCCCHHHHCCCCCCCCCCEEEEEEEECCCCCCCCCCCCEEEEECCCCCCCCC-CCHHHHHHHHHC T ss_conf 98833000118889988864899999878534467753023674120364478-999999999870 No 14 >pfam02744 GalP_UDP_tr_C Galactose-1-phosphate uridyl transferase, C-terminal domain. SCOP reports fold duplication with N-terminal domain. Both involved in Zn and Fe binding. Probab=99.65 E-value=8e-16 Score=119.90 Aligned_cols=143 Identities=13% Similarity=0.158 Sum_probs=110.8 Q ss_pred CCCHHHHHHCCCCC--CCEEEECCEEEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 97654544468887--3178778909999417888982299994155421344027888989999997304455313526 Q gi|254780780|r 11 NQNIFIKIIRNETN--ACRVYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIAFLIKKIAIACKSAFQAD 88 (155) Q Consensus 11 d~CiFC~Ii~ge~p--~~~v~E~d~~~afld~~P~~~GH~lViPk~H~~~l~dL~~~~~~~l~~~~~~v~~al~~~~~~~ 88 (155) ..|+||+|++.|.. .++|+||+.+++|..-....|..++|+||+|+.++.+|++++..+|..+++.+.+.+.+.++.+ T Consensus 14 G~~L~~D~~~~E~~~~~RvV~en~~f~a~~P~aarwP~Ev~i~Pk~h~~~l~~l~d~e~~dLA~il~~il~r~d~lf~~p 93 (168) T pfam02744 14 GSILLHDYVQMELAEKERVVVENESWPVVVPYWAKWPFETLRLPKRHVLSLTELTDKEREDLAAIVKPLTRRYDNLFETS 93 (168) T ss_pred CCCHHHHHHHHHHCCCCEEEEECCCEEEEEECCCCCCEEEEEECCCCHHHHHHCCHHHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 97359999999767697799846980899866026960899953866434523488999999999999999620356999 Q ss_pred -CEEEEEECCCCCCCEEE--EEEEEEEEC---CCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf -66999825756686310--106999952---379976534557766688989999999999999999884 Q gi|254780780|r 89 -GIQILQFNGHAAGQTVP--HLHFHVIPC---KNGDNASHTNIHPTQKIENFAKLEINAQKIRKELQNFLK 153 (155) Q Consensus 89 -~~ni~~~~g~~~gq~V~--HlH~HiiPR---~~~d~~~~~~~~~~~~~~~~~~l~~~a~~lr~~l~~~l~ 153 (155) -+|++++..+..+...+ |+|+|+.|. ..+-.....+.-..+...+....|+.|++||+||..+=+ T Consensus 94 ~PY~~~~HqaP~~~~~~~~~~~H~~~~p~~lr~~~~~k~~aG~e~~~~~~~d~~PE~aA~~LReA~~~~h~ 164 (168) T pfam02744 94 FPYSMGIHQAPLNAEELNHWQFHPHFYPPLLRSATVRKFEVGFEILPERQRDLTAEQAAERLREALAAVHK 164 (168) T ss_pred CCCCHHHHCCCCCCCCCCHHHEEEEEEEHHEECCCCCEEEEEHHHCCCCCCCCCHHHHHHHHHHHHHHHHH T ss_conf 98541553178986655333534574112134278646765621035513667989999999888889888 No 15 >COG1085 GalT Galactose-1-phosphate uridylyltransferase [Energy production and conversion] Probab=99.51 E-value=1.1e-13 Score=106.47 Aligned_cols=144 Identities=18% Similarity=0.203 Sum_probs=111.9 Q ss_pred CCCCCHHHHHHCCCCC--CCEEEECCEEEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 5897654544468887--31787789099994178889822999941554213440278889899999973044553135 Q gi|254780780|r 9 YDNQNIFIKIIRNETN--ACRVYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIAFLIKKIAIACKSAFQ 86 (155) Q Consensus 9 ~dd~CiFC~Ii~ge~p--~~~v~E~d~~~afld~~P~~~GH~lViPk~H~~~l~dL~~~~~~~l~~~~~~v~~al~~~~~ 86 (155) .+..|+||+|+..|.- .++|+|++.++||.......|.+++++||+|+..+.+|++++..+|..+++.+.....++++ T Consensus 183 ~~~~~~~~~~ve~E~~~~~R~v~e~~~~~a~~Pf~a~~pfEv~i~pk~hv~~l~~~sdee~~~lA~ilk~~~~~y~~~~~ 262 (338) T COG1085 183 ENGSCMYCDLVEREKGDGERIVVENDHFLAFVPFWARWPFEVLIYPKEHVSFLTDLSDEELKDLAEILKKLLARYDNLFG 262 (338) T ss_pred HCCCCHHHHHHHHHHCCCCEEEECCCEEEEECCCCCCCCEEEEECCHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 55986588999998415732870376069960235667437986567776445417888899999999999999850358 Q ss_pred CC-CEEEEEECCCC-CCCEEEEEEEEEEE---CCCCCCCCCCCCC-CCCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 26-66999825756-68631010699995---2379976534557-76668898999999999999999988 Q gi|254780780|r 87 AD-GIQILQFNGHA-AGQTVPHLHFHVIP---CKNGDNASHTNIH-PTQKIENFAKLEINAQKIRKELQNFL 152 (155) Q Consensus 87 ~~-~~ni~~~~g~~-~gq~V~HlH~HiiP---R~~~d~~~~~~~~-~~~~~~~~~~l~~~a~~lr~~l~~~l 152 (155) .. -++++.+.-+. ..+.-.|+|+|++| |..+-......+. ..+.+...-..|+.|++||+...... T Consensus 263 ~~fpY~m~~h~ap~~~~~~~~~~h~~~~p~~~R~~t~~k~~~g~e~~~~e~~~~~~pEeaA~~LR~~~~~~~ 334 (338) T COG1085 263 NSFPYSMGFHQAPFNEVNEHYHLHAEIYPPLLRSATKLKFLAGYEMGAGEFIRDVTPEEAAERLRERSAEIH 334 (338) T ss_pred CCCCEEEEEECCCCCCCCCCCEEEEEECCCCCCCCCCCCEEEEEECCCCEEECCCCHHHHHHHHHHHHHCCC T ss_conf 887603565206877666453489998355211135300032330344245326898999999987441003 No 16 >KOG4359 consensus Probab=99.49 E-value=2e-13 Score=104.97 Aligned_cols=114 Identities=21% Similarity=0.237 Sum_probs=79.9 Q ss_pred CCCCCCCCCHHHHHHCCCCCCC--EEEECCEEEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHH---HHHHHHHHHHH Q ss_conf 6566589765454446888731--787789099994178889822999941554213440278889---89999997304 Q gi|254780780|r 5 SSTHYDNQNIFIKIIRNETNAC--RVYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILS---QIAFLIKKIAI 79 (155) Q Consensus 5 s~~~~dd~CiFC~Ii~ge~p~~--~v~E~d~~~afld~~P~~~GH~lViPk~H~~~l~dL~~~~~~---~l~~~~~~v~~ 79 (155) -++...+.|.||+|+.++.+.. ...||+.+++|.|++|.+..|+||+||+|+.+..+|..+... .++...+.+.+ T Consensus 25 ~~~~~~~~C~FCDia~r~~~~~ell~~En~~~V~fkDikPaA~~HYLvipK~Hi~~~~~L~k~~V~Lve~m~~~G~~~l~ 104 (166) T KOG4359 25 KSPEPKSTCVFCDIAGRQDPGTELLHCENEDLVCFKDIKPAATHHYLVVPKKHIGNCRTLRKDQVELVENMVTVGKTILE 104 (166) T ss_pred CCCCCCCCEEEEEEECCCCCCCCEEEECCCCEEEEECCCCCCCCEEEEECHHHCCCHHHCCHHHHHHHHHHHHHHHHHHH T ss_conf 23376787078760125689985067627747998047813023389706787177554444455899999998899999 Q ss_pred HHHHHCCCCCEEEEEECCCCCCCEEEEEEEEEE-ECCCCCCCC Q ss_conf 455313526669998257566863101069999-523799765 Q gi|254780780|r 80 ACKSAFQADGIQILQFNGHAAGQTVPHLHFHVI-PCKNGDNAS 121 (155) Q Consensus 80 al~~~~~~~~~ni~~~~g~~~gq~V~HlH~Hii-PR~~~d~~~ 121 (155) .-.. ..++...++++-.| .-+|.|||+|+| |--...-.+ T Consensus 105 r~~~-td~~~~r~GFHLPP--f~SV~HLHlH~I~P~~DMgf~s 144 (166) T KOG4359 105 RNNF-TDFTNVRMGFHLPP--FCSVSHLHLHVIAPVDDMGFLS 144 (166) T ss_pred HHHC-CCCHHEEEECCCCC--CCEEEEEEEEEEECHHHHCHHH T ss_conf 8606-77145067535797--3203466676650635514123 No 17 >pfam04677 CwfJ_C_1 Protein similar to CwfJ C-terminus 1. This region is found in the N terminus of Schizosaccharomyces pombe protein CwfJ. CwfJ is part of the Cdc5p complex involved in mRNA splicing. Probab=99.07 E-value=8.4e-10 Score=82.35 Aligned_cols=102 Identities=23% Similarity=0.321 Sum_probs=76.5 Q ss_pred CCCCCHHHHHHCCCCCCC-EEEECCEEEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 589765454446888731-7877890999941788898229999415542134402788898999999730445531352 Q gi|254780780|r 9 YDNQNIFIKIIRNETNAC-RVYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIAFLIKKIAIACKSAFQA 87 (155) Q Consensus 9 ~dd~CiFC~Ii~ge~p~~-~v~E~d~~~afld~~P~~~GH~lViPk~H~~~l~dL~~~~~~~l~~~~~~v~~al~~~~~~ 87 (155) -=|+|-||- .+-+.+.+ +|.-++.++.-++..|..+||+||||.+|++++.+++++++.++.+..+.+. +.+.. T Consensus 10 ~~d~C~fCl-snp~~~khlIisiG~~~YLalpkgpL~~gH~lIvPi~H~~s~~~lded~~~Ei~~f~~~L~----~mf~~ 84 (122) T pfam04677 10 LPDSCWFCL-SNPNLEKHLIVSIGNKAYLALPKGPLVSGHCLIIPIQHIPSTLSLDEEVWDEIRNFRKALT----LMYKS 84 (122) T ss_pred CCCCCCCCC-CCCCCCCCEEEEECCEEEEEECCCCCCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHHH----HHHHH T ss_conf 757687701-6998553279998361699955887788848996333215403499899999999999999----99997 Q ss_pred CCEEEEEECCCCCCCEEEEEEEEEEECCCC Q ss_conf 666999825756686310106999952379 Q gi|254780780|r 88 DGIQILQFNGHAAGQTVPHLHFHVIPCKNG 117 (155) Q Consensus 88 ~~~ni~~~~g~~~gq~V~HlH~HiiPR~~~ 117 (155) .|-.+++.. .+-+..+|.|+++||-... T Consensus 85 ~~~~~vf~E--~~~~~~~H~~iq~vPvP~~ 112 (122) T pfam04677 85 QGKDAVFFE--IASQRRPHLHIQCIPVPKS 112 (122) T ss_pred CCCCEEEEE--ECCCCCCEEEEEEEECCHH T ss_conf 699689999--7478981589999973989 No 18 >KOG2958 consensus Probab=98.97 E-value=2.1e-09 Score=79.83 Aligned_cols=142 Identities=16% Similarity=0.213 Sum_probs=100.8 Q ss_pred CCCCC---CCCCHHHHHHCCCC--CCCEEEECCEEEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 65665---89765454446888--73178778909999417888982299994155421344027888989999997304 Q gi|254780780|r 5 SSTHY---DNQNIFIKIIRNET--NACRVYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIAFLIKKIAI 79 (155) Q Consensus 5 s~~~~---dd~CiFC~Ii~ge~--p~~~v~E~d~~~afld~~P~~~GH~lViPk~H~~~l~dL~~~~~~~l~~~~~~v~~ 79 (155) |.+.| -..|++-+.++.|. -.++|.|+++|+++....-..|..+|+|||+|+..|.+|++.+..+|..+.+.+.. T Consensus 190 s~kkyfe~hgk~ll~dy~~~E~l~Kervv~enehfivvvPywA~wPfEtllipk~h~~~~~~l~~~~k~dLasiLK~ll~ 269 (354) T KOG2958 190 SQKKYFEEHGKCLLMDYVKQEALEKERVVVENEHFIVVVPYWATWPFETLLIPKRHVSRFHELDEVEKVDLASILKLLLI 269 (354) T ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHCEEEEECCCEEEEEEHHHCCCCEEEEECHHHHHHHCCCCHHHHHHHHHHHHHHHH T ss_conf 27889987097367888888875402588514735998453323764046503666521000126777669999999999 Q ss_pred HHHHHCCC-CCEEEEEECCCCCC--C--EEEEEEEEEEE---CCCCCCCCCCCCCCCC-CCCCHHHHHHHHHHHHHH Q ss_conf 45531352-66699982575668--6--31010699995---2379976534557766-688989999999999999 Q gi|254780780|r 80 ACKSAFQA-DGIQILQFNGHAAG--Q--TVPHLHFHVIP---CKNGDNASHTNIHPTQ-KIENFAKLEINAQKIRKE 147 (155) Q Consensus 80 al~~~~~~-~~~ni~~~~g~~~g--q--~V~HlH~HiiP---R~~~d~~~~~~~~~~~-~~~~~~~l~~~a~~lr~~ 147 (155) ...+.|+. --++++++..+..+ | .-.++|+|..| |..+-..+..++.-.. +..| -..|..|++||+- T Consensus 270 KydnlfetsfPYsmg~h~aPl~~t~~e~~n~W~h~hFyppllrsatV~kF~vG~e~l~epqrd-ltpEqaAk~lrel 345 (354) T KOG2958 270 KYDNLFETSFPYSMGIHGAPLGSTEQENYNHWLHMHFYPPLLRSATVRKFLVGYEMLAEPQRD-LTPEQAAKRLREL 345 (354) T ss_pred HHHHHHCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCHHHCCCCCEEECHHHHCCCCCC-CCHHHHHHHHHHC T ss_conf 998751367861112145876653456521233331166501213635225414231375345-7989999999854 No 19 >KOG2477 consensus Probab=98.02 E-value=2.9e-05 Score=54.02 Aligned_cols=103 Identities=13% Similarity=0.135 Sum_probs=70.3 Q ss_pred CCCCHHHHHHCCCCCC-CEEEECCEEEEEE-CCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 8976545444688873-1787789099994-1788898229999415542134402788898999999730445531352 Q gi|254780780|r 10 DNQNIFIKIIRNETNA-CRVYEDDILLAIM-DIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIAFLIKKIAIACKSAFQA 87 (155) Q Consensus 10 dd~CiFC~Ii~ge~p~-~~v~E~d~~~afl-d~~P~~~GH~lViPk~H~~~l~dL~~~~~~~l~~~~~~v~~al~~~~~~ 87 (155) =|+|.||= -+-..|- .+|-=.+..+.-| ...|...|||+|+|-.|+..-..|++++|.++-+..+-++ ..|.. T Consensus 406 lD~C~rCf-ds~klpkhlviSlg~~tYLsLp~~~gL~~gHciIvptqH~~~t~slDEdvWDEIrnfrKcL~----~Mfas 480 (628) T KOG2477 406 LDTCPRCF-DSEKLPKHLVISLGHRTYLSLPTQPGLAKGHCIIVPTQHRINTLSLDEDVWDEIRNFRKCLA----LMFAS 480 (628) T ss_pred HHHCHHHH-CCCCCCCCEEEEECCCEEEECCCCCCCCCCCEEEECCCCCCCCCCCCHHHHHHHHHHHHHHH----HHHHH T ss_conf 51035654-64446666057731440674355676556745884155533424156578999999999999----99986 Q ss_pred CCEEEEEECCCCCCCEEEEEEEEEEECCCC Q ss_conf 666999825756686310106999952379 Q gi|254780780|r 88 DGIQILQFNGHAAGQTVPHLHFHVIPCKNG 117 (155) Q Consensus 88 ~~~ni~~~~g~~~gq~V~HlH~HiiPR~~~ 117 (155) .+-.+++......=|.-||+-+|.||-... T Consensus 481 ~n~dviFyE~a~~l~rrpH~~IeCIPvpqe 510 (628) T KOG2477 481 MNLDVIFYENAPSLQRRPHTAIECIPVPQE 510 (628) T ss_pred CCCCEEEEECCCCCCCCCCEEEEEEECHHH T ss_conf 279858983367634577413787525077 No 20 >KOG2476 consensus Probab=97.99 E-value=4.1e-05 Score=53.06 Aligned_cols=103 Identities=19% Similarity=0.256 Sum_probs=70.2 Q ss_pred CCCCCCHHHHHHCCCCCCC-EEEECCEEEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 6589765454446888731-787789099994178889822999941554213440278889899999973044553135 Q gi|254780780|r 8 HYDNQNIFIKIIRNETNAC-RVYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIAFLIKKIAIACKSAFQ 86 (155) Q Consensus 8 ~~dd~CiFC~Ii~ge~p~~-~v~E~d~~~afld~~P~~~GH~lViPk~H~~~l~dL~~~~~~~l~~~~~~v~~al~~~~~ 86 (155) ..-+.|-||- .+-+...+ +|-=++.+++-|...|.+.||+||+|..|++.+..|+++.+.++-..- .++++-+. T Consensus 316 ~~pg~CwFCL-SnP~vEkHLIVsIG~~~YlAlaKGpLs~~HvlIipi~H~p~~~~ls~ev~~Ei~kyk----aal~~myk 390 (528) T KOG2476 316 IPPGSCWFCL-SNPNVEKHLIVSIGNHFYLALAKGPLSSDHVLIIPIEHIPSLVPLSAEVTQEINKYK----AALRKMYK 390 (528) T ss_pred CCCCCEEEEE-CCCCHHHHEEEEECCEEEEEECCCCCCCCEEEEEECCCCCCCCCCCHHHHHHHHHHH----HHHHHHHH T ss_conf 8998617981-697755616788534447860578887781899871114444558989999999999----99999998 Q ss_pred CCCEEEEEECCCCCCCEEEEEEEEEEECCCC Q ss_conf 2666999825756686310106999952379 Q gi|254780780|r 87 ADGIQILQFNGHAAGQTVPHLHFHVIPCKNG 117 (155) Q Consensus 87 ~~~~ni~~~~g~~~gq~V~HlH~HiiPR~~~ 117 (155) ..|--.+... ....-.-|+|+-+||-.+. T Consensus 391 ~~g~~~vvfE--~~~~rs~Hlq~Qvipvpks 419 (528) T KOG2476 391 KQGKDAVVFE--RQSYRSVHLQLQVIPVPKS 419 (528) T ss_pred HCCCEEEEEE--EECCCCEEEEEEEEECCCH T ss_conf 5597289998--4024400157788753602 No 21 >COG1085 GalT Galactose-1-phosphate uridylyltransferase [Energy production and conversion] Probab=97.81 E-value=7.7e-05 Score=51.39 Aligned_cols=105 Identities=20% Similarity=0.220 Sum_probs=79.4 Q ss_pred CCCCCCHHHH------HHCCCCCCCEEEECCEEEEEECCCC------------CCCCEEEEE--ECCCCCHHHHHHHHHH Q ss_conf 6589765454------4468887317877890999941788------------898229999--4155421344027888 Q gi|254780780|r 8 HYDNQNIFIK------IIRNETNACRVYEDDILLAIMDIMP------------RNPGHVLII--PKSRIRDIFEAPPEIL 67 (155) Q Consensus 8 ~~dd~CiFC~------Ii~ge~p~~~v~E~d~~~afld~~P------------~~~GH~lVi--Pk~H~~~l~dL~~~~~ 67 (155) .||..|.||- +.+.+-....+++|++-.+-.|--. -..|++.|| ...|...+.+|+.++. T Consensus 37 ~~~~~CpfC~gn~~~t~~~~~~~~~~~~~N~fp~v~~d~p~~~~~~~~~~~~~~~~g~~~VIvesp~H~~~l~~~~~~~~ 116 (338) T COG1085 37 DHDPTCPFCPGNERTTEENPRYWHVRVFPNDFPAVSEDPPDAPGSEDPLFKIQEARGKSRVIVESPDHSKTLPELPVEEI 116 (338) T ss_pred CCCCCCCCCCCCCEECCCCCCCCCEEECCCCCHHCCCCCCCCCCCCCCHHCCCCCCCCEEEEEECCCCCCCCCCCCHHHH T ss_conf 25886770689641025679876426537745200688877766555101023557636999976765675555898999 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCEEEEEEEEEE Q ss_conf 989999997304455313526669998257566863101069999 Q gi|254780780|r 68 SQIAFLIKKIAIACKSAFQADGIQILQFNGHAAGQTVPHLHFHVI 112 (155) Q Consensus 68 ~~l~~~~~~v~~al~~~~~~~~~ni~~~~g~~~gq~V~HlH~Hii 112 (155) .+++.+.+...+-|.+.-...-+.+..|.|..+|-+.+|.|.-|+ T Consensus 117 ~~vv~~~~e~~~~L~~~~~~~yV~iF~N~Gk~~G~S~~HPH~Qi~ 161 (338) T COG1085 117 EEVVKLWQERVRELYEREKYKYVQIFENKGKAAGASLPHPHGQIV 161 (338) T ss_pred HHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCCCCCCCEE T ss_conf 999999999999986526864586320058424766889875436 No 22 >cd00608 GalT Galactose-1-phosphate uridyl transferase (GalT): This enzyme plays a key role in galactose metabolism by catalysing the transfer of a uridine 5'-phosphoryl group from UDP-galactose 1-phosphate. The structure of E.coli GalT reveals that the enzyme contains two identical subunits. It also demonstrates that the active site is formed by amino acid residues from both subunits of the dimer. Probab=97.73 E-value=0.0002 Score=48.74 Aligned_cols=107 Identities=13% Similarity=0.113 Sum_probs=71.3 Q ss_pred CCCCCCCCHHHHHHCCCCCCCEEEECCEEEEEECCCCCC--------------------CCE--EEEEECCCCCHHHHHH Q ss_conf 566589765454446888731787789099994178889--------------------822--9999415542134402 Q gi|254780780|r 6 STHYDNQNIFIKIIRNETNACRVYEDDILLAIMDIMPRN--------------------PGH--VLIIPKSRIRDIFEAP 63 (155) Q Consensus 6 ~~~~dd~CiFC~Ii~ge~p~~~v~E~d~~~afld~~P~~--------------------~GH--~lViPk~H~~~l~dL~ 63 (155) .+.+|..|.||.=-.+ .++..--.+=.+.+|-+.+|.. -|. ++|-.-+|..++.+|+ T Consensus 33 ~~~~~~~CPfCpG~e~-~~~~~~~~~w~vrV~~N~fPal~~~~~~~~~~~~~l~~~~~a~G~hEVIi~sp~H~~~l~~l~ 111 (329) T cd00608 33 LPEYDPDCPLCPGNER-ADTGEQNPDYDVRVFENDFPALKPDAPAPEDSDDGLFRTAPARGRCEVICFSPDHNLTLAEMS 111 (329) T ss_pred CCCCCCCCCCCCCCCC-CCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCHHCCCCCCEEEEEEEECCCCCCCHHHCC T ss_conf 8777899997999877-788988999738998379965457888877677730004566603799997864236834589 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCEEEEEEEEEEE Q ss_conf 78889899999973044553135266699982575668631010699995 Q gi|254780780|r 64 PEILSQIAFLIKKIAIACKSAFQADGIQILQFNGHAAGQTVPHLHFHVIP 113 (155) Q Consensus 64 ~~~~~~l~~~~~~v~~al~~~~~~~~~ni~~~~g~~~gq~V~HlH~HiiP 113 (155) .+++..++.+.+.-.+.+.+--+..-+.+.-|.|..+|-+..|.|--|+. T Consensus 112 ~~~i~~vl~~~~~R~~~L~~~~~i~yv~iFkN~G~~aGaSl~HpH~QI~A 161 (329) T cd00608 112 VAEIREVVEAWAERTRELGKNPRIKYVQIFENKGAEMGASLPHPHGQIWA 161 (329) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCCCCCHHEEE T ss_conf 99999999999999999862888518999830573456356687433143 No 23 >PRK11720 galactose-1-phosphate uridylyltransferase; Provisional Probab=97.58 E-value=0.00047 Score=46.47 Aligned_cols=106 Identities=16% Similarity=0.216 Sum_probs=69.4 Q ss_pred CCCCCCCCCCHHHHH---HCCCC-C---CCEEEECCEEEEEECCCCC-------------CCCE--EEEEECCCCCHHHH Q ss_conf 665665897654544---46888-7---3178778909999417888-------------9822--99994155421344 Q gi|254780780|r 4 KSSTHYDNQNIFIKI---IRNET-N---ACRVYEDDILLAIMDIMPR-------------NPGH--VLIIPKSRIRDIFE 61 (155) Q Consensus 4 ~s~~~~dd~CiFC~I---i~ge~-p---~~~v~E~d~~~afld~~P~-------------~~GH--~lViPk~H~~~l~d 61 (155) ++.+.+|.+|.||.= +.|+. | ...|++|++-.. ....|. .-|. ++|-.-+|..++.+ T Consensus 43 ~~~p~~d~~CPfCpGne~~~g~~~pd~~~~~Vf~N~fPAl-~~~~~~~~~~~~~l~~~~~a~G~hEVIiesp~H~~~l~~ 121 (339) T PRK11720 43 QVLPAYDPDCFLCPGNTRVTGDVNPDYTGTYVFTNDFAAL-MPDTPDAPESDDPLFRCQSARGTSRVICFSPDHSKTLPE 121 (339) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCC-CCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCCHHH T ss_conf 6677778999889998566888799877349973898402-357877676788420146667227999979863479256 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCEEEEEEEEEE Q ss_conf 027888989999997304455313526669998257566863101069999 Q gi|254780780|r 62 APPEILSQIAFLIKKIAIACKSAFQADGIQILQFNGHAAGQTVPHLHFHVI 112 (155) Q Consensus 62 L~~~~~~~l~~~~~~v~~al~~~~~~~~~ni~~~~g~~~gq~V~HlH~Hii 112 (155) |+.+++..++.+-+.-.+.+.+ ...-+.+.-|.|..+|-+.+|.|--|+ T Consensus 122 ls~~~i~~vi~~~~~R~~~L~~--~i~yv~iFkN~G~~aGaSl~HpHsQI~ 170 (339) T PRK11720 122 LSVAALREVVDTWQEQTAELGK--TYPWVQVFENKGAAMGCSNPHPHGQIW 170 (339) T ss_pred CCHHHHHHHHHHHHHHHHHHHC--CCCEEEEEECCCCCCCCCCCCCCCCEE T ss_conf 9999999999999999999844--366799874166335635778752023 No 24 >KOG0562 consensus Probab=97.30 E-value=0.00066 Score=45.57 Aligned_cols=85 Identities=21% Similarity=0.300 Sum_probs=57.2 Q ss_pred CCCEEEEC-CEEEEEECCCCCCCCEEEEEECC-CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC----CEEEEEECC Q ss_conf 73178778-90999941788898229999415-5421344027888989999997304455313526----669998257 Q gi|254780780|r 24 NACRVYED-DILLAIMDIMPRNPGHVLIIPKS-RIRDIFEAPPEILSQIAFLIKKIAIACKSAFQAD----GIQILQFNG 97 (155) Q Consensus 24 p~~~v~E~-d~~~afld~~P~~~GH~lViPk~-H~~~l~dL~~~~~~~l~~~~~~v~~al~~~~~~~----~~ni~~~~g 97 (155) |-+++-|. |.++++.|.+|.+..|+||.|++ -++.++++-.+.+. +....-.++......++.+ .|+++++.+ T Consensus 14 ~e~V~~es~d~vvvIrD~fPKa~~H~LvLpr~s~i~~l~~~~qe~l~-ll~~~h~~~~~~v~~~~~~~~~~~f~vG~Hav 92 (184) T KOG0562 14 PENVYIESPDDVVVIRDKFPKARMHLLVLPRRSSIDSLFSVVQEHLS-LLKEDHAVGPCWVDQLTNEALCNYFRVGFHAV 92 (184) T ss_pred CCEEECCCCCCEEEECCCCCCCEEEEEEECCCCHHHHHHHHHHHHHH-HHHHHHHCCCHHHHHHCCHHHHHHEEEEECCC T ss_conf 30133067661899825686210478984231044789999987766-76777632812888850366666423320157 Q ss_pred CCCCCEEEEEEEEEEE Q ss_conf 5668631010699995 Q gi|254780780|r 98 HAAGQTVPHLHFHVIP 113 (155) Q Consensus 98 ~~~gq~V~HlH~HiiP 113 (155) | +-.++|+|||- T Consensus 93 P----SM~~LHLHVIS 104 (184) T KOG0562 93 P----SMNNLHLHVIS 104 (184) T ss_pred C----CHHHEEEEEEE T ss_conf 5----42023678864 No 25 >TIGR00209 galT_1 galactose-1-phosphate uridylyltransferase; InterPro: IPR001937 Galactose-1-phosphate uridyl transferase 2.7.7.12 from EC (galT) catalyzes the transfer of an uridyldiphosphate group on galactose (or glucose) 1-phosphate. During the reaction, the uridyl moiety links to a histidine residue. In the Escherichia coli enzyme, it has been shown that two histidine residues separated by a single proline residue are essential for enzyme activity. On the basis of sequence similarities, two apparently unrelated families seem to exist. Class-I enzymes are found in eukaryotes as well as some bacteria such as E. coli or Streptomyces lividans, while class-II enzymes have been found so far only in bacteria such as Bacillus subtilis or Lactobacillus helveticus .; GO: 0008108 UDP-glucose:hexose-1-phosphate uridylyltransferase activity, 0006012 galactose metabolic process. Probab=97.09 E-value=0.00068 Score=45.46 Aligned_cols=134 Identities=16% Similarity=0.186 Sum_probs=91.8 Q ss_pred CHHHHHHCCCC--CCCEEEECCEEEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC- Q ss_conf 65454446888--731787789099994178889822999941554213440278889899999973044553135266- Q gi|254780780|r 13 NIFIKIIRNET--NACRVYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIAFLIKKIAIACKSAFQADG- 89 (155) Q Consensus 13 CiFC~Ii~ge~--p~~~v~E~d~~~afld~~P~~~GH~lViPk~H~~~l~dL~~~~~~~l~~~~~~v~~al~~~~~~~~- 89 (155) -+|...++.+. .+.++.|++.++++.......|..++++||.|+..+.++++....++..+.+++.......+...- T Consensus 212 ~~~~d~~~~~~~~~~~~~~~~~~~~~~~p~w~~wp~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~d~~~~~~~p 291 (367) T TIGR00209 212 PLLVDYVKRELAEKSRVVVETEHWLAVVPYWAIWPFETLLLPKKHLLRLTELNDAQKEDLALILKKLTTKYDNLFETSFP 291 (367) T ss_pred HHHHHHHHHHHHCCCCEEEECCCEEEEECHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 03456676653102312331353256501012200234430135666666655666667999998865432001210243 Q ss_pred EEEEEECCCCC--CCEE--EEEEEEEEECCCCCC---CCCCCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 69998257566--8631--010699995237997---653455776668898999999999999 Q gi|254780780|r 90 IQILQFNGHAA--GQTV--PHLHFHVIPCKNGDN---ASHTNIHPTQKIENFAKLEINAQKIRK 146 (155) Q Consensus 90 ~ni~~~~g~~~--gq~V--~HlH~HiiPR~~~d~---~~~~~~~~~~~~~~~~~l~~~a~~lr~ 146 (155) +.++.+..+.. |... ..+|.|..|-.-... ....++...+.....-..+..++++|. T Consensus 292 ~~~g~~~~p~~~~g~~~~~~~~~~~~~pp~~~~~~~~~~~~g~~~~~~~~~d~~~~~~~~~~~~ 355 (367) T TIGR00209 292 YSLGWHGAPFNATGEENEHWWLHAHFYPPLLRSATVRKFLVGYELLGEPQRDLTAEQAAERLRA 355 (367) T ss_pred CCCCCCCCCCCCCCCCCCCCEEHHHHCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHH T ss_conf 0024333555555542100000011023234444566676534553253012236678888764 No 26 >PRK05471 CDP-diacylglycerol pyrophosphatase; Provisional Probab=97.07 E-value=0.0022 Score=42.33 Aligned_cols=99 Identities=14% Similarity=0.156 Sum_probs=73.8 Q ss_pred CCCCHHHHHHCCC-CCCCEEEECCEEEEEECCCCCCCCEEEEEECCCCCHHHH--H----HHHHHHHHHHHHHHHHHHHH Q ss_conf 8976545444688-873178778909999417888982299994155421344--0----27888989999997304455 Q gi|254780780|r 10 DNQNIFIKIIRNE-TNACRVYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFE--A----PPEILSQIAFLIKKIAIACK 82 (155) Q Consensus 10 dd~CiFC~Ii~ge-~p~~~v~E~d~~~afld~~P~~~GH~lViPk~H~~~l~d--L----~~~~~~~l~~~~~~v~~al~ 82 (155) .+.|+.-+-..|. .|...|-+++.+++|.|+ .-|.|.|++|-.-++.+++ | ++..+..-+..-..+++.+. T Consensus 37 ~~qCvp~~~~~~~p~PC~~Vd~~~GyvvlKD~--~Gp~qyLLiPt~rIsGIEsP~Ll~~~tpnyf~~AW~aR~~vs~~~g 114 (252) T PRK05471 37 LEQCLPNQQQNQNPAPCAEVDPQAGYVVLKDR--NGPLQYLLMPTYRISGIESPLLLEPSTPNYFALAWQARDFMSKKYG 114 (252) T ss_pred HHHCCCCCCCCCCCCCCEEECCCCCEEEEECC--CCCCEEEEEECCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHC T ss_conf 87527740047999997278067885999669--9983389962465467538523288874189999997589999818 Q ss_pred HHCCCCCEEEEEECCCCCCCEEEEEEEEEE Q ss_conf 313526669998257566863101069999 Q gi|254780780|r 83 SAFQADGIQILQFNGHAAGQTVPHLHFHVI 112 (155) Q Consensus 83 ~~~~~~~~ni~~~~g~~~gq~V~HlH~Hii 112 (155) +.+..+.+.+.+|. ..|-+-.|||+||= T Consensus 115 ~~ipd~~lsLAINS--~~gRSQnQLHIHIs 142 (252) T PRK05471 115 QPIPDSAVSLAINS--RYGRSQDQLHIHIS 142 (252) T ss_pred CCCCHHHEEEEECC--CCCCCCCCEEEEHH T ss_conf 99986783676416--76757033276501 No 27 >pfam02611 CDH CDP-diacylglycerol pyrophosphatase. This is a family of CDP-diacylglycerol pyrophosphatases, EC:3.6.1.26. This enzyme catalyses the reaction CDP-diacylglycerol + H2O <= CMP + phosphatidate. Probab=96.92 E-value=0.0041 Score=40.63 Aligned_cols=99 Identities=13% Similarity=0.173 Sum_probs=72.2 Q ss_pred CCCCHHHHHHCCC-CCCCEEEECCEEEEEECCCCCCCCEEEEEECCCCCHHHH--H----HHHHHHHHHHHHHHHHHHHH Q ss_conf 8976545444688-873178778909999417888982299994155421344--0----27888989999997304455 Q gi|254780780|r 10 DNQNIFIKIIRNE-TNACRVYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFE--A----PPEILSQIAFLIKKIAIACK 82 (155) Q Consensus 10 dd~CiFC~Ii~ge-~p~~~v~E~d~~~afld~~P~~~GH~lViPk~H~~~l~d--L----~~~~~~~l~~~~~~v~~al~ 82 (155) ++.|+.-+-..+. .|...|-.+..+++|.|+ .-|-|.|++|-..++.+++ | ++..+..-+..-..++..+. T Consensus 37 ~~qCvp~~~~~~~p~pC~~Vd~~~gyvv~KD~--~Gp~qyLLmPT~rIsGIEsP~Ll~~~tpnyf~~AW~aR~~v~~~~g 114 (249) T pfam02611 37 HQRCLPNQQATGAPAPCAEVDLQAGYVLLKDR--NGPLQYLLIPTDRVTGIESPLLLEPATPNYFAAAWAARGYVSKKLG 114 (249) T ss_pred HHHCCCCHHCCCCCCCCEEECCCCCEEEEECC--CCCCEEEEEECCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHC T ss_conf 76428803235999997278267885999669--9983279852465577548524288985289999987279999708 Q ss_pred HHCCCCCEEEEEECCCCCCCEEEEEEEEEE Q ss_conf 313526669998257566863101069999 Q gi|254780780|r 83 SAFQADGIQILQFNGHAAGQTVPHLHFHVI 112 (155) Q Consensus 83 ~~~~~~~~ni~~~~g~~~gq~V~HlH~Hii 112 (155) +.+..+.+.+.+|. ..|-+-.|||+||= T Consensus 115 ~~ipd~~lsLaINS--~~gRSQnQLHIHIs 142 (249) T pfam02611 115 QPLPDNALSLAINS--RYGRSQDQLHIHIS 142 (249) T ss_pred CCCCHHHEEEEECC--CCCCCCCCEEEEHH T ss_conf 98982463677516--77766033164522 No 28 >COG4360 APA2 ATP adenylyltransferase (5',5'''-P-1,P-4-tetraphosphate phosphorylase II) [Nucleotide transport and metabolism] Probab=96.69 E-value=0.0023 Score=42.12 Aligned_cols=74 Identities=18% Similarity=0.262 Sum_probs=52.1 Q ss_pred CEEEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCEEEEEEEEE Q ss_conf 90999941788898229999415542134402788898999999730445531352666999825756686310106999 Q gi|254780780|r 32 DILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIAFLIKKIAIACKSAFQADGIQILQFNGHAAGQTVPHLHFHV 111 (155) Q Consensus 32 d~~~afld~~P~~~GH~lViPk~H~~~l~dL~~~~~~~l~~~~~~v~~al~~~~~~~~~ni~~~~g~~~gq~V~HlH~Hi 111 (155) +....+++..|+.+.|+|+|.++--..-.-|+..++...+.+. .+-+| -+.+|.|+.+|-+-+|=|+-+ T Consensus 92 ~th~~llNKF~VVdeHlLiVTrefedQ~s~LTl~Df~ta~~vL----------~~ldg-lvFYNsGp~aGaSq~HkHLQi 160 (298) T COG4360 92 DTHKLLLNKFPVVDEHLLIVTREFEDQESALTLADFTTAYAVL----------CGLDG-LVFYNSGPIAGASQDHKHLQI 160 (298) T ss_pred HHHHHHHHCCCCCCCEEEEEEHHHHHCCCCCCHHHHHHHHHHH----------HCCCC-EEEECCCCCCCCCCCCCCEEE T ss_conf 6576565168766643578640322012657888999999998----------61466-378657877676777662067 Q ss_pred EECCC Q ss_conf 95237 Q gi|254780780|r 112 IPCKN 116 (155) Q Consensus 112 iPR~~ 116 (155) +|..- T Consensus 161 ~pmPf 165 (298) T COG4360 161 VPMPF 165 (298) T ss_pred EECCC T ss_conf 60366 No 29 >KOG3969 consensus Probab=95.98 E-value=0.16 Score=30.66 Aligned_cols=93 Identities=22% Similarity=0.309 Sum_probs=61.8 Q ss_pred CCCCCCEEEECCE----EEEEECCC--CC--CCCEEEEEECCC-CCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC-- Q ss_conf 8887317877890----99994178--88--982299994155-4213440278889899999973044553135266-- Q gi|254780780|r 21 NETNACRVYEDDI----LLAIMDIM--PR--NPGHVLIIPKSR-IRDIFEAPPEILSQIAFLIKKIAIACKSAFQADG-- 89 (155) Q Consensus 21 ge~p~~~v~E~d~----~~afld~~--P~--~~GH~lViPk~H-~~~l~dL~~~~~~~l~~~~~~v~~al~~~~~~~~-- 89 (155) +...-++||||.. |+.+-|.. +. -.=++|+|.+++ +.++-||..+.+.-|.++-.+...++...++.+. T Consensus 156 ~aE~driV~ed~d~~nGFillPDlKWdgqtld~LyllaIvhr~dikSiRDL~~~h~~lL~n~r~k~~~~i~~~y~v~~dq 235 (310) T KOG3969 156 KAEDDRIVYEDPDPENGFILLPDLKWDGQTLDSLYLLAIVHRRDIKSIRDLRPSHLQLLRNIRNKSREAIPQRYGVDPDQ 235 (310) T ss_pred CCCCCCEEEECCCCCCCEEECCCCCCCCCCCCCEEEEEEEECCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCCHH T ss_conf 23213158756887677497534563842201026899994278634640887788999999999999878872888006 Q ss_pred EEEEEECCCCCCCEEEEEEEEEEECCCC Q ss_conf 6999825756686310106999952379 Q gi|254780780|r 90 IQILQFNGHAAGQTVPHLHFHVIPCKNG 117 (155) Q Consensus 90 ~ni~~~~g~~~gq~V~HlH~HiiPR~~~ 117 (155) ..+.++=. -+..|+|+|+++-... T Consensus 236 lrmf~HYq----PSyYHlHVHi~nik~~ 259 (310) T KOG3969 236 LRMFFHYQ----PSYYHLHVHIVNIKHD 259 (310) T ss_pred EEEEEEEC----CCEEEEEEEEEECCCC T ss_conf 79999825----7247999998732577 No 30 >KOG2720 consensus Probab=95.23 E-value=0.0098 Score=38.26 Aligned_cols=74 Identities=22% Similarity=0.242 Sum_probs=40.8 Q ss_pred EECCEEEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCEEEEEE Q ss_conf 77890999941788898229999415542134402788898999999730445531352666999825756686310106 Q gi|254780780|r 29 YEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIAFLIKKIAIACKSAFQADGIQILQFNGHAAGQTVPHLH 108 (155) Q Consensus 29 ~E~d~~~afld~~P~~~GH~lViPk~H~~~l~dL~~~~~~~l~~~~~~v~~al~~~~~~~~~ni~~~~g~~~gq~V~HlH 108 (155) -|+.. +|..+..|+..||+|+||+----.-.-++ ..++.-++. +..-.+.+-+.+++|. .-+..+|.||| T Consensus 164 pe~~~-vvaIN~sPie~~H~LiiP~V~kc~pQrit---~~al~lav~-----~m~~~dd~~frlgyNS-lga~AsVNHLH 233 (431) T KOG2720 164 PENSP-VVAINVSPIEYGHVLIIPRVLKCLPQRIT---HKALLLAVT-----MMAEADDPYFRLGYNS-LGAFASVNHLH 233 (431) T ss_pred CCCCC-EEEEECCCCCCCCEEEECCHHCCCCCEEE---HHHHHHHHH-----HHHHCCCCHHHEECCC-CHHHHHHHHHH T ss_conf 10082-37886575445717980502036720010---899999999-----9986288234000356-10221020454 Q ss_pred EEEE Q ss_conf 9999 Q gi|254780780|r 109 FHVI 112 (155) Q Consensus 109 ~Hii 112 (155) +|.. T Consensus 234 fha~ 237 (431) T KOG2720 234 FHAY 237 (431) T ss_pred HHHH T ss_conf 5555 No 31 >COG2134 Cdh CDP-diacylglycerol pyrophosphatase [Lipid metabolism] Probab=95.07 E-value=0.13 Score=31.18 Aligned_cols=98 Identities=14% Similarity=0.142 Sum_probs=61.5 Q ss_pred CCCHHHHHHCCC-CCCCEEEECCEEEEEECCCCCCCCEEEEEECCCCCHHHH--HH----HHHHHHHHHHHHHHHHHHHH Q ss_conf 976545444688-873178778909999417888982299994155421344--02----78889899999973044553 Q gi|254780780|r 11 NQNIFIKIIRNE-TNACRVYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFE--AP----PEILSQIAFLIKKIAIACKS 83 (155) Q Consensus 11 d~CiFC~Ii~ge-~p~~~v~E~d~~~afld~~P~~~GH~lViPk~H~~~l~d--L~----~~~~~~l~~~~~~v~~al~~ 83 (155) |.|++-.--.+. .|+..|-+.....+|.|.+ -|-.+|++|-.++..+.+ |- +..+-.-|..--.+++.+.. T Consensus 40 e~C~p~~~~~~~paPCaeV~~~AG~av~Kd~~--gPlQyLLmPt~rItGiE~P~L~e~atpNyf~~AWqAR~fms~kyg~ 117 (252) T COG2134 40 ERCLPNYQQNQNPAPCAEVKPQAGYAVLKDRN--GPLQYLLMPTARITGIESPLLLEPATPNYFYLAWQARDFMSKKYGN 117 (252) T ss_pred HHCCCCHHCCCCCCCCEEECCCCCEEEEECCC--CCCEEEEEEEECCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHCC T ss_conf 87076610159999863645877648881368--9701577300114677673440778864899999888899997189 Q ss_pred HCCCCCEEEEEECCCCCCCEEEEEEEEEE Q ss_conf 13526669998257566863101069999 Q gi|254780780|r 84 AFQADGIQILQFNGHAAGQTVPHLHFHVI 112 (155) Q Consensus 84 ~~~~~~~ni~~~~g~~~gq~V~HlH~Hii 112 (155) -+.-..+.+.+|. .-|-+-.|+|+||= T Consensus 118 ~ipd~dvsLaINs--k~gRtQdqlHIHIS 144 (252) T COG2134 118 PIPDSDVSLAINS--KNGRTQDQLHIHIS 144 (252) T ss_pred CCCCCCEEEEECC--CCCCCCCCEEEEEE T ss_conf 9996012888647--66865431589998 No 32 >KOG2958 consensus Probab=90.94 E-value=0.9 Score=26.00 Aligned_cols=108 Identities=17% Similarity=0.092 Sum_probs=68.3 Q ss_pred CCCCCCCCCCHHHH--H-HCCC----CCCCEEEECCEEEEEEC--CCC-----------CCCCEEEEEEC--CCCCHHHH Q ss_conf 66566589765454--4-4688----87317877890999941--788-----------89822999941--55421344 Q gi|254780780|r 4 KSSTHYDNQNIFIK--I-IRNE----TNACRVYEDDILLAIMD--IMP-----------RNPGHVLIIPK--SRIRDIFE 61 (155) Q Consensus 4 ~s~~~~dd~CiFC~--I-i~ge----~p~~~v~E~d~~~afld--~~P-----------~~~GH~lViPk--~H~~~l~d 61 (155) .+.+-||..|.||- | +.|. -.++-|++||+-..=.| ..+ ...|-+-||.- -|--++.+ T Consensus 45 ~~~p~~dp~cplcpG~~ra~g~~np~ydstyvf~NdypA~~~d~p~~~~~~~~~lfk~~~v~G~c~Vicf~Pnh~ltLp~ 124 (354) T KOG2958 45 NTTPSYDPLCPLCPGNIRATGFRNPDYDSTYVFDNDYPALRRDQPTQGQDESTGLFKTISVKGVCKVICFSPNHNLTLPL 124 (354) T ss_pred CCCCCCCCCCCCCCCCCHHCCCCCCCCCCCEECCCCCHHHCCCCCCCCCCCCCCCHHHEEECCEEEEEEECCCCCCCCCC T ss_conf 99876788777788850201443788766324458762205677777999875301102312206999967765551024 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCEEEEEEEEE Q ss_conf 02788898999999730445531352666999825756686310106999 Q gi|254780780|r 62 APPEILSQIAFLIKKIAIACKSAFQADGIQILQFNGHAAGQTVPHLHFHV 111 (155) Q Consensus 62 L~~~~~~~l~~~~~~v~~al~~~~~~~~~ni~~~~g~~~gq~V~HlH~Hi 111 (155) ++..+..++...=+++..-|.+--...-+.|.-|.|.++|-+-+|-|=-+ T Consensus 125 m~~~~i~~vv~aw~~~~~~l~~h~~y~yvQIFeNkGa~mGcSn~HpHgQ~ 174 (354) T KOG2958 125 MDVVEIRDVVDAWKKLYNELGQHDSYKYVQIFENKGAAMGCSNPHPHGQA 174 (354) T ss_pred CCHHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCCCCCCE T ss_conf 87899999999999999974246884405654047741356899963001 No 33 >pfam01076 Mob_Pre Plasmid recombination enzyme. With some plasmids, recombination can occur in a site specific manner that is independent of RecA. In such cases, the recombination event requires another protein called Pre. Pre is a plasmid recombination enzyme. This protein is also known as Mob (conjugative mobilisation). Probab=88.35 E-value=2.4 Score=23.34 Aligned_cols=81 Identities=12% Similarity=0.101 Sum_probs=47.1 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCE-EEEEECCCCCCCEEEEEEEEEEECCCCCCCCCCCCCCCCCCCCHH Q ss_conf 2134402788898999999730445531352666-999825756686310106999952379976534557766688989 Q gi|254780780|r 57 RDIFEAPPEILSQIAFLIKKIAIACKSAFQADGI-QILQFNGHAAGQTVPHLHFHVIPCKNGDNASHTNIHPTQKIENFA 135 (155) Q Consensus 57 ~~l~dL~~~~~~~l~~~~~~v~~al~~~~~~~~~-ni~~~~g~~~gq~V~HlH~HiiPR~~~d~~~~~~~~~~~~~~~~~ 135 (155) +.+.+++++...+++. ...+.+.+.++.+.+ +...+. .-+.||+|+-++|..++.-+..-. -..... T Consensus 90 e~f~~~~~e~~~~~~~---~~~~~l~~~~G~~nv~~a~~H~----DE~tPH~H~~~vP~~~~~rl~ak~-----~~~~r~ 157 (196) T pfam01076 90 DFFKNLTYEEQKEFFE---TAFKFFEERYGKENILYAVVHL----DEATPHMHMGVVPLTEDGRLSAKA-----IFGNRK 157 (196) T ss_pred HHHHHCCHHHHHHHHH---HHHHHHHHHCCHHHEEEEEEEC----CCCCCCEEEEEEEECCCCCCCHHH-----HHCCHH T ss_conf 9982389999999999---9999999865811289999854----789983379999864776400777-----727799 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 99999999999999 Q gi|254780780|r 136 KLEINAQKIRKELQ 149 (155) Q Consensus 136 ~l~~~a~~lr~~l~ 149 (155) +|........+.+. T Consensus 158 ~L~~~Qd~~~~~~~ 171 (196) T pfam01076 158 ELTQLQDRFPEYVG 171 (196) T ss_pred HHHHHHHHHHHHHH T ss_conf 99999999999999 No 34 >COG5075 Uncharacterized conserved protein [Function unknown] Probab=87.37 E-value=2.1 Score=23.76 Aligned_cols=90 Identities=20% Similarity=0.244 Sum_probs=53.6 Q ss_pred CCCCCCEEEECCEEEE----EECCC--CCC--CCEEEEEECC-CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC--CC Q ss_conf 8887317877890999----94178--889--8229999415-542134402788898999999730445531352--66 Q gi|254780780|r 21 NETNACRVYEDDILLA----IMDIM--PRN--PGHVLIIPKS-RIRDIFEAPPEILSQIAFLIKKIAIACKSAFQA--DG 89 (155) Q Consensus 21 ge~p~~~v~E~d~~~a----fld~~--P~~--~GH~lViPk~-H~~~l~dL~~~~~~~l~~~~~~v~~al~~~~~~--~~ 89 (155) |.+.-++||||....- +-|.. |.+ .=|.++|-++ -+.++.||......-+.++-.++...+..-|+. +. T Consensus 151 ~ae~erivyed~~~~ngfiiiPD~KWd~qt~dsL~l~aIv~~~diktiRDlr~~~i~~l~rl~~kiltevp~~f~vd~n~ 230 (305) T COG5075 151 GAENERIVYEDESVINGFIIIPDMKWDGQTVDSLYLVAIVYRTDIKTIRDLRYYHILWLIRLNNKILTEVPYQFGVDPNE 230 (305) T ss_pred CCCCCEEEECCCCCCCCCEECCCCCCCCCCEEEEEEEEEEECCCCHHHHHCCHHHHHHHHHHCCCCEEECCHHCCCCHHH T ss_conf 34564148517440157165566554862110046899995388504542765540587763044107442230748448 Q ss_pred EEEEEECCCCCCCEEEEEEEEEEEC Q ss_conf 6999825756686310106999952 Q gi|254780780|r 90 IQILQFNGHAAGQTVPHLHFHVIPC 114 (155) Q Consensus 90 ~ni~~~~g~~~gq~V~HlH~HiiPR 114 (155) ..+..+-. -+..|+|+||+-- T Consensus 231 l~mfvHY~----PsYyhlHvHI~nI 251 (305) T COG5075 231 LRMFVHYQ----PSYYHLHVHIVNI 251 (305) T ss_pred EEEEEEEC----CCEEEEEEEEEEE T ss_conf 47999844----6247999999861 No 35 >TIGR02705 nudix_YtkD nucleoside triphosphatase YtkD; InterPro: IPR014078 The proteins in this entry belong to the nudix family and share some sequence identity with Escherichia coli MutT but appear not to be functionally interchangeable with it. The functional assignment of the proteins in this family is contentious. Reference challenges the findings of reference , both in interpretation and in enzyme assay results.. Probab=70.71 E-value=4.3 Score=21.76 Aligned_cols=23 Identities=35% Similarity=0.690 Sum_probs=12.7 Q ss_pred ECCCC---CCCCEEEEEECCCCCHHH Q ss_conf 41788---898229999415542134 Q gi|254780780|r 38 MDIMP---RNPGHVLIIPKSRIRDIF 60 (155) Q Consensus 38 ld~~P---~~~GH~lViPk~H~~~l~ 60 (155) ....| ..|+||||||+---..|. T Consensus 15 F~~~~rf~~~p~HVlvip~~~~qWLl 40 (158) T TIGR02705 15 FEKEPRFSPNPNHVLVIPRYKDQWLL 40 (158) T ss_pred ECCCCCCCCCCCCEEEEEEECCEEEE T ss_conf 65788748896956897556893344 No 36 >TIGR00672 cdh CDP-diacylglycerol pyrophosphatase; InterPro: IPR015993 The CDP-diacylglycerol pyrophosphatases 3.6.1.26 from EC play a role in the regulation of phospholipid metabolism by inositol, as well as regulating the cellular levels of phosphatidylinositol . This entry is specific for the proteobacterial enzymes.. Probab=70.19 E-value=4.8 Score=21.47 Aligned_cols=87 Identities=14% Similarity=0.199 Sum_probs=60.4 Q ss_pred CCCCEEEECCEEEEEECCCCCCCCEEEEEECCCCCHHHH--HHHHHHHHHHHHHHHHHHHHHHHCCCC----CEEEEEEC Q ss_conf 873178778909999417888982299994155421344--027888989999997304455313526----66999825 Q gi|254780780|r 23 TNACRVYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFE--APPEILSQIAFLIKKIAIACKSAFQAD----GIQILQFN 96 (155) Q Consensus 23 ~p~~~v~E~d~~~afld~~P~~~GH~lViPk~H~~~l~d--L~~~~~~~l~~~~~~v~~al~~~~~~~----~~ni~~~~ 96 (155) .|+.-|--+...+++.|.+ .|-.+|+.|..++..... |.+...-.++.+++....-+-+.++-+ .+.+.+|. T Consensus 53 ~PC~evkP~aGyvvlkd~~--GPlqyllmPt~~i~G~esPll~~P~tPnff~l~Wqar~fmskkyG~~~Pd~a~sl~ins 130 (251) T TIGR00672 53 SPCAEVKPNAGYVVLKDLN--GPLQYLLMPTYRINGTESPLLLDPSTPNFFWLAWQARDFMSKKYGQPVPDRAVSLAINS 130 (251) T ss_pred CCCEEECCCCCEEEEECCC--CCHHHHCCCCCEECCCCCCCCCCCCCCCCEEHHHHHHHHHHHHCCCCCCCCCEEEEEEC T ss_conf 8750234787758984077--85111012431114544641027898621001345655667651886773002455412 Q ss_pred CCCCCCEEEEEEEEEEE Q ss_conf 75668631010699995 Q gi|254780780|r 97 GHAAGQTVPHLHFHVIP 113 (155) Q Consensus 97 g~~~gq~V~HlH~HiiP 113 (155) ..|.+-.|+|+||-- T Consensus 131 --~~Gr~q~h~hihisC 145 (251) T TIGR00672 131 --RTGRSQNHFHIHISC 145 (251) T ss_pred --CCCCCCCCEEEEEEE T ss_conf --468743403788774 No 37 >smart00537 DCX Domain in the Doublecortin (DCX) gene product. Tandemly-repeated domain in doublin, the Doublecortin gene product. Proposed to bind tubulin. Doublecortin (DCX) is mutated in human X-linked neuronal migration defects. Probab=66.78 E-value=11 Score=19.23 Aligned_cols=56 Identities=11% Similarity=0.166 Sum_probs=36.7 Q ss_pred EEEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHH-HHHHHHHHHCCCCCEEE Q ss_conf 0999941788898229999415542134402788898999999-73044553135266699 Q gi|254780780|r 33 ILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIAFLIK-KIAIACKSAFQADGIQI 92 (155) Q Consensus 33 ~~~afld~~P~~~GH~lViPk~H~~~l~dL~~~~~~~l~~~~~-~v~~al~~~~~~~~~ni 92 (155) .+.+|-+-.|..+|..++|.++.+.++..|= .++...+. ....++++.+.++|--| T Consensus 7 ~I~vyrNGD~~~~g~~~~v~~r~~~sfd~lL----~~lT~~~~l~~~~~Vr~lyt~~G~~V 63 (89) T smart00537 7 RIRFYRNGDRFFKGVRLVVNRKRFKSFEALL----QDLTEVVKLDLPHGVRKLYTLDGKKV 63 (89) T ss_pred EEEEEECCCCCCCCEEEEECCCCCCCHHHHH----HHHHHHHCCCCCCCEEEEECCCCCEE T ss_conf 9999979998999889998863217799999----99887541658876248997999891 No 38 >cd01617 DCX DCX The ubiquitin-like DCX domain is present in tandem within the N-terminal half of the doublecortin protein. Doublecortin is expressed in migrating neurons. Mutations in the gene encoding doublecortin cause lissencephaly in males and 'double-cortex syndrome' in females. Probab=63.86 E-value=8.6 Score=19.90 Aligned_cols=58 Identities=14% Similarity=0.148 Sum_probs=39.9 Q ss_pred EEEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEE Q ss_conf 09999417888982299994155421344027888989999997304455313526669998 Q gi|254780780|r 33 ILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIAFLIKKIAIACKSAFQADGIQILQ 94 (155) Q Consensus 33 ~~~afld~~P~~~GH~lViPk~H~~~l~dL~~~~~~~l~~~~~~v~~al~~~~~~~~~ni~~ 94 (155) .+.+|-+-+|..+|-.++|.++.+.++..| +.++...++.-..++++.+.++|-..+. T Consensus 2 ~i~~~rNGD~~~~g~~~~v~~~~~~tfd~l----L~~lT~~l~l~~g~VR~lyt~dG~~~v~ 59 (80) T cd01617 2 RVVVYRNGDPFFKGVRLLVNRRRFKSFDAL----LDDLTEKVQLDPGAVRKLYTLDGGHRVS 59 (80) T ss_pred EEEEEECCCCCCCCEEEEECHHHHCCHHHH----HHHHHHHCCCCCCCCEEEEECCCCEECC T ss_conf 799997899899989999890464779999----9999853268898626999289981737 No 39 >pfam03432 Relaxase Relaxase/Mobilisation nuclease domain. Relaxases/mobilisation proteins are required for the horizontal transfer of genetic information contained on plasmids that occurs during bacterial conjugation. The relaxase, in conjunction with several auxiliary proteins, forms the relaxation complex or relaxosome. Relaxases nick duplex DNA in a specific manner by catalysing trans-esterification. Probab=58.52 E-value=17 Score=18.07 Aligned_cols=39 Identities=15% Similarity=0.139 Sum_probs=20.4 Q ss_pred HHHHHHHHHCCCCCEEEEEECCCCCCCEEEEEEEEEE-ECCCCC Q ss_conf 7304455313526669998257566863101069999-523799 Q gi|254780780|r 76 KIAIACKSAFQADGIQILQFNGHAAGQTVPHLHFHVI-PCKNGD 118 (155) Q Consensus 76 ~v~~al~~~~~~~~~ni~~~~g~~~gq~V~HlH~Hii-PR~~~d 118 (155) .++..+.+.++..++.++.- ....-.|+|+||| .|-.-+ T Consensus 75 ~i~~~~~~~~~~~~~q~~~~----~H~D~~h~H~Hiv~N~v~~~ 114 (240) T pfam03432 75 EIGREFAKKLGPGNHQYAVA----THTDKDHLHNHIVINRVDLE 114 (240) T ss_pred HHHHHHHHHHCCCCCEEEEE----EECCCCCCEEEEEEEECCCC T ss_conf 99999999857888759999----98789996048998401526 No 40 >TIGR03279 cyano_FeS_chp putative FeS-containing Cyanobacterial-specific oxidoreductase. Members of this protein family are predicted FeS-containing oxidoreductases of unknown function, apparently restricted to and universal across the Cyanobacteria. The high trusted cutoff score for this model, 700 bits, excludes homologs from other lineages. This exclusion seems justified because a significant number of sequence positions are simultaneously unique to and invariant across the Cyanobacteria, suggesting a specialized, conserved function, perhaps related to photosynthesis. A distantly related protein family, TIGR03278, in universal in and restricted to archaeal methanogens, and may be linked to methanogenesis. Probab=54.44 E-value=12 Score=18.94 Aligned_cols=29 Identities=21% Similarity=0.279 Sum_probs=18.1 Q ss_pred CCCCHHHHH---HCCCCCCCEEEECCEEEEEE Q ss_conf 897654544---46888731787789099994 Q gi|254780780|r 10 DNQNIFIKI---IRNETNACRVYEDDILLAIM 38 (155) Q Consensus 10 dd~CiFC~I---i~ge~p~~~v~E~d~~~afl 38 (155) ..+|+||=| =.|-..+--+--||+-+-|| T Consensus 85 ~N~C~FCFidQlP~GmR~sLY~KDDDyRLSFL 116 (433) T TIGR03279 85 NNRCPFCFIDQQPPGKRESLYLKDDDYRLSFL 116 (433) T ss_pred CCCCCEEEECCCCCCCCCCEEEECCCEEEEEE T ss_conf 77785686466885554641462486345320 No 41 >COG1655 Uncharacterized protein conserved in bacteria [Function unknown] Probab=54.07 E-value=6.4 Score=20.71 Aligned_cols=48 Identities=15% Similarity=0.088 Sum_probs=26.5 Q ss_pred CCCCHHHHH--HCCCC--C-CCEEE-ECCEEEEEECCCCCCCCEEEEEECCCCCH Q ss_conf 897654544--46888--7-31787-78909999417888982299994155421 Q gi|254780780|r 10 DNQNIFIKI--IRNET--N-ACRVY-EDDILLAIMDIMPRNPGHVLIIPKSRIRD 58 (155) Q Consensus 10 dd~CiFC~I--i~ge~--p-~~~v~-E~d~~~afld~~P~~~GH~lViPk~H~~~ 58 (155) .=+|++|.- -+.+. . ++++. +.+....+-+++|. -.|++|+|+=-++. T Consensus 19 ~ieCPvC~tkFkkeev~tgsiRiiagDld~~lkygninP~-fY~VvvCP~C~yAa 72 (267) T COG1655 19 TIECPVCNTKFKKEEVKTGSIRIIAGDLDFFLKYGNINPY-FYDVVVCPICYYAA 72 (267) T ss_pred EECCCCCCCHHHHHHEECCCEEEECCCCCCEEECCCCCCC-EEEEEECCHHHHHH T ss_conf 1146765530112114526504761656522201542775-35788764166887 No 42 >PRK02770 S-adenosylmethionine decarboxylase proenzyme; Provisional Probab=46.19 E-value=26 Score=16.84 Aligned_cols=54 Identities=19% Similarity=0.209 Sum_probs=27.3 Q ss_pred HCCCCCEEEEEECCCCCCCEEEEEEEEEEECCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 135266699982575668631010699995237997653455776668898999999999999999 Q gi|254780780|r 84 AFQADGIQILQFNGHAAGQTVPHLHFHVIPCKNGDNASHTNIHPTQKIENFAKLEINAQKIRKELQ 149 (155) Q Consensus 84 ~~~~~~~ni~~~~g~~~gq~V~HlH~HiiPR~~~d~~~~~~~~~~~~~~~~~~l~~~a~~lr~~l~ 149 (155) .|.+.|++.+..-. -.|+-+|--|-+.--..+.. -=+...+ .+..++-|+++|+ T Consensus 61 kF~P~GvT~v~lLa------ESHiSiHTWPE~gyaavDif---tCG~~~~---P~~a~~~L~~~l~ 114 (139) T PRK02770 61 RFEPQGVTALALLA------ESHISIHTWPESGYAAVDVF---TCGDHTM---PEKACQYLIEELM 114 (139) T ss_pred EECCCCEEEEEEEC------CCEEEEEECCCCCEEEEEEE---ECCCCCC---HHHHHHHHHHHHC T ss_conf 80799689999930------42789996676884999998---0599999---9999999999859 No 43 >TIGR01838 PHA_synth_I poly(R)-hydroxyalkanoic acid synthase, class I; InterPro: IPR010963 This entry represents the class I subfamily of poly(R)-hydroxyalkanoate synthases, which polymerises hydroxyacyl-CoAs with three to five carbons in the hydroxyacyl backbone into aliphatic esters termed poly(R)-hydroxyalkanoic acids. These polymers accumulate as carbon and energy storage inclusions in many species and can amount to 90 percent of the dry weight of the cell.; GO: 0008415 acyltransferase activity, 0042619 poly-hydroxybutyrate biosynthetic process, 0005737 cytoplasm. Probab=41.67 E-value=31 Score=16.41 Aligned_cols=91 Identities=11% Similarity=0.161 Sum_probs=57.2 Q ss_pred CCCCEEEECCEE------------------EE--------EECCCC--------CCCCEEEEE--ECCCCCHHHHHHHHH Q ss_conf 873178778909------------------99--------941788--------898229999--415542134402788 Q gi|254780780|r 23 TNACRVYEDDIL------------------LA--------IMDIMP--------RNPGHVLII--PKSRIRDIFEAPPEI 66 (155) Q Consensus 23 ~p~~~v~E~d~~------------------~a--------fld~~P--------~~~GH~lVi--Pk~H~~~l~dL~~~~ 66 (155) .|+.+||||+.+ ++ +||.+| +-.||++.+ .|---....+++-++ T Consensus 163 tpG~VVf~N~lfqLiQY~P~Te~V~~tPlLiVPP~INKyYILDLrP~NSlvrWlv~QG~TVF~~SWrNPd~~~A~~tfdD 242 (541) T TIGR01838 163 TPGKVVFENELFQLIQYEPTTETVHKTPLLIVPPFINKYYILDLRPENSLVRWLVEQGHTVFLISWRNPDAEQADLTFDD 242 (541) T ss_pred CCCEEEEECHHHHHHCCCCCCCCCCCCEEEEECCCCCCCEECCCCCCHHHHHHHHHCCCCEEEEEECCCCHHHHHCCHHH T ss_conf 99706874004366415798724043025873787331120167865358999995499389986006275645367478 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCE-EEEEEEEEEECCCCC Q ss_conf 8989999997304455313526669998257566863-101069999523799 Q gi|254780780|r 67 LSQIAFLIKKIAIACKSAFQADGIQILQFNGHAAGQT-VPHLHFHVIPCKNGD 118 (155) Q Consensus 67 ~~~l~~~~~~v~~al~~~~~~~~~ni~~~~g~~~gq~-V~HlH~HiiPR~~~d 118 (155) + +..-+-+..++++++.|-+.+|.+-. +.|-| +.=.=-++..|..++ T Consensus 243 Y--~~~G~~~Al~~v~~itGe~~~N~~GY---CIGGT~Ls~alA~lAArg~~~ 290 (541) T TIGR01838 243 Y--VRDGVIAALEVVEEITGEKQVNAVGY---CIGGTLLSTALAYLAARGEKK 290 (541) T ss_pred H--HHHHHHHHHHHHHHHCCCEEEEEEEE---EHHHHHHHHHHHHHHHCCCCC T ss_conf 9--99999999999986155406504541---044789999999999617896 No 44 >TIGR01734 D-ala-DACP-lig D-alanine-activating enzyme; InterPro: IPR010072 This entry represents the enzyme (also called D-alanine-D-alanyl carrier protein ligase) which activates D-alanine as an adenylate via the reaction D-ala + ATP to D-ala-AMP + PPi, and further catalyses the condensation of the amino acid adenylate with the D-alanyl carrier protein (D-ala-ACP). The D-alanine is then further transferred to teichoic acid in the biosynthesis of lipoteichoic acid (LTA) and wall teichoic acid (WTA) in Gram-positive bacteria, both polysacchatides .; GO: 0016208 AMP binding, 0047473 D-alanine-poly(phosphoribitol) ligase activity, 0019350 teichoic acid biosynthetic process. Probab=34.26 E-value=38 Score=15.85 Aligned_cols=42 Identities=29% Similarity=0.313 Sum_probs=19.8 Q ss_pred EEECCCCC-CCCCCCCCCCC---CCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 99523799-76534557766---68898999999999999999988 Q gi|254780780|r 111 VIPCKNGD-NASHTNIHPTQ---KIENFAKLEINAQKIRKELQNFL 152 (155) Q Consensus 111 iiPR~~~d-~~~~~~~~~~~---~~~~~~~l~~~a~~lr~~l~~~l 152 (155) +|||++.| ......-.--. -..+++.-..+.+.||++|++.+ T Consensus 434 vvP~~nkd~KV~~LiA~~v~~~G~~e~fe~e~~lT~aIK~eL~~~~ 479 (513) T TIGR01734 434 VVPKYNKDHKVEALIAYVVPETGVFEDFEKEIQLTKAIKKELKKIL 479 (513) T ss_pred EEECCCCCHHHHHHHHEEECCCCCCCHHHHHHHHHHHHHHHHHHHC T ss_conf 4301088148887441100168840035899999999999887511 No 45 >TIGR03330 SAM_DCase_Bsu S-adenosylmethionine decarboxylase proenzyme, Bacillus form. Members of this protein family are the single chain precursor of the two chains of the mature S-adenosylmethionine decarboxylase as found in Methanocaldococcus jannaschii, Bacillus subtilis, and a wide range of other species. It differs substantially in architecture from the form as found in Escherichia coli, and lacks any extended homology to the eukaryotic form (TIGR00535). Probab=33.61 E-value=42 Score=15.61 Aligned_cols=53 Identities=13% Similarity=0.162 Sum_probs=27.5 Q ss_pred CCCCCEEEEEECCCCCCCEEEEEEEEEEECCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 35266699982575668631010699995237997653455776668898999999999999999 Q gi|254780780|r 85 FQADGIQILQFNGHAAGQTVPHLHFHVIPCKNGDNASHTNIHPTQKIENFAKLEINAQKIRKELQ 149 (155) Q Consensus 85 ~~~~~~ni~~~~g~~~gq~V~HlH~HiiPR~~~d~~~~~~~~~~~~~~~~~~l~~~a~~lr~~l~ 149 (155) |.|.|++.+.--.+ .|+-+|--|-+.--..+... =++..+ .+..++.|+++|+ T Consensus 48 f~p~GvT~v~ll~E------SHisiHTwPE~g~aaiDift---Cg~~~~---p~~a~~~l~~~f~ 100 (112) T TIGR03330 48 FSPGGVSGVVLLAE------SHISIHTWPEYGYAAVDVFT---CGDHSD---PEKAFEYLVEALK 100 (112) T ss_pred CCCCCEEEEEEECC------CEEEEEECCCCCEEEEEEEE---CCCCCC---HHHHHHHHHHHHC T ss_conf 49997999999324------36999986788849999986---389999---9999999999729 No 46 >TIGR03478 DMSO_red_II_bet DMSO reductase family type II enzyme, iron-sulfur subunit. This model represents the iron-sulfur subunit, typically called the beta subunit, of various proteins that also contain a molybdopterin subunit and a heme b subunit. The group includes two distinct but very closely related periplasmic proteins of anaerobic respiration, selenate reductase and chlorate reductase. Other members of this family include dimethyl sulphide dehydrogenase and ethylbenzene dehydrogenase. Probab=33.21 E-value=23 Score=17.24 Aligned_cols=22 Identities=18% Similarity=0.274 Sum_probs=17.5 Q ss_pred CCCCCCCCCHHH--HHHCCCCCCC Q ss_conf 656658976545--4446888731 Q gi|254780780|r 5 SSTHYDNQNIFI--KIIRNETNAC 26 (155) Q Consensus 5 s~~~~dd~CiFC--~Ii~ge~p~~ 26 (155) ..++.-++|.|| ||-+|..|+- T Consensus 183 ~~~~k~eKC~fC~~Rie~G~~PaC 206 (321) T TIGR03478 183 PQSQKSEKCIGCYPRIEKGIAPAC 206 (321) T ss_pred CCCCCCCCCCCCHHHHHCCCCCCC T ss_conf 766865658889889878999841 No 47 >TIGR02778 ligD_pol DNA polymerase LigD, polymerase domain; InterPro: IPR014145 DNA repair of double-stranded breaks by non-homologous end joining (NHEJ) is accomplished by a two-protein system that is present in a minority of prokaryotes. One component is the Ku protein (see IPR009187 from INTERPRO), which binds DNA ends. The other is a DNA ligase, a protein that is a multidomain polypeptide in most of those bacteria that have NheJ, a permuted polypeptide in Mycobacterium tuberculosis and a few other species, and the product of tandem genes in some other bacteria. This region represents the C-terminal polymerase domain.. Probab=33.09 E-value=28 Score=16.73 Aligned_cols=80 Identities=19% Similarity=0.159 Sum_probs=46.2 Q ss_pred CCEEEECCEEEEEECCCCCCCCEEEEEECCCC-C----HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCE-EEEEECCC Q ss_conf 31787789099994178889822999941554-2----134402788898999999730445531352666-99982575 Q gi|254780780|r 25 ACRVYEDDILLAIMDIMPRNPGHVLIIPKSRI-R----DIFEAPPEILSQIAFLIKKIAIACKSAFQADGI-QILQFNGH 98 (155) Q Consensus 25 ~~~v~E~d~~~afld~~P~~~GH~lViPk~H~-~----~l~dL~~~~~~~l~~~~~~v~~al~~~~~~~~~-ni~~~~g~ 98 (155) .+++-+|-..++.|--.-..+-|++=..-.|. . -++|||+.+-..+..++ ..|+.++..++.-|. +++-..|. T Consensus 83 ~yl~~~d~~gL~~LvQ~g~lEfH~w~~~~~~~P~~PDriVFDLDPgp~~~~~~vv-eAA~~~r~~L~~lgL~sFvKTSGG 161 (251) T TIGR02778 83 TYLVIEDLEGLLWLVQMGALEFHIWGARIDAPPEKPDRIVFDLDPGPGVAWKLVV-EAALLIRELLDELGLESFVKTSGG 161 (251) T ss_pred CCEEEECHHHHHHHHHCCCEEEECCCCCCCCCCCCCCEEEEECCCCCHHHHHHHH-HHHHHHHHHHHHHCCCCCCEECCC T ss_conf 4227716899999987280577178887888724886688815888865799999-999999999987123564411589 Q ss_pred CCCCEEEEEEEEE Q ss_conf 6686310106999 Q gi|254780780|r 99 AAGQTVPHLHFHV 111 (155) Q Consensus 99 ~~gq~V~HlH~Hi 111 (155) . | ||++| T Consensus 162 K-G-----LHv~v 168 (251) T TIGR02778 162 K-G-----LHVYV 168 (251) T ss_pred C-E-----EEEEE T ss_conf 7-0-----38987 No 48 >TIGR02182 GRXB Glutaredoxin, GrxB family; InterPro: IPR011901 Glutaredoxins , , , also known as thioltransferases (disulphide reductases, are small proteins of approximately one hundred amino-acid residues which utilise glutathione and NADPH as cofactors. Oxidized glutathione is regenerated by glutathione reductase. Together these components compose the glutathione system . Glutaredoxin functions as an electron carrier in the glutathione-dependent synthesis of deoxyribonucleotides by the enzyme ribonucleotide reductase. Like thioredoxin, which functions in a similar way, glutaredoxin possesses an active centre disulphide bond . It exists in either a reduced or an oxidized form where the two cysteine residues are linked in an intramolecular disulphide bond. Glutaredoxin has been sequenced in a variety of species. On the basis of extensive sequence similarity, it has been proposed that Vaccinia virus protein O2L is most probably a glutaredoxin. Finally, it must be noted that Bacteriophage T4 thioredoxin seems also to be evolutionary related. In position 5 of the pattern T4 thioredoxin has Val instead of Pro. This family includes the highly abundant Escherichia coli GrxB (Grx2) glutaredoxin which is notably longer than either GrxA or GrxC. Unlike the other two E. coli glutaredoxins, GrxB appears to be unable to reduce ribonucleotide reductase , and may have more to do with resistance to redox stress .; GO: 0009055 electron carrier activity, 0015035 protein disulfide oxidoreductase activity. Probab=30.07 E-value=19 Score=17.71 Aligned_cols=15 Identities=7% Similarity=0.002 Sum_probs=10.4 Q ss_pred EEECCCCCCCCCCCC Q ss_conf 995237997653455 Q gi|254780780|r 111 VIPCKNGDNASHTNI 125 (155) Q Consensus 111 iiPR~~~d~~~~~~~ 125 (155) ++||+..-++..+.. T Consensus 102 ~~PR~~k~~l~EFAT 116 (212) T TIGR02182 102 LLPRFAKLDLPEFAT 116 (212) T ss_pred CCCCHHCCCCCCCCC T ss_conf 155110168865676 No 49 >TIGR01660 narH nitrate reductase, beta subunit; InterPro: IPR006547 The nitrate reductase enzyme complex allows bacteria to use nitrate as an electron acceptor during anaerobic growth. The enzyme complex consists of a tetramer that has an alpha, beta and 2 gamma subunits. The alpha and beta subunits have catalytic activity and the gamma subunits attach the enzyme to the membrane and are b-type cytochromes that receive electrons from the quinone pool and transfers them to the beta subunit. The sequences in this family are the beta subunit for nitrate reductase I (narH) and nitrate reductase II (narY) for Gram-positive and Gram-negative bacteria. A few thermophiles and archaea also match the model. A number of the sequences in this set are experimentally characterised, these include: E.Coli NarH (P11349 from SWISSPROT) and NarY (P19318 from SWISSPROT) , , P42176 from SWISSPROT from Bacillus subtilis, and related proteins from Psuedomonas fluorescens, Paracoccus denitrificans, and Halomonas halodenitrificans.; GO: 0008940 nitrate reductase activity, 0042126 nitrate metabolic process, 0009325 nitrate reductase complex. Probab=27.12 E-value=21 Score=17.43 Aligned_cols=17 Identities=12% Similarity=0.186 Sum_probs=13.6 Q ss_pred CCCCCCCCHHH--HHHCCC Q ss_conf 56658976545--444688 Q gi|254780780|r 6 STHYDNQNIFI--KIIRNE 22 (155) Q Consensus 6 ~~~~dd~CiFC--~Ii~ge 22 (155) .+++.|+|+|| ||=.|. T Consensus 236 ~~gKsEKCi~CyPR~EaG~ 254 (495) T TIGR01660 236 KTGKSEKCILCYPRLEAGQ 254 (495) T ss_pred CCCCCCEEEECCCCHHCCC T ss_conf 6787130242266210476 No 50 >PRK11637 hypothetical protein; Provisional Probab=26.78 E-value=21 Score=17.51 Aligned_cols=81 Identities=15% Similarity=0.225 Sum_probs=40.0 Q ss_pred ECCEEEEEECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCEEEEEEE Q ss_conf 78909999417888982299994155421344027888989999997304455313526669998257566863101069 Q gi|254780780|r 30 EDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIAFLIKKIAIACKSAFQADGIQILQFNGHAAGQTVPHLHF 109 (155) Q Consensus 30 E~d~~~afld~~P~~~GH~lViPk~H~~~l~dL~~~~~~~l~~~~~~v~~al~~~~~~~~~ni~~~~g~~~gq~V~HlH~ 109 (155) =-+..++|.+-. +|.-.+|=-.|-..++.| |+++-.+...++..+.. -+-+..+ |..++.+-||||| T Consensus 321 v~~G~Vv~a~~~---~gyG~~ViIdHG~g~~Tl----Yah~s~l~v~~Gq~V~~---Gq~Ig~v---G~tG~s~g~~LhF 387 (404) T PRK11637 321 IADGRVILADWL---QGYGLVVVVEHGKGDMSL----YGYNQSALVSVGAQVRA---GQPIALV---GSSGGQGRPSLYF 387 (404) T ss_pred ECCEEEEEEEEC---CCCCCEEEEECCCCCEEE----CCCCCCCCCCCCCEECC---CCEEEEE---ECCCCCCCCEEEE T ss_conf 017699991140---888857999869946571----52889588899799899---9969873---2689999965899 Q ss_pred EEEECCCCCCCCCCCC Q ss_conf 9995237997653455 Q gi|254780780|r 110 HVIPCKNGDNASHTNI 125 (155) Q Consensus 110 HiiPR~~~d~~~~~~~ 125 (155) -| |..+.+.++..| T Consensus 388 Ei--r~~g~~vnP~~w 401 (404) T PRK11637 388 EI--RRQGQAVNPQPW 401 (404) T ss_pred EE--EECCEEECHHHH T ss_conf 99--989998475886 No 51 >KOG3816 consensus Probab=25.17 E-value=27 Score=16.80 Aligned_cols=44 Identities=18% Similarity=0.152 Sum_probs=31.4 Q ss_pred ECCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 4178889822999941554213440278889899999973044553 Q gi|254780780|r 38 MDIMPRNPGHVLIIPKSRIRDIFEAPPEILSQIAFLIKKIAIACKS 83 (155) Q Consensus 38 ld~~P~~~GH~lViPk~H~~~l~dL~~~~~~~l~~~~~~v~~al~~ 83 (155) .|.+|...|-..+-|++|..--+|++ .-.+.|.+....++.|.- T Consensus 393 fdkyPLVDG~fflSP~~h~~~~~eV~--l~GR~~yL~~vC~~CL~g 436 (526) T KOG3816 393 FDKYPLVDGVFFLSPVSHFGPKTEVS--LDGRRFYLQQVCARCLWG 436 (526) T ss_pred HCCCCCCCCEEEECCCCCCCCCEEEE--ECCEEHHHHHHHHHHHCC T ss_conf 01464445117855000468645643--256121189999998646 No 52 >PRK01706 S-adenosylmethionine decarboxylase proenzyme; Validated Probab=23.58 E-value=62 Score=14.52 Aligned_cols=54 Identities=22% Similarity=0.283 Sum_probs=26.8 Q ss_pred HCCCCCEEEEEECCCCCCCEEEEEEEEEEECCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 135266699982575668631010699995237997653455776668898999999999999999 Q gi|254780780|r 84 AFQADGIQILQFNGHAAGQTVPHLHFHVIPCKNGDNASHTNIHPTQKIENFAKLEINAQKIRKELQ 149 (155) Q Consensus 84 ~~~~~~~ni~~~~g~~~gq~V~HlH~HiiPR~~~d~~~~~~~~~~~~~~~~~~l~~~a~~lr~~l~ 149 (155) .|.|.|++.+.--.+ .|+-+|--|-+.--..+.. -=+...+ .+..++.|+++|+ T Consensus 50 ~F~p~GvT~v~lLaE------SHiSiHTwPE~g~aaiDiF---TCG~~~~---p~~a~~~L~~~l~ 103 (123) T PRK01706 50 EFDPHGVTVLVLLSE------SHLSIHTYPEKNFAAIDCY---TCGTTVE---PQIAIDYIVSILK 103 (123) T ss_pred ECCCCCEEEEEEEEC------CEEEEEECCCCCEEEEEEE---CCCCCCC---HHHHHHHHHHHHC T ss_conf 779998899999750------2689997565887999998---5799899---9999999999749 No 53 >TIGR01684 viral_ppase viral phosphatase; InterPro: IPR007827 This family contains uncharacterised baculoviral proteins.. Probab=22.93 E-value=60 Score=14.65 Aligned_cols=21 Identities=19% Similarity=0.178 Sum_probs=15.9 Q ss_pred EEEEEECCCCCCCCEEEEEEC Q ss_conf 099994178889822999941 Q gi|254780780|r 33 ILLAIMDIMPRNPGHVLIIPK 53 (155) Q Consensus 33 ~~~afld~~P~~~GH~lViPk 53 (155) .|+|+-..+|...||+|++.+ T Consensus 1 ~W~~lr~r~~~~krHvLfla~ 21 (323) T TIGR01684 1 SWVVLRLRNPLVKRHVLFLAS 21 (323) T ss_pred CCEEEECCCHHHCCCEEEEEC T ss_conf 907773266106380788960 No 54 >pfam05632 DUF792 Borrelia burgdorferi protein of unknown function (DUF792). This family consists of several hypothetical proteins from the Lyme disease spirochete Borrelia burgdorferi. Probab=21.30 E-value=69 Score=14.23 Aligned_cols=61 Identities=28% Similarity=0.317 Sum_probs=43.3 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCE-EEEEEEEEEEC Q ss_conf 155421344027888989999997304455313526669998257566863-10106999952 Q gi|254780780|r 53 KSRIRDIFEAPPEILSQIAFLIKKIAIACKSAFQADGIQILQFNGHAAGQT-VPHLHFHVIPC 114 (155) Q Consensus 53 k~H~~~l~dL~~~~~~~l~~~~~~v~~al~~~~~~~~~ni~~~~g~~~gq~-V~HlH~HiiPR 114 (155) |+......|....+..++..+++.+..-+-..++++.+-++..-...-|+. ||.+.+ |.|. T Consensus 16 ~k~~~~~~~~~~~~~~ei~~iikdv~~Qifalf~~dNFlvLfPR~DfKGfGYVPQLFF-IkpK 77 (213) T pfam05632 16 KKEEEKICDFEKKEFREIFQIIKDIATQIFALFGADNFLLLFPRMDLKGFGYVPQLFF-IKPK 77 (213) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCEEEE-EECC T ss_conf 2466631244444389999999999999999856465699803777676513432799-8204 No 55 >pfam01868 UPF0086 Domain of unknown function UPF0086. This family consists of several archaeal and eukaryotic proteins. The archaeal proteins are found to be expressed within ribosomal operons and several of the sequences are described as ribonuclease P protein subunit p29 proteins. Probab=21.29 E-value=69 Score=14.23 Aligned_cols=36 Identities=31% Similarity=0.241 Sum_probs=28.5 Q ss_pred CCEEEECCEEEEEECCCCCCCCEEEEEECCCCCHHHHHHH Q ss_conf 3178778909999417888982299994155421344027 Q gi|254780780|r 25 ACRVYEDDILLAIMDIMPRNPGHVLIIPKSRIRDIFEAPP 64 (155) Q Consensus 25 ~~~v~E~d~~~afld~~P~~~GH~lViPk~H~~~l~dL~~ 64 (155) +.+|.|+...+.+.+. .|.+.+|||.+...-+.+++ T Consensus 32 GiVv~ETknt~~i~t~----~~~~~~IpK~~~~F~f~l~~ 67 (89) T pfam01868 32 GIVVDETRNTLVIETE----KGKEKVIPKKGSVFRFELPD 67 (89) T ss_pred EEEEEEECCEEEEECC----CCCEEEEECCCEEEEEEECC T ss_conf 9998620135999938----99799985798699999699 No 56 >pfam10926 DUF2800 Protein of unknown function (DUF2800). This is a family of uncharacterized proteins found in bacteria and viruses. Some members of this family are annotated as being Phi APSE P51-like proteins. Probab=21.26 E-value=20 Score=17.57 Aligned_cols=46 Identities=13% Similarity=0.161 Sum_probs=26.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCEEEEEEEEEEECC Q ss_conf 027888989999997304455313526669998257566863101069999523 Q gi|254780780|r 62 APPEILSQIAFLIKKIAIACKSAFQADGIQILQFNGHAAGQTVPHLHFHVIPCK 115 (155) Q Consensus 62 L~~~~~~~l~~~~~~v~~al~~~~~~~~~ni~~~~g~~~gq~V~HlH~HiiPR~ 115 (155) |++++++++..-+..+..+++.+-.. -.+-+ ..|..+|. +-+++.. T Consensus 246 Ls~~ei~~il~~~~~l~~W~~dv~~~-a~~~~-----~~G~~~pG--~KlV~GR 291 (363) T pfam10926 246 LTDEELAELLEKADLLEKWAKDVEAY-ALDEA-----RNGKEVPG--YKLVEGR 291 (363) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHH-HHHHH-----HCCCCCCC--EEEECCC T ss_conf 99999999997148999999999999-99999-----77998787--2661476 No 57 >COG1140 NarY Nitrate reductase beta subunit [Energy production and conversion] Probab=20.54 E-value=32 Score=16.30 Aligned_cols=18 Identities=17% Similarity=0.274 Sum_probs=14.2 Q ss_pred CCCCCCCCHHH--HHHCCCC Q ss_conf 56658976545--4446888 Q gi|254780780|r 6 STHYDNQNIFI--KIIRNET 23 (155) Q Consensus 6 ~~~~dd~CiFC--~Ii~ge~ 23 (155) .+++.++|+|| ||-.|+. T Consensus 236 ksgKsEKCifCyPRiEaGqP 255 (513) T COG1140 236 KSGKSEKCIFCYPRIEAGQP 255 (513) T ss_pred CCCCCCEEEEECCCCCCCCC T ss_conf 57874303775242226999 No 58 >pfam01777 Ribosomal_L27e Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family. Probab=20.34 E-value=73 Score=14.11 Aligned_cols=12 Identities=25% Similarity=0.396 Sum_probs=7.4 Q ss_pred EEEE-CCCCCCCC Q ss_conf 9995-23799765 Q gi|254780780|r 110 HVIP-CKNGDNAS 121 (155) Q Consensus 110 HiiP-R~~~d~~~ 121 (155) ||+| ||.-|... T Consensus 28 HlMPTRY~vd~~~ 40 (85) T pfam01777 28 HLMPTRYSVDVED 40 (85) T ss_pred CCCCCCEEECCCC T ss_conf 4455523641502 Done!