Query gi|254781025|ref|YP_003065438.1| deoxyuridine 5 27-triphosphate nucleotidohydrolase [Candidatus Liberibacter asiaticus str. psy62] Match_columns 154 No_of_seqs 108 out of 2328 Neff 6.1 Searched_HMMs 39220 Date Mon May 30 05:10:42 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254781025.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR00576 dut dUTP diphosphata 100.0 0 0 377.7 14.1 146 6-153 2-151 (151) 2 PRK00601 dut deoxyuridine 5'-t 100.0 0 0 334.0 15.0 149 1-154 1-149 (149) 3 COG0756 Dut dUTPase [Nucleotid 100.0 0 0 320.1 14.2 147 1-154 1-148 (148) 4 PRK13956 dut deoxyuridine 5'-t 100.0 0 0 293.4 12.9 133 8-153 9-147 (147) 5 PTZ00143 deoxyuridine 5'-triph 100.0 0 0 284.0 11.8 141 4-153 1-154 (155) 6 pfam00692 dUTPase dUTPase. dUT 100.0 5.6E-45 0 276.0 13.4 128 16-152 2-129 (129) 7 KOG3370 consensus 100.0 4.3E-40 1.1E-44 248.1 5.8 138 6-153 3-140 (140) 8 cd07557 trimeric_dUTPase Trime 99.9 2E-26 5.1E-31 170.4 9.4 92 28-122 1-92 (92) 9 TIGR02274 dCTP_deam deoxycytid 99.9 5.8E-24 1.5E-28 156.4 10.4 112 36-153 82-193 (198) 10 PRK00416 dcd deoxycytidine tri 99.8 4.8E-18 1.2E-22 122.7 11.9 105 39-153 65-169 (174) 11 PRK02253 deoxyuridine 5'-triph 99.8 2.8E-18 7.2E-23 124.1 8.9 87 38-128 58-144 (159) 12 PHA01707 dut 2'-deoxyuridine 5 99.8 6E-18 1.5E-22 122.2 9.5 100 25-128 27-137 (158) 13 COG0717 Dcd Deoxycytidine deam 99.7 8.2E-17 2.1E-21 115.7 11.2 83 41-126 74-156 (183) 14 pfam06559 DCD 2'-deoxycytidine 98.1 1.4E-05 3.6E-10 51.9 7.4 84 40-125 81-170 (366) 15 PRK07559 2'-deoxycytidine 5'-t 98.0 2.7E-05 6.9E-10 50.2 7.2 84 39-124 79-168 (366) 16 pfam06559 DCD 2'-deoxycytidine 96.8 0.01 2.6E-07 35.6 7.9 85 40-127 248-339 (366) 17 PRK07559 2'-deoxycytidine 5'-t 96.7 0.012 3.2E-07 35.1 7.7 86 39-127 247-339 (366) 18 pfam05784 Herpes_UL82_83 Betah 95.4 0.21 5.4E-06 28.1 9.3 93 20-125 249-341 (346) 19 pfam04797 Herpes_ORF11 Herpesv 95.3 0.23 5.8E-06 27.9 10.6 81 29-123 260-346 (384) 20 TIGR00266 TIGR00266 conserved 70.2 5.6 0.00014 20.0 3.4 85 30-119 8-108 (230) 21 COG1124 DppF ABC-type dipeptid 62.9 3.5 8.9E-05 21.2 1.2 67 54-125 24-91 (252) 22 TIGR02868 CydC ABC transporter 50.5 5.8 0.00015 19.9 0.6 83 40-130 363-446 (566) 23 cd03262 ABC_HisP_GlnQ_permease 48.3 3.5 9E-05 21.2 -0.8 70 53-125 16-85 (213) 24 TIGR01322 scrB_fam sucrose-6-p 47.1 9.6 0.00024 18.7 1.3 26 23-52 272-297 (475) 25 pfam06030 DUF916 Bacterial pro 46.7 18 0.00046 17.1 2.6 27 18-53 6-32 (121) 26 TIGR03608 L_ocin_972_ABC putat 45.9 7 0.00018 19.5 0.4 52 52-106 13-64 (206) 27 KOG0729 consensus 44.2 8.3 0.00021 19.0 0.6 86 50-138 68-171 (435) 28 COG4988 CydD ABC-type transpor 43.6 12 0.0003 18.2 1.3 63 40-105 324-389 (559) 29 COG1136 SalX ABC-type antimicr 42.1 10 0.00027 18.5 0.8 70 52-125 20-92 (226) 30 cd03229 ABC_Class3 This class 42.0 11 0.00028 18.3 0.9 69 53-125 16-85 (178) 31 TIGR03415 ABC_choXWV_ATP choli 36.5 6.7 0.00017 19.6 -0.9 48 47-99 35-83 (382) 32 COG2430 Uncharacterized conser 34.5 37 0.00095 15.3 6.4 74 39-113 72-164 (236) 33 COG4525 TauB ABC-type taurine 34.1 18 0.00045 17.2 1.0 42 55-100 23-65 (259) 34 PRK10418 nikD nickel transport 33.8 20 0.00051 16.9 1.2 65 54-124 20-87 (254) 35 PRK13637 cbiO cobalt transport 33.7 12 0.00031 18.1 0.1 67 54-123 24-90 (287) 36 KOG0125 consensus 33.1 15 0.00039 17.5 0.5 37 106-149 103-139 (376) 37 TIGR02857 CydD ABC transporter 32.6 20 0.00051 16.8 1.1 90 29-124 345-439 (570) 38 pfam07070 Spo0M SpoOM protein. 31.5 42 0.0011 15.0 4.6 57 38-100 76-132 (218) 39 pfam01238 PMI_typeI Phosphoman 30.5 43 0.0011 14.9 4.0 57 23-83 237-293 (373) 40 COG3638 ABC-type phosphate/pho 28.4 17 0.00043 17.3 0.1 59 44-106 12-70 (258) 41 PHA02118 hypothetical protein 25.1 31 0.00078 15.8 0.9 11 144-154 187-197 (202) 42 KOG2230 consensus 24.4 56 0.0014 14.3 2.5 88 5-97 688-779 (867) 43 PRK00513 minC septum formation 23.4 59 0.0015 14.2 3.8 40 80-119 140-180 (214) 44 pfam01393 Chromo_shadow Chromo 22.7 28 0.00072 16.0 0.4 43 79-123 2-47 (56) 45 TIGR03258 PhnT 2-aminoethylpho 22.2 25 0.00064 16.3 0.0 15 60-74 28-42 (362) 46 cd03290 ABCC_SUR1_N The SUR do 20.9 40 0.001 15.2 0.8 52 54-108 18-69 (218) 47 PRK13642 cbiO cobalt transport 20.4 25 0.00065 16.3 -0.3 65 55-124 25-89 (277) 48 TIGR02769 nickel_nikE nickel i 20.3 52 0.0013 14.5 1.3 78 46-126 21-99 (267) 49 TIGR00074 hypC_hupF hydrogenas 20.1 50 0.0013 14.6 1.2 53 5-60 6-61 (88) No 1 >TIGR00576 dut dUTP diphosphatase; InterPro: IPR008181 Synonym(s): dUTP diphosphatase, Deoxyuridine-triphosphatase. The essential enzyme dUTP pyrophosphatase (3.6.1.23 from EC) is specific for dUTP and is critical for the fidelity of DNA replication and repair. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate, simultaneously reducing dUTP levels and providing the dUMP for dTTP biosynthesis. dUTPase decreases the intracellular concentration of dUPT so that uracil cannot be incorporated into DNA . The crystal structure of human dUTPase reveals that each subunit of the dUTPase trimer folds into an eight-stranded jelly-roll beta barrel, with the C-terminal beta strands interchanged among the subunits. The structure is similar to that of the Escherichia coli enzyme, despite low sequence homology between the two enzymes . Other enzymes like deoxycytidine triphosphate deaminase (dCTP) (3.5.4.13 from EC) that specifically bind uridine also belong to this group suggesting that the signature may recognise a putative uridine-binding motif. Some retroviruses encode dUTPases. Retroviral dUTPase is synthesised as part of POL polyprotein that contains; an aspartyl protease, a reverse transcriptase, dUTPase and RNase H. ; GO: 0016787 hydrolase activity, 0046080 dUTP metabolic process. Probab=100.00 E-value=0 Score=377.65 Aligned_cols=146 Identities=49% Similarity=0.838 Sum_probs=134.0 Q ss_pred EEEEECCCCCCCCCCCCCCCCCCEEEEEEECCCCCCEEECCCCEEEEECCCHHHCCCCEEEEEEECCCCEEEEEEECCCC Q ss_conf 89997675479703743578864087891037787389379973999658602258761467875266313322202355 Q gi|254781025|r 6 IPFLQLPHAHGIPLPEYKTSGSSGMDLFAALPEDEPVELLPGMRSLIPGGYIIAIPPGYEGQVRSRSGLALNYGVACLNS 85 (154) Q Consensus 6 iki~~~~~~~~~~lP~~~t~~~AG~Dl~a~~~~~e~~~i~pg~~~li~Tgi~i~ip~g~~~~i~~RSsl~~K~gl~~~n~ 85 (154) ++++|+ ++++++|+|||++||||||||+.+.+++++|+|++++||||||++++|.||+++|.||||||+||+|++.|+ T Consensus 2 ~~~~~l--~~~a~~P~ya~~~aAG~DL~a~~~~~~P~~i~P~~~~Lv~TG~~~~lP~g~~g~v~PRSGLA~K~gv~i~n~ 79 (151) T TIGR00576 2 LKFVKL--SENAPLPTYATEGAAGYDLRAAEDVADPVTIPPGERALVPTGIAIELPEGYYGRVAPRSGLALKHGVTIDNS 79 (151) T ss_pred EEEEEC--CCCCCCCCCCCCCCCCCEEEEECCCCCCEEECCCCEEEECCCEEEECCCCCEEEEEECCCCCEEEEEEEECC T ss_conf 388860--889867862787656623778417998758789984875244299848994899730441330246899667 Q ss_pred CEEECCCCCCCEEEEEEECC-CCEEEEECCCCEEEEEEEECCCCEEEEE---ECCCCCCCCCCCCCCCCCCC Q ss_conf 03665687887699999807-9449995595035899975043406998---71357754268888735467 Q gi|254781025|r 86 PGTIDSDYRGEIKILLINLG-QENFLIMRGMRIAQLIIANSVRAHPSLI---SAMPMGKNERNEKGFGSTGL 153 (154) Q Consensus 86 ~GvID~~YrGei~v~l~N~~-~~~~~i~~G~RIaQlv~~~~~~~~~~~v---~~~~l~~t~RG~~GFGSTG~ 153 (154) +|+||+||||||+|.|+|++ +++|+|++||||||||++|+.+++|.++ ++++|++|+||+|||||||. T Consensus 80 ~GvID~DYRGE~kV~L~N~gp~~~f~v~~GdRIAQlv~~~~~~v~~v~~~~~~v~~L~~T~RG~GGFGSTG~ 151 (151) T TIGR00576 80 PGVIDADYRGEIKVILINLGPKEDFTVKKGDRIAQLVVEKIVQVEFVEVEFEEVEELDETERGEGGFGSTGV 151 (151) T ss_pred CCEEECCCCCCEEEEEEECCCCCCEEECCCCEEEEEEEEEEEEEEEEEEEEEEEECCCCCCCCCCCCCCCCC T ss_conf 817817872026899985899986788579757768873156899862004564046666775887465799 No 2 >PRK00601 dut deoxyuridine 5'-triphosphate nucleotidohydrolase; Provisional Probab=100.00 E-value=0 Score=334.03 Aligned_cols=149 Identities=50% Similarity=0.733 Sum_probs=135.8 Q ss_pred CCEEEEEEEECCCCCCCCCCCCCCCCCCEEEEEEECCCCCCEEECCCCEEEEECCCHHHCCCCEEEEEEECCCCEEEEEE Q ss_conf 96357899976754797037435788640878910377873893799739996586022587614678752663133222 Q gi|254781025|r 1 MQSIDIPFLQLPHAHGIPLPEYKTSGSSGMDLFAALPEDEPVELLPGMRSLIPGGYIIAIPPGYEGQVRSRSGLALNYGV 80 (154) Q Consensus 1 M~~~~iki~~~~~~~~~~lP~~~t~~~AG~Dl~a~~~~~e~~~i~pg~~~li~Tgi~i~ip~g~~~~i~~RSsl~~K~gl 80 (154) |.+++||++.. +..++.||+|+|++||||||||+. +++++|+|+++++|+||+++++|+||+++|+||||+++|+|| T Consensus 1 m~~~~ik~l~~-~~~~avlP~r~t~~dAG~DL~a~~--~~~~vi~p~~~~~I~TGi~i~iP~g~~~~I~~RSsla~K~gl 77 (149) T PRK00601 1 MKKIDVKILDP-RLGKEFLPAYATEGSAGLDLRACL--DEPVTLAPGETALVPTGLAIHIPDGYAAQILPRSGLAHKHGI 77 (149) T ss_pred CCCEEEEECCC-CCCCCCCCCCCCCCCEEEEEEECC--CCCEEECCCCEEEECCCEEEECCCCCEEEEEECCCCCCCCCE T ss_conf 96188998798-877776667789984847574247--897888999889971897998799939999856883300434 Q ss_pred ECCCCCEEECCCCCCCEEEEEEECCCCEEEEECCCCEEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCCCCC Q ss_conf 02355036656878876999998079449995595035899975043406998713577542688887354679 Q gi|254781025|r 81 ACLNSPGTIDSDYRGEIKILLINLGQENFLIMRGMRIAQLIIANSVRAHPSLISAMPMGKNERNEKGFGSTGLY 154 (154) Q Consensus 81 ~~~n~~GvID~~YrGei~v~l~N~~~~~~~i~~G~RIaQlv~~~~~~~~~~~v~~~~l~~t~RG~~GFGSTG~~ 154 (154) .+.|++||||+||||||+|.|+|++++++.|++||||||+||+|+.+++|++++ +|++|+||++||||||-+ T Consensus 78 ~v~N~~GvID~dYrGei~v~l~N~s~~~~~i~~GdRIaQlv~~~~~~~~~~~v~--~l~~t~RG~gGFGSTGk~ 149 (149) T PRK00601 78 VLGNLPGTIDSDYRGELMVSLWNRGQEPFTIEPGERIAQLVIVPVVQAEFEEVE--EFDETERGEGGFGSTGRH 149 (149) T ss_pred ECCCCCCCCCCCCCCEEEEEEEECCCCCEEECCCCEEEEEEEEECCCCEEEECC--CCCCCCCCCCCCCCCCCC T ss_conf 105865342788885489999989998778579998999998324662699906--478752469977778899 No 3 >COG0756 Dut dUTPase [Nucleotide transport and metabolism] Probab=100.00 E-value=0 Score=320.12 Aligned_cols=147 Identities=52% Similarity=0.858 Sum_probs=136.9 Q ss_pred CCEEEEEEEECCCCCCCCCCCCCCCCCCEEEEEEECCCCCC-EEECCCCEEEEECCCHHHCCCCEEEEEEECCCCEEEEE Q ss_conf 96357899976754797037435788640878910377873-89379973999658602258761467875266313322 Q gi|254781025|r 1 MQSIDIPFLQLPHAHGIPLPEYKTSGSSGMDLFAALPEDEP-VELLPGMRSLIPGGYIIAIPPGYEGQVRSRSGLALNYG 79 (154) Q Consensus 1 M~~~~iki~~~~~~~~~~lP~~~t~~~AG~Dl~a~~~~~e~-~~i~pg~~~li~Tgi~i~ip~g~~~~i~~RSsl~~K~g 79 (154) |+..+++ ++.+.++.++||+|+|+++|||||+|+ ++ ++|+|++++|||||+++++|+||+++|+|||||++||| T Consensus 1 ~~~~~~~-~~~~~~~~~~lP~y~t~gsAG~DLrA~----~~~~~i~pg~~~LVpTGl~~~ip~g~~~~v~PRSgla~k~G 75 (148) T COG0756 1 MRGFEMK-LDKRLNEGAPLPKYATEGSAGYDLRAA----EDEVTIAPGERKLVPTGLAIELPEGYEAQVRPRSGLALKHG 75 (148) T ss_pred CCCEEEE-EEEECCCCCCCCCCCCCCCCCCCEECC----CCEEEECCCCEEEECCCEEEECCCCCEEEEECCCCCCEECE T ss_conf 9624789-986138888677101688653003203----33168889973886178789858980899953665660032 Q ss_pred EECCCCCEEECCCCCCCEEEEEEECCCCEEEEECCCCEEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCCCCC Q ss_conf 202355036656878876999998079449995595035899975043406998713577542688887354679 Q gi|254781025|r 80 VACLNSPGTIDSDYRGEIKILLINLGQENFLIMRGMRIAQLIIANSVRAHPSLISAMPMGKNERNEKGFGSTGLY 154 (154) Q Consensus 80 l~~~n~~GvID~~YrGei~v~l~N~~~~~~~i~~G~RIaQlv~~~~~~~~~~~v~~~~l~~t~RG~~GFGSTG~~ 154 (154) |++.|++|+||+|||||++|.|+|+++++++|++||||||+||+|+.++++.+++ .+++|+||+|||||||+. T Consensus 76 i~~~Ns~G~IDsDYrGei~V~l~N~~~~~f~ie~GdRIaQ~V~~~v~~~~~~~ve--~~~~t~rg~GGFGSTG~~ 148 (148) T COG0756 76 ITLGNSPGTIDSDYRGEIKVLLINLGDEDFVIEKGDRIAQLVFVKVLQAEFDEVE--NLDETERGTGGFGSTGVK 148 (148) T ss_pred EEEECCCCEECCCCCCEEEEEEEECCCCCEEECCCCEEEEEEEEEEEECCEEEEE--CCCCCCCCCCCCCCCCCC T ss_conf 8993678657777875589999937999878627888989999988754546651--013423455887877769 No 4 >PRK13956 dut deoxyuridine 5'-triphosphate nucleotidohydrolase; Provisional Probab=100.00 E-value=0 Score=293.42 Aligned_cols=133 Identities=30% Similarity=0.398 Sum_probs=120.9 Q ss_pred EEECCCCCCCCCCCCCCCCCCEEEEEEECCCCCCEEECCCCEEEEECCCHHHCCCCEEEEEEECCCCEEEEEEECCCCCE Q ss_conf 99767547970374357886408789103778738937997399965860225876146787526631332220235503 Q gi|254781025|r 8 FLQLPHAHGIPLPEYKTSGSSGMDLFAALPEDEPVELLPGMRSLIPGGYIIAIPPGYEGQVRSRSGLALNYGVACLNSPG 87 (154) Q Consensus 8 i~~~~~~~~~~lP~~~t~~~AG~Dl~a~~~~~e~~~i~pg~~~li~Tgi~i~ip~g~~~~i~~RSsl~~K~gl~~~n~~G 87 (154) |+|+ ++++.||+|+|++||||||||+ |+++|+|+++++||||+++++|+||+++|+||||++.|+||.+.|++| T Consensus 9 ikkl--~~~a~lP~rat~~~AG~DL~a~----e~~~i~pge~~lI~TGi~~~iP~G~~~~I~pRSgl~~K~gl~l~N~~G 82 (147) T PRK13956 9 VSSF--TNEDLLPKRETAHAAGYDLKVA----EAVTIAPGEIKLVPTGVKAYMQPGEVLYLYDRSSNPRKKGLVLINSVG 82 (147) T ss_pred EEEC--CCCCCCCCCCCCCCEEEEEECC----CCEEECCCCEEEEECCEEEECCCCCEEEEECCCHHHHHCCEEECCCCC T ss_conf 8983--7785265889998330137469----896999999999848979987999889998874453316778637246 Q ss_pred EECCCC------CCCEEEEEEECCCCEEEEECCCCEEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCCCC Q ss_conf 665687------887699999807944999559503589997504340699871357754268888735467 Q gi|254781025|r 88 TIDSDY------RGEIKILLINLGQENFLIMRGMRIAQLIIANSVRAHPSLISAMPMGKNERNEKGFGSTGL 153 (154) Q Consensus 88 vID~~Y------rGei~v~l~N~~~~~~~i~~G~RIaQlv~~~~~~~~~~~v~~~~l~~t~RG~~GFGSTG~ 153 (154) |||+|| ||||++.|+|++++++.|++||||||+||+|+..+ + ....++|++|||||||- T Consensus 83 vIDsdY~g~~~~~Gei~v~l~N~~~~~~~I~~GdRIAQlv~~p~~~~-----~--~~~~~~~~~GGFGSTGk 147 (147) T PRK13956 83 VIDGDYYGNPANEGHIFAQMQNITDQPVTLEVGERIVQGVFMPFLIA-----D--GDQADGERTGGFGSTGQ 147 (147) T ss_pred CCCCCCCCCCCCCEEEEEEEEECCCCCEEECCCCEEEEEEEEEEEEE-----C--CCCCCCCCCCCCCCCCC T ss_conf 68176547888730699999947997679789998998998689993-----2--55773527897788989 No 5 >PTZ00143 deoxyuridine 5'-triphosphate nucleotidohydrolase; Provisional Probab=100.00 E-value=0 Score=283.99 Aligned_cols=141 Identities=26% Similarity=0.345 Sum_probs=122.8 Q ss_pred EEEEEEECCCCCCC----CCCCCCCCCCCEEEEEEECCCCCCEEECCCCEEEEECCCHHHC----CC-----CEEEEEEE Q ss_conf 57899976754797----0374357886408789103778738937997399965860225----87-----61467875 Q gi|254781025|r 4 IDIPFLQLPHAHGI----PLPEYKTSGSSGMDLFAALPEDEPVELLPGMRSLIPGGYIIAI----PP-----GYEGQVRS 70 (154) Q Consensus 4 ~~iki~~~~~~~~~----~lP~~~t~~~AG~Dl~a~~~~~e~~~i~pg~~~li~Tgi~i~i----p~-----g~~~~i~~ 70 (154) |+++|+.+ ++.+ .-+++.|++|||||||++ ++++|+|+++++|+||+++++ ++ ++..+|+| T Consensus 1 m~l~i~~~--~~~~~~~y~~~~~~~~GDaG~DL~~~----ed~~i~p~~~~lI~tGIk~~~~~~~~~~~~g~~~~~~i~P 74 (155) T PTZ00143 1 MHLLLVPL--NDEVRELYKNHKTFHEGDSGLDLFIV----EDQVLKAKSTTFIKLGIKAIALQYKSDGSDGKNVSYLLFP 74 (155) T ss_pred CCEEEEEC--CHHHHHHHHCCCCCCCCCEEEEEECC----CCEEECCCCEEEEEEEEEEEEEEECCCCCCCCCCEEEEEC T ss_conf 95578838--88898887665878998600238034----6638899975999620289886413777567751379856 Q ss_pred CCCCEEEEEEECCCCCEEECCCCCCCEEEEEEECCCCEEEEECCCCEEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCC Q ss_conf 26631332220235503665687887699999807944999559503589997504340699871357754268888735 Q gi|254781025|r 71 RSGLALNYGVACLNSPGTIDSDYRGEIKILLINLGQENFLIMRGMRIAQLIIANSVRAHPSLISAMPMGKNERNEKGFGS 150 (154) Q Consensus 71 RSsl~~K~gl~~~n~~GvID~~YrGei~v~l~N~~~~~~~i~~G~RIaQlv~~~~~~~~~~~v~~~~l~~t~RG~~GFGS 150 (154) |||++ |++|+++|++||||+||||||+|.++|++++++.|++|+||||+|+.++..++|++|+ +|++|+||++|||| T Consensus 75 RSsls-K~~l~l~N~vGiIDsgYRGeI~v~l~N~~~~~~~i~~GdRiaQlv~~~~~~~~~~~v~--el~~T~RG~gGFGS 151 (155) T PTZ00143 75 RSSIS-KTPLRLANSVGLIDAGYRGELIAALDNIKDFEYTVKKGDRLVQLVSFTGEPITFELVD--ELDETSRGEGGFGS 151 (155) T ss_pred CCHHC-CCCEEECCCCCEECCCCCEEEEEEEEECCCCCEEECCCCEEEEEEECCCCCEEEEECC--CCCCCCCCCCCCCC T ss_conf 76320-4578972764178589966899999959998889747967999983125733799933--57877567997775 Q ss_pred CCC Q ss_conf 467 Q gi|254781025|r 151 TGL 153 (154) Q Consensus 151 TG~ 153 (154) ||- T Consensus 152 TGk 154 (155) T PTZ00143 152 TSN 154 (155) T ss_pred CCC T ss_conf 889 No 6 >pfam00692 dUTPase dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate. Probab=100.00 E-value=5.6e-45 Score=276.00 Aligned_cols=128 Identities=42% Similarity=0.683 Sum_probs=119.0 Q ss_pred CCCCCCCCCCCCCEEEEEEECCCCCCEEECCCCEEEEECCCHHHCCCCEEEEEEECCCCEEEEEEECCCCCEEECCCCCC Q ss_conf 97037435788640878910377873893799739996586022587614678752663133222023550366568788 Q gi|254781025|r 16 GIPLPEYKTSGSSGMDLFAALPEDEPVELLPGMRSLIPGGYIIAIPPGYEGQVRSRSGLALNYGVACLNSPGTIDSDYRG 95 (154) Q Consensus 16 ~~~lP~~~t~~~AG~Dl~a~~~~~e~~~i~pg~~~li~Tgi~i~ip~g~~~~i~~RSsl~~K~gl~~~n~~GvID~~YrG 95 (154) ++.||+|++++|||||||++ ++.+|+|++.++|+||+++++|+||+++|+||||++. +++.+ .+||||+|||| T Consensus 2 ~a~lP~r~~~~dAG~Dl~~~----~~~~i~p~~~~~i~tgi~~~ip~g~~~~i~~RSs~~~-kgl~~--~~GiID~dyrG 74 (129) T pfam00692 2 EGEIPTRKRPGDAGYDLCAP----YDFTLPPGGTVLVPTDIRIPLPPGTFGLILGRSSLAA-KGLIV--LPGIIDSDYRG 74 (129) T ss_pred CCCCCCCCCCCCEEEEEECC----CCEEECCCCEEEEECCEEEECCCCEEEEEECCCCCHH-CCCCC--CCCEECCCCCC T ss_conf 74437779998588728628----9979999998999899899889994999961661000-26644--58788878985 Q ss_pred CEEEEEEECCCCEEEEECCCCEEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCCC Q ss_conf 769999980794499955950358999750434069987135775426888873546 Q gi|254781025|r 96 EIKILLINLGQENFLIMRGMRIAQLIIANSVRAHPSLISAMPMGKNERNEKGFGSTG 152 (154) Q Consensus 96 ei~v~l~N~~~~~~~i~~G~RIaQlv~~~~~~~~~~~v~~~~l~~t~RG~~GFGSTG 152 (154) ||++.|+|++++++.|++|+|||||||+|+..++++++ +++++|+||++|||||| T Consensus 75 ei~v~l~N~~~~~~~i~~G~RiaQli~~~~~~~~~~~~--~~l~~t~RG~~GfGSTG 129 (129) T pfam00692 75 EIKVVLFNSGKKDFTIKKGDRIAQLILLPCVHPELEPV--ENLERTERGDSGFGSSG 129 (129) T ss_pred EEEEEEEECCCCCEEEECCCEEEEEEEEECCCCEEEEC--CCCCCCCCCCCCCCCCC T ss_conf 18999999989999991798599999842176558997--87887636799888898 No 7 >KOG3370 consensus Probab=100.00 E-value=4.3e-40 Score=248.12 Aligned_cols=138 Identities=36% Similarity=0.614 Sum_probs=127.2 Q ss_pred EEEEECCCCCCCCCCCCCCCCCCEEEEEEECCCCCCEEECCCCEEEEECCCHHHCCCCEEEEEEECCCCEEEEEEECCCC Q ss_conf 89997675479703743578864087891037787389379973999658602258761467875266313322202355 Q gi|254781025|r 6 IPFLQLPHAHGIPLPEYKTSGSSGMDLFAALPEDEPVELLPGMRSLIPGGYIIAIPPGYEGQVRSRSGLALNYGVACLNS 85 (154) Q Consensus 6 iki~~~~~~~~~~lP~~~t~~~AG~Dl~a~~~~~e~~~i~pg~~~li~Tgi~i~ip~g~~~~i~~RSsl~~K~gl~~~n~ 85 (154) +++.++ +..|..|+|++.++|||||||+ ++..++|....+|+|++.+++|.++|+++.||||||+||+|.+. T Consensus 3 l~~~~~--s~~a~~p~Rgs~~aaGydl~sa----~~~~vpa~gk~~V~td~q~~vP~g~ygRvaprsglA~k~~I~~g-- 74 (140) T KOG3370 3 LRFAKL--SESATIPTRGSAGAAGYDLYSA----QDGTVPARGKAVVDTDLQIAVPSGYYGRVAPRSGLAWKHFIDVG-- 74 (140) T ss_pred CCHHHC--CHHHCCCCCCCCCCCCCCHHHH----CCCCCCCCCCEECCCCCEEECCCCEEEEECCCCCCHHHCCCCCC-- T ss_conf 552224--4322456557755356240221----04522766543404662363472000013454331022056546-- Q ss_pred CEEECCCCCCCEEEEEEECCCCEEEEECCCCEEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCCCC Q ss_conf 03665687887699999807944999559503589997504340699871357754268888735467 Q gi|254781025|r 86 PGTIDSDYRGEIKILLINLGQENFLIMRGMRIAQLIIANSVRAHPSLISAMPMGKNERNEKGFGSTGL 153 (154) Q Consensus 86 ~GvID~~YrGei~v~l~N~~~~~~~i~~G~RIaQlv~~~~~~~~~~~v~~~~l~~t~RG~~GFGSTG~ 153 (154) .|+||+|||||+.+.|+|+++..|.+++|||||||++.++..++..+++ +|..|+||++||||||+ T Consensus 75 agvVd~dyrgeV~v~LfN~~~~~F~~k~Gdriaqli~~~i~~~~i~~v~--sLe~t~Rg~~Gfgstg~ 140 (140) T KOG3370 75 AGVVDPDYRGEVGVLLFNHSDRDFEYKKGDRIAQLIVEKIVTPEIVLVS--SLEATERGAGGFGSTGV 140 (140) T ss_pred CCEECCCCCCEEEEEEECCCCCCEEEECCCCCEEEEEEECCCCCEEHHH--HHHHHHHHCCCCCCCCC T ss_conf 7335545442267998658986345512882101599964797611266--67777632157687789 No 8 >cd07557 trimeric_dUTPase Trimeric dUTP diphosphatases. Trimeric dUTP diphosphatases, or dUTPases, are the most common family of dUTPase, found in bacteria, eukaryotes, and archaea. They catalyze the hydrolysis of the dUTP-Mg complex (dUTP-Mg) into dUMP and pyrophosphate. This reaction is crucial for the preservation of chromosomal integrity as it removes dUTP and therefore reduces the cellular dUTP/dTTP ratio, and prevents dUTP from being incorporated into DNA. It also provides dUMP as the precursor for dTTP synthesis via the thymidylate synthase pathway. dUTPases are homotrimeric, except some monomeric viral dUTPases, which have been shown to mimic a trimer. Active sites are located at the subunit interface. Probab=99.94 E-value=2e-26 Score=170.41 Aligned_cols=92 Identities=46% Similarity=0.684 Sum_probs=84.5 Q ss_pred CEEEEEEECCCCCCEEECCCCEEEEECCCHHHCCCCEEEEEEECCCCEEEEEEECCCCCEEECCCCCCCEEEEEEECCCC Q ss_conf 40878910377873893799739996586022587614678752663133222023550366568788769999980794 Q gi|254781025|r 28 SGMDLFAALPEDEPVELLPGMRSLIPGGYIIAIPPGYEGQVRSRSGLALNYGVACLNSPGTIDSDYRGEIKILLINLGQE 107 (154) Q Consensus 28 AG~Dl~a~~~~~e~~~i~pg~~~li~Tgi~i~ip~g~~~~i~~RSsl~~K~gl~~~n~~GvID~~YrGei~v~l~N~~~~ 107 (154) |||||++..+ .+.++|+|++.++++|++++++|+++++++.+|||++ |+|+.+.| +|+||+||+|++++.++|.+++ T Consensus 1 ag~Dl~~~~~-~~~~~i~p~~~~li~t~~~i~~p~~~~~~i~~RSsl~-~~Gl~v~~-~g~iD~gy~G~i~~~l~N~~~~ 77 (92) T cd07557 1 AGYDLRLGED-FEGIVLPPGETVLVPTGEAIELPEGYVGLVFPRSSLA-RKGITVHN-AGVIDPGYRGEITLELYNLGPE 77 (92) T ss_pred CCEEEEECCC-CCCEEECCCCEEEEEEEEEEECCCCCEEEEEECCHHE-ECCEEEEC-CCEECCCCCCEEEEEEEECCCC T ss_conf 9564675446-8858999998999980399998999489999346330-72669734-4148689876799999999899 Q ss_pred EEEEECCCCEEEEEE Q ss_conf 499955950358999 Q gi|254781025|r 108 NFLIMRGMRIAQLII 122 (154) Q Consensus 108 ~~~i~~G~RIaQlv~ 122 (154) ++.|++|+||||++| T Consensus 78 ~~~i~~G~riaQlvf 92 (92) T cd07557 78 PVVIKKGDRIAQLVF 92 (92) T ss_pred CEEECCCCEEEEEEC T ss_conf 889879997999979 No 9 >TIGR02274 dCTP_deam deoxycytidine triphosphate deaminase; InterPro: IPR011962 Members of this family include the Escherichia coli monofunctional deoxycytidine triphosphate deaminase (dCTP deaminase) and a Methanocaldococcus jannaschii bifunctional dCTP deaminase (3.5.4.13 from EC)/dUTP diphosphatase (3.6.1.23 from EC). For additional information please see .; GO: 0008829 dCTP deaminase activity, 0006229 dUTP biosynthetic process. Probab=99.91 E-value=5.8e-24 Score=156.41 Aligned_cols=112 Identities=31% Similarity=0.421 Sum_probs=101.5 Q ss_pred CCCCCCEEECCCCEEEEECCCHHHCCCCEEEEEEECCCCEEEEEEECCCCCEEECCCCCCCEEEEEEECCCCEEEEECCC Q ss_conf 37787389379973999658602258761467875266313322202355036656878876999998079449995595 Q gi|254781025|r 36 LPEDEPVELLPGMRSLIPGGYIIAIPPGYEGQVRSRSGLALNYGVACLNSPGTIDSDYRGEIKILLINLGQENFLIMRGM 115 (154) Q Consensus 36 ~~~~e~~~i~pg~~~li~Tgi~i~ip~g~~~~i~~RSsl~~K~gl~~~n~~GvID~~YrGei~v~l~N~~~~~~~i~~G~ 115 (154) +.++|.++|+|++.+|+.|-..++||++..|++..||||| |.||.+.+++|.|||||+|.|++.|+|.++.|+.+.+|+ T Consensus 82 v~~~~~f~i~P~~f~La~T~E~v~lP~dvva~~~GRSSla-RlGl~~~~TAg~iDPGf~G~ITLEl~N~~~~Pv~L~~Gm 160 (198) T TIGR02274 82 VEEDEEFVIPPGEFVLATTLEYVKLPDDVVAFVEGRSSLA-RLGLFIHVTAGWIDPGFEGNITLELFNAGKLPVKLRPGM 160 (198) T ss_pred CCCCCEEEECCCCEEEEEEEEEEEECCCEEEEEECCCCEE-EEEEECCCCCCEECCCCCCEEEEEEECCCCCEEEECCCC T ss_conf 0489879969987688888999950885078886230113-320140662117868987649999876878707865898 Q ss_pred CEEEEEEEECCCCEEEEEECCCCCCCCCCCCCCCCCCC Q ss_conf 03589997504340699871357754268888735467 Q gi|254781025|r 116 RIAQLIIANSVRAHPSLISAMPMGKNERNEKGFGSTGL 153 (154) Q Consensus 116 RIaQlv~~~~~~~~~~~v~~~~l~~t~RG~~GFGSTG~ 153 (154) ||||+||.+...|. ..+...+.|..+=.|++|+ T Consensus 161 Ri~QlvF~~l~~P~-----~~pY~~p~~~gKYqgQ~G~ 193 (198) T TIGR02274 161 RIAQLVFARLSSPA-----ERPYGGPRKSGKYQGQTGV 193 (198) T ss_pred EEEEEEEEECCCCH-----HHCCCCCCCCCCCCCCCCC T ss_conf 58999986417850-----1148864314340378776 No 10 >PRK00416 dcd deoxycytidine triphosphate deaminase; Reviewed Probab=99.77 E-value=4.8e-18 Score=122.74 Aligned_cols=105 Identities=25% Similarity=0.322 Sum_probs=86.7 Q ss_pred CCCEEECCCCEEEEECCCHHHCCCCEEEEEEECCCCEEEEEEECCCCCEEECCCCCCCEEEEEEECCCCEEEEECCCCEE Q ss_conf 87389379973999658602258761467875266313322202355036656878876999998079449995595035 Q gi|254781025|r 39 DEPVELLPGMRSLIPGGYIIAIPPGYEGQVRSRSGLALNYGVACLNSPGTIDSDYRGEIKILLINLGQENFLIMRGMRIA 118 (154) Q Consensus 39 ~e~~~i~pg~~~li~Tgi~i~ip~g~~~~i~~RSsl~~K~gl~~~n~~GvID~~YrGei~v~l~N~~~~~~~i~~G~RIa 118 (154) ++.+.|+||+..|+.|-..+.+|+++.+++.+|||++ |.||.+. ++.||+||+|.+.+.++|.++.++.+.+|+||| T Consensus 65 ~e~~iL~Pg~~~L~~T~E~i~lP~~~~~~~~~rSs~a-R~Gl~v~--~t~~dpG~~G~ltlel~N~~~~pv~l~~g~~i~ 141 (174) T PRK00416 65 GEVFILPPGEFALARTLEYFKLPDDVVGILEGRSSLA-RLGLIVH--VTAIDPGWEGHITLEFSNTGPLPVKLYPGEGIG 141 (174) T ss_pred CCEEEECCCCEEEEEEEEECCCCCCEEEEEEECCHHH-CCCEEEE--CCEECCCCCEEEEEEEEECCCCCEEECCCCEEE T ss_conf 9858977997999999883059878999999635021-4735987--111089841279999996888866574799199 Q ss_pred EEEEEECCCCEEEEEECCCCCCCCCCCCCCCCCCC Q ss_conf 89997504340699871357754268888735467 Q gi|254781025|r 119 QLIIANSVRAHPSLISAMPMGKNERNEKGFGSTGL 153 (154) Q Consensus 119 Qlv~~~~~~~~~~~v~~~~l~~t~RG~~GFGSTG~ 153 (154) |++|.+...+. ..+.++ .+| +=.|++|+ T Consensus 142 Ql~F~~~~~~~-----~~~Y~~-~~G-kYqgq~g~ 169 (174) T PRK00416 142 QLLFFELSGPA-----ERPYAD-RKG-KYQGQRGV 169 (174) T ss_pred EEEEEECCCCC-----CCCCCC-CCC-CCCCCCCC T ss_conf 99999668865-----557554-586-45798998 No 11 >PRK02253 deoxyuridine 5'-triphosphate nucleotidohydrolase; Provisional Probab=99.76 E-value=2.8e-18 Score=124.08 Aligned_cols=87 Identities=26% Similarity=0.344 Sum_probs=77.1 Q ss_pred CCCCEEECCCCEEEEECCCHHHCCCCEEEEEEECCCCEEEEEEECCCCCEEECCCCCCCEEEEEEECCCCEEEEECCCCE Q ss_conf 78738937997399965860225876146787526631332220235503665687887699999807944999559503 Q gi|254781025|r 38 EDEPVELLPGMRSLIPGGYIIAIPPGYEGQVRSRSGLALNYGVACLNSPGTIDSDYRGEIKILLINLGQENFLIMRGMRI 117 (154) Q Consensus 38 ~~e~~~i~pg~~~li~Tgi~i~ip~g~~~~i~~RSsl~~K~gl~~~n~~GvID~~YrGei~v~l~N~~~~~~~i~~G~RI 117 (154) .++.+.++||. .++.|...++||++++|++.|||||+ |.|+.+.+ +++|+||+|.+.+.|.|.+..++.+++|+|| T Consensus 58 ~d~~~~L~pg~-yll~t~E~v~iP~d~~a~~~~RSSl~-R~G~~i~~--av~DpGf~G~~~~~l~~~n~~~i~l~~g~Ri 133 (159) T PRK02253 58 DDGWIHLPPGI-YKVVFNEIVNIPKDHVAFAYPRSSLL-RCGCTLET--AVWDAGYEGRSESLLVVHNPHGIRLERGARI 133 (159) T ss_pred CCCEEEECCCE-EEEEEEEEEECCCCCEEEEECHHHHE-EEEEEECC--CCCCCCCCCEEEEEEEECCCCCEEEECCCEE T ss_conf 99549967988-99998669973806199996111430-31078647--6328983150699999868877798689999 Q ss_pred EEEEEEECCCC Q ss_conf 58999750434 Q gi|254781025|r 118 AQLIIANSVRA 128 (154) Q Consensus 118 aQlv~~~~~~~ 128 (154) |||||.+...+ T Consensus 134 ~Qlvf~~~~~~ 144 (159) T PRK02253 134 AQLVFIRLEAE 144 (159) T ss_pred EEEEEEECCCC T ss_conf 99999983687 No 12 >PHA01707 dut 2'-deoxyuridine 5'-triphosphatase Probab=99.75 E-value=6e-18 Score=122.22 Aligned_cols=100 Identities=21% Similarity=0.317 Sum_probs=85.5 Q ss_pred CCCCEEEEEEE-----------CCCCCCEEECCCCEEEEECCCHHHCCCCEEEEEEECCCCEEEEEEECCCCCEEECCCC Q ss_conf 88640878910-----------3778738937997399965860225876146787526631332220235503665687 Q gi|254781025|r 25 SGSSGMDLFAA-----------LPEDEPVELLPGMRSLIPGGYIIAIPPGYEGQVRSRSGLALNYGVACLNSPGTIDSDY 93 (154) Q Consensus 25 ~~~AG~Dl~a~-----------~~~~e~~~i~pg~~~li~Tgi~i~ip~g~~~~i~~RSsl~~K~gl~~~n~~GvID~~Y 93 (154) -+.++|||+-- .+..+...++|++.+|+.|-..+.+|.++.+++.+|||++ |.||.+. +++||+|| T Consensus 27 v~p~s~DLrLg~e~~~~~~~~~~~~~~~fil~P~~~~L~~T~E~i~lP~d~~~~~~~RSS~a-R~Gl~i~--~t~iDpGf 103 (158) T PHA01707 27 IRENGVDLKIGNEIVRIKENMEKEVGDEFIIYPHEHVLLTTKEYIKLPNDIIAFCNLRSTFA-RKGLLIP--PTIVDAGF 103 (158) T ss_pred CCCCEEEEEECCEEEECCCCCCCCCCCEEEECCCCEEEEEEEEEEECCCCEEEEEEEECHHH-HCCEEEC--CCEECCCE T ss_conf 27743898847898981145412359859988998899988899986856899998222476-1766634--84008962 Q ss_pred CCCEEEEEEECCCCEEEEECCCCEEEEEEEECCCC Q ss_conf 88769999980794499955950358999750434 Q gi|254781025|r 94 RGEIKILLINLGQENFLIMRGMRIAQLIIANSVRA 128 (154) Q Consensus 94 rGei~v~l~N~~~~~~~i~~G~RIaQlv~~~~~~~ 128 (154) +|.+.+.+.| +..|+.+.+|+||||++|.+...+ T Consensus 104 ~G~~TLEl~n-~~~pi~l~~G~rI~Ql~F~~~~~~ 137 (158) T PHA01707 104 EGQLTIELVG-SSIPVKLKSGERFLHLIFARTLTP 137 (158) T ss_pred EEEEEEEEEC-CCCCEEECCCCEEEEEEEEECCCC T ss_conf 5279999963-888889868996979999973786 No 13 >COG0717 Dcd Deoxycytidine deaminase [Nucleotide transport and metabolism] Probab=99.72 E-value=8.2e-17 Score=115.74 Aligned_cols=83 Identities=30% Similarity=0.375 Sum_probs=78.6 Q ss_pred CEEECCCCEEEEECCCHHHCCCCEEEEEEECCCCEEEEEEECCCCCEEECCCCCCCEEEEEEECCCCEEEEECCCCEEEE Q ss_conf 38937997399965860225876146787526631332220235503665687887699999807944999559503589 Q gi|254781025|r 41 PVELLPGMRSLIPGGYIIAIPPGYEGQVRSRSGLALNYGVACLNSPGTIDSDYRGEIKILLINLGQENFLIMRGMRIAQL 120 (154) Q Consensus 41 ~~~i~pg~~~li~Tgi~i~ip~g~~~~i~~RSsl~~K~gl~~~n~~GvID~~YrGei~v~l~N~~~~~~~i~~G~RIaQl 120 (154) .+.|+|++..|..|-..++||+++.+++.+|||++ |.|+.+. +|++|+||+|.+.+.+.|.++.|+.+.+|+||||+ T Consensus 74 ~~il~P~~~~L~~t~E~i~iP~~v~~~~~gRSsla-R~G~~~~--~~~~DpGf~G~~tle~~n~~~~p~~L~~g~rI~QL 150 (183) T COG0717 74 EFILPPGEFYLAVTLEYVEIPEDVAAFCTGRSSLA-RLGLIVH--VGVIDPGFEGRITLELVNSGPLPIRLYPGERIAQL 150 (183) T ss_pred CEEECCCCEEEEEEEEEEECCCCEEEEEECCCCHH-HCCEEEC--CCCCCCCCCCEEEEEEEECCCCCEEECCCCEEEEE T ss_conf 38978995899998889982806289997347065-1837864--73488984865899999678888487589999999 Q ss_pred EEEECC Q ss_conf 997504 Q gi|254781025|r 121 IIANSV 126 (154) Q Consensus 121 v~~~~~ 126 (154) ||.+.. T Consensus 151 vF~~l~ 156 (183) T COG0717 151 VFLELD 156 (183) T ss_pred EEEECC T ss_conf 999745 No 14 >pfam06559 DCD 2'-deoxycytidine 5'-triphosphate deaminase (DCD). This family consists of several bacterial 2'-deoxycytidine 5'-triphosphate deaminase proteins (EC:3.5.4.13). Probab=98.10 E-value=1.4e-05 Score=51.87 Aligned_cols=84 Identities=18% Similarity=0.209 Sum_probs=67.1 Q ss_pred CCEEECCCCEEEEECCCHHHCCCCEEEEEEECCCCEEEEEEEC----CCC--CEEECCCCCCCEEEEEEECCCCEEEEEC Q ss_conf 7389379973999658602258761467875266313322202----355--0366568788769999980794499955 Q gi|254781025|r 40 EPVELLPGMRSLIPGGYIIAIPPGYEGQVRSRSGLALNYGVAC----LNS--PGTIDSDYRGEIKILLINLGQENFLIMR 113 (154) Q Consensus 40 e~~~i~pg~~~li~Tgi~i~ip~g~~~~i~~RSsl~~K~gl~~----~n~--~GvID~~YrGei~v~l~N~~~~~~~i~~ 113 (154) ...+++.|...+|++...+.+|++..|...|+||.+ |..+.+ -++ ---|.+||.|+|.+.+.-. .-++.+.+ T Consensus 81 ~gavlEkgcvYiv~L~E~L~Lp~~i~a~aNPKSSTG-RLDiFtR~ItD~~~efD~I~~GY~GpLy~EI~Pr-sFsI~vr~ 158 (366) T pfam06559 81 DGAVLETGCVYLVPLQEGLALPADISAAANPKSSTG-RLDVFTRVITDGGREFDRIPAGYDGPLYAEISPR-TFSVLVRP 158 (366) T ss_pred CCCEEECCCEEEEEEHHHCCCCCCCEEEECCCCCCC-CEEEEEEEECCCCCCCCCCCCCCCCCEEEEECCC-EEEEEEEC T ss_conf 983663697899981531379978658865887664-0378999942588301025778865258997455-23079807 Q ss_pred CCCEEEEEEEEC Q ss_conf 950358999750 Q gi|254781025|r 114 GMRIAQLIIANS 125 (154) Q Consensus 114 G~RIaQlv~~~~ 125 (154) |+|+.|+-|..- T Consensus 159 G~rLnQiRfr~g 170 (366) T pfam06559 159 GSRLSQIRFRRG 170 (366) T ss_pred CCEEEEEEECCC T ss_conf 980443665179 No 15 >PRK07559 2'-deoxycytidine 5'-triphosphate deaminase; Provisional Probab=98.00 E-value=2.7e-05 Score=50.24 Aligned_cols=84 Identities=17% Similarity=0.207 Sum_probs=67.1 Q ss_pred CCCEEECCCCEEEEECCCHHHCCCCEEEEEEECCCCEEEEEEEC----CCC--CEEECCCCCCCEEEEEEECCCCEEEEE Q ss_conf 87389379973999658602258761467875266313322202----355--036656878876999998079449995 Q gi|254781025|r 39 DEPVELLPGMRSLIPGGYIIAIPPGYEGQVRSRSGLALNYGVAC----LNS--PGTIDSDYRGEIKILLINLGQENFLIM 112 (154) Q Consensus 39 ~e~~~i~pg~~~li~Tgi~i~ip~g~~~~i~~RSsl~~K~gl~~----~n~--~GvID~~YrGei~v~l~N~~~~~~~i~ 112 (154) ....+++.|...++++...+.+|++..|...|+||.+ |-.+.. -++ .--|.+||.|++.+.+.-. .-++.+. T Consensus 79 ~~gavlekgcvYlv~L~E~L~LP~~isa~aNPKSSTG-RLDvFtR~ItD~~~eFD~Ip~GY~GpLY~EI~Pr-TFsI~Vr 156 (366) T PRK07559 79 TDGAVLETGCVYIVPLLESLALPADLSASANPKSSTG-RLDVFTRVITDGAREFDKIPAGYHGPLYAEISPR-TFPVLVR 156 (366) T ss_pred CCCEEEECCCEEEEEEHHHCCCCCCCEEEECCCCCCE-EEEEEEEEECCCCCHHCCCCCCCCCCEEEEECCC-EEEEEEE T ss_conf 8982786797899983431269988769865777640-5788999950588301026888864158997576-4508984 Q ss_pred CCCCEEEEEEEE Q ss_conf 595035899975 Q gi|254781025|r 113 RGMRIAQLIIAN 124 (154) Q Consensus 113 ~G~RIaQlv~~~ 124 (154) +|+|..|+-|.. T Consensus 157 ~G~rLnQiRfr~ 168 (366) T PRK07559 157 TGSRLSQIRFRR 168 (366) T ss_pred CCCCHHHEEECC T ss_conf 798200144227 No 16 >pfam06559 DCD 2'-deoxycytidine 5'-triphosphate deaminase (DCD). This family consists of several bacterial 2'-deoxycytidine 5'-triphosphate deaminase proteins (EC:3.5.4.13). Probab=96.77 E-value=0.01 Score=35.62 Aligned_cols=85 Identities=16% Similarity=0.266 Sum_probs=63.7 Q ss_pred CCEEECCCCEEEEECCCHHHCCCCEEEEEEECCCCEEEEEEECCCCCEEECCCCCCC-------EEEEEEECCCCEEEEE Q ss_conf 738937997399965860225876146787526631332220235503665687887-------6999998079449995 Q gi|254781025|r 40 EPVELLPGMRSLIPGGYIIAIPPGYEGQVRSRSGLALNYGVACLNSPGTIDSDYRGE-------IKILLINLGQENFLIM 112 (154) Q Consensus 40 e~~~i~pg~~~li~Tgi~i~ip~g~~~~i~~RSsl~~K~gl~~~n~~GvID~~YrGe-------i~v~l~N~~~~~~~i~ 112 (154) ..++|.|++..+...-.++.+|+++.|...|=.-..- -..+. =+|-.|+||-.. =.|.=.-.-+.|+.++ T Consensus 248 ~~liLdp~eFYIL~SkE~V~IPp~~AAEM~pyd~~vG--eFRvH-YAGFFDPGFG~~~~gg~GsraVLEVRs~evPF~le 324 (366) T pfam06559 248 RELILDPGEFYILVSREAVHIPPDYAAEMAPFDPLVG--EFRVH-YAGFFDPGFGHAAAGGAGSRAVLEVRSHEVPFILE 324 (366) T ss_pred CEEEECCCCEEEEEECCEEEECCCHHEECCCCCCCCC--CEEEE-ECCCCCCCCCCCCCCCCCCCEEEEEECCCCCEEEE T ss_conf 8599778867999754336747331012524323566--41213-42312588787666777740799983487876766 Q ss_pred CCCCEEEEEEEECCC Q ss_conf 595035899975043 Q gi|254781025|r 113 RGMRIAQLIIANSVR 127 (154) Q Consensus 113 ~G~RIaQlv~~~~~~ 127 (154) .|+.||.|++.+... T Consensus 325 dgQ~v~rL~yE~m~~ 339 (366) T pfam06559 325 HGQIVGRLVYERMAA 339 (366) T ss_pred CCCEEEEEEEEECCC T ss_conf 797889999877004 No 17 >PRK07559 2'-deoxycytidine 5'-triphosphate deaminase; Provisional Probab=96.66 E-value=0.012 Score=35.09 Aligned_cols=86 Identities=16% Similarity=0.286 Sum_probs=64.6 Q ss_pred CCCEEECCCCEEEEECCCHHHCCCCEEEEEEECCCCEEEEEEECCCCCEEECCCCCCC-------EEEEEEECCCCEEEE Q ss_conf 8738937997399965860225876146787526631332220235503665687887-------699999807944999 Q gi|254781025|r 39 DEPVELLPGMRSLIPGGYIIAIPPGYEGQVRSRSGLALNYGVACLNSPGTIDSDYRGE-------IKILLINLGQENFLI 111 (154) Q Consensus 39 ~e~~~i~pg~~~li~Tgi~i~ip~g~~~~i~~RSsl~~K~gl~~~n~~GvID~~YrGe-------i~v~l~N~~~~~~~i 111 (154) +..++|.|++..+...-.++.+|+++.|...|-.-..- -..+. =+|-.|+||-.. =-|.=.-.-+.|+.+ T Consensus 247 ~~~LILdp~eFYIL~SkE~V~iP~~~AaEM~pyd~~~G--efRvH-YAGFFDPGFG~~~~g~~gsraVLEVRs~evPF~l 323 (366) T PRK07559 247 DGELILDPGEFYILASREAVHVPPDYAAEMVPFDPLVG--EFRVH-YAGFFDPGFGHAEAGGTGSRAVLEVRSHEVPFIL 323 (366) T ss_pred CCEEEECCCCEEEEEECCEEEECCCHHHHCCCCCCCCC--CEEEE-ECCCCCCCCCCCCCCCCCCEEEEEEECCCCCEEE T ss_conf 98799737867999634436657741431335444666--34444-5021258778776778773289998348787587 Q ss_pred ECCCCEEEEEEEECCC Q ss_conf 5595035899975043 Q gi|254781025|r 112 MRGMRIAQLIIANSVR 127 (154) Q Consensus 112 ~~G~RIaQlv~~~~~~ 127 (154) +.|+-||+|++.+... T Consensus 324 e~gQ~v~rl~ye~m~~ 339 (366) T PRK07559 324 EHGQIVGRLVYERMLE 339 (366) T ss_pred ECCCEEEEEEEEECCC T ss_conf 6897889999887014 No 18 >pfam05784 Herpes_UL82_83 Betaherpesvirus UL82/83 protein N terminus. This family represents the N terminal region of the Betaherpesvirus UL82 and UL83 proteins. As viruses are reliant upon their host cell to serve as proper environments for their replication, many have evolved mechanisms to alter intracellular conditions to suit their own needs. Human cytomegalovirus induces quiescent cells to enter the cell cycle and then arrests them in late G(1), before they enter the S phase, a cell cycle compartment that is presumably favourable for viral replication. The protein product of the human cytomegalovirus UL82 gene, pp71, can accelerate the movement of cells through the G(1) phase of the cell cycle. This activity would help infected cells reach the late G(1) arrest point sooner and thus may stimulate the infectious cycle. pp71 also induces DNA synthesis in quiescent cells, but a pp71 mutant protein that is unable to induce quiescent cells to enter the cell cycle still retains the abilit Probab=95.37 E-value=0.21 Score=28.08 Aligned_cols=93 Identities=16% Similarity=0.125 Sum_probs=61.6 Q ss_pred CCCCCCCCCEEEEEEECCCCCCEEECCCCEEEEECCCHHHCCCCEEEEEEECCCCEEEEEEECCCCCEEECCCCCCCEEE Q ss_conf 74357886408789103778738937997399965860225876146787526631332220235503665687887699 Q gi|254781025|r 20 PEYKTSGSSGMDLFAALPEDEPVELLPGMRSLIPGGYIIAIPPGYEGQVRSRSGLALNYGVACLNSPGTIDSDYRGEIKI 99 (154) Q Consensus 20 P~~~t~~~AG~Dl~a~~~~~e~~~i~pg~~~li~Tgi~i~ip~g~~~~i~~RSsl~~K~gl~~~n~~GvID~~YrGei~v 99 (154) |.-....+-||-+.+ +.++.++||+...+-...+++-...|.|+++|+.-- ||.+ ++|... =+-.|.+ T Consensus 249 P~L~~~~~NGftV~~----Pr~~~l~~g~~~~v~i~~~feS~~~~~~lF~P~~ip----Glsi--~~~~w~--~~~~L~i 316 (346) T pfam05784 249 PFLRPHPRNGFTVLC----PKNLHLKPGQTSHVMIDNAFESDQTHAGLFFPKDIP----GLSI--SCGPWM--ERQPLFI 316 (346) T ss_pred CCCCCCCCCCEEEEC----CCCEECCCCCEEEEEEEEEEECCCCEEEEEECCCCC----CCEE--EECCCC--CCCEEEE T ss_conf 732117999789978----875660899869999974895188169999265579----9776--732158--9974899 Q ss_pred EEEECCCCEEEEECCCCEEEEEEEEC Q ss_conf 99980794499955950358999750 Q gi|254781025|r 100 LLINLGQENFLIMRGMRIAQLIIANS 125 (154) Q Consensus 100 ~l~N~~~~~~~i~~G~RIaQlv~~~~ 125 (154) .+.+.++ ++.|+.++.++-+-|.+. T Consensus 317 ~i~a~~~-~v~i~~~q~LG~lhFf~r 341 (346) T pfam05784 317 ELRATKE-SVRIEHMQPLGELHFFPR 341 (346) T ss_pred EEEECCC-CEECCCCCEEEEEEEEEC T ss_conf 9984355-410237875348998515 No 19 >pfam04797 Herpes_ORF11 Herpesvirus dUTPase protein. This family of proteins are found in Herpesvirus proteins. This family includes proteins called ORF10 and ORF11 amongst others. However, these proteins seem to be related to other dUTPases pfam00692 suggesting that these proteins are also dUTPases (Bateman A pers. obs.). Probab=95.29 E-value=0.23 Score=27.94 Aligned_cols=81 Identities=12% Similarity=0.159 Sum_probs=52.3 Q ss_pred EEEEEEECCCCCCEEECCCCEEEEECCCHHHCCCCE---EEEEEECCCCEEEEEEECCCCCEEECC-CC--CCCEEEEEE Q ss_conf 087891037787389379973999658602258761---467875266313322202355036656-87--887699999 Q gi|254781025|r 29 GMDLFAALPEDEPVELLPGMRSLIPGGYIIAIPPGY---EGQVRSRSGLALNYGVACLNSPGTIDS-DY--RGEIKILLI 102 (154) Q Consensus 29 G~Dl~a~~~~~e~~~i~pg~~~li~Tgi~i~ip~g~---~~~i~~RSsl~~K~gl~~~n~~GvID~-~Y--rGei~v~l~ 102 (154) -.=+|. +..++|+|++.+.|+-....+.+..- .+.|..-+... .. +|++ =| -..+.+.++ T Consensus 260 v~~iY~----~~~~~IpP~~~~~v~~~~~y~~~~~~~~~~~~I~~~~~~~---~~-------~i~p~~W~P~~~~~l~v~ 325 (384) T pfam04797 260 VPPVYP----GPEVTIPPGESTKVKYNNTYESGLPSKITAAFITGLASNP---DF-------VIEPCEWLPGSTLKLTVY 325 (384) T ss_pred EEEECC----CCCEEECCCCEEEEEECCEEEECCCCCCCEEEEECCCCCC---EE-------EEEEEEECCCCEEEEEEE T ss_conf 555917----9946889997799998529983588765469998589997---39-------997468779981799999 Q ss_pred ECCCCEEEEECCCCEEEEEEE Q ss_conf 807944999559503589997 Q gi|254781025|r 103 NLGQENFLIMRGMRIAQLIIA 123 (154) Q Consensus 103 N~~~~~~~i~~G~RIaQlv~~ 123 (154) |.++.+.+|.+|+++||.+|. T Consensus 326 N~s~~pitI~~gt~la~A~fi 346 (384) T pfam04797 326 NPSNFPLTISRGTPLALALFI 346 (384) T ss_pred CCCCCCEEECCCCEEEEEEEE T ss_conf 689960896189999999998 No 20 >TIGR00266 TIGR00266 conserved hypothetical protein TIGR00266; InterPro: IPR002838 The proteins in this family have no known function.. Probab=70.20 E-value=5.6 Score=20.00 Aligned_cols=85 Identities=21% Similarity=0.331 Sum_probs=51.4 Q ss_pred EEEEEECCCCCCEEECCCCEEEEECCCHHHCCCCEEEEEEECCCCE--EEE------EEEC----C--CCCEE--ECCCC Q ss_conf 8789103778738937997399965860225876146787526631--332------2202----3--55036--65687 Q gi|254781025|r 30 MDLFAALPEDEPVELLPGMRSLIPGGYIIAIPPGYEGQVRSRSGLA--LNY------GVAC----L--NSPGT--IDSDY 93 (154) Q Consensus 30 ~Dl~a~~~~~e~~~i~pg~~~li~Tgi~i~ip~g~~~~i~~RSsl~--~K~------gl~~----~--n~~Gv--ID~~Y 93 (154) +=+||. -.++++|+|..+|.||..+.|..+....=..|-||= +|+ .|.+ + ...|. .=+-| T Consensus 8 ~p~ys~----l~~~LkPge~v~~~~g~~~~M~~~v~l~~~~~GGLL~aL~r~Llg~e~LflN~f~~~s~~~g~l~LApAl 83 (230) T TIGR00266 8 RPSYSM----LELTLKPGERVLAETGAMVSMDRSVELSTELRGGLLGALKRSLLGKESLFLNAFTAESQGEGSLQLAPAL 83 (230) T ss_pred CCCEEE----EEEEECCCCEEEECCCCEEEECCCEEEEEEECCCHHHHHHHHHCCCCEEEEEEEECCCCCCCEEEECCCC T ss_conf 440148----9988088858997277079851877999740686479889873477414775444676871104545778 Q ss_pred CCCEEEEEEECCCCEEEEECCCCEEE Q ss_conf 88769999980794499955950358 Q gi|254781025|r 94 RGEIKILLINLGQENFLIMRGMRIAQ 119 (154) Q Consensus 94 rGei~v~l~N~~~~~~~i~~G~RIaQ 119 (154) .|+|..-+. ..++.+.+++|.-+|. T Consensus 84 PGdi~~yl~-~~~~gI~~~~g~FlA~ 108 (230) T TIGR00266 84 PGDITHYLL-LTKKGIYAQSGAFLAS 108 (230) T ss_pred CCCEEEEEE-CCCCCEEECCCCEEEC T ss_conf 876799977-4788489667855604 No 21 >COG1124 DppF ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component [Amino acid transport and metabolism / Inorganic ion transport and metabolism] Probab=62.89 E-value=3.5 Score=21.17 Aligned_cols=67 Identities=18% Similarity=0.121 Sum_probs=39.8 Q ss_pred CCCHHHCCCCEEEEEEECCCCEEEEEEECCCCCEEECCCCCCCEEEEEEECCC-CEEEEECCCCEEEEEEEEC Q ss_conf 58602258761467875266313322202355036656878876999998079-4499955950358999750 Q gi|254781025|r 54 GGYIIAIPPGYEGQVRSRSGLALNYGVACLNSPGTIDSDYRGEIKILLINLGQ-ENFLIMRGMRIAQLIIANS 125 (154) Q Consensus 54 Tgi~i~ip~g~~~~i~~RSsl~~K~gl~~~n~~GvID~~YrGei~v~l~N~~~-~~~~i~~G~RIaQlv~~~~ 125 (154) -++.+++.+|-..=|.+.||.+ |.-|.-+ .+|.-.+ +.|+|.+.-.-... ..-. .-.|..||||+-. T Consensus 24 ~~VS~~i~~Ge~lgivGeSGsG-KSTL~r~-l~Gl~~p-~~G~I~~~G~~~~~~~~~~--~~~~~VQmVFQDp 91 (252) T COG1124 24 NNVSLEIERGETLGIVGESGSG-KSTLARL-LAGLEKP-SSGSILLDGKPLAPKKRAK--AFYRPVQMVFQDP 91 (252) T ss_pred CCEEEEECCCCEEEEECCCCCC-HHHHHHH-HHCCCCC-CCCEEEECCCCCCCCCCCH--HHCCCEEEEECCC T ss_conf 4325996489789998489898-8899999-9565678-8862898884057665303--3304506995187 No 22 >TIGR02868 CydC ABC transporter, CydDC cysteine exporter (CydDC-E) family, permease/ATP-binding protein CydC; InterPro: IPR014223 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This entry represents CydC, a member of a heterodimeric ATP-binding cassette-type transporter (ABC transporter). It is involved in the export of glutathione from the cytoplasm to the periplasm and is required for the assembly of both cytochrome c and cytochrome bd , , .; GO: 0042626 ATPase activity coupled to transmembrane movement of substances, 0006810 transport, 0016021 integral to membrane. Probab=50.47 E-value=5.8 Score=19.94 Aligned_cols=83 Identities=17% Similarity=0.260 Sum_probs=54.4 Q ss_pred CCEEE-CCCCEEEEECCCHHHCCCCEEEEEEECCCCEEEEEEECCCCCEEECCCCCCCEEEEEEECCCCEEEEECCCCEE Q ss_conf 73893-79973999658602258761467875266313322202355036656878876999998079449995595035 Q gi|254781025|r 40 EPVEL-LPGMRSLIPGGYIIAIPPGYEGQVRSRSGLALNYGVACLNSPGTIDSDYRGEIKILLINLGQENFLIMRGMRIA 118 (154) Q Consensus 40 e~~~i-~pg~~~li~Tgi~i~ip~g~~~~i~~RSsl~~K~gl~~~n~~GvID~~YrGei~v~l~N~~~~~~~i~~G~RIa 118 (154) +++.. .|+...-+=.|+..++++|-.--|.+|||.+ |+=|-.. -.|.+||. +|++.+.=. +..=-.++-+. T Consensus 363 ~~ls~~~pg~~~~vl~~V~L~l~~G~r~Ai~G~SG~G-KsTLL~~-L~G~l~P~-~G~vtl~G~-----~~~~~~~~evr 434 (566) T TIGR02868 363 RDLSVGYPGAPPNVLDGVSLDLPPGERVAILGPSGSG-KSTLLAT-LAGLLDPL-QGEVTLDGV-----PVSSLSEDEVR 434 (566) T ss_pred EEEEEECCCCCHHHHCCCCCCCCCCCCEEEECCCCCC-HHHHHHH-HHHHCCCC-CCCEEECCC-----CHHHCCCCHHE T ss_conf 8776526987346542786411388608986688765-7899999-98402899-991787773-----24325731100 Q ss_pred EEEEEECCCCEE Q ss_conf 899975043406 Q gi|254781025|r 119 QLIIANSVRAHP 130 (154) Q Consensus 119 Qlv~~~~~~~~~ 130 (154) ..|..--+++|. T Consensus 435 r~v~~~aQ~aHl 446 (566) T TIGR02868 435 RRVSVFAQDAHL 446 (566) T ss_pred HHEEECCCCCCC T ss_conf 000312788621 No 23 >cd03262 ABC_HisP_GlnQ_permeases HisP and GlnQ are the ATP-binding components of the bacterial periplasmic histidine and glutamine permeases, repectively. Histidine permease is a multisubunit complex containing the HisQ and HisM integral membrane subunits and two copies of HisP. HisP has properties intermediate between those of integral and peripheral membrane proteins and is accessible from both sides of the membrane, presumably by its interaction with HisQ and HisM. The two HisP subunits form a homodimer within the complex. The domain structure of the amino acid uptake systems is typical for prokaryote extracellular solute binding protein-dependent uptake systems. All of the amino acid uptake systems also have at least one, and in a few cases, two extracellular solute binding proteins located in the periplasm of Gram-negative bacteria, or attached to the cell membrane of Gram-positive bacteria. The best-studied member of the PAAT (polar amino acid transport) family is the HisJQM Probab=48.30 E-value=3.5 Score=21.16 Aligned_cols=70 Identities=14% Similarity=0.077 Sum_probs=44.1 Q ss_pred ECCCHHHCCCCEEEEEEECCCCEEEEEEECCCCCEEECCCCCCCEEEEEEECCCCEEEEECCCCEEEEEEEEC Q ss_conf 6586022587614678752663133222023550366568788769999980794499955950358999750 Q gi|254781025|r 53 PGGYIIAIPPGYEGQVRSRSGLALNYGVACLNSPGTIDSDYRGEIKILLINLGQENFLIMRGMRIAQLIIANS 125 (154) Q Consensus 53 ~Tgi~i~ip~g~~~~i~~RSsl~~K~gl~~~n~~GvID~~YrGei~v~l~N~~~~~~~i~~G~RIaQlv~~~~ 125 (154) =-++.+++++|-..-|.+.||-+ |+= +.++.+-.|.-..|++.+.=.+.+.....+.+=.|=..+||+.+ T Consensus 16 L~~vsl~i~~Ge~~~ivGpSGsG-KST--LL~~i~gL~~p~~G~i~i~g~~i~~~~~~~~~~rr~iG~VFQ~~ 85 (213) T cd03262 16 LKGIDLTVKKGEVVVIIGPSGSG-KST--LLRCINLLEEPDSGTIIIDGLKLTDDKKNINELRQKVGMVFQQF 85 (213) T ss_pred ECCEEEEECCCCEEEEECCCCCH-HHH--HHHHHHCCCCCCCEEEEECCEECCCCHHHHHHHHCCCEEEECCC T ss_conf 86707598899899999999844-999--99999819999864999999999998156999867827996798 No 24 >TIGR01322 scrB_fam sucrose-6-phosphate hydrolase; InterPro: IPR006232 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families , , . This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site . Because the fold of proteins is better conserved than their sequences, some of the families can be grouped in 'clans'. Glycoside hydrolase family 32 comprises enzymes with several known activities; invertase (3.2.1.26 from EC); inulinase; levanase (3.2.1.7 from EC); exo-inulinase (3.2.1.7 from EC); sucrose:sucrose 1-fructosyltransferase (3.2.1.7 from EC); and fructan:fructan 1-fructosyltransferase (2.4.1.100 from EC). This subfamily represents sucrose-6-phosphate hydrolase.; GO: 0004564 beta-fructofuranosidase activity, 0005975 carbohydrate metabolic process, 0005737 cytoplasm. Probab=47.11 E-value=9.6 Score=18.68 Aligned_cols=26 Identities=23% Similarity=0.254 Sum_probs=18.5 Q ss_pred CCCCCCEEEEEEECCCCCCEEECCCCEEEE Q ss_conf 578864087891037787389379973999 Q gi|254781025|r 23 KTSGSSGMDLFAALPEDEPVELLPGMRSLI 52 (154) Q Consensus 23 ~t~~~AG~Dl~a~~~~~e~~~i~pg~~~li 52 (154) =++.|.|||+||. +.+.=+-|.+.|| T Consensus 272 f~eLD~GFdFYAp----Q~~~d~dGR~lL~ 297 (475) T TIGR01322 272 FHELDYGFDFYAP----QTFLDEDGRRLLV 297 (475) T ss_pred HCCCCCCCCCCCC----CCCCCCCCCEEEE T ss_conf 3157887001221----0114713045431 No 25 >pfam06030 DUF916 Bacterial protein of unknown function (DUF916). This family consists of several hypothetical bacterial proteins of unknown function. Probab=46.69 E-value=18 Score=17.11 Aligned_cols=27 Identities=19% Similarity=0.343 Sum_probs=15.1 Q ss_pred CCCCCCCCCCCEEEEEEECCCCCCEEECCCCEEEEE Q ss_conf 037435788640878910377873893799739996 Q gi|254781025|r 18 PLPEYKTSGSSGMDLFAALPEDEPVELLPGMRSLIP 53 (154) Q Consensus 18 ~lP~~~t~~~AG~Dl~a~~~~~e~~~i~pg~~~li~ 53 (154) .+|....+..++| -++.+.|++..-+. T Consensus 6 ~lp~nQ~dk~~~Y---------FdL~~~p~q~q~l~ 32 (121) T pfam06030 6 VLPENQVDKNASY---------FDLKVKPGQTQTLQ 32 (121) T ss_pred ECCCCCCCCCCCE---------EEEEECCCCEEEEE T ss_conf 0884234788660---------89993899989999 No 26 >TIGR03608 L_ocin_972_ABC putative bacteriocin export ABC transporter, lactococcin 972 group. A gene pair with a fairly wide distribution consists of a polypeptide related to the lactococcin 972 (see TIGR01653) and multiple-membrane-spanning putative immunity protein (see TIGR01654). This model represents a small clade within the ABC transporters that regularly are found adjacent to these bacteriocin system gene pairs and are likely serve as export proteins. Probab=45.87 E-value=7 Score=19.47 Aligned_cols=52 Identities=17% Similarity=0.148 Sum_probs=38.6 Q ss_pred EECCCHHHCCCCEEEEEEECCCCEEEEEEECCCCCEEECCCCCCCEEEEEEECCC Q ss_conf 9658602258761467875266313322202355036656878876999998079 Q gi|254781025|r 52 IPGGYIIAIPPGYEGQVRSRSGLALNYGVACLNSPGTIDSDYRGEIKILLINLGQ 106 (154) Q Consensus 52 i~Tgi~i~ip~g~~~~i~~RSsl~~K~gl~~~n~~GvID~~YrGei~v~l~N~~~ 106 (154) +=-++.+++++|-..-|.+.||-+ |+ ++.+..+-+|.-..|++.+.=.+... T Consensus 13 vL~~vsl~i~~Ge~~~i~GpSGsG-KS--TLL~~i~gl~~p~sG~i~~~g~~~~~ 64 (206) T TIGR03608 13 ILDDLNLTIEKGKMVAIVGESGSG-KS--TLLNIIGLLEKPDSGQVYLNGQETPP 64 (206) T ss_pred EECCEEEEECCCCEEEEECCCCCC-HH--HHHHHHHCCCCCCCEEEEECCEECCC T ss_conf 975807798699899998799970-99--99999975999897599999999998 No 27 >KOG0729 consensus Probab=44.16 E-value=8.3 Score=19.04 Aligned_cols=86 Identities=13% Similarity=0.218 Sum_probs=56.2 Q ss_pred EEEECCCHHHCCCCEEEEEEECCCCEEEEEEECCCCCEEECCCCCCCEEEE-E-------EECCC--CEEEEECCCCEE- Q ss_conf 999658602258761467875266313322202355036656878876999-9-------98079--449995595035- Q gi|254781025|r 50 SLIPGGYIIAIPPGYEGQVRSRSGLALNYGVACLNSPGTIDSDYRGEIKIL-L-------INLGQ--ENFLIMRGMRIA- 118 (154) Q Consensus 50 ~li~Tgi~i~ip~g~~~~i~~RSsl~~K~gl~~~n~~GvID~~YrGei~v~-l-------~N~~~--~~~~i~~G~RIa- 118 (154) +-.+||++ |+.+|-+...|--+.--+.|.++.|.-||++++...-.++ + .-+++ .|..|+.|.|+. T Consensus 68 kEsdTGLA---Pp~~WDl~~Dkq~mq~EqpLqVArctkii~~~~~d~~Yvin~KqiaKfvV~lg~~vsptdIeeGMRVGv 144 (435) T KOG0729 68 KESDTGLA---PPALWDLAADKQRMQEEQPLQVARCTKIISGNSEDPKYVINVKQIAKFVVGLGDRVSPTDIEEGMRVGV 144 (435) T ss_pred CCCCCCCC---CHHHHHHHHHHHHHCCCCCCEEEEEEEECCCCCCCCCEEEEHHHHHHHHHCCCCCCCCHHHHHHHEECC T ss_conf 53346889---747776761387752347722002222037987786425529988888645565568023554302042 Q ss_pred -----EE--EEEECCCCEEEEEECCCC Q ss_conf -----89--997504340699871357 Q gi|254781025|r 119 -----QL--IIANSVRAHPSLISAMPM 138 (154) Q Consensus 119 -----Ql--v~~~~~~~~~~~v~~~~l 138 (154) |+ -+-|-..+....+.+++- T Consensus 145 DrnKYqI~lPLPpkIDpsVTmM~VEek 171 (435) T KOG0729 145 DRNKYQIQLPLPPKIDPSVTMMQVEEK 171 (435) T ss_pred CCCCEEEECCCCCCCCCCCEEEEEECC T ss_conf 454114753689888975047875127 No 28 >COG4988 CydD ABC-type transport system involved in cytochrome bd biosynthesis, ATPase and permease components [Energy production and conversion / Posttranslational modification, protein turnover, chaperones] Probab=43.59 E-value=12 Score=18.20 Aligned_cols=63 Identities=19% Similarity=0.173 Sum_probs=42.0 Q ss_pred CCEEECCCCEEEEECCCHHHCCCCEEEEEEECCCCEEEEEEECCCCCEEECCCCCCCEEEE---EEECC Q ss_conf 7389379973999658602258761467875266313322202355036656878876999---99807 Q gi|254781025|r 40 EPVELLPGMRSLIPGGYIIAIPPGYEGQVRSRSGLALNYGVACLNSPGTIDSDYRGEIKIL---LINLG 105 (154) Q Consensus 40 e~~~i~pg~~~li~Tgi~i~ip~g~~~~i~~RSsl~~K~gl~~~n~~GvID~~YrGei~v~---l~N~~ 105 (154) |++.+.-.+......++.+.+++|-.--|.++||-+ |.-+.-. -.|.- +.|.|||.++ +.|.+ T Consensus 324 ~~l~~~y~~g~~~l~~l~~t~~~g~~talvG~SGaG-KSTLl~l-L~G~~-~~~~G~I~vng~~l~~l~ 389 (559) T COG4988 324 ENLSFRYPDGKPALSDLNLTIKAGQLTALVGASGAG-KSTLLNL-LLGFL-APTQGEIRVNGIDLRDLS 389 (559) T ss_pred CCEEEECCCCCCCCCCCEEEECCCCEEEEECCCCCC-HHHHHHH-HHCCC-CCCCCEEEECCCCCCCCC T ss_conf 214785589985566710675489679998899997-8999999-84757-777844888993100068 No 29 >COG1136 SalX ABC-type antimicrobial peptide transport system, ATPase component [Defense mechanisms] Probab=42.07 E-value=10 Score=18.46 Aligned_cols=70 Identities=21% Similarity=0.203 Sum_probs=47.8 Q ss_pred EECCCHHHCCCCEEEEEEECCCCEEEEEEECCCCCEEECCCCCCCEEEEEEECCC---CEEEEECCCCEEEEEEEEC Q ss_conf 9658602258761467875266313322202355036656878876999998079---4499955950358999750 Q gi|254781025|r 52 IPGGYIIAIPPGYEGQVRSRSGLALNYGVACLNSPGTIDSDYRGEIKILLINLGQ---ENFLIMRGMRIAQLIIANS 125 (154) Q Consensus 52 i~Tgi~i~ip~g~~~~i~~RSsl~~K~gl~~~n~~GvID~~YrGei~v~l~N~~~---~~~~i~~G~RIaQlv~~~~ 125 (154) +=-++.+++++|-+.-|.+.||-+ |+ ++.|..|-+|.-..|++.+.-.+.+. +...--+.++|++ ||+.+ T Consensus 20 ~L~~v~l~i~~Ge~vaI~GpSGSG-KS--TLLniig~ld~pt~G~v~i~g~~~~~l~~~~~~~~R~~~iGf-vFQ~~ 92 (226) T COG1136 20 ALKDVNLEIEAGEFVAIVGPSGSG-KS--TLLNLLGGLDKPTSGEVLINGKDLTKLSEKELAKLRRKKIGF-VFQNF 92 (226) T ss_pred ECCCCEEEECCCCEEEEECCCCCC-HH--HHHHHHHCCCCCCCCEEEECCEECCCCCHHHHHHHHHHHEEE-ECCCC T ss_conf 524625887499899998999998-99--999999646678884699998886758988999997774899-87567 No 30 >cd03229 ABC_Class3 This class is comprised of all BPD (Binding Protein Dependent) systems that are largely represented in archaea and eubacteria and are primarily involved in scavenging solutes from the environment. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=42.01 E-value=11 Score=18.33 Aligned_cols=69 Identities=16% Similarity=0.056 Sum_probs=46.0 Q ss_pred ECCCHHHCCCCEEEEEEECCCCEEEEEEECCCC-CEEECCCCCCCEEEEEEECCCCEEEEECCCCEEEEEEEEC Q ss_conf 658602258761467875266313322202355-0366568788769999980794499955950358999750 Q gi|254781025|r 53 PGGYIIAIPPGYEGQVRSRSGLALNYGVACLNS-PGTIDSDYRGEIKILLINLGQENFLIMRGMRIAQLIIANS 125 (154) Q Consensus 53 ~Tgi~i~ip~g~~~~i~~RSsl~~K~gl~~~n~-~GvID~~YrGei~v~l~N~~~~~~~i~~G~RIaQlv~~~~ 125 (154) =-++.+++++|-..-|.+.||-+ |+=+ .++ .|...+ ..|++.+.=.+..+......+-.|=.-+||+.+ T Consensus 16 l~~vs~~i~~Ge~~~ivGpSG~G-KSTl--lr~i~Gl~~p-~~G~I~~~g~~i~~~~~~~~~~rr~ig~vFQ~~ 85 (178) T cd03229 16 LNDVSLNIEAGEIVALLGPSGSG-KSTL--LRCIAGLEEP-DSGSILIDGEDLTDLEDELPPLRRRIGMVFQDF 85 (178) T ss_pred ECCCEEEECCCCEEEEECCCCCH-HHHH--HHHHHCCCCC-CCEEEEECCEECCCCCCHHHHHCCCEEEEECCC T ss_conf 83707698899899999999983-9999--9999859999-963999999999888610245417759992699 No 31 >TIGR03415 ABC_choXWV_ATP choline ABC transporter, ATP-binding protein. Members of this protein family are the ATP-binding subunit of a three-protein transporter. This family belongs, more broadly, to the family of proline and glycine-betaine transporters, but members have been identified by direct characterization and by bioinformatic means as choline transporters. Many species have several closely-related members of this family, probably with variable abilities to act additionally on related quaternary amines. Probab=36.47 E-value=6.7 Score=19.57 Aligned_cols=48 Identities=17% Similarity=0.190 Sum_probs=32.4 Q ss_pred CCEEEEECCCHHHCCCCEEEEEEECCCCEEEEEE-ECCCCCEEECCCCCCCEEE Q ss_conf 9739996586022587614678752663133222-0235503665687887699 Q gi|254781025|r 47 GMRSLIPGGYIIAIPPGYEGQVRSRSGLALNYGV-ACLNSPGTIDSDYRGEIKI 99 (154) Q Consensus 47 g~~~li~Tgi~i~ip~g~~~~i~~RSsl~~K~gl-~~~n~~GvID~~YrGei~v 99 (154) |...-+ -++.+++.+|-..-|.+-||.+ |+=| .+. .|.+++ -+|++.+ T Consensus 35 g~~vAV-~dvsl~I~~GEi~~lvGpSGsG-KSTLLr~i--~GL~~p-t~G~I~i 83 (382) T TIGR03415 35 GLVVGV-ANASLDIEEGEICVLMGLSGSG-KSSLLRAV--NGLNPV-SRGSVLV 83 (382) T ss_pred CCEEEE-ECCEEEECCCCEEEEECCCCCH-HHHHHHHH--HCCCCC-CCEEEEE T ss_conf 999989-6517488799899999999734-99999999--759998-8529999 No 32 >COG2430 Uncharacterized conserved protein [Function unknown] Probab=34.52 E-value=37 Score=15.34 Aligned_cols=74 Identities=19% Similarity=0.240 Sum_probs=39.9 Q ss_pred CCCEEECCCCEE----EEECCCHHHCCCCEEEEEEECCCCEEEEEEECC--CCCEEE-----C-------CCC-CCCEEE Q ss_conf 873893799739----996586022587614678752663133222023--550366-----5-------687-887699 Q gi|254781025|r 39 DEPVELLPGMRS----LIPGGYIIAIPPGYEGQVRSRSGLALNYGVACL--NSPGTI-----D-------SDY-RGEIKI 99 (154) Q Consensus 39 ~e~~~i~pg~~~----li~Tgi~i~ip~g~~~~i~~RSsl~~K~gl~~~--n~~GvI-----D-------~~Y-rGei~v 99 (154) ...+.++||+.. ++|..+++...++---.+..+-+++ |....+- +..|+| + +++ +|-+++ T Consensus 72 ~~pivv~Pgs~~~iyl~vPVdiaV~~~~~g~~~lvDvF~~~-~~kyaLYG~p~~GvvcRy~~s~l~~~P~~~~~egv~kl 150 (236) T COG2430 72 ESPIVVAPGSSRTIYLKVPVDIAVYASAKGEPTLVDVFGPG-KEKYALYGPPENGVVCRYWISELYERPENPPGEGVAKL 150 (236) T ss_pred CCCEEECCCCCEEEEEEEEEEEEEEECCCCCCEEEEEECCC-CEEEEEECCCCCCEEEEEEEEEEEECCCCCCCCCEEEE T ss_conf 17678689872799998247899998279982478885799-71388877877756999984014655789864324899 Q ss_pred EEEECCCCEEEEEC Q ss_conf 99980794499955 Q gi|254781025|r 100 LLINLGQENFLIMR 113 (154) Q Consensus 100 ~l~N~~~~~~~i~~ 113 (154) .+.|.+++..+|.+ T Consensus 151 ~v~N~sd~wv~Isr 164 (236) T COG2430 151 TVVNTSDEWVTISR 164 (236) T ss_pred EEEECCCCEEEEEE T ss_conf 99966997399998 No 33 >COG4525 TauB ABC-type taurine transport system, ATPase component [Inorganic ion transport and metabolism] Probab=34.13 E-value=18 Score=17.17 Aligned_cols=42 Identities=26% Similarity=0.328 Sum_probs=27.2 Q ss_pred CCHHHCCCCEEEEEEECCCCEEEEEEECCC-CCEEECCCCCCCEEEE Q ss_conf 860225876146787526631332220235-5036656878876999 Q gi|254781025|r 55 GYIIAIPPGYEGQVRSRSGLALNYGVACLN-SPGTIDSDYRGEIKIL 100 (154) Q Consensus 55 gi~i~ip~g~~~~i~~RSsl~~K~gl~~~n-~~GvID~~YrGei~v~ 100 (154) ++.+++..|-...+.++||.+ |+-| .| .+|.++++ +|++++. T Consensus 23 ~vsL~ia~ge~vv~lGpSGcG-KTTL--Lnl~AGf~~P~-~G~i~l~ 65 (259) T COG4525 23 DVSLTIASGELVVVLGPSGCG-KTTL--LNLIAGFVTPS-RGSIQLN 65 (259) T ss_pred CCCEEECCCCEEEEECCCCCC-HHHH--HHHHHCCCCCC-CCEEEEC T ss_conf 355023589789997688865-7889--99986275856-6648889 No 34 >PRK10418 nikD nickel transporter ATP-binding protein; Provisional Probab=33.76 E-value=20 Score=16.86 Aligned_cols=65 Identities=18% Similarity=0.244 Sum_probs=40.5 Q ss_pred CCCHHHCCCCEEEEEEECCCCEEEEEEECCCCCEEECCC---CCCCEEEEEEECCCCEEEEECCCCEEEEEEEE Q ss_conf 586022587614678752663133222023550366568---78876999998079449995595035899975 Q gi|254781025|r 54 GGYIIAIPPGYEGQVRSRSGLALNYGVACLNSPGTIDSD---YRGEIKILLINLGQENFLIMRGMRIAQLIIAN 124 (154) Q Consensus 54 Tgi~i~ip~g~~~~i~~RSsl~~K~gl~~~n~~GvID~~---YrGei~v~l~N~~~~~~~i~~G~RIaQlv~~~ 124 (154) -++.+++.+|-..-|.+.||-+ |+ ..+....|++.++ +.|++.+.-.+...... ++ |-.++||+. T Consensus 20 ~~Isl~v~~Ge~~aiiG~SGsG-KS-tl~k~llgll~~~~~~~~G~i~~dg~~~~~~~~---r~-r~i~~v~Q~ 87 (254) T PRK10418 20 HGVSLTLQRGRVLALVGGSGSG-KS-LTCAAALGILPAGVRQTAGRVLADGKPVAPCAL---RG-RKIATIMQN 87 (254) T ss_pred ECEEEEECCCCEEEEECCCCCH-HH-HHHHHHHCCCCCCCCEECCEEEECCEECCHHHH---CC-CCEEEEEEC T ss_conf 6607289899999999999878-99-999999579988984157899999999960343---05-508999837 No 35 >PRK13637 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=33.71 E-value=12 Score=18.13 Aligned_cols=67 Identities=13% Similarity=0.106 Sum_probs=41.4 Q ss_pred CCCHHHCCCCEEEEEEECCCCEEEEEEECCCCCEEECCCCCCCEEEEEEECCCCEEEEECCCCEEEEEEE Q ss_conf 5860225876146787526631332220235503665687887699999807944999559503589997 Q gi|254781025|r 54 GGYIIAIPPGYEGQVRSRSGLALNYGVACLNSPGTIDSDYRGEIKILLINLGQENFLIMRGMRIAQLIIA 123 (154) Q Consensus 54 Tgi~i~ip~g~~~~i~~RSsl~~K~gl~~~n~~GvID~~YrGei~v~l~N~~~~~~~i~~G~RIaQlv~~ 123 (154) -++.+++.+|-+.-|.++||-+ |+=|.-. -.|++.+ ..|++.+.=.+..+....+..=.|-..+||+ T Consensus 24 ~~isl~i~~GE~v~iiG~nGsG-KSTLl~~-l~GLl~p-~~G~V~i~G~~i~~~~~~~~~~r~~iG~VfQ 90 (287) T PRK13637 24 DNVNIEIEDGEFVALIGHTGSG-KSTLIQH-LNGLLKP-TSGKIIIDGVDITDKKVKLSDIRKKVGLVFQ 90 (287) T ss_pred ECEEEEECCCCEEEEECCCCCH-HHHHHHH-HHCCCCC-CCCEEEECCEECCCCCCCHHHHHHCEEEEEE T ss_conf 3207698799899999999939-9999999-9739988-8726999999987888677888741789961 No 36 >KOG0125 consensus Probab=33.13 E-value=15 Score=17.50 Aligned_cols=37 Identities=22% Similarity=0.261 Sum_probs=17.7 Q ss_pred CCEEEEECCCCEEEEEEEECCCCEEEEEECCCCCCCCCCCCCCC Q ss_conf 94499955950358999750434069987135775426888873 Q gi|254781025|r 106 QENFLIMRGMRIAQLIIANSVRAHPSLISAMPMGKNERNEKGFG 149 (154) Q Consensus 106 ~~~~~i~~G~RIaQlv~~~~~~~~~~~v~~~~l~~t~RG~~GFG 149 (154) +-||.+..-| +.| +|.++=++ .+.|+--.+||++||| T Consensus 103 NIPFrFRdpD-L~a-MF~kfG~V-----ldVEIIfNERGSKGFG 139 (376) T KOG0125 103 NIPFRFRDPD-LRA-MFEKFGKV-----LDVEIIFNERGSKGFG 139 (376) T ss_pred CCCCCCCCCC-HHH-HHHHHCCE-----EEEEEEECCCCCCCCC T ss_conf 5875246800-999-99851750-----4689986367777523 No 37 >TIGR02857 CydD ABC transporter, CydDC cysteine exporter (CydDC-E) family, permease/ATP-binding protein CydD; InterPro: IPR014216 The gene pair cydCD encodes an ABC-family transporter in which each gene contains an N-terminal membrane-spanning domain and a C-terminal ATP-binding domain. In Escherichia coli these genes were discovered as mutants which caused the terminal heme-copper oxidase complex cytochrome bd to fail to assemble. Recent work has shown that the transporter is involved in the export of redox-active thiol compounds such as cysteine and glutathione , . The linkage to assembly of the cytochrome bd complex is further supported by the conserved operon structure found outside the gammaproteobacteria (cydABCD) containing both the transporter and oxidase genes components. The genes used as the seed members for this entry are all either found in the gammaproteobacterial context, or the CydABCD context. All members of this entry scoring above the trusted cut off at the time of its creation were from genomes which encode a cytochrome bd complex. Unfortunately, the gene symbol nomenclature, based on this operon in Bacillus subtilis, assigns cydC to the third gene in the operon, where this gene is actually homologous to the E. coli cydD gene. We have chosen to name all homologues in this family in accordance with the precedence of publication of the E. coli name, CydD.. Probab=32.56 E-value=20 Score=16.85 Aligned_cols=90 Identities=17% Similarity=0.076 Sum_probs=56.5 Q ss_pred EEEEEEECCCCCCEEEC-CCCEEEEECCCHHHCCCCEEEEEEECCCCEEEEEEECCCC-CEEECCCCCCCEEEEEE--EC Q ss_conf 08789103778738937-9973999658602258761467875266313322202355-03665687887699999--80 Q gi|254781025|r 29 GMDLFAALPEDEPVELL-PGMRSLIPGGYIIAIPPGYEGQVRSRSGLALNYGVACLNS-PGTIDSDYRGEIKILLI--NL 104 (154) Q Consensus 29 G~Dl~a~~~~~e~~~i~-pg~~~li~Tgi~i~ip~g~~~~i~~RSsl~~K~gl~~~n~-~GvID~~YrGei~v~l~--N~ 104 (154) |-| -.. ..++--+.. ||...=+..++.+.+++|-..=|.+-||=+ |+=| .|. .|.+|++ +|+|.|.-. .. T Consensus 345 G~~-P~l-~~~~v~~~y~~G~~~pa~~~~sf~~~pG~~vAl~G~SGaG-KSTL--L~lLLGf~~P~-~G~i~v~~~Gg~~ 418 (570) T TIGR02857 345 GAD-PSL-EFEGVSVAYAPGRDEPALRPVSFTVEPGERVALVGPSGAG-KSTL--LNLLLGFVEPT-EGAIVVERDGGIN 418 (570) T ss_pred CCC-CEE-EEEEEEEECCCCCHHHCCCCCCEEECCCCEEEEEECCCCC-HHHH--HHHHHCCCCCC-CCEEEEECCCCCC T ss_conf 998-407-8875168627888010378854166387048886279997-8899--99997157644-6468874488853 Q ss_pred CCCEEEEECC-CCEEEEEEEE Q ss_conf 7944999559-5035899975 Q gi|254781025|r 105 GQENFLIMRG-MRIAQLIIAN 124 (154) Q Consensus 105 ~~~~~~i~~G-~RIaQlv~~~ 124 (154) +.+=-.+..- +.--|+-.++ T Consensus 419 G~~L~~~~~~W~W~~~iaWv~ 439 (570) T TIGR02857 419 GVPLAEADADWSWRDQIAWVP 439 (570) T ss_pred CCCCCCCCHHHHHHHCEEEEC T ss_conf 976421131022554134305 No 38 >pfam07070 Spo0M SpoOM protein. This family consists of several bacterial SpoOM proteins which are thought to control sporulation in Bacillus subtilis.Spo0M exerts certain negative effects on sporulation and its gene expression is controlled by sigmaH. Probab=31.45 E-value=42 Score=15.05 Aligned_cols=57 Identities=12% Similarity=0.183 Sum_probs=38.5 Q ss_pred CCCCEEECCCCEEEEECCCHHHCCCCEEEEEEECCCCEEEEEEECCCCCEEECCCCCCCEEEE Q ss_conf 787389379973999658602258761467875266313322202355036656878876999 Q gi|254781025|r 38 EDEPVELLPGMRSLIPGGYIIAIPPGYEGQVRSRSGLALNYGVACLNSPGTIDSDYRGEIKIL 100 (154) Q Consensus 38 ~~e~~~i~pg~~~li~Tgi~i~ip~g~~~~i~~RSsl~~K~gl~~~n~~GvID~~YrGei~v~ 100 (154) ..+..+|+||+..-||..+.+ |...-.-+ .+..+.++++|.+.+ =||++=+..|.|. T Consensus 76 v~~~F~i~~ge~~~ipF~~~l--P~eTPiT~-~~~~v~v~T~LdI~~---avDp~D~D~l~V~ 132 (218) T pfam07070 76 LGGAFTIQPGEERVFPFTLTL--PWETPITI-GDAKVWVETGLDIAG---AIDPGDKDILTVR 132 (218) T ss_pred CCCCEEECCCCEEEEEEEEEC--CCCCCCCC-CCCEEEEEEEEEECC---CCCCCCCCCEEEE T ss_conf 188789899996997688338--98889515-898799998888078---8899886647982 No 39 >pfam01238 PMI_typeI Phosphomannose isomerase type I. This is a family of Phosphomannose isomerase type I enzymes (EC 5.3.1.8). Probab=30.49 E-value=43 Score=14.95 Aligned_cols=57 Identities=19% Similarity=0.168 Sum_probs=42.5 Q ss_pred CCCCCCEEEEEEECCCCCCEEECCCCEEEEECCCHHHCCCCEEEEEEECCCCEEEEEEECC Q ss_conf 5788640878910377873893799739996586022587614678752663133222023 Q gi|254781025|r 23 KTSGSSGMDLFAALPEDEPVELLPGMRSLIPGGYIIAIPPGYEGQVRSRSGLALNYGVACL 83 (154) Q Consensus 23 ~t~~~AG~Dl~a~~~~~e~~~i~pg~~~li~Tgi~i~ip~g~~~~i~~RSsl~~K~gl~~~ 83 (154) ..++|.| +.+.+ . .-+++.|||...++-|.-=.--.|....++..|-.=+|-||+-. T Consensus 237 ~yP~D~G--l~all-L-N~v~L~PGeAlfl~Ag~~HAYL~G~~vE~MA~SDNVlRaGLTpK 293 (373) T pfam01238 237 QYPDDIG--LFRLF-L-NHVKLNPGEAMFLKANDPHAYLSGDIVECMADSDNVVRAGLTPK 293 (373) T ss_pred HCCCCCE--EEEEE-C-CEEEECCCCEEEECCCCCEEEECCCCEEEECCCCCEEECCCCCC T ss_conf 6899864--41543-0-13760698778736999628754756652035654121257854 No 40 >COG3638 ABC-type phosphate/phosphonate transport system, ATPase component [Inorganic ion transport and metabolism] Probab=28.38 E-value=17 Score=17.27 Aligned_cols=59 Identities=22% Similarity=0.228 Sum_probs=36.5 Q ss_pred ECCCCEEEEECCCHHHCCCCEEEEEEECCCCEEEEEEECCCCCEEECCCCCCCEEEEEEECCC Q ss_conf 379973999658602258761467875266313322202355036656878876999998079 Q gi|254781025|r 44 LLPGMRSLIPGGYIIAIPPGYEGQVRSRSGLALNYGVACLNSPGTIDSDYRGEIKILLINLGQ 106 (154) Q Consensus 44 i~pg~~~li~Tgi~i~ip~g~~~~i~~RSsl~~K~gl~~~n~~GvID~~YrGei~v~l~N~~~ 106 (154) ..|+...- =.++.+++++|-+.-|.++||-+.-+-|.+.| |.+|+.+ |++.+.-.+... T Consensus 12 ~yp~~~~a-L~~Vnl~I~~GE~VaiIG~SGaGKSTLLR~ln--gl~d~t~-G~i~~~g~~i~~ 70 (258) T COG3638 12 TYPGGHQA-LKDVNLEINQGEMVAIIGPSGAGKSTLLRSLN--GLVDPTS-GEILFNGVQITK 70 (258) T ss_pred ECCCCCEE-EEEEEEEECCCCEEEEECCCCCCHHHHHHHHH--CCCCCCC-CEEEECCCCHHC T ss_conf 54799566-63576775798689998788886899999986--6658886-159965622102 No 41 >PHA02118 hypothetical protein Probab=25.06 E-value=31 Score=15.81 Aligned_cols=11 Identities=36% Similarity=0.981 Sum_probs=7.9 Q ss_pred CCCCCCCCCCC Q ss_conf 88887354679 Q gi|254781025|r 144 NEKGFGSTGLY 154 (154) Q Consensus 144 G~~GFGSTG~~ 154 (154) ..+||||||-| T Consensus 187 SrgGFGssg~f 197 (202) T PHA02118 187 SKGGFGKSGSF 197 (202) T ss_pred CCCCCCCCCCC T ss_conf 35666665545 No 42 >KOG2230 consensus Probab=24.38 E-value=56 Score=14.31 Aligned_cols=88 Identities=18% Similarity=0.193 Sum_probs=43.1 Q ss_pred EEEEEEC---CCCCCCCCCCCCCCCCCEEEEEEECCCCCCE-EECCCCEEEEECCCHHHCCCCEEEEEEECCCCEEEEEE Q ss_conf 7899976---7547970374357886408789103778738-93799739996586022587614678752663133222 Q gi|254781025|r 5 DIPFLQL---PHAHGIPLPEYKTSGSSGMDLFAALPEDEPV-ELLPGMRSLIPGGYIIAIPPGYEGQVRSRSGLALNYGV 80 (154) Q Consensus 5 ~iki~~~---~~~~~~~lP~~~t~~~AG~Dl~a~~~~~e~~-~i~pg~~~li~Tgi~i~ip~g~~~~i~~RSsl~~K~gl 80 (154) +.|+..+ .+..+..||.+.+.+|.++|=-.. .+..+ ....|+...-.|+.-++.----..-+..|+-. .-.+| T Consensus 688 ~f~v~~~~d~~~l~s~tl~vq~~sW~~~L~P~~~--~~~~i~~v~~g~~ev~~t~~~~s~~lrr~~~~~~~t~~-~ss~i 764 (867) T KOG2230 688 DFKVFLLNDPVHLWSQTLPVQSTSWDVTLRPNGV--QDKVIAIVDLGQGEVTRTNAFLSLQLRRGQDVISRTHF-FSSSI 764 (867) T ss_pred CEEEEECCCHHHHHHEEEEEEEEECCCCCCCCCC--CCEEEEEECCCCCEEEECCCHHHHHHHHCCCEEEEEEE-CCCCC T ss_conf 1478832763432331421377513454366654--30356640057751576055299998645562455775-02212 Q ss_pred ECCCCCEEECCCCCCCE Q ss_conf 02355036656878876 Q gi|254781025|r 81 ACLNSPGTIDSDYRGEI 97 (154) Q Consensus 81 ~~~n~~GvID~~YrGei 97 (154) +...|+.|+||.=|+ T Consensus 765 --~~~~~il~P~~~Fei 779 (867) T KOG2230 765 --AGAVGILDPGLEFEI 779 (867) T ss_pred --CCCCCCCCCCCEEEE T ss_conf --110232479837884 No 43 >PRK00513 minC septum formation inhibitor; Reviewed Probab=23.43 E-value=59 Score=14.20 Aligned_cols=40 Identities=15% Similarity=0.151 Sum_probs=29.9 Q ss_pred EECC-CCCEEECCCCCCCEEEEEEECCCCEEEEECCCCEEE Q ss_conf 2023-550366568788769999980794499955950358 Q gi|254781025|r 80 VACL-NSPGTIDSDYRGEIKILLINLGQENFLIMRGMRIAQ 119 (154) Q Consensus 80 l~~~-n~~GvID~~YrGei~v~l~N~~~~~~~i~~G~RIaQ 119 (154) +.+. ..-|++.+||.|.-...+....-+|..+.-|+.|+. T Consensus 140 I~V~G~LrG~vhAG~~Gn~~A~I~A~~l~P~qlrIad~iar 180 (214) T PRK00513 140 IFVLGSLRGIAHAGATGNKEAVIAALQLEPTQLRIADVVAR 180 (214) T ss_pred EEEEEECCCEEEECCCCCCCCEEEECCCCCCEEEECCEEEE T ss_conf 99997626769856899987399985478748899636504 No 44 >pfam01393 Chromo_shadow Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain. Probab=22.75 E-value=28 Score=16.00 Aligned_cols=43 Identities=28% Similarity=0.324 Sum_probs=25.6 Q ss_pred EEECCCCCEEECCCCCCCEEEEEEECCCCEEEEEC---CCCEEEEEEE Q ss_conf 22023550366568788769999980794499955---9503589997 Q gi|254781025|r 79 GVACLNSPGTIDSDYRGEIKILLINLGQENFLIMR---GMRIAQLIIA 123 (154) Q Consensus 79 gl~~~n~~GvID~~YrGei~v~l~N~~~~~~~i~~---G~RIaQlv~~ 123 (154) |+....-.|..|+ .||+++.+..-+++.-.+.. ..|+-|+|+. T Consensus 2 Gl~~e~IiGaTd~--~Gel~FL~kwk~de~dlvpa~~an~~~Pq~VI~ 47 (56) T pfam01393 2 GLELEKIVGATDS--PGELHFLCKFKDDEADLVPAKEANVKCPQKVIA 47 (56) T ss_pred CCCHHHEEEEECC--CCEEEEEEEECCCCCCEEEHHHHHHHCCHHHHH T ss_conf 9882554776777--996999999749853167779976359599999 No 45 >TIGR03258 PhnT 2-aminoethylphosphonate ABC transport system, ATP-binding component PhnT. This ATP-binding component of an ABC transport system is found in Salmonella and Burkholderia lineages in the vicinity of enzymes for the breakdown of 2-aminoethylphosphonate. Probab=22.19 E-value=25 Score=16.30 Aligned_cols=15 Identities=27% Similarity=0.226 Sum_probs=4.8 Q ss_pred CCCCEEEEEEECCCC Q ss_conf 587614678752663 Q gi|254781025|r 60 IPPGYEGQVRSRSGL 74 (154) Q Consensus 60 ip~g~~~~i~~RSsl 74 (154) +++|-+.-|.+-||. T Consensus 28 i~~GE~~~llGpSG~ 42 (362) T TIGR03258 28 IEAGELLALIGKSGC 42 (362) T ss_pred ECCCCEEEEECCCCC T ss_conf 999989999999974 No 46 >cd03290 ABCC_SUR1_N The SUR domain 1. The sulfonylurea receptor SUR is an ATP transporter of the ABCC/MRP family with tandem ATPase binding domains. Unlike other ABC proteins, it has no intrinsic transport function, neither active nor passive, but associates with the potassium channel proteins Kir6.1 or Kir6.2 to form the ATP-sensitive potassium (K(ATP)) channel. Within the channel complex, SUR serves as a regulatory subunit that fine-tunes the gating of Kir6.x in response to alterations in cellular metabolism. It constitutes a major pharmaceutical target as it binds numerous drugs, K(ATP) channel openers and blockers, capable of up- or down-regulating channel activity. Probab=20.93 E-value=40 Score=15.15 Aligned_cols=52 Identities=15% Similarity=0.200 Sum_probs=32.5 Q ss_pred CCCHHHCCCCEEEEEEECCCCEEEEEEECCCCCEEECCCCCCCEEEEEEECCCCE Q ss_conf 5860225876146787526631332220235503665687887699999807944 Q gi|254781025|r 54 GGYIIAIPPGYEGQVRSRSGLALNYGVACLNSPGTIDSDYRGEIKILLINLGQEN 108 (154) Q Consensus 54 Tgi~i~ip~g~~~~i~~RSsl~~K~gl~~~n~~GvID~~YrGei~v~l~N~~~~~ 108 (154) .++.+++|+|-..-|.++||-+ |.=|.-+ -.|...+ +.|++.+.-.+.+... T Consensus 18 ~~inl~i~~Ge~~~IvG~sGsG-KSTLl~~-l~g~~~~-~~G~I~~~~~~~~~~~ 69 (218) T cd03290 18 SNINIRIPTGQLTMIVGQVGCG-KSSLLLA-ILGEMQT-LEGKVHWSNKNESEPS 69 (218) T ss_pred ECEEEEECCCCEEEEECCCCCC-HHHHHHH-HHCCCCC-CCCEEEECCEECCCCC T ss_conf 4769998699999999999980-9999999-8555656-7764998986668646 No 47 >PRK13642 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=20.45 E-value=25 Score=16.28 Aligned_cols=65 Identities=23% Similarity=0.242 Sum_probs=37.9 Q ss_pred CCHHHCCCCEEEEEEECCCCEEEEEEECCCCCEEECCCCCCCEEEEEEECCCCEEEEECCCCEEEEEEEE Q ss_conf 8602258761467875266313322202355036656878876999998079449995595035899975 Q gi|254781025|r 55 GYIIAIPPGYEGQVRSRSGLALNYGVACLNSPGTIDSDYRGEIKILLINLGQENFLIMRGMRIAQLIIAN 124 (154) Q Consensus 55 gi~i~ip~g~~~~i~~RSsl~~K~gl~~~n~~GvID~~YrGei~v~l~N~~~~~~~i~~G~RIaQlv~~~ 124 (154) ++.+++++|-+.-|.+.||-+ |+=+.-. -.|+..+ +.|++.+.=.+.+..+.. .=.|-.++||+. T Consensus 25 ~is~~i~~Ge~~aiiG~sGsG-KSTL~~~-l~Gl~~~-~~G~I~~~G~~i~~~~~~--~~r~~ig~VfQ~ 89 (277) T PRK13642 25 GVSFSITKGEWVSIIGQNGSG-KSTTARL-IDGLFEE-FEGIVKIDGERLTAENVW--NLRRKIGMVFQN 89 (277) T ss_pred CEEEEECCCCEEEEECCCCCH-HHHHHHH-HHCCCCC-CCCEEEECCEECCCCCHH--HHHCCCEEEEEC T ss_conf 307998899899999999968-9999999-9638998-884899999999857888--885176899989 No 48 >TIGR02769 nickel_nikE nickel import ATP-binding protein NikE; InterPro: IPR014137 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This family represents the NikE subunit of a multisubunit nickel import ABC transporter complex. Nickel, once imported, may be used in urease and in certain classes of hydrogenase and superoxide dismutase.. Probab=20.25 E-value=52 Score=14.49 Aligned_cols=78 Identities=19% Similarity=0.141 Sum_probs=43.4 Q ss_pred CCCEEEEECCCHHHCCCCEEEEEEECCCCEEEEEEECCCCCEEECCCCCCCEEEEEEECCC-CEEEEECCCCEEEEEEEE Q ss_conf 9973999658602258761467875266313322202355036656878876999998079-449995595035899975 Q gi|254781025|r 46 PGMRSLIPGGYIIAIPPGYEGQVRSRSGLALNYGVACLNSPGTIDSDYRGEIKILLINLGQ-ENFLIMRGMRIAQLIIAN 124 (154) Q Consensus 46 pg~~~li~Tgi~i~ip~g~~~~i~~RSsl~~K~gl~~~n~~GvID~~YrGei~v~l~N~~~-~~~~i~~G~RIaQlv~~~ 124 (154) ..++.-|=+++.+.+-.|...=+.+|||-+ |.=|.=. -.|.= .=-.|++.+.=.-.++ +.-.-++=.|=-|+||+- T Consensus 21 ~~~~~~VL~~vsL~l~~Ge~~gLLG~SG~G-KSTLArl-LlGLe-~P~~G~v~F~G~dl~~L~~~~~rafrR~vQlvFQD 97 (267) T TIGR02769 21 KKQRKAVLDNVSLSLEEGETVGLLGRSGCG-KSTLARL-LLGLE-KPAQGEVSFRGQDLSKLDRKQRRAFRRDVQLVFQD 97 (267) T ss_pred HHHHHHHCCCCCEEECCCCEEEECCCCCCC-HHHHHHH-HHHCC-CCCCCEEEECCCCHHHCCHHHCCEEEEEEEEEEEC T ss_conf 454431036743231377505523678873-7789999-87507-88874047625645550811021112136789874 Q ss_pred CC Q ss_conf 04 Q gi|254781025|r 125 SV 126 (154) Q Consensus 125 ~~ 126 (154) .+ T Consensus 98 ~~ 99 (267) T TIGR02769 98 SP 99 (267) T ss_pred CC T ss_conf 74 No 49 >TIGR00074 hypC_hupF hydrogenase assembly chaperone HypC/HupF; InterPro: IPR001109 The large subunit of [NiFe]-hydrogenase, as well as other nickel metalloenzymes, is synthesised as a precursor devoid of the metalloenzyme active site. This precursor then undergoes a complex post-translational maturation process that requires a number of accessory proteins. The hydrogenase expression/formation proteins (HUPF/HYPC) form a family of small proteins that are hydrogenase precursor-specific chaperones required for this maturation process . They are believed to keep the hydrogenase precursor in a conformation accessible for metal incorporation , . . Probab=20.07 E-value=50 Score=14.59 Aligned_cols=53 Identities=17% Similarity=0.164 Sum_probs=32.9 Q ss_pred EEEEEECCCCC-CCCCCCCCCCCCCEEEEEE--ECCCCCCEEECCCCEEEEECCCHHHC Q ss_conf 78999767547-9703743578864087891--03778738937997399965860225 Q gi|254781025|r 5 DIPFLQLPHAH-GIPLPEYKTSGSSGMDLFA--ALPEDEPVELLPGMRSLIPGGYIIAI 60 (154) Q Consensus 5 ~iki~~~~~~~-~~~lP~~~t~~~AG~Dl~a--~~~~~e~~~i~pg~~~li~Tgi~i~i 60 (154) .-||..+..+. .|..-..+-..++-+||-- | ++.-...+|+..||.+|.++++ T Consensus 6 P~qV~~i~~~~~~A~v~~~G~~~~v~l~Lv~ksC---~~N~~~~~GdyvLvH~G~A~~~ 61 (88) T TIGR00074 6 PGQVVEIDENIDLALVEFKGVKREVSLDLVGKSC---DENEEVKVGDYVLVHVGFAISV 61 (88) T ss_pred CCEEEEECCCCCEEEECCCCEEEEEEEEECCCCC---CCCCCCCCCCEEEEEEEHHHHH T ss_conf 7168885499987886015224667641235544---6785999987763200046632 Done!