Query gi|254780480|ref|YP_003064893.1| 2'-deoxycytidine 5'-triphosphate deaminase [Candidatus Liberibacter asiaticus str. psy62] Match_columns 367 No_of_seqs 242 out of 475 Neff 5.3 Searched_HMMs 39220 Date Sun May 29 17:56:42 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780480.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 pfam06559 DCD 2'-deoxycytidine 100.0 0 0 1019.3 32.3 358 2-365 8-366 (366) 2 PRK07559 2'-deoxycytidine 5'-t 100.0 0 0 1016.5 31.6 359 2-365 7-366 (366) 3 TIGR02274 dCTP_deam deoxycytid 100.0 0 0 336.9 15.9 154 5-165 1-171 (198) 4 PHA01707 dut 2'-deoxyuridine 5 100.0 2.2E-40 5.6E-45 283.4 15.7 135 4-165 1-135 (158) 5 PRK00416 dcd deoxycytidine tri 100.0 6.7E-39 1.7E-43 273.8 17.2 148 5-165 1-149 (174) 6 TIGR02274 dCTP_deam deoxycytid 100.0 1.2E-39 3.1E-44 278.7 13.1 180 167-361 1-197 (198) 7 COG0717 Dcd Deoxycytidine deam 100.0 7.5E-35 1.9E-39 247.3 16.0 152 4-165 1-156 (183) 8 COG0717 Dcd Deoxycytidine deam 100.0 1.2E-34 2.9E-39 246.1 15.0 180 166-363 1-182 (183) 9 PRK02253 deoxyuridine 5'-triph 100.0 1.1E-31 2.8E-36 226.6 13.7 139 16-165 3-142 (159) 10 PHA01707 dut 2'-deoxyuridine 5 100.0 1.4E-28 3.6E-33 206.4 14.1 153 166-362 1-157 (158) 11 PRK00416 dcd deoxycytidine tri 100.0 5.6E-27 1.4E-31 195.9 15.4 168 167-361 1-173 (174) 12 PRK02253 deoxyuridine 5'-triph 99.9 1.5E-25 3.7E-30 186.7 11.0 154 179-358 4-159 (159) 13 cd07557 trimeric_dUTPase Trime 99.8 1.8E-18 4.6E-23 140.4 10.4 91 34-161 1-92 (92) 14 pfam00692 dUTPase dUTPase. dUT 99.4 2.6E-12 6.7E-17 100.2 10.1 85 73-165 20-105 (129) 15 cd07557 trimeric_dUTPase Trime 99.3 5.4E-12 1.4E-16 98.1 8.8 79 248-333 12-92 (92) 16 PRK00601 dut deoxyuridine 5'-t 99.2 9.9E-11 2.5E-15 89.9 9.5 85 74-164 36-122 (149) 17 PRK13956 dut deoxyuridine 5'-t 98.9 1.5E-08 3.9E-13 75.6 9.3 84 74-163 34-125 (147) 18 TIGR00576 dut dUTP diphosphata 98.9 1.4E-08 3.6E-13 75.8 8.9 83 75-163 34-119 (151) 19 pfam00692 dUTPase dUTPase. dUT 98.9 4.1E-08 1E-12 72.8 10.8 84 249-338 23-106 (129) 20 PTZ00143 deoxyuridine 5'-triph 98.8 6.6E-08 1.7E-12 71.4 9.4 85 75-165 35-129 (155) 21 COG0756 Dut dUTPase [Nucleotid 98.8 6.5E-08 1.6E-12 71.5 9.3 83 77-164 38-121 (148) 22 PRK00601 dut deoxyuridine 5'-t 98.7 3.2E-07 8.2E-12 66.9 9.8 84 248-338 37-124 (149) 23 TIGR00576 dut dUTP diphosphata 98.6 5.7E-07 1.5E-11 65.3 9.8 85 248-338 34-122 (151) 24 pfam06559 DCD 2'-deoxycytidine 98.4 7.9E-06 2E-10 57.9 12.4 152 3-164 171-337 (366) 25 COG0756 Dut dUTPase [Nucleotid 98.4 4.1E-06 1.1E-10 59.7 9.9 85 250-340 38-125 (148) 26 PRK13956 dut deoxyuridine 5'-t 98.4 4.2E-06 1.1E-10 59.7 9.4 88 249-337 36-127 (147) 27 PRK07559 2'-deoxycytidine 5'-t 98.3 1.9E-05 4.9E-10 55.3 12.1 150 4-164 171-337 (366) 28 PTZ00143 deoxyuridine 5'-triph 97.5 0.0018 4.6E-08 42.5 9.8 54 277-337 75-129 (155) 29 KOG3370 consensus 94.4 0.15 3.8E-06 29.9 6.2 79 78-163 34-113 (140) 30 KOG3370 consensus 93.6 0.16 4.1E-06 29.7 5.1 78 258-342 25-119 (140) 31 TIGR02772 Ku_bact Ku protein; 65.6 3.9 9.9E-05 20.7 1.8 32 130-163 2-34 (271) 32 pfam08758 Cadherin_pro Cadheri 58.8 7.8 0.0002 18.7 2.4 28 317-344 14-41 (90) 33 cd00789 KU_like Ku-core domain 52.7 12 0.0003 17.5 2.5 32 131-164 2-33 (256) 34 TIGR02883 spore_cwlD N-acetylm 45.9 9.6 0.00024 18.2 1.1 67 129-202 12-93 (192) 35 PRK11637 hypothetical protein; 44.9 22 0.00057 15.8 3.2 57 283-340 321-383 (404) 36 pfam05784 Herpes_UL82_83 Betah 44.0 23 0.00059 15.7 10.4 90 232-337 246-342 (346) 37 pfam01834 XRCC1_N XRCC1 N term 41.1 12 0.00031 17.5 1.0 86 90-196 54-146 (152) 38 KOG2936 consensus 39.1 11 0.00029 17.7 0.6 78 249-332 142-228 (301) 39 pfam04797 Herpes_ORF11 Herpesv 37.8 29 0.00073 15.1 7.0 251 74-356 98-365 (384) 40 KOG3293 consensus 37.7 29 0.00073 15.0 3.6 23 249-271 52-74 (134) 41 TIGR02061 aprA adenylylsulfate 35.9 26 0.00066 15.3 2.1 97 249-349 192-291 (651) 42 TIGR01163 rpe ribulose-phospha 35.8 14 0.00036 17.0 0.7 89 193-302 57-145 (216) 43 pfam03843 Slp Outer membrane l 34.4 30 0.00076 15.0 2.2 39 260-298 52-90 (160) 44 COG1273 Ku-homolog [Replicatio 32.9 30 0.00078 14.9 2.0 34 129-164 3-36 (278) 45 KOG4048 consensus 30.6 37 0.00095 14.3 5.1 86 208-302 54-165 (201) 46 pfam11850 DUF3370 Protein of u 28.9 40 0.001 14.1 4.6 31 17-47 53-83 (439) 47 PRK09455 rseB periplasmic nega 28.9 40 0.001 14.1 3.4 10 153-162 160-169 (302) 48 pfam00583 Acetyltransf_1 Acety 28.5 38 0.00097 14.2 1.9 35 323-357 3-39 (80) 49 COG5266 CbiK ABC-type Co2+ tra 28.0 41 0.0011 14.0 2.2 32 136-167 154-185 (264) 50 PRK05799 coproporphyrinogen II 27.0 22 0.00057 15.7 0.5 66 135-202 86-157 (374) 51 PRK10319 N-acetylmuramoyl-l-al 26.1 30 0.00075 15.0 1.0 49 119-172 60-116 (289) 52 TIGR02472 sucr_P_syn_N sucrose 25.6 34 0.00087 14.5 1.2 63 105-182 41-114 (445) 53 KOG1408 consensus 24.4 48 0.0012 13.6 6.0 65 98-165 572-637 (1080) 54 COG0860 AmiC N-acetylmuramoyl- 24.0 40 0.001 14.1 1.4 15 119-135 46-60 (231) 55 PRK08221 anaerobic sulfite red 23.2 37 0.00094 14.3 1.0 46 70-120 24-69 (263) 56 TIGR01967 DEAH_box_HrpA ATP-de 22.7 40 0.001 14.1 1.1 20 132-152 237-256 (1320) 57 COG2147 RPL19A Ribosomal prote 21.7 54 0.0014 13.2 1.7 21 4-24 34-54 (150) 58 KOG3021 consensus 21.6 38 0.00096 14.3 0.8 26 234-259 215-240 (313) 59 COG1124 DppF ABC-type dipeptid 21.5 54 0.0014 13.2 2.1 85 76-184 27-111 (252) 60 cd04331 RNAP_E_N RNAP_E_N: Rpo 21.5 55 0.0014 13.2 3.3 69 257-333 1-71 (80) 61 TIGR02992 ectoine_eutC ectoine 21.2 55 0.0014 13.2 1.9 55 250-304 212-277 (326) 62 TIGR01081 mpl UDP-N-acetylmura 20.9 47 0.0012 13.7 1.2 34 13-46 39-73 (459) 63 COG3919 Predicted ATP-grasp en 20.7 57 0.0014 13.1 2.0 38 142-179 144-184 (415) 64 PRK10809 ribosomal-protein-S5- 20.5 57 0.0015 13.1 2.4 32 324-357 85-120 (194) 65 TIGR02466 TIGR02466 conserved 20.3 39 0.00099 14.2 0.7 99 167-289 51-149 (210) 66 PRK05904 coproporphyrinogen II 20.1 27 0.00068 15.2 -0.2 138 129-298 86-242 (353) No 1 >pfam06559 DCD 2'-deoxycytidine 5'-triphosphate deaminase (DCD). This family consists of several bacterial 2'-deoxycytidine 5'-triphosphate deaminase proteins (EC:3.5.4.13). Probab=100.00 E-value=0 Score=1019.34 Aligned_cols=358 Identities=57% Similarity=0.973 Sum_probs=347.9 Q ss_pred CCCCCCHHHHHHHHHCCCEEECCCCCHHHCCCCEEEEEECCEEEEEECCCCCCCCCCHHHHCCCCEEEEEECCCCEEECC Q ss_conf 87615868999999879849817898656571225887589899981231578886211000240478873688648668 Q gi|254780480|r 2 NKGVLPDKAIATLAERGDIVSEHPLEKNQIQPASIDLRLSSKAYRVRASFLPNIDGLVSDKIKHFKLREIDLSGGTVLDA 81 (367) Q Consensus 2 ~~GILsd~~I~~li~~g~I~i~~p~~~~QIQPaSiDLRLG~~~~rv~asflp~~~~~v~~~l~~~~~~eidl~~g~vLep 81 (367) .+||||+++|+.||++|.|.++.|++++||||||+|||||++|||+++||||+++.+|.++|+++.||+||+++|+|||+ T Consensus 8 ~~GiLp~q~I~~li~~g~I~~~~~~~~~QIQPASlDLrLg~~ayrvraSFLpg~~~~V~~~L~~l~~~~idL~~gavlEk 87 (366) T pfam06559 8 ANGILPDRAIAALIAAGAITADAPLDPDQIQPASLDLRLGTRAYRVRASFLPGPGRTVAERLDDFALHEVDLTDGAVLET 87 (366) T ss_pred CCCCCCHHHHHHHHHCCCEECCCCCCCCCCCCCCCCCCCCCEEEEEEEEECCCCCCCHHHHHHHHHEEEEECCCCCEEEC T ss_conf 67569489999999869753489886013586340645787179998001479959399999654118982799836636 Q ss_pred CCEEEEEEEEEEECCCCCEEEEECCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEEECCCCCCEEEECCCEEEEEEE Q ss_conf 73299998889984998279986013431125034232078775542366744389999516865279833980356898 Q gi|254780480|r 82 NCVYIVPLMESLNLKNGISAYANPKSSIGRIDVFARVIGDRSQEFDSISPNYSGPLYLEILPRTFPIVVRVGSRLSQIRF 161 (367) Q Consensus 82 g~~yLv~t~E~i~LP~di~a~~~pKSS~GRLgif~r~i~~~~~~f~~id~Gy~G~l~lEI~~~sfpI~v~~G~rl~Qi~f 161 (367) ||+|||+++|+++||++|+|.|||||||||||||||+|||++.+||.|++||+|+||+||+|+||||+|++|+||||||| T Consensus 88 gcvYiv~L~E~L~Lp~~i~a~aNPKSSTGRLDiFtR~ItD~~~efD~I~~GY~GpLy~EI~PrsFsI~vr~G~rLnQiRf 167 (366) T pfam06559 88 GCVYLVPLQEGLALPADISAAANPKSSTGRLDVFTRVITDGGREFDRIPAGYDGPLYAEISPRTFSVLVRPGSRLSQIRF 167 (366) T ss_pred CCEEEEEEHHHCCCCCCCEEEECCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCEEEEECCCEEEEEEECCCEEEEEEE T ss_conf 97899981531379978658865887664037899994258830102577886525899745523079807980443665 Q ss_pred EECCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCEEEEECCCCCEEECCCCCCCCHHHHHHHHC Q ss_conf 50887689999999875278603665544337316788814554566663045512677513015655558778664311 Q gi|254780480|r 162 VKERKFCSQEELSALHELSPLVQDGLRDFSGKGVALSVDLKGNEIGKKSIIGYRAKRHTAAIDVDSENKYDESDFWDPLF 241 (367) Q Consensus 162 ~~g~~~LsD~eL~~lh~~~~LV~~~~~~~~~~Gl~~sVdL~~~~~~~~~iiGyrAkkht~~IDl~k~~~y~~~dfwe~i~ 241 (367) |+|++.+||++|+++|++++||.++ ..+++|+++||||+++ ..+++|||||||||++||++|+++|+++||||||+ T Consensus 168 r~g~~~lsd~~L~~lh~~~~lv~~~--~~i~~Gl~~SVDL~~~--~~~~iiGYRAKkht~~IDldki~~y~~~dfWepi~ 243 (366) T pfam06559 168 RRGEATLSDAELAALHAREPLVDGA--PVIDGGLALSVDLAGE--GGDGLVGYRAKRHTGVIDLDRIGAYDPADFWEPLH 243 (366) T ss_pred CCCCCCCCHHHHHHHHHHCCCCCCC--CCCCCCEEEEEECCCC--CCCCEEEEEECCCCCEEEECCCCCCCHHHHHCEEE T ss_conf 1798767999999988548775787--2136881799985578--88864899825899748744746587556010465 Q ss_pred CCCCCCCEEEECCCCEEEEEEEEEEECCCCCEEEEEEECCCCCCEEEECCEEECCCCCCCCCCCCCCEEEEEECC-CCCE Q ss_conf 355777539988997899986334442834016445304333421342160456886654345667504788617-8888 Q gi|254780480|r 242 SWDAPEGQFILDPNEFYIFASREFIQIPPSLVAEMIPYDHSIGEFRVHYAGFFDPGFGCVLPQEVGAKAVLEVRS-SVPF 320 (367) Q Consensus 242 ~~~~~~~~~~l~p~~F~l~~s~E~v~iP~~~aae~~~~~~~~g~~r~h~AGf~DpGf~~~~~~~~~~~~vLevr~-~~p~ 320 (367) +. +++++||+|||||||+|+|+|+||+|+||||+|||+.+||||+|||||||||||+..+++.|+|||||||| |+|| T Consensus 244 ~~--~~~~liLdp~eFYIL~SkE~V~IPp~~AAEM~pyd~~vGeFRvHYAGFFDPGFG~~~~gg~GsraVLEVRs~evPF 321 (366) T pfam06559 244 AR--DGRELILDPGEFYILVSREAVHIPPDYAAEMAPFDPLVGEFRVHYAGFFDPGFGHAAAGGAGSRAVLEVRSHEVPF 321 (366) T ss_pred CC--CCCEEEECCCCEEEEEECCEEEECCCHHEECCCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCEEEEEECCCCCE T ss_conf 28--9985997788679997543367473310125243235664121342312588787666777740799983487876 Q ss_pred EEECCCEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHH Q ss_conf 994798889999976376642223754347232666773443430 Q gi|254780480|r 321 VLEHGQIIGRLKYESMDGEPERLYGEARGSHYQSQALKLSKHFHQ 365 (367) Q Consensus 321 ~l~~g~~i~~l~~~~~~~~p~~~Yg~~~~s~Yq~Qgl~lsk~f~~ 365 (367) +|+|||+||||+||+|++.|+++||++++||||+||||||||||. T Consensus 322 ~ledgQ~v~rL~yE~m~~~P~~lYG~~i~SNYq~QgLkLsKhFk~ 366 (366) T pfam06559 322 ILEHGQIVGRLVYERMAARPATLYGAGLGSNYQGQGLKLSKHFKA 366 (366) T ss_pred EEECCCEEEEEEEEECCCCCHHHCCCCCCCCCCCCCCCHHHHCCC T ss_conf 766797889999877004861210787665421237643542379 No 2 >PRK07559 2'-deoxycytidine 5'-triphosphate deaminase; Provisional Probab=100.00 E-value=0 Score=1016.49 Aligned_cols=359 Identities=59% Similarity=1.005 Sum_probs=352.2 Q ss_pred CCCCCCHHHHHHHHHCCCEEECCCCCHHHCCCCEEEEEECCEEEEEECCCCCCCCCCHHHHCCCCEEEEEECCCCEEECC Q ss_conf 87615868999999879849817898656571225887589899981231578886211000240478873688648668 Q gi|254780480|r 2 NKGVLPDKAIATLAERGDIVSEHPLEKNQIQPASIDLRLSSKAYRVRASFLPNIDGLVSDKIKHFKLREIDLSGGTVLDA 81 (367) Q Consensus 2 ~~GILsd~~I~~li~~g~I~i~~p~~~~QIQPaSiDLRLG~~~~rv~asflp~~~~~v~~~l~~~~~~eidl~~g~vLep 81 (367) ..||||+++|++||++|.|.++.|++++||||||+|||||+++||+++||||+++.+|.++|+++.||+||+++|++||+ T Consensus 7 ~~Gilp~q~i~~li~~g~I~~~~~~~~~QiQPaSlDLrLg~~ayrvraSFlPg~~~~V~~~l~~l~~~~idl~~gavlek 86 (366) T PRK07559 7 ATGILPDQAIAALIESGAIASERPLDADQIQPASLDLRLGAKAYRVRASFLPGPGRTVAERLDELKLHEIDLTDGAVLET 86 (366) T ss_pred CCCCCCHHHHHHHHHCCCEECCCCCCCCCCCCCCCCCCCCCEEEEEEEEECCCCCCCHHHHHHHHHEEEECCCCCEEEEC T ss_conf 77049489999999869764489885024486451645787079998001479969189998654048972789827867 Q ss_pred CCEEEEEEEEEEECCCCCEEEEECCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEEECCCCCCEEEECCCEEEEEEE Q ss_conf 73299998889984998279986013431125034232078775542366744389999516865279833980356898 Q gi|254780480|r 82 NCVYIVPLMESLNLKNGISAYANPKSSIGRIDVFARVIGDRSQEFDSISPNYSGPLYLEILPRTFPIVVRVGSRLSQIRF 161 (367) Q Consensus 82 g~~yLv~t~E~i~LP~di~a~~~pKSS~GRLgif~r~i~~~~~~f~~id~Gy~G~l~lEI~~~sfpI~v~~G~rl~Qi~f 161 (367) ||+||++++|+++||++|+|.|||||||||||||||+|||++.+||+|++||+|+||+||+|+||||+|++|+||||||| T Consensus 87 gcvYlv~L~E~L~LP~~isa~aNPKSSTGRLDvFtR~ItD~~~eFD~Ip~GY~GpLY~EI~PrTFsI~Vr~G~rLnQiRf 166 (366) T PRK07559 87 GCVYIVPLLESLALPADLSASANPKSSTGRLDVFTRVITDGAREFDKIPAGYHGPLYAEISPRTFPVLVRTGSRLSQIRF 166 (366) T ss_pred CCEEEEEEHHHCCCCCCCEEEECCCCCCEEEEEEEEEECCCCCHHCCCCCCCCCCEEEEECCCEEEEEEECCCCHHHEEE T ss_conf 97899983431269988769865777640578899995058830102688886415899757645089847982001442 Q ss_pred EECCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCEEEEECCCCCEEECCCCCCCCHHHHHHHHC Q ss_conf 50887689999999875278603665544337316788814554566663045512677513015655558778664311 Q gi|254780480|r 162 VKERKFCSQEELSALHELSPLVQDGLRDFSGKGVALSVDLKGNEIGKKSIIGYRAKRHTAAIDVDSENKYDESDFWDPLF 241 (367) Q Consensus 162 ~~g~~~LsD~eL~~lh~~~~LV~~~~~~~~~~Gl~~sVdL~~~~~~~~~iiGyrAkkht~~IDl~k~~~y~~~dfwe~i~ 241 (367) ++|++++||++|+++|++++||++..+.++++|+++||||+++ .+++|||||||||++||++|+++|+++||||||+ T Consensus 167 r~g~~~lsd~eL~~lh~~~~Lv~~~~~~~i~~Gl~lSVDLsg~---~~~iiGYRAKkht~~IDldKi~~y~~~dFWEpI~ 243 (366) T PRK07559 167 RRGQAVLDEAELAALHAAEPLVDGDDPNISGGGIALSVDLSGG---EDGLVGYRAKRHTGVVDVDRRGGHDVRDFWEPLY 243 (366) T ss_pred CCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEEECCC---CCCEEEEEECCCCCEEEECCCCCCCHHHHHCEEE T ss_conf 2798766999999876437764576554346865999993268---8964678823799637755867788688210101 Q ss_pred CCCCCCCEEEECCCCEEEEEEEEEEECCCCCEEEEEEECCCCCCEEEECCEEECCCCCCCCCCCCCCEEEEEECC-CCCE Q ss_conf 355777539988997899986334442834016445304333421342160456886654345667504788617-8888 Q gi|254780480|r 242 SWDAPEGQFILDPNEFYIFASREFIQIPPSLVAEMIPYDHSIGEFRVHYAGFFDPGFGCVLPQEVGAKAVLEVRS-SVPF 320 (367) Q Consensus 242 ~~~~~~~~~~l~p~~F~l~~s~E~v~iP~~~aae~~~~~~~~g~~r~h~AGf~DpGf~~~~~~~~~~~~vLevr~-~~p~ 320 (367) ++ .++++||+|||||||+|+|.|+||+|+||||+|||+++||||+|||||||||||+..+++.|+|||||||| |+|| T Consensus 244 ~~--~~~~LILdp~eFYIL~SkE~V~iP~~~AaEM~pyd~~~GefRvHYAGFFDPGFG~~~~g~~gsraVLEVRs~evPF 321 (366) T PRK07559 244 AR--GDGELILDPGEFYILASREAVHVPPDYAAEMVPFDPLVGEFRVHYAGFFDPGFGHAEAGGTGSRAVLEVRSHEVPF 321 (366) T ss_pred EC--CCCEEEECCCCEEEEEECCEEEECCCHHHHCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCEEEEEEECCCCCE T ss_conf 37--8987997378679996344366577414313354446663444450212587787767787732899983487875 Q ss_pred EEECCCEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHH Q ss_conf 994798889999976376642223754347232666773443430 Q gi|254780480|r 321 VLEHGQIIGRLKYESMDGEPERLYGEARGSHYQSQALKLSKHFHQ 365 (367) Q Consensus 321 ~l~~g~~i~~l~~~~~~~~p~~~Yg~~~~s~Yq~Qgl~lsk~f~~ 365 (367) +|+|||+||||+||+|++.|+.+||++++||||+||||||||||. T Consensus 322 ~le~gQ~v~rl~ye~m~~~p~~lYG~~i~SnYq~QgLkLsKhFk~ 366 (366) T PRK07559 322 ILEHGQIVGRLVYERMLERPDALYGAGIGSNYQAQGLKLSKHFRA 366 (366) T ss_pred EEECCCEEEEEEEEECCCCCCCCCCCCCCCCCHHCCCCHHHHCCC T ss_conf 876897889999887014871111677575411026642643269 No 3 >TIGR02274 dCTP_deam deoxycytidine triphosphate deaminase; InterPro: IPR011962 Members of this family include the Escherichia coli monofunctional deoxycytidine triphosphate deaminase (dCTP deaminase) and a Methanocaldococcus jannaschii bifunctional dCTP deaminase (3.5.4.13 from EC)/dUTP diphosphatase (3.6.1.23 from EC). For additional information please see .; GO: 0008829 dCTP deaminase activity, 0006229 dUTP biosynthetic process. Probab=100.00 E-value=0 Score=336.86 Aligned_cols=154 Identities=27% Similarity=0.461 Sum_probs=135.3 Q ss_pred CCCHHHHHHHHHCCCEEECCCCC---------HHHCCCCEEEEEECCEEEEEECCCCCCCCCCHHHHCCC-----CEEEE Q ss_conf 15868999999879849817898---------65657122588758989998123157888621100024-----04788 Q gi|254780480|r 5 VLPDKAIATLAERGDIVSEHPLE---------KNQIQPASIDLRLSSKAYRVRASFLPNIDGLVSDKIKH-----FKLRE 70 (367) Q Consensus 5 ILsd~~I~~li~~g~I~i~~p~~---------~~QIQPaSiDLRLG~~~~rv~asflp~~~~~v~~~l~~-----~~~~e 70 (367) ||||++|+++|++|.|.|| |++ ++||||||||||||++|..++.+.-+..|..-.+...+ ++..+ T Consensus 1 iLsD~dI~~~l~~g~l~Ie-P~~~~~~GkGvv~~~~~p~s~DlRlG~eF~~F~~~~~~~IDp~~~~~~~~~~~~~vMs~~ 79 (198) T TIGR02274 1 ILSDRDIKRKLEEGELKIE-PLEEANIGKGVVEEQIQPASVDLRLGNEFAVFENHTEAVIDPKNPKEAREYLFEKVMSDE 79 (198) T ss_pred CCCCHHHHHHHHCCCEEEC-CCCCCCCCCCEECCCCCCCEEEEEECCEEEEEECCCCCEECCCCCEEEEEEEEEEECCCC T ss_conf 9871788889847985671-888733556301024254125897377799981344513526886220035665432540 Q ss_pred --EECCCCEEECCCCEEEEEEEEEEECCCCCEEEEECCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEEECCCC-CC Q ss_conf --736886486687329999888998499827998601343112503423207877554236674438999951686-52 Q gi|254780480|r 71 --IDLSGGTVLDANCVYIVPLMESLNLKNGISAYANPKSSIGRIDVFARVIGDRSQEFDSISPNYSGPLYLEILPRT-FP 147 (367) Q Consensus 71 --idl~~g~vLepg~~yLv~t~E~i~LP~di~a~~~pKSS~GRLgif~r~i~~~~~~f~~id~Gy~G~l~lEI~~~s-fp 147 (367) ++.++.|+|+||+|+|++|.|+|+||+|++|++++|||+||||||+|.||+ +|||||+|+||||++|.+ +| T Consensus 80 i~v~~~~~f~i~P~~f~La~T~E~v~lP~dvva~~~GRSSlaRlGl~~~~TAg------~iDPGf~G~ITLEl~N~~~~P 153 (198) T TIGR02274 80 IIVEEDEEFVIPPGEFVLATTLEYVKLPDDVVAFVEGRSSLARLGLFIHVTAG------WIDPGFEGNITLELFNAGKLP 153 (198) T ss_pred CCCCCCCEEEECCCCEEEEEEEEEEEECCCEEEEEECCCCEEEEEEECCCCCC------EECCCCCCEEEEEEECCCCCE T ss_conf 06048987996998768888899995088507888623011332014066211------786898764999987687870 Q ss_pred EEEECCCEEEEEEEEECC Q ss_conf 798339803568985088 Q gi|254780480|r 148 IVVRVGSRLSQIRFVKER 165 (367) Q Consensus 148 I~v~~G~rl~Qi~f~~g~ 165 (367) |+||||||||||+|.+-. T Consensus 154 v~L~~GmRi~QlvF~~l~ 171 (198) T TIGR02274 154 VKLRPGMRIAQLVFARLS 171 (198) T ss_pred EEECCCCEEEEEEEEECC T ss_conf 786589858999986417 No 4 >PHA01707 dut 2'-deoxyuridine 5'-triphosphatase Probab=100.00 E-value=2.2e-40 Score=283.42 Aligned_cols=135 Identities=21% Similarity=0.437 Sum_probs=122.0 Q ss_pred CCCCHHHHHHHHHCCCEEECCCCCHHHCCCCEEEEEECCEEEEEECCCCCCCCCCHHHHCCCCEEEEEECCCCEEECCCC Q ss_conf 61586899999987984981789865657122588758989998123157888621100024047887368864866873 Q gi|254780480|r 4 GVLPDKAIATLAERGDIVSEHPLEKNQIQPASIDLRLSSKAYRVRASFLPNIDGLVSDKIKHFKLREIDLSGGTVLDANC 83 (367) Q Consensus 4 GILsd~~I~~li~~g~I~i~~p~~~~QIQPaSiDLRLG~~~~rv~asflp~~~~~v~~~l~~~~~~eidl~~g~vLepg~ 83 (367) |||||++|++++++|.|.|+ |++++||||||+|||||+++++++.. + +.+.++.|+|+||+ T Consensus 1 MILSd~~I~~~i~~g~i~I~-Pf~~~~v~p~s~DLrLg~e~~~~~~~------------~------~~~~~~~fil~P~~ 61 (158) T PHA01707 1 MILSDRDIKYYINKGWLVIE-PLSEDTIRENGVDLKIGNEIVRIKEN------------M------EKEVGDEFIIYPHE 61 (158) T ss_pred CCCCHHHHHHHHHCCCEEEC-CCCCCCCCCCEEEEEECCEEEECCCC------------C------CCCCCCEEEECCCC T ss_conf 92678999999875998976-99910027743898847898981145------------4------12359859988998 Q ss_pred EEEEEEEEEEECCCCCEEEEECCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEEECCCCCCEEEECCCEEEEEEEEE Q ss_conf 29999888998499827998601343112503423207877554236674438999951686527983398035689850 Q gi|254780480|r 84 VYIVPLMESLNLKNGISAYANPKSSIGRIDVFARVIGDRSQEFDSISPNYSGPLYLEILPRTFPIVVRVGSRLSQIRFVK 163 (367) Q Consensus 84 ~yLv~t~E~i~LP~di~a~~~pKSS~GRLgif~r~i~~~~~~f~~id~Gy~G~l~lEI~~~sfpI~v~~G~rl~Qi~f~~ 163 (367) ++|++|+|+++||+|++|++++|||+||+|++++.+ .|||||+|+||||++|.+.||++|+|||||||.|++ T Consensus 62 ~~L~~T~E~i~lP~d~~~~~~~RSS~aR~Gl~i~~t--------~iDpGf~G~~TLEl~n~~~pi~l~~G~rI~Ql~F~~ 133 (158) T PHA01707 62 HVLLTTKEYIKLPNDIIAFCNLRSTFARKGLLIPPT--------IVDAGFEGQLTIELVGSSIPVKLKSGERFLHLIFAR 133 (158) T ss_pred EEEEEEEEEEECCCCEEEEEEEECHHHHCCEEECCC--------EECCCEEEEEEEEEECCCCCEEECCCCEEEEEEEEE T ss_conf 899988899986856899998222476176663484--------008962527999996388888986899697999997 Q ss_pred CC Q ss_conf 88 Q gi|254780480|r 164 ER 165 (367) Q Consensus 164 g~ 165 (367) -. T Consensus 134 ~~ 135 (158) T PHA01707 134 TL 135 (158) T ss_pred CC T ss_conf 37 No 5 >PRK00416 dcd deoxycytidine triphosphate deaminase; Reviewed Probab=100.00 E-value=6.7e-39 Score=273.76 Aligned_cols=148 Identities=21% Similarity=0.293 Sum_probs=125.9 Q ss_pred CCCHHHHHHHHHCCCEEECCCCCHHHCCCCEEEEEECCEEEEEECCCCCCCCCCHHHHCCCCEEEEEECCCCEEECCCCE Q ss_conf 15868999999879849817898656571225887589899981231578886211000240478873688648668732 Q gi|254780480|r 5 VLPDKAIATLAERGDIVSEHPLEKNQIQPASIDLRLSSKAYRVRASFLPNIDGLVSDKIKHFKLREIDLSGGTVLDANCV 84 (367) Q Consensus 5 ILsd~~I~~li~~g~I~i~~p~~~~QIQPaSiDLRLG~~~~rv~asflp~~~~~v~~~l~~~~~~eidl~~g~vLepg~~ 84 (367) ||||++|++++++|.|.++ ||+++||||||+|||||+++.+++....+..|. +-.....+....+++|+|+||++ T Consensus 1 ILsd~~I~~~~~~g~i~I~-Pf~~~~i~p~syDlrLg~~~~v~~~~~~~~iD~----~~~~~~~~~~~~~e~~iL~Pg~~ 75 (174) T PRK00416 1 ILSDDDIIRRLAEGRLMIE-PFDPEQVRPNGYDLRLGNEFKVFTNHNSTVIDP----KNFDEDEFVDIEGEVFILPPGEF 75 (174) T ss_pred CCCHHHHHHHHHCCCEEEC-CCCHHHCCCCEEEEEECCEEEEEECCCCCEECC----CCCCCCEEEECCCCEEEECCCCE T ss_conf 9887999999974982786-999788387437378567889985688757747----77897548974798589779979 Q ss_pred EEEEEEEEEECCCCCEEEEECCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEEECCCC-CCEEEECCCEEEEEEEEE Q ss_conf 9999888998499827998601343112503423207877554236674438999951686-527983398035689850 Q gi|254780480|r 85 YIVPLMESLNLKNGISAYANPKSSIGRIDVFARVIGDRSQEFDSISPNYSGPLYLEILPRT-FPIVVRVGSRLSQIRFVK 163 (367) Q Consensus 85 yLv~t~E~i~LP~di~a~~~pKSS~GRLgif~r~i~~~~~~f~~id~Gy~G~l~lEI~~~s-fpI~v~~G~rl~Qi~f~~ 163 (367) ||++|+|+++||+|++|++++|||+||+||++|.++ +||||+|+||||++|.+ .||++|||||+|||.|++ T Consensus 76 ~L~~T~E~i~lP~~~~~~~~~rSs~aR~Gl~v~~t~--------~dpG~~G~ltlel~N~~~~pv~l~~g~~i~Ql~F~~ 147 (174) T PRK00416 76 ALARTLEYFKLPDDVVGILEGRSSLARLGLIVHVTA--------IDPGWEGHITLEFSNTGPLPVKLYPGEGIGQLLFFE 147 (174) T ss_pred EEEEEEEECCCCCCEEEEEEECCHHHCCCEEEECCE--------ECCCCCEEEEEEEEECCCCCEEECCCCEEEEEEEEE T ss_conf 999998830598789999996350214735987111--------089841279999996888866574799199999996 Q ss_pred CC Q ss_conf 88 Q gi|254780480|r 164 ER 165 (367) Q Consensus 164 g~ 165 (367) -. T Consensus 148 ~~ 149 (174) T PRK00416 148 LS 149 (174) T ss_pred CC T ss_conf 68 No 6 >TIGR02274 dCTP_deam deoxycytidine triphosphate deaminase; InterPro: IPR011962 Members of this family include the Escherichia coli monofunctional deoxycytidine triphosphate deaminase (dCTP deaminase) and a Methanocaldococcus jannaschii bifunctional dCTP deaminase (3.5.4.13 from EC)/dUTP diphosphatase (3.6.1.23 from EC). For additional information please see .; GO: 0008829 dCTP deaminase activity, 0006229 dUTP biosynthetic process. Probab=100.00 E-value=1.2e-39 Score=278.69 Aligned_cols=180 Identities=26% Similarity=0.403 Sum_probs=152.4 Q ss_pred CCCHHHHHHHHHHCCCCCCCCCCC-------CCCCEEEEEEECCCCCCCCCCEEEEECCCCCEEECCCCCCCCHHHHH-- Q ss_conf 689999999875278603665544-------33731678881455456666304551267751301565555877866-- Q gi|254780480|r 167 FCSQEELSALHELSPLVQDGLRDF-------SGKGVALSVDLKGNEIGKKSIIGYRAKRHTAAIDVDSENKYDESDFW-- 237 (367) Q Consensus 167 ~LsD~eL~~lh~~~~LV~~~~~~~-------~~~Gl~~sVdL~~~~~~~~~iiGyrAkkht~~IDl~k~~~y~~~dfw-- 237 (367) +|||.+++++.+...|..+|..+. ...-.+.||||+-. +....|+ +++..+||+++........+- T Consensus 1 iLsD~dI~~~l~~g~l~IeP~~~~~~GkGvv~~~~~p~s~DlRlG----~eF~~F~-~~~~~~IDp~~~~~~~~~~~~~v 75 (198) T TIGR02274 1 ILSDRDIKRKLEEGELKIEPLEEANIGKGVVEEQIQPASVDLRLG----NEFAVFE-NHTEAVIDPKNPKEAREYLFEKV 75 (198) T ss_pred CCCCHHHHHHHHCCCEEECCCCCCCCCCCEECCCCCCCEEEEEEC----CEEEEEE-CCCCCEECCCCCEEEEEEEEEEE T ss_conf 987178888984798567188873355630102425412589737----7799981-34451352688622003566543 Q ss_pred --HHHCCCCCCCCEEEECCCCEEEEEEEEEEECCCCCEE--EEEEECCCCCCEEEECCEEECCCCCCCCCCCCCCEEEEE Q ss_conf --4311355777539988997899986334442834016--445304333421342160456886654345667504788 Q gi|254780480|r 238 --DPLFSWDAPEGQFILDPNEFYIFASREFIQIPPSLVA--EMIPYDHSIGEFRVHYAGFFDPGFGCVLPQEVGAKAVLE 313 (367) Q Consensus 238 --e~i~~~~~~~~~~~l~p~~F~l~~s~E~v~iP~~~aa--e~~~~~~~~g~~r~h~AGf~DpGf~~~~~~~~~~~~vLe 313 (367) +++.. .+++.|+|+||+|+|++|.|+|++|+|++| ++||+.+|+|.|.+.+||||||||.|++| || T Consensus 76 Ms~~i~v--~~~~~f~i~P~~f~La~T~E~v~lP~dvva~~~GRSSlaRlGl~~~~TAg~iDPGf~G~IT--------LE 145 (198) T TIGR02274 76 MSDEIIV--EEDEEFVIPPGEFVLATTLEYVKLPDDVVAFVEGRSSLARLGLFIHVTAGWIDPGFEGNIT--------LE 145 (198) T ss_pred CCCCCCC--CCCCEEEECCCCEEEEEEEEEEEECCCEEEEEECCCCEEEEEEECCCCCCEECCCCCCEEE--------EE T ss_conf 2540060--4898799699876888889999508850788862301133201406621178689876499--------99 Q ss_pred EC--CCCCEEEECCCEEEEEEEEECCCCCCCCCCC-CCCCCCCCC-CCCHHH Q ss_conf 61--7888899479888999997637664222375-434723266-677344 Q gi|254780480|r 314 VR--SSVPFVLEHGQIIGRLKYESMDGEPERLYGE-ARGSHYQSQ-ALKLSK 361 (367) Q Consensus 314 vr--~~~p~~l~~g~~i~~l~~~~~~~~p~~~Yg~-~~~s~Yq~Q-gl~lsk 361 (367) +- +..|+.|++|||||||||++|+++++++|+. ..+||||+| |+++|| T Consensus 146 l~N~~~~Pv~L~~GmRi~QlvF~~l~~P~~~pY~~p~~~gKYqgQ~G~~~Sr 197 (198) T TIGR02274 146 LFNAGKLPVKLRPGMRIAQLVFARLSSPAERPYGGPRKSGKYQGQTGVTPSR 197 (198) T ss_pred EECCCCCEEEECCCCEEEEEEEEECCCCHHHCCCCCCCCCCCCCCCCCCCCC T ss_conf 8768787078658985899998641785011488643143403787767888 No 7 >COG0717 Dcd Deoxycytidine deaminase [Nucleotide transport and metabolism] Probab=100.00 E-value=7.5e-35 Score=247.33 Aligned_cols=152 Identities=28% Similarity=0.450 Sum_probs=114.6 Q ss_pred CCCCHHHHHHHHHCCCEEECCCCCHH--HCCCCEEEEEECCEEEEEECCCCCCCCCCHHHHCCCCEEEEEECCC-CEEEC Q ss_conf 61586899999987984981789865--6571225887589899981231578886211000240478873688-64866 Q gi|254780480|r 4 GVLPDKAIATLAERGDIVSEHPLEKN--QIQPASIDLRLSSKAYRVRASFLPNIDGLVSDKIKHFKLREIDLSG-GTVLD 80 (367) Q Consensus 4 GILsd~~I~~li~~g~I~i~~p~~~~--QIQPaSiDLRLG~~~~rv~asflp~~~~~v~~~l~~~~~~eidl~~-g~vLe 80 (367) +||||++|++++.+|.+.++ |+.+. ||||+|+|||||++|..++...-+..|..-.+.. ......+..++ .|+|+ T Consensus 1 milsd~~I~~~i~~~~~~i~-p~~~~~~~i~p~g~Dlrl~~ef~vf~~~~~~~iD~~~~~~~-~~~~~~~~~e~~~~il~ 78 (183) T COG0717 1 MILSDRDIRKMVEEGRLLIE-PFEDKEYQIQPAGYDLRLGNEFRVFRNEGAGVIDPDNPDEE-DPLVEEEELEDGEFILP 78 (183) T ss_pred CCCCHHHHHHHHHCCCCEEC-CCCCCCCCCCCCCEEEEECCEEEEEECCCCCEECCCCCCCC-CCCCEEEECCCCCEEEC T ss_conf 93578899999750763230-47774444366547788311789996368857867666656-66410341467638978 Q ss_pred CCCEEEEEEEEEEECCCCCEEEEECCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEEECCCC-CCEEEECCCEEEEE Q ss_conf 87329999888998499827998601343112503423207877554236674438999951686-52798339803568 Q gi|254780480|r 81 ANCVYIVPLMESLNLKNGISAYANPKSSIGRIDVFARVIGDRSQEFDSISPNYSGPLYLEILPRT-FPIVVRVGSRLSQI 159 (367) Q Consensus 81 pg~~yLv~t~E~i~LP~di~a~~~pKSS~GRLgif~r~i~~~~~~f~~id~Gy~G~l~lEI~~~s-fpI~v~~G~rl~Qi 159 (367) |+++||++|+|+++||+|++|+++||||++|+|++++++ ++||||+|++|||++|.+ .|+++|+|||++|| T Consensus 79 P~~~~L~~t~E~i~iP~~v~~~~~gRSslaR~G~~~~~~--------~~DpGf~G~~tle~~n~~~~p~~L~~g~rI~QL 150 (183) T COG0717 79 PGEFYLAVTLEYVEIPEDVAAFCTGRSSLARLGLIVHVG--------VIDPGFEGRITLELVNSGPLPIRLYPGERIAQL 150 (183) T ss_pred CCCEEEEEEEEEEECCCCEEEEEECCCCHHHCCEEECCC--------CCCCCCCCEEEEEEEECCCCCEEECCCCEEEEE T ss_conf 995899998889982806289997347065183786473--------488984865899999678888487589999999 Q ss_pred EEEECC Q ss_conf 985088 Q gi|254780480|r 160 RFVKER 165 (367) Q Consensus 160 ~f~~g~ 165 (367) .|.+.. T Consensus 151 vF~~l~ 156 (183) T COG0717 151 VFLELD 156 (183) T ss_pred EEEECC T ss_conf 999745 No 8 >COG0717 Dcd Deoxycytidine deaminase [Nucleotide transport and metabolism] Probab=100.00 E-value=1.2e-34 Score=246.13 Aligned_cols=180 Identities=25% Similarity=0.337 Sum_probs=155.8 Q ss_pred CCCCHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCEEEEECCCCCEEECCCCCCCCHHHHHHHHCCCCC Q ss_conf 76899999998752786036655443373167888145545666630455126775130156555587786643113557 Q gi|254780480|r 166 KFCSQEELSALHELSPLVQDGLRDFSGKGVALSVDLKGNEIGKKSIIGYRAKRHTAAIDVDSENKYDESDFWDPLFSWDA 245 (367) Q Consensus 166 ~~LsD~eL~~lh~~~~LV~~~~~~~~~~Gl~~sVdL~~~~~~~~~iiGyrAkkht~~IDl~k~~~y~~~dfwe~i~~~~~ 245 (367) ++|||.++.++-.+..++..+..+...+-.+.++||+.. +-.+...+.++++||.++.+..++...|++..- T Consensus 1 milsd~~I~~~i~~~~~~i~p~~~~~~~i~p~g~Dlrl~-----~ef~vf~~~~~~~iD~~~~~~~~~~~~~~~~e~--- 72 (183) T COG0717 1 MILSDRDIRKMVEEGRLLIEPFEDKEYQIQPAGYDLRLG-----NEFRVFRNEGAGVIDPDNPDEEDPLVEEEELED--- 72 (183) T ss_pred CCCCHHHHHHHHHCCCCEECCCCCCCCCCCCCCEEEEEC-----CEEEEEECCCCCEECCCCCCCCCCCCEEEECCC--- T ss_conf 935788999997507632304777444436654778831-----178999636885786766665666410341467--- Q ss_pred CCCEEEECCCCEEEEEEEEEEECCCCCEEEEEEECCCCCCEEEECCEEECCCCCCCCCCCCCCEEEEEECC-CCCEEEEC Q ss_conf 77539988997899986334442834016445304333421342160456886654345667504788617-88889947 Q gi|254780480|r 246 PEGQFILDPNEFYIFASREFIQIPPSLVAEMIPYDHSIGEFRVHYAGFFDPGFGCVLPQEVGAKAVLEVRS-SVPFVLEH 324 (367) Q Consensus 246 ~~~~~~l~p~~F~l~~s~E~v~iP~~~aae~~~~~~~~g~~r~h~AGf~DpGf~~~~~~~~~~~~vLevr~-~~p~~l~~ 324 (367) +.++|+|++|||++|.|+|+||+|+||.|.+.++.+++++.+++|||||||.|.+|++ .+.+ +.|+.|++ T Consensus 73 --~~~il~P~~~~L~~t~E~i~iP~~v~~~~~gRSslaR~G~~~~~~~~DpGf~G~~tle-------~~n~~~~p~~L~~ 143 (183) T COG0717 73 --GEFILPPGEFYLAVTLEYVEIPEDVAAFCTGRSSLARLGLIVHVGVIDPGFEGRITLE-------LVNSGPLPIRLYP 143 (183) T ss_pred --CCEEECCCCEEEEEEEEEEECCCCEEEEEECCCCHHHCCEEECCCCCCCCCCCEEEEE-------EEECCCCCEEECC T ss_conf --6389789958999988899828062899973470651837864734889848658999-------9967888848758 Q ss_pred CCEEEEEEEEECCCCCCCCCCCCCCCCCCCC-CCCHHHHH Q ss_conf 9888999997637664222375434723266-67734434 Q gi|254780480|r 325 GQIIGRLKYESMDGEPERLYGEARGSHYQSQ-ALKLSKHF 363 (367) Q Consensus 325 g~~i~~l~~~~~~~~p~~~Yg~~~~s~Yq~Q-gl~lsk~f 363 (367) ||+||||+|++|+.+++++|+.. +||||+| |+++||-. T Consensus 144 g~rI~QLvF~~l~~~~~~~Y~~~-~gkYq~q~gv~~~~~~ 182 (183) T COG0717 144 GERIAQLVFLELDSPAERPYSGR-GGKYQGQRGVTPSRIY 182 (183) T ss_pred CCEEEEEEEEECCCCCCCCCCCC-CCCCCCCCCCCCCCCC T ss_conf 99999999997456553464677-8742467676753024 No 9 >PRK02253 deoxyuridine 5'-triphosphate nucleotidohydrolase; Provisional Probab=99.98 E-value=1.1e-31 Score=226.64 Aligned_cols=139 Identities=22% Similarity=0.256 Sum_probs=111.9 Q ss_pred HCCCEEECCCCCHHHCCCCEEEEEECCEEEEEECCCCCCCCCCHHHHCCCCEEEEEECCCCEEECCCCEEEEEEEEEEEC Q ss_conf 87984981789865657122588758989998123157888621100024047887368864866873299998889984 Q gi|254780480|r 16 ERGDIVSEHPLEKNQIQPASIDLRLSSKAYRVRASFLPNIDGLVSDKIKHFKLREIDLSGGTVLDANCVYIVPLMESLNL 95 (367) Q Consensus 16 ~~g~I~i~~p~~~~QIQPaSiDLRLG~~~~rv~asflp~~~~~v~~~l~~~~~~eidl~~g~vLepg~~yLv~t~E~i~L 95 (367) ++|.++.+.+..++||||||+|||||+.+.....+.++-.+. .+.+.+.+..+++.++.++|+||+ |+++|+|+|+| T Consensus 3 ~~~~~~~~~~d~e~QiQP~gvDLrLg~~~~~~~~g~i~~~~~--~~~~~~~~~ve~~~d~~~~L~pg~-yll~t~E~v~i 79 (159) T PRK02253 3 RAGPLLEDAVDLDTQIQPNGVDLTLKEIERQEGPGRIDFDNS--RRKLPEREPLEFDDDGWIHLPPGI-YKVVFNEIVNI 79 (159) T ss_pred CCCHHHCCCCCHHHCCCCCCEEEECCCEEEEECCCCCCCCCH--HHCCCCEEEEEECCCCEEEECCCE-EEEEEEEEEEC T ss_conf 566133168875675783638998987999823875068864--651674179640799549967988-99998669973 Q ss_pred CCCCEEEEECCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEEECCCC-CCEEEECCCEEEEEEEEECC Q ss_conf 99827998601343112503423207877554236674438999951686-52798339803568985088 Q gi|254780480|r 96 KNGISAYANPKSSIGRIDVFARVIGDRSQEFDSISPNYSGPLYLEILPRT-FPIVVRVGSRLSQIRFVKER 165 (367) Q Consensus 96 P~di~a~~~pKSS~GRLgif~r~i~~~~~~f~~id~Gy~G~l~lEI~~~s-fpI~v~~G~rl~Qi~f~~g~ 165 (367) |+|++|+++||||++|+|++++... +||||+|++++++.+.+ .||++++|||++||.|++-. T Consensus 80 P~d~~a~~~~RSSl~R~G~~i~~av--------~DpGf~G~~~~~l~~~n~~~i~l~~g~Ri~Qlvf~~~~ 142 (159) T PRK02253 80 PKDHVAFAYPRSSLLRCGCTLETAV--------WDAGYEGRSESLLVVHNPHGIRLERGARIAQLVFIRLE 142 (159) T ss_pred CCCCEEEEECHHHHEEEEEEECCCC--------CCCCCCCEEEEEEEECCCCCEEEECCCEEEEEEEEECC T ss_conf 8061999961114303107864763--------28983150699999868877798689999999999836 No 10 >PHA01707 dut 2'-deoxyuridine 5'-triphosphatase Probab=99.96 E-value=1.4e-28 Score=206.39 Aligned_cols=153 Identities=24% Similarity=0.356 Sum_probs=126.5 Q ss_pred CCCCHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCEEEEECCCCCEEECCCCCCCCHHHHHHHHCCCCC Q ss_conf 76899999998752786036655443373167888145545666630455126775130156555587786643113557 Q gi|254780480|r 166 KFCSQEELSALHELSPLVQDGLRDFSGKGVALSVDLKGNEIGKKSIIGYRAKRHTAAIDVDSENKYDESDFWDPLFSWDA 245 (367) Q Consensus 166 ~~LsD~eL~~lh~~~~LV~~~~~~~~~~Gl~~sVdL~~~~~~~~~iiGyrAkkht~~IDl~k~~~y~~~dfwe~i~~~~~ 245 (367) ++|||.+++++.++..|+.++.+. .+-.+.|+||+.. +.++-+ +++ +..+ T Consensus 1 MILSd~~I~~~i~~g~i~I~Pf~~--~~v~p~s~DLrLg----~e~~~~--~~~--------------------~~~~-- 50 (158) T PHA01707 1 MILSDRDIKYYINKGWLVIEPLSE--DTIRENGVDLKIG----NEIVRI--KEN--------------------MEKE-- 50 (158) T ss_pred CCCCHHHHHHHHHCCCEEECCCCC--CCCCCCEEEEEEC----CEEEEC--CCC--------------------CCCC-- T ss_conf 926789999998759989769991--0027743898847----898981--145--------------------4123-- Q ss_pred CCCEEEECCCCEEEEEEEEEEECCCCCEE--EEEEECCCCCCEEEECCEEECCCCCCCCCCCCCCEEEEEECC-CCCEEE Q ss_conf 77539988997899986334442834016--445304333421342160456886654345667504788617-888899 Q gi|254780480|r 246 PEGQFILDPNEFYIFASREFIQIPPSLVA--EMIPYDHSIGEFRVHYAGFFDPGFGCVLPQEVGAKAVLEVRS-SVPFVL 322 (367) Q Consensus 246 ~~~~~~l~p~~F~l~~s~E~v~iP~~~aa--e~~~~~~~~g~~r~h~AGf~DpGf~~~~~~~~~~~~vLevr~-~~p~~l 322 (367) ..+.++|+||+|+|++|.|+|++|+|++| ++|++.+|.|.+ +| ++|+||||.|++| ||+.+ ..|+.| T Consensus 51 ~~~~fil~P~~~~L~~T~E~i~lP~d~~~~~~~RSS~aR~Gl~-i~-~t~iDpGf~G~~T--------LEl~n~~~pi~l 120 (158) T PHA01707 51 VGDEFIIYPHEHVLLTTKEYIKLPNDIIAFCNLRSTFARKGLL-IP-PTIVDAGFEGQLT--------IELVGSSIPVKL 120 (158) T ss_pred CCCEEEECCCCEEEEEEEEEEECCCCEEEEEEEECHHHHCCEE-EC-CCEECCCEEEEEE--------EEEECCCCCEEE T ss_conf 5985998899889998889998685689999822247617666-34-8400896252799--------999638888898 Q ss_pred ECCCEEEEEEEEECCCCCCCCCCCCCCCCCCCC-CCCHHHH Q ss_conf 479888999997637664222375434723266-6773443 Q gi|254780480|r 323 EHGQIIGRLKYESMDGEPERLYGEARGSHYQSQ-ALKLSKH 362 (367) Q Consensus 323 ~~g~~i~~l~~~~~~~~p~~~Yg~~~~s~Yq~Q-gl~lsk~ 362 (367) ++||+||||+|++++++++++| .+|||+| |.++||. T Consensus 121 ~~G~rI~Ql~F~~~~~~~~~~Y----~GkYq~q~g~t~sk~ 157 (158) T PHA01707 121 KSGERFLHLIFARTLTPVEKPY----NGKYQKQKGVTLAKE 157 (158) T ss_pred CCCCEEEEEEEEECCCCCCCCC----CCCCCCCCCCCCCCC T ss_conf 6899697999997378666782----888589989888888 No 11 >PRK00416 dcd deoxycytidine triphosphate deaminase; Reviewed Probab=99.95 E-value=5.6e-27 Score=195.94 Aligned_cols=168 Identities=24% Similarity=0.387 Sum_probs=132.2 Q ss_pred CCCHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCEEEEECCCCCEEECCCCCCCCHHHHHHHHCCCCCC Q ss_conf 68999999987527860366554433731678881455456666304551267751301565555877866431135577 Q gi|254780480|r 167 FCSQEELSALHELSPLVQDGLRDFSGKGVALSVDLKGNEIGKKSIIGYRAKRHTAAIDVDSENKYDESDFWDPLFSWDAP 246 (367) Q Consensus 167 ~LsD~eL~~lh~~~~LV~~~~~~~~~~Gl~~sVdL~~~~~~~~~iiGyrAkkht~~IDl~k~~~y~~~dfwe~i~~~~~~ 246 (367) +|||.+++++.+...++.++.+. .+-.+.|+||+.. +.+.-|. ..+..+||..+.... +.+ ..+ T Consensus 1 ILsd~~I~~~~~~g~i~I~Pf~~--~~i~p~syDlrLg----~~~~v~~-~~~~~~iD~~~~~~~------~~~---~~~ 64 (174) T PRK00416 1 ILSDDDIIRRLAEGRLMIEPFDP--EQVRPNGYDLRLG----NEFKVFT-NHNSTVIDPKNFDED------EFV---DIE 64 (174) T ss_pred CCCHHHHHHHHHCCCEEECCCCH--HHCCCCEEEEEEC----CEEEEEE-CCCCCEECCCCCCCC------EEE---ECC T ss_conf 98879999999749827869997--8838743737856----7889985-688757747778975------489---747 Q ss_pred CCEEEECCCCEEEEEEEEEEECCCCCEE--EEEEECCCCCCEEEECCEEECCCCCCCCCCCCCCEEEEEECC--CCCEEE Q ss_conf 7539988997899986334442834016--445304333421342160456886654345667504788617--888899 Q gi|254780480|r 247 EGQFILDPNEFYIFASREFIQIPPSLVA--EMIPYDHSIGEFRVHYAGFFDPGFGCVLPQEVGAKAVLEVRS--SVPFVL 322 (367) Q Consensus 247 ~~~~~l~p~~F~l~~s~E~v~iP~~~aa--e~~~~~~~~g~~r~h~AGf~DpGf~~~~~~~~~~~~vLevr~--~~p~~l 322 (367) ++.++|+||+|||+.|.|+|++|+|++| +.+++.+|.|.+ +| ++++||||.|.+| ||+.| ..|+.| T Consensus 65 ~e~~iL~Pg~~~L~~T~E~i~lP~~~~~~~~~rSs~aR~Gl~-v~-~t~~dpG~~G~lt--------lel~N~~~~pv~l 134 (174) T PRK00416 65 GEVFILPPGEFALARTLEYFKLPDDVVGILEGRSSLARLGLI-VH-VTAIDPGWEGHIT--------LEFSNTGPLPVKL 134 (174) T ss_pred CCEEEECCCCEEEEEEEEECCCCCCEEEEEEECCHHHCCCEE-EE-CCEECCCCCEEEE--------EEEEECCCCCEEE T ss_conf 985897799799999988305987899999963502147359-87-1110898412799--------9999688886657 Q ss_pred ECCCEEEEEEEEECCCCCCCCCCCCCCCCCCCC-CCCHHH Q ss_conf 479888999997637664222375434723266-677344 Q gi|254780480|r 323 EHGQIIGRLKYESMDGEPERLYGEARGSHYQSQ-ALKLSK 361 (367) Q Consensus 323 ~~g~~i~~l~~~~~~~~p~~~Yg~~~~s~Yq~Q-gl~lsk 361 (367) ++||+||||+|++|+++|+++|+. ..+|||+| |.++|| T Consensus 135 ~~g~~i~Ql~F~~~~~~~~~~Y~~-~~GkYqgq~g~~~sk 173 (174) T PRK00416 135 YPGEGIGQLLFFELSGPAERPYAD-RKGKYQGQRGVTLSR 173 (174) T ss_pred CCCCEEEEEEEEECCCCCCCCCCC-CCCCCCCCCCCCCCC T ss_conf 479919999999668865557554-586457989987778 No 12 >PRK02253 deoxyuridine 5'-triphosphate nucleotidohydrolase; Provisional Probab=99.93 E-value=1.5e-25 Score=186.71 Aligned_cols=154 Identities=25% Similarity=0.324 Sum_probs=116.3 Q ss_pred HCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCEEEEECCCCCEEECCCCCCCCHHHHHHHHCCCCCCCCEEEECCCCEE Q ss_conf 27860366554433731678881455456666304551267751301565555877866431135577753998899789 Q gi|254780480|r 179 LSPLVQDGLRDFSGKGVALSVDLKGNEIGKKSIIGYRAKRHTAAIDVDSENKYDESDFWDPLFSWDAPEGQFILDPNEFY 258 (367) Q Consensus 179 ~~~LV~~~~~~~~~~Gl~~sVdL~~~~~~~~~iiGyrAkkht~~IDl~k~~~y~~~dfwe~i~~~~~~~~~~~l~p~~F~ 258 (367) ...|+.+.. +...+.++.||||+.+ .+ ||- ...+-||++.... ....++++.. ..++.++|+||+ | T Consensus 4 ~~~~~~~~~-d~e~QiQP~gvDLrLg-----~~--~~~-~~~g~i~~~~~~~--~~~~~~~ve~--~~d~~~~L~pg~-y 69 (159) T PRK02253 4 AGPLLEDAV-DLDTQIQPNGVDLTLK-----EI--ERQ-EGPGRIDFDNSRR--KLPEREPLEF--DDDGWIHLPPGI-Y 69 (159) T ss_pred CCHHHCCCC-CHHHCCCCCCEEEECC-----CE--EEE-ECCCCCCCCCHHH--CCCCEEEEEE--CCCCEEEECCCE-E T ss_conf 661331688-7567578363899898-----79--998-2387506886465--1674179640--799549967988-9 Q ss_pred EEEEEEEEECCCCCEE--EEEEECCCCCCEEEECCEEECCCCCCCCCCCCCCEEEEEECCCCCEEEECCCEEEEEEEEEC Q ss_conf 9986334442834016--44530433342134216045688665434566750478861788889947988899999763 Q gi|254780480|r 259 IFASREFIQIPPSLVA--EMIPYDHSIGEFRVHYAGFFDPGFGCVLPQEVGAKAVLEVRSSVPFVLEHGQIIGRLKYESM 336 (367) Q Consensus 259 l~~s~E~v~iP~~~aa--e~~~~~~~~g~~r~h~AGf~DpGf~~~~~~~~~~~~vLevr~~~p~~l~~g~~i~~l~~~~~ 336 (367) +++|.|+|+||+|+|| +.||+..|.|.+ +|+|+| ||||.|+.+. +|++.+..|+.|+.|||||||+|++| T Consensus 70 ll~t~E~v~iP~d~~a~~~~RSSl~R~G~~-i~~av~-DpGf~G~~~~------~l~~~n~~~i~l~~g~Ri~Qlvf~~~ 141 (159) T PRK02253 70 KVVFNEIVNIPKDHVAFAYPRSSLLRCGCT-LETAVW-DAGYEGRSES------LLVVHNPHGIRLERGARIAQLVFIRL 141 (159) T ss_pred EEEEEEEEECCCCCEEEEECHHHHEEEEEE-ECCCCC-CCCCCCEEEE------EEEECCCCCEEEECCCEEEEEEEEEC T ss_conf 999866997380619999611143031078-647632-8983150699------99986887779868999999999983 Q ss_pred CCCCCCCCCCCCCCCCCCCCCC Q ss_conf 7664222375434723266677 Q gi|254780480|r 337 DGEPERLYGEARGSHYQSQALK 358 (367) Q Consensus 337 ~~~p~~~Yg~~~~s~Yq~Qgl~ 358 (367) +++++.+| +++||||.++ T Consensus 142 ~~~~e~~Y----~G~YQgqr~~ 159 (159) T PRK02253 142 EAEHESGY----SGSYQGENID 159 (159) T ss_pred CCCCCCCC----CCCCCCCCCC T ss_conf 68777881----7786776279 No 13 >cd07557 trimeric_dUTPase Trimeric dUTP diphosphatases. Trimeric dUTP diphosphatases, or dUTPases, are the most common family of dUTPase, found in bacteria, eukaryotes, and archaea. They catalyze the hydrolysis of the dUTP-Mg complex (dUTP-Mg) into dUMP and pyrophosphate. This reaction is crucial for the preservation of chromosomal integrity as it removes dUTP and therefore reduces the cellular dUTP/dTTP ratio, and prevents dUTP from being incorporated into DNA. It also provides dUMP as the precursor for dTTP synthesis via the thymidylate synthase pathway. dUTPases are homotrimeric, except some monomeric viral dUTPases, which have been shown to mimic a trimer. Active sites are located at the subunit interface. Probab=99.78 E-value=1.8e-18 Score=140.43 Aligned_cols=91 Identities=32% Similarity=0.530 Sum_probs=83.4 Q ss_pred CEEEEEECCEEEEEECCCCCCCCCCHHHHCCCCEEEEEECCCCEEECCCCEEEEEEEEEEECCCCCEEEEECCCCCCCCC Q ss_conf 22588758989998123157888621100024047887368864866873299998889984998279986013431125 Q gi|254780480|r 34 ASIDLRLSSKAYRVRASFLPNIDGLVSDKIKHFKLREIDLSGGTVLDANCVYIVPLMESLNLKNGISAYANPKSSIGRID 113 (367) Q Consensus 34 aSiDLRLG~~~~rv~asflp~~~~~v~~~l~~~~~~eidl~~g~vLepg~~yLv~t~E~i~LP~di~a~~~pKSS~GRLg 113 (367) ||+||++++.+ +.++|+|++.++++|.|++.+|++++|.+.||||++|.| T Consensus 1 ag~Dl~~~~~~------------------------------~~~~i~p~~~~li~t~~~i~~p~~~~~~i~~RSsl~~~G 50 (92) T cd07557 1 AGYDLRLGEDF------------------------------EGIVLPPGETVLVPTGEAIELPEGYVGLVFPRSSLARKG 50 (92) T ss_pred CCEEEEECCCC------------------------------CCEEECCCCEEEEEEEEEEECCCCCEEEEEECCHHEECC T ss_conf 95646754468------------------------------858999998999980399998999489999346330726 Q ss_pred CEEEEEECCCCCCCCCCCCCCEEEEEEECCCC-CCEEEECCCEEEEEEE Q ss_conf 03423207877554236674438999951686-5279833980356898 Q gi|254780480|r 114 VFARVIGDRSQEFDSISPNYSGPLYLEILPRT-FPIVVRVGSRLSQIRF 161 (367) Q Consensus 114 if~r~i~~~~~~f~~id~Gy~G~l~lEI~~~s-fpI~v~~G~rl~Qi~f 161 (367) +.++. ++ .|||||+|+|++++.|.+ .|+.+++|+|++|+.| T Consensus 51 l~v~~-~g------~iD~gy~G~i~~~l~N~~~~~~~i~~G~riaQlvf 92 (92) T cd07557 51 ITVHN-AG------VIDPGYRGEITLELYNLGPEPVVIKKGDRIAQLVF 92 (92) T ss_pred EEEEC-CC------EECCCCCCEEEEEEEECCCCCEEECCCCEEEEEEC T ss_conf 69734-41------48689876799999999899889879997999979 No 14 >pfam00692 dUTPase dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate. Probab=99.40 E-value=2.6e-12 Score=100.16 Aligned_cols=85 Identities=15% Similarity=0.199 Sum_probs=76.6 Q ss_pred CCCCEEECCCCEEEEEEEEEEECCCCCEEEEECCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEEECCCC-CCEEEE Q ss_conf 6886486687329999888998499827998601343112503423207877554236674438999951686-527983 Q gi|254780480|r 73 LSGGTVLDANCVYIVPLMESLNLKNGISAYANPKSSIGRIDVFARVIGDRSQEFDSISPNYSGPLYLEILPRT-FPIVVR 151 (367) Q Consensus 73 l~~g~vLepg~~yLv~t~E~i~LP~di~a~~~pKSS~GRLgif~r~i~~~~~~f~~id~Gy~G~l~lEI~~~s-fpI~v~ 151 (367) ..+.++++|++..+++|...+.+|++.+|++.||||+++.++.+.. ..||+||.|+|.+.+.|.+ .|+.+. T Consensus 20 ~~~~~~i~p~~~~~i~tgi~~~ip~g~~~~i~~RSs~~~kgl~~~~--------GiID~dyrGei~v~l~N~~~~~~~i~ 91 (129) T pfam00692 20 APYDFTLPPGGTVLVPTDIRIPLPPGTFGLILGRSSLAAKGLIVLP--------GIIDSDYRGEIKVVLFNSGKKDFTIK 91 (129) T ss_pred CCCCEEECCCCEEEEECCEEEECCCCEEEEEECCCCCHHCCCCCCC--------CEECCCCCCEEEEEEEECCCCCEEEE T ss_conf 2899799999989998998998899949999616610002664458--------78887898518999999989999991 Q ss_pred CCCEEEEEEEEECC Q ss_conf 39803568985088 Q gi|254780480|r 152 VGSRLSQIRFVKER 165 (367) Q Consensus 152 ~G~rl~Qi~f~~g~ 165 (367) +|+||+||.|.+-. T Consensus 92 ~G~RiaQli~~~~~ 105 (129) T pfam00692 92 KGDRIAQLILLPCV 105 (129) T ss_pred CCCEEEEEEEEECC T ss_conf 79859999984217 No 15 >cd07557 trimeric_dUTPase Trimeric dUTP diphosphatases. Trimeric dUTP diphosphatases, or dUTPases, are the most common family of dUTPase, found in bacteria, eukaryotes, and archaea. They catalyze the hydrolysis of the dUTP-Mg complex (dUTP-Mg) into dUMP and pyrophosphate. This reaction is crucial for the preservation of chromosomal integrity as it removes dUTP and therefore reduces the cellular dUTP/dTTP ratio, and prevents dUTP from being incorporated into DNA. It also provides dUMP as the precursor for dTTP synthesis via the thymidylate synthase pathway. dUTPases are homotrimeric, except some monomeric viral dUTPases, which have been shown to mimic a trimer. Active sites are located at the subunit interface. Probab=99.35 E-value=5.4e-12 Score=98.13 Aligned_cols=79 Identities=29% Similarity=0.515 Sum_probs=65.2 Q ss_pred CEEEECCCCEEEEEEEEEEECCCCCEEEEEEECC--CCCCEEEECCEEECCCCCCCCCCCCCCEEEEEECCCCCEEEECC Q ss_conf 5399889978999863344428340164453043--33421342160456886654345667504788617888899479 Q gi|254780480|r 248 GQFILDPNEFYIFASREFIQIPPSLVAEMIPYDH--SIGEFRVHYAGFFDPGFGCVLPQEVGAKAVLEVRSSVPFVLEHG 325 (367) Q Consensus 248 ~~~~l~p~~F~l~~s~E~v~iP~~~aae~~~~~~--~~g~~r~h~AGf~DpGf~~~~~~~~~~~~vLevr~~~p~~l~~g 325 (367) +.++|+|+++++..|.|++.+|++++|.+.+.++ +.| ..+|++|++||||.|.++.+ |.+.++.|+.|+.| T Consensus 12 ~~~~i~p~~~~li~t~~~i~~p~~~~~~i~~RSsl~~~G-l~v~~~g~iD~gy~G~i~~~------l~N~~~~~~~i~~G 84 (92) T cd07557 12 EGIVLPPGETVLVPTGEAIELPEGYVGLVFPRSSLARKG-ITVHNAGVIDPGYRGEITLE------LYNLGPEPVVIKKG 84 (92) T ss_pred CCEEECCCCEEEEEEEEEEECCCCCEEEEEECCHHEECC-EEEECCCEECCCCCCEEEEE------EEECCCCCEEECCC T ss_conf 858999998999980399998999489999346330726-69734414868987679999------99998998898799 Q ss_pred CEEEEEEE Q ss_conf 88899999 Q gi|254780480|r 326 QIIGRLKY 333 (367) Q Consensus 326 ~~i~~l~~ 333 (367) ++||||+| T Consensus 85 ~riaQlvf 92 (92) T cd07557 85 DRIAQLVF 92 (92) T ss_pred CEEEEEEC T ss_conf 97999979 No 16 >PRK00601 dut deoxyuridine 5'-triphosphate nucleotidohydrolase; Provisional Probab=99.22 E-value=9.9e-11 Score=89.87 Aligned_cols=85 Identities=20% Similarity=0.369 Sum_probs=74.5 Q ss_pred CCCEEECCCCEEEEEEEEEEECCCCCEEEEECCCCCC-CCCCEEEEEECCCCCCCCCCCCCCEEEEEEECCCC-CCEEEE Q ss_conf 8864866873299998889984998279986013431-12503423207877554236674438999951686-527983 Q gi|254780480|r 74 SGGTVLDANCVYIVPLMESLNLKNGISAYANPKSSIG-RIDVFARVIGDRSQEFDSISPNYSGPLYLEILPRT-FPIVVR 151 (367) Q Consensus 74 ~~g~vLepg~~yLv~t~E~i~LP~di~a~~~pKSS~G-RLgif~r~i~~~~~~f~~id~Gy~G~l~lEI~~~s-fpI~v~ 151 (367) ++..+++|++..+++|-=.+.+|++..|++.||||++ +.||.+.-.++ .||+||.|+|.+-+.|.+ -|+.+. T Consensus 36 ~~~~vi~p~~~~~I~TGi~i~iP~g~~~~I~~RSsla~K~gl~v~N~~G------vID~dYrGei~v~l~N~s~~~~~i~ 109 (149) T PRK00601 36 DEPVTLAPGETALVPTGLAIHIPDGYAAQILPRSGLAHKHGIVLGNLPG------TIDSDYRGELMVSLWNRGQEPFTIE 109 (149) T ss_pred CCCEEECCCCEEEECCCEEEECCCCCEEEEEECCCCCCCCCEECCCCCC------CCCCCCCCEEEEEEEECCCCCEEEC T ss_conf 8978889998899718979987999399998568833004341058653------4278888548999998999877857 Q ss_pred CCCEEEEEEEEEC Q ss_conf 3980356898508 Q gi|254780480|r 152 VGSRLSQIRFVKE 164 (367) Q Consensus 152 ~G~rl~Qi~f~~g 164 (367) +|+||||+.|..- T Consensus 110 ~GdRIaQlv~~~~ 122 (149) T PRK00601 110 PGERIAQLVIVPV 122 (149) T ss_pred CCCEEEEEEEEEC T ss_conf 9998999998324 No 17 >PRK13956 dut deoxyuridine 5'-triphosphate nucleotidohydrolase; Provisional Probab=98.89 E-value=1.5e-08 Score=75.56 Aligned_cols=84 Identities=19% Similarity=0.271 Sum_probs=72.3 Q ss_pred CCCEEECCCCEEEEEEEEEEECCCCCEEEEECCCCCC-CCCCEEEEEECCCCCCCCCCCCC------CEEEEEEECCCC- Q ss_conf 8864866873299998889984998279986013431-12503423207877554236674------438999951686- Q gi|254780480|r 74 SGGTVLDANCVYIVPLMESLNLKNGISAYANPKSSIG-RIDVFARVIGDRSQEFDSISPNY------SGPLYLEILPRT- 145 (367) Q Consensus 74 ~~g~vLepg~~yLv~t~E~i~LP~di~a~~~pKSS~G-RLgif~r~i~~~~~~f~~id~Gy------~G~l~lEI~~~s- 145 (367) .+..++.|++..+++|-=.+.+|++.++.+.||||++ +-||.+--.+ ..||++| .|+|.+-+.|.+ T Consensus 34 ~e~~~i~pge~~lI~TGi~~~iP~G~~~~I~pRSgl~~K~gl~l~N~~------GvIDsdY~g~~~~~Gei~v~l~N~~~ 107 (147) T PRK13956 34 AEAVTIAPGEIKLVPTGVKAYMQPGEVLYLYDRSSNPRKKGLVLINSV------GVIDGDYYGNPANEGHIFAQMQNITD 107 (147) T ss_pred CCCEEECCCCEEEEECCEEEECCCCCEEEEECCCHHHHHCCEEECCCC------CCCCCCCCCCCCCCEEEEEEEEECCC T ss_conf 989699999999984897998799988999887445331677863724------66817654788873069999994799 Q ss_pred CCEEEECCCEEEEEEEEE Q ss_conf 527983398035689850 Q gi|254780480|r 146 FPIVVRVGSRLSQIRFVK 163 (367) Q Consensus 146 fpI~v~~G~rl~Qi~f~~ 163 (367) -++.+.+|+||+|++|.. T Consensus 108 ~~~~I~~GdRIAQlv~~p 125 (147) T PRK13956 108 QPVTLEVGERIVQGVFMP 125 (147) T ss_pred CCEEECCCCEEEEEEEEE T ss_conf 767978999899899868 No 18 >TIGR00576 dut dUTP diphosphatase; InterPro: IPR008181 Synonym(s): dUTP diphosphatase, Deoxyuridine-triphosphatase. The essential enzyme dUTP pyrophosphatase (3.6.1.23 from EC) is specific for dUTP and is critical for the fidelity of DNA replication and repair. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate, simultaneously reducing dUTP levels and providing the dUMP for dTTP biosynthesis. dUTPase decreases the intracellular concentration of dUPT so that uracil cannot be incorporated into DNA . The crystal structure of human dUTPase reveals that each subunit of the dUTPase trimer folds into an eight-stranded jelly-roll beta barrel, with the C-terminal beta strands interchanged among the subunits. The structure is similar to that of the Escherichia coli enzyme, despite low sequence homology between the two enzymes . Other enzymes like deoxycytidine triphosphate deaminase (dCTP) (3.5.4.13 from EC) that specifically bind uridine also belong to this group suggesting that the signature may recognise a putative uridine-binding motif. Some retroviruses encode dUTPases. Retroviral dUTPase is synthesised as part of POL polyprotein that contains; an aspartyl protease, a reverse transcriptase, dUTPase and RNase H. ; GO: 0016787 hydrolase activity, 0046080 dUTP metabolic process. Probab=98.88 E-value=1.4e-08 Score=75.78 Aligned_cols=83 Identities=20% Similarity=0.314 Sum_probs=72.4 Q ss_pred CCEEECCCCEEEEEEEEEEECCCCCEEEEECCCCCC-CCCCEEEEEECCCCCCCCCCCCCCEEEEEEECCCC--CCEEEE Q ss_conf 864866873299998889984998279986013431-12503423207877554236674438999951686--527983 Q gi|254780480|r 75 GGTVLDANCVYIVPLMESLNLKNGISAYANPKSSIG-RIDVFARVIGDRSQEFDSISPNYSGPLYLEILPRT--FPIVVR 151 (367) Q Consensus 75 ~g~vLepg~~yLv~t~E~i~LP~di~a~~~pKSS~G-RLgif~r~i~~~~~~f~~id~Gy~G~l~lEI~~~s--fpI~v~ 151 (367) +...+.|++-.|++|==.|.||..+.|++.|||++| |-+|.+ +.+++ .||+.|.|+|=.=+.|++ -+..+. T Consensus 34 ~P~~i~P~~~~Lv~TG~~~~lP~g~~g~v~PRSGLA~K~gv~i----~n~~G--vID~DYRGE~kV~L~N~gp~~~f~v~ 107 (151) T TIGR00576 34 DPVTIPPGERALVPTGIAIELPEGYYGRVAPRSGLALKHGVTI----DNSPG--VIDADYRGEIKVILINLGPKEDFTVK 107 (151) T ss_pred CCEEECCCCEEEECCCEEEECCCCCEEEEEECCCCCEEEEEEE----ECCCC--EEECCCCCCEEEEEEECCCCCCEEEC T ss_conf 8758789984875244299848994899730441330246899----66781--78178720268999858999867885 Q ss_pred CCCEEEEEEEEE Q ss_conf 398035689850 Q gi|254780480|r 152 VGSRLSQIRFVK 163 (367) Q Consensus 152 ~G~rl~Qi~f~~ 163 (367) +|+||+||.+-+ T Consensus 108 ~GdRIAQlv~~~ 119 (151) T TIGR00576 108 KGDRIAQLVVEK 119 (151) T ss_pred CCCEEEEEEEEE T ss_conf 797577688731 No 19 >pfam00692 dUTPase dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate. Probab=98.86 E-value=4.1e-08 Score=72.82 Aligned_cols=84 Identities=17% Similarity=0.237 Sum_probs=67.7 Q ss_pred EEEECCCCEEEEEEEEEEECCCCCEEEEEEECCCCCCEEEECCEEECCCCCCCCCCCCCCEEEEEECCCCCEEEECCCEE Q ss_conf 39988997899986334442834016445304333421342160456886654345667504788617888899479888 Q gi|254780480|r 249 QFILDPNEFYIFASREFIQIPPSLVAEMIPYDHSIGEFRVHYAGFFDPGFGCVLPQEVGAKAVLEVRSSVPFVLEHGQII 328 (367) Q Consensus 249 ~~~l~p~~F~l~~s~E~v~iP~~~aae~~~~~~~~g~~r~h~AGf~DpGf~~~~~~~~~~~~vLevr~~~p~~l~~g~~i 328 (367) .++|+|++..+..|-.++.+|+++.+.+.+.+...--+...-+|.+|+||.|.+. ..|.+.++.|+.++.|+|| T Consensus 23 ~~~i~p~~~~~i~tgi~~~ip~g~~~~i~~RSs~~~kgl~~~~GiID~dyrGei~------v~l~N~~~~~~~i~~G~Ri 96 (129) T pfam00692 23 DFTLPPGGTVLVPTDIRIPLPPGTFGLILGRSSLAAKGLIVLPGIIDSDYRGEIK------VVLFNSGKKDFTIKKGDRI 96 (129) T ss_pred CEEECCCCEEEEECCEEEECCCCEEEEEECCCCCHHCCCCCCCCEECCCCCCEEE------EEEEECCCCCEEEECCCEE T ss_conf 9799999989998998998899949999616610002664458788878985189------9999998999999179859 Q ss_pred EEEEEEECCC Q ss_conf 9999976376 Q gi|254780480|r 329 GRLKYESMDG 338 (367) Q Consensus 329 ~~l~~~~~~~ 338 (367) |||+|.+... T Consensus 97 aQli~~~~~~ 106 (129) T pfam00692 97 AQLILLPCVH 106 (129) T ss_pred EEEEEEECCC T ss_conf 9999842176 No 20 >PTZ00143 deoxyuridine 5'-triphosphate nucleotidohydrolase; Provisional Probab=98.78 E-value=6.6e-08 Score=71.42 Aligned_cols=85 Identities=24% Similarity=0.245 Sum_probs=65.2 Q ss_pred CCEEECCCCEEEEEEEEEEEC---------CCCCEEEEECCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEEECCCC Q ss_conf 864866873299998889984---------99827998601343112503423207877554236674438999951686 Q gi|254780480|r 75 GGTVLDANCVYIVPLMESLNL---------KNGISAYANPKSSIGRIDVFARVIGDRSQEFDSISPNYSGPLYLEILPRT 145 (367) Q Consensus 75 ~g~vLepg~~yLv~t~E~i~L---------P~di~a~~~pKSS~GRLgif~r~i~~~~~~f~~id~Gy~G~l~lEI~~~s 145 (367) +..++.|++..+++|-=++.. ..++..++.||||++.-++... ++. ..||+||.|+|.+-+.|.+ T Consensus 35 ed~~i~p~~~~lI~tGIk~~~~~~~~~~~~g~~~~~~i~PRSslsK~~l~l~----N~v--GiIDsgYRGeI~v~l~N~~ 108 (155) T PTZ00143 35 EDQVLKAKSTTFIKLGIKAIALQYKSDGSDGKNVSYLLFPRSSISKTPLRLA----NSV--GLIDAGYRGELIAALDNIK 108 (155) T ss_pred CCEEECCCCEEEEEEEEEEEEEEECCCCCCCCCCEEEEECCCHHCCCCEEEC----CCC--CEECCCCCEEEEEEEEECC T ss_conf 6638899975999620289886413777567751379856763204578972----764--1785899668999999599 Q ss_pred -CCEEEECCCEEEEEEEEECC Q ss_conf -52798339803568985088 Q gi|254780480|r 146 -FPIVVRVGSRLSQIRFVKER 165 (367) Q Consensus 146 -fpI~v~~G~rl~Qi~f~~g~ 165 (367) -+..+.+|+||+||.+.... T Consensus 109 ~~~~~i~~GdRiaQlv~~~~~ 129 (155) T PTZ00143 109 DFEYTVKKGDRLVQLVSFTGE 129 (155) T ss_pred CCCEEECCCCEEEEEEECCCC T ss_conf 988897479679999831257 No 21 >COG0756 Dut dUTPase [Nucleotide transport and metabolism] Probab=98.77 E-value=6.5e-08 Score=71.50 Aligned_cols=83 Identities=22% Similarity=0.336 Sum_probs=71.5 Q ss_pred EEECCCCEEEEEEEEEEECCCCCEEEEECCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEEECCCC-CCEEEECCCE Q ss_conf 486687329999888998499827998601343112503423207877554236674438999951686-5279833980 Q gi|254780480|r 77 TVLDANCVYIVPLMESLNLKNGISAYANPKSSIGRIDVFARVIGDRSQEFDSISPNYSGPLYLEILPRT-FPIVVRVGSR 155 (367) Q Consensus 77 ~vLepg~~yLv~t~E~i~LP~di~a~~~pKSS~GRLgif~r~i~~~~~~f~~id~Gy~G~l~lEI~~~s-fpI~v~~G~r 155 (367) ..+.||+..|++|-=.+.||+...|.+.|||++|.--= ++..++.+ .||+.|.|+|..-+.|.+ -|..+.+||| T Consensus 38 ~~i~pg~~~LVpTGl~~~ip~g~~~~v~PRSgla~k~G---i~~~Ns~G--~IDsDYrGei~V~l~N~~~~~f~ie~GdR 112 (148) T COG0756 38 VTIAPGERKLVPTGLAIELPEGYEAQVRPRSGLALKHG---ITLGNSPG--TIDSDYRGEIKVLLINLGDEDFVIEKGDR 112 (148) T ss_pred EEECCCCEEEECCCEEEECCCCCEEEEECCCCCCEECE---EEEECCCC--EECCCCCCEEEEEEEECCCCCEEECCCCE T ss_conf 68889973886178789858980899953665660032---89936786--57777875589999937999878627888 Q ss_pred EEEEEEEEC Q ss_conf 356898508 Q gi|254780480|r 156 LSQIRFVKE 164 (367) Q Consensus 156 l~Qi~f~~g 164 (367) |+|++|-.- T Consensus 113 IaQ~V~~~v 121 (148) T COG0756 113 IAQLVFVKV 121 (148) T ss_pred EEEEEEEEE T ss_conf 989999988 No 22 >PRK00601 dut deoxyuridine 5'-triphosphate nucleotidohydrolase; Provisional Probab=98.65 E-value=3.2e-07 Score=66.94 Aligned_cols=84 Identities=24% Similarity=0.340 Sum_probs=70.0 Q ss_pred CEEEECCCCEEEEEEEEEEECCCCCEEEEEEECCCC---CCEEEE-CCEEECCCCCCCCCCCCCCEEEEEECCCCCEEEE Q ss_conf 539988997899986334442834016445304333---421342-1604568866543456675047886178888994 Q gi|254780480|r 248 GQFILDPNEFYIFASREFIQIPPSLVAEMIPYDHSI---GEFRVH-YAGFFDPGFGCVLPQEVGAKAVLEVRSSVPFVLE 323 (367) Q Consensus 248 ~~~~l~p~~F~l~~s~E~v~iP~~~aae~~~~~~~~---g~~r~h-~AGf~DpGf~~~~~~~~~~~~vLevr~~~p~~l~ 323 (367) ..++|+|++-.+..|-=.+.+|+++.+...|.++.. | ..++ .+|-+|+||.|.+ ...|...++.||.++ T Consensus 37 ~~~vi~p~~~~~I~TGi~i~iP~g~~~~I~~RSsla~K~g-l~v~N~~GvID~dYrGei------~v~l~N~s~~~~~i~ 109 (149) T PRK00601 37 EPVTLAPGETALVPTGLAIHIPDGYAAQILPRSGLAHKHG-IVLGNLPGTIDSDYRGEL------MVSLWNRGQEPFTIE 109 (149) T ss_pred CCEEECCCCEEEECCCEEEECCCCCEEEEEECCCCCCCCC-EECCCCCCCCCCCCCCEE------EEEEEECCCCCEEEC T ss_conf 9788899988997189799879993999985688330043-410586534278888548------999998999877857 Q ss_pred CCCEEEEEEEEECCC Q ss_conf 798889999976376 Q gi|254780480|r 324 HGQIIGRLKYESMDG 338 (367) Q Consensus 324 ~g~~i~~l~~~~~~~ 338 (367) .|++||||+|.+... T Consensus 110 ~GdRIaQlv~~~~~~ 124 (149) T PRK00601 110 PGERIAQLVIVPVVQ 124 (149) T ss_pred CCCEEEEEEEEECCC T ss_conf 999899999832466 No 23 >TIGR00576 dut dUTP diphosphatase; InterPro: IPR008181 Synonym(s): dUTP diphosphatase, Deoxyuridine-triphosphatase. The essential enzyme dUTP pyrophosphatase (3.6.1.23 from EC) is specific for dUTP and is critical for the fidelity of DNA replication and repair. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate, simultaneously reducing dUTP levels and providing the dUMP for dTTP biosynthesis. dUTPase decreases the intracellular concentration of dUPT so that uracil cannot be incorporated into DNA . The crystal structure of human dUTPase reveals that each subunit of the dUTPase trimer folds into an eight-stranded jelly-roll beta barrel, with the C-terminal beta strands interchanged among the subunits. The structure is similar to that of the Escherichia coli enzyme, despite low sequence homology between the two enzymes . Other enzymes like deoxycytidine triphosphate deaminase (dCTP) (3.5.4.13 from EC) that specifically bind uridine also belong to this group suggesting that the signature may recognise a putative uridine-binding motif. Some retroviruses encode dUTPases. Retroviral dUTPase is synthesised as part of POL polyprotein that contains; an aspartyl protease, a reverse transcriptase, dUTPase and RNase H. ; GO: 0016787 hydrolase activity, 0046080 dUTP metabolic process. Probab=98.59 E-value=5.7e-07 Score=65.31 Aligned_cols=85 Identities=16% Similarity=0.235 Sum_probs=76.8 Q ss_pred CEEEECCCCEEEEEEEEEEECCCCCEEEEEEECCCCCCEE---EECCEEECCCCCCCCCCCCCCEEEEEECC-CCCEEEE Q ss_conf 5399889978999863344428340164453043334213---42160456886654345667504788617-8888994 Q gi|254780480|r 248 GQFILDPNEFYIFASREFIQIPPSLVAEMIPYDHSIGEFR---VHYAGFFDPGFGCVLPQEVGAKAVLEVRS-SVPFVLE 323 (367) Q Consensus 248 ~~~~l~p~~F~l~~s~E~v~iP~~~aae~~~~~~~~g~~r---~h~AGf~DpGf~~~~~~~~~~~~vLevr~-~~p~~l~ 323 (367) +.+.|+|++=.|..|==.|.||..+.|-++|.++..--.- .--||-||.++.|.+ +.+|-+.+ +.+|.++ T Consensus 34 ~P~~i~P~~~~Lv~TG~~~~lP~g~~g~v~PRSGLA~K~gv~i~n~~GvID~DYRGE~------kV~L~N~gp~~~f~v~ 107 (151) T TIGR00576 34 DPVTIPPGERALVPTGIAIELPEGYYGRVAPRSGLALKHGVTIDNSPGVIDADYRGEI------KVILINLGPKEDFTVK 107 (151) T ss_pred CCEEECCCCEEEECCCEEEECCCCCEEEEEECCCCCEEEEEEEECCCCEEECCCCCCE------EEEEEECCCCCCEEEC T ss_conf 8758789984875244299848994899730441330246899667817817872026------8999858999867885 Q ss_pred CCCEEEEEEEEECCC Q ss_conf 798889999976376 Q gi|254780480|r 324 HGQIIGRLKYESMDG 338 (367) Q Consensus 324 ~g~~i~~l~~~~~~~ 338 (367) .|+|||||+++|... T Consensus 108 ~GdRIAQlv~~~~~~ 122 (151) T TIGR00576 108 KGDRIAQLVVEKIVQ 122 (151) T ss_pred CCCEEEEEEEEEEEE T ss_conf 797577688731568 No 24 >pfam06559 DCD 2'-deoxycytidine 5'-triphosphate deaminase (DCD). This family consists of several bacterial 2'-deoxycytidine 5'-triphosphate deaminase proteins (EC:3.5.4.13). Probab=98.45 E-value=7.9e-06 Score=57.89 Aligned_cols=152 Identities=18% Similarity=0.274 Sum_probs=99.7 Q ss_pred CCCCCHHHHHHHHHCCCEEECCCCCHHHCCCCEEEEEEC--C--EEEEEECCCCCCCCC-CHH-HHCCC-CEEEEEECCC Q ss_conf 761586899999987984981789865657122588758--9--899981231578886-211-00024-0478873688 Q gi|254780480|r 3 KGVLPDKAIATLAERGDIVSEHPLEKNQIQPASIDLRLS--S--KAYRVRASFLPNIDG-LVS-DKIKH-FKLREIDLSG 75 (367) Q Consensus 3 ~GILsd~~I~~li~~g~I~i~~p~~~~QIQPaSiDLRLG--~--~~~rv~asflp~~~~-~v~-~~l~~-~~~~eidl~~ 75 (367) +-.|+|.+++++-++..|+...+.-++- =+-|+||.-. + ..||-+.+. +-.|- ++. -+..+ .+......+. T Consensus 171 ~~~lsd~~L~~lh~~~~lv~~~~~i~~G-l~~SVDL~~~~~~~iiGYRAKkht-~~IDldki~~y~~~dfWepi~~~~~~ 248 (366) T pfam06559 171 EATLSDAELAALHAREPLVDGAPVIDGG-LALSVDLAGEGGDGLVGYRAKRHT-GVIDLDRIGAYDPADFWEPLHARDGR 248 (366) T ss_pred CCCCCHHHHHHHHHHCCCCCCCCCCCCC-EEEEEECCCCCCCCEEEEEECCCC-CEEEECCCCCCCHHHHHCEEECCCCC T ss_conf 8767999999988548775787213688-179998557888864899825899-74874474658755601046528998 Q ss_pred CEEECCCCEEEEEEEEEEECCCCCEEEEECCCCCCCCCCEEEEEECCCCCCCCCCCCCCEE--------EEEEECCCCCC Q ss_conf 6486687329999888998499827998601343112503423207877554236674438--------99995168652 Q gi|254780480|r 76 GTVLDANCVYIVPLMESLNLKNGISAYANPKSSIGRIDVFARVIGDRSQEFDSISPNYSGP--------LYLEILPRTFP 147 (367) Q Consensus 76 g~vLepg~~yLv~t~E~i~LP~di~a~~~pKSS~GRLgif~r~i~~~~~~f~~id~Gy~G~--------l~lEI~~~sfp 147 (367) ..+|+|++||+-...|.|.+|+|.+|-..|=.. +.|=| =++| ++| .||||-.. -.||+-++..| T Consensus 249 ~liLdp~eFYIL~SkE~V~IPp~~AAEM~pyd~--~vGeF---RvHY-AGF--FDPGFG~~~~gg~GsraVLEVRs~evP 320 (366) T pfam06559 249 ELILDPGEFYILVSREAVHIPPDYAAEMAPFDP--LVGEF---RVHY-AGF--FDPGFGHAAAGGAGSRAVLEVRSHEVP 320 (366) T ss_pred EEEECCCCEEEEEECCEEEECCCHHEECCCCCC--CCCCE---EEEE-CCC--CCCCCCCCCCCCCCCCEEEEEECCCCC T ss_conf 599778867999754336747331012524323--56641---2134-231--258878766677774079998348787 Q ss_pred EEEECCCEEEEEEEEEC Q ss_conf 79833980356898508 Q gi|254780480|r 148 IVVRVGSRLSQIRFVKE 164 (367) Q Consensus 148 I~v~~G~rl~Qi~f~~g 164 (367) -.+.-|+.+|.|++-+= T Consensus 321 F~ledgQ~v~rL~yE~m 337 (366) T pfam06559 321 FILEHGQIVGRLVYERM 337 (366) T ss_pred EEEECCCEEEEEEEEEC T ss_conf 67667978899998770 No 25 >COG0756 Dut dUTPase [Nucleotide transport and metabolism] Probab=98.40 E-value=4.1e-06 Score=59.70 Aligned_cols=85 Identities=19% Similarity=0.266 Sum_probs=71.4 Q ss_pred EEECCCCEEEEEEEEEEECCCCCEEEEEEECCCCCCEEEE---CCEEECCCCCCCCCCCCCCEEEEEECCCCCEEEECCC Q ss_conf 9988997899986334442834016445304333421342---1604568866543456675047886178888994798 Q gi|254780480|r 250 FILDPNEFYIFASREFIQIPPSLVAEMIPYDHSIGEFRVH---YAGFFDPGFGCVLPQEVGAKAVLEVRSSVPFVLEHGQ 326 (367) Q Consensus 250 ~~l~p~~F~l~~s~E~v~iP~~~aae~~~~~~~~g~~r~h---~AGf~DpGf~~~~~~~~~~~~vLevr~~~p~~l~~g~ 326 (367) +.|.|++..|..|-=.+.||..+++-++|.++..-...+. -.|-+|+.+.|.+ ...|-++++.||.++.|+ T Consensus 38 ~~i~pg~~~LVpTGl~~~ip~g~~~~v~PRSgla~k~Gi~~~Ns~G~IDsDYrGei------~V~l~N~~~~~f~ie~Gd 111 (148) T COG0756 38 VTIAPGERKLVPTGLAIELPEGYEAQVRPRSGLALKHGITLGNSPGTIDSDYRGEI------KVLLINLGDEDFVIEKGD 111 (148) T ss_pred EEECCCCEEEECCCEEEECCCCCEEEEECCCCCCEECEEEEECCCCEECCCCCCEE------EEEEEECCCCCEEECCCC T ss_conf 68889973886178789858980899953665660032899367865777787558------999993799987862788 Q ss_pred EEEEEEEEECCCCC Q ss_conf 88999997637664 Q gi|254780480|r 327 IIGRLKYESMDGEP 340 (367) Q Consensus 327 ~i~~l~~~~~~~~p 340 (367) |||||+|.|..... T Consensus 112 RIaQ~V~~~v~~~~ 125 (148) T COG0756 112 RIAQLVFVKVLQAE 125 (148) T ss_pred EEEEEEEEEEEECC T ss_conf 89899999887545 No 26 >PRK13956 dut deoxyuridine 5'-triphosphate nucleotidohydrolase; Provisional Probab=98.37 E-value=4.2e-06 Score=59.69 Aligned_cols=88 Identities=16% Similarity=0.164 Sum_probs=68.9 Q ss_pred EEEECCCCEEEEEEEEEEECCCCCEEEEEEECCCC---CCEEEECCEEECCCCCCCCCCCCCC-EEEEEECCCCCEEEEC Q ss_conf 39988997899986334442834016445304333---4213421604568866543456675-0478861788889947 Q gi|254780480|r 249 QFILDPNEFYIFASREFIQIPPSLVAEMIPYDHSI---GEFRVHYAGFFDPGFGCVLPQEVGA-KAVLEVRSSVPFVLEH 324 (367) Q Consensus 249 ~~~l~p~~F~l~~s~E~v~iP~~~aae~~~~~~~~---g~~r~h~AGf~DpGf~~~~~~~~~~-~~vLevr~~~p~~l~~ 324 (367) ..+|.|++-.+..|-=.+.||+.+++.+.|.++.. |......+|-+|.+|.+... ..|- ...|...++.||.++. T Consensus 36 ~~~i~pge~~lI~TGi~~~iP~G~~~~I~pRSgl~~K~gl~l~N~~GvIDsdY~g~~~-~~Gei~v~l~N~~~~~~~I~~ 114 (147) T PRK13956 36 AVTIAPGEIKLVPTGVKAYMQPGEVLYLYDRSSNPRKKGLVLINSVGVIDGDYYGNPA-NEGHIFAQMQNITDQPVTLEV 114 (147) T ss_pred CEEECCCCEEEEECCEEEECCCCCEEEEECCCHHHHHCCEEECCCCCCCCCCCCCCCC-CCEEEEEEEEECCCCCEEECC T ss_conf 9699999999984897998799988999887445331677863724668176547888-730699999947997679789 Q ss_pred CCEEEEEEEEECC Q ss_conf 9888999997637 Q gi|254780480|r 325 GQIIGRLKYESMD 337 (367) Q Consensus 325 g~~i~~l~~~~~~ 337 (367) |+|||||+|.+.. T Consensus 115 GdRIAQlv~~p~~ 127 (147) T PRK13956 115 GERIVQGVFMPFL 127 (147) T ss_pred CCEEEEEEEEEEE T ss_conf 9989989986899 No 27 >PRK07559 2'-deoxycytidine 5'-triphosphate deaminase; Provisional Probab=98.33 E-value=1.9e-05 Score=55.35 Aligned_cols=150 Identities=18% Similarity=0.288 Sum_probs=98.0 Q ss_pred CCCCHHHHHHHHHCCCEEECCCCCHHHC---CCCEEEEEECC---EEEEEECCCCCCCC-CCHH-HHCCC-CEEEEEECC Q ss_conf 6158689999998798498178986565---71225887589---89998123157888-6211-00024-047887368 Q gi|254780480|r 4 GVLPDKAIATLAERGDIVSEHPLEKNQI---QPASIDLRLSS---KAYRVRASFLPNID-GLVS-DKIKH-FKLREIDLS 74 (367) Q Consensus 4 GILsd~~I~~li~~g~I~i~~p~~~~QI---QPaSiDLRLG~---~~~rv~asflp~~~-~~v~-~~l~~-~~~~eidl~ 74 (367) -.|+|.+++++-++..|+... ....| =.-||||.-.. ..||-|.+. +-.| .++. -+..+ .+...-..+ T Consensus 171 ~~lsd~eL~~lh~~~~Lv~~~--~~~~i~~Gl~lSVDLsg~~~~iiGYRAKkht-~~IDldKi~~y~~~dFWEpI~~~~~ 247 (366) T PRK07559 171 AVLDEAELAALHAAEPLVDGD--DPNISGGGIALSVDLSGGEDGLVGYRAKRHT-GVVDVDRRGGHDVRDFWEPLYARGD 247 (366) T ss_pred CCCCHHHHHHHHHHCCCCCCC--CCCCCCCCEEEEEEECCCCCCEEEEEECCCC-CEEEECCCCCCCHHHHHCEEEECCC T ss_conf 766999999876437764576--5543468659999932688964678823799-6377558677886882101013789 Q ss_pred CCEEECCCCEEEEEEEEEEECCCCCEEEEECCCCCCCCCCEEEEEECCCCCCCCCCCCCCEE--------EEEEECCCCC Q ss_conf 86486687329999888998499827998601343112503423207877554236674438--------9999516865 Q gi|254780480|r 75 GGTVLDANCVYIVPLMESLNLKNGISAYANPKSSIGRIDVFARVIGDRSQEFDSISPNYSGP--------LYLEILPRTF 146 (367) Q Consensus 75 ~g~vLepg~~yLv~t~E~i~LP~di~a~~~pKSS~GRLgif~r~i~~~~~~f~~id~Gy~G~--------l~lEI~~~sf 146 (367) ...+|+|++||+-...|.|.+|++.+|-..|=.. +.|=| -++| ++| .||||-.. -.||+-++.. T Consensus 248 ~~LILdp~eFYIL~SkE~V~iP~~~AaEM~pyd~--~~Gef---RvHY-AGF--FDPGFG~~~~g~~gsraVLEVRs~ev 319 (366) T PRK07559 248 GELILDPGEFYILASREAVHVPPDYAAEMVPFDP--LVGEF---RVHY-AGF--FDPGFGHAEAGGTGSRAVLEVRSHEV 319 (366) T ss_pred CEEEECCCCEEEEEECCEEEECCCHHHHCCCCCC--CCCCE---EEEE-CCC--CCCCCCCCCCCCCCCEEEEEEECCCC T ss_conf 8799737867999634436657741431335444--66634---4445-021--25877877677877328999834878 Q ss_pred CEEEECCCEEEEEEEEEC Q ss_conf 279833980356898508 Q gi|254780480|r 147 PIVVRVGSRLSQIRFVKE 164 (367) Q Consensus 147 pI~v~~G~rl~Qi~f~~g 164 (367) |-.+.-|+.+|.|++.+= T Consensus 320 PF~le~gQ~v~rl~ye~m 337 (366) T PRK07559 320 PFILEHGQIVGRLVYERM 337 (366) T ss_pred CEEEECCCEEEEEEEEEC T ss_conf 758768978899998870 No 28 >PTZ00143 deoxyuridine 5'-triphosphate nucleotidohydrolase; Provisional Probab=97.47 E-value=0.0018 Score=42.48 Aligned_cols=54 Identities=17% Similarity=0.224 Sum_probs=38.5 Q ss_pred EEECCCCCCEEEE-CCEEECCCCCCCCCCCCCCEEEEEECCCCCEEEECCCEEEEEEEEECC Q ss_conf 5304333421342-160456886654345667504788617888899479888999997637 Q gi|254780480|r 277 IPYDHSIGEFRVH-YAGFFDPGFGCVLPQEVGAKAVLEVRSSVPFVLEHGQIIGRLKYESMD 337 (367) Q Consensus 277 ~~~~~~~g~~r~h-~AGf~DpGf~~~~~~~~~~~~vLevr~~~p~~l~~g~~i~~l~~~~~~ 337 (367) ||+...-| ++.. ..|-+|+|+.|.+ .+.|-..++.+|.++.|+|||||++.+.. T Consensus 75 RSslsK~~-l~l~N~vGiIDsgYRGeI------~v~l~N~~~~~~~i~~GdRiaQlv~~~~~ 129 (155) T PTZ00143 75 RSSISKTP-LRLANSVGLIDAGYRGEL------IAALDNIKDFEYTVKKGDRLVQLVSFTGE 129 (155) T ss_pred CCHHCCCC-EEECCCCCEECCCCCEEE------EEEEEECCCCCEEECCCCEEEEEEECCCC T ss_conf 76320457-897276417858996689------99999599988897479679999831257 No 29 >KOG3370 consensus Probab=94.43 E-value=0.15 Score=29.95 Aligned_cols=79 Identities=18% Similarity=0.265 Sum_probs=51.0 Q ss_pred EECCCCEEEEEEEEEEECCCCCEEEEECCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEEECCCC-CCEEEECCCEE Q ss_conf 86687329999888998499827998601343112503423207877554236674438999951686-52798339803 Q gi|254780480|r 78 VLDANCVYIVPLMESLNLKNGISAYANPKSSIGRIDVFARVIGDRSQEFDSISPNYSGPLYLEILPRT-FPIVVRVGSRL 156 (367) Q Consensus 78 vLepg~~yLv~t~E~i~LP~di~a~~~pKSS~GRLgif~r~i~~~~~~f~~id~Gy~G~l~lEI~~~s-fpI~v~~G~rl 156 (367) ...|.+-.++.|-=.+.+|....+++-|||-++-- -++.+- .+ -||+.|.|++-.-+.|.| -.-.+..|+|+ T Consensus 34 ~vpa~gk~~V~td~q~~vP~g~ygRvaprsglA~k-~~I~~g----ag--vVd~dyrgeV~v~LfN~~~~~F~~k~Gdri 106 (140) T KOG3370 34 TVPARGKAVVDTDLQIAVPSGYYGRVAPRSGLAWK-HFIDVG----AG--VVDPDYRGEVGVLLFNHSDRDFEYKKGDRI 106 (140) T ss_pred CCCCCCCEECCCCCEEECCCCEEEEECCCCCCHHH-CCCCCC----CC--EECCCCCCEEEEEEECCCCCCEEEECCCCC T ss_conf 22766543404662363472000013454331022-056546----73--355454422679986589863455128821 Q ss_pred EEEEEEE Q ss_conf 5689850 Q gi|254780480|r 157 SQIRFVK 163 (367) Q Consensus 157 ~Qi~f~~ 163 (367) +||.+-+ T Consensus 107 aqli~~~ 113 (140) T KOG3370 107 AQLIVEK 113 (140) T ss_pred EEEEEEE T ss_conf 0159996 No 30 >KOG3370 consensus Probab=93.64 E-value=0.16 Score=29.71 Aligned_cols=78 Identities=26% Similarity=0.279 Sum_probs=50.2 Q ss_pred EEEEEEEEEECCCCCEEEE-------EEECC------CCC----CEEEECCEEECCCCCCCCCCCCCCEEEEEECCCCCE Q ss_conf 9998633444283401644-------53043------334----213421604568866543456675047886178888 Q gi|254780480|r 258 YIFASREFIQIPPSLVAEM-------IPYDH------SIG----EFRVHYAGFFDPGFGCVLPQEVGAKAVLEVRSSVPF 320 (367) Q Consensus 258 ~l~~s~E~v~iP~~~aae~-------~~~~~------~~g----~~r~h~AGf~DpGf~~~~~~~~~~~~vLevr~~~p~ 320 (367) |.+.|.+..++|+...|-. +|... |.| .|.-.-||-+||.+.+.. ..+|-.-++..| T Consensus 25 ydl~sa~~~~vpa~gk~~V~td~q~~vP~g~ygRvaprsglA~k~~I~~gagvVd~dyrgeV------~v~LfN~~~~~F 98 (140) T KOG3370 25 YDLYSAQDGTVPARGKAVVDTDLQIAVPSGYYGRVAPRSGLAWKHFIDVGAGVVDPDYRGEV------GVLLFNHSDRDF 98 (140) T ss_pred CCHHHHCCCCCCCCCCEECCCCCEEECCCCEEEEECCCCCCHHHCCCCCCCCEECCCCCCEE------EEEEECCCCCCE T ss_conf 24022104522766543404662363472000013454331022056546733554544226------799865898634 Q ss_pred EEECCCEEEEEEEEECCCCCCC Q ss_conf 9947988899999763766422 Q gi|254780480|r 321 VLEHGQIIGRLKYESMDGEPER 342 (367) Q Consensus 321 ~l~~g~~i~~l~~~~~~~~p~~ 342 (367) ..+.|.+|+||++|+... |+. T Consensus 99 ~~k~Gdriaqli~~~i~~-~~i 119 (140) T KOG3370 99 EYKKGDRIAQLIVEKIVT-PEI 119 (140) T ss_pred EEECCCCCEEEEEEECCC-CCE T ss_conf 551288210159996479-761 No 31 >TIGR02772 Ku_bact Ku protein; InterPro: IPR009187 This superfamily consists of prokaryotic Ku domain containing proteins. In the eukaryotes it has been shown that the Ku protein is involved in repairing DNA double-strand breaks by non-homologous end-joining , . The Ku protein is a heterodimer of approximately 70 kDa and 80 kDa subunits . Both these subunits have strong sequence similarity and it has been suggested that they may have evolved by gene duplication from a homodimeric ancestor in eukaryotes . The prokaryotic Ku members are homodimers and they have been predicted to be involved in the DNA repair system, which is mechanistically similar to eukaryotic non-homologous end joining , . Recent findings have implicated yeast Ku in telomeric structure maintenance in addition to on-homologous end-joining. Some of the phenotypes of the Ku-knockout mice may indicate a similar role for Ku at mammalian telomeres . Evolutionary notes: With current available phyletic information it is difficult to determine the correct evolutionary trajectory of the Ku domain. It is possible that the core Ku domain was present in bacteria and archaea even before the presence of the eukaryotes. Eukaryotes might have vertically inherited the Ku-core protein, from a common ancestor shared with a certain archaeal lineage or through horizontal transfer from bacteria. Alternatively, the core Ku domain could have evolved in the eukaryotic lineage and then horizontally transferred to the prokaryotes. Sequencing of additional archaeal genomes and those of early-branching eukaryotes may help resolving the evolutionary history of the Ku domain. Structure notes: The eukaryotic Ku heterodimer comprises an alpha/beta N-terminal, a central beta-barrel domain and a helical C-terminal arm . Structural analysis of the Ku70/80 heterodimer bound to DNA indicate that subunit contacts lead to the formation of a highly charged channel through which the DNA passes without making any contacts with the DNA bases . For additional information please see .. Probab=65.57 E-value=3.9 Score=20.73 Aligned_cols=32 Identities=13% Similarity=0.334 Sum_probs=28.5 Q ss_pred CCCCCEEEEEEECCCCCCEEEECCCE-EEEEEEEE Q ss_conf 66744389999516865279833980-35689850 Q gi|254780480|r 130 SPNYSGPLYLEILPRTFPIVVRVGSR-LSQIRFVK 163 (367) Q Consensus 130 d~Gy~G~l~lEI~~~sfpI~v~~G~r-l~Qi~f~~ 163 (367) -|=|+|.|=+=++ |+||+||+.++ =-.|+|+. T Consensus 2 Ra~WKG~isfgLV--~~PV~Ly~AT~~~~~i~F~~ 34 (271) T TIGR02772 2 RAIWKGAISFGLV--NIPVKLYPATESSEDISFHL 34 (271) T ss_pred CCCEEEEEEEEEE--EEEEEEEECCCCCCCCCCCC T ss_conf 9831344454445--64157620512234555342 No 32 >pfam08758 Cadherin_pro Cadherin prodomain like. Cadherins are a family of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This domain corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions. Probab=58.85 E-value=7.8 Score=18.73 Aligned_cols=28 Identities=39% Similarity=0.642 Sum_probs=14.6 Q ss_pred CCCEEEECCCEEEEEEEEECCCCCCCCC Q ss_conf 8888994798889999976376642223 Q gi|254780480|r 317 SVPFVLEHGQIIGRLKYESMDGEPERLY 344 (367) Q Consensus 317 ~~p~~l~~g~~i~~l~~~~~~~~p~~~Y 344 (367) .||-.|+.||.+++..|+--...+...| T Consensus 14 ~Vp~~l~~g~~lg~V~F~~C~g~~~i~y 41 (90) T pfam08758 14 LVPRRLEEGQPLGRVKFNDCKGRRRIQY 41 (90) T ss_pred EECCCCCCCCEEEEEEEEECCCCCCEEE T ss_conf 9577513897977999310689864478 No 33 >cd00789 KU_like Ku-core domain, Ku-like subfamily; composed of prokaryotic homologs of the eukaryotic DNA binding protein Ku. The alignment includes the core domain shared by the prokaryotic YkoV-like proteins and the eukaryotic Ku70 and Ku80. The prokaryotic Ku homologs are predicted to form homodimers. It is proposed that the Ku homologs are functionally associated with ATP-dependent DNA ligase and the eukaryotic-type primase, probably as components of a double-strand break repair system. Probab=52.72 E-value=12 Score=17.53 Aligned_cols=32 Identities=13% Similarity=0.352 Sum_probs=26.3 Q ss_pred CCCCEEEEEEECCCCCCEEEECCCEEEEEEEEEC Q ss_conf 6744389999516865279833980356898508 Q gi|254780480|r 131 PNYSGPLYLEILPRTFPIVVRVGSRLSQIRFVKE 164 (367) Q Consensus 131 ~Gy~G~l~lEI~~~sfpI~v~~G~rl~Qi~f~~g 164 (367) +-|+|.|..-++ |.||++|+...=.-|+|+.= T Consensus 2 aiWkG~isfGLV--~iPV~Ly~At~~~~i~F~~l 33 (256) T cd00789 2 AIWKGAISFGLV--NIPVKLYSATESEDISFHQL 33 (256) T ss_pred CCCEEEEEEEEE--EEEEEEEECCCCCCCCCEEE T ss_conf 853256788668--43069784444787651574 No 34 >TIGR02883 spore_cwlD N-acetylmuramoyl-L-alanine amidase CwlD; InterPro: IPR014234 This entry represents the germination-specific N-acetylmuramoyl-L-alanine amidase CwlD. CwlD from Bacillus subtilis is required, along with the putative deactylase PdaA, to make muramic delta-lactam, a novel peptidoglycan constituent found only in bacterial endospores. CwlD mutants show a germination defect.. Probab=45.91 E-value=9.6 Score=18.16 Aligned_cols=67 Identities=19% Similarity=0.249 Sum_probs=28.2 Q ss_pred CCCCCCEE-------EEEEECCCCCCEEEECCCEEEEEEEEECC--------CCCCHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 36674438-------99995168652798339803568985088--------7689999999875278603665544337 Q gi|254780480|r 129 ISPNYSGP-------LYLEILPRTFPIVVRVGSRLSQIRFVKER--------KFCSQEELSALHELSPLVQDGLRDFSGK 193 (367) Q Consensus 129 id~Gy~G~-------l~lEI~~~sfpI~v~~G~rl~Qi~f~~g~--------~~LsD~eL~~lh~~~~LV~~~~~~~~~~ 193 (367) +|+|=.|. |+|+|+-.--..+=..|-++-=. |... ..++..-.+-|.++..++.....+ T Consensus 12 ~DGGA~~~~G~~EKDi~L~ia~~lk~~L~~~Ga~V~mT--Re~D~dLa~~~t~~~~~rK~eDL~~Rv~~~~~~~aD---- 85 (192) T TIGR02883 12 IDGGAVGKDGTLEKDITLEIALKLKDYLQEQGALVVMT--REDDKDLASEGTKGYSRRKIEDLRKRVKLINESEAD---- 85 (192) T ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCEEEEE--ECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCC---- T ss_conf 87287367785113664799999999998569789885--227711145001245678999999999985078986---- Q ss_pred CEEEEEEEC Q ss_conf 316788814 Q gi|254780480|r 194 GVALSVDLK 202 (367) Q Consensus 194 Gl~~sVdL~ 202 (367) +.+|++|- T Consensus 86 -~FiSIHlN 93 (192) T TIGR02883 86 -LFISIHLN 93 (192) T ss_pred -EEEECCCC T ss_conf -88652201 No 35 >PRK11637 hypothetical protein; Provisional Probab=44.86 E-value=22 Score=15.75 Aligned_cols=57 Identities=30% Similarity=0.344 Sum_probs=29.8 Q ss_pred CCCEEEECCEEECCCCCCCCCCCCCCE-EEEEECC-----CCCEEEECCCEEEEEEEEECCCCC Q ss_conf 342134216045688665434566750-4788617-----888899479888999997637664 Q gi|254780480|r 283 IGEFRVHYAGFFDPGFGCVLPQEVGAK-AVLEVRS-----SVPFVLEHGQIIGRLKYESMDGEP 340 (367) Q Consensus 283 ~g~~r~h~AGf~DpGf~~~~~~~~~~~-~vLevr~-----~~p~~l~~g~~i~~l~~~~~~~~p 340 (367) +..++|.||||+ +|||..+--.-|+- .+|.--. .+-=.+..||.||.+=.--.++.| T Consensus 321 v~~G~Vv~a~~~-~gyG~~ViIdHG~g~~TlYah~s~l~v~~Gq~V~~Gq~Ig~vG~tG~s~g~ 383 (404) T PRK11637 321 IADGRVILADWL-QGYGLVVVVEHGKGDMSLYGYNQSALVSVGAQVRAGQPIALVGSSGGQGRP 383 (404) T ss_pred ECCEEEEEEEEC-CCCCCEEEEECCCCCEEECCCCCCCCCCCCCEECCCCEEEEEECCCCCCCC T ss_conf 017699991140-888857999869946571528895888997998999969873268999996 No 36 >pfam05784 Herpes_UL82_83 Betaherpesvirus UL82/83 protein N terminus. This family represents the N terminal region of the Betaherpesvirus UL82 and UL83 proteins. As viruses are reliant upon their host cell to serve as proper environments for their replication, many have evolved mechanisms to alter intracellular conditions to suit their own needs. Human cytomegalovirus induces quiescent cells to enter the cell cycle and then arrests them in late G(1), before they enter the S phase, a cell cycle compartment that is presumably favourable for viral replication. The protein product of the human cytomegalovirus UL82 gene, pp71, can accelerate the movement of cells through the G(1) phase of the cell cycle. This activity would help infected cells reach the late G(1) arrest point sooner and thus may stimulate the infectious cycle. pp71 also induces DNA synthesis in quiescent cells, but a pp71 mutant protein that is unable to induce quiescent cells to enter the cell cycle still retains the abilit Probab=44.00 E-value=23 Score=15.67 Aligned_cols=90 Identities=21% Similarity=0.257 Sum_probs=46.3 Q ss_pred CHHHHHHHHCCCCCCCCEEEECCCCEEEEEEEEEE-ECCCCCEEEEEEECCCCCCEEEECCEEECCCC----CCCC-CCC Q ss_conf 87786643113557775399889978999863344-42834016445304333421342160456886----6543-456 Q gi|254780480|r 232 DESDFWDPLFSWDAPEGQFILDPNEFYIFASREFI-QIPPSLVAEMIPYDHSIGEFRVHYAGFFDPGF----GCVL-PQE 305 (367) Q Consensus 232 ~~~dfwe~i~~~~~~~~~~~l~p~~F~l~~s~E~v-~iP~~~aae~~~~~~~~g~~r~h~AGf~DpGf----~~~~-~~~ 305 (367) .++.|..+-. .+|--|+-|..+.|-..+++. .+.+-+.+ .-.|+|+|=|-- ..+. ... T Consensus 246 nP~P~L~~~~----~NGftV~~Pr~~~l~~g~~~~v~i~~~feS------------~~~~~~lF~P~~ipGlsi~~~~w~ 309 (346) T pfam05784 246 NPEPFLRPHP----RNGFTVLCPKNLHLKPGQTSHVMIDNAFES------------DQTHAGLFFPKDIPGLSISCGPWM 309 (346) T ss_pred CCCCCCCCCC----CCCEEEECCCCEECCCCCEEEEEEEEEEEC------------CCCEEEEEECCCCCCCEEEECCCC T ss_conf 9997321179----997899788756608998699999748951------------881699992655799776732158 Q ss_pred CCCEEEEEECC-CCCEEEECCCEEEEEEEEECC Q ss_conf 67504788617-888899479888999997637 Q gi|254780480|r 306 VGAKAVLEVRS-SVPFVLEHGQIIGRLKYESMD 337 (367) Q Consensus 306 ~~~~~vLevr~-~~p~~l~~g~~i~~l~~~~~~ 337 (367) .+..--+|+|+ .....|+.+|.+|.+-|++=+ T Consensus 310 ~~~~L~i~i~a~~~~v~i~~~q~LG~lhFf~r~ 342 (346) T pfam05784 310 ERQPLFIELRATKESVRIEHMQPLGELHFFPRD 342 (346) T ss_pred CCCEEEEEEEECCCCEECCCCCEEEEEEEEECC T ss_conf 997489999843554102378753489985155 No 37 >pfam01834 XRCC1_N XRCC1 N terminal domain. Probab=41.11 E-value=12 Score=17.47 Aligned_cols=86 Identities=21% Similarity=0.246 Sum_probs=48.4 Q ss_pred EEEEECCCCCEEEEE---CCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEEECCCC-C--CEEEECCCEEEEEEEEE Q ss_conf 889984998279986---01343112503423207877554236674438999951686-5--27983398035689850 Q gi|254780480|r 90 MESLNLKNGISAYAN---PKSSIGRIDVFARVIGDRSQEFDSISPNYSGPLYLEILPRT-F--PIVVRVGSRLSQIRFVK 163 (367) Q Consensus 90 ~E~i~LP~di~a~~~---pKSS~GRLgif~r~i~~~~~~f~~id~Gy~G~l~lEI~~~s-f--pI~v~~G~rl~Qi~f~~ 163 (367) ..+|.+-|.=+|+++ +||+.|+-- -+. =|.++.+.| | |+--+-|.-.++.|+|. T Consensus 54 I~~IDIGN~gSAfIEVlVgrS~~~~~~---------------~~~-----~f~~Ll~~ssfMsp~eSkn~~n~nrVR~F~ 113 (152) T pfam01834 54 IHSVDIGNDGSAFVEVLVGSSAGGGTA---------------GEQ-----DYEVLLVTSSFMSPSESRSGSNPNRVRMFG 113 (152) T ss_pred EEEEEECCCCEEEEEEEEEECCCCCCC---------------CCC-----CEEEEEEEECCCCHHHHCCCCCCCCEEEEC T ss_conf 646973588708999997404688876---------------777-----706897430244866841678876058867 Q ss_pred CCCCCCHHHHHHHHHHCCCC-CCCCCCCCCCCEE Q ss_conf 88768999999987527860-3665544337316 Q gi|254780480|r 164 ERKFCSQEELSALHELSPLV-QDGLRDFSGKGVA 196 (367) Q Consensus 164 g~~~LsD~eL~~lh~~~~LV-~~~~~~~~~~Gl~ 196 (367) +....+. ...+.=++.++| .-+.+....-||. T Consensus 114 ~~~l~~~-~~~ekWDrVKivCtQPfnk~~~yGLs 146 (152) T pfam01834 114 PDKLVRA-AAEKRWDRVKIVCSQPYSKDSPYGLS 146 (152) T ss_pred HHHCCHH-HHHHCCCEEEEEECCCCCCCCCCCEE T ss_conf 5772866-76502107899983877788852249 No 38 >KOG2936 consensus Probab=39.07 E-value=11 Score=17.70 Aligned_cols=78 Identities=22% Similarity=0.328 Sum_probs=50.6 Q ss_pred EEEECCCCEEEEEEEEE--EECCCCCE----E--EE-EEECCCCCCEEEECCEEECCCCCCCCCCCCCCEEEEEECCCCC Q ss_conf 39988997899986334--44283401----6--44-5304333421342160456886654345667504788617888 Q gi|254780480|r 249 QFILDPNEFYIFASREF--IQIPPSLV----A--EM-IPYDHSIGEFRVHYAGFFDPGFGCVLPQEVGAKAVLEVRSSVP 319 (367) Q Consensus 249 ~~~l~p~~F~l~~s~E~--v~iP~~~a----a--e~-~~~~~~~g~~r~h~AGf~DpGf~~~~~~~~~~~~vLevr~~~p 319 (367) .+...+.++.++.+.|. |..++--+ + .| ..++.++-+. |+-|.||-+=-.- ..+.|.||.+..-+ T Consensus 142 ~L~~~~~~~~~l~t~~v~~v~~~~~~~~~~t~di~l~~tfn~~~~eL---y~~fld~~rv~~w---t~S~a~l~~~~~g~ 215 (301) T KOG2936 142 ELKREHSKDIILPTDEVKPVKQGPVGVVIPTADISLSATFNCRVDEL---YEIFLDPERVKAW---TRSPAELEADPGGK 215 (301) T ss_pred HHHCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCEEHHHCCCCHHHH---HHHHHCHHHHHHH---CCCHHHCCCCCCCC T ss_conf 87404677745321105753447654433434321243218988999---9987369999875---17825400589983 Q ss_pred EEEECCCEEEEEE Q ss_conf 8994798889999 Q gi|254780480|r 320 FVLEHGQIIGRLK 332 (367) Q Consensus 320 ~~l~~g~~i~~l~ 332 (367) |.|-+|-.++.+. T Consensus 216 f~lf~GnVtg~~~ 228 (301) T KOG2936 216 FSLFDGNVTGEFL 228 (301) T ss_pred EEEECCCCEEEEE T ss_conf 5896242002344 No 39 >pfam04797 Herpes_ORF11 Herpesvirus dUTPase protein. This family of proteins are found in Herpesvirus proteins. This family includes proteins called ORF10 and ORF11 amongst others. However, these proteins seem to be related to other dUTPases pfam00692 suggesting that these proteins are also dUTPases (Bateman A pers. obs.). Probab=37.83 E-value=29 Score=15.05 Aligned_cols=251 Identities=14% Similarity=0.097 Sum_probs=104.0 Q ss_pred CCCEEECCCCEEEEEEEEEEECCCCCEEEEECCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEEECCCCCCEEEECC Q ss_conf 88648668732999988899849982799860134311250342320787755423667443899995168652798339 Q gi|254780480|r 74 SGGTVLDANCVYIVPLMESLNLKNGISAYANPKSSIGRIDVFARVIGDRSQEFDSISPNYSGPLYLEILPRTFPIVVRVG 153 (367) Q Consensus 74 ~~g~vLepg~~yLv~t~E~i~LP~di~a~~~pKSS~GRLgif~r~i~~~~~~f~~id~Gy~G~l~lEI~~~sfpI~v~~G 153 (367) +....+.+|++.+.-+-=..--|..+.=++.+=++ +... +.... ... ..-.+-.|++. ++. .++.-.++ T Consensus 98 ~~~~~i~~g~L~~~I~~v~~v~~~~l~l~i~~~~~-~~~~--~~~c~---q~~-~~~~~~~~~l~--lsG--~~~~~~~~ 166 (384) T pfam04797 98 TPLRVIPPGELKFYILFLTPVKLDGLYLYILPDKT-SIPM--TPSCS---QLG-SVLNSEEPQLF--LSG--QPVRTSPE 166 (384) T ss_pred CCCCCCCCCCEEEEEEEEECCCCCCEEEEECCCCC-CCCC--CCCCC---CCC-CCCCCCCCEEE--EEE--EEECCCCC T ss_conf 79864389972999999604677612899607888-8657--88511---000-10368997389--985--47637898 Q ss_pred CEEEEEEEEECCCCC--CHHHHHHHHHHCCCCCCCCC-CCCCCCEEEEEEECCCCCCCCCCEEEEECCCCCEEEC-CCCC Q ss_conf 803568985088768--99999998752786036655-4433731678881455456666304551267751301-5655 Q gi|254780480|r 154 SRLSQIRFVKERKFC--SQEELSALHELSPLVQDGLR-DFSGKGVALSVDLKGNEIGKKSIIGYRAKRHTAAIDV-DSEN 229 (367) Q Consensus 154 ~rl~Qi~f~~g~~~L--sD~eL~~lh~~~~LV~~~~~-~~~~~Gl~~sVdL~~~~~~~~~iiGyrAkkht~~IDl-~k~~ 229 (367) ....|+++.... -+..--++|.....+-.-.. ..-.+-+.++|++.++. +...+-.+..=+.+.-.. +.+= T Consensus 167 ---~y~~~f~~~~~~~f~~~~~~~~~~~~~~v~~v~~~~~g~~~~rv~v~~~~~~--~~~~v~~~~~v~~~~s~~~~~~L 241 (384) T pfam04797 167 ---TYTPYFLAQKTKPFLKRGFARLHAEEGEVCPVNAVRLGKNYVRVSVTLPSEA--KSETVLVKVPVRVGVSLTNCAEL 241 (384) T ss_pred ---CEEEEEECCCCCCHHCCCEEEECCCCCCEEEEEEEECCCCEEEEEEECCCCC--CCCEEEEEEEEEEECCCCCCCEE T ss_conf ---3667883588814203501331455462788999994898899999777889--87268887889761146766369 Q ss_pred CCCHHHHHHHHCCCCCCCCEEEECCCCEEEEEEEEEEECCCCCEE--E--------EEEECCCCCCEEEECCEEECCCCC Q ss_conf 558778664311355777539988997899986334442834016--4--------453043334213421604568866 Q gi|254780480|r 230 KYDESDFWDPLFSWDAPEGQFILDPNEFYIFASREFIQIPPSLVA--E--------MIPYDHSIGEFRVHYAGFFDPGFG 299 (367) Q Consensus 230 ~y~~~dfwe~i~~~~~~~~~~~l~p~~F~l~~s~E~v~iP~~~aa--e--------~~~~~~~~g~~r~h~AGf~DpGf~ 299 (367) ..+-.-+|...-.... .--+++.||.=.+..-.|.++|+-+..= - .+..+..-..|++--.=| T Consensus 242 ~F~~~~~~~~~~~~~~-~~v~~iY~~~~~~IpP~~~~~v~~~~~y~~~~~~~~~~~~I~~~~~~~~~~i~p~~W------ 314 (384) T pfam04797 242 AFKHNPYPGSVWSWGR-ATVPPVYPGPEVTIPPGESTKVKYNNTYESGLPSKITAAFITGLASNPDFVIEPCEW------ 314 (384) T ss_pred EECCCCCCCCCCCCCC-CEEEEECCCCCEEECCCCEEEEEECCEEEECCCCCCCEEEEECCCCCCEEEEEEEEE------ T ss_conf 9744689766788887-475559179946889997799998529983588765469998589997399974687------ Q ss_pred CCCCCCCCCEEEEEECC--CCCEEEECCCEEEEEEEEECCCCCCCCCC-CCCCCCCCCCC Q ss_conf 54345667504788617--88889947988899999763766422237-54347232666 Q gi|254780480|r 300 CVLPQEVGAKAVLEVRS--SVPFVLEHGQIIGRLKYESMDGEPERLYG-EARGSHYQSQA 356 (367) Q Consensus 300 ~~~~~~~~~~~vLevr~--~~p~~l~~g~~i~~l~~~~~~~~p~~~Yg-~~~~s~Yq~Qg 356 (367) -.++.+.|-|.| +.|..|..|+++|+=+|. .+...=. .....-|+.+. T Consensus 315 -----~P~~~~~l~v~N~s~~pitI~~gt~la~A~fi----~~~~~~~~~~~~~~~~~~s 365 (384) T pfam04797 315 -----LPGSTLKLTVYNPSNFPLTISRGTPLALALFI----APKDSASDAAPRIYYDRLS 365 (384) T ss_pred -----CCCCEEEEEEECCCCCCEEECCCCEEEEEEEE----ECCCCCCCCCHHHHHHHCC T ss_conf -----79981799999689960896189999999998----0366666521001232157 No 40 >KOG3293 consensus Probab=37.74 E-value=29 Score=15.04 Aligned_cols=23 Identities=17% Similarity=0.407 Sum_probs=11.0 Q ss_pred EEEECCCCEEEEEEEEEEECCCC Q ss_conf 39988997899986334442834 Q gi|254780480|r 249 QFILDPNEFYIFASREFIQIPPS 271 (367) Q Consensus 249 ~~~l~p~~F~l~~s~E~v~iP~~ 271 (367) .|...|.-+.=.+|.-+++||.. T Consensus 52 kf~r~pEcYirGttIkylri~d~ 74 (134) T KOG3293 52 KFFRMPECYIRGTTIKYLRIPDE 74 (134) T ss_pred CEEECCEEEEECCEEEEEECCHH T ss_conf 02664313881463589966589 No 41 >TIGR02061 aprA adenylylsulfate reductase, alpha subunit; InterPro: IPR011803 During dissimilatory sulphate reduction or sulphur oxidation, adenylylsulphate (APS) reductase catalyzes reversibly the two-electron reduction of APS to sulphite and AMP. Found in several bacterial lineages and in Archaeoglobales, APS reductase is a heterodimer composed of an alpha subunit containing a noncovalently bound FAD, and a beta subunit containing two [4Fe-4S] clusters. This entry describes the alpha subunit of APS reductase, which shares a common evolutionary origin with fumarate reductase/succinate dehydrogenase flavoproteins.. Probab=35.92 E-value=26 Score=15.32 Aligned_cols=97 Identities=24% Similarity=0.347 Sum_probs=68.5 Q ss_pred EEEECCCCEEEEEEEEEEECCCCCEE-EEEEECCCCCCEEEECCEEECCCCCCCCCCCCCCEEE-EEECCCCCEEEECCC Q ss_conf 39988997899986334442834016-4453043334213421604568866543456675047-886178888994798 Q gi|254780480|r 249 QFILDPNEFYIFASREFIQIPPSLVA-EMIPYDHSIGEFRVHYAGFFDPGFGCVLPQEVGAKAV-LEVRSSVPFVLEHGQ 326 (367) Q Consensus 249 ~~~l~p~~F~l~~s~E~v~iP~~~aa-e~~~~~~~~g~~r~h~AGf~DpGf~~~~~~~~~~~~v-Levr~~~p~~l~~g~ 326 (367) .|=+--++||+..++-.| |-..=|. --+|.++.=|-+|+=||-| |.|=+.....+.|+.-+ +|.| -||++-.||= T Consensus 192 GFnvR~~~~hvFKaktvi-vAaGGA~n~yrPRStGEgaGR~WYa~w-~~GS~Y~m~~~aGA~mT~mEnR-fvpaRFKDGY 268 (651) T TIGR02061 192 GFNVREGEYHVFKAKTVI-VAAGGASNVYRPRSTGEGAGRVWYAVW-DAGSAYTMLLKAGAKMTQMENR-FVPARFKDGY 268 (651) T ss_pred EEEECCCCEEEEECCEEE-EECCCCCCCCCCCCCCCCCCCCCCCCC-CCCHHHHHHHHHHHHHHHHHCC-CEEEEECCCC T ss_conf 001017865698712456-630661001357624436775003645-6635789999863324322034-0220331788 Q ss_pred -EEEEEEEEECCCCCCCCCCCCCC Q ss_conf -88999997637664222375434 Q gi|254780480|r 327 -IIGRLKYESMDGEPERLYGEARG 349 (367) Q Consensus 327 -~i~~l~~~~~~~~p~~~Yg~~~~ 349 (367) +||. -|--+...+...||++.- T Consensus 269 GPVGA-wFL~fK~ka~N~~GEeY~ 291 (651) T TIGR02061 269 GPVGA-WFLFFKAKAKNALGEEYI 291 (651) T ss_pred CCHHH-HHHHHHHHCCCCCCCCHH T ss_conf 87689-998888630578861235 No 42 >TIGR01163 rpe ribulose-phosphate 3-epimerase; InterPro: IPR000056 Ribulose-phosphate 3-epimerase (5.1.3.1 from EC) (also known as pentose-5-phosphate 3-epimerase or PPE) is the enzyme that converts D-ribulose 5-phosphate into D-xylulose 5-phosphate in Calvin's reductive pentose phosphate cycle. In Alcaligenes eutrophus two copies of the gene coding for PPE are known , one is chromosomally encoded P40117 from SWISSPROT, the other one is on a plasmid Q04539 from SWISSPROT. PPE has been found in a wide range of bacteria, archaebacteria, fungi and plants. All the proteins have from 209 to 241 amino acid residues. The enzyme has a TIM barrel structure.; GO: 0004750 ribulose-phosphate 3-epimerase activity, 0005975 carbohydrate metabolic process. Probab=35.76 E-value=14 Score=17.02 Aligned_cols=89 Identities=16% Similarity=0.197 Sum_probs=50.9 Q ss_pred CCEEEEEEECCCCCCCCCCEEEEECCCCCEEECCCCCCCCHHHHHHHHCCCCCCCCEEEECCCCEEEEEEEEEEECCCCC Q ss_conf 73167888145545666630455126775130156555587786643113557775399889978999863344428340 Q gi|254780480|r 193 KGVALSVDLKGNEIGKKSIIGYRAKRHTAAIDVDSENKYDESDFWDPLFSWDAPEGQFILDPNEFYIFASREFIQIPPSL 272 (367) Q Consensus 193 ~Gl~~sVdL~~~~~~~~~iiGyrAkkht~~IDl~k~~~y~~~dfwe~i~~~~~~~~~~~l~p~~F~l~~s~E~v~iP~~~ 272 (367) --+++.|+|--+. .+..+--=|+--|..|-+.--+...+..--+-|+.. +.+-.+.|+|+- |-+. T Consensus 57 ~~~P~DVHLMv~~--pd~~~~~Fa~aGA~~I~vH~Ea~~h~~R~l~~Ik~~-G~~AG~v~NP~T------------Pl~~ 121 (216) T TIGR01163 57 TKLPIDVHLMVEN--PDRYIEDFAEAGADIITVHAEATEHIHRLLQLIKEL-GAKAGIVLNPAT------------PLEA 121 (216) T ss_pred CCCCEEEEECCCC--HHHHHHHHHHHCCCEEEEECCCCCCHHHHHHHHHHC-CCCEEEEECCCC------------CHHH T ss_conf 9521266303578--577788999708998998437762679999999971-897068867999------------9878 Q ss_pred EEEEEEECCCCCCEEEECCEEECCCCCCCC Q ss_conf 164453043334213421604568866543 Q gi|254780480|r 273 VAEMIPYDHSIGEFRVHYAGFFDPGFGCVL 302 (367) Q Consensus 273 aae~~~~~~~~g~~r~h~AGf~DpGf~~~~ 302 (367) --++-++-..+=.+ - ..||||||- T Consensus 122 ~~~~L~~~D~VLlM-S-----VnPGFgGQk 145 (216) T TIGR01163 122 LEYVLEDVDLVLLM-S-----VNPGFGGQK 145 (216) T ss_pred HHHHHHHCCEEEEE-E-----EECCCCCCC T ss_conf 99898762989988-7-----607998841 No 43 >pfam03843 Slp Outer membrane lipoprotein Slp family. Probab=34.41 E-value=30 Score=14.95 Aligned_cols=39 Identities=26% Similarity=0.351 Sum_probs=24.1 Q ss_pred EEEEEEEECCCCCEEEEEEECCCCCCEEEECCEEECCCC Q ss_conf 986334442834016445304333421342160456886 Q gi|254780480|r 260 FASREFIQIPPSLVAEMIPYDHSIGEFRVHYAGFFDPGF 298 (367) Q Consensus 260 ~~s~E~v~iP~~~aae~~~~~~~~g~~r~h~AGf~DpGf 298 (367) -+-.|.+..|-|-.+.=.-.+...|-|.+-..||.||-= T Consensus 52 ~T~lEV~~~PLd~~grP~~~~~s~GRFla~~~gFlDP~~ 90 (160) T pfam03843 52 QTRLEIVSLPLDSSGRPDLDDESEGRFLAYVNGFLDPAI 90 (160) T ss_pred CEEEEEEECCCCCCCCCCCCCCCCCEEEEECCCCCCHHH T ss_conf 629999981468789728789999779997588778768 No 44 >COG1273 Ku-homolog [Replication, recombination, and repair] Probab=32.85 E-value=30 Score=14.88 Aligned_cols=34 Identities=18% Similarity=0.342 Sum_probs=27.9 Q ss_pred CCCCCCEEEEEEECCCCCCEEEECCCEEEEEEEEEC Q ss_conf 366744389999516865279833980356898508 Q gi|254780480|r 129 ISPNYSGPLYLEILPRTFPIVVRVGSRLSQIRFVKE 164 (367) Q Consensus 129 id~Gy~G~l~lEI~~~sfpI~v~~G~rl~Qi~f~~g 164 (367) ..+-|+|-|-+-++ +.||++|+-..=.-|+|+.= T Consensus 3 ~Ra~WKG~ikfgLV--~~PV~ly~ATs~seI~F~~L 36 (278) T COG1273 3 MRAIWKGYIKFGLV--NCPVKLYPATSESEIRFHTL 36 (278) T ss_pred CCCCCCCEEEEEEE--ECCEEEEECCCCCCCCHHHH T ss_conf 54401103675257--21047540356553103001 No 45 >KOG4048 consensus Probab=30.62 E-value=37 Score=14.30 Aligned_cols=86 Identities=14% Similarity=0.119 Sum_probs=53.1 Q ss_pred CCCCEEEEECCCCCEEECCCCCCCCHHHHHHHHCCCCCCCCEEEECCCCEEEEEEEEEEECCCC---------------- Q ss_conf 6663045512677513015655558778664311355777539988997899986334442834---------------- Q gi|254780480|r 208 KKSIIGYRAKRHTAAIDVDSENKYDESDFWDPLFSWDAPEGQFILDPNEFYIFASREFIQIPPS---------------- 271 (367) Q Consensus 208 ~~~iiGyrAkkht~~IDl~k~~~y~~~dfwe~i~~~~~~~~~~~l~p~~F~l~~s~E~v~iP~~---------------- 271 (367) ++.++-++||.-+|.-++..- ++|=.|.-+-. -++=--|.|.|-.-+....+++| T Consensus 54 pg~Vv~~kAk~RtGe~Nf~~C----iR~gLe~HyG~-----KivglgG~FiIqkG~a~~hvm~dFs~cP~nsdeEVnkWL 124 (201) T KOG4048 54 PGSVVHFKAKVRTGEKNFTNC----IRDGLEEHYGK-----KIVGLGGQFIIQKGKARLHVMADFSSCPLNSDEEVNKWL 124 (201) T ss_pred CCCEEEEEEEEECCCHHHHHH----HHHHHHHHCCC-----CCCCCCCEEEEECCCCEEECCCCCCCCCCCCHHHHHHHH T ss_conf 974799986433073238999----99889987187-----621334079982174013525557899977778998888 Q ss_pred ----------CEEEEEEECCCCCCEEEECCEEECCCCCCCC Q ss_conf ----------0164453043334213421604568866543 Q gi|254780480|r 272 ----------LVAEMIPYDHSIGEFRVHYAGFFDPGFGCVL 302 (367) Q Consensus 272 ----------~aae~~~~~~~~g~~r~h~AGf~DpGf~~~~ 302 (367) ++..|+++++..-.-.-|+--|-+-||||.- T Consensus 125 ~fyEM~ApL~c~~v~vs~dpg~dLRLeHtH~fS~hg~gGHY 165 (201) T KOG4048 125 HFYEMKAPLVCLPVFVSRDPGFDLRLEHTHFFSRHGEGGHY 165 (201) T ss_pred HHEECCCCEEEEEEEEECCCCCCEEEEEEEEEECCCCCCCC T ss_conf 66015676055667873289975356566764015777630 No 46 >pfam11850 DUF3370 Protein of unknown function (DUF3370). This family of proteins are functionally uncharacterized. This protein is found in bacteria. Proteins in this family are typically between 452 to 532 amino acids in length. Probab=28.90 E-value=40 Score=14.12 Aligned_cols=31 Identities=13% Similarity=0.159 Sum_probs=15.8 Q ss_pred CCCEEECCCCCHHHCCCCEEEEEECCEEEEE Q ss_conf 7984981789865657122588758989998 Q gi|254780480|r 17 RGDIVSEHPLEKNQIQPASIDLRLSSKAYRV 47 (367) Q Consensus 17 ~g~I~i~~p~~~~QIQPaSiDLRLG~~~~rv 47 (367) +|-+.++-|-.+.+...|.++..+.-+|=.+ T Consensus 53 eGILlSTfP~~gk~~p~ahLn~~f~GrFdlF 83 (439) T pfam11850 53 EGILLSTFPGSGKNQPLAHLNYAFNGRFDLF 83 (439) T ss_pred CCEEEECCCCCCCCCCCCCCCCCCCCCCCCE T ss_conf 8879862798887786645664535632336 No 47 >PRK09455 rseB periplasmic negative regulator of sigmaE; Provisional Probab=28.90 E-value=40 Score=14.11 Aligned_cols=10 Identities=40% Similarity=0.747 Sum_probs=4.4 Q ss_pred CCEEEEEEEE Q ss_conf 9803568985 Q gi|254780480|r 153 GSRLSQIRFV 162 (367) Q Consensus 153 G~rl~Qi~f~ 162 (367) |.-|-|++|. T Consensus 160 g~vLEQf~f~ 169 (302) T PRK09455 160 GETLEQFRVI 169 (302) T ss_pred CCEEEEEEEE T ss_conf 9789999999 No 48 >pfam00583 Acetyltransf_1 Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions such as Elp3-related proteins. Probab=28.49 E-value=38 Score=14.24 Aligned_cols=35 Identities=23% Similarity=0.400 Sum_probs=20.6 Q ss_pred ECCCEEEEEEEEECCCCCCCCC--CCCCCCCCCCCCC Q ss_conf 4798889999976376642223--7543472326667 Q gi|254780480|r 323 EHGQIIGRLKYESMDGEPERLY--GEARGSHYQSQAL 357 (367) Q Consensus 323 ~~g~~i~~l~~~~~~~~p~~~Y--g~~~~s~Yq~Qgl 357 (367) ++|+.||-+.+......+...| +-.+...||+||+ T Consensus 3 ~~g~ivG~~~~~~~~~~~~~~~i~~l~V~p~~rg~Gi 39 (80) T pfam00583 3 EDGELVGFASLSIIDEEGNVAEIEGLAVDPEYRGKGI 39 (80) T ss_pred CCCEEEEEEEEEEECCCCCEEEEEEEEECHHHCCCCH T ss_conf 7898999999999648999999999999989959987 No 49 >COG5266 CbiK ABC-type Co2+ transport system, periplasmic component [Inorganic ion transport and metabolism] Probab=28.02 E-value=41 Score=14.02 Aligned_cols=32 Identities=22% Similarity=0.206 Sum_probs=23.8 Q ss_pred EEEEEECCCCCCEEEECCCEEEEEEEEECCCC Q ss_conf 89999516865279833980356898508876 Q gi|254780480|r 136 PLYLEILPRTFPIVVRVGSRLSQIRFVKERKF 167 (367) Q Consensus 136 ~l~lEI~~~sfpI~v~~G~rl~Qi~f~~g~~~ 167 (367) +..+||.|.+-|...+.|-.+-=..+++|+.. T Consensus 154 g~~~EIiPL~nP~~a~~ge~f~~~vl~~GkPv 185 (264) T COG5266 154 GQPLEIIPLENPANAWVGEVFRGKVLDNGKPV 185 (264) T ss_pred CCCCEEEECCCCCCCCCCCEEEEEEEECCCCC T ss_conf 89616675568000224775789997789648 No 50 >PRK05799 coproporphyrinogen III oxidase; Provisional Probab=27.03 E-value=22 Score=15.75 Aligned_cols=66 Identities=17% Similarity=0.200 Sum_probs=43.4 Q ss_pred EEEEEEECCCCCC---EEEECCCEEEEEEEEECCCCCCHHHHHHHHHHCCCCC--CCCCCCCCCCEE-EEEEEC Q ss_conf 3899995168652---7983398035689850887689999999875278603--665544337316-788814 Q gi|254780480|r 135 GPLYLEILPRTFP---IVVRVGSRLSQIRFVKERKFCSQEELSALHELSPLVQ--DGLRDFSGKGVA-LSVDLK 202 (367) Q Consensus 135 G~l~lEI~~~sfp---I~v~~G~rl~Qi~f~~g~~~LsD~eL~~lh~~~~LV~--~~~~~~~~~Gl~-~sVdL~ 202 (367) -.+|+|..|.++. +......-+|-+.+ |-..++|+.|+.++..++.-. ......-.-|.. +|+||- T Consensus 86 ~E~TiE~nP~~~~~~~l~~l~~~GvNRiSi--GVQSf~~~~l~~lgR~h~~~~~~~ai~~~~~~gf~niniDLI 157 (374) T PRK05799 86 LEFTVEGNPGTFTEEKLKILKSMGVNRISI--GLQAWQNSLLKTLGRIHTFEEFLENYKLARKLGFNNINVDLM 157 (374) T ss_pred CEEEEEECCCCCCHHHHHHHHHCCCCEEEE--ECCCCCHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCEEEEEE T ss_conf 458998566778999999999719975888--533588999998479998999999999999759974668854 No 51 >PRK10319 N-acetylmuramoyl-l-alanine amidase I; Provisional Probab=26.06 E-value=30 Score=14.96 Aligned_cols=49 Identities=18% Similarity=0.166 Sum_probs=22.6 Q ss_pred EECCCCCCCCCCCCCCEE-------EEEEECCCCCCEEEECCCEEEEEEE-EECCCCCCHHH Q ss_conf 207877554236674438-------9999516865279833980356898-50887689999 Q gi|254780480|r 119 IGDRSQEFDSISPNYSGP-------LYLEILPRTFPIVVRVGSRLSQIRF-VKERKFCSQEE 172 (367) Q Consensus 119 i~~~~~~f~~id~Gy~G~-------l~lEI~~~sfpI~v~~G~rl~Qi~f-~~g~~~LsD~e 172 (367) +.|-|.+- .|||-.|. |+|+|+-+--..+-..|. +..+ |..+.+++=.+ T Consensus 60 VIDpGHGG--~DpGAig~~G~~EKdi~L~IA~~L~~~L~~~G~---~VvmTR~~D~~v~L~~ 116 (289) T PRK10319 60 MLDPGHGG--IDTGAIGRNGSKEKHVVLAIAKNVRSILRNHGI---DARLTRSGDTFIPLYD 116 (289) T ss_pred EECCCCCC--CCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCC---EEEEECCCCCCCCHHH T ss_conf 98699999--899888999973799999999999999986899---7999468988689999 No 52 >TIGR02472 sucr_P_syn_N sucrose-phosphate synthase, putative, glycosyltransferase domain; InterPro: IPR012822 This family consists of the N-terminal regions, or in some cases the entirety, of bacterial proteins closely related to plant sucrose-phosphate synthases (SPS). The C-terminal domain (IPR012821 from INTERPRO), found with most members of this family, resembles both bona fide plant sucrose-phosphate phosphatases (SPP) and the SPP-like domain of plant sucrose phosphate synthase (SPS). At least two members of this family lack the SPP-like domain, which may have binding or regulatory rather than enzymatic activity by analogy to plant SPS. This enzyme produces sucrose 6-phosphate and UDP from UDP-glucose and D-fructose 6-phosphate, and may be encoded near the gene for fructokinase.. Probab=25.63 E-value=34 Score=14.55 Aligned_cols=63 Identities=25% Similarity=0.537 Sum_probs=40.9 Q ss_pred CCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEEECCCCCCEEEECCCEEEEEEE-----EECCCC------CCHHHH Q ss_conf 013431125034232078775542366744389999516865279833980356898-----508876------899999 Q gi|254780480|r 105 PKSSIGRIDVFARVIGDRSQEFDSISPNYSGPLYLEILPRTFPIVVRVGSRLSQIRF-----VKERKF------CSQEEL 173 (367) Q Consensus 105 pKSS~GRLgif~r~i~~~~~~f~~id~Gy~G~l~lEI~~~sfpI~v~~G~rl~Qi~f-----~~g~~~------LsD~eL 173 (367) -||-.++.||.||.|.|. +|+|-|--++ |.. -|+-||.=|-| -|++.. +.|.-| T Consensus 41 ~~~ev~qVdlvTR~I~D~-----~vspDYA~~~--E~i--------~p~ArIvRl~fGP~rYLRKE~LWPyLD~~~D~~l 105 (445) T TIGR02472 41 RRSEVEQVDLVTRLIKDA-----KVSPDYAQPI--ERI--------APKARIVRLPFGPRRYLRKELLWPYLDELADQLL 105 (445) T ss_pred CCCCCCEEEEEEEEEECC-----CCCCHHHHHH--HHC--------CCCCEEEEECCCCCCCCCCCCCCCCHHHHHHHHH T ss_conf 589751476775034317-----7680233477--640--------8995599745788855740156600789999999 Q ss_pred HHHHHHCCC Q ss_conf 998752786 Q gi|254780480|r 174 SALHELSPL 182 (367) Q Consensus 174 ~~lh~~~~L 182 (367) .=+++.+.+ T Consensus 106 ~ylr~~g~l 114 (445) T TIGR02472 106 SYLRQQGRL 114 (445) T ss_pred HHHHHCCCC T ss_conf 999735888 No 53 >KOG1408 consensus Probab=24.42 E-value=48 Score=13.59 Aligned_cols=65 Identities=18% Similarity=0.120 Sum_probs=38.4 Q ss_pred CCEEEEECCCCCCCCCCEEEEEECCCCCCCC-CCCCCCEEEEEEECCCCCCEEEECCCEEEEEEEEECC Q ss_conf 8279986013431125034232078775542-3667443899995168652798339803568985088 Q gi|254780480|r 98 GISAYANPKSSIGRIDVFARVIGDRSQEFDS-ISPNYSGPLYLEILPRTFPIVVRVGSRLSQIRFVKER 165 (367) Q Consensus 98 di~a~~~pKSS~GRLgif~r~i~~~~~~f~~-id~Gy~G~l~lEI~~~sfpI~v~~G~rl~Qi~f~~g~ 165 (367) .|+-+++-|||-||+.--.+-+...++-||. |||.|+ +++ +..+...|.++-=..=-|.+.|+|. T Consensus 572 simFr~~qk~~~g~~f~r~t~t~~ktTlYDm~Vdp~~k--~v~-t~cQDrnirif~i~sgKq~k~FKgs 637 (1080) T KOG1408 572 SIMFRVNQKASSGRLFPRHTQTLSKTTLYDMAVDPTSK--LVV-TVCQDRNIRIFDIESGKQVKSFKGS 637 (1080) T ss_pred HHHEEHHCCCCCCEECCCCCCCCCCCEEEEEEECCCCC--EEE-EEECCCCEEEEECCCCCEEEEECCC T ss_conf 43200110056722413542113551477763078865--379-9852463677752564044200366 No 54 >COG0860 AmiC N-acetylmuramoyl-L-alanine amidase [Cell envelope biogenesis, outer membrane] Probab=24.00 E-value=40 Score=14.07 Aligned_cols=15 Identities=20% Similarity=0.158 Sum_probs=6.1 Q ss_pred EECCCCCCCCCCCCCCE Q ss_conf 20787755423667443 Q gi|254780480|r 119 IGDRSQEFDSISPNYSG 135 (367) Q Consensus 119 i~~~~~~f~~id~Gy~G 135 (367) +.|-|.+- .|+|=-| T Consensus 46 ~IDpGHGG--~D~GAig 60 (231) T COG0860 46 VIDPGHGG--KDPGAIG 60 (231) T ss_pred EECCCCCC--CCCCCCC T ss_conf 98689997--7876638 No 55 >PRK08221 anaerobic sulfite reductase subunit B; Provisional Probab=23.18 E-value=37 Score=14.34 Aligned_cols=46 Identities=17% Similarity=0.175 Sum_probs=26.7 Q ss_pred EEECCCCEEECCCCEEEEEEEEEEECCCCCEEEEECCCCCCCCCCEEEEEE Q ss_conf 873688648668732999988899849982799860134311250342320 Q gi|254780480|r 70 EIDLSGGTVLDANCVYIVPLMESLNLKNGISAYANPKSSIGRIDVFARVIG 120 (367) Q Consensus 70 eidl~~g~vLepg~~yLv~t~E~i~LP~di~a~~~pKSS~GRLgif~r~i~ 120 (367) .+-++-.+..+||.|+.+...+.-+.|=.|+. .+-|.+.+..++.- T Consensus 24 t~~~~~~~~~~PGQFvmv~v~~~ge~PiSi~~-----~~~g~l~l~~~~vG 69 (263) T PRK08221 24 TFRMAVDGDVKPGQFFEVSLPKVGEAPISVSD-----IGEGYIDLTIRRVG 69 (263) T ss_pred EEEECCCCCCCCCCEEEEEECCCCCCEEEEEC-----CCCCEEEEEEEEEC T ss_conf 99981589979998799995899951058861-----79996999999488 No 56 >TIGR01967 DEAH_box_HrpA ATP-dependent helicase HrpA; InterPro: IPR010222 This entry represents HrpA, one of two related but uncharacterised DEAH-box ATP-dependent helicases in many Proteobacteria and a few high-GC Gram-positive bacteria. HrpA is about 1300 amino acids long, while its paralog HrpB, also uncharacterised, is about 800 amino acids long. Related characterised eukaryotic proteins are RNA helicases associated with pre-mRNA processing.; GO: 0005524 ATP binding, 0008026 ATP-dependent helicase activity. Probab=22.74 E-value=40 Score=14.13 Aligned_cols=20 Identities=35% Similarity=0.725 Sum_probs=14.7 Q ss_pred CCCEEEEEEECCCCCCEEEEC Q ss_conf 744389999516865279833 Q gi|254780480|r 132 NYSGPLYLEILPRTFPIVVRV 152 (367) Q Consensus 132 Gy~G~l~lEI~~~sfpI~v~~ 152 (367) |--.|| +|+|.|||||-+|- T Consensus 237 GkPaPI-IEVSGRTyPVe~RY 256 (1320) T TIGR01967 237 GKPAPI-IEVSGRTYPVEVRY 256 (1320) T ss_pred CCCCCE-EEECCCCCCCCEEE T ss_conf 898875-55278546651111 No 57 >COG2147 RPL19A Ribosomal protein L19E [Translation, ribosomal structure and biogenesis] Probab=21.66 E-value=54 Score=13.25 Aligned_cols=21 Identities=19% Similarity=0.308 Sum_probs=17.0 Q ss_pred CCCCHHHHHHHHHCCCEEECC Q ss_conf 615868999999879849817 Q gi|254780480|r 4 GVLPDKAIATLAERGDIVSEH 24 (367) Q Consensus 4 GILsd~~I~~li~~g~I~i~~ 24 (367) +..+-++|++||++|.|...+ T Consensus 34 ~A~TR~dIr~LIk~g~I~~k~ 54 (150) T COG2147 34 SAITREDIRALIKDGVIKAKP 54 (150) T ss_pred HHHHHHHHHHHHHCCCEEECC T ss_conf 865199999999878766233 No 58 >KOG3021 consensus Probab=21.63 E-value=38 Score=14.28 Aligned_cols=26 Identities=31% Similarity=0.607 Sum_probs=16.9 Q ss_pred HHHHHHHCCCCCCCCEEEECCCCEEE Q ss_conf 78664311355777539988997899 Q gi|254780480|r 234 SDFWDPLFSWDAPEGQFILDPNEFYI 259 (367) Q Consensus 234 ~dfwe~i~~~~~~~~~~~l~p~~F~l 259 (367) -|.|.-..+.+...+++|++|-+||= T Consensus 215 GDLW~GNva~D~~~ePiIfDPasfYg 240 (313) T KOG3021 215 GDLWSGNVAEDGAGEPIIFDPASFYG 240 (313) T ss_pred CCCCCCCCCCCCCCCCEEECCHHHCC T ss_conf 66336753446799965767533038 No 59 >COG1124 DppF ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component [Amino acid transport and metabolism / Inorganic ion transport and metabolism] Probab=21.53 E-value=54 Score=13.23 Aligned_cols=85 Identities=14% Similarity=0.137 Sum_probs=43.3 Q ss_pred CEEECCCCEEEEEEEEEEECCCCCEEEEECCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEEEECCCCCCEEEECCCE Q ss_conf 64866873299998889984998279986013431125034232078775542366744389999516865279833980 Q gi|254780480|r 76 GTVLDANCVYIVPLMESLNLKNGISAYANPKSSIGRIDVFARVIGDRSQEFDSISPNYSGPLYLEILPRTFPIVVRVGSR 155 (367) Q Consensus 76 g~vLepg~~yLv~t~E~i~LP~di~a~~~pKSS~GRLgif~r~i~~~~~~f~~id~Gy~G~l~lEI~~~sfpI~v~~G~r 155 (367) .|-+.+|+.+ .| +-..--||||++|+ +++ ...-+.|.+++.=.+..---.-+.=-| T Consensus 27 S~~i~~Ge~l--------gi---vGeSGsGKSTL~r~------l~G-------l~~p~~G~I~~~G~~~~~~~~~~~~~~ 82 (252) T COG1124 27 SLEIERGETL--------GI---VGESGSGKSTLARL------LAG-------LEKPSSGSILLDGKPLAPKKRAKAFYR 82 (252) T ss_pred EEEECCCCEE--------EE---ECCCCCCHHHHHHH------HHC-------CCCCCCCEEEECCCCCCCCCCCHHHCC T ss_conf 5996489789--------99---84898988899999------956-------567888628988840576653033304 Q ss_pred EEEEEEEECCCCCCHHHHHHHHHHCCCCC Q ss_conf 35689850887689999999875278603 Q gi|254780480|r 156 LSQIRFVKERKFCSQEELSALHELSPLVQ 184 (367) Q Consensus 156 l~Qi~f~~g~~~LsD~eL~~lh~~~~LV~ 184 (367) ..|++|-.=.+.|+-..-....-.+++-. T Consensus 83 ~VQmVFQDp~~SLnP~~tv~~~l~Epl~~ 111 (252) T COG1124 83 PVQMVFQDPYSSLNPRRTVGRILSEPLRP 111 (252) T ss_pred CEEEEECCCCCCCCCCHHHHHHHHHHHCC T ss_conf 50699518722468410198997424303 No 60 >cd04331 RNAP_E_N RNAP_E_N: RpoE, N-terminal ribonucleoprotein (RNP) domain. RpoE (subunit E) is a subunit of the archaeal RNA polymerase (RNAP) that is homologous to Rpb7 of eukaryotic RNAP II, Rpc25 of eukaryotic RNAP III, and Rpa43 of eukaryotic RNAP I. RpoE heterodimerizes with RpoF, another RNA polymerase subunit. RpoE has an elongated two-domain structure that includes an N-terminal RNP domain and a C-terminal oligonucleotide-binding (OB) domain. Both domains of RpoE bind single-stranded RNA. Probab=21.48 E-value=55 Score=13.22 Aligned_cols=69 Identities=17% Similarity=0.457 Sum_probs=40.1 Q ss_pred EEEEEEEEEEECCCCCEEEEEEECCCCCCEEEECCEEECCCCCCCCCCCCCCEEEEEECC-CCCEEE-ECCCEEEEEEE Q ss_conf 899986334442834016445304333421342160456886654345667504788617-888899-47988899999 Q gi|254780480|r 257 FYIFASREFIQIPPSLVAEMIPYDHSIGEFRVHYAGFFDPGFGCVLPQEVGAKAVLEVRS-SVPFVL-EHGQIIGRLKY 333 (367) Q Consensus 257 F~l~~s~E~v~iP~~~aae~~~~~~~~g~~r~h~AGf~DpGf~~~~~~~~~~~~vLevr~-~~p~~l-~~g~~i~~l~~ 333 (367) |++..-...|+|||++-++ -...+..-+.+..|+|-+++..|.-+ +|.++.. ....++ -+|..--...| T Consensus 1 f~l~~l~d~IrIpP~~fg~-~~~~~i~~~l~~~yegkv~~~~Gliv-------~v~di~~i~eG~I~~GdG~~y~~V~F 71 (80) T cd04331 1 YKLVELEDVVRVPPELFGE-DLEEAVLEILKEKYEGRLDKDLGKIV-------SVLDVKDVGEGKIVHGDGAVYHEVRF 71 (80) T ss_pred CEEEEEEEEEEECHHHHCC-CHHHHHHHHHHHHHCCEEECCCCEEE-------EEEEEEEECCCEEECCCCCEEEEEEE T ss_conf 9999998999999899096-78999999998884891947768999-------99994895599898299989999999 No 61 >TIGR02992 ectoine_eutC ectoine utilization protein EutC; InterPro: IPR014334 Members of this entry are EutA, a predicted arylmalonate decarboxylase found in a conserved ectoine utilization operon of species that include Mesorhizobium loti, Silicibacter pomeroyi, Agrobacterium tumefaciens, Pseudomonas putida and Sinorhizobium meliloti 1021, where it is known to be induced by ectoine). Members belong to the ornithine cyclodeaminase/mu-crystallin family.. Probab=21.20 E-value=55 Score=13.19 Aligned_cols=55 Identities=18% Similarity=0.206 Sum_probs=37.6 Q ss_pred EEECCCCEEEEEEEE---EEECCCC-------CEEEEEEECCCCCCEEEE-CCEEECCCCCCCCCC Q ss_conf 998899789998633---4442834-------016445304333421342-160456886654345 Q gi|254780480|r 250 FILDPNEFYIFASRE---FIQIPPS-------LVAEMIPYDHSIGEFRVH-YAGFFDPGFGCVLPQ 304 (367) Q Consensus 250 ~~l~p~~F~l~~s~E---~v~iP~~-------~aae~~~~~~~~g~~r~h-~AGf~DpGf~~~~~~ 304 (367) =.|+||.+++++-.. +=.|-|. |+|.-.+--...||+++. .||.+-++=...+-| T Consensus 212 ~wL~pGqh~tAMGsD~EHKnE~~P~~~a~a~~YVaD~~~Qt~~lGELhhAI~Ag~~~~~~~faELG 277 (326) T TIGR02992 212 EWLEPGQHVTAMGSDAEHKNEIDPAVIAKADLYVADRLSQTALLGELHHAIAAGLVAAEQSFAELG 277 (326) T ss_pred HHCCCCCEEEEECCCCCCCCCCCHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCC T ss_conf 336788789860677012234364788541115666568875412268899971105687874211 No 62 >TIGR01081 mpl UDP-N-acetylmuramate:L-alanyl-gamma-D-glutamyl-meso-diaminopimelate ligase; InterPro: IPR005757 The tripeptide L-ananyl-gamma-D-glutamyl-meso-diaminopimelic acid is efficiently re-cycled during the formation of bacterial cell wall murein. UDP-N-acetylmuramate:L-alanyl-gamma-D-glutamyl-meso-diaminopimelate ligase is an essential enzyme in this process which links the tripeptide to UDP-N-acetylmuramate.; GO: 0016874 ligase activity, 0007047 cell wall organization and biogenesis. Probab=20.85 E-value=47 Score=13.66 Aligned_cols=34 Identities=26% Similarity=0.287 Sum_probs=22.3 Q ss_pred HHHHCCCEEECCCCCHHHCCCCEEEEEE-CCEEEE Q ss_conf 9998798498178986565712258875-898999 Q gi|254780480|r 13 TLAERGDIVSEHPLEKNQIQPASIDLRL-SSKAYR 46 (367) Q Consensus 13 ~li~~g~I~i~~p~~~~QIQPaSiDLRL-G~~~~r 46 (367) .++++--|.+-..||..|++|-.=||-+ ||...| T Consensus 39 T~LE~~GIe~~~gfD~~QL~~~~pD~~viGNa~~R 73 (459) T TIGR01081 39 TLLEAQGIEIIEGFDAAQLEPFKPDLVVIGNAMKR 73 (459) T ss_pred HHHHHCCCEEECCCCHHHCCCCCCCEEEECCCCCC T ss_conf 88875573032277844400348887998463316 No 63 >COG3919 Predicted ATP-grasp enzyme [General function prediction only] Probab=20.66 E-value=57 Score=13.11 Aligned_cols=38 Identities=26% Similarity=0.373 Sum_probs=24.8 Q ss_pred CCCCCCEEEECCCEEEEEEEEECCCC--CCHHHHHH-HHHH Q ss_conf 16865279833980356898508876--89999999-8752 Q gi|254780480|r 142 LPRTFPIVVRVGSRLSQIRFVKERKF--CSQEELSA-LHEL 179 (367) Q Consensus 142 ~~~sfpI~v~~G~rl~Qi~f~~g~~~--LsD~eL~~-lh~~ 179 (367) .-.+|||.++|||..+--++-|.+.+ -+.++|+. +|.. T Consensus 144 ~el~FPvILKP~mgg~~~~~araKa~~a~d~ee~k~a~~~a 184 (415) T COG3919 144 DELTFPVILKPGMGGSVHFEARAKAFTAADNEEMKLALHRA 184 (415) T ss_pred HHEEEEEEECCCCCCCCEEEHHHHEEECCCHHHHHHHHHHH T ss_conf 31440388617888751001010014316779999999998 No 64 >PRK10809 ribosomal-protein-S5-alanine N-acetyltransferase; Provisional Probab=20.49 E-value=57 Score=13.09 Aligned_cols=32 Identities=25% Similarity=0.373 Sum_probs=15.8 Q ss_pred CCCEEEEEEEEECCCCC----CCCCCCCCCCCCCCCCC Q ss_conf 79888999997637664----22237543472326667 Q gi|254780480|r 324 HGQIIGRLKYESMDGEP----ERLYGEARGSHYQSQAL 357 (367) Q Consensus 324 ~g~~i~~l~~~~~~~~p----~~~Yg~~~~s~Yq~Qgl 357 (367) .|..||.+.+......+ +.-| -++..|||||+ T Consensus 85 ~~~liG~i~l~~i~~~~~~~~eIGY--~l~~~~~GkGy 120 (194) T PRK10809 85 EKEIIGVANFSNVVRGSFHACYLGY--SIGQKWQGQGL 120 (194) T ss_pred CCEEEEEEEEEECCCCCCCEEEEEE--EECHHHHCCCH T ss_conf 9979999998742577651688999--99888948888 No 65 >TIGR02466 TIGR02466 conserved hypothetical protein; InterPro: IPR012668 This family consists of uncharacterised proteins in Caulobacter crescentus CB15, Bdellovibrio bacteriovorus HD100, Synechococcus sp. WH 8102 (2), Silicibacter pomeroyi DSS-3 (2), and Hyphomonas neptunium ATCC15444. The context of nearby genes differs substantially between members and does point to any specific biological role.. Probab=20.34 E-value=39 Score=14.19 Aligned_cols=99 Identities=15% Similarity=0.219 Sum_probs=58.0 Q ss_pred CCCHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCEEEEECCCCCEEECCCCCCCCHHHHHHHHCCCCCC Q ss_conf 68999999987527860366554433731678881455456666304551267751301565555877866431135577 Q gi|254780480|r 167 FCSQEELSALHELSPLVQDGLRDFSGKGVALSVDLKGNEIGKKSIIGYRAKRHTAAIDVDSENKYDESDFWDPLFSWDAP 246 (367) Q Consensus 167 ~LsD~eL~~lh~~~~LV~~~~~~~~~~Gl~~sVdL~~~~~~~~~iiGyrAkkht~~IDl~k~~~y~~~dfwe~i~~~~~~ 246 (367) ++|-..|..++...+.-.+=...+..---+++-||..|..++ + -.+++.|--|+++.+- T Consensus 51 ~~Sy~sl~d~~~r~~~F~~L~~~l~~hv~kF~~dL~~d~d~~------------------~---l~~~~~W~~I~pR~Gg 109 (210) T TIGR02466 51 YTSYASLLDLPERFPEFAELVKQLDKHVAKFARDLEYDADGL------------------E---LRIQKLWVNILPRQGG 109 (210) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCC------------------E---EEECCCCEEEECCCCC T ss_conf 212545656777627888999999999999999825898743------------------5---7601214365278886 Q ss_pred CCEEEECCCCEEEEEEEEEEECCCCCEEEEEEECCCCCCEEEE Q ss_conf 7539988997899986334442834016445304333421342 Q gi|254780480|r 247 EGQFILDPNEFYIFASREFIQIPPSLVAEMIPYDHSIGEFRVH 289 (367) Q Consensus 247 ~~~~~l~p~~F~l~~s~E~v~iP~~~aae~~~~~~~~g~~r~h 289 (367) -.+-.+||+. +++..=||.+|+|+.+ ..--++|.+.+..| T Consensus 110 ~~~~H~HP~s--~iSGtyYV~~Pe~~~~-ikfedPR~~~~ma~ 149 (210) T TIGR02466 110 VHSAHLHPGS--VISGTYYVQTPENCGA-IKFEDPRLDDMMAA 149 (210) T ss_pred EECCCCCCCC--EEEEEEEEECCCCCCC-CCCCCCHHHHHHCC T ss_conf 4267627431--0210446505898697-54213057775224 No 66 >PRK05904 coproporphyrinogen III oxidase; Provisional Probab=20.10 E-value=27 Score=15.24 Aligned_cols=138 Identities=11% Similarity=0.123 Sum_probs=75.2 Q ss_pred CCCCCCEEEEEEECCCCC---CEEEECCCEEEEEEEEECCCCCCHHHHHHHHHHCCCCCCC--CCCCCCCCE-EEEEEEC Q ss_conf 366744389999516865---2798339803568985088768999999987527860366--554433731-6788814 Q gi|254780480|r 129 ISPNYSGPLYLEILPRTF---PIVVRVGSRLSQIRFVKERKFCSQEELSALHELSPLVQDG--LRDFSGKGV-ALSVDLK 202 (367) Q Consensus 129 id~Gy~G~l~lEI~~~sf---pI~v~~G~rl~Qi~f~~g~~~LsD~eL~~lh~~~~LV~~~--~~~~~~~Gl-~~sVdL~ 202 (367) ++++. ++|+|..|.++ -+..+...-+| |+--|-..++|+.|+.+...+..-... .......|. .+|+||- T Consensus 86 ~~~~~--EiTiEaNP~~~~~ekL~~lk~~GVN--RiSlGVQSf~d~~Lk~LGR~H~~~~~~~ai~~~r~~Gf~nIsiDLI 161 (353) T PRK05904 86 VDNNC--EFTIECNPELITQSQINLLKKNKVN--RISLGVQSMNNNILKQLNRTHTIQDSKEAINLLHKNGIYNISCDFL 161 (353) T ss_pred CCCCC--EEEEEECCCCCCHHHHHHHHHCCCC--EEEEECCCCCHHHHHHHCCCCCHHHHHHHHHHHHHHCCCEEEECEE T ss_conf 38783--5999865144878999999964987--6888745599899998389998999999999999819973600426 Q ss_pred CCCCCCCCCEEEEECCCCCEEECCCCCCCCHHHHHHHHC------CCCCCCCEEEECCCCEEEEEEEEEEECCCCCEEEE Q ss_conf 554566663045512677513015655558778664311------35577753998899789998633444283401644 Q gi|254780480|r 203 GNEIGKKSIIGYRAKRHTAAIDVDSENKYDESDFWDPLF------SWDAPEGQFILDPNEFYIFASREFIQIPPSLVAEM 276 (367) Q Consensus 203 ~~~~~~~~iiGyrAkkht~~IDl~k~~~y~~~dfwe~i~------~~~~~~~~~~l~p~~F~l~~s~E~v~iP~~~aae~ 276 (367) = ++.|- ..+++-+.+. +....-=++.++||... .++...++.|..++| T Consensus 162 y------GlPgQ-----------------T~~~~~~~L~~~l~l~p~HiS~Y~LtiE~gT~~---~~~~~~~~~d~~~~~ 215 (353) T PRK05904 162 Y------CLPIL-----------------KLKDLDEVFNFILKHKINHISFYSLEIKEGSIL---KKYHYTIDEDKEAEQ 215 (353) T ss_pred E------CCCCC-----------------CHHHHHHHHHHHHHCCCCCEEEEEEEECCCCHH---CCCCCCCCCHHHHHH T ss_conf 3------59999-----------------999999999999965999178888898579732---047889992799999 Q ss_pred E----EECCCCCCEEEEC--CEEEC-CCC Q ss_conf 5----3043334213421--60456-886 Q gi|254780480|r 277 I----PYDHSIGEFRVHY--AGFFD-PGF 298 (367) Q Consensus 277 ~----~~~~~~g~~r~h~--AGf~D-pGf 298 (367) - ......| | .|| .-|-- ||| T Consensus 216 ~~~~~~~L~~~G-y-~~YEISNfAk~pg~ 242 (353) T PRK05904 216 LNYIKAKFNKLN-Y-KRYEVSNWTNNFKY 242 (353) T ss_pred HHHHHHHHHHCC-C-CCEEECCCCCCCCC T ss_conf 999999998669-9-70663512379698 Done!