Query gi|254780743|ref|YP_003065156.1| putative uracil-DNA glycosylase [Candidatus Liberibacter asiaticus str. psy62] Match_columns 261 No_of_seqs 186 out of 1458 Neff 6.3 Searched_HMMs 23785 Date Mon May 30 13:56:00 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780743.hhm -d /home/congqian_1/database/pdb/pdb70.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 1ui0_A Uracil-DNA glycosylase; 100.0 0 0 372.8 15.3 181 76-261 2-188 (205) 2 1vk2_A Uracil-DNA glycosylase 100.0 0 0 358.6 15.7 185 73-260 16-202 (204) 3 2d3y_A Uracil-DNA glycosylase; 100.0 0 0 314.2 10.2 183 75-259 1-214 (219) 4 3ikb_A Uncharacterized conserv 100.0 2.8E-45 0 277.6 6.9 176 73-261 3-194 (198) 5 2c2q_A G/U mismatch-specific D 100.0 3.6E-30 1.5E-34 190.8 9.6 163 91-259 9-184 (199) 6 1mug_A Protein (G:T/U specific 99.9 3.7E-27 1.6E-31 173.6 12.6 142 104-253 6-161 (168) 7 1wyw_A TDG, G/T mismatch-speci 99.9 9.6E-25 4.1E-29 159.9 12.2 164 96-261 11-193 (230) 8 1oe4_A SMUG1, single-strand se 98.8 1.5E-08 6.2E-13 67.6 9.2 130 104-250 50-230 (247) 9 2l3f_A Uncharacterized protein 96.1 0.00094 4E-08 40.2 0.6 133 101-253 7-157 (166) 10 3fci_A UDG, uracil-DNA glycosy 73.9 4.3 0.00018 19.3 8.0 114 106-238 54-192 (223) 11 2ux6_A Pseudoazurin; type-1 co 72.7 1.7 7.1E-05 21.7 2.3 36 109-144 86-121 (122) 12 2j8x_A Uracil-DNA glycosylase; 64.5 7 0.00029 18.1 9.4 118 106-240 57-196 (231) 13 2boo_A UDG, uracil-DNA glycosy 55.0 10 0.00043 17.2 7.0 115 106-239 73-212 (247) 14 1yc5_A NAD-dependent deacetyla 54.3 11 0.00045 17.1 3.9 10 186-195 180-189 (246) 15 1em8_B DNA polymerase III PSI 51.4 12 0.0005 16.8 4.3 90 105-252 9-101 (112) 16 1udg_A Uracil-DNA glycosylase; 51.0 12 0.00051 16.8 9.4 116 106-240 78-217 (244) 17 4eug_A UDG, UNG, protein (glyc 44.1 16 0.00065 16.2 9.6 115 106-238 54-192 (229) 18 2owr_A UDG, uracil-DNA glycosy 36.5 21 0.00086 15.5 11.0 111 106-237 58-185 (218) 19 3a7n_A UDG, uracil-DNA glycosy 35.0 22 0.00091 15.3 6.9 114 106-238 69-207 (238) 20 1nrp_R Receptor based peptide 33.0 3.4 0.00014 19.9 -1.7 16 235-250 5-21 (26) 21 1mkz_A Molybdenum cofactor bio 32.1 24 0.001 15.0 5.1 68 107-196 12-79 (172) 22 3cxm_A Uracil-DNA glycosylase; 30.6 26 0.0011 14.9 11.1 115 106-238 91-231 (268) 23 1oj7_A Hypothetical oxidoreduc 30.5 26 0.0011 14.9 6.5 27 165-195 89-115 (408) 24 3nrb_A Formyltetrahydrofolate 22.3 37 0.0016 14.0 2.3 20 176-195 155-174 (287) 25 2jsx_A Protein NAPD; TAT, proo 21.0 39 0.0017 13.8 2.1 63 73-158 15-77 (95) No 1 >1ui0_A Uracil-DNA glycosylase; base excision repair, iron/sulfer cluster, thermophIle, riken structural genomics/proteomics initiative, RSGI; 1.50A {Thermus thermophilus} SCOP: c.18.1.2 PDB: 1ui1_A Probab=100.00 E-value=0 Score=372.81 Aligned_cols=181 Identities=29% Similarity=0.552 Sum_probs=174.4 Q ss_pred CHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCEEEE Q ss_conf 98999999960038852002877223678888848998578983001120012070048999999860986222100000 Q gi|254780743|r 76 SLHELKSLLRSFHDCHLCSTSLSTICATQTEGQDLMIIGYTPSDSDNISGKPFSGKTGNMLDKMLQSIEIMRTQIHISMI 155 (261) Q Consensus 76 ~l~~L~~~~~~~~~C~L~~~~~~~v~g~G~~~a~imiVgeaPg~~Ed~~G~pF~G~aG~lL~~mL~aigl~r~~vyitn~ 155 (261) ++++|+..+.+|+.|+|+++++++|+|.|+++|+||||||+||.+|+++|+||+|++|++|++||+++|+++++||+||+ T Consensus 2 ~l~~L~~~i~~C~~C~L~~~~~~~V~g~G~~~a~imiVg~aPg~~e~~~g~pf~G~aG~lL~~~L~~~gl~~~~vyitn~ 81 (205) T 1ui0_A 2 TLELLQAQAQNCTACRLMEGRTRVVFGEGNPDAKLMIVGEGPGEEEDKTGRPFVGKAGQLLNRILEAAGIPREEVYITNI 81 (205) T ss_dssp CHHHHHHHHTTCCSSGGGTTCSSCCCCEECTTCSEEEEESCCCHHHHHHSCSSCSHHHHHHHHHHHHHTCCGGGSEEEES T ss_pred CHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCHHHHHCCCCCCCCHHHHHHHHHHHCCCCHHHEEEECC T ss_conf 88999999867888999889978554789999999999588760776208864675788999999983999899886600 Q ss_pred EECCCCCCCCCCHHHHHHHH-HHHHHHHHHCCCCEEEEECHHHHHHHHCCCCCCCHHCCCEEEEEECCCCCEEEEEEECH Q ss_conf 00122566667788999999-99999863229767995255999998446533101106505766428973789997080 Q gi|254780743|r 156 SPWHPPGNRNLSNIEMEICR-PIIMRQIELISPKILLFIGNKTKNFFFNNDAQKTYQNLGKWSNLCTPHSIIPTLATVHP 234 (261) Q Consensus 156 vkcrPp~nr~p~~~ei~~c~-p~L~~ei~li~Pk~ii~lG~~a~~~ll~~~~~~i~~~rG~~~~~~~~~~~i~~~pt~HP 234 (261) |||+|++++.|+..|++.|+ +||.+||++++|++||+||++|++++++.. ..+.+.||+||++ .++++|||||| T Consensus 82 vkc~p~~~~~~~~~ei~~C~~p~L~~ei~li~Pkiii~lG~~a~~~~l~~~-~~~~~~rG~~~~~----~~~~~~~tyHP 156 (205) T 1ui0_A 82 VKCRPPQNRAPLPDEAKICTDKWLLKQIELIAPQIIVPLGAVAAEFFLGEK-VSITKVRGKWYEW----HGIKVFPMFHP 156 (205) T ss_dssp CSSCCGGGCCCCHHHHHHHHHHTHHHHHHHHCCSEEEEBSHHHHHHHHTSC-CCHHHHTTCCEEE----TTEEECCBCCH T ss_pred EEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCEEEEHHHHHHHHHHCCC-CCCCCCCCCEEEE----CCCEEEEECCC T ss_conf 401377678998699999999999999985488367330899999986125-4412224713873----89669998597 Q ss_pred HHHHCCHHH-----HHHHHHHHHHHHHHHHCC Q ss_conf 388336598-----999999999999998609 Q gi|254780743|r 235 QELIQYPLI-----KKNTWHALITLKKALKNL 261 (261) Q Consensus 235 s~LLr~P~~-----K~~aW~DL~~l~~~L~~l 261 (261) |||+|||+. |+.+|+||++++++|++| T Consensus 157 s~LlrnP~~~~~~~K~~~w~Dl~~lk~~L~~l 188 (205) T 1ui0_A 157 AYLLRNPSRAPGSPKHLTWLDIQEVKRALDAL 188 (205) T ss_dssp HHHHHSCCCSTTCHHHHHHHHHHHHHHHHHHC T ss_pred HHHHHCCCCCCHHHHHHHHHHHHHHHHHHHCC T ss_conf 68865935343046889999999999999827 No 2 >1vk2_A Uracil-DNA glycosylase TM0511; structural genomics, JCSG, PSI, protein structure initiative, joint center for structural genomics, hydrolase; 1.90A {Thermotoga maritima} SCOP: c.18.1.2 PDB: 1l9g_A Probab=100.00 E-value=0 Score=358.57 Aligned_cols=185 Identities=19% Similarity=0.387 Sum_probs=174.7 Q ss_pred CCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCE Q ss_conf 46998999999960038852002877223678888848998578983001120012070048999999860986222100 Q gi|254780743|r 73 KACSLHELKSLLRSFHDCHLCSTSLSTICATQTEGQDLMIIGYTPSDSDNISGKPFSGKTGNMLDKMLQSIEIMRTQIHI 152 (261) Q Consensus 73 ~~~~l~~L~~~~~~~~~C~L~~~~~~~v~g~G~~~a~imiVgeaPg~~Ed~~G~pF~G~aG~lL~~mL~aigl~r~~vyi 152 (261) ....++.|++.+.+|+.|+|+.+++++|+|.|+++|+||||||+||.+|+++|+||+|+||++|++||.++|+++++||+ T Consensus 16 ~~~~l~~L~~~i~~C~~C~L~~~~~~~V~g~G~~~a~imiVgeaPg~~e~~~g~pf~g~sG~ll~~~l~~~gl~~~~vyi 95 (204) T 1vk2_A 16 REELMEIVSERVKKCTACPLHLNRTNVVVGEGNLDTRIVFVGEGPGEEEDKTGRPFVGRAGMLLTELLRESGIRREDVYI 95 (204) T ss_dssp HHHHHHHHHHHHHHCCSSGGGGTCSSCCCCEECTTCSEEEEESCCCHHHHHHTCSSCSHHHHHHHHHHHHTTCCGGGSEE T ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCHHHHHCCCCCCCCHHHHHHHHHHHHCCCCCCEEE T ss_conf 99999999999861878979889988256889999999999899745560529854565387899887762457023579 Q ss_pred EEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEECHHHHHHHHCCCCCCCHHCCCEEEEEECCCCCEEEEEEE Q ss_conf 00000122566667788999999999998632297679952559999984465331011065057664289737899970 Q gi|254780743|r 153 SMISPWHPPGNRNLSNIEMEICRPIIMRQIELISPKILLFIGNKTKNFFFNNDAQKTYQNLGKWSNLCTPHSIIPTLATV 232 (261) Q Consensus 153 tn~vkcrPp~nr~p~~~ei~~c~p~L~~ei~li~Pk~ii~lG~~a~~~ll~~~~~~i~~~rG~~~~~~~~~~~i~~~pt~ 232 (261) ||++||+||+||.|+..|++.|++||.+||++++|++||+||+.|.++++.....++.+.||+|+++. .+++++||| T Consensus 96 tn~vkc~pp~n~~~~~~e~~~c~~~l~~eI~li~PkiIv~LG~~a~~~ll~~~~~~i~~~rG~~~~~~---~~~~~~pt~ 172 (204) T 1vk2_A 96 CNVVKCRPPNNRTPTPEEQAACGHFLLAQIEIINPDVIVALGATALSFFVDGKKVSITKVRGNPIDWL---GGKKVIPTF 172 (204) T ss_dssp EESCSSCCGGGCCCCHHHHHHHHHHHHHHHHHHCCSEEEEESHHHHHHTTTTCCCCHHHHTTCCEECG---GGCEEEEEC T ss_pred EEEEEECCCCCCCCCHHHHHHCCCHHHHHHHCCCCCEEEEECHHHHHHHHCCCCCCCCCCCCCEEECC---CCCEEEEEE T ss_conf 76787608867899989998847467777631486479715899999997267766420369558867---998799981 Q ss_pred CHHHHHCCH--HHHHHHHHHHHHHHHHHHC Q ss_conf 803883365--9899999999999999860 Q gi|254780743|r 233 HPQELIQYP--LIKKNTWHALITLKKALKN 260 (261) Q Consensus 233 HPs~LLr~P--~~K~~aW~DL~~l~~~L~~ 260 (261) |||||+||| ..|+.+|+||+++++.|++ T Consensus 173 HPs~llrnp~~~~k~~~~~Dl~~ik~~l~~ 202 (204) T 1vk2_A 173 HPSYLLRNRSNELRRIVLEDIEKAKSFIKK 202 (204) T ss_dssp CHHHHHHCCCHHHHHHHHHHHHHHHTTSCC T ss_pred CCHHHHHCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 965987793888899999999999999985 No 3 >2d3y_A Uracil-DNA glycosylase; base excision repair, iron/sulfer cluster, thermophIle, structural genomics, NPPSFA; HET: DU; 1.55A {Thermus thermophilus HB8} PDB: 2ddg_A* 2dem_A* 2dp6_A* Probab=100.00 E-value=0 Score=314.21 Aligned_cols=183 Identities=13% Similarity=0.142 Sum_probs=145.9 Q ss_pred CCHHHHHHHHHCCCCCC-CCCCCCC----------------CCCCCCCCCCCEEEEECCCCCCCCC-CCCCCCC-CCHHH Q ss_conf 99899999996003885-2002877----------------2236788888489985789830011-2001207-00489 Q gi|254780743|r 75 CSLHELKSLLRSFHDCH-LCSTSLS----------------TICATQTEGQDLMIIGYTPSDSDNI-SGKPFSG-KTGNM 135 (261) Q Consensus 75 ~~l~~L~~~~~~~~~C~-L~~~~~~----------------~v~g~G~~~a~imiVgeaPg~~Ed~-~G~pF~G-~aG~l 135 (261) .++++|...+.+|+.|+ |+.++++ .|||+|+.+|+||||||+||++||. .|+||+| .+|++ T Consensus 1 ~d~~~l~~~i~~C~~Cp~L~~~r~~i~~~~~~~~~~~~~~~pV~G~G~~~A~imiIGeaPg~~ed~~~g~pf~g~~sG~l 80 (219) T 2d3y_A 1 MDREAFVQTLTACRLCPRLVAWREEVVGRKRAFRGEPYWARPVPGFGDPEARILLFGLAPGAHGSNRTGRPFTGDASGAF 80 (219) T ss_dssp CCHHHHHHHHTTCCCCHHHHHHHHHTTTTSGGGTTSCCCCSCCCCEECTTCSEEEEESCCCTTTHHHHSSTTTTSHHHHH T ss_pred CCHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCEEECHHHH T ss_conf 98899988764275767899999999851244446434788428999999998999689873202326898433244989 Q ss_pred HHHHHHHCCCCCC------------CCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEECHHHHHHHHC Q ss_conf 9999986098622------------2100000001225666677889999999999986322976799525599999844 Q gi|254780743|r 136 LDKMLQSIEIMRT------------QIHISMISPWHPPGNRNLSNIEMEICRPIIMRQIELISPKILLFIGNKTKNFFFN 203 (261) Q Consensus 136 L~~mL~aigl~r~------------~vyitn~vkcrPp~nr~p~~~ei~~c~p~L~~ei~li~Pk~ii~lG~~a~~~ll~ 203 (261) |++||.++|++++ ++|+||++|||||+||.|+.+|+..|.+|+.+++.+++|++||+||++|.+++++ T Consensus 81 L~~~l~~~gl~~~~~~~~~~~~~~~~~~~tn~vk~rpP~nrp~~~e~~~~~~~ll~e~~~lv~PkvIV~LG~~A~kall~ 160 (219) T 2d3y_A 81 LYPLLHEAGLSSKPESLPGDDLRLYGVYLTAAVRCAPPKNKPTPEELRACARWTEVELGLLPEVRVYVALGRIALEALLA 160 (219) T ss_dssp HHHHHHHTTCBSCSCCCTTSCCCBSSEEEEESCCSCCGGGCCCHHHHHHHHHHHHHHHHTCTTCCEEEEESHHHHHHHHH T ss_pred HHHHHHHHHCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEECHHHHHHHHH T ss_conf 99999998644642010133232101476165133598889985999985289999999861862132010999999987 Q ss_pred CCCCCCHHCCCEEEEEECCCCCEEEEEEECHHHHHCCHHHHHHHHHHHHHHHHHHH Q ss_conf 65331011065057664289737899970803883365989999999999999986 Q gi|254780743|r 204 NDAQKTYQNLGKWSNLCTPHSIIPTLATVHPQELIQYPLIKKNTWHALITLKKALK 259 (261) Q Consensus 204 ~~~~~i~~~rG~~~~~~~~~~~i~~~pt~HPs~LLr~P~~K~~aW~DL~~l~~~L~ 259 (261) .........+++++.......++++||||||||++||+ |+.+|+|+..+.++++ T Consensus 161 ~~~~~~~~~~~~~~~~~~~~~g~~v~ptyHPS~llr~~--~r~~~~~~~~~l~~a~ 214 (219) T 2d3y_A 161 HFGLRKSAHPFRHGAHYPLPGGRHLLASYHVSRQNTQT--GRLTREMFLEVLMEAK 214 (219) T ss_dssp HHTCCTTTSCCCTTCEEEETTTEEEEEECCCSHHHHHH--TSSCHHHHHHHHHHHH T ss_pred HCCCCCCCCCCCCCCEEECCCCCEEEEEECCCCCCCCC--CHHHHHHHHHHHHHHH T ss_conf 60776444433245267379991899996887545598--8023999999999999 No 4 >3ikb_A Uncharacterized conserved protein; APC63946, streptococcus mutans UA159, structural genomics, PSI-2, protein structure initiative; HET: FLC; 1.67A {Streptococcus mutans} Probab=100.00 E-value=2.8e-45 Score=277.58 Aligned_cols=176 Identities=14% Similarity=0.276 Sum_probs=139.6 Q ss_pred CCCCHHHHHHHHHCCCCCCCC-CCCCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCCC--- Q ss_conf 469989999999600388520-0287722367888884899857898300112001207004899999986098622--- Q gi|254780743|r 73 KACSLHELKSLLRSFHDCHLC-STSLSTICATQTEGQDLMIIGYTPSDSDNISGKPFSGKTGNMLDKMLQSIEIMRT--- 148 (261) Q Consensus 73 ~~~~l~~L~~~~~~~~~C~L~-~~~~~~v~g~G~~~a~imiVgeaPg~~Ed~~G~pF~G~aG~lL~~mL~aigl~r~--- 148 (261) ...++++|+..+.+|..|... ....++||+ ||++|+|||||||||++||++|+||+|+||++|.+ ++|++++ T Consensus 3 ~mt~l~~l~~~i~~~~~~~~~~~~~~~Pvf~-g~~~A~imiIGeAPG~~ed~~G~PF~g~sG~~L~~---~lgl~r~~~~ 78 (198) T 3ikb_A 3 AMTSLEEITKAIMADSQNKVFTEKNIEPLFA-APKTARINIVGQAPGIKAQESRLYWNDKSGDRLRE---WMGVDYDTFY 78 (198) T ss_dssp CCCSHHHHHHHHHHSGGGHHHHTTTCCCCCB-CCTTCCEEEECSCCCHHHHHHCCSSCSHHHHHHHH---HHTCCHHHHH T ss_pred HHHHHHHHHHHHHHCHHHHHHHHCCCCCEEE-CCCCCCEEEEECCCCHHHHHCCCCCCCCHHHHHHH---HHCCCHHHCC T ss_conf 7779999999998582213456249975222-69999889986898466521599887825889999---9597553346 Q ss_pred ---CCCEEEEEECCCCCCCC----CCHHHHHHHHHHHHHHHHHCCCCEEEEECHHHHHHHHCCCC-CCCHHCCCEEEEEE Q ss_conf ---21000000012256666----77889999999999986322976799525599999844653-31011065057664 Q gi|254780743|r 149 ---QIHISMISPWHPPGNRN----LSNIEMEICRPIIMRQIELISPKILLFIGNKTKNFFFNNDA-QKTYQNLGKWSNLC 220 (261) Q Consensus 149 ---~vyitn~vkcrPp~nr~----p~~~ei~~c~p~L~~ei~li~Pk~ii~lG~~a~~~ll~~~~-~~i~~~rG~~~~~~ 220 (261) +|||||+++|+||+||. |+.+++..|++++..+ +++|++||+||++|++++|+... ..+.+.+|+|+++. T Consensus 79 ~~~~v~i~n~v~c~Pp~nr~~~~~p~~~~~~~~~~~l~~~--l~~pklIv~LG~~A~~~~L~~~~~~~~~~~~~~~~~~~ 156 (198) T 3ikb_A 79 HSGYFAVIPMDFYYPGKGKSGDLPPRKGFAQKWHQPILDL--LPDIQLTILIGNYAQKYYLHQKSSVKLTDTVAHYKKYL 156 (198) T ss_dssp HSSCEEECCSCSSCCCBCSSSBCCCCTTHHHHHHHHHHTT--CTTCCEEEEESHHHHHHHTTCCTTSCHHHHHHTGGGGT T ss_pred CCCCEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHH--CCCCEEEEEECHHHHHHHHCCCCCCCHHHHHHCCHHHC T ss_conf 5578899855667899998999997199999999999986--68970999878999998828677887043421115528 Q ss_pred CCCCCEEEEEEECHHHHHCCHH-HHHHHH---HHHHHHHHHHHCC Q ss_conf 2897378999708038833659-899999---9999999998609 Q gi|254780743|r 221 TPHSIIPTLATVHPQELIQYPL-IKKNTW---HALITLKKALKNL 261 (261) Q Consensus 221 ~~~~~i~~~pt~HPs~LLr~P~-~K~~aW---~DL~~l~~~L~~l 261 (261) . .+||+||||+ ||.. .||+.| ++|..|++++++| T Consensus 157 ~-----~~~pl~HPS~--rN~~wlkrnpwfe~e~l~~Lr~~v~~l 194 (198) T 3ikb_A 157 P-----DYFPLVHPSP--RNQIWMSRHPWFEAQVVPDLKKIIQQI 194 (198) T ss_dssp T-----TEEEECCCCG--GGHHHHHHCTHHHHTHHHHHHHHHHHH T ss_pred C-----CEEEECCCCC--CCHHHHHHCCHHHHHHHHHHHHHHHHH T ss_conf 8-----9788548984--661788629316687699999999999 No 5 >2c2q_A G/U mismatch-specific DNA glycosylase; radiation resistance, DNA repair enzymes, uracil-DNA glycosylase, MUG, hydrolase; 1.7A {Deinococcus radiodurans} PDB: 2c2p_A Probab=99.96 E-value=3.6e-30 Score=190.81 Aligned_cols=163 Identities=13% Similarity=0.162 Sum_probs=132.5 Q ss_pred CCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCC-------------EEEEEE Q ss_conf 5200287722367888884899857898300112001207004899999986098622210-------------000000 Q gi|254780743|r 91 HLCSTSLSTICATQTEGQDLMIIGYTPSDSDNISGKPFSGKTGNMLDKMLQSIEIMRTQIH-------------ISMISP 157 (261) Q Consensus 91 ~L~~~~~~~v~g~G~~~a~imiVgeaPg~~Ed~~G~pF~G~aG~lL~~mL~aigl~r~~vy-------------itn~vk 157 (261) .|..++++.|.+.++++++|||||++||.+|+++|+||+|+ |.+|+++|.++|++++++| +||+++ T Consensus 9 ~~~~~~~~~~P~~~~p~~rilivG~aPG~~s~~~G~~f~~p-~n~fw~~L~~~Gl~~~~~~~~~~~~l~~~~i~~~~~v~ 87 (199) T 2c2q_A 9 DLTGSGEYLVPDVLQPGLTLVLVGTAPSGISARARAYYANP-ENKFWRTLHAVGLTPRQLVPQEYATLPQYGLGLTDVAK 87 (199) T ss_dssp CTTTTTTCSSCCBCCTTCSEEEEESCCCHHHHHHTSSSCST-TCCHHHHHHHTTSSSSCCCGGGGGGGGGGTEEEEESCC T ss_pred CCCCCCCCCCCCCCCCCCCEEEEECCCCHHHHHCCCCCCCH-HHHHHHHHHHHCCCCCCCCHHHHHHHHHCCCEECCCCC T ss_conf 64456777689987999978998269798899669853680-78899999991898666788887676640542225668 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEECHHHHHHHHCCCCCCCHHCCCEEEEEECCCCCEEEEEEECHHHH Q ss_conf 12256666778899999999999863229767995255999998446533101106505766428973789997080388 Q gi|254780743|r 158 WHPPGNRNLSNIEMEICRPIIMRQIELISPKILLFIGNKTKNFFFNNDAQKTYQNLGKWSNLCTPHSIIPTLATVHPQEL 237 (261) Q Consensus 158 crPp~nr~p~~~ei~~c~p~L~~ei~li~Pk~ii~lG~~a~~~ll~~~~~~i~~~rG~~~~~~~~~~~i~~~pt~HPs~L 237 (261) |++++|+.++..|+ |.+||.++|++++|++||++|+.+++++++... ..... |..........+++++++||||++ T Consensus 88 ~~~~~~~~~~~~e~--~~~~L~~~i~~~~PrvIv~lG~~a~~~~l~~~~-~~~~~-g~~~~~~~~~~~~~v~~lphpS~l 163 (199) T 2c2q_A 88 RHSGVAAALPGEAW--RPDELRRKVEHYRPRIVAFTSKRGASETLGVPT-GKLPY-GPQPQPLDWPAETELWVLPSTSPL 163 (199) T ss_dssp SCCSCGGGSCGGGC--CHHHHHHHHHHHCCSEEEESSHHHHHHHHCSCG-GGCCS-EECCCCSSCCTTCEEEECCCCSGG T ss_pred CCCCCCCCCCHHHH--HHHHHHHHHHHCCCCEEEECCHHHHHHHHCCCC-CCCCC-CCCCEECCCCCCCEEEEECCCCHH T ss_conf 88988887556565--118799999867986999849899999853455-42332-454300134577207985697588 Q ss_pred HCCHHHHHHHHHHHHHHHHHHH Q ss_conf 3365989999999999999986 Q gi|254780743|r 238 IQYPLIKKNTWHALITLKKALK 259 (261) Q Consensus 238 Lr~P~~K~~aW~DL~~l~~~L~ 259 (261) .++.. |+..|++|..+.+.|+ T Consensus 164 ~~~~~-r~~~~~~l~~~l~~lr 184 (199) T 2c2q_A 164 GHNHF-RLEPWQALGDRVRELR 184 (199) T ss_dssp GTTTC-CSHHHHHHHHHHHHHH T ss_pred HHCCC-CHHHHHHHHHHHHHHH T ss_conf 71734-2679999999999999 No 6 >1mug_A Protein (G:T/U specific DNA glycosylase); hydrolase; 1.80A {Escherichia coli} SCOP: c.18.1.2 PDB: 1mtl_A 1mwi_A* 1mwj_A Probab=99.95 E-value=3.7e-27 Score=173.63 Aligned_cols=142 Identities=15% Similarity=0.178 Sum_probs=116.8 Q ss_pred CCCCCCEEEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCC-------------CCCCCEEEEEECCCCCCCCCCHHH Q ss_conf 8888848998578983001120012070048999999860986-------------222100000001225666677889 Q gi|254780743|r 104 QTEGQDLMIIGYTPSDSDNISGKPFSGKTGNMLDKMLQSIEIM-------------RTQIHISMISPWHPPGNRNLSNIE 170 (261) Q Consensus 104 G~~~a~imiVgeaPg~~Ed~~G~pF~G~aG~lL~~mL~aigl~-------------r~~vyitn~vkcrPp~nr~p~~~e 170 (261) =.++++|||||++||..++++|+||+|+.+ .|.++|.++|+. +..||+||+++|+++.++.|+..| T Consensus 6 i~p~~~vlivG~~Pg~~s~~~g~~y~~p~n-~fw~~l~~~~~~~~~~~~~~~~~l~~~~i~ltd~v~~~~~~~~~~~~~E 84 (168) T 1mug_A 6 LAPGLRVVFCGINPGLSSAGTGFPFAHPAN-RFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVTKLVDRPTVQANEVSKQE 84 (168) T ss_dssp CCTTCSEEEEESSCCHHHHHHSSTTCSTTC-CHHHHHHHTTSSSSCCCGGGGGGGGGGTEEEEESCCCCCSSGGGSCHHH T ss_pred CCCCCCEEEEECCCCHHHHHCCCCCCCCHH-HHHHHHHHHCCCCCCCCHHHHHHCCCCCEEEEECCCCCCCCCCCCCHHH T ss_conf 699998899847989778855998679557-9999999826877788877764123366798513245577889973899 Q ss_pred HHHHHHHHHHHHHHCCCCEEEEECHHHHHHHHCCCCCCCHHCCCEEEEEECCCCCEEEEEEECHHHHHCCH-HHHHHHHH Q ss_conf 99999999998632297679952559999984465331011065057664289737899970803883365-98999999 Q gi|254780743|r 171 MEICRPIIMRQIELISPKILLFIGNKTKNFFFNNDAQKTYQNLGKWSNLCTPHSIIPTLATVHPQELIQYP-LIKKNTWH 249 (261) Q Consensus 171 i~~c~p~L~~ei~li~Pk~ii~lG~~a~~~ll~~~~~~i~~~rG~~~~~~~~~~~i~~~pt~HPs~LLr~P-~~K~~aW~ 249 (261) +..|.+||.++|+.++|++|+++|+.+.+.+++..... .|.... ...++++++++|||++.|.+ +.|..+|. T Consensus 85 ~~~~~~~L~~~l~~~~P~iI~~~G~~a~~~~~~~~~~~----~g~q~~---~~~~~~~~vlPspS~~~~~~~e~k~~~w~ 157 (168) T 1mug_A 85 LHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQ----WGKQTL---TIGSTQIWVLPNPSGLSRVSLEKLVEAYR 157 (168) T ss_dssp HHHHHHHHHHHHHHHCCSEEEEECHHHHHHHHTCCSCC----SEEEEE---EETTEEEEEECCCSTTSCCCHHHHHHHHH T ss_pred HHHHHHHHHHHHHHHCCCHHHHCCHHHHHHHHCCCCCC----CCCCCC---CCCCCCEEEECCCCHHHCCCHHHHHHHHH T ss_conf 98721889999998391801426999999984778776----553245---67997779944899677289999999999 Q ss_pred HHHH Q ss_conf 9999 Q gi|254780743|r 250 ALIT 253 (261) Q Consensus 250 DL~~ 253 (261) +|+. T Consensus 158 ~L~~ 161 (168) T 1mug_A 158 ELDQ 161 (168) T ss_dssp HHHH T ss_pred HHHH T ss_conf 9999 No 7 >1wyw_A TDG, G/T mismatch-specific thymine DNA glycosylase; hydrolase; 2.10A {Homo sapiens} SCOP: c.18.1.2 PDB: 2d07_A 2rba_A* Probab=99.92 E-value=9.6e-25 Score=159.86 Aligned_cols=164 Identities=17% Similarity=0.195 Sum_probs=135.4 Q ss_pred CCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCC--------------CCCCEEEEEECCCC Q ss_conf 8772236788888489985789830011200120700489999998609862--------------22100000001225 Q gi|254780743|r 96 SLSTICATQTEGQDLMIIGYTPSDSDNISGKPFSGKTGNMLDKMLQSIEIMR--------------TQIHISMISPWHPP 161 (261) Q Consensus 96 ~~~~v~g~G~~~a~imiVgeaPg~~Ed~~G~pF~G~aG~lL~~mL~aigl~r--------------~~vyitn~vkcrPp 161 (261) -++.+.....++++|||||.+||..++++|++|+|+++.+.. +|...|+.. ..+++||+|+|+++ T Consensus 11 ~~~~lPd~~~p~~~vLivG~~PG~~S~~~G~~ya~p~N~FW~-~L~~~g~~~~~~~~~e~~~~l~~~gigltdvv~r~t~ 89 (230) T 1wyw_A 11 LTKTLPDILTFNLDIVIIGINPGLMAAYKGHHYPGPGNHFWK-CLFMSGLSEVQLNHMDDHTLPGKYGIGFTNMVERTTP 89 (230) T ss_dssp GGCCCCCBCCSSCSEEEEESSCCHHHHHHTSSSCSSSBCHHH-HHHHTTSSSSCCCGGGGGGHHHHHSEEEEESCCSCCS T ss_pred HHCCCCCCCCCCCCEEEEECCCCHHHHHCCCCCCCCCHHHHH-HHHHHCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCC T ss_conf 727799877999978998428896678438888786134999-9999748886678778766665314430212215687 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEECHHHHHHHHCCCCCCCHHCCCEEEEE--ECCCCCEEEEEEECHHHHHC Q ss_conf 6666778899999999999863229767995255999998446533101106505766--42897378999708038833 Q gi|254780743|r 162 GNRNLSNIEMEICRPIIMRQIELISPKILLFIGNKTKNFFFNNDAQKTYQNLGKWSNL--CTPHSIIPTLATVHPQELIQ 239 (261) Q Consensus 162 ~nr~p~~~ei~~c~p~L~~ei~li~Pk~ii~lG~~a~~~ll~~~~~~i~~~rG~~~~~--~~~~~~i~~~pt~HPs~LLr 239 (261) +++.|+..|+..|.++|.++|+.++|++|+++|..|.+.+++.. .++...+++|... ..++..+.+++++|||.+-+ T Consensus 90 ~~s~ls~~E~~~~~~~L~~~I~~~~P~~iv~~G~~A~~~f~~~~-~g~~~~~~~~G~q~~~~~~~~~~v~vlPS~S~~na 168 (230) T 1wyw_A 90 GSKDLSSKEFREGGRILVQKLQKYQPRIAVFNGKCIYEIFSKEV-FGVKVKNLEFGLQPHKIPDTETLCYVMPSSSARCA 168 (230) T ss_dssp CGGGCCHHHHHHHHHHHHHHHHHHCCSEEEEESHHHHHHHHHHH-TCCCCSSCCSEECSCCSTTSCCEEEEECCCBTTCS T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECHHHHHHHHHHH-CCCCCCCCEECCCCCCCCCCCCEEEECCCCCHHHH T ss_conf 77898778998758999999998399689870789999998642-16775331121355456887856998589818860 Q ss_pred C---HHHHHHHHHHHHHHHHHHHCC Q ss_conf 6---598999999999999998609 Q gi|254780743|r 240 Y---PLIKKNTWHALITLKKALKNL 261 (261) Q Consensus 240 ~---P~~K~~aW~DL~~l~~~L~~l 261 (261) + .+.|-..|..|+.+++.|+.+ T Consensus 169 ~~~r~~~k~~~~~~L~~~~d~l~~~ 193 (230) T 1wyw_A 169 QFPRAQDKVHYYIKLKDLRDQLKGI 193 (230) T ss_dssp SCCSGGGTHHHHHHHHHHHHHHTTC T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 8774899999999999999998767 No 8 >1oe4_A SMUG1, single-strand selective monofunctional uracil DNA glycosylase; hydrolase, hydrolase/complex, SMUG1; HET: 3DR; 2.0A {Xenopus laevis} SCOP: c.18.1.3 PDB: 1oe5_A* 1oe6_A* Probab=98.85 E-value=1.5e-08 Score=67.60 Aligned_cols=130 Identities=15% Similarity=0.284 Sum_probs=90.4 Q ss_pred CCCCCCEEEEECCCCCCC-CCCCCCCCC-----------------------------------CCHHHHHHHHHHCCCCC Q ss_conf 888884899857898300-112001207-----------------------------------00489999998609862 Q gi|254780743|r 104 QTEGQDLMIIGYTPSDSD-NISGKPFSG-----------------------------------KTGNMLDKMLQSIEIMR 147 (261) Q Consensus 104 G~~~a~imiVgeaPg~~E-d~~G~pF~G-----------------------------------~aG~lL~~mL~aigl~r 147 (261) ++...+++|+|..||... -.+|.||.. .||..|-.+++..--+. T Consensus 50 ~~~~~~vl~lGmNPGp~Gm~qTGVPFgd~~~vr~~l~i~~~v~~P~~~HPKRpi~G~~c~r~EvSg~rlW~l~~~~~g~p 129 (247) T 1oe4_A 50 CKSKKEVLFLGMNPGPFGMAQTGVPFGEVNHVRDWLQIEGPVSKPEVEHPKRRIRGFECPQSEVSGARFWSLFKSLCGQP 129 (247) T ss_dssp CSSCCSEEEEESSCCTTTHHHHSSTTCCHHHHHHTTCCCCCCCCCSSCCTTSCCCGGGCCSCCHHHHHHHHHHHHHHSSH T ss_pred CCCCCCEEEEECCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCH T ss_conf 18999889983289877786418685887999878516674678854476776546576655653988999999883999 Q ss_pred ----CCCCEEEEEECCC--CC--CCCCC---HHH----HHHHHHHHHHHHHHCCCCEEEEECHHHHHHHHCCCCCCCHHC Q ss_conf ----2210000000122--56--66677---889----999999999986322976799525599999844653310110 Q gi|254780743|r 148 ----TQIHISMISPWHP--PG--NRNLS---NIE----MEICRPIIMRQIELISPKILLFIGNKTKNFFFNNDAQKTYQN 212 (261) Q Consensus 148 ----~~vyitn~vkcrP--p~--nr~p~---~~e----i~~c~p~L~~ei~li~Pk~ii~lG~~a~~~ll~~~~~~i~~~ 212 (261) +++|+.|.++--+ .. |++|. ..| ...|-.+|.+.|++.+|++||.+|+.|.+.+-. .+ .. T Consensus 130 e~ff~~~fv~N~CPL~f~~~~GkN~TP~~l~~~e~~~l~~~Cd~~L~~~v~~L~p~~VVgVG~~Ae~ra~~----al-~~ 204 (247) T 1oe4_A 130 ETFFKHCFVHNHCPLIFMNHSGKNLTPTDLPKAQRDTLLEICDEALCQAVRVLGVKLVIGVGRFSEQRARK----AL-MA 204 (247) T ss_dssp HHHHTTEEEEESCCCEEECTTCCEECGGGSCHHHHHHHHHHHHHHHHHHHHHHTCSEEEEESHHHHHHHHH----HH-HH T ss_pred HHHHHHHEEEECCCCEECCCCCCCCCCHHCCHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECHHHHHHHHH----HH-CC T ss_conf 89865342572377333188998688000588899999999999999999861972899977899999999----86-04 Q ss_pred CCEEEEEECCCCCEEEEEEECHHHHHCCHHHHHHHHHH Q ss_conf 65057664289737899970803883365989999999 Q gi|254780743|r 213 LGKWSNLCTPHSIIPTLATVHPQELIQYPLIKKNTWHA 250 (261) Q Consensus 213 rG~~~~~~~~~~~i~~~pt~HPs~LLr~P~~K~~aW~D 250 (261) . +..+++.-..|||- ++|...+. |.. T Consensus 205 ~---------g~~V~v~~i~HPSP--asp~aN~~-W~~ 230 (247) T 1oe4_A 205 E---------GIDVTVKGIMHPSP--RNPQANKG-WEG 230 (247) T ss_dssp T---------TCSCEEEEECCCCT--TCGGGTTT-HHH T ss_pred C---------CCCCEEEECCCCCC--CCCHHCCC-CHH T ss_conf 6---------99855864579999--88000058-489 No 9 >2l3f_A Uncharacterized protein; structural genomics, northeast structural genomics consortiu PSI-2, protein structure initiative; NMR {Methanosarcina acetivorans} Probab=96.15 E-value=0.00094 Score=40.19 Aligned_cols=133 Identities=13% Similarity=0.156 Sum_probs=77.3 Q ss_pred CCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHC-CCCC--------------CCCCEEEEEE-CCCCCCC Q ss_conf 3678888848998578983001120012070048999999860-9862--------------2210000000-1225666 Q gi|254780743|r 101 CATQTEGQDLMIIGYTPSDSDNISGKPFSGKTGNMLDKMLQSI-EIMR--------------TQIHISMISP-WHPPGNR 164 (261) Q Consensus 101 ~g~G~~~a~imiVgeaPg~~Ed~~G~pF~G~aG~lL~~mL~ai-gl~r--------------~~vyitn~vk-crPp~nr 164 (261) ...-+++++|+|+|-.||....+.|..|.++..++ -++|..+ +.+. ..+-+|++|+ |.-.+.. T Consensus 7 pp~~~~~~~VLilGs~Pg~~S~~~g~yYa~p~N~F-W~il~~~~~~~~~~~~~~~~~~~L~~~gIgl~Dvv~~~~r~gs~ 85 (166) T 2l3f_A 7 PAVLDENTEILILGSLPSDESIRKQQYYGNPGNDF-WRLVGHAIGENLQDMAYEKKLKTLKHNRIGLWDVFKAGSREGSQ 85 (166) T ss_dssp CCBCCSSCSEEEEESSCCHHHHHHTSGGGSTTCTH-HHHHHHHHTSCSTTSCHHHHHHHHHHTTEEEEESEEEECCCSSS T ss_pred CCCCCCCCCEEEEECCCCHHHHHCCCCCCCCCCCH-HHHHHHHCCCCCCCCCCHHHHHHHHHCCCCEEEHHHHHCCCCCC T ss_conf 99669998689985288977886688757998841-77899873898755571889999998595301001242106767 Q ss_pred CCCHHHHHHHHHHHHHHH-HHC-CCCEEEEECHHHHHHHHCCCCCCCHHCCCEEEEEECCCCCEEEEEEECHHHHHCCHH Q ss_conf 677889999999999986-322-976799525599999844653310110650576642897378999708038833659 Q gi|254780743|r 165 NLSNIEMEICRPIIMRQI-ELI-SPKILLFIGNKTKNFFFNNDAQKTYQNLGKWSNLCTPHSIIPTLATVHPQELIQYPL 242 (261) Q Consensus 165 ~p~~~ei~~c~p~L~~ei-~li-~Pk~ii~lG~~a~~~ll~~~~~~i~~~rG~~~~~~~~~~~i~~~pt~HPs~LLr~P~ 242 (261) ..+|..-.+-....| +.. +.+.|++.|..|.+.+.... . .| -.+-++|+-.|+.-.... T Consensus 86 ---d~~i~~~~~nd~~~l~~~~p~i~~i~fnG~~A~~~~~~~~--~----~~---------~~~~vLPSTSpana~~~~- 146 (166) T 2l3f_A 86 ---DSKIGDEEINDFSGLKEMVPKLRLICFNGRKAGEYEPLLR--G----MG---------YETKVLPSSSGANRRFSK- 146 (166) T ss_dssp ---SCCCCCCEECCCTTHHHHCSSCCEEEEESHHHHTTHHHHG--G----GT---------CEEEEECCCSHHHHTTCS- T ss_pred ---HHHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHCCCCCC--C----CC---------CEEEECCCCCHHHHCCCH- T ss_conf ---7787776666899999858881199978999985650125--7----89---------738985898766566298- Q ss_pred HHHHHHHHHHH Q ss_conf 89999999999 Q gi|254780743|r 243 IKKNTWHALIT 253 (261) Q Consensus 243 ~K~~aW~DL~~ 253 (261) .|...|.++.. T Consensus 147 ekl~~W~~l~~ 157 (166) T 2l3f_A 147 NRESEWEAVFR 157 (166) T ss_dssp STHHHHHHHHG T ss_pred HHHHHHHHHHH T ss_conf 99999999999 No 10 >3fci_A UDG, uracil-DNA glycosylase; DNA repair, alternative splicing, disease mutation, DNA damage, glycosidase, HOST- virus interaction, hydrolase; HET: 3FI; 1.27A {Homo sapiens} PDB: 1emh_A* 1emj_A* 1q3f_A* 1ssp_E* 1ugh_E 1akz_A* 2oxm_A* 2oyt_A* 3fcf_A* 2hxm_A* 3fck_B* 3fcl_A* 2ssp_E* 1yuo_A 4skn_E* 1okb_A Probab=73.94 E-value=4.3 Score=19.30 Aligned_cols=114 Identities=12% Similarity=0.133 Sum_probs=62.8 Q ss_pred CCCCEEEEECCCCCC-CCCCCCCCCCCCH----HHHHHHHHHCC-----CC-----------CCCCCEEEEEECCCCCCC Q ss_conf 888489985789830-0112001207004----89999998609-----86-----------222100000001225666 Q gi|254780743|r 106 EGQDLMIIGYTPSDS-DNISGKPFSGKTG----NMLDKMLQSIE-----IM-----------RTQIHISMISPWHPPGNR 164 (261) Q Consensus 106 ~~a~imiVgeaPg~~-Ed~~G~pF~G~aG----~lL~~mL~aig-----l~-----------r~~vyitn~vkcrPp~nr 164 (261) .+-||+|||+-|--. ++..|..|.-..| .-|.++...+. .. ++.|.+-|.+----.|. T Consensus 54 ~~vKVVIlGQDPY~~~~~A~GLaFsv~~~~k~ppSL~NI~kel~~d~~~~~~~~~g~L~~Wa~QGVLLLNt~LTv~~~~- 132 (223) T 3fci_A 54 KDVKVVILGQDPYHGPNQAHGLCFSVQRPVPPPPSLENIYKELSTDIEDFVHPGHGDLSGWAKQGVLLLNAVLTVRAHQ- 132 (223) T ss_dssp GGCCEEEEECSCCCSTTTCCSSTTCCCTTSCCCHHHHHHHHHHHHHSTTCCCCSSCCTHHHHTTTEEEEESSCCEETTC- T ss_pred HHEEEEEECCCCCCCCCCCCCEEECCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCHHHHHHCCEEEEECEEEEECCC- T ss_conf 6847999616899999852544431679988793099999999861488888998757778749989860647766778- Q ss_pred CCCHHHHHHHHHHHHHHHH----HCCCCEEEEECHHHHHHHHCCCCCCCHHCCCEEEEEECCCCCEEEEEEECHHHHH Q ss_conf 6778899999999999863----2297679952559999984465331011065057664289737899970803883 Q gi|254780743|r 165 NLSNIEMEICRPIIMRQIE----LISPKILLFIGNKTKNFFFNNDAQKTYQNLGKWSNLCTPHSIIPTLATVHPQELI 238 (261) Q Consensus 165 ~p~~~ei~~c~p~L~~ei~----li~Pk~ii~lG~~a~~~ll~~~~~~i~~~rG~~~~~~~~~~~i~~~pt~HPs~LL 238 (261) |..-......+|...-|+ --++-+.++.|+.|.+. .. .|...+ ..++-+.|||.|- T Consensus 133 -~~SH~~~gW~~ft~~ii~~l~~~~~~vVfiLWG~~A~~~-~~----~i~~~~------------h~il~~~HPSPls 192 (223) T 3fci_A 133 -ANSHKERGWEQFTDAVVSWLNQNSNGLVFLLWGSYAQKK-GS----AIDRKR------------HHVLQTAHPSPLS 192 (223) T ss_dssp -TTTTTTSSHHHHHHHHHHHHHHHCCSCEEEEESHHHHHH-TT----TCCTTT------------SEEEEECCSSTTT T ss_pred -CCCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECHHHHHH-HH----HCCCCC------------CEEEEECCCCCCC T ss_conf -899777888999999999874147886999956689998-85----279878------------7599707998301 No 11 >2ux6_A Pseudoazurin; type-1 copper, metal-binding, redox potential, copper, transport, cupredoxin, periplasmic, electron transport; 1.3A {Achromobacter cycloclastes} PDB: 2ux7_A 2uxf_A 2uxg_A 1bqk_A 1bqr_A 1zia_A 1zib_A 2jkw_A Probab=72.67 E-value=1.7 Score=21.66 Aligned_cols=36 Identities=19% Similarity=0.388 Sum_probs=31.4 Q ss_pred CEEEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHCC Q ss_conf 489985789830011200120700489999998609 Q gi|254780743|r 109 DLMIIGYTPSDSDNISGKPFSGKTGNMLDKMLQSIE 144 (261) Q Consensus 109 ~imiVgeaPg~~Ed~~G~pF~G~aG~lL~~mL~aig 144 (261) -.++||+.|.-.|+.+...+.|++.+.|+++++++| T Consensus 86 G~IvVgd~~~n~~~~~~~~~~~~s~~~~~~~i~~~~ 121 (122) T 2ux6_A 86 GVVQVGDAPANLEAVKGAKNPKKAQERLDAALAALG 121 (122) T ss_dssp EEEEESSSCTTHHHHHTCCCCHHHHHHHHHHHHHTT T ss_pred EEEEECCCCCCHHHHHHCCCCHHHHHHHHHHHHHHC T ss_conf 999999999898998645473778999999999852 No 12 >2j8x_A Uracil-DNA glycosylase; hydrolase/inhibitor, hydrolase/inhibitor complex, EBV, DNA repair, lytic protein, epstein-BARR virus; 2.3A {Epstein-barr virus} SCOP: c.18.1.1 Probab=64.49 E-value=7 Score=18.13 Aligned_cols=118 Identities=15% Similarity=0.184 Sum_probs=66.9 Q ss_pred CCCCEEEEECCCCCCCCCCCCCCCCCCH----HHHHHHHHHC-----CCC-----------CCCCCEEEEEECCCCCCCC Q ss_conf 8884899857898300112001207004----8999999860-----986-----------2221000000012256666 Q gi|254780743|r 106 EGQDLMIIGYTPSDSDNISGKPFSGKTG----NMLDKMLQSI-----EIM-----------RTQIHISMISPWHPPGNRN 165 (261) Q Consensus 106 ~~a~imiVgeaPg~~Ed~~G~pF~G~aG----~lL~~mL~ai-----gl~-----------r~~vyitn~vkcrPp~nr~ 165 (261) .+-||+|+|+-|--.++..|..|+=+.| -=|.+|++.+ +.. ++.|.+-|.+---..|... T Consensus 57 ~~vKVVIlGQDPY~~~~A~GLaFSv~~~~~~ppSL~NI~kel~~d~~~~~~~~~g~L~~Wa~QGVLLLNt~LTv~~g~~~ 136 (231) T 2j8x_A 57 SDIKVVILGQDPYHGGQANGLAFSVAYGFPVPPSLRNIYAELHRSLPEFSPPDHGCLDAWASQGVLLLNTILTVQKGKPG 136 (231) T ss_dssp GGCCEEEEECSCCSSSCCCSSSSCCCTTSCCCHHHHHHHHHHHHHCTTCCCCSSCCCHHHHTTTEEEEESSCCEETTBTT T ss_pred HHEEEEEEECCCCCCCCEEEEEEECCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCHHHHHHCCEEEECCEEEEECCCCC T ss_conf 89079999568998884201532135899799349999999986458788899898788874876887143897489988 Q ss_pred -CCHHHHHHHHHHHHHHHHHC-CCCEEEEECHHHHHHHHCCCCCCCHHCCCEEEEEECCCCCEEEEEEECHHHHHCC Q ss_conf -77889999999999986322-9767995255999998446533101106505766428973789997080388336 Q gi|254780743|r 166 -LSNIEMEICRPIIMRQIELI-SPKILLFIGNKTKNFFFNNDAQKTYQNLGKWSNLCTPHSIIPTLATVHPQELIQY 240 (261) Q Consensus 166 -p~~~ei~~c~p~L~~ei~li-~Pk~ii~lG~~a~~~ll~~~~~~i~~~rG~~~~~~~~~~~i~~~pt~HPs~LLr~ 240 (261) .....-+.+...+.+.|.-. ++-+.++.|+.|.+. . ..+.. ....++-+.|||.|-.+ T Consensus 137 SH~~~gWe~ft~~vi~~l~~~~~~vVfiLWG~~A~~~-~----~~i~~------------~~h~Il~s~HPSPl~~~ 196 (231) T 2j8x_A 137 SHADIGWAWFTDHVISLLSERLKACVFMLWGAKAGDK-A----SLINS------------KKHLVLTSQHPSPLAQN 196 (231) T ss_dssp TTTTSSHHHHHHHHHHHHHHHCSSCEEEEESHHHHGG-G----GGSCT------------TTSEEEEECCSSGGGGG T ss_pred CCCCCCCHHHHHHHHHHHHHCCCCEEEEEECHHHHHH-H----HCCCC------------CCCEEEECCCCCCCCCC T ss_conf 8667981646999999997166785999958789975-3----03588------------78779975799830136 No 13 >2boo_A UDG, uracil-DNA glycosylase; UNG, base excision repair, radiation resistance, DNA damage, DNA repair, glycosidase, hydrolase; 1.8A {Deinococcus radiodurans} Probab=55.01 E-value=10 Score=17.16 Aligned_cols=115 Identities=12% Similarity=0.174 Sum_probs=62.9 Q ss_pred CCCCEEEEECCCCCC-CCCCCCCCCCCCH----HHHHHHHHHC-----CCC-----------CCCCCEEEEEECCCCCCC Q ss_conf 888489985789830-0112001207004----8999999860-----986-----------222100000001225666 Q gi|254780743|r 106 EGQDLMIIGYTPSDS-DNISGKPFSGKTG----NMLDKMLQSI-----EIM-----------RTQIHISMISPWHPPGNR 164 (261) Q Consensus 106 ~~a~imiVgeaPg~~-Ed~~G~pF~G~aG----~lL~~mL~ai-----gl~-----------r~~vyitn~vkcrPp~nr 164 (261) .+-||+|||+-|--. +...|..|+-..+ .-|.+|+..+ |+. ++.|.+-|.+----.|. T Consensus 73 ~~vKVVIlGQDPY~~~g~A~GLaFSv~~~~k~ppSL~NI~kel~~d~~~~~~p~~g~L~~Wa~QGVLLLNt~LTv~~g~- 151 (247) T 2boo_A 73 GEVKVLILGQDPYHGPNQAHGLSFSVRPGVRVPPSLRNIYKELTEDIPGFVAPKHGYLRSWAEQGVLLLNAVLTVRAGQ- 151 (247) T ss_dssp GGCCEEEEESSCCCSTTCCSSSSSCCCTTSCCCHHHHHHHHHHHHHSTTCCCCSSCCCHHHHTTTEEEEESSCCEETTB- T ss_pred HHEEEEEECCCCCCCCCCCCCEEEECCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHCCEEEEECEEEEECCC- T ss_conf 5726999606899798952724665478898886689999999875079877887627779859889960647753899- Q ss_pred CCCHHHHHHHHHHHHHHHHHC----CCCEEEEECHHHHHHHHCCCCCCCHHCCCEEEEEECCCCCEEEEEEECHHHHHC Q ss_conf 677889999999999986322----976799525599999844653310110650576642897378999708038833 Q gi|254780743|r 165 NLSNIEMEICRPIIMRQIELI----SPKILLFIGNKTKNFFFNNDAQKTYQNLGKWSNLCTPHSIIPTLATVHPQELIQ 239 (261) Q Consensus 165 ~p~~~ei~~c~p~L~~ei~li----~Pk~ii~lG~~a~~~ll~~~~~~i~~~rG~~~~~~~~~~~i~~~pt~HPs~LLr 239 (261) |..-.-....+|...-|+.+ ++-+.++.|+.|.+ +. ..|.. ....++-+.|||-|-+ T Consensus 152 -~~SH~~~gWe~ft~~ii~~i~~~~~~vVf~LWG~~A~~-~~----~~i~~------------~~h~il~~~HPSPls~ 212 (247) T 2boo_A 152 -ANSHQGKGWEHFTDAVIKAVNAKEERVVFILWGSYARK-KK----KLITG------------KNHVVIESGHPSPLSE 212 (247) T ss_dssp -TTTTTTSSHHHHHHHHHHHHHTCSSCCEEEEESHHHHG-GG----GGCCC------------TTCEEEEECCSSTTTG T ss_pred -CCCCCCCCHHHHHHHHHHHHHHCCCCEEEEEECHHHHH-HH----CCCCC------------CCCEEEECCCCCHHHC T ss_conf -99854588799999999999864788699996878998-65----03588------------8867996589881224 No 14 >1yc5_A NAD-dependent deacetylase; SIR2, sirtuin, SIR2TM, SIRT1, nicotinamide, hydrolase; HET: ALY; 1.40A {Thermotoga maritima} SCOP: c.31.1.5 PDB: 2h2d_A* 2h2f_A 2h2g_A* 2h2h_A* 2h2i_A* 2h4f_A* 2h4j_A* 3d4b_A* 3d81_A* 2h4h_A* 3jr3_A* 2h59_A* Probab=54.33 E-value=11 Score=17.10 Aligned_cols=10 Identities=10% Similarity=0.634 Sum_probs=7.6 Q ss_pred CCCEEEEECH Q ss_conf 9767995255 Q gi|254780743|r 186 SPKILLFIGN 195 (261) Q Consensus 186 ~Pk~ii~lG~ 195 (261) +..++|.+|. T Consensus 180 ~~Dl~lvvGT 189 (246) T 1yc5_A 180 RASLMIVLGS 189 (246) T ss_dssp HCSEEEEESC T ss_pred HCCEEEEECC T ss_conf 2999999699 No 15 >1em8_B DNA polymerase III PSI subunit; DNA POL III, heterodimer, clamp-loader, alpha-beta fold, gene regulation; HET: DNA; 2.10A {Escherichia coli} SCOP: c.126.1.1 Probab=51.37 E-value=12 Score=16.82 Aligned_cols=90 Identities=12% Similarity=0.197 Sum_probs=54.7 Q ss_pred CCCCCEEEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 88884899857898300112001207004899999986098622210000000122566667788999999999998632 Q gi|254780743|r 105 TEGQDLMIIGYTPSDSDNISGKPFSGKTGNMLDKMLQSIEIMRTQIHISMISPWHPPGNRNLSNIEMEICRPIIMRQIEL 184 (261) Q Consensus 105 ~~~a~imiVgeaPg~~Ed~~G~pF~G~aG~lL~~mL~aigl~r~~vyitn~vkcrPp~nr~p~~~ei~~c~p~L~~ei~l 184 (261) +.+-|++||.+.+-... . -||..+|++++|+.+++|..+ |. ++.. T Consensus 9 p~~~RLliVa~~~p~~~----------~-~L~~DVLrsl~L~~~q~~~lt-----------pe-------------ql~~ 53 (112) T 1em8_B 9 PAHVRLVMVANDLPALT----------D-PLVSDVLRALTVSPDQVLQLT-----------PE-------------KIAM 53 (112) T ss_dssp CTTCCEEEECSSCCCTT----------S-HHHHHHHHHTTCCGGGEEEEC-----------TT-------------GGGG T ss_pred CHHCEEEEEECCCCCCC----------C-HHHHHHHHHCCCCHHHEEEEC-----------HH-------------HHHH T ss_conf 53066999838997645----------7-189999998799999953458-----------99-------------9952 Q ss_pred C---CCCEEEEECHHHHHHHHCCCCCCCHHCCCEEEEEECCCCCEEEEEEECHHHHHCCHHHHHHHHHHHH Q ss_conf 2---9767995255999998446533101106505766428973789997080388336598999999999 Q gi|254780743|r 185 I---SPKILLFIGNKTKNFFFNNDAQKTYQNLGKWSNLCTPHSIIPTLATVHPQELIQYPLIKKNTWHALI 252 (261) Q Consensus 185 i---~Pk~ii~lG~~a~~~ll~~~~~~i~~~rG~~~~~~~~~~~i~~~pt~HPs~LLr~P~~K~~aW~DL~ 252 (261) + .+..+=.+|..+ .. ..-|+ .+-+-.-+-|-.|+..||.-|+-.. T Consensus 54 L~~~~~~~~W~lG~~~----------~~-~~~~~------------~l~Sp~L~~L~~n~~aKRaLWQQIc 101 (112) T 1em8_B 54 LPQGSHCNSWRLGTDE----------PL-SLEGA------------QVASPALTDLRANPTARAALWQQIC 101 (112) T ss_dssp SCTTCBCEEEEESCCS----------CC-SSBSC------------EEEECCHHHHHHCHHHHHHHHHHHH T ss_pred CCCCCCCEEEECCCCC----------CC-CCCCC------------EEECCCHHHHHCCHHHHHHHHHHHH T ss_conf 8778876268655667----------76-55686------------7818489989449799999999998 No 16 >1udg_A Uracil-DNA glycosylase; hydrolase; 1.75A {Herpes simplex virus} SCOP: c.18.1.1 PDB: 1lau_E 1udh_A 1udi_E 2c53_A* 2c56_A* Probab=50.95 E-value=12 Score=16.78 Aligned_cols=116 Identities=16% Similarity=0.223 Sum_probs=65.4 Q ss_pred CCCCEEEEECCCCC-CCCCCCCCCCCCCHH----HHHHHHHHCC-----CC-----------CCCCCEEEEEECCCCCCC Q ss_conf 88848998578983-001120012070048----9999998609-----86-----------222100000001225666 Q gi|254780743|r 106 EGQDLMIIGYTPSD-SDNISGKPFSGKTGN----MLDKMLQSIE-----IM-----------RTQIHISMISPWHPPGNR 164 (261) Q Consensus 106 ~~a~imiVgeaPg~-~Ed~~G~pF~G~aG~----lL~~mL~aig-----l~-----------r~~vyitn~vkcrPp~nr 164 (261) .+-||+|||+-|-- .+...|..|.-..|. =|.+|+.++. .. ++.|.+-|.+-.--.|. T Consensus 78 ~~vKVVIlGQDPY~~~~~A~GLaFsv~~~~~~ppSLrNI~kel~~d~~~~~~~~~g~L~~Wa~QGVLLLNt~LTv~~g~- 156 (244) T 1udg_A 78 DEVRVVIIGQDPYHHPGQAHGLAFSVRANVPPPPSLRNVLAAVKNCYPEARMSGHGCLEKWARDGVLLLNTTLTVKRGA- 156 (244) T ss_dssp GGCCEEEEECSCCCSTTTCSSSSSCCCTTSCCCHHHHHHHHHHHHHCTTCCCCSCCCCHHHHHTTEEEEESSCCEETTB- T ss_pred HHCEEEEEECCCCCCCCCCCCEEECCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHCCEEEEECEEEEECCC- T ss_conf 6867999851799999860765513689999985699999999864378888887777778649899970658861577- Q ss_pred CCCHHH--HHHHHHHHHHHHHH-CCCCEEEEECHHHHHHHHCCCCCCCHHCCCEEEEEECCCCCEEEEEEECHHHHHCC Q ss_conf 677889--99999999998632-29767995255999998446533101106505766428973789997080388336 Q gi|254780743|r 165 NLSNIE--MEICRPIIMRQIEL-ISPKILLFIGNKTKNFFFNNDAQKTYQNLGKWSNLCTPHSIIPTLATVHPQELIQY 240 (261) Q Consensus 165 ~p~~~e--i~~c~p~L~~ei~l-i~Pk~ii~lG~~a~~~ll~~~~~~i~~~rG~~~~~~~~~~~i~~~pt~HPs~LLr~ 240 (261) .-+... -+.+...+.+.|.. -++-+.++.|+.|.+. .. +...+ -.++-+.|||-|-+. T Consensus 157 ~~SH~~~gWe~ft~~ii~~l~~~~~~iVfiLWG~~Aq~~-~~-----~~~~~------------h~Il~~~HPSPls~~ 217 (244) T 1udg_A 157 AASHSRIGWDRFVGGVIRRLAARRPGLVFMLWGTHAQNA-IR-----PDPRV------------HCVLKFSHPSPLSKV 217 (244) T ss_dssp TTTTTTSSHHHHHHHHHHHHHHHCSSCEEEEESHHHHHH-CC-----CCTTT------------SEEEEECCSSTTSSS T ss_pred CCCCCCCCHHHHHHHHHHHHHHCCCCEEEEEECHHHHHH-CC-----CCCCC------------CEEEEECCCCCCCCC T ss_conf 888788987999999999998368986999967898853-42-----47888------------869986898821136 No 17 >4eug_A UDG, UNG, protein (glycosylase); hydrolase; 1.40A {Escherichia coli B} SCOP: c.18.1.1 PDB: 5eug_A 1eug_A 2uug_A 3eug_A 2eug_A 1uug_A 1lqj_A 1lqm_A 1lqg_A 1flz_A 1eui_A 2jhq_A Probab=44.06 E-value=16 Score=16.15 Aligned_cols=115 Identities=17% Similarity=0.195 Sum_probs=63.6 Q ss_pred CCCCEEEEECCCCCC-CCCCCCCCCCCCH----HHHHHHHHHC-----CCC-----------CCCCCEEEEEECCCCCCC Q ss_conf 888489985789830-0112001207004----8999999860-----986-----------222100000001225666 Q gi|254780743|r 106 EGQDLMIIGYTPSDS-DNISGKPFSGKTG----NMLDKMLQSI-----EIM-----------RTQIHISMISPWHPPGNR 164 (261) Q Consensus 106 ~~a~imiVgeaPg~~-Ed~~G~pF~G~aG----~lL~~mL~ai-----gl~-----------r~~vyitn~vkcrPp~nr 164 (261) .+-||+|+|+-|--. ....|..|.-..| .=|.+++..+ |.. .+.|.+-|.+----.|.. T Consensus 54 ~~vKVVIlGQDPY~~~~~A~GLaFsv~~~~k~PpSL~NI~kel~~d~~~~~~~~~g~L~~Wa~QGVLLLNt~LTv~~g~~ 133 (229) T 4eug_A 54 GDVKVVILGQDPYHGPGQAHGLAFSVRPGIAIPPSLLNMYKELENTIPGFTRPNHGYLESWARQGVLLLNTVLTVRAGQA 133 (229) T ss_dssp GGCCEEEEESSCCCSTTTCSSSTTCCCTTSCCCHHHHHHHHHHHHHSTTCCCCSCCCCHHHHTTTEEEEESSCCEETTBT T ss_pred HHEEEEEEECCCCCCCCCCCCEEEECCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHCCEEEEECEEEEECCCC T ss_conf 37489997247999888768765445899888888999999999853898778988664375398799723388877788 Q ss_pred CCCHHH--HHHHHHHHHHHHHHC-CCCEEEEECHHHHHHHHCCCCCCCHHCCCEEEEEECCCCCEEEEEEECHHHHH Q ss_conf 677889--999999999986322-97679952559999984465331011065057664289737899970803883 Q gi|254780743|r 165 NLSNIE--MEICRPIIMRQIELI-SPKILLFIGNKTKNFFFNNDAQKTYQNLGKWSNLCTPHSIIPTLATVHPQELI 238 (261) Q Consensus 165 ~p~~~e--i~~c~p~L~~ei~li-~Pk~ii~lG~~a~~~ll~~~~~~i~~~rG~~~~~~~~~~~i~~~pt~HPs~LL 238 (261) . +... -+.+...+.+.|.-- ++-+.++.|+.|.+ +. . .|...+ ..++-+.|||.|- T Consensus 134 ~-SH~~~gWe~ft~~ii~~l~~~~~~vVf~LWG~~A~~-~~-~---~i~~~~------------~~Il~s~HPSPls 192 (229) T 4eug_A 134 H-SHASLGWETFTDKVISLINQHREGVVFLLWGSHAQK-KG-A---IIDKQR------------HHVLKAPQPSPLS 192 (229) T ss_dssp T-TTTTSSHHHHHHHHHHHHHHHCSSCEEEEESHHHHH-HT-T---TSCTTT------------SEEEEECCSSTTT T ss_pred C-CCCCCCHHHHHHHHHHHHHHCCCCEEEEEECHHHHH-HH-H---CCCCCC------------CEEEEECCCCCCC T ss_conf 8-877878799999999999846898499996768898-77-4---179888------------6699727998000 No 18 >2owr_A UDG, uracil-DNA glycosylase; uracil-DNA glycosylase fold in the core: 3 layers (A/B/A); HET: EPE; 2.30A {Vaccinia virus} PDB: 2owq_A Probab=36.50 E-value=21 Score=15.46 Aligned_cols=111 Identities=14% Similarity=0.164 Sum_probs=64.1 Q ss_pred CCCCEEEEECCCCCCCCCCCCCCCCCC--HHHHHHHHHHC----CCC---------CCCCCEEEEEECCCCCCCCCCHHH Q ss_conf 888489985789830011200120700--48999999860----986---------222100000001225666677889 Q gi|254780743|r 106 EGQDLMIIGYTPSDSDNISGKPFSGKT--GNMLDKMLQSI----EIM---------RTQIHISMISPWHPPGNRNLSNIE 170 (261) Q Consensus 106 ~~a~imiVgeaPg~~Ed~~G~pF~G~a--G~lL~~mL~ai----gl~---------r~~vyitn~vkcrPp~nr~p~~~e 170 (261) .+-||+|+|+-|-- ....|..|.-+. -.-|.+|...+ +.. ++.|.+-|.+----.|. |.. + T Consensus 58 ~~vKVVIlGQDPYp-~~a~GLaFSv~~~~ppSL~NI~kel~~~~~~~~~~~~~~wa~QGVLLLNt~LTv~~g~--~~s-H 133 (218) T 2owr_A 58 RNKRVCVCGIDPYP-KDGTGVPFESPNFTKKSIKEIASSISRLTGVIDYKGYNLNIIDGVIPWNYYLSCKLGE--TKS-H 133 (218) T ss_dssp TTCCEEEEESSCCS-SCCCSSTTCCTTSCCHHHHHHHHHHHHHHCCCCCSEECGGGSTTEEEEESSSCEETTC--TTT-T T ss_pred HHCEEEEEECCCCC-CCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCHHHCCCEEEEECEEEEECCC--CCC-C T ss_conf 89379998457999-8676722467898776999999998862298888987513219889875558875698--787-7 Q ss_pred HHHHHHHHHHHHHHC--CCCEEEEECHHHHHHHHCCCCCCCHHCCCEEEEEECCCCCEEEEEEECHHHH Q ss_conf 999999999986322--9767995255999998446533101106505766428973789997080388 Q gi|254780743|r 171 MEICRPIIMRQIELI--SPKILLFIGNKTKNFFFNNDAQKTYQNLGKWSNLCTPHSIIPTLATVHPQEL 237 (261) Q Consensus 171 i~~c~p~L~~ei~li--~Pk~ii~lG~~a~~~ll~~~~~~i~~~rG~~~~~~~~~~~i~~~pt~HPs~L 237 (261) .....+|...-|+.+ +..+++++|..+++..... +. .+ ..++.+.||+.. T Consensus 134 ~~~W~~ft~~ii~~l~~~~~vv~ll~g~~a~~~~~~----i~---~~----------~~~i~~~HPss~ 185 (218) T 2owr_A 134 AIYWDKISKLLLQHITKHVSVLYCLGKTDFSNIRAK----LE---SP----------VTTIVGYHPAAR 185 (218) T ss_dssp HHHHHHHHHHHHHHHTTTCSEEEEECHHHHHHHHHH----CS---CC----------CEEEEECCTTC- T ss_pred CCCHHHHHHHHHHHHHCCCCEEEEEECHHHHHHHHC----CC---CC----------CEEEECCCCCCC T ss_conf 758499999999999736872899840898867634----78---89----------589979887444 No 19 >3a7n_A UDG, uracil-DNA glycosylase; UNG-UGI interactions, UNG-DNA complex, citrate AS protein LI ligand binding, inhibitor design; HET: FLC; 1.95A {Mycobacterium tuberculosis H37RV} PDB: 2zhx_A* Probab=34.97 E-value=22 Score=15.32 Aligned_cols=114 Identities=15% Similarity=0.214 Sum_probs=63.5 Q ss_pred CCCCEEEEECCCCCC-CCCCCCCCCCCCHH-----HHHHHHHHC----CCC-----------CCCCCEEEEEECCCCCCC Q ss_conf 888489985789830-01120012070048-----999999860----986-----------222100000001225666 Q gi|254780743|r 106 EGQDLMIIGYTPSDS-DNISGKPFSGKTGN-----MLDKMLQSI----EIM-----------RTQIHISMISPWHPPGNR 164 (261) Q Consensus 106 ~~a~imiVgeaPg~~-Ed~~G~pF~G~aG~-----lL~~mL~ai----gl~-----------r~~vyitn~vkcrPp~nr 164 (261) .+-||+|+|+-|--. ....|..|+-+.|. -|.+|++++ |.. ++.|.+.|.+----.|. T Consensus 69 ~~vKVVIlGQDPYh~~gqA~GLaFSv~~~~~~iPpSLrNI~Kel~~d~g~~~~~~gdL~~Wa~QGVLLLNt~LTv~~g~- 147 (238) T 3a7n_A 69 DNVRVLIVGQDPYPTPGHAVGLSFSVAPDVRPWPRSLANIFDEYTADLGYPLPSNGDLTPWAQRGVLLLNRVLTVRPSN- 147 (238) T ss_dssp GGCCEEEEESSCCSSTTTCCSSTTCCCTTCSSCCHHHHHHHHHHHHHHCCCCCSSCCCHHHHHTTEEEEESSCCBCTTC- T ss_pred HHCEEEEEECCCCCCCCCCCEECCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCHHHHHHCCEEEEEEEEEECCCC- T ss_conf 7917998842799999961517143489987788899999999998529999999888899869889976345512888- Q ss_pred CCCHHH---HHHHHHHHHHHHHHC-CCCEEEEECHHHHHHHHCCCCCCCHHCCCEEEEEECCCCCEEEEEEECHHHHH Q ss_conf 677889---999999999986322-97679952559999984465331011065057664289737899970803883 Q gi|254780743|r 165 NLSNIE---MEICRPIIMRQIELI-SPKILLFIGNKTKNFFFNNDAQKTYQNLGKWSNLCTPHSIIPTLATVHPQELI 238 (261) Q Consensus 165 ~p~~~e---i~~c~p~L~~ei~li-~Pk~ii~lG~~a~~~ll~~~~~~i~~~rG~~~~~~~~~~~i~~~pt~HPs~LL 238 (261) |..-. -+.+...+.+.|..- ++-+.++.|+.|.+. . ..|.. + ..-++-+.|||-|- T Consensus 148 -~~SH~~~gWe~fT~~vi~~l~~~~~~vVfiLWG~~Aq~~-~----~~I~~--~----------~h~il~s~HPSPlS 207 (238) T 3a7n_A 148 -PASHRGKGWEAVTECAIRALAARAAPLVAILWGRDASTL-K----PMLAA--G----------NCVAIESPHPSPLS 207 (238) T ss_dssp -TTTTTTSSHHHHHHHHHHHHHHSSSCEEEEEESHHHHTT-H----HHHTT--S----------SEEEEEECCSSTTT T ss_pred -CCCCCCCCHHHHHHHHHHHHHHCCCCCEEEEECHHHHHH-H----CCCCC--C----------CCEEEECCCCCCCC T ss_conf -776547886999999999998157995899867589861-0----34587--8----------85799788999502 No 20 >1nrp_R Receptor based peptide NR'S; serine proteinase/receptor; 3.00A {Homo sapiens} PDB: 1nrn_R 1nrq_R* Probab=33.01 E-value=3.4 Score=19.88 Aligned_cols=16 Identities=25% Similarity=0.341 Sum_probs=12.7 Q ss_pred HHHHCCHHHHH-HHHHH Q ss_conf 38833659899-99999 Q gi|254780743|r 235 QELIQYPLIKK-NTWHA 250 (261) Q Consensus 235 s~LLr~P~~K~-~aW~D 250 (261) |+|||||..|- -+|+| T Consensus 5 sfllrnpndkyepfwed 21 (26) T 1nrp_R 5 SFLLRNPNDKYEPFWED 21 (26) T ss_pred HHHHCCCCCCCCCCCCC T ss_conf 13221877666756343 No 21 >1mkz_A Molybdenum cofactor biosynthesis protein B; MAD, WEAK anomalous signal, molybdopterin synthesis, structural genomics, PSI; HET: MSE; 1.60A {Escherichia coli} SCOP: c.57.1.1 PDB: 1r2k_B Probab=32.13 E-value=24 Score=15.04 Aligned_cols=68 Identities=12% Similarity=0.163 Sum_probs=40.9 Q ss_pred CCCEEEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHCC Q ss_conf 88489985789830011200120700489999998609862221000000012256666778899999999999863229 Q gi|254780743|r 107 GQDLMIIGYTPSDSDNISGKPFSGKTGNMLDKMLQSIEIMRTQIHISMISPWHPPGNRNLSNIEMEICRPIIMRQIELIS 186 (261) Q Consensus 107 ~a~imiVgeaPg~~Ed~~G~pF~G~aG~lL~~mL~aigl~r~~vyitn~vkcrPp~nr~p~~~ei~~c~p~L~~ei~li~ 186 (261) ..-|+.||+.-|..+|. +|.+|..+|.+.|..-.+..+ + ..+.+++. .-+.+.+.--. T Consensus 12 rvavitvsD~rg~~~D~--------nGp~L~~~l~~~G~~v~~~~i---v--------~Dd~~~~~---~~l~~~~~~~~ 69 (172) T 1mkz_A 12 RIAILTVSNRRGEEDDT--------SGHYLRDSAQEAGHHVVDKAI---V--------KENRYAIR---AQVSAWIASDD 69 (172) T ss_dssp EEEEEEECSSCCGGGCH--------HHHHHHHHHHHTTCEEEEEEE---E--------CSCHHHHH---HHHHHHHHSSS T ss_pred EEEEEEEECCCCCCCCC--------HHHHHHHHHHHCCCEEEEEEE---C--------CCCHHHHH---HHHHHHHHCCC T ss_conf 89999984898867667--------399999999985995776310---5--------88579999---99999876256 Q ss_pred CCEEEEECHH Q ss_conf 7679952559 Q gi|254780743|r 187 PKILLFIGNK 196 (261) Q Consensus 187 Pk~ii~lG~~ 196 (261) ..+|++-|.+ T Consensus 70 ~dlIiTtGGt 79 (172) T 1mkz_A 70 VQVVLITGGT 79 (172) T ss_dssp CCEEEEESCC T ss_pred CCEEEECCCE T ss_conf 7615852531 No 22 >3cxm_A Uracil-DNA glycosylase; base excision repair, BER, DNA damage repair, MSGPP, SGPP, hydrolase, 5-bromouracil, structural genomics; 1.50A {Leishmania naiffi} Probab=30.55 E-value=26 Score=14.89 Aligned_cols=115 Identities=15% Similarity=0.150 Sum_probs=62.3 Q ss_pred CCCCEEEEECCCCCC-CCCCCCCCCCCCHH----HHHHHHHHC-----CCC-----------CCCCCEEEEEECCCCCCC Q ss_conf 888489985789830-01120012070048----999999860-----986-----------222100000001225666 Q gi|254780743|r 106 EGQDLMIIGYTPSDS-DNISGKPFSGKTGN----MLDKMLQSI-----EIM-----------RTQIHISMISPWHPPGNR 164 (261) Q Consensus 106 ~~a~imiVgeaPg~~-Ed~~G~pF~G~aG~----lL~~mL~ai-----gl~-----------r~~vyitn~vkcrPp~nr 164 (261) .+-||+|||+-|--. +...|..|.-..|. =|.+|++++ |.. ++.|.+.|.+----.|. T Consensus 91 ~~vKVVIlGQDPYh~~gqA~GLaFSv~~~~k~PpSLrNI~KeL~~d~~~~~~p~~g~L~~Wa~QGVLLLNt~LTv~~g~- 169 (268) T 3cxm_A 91 RGLKVVLLGQDPYHDLHQAHGLCFSVLPEVPLPPSLRNIYKELTTDIAGFQAPKHGYLQSWSEQGMLMLNATLTVEAHK- 169 (268) T ss_dssp TTCCEEEEESSCCCSTTTCCSSTTCCCTTSCCCHHHHHHHHHHHHHSTTCCCCSSCCCHHHHTTTEEEEESSCCEETTC- T ss_pred HHCEEEEEECCCCCCCCCCCEEEEECCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCHHHHHHCCEEEEEEEEEEECCC- T ss_conf 7937999742799899971068654899999981599999999975288889998526889849858873248887787- Q ss_pred CCCHHHHHHHHHHHHH---HHHHCCC--CEEEEECHHHHHHHHCCCCCCCHHCCCEEEEEECCCCCEEEEEEECHHHHH Q ss_conf 6778899999999999---8632297--679952559999984465331011065057664289737899970803883 Q gi|254780743|r 165 NLSNIEMEICRPIIMR---QIELISP--KILLFIGNKTKNFFFNNDAQKTYQNLGKWSNLCTPHSIIPTLATVHPQELI 238 (261) Q Consensus 165 ~p~~~ei~~c~p~L~~---ei~li~P--k~ii~lG~~a~~~ll~~~~~~i~~~rG~~~~~~~~~~~i~~~pt~HPs~LL 238 (261) .-+...+....+|... .|..-++ -+.++.|+.|.+. . ..|... ...++-+.|||.|- T Consensus 170 ~~SH~~~~GWe~fT~~vI~~i~~~~~~~vVflLWG~~Aq~~-~----~~i~~~------------~h~Il~s~HPSPLS 231 (268) T 3cxm_A 170 ANSHSKTSGWAAFTDAVIQHLSQHHPNRLVFLLWGGYAQQK-K----RLIDAN------------RHVVLENVHPSPLS 231 (268) T ss_dssp TTTTTTTSCHHHHHHHHHHHHHHHCSSCCEEEEESHHHHGG-G----GGSCTT------------TSEEEEECCSSHHH T ss_pred CCCCCCCCCHHHHHHHHHHHHHHCCCCCEEEEEECHHHHHH-H----HCCCCC------------CCEEEEECCCCCCC T ss_conf 34357887789999999999985277887999717589998-8----428987------------86699757999734 No 23 >1oj7_A Hypothetical oxidoreductase YQHD; structural genomics; HET: NZQ; 2.0A {Escherichia coli} SCOP: e.22.1.2 Probab=30.51 E-value=26 Score=14.88 Aligned_cols=27 Identities=19% Similarity=0.209 Sum_probs=13.8 Q ss_pred CCCHHHHHHHHHHHHHHHHHCCCCEEEEECH Q ss_conf 6778899999999999863229767995255 Q gi|254780743|r 165 NLSNIEMEICRPIIMRQIELISPKILLFIGN 195 (261) Q Consensus 165 ~p~~~ei~~c~p~L~~ei~li~Pk~ii~lG~ 195 (261) .|+.+.+..+...+.++ ++..||.+|. T Consensus 89 ~P~~~~v~~~~~~~r~~----~~D~IIavGG 115 (408) T 1oj7_A 89 NPAYETLMNAVKLVREQ----KVTFLLAVGG 115 (408) T ss_dssp SCBHHHHHHHHHHHHHH----TCCEEEEEES T ss_pred CCCHHHHHHHHHHHHHC----CCCEEEECCC T ss_conf 98999999999999755----9988998089 No 24 >3nrb_A Formyltetrahydrofolate deformylase; N-terminal ACT domain, structural genomics, joint center for structural genomics, JCSG; HET: MSE FLC; 2.05A {Pseudomonas putida} Probab=22.33 E-value=37 Score=14.00 Aligned_cols=20 Identities=5% Similarity=0.107 Sum_probs=14.7 Q ss_pred HHHHHHHHHCCCCEEEEECH Q ss_conf 99999863229767995255 Q gi|254780743|r 176 PIIMRQIELISPKILLFIGN 195 (261) Q Consensus 176 p~L~~ei~li~Pk~ii~lG~ 195 (261) ..+...++-.+|.+||+.|- T Consensus 155 ~~~~~~l~~~~~Dlivlag~ 174 (287) T 3nrb_A 155 SQIKNIVTQSQADLIVLARY 174 (287) T ss_dssp HHHHHHHHHHTCSEEEESSC T ss_pred HHHHHHHHCCCCEEEEEECC T ss_conf 99987875268769998700 No 25 >2jsx_A Protein NAPD; TAT, proofreading, cytoplasm, chaperone; NMR {Escherichia coli K12} PDB: 2pq4_A Probab=20.97 E-value=39 Score=13.84 Aligned_cols=63 Identities=10% Similarity=0.298 Sum_probs=40.3 Q ss_pred CCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCCE Q ss_conf 46998999999960038852002877223678888848998578983001120012070048999999860986222100 Q gi|254780743|r 73 KACSLHELKSLLRSFHDCHLCSTSLSTICATQTEGQDLMIIGYTPSDSDNISGKPFSGKTGNMLDKMLQSIEIMRTQIHI 152 (261) Q Consensus 73 ~~~~l~~L~~~~~~~~~C~L~~~~~~~v~g~G~~~a~imiVgeaPg~~Ed~~G~pF~G~aG~lL~~mL~aigl~r~~vyi 152 (261) .+..++++++.+..+.+|..+ +......||.++-|+++.++ ++ +.+.+|..- ..|+= T Consensus 15 ~Pe~~~~V~~~l~~~~g~EIh--------~~d~~~GKiVVtiE~~~~~~-------------~~-~~i~~I~~l-~GVls 71 (95) T 2jsx_A 15 KSERISDISTQLNAFPGCEVA--------VSDAPSGQLIVVVEAEDSET-------------LI-QTIESVRNV-EGVLA 71 (95) T ss_dssp CTTSHHHHHHHHTTSTTEEEE--------EEETTTTEEEEEEEESSHHH-------------HH-HHHHHHTTS-TTEEE T ss_pred CHHHHHHHHHHHHCCCCCEEE--------EECCCCCCEEEEEEECCHHH-------------HH-HHHHHHHCC-CCCEE T ss_conf 878899999999759997896--------20388970999997398679-------------99-999998769-98327 Q ss_pred EEEEEC Q ss_conf 000001 Q gi|254780743|r 153 SMISPW 158 (261) Q Consensus 153 tn~vkc 158 (261) ++++.- T Consensus 72 a~lVY~ 77 (95) T 2jsx_A 72 VSLVYH 77 (95) T ss_dssp EEESSC T ss_pred EEEEEE T ss_conf 867788 Done!