Query 537021.9.peg.753_1 Match_columns 1033 No_of_seqs 206 out of 3477 Neff 6.8 Searched_HMMs 23785 Date Wed May 25 07:46:53 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i peg_753.hhm -d /home/congqian_1/database/pdb/pdb70.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 2hpi_A DNA polymerase III alph 100.0 0 0 2408.0 86.9 1014 5-1033 1-1083(1220) 2 2hnh_A DNA polymerase III alph 100.0 0 0 2098.6 73.8 887 8-911 3-908 (910) 3 3f2b_A DNA-directed DNA polyme 100.0 0 0 1692.6 49.6 807 10-904 115-1031(1041) 4 3e38_A Two-domain protein cont 99.8 5.6E-27 2.3E-31 244.8 -31.7 295 11-358 20-337 (343) 5 3o0f_A Putative metal-dependen 99.7 6.1E-19 2.6E-23 176.8 3.6 73 8-82 9-83 (301) 6 1m65_A Hypothetical protein YC 99.4 1.3E-14 5.5E-19 140.2 1.0 181 10-222 2-198 (245) 7 3dcp_A Histidinol-phosphatase; 99.4 1.9E-13 8.1E-18 130.3 6.3 207 13-221 4-263 (283) 8 2yxo_A Histidinol phosphatase; 99.4 1.1E-13 4.8E-18 132.3 2.1 200 12-222 2-230 (267) 9 2wje_A CPS4B, tyrosine-protein 99.1 3.5E-11 1.5E-15 111.2 5.5 221 10-277 4-247 (247) 10 2anu_A Hypothetical protein TM 98.8 2.7E-08 1.1E-12 86.9 9.3 166 5-221 13-210 (255) 11 2w9m_A Polymerase X; SAXS, DNA 98.7 1.9E-07 8.1E-12 79.6 11.3 20 803-825 473-492 (578) 12 2edu_A Kinesin-like protein KI 96.5 0.0016 6.6E-08 46.5 3.2 51 838-891 41-92 (98) 13 2i5h_A Hypothetical protein AF 95.6 0.0078 3.3E-07 40.6 3.4 36 838-873 133-168 (205) 14 2k50_A Replication factor A re 95.1 0.043 1.8E-06 34.4 6.0 50 983-1032 7-61 (115) 15 1s5l_U Photosystem II 12 kDa e 93.6 0.064 2.7E-06 32.9 4.1 53 819-887 55-107 (134) 16 3i7f_A Aspartyl-tRNA synthetas 93.4 0.16 6.9E-06 29.4 6.0 13 537-549 311-323 (548) 17 3e0e_A Replication protein A; 93.4 0.24 1E-05 28.1 6.8 47 986-1032 5-56 (97) 18 3bz1_U Photosystem II 12 kDa e 93.3 0.066 2.8E-06 32.8 3.8 54 818-887 24-77 (104) 19 3dm3_A Replication factor A; p 93.3 0.17 7.3E-06 29.3 5.9 48 985-1032 7-59 (105) 20 3m4p_A Ehasnrs, asparaginyl-tR 92.9 0.32 1.3E-05 27.0 6.7 14 249-262 201-214 (456) 21 1e1o_A Lysyl-tRNA synthetase; 92.8 0.35 1.5E-05 26.7 6.8 29 537-565 252-281 (504) 22 3a74_A Lysyl-tRNA synthetase; 91.5 0.54 2.3E-05 25.1 6.5 12 537-548 243-254 (493) 23 2duy_A Competence protein come 91.2 0.1 4.2E-06 31.3 2.5 55 816-886 16-70 (75) 24 3bju_A Lysyl-tRNA synthetase; 91.0 0.77 3.2E-05 23.8 7.6 12 537-548 245-256 (521) 25 3kf6_A Protein STN1; OB fold, 90.9 0.63 2.7E-05 24.5 6.4 36 992-1032 47-83 (159) 26 3ebe_A Protein MCM10 homolog; 89.0 1.1 4.4E-05 22.6 6.4 41 992-1032 56-99 (200) 27 2zxd_A Alpha-L-fucosidase, put 87.6 1.2 5.1E-05 22.1 5.9 44 27-70 105-169 (455) 28 1o7i_A SSB, SSO2364, single st 87.1 1.3 5.6E-05 21.7 7.0 48 986-1033 5-57 (119) 29 1gm5_A RECG; helicase, replica 86.9 0.37 1.5E-05 26.5 2.9 30 787-817 614-643 (780) 30 3kf8_A Protein STN1; OB fold; 86.2 1.5 6.2E-05 21.4 6.2 43 989-1031 82-130 (220) 31 3gza_A Putative alpha-L-fucosi 84.3 1.8 7.5E-05 20.7 5.6 50 25-78 57-130 (443) 32 3bzc_A TEX; helix-turn-helix, 84.1 0.73 3.1E-05 24.0 3.3 73 800-885 481-554 (785) 33 2pi2_A Replication protein A 3 82.5 2.1 8.7E-05 20.2 6.1 32 527-558 74-107 (270) 34 3kdf_D Replication protein A 3 82.3 2.1 8.7E-05 20.1 6.3 37 992-1033 31-68 (132) 35 2k75_A Uncharacterized protein 82.2 2.1 8.8E-05 20.1 6.5 41 992-1032 13-56 (106) 36 3eyp_A Putative alpha-L-fucosi 80.2 2.4 0.0001 19.6 5.9 50 26-79 53-126 (469) 37 3dx5_A Uncharacterized protein 80.0 2.3 9.5E-05 19.8 4.6 52 18-69 6-64 (286) 38 2wvv_A Alpha-L-fucosidase; alp 78.9 2.6 0.00011 19.3 5.7 45 26-70 77-142 (450) 39 3mo4_A Alpha-1,3/4-fucosidase; 76.1 3.1 0.00013 18.7 6.3 45 27-71 64-132 (480) 40 1qzg_A Protection of telomeres 75.5 3.2 0.00013 18.6 6.9 43 990-1032 40-90 (187) 41 1hjs_A Beta-1,4-galactanase; 4 73.5 3.5 0.00015 18.2 4.7 65 10-74 10-81 (332) 42 2pgf_A Adenosine deaminase; me 72.2 1.5 6.4E-05 21.3 2.0 20 353-372 110-129 (371) 43 3l6a_A Eukaryotic translation 71.6 3 0.00013 18.8 3.4 119 305-449 5-125 (364) 44 2zkq_k 40S ribosomal protein S 71.4 1.8 7.4E-05 20.7 2.2 27 375-401 109-138 (151) 45 1ynx_A Replication factor-A pr 68.6 4.4 0.00018 17.4 5.3 52 980-1033 4-59 (114) 46 1wjj_A Hypothetical protein F2 68.3 4.4 0.00019 17.3 5.0 47 986-1032 19-80 (145) 47 1ug3_A EIF4GI, eukaryotic prot 67.9 4.3 0.00018 17.4 3.5 120 308-453 7-127 (339) 48 2kbn_A Conserved protein; nucl 66.4 4.8 0.0002 17.1 6.4 44 987-1032 9-53 (109) 49 1s1h_K RP59A, 40S ribosomal pr 66.3 2.1 8.8E-05 20.1 1.7 27 375-401 94-123 (136) 50 2ztd_A Holliday junction ATP-d 65.9 4.9 0.00021 17.0 6.1 62 839-908 125-188 (212) 51 1qwg_A PSL synthase;, (2R)-pho 65.2 5 0.00021 16.9 5.8 12 430-441 211-222 (251) 52 1k19_A Chemosensory protein CS 63.0 5.5 0.00023 16.6 3.6 40 776-816 70-109 (112) 53 2ekc_A AQ_1548, tryptophan syn 62.6 5.6 0.00023 16.5 9.9 41 27-68 2-46 (262) 54 3nav_A Tryptophan synthase alp 61.8 5.7 0.00024 16.4 4.6 46 24-70 2-51 (271) 55 2zkq_m 40S ribosomal protein S 59.9 4.1 0.00017 17.7 2.2 57 814-887 13-69 (152) 56 3ofo_B 30S ribosomal protein S 59.2 5.7 0.00024 16.4 2.8 67 401-469 16-83 (218) 57 3iar_A Adenosine deaminase; pu 59.0 3.5 0.00015 18.2 1.7 26 12-42 9-34 (367) 58 1e8g_A Vanillyl-alcohol oxidas 58.9 3.1 0.00013 18.6 1.5 21 360-380 83-104 (560) 59 1vjz_A Endoglucanase; TM1752, 58.8 6.3 0.00027 16.0 4.9 69 6-74 15-98 (341) 60 3bbn_B Ribosomal protein S2; s 58.6 5.9 0.00025 16.3 2.8 19 98-116 39-57 (231) 61 2vqe_K 30S ribosomal protein S 57.7 4.1 0.00017 17.6 1.9 27 375-401 88-117 (129) 62 3glk_A Acetyl-COA carboxylase 57.5 6.6 0.00028 15.9 4.3 99 13-121 6-121 (540) 63 1bif_A 6-phosphofructo-2-kinas 57.4 6.2 0.00026 16.1 2.8 31 374-407 227-257 (469) 64 3ou8_A Adenosine deaminase; st 57.2 2.8 0.00012 19.0 1.0 26 12-42 13-38 (326) 65 3f1z_A Putative nucleic acid-b 56.7 6.8 0.00029 15.8 5.9 39 988-1026 45-83 (133) 66 3ci0_K Pseudopilin GSPK; gener 56.4 5.3 0.00022 16.7 2.3 39 839-877 201-241 (298) 67 1rh9_A Endo-beta-mannanase; en 55.6 7.1 0.0003 15.6 5.8 22 197-218 87-108 (373) 68 1xrs_B D-lysine 5,6-aminomutas 55.3 7.1 0.0003 15.6 6.6 20 198-217 170-189 (262) 69 2q02_A Putative cytoplasmic pr 53.3 7.6 0.00032 15.4 5.4 49 26-75 18-72 (272) 70 3jrx_A Acetyl-COA carboxylase 53.1 7.7 0.00032 15.3 3.6 86 25-121 39-137 (587) 71 3bbn_K Ribosomal protein S11; 52.6 1.6 6.9E-05 21.0 -0.8 27 375-401 101-130 (140) 72 1tg7_A Beta-galactosidase; TIM 52.0 7.9 0.00033 15.2 4.3 50 29-78 38-100 (971) 73 2zu6_B Programmed cell death p 51.9 8 0.00034 15.2 4.1 79 342-427 183-262 (307) 74 1fob_A Beta-1,4-galactanase; B 51.0 8.2 0.00034 15.1 3.6 45 28-72 28-79 (334) 75 1jmc_A Protein (replication pr 50.4 8.4 0.00035 15.0 6.2 54 979-1032 125-185 (246) 76 1z00_A DNA excision repair pro 49.9 7.8 0.00033 15.3 2.3 51 835-902 17-67 (89) 77 2f6h_X Myosin-2, type V myosin 49.0 8.7 0.00037 14.9 3.0 86 677-762 146-245 (419) 78 3mmt_A Fructose-bisphosphate a 48.7 8.8 0.00037 14.8 4.5 25 517-541 55-79 (347) 79 1cuk_A RUVA protein; DNA repai 47.9 9.1 0.00038 14.7 4.3 16 839-854 110-125 (203) 80 3n4f_A Mandelate racemase/muco 47.4 9.2 0.00039 14.7 3.8 76 360-441 294-384 (392) 81 3d3a_A Beta-galactosidase; pro 46.6 9.4 0.0004 14.6 5.4 50 29-78 39-101 (612) 82 3kws_A Putative sugar isomeras 46.0 9.6 0.0004 14.5 4.7 45 26-70 37-81 (287) 83 1uuq_A Mannosyl-oligosaccharid 45.8 9.7 0.00041 14.5 5.7 24 196-219 112-135 (440) 84 3mqt_A Mandelate racemase/muco 45.6 9.7 0.00041 14.5 3.9 86 353-441 285-390 (394) 85 3no1_A Isomerase, mandelate ra 45.2 9.9 0.00041 14.4 2.3 31 360-390 303-333 (398) 86 1jb7_A Telomere-binding protei 44.9 10 0.00042 14.4 6.6 12 95-106 96-107 (495) 87 1x2i_A HEF helicase/nuclease; 44.5 8.5 0.00036 14.9 1.8 30 836-870 13-42 (75) 88 1qop_A Tryptophan synthase alp 44.5 10 0.00042 14.3 4.3 44 26-70 1-48 (268) 89 2a1j_B DNA excision repair pro 44.3 7.9 0.00033 15.2 1.6 30 835-869 30-59 (91) 90 2vpq_A Acetyl-COA carboxylase; 43.9 10 0.00043 14.3 4.3 27 43-71 3-29 (451) 91 2dh2_A 4F2 cell-surface antige 43.6 10 0.00044 14.2 5.8 49 23-71 29-100 (424) 92 3obe_A Sugar phosphate isomera 43.1 11 0.00044 14.2 4.2 58 12-70 21-93 (305) 93 2vqe_M 30S ribosomal protein S 43.1 3.9 0.00017 17.8 -0.1 48 831-887 11-58 (126) 94 3kwm_A Ribose-5-phosphate isom 43.0 8.4 0.00035 15.0 1.6 50 806-856 63-114 (224) 95 3ofo_K 30S ribosomal protein S 43.0 8.8 0.00037 14.8 1.7 27 375-401 78-107 (117) 96 3ipw_A Hydrolase TATD family p 42.3 11 0.00045 14.1 4.0 62 4-66 21-90 (325) 97 1dly_A Hemoglobin; oxygen stor 42.0 11 0.00046 14.0 6.2 70 694-763 88-161 (164) 98 2w70_A Biotin carboxylase; lig 41.4 11 0.00047 14.0 4.0 68 42-121 3-72 (449) 99 3n6r_A Propionyl-COA carboxyla 40.9 11 0.00047 13.9 2.8 15 199-213 16-30 (681) 100 1yz4_A DUSP15, dual specificit 40.7 11 0.00048 13.9 5.3 44 353-396 66-113 (160) 101 1bf2_A Isoamylase; hydrolase, 40.2 12 0.00048 13.8 3.5 14 361-374 276-289 (750) 102 3p52_A NH(3)-dependent NAD(+) 40.0 12 0.00049 13.8 5.9 58 141-212 26-84 (249) 103 3cuq_A Vacuolar-sorting protei 39.6 9.9 0.00042 14.4 1.5 41 849-889 155-196 (234) 104 2bcq_A DNA polymerase lambda; 39.4 12 0.0005 13.8 3.2 13 387-399 112-124 (335) 105 1o8b_A Ribose 5-phosphate isom 39.1 12 0.0005 13.7 2.3 51 806-857 57-109 (219) 106 2qf7_A Pyruvate carboxylase pr 39.1 12 0.0005 13.7 2.8 44 24-68 77-120 (1165) 107 1i60_A IOLI protein; beta barr 39.1 12 0.0005 13.7 13.4 49 26-75 13-67 (278) 108 1b22_A DNA repair protein RAD5 38.7 12 0.00051 13.7 2.0 10 844-853 32-41 (114) 109 3icg_A Endoglucanase D; cellul 38.6 12 0.00051 13.7 4.8 44 29-72 47-105 (515) 110 1kft_A UVRC, excinuclease ABC 38.5 8.2 0.00034 15.1 0.9 27 838-869 25-51 (78) 111 2bvf_A 6-hydroxy-D-nicotine ox 38.3 12 0.00051 13.6 2.0 20 360-380 51-70 (459) 112 1lk5_A D-ribose-5-phosphate is 37.8 12 0.00052 13.6 2.6 51 805-856 60-112 (229) 113 2f8m_A Ribose 5-phosphate isom 37.8 12 0.00052 13.6 2.4 53 804-857 66-120 (244) 114 2g0w_A LMO2234 protein; putati 37.6 13 0.00053 13.5 5.4 54 19-75 29-89 (296) 115 1c6v_A Protein (SIV integrase) 37.3 13 0.00053 13.5 2.0 28 196-223 48-76 (164) 116 3kp1_A D-ornithine aminomutase 37.1 13 0.00053 13.5 6.6 24 97-120 76-99 (763) 117 2a1j_A DNA repair endonuclease 36.9 13 0.00054 13.5 2.4 36 839-885 6-41 (63) 118 2zvr_A Uncharacterized protein 36.7 13 0.00054 13.4 3.4 15 200-214 73-87 (290) 119 1exn_A 5'-exonuclease, 5'-nucl 36.7 13 0.00054 13.4 2.8 27 842-873 208-234 (290) 120 2zds_A Putative DNA-binding pr 36.2 13 0.00055 13.4 4.9 53 19-71 7-69 (340) 121 3ngf_A AP endonuclease, family 36.1 13 0.00055 13.4 4.8 50 20-70 16-65 (269) 122 2uyg_A 3-dehydroquinate dehydr 35.9 8 0.00034 15.2 0.6 18 195-212 28-45 (149) 123 3i9v_2 NADH-quinone oxidoreduc 35.9 13 0.00055 13.3 4.1 61 421-482 2-70 (181) 124 1c0m_A Protein (integrase); HI 35.9 13 0.00055 13.3 4.0 15 199-213 82-96 (238) 125 2jg6_A DNA-3-methyladenine gly 35.6 13 0.00056 13.3 4.2 131 670-825 5-165 (186) 126 1wza_A Alpha-amylase A; hydrol 35.1 14 0.00057 13.2 4.6 43 5-47 1-55 (488) 127 1ulz_A Pyruvate carboxylase N- 35.0 14 0.00057 13.2 3.4 68 42-121 3-71 (451) 128 1tvk_A Tubulin alpha chain; ep 34.9 9.3 0.00039 14.6 0.8 46 334-393 115-160 (440) 129 2h9d_A Salicylate biosynthesis 34.8 14 0.00058 13.2 6.0 56 694-756 30-85 (101) 130 1qya_A ORFB, hypothetical prot 34.3 8.5 0.00036 14.9 0.5 29 366-394 223-251 (307) 131 3hbl_A Pyruvate carboxylase; T 34.2 14 0.00059 13.1 4.1 46 23-69 60-105 (1150) 132 2g8y_A Malate/L-lactate dehydr 34.2 14 0.00058 13.2 1.6 34 197-230 120-153 (385) 133 3edn_A Phenazine biosynthesis 34.0 12 0.00051 13.7 1.2 30 367-396 218-247 (299) 134 1v8d_A Hypothetical protein (T 34.0 11 0.00045 14.1 0.9 73 141-217 61-139 (235) 135 1ixr_A Holliday junction DNA h 33.6 14 0.0006 13.1 2.4 19 839-857 109-128 (191) 136 2x6h_A GH13170P, VPS34, phosph 33.6 13 0.00053 13.5 1.3 45 569-624 450-494 (696) 137 3bch_A 40S ribosomal protein S 33.5 14 0.0006 13.0 3.3 13 805-817 166-178 (253) 138 2ihm_A POL MU, DNA polymerase 33.2 14 0.00061 13.0 4.2 16 382-399 115-130 (360) 139 2vcc_A Alpha-N-acetylglucosami 33.1 14 0.00061 13.0 2.8 102 425-545 288-399 (891) 140 2ww4_A 4-diphosphocytidyl-2C-m 32.8 15 0.00061 13.0 1.7 11 428-438 255-265 (283) 141 1ecm_A Endo-oxabicyclic transi 32.6 15 0.00062 12.9 8.3 67 694-782 27-95 (109) 142 3nqb_A Adenine deaminase 2; PS 32.0 15 0.00063 12.9 1.5 32 9-44 89-120 (608) 143 2dpl_A GMP synthetase, GMP syn 31.9 15 0.00063 12.9 3.5 22 454-475 165-186 (308) 144 1ym5_A YHI9, hypothetical 32.6 31.8 11 0.00045 14.1 0.7 32 366-397 215-246 (300) 145 1f0x_A DLDH, D-lactate dehydro 31.6 13 0.00055 13.4 1.1 20 360-380 58-77 (571) 146 3l23_A Sugar phosphate isomera 31.5 15 0.00064 12.8 6.0 53 18-70 18-76 (303) 147 2vrq_A Alpha-L-arabinofuranosi 31.4 15 0.00064 12.8 4.8 45 30-74 54-129 (496) 148 1uqr_A 3-dehydroquinate dehydr 31.4 11 0.00045 14.1 0.6 18 195-212 30-47 (154) 149 1qus_A Lytic murein transglyco 31.4 15 0.00065 12.8 5.1 48 727-774 97-144 (322) 150 1chm_A Creatine amidinohydrola 31.3 15 0.00063 12.9 1.3 44 9-52 3-50 (401) 151 2fmp_A DNA polymerase beta; nu 31.2 15 0.00065 12.8 4.2 13 388-400 115-127 (335) 152 1xub_A Phenazine biosynthesis 30.6 16 0.00066 12.7 1.6 31 366-396 219-249 (298) 153 1v9n_A Malate dehydrogenase; r 30.4 13 0.00056 13.3 1.0 21 200-220 108-128 (360) 154 3ls8_A Phosphatidylinositol 3- 30.4 15 0.00065 12.8 1.3 44 570-624 377-420 (614) 155 3dr0_A Cytochrome C6; photosyn 30.4 16 0.00067 12.7 4.1 36 406-441 56-91 (93) 156 1bqc_A Protein (beta-mannanase 30.1 16 0.00067 12.6 6.1 43 31-73 36-85 (302) 157 2uuu_A Alkyldihydroxyacetoneph 30.0 13 0.00054 13.5 0.8 20 360-380 150-169 (584) 158 1qnr_A Endo-1,4-B-D-mannanase; 30.0 16 0.00068 12.6 5.3 21 196-216 91-111 (344) 159 3lwz_A 3-dehydroquinate dehydr 29.9 10 0.00043 14.3 0.3 16 196-211 37-52 (153) 160 2gjv_A Putative cytoplasmic pr 29.8 8.1 0.00034 15.1 -0.2 36 577-613 26-61 (175) 161 1nxu_A Hypothetical oxidoreduc 29.7 15 0.00065 12.8 1.2 25 198-222 95-119 (333) 162 3k6g_A Telomeric repeat-bindin 29.7 5.6 0.00023 16.5 -1.1 29 270-298 35-64 (111) 163 1xng_A NH(3)-dependent NAD(+) 29.5 16 0.00069 12.6 6.5 59 140-212 24-83 (268) 164 3cm3_A Late protein H1, dual s 29.2 17 0.00069 12.5 3.3 41 352-392 89-133 (176) 165 2c0h_A Mannan endo-1,4-beta-ma 29.2 17 0.0007 12.5 6.8 44 29-72 47-110 (353) 166 1o57_A PUR operon repressor; p 29.1 17 0.0007 12.5 4.0 70 402-473 86-160 (291) 167 1jae_A Alpha-amylase; glycosid 28.6 14 0.00058 13.2 0.8 17 55-71 77-93 (471) 168 2e9l_A Cytosolic beta-glucosid 28.6 17 0.00071 12.4 3.8 50 23-72 53-117 (469) 169 1v77_A PH1877P, hypothetical p 28.3 17 0.00072 12.4 3.6 41 180-220 123-172 (212) 170 2h56_A DNA-3-methyladenine gly 28.2 17 0.00072 12.4 6.4 98 736-855 59-156 (233) 171 3mmw_A Endoglucanase; TIM barr 28.1 17 0.00072 12.4 4.8 45 28-72 34-93 (317) 172 1uek_A 4-(cytidine 5'-diphosph 28.0 15 0.00064 12.8 0.9 23 361-385 214-236 (275) 173 3bbn_M Ribosomal protein S13; 27.9 2.7 0.00011 19.2 -2.9 40 808-855 41-80 (145) 174 3jug_A Beta-mannanase; TIM-bar 27.9 17 0.00073 12.3 6.0 17 197-213 89-105 (345) 175 1z2i_A Malate dehydrogenase; s 27.7 17 0.00073 12.3 2.1 18 200-217 106-123 (358) 176 2jep_A Xyloglucanase; family 5 27.6 17 0.00074 12.3 5.3 19 196-214 111-129 (395) 177 2vqe_B 30S ribosomal protein S 27.6 18 0.00074 12.3 2.0 21 98-118 40-60 (256) 178 3kts_A Glycerol uptake operon 27.5 18 0.00074 12.3 8.9 157 28-238 17-173 (192) 179 2ziu_A MUS81 protein; helix-ha 27.2 18 0.00075 12.3 2.5 29 839-872 239-267 (311) 180 3ce9_A Glycerol dehydrogenase; 27.0 18 0.00075 12.2 1.2 48 21-69 14-64 (354) 181 2whl_A Beta-mannanase, baman5; 26.6 18 0.00076 12.2 6.0 46 27-72 31-83 (294) 182 1xtz_A Ribose-5-phosphate isom 26.4 18 0.00077 12.2 2.3 49 806-855 83-133 (264) 183 1e4i_A Beta-glucosidase; glyco 26.4 18 0.00077 12.1 2.5 17 198-214 101-117 (447) 184 1d8j_A General transcription f 26.3 13 0.00053 13.5 0.2 36 648-686 6-42 (81) 185 3ofp_M 30S ribosomal protein S 26.3 3.8 0.00016 18.0 -2.5 26 830-855 9-34 (113) 186 1s7j_A Phenazine biosynthesis 26.1 17 0.00073 12.3 0.9 24 368-391 193-216 (262) 187 1now_A Beta-hexosaminidase bet 26.1 18 0.00078 12.1 1.7 24 360-383 222-250 (507) 188 1qam_A ERMC' methyltransferase 26.0 15 0.00063 12.9 0.6 41 372-412 35-77 (244) 189 1rd5_A Tryptophan synthase alp 25.9 19 0.00078 12.1 3.5 39 28-68 4-47 (262) 190 1ls9_A Cytochrome C6; omega lo 25.8 19 0.00078 12.1 3.2 34 408-441 56-89 (91) 191 2p5v_A Transcriptional regulat 25.8 19 0.00079 12.1 3.0 47 414-479 2-49 (162) 192 1rfm_A L-sulfolactate dehydrog 25.6 19 0.00079 12.0 1.7 21 200-220 97-117 (344) 193 3eef_A N-carbamoylsarcosine am 25.6 19 0.00079 12.0 3.4 27 198-224 32-58 (182) 194 1vi6_A 30S ribosomal protein S 25.5 19 0.00079 12.0 3.3 15 27-41 11-25 (208) 195 3geb_A EYES absent homolog 2; 25.5 15 0.00064 12.8 0.5 107 336-472 144-256 (274) 196 2gtv_X CM, chorismate mutase; 25.3 19 0.0008 12.0 8.0 49 722-784 55-103 (104) 197 1hx0_A Alpha amylase (PPA); in 25.0 16 0.00066 12.7 0.5 17 197-213 79-95 (496) 198 2bgw_A XPF endonuclease; hydro 24.8 19 0.00081 11.9 1.9 27 838-869 163-189 (219) 199 3idw_A Actin cytoskeleton-regu 24.7 19 0.00082 11.9 2.6 16 428-443 55-70 (72) 200 2af7_A Gamma-carboxymuconolact 24.6 20 0.00082 11.9 3.6 33 731-765 75-107 (125) 201 1nnx_A Protein YGIW; structura 24.5 20 0.00082 11.9 6.3 46 978-1031 23-68 (109) 202 1jeo_A MJ1247, hypothetical pr 24.5 20 0.00082 11.9 2.9 34 180-215 83-116 (180) 203 1wbh_A KHG/KDPG aldolase; lyas 24.3 20 0.00083 11.9 2.9 38 124-161 90-128 (214) 204 1z00_B DNA repair endonuclease 24.2 20 0.00083 11.8 2.6 26 839-869 20-45 (84) 205 2osx_A Endoglycoceramidase II; 24.2 20 0.00083 11.8 5.4 20 355-374 102-121 (481) 206 3kip_A 3-dehydroquinase, type 24.2 13 0.00056 13.3 0.0 29 205-234 80-108 (167) 207 2bkm_A Truncated hemoglobin fr 24.0 20 0.00084 11.8 4.3 43 724-766 84-126 (128) 208 2qde_A Mandelate racemase/muco 23.9 20 0.00084 11.8 5.1 18 360-377 280-297 (397) 209 1jms_A Terminal deoxynucleotid 23.8 20 0.00085 11.8 4.1 28 366-400 123-150 (381) 210 1g94_A Alpha-amylase; beta-alp 23.7 19 0.00078 12.1 0.7 46 30-75 18-87 (448) 211 1kko_A 3-methylaspartate ammon 23.7 20 0.00085 11.8 4.2 23 573-595 332-354 (413) 212 3e2v_A 3'-5'-exonuclease; stru 23.6 20 0.00085 11.8 4.8 65 4-69 7-80 (401) 213 1i7w_B E-cadherin, epithelial- 23.5 13 0.00053 13.5 -0.2 13 376-388 105-117 (151) 214 1wr1_B Ubiquitin-like protein 23.5 20 0.00086 11.7 1.7 22 337-358 13-34 (58) 215 1o5k_A DHDPS, dihydrodipicolin 23.3 21 0.00087 11.7 9.7 88 141-237 47-136 (306) 216 2wxf_A Phosphatidylinositol-4, 23.2 20 0.00082 11.9 0.7 44 570-624 681-724 (940) 217 1dc1_A Bsobi restriction endon 23.1 21 0.00087 11.7 3.5 36 602-643 109-145 (323) 218 3ouz_A Biotin carboxylase; str 23.1 21 0.00087 11.7 3.7 32 39-72 4-35 (446) 219 1e7u_A Phosphatidylinositol 3- 23.0 21 0.00087 11.7 1.2 45 569-624 697-741 (961) 220 1vim_A Hypothetical protein AF 22.9 19 0.00078 12.1 0.6 18 199-216 107-124 (200) 221 1m0s_A Ribose-5-phosphate isom 22.9 21 0.00088 11.7 4.5 52 805-857 56-109 (219) 222 1wvf_A 4-cresol dehydrogenase 22.9 19 0.0008 12.0 0.6 21 360-380 69-90 (520) 223 3ooo_A Proline dipeptidase; st 22.8 21 0.00088 11.6 1.8 30 26-55 2-31 (132) 224 2h31_A Multifunctional protein 22.7 21 0.00089 11.6 2.0 26 343-372 160-185 (425) 225 2c7f_A Alpha-L-arabinofuranosi 22.6 21 0.00089 11.6 3.9 45 30-74 62-136 (513) 226 1ux8_A YJBI protein; oxygen st 22.4 21 0.0009 11.6 4.0 73 693-765 48-126 (132) 227 3fmt_A Protein SEQA; protein-D 22.3 21 0.0009 11.6 3.5 21 455-475 138-158 (162) 228 2abk_A Endonuclease III; DNA-r 22.2 22 0.00091 11.5 3.4 17 375-392 116-132 (211) 229 1ug6_A Beta-glycosidase; gluco 22.1 22 0.00091 11.5 2.9 17 198-214 100-116 (431) 230 1cxq_A Avian sarcoma virus int 21.9 22 0.00091 11.5 3.2 19 197-215 81-99 (162) 231 1yqd_A Sinapyl alcohol dehydro 21.8 22 0.00092 11.5 4.1 46 578-623 309-355 (366) 232 1uuf_A YAHK, zinc-type alcohol 21.7 22 0.00092 11.5 4.1 40 580-619 318-358 (369) 233 3k8k_A Alpha-amylase, SUSG; al 21.6 22 0.00092 11.5 4.0 13 376-388 173-185 (669) 234 3ahy_A Beta-glucosidase; cellu 21.6 22 0.00092 11.5 5.3 48 24-72 59-123 (473) 235 1yub_A Ermam, rRNA methyltrans 21.6 22 0.00093 11.5 1.1 54 360-413 18-77 (245) 236 3g8r_A Probable spore coat pol 21.6 22 0.00093 11.5 2.7 43 381-435 208-250 (350) 237 3ces_A MNMG, tRNA uridine 5-ca 21.5 16 0.00067 12.6 0.0 11 611-621 387-397 (651) 238 1sg6_A Pentafunctional AROM po 21.4 22 0.00094 11.4 4.5 19 803-821 314-332 (393) 239 1kg2_A A/G-specific adenine gl 21.4 22 0.00094 11.4 3.0 24 839-862 111-134 (225) 240 1ccw_A Protein (glutamate muta 21.2 22 0.00094 11.4 5.3 15 200-214 46-60 (137) 241 1k77_A EC1530, hypothetical pr 21.2 22 0.00094 11.4 3.8 51 21-72 9-59 (260) 242 1fad_A Protein (FADD protein); 21.1 22 0.00094 11.4 4.0 37 426-478 12-48 (99) 243 1dd3_A 50S ribosomal protein L 20.8 23 0.00096 11.3 2.9 46 694-758 72-123 (128) 244 2dzd_A Pyruvate carboxylase; b 20.8 23 0.00096 11.3 4.2 31 40-72 5-35 (461) 245 3ccf_A Cyclopropane-fatty-acyl 20.8 14 0.00058 13.2 -0.5 18 143-160 59-76 (279) 246 2jy5_A Ubiquilin-1; UBA, alter 20.7 22 0.00091 11.5 0.5 24 339-362 10-33 (52) 247 2aam_A Hypothetical protein TM 20.5 23 0.00097 11.3 3.1 20 361-380 170-191 (309) 248 2knz_A Ubiquilin-4; cytoplasm, 20.5 23 0.00097 11.3 1.7 22 339-360 9-30 (53) 249 1wxp_A THO complex subunit 1; 20.4 23 0.00097 11.3 5.1 49 412-478 4-52 (110) 250 1qtw_A Endonuclease IV; DNA re 20.4 23 0.00098 11.3 4.5 51 14-69 4-63 (285) 251 3cb2_A Gamma-1-tubulin, tubuli 20.3 17 0.0007 12.5 -0.1 55 334-404 115-169 (475) 252 1wyz_A Putative S-adenosylmeth 20.1 23 0.00099 11.2 4.1 103 347-466 88-190 (242) 253 2kng_A Protein LSR2; DNA-bindi 20.1 23 0.00099 11.2 1.8 21 362-383 17-37 (55) 254 1vkc_A Putative acetyl transfe 20.1 23 0.00099 11.2 3.8 16 196-211 112-127 (158) 255 1tvk_B Tubulin beta chain; epo 20.0 14 0.00057 13.2 -0.6 47 333-393 112-158 (427) No 1 >2hpi_A DNA polymerase III alpha subunit; POL-beta-like nucleotidyltransferase fold; HET: DNA; 3.00A {Thermus aquaticus} PDB: 2hpm_A* 3e0d_A* Probab=100.00 E-value=0 Score=2408.03 Aligned_cols=1014 Identities=42% Similarity=0.681 Sum_probs=955.6 Q ss_pred CCCCCCCHHHHHHCHHHHHHHCCCHHHHHHHHHHCCC--CEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCC Q ss_conf 6689881334331444764432998999999997899--88998116517749999999997699638999999854766 Q 537021.9.peg.7 5 LEKSPSFVHLQVHSSYSLLEGALSLNNILDKIAADQQ--PAIAITDTNNLFSALEFSQKACAAGIQPIIGCQLDIDMQDN 82 (1033) Q Consensus 5 ~~~~~~FvhLh~hS~ySlldg~~~~e~LV~~A~~~G~--~AiAITD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~ 82 (1033) |.+..+||||||||+||+|||+++|++||++|+++|| |||||||||||||+++|+++|+++|||||+|||+|+.++.. T Consensus 1 m~~~m~FvhLhvhS~YSlLdg~~~i~~Lv~~Ake~G~~~pAvAiTDh~nl~Gav~F~~~akk~gIKPIiG~E~~v~~~~~ 80 (1220) T 2hpi_A 1 MGSKLKFAHLHQHTQFSLLDGAAKLQDLLKWVKETTPEDPALAMTDHGNLFGAVEFYKKATAMGVKPIIGYEAYVAAESR 80 (1220) T ss_dssp ----CCCCCCSBCCTTSTTTCCCCHHHHHHHHHHHCCSSCEEEEEEETCCTTHHHHHHHHHTTTCEEEEEEEEEEESSCT T ss_pred CCCCCCCHHHCCCCCCCHHCCCCCHHHHHHHHHHCCCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCC T ss_conf 99988524114477471314046999999999976988988999608888779999999997699848878999658641 Q ss_pred CCCCCCCCCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCEEEECCCCCCHHHHHHHCC Q ss_conf 43344565666677539999569689999999998997324877874007899986348988998197679678888779 Q 537021.9.peg.7 83 LKMTNQGKTPFPTVPSIVLLVSTAEGYQRLIELVSRMYLLDQGKQSVRICLSWLQEIGTEGLIMLTGGSSGPIDRAFFFN 162 (1033) Q Consensus 83 ~~~~~~~~~~~~~~~~lvlLAkN~~Gy~nL~kL~S~a~~~~~~~~~p~i~~~~L~e~~~egLIvlsg~~~g~i~~~l~~~ 162 (1033) .++.. +.......+|++|||||++||+|||+|+|.+|++ +++++||+++++|.+ |++||||+|||..|+++++++.+ T Consensus 81 ~~~~~-~~~~~~~~~hlvLLAkN~~GY~NL~kL~S~a~~~-g~~~~Pri~~~~L~~-~~egLI~lsgc~~g~i~~~l~~~ 157 (1220) T 2hpi_A 81 FDRKR-GKGLDGGYFHLTLLAKDFTGYQNLVRLASRAYLE-GFYEKPRIDREILRE-HAQGLIALSGCLGAEIPQFILQD 157 (1220) T ss_dssp TCCCC-C-----CEEEEEEEECSHHHHHHHHHHHHHHHHT-CBSSSBEECHHHHHH-TCTTEEEECCSTTBHHHHHHHTT T ss_pred CCCCC-CCCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHH-CCCCCCCCCHHHHHH-CCCCCEEECCCCCCHHHHHHHCC T ss_conf 23334-5667787201899978999999999999999970-787888778999985-07991996487436688998679 Q ss_pred CHHHHHHHHHHHHHHCCCCEEEEEECCCHHHH-HHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHCCCCCC Q ss_conf 97899999999987659868999943991158-99999999999818989980883007878899998766884698576 Q 537021.9.peg.7 163 SSQVAEKRLLTFKKLFGDRLYVNLQRHRGYDR-YRESQVVQLAYTHELPLVATNNSLFLSEEDYEAHDVLMAVAHSTVVS 241 (1033) Q Consensus 163 ~~~~a~~~l~~l~~~Fgd~~ylEl~~~~~~e~-~~~~~l~~lA~~~~iPlVaTndv~Y~~~~D~~~h~iL~aI~~~~~l~ 241 (1033) +.+.|++.+.+|+++||+|||+|||+|+..++ .++..++++|+++++|+||||||||++++|+.+|++|+||++|++++ T Consensus 158 ~~~~a~~~l~~~~~iFgd~~YlEiq~~~~~~~~~~n~~li~lA~~~~iplVATndvhYl~~eD~~~hdvL~~I~~g~~l~ 237 (1220) T 2hpi_A 158 RLDLAEARLNEDLSIFGDRFFIEIQNHGLPEQKKVNQVLKEFARKYGLGMVATNDGHYVRKEDARAHEVLLAIQSKTTLD 237 (1220) T ss_dssp CHHHHHHHHHHHHHHHGGGEEEEECCSSCHHHHHHHHHHHHHHHHTTCCEEECCCBSBSSGGGHHHHHHHHHHHTTCCSS T ss_pred CHHHHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHCCCCCC T ss_conf 98999999999998648968999951898689999999999999849999997987407761699999999985799326 Q ss_pred CCCCCCC-CCCCCCCCHHHHHHHCCHHH---HHHHHHHHHHHHHHHHHHHH---HCCCCCCCCCCCCCCCCCHHHHHHHH Q ss_conf 5310024-53432343478875122035---89987886411100013322---10234555756887768988999876 Q 537021.9.peg.7 242 QKDRPRV-TPDHYLKNRSEMVSIFSDLP---EALENTVEIARRCSFILQVH---APILPRFIEKECDDIQKEEENELRNK 314 (1033) Q Consensus 242 ~~~~~~~-~~~~yLks~eEM~~~f~~~p---eAi~NT~~IAe~c~~~l~~~---~~~lP~f~~p~~~~~~~~~~~~Lr~l 314 (1033) ++++.++ +.++||||++||.++|+++| +|++||.+||++|+++++.. .++||+||+|++ .++++||+++ T Consensus 238 d~~r~~~~~~e~ylKS~eEM~~lf~~~p~~~eal~NT~~IA~~c~~~l~~~~~~~~~lP~f~~p~g----~t~~~yLr~l 313 (1220) T 2hpi_A 238 DPERWRFPCDEFYVKTPEEMRAMLPEAEWGDEPFDNTVEIARMCDVDLPIGDKMVYRIPRFPLPEG----RTEAQYLREL 313 (1220) T ss_dssp CSSCCCCSSSCCBCCCHHHHHHHSCHHHHCSHHHHHHHHHHHTCCCCCCCTTSCCCBCCCCCC--C----CCHHHHHHHH T ss_pred CCCCCCCCCHHHCCCCHHHHHHHCCCCCHHHHHHHHHHHHHHHHHEEECCCCCCCCCCCCCCCCCC----CCHHHHHHHH T ss_conf 463334786452379899997643345014678899999987531231136544455898889999----8989999999 Q ss_pred HHHHHHHCCCCCCC-----------------------------------------------CCCCCHHHHHHHHHHHHHH Q ss_conf 54103420124643-----------------------------------------------2457838999998877898 Q 537021.9.peg.7 315 AVAGLEVRLAQGAI-----------------------------------------------AKGYHAQDYRQRLDFELNV 347 (1033) Q Consensus 315 ~~~Gl~~R~~~~~~-----------------------------------------------~~~~~~~~y~~RLe~EL~v 347 (1033) |++||++||++... .++...++|.+||+|||+| T Consensus 314 ~~~Gl~~Ry~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~Rl~~EL~i 393 (1220) T 2hpi_A 314 TFLGLLRRYPDRITEAFYREVLRLLGTMPPHGDERALAEALARVEEKAWEELRKRLPPLEGVREWTAEAILHRALYELSV 393 (1220) T ss_dssp HHHHHHHHCTTTSCHHHHHHHHTTC-------CHHHHHHHHHHCCHHHHTTTTTC--------CCCHHHHHHHHHHHHHH T ss_pred HHHHHHHHCCCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHH T ss_conf 99689986476651667777766423566542135567788888777788876430332243321048899999999999 Q ss_pred HHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHHCCCHHHHHHHHCCCCCCCCCCCCCHHHH Q ss_conf 98606305578889999985104764334444204667765544037563332310567621001457875333656878 Q 537021.9.peg.7 348 IARMKFSGYFLIVADFIQWAKKNNIPVGPGRGSGAGSVTAYALTITDIDPLRFSLLFERFLNPDRMSMPDFDIDFCQDRR 427 (1033) Q Consensus 348 I~~~gf~~YFLiV~Div~~ak~~gI~vGpGRGSAagSLVaY~LgIT~VDPi~~~LlFERFLnp~R~~~PDIDiDf~~~~R 427 (1033) |.+|||++|||||||+|+|||++||+||||||||||||||||||||+||||+|||||||||||+|.+||||||||++++| T Consensus 394 I~~~gf~~YFLiv~d~v~~ak~~~i~vGpGRGSaagSLVaY~LgIT~vDPi~~~LlFERFLnpeR~~~PDIDiDf~~~~R 473 (1220) T 2hpi_A 394 IERMGFPGYFLIVQDYINWARGHGVSVGPGRGSAAGSLVAYAVGITNIDPLRFGLLFERFLNPERVSMPDIDTDFSDRER 473 (1220) T ss_dssp HHHHTCHHHHHHHHHHHHHHHTTTCCBCSCCGGGGGCHHHHHTTSCCSCTTTTTCCHHHHSCTTSCCCCCCEEEEETTTH T ss_pred HHHCCCCCEEEEHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCHHHH T ss_conf 98179983687613788998728916689999877882340257877684335866788556013666674557774658 Q ss_pred HHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCCH-HHHHCCHHHHHHHHCCC Q ss_conf 99999998303802155422077877889999999752799889886543101458982443-44522988788643074 Q 537021.9.peg.7 428 DEVIRYVQNKYGHERVAQIITFGSLQAKAALRDVGRALQMPYSQVDRLCKLIPNDPAHPVSL-QAIADDSRFREARLADP 506 (1033) Q Consensus 428 ~~vi~y~~~kyG~~~va~i~t~~t~~~k~aird~~r~~g~~~~~~~~~~~~i~~~~~~~~~~-~~~~~~~~~~~~~~~~~ 506 (1033) ++||+||.+|||.++||||+||+|+++|+|+|||||++|+|+.++|+++|+||...+...++ ++++..++|++....++ T Consensus 474 ~~vi~Yv~~kyG~~~Va~I~Tf~t~~~k~airdv~r~~g~~~~~~~~i~k~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~ 553 (1220) T 2hpi_A 474 DRVIQYVRERYGEDKVAQIGTFGSLASKAALKDVARVYGIPHKKAEELAKLIPVQFGKPKPLQEAIQVVPELRAEMEKDE 553 (1220) T ss_dssp HHHHHHHHHHHCTTTEEEEEEEEECCHHHHHHHHHHHTTCCHHHHHHHHTTSCBSSSSBCCTTTC-----CCTTHHHHCH T ss_pred HHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCCCCCCCCHHHHHHHCHHHHHHHHCCH T ss_conf 99999999871753112200000067889999999983733888888764078666775315667653789997764399 Q ss_pred CCHHHHHHHHHHHHHHHCCCCCCCEEEECCCCHHHHEECCCCCCCCCEEEEEECHHHHHHCCCHHHHHHHCCHHHHHHHH Q ss_conf 20268898720112210024474403352566666200123676886688510213343112321356541789999999 Q 537021.9.peg.7 507 SVDRLLEISQKLEGLYRHASTHAAGIVIGDRPLSQLVPMYRDVRSDLPVTQFNMKWIEKAGLIKFDFLGLKTLTLLQKSL 586 (1033) Q Consensus 507 ~~~~~~~~a~~leg~~r~~~~Ha~Gvvi~~~~l~~~~p~~~~~~~~~~~~q~d~~~~e~~gl~K~D~Lgl~~L~~i~~~~ 586 (1033) .+++++++|.+|||++||+|+||||||||++||++++|++++. ++.++|||||+++|++||+|||||||+|||+|++++ T Consensus 554 ~~~~l~~~A~~leG~~r~~g~Ha~Gvvi~~~~l~~~~P~~~~~-~~~~~tq~d~~~~e~~gLlK~D~LGl~~Ltii~~~~ 632 (1220) T 2hpi_A 554 RIRQVIEVAMRLEGLNRHASVHAAGVVIAAEPLTDLVPLMRDQ-EGRPVTQYDMGAVEALGLLKMDFLGLRTLTFLDEAR 632 (1220) T ss_dssp HHHHHHHHHHHSTTCEEEEEEEEEEEEECSSCGGGTSCEEECT-TCCEEESSCHHHHHTTTBEEEEEEEEHHHHHHHHHH T ss_pred HHHHHHHHHHHHCCCCCCCCCCCCEEEEECCCCCCEEEEEECC-CCCCEEEECHHHHHHCCCHHHHHHCCCCHHHHHHHH T ss_conf 9999999999824976546767887998568865434047868-998247620002754687454665146478999999 Q ss_pred HHHHH-HCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHCCCHHHHHHHHHHCCCHHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 98874-02543433234435889987620141012211453478888771740488887665411411342003688850 Q 537021.9.peg.7 587 DFLAQ-RGVKVDLSLIPFDDHETYHLLTTKGTLGIFQLESSGMRQALEGMQPDCIEDIIALVSLYRPGPIDNIVVYNNRK 665 (1033) Q Consensus 587 ~~i~~-~~~~~d~~~ip~~D~~~~~l~~~g~t~GvFQ~es~~~~~~l~~~~p~~~~dl~~~~al~RPGp~~~~~~yi~rk 665 (1033) ++|++ +|.++|+++||+||++||+||++|+|.||||+||+||++++++++|++|+||++++||||||||++++.|+.|| T Consensus 633 ~~i~~~~~~~~~~~~ip~dD~~~~~l~~~g~t~GifQ~es~g~~~~l~~~~p~~~~dl~a~~al~RPGp~~~i~~~~~~k 712 (1220) T 2hpi_A 633 RIVKESKGVELDYDRLPLDDPKTFELLSRGETKGVFQLESGGMTATVRGLKPRRLEDIIALVSLYRPGPMEHIPTYIRRH 712 (1220) T ss_dssp HHHHHHHCCCCCGGGCCTTCHHHHHHHHTTCCTTCTTCCSHHHHHHHHHHCCCSHHHHHHHHHHCSTTGGGHHHHHHHHH T ss_pred HHHHHHHCCCCCHHCCCCCCHHHHHHHHCCCCCCCCCCCCCCHHHHHHCCCCCCHHHHHHHHHHCCCCCCCCCHHHHHHH T ss_conf 99987038976631189876999999847521786755464066687518998788999998651887742024788885 Q ss_pred CCCCCCCC-----CCHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCC Q ss_conf 46578502-----6966733410335530012568988788632684130788976201211016778789999986249 Q 537021.9.peg.7 666 NGKEKIVS-----IHPLIDPMLKETQGVIIYQEQVMQIAQLLSGYSLSEADVLRRAMGKKIKEEMDKQKERFISGASKNG 740 (1033) Q Consensus 666 ~g~e~~~~-----~hp~l~~iL~~T~Gv~vyQEQvm~ia~~~ag~~~~ead~lRra~~kk~~~~~~~~~~~f~~g~~~~g 740 (1033) ||+++++| +||.|++||++||||||||||||+|++.+||||++|||.||||||||+.++|++++++|++||.++| T Consensus 713 ~g~~~~~y~~~~~~~~~l~~il~~TyGvivyQEqvM~i~~~~agfs~~eaD~lRram~KK~~~~m~~~~~~f~~g~~~~g 792 (1220) T 2hpi_A 713 HGQEPVSYAEFPHAEKYLRPILDETYGIPVYQEQIMQIASQVAGYSLGEADLLRRAMGKKRVEEMQKHRERFVRGAKERG 792 (1220) T ss_dssp TTCSCCCCTTSTTTHHHHHHHHGGGTTCCCBHHHHHHHHHHHSCCCHHHHHHHHHHHHHTCTTTHHHHHHHHHHHHHHHS T ss_pred CCCCCCCCCCCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCC T ss_conf 48777655567753588998874226830129999989999738986778899872066878999999877505651248 Q ss_pred CCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCCCCCCC Q ss_conf 99999999999999750001331014788899999999997302666777510023542025899988886368503552 Q 537021.9.peg.7 741 ISKTIAVNIFELLAKFADYGFNKSHAAAYAVISYQTAWMKTHYPVEFLAASMTLEMDNVEKIKKFCQDARQFNIQIMPPS 820 (1033) Q Consensus 741 ~~~~~a~~i~~~i~~f~~Y~FnksHa~aya~~~y~~aylK~~yP~eF~aa~L~~~~~~~~~~~~~i~e~~~~gi~vl~Pd 820 (1033) ++++.|++||++|++||+||||||||+|||++||||||||+|||+|||||+||++++++++++.|++|||++||+|+||| T Consensus 793 ~~~~~a~~i~~~i~~fa~YgFnKSHa~aYa~iayqtAylK~~yP~eF~aa~l~~~~~~~~~~~~~i~e~~~~gi~v~~Pd 872 (1220) T 2hpi_A 793 VPEEEANRLFDMLEAFANYGFNKSHAAAYSLLSYQTAYVKAHYPVEFMAALLSVERHDSDKVAEYIRDARALGIPVLPPD 872 (1220) T ss_dssp CCSHHHHHHHHHHHHHTTSCBCHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHTTTCHHHHHHHHHHHHTTTCCEECCC T ss_pred CCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCHHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCCCCCE T ss_conf 77999999999999987416684069999999999999988499999999812788984478999999987324656982 Q ss_pred CCCCCEEEEECCCEEEEEHHHHCCCCHHHHHHHHHHH-HCCCCCCHHHHHHHCCCCCCCHHHHHHHHHCCCCCCCCCCHH Q ss_conf 3342100353198698502685896989999999998-379999978999734824799899998886767234790389 Q 537021.9.peg.7 821 VNTPCVDFKVGDNRIYYSLAAIKGVGTTTARHIMEAS-ADKPFDSLEDFCSRVDSKKLNRRVLESLVFAGALDCFGYSRM 899 (1033) Q Consensus 821 IN~S~~~f~~~~~~Ir~Gl~~Ikgvg~~~~~~Ive~r-~~g~f~sl~df~~rv~~~~~nk~~le~Li~aGAfD~~~~~R~ 899 (1033) ||+|.++|++++++||+||++|||||+++++.|+++| ++|||+|++||+.|++.+.+||+++|+||+|||||+|| ||+ T Consensus 873 iN~S~~~f~~~~~~i~~gL~aIKgvg~~~~~~Ii~eR~~~G~f~sl~Df~~Rvd~~~~nk~~lE~LIkaGAFD~~g-~R~ 951 (1220) T 2hpi_A 873 VNRSGFDFKVVGEEILFGLSAVKNVGEMAARAILEERERGGPFKSLGDFLKRLPEQVVNKRALESLVKAGALDAFG-DRA 951 (1220) T ss_dssp TTTCCSSCEEETTEEECCSTTSSSCCHHHHHHHHHHHHHSCSCCSHHHHHHHSCTTTSCHHHHHHHHHHTTTGGGS-CHH T ss_pred ECCCCCCCEECCCCCCCCHHHHCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCHHHHHHHHHCCCCCCCC-CHH T ss_conf 0124564442486345777761585067799999999856998747789988520469999999999838755534-499 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCEEECCCCHHHHHHHHHHC- Q ss_conf 9999899999999988654312777866665555655776668999988899985788726211638558999999874- Q 537021.9.peg.7 900 QLLQSLDNIQKYAQWVEKNRTNKHENIFAHEKGTFSDKITLEKFSVENSSVRFENEQRVLGFYFSGHPLDVYKPILRKM- 978 (1033) Q Consensus 900 ~ll~~i~~~~~~~~~~~~~~~~~q~~LF~~~~~~~~~~~~~~~~~~~~~~e~l~~E~e~LGf~lS~HPl~~~~~~l~~~- 978 (1033) +||++++.+++++....+++..+|.+||+... .+.+++.++|+..|++++|+|+||||+|+|||+.|+. +... T Consensus 952 ~ll~~l~~~l~~~~~~~~~~~~~q~~LF~~~~-----~~~~~~~~~~~~~e~L~~Eke~LGfylS~HPL~~y~~-l~~~~ 1025 (1220) T 2hpi_A 952 RLLASLEPLLRWAAETRERGRSGLVGLFAEVE-----EPPLVEASPLDEITMLRYEKEALGIYVSGHPVLRYPG-LREVA 1025 (1220) T ss_dssp HHHHHHHHHHHHHHHHHHHHTTCCCSSSSCCC-----CCCCCCCCCCCHHHHHHHHHHHHSSCCSCCGGGGCTH-HHHHC T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCC-----CCCCCCCCCCCHHHHHHHHHHHEEEECCCCCCCHHHH-HHHHH T ss_conf 99981999999999986521036878667777-----7667789998999999999973334426683100898-99773 Q ss_pred --CCCCHHHH-HCCCCCCEEEEEEEEEEEEEECCCCCCEEEEEEEECCCCEEEEEECC Q ss_conf --89768898-50558988999999961121015888769999998497049999539 Q 537021.9.peg.7 979 --GIKGYEES-VATIRGNNIRLAAMVVSKQQKKTRKGSRIGWVTFSEPAKEYEANFFP 1033 (1033) Q Consensus 979 --~~~~~~~~-~~~~~g~~v~iaG~V~~i~~~~TKkG~~mafltleD~~g~~e~~iFP 1033 (1033) .+..+.+. .+...+++|+++|+|+++|+++||+|++|+|+|+||++|.+|++||| T Consensus 1026 ~~~~~~l~~~~~~~~~~~~v~vaG~I~~v~~~~TKkG~~maf~tleD~tG~iev~vF~ 1083 (1220) T 2hpi_A 1026 SCTIEELSEFVRELPGKPKVLLSGMVEEVVRKPTRSGGMMARFTLSDETGALEVVVFG 1083 (1220) T ss_dssp SSCTTTHHHHHSSSSSSCEEEEEEEECCC------------CEEEEETTEEEEEC--- T ss_pred CCCHHHHHHHHHHCCCCCEEEEEEEEEEEEECCCCCCCEEEEEEEEECCCCEEEEECH T ss_conf 4887889999972789988999999978897327999969999999899988999837 No 2 >2hnh_A DNA polymerase III alpha subunit; DNA replication, nucleotidyltransferase, POL beta, PHP; HET: DNA; 2.30A {Escherichia coli} PDB: 2hqa_A* Probab=100.00 E-value=0 Score=2098.63 Aligned_cols=887 Identities=45% Similarity=0.784 Sum_probs=842.6 Q ss_pred CCCCHHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCCC Q ss_conf 98813343314447644329989999999978998899811651774999999999769963899999985476643344 Q 537021.9.peg.7 8 SPSFVHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPIIGCQLDIDMQDNLKMTN 87 (1033) Q Consensus 8 ~~~FvhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~~~~~~ 87 (1033) .|+||||||||+||+|+|+++|++||++|+++||+||||||||||+|+++|+++|+++|||||+|||+++..+.. T Consensus 3 ~p~FvhLh~hS~yS~l~g~~~~eelv~~A~~~g~~alAiTD~~nl~G~v~F~~a~~k~gIKpIiG~e~~v~~~~~----- 77 (910) T 2hnh_A 3 EPRFVHLRVHSDYSMIDGLAKTAPLVKKAAALGMPALAITDFTNLCGLVKFYGAGHGAGIKPIVGADFNVQCDLL----- 77 (910) T ss_dssp CCCCCCCCBCCGGGSSSCCSCHHHHHHHHHHTTCSEEEECCBTCCTTHHHHHHHHHHHTCEEEEEEEEEEECTTT----- T ss_pred CCCCHHHCCCCCCCHHCCCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCC----- T ss_conf 899202207775803131369999999999859999999528859789999999997699827788999833877----- Q ss_pred CCCCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCEEEECCCCCCHHHHHHHCCCHHHH Q ss_conf 56566667753999956968999999999899732487787400789998634898899819767967888877997899 Q 537021.9.peg.7 88 QGKTPFPTVPSIVLLVSTAEGYQRLIELVSRMYLLDQGKQSVRICLSWLQEIGTEGLIMLTGGSSGPIDRAFFFNSSQVA 167 (1033) Q Consensus 88 ~~~~~~~~~~~lvlLAkN~~Gy~nL~kL~S~a~~~~~~~~~p~i~~~~L~e~~~egLIvlsg~~~g~i~~~l~~~~~~~a 167 (1033) .+..+|++|||||++||+|||+|+|.+++++.++..|++++++|.+ +++||||+|||..|++.+++..++.+.+ T Consensus 78 -----~~~~~~l~lLAKN~~GykNL~kLiS~a~~~~~~~~~p~i~~~~L~~-~~egLIvlsg~~~g~i~~~l~~~~~~~a 151 (910) T 2hnh_A 78 -----GDELTHLTVLAANNTGYQNLTLLISKAYQRGYGAAGPIIDRDWLIE-LNEGLILLSGGRMGDVGRSLLRGNSALV 151 (910) T ss_dssp -----TTCCEEEEEEECSHHHHHHHHHHHHHHHHHCCCTTCCCEETTHHHH-SCTTEEEECCGGGSHHHHHHHHTCHHHH T ss_pred -----CCCCCCEEEEECCHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHH-CCCCEEEEECCCCCHHHHHHHCCCHHHH T ss_conf -----7870688999799999999999999998704578888688999986-7999799858855789999876999999 Q ss_pred HHHHHHHHHHCCCCEEEEEECCCHH-HHHHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 9999999876598689999439911-589999999999981898998088300787889999876688469857653100 Q 537021.9.peg.7 168 EKRLLTFKKLFGDRLYVNLQRHRGY-DRYRESQVVQLAYTHELPLVATNNSLFLSEEDYEAHDVLMAVAHSTVVSQKDRP 246 (1033) Q Consensus 168 ~~~l~~l~~~Fgd~~ylEl~~~~~~-e~~~~~~l~~lA~~~~iPlVaTndv~Y~~~~D~~~h~iL~aI~~~~~l~~~~~~ 246 (1033) ++++.+|+++||++||+|+|+|+.. ++..+..++++|+++++|+||||||||++++|+++|++|.||+.++++++.++. T Consensus 152 ~~~i~~l~~iF~~~~yiEl~~~g~~~~~~~n~~li~la~~~~iplVATndv~Y~~~eD~~~~dvl~~I~~~~~l~d~~~~ 231 (910) T 2hnh_A 152 DECVAFYEEHFPDRYFLELIRTGRPDEESYLHAAVELAEARGLPVVATNDVRFIDSSDFDAHEIRVAIHDGFTLDDPKRP 231 (910) T ss_dssp HHHHHHHHHHCTTSEEEEECCSSCTTHHHHHHHHHHHHHHHTCCEEECCCBCBSSGGGHHHHHHHHHHHHTCCTTCTTSC T ss_pred HHHHHHHHHHCCCCEEEEHHCCCCHHHHHHHHHHHHHHHHCCCCCHHCCCEEEEEHHHHHHHHHHHHHHCCCCHHHHHHH T ss_conf 99999999867850896112179855999999999999980996110044124305028899999997329745032221 Q ss_pred C-CCCCCCCCCHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCC Q ss_conf 2-453432343478875122035899878864111000133221023455575688776898899987654103420124 Q 537021.9.peg.7 247 R-VTPDHYLKNRSEMVSIFSDLPEALENTVEIARRCSFILQVHAPILPRFIEKECDDIQKEEENELRNKAVAGLEVRLAQ 325 (1033) Q Consensus 247 ~-~~~~~yLks~eEM~~~f~~~peAi~NT~~IAe~c~~~l~~~~~~lP~f~~p~~~~~~~~~~~~Lr~l~~~Gl~~R~~~ 325 (1033) + .++++||||++||.++|+++|+|++||..||++|++.++...+.+|+|++|++ +++++|++++.+|+.+||+. T Consensus 232 ~~~~~~~ylks~eEM~~lF~d~peal~Nt~~Ia~~c~~~~~~~~~~lP~~~~~~~-----~~~~~l~~l~~~Gl~~r~~~ 306 (910) T 2hnh_A 232 RNYSPQQYMRSEEEMCELFADIPEALANTVEIAKRCNVTVRLGEYFLPQFPTGDM-----STEDYLVKRAKEGLEERLAF 306 (910) T ss_dssp CCCCTTCBCCCHHHHHHHTTTCHHHHHHHHHHHHHCCCCCCCSSCCCCCCCCTTS-----CHHHHHHHHHHHHHHHHHHH T ss_pred CCCCHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCC-----CCHHHHHHHHHHHHHHHHHH T ss_conf 2378643489899999997451999997999988520244305566898876678-----84899999999889998641 Q ss_pred CCCC---CCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHHCCC Q ss_conf 6432---4578389999988778989860630557888999998510476433444420466776554403756333231 Q 537021.9.peg.7 326 GAIA---KGYHAQDYRQRLDFELNVIARMKFSGYFLIVADFIQWAKKNNIPVGPGRGSGAGSVTAYALTITDIDPLRFSL 402 (1033) Q Consensus 326 ~~~~---~~~~~~~y~~RLe~EL~vI~~~gf~~YFLiV~Div~~ak~~gI~vGpGRGSAagSLVaY~LgIT~VDPi~~~L 402 (1033) .... .....++|++||++||+||.+|||++|||||||+|+|||++||+||||||||||||||||||||+||||+||| T Consensus 307 ~~~~~~~~~~~~~~y~~RL~~EL~iI~~~gf~~YFLiv~diV~~Ak~~gi~vGpgRGSaagSLVaylLgIT~VDPi~~~L 386 (910) T 2hnh_A 307 LFPDEEERLKRRPEYDERLETELQVINQMGFPGYFLIVMEFIQWSKDNGVPVGPGRGSGAGSLVAYALKITDLDPLEFDL 386 (910) T ss_dssp HCCSHHHHHHHSHHHHHHHHHHHHHHHHTTCHHHHHHHHHHHHHHHHTTCCBCSCCTTGGGCHHHHHTTSSCSCTTTTTC T ss_pred CCCCHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCEEECCCCCCHHHHHHHHHHCCCCCCCHHCCC T ss_conf 26411220103499999999999999868980999999999999997894524688814889999997176779020388 Q ss_pred HHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCC Q ss_conf 05676210014578753336568789999999830380215542207787788999999975279988988654310145 Q 537021.9.peg.7 403 LFERFLNPDRMSMPDFDIDFCQDRRDEVIRYVQNKYGHERVAQIITFGSLQAKAALRDVGRALQMPYSQVDRLCKLIPND 482 (1033) Q Consensus 403 lFERFLnp~R~~~PDIDiDf~~~~R~~vi~y~~~kyG~~~va~i~t~~t~~~k~aird~~r~~g~~~~~~~~~~~~i~~~ 482 (1033) +|||||||+|++|||||+||++++|++|++|+++|||.++||+|+||+|+++|+|+|||||++|+|+.++|+++|+||.. T Consensus 387 lFERFLn~~R~~~PDID~Df~~~~r~~v~~y~~~~yG~~~va~i~Tf~t~~~ksA~rdv~r~~g~~~~~~~~i~~~i~~~ 466 (910) T 2hnh_A 387 LFERFLNPERVSMPDFDVDFCMEKRDQVIEHVADMYGRDAVSQIITFGTMAAKAVIRDVGRVLGHPYGFVDRISKLIPPD 466 (910) T ss_dssp CGGGTSCTTSCCCCCCCCEECGGGHHHHHHHHHHHHCGGGEEEECCEEECCHHHHHHHHHHHTTCCHHHHHHHHTTSCCC T ss_pred CHHHHCCCCCCCCCCCCCCCCHHCHHHHHHHHHHHHCCCCEEEEEEECCHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCC T ss_conf 79997487667778888662211319999999999485514885310023688999999998589999999998526766 Q ss_pred CCCCCCHHHHHCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCEEEECCCCHHHHEECCCCCCCCCEEEEEECHH Q ss_conf 89824434452298878864307420268898720112210024474403352566666200123676886688510213 Q 537021.9.peg.7 483 PAHPVSLQAIADDSRFREARLADPSVDRLLEISQKLEGLYRHASTHAAGIVIGDRPLSQLVPMYRDVRSDLPVTQFNMKW 562 (1033) Q Consensus 483 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~leg~~r~~~~Ha~Gvvi~~~~l~~~~p~~~~~~~~~~~~q~d~~~ 562 (1033) ++.... .+...+++++.....++.++.++++|++|||++||+|+||||||||++||.+++|++++.+++.++|||||++ T Consensus 467 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~a~~leG~~r~~~~Ha~Gvvi~~~~l~~~~p~~~~~~~~~~~~q~d~~~ 545 (910) T 2hnh_A 467 PGMTLA-KAFEAEPQLPEIYEADEEVKALIDMARKLEGVTRNAGKHAGGVVIAPTKITDFAPLYCDEEGKHPVTQFDKSD 545 (910) T ss_dssp TTCCHH-HHHHHCTHHHHHHHHCHHHHHHHHHHHHHTTCEEEECSSCCEEEECSSCGGGTCCEECCTTSCSCEESSCHHH T ss_pred CCCCHH-HHHHHCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCCEEEEEECCCC T ss_conf 686456-6666465668877632589999999988258744434567879983887431355444788870587304753 Q ss_pred HHHHCCCHHHHHHHCCHHHHHHHHHHHHHH-C----CCCCCCCCCCCHHHHHHHHHCCCCCHHHHCCCHHHHHHHHHHCC Q ss_conf 343112321356541789999999988740-2----54343323443588998762014101221145347888877174 Q 537021.9.peg.7 563 IEKAGLIKFDFLGLKTLTLLQKSLDFLAQR-G----VKVDLSLIPFDDHETYHLLTTKGTLGIFQLESSGMRQALEGMQP 637 (1033) Q Consensus 563 ~e~~gl~K~D~Lgl~~L~~i~~~~~~i~~~-~----~~~d~~~ip~~D~~~~~l~~~g~t~GvFQ~es~~~~~~l~~~~p 637 (1033) +|++||+|||||||+|||+|++++++|+++ + ..+|+..||+||++||+||++|+|.||||+||+|||++|++++| T Consensus 546 ~e~~gllK~D~Lgl~~lt~i~~~~~~i~~~~~~~~~~~~~~~~ip~~D~~~~~l~~~g~t~Gifq~es~~~~~~l~~~~P 625 (910) T 2hnh_A 546 VEYAGLVKFDFLGLRTLTIINWALEMINKRRAKNGEPPLDIAAIPLDDKKSFDMLQRSETTAVFQLESRGMKDLIKRLQP 625 (910) T ss_dssp HHHTTCEEEEECCCCHHHHHHHHHHHHHHTTTTTTCCCCCGGGCCSCCHHHHHHHHTTCCTTSTTCCSHHHHHHHHHHCC T ss_pred HHHCCCCCHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCHHHCCCCCHHHHHHHCCCCCCCCCCCCCHHHHHHHHHCCC T ss_conf 31168731236514469999999999986317667886787548986589999760425578676788799999986599 Q ss_pred CHHHHHHHHHHHCCCHHHHHHH--HHHHHHCCCCCCCCC-----CHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCHHHH Q ss_conf 0488887665411411342003--688850465785026-----966733410335530012568988788632684130 Q 537021.9.peg.7 638 DCIEDIIALVSLYRPGPIDNIV--VYNNRKNGKEKIVSI-----HPLIDPMLKETQGVIIYQEQVMQIAQLLSGYSLSEA 710 (1033) Q Consensus 638 ~~~~dl~~~~al~RPGp~~~~~--~yi~rk~g~e~~~~~-----hp~l~~iL~~T~Gv~vyQEQvm~ia~~~ag~~~~ea 710 (1033) ++|+||++++||+||||+++++ .|++|++|+++++|+ ||.|++||++||||||||||||+|++.+||||+++| T Consensus 626 ~~f~dl~~~~al~Rpgp~~~~~~~~~i~r~~~~~~~~y~~~~~~h~~l~~il~~T~Gv~vyqEqim~i~~~~ag~~~~~a 705 (910) T 2hnh_A 626 DCFEDMIALVALFRPGPLQSGMVDNFIDRKHGREEISYPDVQWQHESLKPVLEPTYGIILYQEQVMQIAQVLSGYTLGGA 705 (910) T ss_dssp CSHHHHHHHHHHSSHHHHTTTHHHHHHHHHTTSSCCCSBCSSSBCGGGHHHHGGGTTCCCBHHHHHHHHHHHHCCCHHHH T ss_pred CCHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHCCCCEEEEHHHHHHHHHHHCCCCHHHH T ss_conf 87999999998127776334640378876058877677643325688999971457835539999867787669986789 Q ss_pred HHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 78897620121101677878999998624999999999999999750001331014788899999999997302666777 Q 537021.9.peg.7 711 DVLRRAMGKKIKEEMDKQKERFISGASKNGISKTIAVNIFELLAKFADYGFNKSHAAAYAVISYQTAWMKTHYPVEFLAA 790 (1033) Q Consensus 711 d~lRra~~kk~~~~~~~~~~~f~~g~~~~g~~~~~a~~i~~~i~~f~~Y~FnksHa~aya~~~y~~aylK~~yP~eF~aa 790 (1033) |.+|||||||+.++|++.++.|++||.++|++++.|++||++|++||+||||||||+|||++|||+||||+|||+||||| T Consensus 706 d~~Rka~~kk~~~~~~~~~~~f~~g~~~~g~~~~~a~~i~~~i~~fa~Y~FnksHa~aYa~~ay~~AylK~~yP~eF~~a 785 (910) T 2hnh_A 706 DMLRRAMGKKKPEEMAKQRSVFAEGAEKNGINAELAMKIFDLVEKFAGYGFNKSHSAAYALVSYQTLWLKAHYPAEFMAA 785 (910) T ss_dssp HHHHHHHHHCCHHHHHHHHHHHHHHHHHTTCCHHHHHHHHHHHHHHHTTCBCHHHHHHHHHHHHHHHHHHHHCHHHHHHH T ss_pred HHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCHHHHHHH T ss_conf 99998734467177899999999779984999999999999998874518778999999999999999987078999999 Q ss_pred HHCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEECC-CEEEEEHHHHCCCCHHHHHHHHHHH-HCCCCCCHHHH Q ss_conf 510023542025899988886368503552334210035319-8698502685896989999999998-37999997899 Q 537021.9.peg.7 791 SMTLEMDNVEKIKKFCQDARQFNIQIMPPSVNTPCVDFKVGD-NRIYYSLAAIKGVGTTTARHIMEAS-ADKPFDSLEDF 868 (1033) Q Consensus 791 ~L~~~~~~~~~~~~~i~e~~~~gi~vl~PdIN~S~~~f~~~~-~~Ir~Gl~~Ikgvg~~~~~~Ive~r-~~g~f~sl~df 868 (1033) +||+++++++++..++.||+++||+|+|||||+|.++|++++ +.||+||++|||||+++++.|+++| ++|||+|+.|| T Consensus 786 ~l~~~~~~~~k~~~~~~e~~~~gi~i~~pdin~S~~~~~~~~~~~i~~gl~~ikgig~~~a~~Iv~~R~~~~~f~s~~Df 865 (910) T 2hnh_A 786 VMTADMDNTEKVVGLVDECWRMGLKILPPDINSGLYHFHVNDDGEIVYGIGAIKGVGEGPIEAIIEARNKGGYFRELFDL 865 (910) T ss_dssp HHHHTTTCHHHHHHHHHHHHHTTCCEECCCTTTCBSSCEECTTSCEECBGGGSTTCCHHHHHHHHHHHHTTCCCSSHHHH T ss_pred HHCCCCCCCHHHHHHHHHHHHCCCEEECCCCCCCCCCEEECCCCEEEECHHHCCCCCHHHHHHHHHHHHHCCCCCCHHHH T ss_conf 81588999389999999999879979898203678845876899598325435998999999999989747998999999 Q ss_pred HHHCCCCCCCHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 9734824799899998886767234790389999989999999 Q 537021.9.peg.7 869 CSRVDSKKLNRRVLESLVFAGALDCFGYSRMQLLQSLDNIQKY 911 (1033) Q Consensus 869 ~~rv~~~~~nk~~le~Li~aGAfD~~~~~R~~ll~~i~~~~~~ 911 (1033) +.|++.+.+|++++|.||+|||||+||.||++|++++++++++ T Consensus 866 ~~r~~~~~~~~~~~~~Li~~gafd~~~~~R~~ll~~l~~~~~~ 908 (910) T 2hnh_A 866 CARTDTKKLNRRVLEKLIMSGAFDRLGPHRAALMNSLGDALKA 908 (910) T ss_dssp TTSSCSSSSCHHHHHHHHHTTTTTTTSSCHHHHHTTSSSSGGG T ss_pred HHHHCCCCCCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHH T ss_conf 9870646799999999987799746067899999989999984 No 3 >3f2b_A DNA-directed DNA polymerase III alpha chain; DNA polymerase C; HET: DGT; 2.39A {Geobacillus kaustophilus} PDB: 3f2c_A* 3f2d_A* Probab=100.00 E-value=0 Score=1692.59 Aligned_cols=807 Identities=23% Similarity=0.302 Sum_probs=706.0 Q ss_pred CCHHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCCCC- Q ss_conf 8133433144476443299899999999789988998116517749999999997699638999999854766433445- Q 537021.9.peg.7 10 SFVHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPIIGCQLDIDMQDNLKMTNQ- 88 (1033) Q Consensus 10 ~FvhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~~~~~~~- 88 (1033) -.|||||||.||+|||+++|++||++|+++||+||||||||||+|+++|+++|+++|||||+|||+|+.++......+. T Consensus 115 krVeLH~HT~yS~ldg~~~~~elv~~A~e~G~~AiAITDhgnl~G~~ef~~aakk~GIKpIiG~E~~l~dd~~~i~~~~~ 194 (1041) T 3f2b_A 115 KRVELHLHTPMSQMDAVTSVTKLIEQAKKWGHPAIAVTDHAVVQSFPEAYSAAKKHGMKVIYGLEANIVDDGVPIAYNET 194 (1041) T ss_dssp CCCBCCBCCTTSTTTCCSCHHHHHHHHHHTTCSCEEECCBSCCTTHHHHHHHHHHHTCCEEEEEEEEEECC--------- T ss_pred CEEECCCCCCCHHHHHCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCCCCC T ss_conf 70562478843454002789999999998599999883038587899999999986998699889998369753334431 Q ss_pred --CCCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCEEEECCCCCCHHHHHHHCCCHHH Q ss_conf --656666775399995696899999999989973248778740078999863489889981976796788887799789 Q 537021.9.peg.7 89 --GKTPFPTVPSIVLLVSTAEGYQRLIELVSRMYLLDQGKQSVRICLSWLQEIGTEGLIMLTGGSSGPIDRAFFFNSSQV 166 (1033) Q Consensus 89 --~~~~~~~~~~lvlLAkN~~Gy~nL~kL~S~a~~~~~~~~~p~i~~~~L~e~~~egLIvlsg~~~g~i~~~l~~~~~~~ 166 (1033) ........+|+++||||++||+|||+|+|.+|++ ++++.|++++++|.+ +++||||+|||..|++.++++.+..++ T Consensus 195 ~~~~~~~~~~~hlilLAKN~~GykNL~kLvS~s~~~-~f~~~PrI~~~~L~~-~~eGLIilSgc~~Gel~~~ll~~~~e~ 272 (1041) T 3f2b_A 195 HRRLGSGSGPFHVTLLAQNETGLKNLFKLVSLSHIQ-YFHRVPRIPRSVLVK-HRDGLLVGSGCDKGELFDNLIQKAPEE 272 (1041) T ss_dssp ---------CEEEEEEECSHHHHHHHHHHHHHHHTT-TCSSSCCEEHHHHHH-TCTTEEEECCSSSSSSTTC-------- T ss_pred CCCCCCCCCCCCEEEEECCHHHHHHHHHHHHHHHHH-CCCCCCCCCHHHHHH-CCCCEEEECCCCCCHHHHHHHCCCHHH T ss_conf 123456788753799939999999999999999984-699986577999985-799869973887677999998089999 Q ss_pred HHHHHHH--HHHHCCCCEEEEEEC-CCHH-HH---HHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHCCCC Q ss_conf 9999999--987659868999943-9911-58---999999999998189899808830078788999987668846985 Q 537021.9.peg.7 167 AEKRLLT--FKKLFGDRLYVNLQR-HRGY-DR---YRESQVVQLAYTHELPLVATNNSLFLSEEDYEAHDVLMAVAHSTV 239 (1033) Q Consensus 167 a~~~l~~--l~~~Fgd~~ylEl~~-~~~~-e~---~~~~~l~~lA~~~~iPlVaTndv~Y~~~~D~~~h~iL~aI~~~~~ 239 (1033) +++.+.+ |.++|+.++|.+++. +... ++ ..+..++++|+++++|+||||||||++++|+.+|++|.||+.+.. T Consensus 273 ~~~~~~~y~~lei~~~~~y~~li~~~~~~~~~~~~~~~~~li~lAkk~~iplVATnDvhYl~~eD~~~~~iL~~i~~~~~ 352 (1041) T 3f2b_A 273 VEDIARFYDFLEVHPPDVYKPLIEMDYVKDEEMIKNIIRSIVALGEKLDIPVVATGNVHYLNPEDKIYRKILIHSQGGAN 352 (1041) T ss_dssp CCTTGGGCSBEEECCGGGGCCC----CCSCHHHHHHHHHHHHHHHHHTTCCEEECCCBSBSSGGGHHHHHHHHHTTGGGS T ss_pred HHHHHHHHHHHHHCCCCCEEEHHHCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHCCCCC T ss_conf 99999999999857862112011015754089999999999999998099989957866788889999999986047998 Q ss_pred CCCCCCCCCCCCCCCCCHHHHHHHCCHHHH------HHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHH Q ss_conf 765310024534323434788751220358------99878864111000133221023455575688776898899987 Q 537021.9.peg.7 240 VSQKDRPRVTPDHYLKNRSEMVSIFSDLPE------ALENTVEIARRCSFILQVHAPILPRFIEKECDDIQKEEENELRN 313 (1033) Q Consensus 240 l~~~~~~~~~~~~yLks~eEM~~~f~~~pe------Ai~NT~~IAe~c~~~l~~~~~~lP~f~~p~~~~~~~~~~~~Lr~ 313 (1033) ..... ..+++||+|++||.++|+++|+ +++||..||++|+. ..|.+|++.+|.+ .+++++|++ T Consensus 353 ~~~~~---~~~e~ylks~eEM~~~F~~l~~e~~~e~~i~Nt~~Ia~~~~~----~~p~~~~~~~P~~----~~~~e~L~~ 421 (1041) T 3f2b_A 353 PLNRH---ELPDVYFRTTNEMLDCFSFLGPEKAKEIVVDNTQKIASLIGD----VKPIKDELYTPRI----EGADEEIRE 421 (1041) T ss_dssp SSTTS---CCCCCBCCCHHHHHHHHGGGHHHHHHHHHTHHHHHHHHTCCC----CCSSCSSCBCCCC----TTHHHHHHH T ss_pred HHHHH---HHHHHCCCCHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHCC----CCCCCCCCCCCCC----CCHHHHHHH T ss_conf 36653---323430478999999864332778888999989999988501----3556864457655----567899999 Q ss_pred HHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCC Q ss_conf 65410342012464324578389999988778989860630557888999998510476433444420466776554403 Q 537021.9.peg.7 314 KAVAGLEVRLAQGAIAKGYHAQDYRQRLDFELNVIARMKFSGYFLIVADFIQWAKKNNIPVGPGRGSGAGSVTAYALTIT 393 (1033) Q Consensus 314 l~~~Gl~~R~~~~~~~~~~~~~~y~~RLe~EL~vI~~~gf~~YFLiV~Div~~ak~~gI~vGpGRGSAagSLVaY~LgIT 393 (1033) +|++|+++||++. .++.|++||++||++|.+|||++|||||||+|+|||++||+|| ||||||||||||||||| T Consensus 422 l~~~~~~~~yg~~------l~~~~~~RL~~EL~vI~~~Gf~~YFLIv~dlV~~a~~~gi~vg-gRGSaagSlVay~LgIT 494 (1041) T 3f2b_A 422 MSYRRAKEIYGDP------LPKLVEERLEKELKSIIGHGFAVIYLISHKLVKKSLDDGYLVG-SRGSVGSSFVATMTEIT 494 (1041) T ss_dssp HHHHHHHHHHCSS------CCHHHHHHHHHHHHHHHHTTCHHHHHHHHHHHHHHHHTTCCCE-ECGGGGGCHHHHHTTSC T ss_pred HHHHHHHHHCCCC------CCHHHHHHHHHHHHHHHHCCCCCEEHHHHHHHHHHHHCCCCCC-CCCHHHHHHHHHHCCCC T ss_conf 9987777622657------7288999999999999850568775999999999986798625-66314567999861765 Q ss_pred CCCHHH-----------------------------------------CCCHHHHHHHHCCCCCCCCCCCCCHHHHHHHHH Q ss_conf 756333-----------------------------------------231056762100145787533365687899999 Q 537021.9.peg.7 394 DIDPLR-----------------------------------------FSLLFERFLNPDRMSMPDFDIDFCQDRRDEVIR 432 (1033) Q Consensus 394 ~VDPi~-----------------------------------------~~LlFERFLnp~R~~~PDIDiDf~~~~R~~vi~ 432 (1033) +||||+ |||+|||||||+|.+|||||+||++++|++|++ T Consensus 495 eVDPl~~hy~c~~c~~~ef~~~~~~~~g~dlpd~~cp~c~~~~~~dg~~L~FErFLn~~r~~~PDIDlDF~~~~r~~v~~ 574 (1041) T 3f2b_A 495 EVNPLPPHYVCPNCKHSEFFNDGSVGSGFDLPDKNCPRCGTKYKKDGHDIPFETFLGFKGDKVPDIDLNFSGEYQPRAHN 574 (1041) T ss_dssp SCCCSCSEEECTTTCCEEECCSSCCSCGGGSCCCBCTTTCCBCEEECCCCCTHHHHTTTSCSCCCEEEEEETTTHHHHHH T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHCCCCEEEEHHHCCCCCCCCCHHHCCCCHHHHHHHHH T ss_conf 64898632246521112114555445677776544521101331457444142215833023453211253788999999 Q ss_pred HHHHHCCCCCEEECCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCCHHHHHCCHHHHHHHHCCCCCHHHH Q ss_conf 99830380215542207787788999999975279988988654310145898244344522988788643074202688 Q 537021.9.peg.7 433 YVQNKYGHERVAQIITFGSLQAKAALRDVGRALQMPYSQVDRLCKLIPNDPAHPVSLQAIADDSRFREARLADPSVDRLL 512 (1033) Q Consensus 433 y~~~kyG~~~va~i~t~~t~~~k~aird~~r~~g~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 512 (1033) |++++||+++||+|+||+|+++|+|++++.+. +...+.. ....++. T Consensus 575 yv~~~yG~~~va~i~T~~t~~~k~A~~~vk~~---------------~~~~~~~-------------------~r~aei~ 620 (1041) T 3f2b_A 575 YTKVLFGEDNVYRAGTIGTVADKTAYGFVKAY---------------ASDHNLE-------------------LRGAEID 620 (1041) T ss_dssp HHHHHHCTTSEEEEEEEEECCHHHHHHHHHHH---------------HHHTTCC-------------------CCHHHHH T ss_pred HHHHHHCCCCEEEEEECCCHHHHHHHHHHHHC---------------CCCCCHH-------------------HHHHHHH T ss_conf 88876360037898631105766564131001---------------0124324-------------------6799999 Q ss_pred HHHHHHHHHHHCCCCCCCEEEECCC--CHHHHEECCCCC---CCCCEEEEEECHHHHHHCCCHHHHHHHCCHHHHHHHHH Q ss_conf 9872011221002447440335256--666620012367---68866885102133431123213565417899999999 Q 537021.9.peg.7 513 EISQKLEGLYRHASTHAAGIVIGDR--PLSQLVPMYRDV---RSDLPVTQFNMKWIEKAGLIKFDFLGLKTLTLLQKSLD 587 (1033) Q Consensus 513 ~~a~~leg~~r~~~~Ha~Gvvi~~~--~l~~~~p~~~~~---~~~~~~~q~d~~~~e~~gl~K~D~Lgl~~L~~i~~~~~ 587 (1033) ++|++|||+|||+|+|||||||+++ ||+++||++.++ .+++++|||||+++|+ ||||||||||+|||+|+.+.+ T Consensus 621 ~la~~leG~~R~~g~HagGvVI~p~~~~l~d~~Pl~~~~~~~~~~~~~tqfd~~~ie~-gLlK~D~LGl~~lt~I~~~~~ 699 (1041) T 3f2b_A 621 RLAAGCTGVKRTTGQHPGGIIVVPDYMEIYDFTPIQYPADDTSSEWRTTHFDFHSIHD-NLLKLDILGHDDPTVIRMLQD 699 (1041) T ss_dssp HHHHHHTTCEEEEEEEEEEEEECCTTSCGGGTCCEECGGGCTTCSSCEEEEEHHHHTT-TBCEEEEEEEHHHHHHHHHHH T ss_pred HHHHCCCCCCCHHHCCCCCEEEECCCCCCCCCCCCEECCCCCCCCCCCCCCHHHHHHC-CCEEEEECCCCCHHHHHHHHH T ss_conf 9861357751144427771798057531221256034477776564244201222222-866686537885799999987 Q ss_pred HHHHHCCCCCCCCCCCCHHHHHHHHHCCC--------------CCHHHHCCCHHHHHHHHHHCCCHHHHHHHHHHHCCCH Q ss_conf 88740254343323443588998762014--------------1012211453478888771740488887665411411 Q 537021.9.peg.7 588 FLAQRGVKVDLSLIPFDDHETYHLLTTKG--------------TLGIFQLESSGMRQALEGMQPDCIEDIIALVSLYRPG 653 (1033) Q Consensus 588 ~i~~~~~~~d~~~ip~~D~~~~~l~~~g~--------------t~GvFQ~es~~~~~~l~~~~p~~~~dl~~~~al~RPG 653 (1033) ++ .+|+.+||+||++||++|++++ |.|||||||++||++|++++|++|+|||+++||+||| T Consensus 700 l~-----~id~~~Ip~dD~~t~~lf~~~~~~~~~~~~~~~~~~T~GifQ~eS~~~r~~L~~~kP~~f~DLv~i~aL~rp~ 774 (1041) T 3f2b_A 700 LS-----GIDPKTIPTDDPDVMGIFSSTEPLGVTPEQIMCNVGTIGIPEFGTRFVRQMLEETRPKTFSELVQISGLSHGT 774 (1041) T ss_dssp HH-----CCCGGGCCSCCHHHHHTTTCSGGGTSCHHHHTCSSSCTTSTTTTSHHHHHHHHHHCCCSHHHHHHHHHHTSST T ss_pred HH-----CCCHHHCCCCCHHHHHHHHCCCCCCCCHHHHHHCCCCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHCCCC T ss_conf 60-----2787657765677888764024112216888742375556741116799999970898667678787510364 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCHHHHHHHHHH Q ss_conf 34200368885046578502696673341033553001256898878863268413078897620121101677878999 Q 537021.9.peg.7 654 PIDNIVVYNNRKNGKEKIVSIHPLIDPMLKETQGVIIYQEQVMQIAQLLSGYSLSEADVLRRAMGKKIKEEMDKQKERFI 733 (1033) Q Consensus 654 p~~~~~~yi~rk~g~e~~~~~hp~l~~iL~~T~Gv~vyQEQvm~ia~~~ag~~~~ead~lRra~~kk~~~~~~~~~~~f~ 733 (1033) |+.. +|+++.+.+.||.|+++|+.|||||||| | ++|++.++|+.+|+++.|++. ++++|. T Consensus 775 ~v~~-------~n~~e~i~~~~~~l~~vl~~t~gimvy~--i------~~g~~~~~a~~i~e~~rk~k~-----~~~~~~ 834 (1041) T 3f2b_A 775 DVWL-------GNAQELIQNGTCTLSEVIGCRDDIMVYL--I------YRGLEPSLAFKIMESVRKGKG-----LTPEFE 834 (1041) T ss_dssp TSTT-------TTHHHHHHTTSSCGGGSCCSHHHHHHHH--H------HTTCCHHHHHHHHHHHTTTCC-----CCHHHH T ss_pred CHHH-------CCCCHHCCCCCCCHHHHHHHHHHHHHHH--H------HHHHHHHHHHHHHHHHHHHCC-----CCHHHH T ss_conf 1121-------1440110368757998850274799999--9------997779999999999997479-----989999 Q ss_pred HHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCH---------HHHH Q ss_conf 9986249999999999999997500013310147888999999999973026667775100235420---------2589 Q 537021.9.peg.7 734 SGASKNGISKTIAVNIFELLAKFADYGFNKSHAAAYAVISYQTAWMKTHYPVEFLAASMTLEMDNVE---------KIKK 804 (1033) Q Consensus 734 ~g~~~~g~~~~~a~~i~~~i~~f~~Y~FnksHa~aya~~~y~~aylK~~yP~eF~aa~L~~~~~~~~---------~~~~ 804 (1033) +++.++|++++. ..+.+|++||||||||+|||++|||+||||+|||+|||||+||+++++++ ++.. T Consensus 835 ~~~~~~~v~~~~-----~~~~~~~~YgFnKSHAaAYa~iayq~AylK~hYP~eF~aa~ls~~~~~~d~~~~~~~~~~i~~ 909 (1041) T 3f2b_A 835 AEMRKHDVPEWY-----IDSCKKIKYMFPKAHAAAYVLMAVRIAYFKVHHPLLYYASYFTVRAEDFDLDAMIKGSAAIRK 909 (1041) T ss_dssp HHHHHTTCCHHH-----HHHHHHCSCCCCHHHHHHHHHHHHHHHHHHHHSHHHHHHHHHHHTCCCCCHHHHHHCHHHHHH T ss_pred HHHHHCCCHHHH-----HHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHCCCCCCCCHHHHHCCHHHHHH T ss_conf 988877868999-----999861527987899999999999999998605399999972568567755766321788999 Q ss_pred HHHHHHH------------------------CCCCCCCCCCCCCCE-EEEECCCEEEEEHHHHCCCCHHHHHHHHHHHHC Q ss_conf 9988886------------------------368503552334210-035319869850268589698999999999837 Q 537021.9.peg.7 805 FCQDARQ------------------------FNIQIMPPSVNTPCV-DFKVGDNRIYYSLAAIKGVGTTTARHIMEASAD 859 (1033) Q Consensus 805 ~i~e~~~------------------------~gi~vl~PdIN~S~~-~f~~~~~~Ir~Gl~~Ikgvg~~~~~~Ive~r~~ 859 (1033) ++.||++ +|++++|||||+|.+ .|.+++++||+||++|||||++++++|+++|++ T Consensus 910 ~i~e~~~~g~~~~~~~~~~~~~l~~~~e~~~rg~~~~~~di~~S~a~~f~i~~~~I~~gl~aIkGlG~~~a~~Iv~~R~~ 989 (1041) T 3f2b_A 910 RIEEINAKGIQATAKEKSLLTVLEVALEMCERGFSFKNIDLYRSQATEFVIDGNSLIPPFNAIPGLGTNVAQAIVRAREE 989 (1041) T ss_dssp HHHHHHHTGGGCCHHHHHHHHHHHHHHHHHHTTCEECCCCTTTCCSSCCEEETTEEECCGGGSTTCCHHHHHHHHHHHHT T ss_pred HHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHCCCCCCCCCCCCCCCCEEECCCCEEEECHHHCCCCCHHHHHHHHHHHCC T ss_conf 99999973985465322333223220678756884178863335662121059949951564389799999999998667 Q ss_pred CCCCCHHHHHHHCCCCCCCHHHHHHHHHCCCCCCCCCCHHHHHHH Q ss_conf 999997899973482479989999888676723479038999998 Q 537021.9.peg.7 860 KPFDSLEDFCSRVDSKKLNRRVLESLVFAGALDCFGYSRMQLLQS 904 (1033) Q Consensus 860 g~f~sl~df~~rv~~~~~nk~~le~Li~aGAfD~~~~~R~~ll~~ 904 (1033) |||+|++||+.|+ .+||+++|.||++||||+|+.+|+..|++ T Consensus 990 g~F~s~~Df~~R~---~~~k~~le~Li~~Gafd~l~~~~q~~Lf~ 1031 (1041) T 3f2b_A 990 GEFLSKEDLQQRG---KLSKTLLEYLESRGCLDSLPDHNQLSLFS 1031 (1041) T ss_dssp SCCCSHHHHHHHH---TCCHHHHHHHHHTTTTTTSCSSCSCCCC- T ss_pred CCCCCHHHHHHHH---CCCHHHHHHHHHCCCCCCCCHHHHHHHHH T ss_conf 9999999999732---89999999999779977898328999982 No 4 >3e38_A Two-domain protein containing predicted PHP-like dependent phosphoesterase; two-domain protein containing predicted PHP-like metal-depen phosphoesterase; 2.20A {Bacteroides vulgatus atcc 8482} Probab=99.78 E-value=5.6e-27 Score=244.83 Aligned_cols=295 Identities=11% Similarity=-0.009 Sum_probs=189.3 Q ss_pred CHHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCC------------CCEEEEEEEEEE Q ss_conf 13343314447644329989999999978998899811651774999999999769------------963899999985 Q 537021.9.peg.7 11 FVHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAG------------IQPIIGCQLDID 78 (1033) Q Consensus 11 FvhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~g------------IKPIiG~E~~v~ 78 (1033) -+-||+||.|| ||..+|+++|++|++.||++||||||+|++|+++|+++|++.| +..+.|.|.... T Consensus 20 ~~DLH~HT~~S--DG~~~p~~lv~~a~~~Gl~~iAiTDH~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~ 97 (343) T 3e38_A 20 KCDFHMHSVFS--DGLVWPTVRVDEAYRDGLDAISLTEHIEYRPHKQDVVSDHNRSFDLCREQAEKLGILLIKGSEITRA 97 (343) T ss_dssp EEECCBCCTTT--TCSBCHHHHHHHHHHTTCSEECCEEESSCCTTTTTBCCCTTHHHHHHHHHHHHHTCEECCEEEEECS T ss_pred EEECCCCCCCC--CCCCCHHHHHHHHHHCCCCEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCEEEEEEECCCC T ss_conf 87778583732--7878799999999981999999887986537499999999864798744554058358861011364 Q ss_pred CCCCCCCCCCCCCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCEEEECCCCCCHHHHH Q ss_conf 47664334456566667753999956968999999999899732487787400789998634898899819767967888 Q 537021.9.peg.7 79 MQDNLKMTNQGKTPFPTVPSIVLLVSTAEGYQRLIELVSRMYLLDQGKQSVRICLSWLQEIGTEGLIMLTGGSSGPIDRA 158 (1033) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~lvlLAkN~~Gy~nL~kL~S~a~~~~~~~~~p~i~~~~L~e~~~egLIvlsg~~~g~i~~~ 158 (1033) . ...|+..+-... +. ......+....+.... ..++.....+... . .. T Consensus 98 ~---------------~~~~~~~~~~~~-------------~~-~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~-~-~~ 144 (343) T 3e38_A 98 M---------------APGHFNAIFLSD-------------SN-PLEQKDYKDAFREAKK--QGAFMFWNHPGWD-S-QQ 144 (343) T ss_dssp T---------------TTCEEEEESCSC-------------SG-GGCCSSHHHHHHHHHH--TTCEEEECCTTCT-T-TS T ss_pred C---------------HHCCCCCCCCHH-------------HH-CCCCCCHHHHHHHHHH--CCCEEEECCCCCC-C-CC T ss_conf 1---------------211441025624-------------30-4442139999999876--2770474242002-4-64 Q ss_pred HHCCCHHHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHCCC Q ss_conf 87799789999999998765986899994399115899999999999818989980883007878899998766884698 Q 537021.9.peg.7 159 FFFNSSQVAEKRLLTFKKLFGDRLYVNLQRHRGYDRYRESQVVQLAYTHELPLVATNNSLFLSEEDYEAHDVLMAVAHST 238 (1033) Q Consensus 159 l~~~~~~~a~~~l~~l~~~Fgd~~ylEl~~~~~~e~~~~~~l~~lA~~~~iPlVaTndv~Y~~~~D~~~h~iL~aI~~~~ 238 (1033) ......... ... ...++..+++|++++.... ..++++|+++++|+|+++|+||..++|+..|+.+.|+.+.. T Consensus 145 ~~~~~~~~~--~~~--~~~~~~~~~IEv~n~~~~~----~~~~~la~~~~l~~va~sD~H~~~~~~~~~~~~~~~~~t~~ 216 (343) T 3e38_A 145 PDTTKWWPE--HTA--LYQEGCMHGIEVANGHLYM----PEAIQWCLDKNLTMIGTSDIHQPIQTDYDFEKGEHRTMTFV 216 (343) T ss_dssp SSCCCCCHH--HHH--HHHTTCCSEEEEEETTEEC----THHHHHHHHHTCEEEEECCBCSCHHHHCCGGGTCCCCEEEE T ss_pred HHHHHHHHH--HHH--HHHHCCCCEEEEECCCCCH----HHHHHHHHHCCCCEECCCCCEECCHHHHHHHCCCCCCEEEE T ss_conf 023344389--999--9750887578753466236----99999998679816626751205723233310468866999 Q ss_pred CCCCCCCCC-----CCCCCCCCCHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHH Q ss_conf 576531002-----453432343478875122035899878864111000133221023455575688776898899987 Q 537021.9.peg.7 239 VVSQKDRPR-----VTPDHYLKNRSEMVSIFSDLPEALENTVEIARRCSFILQVHAPILPRFIEKECDDIQKEEENELRN 313 (1033) Q Consensus 239 ~l~~~~~~~-----~~~~~yLks~eEM~~~f~~~peAi~NT~~IAe~c~~~l~~~~~~lP~f~~p~~~~~~~~~~~~Lr~ 313 (1033) ..++..+.. .+...|+++.++|..+|.+.|++++||.+|+++|+.......+.+|.++.|.. |+. T Consensus 217 ~~~~~s~~~i~~Al~~g~~y~~~~~~m~~~~~~~~e~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~----------l~~ 286 (343) T 3e38_A 217 FAKERSLQGIREALDNRRTAAYFHELLIGREDLLRPFFEKCVKIEEVSRNEQGVTLSITNVTDLVLK----------LKK 286 (343) T ss_dssp EESSSSHHHHHHHHHTTCEEEEETTEEECCHHHHHHHHHHHEEEEEEEEETTEEEEEEEECSSSCEE----------EEE T ss_pred ECCCCCHHHHHHHHHCCCEEEECHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCE----------EEC T ss_conf 8577885459999864776897645654334654999973034654214453135234788874011----------551 Q ss_pred HHHHHHHHCCC------CCCCCCCCCHHHHHHHHHHHHHHHHHHCCHHHHH Q ss_conf 65410342012------4643245783899999887789898606305578 Q 537021.9.peg.7 314 KAVAGLEVRLA------QGAIAKGYHAQDYRQRLDFELNVIARMKFSGYFL 358 (1033) Q Consensus 314 l~~~Gl~~R~~------~~~~~~~~~~~~y~~RLe~EL~vI~~~gf~~YFL 358 (1033) .+..++.++.. ++.........++.+|++.|++|++..-.+++|| T Consensus 287 ~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~fevlN~~i~p~~~~ 337 (343) T 3e38_A 287 TAHDTLLVYFRDMTLKPHTRYTVRIGFKQGIKGGDVNFEVTNFIVAPDKGL 337 (343) T ss_dssp CSCCTTEECCSEEEECTTEEEEEEEEECTTCCCCEEEEEEEEEEEETTEEE T ss_pred CCCCCHHHCCCEEEECCCCCEEEEEEECCCCCCCCCEEEEEEEEECCCCCC T ss_conf 575353224760587478735898850334443431489999657699776 No 5 >3o0f_A Putative metal-dependent phosphoesterase; structural genomics, joint center for structural genomics, J protein structure initiative; HET: AMP; 1.94A {Bifidobacterium adolescentis} PDB: 3e0f_A* Probab=99.74 E-value=6.1e-19 Score=176.83 Aligned_cols=73 Identities=15% Similarity=0.258 Sum_probs=68.4 Q ss_pred CCCCHH--HHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCC Q ss_conf 988133--433144476443299899999999789988998116517749999999997699638999999854766 Q 537021.9.peg.7 8 SPSFVH--LQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPIIGCQLDIDMQDN 82 (1033) Q Consensus 8 ~~~Fvh--Lh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~ 82 (1033) .|.|.| ||+||.|| ||..+|+++|++|+++||++|||||||||+|+++|+++|++.|||+|+|+|+++..... T Consensus 9 ~~~~~~~DLH~HT~yS--DG~~~~eelv~~A~e~G~~~iaITDH~~l~G~~~~~~~a~~~gi~~i~g~E~~~~~~~~ 83 (301) T 3o0f_A 9 EPPAQGWDIHCHTVFS--DGTETPRTLVEQARKLGLHGVAIADHDTTAGWDEATEASEEIGLPLLLGTEITAVDEDV 83 (301) T ss_dssp CCCSSSEEEEECCTTT--TCSSCHHHHHHHHHHTTCSEEEECCBTCCTTHHHHHHHHHHHTCCEEEEEEEEEEETTE T ss_pred CCCCCCEECCCCCCCC--CCCCCHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHHHHCCCCCCCCCEEEECCCCC T ss_conf 8998776478185743--89998999999999889999999789985029999999987499811561333034443 No 6 >1m65_A Hypothetical protein YCDX; structural genomics, beta-alpha-barrel, zinc, metallo- enzyme, structure 2 function project, S2F, unknown function; 1.57A {Escherichia coli} SCOP: c.6.3.1 PDB: 1m68_A 1pb0_A Probab=99.43 E-value=1.3e-14 Score=140.24 Aligned_cols=181 Identities=10% Similarity=0.007 Sum_probs=109.6 Q ss_pred CCHHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCC--------CCEEEEEEEEEECCC Q ss_conf 813343314447644329989999999978998899811651774999999999769--------963899999985476 Q 537021.9.peg.7 10 SFVHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAG--------IQPIIGCQLDIDMQD 81 (1033) Q Consensus 10 ~FvhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~g--------IKPIiG~E~~v~~~~ 81 (1033) .++.|||||.|| .||..+|+++|++|++.|++++|||||++++|..++..++++.+ ++.+.|+|+.+.... T Consensus 2 ~~iDlH~HT~~S-~dg~~~~~e~v~~A~~~G~~~iaiTDH~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~E~~~~~~~ 80 (245) T 1m65_A 2 YPVDLHMHTVAS-THAYSTLSDYIAQAKQKGIKLFAITDHGPDMEDAPHHWHFINMRIWPRVVDGVGILRGIEANIKNVD 80 (245) T ss_dssp CCEECCBCCTTS-TTCCCCHHHHHHHHHHHTCCEEEEEEECTTSTTCCCTHHHHGGGGSCSEETTEEEEEEEEEEBCSTT T ss_pred CCCCCCCCCCCC-CCCCCCHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCH T ss_conf 861523387988-9866859999999998799999980899853402679999999999997289789743412324642 Q ss_pred CCCCCCCCCCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCCCCCCCC---CHHHHHHHCCCCEEEECCCCCCHHHHH Q ss_conf 64334456566667753999956968999999999899732487787400---789998634898899819767967888 Q 537021.9.peg.7 82 NLKMTNQGKTPFPTVPSIVLLVSTAEGYQRLIELVSRMYLLDQGKQSVRI---CLSWLQEIGTEGLIMLTGGSSGPIDRA 158 (1033) Q Consensus 82 ~~~~~~~~~~~~~~~~~lvlLAkN~~Gy~nL~kL~S~a~~~~~~~~~p~i---~~~~L~e~~~egLIvlsg~~~g~i~~~ 158 (1033) ... ..+...+.+...++...+ .....|+. ..+.+.+....|.+...+.+.... T Consensus 81 ~~~------------------~~~~~~~~~~~~~i~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~ah~~~~~--- 136 (245) T 1m65_A 81 GEI------------------DCSGKMFDSLDLIIAGFH---EPVFAPHDKATNTQAMIATIASGNVHIISHPGNPK--- 136 (245) T ss_dssp CCB------------------SCCHHHHHHCSEEEEECC---TTTSCCCCHHHHHHHHHHHHHTSCCSEECCTTCTT--- T ss_pred HHH------------------HHHHHHHCCCCEEEEEEE---CCCCCCCCHHHHHHHHHHHHHCCCCCEECCCCHHH--- T ss_conf 566------------------656664225550788750---13568745999999999999679986313645054--- Q ss_pred HHCCCHHHHHHHHHHHHHHCCCCEEEEEECCCHH-----HHHHHHHHHHHHHHHCCCEEEECCCCCCCH Q ss_conf 8779978999999999876598689999439911-----589999999999981898998088300787 Q 537021.9.peg.7 159 FFFNSSQVAEKRLLTFKKLFGDRLYVNLQRHRGY-----DRYRESQVVQLAYTHELPLVATNNSLFLSE 222 (1033) Q Consensus 159 l~~~~~~~a~~~l~~l~~~Fgd~~ylEl~~~~~~-----e~~~~~~l~~lA~~~~iPlVaTndv~Y~~~ 222 (1033) .... ....+..+.+. +.++|+...... +...+..++++++++|+|+|+++|+|++.. T Consensus 137 -~~~~---~~~i~~~~~~~---~~~~e~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~~~~SDaH~~~~ 198 (245) T 1m65_A 137 -YEID---VKAVAEAAAKH---QVALEINNSSFLHSRKGSEDNCREVAAAVRDAGGWVALGSDSHTAFT 198 (245) T ss_dssp -SCCC---HHHHHHHHHHH---TCEEEEETTC----------CHHHHHHHHHHHTCCEEEECCBSSGGG T ss_pred -HHHH---HHHHHHHHHHH---CCEEEEEEHHHHCCCCCCCCHHHHHHHHHHHCCCCEEEECCCCCHHH T ss_conf -4545---79999999861---12544300344137777666399999999985994999889899445 No 7 >3dcp_A Histidinol-phosphatase; HISK, histidine biosynthesis, NESG, LMR141, structural genomics, PSI-2, protein structure initiative; 2.10A {Listeria monocytogenes str} Probab=99.42 E-value=1.9e-13 Score=130.33 Aligned_cols=207 Identities=14% Similarity=0.035 Sum_probs=107.4 Q ss_pred HHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHH----------------------------HHHHH Q ss_conf 34331444764432998999999997899889981165177499999----------------------------99997 Q 537021.9.peg.7 13 HLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFS----------------------------QKACA 64 (1033) Q Consensus 13 hLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~----------------------------~ackk 64 (1033) =||+||.||-++++.+|+++|++|++.|++++|||||++++|..++. ..+.. T Consensus 4 DlH~HT~~Sdg~~~~s~eelv~~A~~~Gl~~iaITDH~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ 83 (283) T 3dcp_A 4 DGHTHTEFCPHGTHDDVEEMVLKAIELDFDEYSIVEHAPLSSEFMKNTAGDKEAVTTASMAMSDLPYYFKKMNHIKKKYA 83 (283) T ss_dssp EEEECCTTCTTSCCCCHHHHHHHHHHTTCCEEEEEEECCCCHHHHHTBCSCTHHHHTTSCCGGGHHHHHHHHHHHHHHTT T ss_pred CCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHC T ss_conf 25367598588983879999999998799999984899976604667777762221036658889999999999998636 Q ss_pred CCCCEEEEEEEEEECCCCCCCCCCCCCCCCCCCEEEEEECC--HHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCC Q ss_conf 69963899999985476643344565666677539999569--6899999999989973248778740078999863489 Q 537021.9.peg.7 65 AGIQPIIGCQLDIDMQDNLKMTNQGKTPFPTVPSIVLLVST--AEGYQRLIELVSRMYLLDQGKQSVRICLSWLQEIGTE 142 (1033) Q Consensus 65 ~gIKPIiG~E~~v~~~~~~~~~~~~~~~~~~~~~lvlLAkN--~~Gy~nL~kL~S~a~~~~~~~~~p~i~~~~L~e~~~e 142 (1033) .+++++.|+|+.+.....................+++.+.. ..++......++..+.. ..+..++...+.+..+ .+ T Consensus 84 ~~~~~~~g~e~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~-~~ 161 (283) T 3dcp_A 84 SDLLIHIGFEVDYLIGYEDFTRDFLNEYGPQTDDGVLSLHFLEGQGGFRSIDFSAEDYNE-GIVQFYGGFEQAQLAY-LE 161 (283) T ss_dssp TTCEEEEEEEEECCTTCHHHHHHHHHHHGGGCSEEEEECCEEEETTEEEESSSCHHHHHH-HTHHHHTSHHHHHHHH-HH T ss_pred CCCCHHHHHHHHHHHCCHHHHHHHHHHHHCCCCEEEEEHHHHCCCCCCCCHHHCCHHHHH-HHHHHCCCHHHHHHHH-HH T ss_conf 896224515774531006678999998623456254102431465443521014399999-9987347438999999-99 Q ss_pred CE------------EEECCC-----CCCHHHHHHHCCCHHHHHHHHHHHHHHC-CCCEEEEEECC----CH-HHHHHHHH Q ss_conf 88------------998197-----6796788887799789999999998765-98689999439----91-15899999 Q 537021.9.peg.7 143 GL------------IMLTGG-----SSGPIDRAFFFNSSQVAEKRLLTFKKLF-GDRLYVNLQRH----RG-YDRYRESQ 199 (1033) Q Consensus 143 gL------------Ivlsg~-----~~g~i~~~l~~~~~~~a~~~l~~l~~~F-gd~~ylEl~~~----~~-~e~~~~~~ 199 (1033) ++ ..+.+. ..+...........+.+...+.++.+.+ .+++++|++.. .. .+...+.. T Consensus 162 ~l~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iEvn~~~~~~~~~~e~~~~~~ 241 (283) T 3dcp_A 162 GVKQSIEADLGLFKPRRMGHISLCQKFQQFFGEDTSDFSEEVMEKFRVILALVKKRDYELDFNTAGLFKPLCGETYPPKK 241 (283) T ss_dssp HHHHHHHCCCCTTCCSEECCTTGGGTTGGGGTCCGGGCCHHHHHHHHHHHHHHHHHTCEEEEECGGGGSTTCCSCBSCHH T ss_pred HHHHHHHHHHCCCCCCEECHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCEEEEEECCCCCCCCCCCCCHHHH T ss_conf 99999998646788754145689986287521135442355899999999999977948999576566886444453699 Q ss_pred HHHHHHHHCCCEEEECCCCCCC Q ss_conf 9999998189899808830078 Q 537021.9.peg.7 200 VVQLAYTHELPLVATNNSLFLS 221 (1033) Q Consensus 200 l~~lA~~~~iPlVaTndv~Y~~ 221 (1033) ++++|+++|+|+++++|+|.++ T Consensus 242 ll~~a~e~gl~~~~gSDaH~~~ 263 (283) T 3dcp_A 242 IVTLASELQIPFVYGSDSHGVQ 263 (283) T ss_dssp HHHHHHHTTCCEEEECCBSSGG T ss_pred HHHHHHHCCCEEEECCCCCCHH T ss_conf 9999998499899807878865 No 8 >2yxo_A Histidinol phosphatase; metal-dependent, hydrolase; 1.60A {Thermus thermophilus} PDB: 2yz5_A 2z4g_A Probab=99.36 E-value=1.1e-13 Score=132.29 Aligned_cols=200 Identities=12% Similarity=0.029 Sum_probs=109.7 Q ss_pred HHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHH---------------HH--HHHHCCCCEEEEEE Q ss_conf 33433144476443299899999999789988998116517749999---------------99--99976996389999 Q 537021.9.peg.7 12 VHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEF---------------SQ--KACAAGIQPIIGCQ 74 (1033) Q Consensus 12 vhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F---------------~~--ackk~gIKPIiG~E 74 (1033) +-|||||.|| .||..+|+++|++|++.|++++|||||+++.+.++. .. ..+..++++++|+| T Consensus 2 iDlH~HT~~S-~DG~~~~ee~v~~A~~~Gl~~iaiTDH~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~e 80 (267) T 2yxo_A 2 VDSHVHTPLC-GHAEGHPEAYLEEARAKGLKGVVFTDHSPMPPWYDPESRMRLEALPFYLLALERVRERAQDLYVGIGLE 80 (267) T ss_dssp EEEEECCGGG-SSCCSCHHHHHHHHHHTTCSEEEEEEECCCCTTSSGGGSCCGGGHHHHHHHHHHHHHHCTTSEEEEEEE T ss_pred EEECCCCCCC-CCCCCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEEEE T ss_conf 7433478988-985765999999999859999999679886565437665103437999999999998726825873026 Q ss_pred EEEECCCCCCCCCCCCCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCCCCCCCCCH-------HHHHHHCCCCEEEE Q ss_conf 99854766433445656666775399995696899999999989973248778740078-------99986348988998 Q 537021.9.peg.7 75 LDIDMQDNLKMTNQGKTPFPTVPSIVLLVSTAEGYQRLIELVSRMYLLDQGKQSVRICL-------SWLQEIGTEGLIML 147 (1033) Q Consensus 75 ~~v~~~~~~~~~~~~~~~~~~~~~lvlLAkN~~Gy~nL~kL~S~a~~~~~~~~~p~i~~-------~~L~e~~~egLIvl 147 (1033) +.+.++.......... .......+.......++...... ...+..++... +.+......+.+.+ T Consensus 81 ~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~v 151 (267) T 2yxo_A 81 ADFHPGTEGFLAQLLR--RYPFDYVIGSVHYLGAWPLDHPD-------HQEEYAWRDLKEVFRAYFQEVEKAARSGLFHA 151 (267) T ss_dssp EECCTTCHHHHHHHHH--SSCCSCEEEECCCBTTBCTTCGG-------GGGGGGGSCHHHHHHHHHHHHHHHHTSSCCSE T ss_pred EEECCCCCHHHHHHHH--HCCHHHHHHHCCCCCCCCCCCHH-------HHHHHCCCCHHHHHHHHHHHHHHHHHCCCCCE T ss_conf 5303331021488886--32446787303543677753146-------77764368899999999999999997799658 Q ss_pred CCCCCCHHHHHHHCCCHHHHHHHHHHHHH-HCCCCEEEEEECCCHH----HHHHHHHHHHHHHHHCCCEEEECCCCCCCH Q ss_conf 19767967888877997899999999987-6598689999439911----589999999999981898998088300787 Q 537021.9.peg.7 148 TGGSSGPIDRAFFFNSSQVAEKRLLTFKK-LFGDRLYVNLQRHRGY----DRYRESQVVQLAYTHELPLVATNNSLFLSE 222 (1033) Q Consensus 148 sg~~~g~i~~~l~~~~~~~a~~~l~~l~~-~Fgd~~ylEl~~~~~~----e~~~~~~l~~lA~~~~iPlVaTndv~Y~~~ 222 (1033) .++.... .+.......+.......++.+ .+..++++|++.++.. +...+..++++++++|+|+++++|+|++.. T Consensus 152 ~ah~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Ein~~~~~~~~~~~~~~~~~~~~a~~~gl~~~~gSDaH~~~~ 230 (267) T 2yxo_A 152 IGHLDLP-KKFGHRLPEEALLELAEPALRAVAEAGLFLDVNTAGLRRPAKEVYPAPALLRRARELGIGLVLGSDAHRPEE 230 (267) T ss_dssp ESCTTGG-GTTSCCCCHHHHHHHHHHHHHHHHHHTCEEEEEGGGGGSTTCSCBSCHHHHHHHHHHTCCEEEECCBSSGGG T ss_pred EECCCHH-HHCCCCCCHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHCCCEEEEECCCCCHHH T ss_conf 5265631-321666777889999999999999539779996640247876555589999999985992999789898789 No 9 >2wje_A CPS4B, tyrosine-protein phosphatase CPSB; capsule biogenesis/degradation, manganese, hydrolase, exopolysaccharide synthesis; 1.90A {Streptococcus pneumoniae} PDB: 2wjd_A 2wjf_A Probab=99.15 E-value=3.5e-11 Score=111.23 Aligned_cols=221 Identities=15% Similarity=0.110 Sum_probs=123.5 Q ss_pred CCHHHHHHCHHHHHHHCCCHHH---HHHHHHHCCCCEEEEECCCHHHH-----------HHHHHHHHHHCCCCEEE--EE Q ss_conf 8133433144476443299899---99999978998899811651774-----------99999999976996389--99 Q 537021.9.peg.7 10 SFVHLQVHSSYSLLEGALSLNN---ILDKIAADQQPAIAITDTNNLFS-----------ALEFSQKACAAGIQPII--GC 73 (1033) Q Consensus 10 ~FvhLh~hS~ySlldg~~~~e~---LV~~A~~~G~~AiAITD~~nl~G-----------av~F~~ackk~gIKPIi--G~ 73 (1033) +|+.|||||.||+.||..++++ +|++|++.|++++|||||....+ ..+.....++.+.+..+ |. T Consensus 4 ~miDlH~Ht~~s~~DG~~s~ee~~~~~~~A~~~G~~~i~iTDH~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (247) T 2wje_A 4 GMIDIHSHIVFDVDDGPKSREESKALLAESYRQGVRTIVSTSHRRKGMFETPEEKIAENFLQVREIAKEVASDLVIAYGA 83 (247) T ss_dssp CEEECCBCCSTTSSSSCSSHHHHHHHHHHHHHTTEEEEECCCEEBTTTBCCCHHHHHHHHHHHHHHHHHHCTTCEEECCC T ss_pred CEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCEEEECC T ss_conf 05887676778877898999999999999998799999989899877777868999999999999999737870673030 Q ss_pred EEEEECCCCCCCCCCCCCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCEEEECCCCCC Q ss_conf 99985476643344565666677539999569689999999998997324877874007899986348988998197679 Q 537021.9.peg.7 74 QLDIDMQDNLKMTNQGKTPFPTVPSIVLLVSTAEGYQRLIELVSRMYLLDQGKQSVRICLSWLQEIGTEGLIMLTGGSSG 153 (1033) Q Consensus 74 E~~v~~~~~~~~~~~~~~~~~~~~~lvlLAkN~~Gy~nL~kL~S~a~~~~~~~~~p~i~~~~L~e~~~egLIvlsg~~~g 153 (1033) +..+....... ...........+...++-. .+...+....+.+......|.+...+... T Consensus 84 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~-------------------~~~~~~~~~~~~~~~~~~~g~~~~laHp~- 142 (247) T 2wje_A 84 EIYYTPDVLDK-LEKKRIPTLNDSRYALIEF-------------------SMNTPYRDIHSALSKILMLGITPVIAHIE- 142 (247) T ss_dssp EEECCTHHHHH-HHTTCSCCGGGSSEEEEEC-------------------CTTCCHHHHHHHHHHHHTTTCEEEETTGG- T ss_pred CEEECCHHHHH-HHHHHHHCCCCEEEEEEEC-------------------CCCCCHHHHHHHHHHHHHCCCCCCCCCCC- T ss_conf 00103112443-0035550579839999967-------------------88776567899999999689966415663- Q ss_pred HHHHHHHCCCHHHHHHHHHHHHHHCCCCEEEEEECCC-------HHHHHHHHHHHHHHHHHCCCEEEECCCCCCCHHHHH Q ss_conf 6788887799789999999998765986899994399-------115899999999999818989980883007878899 Q 537021.9.peg.7 154 PIDRAFFFNSSQVAEKRLLTFKKLFGDRLYVNLQRHR-------GYDRYRESQVVQLAYTHELPLVATNNSLFLSEEDYE 226 (1033) Q Consensus 154 ~i~~~l~~~~~~~a~~~l~~l~~~Fgd~~ylEl~~~~-------~~e~~~~~~l~~lA~~~~iPlVaTndv~Y~~~~D~~ 226 (1033) +...... ..+.+.. +...+.++|+.... +.-......+++.+.+.|+|+++++|+|.++..++. T Consensus 143 ---r~~~~~~---~~~~~~~---~~~~~~~~ein~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~i~igSDAH~~~~~~~~ 213 (247) T 2wje_A 143 ---RYDALEN---NEKRVRE---LIDMGCYTQVNSSHVLKPKLFGERYKFMKKRAQYFLEQDLVHVIASDMHNLDGRPPH 213 (247) T ss_dssp ---GCGGGTT---CHHHHHH---HHHTTCEEEEEHHHHSCCCSSCCSCHHHHHHHHHHHHTTCCSEEECCBCCSSSSCCC T ss_pred ---CHHHHHH---CHHHHHH---HHHCCCEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEECCCCCCCCCCHH T ss_conf ---0666652---6888999---986897588642443334567774599999999999789909998589984346816 Q ss_pred HHHHHHHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHCCHHHHHHHHHHHH Q ss_conf 998766884698576531002453432343478875122035899878864 Q 537021.9.peg.7 227 AHDVLMAVAHSTVVSQKDRPRVTPDHYLKNRSEMVSIFSDLPEALENTVEI 277 (1033) Q Consensus 227 ~h~iL~aI~~~~~l~~~~~~~~~~~~yLks~eEM~~~f~~~peAi~NT~~I 277 (1033) ..+++.++... ...+++.++|.+-|++|-|+..| T Consensus 214 ~~~a~~~l~~~-----------------~g~~~~~~~~~~np~~i~~~~~i 247 (247) T 2wje_A 214 MAEAYDLVTQK-----------------YGEAKAQELFIDNPRKIVMDQLI 247 (247) T ss_dssp HHHHHHHHHHH-----------------HCHHHHHHHHTHHHHHHHTTCCC T ss_pred HHHHHHHHHHH-----------------CCHHHHHHHHHHHHHHHHCCCCC T ss_conf 99999999987-----------------39999999999849999776919 No 10 >2anu_A Hypothetical protein TM0559; predicted metal-dependent phosphoesterase (PHP family), structural genomics; 2.40A {Thermotoga maritima MSB8} SCOP: c.6.3.1 Probab=98.81 E-value=2.7e-08 Score=86.86 Aligned_cols=166 Identities=14% Similarity=0.168 Sum_probs=93.4 Q ss_pred CCCCCC--CHHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHH--------------------- Q ss_conf 668988--133433144476443299899999999789988998116517749999999--------------------- Q 537021.9.peg.7 5 LEKSPS--FVHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQK--------------------- 61 (1033) Q Consensus 5 ~~~~~~--FvhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~a--------------------- 61 (1033) |..... ++-||+||.|| ||..+|+++|++|++.|++++|||||+++.+..++... T Consensus 13 ~~~~~~w~~aDLH~Hs~~S--Dg~~~~~e~v~~A~~~Gld~iaiTDH~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ 90 (255) T 2anu_A 13 MKTDTEWLLCDFHVHTNMS--DGHLPLGEVVDLFGKHGVDVVSITDHIVDRRTLEQRKRNGEPLGAITEDKFQDYLKRLW 90 (255) T ss_dssp ----CEEEEEEEEECCTTT--TCSSCHHHHHHHHHHTTCSEEEEEEEEECHHHHHHHHHTTCCCCSCCSTTHHHHHHHHH T ss_pred CCCCCCEEEEEEEECCCCC--CCCCCHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHH T ss_conf 1048987857703786864--89899999999999859978999347861326888752121133212104668888899 Q ss_pred ------HHHCCCCEEEEEEEEEECCCCCCCCCCCCCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCCCCCCCCCHHH Q ss_conf ------99769963899999985476643344565666677539999569689999999998997324877874007899 Q 537021.9.peg.7 62 ------ACAAGIQPIIGCQLDIDMQDNLKMTNQGKTPFPTVPSIVLLVSTAEGYQRLIELVSRMYLLDQGKQSVRICLSW 135 (1033) Q Consensus 62 ------ckk~gIKPIiG~E~~v~~~~~~~~~~~~~~~~~~~~~lvlLAkN~~Gy~nL~kL~S~a~~~~~~~~~p~i~~~~ 135 (1033) ....++..+.|.|...... ...+++.+..++. .+..+... T Consensus 91 ~~~~~~~~~~~~~~~~g~e~~~~~~-----------------~~~~~~~~~~~~~-----------------~~~~~~~~ 136 (255) T 2anu_A 91 REQKRAWEEYGMILIPGVEITNNTD-----------------LYHIVAVDVKEYV-----------------DPSLPVEE 136 (255) T ss_dssp HHHHHHHHHHSCEEEEEEEEEETTT-----------------TEEEEEESCCSCC-----------------CTTSCHHH T ss_pred HHHHHHHHHCCCCCCCCCEECCCCC-----------------CEEEEECCCCCCC-----------------CCCCCHHH T ss_conf 9999998654743147703114456-----------------3157404763457-----------------87666568 Q ss_pred H-HHH-CCCCEEEECCCCCCHHHHHHHCCCHHHHHHHHHHHHHHCCCC-EEEEEECCCHHHHHHHHHHHHHHHHHCCCEE Q ss_conf 9-863-489889981976796788887799789999999998765986-8999943991158999999999998189899 Q 537021.9.peg.7 136 L-QEI-GTEGLIMLTGGSSGPIDRAFFFNSSQVAEKRLLTFKKLFGDR-LYVNLQRHRGYDRYRESQVVQLAYTHELPLV 212 (1033) Q Consensus 136 L-~e~-~~egLIvlsg~~~g~i~~~l~~~~~~~a~~~l~~l~~~Fgd~-~ylEl~~~~~~e~~~~~~l~~lA~~~~iPlV 212 (1033) + ... ..++++++-. +............. .......+... --+|+.... .+..+++..++|+| T Consensus 137 ~~~~~~~~g~~~i~aH-p~~~~~~~~~~~~~------~~~~~~~~~~~~d~iE~~n~~--------~~~~~~~~~~~~~v 201 (255) T 2anu_A 137 IVEKLKEQNALVIAAH-PDRKKQDEEHLSWY------LWANMERFKDTFDAWEIANRD--------DLFNSVGVKKYRYV 201 (255) T ss_dssp HHHHHHHTTCEEEECC-CCTTC-----CCCH------HHHSTTTTTTTCSEEEEEETT--------EECHHHHHTTCCEE T ss_pred HHHHHHHCCCEEEECC-HHHCCCCCHHHHHH------HHHHHHHHHCCCEEEEECCHH--------HHHHHHHHCCCCEE T ss_conf 9999997799788658-10002660112343------677787664275099971779--------99999886099879 Q ss_pred EECCCCCCC Q ss_conf 808830078 Q 537021.9.peg.7 213 ATNNSLFLS 221 (1033) Q Consensus 213 aTndv~Y~~ 221 (1033) +++|.|++. T Consensus 202 ~gSDaH~~~ 210 (255) T 2anu_A 202 ANSDFHELW 210 (255) T ss_dssp EECCBCSGG T ss_pred EECCCCCCC T ss_conf 768988841 No 11 >2w9m_A Polymerase X; SAXS, DNA repair, DNA polymerase, DNA replication; 2.46A {Deinococcus radiodurans} Probab=98.71 E-value=1.9e-07 Score=79.61 Aligned_cols=20 Identities=5% Similarity=0.064 Sum_probs=9.4 Q ss_pred HHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 89998888636850355233421 Q 537021.9.peg.7 803 KKFCQDARQFNIQIMPPSVNTPC 825 (1033) Q Consensus 803 ~~~i~e~~~~gi~vl~PdIN~S~ 825 (1033) ..+++.|++.|+.+ .||.|. T Consensus 473 e~v~~~~~~~g~~l---EINts~ 492 (578) T 2w9m_A 473 DAVLGACEANGTVV---EINANA 492 (578) T ss_dssp HHHHHHHHHHTCEE---EEECST T ss_pred HHHHHHHHHHCCEE---EEECCC T ss_conf 99999999819999---994999 No 12 >2edu_A Kinesin-like protein KIF22; kinesin-like DNA binding domain, helix turn helix motif, structural genomics, NPPSFA; NMR {Homo sapiens} SCOP: a.60.2.7 Probab=96.46 E-value=0.0016 Score=46.51 Aligned_cols=51 Identities=25% Similarity=0.430 Sum_probs=43.6 Q ss_pred EHHHHCCCCHHHHHHHHHHH-HCCCCCCHHHHHHHCCCCCCCHHHHHHHHHCCCC Q ss_conf 02685896989999999998-3799999789997348247998999988867672 Q 537021.9.peg.7 838 SLAAIKGVGTTTARHIMEAS-ADKPFDSLEDFCSRVDSKKLNRRVLESLVFAGAL 891 (1033) Q Consensus 838 Gl~~Ikgvg~~~~~~Ive~r-~~g~f~sl~df~~rv~~~~~nk~~le~Li~aGAf 891 (1033) =|..++|||.+.++.||+.| ++|||.|++|+.. | ++++.+.++.+.+++.+ T Consensus 41 eL~~lpgIg~~~A~~Iv~~R~~~G~f~sledL~~-v--~Gi~~k~~eki~k~~~L 92 (98) T 2edu_A 41 DLRSLQRIGPKKAQLIVGWRELHGPFSQVEDLER-V--EGITGKQMESFLKANIL 92 (98) T ss_dssp HHHHSTTCCHHHHHHHHHHHHHHCCCSSGGGGGG-S--TTCCHHHHHHHHHHHHH T ss_pred HHHHCCCCCHHHHHHHHHHHHHCCCCCCHHHHHC-C--CCCCHHHHHHHHHCCCE T ss_conf 9964799899999999999998599288999844-8--99899999999983875 No 13 >2i5h_A Hypothetical protein AF1531; PFAM:DUF655, PSI-2, structural genomics, protein structure initiative; 1.74A {Archaeoglobus fulgidus} SCOP: e.71.1.1 Probab=95.57 E-value=0.0078 Score=40.62 Aligned_cols=36 Identities=36% Similarity=0.659 Sum_probs=33.3 Q ss_pred EHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHCC Q ss_conf 026858969899999999983799999789997348 Q 537021.9.peg.7 838 SLAAIKGVGTTTARHIMEASADKPFDSLEDFCSRVD 873 (1033) Q Consensus 838 Gl~~Ikgvg~~~~~~Ive~r~~g~f~sl~df~~rv~ 873 (1033) .|.-+.|||.+.+..|+++|+.+||+||+|+-.||. T Consensus 133 ~leLLPGIGKK~~~~IleeR~k~~FeSFedi~~Rv~ 168 (205) T 2i5h_A 133 QLELLPGVGKKMMWAIIEERKKRPFESFEDIAQRVK 168 (205) T ss_dssp GGGGSTTCCHHHHHHHHHHHHHSCCCSHHHHHHHST T ss_pred HHHHCCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHC T ss_conf 887523505899999999965589889999998844 No 14 >2k50_A Replication factor A related protein; uncharacterized protein, structural genomics, PSI-2; NMR {Methanothermobacterthermautotrophicus str} Probab=95.13 E-value=0.043 Score=34.40 Aligned_cols=50 Identities=20% Similarity=0.296 Sum_probs=40.5 Q ss_pred HHHHHCCCCCCEEEEEEEEEEE---EEECCCCCC--EEEEEEEECCCCEEEEEEC Q ss_conf 8898505589889999999611---210158887--6999999849704999953 Q 537021.9.peg.7 983 YEESVATIRGNNIRLAAMVVSK---QQKKTRKGS--RIGWVTFSEPAKEYEANFF 1032 (1033) Q Consensus 983 ~~~~~~~~~g~~v~iaG~V~~i---~~~~TKkG~--~mafltleD~~g~~e~~iF 1032 (1033) +..+.++..|..+.|.|.|+++ ++..+|+|+ .+.-++|.|.||.+.+++| T Consensus 7 i~~I~~l~~g~~v~v~g~V~~~~~~r~~~~k~G~~~~~~~i~l~D~TG~I~vt~W 61 (115) T 2k50_A 7 MDTISKLEEGAETPVTGRVMKISSPRTFTTRKGREGKLANVIIADDTGELRAVFW 61 (115) T ss_dssp CCCTTTCCTTCEEEEEEEEEEECCCEECCCTTSSCCEEEEEEEEETTEEEEEEEE T ss_pred CCCHHHCCCCCEEEEEEEEEECCCCEEEECCCCCEEEEEEEEEECCCCEEEEEEE T ss_conf 4668677899876799999985697689917998968999999849997999995 No 15 >1s5l_U Photosystem II 12 kDa extrinsic protein; photosynthesis, oxygen-evolving, tetra- manganese, membrane; HET: CL1 PHO HEM PL9 LMT BCR; 3.50A {Thermosynechococcus elongatus} Probab=93.59 E-value=0.064 Score=32.91 Aligned_cols=53 Identities=23% Similarity=0.332 Sum_probs=41.3 Q ss_pred CCCCCCCEEEEECCCEEEEEHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCHHHHHHHHH Q ss_conf 523342100353198698502685896989999999998379999978999734824799899998886 Q 537021.9.peg.7 819 PSVNTPCVDFKVGDNRIYYSLAAIKGVGTTTARHIMEASADKPFDSLEDFCSRVDSKKLNRRVLESLVF 887 (1033) Q Consensus 819 PdIN~S~~~f~~~~~~Ir~Gl~~Ikgvg~~~~~~Ive~r~~g~f~sl~df~~rv~~~~~nk~~le~Li~ 887 (1033) .|||.... --|..++|||...++.||+ +|||+|++|+.. .++++.+++|.|-. T Consensus 55 idlN~A~~----------~eL~~lpGig~~~A~~Iv~---~gpf~svedl~~---v~Gig~~~~e~l~~ 107 (134) T 1s5l_U 55 IDLNNTNI----------AAFIQYRGLYPTLAKLIVK---NAPYESVEDVLN---IPGLTERQKQILRE 107 (134) T ss_dssp EETTTSCG----------GGGGGSTTCTHHHHHHHHH---TCCCSSGGGGGG---CTTCCHHHHHHHHH T ss_pred EECCCCCH----------HHHHHHHHHHHHHHHHHHH---CCCCCCHHHHHC---CCCCCHHHHHHHHH T ss_conf 41640789----------9997710346999999998---278487999961---77579999999998 No 16 >3i7f_A Aspartyl-tRNA synthetase; tRNA ligase, APO, ATP-binding, aminoacyl-tRNA synthetase, ligase, nucleotide-binding, protein biosynthesis; 2.80A {Entamoeba histolytica} Probab=93.44 E-value=0.16 Score=29.44 Aligned_cols=13 Identities=23% Similarity=0.516 Sum_probs=9.0 Q ss_pred CCHHHHEECCCCC Q ss_conf 6666620012367 Q 537021.9.peg.7 537 RPLSQLVPMYRDV 549 (1033) Q Consensus 537 ~~l~~~~p~~~~~ 549 (1033) +.+.++.|..++. T Consensus 311 ~rVfeI~~~FR~E 323 (548) T 3i7f_A 311 RKVFEVGPVFRAE 323 (548) T ss_dssp CEEEEEEEECCCS T ss_pred CCCCCCCEEEECC T ss_conf 3234434056645 No 17 >3e0e_A Replication protein A; structural genomics, PSI-2, protein structure initiative, northeast structural genomics consortium, NESG; 1.60A {Methanococcus maripaludis} PDB: 2k5v_A Probab=93.44 E-value=0.24 Score=28.06 Aligned_cols=47 Identities=9% Similarity=0.135 Sum_probs=37.2 Q ss_pred HHCCCCCCEEEEEEEEEEEEE---ECCCCCCE--EEEEEEECCCCEEEEEEC Q ss_conf 850558988999999961121---01588876--999999849704999953 Q 537021.9.peg.7 986 SVATIRGNNIRLAAMVVSKQQ---KKTRKGSR--IGWVTFSEPAKEYEANFF 1032 (1033) Q Consensus 986 ~~~~~~g~~v~iaG~V~~i~~---~~TKkG~~--mafltleD~~g~~e~~iF 1032 (1033) +.++..|..+.+.|.|+++.. ..+|.|.+ +.-+.+.|.||.+.+++| T Consensus 5 I~~L~~g~~v~i~g~V~~~~~~r~~~~k~g~~~~v~~~~i~D~TG~i~~~~W 56 (97) T 3e0e_A 5 ISELMPNLSGTINAEVVTAYPKKEFSRKDGTKGQLKSLFLKDDTGSIRGTLW 56 (97) T ss_dssp GGGCCTTEEEEEEEEEEEECCCEEEC----CCEEEEEEEEEETTEEEEEEEE T ss_pred HHHCCCCCEEEEEEEEEECCCCEEEECCCCCEEEEEEEEEECCCCCEEEEEE T ss_conf 8998999838899999981475798847997308999999849980999996 No 18 >3bz1_U Photosystem II 12 kDa extrinsic protein; electron transport photosystem, membrane complex, transmembrane alpha-helix; HET: CLA PHO HEM PL9 BCR DGD LHG SQD LMG LMT; 2.90A {Thermosynechococcus elongatus} PDB: 2axt_U* 3bz2_U* 3kzi_U* 3a0b_U* 3a0h_U* Probab=93.32 E-value=0.066 Score=32.82 Aligned_cols=54 Identities=22% Similarity=0.295 Sum_probs=41.7 Q ss_pred CCCCCCCCEEEEECCCEEEEEHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCHHHHHHHHH Q ss_conf 5523342100353198698502685896989999999998379999978999734824799899998886 Q 537021.9.peg.7 818 PPSVNTPCVDFKVGDNRIYYSLAAIKGVGTTTARHIMEASADKPFDSLEDFCSRVDSKKLNRRVLESLVF 887 (1033) Q Consensus 818 ~PdIN~S~~~f~~~~~~Ir~Gl~~Ikgvg~~~~~~Ive~r~~g~f~sl~df~~rv~~~~~nk~~le~Li~ 887 (1033) +.|||... ..-|..|+|||...+++||+ +|||+|++|+... +++..+.++.+-. T Consensus 24 kidlN~As----------~~eL~~lpGig~~~A~~Iv~---~gpf~s~~dL~~V---~Gig~~~~e~ik~ 77 (104) T 3bz1_U 24 KIDLNNTN----------IAAFIQYRGLYPTLAKLIVK---NAPYESVEDVLNI---PGLTERQKQILRE 77 (104) T ss_dssp BEETTSSC----------GGGGGGSTTTTHHHHHHHHH---SCCCSSGGGGGGC---TTCCHHHHHHHHH T ss_pred CEECCCCC----------HHHHHHCCCCCHHHHHHHHH---CCCCCCHHHHHCC---CCCCHHHHHHHHH T ss_conf 35784089----------99996589979999999997---4997989999638---9989999999998 No 19 >3dm3_A Replication factor A; probably plays AN essential for replication of the chromosome, DNA recombination and repair; 2.40A {Methanocaldococcus jannaschii} Probab=93.29 E-value=0.17 Score=29.26 Aligned_cols=48 Identities=13% Similarity=0.102 Sum_probs=38.3 Q ss_pred HHHCCCCCCEEEEEEEEEEEEE---ECCCCCCE--EEEEEEECCCCEEEEEEC Q ss_conf 9850558988999999961121---01588876--999999849704999953 Q 537021.9.peg.7 985 ESVATIRGNNIRLAAMVVSKQQ---KKTRKGSR--IGWVTFSEPAKEYEANFF 1032 (1033) Q Consensus 985 ~~~~~~~g~~v~iaG~V~~i~~---~~TKkG~~--mafltleD~~g~~e~~iF 1032 (1033) ++.++..|..+.+.|.|+++.. ..+++|.. ++-+.+.|.||.+.+++| T Consensus 7 ~I~~L~~g~~v~v~g~V~~~~~~r~~~~~~G~~~~~~~~~i~D~tg~i~~t~W 59 (105) T 3dm3_A 7 NIGELSPGMTATFEGEVISALPIKEFKRADGSIGKLKSFIVRDETGSIRVTLW 59 (105) T ss_dssp CGGGCCSSEEEEEEEEEEEEEEEEEEECTTSCEEEEEEEEEEETTEEEEEEEE T ss_pred CHHHCCCCCEEEEEEEEEECCCCEEEECCCCCEEEEEEEEEECCCCCEEEEEE T ss_conf 08798999878999999992587798858998889999999929993999997 No 20 >3m4p_A Ehasnrs, asparaginyl-tRNA synthetase, putative; aminoacyl-tRNA synthetase, tRNA ligase, AARS, translation, ATP-binding, nucleotide-binding; HET: 4AD; 2.83A {Entamoeba histolytica} PDB: 3m4q_A Probab=92.86 E-value=0.32 Score=27.02 Aligned_cols=14 Identities=21% Similarity=0.344 Sum_probs=6.4 Q ss_pred CCCCCCCCHHHHHH Q ss_conf 53432343478875 Q 537021.9.peg.7 249 TPDHYLKNRSEMVS 262 (1033) Q Consensus 249 ~~~~yLks~eEM~~ 262 (1033) +...||.++.|+.. T Consensus 201 ~~~~~L~~spql~k 214 (456) T 3m4p_A 201 NEPAYLTQSSQLYL 214 (456) T ss_dssp TEEEEECSCCHHHH T ss_pred CHHHHCCCCHHHHH T ss_conf 11132038788751 No 21 >1e1o_A Lysyl-tRNA synthetase; ligase, aminoacyl-tRNA synthetase, protein biosynthesis; HET: LYS; 2.12A {Escherichia coli} SCOP: b.40.4.1 d.104.1.1 PDB: 1e1t_A* 1e22_A* 1e24_A* 1lyl_A 1bbu_A* 1bbw_A 1krs_A 1krt_A Probab=92.75 E-value=0.35 Score=26.67 Aligned_cols=29 Identities=10% Similarity=0.078 Sum_probs=13.7 Q ss_pred CCHHHHEECCCCCCCC-CEEEEEECHHHHH Q ss_conf 6666620012367688-6688510213343 Q 537021.9.peg.7 537 RPLSQLVPMYRDVRSD-LPVTQFNMKWIEK 565 (1033) Q Consensus 537 ~~l~~~~p~~~~~~~~-~~~~q~d~~~~e~ 565 (1033) +.+.++.|..++-..+ .-.+.|.|=++|. T Consensus 252 ~rvfei~~~FR~E~~~trH~~EFt~le~e~ 281 (504) T 1e1o_A 252 ERVFEINRNFRNEGISVRHNPEFTMMELYM 281 (504) T ss_dssp CEEEEEEEEECCCCCCC-CCSEEEEEEEEE T ss_pred CHHEEECCEECCCCCCCCCCCCEEEEEEEE T ss_conf 122255301547767766896355533100 No 22 >3a74_A Lysyl-tRNA synthetase; aminoacyl tRNA synthetase, ligase, protein biosynthesis, AMI tRNA synthetase, ATP-binding, magnesium; HET: B4P LYN; 1.80A {Geobacillus stearothermophilus} PDB: 3e9h_A* 3e9i_A* Probab=91.48 E-value=0.54 Score=25.08 Aligned_cols=12 Identities=8% Similarity=0.379 Sum_probs=6.5 Q ss_pred CCHHHHEECCCC Q ss_conf 666662001236 Q 537021.9.peg.7 537 RPLSQLVPMYRD 548 (1033) Q Consensus 537 ~~l~~~~p~~~~ 548 (1033) +.+.++.|..++ T Consensus 243 ~rvfqI~~~FR~ 254 (493) T 3a74_A 243 EKVYEIGRVFRN 254 (493) T ss_dssp CEEEEEEEEECC T ss_pred CCEEEEECEECC T ss_conf 547897000357 No 23 >2duy_A Competence protein comea-related protein; helix-hairpin-helix, structural genomics, NPPSFA; 1.75A {Thermus thermophilus HB8} SCOP: a.60.2.7 Probab=91.16 E-value=0.1 Score=31.26 Aligned_cols=55 Identities=33% Similarity=0.549 Sum_probs=42.1 Q ss_pred CCCCCCCCCCEEEEECCCEEEEEHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCHHHHHHHH Q ss_conf 03552334210035319869850268589698999999999837999997899973482479989999888 Q 537021.9.peg.7 816 IMPPSVNTPCVDFKVGDNRIYYSLAAIKGVGTTTARHIMEASADKPFDSLEDFCSRVDSKKLNRRVLESLV 886 (1033) Q Consensus 816 vl~PdIN~S~~~f~~~~~~Ir~Gl~~Ikgvg~~~~~~Ive~r~~g~f~sl~df~~rv~~~~~nk~~le~Li 886 (1033) .-|-|||..... -|..++|||...++.|++.| ||+|++||... ++++.+.++.+- T Consensus 16 ~~~idiN~As~~----------eL~~lpGig~~~A~~Iv~~R---~f~s~~dL~~v---~gi~~~~~~~i~ 70 (75) T 2duy_A 16 QTPVSLNEASLE----------ELMALPGIGPVLARRIVEGR---PYARVEDLLKV---KGIGPATLERLR 70 (75) T ss_dssp GCSEETTTCCHH----------HHTTSTTCCHHHHHHHHHTC---CCSSGGGGGGS---TTCCHHHHHHHG T ss_pred CCCEECCCCCHH----------HHHHCCCCCHHHHHHHHHCC---CCCCHHHHHHC---CCCCHHHHHHHH T ss_conf 787007128799----------99777898999999999858---98989999757---898999999999 No 24 >3bju_A Lysyl-tRNA synthetase; aminoacyl-tRNA synthetase, ATP- binding, cytoplasm, ligase, nucleotide-binding, phosphoprotein, polymorphism; HET: LYS ATP; 2.31A {Homo sapiens} Probab=90.97 E-value=0.77 Score=23.80 Aligned_cols=12 Identities=8% Similarity=0.279 Sum_probs=6.8 Q ss_pred CCHHHHEECCCC Q ss_conf 666662001236 Q 537021.9.peg.7 537 RPLSQLVPMYRD 548 (1033) Q Consensus 537 ~~l~~~~p~~~~ 548 (1033) +.+.++.|..++ T Consensus 245 ~rVfeIg~~FRa 256 (521) T 3bju_A 245 DRVYEIGRQFRN 256 (521) T ss_dssp CEEEEEEEEECC T ss_pred CCCEEEHHHHCC T ss_conf 302011334137 No 25 >3kf6_A Protein STN1; OB fold, chromosomal protein, DNA-binding, nucleus, telomere; 1.65A {Schizosaccharomyces pombe} Probab=90.88 E-value=0.63 Score=24.51 Aligned_cols=36 Identities=8% Similarity=0.029 Sum_probs=30.5 Q ss_pred CCEEEEEEEEEEEEEECCCCCCEEEEEEEECCCC-EEEEEEC Q ss_conf 9889999999611210158887699999984970-4999953 Q 537021.9.peg.7 992 GNNIRLAAMVVSKQQKKTRKGSRIGWVTFSEPAK-EYEANFF 1032 (1033) Q Consensus 992 g~~v~iaG~V~~i~~~~TKkG~~mafltleD~~g-~~e~~iF 1032 (1033) -+.|++.|.|++++++.+ ..+++++|.|| .+++.+| T Consensus 47 I~~V~ivG~Vvsv~~~~~-----~~~y~IDDgTG~~I~~~~w 83 (159) T 3kf6_A 47 IRWIQIVGYIAAIDIYEG-----KHVLTVDDCSGMVLRVVFI 83 (159) T ss_dssp ECEEEEEEEEEEEEEETT-----EEEEEEECSSSCEEEEEEE T ss_pred EEEEEEEEEEEEEEEECC-----EEEEEEECCCCCCEEEEEE T ss_conf 899999999999999456-----6999998899991799998 No 26 >3ebe_A Protein MCM10 homolog; OB-fold, zinc finger, CCCH, DNA replication, metal-binding, nucleus, zinc, zinc-finger; 2.30A {Xenopus laevis} PDB: 3h15_A Probab=88.99 E-value=1.1 Score=22.61 Aligned_cols=41 Identities=7% Similarity=0.146 Sum_probs=20.9 Q ss_pred CCEEEEEEEEEEEEEECCCCCCEEEEEEEECCCC---EEEEEEC Q ss_conf 9889999999611210158887699999984970---4999953 Q 537021.9.peg.7 992 GNNIRLAAMVVSKQQKKTRKGSRIGWVTFSEPAK---EYEANFF 1032 (1033) Q Consensus 992 g~~v~iaG~V~~i~~~~TKkG~~mafltleD~~g---~~e~~iF 1032 (1033) +.+|.++.++.....+.+++|+++.+.++.|..+ .+++.+| T Consensus 56 ~dwv~~Gvv~~Ks~pk~s~~gk~y~~w~l~dL~~~~~~v~lfLF 99 (200) T 3ebe_A 56 EDWVTFGVIVKKITPQSSNNGKTFSIWRLNDLKDLDKYISLFLF 99 (200) T ss_dssp SCEEEEEEEEEEECC-------CCEEEEEECSSSTTCCEEEEEC T ss_pred CCEEEEEEEEECCCCEECCCCCCEEEEEEECCCCCCEEEEEEEE T ss_conf 98899999996377207469970899998617898618999984 No 27 >2zxd_A Alpha-L-fucosidase, putative; TIM barrel, hydrolase; HET: ZXD; 2.15A {Thermotoga maritima} PDB: 2zwy_A* 2zx5_A* 2zx6_A* 2zx7_A* 2zwz_A* 2zx9_A* 2zxa_A* 2zxb_A* 2zx8_A* 1hl9_A* 1hl8_A* 1odu_A* Probab=87.64 E-value=1.2 Score=22.09 Aligned_cols=44 Identities=14% Similarity=0.061 Sum_probs=32.2 Q ss_pred CCHHHHHHHHHHCCCCEEEEE---------------CCCHHHH------HHHHHHHHHHCCCCEE Q ss_conf 998999999997899889981---------------1651774------9999999997699638 Q 537021.9.peg.7 27 LSLNNILDKIAADQQPAIAIT---------------DTNNLFS------ALEFSQKACAAGIQPI 70 (1033) Q Consensus 27 ~~~e~LV~~A~~~G~~AiAIT---------------D~~nl~G------av~F~~ackk~gIKPI 70 (1033) ..+++.|+.|++.||+.+.|| |+|.+.. +-++..||+++|||.- T Consensus 105 fda~~W~~~ak~AGaky~vlTaKHHDGF~lwdSk~t~~n~~~~~~krDiv~el~~A~rk~Glk~G 169 (455) T 2zxd_A 105 WDPQEWADLFKKAGAKYVIPTTKHHDGFCLWGTKYTDFNSVKRGPKRDLVGDLAKAVREAGLRFG 169 (455) T ss_dssp CCHHHHHHHHHHTTCSEEEEEEECTTCCBSSCCSSCSCBTTTSTTCSCHHHHHHHHHHHTTCEEE T ss_pred CCHHHHHHHHHHCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCEEE T ss_conf 89999999999849976997677337742368999998776889897379999999986698489 No 28 >1o7i_A SSB, SSO2364, single stranded DNA binding protein; OB fold; 1.2A {Sulfolobus solfataricus} SCOP: b.40.4.3 Probab=87.14 E-value=1.3 Score=21.75 Aligned_cols=48 Identities=10% Similarity=0.132 Sum_probs=35.3 Q ss_pred HHCCCCCC-EEEEEEEEEEEE---EECCCCCC-EEEEEEEECCCCEEEEEECC Q ss_conf 85055898-899999996112---10158887-69999998497049999539 Q 537021.9.peg.7 986 SVATIRGN-NIRLAAMVVSKQ---QKKTRKGS-RIGWVTFSEPAKEYEANFFP 1033 (1033) Q Consensus 986 ~~~~~~g~-~v~iaG~V~~i~---~~~TKkG~-~mafltleD~~g~~e~~iFP 1033 (1033) +.++..|. .+.+.|.|+++. +..+|.|. .++.+.+.|+||.+.+++|. T Consensus 5 I~dL~p~~~~v~i~~~V~~~~~~r~~~~k~g~~~v~~~~i~D~TG~i~~tlW~ 57 (119) T 1o7i_A 5 VGNLKPNMESVNVTVRVLEASEARQIQTKNGVRTISEAIVGDETGRVKLTLWG 57 (119) T ss_dssp GGGCCTTCSSEEEEEEEEEECCCEEECCTTCCEEEEEEEEEETTEEEEEEEEG T ss_pred HHHCCCCCCCEEEEEEEEECCCCCEEECCCCCEEEEEEEEECCCCEEEEEEEC T ss_conf 77689999878999999988787168738998899999998699879998805 No 29 >1gm5_A RECG; helicase, replication restart; HET: DNA ADP; 3.24A {Thermotoga maritima} SCOP: a.24.21.1 b.40.4.9 c.37.1.19 c.37.1.19 Probab=86.92 E-value=0.37 Score=26.49 Aligned_cols=30 Identities=13% Similarity=0.149 Sum_probs=15.8 Q ss_pred HHHHHHCCCCCCCHHHHHHHHHHHHCCCCCC Q ss_conf 6777510023542025899988886368503 Q 537021.9.peg.7 787 FLAASMTLEMDNVEKIKKFCQDARQFNIQIM 817 (1033) Q Consensus 787 F~aa~L~~~~~~~~~~~~~i~e~~~~gi~vl 817 (1033) +-.+++...+.. +.....+++-++-.+.|| T Consensus 614 ~~V~~lHG~m~~-~eR~~im~~Fr~Gk~~IL 643 (780) T 1gm5_A 614 FKLGLMHGRLSQ-EEKDRVMLEFAEGRYDIL 643 (780) T ss_dssp -CBCCCCSSSCC-SCSHHHHHHHTTTSSSBC T ss_pred CEEEEEECCCCH-HHHHHHHHHHHCCCCCEE T ss_conf 559998278999-999999999985999999 No 30 >3kf8_A Protein STN1; OB fold; 2.40A {Candida tropicalis mya-3404} Probab=86.16 E-value=1.5 Score=21.36 Aligned_cols=43 Identities=14% Similarity=0.101 Sum_probs=30.2 Q ss_pred CCCCCEEEEEEEEEEEEEECCC---CCCEEEEEEEECCCC---EEEEEE Q ss_conf 5589889999999611210158---887699999984970---499995 Q 537021.9.peg.7 989 TIRGNNIRLAAMVVSKQQKKTR---KGSRIGWVTFSEPAK---EYEANF 1031 (1033) Q Consensus 989 ~~~g~~v~iaG~V~~i~~~~TK---kG~~mafltleD~~g---~~e~~i 1031 (1033) ...-++|+|.|.|+.++.+.-. .++.+.|+|+.|.+| .+.+-+ T Consensus 82 N~PI~~VrIvG~VVg~~yk~~~~~~~~~~y~iltIDDsSG~~s~i~vk~ 130 (220) T 3kf8_A 82 NYPVNQINIFGKIVYEQYKEKEFNGVEESYVILVISDFIGIDSKIRVRL 130 (220) T ss_dssp TEEECEEEEEEEEEEEEEECCBCSSCBCCEEEEEEECCCSSSCEEEEEE T ss_pred CCCEEEEEEEEEEEEEEEECCCCCCCCCCEEEEEEECCCCCCEEEEEEE T ss_conf 7651899999999999984265566665349999956999946899995 No 31 >3gza_A Putative alpha-L-fucosidase; NP_812709.1, structural genomics, joint center for structural genomics, JCSG; HET: MSE EPE; 1.60A {Bacteroides thetaiotaomicron vpi-5482} Probab=84.27 E-value=1.8 Score=20.71 Aligned_cols=50 Identities=18% Similarity=0.184 Sum_probs=38.7 Q ss_pred HCCCHHHHHHHHHHCCCCEEEEE---------------CCCHHH----H-----HHHHHHHHHHCCCCEEEEEEEEEE Q ss_conf 32998999999997899889981---------------165177----4-----999999999769963899999985 Q 537021.9.peg.7 25 GALSLNNILDKIAADQQPAIAIT---------------DTNNLF----S-----ALEFSQKACAAGIQPIIGCQLDID 78 (1033) Q Consensus 25 g~~~~e~LV~~A~~~G~~AiAIT---------------D~~nl~----G-----av~F~~ackk~gIKPIiG~E~~v~ 78 (1033) ....+++.|+.|++.|++.+.+| |++... | +-++.+||+++|||. | +|+. T Consensus 57 ~~fD~~qWv~~~k~aGakyvvltaKHHDGF~lW~S~~t~~~v~~s~~~~~krDiv~el~~A~rk~Glk~--G--~Y~S 130 (443) T 3gza_A 57 TELNTDQWVQAAKAAGCKFAVLTATHETGFGLWQSDVNPYCLKAVKWRDGKGDIVRDFVNSCRKYGLQP--G--IYIG 130 (443) T ss_dssp TTCCHHHHHHHHHTTTCSEEEEESCCSSCCBSSCCSSCSSBGGGSSGGGGTCCHHHHHHHHHHHHTCEE--E--EEEC T ss_pred CCCCHHHHHHHHHHCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCE--E--EEEE T ss_conf 428999999999986996899854536886673799999765556564888687999999998709846--9--9984 No 32 >3bzc_A TEX; helix-turn-helix, helix-hairpin-helix, S1 domain, YQGF domain, transcription, RNA binding protein; 2.27A {Pseudomonas aeruginosa} SCOP: a.60.2.6 a.60.2.6 a.294.1.1 b.40.4.5 c.55.3.13 PDB: 3bzk_A 2oce_A Probab=84.12 E-value=0.73 Score=23.96 Aligned_cols=73 Identities=22% Similarity=0.307 Sum_probs=51.1 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCCEEEEECCCEEEEEHHHHCCCCHHHHHHHHHHH-HCCCCCCHHHHHHHCCCCCCC Q ss_conf 0258999888863685035523342100353198698502685896989999999998-379999978999734824799 Q 537021.9.peg.7 800 EKIKKFCQDARQFNIQIMPPSVNTPCVDFKVGDNRIYYSLAAIKGVGTTTARHIMEAS-ADKPFDSLEDFCSRVDSKKLN 878 (1033) Q Consensus 800 ~~~~~~i~e~~~~gi~vl~PdIN~S~~~f~~~~~~Ir~Gl~~Ikgvg~~~~~~Ive~r-~~g~f~sl~df~~rv~~~~~n 878 (1033) +.+..-+..+-..-+.-.+.|||.... --|..|-|+|...+..|++.| +||+|.|-.++.. + ..++ T Consensus 481 ~~L~~~L~~vv~~~VN~VGVDiN~As~----------~lL~~VsGLgprkA~~iv~~r~~~g~f~~R~~L~~-v--~~lG 547 (785) T 3bzc_A 481 LKLARSLDAVVEDCVNAVGVDVNTASA----------ALLARISGLNSTLAQNIVAHRDANGAFRTRDELKK-V--SRLG 547 (785) T ss_dssp HHHHHHHHHHHHHHHHHHCEETTTCCH----------HHHHTSTTCCHHHHHHHHHHHHHHCCCSSGGGGGG-S--TTCC T ss_pred HHHHHHHHHHHHHHHHCCCEEHHHHHH----------HHHHCCCCCCHHHHHHHHHHHHHCCCCCCHHHHHH-C--CCCC T ss_conf 999999999999784045561886147----------77532578788999999999995799577999973-6--7866 Q ss_pred HHHHHHH Q ss_conf 8999988 Q 537021.9.peg.7 879 RRVLESL 885 (1033) Q Consensus 879 k~~le~L 885 (1033) .+.++.. T Consensus 548 ~k~F~q~ 554 (785) T 3bzc_A 548 EKTFEQA 554 (785) T ss_dssp HHHHHHH T ss_pred HHHHHHC T ss_conf 0217652 No 33 >2pi2_A Replication protein A 32 kDa subunit; FULL-length RPA14/32, ssDNA binding protein, OB-fold, dioxane, DNA binding protein; 2.00A {Homo sapiens} SCOP: b.40.4.3 PDB: 2z6k_A 1dpu_A 1z1d_A Probab=82.46 E-value=2.1 Score=20.17 Aligned_cols=32 Identities=16% Similarity=0.149 Sum_probs=12.5 Q ss_pred CCCCEEEECCCCHHHHEECCCCCCCC--CEEEEE Q ss_conf 47440335256666620012367688--668851 Q 537021.9.peg.7 527 THAAGIVIGDRPLSQLVPMYRDVRSD--LPVTQF 558 (1033) Q Consensus 527 ~Ha~Gvvi~~~~l~~~~p~~~~~~~~--~~~~q~ 558 (1033) +.-.|+|.+-+.-...+-.......| +.+.+| T Consensus 74 V~iVG~V~~v~~~~t~~~y~idDgTG~~i~~~~w 107 (270) T 2pi2_A 74 VTIVGIIRHAEKAPTNIVYKIDDMTAAPMDVRQW 107 (270) T ss_dssp EEEEEEEEEEEECSSEEEEEEECSSSSCEEEEEE T ss_pred EEEEEEEEEEEECCCEEEEEEECCCCCCEEEEEE T ss_conf 9999999688856867899998799882799996 No 34 >3kdf_D Replication protein A 32 kDa subunit; wheat GERM cell free, protein complex, center for eukaryotic structural genomics, PSI; HET: MSE; 1.98A {Homo sapiens} PDB: 2pqa_A 1quq_A 1l1o_B Probab=82.32 E-value=2.1 Score=20.13 Aligned_cols=37 Identities=3% Similarity=0.037 Sum_probs=29.2 Q ss_pred CCEEEEEEEEEEEEEECCCCCCEEEEEEEECCCC-EEEEEECC Q ss_conf 9889999999611210158887699999984970-49999539 Q 537021.9.peg.7 992 GNNIRLAAMVVSKQQKKTRKGSRIGWVTFSEPAK-EYEANFFP 1033 (1033) Q Consensus 992 g~~v~iaG~V~~i~~~~TKkG~~mafltleD~~g-~~e~~iFP 1033 (1033) -..|++.|.|.++....| ....+++|.|| .+++..|. T Consensus 31 v~~V~iVG~V~~v~~~~t-----~~~y~IdDgTG~~i~v~~w~ 68 (132) T 3kdf_D 31 ISQVTIVGIIRHAEKAPT-----NIVYKIDDMTAAPMDVRQWV 68 (132) T ss_dssp CCEEEEEEEEEEEEECSS-----EEEEEEECSSSSCEEEEEEC T ss_pred EEEEEEEEEEEEEEECCC-----EEEEEEECCCCCCEEEEEEC T ss_conf 999999999999998797-----89999988999948999977 No 35 >2k75_A Uncharacterized protein TA0387; closed beta barrel, OB fold, structural genomics, PSI-2, protein structure initiative; NMR {Thermoplasma acidophilum} Probab=82.17 E-value=2.1 Score=20.09 Aligned_cols=41 Identities=7% Similarity=0.126 Sum_probs=32.6 Q ss_pred CCEEEEEEEEEEEEEEC-CCCCCE--EEEEEEECCCCEEEEEEC Q ss_conf 98899999996112101-588876--999999849704999953 Q 537021.9.peg.7 992 GNNIRLAAMVVSKQQKK-TRKGSR--IGWVTFSEPAKEYEANFF 1032 (1033) Q Consensus 992 g~~v~iaG~V~~i~~~~-TKkG~~--mafltleD~~g~~e~~iF 1032 (1033) ...|.+.|.|+++..+. +|+|.. ++-+.+.|+||.+.+++| T Consensus 13 ~~~V~i~gkV~~i~~r~~~kdG~~~~v~~~~i~D~TG~ir~t~W 56 (106) T 2k75_A 13 TPYVSVIGKITGIHKKEYESDGTTKSVYQGYIEDDTARIRISSF 56 (106) T ss_dssp CSEEEEEEEEEEEEEEEEEETTEEEEEEEEEEECSSCEEEEEEE T ss_pred CCCEEEEEEEEEECCCEEEECCCEEEEEEEEEECCCCEEEEEEE T ss_conf 99579999995953678996997689999999889978999996 No 36 >3eyp_A Putative alpha-L-fucosidase; structural genomics, hydrolase, lipoprotein, PSI-2, protein structure initiative; 1.90A {Bacteroides thetaiotaomicron} Probab=80.18 E-value=2.4 Score=19.57 Aligned_cols=50 Identities=20% Similarity=0.211 Sum_probs=38.7 Q ss_pred CCCHHHHHHHHHHCCCCEEEEE---------------CCCHHH----H-----HHHHHHHHHHCCCCEEEEEEEEEEC Q ss_conf 2998999999997899889981---------------165177----4-----9999999997699638999999854 Q 537021.9.peg.7 26 ALSLNNILDKIAADQQPAIAIT---------------DTNNLF----S-----ALEFSQKACAAGIQPIIGCQLDIDM 79 (1033) Q Consensus 26 ~~~~e~LV~~A~~~G~~AiAIT---------------D~~nl~----G-----av~F~~ackk~gIKPIiG~E~~v~~ 79 (1033) ...|++.|+.|++.||+-+.|| |++... | +-+|.+||+++|||.- +|+.. T Consensus 53 ~fd~~~W~~~~k~aGaky~vltakHHDGF~lw~S~~t~~~~~~s~~~g~krDlv~el~~A~rk~Glk~G----~Y~S~ 126 (469) T 3eyp_A 53 ALDCRQWMQTLKAAGIPAAILTAKHADGFCLWPSKYTDYSVKNAAWKNGKGDVVREFVDACEEYGLKAG----IYLGP 126 (469) T ss_dssp SCCHHHHHHHHHHTTCCEEEEEEECTTCCBSSCCTTCSSBGGGSSGGGGTCCHHHHHHHHHHHHTCEEE----EEECS T ss_pred CCCHHHHHHHHHHCCCCEEEECCEECCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCEEE----EEECC T ss_conf 089999999999869968996521057566778999987654578778864499999999986197369----98152 No 37 >3dx5_A Uncharacterized protein ASBF; beta-alpha barrel, petrobactin synthesis, ASB locus, structural genomics, PSI-2, protein structure initiative; HET: MSE DHB TRS; 2.12A {Bacillus anthracis} Probab=79.96 E-value=2.3 Score=19.81 Aligned_cols=52 Identities=8% Similarity=-0.008 Sum_probs=39.0 Q ss_pred CHHHHHHHCCCHHHHHHHHHHCCCCEEEEEC-C-C-----HHHHHHHHHHHHHHCCCCE Q ss_conf 4447644329989999999978998899811-6-5-----1774999999999769963 Q 537021.9.peg.7 18 SSYSLLEGALSLNNILDKIAADQQPAIAITD-T-N-----NLFSALEFSQKACAAGIQP 69 (1033) Q Consensus 18 S~ySlldg~~~~e~LV~~A~~~G~~AiAITD-~-~-----nl~Gav~F~~ackk~gIKP 69 (1033) +.+||.....+++++++.|++.|+.+|-|.- + . .....-++..+++++||+. T Consensus 6 ~t~sf~~~~~sl~e~l~~a~~~G~~gIEl~~~~~~~~~~~~~~~~~~~~~~l~~~gl~i 64 (286) T 3dx5_A 6 CTISFRHQLISFTDIVQFAYENGFEGIELWGTHAQNLYMQEYETTERELNCLKDKTLEI 64 (286) T ss_dssp EGGGGTTSCCCHHHHHHHHHHTTCCEEEEEHHHHHHHHHHCHHHHHHHHHHTGGGTCCE T ss_pred EEHHHCCCCCCHHHHHHHHHHHCCCEEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCEE T ss_conf 62874368989999999999819999998255665345458788999999999659979 No 38 >2wvv_A Alpha-L-fucosidase; alpha-L-fucose, hydrolase, glycoside hydrolase family 29; 1.73A {Bacteroides thetaiotaomicron} PDB: 2xii_A* 2xib_A* 2wvv_B 2wvt_A* 2wvu_A* 2wvs_A* Probab=78.91 E-value=2.6 Score=19.28 Aligned_cols=45 Identities=11% Similarity=-0.018 Sum_probs=35.6 Q ss_pred CCCHHHHHHHHHHCCCCEEEEE---------------CCCHHHH------HHHHHHHHHHCCCCEE Q ss_conf 2998999999997899889981---------------1651774------9999999997699638 Q 537021.9.peg.7 26 ALSLNNILDKIAADQQPAIAIT---------------DTNNLFS------ALEFSQKACAAGIQPI 70 (1033) Q Consensus 26 ~~~~e~LV~~A~~~G~~AiAIT---------------D~~nl~G------av~F~~ackk~gIKPI 70 (1033) ...+++.|+.|++.|++-+.+| |+|.+.. +-+|.+||+++|||.- T Consensus 77 ~fd~~~W~~l~k~aGakY~v~takHHDGF~lwdS~~t~~n~~~~~~krDiv~el~~A~r~~Glk~g 142 (450) T 2wvv_A 77 KFDAKKWAKMAKEMGTKYVKITTKHHEGFCLWPSKYTKYTVANTPYKRDILGELVKAYNDEGIDVH 142 (450) T ss_dssp TCCHHHHHHHHHHHTCSEEEEEEECTTCCBSSCCTTCSCBGGGSTTCSCHHHHHHHHHHHTTCEEE T ss_pred CCCHHHHHHHHHHCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEE T ss_conf 599999999999859967997677337753446889997333688777708999999984387404 No 39 >3mo4_A Alpha-1,3/4-fucosidase; structural genomics, PSI-2, protein structure initiative, MI center for structural genomics, MCSG; HET: TYR; 1.90A {Bifidobacterium longum subsp} Probab=76.05 E-value=3.1 Score=18.67 Aligned_cols=45 Identities=18% Similarity=0.225 Sum_probs=35.1 Q ss_pred CCHHHHHHHHHHCCCCEEEEE---------------CCCHHHH---------HHHHHHHHHHCCCCEEE Q ss_conf 998999999997899889981---------------1651774---------99999999976996389 Q 537021.9.peg.7 27 LSLNNILDKIAADQQPAIAIT---------------DTNNLFS---------ALEFSQKACAAGIQPII 71 (1033) Q Consensus 27 ~~~e~LV~~A~~~G~~AiAIT---------------D~~nl~G---------av~F~~ackk~gIKPIi 71 (1033) ..+++.|+.|++.||+.+.|| |++...+ +-+|.+||+++|||..+ T Consensus 64 fDad~W~~~~k~AGakY~vltaKHHDGF~lW~S~~t~~~v~~sp~~~~krDiv~el~~A~rk~Glk~G~ 132 (480) T 3mo4_A 64 VDVDQWMDALVAGGMAGVILTCKHHDGFCLWPSRLTRHTVASSPWREGKGDLVREVSESARRHGLKFGV 132 (480) T ss_dssp CCHHHHHHHHHHTTCSEEEEEEECTTCCBSSCCTTCSCBGGGSSGGGGTCCHHHHHHHHHHHTTCEEEE T ss_pred CCHHHHHHHHHHCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCEEE T ss_conf 899999999998599489981572786557379999986435776688867899999999871971478 No 40 >1qzg_A Protection of telomeres protein 1; protrein-DNA complex, single-stranded telomeric DNA, DNA binding protein/DNA complex; HET: TMP; 1.90A {Schizosaccharomyces pombe} SCOP: b.40.4.3 PDB: 1qzh_A Probab=75.48 E-value=3.2 Score=18.56 Aligned_cols=43 Identities=14% Similarity=0.240 Sum_probs=32.1 Q ss_pred CCCCEEEEEEEEEEEEEEC-CCCC--CEEEEEEEECCCC-----EEEEEEC Q ss_conf 5898899999996112101-5888--7699999984970-----4999953 Q 537021.9.peg.7 990 IRGNNIRLAAMVVSKQQKK-TRKG--SRIGWVTFSEPAK-----EYEANFF 1032 (1033) Q Consensus 990 ~~g~~v~iaG~V~~i~~~~-TKkG--~~mafltleD~~g-----~~e~~iF 1032 (1033) +.++.|.++|+|+....++ ||+| +-+..++|-|.+- .+.+.+| T Consensus 40 k~n~~VNviGVV~~~~~P~~Sk~G~kDy~~tl~I~D~S~~~~~~gL~v~iF 90 (187) T 1qzg_A 40 KKNTIVNLFGIVKDFTPSRQSLHGTKDWVTTVYLWDPTCDTSSIGLQIHLF 90 (187) T ss_dssp SSCEEEEEEEEEEEEEEEEECSSTTCCEEEEEEEECTTSCTTSCCEEEEEE T ss_pred HCCCEEEEEEEEEECCCCCCCCCCCCCEEEEEEEECCCCCCCCCCEEEEEE T ss_conf 159888899998003697027789986799999985998888886799998 No 41 >1hjs_A Beta-1,4-galactanase; 4-galactanases, family 53 glycoside hydrolase, thermostability, PH optimum, CLAN GH-A, thermophIle, alkalophIle; HET: NAG EPE; 1.87A {Thielavia heterothallica} SCOP: c.1.8.3 PDB: 1hju_A* 1hjq_A* Probab=73.50 E-value=3.5 Score=18.19 Aligned_cols=65 Identities=18% Similarity=0.159 Sum_probs=48.2 Q ss_pred CCHHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEE------E-CCCHHHHHHHHHHHHHHCCCCEEEEEE Q ss_conf 8133433144476443299899999999789988998------1-165177499999999976996389999 Q 537021.9.peg.7 10 SFVHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAI------T-DTNNLFSALEFSQKACAAGIQPIIGCQ 74 (1033) Q Consensus 10 ~FvhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAI------T-D~~nl~Gav~F~~ackk~gIKPIiG~E 74 (1033) +++.+-..+.+.+.+.-...+++.+.-++.|+.+|=| + +.+++.-+-+..+.|+++|+|.|+.+. T Consensus 10 ~~~~~~~~~g~~~~~~~g~~~d~~~~l~~aG~n~VRi~vW~~p~~~~~~~~~v~~~v~~A~~~gl~vil~lh 81 (332) T 1hjs_A 10 SSVVVEERAGVSYKNTNGNAQPLENILAANGVNTVRQRVWVNPADGNYNLDYNIAIAKRAKAAGLGVYIDFH 81 (332) T ss_dssp TTHHHHHHTTCCCBCTTSCBCCHHHHHHHTTCCEEEEEECSSCTTCTTSHHHHHHHHHHHHHTTCEEEEEEC T ss_pred HHHHHHHHCCCEEECCCCCCCCHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCEEEEEEC T ss_conf 678999968966788999814399999977999799501107998867899999999999988997999715 No 42 >2pgf_A Adenosine deaminase; metallo-dependent hydrolase, structural genomics, medical structural genomics of pathogenic protozoa consortium, MSGPP; HET: MSE ADN; 1.89A {Plasmodium vivax} PDB: 2pgr_A* 2qvn_A* 3ewc_A* 3ewd_A* 2amx_A Probab=72.25 E-value=1.5 Score=21.29 Aligned_cols=20 Identities=5% Similarity=-0.064 Sum_probs=9.1 Q ss_pred CHHHHHHHHHHHHHHHCCCC Q ss_conf 30557888999998510476 Q 537021.9.peg.7 353 FSGYFLIVADFIQWAKKNNI 372 (1033) Q Consensus 353 f~~YFLiV~Div~~ak~~gI 372 (1033) ++++.-++++++.-+..+|+ T Consensus 110 ~e~~~~~a~~~~~~~~~dgV 129 (371) T 2pgf_A 110 YEVIEDLAKHAVFNKYKEGV 129 (371) T ss_dssp HHHHHHHHHHHHHHHHHHTE T ss_pred HHHHHHHHHHHHHHHHHCCC T ss_conf 99999999999999998597 No 43 >3l6a_A Eukaryotic translation initiation factor 4 gamma; C-terminal region, MA2 domain, W2 domain, EIF4G2, EIF family translation; HET: MES PG4; 2.00A {Homo sapiens} Probab=71.59 E-value=3 Score=18.81 Aligned_cols=119 Identities=16% Similarity=0.202 Sum_probs=72.7 Q ss_pred CCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCHHHHH--HHHHHHHHHHCCCCCCCCCCCCHH Q ss_conf 898899987654103420124643245783899999887789898606305578--889999985104764334444204 Q 537021.9.peg.7 305 KEEENELRNKAVAGLEVRLAQGAIAKGYHAQDYRQRLDFELNVIARMKFSGYFL--IVADFIQWAKKNNIPVGPGRGSGA 382 (1033) Q Consensus 305 ~~~~~~Lr~l~~~Gl~~R~~~~~~~~~~~~~~y~~RLe~EL~vI~~~gf~~YFL--iV~Div~~ak~~gI~vGpGRGSAa 382 (1033) +.+++.+++.+..=+.+-+..+ ..++. ...+.+++-+..|+ +|..++..+-+++= -.=..+ T Consensus 5 ~~S~Ee~~kk~~~il~Ey~~~~------D~~Ea-------~~~l~el~~p~~~~~~~V~~~i~~aler~~----~~Re~~ 67 (364) T 3l6a_A 5 HHSKEELLKLTETVVTEYLNSG------NANEA-------VNGVREMRAPKHFLPEMLSKVIILSLDRSD----EDKEKA 67 (364) T ss_dssp CCCHHHHHHHHHHHHHHHHHHC------CHHHH-------HHHHHHHTCCGGGHHHHHHHHHHHHHTSCH----HHHHHH T ss_pred CCCHHHHHHHHHHHHHHHHHCC------CHHHH-------HHHHHHHCCCHHHHHHHHHHHHHHHHCCCH----HHHHHH T ss_conf 6999999999999999998589------99999-------999998488275799999999999972889----999999 Q ss_pred HHHHHHHHHCCCCCHHHCCCHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEECCCH Q ss_conf 6677655440375633323105676210014578753336568789999999830380215542207 Q 537021.9.peg.7 383 GSVTAYALTITDIDPLRFSLLFERFLNPDRMSMPDFDIDFCQDRRDEVIRYVQNKYGHERVAQIITF 449 (1033) Q Consensus 383 gSLVaY~LgIT~VDPi~~~LlFERFLnp~R~~~PDIDiDf~~~~R~~vi~y~~~kyG~~~va~i~t~ 449 (1033) +-|++++..=.-|.+=..-.=|+||+ .++||+-+|+|. +-.|+.+-.|+--+.-+.+. T Consensus 68 ~~LL~~L~~~~~is~~~~~~Gf~~~l----~~l~Dl~iDiP~-----a~~~la~~l~~~i~~~~l~l 125 (364) T 3l6a_A 68 SSLISLLKQEGIATSDNFMQAFLNVL----DQCPKLEVDIPL-----VKSYLAQFAARAIISELVSI 125 (364) T ss_dssp HHHHHHHHHTTSSCHHHHHHHHHHHH----HTHHHHHHHCTT-----HHHHHHHHHHHHHHTTSSCH T ss_pred HHHHHHHHHCCCCCHHHHHHHHHHHH----HHHHHHHHCCCC-----HHHHHHHHHHHHHHCCCCCH T ss_conf 99999998779999999999999999----764675324663-----69999999999999289689 No 44 >2zkq_k 40S ribosomal protein S14E; protein-RNA complex, 40S ribosomal subunit, ribosomal protein/RNA complex; 8.70A {Canis familiaris} PDB: 3jyv_K* Probab=71.42 E-value=1.8 Score=20.73 Aligned_cols=27 Identities=26% Similarity=0.423 Sum_probs=18.2 Q ss_pred CCCCCCHHHHHHHH---HHHCCCCCHHHCC Q ss_conf 34444204667765---5440375633323 Q 537021.9.peg.7 375 GPGRGSGAGSVTAY---ALTITDIDPLRFS 401 (1033) Q Consensus 375 GpGRGSAagSLVaY---~LgIT~VDPi~~~ 401 (1033) ||||-+|--+|-.- ..-|+|+-||.|| T Consensus 109 GpGr~~air~l~~~glkI~~I~D~TpiphN 138 (151) T 2zkq_k 109 GPGAQSALRALARSGMKIGRIEDVTPIPSD 138 (151) T ss_dssp CTTHHHHHHHHHHHTCCCSCEECCCCCCCS T ss_pred CCCHHHHHHHHHHCCCEEEEEEECCCCCCC T ss_conf 875788998897689389999981989989 No 45 >1ynx_A Replication factor-A protein 1; canonical OB fold, DNA binding protein; NMR {Saccharomyces cerevisiae} Probab=68.57 E-value=4.4 Score=17.38 Aligned_cols=52 Identities=23% Similarity=0.222 Sum_probs=32.6 Q ss_pred CCCHHHHHCCCCCCEEEEEEEEEEE---EEECCCCCC-EEEEEEEECCCCEEEEEECC Q ss_conf 9768898505589889999999611---210158887-69999998497049999539 Q 537021.9.peg.7 980 IKGYEESVATIRGNNIRLAAMVVSK---QQKKTRKGS-RIGWVTFSEPAKEYEANFFP 1033 (1033) Q Consensus 980 ~~~~~~~~~~~~g~~v~iaG~V~~i---~~~~TKkG~-~mafltleD~~g~~e~~iFP 1033 (1033) +++++++.... +..+|.|.|+.. +....++|. ++.-+.|.|++|.+.+++|. T Consensus 4 ~~pI~~L~p~~--~~w~I~~RV~~k~~~~~~~~~~~~g~v~~~~l~De~G~I~~~~~~ 59 (114) T 1ynx_A 4 IFAIEQLSPYQ--NVWTIKARVSYKGEIKTWHNQRGDGKLFNVNFLDTSGEIRATAFN 59 (114) T ss_dssp BCCGGGCCTTT--CCCEEEEEEEEEEEEEEEECSSCEEEEEEEEEEETTEEEEEEECH T ss_pred CEEHHHCCCCC--CCEEEEEEEEEECCCEEEECCCCCCEEEEEEEECCCCCEEEEEEC T ss_conf 33189968999--967999999882156036558996449999999189989999912 No 46 >1wjj_A Hypothetical protein F20O9.120; DNA-binding protein-related, structural genomics, riken structural genomics/proteomics initiative, RSGI; NMR {Arabidopsis thaliana} SCOP: b.40.4.3 Probab=68.29 E-value=4.4 Score=17.34 Aligned_cols=47 Identities=13% Similarity=-0.050 Sum_probs=32.4 Q ss_pred HHCCCCC-CEEEEEEEEEEEEEECCCC------------C--CEEEEEEEECCCCEEEEEEC Q ss_conf 8505589-8899999996112101588------------8--76999999849704999953 Q 537021.9.peg.7 986 SVATIRG-NNIRLAAMVVSKQQKKTRK------------G--SRIGWVTFSEPAKEYEANFF 1032 (1033) Q Consensus 986 ~~~~~~g-~~v~iaG~V~~i~~~~TKk------------G--~~mafltleD~~g~~e~~iF 1032 (1033) +.++..+ ..+.+.+.|+++..+...+ | -.++-+.+.|+||.+.+++| T Consensus 19 I~dL~p~~~~vnl~~kVl~~~~~~~~~~~~~~~~~~~~~g~~~~v~~~~v~DeTG~i~~tlW 80 (145) T 1wjj_A 19 VEQLKPGTTGHTLTVKVIEANIVVPVTRKTRPASSLSRPSQPSRIVECLIGDETGCILFTAR 80 (145) T ss_dssp STTCCTTCCCEEEEEEEEEEEECCSSSSCCSCSSCCSSCCCCCCCEEEEEECSSCEEEEEEC T ss_pred HHHCCCCCCCEEEEEEEEECCCCEEECCCCCCCCCCCCCCCCCEEEEEEEECCCCEEEEEEE T ss_conf 99988999980799999873143674145642221013588747999999758877999996 No 47 >1ug3_A EIF4GI, eukaryotic protein synthesis initiation factor 4G; heat repeat, translation; 2.24A {Homo sapiens} SCOP: a.118.1.14 a.118.1.14 Probab=67.90 E-value=4.3 Score=17.45 Aligned_cols=120 Identities=17% Similarity=0.178 Sum_probs=75.3 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCHHH-HHHHHHHHHHHHCCCCCCCCCCCCHHHHHH Q ss_conf 8999876541034201246432457838999998877898986063055-788899999851047643344442046677 Q 537021.9.peg.7 308 ENELRNKAVAGLEVRLAQGAIAKGYHAQDYRQRLDFELNVIARMKFSGY-FLIVADFIQWAKKNNIPVGPGRGSGAGSVT 386 (1033) Q Consensus 308 ~~~Lr~l~~~Gl~~R~~~~~~~~~~~~~~y~~RLe~EL~vI~~~gf~~Y-FLiV~Div~~ak~~gI~vGpGRGSAagSLV 386 (1033) ++.+++.+..=+++-+..+ =++.-...+.+++-++| .-+|.-+|..+-+++ ...|- .++-|+ T Consensus 7 ~Eel~kk~~~il~EY~~~~-------------D~~Ea~~~v~el~~p~~~~~~V~~~i~~~ler~---~~~Re-~~~~LL 69 (339) T 1ug3_A 7 EEELEKKSKAIIEEYLHLN-------------DMKEAVQCVQELASPSLLFIFVRHGVESTLERS---AIARE-HMGQLL 69 (339) T ss_dssp HHHHHHHHHHHHHHHHHHC-------------CHHHHHHHHHTTCCGGGHHHHHHHHHHHHTTTC---HHHHH-HHHHHH T ss_pred HHHHHHHHHHHHHHHHCCC-------------CHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCC---HHHHH-HHHHHH T ss_conf 9999999999999997389-------------899999999973996017999999999998388---89999-999999 Q ss_pred HHHHHCCCCCHHHCCCHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEECCCHHHHH Q ss_conf 6554403756333231056762100145787533365687899999998303802155422077877 Q 537021.9.peg.7 387 AYALTITDIDPLRFSLLFERFLNPDRMSMPDFDIDFCQDRRDEVIRYVQNKYGHERVAQIITFGSLQ 453 (1033) Q Consensus 387 aY~LgIT~VDPi~~~LlFERFLnp~R~~~PDIDiDf~~~~R~~vi~y~~~kyG~~~va~i~t~~t~~ 453 (1033) +++..=--+.+=..--=|+||+ .++||+-+|+|. +..|+.+-.|+--+..+.+++.+. T Consensus 70 ~~L~~~~~ls~~~i~~Gf~~~l----~~l~Dl~iDiP~-----a~~~la~~~~~~i~~~~l~l~~l~ 127 (339) T 1ug3_A 70 HQLLCAGHLSTAQYYQGLYEIL----ELAEDMEIDIPH-----VWLYLAELVTPILQEGGVPMGELF 127 (339) T ss_dssp HHHHHTTSSCHHHHHHHHHHHH----HHHHHHTTTCTT-----HHHHHHHHHGGGGSTTSCCHHHHH T ss_pred HHHHHCCCCCHHHHHHHHHHHH----HHHHHHHHCCCH-----HHHHHHHHHHHHHHCCCCCHHHHH T ss_conf 9998779999999999999999----762686540311-----799999999999984880499999 No 48 >2kbn_A Conserved protein; nucleic acid binding protein, beta barrel, structural genomics, PSI-2, protein structure initiative; NMR {Methanosarcina mazei} PDB: 2ken_A Probab=66.41 E-value=4.8 Score=17.06 Aligned_cols=44 Identities=14% Similarity=0.100 Sum_probs=33.8 Q ss_pred HCCCC-CCEEEEEEEEEEEEEECCCCCCEEEEEEEECCCCEEEEEEC Q ss_conf 50558-98899999996112101588876999999849704999953 Q 537021.9.peg.7 987 VATIR-GNNIRLAAMVVSKQQKKTRKGSRIGWVTFSEPAKEYEANFF 1032 (1033) Q Consensus 987 ~~~~~-g~~v~iaG~V~~i~~~~TKkG~~mafltleD~~g~~e~~iF 1032 (1033) .++.. ++.|.+.|-|+++-.+++. .......+.|+||.+.+++| T Consensus 9 ~Dl~~~~~~V~i~~kV~~i~e~~~~--~~~~~g~l~DeTG~Ir~t~W 53 (109) T 2kbn_A 9 VDIVENGQWANLKAKVIQLWENTHE--SISQVGLLGDETGIIKFTIW 53 (109) T ss_dssp TTCCSTTCEEEEEEEEEEEEECCCS--SEEEEEEEECTTCCEEEEEE T ss_pred ECCCCCCCCEEEEEEEEEECCCCCC--CEEEEEEEECCCCEEEEEEE T ss_conf 1468899838999999996089996--19999999888886999996 No 49 >1s1h_K RP59A, 40S ribosomal protein S14-A; 80S ribosome, 40S ribosomal subunit, EEF2, tRNA translocation, sordarin, cryo-EM; 11.70A {Saccharomyces cerevisiae} SCOP: i.1.1.1 Probab=66.30 E-value=2.1 Score=20.11 Aligned_cols=27 Identities=22% Similarity=0.393 Sum_probs=14.6 Q ss_pred CCCCCCHHHHHHHH---HHHCCCCCHHHCC Q ss_conf 34444204667765---5440375633323 Q 537021.9.peg.7 375 GPGRGSGAGSVTAY---ALTITDIDPLRFS 401 (1033) Q Consensus 375 GpGRGSAagSLVaY---~LgIT~VDPi~~~ 401 (1033) ||||-||--.|..- ..-|+||-||.|| T Consensus 94 G~Gr~saik~l~~~glkI~~I~DvTpiphN 123 (136) T 1s1h_K 94 GPGGQAALRALARSGLRIGRIEDVTPVPSD 123 (136) T ss_dssp CCSHHHHHHHHHHHSSEEEEEEECCCCCSS T ss_pred CCCHHHHHHHHHHCCCEEEEEEECCCCCCC T ss_conf 873899999998789889999983989989 No 50 >2ztd_A Holliday junction ATP-dependent DNA helicase RUVA; recombination, branch migration, DNA binding, oligomerization, acidic PIN; 2.40A {Mycobacterium tuberculosis} PDB: 2ztc_A 2zte_A 2h5x_A 1bvs_A Probab=65.86 E-value=4.9 Score=16.98 Aligned_cols=62 Identities=18% Similarity=0.297 Sum_probs=28.3 Q ss_pred HHHHCCCCHHHHHHHH-HHHHCCCCCCHHHHHHH-CCCCCCCHHHHHHHHHCCCCCCCCCCHHHHHHHHHHH Q ss_conf 2685896989999999-99837999997899973-4824799899998886767234790389999989999 Q 537021.9.peg.7 839 LAAIKGVGTTTARHIM-EASADKPFDSLEDFCSR-VDSKKLNRRVLESLVFAGALDCFGYSRMQLLQSLDNI 908 (1033) Q Consensus 839 l~~Ikgvg~~~~~~Iv-e~r~~g~f~sl~df~~r-v~~~~~nk~~le~Li~aGAfD~~~~~R~~ll~~i~~~ 908 (1033) |..|+|||.++|+.|+ |-+ +++..+..-... ..........+++ +-++|+++.+.-..+..+ T Consensus 125 L~~vpGIGkKtAeRIilELk--dKl~~~~~~~~~~~~~~~~~~e~~~A------L~~LGy~~~ea~~av~~i 188 (212) T 2ztd_A 125 LTRVPGIGKRGAERMVLELR--DKVGVAATGGALSTNGHAVRSPVVEA------LVGLGFAAKQAEEATDTV 188 (212) T ss_dssp HHTSTTCCHHHHHHHHHHHT--TTCC-------------CCHHHHHHH------HHHTTCCHHHHHHHHHHH T ss_pred HHHCCCCCHHHHHHHHHHHH--HHHCCCCCCCCCCCCCCCCHHHHHHH------HHHCCCCHHHHHHHHHHH T ss_conf 85177824888999999997--54313334554345664317899999------998599989999999999 No 51 >1qwg_A PSL synthase;, (2R)-phospho-3-sulfolactate synthase; beta-alpha-barrel, lyase; 1.60A {Methanocaldococcus jannaschii} SCOP: c.1.27.1 Probab=65.17 E-value=5 Score=16.88 Aligned_cols=12 Identities=17% Similarity=0.072 Sum_probs=4.8 Q ss_pred HHHHHHHHCCCC Q ss_conf 999998303802 Q 537021.9.peg.7 430 VIRYVQNKYGHE 441 (1033) Q Consensus 430 vi~y~~~kyG~~ 441 (1033) +..|..++||.+ T Consensus 211 ~q~~~I~~fG~~ 222 (251) T 1qwg_A 211 QQVAFILKFGSS 222 (251) T ss_dssp HHHHHHHHHCTT T ss_pred HHHHHHHHHCCC T ss_conf 999999997989 No 52 >1k19_A Chemosensory protein CSP2; pheromone, lipid transport; NMR {Mamestra brassicae} SCOP: a.118.21.1 PDB: 1kx8_A 1kx9_A 1n8u_A* 1n8v_A* 2jnt_A Probab=63.01 E-value=5.5 Score=16.58 Aligned_cols=40 Identities=10% Similarity=0.019 Sum_probs=32.1 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCCC Q ss_conf 99999730266677751002354202589998888636850 Q 537021.9.peg.7 776 TAWMKTHYPVEFLAASMTLEMDNVEKIKKFCQDARQFNIQI 816 (1033) Q Consensus 776 ~aylK~~yP~eF~aa~L~~~~~~~~~~~~~i~e~~~~gi~v 816 (1033) +.||..|||.+|-.-+=-.+|.+.+.. .|-..++..||.| T Consensus 70 i~~l~~~~P~~W~~l~~KYDP~g~y~~-k~~~~~~~~gi~~ 109 (112) T 1k19_A 70 IEHLIKNEIEIWRELTAKYDPTGNWRK-KYEDRAKAAGIVI 109 (112) T ss_dssp HHHHHHHHSCHHHHHHHSSTTTCSSHH-HHHHHHHHHCCCC T ss_pred HHHHHHCCHHHHHHHHHHHCCCCHHHH-HHHHHHHHCCCCC T ss_conf 999999091999999998698956999-9999999848999 No 53 >2ekc_A AQ_1548, tryptophan synthase alpha chain; structural genomics, lyase, NPPSFA; 2.00A {Aquifex aeolicus VF5} Probab=62.56 E-value=5.6 Score=16.52 Aligned_cols=41 Identities=7% Similarity=0.137 Sum_probs=16.6 Q ss_pred CCHHHHHHHHHHCCCCEEEE--E--CCCHHHHHHHHHHHHHHCCCC Q ss_conf 99899999999789988998--1--165177499999999976996 Q 537021.9.peg.7 27 LSLNNILDKIAADQQPAIAI--T--DTNNLFSALEFSQKACAAGIQ 68 (1033) Q Consensus 27 ~~~e~LV~~A~~~G~~AiAI--T--D~~nl~Gav~F~~ackk~gIK 68 (1033) +++++.++..++.+-+++.. | |. ++--..++.+++.+.|.- T Consensus 2 ~ri~~~f~~~k~~~~~~li~y~t~G~P-~~~~~~~~~~~l~~~G~d 46 (262) T 2ekc_A 2 GRISDKFTELKEKREKALVSYLMVGYP-DYETSLKAFKEVLKNGTD 46 (262) T ss_dssp CHHHHHHHHHHHHTBCEEEEEEETTSS-CHHHHHHHHHHHHHTTCS T ss_pred CHHHHHHHHHHHCCCCEEEEEEECCCC-CHHHHHHHHHHHHHCCCC T ss_conf 649999999997599569988828379-868999999999976999 No 54 >3nav_A Tryptophan synthase alpha chain; alpha subunit, structural genomics, CSG center for structural genomics of infectious diseases; 2.10A {Vibrio cholerae o1 biovar el tor} Probab=61.82 E-value=5.7 Score=16.42 Aligned_cols=46 Identities=24% Similarity=0.311 Sum_probs=28.9 Q ss_pred HHCCCHHHHHHHHHHCCCCEE----EEECCCHHHHHHHHHHHHHHCCCCEE Q ss_conf 432998999999997899889----98116517749999999997699638 Q 537021.9.peg.7 24 EGALSLNNILDKIAADQQPAI----AITDTNNLFSALEFSQKACAAGIQPI 70 (1033) Q Consensus 24 dg~~~~e~LV~~A~~~G~~Ai----AITD~~nl~Gav~F~~ackk~gIKPI 70 (1033) .++++++++.+++++.+-+|+ -.-|. +.-...++.++..+.|.--| T Consensus 2 ~~m~ri~~~f~~~k~~~~~ali~yltaG~P-~~e~s~~~l~~l~~~G~D~i 51 (271) T 3nav_A 2 NAMNRYQALFQRLSAAQQGAFVPFVTIGDP-NPEQSLAIMQTLIDAGADAL 51 (271) T ss_dssp -CCCHHHHHHHHHHHTTBCEEEEEEETTSS-CHHHHHHHHHHHHHTTCSSE T ss_pred CCHHHHHHHHHHHHHCCCCEEEEEEECCCC-CHHHHHHHHHHHHHCCCCEE T ss_conf 503589999999997499369989837189-87899999999997699999 No 55 >2zkq_m 40S ribosomal protein S18E; protein-RNA complex, 40S ribosomal subunit, ribosomal protein/RNA complex; 8.70A {Canis familiaris} PDB: 1s1h_M 3jyv_M* Probab=59.88 E-value=4.1 Score=17.67 Aligned_cols=57 Identities=21% Similarity=0.444 Sum_probs=37.1 Q ss_pred CCCCCCCCCCCCEEEEECCCEEEEEHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCHHHHHHHHH Q ss_conf 85035523342100353198698502685896989999999998379999978999734824799899998886 Q 537021.9.peg.7 814 IQIMPPSVNTPCVDFKVGDNRIYYSLAAIKGVGTTTARHIMEASADKPFDSLEDFCSRVDSKKLNRRVLESLVF 887 (1033) Q Consensus 814 i~vl~PdIN~S~~~f~~~~~~Ir~Gl~~Ikgvg~~~~~~Ive~r~~g~f~sl~df~~rv~~~~~nk~~le~Li~ 887 (1033) |.|+.-||+ ++..|.++|..|+|||...+..|++.-.-.|- .++ +.++..+++.|.. T Consensus 13 vRI~g~dI~--------g~K~v~~aLt~I~GIG~~~A~~Ic~~lgI~~~-------~kv--g~Lt~~ql~kl~~ 69 (152) T 2zkq_m 13 LRVLNTNID--------GRRKIAFAITAIKGVGRRYAHVVLRKADIDLT-------KRA--GELTEDEVERVIT 69 (152) T ss_dssp CCCSTTCCC--------CSSBHHHHGGGSTTCCHHHHHHHHHHHTCCTT-------SBT--TTSCHHHHHHHHH T ss_pred HHEECCCCC--------CCCEEEEEEECCCCCCHHHHHHHHHHHCCCCC-------CCC--CCCCHHHHHHHHH T ss_conf 413285279--------99788765002106589999999999389954-------301--5199999999999 No 56 >3ofo_B 30S ribosomal protein S2; protein biosynthesis, ribosomes, RNA, tRNA, transfer, eryThr ketolide, macrolide, antibiotic, EXIT, peptidyl; 3.10A {Escherichia coli} PDB: 3fih_B* 2wwl_B 3ofp_B 3i1m_B 1vs7_B* 3e1a_B 3e1c_B 1vs5_B 3i1o_B 3i1q_B 3i1s_B 3i1z_B 3i21_B 3kc4_B 3or9_B 3ora_B 2qal_B* 1p87_B 2aw7_B 2avy_B ... Probab=59.24 E-value=5.7 Score=16.45 Aligned_cols=67 Identities=10% Similarity=0.086 Sum_probs=28.3 Q ss_pred CCHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCC-CCEEECCCHHHHHHHHHHHHHHHHCCCCH Q ss_conf 3105676210014578753336568789999999830380-21554220778778899999997527998 Q 537021.9.peg.7 401 SLLFERFLNPDRMSMPDFDIDFCQDRRDEVIRYVQNKYGH-ERVAQIITFGSLQAKAALRDVGRALQMPY 469 (1033) Q Consensus 401 ~LlFERFLnp~R~~~PDIDiDf~~~~R~~vi~y~~~kyG~-~~va~i~t~~t~~~k~aird~~r~~g~~~ 469 (1033) |--++.|+-..|.+..=||++.....=.++..++.+--.. ..+--|+|-- +++.+|+..++..|.++ T Consensus 16 np~m~~yI~g~rngi~IIdl~~T~~~L~~A~~~i~~~~~~~~~iLfVgt~~--~~~~~i~~~a~~~~~~y 83 (218) T 3ofo_B 16 NPKMKPFIFGARNKVHIINLEKTVPMFNEALAELNKIASRKGKILFVGTKR--AASEAVKDAALSCDQFF 83 (218) T ss_dssp CGGGCTTCCCCCSSSCCCCHHHHTTGGGTHHHHHHHHTTTCCCCCEECCSS--SSGGGTSSSTTTSSCCE T ss_pred CCCCCCCCCCCCCCCEEECHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCH--HHHHHHHHHHHHHCCCC T ss_conf 986301002564996797299999999999999999973799799997768--78999999999809973 No 57 >3iar_A Adenosine deaminase; purine metabolism, structural genomics, structural genomics consortium, SGC, acetylation, disease mutation; HET: 3D1; 1.52A {Homo sapiens} PDB: 2bgn_E* 1w1i_E* 1qxl_A* 1krm_A* 1vfl_A 1ndv_A* 1ndy_A* 1ndz_A* 1o5r_A* 1uml_A* 1v79_A* 1v7a_A* 1ndw_A 1wxy_A* 1wxz_A* 2e1w_A* 2z7g_A* 2ada_A* 1a4m_A* 1a4l_A* ... Probab=58.99 E-value=3.5 Score=18.19 Aligned_cols=26 Identities=31% Similarity=0.377 Sum_probs=23.0 Q ss_pred HHHHHHCHHHHHHHCCCHHHHHHHHHHCCCC Q ss_conf 3343314447644329989999999978998 Q 537021.9.peg.7 12 VHLQVHSSYSLLEGALSLNNILDKIAADQQP 42 (1033) Q Consensus 12 vhLh~hS~ySlldg~~~~e~LV~~A~~~G~~ 42 (1033) ++|||| |+|+++++-|.+.|++.|.+ T Consensus 9 ~eLH~H-----L~Gs~~~~~l~ela~~~g~~ 34 (367) T 3iar_A 9 VELHVH-----LDGSIKPETILYYGRRRGIA 34 (367) T ss_dssp EECCBB-----GGGSCCHHHHHHHHHHHTCC T ss_pred HHHHCC-----CCCCCCHHHHHHHHHHCCCC T ss_conf 999678-----82788999999999982999 No 58 >1e8g_A Vanillyl-alcohol oxidase; oxidoreductase, flavoenzyme, specificity; HET: FAD FCR; 2.1A {Penicillium simplicissimum} SCOP: d.58.32.1 d.145.1.1 PDB: 1e8f_A* 1e8h_A* 1qlt_A* 1qlu_A* 1vao_A* 1ahv_A* 1ahz_A* 1ahu_A* 2vao_A* 1w1j_A* 1dzn_A* 1w1l_A* 1e0y_A* 1w1k_A* 1w1m_A* Probab=58.91 E-value=3.1 Score=18.62 Aligned_cols=21 Identities=29% Similarity=0.457 Sum_probs=17.1 Q ss_pred HHHHHHHHHCCCCCCCC-CCCC Q ss_conf 89999985104764334-4442 Q 537021.9.peg.7 360 VADFIQWAKKNNIPVGP-GRGS 380 (1033) Q Consensus 360 V~Div~~ak~~gI~vGp-GRGS 380 (1033) |+.+|++|++++++|-| |+|+ T Consensus 83 V~~iv~~a~~~~vpv~p~g~G~ 104 (560) T 1e8g_A 83 VQSIVGLANKFSFPLWPISIGR 104 (560) T ss_dssp HHHHHHHHHHHTCCEEEESSCC T ss_pred HHHHHHHHHHCCCEEEEECCCC T ss_conf 9999999998798089966799 No 59 >1vjz_A Endoglucanase; TM1752, structural genomics, JCSG, PSI, protein structure initiative, joint center for structural genomics; 2.05A {Thermotoga maritima} SCOP: c.1.8.3 Probab=58.81 E-value=6.3 Score=16.03 Aligned_cols=69 Identities=12% Similarity=0.013 Sum_probs=44.2 Q ss_pred CCCCCCHHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEE-------ECC--------CHHHHHHHHHHHHHHCCCCEE Q ss_conf 68988133433144476443299899999999789988998-------116--------517749999999997699638 Q 537021.9.peg.7 6 EKSPSFVHLQVHSSYSLLEGALSLNNILDKIAADQQPAIAI-------TDT--------NNLFSALEFSQKACAAGIQPI 70 (1033) Q Consensus 6 ~~~~~FvhLh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAI-------TD~--------~nl~Gav~F~~ackk~gIKPI 70 (1033) .+-|.|---++.+.++....-.-.++-++..++.|+.+|=| .+. +.+...=++.++|+++||++| T Consensus 15 ~~~~~~~g~n~~~~~~~~~~~~~te~D~~~ia~~GfN~VRlpv~~~~~~~~~~~~~~~~~~l~~ld~~v~~a~~~Gi~vi 94 (341) T 1vjz_A 15 NTIPRWRGFNLLEAFSIKSTGNFKEEDFLWMAQWDFNFVRIPMCHLLWSDRGNPFIIREDFFEKIDRVIFWGEKYGIHIC 94 (341) T ss_dssp --CCCCEEEECCTTSSTTCCCCCCHHHHHHHHHTTCCEEEEEEEGGGTSCSSCTTCCCGGGHHHHHHHHHHHHHHTCEEE T ss_pred CCCCCCCCEECCCCCCCCCCCCCCHHHHHHHHHCCCCEEEEECCHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEE T ss_conf 67876025576210113556788799999999869988993066999158888776598999999999999998799899 Q ss_pred EEEE Q ss_conf 9999 Q 537021.9.peg.7 71 IGCQ 74 (1033) Q Consensus 71 iG~E 74 (1033) +-.- T Consensus 95 ldlH 98 (341) T 1vjz_A 95 ISLH 98 (341) T ss_dssp EEEE T ss_pred EEEC T ss_conf 9726 No 60 >3bbn_B Ribosomal protein S2; small ribosomal subunit, spinach chloroplast ribosome, ribonucleoprotein particle, macromolecular complex; 9.40A {Spinacea oleracea} SCOP: i.1.1.1 Probab=58.62 E-value=5.9 Score=16.32 Aligned_cols=19 Identities=32% Similarity=0.286 Sum_probs=6.9 Q ss_pred EEEEEECCHHHHHHHHHHH Q ss_conf 3999956968999999999 Q 537021.9.peg.7 98 SIVLLVSTAEGYQRLIELV 116 (1033) Q Consensus 98 ~lvlLAkN~~Gy~nL~kL~ 116 (1033) |++=|-++...++..++++ T Consensus 39 ~IIdl~~T~~~L~~A~~~l 57 (231) T 3bbn_B 39 HIINLTRTARFLSEACDLV 57 (231) T ss_dssp EEECHHHHHHHTHHHHHHS T ss_pred EEECHHHHHHHHHHHHHHH T ss_conf 4885999999999999999 No 61 >2vqe_K 30S ribosomal protein S11, 30S ribosomal protein S6; tRNA-binding, rRNA-binding, metal-binding, zinc-finger, translation; HET: TM2 PAR; 2.5A {Thermus thermophilus} SCOP: c.55.4.1 PDB: 1gix_N* 1hnw_K* 1hnx_K* 1hnz_K* 1hr0_K 1ibk_K* 1ibl_K* 1ibm_K 1j5e_K 1jgo_N* 1jgp_N* 1jgq_N* 1ml5_N* 1n32_K* 1n33_K* 1n34_K 1n36_K 1xmo_K* 1xmq_K* 1xnq_K* ... Probab=57.71 E-value=4.1 Score=17.59 Aligned_cols=27 Identities=26% Similarity=0.414 Sum_probs=10.4 Q ss_pred CCCCCCHHHHHHHH---HHHCCCCCHHHCC Q ss_conf 34444204667765---5440375633323 Q 537021.9.peg.7 375 GPGRGSGAGSVTAY---ALTITDIDPLRFS 401 (1033) Q Consensus 375 GpGRGSAagSLVaY---~LgIT~VDPi~~~ 401 (1033) ||||-+|--.|..- .+-|+|+-|+.|| T Consensus 88 G~GR~~aik~l~~~glkI~~I~D~TpiphN 117 (129) T 2vqe_K 88 GAGREQAIRALQASGLQVKSIVDDTPVPHN 117 (129) T ss_dssp CTTHHHHHHHHHTSSSEEEECEECCCCCSS T ss_pred CCCCHHHHHHHHHCCCEEEEEEECCCCCCC T ss_conf 997299999998789879999989999979 No 62 >3glk_A Acetyl-COA carboxylase 2; ATP binding, alternative splicing, ATP-binding, biotin, fatty acid biosynthesis, ligase, lipid synthesis, manganese; 2.10A {Homo sapiens} PDB: 3gid_A 2hjw_A Probab=57.45 E-value=6.6 Score=15.86 Aligned_cols=99 Identities=11% Similarity=0.077 Sum_probs=52.2 Q ss_pred HHHHHCHHHH--HH---HCCCHHHHHHHH-HHCCCCEEEEECCCHHHHHHHHHHHHHHC---------CCCEEEEEEEEE Q ss_conf 3433144476--44---329989999999-97899889981165177499999999976---------996389999998 Q 537021.9.peg.7 13 HLQVHSSYSL--LE---GALSLNNILDKI-AADQQPAIAITDTNNLFSALEFSQKACAA---------GIQPIIGCQLDI 77 (1033) Q Consensus 13 hLh~hS~ySl--ld---g~~~~e~LV~~A-~~~G~~AiAITD~~nl~Gav~F~~ackk~---------gIKPIiG~E~~v 77 (1033) |.|-|+.-++ |. ...+|.+++... ...-++-|-|+.+|- -|+|..++|++- ||+.+. +|. T Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~kvLiANRGE--iA~Ri~r~~r~l~~~~~~~e~gi~tva---v~s 80 (540) T 3glk_A 6 HHHHHENLYFQGLHRDFTVASPAEFVTRFGGDRVIEKVLIANNGI--AAVKCMRSIRRWAYEMFRNERAIRFVV---MVT 80 (540) T ss_dssp ---------------CCCCSSHHHHHHHTTCSCCCCEEEECCCHH--HHHHHHHHHHHHHHHHHSCTTSSEEEE---EEC T ss_pred CCCCCCCHHHHCCCCCCCCCCHHHHHHHCCCCCCCCEEEEECCCH--HHHHHHHHHHHHHHCCCCCCCCCCEEE---EEC T ss_conf 321234213312157775579789998538984324698976859--999999998775431342104974899---984 Q ss_pred ECCCCCCCCCCCCCCCCCCCEEEEE--ECCHHHHHHHHHHHHHHHH Q ss_conf 5476643344565666677539999--5696899999999989973 Q 537021.9.peg.7 78 DMQDNLKMTNQGKTPFPTVPSIVLL--VSTAEGYQRLIELVSRMYL 121 (1033) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~lvlL--AkN~~Gy~nL~kL~S~a~~ 121 (1033) ..+.. ......+ ..-+.+.+ +.+.+-|-|.-+++..+.. T Consensus 81 ~~D~~-~~~~~v~----~ad~~~~i~~~~~~~sYL~~~~Ii~~A~~ 121 (540) T 3glk_A 81 PEDLK-ANAEYIK----MADHYVPVPGGPNNNNYANVELIVDIAKR 121 (540) T ss_dssp HHHHH-TTCHHHH----HSSEEEECCCSSGGGTTTCHHHHHHHHHH T ss_pred CCCCC-CCCHHHH----HCCEEEECCCCCHHHCCCCHHHHHHHHHH T ss_conf 40113-7974477----39789871899735300489999999999 No 63 >1bif_A 6-phosphofructo-2-kinase/ fructose-2,6- bisphosphatase; transferase (phospho), hydrolase (phospho), glycolysis, bifunctional enzyme; HET: ATG; 2.00A {Rattus norvegicus} SCOP: c.37.1.7 c.60.1.4 PDB: 3bif_A* 2bif_A* 1k6m_A* 1c80_A* 1c7z_A* 1c81_A* 1tip_A* 1fbt_A Probab=57.43 E-value=6.2 Score=16.12 Aligned_cols=31 Identities=19% Similarity=0.206 Sum_probs=19.0 Q ss_pred CCCCCCCHHHHHHHHHHHCCCCCHHHCCCHHHHH Q ss_conf 3344442046677655440375633323105676 Q 537021.9.peg.7 374 VGPGRGSGAGSVTAYALTITDIDPLRFSLLFERF 407 (1033) Q Consensus 374 vGpGRGSAagSLVaY~LgIT~VDPi~~~LlFERF 407 (1033) +.--+|--.+.+|-|++.+.. -| ..|+|=|- T Consensus 227 ~~~~~g~l~~~iv~~l~n~~~-~~--~~IyLvRH 257 (469) T 1bif_A 227 VNRVADHIQSRIVYYLMNIHV-TP--RSIYLCRH 257 (469) T ss_dssp EECCCSHHHHHHHHHHTTCCC-CC--CCEEEEEC T ss_pred EECCCCCCCHHHHHHHHCCCC-CC--CCCCCCCC T ss_conf 944667640334665223678-87--84551059 No 64 >3ou8_A Adenosine deaminase; structural genomics, PSI-2, protein structure initiative, NE SGX research center for structural genomics, nysgxrc; 2.51A {Pseudomonas aeruginosa} Probab=57.21 E-value=2.8 Score=19.01 Aligned_cols=26 Identities=23% Similarity=0.401 Sum_probs=22.8 Q ss_pred HHHHHHCHHHHHHHCCCHHHHHHHHHHCCCC Q ss_conf 3343314447644329989999999978998 Q 537021.9.peg.7 12 VHLQVHSSYSLLEGALSLNNILDKIAADQQP 42 (1033) Q Consensus 12 vhLh~hS~ySlldg~~~~e~LV~~A~~~G~~ 42 (1033) ++||+| |+|+.+++.+.+.|++.|.+ T Consensus 13 ~eLH~H-----l~Gs~~~~~l~ela~~~~~~ 38 (326) T 3ou8_A 13 AELHLH-----LEGTLEPELLFALAERNRIA 38 (326) T ss_dssp EECSBB-----GGGGCCHHHHHHHHHHTTCC T ss_pred HHHHCC-----CCCCCCHHHHHHHHHHCCCC T ss_conf 887468-----76787999999999974999 No 65 >3f1z_A Putative nucleic acid-binding lipoprotein; YP_001337197.1, structural genomics, joint center for structural genomics, JCSG; 2.46A {Klebsiella pneumoniae subsp} Probab=56.72 E-value=6.8 Score=15.77 Aligned_cols=39 Identities=18% Similarity=0.250 Sum_probs=33.1 Q ss_pred CCCCCCEEEEEEEEEEEEEECCCCCCEEEEEEEECCCCE Q ss_conf 055898899999996112101588876999999849704 Q 537021.9.peg.7 988 ATIRGNNIRLAAMVVSKQQKKTRKGSRIGWVTFSEPAKE 1026 (1033) Q Consensus 988 ~~~~g~~v~iaG~V~~i~~~~TKkG~~mafltleD~~g~ 1026 (1033) ...-|++|.+-|+|+.++++..-.|..+.++|..|+... T Consensus 45 sdylgkwvkvrgvivdirrksgiagsyyfivtmrdeqnk 83 (133) T 3f1z_A 45 SDYLGKWVKVRGVIVDIRRKSGIAGSYYFIVTMRDEQNK 83 (133) T ss_dssp HHHTTCEEEEEEEEEEEEEEECSTTCEEEEEEEECTTCS T ss_pred HHHHCCEEEEEEEEEEEEECCCCCCEEEEEEEEECCCCC T ss_conf 244204089878999865236766137999996222367 No 66 >3ci0_K Pseudopilin GSPK; general secretory pathway, pseudopilus, type 4 pilin biogenesis, methylation, protein transport; 2.20A {Escherichia coli} SCOP: a.60.16.1 a.60.16.1 d.24.1.6 Probab=56.42 E-value=5.3 Score=16.70 Aligned_cols=39 Identities=15% Similarity=0.309 Sum_probs=28.7 Q ss_pred HHHH--CCCCHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCC Q ss_conf 2685--89698999999999837999997899973482479 Q 537021.9.peg.7 839 LAAI--KGVGTTTARHIMEASADKPFDSLEDFCSRVDSKKL 877 (1033) Q Consensus 839 l~~I--kgvg~~~~~~Ive~r~~g~f~sl~df~~rv~~~~~ 877 (1033) |.++ .+++...++.|++.|..++|+|+.||...-....+ T Consensus 201 L~Al~~~~ls~~~A~~li~~Rp~~g~~~~~df~~~~~~~~~ 241 (298) T 3ci0_K 201 LEALFDPWLSPVQARALLQQRPAKGWEDVDQFLAQPLLADV 241 (298) T ss_dssp HHHHTC-------CCHHHHTCCTTCCSCHHHHHTSGGGSSS T ss_pred HHHHHCCCCCHHHHHHHHHHCCCCCCCCHHHHHHCHHHCCC T ss_conf 98760688999999999983600357789999716000146 No 67 >1rh9_A Endo-beta-mannanase; endo-beta-mannase, retaining, glycoside hydrolase family 5; 1.50A {Solanum lycopersicum} SCOP: c.1.8.3 Probab=55.62 E-value=7.1 Score=15.64 Aligned_cols=22 Identities=14% Similarity=-0.017 Sum_probs=15.8 Q ss_pred HHHHHHHHHHHCCCEEEECCCC Q ss_conf 9999999998189899808830 Q 537021.9.peg.7 197 ESQVVQLAYTHELPLVATNNSL 218 (1033) Q Consensus 197 ~~~l~~lA~~~~iPlVaTndv~ 218 (1033) ..++++.|+++|+.++.+...+ T Consensus 87 ~d~~v~~a~~~Gi~v~~~~~~~ 108 (373) T 1rh9_A 87 LDFVISEAKKYGIHLIMSLVNN 108 (373) T ss_dssp HHHHHHHHHHTTCEEEEECCBS T ss_pred HHHHHHHHHHCCCEEEEECCCC T ss_conf 9999999987799899953655 No 68 >1xrs_B D-lysine 5,6-aminomutase beta subunit; TIM barrel, rossmann domain, PLP, cobalamin, 5'- deoxyadenosine, radical, adenosylcobalamin; HET: B12 PLP 5AD; 2.80A {Clostridium sticklandii} SCOP: c.23.6.1 d.230.4.1 Probab=55.28 E-value=7.1 Score=15.60 Aligned_cols=20 Identities=10% Similarity=0.172 Sum_probs=12.6 Q ss_pred HHHHHHHHHHCCCEEEECCC Q ss_conf 99999999818989980883 Q 537021.9.peg.7 198 SQVVQLAYTHELPLVATNNS 217 (1033) Q Consensus 198 ~~l~~lA~~~~iPlVaTndv 217 (1033) +.+++.+.+.+.-+|+.+-. T Consensus 170 ee~v~~a~e~~aD~VgvS~l 189 (262) T 1xrs_B 170 EDFIKKAVELEADVLLVSQT 189 (262) T ss_dssp HHHHHHHHHTTCSEEEEECC T ss_pred HHHHHHHHHCCCCEEEEEEE T ss_conf 99999998649999999752 No 69 >2q02_A Putative cytoplasmic protein; NP_463296.1, structural genomics, joint center for structural genomics, JCSG; 2.40A {Salmonella typhimurium LT2} SCOP: c.1.15.4 Probab=53.33 E-value=7.6 Score=15.36 Aligned_cols=49 Identities=8% Similarity=0.156 Sum_probs=36.4 Q ss_pred CCCHHHHHHHHHHCCCCEEEEEC---CCHH---HHHHHHHHHHHHCCCCEEEEEEE Q ss_conf 29989999999978998899811---6517---74999999999769963899999 Q 537021.9.peg.7 26 ALSLNNILDKIAADQQPAIAITD---TNNL---FSALEFSQKACAAGIQPIIGCQL 75 (1033) Q Consensus 26 ~~~~e~LV~~A~~~G~~AiAITD---~~nl---~Gav~F~~ackk~gIKPIiG~E~ 75 (1033) -.++++.++.|++.|+.+|-|.. ...+ ..+-++-+.++++||+. .++.. T Consensus 18 ~l~l~e~i~~a~e~Gfd~IEl~~~~~~~~~~~~~~~~~~~~~l~~~gl~i-~~l~~ 72 (272) T 2q02_A 18 GLSIEAFFRLVKRLEFNKVELRNDMPSGSVTDDLNYNQVRNLAEKYGLEI-VTINA 72 (272) T ss_dssp TSCHHHHHHHHHHTTCCEEEEETTSTTSSTTTTCCHHHHHHHHHHTTCEE-EEEEE T ss_pred CCCHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCE-EEECC T ss_conf 98999999999981999899824666533347660999999999859947-88336 No 70 >3jrx_A Acetyl-COA carboxylase 2; BC domain, soraphen A, alternative splicing, ATP-binding, biotin, fatty acid biosynthesis, ligase, lipid synthesis; HET: S1A; 2.50A {Homo sapiens} PDB: 3jrw_A* Probab=53.12 E-value=7.7 Score=15.34 Aligned_cols=86 Identities=9% Similarity=0.039 Sum_probs=43.7 Q ss_pred HCCCHHHHHHHHHH-CCCCEEEEECCCHHHHHHHHHHHHHHC---------CCCEEEEEEEEEECC-CCCCCCCCCCCCC Q ss_conf 32998999999997-899889981165177499999999976---------996389999998547-6643344565666 Q 537021.9.peg.7 25 GALSLNNILDKIAA-DQQPAIAITDTNNLFSALEFSQKACAA---------GIQPIIGCQLDIDMQ-DNLKMTNQGKTPF 93 (1033) Q Consensus 25 g~~~~e~LV~~A~~-~G~~AiAITD~~nl~Gav~F~~ackk~---------gIKPIiG~E~~v~~~-~~~~~~~~~~~~~ 93 (1033) ...+|++.+.+.-. .-++-|-|+..| =-|++..++|++. ||+.+ -+|...+ +... ...+ T Consensus 39 ~~~~~~~~~~~~~g~~~~~kvLIANRG--EiA~Ri~Ra~r~~g~e~~~~e~gi~tv---avys~~D~~~~a--~~v~--- 108 (587) T 3jrx_A 39 TVASPAEFVTRFGGDRVIEKVLIANNG--IAAVKCMRSIRRWAYEMFRNERAIRFV---VMVTPEDLKANA--EYIK--- 108 (587) T ss_dssp CCSSHHHHHHHTTCCCCCCEEEECCCH--HHHHHHHHHHHHHHHHHHSCTTSSEEE---EEECHHHHHTTC--HHHH--- T ss_pred CCCCHHHHHHHCCCCCCEEEEEEECCC--HHHHHHHHHHHHHHHCCCCCCCCCEEE---EEECCCCCCCCC--HHHH--- T ss_conf 437989998743898433269997784--999999999777532234201596089---998752223687--3476--- Q ss_pred CCCCEEEEE--ECCHHHHHHHHHHHHHHHH Q ss_conf 677539999--5696899999999989973 Q 537021.9.peg.7 94 PTVPSIVLL--VSTAEGYQRLIELVSRMYL 121 (1033) Q Consensus 94 ~~~~~lvlL--AkN~~Gy~nL~kL~S~a~~ 121 (1033) ..-+.+.+ +.+.+-|-|.-.++..+.. T Consensus 109 -~AD~~v~i~~~~~~~sYlni~~Ii~iA~~ 137 (587) T 3jrx_A 109 -MADHYVPVPGGPNNNNYANVELIVDIAKR 137 (587) T ss_dssp -HSSEEEECCCSSGGGTTTCHHHHHHHHHH T ss_pred -HCCEEEECCCCCHHHCCCCHHHHHHHHHH T ss_conf -39788872898745220499999999999 No 71 >3bbn_K Ribosomal protein S11; small ribosomal subunit, spinach chloroplast ribosome, ribonucleoprotein particle, macromolecular complex; 9.40A {Spinacea oleracea} SCOP: i.1.1.1 Probab=52.65 E-value=1.6 Score=21.00 Aligned_cols=27 Identities=19% Similarity=0.397 Sum_probs=20.0 Q ss_pred CCCCCCHHHHHHHH---HHHCCCCCHHHCC Q ss_conf 34444204667765---5440375633323 Q 537021.9.peg.7 375 GPGRGSGAGSVTAY---ALTITDIDPLRFS 401 (1033) Q Consensus 375 GpGRGSAagSLVaY---~LgIT~VDPi~~~ 401 (1033) ||||-+|--+|..- ..-|+|+-|+.|| T Consensus 101 G~GR~~air~L~~~gl~I~~I~D~TpiphN 130 (140) T 3bbn_K 101 GLGRDAALRAIRRSGILLSFVRDVTPMPHN 130 (140) T ss_dssp STTSSHHHHHHHTTTCEEEEEEECCCCCCS T ss_pred CCCHHHHHHHHHHCCCEEEEEEECCCCCCC T ss_conf 864699999998689889999983999989 No 72 >1tg7_A Beta-galactosidase; TIM barrel domain, glycoside hydrolase, family GH35, glycoprotein; HET: NAG BMA MAN; 1.90A {Penicillium SP} SCOP: b.149.1.1 b.18.1.27 b.18.1.27 b.71.1.5 c.1.8.14 PDB: 1xc6_A* Probab=51.99 E-value=7.9 Score=15.20 Aligned_cols=50 Identities=20% Similarity=0.218 Sum_probs=38.7 Q ss_pred HHHHHHHHHHCCCCEEEE-------------ECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEE Q ss_conf 899999999789988998-------------11651774999999999769963899999985 Q 537021.9.peg.7 29 LNNILDKIAADQQPAIAI-------------TDTNNLFSALEFSQKACAAGIQPIIGCQLDID 78 (1033) Q Consensus 29 ~e~LV~~A~~~G~~AiAI-------------TD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~ 78 (1033) =++.++++|..|+.+|+. -|-....-..+|.+.|+++|+.-|+=.-=|++ T Consensus 38 W~d~l~K~KA~G~N~VstYv~Wn~HEp~~G~~df~G~~Dl~~F~~la~e~GLyvilRpGPYi~ 100 (971) T 1tg7_A 38 YIDIFEKVKALGFNCVSFYVDWALLEGNPGHYSAEGIFDLQPFFDAAKEAGIYLLARPGPYIN 100 (971) T ss_dssp HHHHHHHHHTTTCCEEEEECCHHHHCSBTTBCCCCGGGCSHHHHHHHHHHTCEEEEECCSCCC T ss_pred HHHHHHHHHHCCCCEEEEECCCCCCCCCCCEECCCCHHHHHHHHHHHHHCCCEEEECCCCEEE T ss_conf 899999999709988998467645579688851254688999999999859989987688846 No 73 >2zu6_B Programmed cell death protein 4; protein-protein complex, ATP-binding, helicase, hydrolase, initiation factor, nucleotide-binding; 2.80A {Homo sapiens} PDB: 3eij_A Probab=51.90 E-value=8 Score=15.19 Aligned_cols=79 Identities=11% Similarity=0.119 Sum_probs=54.4 Q ss_pred HHHHHHHHHHCCHHH-HHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHHCCCHHHHHHHHCCCCCCCCCC Q ss_conf 877898986063055-7888999998510476433444420466776554403756333231056762100145787533 Q 537021.9.peg.7 342 DFELNVIARMKFSGY-FLIVADFIQWAKKNNIPVGPGRGSGAGSVTAYALTITDIDPLRFSLLFERFLNPDRMSMPDFDI 420 (1033) Q Consensus 342 e~EL~vI~~~gf~~Y-FLiV~Div~~ak~~gI~vGpGRGSAagSLVaY~LgIT~VDPi~~~LlFERFLnp~R~~~PDIDi 420 (1033) +.-...|.+++.+.| .-+|...|..+-++. +.-|--+.+.|..++..=.-|.+=+.--=|+|.+ .++|||=+ T Consensus 183 ~EA~~cl~eL~~p~fhhe~V~~~i~~~lE~~---~~~~~~~v~~LL~~L~~~~~is~~q~~~Gf~~v~----~~l~Di~l 255 (307) T 2zu6_B 183 SEAEHCLKELEVPHFHHELVYEAIIMVLEST---GESTFKMILDLLKSLWKSSTITVDQMKRGYERIY----NEIPDINL 255 (307) T ss_dssp HHHHHHHHHHCCGGGHHHHHHHHHHHHHTCC---SSHHHHHHHHHHHHHHHHCSSCHHHHHHHHHHHH----HHHHHHHH T ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHHHHHCC---CHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHH----HHCCHHHH T ss_conf 9999999975886013999999999999458---5768999999999998879978999999999998----64728634 Q ss_pred CCCHHHH Q ss_conf 3656878 Q 537021.9.peg.7 421 DFCQDRR 427 (1033) Q Consensus 421 Df~~~~R 427 (1033) |+|.... T Consensus 256 DiP~A~~ 262 (307) T 2zu6_B 256 DVPHSYS 262 (307) T ss_dssp HCTTHHH T ss_pred CCCCHHH T ss_conf 5665299 No 74 >1fob_A Beta-1,4-galactanase; B/A barrel, glycosyl hydrolase, family 53, CLAN GH-A; 1.80A {Aspergillus aculeatus} SCOP: c.1.8.3 PDB: 1fhl_A Probab=51.03 E-value=8.2 Score=15.09 Aligned_cols=45 Identities=18% Similarity=0.209 Sum_probs=20.1 Q ss_pred CHHHHHHHHHHCCCCEEEE------E-CCCHHHHHHHHHHHHHHCCCCEEEE Q ss_conf 9899999999789988998------1-1651774999999999769963899 Q 537021.9.peg.7 28 SLNNILDKIAADQQPAIAI------T-DTNNLFSALEFSQKACAAGIQPIIG 72 (1033) Q Consensus 28 ~~e~LV~~A~~~G~~AiAI------T-D~~nl~Gav~F~~ackk~gIKPIiG 72 (1033) +.+++.+.-++.|+.+|=| . +++++.-+.++.+.|+++|+|.++- T Consensus 28 ~~~d~~~~lk~~G~N~VRi~vW~~P~~g~~~~~~~~~~~~~a~~~Gm~v~ld 79 (334) T 1fob_A 28 QTQALETILADAGINSIRQRVWVNPSDGSYDLDYNLELAKRVKAAGMSLYLD 79 (334) T ss_dssp CBCCHHHHHHHHTCCEEEEEECSCCTTCTTCHHHHHHHHHHHHHTTCEEEEE T ss_pred CCCCHHHHHHHCCCCEEEEEEEECCCCCCCCHHHHHHHHHHHHHCCCEEEEE T ss_conf 7345999999849997998541279888679999999999999889989996 No 75 >1jmc_A Protein (replication protein A (RPA)); human ssDNA binding replication protein A(RPA), single stranded DNA-binding protein, protein-ssDNA complex; HET: DNA; 2.40A {Homo sapiens} SCOP: b.40.4.3 b.40.4.3 PDB: 1fgu_A Probab=50.42 E-value=8.4 Score=15.02 Aligned_cols=54 Identities=7% Similarity=0.170 Sum_probs=29.1 Q ss_pred CCCCHHHHHCCCCCCEEEEEEEEEEEEEEC---CC-CCC--EEEEEEEECCCCE-EEEEEC Q ss_conf 897688985055898899999996112101---58-887--6999999849704-999953 Q 537021.9.peg.7 979 GIKGYEESVATIRGNNIRLAAMVVSKQQKK---TR-KGS--RIGWVTFSEPAKE-YEANFF 1032 (1033) Q Consensus 979 ~~~~~~~~~~~~~g~~v~iaG~V~~i~~~~---TK-kG~--~mafltleD~~g~-~e~~iF 1032 (1033) ...++.++.+...+..+.+.|+|+.+.... +| .|+ ...-+.|.|.++. +.+++| T Consensus 125 ~f~~i~di~~~~~~~~~DviG~v~~v~~~~~~~~k~~~~~~~~~~i~l~D~s~~~i~~tlW 185 (246) T 1jmc_A 125 DFTGIDDLENKSKDSLVDIIGICKSYEDATKITVRSNNREVAKRNIYLMDTSGKVVTATLW 185 (246) T ss_dssp CBCCGGGGGGSCTTCEEEEEEEEEEECCCEEEEETTTTEEEEEEEEEEECTTCCEEEEEEE T ss_pred CCCCHHHHHCCCCCCCCCEEEEEEEEECCEEEEECCCCCCCEEEEEEEEECCCCEEEEEEE T ss_conf 0145999750057886418999989746399997468975279999997079999999998 No 76 >1z00_A DNA excision repair protein ERCC-1; helix-hairpin-helix, hydrolase; HET: DNA; NMR {Homo sapiens} SCOP: a.60.2.5 Probab=49.92 E-value=7.8 Score=15.28 Aligned_cols=51 Identities=24% Similarity=0.372 Sum_probs=30.6 Q ss_pred EEEEHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCHHHHHHHHHCCCCCCCCCCHHHHH Q ss_conf 98502685896989999999998379999978999734824799899998886767234790389999 Q 537021.9.peg.7 835 IYYSLAAIKGVGTTTARHIMEASADKPFDSLEDFCSRVDSKKLNRRVLESLVFAGALDCFGYSRMQLL 902 (1033) Q Consensus 835 Ir~Gl~~Ikgvg~~~~~~Ive~r~~g~f~sl~df~~rv~~~~~nk~~le~Li~aGAfD~~~~~R~~ll 902 (1033) +...|..|+|||...+..|++ .|.|+.+++.- +...|. +.+.+|....+-+ T Consensus 17 ~~~~L~~I~gIG~~~a~~L~~-----~Fgsl~~i~~A------s~eeL~------~i~GiG~~~A~~I 67 (89) T 1z00_A 17 VTECLTTVKSVNKTDSQTLLT-----TFGSLEQLIAA------SREDLA------LCPGLGPQKARRL 67 (89) T ss_dssp HHHHHTTSSSCCHHHHHHHHH-----HTCBHHHHHHC------CHHHHH------TSTTCCHHHHHHH T ss_pred HHHHHCCCCCCCHHHHHHHHH-----HHCCHHHHHHH------HHHHHH------CCCCCCHHHHHHH T ss_conf 999875899759999999999-----94888999998------787772------6899699999999 No 77 >2f6h_X Myosin-2, type V myosin; mysoin V, cargo binding, cargo transport, vacuole binding, secreatory vescIle binding, structural protein; 2.25A {Saccharomyces cerevisiae} Probab=49.03 E-value=8.7 Score=14.86 Aligned_cols=86 Identities=12% Similarity=0.120 Sum_probs=43.4 Q ss_pred HHHHHHCCCCCCCCHHHH---------HHHHHHHHHCCCHHHHHHHHHHHCCC----CCCHHHHHHHHHHHHHHHCCCCH Q ss_conf 673341033553001256---------89887886326841307889762012----11016778789999986249999 Q 537021.9.peg.7 677 LIDPMLKETQGVIIYQEQ---------VMQIAQLLSGYSLSEADVLRRAMGKK----IKEEMDKQKERFISGASKNGISK 743 (1033) Q Consensus 677 ~l~~iL~~T~Gv~vyQEQ---------vm~ia~~~ag~~~~ead~lRra~~kk----~~~~~~~~~~~f~~g~~~~g~~~ 743 (1033) .|+++...+|+.+|+.=+ .|.-...+.|++.++....|.-.... ..+.+-..-..+..-+..+++++ T Consensus 146 ~L~~l~~~iy~~lv~~l~k~L~~~l~pail~~~~~~g~~~~~~~~~~~~~~~~~~~~s~~~Il~~L~~~~~~L~~~~v~~ 225 (419) T 2f6h_X 146 DFEALSYNIYNIWLKKLQKQLQKKAINAVVISESLPGFSAGETSGFLNKIFANTEEYTMDDILTFFNSIYWCMKSFHIEN 225 (419) T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHTTSCSSTTTTC---------------CCCHHHHHHHHHHHHHHHHHTTCCH T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCH T ss_conf 99999999999999999999899889998366555677666555655666667775689999999999999999859999 Q ss_pred HHHHHHHHHHHHH-HHCCCC Q ss_conf 9999999999975-000133 Q 537021.9.peg.7 744 TIAVNIFELLAKF-ADYGFN 762 (1033) Q Consensus 744 ~~a~~i~~~i~~f-~~Y~Fn 762 (1033) ..+..+|.++-.| ..-.|| T Consensus 226 ~l~~qif~qlf~~in~~lFN 245 (419) T 2f6h_X 226 EVFHAVVTTLLNYVDAICFN 245 (419) T ss_dssp HHHHHHHHHHHHHHHHHHHH T ss_pred HHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999899999 No 78 >3mmt_A Fructose-bisphosphate aldolase; ssgcid, structural genomics, seattle structural GE center for infectious disease, hydrolase; HET: 2FP; 2.35A {Bartonella henselae} Probab=48.67 E-value=8.8 Score=14.82 Aligned_cols=25 Identities=20% Similarity=0.222 Sum_probs=16.5 Q ss_pred HHHHHHHCCCCCCCEEEECCCCHHH Q ss_conf 0112210024474403352566666 Q 537021.9.peg.7 517 KLEGLYRHASTHAAGIVIGDRPLSQ 541 (1033) Q Consensus 517 ~leg~~r~~~~Ha~Gvvi~~~~l~~ 541 (1033) .|---|..++.|-+|+|+..+.+.. T Consensus 55 ~l~tsp~~~~~~IsGvILfeeTl~q 79 (347) T 3mmt_A 55 MLFTAKEAMESAISGVILFDETLRQ 79 (347) T ss_dssp HHHTCHHHHHHTEEEEEECHHHHTC T ss_pred HHCCCCHHHHCCEEEEECCCHHHHC T ss_conf 7716854765566489755235422 No 79 >1cuk_A RUVA protein; DNA repair, SOS response, DNA-binding, DNA recombination, helicase; 1.90A {Escherichia coli} SCOP: a.5.1.1 a.60.2.1 b.40.4.2 PDB: 1hjp_A 1bdx_A* 1c7y_A 1d8l_A Probab=47.86 E-value=9.1 Score=14.72 Aligned_cols=16 Identities=31% Similarity=0.700 Sum_probs=8.4 Q ss_pred HHHHCCCCHHHHHHHH Q ss_conf 2685896989999999 Q 537021.9.peg.7 839 LAAIKGVGTTTARHIM 854 (1033) Q Consensus 839 l~~Ikgvg~~~~~~Iv 854 (1033) |+.|+|||.++|++|+ T Consensus 110 L~~vpGIGkKtA~rIi 125 (203) T 1cuk_A 110 LVKLPGIGKKTAERLI 125 (203) T ss_dssp HHTSTTCCHHHHHHHH T ss_pred HHCCCCCCHHHHHHHH T ss_conf 6069987799999999 No 80 >3n4f_A Mandelate racemase/muconate lactonizing protein; structural genomics, protein structure INI NEW YORK structural genomix research consortium; HET: MSE; 1.88A {Geobacillus SP} Probab=47.39 E-value=9.2 Score=14.67 Aligned_cols=76 Identities=14% Similarity=0.330 Sum_probs=40.3 Q ss_pred HHHHHHHHHCCCCCCCCCC-CCHHHHHHH--HHHHCCCCCHHHC-CCHHHHHH------HHCCCCCCC-----CCCCCCH Q ss_conf 8999998510476433444-420466776--5544037563332-31056762------100145787-----5333656 Q 537021.9.peg.7 360 VADFIQWAKKNNIPVGPGR-GSGAGSVTA--YALTITDIDPLRF-SLLFERFL------NPDRMSMPD-----FDIDFCQ 424 (1033) Q Consensus 360 V~Div~~ak~~gI~vGpGR-GSAagSLVa--Y~LgIT~VDPi~~-~LlFERFL------np~R~~~PD-----IDiDf~~ 424 (1033) ...+...|+..|+.+.|+= ||+-|...+ .+..+.....++| .+++|.++ .-++..+|| |++| T Consensus 294 ~~~ia~~A~~~gi~v~~h~~~~~i~~aa~~hl~aa~~n~~~~E~~~~~~~~~~~~~~~i~~G~~~~p~~PGLGve~d--- 370 (392) T 3n4f_A 294 WMELGEKLDAHGLRSAPHCYGNAYGIYASGHLSAAVRNFEFVEYDDITIEGMDVSGYRIENGEIHVPATPGFGIVFD--- 370 (392) T ss_dssp HHHHHHHHHHTTCEECCBCCSCHHHHHHHHHHGGGCTTBCCEEECCEEETTEECTTCEEETTEEEECCSSBTSCEEC--- T ss_pred HHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHHHCCCCCEEECCCHHHHHCCCCCCEEECCEEECCCCCCCCEEEC--- T ss_conf 99999999987991875079868999999999996899970332632465544689876799998999999763589--- Q ss_pred HHHHHHHHHHHHHCCCC Q ss_conf 87899999998303802 Q 537021.9.peg.7 425 DRRDEVIRYVQNKYGHE 441 (1033) Q Consensus 425 ~~R~~vi~y~~~kyG~~ 441 (1033) +++++|+.+|++.. T Consensus 371 ---ee~l~~~~~~~~w~ 384 (392) T 3n4f_A 371 ---DELVTYLINRSGWS 384 (392) T ss_dssp ---HHHHHHHHHHHSEE T ss_pred ---HHHHHHHHCCCCCC T ss_conf ---99998662752453 No 81 >3d3a_A Beta-galactosidase; protein structure initiative II, PSI II, NYSGXRC, 11092F, structural genomics; 2.15A {Bacteroides thetaiotaomicron vpi-5482} Probab=46.59 E-value=9.4 Score=14.58 Aligned_cols=50 Identities=14% Similarity=0.038 Sum_probs=37.6 Q ss_pred HHHHHHHHHHCCCCEEEE-----E--------CCCHHHHHHHHHHHHHHCCCCEEEEEEEEEE Q ss_conf 899999999789988998-----1--------1651774999999999769963899999985 Q 537021.9.peg.7 29 LNNILDKIAADQQPAIAI-----T--------DTNNLFSALEFSQKACAAGIQPIIGCQLDID 78 (1033) Q Consensus 29 ~e~LV~~A~~~G~~AiAI-----T--------D~~nl~Gav~F~~ackk~gIKPIiG~E~~v~ 78 (1033) -++.++++|++|+.+|-+ . |-.....+.+|+++|.+.|+.-|.-.--|++ T Consensus 39 w~~~l~~~K~~G~N~Vrtyv~Wn~HeP~~g~~df~g~~dl~~fl~la~e~GL~Vilr~gpyi~ 101 (612) T 3d3a_A 39 WEHRIKMCKALGMNTICLYVFWNFHEPEEGRYDFAGQKDIAAFCRLAQENGMYVIVRPGPYVC 101 (612) T ss_dssp HHHHHHHHHHHTCCEEEEECCHHHHCSSTTCCCCSGGGCHHHHHHHHHHTTCEEEEECCSCCC T ss_pred HHHHHHHHHHCCCCEEEECCCHHHCCCCCCCCCCCCCCCHHHHHHHHHHCCCEEEEECCCCCC T ss_conf 999999999729989996063521388788157544015999999999859999995587456 No 82 >3kws_A Putative sugar isomerase; structural genomics, joint center for structural genomics, JCSG, protein structure initiative; HET: MSE; 1.68A {Parabacteroides distasonis atcc 8503} Probab=46.03 E-value=9.6 Score=14.51 Aligned_cols=45 Identities=18% Similarity=-0.023 Sum_probs=37.0 Q ss_pred CCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEE Q ss_conf 299899999999789988998116517749999999997699638 Q 537021.9.peg.7 26 ALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPI 70 (1033) Q Consensus 26 ~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPI 70 (1033) ..++++.++.++++|+.+|=|.....-.-.-++.++++++|++.. T Consensus 37 ~~~l~e~l~~~~~~GfdgiEl~~~~~~~~~~~~k~~l~~~gl~v~ 81 (287) T 3kws_A 37 GESLNEKLDFMEKLGVVGFEPGGGGLAGRVNEIKQALNGRNIKVS 81 (287) T ss_dssp CSSHHHHHHHHHHTTCCEEECBSTTCGGGHHHHHHHHTTSSCEEC T ss_pred CCCHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHHHHHCCEEE T ss_conf 999999999999809998997897436459999999998399799 No 83 >1uuq_A Mannosyl-oligosaccharide glucosidase; hydrolase, mannosidase, mannan, glycoside hydrolase, family 5; 1.5A {Cellvibrio mixtus} SCOP: c.1.8.3 PDB: 1uz4_A* Probab=45.79 E-value=9.7 Score=14.48 Aligned_cols=24 Identities=8% Similarity=0.092 Sum_probs=18.1 Q ss_pred HHHHHHHHHHHHCCCEEEECCCCC Q ss_conf 999999999981898998088300 Q 537021.9.peg.7 196 RESQVVQLAYTHELPLVATNNSLF 219 (1033) Q Consensus 196 ~~~~l~~lA~~~~iPlVaTndv~Y 219 (1033) ....++..|++.||.++.+-..+. T Consensus 112 ~lD~~i~~a~~~Gi~vil~l~~~w 135 (440) T 1uuq_A 112 GLDYLLVELAKRDMTVVLYFNNFW 135 (440) T ss_dssp HHHHHHHHHHHTTCEEEEECCBSS T ss_pred HHHHHHHHHHHCCCCCEECCCCCC T ss_conf 999999999976993045034344 No 84 >3mqt_A Mandelate racemase/muconate lactonizing protein; PSI-II, NYSGXRC, muconate lactonizing EN structural genomics, protein structure initiative; 2.10A {Shewanella pealeana} Probab=45.64 E-value=9.7 Score=14.47 Aligned_cols=86 Identities=19% Similarity=0.281 Sum_probs=43.2 Q ss_pred CHHHHHHHHHHHHHHHCCCCCCCCCC-CCH----HHHHHHHHHHCCC----CCHHH-CCCHHHHH-HHH-----CCCCCC Q ss_conf 30557888999998510476433444-420----4667765544037----56333-23105676-210-----014578 Q 537021.9.peg.7 353 FSGYFLIVADFIQWAKKNNIPVGPGR-GSG----AGSVTAYALTITD----IDPLR-FSLLFERF-LNP-----DRMSMP 416 (1033) Q Consensus 353 f~~YFLiV~Div~~ak~~gI~vGpGR-GSA----agSLVaY~LgIT~----VDPi~-~~LlFERF-Lnp-----~R~~~P 416 (1033) ..+=+.=...+.+.|++.|+.+.|+- ||+ |+.-++-++.... .+|.- ++.+.+.. .+| ++..+| T Consensus 285 ~~GGit~~~~i~~~A~~~gi~v~~h~~~~~i~~aa~~hl~aa~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~G~i~~p 364 (394) T 3mqt_A 285 RCGGVTELLRIMDICEHHNAQLMPHNWKTGITAAAARHFGIVCHISEYVEYLHPDFWNGTLTQQLTLNEPKIIDGAIEVS 364 (394) T ss_dssp TSSCHHHHHHHHHHHHHHTCEECCCCCSCHHHHHHHHHHHHHCTTCSCCEECCTTTCCCHHHHHTBSSCCCCBTTBBCCC T ss_pred CCCCCCHHHHHHHHHHHCCCEEEECCCCCHHHHHHHHHHHHHCCCCHHHHHCCCCCCCCCHHHHHCCCCCEEECCEEECC T ss_conf 07882589999999998698697428885799999999999767411466537411233024553368981568589899 Q ss_pred C---CCCCCCHHHHHHHHHHHHHHC-CCC Q ss_conf 7---533365687899999998303-802 Q 537021.9.peg.7 417 D---FDIDFCQDRRDEVIRYVQNKY-GHE 441 (1033) Q Consensus 417 D---IDiDf~~~~R~~vi~y~~~ky-G~~ 441 (1033) | +-+++. .+.+-+|..+|| |.. T Consensus 365 ~~PGLGieid---~d~l~~~~~~~~~~~~ 390 (394) T 3mqt_A 365 DKPGLGIELN---IEFVEQVTGHKFEGHH 390 (394) T ss_dssp CCSBTCCCBC---HHHHHHHHCSCC---- T ss_pred CCCCCCEEEC---HHHHHHHCCCCCCCCC T ss_conf 9899976479---9999752289778867 No 85 >3no1_A Isomerase, mandelate racemase/muconate lactonizing enzyme; enolase,metal-binding,PSI-II, NYSGXRC, structural genomics; 2.16A {Marine actinobacterium PHSC20C1} PDB: 3msy_A Probab=45.17 E-value=9.9 Score=14.41 Aligned_cols=31 Identities=6% Similarity=-0.033 Sum_probs=14.8 Q ss_pred HHHHHHHHHCCCCCCCCCCCCHHHHHHHHHH Q ss_conf 8999998510476433444420466776554 Q 537021.9.peg.7 360 VADFIQWAKKNNIPVGPGRGSGAGSVTAYAL 390 (1033) Q Consensus 360 V~Div~~ak~~gI~vGpGRGSAagSLVaY~L 390 (1033) ...+...|+..|+.++|+=+++++.-++-.+ T Consensus 303 ~~~i~~~A~~~gi~v~~h~~~~~~~h~~aa~ 333 (398) T 3no1_A 303 WLRTAAIATSYDVQMGHHEEPQVSTHLLASQ 333 (398) T ss_dssp HHHHHHHHHHTTCEECCCSCHHHHHHHHHHS T ss_pred HHHHHHHHHHCCCEEECCCHHHHHHHHHHHC T ss_conf 9999999988898897475899999999974 No 86 >1jb7_A Telomere-binding protein alpha subunit; DNA-protein interactions, DNA hydration, sodium ION, quadruplex DNA, DNA-binding protein/DNA complex; 1.86A {Sterkiella nova} SCOP: b.40.4.3 b.40.4.3 b.40.4.3 PDB: 1kix_A 1otc_A* 2i0q_A 1ph6_A 1ph1_A 1phj_A* 1ph3_A 1pa6_A 1ph4_A 1ph7_A 1ph8_A 1ph9_A 1ph5_A* 1ph2_A* 1k8g_A Probab=44.85 E-value=10 Score=14.38 Aligned_cols=12 Identities=17% Similarity=0.240 Sum_probs=6.3 Q ss_pred CCCEEEEEECCH Q ss_conf 775399995696 Q 537021.9.peg.7 95 TVPSIVLLVSTA 106 (1033) Q Consensus 95 ~~~~lvlLAkN~ 106 (1033) .+-.++++|+|. T Consensus 96 ~~l~v~lFa~~~ 107 (495) T 1jb7_A 96 DYATLVLYAKRF 107 (495) T ss_dssp CCEEEEEEESSG T ss_pred CCEEEEEECCCH T ss_conf 870899974897 No 87 >1x2i_A HEF helicase/nuclease; alpha helix, helix-hairpin-helix DNA binding domain, homodimer, hydrolase; 1.45A {Pyrococcus furiosus} SCOP: a.60.2.5 Probab=44.50 E-value=8.5 Score=14.95 Aligned_cols=30 Identities=23% Similarity=0.379 Sum_probs=21.9 Q ss_pred EEEHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHH Q ss_conf 85026858969899999999983799999789997 Q 537021.9.peg.7 836 YYSLAAIKGVGTTTARHIMEASADKPFDSLEDFCS 870 (1033) Q Consensus 836 r~Gl~~Ikgvg~~~~~~Ive~r~~g~f~sl~df~~ 870 (1033) ++-|..|+|||...+..|++. |.|+.++.. T Consensus 13 ~~~L~~IpgIG~~~a~~L~~~-----F~s~~~i~~ 42 (75) T 1x2i_A 13 RLIVEGLPHVSATLARRLLKH-----FGSVERVFT 42 (75) T ss_dssp HHHHTTSTTCCHHHHHHHHHH-----HCSHHHHHH T ss_pred HHHHCCCCCCCHHHHHHHHHH-----HCCHHHHHH T ss_conf 999848999429999999999-----688898999 No 88 >1qop_A Tryptophan synthase alpha chain; lyase, carbon-oxygen lyase, tryptophan biosynthesis, pyridoxal phosphate; HET: IPL PLP; 1.4A {Salmonella typhimurium} SCOP: c.1.2.4 PDB: 1k8x_A* 1wbj_A* 2clk_A* 2j9z_A* 3cep_A* 1k8y_A* 1a5s_A* 1a50_A* 1c29_A* 1c8v_A* 1c9d_A* 1bks_A* 1cx9_A* 1fuy_A* 1cw2_A* 1k7e_A* 1k7f_A* 1k7x_A* 1k3u_A* 1k8z_A* ... Probab=44.48 E-value=10 Score=14.33 Aligned_cols=44 Identities=14% Similarity=0.279 Sum_probs=26.7 Q ss_pred CCCHHHHHHHHHHCCCCEE----EEECCCHHHHHHHHHHHHHHCCCCEE Q ss_conf 2998999999997899889----98116517749999999997699638 Q 537021.9.peg.7 26 ALSLNNILDKIAADQQPAI----AITDTNNLFSALEFSQKACAAGIQPI 70 (1033) Q Consensus 26 ~~~~e~LV~~A~~~G~~Ai----AITD~~nl~Gav~F~~ackk~gIKPI 70 (1033) ++++++..+..++.|-+|+ -.-|. ++-...++..++.++|.--| T Consensus 1 M~ri~~~f~~~k~~~~~ali~y~taG~P-~~~~~~~~l~~l~~~GaD~i 48 (268) T 1qop_A 1 MERYENLFAQLNDRREGAFVPFVTLGDP-GIEQSLKIIDTLIDAGADAL 48 (268) T ss_dssp CCHHHHHHHHHHHTTCCEEEEEEETTSS-CHHHHHHHHHHHHHTTCSSE T ss_pred CCHHHHHHHHHHHCCCCEEEEEEECCCC-CHHHHHHHHHHHHHCCCCEE T ss_conf 9738999999987499368988817079-87999999999997799999 No 89 >2a1j_B DNA excision repair protein ERCC-1; XPF, xeroderma pigmentosum, DNA repair, endonuclease, helix-hairpin-helix, DNA binding protein; HET: DNA; 2.70A {Homo sapiens} SCOP: a.60.2.5 Probab=44.31 E-value=7.9 Score=15.22 Aligned_cols=30 Identities=27% Similarity=0.484 Sum_probs=21.2 Q ss_pred EEEEHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHH Q ss_conf 98502685896989999999998379999978999 Q 537021.9.peg.7 835 IYYSLAAIKGVGTTTARHIMEASADKPFDSLEDFC 869 (1033) Q Consensus 835 Ir~Gl~~Ikgvg~~~~~~Ive~r~~g~f~sl~df~ 869 (1033) +...|+.|+|||...+..|++ .|.|+.+++ T Consensus 30 ~~~~L~~I~gIGk~~A~~L~~-----~F~Si~~l~ 59 (91) T 2a1j_B 30 VTECLTTVKSVNKTDSQTLLT-----TFGSLEQLI 59 (91) T ss_dssp HHHHHTTSTTCCHHHHHHHHH-----HHSSHHHHH T ss_pred HHHHHCCCCCCCHHHHHHHHH-----HHCCCHHHH T ss_conf 999864688859999999999-----958829899 No 90 >2vpq_A Acetyl-COA carboxylase; bacteria, ATP-grAsp domain, biotin carboxylase, ligase; HET: ANP; 2.1A {Staphylococcus aureus} Probab=43.86 E-value=10 Score=14.26 Aligned_cols=27 Identities=19% Similarity=0.251 Sum_probs=18.0 Q ss_pred EEEEECCCHHHHHHHHHHHHHHCCCCEEE Q ss_conf 89981165177499999999976996389 Q 537021.9.peg.7 43 AIAITDTNNLFSALEFSQKACAAGIQPII 71 (1033) Q Consensus 43 AiAITD~~nl~Gav~F~~ackk~gIKPIi 71 (1033) -|-|...| ==+++..++|++.||++|. T Consensus 3 kiLIanrG--eiA~riira~relGi~tVa 29 (451) T 2vpq_A 3 KVLIANRG--EIAVRIIRACRDLGIQTVA 29 (451) T ss_dssp EEEECCCH--HHHHHHHHHHHHTTCEEEE T ss_pred EEEEECCC--HHHHHHHHHHHHCCCEEEE T ss_conf 89998970--9999999999987995999 No 91 >2dh2_A 4F2 cell-surface antigen heavy chain; TIM-barrel, glycosidase like, antiparallel beta-sheet, greek KEY, C-terminal domain, extracellular domain; 2.10A {Homo sapiens} PDB: 2dh3_A Probab=43.56 E-value=10 Score=14.23 Aligned_cols=49 Identities=16% Similarity=0.095 Sum_probs=32.8 Q ss_pred HHHCCCHHHHH---HHHHHCCCCEEEEEC--------------------CCHHHHHHHHHHHHHHCCCCEEE Q ss_conf 44329989999---999978998899811--------------------65177499999999976996389 Q 537021.9.peg.7 23 LEGALSLNNIL---DKIAADQQPAIAITD--------------------TNNLFSALEFSQKACAAGIQPII 71 (1033) Q Consensus 23 ldg~~~~e~LV---~~A~~~G~~AiAITD--------------------~~nl~Gav~F~~ackk~gIKPIi 71 (1033) .||+...+.++ ..-+++|+.||=|+= .|++-..-+|.++|.+.|||-|+ T Consensus 29 ~dg~Gd~~g~~~kLdyl~~LGv~~i~l~Pi~~~~~~~y~~~dy~~vd~~~Gt~~dfk~lV~~aH~~gi~Vil 100 (424) T 2dh2_A 29 GHGAGNLAGLKGRLDYLSSLKVKGLVLGPIHKNQKDDVAQTDLLQIDPNFGSKEDFDSLLQSAKKKSIRVIL 100 (424) T ss_dssp CTTCCSHHHHHTTHHHHHHTTCSEEEECCCEEECTTCSTTEEEEEECGGGCCHHHHHHHHHHHHHTTCEEEE T ss_pred CCCCCCHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCEEEE T ss_conf 998649999998517998759998998998728889988768742185569999999999999862888999 No 92 >3obe_A Sugar phosphate isomerase/epimerase; structural genomics, joint center for structural genomics, J protein structure initiative, PSI; HET: MSE; 1.70A {Parabacteroides distasonis} Probab=43.15 E-value=11 Score=14.18 Aligned_cols=58 Identities=17% Similarity=0.309 Sum_probs=42.0 Q ss_pred HHHHHHCHHHHHHHCC-CHHHHHHHHHHCCCCEEEEEC--------------CCHHHHHHHHHHHHHHCCCCEE Q ss_conf 3343314447644329-989999999978998899811--------------6517749999999997699638 Q 537021.9.peg.7 12 VHLQVHSSYSLLEGAL-SLNNILDKIAADQQPAIAITD--------------TNNLFSALEFSQKACAAGIQPI 70 (1033) Q Consensus 12 vhLh~hS~ySlldg~~-~~e~LV~~A~~~G~~AiAITD--------------~~nl~Gav~F~~ackk~gIKPI 70 (1033) ..+-++| ||+.+.+. .+++.++.+++.|+.+|=|.. .......-++.+.+.++|+++. T Consensus 21 ~~ig~~~-ys~~~~~~~dl~~~L~~~a~~G~~~VEl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~Gl~i~ 93 (305) T 3obe_A 21 KKMGLQT-YSLGQELLQDMPNGLNRLAKAGYTDLEIFGYREDTGKFGDYNPKNTTFIASKDYKKMVDDAGLRIS 93 (305) T ss_dssp CCCEEEG-GGGTHHHHTTHHHHHHHHHHHTCCEEEECCBCTTTCCBCCC----CCCBCHHHHHHHHHHTTCEEE T ss_pred CEEEEEE-EECCCCHHCCHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCEEE T ss_conf 6389998-453767113999999999984999799877666421123447411123209999999998498685 No 93 >2vqe_M 30S ribosomal protein S13, 30S ribosomal protein S6; tRNA-binding, rRNA-binding, metal-binding, zinc-finger, translation; HET: TM2 PAR; 2.5A {Thermus thermophilus} SCOP: a.156.1.1 PDB: 1gix_P* 1hnw_M* 1hnx_M* 1hnz_M* 1hr0_M 1ibk_M* 1ibl_M* 1ibm_M 1j5e_M 1jgo_P* 1jgp_P* 1jgq_P* 1mj1_P* 1ml5_P* 1n32_M* 1n33_M* 1n34_M 1n36_M 1xmo_M* 1xmq_M* ... Probab=43.12 E-value=3.9 Score=17.78 Aligned_cols=48 Identities=27% Similarity=0.394 Sum_probs=32.6 Q ss_pred CCCEEEEEHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCHHHHHHHHH Q ss_conf 198698502685896989999999998379999978999734824799899998886 Q 537021.9.peg.7 831 GDNRIYYSLAAIKGVGTTTARHIMEASADKPFDSLEDFCSRVDSKKLNRRVLESLVF 887 (1033) Q Consensus 831 ~~~~Ir~Gl~~Ikgvg~~~~~~Ive~r~~g~f~sl~df~~rv~~~~~nk~~le~Li~ 887 (1033) ++..|.++|..|+|||...+..|.+.-.=.|. .++ ..++..+++.|.. T Consensus 11 ~~K~v~~aLt~I~GIG~~~A~~Ic~~lGId~~-------~kv--g~Lt~~qi~~l~~ 58 (126) T 2vqe_M 11 RNKRVDVALTYIYGIGKARAKEALEKTGINPA-------TRV--KDLTEAEVVRLRE 58 (126) T ss_dssp CSSBHHHHHTTSSSCCSHHHHHHTTTTTCCTT-------SBG--GGCCHHHHHHHHH T ss_pred CCCEEEEEECCEECCCHHHHHHHHHHCCCCCC-------CCC--CCCCHHHHHHHHH T ss_conf 99787876204348389999999998597966-------301--6399999999999 No 94 >3kwm_A Ribose-5-phosphate isomerase A; structural genomics, IDP02119, center for structural genomics of infectious diseases, csgid; 2.32A {Francisella tularensis subsp} Probab=42.97 E-value=8.4 Score=15.01 Aligned_cols=50 Identities=14% Similarity=0.220 Sum_probs=23.5 Q ss_pred HHHHHHCCCCCCCCCCCCCCEEEEECC-CEEEEEHHHHCCCCHHHH-HHHHHH Q ss_conf 988886368503552334210035319-869850268589698999-999999 Q 537021.9.peg.7 806 CQDARQFNIQIMPPSVNTPCVDFKVGD-NRIYYSLAAIKGVGTTTA-RHIMEA 856 (1033) Q Consensus 806 i~e~~~~gi~vl~PdIN~S~~~f~~~~-~~Ir~Gl~~Ikgvg~~~~-~~Ive~ 856 (1033) ...|++.||++..++ ..+..+.++++ +.|-..|..|||=|.... |.|+.. T Consensus 63 ~~~~~~~Gi~l~~l~-~~~~iDl~iDGADeVd~~l~lIKGGGgallrEKiva~ 114 (224) T 3kwm_A 63 TRKLKALGFDVVDLN-YAGEIDLYIDGADECNNHKELIKGGGAALTREKICVA 114 (224) T ss_dssp HHHHHHTTCCBCCHH-HHCSEEEEEECCSEECTTSCEECCSSSCHHHHHHHHH T ss_pred HHHHHHCCCCCCCCC-CCCEEEEEEECCHHHCCCCCEEECCCHHHHHHHHHHH T ss_conf 999986289753613-3886699997703228775588316178999899999 No 95 >3ofo_K 30S ribosomal protein S11; protein biosynthesis, ribosomes, RNA, tRNA, transfer, eryThr ketolide, macrolide, antibiotic, EXIT, peptidyl; 3.10A {Escherichia coli} PDB: 3fih_K* 3iy8_K 2wwl_K 3ofp_K 3i1m_K 1vs7_K* 3e1a_C 3e1c_C 1vs5_K 3i1o_K 3i1q_K 3i1s_K 3i1z_K 3i21_K 3kc4_K 3or9_K 3ora_K 2qal_K* 1p6g_K 1p87_K ... Probab=42.96 E-value=8.8 Score=14.85 Aligned_cols=27 Identities=37% Similarity=0.641 Sum_probs=14.9 Q ss_pred CCCCCCHHHHHHHHH---HHCCCCCHHHCC Q ss_conf 344442046677655---440375633323 Q 537021.9.peg.7 375 GPGRGSGAGSVTAYA---LTITDIDPLRFS 401 (1033) Q Consensus 375 GpGRGSAagSLVaY~---LgIT~VDPi~~~ 401 (1033) ||||-+|--.|..-= .-|+|+-|+.|| T Consensus 78 G~gr~~~ik~l~~~gikI~~I~D~T~~phn 107 (117) T 3ofo_K 78 GPGRESTIRALNAAGFRITNITDVTPIPHN 107 (117) T ss_dssp STTTTHHHHHHHTTTCCCCCEEECCCCCSS T ss_pred CCCHHHHHHHHHHCCCEEEEEEECCCCCCC T ss_conf 887799999997589879999989999989 No 96 >3ipw_A Hydrolase TATD family protein; niaid, ssgcid, seattle structural genomics center for infectious disease, dysentery, liver abcess; 1.95A {Entamoeba histolytica hm-1} Probab=42.26 E-value=11 Score=14.08 Aligned_cols=62 Identities=11% Similarity=0.089 Sum_probs=48.7 Q ss_pred CCCCCCCCHHHHHHCHHHHHHHC--------CCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCC Q ss_conf 56689881334331444764432--------9989999999978998899811651774999999999769 Q 537021.9.peg.7 4 YLEKSPSFVHLQVHSSYSLLEGA--------LSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAG 66 (1033) Q Consensus 4 ~~~~~~~FvhLh~hS~ySlldg~--------~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~g 66 (1033) -|+-.+.|+-.|||=..+...|. ...++++++|++.|...+-++- .++...-+..+.|+++. T Consensus 21 ~~~~~~~~iDi~~nlt~~~f~g~y~g~~~h~~Dl~~Vl~rA~~~gV~~ii~~g-~~~~~~~~a~~La~~~~ 90 (325) T 3ipw_A 21 SMSMAQQFIDIGANLTDDNYFGNYHGKHYHEEDIDVVLQRAERNGLSHIIITS-GCLNDFKKAIEIINKYQ 90 (325) T ss_dssp --CCCCCEEEEEECTTSGGGGTEETTEECSCCCHHHHHHHHHHTTEEEEEECC-CSHHHHHHHHHHHHHHG T ss_pred CCCCCCCEEEEEECCCCHHHCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEC-CCHHHHHHHHHHHHHCC T ss_conf 66566886899878899676786577647702699999999986999899923-99999999999998677 No 97 >1dly_A Hemoglobin; oxygen storage/transport complex; HET: HEM; 1.80A {Chlamydomonas eugametos} SCOP: a.1.1.1 PDB: 1uvx_A* Probab=41.96 E-value=11 Score=14.04 Aligned_cols=70 Identities=14% Similarity=0.103 Sum_probs=44.0 Q ss_pred HHHHHHHHHHCCCHHHHHHHHHH---HCCC-CCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCC Q ss_conf 68988788632684130788976---2012-1101677878999998624999999999999999750001331 Q 537021.9.peg.7 694 QVMQIAQLLSGYSLSEADVLRRA---MGKK-IKEEMDKQKERFISGASKNGISKTIAVNIFELLAKFADYGFNK 763 (1033) Q Consensus 694 Qvm~ia~~~ag~~~~ead~lRra---~~kk-~~~~~~~~~~~f~~g~~~~g~~~~~a~~i~~~i~~f~~Y~Fnk 763 (1033) |...++..++|=.......+|.+ |..+ .....+.+...|.+-+.+.|++++..++++..++.+...-.|+ T Consensus 88 ~~~fl~~~~GGp~~Y~g~~m~~~H~~l~~~It~~~fd~~l~~l~~al~e~gv~~~~~~ei~~~l~~lr~~Ivn~ 161 (164) T 1dly_A 88 QFAFLAYALGGASEWKGKDMRTAHKDLVPHLSDVHFQAVARHLSDTLTELGVPPEDITDAMAVVASTRTEVLNM 161 (164) T ss_dssp HHHHHHHHTTSSSCCCSCCHHHHHTTCSSCCCHHHHHHHHHHHHHHHHHTTCCHHHHHHHHHHHHTTHHHHTTC T ss_pred HHHHHHHHHCCCCCCCCCCHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCC T ss_conf 99999999289988899888999763557989999999999999999983999999999999999999998189 No 98 >2w70_A Biotin carboxylase; ligase, inhibitor, ATP-binding, fatty acid biosynthesis, nucleotide-binding, lipid synthesis, ATP-grAsp domain; HET: L22; 1.77A {Escherichia coli} PDB: 1bnc_A 2j9g_A* 2v58_A* 2v59_A* 2v5a_A* 2vr1_A* 2w6m_A* 1dv1_A* 2w6o_A* 2w6n_A* 2w6q_A* 2w6z_A* 2w6p_A* 2w71_A* 3jzf_A* 3jzi_A* 1dv2_A* 2gps_A 2gpw_A 3g8c_A* ... Probab=41.36 E-value=11 Score=13.98 Aligned_cols=68 Identities=16% Similarity=0.238 Sum_probs=34.4 Q ss_pred CEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCCCCCCCCCCCCCEEEEE--ECCHHHHHHHHHHHHHH Q ss_conf 8899811651774999999999769963899999985476643344565666677539999--56968999999999899 Q 537021.9.peg.7 42 PAIAITDTNNLFSALEFSQKACAAGIQPIIGCQLDIDMQDNLKMTNQGKTPFPTVPSIVLL--VSTAEGYQRLIELVSRM 119 (1033) Q Consensus 42 ~AiAITD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~~~~~~~~~~~~~~~~~lvlL--AkN~~Gy~nL~kL~S~a 119 (1033) +-|-|.+.|- -|++..++|++-||+.|.- |-..+...... ...-+.+.+ +.+.+-|.|.-+++..+ T Consensus 3 ~kvLianrGe--iA~riira~relGi~tVav---~s~~D~~s~~v-------~~ADe~~~i~~~~~~~sYld~~~Ii~~a 70 (449) T 2w70_A 3 DKIVIANRGE--IALRILRACKELGIKTVAV---HSSADRDLKHV-------LLADETVCIGPAPSVKSYLNIPAIISAA 70 (449) T ss_dssp SEEEECCCHH--HHHHHHHHHHHHTCEEEEE---EEGGGTTCHHH-------HHSSEEEEEECSSGGGTTTCHHHHHHHH T ss_pred CEEEEECCCH--HHHHHHHHHHHCCCCEEEE---CCHHHHCCCCH-------HHCCEEEECCCCCHHHHHCCHHHHHHHH T ss_conf 4199989739--9999999999869939998---37044158897-------7798999848986242004899999999 Q ss_pred HH Q ss_conf 73 Q 537021.9.peg.7 120 YL 121 (1033) Q Consensus 120 ~~ 121 (1033) .. T Consensus 71 ~~ 72 (449) T 2w70_A 71 EI 72 (449) T ss_dssp HH T ss_pred HH T ss_conf 88 No 99 >3n6r_A Propionyl-COA carboxylase, alpha subunit; protein complex, biotin-dependent carboxylase, ligase; HET: BTI; 3.20A {Ruegeria pomeroyi} Probab=40.93 E-value=11 Score=13.93 Aligned_cols=15 Identities=27% Similarity=0.324 Sum_probs=5.9 Q ss_pred HHHHHHHHHCCCEEE Q ss_conf 999999981898998 Q 537021.9.peg.7 199 QVVQLAYTHELPLVA 213 (1033) Q Consensus 199 ~l~~lA~~~~iPlVa 213 (1033) ++++-+++.|+..|+ T Consensus 16 r~~r~~~~~gi~tv~ 30 (681) T 3n6r_A 16 RVIKTARKMGISTVA 30 (681) T ss_dssp HHHHHHGGGSCCBCC T ss_pred HHHHHHHHCCCEEEE T ss_conf 999999981993999 No 100 >1yz4_A DUSP15, dual specificity phosphatase-like 15 isoform A; hydrolase; HET: BOG; 2.40A {Homo sapiens} Probab=40.67 E-value=11 Score=13.90 Aligned_cols=44 Identities=27% Similarity=0.347 Sum_probs=30.3 Q ss_pred CHHHHHHHHHHHHHHHCCC--CCC--CCCCCCHHHHHHHHHHHCCCCC Q ss_conf 3055788899999851047--643--3444420466776554403756 Q 537021.9.peg.7 353 FSGYFLIVADFIQWAKKNN--IPV--GPGRGSGAGSVTAYALTITDID 396 (1033) Q Consensus 353 f~~YFLiV~Div~~ak~~g--I~v--GpGRGSAagSLVaY~LgIT~VD 396 (1033) +..||--+.++|.-++++| ++| -.|++=+|.=++||++.--..+ T Consensus 66 i~~~~~~~~~fI~~~~~~~~~VLVHC~~G~sRS~~vviaYLm~~~~~s 113 (160) T 1yz4_A 66 IKKHFKECINFIHCCRLNGGNCLVHSFAGISRSTTIVTAYVMTVTGLG 113 (160) T ss_dssp GGGGHHHHHHHHHHHHHTTCCEEEEETTSSSHHHHHHHHHHHHHHCCC T ss_pred HHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCHHHHHHHHHHHCCCC T ss_conf 999999999999998743773676846678754999999999983999 No 101 >1bf2_A Isoamylase; hydrolase, glycosidase, debranching enzyme; 2.00A {Pseudomonas amyloderamosa} SCOP: b.1.18.2 b.71.1.1 c.1.8.1 Probab=40.22 E-value=12 Score=13.84 Aligned_cols=14 Identities=21% Similarity=0.223 Sum_probs=6.4 Q ss_pred HHHHHHHHCCCCCC Q ss_conf 99999851047643 Q 537021.9.peg.7 361 ADFIQWAKKNNIPV 374 (1033) Q Consensus 361 ~Div~~ak~~gI~v 374 (1033) ..+|+-|+++||.| T Consensus 276 k~LV~~aH~~GI~V 289 (750) T 1bf2_A 276 QAMVQAFHNAGIKV 289 (750) T ss_dssp HHHHHHHHHTTCEE T ss_pred HHHHHHHHHCCCEE T ss_conf 99999998779989 No 102 >3p52_A NH(3)-dependent NAD(+) synthetase; structural genomics, center for structural genomics of infec diseases, NADE, CSGI; 2.74A {Campylobacter jejuni} Probab=39.95 E-value=12 Score=13.81 Aligned_cols=58 Identities=14% Similarity=0.274 Sum_probs=30.1 Q ss_pred CCC-EEEECCCCCCHHHHHHHCCCHHHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHHHHHHHHHCCCEE Q ss_conf 898-899819767967888877997899999999987659868999943991158999999999998189899 Q 537021.9.peg.7 141 TEG-LIMLTGGSSGPIDRAFFFNSSQVAEKRLLTFKKLFGDRLYVNLQRHRGYDRYRESQVVQLAYTHELPLV 212 (1033) Q Consensus 141 ~eg-LIvlsg~~~g~i~~~l~~~~~~~a~~~l~~l~~~Fgd~~ylEl~~~~~~e~~~~~~l~~lA~~~~iPlV 212 (1033) .++ ++.+|||.+|-+.-++ .++.+|++++--.+.++...+.-.....++|+.++++.. T Consensus 26 ~~~vvlglSGGVDSsv~a~L--------------a~~al~~~v~~v~~~~~~~~~~~~~~a~~la~~lgi~~~ 84 (249) T 3p52_A 26 SQGVVLGLSGGIDSALVATL--------------CKRALKENVFALLMPTQISNKANLEDALRLCADLNLEYK 84 (249) T ss_dssp CSEEEEECCSSHHHHHHHHH--------------HHHHHTTSEEEEECCSCCSSCHHHHHHHHHHHHHTCEEE T ss_pred CCEEEEECCCCHHHHHHHHH--------------HHHHCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCCC T ss_conf 98299978898899999999--------------998448717998578411000689999999998177600 No 103 >3cuq_A Vacuolar-sorting protein SNF8; ESCRT, MBV, VPS, alternative splicing, coiled coil, cytoplasm, nucleus, protein transport; 2.61A {Homo sapiens} PDB: 2zme_A Probab=39.55 E-value=9.9 Score=14.40 Aligned_cols=41 Identities=15% Similarity=0.300 Sum_probs=19.5 Q ss_pred HHHHHHHHHHCCCCCCHHHHHHHCCCC-CCCHHHHHHHHHCC Q ss_conf 999999998379999978999734824-79989999888676 Q 537021.9.peg.7 849 TARHIMEASADKPFDSLEDFCSRVDSK-KLNRRVLESLVFAG 889 (1033) Q Consensus 849 ~~~~Ive~r~~g~f~sl~df~~rv~~~-~~nk~~le~Li~aG 889 (1033) --..|.+.-+...|.+..++..+.++. ..-+..|+.|++.| T Consensus 155 D~~~il~~~~~~g~vT~~~L~~~lgWs~~ra~~~Le~lv~~G 196 (234) T 3cuq_A 155 DHTVVLQLAEKNGYVTVSEIKASLKWETERARQVLEHLLKEG 196 (234) T ss_dssp HHHHHHHHHTTTSEECHHHHHHHHTCCHHHHHHHHHHHHHHT T ss_pred HHHHHHHHHHHCCCCCHHHHHHHHCCCHHHHHHHHHHHHHCC T ss_conf 899999999846991699999997969999999999999689 No 104 >2bcq_A DNA polymerase lambda; misalignment, extrahelical, mutagenesis, mutation, deletion, streisinger, slippage, transferase, lyase/DNA complex; HET: DNA; 1.65A {Homo sapiens} SCOP: a.60.6.1 a.60.12.1 d.218.1.2 PDB: 1xsl_A* 2bcr_A* 2bcs_A* 2bcu_A* 2bcv_A* 2gws_A* 3c5g_A* 3c5f_A* 2pfn_A* 1xsp_A* 1xsn_A* 2pfo_A* 2pfp_A* 2pfq_A* 3hw8_A* 3hwt_A* 1rzt_A* 3hx0_A* 3mdc_A* 3mda_A* ... Probab=39.43 E-value=12 Score=13.75 Aligned_cols=13 Identities=15% Similarity=0.383 Sum_probs=8.4 Q ss_pred HHHHHCCCCCHHH Q ss_conf 6554403756333 Q 537021.9.peg.7 387 AYALTITDIDPLR 399 (1033) Q Consensus 387 aY~LgIT~VDPi~ 399 (1033) -|-+||+.+|=++ T Consensus 112 l~~~Gi~si~dL~ 124 (335) T 2bcq_A 112 WYQQGFRSLEDIR 124 (335) T ss_dssp HHHTTCCSHHHHH T ss_pred HHHCCCCCHHHHH T ss_conf 9981925699999 No 105 >1o8b_A Ribose 5-phosphate isomerase; RPIA, PSI, protein structure initiative, MCSG, midwest center for structural genomics; HET: ABF; 1.25A {Escherichia coli} SCOP: c.124.1.4 d.58.40.1 PDB: 1lkz_A 1ks2_A* 3enq_A 3env_A* 3enw_A* Probab=39.08 E-value=12 Score=13.71 Aligned_cols=51 Identities=16% Similarity=0.305 Sum_probs=32.9 Q ss_pred HHHHHHCCCCCCCCCCCCCCEEEEECC-CEEEEEHHHHCCCCHHHH-HHHHHHH Q ss_conf 988886368503552334210035319-869850268589698999-9999998 Q 537021.9.peg.7 806 CQDARQFNIQIMPPSVNTPCVDFKVGD-NRIYYSLAAIKGVGTTTA-RHIMEAS 857 (1033) Q Consensus 806 i~e~~~~gi~vl~PdIN~S~~~f~~~~-~~Ir~Gl~~Ikgvg~~~~-~~Ive~r 857 (1033) ...|++.||++..++ ..+..++.+++ +.|--.|..|||=|.... |.|+... T Consensus 57 ~~~a~~~Gi~l~~~~-~~~~iDiaiDGaDevd~~l~liKGgGgal~rEKiva~~ 109 (219) T 1o8b_A 57 TEKLKSLGIHVFDLN-EVDSLGIYVDGADEINGHMQMIKGGGAALTREKIIASV 109 (219) T ss_dssp -----------CCGG-GCSCEEEEEECCSEECTTSCEECCCCC-HHHHHHHHHH T ss_pred HHHHHHHCCCCCCHH-HCCCCCEEEECCCHHHHHHHHHHCCHHHHHHHHHHHHH T ss_conf 999987166656865-67876767517302102243564268999988899983 No 106 >2qf7_A Pyruvate carboxylase protein; multi-domain, multi-functional, biotin-dependent, ligase; HET: KCX COA SAP; 2.00A {Rhizobium etli cfn 42} Probab=39.07 E-value=12 Score=13.71 Aligned_cols=44 Identities=25% Similarity=0.237 Sum_probs=31.7 Q ss_pred HHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCC Q ss_conf 432998999999997899889981165177499999999976996 Q 537021.9.peg.7 24 EGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQ 68 (1033) Q Consensus 24 dg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIK 68 (1033) ++...++.+++.|++.|..||--- +|=|+=-.+|.++|.++||. T Consensus 77 ~~yl~~~~i~~~a~~~~~~ai~pg-ygflse~~~fa~~~~~~g~~ 120 (1165) T 2qf7_A 77 ESYLSIDEVIRVAKLSGADAIHPG-YGLLSESPEFVDACNKAGII 120 (1165) T ss_dssp HHHHCHHHHHHHHHHHTCSEEECC-SSTTTTCHHHHHHHHHTTCE T ss_pred CCCCCHHHHHHHHHHHCCCEEECC-CCHHHCCHHHHHHHHHCCCE T ss_conf 011599999999999691999888-34200599999999987998 No 107 >1i60_A IOLI protein; beta barrel, structural genomics, PSI, protein structure initiative, midwest center for structural genomics, MCSG; 1.60A {Bacillus subtilis} SCOP: c.1.15.4 PDB: 1i6n_A Probab=39.05 E-value=12 Score=13.71 Aligned_cols=49 Identities=16% Similarity=0.106 Sum_probs=37.3 Q ss_pred CCCHHHHHHHHHHCCCCEEEEECCCHHH------HHHHHHHHHHHCCCCEEEEEEE Q ss_conf 2998999999997899889981165177------4999999999769963899999 Q 537021.9.peg.7 26 ALSLNNILDKIAADQQPAIAITDTNNLF------SALEFSQKACAAGIQPIIGCQL 75 (1033) Q Consensus 26 ~~~~e~LV~~A~~~G~~AiAITD~~nl~------Gav~F~~ackk~gIKPIiG~E~ 75 (1033) ..++++-++.|++.|+.+|-|.+.+... ..-++.+.++++||+++ +... T Consensus 13 ~~~l~~~l~~aa~~G~~~iEl~~~~~~~~~~~~~~~~~l~~~~~~~gl~i~-~~~~ 67 (278) T 1i60_A 13 NSNLKLDLELCEKHGYDYIEIRTMDKLPEYLKDHSLDDLAEYFQTHHIKPL-ALNA 67 (278) T ss_dssp TCCHHHHHHHHHHTTCSEEEEETTTHHHHHTTSSCHHHHHHHHHTSSCEEE-EEEE T ss_pred CCCHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCHHHHHHHHHHCCCEEE-EEEC T ss_conf 999999999999959799996767666001486539999999998099399-9836 No 108 >1b22_A DNA repair protein RAD51; DNA binding, riken structural genomics/proteomics initiative, RSGI, structural genomics, DNA binding protein; HET: DNA; NMR {Homo sapiens} SCOP: a.60.4.1 Probab=38.68 E-value=12 Score=13.67 Aligned_cols=10 Identities=10% Similarity=0.581 Sum_probs=3.6 Q ss_pred CCCHHHHHHH Q ss_conf 9698999999 Q 537021.9.peg.7 844 GVGTTTARHI 853 (1033) Q Consensus 844 gvg~~~~~~I 853 (1033) |||...++.+ T Consensus 32 Gvg~~~i~KL 41 (114) T 1b22_A 32 GINANDVKKL 41 (114) T ss_dssp TCSHHHHHHH T ss_pred CCCHHHHHHH T ss_conf 9999999999 No 109 >3icg_A Endoglucanase D; cellulase, xylanase, carbohydrate binding DOM glucanase, carbohydrate metabolism, cellulose degradation, glycosidase; HET: BTB; 2.10A {Clostridium cellulovorans} Probab=38.61 E-value=12 Score=13.66 Aligned_cols=44 Identities=14% Similarity=0.162 Sum_probs=25.5 Q ss_pred HHHHHHHHHHCCCCEEEE-------ECC--------CHHHHHHHHHHHHHHCCCCEEEE Q ss_conf 899999999789988998-------116--------51774999999999769963899 Q 537021.9.peg.7 29 LNNILDKIAADQQPAIAI-------TDT--------NNLFSALEFSQKACAAGIQPIIG 72 (1033) Q Consensus 29 ~e~LV~~A~~~G~~AiAI-------TD~--------~nl~Gav~F~~ackk~gIKPIiG 72 (1033) .++.++..|++|+.++=| .+. ..|.-.-+....|+++||+||+= T Consensus 47 tkedI~~ik~~GfN~vRipisW~~~~~~~~~~~in~~~l~~~d~vId~a~~~Gi~viLd 105 (515) T 3icg_A 47 THAMINKIKEAGFNTLRLPVTWDGHMGAAPEYTIDQTWMKRVEEIANYAFDNDMYVIIN 105 (515) T ss_dssp CHHHHHHHHHHTCCEEEECCCCTTSBCCTTTCCBCHHHHHHHHHHHHHHHTTTCEEEEE T ss_pred CHHHHHHHHHCCCCEEEEEEEHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEE T ss_conf 69999999987998899701289807888887449999999999999999789999996 No 110 >1kft_A UVRC, excinuclease ABC subunit C; helix-hairpin-helix, HHH domain, DNA-binding domain, DNA binding protein; NMR {Escherichia coli} SCOP: a.60.2.3 Probab=38.53 E-value=8.2 Score=15.10 Aligned_cols=27 Identities=26% Similarity=0.501 Sum_probs=20.5 Q ss_pred EHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHH Q ss_conf 02685896989999999998379999978999 Q 537021.9.peg.7 838 SLAAIKGVGTTTARHIMEASADKPFDSLEDFC 869 (1033) Q Consensus 838 Gl~~Ikgvg~~~~~~Ive~r~~g~f~sl~df~ 869 (1033) .|..|+|||+..+..|++. |.|+.++. T Consensus 25 ~L~~IpGIG~k~ak~Ll~~-----F~si~~i~ 51 (78) T 1kft_A 25 SLETIEGVGPKRRQMLLKY-----MGGLQGLR 51 (78) T ss_dssp GGGGCTTCSSSHHHHHHHH-----HSCHHHHH T ss_pred CCCCCCCCCHHHHHHHHHH-----HCCHHHHH T ss_conf 2656999329999999999-----29949998 No 111 >2bvf_A 6-hydroxy-D-nicotine oxidase; autoflavinylation, enantiomeric substrates, flavoenzymes, nicotine degradation; HET: FAD; 1.92A {Arthrobacter nicotinovorans} PDB: 2bvg_A* 2bvh_A* Probab=38.28 E-value=12 Score=13.62 Aligned_cols=20 Identities=25% Similarity=0.410 Sum_probs=12.4 Q ss_pred HHHHHHHHHCCCCCCCCCCCC Q ss_conf 899999851047643344442 Q 537021.9.peg.7 360 VADFIQWAKKNNIPVGPGRGS 380 (1033) Q Consensus 360 V~Div~~ak~~gI~vGpGRGS 380 (1033) |.++|++|+++|++|- -||. T Consensus 51 V~~~v~~A~~~~~~v~-~rgg 70 (459) T 2bvf_A 51 VAKSVRYACDNGLEIS-VRSG 70 (459) T ss_dssp HHHHHHHHHHHTCCEE-EESS T ss_pred HHHHHHHHHHCCCEEE-EECC T ss_conf 9999999998798299-9889 No 112 >1lk5_A D-ribose-5-phosphate isomerase; alpha/beta structure; 1.75A {Pyrococcus horikoshii} SCOP: c.124.1.4 d.58.40.1 PDB: 1lk7_A* Probab=37.83 E-value=12 Score=13.57 Aligned_cols=51 Identities=24% Similarity=0.272 Sum_probs=29.8 Q ss_pred HHHHHHHCCCCCCCCCCCCCCEEEEECC-CEEEEEHHHHCCCCHHHH-HHHHHH Q ss_conf 9988886368503552334210035319-869850268589698999-999999 Q 537021.9.peg.7 805 FCQDARQFNIQIMPPSVNTPCVDFKVGD-NRIYYSLAAIKGVGTTTA-RHIMEA 856 (1033) Q Consensus 805 ~i~e~~~~gi~vl~PdIN~S~~~f~~~~-~~Ir~Gl~~Ikgvg~~~~-~~Ive~ 856 (1033) --..|+++||++.+.+ .....++.+++ +.|-..|..|||-|.... |.|+.. T Consensus 60 t~~~a~~~gi~~~~~~-~~~~iDv~iDGaDevd~~l~liKGGGGal~rEKivA~ 112 (229) T 1lk5_A 60 AKLLAIEHDIPIASLD-QVDAIDVAVDGADEVDPNLNLIKGRGAALTMEKIIEY 112 (229) T ss_dssp HHHHHHHTTCCBCCGG-GCSCEEEEEECCSEECTTCCEECCTTSCHHHHHHHHH T ss_pred HHHHHHHCCCCCCCCC-CCCCEEEEECCHHHHCCCHHHHHHHHHHHHHHHHHHH T ss_conf 9999997799454412-2564117841624434005767620067888999998 No 113 >2f8m_A Ribose 5-phosphate isomerase; structural genomics, PSI, protein structure initiative; 2.09A {Plasmodium falciparum 3D7} Probab=37.82 E-value=12 Score=13.57 Aligned_cols=53 Identities=21% Similarity=0.335 Sum_probs=32.9 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCEEEEECC-CEEEEEHHHHCCCCHHHH-HHHHHHH Q ss_conf 99988886368503552334210035319-869850268589698999-9999998 Q 537021.9.peg.7 804 KFCQDARQFNIQIMPPSVNTPCVDFKVGD-NRIYYSLAAIKGVGTTTA-RHIMEAS 857 (1033) Q Consensus 804 ~~i~e~~~~gi~vl~PdIN~S~~~f~~~~-~~Ir~Gl~~Ikgvg~~~~-~~Ive~r 857 (1033) ..-.+|+++||++..++ .....++.+++ +.|-..|..|||=|.... |.|+... T Consensus 66 ~T~~~a~~~Gi~v~~l~-~~~~iDv~iDGADEvd~~l~lIKGGGgal~rEKivA~~ 120 (244) T 2f8m_A 66 DTELKARKLGIPLTTLE-KHSNIDITIDGTDEIDLNLNLIKGRGGALVREKLVASS 120 (244) T ss_dssp HHHHHHHHHTCCBCCCC-SSCCBSEEEECCSEECTTCCEECCTTSCHHHHHHHHHT T ss_pred HHHHHHHHCCCCCCCCH-HCCCCCEEEECCCCCCCCCCEEECCCCCHHHHHHHHHH T ss_conf 99999988599711702-35731168725630035824120786201678899960 No 114 >2g0w_A LMO2234 protein; putative sugar isomerase, structural genomics, joint center structural genomics, JCSG, protein structure initiative; HET: MSE PG4; 1.70A {Listeria monocytogenes} SCOP: c.1.15.4 Probab=37.57 E-value=13 Score=13.54 Aligned_cols=54 Identities=13% Similarity=0.134 Sum_probs=38.5 Q ss_pred HHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHH-------HHHHHHHHHHCCCCEEEEEEE Q ss_conf 447644329989999999978998899811651774-------999999999769963899999 Q 537021.9.peg.7 19 SYSLLEGALSLNNILDKIAADQQPAIAITDTNNLFS-------ALEFSQKACAAGIQPIIGCQL 75 (1033) Q Consensus 19 ~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~G-------av~F~~ackk~gIKPIiG~E~ 75 (1033) .||+. =-.+.++.++.|++.|+.+|-|... .+.. .-++.+++.+.||+. .++.. T Consensus 29 ~~tl~-~~~~lee~l~~aae~GfdgVEl~~~-~~~~~~~~~~~~~~l~~~l~~~GL~i-~~i~~ 89 (296) T 2g0w_A 29 SYTLG-TEVSFPKRVKVAAENGFDGIGLRAE-NYVDALAAGLTDEDMLRILDEHNMKV-TEVEY 89 (296) T ss_dssp GGGGT-TTSCHHHHHHHHHHTTCSEEEEEHH-HHHHHHHTTCCHHHHHHHHHHTTCEE-EEEEC T ss_pred EEECC-CCCCHHHHHHHHHHHCCCEEEECCC-CCCCCCCCCCCHHHHHHHHHHCCCEE-EEEEC T ss_conf 88878-9999999999999969799997453-23000446899999999999719979-99865 No 115 >1c6v_A Protein (SIV integrase); DNA integration, DNA binding protein; 3.00A {Simian immunodeficiency virus} SCOP: c.55.3.2 Probab=37.26 E-value=13 Score=13.50 Aligned_cols=28 Identities=25% Similarity=0.254 Sum_probs=13.1 Q ss_pred HHHHHHHHHHHHCCCEEEE-CCCCCCCHH Q ss_conf 9999999999818989980-883007878 Q 537021.9.peg.7 196 RESQVVQLAYTHELPLVAT-NNSLFLSEE 223 (1033) Q Consensus 196 ~~~~l~~lA~~~~iPlVaT-ndv~Y~~~~ 223 (1033) ....+..+...+++|.+.| |+..|...+ T Consensus 48 v~~~l~~~~~~~g~p~i~sDnG~~f~s~~ 76 (164) T 1c6v_A 48 TALFLLKLAGRWPITHLHTDNGANFASQE 76 (164) T ss_dssp HHHHHHHHHTTSCCCEEECCCCSSTTSSH T ss_pred HHHHHHHHHHHCCCCEEEEECCCCCCCHH T ss_conf 99999999986498289994997500188 No 116 >3kp1_A D-ornithine aminomutase E component; 5 aminomutase (OAM), metal binding protein; HET: PLP B12 5AD; 2.01A {Clostridium sticklandii} PDB: 3kow_A* 3koy_A* 3koz_A* 3kp0_A* 3kox_A* Probab=37.09 E-value=13 Score=13.48 Aligned_cols=24 Identities=8% Similarity=0.124 Sum_probs=12.0 Q ss_pred CEEEEEECCHHHHHHHHHHHHHHH Q ss_conf 539999569689999999998997 Q 537021.9.peg.7 97 PSIVLLVSTAEGYQRLIELVSRMY 120 (1033) Q Consensus 97 ~~lvlLAkN~~Gy~nL~kL~S~a~ 120 (1033) +.+++..-+-.=|..+.+.-..++ T Consensus 76 ~~~~~~vatg~~~~d~~~~~~aa~ 99 (763) T 3kp1_A 76 PVITTEIASGRFEDDIRRMRMAAW 99 (763) T ss_dssp SEEEEECCCSCHHHHHHHHHHHHH T ss_pred CEEEEEEECCCHHHHHHHHHHHHH T ss_conf 769999833762878999999987 No 117 >2a1j_A DNA repair endonuclease XPF; XPF, xeroderma pigmentosum, DNA repair, endonuclease, helix-hairpin-helix, DNA binding protein; HET: DNA; 2.70A {Homo sapiens} SCOP: a.60.2.5 PDB: 2kn7_A* Probab=36.92 E-value=13 Score=13.46 Aligned_cols=36 Identities=22% Similarity=0.473 Sum_probs=24.8 Q ss_pred HHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHCCCCCCCHHHHHHH Q ss_conf 26858969899999999983799999789997348247998999988 Q 537021.9.peg.7 839 LAAIKGVGTTTARHIMEASADKPFDSLEDFCSRVDSKKLNRRVLESL 885 (1033) Q Consensus 839 l~~Ikgvg~~~~~~Ive~r~~g~f~sl~df~~rv~~~~~nk~~le~L 885 (1033) |..|+|||.+.+..|+. .|.|+.+++. .....|..+ T Consensus 6 L~~IPGIg~~~~~~Ll~-----~fgSi~~l~~------as~eeL~~v 41 (63) T 2a1j_A 6 LLKMPGVNAKNCRSLMH-----HVKNIAELAA------LSQDELTSI 41 (63) T ss_dssp HHTSTTCCHHHHHHHHH-----HCSSHHHHHT------CCHHHHHHH T ss_pred HHCCCCCCHHHHHHHHH-----HCCCHHHHHH------CCHHHHHHC T ss_conf 85299988999999999-----8679999987------999999878 No 118 >2zvr_A Uncharacterized protein TM_0416; hyperthermophIle, ketohexose 3- epimerase, D-tagatose 3-epimerase, isomerase; 2.20A {Thermotoga maritima} Probab=36.75 E-value=13 Score=13.44 Aligned_cols=15 Identities=33% Similarity=0.386 Sum_probs=6.5 Q ss_pred HHHHHHHHCCCEEEE Q ss_conf 999999818989980 Q 537021.9.peg.7 200 VVQLAYTHELPLVAT 214 (1033) Q Consensus 200 l~~lA~~~~iPlVaT 214 (1033) +.++.+++++++.+. T Consensus 73 lk~~l~~~gL~v~s~ 87 (290) T 2zvr_A 73 VKILSEELNLPICAI 87 (290) T ss_dssp HHHHHHHHTCCEEEE T ss_pred HHHHHHHCCCEEEEE T ss_conf 999999839908998 No 119 >1exn_A 5'-exonuclease, 5'-nuclease; hydrolase; 2.50A {Enterobacteria phage T5} SCOP: a.60.7.1 c.120.1.2 PDB: 1ut5_A 1ut8_A 1xo1_A Probab=36.68 E-value=13 Score=13.43 Aligned_cols=27 Identities=19% Similarity=0.527 Sum_probs=12.8 Q ss_pred HCCCCHHHHHHHHHHHHCCCCCCHHHHHHHCC Q ss_conf 58969899999999983799999789997348 Q 537021.9.peg.7 842 IKGVGTTTARHIMEASADKPFDSLEDFCSRVD 873 (1033) Q Consensus 842 Ikgvg~~~~~~Ive~r~~g~f~sl~df~~rv~ 873 (1033) |+|||++++..++++ |.|+++.+..++ T Consensus 208 V~GiG~KtA~kLl~~-----ygsle~i~~~~~ 234 (290) T 1exn_A 208 VEGIGAKRGYNIIRE-----FGNVLDIIDQLP 234 (290) T ss_dssp CTTCCHHHHHHHHHH-----HCSHHHHHHHCS T ss_pred CCCCCHHHHHHHHHH-----HCCHHHHHHHHH T ss_conf 998589999999987-----389999998777 No 120 >2zds_A Putative DNA-binding protein; TIM-barrel fold, structural genomics, NPPSFA; 2.30A {Streptomyces coelicolor} Probab=36.22 E-value=13 Score=13.38 Aligned_cols=53 Identities=9% Similarity=-0.049 Sum_probs=38.3 Q ss_pred HHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHH----------HHHHHHHHHHHCCCCEEE Q ss_conf 44764432998999999997899889981165177----------499999999976996389 Q 537021.9.peg.7 19 SYSLLEGALSLNNILDKIAADQQPAIAITDTNNLF----------SALEFSQKACAAGIQPII 71 (1033) Q Consensus 19 ~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~----------Gav~F~~ackk~gIKPIi 71 (1033) -||-..+-.++++.++.|+++|+.+|-|+-.+.-+ .+-++.+.++++|++... T Consensus 7 ~~~~~~~dlple~~l~~aa~~G~~gIEl~~~~~~~~~~~~~~~~~~~~~~~~~l~~~gl~i~~ 69 (340) T 2zds_A 7 LFTGQWADLPLEEVCRLARDFGYDGLELACWGDHFEVDKALADPSYVDSRHQLLDKYGLKCWA 69 (340) T ss_dssp EESGGGTTSCHHHHHHHHHHHTCSEEEEESSTTTCCHHHHHHCTTHHHHHHHHHHHTTCEEEE T ss_pred ECCCCCCCCCHHHHHHHHHHHCCCEEEEECCCCCCCCCHHCCCHHHHHHHHHHHHHCCCEEEE T ss_conf 121567899899999999982999899835777766000007978999999999982996899 No 121 >3ngf_A AP endonuclease, family 2; structural genomics, seattle structural genomics center for infectious disease; 1.80A {Brucella melitensis biovar abortus 230ORGANISM_TAXID} Probab=36.06 E-value=13 Score=13.36 Aligned_cols=50 Identities=8% Similarity=0.037 Sum_probs=36.3 Q ss_pred HHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEE Q ss_conf 476443299899999999789988998116517749999999997699638 Q 537021.9.peg.7 20 YSLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPI 70 (1033) Q Consensus 20 ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPI 70 (1033) +|++..-.++++.++.|++.|+.+|-|-.... .-.-+..+.+++.|++.. T Consensus 16 ~~~~~~~~pl~e~l~~aae~GfdgVEl~~~~~-~~~~~l~~~l~~~gl~~~ 65 (269) T 3ngf_A 16 LSTMFNEVPFLERFRLAAEAGFGGVEFLFPYD-FDADVIARELKQHNLTQV 65 (269) T ss_dssp TTTSCTTSCHHHHHHHHHHTTCSEEECSCCTT-SCHHHHHHHHHHTTCEEE T ss_pred CCCCCCCCCHHHHHHHHHHHCCCEEEECCCCC-CCHHHHHHHHHHCCCCEE T ss_conf 14334789999999999996999999898887-899999999998399669 No 122 >2uyg_A 3-dehydroquinate dehydratase; typeii 3-dehydroquinase, lyase; 2.2A {Thermus thermophilus} Probab=35.95 E-value=8 Score=15.18 Aligned_cols=18 Identities=17% Similarity=-0.025 Sum_probs=10.4 Q ss_pred HHHHHHHHHHHHHCCCEE Q ss_conf 999999999998189899 Q 537021.9.peg.7 195 YRESQVVQLAYTHELPLV 212 (1033) Q Consensus 195 ~~~~~l~~lA~~~~iPlV 212 (1033) .+++.+.+.+++.++.+. T Consensus 28 ~i~~~~~~~a~~~g~~l~ 45 (149) T 2uyg_A 28 ELEALCEAWGAELGLGVV 45 (149) T ss_dssp HHHHHHHHHHHHTTCCEE T ss_pred HHHHHHHHHHHHCCCCEE T ss_conf 999999999997599478 No 123 >3i9v_2 NADH-quinone oxidoreductase subunit 2; electron transport, respiratory chain, 4Fe- 4S, cell membrane, flavoprotein, FMN, iron; HET: FMN; 3.10A {Thermus thermophilus HB8} PDB: 2fug_2* 3iam_2* 3ias_2* 3m9s_2* Probab=35.93 E-value=13 Score=13.34 Aligned_cols=61 Identities=16% Similarity=0.267 Sum_probs=40.7 Q ss_pred CCCHHHHHHHHHHHHHHCCCC--CEEECCCHHHHH------HHHHHHHHHHHCCCCHHHHHHHHHHCCCC Q ss_conf 365687899999998303802--155422077877------88999999975279988988654310145 Q 537021.9.peg.7 421 DFCQDRRDEVIRYVQNKYGHE--RVAQIITFGSLQ------AKAALRDVGRALQMPYSQVDRLCKLIPND 482 (1033) Q Consensus 421 Df~~~~R~~vi~y~~~kyG~~--~va~i~t~~t~~------~k~aird~~r~~g~~~~~~~~~~~~i~~~ 482 (1033) -|+++.+ +.++-+..||..+ +-|.|.-..-.+ ...|++.+|+.+|+|..++..++..-+.. T Consensus 2 ~~~~~~~-~~i~~ii~~Yp~~~~r~all~~L~~iQ~~~GyIp~~al~~iA~~l~v~~a~V~~vatFY~~f 70 (181) T 3i9v_2 2 GFFDDKQ-DFLEETFAKYPPEGRRAAIMPLLRRVQQEEGWIRPERIEEIARLVGTTPTEVMGVASFYSYY 70 (181) T ss_dssp -CTTTCH-HHHHHHHTTSCTTCCGGGHHHHHHHHHHHHSSCCHHHHHHHHHHHTSCHHHHHHHHTTCSSC T ss_pred CCCHHHH-HHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCHHHHHHHHHHCCCC T ss_conf 9888899-99999998788864776999999999998299099999999999892999999999762502 No 124 >1c0m_A Protein (integrase); HIV, X-RAY crystallography, protein structure, transferase; 2.53A {Rous sarcoma virus} SCOP: b.34.7.1 c.55.3.2 PDB: 1c1a_A Probab=35.92 E-value=13 Score=13.34 Aligned_cols=15 Identities=0% Similarity=-0.090 Sum_probs=6.4 Q ss_pred HHHHHHHHHCCCEEE Q ss_conf 999999981898998 Q 537021.9.peg.7 199 QVVQLAYTHELPLVA 213 (1033) Q Consensus 199 ~l~~lA~~~~iPlVa 213 (1033) .+.++...+||.... T Consensus 82 ~~~~~~~~~gI~~~~ 96 (238) T 1c0m_A 82 STREWLARWGIAHTT 96 (238) T ss_dssp HHHHHHHHHTCEEEE T ss_pred HHHHHHHHCEEEEEE T ss_conf 787765540068887 No 125 >2jg6_A DNA-3-methyladenine glycosidase; 3-methyladenine-DNA-glycosylase-I, hydrolase; 1.70A {Staphylococcus aureus} Probab=35.64 E-value=13 Score=13.31 Aligned_cols=131 Identities=12% Similarity=0.161 Sum_probs=71.1 Q ss_pred CCCCCCHHHHHHHCCCCCCCCHHHHHHHHH----HHHHCCCHHHH----HHHHHHHCCCCCCHHHHHHHHHHHHHHHC-C Q ss_conf 850269667334103355300125689887----88632684130----78897620121101677878999998624-9 Q 537021.9.peg.7 670 KIVSIHPLIDPMLKETQGVIIYQEQVMQIA----QLLSGYSLSEA----DVLRRAMGKKIKEEMDKQKERFISGASKN-G 740 (1033) Q Consensus 670 ~~~~~hp~l~~iL~~T~Gv~vyQEQvm~ia----~~~ag~~~~ea----d~lRra~~kk~~~~~~~~~~~f~~g~~~~-g 740 (1033) ++..-+|.+...=.+-.|++++-+|.+-=. .--||+|+.-. +.+|+|+.-=+++.+..+-+.=++..+.+ | T Consensus 5 ~W~~~~~~~~~YHD~eWG~p~~dD~~LFE~L~Le~fQaGLsW~~Il~Kr~~fr~aF~~Fd~~~vA~~~e~die~Ll~d~~ 84 (186) T 2jg6_A 5 AFGTKDPVYLNYHDHVWGQPLYDSKALFKLLALESQHAGLSWLTILKKKEAYEEAFYDFEPEKVAQMTAQDIDRLMTFPN 84 (186) T ss_dssp TTTCCCHHHHHHHHHTTTSCCCCHHHHHHHHHHHHTCTTSCHHHHHHHHHHHHHHTGGGCHHHHTTCCHHHHHHHTTCTT T ss_pred CCCCCCHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHCCCHHHHHHHHCCCC T ss_conf 89999838998885268986888799999999999853178999999999999999769999997789999998852720 Q ss_pred CC--HHH-------HHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC------------CCCCC Q ss_conf 99--999-------9999999997500013310147888999999999973026667775100------------23542 Q 537021.9.peg.7 741 IS--KTI-------AVNIFELLAKFADYGFNKSHAAAYAVISYQTAWMKTHYPVEFLAASMTL------------EMDNV 799 (1033) Q Consensus 741 ~~--~~~-------a~~i~~~i~~f~~Y~FnksHa~aya~~~y~~aylK~~yP~eF~aa~L~~------------~~~~~ 799 (1033) += +.. |..+-++-+.|+ +| ..|..+..+. .+... T Consensus 85 IIRnr~KI~Avi~NA~~~~~i~~e~g--SF-----------------------~~ylw~fv~~~pi~~~~~~~~~~p~~t 139 (186) T 2jg6_A 85 IVHHRKKLEAIVNQAQGYLKIEQAYG--SF-----------------------SKFLWSYVNGKPKDLQYEHASDRITVD 139 (186) T ss_dssp SCCCHHHHHHHHHHHHHHHHHHHHHS--CH-----------------------HHHHHGGGTTSCEECCCCSGGGCCSCC T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCC--CH-----------------------HHHHHHHCCCCCCCCCCCCCCCCCCCC T ss_conf 56619999999999999999888309--99-----------------------999998547888655656603567899 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 02589998888636850355233421 Q 537021.9.peg.7 800 EKIKKFCQDARQFNIQIMPPSVNTPC 825 (1033) Q Consensus 800 ~~~~~~i~e~~~~gi~vl~PdIN~S~ 825 (1033) ..-..+.+++|++|.+..+|-+=.|. T Consensus 140 ~~s~~isk~LKk~GFkfvGpt~~ysf 165 (186) T 2jg6_A 140 DTATQLSKDLKQYGFKFLGPVTVFSF 165 (186) T ss_dssp HHHHHHHHHHHTTTCCSCCHHHHHHH T ss_pred HHHHHHHHHHHHCCCEECCCHHHHHH T ss_conf 89999999998678744385799999 No 126 >1wza_A Alpha-amylase A; hydrolase, halophilic, thermophilic; 1.60A {Halothermothrix orenii} SCOP: b.71.1.1 c.1.8.1 Probab=35.14 E-value=14 Score=13.25 Aligned_cols=43 Identities=28% Similarity=0.321 Sum_probs=24.0 Q ss_pred CCCCCCCHHHHHHCHH-HHHHHCCCHHHHHHHH-----------HHCCCCEEEEE Q ss_conf 6689881334331444-7644329989999999-----------97899889981 Q 537021.9.peg.7 5 LEKSPSFVHLQVHSSY-SLLEGALSLNNILDKI-----------AADQQPAIAIT 47 (1033) Q Consensus 5 ~~~~~~FvhLh~hS~y-Slldg~~~~e~LV~~A-----------~~~G~~AiAIT 47 (1033) |+|+..+=+||+.+.- |-.+|......++++- +++|+.+|=|+ T Consensus 1 ~~~~~~~Ye~~~~~f~~~~~~~~G~~~g~~~~ldyl~~~~~~~i~~LGv~~I~L~ 55 (488) T 1wza_A 1 FEKHGTYYEIFVRSFYDSDGDGIGDLKGIIEKLDYLNDGDPETIADLGVNGIWLM 55 (488) T ss_dssp CCCCCCEEEECGGGSCCSSSSSCCCHHHHHHTHHHHCCSCTTCCSSCCCSEEEEC T ss_pred CCCCCEEEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCCCCCHHHHCCCCEEEEC T ss_conf 9877179999674104899998649899997464431013336760785679979 No 127 >1ulz_A Pyruvate carboxylase N-terminal domain; biotin carboxylase, ligase; 2.20A {Aquifex aeolicus VF5} SCOP: b.84.2.1 c.30.1.1 d.142.1.2 Probab=35.01 E-value=14 Score=13.23 Aligned_cols=68 Identities=6% Similarity=0.130 Sum_probs=33.4 Q ss_pred CEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCCCCCCCCCCCCCEEEEE-ECCHHHHHHHHHHHHHHH Q ss_conf 8899811651774999999999769963899999985476643344565666677539999-569689999999998997 Q 537021.9.peg.7 42 PAIAITDTNNLFSALEFSQKACAAGIQPIIGCQLDIDMQDNLKMTNQGKTPFPTVPSIVLL-VSTAEGYQRLIELVSRMY 120 (1033) Q Consensus 42 ~AiAITD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~~~~~~~~~~~~~~~~~lvlL-AkN~~Gy~nL~kL~S~a~ 120 (1033) +-|-|...|-. +++..++|++-||+.|.= |-..+.... .... .-..+++ ....+-|-|.-+++..+. T Consensus 3 ~kiLianrGei--A~riira~~elGi~tVaV---~s~~D~~a~----~~~~---ADe~~~ig~~~~~sYl~~~~ii~ia~ 70 (451) T 1ulz_A 3 NKVLVANRGEI--AVRIIRACKELGIPTVAI---YNEVESTAR----HVKL---ADEAYMIGTDPLDTYLNKQRIINLAL 70 (451) T ss_dssp SSEEECCCHHH--HHHHHHHHHHHTCCEEEE---ECGGGTTCH----HHHH---SSEEEECCSSTTHHHHCHHHHHHHHH T ss_pred CEEEEECCCHH--HHHHHHHHHHCCCEEEEE---CCCHHHCCC----CHHH---CCEEEECCCCHHHHHHHHHHHHHHHH T ss_conf 33999897199--999999999859969998---684564588----9778---88989879977887664899999999 Q ss_pred H Q ss_conf 3 Q 537021.9.peg.7 121 L 121 (1033) Q Consensus 121 ~ 121 (1033) . T Consensus 71 ~ 71 (451) T 1ulz_A 71 E 71 (451) T ss_dssp H T ss_pred H T ss_conf 8 No 128 >1tvk_A Tubulin alpha chain; epothilone, taxol, ligand interactions, cell cycle, structural protein; HET: GDP GTP EP; 2.89A {Bos taurus} SCOP: c.32.1.1 d.79.2.1 PDB: 1tub_A* 1jff_A* 2wbe_A* 3dco_A* 3edl_A* 1ffx_A* 1ia0_A* 2hxf_A* 2hxh_A* 2p4n_A* 1sa0_A* 1sa1_A* 3edl_F* 3hke_A* 3hkc_A* 3hkd_A* 3hkb_A* 3n2g_A* 3n2k_A* 1z2b_A* ... Probab=34.93 E-value=9.3 Score=14.63 Aligned_cols=46 Identities=20% Similarity=0.148 Sum_probs=32.4 Q ss_pred HHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCC Q ss_conf 389999988778989860630557888999998510476433444420466776554403 Q 537021.9.peg.7 334 AQDYRQRLDFELNVIARMKFSGYFLIVADFIQWAKKNNIPVGPGRGSGAGSVTAYALTIT 393 (1033) Q Consensus 334 ~~~y~~RLe~EL~vI~~~gf~~YFLiV~Div~~ak~~gI~vGpGRGSAagSLVaY~LgIT 393 (1033) .+++.+|+.+|++- -+--+.|++.+ . +|=|=||+-||+++.+|.-. T Consensus 115 ~d~i~e~iRkevE~---cD~l~gf~~~h---------S--l~GGTGSGlGs~i~e~L~d~ 160 (440) T 1tvk_A 115 IDLVLDRIRKLADQ---CTGLQGFSVFH---------S--FGGGTGSGFTSLLMERLSVD 160 (440) T ss_dssp HHHHHHHHHHHHHS---SSSEEEEEEEC---------C--TTSSTTTHHHHHHHHHHHTS T ss_pred HHHHHHHHHHHHHH---CCCCCCEEEEE---------C--CCCCCCCCHHHHHHHHHHHH T ss_conf 89999999999995---88867667884---------2--57877662899999998876 No 129 >2h9d_A Salicylate biosynthesis protein PCHB; intertwined dimer, lyase; 1.95A {Pseudomonas aeruginosa PAO1} SCOP: a.130.1.1 PDB: 3hgw_C 3hgx_A* 2h9c_A Probab=34.80 E-value=14 Score=13.21 Aligned_cols=56 Identities=18% Similarity=0.253 Sum_probs=36.6 Q ss_pred HHHHHHHHHHCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH Q ss_conf 689887886326841307889762012110167787899999862499999999999999975 Q 537021.9.peg.7 694 QVMQIAQLLSGYSLSEADVLRRAMGKKIKEEMDKQKERFISGASKNGISKTIAVNIFELLAKF 756 (1033) Q Consensus 694 Qvm~ia~~~ag~~~~ead~lRra~~kk~~~~~~~~~~~f~~g~~~~g~~~~~a~~i~~~i~~f 756 (1033) +=|.++..+|-+-.... .+ -+++-.+++-+....-|.+.|++++.++.||..|-.. T Consensus 30 ~R~~l~~~I~~~K~~~~-~v------~d~~Re~~vl~~~~~~a~~~~l~~~~i~~i~~~ii~~ 85 (101) T 2h9d_A 30 RRMDYVKAASRFKASEA-AI------PAPERVAAMLPERARWAEENGLDAPFVEGLFAQIIHW 85 (101) T ss_dssp HHHHHHHHTTTC-----------------CHHHHHHHHHHHHHHHTTSCHHHHHHHHHHHHHH T ss_pred HHHHHHHHHHHHHHHCC-CC------CCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH T ss_conf 99999999999876678-98------8869999999999998775796999999999999999 No 130 >1qya_A ORFB, hypothetical protein YDDE; putative phenazine biosynthesis protein, epimerase, antibiotic biosynthesis protein, structural genomics; 2.00A {Escherichia coli} SCOP: d.21.1.2 d.21.1.2 PDB: 1sdj_A 1qy9_A Probab=34.32 E-value=8.5 Score=14.95 Aligned_cols=29 Identities=17% Similarity=0.268 Sum_probs=22.5 Q ss_pred HHHCCCCCCCCCCCCHHHHHHHHHHHCCC Q ss_conf 85104764334444204667765544037 Q 537021.9.peg.7 366 WAKKNNIPVGPGRGSGAGSVTAYALTITD 394 (1033) Q Consensus 366 ~ak~~gI~vGpGRGSAagSLVaY~LgIT~ 394 (1033) |+...||.=-|+=|||+++|.+|+..--. T Consensus 223 f~p~~Gi~EDpaTGSa~~aLa~yl~~~~~ 251 (307) T 1qya_A 223 FSPAIGIVEDPVTGNANGPMGAWLVHHNV 251 (307) T ss_dssp EEGGGTEEEESSCHHHHHHHHHHHHHTTC T ss_pred ECCCCCCCCCCCCHHHHHHHHHHHHHCCC T ss_conf 42668973646246879999999997187 No 131 >3hbl_A Pyruvate carboxylase; TIM barrel, ligase; HET: BTI ADP; 2.71A {Staphylococcus aureus subsp} PDB: 3bg5_A* 3ho8_A* 3hb9_A* Probab=34.21 E-value=14 Score=13.14 Aligned_cols=46 Identities=22% Similarity=0.255 Sum_probs=32.7 Q ss_pred HHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCE Q ss_conf 44329989999999978998899811651774999999999769963 Q 537021.9.peg.7 23 LEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQP 69 (1033) Q Consensus 23 ldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKP 69 (1033) .++...++.+++.|++.|..||--+ .|-|+=-.+|.++|.++||+. T Consensus 60 ~~~yl~~~~i~~~a~~~~~~ai~pg-ygflsen~~~a~~~~~~gi~~ 105 (1150) T 3hbl_A 60 AESYLNIERIIDVAKQANVDAIHPG-YGFLSENEQFARRCAEEGIKF 105 (1150) T ss_dssp TGGGTCHHHHHHHHHHTTCSEEECT-TTTSTTCHHHHHHHHHTTCEE T ss_pred HHCCCCHHHHHHHHHHHCCCEEECC-CCCCCCCHHHHHHHHHCCCEE T ss_conf 3211599999999999792999658-440324999999999879979 No 132 >2g8y_A Malate/L-lactate dehydrogenases; NAD, E.coli, structural genomics, PSI, protein structure initiative, midwest center for structural genomics; HET: NAD 1PE; 2.15A {Escherichia coli K12} Probab=34.19 E-value=14 Score=13.16 Aligned_cols=34 Identities=24% Similarity=0.219 Sum_probs=21.9 Q ss_pred HHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHH Q ss_conf 9999999998189899808830078788999987 Q 537021.9.peg.7 197 ESQVVQLAYTHELPLVATNNSLFLSEEDYEAHDV 230 (1033) Q Consensus 197 ~~~l~~lA~~~~iPlVaTndv~Y~~~~D~~~h~i 230 (1033) .+..++.|+++|+-+|+..+++..-.--+.+..+ T Consensus 120 ~~~aiekAk~~Gi~~v~vrns~H~G~~g~y~~~a 153 (385) T 2g8y_A 120 MALGIEKAHQHGIAAVALHNSHHIGRIGYWAEQC 153 (385) T ss_dssp HHHHHHHHHHHSEEEEEEEEEECCCCHHHHHHHH T ss_pred HHHHHHHHHHHCCCEEEECCCCCCCCCHHHHHHH T ss_conf 9999999997298379756878755427999999 No 133 >3edn_A Phenazine biosynthesis protein, PHZF family; diaminopimelate epimerase-like fold, alpha and beta protein class, structural genomics; HET: MSE; 1.50A {Bacillus anthracis} Probab=34.05 E-value=12 Score=13.69 Aligned_cols=30 Identities=23% Similarity=0.307 Sum_probs=22.7 Q ss_pred HHCCCCCCCCCCCCHHHHHHHHHHHCCCCC Q ss_conf 510476433444420466776554403756 Q 537021.9.peg.7 367 AKKNNIPVGPGRGSGAGSVTAYALTITDID 396 (1033) Q Consensus 367 ak~~gI~vGpGRGSAagSLVaY~LgIT~VD 396 (1033) ....|+.-.|+=|||+++|.+|+......+ T Consensus 218 ~~~~G~~Ed~atGsa~~~la~~l~~~~~~~ 247 (299) T 3edn_A 218 SAYAGTIEDPVTGTASGVMGAYYATYVEKD 247 (299) T ss_dssp CTTSSCSEESSCHHHHHHHHHHHHHHTCCC T ss_pred ECCCCCCCCCCCHHHHHHHHHHHHHHHCCC T ss_conf 013775367475488999999999870767 No 134 >1v8d_A Hypothetical protein (TT1679); X-RAY craytallography, structural genomics, riken structural genomics/proteomics initiative, RSGI; 2.16A {Thermus thermophilus} SCOP: c.140.1.1 Probab=34.05 E-value=11 Score=14.14 Aligned_cols=73 Identities=16% Similarity=0.044 Sum_probs=49.6 Q ss_pred CCCEEEECCCCCCHHHHHH--HCCCHHHHHHHHHHHHHHCC-CCEEEEEECCCHHHHHHHHHH---HHHHHHHCCCEEEE Q ss_conf 8988998197679678888--77997899999999987659-868999943991158999999---99999818989980 Q 537021.9.peg.7 141 TEGLIMLTGGSSGPIDRAF--FFNSSQVAEKRLLTFKKLFG-DRLYVNLQRHRGYDRYRESQV---VQLAYTHELPLVAT 214 (1033) Q Consensus 141 ~egLIvlsg~~~g~i~~~l--~~~~~~~a~~~l~~l~~~Fg-d~~ylEl~~~~~~e~~~~~~l---~~lA~~~~iPlVaT 214 (1033) ..|-|+..||..|++.-.. ..+..+.++..+..+.++.. ..+|+-+|.- +.+|..| .+.|+++++..|.. T Consensus 61 ~~G~i~VvGcSTSEV~G~~IGt~ss~eva~~I~~~l~~~~~~~gi~LA~QcC----EHLNRALvVEr~~a~~~~lE~V~V 136 (235) T 1v8d_A 61 APGSLFVLGGSTSEVLGERVGTRPSLEAAHAVLEGLLPPLLERGVHVAVQAC----EHLNRALVVERETARAFGKEEVAV 136 (235) T ss_dssp CTTCEEEEEECHHHHHC----CCCCHHHHHHHHHHHHHHHHTTTCEEEEECC----GGGTTCEEEEHHHHHHTTCCEEEC T ss_pred CCCCEEEEECCHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEECH----HHHHHHHHHHHHHHHHCCCEEEEE T ss_conf 9998899960367636763677682999999999999998652957998455----665678886297697749717998 Q ss_pred CCC Q ss_conf 883 Q 537021.9.peg.7 215 NNS 217 (1033) Q Consensus 215 ndv 217 (1033) -.+ T Consensus 137 vP~ 139 (235) T 1v8d_A 137 FPH 139 (235) T ss_dssp BCB T ss_pred ECC T ss_conf 647 No 135 >1ixr_A Holliday junction DNA helicase RUVA; heterooligomeric complex, octameric RUVA, AAA-ATPase domain, complex with nucleotide, hydrolase; HET: ANP; 3.30A {Thermus thermophilus} SCOP: a.60.2.1 b.40.4.2 Probab=33.65 E-value=14 Score=13.07 Aligned_cols=19 Identities=37% Similarity=0.511 Sum_probs=14.0 Q ss_pred HHHHCCCCHHHHHHHH-HHH Q ss_conf 2685896989999999-998 Q 537021.9.peg.7 839 LAAIKGVGTTTARHIM-EAS 857 (1033) Q Consensus 839 l~~Ikgvg~~~~~~Iv-e~r 857 (1033) |+.|+|||.++|+.|+ |-+ T Consensus 109 L~~vpGIGkKtAerIilELk 128 (191) T 1ixr_A 109 LTSASGVGRRLAERIALELK 128 (191) T ss_dssp HTTSTTCCHHHHHHHHHHHT T ss_pred HHCCCCCCHHHHHHHHHHHH T ss_conf 50188846889999999998 No 136 >2x6h_A GH13170P, VPS34, phosphotidylinositol 3 kinase 59F; phosphoinositide 3-kinase, PI3K, class III PI3K, autophagy, transferase; 2.90A {Drosophila melanogaster} PDB: 2x6f_A 2x6i_A* 2x6j_A* 2x6k_A* Probab=33.58 E-value=13 Score=13.49 Aligned_cols=45 Identities=33% Similarity=0.523 Sum_probs=26.5 Q ss_pred CHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHCC Q ss_conf 32135654178999999998874025434332344358899876201410122114 Q 537021.9.peg.7 569 IKFDFLGLKTLTLLQKSLDFLAQRGVKVDLSLIPFDDHETYHLLTTKGTLGIFQLE 624 (1033) Q Consensus 569 ~K~D~Lgl~~L~~i~~~~~~i~~~~~~~d~~~ip~~D~~~~~l~~~g~t~GvFQ~e 624 (1033) +.-|-|-+-.+++++..++ +.|. |+.- .+|..+..|...|+-|+- T Consensus 450 lRqD~l~~q~~~~m~~~~~---~~~~--~l~~------~~Y~v~~~~~~~G~ie~v 494 (696) T 2x6h_A 450 LRQDQLILQMITLMDKLLR---RENL--DLKL------TPYKVLATSSKHGFLQYV 494 (696) T ss_dssp THHHHHHHHHHHHHHHHHH---HTTC--CCCC------CCCCEEESSTTCEEEECC T ss_pred CCHHHHHHHHHHHHHHHHH---HCCC--CCCC------CCCEEEECCCCCEEEEEC T ss_conf 1078999999999999998---6587--7645------563377548984169971 No 137 >3bch_A 40S ribosomal protein SA; laminin receptor, P40 ribosomal protein, acetylation, cytoplasm, phosphorylation, polymorphism; 2.15A {Homo sapiens} Probab=33.45 E-value=14 Score=13.05 Aligned_cols=13 Identities=15% Similarity=0.163 Sum_probs=5.4 Q ss_pred HHHHHHHCCCCCC Q ss_conf 9988886368503 Q 537021.9.peg.7 805 FCQDARQFNIQIM 817 (1033) Q Consensus 805 ~i~e~~~~gi~vl 817 (1033) .+.||+++||++. T Consensus 166 AI~EA~kl~IPvI 178 (253) T 3bch_A 166 PLTEASYVNLPTI 178 (253) T ss_dssp HHHHHHHTTCCEE T ss_pred HHHHHHHCCCCEE T ss_conf 7677876499779 No 138 >2ihm_A POL MU, DNA polymerase MU; helix-turn-helix, transferase/DNA complex; HET: DNA D3T; 2.40A {Mus musculus} Probab=33.24 E-value=14 Score=13.02 Aligned_cols=16 Identities=31% Similarity=0.459 Sum_probs=9.9 Q ss_pred HHHHHHHHHHCCCCCHHH Q ss_conf 466776554403756333 Q 537021.9.peg.7 382 AGSVTAYALTITDIDPLR 399 (1033) Q Consensus 382 agSLVaY~LgIT~VDPi~ 399 (1033) |-.| |-.||+.++=|+ T Consensus 115 a~~l--~~~Gi~si~dL~ 130 (360) T 2ihm_A 115 ANRW--YQEGLRTLDELR 130 (360) T ss_dssp HHHH--HHTTCCSHHHHH T ss_pred HHHH--HHCCCCCHHHHH T ss_conf 9999--983988799998 No 139 >2vcc_A Alpha-N-acetylglucosaminidase; GH89, naGlu, sanfilippo disease, hydrolase, mucopolysaccharidosis, family 89 glycoside hydrolase; 2.0A {Clostridium perfringens} PDB: 2vc9_A* 2vcb_A* 2vca_A Probab=33.13 E-value=14 Score=13.01 Aligned_cols=102 Identities=13% Similarity=0.154 Sum_probs=45.5 Q ss_pred HHHHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHH---CCCCCCCCCCHHHHHCCHHHHHH Q ss_conf 878999999983038021554220778778899999997527998898865431---01458982443445229887886 Q 537021.9.peg.7 425 DRRDEVIRYVQNKYGHERVAQIITFGSLQAKAALRDVGRALQMPYSQVDRLCKL---IPNDPAHPVSLQAIADDSRFREA 501 (1033) Q Consensus 425 ~~R~~vi~y~~~kyG~~~va~i~t~~t~~~k~aird~~r~~g~~~~~~~~~~~~---i~~~~~~~~~~~~~~~~~~~~~~ 501 (1033) +|=++-|+++- --|-+.+- .+ .+--+.-++|=+-||++.++++..=-. .||..=.. +..+ T Consensus 288 ~rWEreIDWMA-L~GINlpL---A~--~GqEaVW~~v~~~~Glt~~Ei~~ff~GPAflaW~rMgN-----------l~gw 350 (891) T 2vcc_A 288 DQYEEFLDWCA-MNGVNLVL---DI--IGQEEVLRRTLNEFGYSDEEVKEFISGPAYFAWFYMQN-----------MTGF 350 (891) T ss_dssp HHHHHHHHHHH-HTTCCEEE---CC--TTHHHHHHHHHGGGTCCHHHHHHHSCCTTCHHHHHTTS-----------CCST T ss_pred HHHHHHHHHHH-HHCCCCCC---CC--CCHHHHHHHHHHHCCCCHHHHHHHCCCCCHHHHHHHCC-----------CCCC T ss_conf 99999999997-70787114---02--02499999999983999899998657941167786177-----------4547 Q ss_pred H--HCCCCCHHHHHHHHHHHHHHHCCCCCC-----CEEEECCCCHHHHEEC Q ss_conf 4--307420268898720112210024474-----4033525666662001 Q 537021.9.peg.7 502 R--LADPSVDRLLEISQKLEGLYRHASTHA-----AGIVIGDRPLSQLVPM 545 (1033) Q Consensus 502 ~--~~~~~~~~~~~~a~~leg~~r~~~~Ha-----~Gvvi~~~~l~~~~p~ 545 (1033) . ....-...=.++.++|....|.+|++| +|.| |..+.+.-|- T Consensus 351 gGPLp~~W~~~q~~LQkkIl~RmReLGM~PVLPaFaG~V--P~~~~~~~P~ 399 (891) T 2vcc_A 351 GGPLPNDWFEQRAELGRKMHDRMQSFGINPVLQGYSGMV--PRDFKEKNQE 399 (891) T ss_dssp TCCCCTTHHHHHHHHHHHHHHHHHHHTCEEEEECCCCEE--CTTHHHHSTT T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCC--CHHHHHHCCC T ss_conf 899999999999999999999999849964165446878--3778830999 No 140 >2ww4_A 4-diphosphocytidyl-2C-methyl-D-erythritol kinase; non-mevalonate pathway, isoprenoid biosynthesis, transferase; HET: ADP; 2.00A {Escherichia coli} PDB: 1oj4_A* Probab=32.78 E-value=15 Score=12.96 Aligned_cols=11 Identities=9% Similarity=0.111 Sum_probs=4.0 Q ss_pred HHHHHHHHHHC Q ss_conf 99999998303 Q 537021.9.peg.7 428 DEVIRYVQNKY 438 (1033) Q Consensus 428 ~~vi~y~~~ky 438 (1033) +++.+.+.+.| T Consensus 255 ~~~~~~~~~~~ 265 (283) T 2ww4_A 255 RQVLEQAPEWL 265 (283) T ss_dssp HHHHHHCCC-C T ss_pred HHHHHHCCCCC T ss_conf 99998640407 No 141 >1ecm_A Endo-oxabicyclic transition state analogue; P-protein, chorismate mutase domain; HET: TSA; 2.20A {Escherichia coli} SCOP: a.130.1.1 Probab=32.56 E-value=15 Score=12.94 Aligned_cols=67 Identities=13% Similarity=0.181 Sum_probs=41.2 Q ss_pred HHHHHHHHHHCCCHHHHHHHHHHHCC--CCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHH Q ss_conf 68988788632684130788976201--2110167787899999862499999999999999975000133101478889 Q 537021.9.peg.7 694 QVMQIAQLLSGYSLSEADVLRRAMGK--KIKEEMDKQKERFISGASKNGISKTIAVNIFELLAKFADYGFNKSHAAAYAV 771 (1033) Q Consensus 694 Qvm~ia~~~ag~~~~ead~lRra~~k--k~~~~~~~~~~~f~~g~~~~g~~~~~a~~i~~~i~~f~~Y~FnksHa~aya~ 771 (1033) |=|.++..+|. +.+.-|. .+++-.+++.+....-+...|++++.++.||..|-..+ T Consensus 27 ~R~~l~~~I~~--------~K~~~~~~i~dp~RE~~vl~~~~~~a~~~gl~~~~i~~if~~ii~~s-------------- 84 (109) T 1ecm_A 27 ERRELAVEVGK--------AKLLSHRPVRDIDRERDLLERLITLGKAHHLDAHYITRLFQLIIEDS-------------- 84 (109) T ss_dssp HHHHHHHHHHH--------HHHHTTCCSCCHHHHHHHHHHHHHHHHHHTCCHHHHHHHHHHHHHHH-------------- T ss_pred HHHHHHHHHHH--------HHHHCCCCCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHH-------------- T ss_conf 99999999999--------88875998522999999999999871158959999999999999999-------------- Q ss_pred HHHHHHHHHHH Q ss_conf 99999999973 Q 537021.9.peg.7 772 ISYQTAWMKTH 782 (1033) Q Consensus 772 ~~y~~aylK~~ 782 (1033) ..+|..+++.| T Consensus 85 ~~~Q~~~~~~~ 95 (109) T 1ecm_A 85 VLTQQALLQQH 95 (109) T ss_dssp HHHHHHHHHTC T ss_pred HHHHHHHHHHH T ss_conf 99999999987 No 142 >3nqb_A Adenine deaminase 2; PSI-II, structural genomics, protein structure initiative, N SGX research center for structural genomics, nysgxrc; 2.21A {Agrobacterium tumefaciens} Probab=31.95 E-value=15 Score=12.86 Aligned_cols=32 Identities=16% Similarity=0.265 Sum_probs=19.2 Q ss_pred CCCHHHHHHCHHHHHHHCCCHHHHHHHHHHCCCCEE Q ss_conf 881334331444764432998999999997899889 Q 537021.9.peg.7 9 PSFVHLQVHSSYSLLEGALSLNNILDKIAADQQPAI 44 (1033) Q Consensus 9 ~~FvhLh~hS~ySlldg~~~~e~LV~~A~~~G~~Ai 44 (1033) |-|+-.|+|-+.| ..+|+++.+.+...|..++ T Consensus 89 PGlID~H~Hie~s----~~~p~~~a~~al~~GvTTv 120 (608) T 3nqb_A 89 PGLIDTHMHIESS----MITPAAYAAAVVARGVTTI 120 (608) T ss_dssp ECEEEEEECGGGG----TSCHHHHHHHHHTTTEEEE T ss_pred CCCEEEEECCCCC----CCCHHHHHHHHHHCCEEEE T ss_conf 0711301587753----6799999999981893899 No 143 >2dpl_A GMP synthetase, GMP synthase [glutamine-hydrolyzing] subunit B; structural genomics, NPPSFA; 1.43A {Pyrococcus horikoshii OT3} PDB: 2z0c_A 3a4i_A Probab=31.92 E-value=15 Score=12.86 Aligned_cols=22 Identities=23% Similarity=0.457 Sum_probs=18.0 Q ss_pred HHHHHHHHHHHCCCCHHHHHHH Q ss_conf 8899999997527998898865 Q 537021.9.peg.7 454 AKAALRDVGRALQMPYSQVDRL 475 (1033) Q Consensus 454 ~k~aird~~r~~g~~~~~~~~~ 475 (1033) .|.=||.+||.||+|...+++- T Consensus 165 ~K~EVR~la~~LGlP~~i~~K~ 186 (308) T 2dpl_A 165 YKDEVRELAKFLGLPEKIYNRM 186 (308) T ss_dssp CHHHHHHHHHHTTCCHHHHTCC T ss_pred CHHHHHHHHHHHCCCHHHHCCC T ss_conf 5699999999809998992889 No 144 >1ym5_A YHI9, hypothetical 32.6 kDa protein in DAP2-SLT2 intergenic region; PHZF enzyme superfamily, double hot-DOG, structural genomics; 2.05A {Saccharomyces cerevisiae} Probab=31.82 E-value=11 Score=14.12 Aligned_cols=32 Identities=22% Similarity=0.322 Sum_probs=24.1 Q ss_pred HHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCH Q ss_conf 85104764334444204667765544037563 Q 537021.9.peg.7 366 WAKKNNIPVGPGRGSGAGSVTAYALTITDIDP 397 (1033) Q Consensus 366 ~ak~~gI~vGpGRGSAagSLVaY~LgIT~VDP 397 (1033) |+...||.--|+=|||+++|.+|+......|. T Consensus 215 F~p~~Gi~EDpaTGSaa~ala~yl~~~~~~~~ 246 (300) T 1ym5_A 215 FAPVINVYEDPVCGSGSVALARYLQEVYKFEK 246 (300) T ss_dssp EEGGGTEEEESSCHHHHHHHHHHHHHHHCCSS T ss_pred ECCCCCCCCCCCCCHHHHHHHHHHHHHCCCCC T ss_conf 41578863376632668999999997358997 No 145 >1f0x_A DLDH, D-lactate dehydrogenase; oxidoreductase; HET: FAD; 1.90A {Escherichia coli} SCOP: d.58.32.2 d.145.1.1 Probab=31.56 E-value=13 Score=13.37 Aligned_cols=20 Identities=0% Similarity=-0.094 Sum_probs=13.1 Q ss_pred HHHHHHHHHCCCCCCCCCCCC Q ss_conf 899999851047643344442 Q 537021.9.peg.7 360 VADFIQWAKKNNIPVGPGRGS 380 (1033) Q Consensus 360 V~Div~~ak~~gI~vGpGRGS 380 (1033) |..+|++|++++|+|=| ||+ T Consensus 58 v~~iv~~a~~~~ipv~p-rGg 77 (571) T 1f0x_A 58 LWRVLKACVTADKIILM-QAA 77 (571) T ss_dssp HHHHHHHHHHTTCEEEE-ESS T ss_pred HHHHHHHHHHCCCEEEE-ECC T ss_conf 99999999987986999-789 No 146 >3l23_A Sugar phosphate isomerase/epimerase; structural genomics, joint center for structural genomics, JCSG, protein structure initiative; HET: MSE; 1.70A {Parabacteroides distasonis} Probab=31.53 E-value=15 Score=12.81 Aligned_cols=53 Identities=19% Similarity=0.271 Sum_probs=40.1 Q ss_pred CHHHHHHHCC--CHHHHHHHHHHCCCCEEEEEC--CCHHHH--HHHHHHHHHHCCCCEE Q ss_conf 4447644329--989999999978998899811--651774--9999999997699638 Q 537021.9.peg.7 18 SSYSLLEGAL--SLNNILDKIAADQQPAIAITD--TNNLFS--ALEFSQKACAAGIQPI 70 (1033) Q Consensus 18 S~ySlldg~~--~~e~LV~~A~~~G~~AiAITD--~~nl~G--av~F~~ackk~gIKPI 70 (1033) +-||+.+.+. .+++.++++++.|+.+|=|.- .....+ +-++-+...++|++.. T Consensus 18 ~lys~~~~~~~~d~~~~L~~~~e~G~~~IEl~~~~~~~~~~~~~~elk~~l~~~gL~i~ 76 (303) T 3l23_A 18 QIYSLSQELYKGDVAANLRKVKDMGYSKLELAGYGKGAIGGVPMMDFKKMAEDAGLKII 76 (303) T ss_dssp EGGGGGGGGGSSCHHHHHHHHHHTTCCEEEECCEETTEETTEEHHHHHHHHHHTTCEEE T ss_pred EEEECCCCCCCCCHHHHHHHHHHHCCCEEEECCCCCCCCCCCCHHHHHHHHHHCCCCEE T ss_conf 71422310114999999999999599989988877766689999999999998799755 No 147 >2vrq_A Alpha-L-arabinofuranosidase; hydrolase, glycosidase, inactive mutant, pentasaccharide; HET: XYP; 2.0A {Thermobacillus xylanilyticus} PDB: 2vrk_A Probab=31.44 E-value=15 Score=12.80 Aligned_cols=45 Identities=27% Similarity=0.429 Sum_probs=33.4 Q ss_pred HHHHHHHHHCCCCEEEE-----EC--------------------------CCHHHHHHHHHHHHHHCCCCEEEEEE Q ss_conf 99999999789988998-----11--------------------------65177499999999976996389999 Q 537021.9.peg.7 30 NNILDKIAADQQPAIAI-----TD--------------------------TNNLFSALEFSQKACAAGIQPIIGCQ 74 (1033) Q Consensus 30 e~LV~~A~~~G~~AiAI-----TD--------------------------~~nl~Gav~F~~ackk~gIKPIiG~E 74 (1033) .++++..+++|.+.+=. +| ..|-+|.-+|.+-|++.|-+|++++- T Consensus 54 ~d~~~~lk~l~~~~lR~PGGn~~~~y~W~~~igp~~~R~~~~~~~w~~~~~~~~~g~def~~~~~~~ga~~~~~vn 129 (496) T 2vrq_A 54 NDVLEALKQMKIPVLRWPGGCFADEYHWKDGVGPREKRKRMVNTHWGGVIENNHFGTHEFMMLCELLGCEPYISGN 129 (496) T ss_dssp HHHHHHHHHHTCCEEEESCSGGGGTCCGGGGCSCGGGCCCCEETTTTSEECCCCSCHHHHHHHHHHHTCEEEEEEC T ss_pred HHHHHHHHHCCCCEEECCCCCCCCCCCCCCCCCCHHHCCCCCCCCCCCEECCCCCCHHHHHHHHHHCCCEEEEEEE T ss_conf 9999999974999797189874673603578698677847657887761368876799999999972997999998 No 148 >1uqr_A 3-dehydroquinate dehydratase; shikimate pathway, aromatic amino acid biosynthesis, lyase; 1.7A {Actinobacillus pleuropneumoniae} SCOP: c.23.13.1 Probab=31.41 E-value=11 Score=14.11 Aligned_cols=18 Identities=22% Similarity=0.058 Sum_probs=9.9 Q ss_pred HHHHHHHHHHHHHCCCEE Q ss_conf 999999999998189899 Q 537021.9.peg.7 195 YRESQVVQLAYTHELPLV 212 (1033) Q Consensus 195 ~~~~~l~~lA~~~~iPlV 212 (1033) .+++.+.+.+++.++.+. T Consensus 30 ~i~~~l~~~a~~~~i~l~ 47 (154) T 1uqr_A 30 DIEQHLQQSAQAQGYELD 47 (154) T ss_dssp HHHHHHHHHHHHTTCEEE T ss_pred HHHHHHHHHHHHCCCEEE T ss_conf 999999999997398354 No 149 >1qus_A Lytic murein transglycosylase B; alpha-helical protein with AN five-stranded antiparallel beta-sheet, hydrolase; HET: BCN; 1.70A {Escherichia coli} SCOP: d.2.1.6 PDB: 1d0k_A* 1d0m_A* 1qdr_A* 1qdt_A 1d0l_A* 1qut_A* 1ltm_A* Probab=31.35 E-value=15 Score=12.79 Aligned_cols=48 Identities=15% Similarity=0.175 Sum_probs=27.5 Q ss_pred HHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHH Q ss_conf 787899999862499999999999999975000133101478889999 Q 537021.9.peg.7 727 KQKERFISGASKNGISKTIAVNIFELLAKFADYGFNKSHAAAYAVISY 774 (1033) Q Consensus 727 ~~~~~f~~g~~~~g~~~~~a~~i~~~i~~f~~Y~FnksHa~aya~~~y 774 (1033) +.+..|-.=..+=|+++++.-.||.+=-.|+.|.=+-+---|.|.+|| T Consensus 97 ~~~~~l~~ie~~yGVp~~ii~AiwGvET~yG~~~G~~~v~~aLaTLA~ 144 (322) T 1qus_A 97 QYEDALNRAWQVYGVPPEIIVGIIGVETRWGRVMGKTRILDALATLSF 144 (322) T ss_dssp HTHHHHHHHHHHHCCCHHHHHHHHHHHHTTTTCCCCEEHHHHHHHHHH T ss_pred HHHHHHHHHHHHHCCCHHHHHHHEEEECCCCCCCCCCCHHHHHHHHHH T ss_conf 999999999999795988875620355310124588618999999997 No 150 >1chm_A Creatine amidinohydrolase; creatinase; 1.90A {Pseudomonas putida} SCOP: c.55.2.1 d.127.1.1 Probab=31.28 E-value=15 Score=12.88 Aligned_cols=44 Identities=23% Similarity=0.259 Sum_probs=33.5 Q ss_pred CCCHHHHHH----CHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCHH Q ss_conf 881334331----44476443299899999999789988998116517 Q 537021.9.peg.7 9 PSFVHLQVH----SSYSLLEGALSLNNILDKIAADQQPAIAITDTNNL 52 (1033) Q Consensus 9 ~~FvhLh~h----S~ySlldg~~~~e~LV~~A~~~G~~AiAITD~~nl 52 (1033) |++.|||.- +.||--+=..+.+.+-+.-+++|..|+-|++-.|+ T Consensus 3 ~~~~~~~~~~~~~~~fs~~E~~~R~~rl~~~m~~~~lDalvi~~~~ni 50 (401) T 1chm_A 3 PKTLRIRNGDKVRSTFSAQEYANRQARLRAHLAAENIDAAIFTSYHNI 50 (401) T ss_dssp CSEECCCCSCCCCCSSCHHHHHHHHHHHHHHHHHTTCSEEEECSHHHH T ss_pred CCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCC T ss_conf 865667887707899799999999999999999879989998998551 No 151 >2fmp_A DNA polymerase beta; nucleotidyl transferase, transferase/DNA complex; HET: DNA DOC DCT; 1.65A {Homo sapiens} SCOP: a.60.6.1 a.60.12.1 d.218.1.2 PDB: 1bpx_A* 1bpz_A* 1mq2_A* 1mq3_A* 1bpy_A* 1tva_A* 1zjm_A* 1zjn_A* 1zqa_A* 1zqb_A* 1zqc_A* 1zqd_A* 1zqe_A* 1zqf_A* 1zqg_A* 1zqh_A* 1zqi_A* 1zqj_A* 1zqk_A* 1zql_A* ... Probab=31.25 E-value=15 Score=12.77 Aligned_cols=13 Identities=23% Similarity=0.233 Sum_probs=9.5 Q ss_pred HHHHCCCCCHHHC Q ss_conf 5544037563332 Q 537021.9.peg.7 388 YALTITDIDPLRF 400 (1033) Q Consensus 388 Y~LgIT~VDPi~~ 400 (1033) |-+||+.+|=++. T Consensus 115 ~~~Gi~ti~dL~~ 127 (335) T 2fmp_A 115 VDEGIKTLEDLRK 127 (335) T ss_dssp HHTTCCSHHHHHT T ss_pred HHCCCCCHHHHHH T ss_conf 9818875999987 No 152 >1xub_A Phenazine biosynthesis protein PHZF; biosynthetic protein; 1.30A {Pseudomonas fluorescens} SCOP: d.21.1.2 d.21.1.2 PDB: 1u1w_A* 1u1v_A* 1u1x_A* 1xua_A* 1t6k_A Probab=30.58 E-value=16 Score=12.69 Aligned_cols=31 Identities=16% Similarity=0.245 Sum_probs=23.3 Q ss_pred HHHCCCCCCCCCCCCHHHHHHHHHHHCCCCC Q ss_conf 8510476433444420466776554403756 Q 537021.9.peg.7 366 WAKKNNIPVGPGRGSGAGSVTAYALTITDID 396 (1033) Q Consensus 366 ~ak~~gI~vGpGRGSAagSLVaY~LgIT~VD 396 (1033) |+...||.==|+=|||++.|.+|+..--..| T Consensus 219 F~p~~Gi~EDpaTGSaa~aLa~yl~~~g~~~ 249 (298) T 1xub_A 219 FSPAYGVVEDAATGSAAGPLAIHLARHGQIE 249 (298) T ss_dssp EBGGGTBSSCSCCHHHHHHHHHHHHHTTSSC T ss_pred ECCCCCCCCCCCCCHHHHHHHHHHHHHCCCC T ss_conf 2146688888764231699999999848988 No 153 >1v9n_A Malate dehydrogenase; riken structural genomics/proteomics initiative, RSGI, structural genomics, oxidoreductase; HET: NDP; 2.10A {Pyrococcus horikoshii OT3} Probab=30.42 E-value=13 Score=13.28 Aligned_cols=21 Identities=19% Similarity=0.195 Sum_probs=9.0 Q ss_pred HHHHHHHHCCCEEEECCCCCC Q ss_conf 999999818989980883007 Q 537021.9.peg.7 200 VVQLAYTHELPLVATNNSLFL 220 (1033) Q Consensus 200 l~~lA~~~~iPlVaTndv~Y~ 220 (1033) .++.|+++|+-+|+..+++.. T Consensus 108 ai~~A~~~Gi~~v~v~ns~H~ 128 (360) T 1v9n_A 108 AIKKAKDTGIGIVIARNSNHY 128 (360) T ss_dssp HHHHHHHHSEEEEEEEEECCC T ss_pred HHHHHHHCCCEEEEECCCCCC T ss_conf 999998639559973046877 No 154 >3ls8_A Phosphatidylinositol 3-kinase catalytic subunit type 3; alpha/beta protein, PIK3C3, compound 15E, structural genomics, SGC stockholm; HET: AJZ; 2.25A {Homo sapiens} PDB: 3ihy_A Probab=30.40 E-value=15 Score=12.77 Aligned_cols=44 Identities=32% Similarity=0.461 Sum_probs=28.9 Q ss_pred HHHHHHHCCHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHCC Q ss_conf 2135654178999999998874025434332344358899876201410122114 Q 537021.9.peg.7 570 KFDFLGLKTLTLLQKSLDFLAQRGVKVDLSLIPFDDHETYHLLTTKGTLGIFQLE 624 (1033) Q Consensus 570 K~D~Lgl~~L~~i~~~~~~i~~~~~~~d~~~ip~~D~~~~~l~~~g~t~GvFQ~e 624 (1033) .-|-|-+-.+++++..++. .| +|+.-. +|..+..|...|+-|+- T Consensus 377 RqD~l~~q~~~~m~~i~~~---~~--~dl~l~------~Y~vi~~~~~~GlIE~V 420 (614) T 3ls8_A 377 RQDQLILQIISLMDKLLRK---EN--LDLKLT------PYKVLATSTKHGFMQFI 420 (614) T ss_dssp HHHHHHHHHHHHHHHHHHH---TT--CCCCCC------CCCEEESSSSEEEEECC T ss_pred HHHHHHHHHHHHHHHHHHH---CC--CCCCCE------EEEEEECCCCCCEEEEE T ss_conf 3889999999999999986---68--776330------14786058987648886 No 155 >3dr0_A Cytochrome C6; photosynthesis, cyanobacteria, electron transfer, electron transport, heme, iron, metal-binding, thylakoid, transport; HET: HEM; 1.23A {Synechococcus SP} Probab=30.35 E-value=16 Score=12.66 Aligned_cols=36 Identities=14% Similarity=0.244 Sum_probs=27.1 Q ss_pred HHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCC Q ss_conf 762100145787533365687899999998303802 Q 537021.9.peg.7 406 RFLNPDRMSMPDFDIDFCQDRRDEVIRYVQNKYGHE 441 (1033) Q Consensus 406 RFLnp~R~~~PDIDiDf~~~~R~~vi~y~~~kyG~~ 441 (1033) ......+..||-.+--++++.+..|+.||.+....+ T Consensus 56 ~~~~~~~~~Mp~~~~~Lsd~ei~~l~aYi~~~a~~~ 91 (93) T 3dr0_A 56 YQVTNGQGAMPAFGGRLSDADIANVAAYIADQAENN 91 (93) T ss_dssp HHHHHCBTTBCCCBTTBCHHHHHHHHHHHHHHHHTT T ss_pred HHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCC T ss_conf 997258799987667899999999999999875079 No 156 >1bqc_A Protein (beta-mannanase); glycosyl hydrolase, family 5, thermomonospora fusca; 1.50A {Thermobifida fusca} SCOP: c.1.8.3 PDB: 2man_A* 3man_A* Probab=30.12 E-value=16 Score=12.63 Aligned_cols=43 Identities=5% Similarity=0.023 Sum_probs=29.9 Q ss_pred HHHHHHHHCCCCEEEE-------ECCCHHHHHHHHHHHHHHCCCCEEEEE Q ss_conf 9999999789988998-------116517749999999997699638999 Q 537021.9.peg.7 31 NILDKIAADQQPAIAI-------TDTNNLFSALEFSQKACAAGIQPIIGC 73 (1033) Q Consensus 31 ~LV~~A~~~G~~AiAI-------TD~~nl~Gav~F~~ackk~gIKPIiG~ 73 (1033) +-++..++.|+.+|=| -+.+.+.-+-++...|.++||+.|+.. T Consensus 36 ~~~~~~k~~G~N~vRl~~~~~~~~~~~~~~~ld~~v~~a~~~Gi~Vildl 85 (302) T 1bqc_A 36 QAFADIKSHGANTVRVVLSNGVRWSKNGPSDVANVISLCKQNRLICMLEV 85 (302) T ss_dssp THHHHHHHTTCSEEEEEECCSSSSCCCCHHHHHHHHHHHHHTTCEEEEEE T ss_pred HHHHHHHHCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHCCCEEEEEE T ss_conf 99999997699889970316555787709999999999998799899984 No 157 >2uuu_A Alkyldihydroxyacetonephosphate synthase; rhizomelic chondrodysplasia punctata, biosynthesis of phospholipids, flavoprotein; HET: FAD PL3; 1.95A {Dictyostelium discoideum} PDB: 2uuv_A* Probab=30.01 E-value=13 Score=13.47 Aligned_cols=20 Identities=35% Similarity=0.527 Sum_probs=16.0 Q ss_pred HHHHHHHHHCCCCCCCCCCCC Q ss_conf 899999851047643344442 Q 537021.9.peg.7 360 VADFIQWAKKNNIPVGPGRGS 380 (1033) Q Consensus 360 V~Div~~ak~~gI~vGpGRGS 380 (1033) |.++|++|.+++|+|-| ||+ T Consensus 150 V~~iv~~A~~~~vpv~p-rGg 169 (584) T 2uuu_A 150 VERLVQLAHKYNVVIIP-MGG 169 (584) T ss_dssp HHHHHHHHHHHTCEEEE-ESS T ss_pred HHHHHHHHHHCCCEEEE-ECC T ss_conf 99999999987982999-889 No 158 >1qnr_A Endo-1,4-B-D-mannanase; hydrolase, anomalous scattering; HET: NAG MAB; 1.4A {Trichoderma reesei} SCOP: c.1.8.3 PDB: 1qno_A* 1qnq_A* 1qnp_A* 1qns_A* Probab=29.97 E-value=16 Score=12.61 Aligned_cols=21 Identities=33% Similarity=0.229 Sum_probs=12.5 Q ss_pred HHHHHHHHHHHHCCCEEEECC Q ss_conf 999999999981898998088 Q 537021.9.peg.7 196 RESQVVQLAYTHELPLVATNN 216 (1033) Q Consensus 196 ~~~~l~~lA~~~~iPlVaTnd 216 (1033) ....++..|++.++.++.+.. T Consensus 91 ~~d~~v~~a~~~gi~v~l~~~ 111 (344) T 1qnr_A 91 TLDYVVQSAEQHNLKLIIPFV 111 (344) T ss_dssp HHHHHHHHHHHHTCEEEEESC T ss_pred HHHHHHHHHHHCCCEEEEECC T ss_conf 889999999975997876014 No 159 >3lwz_A 3-dehydroquinate dehydratase; AROQ, IDP90771, amino- acid biosynthesis, aromatic amino acid biosynthesis, lyase, structural genomics; 1.65A {Yersinia pestis} Probab=29.93 E-value=10 Score=14.29 Aligned_cols=16 Identities=25% Similarity=0.088 Sum_probs=7.4 Q ss_pred HHHHHHHHHHHHCCCE Q ss_conf 9999999999818989 Q 537021.9.peg.7 196 RESQVVQLAYTHELPL 211 (1033) Q Consensus 196 ~~~~l~~lA~~~~iPl 211 (1033) +++.+.+.|++.++.+ T Consensus 37 i~~~~~~~a~~~~i~~ 52 (153) T 3lwz_A 37 IVSQLEIQAQGMDVAL 52 (153) T ss_dssp HHHHHHHHHHHTTEEE T ss_pred HHHHHHHHHHHCCCCE T ss_conf 9999999998759817 No 160 >2gjv_A Putative cytoplasmic protein; structural genomics, unknown function, PSI, protein structur initiative; 2.39A {Salmonella typhimurium} SCOP: d.323.1.2 Probab=29.80 E-value=8.1 Score=15.13 Aligned_cols=36 Identities=17% Similarity=0.281 Sum_probs=21.6 Q ss_pred CCHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHH Q ss_conf 1789999999988740254343323443588998762 Q 537021.9.peg.7 577 KTLTLLQKSLDFLAQRGVKVDLSLIPFDDHETYHLLT 613 (1033) Q Consensus 577 ~~L~~i~~~~~~i~~~~~~~d~~~ip~~D~~~~~l~~ 613 (1033) .+|.+|+..+.-+++...++|+.-.| +||+.|.+=. T Consensus 26 ~~l~ii~~v~~rLre~~P~~~ve~fp-~~p~~Y~l~h 61 (175) T 2gjv_A 26 ETLSVIHTVANRLRELNPDMDIHISS-TDAKVYIPTG 61 (175) T ss_dssp CHHHHHHHHHHHHHHHCTTEEEECCS-SCTTTCCCCS T ss_pred HHHHHHHHHHHHHHHHCCCCEEEECC-CCHHHCCCCC T ss_conf 57789999999999769785169667-9887736578 No 161 >1nxu_A Hypothetical oxidoreductase YIAK; hypothetical protein, structural genomics, PSI, protein structure initiative; 1.80A {Escherichia coli} SCOP: c.122.1.1 PDB: 1s20_A* Probab=29.73 E-value=15 Score=12.75 Aligned_cols=25 Identities=28% Similarity=0.353 Sum_probs=10.9 Q ss_pred HHHHHHHHHHCCCEEEECCCCCCCH Q ss_conf 9999999981898998088300787 Q 537021.9.peg.7 198 SQVVQLAYTHELPLVATNNSLFLSE 222 (1033) Q Consensus 198 ~~l~~lA~~~~iPlVaTndv~Y~~~ 222 (1033) +.+++.|+++|+-+|+..+++..-. T Consensus 95 ~~ai~kA~~~Gi~~v~v~ns~H~G~ 119 (333) T 1nxu_A 95 DRAIELAADHGIGLVALRNANHWMR 119 (333) T ss_dssp HHHHHHHHHHSEEEEEEEEECCCSC T ss_pred HHHHHHHHHHCCEEEEECCCCCCCC T ss_conf 9999999971947997267786520 No 162 >3k6g_A Telomeric repeat-binding factor 2-interacting Pro; helix, chromosomal protein, nucleus, phosphoprotein, telomer cycle, DNA-binding, protein binding; 1.95A {Homo sapiens} Probab=29.71 E-value=5.6 Score=16.52 Aligned_cols=29 Identities=17% Similarity=0.260 Sum_probs=15.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHC-CCCCCCCC Q ss_conf 998788641110001332210-23455575 Q 537021.9.peg.7 270 ALENTVEIARRCSFILQVHAP-ILPRFIEK 298 (1033) Q Consensus 270 Ai~NT~~IAe~c~~~l~~~~~-~lP~f~~p 298 (1033) .++|+.+++.-|.|-..-..+ -+|-+... T Consensus 35 lLKnSGel~at~~fL~tG~r~dgyPiW~R~ 64 (111) T 3k6g_A 35 FLKNSGELEATSAFLASGQRADGYPIWSRQ 64 (111) T ss_dssp HHHTTTCHHHHHHHHHHSSCTTSCCCCCHH T ss_pred HHHCCCCHHHHHHHHHCCCCCCCCCEEEEC T ss_conf 997253589999999808789998512326 No 163 >1xng_A NH(3)-dependent NAD(+) synthetase; amidotransferase, ligase; HET: DND ATP; 1.70A {Helicobacter pylori} SCOP: c.26.2.1 PDB: 1xnh_A Probab=29.48 E-value=16 Score=12.55 Aligned_cols=59 Identities=15% Similarity=0.218 Sum_probs=30.5 Q ss_pred CCCC-EEEECCCCCCHHHHHHHCCCHHHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHHHHHHHHHCCCEE Q ss_conf 4898-899819767967888877997899999999987659868999943991158999999999998189899 Q 537021.9.peg.7 140 GTEG-LIMLTGGSSGPIDRAFFFNSSQVAEKRLLTFKKLFGDRLYVNLQRHRGYDRYRESQVVQLAYTHELPLV 212 (1033) Q Consensus 140 ~~eg-LIvlsg~~~g~i~~~l~~~~~~~a~~~l~~l~~~Fgd~~ylEl~~~~~~e~~~~~~l~~lA~~~~iPlV 212 (1033) ..++ ++.+|||.+|-+.-++. ++.+++|++--.+...............+|+.++++.. T Consensus 24 g~k~vvvglSGGVDSav~A~La--------------~~a~~~~v~~v~mp~~~~~~~~~~~A~~~a~~Lgi~~~ 83 (268) T 1xng_A 24 GFKKVVYGLSGGLDSAVVGVLC--------------QKVFKENAHALLMPSSVSMPENKTDALNLCEKFSIPYT 83 (268) T ss_dssp TCCCEEEECCSSHHHHHHHHHH--------------HHHHGGGEEEEECCCSSSCHHHHHHHHHHHHHHTCCEE T ss_pred CCCEEEEECCCCHHHHHHHHHH--------------HHHHHHCCEEEECCCHHCCHHHHHHHHHHHHHHHHCCC T ss_conf 9981999786889999999999--------------98503226787135010571557999999998330230 No 164 >3cm3_A Late protein H1, dual specificity protein phosphatase; dual-specificity phosphatase, VH1, hydrolase; 1.32A {Vaccinia virus} PDB: 2rf6_A 2p4d_A Probab=29.20 E-value=17 Score=12.52 Aligned_cols=41 Identities=24% Similarity=0.401 Sum_probs=28.2 Q ss_pred CCHHHHHHHHHHHHHHHCCC--CCC--CCCCCCHHHHHHHHHHHC Q ss_conf 63055788899999851047--643--344442046677655440 Q 537021.9.peg.7 352 KFSGYFLIVADFIQWAKKNN--IPV--GPGRGSGAGSVTAYALTI 392 (1033) Q Consensus 352 gf~~YFLiV~Div~~ak~~g--I~v--GpGRGSAagSLVaY~LgI 392 (1033) ....||--+.++|+-|.++| ++| ..|++=+|..++||++-= T Consensus 89 ~i~~~~~~~~~fI~~~~~~g~~VLVHC~~G~sRS~tvv~aYLm~~ 133 (176) T 3cm3_A 89 DISKYFDDVTAFLSKCDQRNEPVLVHSAAGVNRSGAMILAYLMSK 133 (176) T ss_dssp CCGGGHHHHHHHHHHHHHHTCCEEEECSSSSSHHHHHHHHHHHHH T ss_pred CHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCHHHHHHHHHHHHH T ss_conf 699999999999999985599899999987741499999999997 No 165 >2c0h_A Mannan endo-1,4-beta-mannosidase; hydrolase, signal, TIM alpha/beta barrel; 1.6A {Mytilus edulis} SCOP: c.1.8.3 Probab=29.17 E-value=17 Score=12.51 Aligned_cols=44 Identities=9% Similarity=0.146 Sum_probs=25.5 Q ss_pred HHHHHHHHHHCCCCEEEE--------------------ECCCHHHHHHHHHHHHHHCCCCEEEE Q ss_conf 899999999789988998--------------------11651774999999999769963899 Q 537021.9.peg.7 29 LNNILDKIAADQQPAIAI--------------------TDTNNLFSALEFSQKACAAGIQPIIG 72 (1033) Q Consensus 29 ~e~LV~~A~~~G~~AiAI--------------------TD~~nl~Gav~F~~ackk~gIKPIiG 72 (1033) .++.++..+++|+.+|=+ -|...+.-+-+|+++|.++||+.|+- T Consensus 47 ~~~~l~~l~~~G~N~vRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~a~~~Gi~vil~ 110 (353) T 2c0h_A 47 FESTLSDMQSHGGNSVRVWLHIEGESTPEFDNNGYVTGIDNTLISDMRAYLHAAQRHNILIFFT 110 (353) T ss_dssp HHHHHHHHHHTTCCEEEEEEEETTSSSSEECTTSCEEECCTTHHHHHHHHHHHHHHTTCEEEEE T ss_pred HHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEE T ss_conf 9999999998599199988461765575434566667678667787999999999789989998 No 166 >1o57_A PUR operon repressor; purine operon repressor, helix-turn-helix domain, phosphoribosyltranseferases, domain recombination, DNA binding; HET: EPE P6G 2PE PG4 1PE; 2.20A {Bacillus subtilis} SCOP: a.4.5.40 c.61.1.1 PDB: 1p4a_A* Probab=29.15 E-value=17 Score=12.51 Aligned_cols=70 Identities=16% Similarity=0.230 Sum_probs=43.7 Q ss_pred CHHHHHHHHCCCCCCCCCCCCC-----HHHHHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHHCCCCHHHHH Q ss_conf 1056762100145787533365-----68789999999830380215542207787788999999975279988988 Q 537021.9.peg.7 402 LLFERFLNPDRMSMPDFDIDFC-----QDRRDEVIRYVQNKYGHERVAQIITFGSLQAKAALRDVGRALQMPYSQVD 473 (1033) Q Consensus 402 LlFERFLnp~R~~~PDIDiDf~-----~~~R~~vi~y~~~kyG~~~va~i~t~~t~~~k~aird~~r~~g~~~~~~~ 473 (1033) -+.|||-+|+|+ +|..=+|.. .+-+.++-+.+.++|-...+-.|+|..+=+--.| .-+|+.||.|+..+. T Consensus 86 ~l~~~l~~~~ri-lpG~~vy~s~ll~dP~~l~~lG~~lA~~~~~~~iD~Vvgv~~~GiplA-~~vA~~LgvP~v~~r 160 (291) T 1o57_A 86 TLGQSLANPERI-LPGGYVYLTDILGKPSVLSKVGKLFASVFAEREIDVVMTVATKGIPLA-YAAASYLNVPVVIVR 160 (291) T ss_dssp HHHHHHTCGGGE-ETTTEECCTTTTTCHHHHHHHHHHHHHHTTTSCCSEEEEETTTTHHHH-HHHHHHHTCCEEEEB T ss_pred HHHHHHCCCCCC-CCCCEEEHHHHCCCHHHHHHHHHHHHHHCCCCCCCEEEECCCCCHHHH-HHHHHHHCCCEEEEE T ss_conf 999996599952-588568756640699999999999999737789979993175669999-999999699979999 No 167 >1jae_A Alpha-amylase; glycosidase, carbohydrate metabolism, 4-glucan-4-glucanohydrolase, hydrolase; 1.65A {Tenebrio molitor} SCOP: b.71.1.1 c.1.8.1 PDB: 1clv_A 1tmq_A 1viw_A* Probab=28.61 E-value=14 Score=13.16 Aligned_cols=17 Identities=12% Similarity=0.358 Sum_probs=7.4 Q ss_pred HHHHHHHHHHCCCCEEE Q ss_conf 99999999976996389 Q 537021.9.peg.7 55 ALEFSQKACAAGIQPII 71 (1033) Q Consensus 55 av~F~~ackk~gIKPIi 71 (1033) .-++.++|.+.|||.|+ T Consensus 77 fk~LV~~aH~~GI~VIl 93 (471) T 1jae_A 77 FTDMTRRCNDAGVRIYV 93 (471) T ss_dssp HHHHHHHHHHTTCEEEE T ss_pred HHHHHHHHHHCCCEEEE T ss_conf 99999999987998999 No 168 >2e9l_A Cytosolic beta-glucosidase; novel cytosolic neutral beta-glycosylceramidase, hydrolase; HET: BGC PLM OLA; 1.60A {Homo sapiens} PDB: 2e9m_A* 2zox_A* 2jfe_X* Probab=28.61 E-value=17 Score=12.44 Aligned_cols=50 Identities=14% Similarity=0.159 Sum_probs=30.3 Q ss_pred HHHCCCHHHHHHHHHHCCCCEEEEE--------C--CCHHH-HHHHHH----HHHHHCCCCEEEE Q ss_conf 4432998999999997899889981--------1--65177-499999----9999769963899 Q 537021.9.peg.7 23 LEGALSLNNILDKIAADQQPAIAIT--------D--TNNLF-SALEFS----QKACAAGIQPIIG 72 (1033) Q Consensus 23 ldg~~~~e~LV~~A~~~G~~AiAIT--------D--~~nl~-Gav~F~----~ackk~gIKPIiG 72 (1033) .|...+-++=++.++++|++|.=++ | .|... .+++|| .+|+++||+|++- T Consensus 53 ~d~y~~y~eDi~l~~~lG~~~yRfSi~WsRI~P~g~~g~~n~~gl~~Y~~~id~l~~~GI~P~vT 117 (469) T 2e9l_A 53 CGSYTLWEEDLKCIKQLGLTHYRFSLSWSRLLPDGTTGFINQKGIDYYNKIIDDLLKNGVTPIVT 117 (469) T ss_dssp TCTTTCHHHHHHHHHHHTCSEEEEECCHHHHSTTSSTTSCCHHHHHHHHHHHHHHHHTTCEEEEE T ss_pred CCHHHHHHHHHHHHHHHCCCEEECCCCHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEECCC T ss_conf 66456379999999982999897447699853278778647999999999999999829972103 No 169 >1v77_A PH1877P, hypothetical protein PH1877; RNAse P protein, TIM-barrel, RNA binding protein; 1.80A {Pyrococcus horikoshii} SCOP: c.6.3.2 PDB: 2czv_A* Probab=28.26 E-value=17 Score=12.39 Aligned_cols=41 Identities=10% Similarity=0.031 Sum_probs=28.6 Q ss_pred CCEEEEEECCC-----HHHHH----HHHHHHHHHHHHCCCEEEECCCCCC Q ss_conf 86899994399-----11589----9999999999818989980883007 Q 537021.9.peg.7 180 DRLYVNLQRHR-----GYDRY----RESQVVQLAYTHELPLVATNNSLFL 220 (1033) Q Consensus 180 d~~ylEl~~~~-----~~e~~----~~~~l~~lA~~~~iPlVaTndv~Y~ 220 (1033) .+..+|+.... +..+. ....+++++++++.|+|.++|.|.+ T Consensus 123 ~gValEIn~s~ll~~~~~~R~~~l~n~~~ll~L~kky~~piVvsSdAhs~ 172 (212) T 1v77_A 123 KNVALGFSLRPLLYSNPYERANLLRFMMKAWKLVEKYKVRRFLTSSAQEK 172 (212) T ss_dssp HTCEEEEESHHHHHSCHHHHHHHHHHHHHHHHHHHHHTCCEEEECCCSSG T ss_pred CCEEEEEECHHHHCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEECCCCCH T ss_conf 89189997627644798647899999999999999819988997699982 No 170 >2h56_A DNA-3-methyladenine glycosidase; 10174367, EC 3.2.2.-, structural genomics, PSI-2, protein structure initiative; 2.55A {Bacillus halodurans} Probab=28.19 E-value=17 Score=12.38 Aligned_cols=98 Identities=13% Similarity=0.093 Sum_probs=43.1 Q ss_pred HHHCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCC Q ss_conf 86249999999999999997500013310147888999999999973026667775100235420258999888863685 Q 537021.9.peg.7 736 ASKNGISKTIAVNIFELLAKFADYGFNKSHAAAYAVISYQTAWMKTHYPVEFLAASMTLEMDNVEKIKKFCQDARQFNIQ 815 (1033) Q Consensus 736 ~~~~g~~~~~a~~i~~~i~~f~~Y~FnksHa~aya~~~y~~aylK~~yP~eF~aa~L~~~~~~~~~~~~~i~e~~~~gi~ 815 (1033) -+...++-+.|..||+.+..--|-.|+....++ .--+.++ .. .|-...-..++.++.+.-. T Consensus 59 Ii~QQiS~~aA~ti~~Rl~~~~G~~~p~p~~l~------------~~~~~~L-----r~-~Gls~~K~~~i~~la~~~~- 119 (233) T 2h56_A 59 IVEQQLSIKAASAIYGRVEQLVGGALEKPEQLY------------RVSDEAL-----RQ-AGVSKRKIEYIRHVCEHVE- 119 (233) T ss_dssp HHHTTSCHHHHHHHHHHHHHHHTSCCCCTHHHH------------TSCHHHH-----HH-TTCCHHHHHHHHHHHHHHH- T ss_pred HHCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHH------------HCCHHHH-----HH-CCCCHHHHHHHHHHHHHHH- T ss_conf 981831499999999999999489999968886------------2999999-----77-7997878999999999998- Q ss_pred CCCCCCCCCCEEEEECCCEEEEEHHHHCCCCHHHHHHHHH Q ss_conf 0355233421003531986985026858969899999999 Q 537021.9.peg.7 816 IMPPSVNTPCVDFKVGDNRIYYSLAAIKGVGTTTARHIME 855 (1033) Q Consensus 816 vl~PdIN~S~~~f~~~~~~Ir~Gl~~Ikgvg~~~~~~Ive 855 (1033) .=.++... -...+...++--|.+|||||.-+++-+.= T Consensus 120 --~g~~~~~~-l~~~~~~e~~~~L~~ikGIGpWTA~~ill 156 (233) T 2h56_A 120 --SGRLDFTE-LEGAEATTVIEKLTAIKGIGQWTAEMFMM 156 (233) T ss_dssp --TTSSCHHH-HTTSCHHHHHHHHHTSTTCCHHHHHHHHH T ss_pred --HCCCCCCC-CCCCCHHHHHHHHHHCCCCCHHHHHHHHH T ss_conf --17752001-01268889999988068848789999999 No 171 >3mmw_A Endoglucanase; TIM barrel fold, hydrolase; 1.85A {Thermotoga maritima} PDB: 3mmu_A Probab=28.09 E-value=17 Score=12.37 Aligned_cols=45 Identities=16% Similarity=0.124 Sum_probs=27.3 Q ss_pred CHHHHHHHHHHCCCCEEEEE----------CCC-----HHHHHHHHHHHHHHCCCCEEEE Q ss_conf 98999999997899889981----------165-----1774999999999769963899 Q 537021.9.peg.7 28 SLNNILDKIAADQQPAIAIT----------DTN-----NLFSALEFSQKACAAGIQPIIG 72 (1033) Q Consensus 28 ~~e~LV~~A~~~G~~AiAIT----------D~~-----nl~Gav~F~~ackk~gIKPIiG 72 (1033) -.++.++..++.|+.+|=|- +.. .+...-++...|+++||++|+- T Consensus 34 ~t~~~~~~i~~~G~N~VRipi~w~~~~~~~~~~~~~~~~l~~ld~~i~~a~~~gi~vild 93 (317) T 3mmw_A 34 IKDEFFDIIKEAGFSHVRIPIRWSTHAYAFPPYKIMDRFFKRVDEVINGALKRGLAVVIN 93 (317) T ss_dssp CCTHHHHHHHHTTCCEEEECCCCGGGBCSSTTCCBCHHHHHHHHHHHHHHHTTTCEEEEE T ss_pred CHHHHHHHHHHCCCCEEECCCCHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEE T ss_conf 629999999983999899477789836888887408899999999999999779989997 No 172 >1uek_A 4-(cytidine 5'-diphospho)-2C-methyl-D-erythritol kinase; non-mevalonate pathway, GHMP superfamily; 1.70A {Thermus thermophilus} SCOP: d.14.1.5 d.58.26.5 Probab=28.04 E-value=15 Score=12.84 Aligned_cols=23 Identities=22% Similarity=0.215 Sum_probs=12.0 Q ss_pred HHHHHHHHCCCCCCCCCCCCHHHHH Q ss_conf 9999985104764334444204667 Q 537021.9.peg.7 361 ADFIQWAKKNNIPVGPGRGSGAGSV 385 (1033) Q Consensus 361 ~Div~~ak~~gI~vGpGRGSAagSL 385 (1033) .|+.+++++.|-. + .+=|++||- T Consensus 214 ~~i~~~l~~~ga~-~-~~lSGSGst 236 (275) T 1uek_A 214 KEVRGRMRALGLR-G-VLMSGSGSA 236 (275) T ss_dssp HHHHHHHHHTTCE-E-EEECTTSSC T ss_pred HHHHHHHHHCCCC-E-EEEECHHHC T ss_conf 9999999967998-6-999884124 No 173 >3bbn_M Ribosomal protein S13; small ribosomal subunit, spinach chloroplast ribosome, ribonucleoprotein particle, macromolecular complex; 9.40A {Oryza sativa} Probab=27.91 E-value=2.7 Score=19.15 Aligned_cols=40 Identities=28% Similarity=0.403 Sum_probs=30.4 Q ss_pred HHHHCCCCCCCCCCCCCCEEEEECCCEEEEEHHHHCCCCHHHHHHHHH Q ss_conf 888636850355233421003531986985026858969899999999 Q 537021.9.peg.7 808 DARQFNIQIMPPSVNTPCVDFKVGDNRIYYSLAAIKGVGTTTARHIME 855 (1033) Q Consensus 808 e~~~~gi~vl~PdIN~S~~~f~~~~~~Ir~Gl~~Ikgvg~~~~~~Ive 855 (1033) ....+.+.|...|| .+++.|.++|..|.|||...+..|++ T Consensus 41 ~~~~~~vRI~GvnI--------p~nK~V~~ALT~IyGIG~~~A~~Ic~ 80 (145) T 3bbn_M 41 GLQIECIRIGGVEI--------PNHKRVEYSLQYIHGIGRSRSRQILL 80 (145) T ss_dssp ------CCCSSSCC--------CCSSBTTTGGGGSTTCCSSTTTGGGT T ss_pred CEEEEEEEEECCCC--------CCCCEEEEHHHHHHCCCHHHHHHHHH T ss_conf 54789899848748--------99977212355674638899999999 No 174 >3jug_A Beta-mannanase; TIM-barrel, glycosidase, hydrolase; 1.60A {Bacillus} Probab=27.90 E-value=17 Score=12.35 Aligned_cols=17 Identities=24% Similarity=0.339 Sum_probs=7.9 Q ss_pred HHHHHHHHHHHCCCEEE Q ss_conf 99999999981898998 Q 537021.9.peg.7 197 ESQVVQLAYTHELPLVA 213 (1033) Q Consensus 197 ~~~l~~lA~~~~iPlVa 213 (1033) ..+++++|+++|+-+|. T Consensus 89 ld~~i~~a~~~Gi~VIl 105 (345) T 3jug_A 89 VREVIELAEQNKMVAVV 105 (345) T ss_dssp HHHHHHHHHTTTCEEEE T ss_pred HHHHHHHHHHCCCEEEE T ss_conf 99999999988999999 No 175 >1z2i_A Malate dehydrogenase; structural genomics, PSI, protein structure initiative, NEW YORK SGX research center for structural genomics; HET: NAD; 2.20A {Agrobacterium tumefaciens} Probab=27.73 E-value=17 Score=12.32 Aligned_cols=18 Identities=33% Similarity=0.296 Sum_probs=7.6 Q ss_pred HHHHHHHHCCCEEEECCC Q ss_conf 999999818989980883 Q 537021.9.peg.7 200 VVQLAYTHELPLVATNNS 217 (1033) Q Consensus 200 l~~lA~~~~iPlVaTndv 217 (1033) +++.|+++|+-+|+..++ T Consensus 106 aie~Ak~~Gi~~v~v~ns 123 (358) T 1z2i_A 106 AMALAEKFGIGAVAIRNS 123 (358) T ss_dssp HHHHHHHHSEEEEEEEEE T ss_pred HHHHHHHCCCCEEEEECC T ss_conf 999998548627997347 No 176 >2jep_A Xyloglucanase; family 5, plant cell WALL, hydrolase; 1.4A {Paenibacillus pabuli} PDB: 2jeq_A* Probab=27.60 E-value=17 Score=12.31 Aligned_cols=19 Identities=37% Similarity=0.499 Sum_probs=11.9 Q ss_pred HHHHHHHHHHHHCCCEEEE Q ss_conf 9999999999818989980 Q 537021.9.peg.7 196 RESQVVQLAYTHELPLVAT 214 (1033) Q Consensus 196 ~~~~l~~lA~~~~iPlVaT 214 (1033) +.+++++.+.+.|+.+|.+ T Consensus 111 ~~~~vv~~a~~~Gl~vild 129 (395) T 2jep_A 111 RIQQVVDYAYNEGLYVIIN 129 (395) T ss_dssp HHHHHHHHHHTTTCEEEEC T ss_pred HHHHHHHHHHHCCCEEEEE T ss_conf 9999999998589979999 No 177 >2vqe_B 30S ribosomal protein S2; tRNA-binding, rRNA-binding, metal-binding, zinc-finger, translation; HET: TM2 PAR; 2.5A {Thermus thermophilus} SCOP: c.23.15.1 PDB: 1gix_E* 1hnw_B* 1hnx_B* 1hnz_B* 1hr0_B 1ibk_B* 1ibl_B* 1ibm_B 1j5e_B 1jgo_E* 1jgp_E* 1jgq_E* 1ml5_E* 1n32_B* 1n33_B* 1n34_B 1n36_B 1xmo_B* 1xmq_B* 1xnq_B* ... Probab=27.58 E-value=18 Score=12.30 Aligned_cols=21 Identities=24% Similarity=0.344 Sum_probs=7.7 Q ss_pred EEEEEECCHHHHHHHHHHHHH Q ss_conf 399995696899999999989 Q 537021.9.peg.7 98 SIVLLVSTAEGYQRLIELVSR 118 (1033) Q Consensus 98 ~lvlLAkN~~Gy~nL~kL~S~ 118 (1033) |++=|-++-..++..++.++. T Consensus 40 ~IIdL~kT~~~L~~A~~~i~~ 60 (256) T 2vqe_B 40 HIIDLQKTMEELERTFRFIED 60 (256) T ss_dssp EEECHHHHHHHHHHHHHHHHH T ss_pred EEECHHHHHHHHHHHHHHHHH T ss_conf 688699999999999999998 No 178 >3kts_A Glycerol uptake operon antiterminator regulatory protein; structural genomics, PSI-2, protein structure initiative; HET: UNL; 2.75A {Listeria monocytogenes str} Probab=27.54 E-value=18 Score=12.30 Aligned_cols=157 Identities=14% Similarity=0.127 Sum_probs=94.5 Q ss_pred CHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCCCCCCCCCCCCCCCEEEEEECCHH Q ss_conf 98999999997899889981165177499999999976996389999998547664334456566667753999956968 Q 537021.9.peg.7 28 SLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPIIGCQLDIDMQDNLKMTNQGKTPFPTVPSIVLLVSTAE 107 (1033) Q Consensus 28 ~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPIiG~E~~v~~~~~~~~~~~~~~~~~~~~~lvlLAkN~~ 107 (1033) +=.+-.++|.+-..+.+-|.. +++.-..+..+.|+++|-+.++=+++ +-=|..++. T Consensus 17 r~~~~le~al~s~~~~iflL~-g~I~~l~~iv~~~k~~gK~vfVHiDL-----------------------I~GL~~d~~ 72 (192) T 3kts_A 17 HNQKDMEKILELDLTYMVMLE-THVAQLKALVKYAQAGGKKVLLHADL-----------------------VNGLKNDDY 72 (192) T ss_dssp SSSHHHHHHTTSSCCEEEECS-EETTTHHHHHHHHHHTTCEEEEEGGG-----------------------EETCCCSHH T ss_pred CCHHHHHHHHCCCCCEEEEEC-CCHHHHHHHHHHHHHCCCEEEEEEEC-----------------------CCCCCCCHH T ss_conf 788999999758998999953-71877999999999879989998531-----------------------677788889 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHCCCCEEEECCCCCCHHHHHHHCCCHHHHHHHHHHHHHHCCCCEEEEEE Q ss_conf 99999999989973248778740078999863489889981976796788887799789999999998765986899994 Q 537021.9.peg.7 108 GYQRLIELVSRMYLLDQGKQSVRICLSWLQEIGTEGLIMLTGGSSGPIDRAFFFNSSQVAEKRLLTFKKLFGDRLYVNLQ 187 (1033) Q Consensus 108 Gy~nL~kL~S~a~~~~~~~~~p~i~~~~L~e~~~egLIvlsg~~~g~i~~~l~~~~~~~a~~~l~~l~~~Fgd~~ylEl~ 187 (1033) |.+=|++.. +..+--..-...+..-...||..+- |.++- +....++.+......-+| ++|+. T Consensus 73 av~fL~~~~-------~~dGIISTk~~~I~~Ak~~Gl~tIq--------R~Fli-DS~al~~~~~~i~~~~PD--~VEiL 134 (192) T 3kts_A 73 AIDFLCTEI-------CPDGIISTRGNAIMKAKQHKMLAIQ--------RLFMI-DSSAYNKGVALIQKVQPD--CIELL 134 (192) T ss_dssp HHHHHHHTT-------CCSEEEESCHHHHHHHHHTTCEEEE--------EEECC-SHHHHHHHHHHHHHHCCS--EEEEE T ss_pred HHHHHHHHC-------CCCEEEECCHHHHHHHHHCCCEEEE--------EEEEE-EHHHHHHHHHHHHHCCCC--EEEEC T ss_conf 999999842-------8998997889999999977990797--------66344-276899999998547989--99988 Q ss_pred CCCHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHCCC Q ss_conf 399115899999999999818989980883007878899998766884698 Q 537021.9.peg.7 188 RHRGYDRYRESQVVQLAYTHELPLVATNNSLFLSEEDYEAHDVLMAVAHST 238 (1033) Q Consensus 188 ~~~~~e~~~~~~l~~lA~~~~iPlVaTndv~Y~~~~D~~~h~iL~aI~~~~ 238 (1033) ..-.+ ..+.++.++.++|+||..=+ .+++| +..|++.|. T Consensus 135 PG~~p-----~~I~~i~~~~~~PiIAGGLI--~~ked-----V~~aL~aGA 173 (192) T 3kts_A 135 PGIIP-----EQVQKMTQKLHIPVIAGGLI--ETSEQ-----VNQVIASGA 173 (192) T ss_dssp CTTCH-----HHHHHHHHHHCCCEEEESSC--CSHHH-----HHHHHTTTE T ss_pred CHHHH-----HHHHHHHHHCCCCEEEECCC--CCHHH-----HHHHHHCCC T ss_conf 66789-----99999997469999976672--88999-----999998599 No 179 >2ziu_A MUS81 protein; helix-hairpin-helix, alternative splicing, DNA damage, DNA recombination, DNA repair, hydrolase, magnesium; 2.70A {Danio rerio} PDB: 2ziv_A 2ziw_A Probab=27.19 E-value=18 Score=12.25 Aligned_cols=29 Identities=21% Similarity=0.399 Sum_probs=18.9 Q ss_pred HHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHC Q ss_conf 2685896989999999998379999978999734 Q 537021.9.peg.7 839 LAAIKGVGTTTARHIMEASADKPFDSLEDFCSRV 872 (1033) Q Consensus 839 l~~Ikgvg~~~~~~Ive~r~~g~f~sl~df~~rv 872 (1033) |.+|+|||.+.|.+|++ .|.++.+++.-. T Consensus 239 L~qIpGIs~~~A~~I~~-----~y~T~~~L~~a~ 267 (311) T 2ziu_A 239 LMQISGVSGDKAAAVLE-----HYSTVSSLLQAY 267 (311) T ss_dssp HTTBTTCCHHHHHHHHH-----HCSSHHHHHHHH T ss_pred HHHCCCCCHHHHHHHHH-----HCCCHHHHHHHH T ss_conf 98279999999999999-----769999999999 No 180 >3ce9_A Glycerol dehydrogenase; NP_348253.1, 3-dehydroquinate synthase, structural genomics, joint center for structural genomics; HET: MSE; 2.37A {Clostridium acetobutylicum atcc 824} Probab=27.02 E-value=18 Score=12.23 Aligned_cols=48 Identities=13% Similarity=0.023 Sum_probs=32.7 Q ss_pred HHHHHCCCHHHHHHHHHHCCC-CEEEEECCCHHHHH--HHHHHHHHHCCCCE Q ss_conf 764432998999999997899-88998116517749--99999999769963 Q 537021.9.peg.7 21 SLLEGALSLNNILDKIAADQQ-PAIAITDTNNLFSA--LEFSQKACAAGIQP 69 (1033) Q Consensus 21 Slldg~~~~e~LV~~A~~~G~-~AiAITD~~nl~Ga--v~F~~ackk~gIKP 69 (1033) .+.-|....+++-+.++++|. .++-+||.+. +-. -+..+..+++|+.- T Consensus 14 ~i~~G~g~l~~l~~~l~~~g~~k~liVtd~~~-~~~~~~~v~~~L~~~~i~~ 64 (354) T 3ce9_A 14 ILEVGNNKIYNIGQIIKKGNFKRVSLYFGEGI-YELFGETIEKSIKSSNIEI 64 (354) T ss_dssp EEEEESSCGGGHHHHHGGGTCSEEEEEEETTH-HHHHHHHHHHHHHTTTCEE T ss_pred EEEECCCHHHHHHHHHHHHCCCEEEEEECCCH-HHHHHHHHHHHHHHCCCEE T ss_conf 68989598999999999719895999989678-8999999999998779989 No 181 >2whl_A Beta-mannanase, baman5; glycoside hydrolase, hydrolase; HET: MAN BMA; 1.40A {Bacillus agaradhaerens} PDB: 2whj_A Probab=26.56 E-value=18 Score=12.17 Aligned_cols=46 Identities=13% Similarity=0.135 Sum_probs=34.1 Q ss_pred CCHHHHHHHHHHCCCCEEEE-------ECCCHHHHHHHHHHHHHHCCCCEEEE Q ss_conf 99899999999789988998-------11651774999999999769963899 Q 537021.9.peg.7 27 LSLNNILDKIAADQQPAIAI-------TDTNNLFSALEFSQKACAAGIQPIIG 72 (1033) Q Consensus 27 ~~~e~LV~~A~~~G~~AiAI-------TD~~nl~Gav~F~~ackk~gIKPIiG 72 (1033) ...++-++..|++|+.+|=| .|...+.-+-+++.+|.++||..|+- T Consensus 31 ~~~~~di~~~k~~G~N~VRl~~~~~~~~~~~~l~~ld~~v~~a~~~Gi~Vi~d 83 (294) T 2whl_A 31 DTASTAIPAIAEQGANTIRIVLSDGGQWEKDDIDTIREVIELAEQNKMVAVVE 83 (294) T ss_dssp GGHHHHHHHHHHTTCSEEEEEECCSSSSCCCCHHHHHHHHHHHHTTTCEEEEE T ss_pred HHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEE T ss_conf 67999999999769987997466887558438999999999999789999971 No 182 >1xtz_A Ribose-5-phosphate isomerase; yeast; 2.10A {Saccharomyces cerevisiae} Probab=26.45 E-value=18 Score=12.15 Aligned_cols=49 Identities=16% Similarity=0.197 Sum_probs=23.9 Q ss_pred HHHHHHCCCCCCCCCCCCCCEEEEECC-CEEEEEHHHHCCCCHHHH-HHHHH Q ss_conf 988886368503552334210035319-869850268589698999-99999 Q 537021.9.peg.7 806 CQDARQFNIQIMPPSVNTPCVDFKVGD-NRIYYSLAAIKGVGTTTA-RHIME 855 (1033) Q Consensus 806 i~e~~~~gi~vl~PdIN~S~~~f~~~~-~~Ir~Gl~~Ikgvg~~~~-~~Ive 855 (1033) -..|++.||++.+++ .....+..+++ +.|---|..|||-|.... |.|+. T Consensus 83 ~~~a~~~gi~l~~l~-~~~~iDi~iDGADEVd~~l~lIKGGGGAllrEKivA 133 (264) T 1xtz_A 83 RNLILDNKLQLGSIE-QYPRIDIAFDGADEVDENLQLIKGGGACLFQEKLVS 133 (264) T ss_dssp HHHHHHTTCEECCTT-TCCSEEEEEECCSEECTTSCEECCTTSCHHHHHHHH T ss_pred HHHHHHCCCCEECHH-HCCCCCEEEECHHHHCCCCCEEEECCHHHHHHHHHH T ss_conf 999998699560443-268520687061221557537876618877899999 No 183 >1e4i_A Beta-glucosidase; glycosyl hydrolase, family 1 glycosyl hydrolase, covalent enzyme-glycoside intermediate, alpha/beta barrel; HET: G2F NFG; 2.00A {Bacillus polymyxa} SCOP: c.1.8.4 PDB: 1tr1_A 1bgg_A* 1bga_A 1uyq_A* Probab=26.37 E-value=18 Score=12.14 Aligned_cols=17 Identities=24% Similarity=0.274 Sum_probs=8.2 Q ss_pred HHHHHHHHHHCCCEEEE Q ss_conf 99999999818989980 Q 537021.9.peg.7 198 SQVVQLAYTHELPLVAT 214 (1033) Q Consensus 198 ~~l~~lA~~~~iPlVaT 214 (1033) .+++.-..+.||..++| T Consensus 101 ~~~i~~l~~~GI~P~VT 117 (447) T 1e4i_A 101 HRVVDLLNDNGIEPFCT 117 (447) T ss_dssp HHHHHHHHHTTCEEEEE T ss_pred HHHHHHHHHCCCEEEEE T ss_conf 99999999749867886 No 184 >1d8j_A General transcription factor TFIIE-beta; winged helix-turn-helix, riken structural genomics/proteomics initiative, RSGI, structural genomics; NMR {Homo sapiens} SCOP: a.4.5.18 PDB: 1d8k_A Probab=26.34 E-value=13 Score=13.51 Aligned_cols=36 Identities=39% Similarity=0.640 Sum_probs=19.0 Q ss_pred HHCCCHHHHHHHHHHHHHCCCCCCCCCCHH-HHHHHCCCC Q ss_conf 411411342003688850465785026966-733410335 Q 537021.9.peg.7 648 SLYRPGPIDNIVVYNNRKNGKEKIVSIHPL-IDPMLKETQ 686 (1033) Q Consensus 648 al~RPGp~~~~~~yi~rk~g~e~~~~~hp~-l~~iL~~T~ 686 (1033) +-+|=|.+.-|..|.+.||-.- ..||. |++||.+|. T Consensus 6 s~~kFgvLaKIV~~MK~RHq~g---~~hPLtLeEiLDEtn 42 (81) T 1d8j_A 6 SGYKFGVLAKIVNYMKTRHQRG---DTHPLTLDEILDETQ 42 (81) T ss_dssp SCTTHHHHHHHHHHHHHHHHHT---CCSCBCHHHHHHHHT T ss_pred CCCCCHHHHHHHHHHHHHHHCC---CCCCCCHHHHHHHHH T ss_conf 6630169999999999887158---988745999998886 No 185 >3ofp_M 30S ribosomal protein S13; protein biosynthesis, ribosomes, RNA, tRNA, transfer, eryThr ketolide, macrolide, antibiotic, EXIT, peptidyl; 3.10A {Escherichia coli} PDB: 3fih_M* 2wwl_M 2qal_M* 1p6g_M 1p87_M 2aw7_M 2avy_M 2i2u_M 2i2p_M* 2qan_M* 2qb9_M* 2qbb_M* 2qbd_M 2qbf_M 2qbh_M* 2qbj_M* 2qou_M* 2qow_M* 2qoy_M* 2qp0_M* ... Probab=26.26 E-value=3.8 Score=17.96 Aligned_cols=26 Identities=27% Similarity=0.450 Sum_probs=22.2 Q ss_pred ECCCEEEEEHHHHCCCCHHHHHHHHH Q ss_conf 31986985026858969899999999 Q 537021.9.peg.7 830 VGDNRIYYSLAAIKGVGTTTARHIME 855 (1033) Q Consensus 830 ~~~~~Ir~Gl~~Ikgvg~~~~~~Ive 855 (1033) +++..|.++|..|.|||...+..|.+ T Consensus 9 p~~K~v~~ALt~I~GIG~~~A~~Ic~ 34 (113) T 3ofp_M 9 PDHKHAVIALTSIYGVGKTRSKAILA 34 (113) T ss_dssp CCCCCHHHHHTSSSSCCSSHHHHGGG T ss_pred CCCCEEEEEEECEECCCHHHHHHHHH T ss_conf 99978665501434748999999999 No 186 >1s7j_A Phenazine biosynthesis protein PHZF family; bacteria, structural genomics, PSI, protein structure initiative; 2.30A {Enterococcus faecalis V583} SCOP: d.21.1.2 Probab=26.11 E-value=17 Score=12.33 Aligned_cols=24 Identities=25% Similarity=0.205 Sum_probs=17.9 Q ss_pred HCCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 104764334444204667765544 Q 537021.9.peg.7 368 KKNNIPVGPGRGSGAGSVTAYALT 391 (1033) Q Consensus 368 k~~gI~vGpGRGSAagSLVaY~Lg 391 (1033) -..||.==|+=|||+|+|.+|+-. T Consensus 193 P~~Gi~EDpaTGSa~~~La~yl~~ 216 (262) T 1s7j_A 193 PKLRINEDPVCGSAHANLIPYWGK 216 (262) T ss_dssp GGGTEEEESCCSTTHHHHHHHHHH T ss_pred CCCCCCCCCCCHHHHHHHHHHHHH T ss_conf 457998856523667879999986 No 187 >1now_A Beta-hexosaminidase beta chain; (beta/alpha)8-barrel, homodimer, family 20 glycosidase, hydrolase; HET: NAG IFG; 2.20A {Homo sapiens} SCOP: c.1.8.6 d.92.2.1 PDB: 1nou_A* 1np0_A* 2gjx_B* 1o7a_A* 2gk1_B* Probab=26.07 E-value=18 Score=12.10 Aligned_cols=24 Identities=33% Similarity=0.470 Sum_probs=18.2 Q ss_pred HHHHHHHHHCCCCCC-----CCCCCCHHH Q ss_conf 899999851047643-----344442046 Q 537021.9.peg.7 360 VADFIQWAKKNNIPV-----GPGRGSGAG 383 (1033) Q Consensus 360 V~Div~~ak~~gI~v-----GpGRGSAag 383 (1033) +.|||+||+++||.| .||--.|+. T Consensus 222 i~eiv~yA~~rgI~vIPEiD~PGH~~a~~ 250 (507) T 1now_A 222 VRMVIEYARLRGIRVLPEFDTPGHTLSWG 250 (507) T ss_dssp HHHHHHHHHHTTCEEEEEEEESSSCTTHH T ss_pred HHHHHHHHHHCCCEEEECCCCHHHHHHHH T ss_conf 99999999974967974465412899999 No 188 >1qam_A ERMC' methyltransferase; rRNA methyltransferase ERMC', cofactor analogs; 2.20A {Bacillus subtilis} SCOP: c.66.1.24 PDB: 1qan_A* 1qao_A* 1qaq_A* 2erc_A Probab=25.97 E-value=15 Score=12.89 Aligned_cols=41 Identities=17% Similarity=0.225 Sum_probs=21.0 Q ss_pred CCCCCCCCCHHHHHHHHHHHCCC--CCHHHCCCHHHHHHHHCC Q ss_conf 64334444204667765544037--563332310567621001 Q 537021.9.peg.7 372 IPVGPGRGSGAGSVTAYALTITD--IDPLRFSLLFERFLNPDR 412 (1033) Q Consensus 372 I~vGpGRGSAagSLVaY~LgIT~--VDPi~~~LlFERFLnp~R 412 (1033) +-+|||.|+--..|...+-.+|. +||-=...|=+||.+.++ T Consensus 35 lEIGpG~G~LT~~L~~~~~~v~avEiD~~l~~~L~~~~~~~~n 77 (244) T 1qam_A 35 FEIGSGKGHFTLELVQRCNFVTAIEIDHKLCKTTENKLVDHDN 77 (244) T ss_dssp EEECCTTSHHHHHHHHHSSEEEEECSCHHHHHHHHHHTTTCCS T ss_pred EEECCCCCHHHHHHHHCCCCEEEEEECHHHHHHHHHHHCCCCC T ss_conf 9973887199999985558306998530177999974144776 No 189 >1rd5_A Tryptophan synthase alpha chain, chloroplast; hydroxamic acid, diboa, dimboa, indole, indole-glycerol- phosphate, lyase; 2.02A {Zea mays} SCOP: c.1.2.4 PDB: 1tjr_A Probab=25.88 E-value=19 Score=12.07 Aligned_cols=39 Identities=18% Similarity=0.270 Sum_probs=19.7 Q ss_pred CHHHHHHHHHHCCCCEEEEE-----CCCHHHHHHHHHHHHHHCCCC Q ss_conf 98999999997899889981-----165177499999999976996 Q 537021.9.peg.7 28 SLNNILDKIAADQQPAIAIT-----DTNNLFSALEFSQKACAAGIQ 68 (1033) Q Consensus 28 ~~e~LV~~A~~~G~~AiAIT-----D~~nl~Gav~F~~ackk~gIK 68 (1033) ++++..+.+++.|-+|+ ++ |. +.-...++.++..+.|.- T Consensus 4 ~i~~~f~~~k~~~~~al-i~yi~aG~P-~~~~~~~~l~~l~~~G~D 47 (262) T 1rd5_A 4 PVSDTMAALMAKGKTAF-IPYITAGDP-DLATTAEALRLLDGCGAD 47 (262) T ss_dssp CHHHHHHHHHHTTCCEE-EEEEETTSS-CHHHHHHHHHHHHHTTCS T ss_pred HHHHHHHHHHHCCCCEE-EEEEECCCC-CHHHHHHHHHHHHHCCCC T ss_conf 89999999997599679-999727789-978999999999975999 No 190 >1ls9_A Cytochrome C6; omega loop, antiparallel beta-sheet, protoporphyrin IX containing Fe, heme, HAEM, electron transport; HET: HEM; 1.30A {Cladophora glomerata} SCOP: a.3.1.1 Probab=25.83 E-value=19 Score=12.07 Aligned_cols=34 Identities=21% Similarity=0.316 Sum_probs=25.7 Q ss_pred HHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCC Q ss_conf 2100145787533365687899999998303802 Q 537021.9.peg.7 408 LNPDRMSMPDFDIDFCQDRRDEVIRYVQNKYGHE 441 (1033) Q Consensus 408 Lnp~R~~~PDIDiDf~~~~R~~vi~y~~~kyG~~ 441 (1033) +...+..||-..--++++.+..|+.||.++++.+ T Consensus 56 ~~~g~~~Mp~f~~~Lsd~ei~~l~~Yi~~~~~~~ 89 (91) T 1ls9_A 56 VNNGKGAMPAWADRLDEDDIEAVSNYVYDQAVNS 89 (91) T ss_dssp HHHCBTTBCCCTTTSCHHHHHHHHHHHHHHHHTT T ss_pred HCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCC T ss_conf 8068689965334799999999999999985699 No 191 >2p5v_A Transcriptional regulator, LRP/ASNC family; NMB0573, structural genomics; 1.99A {Neisseria meningitidis MC58} PDB: 2p6s_A 2p6t_A Probab=25.78 E-value=19 Score=12.06 Aligned_cols=47 Identities=17% Similarity=0.314 Sum_probs=32.1 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHHCCCCHHHH-HHHHHHC Q ss_conf 57875333656878999999983038021554220778778899999997527998898-8654310 Q 537021.9.peg.7 414 SMPDFDIDFCQDRRDEVIRYVQNKYGHERVAQIITFGSLQAKAALRDVGRALQMPYSQV-DRLCKLI 479 (1033) Q Consensus 414 ~~PDIDiDf~~~~R~~vi~y~~~kyG~~~va~i~t~~t~~~k~aird~~r~~g~~~~~~-~~~~~~i 479 (1033) -||.+.+|=-| .++++.|.+ -+|..++++|+.+|++...+ +++.++. T Consensus 2 pm~~~~LD~~D---~~Il~~L~~----------------d~R~s~~eiA~~~gls~~tv~~Ri~rLe 49 (162) T 2p5v_A 2 PMPQLTLDKTD---IKILQVLQE----------------NGRLTNVELSERVALSPSPCLRRLKQLE 49 (162) T ss_dssp ---CCCCCHHH---HHHHHHHHH----------------CTTCCHHHHHHHHTSCHHHHHHHHHHHH T ss_pred CCCCCCCCHHH---HHHHHHHHH----------------CCCCCHHHHHHHHCCCHHHHHHHHHHHH T ss_conf 68989808999---999999998----------------3799999999998929999999999998 No 192 >1rfm_A L-sulfolactate dehydrogenase; methanogens, coenzyme M, hyperthermostable, Pro-S hydrogen transfer; HET: NAD; 2.50A {Methanocaldococcus jannaschii} SCOP: c.122.1.1 Probab=25.60 E-value=19 Score=12.04 Aligned_cols=21 Identities=24% Similarity=0.240 Sum_probs=8.8 Q ss_pred HHHHHHHHCCCEEEECCCCCC Q ss_conf 999999818989980883007 Q 537021.9.peg.7 200 VVQLAYTHELPLVATNNSLFL 220 (1033) Q Consensus 200 l~~lA~~~~iPlVaTndv~Y~ 220 (1033) +++.|+++|+-+|+..+++.. T Consensus 97 aiekA~~~Gi~~v~v~ns~H~ 117 (344) T 1rfm_A 97 AIKKAKNVGVGVVATRNANHF 117 (344) T ss_dssp HHHHHHHHSEEEEEEEEECCC T ss_pred HHHHHHHCCCEEEEECCCCCC T ss_conf 999997447559985456777 No 193 >3eef_A N-carbamoylsarcosine amidase related protein; structural genomics, PSI-2, protein structure initiative; 2.35A {Thermoplasma acidophilum} Probab=25.57 E-value=19 Score=12.03 Aligned_cols=27 Identities=22% Similarity=0.396 Sum_probs=14.8 Q ss_pred HHHHHHHHHHCCCEEEECCCCCCCHHH Q ss_conf 999999998189899808830078788 Q 537021.9.peg.7 198 SQVVQLAYTHELPLVATNNSLFLSEED 224 (1033) Q Consensus 198 ~~l~~lA~~~~iPlVaTndv~Y~~~~D 224 (1033) +++++.+++.++|+|.|.+.|+.+..+ T Consensus 32 ~~li~~ar~~g~pVi~~~~~~~~~~~~ 58 (182) T 3eef_A 32 RKVIETFRRSGLPVVYVNDSHYPDDPE 58 (182) T ss_dssp HHHHHHHHHTTCCEEEEEECBCTTSTT T ss_pred HHHHHHHHHCCCCEEEEECCCCCCCCC T ss_conf 999999998299689982465677864 No 194 >1vi6_A 30S ribosomal protein S2P; structural genomics, ribosome; 1.95A {Archaeoglobus fulgidus} SCOP: c.23.15.1 PDB: 1vi5_A Probab=25.51 E-value=19 Score=12.02 Aligned_cols=15 Identities=7% Similarity=-0.066 Sum_probs=6.9 Q ss_pred CCHHHHHHHHHHCCC Q ss_conf 998999999997899 Q 537021.9.peg.7 27 LSLNNILDKIAADQQ 41 (1033) Q Consensus 27 ~~~e~LV~~A~~~G~ 41 (1033) +++++|++..+..|. T Consensus 11 v~~~~ll~agvH~G~ 25 (208) T 1vi6_A 11 VPPDDYLAAGVHIGT 25 (208) T ss_dssp SCHHHHHHHTTTBCC T ss_pred CCHHHHHHHCEEECC T ss_conf 249999973924787 No 195 >3geb_A EYES absent homolog 2; hydrolase, activator, alternative splicing, cytoplasm, developmental protein, magnesium, nucleus, polymorphism; 2.40A {Homo sapiens} PDB: 3hb0_A 3hb1_A Probab=25.51 E-value=15 Score=12.80 Aligned_cols=107 Identities=12% Similarity=0.210 Sum_probs=55.5 Q ss_pred HHHHHHHHHHHHHHHHCCHHHHHHH-----HHHHHHHHCCCCCCCCCCCCHHHHHH-HHHHHCCCCCHHHCCCHHHHHHH Q ss_conf 9999988778989860630557888-----99999851047643344442046677-65544037563332310567621 Q 537021.9.peg.7 336 DYRQRLDFELNVIARMKFSGYFLIV-----ADFIQWAKKNNIPVGPGRGSGAGSVT-AYALTITDIDPLRFSLLFERFLN 409 (1033) Q Consensus 336 ~y~~RLe~EL~vI~~~gf~~YFLiV-----~Div~~ak~~gI~vGpGRGSAagSLV-aY~LgIT~VDPi~~~LlFERFLn 409 (1033) +.+..|.-|++...+. .|.. ..+..-.+.-+++| --|+-.-||+ |-+-|.-.+=||++ -.+ T Consensus 144 ~~w~~L~~e~e~~Td~-----Wls~A~k~L~~i~~rs~~vNVLV--TstqLVpaLaK~LLygL~~~fpiEN------IYS 210 (274) T 3geb_A 144 ETWLQLRAELEALTDL-----WLTHSLKALNLINSRPNCVNVLV--TTTQLIPALAKVLLYGLGSVFPIEN------IYS 210 (274) T ss_dssp HHHHHHHHHHHHHTTS-----HHHHHHHHHHHHHHSTTEEEEEE--ESSCHHHHHHHHHHTTCTTTSCGGG------EEE T ss_pred HHHHHHHHHHHHHCCC-----HHHHHHHHHHHHHCCCCEEEEEE--ECCCCHHHHHHHHHHCCCCCCCCCC------CCC T ss_conf 9999999989865451-----79999999997743467579998--6784489999999862567565134------335 Q ss_pred HCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHHCCCCHHHH Q ss_conf 001457875333656878999999983038021554220778778899999997527998898 Q 537021.9.peg.7 410 PDRMSMPDFDIDFCQDRRDEVIRYVQNKYGHERVAQIITFGSLQAKAALRDVGRALQMPYSQV 472 (1033) Q Consensus 410 p~R~~~PDIDiDf~~~~R~~vi~y~~~kyG~~~va~i~t~~t~~~k~aird~~r~~g~~~~~~ 472 (1033) .. .-+.+..++++.+|||+.....++.=+ .-=.++||.+++|+..+ T Consensus 211 a~------------kiGKesCFerI~~RFG~k~tyvvIGDG-----~dEe~AAk~l~~PFwrI 256 (274) T 3geb_A 211 AT------------KTGKESCFERIMQRFGRKAVYVVIGDG-----VEEEQGAKKHNMPFWRI 256 (274) T ss_dssp TT------------TTCHHHHHHHHHHHHCTTSEEEEEESS-----HHHHHHHHHTTCCEEEC T ss_pred HH------------HCCHHHHHHHHHHHHCCCCEEEEECCC-----HHHHHHHHHCCCCEEEE T ss_conf 31------------056899999999986898569997897-----67999999629985880 No 196 >2gtv_X CM, chorismate mutase; four-helix bundle, isomerase; HET: TSA; NMR {Methanocaldococcus jannaschii} SCOP: a.130.1.3 Probab=25.33 E-value=19 Score=12.00 Aligned_cols=49 Identities=14% Similarity=0.259 Sum_probs=33.7 Q ss_pred CCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 101677878999998624999999999999999750001331014788899999999997302 Q 537021.9.peg.7 722 KEEMDKQKERFISGASKNGISKTIAVNIFELLAKFADYGFNKSHAAAYAVISYQTAWMKTHYP 784 (1033) Q Consensus 722 ~~~~~~~~~~f~~g~~~~g~~~~~a~~i~~~i~~f~~Y~FnksHa~aya~~~y~~aylK~~yP 784 (1033) ++--+++-++..+-|.+.|++++.++.||..|-..+ ..+|..+++.|++ T Consensus 55 ~~RE~~vl~~~~~~a~~~~ld~~~i~~if~~Ii~~s--------------~~~Q~~~l~~~~~ 103 (104) T 2gtv_X 55 PEREKYIYDRIRKLCKEHNVDENIGIKIFQRLIEHN--------------KALQKQYLEETLE 103 (104) T ss_dssp HHHHHHHHHHHHHHHHHHTSCSHHHHHHHHHHHHHH--------------HHHHHHHHHHHHH T ss_pred HHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHH--------------HHHHHHHHHCCCC T ss_conf 899999999999986657969999999999999999--------------9999998857157 No 197 >1hx0_A Alpha amylase (PPA); inhibitor, carbohydrate, pancreas, hydrolase; HET: GLC AC1 BGC MAL; 1.38A {Sus scrofa} SCOP: b.71.1.1 c.1.8.1 PDB: 1wo2_A* 1ua3_A* 1kxq_A 1kxt_A 1kxv_A 1jfh_A* 1vah_A* 1ppi_A* 3l2m_A* 3l2l_A* 1dhk_A* 1ose_A* 1pig_A* 1pif_A* 1bvn_P 3ij8_A* 1cpu_A* 1bsi_A 1u2y_A* 1u30_A* ... Probab=25.04 E-value=16 Score=12.68 Aligned_cols=17 Identities=6% Similarity=-0.015 Sum_probs=11.3 Q ss_pred HHHHHHHHHHHCCCEEE Q ss_conf 99999999981898998 Q 537021.9.peg.7 197 ESQVVQLAYTHELPLVA 213 (1033) Q Consensus 197 ~~~l~~lA~~~~iPlVa 213 (1033) .++|++-|.+.||.+|. T Consensus 79 fk~LV~~aH~~GI~VIl 95 (496) T 1hx0_A 79 FRDMVTRCNNVGVRIYV 95 (496) T ss_dssp HHHHHHHHHHTTCEEEE T ss_pred HHHHHHHHHHCCCEEEE T ss_conf 99999999987998998 No 198 >2bgw_A XPF endonuclease; hydrolase, structure specific endonuclease, nucleotide excision repair; 2.8A {Aeropyrum pernix} SCOP: a.60.2.5 c.52.1.20 PDB: 2bhn_A Probab=24.84 E-value=19 Score=11.93 Aligned_cols=27 Identities=44% Similarity=0.715 Sum_probs=18.5 Q ss_pred EHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHH Q ss_conf 02685896989999999998379999978999 Q 537021.9.peg.7 838 SLAAIKGVGTTTARHIMEASADKPFDSLEDFC 869 (1033) Q Consensus 838 Gl~~Ikgvg~~~~~~Ive~r~~g~f~sl~df~ 869 (1033) =|.+|+|||...+..|++. |.|+.+++ T Consensus 163 ~L~~Ipgi~~~~A~~Ll~~-----f~Sl~~l~ 189 (219) T 2bgw_A 163 ILQSFPGIGRRTAERILER-----FGSLERFF 189 (219) T ss_dssp HHHTSTTCCHHHHHHHHHH-----HSSHHHHT T ss_pred HHHCCCCCCHHHHHHHHHH-----CCCHHHHH T ss_conf 9840799899999999998-----69999998 No 199 >3idw_A Actin cytoskeleton-regulatory complex protein SLA1; clathrin adaptor, endocytosis, SAM domain, yeast, actin- binding, cell membrane; 1.85A {Saccharomyces cerevisiae} Probab=24.68 E-value=19 Score=11.91 Aligned_cols=16 Identities=25% Similarity=0.623 Sum_probs=9.2 Q ss_pred HHHHHHHHHHCCCCCE Q ss_conf 9999999830380215 Q 537021.9.peg.7 428 DEVIRYVQNKYGHERV 443 (1033) Q Consensus 428 ~~vi~y~~~kyG~~~v 443 (1033) -.|..|+.++||++.+ T Consensus 55 I~V~k~l~~k~~r~~~ 70 (72) T 3idw_A 55 VRVMKHLDKKFGRENI 70 (72) T ss_dssp HHHHHHHHHHTTC--- T ss_pred HHHHHHHHHHHCCCCC T ss_conf 9999999998643203 No 200 >2af7_A Gamma-carboxymuconolactone decarboxylase; O26336_metth, NESG, TT747, structural genomics, PSI, protein structure initiative; 2.81A {Methanothermobacterthermautotrophicus} SCOP: a.152.1.2 Probab=24.60 E-value=20 Score=11.90 Aligned_cols=33 Identities=24% Similarity=0.380 Sum_probs=26.0 Q ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCH Q ss_conf 99999862499999999999999975000133101 Q 537021.9.peg.7 731 RFISGASKNGISKTIAVNIFELLAKFADYGFNKSH 765 (1033) Q Consensus 731 ~f~~g~~~~g~~~~~a~~i~~~i~~f~~Y~FnksH 765 (1033) .-+.+|.++|.+++...++.-++..++| |+++- T Consensus 75 ~H~~~al~~G~T~eEi~E~i~~~~~y~G--~p~~~ 107 (125) T 2af7_A 75 SHVRGALNAGCSKDEIIEVMIQMAVYAG--FPAAI 107 (125) T ss_dssp HHHHHHHHTTCCHHHHHHHHHHHHHHTC--HHHHH T ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHHCC--HHHHH T ss_conf 9999998779999999999999999856--99999 No 201 >1nnx_A Protein YGIW; structural genomics, hypothetical protein, OB-fold, structure 2 function project, S2F, unknown function; 1.45A {Escherichia coli} SCOP: b.40.10.1 Probab=24.53 E-value=20 Score=11.89 Aligned_cols=46 Identities=9% Similarity=0.137 Sum_probs=36.4 Q ss_pred CCCCCHHHHHCCCCCCEEEEEEEEEEEEEECCCCCCEEEEEEEECCCCEEEEEE Q ss_conf 489768898505589889999999611210158887699999984970499995 Q 537021.9.peg.7 978 MGIKGYEESVATIRGNNIRLAAMVVSKQQKKTRKGSRIGWVTFSEPAKEYEANF 1031 (1033) Q Consensus 978 ~~~~~~~~~~~~~~g~~v~iaG~V~~i~~~~TKkG~~mafltleD~~g~~e~~i 1031 (1033) ....+..+..+..+++.|.+-|.|++.- .++.| .|.|.||.+.+-| T Consensus 23 ~~~~TV~~a~~~~Dd~~V~L~G~Iv~~l-----~~d~Y---~F~D~TG~I~VeI 68 (109) T 1nnx_A 23 GSVTTVESAKSLRDDTWVTLRGNIVERI-----SDDLY---VFKDASGTINVDI 68 (109) T ss_dssp CCCCCHHHHTTSCSSEEEEEEEEEEEEE-----ETTEE---EEEETTEEEEEEC T ss_pred CCEEEHHHHHHCCCCCEEEEEEEEEEEE-----CCCEE---EEECCCCCEEEEE T ss_conf 6579399775373898599988999784-----89438---9988997599997 No 202 >1jeo_A MJ1247, hypothetical protein MJ1247; RUMP pathway, phosphosugar, 3-hexulose-6-phosphate isomerase, PHI, structural genomics; HET: CME CIT; 2.00A {Methanocaldococcus jannaschii} SCOP: c.80.1.3 Probab=24.52 E-value=20 Score=11.89 Aligned_cols=34 Identities=15% Similarity=0.089 Sum_probs=16.1 Q ss_pred CCEEEEEECCCHHHHHHHHHHHHHHHHHCCCEEEEC Q ss_conf 868999943991158999999999998189899808 Q 537021.9.peg.7 180 DRLYVNLQRHRGYDRYRESQVVQLAYTHELPLVATN 215 (1033) Q Consensus 180 d~~ylEl~~~~~~e~~~~~~l~~lA~~~~iPlVaTn 215 (1033) +++.+-+...+... ..-.+++.|++.|.|+|+.. T Consensus 83 ~Dv~i~iS~sG~t~--~~i~~~~~ak~~g~~vI~IT 116 (180) T 1jeo_A 83 DDLLILISGSGRTE--SVLTVAKKAKNINNNIIAIV 116 (180) T ss_dssp TCEEEEEESSSCCH--HHHHHHHHHHTTCSCEEEEE T ss_pred CCEEEEECCCCCCH--HHHHHHHHHHHCCCCEEEEE T ss_conf 99899975999808--99999999997599799996 No 203 >1wbh_A KHG/KDPG aldolase; lyase; 1.55A {Escherichia coli} SCOP: c.1.10.1 PDB: 2c0a_A 1wau_A 1eua_A 1eun_A 1fq0_A* 1fwr_A* Probab=24.32 E-value=20 Score=11.86 Aligned_cols=38 Identities=13% Similarity=0.066 Sum_probs=17.1 Q ss_pred CCCCCCCCCHHHHHHHCCCCEEEECCCCC-CHHHHHHHC Q ss_conf 87787400789998634898899819767-967888877 Q 537021.9.peg.7 124 QGKQSVRICLSWLQEIGTEGLIMLTGGSS-GPIDRAFFF 161 (1033) Q Consensus 124 ~~~~~p~i~~~~L~e~~~egLIvlsg~~~-g~i~~~l~~ 161 (1033) .|--.|..+.+.+..-+..|+.++.||.. +++.+++.. T Consensus 90 ~FivsP~~~~~v~~~a~~~~i~~iPGv~TpsEi~~A~~~ 128 (214) T 1wbh_A 90 QFAISPGLTEPLLKAATEGTIPLIPGISTVSELMLGMDY 128 (214) T ss_dssp SCEEESSCCHHHHHHHHHSSSCEEEEESSHHHHHHHHHT T ss_pred CEEECCCCCHHHHHHHHHCCCCCCCCCCCHHHHHHHHHC T ss_conf 899858999999999985499753784988999999985 No 204 >1z00_B DNA repair endonuclease XPF; helix-hairpin-helix, hydrolase; HET: DNA; NMR {Homo sapiens} SCOP: a.60.2.5 PDB: 2aq0_A* Probab=24.21 E-value=20 Score=11.84 Aligned_cols=26 Identities=19% Similarity=0.473 Sum_probs=17.2 Q ss_pred HHHHCCCCHHHHHHHHHHHHCCCCCCHHHHH Q ss_conf 2685896989999999998379999978999 Q 537021.9.peg.7 839 LAAIKGVGTTTARHIMEASADKPFDSLEDFC 869 (1033) Q Consensus 839 l~~Ikgvg~~~~~~Ive~r~~g~f~sl~df~ 869 (1033) |..|+|||.+.+..|+. .|.|+.++. T Consensus 20 L~~iPGIg~k~~~~Ll~-----~f~sl~~i~ 45 (84) T 1z00_B 20 LLKMPGVNAKNCRSLMH-----HVKNIAELA 45 (84) T ss_dssp HHTCSSCCHHHHHHHHH-----HSSCHHHHH T ss_pred HHCCCCCCHHHHHHHHH-----HCCCHHHHH T ss_conf 98389999999999999-----966999998 No 205 >2osx_A Endoglycoceramidase II; (alpha/beta)8 (TIM) barrel, hydrolase; HET: SIA GAL BGC 16C; 1.10A {Rhodococcus SP} PDB: 2oyk_A* 2osw_A* 2oyl_A* 2oym_A* 2osy_A* Probab=24.21 E-value=20 Score=11.84 Aligned_cols=20 Identities=25% Similarity=0.418 Sum_probs=13.1 Q ss_pred HHHHHHHHHHHHHHCCCCCC Q ss_conf 55788899999851047643 Q 537021.9.peg.7 355 GYFLIVADFIQWAKKNNIPV 374 (1033) Q Consensus 355 ~YFLiV~Div~~ak~~gI~v 374 (1033) .|+=-+.++|++|+++||.| T Consensus 102 ~~l~~l~~~v~~a~~~Gi~v 121 (481) T 2osx_A 102 QYLDRVEDRVGWYAERGYKV 121 (481) T ss_dssp HHHHHHHHHHHHHHHTTCEE T ss_pred HHHHHHHHHHHHHHHCCCEE T ss_conf 99999999999999779989 No 206 >3kip_A 3-dehydroquinase, type II; lyase; 2.95A {Candida albicans} Probab=24.19 E-value=13 Score=13.30 Aligned_cols=29 Identities=14% Similarity=0.227 Sum_probs=10.9 Q ss_pred HHHCCCEEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 981898998088300787889999876688 Q 537021.9.peg.7 205 YTHELPLVATNNSLFLSEEDYEAHDVLMAV 234 (1033) Q Consensus 205 ~~~~iPlVaTndv~Y~~~~D~~~h~iL~aI 234 (1033) ......-|..|..-|.+-+ ..++|+|.++ T Consensus 80 ~~~~~~gIIiNpga~THtS-iai~DAL~~~ 108 (167) T 3kip_A 80 KRQGVGFVVINAGAYTHTS-VGIRDALLGT 108 (167) T ss_dssp HHTTCCEEEEECGGGGGTC-HHHHHHHHHT T ss_pred HHCCCEEEEECCCHHEEEH-HHHHHHHHHC T ss_conf 3458248983541011201-1169899854 No 207 >2bkm_A Truncated hemoglobin from geobacillus stearothermophilus; hypothetical protein, oxygen transport, transport, oxygen storage; HET: HEM; 1.5A {Geobacillus stearothermophilus} Probab=24.04 E-value=20 Score=11.82 Aligned_cols=43 Identities=12% Similarity=0.058 Sum_probs=32.3 Q ss_pred HHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCHH Q ss_conf 1677878999998624999999999999999750001331014 Q 537021.9.peg.7 724 EMDKQKERFISGASKNGISKTIAVNIFELLAKFADYGFNKSHA 766 (1033) Q Consensus 724 ~~~~~~~~f~~g~~~~g~~~~~a~~i~~~i~~f~~Y~FnksHa 766 (1033) ..+.+-..|.+-..+.|++++.++++++.++..|...=|++-. T Consensus 84 ~f~~~l~~~~~al~e~~~~~e~~~~~~~~~~~~A~~m~N~~d~ 126 (128) T 2bkm_A 84 RAEAWLACMRAAMDEIGLSGPAREQFYHRLVLTAHHMVNTPDH 126 (128) T ss_dssp HHHHHHHHHHHHHHHTTCCSHHHHHHHHHHHHHHHHHCCSCCC T ss_pred HHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 9999999999999982999999999999999999986289888 No 208 >2qde_A Mandelate racemase/muconate lactonizing enzyme family protein; PSI-II, NYSGXRC, enolase, structural genomics, protein structure initiative, PSI-2; 1.93A {Azoarcus SP} Probab=23.93 E-value=20 Score=11.80 Aligned_cols=18 Identities=28% Similarity=0.457 Sum_probs=11.7 Q ss_pred HHHHHHHHHCCCCCCCCC Q ss_conf 899999851047643344 Q 537021.9.peg.7 360 VADFIQWAKKNNIPVGPG 377 (1033) Q Consensus 360 V~Div~~ak~~gI~vGpG 377 (1033) ...++.+|++.|+.+-+| T Consensus 280 ~~~i~~~A~~~gi~~~~~ 297 (397) T 2qde_A 280 AQRWLTLARLANLPVICG 297 (397) T ss_dssp HHHHHHHHHHHTCCEEEC T ss_pred HHHHHHHHHHCCCCEEEC T ss_conf 999999999869988865 No 209 >1jms_A Terminal deoxynucleotidyltransferase; polymerase; 2.36A {Mus musculus} SCOP: a.60.6.1 a.60.12.1 d.218.1.2 PDB: 1kdh_A* 1kej_A* Probab=23.82 E-value=20 Score=11.79 Aligned_cols=28 Identities=7% Similarity=0.196 Sum_probs=15.4 Q ss_pred HHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHHC Q ss_conf 85104764334444204667765544037563332 Q 537021.9.peg.7 366 WAKKNNIPVGPGRGSGAGSVTAYALTITDIDPLRF 400 (1033) Q Consensus 366 ~ak~~gI~vGpGRGSAagSLVaY~LgIT~VDPi~~ 400 (1033) +++=.|| ||-+ |-.+ |-+||+.++=|+. T Consensus 123 l~~I~Gi--Gpk~---a~~l--~~~Gi~tl~dL~~ 150 (381) T 1jms_A 123 FTSVFGV--GLKT---AEKW--FRMGFRTLSKIQS 150 (381) T ss_dssp HHTSTTC--CHHH---HHHH--HHTTCCSHHHHHH T ss_pred HHHCCCC--CHHH---HHHH--HHCCCCCHHHHHH T ss_conf 8737787--8999---9999--9869777999854 No 210 >1g94_A Alpha-amylase; beta-alpha-8-barrel, 3 domain structure, hydrolase; HET: DAF GLC; 1.74A {Pseudoalteromonas haloplanktis} SCOP: b.71.1.1 c.1.8.1 PDB: 1g9h_A* 1l0p_A 1aqm_A* 1aqh_A* 1b0i_A 1jd7_A 1jd9_A 1kxh_A* Probab=23.75 E-value=19 Score=12.09 Aligned_cols=46 Identities=11% Similarity=0.225 Sum_probs=27.3 Q ss_pred HHHHHHHHHCCCCEEEEEC---C---------------------CHHHHHHHHHHHHHHCCCCEEEEEEE Q ss_conf 9999999978998899811---6---------------------51774999999999769963899999 Q 537021.9.peg.7 30 NNILDKIAADQQPAIAITD---T---------------------NNLFSALEFSQKACAAGIQPIIGCQL 75 (1033) Q Consensus 30 e~LV~~A~~~G~~AiAITD---~---------------------~nl~Gav~F~~ackk~gIKPIiG~E~ 75 (1033) ++|++..+++|+.||=|+= | |+.-..-+|.++|.+.||+.|+=+-+ T Consensus 18 ~~~~~yl~~~G~t~I~l~Pv~e~~~~~~wy~gY~~~~Y~i~~r~Gt~~dlk~LV~~aH~~GI~VilD~V~ 87 (448) T 1g94_A 18 QECEQYLGPKGYAAVQVSPPNEHITGSQWWTRYQPVSYELQSRGGNRAQFIDMVNRCSAAGVDIYVDTLI 87 (448) T ss_dssp HHHHHTHHHHTCCEEEECCCSCBBCSSSGGGGGSBSCSCSCBTTBCHHHHHHHHHHHHHTTCEEEEEEEC T ss_pred HHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCEECCCCCCHHHHHHHHHHHHHCCCEEEEEECC T ss_conf 9999999970998899694836899998744688877436899999999999999999879999999857 No 211 >1kko_A 3-methylaspartate ammonia-lyase; enolase superfamily, TIM barrel; 1.33A {Citrobacter amalonaticus} SCOP: c.1.11.2 d.54.1.1 PDB: 1kkr_A* Probab=23.71 E-value=20 Score=11.77 Aligned_cols=23 Identities=9% Similarity=0.182 Sum_probs=8.5 Q ss_pred HHHHCCHHHHHHHHHHHHHHCCC Q ss_conf 56541789999999988740254 Q 537021.9.peg.7 573 FLGLKTLTLLQKSLDFLAQRGVK 595 (1033) Q Consensus 573 ~Lgl~~L~~i~~~~~~i~~~~~~ 595 (1033) +--+.+++-..++.++-++.|.. T Consensus 332 ~~~~GGitea~~~~~~a~~~Gi~ 354 (413) T 1kko_A 332 TPDLGGIHNIVDAVLYCNKHGME 354 (413) T ss_dssp GGGGSSTHHHHHHHHHHHHHTCE T ss_pred HHHCCCHHHHHHHHHHHHHCCCC T ss_conf 03238999999999999986993 No 212 >3e2v_A 3'-5'-exonuclease; structural genomics, hydrolase, PSI-2, protein structure initiative; 1.50A {Saccharomyces cerevisiae YJM789} Probab=23.62 E-value=20 Score=11.76 Aligned_cols=65 Identities=8% Similarity=0.206 Sum_probs=48.4 Q ss_pred CCCCCCCCHHHHHHCHHHHHHH--------CCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHC-CCCE Q ss_conf 5668988133433144476443--------2998999999997899889981165177499999999976-9963 Q 537021.9.peg.7 4 YLEKSPSFVHLQVHSSYSLLEG--------ALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAA-GIQP 69 (1033) Q Consensus 4 ~~~~~~~FvhLh~hS~ySlldg--------~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~-gIKP 69 (1033) -+.+...|+--|||=..+.-+| -...++++++|++.|...+-++-. ++...-+..+.|+++ ++.| T Consensus 7 ~~~~~~~~iD~~~~L~d~~F~~~y~~~~~hd~Dl~~Vl~rA~~~GV~~ii~~g~-~~~~~~~a~~La~~~~~~~p 80 (401) T 3e2v_A 7 ATDSPLKYYDIGLNLTDPMFHGIYNGKQYHPADYVKLLERAAQRHVKNALVTGS-SIAESQSAIELVSSVKDLSP 80 (401) T ss_dssp ----CCCEEEEEECTTSGGGGTEETTEECSCCCHHHHHHHHHHTTEEEEEECCC-SHHHHHHHHHHHHHHTTTCS T ss_pred CCCCCCCEEEEEECCCCHHHCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECC-CHHHHHHHHHHHHHCCCCCC T ss_conf 888986138973478995766844676677137999999999879998999159-99999999999996623577 No 213 >1i7w_B E-cadherin, epithelial-cadherin; cell adhesion, beta-catenin, protein-protein complex, extended interface, armadillo repeat, phosphoserin; HET: SEP; 2.00A {Mus musculus} SCOP: j.71.1.1 PDB: 1i7x_B 3ifq_C* Probab=23.53 E-value=13 Score=13.51 Aligned_cols=13 Identities=38% Similarity=0.601 Sum_probs=8.1 Q ss_pred CCCCCHHHHHHHH Q ss_conf 4444204667765 Q 537021.9.peg.7 376 PGRGSGAGSVTAY 388 (1033) Q Consensus 376 pGRGSAagSLVaY 388 (1033) .|.||.||||-|- T Consensus 105 EG~gS~agSLSsL 117 (151) T 1i7w_B 105 EGSGSEAASLSSL 117 (151) T ss_dssp CCCCCCCTTCCCC T ss_pred CCCCCCCCCCCCC T ss_conf 7888746771001 No 214 >1wr1_B Ubiquitin-like protein DSK2; UBA domain, UBA-ubiquitin complex, signaling protein; NMR {Saccharomyces cerevisiae} SCOP: a.5.2.1 Probab=23.50 E-value=20 Score=11.74 Aligned_cols=22 Identities=18% Similarity=0.288 Sum_probs=12.7 Q ss_pred HHHHHHHHHHHHHHHCCHHHHH Q ss_conf 9999887789898606305578 Q 537021.9.peg.7 337 YRQRLDFELNVIARMKFSGYFL 358 (1033) Q Consensus 337 y~~RLe~EL~vI~~~gf~~YFL 358 (1033) -.+|.+.+|....+|||.+.=. T Consensus 13 pee~y~~qL~~L~~MGF~d~~~ 34 (58) T 1wr1_B 13 PEERYEHQLRQLNDMGFFDFDR 34 (58) T ss_dssp HHHHTHHHHHHHHHHTCCCHHH T ss_pred HHHHHHHHHHHHHHCCCCCHHH T ss_conf 8999999999999959996899 No 215 >1o5k_A DHDPS, dihydrodipicolinate synthase; TM1521, structural genomics, JCSG, PSI, protein structure initiative; HET: MCL; 1.80A {Thermotoga maritima} SCOP: c.1.10.1 Probab=23.28 E-value=21 Score=11.71 Aligned_cols=88 Identities=11% Similarity=0.068 Sum_probs=37.4 Q ss_pred CCCEEEECCCCCCHHHHHHHCCCHHHHHHHHHHHHHHCCCC--EEEEEECCCHHHHHHHHHHHHHHHHHCCCEEEECCCC Q ss_conf 89889981976796788887799789999999998765986--8999943991158999999999998189899808830 Q 537021.9.peg.7 141 TEGLIMLTGGSSGPIDRAFFFNSSQVAEKRLLTFKKLFGDR--LYVNLQRHRGYDRYRESQVVQLAYTHELPLVATNNSL 218 (1033) Q Consensus 141 ~egLIvlsg~~~g~i~~~l~~~~~~~a~~~l~~l~~~Fgd~--~ylEl~~~~~~e~~~~~~l~~lA~~~~iPlVaTndv~ 218 (1033) -+|+++ +|..|+...+ ..+|-++.++...+.-+++ ++.++......+ .-.+.+.|++.|.--|.....+ T Consensus 47 v~gi~~--~G~tGE~~~L----s~~Er~~l~~~~~~~~~~~~~vi~gv~~~~~~~---ai~~a~~a~~~Gad~v~v~pP~ 117 (306) T 1o5k_A 47 VNALIV--LGTTGESPTV----NEDEREKLVSRTLEIVDGKIPVIVGAGTNSTEK---TLKLVKQAEKLGANGVLVVTPY 117 (306) T ss_dssp CCEEEE--SSGGGTGGGC----CHHHHHHHHHHHHHHHTTSSCEEEECCCSCHHH---HHHHHHHHHHHTCSEEEEECCC T ss_pred CCEEEE--CEEHHHHHHC----CHHHHHHHHHHHHHHCCCCCCEEEECCCCCHHH---HHHHHHHHHHCCCCEEEECCCC T ss_conf 998996--8213425558----999999999999985488888898468654999---9999999995699999985999 Q ss_pred CCCHHHHHHHHHHHHHHCC Q ss_conf 0787889999876688469 Q 537021.9.peg.7 219 FLSEEDYEAHDVLMAVAHS 237 (1033) Q Consensus 219 Y~~~~D~~~h~iL~aI~~~ 237 (1033) |..+.+..+.+-...|... T Consensus 118 ~~~~~~~~i~~~~~~ia~a 136 (306) T 1o5k_A 118 YNKPTQEGLYQHYKYISER 136 (306) T ss_dssp SSCCCHHHHHHHHHHHHTT T ss_pred CCCCCHHHHHHHHHHHHHH T ss_conf 9997999999999999982 No 216 >2wxf_A Phosphatidylinositol-4,5-bisphosphate 3-kinase catalytic subunit delta isoform; transferase, phosphoprotein, isoform-specific inhibitors, cancer; HET: 039; 1.90A {Mus musculus} PDB: 2wxg_A* 2wxh_A* 2wxi_A* 2wxj_A* 2wxk_A* 2wxl_A* 2wxm_A* 2wxn_A* 2wxo_A* 2wxp_A* 2wxq_A* 2wxr_A 2x38_A* Probab=23.15 E-value=20 Score=11.89 Aligned_cols=44 Identities=27% Similarity=0.486 Sum_probs=24.7 Q ss_pred HHHHHHHCCHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHCC Q ss_conf 2135654178999999998874025434332344358899876201410122114 Q 537021.9.peg.7 570 KFDFLGLKTLTLLQKSLDFLAQRGVKVDLSLIPFDDHETYHLLTTKGTLGIFQLE 624 (1033) Q Consensus 570 K~D~Lgl~~L~~i~~~~~~i~~~~~~~d~~~ip~~D~~~~~l~~~g~t~GvFQ~e 624 (1033) .-|.|-+-.+.+++...+ +.| +|+.-. +|..+..|...|+-++= T Consensus 681 RqD~l~lQ~i~~m~~i~~---~~~--ldl~l~------~Y~vl~t~~~~G~IE~V 724 (940) T 2wxf_A 681 RQDMLTLQMIQLMDVLWK---QEG--LDLRMT------PYGCLPTGDRTGLIEVV 724 (940) T ss_dssp HHHHHHHHHHHHHHHHHH---HTT--CCCCCC------CCCEEEEETTEEEEECC T ss_pred CHHHHHHHHHHHHHHHHH---HCC--CCCCCC------CEEEEECCCCCEEEEEE T ss_conf 099999999999999998---669--885330------11899737996457884 No 217 >1dc1_A Bsobi restriction endonuclease; protein-DNA complex, thermophilic enzyme, degenerate DNA recognition, hydrolase/DNA complex; HET: DNA; 1.70A {Geobacillus stearothermophilus} SCOP: c.52.1.11 Probab=23.11 E-value=21 Score=11.68 Aligned_cols=36 Identities=22% Similarity=0.333 Sum_probs=25.1 Q ss_pred CCCHHHHHH-HHHCCCCCHHHHCCCHHHHHHHHHHCCCHHHHH Q ss_conf 443588998-762014101221145347888877174048888 Q 537021.9.peg.7 602 PFDDHETYH-LLTTKGTLGIFQLESSGMRQALEGMQPDCIEDI 643 (1033) Q Consensus 602 p~~D~~~~~-l~~~g~t~GvFQ~es~~~~~~l~~~~p~~~~dl 643 (1033) .+=|+-||+ ||.+|||.| ..||.+-.-+.-+.|.-. T Consensus 109 ~fieElv~RfLLtrGD~LG------G~MRNigG~LAqrK~TR~ 145 (323) T 1dc1_A 109 EFIDELIFRYLLFQGDSLG------GTMRNIAGALAQQKLTRA 145 (323) T ss_dssp GHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHH T ss_pred HHHHHHHHHHHHHCCCCCC------CHHHHHHHHHHHHHHHHH T ss_conf 7899999999996354235------035533678889999999 No 218 >3ouz_A Biotin carboxylase; structural genomics, center for structural genomics of infec diseases, csgid, alpha-beta fold, cytosol, LIG; HET: MSE ADP SRT TLA; 1.90A {Campylobacter jejuni subsp} Probab=23.07 E-value=21 Score=11.68 Aligned_cols=32 Identities=19% Similarity=0.173 Sum_probs=24.1 Q ss_pred CCCCEEEEECCCHHHHHHHHHHHHHHCCCCEEEE Q ss_conf 8998899811651774999999999769963899 Q 537021.9.peg.7 39 DQQPAIAITDTNNLFSALEFSQKACAAGIQPIIG 72 (1033) Q Consensus 39 ~G~~AiAITD~~nl~Gav~F~~ackk~gIKPIiG 72 (1033) +.++-|-|.+.|-+ +++..++|++.|++.|.= T Consensus 4 ~~~kkiLianrGei--A~riiraarelG~~tV~V 35 (446) T 3ouz_A 4 MEIKSILIANRGEI--ALRALRTIKEMGKKAICV 35 (446) T ss_dssp TCCCEEEECCCHHH--HHHHHHHHHHTTCEEEEE T ss_pred CCCCEEEEECCCHH--HHHHHHHHHHCCCCEEEE T ss_conf 54538999898699--999999999869919999 No 219 >1e7u_A Phosphatidylinositol 3-kinase catalytic subunit; phosphoinositide 3-kinase gamma, secondary messenger generation, PI3K, PI 3K, wortmannin; HET: KWT; 2.0A {Sus scrofa} SCOP: a.118.1.6 b.7.1.1 d.15.1.5 d.144.1.4 PDB: 1e8x_A* 1e8w_A* 1e7v_A* 1e90_A* 3l54_A* 3dbs_A* 3l13_A* 3l08_A* 3cst_A* 3csf_A* 3l16_A* 3l17_A* 3ene_A* 1e8y_A 1e8z_A* 2a4z_A* 2a5u_A* 2chw_A* 2chx_A* 2chz_A* ... Probab=23.02 E-value=21 Score=11.67 Aligned_cols=45 Identities=22% Similarity=0.455 Sum_probs=25.3 Q ss_pred CHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCCCCCHHHHCC Q ss_conf 32135654178999999998874025434332344358899876201410122114 Q 537021.9.peg.7 569 IKFDFLGLKTLTLLQKSLDFLAQRGVKVDLSLIPFDDHETYHLLTTKGTLGIFQLE 624 (1033) Q Consensus 569 ~K~D~Lgl~~L~~i~~~~~~i~~~~~~~d~~~ip~~D~~~~~l~~~g~t~GvFQ~e 624 (1033) +.-|.|-+-++.+++... ++.|. |+.- .+|..+..|...|+-++= T Consensus 697 LRqD~l~lQ~i~~m~~i~---~~~~l--dl~l------~~Y~vi~t~~~~GlIE~V 741 (961) T 1e7u_A 697 LRQDMLILQILRIMESIW---ETESL--DLCL------LPYGCISTGDKIGMIEIV 741 (961) T ss_dssp CHHHHHHHHHHHHHHHHH---HHTTC--CCCC------CCCCEEEEETTEEEEECC T ss_pred CCHHHHHHHHHHHHHHHH---HHCCC--CCCC------EEEEEEEECCCCEEEEEC T ss_conf 128879999999999999---87688--7301------025899705883479988 No 220 >1vim_A Hypothetical protein AF1796; structural genomics, unknown function; 1.36A {Archaeoglobus fulgidus} SCOP: c.80.1.3 Probab=22.92 E-value=19 Score=12.10 Aligned_cols=18 Identities=22% Similarity=0.141 Sum_probs=8.5 Q ss_pred HHHHHHHHHCCCEEEECC Q ss_conf 999999981898998088 Q 537021.9.peg.7 199 QVVQLAYTHELPLVATNN 216 (1033) Q Consensus 199 ~l~~lA~~~~iPlVaTnd 216 (1033) .+++.|++.|+|+|+..+ T Consensus 107 ~~~~~ak~~g~~vI~IT~ 124 (200) T 1vim_A 107 NISKKAKDIGSKLVAVTG 124 (200) T ss_dssp HHHHHHHHHTCEEEEEES T ss_pred HHHHHHHHCCCEEEEEEC T ss_conf 999999987996999979 No 221 >1m0s_A Ribose-5-phosphate isomerase A; D-ribose 5-phosphate isomerase, northeast structural genomics consortium, IR21, structural genomics, PSI; HET: CIT; 1.90A {Haemophilus influenzae} SCOP: c.124.1.4 d.58.40.1 Probab=22.92 E-value=21 Score=11.66 Aligned_cols=52 Identities=21% Similarity=0.261 Sum_probs=30.7 Q ss_pred HHHHHHHCCCCCCCCCCCCCCEEEEECC-CEEEEEHHHHCCCCHHHH-HHHHHHH Q ss_conf 9988886368503552334210035319-869850268589698999-9999998 Q 537021.9.peg.7 805 FCQDARQFNIQIMPPSVNTPCVDFKVGD-NRIYYSLAAIKGVGTTTA-RHIMEAS 857 (1033) Q Consensus 805 ~i~e~~~~gi~vl~PdIN~S~~~f~~~~-~~Ir~Gl~~Ikgvg~~~~-~~Ive~r 857 (1033) -...|++.||++..++ .....++.+++ +.|--.|..|||=|.... |.|+... T Consensus 56 t~~~~~~~Gi~~~~l~-~~~~iDiaiDGaDevd~~l~lIKGgGgallrEKiva~~ 109 (219) T 1m0s_A 56 SEELLRKQGIEVFNAN-DVSSLDIYVDGADEINPQKMMIKGGGAALTREKIVAAL 109 (219) T ss_dssp HHHHHHHTTCCBCCGG-GCSCEEEEEECCSEECTTSCEECCTTSCHHHHHHHHHH T ss_pred HHHHHHHCCCCCCCCC-CCCCCCEECCCHHHCCCCCCCHHCCCHHHHHHHHHHHH T ss_conf 9999998799733444-35654532364553084758621583888898999985 No 222 >1wvf_A 4-cresol dehydrogenase [hydroxylating] flavoprotein subunit; electron-transfer, FAD, oxidoreductase; HET: FAD; 1.30A {Pseudomonas putida} SCOP: d.58.32.1 d.145.1.1 PDB: 1wve_A* 1dii_A* 1diq_A* Probab=22.86 E-value=19 Score=12.00 Aligned_cols=21 Identities=19% Similarity=0.447 Sum_probs=15.9 Q ss_pred HHHHHHHHHCCCCCCCC-CCCC Q ss_conf 89999985104764334-4442 Q 537021.9.peg.7 360 VADFIQWAKKNNIPVGP-GRGS 380 (1033) Q Consensus 360 V~Div~~ak~~gI~vGp-GRGS 380 (1033) |..+|++|+++++++-| |.|+ T Consensus 69 V~~~v~~a~~~~i~v~~rggG~ 90 (520) T 1wvf_A 69 VQGVVKICNEHKIPIWTISTGR 90 (520) T ss_dssp HHHHHHHHHHHTCCEEEESSCC T ss_pred HHHHHHHHHHCCCEEEEECCCC T ss_conf 9999999998798099989998 No 223 >3ooo_A Proline dipeptidase; structural genomics, PSI-2, protein structure initiative, MI center for structural genomics, MCSG, hydrolase; HET: MSE; 1.57A {Streptococcus agalactiae} Probab=22.85 E-value=21 Score=11.65 Aligned_cols=30 Identities=17% Similarity=0.110 Sum_probs=20.7 Q ss_pred CCCHHHHHHHHHHCCCCEEEEECCCHHHHH Q ss_conf 299899999999789988998116517749 Q 537021.9.peg.7 26 ALSLNNILDKIAADQQPAIAITDTNNLFSA 55 (1033) Q Consensus 26 ~~~~e~LV~~A~~~G~~AiAITD~~nl~Ga 55 (1033) +.+++.+-+..++.|..++-||+..|++=. T Consensus 2 m~Rl~klr~~m~~~~ld~~lit~~~ni~Yl 31 (132) T 3ooo_A 2 MSKLNRIRHHLHSVQAELAVFSDPVTVNYL 31 (132) T ss_dssp CHHHHHHHHHHHHTTCSEEEECCHHHHHHH T ss_pred CHHHHHHHHHHHHCCCCEEEEECCCHHHHH T ss_conf 739999999999869999999253132474 No 224 >2h31_A Multifunctional protein ADE2; alpha-beta-alpha, ligase, lyase; 2.80A {Homo sapiens} Probab=22.70 E-value=21 Score=11.62 Aligned_cols=26 Identities=8% Similarity=-0.022 Sum_probs=11.0 Q ss_pred HHHHHHHHHCCHHHHHHHHHHHHHHHCCCC Q ss_conf 778989860630557888999998510476 Q 537021.9.peg.7 343 FELNVIARMKFSGYFLIVADFIQWAKKNNI 372 (1033) Q Consensus 343 ~EL~vI~~~gf~~YFLiV~Div~~ak~~gI 372 (1033) .|++.|.++-..-|= -+..++.+.|+ T Consensus 160 ~e~e~i~~~a~~v~~----~l~~~~~~~Gl 185 (425) T 2h31_A 160 TEVDIMSHATQAIFE----ILEKSWLPQNC 185 (425) T ss_dssp HHHHHHHHHHHHHHH----HHHHHHGGGSC T ss_pred HHHHHHHHHHHHHHH----HHHHHHHHCCC T ss_conf 899999999999999----99999997898 No 225 >2c7f_A Alpha-L-arabinofuranosidase; glycosidase, xylan, arabinan, hydrolase; HET: AHR; 2.7A {Clostridium thermocellum} SCOP: b.71.1.2 c.1.8.3 PDB: 2c8n_A Probab=22.60 E-value=21 Score=11.61 Aligned_cols=45 Identities=11% Similarity=0.204 Sum_probs=31.0 Q ss_pred HHHHHHHHHCCCCEEEE-----EC-------------------------CCHHHHHHHHHHHHHHCCCCEEEEEE Q ss_conf 99999999789988998-----11-------------------------65177499999999976996389999 Q 537021.9.peg.7 30 NNILDKIAADQQPAIAI-----TD-------------------------TNNLFSALEFSQKACAAGIQPIIGCQ 74 (1033) Q Consensus 30 e~LV~~A~~~G~~AiAI-----TD-------------------------~~nl~Gav~F~~ackk~gIKPIiG~E 74 (1033) .++++..+++|.+.|=. +| .-|-+|.-+|..-|++.|..|++.+- T Consensus 62 ~dv~~~l~~l~~~~lR~PGG~~~~~Y~W~~~iGp~~~R~~~~~~~w~~~~~n~~g~~ef~~~~~~~gaep~i~vn 136 (513) T 2c7f_A 62 KDVIELVKELNVPIIRYPGGNFVSNYFWEDGVGPVEDRPRRLDLAWKSIEPNQVGINEFAKWCKKVNAEIMMAVN 136 (513) T ss_dssp HHHHHHHHHHCCSEEEESCSTTGGGCCGGGGSSCGGGCCCEEETTTTEEECCSSCTHHHHHHHHHTTCEEEEECC T ss_pred HHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCEEEEEEE T ss_conf 999999996299979839975157442058869968887655787674046864789999999980997999997 No 226 >1ux8_A YJBI protein; oxygen storage/transport, truncated hemoglobin, oxygen transport; HET: HEM; 2.15A {Bacillus subtilis} SCOP: a.1.1.1 Probab=22.41 E-value=21 Score=11.58 Aligned_cols=73 Identities=21% Similarity=0.169 Sum_probs=41.0 Q ss_pred HHHHHHHHHHHCCCHHHHH----HHHHHHCC-C-CCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCH Q ss_conf 5689887886326841307----88976201-2-110167787899999862499999999999999975000133101 Q 537021.9.peg.7 693 EQVMQIAQLLSGYSLSEAD----VLRRAMGK-K-IKEEMDKQKERFISGASKNGISKTIAVNIFELLAKFADYGFNKSH 765 (1033) Q Consensus 693 EQvm~ia~~~ag~~~~ead----~lRra~~k-k-~~~~~~~~~~~f~~g~~~~g~~~~~a~~i~~~i~~f~~Y~FnksH 765 (1033) .|...++..++|-..+... .+|.+-.. . ..+..+.+-..|.+-..+.|++++.++++|..++..|...=|++- T Consensus 48 ~~~~fl~~~~GGp~~~~~~y~g~~~~~~H~~~~It~~~fd~wl~~~~~al~~~~~~~~~~~~~~~~~~~~a~~m~N~~~ 126 (132) T 1ux8_A 48 KQKQFLTQYLGGPPLYTEEHGHPMLRARHLPFPITNERADAWLSCMKDAMDHVGLEGEIREFLFGRLELTARHMVNQTE 126 (132) T ss_dssp HHHHHHHHHTTSCCHHHHHHCCCCHHHHHTTSCCCHHHHHHHHHHHHHHHHHHTCCSHHHHHHHHHHHHHHHHHCC--- T ss_pred HHHHHHHHHCCCCCCCCCCCCCCCHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 9999999980697656533477789988627999999999999999999998299999999999999999999608998 No 227 >3fmt_A Protein SEQA; protein-DNA complex, hemimethylated GATC, DNA replication; HET: 6MA; 2.98A {Escherichia coli} Probab=22.33 E-value=21 Score=11.57 Aligned_cols=21 Identities=10% Similarity=0.395 Sum_probs=6.6 Q ss_pred HHHHHHHHHHCCCCHHHHHHH Q ss_conf 899999997527998898865 Q 537021.9.peg.7 455 KAALRDVGRALQMPYSQVDRL 475 (1033) Q Consensus 455 k~aird~~r~~g~~~~~~~~~ 475 (1033) |+-+..+...+|+|.+.++++ T Consensus 138 r~ml~~vm~~mg~~~~~ie~v 158 (162) T 3fmt_A 138 CSMIEHIMQSMQFPAELIEKV 158 (162) T ss_dssp HHHHHHHHHHTTCCHHHHHHH T ss_pred HHHHHHHHHHCCCCHHHHHHH T ss_conf 999999999909999999998 No 228 >2abk_A Endonuclease III; DNA-repair, DNA glycosylase; 1.85A {Escherichia coli} SCOP: a.96.1.1 Probab=22.16 E-value=22 Score=11.54 Aligned_cols=17 Identities=29% Similarity=0.351 Sum_probs=8.2 Q ss_pred CCCCCCHHHHHHHHHHHC Q ss_conf 344442046677655440 Q 537021.9.peg.7 375 GPGRGSGAGSVTAYALTI 392 (1033) Q Consensus 375 GpGRGSAagSLVaY~LgI 392 (1033) |-||=.| +++.+|+.+. T Consensus 116 GVG~kTA-~~il~~a~~~ 132 (211) T 2abk_A 116 GVGRKTA-NVVLNTAFGW 132 (211) T ss_dssp TCCHHHH-HHHHHHHHCC T ss_pred CCCHHHH-HHHHHHHHCC T ss_conf 8736999-9999987358 No 229 >1ug6_A Beta-glycosidase; glucosidase, atomic resolution, riken structural genomics/proteomics initiative, RSGI, structural genomics, hydrolase; 0.99A {Thermus thermophilus} SCOP: c.1.8.4 PDB: 1np2_A Probab=22.09 E-value=22 Score=11.53 Aligned_cols=17 Identities=12% Similarity=0.126 Sum_probs=7.5 Q ss_pred HHHHHHHHHHCCCEEEE Q ss_conf 99999999818989980 Q 537021.9.peg.7 198 SQVVQLAYTHELPLVAT 214 (1033) Q Consensus 198 ~~l~~lA~~~~iPlVaT 214 (1033) +.+++-..+.||..++| T Consensus 100 ~~~i~~l~~~gI~P~VT 116 (431) T 1ug6_A 100 DRLVDRLLASGITPFLT 116 (431) T ss_dssp HHHHHHHHHTTCEEEEE T ss_pred HHHHHHHHHCCCEEEEE T ss_conf 99999999749978651 No 230 >1cxq_A Avian sarcoma virus integrase; mixed beta-sheet surrounded by alpha-helices, transferase; HET: EPE; 1.02A {Avian sarcoma virus} SCOP: c.55.3.2 PDB: 1cxu_A* 1czb_A* 1cz9_A* 1asu_A* 1asw_A* 1asv_A 1a5v_A* 1a5w_A* 1a5x_A* 1vsf_A* 1vse_A* 1vsm_A 1vsd_A* 1vsh_A* 1vsi_A* 1vsj_A* 1vsk_A 1vsl_A Probab=21.91 E-value=22 Score=11.51 Aligned_cols=19 Identities=0% Similarity=-0.134 Sum_probs=8.9 Q ss_pred HHHHHHHHHHHCCCEEEEC Q ss_conf 9999999998189899808 Q 537021.9.peg.7 197 ESQVVQLAYTHELPLVATN 215 (1033) Q Consensus 197 ~~~l~~lA~~~~iPlVaTn 215 (1033) ...+.++..++||....+. T Consensus 81 s~~~~~~l~~~gI~~~~~~ 99 (162) T 1cxq_A 81 SKSTREWLARWGIAHTTGI 99 (162) T ss_dssp SHHHHHHHHHHTCEEECCC T ss_pred CHHHHHHHHHCCEEEEECC T ss_conf 6789998871690870078 No 231 >1yqd_A Sinapyl alcohol dehydrogenase; lignin, monolignol, oxidoreductase, zinc-dependent, plant defense, biosynthesis, substrate inhibition; HET: NAP; 1.65A {Populus tremuloides} PDB: 1yqx_A* Probab=21.77 E-value=22 Score=11.48 Aligned_cols=46 Identities=20% Similarity=0.294 Sum_probs=32.2 Q ss_pred CHHHHHHHHHHHHHHCCCCCCCCCCCCH-HHHHHHHHCCCCCHHHHC Q ss_conf 7899999999887402543433234435-889987620141012211 Q 537021.9.peg.7 578 TLTLLQKSLDFLAQRGVKVDLSLIPFDD-HETYHLLTTKGTLGIFQL 623 (1033) Q Consensus 578 ~L~~i~~~~~~i~~~~~~~d~~~ip~~D-~~~~~l~~~g~t~GvFQ~ 623 (1033) +..-++++++++.+...+.+++.+|++| ++.|+.+.+|+..|=.=+ T Consensus 309 ~~~d~~~~~~li~~g~i~~~i~~~~l~~i~eA~~~l~~g~~~GKiVl 355 (366) T 1yqd_A 309 GMKETQEMIDFAAKHNITADIEVISTDYLNTAMERLAKNDVRYRFVI 355 (366) T ss_dssp CHHHHHHHHHHHHHTTCCCCEEEECGGGHHHHHHHHHTTCCSSEEEE T ss_pred CHHHHHHHHHHHHCCCCCCCEEEEEHHHHHHHHHHHHCCCCCEEEEE T ss_conf 99999999999985999555799859999999999976998408999 No 232 >1uuf_A YAHK, zinc-type alcohol dehydrogenase-like protein YAHK; oxidoreductase, zinc binding, oxydoreductase, metal-binding; 1.76A {Escherichia coli} SCOP: b.35.1.2 c.2.1.1 Probab=21.65 E-value=22 Score=11.47 Aligned_cols=40 Identities=28% Similarity=0.401 Sum_probs=29.3 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCH-HHHHHHHHCCCCCH Q ss_conf 99999999887402543433234435-88998762014101 Q 537021.9.peg.7 580 TLLQKSLDFLAQRGVKVDLSLIPFDD-HETYHLLTTKGTLG 619 (1033) Q Consensus 580 ~~i~~~~~~i~~~~~~~d~~~ip~~D-~~~~~l~~~g~t~G 619 (1033) .-++++++++.+...+.+.+.+|++| ++.|+.+.+|++.| T Consensus 318 ~~~~~~~~li~~g~i~~~~~~~~le~i~~A~~~l~~g~~~G 358 (369) T 1uuf_A 318 PETQEMLDFCAEHGIVADIEMIRADQINEAYERMLRGDVKY 358 (369) T ss_dssp HHHHHHHHHHHHHTCCCCEEEECGGGHHHHHHHHHTTCSSS T ss_pred HHHHHHHHHHHHCCCCCCEEEEEHHHHHHHHHHHHCCCCCE T ss_conf 99999999998298988769983999999999997699816 No 233 >3k8k_A Alpha-amylase, SUSG; alpha8/BETA8 barrel, CBM, beta-sandwich, membrane protein; 2.20A {Bacteroides thetaiotaomicron} PDB: 3k8m_A* 3k8l_A* Probab=21.64 E-value=22 Score=11.47 Aligned_cols=13 Identities=8% Similarity=0.256 Sum_probs=4.7 Q ss_pred CCCCCHHHHHHHH Q ss_conf 4444204667765 Q 537021.9.peg.7 376 PGRGSGAGSVTAY 388 (1033) Q Consensus 376 pGRGSAagSLVaY 388 (1033) +..|...++...+ T Consensus 173 ~~~~~~~~~~~~~ 185 (669) T 3k8k_A 173 TQEGAAGYNAAEW 185 (669) T ss_dssp CSSGGGGCCGGGE T ss_pred CCCCCCCCCCCCC T ss_conf 6667788997644 No 234 >3ahy_A Beta-glucosidase; cellulases, glycosyl hydrolase, manganese enhancement, hydro; 1.63A {Trichoderma reesei} Probab=21.64 E-value=22 Score=11.46 Aligned_cols=48 Identities=13% Similarity=0.152 Sum_probs=25.8 Q ss_pred HHCCCHHHHHHHHHHCCCCEEEEE--------C--C---CHHHHHHHH----HHHHHHCCCCEEEE Q ss_conf 432998999999997899889981--------1--6---517749999----99999769963899 Q 537021.9.peg.7 24 EGALSLNNILDKIAADQQPAIAIT--------D--T---NNLFSALEF----SQKACAAGIQPIIG 72 (1033) Q Consensus 24 dg~~~~e~LV~~A~~~G~~AiAIT--------D--~---~nl~Gav~F----~~ackk~gIKPIiG 72 (1033) |..-+-++=++.++++|++|.=.. + . =|-. .++| ..+|+++||+||+- T Consensus 59 d~y~ry~eDi~l~~~lG~~~yRfSI~WsRI~P~g~~~~~~n~~-gl~~Y~~~id~l~~~GI~P~vT 123 (473) T 3ahy_A 59 DSYNRTAEDIALLKSLGAKSYRFSISWSRIIPEGGRGDAVNQA-GIDHYVKFVDDLLDAGITPFIT 123 (473) T ss_dssp CGGGCHHHHHHHHHHHTCSEEEEECCHHHHSSSCSTTSCCCHH-HHHHHHHHHHHHHHTTCEEEEE T ss_pred CHHHHHHHHHHHHHHHCCCEEECCCCHHHCCCCCCCCCCCCHH-HHHHHHHHHHHHHHCCCEEEEE T ss_conf 6255219999999985989898336498705579878877989-9999999999998579767888 No 235 >1yub_A Ermam, rRNA methyltransferase; MLS antibiotics; NMR {Streptococcus pneumoniae} SCOP: c.66.1.24 Probab=21.63 E-value=22 Score=11.46 Aligned_cols=54 Identities=19% Similarity=0.307 Sum_probs=34.2 Q ss_pred HHHHHHHHHC----CCCCCCCCCCCHHHHHHHHHHHCC--CCCHHHCCCHHHHHHHHCCC Q ss_conf 8999998510----476433444420466776554403--75633323105676210014 Q 537021.9.peg.7 360 VADFIQWAKK----NNIPVGPGRGSGAGSVTAYALTIT--DIDPLRFSLLFERFLNPDRM 413 (1033) Q Consensus 360 V~Div~~ak~----~gI~vGpGRGSAagSLVaY~LgIT--~VDPi~~~LlFERFLnp~R~ 413 (1033) +..||+++.- .=+-+|||.|.--..|...+-.+| ++||--+..+=+||-+..+. T Consensus 18 ~~~Iv~~~~~~~~d~VlEIGpG~G~LT~~L~~~~~~v~aiE~D~~l~~~l~~~~~~~~n~ 77 (245) T 1yub_A 18 LNQIIKQLNLKETDTVYEIGTGKGHLTTKLAKISKQVTSIELDSHLFNLSSEKLKLNTRV 77 (245) T ss_dssp HHHHHHHCCCCSSEEEEECSCCCSSCSHHHHHHSSEEEESSSSCSSSSSSSCTTTTCSEE T ss_pred HHHHHHHCCCCCCCEEEEECCCHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHCCCCC T ss_conf 999999458999996899799735999999961650040340446689887554113400 No 236 >3g8r_A Probable spore coat polysaccharide biosynthesis protein E; structural genomics, protein structure initiative; 2.49A {Chromobacterium violaceum atcc 12472} Probab=21.62 E-value=22 Score=11.46 Aligned_cols=43 Identities=16% Similarity=0.206 Sum_probs=22.9 Q ss_pred HHHHHHHHHHHCCCCCHHHCCCHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 0466776554403756333231056762100145787533365687899999998 Q 537021.9.peg.7 381 GAGSVTAYALTITDIDPLRFSLLFERFLNPDRMSMPDFDIDFCQDRRDEVIRYVQ 435 (1033) Q Consensus 381 AagSLVaY~LgIT~VDPi~~~LlFERFLnp~R~~~PDIDiDf~~~~R~~vi~y~~ 435 (1033) ++-++.|+.+|.+ .+||-.-.+|...+| | +++. --++..+++. T Consensus 208 ~~~~~~Av~~GA~---------iIEKHfTldk~~~g~-d-~~S~-~~~e~~~~v~ 250 (350) T 3g8r_A 208 MEPIMLAVAQGAT---------VFEKHVGLPTDQYGI-N-NYSA-NPEQVRRWLA 250 (350) T ss_dssp CHHHHHHHHTTCC---------EEEEEBCCCBTTBCC-C-TTCB-CHHHHHHHHH T ss_pred CCCHHHHHHCCCC---------CEEEEEECCCCCCCC-C-HHHC-CCHHHHHHHH T ss_conf 2126878858974---------120678447667897-2-3202-7278999999 No 237 >3ces_A MNMG, tRNA uridine 5-carboxymethylaminomethyl modification enzyme GIDA, GIDA; tRNA modification, FAD binding domain, structural genomics; 2.41A {Escherichia coli} PDB: 3cp2_A 3g05_A Probab=21.48 E-value=16 Score=12.63 Aligned_cols=11 Identities=18% Similarity=0.078 Sum_probs=5.2 Q ss_pred HHHCCCCCHHH Q ss_conf 76201410122 Q 537021.9.peg.7 611 LLTTKGTLGIF 621 (1033) Q Consensus 611 l~~~g~t~GvF 621 (1033) +|-.|.-.|+- T Consensus 387 Lf~AGQI~Gte 397 (651) T 3ces_A 387 LFFAGQINGTT 397 (651) T ss_dssp EEECSGGGTCC T ss_pred EEEECCCCCCH T ss_conf 78731203754 No 238 >1sg6_A Pentafunctional AROM polypeptide; shikimate pathway, aromatic amino acid biosynthesis, DHQS, open form, form J, domain movement, cyclase; HET: NAD; 1.70A {Emericella nidulans} SCOP: e.22.1.1 PDB: 1nr5_A* 1nrx_A* 1nua_A 1nva_A* 1nvb_A* 1nvd_A* 1nve_A* 1nvf_A* 1dqs_A* Probab=21.37 E-value=22 Score=11.42 Aligned_cols=19 Identities=0% Similarity=0.071 Sum_probs=7.5 Q ss_pred HHHHHHHHHCCCCCCCCCC Q ss_conf 8999888863685035523 Q 537021.9.peg.7 803 KKFCQDARQFNIQIMPPSV 821 (1033) Q Consensus 803 ~~~i~e~~~~gi~vl~PdI 821 (1033) ..+.+=+++.|.++..||+ T Consensus 314 ~~i~~ll~~lgLPt~l~~~ 332 (393) T 1sg6_A 314 SRIVKCLAAYGLPTSLKDA 332 (393) T ss_dssp HHHHHHHHHTTCCCSTTCH T ss_pred HHHHHHHHHCCCCCCCHHH T ss_conf 9999999986999644243 No 239 >1kg2_A A/G-specific adenine glycosylase; DNA repair, hydrolase; 1.20A {Escherichia coli} SCOP: a.96.1.2 PDB: 1kg3_A 1muy_A 1kg6_A 1kg5_A 1mun_A 1mud_A 1kg4_A 1weg_A 1wei_A* 1wef_A* 1kg7_A 1kqj_A Probab=21.36 E-value=22 Score=11.42 Aligned_cols=24 Identities=46% Similarity=0.637 Sum_probs=13.1 Q ss_pred HHHHCCCCHHHHHHHHHHHHCCCC Q ss_conf 268589698999999999837999 Q 537021.9.peg.7 839 LAAIKGVGTTTARHIMEASADKPF 862 (1033) Q Consensus 839 l~~Ikgvg~~~~~~Ive~r~~g~f 862 (1033) |.+++|||.+++..|+..--+.|. T Consensus 111 L~~LpGVG~kTA~~il~~a~~~p~ 134 (225) T 1kg2_A 111 VAALPGVGRSTAGAILSLSLGKHF 134 (225) T ss_dssp HHTSTTCCHHHHHHHHHHHHCCSC T ss_pred HHHCCCCCHHHHHHHHHHHCCCCC T ss_conf 975898747899999998529861 No 240 >1ccw_A Protein (glutamate mutase); coenzyme B12, radical reaction, TIM- barrel, rossman-fold, isomerase; HET: CNC TAR; 1.60A {Clostridium cochlearium} SCOP: c.23.6.1 PDB: 1cb7_A* 1b1a_A 1i9c_A* 1be1_A 1fmf_A 1id8_A* Probab=21.20 E-value=22 Score=11.40 Aligned_cols=15 Identities=7% Similarity=0.109 Sum_probs=6.3 Q ss_pred HHHHHHHHCCCEEEE Q ss_conf 999999818989980 Q 537021.9.peg.7 200 VVQLAYTHELPLVAT 214 (1033) Q Consensus 200 l~~lA~~~~iPlVaT 214 (1033) +++.+++.+.-+|+. T Consensus 46 iv~~~~~~~~d~V~i 60 (137) T 1ccw_A 46 FIKAAIETKADAILV 60 (137) T ss_dssp HHHHHHHHTCSEEEE T ss_pred HHHHHHHHCCCEEEE T ss_conf 999999839987887 No 241 >1k77_A EC1530, hypothetical protein YGBM; TIM barrel, structural genomics, PSI, protein structure initiative, midwest center for structural genomics; 1.63A {Escherichia coli} SCOP: c.1.15.5 Probab=21.16 E-value=22 Score=11.39 Aligned_cols=51 Identities=10% Similarity=0.066 Sum_probs=36.7 Q ss_pred HHHHHCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEEEE Q ss_conf 7644329989999999978998899811651774999999999769963899 Q 537021.9.peg.7 21 SLLEGALSLNNILDKIAADQQPAIAITDTNNLFSALEFSQKACAAGIQPIIG 72 (1033) Q Consensus 21 Slldg~~~~e~LV~~A~~~G~~AiAITD~~nl~Gav~F~~ackk~gIKPIiG 72 (1033) |++..=.++++.++.|++.|+.+|-|-.... ..+-+.-+.+++.|++...- T Consensus 9 ~~~~~~~pl~e~i~~a~~~GfdgIEl~~~~~-~~~~~~~~~l~~~gl~~~~~ 59 (260) T 1k77_A 9 SMMFTEVPFIERFAAARKAGFDAVEFLFPYN-YSTLQIQKQLEQNHLTLALF 59 (260) T ss_dssp TTSSTTSCGGGHHHHHHHHTCSEEECSCCTT-SCHHHHHHHHHHTTCEEEEE T ss_pred CCCCCCCCHHHHHHHHHHHCCCEEEECCCCC-CCHHHHHHHHHHCCCCEEEE T ss_conf 1027899999999999994999899788788-99999999999869918996 No 242 >1fad_A Protein (FADD protein); apoptosis, death domain; NMR {Mus musculus} SCOP: a.77.1.2 Probab=21.13 E-value=22 Score=11.39 Aligned_cols=37 Identities=14% Similarity=0.256 Sum_probs=23.9 Q ss_pred HHHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHH Q ss_conf 78999999983038021554220778778899999997527998898865431 Q 537021.9.peg.7 426 RRDEVIRYVQNKYGHERVAQIITFGSLQAKAALRDVGRALQMPYSQVDRLCKL 478 (1033) Q Consensus 426 ~R~~vi~y~~~kyG~~~va~i~t~~t~~~k~aird~~r~~g~~~~~~~~~~~~ 478 (1033) ..+.++++|.++-|.+ -+.+||.||++..+++.|... T Consensus 12 dl~~~f~~ia~~lg~~----------------W~~lar~Lgls~~~I~~I~~~ 48 (99) T 1fad_A 12 YLQVAFDIVCDNVGRD----------------WKRLARELKVSEAKMDGIEEK 48 (99) T ss_dssp HHHHHHHHHHHHHHHH----------------HHHHHHHTTCCHHHHHHHHHH T ss_pred HHHHHHHHHHHHHHHH----------------HHHHHHHHCCCHHHHHHHHHH T ss_conf 7999999999988784----------------999999939199999999987 No 243 >1dd3_A 50S ribosomal protein L7/L12; dimer formation, flexibility, hinge region, four-helix- bundle, five-helix- bundle, alpha-beta structure; 2.00A {Thermotoga maritima} SCOP: a.108.1.1 d.45.1.1 PDB: 1dd4_A* 1giy_I 1yl3_I Probab=20.83 E-value=23 Score=11.34 Aligned_cols=46 Identities=30% Similarity=0.429 Sum_probs=32.6 Q ss_pred HHHHHHHHHHCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHH------HHCCCCHHHHHHHHHHHHHHHH Q ss_conf 6898878863268413078897620121101677878999998------6249999999999999997500 Q 537021.9.peg.7 694 QVMQIAQLLSGYSLSEADVLRRAMGKKIKEEMDKQKERFISGA------SKNGISKTIAVNIFELLAKFAD 758 (1033) Q Consensus 694 Qvm~ia~~~ag~~~~ead~lRra~~kk~~~~~~~~~~~f~~g~------~~~g~~~~~a~~i~~~i~~f~~ 758 (1033) .|++..+.+.|++|.||- +|+++| ++.|++.+.|+++-..++..|+ T Consensus 72 ~vIK~VR~it~LgLkEAK-------------------~lVE~aP~~~~viKe~vsKeeAE~iK~kLe~aGA 123 (128) T 1dd3_A 72 QVIKVVREITGLGLKEAK-------------------DLVEKAGSPDAVIKSGVSKEEAEEIKKKLEEAGA 123 (128) T ss_dssp HHHHHHHHHHCCCHHHHH-------------------HHHTTTTSTTCEEEEEECHHHHHHHHHHHHHTTC T ss_pred HHHHHHHHHCCCCHHHHH-------------------HHHHCCCCCHHHHHCCCCHHHHHHHHHHHHHCCC T ss_conf 123457753699889999-------------------9997179840678538899999999999998598 No 244 >2dzd_A Pyruvate carboxylase; biotin carboxylase, ligase; 2.40A {Geobacillus thermodenitrificans} Probab=20.80 E-value=23 Score=11.34 Aligned_cols=31 Identities=10% Similarity=0.247 Sum_probs=22.4 Q ss_pred CCCEEEEECCCHHHHHHHHHHHHHHCCCCEEEE Q ss_conf 998899811651774999999999769963899 Q 537021.9.peg.7 40 QQPAIAITDTNNLFSALEFSQKACAAGIQPIIG 72 (1033) Q Consensus 40 G~~AiAITD~~nl~Gav~F~~ackk~gIKPIiG 72 (1033) .++-|-|..+|-. +++..++|++.||++|.= T Consensus 5 ~~kkvLianrGei--a~ri~ra~relGi~tVav 35 (461) T 2dzd_A 5 RIRKVLVANRGEI--AIRVFRACTELGIRTVAI 35 (461) T ss_dssp CCSEEEECSCHHH--HHHHHHHHHHHTCEEEEE T ss_pred CCCEEEEECCCHH--HHHHHHHHHHCCCEEEEE T ss_conf 6668999897299--999999999879969999 No 245 >3ccf_A Cyclopropane-fatty-acyl-phospholipid synthase; YP_321342.1, putative methyltransferase, structural genomics; 1.90A {Anabaena variabilis atcc 29413} Probab=20.77 E-value=14 Score=13.17 Aligned_cols=18 Identities=17% Similarity=0.318 Sum_probs=10.0 Q ss_pred CEEEECCCCCCHHHHHHH Q ss_conf 889981976796788887 Q 537021.9.peg.7 143 GLIMLTGGSSGPIDRAFF 160 (1033) Q Consensus 143 gLIvlsg~~~g~i~~~l~ 160 (1033) .-|+=.||-.|.+...+. T Consensus 59 ~~VLDiGcG~G~~~~~la 76 (279) T 3ccf_A 59 EFILDLGCGTGQLTEKIA 76 (279) T ss_dssp CEEEEETCTTSHHHHHHH T ss_pred CEEEEECCCCCHHHHHHH T ss_conf 999993585859999999 No 246 >2jy5_A Ubiquilin-1; UBA, alternative splicing, cytoplasm, nucleus, phosphoprotein, proteasome, signaling protein; NMR {Homo sapiens} PDB: 2jy6_B Probab=20.65 E-value=22 Score=11.51 Aligned_cols=24 Identities=17% Similarity=0.169 Sum_probs=15.2 Q ss_pred HHHHHHHHHHHHHCCHHHHHHHHH Q ss_conf 998877898986063055788899 Q 537021.9.peg.7 339 QRLDFELNVIARMKFSGYFLIVAD 362 (1033) Q Consensus 339 ~RLe~EL~vI~~~gf~~YFLiV~D 362 (1033) +|.+.+|....+|||.|.=..+.- T Consensus 10 ~~y~~qL~~L~eMGF~d~~~n~~A 33 (52) T 2jy5_A 10 VRFQQQLEQLSAMGFLNREANLQA 33 (52) T ss_dssp TTTHHHHHHHHHTTCCCHHHHHHH T ss_pred HHHHHHHHHHHHCCCCCHHHHHHH T ss_conf 999999999999499966999999 No 247 >2aam_A Hypothetical protein TM1410; structural genomics, joint center for structural genomics, JCSG, protein structure initiative, PSI; HET: MSE UNL; 2.20A {Thermotoga maritima} SCOP: c.1.8.15 Probab=20.48 E-value=23 Score=11.28 Aligned_cols=20 Identities=10% Similarity=0.443 Sum_probs=8.5 Q ss_pred HHHHHHHHCC--CCCCCCCCCC Q ss_conf 9999985104--7643344442 Q 537021.9.peg.7 361 ADFIQWAKKN--NIPVGPGRGS 380 (1033) Q Consensus 361 ~Div~~ak~~--gI~vGpGRGS 380 (1033) .+|++.+|++ +..+.+-||- T Consensus 170 ~~L~~~~r~r~P~~~ii~NnG~ 191 (309) T 2aam_A 170 LEIAEYVRERKPDMLIIPQNGE 191 (309) T ss_dssp HHHHHHHHHHCTTCEEEEBSCG T ss_pred HHHHHHHHHHCCCCEEEECCCH T ss_conf 9999999975899589964757 No 248 >2knz_A Ubiquilin-4; cytoplasm, endoplasmic reticulum, nucleus, phosphoprotein, protein binding; NMR {Mus musculus} Probab=20.47 E-value=23 Score=11.28 Aligned_cols=22 Identities=18% Similarity=0.181 Sum_probs=13.9 Q ss_pred HHHHHHHHHHHHHCCHHHHHHH Q ss_conf 9988778989860630557888 Q 537021.9.peg.7 339 QRLDFELNVIARMKFSGYFLIV 360 (1033) Q Consensus 339 ~RLe~EL~vI~~~gf~~YFLiV 360 (1033) +|.+.+|....+|||.|-=..+ T Consensus 9 ~~y~~qL~~L~eMGF~D~~~Nl 30 (53) T 2knz_A 9 VRFQQQLEQLNSMGFINREANL 30 (53) T ss_dssp HHHHHHHHHHHTTTCCCHHHHH T ss_pred HHHHHHHHHHHHCCCCCHHHHH T ss_conf 8999999999994999669999 No 249 >1wxp_A THO complex subunit 1; death domain, structural genomics, nuclear matrix, riken structural genomics/proteomics initiative, RSGI, transport protein; NMR {Homo sapiens} Probab=20.43 E-value=23 Score=11.28 Aligned_cols=49 Identities=24% Similarity=0.198 Sum_probs=30.5 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHH Q ss_conf 1457875333656878999999983038021554220778778899999997527998898865431 Q 537021.9.peg.7 412 RMSMPDFDIDFCQDRRDEVIRYVQNKYGHERVAQIITFGSLQAKAALRDVGRALQMPYSQVDRLCKL 478 (1033) Q Consensus 412 R~~~PDIDiDf~~~~R~~vi~y~~~kyG~~~va~i~t~~t~~~k~aird~~r~~g~~~~~~~~~~~~ 478 (1033) ..+=||+--|=+- .++.+.+|.++-|.+ -+.+||.||++..+++.|... T Consensus 4 ~~~~p~~~~~~~l--~~~~l~~ia~~lg~~----------------W~~la~~Lgl~~~~I~~I~~~ 52 (110) T 1wxp_A 4 GSSGPDVRRDKPV--TGEQIEVFANKLGEQ----------------WKILAPYLEMKDSEIRQIECD 52 (110) T ss_dssp CCCCCCCCSSSCC--CHHHHHHHHHHHTTT----------------HHHHTTTTTCCHHHHHHHHHH T ss_pred CCCCCCCCCCCCC--HHHHHHHHHHHHHHH----------------HHHHHHHHCCCHHHHHHHHHH T ss_conf 8889853236545--099999999999776----------------999999909399999999985 No 250 >1qtw_A Endonuclease IV; DNA repair enzyme, TIM barrel, trinuclear Zn cluster, hydrolase; 1.02A {Escherichia coli} SCOP: c.1.15.1 PDB: 1qum_A* 2nqh_A 2nqj_A* 2nq9_A* Probab=20.36 E-value=23 Score=11.27 Aligned_cols=51 Identities=18% Similarity=0.049 Sum_probs=32.2 Q ss_pred HHHHCHHHHHHHCCCHHHHHHHHHHCCCCEEEEECC---------CHHHHHHHHHHHHHHCCCCE Q ss_conf 433144476443299899999999789988998116---------51774999999999769963 Q 537021.9.peg.7 14 LQVHSSYSLLEGALSLNNILDKIAADQQPAIAITDT---------NNLFSALEFSQKACAAGIQP 69 (1033) Q Consensus 14 Lh~hS~ySlldg~~~~e~LV~~A~~~G~~AiAITD~---------~nl~Gav~F~~ackk~gIKP 69 (1033) |-+|...| ..+++.+++|++.|+.++-|=-. -+--.+-+|-+.++++|++. T Consensus 4 ~g~h~~~~-----~gl~~a~~~a~~~G~~~~qif~~~p~~w~~~~~~~~~~~~~r~~~~~~~l~~ 63 (285) T 1qtw_A 4 IGAHVSAA-----GGLANAAIRAAEIDATAFALFTKNQRQWRAAPLTTQTIDEFKAACEKYHYTS 63 (285) T ss_dssp EEEECCCT-----TCHHHHHHHHHHTTCSEEECCSSCSSCSSCCCCCHHHHHHHHHHHHHTTCCG T ss_pred EEEEECCC-----CCHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHCCCCC T ss_conf 78877775-----4799999999974999999978997777889999999999999999829997 No 251 >3cb2_A Gamma-1-tubulin, tubulin gamma-1 chain; lattice, microtubule, nucleation, GTPase, lateral interaction, structural protein, hydrolase; HET: GDP; 2.30A {Homo sapiens} PDB: 1z5v_A* 1z5w_A* Probab=20.32 E-value=17 Score=12.49 Aligned_cols=55 Identities=22% Similarity=0.244 Sum_probs=34.8 Q ss_pred HHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHHCCCHH Q ss_conf 38999998877898986063055788899999851047643344442046677655440375633323105 Q 537021.9.peg.7 334 AQDYRQRLDFELNVIARMKFSGYFLIVADFIQWAKKNNIPVGPGRGSGAGSVTAYALTITDIDPLRFSLLF 404 (1033) Q Consensus 334 ~~~y~~RLe~EL~vI~~~gf~~YFLiV~Div~~ak~~gI~vGpGRGSAagSLVaY~LgIT~VDPi~~~LlF 404 (1033) .+.+.+|+.+|++- -+--..|++.+ -+|=|=||+-||+++..|.-.= |-++-+-| T Consensus 115 ~d~i~d~iRk~~E~---cD~l~gf~~~~-----------Sl~GGTGSGlGs~i~e~L~d~y--pk~~~~~~ 169 (475) T 3cb2_A 115 HEDIFDIIDREADG---SDSLEGFVLCH-----------SIAGGTGSGLGSYLLERLNDRY--PKKLVQTY 169 (475) T ss_dssp HHHHHHHHHHHHHT---CSSCCEEEEEE-----------ESSSSHHHHHHHHHHHHHHHHS--TTSEEEEE T ss_pred HHHHHHHHHHHHHC---CCCCCCEEEEE-----------CCCCCCCCCHHHHHHHHHHHHC--CCCEEEEE T ss_conf 99999999877741---89855631231-----------0588877748899999987648--76247878 No 252 >1wyz_A Putative S-adenosylmethionine-dependent methytransferase; northeast structural genomics consortium, BTR28; 2.50A {Bacteroides thetaiotaomicron vpi-5482} SCOP: c.90.1.1 Probab=20.12 E-value=23 Score=11.23 Aligned_cols=103 Identities=16% Similarity=0.212 Sum_probs=52.9 Q ss_pred HHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCCHHHCCCHHHHHHHHCCCCCCCCCCCCCHHH Q ss_conf 89860630557888999998510476433444420466776554403756333231056762100145787533365687 Q 537021.9.peg.7 347 VIARMKFSGYFLIVADFIQWAKKNNIPVGPGRGSGAGSVTAYALTITDIDPLRFSLLFERFLNPDRMSMPDFDIDFCQDR 426 (1033) Q Consensus 347 vI~~~gf~~YFLiV~Div~~ak~~gI~vGpGRGSAagSLVaY~LgIT~VDPi~~~LlFERFLnp~R~~~PDIDiDf~~~~ 426 (1033) +|.+-|.+.-.===+.+|++|+++||.|-|==|-. |+++ +|-+...+ ...-.|.-||-.. ... T Consensus 88 lvSDAGtP~ISDPG~~LV~~a~~~~i~V~~iPGpS--A~i~-AL~~SGl~--~~~F~F~GFLP~k------------~~~ 150 (242) T 1wyz_A 88 VISEAGCPAVADPGADVVAIAQRQKLKVIPLVGPS--SIIL-SVMASGFN--GQSFAFHGYLPIE------------PGE 150 (242) T ss_dssp EECC-------CHHHHHHHHHHHTTCCEEECCCCC--HHHH-HHHHHTSC--SSSEEEEEECCSS------------TTH T ss_pred EEECCCCCCCCCCCHHHHHHHHHCCCEEEEECCCC--HHHH-HHHHCCCC--CCCEEEEEECCCC------------HHH T ss_conf 99268998784962999999995896799854750--5878-98863788--8853899712686------------788 Q ss_pred HHHHHHHHHHHCCCCCEEECCCHHHHHHHHHHHHHHHHCC Q ss_conf 8999999983038021554220778778899999997527 Q 537021.9.peg.7 427 RDEVIRYVQNKYGHERVAQIITFGSLQAKAALRDVGRALQ 466 (1033) Q Consensus 427 R~~vi~y~~~kyG~~~va~i~t~~t~~~k~aird~~r~~g 466 (1033) |.+.++-+.++.-.....+|.--+..+-...+.|+..++| T Consensus 151 r~k~l~~l~~~~~~~~~t~if~Esp~Ri~~~l~~l~~~~~ 190 (242) T 1wyz_A 151 RAKKLKTLEQRVYAESQTQLFIETPYRNHKMIEDILQNCR 190 (242) T ss_dssp HHHHHHHHHHHHHHHTCEEEEEECGGGHHHHHHHHHHHSC T ss_pred HHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHC T ss_conf 9999999987653046628999722789999999999757 No 253 >2kng_A Protein LSR2; DNA-binding domain, immune response, DNA binding protein; NMR {Mycobacterium tuberculosis} Probab=20.11 E-value=23 Score=11.23 Aligned_cols=21 Identities=29% Similarity=0.528 Sum_probs=10.9 Q ss_pred HHHHHHHCCCCCCCCCCCCHHH Q ss_conf 9999851047643344442046 Q 537021.9.peg.7 362 DFIQWAKKNNIPVGPGRGSGAG 383 (1033) Q Consensus 362 Div~~ak~~gI~vGpGRGSAag 383 (1033) .|=.||+++|+.|. -||=-.- T Consensus 17 ~IR~WAr~~G~~Vs-~RGRIp~ 37 (55) T 2kng_A 17 AIREWARRNGHNVS-TRGRIPA 37 (55) T ss_dssp HHHHHHHHTTCCCC-SSSCCCH T ss_pred HHHHHHHHCCCCCC-CCCCCCH T ss_conf 99999999388378-7898889 No 254 >1vkc_A Putative acetyl transferase; structural genomics, southeast collaboratory for structural genomics, secsg, protein structure initiative, PSI; 1.89A {Pyrococcus furiosus} SCOP: d.108.1.1 Probab=20.10 E-value=23 Score=11.22 Aligned_cols=16 Identities=6% Similarity=-0.172 Sum_probs=6.2 Q ss_pred HHHHHHHHHHHHCCCE Q ss_conf 9999999999818989 Q 537021.9.peg.7 196 RESQVVQLAYTHELPL 211 (1033) Q Consensus 196 ~~~~l~~lA~~~~iPl 211 (1033) +...+.+.|++.|+.- T Consensus 112 Ll~~~~~~a~~~G~~~ 127 (158) T 1vkc_A 112 LLRKAEEWAKERGAKK 127 (158) T ss_dssp HHHHHHHHHHHTTCSC T ss_pred HHHHHHHHHHHCCCCE T ss_conf 9999999999869988 No 255 >1tvk_B Tubulin beta chain; epothilone, taxol, ligand interactions, cell cycle, structural protein; HET: GDP GTP EP; 2.89A {Bos taurus} SCOP: c.32.1.1 d.79.2.1 PDB: 1tub_B* 1jff_B* 1ia0_B* 1ffx_B* 1sa0_B* 1sa1_B* 2hxf_B* 2hxh_B* 2p4n_B* 2wbe_B* 3dco_B* 3edl_B* 1z2b_B* 3hke_B* 3du7_B* 3e22_B* 3hkc_B* 3hkd_B* 3hkb_B* 3n2g_B* ... Probab=20.02 E-value=14 Score=13.23 Aligned_cols=47 Identities=17% Similarity=0.194 Sum_probs=33.5 Q ss_pred CHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCC Q ss_conf 8389999988778989860630557888999998510476433444420466776554403 Q 537021.9.peg.7 333 HAQDYRQRLDFELNVIARMKFSGYFLIVADFIQWAKKNNIPVGPGRGSGAGSVTAYALTIT 393 (1033) Q Consensus 333 ~~~~y~~RLe~EL~vI~~~gf~~YFLiV~Div~~ak~~gI~vGpGRGSAagSLVaY~LgIT 393 (1033) ..+.+.+|+.+|++-- +--+.|++.+ .+ -| |=||+-||.++..|.-. T Consensus 112 ~~~~i~d~irk~~E~c---D~l~gf~~~~---------Sl-~G-GTGSGlgs~l~e~l~d~ 158 (427) T 1tvk_B 112 LVDSVLDVVRKESESC---DCLQGFQLTH---------SL-GG-GTGSGMGTLLISKIREE 158 (427) T ss_dssp TTTTHHHHHHHHHTSS---SCEEEEECEE---------ES-SS-STTTHHHHHHHHHHTTT T ss_pred HHHHHHHHHHHHHHHC---CCCCCEEEEE---------CC-CC-CCCCCHHHHHHHHHHHH T ss_conf 9999999999998717---8755734340---------04-77-67775888899999876 Done!