Query gi|254780799|ref|YP_003065212.1| DNA translocase FtsK [Candidatus Liberibacter asiaticus str. psy62] Match_columns 806 No_of_seqs 311 out of 1864 Neff 8.3 Searched_HMMs 39220 Date Sun May 29 18:35:45 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780799.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK10263 DNA translocase FtsK; 100.0 0 0 1947.3 61.0 488 306-801 863-1353(1355) 2 COG1674 FtsK DNA segregation A 100.0 0 0 1254.6 41.2 471 302-801 376-856 (858) 3 pfam01580 FtsK_SpoIIIE FtsK/Sp 100.0 0 0 397.2 17.9 197 418-637 2-202 (202) 4 smart00843 Ftsk_gamma This dom 99.9 5.6E-24 1.4E-28 199.2 5.6 63 737-799 1-63 (63) 5 pfam09397 Ftsk_gamma Ftsk gamm 99.9 6.7E-24 1.7E-28 198.6 5.5 66 736-801 2-67 (67) 6 COG1674 FtsK DNA segregation A 99.1 1E-10 2.5E-15 100.0 4.7 200 454-698 286-487 (858) 7 cd01127 TrwB Bacterial conjuga 98.0 6.8E-06 1.7E-10 63.8 4.5 53 441-499 30-82 (410) 8 COG0433 HerA helicase [Replica 98.0 0.00039 1E-08 50.6 12.6 71 592-667 393-467 (520) 9 pfam10412 TrwB_AAD_bind Type I 97.8 3.7E-05 9.5E-10 58.2 4.6 61 592-658 245-311 (386) 10 smart00382 AAA ATPases associa 97.6 0.00021 5.3E-09 52.7 6.4 141 453-654 1-144 (148) 11 TIGR02759 TraD_Ftype type IV c 97.5 7.4E-05 1.9E-09 56.0 2.8 23 456-478 210-232 (613) 12 PRK13700 conjugal transfer pro 97.4 0.00026 6.7E-09 51.9 4.4 211 442-667 174-495 (732) 13 pfam01935 DUF87 Domain of unkn 97.3 0.00048 1.2E-08 50.0 5.1 55 438-495 3-61 (218) 14 COG3451 VirB4 Type IV secretor 97.3 0.00068 1.7E-08 48.8 5.5 98 427-528 406-521 (796) 15 cd01122 GP4d_helicase GP4d_hel 97.0 0.0039 9.9E-08 43.1 7.2 142 457-635 33-193 (271) 16 cd00984 DnaB_C DnaB helicase C 97.0 0.013 3.2E-07 39.3 9.6 142 457-635 16-173 (242) 17 cd00009 AAA The AAA+ (ATPases 96.9 0.01 2.7E-07 39.9 8.8 58 589-654 83-145 (151) 18 PRK13898 type IV secretion sys 96.9 0.0024 6.1E-08 44.7 5.5 213 435-655 425-708 (800) 19 cd01126 TraG_VirD4 The TraG/Tr 96.9 0.0011 2.8E-08 47.2 3.7 40 591-636 264-303 (384) 20 PRK13873 conjugal transfer ATP 96.6 0.0024 6.2E-08 44.7 4.0 305 433-782 418-808 (815) 21 PRK13853 type IV secretion sys 96.5 0.0054 1.4E-07 42.0 5.1 213 434-655 404-693 (789) 22 PRK10263 DNA translocase FtsK; 96.5 0.058 1.5E-06 34.3 10.3 173 16-193 14-196 (1355) 23 PRK00411 cdc6 cell division co 96.5 0.015 3.8E-07 38.7 7.2 41 453-495 54-94 (394) 24 TIGR02746 TraC-F-type type-IV 96.4 0.0027 7E-08 44.3 2.9 333 328-700 375-845 (900) 25 cd01131 PilT Pilus retraction 96.4 0.0088 2.3E-07 40.5 5.4 38 457-499 4-42 (198) 26 PRK13891 conjugal transfer pro 96.3 0.0053 1.4E-07 42.1 4.1 303 327-654 347-752 (852) 27 PRK13833 conjugal transfer pro 96.3 0.0057 1.5E-07 41.9 4.1 90 397-503 101-190 (323) 28 PRK00440 rfc replication facto 96.2 0.016 4E-07 38.6 5.8 31 451-481 34-64 (318) 29 PRK13830 conjugal transfer pro 96.1 0.0085 2.2E-07 40.6 4.1 212 436-656 435-719 (818) 30 pfam02534 TraG TraG/TraD famil 96.0 0.0065 1.7E-07 41.4 3.2 64 588-683 299-366 (468) 31 pfam03796 DnaB_C DnaB-like hel 95.9 0.14 3.7E-06 31.4 9.9 41 593-634 132-180 (186) 32 PRK13851 type IV secretion sys 95.7 0.0078 2E-07 40.9 2.6 40 456-501 164-203 (343) 33 cd03271 ABC_UvrA_II The excisi 95.7 0.21 5.4E-06 30.1 9.9 31 451-481 18-48 (261) 34 cd01130 VirB11-like_ATPase Typ 95.6 0.01 2.6E-07 40.0 3.0 51 455-511 26-77 (186) 35 PRK13900 type IV secretion sys 95.6 0.0093 2.4E-07 40.3 2.8 50 456-511 162-213 (332) 36 PRK06871 DNA polymerase III su 95.5 0.056 1.4E-06 34.4 6.4 180 452-710 20-216 (324) 37 TIGR01420 pilT_fam twitching m 95.4 0.011 2.8E-07 39.7 2.5 32 447-478 118-151 (350) 38 PRK13721 conjugal transfer ATP 95.2 0.024 6E-07 37.2 3.8 184 438-636 454-730 (864) 39 smart00487 DEXDc DEAD-like hel 95.2 0.19 4.8E-06 30.5 8.2 38 455-494 25-62 (201) 40 TIGR00929 VirB4_CagE type IV s 95.0 0.032 8.2E-07 36.2 3.9 314 356-707 385-849 (931) 41 pfam06745 KaiC KaiC. This fami 94.9 0.26 6.7E-06 29.4 8.4 133 456-636 21-166 (231) 42 pfam00735 Septin Septin. Membe 94.8 0.011 2.9E-07 39.7 1.2 181 456-659 6-208 (280) 43 PRK04328 hypothetical protein; 94.7 0.41 1E-05 28.0 9.8 134 456-638 26-177 (250) 44 pfam00437 GSPII_E Type II/IV s 94.7 0.026 6.5E-07 37.0 2.7 90 393-501 90-181 (283) 45 cd01850 CDC_Septin CDC/Septin. 94.6 0.021 5.3E-07 37.7 2.2 154 456-628 6-179 (276) 46 pfam04157 EAP30 EAP30/Vps36 fa 94.6 0.16 4E-06 31.1 6.6 87 695-785 129-216 (219) 47 cd04104 p47_IIGP_like p47 (47- 94.5 0.026 6.6E-07 37.0 2.4 20 456-475 3-22 (197) 48 pfam06814 Lung_7-TM_R Lung sev 94.5 0.27 7E-06 29.3 7.7 79 47-126 11-89 (287) 49 pfam05707 Zot Zonular occluden 94.4 0.025 6.4E-07 37.1 2.2 53 592-647 72-126 (183) 50 pfam05872 DUF853 Bacterial pro 94.4 0.03 7.7E-07 36.5 2.6 266 438-721 4-410 (504) 51 TIGR02788 VirB11 P-type DNA tr 94.3 0.046 1.2E-06 35.1 3.3 62 432-513 151-213 (328) 52 COG3596 Predicted GTPase [Gene 94.2 0.028 7.1E-07 36.7 2.1 73 456-535 41-119 (296) 53 COG2804 PulE Type II secretory 94.2 0.043 1.1E-06 35.3 2.9 112 374-496 177-297 (500) 54 cd00046 DEXDc DEAD-like helica 94.1 0.075 1.9E-06 33.5 4.2 27 455-481 1-27 (144) 55 PRK08187 pyruvate kinase; Vali 94.0 0.16 4.1E-06 31.0 5.7 280 328-653 171-550 (606) 56 pfam06414 Zeta_toxin Zeta toxi 93.9 0.091 2.3E-06 32.9 4.2 92 456-559 14-116 (191) 57 cd03270 ABC_UvrA_I The excisio 93.9 0.47 1.2E-05 27.6 7.8 42 591-636 158-200 (226) 58 pfam05621 TniB Bacterial TniB 93.7 0.55 1.4E-05 27.0 7.9 153 589-796 144-301 (302) 59 COG2805 PilT Tfp pilus assembl 93.7 0.048 1.2E-06 34.9 2.4 114 447-635 116-232 (353) 60 PRK13894 conjugal transfer ATP 93.5 0.082 2.1E-06 33.2 3.4 82 402-501 111-193 (320) 61 COG4185 Uncharacterized protei 93.2 0.45 1.1E-05 27.7 6.8 120 454-632 1-128 (187) 62 TIGR02533 type_II_gspE general 93.2 0.034 8.5E-07 36.1 1.1 70 390-474 194-265 (495) 63 KOG2655 consensus 93.2 0.062 1.6E-06 34.1 2.3 193 456-677 23-239 (366) 64 TIGR00099 Cof-subfamily Cof-li 93.1 0.63 1.6E-05 26.6 7.4 88 606-713 14-104 (270) 65 pfam09726 Macoilin Transmembra 92.9 0.56 1.4E-05 26.9 6.9 30 73-105 41-71 (680) 66 cd01129 PulE-GspE PulE/GspE Th 92.8 0.15 3.8E-06 31.2 3.9 47 448-500 72-121 (264) 67 PRK13897 type IV secretion sys 92.7 0.035 9E-07 35.9 0.6 65 589-659 403-474 (628) 68 PTZ00301 uridine kinase; Provi 92.7 0.15 3.8E-06 31.3 3.8 41 454-494 1-43 (210) 69 TIGR01666 YCCS hypothetical me 92.7 0.9 2.3E-05 25.4 11.9 91 76-172 22-112 (713) 70 TIGR01667 YCCS_YHJK integral m 92.0 1.1 2.7E-05 24.8 13.2 69 72-140 18-86 (724) 71 TIGR02163 napH_ ferredoxin-typ 91.3 1.1 2.7E-05 24.8 6.8 80 22-104 3-101 (263) 72 PRK04195 replication factor C 91.3 1.3 3.2E-05 24.3 8.7 20 455-474 41-60 (403) 73 COG5019 CDC3 Septin family pro 91.3 0.15 3.9E-06 31.2 2.4 26 457-482 26-51 (373) 74 KOG2927 consensus 91.2 1.2 3.2E-05 24.4 7.0 18 134-151 217-234 (372) 75 pfam00270 DEAD DEAD/DEAH box h 91.1 1.3 3.4E-05 24.2 7.4 15 456-470 16-30 (167) 76 KOG0735 consensus 90.9 0.19 4.7E-06 30.5 2.6 61 451-514 428-488 (952) 77 PRK00091 miaA tRNA delta(2)-is 90.8 0.12 3.1E-06 31.9 1.6 68 452-527 2-80 (304) 78 COG5008 PilU Tfp pilus assembl 90.8 0.16 4.1E-06 31.0 2.2 100 338-490 59-160 (375) 79 TIGR02782 TrbB_P P-type conjug 90.6 0.36 9.2E-06 28.4 3.8 35 457-492 142-176 (315) 80 PRK09112 DNA polymerase III su 90.6 0.36 9.2E-06 28.4 3.8 133 452-646 42-190 (352) 81 KOG2568 consensus 90.5 1.5 3.7E-05 23.8 9.7 73 46-119 171-243 (518) 82 pfam07693 KAP_NTPase KAP famil 90.5 1.5 3.7E-05 23.8 7.7 54 591-650 161-214 (301) 83 TIGR01271 CFTR_protein cystic 90.5 0.14 3.7E-06 31.4 1.7 29 443-471 437-469 (1534) 84 cd03225 ABC_cobalt_CbiO_domain 90.5 0.93 2.4E-05 25.3 5.9 151 449-636 22-195 (211) 85 PRK08233 hypothetical protein; 90.3 0.23 5.8E-06 29.9 2.6 22 457-478 6-27 (182) 86 cd02025 PanK Pantothenate kina 90.2 0.34 8.6E-06 28.6 3.4 35 458-493 3-38 (220) 87 PRK01077 cobyrinic acid a,c-di 90.0 0.68 1.7E-05 26.3 4.8 55 661-718 262-328 (451) 88 PRK10436 hypothetical protein; 89.9 0.49 1.2E-05 27.4 4.0 112 374-495 134-253 (461) 89 COG2401 ABC-type ATPase fused 89.8 0.46 1.2E-05 27.6 3.8 170 436-654 387-581 (593) 90 PRK07270 DNA polymerase III su 89.7 0.47 1.2E-05 27.6 3.8 46 692-751 503-549 (557) 91 TIGR02524 dot_icm_DotB Dot/Icm 89.5 0.34 8.8E-06 28.5 3.0 124 344-506 56-183 (358) 92 TIGR02525 plasmid_TraJ plasmid 89.4 0.21 5.4E-06 30.1 1.9 74 377-494 111-188 (374) 93 COG1474 CDC6 Cdc6-related prot 89.0 1.7 4.4E-05 23.3 6.3 164 456-679 44-211 (366) 94 COG4962 CpaF Flp pilus assembl 88.9 0.47 1.2E-05 27.5 3.3 54 455-517 174-227 (355) 95 cd03291 ABCC_CFTR1 The CFTR su 88.7 0.58 1.5E-05 26.8 3.7 59 443-510 48-111 (282) 96 TIGR03499 FlhF flagellar biosy 88.4 0.81 2.1E-05 25.7 4.3 68 458-525 198-269 (282) 97 pfam04851 ResIII Type III rest 88.4 0.92 2.3E-05 25.3 4.5 40 453-496 17-56 (103) 98 pfam05049 IIGP Interferon-indu 88.3 0.28 7.2E-06 29.2 1.9 19 456-474 37-55 (375) 99 COG0630 VirB11 Type IV secreto 88.0 2.1 5.3E-05 22.7 6.2 148 355-510 33-194 (312) 100 pfam05802 EspB Enterobacterial 88.0 1.4 3.6E-05 23.9 5.3 65 524-625 253-317 (317) 101 PRK05416 hypothetical protein; 87.9 0.42 1.1E-05 27.9 2.5 32 452-493 4-35 (292) 102 PRK06995 flhF flagellar biosyn 87.8 1 2.7E-05 24.9 4.6 69 456-525 179-251 (404) 103 TIGR03015 pepcterm_ATPase puta 87.6 2.3 5.9E-05 22.4 9.0 37 455-495 44-81 (269) 104 pfam02689 Herpes_Helicase Heli 87.5 0.35 8.9E-06 28.5 2.0 24 408-431 454-485 (801) 105 COG1660 Predicted P-loop-conta 87.5 0.45 1.1E-05 27.7 2.5 68 454-533 1-79 (286) 106 pfam04548 AIG1 AIG1 family. Ar 87.3 0.33 8.3E-06 28.7 1.7 25 456-480 2-26 (200) 107 COG0419 SbcC ATPase involved i 87.1 0.69 1.8E-05 26.3 3.3 11 402-412 461-471 (908) 108 pfam02456 Adeno_IVa2 Adenoviru 86.9 0.42 1.1E-05 27.9 2.1 33 458-490 91-123 (370) 109 cd03249 ABC_MTABC3_MDL1_MDL2 M 86.5 2.6 6.7E-05 21.9 6.1 179 449-678 24-228 (238) 110 KOG0991 consensus 86.5 0.63 1.6E-05 26.6 2.8 44 440-483 27-77 (333) 111 PRK04132 replication factor C 86.5 0.26 6.5E-06 29.5 0.8 78 471-550 411-493 (863) 112 PRK07667 uridine kinase; Provi 86.4 0.59 1.5E-05 26.8 2.6 20 459-478 19-38 (190) 113 pfam05673 DUF815 Protein of un 86.1 0.77 2E-05 25.9 3.1 71 411-497 20-92 (248) 114 PRK04220 2-phosphoglycerate ki 86.0 0.4 1E-05 28.0 1.6 125 456-657 94-221 (306) 115 cd03251 ABCC_MsbA MsbA is an e 86.0 0.99 2.5E-05 25.1 3.6 177 449-678 23-227 (234) 116 TIGR03269 met_CoM_red_A2 methy 86.0 2.8 7.1E-05 21.8 7.5 41 308-353 185-225 (520) 117 pfam03668 ATP_bind_2 P-loop AT 85.9 0.62 1.6E-05 26.6 2.5 30 454-493 1-30 (284) 118 PRK09435 arginine/ornithine tr 85.6 1.3 3.4E-05 24.1 4.1 43 456-504 51-93 (325) 119 pfam04602 Arabinose_trans Myco 85.6 2.9 7.4E-05 21.6 9.2 94 51-149 461-575 (1075) 120 PRK11664 ATP-dependent RNA hel 85.5 0.97 2.5E-05 25.2 3.3 96 613-717 471-578 (812) 121 cd03279 ABC_sbcCD SbcCD and ot 85.0 1.5 3.7E-05 23.8 4.1 40 446-486 17-59 (213) 122 CHL00081 chlI Mg-protoporyphyr 85.0 0.69 1.8E-05 26.3 2.4 46 739-789 264-309 (347) 123 cd03114 ArgK-like The function 84.8 1.2 3E-05 24.5 3.5 35 459-495 4-38 (148) 124 PRK12402 replication factor C 84.7 0.7 1.8E-05 26.2 2.3 30 452-481 34-63 (337) 125 KOG1384 consensus 84.7 0.41 1E-05 28.0 1.1 50 456-514 9-71 (348) 126 cd01120 RecA-like_NTPases RecA 84.6 3.2 8.1E-05 21.3 11.0 47 592-638 87-140 (165) 127 cd01124 KaiC KaiC is a circadi 84.6 3.2 8.1E-05 21.3 8.5 43 593-635 98-140 (187) 128 COG0324 MiaA tRNA delta(2)-iso 84.5 0.5 1.3E-05 27.3 1.5 74 453-532 2-87 (308) 129 KOG2035 consensus 84.5 3.2 8.2E-05 21.3 6.4 166 452-713 32-203 (351) 130 COG4901 Ribosomal protein S25 84.3 1.3 3.4E-05 24.1 3.6 63 738-800 42-104 (107) 131 PRK13652 cbiO cobalt transport 84.0 1.8 4.6E-05 23.2 4.2 100 590-716 156-256 (277) 132 pfam07787 DUF1625 Protein of u 84.0 2.7 7E-05 21.8 5.1 24 107-130 181-204 (247) 133 TIGR00618 sbcc exonuclease Sbc 84.0 0.79 2E-05 25.9 2.3 12 61-72 30-41 (1063) 134 PRK10334 mechanosensitive chan 84.0 3.4 8.6E-05 21.1 11.1 54 108-163 67-120 (285) 135 pfam03308 ArgK ArgK protein. T 84.0 1.4 3.7E-05 23.9 3.7 38 456-495 31-68 (267) 136 cd03115 SRP The signal recogni 83.8 3.4 8.7E-05 21.1 5.7 74 457-532 3-80 (173) 137 pfam08282 Hydrolase_3 haloacid 83.6 2.2 5.6E-05 22.5 4.5 82 609-714 15-96 (254) 138 cd03243 ABC_MutS_homologs The 83.6 1.7 4.3E-05 23.4 3.9 143 427-637 2-153 (202) 139 PRK10649 hypothetical protein; 83.6 3.5 8.9E-05 21.0 11.2 13 589-601 482-494 (564) 140 PRK05707 DNA polymerase III su 83.4 1.6 4.2E-05 23.5 3.8 143 451-666 18-174 (328) 141 PRK12339 2-phosphoglycerate ki 83.3 0.59 1.5E-05 26.8 1.5 110 457-647 6-115 (197) 142 PRK05703 flhF flagellar biosyn 83.3 2.6 6.7E-05 21.9 4.8 67 458-524 214-284 (412) 143 cd03282 ABC_MSH4_euk MutS4 hom 83.3 2.3 5.8E-05 22.4 4.5 101 428-537 3-103 (204) 144 PRK06067 flagellar accessory p 83.2 3.6 9.2E-05 20.9 9.2 130 456-636 34-172 (241) 145 PTZ00209 retrotransposon hot s 83.2 1 2.6E-05 25.0 2.6 38 452-490 171-208 (693) 146 PRK11131 ATP-dependent RNA hel 83.1 1 2.6E-05 25.0 2.6 13 542-554 1124-1136(1295) 147 PRK10744 phosphate transporter 83.0 3.6 9.3E-05 20.9 10.2 154 445-636 27-213 (257) 148 PRK13768 GTPase; Provisional 82.8 3.5 9E-05 21.0 5.3 76 456-534 4-85 (253) 149 PRK12363 phosphoglycerol trans 82.4 3.8 9.8E-05 20.7 7.9 52 394-450 374-427 (703) 150 PRK01172 ski2-like helicase; P 82.4 3.9 9.8E-05 20.7 10.0 18 328-345 245-262 (674) 151 PRK09586 murP N-acetylmuramic 82.3 3.9 9.9E-05 20.7 7.4 25 79-103 162-187 (474) 152 TIGR02538 type_IV_pilB type IV 82.2 0.68 1.7E-05 26.3 1.4 19 456-474 328-346 (577) 153 PRK12438 hypothetical protein; 82.2 3.9 1E-04 20.6 9.0 163 486-685 654-835 (979) 154 TIGR01064 pyruv_kin pyruvate k 82.0 0.97 2.5E-05 25.2 2.2 66 597-685 259-338 (513) 155 PRK13631 cbiO cobalt transport 82.0 1.1 2.7E-05 24.9 2.4 27 449-475 47-73 (320) 156 TIGR00174 miaA tRNA delta(2)-i 81.9 0.64 1.6E-05 26.5 1.2 10 459-468 4-13 (307) 157 COG0470 HolB ATPase involved i 81.7 1.2 3.1E-05 24.4 2.6 31 453-483 22-53 (325) 158 PRK13850 type IV secretion sys 81.6 0.39 9.9E-06 28.1 0.0 110 591-708 390-542 (670) 159 COG4129 Predicted membrane pro 81.5 4.1 0.00011 20.5 12.0 34 67-100 63-96 (332) 160 PRK10525 cytochrome o ubiquino 81.5 4.1 0.00011 20.5 9.6 40 65-104 31-70 (305) 161 PRK09270 frcK putative fructos 81.4 1.9 4.8E-05 23.0 3.4 135 458-626 38-189 (230) 162 PRK12337 2-phosphoglycerate ki 81.1 0.82 2.1E-05 25.7 1.5 282 369-781 198-489 (492) 163 PRK10869 recombination and rep 81.0 0.92 2.3E-05 25.3 1.8 33 679-711 413-445 (553) 164 cd03286 ABC_MSH6_euk MutS6 hom 81.0 3.1 7.9E-05 21.4 4.4 103 427-538 2-105 (218) 165 PRK11465 hypothetical protein; 80.9 4.3 0.00011 20.3 13.8 13 774-786 703-715 (741) 166 cd03241 ABC_RecN RecN ATPase i 80.8 0.94 2.4E-05 25.3 1.8 21 613-633 228-248 (276) 167 pfam04466 Terminase_3 Phage te 80.5 2.6 6.7E-05 21.9 4.0 129 458-635 6-138 (387) 168 PRK10530 phosphotransferase; P 80.5 4.4 0.00011 20.2 6.2 86 607-715 18-103 (272) 169 PTZ00207 hypothetical protein; 80.5 4.5 0.00011 20.2 8.8 110 62-172 69-193 (591) 170 KOG0979 consensus 80.4 1.2 3.1E-05 24.4 2.2 25 392-416 500-527 (1072) 171 pfam03137 OATP Organic Anion T 80.3 4.5 0.00011 20.2 15.2 50 357-411 359-409 (582) 172 pfam00485 PRK Phosphoribulokin 80.1 1.7 4.4E-05 23.3 2.9 22 457-478 2-23 (196) 173 COG1643 HrpA HrpA-like helicas 80.0 2 5E-05 22.9 3.2 48 430-483 450-497 (845) 174 cd02024 NRK1 Nicotinamide ribo 80.0 1.2 3.2E-05 24.4 2.1 39 458-508 3-41 (187) 175 cd03245 ABCC_bacteriocin_expor 80.0 1.7 4.4E-05 23.3 2.9 151 449-637 25-201 (220) 176 PRK09302 circadian clock prote 79.9 4.6 0.00012 20.1 7.6 262 329-668 142-427 (501) 177 cd03260 ABC_PstB_phosphate_tra 79.4 4.8 0.00012 20.0 7.3 145 453-636 25-201 (227) 178 PRK08118 topology modulation p 79.4 1 2.6E-05 25.0 1.6 18 454-471 1-18 (167) 179 PRK11512 DNA-binding transcrip 79.2 3.7 9.5E-05 20.8 4.4 56 746-801 45-101 (144) 180 PRK06749 replicative DNA helic 79.2 4.9 0.00012 19.9 10.0 141 456-635 188-350 (428) 181 pfam00350 Dynamin_N Dynamin fa 79.1 1.5 3.7E-05 23.8 2.3 21 457-477 1-21 (168) 182 PRK13767 ATP-dependent helicas 78.9 3.8 9.6E-05 20.7 4.3 18 750-767 734-751 (878) 183 cd02026 PRK Phosphoribulokinas 78.8 2.4 6.1E-05 22.2 3.3 33 458-494 3-35 (273) 184 KOG0926 consensus 78.7 0.46 1.2E-05 27.6 -0.4 242 409-717 231-509 (1172) 185 PRK05917 DNA polymerase III su 78.6 2.1 5.4E-05 22.6 3.0 150 452-712 16-174 (290) 186 pfam05116 S6PP Sucrose-6F-phos 78.6 1.7 4.3E-05 23.4 2.4 59 590-649 1-59 (247) 187 cd03280 ABC_MutS2 MutS2 homolo 78.4 3.8 9.8E-05 20.7 4.2 98 427-537 2-103 (200) 188 cd01852 AIG1 AIG1 (avrRpt2-ind 78.1 0.68 1.7E-05 26.3 0.3 21 456-476 2-22 (196) 189 cd03224 ABC_TM1139_LivF_branch 78.1 5.2 0.00013 19.7 7.8 148 449-636 21-193 (222) 190 cd03227 ABC_Class2 ABC-type Cl 78.0 3.1 8E-05 21.4 3.7 31 455-485 22-52 (162) 191 TIGR03600 phage_DnaB phage rep 77.9 5.3 0.00013 19.7 8.2 140 456-635 196-354 (421) 192 PRK11007 trehalose(maltose)-sp 77.9 5.3 0.00013 19.7 9.9 17 84-100 228-244 (473) 193 TIGR00235 udk uridine kinase; 77.8 2.2 5.7E-05 22.4 2.9 47 458-509 14-63 (220) 194 pfam00488 MutS_V MutS domain V 77.8 5.3 0.00014 19.6 5.1 100 427-538 14-117 (234) 195 KOG3347 consensus 77.6 1 2.6E-05 25.1 1.1 19 453-471 6-24 (176) 196 PRK00192 mannosyl-3-phosphogly 77.5 2.3 5.8E-05 22.4 2.9 50 588-639 3-52 (275) 197 PRK07429 phosphoribulokinase; 77.3 1.6 4.2E-05 23.5 2.1 220 457-719 11-284 (331) 198 COG0497 RecN ATPase involved i 77.2 1.5 3.9E-05 23.7 1.9 26 336-361 471-496 (557) 199 PRK11192 ATP-dependent RNA hel 76.9 4 0.0001 20.5 4.0 15 456-470 42-56 (417) 200 COG1122 CbiO ABC-type cobalt t 76.4 2.8 7E-05 21.8 3.0 161 447-635 23-199 (235) 201 cd03289 ABCC_CFTR2 The CFTR su 76.4 2.3 5.9E-05 22.4 2.6 73 593-682 159-231 (275) 202 TIGR02198 rfaE_dom_I rfaE bifu 76.1 1.7 4.2E-05 23.4 1.8 51 469-524 158-208 (321) 203 PRK10261 glutathione transport 75.9 3.3 8.5E-05 21.1 3.4 28 448-475 344-371 (623) 204 pfam03408 Foamy_virus_ENV Foam 75.9 4.6 0.00012 20.1 4.1 32 597-629 572-609 (985) 205 cd03247 ABCC_cytochrome_bd The 75.8 2.5 6.3E-05 22.1 2.7 28 449-476 23-50 (178) 206 pfam00224 PK Pyruvate kinase, 75.7 1.6 4E-05 23.6 1.7 234 328-652 39-296 (348) 207 PRK06739 pyruvate kinase; Vali 75.5 1.6 4.2E-05 23.5 1.7 167 510-719 160-341 (352) 208 COG3378 Phage associated DNA p 75.5 2.5 6.3E-05 22.1 2.6 43 434-476 210-252 (517) 209 PRK07399 DNA polymerase III su 75.5 2.6 6.5E-05 22.0 2.7 36 452-487 23-59 (314) 210 TIGR02324 CP_lyasePhnL phospho 75.4 2 5.1E-05 22.8 2.1 11 311-321 169-179 (224) 211 TIGR00678 holB DNA polymerase 75.4 2.8 7.1E-05 21.8 2.8 30 452-481 11-41 (216) 212 pfam01047 MarR MarR family. Th 75.3 4.2 0.00011 20.4 3.7 52 745-796 7-59 (59) 213 pfam07999 RHSP Retrotransposon 75.3 0.41 1.1E-05 27.9 -1.4 69 413-491 174-242 (597) 214 PRK04214 rbn ribonuclease BN/u 75.2 6.2 0.00016 19.1 17.4 40 16-55 21-66 (411) 215 PRK11773 uvrD DNA-dependent he 75.1 6.2 0.00016 19.1 5.4 19 336-354 360-378 (722) 216 cd03284 ABC_MutS1 MutS1 homolo 75.0 4.8 0.00012 20.0 4.0 119 457-637 33-155 (216) 217 cd03288 ABCC_SUR2 The SUR doma 75.0 2.6 6.7E-05 21.9 2.6 180 451-682 44-249 (257) 218 PRK13646 cbiO cobalt transport 74.8 2.4 6.1E-05 22.3 2.4 104 593-711 166-284 (286) 219 TIGR01967 DEAH_box_HrpA ATP-de 74.7 1.3 3.2E-05 24.3 1.0 16 331-346 1171-1186(1320) 220 COG0178 UvrA Excinuclease ATPa 74.6 2.3 5.8E-05 22.4 2.2 122 341-482 527-655 (935) 221 cd02023 UMPK Uridine monophosp 74.6 3 7.6E-05 21.5 2.8 33 457-493 2-34 (198) 222 cd03244 ABCC_MRP_domain2 Domai 74.5 2.6 6.7E-05 21.9 2.5 149 449-636 25-199 (221) 223 KOG0060 consensus 74.3 4 0.0001 20.5 3.4 155 444-637 447-629 (659) 224 pfam10236 DAP3 Mitochondrial r 74.2 2.9 7.3E-05 21.7 2.7 142 456-605 25-171 (274) 225 TIGR03167 tRNA_sel_U_synt tRNA 74.1 2.8 7.1E-05 21.8 2.6 13 458-470 131-143 (311) 226 PRK08769 DNA polymerase III su 74.0 3.5 9E-05 21.0 3.1 155 451-667 22-182 (319) 227 cd07029 RNAP_I_III_AC19 AC19 s 73.9 2.2 5.7E-05 22.5 2.0 46 491-538 31-82 (85) 228 PRK13876 conjugal transfer cou 73.7 0.86 2.2E-05 25.6 -0.1 104 591-700 401-547 (662) 229 PRK00090 bioD dithiobiotin syn 73.6 3.7 9.4E-05 20.8 3.1 17 457-473 2-19 (223) 230 cd01125 repA Hexameric Replica 73.5 6.7 0.00017 18.9 4.4 44 590-635 111-159 (239) 231 PRK00984 truD tRNA pseudouridi 73.4 6.8 0.00017 18.9 5.2 23 328-350 55-77 (344) 232 PRK11054 helD DNA helicase IV; 73.3 6.8 0.00017 18.8 4.4 40 326-389 248-287 (684) 233 cd03228 ABCC_MRP_Like The MRP 73.2 2.8 7E-05 21.8 2.4 40 449-493 23-62 (171) 234 PRK06696 uridine kinase; Valid 73.2 3.1 7.8E-05 21.4 2.6 25 455-479 26-51 (227) 235 pfam11543 UN_NPL4 Nuclear pore 73.1 1.9 4.7E-05 23.1 1.5 13 674-686 68-80 (80) 236 pfam12252 SidE Dot/Icm substra 73.1 6.9 0.00018 18.8 4.7 48 508-555 1090-1137(1443) 237 cd07027 RNAP_RPB11_like RPB11 73.0 2 5E-05 22.9 1.6 44 492-537 32-81 (83) 238 pfam07447 VP40 Matrix protein 72.8 2.5 6.4E-05 22.1 2.1 30 373-411 236-266 (295) 239 PRK13651 cobalt transporter AT 72.7 2.8 7.3E-05 21.7 2.4 97 593-716 182-279 (304) 240 smart00344 HTH_ASNC helix_turn 72.5 6.7 0.00017 18.9 4.2 47 740-786 2-48 (108) 241 smart00347 HTH_MARR helix_turn 72.3 4.8 0.00012 20.0 3.4 56 745-800 14-70 (101) 242 pfam08007 Cupin_4 Cupin superf 72.2 1.9 4.9E-05 22.9 1.4 45 668-712 174-229 (320) 243 KOG0922 consensus 72.1 1.6 4E-05 23.6 0.9 20 328-347 267-286 (674) 244 cd02575 PseudoU_synth_EcTruD P 72.1 3.2 8.3E-05 21.2 2.5 140 616-784 41-200 (253) 245 PRK01637 rbn ribonuclease BN; 72.0 7.3 0.00019 18.6 15.5 10 41-50 45-54 (284) 246 PRK09824 beta-glucoside-specif 71.9 7.3 0.00019 18.6 13.5 47 83-129 212-260 (625) 247 COG1204 Superfamily II helicas 71.9 7.3 0.00019 18.6 5.7 46 738-783 687-733 (766) 248 PRK06090 DNA polymerase III su 71.8 3.3 8.4E-05 21.2 2.5 147 451-670 21-180 (319) 249 cd03239 ABC_SMC_head The struc 71.7 5.9 0.00015 19.3 3.8 33 447-480 16-48 (178) 250 cd03250 ABCC_MRP_domain1 Domai 71.3 5.3 0.00014 19.6 3.5 147 449-636 26-189 (204) 251 PRK05826 pyruvate kinase; Prov 71.3 2.4 6.2E-05 22.2 1.7 233 328-652 41-294 (461) 252 PRK13645 cbiO cobalt transport 71.3 3.2 8.2E-05 21.3 2.4 99 590-715 169-268 (289) 253 cd03254 ABCC_Glucan_exporter_l 71.2 3.7 9.4E-05 20.8 2.6 153 444-636 15-199 (229) 254 PRK05480 uridine kinase; Provi 71.2 3.9 9.9E-05 20.7 2.8 33 457-493 9-41 (209) 255 cd03285 ABC_MSH2_euk MutS2 hom 71.0 7.6 0.00019 18.5 4.6 72 456-536 32-103 (222) 256 PRK06217 hypothetical protein; 70.9 2.3 5.8E-05 22.4 1.5 21 454-474 1-21 (185) 257 cd01855 YqeH YqeH. YqeH is an 70.8 3 7.7E-05 21.5 2.1 27 456-482 129-155 (190) 258 PRK13634 cbiO cobalt transport 70.8 3.3 8.5E-05 21.1 2.4 93 593-711 153-246 (276) 259 PRK13632 cbiO cobalt transport 70.5 3.7 9.4E-05 20.8 2.5 76 590-682 162-238 (273) 260 pfam05976 consensus 70.4 7.9 0.0002 18.4 11.4 26 15-40 2-27 (706) 261 PRK02983 lysS lysyl-tRNA synth 70.4 7.9 0.0002 18.4 11.0 48 81-128 116-163 (1099) 262 TIGR01631 Trypano_RHS trypanos 70.3 4.8 0.00012 20.0 3.1 30 461-490 314-343 (814) 263 PRK13407 bchI magnesium chelat 70.2 3.8 9.6E-05 20.8 2.5 47 738-789 252-298 (334) 264 PRK07993 DNA polymerase III su 70.1 4.7 0.00012 20.1 3.0 145 451-668 20-178 (334) 265 PTZ00066 pyruvate kinase; Prov 70.1 2.1 5.5E-05 22.6 1.2 116 511-652 205-330 (513) 266 cd02028 UMPK_like Uridine mono 70.0 4 0.0001 20.6 2.6 23 457-479 2-24 (179) 267 PTZ00300 pyruvate kinase; Prov 70.0 2.4 6.2E-05 22.2 1.5 238 328-652 12-267 (454) 268 PRK06893 DNA replication initi 69.9 6.5 0.00016 19.0 3.7 31 454-484 39-69 (229) 269 cd03248 ABCC_TAP TAP, the Tran 69.8 3.9 1E-04 20.6 2.5 150 449-636 35-210 (226) 270 cd03246 ABCC_Protease_Secretio 69.8 3.3 8.3E-05 21.2 2.1 40 593-636 117-157 (173) 271 PRK11264 putative amino-acid A 69.7 5.6 0.00014 19.5 3.3 39 449-489 22-60 (248) 272 COG1855 ATPase (PilT family) [ 69.7 3.4 8.7E-05 21.1 2.2 120 303-479 167-288 (604) 273 COG0444 DppD ABC-type dipeptid 69.6 6 0.00015 19.3 3.4 182 447-669 24-245 (316) 274 PRK10590 ATP-dependent RNA hel 69.6 8.2 0.00021 18.2 4.2 45 606-650 322-370 (457) 275 PRK05654 acetyl-CoA carboxylas 69.5 8.2 0.00021 18.2 5.0 88 541-643 90-177 (288) 276 cd03290 ABCC_SUR1_N The SUR do 69.4 4.2 0.00011 20.4 2.6 31 444-474 13-47 (218) 277 PRK13637 cbiO cobalt transport 69.3 4 0.0001 20.6 2.5 99 590-715 163-262 (287) 278 pfam05729 NACHT NACHT domain. 69.3 7.9 0.0002 18.4 4.0 55 456-510 2-57 (165) 279 PRK11784 tRNA 2-selenouridine 69.3 3.7 9.5E-05 20.8 2.3 20 455-474 138-157 (333) 280 PTZ00112 origin recognition co 69.2 4.4 0.00011 20.3 2.7 226 340-696 214-461 (650) 281 PTZ00011 alpha tubulin I; Prov 69.2 4.4 0.00011 20.2 2.7 15 607-621 325-339 (453) 282 PRK13636 cbiO cobalt transport 69.0 4.2 0.00011 20.4 2.5 205 447-712 25-256 (285) 283 PRK08694 consensus 68.8 8.5 0.00022 18.1 9.8 140 457-635 221-380 (468) 284 TIGR00328 flhB flagellar biosy 68.7 8.5 0.00022 18.1 15.3 91 26-116 24-138 (352) 285 TIGR01995 PTS-II-ABC-beta PTS 68.5 8.6 0.00022 18.1 9.0 14 67-80 283-296 (660) 286 cd01897 NOG NOG1 is a nucleola 68.4 4.3 0.00011 20.3 2.5 21 455-475 1-21 (168) 287 PRK10247 putative ABC transpor 68.4 8.6 0.00022 18.1 7.6 37 448-488 27-63 (225) 288 cd03253 ABCC_ATM1_transporter 68.4 4.3 0.00011 20.3 2.5 25 450-474 23-47 (236) 289 COG1846 MarR Transcriptional r 68.3 7.9 0.0002 18.4 3.8 57 746-802 27-84 (126) 290 TIGR02040 PpsR-CrtJ transcript 68.1 2.8 7E-05 21.8 1.5 156 516-698 122-293 (453) 291 PRK06354 pyruvate kinase; Prov 68.1 2.8 7.1E-05 21.7 1.5 119 507-652 170-298 (589) 292 cd03369 ABCC_NFT1 Domain 2 of 68.0 7.1 0.00018 18.7 3.5 27 449-475 29-55 (207) 293 PRK13764 ATPase; Provisional 68.0 4.1 0.0001 20.5 2.3 108 325-479 175-284 (605) 294 cd03223 ABCD_peroxisomal_ALDP 67.9 4.5 0.00011 20.2 2.5 31 444-474 13-47 (166) 295 cd01853 Toc34_like Toc34-like 67.8 3.6 9.1E-05 20.9 2.0 47 450-496 27-76 (249) 296 PHA00520 packaging NTPase P4 67.8 2.8 7.2E-05 21.7 1.5 59 591-650 181-254 (326) 297 cd00288 Pyruvate_Kinase Pyruva 67.7 3.2 8.3E-05 21.2 1.8 253 328-671 39-307 (480) 298 cd03258 ABC_MetN_methionine_tr 67.7 5.7 0.00015 19.4 3.0 175 447-675 24-229 (233) 299 cd04163 Era Era subfamily. Er 67.6 5 0.00013 19.8 2.7 23 453-475 2-24 (168) 300 PRK06247 pyruvate kinase; Prov 67.5 3.5 8.9E-05 21.0 1.9 227 328-652 42-290 (477) 301 PRK08156 surface presentation 67.5 8.9 0.00023 17.9 15.5 24 25-48 24-47 (367) 302 pfam09012 FeoC FeoC like trans 67.4 9 0.00023 17.9 4.1 49 743-791 2-50 (68) 303 PRK13641 cbiO cobalt transport 67.3 4.3 0.00011 20.3 2.3 206 449-715 28-262 (286) 304 COG3432 Predicted transcriptio 67.3 9 0.00023 17.9 4.1 64 739-803 16-79 (95) 305 cd01871 Rac1_like Rac1-like su 67.3 6.2 0.00016 19.1 3.1 43 457-503 4-55 (174) 306 PRK07261 topology modulation p 67.2 2.8 7.1E-05 21.7 1.3 14 457-470 3-16 (171) 307 COG1703 ArgK Putative periplas 67.1 6.5 0.00017 19.0 3.2 42 457-504 54-95 (323) 308 PRK04213 GTP-binding protein; 67.0 5.2 0.00013 19.7 2.7 23 454-476 1-23 (195) 309 smart00489 DEXDc3 DEAD-like he 66.9 3.6 9.1E-05 20.9 1.8 24 455-478 28-51 (289) 310 smart00488 DEXDc2 DEAD-like he 66.9 3.6 9.1E-05 20.9 1.8 24 455-478 28-51 (289) 311 PRK13549 xylose transporter AT 66.7 5.7 0.00015 19.4 2.8 43 589-636 423-466 (513) 312 PRK12468 flhB flagellar biosyn 66.6 9.3 0.00024 17.8 17.0 23 26-48 30-52 (383) 313 COG1205 Distinct helicase fami 66.5 9.3 0.00024 17.8 4.1 39 741-780 742-789 (851) 314 cd00267 ABC_ATPase ABC (ATP-bi 66.5 6.1 0.00016 19.2 3.0 40 593-636 101-141 (157) 315 pfam01978 TrmB Sugar-specific 66.4 9.4 0.00024 17.8 4.3 50 745-795 12-61 (68) 316 pfam03193 DUF258 Protein of un 66.4 3.7 9.5E-05 20.8 1.8 89 458-560 39-147 (161) 317 PRK09361 radB DNA repair and r 66.3 9.4 0.00024 17.8 8.1 62 593-656 110-181 (224) 318 PRK13635 cbiO cobalt transport 66.2 5.1 0.00013 19.8 2.5 99 590-716 159-258 (279) 319 COG1136 SalX ABC-type antimicr 65.9 5.2 0.00013 19.7 2.5 150 449-636 26-204 (226) 320 COG0469 PykF Pyruvate kinase [ 65.9 4.3 0.00011 20.3 2.1 116 508-654 168-298 (477) 321 PRK13633 cobalt transporter AT 65.6 5 0.00013 19.8 2.4 215 447-714 30-261 (281) 322 cd03229 ABC_Class3 This class 65.5 5 0.00013 19.8 2.4 27 449-475 21-47 (178) 323 COG0593 DnaA ATPase involved i 65.4 9.8 0.00025 17.7 3.8 49 430-482 93-141 (408) 324 cd00286 Tubulin_FtsZ Tubulin/F 65.3 5.1 0.00013 19.8 2.4 58 459-531 98-155 (328) 325 KOG2802 consensus 65.3 7.3 0.00019 18.6 3.2 229 416-720 234-464 (503) 326 COG4618 ArpD ABC-type protease 65.2 7.3 0.00019 18.6 3.2 146 453-636 361-533 (580) 327 pfam00580 UvrD-helicase UvrD/R 65.1 9.9 0.00025 17.6 7.0 20 696-715 384-403 (494) 328 cd03277 ABC_SMC5_euk Eukaryoti 65.1 5.9 0.00015 19.3 2.7 27 455-481 23-50 (213) 329 PRK09493 glnQ glutamine ABC tr 65.0 6.3 0.00016 19.1 2.8 45 445-494 18-62 (240) 330 cd03252 ABCC_Hemolysin The ABC 64.9 7.7 0.0002 18.4 3.2 176 449-678 23-227 (237) 331 PRK10513 sugar phosphatase; Pr 64.6 6.4 0.00016 19.0 2.8 34 606-639 17-50 (270) 332 COG4172 ABC-type uncharacteriz 64.6 6.8 0.00017 18.8 2.9 89 447-546 29-148 (534) 333 PRK02412 aroD 3-dehydroquinate 64.5 3.9 9.8E-05 20.7 1.6 171 503-696 13-218 (253) 334 PRK13663 hypothetical protein; 64.5 4.9 0.00013 19.9 2.2 102 324-481 87-192 (493) 335 PRK11022 dppD dipeptide transp 64.5 5.7 0.00014 19.4 2.5 183 447-668 26-244 (327) 336 PRK09334 30S ribosomal protein 64.4 10 0.00026 17.5 4.0 63 737-799 44-106 (107) 337 smart00534 MUTSac ATPase domai 64.4 8.2 0.00021 18.2 3.3 118 457-636 2-123 (185) 338 KOG0057 consensus 64.3 4.9 0.00012 19.9 2.1 42 455-502 379-423 (591) 339 PRK09206 pyruvate kinase; Prov 64.3 3.6 9.2E-05 20.9 1.5 236 328-652 39-293 (470) 340 PRK10619 histidine/lysine/argi 64.2 6.4 0.00016 19.1 2.7 39 447-489 24-62 (257) 341 pfam00910 RNA_helicase RNA hel 64.2 7.1 0.00018 18.7 2.9 25 457-481 1-25 (105) 342 PRK09580 sufC cysteine desulfu 64.1 4.6 0.00012 20.1 2.0 53 440-495 9-65 (248) 343 PRK03573 transcriptional regul 64.0 10 0.00026 17.6 3.7 31 756-786 47-77 (144) 344 cd03296 ABC_CysA_sulfate_impor 63.9 5.6 0.00014 19.5 2.4 169 449-654 23-209 (239) 345 cd03297 ABC_ModC_molybdenum_tr 63.9 4.7 0.00012 20.1 2.0 28 457-488 26-53 (214) 346 PRK10253 iron-enterobactin tra 63.7 7 0.00018 18.7 2.8 57 433-494 12-68 (265) 347 PRK13639 cbiO cobalt transport 63.6 5.7 0.00015 19.4 2.4 94 593-713 158-252 (275) 348 cd03283 ABC_MutS-like MutS-lik 63.4 11 0.00027 17.4 4.5 144 426-635 1-148 (199) 349 PRK04841 transcriptional regul 63.4 11 0.00027 17.4 8.1 26 5-30 5-30 (903) 350 COG1123 ATPase components of v 63.2 8.6 0.00022 18.1 3.2 27 446-472 309-335 (539) 351 TIGR03598 GTPase_YsxC ribosome 62.7 6.8 0.00017 18.8 2.6 24 453-476 17-40 (179) 352 pfam10643 Cytochrome-c551 Phot 62.6 11 0.00028 17.3 9.1 73 9-106 1-75 (213) 353 PRK05599 hypothetical protein; 62.6 11 0.00028 17.3 4.5 73 325-401 150-225 (246) 354 pfam10165 Ric8 Guanine nucleot 62.6 11 0.00028 17.3 3.7 122 586-789 306-428 (438) 355 PRK13796 GTP-binding protein Y 62.5 6.6 0.00017 19.0 2.5 82 360-477 104-185 (367) 356 COG2261 Predicted membrane pro 62.4 11 0.00028 17.3 3.6 18 66-83 34-51 (82) 357 cd02430 PTH2 Peptidyl-tRNA hyd 62.3 11 0.00028 17.3 3.7 47 328-379 58-110 (115) 358 PRK10584 putative ABC transpor 62.3 6.6 0.00017 18.9 2.5 28 449-476 31-58 (228) 359 COG2850 Uncharacterized conser 62.2 4.1 0.00011 20.4 1.4 22 610-631 103-124 (383) 360 PRK06835 DNA replication prote 62.2 9.9 0.00025 17.6 3.4 28 456-483 185-212 (330) 361 cd03292 ABC_FtsE_transporter F 61.9 6.4 0.00016 19.1 2.4 150 447-636 20-197 (214) 362 PRK10419 nikE nickel transport 61.8 9.4 0.00024 17.8 3.2 164 447-636 31-213 (266) 363 PRK11169 leucine-responsive tr 61.7 11 0.00029 17.2 4.9 31 597-627 32-62 (164) 364 TIGR00634 recN DNA repair prot 61.7 4.5 0.00012 20.2 1.6 26 336-361 507-532 (605) 365 TIGR02773 addB_Gpos ATP-depend 61.7 5.2 0.00013 19.7 1.9 14 335-348 358-371 (1192) 366 PRK13649 cbiO cobalt transport 61.7 6.6 0.00017 19.0 2.4 205 449-716 28-263 (280) 367 PRK11598 putative cell divisio 61.4 11 0.00029 17.2 11.1 24 335-358 304-327 (545) 368 PRK13538 cytochrome c biogenes 61.3 9.4 0.00024 17.8 3.1 45 440-488 9-57 (204) 369 cd00268 DEADc DEAD-box helicas 61.3 11 0.00028 17.3 3.4 39 456-494 38-77 (203) 370 cd03231 ABC_CcmA_heme_exporter 61.2 9.4 0.00024 17.8 3.1 38 438-475 6-47 (201) 371 cd03300 ABC_PotA_N PotA is an 61.1 5.7 0.00015 19.4 2.0 158 448-654 20-203 (232) 372 PRK12288 ribosome-associated G 61.0 4.4 0.00011 20.2 1.4 90 458-561 211-320 (344) 373 PRK13880 conjugal transfer cou 61.0 2.4 6.1E-05 22.2 0.1 62 592-661 445-515 (636) 374 PRK13650 cbiO cobalt transport 61.0 7.3 0.00019 18.6 2.5 98 593-717 158-257 (276) 375 PRK10762 D-ribose transporter 60.9 7.3 0.00019 18.6 2.5 29 447-475 271-299 (501) 376 PRK10976 putative sugar phosph 60.8 8.6 0.00022 18.1 2.9 81 608-712 18-98 (266) 377 PRK13642 cbiO cobalt transport 60.8 11 0.00027 17.4 3.3 196 447-714 26-256 (277) 378 COG3264 Small-conductance mech 60.7 12 0.0003 17.1 8.3 11 485-495 742-752 (835) 379 pfam07724 AAA_2 AAA domain (Cd 60.6 12 0.00029 17.1 3.5 100 456-623 5-104 (168) 380 cd03262 ABC_HisP_GlnQ_permease 60.6 6.9 0.00018 18.8 2.3 145 449-636 21-196 (213) 381 PRK10982 galactose/methyl gala 60.6 7.4 0.00019 18.6 2.5 125 308-475 151-295 (491) 382 PRK12289 ribosome-associated G 60.5 5.1 0.00013 19.8 1.7 35 458-492 175-221 (351) 383 pfam07088 GvpD GvpD gas vesicl 60.4 8.7 0.00022 18.0 2.8 18 587-604 387-404 (484) 384 cd03256 ABC_PhnC_transporter A 60.4 5.9 0.00015 19.3 2.0 66 593-672 165-231 (241) 385 PRK09473 oppD oligopeptide tra 60.2 6.4 0.00016 19.0 2.1 171 448-669 36-253 (330) 386 PRK05580 primosome assembly pr 60.2 12 0.0003 17.0 4.8 101 591-712 523-630 (699) 387 PRK13643 cbiO cobalt transport 60.2 7.1 0.00018 18.7 2.4 98 593-717 165-263 (288) 388 KOG2670 consensus 60.1 12 0.0003 17.1 3.4 60 589-651 312-402 (433) 389 PRK11629 lolD lipoprotein tran 60.0 11 0.00028 17.3 3.3 38 447-488 28-65 (233) 390 cd03299 ABC_ModC_like Archeal 60.0 7.6 0.00019 18.5 2.5 163 447-654 18-202 (235) 391 CHL00131 ycf16 sulfate ABC tra 60.0 7.6 0.00019 18.5 2.5 28 449-476 27-54 (252) 392 PRK11440 hypothetical protein; 59.8 8.5 0.00022 18.1 2.7 42 592-633 10-56 (188) 393 PRK13647 cbiO cobalt transport 59.5 8 0.0002 18.3 2.5 151 449-636 26-199 (273) 394 cd03301 ABC_MalK_N The N-termi 59.5 11 0.00029 17.2 3.3 147 448-636 20-192 (213) 395 COG0572 Udk Uridine kinase [Nu 59.5 9.3 0.00024 17.8 2.8 103 456-597 10-114 (218) 396 KOG1547 consensus 59.4 10 0.00026 17.5 3.1 45 456-501 48-92 (336) 397 COG4598 HisP ABC-type histidin 59.3 6 0.00015 19.2 1.9 82 457-544 35-144 (256) 398 KOG0760 consensus 59.1 11 0.00029 17.2 3.2 51 745-797 224-275 (302) 399 TIGR01073 pcrA ATP-dependent D 59.0 7.1 0.00018 18.7 2.2 20 335-354 415-434 (811) 400 PRK12727 flagellar biosynthesi 58.9 12 0.00032 16.9 3.6 64 461-524 355-422 (557) 401 TIGR03597 GTPase_YqeH ribosome 58.8 8.1 0.00021 18.3 2.5 81 360-476 96-176 (360) 402 KOG3030 consensus 58.8 13 0.00032 16.8 6.8 46 83-128 68-126 (317) 403 TIGR01821 5aminolev_synth 5-am 58.8 3.8 9.7E-05 20.7 0.8 44 312-360 314-357 (427) 404 pfam02233 PNTB NAD(P) transhyd 58.8 13 0.00032 16.8 7.2 19 329-351 325-343 (464) 405 cd00878 Arf_Arl Arf (ADP-ribos 58.7 3.9 9.9E-05 20.6 0.8 21 457-477 2-22 (158) 406 PRK10919 ATP-dependent DNA hel 58.6 13 0.00032 16.8 3.9 21 335-355 354-374 (672) 407 cd03238 ABC_UvrA The excision 58.6 8.9 0.00023 17.9 2.6 41 592-636 109-150 (176) 408 TIGR00655 PurU formyltetrahydr 58.4 3.6 9.3E-05 20.9 0.6 12 603-614 260-271 (294) 409 PRK05419 putative sulfite oxid 58.3 13 0.00033 16.8 12.5 61 68-128 33-94 (203) 410 cd04153 Arl5_Arl8 Arl5/Arl8 su 58.3 7.1 0.00018 18.7 2.1 29 456-484 17-45 (174) 411 PRK06921 hypothetical protein; 58.0 12 0.0003 17.1 3.2 25 457-481 119-143 (265) 412 cd03298 ABC_ThiQ_thiamine_tran 57.8 7 0.00018 18.7 2.0 140 452-636 22-190 (211) 413 COG2607 Predicted ATPase (AAA+ 57.8 10 0.00026 17.5 2.8 71 409-495 50-122 (287) 414 KOG0346 consensus 57.7 8.6 0.00022 18.1 2.4 19 84-102 70-88 (569) 415 PTZ00012 alpha-tubulin II; Pro 57.7 8.4 0.00021 18.2 2.4 81 460-555 142-234 (450) 416 pfam01078 Mg_chelatase Magnesi 57.7 7.4 0.00019 18.6 2.1 23 452-474 20-42 (207) 417 cd02576 PseudoU_synth_ScPUS7 P 57.6 10 0.00026 17.5 2.8 22 329-350 43-64 (371) 418 PRK00215 LexA repressor; Valid 57.5 13 0.00034 16.7 4.7 38 596-633 26-71 (204) 419 KOG0350 consensus 57.5 6.6 0.00017 18.9 1.8 52 457-508 186-238 (620) 420 PRK09279 pyruvate phosphate di 57.4 3.6 9.2E-05 20.9 0.5 40 60-104 17-58 (875) 421 PRK01158 phosphoglycolate phos 57.3 11 0.00027 17.4 2.8 29 608-636 19-47 (226) 422 pfam03205 MobB Molybdopterin g 57.2 13 0.00034 16.7 8.3 46 455-501 1-46 (122) 423 pfam01080 Presenilin Presenili 57.2 13 0.00034 16.7 14.9 17 41-57 29-46 (403) 424 cd03261 ABC_Org_Solvent_Resist 57.1 9.2 0.00023 17.9 2.5 149 449-636 21-198 (235) 425 cd00876 Ras Ras family. The R 57.1 8.5 0.00022 18.1 2.3 47 457-503 2-53 (160) 426 KOG1292 consensus 57.1 13 0.00034 16.6 8.6 25 21-45 109-133 (510) 427 TIGR01858 tag_bisphos_ald clas 57.1 11 0.00027 17.3 2.8 51 702-757 115-169 (282) 428 cd01128 rho_factor Transcripti 57.1 13 0.00034 16.6 5.6 73 456-530 18-90 (249) 429 PRK10908 cell division protein 57.0 8.6 0.00022 18.1 2.3 42 590-636 156-198 (222) 430 cd01854 YjeQ_engC YjeQ/EngC. 56.9 6.8 0.00017 18.8 1.8 89 458-560 165-273 (287) 431 pfam09848 DUF2075 Uncharacteri 56.9 13 0.00034 16.6 3.8 21 457-477 4-24 (348) 432 cd03281 ABC_MSH5_euk MutS5 hom 56.8 13 0.00034 16.6 4.6 94 428-532 3-98 (213) 433 cd03294 ABC_Pro_Gly_Bertaine T 56.7 8.8 0.00022 18.0 2.3 165 433-636 29-222 (269) 434 cd01893 Miro1 Miro1 subfamily. 56.7 11 0.00027 17.4 2.8 43 457-503 3-53 (166) 435 PRK11701 phnK phosphonates tra 56.4 8.9 0.00023 18.0 2.3 29 446-474 24-52 (258) 436 cd03214 ABC_Iron-Siderophores_ 56.4 11 0.00028 17.3 2.8 42 444-489 11-56 (180) 437 PRK13644 cbiO cobalt transport 56.3 14 0.00035 16.5 8.4 94 593-715 157-251 (274) 438 PRK09544 znuC high-affinity zi 56.2 11 0.00027 17.4 2.7 44 447-495 23-66 (251) 439 PRK10870 transcriptional repre 56.1 11 0.00027 17.3 2.7 45 755-799 71-116 (176) 440 cd04146 RERG_RasL11_like RERG/ 56.0 13 0.00032 16.8 3.1 47 457-503 2-53 (165) 441 cd06927 RNAP_L L subunit of Ar 56.0 6.9 0.00018 18.8 1.7 43 493-537 33-81 (83) 442 PRK04837 ATP-dependent RNA hel 56.0 14 0.00035 16.5 3.6 17 695-711 358-377 (423) 443 cd00157 Rho Rho (Ras homology) 55.9 11 0.00029 17.2 2.8 43 457-503 3-54 (171) 444 PRK10938 putative molybdenum t 55.8 14 0.00036 16.5 7.8 28 448-475 280-307 (490) 445 COG3840 ThiQ ABC-type thiamine 55.8 7.6 0.00019 18.5 1.9 185 433-662 13-210 (231) 446 PRK12338 hypothetical protein; 55.8 6 0.00015 19.2 1.4 122 585-714 129-272 (320) 447 COG5635 Predicted NTPase (NACH 55.7 14 0.00036 16.5 4.0 62 456-532 224-287 (824) 448 pfam04317 DUF463 YcjX-like fam 55.6 8.7 0.00022 18.0 2.2 13 742-754 430-442 (443) 449 smart00763 AAA_PrkA PrkA AAA d 55.6 13 0.00032 16.8 3.0 99 332-491 17-116 (361) 450 PRK00098 ribosome-associated G 55.4 7.7 0.0002 18.4 1.9 94 458-561 168-276 (298) 451 PRK04322 peptidyl-tRNA hydrola 55.3 14 0.00036 16.4 3.6 47 328-379 59-111 (116) 452 KOG3054 consensus 55.3 8.5 0.00022 18.1 2.1 14 418-431 259-272 (299) 453 cd03255 ABC_MJ0796_Lo1CDE_FtsE 55.2 14 0.00036 16.4 3.4 156 447-654 23-212 (218) 454 cd04154 Arl2 Arl2 subfamily. 55.1 10 0.00026 17.5 2.5 23 454-476 14-36 (173) 455 PRK09598 lipid A phosphoethano 54.8 14 0.00037 16.4 11.1 20 339-358 296-315 (522) 456 cd03233 ABC_PDR_domain1 The pl 54.5 10 0.00026 17.6 2.3 20 456-475 35-54 (202) 457 PRK12724 flagellar biosynthesi 54.4 15 0.00037 16.3 4.1 66 457-523 226-295 (432) 458 PRK11147 ABC transporter ATPas 54.4 15 0.00037 16.3 3.3 62 444-511 331-396 (632) 459 PRK10875 recD exonuclease V su 54.4 15 0.00037 16.3 7.4 58 415-496 235-298 (607) 460 PRK08533 flagellar accessory p 54.4 15 0.00037 16.3 3.3 27 456-482 26-52 (230) 461 pfam03348 Serinc Serine incorp 54.3 15 0.00038 16.3 7.4 69 25-99 24-97 (426) 462 pfam03215 Rad17 Rad17 cell cyc 54.3 10 0.00026 17.5 2.3 13 512-524 362-374 (490) 463 PRK11308 dppF dipeptide transp 54.2 10 0.00026 17.5 2.3 67 593-669 175-246 (327) 464 PRK11248 tauB taurine transpor 54.2 10 0.00026 17.5 2.3 152 444-634 13-188 (255) 465 pfam03029 ATP_bind_1 Conserved 54.1 15 0.00038 16.3 3.5 49 459-510 1-52 (234) 466 COG0210 UvrD Superfamily I DNA 54.0 15 0.00038 16.3 5.5 35 452-486 372-406 (655) 467 PRK09765 2-O-a-mannosyl-D-glyc 53.9 15 0.00038 16.3 10.1 15 87-101 345-359 (638) 468 KOG3817 consensus 53.9 15 0.00038 16.3 13.5 13 172-184 289-301 (452) 469 cd01849 YlqF_related_GTPase Yl 53.9 10 0.00026 17.5 2.3 20 456-475 102-121 (155) 470 pfam03297 Ribosomal_S25 S25 ri 53.9 15 0.00037 16.3 3.1 58 740-797 43-100 (104) 471 COG1201 Lhr Lhr-like helicases 53.9 15 0.00038 16.3 9.3 66 488-554 511-591 (814) 472 TIGR03415 ABC_choXWV_ATP choli 53.8 13 0.00033 16.7 2.8 107 438-550 34-159 (382) 473 cd03234 ABCG_White The White s 53.8 11 0.00027 17.4 2.4 54 590-654 162-216 (226) 474 PRK11247 ssuB aliphatic sulfon 53.8 11 0.00029 17.2 2.5 54 590-654 152-206 (257) 475 cd03219 ABC_Mj1267_LivG_branch 53.7 9 0.00023 17.9 2.0 40 593-636 164-204 (236) 476 cd03293 ABC_NrtD_SsuB_transpor 53.7 11 0.00027 17.4 2.4 143 449-636 25-193 (220) 477 cd04155 Arl3 Arl3 subfamily. 53.6 12 0.0003 17.0 2.6 25 453-477 13-37 (173) 478 cd02552 PseudoU_synth_TruD_lik 53.6 15 0.00038 16.2 3.1 110 589-720 30-153 (232) 479 KOG3354 consensus 53.5 6.7 0.00017 18.9 1.3 17 456-472 14-30 (191) 480 smart00173 RAS Ras subfamily o 53.5 14 0.00036 16.5 2.9 43 457-503 3-54 (164) 481 CHL00174 accD acetyl-CoA carbo 53.4 15 0.00039 16.2 3.9 84 544-642 112-195 (305) 482 cd01371 KISc_KIF3 Kinesin moto 53.3 5.3 0.00014 19.6 0.8 18 456-473 84-101 (333) 483 TIGR01884 cas_HTH CRISPR locus 53.3 15 0.00039 16.2 4.0 57 744-800 172-229 (231) 484 cd01367 KISc_KIF2_like Kinesin 53.2 5.6 0.00014 19.5 0.9 16 456-471 87-102 (322) 485 pfam11737 DUF3300 Protein of u 53.2 14 0.00035 16.6 2.8 29 598-626 87-115 (230) 486 TIGR02868 CydC ABC transporter 53.1 11 0.00027 17.4 2.3 21 25-45 12-32 (566) 487 KOG0245 consensus 52.8 4.3 0.00011 20.3 0.2 23 661-683 875-897 (1221) 488 COG1735 Php Predicted metal-de 52.8 8.6 0.00022 18.1 1.8 38 595-632 50-95 (316) 489 cd01376 KISc_KID_like Kinesin 52.7 5.8 0.00015 19.4 0.9 51 664-721 224-277 (319) 490 PRK07471 DNA polymerase III su 52.6 13 0.00033 16.8 2.6 35 453-487 37-72 (363) 491 TIGR01162 purE phosphoribosyla 52.5 14 0.00035 16.6 2.8 26 334-359 12-39 (159) 492 pfam04123 DUF373 Domain of unk 52.5 16 0.0004 16.1 12.1 133 51-190 154-317 (344) 493 cd03276 ABC_SMC6_euk Eukaryoti 52.5 12 0.00031 17.0 2.5 21 458-478 25-45 (198) 494 PRK10895 putative ABC transpor 52.5 9.7 0.00025 17.7 2.0 16 459-474 34-49 (241) 495 COG2717 Predicted membrane pro 52.4 16 0.0004 16.1 9.7 91 25-130 75-165 (209) 496 PRK05313 hypothetical protein; 52.4 16 0.0004 16.1 3.1 34 370-403 370-408 (450) 497 PRK09700 D-allose transporter 52.4 11 0.00029 17.2 2.3 18 457-474 34-51 (510) 498 PRK09862 putative ATP-dependen 52.3 10 0.00027 17.4 2.1 19 456-474 212-230 (506) 499 PRK13547 hmuV hemin importer A 52.2 13 0.00034 16.7 2.7 43 432-474 5-47 (273) 500 PRK11124 artP arginine transpo 52.1 14 0.00037 16.4 2.8 41 434-474 8-48 (242) No 1 >PRK10263 DNA translocase FtsK; Provisional Probab=100.00 E-value=0 Score=1947.34 Aligned_cols=488 Identities=49% Similarity=0.759 Sum_probs=464.8 Q ss_pred CCCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEEECCEEEEEEEEECCCCCHHHHHHHHHHHH Q ss_conf 62036613533200124555632566545789999997424863289984104424444432147863999997889999 Q gi|254780799|r 306 TFVLPSKEILSTSQSPVNQMTFSPKVMQNNACTLKSVLSDFGIQGEIVNVRPGPVITLYELEPAPGIKSSRIIGLSDDIA 385 (806) Q Consensus 306 ~YkLPsLdLL~~~~~~~~~~~~s~eeL~~nA~lLE~tL~dFGVe~~Vv~v~pGPVVTrYEi~PApGVKvSKI~nLadDIA 385 (806) .-.||+||||+.|.. +...+|+++|+++|++||.+|+||||+++|++++||||||||||+||||||||||+||+|||| T Consensus 863 ~~~lp~l~ll~~~~~--~~~~~~~~~l~~~arl~E~~L~df~V~a~Vv~i~pGPvVTryEl~papGVKvsrI~~L~dDlA 940 (1355) T PRK10263 863 TTPLPSLDLLTPPPS--EVEPVDTFALEQMARLVEARLADFRIKADVVNYSPGPVITRFELNLAPGVKAARISNLSRDLA 940 (1355) T ss_pred CCCCCCHHHCCCCCC--CCCCCCHHHHHHHHHHHHHHHHHCCCEEEEEEEECCCEEEEEEEECCCCCCHHHHHHHHHHHH T ss_conf 999986542469841--358788999999999999987633942899987239867876540589850655310899999 Q ss_pred HHHHHCCCCC-CCCCCCCEEEEEECCCCCCEEEEHHHCCCHHHHHCCCCCHHHCCCCCCCCEEEEECCCCCEEEEEECCC Q ss_conf 8863001210-003886125553024566238734230721343022100021045666666785410020235530477 Q gi|254780799|r 386 RSMSAISARV-AVIPRRNAIGIELPNDIRETVMLRDLIVSRVFEKNQCDLAINLGKSIEGKPIIADLARMPHLLIAGTTG 464 (806) Q Consensus 386 ~aLsa~svRI-apIPGK~~VGIEiPN~~r~~V~lreil~s~~f~~s~~~L~iaLGKdI~G~pvv~DLakMPHLLIAGtTG 464 (806) |+|+|.|||| +|||||++|||||||++|++|+|||||+|++|++++++|+++|||||+|+|+++||+|||||||||||| T Consensus 941 ~aLsa~sVRI~apIPGK~~VGIEvPN~~r~~V~lrevl~s~~f~~~~s~L~~aLGKDI~G~pvv~DLaKMPHLLIAGtTG 1020 (1355) T PRK10263 941 RSLSTVAVRVVEVIPGKPYVGLELPNKKRQTVYLREVLDNAKFRDNPSPLTVVLGKDIAGEPVVADLAKMPHLLVAGTTG 1020 (1355) T ss_pred HHHCCCCEEEEECCCCCCEEEEECCCCCCCEEEHHHHHCCHHHHCCCCCCEEEECCCCCCCEEEEEHHHCCCEEEECCCC T ss_conf 97226760698116998755688998988747558755584232279974265136667988774742197113404678 Q ss_pred CHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 40679999999999982995784788852310011102770343122334304566899999999999999998708996 Q gi|254780799|r 465 SGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYDGIPNLLTPVVTNPQKAVTVLKWLVCEMEERYQKMSKIGVRN 544 (806) Q Consensus 465 SGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~~iPHLl~pVvTd~~kA~~aL~w~V~EMe~RY~l~a~~~vRn 544 (806) ||||||||+||+|||||++|||||||||||||||||+||||||||+|||||||||++||+|||+||||||++|+++|||| T Consensus 1021 SGKSV~iNsmI~SlLyk~~PdevrlImIDPKmvELs~Y~gIPHLL~PVVTDpkkA~~AL~W~V~EMErRY~l~a~~gVRN 1100 (1355) T PRK10263 1021 SGKSVGVNAMILSMLYKAQPEDVRFIMIDPKMLELSVYEGIPHLLTEVVTDMKDAANALRWCVNEMERRYKLMSALGVRN 1100 (1355) T ss_pred CCCHHHHHHHHHHHHHCCCHHHEEEEEECCCEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 88304399999999843888794799866874531467999735776737879999999999999999999999819988 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHHHHHHCCCHHHHHHHHHHHHHHHC Q ss_conf 89999999988744786677544677654543222332232346986877634468888732100588999999866414 Q gi|254780799|r 545 IDGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQHMPYIVVVIDEMADLMMVARKDIESAVQRLAQMARAS 624 (806) Q Consensus 545 i~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivviiDElaDlmm~~~~~ve~~i~rlaq~ara~ 624 (806) |+|||+|+.++.+.|+|+.++.|...|.. +.+++.+++|||||||||||||||||+|||||++|+||||||||| T Consensus 1101 I~gYN~kv~~a~~~g~pi~dp~~~~~d~~------~~~~p~leklPyIVViIDElADLMM~agkeVE~~I~RLAQkARAa 1174 (1355) T PRK10263 1101 LAGYNEKIAEADRMMRPIPDPYWKPGDSM------DAQHPVLKKEPYIVVLVDEFADLMMTVGKKVEELIARLAQKARAA 1174 (1355) T ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCCCC------CCCCCCCCCCCEEEEEEECCHHHHCCCCHHHHHHHHHHHHHHHHH T ss_conf 79999999987750687776334676543------346665578986999973362753626287999999999999987 Q ss_pred CEEEEEEECCCCCCCCCHHHHHCCCCEEEEEECCCCCCHHHCCCCCHHHHCCCCCEEEECCC-CCEEEEEECCCCHHHHH Q ss_conf 23799996577753554355411025158764586642123388645786588754773689-83258883348988999 Q gi|254780799|r 625 GIHVIMATQRPSVDVITGTIKANFPTRISFQVSSKIDSRTILGEQGAEQLLGQGDMLYMTGG-GRVQRIHGPFVSDIEVE 703 (806) Q Consensus 625 GiHli~aTqrPsvdvitg~ikan~p~riaf~v~s~~dSrtild~~gae~Llg~gdml~~~~~-~~~~r~~g~~v~~~ev~ 703 (806) ||||||||||||||||||+|||||||||||+||||||||||||++|||+|||+|||||+|+| ++|+|||||||||+||+ T Consensus 1175 GIHLIlATQRPSVDVITGlIKANiPtRIAF~VsSkiDSRTILD~~GAE~LLG~GDMLf~~pg~~~p~RvqGafVsD~EV~ 1254 (1355) T PRK10263 1175 GIHLVLATQRPSVDVITGLIKANIPTRIAFTVSSKIDSRTILDQAGAESLLGMGDMLYSGPNSTLPVRVHGAFVRDQEVH 1254 (1355) T ss_pred HCEEEEECCCCCCCEECCCEECCCHHHHEECCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCCCCCEEEECCCCCHHHHH T ss_conf 42378724899867004624334323310003445663510787588885589764423799998417777866889999 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCCCCCCCCC-CCCCCCCHHHHHHHHHHHHCCCEEEEHHHHHHCCCHHHHHHHHHHHHHC Q ss_conf 999999712897421100124556667788-8887777038999999996598500014222001177899999999977 Q gi|254780799|r 704 KVVSHLKTQGEAKYIDIKDKILLNEEMRFS-ENSSVADDLYKQAVDIVLRDNKASISYIQRRLGIGYNRAASIIENMEEK 782 (806) Q Consensus 704 ~v~~~~~~q~~~~y~~~~~~~~~~~~~~~~-~~~~~~d~l~~~a~~~v~~~~~~s~s~lqr~~~igy~raar~~~~~e~~ 782 (806) +||+|||+|++|+|++++......++++++ +++++.|+||+||+++|+++||+|||+|||||||||||||||||+||++ T Consensus 1255 ~VV~~~k~qg~p~Y~~~i~~~~~~~~~~~~~~~~~e~D~L~~qAv~~V~~~~~aS~S~lQRrlrIGYnRAARiid~mE~~ 1334 (1355) T PRK10263 1255 AVVQDWKARGRPQYVDGITSDSESEGGAGGFDGAEELDPLFDQAVQFVTEKRKASISGVQRQFRIGYNRAARIIEQMEAQ 1334 (1355) T ss_pred HHHHHHHHCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCHHHHHHHCCCCCHHHHHHHHHHHHC T ss_conf 99999986289763110145544456777777776566799999999986294328998763134335899999999977 Q ss_pred CCCCHHHCCCCCEEEECHH Q ss_conf 9868022788726731712 Q gi|254780799|r 783 GVIGPASSTGKREILISSM 801 (806) Q Consensus 783 giv~~~~g~~~r~vl~~~~ 801 (806) |||||++|+|+||||.++- T Consensus 1335 GIVs~~~~~g~REVLap~p 1353 (1355) T PRK10263 1335 GIVSEQGHNGNREVLAPPP 1353 (1355) T ss_pred CCCCCCCCCCCCCCCCCCC T ss_conf 8846888998863458999 No 2 >COG1674 FtsK DNA segregation ATPase FtsK/SpoIIIE and related proteins [Cell division and chromosome partitioning] Probab=100.00 E-value=0 Score=1254.64 Aligned_cols=471 Identities=46% Similarity=0.643 Sum_probs=441.0 Q ss_pred CCCCCCCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEEECCEEEEEEEEECCCCCHHHHHHHH Q ss_conf 23576203661353320012455563256654578999999742486328998410442444443214786399999788 Q gi|254780799|r 302 HGTGTFVLPSKEILSTSQSPVNQMTFSPKVMQNNACTLKSVLSDFGIQGEIVNVRPGPVITLYELEPAPGIKSSRIIGLS 381 (806) Q Consensus 302 ~~~~~YkLPsLdLL~~~~~~~~~~~~s~eeL~~nA~lLE~tL~dFGVe~~Vv~v~pGPVVTrYEi~PApGVKvSKI~nLa 381 (806) .....|.+|+++||.++.........+..++..++..|+.+|.+|+|.++++.+.+||+||+||++|++|||+|||++|+ T Consensus 376 ~~~~~~~lP~l~ll~~~~~~~~~~~~~~~~l~~~~~~l~~sl~~f~i~a~~~~~~~gp~vt~yei~l~~gvk~~~i~~L~ 455 (858) T COG1674 376 RGKDTPPLPHLFLIADEFAELKSEHPDFAELLVSIARLGRSLGIHLILATQKPDGVGPSNTRFEIALKVGVKVDSIEILG 455 (858) T ss_pred CCCCCCCCCCHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHEEEEEEEECCCCCCCEEEEEEECCCCHHHHHHHCC T ss_conf 56887889822120554322345774778899999999887886443578960567885126998645662166776047 Q ss_pred HHHHHHHHHCCCCC-CCCCCCCEEEEEECCCC-CCEEEEHHHCCCHHHHHCCCCCHHHCCCCCCCCEEEEECCCCCEEEE Q ss_conf 99998863001210-00388612555302456-62387342307213430221000210456666667854100202355 Q gi|254780799|r 382 DDIARSMSAISARV-AVIPRRNAIGIELPNDI-RETVMLRDLIVSRVFEKNQCDLAINLGKSIEGKPIIADLARMPHLLI 459 (806) Q Consensus 382 dDIA~aLsa~svRI-apIPGK~~VGIEiPN~~-r~~V~lreil~s~~f~~s~~~L~iaLGKdI~G~pvv~DLakMPHLLI 459 (806) +|+|++|++.++|| +|||||++||||+||.. |++|+|+|++++++|.++...|++++||||.|+|+++||+||||||| T Consensus 456 ~d~A~~L~~~~~ri~~~ipgk~~igie~pn~~~~~~~~L~e~~~~~~~~~~~~~l~i~lg~~~~~~~~~~dlak~~hlli 535 (858) T COG1674 456 NDDAAALVAKSRRYLAPIPGKSYIGFQSPNSGRPWLPPLRELLDLDEFLERFPLGVIDLPKDIRQEPIVIDLAKAGHLLI 535 (858) T ss_pred CHHHHHCCCCCEEEEEEECCCCEEEEECCCCCCCCCCCHHHCCCCHHHHCCCCCEEECCCHHCCCCCEEEECCCCCCEEE T ss_conf 37788501446058987479753789746889867787121035234532465147525100368813855356888788 Q ss_pred EECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCCCCHHHHCCCCCCHH-HHHHHHHHHHHHHHHHHHHHH Q ss_conf 3047740679999999999982995784788852310011102770343122334304-566899999999999999998 Q gi|254780799|r 460 AGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYDGIPNLLTPVVTNPQ-KAVTVLKWLVCEMEERYQKMS 538 (806) Q Consensus 460 AGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~~iPHLl~pVvTd~~-kA~~aL~w~V~EMe~RY~l~a 538 (806) ||+||||||||+||||+|+||+++|+||||+||||||+||+.|++||||++||+||++ ||.++|+|||+|||+||++|+ T Consensus 536 ~G~tgsGKSv~lnt~i~Sll~~~~P~ev~~~~iD~k~~~L~~~~~iPHl~~~v~td~~~k~~~al~~~~~eme~R~~l~~ 615 (858) T COG1674 536 AGATGSGKSVALNTMILSLLYTHSPEEVRFYIIDPKMLELAAYDGLPHLGDPVVTDEKEKAEKALAELVAEMERRYKLFS 615 (858) T ss_pred ECCCCCCHHHHHHHHHHHHHHCCCCHHEEEEEECCCCCCCHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 24888651558999999987518906849999747875433330698557723247477899999999999999999888 Q ss_pred HCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHHHHHHCCCHHHHHHHHHH Q ss_conf 70899689999999988744786677544677654543222332232346986877634468888732100588999999 Q gi|254780799|r 539 KIGVRNIDGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQHMPYIVVVIDEMADLMMVARKDIESAVQRLA 618 (806) Q Consensus 539 ~~~vRni~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivviiDElaDlmm~~~~~ve~~i~rla 618 (806) ..|||||++||+|+.++ ...++||||||||||+|||||+++||||++|+||| T Consensus 616 ~~~vr~i~~yn~k~~~~----------------------------~~~~~lP~iviiiDe~adlm~~~~k~ve~~i~rLa 667 (858) T COG1674 616 EKGVRNIEGYNEKIAGA----------------------------IPDEELPYIVIIIDEYADLMMVAGKELEELIARLA 667 (858) T ss_pred HHCCCCHHHHHHHHCCC----------------------------CCCCCCCCEEEEECCHHHHHCCCCHHHHHHHHHHH T ss_conf 85666277776542024----------------------------55567980899944478886123176999999999 Q ss_pred HHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEEEECCCCCCHHHCCCCCHHHHCCCCCEEEECCC-CCEEEEEECCC Q ss_conf 86641423799996577753554355411025158764586642123388645786588754773689-83258883348 Q gi|254780799|r 619 QMARASGIHVIMATQRPSVDVITGTIKANFPTRISFQVSSKIDSRTILGEQGAEQLLGQGDMLYMTGG-GRVQRIHGPFV 697 (806) Q Consensus 619 q~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf~v~s~~dSrtild~~gae~Llg~gdml~~~~~-~~~~r~~g~~v 697 (806) |||||+|||||||||||||||||| |||||||||||+|+|++|||||||+.|||+|+|+|||||++++ ++|+|+||||| T Consensus 668 ~~ara~GIHlilatqRps~dVit~-ikan~psrIaf~v~s~~dsr~il~~~gae~l~GrGd~l~~~~~~~~pvr~~~~fv 746 (858) T COG1674 668 QKGRAAGIHLILATQRPSVDVITG-IKANIPTRIALRLSSKIDSRLILGQDGAEKLLGRGDMLFLGPGDAKPVRVLPAFV 746 (858) T ss_pred HHHHHCCCEEEEECCCCCCCHHHH-HHHCCCCCEEEEECCCCCEEEEECCCHHHHCCCCCEEECCCCCCCCCCCCCCCEE T ss_conf 978765826999748999504088-8853875256651687652566245506308885424215788988602665400 Q ss_pred CHHHHHHHHHHHHHCCCCCCCCCCCCCCC---CC-CCCC--CCCCCCCCHHHHHHHHHHHHCCCEEEEHHHHHHCCCHHH Q ss_conf 98899999999971289742110012455---66-6778--888877770389999999965985000142220011778 Q gi|254780799|r 698 SDIEVEKVVSHLKTQGEAKYIDIKDKILL---NE-EMRF--SENSSVADDLYKQAVDIVLRDNKASISYIQRRLGIGYNR 771 (806) Q Consensus 698 ~~~ev~~v~~~~~~q~~~~y~~~~~~~~~---~~-~~~~--~~~~~~~d~l~~~a~~~v~~~~~~s~s~lqr~~~igy~r 771 (806) ||+||+++++++|.|++|.|.+....... ++ .... .....+.|+||+||+++|++++++|+|++||+++||||| T Consensus 747 sd~ev~~v~~~~k~~~~p~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~l~~~a~~~v~~~~~~S~s~lqr~~~iG~~r 826 (858) T COG1674 747 SDEEVEEVVEHLKSQVEPVYLDLATNPHLLISGETTVLRAIDREEEAMDELFDEAKELVLPPPKASTSLLQRALDLGLNR 826 (858) T ss_pred CHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCHHHHHHCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHCCCCCCEEE T ss_conf 48888886300231377324423567542013531002221011011038999999862472113789872062678177 Q ss_pred HHHHHHHHHHCCCCCHHHCCCCCEEEECHH Q ss_conf 999999999779868022788726731712 Q gi|254780799|r 772 AASIIENMEEKGVIGPASSTGKREILISSM 801 (806) Q Consensus 772 aar~~~~~e~~giv~~~~g~~~r~vl~~~~ 801 (806) |||+||+||..|+|||++|+|+||||+... T Consensus 827 aa~~~~~~e~~giv~~~~~~~~reil~~~~ 856 (858) T COG1674 827 AATLIESMELLGIVGPPNGPKGREILVVED 856 (858) T ss_pred EECCCCCCHHHCEECCCCCCCCCEEEECCC T ss_conf 604222121313434645799852762677 No 3 >pfam01580 FtsK_SpoIIIE FtsK/SpoIIIE family. FtsK has extensive sequence similarity to wide variety of proteins from prokaryotes and plasmids, termed the FtsK/SpoIIIE family. This domain contains a putative ATP binding P-loop motif. It is found in the FtsK cell division protein from E. coli and the stage III sporulation protein E SpoIIIE which has roles in regulation of prespore specific gene expression in B. subtilis. A mutation in FtsK causes a temperature sensitive block in cell division and it is involved in peptidoglycan synthesis or modification. The SpoIIIE protein is implicated in intercellular chromosomal DNA transfer. Probab=100.00 E-value=0 Score=397.20 Aligned_cols=197 Identities=48% Similarity=0.791 Sum_probs=182.4 Q ss_pred EHHHCCCHHHHHCCCCCHHHCCCCCCCCEEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHH Q ss_conf 34230721343022100021045666666785410020235530477406799999999999829957847888523100 Q gi|254780799|r 418 LRDLIVSRVFEKNQCDLAINLGKSIEGKPIIADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKML 497 (806) Q Consensus 418 lreil~s~~f~~s~~~L~iaLGKdI~G~pvv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~v 497 (806) +.+......|+.+.+.||+.+|.|..|+|++.||.++||+||+|+||||||++++++|+|++++++|++|+|++||||.. T Consensus 2 ~~~~~~~~~~~~~~~~l~vp~G~~~~G~pv~~dl~~~pH~Lv~G~tGsGKS~~l~~li~sl~~~~~p~~v~l~liD~K~~ 81 (202) T pfam01580 2 LKELLDGKPFRGSRSRLTIALGKDISGNPVVADLVKMPHLLIAGATGSGKSTFLNTLILSLAARHSPEEVRLYLIDPKGG 81 (202) T ss_pred HHHHCCCCCCCCCCCCCEEEEEECCCCCEEEEECCCCCCEEEECCCCCCCCHHHHHHHHHHHHHCCCCEEEEEEECCCCC T ss_conf 67861692001689961487776799998998635688689965899980099999999998737962069999748961 Q ss_pred HHHHCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCC Q ss_conf 11102770343122334304566899999999999999998708996899999999887447866775446776545432 Q gi|254780799|r 498 ELSVYDGIPNLLTPVVTNPQKAVTVLKWLVCEMEERYQKMSKIGVRNIDGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEA 577 (806) Q Consensus 498 Els~Y~~iPHLl~pVvTd~~kA~~aL~w~V~EMe~RY~l~a~~~vRni~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~ 577 (806) +|+.|+++||+...+++|+.++...|+|++.||+|||++|++.||+||..||++.......+ T Consensus 82 ~~~~~~~~~h~~~~~~~d~e~~~~~l~~l~~em~rR~~ll~~~g~~~i~~~~~~~~~~~~~~------------------ 143 (202) T pfam01580 82 ELAALEDLPHLLSAVATDPEDALSALRALVAEMERRYALLKQLGVRSIEEYNGEIAEDILGG------------------ 143 (202) T ss_pred CHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCC------------------ T ss_conf 26767635654433768999999999999999999999999838876899999866432124------------------ Q ss_pred CCCCCCCCCCCCCEEEEEHHHHHHHHHHCCCH----HHHHHHHHHHHHHHCCEEEEEEECCCCC Q ss_conf 22332232346986877634468888732100----5889999998664142379999657775 Q gi|254780799|r 578 IYETEHFDFQHMPYIVVVIDEMADLMMVARKD----IESAVQRLAQMARASGIHVIMATQRPSV 637 (806) Q Consensus 578 ~~~~~~~~~~~lp~ivviiDElaDlmm~~~~~----ve~~i~rlaq~ara~GiHli~aTqrPsv 637 (806) .....+||+|||||||+|+||+.++++ +++.+.||||++||+|||||+||||||+ T Consensus 144 -----~~~~~~~p~ivvviDE~~~l~~~~~~~~~~~~~~~l~~iar~GRa~GihlilatQrP~~ 202 (202) T pfam01580 144 -----AGWLEELPPIVVIVDERAELMLAAPKDSEMRVEGALARLARMGRAAGIHLLLATQRPGV 202 (202) T ss_pred -----CCCCCCCCCEEEEEHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCEEEEEEECCCCC T ss_conf -----55433478189864459999865550468999999999999887338299998189999 No 4 >smart00843 Ftsk_gamma This domain directs oriented DNA translocation and forms a winged helix structure. Mutated proteins with substitutions in the FtsK gamma DNA-recognition helix are impaired in DNA binding. Probab=99.89 E-value=5.6e-24 Score=199.20 Aligned_cols=63 Identities=49% Similarity=0.894 Sum_probs=61.0 Q ss_pred CCCCHHHHHHHHHHHHCCCEEEEHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHCCCCCEEEEC Q ss_conf 777703899999999659850001422200117789999999997798680227887267317 Q gi|254780799|r 737 SVADDLYKQAVDIVLRDNKASISYIQRRLGIGYNRAASIIENMEEKGVIGPASSTGKREILIS 799 (806) Q Consensus 737 ~~~d~l~~~a~~~v~~~~~~s~s~lqr~~~igy~raar~~~~~e~~giv~~~~g~~~r~vl~~ 799 (806) ++.||||+||+++|+++|++|||+|||||+|||||||||||+||++|||||++|+||||||++ T Consensus 1 ~~~D~l~~~a~~~V~~~~~~S~S~lQR~l~IGynRAariid~lE~~GiVsp~~g~~~ReVL~~ 63 (63) T smart00843 1 EEEDELYDEAVELVIETQKASTSLLQRRLRIGYNRAARLIDQLEEEGIVGPANGSKPREVLVT 63 (63) T ss_pred CCCHHHHHHHHHHHHHHCCCCHHHHHHHHHCCHHHHHHHHHHHHHCCCCCCCCCCCCCEEECC T ss_conf 963289999999999808624899999972050699999999999858788779999834249 No 5 >pfam09397 Ftsk_gamma Ftsk gamma domain. This domain directs oriented DNA translocation and forms a winged helix structure. Mutated proteins with substitutions in the FtsK gamma DNA-recognition helix are impaired in DNA binding. Probab=99.89 E-value=6.7e-24 Score=198.61 Aligned_cols=66 Identities=50% Similarity=0.844 Sum_probs=63.1 Q ss_pred CCCCCHHHHHHHHHHHHCCCEEEEHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHCCCCCEEEECHH Q ss_conf 877770389999999965985000142220011778999999999779868022788726731712 Q gi|254780799|r 736 SSVADDLYKQAVDIVLRDNKASISYIQRRLGIGYNRAASIIENMEEKGVIGPASSTGKREILISSM 801 (806) Q Consensus 736 ~~~~d~l~~~a~~~v~~~~~~s~s~lqr~~~igy~raar~~~~~e~~giv~~~~g~~~r~vl~~~~ 801 (806) .++.||||++|+++|++++++|||+|||||+|||||||||||+||++|||||++|+||||||++.+ T Consensus 2 ~~~~D~l~~~a~~~V~~~~~~S~S~lQR~~~IGynRAariid~LE~~GiVsp~~g~~~ReVL~~~~ 67 (67) T pfam09397 2 DEEEDELYDEAVEFVIETQKASTSLLQRRLRIGYNRAARLIEQMEEEGIVGPANGSKPREVLVPKE 67 (67) T ss_pred CCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHCCHHHHHHHHHHHHHCCCCCCCCCCCCCEECCCCC T ss_conf 742038999999999981863489999997105069999999999984888877898850508999 No 6 >COG1674 FtsK DNA segregation ATPase FtsK/SpoIIIE and related proteins [Cell division and chromosome partitioning] Probab=99.09 E-value=1e-10 Score=99.96 Aligned_cols=200 Identities=28% Similarity=0.417 Sum_probs=158.1 Q ss_pred CCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHH-CCCCHHHHCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 202355304774067999999999998299578478885231001110-2770343122334304566899999999999 Q gi|254780799|r 454 MPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSV-YDGIPNLLTPVVTNPQKAVTVLKWLVCEMEE 532 (806) Q Consensus 454 MPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~-Y~~iPHLl~pVvTd~~kA~~aL~w~V~EMe~ 532 (806) .||=+++||||||||--. +.+.++-..++|+++.++.+|.|.-.... .++. |+...+..... ..++.-.=.||.+ T Consensus 286 ~~~~~~~~~t~~~~~e~~-~~~~~~~~~~~~~e~~~~~~~~k~~~~~~~~~~~-~~~~~~~~~~~--~r~~~~~~~~~~~ 361 (858) T COG1674 286 GPHGLVAGTTGSGKSELL-TYILSLAINHSPEELNFLLIDYKGGSMANPFKGL-HVVAVITNLAE--ERALAAIKGELAR 361 (858) T ss_pred CCCCCCCCCCCCCCCCCC-CHHCCCCCCCCCHHHHHHHCCCCCCCCCCCCCCC-CCCCCCCCCCC--CCCCCCCCHHHHH T ss_conf 421012467655732100-0102333358814420000120012321222221-10001014541--0000013103443 Q ss_pred HHHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHHHHHHCCCHHHH Q ss_conf 99999870899689999999988744786677544677654543222332232346986877634468888732100588 Q gi|254780799|r 533 RYQKMSKIGVRNIDGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQHMPYIVVVIDEMADLMMVARKDIES 612 (806) Q Consensus 533 RY~l~a~~~vRni~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivviiDElaDlmm~~~~~ve~ 612 (806) |.+ +++..|.+... . .....+||-+-++.||++++.-.. .+-++ T Consensus 362 r~~-------~~~~~~~~~~~----~------------------------~~~~~~lP~l~ll~~~~~~~~~~~-~~~~~ 405 (858) T COG1674 362 RQR-------NHINAYTKSYK----R------------------------GKDTPPLPHLFLIADEFAELKSEH-PDFAE 405 (858) T ss_pred HHC-------CCCCCCCHHHC----C------------------------CCCCCCCCCHHHCCCCCCCCCCCC-CCHHH T ss_conf 100-------23234201102----5------------------------688788982212055432234577-47788 Q ss_pred HHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEEEECCCCCCHHHCCCCCHHHHCCCCCEEEEC-CCCCEEE Q ss_conf 999999866414237999965777535543554110251587645866421233886457865887547736-8983258 Q gi|254780799|r 613 AVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISFQVSSKIDSRTILGEQGAEQLLGQGDMLYMT-GGGRVQR 691 (806) Q Consensus 613 ~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf~v~s~~dSrtild~~gae~Llg~gdml~~~-~~~~~~r 691 (806) .++..+...|.-++|+|.|||.|.... ..|-==||+.++.++.||+++|...=|-.|-=.+---..| ||-..+. T Consensus 406 l~~~~~~l~~sl~~f~i~a~~~~~~~g-----p~vt~yei~l~~gvk~~~i~~L~~d~A~~L~~~~~ri~~~ipgk~~ig 480 (858) T COG1674 406 LLVSIARLGRSLGIHLILATQKPDGVG-----PSNTRFEIALKVGVKVDSIEILGNDDAAALVAKSRRYLAPIPGKSYIG 480 (858) T ss_pred HHHHHHHHHHHHHHHEEEEEEEECCCC-----CCCEEEEEEECCCCHHHHHHHCCCHHHHHCCCCCEEEEEEECCCCEEE T ss_conf 999999998878864435789605678-----851269986456621667760473778850144605898747975378 Q ss_pred EEECCCC Q ss_conf 8833489 Q gi|254780799|r 692 IHGPFVS 698 (806) Q Consensus 692 ~~g~~v~ 698 (806) +|-++.. T Consensus 481 ie~pn~~ 487 (858) T COG1674 481 FQSPNSG 487 (858) T ss_pred EECCCCC T ss_conf 9746889 No 7 >cd01127 TrwB Bacterial conjugation protein TrwB, ATP binding domain. TrwB is a homohexamer encoded by conjugative plasmids in Gram-negative bacteria. TrwB also has an all alpha domain which has been hypothesized to be responsible for DNA binding. TrwB is a component of Type IV secretion and is responsible for the horizontal transfer of DNA between bacteria. Probab=98.03 E-value=6.8e-06 Score=63.78 Aligned_cols=53 Identities=36% Similarity=0.565 Sum_probs=43.4 Q ss_pred CCCCCEEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHH Q ss_conf 66666678541002023553047740679999999999982995784788852310011 Q gi|254780799|r 441 SIEGKPIIADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLEL 499 (806) Q Consensus 441 dI~G~pvv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEl 499 (806) .|.|-|+.-+. ..-|.||.|+||||||++|+-+|-+++-+-. ++|..|||= |+ T Consensus 30 ~i~~~p~p~~~-e~~H~lv~G~tGsGKT~~i~~li~~~~~rg~----~~II~DpkG-e~ 82 (410) T cd01127 30 TIAGLPFPKDA-EEAHTMIIGTTGTGKTTQIRELLASIRARGD----RAIIYDPNG-GF 82 (410) T ss_pred EECCCCCCCCH-HHCCEEEECCCCCCHHHHHHHHHHHHHHCCC----EEEEEECCC-HH T ss_conf 68986278652-0274799889999889999999999998699----099995885-49 No 8 >COG0433 HerA helicase [Replication, recombination, and repair] Probab=97.96 E-value=0.00039 Score=50.59 Aligned_cols=71 Identities=25% Similarity=0.465 Sum_probs=55.1 Q ss_pred EEEEHHHHHHHHHHCCC----HHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEEEECCCCCCHHHCC Q ss_conf 87763446888873210----05889999998664142379999657775355435541102515876458664212338 Q gi|254780799|r 592 IVVVIDEMADLMMVARK----DIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISFQVSSKIDSRTILG 667 (806) Q Consensus 592 ivviiDElaDlmm~~~~----~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf~v~s~~dSrtild 667 (806) ++++|||--.. +++ .+...+.|+|.-.|.-|+.++++||||| -+--.+.+|.-+++-|+..+..|-+-|.- T Consensus 393 ~~~iieea~~~---~p~~~~~~~~~~~~~iar~gRk~~v~l~~~tQrps--~l~~~v~~~~~tkii~r~~~~~d~~~~~~ 467 (520) T COG0433 393 LLIVIEEAHNF---APSSRKAPVKDIIERIAREGRKFGVGLVLATQRPS--DLDDLVLSQANTKIILRLVEPTDQKYISR 467 (520) T ss_pred CCEEHHHHHHH---CCCCCCHHHHHHHHHHHHHHCCCCCCEEEEEECHH--HCCHHHHHCCCHHHHHCCCCCHHHHHHHH T ss_conf 41140777763---87554214689999999860723541357886767--63078986103497740578223666565 No 9 >pfam10412 TrwB_AAD_bind Type IV secretion-system coupling protein DNA-binding domain. The plasmid conjugative coupling protein TrwB forms hexamers from six structurally very similar protomers. This hexamer contains a central channel running from the cytosolic pole (made up by the AADs) to the membrane pole ending at the transmembrane pore shaped by 12 transmembrane helices, rendering an overall mushroom-like structure. The TrwB_AAD (all-alpha domain) domain appears to be the DNA-binding domain of the structure. TrwB, a basic integral inner-membrane nucleoside-triphosphate-binding protein, is the structural prototype for the type IV secretion system coupling proteins, a family of proteins essential for macromolecular transport between cells and export. Probab=97.76 E-value=3.7e-05 Score=58.22 Aligned_cols=61 Identities=21% Similarity=0.297 Sum_probs=36.5 Q ss_pred EEEEHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCC-CCCCH-----HHHHCCCCEEEEEECC Q ss_conf 8776344688887321005889999998664142379999657775-35543-----5541102515876458 Q gi|254780799|r 592 IVVVIDEMADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSV-DVITG-----TIKANFPTRISFQVSS 658 (806) Q Consensus 592 ivviiDElaDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsv-dvitg-----~ikan~p~riaf~v~s 658 (806) +.+++|||+-|- .=..+|+.+. .+|..||++++++|-.+= +-+=| .|.+|.-++|.|++.. T Consensus 245 v~~~lDEf~~lg--~i~~l~~~l~----~~Rs~gi~~~l~~Qs~~QL~~~YG~~~a~~il~n~~~~i~~~~~d 311 (386) T pfam10412 245 LWFFLDELPSLH--KLPSLADALA----EGRKFGGCVVLGIQSFAQLEEVYGEDGAQTLLGLCNTKLILRVSD 311 (386) T ss_pred EEEEEECHHHCC--CCHHHHHHHH----HHHCCCCEEEEEEECHHHHHHHHCHHHHHHHHHHCCCEEEEECCC T ss_conf 499998711027--6688999999----864078499999838999999879889999996256599982799 No 10 >smart00382 AAA ATPases associated with a variety of cellular activities. AAA - ATPases associated with a variety of cellular activities. This profile/alignment only detects a fraction of this vast family. The poorly conserved N-terminal helix is missing from the alignment. Probab=97.60 E-value=0.00021 Score=52.67 Aligned_cols=141 Identities=20% Similarity=0.199 Sum_probs=78.2 Q ss_pred CCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 02023553047740679999999999982995784788852310011102770343122334304566899999999999 Q gi|254780799|r 453 RMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYDGIPNLLTPVVTNPQKAVTVLKWLVCEMEE 532 (806) Q Consensus 453 kMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~~iPHLl~pVvTd~~kA~~aL~w~V~EMe~ 532 (806) +.+|+++.|.+|||||..+..++..+.... -..+.+++....-..+.... .-.. T Consensus 1 ~~~~ill~G~~GsGKTtl~~~la~~~~~~~----~~v~~~~~~~~~~~~~~~~~----------------------~~~~ 54 (148) T smart00382 1 PGEVILIVGPPGSGKTTLARALARELGPPG----GGVIYIDGEDILEEVLDQLL----------------------LIIV 54 (148) T ss_pred CCCEEEEECCCCHHHHHHHHHHHHHHCCCC----CCEEEEEHHHHHHHHHHHHH----------------------HHHH T ss_conf 997899999997029999999998726689----96899875998988898765----------------------3000 Q ss_pred HHHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHHHHHHCCCHHHH Q ss_conf 99999870899689999999988744786677544677654543222332232346986877634468888732100588 Q gi|254780799|r 533 RYQKMSKIGVRNIDGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQHMPYIVVVIDEMADLMMVARKDIES 612 (806) Q Consensus 533 RY~l~a~~~vRni~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivviiDElaDlmm~~~~~ve~ 612 (806) .+......+.+.+...++. .+..++.|++|||+-.+.......... T Consensus 55 ~~~~~~~~~~~~~~~~~~~----------------------------------~~~~~~~viiiDei~~~~~~~~~~~~~ 100 (148) T smart00382 55 GGKKASGSGELRLRLALAL----------------------------------ARKLKPDVLILDEITSLLDAEQEALLL 100 (148) T ss_pred CCCCCCCCHHHHHHHHHHH----------------------------------HHHCCCCEEEEECCHHHCCCCCHHHHH T ss_conf 1122105199999999999----------------------------------984499899982750214762079999 Q ss_pred H---HHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEE Q ss_conf 9---99999866414237999965777535543554110251587 Q gi|254780799|r 613 A---VQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISF 654 (806) Q Consensus 613 ~---i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf 654 (806) . -....++.....+++|.+|+. ..+.+.+.++--+..++-+ T Consensus 101 ~~~~~~~~~~~~~~~~~~vi~~~n~-~~~~~~~~~~~~~~~~~~~ 144 (148) T smart00382 101 LLEELRLLLLLKSEKNLTVILTTND-EKDLGPALLRRRFDRRIVL 144 (148) T ss_pred HHHHHHHHCCCCCCCCEEEEEEECC-CCCCCHHHHCCCCCEEEEE T ss_conf 9999998517657899899995699-5224987707447879998 No 11 >TIGR02759 TraD_Ftype type IV conjugative transfer system coupling protein TraD; InterPro: IPR014128 The TraD protein performs an essential coupling function in conjugative type IV secretion systems. This protein sits at the inner membrane in contact with the assembled pilus and its scaffold as well as the relaxosome-plasmid DNA complex (through TraM) , .; GO: 0000746 conjugation. Probab=97.48 E-value=7.4e-05 Score=56.01 Aligned_cols=23 Identities=57% Similarity=0.873 Sum_probs=9.9 Q ss_pred EEEEEECCCCHHHHHHHHHHHHH Q ss_conf 23553047740679999999999 Q gi|254780799|r 456 HLLIAGTTGSGKSVAINTMILSL 478 (806) Q Consensus 456 HLLIAGtTGSGKSV~iN~iI~Sl 478 (806) |+||-|||||||||||.-++.++ T Consensus 210 h~L~~GTtG~GKs~~lr~LL~~i 232 (613) T TIGR02759 210 HILIHGTTGSGKSVALRKLLRWI 232 (613) T ss_pred CEEEECCCCCHHHHHHHHHHHHH T ss_conf 52664541743899999999999 No 12 >PRK13700 conjugal transfer protein TraD; Provisional Probab=97.36 E-value=0.00026 Score=51.89 Aligned_cols=211 Identities=21% Similarity=0.219 Sum_probs=112.4 Q ss_pred CCCCEEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHH------------H---------- Q ss_conf 666667854100202355304774067999999999998299578478885231001------------1---------- Q gi|254780799|r 442 IEGKPIIADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLE------------L---------- 499 (806) Q Consensus 442 I~G~pvv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vE------------l---------- 499 (806) |+|-|++-+= .-=|+||.||||||||+.||.|+-.+-=|-. |-|..||.--= | T Consensus 174 i~glpl~k~s-E~qH~li~GTtGtGKS~~ir~LL~qIR~RGd----rAIIyD~~G~Fv~~FY~p~~DiILNPfDaRc~~W 248 (732) T PRK13700 174 IGDLPIIRDS-EIQNFCLHGTVGAGKSEVIRRLANYARQRGD----MVVIYDRSGEFVKSYYDPSIDKILNPLDARCAAW 248 (732) T ss_pred ECCCCCCCCC-HHHEEEEECCCCCCHHHHHHHHHHHHHHCCC----CEEEECCCCCCHHHHCCCCCCEEECCHHHCCCCC T ss_conf 6786667652-0312677468888899999999999997299----5899939985339763988886548201058987 Q ss_pred HHCCCC---------HHHHCCCCCCHH-HHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHH-------------------- Q ss_conf 102770---------343122334304-56689999999999999999870899689999-------------------- Q gi|254780799|r 500 SVYDGI---------PNLLTPVVTNPQ-KAVTVLKWLVCEMEERYQKMSKIGVRNIDGFN-------------------- 549 (806) Q Consensus 500 s~Y~~i---------PHLl~pVvTd~~-kA~~aL~w~V~EMe~RY~l~a~~~vRni~~yN-------------------- 549 (806) ++++++ -+=|.|.-++.. -=+.+=+-+..|+-+++ ...+-|+++..= T Consensus 249 s~w~E~~~~~Dfd~iA~sLIP~~~~~DpFW~~sARtIfae~~~~L---~~~~~~t~~~Ll~~ll~~~le~L~~~L~gT~A 325 (732) T PRK13700 249 DLWKECLTQPDFDNTANTLIPMGTKEDPFWQGSGRTIFAEAAYLM---RNDPNRSYSKLVDTLLSIKIEKLRTYLRNSPA 325 (732) T ss_pred CHHHHHCCHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHH---HHCCCCCHHHHHHHHHHCCHHHHHHHHCCCCC T ss_conf 557660787649999997489999998068888999999999999---86589999999999983799999999769950 Q ss_pred --------HHH------------HH--H----HHCCCCCCCCCCCC---CCCCCCCCCC----CCCC------------- Q ss_conf --------999------------98--8----74478667754467---7654543222----3322------------- Q gi|254780799|r 550 --------LKV------------AQ--Y----HNTGKKFNRTVQTG---FDRKTGEAIY----ETEH------------- 583 (806) Q Consensus 550 --------~k~------------~~--~----~~~~~~~~~~~~~~---~~~~~~~~~~----~~~~------------- 583 (806) +|. +. . ...|++++=+-|.. .|...+ -+| +++. T Consensus 326 a~lv~~~aeKta~SIrsvL~~yi~~LRyL~~l~~~g~~FSIRdW~~~~~ed~~~g-wLFITs~~~q~~aLrPLIS~Wldi 404 (732) T PRK13700 326 ANLVEEKIEKTAISIRAVLTNYVKAIRYLQGIEHNGEPFTIRDWMRGVREDQKNG-WLFISSNADTHASLKPVISMWLSI 404 (732) T ss_pred HHHCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHCCCCCCCC-EEEEECCHHHHHHHHHHHHHHHHH T ss_conf 5301876088999999999987888763034788899841889755345688887-699815788998888899999999 Q ss_pred -------CCCCCCCEEEEEHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCC-CCCCCH-----HHHHCCCC Q ss_conf -------32346986877634468888732100588999999866414237999965777-535543-----55411025 Q gi|254780799|r 584 -------FDFQHMPYIVVVIDEMADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPS-VDVITG-----TIKANFPT 650 (806) Q Consensus 584 -------~~~~~lp~ivviiDElaDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPs-vdvitg-----~ikan~p~ 650 (806) .....=..|=+|+|||+.| + .+ ..+....+.+|--|=..|+.-|--+ .+-|=| .|=..+-| T Consensus 405 A~~aLLsl~~~~~RRiWfiiDELpSL----~-KL-p~L~~~Lae~RKfGGc~VlG~Qs~aQL~~iYG~~~A~ti~dl~nT 478 (732) T PRK13700 405 AIRGLLAMGENRNRRVWFFCDELPTL----H-KL-PDLVEILPEARKFGGCYVFGIQSYAQLEDIYGEKAAATLFDVMNT 478 (732) T ss_pred HHHHHHCCCCCCCCCEEEEEECCCCC----C-CC-HHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHCC T ss_conf 99998479998886159997544343----3-40-237899987520377589960649999998778789999986416 Q ss_pred EEEEEECCCCCCHHHCC Q ss_conf 15876458664212338 Q gi|254780799|r 651 RISFQVSSKIDSRTILG 667 (806) Q Consensus 651 riaf~v~s~~dSrtild 667 (806) |+-|++.|.---+..=+ T Consensus 479 kl~fR~~d~~tA~~~s~ 495 (732) T PRK13700 479 RAFFRSPSHKIAEFAAG 495 (732) T ss_pred EEEEECCCHHHHHHHHH T ss_conf 68986798789999998 No 13 >pfam01935 DUF87 Domain of unknown function DUF87. The function of this prokaryotic domain is unknown. It contains several conserved aspartates and histidines that could be metal ligands. Probab=97.29 E-value=0.00048 Score=49.95 Aligned_cols=55 Identities=31% Similarity=0.654 Sum_probs=44.7 Q ss_pred CCCCCC--CCEEEEECCCC--CEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCC Q ss_conf 045666--66678541002--02355304774067999999999998299578478885231 Q gi|254780799|r 438 LGKSIE--GKPIIADLARM--PHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPK 495 (806) Q Consensus 438 LGKdI~--G~pvv~DLakM--PHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK 495 (806) +|+=++ .-|+..|+.|+ -|+.|=|+||||||..+-.|+-+|+=+ .....|.+||- T Consensus 3 IG~l~~~~~v~v~ld~~~lv~rH~aIlg~TGsGKS~tv~vLl~~l~~~---~~~~vlVfDpH 61 (218) T pfam01935 3 IGRLLDGSEVPVYLDLNKLVSRHFAILGSTGSGKSNTVAVLLEELLEK---KGATVLIFDPH 61 (218) T ss_pred ECCCCCCCCCEEEEEHHHHHHHHEEEECCCCCCHHHHHHHHHHHHHHC---CCCCEEEECCC T ss_conf 023168898138963899634214787269997699999999999854---79978998288 No 14 >COG3451 VirB4 Type IV secretory pathway, VirB4 components [Intracellular trafficking and secretion] Probab=97.25 E-value=0.00068 Score=48.79 Aligned_cols=98 Identities=23% Similarity=0.393 Sum_probs=67.4 Q ss_pred HHHCCCCCHHHCCCCCCCCEEEEECCCC---CEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHH--- Q ss_conf 4302210002104566666678541002---0235530477406799999999999829957847888523100111--- Q gi|254780799|r 427 FEKNQCDLAINLGKSIEGKPIIADLARM---PHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELS--- 500 (806) Q Consensus 427 f~~s~~~L~iaLGKdI~G~pvv~DLakM---PHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls--- 500 (806) +......=|+.++|..+|.|+..++-.= .|-+|=|+|||||||.+|=++++.+-... -+.+.+|+=. ++- T Consensus 406 ~~~n~wg~~l~~~kt~~~sPf~~nfH~~~d~ghT~I~G~tGaGKTvLl~~lla~~~k~~~---~~iv~fDk~~-g~~~~~ 481 (796) T COG3451 406 FRGNPWGEPLTVLKTEDGSPFYFNFHVGEDVGHTLIIGPTGAGKTVLLSFLLAQALKYGN---PQIVAFDKDN-GAYIFI 481 (796) T ss_pred CCCCCCCCCCEEEECCCCCCEEEECCCCCCCCCEEEECCCCCCHHHHHHHHHHHHHHHCC---CCEEEECCCC-CHHHHH T ss_conf 668889975054204899962564355667697499889888789999999999987459---8189984897-357788 Q ss_pred -----HC----CCCHHHHCC---CCCCHHHHHHHHHHHHH Q ss_conf -----02----770343122---33430456689999999 Q gi|254780799|r 501 -----VY----DGIPNLLTP---VVTNPQKAVTVLKWLVC 528 (806) Q Consensus 501 -----~Y----~~iPHLl~p---VvTd~~kA~~aL~w~V~ 528 (806) .| .|.|-=+.| +......+.-..+|.+. T Consensus 482 ~a~gG~y~~l~~~~~~~~NPf~~l~~t~~n~~fl~~~~~~ 521 (796) T COG3451 482 EALGGEYLDLEPGVPSGFNPFEILEDTDENREFLAEWLVK 521 (796) T ss_pred HHHCCEEECCCCCCCCCCCCHHHCCCCHHHHHHHHHHHHH T ss_conf 8749887535678774608411037865668999999999 No 15 >cd01122 GP4d_helicase GP4d_helicase is a homohexameric 5'-3' helicases. Helicases couple NTP hydrolysis to the unwinding of nucleic acid duplexes into their component strands. Probab=96.98 E-value=0.0039 Score=43.13 Aligned_cols=142 Identities=21% Similarity=0.334 Sum_probs=74.1 Q ss_pred EEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHH-------HHHCCCCH-HHHCCCCCCHHHHHHHHHHHHH Q ss_conf 355304774067999999999998299578478885231001-------11027703-4312233430456689999999 Q gi|254780799|r 457 LLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLE-------LSVYDGIP-NLLTPVVTNPQKAVTVLKWLVC 528 (806) Q Consensus 457 LLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vE-------ls~Y~~iP-HLl~pVvTd~~kA~~aL~w~V~ 528 (806) ..|||-||+|||..+..++..+.-++. .+..++-.-|-. ++...++| |....-..+..+ -++-++. T Consensus 33 ~viaarpg~GKT~f~~~~a~~~~~~~g---~~vl~~SlEm~~~~~~~Rlls~~~g~~~~~~~~~~~~~~e---~~~~~~~ 106 (271) T cd01122 33 IILTAGTGVGKTTFLREYALDLITQHG---VRVGTISLEEPVVRTARRLLGQYAGKRLHLPDTVFIYTLE---EFDAAFD 106 (271) T ss_pred EEEEECCCCCHHHHHHHHHHHHHHHCC---CEEEEEECCCCHHHHHHHHHHHHHCCCHHHCCCCCCCHHH---HHHHHHH T ss_conf 999968998699999999999999769---9089997049999999999999829971103446778099---9999999 Q ss_pred HHHH--HHHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHHHHHHC Q ss_conf 9999--99999870899689999999988744786677544677654543222332232346986877634468888732 Q gi|254780799|r 529 EMEE--RYQKMSKIGVRNIDGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQHMPYIVVVIDEMADLMMVA 606 (806) Q Consensus 529 EMe~--RY~l~a~~~vRni~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivviiDElaDlmm~~ 606 (806) +++. ++-+.-..+.-+++...++++.+.+... . =+||||=+ .+|+.. T Consensus 107 ~~~~~~~l~i~d~~~~~~~~~i~~~ir~~~~~~~----------------------------~--~~vvIDyl-qll~~~ 155 (271) T cd01122 107 EFEGTGRLFMYDSFGEYSMDSVLEKVRYMAVSHG----------------------------I--QHIIIDNL-SIMVSD 155 (271) T ss_pred HHCCCCCEEEECCCCCCCHHHHHHHHHHHHHHCC----------------------------C--CEEEEEHH-HHCCCC T ss_conf 9707998088789999889999999999998289----------------------------9--88998317-850367 Q ss_pred CC-------HHHHHHHHHHHHHHHCCEEEEEEEC--CC Q ss_conf 10-------0588999999866414237999965--77 Q gi|254780799|r 607 RK-------DIESAVQRLAQMARASGIHVIMATQ--RP 635 (806) Q Consensus 607 ~~-------~ve~~i~rlaq~ara~GiHli~aTq--rP 635 (806) +. ++.....+|-.+|+--+||.|+..| |+ T Consensus 156 ~~~~~d~~~~i~~i~~~Lk~lAke~~v~Vi~lsQlnR~ 193 (271) T cd01122 156 ERASGDERKALDEIMTKLRGFATEHGIHITLVSHLRRP 193 (271) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCC T ss_conf 86773189999999999999999979977999526765 No 16 >cd00984 DnaB_C DnaB helicase C terminal domain. The hexameric helicase DnaB unwinds the DNA duplex at the chromosome replication fork. Although the mechanism by which DnaB both couples ATP hydrolysis to translocation along DNA and denatures the duplex is unknown, a change in the quaternary structure of the protein involving dimerization of the N-terminal domain has been observed and may occur during the enzymatic cycle. This C-terminal domain contains an ATP-binding site and is therefore probably the site of ATP hydrolysis. Probab=96.95 E-value=0.013 Score=39.26 Aligned_cols=142 Identities=17% Similarity=0.205 Sum_probs=70.8 Q ss_pred EEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHH-------HHHHCCCCH--HHHCCCCCCHHHHHHHHHHHH Q ss_conf 35530477406799999999999829957847888523100-------111027703--431223343045668999999 Q gi|254780799|r 457 LLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKML-------ELSVYDGIP--NLLTPVVTNPQKAVTVLKWLV 527 (806) Q Consensus 457 LLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~v-------Els~Y~~iP--HLl~pVvTd~~kA~~aL~w~V 527 (806) .+|||.||.|||-..-.+...+.-+. ..+..++=.-|= -++.+.++| ++..+-.++- -...+..+. T Consensus 16 ~vi~a~~g~GKS~~~~~la~~~a~~~---g~~V~~~SlEm~~~~~~~R~~s~~~~i~~~~i~~~~~~~~--~~~~~~~~~ 90 (242) T cd00984 16 IIIAARPSMGKTAFALNIAENIAKKQ---GKPVLFFSLEMSKEQLLQRLLASESGISLSKLRTGSLSDE--DWERLAEAI 90 (242) T ss_pred EEEEECCCCCHHHHHHHHHHHHHHHC---CCCEEEEECCCCHHHHHHHHHHHHHCCCHHHHHCCCCCHH--HHHHHHHHH T ss_conf 99996899999999999999999977---9959999333538899999999982977455302652279--999999999 Q ss_pred HHHHHHHHHH-HHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHHHHHHC Q ss_conf 9999999999-870899689999999988744786677544677654543222332232346986877634468888732 Q gi|254780799|r 528 CEMEERYQKM-SKIGVRNIDGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQHMPYIVVVIDEMADLMMVA 606 (806) Q Consensus 528 ~EMe~RY~l~-a~~~vRni~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivviiDElaDlmm~~ 606 (806) .++.+. .++ -.....+++.-..+++....+. . | =+|||| ..+||... T Consensus 91 ~~~~~~-~l~i~d~~~~t~~~i~~~ir~~~~~~----------------------------~-~-~~vvvD-ylql~~~~ 138 (242) T cd00984 91 GELKEL-PIYIDDSSSLTVSDIRSRARRLKKEH----------------------------G-L-GLIVID-YLQLMSGS 138 (242) T ss_pred HHHCCC-CEEEECCCCCCHHHHHHHHHHHHHHC----------------------------C-C-CEEEEE-HHHHCCCC T ss_conf 986169-88996699999999999999998836----------------------------9-9-899982-69854677 Q ss_pred CC------HHHHHHHHHHHHHHHCCEEEEEEECCC Q ss_conf 10------058899999986641423799996577 Q gi|254780799|r 607 RK------DIESAVQRLAQMARASGIHVIMATQRP 635 (806) Q Consensus 607 ~~------~ve~~i~rlaq~ara~GiHli~aTqrP 635 (806) ++ ++.....+|-++|+..+|+.|++.|=. T Consensus 139 ~~~~~~~~~i~~i~~~Lk~lA~e~~v~Vi~~sQln 173 (242) T cd00984 139 KKKGNRQQEVAEISRSLKLLAKELNVPVIALSQLS 173 (242) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHCCEEEEEECCC T ss_conf 76657999999999999999999799399984678 No 17 >cd00009 AAA The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases. Probab=96.90 E-value=0.01 Score=39.90 Aligned_cols=58 Identities=26% Similarity=0.400 Sum_probs=38.2 Q ss_pred CCEEEEEHHHHHHHHHHCCCHHHHHHHHHHHHH-----HHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEE Q ss_conf 986877634468888732100588999999866-----414237999965777535543554110251587 Q gi|254780799|r 589 MPYIVVVIDEMADLMMVARKDIESAVQRLAQMA-----RASGIHVIMATQRPSVDVITGTIKANFPTRISF 654 (806) Q Consensus 589 lp~ivviiDElaDlmm~~~~~ve~~i~rlaq~a-----ra~GiHli~aTqrPsvdvitg~ikan~p~riaf 654 (806) ...-|++|||.-.| +.+......++-+.. +-..+++|+||++|.- +.+.-++++|+-. T Consensus 83 ~~~~vl~iDEi~~l----~~~~~~~~~~~l~~~~~~~~~~~~~~vI~~tn~~~~----~~~~~~~~~R~~~ 145 (151) T cd00009 83 AKPGVLFIDEIDSL----SRGAQNALLRVLETLNDLRIDRENVRVIGATNRPLL----GDLDRALYDRLDI 145 (151) T ss_pred CCCCEEEEECHHHC----CHHHHHHHHHHHHHHCCCCCCCCCEEEEEEECCCCC----CCHHHHHHCCCCE T ss_conf 69986982016655----999999999999871575406788899995289988----6837764255986 No 18 >PRK13898 type IV secretion system ATPase VirB4; Provisional Probab=96.88 E-value=0.0024 Score=44.71 Aligned_cols=213 Identities=21% Similarity=0.375 Sum_probs=109.4 Q ss_pred HHHCCCCCCCCEEEEECC--CCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCH-HHHH------HCC-- Q ss_conf 021045666666785410--02023553047740679999999999982995784788852310-0111------027-- Q gi|254780799|r 435 AINLGKSIEGKPIIADLA--RMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKM-LELS------VYD-- 503 (806) Q Consensus 435 ~iaLGKdI~G~pvv~DLa--kMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~-vEls------~Y~-- 503 (806) ++++=++.+|.|+..++- ..-|-||-|.|||||||.+|.+++. +.|+.| |.+-+|=.+ .|.- .|. T Consensus 425 av~~~~T~~gtpy~fNfH~~d~GHtlI~G~TGsGKTtl~~fL~aq-~~ky~~---~~f~fDkd~~~~i~~~a~GG~Y~~l 500 (800) T PRK13898 425 AVTVFDTTSGTPFYFNFHVRDVGHTLIIGPTGAGKTVLMNFLCAQ-AMKFSP---RMFFFDKDRGAEIFIRALNGVYTVI 500 (800) T ss_pred CCEECCCCCCCCEEEECCCCCCCCEEEECCCCCCHHHHHHHHHHH-HHHHCC---EEEEEECCCCHHHHHHHHCCEEEEC T ss_conf 665030389987798674598775699899999899999999999-875488---7999979998699999829888743 Q ss_pred --CCHHHHCCCCCC--HHHHH----------------------HHHHHHHH---HH---HHHHHHHHH-CCCCCHHHHHH Q ss_conf --703431223343--04566----------------------89999999---99---999999987-08996899999 Q gi|254780799|r 504 --GIPNLLTPVVTN--PQKAV----------------------TVLKWLVC---EM---EERYQKMSK-IGVRNIDGFNL 550 (806) Q Consensus 504 --~iPHLl~pVvTd--~~kA~----------------------~aL~w~V~---EM---e~RY~l~a~-~~vRni~~yN~ 550 (806) |-|-=+.|--.+ +..-. ..+.-+|+ .| +||...+.. .++.+-.+--+ T Consensus 501 ~~g~~tg~nP~~l~dt~~n~~fl~~~l~~l~~~~g~~~t~~~~~~I~~av~~~~~l~~~~r~ls~l~~~l~~~~~~~L~~ 580 (800) T PRK13898 501 EPRLKCNFNPLQLDDTPENRTFLMEWLKVLVTSNGESLTAQDIKRINDAVEGNFKLKKEDRRLSNLVAFLGIDGPNTLAG 580 (800) T ss_pred CCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHCCCCCHHHHH T ss_conf 79985686877799986788999999999997379999999999999999998608921132657998735488468999 Q ss_pred HHHHHHHCCCC---CCCCCCCCCCCCCCCC-CCC------CC--------------CCCCCCCCEEEEEHHHHHHHHHHC Q ss_conf 99988744786---6775446776545432-223------32--------------232346986877634468888732 Q gi|254780799|r 551 KVAQYHNTGKK---FNRTVQTGFDRKTGEA-IYE------TE--------------HFDFQHMPYIVVVIDEMADLMMVA 606 (806) Q Consensus 551 k~~~~~~~~~~---~~~~~~~~~~~~~~~~-~~~------~~--------------~~~~~~lp~ivviiDElaDlmm~~ 606 (806) +.+.-...|+- +..+ .+.+|.....- .++ ++ +.-++-=|. ++||||+-.+.. T Consensus 581 rL~~w~~~G~~g~~fDn~-~D~l~~~~~~~~gfd~t~ll~~~~~~~pvl~Ylf~rie~~ldG~p~-ii~iDE~W~~l~-- 656 (800) T PRK13898 581 RIAMWHGKGSHAAIFDNE-EDLLDFQKARVFGFEMTELLKDPVSLAPVLLYLFHRISISLDGTPS-MIVLDEAWALID-- 656 (800) T ss_pred HHHHHHCCCCEEECCCCC-CCCCCCCCCCEEEEECHHHCCCCHHHHHHHHHHHHHHHHHCCCCCE-EEEEECHHHHHC-- T ss_conf 999986699805224897-5566866680899984675288204899999999999975289826-999603366637-- Q ss_pred CCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCC---CCHHHHHCCCCEEEEE Q ss_conf 100588999999866414237999965777535---5435541102515876 Q gi|254780799|r 607 RKDIESAVQRLAQMARASGIHVIMATQRPSVDV---ITGTIKANFPTRISFQ 655 (806) Q Consensus 607 ~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdv---itg~ikan~p~riaf~ 655 (806) -.-...-|.---.-.|-.+-.+|+|||.|+-=+ |...|.-|.||+|=+. T Consensus 657 ~~~f~~~i~~~lkt~RK~N~~vv~aTQs~~d~~~s~i~~~iieq~~T~I~LP 708 (800) T PRK13898 657 NPVFAPKIKDWLKVLRKLNTFVIFATQSVEDASKSAISDTLVQQTATQIFLP 708 (800) T ss_pred CHHHHHHHHHHHHHHHHCCCEEEEEECCHHHHHCCCHHHHHHHHCCEEEECC T ss_conf 9999999999999998739779998088999862846899998688579887 No 19 >cd01126 TraG_VirD4 The TraG/TraD/VirD4 family are bacterial conjugation proteins involved in type IV secretion. These proteins aid the transfer of DNA from the plasmid into the host bacterial chromosome. They contain an ATP binding domain. VirD4 is involved in DNA transfer to plant cells and is required for virulence. Probab=96.88 E-value=0.0011 Score=47.19 Aligned_cols=40 Identities=18% Similarity=0.364 Sum_probs=28.2 Q ss_pred EEEEEHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCC Q ss_conf 6877634468888732100588999999866414237999965777 Q gi|254780799|r 591 YIVVVIDEMADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPS 636 (806) Q Consensus 591 ~ivviiDElaDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPs 636 (806) -+.++.|||+-| | -+. .+..+.-.+|-.|||+++++|-.+ T Consensus 264 ~v~~~lDEf~nl----~-~ip-~l~~~ls~~Rs~Gi~~~~~~Qs~~ 303 (384) T cd01126 264 RVLFLLDEFPLL----G-KLE-TFEKAIAIMAGYGIRLLLIFQSLA 303 (384) T ss_pred CEEEEEECHHHC----C-CHH-HHHHHHHHHHCCCEEEEEEEECHH T ss_conf 559998661101----7-738-899999987428919999995499 No 20 >PRK13873 conjugal transfer ATPase TrbE; Provisional Probab=96.64 E-value=0.0024 Score=44.67 Aligned_cols=305 Identities=19% Similarity=0.317 Sum_probs=148.8 Q ss_pred CCHHHCCCCCCCCEEEEECC--CCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCH-HHH------HHCC Q ss_conf 00021045666666785410--02023553047740679999999999982995784788852310-011------1027 Q gi|254780799|r 433 DLAINLGKSIEGKPIIADLA--RMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKM-LEL------SVYD 503 (806) Q Consensus 433 ~L~iaLGKdI~G~pvv~DLa--kMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~-vEl------s~Y~ 503 (806) .=|+.+.++-+|.|+..|+- .--|-||-|.|||||||.+|.|.+. ..|+.+. |.+..|=.+ .|. ..|- T Consensus 418 ~p~l~~~~T~g~tPf~fN~H~gdvGHtlI~GpTGsGKTvll~~l~~q-~~ry~~~--~vf~FDkd~s~~i~~~a~GG~y~ 494 (815) T PRK13873 418 GPPLLYGRTEGSTPFRLSLHVGDVGHTLVVGPTGAGKSVLLALMALQ-FRRYPGS--QVFAFDFGGSIRAAALAMGGDWH 494 (815) T ss_pred CCEEEEEECCCCCCEEEECCCCCCCCEEEECCCCCCHHHHHHHHHHH-HHHCCCC--CEEEEECCCCHHHHHHHHCCEEE T ss_conf 88058863489996687664688764389788999899999999999-8644898--48999789878999998299876 Q ss_pred CC----------HHHHCCC--CCCHHHHHHHHHHHHHHHHHH---------------HHHHHH--CCCCCHHHHHHHH-- Q ss_conf 70----------3431223--343045668999999999999---------------999987--0899689999999-- Q gi|254780799|r 504 GI----------PNLLTPV--VTNPQKAVTVLKWLVCEMEER---------------YQKMSK--IGVRNIDGFNLKV-- 552 (806) Q Consensus 504 ~i----------PHLl~pV--vTd~~kA~~aL~w~V~EMe~R---------------Y~l~a~--~~vRni~~yN~k~-- 552 (806) +| |-=+.|- +.++.+-+-+..|+..=+++. -+-++. ..-|.+.++-.-+ T Consensus 495 ~lg~~~~~~~~~~~~f~Pl~~~d~~~~r~~~~~wi~~ll~~~g~~~tp~~~~~i~~Av~~l~~~~~~~Rtls~l~~~l~~ 574 (815) T PRK13873 495 DLGGGLSDGADGSVALQPLARIDDPAERAWAAEWIAAILAREGVAVTPEVKEHIWSALTSLASAPVEERTLTGLSVLLQS 574 (815) T ss_pred ECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCHHHCCHHHHHHHHCC T ss_conf 03565556789987779876889978889999999999961798889899999999999985198521739999988188 Q ss_pred ---HHHH---HCCCCCCCCCCCCCCCCCCCC------CCCCCCC-------------------CCCCCCEEEEEHHHHHH Q ss_conf ---9887---447866775446776545432------2233223-------------------23469868776344688 Q gi|254780799|r 553 ---AQYH---NTGKKFNRTVQTGFDRKTGEA------IYETEHF-------------------DFQHMPYIVVVIDEMAD 601 (806) Q Consensus 553 ---~~~~---~~~~~~~~~~~~~~~~~~~~~------~~~~~~~-------------------~~~~lp~ivviiDElaD 601 (806) +.+. ..|.+.. --+|.+.+.- .+|.+.. -++-=| .+++|||.-- T Consensus 575 ~~L~~~L~~w~~~G~~G----~lfD~~~D~l~~~~~~~Fe~~~l~~~~~~~pvl~YLf~rie~~lDGrp-tli~iDEaW~ 649 (815) T PRK13873 575 NALKQALRPYCLGGPYG----RLLDAEHERLGSADVQAFETEGLMGTAAAPAVLSYLFHRIEDRFDGRP-TLLILDEGWL 649 (815) T ss_pred CHHHHHHHHHHCCCCCE----ECCCCCCCCCCCCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCC-EEEEHHHHHH T ss_conf 14999999983789845----426886567776773699873037865178999999999999758996-6997586587 Q ss_pred HHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCC----CCHHHHHCCCCEEEEEECCCC--CCHHHCCCCCHHHHC Q ss_conf 88732100588999999866414237999965777535----543554110251587645866--421233886457865 Q gi|254780799|r 602 LMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDV----ITGTIKANFPTRISFQVSSKI--DSRTILGEQGAEQLL 675 (806) Q Consensus 602 lmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdv----itg~ikan~p~riaf~v~s~~--dSrtild~~gae~Ll 675 (806) +. .-.-...-|..-----|-..-.+|+|||.|+ |+ |-..|.-|.|+||=+--.... .+|.+- T Consensus 650 ~L--~~~~F~~~i~~wLkT~RK~N~~vv~aTQS~~-di~~S~I~~aiie~c~T~IfLPNp~A~~~~~~~~Y--------- 717 (815) T PRK13873 650 AL--DDPVFAAQLREWLKTLRKKNASVIFATQSLA-DIDDSAIAPAIIESCPTRIFLPNERAIEPQIAAIY--------- 717 (815) T ss_pred HH--CCHHHHHHHHHHHHHHHHCCCEEEEECCCHH-HHHCCCHHHHHHHHCCEEEECCCCCCCCCHHHHHH--------- T ss_conf 72--8989999999999999870877999778889-98649668999986885698778222770068999--------- Q ss_pred CCCCEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHCCCCCCC-CCCCC------CCCCCCCCCCCCCCCCCHHHHHHHH Q ss_conf 8875477368983258883348988999999999712897421-10012------4556667788888777703899999 Q gi|254780799|r 676 GQGDMLYMTGGGRVQRIHGPFVSDIEVEKVVSHLKTQGEAKYI-DIKDK------ILLNEEMRFSENSSVADDLYKQAVD 748 (806) Q Consensus 676 g~gdml~~~~~~~~~r~~g~~v~~~ev~~v~~~~~~q~~~~y~-~~~~~------~~~~~~~~~~~~~~~~d~l~~~a~~ 748 (806) ++-=.+|.||+-|.+- +.+-+|- ..... ....-.-...+ .++.++. ...-+ T Consensus 718 -----------------~~fGLn~rqieiIa~a---~pKRdYy~~s~~g~Rlf~L~L~~~~la~~~-~s~~~d~-~~~~~ 775 (815) T PRK13873 718 -----------------RRFGLNDRQIEILARA---TPKRDYYCQSRRGNRLFELGLGPVALAFCA-ASSKADQ-ALIDR 775 (815) T ss_pred -----------------HHCCCCHHHHHHHHHC---CCCCEEEEECCCCCEEEECCCCHHHHHEEE-CCCHHHH-HHHHH T ss_conf -----------------8749999999999733---746635898799877897389704210000-3888899-99999 Q ss_pred HHHHCCCE--EEEHHHHHHCCCHHHHHHHHHHHHHC Q ss_conf 99965985--00014222001177899999999977 Q gi|254780799|r 749 IVLRDNKA--SISYIQRRLGIGYNRAASIIENMEEK 782 (806) Q Consensus 749 ~v~~~~~~--s~s~lqr~~~igy~raar~~~~~e~~ 782 (806) ++.+.++. ...+|..| |-.=||-|++.++.. T Consensus 776 l~~~~g~~~~~~~wl~~~---g~~~a~~~~~~~~~~ 808 (815) T PRK13873 776 VLAEHGRDGFAAAWLRHR---GLDWAADLLPDLTNL 808 (815) T ss_pred HHHHHCCCHHHHHHHHHC---CCCHHHHHHHHHHHC T ss_conf 999819101599999965---983688876543111 No 21 >PRK13853 type IV secretion system protein VirB4; Provisional Probab=96.52 E-value=0.0054 Score=42.04 Aligned_cols=213 Identities=19% Similarity=0.296 Sum_probs=107.3 Q ss_pred CHHHCCCCCCCCEEEEECC--CCCEEEEEECCCCHHHHHHHHHHHHHHHHCCH-HHEEEEEECCCHH-HH------HHC- Q ss_conf 0021045666666785410--02023553047740679999999999982995-7847888523100-11------102- Q gi|254780799|r 434 LAINLGKSIEGKPIIADLA--RMPHLLIAGTTGSGKSVAINTMILSLLYRMTP-AQCRLIMIDPKML-EL------SVY- 502 (806) Q Consensus 434 L~iaLGKdI~G~pvv~DLa--kMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P-~evkliliDPK~v-El------s~Y- 502 (806) =++++-++.+|.|+..++- .--|-||-|.|||||||.+|.+++. +.|+-. ..-+.+..|=.+= |. ..| T Consensus 404 ~al~~~~T~~gtPf~fn~H~~d~GHt~I~G~TGsGKTtll~fL~aq-~~ky~~~~~~~~~~fDkd~s~~i~~~a~GG~y~ 482 (789) T PRK13853 404 NAIARFRTNGGTPFDYIPHEHDVGMTAIFGPIGRGKTTLMTFILAM-LEQSMVDRAGAVVFFDKDRGGELLVRATGGTYL 482 (789) T ss_pred CCCEEEECCCCCCEEEECCCCCCCCEEEECCCCCCHHHHHHHHHHH-HHHHHCCCCCEEEEEECCCCHHHHHHHHCCEEE T ss_conf 8753425499982586378788774488789999889999999999-997422357708999588638999998298776 Q ss_pred ---CCCHHHHCCCC--CC-HHHHHHHHHHHHHHHHHH---------H--------HHHH-HCCCCCHHHHHH-------- Q ss_conf ---77034312233--43-045668999999999999---------9--------9998-708996899999-------- Q gi|254780799|r 503 ---DGIPNLLTPVV--TN-PQKAVTVLKWLVCEMEER---------Y--------QKMS-KIGVRNIDGFNL-------- 550 (806) Q Consensus 503 ---~~iPHLl~pVv--Td-~~kA~~aL~w~V~EMe~R---------Y--------~l~a-~~~vRni~~yN~-------- 550 (806) +|-|-=+.|-- -| +..-+-+..|+..=.++. - .+|+ ...-|.+....+ T Consensus 483 ~i~~g~~tg~nPl~~l~~t~~~~~fl~~wl~~l~~~~g~~~lt~~~~~~i~~av~~~~~~~~~~r~ls~l~~~l~~~~~~ 562 (789) T PRK13853 483 ALRRGVPSGLAPLRGLENTAASRDFLREWIVALIESDGRGGISPEENRRLVRGIHRQLSFDPDMRSLAGLREFLLHGPAE 562 (789) T ss_pred ECCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCHHHCCHHHHHHHHCCCCCC T ss_conf 12699956848765689986889999999999997259999999999999999999860883107898999984558821 Q ss_pred ----HHHHHHHCCCCCCCCCCCCCCCCCCCC-------CCCCC--------------------CCCCCCCCEEEEEHHHH Q ss_conf ----999887447866775446776545432-------22332--------------------23234698687763446 Q gi|254780799|r 551 ----KVAQYHNTGKKFNRTVQTGFDRKTGEA-------IYETE--------------------HFDFQHMPYIVVVIDEM 599 (806) Q Consensus 551 ----k~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~--------------------~~~~~~lp~ivviiDEl 599 (806) +.+.--..|+- .--+|.+++.- .++-. +.-++-=|. ++++||+ T Consensus 563 ~~~~rL~~w~~~g~~-----gwlfD~~~D~l~~~~~~~gfd~~~ll~~~~~~~pvl~YLf~rie~~ldGrp~-ii~lDE~ 636 (789) T PRK13853 563 GAGARLQRWCRGNAL-----GWAFDGEVDEVKLDPSITGFDMTHLLEYEEVCAPAAAYLLHRIGAMVDGRRF-VMSCDEF 636 (789) T ss_pred HHHHHHHHHHCCCCE-----EEECCCCCCCCCCCCCEEEEECHHHCCCCCHHHHHHHHHHHHHHHHHCCCCE-EEEEHHH T ss_conf 699999998549970-----4403686546588787799861785278512899999999999986069966-9994566 Q ss_pred HHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCC---CCCCCHHHHHCCCCEEEEE Q ss_conf 8888732100588999999866414237999965777---5355435541102515876 Q gi|254780799|r 600 ADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPS---VDVITGTIKANFPTRISFQ 655 (806) Q Consensus 600 aDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPs---vdvitg~ikan~p~riaf~ 655 (806) -.+.. -.-...-|.---.-.|-.+-.+|+|||.|+ --=|--.|.-|.||+|=+- T Consensus 637 w~~l~--~~~f~~~i~~wlkt~RK~N~~vv~aTQs~~d~~~s~i~~~i~e~~~T~I~LP 693 (789) T PRK13853 637 RAYLL--NPKFAAVVDKFLLTVRKNNGMLILATQQPEHVLESPLGASLVAQCMTKIFYP 693 (789) T ss_pred HHHHC--CHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHCCCHHHHHHHHCCEEEECC T ss_conf 67646--9999999999999998749089996489999863827899998688679888 No 22 >PRK10263 DNA translocase FtsK; Provisional Probab=96.50 E-value=0.058 Score=34.35 Aligned_cols=173 Identities=21% Similarity=0.248 Sum_probs=151.5 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCC-CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 3112789999999999999999999999984427068963136886-450001346789999999999999999999999 Q gi|254780799|r 16 FLLSDWSKKKMKIVAGLILLCTVFAITLALGTWDVYDPSFSYITLR-SPKNFLGYGGAIFADVAIQFFGIASVFFLPPPT 94 (806) Q Consensus 16 ~~l~~f~~rrl~Ei~Gl~Li~~al~l~iSL~SYsp~DPs~~~~s~~-~v~N~~G~lGA~iAd~L~~lFG~~Ayllpl~Ll 94 (806) ..|++ .||+.|.+-|+..++++||.+||+||||+||||+.+.+. +|||++|.+|||+||.||+.||..||.+|++++ T Consensus 14 kkLS~--grRLlEa~LIvv~L~Ai~LmvALlSFnPSDPSWSQTAWhepIhNlGG~~GAWlADTLFf~FGvLAY~iP~im~ 91 (1355) T PRK10263 14 TKLSS--GRRLLEALLILIVLFAVWLMAALLSFNPSDPSWSQTAWHEPIHNLGGMPGAWLADTLFFIFGVMAYTIPVIIV 91 (1355) T ss_pred EECCC--CCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 74378--5039999999999999999999952688998843347786200157635189998999998999988999999 Q ss_pred HHHHHHHHCCC------HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999982765------236789999999999999999998504443223334324889999999999864689999999 Q gi|254780799|r 95 MWALSLLFDKK------IYCFSKRATAWLINILVSATFFASFSPSQSWPIQNGFGGIIGDLIIRLPFLFFESYPRKLGIL 168 (806) Q Consensus 95 ~~g~~ll~~k~------~~~~~~Rl~~~il~ll~~s~ll~~~~~~~~~~l~~g~GGiiG~~l~~~l~~~lg~~g~~l~ll 168 (806) ..+|..||++. ...+..|++|.+.+++.+|.+ +.+++...|++. +||++|.++++.+..+|+.+|+++.++ T Consensus 92 ~~cW~~fR~r~~~~~iDyf~~slRLiG~LaLiLTSCGL-AalN~DDi~YFa--SGGVIGsllSsa~lP~lN~lGaTL~LL 168 (1355) T PRK10263 92 GGCWFAWRHQSSDEYIDYFAVSLRIIGVLALILTSCGL-AAINADDIWYFA--SGGVIGSLLSTTLQPLLHSSGGTIALL 168 (1355) T ss_pred HHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHH-HHCCCCCCCCCC--CCCEEHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999998615676600899999999999999970046-541741111025--778700877776887663063899999 Q ss_pred HH---HHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99---99999999999999998886454 Q gi|254780799|r 169 FF---QMILFLAMSWLLIYSSSAIFQGK 193 (806) Q Consensus 169 i~---~iil~l~is~l~i~~~~~~~~~~ 193 (806) |+ |++||+++||+++.++++..... T Consensus 169 ~~WaaGlTLFTGwSWLtIaEkiG~~vL~ 196 (1355) T PRK10263 169 CVWAAGLTLFTGWSWVTIAEKLGGWILN 196 (1355) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHHH T ss_conf 9998754653055199999998189999 No 23 >PRK00411 cdc6 cell division control protein 6; Reviewed Probab=96.48 E-value=0.015 Score=38.73 Aligned_cols=41 Identities=22% Similarity=0.427 Sum_probs=31.0 Q ss_pred CCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCC Q ss_conf 0202355304774067999999999998299578478885231 Q gi|254780799|r 453 RMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPK 495 (806) Q Consensus 453 kMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK 495 (806) .-++++|-|.||+||+++++.+.-.+ +.....++++-|+-+ T Consensus 54 ~~~n~~I~G~pGTGKT~~vk~v~~~l--~~~~~~~~~vyINc~ 94 (394) T PRK00411 54 RPSNVLILGPPGTGKTTTVKKVFEEL--EEAALKVVYVYINCQ 94 (394) T ss_pred CCCCEEEECCCCCCHHHHHHHHHHHH--HHHCCCCEEEEEECC T ss_conf 99847998899998999999999999--974689659999696 No 24 >TIGR02746 TraC-F-type type-IV secretion system protein TraC; InterPro: IPR014117 The proteins in this entry are found in the F, P and I-like type IV secretion systems. Gene symbols include TraC (F-type), TrbE/VirB4 (P-type) and TraU (I-type). The proteins contain the Walker A and B motifs and so are putative nucleotide triphosphatases.. Probab=96.38 E-value=0.0027 Score=44.26 Aligned_cols=333 Identities=18% Similarity=0.190 Sum_probs=189.5 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCEEEEEEEECCEEEEEEEEECCCCCHHHHHHHHHHHHHHHHHCC---CCC---CCC--- Q ss_conf 25665457899999974248632899841044244444321478639999978899998863001---210---003--- Q gi|254780799|r 328 SPKVMQNNACTLKSVLSDFGIQGEIVNVRPGPVITLYELEPAPGIKSSRIIGLSDDIARSMSAIS---ARV---AVI--- 398 (806) Q Consensus 328 s~eeL~~nA~lLE~tL~dFGVe~~Vv~v~pGPVVTrYEi~PApGVKvSKI~nLadDIA~aLsa~s---vRI---apI--- 398 (806) +++.+.+..+.+.+.++.=|.+=.-......|.|.+==+=..+. .+|=+|+...-.... .-+ .|| T Consensus 375 ~~~~~~~~~~~a~~~Fr~~gF~L~~~~~~~~~~vfL~~LPf~~~------~glf~~~~~~~~~~~~~~~N~~~l~Pl~ad 448 (900) T TIGR02746 375 NPDNLRRDVEAAKSSFRKNGFDLAEDKRIQLQAVFLAALPFALT------EGLFKDLKKAGRVRTLSSANAVNLLPLIAD 448 (900) T ss_pred CHHHHHHHHHHHHHHHHHCCCEECCHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHCCCCCHHHHHHHCCCEEEE T ss_conf 88899999999999998588200305421237899973035631------478898765221343045538661742140 Q ss_pred -CC-------C-CEEEEEECCCCCCEEEEHHHCCCHHHHHCCCCCHHHCCCCCCCCEEEEECCCCCEEEEEECCCCHHHH Q ss_conf -88-------6-12555302456623873423072134302210002104566666678541002023553047740679 Q gi|254780799|r 399 -PR-------R-NAIGIELPNDIRETVMLRDLIVSRVFEKNQCDLAINLGKSIEGKPIIADLARMPHLLIAGTTGSGKSV 469 (806) Q Consensus 399 -PG-------K-~~VGIEiPN~~r~~V~lreil~s~~f~~s~~~L~iaLGKdI~G~pvv~DLakMPHLLIAGtTGSGKSV 469 (806) +| . ..=|+=+|-.+-|.-+| +++.+.- . .|.|-.+ .|.|+||||||| T Consensus 449 w~Gnnmedrl~~~~~g~l~ptrR~Ql~~~-d~F~~~~-----~--------~i~NYN~----------~~~g~sG~GKSf 504 (900) T TIGR02746 449 WKGNNMEDRLTLKGSGLLLPTRRGQLSAF-DPFDSDL-----L--------NITNYNI----------AVVGGSGAGKSF 504 (900) T ss_pred CCCCCHHHHHCCCCCEEEEEECCCEEEEE-EECCCCC-----C--------CCCCCEE----------EEECCCCCCHHH T ss_conf 26885012204787727752135358778-3033786-----6--------7677148----------997288987168 Q ss_pred HHHHHHHHHH--------------HHCCHHHEEEEEECCC---HHHHHHCC---------CCHHHHCCCCCCHH----HH Q ss_conf 9999999999--------------8299578478885231---00111027---------70343122334304----56 Q gi|254780799|r 470 AINTMILSLL--------------YRMTPAQCRLIMIDPK---MLELSVYD---------GIPNLLTPVVTNPQ----KA 519 (806) Q Consensus 470 ~iN~iI~SlL--------------yk~~P~evkliliDPK---~vEls~Y~---------~iPHLl~pVvTd~~----kA 519 (806) .+|-+|.+.| ||.-=+-+.=-+||=. -+=|.++- ++--.=.-.||+.+ +. T Consensus 505 ~~Q~~~~~~L~~G~kv~viDvG~Sy~KLC~~lgG~Yie~~~e~~i~LNPFTHL~i~~A~~~~~~~~~~~v~~i~Geqv~~ 584 (900) T TIGR02746 505 FMQELIVSVLSRGGKVWVIDVGRSYKKLCEMLGGTYIEFSPESTICLNPFTHLDIEEARFDIIDFDGNNVTEIDGEQVDE 584 (900) T ss_pred HHHHHHHHHHHCCCEEEEEEECCCHHHHHHHHCCCEECCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCHHHHH T ss_conf 79899999851798489996078848887651871210466776323874446600033322673634677567402889 Q ss_pred HHHHHHHHHHHH----HHHHHHHHCCCCCHHHHHHHHHHH-----HHCCCC---------------------CC---CCC Q ss_conf 689999999999----999999870899689999999988-----744786---------------------67---754 Q gi|254780799|r 520 VTVLKWLVCEME----ERYQKMSKIGVRNIDGFNLKVAQY-----HNTGKK---------------------FN---RTV 566 (806) Q Consensus 520 ~~aL~w~V~EMe----~RY~l~a~~~vRni~~yN~k~~~~-----~~~~~~---------------------~~---~~~ 566 (806) ..-|+-.+.-|= .=+..|++..- =++-|-.-+.+| +..|.. -. .+. T Consensus 585 ~~~i~~~ia~MAGlDl~P~~~l~~~q~-lmaryCs~l~~AI~~~W~~~g~~a~~tdV~~~L~~~~~e~~~~~~DGir~r~ 663 (900) T TIGR02746 585 IDQITALIALMAGLDLSPEGGLSDEQE-LMARYCSLLEEAILAAWKEKGNEATITDVADALEQLRLEEEDSENDGIRPRI 663 (900) T ss_pred HHHHHHHHHHHHCCCCCCCCCCCHHHH-HHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCCCCCCCCCCCHHHH T ss_conf 999999999884466684556788889-9999999999999999997122488489999998521331345688601689 Q ss_pred CC----------------CCCCCC------CCCCCC----CCC-CCC---------------------CCCCEEEEEHHH Q ss_conf 46----------------776545------432223----322-323---------------------469868776344 Q gi|254780799|r 567 QT----------------GFDRKT------GEAIYE----TEH-FDF---------------------QHMPYIVVVIDE 598 (806) Q Consensus 567 ~~----------------~~~~~~------~~~~~~----~~~-~~~---------------------~~lp~ivviiDE 598 (806) +. -|++++ .--..| +.. |.+ .+.|- +||||| T Consensus 664 ~~l~~~L~~Yt~~G~YG~~Fn~~~~l~~~~~~vvLEL~eL~~~Dp~l~~~Vl~~L~~~I~~~my~~g~R~~rK-~~~iDE 742 (900) T TIGR02746 664 KDLGKLLNPYTSNGVYGRYFNGPNNLDFSNRFVVLELEELEDRDPELQAVVLFSLMVRITGEMYLTGDRKRRK-ILIIDE 742 (900) T ss_pred HHHHHHHHHHHCCCCCEEECCCCCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCE-EEEEHH T ss_conf 9999873264248712011287775678987688512257577856779999999999987763058897551-676442 Q ss_pred HHHHHH-HCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHH--HHCCCCEEEEEECCCCCCHHHCCCC--CHHH Q ss_conf 688887-3210058899999986641423799996577753554355--4110251587645866421233886--4578 Q gi|254780799|r 599 MADLMM-VARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTI--KANFPTRISFQVSSKIDSRTILGEQ--GAEQ 673 (806) Q Consensus 599 laDlmm-~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~i--kan~p~riaf~v~s~~dSrtild~~--gae~ 673 (806) -=+||- -+-...-.-|=++|=.||==|==+|+.||-= -=.- |||--+|-||.=| |=-.||-|. +=++ T Consensus 743 AW~Ll~~g~~~~~~~FIE~gyRr~RK~~Ga~~tiTQ~~-----~D~~~dka~~~arA~yaNS---~~~iiL~Q~~~~~~~ 814 (900) T TIGR02746 743 AWSLLDDGANPQAANFIETGYRRARKYGGAFITITQGI-----EDFYSDKASPEARAAYANS---DWKIILRQSAESIEK 814 (900) T ss_pred HHHHHHCCCCCHHHHHHHHHHHCCHHHCCCEEEEECCE-----EHHHCCCCCHHHHHHHHCC---CEEEEECCCHHHHHH T ss_conf 48987427851358988876320111164368996440-----0100034798899988453---300412788647899 Q ss_pred HCCCCCEEEECCCC----CEEEEEECCCCHH Q ss_conf 65887547736898----3258883348988 Q gi|254780799|r 674 LLGQGDMLYMTGGG----RVQRIHGPFVSDI 700 (806) Q Consensus 674 Llg~gdml~~~~~~----~~~r~~g~~v~~~ 700 (806) |.-.+--=|-|.+- +....+|++-|+- T Consensus 815 ~~~~~~~~~~~~~~~~i~sl~~~~~~GfSe~ 845 (900) T TIGR02746 815 LKAENPLQFSPFEKRLIKSLRTAKGAGFSEL 845 (900) T ss_pred HHHCCCCCCCHHHHHHHHHCCCCCCCHHHHH T ss_conf 9963898889889999972253566306766 No 25 >cd01131 PilT Pilus retraction ATPase PilT. PilT is a nucleotide binding protein responsible for the retraction of type IV pili, likely by pili disassembly. This retraction provides the force required for travel of bacteria in low water environments by a mechanism known as twitching motility. Probab=96.36 E-value=0.0088 Score=40.45 Aligned_cols=38 Identities=34% Similarity=0.591 Sum_probs=26.6 Q ss_pred EEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEE-ECCCHHHH Q ss_conf 35530477406799999999999829957847888-52310011 Q gi|254780799|r 457 LLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIM-IDPKMLEL 499 (806) Q Consensus 457 LLIAGtTGSGKSV~iN~iI~SlLyk~~P~evklil-iDPK~vEl 499 (806) .||+|.||||||--+++++--+ ......+.+- =|| +|+ T Consensus 4 iLitG~TGSGKTTtl~all~~i---~~~~~~~IiTiEDP--iE~ 42 (198) T cd01131 4 VLVTGPTGSGKSTTLAAMIDYI---NKNKTHHILTIEDP--IEF 42 (198) T ss_pred EEEECCCCCCHHHHHHHHHHHH---CCCCCCEEEEECCC--HHH T ss_conf 9998999997999999999853---63788369996473--775 No 26 >PRK13891 conjugal transfer protein TrbE; Provisional Probab=96.30 E-value=0.0053 Score=42.09 Aligned_cols=303 Identities=15% Similarity=0.226 Sum_probs=143.9 Q ss_pred CCHHHHHHHHHHHHHHHHHCCCCEEEEEEEECCEEEEEEEEECCCCCHHHHHHHHHHHHHHHHHC-------CCC----- Q ss_conf 32566545789999997424863289984104424444432147863999997889999886300-------121----- Q gi|254780799|r 327 FSPKVMQNNACTLKSVLSDFGIQGEIVNVRPGPVITLYELEPAPGIKSSRIIGLSDDIARSMSAI-------SAR----- 394 (806) Q Consensus 327 ~s~eeL~~nA~lLE~tL~dFGVe~~Vv~v~pGPVVTrYEi~PApGVKvSKI~nLadDIA~aLsa~-------svR----- 394 (806) .|++.+.+ +.-++..|.+-+=+ .|.--..-++|+.|.=.|+ +...-++.+..++... .+- T Consensus 347 ~d~dA~~~-~~d~d~Al~el~sg-~v~~Gy~t~tv~v~~~~~~------~l~~~~~~v~~~l~~~GF~~~~Et~n~~~A~ 418 (852) T PRK13891 347 VNQDAASM-VGDAEAAIAEVNSG-LVAAGYYTSVVVLMDEDRE------RLEASALLVEKAINRLAFAARIETINTMDAY 418 (852) T ss_pred CCHHHHHH-HHHHHHHHHHHCCC-CEEEEEEEEEEEEECCCHH------HHHHHHHHHHHHHHHCCCEEEEEHHCCHHHH T ss_conf 46489999-99999999986089-7526888889999789999------9999999999999868967986110239999 Q ss_pred CCCCCCCCEEEEEECCCCCCEE---------EEHHHCCCHHHHH----CCCCCHHHCCCCCCCCEEEEECC--CCCEEEE Q ss_conf 0003886125553024566238---------7342307213430----22100021045666666785410--0202355 Q gi|254780799|r 395 VAVIPRRNAIGIELPNDIRETV---------MLRDLIVSRVFEK----NQCDLAINLGKSIEGKPIIADLA--RMPHLLI 459 (806) Q Consensus 395 IapIPGK~~VGIEiPN~~r~~V---------~lreil~s~~f~~----s~~~L~iaLGKdI~G~pvv~DLa--kMPHLLI 459 (806) -+.+||-.+- |..+-+| .|+.+..-+.... ....=|+...|+-+|.|+..++- .--|-|| T Consensus 419 lgsLPGn~~~-----nvR~~~isT~NlA~l~pl~~vw~G~~~~p~~~~~~~~pal~~~~T~g~tPf~fN~Hv~DvGHTlI 493 (852) T PRK13891 419 LGSLPGHGVE-----NVRRPLINTMNLADLLPTSSIWTGSATAPCPMYPPLSPALMHCVTVGATPFRLNLHVRDLGHTFM 493 (852) T ss_pred HHCCCCCCCC-----CCCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCEEEECCCCCCCCEEE T ss_conf 8619997667-----75636666477775641034567866798866899987418874489984478545688664078 Q ss_pred EECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHH--------HH------HHCCCC-----HHHHCC--CCCCHHH Q ss_conf 30477406799999999999829957847888523100--------11------102770-----343122--3343045 Q gi|254780799|r 460 AGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKML--------EL------SVYDGI-----PNLLTP--VVTNPQK 518 (806) Q Consensus 460 AGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~v--------El------s~Y~~i-----PHLl~p--VvTd~~k 518 (806) -|.|||||||.+|.|.+... |+.+.+ .+-.|=.+- |. ..|-+| |-=+.| .+-++.. T Consensus 494 iGpTGaGKTvll~fL~aQ~~-rY~~~~--vf~FDK~~S~~~~~~g~~~~~~a~GG~~~~l~~~~~~~gfnPL~~~dt~~~ 570 (852) T PRK13891 494 FGPTGAGKSTHLGIIAAQLR-RYAGMS--IYAFDKGMSMYPLAAGIRAATKGKSGLHFTVAADDDRLAFCPLQFLETKGD 570 (852) T ss_pred ECCCCCCHHHHHHHHHHHHH-HCCCCC--EEEECCCCCHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCHHH T ss_conf 78999988999999999974-418981--898789852343441246889853981763068999878687868997145 Q ss_pred HHHHHHHHHH------------HHHHHH---HHHHHCCCCCHHHHHHHH-----HHHHH---CCCCCCCCCCCCCCCCCC Q ss_conf 6689999999------------999999---999870899689999999-----98874---478667754467765454 Q gi|254780799|r 519 AVTVLKWLVC------------EMEERY---QKMSKIGVRNIDGFNLKV-----AQYHN---TGKKFNRTVQTGFDRKTG 575 (806) Q Consensus 519 A~~aL~w~V~------------EMe~RY---~l~a~~~vRni~~yN~k~-----~~~~~---~~~~~~~~~~~~~~~~~~ 575 (806) -+-+..|+.. +.+.-+ .-|++.+-|.+.+|..-+ +++.. .|.+. .--+|.+++ T Consensus 571 r~~~~~wl~~ll~~~g~~~tp~~~~~I~~Av~sl~~~~~rtLs~f~~~lq~~~l~~aL~~w~~~G~~----G~lfDn~~D 646 (852) T PRK13891 571 RAWAMEWIDTILALNGVNTTPAQRNEIGNAIMSMHASGARTLSEFSVTIQDEAIREAIKQYTVDGSM----GHLLDAEED 646 (852) T ss_pred HHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCHHHHHHHHHHHCCCCC----EECCCCCCC T ss_conf 7999999999997289889989999999999987616460799998763988899999998469980----603788766 Q ss_pred CC------CCCCCCC---------------------CCCCCCEEEEEHHHHHHHHHHCCCH-HHHHHHHHHHHHHHCCEE Q ss_conf 32------2233223---------------------2346986877634468888732100-588999999866414237 Q gi|254780799|r 576 EA------IYETEHF---------------------DFQHMPYIVVVIDEMADLMMVARKD-IESAVQRLAQMARASGIH 627 (806) Q Consensus 576 ~~------~~~~~~~---------------------~~~~lp~ivviiDElaDlmm~~~~~-ve~~i~rlaq~ara~GiH 627 (806) .- .+|.+.. -++-=| -++++||.=-+. +.+ ...-|-------|-.--. T Consensus 647 ~l~~s~~~~Fe~~~l~~~~~~~~~Pvl~YLFhRIe~~lDGrp-tli~lDEaW~~L---~dp~F~~~i~~wLKTlRKkN~~ 722 (852) T PRK13891 647 GLSLSDFTVFEIEELMNLGEKFALPVLLYLFRRIERALKGQP-SVIILDEAWLML---GHPAFRAKIREWLKVLRKANCL 722 (852) T ss_pred CCCCCCCEEEECHHHCCCCCHHHHHHHHHHHHHHHHHHCCCC-EEEEEHHHHHHH---CCHHHHHHHHHHHHHHHHCCCE T ss_conf 767676458825013026815589999999999998736992-699952367761---8999999999999999871847 Q ss_pred EEEEECCCCCCCC----CHHHHHCCCCEEEE Q ss_conf 9999657775355----43554110251587 Q gi|254780799|r 628 VIMATQRPSVDVI----TGTIKANFPTRISF 654 (806) Q Consensus 628 li~aTqrPsvdvi----tg~ikan~p~riaf 654 (806) +|+|||.|+ |+. --.|.-+.||||=+ T Consensus 723 VvfATQSl~-Di~~S~I~~aiiEqc~T~IfL 752 (852) T PRK13891 723 VLMATQSLS-DAANSGILDVIVESTATKIFL 752 (852) T ss_pred EEEECCCHH-HHHCCCHHHHHHHHCCEEEEC T ss_conf 999768888-985385689999858856876 No 27 >PRK13833 conjugal transfer protein TrbB; Provisional Probab=96.27 E-value=0.0057 Score=41.87 Aligned_cols=90 Identities=24% Similarity=0.397 Sum_probs=54.4 Q ss_pred CCCCCCEEEEEECCCCCCEEEEHHHCCCHHHHHCCCCCHHHCCCCCCCCEEEEECCCCCEEEEEECCCCHHHHHHHHHHH Q ss_conf 03886125553024566238734230721343022100021045666666785410020235530477406799999999 Q gi|254780799|r 397 VIPRRNAIGIELPNDIRETVMLRDLIVSRVFEKNQCDLAINLGKSIEGKPIIADLARMPHLLIAGTTGSGKSVAINTMIL 476 (806) Q Consensus 397 pIPGK~~VGIEiPN~~r~~V~lreil~s~~f~~s~~~L~iaLGKdI~G~pvv~DLakMPHLLIAGtTGSGKSV~iN~iI~ 476 (806) |+.+.+++-|--|...+ ..|-|++++..+......+ |-.-|.+ + =-.||+|.|||||.-.+|+++. T Consensus 101 P~~~g~sitIRk~~~~~--~tl~dlv~~g~~t~~~~~~---L~~aV~~--------r-~nilVsGgTGSGKTTllnaL~~ 166 (323) T PRK13833 101 PVVSGPAFTIRRRASRL--IPLDDYVRSKVMTEAQAST---IRSAISS--------R-LNIVISGGTGSGKTTLANAVIA 166 (323) T ss_pred CCCCCCEEEEECCCCCC--CCHHHHHHCCCCCHHHHHH---HHHHHHC--------C-CCEEEECCCCCCHHHHHHHHHH T ss_conf 86799529996878999--9999998769999999999---9999981--------8-9689991777756899999998 Q ss_pred HHHHHCCHHHEEEEEECCCHHHHHHCC Q ss_conf 999829957847888523100111027 Q gi|254780799|r 477 SLLYRMTPAQCRLIMIDPKMLELSVYD 503 (806) Q Consensus 477 SlLyk~~P~evkliliDPK~vEls~Y~ 503 (806) .+ .++.|+| |++.|-=- -||..+. T Consensus 167 ~i-~~~~p~e-RivtIEDt-~EL~~~~ 190 (323) T PRK13833 167 EI-VASAPED-RLVILEDT-AEIQCAA 190 (323) T ss_pred HH-CCCCCHH-EEEEECCC-CCCCCCC T ss_conf 64-0289323-39994575-0114678 No 28 >PRK00440 rfc replication factor C small subunit; Reviewed Probab=96.16 E-value=0.016 Score=38.57 Aligned_cols=31 Identities=39% Similarity=0.562 Sum_probs=27.2 Q ss_pred CCCCCEEEEEECCCCHHHHHHHHHHHHHHHH Q ss_conf 1002023553047740679999999999982 Q gi|254780799|r 451 LARMPHLLIAGTTGSGKSVAINTMILSLLYR 481 (806) Q Consensus 451 LakMPHLLIAGtTGSGKSV~iN~iI~SlLyk 481 (806) -.++||||+.|..|+||..+..++.-.+... T Consensus 34 ~~~~phlLf~GppG~GKTt~a~~la~~l~~~ 64 (318) T PRK00440 34 EKNMPHLLFAGPPGTGKTTAALALARELYGE 64 (318) T ss_pred CCCCCEEEEECCCCCCHHHHHHHHHHHHCCC T ss_conf 7998669888959988999999999997698 No 29 >PRK13830 conjugal transfer protein TrbE; Provisional Probab=96.06 E-value=0.0085 Score=40.58 Aligned_cols=212 Identities=19% Similarity=0.282 Sum_probs=108.9 Q ss_pred HHCCCCCCCC-EEEEECC--CCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCH-HHH------HHCCCC Q ss_conf 2104566666-6785410--02023553047740679999999999982995784788852310-011------102770 Q gi|254780799|r 436 INLGKSIEGK-PIIADLA--RMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKM-LEL------SVYDGI 505 (806) Q Consensus 436 iaLGKdI~G~-pvv~DLa--kMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~-vEl------s~Y~~i 505 (806) .+|-+..+|. |+..|+- .--|-||-|.|||||||.+|.| ++-..|+.+ .|.+..|=.+ .|. ..|-+| T Consensus 435 p~l~~t~sG~TPf~fN~H~~dvGHtlIiGpTGsGKTvll~fl-~aq~~ky~~--~~vf~FDKd~s~~i~~~a~GG~y~~l 511 (818) T PRK13830 435 PPLMQVASGSTPFRLNLHVDDVGHTLIFGPTGSGKSTLLALI-AAQFRRYAG--AQIFAFDKGRSMLPLTLAAGGDHYEI 511 (818) T ss_pred CCEEEECCCCCCEEEECCCCCCCEEEEECCCCCCHHHHHHHH-HHHHHHCCC--CEEEEECCCCCHHHHHHHHCCEEEEE T ss_conf 614861489975356278898650589899999889999999-999864279--83899748876899999809925873 Q ss_pred ---------HHHHCC--CCCCHHHHHHHHHHHHHH---------------HHHHHHHHHHCCCCCHHHHHH-----HHHH Q ss_conf ---------343122--334304566899999999---------------999999998708996899999-----9998 Q gi|254780799|r 506 ---------PNLLTP--VVTNPQKAVTVLKWLVCE---------------MEERYQKMSKIGVRNIDGFNL-----KVAQ 554 (806) Q Consensus 506 ---------PHLl~p--VvTd~~kA~~aL~w~V~E---------------Me~RY~l~a~~~vRni~~yN~-----k~~~ 554 (806) |-=+.| -+.++..-+-+-.|+..= ..+--..+++..-|.+..+-. .+++ T Consensus 512 ~~~~~~~~~~~~f~Pl~~~~t~~~~~fl~~wl~~l~~~~g~~~tp~~~~~i~~a~~~~~~~~~r~l~~~~~~~~~~~l~~ 591 (818) T PRK13830 512 GGDDAEEGRALAFCPLSELSTDADRAWAAEWIETLVALQGVTITPDHRNAISRQIGLMASARGRSLSDFVSGVQMREIKD 591 (818) T ss_pred CCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHCCCHHHHH T ss_conf 36777788877879888899857779999999999984798789999999999998510365580988864357578999 Q ss_pred HHHCCCCCCCCCCCCCCCCCCCC------CCCCCC----------C-----------CCCCCCEEEEEHHHHHHHHHHCC Q ss_conf 87447866775446776545432------223322----------3-----------23469868776344688887321 Q gi|254780799|r 555 YHNTGKKFNRTVQTGFDRKTGEA------IYETEH----------F-----------DFQHMPYIVVVIDEMADLMMVAR 607 (806) Q Consensus 555 ~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~----------~-----------~~~~lp~ivviiDElaDlmm~~~ 607 (806) +... -....+..--+|.+.+.- .++-+. | -++-=| .++++||+--+. . T Consensus 592 ~L~~-w~~~G~~g~~fD~~~D~l~~~~~~~fe~~~ll~~~~~~~~Pvl~Ylf~rie~~lDGrp-~li~lDE~W~~L---~ 666 (818) T PRK13830 592 ALHH-YTVDGPMGQLLDAEEDGLTLGAFQTFEIEQLMNMGERNLVPVLTYLFRRIEKRLTGAP-SLIILDEAWLML---G 666 (818) T ss_pred HHHH-HHCCCCCEECCCCCCCCCCCCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHCCCCC-EEEEHHHHHHHH---C T ss_conf 9998-7069972504478766757566417855300447742079999999999997507991-799767648761---8 Q ss_pred CHH-HHHHHHHHHHHHHCCEEEEEEECCCCCCC----CCHHHHHCCCCEEEEEE Q ss_conf 005-88999999866414237999965777535----54355411025158764 Q gi|254780799|r 608 KDI-ESAVQRLAQMARASGIHVIMATQRPSVDV----ITGTIKANFPTRISFQV 656 (806) Q Consensus 608 ~~v-e~~i~rlaq~ara~GiHli~aTqrPsvdv----itg~ikan~p~riaf~v 656 (806) .++ .+-|.------|-..-.+|+|||.|+ |+ |...|.-|.||+|=+-- T Consensus 667 ~~~F~~~i~~wlkt~RK~N~~vv~aTQS~~-d~~~S~I~~~iveq~~T~IfLPN 719 (818) T PRK13830 667 HPVFRDKIREWLKVLRKANCAVVLATQSIS-DAERSGIIDVLKESCPTKICLPN 719 (818) T ss_pred CHHHHHHHHHHHHHHHHCCCEEEEECCCHH-HHHCCCHHHHHHHHCCEEEECCC T ss_conf 989999999999999863977999778889-98659648999986886698668 No 30 >pfam02534 TraG TraG/TraD family. These proteins contain a P-loop and walker-B site for nucleotide binding. TraG is essential for DNA transfer in bacterial conjugation. These proteins are thought to mediate interactions between the DNA-processing (Dtr) and the mating pair formation (Mpf) systems. Probab=95.99 E-value=0.0065 Score=41.44 Aligned_cols=64 Identities=23% Similarity=0.455 Sum_probs=42.2 Q ss_pred CCCE-EEEEHHHHHHH--HHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEEEECCCCCCHH Q ss_conf 6986-87763446888--87321005889999998664142379999657775355435541102515876458664212 Q gi|254780799|r 588 HMPY-IVVVIDEMADL--MMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISFQVSSKIDSRT 664 (806) Q Consensus 588 ~lp~-ivviiDElaDl--mm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf~v~s~~dSrt 664 (806) ++|+ +..+.||||-| | .+++..+. ..|--||++.+-.|--+ |. +- T Consensus 299 ~~~~~vlflLDEF~nlg~i----~~l~~~l~----~~rg~gi~~~~i~Qs~~----------------------QL--~~ 346 (468) T pfam02534 299 GRPKEVLFLLDEFPSLGKL----EILEEALA----TIRGYGIRLTTICQSIS----------------------QL--KE 346 (468) T ss_pred CCCEEEEEEEECHHCCCCH----HHHHHHHH----HHHHCCCEEEEEEECHH----------------------HH--HH T ss_conf 9751479999821114877----89999999----88617969999996199----------------------99--98 Q ss_pred HCCCCCHHHHCCC-CCEEEE Q ss_conf 3388645786588-754773 Q gi|254780799|r 665 ILGEQGAEQLLGQ-GDMLYM 683 (806) Q Consensus 665 ild~~gae~Llg~-gdml~~ 683 (806) +-++.||+.++|+ +.-+|+ T Consensus 347 ~YG~~~a~til~n~~~~~~f 366 (468) T pfam02534 347 TYGGEGAKSILSNHAVVICF 366 (468) T ss_pred HHCCCCHHHHHHCCCEEEEE T ss_conf 85841278898548738999 No 31 >pfam03796 DnaB_C DnaB-like helicase C terminal domain. The hexameric helicase DnaB unwinds the DNA duplex at the Escherichia coli chromosome replication fork. Although the mechanism by which DnaB both couples ATP hydrolysis to translocation along DNA and denatures the duplex is unknown, a change in the quaternary structure of the protein involving dimerization of the N-terminal domain has been observed and may occur during the enzymatic cycle. This C-terminal domain contains an ATP-binding site and is therefore probably the site of ATP hydrolysis. Probab=95.94 E-value=0.14 Score=31.37 Aligned_cols=41 Identities=29% Similarity=0.424 Sum_probs=30.5 Q ss_pred EEEHHHHHHHHHHCCC--------HHHHHHHHHHHHHHHCCEEEEEEECC Q ss_conf 7763446888873210--------05889999998664142379999657 Q gi|254780799|r 593 VVVIDEMADLMMVARK--------DIESAVQRLAQMARASGIHVIMATQR 634 (806) Q Consensus 593 vviiDElaDlmm~~~~--------~ve~~i~rlaq~ara~GiHli~aTqr 634 (806) +||||=+ +||...+. +|..-..+|.++|+--+|..|.++|= T Consensus 132 ~vvvDyl-~l~~~~~~~~~~~r~~~v~~i~~~Lk~lA~e~~i~ii~~sQl 180 (186) T pfam03796 132 LIVIDYL-QLMQGSKASKSENRQQEISEISRSLKALAKELNIPVIALSQL 180 (186) T ss_pred EEEEEHH-HHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEECC T ss_conf 8997489-863677888775599999999999999999979918997225 No 32 >PRK13851 type IV secretion system protein VirB11; Provisional Probab=95.68 E-value=0.0078 Score=40.86 Aligned_cols=40 Identities=45% Similarity=0.728 Sum_probs=33.0 Q ss_pred EEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHH Q ss_conf 2355304774067999999999998299578478885231001110 Q gi|254780799|r 456 HLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSV 501 (806) Q Consensus 456 HLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~ 501 (806) -.||+|.|||||.-.+|+++..+ |.+-|+|-|-= --||.+ T Consensus 164 NIlIsGgTGSGKTTllnALl~~I-----P~~eRIvtIED-t~EL~l 203 (343) T PRK13851 164 TMLLCGPTGSGKTTMSKTLISAI-----PPQERLITIED-TLELVI 203 (343) T ss_pred CEEEECCCCCCHHHHHHHHHHCC-----CCCCCEEEECC-CCCCCC T ss_conf 89998889861999999999628-----96552799611-500258 No 33 >cd03271 ABC_UvrA_II The excision repair protein UvrA domain II; Nucleotide excision repair in eubacteria is a process that repairs DNA damage by the removal of a 12-13-mer oligonucleotide containing the lesion. Recognition and cleavage of the damaged DNA is a multistep ATP-dependent reaction that requires the UvrA, UvrB, and UvrC proteins. Both UvrA and UvrB are ATPases, with UvrA having two ATP binding sites, which have the characteristic signature of the family of ABC proteins and UvrB having one ATP binding site that is structurally related to that of helicases. Probab=95.66 E-value=0.21 Score=30.12 Aligned_cols=31 Identities=32% Similarity=0.399 Sum_probs=22.0 Q ss_pred CCCCCEEEEEECCCCHHHHHHHHHHHHHHHH Q ss_conf 1002023553047740679999999999982 Q gi|254780799|r 451 LARMPHLLIAGTTGSGKSVAINTMILSLLYR 481 (806) Q Consensus 451 LakMPHLLIAGtTGSGKSV~iN~iI~SlLyk 481 (806) +.+==...|-|..|||||--+|+++...+.+ T Consensus 18 i~~G~~~aIiG~sGsGKSTLl~~~L~~~~~~ 48 (261) T cd03271 18 IPLGVLTCVTGVSGSGKSSLINDTLYPALAR 48 (261) T ss_pred ECCCCEEEEECCCCCCHHHHHHHHHHHHHHH T ss_conf 8899999998799986999999998885410 No 34 >cd01130 VirB11-like_ATPase Type IV secretory pathway component VirB11, and related ATPases. The homohexamer, VirB11 is one of eleven Vir proteins, which are required for T-pilus biogenesis and virulence in the transfer of T-DNA from the Ti (tumor-inducing) plasmid of bacterial to plant cells. The pilus is a fibrous cell surface organelle, which mediates adhesion between bacteria during conjugative transfer or between bacteria and host eukaryotic cells during infection. VirB11- related ATPases include the archaeal flagella biosynthesis protein and the pilus assembly proteins CpaF/TadA and TrbB. This alignment contains the C-terminal domain, which is the ATPase. Probab=95.62 E-value=0.01 Score=40.04 Aligned_cols=51 Identities=33% Similarity=0.544 Sum_probs=35.2 Q ss_pred CEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHH-CCCCHHHHCC Q ss_conf 02355304774067999999999998299578478885231001110-2770343122 Q gi|254780799|r 455 PHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSV-YDGIPNLLTP 511 (806) Q Consensus 455 PHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~-Y~~iPHLl~p 511 (806) =.+||+|.|||||+-++|+++-.+ |.+.|.+.|.- -.||.. +.+..||.+. T Consensus 26 ~nIlIsG~tGSGKTTll~al~~~i-----~~~~rivtiEd-~~El~l~~~~~v~l~~~ 77 (186) T cd01130 26 KNILISGGTGSGKTTLLNALLAFI-----PPDERIITIED-TAELQLPHPNWVRLVTR 77 (186) T ss_pred CCEEEECCCCCCHHHHHHHHHHHC-----CCCCCEEEECC-CHHHCCCCCCEEEEEEE T ss_conf 989998999998999999999613-----34564598415-35404777756888860 No 35 >PRK13900 type IV secretion system ATPase VirB11; Provisional Probab=95.62 E-value=0.0093 Score=40.27 Aligned_cols=50 Identities=28% Similarity=0.424 Sum_probs=38.1 Q ss_pred EEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCC--CCHHHHCC Q ss_conf 235530477406799999999999829957847888523100111027--70343122 Q gi|254780799|r 456 HLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYD--GIPNLLTP 511 (806) Q Consensus 456 HLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~--~iPHLl~p 511 (806) -+||||.|||||.-.+|+++. +-|.+.|+|.|-- -.||.+.+ ..-||+++ T Consensus 162 NilI~G~TgSGKTTll~aL~~-----~ip~~eRiitIED-t~EL~l~~~pn~v~l~~~ 213 (332) T PRK13900 162 NIIISGGTSTGKTTFTNAALR-----EIPAIERLITVED-AREIVLSSHPNRVHLLAS 213 (332) T ss_pred CEEEECCCCCCHHHHHHHHHH-----CCCCCCCEEEECC-CHHHCCCCCCCEEEEEEC T ss_conf 199988889889999999983-----5895353566314-066356668888999971 No 36 >PRK06871 DNA polymerase III subunit delta'; Validated Probab=95.51 E-value=0.056 Score=34.44 Aligned_cols=180 Identities=21% Similarity=0.286 Sum_probs=96.4 Q ss_pred CCCCE-EEEEECCCCHHHHHHHHHHHHHHHHCCHHH--------EEEEEECCCHHHHHHCCCCHHHHCCCCCCHHHHHHH Q ss_conf 00202-355304774067999999999998299578--------478885231001110277034312233430456689 Q gi|254780799|r 452 ARMPH-LLIAGTTGSGKSVAINTMILSLLYRMTPAQ--------CRLIMIDPKMLELSVYDGIPNLLTPVVTNPQKAVTV 522 (806) Q Consensus 452 akMPH-LLIAGtTGSGKSV~iN~iI~SlLyk~~P~e--------vkliliDPK~vEls~Y~~iPHLl~pVvTd~~kA~~a 522 (806) .++|| +|+.|..|.||+-....+.-.||-...-++ |+++ +-...-|+ |.+.|. |.| T Consensus 20 ~r~~HA~L~~G~~G~Gk~~la~~~a~~llC~~~~~~~~Cg~C~sC~l~-------~~g~HPD~-~~i~~~--~~k----- 84 (324) T PRK06871 20 GRGHHALLFKADSGLGTEQLIRALAQWLMCQAPGDEQPCGQCHSCHLF-------QAGNHPDF-HILEPI--DGK----- 84 (324) T ss_pred CCCCEEEEEECCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHH-------HCCCCCCE-EEEECC--CCC----- T ss_conf 995437876899997899999999999828999999988889899999-------73899987-998467--888----- Q ss_pred HHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHHH Q ss_conf 99999999999999987089968999999998874478667754467765454322233223234698687763446888 Q gi|254780799|r 523 LKWLVCEMEERYQKMSKIGVRNIDGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQHMPYIVVVIDEMADL 602 (806) Q Consensus 523 L~w~V~EMe~RY~l~a~~~vRni~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivviiDElaDl 602 (806) ..||-.|-.-++++...- ..=.|=|+|||+ ||. T Consensus 85 ----------------~I~vd~IR~l~~~~~~~~------------------------------~~g~~KV~iI~~-ae~ 117 (324) T PRK06871 85 ----------------DIGVDQVREINEKVSQFA------------------------------QQGGNKVVYIQG-AER 117 (324) T ss_pred ----------------CCCHHHHHHHHHHHHHCC------------------------------CCCCCEEEEECC-HHH T ss_conf ----------------788999999999986462------------------------------205966999758-888 Q ss_pred HHHCC-----CHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEEEECCCCCCHHHCCCCCHHHHCCC Q ss_conf 87321-----0058899999986641423799996577753554355411025158764586642123388645786588 Q gi|254780799|r 603 MMVAR-----KDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISFQVSSKIDSRTILGEQGAEQLLGQ 677 (806) Q Consensus 603 mm~~~-----~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf~v~s~~dSrtild~~gae~Llg~ 677 (806) |-.++ |-.|++ ..+...||.|.+|+- +-.+|++-- -++.|..-+.-+...-|-+++ -+. T Consensus 118 m~~~AaNALLKtLEEP---------p~~~~fiL~t~~~~~--ll~TI~SRC-q~~~~~~p~~~~~~~wL~~~~----~~~ 181 (324) T PRK06871 118 LTEAAANALLKTLEEP---------RPNTYFLLQADLSAS--LLATIYSRC-QTWLIHVPEEQIALDWLQAQS----SGE 181 (324) T ss_pred HHHHHHHHHHHHHCCC---------CCCEEEEEEECCCCC--CCCHHHHCC-CCEECCCCCHHHHHHHHHHHC----CCC T ss_conf 5799999999983389---------878389998787010--324086266-120089949999999999746----887 Q ss_pred -CCEE--EECCCCCEEEEEECCCCHHHHHHHHHHHH Q ss_conf -7547--73689832588833489889999999997 Q gi|254780799|r 678 -GDML--YMTGGGRVQRIHGPFVSDIEVEKVVSHLK 710 (806) Q Consensus 678 -gdml--~~~~~~~~~r~~g~~v~~~ev~~v~~~~~ 710 (806) .|.. ..-.++.|.+.. .|+.++..+.--++++ T Consensus 182 ~~~~~~al~~~~g~pl~A~-~~~~~~~~~~r~~~~~ 216 (324) T PRK06871 182 IQEIQTALRINYGRPLLAL-TFLEQGLLEQRKTFLR 216 (324) T ss_pred HHHHHHHHHHCCCCHHHHH-HHHCCCHHHHHHHHHH T ss_conf 2999999997699879999-9868779999999999 No 37 >TIGR01420 pilT_fam twitching motility protein; InterPro: IPR006321 These represent the PilT subfamily of proteins related to GspE, a protein involved in type II secretion (also called the General Secretion Pathway). PilT is an apparent cytosolic ATPase associated with type IV pilus systems. It is not required for pilin biogenesis, but is required for twitching motility and social gliding behaviors, shown in some species, powered by pilus retraction . Members of this family may be found in some species that do not have type IV pili but have related structures for DNA uptake and natural transformation. ; GO: 0005524 ATP binding, 0006810 transport. Probab=95.37 E-value=0.011 Score=39.70 Aligned_cols=32 Identities=41% Similarity=0.720 Sum_probs=23.5 Q ss_pred EEEECCCCC--EEEEEECCCCHHHHHHHHHHHHH Q ss_conf 785410020--23553047740679999999999 Q gi|254780799|r 447 IIADLARMP--HLLIAGTTGSGKSVAINTMILSL 478 (806) Q Consensus 447 vv~DLakMP--HLLIAGtTGSGKSV~iN~iI~Sl 478 (806) |+-++++-| =.||.|.||||||=-+=|||=-| T Consensus 118 v~~~~a~~~~GLiLVTGPTGSGKSTTlAsmIDyI 151 (350) T TIGR01420 118 VLRELAERPRGLILVTGPTGSGKSTTLASMIDYI 151 (350) T ss_pred HHHHHHHCCCCCEEEECCCCCCHHHHHHHHHHHH T ss_conf 9999983669938987688986789999999787 No 38 >PRK13721 conjugal transfer ATP-binding protein TraC; Provisional Probab=95.24 E-value=0.024 Score=37.24 Aligned_cols=184 Identities=17% Similarity=0.191 Sum_probs=96.6 Q ss_pred CCCCCCCCEEEEECC--CCC----EEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECC-----CHHH-------- Q ss_conf 045666666785410--020----235530477406799999999999829957847888523-----1001-------- Q gi|254780799|r 438 LGKSIEGKPIIADLA--RMP----HLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDP-----KMLE-------- 498 (806) Q Consensus 438 LGKdI~G~pvv~DLa--kMP----HLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDP-----K~vE-------- 498 (806) +|. =|++...|+= .++ -.-|+|++||||||.+|-|+.+.+-. --|.+.||+ |..| T Consensus 454 ~~r--RgQ~~~~d~F~~~~~~tN~N~~I~g~SGsGKSf~~q~l~~~~~~~----G~~V~IID~G~SY~~lc~~~gG~yi~ 527 (864) T PRK13721 454 PTY--RNQLAFIDIFFRGMNNTNYNMAVCGTSGAGKTGLIQPLIRSVLDS----GGFAVVFDMGDGYKSLCENMGGVYLD 527 (864) T ss_pred EEC--CCCCCEECCCCCCCCCCCCCEEEECCCCCCHHHHHHHHHHHHHHC----CCEEEEEECCCCHHHHHHHCCCEEEC T ss_conf 211--686103455678999987006998289986899999999999857----99799996997899999973998866 Q ss_pred -----HHHCCCCH----H------HH----CC--CCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHH------- Q ss_conf -----11027703----4------31----22--334304566899999999999999998708996899999------- Q gi|254780799|r 499 -----LSVYDGIP----N------LL----TP--VVTNPQKAVTVLKWLVCEMEERYQKMSKIGVRNIDGFNL------- 550 (806) Q Consensus 499 -----ls~Y~~iP----H------Ll----~p--VvTd~~kA~~aL~w~V~EMe~RY~l~a~~~vRni~~yN~------- 550 (806) |.++..|. + |+ .| -+||-. -..|.-+|.+.=+++. ....+.++..|=+ T Consensus 528 ~~~i~~NPF~~i~d~~~~~e~i~~ll~~masp~~~l~~~e--~~~l~~Av~~~w~~~g--~~~~iddv~~~L~~~~~~~~ 603 (864) T PRK13721 528 GETLRFNPFANITDIDQSAERIRDQLSVMASPNGNLDEVH--EGLLLQAVRASWLAKK--NRARIDDVVDFLKNARDSEQ 603 (864) T ss_pred CCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCHHH--HHHHHHHHHHHHHHHC--CCCCCCHHHHHHHHCCHHHH T ss_conf 9965468875666631046899999999838899999799--9999999999999828--98883089999873310144 Q ss_pred ---------HHHHH-------HHCCCCCCCCCCCC----CCCCCCCCCCCCCC----CCC-------------------- Q ss_conf ---------99988-------74478667754467----76545432223322----323-------------------- Q gi|254780799|r 551 ---------KVAQY-------HNTGKKFNRTVQTG----FDRKTGEAIYETEH----FDF-------------------- 586 (806) Q Consensus 551 ---------k~~~~-------~~~~~~~~~~~~~~----~~~~~~~~~~~~~~----~~~-------------------- 586 (806) ++++. -..|. ......| .+.+..--.+|-+. ++| T Consensus 604 ~~~~~~~~~r~~~l~~~L~~y~~~G~--Yg~~Fng~~~~l~~~~~fvVlEL~~Lk~~~~L~~vvl~~lm~~i~~~my~~~ 681 (864) T PRK13721 604 YADSPTIRSRLDEMIVLLDQYTANGT--YGEYFNSDEPSLRDDAKMVVLELGGLEDRPSLLVAVMFSLIIYIENRMYRTP 681 (864) T ss_pred HHCCCHHHHHHHHHHHHHHHHCCCCC--CCCEECCCCCCCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 41341178999999999986525898--8642059998666889889998301488767999999999999999997577 Q ss_pred CCCCEEEEEHHHHHHHHHHCCC--HHHHHHHHHHHHHHHCCEEEEEEECCCC Q ss_conf 4698687763446888873210--0588999999866414237999965777 Q gi|254780799|r 587 QHMPYIVVVIDEMADLMMVARK--DIESAVQRLAQMARASGIHVIMATQRPS 636 (806) Q Consensus 587 ~~lp~ivviiDElaDlmm~~~~--~ve~~i~rlaq~ara~GiHli~aTqrPs 636 (806) ...|. +|||||--.|+ +++ .+-+.|-.....+|==|=-+|++||--. T Consensus 682 R~~~k-~~iiDEAW~lL--~~~n~~~~~FIe~~yRr~RKy~Gs~i~iTQ~i~ 730 (864) T PRK13721 682 RNLKK-LNVIDEGWRLL--DFKNHKVGEFIEKGYRTARRHTGAYITITQNIV 730 (864) T ss_pred CCCEE-EEEEEHHHHHH--CCCCHHHHHHHHHHHHHHHHHCCEEEEEECCHH T ss_conf 99717-99970187764--589878999999999998861866999922667 No 39 >smart00487 DEXDc DEAD-like helicases superfamily. Probab=95.15 E-value=0.19 Score=30.51 Aligned_cols=38 Identities=32% Similarity=0.433 Sum_probs=25.1 Q ss_pred CEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECC Q ss_conf 0235530477406799999999999829957847888523 Q gi|254780799|r 455 PHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDP 494 (806) Q Consensus 455 PHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDP 494 (806) -+++|++.|||||+.+.=..+...+.+.. ..|.+.+=| T Consensus 25 ~~~~i~~~tGsGKT~~~~~~~~~~~~~~~--~~~~li~~P 62 (201) T smart00487 25 RDVILAAPTGSGKTLAALLPALEALKRGK--GKRVLVLVP 62 (201) T ss_pred CCEEEECCCCCCHHHHHHHHHHHHHHCCC--CCEEEEEEC T ss_conf 98899899996099999999999863389--975999908 No 40 >TIGR00929 VirB4_CagE type IV secretion/conjugal transfer ATPase, VirB4 family; InterPro: IPR004346 This family includes the Helicobacter pylori protein CagE (see examples), which together with other proteins from the cag pathogenicity island (PAI), encodes a type IV transporter secretion system. The precise role of CagE is not known, but studies in animal models have shown that it is essential for pathogenesis in Helicobacter pylori induced gastritis and peptic ulceration . Indeed, the expression of the cag PAI has been shown to be essential for stimulating human gastric epithelial cell apoptosis in vitro . Similar type IV transport systems are also found in other bacteria. This family includes proteins from the trb and Vir conjugal transfer systems in Agrobacterium tumefaciens and homologues of VirB proteins from other species.; GO: 0005524 ATP binding. Probab=94.97 E-value=0.032 Score=36.24 Aligned_cols=314 Identities=20% Similarity=0.257 Sum_probs=182.6 Q ss_pred EECCEEEEEEEEECCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCEEEEE------ECCCCCCEEEEHHHCCCHHHHH Q ss_conf 1044244444321478639999978899998863001210003886125553------0245662387342307213430 Q gi|254780799|r 356 RPGPVITLYELEPAPGIKSSRIIGLSDDIARSMSAISARVAVIPRRNAIGIE------LPNDIRETVMLRDLIVSRVFEK 429 (806) Q Consensus 356 ~pGPVVTrYEi~PApGVKvSKI~nLadDIA~aLsa~svRIapIPGK~~VGIE------iPN~~r~~V~lreil~s~~f~~ 429 (806) ....+|..|+=. | -.+-...|.+.++++.+...-. .-++.+-.+|.| .|+.-.-.++- ..|.|..|.. T Consensus 385 ~~~~~~~v~aGv---G-aL~~~~~L~~~~~~a~~~l~~~-G~~~~~E~~~~e~~f~s~LP~n~~~~~R~-~~its~nfA~ 458 (931) T TIGR00929 385 EHHLSLVVYAGV---G-ALEDKEKLDAALSEARSLLNAK-GIVAVKETLGLEAAFWSQLPGNFKYNPRK-SLITSRNFAS 458 (931) T ss_pred ECCEEEEEEECC---C-CCCCHHHHHHHHHHHHHHHHHC-CCEEEECCCCHHHHHHHHCCCCCCCCCCC-HHHHHHHHHH T ss_conf 302278998052---5-6788889999999999998726-84243201001157897263544100350-1111799998 Q ss_pred -------------C-----CCC-CHHHCCCCCCCCEEEEECC------------CCC--EEEEEECCCCHHHHHHHHHHH Q ss_conf -------------2-----210-0021045666666785410------------020--235530477406799999999 Q gi|254780799|r 430 -------------N-----QCD-LAINLGKSIEGKPIIADLA------------RMP--HLLIAGTTGSGKSVAINTMIL 476 (806) Q Consensus 430 -------------s-----~~~-L~iaLGKdI~G~pvv~DLa------------kMP--HLLIAGtTGSGKSV~iN~iI~ 476 (806) . ..- -++.+=|+.+|.|+..+.= +|+ |=+|=|.|||||||-||=+++ T Consensus 459 l~~~~~~~~G~~~~NCP~~~WG~~al~~l~t~~gtPfyfNFH~~~~~~~~A~~~~r~~GhT~IfG~~G~GKTtLl~fL~a 538 (931) T TIGR00929 459 LIPFHNFNLGKSQNNCPEYPWGDPALTLLKTQSGTPFYFNFHVRDAKEKNADEGKRVLGHTLIFGPTGSGKTTLLNFLLA 538 (931) T ss_pred HHHHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCEECCCCCCCCCCCCCCCCHHHCCCEEEECCCCCCHHHHHHHHHH T ss_conf 87765146888888488886668706898717898602245457212465000311038777888889846999999999 Q ss_pred HHHHHCCH-HHEEEEEECC---------------CHHH-------HHHCCCC-H-H------HHCC-------------- Q ss_conf 99982995-7847888523---------------1001-------1102770-3-4------3122-------------- Q gi|254780799|r 477 SLLYRMTP-AQCRLIMIDP---------------KMLE-------LSVYDGI-P-N------LLTP-------------- 511 (806) Q Consensus 477 SlLyk~~P-~evkliliDP---------------K~vE-------ls~Y~~i-P-H------Ll~p-------------- 511 (806) .+. |+.| +-.+++.+|= +.++ .+-=-+| | | |..- T Consensus 539 ~~~-ky~~~~a~~~~~fDkd~g~~~~~~a~gG~Y~~i~G~~T~~~~g~~~~~NPl~~~~g~~l~~t~~n~~F~~~~~~~l 617 (931) T TIGR00929 539 QLQ-KYKPNFALTIFAFDKDRGMEIFIRALGGKYLEIKGDATVAADGEPFGFNPLQGIDGEALEDTERNRAFLKEWLAML 617 (931) T ss_pred HHH-CCCCCCCEEEEEEECCCCCHHHHEEECCEEEEECCCEECCCCCCCCCCCCHHHHCCCCCCCCCCCCHHHHHHHHHH T ss_conf 974-2488987069998878982104111174565303301016788866568020123367777724407999999999 Q ss_pred ----CCC---CHHHHHHHHHHHHHHHHHHH----HHH-------HHCCCCCHH------HHHHHHHHHHHCCCCC-CCCC Q ss_conf ----334---30456689999999999999----999-------870899689------9999999887447866-7754 Q gi|254780799|r 512 ----VVT---NPQKAVTVLKWLVCEMEERY----QKM-------SKIGVRNID------GFNLKVAQYHNTGKKF-NRTV 566 (806) Q Consensus 512 ----VvT---d~~kA~~aL~w~V~EMe~RY----~l~-------a~~~vRni~------~yN~k~~~~~~~~~~~-~~~~ 566 (806) -.+ .+.+-.++|.-+|.-|.+-+ ..+ ..+....++ +--.+.+..-+.|.-. .-.+ T Consensus 618 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~r~l~~l~~~l~~~~~~~~~~~~~l~~~L~~w~~~~~~G~~Gef 697 (931) T TIGR00929 618 VTLDGETDEEISAEDRNALSSAIDSLYEEDQKEHPAIRSLSDLLNFLPKDSIEVVEARNSLADRLERWLKGGDYGGDGEF 697 (931) T ss_pred HHHCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCEE T ss_conf 85127665436789999999998888731441125522089999972577530004502389998875178987555203 Q ss_pred CCCCC-CCCCCCCCCC-----------------C--------------CCCCCCC-CEEEEEHHHHHHHHHHCCCHHHHH Q ss_conf 46776-5454322233-----------------2--------------2323469-868776344688887321005889 Q gi|254780799|r 567 QTGFD-RKTGEAIYET-----------------E--------------HFDFQHM-PYIVVVIDEMADLMMVARKDIESA 613 (806) Q Consensus 567 ~~~~~-~~~~~~~~~~-----------------~--------------~~~~~~l-p~ivviiDElaDlmm~~~~~ve~~ 613 (806) |--|| .+++.-.+.. + ...++.- --.|++|||+=-+. ..--.+.- T Consensus 698 ~wLFD~~~~D~Ldl~~~~~~gfd~~~ll~~~e~~~~~~p~~~YLf~Ri~~~~dg~~r~~~~~~DEaw~~l--~~p~~~~~ 775 (931) T TIGR00929 698 QWLFDGNETDSLDLSKADIIGFDLTELLDNPELEKVATPVLLYLFHRIEEALDGSNRRFLIVIDEAWKLL--GDPVFAAK 775 (931) T ss_pred EEECCCCCCCCCCCCCCCEEEEEHHHHHCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCHHHH--CCHHHHHH T ss_conf 2110689754123789714787554642476866789999999999999972413586799851305332--69078999 Q ss_pred HHHHHHHHHHCCEEEEEEECCC-CCCC---CCHHHHHCCCCEEEEEECCCCCCHHHCCCCCHHHHCCCCCEEEEC-CCCC Q ss_conf 9999986641423799996577-7535---543554110251587645866421233886457865887547736-8983 Q gi|254780799|r 614 VQRLAQMARASGIHVIMATQRP-SVDV---ITGTIKANFPTRISFQVSSKIDSRTILGEQGAEQLLGQGDMLYMT-GGGR 688 (806) Q Consensus 614 i~rlaq~ara~GiHli~aTqrP-svdv---itg~ikan~p~riaf~v~s~~dSrtild~~gae~Llg~gdml~~~-~~~~ 688 (806) |.....-+|=..=++|+|||-| +-=- |-=.|..|.||-|.| | +... T Consensus 776 i~~~l~t~RK~NG~~v~ATQ~~Y~d~~~s~i~~~~~~~~~T~I~l-----------------------------Pn~~a~ 826 (931) T TIGR00929 776 IKDWLKTLRKANGIVVLATQSIYNDALESRIADTLIEQCATKIFL-----------------------------PNPKAD 826 (931) T ss_pred HHHHHHHHHHHCCEEEEECCHHHHHHHCCCHHHHHHHHCCCEEEC-----------------------------CCCCCC T ss_conf 999987576609779863001477763154246889615834324-----------------------------884568 Q ss_pred EEEEEECC----CCHHHHHHHHH Q ss_conf 25888334----89889999999 Q gi|254780799|r 689 VQRIHGPF----VSDIEVEKVVS 707 (806) Q Consensus 689 ~~r~~g~~----v~~~ev~~v~~ 707 (806) -.-..-.| .||.|.+-|-. T Consensus 827 ~~dy~~~f~~~glte~E~~~l~~ 849 (931) T TIGR00929 827 REDYAEGFGIKGLTEREFELLKS 849 (931) T ss_pred HHHHHHHCCCCCCCHHHHHHHHH T ss_conf 58999855888979899999985 No 41 >pfam06745 KaiC KaiC. This family represents a conserved region within bacterial and archaeal proteins, most of which are hypothetical. More than one copy is sometimes found in each protein. This family includes KaiC, which is one of the Kai proteins among which direct protein-protein association may be a critical process in the generation of circadian rhythms in cyanobacteria. Probab=94.89 E-value=0.26 Score=29.40 Aligned_cols=133 Identities=17% Similarity=0.258 Sum_probs=71.1 Q ss_pred EEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 23553047740679999999999982995784788852310011102770343122334304566899999999999999 Q gi|254780799|r 456 HLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYDGIPNLLTPVVTNPQKAVTVLKWLVCEMEERYQ 535 (806) Q Consensus 456 HLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~~iPHLl~pVvTd~~kA~~aL~w~V~EMe~RY~ 535 (806) =.||.|.+|||||+.--.++..=+.+ ..+.|=+| |..+...+++ +....+.|-..+++..=+ T Consensus 21 ~~LI~G~pGsGKT~la~qfl~~ga~~-~ge~~lYi---------s~ee~~~~l~--------~~~~~~g~~~~~~~~~g~ 82 (231) T pfam06745 21 VVLITGGPGTGKTIFGLQFLYNGALE-YGEPGVYV---------TLEEPPEDLR--------ENAKSFGWDLEKLEEEGK 82 (231) T ss_pred EEEEECCCCCCHHHHHHHHHHHHHHH-CCCCEEEE---------EECCCHHHHH--------HHHHHCCCCHHHHHHCCC T ss_conf 99998589725999999999999986-58968999---------8137999999--------999982998589864696 Q ss_pred ------------HHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHHHH Q ss_conf ------------99870899689999999988744786677544677654543222332232346986877634468888 Q gi|254780799|r 536 ------------KMSKIGVRNIDGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQHMPYIVVVIDEMADLM 603 (806) Q Consensus 536 ------------l~a~~~vRni~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivviiDElaDlm 603 (806) ........+++...+++.++.++ .-|. .||||-+..|. T Consensus 83 l~i~d~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~-----------------------------~~~~-~vVIDsit~l~ 132 (231) T pfam06745 83 LAIIDASTSGIGIAEVKGRFDLEELIERLREAIRE-----------------------------IGAK-RVVIDSITTLF 132 (231) T ss_pred EEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHH-----------------------------HCCC-EEEEECCHHHH T ss_conf 78986254422210011227999999999999997-----------------------------1998-89997641640 Q ss_pred HHCC-CHHHHHHHHHHHHHHHCCEEEEEEECCCC Q ss_conf 7321-00588999999866414237999965777 Q gi|254780799|r 604 MVAR-KDIESAVQRLAQMARASGIHVIMATQRPS 636 (806) Q Consensus 604 m~~~-~~ve~~i~rlaq~ara~GiHli~aTqrPs 636 (806) +... .++-..+.+|.+..|..|+..++-.+-++ T Consensus 133 ~~~~~~~~r~~l~~l~~~lk~~g~t~l~t~e~~~ 166 (231) T pfam06745 133 YLLKPAMAREILRRLKRVLKKLGVTAIFTSEKPS 166 (231) T ss_pred HCCCHHHHHHHHHHHHHHHHHCCCEEEEEEEECC T ss_conf 0588999999999999999976991999982125 No 42 >pfam00735 Septin Septin. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP. As regards the septins, these are polypeptides of 30-65kDa with three characteristic GTPase motifs (G-1, G-3 and G-4) that are similar to those of the Ras family. The G-4 motif is strictly conserved with a unique septin consensus of AKAD. Most septins are thought to have at least one coiled-coil region, which in some cases is necessary for intermolecular interactions that allow septins to polymerize to form rod-shaped complexes. In turn, these are arranged into tandem arrays to form filaments. They are multifunctional proteins, with roles in cytokinesis, sporulation, germ cell development, exocytosis and apoptosis. Probab=94.81 E-value=0.011 Score=39.66 Aligned_cols=181 Identities=19% Similarity=0.311 Sum_probs=84.4 Q ss_pred EEEEEECCCCHHHHHHHHHHHHHHHHCCHHH------EEEEEECCCHHHHHHCCCCHHHHCCCCCCH---HHHH--HHHH Q ss_conf 2355304774067999999999998299578------478885231001110277034312233430---4566--8999 Q gi|254780799|r 456 HLLIAGTTGSGKSVAINTMILSLLYRMTPAQ------CRLIMIDPKMLELSVYDGIPNLLTPVVTNP---QKAV--TVLK 524 (806) Q Consensus 456 HLLIAGtTGSGKSV~iN~iI~SlLyk~~P~e------vkliliDPK~vEls~Y~~iPHLl~pVvTd~---~kA~--~aL~ 524 (806) -+||+|++|+|||-+|||+..+-++..+... -+-.-|.....|+- -+|+.--|+ ||.-| ++.. ...+ T Consensus 6 nimvvG~sGlGKTTfiNtL~~~~~~~~~~~~~~~~~~~~t~~i~~~~~~ie-e~g~~l~Lt-iiDTpGfGd~i~n~~~~~ 83 (280) T pfam00735 6 TLMVVGESGLGKTTLINTLFLTDLYPERGIPGPSEKIKKTVEIKATTVEIE-EDGVKLNLT-VIDTPGFGDAIDNSNCWK 83 (280) T ss_pred EEEEECCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEE-ECCEEEEEE-EEECCCCCCCCCHHHHHH T ss_conf 999977999978999999857857786678884335567753488999998-899899999-998998423345166699 Q ss_pred HHHHHHHHHHHHH--HHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCC--CCCCCCCCCCCCCCCCC----CCCCEEEEEH Q ss_conf 9999999999999--87089968999999998874478667754467--76545432223322323----4698687763 Q gi|254780799|r 525 WLVCEMEERYQKM--SKIGVRNIDGFNLKVAQYHNTGKKFNRTVQTG--FDRKTGEAIYETEHFDF----QHMPYIVVVI 596 (806) Q Consensus 525 w~V~EMe~RY~l~--a~~~vRni~~yN~k~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~----~~lp~ivvii 596 (806) -+++-.|.+|+.. .+.+++- ..+ ...+++.- |-+.++..+.+-+-..+ ++-.-|=||- T Consensus 84 ~I~~yI~~qf~~yl~eE~~i~R-~~~-------------~D~RVH~cLYFI~Ptgh~L~~lDi~~mk~Ls~~vNvIPVIa 149 (280) T pfam00735 84 PIVEYIDEQFEQYLRDESRLNR-TSI-------------VDNRVHCCLYFISPTGHGLKPLDVEFMKKLHEKVNIIPVIA 149 (280) T ss_pred HHHHHHHHHHHHHHHHHHCCCC-CCC-------------CCCCEEEEEEEECCCCCCCCHHHHHHHHHHHCCCCEEEEEE T ss_conf 9999999999999998647566-777-------------88843799997568888886889999998714465366662 Q ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCH---HHHHCCCCEEEEEECCC Q ss_conf 4468888732100588999999866414237999965777535543---55411025158764586 Q gi|254780799|r 597 DEMADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITG---TIKANFPTRISFQVSSK 659 (806) Q Consensus 597 DElaDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg---~ikan~p~riaf~v~s~ 659 (806) =||.| ...|++..=.++-+--.+.||-+-=. |+.+.-.. ---..+.+.+-|+|.+. T Consensus 150 --KADtL--T~~El~~~K~~I~~~l~~~~I~i~~f---~~~~~d~~e~~~~~~~~~~~~PFaIigS 208 (280) T pfam00735 150 --KADTL--TPDELQRFKKRIRAEIERQNIKIYHF---PDEESDEDEEKEQTEQLKSSIPFAIVGS 208 (280) T ss_pred --CCCCC--CHHHHHHHHHHHHHHHHHCCCCEECC---CCCCCCCHHHHHHHHHHCCCCCCEEECC T ss_conf --14558--99999999999999999859725468---8854355778999987405799779888 No 43 >PRK04328 hypothetical protein; Provisional Probab=94.75 E-value=0.41 Score=28.01 Aligned_cols=134 Identities=17% Similarity=0.184 Sum_probs=75.2 Q ss_pred EEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 23553047740679999999999982995784788852310011102770343122334304566899999999999999 Q gi|254780799|r 456 HLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYDGIPNLLTPVVTNPQKAVTVLKWLVCEMEERYQ 535 (806) Q Consensus 456 HLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~~iPHLl~pVvTd~~kA~~aL~w~V~EMe~RY~ 535 (806) -.||+|.+|||||..-.-.+..=+-+- +.|=++-.+-..-++ + .-+..+.|-+.+.++..+ T Consensus 26 ~~Lv~G~pGtGKT~la~qFl~~g~~~G--E~~lyis~eE~~~~l---------~--------~~~~~~G~d~~~~~~~g~ 86 (250) T PRK04328 26 VVLLSGGPGTGKSIFSQQFLWNGLQMG--EPGIYVALEEHPVQV---------R--------RNMAQFGWDVRKYEEEGK 86 (250) T ss_pred EEEEEECCCCCHHHHHHHHHHHHHHCC--CCEEEEEEECCHHHH---------H--------HHHHHCCCCHHHHHHCCC T ss_conf 999982899998999999999998769--977999972799999---------9--------999980998689865697 Q ss_pred H------------------HHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHH Q ss_conf 9------------------9870899689999999988744786677544677654543222332232346986877634 Q gi|254780799|r 536 K------------------MSKIGVRNIDGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQHMPYIVVVID 597 (806) Q Consensus 536 l------------------~a~~~vRni~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivviiD 597 (806) + +......|+..+-+.++++.+. .=|. .|||| T Consensus 87 l~iid~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~-----------------------------~~~~-rvVID 136 (250) T PRK04328 87 FAIVDAFTGGIGEAAEREKYVVKDPDDVRELIDVLRQAIRD-----------------------------IGAK-RVVID 136 (250) T ss_pred EEEEECCCCCCCCCCHHHHHCCCCCCCHHHHHHHHHHHHHH-----------------------------HCCC-EEEEE T ss_conf 79985123333420000010136853599999999999985-----------------------------1898-89993 Q ss_pred HHHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCC Q ss_conf 46888873210058899999986641423799996577753 Q gi|254780799|r 598 EMADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVD 638 (806) Q Consensus 598 ElaDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvd 638 (806) -+.-|.+..+......+.+|.+.-+..|+-.++-+|-+..+ T Consensus 137 Sit~l~~~~~~~~r~~l~~l~~~l~~~g~Ttll~~e~~~~~ 177 (250) T PRK04328 137 SVSTLYLTKPAVARSIVMQLKRVLAGLGCTSIFVSQVSVGE 177 (250) T ss_pred CCHHHHHCCHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCC T ss_conf 70787745858899999999999986898699997100365 No 44 >pfam00437 GSPII_E Type II/IV secretion system protein. This family contains both type II and type IV pathway secretion proteins from bacteria. VirB11 ATPase is a subunit of the Agrobacterium tumefaciens transfer DNA (T-DNA) transfer system, a type IV secretion pathway required for delivery of T-DNA and effector proteins to plant cells during infection. Probab=94.66 E-value=0.026 Score=37.00 Aligned_cols=90 Identities=24% Similarity=0.481 Sum_probs=49.6 Q ss_pred CCCC--CCCCCCEEEEEECCCCCCEEEEHHHCCCHHHHHCCCCCHHHCCCCCCCCEEEEECCCCCEEEEEECCCCHHHHH Q ss_conf 2100--03886125553024566238734230721343022100021045666666785410020235530477406799 Q gi|254780799|r 393 ARVA--VIPRRNAIGIELPNDIRETVMLRDLIVSRVFEKNQCDLAINLGKSIEGKPIIADLARMPHLLIAGTTGSGKSVA 470 (806) Q Consensus 393 vRIa--pIPGK~~VGIEiPN~~r~~V~lreil~s~~f~~s~~~L~iaLGKdI~G~pvv~DLakMPHLLIAGtTGSGKSV~ 470 (806) +|++ |.-+...+-|-.|+. ....+.++..+..|...-..+ |-.-+. ++ =.+||+|.|||||+-. T Consensus 90 ~~i~~~p~~~g~si~IRk~~~--~~~tl~~L~~~g~~~~~~~~~---L~~~v~--------~~-~~ilIsG~TGSGKTT~ 155 (283) T pfam00437 90 FRVSTPPVVAGEAFVIRKPSS--RSLTLDDLGMTGAFDADIAEF---LRQAVQ--------AR-GNILVSGGTGSGKTTL 155 (283) T ss_pred EEEEECCCCCCCEEEEECCCC--CCCCHHHHCCCCCCHHHHHHH---HHHHHH--------CC-CEEEEECCCCCCHHHH T ss_conf 999987523674589850667--789999973897785999999---999998--------19-7599988999988999 Q ss_pred HHHHHHHHHHHCCHHHEEEEEECCCHHHHHH Q ss_conf 9999999998299578478885231001110 Q gi|254780799|r 471 INTMILSLLYRMTPAQCRLIMIDPKMLELSV 501 (806) Q Consensus 471 iN~iI~SlLyk~~P~evkliliDPK~vEls~ 501 (806) +|+++..+ .+.+-|.+.|-- -+|+.. T Consensus 156 l~all~~i----~~~~~riitiED-~~El~l 181 (283) T pfam00437 156 LYALLNEI----NTDDERIVTIED-PVEIQL 181 (283) T ss_pred HHHHHHHC----CCCCCCEEEECC-CCEECC T ss_conf 99999840----877762787337-852317 No 45 >cd01850 CDC_Septin CDC/Septin. Septins are a conserved family of GTP-binding proteins associated with diverse processes in dividing and non-dividing cells. They were first discovered in the budding yeast S. cerevisiae as a set of genes (CDC3, CDC10, CDC11 and CDC12) required for normal bud morphology. Septins are also present in metazoan cells, where they are required for cytokinesis in some systems, and implicated in a variety of other processes involving organization of the cell cortex and exocytosis. In humans, 12 septin genes generate dozens of polypeptides, many of which comprise heterooligomeric complexes. Since septin mutants are commonly defective in cytokinesis and formation of the neck formation of the neck filaments/septin rings, septins have been considered to be the primary constituents of the neck filaments. Septins belong to the GTPase superfamily for their conserved GTPase motifs and enzymatic activities. Probab=94.64 E-value=0.021 Score=37.65 Aligned_cols=154 Identities=19% Similarity=0.291 Sum_probs=78.7 Q ss_pred EEEEEECCCCHHHHHHHHHHHHHHHHCCHHH-------EEEEEECCCHHHHHHCCCCHHHHCCCCCCH---HHH--HHHH Q ss_conf 2355304774067999999999998299578-------478885231001110277034312233430---456--6899 Q gi|254780799|r 456 HLLIAGTTGSGKSVAINTMILSLLYRMTPAQ-------CRLIMIDPKMLELSVYDGIPNLLTPVVTNP---QKA--VTVL 523 (806) Q Consensus 456 HLLIAGtTGSGKSV~iN~iI~SlLyk~~P~e-------vkliliDPK~vEls~Y~~iPHLl~pVvTd~---~kA--~~aL 523 (806) -+||+|.+|+|||-+||++..+-++...... -+-.-|...++|+.- +|++--|+ ||.-| .+. .... T Consensus 6 nimVvG~sGlGKsTfiNtLf~~~~~~~~~~~~~~~~~~~~t~~i~~~~~~i~e-~g~~l~Lt-viDTpGfGd~i~n~~~~ 83 (276) T cd01850 6 NIMVVGESGLGKSTFINTLFNTKLIPSDYPPDPAEEHIDKTVEIKSSKAEIEE-NGVKLKLT-VIDTPGFGDNINNSDCW 83 (276) T ss_pred EEEEECCCCCCHHHHHHHHHCCCCCCCCCCCCCHHHCCCCCCEEEEEEEEEEE-CCEEEEEE-EEECCCCCCCCCHHHHH T ss_conf 99997689998899999974785778777788132056888258999999963-99899999-99768841223603409 Q ss_pred HHHHHHHHHHHHHH--HHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCC--CCCCCCCCCCCCCCCCC----CCCCEEEEE Q ss_conf 99999999999999--87089968999999998874478667754467--76545432223322323----469868776 Q gi|254780799|r 524 KWLVCEMEERYQKM--SKIGVRNIDGFNLKVAQYHNTGKKFNRTVQTG--FDRKTGEAIYETEHFDF----QHMPYIVVV 595 (806) Q Consensus 524 ~w~V~EMe~RY~l~--a~~~vRni~~yN~k~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~----~~lp~ivvi 595 (806) +-+++-.|.+|+.. .+.+++--..+ ...+++.- |-+.++..+.+-+-..+ ++-.-|=|| T Consensus 84 ~~I~~yI~~qf~~yl~eE~~i~R~~~~-------------~D~RVH~cLYFI~Ptgh~L~~lDi~~mk~Ls~~vNvIPVI 150 (276) T cd01850 84 KPIVDYIDDQFDQYLREESRIKRNPRI-------------PDTRVHACLYFIEPTGHGLKPLDIEFMKRLSKRVNIIPVI 150 (276) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCC-------------CCCCEEEEEEEECCCCCCCCHHHHHHHHHHCCCCCEEEEE T ss_conf 999999999999999998531457888-------------7774489999967888888799999999962756456678 Q ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHHHCCEEE Q ss_conf 344688887321005889999998664142379 Q gi|254780799|r 596 IDEMADLMMVARKDIESAVQRLAQMARASGIHV 628 (806) Q Consensus 596 iDElaDlmm~~~~~ve~~i~rlaq~ara~GiHl 628 (806) - =||.| ...|.+..=.++-+--.+.||.+ T Consensus 151 a--KADtl--T~~El~~~K~~I~~~l~~~~I~i 179 (276) T cd01850 151 A--KADTL--TPEELKEFKQRIMEDIEEHNIKI 179 (276) T ss_pred E--CCCCC--CHHHHHHHHHHHHHHHHHCCCCE T ss_conf 5--16669--99999999999999999859635 No 46 >pfam04157 EAP30 EAP30/Vps36 family. This family includes EAP30 as well as the Vps36 protein. Vps36 is involved in Golgi to endosome trafficking. EAP30 is a subunit of the ELL complex. The ELL is an 80-kDa RNA polymerase II transcription factor. ELL interacts with three other proteins to form the complex known as ELL complex. The ELL complex is capable of increasing that catalytic rate of transcription elongation, but is unable to repress initiation of transcription by RNA polymerase II as is the case of ELL. EAP30 is thought to lead to the derepression of ELL's transcriptional inhibitory activity. Probab=94.60 E-value=0.16 Score=31.09 Aligned_cols=87 Identities=18% Similarity=0.197 Sum_probs=62.3 Q ss_pred CCCCHHHHHHHHHHHHHCCCCCCC-CCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEHHHHHHCCCHHHHH Q ss_conf 348988999999999712897421-1001245566677888887777038999999996598500014222001177899 Q gi|254780799|r 695 PFVSDIEVEKVVSHLKTQGEAKYI-DIKDKILLNEEMRFSENSSVADDLYKQAVDIVLRDNKASISYIQRRLGIGYNRAA 773 (806) Q Consensus 695 ~~v~~~ev~~v~~~~~~q~~~~y~-~~~~~~~~~~~~~~~~~~~~~d~l~~~a~~~v~~~~~~s~s~lqr~~~igy~raa 773 (806) ..||.+++.+-++.++..+.| |. -.+.....- -......+.+..-...++++-..+-+|.+.|+++|.+-+.||- T Consensus 129 ~lvSp~Dl~~A~~~l~~Lg~~-~~l~~~~sg~~~---v~s~~~~e~~~~~~~il~~~~~~~~vt~~~l~~~l~ws~~~a~ 204 (219) T pfam04157 129 ELVSPDDLLKACKKLEKLGLG-FKLVKVGSGVLV---VQSVPDSELSTDQTKILELAEELGGVTASELAEKLGWSIGRAK 204 (219) T ss_pred CCCCHHHHHHHHHHHHHCCCC-EEEEEECCCCEE---EEECCCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHCCCHHHHH T ss_conf 878999999999999975999-699997797289---9856841046899999999972799489999999798899999 Q ss_pred HHHHHHHHCCCC Q ss_conf 999999977986 Q gi|254780799|r 774 SIIENMEEKGVI 785 (806) Q Consensus 774 r~~~~~e~~giv 785 (806) ..++.||.+|++ T Consensus 205 e~L~~~~~~G~l 216 (219) T pfam04157 205 EVLEKAEKEGLL 216 (219) T ss_pred HHHHHHHHCCCE T ss_conf 999999974997 No 47 >cd04104 p47_IIGP_like p47 (47-kDa) family. The p47 GTPase family consists of several highly homologous proteins, including IGTP, TGTP/Mg21, IRG-47, GTPI, LRG-47, and IIGP1. They are found in higher eukaryotes where they play a role in immune resistance against intracellular pathogens. p47 proteins exist at low resting levels in mouse cells, but are strongly induced by Type II interferon (IFN-gamma). ITGP is critical for resistance to Toxoplasma gondii infection and in involved in inhibition of Coxsackievirus-B3-induced apoptosis. TGTP was shown to limit vesicular stomatitis virus (VSV) infection of fibroblasts in vitro. IRG-47 is involved in resistance to T. gondii infection. LRG-47 has been implicated in resistance to T. gondii, Listeria monocytogenes, Leishmania, and mycobacterial infections. IIGP1 has been shown to localize to the ER and to the Golgi membranes in IFN-induced cells and inflamed tissues. In macrophages, IIGP1 interacts with hook3, a microtubule binding protei Probab=94.53 E-value=0.026 Score=36.98 Aligned_cols=20 Identities=35% Similarity=0.662 Sum_probs=18.6 Q ss_pred EEEEEECCCCHHHHHHHHHH Q ss_conf 23553047740679999999 Q gi|254780799|r 456 HLLIAGTTGSGKSVAINTMI 475 (806) Q Consensus 456 HLLIAGtTGSGKSV~iN~iI 475 (806) |.-|+|.+|||||..||++- T Consensus 3 ~iaVtGesGaGKSSfINAlR 22 (197) T cd04104 3 NIAVTGESGAGKSSFINALR 22 (197) T ss_pred EEEEECCCCCCHHHHHHHHH T ss_conf 79995589986899999986 No 48 >pfam06814 Lung_7-TM_R Lung seven transmembrane receptor. This family represents a conserved region with eukaryotic lung seven transmembrane receptors and related proteins. Probab=94.51 E-value=0.27 Score=29.28 Aligned_cols=79 Identities=19% Similarity=0.095 Sum_probs=50.6 Q ss_pred HCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 42706896313688645000134678999999999999999999999999999982765236789999999999999999 Q gi|254780799|r 47 TWDVYDPSFSYITLRSPKNFLGYGGAIFADVAIQFFGIASVFFLPPPTMWALSLLFDKKIYCFSKRATAWLINILVSATF 126 (806) Q Consensus 47 SYsp~DPs~~~~s~~~v~N~~G~lGA~iAd~L~~lFG~~Ayllpl~Ll~~g~~ll~~k~~~~~~~Rl~~~il~ll~~s~l 126 (806) .++++|.+......-..+|..|.+.|.-...+ .++|..+....++.++|++.++++++.........+.++.+..+..+ T Consensus 11 ~~~~~~~~~~~~g~v~~~N~~GyL~a~~~pl~-~~Y~~m~i~y~~~~~~W~~~~~~~~~~ll~iq~~i~~~i~l~~le~~ 89 (287) T pfam06814 11 FHSGSKSSTALKGKVNWKNPYGYLSASEAPLM-PFYGIMSLAYVLLGALWFFILFKYWHDILPLQKYIAAVIALGMVELA 89 (287) T ss_pred EECCCCCCEEEEEEEEEECCCCCCCHHHCCHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 96488875079999999999998794763516-59999999999999999999998399999999999999999999999 No 49 >pfam05707 Zot Zonular occludens toxin (Zot). This family consists of bacterial and viral proteins which are very similar to the Zonular occludens toxin (Zot). Zot is elaborated by bacteriophages present in toxigenic strains of Vibrio cholerae. Zot is a single polypeptide chain of 44.8 kDa, with the ability to reversibly alter intestinal epithelial tight junctions, allowing the passage of macromolecules through mucosal barriers Probab=94.43 E-value=0.025 Score=37.07 Aligned_cols=53 Identities=26% Similarity=0.390 Sum_probs=38.2 Q ss_pred EEEEHHHHHHHHHH--CCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHC Q ss_conf 87763446888873--210058899999986641423799996577753554355411 Q gi|254780799|r 592 IVVVIDEMADLMMV--ARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKAN 647 (806) Q Consensus 592 ivviiDElaDlmm~--~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan 647 (806) =++||||.-...=. .++.+-+.|.-|++ .|=-|+-+|+-||.|+ -|---|++- T Consensus 72 ~liViDE~~~~~~~r~~~~~~~~~i~~l~~-HRH~G~DiiliTQ~~~--~id~~ir~l 126 (183) T pfam05707 72 ALLVIDEAQTWFPSRRGGDKVPPVLDAFST-HRHLGWDIILITQNPS--KIDKQIRAL 126 (183) T ss_pred CEEEEECCHHHCCCCCCCCCCCHHHHHHHH-CCCCCCEEEEEECCHH--HHHHHHHHH T ss_conf 799998976554887778888389999998-0778820899918979--972999986 No 50 >pfam05872 DUF853 Bacterial protein of unknown function (DUF853). This family consists of several bacterial proteins of unknown function. One member from Brucella melitensis is thought to be an ATPase. Probab=94.41 E-value=0.03 Score=36.46 Aligned_cols=266 Identities=25% Similarity=0.365 Sum_probs=137.1 Q ss_pred CCCCCCCCEEEEECCCCC-EEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCC---------------HHH--- Q ss_conf 045666666785410020-2355304774067999999999998299578478885231---------------001--- Q gi|254780799|r 438 LGKSIEGKPIIADLARMP-HLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPK---------------MLE--- 498 (806) Q Consensus 438 LGKdI~G~pvv~DLakMP-HLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK---------------~vE--- 498 (806) ||-.-.+.|+...|..+- |=||||+||.||.|-+..|-=++= -.-|--.|-|=| ..| T Consensus 4 ~g~~~~~~~~~l~l~~~NRHGLiaGATGTGKTvTLq~lAE~~S----~~GVPvf~aDvKGDLsGia~~G~~~~kl~~R~~ 79 (504) T pfam05872 4 LGAGTPGQPLGLALGMANRHGLIAGATGTGKTVTLQVLAESFS----DAGVPVFLADVKGDLSGIAAPGEPQDKLAARAA 79 (504) T ss_pred CCCCCCCCCEECCHHHCCCCCEECCCCCCCHHHHHHHHHHHHH----HCCCCEEEECCCCCHHHCCCCCCCCHHHHHHHH T ss_conf 1468999754357546052166226577861789999999874----549938871056614321688999757999999 Q ss_pred -H--HHC---------------CCC----------HHHHCC------------------------CCCCHHHHHHHHHHH Q ss_conf -1--102---------------770----------343122------------------------334304566899999 Q gi|254780799|r 499 -L--SVY---------------DGI----------PNLLTP------------------------VVTNPQKAVTVLKWL 526 (806) Q Consensus 499 -l--s~Y---------------~~i----------PHLl~p------------------------VvTd~~kA~~aL~w~ 526 (806) + ..| +|+ |-||+- -.-|.|+=-..|+|+ T Consensus 80 ~~g~~~~~~~~~Pv~fwdl~g~~G~pvR~Tise~GPlLLsr~L~LndtQ~gvL~~~F~~ADd~gLlLlDLkDLra~l~~v 159 (504) T pfam05872 80 SIGLDDYQPQAFPVIFWDLFGEKGHPVRTTISEMGPLLLSRLLNLNDTQEGVLNIAFRIADDKGLLLLDLKDLRAILKYV 159 (504) T ss_pred HCCCCCCCCCCCCEEEEECCCCCCCEEEEEHHHCCHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHH T ss_conf 71998888677987888657888973786454306899998717853889999988875455998023489999999998 Q ss_pred HH---HHHHHHHHHHHCCCCCHHHHHHHHHHHHHC------CCCCC---CCCCCCCC-----------CCCCCC------ Q ss_conf 99---999999999870899689999999988744------78667---75446776-----------545432------ Q gi|254780799|r 527 VC---EMEERYQKMSKIGVRNIDGFNLKVAQYHNT------GKKFN---RTVQTGFD-----------RKTGEA------ 577 (806) Q Consensus 527 V~---EMe~RY~l~a~~~vRni~~yN~k~~~~~~~------~~~~~---~~~~~~~~-----------~~~~~~------ 577 (806) -+ |...+|--.|.+.+--| -.++-..+.. |||-. +=+++..| .-...| T Consensus 160 ~d~~k~~~~~yG~vS~aSvgaI---~R~Ll~LE~qG~d~FFGEPaldi~Dlmrt~~dGrGvIniL~a~~l~~~P~LysTF 236 (504) T pfam05872 160 GDNAKELSNQYGNVSKASVGAI---QRALLTLEQQGAEHFFGEPALDIEDLMRTDADGRGVISLLAADKLIQSPKLYSTF 236 (504) T ss_pred HHHHHHHHHHCCCCCHHHHHHH---HHHHHHHHHCCHHHHCCCCCCCHHHHHCCCCCCCEEEEEEEHHHHHHCCHHHHHH T ss_conf 7638999987298558889999---9999999965436307985567699853088998799986515354271799999 Q ss_pred -------CCC--CCCCCCCCCCEEEEEHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCC Q ss_conf -------223--32232346986877634468888732100588999999866414237999965777535543554110 Q gi|254780799|r 578 -------IYE--TEHFDFQHMPYIVVVIDEMADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANF 648 (806) Q Consensus 578 -------~~~--~~~~~~~~lp~ivviiDElaDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~ 648 (806) .|| .|--|++ -|-+|-..||=-=|---|+|..-+.|-..--.=|--|.-..-.||-|. ||=..+ =+.+ T Consensus 237 LLWLLsELfe~LPEvGD~d-KPKLVFFFDEAHLLF~dApkall~kIEqvVRLIRSKGVGVyFvTQnP~-DiPd~V-L~QL 313 (504) T pfam05872 237 LLWLLSELFEQLPEVGDPD-KPKLVFFFDEAHLLFNDAPKVLLEKVEQVVRLIRSKGVGVYFVTQNPL-DLPDTV-LAQL 313 (504) T ss_pred HHHHHHHHHHHCCCCCCCC-CCEEEEEECHHHHHHCCCCHHHHHHHHHHHHHHHCCCCEEEEECCCCC-CCCHHH-HHHH T ss_conf 9999999997585668888-853899940166652478599999999999875306735999727876-470889-9988 Q ss_pred CCEE--------------------EEEECCCCCCHHHCCCCCHHHHCCCCCEEEE--CCCCCEEEEEECC---------- Q ss_conf 2515--------------------8764586642123388645786588754773--6898325888334---------- Q gi|254780799|r 649 PTRI--------------------SFQVSSKIDSRTILGEQGAEQLLGQGDMLYM--TGGGRVQRIHGPF---------- 696 (806) Q Consensus 649 p~ri--------------------af~v~s~~dSrtild~~gae~Llg~gdml~~--~~~~~~~r~~g~~---------- 696 (806) =.|| .|+.+...|-...| .=||-|.-|.. ..-+.|.=|+-++ T Consensus 314 GnRVQHALRAfTP~DqKAvk~aa~tfr~np~~d~~~~i------t~LgvGEAlVS~LdekG~Pt~v~rt~i~pP~S~mGp 387 (504) T pfam05872 314 GNRVQHALRAFTPRDQKAVKAAADTFRPNPDFDTEEAI------TELGTGEALVSTLDEKGAPSIVERTLVRPPASRIGP 387 (504) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCHHHHH------HHCCCCCEEEEECCCCCCCCCEEEEEEECCHHHCCC T ss_conf 77889998638854699999999857999766899998------762765024331057999651467888786330588 Q ss_pred CCHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 8988999999999712897421100 Q gi|254780799|r 697 VSDIEVEKVVSHLKTQGEAKYIDIK 721 (806) Q Consensus 697 v~~~ev~~v~~~~~~q~~~~y~~~~ 721 (806) ++++|...++.- +--...|.+.+ T Consensus 388 ~t~~er~~~i~~--Spl~~kY~~~v 410 (504) T pfam05872 388 LTDEERAALVQA--SPVAGVYDQPI 410 (504) T ss_pred CCHHHHHHHHHC--CCCHHHHCCCC T ss_conf 999999999974--96521207754 No 51 >TIGR02788 VirB11 P-type DNA transfer ATPase VirB11; InterPro: IPR014155 This entry contains VirB11, a protein that is found in the vir locus of Agrobacterium Ti plasmids where it is involved in the type IV secretion system for DNA transfer . VirB11 is believed to be an ATPase and is a homologue of the P-like conjugation system TrbB protein and the Flp pilus system protein TadA .. Probab=94.28 E-value=0.046 Score=35.05 Aligned_cols=62 Identities=31% Similarity=0.418 Sum_probs=47.4 Q ss_pred CCCHHHCCCCCCCCEEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEEC-CCHHHHHHCCCCHHHHC Q ss_conf 10002104566666678541002023553047740679999999999982995784788852-31001110277034312 Q gi|254780799|r 432 CDLAINLGKSIEGKPIIADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMID-PKMLELSVYDGIPNLLT 510 (806) Q Consensus 432 ~~L~iaLGKdI~G~pvv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliD-PK~vEls~Y~~iPHLl~ 510 (806) .++.|.=+|+ +.|+|.|||||-=.+|+++ +.=|.+=|+|-|- -.=+-|+.-...=||+. T Consensus 151 l~~Ai~~~kn---------------Iii~GGTgSGKTTf~kal~-----~~IP~~ER~iTIED~~E~~~~hhpN~V~L~y 210 (328) T TIGR02788 151 LRLAIASRKN---------------IIISGGTGSGKTTFLKALV-----KEIPKDERLITIEDTRELFLPHHPNKVHLFY 210 (328) T ss_pred HHHHHHCCCE---------------EEEEECCCCHHHHHHHHHH-----HCCCCCCCEEEEEEEECCCCCCCCCEEEEEE T ss_conf 9999873891---------------9999068971899999997-----3276225278885201147888986456553 Q ss_pred CCC Q ss_conf 233 Q gi|254780799|r 511 PVV 513 (806) Q Consensus 511 pVv 513 (806) +=. T Consensus 211 sk~ 213 (328) T TIGR02788 211 SKG 213 (328) T ss_pred CCC T ss_conf 464 No 52 >COG3596 Predicted GTPase [General function prediction only] Probab=94.23 E-value=0.028 Score=36.72 Aligned_cols=73 Identities=23% Similarity=0.282 Sum_probs=40.8 Q ss_pred EEEEEECCCCHHHHHHHHHHHHHHHHCC--HHHEEEEEECCCHHHHHHCCCCHHHH----CCCCCCHHHHHHHHHHHHHH Q ss_conf 2355304774067999999999998299--57847888523100111027703431----22334304566899999999 Q gi|254780799|r 456 HLLIAGTTGSGKSVAINTMILSLLYRMT--PAQCRLIMIDPKMLELSVYDGIPNLL----TPVVTNPQKAVTVLKWLVCE 529 (806) Q Consensus 456 HLLIAGtTGSGKSV~iN~iI~SlLyk~~--P~evkliliDPK~vEls~Y~~iPHLl----~pVvTd~~kA~~aL~w~V~E 529 (806) -.|+.|+||+|||.-||++--+.+...+ |- =.||-.-.--.|++ |-| +|=+-|-+++-..++-.+.+ T Consensus 41 nvLi~G~TG~GKSSliNALF~~~~~~v~~vg~-----~t~~~~~~~~~~~~--~~l~lwDtPG~gdg~~~D~~~r~~~~d 113 (296) T COG3596 41 NVLLMGATGAGKSSLINALFQGEVKEVSKVGV-----GTDITTRLRLSYDG--ENLVLWDTPGLGDGKDKDAEHRQLYRD 113 (296) T ss_pred EEEEECCCCCCHHHHHHHHHHCCCCEEEECCC-----CCCCHHHHHHHCCC--CCEEEECCCCCCCCHHHHHHHHHHHHH T ss_conf 58974377776889999997026734210466-----88701567741266--524884378855320221899999998 Q ss_pred HHHHHH Q ss_conf 999999 Q gi|254780799|r 530 MEERYQ 535 (806) Q Consensus 530 Me~RY~ 535 (806) --.|-+ T Consensus 114 ~l~~~D 119 (296) T COG3596 114 YLPKLD 119 (296) T ss_pred HHHHCC T ss_conf 863224 No 53 >COG2804 PulE Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB [Cell motility and secretion / Intracellular trafficking and secretion] Probab=94.17 E-value=0.043 Score=35.34 Aligned_cols=112 Identities=29% Similarity=0.393 Sum_probs=60.2 Q ss_pred HHHHHHHHH-HHHHHHHHCCCCCC-CCCCCCEEEEE---ECCCCCCEEEEHHHCCCHHHHHCCCCCHH-HCCCCCCCCEE Q ss_conf 999997889-99988630012100-03886125553---02456623873423072134302210002-10456666667 Q gi|254780799|r 374 SSRIIGLSD-DIARSMSAISARVA-VIPRRNAIGIE---LPNDIRETVMLRDLIVSRVFEKNQCDLAI-NLGKSIEGKPI 447 (806) Q Consensus 374 vSKI~nLad-DIA~aLsa~svRIa-pIPGK~~VGIE---iPN~~r~~V~lreil~s~~f~~s~~~L~i-aLGKdI~G~pv 447 (806) +|||.=+|+ |||=-=.-.+=|+. .+.|+. |-.- +|+..-|-|.+|=+-.+. ..|.+ -||-+=.-.-. T Consensus 177 vsriK~ma~LDIaErR~PQdGr~~~~~~~~~-vd~RvStlP~~~GEkvVlRil~~~~------~~l~l~~Lg~~~~~~~~ 249 (500) T COG2804 177 VSRLKVMAGLDIAERRLPQDGRIRLKLNGRK-VDFRVSTLPTFYGEKVVLRILDKDQ------VILDLEKLGMSPFQLAR 249 (500) T ss_pred HHHHHHHHCCCHHHCCCCCCCEEEEEECCEE-EEEEEECCCCCCCCEEEEEEECCCC------CCCCHHHHCCCHHHHHH T ss_conf 9999988437525201787762899808835-8899955788778578999833312------46888783899889999 Q ss_pred EEECCCCCE--EEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEE-ECCCH Q ss_conf 854100202--35530477406799999999999829957847888-52310 Q gi|254780799|r 448 IADLARMPH--LLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIM-IDPKM 496 (806) Q Consensus 448 v~DLakMPH--LLIAGtTGSGKSV~iN~iI~SlLyk~~P~evklil-iDPK~ 496 (806) +..+.+.|| +||.|.||||||.-+-+++..+ ++|+.- .+- =||=- T Consensus 250 ~~~~~~~p~GliLvTGPTGSGKTTTLY~~L~~l---n~~~~n-I~TiEDPVE 297 (500) T COG2804 250 LLRLLNRPQGLILVTGPTGSGKTTTLYAALSEL---NTPERN-IITIEDPVE 297 (500) T ss_pred HHHHHHCCCEEEEEECCCCCCHHHHHHHHHHHH---CCCCCE-EEEEECCEE T ss_conf 999972897089996899998899999999986---278850-898407804 No 54 >cd00046 DEXDc DEAD-like helicases superfamily. A diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region. Probab=94.15 E-value=0.075 Score=33.48 Aligned_cols=27 Identities=44% Similarity=0.539 Sum_probs=22.3 Q ss_pred CEEEEEECCCCHHHHHHHHHHHHHHHH Q ss_conf 023553047740679999999999982 Q gi|254780799|r 455 PHLLIAGTTGSGKSVAINTMILSLLYR 481 (806) Q Consensus 455 PHLLIAGtTGSGKSV~iN~iI~SlLyk 481 (806) .++||++-|||||+.+.--.+...+.+ T Consensus 1 ~~~lv~~ptGsGKT~~~~~~~~~~~~~ 27 (144) T cd00046 1 RDVLLAAPTGSGKTLAALLPILELLDS 27 (144) T ss_pred CCEEEECCCCCHHHHHHHHHHHHHHHH T ss_conf 999998899717999999999999975 No 55 >PRK08187 pyruvate kinase; Validated Probab=94.02 E-value=0.16 Score=31.00 Aligned_cols=280 Identities=19% Similarity=0.254 Sum_probs=148.0 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCEEEEEEEECCEEEEEEEEECCCCCHHHHHHHHHHHHHHHHHC-----CCCC-CCCCCC Q ss_conf 2566545789999997424863289984104424444432147863999997889999886300-----1210-003886 Q gi|254780799|r 328 SPKVMQNNACTLKSVLSDFGIQGEIVNVRPGPVITLYELEPAPGIKSSRIIGLSDDIARSMSAI-----SARV-AVIPRR 401 (806) Q Consensus 328 s~eeL~~nA~lLE~tL~dFGVe~~Vv~v~pGPVVTrYEi~PApGVKvSKI~nLadDIA~aLsa~-----svRI-apIPGK 401 (806) |.++-+++.+.+-..=++.|-...|--=.+||-+-.=++.|+|+|.=-|.. -|+.-+-+.-. .... +|.|++ T Consensus 171 ~pe~w~~mI~~VR~Aa~~~Gr~~~IlmDL~GPKIRtG~l~~gp~v~k~~p~--rd~~G~v~~Pa~~wl~~~~~~~~~~~~ 248 (606) T PRK08187 171 DPEAWRAMIEHVRTAEEETGRRCRVLMDLAGPKIRTGPIAPGPGVVKLRPT--RDRYGRVVTPARLWITASPAPESLPGD 248 (606) T ss_pred CHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCEECCCCCCCCCCCCCCC--CCCCCCCCCCHHHHCCCCCCCCCCCCC T ss_conf 999999999999999997299417998779983113147888754312444--344343345202101345666666666 Q ss_pred CEEEEEECCCCCCEEEEHHHCCCHHHHHCCCCCH---------------------------------------------- Q ss_conf 1255530245662387342307213430221000---------------------------------------------- Q gi|254780799|r 402 NAIGIELPNDIRETVMLRDLIVSRVFEKNQCDLA---------------------------------------------- 435 (806) Q Consensus 402 ~~VGIEiPN~~r~~V~lreil~s~~f~~s~~~L~---------------------------------------------- 435 (806) +.+=+.++-.--..+.-+|.+.-..-..++-+|. T Consensus 249 ~~~~~~v~~~~~~~l~~Gd~i~~~DaRg~~R~l~V~~~~~~~~~a~~~~~ay~~~G~~l~~~~~~~~v~~~p~~e~~i~L 328 (606) T PRK08187 249 PAFCLEVTAEWLAKLEQGDEILLTDARGKKRKLVVVRVDDGGVLAEGDKTAYIQNGTKLRRGHDRTEVGGVPAVEQRLTL 328 (606) T ss_pred CCCCCCCCHHHHHHCCCCCEEEECCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEE T ss_conf 53222344033331155750110022344210000013554300002332112456422234443333566654442130 Q ss_pred -------------------------------HHCCCCCCCCEEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCH Q ss_conf -------------------------------2104566666678541002023553047740679999999999982995 Q gi|254780799|r 436 -------------------------------INLGKSIEGKPIIADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTP 484 (806) Q Consensus 436 -------------------------------iaLGKdI~G~pvv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P 484 (806) =.+.-=--|++|..|=.|.= +-|-. .++ T Consensus 329 ~~Gd~l~Lt~d~~~~~~~~~~~~~~i~ctlp~l~~~v~~G~~Il~DDGkI~-~~V~~--------------------v~~ 387 (606) T PRK08187 329 KVGDRLLLTRDDAPGDPAAGGGGPRISCTLPEVVDAVRVGDRVWFDDGKIG-AVVVK--------------------VTE 387 (606) T ss_pred CCCCEEEEECCCCCCCCCCCCCCCEEECCHHHHHHHCCCCCEEEEECCCEE-EEEEE--------------------EEC T ss_conf 358879995056677732267774564040656864789997999578079-99999--------------------858 Q ss_pred HH--EEEEEECCCHHHHHHCCCC--H--HHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHC Q ss_conf 78--4788852310011102770--3--4312233430456689999999999999999870899689999999988744 Q gi|254780799|r 485 AQ--CRLIMIDPKMLELSVYDGI--P--NLLTPVVTNPQKAVTVLKWLVCEMEERYQKMSKIGVRNIDGFNLKVAQYHNT 558 (806) Q Consensus 485 ~e--vkliliDPK~vEls~Y~~i--P--HLl~pVvTd~~kA~~aL~w~V~EMe~RY~l~a~~~vRni~~yN~k~~~~~~~ 558 (806) ++ |+-.---|+-.-|+.+.|| | +|-.|.+|+-. ..-|.|+++.. +..+-.=||+-+....--+...+. T Consensus 388 ~~v~~~V~~a~~~G~kL~s~KGiN~P~~~l~lp~lT~kD--~~dl~f~~~~~----D~va~SFVr~~~Dv~~l~~~l~~~ 461 (606) T PRK08187 388 DALEVRVTHARPGGSKLKSGKGINLPDTELPLPALTDKD--LEDLAFVAKHA----DIVGLSFVQSPEDVEYLQEALAQL 461 (606) T ss_pred CEEEEEEEECCCCCCEECCCCCEECCCCCCCCCCCCHHH--HHHHHHHHHCC----CEEEEECCCCHHHHHHHHHHHHHC T ss_conf 868999997278886664898645899988998898779--99999998569----989985158899999999999965 Q ss_pred CC-CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEH-HHHHHHHHHCCCH--HHHHHH-------HHHHHHHHCCEE Q ss_conf 78-667754467765454322233223234698687763-4468888732100--588999-------999866414237 Q gi|254780799|r 559 GK-KFNRTVQTGFDRKTGEAIYETEHFDFQHMPYIVVVI-DEMADLMMVARKD--IESAVQ-------RLAQMARASGIH 627 (806) Q Consensus 559 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivvii-DElaDlmm~~~~~--ve~~i~-------rlaq~ara~GiH 627 (806) |- .+. + -...| ....++.||-|+..- .+-.|=.|+|.-| ||-.+. +|-++++|+|+- T Consensus 462 ~~~~~~--i---------iaKIE-~~~a~~nl~~Il~~Am~~~~dgvMVARGDLgVE~~~e~vp~~Qk~ii~~c~~~~~P 529 (606) T PRK08187 462 GADHLG--V---------VLKIE-TRPAFENLPRILLTAMRRPPFGVMIARGDLAVEVGFERLAEVQEEILWLCEAAHVP 529 (606) T ss_pred CCCCCC--E---------EEEEC-CHHHHHHHHHHHHHHCCCCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHCCCC T ss_conf 898874--8---------99856-98998689999985053678469997875546669777289999999999972997 Q ss_pred EEEEECCCCCCCCCHHHHHCCCCEEE Q ss_conf 99996577753554355411025158 Q gi|254780799|r 628 VIMATQRPSVDVITGTIKANFPTRIS 653 (806) Q Consensus 628 li~aTqrPsvdvitg~ikan~p~ria 653 (806) .|.||| |.-.+||.-.|||-- T Consensus 530 VI~ATQ-----mLeSM~~~~~PTRAE 550 (606) T PRK08187 530 VIWATQ-----VLESLAKKGLPSRAE 550 (606) T ss_pred EEEECC-----HHHHHCCCCCCCCHH T ss_conf 799500-----387635188987254 No 56 >pfam06414 Zeta_toxin Zeta toxin. This family consists of several bacterial zeta toxin proteins. Zeta toxin is thought to be part of a postregulational killing system in bacteria. It relies on antitoxin/toxin systems that secure stable inheritance of low and medium copy number plasmids during cell division and kill cells that have lost the plasmid. Probab=93.93 E-value=0.091 Score=32.88 Aligned_cols=92 Identities=22% Similarity=0.367 Sum_probs=48.2 Q ss_pred EEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHH--HHHHCCCCHHHHCCCCCCHHHHHHH-----HHHHHH Q ss_conf 235530477406799999999999829957847888523100--1110277034312233430456689-----999999 Q gi|254780799|r 456 HLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKML--ELSVYDGIPNLLTPVVTNPQKAVTV-----LKWLVC 528 (806) Q Consensus 456 HLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~v--Els~Y~~iPHLl~pVvTd~~kA~~a-----L~w~V~ 528 (806) =.++||..|||||...+.++-.+- ...++.|||=-+ .+--|+++ .-.+|+.++.. =+|+-. T Consensus 14 ai~laG~pGAGKS~~~~~~~~~~~------~~~~v~In~D~~r~~~P~y~~l------~~~~~~~~~~~~~~~a~~~~~~ 81 (191) T pfam06414 14 AVLLGGQPGAGKTELARALLEELG------GGNVVRIDPDELRTYHPDYDEL------QKADPKDASELTQPDASRWVEK 81 (191) T ss_pred EEEEECCCCCCHHHHHHHHHHHCC------CCCCEEECCHHHHHHHHHHHHH------HHCCHHHHHHHHHHHHHHHHHH T ss_conf 999957998888999999987537------8993897135878877747865------5407677899989999999999 Q ss_pred ----HHHHHHHHHHHCCCCCHHHHHHHHHHHHHCC Q ss_conf ----9999999998708996899999999887447 Q gi|254780799|r 529 ----EMEERYQKMSKIGVRNIDGFNLKVAQYHNTG 559 (806) Q Consensus 529 ----EMe~RY~l~a~~~vRni~~yN~k~~~~~~~~ 559 (806) =|++||.+.=+---||.+.+-+.++.+++.| T Consensus 82 ~~~~a~~~r~n~iiegT~~~~~~~~~~~~~lk~~G 116 (191) T pfam06414 82 LIDYAIERGYNIILEGTLRSPDVARKLARKLKAAG 116 (191) T ss_pred HHHHHHHCCCCEEEECCCCCHHHHHHHHHHHHHCC T ss_conf 99999975999898577789799999999999789 No 57 >cd03270 ABC_UvrA_I The excision repair protein UvrA domain I; Nucleotide excision repair in eubacteria is a process that repairs DNA damage by the removal of a 12-13-mer oligonucleotide containing the lesion. Recognition and cleavage of the damaged DNA is a multistep ATP-dependent reaction that requires the UvrA, UvrB, and UvrC proteins. Both UvrA and UvrB are ATPases, with UvrA having two ATP binding sites, which have the characteristic signature of the family of ABC proteins, and UvrB having one ATP binding site that is structurally related to that of helicases. Probab=93.91 E-value=0.47 Score=27.56 Aligned_cols=42 Identities=21% Similarity=0.296 Sum_probs=26.9 Q ss_pred EEEEEHHHH-HHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCC Q ss_conf 687763446-8888732100588999999866414237999965777 Q gi|254780799|r 591 YIVVVIDEM-ADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPS 636 (806) Q Consensus 591 ~ivviiDEl-aDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPs 636 (806) .+++|-||= +=|=....+++.+.|.+| |..|+-+|+.|.+++ T Consensus 158 ~~lliLDEPTsgLD~~~~~~i~~~i~~l----~~~G~Tvi~VsHd~~ 200 (226) T cd03270 158 GVLYVLDEPSIGLHPRDNDRLIETLKRL----RDLGNTVLVVEHDED 200 (226) T ss_pred CCEEEECCCCCCCCHHHHHHHHHHHHHH----HHCCCEEEEEEECHH T ss_conf 7168832873337989999999999999----976998999972578 No 58 >pfam05621 TniB Bacterial TniB protein. This family consists of several bacterial TniB NTP-binding proteins. TniB is a probable ATP-binding protein which is involved in Tn5053 mercury resistance transposition. Probab=93.67 E-value=0.55 Score=26.99 Aligned_cols=153 Identities=14% Similarity=0.187 Sum_probs=85.5 Q ss_pred CCEEEEEHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEEEECCCCCCHHHCCC Q ss_conf 98687763446888873210058899999986641423799996577753554355411025158764586642123388 Q gi|254780799|r 589 MPYIVVVIDEMADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISFQVSSKIDSRTILGE 668 (806) Q Consensus 589 lp~ivviiDElaDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf~v~s~~dSrtild~ 668 (806) +--=|+||||+-.++--.+++..+...=|--.+...-|-+|.+ |+-.| +-||+.-.|.-|| T Consensus 144 ~~vrmLIIDEiHnlL~Gs~~~qr~~ln~LK~L~Nel~IpiV~v----------Gt~eA----~~ai~tD~QlasR----- 204 (302) T pfam05621 144 VGVRMLVIDELHNVLAGNSVNRREFLNLLRFLGNELRIPLVGV----------GTRDA----YLAIRSDDQLENR----- 204 (302) T ss_pred CCCCEEEEECHHHHCCCCHHHHHHHHHHHHHHHHCCCCCEEEE----------CCHHH----HHHHHCCHHHHHH----- T ss_conf 4987899854365604868899999999999863658786995----------31999----9997068888850----- Q ss_pred CCHHHHCCCCCEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHC---CCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHH Q ss_conf 64578658875477368983258883348988999999999712---897421100124556667788888777703899 Q gi|254780799|r 669 QGAEQLLGQGDMLYMTGGGRVQRIHGPFVSDIEVEKVVSHLKTQ---GEAKYIDIKDKILLNEEMRFSENSSVADDLYKQ 745 (806) Q Consensus 669 ~gae~Llg~gdml~~~~~~~~~r~~g~~v~~~ev~~v~~~~~~q---~~~~y~~~~~~~~~~~~~~~~~~~~~~d~l~~~ 745 (806) =+...+|. |=-|+|-.+++.-...- .+|+......-...-- .-.+|..++--.|..+ T Consensus 205 ---------F~~~~Lp~----------W~~d~ef~~LL~sfe~~LPL~~~S~L~~~~~a~~I~-~~SeG~iGei~~Ll~~ 264 (302) T pfam05621 205 ---------FEPMLLPP----------WEANDDCCSLLASFAASLPLRRPSPIATLDMARYLL-TRSEGTIGELAHLLMA 264 (302) T ss_pred ---------CCCCCCCC----------CCCCCHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHH-HHCCCCHHHHHHHHHH T ss_conf ---------58611688----------889808999999999868887776888899999999-9859928799999999 Q ss_pred HHHHHHHCCCEEEEHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHCCCC--CEE Q ss_conf 999999659850001422200117789999999997798680227887--267 Q gi|254780799|r 746 AVDIVLRDNKASISYIQRRLGIGYNRAASIIENMEEKGVIGPASSTGK--REI 796 (806) Q Consensus 746 a~~~v~~~~~~s~s~lqr~~~igy~raar~~~~~e~~giv~~~~g~~~--r~v 796 (806) |+...+++|+--++ .+.+++.+.|+|.++-+. ||+ T Consensus 265 aA~~AI~sG~E~It----------------~~~l~~~~~i~ps~r~r~~~r~~ 301 (302) T pfam05621 265 AAIAAVESGEEAIN----------------HRTLSMADYIGPSERRRQFEREL 301 (302) T ss_pred HHHHHHHCCCCCCC----------------HHHHHHCCCCCCCCCCCHHHHHC T ss_conf 99999847871008----------------99996679868333430656522 No 59 >COG2805 PilT Tfp pilus assembly protein, pilus retraction ATPase PilT [Cell motility and secretion / Intracellular trafficking and secretion] Probab=93.66 E-value=0.048 Score=34.91 Aligned_cols=114 Identities=32% Similarity=0.451 Sum_probs=65.8 Q ss_pred EEEECCCCC--EEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCCCCHHHHCCCCCCHHHHHHHH- Q ss_conf 785410020--23553047740679999999999982995784788852310011102770343122334304566899- Q gi|254780799|r 447 IIADLARMP--HLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYDGIPNLLTPVVTNPQKAVTVL- 523 (806) Q Consensus 447 vv~DLakMP--HLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~~iPHLl~pVvTd~~kA~~aL- 523 (806) ++-+++.-| -.||.|.||||||--+-+||--+= ++.|. |.|| |-||-+-+..= T Consensus 116 i~~~~~~~~~GLILVTGpTGSGKSTTlAamId~iN-~~~~~---------------------HIlT--IEDPIE~vh~sk 171 (353) T COG2805 116 IVRELAESPRGLILVTGPTGSGKSTTLAAMIDYIN-KHKAK---------------------HILT--IEDPIEYVHESK 171 (353) T ss_pred HHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHHH-CCCCC---------------------CEEE--ECCCHHHHHCCH T ss_conf 99999828796699867999967879999999984-14775---------------------1687--237468650432 Q ss_pred HHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHHHH Q ss_conf 99999999999999870899689999999988744786677544677654543222332232346986877634468888 Q gi|254780799|r 524 KWLVCEMEERYQKMSKIGVRNIDGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQHMPYIVVVIDEMADLM 603 (806) Q Consensus 524 ~w~V~EMe~RY~l~a~~~vRni~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivviiDElaDlm 603 (806) +..++.=|- =++-.+|..-.+.|... | | =||.|-|+-|+ T Consensus 172 kslI~QREv---------G~dT~sF~~aLraALRe------------D------------P-------DVIlvGEmRD~- 210 (353) T COG2805 172 KSLINQREV---------GRDTLSFANALRAALRE------------D------------P-------DVILVGEMRDL- 210 (353) T ss_pred HHHHHHHHH---------CCCHHHHHHHHHHHHHC------------C------------C-------CEEEEECCCCH- T ss_conf 766668774---------54278899999998602------------9------------9-------97998213469- Q ss_pred HHCCCHHHHHHHHHHHHHHHCCEEEEEEECCC Q ss_conf 73210058899999986641423799996577 Q gi|254780799|r 604 MVARKDIESAVQRLAQMARASGIHVIMATQRP 635 (806) Q Consensus 604 m~~~~~ve~~i~rlaq~ara~GiHli~aTqrP 635 (806) | -.|+|=.|--.| |||++|=-- T Consensus 211 -------E--Ti~~ALtAAETG-HLV~~TLHT 232 (353) T COG2805 211 -------E--TIRLALTAAETG-HLVFGTLHT 232 (353) T ss_pred -------H--HHHHHHHHHHCC-CEEEEECCC T ss_conf -------9--999999898608-877774023 No 60 >PRK13894 conjugal transfer ATPase TrbB; Provisional Probab=93.50 E-value=0.082 Score=33.22 Aligned_cols=82 Identities=22% Similarity=0.380 Sum_probs=47.2 Q ss_pred CEEEEEECCCCCCEEEEHHHCCCHHHHHCCCCCHHHCCCCCCCCEEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHH Q ss_conf 12555302456623873423072134302210002104566666678541002023553047740679999999999982 Q gi|254780799|r 402 NAIGIELPNDIRETVMLRDLIVSRVFEKNQCDLAINLGKSIEGKPIIADLARMPHLLIAGTTGSGKSVAINTMILSLLYR 481 (806) Q Consensus 402 ~~VGIEiPN~~r~~V~lreil~s~~f~~s~~~L~iaLGKdI~G~pvv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk 481 (806) +++-|--|...+ ..|.|++++..+...-..+ |-.-|. ++ --+||+|.|||||.-++|+++-.+.+ T Consensus 111 ~~~sIRk~~~~~--~tL~dlv~~G~~~~~~a~~---L~~~V~--------~r-~nilI~G~TgsGKTTll~all~~i~~- 175 (320) T PRK13894 111 PTFAIRKKAVAI--FTLDQYVERGIMTAEQREA---IIAAVR--------AH-RNILVIGGTGSGKTTLVNAIINEMVI- 175 (320) T ss_pred CEEEEECCCCCC--CCHHHHHHCCCCCHHHHHH---HHHHHH--------CC-CCEEEECCCCCCHHHHHHHHHHHHCC- T ss_conf 659996868999--9999998769999999999---999997--------28-75899858886568999999863202- Q ss_pred CCHHHEEEEEE-CCCHHHHHH Q ss_conf 99578478885-231001110 Q gi|254780799|r 482 MTPAQCRLIMI-DPKMLELSV 501 (806) Q Consensus 482 ~~P~evklili-DPK~vEls~ 501 (806) ..|. .|++.| |.- ||.. T Consensus 176 ~~p~-eRivtIED~~--EL~~ 193 (320) T PRK13894 176 QDPT-ERVFIIEDTG--EIQC 193 (320) T ss_pred CCCC-CCEEEECCHH--HHCC T ss_conf 6952-0177525878--8078 No 61 >COG4185 Uncharacterized protein conserved in bacteria [Function unknown] Probab=93.25 E-value=0.45 Score=27.68 Aligned_cols=120 Identities=26% Similarity=0.359 Sum_probs=73.9 Q ss_pred CCE-EEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 202-3553047740679999999999982995784788852310011102770343122334304566899999999999 Q gi|254780799|r 454 MPH-LLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYDGIPNLLTPVVTNPQKAVTVLKWLVCEMEE 532 (806) Q Consensus 454 MPH-LLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~~iPHLl~pVvTd~~kA~~aL~w~V~EMe~ 532 (806) ||- .+|||+-||||| |+-.|+++..-| -++-|+|--.-.. +.|+ +|+ .+=--|++-|++ T Consensus 1 m~~l~IvaG~NGsGKs----tv~~~~~~~~~~---~~~~VN~D~iA~~--------i~p~--~p~---~~~i~A~r~ai~ 60 (187) T COG4185 1 MKRLDIVAGPNGSGKS----TVYASTLAPLLP---GIVFVNADEIAAQ--------ISPD--NPT---SAAIQAARVAID 60 (187) T ss_pred CCEEEEEECCCCCCCE----EEEECCCHHHCC---CEEEECHHHHHHH--------CCCC--CCH---HHHHHHHHHHHH T ss_conf 9658999668888732----543102322148---7599877998553--------3899--964---778999999999 Q ss_pred HHHHHHHCCCCCH-------HHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHHHHHH Q ss_conf 9999987089968-------999999998874478667754467765454322233223234698687763446888873 Q gi|254780799|r 533 RYQKMSKIGVRNI-------DGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQHMPYIVVVIDEMADLMMV 605 (806) Q Consensus 533 RY~l~a~~~vRni-------~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivviiDElaDlmm~ 605 (806) |...+.+.|+-=+ .+|-+.+++|+.+|-- -.|-|||+ | T Consensus 61 ~i~~~I~~~~~F~~ETtLS~~s~~~~ik~Ak~~Gf~-------------------------I~L~y~~i--~-------- 105 (187) T COG4185 61 RIARLIDLGRPFIAETTLSGPSILELIKTAKAAGFY-------------------------IVLNYIVI--D-------- 105 (187) T ss_pred HHHHHHHCCCCCCEEEEECCCHHHHHHHHHHHCCEE-------------------------EEEEEEEE--C-------- T ss_conf 999999749984157751361189999999867769-------------------------99999996--7-------- Q ss_pred CCCHHHHHHHHHHHHHHHCCEEEEEEE Q ss_conf 210058899999986641423799996 Q gi|254780799|r 606 ARKDIESAVQRLAQMARASGIHVIMAT 632 (806) Q Consensus 606 ~~~~ve~~i~rlaq~ara~GiHli~aT 632 (806) .+|-++.|+.|-- |.|=|=|=++ T Consensus 106 ---~~elavERVk~RV-a~GGH~IpED 128 (187) T COG4185 106 ---SVELAVERVKLRV-AKGGHDIPED 128 (187) T ss_pred ---CHHHHHHHHHHHH-HCCCCCCCHH T ss_conf ---2888999999998-6178888188 No 62 >TIGR02533 type_II_gspE general secretory pathway protein E; InterPro: IPR013369 GspE, the E protein of the type II secretion system, is also referred to as the main terminal branch of the general secretion pathway. ; GO: 0005524 ATP binding, 0008565 protein transporter activity, 0015628 protein secretion by the type II secretion system, 0015627 type II protein secretion system complex. Probab=93.24 E-value=0.034 Score=36.12 Aligned_cols=70 Identities=34% Similarity=0.554 Sum_probs=47.3 Q ss_pred HCCCCCCCCCCCCEEEEEECCCCCCEEEEHHHCCCHHHHHCCCCCHHHCCCCCCCCEEEEECCCCCE--EEEEECCCCHH Q ss_conf 0012100038861255530245662387342307213430221000210456666667854100202--35530477406 Q gi|254780799|r 390 AISARVAVIPRRNAIGIELPNDIRETVMLRDLIVSRVFEKNQCDLAINLGKSIEGKPIIADLARMPH--LLIAGTTGSGK 467 (806) Q Consensus 390 a~svRIapIPGK~~VGIEiPN~~r~~V~lreil~s~~f~~s~~~L~iaLGKdI~G~pvv~DLakMPH--LLIAGtTGSGK 467 (806) ..+|||+.|| -..=|=|-|| +|+-+ .....|. .||-+=..---.--|=++|| +||-|=||||| T Consensus 194 ~~DiRVSt~P----------t~fGERvVMR-LLDK~---~~~l~L~-~LGm~~~~l~~~~~li~rpHGIiLVTGPTGSGK 258 (495) T TIGR02533 194 DIDIRVSTVP----------TSFGERVVMR-LLDKD---AVRLDLE-ALGMSPELLSSLERLIKRPHGIILVTGPTGSGK 258 (495) T ss_pred EEEEEEEECC----------CCCCCEEEHH-HHCHH---HHHHHHH-HCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCH T ss_conf 4667885305----------8997100000-11204---7777588-648888899999999718896188417789852 Q ss_pred HHHHHHH Q ss_conf 7999999 Q gi|254780799|r 468 SVAINTM 474 (806) Q Consensus 468 SV~iN~i 474 (806) |--+=+= T Consensus 259 tTTLYaa 265 (495) T TIGR02533 259 TTTLYAA 265 (495) T ss_pred HHHHHHH T ss_conf 5889999 No 63 >KOG2655 consensus Probab=93.15 E-value=0.062 Score=34.14 Aligned_cols=193 Identities=20% Similarity=0.328 Sum_probs=95.4 Q ss_pred EEEEEECCCCHHHHHHHHHHHHHHHHC--------CHHHEEEEEECCCHHHHHHCCCCHHHHC----CCCCCHHHHHHHH Q ss_conf 235530477406799999999999829--------9578478885231001110277034312----2334304566899 Q gi|254780799|r 456 HLLIAGTTGSGKSVAINTMILSLLYRM--------TPAQCRLIMIDPKMLELSVYDGIPNLLT----PVVTNPQKAVTVL 523 (806) Q Consensus 456 HLLIAGtTGSGKSV~iN~iI~SlLyk~--------~P~evkliliDPK~vEls~Y~~iPHLl~----pVvTd~~kA~~aL 523 (806) -|+++|.-|+|||-+|||+...-||.. +|.+ -+-|+..++++--= |..--|| |=.-|- .-+.. T Consensus 23 tlmvvG~sGlGKsTfiNsLf~~~l~~~~~~~~~~~~~~~--t~~i~~~~~~iee~-g~~l~LtvidtPGfGD~--vdns~ 97 (366) T KOG2655 23 TLMVVGESGLGKSTFINSLFLTDLSGNREVPGASERIKE--TVEIESTKVEIEEN-GVKLNLTVIDTPGFGDA--VDNSN 97 (366) T ss_pred EEEEECCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCC--CCEEEEEEEEECCC-CEEEEEEEECCCCCCCC--CCCCC T ss_conf 899855887638899988886532577666775567655--31132232376379-74887678436998654--46512 Q ss_pred HH--HHHHHHHHHHHHH--HCCCCCHHHHHHHHHHHHHCCCC-----CCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEE Q ss_conf 99--9999999999998--70899689999999988744786-----677544677654543222332232346986877 Q gi|254780799|r 524 KW--LVCEMEERYQKMS--KIGVRNIDGFNLKVAQYHNTGKK-----FNRTVQTGFDRKTGEAIYETEHFDFQHMPYIVV 594 (806) Q Consensus 524 ~w--~V~EMe~RY~l~a--~~~vRni~~yN~k~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivv 594 (806) -| +++--|.+|+..= +.+.+- ..+ ..+.. +-.|...|.+|-.-+-+ ..-.++..-|=| T Consensus 98 ~w~pi~~yi~~q~~~yl~~E~~~~R-~~~--------~D~RVH~cLYFI~P~ghgL~p~Di~~M----k~l~~~vNiIPV 164 (366) T KOG2655 98 CWRPIVNYIDSQFDQYLDEESRLNR-SKI--------KDNRVHCCLYFISPTGHGLKPLDIEFM----KKLSKKVNLIPV 164 (366) T ss_pred CCHHHHHHHHHHHHHHHHHHCCCCC-CCC--------CCCCEEEEEEEECCCCCCCCHHHHHHH----HHHHCCCCCCCE T ss_conf 4323567899999999860236886-556--------688448999983788888868669999----977325661031 Q ss_pred EHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHH---HCCCCEEEEEECCCCCCHHHCCCCCH Q ss_conf 634468888732100588999999866414237999965777535543554---11025158764586642123388645 Q gi|254780799|r 595 VIDEMADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIK---ANFPTRISFQVSSKIDSRTILGEQGA 671 (806) Q Consensus 595 iiDElaDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ik---an~p~riaf~v~s~~dSrtild~~ga 671 (806) |.- ||.+ ...|+...=.|+-|--....|+.- +=| .|-=-.-.+ ..+=+++-|+|.+ |-|+.++.|- T Consensus 165 I~K--aD~l--T~~El~~~K~~I~~~i~~~nI~vf---~fp-~~~~d~~~~~~~~~l~~~~PFAIig---s~~~~e~~G~ 233 (366) T KOG2655 165 IAK--ADTL--TKDELNQFKKRIRQDIEEHNIKVF---DFP-TDESDEELKEEEQDLKSSIPFAIIG---SNTEIEEKGK 233 (366) T ss_pred EEC--CCCC--CHHHHHHHHHHHHHHHHHCCCCEE---CCC-CCCCHHHHHHHHHHHHHCCCEEEEE---CCCEEECCCC T ss_conf 640--5668--989999999999999887380335---699-8743566689999875429868982---4835644892 Q ss_pred HHHCCC Q ss_conf 786588 Q gi|254780799|r 672 EQLLGQ 677 (806) Q Consensus 672 e~Llg~ 677 (806) ..-.|+ T Consensus 234 ~~vrgR 239 (366) T KOG2655 234 KRVRGR 239 (366) T ss_pred EEEECE T ss_conf 465010 No 64 >TIGR00099 Cof-subfamily Cof-like hydrolase; InterPro: IPR000150 The Haloacid Dehydrogenase (HAD) superfamily includes phosphatases, phosphonatases, P-type ATPases, beta-phosphoglucomutases, phosphomannomutases, and dehalogenases, which are involved in a variety of cellular processes ranging from amino acid biosynthesis to detoxification . Proteins in this entry are mostly uncharacterised, though they form a distinct subgroup within the HAD superfamily. Members are found almost exclusively in bacteria and many species contain several paralogs, for example Escherichia coli contains a total of six proteins from this entry. Sequence similarities suggest that these enzymes are phosphatases which work on phosphorylated sugars. The YbiV (P75792 from SWISSPROT) protein from E. coli has been experimentally characterised . This enzyme catalyzes the hydrolysis of sugar phosphate to sugar and inorganic phosphate. It has a wide substrate specificity, catalyzing the hydrolysis of ribose-5-phosphate and glucose-6-phosphate most efficiently, but it is not known if these are the real substrates in vivo. The protein appears to be a monomer that contains two domains, an alpha-beta hydrolase domain that forms a Rossman fold, and an alpha-beta domain. The active site is found in a negatively charged cavity found at the interface between the two domains. ; GO: 0016787 hydrolase activity, 0008152 metabolic process. Probab=93.08 E-value=0.63 Score=26.56 Aligned_cols=88 Identities=24% Similarity=0.339 Sum_probs=63.4 Q ss_pred CCCHHHHHHHHHHHHHHHCCEEEEEEECCC--CCCCCCHHHHHCCCCEE-EEEECCCCCCHHHCCCCCHHHHCCCCCEEE Q ss_conf 210058899999986641423799996577--75355435541102515-876458664212338864578658875477 Q gi|254780799|r 606 ARKDIESAVQRLAQMARASGIHVIMATQRP--SVDVITGTIKANFPTRI-SFQVSSKIDSRTILGEQGAEQLLGQGDMLY 682 (806) Q Consensus 606 ~~~~ve~~i~rlaq~ara~GiHli~aTqrP--svdvitg~ikan~p~ri-af~v~s~~dSrtild~~gae~Llg~gdml~ 682 (806) -.++|......-=++||+.|+|++|||=|| ++..-..-|.-++.-.+ .+ +-.+|| |.+ T Consensus 14 ~~~~i~~~~~~al~~l~~~G~~~~laTGR~~~g~~~~~~~~~~~~~~~~~~~-----------i~~nGg--------~~~ 74 (270) T TIGR00099 14 DDKEISPSTKEALAKLREKGIKVVLATGRPLAGVYKELKEILKELGLDTDPY-----------ISANGG--------AVI 74 (270) T ss_pred CCCCCCHHHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHHHHHHHCCCCCEE-----------EEECCC--------EEE T ss_conf 9875188899999999966988999818864347899999888506785568-----------761784--------899 Q ss_pred ECCCCCEEEEEECCCCHHHHHHHHHHHHHCC Q ss_conf 3689832588833489889999999997128 Q gi|254780799|r 683 MTGGGRVQRIHGPFVSDIEVEKVVSHLKTQG 713 (806) Q Consensus 683 ~~~~~~~~r~~g~~v~~~ev~~v~~~~~~q~ 713 (806) ...++ +.=++---++.++++.+.+++++++ T Consensus 75 ~~~~~-~~~~~~~~l~~~~~~~~~~~~~~~~ 104 (270) T TIGR00099 75 DEGGG-NEILYKKPLDLDLVEELLNFLKKHG 104 (270) T ss_pred EECCC-CEEEEECCCCHHHHHHHHHHHHHCC T ss_conf 84687-0056302478889999999997348 No 65 >pfam09726 Macoilin Transmembrane protein. This entry is a highly conserved protein present in eukaryotes. Probab=92.88 E-value=0.56 Score=26.94 Aligned_cols=30 Identities=17% Similarity=0.244 Sum_probs=20.6 Q ss_pred HHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 999999999-999999999999999999982765 Q gi|254780799|r 73 IFADVAIQF-FGIASVFFLPPPTMWALSLLFDKK 105 (806) Q Consensus 73 ~iAd~L~~l-FG~~Ayllpl~Ll~~g~~ll~~k~ 105 (806) .++||++.| |=| +||++|++-+++=-|+.. T Consensus 41 llaDFlleFRFEf---LWPfWLllRSvyDSFKyq 71 (680) T pfam09726 41 LLADFILEFRFEY---LWPFWLLIRSVYDSFKYQ 71 (680) T ss_pred HHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHC T ss_conf 9999997778988---759999999898777522 No 66 >cd01129 PulE-GspE PulE/GspE The type II secretory pathway is the main terminal branch of the general secretory pathway (GSP). It is responsible for the export the majority of Gram-negative bacterial exoenzymes and toxins. PulE is a cytoplasmic protein of the GSP, which contains an ATP binding site and a tetracysteine motif. This subgroup also includes PillB and HofB. Probab=92.79 E-value=0.15 Score=31.24 Aligned_cols=47 Identities=36% Similarity=0.684 Sum_probs=30.8 Q ss_pred EEECCCCCE--EEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEE-CCCHHHHH Q ss_conf 854100202--355304774067999999999998299578478885-23100111 Q gi|254780799|r 448 IADLARMPH--LLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMI-DPKMLELS 500 (806) Q Consensus 448 v~DLakMPH--LLIAGtTGSGKSV~iN~iI~SlLyk~~P~evklili-DPK~vEls 500 (806) +-.+.+.|| +||+|.||||||--+.+++-.+ .+++ .+.+.| || ||.. T Consensus 72 l~~~~~~~~GlilitGptGSGKtTtl~a~l~~~---~~~~-~~i~tiEdP--vE~~ 121 (264) T cd01129 72 FRKLLEKPHGIILVTGPTGSGKTTTLYSALSEL---NTPE-KNIITVEDP--VEYQ 121 (264) T ss_pred HHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHH---CCCC-CEEEEEECC--CCCC T ss_conf 999970899889997899997799999999864---3688-508998676--3145 No 67 >PRK13897 type IV secretion system component VirD4; Provisional Probab=92.73 E-value=0.035 Score=35.95 Aligned_cols=65 Identities=20% Similarity=0.387 Sum_probs=44.7 Q ss_pred CCE-EEEEHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCC--CCC--C--HHHHHCCCCEEEEEECCC Q ss_conf 986-8776344688887321005889999998664142379999657775--355--4--355411025158764586 Q gi|254780799|r 589 MPY-IVVVIDEMADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSV--DVI--T--GTIKANFPTRISFQVSSK 659 (806) Q Consensus 589 lp~-ivviiDElaDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsv--dvi--t--g~ikan~p~riaf~v~s~ 659 (806) +|| ..++.|||+-| | .++..-.-||.| |--||++.+-.|-.+= +.= + -.|..|-..||.|..... T Consensus 403 ~p~~VLfLLDEFanL----G-kIp~fekaiA~m-rGYGIrl~lIiQslsQLk~~YG~~~a~tIl~Nc~~~I~f~~nD~ 474 (628) T PRK13897 403 EPYGVLFLMDEFPTL----G-KMEQFKTGIAYF-RGYRVRLFLIIQDTEQLKGIYEEAGMNSFLSNSTYRITFAANNI 474 (628) T ss_pred CCEEEEEEECCCCCC----C-CHHHHHHHHHHH-HCCCCEEEEEEECHHHHHHHHCCCHHHHHHHCCCEEEEECCCCH T ss_conf 874589985100214----8-748899999987-03687899999769999988573447789855863798668975 No 68 >PTZ00301 uridine kinase; Provisional Probab=92.72 E-value=0.15 Score=31.27 Aligned_cols=41 Identities=32% Similarity=0.459 Sum_probs=34.8 Q ss_pred CCE--EEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECC Q ss_conf 202--35530477406799999999999829957847888523 Q gi|254780799|r 454 MPH--LLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDP 494 (806) Q Consensus 454 MPH--LLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDP 494 (806) ||- .-|||-+|||||-.-+.|+-.|+.+..|+.+-++.-|= T Consensus 1 m~~~iIgIaGgSgSGKTT~a~~i~~~l~~~~~~~~v~ii~~D~ 43 (210) T PTZ00301 1 MPCTVIGISGASGSGKSSLSTNIVSELMAHCGPVSIGVICEDF 43 (210) T ss_pred CCCEEEEEECCCCCCHHHHHHHHHHHHHHHCCCCCEEEEECCC T ss_conf 9988999968876789999999999987614998079983676 No 69 >TIGR01666 YCCS hypothetical membrane protein, TIGR01666; InterPro: IPR010019 This entry identifies a clade of sequences from gamma and beta proteobacteria. These proteins are more than 700 amino acids long and many have been annotated as putative membrane proteins. The gene from Salmonella has been annotated as a putative efflux transporter. The gene from Escherichia coli has been annotated as yccS.. Probab=92.66 E-value=0.9 Score=25.40 Aligned_cols=91 Identities=13% Similarity=0.179 Sum_probs=49.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 99999999999999999999999998276523678999999999999999999850444322333432488999999999 Q gi|254780799|r 76 DVAIQFFGIASVFFLPPPTMWALSLLFDKKIYCFSKRATAWLINILVSATFFASFSPSQSWPIQNGFGGIIGDLIIRLPF 155 (806) Q Consensus 76 d~L~~lFG~~Ayllpl~Ll~~g~~ll~~k~~~~~~~Rl~~~il~ll~~s~ll~~~~~~~~~~l~~g~GGiiG~~l~~~l~ 155 (806) -...|+||...+.+|++|.+.+..+.=--+...-..|-+.+.+.-++.+++.-.+.....|.+ ++|-.++...+ T Consensus 22 ~~~~W~~~~~~~~~PLiLG~iAaaL~DlDdRl~GRLkn~i~TLicF~Iasfsvellf~~Pwlf------a~glt~stfgF 95 (713) T TIGR01666 22 AVGIWFFDLSKLSIPLILGVIAAALVDLDDRLAGRLKNVILTLICFAIASFSVELLFGKPWLF------AVGLTVSTFGF 95 (713) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHH------HHHHHHHHHHH T ss_conf 986557620013456799999987513010121148999999999999999999984464999------99999999999 Q ss_pred HHHHHHHHHHHHHHHHH Q ss_conf 98646899999999999 Q gi|254780799|r 156 LFFESYPRKLGILFFQM 172 (806) Q Consensus 156 ~~lg~~g~~l~lli~~i 172 (806) ..+|..|.--..+-|+. T Consensus 96 imlGa~GqRYatIAFgs 112 (713) T TIGR01666 96 IMLGALGQRYATIAFGS 112 (713) T ss_pred HHHHHHHHHHHHHHHHH T ss_conf 99845435799999999 No 70 >TIGR01667 YCCS_YHJK integral membrane protein, YccS/YhfK family; InterPro: IPR010020 This entry identifies a clade of sequences from gamma and beta proteobacteria. These proteins are more than 700 amino acids long and many have been annotated as putative membrane proteins. The gene from Salmonella has been annotated as a putative efflux transporter. The gene from Escherichia coli has been annotated as yccS. The YccS hypothetical equivalog is found in beta and gamma proteobacteria, while the smaller YhfK group is only found in E. coli, Salmonella and Yersinia. . Probab=92.01 E-value=1.1 Score=24.84 Aligned_cols=69 Identities=12% Similarity=0.036 Sum_probs=41.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 899999999999999999999999999998276523678999999999999999999850444322333 Q gi|254780799|r 72 AIFADVAIQFFGIASVFFLPPPTMWALSLLFDKKIYCFSKRATAWLINILVSATFFASFSPSQSWPIQN 140 (806) Q Consensus 72 A~iAd~L~~lFG~~Ayllpl~Ll~~g~~ll~~k~~~~~~~Rl~~~il~ll~~s~ll~~~~~~~~~~l~~ 140 (806) -.++.++.++||...-.+|+++.+.+--+---.+...-..|-+.+.++.++.|++.-.+..+..|++.. T Consensus 18 l~~a~~~~Ww~d~~~~~~Pl~lgiiAagL~~~d~rl~GRLk~l~itl~~F~~aS~~v~Ll~~~p~~~~~ 86 (724) T TIGR01667 18 LAGAELLIWWFDLVKLLIPLILGIIAAGLDDLDDRLLGRLKNLIITLLCFSIASFLVQLLFPKPWLFVV 86 (724) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHH T ss_conf 998888754641588877899987886403600256537999999999999999999875366068999 No 71 >TIGR02163 napH_ ferredoxin-type protein, NapH/MauN family; InterPro: IPR011886 Most members of this family are the NapH protein, found next to NapG,in operons that encode the periplasmic nitrate reductase. Some species with this reductase lack NapC but accomplish electron transfer to NapAB in some other manner, likely to involve NapH, NapG, and/or some other protein. A few members of this protein are designated MauN and are found in methylamine utilization operons in species that appear to lack a periplasmic nitrate reductase.; GO: 0005506 iron ion binding, 0006118 electron transport. Probab=91.34 E-value=1.1 Score=24.84 Aligned_cols=80 Identities=16% Similarity=-0.016 Sum_probs=58.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCC--C---------CCCCCCC-----CCCCCCCCHHHHHHHHHHHHHHHHHH-- Q ss_conf 9999999999999999999999984427--0---------6896313-----68864500013467899999999999-- Q gi|254780799|r 22 SKKKMKIVAGLILLCTVFAITLALGTWD--V---------YDPSFSY-----ITLRSPKNFLGYGGAIFADVAIQFFG-- 83 (806) Q Consensus 22 ~~rrl~Ei~Gl~Li~~al~l~iSL~SYs--p---------~DPs~~~-----~s~~~v~N~~G~lGA~iAd~L~~lFG-- 83 (806) .-||+--+.-++|++++=|.-+-++.=| . +|| |-+ +...+..|.+ +||.+=-.||-+|| T Consensus 3 ~lRRl~Ql~~L~LF~~~p~ag~~iL~GNlSsS~ll~tiPLsDP-li~LQ~l~A~~~p~~~aL--iGA~iI~~FYal~ggR 79 (263) T TIGR02163 3 LLRRLVQLSILGLFLLGPYAGVWILKGNLSSSRLLGTIPLSDP-LITLQILLAGHLPDLNAL--IGALIIVAFYALFGGR 79 (263) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHEECCHHHHHHCCCCCCCCH-HHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHH T ss_conf 8999999999999987786432020201688887077586436-899999986107899999--9999999999998520 Q ss_pred -HHHHHHHHHHHHHHHHHHHCC Q ss_conf -999999999999999998276 Q gi|254780799|r 84 -IASVFFLPPPTMWALSLLFDK 104 (806) Q Consensus 84 -~~Ayllpl~Ll~~g~~ll~~k 104 (806) +-||+-|+-++.=-...+++| T Consensus 80 aFCsWVCP~N~vTD~A~wLr~K 101 (263) T TIGR02163 80 AFCSWVCPLNLVTDLAAWLRRK 101 (263) T ss_pred HEEEEECCCCCHHHHHHHHHHH T ss_conf 0245454886411258999986 No 72 >PRK04195 replication factor C large subunit; Provisional Probab=91.30 E-value=1.3 Score=24.32 Aligned_cols=20 Identities=30% Similarity=0.542 Sum_probs=9.4 Q ss_pred CEEEEEECCCCHHHHHHHHH Q ss_conf 02355304774067999999 Q gi|254780799|r 455 PHLLIAGTTGSGKSVAINTM 474 (806) Q Consensus 455 PHLLIAGtTGSGKSV~iN~i 474 (806) ++||..|-.|.||+-++.++ T Consensus 41 k~lLL~GPpGvGKTT~a~~l 60 (403) T PRK04195 41 KALLLYGPPGVGKTSLAHAL 60 (403) T ss_pred CEEEEECCCCCCHHHHHHHH T ss_conf 46998893998799999999 No 73 >COG5019 CDC3 Septin family protein [Cell division and chromosome partitioning / Cytoskeleton] Probab=91.26 E-value=0.15 Score=31.19 Aligned_cols=26 Identities=31% Similarity=0.514 Sum_probs=14.0 Q ss_pred EEEEECCCCHHHHHHHHHHHHHHHHC Q ss_conf 35530477406799999999999829 Q gi|254780799|r 457 LLIAGTTGSGKSVAINTMILSLLYRM 482 (806) Q Consensus 457 LLIAGtTGSGKSV~iN~iI~SlLyk~ 482 (806) ++++|.-|+|||.||||+.-+-|-.. T Consensus 26 im~~G~sG~GKttfiNtL~~~~l~~~ 51 (373) T COG5019 26 IMVVGESGLGKTTFINTLFGTSLVDE 51 (373) T ss_pred EEEECCCCCCHHHHHHHHHHHHCCCC T ss_conf 99962788755578876567652577 No 74 >KOG2927 consensus Probab=91.16 E-value=1.2 Score=24.37 Aligned_cols=18 Identities=17% Similarity=0.467 Sum_probs=10.0 Q ss_pred CCCCCCCCCCHHHHHHHH Q ss_conf 432233343248899999 Q gi|254780799|r 134 QSWPIQNGFGGIIGDLII 151 (806) Q Consensus 134 ~~~~l~~g~GGiiG~~l~ 151 (806) ..|+++.|.||+++.++. T Consensus 217 gvyY~sig~~gfl~~Ilv 234 (372) T KOG2927 217 GVYYLSIGAGGFLAFILV 234 (372) T ss_pred CEEEEECCHHHHHHHHHH T ss_conf 211344031599999999 No 75 >pfam00270 DEAD DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA splicing, ribosome biogenesis, nucleocytoplasmic transport, translation, RNA decay and organellar gene expression. Probab=91.09 E-value=1.3 Score=24.17 Aligned_cols=15 Identities=47% Similarity=0.811 Sum_probs=12.9 Q ss_pred EEEEEECCCCHHHHH Q ss_conf 235530477406799 Q gi|254780799|r 456 HLLIAGTTGSGKSVA 470 (806) Q Consensus 456 HLLIAGtTGSGKSV~ 470 (806) .++|-+.||||||.+ T Consensus 16 ~~iv~~pTGsGKT~~ 30 (167) T pfam00270 16 DVLVQAPTGSGKTLA 30 (167) T ss_pred CEEEECCCCCCHHHH T ss_conf 789988999758999 No 76 >KOG0735 consensus Probab=90.85 E-value=0.19 Score=30.54 Aligned_cols=61 Identities=20% Similarity=0.305 Sum_probs=52.1 Q ss_pred CCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCCCCHHHHCCCCC Q ss_conf 1002023553047740679999999999982995784788852310011102770343122334 Q gi|254780799|r 451 LARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYDGIPNLLTPVVT 514 (806) Q Consensus 451 LakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~~iPHLl~pVvT 514 (806) .-.-||+|++|.-|||||+-+.+++--. .++..+-++.+|=+.+-.+-.+.|-|+|.-|.+ T Consensus 428 v~~~~~Ill~G~~GsGKT~L~kal~~~~---~k~~~~hv~~v~Cs~l~~~~~e~iQk~l~~vfs 488 (952) T KOG0735 428 VFRHGNILLNGPKGSGKTNLVKALFDYY---SKDLIAHVEIVSCSTLDGSSLEKIQKFLNNVFS 488 (952) T ss_pred CCCCCCEEEECCCCCCHHHHHHHHHHHH---CCCCCEEEEEEECHHCCCHHHHHHHHHHHHHHH T ss_conf 3346618986799877769999999875---156506999975221042048999999999999 No 77 >PRK00091 miaA tRNA delta(2)-isopentenylpyrophosphate transferase; Reviewed Probab=90.82 E-value=0.12 Score=31.90 Aligned_cols=68 Identities=16% Similarity=0.339 Sum_probs=41.1 Q ss_pred CCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHH--HH---------HHCCCCHHHHCCCCCCHHHHH Q ss_conf 0020235530477406799999999999829957847888523100--11---------102770343122334304566 Q gi|254780799|r 452 ARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKML--EL---------SVYDGIPNLLTPVVTNPQKAV 520 (806) Q Consensus 452 akMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~v--El---------s~Y~~iPHLl~pVvTd~~kA~ 520 (806) -|.|=+.|+|.||||||-.-..+=- -+ ..-.|=.|--+| || .--++|||-|.=++ ||.+-- T Consensus 2 ~~~~ii~i~GpTasGKs~la~~la~--~~-----~~eIIsaDS~QvYk~l~IgTakps~~e~~~i~Hhli~~~-~~~e~~ 73 (304) T PRK00091 2 MKPKLIVLVGPTASGKTALAIELAK--RL-----NGEIISVDSMQVYRGMDIGTAKPTAEELAGVPHHLIDIL-DPTESY 73 (304) T ss_pred CCCCEEEEECCCCCCHHHHHHHHHH--HC-----CCEEEEECHHHHHCCCCEEECCCCHHHHCCCCCEEEEEE-CCCCCE T ss_conf 9997799989886589999999999--87-----998994126887499986889999999818981243456-588754 Q ss_pred HHHHHHH Q ss_conf 8999999 Q gi|254780799|r 521 TVLKWLV 527 (806) Q Consensus 521 ~aL~w~V 527 (806) ++=.|.- T Consensus 74 sv~~f~~ 80 (304) T PRK00091 74 SAADFQR 80 (304) T ss_pred EHHHHHH T ss_conf 4999999 No 78 >COG5008 PilU Tfp pilus assembly protein, ATPase PilU [Cell motility and secretion / Intracellular trafficking and secretion] Probab=90.82 E-value=0.16 Score=31.01 Aligned_cols=100 Identities=27% Similarity=0.382 Sum_probs=59.3 Q ss_pred HHHHHHHHCCCCEEEEEEEECCEEEEEEEEECCCCCHHHHHHHHHHHHHHHHHCCCC-C-CCCCCCCEEEEEECCCCCCE Q ss_conf 999997424863289984104424444432147863999997889999886300121-0-00388612555302456623 Q gi|254780799|r 338 TLKSVLSDFGIQGEIVNVRPGPVITLYELEPAPGIKSSRIIGLSDDIARSMSAISAR-V-AVIPRRNAIGIELPNDIRET 415 (806) Q Consensus 338 lLE~tL~dFGVe~~Vv~v~pGPVVTrYEi~PApGVKvSKI~nLadDIA~aLsa~svR-I-apIPGK~~VGIEiPN~~r~~ 415 (806) +=++-..+|.=..+..=+..-|..+||-+. -. ..|.+...-.| | -.||- T Consensus 59 M~~~q~~eF~~~~EcNfai~~~~~gRfRvn--------Af------~qr~~~g~VlRrI~~~IPt--------------- 109 (375) T COG5008 59 MSAKQREEFEETHECNFAISARDIGRFRVN--------AF------YQRGLAGLVLRRIETKIPT--------------- 109 (375) T ss_pred HHHHHHHHHHHCCCCEEEEECCCCCEEEEE--------HH------HHCCCCEEEEEEECCCCCC--------------- T ss_conf 649999998752652279974888607740--------66------6477740156440115884--------------- Q ss_pred EEEHHHCCCHHHHHCCCCCHHHCCCCCCCCEEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEE Q ss_conf 873423072134302210002104566666678541002023553047740679999999999982995784788 Q gi|254780799|r 416 VMLRDLIVSRVFEKNQCDLAINLGKSIEGKPIIADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLI 490 (806) Q Consensus 416 V~lreil~s~~f~~s~~~L~iaLGKdI~G~pvv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkli 490 (806) |+ +-+||=.| ||. -++|=--++|-|+||||||-.+-+||- ||++-.-=-.| T Consensus 110 -----------~e--eL~LPevl-k~l-------a~~kRGLviiVGaTGSGKSTtmAaMi~---yRN~~s~gHIi 160 (375) T COG5008 110 -----------FE--ELKLPEVL-KDL-------ALAKRGLVIIVGATGSGKSTTMAAMIG---YRNKNSTGHII 160 (375) T ss_pred -----------HH--HCCCCHHH-HHH-------HCCCCCEEEEECCCCCCCHHHHHHHHC---CCCCCCCCCEE T ss_conf -----------77--61984889-975-------204374599987788884016899860---13468877358 No 79 >TIGR02782 TrbB_P P-type conjugative transfer ATPase TrbB; InterPro: IPR014149 This entry represents TrbB, a protein, which is encoded in the trb locus of Agrobacterium Ti plasmids where it is involved in the type IV secretion system for plasmid conjugative transfer . TrbB is a homologue of the vir system VirB11 ATPase , and the Flp pilus system ATPase TadA .. Probab=90.62 E-value=0.36 Score=28.39 Aligned_cols=35 Identities=31% Similarity=0.615 Sum_probs=28.5 Q ss_pred EEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEE Q ss_conf 355304774067999999999998299578478885 Q gi|254780799|r 457 LLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMI 492 (806) Q Consensus 457 LLIAGtTGSGKSV~iN~iI~SlLyk~~P~evklili 492 (806) .||+|=|||||.--.|++|.-|=--+.|+| |+|+| T Consensus 142 Ilv~GGTGSGKTTLaNAlla~I~~l~~P~d-R~vIi 176 (315) T TIGR02782 142 ILVVGGTGSGKTTLANALLAEIAKLNDPDD-RVVII 176 (315) T ss_pred EEEECCCCCCHHHHHHHHHHHHHHCCCCCC-EEEEE T ss_conf 899814588579999999998852169996-18998 No 80 >PRK09112 DNA polymerase III subunit delta'; Validated Probab=90.58 E-value=0.36 Score=28.38 Aligned_cols=133 Identities=21% Similarity=0.326 Sum_probs=70.3 Q ss_pred CCCCE-EEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEE--ECCC-----HHHHHHCCCCHHHHCCCCCCHHHHHHHH Q ss_conf 00202-35530477406799999999999829957847888--5231-----0011102770343122334304566899 Q gi|254780799|r 452 ARMPH-LLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIM--IDPK-----MLELSVYDGIPNLLTPVVTNPQKAVTVL 523 (806) Q Consensus 452 akMPH-LLIAGtTGSGKSV~iN~iI~SlLyk~~P~evklil--iDPK-----~vEls~Y~~iPHLl~pVvTd~~kA~~aL 523 (806) .|||| +|..|.-|.||.-.-..+.-.||-...|.+..--. +||. .++-..+-++=|+--| T Consensus 42 gRl~HA~Lf~GP~GiGKaTlA~~~A~~Ll~~~~~~~~~~~~~~pd~~~~~~r~i~~g~hpdl~~i~r~------------ 109 (352) T PRK09112 42 GRLHHALLFEGPEGIGKATLAFHLANHILSHPDPNEAPETLADPDPASPLWRQIAQGAHPNLLHLTRP------------ 109 (352) T ss_pred CCCCCEEEEECCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEECC------------ T ss_conf 99652465358998089999999999986699866686556788878778999974899995655343------------ Q ss_pred HHHHHHHHHHHHHHHHCC---CCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHH Q ss_conf 999999999999998708---99689999999988744786677544677654543222332232346986877634468 Q gi|254780799|r 524 KWLVCEMEERYQKMSKIG---VRNIDGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQHMPYIVVVIDEMA 600 (806) Q Consensus 524 ~w~V~EMe~RY~l~a~~~---vRni~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivviiDEla 600 (806) ++|.+.+.+ +..+ ||++..|-.. -+. .=.|=|+||||- T Consensus 110 ---~d~k~~~~~--~~I~vd~iR~l~~~~~~-------------------~~~--------------~~~~kv~Iid~a- 150 (352) T PRK09112 110 ---FDEKTGKFK--TAITVDEIRRVTHFLSQ-------------------TSG--------------DGNWRIVIIDPA- 150 (352) T ss_pred ---CCHHHHHHH--CCCCHHHHHHHHHHHCC-------------------CCC--------------CCCEEEEEECCH- T ss_conf ---220214543--35777999999998454-------------------886--------------688069998187- Q ss_pred HHHHHCC-----CHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHH Q ss_conf 8887321-----005889999998664142379999657775355435541 Q gi|254780799|r 601 DLMMVAR-----KDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKA 646 (806) Q Consensus 601 Dlmm~~~-----~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ika 646 (806) |.|-... |-.|++= .....||.|..|+ -|..+|++ T Consensus 151 d~m~~~aaNALLK~LEEPp---------~~~~fiLit~~~~--~ll~TI~S 190 (352) T PRK09112 151 DDMNRNAANAILKTLEEPP---------ARALFILISHSSG--RLLPTIRS 190 (352) T ss_pred HHHHHHHHHHHHHHHHCCC---------CCEEEEEEECCHH--HCHHHHHH T ss_conf 8746999999999853489---------8748998869977--77689997 No 81 >KOG2568 consensus Probab=90.54 E-value=1.5 Score=23.83 Aligned_cols=73 Identities=16% Similarity=0.066 Sum_probs=43.6 Q ss_pred HHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH Q ss_conf 44270689631368864500013467899999999999999999999999999998276523678999999999 Q gi|254780799|r 46 GTWDVYDPSFSYITLRSPKNFLGYGGAIFADVAIQFFGIASVFFLPPPTMWALSLLFDKKIYCFSKRATAWLIN 119 (806) Q Consensus 46 ~SYsp~DPs~~~~s~~~v~N~~G~lGA~iAd~L~~lFG~~Ayllpl~Ll~~g~~ll~~k~~~~~~~Rl~~~il~ 119 (806) .+.+++.+++...+...-.|..|.+-|.-+ .++.++|+.+..-.++.++|.++.+++++.....-+.+..++. T Consensus 171 ~~~~~s~~~~~~~~~v~wkNpyGyL~a~~~-Plm~fy~~m~laYvllgllW~~~~~~y~~diL~lQ~~I~~Vi~ 243 (518) T KOG2568 171 ISCDSSLESYKATGSVNWKNPYGYLPASDA-PLMPFYGFMCLAYVLLGLLWFFQCAQYWHDILPLQKYITAVIA 243 (518) T ss_pred EEECCCCCCCCCCCEEEEECCCCCCCHHHC-CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 850676555531114898889898683435-2104999999999999999999999999888999999999999 No 82 >pfam07693 KAP_NTPase KAP family P-loop domain. The KAP (after Kidins220/ARMS and PifA) family of predicted NTPases are sporadically distributed across a wide phylogenetic range in bacteria and in animals. Many of the prokaryotic KAP NTPases are encoded in plasmids and tend to undergo disruption to form pseudogenes. A unique feature of all eukaryotic and certain bacterial KAP NTPases is the presence of two or four transmembrane helices inserted into the P-loop NTPase domain. These transmembrane helices anchor KAP NTPases in the membrane such that the P-loop domain is located on the intracellular side. Probab=90.54 E-value=1.5 Score=23.82 Aligned_cols=54 Identities=19% Similarity=0.338 Sum_probs=42.4 Q ss_pred EEEEEHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCC Q ss_conf 687763446888873210058899999986641423799996577753554355411025 Q gi|254780799|r 591 YIVVVIDEMADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPT 650 (806) Q Consensus 591 ~ivviiDElaDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~ 650 (806) -|||+||+|==+ .+.++-+..-.+=-.+.--+++-|||+.+ ++|..-++.++++ T Consensus 161 ~iVviIDDLDRc---~p~~~v~~Le~Ik~~~d~~n~vfVLa~D~---~~v~~al~~~~~~ 214 (301) T pfam07693 161 RIVVIIDDLDRC---EPEEAVELLEAVRLLFDFPNVVFILAYDE---EIIKKAIEHNYGG 214 (301) T ss_pred CEEEEECCCCCC---CHHHHHHHHHHHHHHHCCCCEEEEEECCH---HHHHHHHHHHCCC T ss_conf 789997365548---87899999999999726798189997589---9999999987387 No 83 >TIGR01271 CFTR_protein cystic fibrosis transmembrane conductor regulator (CFTR); InterPro: IPR005291 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). These proteins are integral membrane proteins and they are involved in the transport of chloride ions. Many of these proteins are the cystis fibrosis transmembrane conductor regulators (CFTR) in eukaryotes. The principal role of this protein is chloride ion conductance. The protein is predicted to consist of 12 transmembrane domains. Mutations or lesions in the genetic loci have been linked to the aetiology of asthma, bronchiectasis, chronic obstructive pulmonary disease etc. Disease-causing mutations have been studied by 36Cl efflux assays in vitro cell cultures and electrophysiology, all of which point to the impairment of chloride channel stability and not the biosynthetic processing per se.; GO: 0005254 chloride channel activity, 0006811 ion transport, 0016020 membrane. Probab=90.51 E-value=0.14 Score=31.39 Aligned_cols=29 Identities=34% Similarity=0.571 Sum_probs=19.0 Q ss_pred CCCEEEEEC----CCCCEEEEEECCCCHHHHHH Q ss_conf 666678541----00202355304774067999 Q gi|254780799|r 443 EGKPIIADL----ARMPHLLIAGTTGSGKSVAI 471 (806) Q Consensus 443 ~G~pvv~DL----akMPHLLIAGtTGSGKSV~i 471 (806) .+.||.-|. .|=-=|=|||+||||||.-+ T Consensus 437 ~vTPVLK~I~~~lEkG~lLAvAGSTGsGKsSLL 469 (1534) T TIGR01271 437 YVTPVLKNISFKLEKGQLLAVAGSTGSGKSSLL 469 (1534) T ss_pred HCCCEEEECCEEEECCCEEEEECCCCCHHHHHH T ss_conf 116433401014304415787436775078999 No 84 >cd03225 ABC_cobalt_CbiO_domain1 Domain I of the ABC component of a cobalt transport family found in bacteria, archaea, and eukaryota. The transition metal cobalt is an essential component of many enzymes and must be transported into cells in appropriate amounts when needed. This ABC transport system of the CbiMNQO family is involved in cobalt transport in association with the cobalamin (vitamin B12) biosynthetic pathways. Most of cobalt (Cbi) transport systems possess a separate CbiN component, the cobalt-binding periplasmic protein, and they are encoded by the conserved gene cluster cbiMNQO. Both the CbiM and CbiQ proteins are integral cytoplasmic membrane proteins, and the CbiO protein has the linker peptide and the Walker A and B motifs commonly found in the ATPase components of the ABC-type transport systems. Probab=90.50 E-value=0.93 Score=25.32 Aligned_cols=151 Identities=18% Similarity=0.211 Sum_probs=74.4 Q ss_pred EECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCH-------------------HHEEEEEECCC--HHHHHHCCCCHH Q ss_conf 541002023553047740679999999999982995-------------------78478885231--001110277034 Q gi|254780799|r 449 ADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTP-------------------AQCRLIMIDPK--MLELSVYDGIPN 507 (806) Q Consensus 449 ~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P-------------------~evkliliDPK--~vEls~Y~~iPH 507 (806) .++.+==.+-|.|..|||||-.++.|.- + ++.+- ..+-++.=||. -..+++++.|-. T Consensus 22 l~i~~Gei~~iiG~nGaGKSTLlk~i~G-l-~~p~~G~I~~~g~~i~~~~~~~~~~~ig~v~Q~p~~~~~~~tv~e~i~~ 99 (211) T cd03225 22 LTIKKGEFVLIVGPNGSGKSTLLRLLNG-L-LGPTSGEVLVDGKDLTKLSLKELRRKVGLVFQNPDDQFFGPTVEEEVAF 99 (211) T ss_pred EEECCCCEEEEECCCCCCHHHHHHHHHC-C-CCCCCCCEEECCEECCCCCHHHHHCCEEEEEECCHHHCCCCCHHHHHHH T ss_conf 8884997999988999989999999964-6-7798887789999999799899840389997783253055869999999 Q ss_pred HHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHH-HHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 3122334304566899999999999999998708996899999-999887447866775446776545432223322323 Q gi|254780799|r 508 LLTPVVTNPQKAVTVLKWLVCEMEERYQKMSKIGVRNIDGFNL-KVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFDF 586 (806) Q Consensus 508 Ll~pVvTd~~kA~~aL~w~V~EMe~RY~l~a~~~vRni~~yN~-k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 586 (806) -+..-=.+.+++.....++..++.- .-+++-.++++.|=++ |+.-|++- T Consensus 100 ~~~~~~~~~~~~~~~~~~~l~~~~l--~~~~~~~~~~LSGGqkQrv~iAral---------------------------- 149 (211) T cd03225 100 GLENLGLPEEEIEERVEEALELVGL--EGLRDRSPFTLSGGQKQRVAIAGVL---------------------------- 149 (211) T ss_pred HHHHCCCCHHHHHHHHHHHHHHCCC--HHHHHCCHHHCCHHHHHHHHHHHHH---------------------------- T ss_conf 9998699999999999999998699--4666389545999899999999999---------------------------- Q ss_pred CCCCEEEEEHHHH-HHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCC Q ss_conf 4698687763446-8888732100588999999866414237999965777 Q gi|254780799|r 587 QHMPYIVVVIDEM-ADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPS 636 (806) Q Consensus 587 ~~lp~ivviiDEl-aDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPs 636 (806) -.=|- |++.||= +-|=..+..++...|.+|++ -|+-.|+.|.++. T Consensus 150 ~~~P~-ililDEPTsgLD~~~~~~i~~~l~~l~~----~g~tii~itHdl~ 195 (211) T cd03225 150 AMDPD-ILLLDEPTAGLDPAGRRELLELLKKLKA----EGKTIIIVTHDLD 195 (211) T ss_pred HCCCC-EEEEECCCCCCCHHHHHHHHHHHHHHHH----CCCEEEEEECCHH T ss_conf 75999-9999798555899999999999999997----8999999925999 No 85 >PRK08233 hypothetical protein; Provisional Probab=90.29 E-value=0.23 Score=29.90 Aligned_cols=22 Identities=27% Similarity=0.478 Sum_probs=19.3 Q ss_pred EEEEECCCCHHHHHHHHHHHHH Q ss_conf 3553047740679999999999 Q gi|254780799|r 457 LLIAGTTGSGKSVAINTMILSL 478 (806) Q Consensus 457 LLIAGtTGSGKSV~iN~iI~Sl 478 (806) +-|||.||||||--.|++.-.+ T Consensus 6 IgIaGgSgSGKTtla~~l~~~l 27 (182) T PRK08233 6 ITIAAVSGGGKTTLTERLTHKL 27 (182) T ss_pred EEEECCCCCCHHHHHHHHHHHC T ss_conf 9996888678999999999974 No 86 >cd02025 PanK Pantothenate kinase (PanK) catalyzes the phosphorylation of pantothenic acid to form 4'-phosphopantothenic, which is the first of five steps in coenzyme A (CoA) biosynthetic pathway. The reaction carried out by this enzyme is a key regulatory point in CoA biosynthesis. Probab=90.25 E-value=0.34 Score=28.60 Aligned_cols=35 Identities=34% Similarity=0.455 Sum_probs=22.0 Q ss_pred EEEECCCCHHHHHHHHHHHHHHHHCCH-HHEEEEEEC Q ss_conf 553047740679999999999982995-784788852 Q gi|254780799|r 458 LIAGTTGSGKSVAINTMILSLLYRMTP-AQCRLIMID 493 (806) Q Consensus 458 LIAGtTGSGKSV~iN~iI~SlLyk~~P-~evkliliD 493 (806) =|||.||||||-.-+ .|..+|-+..+ ..|-+|=-| T Consensus 3 GIaG~sgSGKST~a~-~l~~~l~~~~~~~~v~ii~~D 38 (220) T cd02025 3 GIAGSVAVGKSTTAR-VLQALLSRWPDHPNVELITTD 38 (220) T ss_pred EEECCCCCCHHHHHH-HHHHHHHHCCCCCCEEEEECC T ss_conf 978899877999999-999986002699948999787 No 87 >PRK01077 cobyrinic acid a,c-diamide synthase; Validated Probab=90.02 E-value=0.68 Score=26.33 Aligned_cols=55 Identities=24% Similarity=0.376 Sum_probs=27.5 Q ss_pred CCHHHCCCCCHH----------HHCCCCCEEEECCCCCEEEEEECCCCHHH--HHHHHHHHHHCCCCCCC Q ss_conf 421233886457----------86588754773689832588833489889--99999999712897421 Q gi|254780799|r 661 DSRTILGEQGAE----------QLLGQGDMLYMTGGGRVQRIHGPFVSDIE--VEKVVSHLKTQGEAKYI 718 (806) Q Consensus 661 dSrtild~~gae----------~Llg~gdml~~~~~~~~~r~~g~~v~~~e--v~~v~~~~~~q~~~~y~ 718 (806) |.-.-|.+.||| .|-..=|.||+|+| .|. +|..=.++.. -+.|.++. .+|.|-|- T Consensus 262 enl~~L~~~Gael~~fSPl~d~~lP~~~D~l~lpGG-yPE-l~a~~L~~n~~l~~~I~~~~-~~G~pi~a 328 (451) T PRK01077 262 DNLELLRAAGAELVFFSPLADEALPDDCDALYLGGG-YPE-LFAAELAANTSMRASIRAAA-AAGKPIYA 328 (451) T ss_pred HHHHHHHHCCCEEEEECCCCCCCCCCCCCEEEECCC-CHH-HHHHHHHHHHHHHHHHHHHH-HCCCCEEE T ss_conf 679999978988999577678889866788995798-502-43999987089999999999-86997899 No 88 >PRK10436 hypothetical protein; Provisional Probab=89.89 E-value=0.49 Score=27.41 Aligned_cols=112 Identities=27% Similarity=0.377 Sum_probs=62.2 Q ss_pred HHHHHHHHH-HHHHHHHHCCCCCC-CCCCCCEEEEE---ECCCCCCEEEEHHHCCCHHHHHCCCCCHHHCCCCCCCCEEE Q ss_conf 999997889-99988630012100-03886125553---02456623873423072134302210002104566666678 Q gi|254780799|r 374 SSRIIGLSD-DIARSMSAISARVA-VIPRRNAIGIE---LPNDIRETVMLRDLIVSRVFEKNQCDLAINLGKSIEGKPII 448 (806) Q Consensus 374 vSKI~nLad-DIA~aLsa~svRIa-pIPGK~~VGIE---iPN~~r~~V~lreil~s~~f~~s~~~L~iaLGKdI~G~pvv 448 (806) +|||..+|+ |||-.-...+=|+. ...|+. +-+- +|-..-|.|.+|=+-.+.. ...|. -||-+=.-.-.+ T Consensus 134 ~sriK~la~ldiae~r~PQdGr~~~~~~~~~-id~Rvst~Pt~~GE~ivlRlL~~~~~----~~~L~-~LG~~~~~~~~~ 207 (461) T PRK10436 134 TARLKVLGNLDIAEHRLPQDGQFTVELAGNA-YSFRIATLPCRGGEKVVLRLLQQVQQ----TLDLE-TLGMTPAQLAQF 207 (461) T ss_pred HHHHHHHCCCCHHHCCCCCCCCEEEEECCEE-EEEEEEECCCCCCCEEEEEEECCCCC----CCCHH-HHCCCHHHHHHH T ss_conf 9999997499812055687782799889968-99999933467887789995246656----68887-848899999999 Q ss_pred EECCCCCE--EEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEE-CCC Q ss_conf 54100202--355304774067999999999998299578478885-231 Q gi|254780799|r 449 ADLARMPH--LLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMI-DPK 495 (806) Q Consensus 449 ~DLakMPH--LLIAGtTGSGKSV~iN~iI~SlLyk~~P~evklili-DPK 495 (806) -.+.+-|| +||.|.||||||.-+-+++-.+ .+| +.+.+-| ||= T Consensus 208 ~~~~~~p~GliLvtGPTGSGKTTTLya~L~~l---~~~-~~~I~TiEDPV 253 (461) T PRK10436 208 RQALQQPQGLVLVTGPTGSGKTVTLYSALQTL---NTA-QINICSVEDPV 253 (461) T ss_pred HHHHHCCCCEEEEECCCCCCHHHHHHHHHHHH---CCC-CCEEEEEECCC T ss_conf 99983899779997899995699999999743---467-71699960774 No 89 >COG2401 ABC-type ATPase fused to a predicted acetyltransferase domain [General function prediction only] Probab=89.76 E-value=0.46 Score=27.62 Aligned_cols=170 Identities=24% Similarity=0.345 Sum_probs=85.8 Q ss_pred HHCCCCCCCCEEEEE--CCCCCEEE--EEECCCCHHHHHHHHHHHHHHHH-----CCHHHEEEEE-------ECCCHHHH Q ss_conf 210456666667854--10020235--53047740679999999999982-----9957847888-------52310011 Q gi|254780799|r 436 INLGKSIEGKPIIAD--LARMPHLL--IAGTTGSGKSVAINTMILSLLYR-----MTPAQCRLIM-------IDPKMLEL 499 (806) Q Consensus 436 iaLGKdI~G~pvv~D--LakMPHLL--IAGtTGSGKSV~iN~iI~SlLyk-----~~P~evklil-------iDPK~vEl 499 (806) ++++.+.--.++.-| |.-=||=+ |-|+.||||++.+- ||+.--.. ..||-=+.=+ -=|+-+|. T Consensus 387 FGv~~r~ieryvlr~vNL~ikpGdvvaVvGqSGaGKttllR-mi~G~~~~~~ee~y~p~sg~v~~p~nt~~a~iPge~Ep 465 (593) T COG2401 387 FGVRQRVIERYVLRNLNLEIKPGDVVAVVGQSGAGKTTLLR-MILGAQKGRGEEKYRPDSGKVEVPKNTVSALIPGEYEP 465 (593) T ss_pred HCCHHEEEEEEEEECEEEEECCCCEEEEEECCCCCCCHHHH-HHHHHHHCCCCCCCCCCCCCEECCCCCHHHCCCCCCCC T ss_conf 43012145566640203686478768999248877311999-99877643562024787772103443132106765554 Q ss_pred HHC--CCCHHHHCCCCCCHHHHHHHHHHHHHHHHHHHH-HHHHCCCCCHHHH--HHHHHHHHHCCCCCCCCCCCCCCCCC Q ss_conf 102--770343122334304566899999999999999-9987089968999--99999887447866775446776545 Q gi|254780799|r 500 SVY--DGIPNLLTPVVTNPQKAVTVLKWLVCEMEERYQ-KMSKIGVRNIDGF--NLKVAQYHNTGKKFNRTVQTGFDRKT 574 (806) Q Consensus 500 s~Y--~~iPHLl~pVvTd~~kA~~aL~w~V~EMe~RY~-l~a~~~vRni~~y--N~k~~~~~~~~~~~~~~~~~~~~~~~ 574 (806) +.= +=|-||- -+.-|+.-|+..|+-+=--=+-+|+ -|+++.. || |.|+.++.+++ T Consensus 466 ~f~~~tilehl~-s~tGD~~~AveILnraGlsDAvlyRr~f~ELSt----GQKeR~KLAkllaer--------------- 525 (593) T COG2401 466 EFGEVTILEHLR-SKTGDLNAAVEILNRAGLSDAVLYRRKFSELST----GQKERAKLAKLLAER--------------- 525 (593) T ss_pred CCCCHHHHHHHH-HCCCCHHHHHHHHHHHCCCHHHHHHCCHHHCCC----CHHHHHHHHHHHHCC--------------- T ss_conf 457311899875-236863678999976045305430046755385----457778999997348--------------- Q ss_pred CCCCCCCCCCCCCCCCEEEEEHHHHHHHHHHCCCHHHHHHHHHH----HHHHHCCEEEEEEECCCCCCCCCHHHHHCCCC Q ss_conf 43222332232346986877634468888732100588999999----86641423799996577753554355411025 Q gi|254780799|r 575 GEAIYETEHFDFQHMPYIVVVIDEMADLMMVARKDIESAVQRLA----QMARASGIHVIMATQRPSVDVITGTIKANFPT 650 (806) Q Consensus 575 ~~~~~~~~~~~~~~lp~ivviiDElaDlmm~~~~~ve~~i~rla----q~ara~GiHli~aTqrPsvdvitg~ikan~p~ 650 (806) | =|.+||||+-+.- |..++|+| -+||-+||-||++|.||. +++|==|- T Consensus 526 ---------------p-n~~~iDEF~AhLD------~~TA~rVArkiselARe~giTlivvThrpE------v~~AL~PD 577 (593) T COG2401 526 ---------------P-NVLLIDEFAAHLD------ELTAVRVARKISELAREAGITLIVVTHRPE------VGNALRPD 577 (593) T ss_pred ---------------C-CCEEHHHHHHHCC------HHHHHHHHHHHHHHHHHHCCEEEEEECCHH------HHHCCCCC T ss_conf ---------------9-8177356664317------799999999999999970973999964877------77444875 Q ss_pred EEEE Q ss_conf 1587 Q gi|254780799|r 651 RISF 654 (806) Q Consensus 651 riaf 654 (806) ++-+ T Consensus 578 ~li~ 581 (593) T COG2401 578 TLIL 581 (593) T ss_pred EEEE T ss_conf 0688 No 90 >PRK07270 DNA polymerase III subunits gamma and tau; Validated Probab=89.67 E-value=0.47 Score=27.56 Aligned_cols=46 Identities=4% Similarity=0.026 Sum_probs=23.1 Q ss_pred EEECCCCHHHHHHH-HHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 88334898899999-9999712897421100124556667788888777703899999999 Q gi|254780799|r 692 IHGPFVSDIEVEKV-VSHLKTQGEAKYIDIKDKILLNEEMRFSENSSVADDLYKQAVDIVL 751 (806) Q Consensus 692 ~~g~~v~~~ev~~v-~~~~~~q~~~~y~~~~~~~~~~~~~~~~~~~~~~d~l~~~a~~~v~ 751 (806) .|=.+|.+++=..| .+|+.+..... .......+.|++-.||.++.- T Consensus 503 ~~iv~v~~~~W~~ir~ef~~~~~~~~--------------~~~~~~~~e~~~v~~a~~lfg 549 (557) T PRK07270 503 PNILAVPREDFQEIRAEFAAKLKSQK--------------DSEQEEEEEEEDIPEGFEFLA 549 (557) T ss_pred CCEEEEEHHHHHHHHHHHHHHHHCCC--------------CCCCCCCCCCCHHHHHHHHHC T ss_conf 70898308989999999998733367--------------543355566602899999842 No 91 >TIGR02524 dot_icm_DotB Dot/Icm secretion system ATPase DotB; InterPro: IPR013363 Proteins in this entry are the DotB component of Dot/Icm secretion systems, as found in the obligate intracellular pathogens Legionella pneumophila and Coxiella burnetii. While this system resembles type IV secretion systems and has been called a form of type IV, the literature now seems to favor calling this the Dot/Icm system. This family is most closely related to the TraJ proteins of plasmid transfer (IPR013364 from INTERPRO), rather than to proteins of other type IV secretion systems.. Probab=89.53 E-value=0.34 Score=28.55 Aligned_cols=124 Identities=24% Similarity=0.360 Sum_probs=87.5 Q ss_pred HHCCCCEEEEEEEECCEEEEEEEEECCCCCHHHHHHHHHHHHHHHHHCCCCC---C-CCCCCCEEEEEECCCCCCEEEEH Q ss_conf 4248632899841044244444321478639999978899998863001210---0-03886125553024566238734 Q gi|254780799|r 344 SDFGIQGEIVNVRPGPVITLYELEPAPGIKSSRIIGLSDDIARSMSAISARV---A-VIPRRNAIGIELPNDIRETVMLR 419 (806) Q Consensus 344 ~dFGVe~~Vv~v~pGPVVTrYEi~PApGVKvSKI~nLadDIA~aLsa~svRI---a-pIPGK~~VGIEiPN~~r~~V~lr 419 (806) +=||=++.-.=..---+=|-||+.|..|+|-. -|| + -+-|...|- +.+| T Consensus 56 ~iyGPn~ttq~~~G~didthye~rPnr~~r~r-----------------yr~n~t~C~v~Gh~~iq----------it~r 108 (358) T TIGR02524 56 AIYGPNATTQLLRGKDIDTHYEIRPNRNVRYR-----------------YRVNATACLVEGHEAIQ----------ITLR 108 (358) T ss_pred HHCCCCCCEEEEECCCCCCCCCCCCCCCCEEE-----------------EEEECEEEEEECCCCEE----------EEEE T ss_conf 52187521123204522210000467775046-----------------78704044520555047----------7876 Q ss_pred HHCCCHHHHHCCCCCHHHCCCCCCCCEEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHH Q ss_conf 23072134302210002104566666678541002023553047740679999999999982995784788852310011 Q gi|254780799|r 420 DLIVSRVFEKNQCDLAINLGKSIEGKPIIADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLEL 499 (806) Q Consensus 420 eil~s~~f~~s~~~L~iaLGKdI~G~pvv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEl 499 (806) -|=..+. .-++..||-++=..|+-+-= =.-|.|+||||||..+-+||--|. -+||--|-||-=---+|+ T Consensus 109 ~iP~~PP-~ls~~~lP~~i~~aiaPqeG--------~v~~tGatGsGkstlla~iirel~--e~~ds~rk~ltye~Pie~ 177 (358) T TIGR02524 109 AIPAEPP-KLSKLDLPDAIVEAIAPQEG--------VVYVTGATGSGKSTLLAAIIRELI--EASDSNRKVLTYEAPIEY 177 (358) T ss_pred ECCCCCC-CCCCCCCHHHHHHHHCCCCC--------EEEEECCCCCCHHHHHHHHHHHHH--HCCCCCCEEEEECCCHHH T ss_conf 4279887-44311341799985178886--------699862677636889999999986--067656414642263233 Q ss_pred HHCCCCH Q ss_conf 1027703 Q gi|254780799|r 500 SVYDGIP 506 (806) Q Consensus 500 s~Y~~iP 506 (806) +|+.|- T Consensus 178 -vyd~~~ 183 (358) T TIGR02524 178 -VYDELE 183 (358) T ss_pred -HHHCCC T ss_conf -221101 No 92 >TIGR02525 plasmid_TraJ plasmid transfer ATPase TraJ; InterPro: IPR013364 Proteins in this entry are predicted ATPases associated with plasmid transfer loci in bacteria. This family is most similar to the DotB ATPase (IPR013363 from INTERPRO) of a type-IV secretion-like system of the obligate intracellular pathogens Legionella pneumophila and Coxiella burnetii.. Probab=89.44 E-value=0.21 Score=30.12 Aligned_cols=74 Identities=36% Similarity=0.525 Sum_probs=45.7 Q ss_pred HHHHHHHHHHHHHHCCCCC--CCCCCCCEEEEEECCCCCCEEEEHHHCCCHHHHHCCCCCHH-HCCCCCCCCEEEEECCC Q ss_conf 9978899998863001210--00388612555302456623873423072134302210002-10456666667854100 Q gi|254780799|r 377 IIGLSDDIARSMSAISARV--AVIPRRNAIGIELPNDIRETVMLRDLIVSRVFEKNQCDLAI-NLGKSIEGKPIIADLAR 453 (806) Q Consensus 377 I~nLadDIA~aLsa~svRI--apIPGK~~VGIEiPN~~r~~V~lreil~s~~f~~s~~~L~i-aLGKdI~G~pvv~DLak 453 (806) |-.+..=||+-| || ..||-=..+||| .|+++| -||. +|| T Consensus 111 ~~~~~~AislTl-----Rvips~IP~L~~MgiE-----------~DLf~a--------lLP~~GLG-------------- 152 (374) T TIGR02525 111 IGKLETAISLTL-----RVIPSDIPDLKKMGIE-----------PDLFEA--------LLPKAGLG-------------- 152 (374) T ss_pred HHHHHHHHHHHH-----HHCCCCCCCHHHCCCC-----------HHHHHH--------HCCCCCCE-------------- T ss_conf 555444443442-----0055666513234787-----------689987--------05003780-------------- Q ss_pred CCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEE-CC Q ss_conf 202355304774067999999999998299578478885-23 Q gi|254780799|r 454 MPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMI-DP 494 (806) Q Consensus 454 MPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evklili-DP 494 (806) ||.|-||||||--+=+|==-. -+..|| =|+|=+ || T Consensus 153 ----LiCG~TGSGKSTl~AaiY~~~-l~t~pd-RKivT~EDP 188 (374) T TIGR02525 153 ----LICGETGSGKSTLAAAIYRHC-LETYPD-RKIVTYEDP 188 (374) T ss_pred ----EECCCCCCCHHHHHHHHHHHH-CCCCCC-CEEEEEECC T ss_conf ----221778972899999999985-074889-707986577 No 93 >COG1474 CDC6 Cdc6-related protein, AAA superfamily ATPase [DNA replication, recombination, and repair / Posttranslational modification, protein turnover, chaperones] Probab=89.01 E-value=1.7 Score=23.30 Aligned_cols=164 Identities=16% Similarity=0.218 Sum_probs=77.7 Q ss_pred EEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 23553047740679999999999982995784788852310011102770343122334304566899999999999999 Q gi|254780799|r 456 HLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYDGIPNLLTPVVTNPQKAVTVLKWLVCEMEERYQ 535 (806) Q Consensus 456 HLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~~iPHLl~pVvTd~~kA~~aL~w~V~EMe~RY~ 535 (806) +++|-|.||+||.+-++-+ +.-|-..+|... .+|= =+.....|.++...+--.+. T Consensus 44 n~~iyG~~GTGKT~~~~~v-~~~l~~~~~~~~------------~~yI-----Nc~~~~t~~~i~~~i~~~~~------- 98 (366) T COG1474 44 NIIIYGPTGTGKTATVKFV-MEELEESSANVE------------VVYI-----NCLELRTPYQVLSKILNKLG------- 98 (366) T ss_pred CEEEECCCCCCHHHHHHHH-HHHHHHHCCCCC------------EEEE-----EEEECCCHHHHHHHHHHHHC------- T ss_conf 0799889998732899999-999973315675------------7999-----51307878799999999826------- Q ss_pred HHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHHHHHHCCCHHHHHHH Q ss_conf 99870899689999999988744786677544677654543222332232346986877634468888732100588999 Q gi|254780799|r 536 KMSKIGVRNIDGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQHMPYIVVVIDEMADLMMVARKDIESAVQ 615 (806) Q Consensus 536 l~a~~~vRni~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivviiDElaDlmm~~~~~ve~~i~ 615 (806) -.-..|.+..+-|+...+...+. =.+.|||.||+.=|-=-.+ ++=-.+. T Consensus 99 ~~p~~g~~~~~~~~~l~~~~~~~------------------------------~~~~IvvLDEid~L~~~~~-~~LY~L~ 147 (366) T COG1474 99 KVPLTGDSSLEILKRLYDNLSKK------------------------------GKTVIVILDEVDALVDKDG-EVLYSLL 147 (366) T ss_pred CCCCCCCCHHHHHHHHHHHHHHC------------------------------CCEEEEEECCHHHHHCCCC-CHHHHEE T ss_conf 89976763268999999977741------------------------------8759999764765415464-1455111 Q ss_pred HHHHHHHHCCEEEEEEECCCCCCC---CCHHHHHC-CCCEEEEEECCCCCCHHHCCCCCHHHHCCCCC Q ss_conf 999866414237999965777535---54355411-02515876458664212338864578658875 Q gi|254780799|r 616 RLAQMARASGIHVIMATQRPSVDV---ITGTIKAN-FPTRISFQVSSKIDSRTILGEQGAEQLLGQGD 679 (806) Q Consensus 616 rlaq~ara~GiHli~aTqrPsvdv---itg~ikan-~p~riaf~v~s~~dSrtild~~gae~Llg~gd 679 (806) |+.+-. ..++++-.=--+.+. +-.=||.. -|.+|-|.-=+.--=++||.+ .|+..+..|. T Consensus 148 r~~~~~---~~~v~vi~i~n~~~~~~~ld~rv~s~l~~~~I~F~pY~a~el~~Il~~-R~~~~~~~~~ 211 (366) T COG1474 148 RAPGEN---KVKVSIIAVSNDDKFLDYLDPRVKSSLGPSEIVFPPYTAEELYDILRE-RVEEGFSAGV 211 (366) T ss_pred CCCCCC---CEEEEEEEEECCHHHHHHHHHHHHHCCCCCEEEECCCCHHHHHHHHHH-HHHHHHCCCC T ss_conf 247767---537999997354889998756676506876355289898999999999-9985404687 No 94 >COG4962 CpaF Flp pilus assembly protein, ATPase CpaF [Intracellular trafficking and secretion] Probab=88.85 E-value=0.47 Score=27.49 Aligned_cols=54 Identities=31% Similarity=0.533 Sum_probs=35.9 Q ss_pred CEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCCCCHHHHCCCCCCHH Q ss_conf 023553047740679999999999982995784788852310011102770343122334304 Q gi|254780799|r 455 PHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYDGIPNLLTPVVTNPQ 517 (806) Q Consensus 455 PHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~~iPHLl~pVvTd~~ 517 (806) --.||||-|||||.--+|++ +..-.|+| |.|-|-=- -||.. ..||.. .-.|.|. T Consensus 174 ~NILisGGTGSGKTTlLNal----~~~i~~~e-RvItiEDt-aELql--~~ph~v-rL~TR~~ 227 (355) T COG4962 174 CNILISGGTGSGKTTLLNAL----SGFIDSDE-RVITIEDT-AELQL--AHPHVV-RLETRPP 227 (355) T ss_pred EEEEEECCCCCCHHHHHHHH----HHCCCCCC-EEEEEEEH-HHHCC--CCCCEE-EEEECCC T ss_conf 15999678788799999999----71579765-08998123-66446--998557-8863488 No 95 >cd03291 ABCC_CFTR1 The CFTR subfamily domain 1. The cystic fibrosis transmembrane regulator (CFTR), the product of the gene mutated in patients with cystic fibrosis, has adapted the ABC transporter structural motif to form a tightly regulated anion channel at the apical surface of many epithelia. Use of the term assembly of a functional ion channel implies the coming together of subunits, or at least smaller not-yet functional components of the active whole. In fact, on the basis of current knowledge only the CFTR polypeptide itself is required to form an ATP- and protein kinase A-dependent low-conductance chloride channel of the type present in the apical membrane of many epithelial cells. CFTR displays the typical organization (IM-ABC)2 and carries a characteristic hydrophilic R-domain that separates IM1-ABC1 from IM2-ABC2. Probab=88.67 E-value=0.58 Score=26.82 Aligned_cols=59 Identities=22% Similarity=0.443 Sum_probs=35.2 Q ss_pred CCCEEEEECC----CCCEEEEEECCCCHHHHHHHHHHHHHHHH-CCHHHEEEEEECCCHHHHHHCCCCHHHHC Q ss_conf 6666785410----02023553047740679999999999982-99578478885231001110277034312 Q gi|254780799|r 443 EGKPIIADLA----RMPHLLIAGTTGSGKSVAINTMILSLLYR-MTPAQCRLIMIDPKMLELSVYDGIPNLLT 510 (806) Q Consensus 443 ~G~pvv~DLa----kMPHLLIAGtTGSGKSV~iN~iI~SlLyk-~~P~evkliliDPK~vEls~Y~~iPHLl~ 510 (806) +|+||.-|+. +==.+-|-|.||||||--++.| ++ ..|++=+ |.+|-+ .+.-...|+|.. T Consensus 48 ~~~pVLk~Isf~I~~Ge~vaIVG~sGSGKSTLl~lL-----~gl~~p~~G~-I~~~g~---i~~v~Q~~~lf~ 111 (282) T cd03291 48 VGAPVLKNINLKIEKGEMLAITGSTGSGKTSLLMLI-----LGELEPSEGK-IKHSGR---ISFSSQFSWIMP 111 (282) T ss_pred CCCCEEECEEEEECCCCEEEEECCCCCHHHHHHHHH-----HCCCCCCCCE-EEECCE---EEEECCCCCCCC T ss_conf 989614164899849999999999998199999999-----5787278658-999999---998657442267 No 96 >TIGR03499 FlhF flagellar biosynthetic protein FlhF. Probab=88.39 E-value=0.81 Score=25.74 Aligned_cols=68 Identities=21% Similarity=0.372 Sum_probs=54.5 Q ss_pred EEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHH----HHHHCCCCHHHHCCCCCCHHHHHHHHHH Q ss_conf 5530477406799999999999829957847888523100----1110277034312233430456689999 Q gi|254780799|r 458 LIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKML----ELSVYDGIPNLLTPVVTNPQKAVTVLKW 525 (806) Q Consensus 458 LIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~v----Els~Y~~iPHLl~pVvTd~~kA~~aL~w 525 (806) ...|.||+||.-.|==|=....++.....|-||=+|-=|+ -|..|-+|=-+=.-||++|++-..+|.- T Consensus 198 ~lvGPTGVGKTTTiAKLAa~~~l~~~~~~V~lIT~DtyRigA~eQLk~ya~il~vp~~vv~~~~~l~~~l~~ 269 (282) T TIGR03499 198 ALVGPTGVGKTTTLAKLAARFVLEHGKKKVALITTDTYRIGAVEQLKTYAKILGVPVKVARDPKELAKALER 269 (282) T ss_pred EEECCCCCCHHHHHHHHHHHHHHHCCCCCEEEEEECCCCHHHHHHHHHHHHHHCCEEEEECCHHHHHHHHHH T ss_conf 997788875788999999999997389967999807776789999999999959748993999999999986 No 97 >pfam04851 ResIII Type III restriction enzyme, res subunit. Probab=88.37 E-value=0.92 Score=25.34 Aligned_cols=40 Identities=28% Similarity=0.422 Sum_probs=28.1 Q ss_pred CCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCH Q ss_conf 02023553047740679999999999982995784788852310 Q gi|254780799|r 453 RMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKM 496 (806) Q Consensus 453 kMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~ 496 (806) +-.+-+|..+|||||+++.=.+|..++.+.+ |.+.+=|.. T Consensus 17 ~~~~~~i~~pTGsGKT~~~~~~i~~~~~~~~----~~lvlvp~~ 56 (103) T pfam04851 17 EKKRGLIVMATGSGKTLTAAKLIARLLKGKK----KVLFLVPRK 56 (103) T ss_pred HCCCEEEEECCCCCHHHHHHHHHHHHHHCCC----CEEEEECCH T ss_conf 6398699958999879999999999984699----299990829 No 98 >pfam05049 IIGP Interferon-inducible GTPase (IIGP). Interferon-inducible GTPase (IIGP) is thought to play a role in in intracellular defence. IIGP is predominantly associated with the Golgi apparatus and also localizes to the endoplasmic reticulum and exerts a distinct role in IFN-induced intracellular membrane trafficking or processing. Probab=88.31 E-value=0.28 Score=29.18 Aligned_cols=19 Identities=37% Similarity=0.726 Sum_probs=11.6 Q ss_pred EEEEEECCCCHHHHHHHHH Q ss_conf 2355304774067999999 Q gi|254780799|r 456 HLLIAGTTGSGKSVAINTM 474 (806) Q Consensus 456 HLLIAGtTGSGKSV~iN~i 474 (806) +.=|.|-||+|||.+||++ T Consensus 37 nIavtGesG~GkSsfINal 55 (375) T pfam05049 37 KIAVTGDSGNGKSSFINAL 55 (375) T ss_pred EEEEECCCCCCHHHHHHHH T ss_conf 7998548998678999987 No 99 >COG0630 VirB11 Type IV secretory pathway, VirB11 components, and related ATPases involved in archaeal flagella biosynthesis [Cell motility and secretion / Intracellular trafficking and secretion] Probab=88.05 E-value=2.1 Score=22.72 Aligned_cols=148 Identities=18% Similarity=0.234 Sum_probs=71.2 Q ss_pred EEECCEEEEEEEEE-CCCCCHHHHHHHHHHHHHHHHHCCCC-CCCCCCCCEEEEEECCCCCCEEEEHHHCCCHH------ Q ss_conf 41044244444321-47863999997889999886300121-00038861255530245662387342307213------ Q gi|254780799|r 355 VRPGPVITLYELEP-APGIKSSRIIGLSDDIARSMSAISAR-VAVIPRRNAIGIELPNDIRETVMLRDLIVSRV------ 426 (806) Q Consensus 355 v~pGPVVTrYEi~P-ApGVKvSKI~nLadDIA~aLsa~svR-IapIPGK~~VGIEiPN~~r~~V~lreil~s~~------ 426 (806) ..+|||...+.... ......-..-.-.+.+|+.++..+-+ |. ..++.+=.-+||-.|-.+.+.+...... T Consensus 33 n~~~~v~v~~~~~~~~~~~~~~~~~~~l~~~~~~~a~~~gk~is--~~~Pi~~~~lp~g~Ri~i~~~~~v~~~~~s~~IR 110 (312) T COG0630 33 NGPGPVYVEHKGGGSYVTNIPFLTEEELDSLAIRLAQRSGKPIS--EANPILDATLPDGSRIQIVLGPEVSPNGSSFTIR 110 (312) T ss_pred CCCCCEEEEEECCCEEECCCCCCCHHHHHHHHHHHHHHCCCCCC--CCCCCEEEECCCCEEEEEECCCCCCCCCCCEEEE T ss_conf 28962899970675034255456788999999999986299667--8887168774886489884078656898866886 Q ss_pred -HHHCCCCCHHHCC-CCCCCC---EEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHH Q ss_conf -4302210002104-566666---67854100202355304774067999999999998299578478885231001110 Q gi|254780799|r 427 -FEKNQCDLAINLG-KSIEGK---PIIADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSV 501 (806) Q Consensus 427 -f~~s~~~L~iaLG-KdI~G~---pvv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~ 501 (806) |......+.-.+. ++++-. ....-+.+.-+.+|+|.|||||---+|+++ .+=|.+.|.+-|+=. =|+.. T Consensus 111 k~~~~~~t~~~l~~~gt~~~~~~ayL~~~ie~~~siii~G~t~sGKTt~lnall-----~~Ip~~~rivtIEdt-~E~~~ 184 (312) T COG0630 111 KFSDEPITPEDLIEYGTISPEQAAYLWLAIEARKSIIICGGTASGKTTLLNALL-----DFIPPEERIVTIEDT-PELKL 184 (312) T ss_pred CCCCCCCCHHHHHHCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCHHHHHHHHH-----HHCCCCCCEEEECCC-CEECC T ss_conf 266888998998543788888999999999769949998888886495999998-----637852218995255-54147 Q ss_pred C-CCCHHHHC Q ss_conf 2-77034312 Q gi|254780799|r 502 Y-DGIPNLLT 510 (806) Q Consensus 502 Y-~~iPHLl~ 510 (806) + +..-+|.+ T Consensus 185 ~~~n~~~l~~ 194 (312) T COG0630 185 PHENWVQLVT 194 (312) T ss_pred CCCCEEEEEE T ss_conf 8876589983 No 100 >pfam05802 EspB Enterobacterial EspB protein. EspB is a type-III-secreted pore-forming protein of enteropathogenic Escherichia coli (EPEC) which is essential for EPEC pathogenesis. EspB is also found in Citrobacter rodentium. Probab=87.97 E-value=1.4 Score=23.94 Aligned_cols=65 Identities=22% Similarity=0.310 Sum_probs=50.7 Q ss_pred HHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHHHH Q ss_conf 99999999999999870899689999999988744786677544677654543222332232346986877634468888 Q gi|254780799|r 524 KWLVCEMEERYQKMSKIGVRNIDGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQHMPYIVVVIDEMADLM 603 (806) Q Consensus 524 ~w~V~EMe~RY~l~a~~~vRni~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivviiDElaDlm 603 (806) +.-..--|.||+-+..+..-|++-|.+-|+.++.. |---|-| | T Consensus 253 KSnHeinEQRaKSvEn~Qq~nle~ykqdvrr~qdd------------------------------------i~srlrd-m 295 (317) T pfam05802 253 KSNHEINEQRAKSVENFQQGNLELYKQDVRRSQDD------------------------------------IASRLRD-M 295 (317) T ss_pred CCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHH------------------------------------HHHHHHH-H T ss_conf 24322438888789988730488899998774677------------------------------------8999999-9 Q ss_pred HHCCCHHHHHHHHHHHHHHHCC Q ss_conf 7321005889999998664142 Q gi|254780799|r 604 MVARKDIESAVQRLAQMARASG 625 (806) Q Consensus 604 m~~~~~ve~~i~rlaq~ara~G 625 (806) -++.+|.-+.+-|.+|-+|-|| T Consensus 296 t~aardl~dl~nrmgqs~rlag 317 (317) T pfam05802 296 TAAARDLNDLQNRMGQSARLAG 317 (317) T ss_pred HHHHHHHHHHHHHHHHHHCCCC T ss_conf 9988889999998655430379 No 101 >PRK05416 hypothetical protein; Provisional Probab=87.87 E-value=0.42 Score=27.90 Aligned_cols=32 Identities=31% Similarity=0.570 Sum_probs=26.3 Q ss_pred CCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEEC Q ss_conf 002023553047740679999999999982995784788852 Q gi|254780799|r 452 ARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMID 493 (806) Q Consensus 452 akMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliD 493 (806) ..|.=++|.|-.|||||+++|++ |+.-+--|| T Consensus 4 ~~m~lviVTGlSGAGKStAl~~L----------EDlGy~ciD 35 (292) T PRK05416 4 APMRLVIVTGLSGAGKSVALRAL----------EDLGYYCVD 35 (292) T ss_pred CCEEEEEEECCCCCCHHHHHHHH----------HHCCCEEEC T ss_conf 98689999689978799999999----------817944786 No 102 >PRK06995 flhF flagellar biosynthesis regulator FlhF; Validated Probab=87.85 E-value=1 Score=24.92 Aligned_cols=69 Identities=20% Similarity=0.310 Sum_probs=57.9 Q ss_pred EEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHH----HHHHCCCCHHHHCCCCCCHHHHHHHHHH Q ss_conf 235530477406799999999999829957847888523100----1110277034312233430456689999 Q gi|254780799|r 456 HLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKML----ELSVYDGIPNLLTPVVTNPQKAVTVLKW 525 (806) Q Consensus 456 HLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~v----Els~Y~~iPHLl~pVvTd~~kA~~aL~w 525 (806) +.|| |-||.||.-.|==|=....+++..+.|-||=.|.=|+ -|..|-+|-.+=.-||.+|.+-..+|+- T Consensus 179 ~alV-GPTGVGKTTTiAKLAAr~~l~~g~~kVaLIT~DTYRIgAvEQLktYa~IlgvPv~vv~~~~eL~~aL~~ 251 (404) T PRK06995 179 FALV-GPTGVGKTTTTAKLAARCVMRHGASKVALLTTDSYRIGGHEQLRIYGKILGVPVHAVKDAADLRLALAE 251 (404) T ss_pred EEEE-CCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHH T ss_conf 8986-688876375899999999998389837999768754789999999998759559995999999999997 No 103 >TIGR03015 pepcterm_ATPase putative secretion ATPase, PEP-CTERM locus subfamily. Members of this protein are marked as probable ATPases by the nucleotide binding P-loop motif GXXGXGKTT, a motif DEAQ similar to the DEAD/H box of helicases, and extensive homology to ATPases of MSHA-type pilus systems and to GspA proteins associated with type II protein secretion systems. Probab=87.57 E-value=2.3 Score=22.36 Aligned_cols=37 Identities=19% Similarity=0.439 Sum_probs=25.9 Q ss_pred CEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEE-CCC Q ss_conf 02355304774067999999999998299578478885-231 Q gi|254780799|r 455 PHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMI-DPK 495 (806) Q Consensus 455 PHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evklili-DPK 495 (806) .=.+|-|--|||||.-+..++-+ ..|+.+..+.| +|. T Consensus 44 g~~lltGe~GtGKTtllr~l~~~----l~~~~~~~~~i~~~~ 81 (269) T TIGR03015 44 GFILITGEVGAGKTTLIRNLLKR----LDQERVVAAKLVNTR 81 (269) T ss_pred CEEEEECCCCCCHHHHHHHHHHH----CCCCCEEEEEECCCC T ss_conf 65999729989889999999984----593454899976999 No 104 >pfam02689 Herpes_Helicase Helicase. This family consists of Helicases from the Herpes viruses. Helicases are responsible for the unwinding of DNA and are essential for replication and completion of the viral life cycle. Probab=87.55 E-value=0.35 Score=28.48 Aligned_cols=24 Identities=13% Similarity=0.293 Sum_probs=14.8 Q ss_pred ECCCCCCEE--------EEHHHCCCHHHHHCC Q ss_conf 024566238--------734230721343022 Q gi|254780799|r 408 LPNDIRETV--------MLRDLIVSRVFEKNQ 431 (806) Q Consensus 408 iPN~~r~~V--------~lreil~s~~f~~s~ 431 (806) |-.+.|..| .|.+||+++.|-++. T Consensus 454 Vt~k~kk~v~Gf~GTf~~F~~vL~~D~Fi~~~ 485 (801) T pfam02689 454 VTGKTKKCVCGFSGTFEKFKAVLDSDSFIEAH 485 (801) T ss_pred EECCEEEEEEEEEECHHHHHHHHHHCCHHHCC T ss_conf 85133479985112289999986013223337 No 105 >COG1660 Predicted P-loop-containing kinase [General function prediction only] Probab=87.47 E-value=0.45 Score=27.67 Aligned_cols=68 Identities=25% Similarity=0.441 Sum_probs=43.1 Q ss_pred CCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEEC-------CCHHHHHH--CCCCHHHHCCCCCCHHH--HHHH Q ss_conf 2023553047740679999999999982995784788852-------31001110--27703431223343045--6689 Q gi|254780799|r 454 MPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMID-------PKMLELSV--YDGIPNLLTPVVTNPQK--AVTV 522 (806) Q Consensus 454 MPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliD-------PK~vEls~--Y~~iPHLl~pVvTd~~k--A~~a 522 (806) |.-.+|-|--||||||++|++ ++.-.--|| ||+++|-- =..++++ .|++|.+- -... T Consensus 1 m~lvIVTGlSGAGKsvAl~~l----------EDlGyycvDNLPp~Llp~~~~~~~~~~~~~~kv--Av~iDiRs~~~~~~ 68 (286) T COG1660 1 MRLVIVTGLSGAGKSVALRVL----------EDLGYYCVDNLPPQLLPKLADLMLTLESRITKV--AVVIDVRSREFFGD 68 (286) T ss_pred CCEEEEECCCCCCHHHHHHHH----------HHCCEEEECCCCHHHHHHHHHHHHHCCCCCCEE--EEEEECCCCHHHHH T ss_conf 946999568887688999999----------745804535899889999999976305677527--99996145031788 Q ss_pred HHHHHHHHHHH Q ss_conf 99999999999 Q gi|254780799|r 523 LKWLVCEMEER 533 (806) Q Consensus 523 L~w~V~EMe~R 533 (806) |.-++.|+..+ T Consensus 69 l~~~l~~l~~~ 79 (286) T COG1660 69 LEEVLDELKDN 79 (286) T ss_pred HHHHHHHHHHC T ss_conf 99999999855 No 106 >pfam04548 AIG1 AIG1 family. Arabidopsis protein AIG1 appears to be involved in plant resistance to bacteria. Probab=87.30 E-value=0.33 Score=28.70 Aligned_cols=25 Identities=28% Similarity=0.545 Sum_probs=20.0 Q ss_pred EEEEEECCCCHHHHHHHHHHHHHHH Q ss_conf 2355304774067999999999998 Q gi|254780799|r 456 HLLIAGTTGSGKSVAINTMILSLLY 480 (806) Q Consensus 456 HLLIAGtTGSGKSV~iN~iI~SlLy 480 (806) .++.-|-||||||-..|||+=.=-+ T Consensus 2 rivLlGktG~GKSstgNtILG~~~F 26 (200) T pfam04548 2 RIVLVGKTGNGKSATGNSILGRKAF 26 (200) T ss_pred EEEEECCCCCCHHHHHHHHCCCCCC T ss_conf 8999799998436557661797533 No 107 >COG0419 SbcC ATPase involved in DNA repair [DNA replication, recombination, and repair] Probab=87.15 E-value=0.69 Score=26.29 Aligned_cols=11 Identities=36% Similarity=0.589 Sum_probs=6.3 Q ss_pred CEEEEEECCCC Q ss_conf 12555302456 Q gi|254780799|r 402 NAIGIELPNDI 412 (806) Q Consensus 402 ~~VGIEiPN~~ 412 (806) ++.|-++|+.. T Consensus 461 PvCg~~l~~~~ 471 (908) T COG0419 461 PVCGQELPEEH 471 (908) T ss_pred CCCCCCCCCHH T ss_conf 98889786077 No 108 >pfam02456 Adeno_IVa2 Adenovirus IVa2 protein. IVa2 protein can interact with the adenoviral packaging signal and that this interaction involves DNA sequences that have previously been demonstrated to be required for packaging. During the course of lytic infection, the adenovirus major late promoter (MLP) is induced to high levels after replication of viral DNA has started. IVa2 is a transcriptional activator of the major late promoter. Probab=86.91 E-value=0.42 Score=27.86 Aligned_cols=33 Identities=27% Similarity=0.462 Sum_probs=31.2 Q ss_pred EEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEE Q ss_conf 553047740679999999999982995784788 Q gi|254780799|r 458 LIAGTTGSGKSVAINTMILSLLYRMTPAQCRLI 490 (806) Q Consensus 458 LIAGtTGSGKSV~iN~iI~SlLyk~~P~evkli 490 (806) .|=|-||||||-.+-.+|.+=|-.-.|+.|=|| T Consensus 91 vVYGPTG~GKSQLlRNlis~~lI~P~PETVfFI 123 (370) T pfam02456 91 VVYGPTGCGKSQLLRNLLSCQLIQPIPETVFFI 123 (370) T ss_pred EEECCCCCCHHHHHHHHHHCCCCCCCCCEEEEE T ss_conf 998899877899999987346677999728997 No 109 >cd03249 ABC_MTABC3_MDL1_MDL2 MTABC3 (also known as ABCB6) is a mitochondrial ATP-binding cassette protein involved in iron homeostasis and one of four ABC transporters expressed in the mitochondrial inner membrane, the other three being MDL1(ABC7), MDL2, and ATM1. In fact, the yeast MDL1 (multidrug resistance-like protein 1) and MDL2 (multidrug resistance-like protein 2) transporters are also included in this CD. MDL1 is an ATP-dependent permease that acts as a high-copy suppressor of ATM1 and is thought to have a role in resistance to oxidative stress. Interestingly, subfamily B is more closely related to the carboxyl-terminal component of subfamily C than the two halves of ABCC molecules are with one another. Probab=86.51 E-value=2.6 Score=21.95 Aligned_cols=179 Identities=18% Similarity=0.290 Sum_probs=82.8 Q ss_pred EECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCC---------------H----HHEEEEEECCCHHHHHHCCCCHHHH Q ss_conf 54100202355304774067999999999998299---------------5----7847888523100111027703431 Q gi|254780799|r 449 ADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMT---------------P----AQCRLIMIDPKMLELSVYDGIPNLL 509 (806) Q Consensus 449 ~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~---------------P----~evkliliDPK~vEls~Y~~iPHLl 509 (806) .++.+==++-|-|.||||||-.++.|. =||+.+ + ..+-++-=||-...-|+++.|- +- T Consensus 24 l~i~~G~~iaIvG~sGsGKSTLl~ll~--gl~~p~~G~I~idg~~i~~~~~~~~r~~i~~v~Q~~~lf~~Ti~eNi~-~g 100 (238) T cd03249 24 LTIPPGKTVALVGSSGCGKSTVVSLLE--RFYDPTSGEILLDGVDIRDLNLRWLRSQIGLVSQEPVLFDGTIAENIR-YG 100 (238) T ss_pred EEECCCCEEEEECCCCCCHHHHHHHHH--HCCCCCCCEEEECCEECCCCCHHHHHHCEEEECCCCEECCCCHHHHHH-CC T ss_conf 997699999999999998999999982--386188518999999923189999974069991589672752999963-36 Q ss_pred CCCCCCHHHHHHHHHHH-----HHHHHHHHHHHHHCCCCCHHHH-HHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 22334304566899999-----9999999999987089968999-99999887447866775446776545432223322 Q gi|254780799|r 510 TPVVTNPQKAVTVLKWL-----VCEMEERYQKMSKIGVRNIDGF-NLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEH 583 (806) Q Consensus 510 ~pVvTd~~kA~~aL~w~-----V~EMe~RY~l~a~~~vRni~~y-N~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 583 (806) .|-.+| ++...|++-+ +..+...|+-.-..+-.|+.|= -+|+.-|++- T Consensus 101 ~~~~~~-~~~~~a~~~~~l~~~i~~lp~gl~t~ige~G~~LSgGQ~QRialARAl------------------------- 154 (238) T cd03249 101 KPDATD-EEVEEAAKKANIHDFIMSLPDGYDTLVGERGSQLSGGQKQRIAIARAL------------------------- 154 (238) T ss_pred CCCCCH-HHHHHHHHHHCCCHHHHHCCCCCCEEECCCCCCCCHHHHHHHHHHHHH------------------------- T ss_conf 878999-999999999785243551868772250786785899999999999999------------------------- Q ss_pred CCCCCCCEEEEEHHHH-HHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEEEECCCCCC Q ss_conf 3234698687763446-888873210058899999986641423799996577753554355411025158764586642 Q gi|254780799|r 584 FDFQHMPYIVVVIDEM-ADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISFQVSSKIDS 662 (806) Q Consensus 584 ~~~~~lp~ivviiDEl-aDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf~v~s~~dS 662 (806) =+=|-| +|.||- +-|=-.+.+++.+.|.++. .+--+|+.|.|++. + ..--||-+ +|. T Consensus 155 ---~~~~~i-lilDE~tsaLD~~te~~i~~~l~~~~-----~~~TvI~ItHrl~~------l--~~~D~Iiv-----l~~ 212 (238) T cd03249 155 ---LRNPKI-LLLDEATSALDAESEKLVQEALDRAM-----KGRTTIVIAHRLST------I--RNADLIAV-----LQN 212 (238) T ss_pred ---HCCCCE-EEEECCCCCCCHHHHHHHHHHHHHHH-----CCCEEEEECCCHHH------H--HHCCEEEE-----EEC T ss_conf ---659999-99978766789999999999999980-----99989998488889------9--85999999-----989 Q ss_pred HHHCCCCCHHHHCCCC Q ss_conf 1233886457865887 Q gi|254780799|r 663 RTILGEQGAEQLLGQG 678 (806) Q Consensus 663 rtild~~gae~Llg~g 678 (806) =.|..++--|.|+-++ T Consensus 213 G~Ive~Gt~~eLl~~~ 228 (238) T cd03249 213 GQVVEQGTHDELMAQK 228 (238) T ss_pred CEEEEECCHHHHHHCC T ss_conf 9999988889998788 No 110 >KOG0991 consensus Probab=86.50 E-value=0.63 Score=26.59 Aligned_cols=44 Identities=34% Similarity=0.496 Sum_probs=31.6 Q ss_pred CCCCCCEE-------EEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCC Q ss_conf 56666667-------854100202355304774067999999999998299 Q gi|254780799|r 440 KSIEGKPI-------IADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMT 483 (806) Q Consensus 440 KdI~G~pv-------v~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~ 483 (806) .||-|+-- ++-=..||||+++|--|-||.-||-++---||=.+- T Consensus 27 ~dIVGNe~tv~rl~via~~gnmP~liisGpPG~GKTTsi~~LAr~LLG~~~ 77 (333) T KOG0991 27 QDIVGNEDTVERLSVIAKEGNMPNLIISGPPGTGKTTSILCLARELLGDSY 77 (333) T ss_pred HHHHCCHHHHHHHHHHHHCCCCCCEEEECCCCCCHHHHHHHHHHHHHCHHH T ss_conf 882177989999999997289986675279998616489999999838066 No 111 >PRK04132 replication factor C small subunit; Provisional Probab=86.47 E-value=0.26 Score=29.49 Aligned_cols=78 Identities=17% Similarity=0.026 Sum_probs=40.7 Q ss_pred HHHHHHHHHHHCCHHHE--EEEEECCCHH-HHHHCCCCHHHHC--CCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCH Q ss_conf 99999999982995784--7888523100-1110277034312--23343045668999999999999999987089968 Q gi|254780799|r 471 INTMILSLLYRMTPAQC--RLIMIDPKML-ELSVYDGIPNLLT--PVVTNPQKAVTVLKWLVCEMEERYQKMSKIGVRNI 545 (806) Q Consensus 471 iN~iI~SlLyk~~P~ev--kliliDPK~v-Els~Y~~iPHLl~--pVvTd~~kA~~aL~w~V~EMe~RY~l~a~~~vRni 545 (806) ++-.+-|-+. .+|-+- .++-||-|.. .|-.|.+|+-..- -..++|.-+..+-+|+=++-.|+ -..+.-+.-++ T Consensus 411 iekei~~kik-er~i~~k~~liyi~~ka~~~l~~~~~ieig~ktk~~~~ipEwiKk~~k~vkr~flR~-l~g~dg~kp~~ 488 (863) T PRK04132 411 IEKDITNKIG-GRTVKGITSSIYIDNRAFYLLLMFWGVEIGNKTKIGYKVPEWIKEGNKFVKREFLRG-LFGADGTKPNI 488 (863) T ss_pred HHHHHHHHHC-CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHCCHHHHHHCCHHHHHHHHHH-HHCCCCCCCCH T ss_conf 7877899861-443100567776400456778788776642201133204999974602677888877-64257777737 Q ss_pred HHHHH Q ss_conf 99999 Q gi|254780799|r 546 DGFNL 550 (806) Q Consensus 546 ~~yN~ 550 (806) ..||. T Consensus 489 k~~~~ 493 (863) T PRK04132 489 KKYNA 493 (863) T ss_pred HHHHH T ss_conf 76545 No 112 >PRK07667 uridine kinase; Provisional Probab=86.36 E-value=0.59 Score=26.79 Aligned_cols=20 Identities=25% Similarity=0.428 Sum_probs=18.0 Q ss_pred EEECCCCHHHHHHHHHHHHH Q ss_conf 53047740679999999999 Q gi|254780799|r 459 IAGTTGSGKSVAINTMILSL 478 (806) Q Consensus 459 IAGtTGSGKSV~iN~iI~Sl 478 (806) |||.||||||-.-|.+.-.| T Consensus 19 IaG~sgSGKTTla~~L~~~l 38 (190) T PRK07667 19 IDGLSRSGKTTFVANLKENM 38 (190) T ss_pred EECCCCCCHHHHHHHHHHHH T ss_conf 77989788999999999998 No 113 >pfam05673 DUF815 Protein of unknown function (DUF815). This family consists of several bacterial proteins of unknown function. Probab=86.08 E-value=0.77 Score=25.91 Aligned_cols=71 Identities=25% Similarity=0.324 Sum_probs=45.4 Q ss_pred CCCCEEEEHHHCCCHHHHHC--CCCCHHHCCCCCCCCEEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEE Q ss_conf 56623873423072134302--2100021045666666785410020235530477406799999999999829957847 Q gi|254780799|r 411 DIRETVMLRDLIVSRVFEKN--QCDLAINLGKSIEGKPIIADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCR 488 (806) Q Consensus 411 ~~r~~V~lreil~s~~f~~s--~~~L~iaLGKdI~G~pvv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evk 488 (806) .+.+.|.|.+++.-+.-++. +...-+.=|+ | --|.|.=|+-|.|||..|.++ |.+..++-+| T Consensus 20 ~~~d~v~l~~L~Gie~Qk~~l~~NT~~F~~G~-----p-------AnnvLLwG~RGtGKSSlVKal----l~~~~~~gLr 83 (248) T pfam05673 20 PHPDPVDLDDLVGIDRQKEALLRNTEQFLAGL-----P-------ANNVLLWGARGTGKSSLVKAL----LNEYADQGLR 83 (248) T ss_pred CCCCCCCHHHHCCHHHHHHHHHHHHHHHHCCC-----C-------CCCEEEECCCCCCHHHHHHHH----HHHHHHCCCE T ss_conf 78898998893493999999999999998089-----8-------613676768989888999999----9986314956 Q ss_pred EEEECCCHH Q ss_conf 888523100 Q gi|254780799|r 489 LIMIDPKML 497 (806) Q Consensus 489 liliDPK~v 497 (806) ||=|+..-+ T Consensus 84 lIEv~k~~L 92 (248) T pfam05673 84 LIEVDKDDL 92 (248) T ss_pred EEEECHHHH T ss_conf 999878887 No 114 >PRK04220 2-phosphoglycerate kinase; Provisional Probab=86.02 E-value=0.4 Score=28.02 Aligned_cols=125 Identities=22% Similarity=0.342 Sum_probs=73.9 Q ss_pred EEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 23553047740679999999999982995784788852310011102770343122334304566899999999999999 Q gi|254780799|r 456 HLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYDGIPNLLTPVVTNPQKAVTVLKWLVCEMEERYQ 535 (806) Q Consensus 456 HLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~~iPHLl~pVvTd~~kA~~aL~w~V~EMe~RY~ 535 (806) +.||+||+|.|||.- . -+|+-.=||+|++. || .++|+-|++- T Consensus 94 iILigGtsGvGKSTl-----A--------------------~~LA~rLgI~~vis---TD----------~IREVmR~~~ 135 (306) T PRK04220 94 IILIGGASGVGTSTI-----A--------------------FELASRLGIRSVIG---TD----------SIREVMRKII 135 (306) T ss_pred EEEEECCCCCCHHHH-----H--------------------HHHHHHHCCCEEEC---CH----------HHHHHHHHCC T ss_conf 999858998878999-----9--------------------99999709883422---21----------6999998524 Q ss_pred HHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHHHHHHCCCHHHHHHH Q ss_conf 99870899689999999988744786677544677654543222332232346986877634468888732100588999 Q gi|254780799|r 536 KMSKIGVRNIDGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQHMPYIVVVIDEMADLMMVARKDIESAVQ 615 (806) Q Consensus 536 l~a~~~vRni~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivviiDElaDlmm~~~~~ve~~i~ 615 (806) --...-.=+-.+|+.-- .+..+ .. .. |- +| +..-..+ .-|+..|- T Consensus 136 ~~el~P~Lh~SSy~Awk--------~l~~~-------~~-------~~------~~---~I---~Gf~~Q~-~~V~~gI~ 180 (306) T PRK04220 136 SKELLPTLHESSYTAWK--------SLRRP-------PW-------EE------PD---HI---LGFERHV-EPVLVGVE 180 (306) T ss_pred CCCCCCHHHCCCHHHHH--------CCCCC-------CC-------CC------HH---HH---HHHHHHH-HHHHHHHH T ss_conf 83017513227513100--------23678-------77-------86------57---99---9999999-99999999 Q ss_pred HHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCC---CCEEEEEEC Q ss_conf 999866414237999965777535543554110---251587645 Q gi|254780799|r 616 RLAQMARASGIHVIMATQRPSVDVITGTIKANF---PTRISFQVS 657 (806) Q Consensus 616 rlaq~ara~GiHli~aTqrPsvdvitg~ikan~---p~riaf~v~ 657 (806) ++-..|---|..+|+- -|-++.|+|+... |.=+-|-++ T Consensus 181 aiI~Ra~~eg~slIIE----GVHlvP~~i~~~~~~~~~vi~fll~ 221 (306) T PRK04220 181 AVIERALKEGISVIIE----GVHIVPGFIKEKYLNMPNVFMFVLT 221 (306) T ss_pred HHHHHHHHCCCCEEEE----EECCCHHHHHHHHHCCCCEEEEEEE T ss_conf 9999999729968998----4303778877776438838999999 No 115 >cd03251 ABCC_MsbA MsbA is an essential ABC transporter, closely related to eukaryotic MDR proteins. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=85.99 E-value=0.99 Score=25.11 Aligned_cols=177 Identities=21% Similarity=0.283 Sum_probs=86.1 Q ss_pred EECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHH---------------------HEEEEEECCCHHHHHHCCCCHH Q ss_conf 5410020235530477406799999999999829957---------------------8478885231001110277034 Q gi|254780799|r 449 ADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPA---------------------QCRLIMIDPKMLELSVYDGIPN 507 (806) Q Consensus 449 ~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~---------------------evkliliDPK~vEls~Y~~iPH 507 (806) .++.+==+.-|-|.+|||||--++.| +=-..|. .+-++.=||....-++.+.|- T Consensus 23 l~i~~Ge~i~IvG~sGsGKSTLl~ll----~gl~~p~~G~I~i~g~~i~~~~~~~~r~~i~~v~Q~~~lf~~Ti~eNI~- 97 (234) T cd03251 23 LDIPAGETVALVGPSGSGKSTLVNLI----PRFYDVDSGRILIDGHDVRDYTLASLRRQIGLVSQDVFLFNDTVAENIA- 97 (234) T ss_pred EEECCCCEEEEECCCCCHHHHHHHHH----HCCCCCCCCEEEECCEECCCCCHHHHHHCEEEECCCCEECCCCHHHHHH- T ss_conf 99879999999989998299999999----6676678868999999966089999973179993689471645999961- Q ss_pred HHCCCCCCHHHHHHHHHHH-----HHHHHHHHHHHHHCCCCCHHHHH-HHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 3122334304566899999-----99999999999870899689999-99998874478667754467765454322233 Q gi|254780799|r 508 LLTPVVTNPQKAVTVLKWL-----VCEMEERYQKMSKIGVRNIDGFN-LKVAQYHNTGKKFNRTVQTGFDRKTGEAIYET 581 (806) Q Consensus 508 Ll~pVvTd~~kA~~aL~w~-----V~EMe~RY~l~a~~~vRni~~yN-~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 581 (806) +-.|-++| .+...+|+-| +..|+..|+-.-.-+-+|+.|=- +|+.-|++- T Consensus 98 ~g~~~~~~-~~i~~~l~~~~l~~~i~~l~~gl~t~i~~~g~~LSgGQ~QRi~lARal----------------------- 153 (234) T cd03251 98 YGRPGATR-EEVEEAARAANAHEFIMELPEGYDTVIGERGVKLSGGQRQRIAIARAL----------------------- 153 (234) T ss_pred CCCCCCCH-HHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHH----------------------- T ss_conf 47989999-999999998634554202778875563887895899999999999999----------------------- Q ss_pred CCCCCCCCCEEEEEHHHH-HHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEEEECCCC Q ss_conf 223234698687763446-8888732100588999999866414237999965777535543554110251587645866 Q gi|254780799|r 582 EHFDFQHMPYIVVVIDEM-ADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISFQVSSKI 660 (806) Q Consensus 582 ~~~~~~~lp~ivviiDEl-aDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf~v~s~~ 660 (806) -.=|- |+|.||- +-|=..+.+.+.+.|.++. .+--+|+-|+|++. + ..-.||-+ + T Consensus 154 -----~~~~~-iliLDEpts~LD~~~e~~i~~~l~~~~-----~~~TvI~itH~l~~------l--~~~D~Iiv-----l 209 (234) T cd03251 154 -----LKDPP-ILILDEATSALDTESERLVQAALERLM-----KNRTTFVIAHRLST------I--ENADRIVV-----L 209 (234) T ss_pred -----HCCCC-EEEEECCCCCCCHHHHHHHHHHHHHHH-----CCCEEEEEECCHHH------H--HHCCEEEE-----E T ss_conf -----64999-899968766899899999999999981-----99989999278889------9--85999999-----9 Q ss_pred CCHHHCCCCCHHHHCCCC Q ss_conf 421233886457865887 Q gi|254780799|r 661 DSRTILGEQGAEQLLGQG 678 (806) Q Consensus 661 dSrtild~~gae~Llg~g 678 (806) |.=.|..++--|.||-++ T Consensus 210 ~~G~ive~G~~~eLl~~~ 227 (234) T cd03251 210 EDGKIVERGTHEELLAQG 227 (234) T ss_pred ECCEEEEECCHHHHHHCC T ss_conf 899999988889998788 No 116 >TIGR03269 met_CoM_red_A2 methyl coenzyme M reductase system, component A2. The enzyme that catalyzes the final step in methanogenesis, methyl coenzyme M reductase, contains alpha, beta, and gamma chains. In older literature, the complex of alpha, beta, and gamma chains was termed component C, while this single chain protein was termed methyl coenzyme M reductase system component A2. Probab=85.96 E-value=2.8 Score=21.75 Aligned_cols=41 Identities=15% Similarity=0.143 Sum_probs=29.5 Q ss_pred CCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEE Q ss_conf 0366135332001245556325665457899999974248632899 Q gi|254780799|r 308 VLPSKEILSTSQSPVNQMTFSPKVMQNNACTLKSVLSDFGIQGEIV 353 (806) Q Consensus 308 kLPsLdLL~~~~~~~~~~~~s~eeL~~nA~lLE~tL~dFGVe~~Vv 353 (806) .=|.+=+||+|.+ ..|..+-++..+.|.+.++++|+-.=++ T Consensus 185 ~~P~lLilDEPTa-----~LD~~~~~~l~~~l~~l~~~~g~tii~i 225 (520) T TIGR03269 185 KEPFLFLADEPTG-----TLDPQTAKLVHNALEEAVKASGISMVLT 225 (520) T ss_pred CCCCEEEEECCCC-----CCCHHHHHHHHHHHHHHHHHHCCEEEEE T ss_conf 5998999707633-----5799999999999999999839659997 No 117 >pfam03668 ATP_bind_2 P-loop ATPase protein family. This family contains an ATP-binding site and could be an ATPase (personal obs:C Yeats). Probab=85.93 E-value=0.62 Score=26.64 Aligned_cols=30 Identities=30% Similarity=0.596 Sum_probs=23.7 Q ss_pred CCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEEC Q ss_conf 2023553047740679999999999982995784788852 Q gi|254780799|r 454 MPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMID 493 (806) Q Consensus 454 MPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliD 493 (806) |-=++|.|-.|||||+++|++ |+.=+--|| T Consensus 1 m~lviVTGlSGAGKStAl~~L----------ED~Gy~cvD 30 (284) T pfam03668 1 IDLVIITGRSGAGKSVALRAL----------EDLGYYCVD 30 (284) T ss_pred CEEEEEECCCCCCHHHHHHHH----------HCCCCEEEC T ss_conf 969999579977799999999----------818924775 No 118 >PRK09435 arginine/ornithine transport system ATPase; Provisional Probab=85.64 E-value=1.3 Score=24.12 Aligned_cols=43 Identities=28% Similarity=0.485 Sum_probs=34.2 Q ss_pred EEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCCC Q ss_conf 2355304774067999999999998299578478885231001110277 Q gi|254780799|r 456 HLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYDG 504 (806) Q Consensus 456 HLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~~ 504 (806) -+=|.|.-|.|||-.|+.++..++-+ -..|-.+-|||- |.|.| T Consensus 51 ~iGiTG~pG~GKStli~~l~~~~~~~--g~~v~vlavDPs----S~~sg 93 (325) T PRK09435 51 RIGITGVPGVGKSTFIEALGMHLIEQ--GHKVAVLAVDPS----STRTG 93 (325) T ss_pred EEEEECCCCCCHHHHHHHHHHHHHHC--CCEEEEEEECCC----CCCCC T ss_conf 99742799986889999999999967--985899997899----99888 No 119 >pfam04602 Arabinose_trans Mycobacterial cell wall arabinan synthesis protein. Arabinosyltransferase is involved in arabinogalactan (AG) biosynthesis pathway in mycobacteria. AG is a component of the macromolecular assembly of the mycolyl-AG-peptidoglycan complex of the cell wall. This enzyme has important clinical applications as it is believed to be the target of the antimycobacterial drug Ethambutol. Probab=85.58 E-value=2.9 Score=21.62 Aligned_cols=94 Identities=18% Similarity=0.145 Sum_probs=50.9 Q ss_pred CCCCCCCCC-CCCCCCHHHHHHHHHHHHHHH--HHHHH----------HHHHHHHHHHHHHHHHHCCCHHH----HHHHH Q ss_conf 689631368-864500013467899999999--99999----------99999999999999982765236----78999 Q gi|254780799|r 51 YDPSFSYIT-LRSPKNFLGYGGAIFADVAIQ--FFGIA----------SVFFLPPPTMWALSLLFDKKIYC----FSKRA 113 (806) Q Consensus 51 ~DPs~~~~s-~~~v~N~~G~lGA~iAd~L~~--lFG~~----------Ayllpl~Ll~~g~~ll~~k~~~~----~~~Rl 113 (806) .|+.|.+.- ...++...|+--.|..+++.| +|.-. -+++.+.++...+.++|++++.- ..+|+ T Consensus 461 ~DQTLa~V~Eatrvr~~vGPsl~Wy~E~~RY~~L~~~t~DGS~~RRf~vLl~llcL~~~~~~lLRrgripG~A~gPs~RL 540 (1075) T pfam04602 461 RDQTLATVLEATRIKYAVGPSIAWYQEHLRYYFLTVESPDGSIARRFAVLVLLLCLFGSLAMLLRRGRIPGVASGPAWRL 540 (1075) T ss_pred CCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHH T ss_conf 27428999999998875588807777899999986678775078889999999999999999801678787667825889 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCC----CCHHHHHH Q ss_conf 9999999999999998504443223334----32488999 Q gi|254780799|r 114 TAWLINILVSATFFASFSPSQSWPIQNG----FGGIIGDL 149 (806) Q Consensus 114 ~~~il~ll~~s~ll~~~~~~~~~~l~~g----~GGiiG~~ 149 (806) ++. .+.+.++..|.+ .-|.+.+| .||.++.+ T Consensus 541 ~gv----~~~~l~~l~fTP-TKWTHHFG~fAGla~alaAl 575 (1075) T pfam04602 541 IGI----TAISLLLLMFTP-TKWTHQFGAFAGLAGALGAL 575 (1075) T ss_pred HHH----HHHHHHHHHCCC-CHHHHHHHHHHHHHHHHHHH T ss_conf 999----999999994086-22543420788899999999 No 120 >PRK11664 ATP-dependent RNA helicase HrpB; Provisional Probab=85.48 E-value=0.97 Score=25.19 Aligned_cols=96 Identities=20% Similarity=0.250 Sum_probs=49.4 Q ss_pred HHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEEEECCCCCCHHHCCCC-CH----HHHCCCCCEEEEC--- Q ss_conf 999999866414237999965777535543554110251587645866421233886-45----7865887547736--- Q gi|254780799|r 613 AVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISFQVSSKIDSRTILGEQ-GA----EQLLGQGDMLYMT--- 684 (806) Q Consensus 613 ~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf~v~s~~dSrtild~~-ga----e~Llg~gdml~~~--- 684 (806) -..+++|+++.- .+.+.-+-+--++=.-||-|||-+-. -++|-.|-.+ || ..-|.+-++|... T Consensus 471 ~~~~l~~~~~~~-------~~~~~~~~~g~lla~AfPDRIArrRg--~~gry~LanGrga~L~~~d~L~~~ewLvaadl~ 541 (812) T PRK11664 471 RAQQLLKRLNVR-------GGEVDSSLIAPLLALAFPDRIARRRG--QDGRYQLANGMGAMLDADDALSRHEWLVAPLLL 541 (812) T ss_pred HHHHHHHHHHHC-------CCCCCHHHHHHHHHHHCHHHHHHHHC--CCCCEEEECCCEEEECCCCCCCCCCCEEEEEEC T ss_conf 999999986540-------48998678999999978599988618--998679726875887888777788858999813 Q ss_pred -CCCC-EEEE-EECCCCHHHHHHHHHHH-HHCCCCCC Q ss_conf -8983-2588-83348988999999999-71289742 Q gi|254780799|r 685 -GGGR-VQRI-HGPFVSDIEVEKVVSHL-KTQGEAKY 717 (806) Q Consensus 685 -~~~~-~~r~-~g~~v~~~ev~~v~~~~-~~q~~~~y 717 (806) ++.. -.|| +++=++.++++..-... ..+-...| T Consensus 542 ~~~~~~~~rI~lAa~i~~~~l~~~~~~~i~~~~~~~w 578 (812) T PRK11664 542 QGSASPDARILLALPLDIDELVQRCPQLVQQSDTVEW 578 (812) T ss_pred CCCCCCHHEEEEECCCCHHHHHHHHHHHCEEEEEEEE T ss_conf 6787752218862688999999886756288799997 No 121 >cd03279 ABC_sbcCD SbcCD and other Mre11/Rad50 (MR) complexes are implicated in the metabolism of DNA ends. They cleave ends sealed by hairpin structures and are thought to play a role in removing protein bound to DNA termini. Probab=84.98 E-value=1.5 Score=23.84 Aligned_cols=40 Identities=33% Similarity=0.590 Sum_probs=30.1 Q ss_pred EEEEECCCCC---EEEEEECCCCHHHHHHHHHHHHHHHHCCHHH Q ss_conf 6785410020---2355304774067999999999998299578 Q gi|254780799|r 446 PIIADLARMP---HLLIAGTTGSGKSVAINTMILSLLYRMTPAQ 486 (806) Q Consensus 446 pvv~DLakMP---HLLIAGtTGSGKSV~iN~iI~SlLyk~~P~e 486 (806) ....|.+++. --||-|-||||||--+++|.-. ||-.+|.. T Consensus 17 ~~~IDF~~~~~~~lflI~G~nGsGKSTIlDAI~~a-LYGk~~r~ 59 (213) T cd03279 17 EQVIDFTGLDNNGLFLICGPTGAGKSTILDAITYA-LYGKTPRY 59 (213) T ss_pred CEEEECCCCCCCCEEEEECCCCCCHHHHHHHHHHH-HCCCCCCC T ss_conf 71784876787888999889999788999999999-83888233 No 122 >CHL00081 chlI Mg-protoporyphyrin IX chelatase Probab=84.97 E-value=0.69 Score=26.26 Aligned_cols=46 Identities=22% Similarity=0.233 Sum_probs=31.9 Q ss_pred CCHHHHHHHHHHHHCCCEEEEHHHHHHCCCHHHHHHHHHHHHHCCCCCHHH Q ss_conf 770389999999965985000142220011778999999999779868022 Q gi|254780799|r 739 ADDLYKQAVDIVLRDNKASISYIQRRLGIGYNRAASIIENMEEKGVIGPAS 789 (806) Q Consensus 739 ~d~l~~~a~~~v~~~~~~s~s~lqr~~~igy~raar~~~~~e~~giv~~~~ 789 (806) .|.+...+++++.+.+-.|- |=.|.-.||||-.=-|+-+-.|.|.+ T Consensus 264 ~~~~~~~i~~~~~~~~v~g~-----RA~I~l~raARA~AAL~GR~~V~~eD 309 (347) T CHL00081 264 DYDLRVKISQICSELDVDGL-----RGDIVTNRAAKALAAFNGRDEVTPGD 309 (347) T ss_pred CHHHHHHHHHHHHHHCCCCC-----CHHHHHHHHHHHHHHHCCCCCCCHHH T ss_conf 99999999999998489987-----18999999999999986998368999 No 123 >cd03114 ArgK-like The function of this protein family is unkown. The protein sequences are similar to the ArgK protein in E. coli. ArgK protein is a membrane ATPase which is required for transporting arginine, ornithine and lysine into the cells by the arginine and ornithine (AO system) and lysine, arginine and ornithine (LAO) transport systems. Probab=84.76 E-value=1.2 Score=24.50 Aligned_cols=35 Identities=37% Similarity=0.636 Sum_probs=30.4 Q ss_pred EEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCC Q ss_conf 5304774067999999999998299578478885231 Q gi|254780799|r 459 IAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPK 495 (806) Q Consensus 459 IAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK 495 (806) |.|+-|+|||-.|+.++.-+.-+ -..|=.|-+||- T Consensus 4 itG~pGaGKStLi~~l~~~~~~~--g~~VaVlavDPs 38 (148) T cd03114 4 ITGVPGAGKSTLIDALITALRAR--GKRVAVLAIDPS 38 (148) T ss_pred ECCCCCCCHHHHHHHHHHHHHHC--CCEEEEEEECCC T ss_conf 25899787899999999999978--983799996888 No 124 >PRK12402 replication factor C small subunit 2; Reviewed Probab=84.70 E-value=0.7 Score=26.25 Aligned_cols=30 Identities=33% Similarity=0.649 Sum_probs=26.3 Q ss_pred CCCCEEEEEECCCCHHHHHHHHHHHHHHHH Q ss_conf 002023553047740679999999999982 Q gi|254780799|r 452 ARMPHLLIAGTTGSGKSVAINTMILSLLYR 481 (806) Q Consensus 452 akMPHLLIAGtTGSGKSV~iN~iI~SlLyk 481 (806) .++||||.-|.-|+||.-+...+.-.|... T Consensus 34 ~~~phlLf~GPpG~GKTt~A~~lA~~l~~~ 63 (337) T PRK12402 34 GNLPHLVVYGPSGSGKTAAVRALARELYGD 63 (337) T ss_pred CCCCEEEEECCCCCCHHHHHHHHHHHHCCC T ss_conf 998769888929848999999999996799 No 125 >KOG1384 consensus Probab=84.67 E-value=0.41 Score=27.96 Aligned_cols=50 Identities=32% Similarity=0.495 Sum_probs=27.9 Q ss_pred EEEEEECCCCHHHH-HHHHHHHHHHHHCCHHHEEEEEECCCH------------HHHHHCCCCHHHHCCCCC Q ss_conf 23553047740679-999999999982995784788852310------------011102770343122334 Q gi|254780799|r 456 HLLIAGTTGSGKSV-AINTMILSLLYRMTPAQCRLIMIDPKM------------LELSVYDGIPNLLTPVVT 514 (806) Q Consensus 456 HLLIAGtTGSGKSV-~iN~iI~SlLyk~~P~evkliliDPK~------------vEls~Y~~iPHLl~pVvT 514 (806) -..|.||||||||= .|+. =+.-|-| |..+-|| +-+..=.|+||-|.+++- T Consensus 9 VvvI~G~TGsGKSrLaVdL------A~rf~~E---IINsDkmQvYkGldivTnK~t~~e~~gVPHHLlg~l~ 71 (348) T KOG1384 9 VVVIMGATGAGKSRLAVDL------ATRFPGE---IINSDKMQVYKGLDIVTNKITLQERKGVPHHLLGHLH 71 (348) T ss_pred EEEEECCCCCCHHHHHHHH------HHHCCCE---EECCCCEEEECCCCCCCCCCCHHHCCCCCHHHHCCCC T ss_conf 9999557777704667888------9757864---6515633563276620166875540798767707688 No 126 >cd01120 RecA-like_NTPases RecA-like NTPases. This family includes the NTP binding domain of F1 and V1 H+ATPases, DnaB and related helicases as well as bacterial RecA and related eukaryotic and archaeal recombinases. This group also includes bacterial conjugation proteins and related DNA transfer proteins involved in type II and type IV secretion. Probab=84.63 E-value=3.2 Score=21.32 Aligned_cols=47 Identities=28% Similarity=0.503 Sum_probs=34.9 Q ss_pred EEEEHHHHHHHHHHC-------CCHHHHHHHHHHHHHHHCCEEEEEEECCCCCC Q ss_conf 877634468888732-------10058899999986641423799996577753 Q gi|254780799|r 592 IVVVIDEMADLMMVA-------RKDIESAVQRLAQMARASGIHVIMATQRPSVD 638 (806) Q Consensus 592 ivviiDElaDlmm~~-------~~~ve~~i~rlaq~ara~GiHli~aTqrPsvd 638 (806) -++|||.+-=+-... ...|-+...+|.+.|+--+|-.|+.-|=+.-| T Consensus 87 vliiiDSit~~~~a~~e~~~g~~~~v~~~~~~L~~~Ak~~~itvi~i~~v~~d~ 140 (165) T cd01120 87 DLIILDELTRLVRALREIREGYPGELDEELRELLERARKGGVTVIFTLQVPSGD 140 (165) T ss_pred EEEEEECHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEEEEECCC T ss_conf 799992889988774001588678999999999999977982899999843377 No 127 >cd01124 KaiC KaiC is a circadian clock protein primarily found in cyanobacteria KaiC is a RecA-like ATPase, having both Walker A and Walker B motifs. A related protein is found in archaea. Probab=84.60 E-value=3.2 Score=21.31 Aligned_cols=43 Identities=19% Similarity=0.423 Sum_probs=30.2 Q ss_pred EEEHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCC Q ss_conf 7763446888873210058899999986641423799996577 Q gi|254780799|r 593 VVVIDEMADLMMVARKDIESAVQRLAQMARASGIHVIMATQRP 635 (806) Q Consensus 593 vviiDElaDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrP 635 (806) +||||=+.-+.+....+.-..+.++.+..|..|+--++-++.. T Consensus 98 ~vVIDSi~~l~~~~~~~~~~~~~~l~~~l~~~~~t~ll~~e~~ 140 (187) T cd01124 98 RVVIDSVSGLLLMEQSTARLEIRRLLFALKRFGVTTLLTSEQS 140 (187) T ss_pred EEEECCHHHHHHCCHHHHHHHHHHHHHHHHHCCCCEEEEEEEC T ss_conf 9999486887525666689999999999997699689999742 No 128 >COG0324 MiaA tRNA delta(2)-isopentenylpyrophosphate transferase [Translation, ribosomal structure and biogenesis] Probab=84.54 E-value=0.5 Score=27.32 Aligned_cols=74 Identities=20% Similarity=0.356 Sum_probs=41.5 Q ss_pred CCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHE---------EEEEECCCHHHHHHCCCCHHHHCCCCCCHHHHHHHH Q ss_conf 02023553047740679999999999982995784---------788852310011102770343122334304566899 Q gi|254780799|r 453 RMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQC---------RLIMIDPKMLELSVYDGIPNLLTPVVTNPQKAVTVL 523 (806) Q Consensus 453 kMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~ev---------kliliDPK~vEls~Y~~iPHLl~pVvTd~~kA~~aL 523 (806) +|+=++|+|.|+||||--=.. |=+..+-|+ |=+=|=-=+.-++-..++||.|.-+ .||.+.-+|- T Consensus 2 ~~~~i~I~GPTAsGKT~lai~-----LAk~~~~eIIs~DSmQvYr~mdIGTAKps~~e~~~vpHhliDi-~~p~e~ysa~ 75 (308) T COG0324 2 KPKLIVIAGPTASGKTALAIA-----LAKRLGGEIISLDSMQVYRGLDIGTAKPSLEELAGVPHHLIDI-RDPTESYSAA 75 (308) T ss_pred CCCEEEEECCCCCCHHHHHHH-----HHHHCCCCEEECCHHHHCCCCCCCCCCCCHHHHCCCCEEEECC-CCCCCCCCHH T ss_conf 963799989887577899999-----9998299289302355318886307999999985899787545-6832255499 Q ss_pred HH---HHHHHHH Q ss_conf 99---9999999 Q gi|254780799|r 524 KW---LVCEMEE 532 (806) Q Consensus 524 ~w---~V~EMe~ 532 (806) +| |.++|++ T Consensus 76 ~f~~~a~~~i~~ 87 (308) T COG0324 76 EFQRDALAAIDD 87 (308) T ss_pred HHHHHHHHHHHH T ss_conf 999999999999 No 129 >KOG2035 consensus Probab=84.46 E-value=3.2 Score=21.27 Aligned_cols=166 Identities=27% Similarity=0.434 Sum_probs=98.8 Q ss_pred CCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEE----EEECCCH-HHHHHCCCCHHHHCCCCCCHHHHHHHHHHH Q ss_conf 00202355304774067999999999998299578478----8852310-011102770343122334304566899999 Q gi|254780799|r 452 ARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRL----IMIDPKM-LELSVYDGIPNLLTPVVTNPQKAVTVLKWL 526 (806) Q Consensus 452 akMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkl----iliDPK~-vEls~Y~~iPHLl~pVvTd~~kA~~aL~w~ 526 (806) ..+||||+-|-.|+||---|=+++- =||-..-+++|. +..+-|+ +|++.-..--||=. .|.+|-+-=+-+ T Consensus 32 ~d~PHll~yGPSGaGKKTrimclL~-elYG~gveklki~~~t~~tpS~kklEistvsS~yHlEi----tPSDaG~~DRvV 106 (351) T KOG2035 32 GDFPHLLVYGPSGAGKKTRIMCLLR-ELYGVGVEKLKIETRTFTTPSKKKLEISTVSSNYHLEI----TPSDAGNYDRVV 106 (351) T ss_pred CCCCEEEEECCCCCCCHHHHHHHHH-HHHCCCCHHEEEEEEEEECCCCCEEEEEEECCCCEEEE----CHHHCCCCCHHH T ss_conf 7787078888898872111899999-88578724505666788648886379999425651774----734337511799 Q ss_pred HHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHHHHHHC Q ss_conf 99999999999870899689999999988744786677544677654543222332232346986877634468888732 Q gi|254780799|r 527 VCEMEERYQKMSKIGVRNIDGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQHMPYIVVVIDEMADLMMVA 606 (806) Q Consensus 527 V~EMe~RY~l~a~~~vRni~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivviiDElaDlmm~~ 606 (806) |.||=+ -||. .+-|+.- ..-||=||||-|--.| T Consensus 107 iQellK---evAQ--t~qie~~--------------------------------------~qr~fKvvvi~ead~L---- 139 (351) T KOG2035 107 IQELLK---EVAQ--TQQIETQ--------------------------------------GQRPFKVVVINEADEL---- 139 (351) T ss_pred HHHHHH---HHHH--HCCHHHC--------------------------------------CCCCEEEEEEECHHHH---- T ss_conf 999999---9874--1413332--------------------------------------6665489998035765---- Q ss_pred CCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEEEECCCCCCHHHCCCCCHHHHCCCCCEEEECCC Q ss_conf 10058899999986641423799996577753554355411025158764586642123388645786588754773689 Q gi|254780799|r 607 RKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISFQVSSKIDSRTILGEQGAEQLLGQGDMLYMTGG 686 (806) Q Consensus 607 ~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf~v~s~~dSrtild~~gae~Llg~gdml~~~~~ 686 (806) -||-..+.-|---| .+.-+.|||.--.- ||| .--|-||.+ T Consensus 140 T~dAQ~aLRRTMEk-Ys~~~RlIl~cns~--------------Sri----IepIrSRCl--------------------- 179 (351) T KOG2035 140 TRDAQHALRRTMEK-YSSNCRLILVCNST--------------SRI----IEPIRSRCL--------------------- 179 (351) T ss_pred HHHHHHHHHHHHHH-HHCCCEEEEEECCC--------------CCC----HHHHHHHEE--------------------- T ss_conf 08899999999999-86071699992674--------------302----267762205--------------------- Q ss_pred CCEEEEEECCCCHHHHHHHHHHH-HHCC Q ss_conf 83258883348988999999999-7128 Q gi|254780799|r 687 GRVQRIHGPFVSDIEVEKVVSHL-KTQG 713 (806) Q Consensus 687 ~~~~r~~g~~v~~~ev~~v~~~~-~~q~ 713 (806) .+|+-+ -||+||-.|-..+ +.++ T Consensus 180 --~iRvpa--ps~eeI~~vl~~v~~kE~ 203 (351) T KOG2035 180 --FIRVPA--PSDEEITSVLSKVLKKEG 203 (351) T ss_pred --EEECCC--CCHHHHHHHHHHHHHHHC T ss_conf --876789--987899999999998733 No 130 >COG4901 Ribosomal protein S25 [Translation, ribosomal structure and biogenesis] Probab=84.28 E-value=1.3 Score=24.15 Aligned_cols=63 Identities=22% Similarity=0.343 Sum_probs=56.5 Q ss_pred CCCHHHHHHHHHHHHCCCEEEEHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHCCCCCEEEECH Q ss_conf 777038999999996598500014222001177899999999977986802278872673171 Q gi|254780799|r 738 VADDLYKQAVDIVLRDNKASISYIQRRLGIGYNRAASIIENMEEKGVIGPASSTGKREILISS 800 (806) Q Consensus 738 ~~d~l~~~a~~~v~~~~~~s~s~lqr~~~igy~raar~~~~~e~~giv~~~~g~~~r~vl~~~ 800 (806) -+.++|+++..=|-..+..+.+.|--|+.|+-.=|-+.+-.||++|+|-+..++..-+++++. T Consensus 42 vdee~~~ki~KEV~~~r~VTpy~la~r~gI~~SvAr~vLR~LeeeGvv~lvsknrR~~IY~~a 104 (107) T COG4901 42 VDEELLDKIRKEVPRERVVTPYVLASRYGINGSVARIVLRHLEEEGVVQLVSKNRRQAIYTRA 104 (107) T ss_pred CCHHHHHHHHHHCCCCEEECHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEECCCCCCEEEECC T ss_conf 059999999986254515549998888555258999999998767744400167642244215 No 131 >PRK13652 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=84.03 E-value=1.8 Score=23.19 Aligned_cols=100 Identities=22% Similarity=0.270 Sum_probs=56.9 Q ss_pred CEEEEEHHH-HHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEEEECCCCCCHHHCCC Q ss_conf 868776344-6888873210058899999986641423799996577753554355411025158764586642123388 Q gi|254780799|r 590 PYIVVVIDE-MADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISFQVSSKIDSRTILGE 668 (806) Q Consensus 590 p~ivviiDE-laDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf~v~s~~dSrtild~ 668 (806) |- |++-|| .|-|=-.+.+++-+.|.+|.| --|+-.|++|..-+. =+.+-.||.. --+-|.|.+. T Consensus 156 P~-lLlLDEPtagLDp~~~~~i~~~l~~l~~---~~g~Tii~vtHdl~~-------v~~~aDri~v----l~~G~ii~~G 220 (277) T PRK13652 156 PQ-VLVLDEPTAGLDPQGVKELFDFLNALPE---TYGMTVIFSTHQVEL-------VAEMADYIYV----MEKGEIVGYG 220 (277) T ss_pred CC-EEEECCCCCCCCHHHHHHHHHHHHHHHH---HCCCEEEEECCCHHH-------HHHHCCEEEE----EECCEEEEEC T ss_conf 99-9998397454899999999999999998---509899999148999-------9997999999----9899999987 Q ss_pred CCHHHHCCCCCEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHCCCCC Q ss_conf 645786588754773689832588833489889999999997128974 Q gi|254780799|r 669 QGAEQLLGQGDMLYMTGGGRVQRIHGPFVSDIEVEKVVSHLKTQGEAK 716 (806) Q Consensus 669 ~gae~Llg~gdml~~~~~~~~~r~~g~~v~~~ev~~v~~~~~~q~~~~ 716 (806) -.|.++.+=|- +.-+.+...++-++...+|++|-+- T Consensus 221 -tp~ev~~~p~~-----------l~~~~l~~p~~~~l~~~L~~~g~~i 256 (277) T PRK13652 221 -TVQEIFLQPDL-----------LARARLDLPSLPKLIQSLRAQGIAI 256 (277) T ss_pred -CHHHHHHCHHH-----------HHHCCCCCCHHHHHHHHHHHCCCCC T ss_conf -89999749899-----------9977999983999999999759999 No 132 >pfam07787 DUF1625 Protein of unknown function (DUF1625). Sequences making up this family are derived from hypothetical proteins expressed by both prokaryotic and eukaryotic species. The region in question is approximately 250 residues long. Probab=84.00 E-value=2.7 Score=21.79 Aligned_cols=24 Identities=8% Similarity=0.158 Sum_probs=13.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 367899999999999999999985 Q gi|254780799|r 107 YCFSKRATAWLINILVSATFFASF 130 (806) Q Consensus 107 ~~~~~Rl~~~il~ll~~s~ll~~~ 130 (806) ..+..|+++|+++.+.+.+++.-+ T Consensus 181 ~tW~~R~~G~~lmf~G~~~~~~pl 204 (247) T pfam07787 181 LTWVLRFAGLILMFLGLILLLSPL 204 (247) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999999999999999999 No 133 >TIGR00618 sbcc exonuclease SbcC; InterPro: IPR004592 All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. SbcC may have nuclease activity that is functionally related to one of the nuclease activities of the RecBCD enzyme (IPR004586 from INTERPRO).; GO: 0004527 exonuclease activity, 0006259 DNA metabolic process. Probab=83.99 E-value=0.79 Score=25.86 Aligned_cols=12 Identities=25% Similarity=0.058 Sum_probs=6.4 Q ss_pred CCCCCHHHHHHH Q ss_conf 645000134678 Q gi|254780799|r 61 RSPKNFLGYGGA 72 (806) Q Consensus 61 ~~v~N~~G~lGA 72 (806) .++-=++|.-|| T Consensus 30 l~~f~i~G~tGA 41 (1063) T TIGR00618 30 LKLFVICGKTGA 41 (1063) T ss_pred CCEEEEECCCCC T ss_conf 573677788998 No 134 >PRK10334 mechanosensitive channel MscS; Provisional Probab=83.98 E-value=3.4 Score=21.12 Aligned_cols=54 Identities=7% Similarity=-0.063 Sum_probs=23.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 67899999999999999999985044432233343248899999999998646899 Q gi|254780799|r 108 CFSKRATAWLINILVSATFFASFSPSQSWPIQNGFGGIIGDLIIRLPFLFFESYPR 163 (806) Q Consensus 108 ~~~~Rl~~~il~ll~~s~ll~~~~~~~~~~l~~g~GGiiG~~l~~~l~~~lg~~g~ 163 (806) .+..+++.+.+..+.....+..+..+.. .+- ...|..|-.++-.+...++++.+ T Consensus 67 ~fl~~~v~~~i~~~~ii~~L~~lGi~~t-sl~-A~~G~~GlaIGlAlQ~~lsN~~s 120 (285) T PRK10334 67 DFLSALVRYGIIAFTLIAALGRVGVQTA-SVI-AVLGAAGLAVGLALQGSLSNLAA 120 (285) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCHH-HHH-HHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999999999999999998298779-999-99999999999999999999998 No 135 >pfam03308 ArgK ArgK protein. The ArgK protein acts as an ATPase enzyme and as a kinase, and phosphorylates periplasmic binding proteins involved in the LAO (lysine, arginine, ornithine)/AO transport systems. Probab=83.96 E-value=1.4 Score=23.87 Aligned_cols=38 Identities=29% Similarity=0.503 Sum_probs=32.3 Q ss_pred EEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCC Q ss_conf 2355304774067999999999998299578478885231 Q gi|254780799|r 456 HLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPK 495 (806) Q Consensus 456 HLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK 495 (806) -+=|.|+-|+|||-.|+.++.-++-+ -..|=-+-|||- T Consensus 31 ~iGiTG~PGaGKStli~~l~~~~~~~--g~~vaVlAvDPS 68 (267) T pfam03308 31 RVGITGVPGAGKSTLIEALGMELRRR--GHRVAVLAVDPS 68 (267) T ss_pred EEEEECCCCCCHHHHHHHHHHHHHHC--CCEEEEEEECCC T ss_conf 99876899887999999999999968--986899997899 No 136 >cd03115 SRP The signal recognition particle (SRP) mediates the transport to or across the plasma membrane in bacteria and the endoplasmic reticulum in eukaryotes. SRP recognizes N-terminal sighnal sequences of newly synthesized polypeptides at the ribosome. The SRP-polypeptide complex is then targeted to the membrane by an interaction between SRP and its cognated receptor (SR). In mammals, SRP consists of six protein subunits and a 7SL RNA. One of these subunits is a 54 kd protein (SRP54), which is a GTP-binding protein that interacts with the signal sequence when it emerges from the ribosome. SRP54 is a multidomain protein that consists of an N-terminal domain, followed by a central G (GTPase) domain and a C-terminal M domain. Probab=83.79 E-value=3.4 Score=21.07 Aligned_cols=74 Identities=19% Similarity=0.209 Sum_probs=53.0 Q ss_pred EEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHH----HHHHCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 35530477406799999999999829957847888523100----11102770343122334304566899999999999 Q gi|254780799|r 457 LLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKML----ELSVYDGIPNLLTPVVTNPQKAVTVLKWLVCEMEE 532 (806) Q Consensus 457 LLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~v----Els~Y~~iPHLl~pVvTd~~kA~~aL~w~V~EMe~ 532 (806) .+..|-||+||.-.+==+-.-++ .....|-+|=.|-.|. -|..|-.|--.-.-++-++......+++.+.++.. T Consensus 3 i~lvGptGvGKTTTiaKLA~~~~--~~~~kV~lit~Dt~R~gA~eQL~~~a~~l~v~~~~~~~~~~~~~~~~~~~~~~~~ 80 (173) T cd03115 3 ILLVGLQGVGKTTTAAKLALYLK--KKGKKVLLVAADTYRPAAIEQLRVLGEQVGVPVFEEGEGKDPVSIAKRAIEHARE 80 (173) T ss_pred EEEECCCCCCHHHHHHHHHHHHH--HCCCEEEEEECCCCCHHHHHHHHHHHHHCCCEEEECCCCCCHHHHHHHHHHHHHH T ss_conf 99989999988999999999999--7699289997488757799999999997498599227755879999999999875 No 137 >pfam08282 Hydrolase_3 haloacid dehalogenase-like hydrolase. This family contains haloacid dehalogenase-like hydrolase enzymes. Probab=83.64 E-value=2.2 Score=22.49 Aligned_cols=82 Identities=20% Similarity=0.298 Sum_probs=48.0 Q ss_pred HHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEEEECCCCCCHHHCCCCCHHHHCCCCCEEEECCCCC Q ss_conf 05889999998664142379999657775355435541102515876458664212338864578658875477368983 Q gi|254780799|r 609 DIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISFQVSSKIDSRTILGEQGAEQLLGQGDMLYMTGGGR 688 (806) Q Consensus 609 ~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf~v~s~~dSrtild~~gae~Llg~gdml~~~~~~~ 688 (806) .+-......-+++|+.|||+++||=||-..+-.-+-+-+++ .-++-.+|| +.|.+. ++ T Consensus 15 ~i~~~~~~ai~~l~~~G~~~~iaTGR~~~~~~~~~~~l~~~-------------~~~I~~NGa--------~i~~~~-~~ 72 (254) T pfam08282 15 KISERTKEAIKKLQEKGIKVVIATGRPYRGALPVLEELGLD-------------LPVICFNGA--------YIYDEN-GK 72 (254) T ss_pred CCCHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHCCCC-------------CCEEECCCC--------EEEECC-CE T ss_conf 38999999999999889999999498799999999980999-------------759977971--------999689-86 Q ss_pred EEEEEECCCCHHHHHHHHHHHHHCCC Q ss_conf 25888334898899999999971289 Q gi|254780799|r 689 VQRIHGPFVSDIEVEKVVSHLKTQGE 714 (806) Q Consensus 689 ~~r~~g~~v~~~ev~~v~~~~~~q~~ 714 (806) .++-.+++.+++.++++++++.+- T Consensus 73 --~l~~~~i~~~~~~~i~~~~~~~~~ 96 (254) T pfam08282 73 --ILYKNPISKEDVKEIIEYLKENNL 96 (254) T ss_pred --EEEEECCCHHHHHHHHHHHHHCCC T ss_conf --999934999999999999998698 No 138 >cd03243 ABC_MutS_homologs The MutS protein initiates DNA mismatch repair by recognizing mispaired and unpaired bases embedded in duplex DNA and activating endo- and exonucleases to remove the mismatch. Members of the MutS family also possess a conserved ATPase activity that belongs to the ATP binding cassette (ABC) superfamily. MutS homologs (MSH) have been identified in most prokaryotic and all eukaryotic organisms examined. Prokaryotes have two homologs (MutS1 and MutS2), whereas seven MSH proteins (MSH1 to MSH7) have been identified in eukaryotes. The homodimer MutS1 and heterodimers MSH2-MSH3 and MSH2-MSH6 are primarily involved in mitotic mismatch repair, whereas MSH4-MSH5 is involved in resolution of Holliday junctions during meiosis. All members of the MutS family contain the highly conserved Walker A/B ATPase domain, and many share a common mechanism of action. MutS1, MSH2-MSH3, MSH2-MSH6, and MSH4-MSH5 dimerize to form sliding clamps, and recognition of specific DNA stru Probab=83.63 E-value=1.7 Score=23.40 Aligned_cols=143 Identities=19% Similarity=0.253 Sum_probs=82.0 Q ss_pred HHHCCCCCHHHCC--CCCCCCEEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCCC Q ss_conf 4302210002104--56666667854100202355304774067999999999998299578478885231001110277 Q gi|254780799|r 427 FEKNQCDLAINLG--KSIEGKPIIADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYDG 504 (806) Q Consensus 427 f~~s~~~L~iaLG--KdI~G~pvv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~~ 504 (806) .++..|||-..+. ++.-.+.+. |.+=.-++|-|.-.||||+++-++-+..++-..= .-|==+..++++|+. T Consensus 2 i~~~rHPlle~~~~~~~~VpNdi~--l~~~~~~iiTGpN~sGKSt~lkti~l~~~laq~G-----~~vpa~~~~~~~~~~ 74 (202) T cd03243 2 IKGGRHPVLLALTKGETFVPNDIN--LGSGRLLLITGPNMGGKSTYLRSIGLAVLLAQIG-----CFVPAESASIPLVDR 74 (202) T ss_pred CCCCCCCEEEEECCCCCEECCEEE--ECCCCEEEEECCCCCCHHHHHHHHHHHHHHHHHC-----CEEEECCCEEECCCE T ss_conf 775748668534378977878288--6798289998998875399999999999999838-----737204468944666 Q ss_pred CHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 03431223343045668999999999999999987089968999999998874478667754467765454322233223 Q gi|254780799|r 505 IPNLLTPVVTNPQKAVTVLKWLVCEMEERYQKMSKIGVRNIDGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHF 584 (806) Q Consensus 505 iPHLl~pVvTd~~kA~~aL~w~V~EMe~RY~l~a~~~vRni~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 584 (806) | ++- +.+.......+-.--.||.|=-.++..+. T Consensus 75 i---~~~-~~~~d~~~~~~S~F~~e~~~~~~i~~~~~------------------------------------------- 107 (202) T cd03243 75 I---FTR-IGAEDSISDGRSTFMAELLELKEILSLAT------------------------------------------- 107 (202) T ss_pred E---EEE-ECCCHHHHCCCCHHHHHHHHHHHHHHHCC------------------------------------------- T ss_conf 9---998-46602444353549999999999998677------------------------------------------- Q ss_pred CCCCCCEEEEEHHHHHHHHHHCCCHHHH-------HHHHHHHHHHHCCEEEEEEECCCCC Q ss_conf 2346986877634468888732100588-------9999998664142379999657775 Q gi|254780799|r 585 DFQHMPYIVVVIDEMADLMMVARKDIES-------AVQRLAQMARASGIHVIMATQRPSV 637 (806) Q Consensus 585 ~~~~lp~ivviiDElaDlmm~~~~~ve~-------~i~rlaq~ara~GiHli~aTqrPsv 637 (806) +.-.|++||+.- |-+-++ .+-.|++ .+.+.|+||-=+.. T Consensus 108 -----~~slvliDE~~~-----gT~~~eg~~la~a~l~~l~~----~~~~~i~tTH~~~L 153 (202) T cd03243 108 -----PRSLVLIDELGR-----GTSTAEGLAIAYAVLEHLLE----KGCRTLFATHFHEL 153 (202) T ss_pred -----CCCEEEECCCCC-----CCCHHHHHHHHHHHHHHHHH----CCCEEEEEECCHHH T ss_conf -----777242052347-----99867879999999999985----36849998253888 No 139 >PRK10649 hypothetical protein; Provisional Probab=83.62 E-value=3.5 Score=21.02 Aligned_cols=13 Identities=8% Similarity=0.457 Sum_probs=9.9 Q ss_pred CCEEEEEHHHHHH Q ss_conf 9868776344688 Q gi|254780799|r 589 MPYIVVVIDEMAD 601 (806) Q Consensus 589 lp~ivviiDElaD 601 (806) .|++|=.=|++.. T Consensus 482 IPf~iW~S~~~~~ 494 (564) T PRK10649 482 IPFLLWTSEKWQA 494 (564) T ss_pred EEEEEEECHHHHH T ss_conf 0289998799998 No 140 >PRK05707 DNA polymerase III subunit delta'; Validated Probab=83.41 E-value=1.6 Score=23.48 Aligned_cols=143 Identities=25% Similarity=0.310 Sum_probs=82.4 Q ss_pred CCCCCE-EEEEECCCCHHHHHHHHHHHHHHHHCCHH--------HEEEEEECCCHHHHHHCCCCHHHHCCCCCCHHHHHH Q ss_conf 100202-35530477406799999999999829957--------847888523100111027703431223343045668 Q gi|254780799|r 451 LARMPH-LLIAGTTGSGKSVAINTMILSLLYRMTPA--------QCRLIMIDPKMLELSVYDGIPNLLTPVVTNPQKAVT 521 (806) Q Consensus 451 LakMPH-LLIAGtTGSGKSV~iN~iI~SlLyk~~P~--------evkliliDPK~vEls~Y~~iPHLl~pVvTd~~kA~~ 521 (806) -.++|| +|+.|--|.||+-..-.+-.+||-....+ .||+ ++=...-|+ |.+.| .++.. T Consensus 18 ~~rl~HA~Lf~G~~G~GK~~lA~~~A~~LlC~~~~~~~~Cg~C~sC~~-------~~~~~HPD~-~~i~p-----e~~~~ 84 (328) T PRK05707 18 RGRHAHAYLLHGPAGIGKRALAERLAAFLLCEAPQGGGACGSCKGCQL-------LAAGSHPDN-FVLEP-----EEADK 84 (328) T ss_pred CCCCCEEEEEECCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHH-------HHCCCCCCE-EEEEC-----CCCCC T ss_conf 798220464479998679999999999984899999899988889999-------875899987-99842-----66677 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHH Q ss_conf 99999999999999998708996899999999887447866775446776545432223322323469868776344688 Q gi|254780799|r 522 VLKWLVCEMEERYQKMSKIGVRNIDGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQHMPYIVVVIDEMAD 601 (806) Q Consensus 522 aL~w~V~EMe~RY~l~a~~~vRni~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivviiDElaD 601 (806) ..+|-.|-+-++++.. .. ..=.|=|+|||+ || T Consensus 85 -----------------~I~IdqIR~l~~~~~~---~~---------------------------~~g~~KV~iI~~-Ae 116 (328) T PRK05707 85 -----------------PIKVDQVRELVSFVVQ---TA---------------------------QLGGRKVVLIEP-AE 116 (328) T ss_pred -----------------CCCHHHHHHHHHHHHH---CC---------------------------CCCCCEEEEEEH-HH T ss_conf -----------------6979999999999831---76---------------------------678957999502-87 Q ss_pred HHHHCC-----CHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEEEECCCCCCHHHC Q ss_conf 887321-----00588999999866414237999965777535543554110251587645866421233 Q gi|254780799|r 602 LMMVAR-----KDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISFQVSSKIDSRTIL 666 (806) Q Consensus 602 lmm~~~-----~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf~v~s~~dSrtil 666 (806) .|-+++ |-.|++ ..+-..||.|..|+ =|-.+|++--= ++.|..-+.-+.+.-| T Consensus 117 ~m~~~AaNALLKtLEEP---------p~~t~fiL~t~~~~--~lLpTI~SRCq-~~~~~~p~~e~~~~~L 174 (328) T PRK05707 117 AMNRNAANALLKSLEEP---------SGQTVLLLISHQPS--RLLPTIKSRCQ-QLACPLPSNEPSLQWL 174 (328) T ss_pred HHCHHHHHHHHHHHHCC---------CCCEEEEEEECCHH--HCHHHHHHCCE-EEECCCCCHHHHHHHH T ss_conf 73899999999985078---------98759998609934--48258874141-3348998999999999 No 141 >PRK12339 2-phosphoglycerate kinase; Provisional Probab=83.33 E-value=0.59 Score=26.80 Aligned_cols=110 Identities=18% Similarity=0.224 Sum_probs=69.1 Q ss_pred EEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 35530477406799999999999829957847888523100111027703431223343045668999999999999999 Q gi|254780799|r 457 LLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYDGIPNLLTPVVTNPQKAVTVLKWLVCEMEERYQK 536 (806) Q Consensus 457 LLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~~iPHLl~pVvTd~~kA~~aL~w~V~EMe~RY~l 536 (806) +||+||||.|||.. . -||+-.=||+|.+. || .++||-|+|- T Consensus 6 iligG~sGvGKStl-----a--------------------~~lA~rlgi~~vis---TD----------~IRevlR~~i- 46 (197) T PRK12339 6 HFIGGIPGVGKTSI-----S--------------------GYIARHRAIDIVLS---GD----------YLREFLRPYV- 46 (197) T ss_pred EEEECCCCCCHHHH-----H--------------------HHHHHHCCCCEEEC---CH----------HHHHHHHHHC- T ss_conf 99857998878999-----9--------------------99999749975534---34----------7999999866- Q ss_pred HHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHHHHHHCCCHHHHHHHH Q ss_conf 98708996899999999887447866775446776545432223322323469868776344688887321005889999 Q gi|254780799|r 537 MSKIGVRNIDGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQHMPYIVVVIDEMADLMMVARKDIESAVQR 616 (806) Q Consensus 537 ~a~~~vRni~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivviiDElaDlmm~~~~~ve~~i~r 616 (806) .+.-+=+-..|+.- +.-|.| +. ++ ||.-|-| . ..-|+..|.+ T Consensus 47 -~~eP~L~~Ssy~A~----~~~~~~------------~~-----------~~------ii~Gf~~---q-~~~V~~gi~a 88 (197) T PRK12339 47 -DDEPVLAKSVYDAW----EFYGSM------------TD-----------EN------IVKGYLD---Q-ARAIMPGINR 88 (197) T ss_pred -CCCCCHHHHHHHHH----HHCCCC------------CH-----------HH------HHHHHHH---H-HHHHHHHHHH T ss_conf -88740033046798----870896------------52-----------78------9999999---9-9999999999 Q ss_pred HHHHHHHCCEEEEEEECCCCCCCCCHHHHHC Q ss_conf 9986641423799996577753554355411 Q gi|254780799|r 617 LAQMARASGIHVIMATQRPSVDVITGTIKAN 647 (806) Q Consensus 617 laq~ara~GiHli~aTqrPsvdvitg~ikan 647 (806) +-..|-.-|.-+|+- .|-++.|+|+.+ T Consensus 89 vi~Ra~~eg~slIIE----GVHlvP~~i~~~ 115 (197) T PRK12339 89 VIRRALLNGEDLVIE----SLYFHPPMIDEN 115 (197) T ss_pred HHHHHHHCCCCEEEE----EEEECHHHHHHH T ss_conf 999999739977998----521177887788 No 142 >PRK05703 flhF flagellar biosynthesis regulator FlhF; Validated Probab=83.30 E-value=2.6 Score=21.94 Aligned_cols=67 Identities=21% Similarity=0.343 Sum_probs=52.8 Q ss_pred EEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCH----HHHHHCCCCHHHHCCCCCCHHHHHHHHH Q ss_conf 553047740679999999999982995784788852310----0111027703431223343045668999 Q gi|254780799|r 458 LIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKM----LELSVYDGIPNLLTPVVTNPQKAVTVLK 524 (806) Q Consensus 458 LIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~----vEls~Y~~iPHLl~pVvTd~~kA~~aL~ 524 (806) ...|.||+||.-.|==|=....++.....|-||=+|-=| =-|-.|-+|=.+=.-||.+|.+-..+|+ T Consensus 214 alVGPTGVGKTTTiAKLAA~~~l~~~~~kV~lIT~DtyRigA~eQLk~Ya~ilgvp~~v~~~~~~l~~al~ 284 (412) T PRK05703 214 ALVGPTGVGKTTTLAKLAARYALEEGKDKVALITLDTYRIGAVEQLKTYAKIMGIPVKVAYDPKELAKALE 284 (412) T ss_pred EEECCCCCCHHHHHHHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHH T ss_conf 99888887567699999999999729981799983767777999999999971973798479999999998 No 143 >cd03282 ABC_MSH4_euk MutS4 homolog in eukaryotes. The MutS protein initiates DNA mismatch repair by recognizing mispaired and unpaired bases embedded in duplex DNA and activating endo- and exonucleases to remove the mismatch. Members of the MutS family possess C-terminal domain with a conserved ATPase activity that belongs to the ATP binding cassette (ABC) superfamily. MutS homologs (MSH) have been identified in most prokaryotic and all eukaryotic organisms examined. Prokaryotes have two homologs (MutS1 and MutS2), whereas seven MSH proteins (MSH1 to MSH7) have been identified in eukaryotes. The homodimer MutS1 and heterodimers MSH2-MSH3 and MSH2-MSH6 are primarily involved in mitotic mismatch repair, whereas MSH4-MSH5 is involved in resolution of Holliday junctions during meiosis. All members of the MutS family contain the highly conserved Walker A/B ATPase domain, and many share a common mechanism of action. MutS1, MSH2-MSH3, MSH2-MSH6, and MSH4-MSH5 dimerize to form sliding c Probab=83.28 E-value=2.3 Score=22.39 Aligned_cols=101 Identities=16% Similarity=0.185 Sum_probs=55.2 Q ss_pred HHCCCCCHHHCCCCCCCCEEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCCCCHH Q ss_conf 30221000210456666667854100202355304774067999999999998299578478885231001110277034 Q gi|254780799|r 428 EKNQCDLAINLGKSIEGKPIIADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYDGIPN 507 (806) Q Consensus 428 ~~s~~~L~iaLGKdI~G~pvv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~~iPH 507 (806) ++..|||---..++.-.+.+..+=.+---++|-|.-.||||+++-++-+..++-..= ..|=-+..++.+|+.| T Consensus 3 k~~rHPlle~~~~~~VpNdi~l~~~~~~~~iITGpN~gGKSt~Lktigl~~ilAq~G-----~~vpA~~a~~~~~d~I-- 75 (204) T cd03282 3 RDSRHPILDRDKKNFIPNDIYLTRGSSRFHIITGPNMSGKSTYLKQIALLAIMAQIG-----CFVPAEYATLPIFNRL-- 75 (204) T ss_pred CCCCCCEEECCCCCEECCEEEECCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHC-----CEEEHHHCCCCCCCEE-- T ss_conf 577086385688967876688669972599998999887199999999999999968-----9167132103366778-- Q ss_pred HHCCCCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 312233430456689999999999999999 Q gi|254780799|r 508 LLTPVVTNPQKAVTVLKWLVCEMEERYQKM 537 (806) Q Consensus 508 Ll~pVvTd~~kA~~aL~w~V~EMe~RY~l~ 537 (806) ++ -+-+...-...+.--..||.|=-+++ T Consensus 76 -~t-~i~~~d~i~~~~StF~~E~~~~~~il 103 (204) T cd03282 76 -LS-RLSNDDSMERNLSTFASEMSETAYIL 103 (204) T ss_pred -EE-EECCCCCCCCCCCHHHHHHHHHHHHH T ss_conf -99-98766610066458999999999999 No 144 >PRK06067 flagellar accessory protein FlaH; Validated Probab=83.20 E-value=3.6 Score=20.90 Aligned_cols=130 Identities=16% Similarity=0.167 Sum_probs=63.0 Q ss_pred EEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 23553047740679999999999982995784788852310011102770343122334304566899999999999999 Q gi|254780799|r 456 HLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYDGIPNLLTPVVTNPQKAVTVLKWLVCEMEERYQ 535 (806) Q Consensus 456 HLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~~iPHLl~pVvTd~~kA~~aL~w~V~EMe~RY~ 535 (806) -.||+|.+|+|||+.-.-++..-+-+- +.|=++-.+.-.-+ ++ +-...+.|=+.+++.+.+ T Consensus 34 ~~li~G~~G~GKt~~~~~f~~~~~~~g--~~~~~~~~ee~~~~---------~~--------~~~~~~g~dl~~~~~~G~ 94 (241) T PRK06067 34 LILIEGENDTGKSVLSQQFVWGALNQG--KRGLAITTENTSKS---------YL--------KQMESLKLDISDFFIWGY 94 (241) T ss_pred EEEEECCCCCCHHHHHHHHHHHHHHCC--CEEEEEEECCCHHH---------HH--------HHHHHCCCCHHHHHHCCC T ss_conf 899980799887999999999998679--82999994289999---------99--------999983998599986697 Q ss_pred HH--------HHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHHHHHHC- Q ss_conf 99--------870899689999999988744786677544677654543222332232346986877634468888732- Q gi|254780799|r 536 KM--------SKIGVRNIDGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQHMPYIVVVIDEMADLMMVA- 606 (806) Q Consensus 536 l~--------a~~~vRni~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivviiDElaDlmm~~- 606 (806) +. ...+-..+++.-.++.++.++. =+. |||||=+.-+.... T Consensus 95 L~i~~~~~~~~~~~~~~~~~ll~~l~~~v~~~-----------------------------~~~-~vVIDSls~l~~~~~ 144 (241) T PRK06067 95 LRIFPLNTEGFEWNSELAEKLLDLIIEFIKRR-----------------------------REE-VIIIDSLTIFATYAS 144 (241) T ss_pred CEEEECCCCCCCCCHHHHHHHHHHHHHHHHHH-----------------------------CCC-EEEECCHHHHHCCCC T ss_conf 05783241113421556899999999999971-----------------------------998-999928017541388 Q ss_pred CCHHHHHHHHHHHHHHHCCEEEEEEECCCC Q ss_conf 100588999999866414237999965777 Q gi|254780799|r 607 RKDIESAVQRLAQMARASGIHVIMATQRPS 636 (806) Q Consensus 607 ~~~ve~~i~rlaq~ara~GiHli~aTqrPs 636 (806) .+++-..+.+|.... .-|.-.+ -|-.|. T Consensus 145 ~~~~~~~l~~l~~l~-~~g~tvl-lt~~~~ 172 (241) T PRK06067 145 EDDVLNFFTECKNLC-DNGKTIL-ITLHPY 172 (241) T ss_pred HHHHHHHHHHHHHHH-HCCCEEE-EEECCC T ss_conf 899999999999999-6898899-990567 No 145 >PTZ00209 retrotransposon hot spot protein; Provisional Probab=83.19 E-value=1 Score=25.00 Aligned_cols=38 Identities=24% Similarity=0.491 Sum_probs=22.4 Q ss_pred CCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEE Q ss_conf 002023553047740679999999999982995784788 Q gi|254780799|r 452 ARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLI 490 (806) Q Consensus 452 akMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkli 490 (806) .-+||+|| ||-|=|||.+.-+-++-=|..+.-+++.+| T Consensus 171 ~P~~~vLI-GTPGIGKSm~aGSyLLYqLLHyDae~L~vV 208 (693) T PTZ00209 171 TPPTHIVI-GISGIGKSCGVGSFLLHSLLHFHEGMLDVV 208 (693) T ss_pred CCCCEEEE-CCCCCCCCCCCHHHHHHHHHHCCHHHCCEE T ss_conf 99852897-899755333428999999871357458589 No 146 >PRK11131 ATP-dependent RNA helicase HrpA; Provisional Probab=83.15 E-value=1 Score=25.00 Aligned_cols=13 Identities=23% Similarity=0.304 Sum_probs=7.8 Q ss_pred CCCHHHHHHHHHH Q ss_conf 9968999999998 Q gi|254780799|r 542 VRNIDGFNLKVAQ 554 (806) Q Consensus 542 vRni~~yN~k~~~ 554 (806) +|+=++|++.... T Consensus 1124 vr~~~~F~~~~~~ 1136 (1295) T PRK11131 1124 VWTEEGFAALHEK 1136 (1295) T ss_pred CCCHHHHHHHHHH T ss_conf 9999999999999 No 147 >PRK10744 phosphate transporter subunit; Provisional Probab=83.05 E-value=3.6 Score=20.86 Aligned_cols=154 Identities=21% Similarity=0.323 Sum_probs=78.0 Q ss_pred CEEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCH--------------------------HHEEEEEECCCHHH Q ss_conf 6678541002023553047740679999999999982995--------------------------78478885231001 Q gi|254780799|r 445 KPIIADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTP--------------------------AQCRLIMIDPKMLE 498 (806) Q Consensus 445 ~pvv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P--------------------------~evkliliDPK~vE 498 (806) +-+-.++.+==-+-+-|-.|||||..+++|- -||...| ..+-++.=||-... T Consensus 27 ~~vsl~i~~Ge~~~liG~nGaGKSTLlk~i~--gl~~l~p~~~~~G~I~~~G~~i~~~~~~~~~~r~~ig~v~Q~~~~f~ 104 (257) T PRK10744 27 KNINLDIAKNQVTAFIGPSGCGKSTLLRTFN--KMYELYPEQRAEGEILLDGDNILTNSQDIALLRAKVGMVFQKPTPFP 104 (257) T ss_pred CCCEEEECCCCEEEEECCCCCCHHHHHHHHH--HHHCCCCCCCCCCEEEECCEECCCCHHHHHHHHHCEEEEEECCCCCC T ss_conf 1428998899899999999981999999998--76512788875405968992655761549998720689943675577 Q ss_pred HHHCCCCHHHHCCCCC--CHHHHHHHHHHHHHHHHH---HHHHHHHCCCCCHHHHH-HHHHHHHHCCCCCCCCCCCCCCC Q ss_conf 1102770343122334--304566899999999999---99999870899689999-99998874478667754467765 Q gi|254780799|r 499 LSVYDGIPNLLTPVVT--NPQKAVTVLKWLVCEMEE---RYQKMSKIGVRNIDGFN-LKVAQYHNTGKKFNRTVQTGFDR 572 (806) Q Consensus 499 ls~Y~~iPHLl~pVvT--d~~kA~~aL~w~V~EMe~---RY~l~a~~~vRni~~yN-~k~~~~~~~~~~~~~~~~~~~~~ 572 (806) +++|+.|-.-+. ... ...+.....+|+..+..- ..+. ..-...++.|=- +|+.-|++- T Consensus 105 ~tv~~nv~~~~~-~~~~~~~~~~~~rv~~~l~~~~l~~~~~~~-~~~~~~~LSGGqkQRvaiArAL-------------- 168 (257) T PRK10744 105 MSIYDNIAFGVR-LFEKLSRADMDERVQWALTKAALWNETKDK-LHQSGYSLSGGQQQRLCIARGI-------------- 168 (257) T ss_pred CCHHHHHHHHHH-HHCCCCHHHHHHHHHHHHHHHCCCCHHHHH-CCCCCCCCCHHHHHHHHHHHHH-------------- T ss_conf 609999877577-633588889999999999983253101342-1577455898898999998777-------------- Q ss_pred CCCCCCCCCCCCCCCCCCEEEEEHHH-HHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCC Q ss_conf 45432223322323469868776344-68888732100588999999866414237999965777 Q gi|254780799|r 573 KTGEAIYETEHFDFQHMPYIVVVIDE-MADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPS 636 (806) Q Consensus 573 ~~~~~~~~~~~~~~~~lp~ivviiDE-laDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPs 636 (806) -.=|- |++-|| .+-|=....++|.+.|..|.+ ++-.|+.|.+.+ T Consensus 169 --------------~~~P~-vLllDEPts~LD~~~~~~i~~ll~~l~~-----~~Tvi~itHdl~ 213 (257) T PRK10744 169 --------------AIRPE-VLLLDEPCSALDPISTGRIEELITELKQ-----DYTVVIVTHNMQ 213 (257) T ss_pred --------------HCCCC-EEEECCCCCCCCHHHHHHHHHHHHHHHH-----CCEEEEEEECHH T ss_conf --------------51999-8887788553699999999999999971-----996999961999 No 148 >PRK13768 GTPase; Provisional Probab=82.84 E-value=3.5 Score=20.98 Aligned_cols=76 Identities=17% Similarity=0.213 Sum_probs=45.6 Q ss_pred EEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCC---CCHHHHC--CCCCCHHHH-HHHHHHHHHH Q ss_conf 235530477406799999999999829957847888523100111027---7034312--233430456-6899999999 Q gi|254780799|r 456 HLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYD---GIPNLLT--PVVTNPQKA-VTVLKWLVCE 529 (806) Q Consensus 456 HLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~---~iPHLl~--pVvTd~~kA-~~aL~w~V~E 529 (806) -.+|-|.-|||||-..++|---+- ..-..|..|=-||- +|--.|. ||-.+.+ -|.-+-+-- -.||-.|.+- T Consensus 4 ~~~ViGpaGSGKsT~~~~l~~~l~--~~~r~~~vvNLDPA-~e~~pY~~~iDIRd~i~~~dVM~~~~LGPNGali~~~e~ 80 (253) T PRK13768 4 IVFFLGTAGSGKTTLVGALSDWLE--EQGYDVAIVNLDPA-VEYLPYKPDIDVREYVSAREIMRKYGLGPNGALIASVDL 80 (253) T ss_pred EEEEECCCCCCHHHHHHHHHHHHH--HCCCCEEEEECCCC-CCCCCCCCCCCHHHHCCHHHHHHHHCCCCCHHHHHHHHH T ss_conf 899989999988999999999999--76997599978986-658999988637861789999988198964689999999 Q ss_pred HHHHH Q ss_conf 99999 Q gi|254780799|r 530 MEERY 534 (806) Q Consensus 530 Me~RY 534 (806) ...+. T Consensus 81 l~~~~ 85 (253) T PRK13768 81 LLTKA 85 (253) T ss_pred HHHHH T ss_conf 99989 No 149 >PRK12363 phosphoglycerol transferase I; Provisional Probab=82.45 E-value=3.8 Score=20.70 Aligned_cols=52 Identities=23% Similarity=0.360 Sum_probs=34.3 Q ss_pred CC-CCCCCCCEEEEEECCCCCCE-EEEHHHCCCHHHHHCCCCCHHHCCCCCCCCEEEEE Q ss_conf 10-00388612555302456623-87342307213430221000210456666667854 Q gi|254780799|r 394 RV-AVIPRRNAIGIELPNDIRET-VMLRDLIVSRVFEKNQCDLAINLGKSIEGKPIIAD 450 (806) Q Consensus 394 RI-apIPGK~~VGIEiPN~~r~~-V~lreil~s~~f~~s~~~L~iaLGKdI~G~pvv~D 450 (806) || +.=-+|++| |=|-+.+=.| =.+.|+|. +...-.|-+.+|||+.-+-++.+ T Consensus 374 rIr~Sp~~kNTi-IVv~SDHLAM~NtatD~L~----k~kR~Nlll~~g~dl~pqq~~~~ 427 (703) T PRK12363 374 RIRNSRYGKNTI-IVIASDHLAMPNDLSDVLT----KQKRENLLLFLGKDIAPQQVVTR 427 (703) T ss_pred HHHCCCCCCCEE-EEEECCCCCCCCCHHHHHH----HHHHHCEEEEECCCCCHHHHHHC T ss_conf 996396658649-9996454457761677764----30012037886588766776531 No 150 >PRK01172 ski2-like helicase; Provisional Probab=82.36 E-value=3.9 Score=20.68 Aligned_cols=18 Identities=22% Similarity=0.152 Sum_probs=9.2 Q ss_pred CHHHHHHHHHHHHHHHHH Q ss_conf 256654578999999742 Q gi|254780799|r 328 SPKVMQNNACTLKSVLSD 345 (806) Q Consensus 328 s~eeL~~nA~lLE~tL~d 345 (806) ++..-+..|+.|-+.+.. T Consensus 245 sR~~~e~~A~~l~~~~~~ 262 (674) T PRK01172 245 SRKNAEDYAEMLIQHFPE 262 (674) T ss_pred CHHHHHHHHHHHHHHHHH T ss_conf 758899999999985321 No 151 >PRK09586 murP N-acetylmuramic acid phosphotransfer permease; Reviewed Probab=82.30 E-value=3.9 Score=20.66 Aligned_cols=25 Identities=12% Similarity=0.048 Sum_probs=13.9 Q ss_pred HHHHHHHHH-HHHHHHHHHHHHHHHC Q ss_conf 999999999-9999999999999827 Q gi|254780799|r 79 IQFFGIASV-FFLPPPTMWALSLLFD 103 (806) Q Consensus 79 ~~lFG~~Ay-llpl~Ll~~g~~ll~~ 103 (806) ++.+|.+.| ++|+++.|-+.+.|.. T Consensus 162 l~~~~~a~FyFLPi~va~saAK~f~~ 187 (474) T PRK09586 162 MKVFSKGLFTFLVILIGYNAAQAFGG 187 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 99999989999999999999998099 No 152 >TIGR02538 type_IV_pilB type IV-A pilus assembly ATPase PilB; InterPro: IPR013374 This model describes a protein involved in type IV pilus biogenesis designated PilB in Pseudomonas aeruginosa but PilF in Neisseria gonorrhoeae; the more common usage, reflected here, is PilB. This protein is an ATPase involved in protein export for pilin assembly, and is closely related to GspE (IPR013369 from INTERPRO) of type II secretion systems (also referred to as the main terminal branch of the general secretion pathway). Note that type IV pilus systems are often divided into type IV-A and IV-B, with the latter group including bundle-forming pilus, mannose-sensitive hemagglutinin, etc. Members of this family are found in type IV-A systems.; GO: 0005524 ATP binding, 0008565 protein transporter activity, 0009297 pilus biogenesis. Probab=82.21 E-value=0.68 Score=26.32 Aligned_cols=19 Identities=47% Similarity=0.793 Sum_probs=12.8 Q ss_pred EEEEEECCCCHHHHHHHHH Q ss_conf 2355304774067999999 Q gi|254780799|r 456 HLLIAGTTGSGKSVAINTM 474 (806) Q Consensus 456 HLLIAGtTGSGKSV~iN~i 474 (806) =+||-|=|||||.|-+=|= T Consensus 328 MvLVTGPTGSGKTVSLYTa 346 (577) T TIGR02538 328 MVLVTGPTGSGKTVSLYTA 346 (577) T ss_pred CEEEECCCCCCHHHHHHHH T ss_conf 2886266598416878763 No 153 >PRK12438 hypothetical protein; Provisional Probab=82.17 E-value=3.9 Score=20.63 Aligned_cols=163 Identities=17% Similarity=0.197 Sum_probs=81.1 Q ss_pred HEEEEEECCCHHHHHHCCCC-HHHHCCCCCCHHHHHHH------HHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHC Q ss_conf 84788852310011102770-34312233430456689------999999999999999870899689999999988744 Q gi|254780799|r 486 QCRLIMIDPKMLELSVYDGI-PNLLTPVVTNPQKAVTV------LKWLVCEMEERYQKMSKIGVRNIDGFNLKVAQYHNT 558 (806) Q Consensus 486 evkliliDPK~vEls~Y~~i-PHLl~pVvTd~~kA~~a------L~w~V~EMe~RY~l~a~~~vRni~~yN~k~~~~~~~ 558 (806) .|+|=-+||+=-=|..|..| |.|..|.=-=|..--.- |-.+=.||=.|| .|-|=..|=.+ T Consensus 654 tV~~Y~~D~~DPil~tw~kiFP~lfkp~semp~~L~~H~RYPeDLFkvQ~~~l~~Y------Hvtdp~~FYn~------- 720 (979) T PRK12438 654 TVTLFQFDRDDPVLRTWMRAFPGTVKSEDQIPDELRAHFRYPEDLFEVQRSLLAKY------HVDEPREFFTT------- 720 (979) T ss_pred CEEEEECCCCCHHHHHHHHHCCCCCCCHHHCCHHHHHHCCCCHHHHHHHHHHHHHC------CCCCHHHHCCC------- T ss_conf 07899708988599999986963269947789999966079689999999999852------57982664177------- Q ss_pred CCCCCCCCCC-CCCCCCCCCCCCCCCCCCCCCCEEEEEHH------HHHHHHH-HCCCHHHHHHHHHHHHHHHCCEEEEE Q ss_conf 7866775446-77654543222332232346986877634------4688887-32100588999999866414237999 Q gi|254780799|r 559 GKKFNRTVQT-GFDRKTGEAIYETEHFDFQHMPYIVVVID------EMADLMM-VARKDIESAVQRLAQMARASGIHVIM 630 (806) Q Consensus 559 ~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~lp~ivviiD------ElaDlmm-~~~~~ve~~i~rlaq~ara~GiHli~ 630 (806) |. .|. ..|+..++.. .-=||-++.=+ ||. ||. -.+..=+.+++-||--+-+-.--=++ T Consensus 721 -eD----~W~vP~dp~~~~~~--------~~pPYY~~~~lPg~~~~eF~-L~~pftp~~R~nl~a~laarsD~~~YGkl~ 786 (979) T PRK12438 721 -NA----FWSVPSDPTNDANA--------TQPPFYVLVGDQQSAQPSFR-LASAMVGYNREFLSAYISAHSDPANYGKLT 786 (979) T ss_pred -CC----CCCCCCCCCCCCCC--------CCCCEEEEEECCCCCCCCEE-EECCCCCCCCHHHHEEEEEECCCCCCCEEE T ss_conf -76----32288877788886--------37980799877899984169-750778787336350225624856786699 Q ss_pred EECCCCCCCCCHH--HHHCCC--CEEEEEECCCCCCHHHCCCCCHHHHCCCCCEEEECC Q ss_conf 9657775355435--541102--515876458664212338864578658875477368 Q gi|254780799|r 631 ATQRPSVDVITGT--IKANFP--TRISFQVSSKIDSRTILGEQGAEQLLGQGDMLYMTG 685 (806) Q Consensus 631 aTqrPsvdvitg~--ikan~p--~riaf~v~s~~dSrtild~~gae~Llg~gdml~~~~ 685 (806) .=|=|+-.-|-|= |.|+|- +.||= -+|.+ ++|-+ +=+|+||-+|- T Consensus 787 ly~~Pk~~~v~GP~Qi~~~i~qd~~IS~-------qltLw-~~gS~--Vi~GnLLvlPi 835 (979) T PRK12438 787 VLELPTDTLTQGPQQIQNSMISDTRVAS-------ERTLL-ERSNR--IHYGNLLSLPI 835 (979) T ss_pred EEECCCCCCCCCHHHHHHHHHCCHHHHH-------HHHHH-HCCCE--EEECCEEEEEC T ss_conf 9988999968886999987527848888-------87776-35883--68664489964 No 154 >TIGR01064 pyruv_kin pyruvate kinase; InterPro: IPR001697 Pyruvate kinase (2.7.1.40 from EC) (PK) catalyses the final step in glycolysis , the conversion of phosphoenolpyruvate to pyruvate with concomitant phosphorylation of ADP to ATP:ADP + phosphoenolpyruvate = ATP + pyruvate The enzyme, which is found in all living organisms, requires both magnesium and potassium ions for its activity. In vertebrates, there are four tissue-specific isozymes: L (liver), R (red cells), M1 (muscle, heart and brain), and M2 (early foetal tissue). In plants, PK exists as cytoplasmic and plastid isozymes, while most bacteria and lower eukaryotes have one form, except in certain bacteria, such as Escherichia coli, that have two isozymes. All isozymes appear to be tetramers of identical subunits of ~500 residues. PK helps control the rate of glycolysis, along with phosphofructokinase (IPR000023 from INTERPRO) and hexokinase (IPR001312 from INTERPRO). PK possesses allosteric sites for numerous effectors, yet the isozymes respond differently, in keeping with their different tissue distributions . The activity of L-type (liver) PK is increased by fructose-1,6-bisphosphate (F1,6BP) and lowered by ATP and alanine (gluconeogenic precursor), therefore when glucose levels are high, glycolysis is promoted, and when levels are low, gluconeogenesis is promoted. L-type PK is also hormonally regulated, being activated by insulin and inhibited by glucagon, which covalently modifies the PK enzyme. M1-type (muscle, brain) PK is inhibited by ATP, but F1,6BP and alanine have no effect, which correlates with the function of muscle and brain, as opposed to the liver. The structure of several pyruvate kinases from various organisms have been determined , . The protein comprises three-four domains: a small N-terminal helical domain (absent in bacterial PK), a beta/alpha-barrel domain, a beta-barrel domain (inserted within the beta/alpha-barrel domain), and a 3-layer alpha/beta/alpha sandwich domain. ; GO: 0000287 magnesium ion binding, 0004743 pyruvate kinase activity, 0030955 potassium ion binding, 0006096 glycolysis. Probab=82.03 E-value=0.97 Score=25.19 Aligned_cols=66 Identities=36% Similarity=0.543 Sum_probs=51.0 Q ss_pred HHHHHHHHHCCCH--HHH-------HHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCE-----EEEEECCCCCC Q ss_conf 4468888732100--588-------999999866414237999965777535543554110251-----58764586642 Q gi|254780799|r 597 DEMADLMMVARKD--IES-------AVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTR-----ISFQVSSKIDS 662 (806) Q Consensus 597 DElaDlmm~~~~~--ve~-------~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~r-----iaf~v~s~~dS 662 (806) =|-+|=.|||.-| ||- .=-+|.|+||..|+-.|.||| -.--+|+-=.||| ||++| T Consensus 259 ~~aSDGiMVARGDLGvEip~eeVp~~QK~~I~~cn~~gk~VItATQ-----MLdSMi~Np~PTRAEVsDVANAi------ 327 (513) T TIGR01064 259 AEASDGIMVARGDLGVEIPAEEVPILQKKLIRKCNRAGKPVITATQ-----MLDSMIKNPRPTRAEVSDVANAI------ 327 (513) T ss_pred HHHHCCEEEEECCCCEECCHHHHHHHHHHHHHHHHHHCCEEEEEEC-----CHHHCCCCCCCCCCEEEEEEEEE------ T ss_conf 9872952897656702547247999999999999850992799833-----24550068889411132235553------ Q ss_pred HHHCCCCCHHHHCCCCCEEEECC Q ss_conf 12338864578658875477368 Q gi|254780799|r 663 RTILGEQGAEQLLGQGDMLYMTG 685 (806) Q Consensus 663 rtild~~gae~Llg~gdml~~~~ 685 (806) || || |-+.|+| T Consensus 328 ---LD--Gt-------DAvMLSG 338 (513) T TIGR01064 328 ---LD--GT-------DAVMLSG 338 (513) T ss_pred ---EC--CC-------CEEECCC T ss_conf ---06--77-------6131010 No 155 >PRK13631 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=81.98 E-value=1.1 Score=24.86 Aligned_cols=27 Identities=26% Similarity=0.409 Sum_probs=21.2 Q ss_pred EECCCCCEEEEEECCCCHHHHHHHHHH Q ss_conf 541002023553047740679999999 Q gi|254780799|r 449 ADLARMPHLLIAGTTGSGKSVAINTMI 475 (806) Q Consensus 449 ~DLakMPHLLIAGtTGSGKSV~iN~iI 475 (806) .++.+==-+-|.|.+|||||..++++- T Consensus 47 ~~i~~Ge~vaIIG~nGsGKSTL~~~l~ 73 (320) T PRK13631 47 YTFEKNKIYFIIGNSGSGKSTLVTHFN 73 (320) T ss_pred EEECCCCEEEEECCCCCCHHHHHHHHH T ss_conf 588599899999499984999999997 No 156 >TIGR00174 miaA tRNA delta(2)-isopentenylpyrophosphate transferase; InterPro: IPR002627 tRNA isopentenyltransferases 2.5.1.8 from EC also known as tRNA delta(2)-isopentenylpyrophosphate transferases or IPP transferases. These enzymes modify both cytoplasmic and mitochondrial tRNAs at A(37) to give isopentenyl A(37) .; GO: 0004811 tRNA isopentenyltransferase activity, 0005524 ATP binding, 0008033 tRNA processing. Probab=81.90 E-value=0.64 Score=26.50 Aligned_cols=10 Identities=60% Similarity=0.800 Sum_probs=4.3 Q ss_pred EEECCCCHHH Q ss_conf 5304774067 Q gi|254780799|r 459 IAGTTGSGKS 468 (806) Q Consensus 459 IAGtTGSGKS 468 (806) |+|+|+|||| T Consensus 4 i~GpTAvGKs 13 (307) T TIGR00174 4 IMGPTAVGKS 13 (307) T ss_pred EEECCCCCHH T ss_conf 7408855477 No 157 >COG0470 HolB ATPase involved in DNA replication [DNA replication, recombination, and repair] Probab=81.72 E-value=1.2 Score=24.39 Aligned_cols=31 Identities=39% Similarity=0.610 Sum_probs=27.3 Q ss_pred CCCE-EEEEECCCCHHHHHHHHHHHHHHHHCC Q ss_conf 0202-355304774067999999999998299 Q gi|254780799|r 453 RMPH-LLIAGTTGSGKSVAINTMILSLLYRMT 483 (806) Q Consensus 453 kMPH-LLIAGtTGSGKSV~iN~iI~SlLyk~~ 483 (806) +||| ||+.|-.|+||+.....+--+|+.... T Consensus 22 ~~~halL~~Gp~G~Gktt~a~~lA~~l~~~~~ 53 (325) T COG0470 22 RLPHALLFYGPPGVGKTTAALALAKELLCENP 53 (325) T ss_pred CCCCCEEEECCCCCCHHHHHHHHHHHHCCCCC T ss_conf 88761003799999789999999999658664 No 158 >PRK13850 type IV secretion system protein VirD4; Provisional Probab=81.58 E-value=0.39 Score=28.14 Aligned_cols=110 Identities=17% Similarity=0.224 Sum_probs=70.5 Q ss_pred EEEEEHHHHHHHHHHCCC--HHHHHHHHHHHHHHHCCEEEEEEECCCCC-CCCC-----HHHHHCCCCEEEEEECCCCCC Q ss_conf 687763446888873210--05889999998664142379999657775-3554-----355411025158764586642 Q gi|254780799|r 591 YIVVVIDEMADLMMVARK--DIESAVQRLAQMARASGIHVIMATQRPSV-DVIT-----GTIKANFPTRISFQVSSKIDS 662 (806) Q Consensus 591 ~ivviiDElaDlmm~~~~--~ve~~i~rlaq~ara~GiHli~aTqrPsv-dvit-----g~ikan~p~riaf~v~s~~dS 662 (806) .+....|||+-| || -+|.++.- || .-||.+.+-+|-.+= +=+= ..|-.|---||+|++...-+- T Consensus 390 ~vLlLLDEF~aL----GkL~iie~Ala~---mA-GYGiRl~lI~Qsl~QL~~~YGe~~a~tilsN~~vrI~fapnD~eTA 461 (670) T PRK13850 390 EVLFLLDEFKHL----GKLEAIETAITT---IA-GYKGRFMFIIQSLSALTGTYDEAGKQNFLSNTGVQVFMATADDETP 461 (670) T ss_pred EEEEEECCCCCC----CCHHHHHHHHHH---HC-CCCCEEEEEEECHHHHHHHHCHHHHHHHHHCCCEEEEECCCCHHHH T ss_conf 079986064105----874899999998---62-5687899998479999988584156788725726999657977669 Q ss_pred HHHCCCCC----------------------------------HHHHCCCCCEEEECCCCCEEEEEE-CCCCHHHHHHHHH Q ss_conf 12338864----------------------------------578658875477368983258883-3489889999999 Q gi|254780799|r 663 RTILGEQG----------------------------------AEQLLGQGDMLYMTGGGRVQRIHG-PFVSDIEVEKVVS 707 (806) Q Consensus 663 rtild~~g----------------------------------ae~Llg~gdml~~~~~~~~~r~~g-~~v~~~ev~~v~~ 707 (806) +-|=|.-| -=..|+.-|.|-+-.|..|+|..- .|-.|.+-.++-+ T Consensus 462 k~iS~~LG~~T~~~~s~S~~~~r~~~~s~s~seq~RpLLtP~Evr~Lp~d~eIvlv~G~~PI~akKirYY~D~~fk~i~~ 541 (670) T PRK13850 462 TYISKAIGDYTFKARSTSYSQARMFDHNIQISDQGAPLLRPEQVRLLDDDYEIVLIKGQPPLKLRKVRYYSDRELKRIFE 541 (670) T ss_pred HHHHHHHCHHHEEEEEEECCCCCCCCCCCCCCCCCCCCCCHHHHHCCCCCCEEEEECCCCCEEECEEEECCCHHHHHHHH T ss_conf 99999848213453311036787778874400266768997899539987669980699970333315415888888887 Q ss_pred H Q ss_conf 9 Q gi|254780799|r 708 H 708 (806) Q Consensus 708 ~ 708 (806) - T Consensus 542 ~ 542 (670) T PRK13850 542 A 542 (670) T ss_pred H T ss_conf 4 No 159 >COG4129 Predicted membrane protein [Function unknown] Probab=81.48 E-value=4.1 Score=20.46 Aligned_cols=34 Identities=24% Similarity=0.123 Sum_probs=23.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 1346789999999999999999999999999999 Q gi|254780799|r 67 LGYGGAIFADVAIQFFGIASVFFLPPPTMWALSL 100 (806) Q Consensus 67 ~G~lGA~iAd~L~~lFG~~Ayllpl~Ll~~g~~l 100 (806) +=.+|+.+|.+++.+||+..+.+-+.+++.-..+ T Consensus 63 g~~iG~~~a~l~~~l~g~~~~~~~v~~~i~i~~~ 96 (332) T COG4129 63 GNALGAILAVLFFLLFGQNPIAFGVVLLIIIPLL 96 (332) T ss_pred HHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHH T ss_conf 9999999999999980756899999999999999 No 160 >PRK10525 cytochrome o ubiquinol oxidase subunit II; Provisional Probab=81.47 E-value=4.1 Score=20.45 Aligned_cols=40 Identities=15% Similarity=0.087 Sum_probs=28.0 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 0013467899999999999999999999999999998276 Q gi|254780799|r 65 NFLGYGGAIFADVAIQFFGIASVFFLPPPTMWALSLLFDK 104 (806) Q Consensus 65 N~~G~lGA~iAd~L~~lFG~~Ayllpl~Ll~~g~~ll~~k 104 (806) |--|.+++-..++++..+++..+++.+.++..-+..++.| T Consensus 31 dPkGpva~~q~~Li~~s~~lMliVvvpVi~l~~~~~~ryR 70 (305) T PRK10525 31 DPKGQIGLEQRSLILTAFGLMLIVVIPAILMAVGFAWKYR 70 (305) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEE T ss_conf 9998778999989999999998689999999999884111 No 161 >PRK09270 frcK putative fructose transport system kinase; Reviewed Probab=81.38 E-value=1.9 Score=23.05 Aligned_cols=135 Identities=20% Similarity=0.254 Sum_probs=63.8 Q ss_pred EEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 55304774067999999999998299578478885231001110277034312233430456689999999999999999 Q gi|254780799|r 458 LIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYDGIPNLLTPVVTNPQKAVTVLKWLVCEMEERYQKM 537 (806) Q Consensus 458 LIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~~iPHLl~pVvTd~~kA~~aL~w~V~EMe~RY~l~ 537 (806) =|||.+|||||-.-..+...|--+..+..|-++=.| |- |+---+. .+|. ++ T Consensus 38 gIaG~pGSGKSTlA~~l~~~L~~~~~~~~~~~vpmD----------GF-H~~~~~L--------------~~~~----~~ 88 (230) T PRK09270 38 GIAGPPGAGKSTLAETLWEALSQQGPELPAIQVPMD----------GF-HLDNAVL--------------DARG----LR 88 (230) T ss_pred EEECCCCCCHHHHHHHHHHHHHHCCCCCCEEEECCC----------CC-CCCHHHH--------------HHCC----CC T ss_conf 998999889999999999998623799857997365----------33-4572555--------------4354----74 Q ss_pred HHCCC---CCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEE------HH-----HHHHHH Q ss_conf 87089---96899999999887447866775446776545432223322323469868776------34-----468888 Q gi|254780799|r 538 SKIGV---RNIDGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQHMPYIVVV------ID-----EMADLM 603 (806) Q Consensus 538 a~~~v---Rni~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivvi------iD-----ElaDlm 603 (806) +.-|. =|..+|-+.++..+.....+.-| .||....++.-.. ....+-+-|||| -| +++|++ T Consensus 89 ~rkGaP~TFD~~~l~~~L~~Lk~~~~~v~~P---~yD~~~~d~~~~~--~~i~~~~~IVIvEGnyLLld~~~W~~l~~~~ 163 (230) T PRK09270 89 ARKGAPETFDVAGLAELLRRLREGDCEVYWP---VFDRQLEDPVADA--IVVGPTARLVIVEGNYLLLDDEPWRRLAGFF 163 (230) T ss_pred CCCCCCCCCCHHHHHHHHHHHHCCCCCEEEC---CCCCCCCCCCCCC--EEECCCCCEEEEECEEEECCCCCHHHHHHHH T ss_conf 3379910216988999999985689717521---3432245778895--3666998689993447613783289999863 Q ss_pred HH---CCCHHHHHHHHHHHHHHHCCE Q ss_conf 73---210058899999986641423 Q gi|254780799|r 604 MV---ARKDIESAVQRLAQMARASGI 626 (806) Q Consensus 604 m~---~~~~ve~~i~rlaq~ara~Gi 626 (806) -. ..-+.|....|+.+-=.+.|. T Consensus 164 D~~ifvd~~~~~~~~Rli~R~~~~G~ 189 (230) T PRK09270 164 DFSIFLDAPAEVLRERLVARKLAGGL 189 (230) T ss_pred CEEEEEECCHHHHHHHHHHHHHHCCC T ss_conf 76799848999999999999987299 No 162 >PRK12337 2-phosphoglycerate kinase; Provisional Probab=81.05 E-value=0.82 Score=25.71 Aligned_cols=282 Identities=17% Similarity=0.238 Sum_probs=125.1 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCEEEEEECCCCCCEEEEHHHCCCHHHHHCCCCCHHHCCCCCCCCEEE Q ss_conf 47863999997889999886300121000388612555302456623873423072134302210002104566666678 Q gi|254780799|r 369 APGIKSSRIIGLSDDIARSMSAISARVAVIPRRNAIGIELPNDIRETVMLRDLIVSRVFEKNQCDLAINLGKSIEGKPII 448 (806) Q Consensus 369 ApGVKvSKI~nLadDIA~aLsa~svRIapIPGK~~VGIEiPN~~r~~V~lreil~s~~f~~s~~~L~iaLGKdI~G~pvv 448 (806) |.|+|.+.=-.+|-++-..|.-..+++.. -..-|+.|+- .|-=--|+|++..... T Consensus 198 aaG~~P~~Ay~iA~eie~~L~~~~~~~i~-----------~~elr~~v~~--------------~L~~~~~~~~A~rY~l 252 (492) T PRK12337 198 AAGVAPDVARKVARVTQRDLRGSGDRVVR-----------RDEIREKVEA--------------LLRDEVGPDVSARYRL 252 (492) T ss_pred HCCCCHHHHHHHHHHHHHHHHHCCCEEEE-----------HHHHHHHHHH--------------HHHHHCCHHHHHHHHH T ss_conf 80588889999999999999865887970-----------9999999999--------------9987303889999999 Q ss_pred EE-CCCC--C-EEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCCCCHHHHCCCCCCHHHHHHHHH Q ss_conf 54-1002--0-235530477406799999999999829957847888523100111027703431223343045668999 Q gi|254780799|r 449 AD-LARM--P-HLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYDGIPNLLTPVVTNPQKAVTVLK 524 (806) Q Consensus 449 ~D-LakM--P-HLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~~iPHLl~pVvTd~~kA~~aL~ 524 (806) .- +-++ | |+||+||||-|||+.= | ||.-==||+|.+. || T Consensus 253 wR~ir~~~~PiiILIGGaSGvGKSTlA-----s--------------------eLA~RLGI~~VIs---TD--------- 295 (492) T PRK12337 253 LRVLRKPPRPLHVLLGGVSGTGKSVLA-----A--------------------ELAYRLGITRVVP---TD--------- 295 (492) T ss_pred HHHHHCCCCCEEEEEECCCCCCHHHHH-----H--------------------HHHHHHCCCCCCC---CH--------- T ss_conf 999735688769996078886688899-----9--------------------9999609881025---44--------- Q ss_pred HHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCC-CCCCCCCCCCCCCCCCCEEEEEHHHHHHHH Q ss_conf 9999999999999870899689999999988744786677544677654-543222332232346986877634468888 Q gi|254780799|r 525 WLVCEMEERYQKMSKIGVRNIDGFNLKVAQYHNTGKKFNRTVQTGFDRK-TGEAIYETEHFDFQHMPYIVVVIDEMADLM 603 (806) Q Consensus 525 w~V~EMe~RY~l~a~~~vRni~~yN~k~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~lp~ivviiDElaDlm 603 (806) .++||-|.+--=..+=.=+-.+||.- ...++. .+. +.-|+=--++.=|-|=- T Consensus 296 -sIREVMR~~is~el~P~Lh~SSy~Aw----------------k~L~~~~~~~----------~~~~~~~~vi~GF~~Qv 348 (492) T PRK12337 296 -AIREVMRAMVSKDLLPTLHASTFNAW----------------EALVPPGLGL----------PGEPTRVELLAGFRDQV 348 (492) T ss_pred -HHHHHHHHHCCHHHCCHHHHHHHHHH----------------HHCCCCCCCC----------CCCCCHHHHHHHHHHHH T ss_conf -79999998459764845777556888----------------8608734577----------77860768998999999 Q ss_pred HHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHH---HCCCCEEEEEECCCCCCHHHCCCCCHHHHCCC-CC Q ss_conf 732100588999999866414237999965777535543554---11025158764586642123388645786588-75 Q gi|254780799|r 604 MVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIK---ANFPTRISFQVSSKIDSRTILGEQGAEQLLGQ-GD 679 (806) Q Consensus 604 m~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ik---an~p~riaf~v~s~~dSrtild~~gae~Llg~-gd 679 (806) ..|...|-++...+---|..+|+- -|-++-|.|+ ..-|.=|-|-|+..=. . .--+.+.=+ -. T Consensus 349 ----~~V~vGl~aVieRa~~EG~SvVIE----GVHLvPg~i~~~~~e~~~vIp~mV~i~dE-e-----~Hr~RF~~R~r~ 414 (492) T PRK12337 349 ----QQVSVGLKAVVRRSILEGTSLVLE----GVHLVPGYLQHAYQDGALTVPMLVALPDE-E-----EHRRHFELRDRE 414 (492) T ss_pred ----HHHHHHHHHHHHHHHHCCCCEEEE----EEEECHHHHHHHHCCCCEEEEEEEEECCH-H-----HHHHHHHHHHHH T ss_conf ----999999999999999728867998----33307066666641587389999984767-9-----999999987514 Q ss_pred EEEECCCCCEEEEEECCCC-HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCEEE Q ss_conf 4773689832588833489-889999999997128974211001245566677888887777038999999996598500 Q gi|254780799|r 680 MLYMTGGGRVQRIHGPFVS-DIEVEKVVSHLKTQGEAKYIDIKDKILLNEEMRFSENSSVADDLYKQAVDIVLRDNKASI 758 (806) Q Consensus 680 ml~~~~~~~~~r~~g~~v~-~~ev~~v~~~~~~q~~~~y~~~~~~~~~~~~~~~~~~~~~~d~l~~~a~~~v~~~~~~s~ 758 (806) | ...|=-+-|+. =+||-.+-||+....+-. +----+..+.|+--++|+++|++.=.+-. T Consensus 415 t-------~~~Rp~ekYLk~F~eIR~IQdyLv~rAre~-------------gVPVI~n~~ldesvd~~~evi~~r~~~a~ 474 (492) T PRK12337 415 T-------AASRPLHRYMRHFEEIRLMQDHLLRLAREE-------------DVPVLDGETLDESADKAVEVVLRYVMVAL 474 (492) T ss_pred H-------CCCCCHHHHHHHHHHHHHHHHHHHHHHHHC-------------CCCCCCCCCHHHHHHHHHHHHHHHHHHHC T ss_conf 1-------036860179997999999999999999874-------------99820787667799999999999999845 Q ss_pred EHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 01422200117789999999997 Q gi|254780799|r 759 SYIQRRLGIGYNRAASIIENMEE 781 (806) Q Consensus 759 s~lqr~~~igy~raar~~~~~e~ 781 (806) .- -.|+++|=|.++. T Consensus 475 ~~--------e~~~~~l~~~~~~ 489 (492) T PRK12337 475 TP--------EERGQRLGEAHAA 489 (492) T ss_pred CH--------HHHHHHHHHHHHH T ss_conf 98--------9999887677776 No 163 >PRK10869 recombination and repair protein; Provisional Probab=81.03 E-value=0.92 Score=25.35 Aligned_cols=33 Identities=12% Similarity=0.143 Sum_probs=13.5 Q ss_pred CEEEECCCCCEEEEEECCCCHHHHHHHHHHHHH Q ss_conf 547736898325888334898899999999971 Q gi|254780799|r 679 DMLYMTGGGRVQRIHGPFVSDIEVEKVVSHLKT 711 (806) Q Consensus 679 dml~~~~~~~~~r~~g~~v~~~ev~~v~~~~~~ 711 (806) ++||.++-+.|.|==.--.|-.|..|+.=-+|. T Consensus 413 efl~s~N~G~~~~pL~kiASGGElSRimLAlk~ 445 (553) T PRK10869 413 EFRVTTNPGQPLQPIAKVASGGELSRIALAIQV 445 (553) T ss_pred EEEEECCCCCCCCCHHHHCCHHHHHHHHHHHHH T ss_conf 999952899997617776366089999999999 No 164 >cd03286 ABC_MSH6_euk MutS6 homolog in eukaryotes. The MutS protein initiates DNA mismatch repair by recognizing mispaired and unpaired bases embedded in duplex DNA and activating endo- and exonucleases to remove the mismatch. Members of the MutS family possess C-terminal domain with a conserved ATPase activity that belongs to the ATP binding cassette (ABC) superfamily. MutS homologs (MSH) have been identified in most prokaryotic and all eukaryotic organisms examined. Prokaryotes have two homologs (MutS1 and MutS2), whereas seven MSH proteins (MSH1 to MSH7) have been identified in eukaryotes. The homodimer MutS1 and heterodimers MSH2-MSH3 and MSH2-MSH6 are primarily involved in mitotic mismatch repair, whereas MSH4-MSH5 is involved in resolution of Holliday junctions during meiosis. All members of the MutS family contain the highly conserved Walker A/B ATPase domain, and many share a common mechanism of action. MutS1, MSH2-MSH3, MSH2-MSH6, and MSH4-MSH5 dimerize to form sliding c Probab=80.96 E-value=3.1 Score=21.41 Aligned_cols=103 Identities=17% Similarity=0.210 Sum_probs=60.0 Q ss_pred HHHCCCCCH-HHCCCCCCCCEEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCCCC Q ss_conf 430221000-2104566666678541002023553047740679999999999982995784788852310011102770 Q gi|254780799|r 427 FEKNQCDLA-INLGKSIEGKPIIADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYDGI 505 (806) Q Consensus 427 f~~s~~~L~-iaLGKdI~G~pvv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~~i 505 (806) |++..|||- ...+.+.-.+.+..|=.+=-.++|-|.-+||||+.+-++-+..++-..- ..|==+..++.+|+.| T Consensus 2 lk~~RHPll~~~~~~~~VpNdi~l~~~~~~~~iiTGpN~sGKSt~lk~i~l~~ilaq~G-----~~vpA~~~~~~~~d~i 76 (218) T cd03286 2 FEELRHPCLNASTASSFVPNDVDLGATSPRILVLTGPNMGGKSTLLRTVCLAVIMAQMG-----MDVPAKSMRLSLVDRI 76 (218) T ss_pred CCCCCCCEEECCCCCCEECCEEEECCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHCC-----CCEECCCCEEECCCEE T ss_conf 76675877972689975875688679974089998999887389999999999999828-----8430146477346648 Q ss_pred HHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 343122334304566899999999999999998 Q gi|254780799|r 506 PNLLTPVVTNPQKAVTVLKWLVCEMEERYQKMS 538 (806) Q Consensus 506 PHLl~pVvTd~~kA~~aL~w~V~EMe~RY~l~a 538 (806) ++- +-+...-...+..-..||.+==+.+. T Consensus 77 ---~~~-i~~~d~~~~~~StF~~e~~~~~~il~ 105 (218) T cd03286 77 ---FTR-IGARDDIMKGESTFMVELSETANILR 105 (218) T ss_pred ---EEE-ECCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf ---997-45866143115069999999999998 No 165 >PRK11465 hypothetical protein; Provisional Probab=80.90 E-value=4.3 Score=20.32 Aligned_cols=13 Identities=15% Similarity=0.330 Sum_probs=5.5 Q ss_pred HHHHHHHHCCCCC Q ss_conf 9999999779868 Q gi|254780799|r 774 SIIENMEEKGVIG 786 (806) Q Consensus 774 r~~~~~e~~giv~ 786 (806) |+-+.++++||== T Consensus 703 ~lK~~FD~~GIei 715 (741) T PRK11465 703 QVKKHFDLAGVRA 715 (741) T ss_pred HHHHHHHHCCCCC T ss_conf 9999998779937 No 166 >cd03241 ABC_RecN RecN ATPase involved in DNA repair; ABC (ATP-binding cassette) transporter nucleotide-binding domain; ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds including sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=80.85 E-value=0.94 Score=25.28 Aligned_cols=21 Identities=19% Similarity=0.392 Sum_probs=17.9 Q ss_pred HHHHHHHHHHHCCEEEEEEEC Q ss_conf 999999866414237999965 Q gi|254780799|r 613 AVQRLAQMARASGIHVIMATQ 633 (806) Q Consensus 613 ~i~rlaq~ara~GiHli~aTq 633 (806) .|+-|+|.|..|--|+.+.-+ T Consensus 228 ~ITHlpQvAa~ad~H~~V~K~ 248 (276) T cd03241 228 CITHLPQVAAMADNHFLVEKE 248 (276) T ss_pred EEECHHHHHHCCCCEEEEEEE T ss_conf 981779988413767999998 No 167 >pfam04466 Terminase_3 Phage terminase large subunit. Initiation of packaging of double-stranded viral DNA involves the specific interaction of the prohead with viral DNA in a process mediated by a phage-encoded terminase protein. The terminase enzymes are usually hetero-oligomers composed of a small and a large subunit. This region is found on the large subunit and possess an endonuclease and ATPase activity that require Mg2+ and a neutral or slightly basic reaction. This region is also found in bacterial sequences. Probab=80.54 E-value=2.6 Score=21.91 Aligned_cols=129 Identities=19% Similarity=0.273 Sum_probs=65.5 Q ss_pred EEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCCCCHHHHCCCCCCHHHHH-HHHHHHHHHHHHHHHH Q ss_conf 553047740679999999999982995784788852310011102770343122334304566-8999999999999999 Q gi|254780799|r 458 LIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYDGIPNLLTPVVTNPQKAV-TVLKWLVCEMEERYQK 536 (806) Q Consensus 458 LIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~~iPHLl~pVvTd~~kA~-~aL~w~V~EMe~RY~l 536 (806) ..-|..|||||+++-..++-..++. |. +.+++=+-+- .-+..+ .-|+|++ T Consensus 6 v~~GGrgsgKS~~~a~~~i~~~~~~-~~--~~l~~r~~~~-----------------slr~sv~~~~~~~i--------- 56 (387) T pfam04466 6 VAKGGRGSGKSYHIALKLVLKLLMH-PR--TNLVIREVKN-----------------TIEDSVFTQLQEAL--------- 56 (387) T ss_pred EEEECCCCHHHHHHHHHHHHHHHHC-CC--EEEEEECCHH-----------------HHHHHHHHHHHHHH--------- T ss_conf 9990888679999999999999878-98--6999975568-----------------89999999999999--------- Q ss_pred HHHCCCCCHHHHHHHHHHH--HHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-EEEEEHHHHHHHHHHCCCHHHHH Q ss_conf 9870899689999999988--74478667754467765454322233223234698-68776344688887321005889 Q gi|254780799|r 537 MSKIGVRNIDGFNLKVAQY--HNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQHMP-YIVVVIDEMADLMMVARKDIESA 613 (806) Q Consensus 537 ~a~~~vRni~~yN~k~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp-~ivviiDElaDlmm~~~~~ve~~ 613 (806) ..+|+.+.-..|+.--+. ..+| ..-...|.|... .+..+- .-.+.+||.+.+ ...+.+.. T Consensus 57 -~~~~~~~~~~~~~s~~~i~~~~~g---s~i~f~G~d~~~----------~iks~~~i~~~~~eEa~~~---~~~~~~~l 119 (387) T pfam04466 57 -SMLGLDHEFKISKSPIEITVKING---SKFLFYGMDDPA----------KIKSIKDVSDAWIEEAAEF---KTEDFDQL 119 (387) T ss_pred -HHCCCCCEEEECCCCEEEEECCCC---CEEEEEECCCHH----------HHHCCCCCEEEEEECHHHC---CHHHHHHH T ss_conf -976997348973761489987899---699998578968----------8416366149999412447---99899999 Q ss_pred HHHHHHHHHHCCEEEEEEECCC Q ss_conf 9999986641423799996577 Q gi|254780799|r 614 VQRLAQMARASGIHVIMATQRP 635 (806) Q Consensus 614 i~rlaq~ara~GiHli~aTqrP 635 (806) +.|+-.. -.|.++++.|- | T Consensus 120 ~~~~r~~--~~~~~i~~~~N-P 138 (387) T pfam04466 120 IPTIRRP--KPGSEIFMSFN-P 138 (387) T ss_pred HHHHCCC--CCCEEEEEECC-C T ss_conf 9885317--88719999828-9 No 168 >PRK10530 phosphotransferase; Provisional Probab=80.48 E-value=4.4 Score=20.22 Aligned_cols=86 Identities=19% Similarity=0.237 Sum_probs=48.9 Q ss_pred CCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEEEECCCCCCHHHCCCCCHHHHCCCCCEEEECCC Q ss_conf 10058899999986641423799996577753554355411025158764586642123388645786588754773689 Q gi|254780799|r 607 RKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISFQVSSKIDSRTILGEQGAEQLLGQGDMLYMTGG 686 (806) Q Consensus 607 ~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf~v~s~~dSrtild~~gae~Llg~gdml~~~~~ 686 (806) .+.+-..-.+--+++|+.||+.++||=||-..+- .+.+. +. .|.- ++-.+|| ..|-..+ T Consensus 18 ~~~i~~~~~~ai~~l~~~Gi~~~iaTGR~~~~~~-~~~~~-----l~------~~~~-~I~~NGa--------~i~d~~~ 76 (272) T PRK10530 18 KKTILPSSLEALARAREAGYQLIIVTGRHHVAIH-PFYQA-----LA------LDTP-AICCNGT--------YLYDYQA 76 (272) T ss_pred CCCCCHHHHHHHHHHHHCCCEEEEECCCCHHHHH-HHHHH-----HC------CCCC-EEEECCE--------EEEECCC T ss_conf 8959999999999999789999999599878889-99998-----09------9986-8976885--------9997799 Q ss_pred CCEEEEEECCCCHHHHHHHHHHHHHCCCC Q ss_conf 83258883348988999999999712897 Q gi|254780799|r 687 GRVQRIHGPFVSDIEVEKVVSHLKTQGEA 715 (806) Q Consensus 687 ~~~~r~~g~~v~~~ev~~v~~~~~~q~~~ 715 (806) .+. ++.-.++-+++.++++++++.+-+ T Consensus 77 ~~~--l~~~~l~~~~~~~i~~~~~~~~~~ 103 (272) T PRK10530 77 KTV--LEADPMPVQQALQLIELLNEHQIH 103 (272) T ss_pred CEE--EEECCCCHHHHHHHHHHHHHCCCC T ss_conf 879--995479999999999999984995 No 169 >PTZ00207 hypothetical protein; Provisional Probab=80.46 E-value=4.5 Score=20.22 Aligned_cols=110 Identities=12% Similarity=0.162 Sum_probs=47.0 Q ss_pred CCCCHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHH---HHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHH---HHCCC Q ss_conf 4500013467899999999999-99999999999999---99982765236789999999999999999998---50444 Q gi|254780799|r 62 SPKNFLGYGGAIFADVAIQFFG-IASVFFLPPPTMWA---LSLLFDKKIYCFSKRATAWLINILVSATFFAS---FSPSQ 134 (806) Q Consensus 62 ~v~N~~G~lGA~iAd~L~~lFG-~~Ayllpl~Ll~~g---~~ll~~k~~~~~~~Rl~~~il~ll~~s~ll~~---~~~~~ 134 (806) .+-|.+|.++ .-..+++-.+| ..-+++..++...| +.+.++..+....+++-.+..++-+++.++.. ...-. T Consensus 69 TvG~v~gyf~-lP~g~lyD~~GP~~~~~i~~~~~~lG~ll~~L~F~g~I~gsv~~~~vf~ai~~~g~s~fDta~lvT~l~ 147 (591) T PTZ00207 69 TVGIAVGYFL-LPYSFIYDYLGPRPIFVLSMTVFCLGTLLFALTFQEVIEGSVVRLSVYNGLMTLGCMLFDLGAVVTVLS 147 (591) T ss_pred EHHHHHHHHH-HCHHHHHCCCCCHHHHHHHHHHHHHHHHHHHEEECCEECCCHHHHHHHHHHHHHCCHHHHHHHHHHHHH T ss_conf 0434312564-256977502486078999999999999998800477107851345799999973618877889999998 Q ss_pred CCCCC--------CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 32233--------343248899999999998646899999999999 Q gi|254780799|r 135 SWPIQ--------NGFGGIIGDLIIRLPFLFFESYPRKLGILFFQM 172 (806) Q Consensus 135 ~~~l~--------~g~GGiiG~~l~~~l~~~lg~~g~~l~lli~~i 172 (806) .+|.. .++-|+=+..+......+|.+-.....+++..+ T Consensus 148 ~FP~~RG~VV~ilKtf~GLGsAIi~~i~~afF~~~~s~f~yFL~~~ 193 (591) T PTZ00207 148 VFPSNRGAVVAIMKTFTGLGSAILGSIQLAFFSDNTSAYFFFLMSF 193 (591) T ss_pred HCCCCCCCEEEEEHHHCCCCHHHHHHHHHHHHCCCHHHHHHHHHHH T ss_conf 5754568568753121255199999999999569916899999999 No 170 >KOG0979 consensus Probab=80.41 E-value=1.2 Score=24.40 Aligned_cols=25 Identities=40% Similarity=0.586 Sum_probs=19.4 Q ss_pred CCCC-C--CCCCCCEEEEEECCCCCCEE Q ss_conf 1210-0--03886125553024566238 Q gi|254780799|r 392 SARV-A--VIPRRNAIGIELPNDIRETV 416 (806) Q Consensus 392 svRI-a--pIPGK~~VGIEiPN~~r~~V 416 (806) -.|| + .||++.-.+.-+|-.-|+++ T Consensus 500 k~~vn~~~~~~~~~k~~d~~p~~sre~l 527 (1072) T KOG0979 500 KWRVNASEVIPREKKYADKIPAQSREEL 527 (1072) T ss_pred CEEEECEECCCCCCCCCCCCCCCCHHHH T ss_conf 4101000025522345667885478999 No 171 >pfam03137 OATP Organic Anion Transporter Polypeptide (OATP) family. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution and substrate specificity. Since the numbering of different OATPs in particular species was based originally on the order of discovery, similarly numbered OATPs in humans and rats did not necessarily correspond in function, tissue distribution and substrate specificity (in spite of the name, some OATPs also transport organic cations and neutral molecules). Thus, Tamai et al. initiated the current scheme of using digits for rat OATPs and letters for human ones. Prostaglandin transporter (PGT) proteins are also considered to be OATP family members. In addition, the methotrexate transporter OATK is closely related to OATPs. This family also includes several predicted proteins from Caenorhabditis elegans and Drosophila melanogaster. Thi Probab=80.29 E-value=4.5 Score=20.18 Aligned_cols=50 Identities=14% Similarity=0.094 Sum_probs=23.2 Q ss_pred ECCEEEEEEEEECCCCCHHHHHHHHHHHH-HHHHHCCCCCCCCCCCCEEEEEECCC Q ss_conf 04424444432147863999997889999-88630012100038861255530245 Q gi|254780799|r 357 PGPVITLYELEPAPGIKSSRIIGLSDDIA-RSMSAISARVAVIPRRNAIGIELPND 411 (806) Q Consensus 357 pGPVVTrYEi~PApGVKvSKI~nLadDIA-~aLsa~svRIapIPGK~~VGIEiPN~ 411 (806) =|=+|.||.+.+..=.|..=|+++---+. .++-..+ =|-.+..||-+|-. T Consensus 359 GG~ivkk~kl~~~~~~~~~~i~~~v~~~~~~~~~~i~-----C~~~~~aGvt~~y~ 409 (582) T pfam03137 359 GGYIIKKFKLTARGMAKFAIVVILVSLAFYLPLFLIG-----CSSSPVAGVTPSYS 409 (582) T ss_pred HHHEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHEEC-----CCCCCCCEEECCCC T ss_conf 4321322076789999999999999999999996754-----88987252735788 No 172 >pfam00485 PRK Phosphoribulokinase / Uridine kinase family. In Arabidopsis the region carries two binding domains, a phosphoribosylpyrophosphate-binding domain and, at the very C-terminus, a uracil-binding domain. Probab=80.14 E-value=1.7 Score=23.29 Aligned_cols=22 Identities=27% Similarity=0.567 Sum_probs=18.2 Q ss_pred EEEEECCCCHHHHHHHHHHHHH Q ss_conf 3553047740679999999999 Q gi|254780799|r 457 LLIAGTTGSGKSVAINTMILSL 478 (806) Q Consensus 457 LLIAGtTGSGKSV~iN~iI~Sl 478 (806) ..|||.+|||||-.-|.+.-.| T Consensus 2 IgIaG~SgSGKTT~a~~L~~~l 23 (196) T pfam00485 2 IGVAGSSGAGKTTVARTFVSIF 23 (196) T ss_pred EEEECCCCCCHHHHHHHHHHHH T ss_conf 8998998571999999999996 No 173 >COG1643 HrpA HrpA-like helicases [DNA replication, recombination, and repair] Probab=79.97 E-value=2 Score=22.86 Aligned_cols=48 Identities=33% Similarity=0.412 Sum_probs=29.5 Q ss_pred CCCCCHHHCCCCCCCCEEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCC Q ss_conf 221000210456666667854100202355304774067999999999998299 Q gi|254780799|r 430 NQCDLAINLGKSIEGKPIIADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMT 483 (806) Q Consensus 430 s~~~L~iaLGKdI~G~pvv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~ 483 (806) .+.+|+ .+||-.+--|+---||+| |+.|+. +..+.-=.+|+|+|--.. T Consensus 450 ~~g~LT-~lG~~ms~lpldprLA~m--Ll~a~~---~g~~~e~~~Ias~Ls~~~ 497 (845) T COG1643 450 DSGKLT-PLGKQMSLLPLDPRLARM--LLTAPE---GGCLGEAATIASMLSEQD 497 (845) T ss_pred CCCCCC-HHHHHHHHCCCCHHHHHH--HHHCCC---CCCHHHHHHHHHHHCCCC T ss_conf 888777-889999868998388899--885124---671889999997650688 No 174 >cd02024 NRK1 Nicotinamide riboside kinase (NRK) is an enzyme involved in the metabolism of nicotinamide adenine dinucleotide (NAD+). This enzyme catalyzes the phosphorylation of nicotinamide riboside (NR) to form nicotinamide mononucleotide (NMN). It defines the NR salvage pathway of NAD+ biosynthesis in addition to the pathways through nicotinic acid mononucleotide (NaMN). This enzyme can also phosphorylate the anticancer drug tiazofurin, which is an analog of nicotinamide riboside. Probab=79.96 E-value=1.2 Score=24.36 Aligned_cols=39 Identities=26% Similarity=0.362 Sum_probs=24.4 Q ss_pred EEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCCCCHHH Q ss_conf 553047740679999999999982995784788852310011102770343 Q gi|254780799|r 458 LIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYDGIPNL 508 (806) Q Consensus 458 LIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~~iPHL 508 (806) =|||.||||||---|.+.-.+ |+ |.+|-- =+-|.+.-|+ T Consensus 3 gIaG~S~SGKTTla~~L~~~l-----~~-~~iI~q------DdyYk~~~~~ 41 (187) T cd02024 3 GISGVTNSGKTTLAKLLQRIL-----PN-CCVIHQ------DDFFKPEDEI 41 (187) T ss_pred EEECCCCCCHHHHHHHHHHHC-----CC-CEEECC------CCCCCCCCCC T ss_conf 996888875999999999987-----99-889715------4467884327 No 175 >cd03245 ABCC_bacteriocin_exporters ABC-type bacteriocin exporters. Many non-lantibiotic bacteriocins of lactic acid bacteria are produced as precursors which have N-terminal leader peptides that share similarities in amino acid sequence and contain a conserved processing site of two glycine residues in positions -1 and -2. A dedicated ATP-binding cassette (ABC) transporter is responsible for the proteolytic cleavage of the leader peptides and subsequent translocation of the bacteriocins across the cytoplasmic membrane. Probab=79.95 E-value=1.7 Score=23.30 Aligned_cols=151 Identities=22% Similarity=0.294 Sum_probs=68.2 Q ss_pred EECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCC---------------HH----HEEEEEECCCHHHHHHCCCCHHHH Q ss_conf 54100202355304774067999999999998299---------------57----847888523100111027703431 Q gi|254780799|r 449 ADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMT---------------PA----QCRLIMIDPKMLELSVYDGIPNLL 509 (806) Q Consensus 449 ~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~---------------P~----evkliliDPK~vEls~Y~~iPHLl 509 (806) .++.+==+.-|-|.+|||||-.+|.+.- ||..+ |+ .+-++.=||-..+-++++.|- +- T Consensus 25 l~i~~G~~v~ivG~sGsGKSTLl~ll~g--l~~p~~G~I~i~g~~~~~~~~~~~r~~i~~v~Q~~~lf~~Ti~eNi~-~~ 101 (220) T cd03245 25 LTIRAGEKVAIIGRVGSGKSTLLKLLAG--LYKPTSGSVLLDGTDIRQLDPADLRRNIGYVPQDVTLFYGTLRDNIT-LG 101 (220) T ss_pred EEECCCCEEEEECCCCCHHHHHHHHHHC--CCCCCCCEEEECCEEHHHHCHHHHHHCEEEECCCCEEECCCHHHHHH-CC T ss_conf 9987999999999999859999999967--25478658999999957725999973269991689676675999853-57 Q ss_pred CCCCCCHHHHHHHHHHH-----HHHHHHHHHHHHHCCCCCHHHH-HHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 22334304566899999-----9999999999987089968999-99999887447866775446776545432223322 Q gi|254780799|r 510 TPVVTNPQKAVTVLKWL-----VCEMEERYQKMSKIGVRNIDGF-NLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEH 583 (806) Q Consensus 510 ~pVvTd~~kA~~aL~w~-----V~EMe~RY~l~a~~~vRni~~y-N~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 583 (806) .|-+++ .+...+++.+ +..+...|+..-.-+-+|+.+= -+|+.-|++- T Consensus 102 ~~~~~~-~~i~~~~~~~~l~~~i~~~~~g~~t~i~~~g~~LSgGqkQri~lARal------------------------- 155 (220) T cd03245 102 APLADD-ERILRAAELAGVTDFVNKHPNGLDLQIGERGRGLSGGQRQAVALARAL------------------------- 155 (220) T ss_pred CCCCCH-HHHHHHHHHHCCHHHHHHCCCCCCCCCCCCCCEECHHHHHHHHHHHHH------------------------- T ss_conf 977897-999999999597899973755434535899972189999999999999------------------------- Q ss_pred CCCCCCCEEEEEHHHH-HHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCC Q ss_conf 3234698687763446-88887321005889999998664142379999657775 Q gi|254780799|r 584 FDFQHMPYIVVVIDEM-ADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSV 637 (806) Q Consensus 584 ~~~~~lp~ivviiDEl-aDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsv 637 (806) -.=|-| +|.||- +-|=....+.+-+.|.++ ..|--.|+.|+|+++ T Consensus 156 ---~~~~~i-lilDEpts~LD~~~~~~i~~~l~~~-----~~~~Tvi~itH~~~~ 201 (220) T cd03245 156 ---LNDPPI-LLLDEPTSAMDMNSEERLKERLRQL-----LGDKTLIIITHRPSL 201 (220) T ss_pred ---HCCCCE-EEEECCCCCCCHHHHHHHHHHHHHH-----CCCCEEEEEECCHHH T ss_conf ---559999-9996875688989999999999987-----699989999359889 No 176 >PRK09302 circadian clock protein KaiC; Reviewed Probab=79.89 E-value=4.6 Score=20.09 Aligned_cols=262 Identities=18% Similarity=0.231 Sum_probs=129.3 Q ss_pred HHHHHHHHHHHHHHHHHCCCCEEEEEEEEC--CEEEEEEE--EECCCCCHHHHHHHHHH-----HHHHHHHCCCC-CCCC Q ss_conf 566545789999997424863289984104--42444443--21478639999978899-----99886300121-0003 Q gi|254780799|r 329 PKVMQNNACTLKSVLSDFGIQGEIVNVRPG--PVITLYEL--EPAPGIKSSRIIGLSDD-----IARSMSAISAR-VAVI 398 (806) Q Consensus 329 ~eeL~~nA~lLE~tL~dFGVe~~Vv~v~pG--PVVTrYEi--~PApGVKvSKI~nLadD-----IA~aLsa~svR-IapI 398 (806) +..+++.-..|-..|+++|+-+=.+.-... ...++|-+ -.+-|| +.|..- .-|.|...--| ..-. T Consensus 142 ~~~~R~~l~~L~~~l~~~g~T~llt~E~~~~~~~~~~~g~Ee~vaDGV-----I~L~~~~~~~~~~R~L~V~KmRGs~~~ 216 (501) T PRK09302 142 PAVVRRELFRLFAWLKQKGVTAVMTGERGDEYGPIARYGVEEFVSDCV-----VILRNVLEGEKRTRTLEILKYRGTTHG 216 (501) T ss_pred HHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCHHHHHCCEE-----EEEEEECCCCCCEEEEEEEECCCCCCC T ss_conf 789999999999999877977999875666678754345244201307-----887654148852589999983588666 Q ss_pred CCCCEEEEEECCCCCCEEEEHHHCCCHHHHHC--CCCCHHHCCCCCCCCEEEEECCCCCE--------EEEEECCCCHHH Q ss_conf 88612555302456623873423072134302--21000210456666667854100202--------355304774067 Q gi|254780799|r 399 PRRNAIGIELPNDIRETVMLRDLIVSRVFEKN--QCDLAINLGKSIEGKPIIADLARMPH--------LLIAGTTGSGKS 468 (806) Q Consensus 399 PGK~~VGIEiPN~~r~~V~lreil~s~~f~~s--~~~L~iaLGKdI~G~pvv~DLakMPH--------LLIAGtTGSGKS 468 (806) +|+- ..+|-+.=-+. |- +.+. +.... ...++ +| +..|-+|=+ -||.|.+|+||| T Consensus 217 ~g~h--~f~It~~Gi~V--~P-~~~~-~~~~~~~~~rv~-------tG---i~~LD~llgGGl~~GsstLi~Gp~GtGKT 280 (501) T PRK09302 217 KGEY--PFTITEDGISV--LP-LTAM-RLTQRSSNERIS-------SG---VPDLDEMCGGGFFRGSIILVSGATGTGKT 280 (501) T ss_pred CCCE--EEEECCCCEEE--EE-CCHH-HCCCCCCCCCCC-------CC---CHHHHHHHCCCCCCCCEEEEECCCCCCHH T ss_conf 8723--37987998899--80-3624-335666520004-------79---73278872599758946999889998889 Q ss_pred HHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHH---HHCCCCCH Q ss_conf 999999999998299578478885231001110277034312233430456689999999999999999---87089968 Q gi|254780799|r 469 VAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYDGIPNLLTPVVTNPQKAVTVLKWLVCEMEERYQKM---SKIGVRNI 545 (806) Q Consensus 469 V~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~~iPHLl~pVvTd~~kA~~aL~w~V~EMe~RY~l~---a~~~vRni 545 (806) ..--..+..=+-+- +.|=+ ++.+|....|+ .-+.++.|-+.+++..-++- ..----++ T Consensus 281 tla~qFl~~~a~~G--E~~l~---------~~FeE~~~~l~--------~~a~~~G~dl~~~~~~G~l~i~~~~p~~~~~ 341 (501) T PRK09302 281 LLVSKFAEAACRRG--ERCLL---------FAFEESRAQLV--------RNATSWGIDLEEMERKGLLKIICARPESTGL 341 (501) T ss_pred HHHHHHHHHHHHCC--CCEEE---------EEEECCHHHHH--------HHHHHCCCCHHHHHHCCCEEEEEECCCCCCH T ss_conf 99999999998659--90899---------99967999999--------9999739984888748947999837000598 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHHHHHHCC-CHHHHHHHHHHHHHHHC Q ss_conf 99999999887447866775446776545432223322323469868776344688887321-00588999999866414 Q gi|254780799|r 546 DGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQHMPYIVVVIDEMADLMMVAR-KDIESAVQRLAQMARAS 624 (806) Q Consensus 546 ~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivviiDElaDlmm~~~-~~ve~~i~rlaq~ara~ 624 (806) +.+=.++++.-++- =| =+||||=+.-+..... .++-..+.+|.+.-+.. T Consensus 342 ~e~~~~i~~~v~~~-----------------------------~~-~rVvIDsls~~~~~~~~~~~r~~l~~L~~~Lk~~ 391 (501) T PRK09302 342 EDHLQIIKREIEEF-----------------------------KP-SRVAVDPLSALARGGSLNEFRQFVIRLTDYLKQE 391 (501) T ss_pred HHHHHHHHHHHHHC-----------------------------CC-CEEEECCHHHHHHHCCHHHHHHHHHHHHHHHHHC T ss_conf 99999999999972-----------------------------99-8999958068765268599999999999999768 Q ss_pred CEEEEEEECCCCCCCCCHHHHHCCCCEEEEEECCCCCCHHHCCC Q ss_conf 23799996577753554355411025158764586642123388 Q gi|254780799|r 625 GIHVIMATQRPSVDVITGTIKANFPTRISFQVSSKIDSRTILGE 668 (806) Q Consensus 625 GiHli~aTqrPsvdvitg~ikan~p~riaf~v~s~~dSrtild~ 668 (806) |+-.++.-..+. ++|. .-++. ..+|+-+|+=..|.. T Consensus 392 gvT~l~t~~~~~---~~g~---~~~t~--~~iS~l~D~ii~Lry 427 (501) T PRK09302 392 EITGLFTNLTPD---FMGS---HSITE--SHISSLTDTWILLQY 427 (501) T ss_pred CCEEEEEEECCC---CCCC---CCCCC--CCHHHHCCEEEEEEE T ss_conf 978999761235---5676---64476--660011124789998 No 177 >cd03260 ABC_PstB_phosphate_transporter Phosphate uptake is of fundamental importance in the cell physiology of bacteria because phosphate is required as a nutrient. The Pst system of E. coli comprises four distinct subunits encoded by the pstS, pstA, pstB, and pstC genes. The PstS protein is a phosphate-binding protein located in the periplasmic space. P stA and PstC are hydrophobic and they form the transmembrane portion of the Pst system. PstB is the catalytic subunit, which couples the energy of ATP hydrolysis to the import of phosphate across cellular membranes through the Pst system, often referred as ABC-protein. PstB belongs to one of the largest superfamilies of proteins characterized by a highly conserved adenosine triphosphate (ATP) binding cassette (ABC), which is also a nucleotide binding domain (NBD). Probab=79.45 E-value=4.8 Score=19.99 Aligned_cols=145 Identities=21% Similarity=0.332 Sum_probs=69.0 Q ss_pred CCCEEEEEECCCCHHHHHHHHHHHHHHH--HCCHHHEEEEEECCCHH--------HHH-----HCCCCHHHHCCCCCCHH Q ss_conf 0202355304774067999999999998--29957847888523100--------111-----02770343122334304 Q gi|254780799|r 453 RMPHLLIAGTTGSGKSVAINTMILSLLY--RMTPAQCRLIMIDPKML--------ELS-----VYDGIPNLLTPVVTNPQ 517 (806) Q Consensus 453 kMPHLLIAGtTGSGKSV~iN~iI~SlLy--k~~P~evkliliDPK~v--------Els-----~Y~~iPHLl~pVvTd~~ 517 (806) +==-+-|=|..|||||..++ +|.-|+. ...|++=... +|-+-+ ++- ++. =|+|. | -|=-. T Consensus 25 ~Ge~~~iiG~SGsGKSTll~-~i~gL~~~~~~~p~~G~I~-~~g~~i~~~~~~~~~~R~~ig~VfQ-~~~lf-~-~TV~e 99 (227) T cd03260 25 KGEITALIGPSGCGKSTLLR-LLNRLNDLIPGAPDEGEVL-LDGKDIYDLDVDVLELRRRVGMVFQ-KPNPF-P-GSIYD 99 (227) T ss_pred CCCEEEEECCCCCCHHHHHH-HHHHHHHCCCCCCCCEEEE-ECCEECCCCCCCHHHHHHCCEEECC-CCCCC-C-CCHHH T ss_conf 99899999999981999999-9974450268998146999-9999998899587889628268764-77667-7-80999 Q ss_pred HHHHHHHH----HHHHHHHH-HHHHHHCCCC----------CHHHHH-HHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 56689999----99999999-9999870899----------689999-99998874478667754467765454322233 Q gi|254780799|r 518 KAVTVLKW----LVCEMEER-YQKMSKIGVR----------NIDGFN-LKVAQYHNTGKKFNRTVQTGFDRKTGEAIYET 581 (806) Q Consensus 518 kA~~aL~w----~V~EMe~R-Y~l~a~~~vR----------ni~~yN-~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 581 (806) ..+.+|+. --.|+++| .++|...|.- ++.|-- +|+.-|++- T Consensus 100 Ni~~~l~~~~~~~~~~~~~~v~~~L~~vgL~~~~~~~~~p~~LSGGq~QRvaIArAL----------------------- 156 (227) T cd03260 100 NVAYGLRLHGIKLKEELDERVEEALRKAALWDEVKDRLHALGLSGGQQQRLCLARAL----------------------- 156 (227) T ss_pred HHHHHHHHCCCCCHHHHHHHHHHHHHHCCCCHHHHCCCCCCCCCHHHHHHHHHHHHH----------------------- T ss_conf 999999983899999999999999987799758844568022899999999999998----------------------- Q ss_pred CCCCCCCCCEEEEEHHH-HHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCC Q ss_conf 22323469868776344-68888732100588999999866414237999965777 Q gi|254780799|r 582 EHFDFQHMPYIVVVIDE-MADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPS 636 (806) Q Consensus 582 ~~~~~~~lp~ivviiDE-laDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPs 636 (806) -.=|- |++-|| .+-|=....++|++.|.+|.+. +-.|+.|-+.. T Consensus 157 -----~~~P~-iLllDEPTs~LD~~~~~~i~~li~~l~~~-----~Tii~vTHdl~ 201 (227) T cd03260 157 -----ANEPE-VLLLDEPTSALDPISTAKIEELIAELKKE-----YTIVIVTHNMQ 201 (227) T ss_pred -----HCCCC-EEEECCCCCCCCHHHHHHHHHHHHHHHHC-----CEEEEEECCHH T ss_conf -----35999-99968987657989999999999999668-----88999936999 No 178 >PRK08118 topology modulation protein; Reviewed Probab=79.44 E-value=1 Score=25.02 Aligned_cols=18 Identities=39% Similarity=0.698 Sum_probs=14.6 Q ss_pred CCEEEEEECCCCHHHHHH Q ss_conf 202355304774067999 Q gi|254780799|r 454 MPHLLIAGTTGSGKSVAI 471 (806) Q Consensus 454 MPHLLIAGtTGSGKSV~i 471 (806) |--.+|-|+.|||||-.= T Consensus 1 M~rI~IiG~~GsGKSTlA 18 (167) T PRK08118 1 MKKIILIGSGGSGKSTLA 18 (167) T ss_pred CCEEEEECCCCCCHHHHH T ss_conf 967999889998799999 No 179 >PRK11512 DNA-binding transcriptional repressor MarR; Provisional Probab=79.22 E-value=3.7 Score=20.79 Aligned_cols=56 Identities=11% Similarity=0.171 Sum_probs=32.1 Q ss_pred HHHHHHHCCCEEEEHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHC-CCCCEEEECHH Q ss_conf 999999659850001422200117789999999997798680227-88726731712 Q gi|254780799|r 746 AVDIVLRDNKASISYIQRRLGIGYNRAASIIENMEEKGVIGPASS-TGKREILISSM 801 (806) Q Consensus 746 a~~~v~~~~~~s~s~lqr~~~igy~raar~~~~~e~~giv~~~~g-~~~r~vl~~~~ 801 (806) ++..+.+.+-.|.+-|-+++.+-=+-..|++|.||++|.|--.-. .-.|.++|.-. T Consensus 45 vL~~L~~~~~~t~~eLa~~l~id~~tvt~~ldrLe~~GlI~R~~~p~DrR~~~v~LT 101 (144) T PRK11512 45 VLCSIRCAACITPVELKKVLSVDLGALTRMLDRLVCKGWVERLPNPNDKRGVLVKLT 101 (144) T ss_pred HHHHHHHCCCCCHHHHHHHHCCCHHHHHHHHHHHHHCCCEEECCCCCCCCEEEEEEC T ss_conf 999999869979999999978888789999999997879663579877624787889 No 180 >PRK06749 replicative DNA helicase; Provisional Probab=79.20 E-value=4.9 Score=19.93 Aligned_cols=141 Identities=13% Similarity=0.170 Sum_probs=75.0 Q ss_pred EEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHH----------CCCCH--HHHCCC--CCCHHHHHH Q ss_conf 2355304774067999999999998299578478885231001110----------27703--431223--343045668 Q gi|254780799|r 456 HLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSV----------YDGIP--NLLTPV--VTNPQKAVT 521 (806) Q Consensus 456 HLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~----------Y~~iP--HLl~pV--vTd~~kA~~ 521 (806) =..|||-+|.|||..--.|.....-...| | .+. -+|+|. ..+|| ++-.|- .++ +--. T Consensus 188 LiviaaRPsmGKTa~alnia~~~a~~g~~--v--~~f---SlEMs~~~l~~R~ls~~s~v~~~~i~~~~~~~~~--~~~~ 258 (428) T PRK06749 188 FVVLGARPSMGKTAFALNVGLHAAKSGAA--V--GLF---SLEMSSKQLLKRMASCVGEVSGGRLKNPKHRFAM--EDWE 258 (428) T ss_pred EEEEECCCCCCHHHHHHHHHHHHHHCCCC--E--EEE---ECCCCHHHHHHHHHHHHCCCCHHHHHCCCCCCCH--HHHH T ss_conf 89996279897689999999999964992--7--998---3789999999999997549988886277677999--9999 Q ss_pred HHHHHHHHHHHHHHH-HHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHH Q ss_conf 999999999999999-9870899689999999988744786677544677654543222332232346986877634468 Q gi|254780799|r 522 VLKWLVCEMEERYQK-MSKIGVRNIDGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQHMPYIVVVIDEMA 600 (806) Q Consensus 522 aL~w~V~EMe~RY~l-~a~~~vRni~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivviiDEla 600 (806) .+..++.+|.+ ..+ +-.-....|..--.+.+..+.+. .....+||||=| T Consensus 259 ~~~~a~~~l~~-~~l~i~d~~~~ti~~i~~~~r~~~~~~----------------------------g~~~~livIDYl- 308 (428) T PRK06749 259 KVSKAFAEIGE-LPLEIYDNAGVTVQDIWMQTRKLKRKH----------------------------GDKKVLIIVDYL- 308 (428) T ss_pred HHHHHHHHHHC-CCEEEECCCCCCHHHHHHHHHHHHHHC----------------------------CCCCEEEEEEHH- T ss_conf 99999999855-965997589976799999999999974----------------------------998769997677- Q ss_pred HHHHHCCC-------HHHHHHHHHHHHHHHCCEEEEEEECCC Q ss_conf 88873210-------058899999986641423799996577 Q gi|254780799|r 601 DLMMVARK-------DIESAVQRLAQMARASGIHVIMATQRP 635 (806) Q Consensus 601 Dlmm~~~~-------~ve~~i~rlaq~ara~GiHli~aTqrP 635 (806) -||...++ +|..--.+|-++|+--+|+.|+..|=- T Consensus 309 qLi~~~~~~~~~r~~ev~~isr~lK~lAkel~vpvi~lsQLn 350 (428) T PRK06749 309 QLITGDPKHKGNRFQEISEISRKLKLLARELNVCVVALSQLS 350 (428) T ss_pred HHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEECCCC T ss_conf 650578777778999999999999999999699899971378 No 181 >pfam00350 Dynamin_N Dynamin family. Probab=79.13 E-value=1.5 Score=23.85 Aligned_cols=21 Identities=24% Similarity=0.512 Sum_probs=18.1 Q ss_pred EEEEECCCCHHHHHHHHHHHH Q ss_conf 355304774067999999999 Q gi|254780799|r 457 LLIAGTTGSGKSVAINTMILS 477 (806) Q Consensus 457 LLIAGtTGSGKSV~iN~iI~S 477 (806) +.|-|.++||||-.||+++=- T Consensus 1 ivvvG~~ssGKSSliNALlG~ 21 (168) T pfam00350 1 IAVVGDQSAGKSSVLNALLGR 21 (168) T ss_pred CEEECCCCCCHHHHHHHHHCC T ss_conf 989917889899999999788 No 182 >PRK13767 ATP-dependent helicase; Provisional Probab=78.87 E-value=3.8 Score=20.75 Aligned_cols=18 Identities=17% Similarity=0.348 Sum_probs=8.3 Q ss_pred HHHCCCEEEEHHHHHHCC Q ss_conf 996598500014222001 Q gi|254780799|r 750 VLRDNKASISYIQRRLGI 767 (806) Q Consensus 750 v~~~~~~s~s~lqr~~~i 767 (806) .++..-..+.|.+||||- T Consensus 734 ~L~~~L~~S~l~krrFR~ 751 (878) T PRK13767 734 ILKRALDRTELLKRRFRH 751 (878) T ss_pred HHHHHHCCCHHHHHHHHH T ss_conf 999997458999999999 No 183 >cd02026 PRK Phosphoribulokinase (PRK) is an enzyme involved in the Benson-Calvin cycle in chloroplasts or photosynthetic prokaryotes. This enzyme catalyzes the phosphorylation of D-ribulose 5-phosphate to form D-ribulose 1, 5-biphosphate, using ATP and NADPH produced by the primary reactions of photosynthesis. Probab=78.76 E-value=2.4 Score=22.22 Aligned_cols=33 Identities=24% Similarity=0.379 Sum_probs=23.4 Q ss_pred EEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECC Q ss_conf 5530477406799999999999829957847888523 Q gi|254780799|r 458 LIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDP 494 (806) Q Consensus 458 LIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDP 494 (806) =|||.+|||||-..|++.-- + ..+.|-.|-.|= T Consensus 3 gVaG~SGSGKTTv~~~i~~i-f---g~~~v~vI~~D~ 35 (273) T cd02026 3 GVAGDSGCGKSTFLRRLTSL-F---GSDLVTVICLDD 35 (273) T ss_pred EEECCCCCCHHHHHHHHHHH-H---CCCCEEEEECCC T ss_conf 99788878699999999998-5---848769996577 No 184 >KOG0926 consensus Probab=78.66 E-value=0.46 Score=27.57 Aligned_cols=242 Identities=23% Similarity=0.335 Sum_probs=123.1 Q ss_pred CCCCCCEEEEHHHCCCHHHHHCCCCCHHHCCCCCCCCEEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEE Q ss_conf 24566238734230721343022100021045666666785410020235530477406799999999999829957847 Q gi|254780799|r 409 PNDIRETVMLRDLIVSRVFEKNQCDLAINLGKSIEGKPIIADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCR 488 (806) Q Consensus 409 PN~~r~~V~lreil~s~~f~~s~~~L~iaLGKdI~G~pvv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evk 488 (806) ||..|+++|- .|=.+.+-+++...||+.--- .-|+--+..=|-..|+|.|||||.-- +--.||-+-=..=. T Consensus 231 P~~~~~a~yV-~V~R~~EIQ~sR~~LPI~aeE----q~IMEaIn~n~vvIIcGeTGsGKTTQ----vPQFLYEAGf~s~~ 301 (1172) T KOG0926 231 PACRRKAFYV-IVSRPAEIQESRLDLPIVAEE----QRIMEAINENPVVIICGETGSGKTTQ----VPQFLYEAGFASEQ 301 (1172) T ss_pred CCCCCCCEEE-EECCCHHHHHHHHCCCHHHHH----HHHHHHHHCCCEEEEECCCCCCCCCC----CHHHHHHCCCCCCC T ss_conf 7545530799-844857888877517636789----99999862287499954888886443----41899871347766 Q ss_pred EEEECCCHHHHHHCCCCHHHHCCCCCCHHHHHH-HHHH-HHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 888523100111027703431223343045668-9999-99999999999987089968999999998874478667754 Q gi|254780799|r 489 LIMIDPKMLELSVYDGIPNLLTPVVTNPQKAVT-VLKW-LVCEMEERYQKMSKIGVRNIDGFNLKVAQYHNTGKKFNRTV 566 (806) Q Consensus 489 liliDPK~vEls~Y~~iPHLl~pVvTd~~kA~~-aL~w-~V~EMe~RY~l~a~~~vRni~~yN~k~~~~~~~~~~~~~~~ 566 (806) ..+ |-++ =||.|+..+. |... +-.||-. + -+=-+| T Consensus 302 -------------~~~-~gmI--GITqPRRVAaiamAkRVa~EL~~-------~--~~eVsY------------------ 338 (1172) T KOG0926 302 -------------SSS-PGMI--GITQPRRVAAIAMAKRVAFELGV-------L--GSEVSY------------------ 338 (1172) T ss_pred -------------CCC-CCEE--EECCCHHHHHHHHHHHHHHHHCC-------C--CCCEEE------------------ T ss_conf -------------799-8705--40572278999999999998525-------7--641148------------------ Q ss_pred CCCCCCCCCCC----------CCCCCCCCCCCCCEEEEEHHHHHHHHHHCCCH--HHHHHHHHHHHHHH-----CCEEEE Q ss_conf 46776545432----------22332232346986877634468888732100--58899999986641-----423799 Q gi|254780799|r 567 QTGFDRKTGEA----------IYETEHFDFQHMPYIVVVIDEMADLMMVARKD--IESAVQRLAQMARA-----SGIHVI 629 (806) Q Consensus 567 ~~~~~~~~~~~----------~~~~~~~~~~~lp~ivviiDElaDlmm~~~~~--ve~~i~rlaq~ara-----~GiHli 629 (806) |.-||...++. ...+-.-+.----|=|||+||--.==|-..== .=.-|+||-|+--- --..|| T Consensus 339 qIRfd~ti~e~T~IkFMTDGVLLrEi~~DflL~kYSvIIlDEAHERSvnTDILiGmLSRiV~LR~k~~ke~~~~kpLKLI 418 (1172) T KOG0926 339 QIRFDGTIGEDTSIKFMTDGVLLREIENDFLLTKYSVIILDEAHERSVNTDILIGMLSRIVPLRQKYYKEQCQIKPLKLI 418 (1172) T ss_pred EEEECCCCCCCCEEEEECCHHHHHHHHHHHHHHHCEEEEECHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEE T ss_conf 99853656887404774023889998876755420157851254303127899999988778889976641356761699 Q ss_pred E--EECCCCC--------CCCCHHHHH---CCCCEEEEEECCCCCCHHHCCCCCHHH-----HCCCCCEEEECCCCCEEE Q ss_conf 9--9657775--------355435541---102515876458664212338864578-----658875477368983258 Q gi|254780799|r 630 M--ATQRPSV--------DVITGTIKA---NFPTRISFQVSSKIDSRTILGEQGAEQ-----LLGQGDMLYMTGGGRVQR 691 (806) Q Consensus 630 ~--aTqrPsv--------dvitg~ika---n~p~riaf~v~s~~dSrtild~~gae~-----Llg~gdml~~~~~~~~~r 691 (806) + ||-|=|= -+++.+||- .||--|-|+-....| - =||. .+-+ =+|+|+=++= T Consensus 419 IMSATLRVsDFtenk~LFpi~pPlikVdARQfPVsIHF~krT~~D------Y-i~eAfrKtc~IH~----kLP~G~ILVF 487 (1172) T KOG0926 419 IMSATLRVSDFTENKRLFPIPPPLIKVDARQFPVSIHFNKRTPDD------Y-IAEAFRKTCKIHK----KLPPGGILVF 487 (1172) T ss_pred EEEEEEEECCCCCCCEECCCCCCEEEEECCCCCEEEEECCCCCCH------H-HHHHHHHHHHHHH----CCCCCCEEEE T ss_conf 974147711024675424789960553045285688732679841------7-8999999998861----0899827999 Q ss_pred EEECCCCHHHHHHHHHHHHHCCCCCC Q ss_conf 88334898899999999971289742 Q gi|254780799|r 692 IHGPFVSDIEVEKVVSHLKTQGEAKY 717 (806) Q Consensus 692 ~~g~~v~~~ev~~v~~~~~~q~~~~y 717 (806) +-| -.||...|.-+|....-.| T Consensus 488 vTG----QqEV~qL~~kLRK~~p~~f 509 (1172) T KOG0926 488 VTG----QQEVDQLCEKLRKRFPESF 509 (1172) T ss_pred EEC----HHHHHHHHHHHHHHCCCCC T ss_conf 807----5899999999986484102 No 185 >PRK05917 DNA polymerase III subunit delta'; Validated Probab=78.63 E-value=2.1 Score=22.62 Aligned_cols=150 Identities=17% Similarity=0.276 Sum_probs=85.3 Q ss_pred CCCCE-EEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCCCC-H--HHHCCCCCCHHHHHHHHHHHH Q ss_conf 00202-3553047740679999999999982995784788852310011102770-3--431223343045668999999 Q gi|254780799|r 452 ARMPH-LLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYDGI-P--NLLTPVVTNPQKAVTVLKWLV 527 (806) Q Consensus 452 akMPH-LLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~~i-P--HLl~pVvTd~~kA~~aL~w~V 527 (806) .|+|| +|+.|..|+||+-+--.+-..||....|+.+.. +.+++ | |.+.| +.+... T Consensus 16 ~Rl~HAyLf~Gp~G~GK~~~A~~~A~~LLc~~~p~~~~~-----------i~~~~HPD~~~i~p---e~k~~~------- 74 (290) T PRK05917 16 QKVPSAILLHGQDLSNLSQYAYELASLILLESSPEAQYK-----------ISQKIHPDIHEFFP---EGKGRL------- 74 (290) T ss_pred CCCCEEEEEECCCCCCHHHHHHHHHHHHHCCCCCCHHHH-----------HHHCCCCCEEEECC---CCCCCC------- T ss_conf 996606876899986599999999999857899616889-----------87468998599615---777887------- Q ss_pred HHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHHHHHHCC Q ss_conf 99999999998708996899999999887447866775446776545432223322323469868776344688887321 Q gi|254780799|r 528 CEMEERYQKMSKIGVRNIDGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQHMPYIVVVIDEMADLMMVAR 607 (806) Q Consensus 528 ~EMe~RY~l~a~~~vRni~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivviiDElaDlmm~~~ 607 (806) .++..+.+..+++. . -|. .=.|=|+|||| ||.|-.++ T Consensus 75 ------------~~Id~iR~l~~~i~---~-------------~p~--------------~g~~KV~IId~-Ad~Mn~~A 111 (290) T PRK05917 75 ------------HSIETPRAIKKQIW---I-------------HPY--------------EANYKIYIIHE-ADRMTLDA 111 (290) T ss_pred ------------CCHHHHHHHHHHHC---C-------------CCC--------------CCCCEEEEECC-HHHHCHHH T ss_conf ------------86789999999964---1-------------864--------------68826999756-77638999 Q ss_pred -----CHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEEEECCCCCCHHHCCCCCHHHHCCCCCEEE Q ss_conf -----005889999998664142379999657775355435541102515876458664212338864578658875477 Q gi|254780799|r 608 -----KDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISFQVSSKIDSRTILGEQGAEQLLGQGDMLY 682 (806) Q Consensus 608 -----~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf~v~s~~dSrtild~~gae~Llg~gdml~ 682 (806) |-.|++ ..+.+.||.|.+|. -+-.+|++ |.- -.+ T Consensus 112 aNALLKtLEEP---------P~~tvfILit~~~~--~lLpTI~S----------------RCQ--------------~I~ 150 (290) T PRK05917 112 ISAFLKVLEDP---------PKHSVIILTSAKPQ--RLPPTIRS----------------RSL--------------SIH 150 (290) T ss_pred HHHHHHHHCCC---------CCCEEEEEEECCHH--HCCHHHHH----------------CCC--------------EEE T ss_conf 99999973479---------87859999869925--48237763----------------351--------------167 Q ss_pred ECCCCCEEEEEECCCCHHHHHHHHHHHHHC Q ss_conf 368983258883348988999999999712 Q gi|254780799|r 683 MTGGGRVQRIHGPFVSDIEVEKVVSHLKTQ 712 (806) Q Consensus 683 ~~~~~~~~r~~g~~v~~~ev~~v~~~~~~q 712 (806) .|... --.+++++++.++.|...+ T Consensus 151 i~~~e------~~~i~~e~~~~l~~~a~g~ 174 (290) T PRK05917 151 IPGEE------KTLPSKEDIAYLIRYAQGK 174 (290) T ss_pred CCCCC------CCCCCHHHHHHHHHHHCCC T ss_conf 77620------1347889999999984697 No 186 >pfam05116 S6PP Sucrose-6F-phosphate phosphohydrolase. This family consists of Sucrose-6F-phosphate phosphohydrolase proteins found in plants and cyanobacteria. Sucrose-6(F)-phosphate phosphohydrolase catalyses the final step in the pathway of sucrose biosynthesis. Probab=78.59 E-value=1.7 Score=23.41 Aligned_cols=59 Identities=15% Similarity=0.162 Sum_probs=33.8 Q ss_pred CEEEEEHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCC Q ss_conf 868776344688887321005889999998664142379999657775355435541102 Q gi|254780799|r 590 PYIVVVIDEMADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFP 649 (806) Q Consensus 590 p~ivviiDElaDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p 649 (806) |+.+|+.|==--| +...+.+-....+.-+.+++.||++++||=||-..+..=+=+-+++ T Consensus 1 P~~LIa~DlDGTL-L~~~~~i~~~~~~al~~l~~~Gi~vviaTGR~~~~~~~~~~~l~l~ 59 (247) T pfam05116 1 PRLLLVSDLDNTL-VDGDNEALARLNQLLEAQYRPDSLLVFATGRSLDSAKELLKEKPLP 59 (247) T ss_pred CCEEEEEECCCCC-CCCCCCCCHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHCCCC T ss_conf 9889999387878-0934958899999999999589889998899999999999975989 No 187 >cd03280 ABC_MutS2 MutS2 homologs in bacteria and eukaryotes. The MutS protein initiates DNA mismatch repair by recognizing mispaired and unpaired bases embedded in duplex DNA and activating endo- and exonucleases to remove the mismatch. Members of the MutS family also possess a conserved ATPase activity that belongs to the ATP binding cassette (ABC) superfamily. MutS homologs (MSH) have been identified in most prokaryotic and all eukaryotic organisms examined. Prokaryotes have two homologs (MutS1 and MutS2), whereas seven MSH proteins (MSH1 to MSH7) have been identified in eukaryotes. The homodimer MutS1 and heterodimers MSH2-MSH3 and MSH2-MSH6 are primarily involved in mitotic mismatch repair, whereas MSH4-MSH5 is involved in resolution of Holliday junctions during meiosis. All members of the MutS family contain the highly conserved Walker A/B ATPase domain, and many share a common mechanism of action. MutS1, MSH2-MSH3, MSH2-MSH6, and MSH4-MSH5 dimerize to form sliding clamps, a Probab=78.38 E-value=3.8 Score=20.70 Aligned_cols=98 Identities=20% Similarity=0.313 Sum_probs=58.0 Q ss_pred HHHCCCCCHHHCCCCCCCCEEEEECCCCCEEEEEECCCCHHHHHHHHH-HHHHHHHC-C--HHHEEEEEECCCHHHHHHC Q ss_conf 430221000210456666667854100202355304774067999999-99999829-9--5784788852310011102 Q gi|254780799|r 427 FEKNQCDLAINLGKSIEGKPIIADLARMPHLLIAGTTGSGKSVAINTM-ILSLLYRM-T--PAQCRLIMIDPKMLELSVY 502 (806) Q Consensus 427 f~~s~~~L~iaLGKdI~G~pvv~DLakMPHLLIAGtTGSGKSV~iN~i-I~SlLyk~-~--P~evkliliDPK~vEls~Y 502 (806) +++..|||-...+++.-.+.+..|=.+ .-++|-|.-.||||..+-++ +..+|-.. . |.. +.-++.+| T Consensus 2 i~~~rHPll~~~~~~~V~Ndi~l~~~~-~~~iiTGpN~sGKSt~lk~i~l~~iLAq~G~~vpa~--------~~~~~~~~ 72 (200) T cd03280 2 LREARHPLLPLQGEKVVPLDIQLGENK-RVLVITGPNAGGKTVTLKTLGLLTLMAQSGLPIPAA--------EGSSLPVF 72 (200) T ss_pred CCCCCCCEEECCCCCEECCEEEECCCC-EEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCEEEC--------CCEECCCC T ss_conf 877728859757895476258977993-399998898775099999999999999977780011--------00472665 Q ss_pred CCCHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 77034312233430456689999999999999999 Q gi|254780799|r 503 DGIPNLLTPVVTNPQKAVTVLKWLVCEMEERYQKM 537 (806) Q Consensus 503 ~~iPHLl~pVvTd~~kA~~aL~w~V~EMe~RY~l~ 537 (806) +.| ++- +-|...-...+.--..||.|=-+++ T Consensus 73 d~i---~~~-i~~~d~~~~~~S~F~~E~~~~~~il 103 (200) T cd03280 73 ENI---FAD-IGDEQSIEQSLSTFSSHMKNIARIL 103 (200) T ss_pred CEE---EEE-ECCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 678---888-4673366677879999999999999 No 188 >cd01852 AIG1 AIG1 (avrRpt2-induced gene 1). This represents Arabidoposis protein AIG1 that appears to be involved in plant resistance to bacteria. The Arabidopsis disease resistance gene RPS2 is involved in recognition of bacterial pathogens carrying the avirulence gene avrRpt2. AIG1 exhibits RPS2- and avrRpt1-dependent induction early after infection with Pseudomonas syringae carrying avrRpt2. This subfamily also includes IAN-4 protein, which has GTP-binding activity and shares sequence homology with a novel family of putative GTP-binding proteins: the immuno-associated nucleotide (IAN) family. The evolutionary conservation of the IAN family provides a unique example of a plant pathogen response gene conserved in animals. The IAN/IMAP subfamily has been proposed to regulate apoptosis in vertebrates and angiosperm plants, particularly in relation to cancer, diabetes, and infections. The human IAN genes were renamed GIMAP (GTPase of the immunity associated proteins). Probab=78.15 E-value=0.68 Score=26.32 Aligned_cols=21 Identities=43% Similarity=0.661 Sum_probs=18.6 Q ss_pred EEEEEECCCCHHHHHHHHHHH Q ss_conf 235530477406799999999 Q gi|254780799|r 456 HLLIAGTTGSGKSVAINTMIL 476 (806) Q Consensus 456 HLLIAGtTGSGKSV~iN~iI~ 476 (806) +++.-|-||||||..-|||+- T Consensus 2 rIvLlGktGsGKSstgNtILG 22 (196) T cd01852 2 RLVLVGKTGAGKSATGNTILG 22 (196) T ss_pred EEEEECCCCCCCHHHHEEECC T ss_conf 899978999861534032568 No 189 >cd03224 ABC_TM1139_LivF_branched LivF (TM1139) is part of the LIV-I bacterial ABC-type two-component transport system that imports neutral, branched-chain amino acids. The E. coli branched-chain amino acid transporter comprises a heterodimer of ABC transporters (LivF and LivG), a heterodimer of six-helix TM domains (LivM and LivH), and one of two alternative soluble periplasmic substrate binding proteins (LivK or LivJ). ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. Probab=78.09 E-value=5.2 Score=19.70 Aligned_cols=148 Identities=18% Similarity=0.251 Sum_probs=73.6 Q ss_pred EECCCCCEEEEEECCCCHHHHHHHHHHHHHHH--------------HCCHHH-----EEEEEECCCHH-HHHHCCCCHHH Q ss_conf 54100202355304774067999999999998--------------299578-----47888523100-11102770343 Q gi|254780799|r 449 ADLARMPHLLIAGTTGSGKSVAINTMILSLLY--------------RMTPAQ-----CRLIMIDPKML-ELSVYDGIPNL 508 (806) Q Consensus 449 ~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLy--------------k~~P~e-----vkliliDPK~v-Els~Y~~iPHL 508 (806) ..+.+=-.+-+.|..|||||-.+|+|.- ++- ..+|.+ +-++.=||-.. .||+++.+- T Consensus 21 ~~v~~Gei~~liG~nGaGKSTLl~~i~G-l~~p~~G~I~~~G~~i~~~~~~~~~~~gi~~v~Q~~~l~~~ltv~enl~-- 97 (222) T cd03224 21 LTVPEGEIVALLGRNGAGKTTLLKTIMG-LLPPRSGSIRFDGRDITGLPPHERARAGIGYVPEGRRIFPELTVEENLL-- 97 (222) T ss_pred EEECCCCEEEEECCCCCCHHHHHHHHHC-CCCCCCEEEEECCEECCCCCHHHHHHCCEEEECCCCCCCCCCCHHHHHH-- T ss_conf 9988998999999999859999999977-9889960999999999999999999759389635665688990999999-- Q ss_pred HCCCCCCHHHHHHHHHHHHHHHHHHH---HHHHHCCCCCHHHHHH-HHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 12233430456689999999999999---9998708996899999-9998874478667754467765454322233223 Q gi|254780799|r 509 LTPVVTNPQKAVTVLKWLVCEMEERY---QKMSKIGVRNIDGFNL-KVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHF 584 (806) Q Consensus 509 l~pVvTd~~kA~~aL~w~V~EMe~RY---~l~a~~~vRni~~yN~-k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 584 (806) +.--.... ...+|.++++-.++ +-+.+..++++.+=++ |+.-|++-- T Consensus 98 ~~~~~~~~----~~~~~~~~~~l~~~~~l~~~~~~~~~~LSgG~~Qrv~iAraL~------------------------- 148 (222) T cd03224 98 LGAYARRR----AKRKARLERVYELFPRLKERRKQLAGTLSGGEQQMLAIARALM------------------------- 148 (222) T ss_pred HHHHHCCC----HHHHHHHHHHHHHHHHHHHHHHCCHHHCCHHHHHHHHHHHHHH------------------------- T ss_conf 98763581----3599999999988663799874845448999999999999996------------------------- Q ss_pred CCCCCCEEEEEHHHH-HHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCC Q ss_conf 234698687763446-8888732100588999999866414237999965777 Q gi|254780799|r 585 DFQHMPYIVVVIDEM-ADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPS 636 (806) Q Consensus 585 ~~~~lp~ivviiDEl-aDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPs 636 (806) .=|- |++.||= +-|=..+.+++-+.|.+| ++.|+-.|+.|...+ T Consensus 149 ---~~P~-lllLDEPt~gLD~~~~~~i~~~l~~l----~~~g~tii~vtHdl~ 193 (222) T cd03224 149 ---SRPK-LLLLDEPSEGLAPKIVEEIFEAIREL----RDEGVTILLVEQNAR 193 (222) T ss_pred ---CCCC-EEEEECCCCCCCHHHHHHHHHHHHHH----HHCCCEEEEECCCHH T ss_conf ---4999-99993865479999999999999999----956999999908589 No 190 >cd03227 ABC_Class2 ABC-type Class 2 contains systems involved in cellular processes other than transport. These families are characterised by the fact that the ABC subunit is made up of duplicated, fused ABC modules (ABC2). No known transmembrane proteins or domains are associated with these proteins. Probab=77.97 E-value=3.1 Score=21.37 Aligned_cols=31 Identities=29% Similarity=0.398 Sum_probs=25.4 Q ss_pred CEEEEEECCCCHHHHHHHHHHHHHHHHCCHH Q ss_conf 0235530477406799999999999829957 Q gi|254780799|r 455 PHLLIAGTTGSGKSVAINTMILSLLYRMTPA 485 (806) Q Consensus 455 PHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~ 485 (806) -.++|-|.-|||||+.++++-+-+++...+. T Consensus 22 ~~~iItGpN~sGKSt~Lr~i~l~~~~a~~~~ 52 (162) T cd03227 22 SLTIITGPNGSGKSTILDAIGLALGGAQSAT 52 (162) T ss_pred CEEEEECCCCCCHHHHHHHHHHHHHHHHCCC T ss_conf 6899989987757999999999999863267 No 191 >TIGR03600 phage_DnaB phage replicative helicase, DnaB family, HK022 subfamily. Members of this family are phage (or prophage-region) homologs of the bacterial homohexameric replicative helicase DnaB. Some phage may rely on host DnaB, while others encode their own verions. This model describes the largest phage-specific clade among the close homologs of DnaB, but there are, or course, other DnaB homologs from phage that fall outside the scope of this model. Probab=77.94 E-value=5.3 Score=19.67 Aligned_cols=140 Identities=18% Similarity=0.246 Sum_probs=71.4 Q ss_pred EEEEEECCCCHHHHHHHHHHHHHHHH-CCHHHEEEEEECCCHHHHHH----------CCCCH--HHHCCCCCCHHHHHHH Q ss_conf 23553047740679999999999982-99578478885231001110----------27703--4312233430456689 Q gi|254780799|r 456 HLLIAGTTGSGKSVAINTMILSLLYR-MTPAQCRLIMIDPKMLELSV----------YDGIP--NLLTPVVTNPQKAVTV 522 (806) Q Consensus 456 HLLIAGtTGSGKSV~iN~iI~SlLyk-~~P~evkliliDPK~vEls~----------Y~~iP--HLl~pVvTd~~kA~~a 522 (806) =..|||-+|.|||...-.|...+-.+ ..| -.++ -+|+|- -.+|| ++-.+-.|+- --.. T Consensus 196 LiIiaARPsmGKTafalnia~n~A~~~g~~----Vl~f---SLEMs~eql~~R~la~~s~i~~~~i~~g~l~~~--~~~~ 266 (421) T TIGR03600 196 LIVIGARPSMGKTTLALNIAENVALREGKP----VLFF---SLEMSAEQLGERLLASKSGINTGNIRTGRFNDS--DFNR 266 (421) T ss_pred EEEEEEECCCCHHHHHHHHHHHHHHHCCCE----EEEE---ECCCCHHHHHHHHHHHHCCCCHHHHHCCCCCHH--HHHH T ss_conf 899985467874599999999999866983----8999---257999999999999854897766652899989--9999 Q ss_pred HHHHHHHHHHHHHHH-HHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHH Q ss_conf 999999999999999-8708996899999999887447866775446776545432223322323469868776344688 Q gi|254780799|r 523 LKWLVCEMEERYQKM-SKIGVRNIDGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQHMPYIVVVIDEMAD 601 (806) Q Consensus 523 L~w~V~EMe~RY~l~-a~~~vRni~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivviiDElaD 601 (806) +..++.+|.+ ..++ -...-.++...-.+++..+.+. .++ =+||||=| . T Consensus 267 ~~~a~~~l~~-~~l~i~d~~~~ti~~ir~~~r~~~~~~---------------------------~~l--~livIDYL-q 315 (421) T TIGR03600 267 LLNAVDRLSE-KDLYIDDTGGLTVAQIRSIARRIKRKK---------------------------GGL--DLIVVDYI-Q 315 (421) T ss_pred HHHHHHHHHC-CCEEEECCCCCCHHHHHHHHHHHHHHC---------------------------CCC--CEEEECHH-H T ss_conf 9999998616-878996699887678999999999862---------------------------898--69997578-6 Q ss_pred HHHHCCC-----HHHHHHHHHHHHHHHCCEEEEEEECCC Q ss_conf 8873210-----058899999986641423799996577 Q gi|254780799|r 602 LMMVARK-----DIESAVQRLAQMARASGIHVIMATQRP 635 (806) Q Consensus 602 lmm~~~~-----~ve~~i~rlaq~ara~GiHli~aTqrP 635 (806) ||-...+ +|..---.|-++|+--+|..|+..|=- T Consensus 316 Li~~~~~~~r~~ei~~Isr~LK~lAkel~ipVi~lsQLn 354 (421) T TIGR03600 316 LMAPTRGRDRNEELGGISRGLKALAKELDVPVVLLAQLN 354 (421) T ss_pred HCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEECCCC T ss_conf 537888888899999999999999999799789970578 No 192 >PRK11007 trehalose(maltose)-specific PTS system components IIBC; Provisional Probab=77.89 E-value=5.3 Score=19.66 Aligned_cols=17 Identities=12% Similarity=-0.029 Sum_probs=7.5 Q ss_pred HHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999 Q gi|254780799|r 84 IASVFFLPPPTMWALSL 100 (806) Q Consensus 84 ~~Ayllpl~Ll~~g~~l 100 (806) |.+=++|.++..|-... T Consensus 228 Y~ssViPiIl~v~~~s~ 244 (473) T PRK11007 228 YQAQVIPALLAGLALGF 244 (473) T ss_pred CCCCHHHHHHHHHHHHH T ss_conf 75308999999999999 No 193 >TIGR00235 udk uridine kinase; InterPro: IPR000764 Uridine kinase (pyrimidine ribonucleoside kinase) is the rate-limiting enzyme in the pyrimidine salvage pathway. It catalyzes the following reaction:ATP + Uridine = ADP + UMP A cDNA for uridine kinase from mouse brain was found which encodes a protein of 277 amino acids. A truncated form of the cDNA was expressed in Escherichia coli, and shown to display uridine kinase activity and to readily form a tetramer, the most active form of the wild-type enzyme. Sequence analysis has identified three ATP-binding site consensus motifs. The predicted secondary structure, and sequence comparison with kinases of known structure, is consistent with uridine kinase having the alpha/beta core nucleotide-binding fold common to many kinases .; GO: 0005524 ATP binding, 0016773 phosphotransferase activity alcohol group as acceptor. Probab=77.79 E-value=2.2 Score=22.44 Aligned_cols=47 Identities=28% Similarity=0.434 Sum_probs=32.0 Q ss_pred EEEECCCCHHHHHHHHHHHHHHHHCCHHH-EEEEEECCCHHHHHHCCCC--HHHH Q ss_conf 55304774067999999999998299578-4788852310011102770--3431 Q gi|254780799|r 458 LIAGTTGSGKSVAINTMILSLLYRMTPAQ-CRLIMIDPKMLELSVYDGI--PNLL 509 (806) Q Consensus 458 LIAGtTGSGKSV~iN~iI~SlLyk~~P~e-vkliliDPK~vEls~Y~~i--PHLl 509 (806) =|||+-|||||. |-|=+.+-+|..=++. .++.|| -+=.=|++. -|+= T Consensus 14 gI~GgSgSGKs~-iATtV~~~i~~~~~~kv~~~~~I----~~D~YYK~~GP~~~~ 63 (220) T TIGR00235 14 GISGGSGSGKSL-IATTVARKIYEQLGKKVLEIVLI----SQDNYYKDQGPSDLE 63 (220) T ss_pred EECCCCCHHHHH-HHHHHHHHHHHHHCCCEEEEEEE----CCCCCCCCCCCCCHH T ss_conf 701766101567-89999999999831400145775----032445889887312 No 194 >pfam00488 MutS_V MutS domain V. This domain is found in proteins of the MutS family (DNA mismatch repair proteins) and is found associated with pfam01624, pfam05188, pfam05192 and pfam05190. The mutS family of proteins is named after the Salmonella typhimurium MutS protein involved in mismatch repair; other members of the family included the eukaryotic MSH 1,2,3, 4,5 and 6 proteins. These have various roles in DNA repair and recombination. Human MSH has been implicated in non-polyposis colorectal carcinoma (HNPCC) and is a mismatch binding protein. The aligned region corresponds with domain V of Thermus aquaticus MutS, which contains a Walker A motif, and is structurally similar to the ATPase domain of ABC transporters. Probab=77.79 E-value=5.3 Score=19.64 Aligned_cols=100 Identities=16% Similarity=0.201 Sum_probs=57.3 Q ss_pred HHHCCCCCHHHCCCCCCCCEEEEECCCC---CE-EEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHC Q ss_conf 4302210002104566666678541002---02-3553047740679999999999982995784788852310011102 Q gi|254780799|r 427 FEKNQCDLAINLGKSIEGKPIIADLARM---PH-LLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVY 502 (806) Q Consensus 427 f~~s~~~L~iaLGKdI~G~pvv~DLakM---PH-LLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y 502 (806) .++..|||- -+....++|--|+.=- ++ ++|-|.-.||||+.+-++-+..++-.. -..|--+..++++| T Consensus 14 i~~~rHPll---~~~~~~~~VpNdi~l~~~~~~~~iiTGpN~sGKSt~Lk~igl~~~lAq~-----G~~VPA~~a~~~~~ 85 (234) T pfam00488 14 IEGGRHPVV---EAVLAENFVPNDVSLGKERSRILLITGPNMGGKSTYLRQVALIVIMAQI-----GSFVPAESARIGIV 85 (234) T ss_pred EEECCCCEE---ECCCCCCEECCEEEECCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHH-----CCCEEECCEEEECC T ss_conf 997668769---7257997687658977996169999788777619999999999999983-----68742220599636 Q ss_pred CCCHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 770343122334304566899999999999999998 Q gi|254780799|r 503 DGIPNLLTPVVTNPQKAVTVLKWLVCEMEERYQKMS 538 (806) Q Consensus 503 ~~iPHLl~pVvTd~~kA~~aL~w~V~EMe~RY~l~a 538 (806) +.| ++- +-+.......+..-..||.+==+++. T Consensus 86 d~I---~~~-i~~~dsl~~~~StF~~e~~~~~~il~ 117 (234) T pfam00488 86 DRI---FTR-IGASDDLASGRSTFMVEMLETANILH 117 (234) T ss_pred CEE---EEE-ECCCCHHHCCCCHHHHHHHHHHHHHH T ss_conf 559---998-56753344661179999999999997 No 195 >KOG3347 consensus Probab=77.58 E-value=1 Score=25.06 Aligned_cols=19 Identities=42% Similarity=0.795 Sum_probs=16.0 Q ss_pred CCCEEEEEECCCCHHHHHH Q ss_conf 0202355304774067999 Q gi|254780799|r 453 RMPHLLIAGTTGSGKSVAI 471 (806) Q Consensus 453 kMPHLLIAGtTGSGKSV~i 471 (806) .-|.+||+||-|.|||-.= T Consensus 6 ~~PNILvtGTPG~GKstl~ 24 (176) T KOG3347 6 ERPNILVTGTPGTGKSTLA 24 (176) T ss_pred CCCCEEEECCCCCCCHHHH T ss_conf 3788798679998802599 No 196 >PRK00192 mannosyl-3-phosphoglycerate phosphatase; Reviewed Probab=77.46 E-value=2.3 Score=22.41 Aligned_cols=50 Identities=26% Similarity=0.307 Sum_probs=31.3 Q ss_pred CCCEEEEEHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCC Q ss_conf 6986877634468888732100588999999866414237999965777535 Q gi|254780799|r 588 HMPYIVVVIDEMADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDV 639 (806) Q Consensus 588 ~lp~ivviiDElaDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdv 639 (806) .|.+.+|+.|==--|+ ...+.++..+. -=+++|+.|||+++||=||-..+ T Consensus 3 ~M~~~LI~tDlDGTLL-~~~~~~~~~~~-ai~~l~~~Gi~vviaTGR~~~~~ 52 (275) T PRK00192 3 DMSKLLVFTDLDGTLL-DHTYDWEPAKP-TLKALKEKGIPVIPCTSKTAAEV 52 (275) T ss_pred CCCCEEEEEECCCCCC-CCCCCCCHHHH-HHHHHHHCCCEEEEECCCCHHHH T ss_conf 8884699994857784-89998368999-99999978999999689889999 No 197 >PRK07429 phosphoribulokinase; Provisional Probab=77.33 E-value=1.6 Score=23.47 Aligned_cols=220 Identities=19% Similarity=0.276 Sum_probs=100.8 Q ss_pred EEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 35530477406799999999999829957847888523100111027703431223343045668999999999999999 Q gi|254780799|r 457 LLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYDGIPNLLTPVVTNPQKAVTVLKWLVCEMEERYQK 536 (806) Q Consensus 457 LLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~~iPHLl~pVvTd~~kA~~aL~w~V~EMe~RY~l 536 (806) +=|||-+|||||-..|.|.- .+.++.|-.|--|= | |.+. -.| |.-.- T Consensus 11 IGIAGgSGSGKTTv~r~I~~----~fg~~~VtvI~~Dd-------Y----hk~d----------------r~~--r~~~~ 57 (331) T PRK07429 11 IGVAGDSGCGKSTFLRRLAD----LFGEELVTVICLDD-------Y----HSLD----------------RKQ--RKEIG 57 (331) T ss_pred EEEECCCCCCHHHHHHHHHH----HHCCCCEEEEECCC-------C----CCCC----------------HHH--HHHCC T ss_conf 99857887789999999999----83888779994786-------7----7788----------------788--98718 Q ss_pred HH--HCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEE------EHHHHHHHHHHC-- Q ss_conf 98--70899689999999988744786677544677654543222332232346986877------634468888732-- Q gi|254780799|r 537 MS--KIGVRNIDGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQHMPYIVV------VIDEMADLMMVA-- 606 (806) Q Consensus 537 ~a--~~~vRni~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivv------iiDElaDlmm~~-- 606 (806) +. +-..-|.+-+-+-+ ++.+.|+++..|+ ||-.++.. .+.+ ..++=|.|+| --.++.|||-.. T Consensus 58 ~t~lhP~And~dLl~e~L-~~Lk~Gk~I~~Pv---Ydh~tg~~-~~~~--~I~P~~vIIvEGLh~L~~~~lR~l~DlKIF 130 (331) T PRK07429 58 ITALDPRANNFDLMYEQL-KALKTGQPILKPI---YNHETGKI-DPPE--YIKPNKIVVVEGLHPLYDERVRDLYDFKVY 130 (331) T ss_pred CCCCCCCHHCHHHHHHHH-HHHHCCCCCCCCC---CCCCCCCC-CCCE--EECCCCEEEEECCHHCCCHHHHHHCCEEEE T ss_conf 987896400599999999-9998599725652---35647877-8866--606886799916121287989975493799 Q ss_pred --------------------CCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHC----------CCCEEEEEE Q ss_conf --------------------10058899999986641423799996577753554355411----------025158764 Q gi|254780799|r 607 --------------------RKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKAN----------FPTRISFQV 656 (806) Q Consensus 607 --------------------~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan----------~p~riaf~v 656 (806) |...|.-+.-+- +|-.--.--+..||--.|||--+.-.+ +-.|+-.+- T Consensus 131 VD~d~diR~~rRI~RDv~ERG~s~E~Vl~qi~--~RkpD~~~yI~PQk~~ADiVI~~~p~~~~~~~~~~~~l~vrli~~~ 208 (331) T PRK07429 131 LDPPDEVKIAWKIKRDMAERGHTYEDVLAEIE--KREPDFEAYIDPQRQYADVVIQVLPTQLIDNDKENKVLRVRLVQRP 208 (331) T ss_pred ECCCHHHHHHHHHHHHHHHHCCCHHHHHHHHH--HCCHHHHHHCCCCHHCCCEEEEECCCCCCCCCCCCCEEEEEEEECC T ss_conf 64878899999888778661899999999998--5117899766804112728999627667888888855889999507 Q ss_pred CCC-CCCHHHCCCCCH-----HHHC--CCCCEEEECCC---CC---EEEEEECCCCHHHHHHHHHHHHHCCCCCCCC Q ss_conf 586-642123388645-----7865--88754773689---83---2588833489889999999997128974211 Q gi|254780799|r 657 SSK-IDSRTILGEQGA-----EQLL--GQGDMLYMTGG---GR---VQRIHGPFVSDIEVEKVVSHLKTQGEAKYID 719 (806) Q Consensus 657 ~s~-~dSrtild~~ga-----e~Ll--g~gdml~~~~~---~~---~~r~~g~~v~~~ev~~v~~~~~~q~~~~y~~ 719 (806) +-. .|.--++|+.-- .+|- -.|-++++-+. ++ ..-+-|-| +.+|+..|-.++..-+.--|-+ T Consensus 209 ~~~~~~~~~~~d~~s~i~w~p~~l~~~~pg~~~~~~~d~~~g~~v~vle~Dg~~-~~~~~~~~E~~l~~~~~k~~g~ 284 (331) T PRK07429 209 GIPHPDPVYLFDEGSTIDWRPRKLTCSYPGIALRYGPDRYMGKPVSVLEVDGQF-DNEEVIYLEDHLSNTSTKYYGE 284 (331) T ss_pred CCCCCCCCEEECCCCCCCCCCCCCCCCCCCCEEEECCHHHCCCEEEEEEECCCC-CHHHHHHHHHHHHCCCCCCCCH T ss_conf 988998324405776312454335789977468964624459640499855876-9899999999975357721465 No 198 >COG0497 RecN ATPase involved in DNA repair [DNA replication, recombination, and repair] Probab=77.19 E-value=1.5 Score=23.68 Aligned_cols=26 Identities=15% Similarity=0.187 Sum_probs=14.3 Q ss_pred HHHHHHHHHHCCCCEEEEEEEECCEE Q ss_conf 89999997424863289984104424 Q gi|254780799|r 336 ACTLKSVLSDFGIQGEIVNVRPGPVI 361 (806) Q Consensus 336 A~lLE~tL~dFGVe~~Vv~v~pGPVV 361 (806) |+.+=++|..-+-..+|..|+.=|-| T Consensus 471 A~aVg~~L~~Ls~~~QVl~VTHlPQV 496 (557) T COG0497 471 AQAVGKKLRRLSEHHQVLCVTHLPQV 496 (557) T ss_pred HHHHHHHHHHHHCCCEEEEEECHHHH T ss_conf 99999999998369649999357878 No 199 >PRK11192 ATP-dependent RNA helicase SrmB; Provisional Probab=76.87 E-value=4 Score=20.54 Aligned_cols=15 Identities=40% Similarity=0.618 Sum_probs=6.8 Q ss_pred EEEEEECCCCHHHHH Q ss_conf 235530477406799 Q gi|254780799|r 456 HLLIAGTTGSGKSVA 470 (806) Q Consensus 456 HLLIAGtTGSGKSV~ 470 (806) ++++.--|||||+.| T Consensus 42 dvl~~A~TGSGKTla 56 (417) T PRK11192 42 DVLGSAPTGTGKTAA 56 (417) T ss_pred CEEEECCCCCHHHHH T ss_conf 889989998679999 No 200 >COG1122 CbiO ABC-type cobalt transport system, ATPase component [Inorganic ion transport and metabolism] Probab=76.42 E-value=2.8 Score=21.78 Aligned_cols=161 Identities=24% Similarity=0.339 Sum_probs=83.0 Q ss_pred EEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCC-------HHHHHHCCCC----H--HHHCCCC Q ss_conf 7854100202355304774067999999999998299578478885231-------0011102770----3--4312233 Q gi|254780799|r 447 IIADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPK-------MLELSVYDGI----P--NLLTPVV 513 (806) Q Consensus 447 vv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK-------~vEls~Y~~i----P--HLl~pVv 513 (806) +-..+.+=-+++|-|.||||||--++. +.-|++.+.-+| .+|-+ ..|+.-+-|+ | .+..|.| T Consensus 23 v~~~i~~Ge~~~i~G~nGsGKSTL~~~--l~GLl~p~~G~v---~~~g~~~~~~~~~~~~~~~vG~VfQnpd~q~~~~tV 97 (235) T COG1122 23 VSLEIEKGERVLLIGPNGSGKSTLLKL--LNGLLKPTSGEV---LVDGLDTSSEKSLLELRQKVGLVFQNPDDQLFGPTV 97 (235) T ss_pred CEEEECCCCEEEEECCCCCCHHHHHHH--HCCCCCCCCCEE---EECCCCCCCHHHHHHHHCCEEEEEECHHHCEECCCH T ss_conf 338987898999988999889999999--537676889848---877813310021887631216999971126104758 Q ss_pred CCH-HHHHHHHHHHHHHHHHHHH-HHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCE Q ss_conf 430-4566899999999999999-99870899689999999988744786677544677654543222332232346986 Q gi|254780799|r 514 TNP-QKAVTVLKWLVCEMEERYQ-KMSKIGVRNIDGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQHMPY 591 (806) Q Consensus 514 Td~-~kA~~aL~w~V~EMe~RY~-l~a~~~vRni~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ 591 (806) -|- -=....+.....||++|-+ .+...|- ..|=++-......||+. ++ .-+--+--=|. T Consensus 98 ~~evafg~~n~g~~~~e~~~rv~~~l~~vgl---~~~~~r~p~~LSGGqkq--Rv--------------aIA~vLa~~P~ 158 (235) T COG1122 98 EDEVAFGLENLGLPREEIEERVAEALELVGL---EELLDRPPFNLSGGQKQ--RV--------------AIAGVLAMGPE 158 (235) T ss_pred HHHHHCCHHHCCCCHHHHHHHHHHHHHHHCC---HHHCCCCCCCCCCCCEE--EH--------------HHHHHHHCCCC T ss_conf 8887535744499989999999999998186---11123881106973166--58--------------86688871898 Q ss_pred EEEEHHHH-HHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCC Q ss_conf 87763446-888873210058899999986641423799996577 Q gi|254780799|r 592 IVVVIDEM-ADLMMVARKDIESAVQRLAQMARASGIHVIMATQRP 635 (806) Q Consensus 592 ivviiDEl-aDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrP 635 (806) | +|-||= |=| -++-.+..+.++-+....-|+++|++|-+- T Consensus 159 i-liLDEPta~L---D~~~~~~l~~~l~~L~~~~~~tiii~tHd~ 199 (235) T COG1122 159 I-LLLDEPTAGL---DPKGRRELLELLKKLKEEGGKTIIIVTHDL 199 (235) T ss_pred E-EEECCCCCCC---CHHHHHHHHHHHHHHHHCCCCEEEEEECCH T ss_conf 9-9974998898---978999999999998860797699994747 No 201 >cd03289 ABCC_CFTR2 The CFTR subfamily domain 2. The cystic fibrosis transmembrane regulator (CFTR), the product of the gene mutated in patients with cystic fibrosis, has adapted the ABC transporter structural motif to form a tightly regulated anion channel at the apical surface of many epithelia. Use of the term assembly of a functional ion channel implies the coming together of subunits or at least smaller not-yet functional components of the active whole. In fact, on the basis of current knowledge only the CFTR polypeptide itself is required to form an ATP- and protein kinase A-dependent low-conductance chloride channel of the type present in the apical membrane of many epithelial cells. CFTR displays the typical organization (IM-ABC)2 and carries a characteristic hydrophilic R-domain that separates IM1-ABC1 from IM2-ABC2. Probab=76.35 E-value=2.3 Score=22.36 Aligned_cols=73 Identities=12% Similarity=0.191 Sum_probs=40.3 Q ss_pred EEEHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEEEECCCCCCHHHCCCCCHH Q ss_conf 77634468888732100588999999866414237999965777535543554110251587645866421233886457 Q gi|254780799|r 593 VVVIDEMADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISFQVSSKIDSRTILGEQGAE 672 (806) Q Consensus 593 vviiDElaDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf~v~s~~dSrtild~~gae 672 (806) |++.||----+- .+-|..|.+.-.. ...|--.|+.|+|++ .++. --||-+ +|.=.|..+.--| T Consensus 159 IllLDEpTs~LD---~~te~~i~~~l~~-~~~~~TvI~itHrl~------~~~~--~DrIlv-----ld~G~Ive~Gt~e 221 (275) T cd03289 159 ILLLDEPSAHLD---PITYQVIRKTLKQ-AFADCTVILSEHRIE------AMLE--CQRFLV-----IEENKVRQYDSIQ 221 (275) T ss_pred EEEEECCCCCCC---HHHHHHHHHHHHH-HCCCCEEEEEECCHH------HHHH--CCEEEE-----EECCEEEEECCHH T ss_conf 899979766899---9999999999999-729998999943888------8986--999999-----9899999978989 Q ss_pred HHCCCCCEEE Q ss_conf 8658875477 Q gi|254780799|r 673 QLLGQGDMLY 682 (806) Q Consensus 673 ~Llg~gdml~ 682 (806) .||-++-..+ T Consensus 222 eLl~~~~~y~ 231 (275) T cd03289 222 KLLNEKSHFK 231 (275) T ss_pred HHHHCCCCHH T ss_conf 9986789159 No 202 >TIGR02198 rfaE_dom_I rfaE bifunctional protein, domain I; InterPro: IPR011913 RfaE is a protein involved in the biosynthesis of ADP-L-glycero-D-manno-heptose, a precursor for LPS inner core biosynthesis. RfaE is a bifunctional protein in Escherichia coli, and separate proteins in some other genome. The longer, N-terminal domain I (this family) is suggested to act in D-glycero-D-manno-heptose 1-phosphate biosynthesis, while domain II (IPR011914 from INTERPRO) adds ADP to yield ADP-D-glycero-D-manno-heptose .; GO: 0016773 phosphotransferase activity alcohol group as acceptor, 0016779 nucleotidyltransferase activity, 0005975 carbohydrate metabolic process. Probab=76.12 E-value=1.7 Score=23.44 Aligned_cols=51 Identities=25% Similarity=0.450 Sum_probs=27.8 Q ss_pred HHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCCCCHHHHCCCCCCHHHHHHHHH Q ss_conf 99999999999829957847888523100111027703431223343045668999 Q gi|254780799|r 469 VAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYDGIPNLLTPVVTNPQKAVTVLK 524 (806) Q Consensus 469 V~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~~iPHLl~pVvTd~~kA~~aL~ 524 (806) |--|.++-.+.=.|+-. =|.+|||||-.-+|-|.|= -|||| |.+++..++. T Consensus 158 vLt~~v~~~~I~~Ar~~-~~pVlVDPKg~df~~Y~GA-tl~TP---N~~E~~~avG 208 (321) T TIGR02198 158 VLTPSVVQEVIAAAREA-GKPVLVDPKGKDFSRYRGA-TLITP---NRKEAEAAVG 208 (321) T ss_pred CCCHHHHHHHHHHHHHC-CCCEEEECCCCCHHHHCCC-CCCCC---CHHHHHHHHC T ss_conf 35857899999999966-8919980787623451466-42366---8799999858 No 203 >PRK10261 glutathione transporter ATP-binding protein; Provisional Probab=75.92 E-value=3.3 Score=21.15 Aligned_cols=28 Identities=32% Similarity=0.382 Sum_probs=20.6 Q ss_pred EEECCCCCEEEEEECCCCHHHHHHHHHH Q ss_conf 8541002023553047740679999999 Q gi|254780799|r 448 IADLARMPHLLIAGTTGSGKSVAINTMI 475 (806) Q Consensus 448 v~DLakMPHLLIAGtTGSGKSV~iN~iI 475 (806) -.++.+==-|-|.|-.|||||.-..++. T Consensus 344 sf~i~~GE~l~lvG~sGsGKSTl~r~l~ 371 (623) T PRK10261 344 SFDLWPGETLSLVGESGSGKSTTGRALL 371 (623) T ss_pred CCCCCCCCEEEEECCCCCCHHHHHHHHH T ss_conf 0035899589997678766899999985 No 204 >pfam03408 Foamy_virus_ENV Foamy virus envelope protein. Expression of the envelope (Env) glycoprotein is essential for viral particle egress. This feature is unique to the Spumavirinae, a subclass of the Retroviridae. Probab=75.90 E-value=4.6 Score=20.12 Aligned_cols=32 Identities=22% Similarity=0.420 Sum_probs=20.3 Q ss_pred HHHHHHHHHCCCHHHHHHHHHHHHH------HHCCEEEE Q ss_conf 4468888732100588999999866------41423799 Q gi|254780799|r 597 DEMADLMMVARKDIESAVQRLAQMA------RASGIHVI 629 (806) Q Consensus 597 DElaDlmm~~~~~ve~~i~rlaq~a------ra~GiHli 629 (806) +-+.- |..+|--.-.+|+-|||.. =|+||||. T Consensus 572 nN~~k-lq~~G~~lt~Av~tlskISDlNDe~L~~Gi~LL 609 (985) T pfam03408 572 NNYRK-LRSMGYSLTGAVQTLSKISDINDENLQQGIHLL 609 (985) T ss_pred HHHHH-HHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHH T ss_conf 06999-999757778889999875035678887568899 No 205 >cd03247 ABCC_cytochrome_bd The CYD subfamily implicated in cytochrome bd biogenesis. The CydC and CydD proteins are important for the formation of cytochrome bd terminal oxidase of E. coli and it has been proposed that they were necessary for biosynthesis of the cytochrome bd quinol oxidase and for periplasmic c-type cytochromes. CydCD were proposed to determine a heterooligomeric complex important for heme export into the periplasm or to be involved in the maintenance of the proper redox state of the periplasmic space. In Bacillus subtilius, the absence of CydCD does not affect the presence of halo-cytochrome c in the membrane and this observation suggests that CydCD proteins are not involved in the export of heme in this organism. Probab=75.76 E-value=2.5 Score=22.12 Aligned_cols=28 Identities=25% Similarity=0.405 Sum_probs=21.5 Q ss_pred EECCCCCEEEEEECCCCHHHHHHHHHHH Q ss_conf 5410020235530477406799999999 Q gi|254780799|r 449 ADLARMPHLLIAGTTGSGKSVAINTMIL 476 (806) Q Consensus 449 ~DLakMPHLLIAGtTGSGKSV~iN~iI~ 476 (806) .++.+==++.|-|.+|||||-.++.|.= T Consensus 23 l~i~~Ge~~aivG~sGsGKSTLl~~l~G 50 (178) T cd03247 23 LELKQGEKIALLGRSGSGKSTLLQLLTG 50 (178) T ss_pred EEECCCCEEEEECCCCCHHHHHHHHHHH T ss_conf 9986999999999998759999999986 No 206 >pfam00224 PK Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains. Probab=75.73 E-value=1.6 Score=23.57 Aligned_cols=234 Identities=22% Similarity=0.219 Sum_probs=120.0 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCEEEEEEEECCEEEEEE-------EEECCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 25665457899999974248632899841044244444-------32147863999997889999886300121000388 Q gi|254780799|r 328 SPKVMQNNACTLKSVLSDFGIQGEIVNVRPGPVITLYE-------LEPAPGIKSSRIIGLSDDIARSMSAISARVAVIPR 400 (806) Q Consensus 328 s~eeL~~nA~lLE~tL~dFGVe~~Vv~v~pGPVVTrYE-------i~PApGVKvSKI~nLadDIA~aLsa~svRIapIPG 400 (806) +.++.++.-+.+.+.-+++|-..-|--=.+||-+-.=+ ++..+|-++. +..|- ....+| T Consensus 39 ~~e~~~~~i~~iR~~~~~~~~~v~Il~Dl~GpkIR~g~~~~~~~~i~l~~G~~v~----~~~~~----------~~~~~~ 104 (348) T pfam00224 39 SHEYHQSRIDNVREAEEKLGGLVAIALDTKGPEIRTGLFKDGKKDIELKAGDKFL----VSTDP----------AYKGAG 104 (348) T ss_pred CHHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCEEEEECCCCCEEEEECCCCEEE----EECCC----------CCCCCC T ss_conf 9999999999999999981998079983689953555226885038966998999----96473----------025789 Q ss_pred -CCEEEEEECCCCCCEEEEHHHCCCHHHHHCCCCCHHHCCC------CCCCCE-EEEECCCCCEEEEEECCCCHHHHHHH Q ss_conf -6125553024566238734230721343022100021045------666666-78541002023553047740679999 Q gi|254780799|r 401 -RNAIGIELPNDIRETVMLRDLIVSRVFEKNQCDLAINLGK------SIEGKP-IIADLARMPHLLIAGTTGSGKSVAIN 472 (806) Q Consensus 401 -K~~VGIEiPN~~r~~V~lreil~s~~f~~s~~~L~iaLGK------dI~G~p-vv~DLakMPHLLIAGtTGSGKSV~iN 472 (806) +..|.+--| ++++. -+....+-+.=|+ ++.++- +. -+.+-+|.=+|.|+|-+ T Consensus 105 ~~~~i~v~~~----------~l~~~---vk~Gd~I~idDG~i~l~V~~~~~~~~i~------~~v~~gG~L~s~Kgvn~- 164 (348) T pfam00224 105 DKEKIYVDYK----------NLTKD---VSPGGIILVDDGVLSLKVLEKDDDKTLV------TEVLNGGVLGSRKGVNL- 164 (348) T ss_pred CCCEEECCCH----------HHHHH---CCCCCEEEECCCEEEEEEEEEECCCEEE------EEEEECEEEECCCCCCC- T ss_conf 8688973717----------76987---5899889843881899999980897399------99977969978766578- Q ss_pred HHHHHHHHHCCHHHEEEEEECCCHHHHHHCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHH Q ss_conf 99999998299578478885231001110277034312233430456689999999999999999870899689999999 Q gi|254780799|r 473 TMILSLLYRMTPAQCRLIMIDPKMLELSVYDGIPNLLTPVVTNPQKAVTVLKWLVCEMEERYQKMSKIGVRNIDGFNLKV 552 (806) Q Consensus 473 ~iI~SlLyk~~P~evkliliDPK~vEls~Y~~iPHLl~pVvTd~~kA~~aL~w~V~EMe~RY~l~a~~~vRni~~yN~k~ 552 (806) |+- .+-.|..|+ |-...|+|+++ .--+..|-.=||+-+.-.+-- T Consensus 165 -----------p~~--------------------~~~lp~lTe--kD~~di~~a~~---~~vD~ialSFVrsa~Di~~lr 208 (348) T pfam00224 165 -----------PGT--------------------DVDLPALSE--KDKADLRFGVK---QGVDMIFASFVRTASDVLEVR 208 (348) T ss_pred -----------CCC--------------------CCCCCCCCH--HHHHHHHHHHH---CCCCEEEECCCCCHHHHHHHH T ss_conf -----------998--------------------567566887--67999999997---799999987889999999999 Q ss_pred HHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHHHHHHCCCH--HHHHH-------HHHHHHHHH Q ss_conf 988744786677544677654543222332232346986877634468888732100--58899-------999986641 Q gi|254780799|r 553 AQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQHMPYIVVVIDEMADLMMVARKD--IESAV-------QRLAQMARA 623 (806) Q Consensus 553 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivviiDElaDlmm~~~~~--ve~~i-------~rlaq~ara 623 (806) .-....|... ++.. ..| ....++.|+-|+ +.+|=+|+|.-| +|-++ -++-+++|+ T Consensus 209 ~~l~~~~~~~--~IIa---------KIE-~~~al~Nl~eIi----~~sDgImIARGDLg~ei~~e~vp~~Qk~ii~~~~~ 272 (348) T pfam00224 209 EVLGEAGKDI--KIIA---------KIE-NQEGVNNFDEIL----EASDGIMVARGDLGIEIPAEEVFLAQKMLIAKCNL 272 (348) T ss_pred HHHHHCCCCC--EEEE---------EEC-CHHHHHHHHHHH----HHCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHHH T ss_conf 9999738876--0599---------844-789886788899----85888998447644558988989999999999998 Q ss_pred CCEEEEEEECCCCCCCCCHHHHHCCCCEE Q ss_conf 42379999657775355435541102515 Q gi|254780799|r 624 SGIHVIMATQRPSVDVITGTIKANFPTRI 652 (806) Q Consensus 624 ~GiHli~aTqrPsvdvitg~ikan~p~ri 652 (806) +|.-.|+||| +.--+|+...|||- T Consensus 273 ~~kpvivATq-----mLeSM~~~~~PTRA 296 (348) T pfam00224 273 AGKPVITATQ-----MLESMIYNPRPTRA 296 (348) T ss_pred CCCCEEEECC-----CHHHHCCCCCCCHH T ss_conf 5992999827-----57765149998569 No 207 >PRK06739 pyruvate kinase; Validated Probab=75.48 E-value=1.6 Score=23.48 Aligned_cols=167 Identities=20% Similarity=0.255 Sum_probs=88.2 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHH-HHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 223343045668999999999999999987089968999999998-8744786677544677654543222332232346 Q gi|254780799|r 510 TPVVTNPQKAVTVLKWLVCEMEERYQKMSKIGVRNIDGFNLKVAQ-YHNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQH 588 (806) Q Consensus 510 ~pVvTd~~kA~~aL~w~V~EMe~RY~l~a~~~vRni~~yN~k~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 588 (806) .|..|+-. ...|+|+++. =-+.+|-.=||+-+.-.+ +++ ..+.+.+-.. +. +..| ....++. T Consensus 160 lp~LTekD--~~dI~~a~~~---~vD~valSFVrsa~DI~~-lr~~l~~~~~~~~~-II---------aKIE-~~~al~N 222 (352) T PRK06739 160 LPAITEKD--KKDIQFLLEE---DVDFIACSFVRKPSHIKE-IRDFIQQYKETSPN-LI---------AKIE-TMEAIEN 222 (352) T ss_pred CCCCCCCC--HHHHHHHHHC---CCCEEEECCCCCHHHHHH-HHHHHHHHCCCCCC-EE---------EEEC-CHHHHHH T ss_conf 76688026--9999999875---999999857799999999-99999972688862-79---------9864-9989976 Q ss_pred CCEEEEEHHHHHHHHHHCCCH--HHHHH-------HHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEEEECCC Q ss_conf 986877634468888732100--58899-------999986641423799996577753554355411025158764586 Q gi|254780799|r 589 MPYIVVVIDEMADLMMVARKD--IESAV-------QRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISFQVSSK 659 (806) Q Consensus 589 lp~ivviiDElaDlmm~~~~~--ve~~i-------~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf~v~s~ 659 (806) +.-|+ +.+|-.|+|.-| +|.++ -++-+++|++|.-.|+||| +.--+|+...|||-=- T Consensus 223 ldeIi----~~sDgIMIARGDLgvei~~e~vp~~Qk~Ii~~c~~~gkpvivATq-----mLeSM~~~p~PTRAEv----- 288 (352) T PRK06739 223 FQDIC----KEADGIMIARGDLGVELPYQFIPLLQKMMIQECNRTNTYVITATQ-----MLQSMVDHSIPTRAEV----- 288 (352) T ss_pred HHHHH----HHCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECC-----CHHHHCCCCCCCHHHH----- T ss_conf 99999----855946996586446369889899999999999982997899636-----2887634998876999----- Q ss_pred CC-CHHHCCCCCHHHHCCCCCEEEECC----CCCEEEEEECCCCHHHHHHHHHHHHHCCCCCCCC Q ss_conf 64-212338864578658875477368----9832588833489889999999997128974211 Q gi|254780799|r 660 ID-SRTILGEQGAEQLLGQGDMLYMTG----GGRVQRIHGPFVSDIEVEKVVSHLKTQGEAKYID 719 (806) Q Consensus 660 ~d-Srtild~~gae~Llg~gdml~~~~----~~~~~r~~g~~v~~~ev~~v~~~~~~q~~~~y~~ 719 (806) .| +-.|+| || |-+.+.+ |.-|... --.=..|.+-+++++.++.-.|.+ T Consensus 289 ~DVanav~d--G~-------D~vmLs~ETA~G~yPv~~---V~~m~~i~~~aE~~~~~~~~~~~~ 341 (352) T PRK06739 289 TDVFQAVLD--GT-------NAVMLSAESASGEHPVES---VSTLRLVSEFAEHVKKDGPFVMKD 341 (352) T ss_pred HHHHHHHHH--CC-------CEEEECCHHCCCCCHHHH---HHHHHHHHHHHHHHHHCCCHHHHH T ss_conf 999999984--79-------889972201179799999---999999999998766128417666 No 208 >COG3378 Phage associated DNA primase [General function prediction only] Probab=75.47 E-value=2.5 Score=22.12 Aligned_cols=43 Identities=21% Similarity=0.271 Sum_probs=32.9 Q ss_pred CHHHCCCCCCCCEEEEECCCCCEEEEEECCCCHHHHHHHHHHH Q ss_conf 0021045666666785410020235530477406799999999 Q gi|254780799|r 434 LAINLGKSIEGKPIIADLARMPHLLIAGTTGSGKSVAINTMIL 476 (806) Q Consensus 434 L~iaLGKdI~G~pvv~DLakMPHLLIAGtTGSGKSV~iN~iI~ 476 (806) .-..|..-|.|-.+...-..=+-+.+.|.+|+||||++|.|.. T Consensus 210 el~~ll~~i~g~~l~g~~~~~k~~~l~G~G~nGKstf~~li~~ 252 (517) T COG3378 210 ELRNLLQRIIGASLTGRVSEQKLFWLYGPGGNGKSTFVDLISN 252 (517) T ss_pred HHHHHHHHHHHHEECCCCCCEEEEEEECCCCCCHHHHHHHHHH T ss_conf 8999999997660067534004799976899873899999999 No 209 >PRK07399 DNA polymerase III subunit delta'; Validated Probab=75.46 E-value=2.6 Score=22.03 Aligned_cols=36 Identities=31% Similarity=0.487 Sum_probs=30.6 Q ss_pred CCCCE-EEEEECCCCHHHHHHHHHHHHHHHHCCHHHE Q ss_conf 00202-3553047740679999999999982995784 Q gi|254780799|r 452 ARMPH-LLIAGTTGSGKSVAINTMILSLLYRMTPAQC 487 (806) Q Consensus 452 akMPH-LLIAGtTGSGKSV~iN~iI~SlLyk~~P~ev 487 (806) .|+|| .|..|..|+||.-+-..+..+|+....+.+| T Consensus 23 ~rl~hAyLF~Gp~G~GK~~~A~~fa~~Ll~~~~~~~~ 59 (314) T PRK07399 23 NRIAPAYLFAGPEGVGRKLAALRFIEGLLSQGSPSKN 59 (314) T ss_pred CCCCCEEEEECCCCCCHHHHHHHHHHHHHCCCCCCCC T ss_conf 9967448778999832999999999998578999976 No 210 >TIGR02324 CP_lyasePhnL phosphonate C-P lyase system protein PhnL; InterPro: IPR012701 Members of this family are the PhnL protein of C-P lyase systems for utilization of phosphonates. These systems resemble phosphonatase-based systems in having a three-component ABC transporter, where IPR005769 from INTERPRO is the permease, IPR005770 from INTERPRO is the phosphonates binding protein, and IPR012693 from INTERPRO is the ATP-binding cassette (ABC) protein. They differ, however, in having, typically, ten or more additional genes, many of which are believed to form a membrane-associated C-P lyase complex. This protein (PhnL) and the adjacent-encoded PhnK (IPR012700 from INTERPRO) resemble transporter ATP-binding proteins but are suggested, based on mutagenesis studies, to be part of this C-P lyase complex rather than part of a transporter per se .. Probab=75.44 E-value=2 Score=22.80 Aligned_cols=11 Identities=9% Similarity=0.142 Sum_probs=5.8 Q ss_pred CHHHHHHHCCC Q ss_conf 61353320012 Q gi|254780799|r 311 SKEILSTSQSP 321 (806) Q Consensus 311 sLdLL~~~~~~ 321 (806) |+=|||+|+.+ T Consensus 169 PiLLLdEPTAS 179 (224) T TIGR02324 169 PILLLDEPTAS 179 (224) T ss_pred CCCCCCCCCCC T ss_conf 40011355402 No 211 >TIGR00678 holB DNA polymerase III, delta' subunit; InterPro: IPR004622 DNA-directed DNA polymerase (2.7.7.7 from EC) catalyzes DNA-template-directed extension of the 3'-end of an RNA strand by one nucleotide at a time. DNA polymerase III is a complex, multi-chain enzyme responsible for most of the replicative synthesis in bacteria. The enzyme also has 3' to 5' exonuclease activity. It has a core composed of alpha, epsilon and theta chains, that associate with a tau subunit which allows the core dimerisation to form the PolIII' complex. PolIII' associates with the gamma complex (gamma, delta, delta', psi and chi chains) and with the beta chain. This domain is the N-terminal half of the delta' subunit of DNA polymerase III. Delta' is homologous to the gamma and tau subunits, which form an outgroup for phylogenetic comparison. The gamma/tau branch of the tree is much more tightly conserved than the delta' branch, and some members of that branch score more highly against this model than some proteins classified as delta'. The noise cut-off is set to detect weakly scoring delta' subunits rather than to exclude gamma/tau subunits.; GO: 0003887 DNA-directed DNA polymerase activity, 0008408 3'-5' exonuclease activity, 0006260 DNA replication. Probab=75.38 E-value=2.8 Score=21.75 Aligned_cols=30 Identities=30% Similarity=0.538 Sum_probs=26.4 Q ss_pred CCCCE-EEEEECCCCHHHHHHHHHHHHHHHH Q ss_conf 00202-3553047740679999999999982 Q gi|254780799|r 452 ARMPH-LLIAGTTGSGKSVAINTMILSLLYR 481 (806) Q Consensus 452 akMPH-LLIAGtTGSGKSV~iN~iI~SlLyk 481 (806) .+||| +|.+|.-|.||......+.-+||-. T Consensus 11 ~r~~HA~LF~G~~G~Gk~~~A~~~A~~l~C~ 41 (216) T TIGR00678 11 GRLAHAYLFTGPEGVGKELLALALAKALLCE 41 (216) T ss_pred CCCCCCEEEECCCCCCHHHHHHHHHHHHCCC T ss_conf 6788612544488874899999999998077 No 212 >pfam01047 MarR MarR family. The Mar proteins are involved in the multiple antibiotic resistance, a non-specific resistance system. The expression of the mar operon is controlled by a repressor, MarR. A large number of compounds induce transcription of the mar operon. This is thought to be due to the compound binding to MarR, and the resulting complex stops MarR binding to the DNA. With the MarR repression lost, transcription of the operon proceeds. The structure of MarR is known and shows MarR as a dimer with each subunit containing a winged-helix DNA binding motif. Probab=75.29 E-value=4.2 Score=20.40 Aligned_cols=52 Identities=21% Similarity=0.499 Sum_probs=42.6 Q ss_pred HHHHHHHHCCCEEEEHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHC-CCCCEE Q ss_conf 9999999659850001422200117789999999997798680227-887267 Q gi|254780799|r 745 QAVDIVLRDNKASISYIQRRLGIGYNRAASIIENMEEKGVIGPASS-TGKREI 796 (806) Q Consensus 745 ~a~~~v~~~~~~s~s~lqr~~~igy~raar~~~~~e~~giv~~~~g-~~~r~v 796 (806) .++..+-+.+..|.+-|-+++.+-=+-..|+++.||++|+|--... .-.|+| T Consensus 7 ~vL~~l~~~~~~s~~~la~~~~~~~~~vs~~i~~Le~~glv~R~~~~~DrR~V 59 (59) T pfam01047 7 HILRILYEHGPLTVSELAEKLGVDRSTVTRVLDRLEKKGLIERSRSPEDRREV 59 (59) T ss_pred HHHHHHHHCCCCCHHHHHHHHCCCHHHHHHHHHHHHHCCCEEECCCCCCCCCC T ss_conf 99999994699299999999885865499999999889798862799998819 No 213 >pfam07999 RHSP Retrotransposon hot spot protein. Members of this family are retrotransposon hot spot proteins. They are associated with polymorphic subtelomeric regions in Trypanosoma. Probab=75.28 E-value=0.41 Score=27.94 Aligned_cols=69 Identities=23% Similarity=0.334 Sum_probs=44.1 Q ss_pred CCEEEEHHHCCCHHHHHCCCCCHHHCCCCCCCCEEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEE Q ss_conf 6238734230721343022100021045666666785410020235530477406799999999999829957847888 Q gi|254780799|r 413 RETVMLRDLIVSRVFEKNQCDLAINLGKSIEGKPIIADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIM 491 (806) Q Consensus 413 r~~V~lreil~s~~f~~s~~~L~iaLGKdI~G~pvv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evklil 491 (806) +...-=+|+.. .|+--+..|+--+...- .-..+||+|| ||+|=|||.+.-+-++-=|..+.-+++++|- T Consensus 174 ~D~yVncEV~R--VWqIVk~dLt~Wfs~~~-------~~~~~p~VLI-GTPGIGKSm~aGSyLLYqLLHyDaekLqvVa 242 (597) T pfam07999 174 NDVYVRREVAR--VWYMVEDRLGNWLVSGL-------KKYKIPGVLD-GTPGIGKSLTVGSYLLYYLLHYDAELLRIIA 242 (597) T ss_pred CEEEEEHHHHH--HHHHHHHHHHHHHCCCC-------CCCCCCCEEE-CCCCCCCCCCCCCEEEEEECCCCHHHCCEEE T ss_conf 60798658988--69999867777624688-------8789973798-8897563446762257640016753562899 No 214 >PRK04214 rbn ribonuclease BN/unknown domain fusion protein; Reviewed Probab=75.15 E-value=6.2 Score=19.15 Aligned_cols=40 Identities=10% Similarity=0.093 Sum_probs=19.0 Q ss_pred CCCHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHCCCCCCCC Q ss_conf 3112789999999999999------999999999984427068963 Q gi|254780799|r 16 FLLSDWSKKKMKIVAGLIL------LCTVFAITLALGTWDVYDPSF 55 (806) Q Consensus 16 ~~l~~f~~rrl~Ei~Gl~L------i~~al~l~iSL~SYsp~DPs~ 55 (806) |+.+.|...|+....+-+- ++=.+.+++|++|+.|....+ T Consensus 21 ~~~rrF~~dr~~~~AAsLtf~tlLALvPlL~v~~sl~s~fp~f~~~ 66 (411) T PRK04214 21 FLWRRFLDDRLFQAAASLTFTTLLALVPLATVVFGVLSAFPVFDRW 66 (411) T ss_pred HHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHH T ss_conf 9999975467988999999999999999999999999875461568 No 215 >PRK11773 uvrD DNA-dependent helicase II; Provisional Probab=75.15 E-value=6.2 Score=19.15 Aligned_cols=19 Identities=26% Similarity=0.265 Sum_probs=12.0 Q ss_pred HHHHHHHHHHCCCCEEEEE Q ss_conf 8999999742486328998 Q gi|254780799|r 336 ACTLKSVLSDFGIQGEIVN 354 (806) Q Consensus 336 A~lLE~tL~dFGVe~~Vv~ 354 (806) ++.||+.|...||...+.+ T Consensus 360 ~~~le~aL~~~gIPy~i~g 378 (722) T PRK11773 360 SRVLEEALLQAGIPYRIYG 378 (722) T ss_pred HHHHHHHHHHCCCCEEEEC T ss_conf 6789999986599848746 No 216 >cd03284 ABC_MutS1 MutS1 homolog in eukaryotes. The MutS protein initiates DNA mismatch repair by recognizing mispaired and unpaired bases embedded in duplex DNA and activating endo- and exonucleases to remove the mismatch. Members of the MutS family possess C-terminal domain with a conserved ATPase activity that belongs to the ATP binding cassette (ABC) superfamily. MutS homologs (MSH) have been identified in most prokaryotic and all eukaryotic organisms examined. Prokaryotes have two homologs (MutS1 and MutS2), whereas seven MSH proteins (MSH1 to MSH7) have been identified in eukaryotes. The homodimer MutS1 and heterodimers MSH2-MSH3 and MSH2-MSH6 are primarily involved in mitotic mismatch repair, whereas MSH4-MSH5 is involved in resolution of Holliday junctions during meiosis. All members of the MutS family contain the highly conserved Walker A/B ATPase domain, and many share a common mechanism of action. MutS1, MSH2-MSH3, MSH2-MSH6, and MSH4-MSH5 dimerize to form sliding clam Probab=75.04 E-value=4.8 Score=19.98 Aligned_cols=119 Identities=19% Similarity=0.252 Sum_probs=72.1 Q ss_pred EEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 35530477406799999999999829957847888523100111027703431223343045668999999999999999 Q gi|254780799|r 457 LLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYDGIPNLLTPVVTNPQKAVTVLKWLVCEMEERYQK 536 (806) Q Consensus 457 LLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~~iPHLl~pVvTd~~kA~~aL~w~V~EMe~RY~l 536 (806) ++|-|.-.||||+.+-++-+..++-..-- .|-=+.-++++|+.| ++- +.+...-...+.--..||.+=-+. T Consensus 33 ~iiTGpN~sGKSt~Lk~igl~~ilaq~G~-----~vpA~~a~i~~~d~I---~t~-i~~~d~i~~~~StF~~e~~~~~~i 103 (216) T cd03284 33 LLITGPNMAGKSTYLRQVALIALLAQIGS-----FVPASKAEIGVVDRI---FTR-IGASDDLAGGRSTFMVEMVETANI 103 (216) T ss_pred EEEECCCCCCHHHHHHHHHHHHHHHHHCC-----EEEECCEEEEECCEE---EEE-ECCCCHHHCCCCHHHHHHHHHHHH T ss_conf 99989987745999999999999998687-----587661599704469---995-068621333726589999999999 Q ss_pred HHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHHHHHHCCCHHHHHH-- Q ss_conf 987089968999999998874478667754467765454322233223234698687763446888873210058899-- Q gi|254780799|r 537 MSKIGVRNIDGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQHMPYIVVVIDEMADLMMVARKDIESAV-- 614 (806) Q Consensus 537 ~a~~~vRni~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivviiDElaDlmm~~~~~ve~~i-- 614 (806) +..+. +.=.|++||+.- |-+.++.+ T Consensus 104 l~~a~------------------------------------------------~~sLvliDEl~~-----gT~~~eg~al 130 (216) T cd03284 104 LNNAT------------------------------------------------ERSLVLLDEIGR-----GTSTYDGLSI 130 (216) T ss_pred HHHCC------------------------------------------------CCCEEEECCCCC-----CCCHHHHHHH T ss_conf 98487------------------------------------------------761563344568-----9985788999 Q ss_pred --HHHHHHHHHCCEEEEEEECCCCC Q ss_conf --99998664142379999657775 Q gi|254780799|r 615 --QRLAQMARASGIHVIMATQRPSV 637 (806) Q Consensus 615 --~rlaq~ara~GiHli~aTqrPsv 637 (806) +=|...+.-.+...++||--... T Consensus 131 a~aile~L~~~~~~~~i~tTH~~~L 155 (216) T cd03284 131 AWAIVEYLHEKIGAKTLFATHYHEL 155 (216) T ss_pred HHHHHHHHHHCCCCEEEEEECHHHH T ss_conf 9999999997079728975050889 No 217 >cd03288 ABCC_SUR2 The SUR domain 2. The sulfonylurea receptor SUR is an ATP binding cassette (ABC) protein of the ABCC/MRP family. Unlike other ABC proteins, it has no intrinsic transport function, neither active nor passive, but associates with the potassium channel proteins Kir6.1 or Kir6.2 to form the ATP-sensitive potassium (K(ATP)) channel. Within the channel complex, SUR serves as a regulatory subunit that fine-tunes the gating of Kir6.x in response to alterations in cellular metabolism. It constitutes a major pharmaceutical target as it binds numerous drugs, K(ATP) channel openers and blockers, capable of up- or down-regulating channel activity. Probab=74.96 E-value=2.6 Score=21.94 Aligned_cols=180 Identities=16% Similarity=0.165 Sum_probs=84.4 Q ss_pred CCCCCEEEEEECCCCHHHHHHHHHHHHHHHHC---------------CH----HHEEEEEECCCHHHHHHCCCCHHHHCC Q ss_conf 10020235530477406799999999999829---------------95----784788852310011102770343122 Q gi|254780799|r 451 LARMPHLLIAGTTGSGKSVAINTMILSLLYRM---------------TP----AQCRLIMIDPKMLELSVYDGIPNLLTP 511 (806) Q Consensus 451 LakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~---------------~P----~evkliliDPK~vEls~Y~~iPHLl~p 511 (806) +.+==..-|-|.+|||||..++.|.- ||+. ++ ..+-++.=||...+-|+.+.|- ..+ T Consensus 44 I~~Ge~vaIvG~sGsGKSTL~~ll~g--l~~p~~G~I~idg~di~~~~~~~lr~~i~~v~Q~~~lf~~Ti~~Ni~--~~~ 119 (257) T cd03288 44 IKPGQKVGICGRTGSGKSSLSLAFFR--MVDIFDGKIVIDGIDISKLPLHTLRSRLSIILQDPILFSGSIRFNLD--PEC 119 (257) T ss_pred ECCCCEEEEECCCCCCHHHHHHHHHH--CCCCCCCEEEECCEEHHHCCHHHHHHHEEEEECCCCCCCCHHHHCCC--CCC T ss_conf 87999999999999819999999960--56678888999989968799999975057994567343613554137--566 Q ss_pred CCCCHHHHHHHHHHHH-----HHHHHHHHHHHHCCCCCHHH-HHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 3343045668999999-----99999999998708996899-99999988744786677544677654543222332232 Q gi|254780799|r 512 VVTNPQKAVTVLKWLV-----CEMEERYQKMSKIGVRNIDG-FNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFD 585 (806) Q Consensus 512 VvTd~~kA~~aL~w~V-----~EMe~RY~l~a~~~vRni~~-yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 585 (806) -++|- +...+|+-+- +.+...|+-.-..+-.|+.+ =-+|+.-|++- T Consensus 120 ~~~~~-~i~~al~~~~l~~~i~~lp~gl~t~l~~~g~~LSgGQrQri~lARAl--------------------------- 171 (257) T cd03288 120 KCTDD-RLWEALEIAQLKNMVKSLPGGLDAVVTEGGENFSVGQRQLFCLARAF--------------------------- 171 (257) T ss_pred CCCHH-HHHHHHHHHHHHHHHHHCCCCCCEEECCCCCCCCHHHHHHHHHHHHH--------------------------- T ss_conf 76899-99999999623799973758985174687683699999999999999--------------------------- Q ss_pred CCCCCEEEEEHHHHH-HHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEEEECCCCCCHH Q ss_conf 346986877634468-8887321005889999998664142379999657775355435541102515876458664212 Q gi|254780799|r 586 FQHMPYIVVVIDEMA-DLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISFQVSSKIDSRT 664 (806) Q Consensus 586 ~~~lp~ivviiDEla-Dlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf~v~s~~dSrt 664 (806) -.=|- |+|.||-- -|=..+.+.+...+.+ ...+.-+|+.|.|++. ++ .--||.+ +|.=. T Consensus 172 -l~~~~-iliLDEpts~LD~~te~~i~~~l~~-----~~~~~TvI~ItHrl~~------~~--~~D~Iiv-----l~~G~ 231 (257) T cd03288 172 -VRKSS-ILIMDEATASIDMATENILQKVVMT-----AFADRTVVTIAHRVST------IL--DADLVLV-----LSRGI 231 (257) T ss_pred -HCCCC-EEEEECCCCCCCHHHHHHHHHHHHH-----HCCCCEEEEEECCHHH------HH--HCCEEEE-----EECCE T ss_conf -55999-9999587667899999999999999-----7499999999238989------98--5999999-----98999 Q ss_pred HCCCCCHHHHCCCCCEEE Q ss_conf 338864578658875477 Q gi|254780799|r 665 ILGEQGAEQLLGQGDMLY 682 (806) Q Consensus 665 ild~~gae~Llg~gdml~ 682 (806) |..+..-|.||.+-|-.| T Consensus 232 Ive~Gt~~eLl~~~~g~y 249 (257) T cd03288 232 LVECDTPENLLAQEDGVF 249 (257) T ss_pred EEEECCHHHHHHCCCCHH T ss_conf 999889999971789778 No 218 >PRK13646 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=74.76 E-value=2.4 Score=22.25 Aligned_cols=104 Identities=11% Similarity=0.177 Sum_probs=54.5 Q ss_pred EEEHHHH-HHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEEEECCCCCCHHHCCCCCH Q ss_conf 7763446-888873210058899999986641423799996577753554355411025158764586642123388645 Q gi|254780799|r 593 VVVIDEM-ADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISFQVSSKIDSRTILGEQGA 671 (806) Q Consensus 593 vviiDEl-aDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf~v~s~~dSrtild~~ga 671 (806) |+|-||= |-|=....++|-+.|.+|- +--|+-.|++|..-.. =+.+-.||.. --|-| |+.++-. T Consensus 166 iLilDEPTagLDp~~~~~i~~ll~~l~---~~~g~TiI~iTHdm~~-------v~~~adrv~v----m~~G~-Iv~~G~p 230 (286) T PRK13646 166 IIVVDEPTAGLDPQSKRQVMRLLKSLQ---TDENKAIILISHDMNE-------VARYADEVIV----MKEGS-IVSQTSP 230 (286) T ss_pred EEEEECCCCCCCHHHHHHHHHHHHHHH---HHCCCEEEEECCCHHH-------HHHHCCEEEE----EECCE-EEEECCH T ss_conf 999838744389899999999999999---9539899999138999-------9996999999----98989-9997789 Q ss_pred HHHCCCCCEEEEC----CC-CC---------EEEEEECCCCHHHHHHHHHHHHH Q ss_conf 7865887547736----89-83---------25888334898899999999971 Q gi|254780799|r 672 EQLLGQGDMLYMT----GG-GR---------VQRIHGPFVSDIEVEKVVSHLKT 711 (806) Q Consensus 672 e~Llg~gdml~~~----~~-~~---------~~r~~g~~v~~~ev~~v~~~~~~ 711 (806) +.++-+-|.|--. |- .+ ..-....+.+++|...+++.|+. T Consensus 231 ~evf~~~~~l~~~~l~~P~~~~l~~~l~~~~~~~~~~~~~t~e~~~~~~k~~~~ 284 (286) T PRK13646 231 KELFKDKKKLADWHIGLPEIVQLQYDFEQKYQTKLKDIALTEEAFVSLYKEWQH 284 (286) T ss_pred HHHHCCHHHHHHCCCCCCHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHH T ss_conf 999779657987899998499999999984799867666429999999999754 No 219 >TIGR01967 DEAH_box_HrpA ATP-dependent helicase HrpA; InterPro: IPR010222 This entry represents HrpA, one of two related but uncharacterised DEAH-box ATP-dependent helicases in many Proteobacteria and a few high-GC Gram-positive bacteria. HrpA is about 1300 amino acids long, while its paralog HrpB, also uncharacterised, is about 800 amino acids long. Related characterised eukaryotic proteins are RNA helicases associated with pre-mRNA processing.; GO: 0005524 ATP binding, 0008026 ATP-dependent helicase activity. Probab=74.70 E-value=1.3 Score=24.29 Aligned_cols=16 Identities=25% Similarity=0.272 Sum_probs=7.6 Q ss_pred HHHHHHHHHHHHHHHC Q ss_conf 6545789999997424 Q gi|254780799|r 331 VMQNNACTLKSVLSDF 346 (806) Q Consensus 331 eL~~nA~lLE~tL~dF 346 (806) +.++.+..++++|..| T Consensus 1171 ~v~~~a~~~~~~L~~~ 1186 (1320) T TIGR01967 1171 EVERIAKQLEEILVLF 1186 (1320) T ss_pred HHHHHHHHHHHHHHHH T ss_conf 9999999999999998 No 220 >COG0178 UvrA Excinuclease ATPase subunit [DNA replication, recombination, and repair] Probab=74.65 E-value=2.3 Score=22.41 Aligned_cols=122 Identities=23% Similarity=0.330 Sum_probs=58.1 Q ss_pred HHHHHCCCCEEEEEEEECC---EEEEEEEEECCCCCHHHHHH--HHHHHHHHHHHC-CCCCCCCCCCCEEEEEECCCCCC Q ss_conf 9974248632899841044---24444432147863999997--889999886300-12100038861255530245662 Q gi|254780799|r 341 SVLSDFGIQGEIVNVRPGP---VITLYELEPAPGIKSSRIIG--LSDDIARSMSAI-SARVAVIPRRNAIGIELPNDIRE 414 (806) Q Consensus 341 ~tL~dFGVe~~Vv~v~pGP---VVTrYEi~PApGVKvSKI~n--LadDIA~aLsa~-svRIapIPGK~~VGIEiPN~~r~ 414 (806) +.|+|-|=-.-||+=-.-- .=-.-.|=|.+|+.--.|+. =-+|+ |+.. |+-=.=.-|+-. |+||-..|+ T Consensus 527 ~~LRDlGNTviVVEHDedti~~AD~iIDiGPgAG~~GGeIv~~Gtp~~i---~~~~~SlTG~YLsg~~~--i~vP~~rr~ 601 (935) T COG0178 527 KRLRDLGNTVIVVEHDEDTIRAADHIIDIGPGAGEHGGEIVAEGTPEEL---LANPESLTGQYLSGKKT--IEVPKKRRA 601 (935) T ss_pred HHHHHCCCEEEEEECCHHHHHHCCEEEEECCCCCCCCCEEEECCCHHHH---HHCCCCHHHHHHCCCCC--CCCCCCCCC T ss_conf 9998659859999569899964688996079877678789970589999---70986535686468865--689655558 Q ss_pred EEE-EHHHCCCHHHHHCCCCCHHHCCCCCCCCEEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHC Q ss_conf 387-34230721343022100021045666666785410020235530477406799999999999829 Q gi|254780799|r 415 TVM-LRDLIVSRVFEKNQCDLAINLGKSIEGKPIIADLARMPHLLIAGTTGSGKSVAINTMILSLLYRM 482 (806) Q Consensus 415 ~V~-lreil~s~~f~~s~~~L~iaLGKdI~G~pvv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~ 482 (806) .=. .=++-......=.+-.+.+.||+= --|.|--|||||--||-.+.-.|.++ T Consensus 602 ~~~~~L~l~gA~~nNLkni~v~iPLg~~---------------t~VTGVSGSGKSTLIn~tL~~a~~~~ 655 (935) T COG0178 602 PNGKWLELKGARRNNLKNIDVEIPLGVF---------------TCVTGVSGSGKSTLINDTLVPALARH 655 (935) T ss_pred CCCCEEEEECCCCCCCCCCEEECCCCCE---------------EEEEECCCCCHHHHHHHHHHHHHHHH T ss_conf 9998699804641474663055116637---------------99983687877786999999999998 No 221 >cd02023 UMPK Uridine monophosphate kinase (UMPK, EC 2.7.1.48), also known as uridine kinase or uridine-cytidine kinase (UCK), catalyzes the reversible phosphoryl transfer from ATP to uridine or cytidine to yield UMP or CMP. In the primidine nucleotide-salvage pathway, this enzyme combined with nucleoside diphosphate kinases further phosphorylates UMP and CMP to form UTP and CTP. This kinase also catalyzes the phosphorylation of several cytotoxic ribonucleoside analogs such as 5-flurrouridine and cyclopentenyl-cytidine. Probab=74.57 E-value=3 Score=21.51 Aligned_cols=33 Identities=33% Similarity=0.465 Sum_probs=21.1 Q ss_pred EEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEEC Q ss_conf 3553047740679999999999982995784788852 Q gi|254780799|r 457 LLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMID 493 (806) Q Consensus 457 LLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliD 493 (806) +.|||.+|||||-.-+.+.-.| ....|-.|=.| T Consensus 2 IgI~G~sgsGKTT~a~~L~~~l----~~~~v~~i~~D 34 (198) T cd02023 2 IGIAGGSGSGKTTVAEEIIEQL----GNPKVVIISQD 34 (198) T ss_pred EEEECCCCCCHHHHHHHHHHHH----CCCCEEEEECC T ss_conf 8988999885999999999980----99985899788 No 222 >cd03244 ABCC_MRP_domain2 Domain 2 of the ABC subfamily C. This family is also known as MRP (mulrtidrug resisitance-associated protein). Some of the MRP members have five additional transmembrane segments in their N-terminus, but the function of these additional membrane-spanning domains is not clear. The MRP was found in the multidrug-resistance lung cancer cell in which p-glycoprotein was not overexpressed. MRP exports glutathione by drug stimulation, as well as, certain substrates in conjugated forms with anions, such as glutathione, glucuronate, and sulfate. Probab=74.45 E-value=2.6 Score=21.95 Aligned_cols=149 Identities=15% Similarity=0.176 Sum_probs=70.5 Q ss_pred EECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCC---------------HH----HEEEEEECCCHHHHHHCCCCHHHH Q ss_conf 54100202355304774067999999999998299---------------57----847888523100111027703431 Q gi|254780799|r 449 ADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMT---------------PA----QCRLIMIDPKMLELSVYDGIPNLL 509 (806) Q Consensus 449 ~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~---------------P~----evkliliDPK~vEls~Y~~iPHLl 509 (806) .++.+==++.|-|.+|||||-.++.|.= ||+.+ ++ .+-++.=||-..+-|+++.|-- . T Consensus 25 l~i~~Ge~v~ivG~sGsGKSTLl~ll~g--l~~p~~G~I~i~g~~i~~~~~~~~r~~i~~v~Q~~~lf~~Tv~eNi~~-~ 101 (221) T cd03244 25 FSIKPGEKVGIVGRTGSGKSSLLLALFR--LVELSSGSILIDGVDISKIGLHDLRSRISIIPQDPVLFSGTIRSNLDP-F 101 (221) T ss_pred EEECCCCEEEEECCCCCCHHHHHHHHHC--CCCCCCCEEEECCEECCCCCHHHHHHHEEEEECCCCCCCCCHHHHHCC-C T ss_conf 9986998999999999989999999967--971898489999999661999999740799930352356008875071-0 Q ss_pred CCCCCCHHHHHHHHHH-----HHHHHHHHHHHHHHCCCCCHHHHH-HHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 2233430456689999-----999999999999870899689999-9999887447866775446776545432223322 Q gi|254780799|r 510 TPVVTNPQKAVTVLKW-----LVCEMEERYQKMSKIGVRNIDGFN-LKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEH 583 (806) Q Consensus 510 ~pVvTd~~kA~~aL~w-----~V~EMe~RY~l~a~~~vRni~~yN-~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 583 (806) .-.+|- +...+|+- .+.++...++..-..+..|+.+=. +|+.-|++- T Consensus 102 -~~~~~~-~i~~~l~~~~l~~~~~~~~~~l~~~~~~~~~~LSgGqkQrv~lARal------------------------- 154 (221) T cd03244 102 -GEYSDE-ELWQALERVGLKEFVESLPGGLDTVVEEGGENLSVGQRQLLCLARAL------------------------- 154 (221) T ss_pred -CCCCHH-HHHHHHHHHCHHHHHHHCCCCCCCEECCCCCCCCHHHHHHHHHHHHH------------------------- T ss_conf -237999-99998875160899974104445470366677999999999999999------------------------- Q ss_pred CCCCCCCEEEEEHHH-HHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCC Q ss_conf 323469868776344-68888732100588999999866414237999965777 Q gi|254780799|r 584 FDFQHMPYIVVVIDE-MADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPS 636 (806) Q Consensus 584 ~~~~~lp~ivviiDE-laDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPs 636 (806) -.=| =|++.|| .+-|=.. -|..|..+-.. ...|.-+|+.|+|++ T Consensus 155 ---~~~p-~ililDEpts~LD~~----~~~~i~~~l~~-~~~~~Tvi~itH~l~ 199 (221) T cd03244 155 ---LRKS-KILVLDEATASVDPE----TDALIQKTIRE-AFKDCTVLTIAHRLD 199 (221) T ss_pred ---HCCC-CEEEEECCCCCCCHH----HHHHHHHHHHH-HCCCCEEEEEECCHH T ss_conf ---6599-999997975679999----99999999998-779989999915889 No 223 >KOG0060 consensus Probab=74.27 E-value=4 Score=20.54 Aligned_cols=155 Identities=23% Similarity=0.350 Sum_probs=74.8 Q ss_pred CCEEEEECC-CCC---EEEEEECCCCHHHHHHHHHH---------HHHHHHCCHHHEEEEEECCCHHHHHHCCCCHHHHC Q ss_conf 666785410-020---23553047740679999999---------99998299578478885231001110277034312 Q gi|254780799|r 444 GKPIIADLA-RMP---HLLIAGTTGSGKSVAINTMI---------LSLLYRMTPAQCRLIMIDPKMLELSVYDGIPNLLT 510 (806) Q Consensus 444 G~pvv~DLa-kMP---HLLIAGtTGSGKSV~iN~iI---------~SlLyk~~P~evkliliDPK~vEls~Y~~iPHLl~ 510 (806) |..++-||. +-| ||||-|-+|+|||.-.-.|= +--..+..|.++=|+==-|=|.-=+.=+.+=.=++ T Consensus 447 g~~lie~Ls~~V~~g~~LLItG~sG~GKtSLlRvlggLWp~~~G~l~k~~~~~~~~lfflPQrPYmt~GTLRdQvIYP~~ 526 (659) T KOG0060 447 GDLLIENLSLEVPSGQNLLITGPSGCGKTSLLRVLGGLWPSTGGKLTKPTDGGPKDLFFLPQRPYMTLGTLRDQVIYPLK 526 (659) T ss_pred CCEEEEEEEEEECCCCEEEEECCCCCCHHHHHHHHHCCCCCCCCEEEECCCCCCCCEEEECCCCCCCCCCHHHHEECCCC T ss_conf 86563210057058975999789987636899998532516787276056788775588368877666544550332575 Q ss_pred CCCCCHHHH-----HHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCC------CC Q ss_conf 233430456-----6899999999999999998708996899999999887447866775446776545432------22 Q gi|254780799|r 511 PVVTNPQKA-----VTVLKWLVCEMEERYQKMSKIGVRNIDGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEA------IY 579 (806) Q Consensus 511 pVvTd~~kA-----~~aL~w~V~EMe~RY~l~a~~~vRni~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~------~~ 579 (806) ++=+|.|-| ..-|.|+. --||..=- -|.|+..... .= T Consensus 527 ~~~~~~~~~~d~~i~r~Le~v~--------------L~hl~~r~------------------ggld~~~~~dW~dvLS~G 574 (659) T KOG0060 527 AEDMDSKSASDEDILRILENVQ--------------LGHLLERE------------------GGLDQQVDWDWMDVLSPG 574 (659) T ss_pred CCCCCCCCCCHHHHHHHHHHHH--------------HHHHHHHH------------------CCCCCHHHCCHHHHCCHH T ss_conf 3221013776789999999854--------------55589985------------------788822220577636988 Q ss_pred CCCCCCCCCC----CEEEEEHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCC Q ss_conf 3322323469----868776344688887321005889999998664142379999657775 Q gi|254780799|r 580 ETEHFDFQHM----PYIVVVIDEMADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSV 637 (806) Q Consensus 580 ~~~~~~~~~l----p~ivviiDElaDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsv 637 (806) |.+...+..| |. +.|-||.--= ..-|||.. +.+|-|+.||-+|---.|+|- T Consensus 575 EqQRLa~ARLfy~kPk-~AiLDE~TSA---v~~dvE~~---~Yr~~r~~giT~iSVgHRkSL 629 (659) T KOG0060 575 EQQRLAFARLFYHKPK-FAILDECTSA---VTEDVEGA---LYRKCREMGITFISVGHRKSL 629 (659) T ss_pred HHHHHHHHHHHHCCCC-EEEEECCHHH---CCHHHHHH---HHHHHHHCCCEEEEECCHHHH T ss_conf 8889999999860883-6876030222---35767999---999999809769996357889 No 224 >pfam10236 DAP3 Mitochondrial ribosomal death-associated protein 3. This is a family of conserved proteins which were originally described as death-associated-protein-3 (DAP-3). The proteins carry a P-loop DNA-binding motif, and induce apoptosis. DAP3 has been shown to be a pro-apoptotic factor in the mitochondrial matrix and to be crucial for mitochondrial biogenesis and so has also been designated as MRP-S29 (mitochondrial ribosomal protein subunit 29). Probab=74.21 E-value=2.9 Score=21.66 Aligned_cols=142 Identities=16% Similarity=0.215 Sum_probs=69.8 Q ss_pred EEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCCCCHHH-HCCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 23553047740679999999999982995784788852310011102770343-12233430456689999999999999 Q gi|254780799|r 456 HLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYDGIPNL-LTPVVTNPQKAVTVLKWLVCEMEERY 534 (806) Q Consensus 456 HLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~~iPHL-l~pVvTd~~kA~~aL~w~V~EMe~RY 534 (806) -.++-|-.|||||++++-++.-=..+ +-+=+=.=|++..=-..++=.|.- -.-...-|-.|..-|+....-= - T Consensus 25 r~vL~G~~GsGKS~~L~q~v~~A~~~---~wiVl~vP~~~~~~~~~~~~~~s~~~~~~ydqP~~a~~~L~~~~~~N---~ 98 (274) T pfam10236 25 RFVLTGERGSGKSVLLAQAMAYALTQ---GWVVLHVPEAEDLVNGTTDYAPDPGNPGLYDQPMYAAALLQRILKAN---E 98 (274) T ss_pred EEEEECCCCCCHHHHHHHHHHHHHHC---CEEEEECCCHHHHHCCCCEEEECCCCCCEEECHHHHHHHHHHHHHHH---H T ss_conf 89988979977999999999999859---98999849889983086427688999971357899999999999986---9 Q ss_pred HHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCC-CCCC---CCCCCCCCCCEEEEEHHHHHHHHHH Q ss_conf 999870899689999999988744786677544677654543-2223---3223234698687763446888873 Q gi|254780799|r 535 QKMSKIGVRNIDGFNLKVAQYHNTGKKFNRTVQTGFDRKTGE-AIYE---TEHFDFQHMPYIVVVIDEMADLMMV 605 (806) Q Consensus 535 ~l~a~~~vRni~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~---~~~~~~~~lp~ivviiDElaDlmm~ 605 (806) ++|+++.+..=-.+.+ .+....|.++.+=+..|.+....- +.+. +|--...+-|-+.+.||.+.-+|-. T Consensus 99 ~~L~~l~~s~~~~~~k--~e~~~~g~tL~dL~~~gi~~~~~a~~~~~al~~EL~~~s~~~pvLvavDg~n~~~~~ 171 (274) T pfam10236 99 EVLKKLKLSKDYKWLK--RESTPAGSTLLDLASLGINDAKSAWDVFQALWKELTAQSKAPPVLVAVDGFNALFGP 171 (274) T ss_pred HHHHHCCCCCCCCCCC--CCCCCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEECCHHHHHCC T ss_conf 9996482556443665--664799997999998623470246899999999998347999889996585675576 No 225 >TIGR03167 tRNA_sel_U_synt tRNA 2-selenouridine synthase. The Escherichia coli YbbB protein was shown to encode a selenophosphate-dependent tRNA 2-selenouridine synthase, essential for modification of some tRNAs to replace a sulfur atom with selenium. This enzyme works with SelD, the selenium donor protein, which also acts in selenocysteine incorporation. Although the members of this protein family show a fairly deep split, sequences from both sides of the split are supported by co-occurence with, and often proximity to, the selD gene. Probab=74.07 E-value=2.8 Score=21.76 Aligned_cols=13 Identities=46% Similarity=0.785 Sum_probs=4.3 Q ss_pred EEEECCCCHHHHH Q ss_conf 5530477406799 Q gi|254780799|r 458 LIAGTTGSGKSVA 470 (806) Q Consensus 458 LIAGtTGSGKSV~ 470 (806) .++|-|||||+-- T Consensus 131 vl~G~TG~GKT~i 143 (311) T TIGR03167 131 VLGGMTGSGKTEL 143 (311) T ss_pred EEECCCCCCHHHH T ss_conf 9878888778999 No 226 >PRK08769 DNA polymerase III subunit delta'; Validated Probab=74.03 E-value=3.5 Score=20.97 Aligned_cols=155 Identities=22% Similarity=0.302 Sum_probs=77.6 Q ss_pred CCCCCE-EEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCCCCHHHHCCCCCCHHHHHHHHHHHHHH Q ss_conf 100202-3553047740679999999999982995784788852310011102770343122334304566899999999 Q gi|254780799|r 451 LARMPH-LLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYDGIPNLLTPVVTNPQKAVTVLKWLVCE 529 (806) Q Consensus 451 LakMPH-LLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~~iPHLl~pVvTd~~kA~~aL~w~V~E 529 (806) -.+.|| +|+-|.-|+||+..-..+.-.||.. .|+.+-- --..+.++-...-|+ |++.|+-. +...-++- T Consensus 22 ~~rl~HA~Lf~Gp~G~GK~~~A~~~A~~llc~-~~~~~~~-~~~~~~i~~g~HPD~-~~i~~~~~---~~~~k~k~---- 91 (319) T PRK08769 22 AGRLGHGLLICGPEGLGKRAVALALAEHVLAS-GPDPALA-QRTRQLIAAGTHPDL-QLVSFIPN---RTGDKLRT---- 91 (319) T ss_pred CCCCCCEEEEECCCCCCHHHHHHHHHHHHHCC-CCCCCCC-CHHHHHHHCCCCCCE-EEEECCCC---CCCCCCCC---- T ss_conf 69942068758999878999999999998379-9797654-338899966899896-87753444---45431123---- Q ss_pred HHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHHHHHHCC-- Q ss_conf 999999998708996899999999887447866775446776545432223322323469868776344688887321-- Q gi|254780799|r 530 MEERYQKMSKIGVRNIDGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQHMPYIVVVIDEMADLMMVAR-- 607 (806) Q Consensus 530 Me~RY~l~a~~~vRni~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivviiDElaDlmm~~~-- 607 (806) -..--.+|++ .+++.. . | ..=+|=|+|||| ||.|-... T Consensus 92 ------~I~IdqiR~l---~~~~~~---~-------------p--------------~~g~~KV~IId~-Ad~mn~~AaN 131 (319) T PRK08769 92 ------EIVIEQVREI---SQKLAL---T-------------P--------------QYGIAQVVIVDP-ADAINRSACN 131 (319) T ss_pred ------CCCHHHHHHH---HHHHHH---C-------------C--------------CCCCCEEEEECC-HHHCCHHHHH T ss_conf ------4869999999---999613---7-------------2--------------027956999806-6752899999 Q ss_pred ---CHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEEEECCCCCCHHHCC Q ss_conf ---005889999998664142379999657775355435541102515876458664212338 Q gi|254780799|r 608 ---KDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISFQVSSKIDSRTILG 667 (806) Q Consensus 608 ---~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf~v~s~~dSrtild 667 (806) |-+|++ ..+.+.||.|.+|+ =+-.+|++-- -||.|++-+.-+..--|- T Consensus 132 alLK~LEEP---------p~~~~~iL~~~~~~--~ll~TI~SRC-q~~~~~~p~~~~~~~~L~ 182 (319) T PRK08769 132 ALLKTLEEP---------SPGRYLWLISAQPA--RLPATIRSRC-QRLEFKLPPAHEALAWLL 182 (319) T ss_pred HHHHHHCCC---------CCCEEEEEEECCHH--HCCHHHHHCC-EEECCCCCCHHHHHHHHH T ss_conf 999982279---------98848999869936--5824776485-011189969999999999 No 227 >cd07029 RNAP_I_III_AC19 AC19 subunit of Eukaryotic RNA polymerase (RNAP) I and RNAP III. The eukaryotic AC19 subunit of RNA polymerase (RNAP) I and RNAP III is involved in the assembly of RNAP subunits. RNAP is a large multi-subunit complex responsible for the synthesis of RNA. It is the principal enzyme of the transcription process, and is a final target in many regulatory pathways that control gene expression in all living cells. At least three distinct RNAP complexes are found in eukaryotic nuclei: RNAP I, RNAP II, and RNAP III. RNAP I is responsible for the synthesis of ribosomal RNA precursor, while RNAP III functions in the synthesis of 5S and tRNA. The AC19 subunit is the equivalent of the RPB11 subunit of RNAP II. The RPB11 subunit heterodimerizes with the RPB3 subunit, and together with RPB10 and RPB12, anchors the two largest subunits, RPB1 and RPB2, and stabilizes their association. The homology of AC19 to RPB11 suggests a similar function. The AC19 subunit is likely to ass Probab=73.93 E-value=2.2 Score=22.48 Aligned_cols=46 Identities=28% Similarity=0.396 Sum_probs=35.2 Q ss_pred EECCCHHHHHHCCCCHHHHCC-----CCCCH-HHHHHHHHHHHHHHHHHHHHHH Q ss_conf 852310011102770343122-----33430-4566899999999999999998 Q gi|254780799|r 491 MIDPKMLELSVYDGIPNLLTP-----VVTNP-QKAVTVLKWLVCEMEERYQKMS 538 (806) Q Consensus 491 liDPK~vEls~Y~~iPHLl~p-----VvTd~-~kA~~aL~w~V~EMe~RY~l~a 538 (806) |-||+ |++..| .|||=+.+ |-|+- ..|..||+|+++.+..=.+.+. T Consensus 31 ~~~~~-V~fagY-~ipHPl~~~i~irIqT~~~~~~~~al~~al~dL~~~~~~i~ 82 (85) T cd07029 31 MKNPE-VEFCGY-SIPHPSENKINLRIQTKGGEPAVDVLKKGLEDLEQICDHIL 82 (85) T ss_pred HHCCC-CEEEEC-CCCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 77999-548532-68999987137999958999989999999999999999998 No 228 >PRK13876 conjugal transfer coupling protein TraG; Provisional Probab=73.67 E-value=0.86 Score=25.58 Aligned_cols=104 Identities=20% Similarity=0.268 Sum_probs=62.0 Q ss_pred EEEEEHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCC-CC---CCHHHHHCCCCEEEEEECCCCCCHHHC Q ss_conf 68776344688887321005889999998664142379999657775-35---543554110251587645866421233 Q gi|254780799|r 591 YIVVVIDEMADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSV-DV---ITGTIKANFPTRISFQVSSKIDSRTIL 666 (806) Q Consensus 591 ~ivviiDElaDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsv-dv---itg~ikan~p~riaf~v~s~~dSrtil 666 (806) ......|||+-| || ++..-.-+|-| |.-||.+.+-+|-.+= += -.+.|-.|.--||+|+..-.-+-+-|= T Consensus 401 ~vLlmLDEF~aL----Gk-l~iie~Ala~m-AGYGirl~lI~Qsl~QL~~~YG~~~sil~Nc~vri~fapnd~~TAe~lS 474 (662) T PRK13876 401 RLLLMLDEFPAL----GR-LDFFESALAFM-AGYGIKSFLIAQSLNQIEKAYGQNNSILDNCHVRVSFATNDERTAKRVS 474 (662) T ss_pred EEEEEEECCCCC----CC-HHHHHHHHHHH-CCCCCEEEEEECCHHHHHHHHCCCCHHHCCCEEEEEECCCCHHHHHHHH T ss_conf 179986064105----87-28999999987-0368778888706999998748774021175169996789753499999 Q ss_pred CCC--------------------------------------CHHHHCCCCCEEEECCCCCEEEEEECC-CCHH Q ss_conf 886--------------------------------------457865887547736898325888334-8988 Q gi|254780799|r 667 GEQ--------------------------------------GAEQLLGQGDMLYMTGGGRVQRIHGPF-VSDI 700 (806) Q Consensus 667 d~~--------------------------------------gae~Llg~gdml~~~~~~~~~r~~g~~-v~~~ 700 (806) +.- +-=.-|..+||+-+-.|.+|+|.+-+. -.|. T Consensus 475 ~~lG~~T~~~~~~s~sg~r~s~~lg~~~~S~se~~RpLltPdEv~~Lp~d~~II~v~G~~PI~akKi~Yy~Dp 547 (662) T PRK13876 475 DALGTATELRAMKNYAGHRLSPWLGHLMVSRQETARPLLTPGEVMQLPPDDEIVMVSGVPPIRAKKARYYEDA 547 (662) T ss_pred HHHCCEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHCCCCCCEEEECCCCCCEEECEEEECCCH T ss_conf 9839313777787305776663337774321112465789889940899867998079997343521451685 No 229 >PRK00090 bioD dithiobiotin synthetase; Reviewed Probab=73.57 E-value=3.7 Score=20.82 Aligned_cols=17 Identities=47% Similarity=0.812 Sum_probs=8.8 Q ss_pred EEEEEC-CCCHHHHHHHH Q ss_conf 355304-77406799999 Q gi|254780799|r 457 LLIAGT-TGSGKSVAINT 473 (806) Q Consensus 457 LLIAGt-TGSGKSV~iN~ 473 (806) +.|+|| ||+|||+.--. T Consensus 2 ifI~GT~T~vGKT~vt~~ 19 (223) T PRK00090 2 LFVTGTDTGVGKTVVTAA 19 (223) T ss_pred EEEEECCCCCCHHHHHHH T ss_conf 899868999769999999 No 230 >cd01125 repA Hexameric Replicative Helicase RepA. RepA is encoded by a plasmid, which is found in most Gram negative bacteria. RepA is a 5'-3' DNA helicase which can utilize ATP, GTP and CTP to a lesser extent. Probab=73.54 E-value=6.7 Score=18.87 Aligned_cols=44 Identities=16% Similarity=0.345 Sum_probs=34.1 Q ss_pred CEEEEEHHHHHHHHHHCC-----CHHHHHHHHHHHHHHHCCEEEEEEECCC Q ss_conf 868776344688887321-----0058899999986641423799996577 Q gi|254780799|r 590 PYIVVVIDEMADLMMVAR-----KDIESAVQRLAQMARASGIHVIMATQRP 635 (806) Q Consensus 590 p~ivviiDElaDlmm~~~-----~~ve~~i~rlaq~ara~GiHli~aTqrP 635 (806) ..=+||||=|+... .+ .++-..|+++.+.|+..|.+.++.---- T Consensus 111 ~~~LVViDpL~~~~--~~dENd~~~m~~~i~~l~~ia~~tg~aVl~vHHt~ 159 (239) T cd01125 111 RIDLVVIDPLVSFH--GVSENDNGAMDAVIKALRRIAAQTGAAILLVHHVR 159 (239) T ss_pred CCCEEEECCCHHHC--CCCCCCHHHHHHHHHHHHHHHHHCCCEEEEEECCC T ss_conf 99899983817748--99978899999999999999997199999970688 No 231 >PRK00984 truD tRNA pseudouridine synthase D; Reviewed Probab=73.44 E-value=6.8 Score=18.85 Aligned_cols=23 Identities=13% Similarity=-0.016 Sum_probs=14.1 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCE Q ss_conf 25665457899999974248632 Q gi|254780799|r 328 SPKVMQNNACTLKSVLSDFGIQG 350 (806) Q Consensus 328 s~eeL~~nA~lLE~tL~dFGVe~ 350 (806) +.+.++..|+.+--..+++|.-| T Consensus 55 T~~~~~~lA~~l~i~~~~igyaG 77 (344) T PRK00984 55 TLFAARELAKFLGISRREVSYAG 77 (344) T ss_pred HHHHHHHHHHHCCCCHHHCCCCC T ss_conf 99999999998099888824256 No 232 >PRK11054 helD DNA helicase IV; Provisional Probab=73.26 E-value=6.8 Score=18.82 Aligned_cols=40 Identities=20% Similarity=0.254 Sum_probs=26.8 Q ss_pred CCCHHHHHHHHHHHHHHHHHCCCCEEEEEEEECCEEEEEEEEECCCCCHHHHHHHHHHHHHHHH Q ss_conf 6325665457899999974248632899841044244444321478639999978899998863 Q gi|254780799|r 326 TFSPKVMQNNACTLKSVLSDFGIQGEIVNVRPGPVITLYELEPAPGIKSSRIIGLSDDIARSMS 389 (806) Q Consensus 326 ~~s~eeL~~nA~lLE~tL~dFGVe~~Vv~v~pGPVVTrYEi~PApGVKvSKI~nLadDIA~aLs 389 (806) +.+++.-+++.+.|.+.| | +.+|+++.+-+|+-.|-+... T Consensus 248 TFT~kAA~EMreRl~~~l---g---------------------~~~v~~~TFHSlal~ILr~~~ 287 (684) T PRK11054 248 AFGRKAAEEMDERIRERL---G---------------------TEDITARTFHALALHIIQQGS 287 (684) T ss_pred ECCHHHHHHHHHHHHHHC---C---------------------CCCEEEEEHHHHHHHHHHHHC T ss_conf 634999999999999754---9---------------------998378659999999999844 No 233 >cd03228 ABCC_MRP_Like The MRP (Mutidrug Resistance Protein)-like transporters are involved in drug, peptide, and lipid export. They belong to the subfamily C of the ATP-binding cassette (ABC) superfamily of transport proteins. The ABCC subfamily contains transporters with a diverse functional spectrum that includes ion transport, cell surface receptor, and toxin secretion activities. The MRP-like family, simlar to all ABC proteins, have a common four-domain core structure constituted by two membrane-spanning domains, each composed of six transmembrane (TM) helices, and two nucleotide-binding domains (NBD). ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=73.18 E-value=2.8 Score=21.78 Aligned_cols=40 Identities=30% Similarity=0.511 Sum_probs=24.4 Q ss_pred EECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEEC Q ss_conf 541002023553047740679999999999982995784788852 Q gi|254780799|r 449 ADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMID 493 (806) Q Consensus 449 ~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliD 493 (806) .++.+==.+.|.|.+|||||-.++.|.= + .+|++=+ |.+| T Consensus 23 l~i~~Ge~i~ivG~sGsGKSTLl~ll~g--l--~~p~~G~-I~i~ 62 (171) T cd03228 23 LTIKPGEKVAIVGPSGSGKSTLLKLLLR--L--YDPTSGE-ILID 62 (171) T ss_pred EEECCCCEEEEECCCCCHHHHHHHHHHC--C--CCCCCCE-EEEC T ss_conf 9985998999999999839999999976--7--7589748-9999 No 234 >PRK06696 uridine kinase; Validated Probab=73.15 E-value=3.1 Score=21.43 Aligned_cols=25 Identities=32% Similarity=0.444 Sum_probs=19.8 Q ss_pred CE-EEEEECCCCHHHHHHHHHHHHHH Q ss_conf 02-35530477406799999999999 Q gi|254780799|r 455 PH-LLIAGTTGSGKSVAINTMILSLL 479 (806) Q Consensus 455 PH-LLIAGtTGSGKSV~iN~iI~SlL 479 (806) |. +-|+|.+|||||-.-|.+.-.|= T Consensus 26 pl~VgIdG~~gSGKTTlA~~La~~L~ 51 (227) T PRK06696 26 PLRVAIDGITASGKTTFANELAEEIK 51 (227) T ss_pred CEEEEEECCCCCCHHHHHHHHHHHHH T ss_conf 68999778998787999999999997 No 235 >pfam11543 UN_NPL4 Nuclear pore localisation protein NPL4. Npl4 is part of the heterodimer UN along with Ufd1 which is involved in the recruitment of p97, an AAA ATPase, for tasks involving the ubiquitin pathway. Npl4 has a ubiquitin-like domain which has within its structure a beta-grasp fold with a helical insert. Probab=73.14 E-value=1.9 Score=23.05 Aligned_cols=13 Identities=54% Similarity=0.838 Sum_probs=9.4 Q ss_pred HCCCCCEEEECCC Q ss_conf 6588754773689 Q gi|254780799|r 674 LLGQGDMLYMTGG 686 (806) Q Consensus 674 Llg~gdml~~~~~ 686 (806) =|.+|||||+-|+ T Consensus 68 ~lkHGDmlyL~~~ 80 (80) T pfam11543 68 GLKHGDMLYLFPG 80 (80) T ss_pred CCCCCCEEEECCC T ss_conf 8787768985589 No 236 >pfam12252 SidE Dot/Icm substrate protein. This family of proteins is found in bacteria. Proteins in this family are typically between 397 and 1543 amino acids in length. This family is the SidE protein in the Dot/Icm pathway of Legionella pneumophila bacteria. There is little literature describing the family. Probab=73.09 E-value=6.9 Score=18.79 Aligned_cols=48 Identities=23% Similarity=0.351 Sum_probs=43.4 Q ss_pred HHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHH Q ss_conf 312233430456689999999999999999870899689999999988 Q gi|254780799|r 508 LLTPVVTNPQKAVTVLKWLVCEMEERYQKMSKIGVRNIDGFNLKVAQY 555 (806) Q Consensus 508 Ll~pVvTd~~kA~~aL~w~V~EMe~RY~l~a~~~vRni~~yN~k~~~~ 555 (806) ..||||||++|+-.--.-.+..+-+|-.-|..+.--|+++|++.+... T Consensus 1090 v~tpvvtd~ekvr~RYd~Li~~~tkrIt~le~~kl~~l~~ikK~i~~l 1137 (1443) T pfam12252 1090 VVTPVVTDEEKVRERYDALIENTSKRITELEKAKLANLDVVKKAISNL 1137 (1443) T ss_pred HCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHH T ss_conf 057531578999999999986056776664316765478899987447 No 237 >cd07027 RNAP_RPB11_like RPB11 subunit of RNA polymerase. The eukaryotic RPB11 subunit of RNA polymerase (RNAP), as well as its archaeal (L subunit) and bacterial (alpha subunit) counterparts, is involved in the assembly of RNAP subunits. RNAP is a large multi-subunit complex responsible for the synthesis of RNA. It is the principal enzyme of the transcription process, and is a final target in many regulatory pathways that control gene expression in all living cells. At least three distinct RNAP complexes are found in eukaryotic nuclei: RNAP I, RNAP II, and RNAP III, for the synthesis of ribosomal RNA precursor, mRNA precursor, and 5S and tRNA, respectively. A single distinct RNAP complex is found in prokaryotes and archaea, which may be responsible for the synthesis of all RNAs. The assembly of the two largest eukaryotic RNAP subunits that provide most of the enzyme's catalytic functions depends on the presence of RPB3/RPB11 heterodimer subunits. This is also true for the archaeal (D/ Probab=73.03 E-value=2 Score=22.86 Aligned_cols=44 Identities=20% Similarity=0.336 Sum_probs=33.9 Q ss_pred ECCCHHHHHHCCCCHHHHCC-----CCCCH-HHHHHHHHHHHHHHHHHHHHH Q ss_conf 52310011102770343122-----33430-456689999999999999999 Q gi|254780799|r 492 IDPKMLELSVYDGIPNLLTP-----VVTNP-QKAVTVLKWLVCEMEERYQKM 537 (806) Q Consensus 492 iDPK~vEls~Y~~iPHLl~p-----VvTd~-~kA~~aL~w~V~EMe~RY~l~ 537 (806) =||+ |+++.|. |||=+.+ |-|+. .++..||+++++...+-|+-+ T Consensus 32 ~~~~-V~fA~Y~-ipHPl~~~~~lrIqT~~~~~p~~al~~a~~~L~~~~~~l 81 (83) T cd07027 32 KDDQ-VDFARYY-IKHPVIDKIQIRIQTKSGIKPKDALKRAVNKLSKLYEHL 81 (83) T ss_pred CCCC-EEEEEEE-CCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 7998-2798410-889988940899998999998999999999999998750 No 238 >pfam07447 VP40 Matrix protein VP40. This family contains viral VP40 matrix proteins that seem to be restricted to the Filoviridae. These play an important role in the assembly process of virus particles by interacting with cellular factors, cellular membranes, and the ribonuclearprotein particle complex. It has been shown that the N-terminal region of VP40 folds into a mixture of hexameric and octameric states - these may have distinct roles. Probab=72.85 E-value=2.5 Score=22.09 Aligned_cols=30 Identities=40% Similarity=0.654 Sum_probs=24.0 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCC-CCCCEEEEEECCC Q ss_conf 39999978899998863001210003-8861255530245 Q gi|254780799|r 373 KSSRIIGLSDDIARSMSAISARVAVI-PRRNAIGIELPND 411 (806) Q Consensus 373 KvSKI~nLadDIA~aLsa~svRIapI-PGK~~VGIEiPN~ 411 (806) |+.-|.|.-.|+ ||.|| |-|+-+|||+|-. T Consensus 236 kiqaIm~~l~dl---------ki~~id~~kni~gIevPe~ 266 (295) T pfam07447 236 KILAIMGAIHDL---------KIEKIDEIKNIFGIELPAD 266 (295) T ss_pred HHHHHHHHHHHC---------EEEECCCCCCEEEEECHHH T ss_conf 589999767545---------1455586554345423799 No 239 >PRK13651 cobalt transporter ATP-binding subunit; Provisional Probab=72.67 E-value=2.8 Score=21.67 Aligned_cols=97 Identities=19% Similarity=0.192 Sum_probs=54.0 Q ss_pred EEEHHHHH-HHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEEEECCCCCCHHHCCCCCH Q ss_conf 77634468-88873210058899999986641423799996577753554355411025158764586642123388645 Q gi|254780799|r 593 VVVIDEMA-DLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISFQVSSKIDSRTILGEQGA 671 (806) Q Consensus 593 vviiDEla-Dlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf~v~s~~dSrtild~~ga 671 (806) |++-||=. -|=-...++|-+.|.+|.+ .|+-.|+.|..-. . =+.+-.|+.+ --+-|.+.|. -. T Consensus 182 iLlLDEPTagLDp~~~~~i~~~l~~L~~----~G~TVI~vTHdm~--~-----v~~~adRviv----l~~G~Iv~~G-~p 245 (304) T PRK13651 182 FLVFDEPTAGLDPQGVKEILEIFDTLNK----KGKTIILVTHDLD--N-----VLEWTKRTIF----FKDGKIIKDG-DT 245 (304) T ss_pred EEEECCCCCCCCHHHHHHHHHHHHHHHH----CCCEEEEECCCHH--H-----HHHHCCEEEE----EECCEEEEEC-CH T ss_conf 9997298665898999999999999997----7999999867899--9-----9997999999----9899899986-88 Q ss_pred HHHCCCCCEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHCCCCC Q ss_conf 786588754773689832588833489889999999997128974 Q gi|254780799|r 672 EQLLGQGDMLYMTGGGRVQRIHGPFVSDIEVEKVVSHLKTQGEAK 716 (806) Q Consensus 672 e~Llg~gdml~~~~~~~~~r~~g~~v~~~ev~~v~~~~~~q~~~~ 716 (806) +.++ +.+..++-+.+.-..+-++...++++|-|- T Consensus 246 ~evf-----------~~~~~l~~~~l~~P~~~~l~~~L~~~g~~~ 279 (304) T PRK13651 246 YEIL-----------NDEKFLIENKMLPPKLLNFVNKLIKKGIPI 279 (304) T ss_pred HHHH-----------CCHHHHHHCCCCCCHHHHHHHHHHHCCCCC T ss_conf 9986-----------798899877999981999999999769999 No 240 >smart00344 HTH_ASNC helix_turn_helix ASNC type. AsnC: an autogenously regulated activator of asparagine synthetase A transcription in Escherichia coli Probab=72.54 E-value=6.7 Score=18.87 Aligned_cols=47 Identities=19% Similarity=0.490 Sum_probs=35.4 Q ss_pred CHHHHHHHHHHHHCCCEEEEHHHHHHCCCHHHHHHHHHHHHHCCCCC Q ss_conf 70389999999965985000142220011778999999999779868 Q gi|254780799|r 740 DDLYKQAVDIVLRDNKASISYIQRRLGIGYNRAASIIENMEEKGVIG 786 (806) Q Consensus 740 d~l~~~a~~~v~~~~~~s~s~lqr~~~igy~raar~~~~~e~~giv~ 786 (806) |++=.+-++...+.++.|.+-|-|++.+-=....+=|+.||+.|+|- T Consensus 2 D~~D~~Il~~L~~n~R~s~~~iA~~lg~S~~tv~~Ri~~L~~~GiI~ 48 (108) T smart00344 2 DEIDRKILEELQKDARISLAELAKKVGLSPSTVHNRVKRLEEEGVIK 48 (108) T ss_pred CHHHHHHHHHHHHCCCCCHHHHHHHHCCCHHHHHHHHHHHHHCCCCC T ss_conf 88999999999982899999999998939999999999996089732 No 241 >smart00347 HTH_MARR helix_turn_helix multiple antibiotic resistance protein. Probab=72.27 E-value=4.8 Score=19.96 Aligned_cols=56 Identities=25% Similarity=0.501 Sum_probs=44.9 Q ss_pred HHHHHHHHCCCEEEEHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHC-CCCCEEEECH Q ss_conf 9999999659850001422200117789999999997798680227-8872673171 Q gi|254780799|r 745 QAVDIVLRDNKASISYIQRRLGIGYNRAASIIENMEEKGVIGPASS-TGKREILISS 800 (806) Q Consensus 745 ~a~~~v~~~~~~s~s~lqr~~~igy~raar~~~~~e~~giv~~~~g-~~~r~vl~~~ 800 (806) .++.++...+-.+.+-|-+++.+--+...|+++.||++|+|--... .-.|.+.+.- T Consensus 14 ~vL~~l~~~~~~t~~~la~~l~~~~~~vs~~v~~L~~~glv~r~~~~~D~R~~~i~L 70 (101) T smart00347 14 LVLRILYEEGPLSVSELAKRLGVSPSTVTRVLDRLEKKGLIRRLPSPEDRRSVLVSL 70 (101) T ss_pred HHHHHHHHCCCCCHHHHHHHHCCCHHHHHHHHHHHHCCCCEEEEECCCCCCEEEEEE T ss_conf 999999976997999999998968879999999994589389821899988589899 No 242 >pfam08007 Cupin_4 Cupin superfamily protein. This family contains many hypothetical proteins that belong to the cupin superfamily. Probab=72.17 E-value=1.9 Score=22.92 Aligned_cols=45 Identities=22% Similarity=0.259 Sum_probs=24.4 Q ss_pred CCCHHHHCCCCCEEEECCC-CCEEEEEEC-------CC--CH-HHHHHHHHHHHHC Q ss_conf 8645786588754773689-832588833-------48--98-8999999999712 Q gi|254780799|r 668 EQGAEQLLGQGDMLYMTGG-GRVQRIHGP-------FV--SD-IEVEKVVSHLKTQ 712 (806) Q Consensus 668 ~~gae~Llg~gdml~~~~~-~~~~r~~g~-------~v--~~-~ev~~v~~~~~~q 712 (806) ++=-|-.|--|||||+|+| .+=-+-++. |= +- +=+..+.+++-++ T Consensus 174 ep~~d~vLepGD~LYiPpG~~H~g~a~~~s~~~SvGfrap~~~el~~~~~d~l~~~ 229 (320) T pfam08007 174 EPVHDFVLEPGDMLYLPRGFIHQGVALDDSLHYSVGFRAPTAAELLSSFLDYLLDK 229 (320) T ss_pred CCCEEEEECCCCEEEECCCCCCCCCCCCCCEEEEECEECCCHHHHHHHHHHHHHHH T ss_conf 85058997688879858998644302888358881212798899999999999862 No 243 >KOG0922 consensus Probab=72.11 E-value=1.6 Score=23.59 Aligned_cols=20 Identities=5% Similarity=0.069 Sum_probs=10.5 Q ss_pred CHHHHHHHHHHHHHHHHHCC Q ss_conf 25665457899999974248 Q gi|254780799|r 328 SPKVMQNNACTLKSVLSDFG 347 (806) Q Consensus 328 s~eeL~~nA~lLE~tL~dFG 347 (806) .+||++.-.++|.+-.++-+ T Consensus 267 GqeEIe~~~~~l~e~~~~~~ 286 (674) T KOG0922 267 GQEEIEAACELLRERAKSLP 286 (674) T ss_pred CHHHHHHHHHHHHHHHHHCC T ss_conf 88899999999998765312 No 244 >cd02575 PseudoU_synth_EcTruD PseudoU_synth_EcTruD: Pseudouridine synthase, TruD family. This group consists of bacterial pseudouridine synthases similar to Escherichia coli TruD. Pseudouridine synthases catalyze the isomerization of specific uridines in an RNA molecule to pseudouridines (5-ribosyluracil, psi). E. coli TruD makes the highly phylogenetically conserved psi13 in tRNAs. Probab=72.11 E-value=3.2 Score=21.24 Aligned_cols=140 Identities=19% Similarity=0.225 Sum_probs=83.7 Q ss_pred HHHHHHHHCCEEE------------EEEECCCCCCCCCHHHHHCCCCEEEEEECCCCCCHHHCCCCCHHHHCCCCCEEEE Q ss_conf 9998664142379------------9996577753554355411025158764586642123388645786588754773 Q gi|254780799|r 616 RLAQMARASGIHV------------IMATQRPSVDVITGTIKANFPTRISFQVSSKIDSRTILGEQGAEQLLGQGDMLYM 683 (806) Q Consensus 616 rlaq~ara~GiHl------------i~aTqrPsvdvitg~ikan~p~riaf~v~s~~dSrtild~~gae~Llg~gdml~~ 683 (806) =+.+.||++||+. =+.||.=||... +..-|...+| ..+.=+|++..-..+=|..||.. T Consensus 41 ~~~~lA~~lgi~~~~igyAGlKDk~AvT~Q~~Sv~~~----~~~~~~~~~~----~~~~~~il~~~~~~~kL~~G~l~-- 110 (253) T cd02575 41 VAKELAKALGVKERDVGYAGLKDRHAVTTQWFSVHLP----GKEAPDLSAL----QLEGVKILEVTRHNRKLRRGHLK-- 110 (253) T ss_pred HHHHHHHHHCCCHHHEEECCCCCCCCCEEEEEEEECC----CCCCCCHHHC----CCCCEEEEEEEECCCCCCCCCCC-- T ss_conf 9999999819986771033036676336999999778----6456444445----78986999987158986677655-- Q ss_pred CCCCC-EEEEEECCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEHHH Q ss_conf 68983-25888334898899999999971289742110012455666778888877770389999999965985000142 Q gi|254780799|r 684 TGGGR-VQRIHGPFVSDIEVEKVVSHLKTQGEAKYIDIKDKILLNEEMRFSENSSVADDLYKQAVDIVLRDNKASISYIQ 762 (806) Q Consensus 684 ~~~~~-~~r~~g~~v~~~ev~~v~~~~~~q~~~~y~~~~~~~~~~~~~~~~~~~~~~d~l~~~a~~~v~~~~~~s~s~lq 762 (806) |++ -+++.+.=-+.++++..++.++++|-|+|-...--+. +.+. +.+|.++.....+. .. T Consensus 111 --GN~F~I~lR~v~~~~~~~~~~l~~i~~~G~PNYFG~QRFG~------------~g~N-~~~a~~~~~~~~~~----~~ 171 (253) T cd02575 111 --GNRFVIRLRGVSGNADKLEQRLETIAQKGVPNYFGPQRFGR------------DGGN-LLQGLRLLAGERKV----RD 171 (253) T ss_pred --CCEEEEEEEECCCCHHHHHHHHHHHHHCCCCCCCCCCCCCC------------CCCC-HHHHHHHHCCCCCC----CC T ss_conf --88799999636789899999999998639833333303689------------8877-99999986178445----79 Q ss_pred HHHCCCHHHHHH-------HHHHHHHCCC Q ss_conf 220011778999-------9999997798 Q gi|254780799|r 763 RRLGIGYNRAAS-------IIENMEEKGV 784 (806) Q Consensus 763 r~~~igy~raar-------~~~~~e~~gi 784 (806) |++|-=|=-|+| |-+.+|+... T Consensus 172 r~~r~~~lsA~qS~LFN~~Ls~Rv~~~~~ 200 (253) T cd02575 172 RNKRSFLLSAARSYLFNRVLSERLEQGSW 200 (253) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 87877689999999999999999976897 No 245 >PRK01637 rbn ribonuclease BN; Reviewed Probab=72.01 E-value=7.3 Score=18.62 Aligned_cols=10 Identities=0% Similarity=-0.151 Sum_probs=4.0 Q ss_pred HHHHHHHCCC Q ss_conf 9999844270 Q gi|254780799|r 41 ITLALGTWDV 50 (806) Q Consensus 41 l~iSL~SYsp 50 (806) +++++++|.| T Consensus 45 ~~~sv~~~~~ 54 (284) T PRK01637 45 VVFSIFSAFP 54 (284) T ss_pred HHHHHHHHHC T ss_conf 9999999811 No 246 >PRK09824 beta-glucoside-specific PTS system components IIABC; Provisional Probab=71.95 E-value=7.3 Score=18.61 Aligned_cols=47 Identities=13% Similarity=-0.055 Sum_probs=20.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHC--CCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999827--65236789999999999999999998 Q gi|254780799|r 83 GIASVFFLPPPTMWALSLLFD--KKIYCFSKRATAWLINILVSATFFAS 129 (806) Q Consensus 83 G~~Ayllpl~Ll~~g~~ll~~--k~~~~~~~Rl~~~il~ll~~s~ll~~ 129 (806) +|.+=++|+++..|-...+-+ ++...-..+.++..++.++....+.. T Consensus 212 ~Y~~sViPiIl~v~~~s~vEk~l~K~iP~~l~~i~~P~ltllI~~pl~l 260 (625) T PRK09824 212 NYSSSVIPIIFSAWLCSILERRLNAWLPSAIKNFFTPLLCLMVITPVTF 260 (625) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 8874259999999999999999987473988886988999999999999 No 247 >COG1204 Superfamily II helicase [General function prediction only] Probab=71.91 E-value=7.3 Score=18.60 Aligned_cols=46 Identities=20% Similarity=0.177 Sum_probs=32.2 Q ss_pred CCCHHHHHHHHHHHHCC-CEEEEHHHHHHCCCHHHHHHHHHHHHHCC Q ss_conf 77703899999999659-85000142220011778999999999779 Q gi|254780799|r 738 VADDLYKQAVDIVLRDN-KASISYIQRRLGIGYNRAASIIENMEEKG 783 (806) Q Consensus 738 ~~d~l~~~a~~~v~~~~-~~s~s~lqr~~~igy~raar~~~~~e~~g 783 (806) ....+|.....-+-+-. .+...-+++.-++|+.=+.++.++.-... T Consensus 687 rar~ly~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ 733 (766) T COG1204 687 RARKLYNAGYKSLEDLRLIADPAELLPLTGIGERLVEAILESLGRDV 733 (766) T ss_pred HHHHHHHHHHCCHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 78998873200088887406746540131157989999999740444 No 248 >PRK06090 DNA polymerase III subunit delta'; Validated Probab=71.82 E-value=3.3 Score=21.20 Aligned_cols=147 Identities=22% Similarity=0.287 Sum_probs=83.7 Q ss_pred CCCCCE-EEEEECCCCHHHHHHHHHHHHHHHHCCHHH-------EEEEEECCCHHHHHHCCCCHHHHCCCCCCHHHHHHH Q ss_conf 100202-355304774067999999999998299578-------478885231001110277034312233430456689 Q gi|254780799|r 451 LARMPH-LLIAGTTGSGKSVAINTMILSLLYRMTPAQ-------CRLIMIDPKMLELSVYDGIPNLLTPVVTNPQKAVTV 522 (806) Q Consensus 451 LakMPH-LLIAGtTGSGKSV~iN~iI~SlLyk~~P~e-------vkliliDPK~vEls~Y~~iPHLl~pVvTd~~kA~~a 522 (806) -.|+|| +|+.|..|.||......+.-.||.+..+++ |+++ .=...-|+ |.+.| .|.. T Consensus 21 ~~rl~HA~L~~g~~G~Gk~~la~~la~~LlC~~~~~~~Cg~C~sC~l~-------~~g~HPD~-~~i~p-----e~~~-- 85 (319) T PRK06090 21 AERIPGALLLQSDEGLGVESLVELFSHALLCQNYQSEACGFCHSCELM-------KSGNHPDL-HVIKP-----EKEG-- 85 (319) T ss_pred CCCCCCEEEEECCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHH-------HCCCCCCC-EEEEC-----CCCC-- T ss_conf 699630676679998579999999999980899999988778779998-------75899982-36612-----3356-- Q ss_pred HHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHHH Q ss_conf 99999999999999987089968999999998874478667754467765454322233223234698687763446888 Q gi|254780799|r 523 LKWLVCEMEERYQKMSKIGVRNIDGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQHMPYIVVVIDEMADL 602 (806) Q Consensus 523 L~w~V~EMe~RY~l~a~~~vRni~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivviiDElaDl 602 (806) ...+|..|-.-|+++...-.. =.|=|||||+ ||. T Consensus 86 ---------------k~I~vd~IR~l~~~~~~~~~~------------------------------g~~KV~iI~~-ae~ 119 (319) T PRK06090 86 ---------------KSITVEQIRQCNRLAQESSQL------------------------------GGYRLFVIEP-ADA 119 (319) T ss_pred ---------------CCCCHHHHHHHHHHHHHCCCC------------------------------CCCEEEEECC-HHH T ss_conf ---------------768799999999997545210------------------------------6936999814-443 Q ss_pred HHHCC-----CHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEEEECCCCCCHHHCCCCC Q ss_conf 87321-----005889999998664142379999657775355435541102515876458664212338864 Q gi|254780799|r 603 MMVAR-----KDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISFQVSSKIDSRTILGEQG 670 (806) Q Consensus 603 mm~~~-----~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf~v~s~~dSrtild~~g 670 (806) |-.+. |-.|++ ..+...||.|.+|+ -|-.+|++--= |+.|..-+.-++-.=|-++| T Consensus 120 m~~~AaNALLKtLEEP---------p~~t~fiL~t~~~~--~ll~TI~SRCq-~~~l~~p~~~~~~~WL~~q~ 180 (319) T PRK06090 120 MNESASNALLKTLEEP---------APNCLFLLVTHNQK--RLLPTIVSRCQ-QWVVTPPSTDQAMQWLKGQG 180 (319) T ss_pred CCHHHHHHHHHHHCCC---------CCCEEEEEEECCHH--HCCCCHHHCCC-CCCCCCCCHHHHHHHHHHHC T ss_conf 4999999999984289---------98838998768512--08641876144-50289959999999998848 No 249 >cd03239 ABC_SMC_head The structural maintenance of chromosomes (SMC) proteins are essential for successful chromosome transmission during replication and segregation of the genome in all organisms. SMCs are generally present as single proteins in bacteria, and as at least six distinct proteins in eukaryotes. The proteins range in size from approximately 110 to 170 kDa, and each has five distinct domains: amino- and carboxy-terminal globular domains, which contain sequences characteristic of ATPases, two coiled-coil regions separating the terminal domains , and a central flexible hinge. SMC proteins function together with other proteins in a range of chromosomal transactions, including chromosome condensation, sister-chromatid cohesion, recombination, DNA repair, and epigenetic silencing of gene expression. Probab=71.69 E-value=5.9 Score=19.28 Aligned_cols=33 Identities=24% Similarity=0.292 Sum_probs=24.8 Q ss_pred EEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHH Q ss_conf 7854100202355304774067999999999998 Q gi|254780799|r 447 IIADLARMPHLLIAGTTGSGKSVAINTMILSLLY 480 (806) Q Consensus 447 vv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLy 480 (806) .+.+..+ ++-.|-|..|||||-.+.+|...|=. T Consensus 16 ~~~~~~~-~~~~ivG~nGsGKSni~~ai~~~~g~ 48 (178) T cd03239 16 TVVGGSN-SFNAIVGPNGSGKSNIVDAICFVLGG 48 (178) T ss_pred EEECCCC-CEEEEECCCCCCHHHHHHHHHHHHCC T ss_conf 8834899-81799899988778999999999866 No 250 >cd03250 ABCC_MRP_domain1 Domain 1 of the ABC subfamily C. This family is also known as MRP (mulrtidrug resisitance-associated protein). Some of the MRP members have five additional transmembrane segments in their N-terminas, but the function of these additional membrane-spanning domains is not clear. The MRP was found in the multidrug-resisting lung cancer cell in which p-glycoprotein was not overexpressed. MRP exports glutathione by drug stimulation, as well as, certain substrates in conjugated forms with anions, such as glutathione, glucuronate, and sulfate. Probab=71.33 E-value=5.3 Score=19.63 Aligned_cols=147 Identities=14% Similarity=0.168 Sum_probs=70.5 Q ss_pred EECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCH------HHEEEEEECCCHHHHHHCCCCHHHHCCCCCCHHHHHHH Q ss_conf 541002023553047740679999999999982995------78478885231001110277034312233430456689 Q gi|254780799|r 449 ADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTP------AQCRLIMIDPKMLELSVYDGIPNLLTPVVTNPQKAVTV 522 (806) Q Consensus 449 ~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P------~evkliliDPK~vEls~Y~~iPHLl~pVvTd~~kA~~a 522 (806) .++.+==..-|-|.+|||||--++.|.- +|+.+- ..+-++.=+|.....|+++.|- +-.| . +..+.-.+ T Consensus 26 l~i~~Ge~~~IvG~sGsGKSTLl~~i~G--~~~~~~G~I~~~g~i~~v~Q~~~l~~~Tv~eNi~-~~~~-~-~~~~~~~~ 100 (204) T cd03250 26 LEVPKGELVAIVGPVGSGKSSLLSALLG--ELEKLSGSVSVPGSIAYVSQEPWIQNGTIRENIL-FGKP-F-DEERYEKV 100 (204) T ss_pred EEECCCCEEEEECCCCCCHHHHHHHHCC--CCCCCCCCCCCCCCEEEEECCCCCCCCCHHHHHH-CCCC-C-CHHHHHHH T ss_conf 9976998999999999858999999818--9525689522589889995877567750999974-4686-6-88999999 Q ss_pred HHHHHHHHHHHHHHHHH-----C--CCCCHHHHH-HHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEE Q ss_conf 99999999999999987-----0--899689999-999988744786677544677654543222332232346986877 Q gi|254780799|r 523 LKWLVCEMEERYQKMSK-----I--GVRNIDGFN-LKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQHMPYIVV 594 (806) Q Consensus 523 L~w~V~EMe~RY~l~a~-----~--~vRni~~yN-~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivv 594 (806) ++-|. ++.-.+.|.. . +-.|+.|=- +|+.-|++- -.=|- ++ T Consensus 101 ~~~~~--l~~~~~~~~~~~~~~i~~~~~~LSgGQkQRvalARal----------------------------~~~~~-il 149 (204) T cd03250 101 IKACA--LEPDLEILPDGDLTEIGEKGINLSGGQKQRISLARAV----------------------------YSDAD-IY 149 (204) T ss_pred HHHHC--CHHHHHHHHCCCCCEECCCCCCCCHHHHHHHHHHHHH----------------------------HCCCC-EE T ss_conf 99976--1988985135352452378875799999999999999----------------------------53999-99 Q ss_pred EHHHHHHHHHHCCCHH--HHHHH-HHHHHHHHCCEEEEEEECCCC Q ss_conf 6344688887321005--88999-999866414237999965777 Q gi|254780799|r 595 VIDEMADLMMVARKDI--ESAVQ-RLAQMARASGIHVIMATQRPS 636 (806) Q Consensus 595 iiDElaDlmm~~~~~v--e~~i~-rlaq~ara~GiHli~aTqrPs 636 (806) +.||= +++=|. +..|. ++-.+-...|--.|+.|.|++ T Consensus 150 llDEP-----ts~LD~~~~~~i~~~l~~~~~~~~~Tvi~itH~~~ 189 (204) T cd03250 150 LLDDP-----LSAVDAHVGRHIFENCILGLLLNNKTRILVTHQLQ 189 (204) T ss_pred EEECC-----CCCCCHHHHHHHHHHHHHHHHHCCCEEEEECCCHH T ss_conf 99698-----76669999999999999998608989999807999 No 251 >PRK05826 pyruvate kinase; Provisional Probab=71.28 E-value=2.4 Score=22.19 Aligned_cols=233 Identities=22% Similarity=0.268 Sum_probs=128.3 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCEEEEEEEECCEEEEEEEEECCCCCHHHHHHHHHHHHHHHHHCCCCCC---CCCCCCEE Q ss_conf 256654578999999742486328998410442444443214786399999788999988630012100---03886125 Q gi|254780799|r 328 SPKVMQNNACTLKSVLSDFGIQGEIVNVRPGPVITLYELEPAPGIKSSRIIGLSDDIARSMSAISARVA---VIPRRNAI 404 (806) Q Consensus 328 s~eeL~~nA~lLE~tL~dFGVe~~Vv~v~pGPVVTrYEi~PApGVKvSKI~nLadDIA~aLsa~svRIa---pIPGK~~V 404 (806) +.++.++.-+.+.+.-+++|...-+--=.+||-+=.-++.. +.+++.+ -..+++. .+.| T Consensus 41 ~~e~~~~~i~~iR~~~~~~~~~i~Il~Dl~GpkIRtg~~~~-~~i~l~~-------------G~~v~i~~~~~~~g---- 102 (461) T PRK05826 41 SHEEHGKRAALVREIAAKLGRPVAILLDLKGPKIRVGKFKD-GKVELKI-------------GDKFTLDTDQKEEG---- 102 (461) T ss_pred CHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCEEEEEECCC-CCEEEEC-------------CCEEEEEECCCCCC---- T ss_conf 99999999999999999739970699988898368986479-8389705-------------99899982787789---- Q ss_pred EEEECCCCCCEEEEHHHCCCHHHHHCCCCCHHHCCC------CCCCCEEEEECCCCCEEEEEECCCCHHHHHHHHHHHHH Q ss_conf 553024566238734230721343022100021045------66666678541002023553047740679999999999 Q gi|254780799|r 405 GIELPNDIRETVMLRDLIVSRVFEKNQCDLAINLGK------SIEGKPIIADLARMPHLLIAGTTGSGKSVAINTMILSL 478 (806) Q Consensus 405 GIEiPN~~r~~V~lreil~s~~f~~s~~~L~iaLGK------dI~G~pvv~DLakMPHLLIAGtTGSGKSV~iN~iI~Sl 478 (806) |.++=.|...++++. -+....+-+.=|+ ++.++-+.+ ..+..|.=+|+|+|- T Consensus 103 -----~~~~i~v~~~~l~~~---v~~Gd~I~idDG~i~l~V~~~~~~~i~~------~V~~~G~L~s~Kgvn-------- 160 (461) T PRK05826 103 -----DQERVGLDYKDLPKD---VKPGDILLLDDGKLQLKVVEVDGDEVET------EVLNGGPLSNNKGIN-------- 160 (461) T ss_pred -----CCCEEEECCHHHHHH---CCCCCEEEEECCEEEEEEEECCCCEEEE------EECCCEEEECCCCCC-------- T ss_conf -----768898430786865---7889989982785999999616986999------974680984687754-------- Q ss_pred HHHCCHHHEEEEEECCCHHHHHHCCCCH--HHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHH-H Q ss_conf 9829957847888523100111027703--431223343045668999999999999999987089968999999998-8 Q gi|254780799|r 479 LYRMTPAQCRLIMIDPKMLELSVYDGIP--NLLTPVVTNPQKAVTVLKWLVCEMEERYQKMSKIGVRNIDGFNLKVAQ-Y 555 (806) Q Consensus 479 Lyk~~P~evkliliDPK~vEls~Y~~iP--HLl~pVvTd~~kA~~aL~w~V~EMe~RY~l~a~~~vRni~~yN~k~~~-~ 555 (806) +| ++-.|..|+-. ...|+|+++ .--+..+-.=||+-+.-.+ +++ . T Consensus 161 --------------------------lp~~~~~lp~lTekD--~~di~fa~~---~~vD~ialSFVrsa~DV~~-lr~~l 208 (461) T PRK05826 161 --------------------------IPGGGLSLPALTEKD--KADIKFAAE---QGVDYIAVSFVRNAEDVEE-ARRLL 208 (461) T ss_pred --------------------------CCCCCCCCCCCCHHH--HHHHHHHHH---CCCCEEEECCCCCHHHHHH-HHHHH T ss_conf --------------------------234435767798668--999999997---6999997468898678999-99999 Q ss_pred HHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHHHHHHCCCH--HHHH-------HHHHHHHHHHCCE Q ss_conf 744786677544677654543222332232346986877634468888732100--5889-------9999986641423 Q gi|254780799|r 556 HNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQHMPYIVVVIDEMADLMMVARKD--IESA-------VQRLAQMARASGI 626 (806) Q Consensus 556 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivviiDElaDlmm~~~~~--ve~~-------i~rlaq~ara~Gi 626 (806) .+.|.+-.. +. +..| ....++.++-|+ +.+|=.|+|.-| +|-+ =-++-+++|++|. T Consensus 209 ~~~~~~~~~-II---------aKIE-~~~al~NldeIi----~~sDgIMIARGDLgvEi~~e~vp~~Qk~Ii~~c~~~gk 273 (461) T PRK05826 209 REAGNCDAK-II---------AKIE-RAEAVDNLDEII----EASDGIMVARGDLGVEIPDAEVPGLQKKIIRKAREAGK 273 (461) T ss_pred HHCCCCCCE-EE---------EEEC-CHHHHHHHHHHH----HHCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHCCC T ss_conf 972999844-89---------9836-889987399999----86589999778530326876749999999998875499 Q ss_pred EEEEEECCCCCCCCCHHHHHCCCCEE Q ss_conf 79999657775355435541102515 Q gi|254780799|r 627 HVIMATQRPSVDVITGTIKANFPTRI 652 (806) Q Consensus 627 Hli~aTqrPsvdvitg~ikan~p~ri 652 (806) -.|+||| +.--+|....|||- T Consensus 274 pvivATq-----mLeSM~~~p~PTRA 294 (461) T PRK05826 274 PVITATQ-----MLESMIENPRPTRA 294 (461) T ss_pred EEEEECC-----CHHHHCCCCCCCHH T ss_conf 8999765-----68976079999727 No 252 >PRK13645 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=71.27 E-value=3.2 Score=21.27 Aligned_cols=99 Identities=13% Similarity=0.171 Sum_probs=49.3 Q ss_pred CEEEEEHHHHH-HHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEEEECCCCCCHHHCCC Q ss_conf 86877634468-88873210058899999986641423799996577753554355411025158764586642123388 Q gi|254780799|r 590 PYIVVVIDEMA-DLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISFQVSSKIDSRTILGE 668 (806) Q Consensus 590 p~ivviiDEla-Dlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf~v~s~~dSrtild~ 668 (806) |-| +|-||=- -|=....+++-.. |.+..|.-|+-.|+.|..-.. =+++-.||.. --+-|+|. + T Consensus 169 P~i-LilDEPTagLDp~~~~~i~~l---l~~L~~~~g~Tvi~vtHdm~~-------v~~~aDrviV----m~~G~iv~-~ 232 (289) T PRK13645 169 GNT-LVLDEPTGGLDPKGEEDFINL---FERLNKEYKKRIIMVTHNMDQ-------VLRIADEVIV----MHEGKVIS-I 232 (289) T ss_pred CCE-EEEECCCCCCCHHHHHHHHHH---HHHHHHHCCCEEEEECCCHHH-------HHHHCCEEEE----EECCEEEE-E T ss_conf 999-999588764898999999999---999999569999999159999-------9997999999----98998999-8 Q ss_pred CCHHHHCCCCCEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHCCCC Q ss_conf 64578658875477368983258883348988999999999712897 Q gi|254780799|r 669 QGAEQLLGQGDMLYMTGGGRVQRIHGPFVSDIEVEKVVSHLKTQGEA 715 (806) Q Consensus 669 ~gae~Llg~gdml~~~~~~~~~r~~g~~v~~~ev~~v~~~~~~q~~~ 715 (806) +-.+.++.+-|+|- .+-+.-..+.++...+|.+|-+ T Consensus 233 G~p~evf~~~~~l~-----------~~~l~~P~~~~l~~~L~~~g~~ 268 (289) T PRK13645 233 GSPFEIFSNQELLT-----------KIEIDPPKLYQLMYKLKNKGID 268 (289) T ss_pred CCHHHHHCCHHHHH-----------HCCCCCCHHHHHHHHHHHCCCC T ss_conf 78899867999999-----------7799998599999999976998 No 253 >cd03254 ABCC_Glucan_exporter_like Glucan exporter ATP-binding protein. In A. tumefaciens cyclic beta-1, 2-glucan must be transported into the periplasmic space to exert its action as a virluence factor. This subfamily belongs to the MRP-like family and is involved in drug, peptide, and lipid export. The MRP-like family, similar to all ABC proteins, have a common four-domain core structure constituted by two membrane-spanning domains each composed of six transmembrane (TM) helices and two nucleotide-binding domains (NBD). ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=71.18 E-value=3.7 Score=20.83 Aligned_cols=153 Identities=22% Similarity=0.355 Sum_probs=71.6 Q ss_pred CCEEEEEC----CCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHH---------------------EEEEEECCCHHH Q ss_conf 66678541----00202355304774067999999999998299578---------------------478885231001 Q gi|254780799|r 444 GKPIIADL----ARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQ---------------------CRLIMIDPKMLE 498 (806) Q Consensus 444 G~pvv~DL----akMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~e---------------------vkliliDPK~vE 498 (806) ++|+.-|+ .+==.+-|-|.+|||||-.++.|.- || .|++ +-++-=||.... T Consensus 15 ~~~vL~~inl~i~~Ge~vaivG~sGsGKSTLl~ll~g--l~--~p~~G~I~idg~~i~~~~~~~~r~~i~~v~Q~~~lf~ 90 (229) T cd03254 15 KKPVLKDINFSIKPGETVAIVGPTGAGKTTLINLLMR--FY--DPQKGQILIDGIDIRDISRKSLRSMIGVVLQDTFLFS 90 (229) T ss_pred CCEEEECEEEEECCCCEEEEECCCCCHHHHHHHHHHC--CC--CCCCCEEEECCEECCCCCHHHHHHCEEEECCCCEECC T ss_conf 9908746299987999999999999809999999966--86--6787389999999541899999632899903898757 Q ss_pred HHHCCCCHHHHCCCCCCHHHHHHHHHHH-----HHHHHHHHHHHHHCCCCCHHHH-HHHHHHHHHCCCCCCCCCCCCCCC Q ss_conf 1102770343122334304566899999-----9999999999987089968999-999998874478667754467765 Q gi|254780799|r 499 LSVYDGIPNLLTPVVTNPQKAVTVLKWL-----VCEMEERYQKMSKIGVRNIDGF-NLKVAQYHNTGKKFNRTVQTGFDR 572 (806) Q Consensus 499 ls~Y~~iPHLl~pVvTd~~kA~~aL~w~-----V~EMe~RY~l~a~~~vRni~~y-N~k~~~~~~~~~~~~~~~~~~~~~ 572 (806) -|+++.|- +-.|-++| .+...+|+.+ ++.+...|+..-..+-.|+.+= -+|+.-|++- T Consensus 91 ~Ti~~Ni~-~~~~~~~~-~~~~~~l~~~~l~~~i~~l~~gl~t~i~~~g~~LSgGq~Qri~lARal-------------- 154 (229) T cd03254 91 GTIMENIR-LGRPNATD-EEVIEAAKEAGAHDFIMKLPNGYDTVLGENGGNLSQGERQLLAIARAM-------------- 154 (229) T ss_pred CCHHHHHH-CCCCCCCH-HHHHHHHHHHCCHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHH-------------- T ss_conf 45999840-58999999-999999998400458875846457861578644999999999999999-------------- Q ss_pred CCCCCCCCCCCCCCCCCCEEEEEHHHHH-HHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCC Q ss_conf 4543222332232346986877634468-888732100588999999866414237999965777 Q gi|254780799|r 573 KTGEAIYETEHFDFQHMPYIVVVIDEMA-DLMMVARKDIESAVQRLAQMARASGIHVIMATQRPS 636 (806) Q Consensus 573 ~~~~~~~~~~~~~~~~lp~ivviiDEla-Dlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPs 636 (806) -.=|- ++|.||-. -|=....+.|.+.|.++. -+--+|+.|+|++ T Consensus 155 --------------~~~~~-illlDEpts~LD~~t~~~i~~~l~~~~-----~~~TvI~itH~~~ 199 (229) T cd03254 155 --------------LRDPK-ILILDEATSNIDTETEKLIQEALEKLM-----KGRTSIIIAHRLS 199 (229) T ss_pred --------------HCCCC-EEEEECCCCCCCHHHHHHHHHHHHHHC-----CCCEEEEEECCHH T ss_conf --------------51899-899989777899899999999999980-----9988999926988 No 254 >PRK05480 uridine kinase; Provisional Probab=71.18 E-value=3.9 Score=20.67 Aligned_cols=33 Identities=33% Similarity=0.422 Sum_probs=22.1 Q ss_pred EEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEEC Q ss_conf 3553047740679999999999982995784788852 Q gi|254780799|r 457 LLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMID 493 (806) Q Consensus 457 LLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliD 493 (806) +.|||..|||||-.-+.+.-.| ..+.+-.|=.| T Consensus 9 IgIaG~SgSGKTT~a~~L~~~l----~~~~v~vi~~D 41 (209) T PRK05480 9 IGIAGGSGSGKTTVASTIYEEL----GDESIAVISQD 41 (209) T ss_pred EEEECCCCCCHHHHHHHHHHHC----CCCCEEEEECC T ss_conf 9998999778999999999980----86875999554 No 255 >cd03285 ABC_MSH2_euk MutS2 homolog in eukaryotes. The MutS protein initiates DNA mismatch repair by recognizing mispaired and unpaired bases embedded in duplex DNA and activating endo- and exonucleases to remove the mismatch. Members of the MutS family possess C-terminal domain with a conserved ATPase activity that belongs to the ATP binding cassette (ABC) superfamily. MutS homologs (MSH) have been identified in most prokaryotic and all eukaryotic organisms examined. Prokaryotes have two homologs (MutS1 and MutS2), whereas seven MSH proteins (MSH1 to MSH7) have been identified in eukaryotes. The homodimer MutS1 and heterodimers MSH2-MSH3 and MSH2-MSH6 are primarily involved in mitotic mismatch repair, whereas MSH4-MSH5 is involved in resolution of Holliday junctions during meiosis. All members of the MutS family contain the highly conserved Walker A/B ATPase domain, and many share a common mechanism of action. MutS1, MSH2-MSH3, MSH2-MSH6, and MSH4-MSH5 dimerize to form sliding c Probab=71.01 E-value=7.6 Score=18.46 Aligned_cols=72 Identities=21% Similarity=0.245 Sum_probs=47.2 Q ss_pred EEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 23553047740679999999999982995784788852310011102770343122334304566899999999999999 Q gi|254780799|r 456 HLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYDGIPNLLTPVVTNPQKAVTVLKWLVCEMEERYQ 535 (806) Q Consensus 456 HLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~~iPHLl~pVvTd~~kA~~aL~w~V~EMe~RY~ 535 (806) =++|-|.-.||||+++-++-+..++-..- ..|--+..++++|+.| ++ -+-+.......+..--.||.+==+ T Consensus 32 i~iiTGpN~sGKSt~lk~i~l~~~laq~G-----~~vpa~~~~~~~~d~i---~t-~i~~~d~~~~~~StF~~e~~~~~~ 102 (222) T cd03285 32 FLIITGPNMGGKSTYIRQIGVIVLMAQIG-----CFVPCDSADIPIVDCI---LA-RVGASDSQLKGVSTFMAEMLETAA 102 (222) T ss_pred EEEEECCCCCCHHHHHHHHHHHHHHHHHC-----CCEEEEEEEEECCCEE---EE-EECCCCCCCCCHHHHHHHHHHHHH T ss_conf 99998999887189999999999999868-----7546323899527649---99-988997100335289999999999 Q ss_pred H Q ss_conf 9 Q gi|254780799|r 536 K 536 (806) Q Consensus 536 l 536 (806) . T Consensus 103 i 103 (222) T cd03285 103 I 103 (222) T ss_pred H T ss_conf 9 No 256 >PRK06217 hypothetical protein; Validated Probab=70.90 E-value=2.3 Score=22.38 Aligned_cols=21 Identities=29% Similarity=0.494 Sum_probs=17.3 Q ss_pred CCEEEEEECCCCHHHHHHHHH Q ss_conf 202355304774067999999 Q gi|254780799|r 454 MPHLLIAGTTGSGKSVAINTM 474 (806) Q Consensus 454 MPHLLIAGtTGSGKSV~iN~i 474 (806) |=+.+|-|+.|||||-.=..+ T Consensus 1 m~rI~i~G~sGsGkSTla~~L 21 (185) T PRK06217 1 MMRIHITGASGSGTTTLGAAL 21 (185) T ss_pred CCEEEEECCCCCCHHHHHHHH T ss_conf 967999789988789999999 No 257 >cd01855 YqeH YqeH. YqeH is an essential GTP-binding protein. Depletion of YqeH induces an excess initiation of DNA replication, suggesting that it negatively controls initiation of chromosome replication. The YqeH subfamily is common in eukaryotes and sporadically present in bacteria with probable acquisition by plants from chloroplasts. Proteins of the YqeH family contain all sequence motifs typical of the vast class of P-loop-containing GTPases, but show a circular permutation, with a G4-G1-G3 pattern of motifs as opposed to the regular G1-G3-G4 pattern seen in most GTPases. Probab=70.83 E-value=3 Score=21.48 Aligned_cols=27 Identities=26% Similarity=0.342 Sum_probs=22.2 Q ss_pred EEEEEECCCCHHHHHHHHHHHHHHHHC Q ss_conf 235530477406799999999999829 Q gi|254780799|r 456 HLLIAGTTGSGKSVAINTMILSLLYRM 482 (806) Q Consensus 456 HLLIAGtTGSGKSV~iN~iI~SlLyk~ 482 (806) +..+-|.|.+|||--||+++-+.-++. T Consensus 129 ~vyvvG~~NvGKSTLiN~Ll~~~~~~~ 155 (190) T cd01855 129 DVYVVGATNVGKSTLINALLKKDNGKK 155 (190) T ss_pred CEEEECCCCCCHHHHHHHHHHHCCCCC T ss_conf 579980587546799999863023443 No 258 >PRK13634 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=70.77 E-value=3.3 Score=21.14 Aligned_cols=93 Identities=22% Similarity=0.258 Sum_probs=46.8 Q ss_pred EEEHHH-HHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEEEECCCCCCHHHCCCCCH Q ss_conf 776344-6888873210058899999986641423799996577753554355411025158764586642123388645 Q gi|254780799|r 593 VVVIDE-MADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISFQVSSKIDSRTILGEQGA 671 (806) Q Consensus 593 vviiDE-laDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf~v~s~~dSrtild~~ga 671 (806) |++-|| .+-|=....+++-+.|.+| .+.-|+-.|+.|..-.. =+.+-.||.+ --+-|.|- ++-. T Consensus 153 iLllDEPTs~LD~~~~~~i~~ll~~L---~~e~g~Tii~vTHdl~~-------~~~~aDrviv----m~~G~Iv~-~G~p 217 (276) T PRK13634 153 VLVLDEPTAGLDPKGRKEIMEMFYKL---HKEKGLTTVLVTHSMED-------AARYADQIVV----MHKGTVFL-QGTP 217 (276) T ss_pred EEEECCCCCCCCHHHHHHHHHHHHHH---HHHCCCEEEEECCCHHH-------HHHHCCEEEE----EECCEEEE-ECCH T ss_conf 89976985427999999999999999---99619999998679999-------9997999999----98999999-8789 Q ss_pred HHHCCCCCEEEECCCCCEEEEEECCCCHHHHHHHHHHHHH Q ss_conf 7865887547736898325888334898899999999971 Q gi|254780799|r 672 EQLLGQGDMLYMTGGGRVQRIHGPFVSDIEVEKVVSHLKT 711 (806) Q Consensus 672 e~Llg~gdml~~~~~~~~~r~~g~~v~~~ev~~v~~~~~~ 711 (806) +.++.+-|.|.. +-+.-.++-++...+++ T Consensus 218 ~evf~~p~~l~~-----------~~l~~P~~~~l~~~L~~ 246 (276) T PRK13634 218 REIFSHPDELEA-----------IGLDLPETVKFKRALEE 246 (276) T ss_pred HHHHCCHHHHHH-----------CCCCCCHHHHHHHHHHH T ss_conf 999729999997-----------79999969999999989 No 259 >PRK13632 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=70.54 E-value=3.7 Score=20.83 Aligned_cols=76 Identities=14% Similarity=0.242 Sum_probs=39.9 Q ss_pred CEEEEEHHHH-HHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEEEECCCCCCHHHCCC Q ss_conf 8687763446-888873210058899999986641423799996577753554355411025158764586642123388 Q gi|254780799|r 590 PYIVVVIDEM-ADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISFQVSSKIDSRTILGE 668 (806) Q Consensus 590 p~ivviiDEl-aDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf~v~s~~dSrtild~ 668 (806) |-| +|-||= +.|=-.+.+++...|.+|.+ .-|.-.|+.|.+-. .+ ++ --||.+-- |-|+ +.+ T Consensus 162 P~i-liLDEPTs~LD~~~~~~l~~~l~~l~~---~~g~TvI~iTHd~~--~~-----~~-aDrv~vm~----~G~i-v~~ 224 (273) T PRK13632 162 PEI-IIFDESTSMLDPKGKREIKKIMVDLRK---DRKKTLISITHDMD--EA-----IL-ADKVIVFS----NGKL-IAQ 224 (273) T ss_pred CCE-EEEECCCCCCCHHHHHHHHHHHHHHHH---HCCCEEEEEEECHH--HH-----HC-CCEEEEEE----CCEE-EEE T ss_conf 999-998077556998999999999999998---46989999942888--99-----71-99999998----9999-997 Q ss_pred CCHHHHCCCCCEEE Q ss_conf 64578658875477 Q gi|254780799|r 669 QGAEQLLGQGDMLY 682 (806) Q Consensus 669 ~gae~Llg~gdml~ 682 (806) +-.+.++.+-+-|. T Consensus 225 G~p~el~~~~e~l~ 238 (273) T PRK13632 225 GKPKEILNDKEILE 238 (273) T ss_pred CCHHHHHCCHHHHH T ss_conf 69999877999999 No 260 >pfam05976 consensus Probab=70.38 E-value=7.9 Score=18.37 Aligned_cols=26 Identities=4% Similarity=-0.091 Sum_probs=12.4 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 43112789999999999999999999 Q gi|254780799|r 15 NFLLSDWSKKKMKIVAGLILLCTVFA 40 (806) Q Consensus 15 ~~~l~~f~~rrl~Ei~Gl~Li~~al~ 40 (806) +|...+....-+|-.+++.+.++..+ T Consensus 2 r~~~~~~~~~~LR~~ial~~~~~~~~ 27 (706) T pfam05976 2 RLWANAKVSYGIRVFIALNGATLLCW 27 (706) T ss_pred CEECCHHHHHHHHHHHHHHHHHHHHH T ss_conf 23104788999999999999999999 No 261 >PRK02983 lysS lysyl-tRNA synthetase; Provisional Probab=70.36 E-value=7.9 Score=18.36 Aligned_cols=48 Identities=15% Similarity=0.017 Sum_probs=27.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999998276523678999999999999999999 Q gi|254780799|r 81 FFGIASVFFLPPPTMWALSLLFDKKIYCFSKRATAWLINILVSATFFA 128 (806) Q Consensus 81 lFG~~Ayllpl~Ll~~g~~ll~~k~~~~~~~Rl~~~il~ll~~s~ll~ 128 (806) .+|....+..+.+++|+.+-|.-+......++.+..++..+..+.++. T Consensus 116 ~~~~~~~~~~~~~l~~~r~~F~a~~~~~~~~~A~~v~~~g~~~~~~~g 163 (1099) T PRK02983 116 NLGLAVHVAAIVLLVLARREFWARVRRGALFKAAAVLVAGLAVGILLG 163 (1099) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 788999999999999617645467788729999999999999999998 No 262 >TIGR01631 Trypano_RHS trypanosome RHS family; InterPro: IPR006518 These sequences are full-length and part-length members of the RHS (retrotransposon hot spot) family in Trypanosoma brucei and Trypanosoma cruzi. Members of this family are frequently interrupted by non-LTR retrotransposons inserted at exactly the same relative position. . Probab=70.31 E-value=4.8 Score=20.00 Aligned_cols=30 Identities=27% Similarity=0.524 Sum_probs=11.3 Q ss_pred ECCCCHHHHHHHHHHHHHHHHCCHHHEEEE Q ss_conf 047740679999999999982995784788 Q gi|254780799|r 461 GTTGSGKSVAINTMILSLLYRMTPAQCRLI 490 (806) Q Consensus 461 GtTGSGKSV~iN~iI~SlLyk~~P~evkli 490 (806) ||=|=|||-++-+.||-=|..+.-++++.| T Consensus 314 GTPGIGKS~~vGSfLLy~LLHyd~e~L~~v 343 (814) T TIGR01631 314 GTPGIGKSFGVGSFLLYKLLHYDAEKLQVV 343 (814) T ss_pred ECCCCCHHHHHHHHHHHHHHHCHHCCCCEE T ss_conf 178852556689999755421100037558 No 263 >PRK13407 bchI magnesium chelatase subunit I; Provisional Probab=70.22 E-value=3.8 Score=20.76 Aligned_cols=47 Identities=15% Similarity=0.217 Sum_probs=32.2 Q ss_pred CCCHHHHHHHHHHHHCCCEEEEHHHHHHCCCHHHHHHHHHHHHHCCCCCHHH Q ss_conf 7770389999999965985000142220011778999999999779868022 Q gi|254780799|r 738 VADDLYKQAVDIVLRDNKASISYIQRRLGIGYNRAASIIENMEEKGVIGPAS 789 (806) Q Consensus 738 ~~d~l~~~a~~~v~~~~~~s~s~lqr~~~igy~raar~~~~~e~~giv~~~~ 789 (806) -.|+....++++.+..+..|. |=-|.--|+||-+=-++-+-.|.+.+ T Consensus 252 ~~d~~~~~~~~~~~~~~~~g~-----Ra~i~l~r~ARa~AaL~Gr~~V~~~d 298 (334) T PRK13407 252 TPDTVLHDCAALCIALGSDGL-----RGELTLLRAARAQAAFEGAETVGRSH 298 (334) T ss_pred CCHHHHHHHHHHHHHHCCCCC-----HHHHHHHHHHHHHHHHCCCCCCCHHH T ss_conf 999999999999998589871-----09999999999999974999789999 No 264 >PRK07993 DNA polymerase III subunit delta'; Validated Probab=70.12 E-value=4.7 Score=20.06 Aligned_cols=145 Identities=22% Similarity=0.276 Sum_probs=87.5 Q ss_pred CCCCCE-EEEEECCCCHHHHHHHHHHHHHHHHCCH--------HHEEEEEECCCHHHHHHCCCCHHHHCCCCCCHHHHHH Q ss_conf 100202-3553047740679999999999982995--------7847888523100111027703431223343045668 Q gi|254780799|r 451 LARMPH-LLIAGTTGSGKSVAINTMILSLLYRMTP--------AQCRLIMIDPKMLELSVYDGIPNLLTPVVTNPQKAVT 521 (806) Q Consensus 451 LakMPH-LLIAGtTGSGKSV~iN~iI~SlLyk~~P--------~evkliliDPK~vEls~Y~~iPHLl~pVvTd~~kA~~ 521 (806) -.++|| +|+.|..|.||+.-...+.-.||-.... ..||++ +=...=|+ |.+.| .|+. T Consensus 20 ~~rl~HA~L~~G~~G~Gk~~la~~~a~~llC~~~~~~~~Cg~C~~C~l~-------~~~~HPD~-~~i~p-----e~~~- 85 (334) T PRK07993 20 AGRGHHALLIQALPGMGDDALIYALSRYLLCQQPQGHKSCGHCRGCQLM-------QAGTHPDY-YTLTP-----EKGK- 85 (334) T ss_pred CCCCCEEEEEECCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHH-------HCCCCCCE-EEECC-----CCCC- T ss_conf 5981046754799998899999999999818999999999999789998-------66899984-77534-----2234- Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHH Q ss_conf 99999999999999998708996899999999887447866775446776545432223322323469868776344688 Q gi|254780799|r 522 VLKWLVCEMEERYQKMSKIGVRNIDGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQHMPYIVVVIDEMAD 601 (806) Q Consensus 522 aL~w~V~EMe~RY~l~a~~~vRni~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivviiDElaD 601 (806) ...+|--|-..++++.. . + .-=.|=|||||+ || T Consensus 86 ----------------~~I~IdqIR~l~~~~~~---~-------------~--------------~~g~~kV~iI~~-Ae 118 (334) T PRK07993 86 ----------------SSLGVDAVREVTEKLYE---H-------------A--------------RLGGAKVVWLPD-AA 118 (334) T ss_pred ----------------CCCCHHHHHHHHHHHHH---C-------------C--------------CCCCCEEEEECC-HH T ss_conf ----------------55999999999999843---6-------------6--------------569947999766-77 Q ss_pred HHHHCC-----CHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEEEECCCCCCHHHCCC Q ss_conf 887321-----0058899999986641423799996577753554355411025158764586642123388 Q gi|254780799|r 602 LMMVAR-----KDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISFQVSSKIDSRTILGE 668 (806) Q Consensus 602 lmm~~~-----~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf~v~s~~dSrtild~ 668 (806) -|-.+. |-.|++ ..+-++||-|.+|+ -+-.+|.+-- -++.|..-+.-+...-|-+ T Consensus 119 ~mn~~AaNaLLKtLEEP---------p~~t~~iL~t~~~~--~lLpTI~SRC-q~~~~~~~~~~~~~~wL~~ 178 (334) T PRK07993 119 LLTDAAANALLKTLEEP---------PEKTWFFLACREPA--RLLATLRSRC-RLHYLAPPPEQYALTWLSR 178 (334) T ss_pred HHCHHHHHHHHHHHCCC---------CCCEEEEEECCCHH--HCCCHHHHHC-CCCCCCCCCHHHHHHHHHH T ss_conf 75999999999861279---------98849998669856--5723887523-0415899799999999987 No 265 >PTZ00066 pyruvate kinase; Provisional Probab=70.06 E-value=2.1 Score=22.60 Aligned_cols=116 Identities=23% Similarity=0.307 Sum_probs=68.4 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHH-HHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 233430456689999999999999999870899689999999988-7447866775446776545432223322323469 Q gi|254780799|r 511 PVVTNPQKAVTVLKWLVCEMEERYQKMSKIGVRNIDGFNLKVAQY-HNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQHM 589 (806) Q Consensus 511 pVvTd~~kA~~aL~w~V~EMe~RY~l~a~~~vRni~~yN~k~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 589 (806) |..|+-.+. ..|+|+++ .--+..|-.-||+=+.-.+ +++. ..+|... ++.. ..| ....++.| T Consensus 205 p~lTekD~~-dil~fa~~---~~vD~IalSFVrs~~DV~~-~r~~l~~~g~~~--~IIa---------KIE-~~~av~Nl 267 (513) T PTZ00066 205 PVIGEKDKN-DILNFAIP---MNCDFIALSFIQSADDVRL-CRNLLGERGKHI--KIIP---------KIE-NIEGLINF 267 (513) T ss_pred CCCCCCCHH-HHHHHHHH---CCCCEEEECCCCCHHHHHH-HHHHHHHCCCCC--EEEE---------EEC-CHHHHHCH T ss_conf 547636889-99999987---5999999867799899999-999999759864--5899---------843-76556589 Q ss_pred CEEEEEHHHHHHHHHHCCCH--HHHHH-------HHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEE Q ss_conf 86877634468888732100--58899-------99998664142379999657775355435541102515 Q gi|254780799|r 590 PYIVVVIDEMADLMMVARKD--IESAV-------QRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRI 652 (806) Q Consensus 590 p~ivviiDElaDlmm~~~~~--ve~~i-------~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~ri 652 (806) .-|+ +.+|-.|+|.-| ||-++ -++-+++|.+|.-.|+||| ..--+|....|||- T Consensus 268 deIi----~~sDgIMVARGDLgvEip~e~vp~~QK~II~~c~~~gKPVIvATq-----mLeSMi~np~PTRA 330 (513) T PTZ00066 268 DKIL----AESDGIMVARGDLGMEIPPEKVFLAQKLMISKCNLQGKPIITATQ-----MLESMIKNPRPTRA 330 (513) T ss_pred HHHH----HHCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECH-----HHHHHCCCCCCCHH T ss_conf 9999----858989995685424269888689999999999975996999742-----57774049998716 No 266 >cd02028 UMPK_like Uridine monophosphate kinase_like (UMPK_like) is a family of proteins highly similar to the uridine monophosphate kinase (UMPK, EC 2.7.1.48), also known as uridine kinase or uridine-cytidine kinase (UCK). Probab=69.98 E-value=4 Score=20.55 Aligned_cols=23 Identities=35% Similarity=0.434 Sum_probs=18.8 Q ss_pred EEEEECCCCHHHHHHHHHHHHHH Q ss_conf 35530477406799999999999 Q gi|254780799|r 457 LLIAGTTGSGKSVAINTMILSLL 479 (806) Q Consensus 457 LLIAGtTGSGKSV~iN~iI~SlL 479 (806) .-|||-+|||||-.-|.|.--|- T Consensus 2 IgIaG~SgSGKTT~a~~L~~~l~ 24 (179) T cd02028 2 VGIAGPSGSGKTTFAKKLSNQLR 24 (179) T ss_pred EEEECCCCCCHHHHHHHHHHHHH T ss_conf 89989897789999999999984 No 267 >PTZ00300 pyruvate kinase; Provisional Probab=69.98 E-value=2.4 Score=22.19 Aligned_cols=238 Identities=20% Similarity=0.193 Sum_probs=122.8 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCEEEEEEEECCEEEEEEEEECCCCCHHHHHHHHHHHHHHHHHCCCCCC--CCCCCCEEE Q ss_conf 256654578999999742486328998410442444443214786399999788999988630012100--038861255 Q gi|254780799|r 328 SPKVMQNNACTLKSVLSDFGIQGEIVNVRPGPVITLYELEPAPGIKSSRIIGLSDDIARSMSAISARVA--VIPRRNAIG 405 (806) Q Consensus 328 s~eeL~~nA~lLE~tL~dFGVe~~Vv~v~pGPVVTrYEi~PApGVKvSKI~nLadDIA~aLsa~svRIa--pIPGK~~VG 405 (806) +.++.++.-+.+.+.=++.|..+-|--=.+||-+=.=++... .++..+ -..+.+. +..+ T Consensus 12 ~~e~h~~~i~~iR~~~~~~~~~v~Il~Dl~GPkIRtg~~~~~-~~~l~~-------------G~~v~l~~~~~~~----- 72 (454) T PTZ00300 12 SHEYHQTTINNVRQAAAELGVNIAIALDTKGPEIRTGLFVGG-EAVMER-------------GATCYVTTDPAFA----- 72 (454) T ss_pred CHHHHHHHHHHHHHHHHHHCCCEEEEEECCCCEEEEEECCCC-CEEECC-------------CCEEEEECCCCCC----- T ss_conf 999999999999999998399859999688983388554799-679658-------------9999997776335----- Q ss_pred EEECCCCCCEEEEHHHCCCHHHHHCCCCCHHHCCC------CCCCC-EEEEECCCCCEEEEEECCCCHHHHHHHHHHHHH Q ss_conf 53024566238734230721343022100021045------66666-678541002023553047740679999999999 Q gi|254780799|r 406 IELPNDIRETVMLRDLIVSRVFEKNQCDLAINLGK------SIEGK-PIIADLARMPHLLIAGTTGSGKSVAINTMILSL 478 (806) Q Consensus 406 IEiPN~~r~~V~lreil~s~~f~~s~~~L~iaLGK------dI~G~-pvv~DLakMPHLLIAGtTGSGKSV~iN~iI~Sl 478 (806) +..+.++-.|.+.++.+. -+....+-+.=|+ .+.++ .+. --.+-+|..+|.|.|-+ T Consensus 73 -~~g~~~~i~v~y~~l~~~---v~~Gd~IlidDG~i~l~V~~~~~~~~v~------~~V~~gG~L~s~Kgvnl------- 135 (454) T PTZ00300 73 -DKGTKDKFYIDYQNLSKV---VRPGGYIYIDDGILILHVQSHEDEQTLK------CTVTNAHTISDRRGVNL------- 135 (454) T ss_pred -CCCCCCEEECCHHHHHHH---CCCCCEEEECCCEEEEEEEEECCCCEEE------EEEEECCEECCCCCEEC------- T ss_conf -688766783376898976---5899989982996999999972897699------99965869638971657------- Q ss_pred HHHCCHHHEEEEEECCCHHHHHHCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHC Q ss_conf 98299578478885231001110277034312233430456689999999999999999870899689999999988744 Q gi|254780799|r 479 LYRMTPAQCRLIMIDPKMLELSVYDGIPNLLTPVVTNPQKAVTVLKWLVCEMEERYQKMSKIGVRNIDGFNLKVAQYHNT 558 (806) Q Consensus 479 Lyk~~P~evkliliDPK~vEls~Y~~iPHLl~pVvTd~~kA~~aL~w~V~EMe~RY~l~a~~~vRni~~yN~k~~~~~~~ 558 (806) | |. .+-.|.+|+-.+ ..|+|++ +.--+..|-.=||+=+.-.+--+-..++ T Consensus 136 -----P-------------------~~-~l~lp~lTekD~--~di~fa~---~~~vD~IalSFVr~a~DV~~~r~~l~~~ 185 (454) T PTZ00300 136 -----P-------------------GC-DVDLPAVSAKDC--ADLQFGV---EQGVDMIFASFIRSAEQVGEVRKALGAK 185 (454) T ss_pred -----C-------------------CC-CCCCCCCCHHHH--HHHHHHH---HCCCCEEEEECCCCHHHHHHHHHHHHHH T ss_conf -----8-------------------99-788776998789--9999999---7499989990679889999999999973 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHHHHHHCCCH--HHHHH-------HHHHHHHHHCCEEEE Q ss_conf 786677544677654543222332232346986877634468888732100--58899-------999986641423799 Q gi|254780799|r 559 GKKFNRTVQTGFDRKTGEAIYETEHFDFQHMPYIVVVIDEMADLMMVARKD--IESAV-------QRLAQMARASGIHVI 629 (806) Q Consensus 559 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivviiDElaDlmm~~~~~--ve~~i-------~rlaq~ara~GiHli 629 (806) |... ++. +..| ....++.|+-|+ |.+|=.|+|.-| ||-++ -++-++||.+|.-.| T Consensus 186 g~~~--~Ii---------aKIE-~~~av~NldeIi----~~sDgIMVARGDLgvEi~~e~vp~~Qk~Ii~~c~~~gkpvI 249 (454) T PTZ00300 186 GGDI--MII---------CKIE-NHQGVQNIDSII----EESDGIMVARGDLGVEIPAEKVVVAQKILISKCNVAGKPVI 249 (454) T ss_pred CCCC--EEE---------EEEC-CHHHHHHHHHHH----HHCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHCCCEEE T ss_conf 8976--699---------8632-466666469899----85879999457430317988978999999999998399299 Q ss_pred EEECCCCCCCCCHHHHHCCCCEE Q ss_conf 99657775355435541102515 Q gi|254780799|r 630 MATQRPSVDVITGTIKANFPTRI 652 (806) Q Consensus 630 ~aTqrPsvdvitg~ikan~p~ri 652 (806) +||| ..--+|....|||- T Consensus 250 ~ATq-----mLeSMi~~p~PTRA 267 (454) T PTZ00300 250 CATQ-----MLESMTYNPRPTRA 267 (454) T ss_pred EECC-----HHHHHCCCCCCCHH T ss_conf 9732-----37764329998638 No 268 >PRK06893 DNA replication initiation factor; Validated Probab=69.89 E-value=6.5 Score=19.00 Aligned_cols=31 Identities=26% Similarity=0.323 Sum_probs=24.7 Q ss_pred CCEEEEEECCCCHHHHHHHHHHHHHHHHCCH Q ss_conf 2023553047740679999999999982995 Q gi|254780799|r 454 MPHLLIAGTTGSGKSVAINTMILSLLYRMTP 484 (806) Q Consensus 454 MPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P 484 (806) =|++.|-|.+|||||=-++++.-..+-+.++ T Consensus 39 ~~~l~i~G~~gsGKTHLLqa~~~~~~~~~~~ 69 (229) T PRK06893 39 QPFFYIWGGKSSGKSHLLKAVSNHYLLNQRT 69 (229) T ss_pred CCEEEEECCCCCCHHHHHHHHHHHHHHHCCC T ss_conf 9879998999998899999999999971898 No 269 >cd03248 ABCC_TAP TAP, the Transporter Associated with Antigen Processing; TAP is essential for peptide delivery from the cytosol into the lumen of the endoplasmic reticulum (ER), where these peptides are loaded on major histocompatibility complex (MHC) I molecules. Loaded MHC I leave the ER and display their antigenic cargo on the cell surface to cytotoxic T cells. Subsequently, virus-infected or malignantly transformed cells can be eliminated. TAP belongs to the large family of ATP-binding cassette (ABC) transporters, which translocate a vast variety of solutes across membranes. Probab=69.80 E-value=3.9 Score=20.63 Aligned_cols=150 Identities=18% Similarity=0.230 Sum_probs=71.4 Q ss_pred EECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHC---------------C----HHHEEEEEECCCHHHHHHCCCCHHHH Q ss_conf 5410020235530477406799999999999829---------------9----57847888523100111027703431 Q gi|254780799|r 449 ADLARMPHLLIAGTTGSGKSVAINTMILSLLYRM---------------T----PAQCRLIMIDPKMLELSVYDGIPNLL 509 (806) Q Consensus 449 ~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~---------------~----P~evkliliDPK~vEls~Y~~iPHLl 509 (806) .++.+==.+-|-|.+|||||-.++.|. -+ |.. . -+.+-++-=||-...-++.+.|- +- T Consensus 35 ~~i~~Ge~vaIvG~sGsGKSTL~~ll~-gl-~~p~~G~I~idg~~i~~~~~~~lr~~i~~v~Q~~~lf~~ti~eNi~-~g 111 (226) T cd03248 35 FTLHPGEVTALVGPSGSGKSTVVALLE-NF-YQPQGGQVLLDGKPISQYEHKYLHSKVSLVGQEPVLFARSLQDNIA-YG 111 (226) T ss_pred EEECCCCEEEEECCCCCHHHHHHHHHH-CC-CCCCCCEEEECCEEHHHCCHHHHHHCEEEEECCCEECCCCHHHHHH-CC T ss_conf 998299999999999984999999996-45-4678878999999934489999973269992479576773566663-27 Q ss_pred CCCCCCHHHHHHHHH-----HHHHHHHHHHHHHHHCCCCCHHH-HHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 223343045668999-----99999999999998708996899-999999887447866775446776545432223322 Q gi|254780799|r 510 TPVVTNPQKAVTVLK-----WLVCEMEERYQKMSKIGVRNIDG-FNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEH 583 (806) Q Consensus 510 ~pVvTd~~kA~~aL~-----w~V~EMe~RY~l~a~~~vRni~~-yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 583 (806) .|-.++. +...+++ -.+.+|...|+-.-..+-.|+.+ --+|+.-|++-- T Consensus 112 ~~~~~~~-~i~~~~~~~~~~~~i~~l~~gl~t~i~~~g~~LSgGqkQRialARal~------------------------ 166 (226) T cd03248 112 LQSCSFE-CVKEAAQKAHAHSFISELASGYDTEVGEKGSQLSGGQKQRVAIARALI------------------------ 166 (226) T ss_pred CCCCCHH-HHHHHHHHHCHHHHHHHCCCCCCCEECCCCCCCCHHHHHHHHHHHHHH------------------------ T ss_conf 8999999-999999996614677626366640616848876999999999999997------------------------ Q ss_pred CCCCCCCEEEEEHHHH-HHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCC Q ss_conf 3234698687763446-8888732100588999999866414237999965777 Q gi|254780799|r 584 FDFQHMPYIVVVIDEM-ADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPS 636 (806) Q Consensus 584 ~~~~~lp~ivviiDEl-aDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPs 636 (806) .=|- |+|.||- +-|=....+++.+.|.++.| +--+|+.|+|++ T Consensus 167 ----~~p~-ililDEptSaLD~~te~~i~~~l~~~~~-----~~Tvi~ItH~l~ 210 (226) T cd03248 167 ----RNPQ-VLILDEATSALDAESEQQVQQALYDWPE-----RRTVLVIAHRLS 210 (226) T ss_pred ----CCCC-EEEEECCCCCCCHHHHHHHHHHHHHHCC-----CCEEEEEECCHH T ss_conf ----5999-9999797668899999999999998669-----999999937999 No 270 >cd03246 ABCC_Protease_Secretion This family represents the ABC component of the protease secretion system PrtD, a 60-kDa integral membrane protein sharing 37% identity with HlyB, the ABC component of the alpha-hemolysin secretion pathway, in the C-terminal domain. They export degradative enzymes by using a type I protein secretion system and lack an N-terminal signal peptide, but contain a C-terminal secretion signal. The Type I secretion apparatus is made up of three components, an ABC transporter, a membrane fusion protein (MFP), and an outer membrane protein (OMP). For the HlyA transporter complex, HlyB (ABC transporter) and HlyD (MFP) reside in the inner membrane of E. coli. The OMP component is TolC, which is thought to interact with the MFP to form a continuous channel across the periplasm from the cytoplasm to the exterior. HlyB belongs to the family of ABC transporters, which are ubiquitous, ATP-dependent transmembrane pumps or channels. The spectrum of transport substra Probab=69.78 E-value=3.3 Score=21.22 Aligned_cols=40 Identities=28% Similarity=0.444 Sum_probs=25.8 Q ss_pred EEEHHHH-HHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCC Q ss_conf 7763446-8888732100588999999866414237999965777 Q gi|254780799|r 593 VVVIDEM-ADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPS 636 (806) Q Consensus 593 vviiDEl-aDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPs 636 (806) |++-||- +-| -.+-|..|.+.-...+..|.-.|+.|.|++ T Consensus 117 ililDEpts~L----D~~~e~~i~~~l~~l~~~~~Tvi~vtH~~~ 157 (173) T cd03246 117 ILVLDEPNSHL----DVEGERALNQAIAALKAAGATRIVIAHRPE 157 (173) T ss_pred EEEEECCCCCC----CHHHHHHHHHHHHHHHHCCCEEEEECCCHH T ss_conf 99996876689----989999999999978648989999847999 No 271 >PRK11264 putative amino-acid ABC transporter ATP-binding protein YecC; Provisional Probab=69.68 E-value=5.6 Score=19.46 Aligned_cols=39 Identities=23% Similarity=0.372 Sum_probs=27.0 Q ss_pred EECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEE Q ss_conf 54100202355304774067999999999998299578478 Q gi|254780799|r 449 ADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRL 489 (806) Q Consensus 449 ~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkl 489 (806) .++.+==.+-|.|..|||||..+++| +.|++-+--++++ T Consensus 22 l~i~~Ge~~aliG~sGsGKSTLl~~l--~gl~~p~~G~i~~ 60 (248) T PRK11264 22 LEVKPGEVVAIIGPSGSGKTTLLRCI--NLLEQPEAGTIRV 60 (248) T ss_pred EEECCCCEEEEECCCCCCHHHHHHHH--HCCCCCCCCEEEE T ss_conf 79879989999999998099999999--7589999867999 No 272 >COG1855 ATPase (PilT family) [General function prediction only] Probab=69.67 E-value=3.4 Score=21.07 Aligned_cols=120 Identities=21% Similarity=0.239 Sum_probs=66.7 Q ss_pred CCCCCCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEEECCEEEEEEEEECCCCCHHHHHHHHH Q ss_conf 35762036613533200124555632566545789999997424863289984104424444432147863999997889 Q gi|254780799|r 303 GTGTFVLPSKEILSTSQSPVNQMTFSPKVMQNNACTLKSVLSDFGIQGEIVNVRPGPVITLYELEPAPGIKSSRIIGLSD 382 (806) Q Consensus 303 ~~~~YkLPsLdLL~~~~~~~~~~~~s~eeL~~nA~lLE~tL~dFGVe~~Vv~v~pGPVVTrYEi~PApGVKvSKI~nLad 382 (806) ..++|++=+++ +...+.++|+..|+-+.+.=+.+. +.-+.==.+|-+|-+.- T Consensus 167 kpG~~k~v~l~----------d~pl~~~ele~ia~eIi~~a~~~~-~sfIEi~r~GatVvQlr----------------- 218 (604) T COG1855 167 KPGEWKLVRLS----------DKPLTREELEEIAREIIERAKRDP-DSFIEIDRPGATVVQLR----------------- 218 (604) T ss_pred CCCCEEEEECC----------CCCCCHHHHHHHHHHHHHHHHHCC-CCEEEEECCCCEEEEEC----------------- T ss_conf 99847999857----------766889999999999999875286-76489806996699843----------------- Q ss_pred HHHHHHHHCCCCC--CCCCCCCEEEEEECCCCCCEEEEHHHCCCHHHHHCCCCCHHHCCCCCCCCEEEEECCCCCEEEEE Q ss_conf 9998863001210--00388612555302456623873423072134302210002104566666678541002023553 Q gi|254780799|r 383 DIARSMSAISARV--AVIPRRNAIGIELPNDIRETVMLRDLIVSRVFEKNQCDLAINLGKSIEGKPIIADLARMPHLLIA 460 (806) Q Consensus 383 DIA~aLsa~svRI--apIPGK~~VGIEiPN~~r~~V~lreil~s~~f~~s~~~L~iaLGKdI~G~pvv~DLakMPHLLIA 460 (806) ..|| |-=|= +=++||-- .|-.|.++ | +.|.-+. +|-=-|= .+---.||| T Consensus 219 ---------n~RIvIarPPf--Sd~~EITa-vRPvvk~~--l--edY~L~d-kl~eRL~------------eraeGILIA 269 (604) T COG1855 219 ---------NYRIVIARPPF--SDRWEITA-VRPVVKLS--L--EDYGLSD-KLKERLE------------ERAEGILIA 269 (604) T ss_pred ---------CEEEEEECCCC--CCCEEEEE-EEEEEEEE--H--HHCCCCH-HHHHHHH------------HHHCCEEEE T ss_conf ---------57999945998--77628999-71369960--5--5428798-9999988------------641646995 Q ss_pred ECCCCHHHHHHHHHHHHHH Q ss_conf 0477406799999999999 Q gi|254780799|r 461 GTTGSGKSVAINTMILSLL 479 (806) Q Consensus 461 GtTGSGKSV~iN~iI~SlL 479 (806) |+-|+|||-+..++=-=++ T Consensus 270 G~PGaGKsTFaqAlAefy~ 288 (604) T COG1855 270 GAPGAGKSTFAQALAEFYA 288 (604) T ss_pred CCCCCCHHHHHHHHHHHHH T ss_conf 6999974689999999998 No 273 >COG0444 DppD ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component [Amino acid transport and metabolism / Inorganic ion transport and metabolism] Probab=69.65 E-value=6 Score=19.25 Aligned_cols=182 Identities=25% Similarity=0.340 Sum_probs=98.0 Q ss_pred EEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEE--EEEECCC-HHHHHH-----CC--CC------H-HHH Q ss_conf 785410020235530477406799999999999829957847--8885231-001110-----27--70------3-431 Q gi|254780799|r 447 IIADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCR--LIMIDPK-MLELSV-----YD--GI------P-NLL 509 (806) Q Consensus 447 vv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evk--liliDPK-~vEls~-----Y~--~i------P-HLl 509 (806) +-.++.+==-|=|.|-.||||||--.+|+- ||-+ .|..+. =|..|-+ .+.|+- |- .| | .=| T Consensus 24 vs~~i~~GE~lgiVGESGsGKS~~~~aim~-llp~-~~~~i~~G~i~f~g~~l~~l~~~~~~~iRG~~I~mIfQ~p~~sL 101 (316) T COG0444 24 VSFELKKGEILGIVGESGSGKSVLAKAIMG-LLPK-PNARIVGGEILFDGKDLLSLSEKELRKIRGKEIAMIFQDPMTSL 101 (316) T ss_pred EEEEECCCCEEEEECCCCCCHHHHHHHHHH-CCCC-CCCEEEEEEEEECCCCCCCCCHHHHHHHCCCEEEEEECCCHHHC T ss_conf 058875896899983897889999999984-6688-89748611899889646669999998631756899974815644 Q ss_pred CCCCCCHHHHHHHHHHHH----HH--HHHHHHHHHHCCCCCHH----HH--------HHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 223343045668999999----99--99999999870899689----99--------99999887447866775446776 Q gi|254780799|r 510 TPVVTNPQKAVTVLKWLV----CE--MEERYQKMSKIGVRNID----GF--------NLKVAQYHNTGKKFNRTVQTGFD 571 (806) Q Consensus 510 ~pVvTd~~kA~~aL~w~V----~E--Me~RY~l~a~~~vRni~----~y--------N~k~~~~~~~~~~~~~~~~~~~~ 571 (806) .||-|=-+.-..+|+.-- ++ +++=.++|...|..|-+ .| .++|--|.+- T Consensus 102 nPv~~Ig~Qi~E~l~~h~~~~~~~ea~~~a~~~L~~Vgi~~~~~~~~~YPhelSGGMrQRV~IAmal------------- 168 (316) T COG0444 102 NPVMTIGDQIAEVLRLHGKGLSKKEAKERAIELLELVGIPDPERRLKSYPHELSGGMRQRVMIAMAL------------- 168 (316) T ss_pred CCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCCCCCCHHHHHHHHHHH------------- T ss_conf 9703499999999998514113689999999999976999878998619835587189999999998------------- Q ss_pred CCCCCCCCCCCCCCCCCCCEEEEEHHHHH-HHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCC Q ss_conf 54543222332232346986877634468-88873210058899999986641423799996577753554355411025 Q gi|254780799|r 572 RKTGEAIYETEHFDFQHMPYIVVVIDEMA-DLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPT 650 (806) Q Consensus 572 ~~~~~~~~~~~~~~~~~lp~ivviiDEla-Dlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~ 650 (806) --=|- ++|-||=- -|=-+...+|=+.+..|. +--|.-+|+=|---+| =|.+-- T Consensus 169 ---------------~~~P~-LlIADEPTTALDvtvQaqIl~ll~~l~---~e~~~siilITHDl~v-------va~~aD 222 (316) T COG0444 169 ---------------ALNPK-LLIADEPTTALDVTVQAQILDLLKELQ---REKGTALILITHDLGV-------VAEIAD 222 (316) T ss_pred ---------------HCCCC-EEEECCCCCHHHHHHHHHHHHHHHHHH---HHCCCEEEEEECCHHH-------HHHHCC T ss_conf ---------------58998-899679860451999999999999999---8549789999488899-------997456 Q ss_pred EEEEEECCCC----CCHHHCCCC Q ss_conf 1587645866----421233886 Q gi|254780799|r 651 RISFQVSSKI----DSRTILGEQ 669 (806) Q Consensus 651 riaf~v~s~~----dSrtild~~ 669 (806) ||+---+-++ +.+.|+..+ T Consensus 223 rv~VMYaG~iVE~g~~~~i~~~P 245 (316) T COG0444 223 RVAVMYAGRIVEEGPVEEIFKNP 245 (316) T ss_pred EEEEEECCEEEEECCHHHHHCCC T ss_conf 68998775899867888874389 No 274 >PRK10590 ATP-dependent RNA helicase RhlE; Provisional Probab=69.58 E-value=8.2 Score=18.25 Aligned_cols=45 Identities=20% Similarity=0.282 Sum_probs=31.2 Q ss_pred CCCHHHHHHHHHHHHHHH--CCEEEEEEE--CCCCCCCCCHHHHHCCCC Q ss_conf 210058899999986641--423799996--577753554355411025 Q gi|254780799|r 606 ARKDIESAVQRLAQMARA--SGIHVIMAT--QRPSVDVITGTIKANFPT 650 (806) Q Consensus 606 ~~~~ve~~i~rlaq~ara--~GiHli~aT--qrPsvdvitg~ikan~p~ 650 (806) .+++.|++|-|+.--||| .|.-.-+.| +++...-|--+||..+|- T Consensus 322 ~P~~~e~YvHRiGRTGRaG~~G~ait~v~~~e~~~~~~ie~~~~~~~~~ 370 (457) T PRK10590 322 LPNVPEDYVHRIGRTGRAAATGEALSLVCVDEHKLLRDIEKLLKKEIPR 370 (457) T ss_pred CCCCHHHEECCCCCCCCCCCCEEEEEEECHHHHHHHHHHHHHHCCCCCC T ss_conf 9997445002267060589953699986689999999999997798874 No 275 >PRK05654 acetyl-CoA carboxylase subunit beta; Validated Probab=69.50 E-value=8.2 Score=18.24 Aligned_cols=88 Identities=11% Similarity=0.156 Sum_probs=57.4 Q ss_pred CCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHHHHHHCCCHHHHHHHHHHHH Q ss_conf 89968999999998874478667754467765454322233223234698687763446888873210058899999986 Q gi|254780799|r 541 GVRNIDGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQHMPYIVVVIDEMADLMMVARKDIESAVQRLAQM 620 (806) Q Consensus 541 ~vRni~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivviiDElaDlmm~~~~~ve~~i~rlaq~ 620 (806) +.++-+.|.+|+++++++-. +.+-+.+|.- ...-+|-.|.+ =+|.=++=.-|..|-+.|+|.+|. T Consensus 90 ~F~d~k~Y~drl~~a~kkTg-~~dav~~g~G-------------~I~g~~v~~~~-~df~F~GGSmG~~~GEki~~a~e~ 154 (288) T PRK05654 90 KFRDSKKYKDRLKAAQKKTG-LKDAVVTGKG-------------TIEGMPVVLAV-MDFSFMGGSMGSVVGEKIVRAVER 154 (288) T ss_pred CCCCCCHHHHHHHHHHHHCC-CCCEEEEEEE-------------EECCEEEEEEE-ECHHHHCCCCCHHHHHHHHHHHHH T ss_conf 88885317899999998709-7507999899-------------99999999999-521454266457899999999999 Q ss_pred HHHCCEEEEEEECCCCCCCCCHH Q ss_conf 64142379999657775355435 Q gi|254780799|r 621 ARASGIHVIMATQRPSVDVITGT 643 (806) Q Consensus 621 ara~GiHli~aTqrPsvdvitg~ 643 (806) |..-++-||+-++.=-.-+=.|+ T Consensus 155 A~~~~~PlI~~~~SGGaRMQEGi 177 (288) T PRK05654 155 ALEEKCPLVIFSASGGARMQEGL 177 (288) T ss_pred HHHCCCCEEEEECCCCHHHCCCH T ss_conf 99749978999678763332746 No 276 >cd03290 ABCC_SUR1_N The SUR domain 1. The sulfonylurea receptor SUR is an ATP transporter of the ABCC/MRP family with tandem ATPase binding domains. Unlike other ABC proteins, it has no intrinsic transport function, neither active nor passive, but associates with the potassium channel proteins Kir6.1 or Kir6.2 to form the ATP-sensitive potassium (K(ATP)) channel. Within the channel complex, SUR serves as a regulatory subunit that fine-tunes the gating of Kir6.x in response to alterations in cellular metabolism. It constitutes a major pharmaceutical target as it binds numerous drugs, K(ATP) channel openers and blockers, capable of up- or down-regulating channel activity. Probab=69.42 E-value=4.2 Score=20.44 Aligned_cols=31 Identities=23% Similarity=0.416 Sum_probs=22.0 Q ss_pred CCEEEEECC----CCCEEEEEECCCCHHHHHHHHH Q ss_conf 666785410----0202355304774067999999 Q gi|254780799|r 444 GKPIIADLA----RMPHLLIAGTTGSGKSVAINTM 474 (806) Q Consensus 444 G~pvv~DLa----kMPHLLIAGtTGSGKSV~iN~i 474 (806) |.|+.-|+. +==.+-|-|.+|||||--+++| T Consensus 13 ~~~vL~~inl~i~~Ge~~~IvG~sGsGKSTLl~~l 47 (218) T cd03290 13 GLATLSNINIRIPTGQLTMIVGQVGCGKSSLLLAI 47 (218) T ss_pred CCEEEECEEEEECCCCEEEEECCCCCCHHHHHHHH T ss_conf 99056476999869999999999998099999998 No 277 >PRK13637 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=69.31 E-value=4 Score=20.56 Aligned_cols=99 Identities=20% Similarity=0.276 Sum_probs=52.9 Q ss_pred CEEEEEHHHH-HHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEEEECCCCCCHHHCCC Q ss_conf 8687763446-888873210058899999986641423799996577753554355411025158764586642123388 Q gi|254780799|r 590 PYIVVVIDEM-ADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISFQVSSKIDSRTILGE 668 (806) Q Consensus 590 p~ivviiDEl-aDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf~v~s~~dSrtild~ 668 (806) |- +++-||= +-|=..+.+++-+.|.+ ..+.-|+-+|++|..... =+.+-.||.+- -+-|.|-| T Consensus 163 P~-iLllDEPTs~LDp~~~~~i~~~l~~---L~~e~g~Tvi~vTHdl~~-------v~~~aDRvivl----~~G~Iv~~- 226 (287) T PRK13637 163 PK-VLILDEPTAGLDPKGRDDILEKIKA---LHKEYNMTIILVSHSMED-------VAKIADRIIVM----NKGRCELQ- 226 (287) T ss_pred CC-EEEEECCCCCCCHHHHHHHHHHHHH---HHHHCCCEEEEECCCHHH-------HHHHCCEEEEE----ECCEEEEE- T ss_conf 99-9998388664899999999999999---998509899999579999-------99969999999----89999998- Q ss_pred CCHHHHCCCCCEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHCCCC Q ss_conf 64578658875477368983258883348988999999999712897 Q gi|254780799|r 669 QGAEQLLGQGDMLYMTGGGRVQRIHGPFVSDIEVEKVVSHLKTQGEA 715 (806) Q Consensus 669 ~gae~Llg~gdml~~~~~~~~~r~~g~~v~~~ev~~v~~~~~~q~~~ 715 (806) +-.+.++- .|..++.+-+.-..+.++...++++|-. T Consensus 227 Gtp~evf~-----------~~~~l~~~~l~~P~~~~l~~~L~~~g~~ 262 (287) T PRK13637 227 GTPREVFK-----------EVDTLESIGLAVPQVTYLVRKLRKKGFN 262 (287) T ss_pred CCHHHHHC-----------CHHHHHHCCCCCCHHHHHHHHHHHCCCC T ss_conf 78899876-----------9889987699999199999999975999 No 278 >pfam05729 NACHT NACHT domain. This NTPase domain is found in apoptosis proteins as well as those involved in MHC transcription activation. This family is closely related to pfam00931. Probab=69.30 E-value=7.9 Score=18.36 Aligned_cols=55 Identities=20% Similarity=0.329 Sum_probs=35.1 Q ss_pred EEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEE-EEECCCHHHHHHCCCCHHHHC Q ss_conf 2355304774067999999999998299578478-885231001110277034312 Q gi|254780799|r 456 HLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRL-IMIDPKMLELSVYDGIPNLLT 510 (806) Q Consensus 456 HLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkl-iliDPK~vEls~Y~~iPHLl~ 510 (806) .++|-|.-|+|||..+.-|..-..-...+.+.++ +.+.-+.+..+....+..||. T Consensus 2 ~i~i~G~aG~GKTtll~kl~~~wa~g~~~~~~~~vf~~~~r~~~~~~~~sl~~ll~ 57 (165) T pfam05729 2 TVILQGEAGSGKTTLLQKLALLWAQGKLPQDFDFVFFLPCRELSRSGEASLADLLF 57 (165) T ss_pred EEEEECCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEHHHHCCCCCCCHHHHHH T ss_conf 89998279898999999999999869843697289999956707776689999999 No 279 >PRK11784 tRNA 2-selenouridine synthase; Provisional Probab=69.26 E-value=3.7 Score=20.81 Aligned_cols=20 Identities=35% Similarity=0.748 Sum_probs=17.3 Q ss_pred CEEEEEECCCCHHHHHHHHH Q ss_conf 02355304774067999999 Q gi|254780799|r 455 PHLLIAGTTGSGKSVAINTM 474 (806) Q Consensus 455 PHLLIAGtTGSGKSV~iN~i 474 (806) |-+++.|-|||||+--+|.+ T Consensus 138 ~~~vl~G~TG~GKT~lL~~L 157 (333) T PRK11784 138 PLVVLGGMTGSGKTRLLQAL 157 (333) T ss_pred CCEEEECCCCCCHHHHHHHH T ss_conf 85998678887789999999 No 280 >PTZ00112 origin recognition complex 1 protein; Provisional Probab=69.18 E-value=4.4 Score=20.27 Aligned_cols=226 Identities=16% Similarity=0.266 Sum_probs=113.2 Q ss_pred HHHHHHCCCCEEEEEEEECCEEEEEEEEECCCCCHHH---HHHHHHHHHHHHHHCCCCCCCCCCCCEEEEEECCCCCCEE Q ss_conf 9997424863289984104424444432147863999---9978899998863001210003886125553024566238 Q gi|254780799|r 340 KSVLSDFGIQGEIVNVRPGPVITLYELEPAPGIKSSR---IIGLSDDIARSMSAISARVAVIPRRNAIGIELPNDIRETV 416 (806) Q Consensus 340 E~tL~dFGVe~~Vv~v~pGPVVTrYEi~PApGVKvSK---I~nLadDIA~aLsa~svRIapIPGK~~VGIEiPN~~r~~V 416 (806) +..+..|.++-.=.++..|-.=--|=. .+|-|..| +.-+.|-+-+.+ + +| =++|-+..| T Consensus 214 e~~i~~~~~~~hWenlvl~SS~~~~~y--~~~~~~~~~~~~~~~~~~lk~~~------~--vp------e~lpcRe~E-- 275 (650) T PTZ00112 214 EHRICPFNIKTHWDNLVLESSDYYYIY--YNKKKYNKNKSLKPIIDKLKLQL------V--VP------EYILGREKE-- 275 (650) T ss_pred CCCCCCCCCCCCHHHHHHCCCCCEEEE--CCCCCCCCCCCEEECCCHHEECC------C--CC------CCCCCCHHH-- T ss_conf 340104664112767541455432565--26422455751120352011012------5--77------447770789-- Q ss_pred EEHHHCCCHHHHHCCCCCHHHCCCCCCCCEEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEE-EEEECCC Q ss_conf 734230721343022100021045666666785410020235530477406799999999999829957847-8885231 Q gi|254780799|r 417 MLRDLIVSRVFEKNQCDLAINLGKSIEGKPIIADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCR-LIMIDPK 495 (806) Q Consensus 417 ~lreil~s~~f~~s~~~L~iaLGKdI~G~pvv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evk-liliDPK 495 (806) |++-..-+.=++..+=+|.. |-|+|.-|.||.-++...|-.|--+..-.++. |-. T Consensus 276 ----------~~~I~~Fie~~i~q~GtG~c----------LYISGVPGTGKTATV~eVIr~L~~~~~~~~lp~F~f---- 331 (650) T PTZ00112 276 ----------AKEIRTFLETGIKQGGTGQI----------LYISGVPGTGKTATVYMVIKELQNKKLKGQLPWFNL---- 331 (650) T ss_pred ----------HHHHHHHHHHHCCCCCCCCE----------EEEECCCCCCCHHHHHHHHHHHHHHHHCCCCCCCEE---- T ss_conf ----------99999999864116886656----------999789999800369999999999997089998159---- Q ss_pred HHHHHHCCCCHHHHCCCCCCHHHHHHHHHH------------HHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCC Q ss_conf 001110277034312233430456689999------------99999999999987089968999999998874478667 Q gi|254780799|r 496 MLELSVYDGIPNLLTPVVTNPQKAVTVLKW------------LVCEMEERYQKMSKIGVRNIDGFNLKVAQYHNTGKKFN 563 (806) Q Consensus 496 ~vEls~Y~~iPHLl~pVvTd~~kA~~aL~w------------~V~EMe~RY~l~a~~~vRni~~yN~k~~~~~~~~~~~~ 563 (806) ||. ||. =+|+|+.|-..|-- |....|+|+. T Consensus 332 -VEI---NGM------kLt~P~qaY~~L~e~Ltg~k~~~~~~A~~lL~k~F~---------------------------- 373 (650) T PTZ00112 332 -FEI---NGV------NLVHPNAAYRVFYKKLFNKKPPNALNSYKELDKLFN---------------------------- 373 (650) T ss_pred -EEE---CCE------ECCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHC---------------------------- T ss_conf -997---363------779878899999999848988867899999999826---------------------------- Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHH Q ss_conf 75446776545432223322323469868776344688887321005889999998664142379999657775355435 Q gi|254780799|r 564 RTVQTGFDRKTGEAIYETEHFDFQHMPYIVVVIDEMADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGT 643 (806) Q Consensus 564 ~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivviiDElaDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ 643 (806) + .-...|+++||| |++.|-..+|=-.+---..+.-| |||+-+= --+ T Consensus 374 --------~---------------~r~p~VlLvDEL-D~LvTkkQ~VlYNLFdWPT~~~S---kLIVIaI-------ANT 419 (650) T PTZ00112 374 --------N---------------NRTPSILIVDEA-DYIVTKTQKVLFTLFDWPTKKNS---KLILIII-------SNT 419 (650) T ss_pred --------C---------------CCCCEEEEECHH-HHHHHCCCCEEEECCCCCCCCCC---EEEEEEE-------EEC T ss_conf --------8---------------997189997157-77763677457773668898887---0799998-------506 Q ss_pred HHHCCCCEEEEEECCCCCCHHHCCCCCHHHHCCCCCEEEECCC-CC-----EEEEEECC Q ss_conf 5411025158764586642123388645786588754773689-83-----25888334 Q gi|254780799|r 644 IKANFPTRISFQVSSKIDSRTILGEQGAEQLLGQGDMLYMTGG-GR-----VQRIHGPF 696 (806) Q Consensus 644 ikan~p~riaf~v~s~~dSrtild~~gae~Llg~gdml~~~~~-~~-----~~r~~g~~ 696 (806) . ++|-|+. .++-|| ||.+-|-|.|=. .. -.|++|.= T Consensus 420 M--DLPERL~----~RVsSR-----------LGltRltF~PYt~~QL~eII~sRL~~~~ 461 (650) T PTZ00112 420 M--DLPYKMK----PSCRSR-----------LAFGRLVFSPYKYQQIEKVIKERLENCE 461 (650) T ss_pred C--CCCHHHH----HHHHHH-----------CCCCEECCCCCCHHHHHHHHHHHHCCCC T ss_conf 7--8606566----665552-----------2885004399899999999999862677 No 281 >PTZ00011 alpha tubulin I; Provisional Probab=69.18 E-value=4.4 Score=20.23 Aligned_cols=15 Identities=27% Similarity=0.399 Sum_probs=8.7 Q ss_pred CCHHHHHHHHHHHHH Q ss_conf 100588999999866 Q gi|254780799|r 607 RKDIESAVQRLAQMA 621 (806) Q Consensus 607 ~~~ve~~i~rlaq~a 621 (806) .++|...+.++-++. T Consensus 325 ~~di~~~i~~ik~~~ 339 (453) T PTZ00011 325 PKDVNAAVATIKTKR 339 (453) T ss_pred CHHHHHHHHHHHHHC T ss_conf 305999999998546 No 282 >PRK13636 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=68.97 E-value=4.2 Score=20.44 Aligned_cols=205 Identities=21% Similarity=0.272 Sum_probs=95.6 Q ss_pred EEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCC--------CC----H-HHHCCCC Q ss_conf 785410020235530477406799999999999829957847888523100111027--------70----3-4312233 Q gi|254780799|r 447 IIADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYD--------GI----P-NLLTPVV 513 (806) Q Consensus 447 vv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~--------~i----P-HLl~pVv 513 (806) +-.++.+==.+-|-|.+|||||..++.|-- + + +|++=+ |++|=+-++.+-.+ |+ | |.+.+. T Consensus 25 isl~I~~Ge~~aiiG~NGaGKSTLl~~i~G-l-l--~p~~G~-I~~~G~~i~~~~~~~~~~r~~ig~vfQ~p~~~l~~~- 98 (285) T PRK13636 25 ININIKKGEVTAILGGNGAGKSTLFQNLNG-I-L--KPSSGR-ILFDGKPIDYSRKGLMKLRESVGMVFQDPDNQLFSA- 98 (285) T ss_pred EEEEECCCCEEEEECCCCCHHHHHHHHHHC-C-C--CCCCCE-EEECCEECCCCHHHHHHHHHHEEEEEECCCCCCCCC- T ss_conf 378987998999999999809999999965-9-8--888608-999999987443449999874069970764244757- Q ss_pred CCHHHHHHH---HHHHHHHHHHHHH-HHHHCCCC--------CHHHHH-HHHHHHHHCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 430456689---9999999999999-99870899--------689999-9999887447866775446776545432223 Q gi|254780799|r 514 TNPQKAVTV---LKWLVCEMEERYQ-KMSKIGVR--------NIDGFN-LKVAQYHNTGKKFNRTVQTGFDRKTGEAIYE 580 (806) Q Consensus 514 Td~~kA~~a---L~w~V~EMe~RY~-l~a~~~vR--------ni~~yN-~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 580 (806) |=-+..+.. +..--+|+++|-+ .+...|.. ++.|=. +|+.-|.. T Consensus 99 tV~e~v~~g~~~~g~~~~e~~~rv~~~L~~~gl~~~~~~~~~~LSGGqkqRvaIA~a----------------------- 155 (285) T PRK13636 99 SVYQDVSFGAVNLKLPEDEVRKRVDNALKRTGIEHLKDKPTHCLSFGQKKRVAIAGV----------------------- 155 (285) T ss_pred CHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHCCCCCCCHHHHHHHHHHHH----------------------- T ss_conf 499999999998599999999999999987598866528800199999999999999----------------------- Q ss_pred CCCCCCCCCCEEEEEHHHH-HHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEEEECCC Q ss_conf 3223234698687763446-888873210058899999986641423799996577753554355411025158764586 Q gi|254780799|r 581 TEHFDFQHMPYIVVVIDEM-ADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISFQVSSK 659 (806) Q Consensus 581 ~~~~~~~~lp~ivviiDEl-aDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf~v~s~ 659 (806) +-.=|- |++-||= |-|=-.+.+++-+.|.+|. +-.|+-.|++|.... . =+.+-.||.+ - T Consensus 156 -----La~~P~-iLlLDEPTagLDp~~~~~i~~ll~~l~---~e~g~TiilvtHd~~--~-----v~~~aDrviv----l 215 (285) T PRK13636 156 -----LVMEPK-VLVLDEPTAGLDPMGVSEIMKLLVEMQ---KELGLTIIIATHDID--I-----VPLYCDNVFV----M 215 (285) T ss_pred -----HHCCCC-EEEEECCCCCCCHHHHHHHHHHHHHHH---HHCCCEEEEECCCHH--H-----HHHHCCEEEE----E T ss_conf -----974998-999978755599999999999999999---844989999948899--9-----9996999999----9 Q ss_pred CCCHHHCCCCCHHHHCCCCCEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHC Q ss_conf 64212338864578658875477368983258883348988999999999712 Q gi|254780799|r 660 IDSRTILGEQGAEQLLGQGDMLYMTGGGRVQRIHGPFVSDIEVEKVVSHLKTQ 712 (806) Q Consensus 660 ~dSrtild~~gae~Llg~gdml~~~~~~~~~r~~g~~v~~~ev~~v~~~~~~q 712 (806) -+-|.|.+ +-.+.++-+-|+|.. +.+.-..|.+++..++.+ T Consensus 216 ~~G~iv~~-G~p~evf~~~~~l~~-----------~~l~~P~~~~l~~~L~~~ 256 (285) T PRK13636 216 KEGRVILQ-GNPKEVFAEKEMLRK-----------VNLRLPRIGHLMEILKEK 256 (285) T ss_pred ECCEEEEE-CCHHHHHCCHHHHHH-----------CCCCCCCHHHHHHHHHHH T ss_conf 89989998-699999669999997-----------799999699999999883 No 283 >PRK08694 consensus Probab=68.76 E-value=8.5 Score=18.13 Aligned_cols=140 Identities=14% Similarity=0.161 Sum_probs=76.9 Q ss_pred EEEEECCCCHHHHHHHHHHHHHHHHCC-HHHEEEEEECCCHHHHHHC----------CCCH--HHHCCCCCCHHHHHHHH Q ss_conf 355304774067999999999998299-5784788852310011102----------7703--43122334304566899 Q gi|254780799|r 457 LLIAGTTGSGKSVAINTMILSLLYRMT-PAQCRLIMIDPKMLELSVY----------DGIP--NLLTPVVTNPQKAVTVL 523 (806) Q Consensus 457 LLIAGtTGSGKSV~iN~iI~SlLyk~~-P~evkliliDPK~vEls~Y----------~~iP--HLl~pVvTd~~kA~~aL 523 (806) ..|||-+|.|||.+.=.|......+.+ | | .+.- +|++.- .+|| ++-+.-+|+-. -..+ T Consensus 221 iVIaaRPsmGKTalalnia~~~a~~~~~~--V--~~fS---LEMs~~~l~~Rlla~~s~v~~~~i~~g~l~~~e--~~~~ 291 (468) T PRK08694 221 IIVAGRPSMGKTAFSINIAEHVAVEGKLP--V--AVFS---MEMGGAQLVMRMLGSVGRLDQSVLKTGRLEDEH--WGRL 291 (468) T ss_pred EEEEECCCCCHHHHHHHHHHHHHHHCCCC--E--EEEC---CCCCHHHHHHHHHHHHCCCCHHHCCCCCCCHHH--HHHH T ss_conf 99961786537899999999999847984--7--9977---889999999999997259863211048999999--9999 Q ss_pred HHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHC-CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHHH Q ss_conf 99999999999999870899689999999988744-78667754467765454322233223234698687763446888 Q gi|254780799|r 524 KWLVCEMEERYQKMSKIGVRNIDGFNLKVAQYHNT-GKKFNRTVQTGFDRKTGEAIYETEHFDFQHMPYIVVVIDEMADL 602 (806) Q Consensus 524 ~w~V~EMe~RY~l~a~~~vRni~~yN~k~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivviiDElaDl 602 (806) .+++.+|.+.==..-.....++.....+.+..+.+ | -+.=+||||=|- | T Consensus 292 ~~a~~~l~~~pl~idd~~~~t~~~i~a~~r~~~~~~~-----------------------------~kl~~vvIDYLq-L 341 (468) T PRK08694 292 NEAVVKLSDAPVYIDETPGLTALELRARARRLARQFN-----------------------------NKLGLIVIDYLQ-L 341 (468) T ss_pred HHHHHHHHCCCEEEECCCCCCHHHHHHHHHHHHHHHC-----------------------------CCCCEEEEEHHH-H T ss_conf 9999998629968976999988799999999999838-----------------------------987389973675-4 Q ss_pred HHHCCC------HHHHHHHHHHHHHHHCCEEEEEEECCC Q ss_conf 873210------058899999986641423799996577 Q gi|254780799|r 603 MMVARK------DIESAVQRLAQMARASGIHVIMATQRP 635 (806) Q Consensus 603 mm~~~~------~ve~~i~rlaq~ara~GiHli~aTqrP 635 (806) |-..++ +|..-=-+|-++|+--+|..|+..|=- T Consensus 342 i~~~~~~~~r~~~i~~isr~LK~lAkel~ipvi~LsQLn 380 (468) T PRK08694 342 MAGSGRSDNRASELGEISRSLKALAKELQVPIIALSQLS 380 (468) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEECCCC T ss_conf 168887655999999999999999999799899963268 No 284 >TIGR00328 flhB flagellar biosynthetic protein FlhB; InterPro: IPR006136 The flhB and flhA genes constitute an operon called flhB operon on the Salmonella typhimurium chromosome. They, together with FliI and FliH, may constitute the export apparatus of flagellin, the component protein of flagellar filament. The flhB gene encodes a highly hydrophobic polypeptide with several potential membrane-spanning segments, suggesting that it may be an integral membrane protein .; GO: 0009306 protein secretion, 0016020 membrane. Probab=68.69 E-value=8.5 Score=18.12 Aligned_cols=91 Identities=11% Similarity=0.007 Sum_probs=45.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCCC---C------CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-- Q ss_conf 99999999999999999998442706896---3------136886450001346789999999999999999999999-- Q gi|254780799|r 26 MKIVAGLILLCTVFAITLALGTWDVYDPS---F------SYITLRSPKNFLGYGGAIFADVAIQFFGIASVFFLPPPT-- 94 (806) Q Consensus 26 l~Ei~Gl~Li~~al~l~iSL~SYsp~DPs---~------~~~s~~~v~N~~G~lGA~iAd~L~~lFG~~Ayllpl~Ll-- 94 (806) -+|+.+++.++++.+.+.-+.++.-.|-- + .+....+..+..+.++..+-+.++.++=+...++.+.++ T Consensus 24 S~E~~~~~~ll~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~llp~~~~~l~~g~~~~ 103 (352) T TIGR00328 24 SREVNTAAVLLAGVLVLLFFGSEGLADLLEALMSFFLSLETSDLLELESLVEIIIELIREVLLLLLPILVLLLVVGVLSN 103 (352) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 31199999999999999998899999999999998753233553147999999999999999999999999999999998 Q ss_pred --HHHHHHHH--------CCCH--HHHHHH-HHHH Q ss_conf --99999982--------7652--367899-9999 Q gi|254780799|r 95 --MWALSLLF--------DKKI--YCFSKR-ATAW 116 (806) Q Consensus 95 --~~g~~ll~--------~k~~--~~~~~R-l~~~ 116 (806) -.||.+-. .-++ ....+| +|+. T Consensus 104 ~~Q~G~~ft~k~l~Pk~~KiNP~~~~G~KRS~FS~ 138 (352) T TIGR00328 104 ILQTGFLFTTKPLKPKLSKINPDVIKGLKRSLFSL 138 (352) T ss_pred HHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHH T ss_conf 88730233435466676758876886676877778 No 285 >TIGR01995 PTS-II-ABC-beta PTS system, beta-glucoside-specific IIABC component; InterPro: IPR011297 The phosphoenolpyruvate-dependent sugar phosphotransferase system (PTS) , is a major carbohydrate transport system in bacteria. The PTS catalyses the phosphorylation of incoming sugar substrates and coupled with translocation across the cell membrane, makes the PTS a link between the uptake and metabolism of sugars. The general mechanism of the PTS is the following: a phosphoryl group from phosphoenolpyruvate (PEP) is transferred via a signal transduction pathway, to enzyme I (EI) which in turn transfers it to a phosphoryl carrier, the histidine protein (HPr). Phospho-HPr then transfers the phosphoryl group to a sugar-specific permease, a membrane-bound complex known as enzyme 2 (EII), which transports the sugar to the cell. EII consists of at least three structurally distinct domains IIA, IIB and IIC . These can either be fused together in a single polypeptide chain or exist as two or three interactive chains, formerly called enzymes II (EII) and III (EIII). The first domain (IIA or EIIA) carries the first permease-specific phosphorylation site, a histidine which is phosphorylated by phospho-HPr. The second domain (IIB or EIIB) is phosphorylated by phospho-IIA on a cysteinyl or histidyl residue, depending on the sugar transported. Finally, the phosphoryl group is transferred from the IIB domain to the sugar substrate concomitantly with the sugar uptake processed by the IIC domain. This third domain (IIC or EIIC) forms the translocation channel and the specific substrate-binding site. An additional transmembrane domain IID, homologous to IIC, can be found in some PTSs, e.g. for mannose , , , . This entry represents a family of PTS enzyme II proteins in which all three domains are found in the same polypeptide chain and which appear to have a broad specificity for beta-glucosides including salicin (beta-D-glucose-1-salicylate) and arbutin (hydroquinone-O-beta-D-glucopyranoside) . These are distinct from the closely related sucrose-specific and trehalose-specific PTS transporters.; GO: 0008982 protein-N(PI)-phosphohistidine-sugar phosphotransferase activity, 0009401 phosphoenolpyruvate-dependent sugar phosphotransferase system. Probab=68.45 E-value=8.6 Score=18.08 Aligned_cols=14 Identities=21% Similarity=0.256 Sum_probs=8.8 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 13467899999999 Q gi|254780799|r 67 LGYGGAIFADVAIQ 80 (806) Q Consensus 67 ~G~lGA~iAd~L~~ 80 (806) +|++|-+++..+-. T Consensus 283 ~GP~g~~~g~gi~~ 296 (660) T TIGR01995 283 LGPVGVYLGEGIAS 296 (660) T ss_pred HHHHHHHHHHHHHH T ss_conf 43589999999999 No 286 >cd01897 NOG NOG1 is a nucleolar GTP-binding protein present in eukaryotes ranging from trypanosomes to humans. NOG1 is functionally linked to ribosome biogenesis and found in association with the nuclear pore complexes and identified in many preribosomal complexes. Thus, defects in NOG1 can lead to defects in 60S biogenesis. The S. cerevisiae NOG1 gene is essential for cell viability, and mutations in the predicted G motifs abrogate function. It is a member of the ODN family of GTP-binding proteins that also includes the bacterial Obg and DRG proteins. Probab=68.38 E-value=4.3 Score=20.30 Aligned_cols=21 Identities=43% Similarity=0.678 Sum_probs=19.4 Q ss_pred CEEEEEECCCCHHHHHHHHHH Q ss_conf 023553047740679999999 Q gi|254780799|r 455 PHLLIAGTTGSGKSVAINTMI 475 (806) Q Consensus 455 PHLLIAGtTGSGKSV~iN~iI 475 (806) |+..|+|-|-+|||.-+|++. T Consensus 1 P~VaivG~pNvGKStL~N~L~ 21 (168) T cd01897 1 PTLVIAGYPNVGKSSLVNKLT 21 (168) T ss_pred CEEEEECCCCCCHHHHHHHHH T ss_conf 979998899988999999995 No 287 >PRK10247 putative ABC transporter ATP-binding protein YbbL; Provisional Probab=68.37 E-value=8.6 Score=18.07 Aligned_cols=37 Identities=30% Similarity=0.419 Sum_probs=24.3 Q ss_pred EEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEE Q ss_conf 85410020235530477406799999999999829957847 Q gi|254780799|r 448 IADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCR 488 (806) Q Consensus 448 v~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evk 488 (806) -.++.+=-.+.|-|-+|||||--++.| +.++ +|+.=+ T Consensus 27 s~~i~~Ge~~~i~G~sGsGKSTLlk~i-~gl~---~p~~G~ 63 (225) T PRK10247 27 NFSLRAGEFKLITGPSGCGKSTLLKIV-ASLI---SPTSGT 63 (225) T ss_pred EEEECCCCEEEEECCCCCCHHHHHHHH-HCCC---CCCCCE T ss_conf 799859969999999999999999999-6466---888765 No 288 >cd03253 ABCC_ATM1_transporter ATM1 is an ABC transporter that is expressed in the mitochondria. Although the specific function of ATM1 is unknown, its disruption results in the accumulation of excess mitochondrial iron, loss of mitochondrial cytochromes, oxidative damage to mitochondrial DNA, and decreased levels of cytosolic heme proteins. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=68.35 E-value=4.3 Score=20.34 Aligned_cols=25 Identities=28% Similarity=0.488 Sum_probs=18.0 Q ss_pred ECCCCCEEEEEECCCCHHHHHHHHH Q ss_conf 4100202355304774067999999 Q gi|254780799|r 450 DLARMPHLLIAGTTGSGKSVAINTM 474 (806) Q Consensus 450 DLakMPHLLIAGtTGSGKSV~iN~i 474 (806) ++.+==++-|-|.+|||||--++.| T Consensus 23 ~i~~Ge~v~ivG~sGsGKSTLl~ll 47 (236) T cd03253 23 TIPAGKKVAIVGPSGSGKSTILRLL 47 (236) T ss_pred EECCCCEEEEECCCCCCHHHHHHHH T ss_conf 9869999999999999899999997 No 289 >COG1846 MarR Transcriptional regulators [Transcription] Probab=68.34 E-value=7.9 Score=18.36 Aligned_cols=57 Identities=18% Similarity=0.392 Sum_probs=40.5 Q ss_pred HHHHHHHCCCEEEEHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHCCC-CCEEEECHHH Q ss_conf 99999965985000142220011778999999999779868022788-7267317125 Q gi|254780799|r 746 AVDIVLRDNKASISYIQRRLGIGYNRAASIIENMEEKGVIGPASSTG-KREILISSME 802 (806) Q Consensus 746 a~~~v~~~~~~s~s~lqr~~~igy~raar~~~~~e~~giv~~~~g~~-~r~vl~~~~~ 802 (806) ...++...+..+.+-|..++.+-..-..|+++.||++|.|--..+.. .|.+++.-.+ T Consensus 27 ~L~~l~~~~~~~~~~la~~l~~~~~~vt~~l~~Le~~glv~r~~~~~DrR~~~l~lT~ 84 (126) T COG1846 27 VLLALYEAGGITVKELAERLGLDRSTVTRLLKRLEDKGLIERLRDPEDRRAVLVRLTE 84 (126) T ss_pred HHHHHHHCCCCCHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEECCCCCCCEEEEEECH T ss_conf 9999998489999999999787888899999999988993674388886312435562 No 290 >TIGR02040 PpsR-CrtJ transcriptional regulator PpsR; InterPro: IPR011785 This entry represents the transcriptional regulator PpsR which is strictly associated with photosynthetic proteobacteria and found in photosynthetic operons. PpsR has been reported to be a repressor . These proteins contain a Helix-Turn-Helix motif of the "fis" type.. Probab=68.13 E-value=2.8 Score=21.77 Aligned_cols=156 Identities=19% Similarity=0.301 Sum_probs=89.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH----------HHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 0456689999999999999999----------870899689999999988744786677544677654543222332232 Q gi|254780799|r 516 PQKAVTVLKWLVCEMEERYQKM----------SKIGVRNIDGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFD 585 (806) Q Consensus 516 ~~kA~~aL~w~V~EMe~RY~l~----------a~~~vRni~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 585 (806) ...|..==-|.-+|||.||+.+ -+...--|-+-|..-..+-..+. .+-|.-.|.+ T Consensus 122 AQ~AmErDYW~lR~~EtRYR~ll~~s~davllv~~st~rileaN~~A~~lLg~~~-------~~rd~LvG~~-------- 186 (453) T TIGR02040 122 AQQAMERDYWKLREMETRYRLLLEVSSDAVLLVDVSTGRILEANSAAAALLGAEG-------QRRDSLVGRE-------- 186 (453) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCHHHHHHHHHHHHHCCCC-------CCCHHHHCCH-------- T ss_conf 9878888799986675444335411488679985365347786799999850267-------7501231113-------- Q ss_pred CCCCCEEEEEHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEEEECCCCCCHHH Q ss_conf 34698687763446888873210058899999986641423799996577753554355411025158764586642123 Q gi|254780799|r 586 FQHMPYIVVVIDEMADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISFQVSSKIDSRTI 665 (806) Q Consensus 586 ~~~lp~ivviiDElaDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf~v~s~~dSrti 665 (806) +|. .+=-.-..+++....+++...+|.++...++-.+-.++|.--+..+|--++-=.|.+.-=.+--| T Consensus 187 ---~~~---------e~~~~~~~~l~~~l~~~~atg~A~~~~i~l~~~~~~~~v~~s~~r~~~~~~fL~~l~~~~~~~~~ 254 (453) T TIGR02040 187 ---FPQ---------ELEEREREELELLLREVRATGKAAEVRILLAESRKELLVVVSLFRQDSESLFLVRLSPAGASAAV 254 (453) T ss_pred ---HHC---------CCCCCCHHHHHHHHHHHHHCCCCCCCEEEECCCCCCEEEEEEEEECCCCCHHHEEECCCCCCCCC T ss_conf ---431---------46645436899999864301577871787457887317887764023664101130311244677 Q ss_pred CCCC--C--HHHHCCCC-CEEEEC-CCCCEEEEEECCCC Q ss_conf 3886--4--57865887-547736-89832588833489 Q gi|254780799|r 666 LGEQ--G--AEQLLGQG-DMLYMT-GGGRVQRIHGPFVS 698 (806) Q Consensus 666 ld~~--g--ae~Llg~g-dml~~~-~~~~~~r~~g~~v~ 698 (806) =+.. | =++|.-.+ |-+..- ..+...++--||+. T Consensus 255 ~~~~s~~sml~~l~~~~pDaiv~~D~~G~i~~aN~aFl~ 293 (453) T TIGR02040 255 GDVLSENSMLAKLVEEAPDAIVVVDADGRIKRANEAFLE 293 (453) T ss_pred CCCCCCHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHH T ss_conf 776674278999997429879997788732888899998 No 291 >PRK06354 pyruvate kinase; Provisional Probab=68.09 E-value=2.8 Score=21.73 Aligned_cols=119 Identities=29% Similarity=0.354 Sum_probs=61.1 Q ss_pred HHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHH-HHCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 4312233430456689999999999999999870899689999999988-744786677544677654543222332232 Q gi|254780799|r 507 NLLTPVVTNPQKAVTVLKWLVCEMEERYQKMSKIGVRNIDGFNLKVAQY-HNTGKKFNRTVQTGFDRKTGEAIYETEHFD 585 (806) Q Consensus 507 HLl~pVvTd~~kA~~aL~w~V~EMe~RY~l~a~~~vRni~~yN~k~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 585 (806) +|-.|-+|+-.+ ..|+|++ +.--+..|..=||+=+.-.+ +++. ...|... ++.. ..| .+.. T Consensus 170 ~l~lp~lTeKD~--~dl~f~~---~~~vD~VAlSFVrsa~DV~~-ir~~l~~~g~~~--~IIA---------KIE-~~ea 231 (589) T PRK06354 170 SLSLPALTEKDR--EDLIFGL---EQGVDWIALSFVRNPSDVLE-IKELIEHNGKSI--PVIA---------KIE-KQEA 231 (589) T ss_pred CCCCCCCCCCHH--HHHHHHH---HCCCCEEEECCCCCHHHHHH-HHHHHHHCCCCC--EEEE---------EEC-CHHH T ss_conf 778778982028--9999999---74999899827899899999-999998669863--4999---------745-7999 Q ss_pred CCCCCEEEEEHHHHHHHHHHCCCH--HHHHH-------HHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEE Q ss_conf 346986877634468888732100--58899-------99998664142379999657775355435541102515 Q gi|254780799|r 586 FQHMPYIVVVIDEMADLMMVARKD--IESAV-------QRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRI 652 (806) Q Consensus 586 ~~~lp~ivviiDElaDlmm~~~~~--ve~~i-------~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~ri 652 (806) ++.|.-|+ |.+|=.|+|.-| ||-++ -|+-+++|.+|.-.|+||| ..--+|+.-.|||- T Consensus 232 v~NldeIi----~~sDgIMVARGDLGvEip~e~VP~~QK~II~~c~~~gKPVI~ATQ-----MLeSMi~np~PTRA 298 (589) T PRK06354 232 IDNIDAIL----ELCDGVMVARGDLGVEIPAEEVPLLQKRLIKKANSLGIPVITATQ-----MLDSMQRNPRPTRA 298 (589) T ss_pred HHHHHHHH----HHCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEC-----CHHHHCCCCCCCHH T ss_conf 98699998----747699995486320258888479999999999973997899815-----57875379998715 No 292 >cd03369 ABCC_NFT1 Domain 2 of NFT1 (New full-length MRP-type transporter 1). NFT1 belongs to the MRP (mulrtidrug resisitance-associated protein) family of ABC transporters. Some of the MRP members have five additional transmembrane segments in their N-terminas, but the function of these additional membrane-spanning domains is not clear. The MRP was found in the multidrug-resisting lung cancer cell in which p-glycoprotein was not overexpressed. MRP exports glutathione by drug stimulation, as well as, certain substrates in conjugated forms with anions such as glutathione, glucuronate, and sulfate. Probab=68.01 E-value=7.1 Score=18.70 Aligned_cols=27 Identities=30% Similarity=0.383 Sum_probs=20.8 Q ss_pred EECCCCCEEEEEECCCCHHHHHHHHHH Q ss_conf 541002023553047740679999999 Q gi|254780799|r 449 ADLARMPHLLIAGTTGSGKSVAINTMI 475 (806) Q Consensus 449 ~DLakMPHLLIAGtTGSGKSV~iN~iI 475 (806) .++.+=-.+-|-|.+|||||-.++.|. T Consensus 29 l~i~~Ge~v~ivG~sGsGKSTLl~ll~ 55 (207) T cd03369 29 FKVKAGEKIGIVGRTGAGKSTLILALF 55 (207) T ss_pred EEECCCCEEEEECCCCCCHHHHHHHHH T ss_conf 898699999999999987999999999 No 293 >PRK13764 ATPase; Provisional Probab=68.00 E-value=4.1 Score=20.50 Aligned_cols=108 Identities=23% Similarity=0.267 Sum_probs=63.2 Q ss_pred CCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEEECCEEEEEEEEECCCCCHHHHHHHHHHHHHHHHHCCCCCC-C-CCCCC Q ss_conf 563256654578999999742486328998410442444443214786399999788999988630012100-0-38861 Q gi|254780799|r 325 MTFSPKVMQNNACTLKSVLSDFGIQGEIVNVRPGPVITLYELEPAPGIKSSRIIGLSDDIARSMSAISARVA-V-IPRRN 402 (806) Q Consensus 325 ~~~s~eeL~~nA~lLE~tL~dFGVe~~Vv~v~pGPVVTrYEi~PApGVKvSKI~nLadDIA~aLsa~svRIa-p-IPGK~ 402 (806) ...++.+|+..++-+.+.= ...=++-+.==.+|-+|.++- +-||+ . =|= T Consensus 175 ~plt~~el~~ia~eIle~a-~~~~~~fiEi~~~G~tvvQ~~--------------------------~~RI~i~rPPf-- 225 (605) T PRK13764 175 EPLTEYELEEIAREILERA-KRDPDGFIEIERPGATVVQLG--------------------------NYRIVIARPPF-- 225 (605) T ss_pred CCCCHHHHHHHHHHHHHHH-HCCCCCEEEEECCCCEEEEEC--------------------------CEEEEEECCCC-- T ss_conf 5599999999999999987-148995599814996799866--------------------------78999845998-- Q ss_pred EEEEEECCCCCCEEEEHHHCCCHHHHHCCCCCHHHCCCCCCCCEEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHH Q ss_conf 25553024566238734230721343022100021045666666785410020235530477406799999999999 Q gi|254780799|r 403 AIGIELPNDIRETVMLRDLIVSRVFEKNQCDLAINLGKSIEGKPIIADLARMPHLLIAGTTGSGKSVAINTMILSLL 479 (806) Q Consensus 403 ~VGIEiPN~~r~~V~lreil~s~~f~~s~~~L~iaLGKdI~G~pvv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlL 479 (806) +=|+||--. |-.|.++ =++|.-+. +|-=-|-+ +-.-.||||+-|||||-+--++=.-+. T Consensus 226 Sd~~EITaV-RPv~~~s----ledY~l~~-~l~~Rl~~------------~a~GilIaG~PGaGKsTfaqalA~~~~ 284 (605) T PRK13764 226 SDGIEITAV-RPIVKLS----LEDYNLSE-KLKERLEE------------RAEGILIAGAPGAGKSTFAQALAEFYA 284 (605) T ss_pred CCCEEEEEE-EEEEECC----HHHCCCCH-HHHHHHHH------------CCCEEEEECCCCCCHHHHHHHHHHHHH T ss_conf 776379997-1017804----77716789-99998873------------366499977999977899999999998 No 294 >cd03223 ABCD_peroxisomal_ALDP Peroxisomal ATP-binding cassette transporter (Pat) is involved in the import of very long-chain fatty acids (VLCFA) into the peroxisome. The peroxisomal membrane forms a permeability barrier for a wide variety of metabolites required for and formed during fatty acid beta-oxidation. To communicate with the cytoplasm and mitochondria, peroxisomes need dedicated proteins to transport such hydrophilic molecules across their membranes. X-linked adrenoleukodystrophy (X-ALD) is caused by mutations in the ALD gene, which encodes ALDP (adrenoleukodystrophy protein ), a peroxisomal integral membrane protein that is a member of the ATP-binding cassette (ABC) transporter protein family. The disease is characterized by a striking and unpredictable variation in phenotypic expression. Phenotypes include the rapidly progressive childhood cerebral form (CCALD), the milder adult form, adrenomyeloneuropathy (AMN), and variants without neurologic involvement (i.e. asympt Probab=67.85 E-value=4.5 Score=20.20 Aligned_cols=31 Identities=35% Similarity=0.644 Sum_probs=22.6 Q ss_pred CCEEEEEC----CCCCEEEEEECCCCHHHHHHHHH Q ss_conf 66678541----00202355304774067999999 Q gi|254780799|r 444 GKPIIADL----ARMPHLLIAGTTGSGKSVAINTM 474 (806) Q Consensus 444 G~pvv~DL----akMPHLLIAGtTGSGKSV~iN~i 474 (806) |+++.-|+ .+==++.|-|.+|||||--++.| T Consensus 13 ~~~il~~isl~i~~Ge~v~i~G~sGsGKSTLl~~l 47 (166) T cd03223 13 GRVLLKDLSFEIKPGDRLLITGPSGTGKSSLFRAL 47 (166) T ss_pred CCEEEEEEEEEECCCCEEEEECCCCCCHHHHHHHH T ss_conf 99889445889889999999958999889999998 No 295 >cd01853 Toc34_like Toc34-like (Translocon at the Outer-envelope membrane of Chloroplasts). This family contains several Toc proteins, including Toc34, Toc33, Toc120, Toc159, Toc86, Toc125, and Toc90. The Toc complex at the outer envelope membrane of chloroplasts is a molecular machine of ~500 kDa that contains a single Toc159 protein, four Toc75 molecules, and four or five copies of Toc34. Toc64 and Toc12 are associated with the translocon, but do not appear to be part of the core complex. The Toc translocon initiates the import of nuclear-encoded preproteins from the cytosol into the organelle. Toc34 and Toc159 are both GTPases, while Toc75 is a beta-barrel integral membrane protein. Toc159 is equally distributed between a soluble cytoplasmic form and a membrane-inserted form, suggesting that assembly of the Toc complex is dynamic. Toc34 and Toc75 act sequentially to mediate docking and insertion of Toc159 resulting in assembly of the functional translocon. Probab=67.85 E-value=3.6 Score=20.94 Aligned_cols=47 Identities=23% Similarity=0.315 Sum_probs=30.8 Q ss_pred ECCCCCEEEEEECCCCHHHHHHHHHHHH---HHHHCCHHHEEEEEECCCH Q ss_conf 4100202355304774067999999999---9982995784788852310 Q gi|254780799|r 450 DLARMPHLLIAGTTGSGKSVAINTMILS---LLYRMTPAQCRLIMIDPKM 496 (806) Q Consensus 450 DLakMPHLLIAGtTGSGKSV~iN~iI~S---lLyk~~P~evkliliDPK~ 496 (806) ||.----.|+=|-||-|||--||+|+=- --=-+.|+..|..||-=.+ T Consensus 27 ~~~~sltILvlGKtGVGKSsTINSifgE~~~~~~aF~~~t~r~~~v~~tv 76 (249) T cd01853 27 ELDFSLTILVLGKTGVGKSSTINSIFGERKAATSAFQSETLRVREVSGTV 76 (249) T ss_pred CCCCEEEEEEEECCCCCHHHHHHHHCCCCCCCCCCCCCCCCCEEEEEEEE T ss_conf 45643699998068764577677650854134477677886508998753 No 296 >PHA00520 packaging NTPase P4 Probab=67.82 E-value=2.8 Score=21.69 Aligned_cols=59 Identities=20% Similarity=0.336 Sum_probs=42.1 Q ss_pred EEEEEHHHHHHHHHHCC---------CHHHHHHHHHHHHHHHCCEEEEEEECCCCCC------CCCHHHHHCCCC Q ss_conf 68776344688887321---------0058899999986641423799996577753------554355411025 Q gi|254780799|r 591 YIVVVIDEMADLMMVAR---------KDIESAVQRLAQMARASGIHVIMATQRPSVD------VITGTIKANFPT 650 (806) Q Consensus 591 ~ivviiDElaDlmm~~~---------~~ve~~i~rlaq~ara~GiHli~aTqrPsvd------vitg~ikan~p~ 650 (806) -=||+||-|-++.--++ +-+-+...-|-|+|++.|.- |+|+--|++| |.--+-|.|--+ T Consensus 181 ~~vvvvDSlR~v~~~l~Gnat~GGISr~~y~~LTdl~n~aas~gc~-vV~~lNPmSdDeKI~~v~~~v~~S~s~~ 254 (326) T PHA00520 181 HRVVVVDSLRNVIFALGGNATSGGISRGAYDLLTDIGNMAASRGCV-VVASLNPMSDDEKIVEVVKEVSASNSGS 254 (326) T ss_pred CCEEEEECHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCE-EEEECCCCCCCHHHHHHHHHHHHHCCCE T ss_conf 7179952146666441378787744688999998888888763957-9997388886077999999987613641 No 297 >cd00288 Pyruvate_Kinase Pyruvate kinase (PK): Large allosteric enzyme that regulates glycolysis through binding of the substrate, phosphoenolpyruvate, and one or more allosteric effectors. Like other allosteric enzymes, PK has a high substrate affinity R state and a low affinity T state. PK exists as several different isozymes, depending on organism and tissue type. In mammals, there are four PK isozymes: R, found in red blood cells, L, found in liver, M1, found in skeletal muscle, and M2, found in kidney, adipose tissue, and lung. PK forms a homotetramer, with each subunit containing three domains. The T state to R state transition of PK is more complex than in most allosteric enzymes, involving a concerted rotation of all 3 domains of each monomer in the homotetramer. Probab=67.68 E-value=3.2 Score=21.25 Aligned_cols=253 Identities=21% Similarity=0.225 Sum_probs=124.8 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCEEEEEEEECCEEEEEEEEECCCCCHHHHHHHHHHHHHHHHHC-CCCCCCCCCCCEEEE Q ss_conf 2566545789999997424863289984104424444432147863999997889999886300-121000388612555 Q gi|254780799|r 328 SPKVMQNNACTLKSVLSDFGIQGEIVNVRPGPVITLYELEPAPGIKSSRIIGLSDDIARSMSAI-SARVAVIPRRNAIGI 406 (806) Q Consensus 328 s~eeL~~nA~lLE~tL~dFGVe~~Vv~v~pGPVVTrYEi~PApGVKvSKI~nLadDIA~aLsa~-svRIapIPGK~~VGI 406 (806) +.++.++.-+.+.+.-+++|...-|--=.+||-+=.=++.....++ +... .+++..=+++ T Consensus 39 ~~e~~~~~i~~iR~~~~~~~~~i~Il~Dl~GpkiR~g~~~~~~~i~--------------l~~G~~v~l~~~~~~----- 99 (480) T cd00288 39 SHEYHQSRIDNVREAAEKTGGPVAIALDTKGPEIRTGLFKGGKDIS--------------LKAGDKFLVTTDPAA----- 99 (480) T ss_pred CHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCEEEEEECCCCCEEE--------------ECCCCEEEEEECCCC----- T ss_conf 9999999999999999971998079997889835898548996379--------------548999999856876----- Q ss_pred EECCCCCCEEEEHHHCCCHHHHHCCCCCHHHCCC------CCCCCEEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHH Q ss_conf 3024566238734230721343022100021045------6666667854100202355304774067999999999998 Q gi|254780799|r 407 ELPNDIRETVMLRDLIVSRVFEKNQCDLAINLGK------SIEGKPIIADLARMPHLLIAGTTGSGKSVAINTMILSLLY 480 (806) Q Consensus 407 EiPN~~r~~V~lreil~s~~f~~s~~~L~iaLGK------dI~G~pvv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLy 480 (806) ..-|.++=.|...++++. -+....+-+.=|+ ++.++-.+ --+.+-.|.=+|+|.|-+ T Consensus 100 ~~~~~~~i~v~~~~l~~~---v~~Gd~I~idDG~i~l~V~~~~~~~~i-----~~~v~~gG~l~s~Kgvn~--------- 162 (480) T cd00288 100 KKGTKEKIYVDYKNLTKD---VSPGNTILVDDGLLSLKVLSKDDDKTL-----VCEVLNGGVLGSRKGVNL--------- 162 (480) T ss_pred CCCCCCEEECCHHHHHHH---CCCCCEEEEECCEEEEEEEEECCCCEE-----EEEEEECCEECCCCCCCC--------- T ss_conf 789888896572888976---589988999478389999998589769-----999975839738863236--------- Q ss_pred HCCHHHEEEEEECCCHHHHHHCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCC Q ss_conf 29957847888523100111027703431223343045668999999999999999987089968999999998874478 Q gi|254780799|r 481 RMTPAQCRLIMIDPKMLELSVYDGIPNLLTPVVTNPQKAVTVLKWLVCEMEERYQKMSKIGVRNIDGFNLKVAQYHNTGK 560 (806) Q Consensus 481 k~~P~evkliliDPK~vEls~Y~~iPHLl~pVvTd~~kA~~aL~w~V~EMe~RY~l~a~~~vRni~~yN~k~~~~~~~~~ 560 (806) |+- ++-.|.+|+-.+ ..|+|+++ .--+..+-.=||+-+.-.+--+-.++.|. T Consensus 163 ---p~~--------------------~~~lp~lTekD~--~di~~a~~---~~vD~valSFVr~~~Dv~~lr~~l~~~g~ 214 (480) T cd00288 163 ---PGT--------------------DVDLPALSEKDK--ADLRFGVE---QGVDMIFASFVRKASDVLEIREVLGEKGK 214 (480) T ss_pred ---CCC--------------------CCCCCCCCHHHH--HHHHHHHH---CCCCEEEECCCCCHHHHHHHHHHHHHCCC T ss_conf ---888--------------------678667987789--99986887---59999998888989999999999997488 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHHHHHHCCCH--HHHHH-------HHHHHHHHHCCEEEEEE Q ss_conf 6677544677654543222332232346986877634468888732100--58899-------99998664142379999 Q gi|254780799|r 561 KFNRTVQTGFDRKTGEAIYETEHFDFQHMPYIVVVIDEMADLMMVARKD--IESAV-------QRLAQMARASGIHVIMA 631 (806) Q Consensus 561 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivviiDElaDlmm~~~~~--ve~~i-------~rlaq~ara~GiHli~a 631 (806) .. ++.. ..| ....++.|+-|+ +.+|=.|+|.-| ||-++ -++-++||++|.-.|+| T Consensus 215 ~~--~Iia---------KIE-~~~al~nl~eIi----~~sDgIMIARGDLgvEi~~e~vp~~Qk~Ii~~c~~~gKPvIvA 278 (480) T cd00288 215 DI--KIIA---------KIE-NQEGVNNFDEIL----EASDGIMVARGDLGVEIPAEEVFLAQKMLIAKCNLAGKPVITA 278 (480) T ss_pred CC--EEEE---------EEC-CHHHHHCHHHHH----HHCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHCCCEEEEE T ss_conf 66--6999---------852-776664799999----8538899977865564798898999999999999839959996 Q ss_pred ECCCCCCCCCHHHHHCCCCEEEEEECCCCCCHHHCCCCCH Q ss_conf 6577753554355411025158764586642123388645 Q gi|254780799|r 632 TQRPSVDVITGTIKANFPTRISFQVSSKIDSRTILGEQGA 671 (806) Q Consensus 632 TqrPsvdvitg~ikan~p~riaf~v~s~~dSrtild~~ga 671 (806) || +.--+|....|||-=- + -+ +-.|+| || T Consensus 279 Tq-----mLeSMi~~p~PTRAEv--~-DV-anAv~d--G~ 307 (480) T cd00288 279 TQ-----MLESMIYNPRPTRAEV--S-DV-ANAVLD--GT 307 (480) T ss_pred CC-----HHHHHHCCCCCCEEEH--H-HH-HHHHHH--CC T ss_conf 73-----6887623899852102--4-58-889874--58 No 298 >cd03258 ABC_MetN_methionine_transporter MetN (also known as YusC) is an ABC-type transporter encoded by metN of the metNPQ operon in Bacillus subtilis that is involved in methionine transport. Other members of this system include the MetP permease and the MetQ substrate binding protein. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=67.67 E-value=5.7 Score=19.39 Aligned_cols=175 Identities=19% Similarity=0.286 Sum_probs=84.8 Q ss_pred EEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHH-HH----------------HHCCCCHHHH Q ss_conf 785410020235530477406799999999999829957847888523100-11----------------1027703431 Q gi|254780799|r 447 IIADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKML-EL----------------SVYDGIPNLL 509 (806) Q Consensus 447 vv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~v-El----------------s~Y~~iPHLl 509 (806) +-.++.+==.+-|-|..|||||+.++ +|..| . +|+.=.. ++|=|-+ .+ .-|+=+||+ T Consensus 24 vsl~i~~Ge~~~ivG~SGsGKSTllr-~i~gL-~--~p~sG~I-~~~g~~i~~~~~~~~~~~Rr~ig~VFQ~~~L~~~~- 97 (233) T cd03258 24 VSLSVPKGEIFGIIGRSGAGKSTLIR-CINGL-E--RPTSGSV-LVDGTDLTLLSGKELRKARRRIGMIFQHFNLLSSR- 97 (233) T ss_pred CEEEECCCCEEEEECCCCCHHHHHHH-HHHCC-C--CCCCCEE-EECCEECCCCCHHHHHHHHCCCCEEECCCCCCCCC- T ss_conf 28899999999998898058999999-99679-9--9998089-99999989799999999862587794377889988- Q ss_pred CCCCCCHHHHHHHH---HHHHHHHHHH-HHHHHHCCCCCHHH---------HHHHHHHHHHCCCCCCCCCCCCCCCCCCC Q ss_conf 22334304566899---9999999999-99998708996899---------99999988744786677544677654543 Q gi|254780799|r 510 TPVVTNPQKAVTVL---KWLVCEMEER-YQKMSKIGVRNIDG---------FNLKVAQYHNTGKKFNRTVQTGFDRKTGE 576 (806) Q Consensus 510 ~pVvTd~~kA~~aL---~w~V~EMe~R-Y~l~a~~~vRni~~---------yN~k~~~~~~~~~~~~~~~~~~~~~~~~~ 576 (806) |=-...+..| +|--+|.++| .+++...|..+... =-+||.-|++ T Consensus 98 ----tv~~nv~~~l~~~~~~~~~~~~r~~~lL~~vgL~~~~~~yP~eLSGGq~QRVaIARA------------------- 154 (233) T cd03258 98 ----TVFENVALPLEIAGVPKAEIEERVLELLELVGLEDKADAYPAQLSGGQKQRVGIARA------------------- 154 (233) T ss_pred ----CHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHCCCCCCCHHHHHHHHHHHH------------------- T ss_conf ----399999999997499999999999999986799167626965267788899999999------------------- Q ss_pred CCCCCCCCCCCCCCEEEEEHHH-HHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEEE Q ss_conf 2223322323469868776344-688887321005889999998664142379999657775355435541102515876 Q gi|254780799|r 577 AIYETEHFDFQHMPYIVVVIDE-MADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISFQ 655 (806) Q Consensus 577 ~~~~~~~~~~~~lp~ivviiDE-laDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf~ 655 (806) +-.=|- +++-|| .+.|=.+..++|-+.|.+| .+.-|+-+|+.|--+.+ =+.+-.||++- T Consensus 155 ---------L~~~P~-lllaDEPTs~LD~~~~~~il~ll~~l---~~e~g~t~i~vTHDl~~-------~~~~adrv~vm 214 (233) T cd03258 155 ---------LANNPK-VLLCDEATSALDPETTQSILALLRDI---NRELGLTIVLITHEMEV-------VKRICDRVAVM 214 (233) T ss_pred ---------HHCCCC-EEEECCCCCCCCHHHHHHHHHHHHHH---HHHHCCEEEEECCCHHH-------HHHHCCEEEEE T ss_conf ---------833998-99965976646988999999999999---99729899998989999-------99869979999 Q ss_pred ECCCCCCHHHCCCCCHHHHC Q ss_conf 45866421233886457865 Q gi|254780799|r 656 VSSKIDSRTILGEQGAEQLL 675 (806) Q Consensus 656 v~s~~dSrtild~~gae~Ll 675 (806) +---|..++-+++++ T Consensus 215 -----~~G~Ive~G~~~~v~ 229 (233) T cd03258 215 -----EKGEVVEEGTVEEVF 229 (233) T ss_pred -----ECCEEEEECCHHHHH T ss_conf -----798999988989996 No 299 >cd04163 Era Era subfamily. Era (E. coli Ras-like protein) is a multifunctional GTPase found in all bacteria except some eubacteria. It binds to the 16S ribosomal RNA (rRNA) of the 30S subunit and appears to play a role in the assembly of the 30S subunit, possibly by chaperoning the 16S rRNA. It also contacts several assembly elements of the 30S subunit. Era couples cell growth with cytokinesis and plays a role in cell division and energy metabolism. Homologs have also been found in eukaryotes. Era contains two domains: the N-terminal GTPase domain and a C-terminal domain KH domain that is critical for RNA binding. Both domains are important for Era function. Era is functionally able to compensate for deletion of RbfA, a cold-shock adaptation protein that is required for efficient processing of the 16S rRNA. Probab=67.59 E-value=5 Score=19.83 Aligned_cols=23 Identities=26% Similarity=0.495 Sum_probs=20.4 Q ss_pred CCCEEEEEECCCCHHHHHHHHHH Q ss_conf 02023553047740679999999 Q gi|254780799|r 453 RMPHLLIAGTTGSGKSVAINTMI 475 (806) Q Consensus 453 kMPHLLIAGtTGSGKSV~iN~iI 475 (806) |-++.-++|-+.+|||-.+|++. T Consensus 2 ~~~~V~ivG~pN~GKSsL~N~L~ 24 (168) T cd04163 2 KSGFVAIVGRPNVGKSTLLNALV 24 (168) T ss_pred CCCEEEEECCCCCCHHHHHHHHH T ss_conf 98689999999999999999995 No 300 >PRK06247 pyruvate kinase; Provisional Probab=67.54 E-value=3.5 Score=21.02 Aligned_cols=227 Identities=21% Similarity=0.284 Sum_probs=121.3 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCEEEEEEEECCEEEEEE-----EEECCCCCHHHHHHHHHHHHHHHHHCCCCC--CCCCC Q ss_conf 25665457899999974248632899841044244444-----321478639999978899998863001210--00388 Q gi|254780799|r 328 SPKVMQNNACTLKSVLSDFGIQGEIVNVRPGPVITLYE-----LEPAPGIKSSRIIGLSDDIARSMSAISARV--AVIPR 400 (806) Q Consensus 328 s~eeL~~nA~lLE~tL~dFGVe~~Vv~v~pGPVVTrYE-----i~PApGVKvSKI~nLadDIA~aLsa~svRI--apIPG 400 (806) +.++.++.-+.+.+.=+++|...-|--=.+||-+=-=+ ++...|-+ +++ ...+| T Consensus 42 ~~e~~~~~i~~iR~~~~~~~~~i~Il~Dl~GpkIR~g~~~~~~i~l~~G~~-------------------v~l~~~~~~~ 102 (477) T PRK06247 42 DHDDHRELYKRIREVEDELGRPIGILADLQGPKLRLGRFADGKVQLANGQT-------------------FRLDVDDAPG 102 (477) T ss_pred CHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCEEEEEECCCCCEEECCCCE-------------------EEEECCCCCC T ss_conf 999999999999999997199707999788983688853798589648999-------------------9996677789 Q ss_pred CCEEEEEECCCCCCEEEEHHHCCCHHHHHCCCCCHHHCC------CCCCCCEEEEECCCCCEEEEEECCCCHHHHHHHHH Q ss_conf 612555302456623873423072134302210002104------56666667854100202355304774067999999 Q gi|254780799|r 401 RNAIGIELPNDIRETVMLRDLIVSRVFEKNQCDLAINLG------KSIEGKPIIADLARMPHLLIAGTTGSGKSVAINTM 474 (806) Q Consensus 401 K~~VGIEiPN~~r~~V~lreil~s~~f~~s~~~L~iaLG------KdI~G~pvv~DLakMPHLLIAGtTGSGKSV~iN~i 474 (806) . .++-.|...++++.- +....+-+.=| ..+.++-+.+ ..+.+|.=+|.|.|-+ T Consensus 103 ~---------~~~i~v~~~~l~~~v---~~Gd~I~idDG~i~l~V~~~~~~~i~~------~V~~gG~L~s~Kgvnl--- 161 (477) T PRK06247 103 D---------HDRVSLPHPEILAAL---KPGDRLLVDDGKVRLVVEACDGDDVVC------RVVEGGPVSDRKGVSL--- 161 (477) T ss_pred C---------CCEEEECCHHHHHHC---CCCCEEEEECCCEEEEEEECCCCEEEE------EEEECCEECCCCCCCC--- T ss_conf 7---------666774357779756---899889992795599999716986999------9962859738871204--- Q ss_pred HHHHHHHCCHHHEEEEEECCCHHHHHHCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHH Q ss_conf 99999829957847888523100111027703431223343045668999999999999999987089968999999998 Q gi|254780799|r 475 ILSLLYRMTPAQCRLIMIDPKMLELSVYDGIPNLLTPVVTNPQKAVTVLKWLVCEMEERYQKMSKIGVRNIDGFNLKVAQ 554 (806) Q Consensus 475 I~SlLyk~~P~evkliliDPK~vEls~Y~~iPHLl~pVvTd~~kA~~aL~w~V~EMe~RY~l~a~~~vRni~~yN~k~~~ 554 (806) |+. ++-.|..|+-.+ ..|+|+++. --+..+-.=||+-+.-.+ +++ T Consensus 162 ---------P~~--------------------~l~lp~lTekD~--~di~~a~~~---~vD~ialSFVrsa~DV~~-vr~ 206 (477) T PRK06247 162 ---------PGT--------------------VLPVSALTEKDR--ADLEFALEL---GVDWVALSFVQRPEDVEE-VRK 206 (477) T ss_pred ---------CCC--------------------CCCCCCCCHHHH--HHHHHHHHC---CCCEEEECCCCCHHHHHH-HHH T ss_conf ---------687--------------------557557897799--999989975---999999757698789999-998 Q ss_pred HHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHHHHHHCCCH--HHHHH-------HHHHHHHHHCC Q ss_conf 8744786677544677654543222332232346986877634468888732100--58899-------99998664142 Q gi|254780799|r 555 YHNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQHMPYIVVVIDEMADLMMVARKD--IESAV-------QRLAQMARASG 625 (806) Q Consensus 555 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivviiDElaDlmm~~~~~--ve~~i-------~rlaq~ara~G 625 (806) ...+.-++-- ..| ....++.+.-|+ +.+|=.|+|.-| ||-++ -++-.+||++| T Consensus 207 ~l~~~~~IIa-------------KIE-~~~av~NldeIi----~~sDgIMVARGDLgvEi~~e~vp~~Qk~ii~~~~~~g 268 (477) T PRK06247 207 VIGGRVPVMA-------------KIE-KPQAVDRLEAIV----EASDAIMVARGDLGVEVPLESVPLIQKRIIRMAREAG 268 (477) T ss_pred HCCCCCCEEE-------------EEC-CHHHHHHHHHHH----HHHCEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHCC T ss_conf 6587650899-------------963-887877489999----8617789965865455798897999999999998629 Q ss_pred EEEEEEECCCCCCCCCHHHHHCCCCEE Q ss_conf 379999657775355435541102515 Q gi|254780799|r 626 IHVIMATQRPSVDVITGTIKANFPTRI 652 (806) Q Consensus 626 iHli~aTqrPsvdvitg~ikan~p~ri 652 (806) .-.|+||| +.--+|+...|||- T Consensus 269 KpvivATq-----mLeSM~~~p~PTRA 290 (477) T PRK06247 269 KPVVVATQ-----MLESMIESPVPTRA 290 (477) T ss_pred CEEEEECC-----HHHHHCCCCCCCHH T ss_conf 96999715-----08874379998658 No 301 >PRK08156 surface presentation of antigens protein SpaS; Validated Probab=67.49 E-value=8.9 Score=17.95 Aligned_cols=24 Identities=17% Similarity=0.235 Sum_probs=13.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 999999999999999999998442 Q gi|254780799|r 25 KMKIVAGLILLCTVFAITLALGTW 48 (806) Q Consensus 25 rl~Ei~Gl~Li~~al~l~iSL~SY 48 (806) |-+|+...++++.+++++.++.++ T Consensus 24 rSkEl~~a~~ll~g~l~l~~~~~~ 47 (367) T PRK08156 24 KSKDLTTAVVILGGIAYLVSFGSL 47 (367) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 748799999999999999999999 No 302 >pfam09012 FeoC FeoC like transcriptional regulator. This family contains several transcriptional regulators, including FeoC, which contain a HTH motif. FeoC acts as a [Fe-S] dependant transcriptional repressor. Probab=67.36 E-value=9 Score=17.93 Aligned_cols=49 Identities=12% Similarity=0.213 Sum_probs=41.6 Q ss_pred HHHHHHHHHHCCCEEEEHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHCC Q ss_conf 8999999996598500014222001177899999999977986802278 Q gi|254780799|r 743 YKQAVDIVLRDNKASISYIQRRLGIGYNRAASIIENMEEKGVIGPASST 791 (806) Q Consensus 743 ~~~a~~~v~~~~~~s~s~lqr~~~igy~raar~~~~~e~~giv~~~~g~ 791 (806) +.|--+++...+++|.+-|.++|++-=.----++|+++.+|-|..-+.+ T Consensus 2 L~~lr~~l~~~g~~s~~~Ls~~f~~~~~~v~~ML~~wi~kGkv~k~~~~ 50 (68) T pfam09012 2 LTELKQYLQERGRASLAELARHFKMSPDAVEAMLEVWIRKGKVRKLEDG 50 (68) T ss_pred HHHHHHHHHHHCCCCHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEECCC T ss_conf 2899999999198479999989793999999999999987975872489 No 303 >PRK13641 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=67.29 E-value=4.3 Score=20.30 Aligned_cols=206 Identities=19% Similarity=0.286 Sum_probs=98.9 Q ss_pred EECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHH----------HHHHCCCC----H--HHHCC- Q ss_conf 5410020235530477406799999999999829957847888523100----------11102770----3--43122- Q gi|254780799|r 449 ADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKML----------ELSVYDGI----P--NLLTP- 511 (806) Q Consensus 449 ~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~v----------Els~Y~~i----P--HLl~p- 511 (806) .++.+==-+-|-|.+|||||..++.| ..|+ +|..=+ |.||=+-+ ++.-+=|+ | .|... T Consensus 28 l~I~~Ge~~~iiG~nGsGKSTLl~~l-~Gll---~P~sG~-V~i~G~~i~~~~~~~~~~~~r~~vg~vfQ~p~~ql~~~t 102 (286) T PRK13641 28 FELEDGSFVALIGHTGSGKSTLMQHF-NALL---KPSSGK-ITIAGYHITPETSNKNLKDLRKKVGLVFQFPEAQLFENT 102 (286) T ss_pred EEECCCCEEEEECCCCCHHHHHHHHH-HCCC---CCCCEE-EEECCEECCCCCCHHHHHHHHHHEEEEEECCCCCCCCCH T ss_conf 79869999999999983999999999-6598---988549-999989997666555799998515489766510126112 Q ss_pred CCCCHHHHHHHHHHHHHHHHHH-HHHHHHCCCC-CHHH--------H-HHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 3343045668999999999999-9999870899-6899--------9-99999887447866775446776545432223 Q gi|254780799|r 512 VVTNPQKAVTVLKWLVCEMEER-YQKMSKIGVR-NIDG--------F-NLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYE 580 (806) Q Consensus 512 VvTd~~kA~~aL~w~V~EMe~R-Y~l~a~~~vR-ni~~--------y-N~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 580 (806) |..|-.-+...+.|--+|+++| -+.+...|-. ++.. = -+|+.-|.. T Consensus 103 V~eev~~g~~~~g~~~~e~~~~~~~~l~~vgl~~~~~~~~p~~LSGGqkqRvaiA~a----------------------- 159 (286) T PRK13641 103 VLKDVEFGPKNFGFSEQEAKEKALKWLKKVGLSEDLISKSPFELSGGQMRRVAIAGV----------------------- 159 (286) T ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHCCHHHCCHHHHHHHHHHHH----------------------- T ss_conf 999999999986999899999999999976996455542913299999999999999----------------------- Q ss_pred CCCCCCCCCCEEEEEHHHH-HHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEEEECCC Q ss_conf 3223234698687763446-888873210058899999986641423799996577753554355411025158764586 Q gi|254780799|r 581 TEHFDFQHMPYIVVVIDEM-ADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISFQVSSK 659 (806) Q Consensus 581 ~~~~~~~~lp~ivviiDEl-aDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf~v~s~ 659 (806) +-.=|- |++-||= +.|=-...+++-+.|.+|.+ .|.-.|++|..-.. =+.+-.|+.. - T Consensus 160 -----La~~P~-iLlLDEPTsgLDp~~~~~i~~ll~~l~~----~G~Tii~vtHd~~~-------v~~~adrv~v----m 218 (286) T PRK13641 160 -----MAYEPE-ILCLDEPAAGLDPEGRKEMMQIFKDYQK----AGHTVILVTHNMDD-------VAEYADDVLV----L 218 (286) T ss_pred -----HHCCCC-EEEECCCCCCCCHHHHHHHHHHHHHHHH----CCCEEEEECCCHHH-------HHHHCCEEEE----E T ss_conf -----974999-9997397343899999999999999996----39999999159999-------9997999999----9 Q ss_pred CCCHHHCCCCCHHHHCCCCCEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHCCCC Q ss_conf 64212338864578658875477368983258883348988999999999712897 Q gi|254780799|r 660 IDSRTILGEQGAEQLLGQGDMLYMTGGGRVQRIHGPFVSDIEVEKVVSHLKTQGEA 715 (806) Q Consensus 660 ~dSrtild~~gae~Llg~gdml~~~~~~~~~r~~g~~v~~~ev~~v~~~~~~q~~~ 715 (806) -+-|.|.| .-.+.++.+ |..++.+++.-.++.+++..+++.|-+ T Consensus 219 ~~G~iv~~-G~p~evf~~-----------~~~l~~~~l~~P~~~~~~~~L~~~g~~ 262 (286) T PRK13641 219 EHGKLIKH-ASPKEIFSD-----------SEWLKKHYLDEPATSRFASKLEKGGFK 262 (286) T ss_pred ECCEEEEE-CCHHHHHCC-----------HHHHHHCCCCCCHHHHHHHHHHHCCCC T ss_conf 89999997-799999659-----------999997799999399999999975999 No 304 >COG3432 Predicted transcriptional regulator [Transcription] Probab=67.27 E-value=9 Score=17.92 Aligned_cols=64 Identities=17% Similarity=0.241 Sum_probs=48.9 Q ss_pred CCHHHHHHHHHHHHCCCEEEEHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHCCCCCEEEECHHHH Q ss_conf 77038999999996598500014222001177899999999977986802278872673171253 Q gi|254780799|r 739 ADDLYKQAVDIVLRDNKASISYIQRRLGIGYNRAASIIENMEEKGVIGPASSTGKREILISSMEE 803 (806) Q Consensus 739 ~d~l~~~a~~~v~~~~~~s~s~lqr~~~igy~raar~~~~~e~~giv~~~~g~~~r~vl~~~~~~ 803 (806) ..+..-+.+++ ...+..-+.-|=..-.+-|.+|.++||.|++.|.+=..++.+.+.+-+++... T Consensus 16 R~eIi~dIL~~-~~~~~~~~Tri~y~aNlny~~~~~yi~~L~~~Gli~~~~~~~~~~y~lT~KG~ 79 (95) T COG3432 16 RLEIIFDILKA-ISEGGIGITRIIYGANLNYKRAQKYIEMLVEKGLIIKQDNGRRKVYELTEKGK 79 (95) T ss_pred HHHHHHHHHHH-HCCCCCCCEEEEEECCCCHHHHHHHHHHHHHCCCEEECCCCCCCEEEECHHHH T ss_conf 99999999998-41799986246643276779999999999867877861488644589885689 No 305 >cd01871 Rac1_like Rac1-like subfamily. The Rac1-like subfamily consists of Rac1, Rac2, and Rac3 proteins, plus the splice variant Rac1b that contains a 19-residue insertion near switch II relative to Rac1. While Rac1 is ubiquitously expressed, Rac2 and Rac3 are largely restricted to hematopoietic and neural tissues respectively. Rac1 stimulates the formation of actin lamellipodia and membrane ruffles. It also plays a role in cell-matrix adhesion and cell anoikis. In intestinal epithelial cells, Rac1 is an important regulator of migration and mediates apoptosis. Rac1 is also essential for RhoA-regulated actin stress fiber and focal adhesion complex formation. In leukocytes, Rac1 and Rac2 have distinct roles in regulating cell morphology, migration, and invasion, but are not essential for macrophage migration or chemotaxis. Rac3 has biochemical properties that are closely related to Rac1, such as effector interaction, nucleotide binding, and hydrolysis; Rac2 has a slower nucleoti Probab=67.27 E-value=6.2 Score=19.14 Aligned_cols=43 Identities=23% Similarity=0.458 Sum_probs=31.2 Q ss_pred EEEEECCCCHHHHHHHHHHHHHHHHCCHHH---------EEEEEECCCHHHHHHCC Q ss_conf 355304774067999999999998299578---------47888523100111027 Q gi|254780799|r 457 LLIAGTTGSGKSVAINTMILSLLYRMTPAQ---------CRLIMIDPKMLELSVYD 503 (806) Q Consensus 457 LLIAGtTGSGKSV~iN~iI~SlLyk~~P~e---------vkliliDPK~vEls~Y~ 503 (806) +++-|..|.|||-.+ .+++.+.-|++ .+-+.+|.|.+.|.+++ T Consensus 4 ivlvGd~~VGKTsli----~r~~~~~f~~~~~~Ti~~~~~~~i~~~~~~v~l~iwD 55 (174) T cd01871 4 CVVVGDGAVGKTCLL----ISYTTNAFPGEYIPTVFDNYSANVMVDGKPVNLGLWD 55 (174) T ss_pred EEEECCCCCCHHHHH----HHHHHCCCCCCCCCEEEEEEEEEEEECCEEEEEEEEC T ss_conf 999899998699999----9997399999868837887679999999999999986 No 306 >PRK07261 topology modulation protein; Provisional Probab=67.19 E-value=2.8 Score=21.75 Aligned_cols=14 Identities=50% Similarity=0.729 Sum_probs=10.7 Q ss_pred EEEEECCCCHHHHH Q ss_conf 35530477406799 Q gi|254780799|r 457 LLIAGTTGSGKSVA 470 (806) Q Consensus 457 LLIAGtTGSGKSV~ 470 (806) .+|-|+.|||||-. T Consensus 3 I~IiG~sGsGKSTl 16 (171) T PRK07261 3 IAIIGYSGSGKSTL 16 (171) T ss_pred EEEECCCCCCHHHH T ss_conf 99988999868999 No 307 >COG1703 ArgK Putative periplasmic protein kinase ArgK and related GTPases of G3E family [Amino acid transport and metabolism] Probab=67.09 E-value=6.5 Score=18.98 Aligned_cols=42 Identities=31% Similarity=0.529 Sum_probs=32.1 Q ss_pred EEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCCC Q ss_conf 355304774067999999999998299578478885231001110277 Q gi|254780799|r 457 LLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYDG 504 (806) Q Consensus 457 LLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~~ 504 (806) +=|-|+-|+|||-.|-.+++-+ +..=.-|--|=|||- |.|.| T Consensus 54 iGITG~PGaGKSTli~~L~~~l--~~~G~rVaVlAVDPS----Sp~TG 95 (323) T COG1703 54 IGITGVPGAGKSTLIEALGREL--RERGHRVAVLAVDPS----SPFTG 95 (323) T ss_pred EEECCCCCCCHHHHHHHHHHHH--HHCCCEEEEEEECCC----CCCCC T ss_conf 8731799886688999999999--977967899998899----99878 No 308 >PRK04213 GTP-binding protein; Provisional Probab=66.99 E-value=5.2 Score=19.71 Aligned_cols=23 Identities=30% Similarity=0.577 Sum_probs=21.6 Q ss_pred CCEEEEEECCCCHHHHHHHHHHH Q ss_conf 20235530477406799999999 Q gi|254780799|r 454 MPHLLIAGTTGSGKSVAINTMIL 476 (806) Q Consensus 454 MPHLLIAGtTGSGKSV~iN~iI~ 476 (806) ||..-+.|-+-.|||--+|+|+- T Consensus 1 ~P~VaivGRpNVGKSTL~N~L~g 23 (195) T PRK04213 1 MPEIIFVGRSNVGKSTLIRALTG 23 (195) T ss_pred CCEEEEECCCCCCHHHHHHHHHC T ss_conf 98799976999889999999968 No 309 >smart00489 DEXDc3 DEAD-like helicases superfamily. Probab=66.90 E-value=3.6 Score=20.92 Aligned_cols=24 Identities=21% Similarity=0.418 Sum_probs=17.2 Q ss_pred CEEEEEECCCCHHHHHHHHHHHHH Q ss_conf 023553047740679999999999 Q gi|254780799|r 455 PHLLIAGTTGSGKSVAINTMILSL 478 (806) Q Consensus 455 PHLLIAGtTGSGKSV~iN~iI~Sl 478 (806) -|+|+-.-||+|||.++=+-.++- T Consensus 28 ~~~llEaPTGtGKTlalL~~al~~ 51 (289) T smart00489 28 KIGILESPTGTGKTLSLLCLTLTW 51 (289) T ss_pred CEEEEECCCCHHHHHHHHHHHHHH T ss_conf 979998999651899999999999 No 310 >smart00488 DEXDc2 DEAD-like helicases superfamily. Probab=66.90 E-value=3.6 Score=20.92 Aligned_cols=24 Identities=21% Similarity=0.418 Sum_probs=17.2 Q ss_pred CEEEEEECCCCHHHHHHHHHHHHH Q ss_conf 023553047740679999999999 Q gi|254780799|r 455 PHLLIAGTTGSGKSVAINTMILSL 478 (806) Q Consensus 455 PHLLIAGtTGSGKSV~iN~iI~Sl 478 (806) -|+|+-.-||+|||.++=+-.++- T Consensus 28 ~~~llEaPTGtGKTlalL~~al~~ 51 (289) T smart00488 28 KIGILESPTGTGKTLSLLCLTLTW 51 (289) T ss_pred CEEEEECCCCHHHHHHHHHHHHHH T ss_conf 979998999651899999999999 No 311 >PRK13549 xylose transporter ATP-binding subunit; Provisional Probab=66.73 E-value=5.7 Score=19.42 Aligned_cols=43 Identities=21% Similarity=0.427 Sum_probs=25.8 Q ss_pred CCEEEEEHHH-HHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCC Q ss_conf 9868776344-68888732100588999999866414237999965777 Q gi|254780799|r 589 MPYIVVVIDE-MADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPS 636 (806) Q Consensus 589 lp~ivviiDE-laDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPs 636 (806) =|- |+|.|| .+-|=..+..+|-+.|.+|++ .|+-.|+.|.--. T Consensus 423 ~P~-iLilDEPT~GLD~~~~~~i~~ll~~l~~----~G~tvl~iSHDl~ 466 (513) T PRK13549 423 NPR-ILILDEPTRGIDVGAKYEIYKLINQLVQ----QGVAIIVISSELP 466 (513) T ss_pred CCC-EEEEECCCCCCCHHHHHHHHHHHHHHHH----CCCEEEEEECCHH T ss_conf 998-9999798668999999999999999995----7999999918689 No 312 >PRK12468 flhB flagellar biosynthesis protein FlhB; Reviewed Probab=66.61 E-value=9.3 Score=17.83 Aligned_cols=23 Identities=13% Similarity=0.206 Sum_probs=11.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 99999999999999999998442 Q gi|254780799|r 26 MKIVAGLILLCTVFAITLALGTW 48 (806) Q Consensus 26 l~Ei~Gl~Li~~al~l~iSL~SY 48 (806) -+|+...+.++.+++++..+.++ T Consensus 30 S~El~~~~~L~~~~~~l~~~~~~ 52 (383) T PRK12468 30 SRELTSMLMLGAGLTILWMSGES 52 (383) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH T ss_conf 77599999999999999999999 No 313 >COG1205 Distinct helicase family with a unique C-terminal domain including a metal-binding cysteine cluster [General function prediction only] Probab=66.52 E-value=9.3 Score=17.81 Aligned_cols=39 Identities=18% Similarity=0.243 Sum_probs=17.3 Q ss_pred HHHHHHHHHHHHCC----CEEEEHHHHHHCCCHH-----HHHHHHHHHH Q ss_conf 03899999999659----8500014222001177-----8999999999 Q gi|254780799|r 741 DLYKQAVDIVLRDN----KASISYIQRRLGIGYN-----RAASIIENME 780 (806) Q Consensus 741 ~l~~~a~~~v~~~~----~~s~s~lqr~~~igy~-----raar~~~~~e 780 (806) +++..+.++|...- --| +.-++|...|++ -|.++.+.+= T Consensus 742 ~~~~~s~~~i~~c~c~~GcP~-cv~s~~cg~~n~~Ldk~~a~~ll~~ll 789 (851) T COG1205 742 DLLEASLELIESCPCPDGCPS-CVTSPKCGNGNLPLDKSAALSLLSELL 789 (851) T ss_pred HHHHHHHHHHHCCCCCCCCCC-CCCCCCCCCCCCHHHHHHHHHHHHHHH T ss_conf 999999999864998789953-105641268975656999999999986 No 314 >cd00267 ABC_ATPase ABC (ATP-binding cassette) transporter nucleotide-binding domain; ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide-binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=66.49 E-value=6.1 Score=19.17 Aligned_cols=40 Identities=28% Similarity=0.456 Sum_probs=27.5 Q ss_pred EEEHHHH-HHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCC Q ss_conf 7763446-8888732100588999999866414237999965777 Q gi|254780799|r 593 VVVIDEM-ADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPS 636 (806) Q Consensus 593 vviiDEl-aDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPs 636 (806) +++.||- +-|=..+..++.+.|.++++. |.-+|+.|..+. T Consensus 101 ililDEPtsgLD~~~~~~l~~~i~~l~~~----g~tii~vtH~~~ 141 (157) T cd00267 101 LLLLDEPTSGLDPASRERLLELLRELAEE----GRTVIIVTHDPE 141 (157) T ss_pred EEEEECCCCCCCHHHHHHHHHHHHHHHHC----CCEEEEEECCHH T ss_conf 99996987668999999999999999968----999999908999 No 315 >pfam01978 TrmB Sugar-specific transcriptional regulator TrmB. One member of this family, TrmB, has been shown to be a sugar-specific transcriptional regulator of the trehalose/maltose ABC transporter in Thermococcus litoralis. Probab=66.37 E-value=9.4 Score=17.80 Aligned_cols=50 Identities=14% Similarity=0.272 Sum_probs=41.7 Q ss_pred HHHHHHHHCCCEEEEHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHCCCCCE Q ss_conf 999999965985000142220011778999999999779868022788726 Q gi|254780799|r 745 QAVDIVLRDNKASISYIQRRLGIGYNRAASIIENMEEKGVIGPASSTGKRE 795 (806) Q Consensus 745 ~a~~~v~~~~~~s~s~lqr~~~igy~raar~~~~~e~~giv~~~~g~~~r~ 795 (806) ++-...++.+.++.+-|-++++|-..++-+.++.|+++|+|--.. ++|+. T Consensus 12 ~vY~~Ll~~g~~t~~eia~~~~i~r~~vY~~L~~L~~kGlv~~~~-~~p~~ 61 (68) T pfam01978 12 KVYLALLKLGPATADEIAEESGVPRSKVYEVLRSLEKKGLVEREK-GRPKK 61 (68) T ss_pred HHHHHHHHCCCCCHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEC-CCCCE T ss_conf 999999981998899999995988989999999999889889867-97447 No 316 >pfam03193 DUF258 Protein of unknown function, DUF258. Probab=66.37 E-value=3.7 Score=20.81 Aligned_cols=89 Identities=21% Similarity=0.243 Sum_probs=44.5 Q ss_pred EEEECCCCHHHHHHHHHHHHHHHH------------CCHHHEEEEEEC--------CCHHHHHHCCCCHHHHCCCCCCHH Q ss_conf 553047740679999999999982------------995784788852--------310011102770343122334304 Q gi|254780799|r 458 LIAGTTGSGKSVAINTMILSLLYR------------MTPAQCRLIMID--------PKMLELSVYDGIPNLLTPVVTNPQ 517 (806) Q Consensus 458 LIAGtTGSGKSV~iN~iI~SlLyk------------~~P~evkliliD--------PK~vEls~Y~~iPHLl~pVvTd~~ 517 (806) .++|..|.|||--||+++-.+-.+ +|--..+|+.++ |=+=||.+.+- T Consensus 39 v~~G~SGVGKSTLiN~L~~~~~~~t~~iS~~~~rGkHTTt~~~l~~l~~gg~iiDTPG~r~f~l~~~------------- 105 (161) T pfam03193 39 VLAGQSGVGKSTLLNALLPELDLRTGEISEKLGRGRHTTTHVELFPLPGGGLLIDTPGFRELGLWHL------------- 105 (161) T ss_pred EEECCCCCCHHHHHHHHCCHHHHHHHHHHHHHCCCCCEEEEEEEEEECCCCEEEECCCCCCCCCCCC------------- T ss_conf 9988999889999988563443577753588638851121137998079958985898762577778------------- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCC Q ss_conf 5668999999999999999987089968999999998874478 Q gi|254780799|r 518 KAVTVLKWLVCEMEERYQKMSKIGVRNIDGFNLKVAQYHNTGK 560 (806) Q Consensus 518 kA~~aL~w~V~EMe~RY~l~a~~~vRni~~yN~k~~~~~~~~~ 560 (806) ....|..+-.||+.--.--.-..++++.+=+=.|++|-++|+ T Consensus 106 -~~~~l~~~F~e~~~~~~~CkF~dC~H~~EPgCaV~~ave~g~ 147 (161) T pfam03193 106 -DPEELAEYFPEFRELAGQCKFRDCLHTNEPGCAVKAAVENGE 147 (161) T ss_pred -CHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHCCC T ss_conf -989999970999998196988699589998858999998599 No 317 >PRK09361 radB DNA repair and recombination protein RadB; Provisional Probab=66.31 E-value=9.4 Score=17.79 Aligned_cols=62 Identities=16% Similarity=0.321 Sum_probs=34.4 Q ss_pred EEEHHHHHHHH--HHC-------CCHHHHHHHHHHHHHHHCCEEEEEEEC-CCCCCCCCHHHHHCCCCEEEEEE Q ss_conf 77634468888--732-------100588999999866414237999965-77753554355411025158764 Q gi|254780799|r 593 VVVIDEMADLM--MVA-------RKDIESAVQRLAQMARASGIHVIMATQ-RPSVDVITGTIKANFPTRISFQV 656 (806) Q Consensus 593 vviiDElaDlm--m~~-------~~~ve~~i~rlaq~ara~GiHli~aTq-rPsvdvitg~ikan~p~riaf~v 656 (806) +||||-++.|. -.. .+.+-..+..|.+.|+.-++-+|+--| |-.+++ +-++.=-|.-++|-| T Consensus 110 lvVIDSi~~~~~~e~~~~~~~~~~~~l~~~~~~L~~~a~~~~~~vvl~nqV~~~~~~--~~~~paGg~~l~h~~ 181 (224) T PRK09361 110 LIVLDSATSLYRLELDDNDNSKLNRELGKQISHLLSLARKHNIAVVITNQVYSDIDS--DGLRPLGGHTLEHWS 181 (224) T ss_pred EEEEECCHHHEEHHCCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCC--CCCCCCCCCCEEEEE T ss_conf 899962301000001457658999999999999999999719869999668852676--544446541222105 No 318 >PRK13635 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=66.15 E-value=5.1 Score=19.76 Aligned_cols=99 Identities=25% Similarity=0.303 Sum_probs=60.1 Q ss_pred CEEEEEHHHH-HHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEEEECCCCCCHHHCCC Q ss_conf 8687763446-888873210058899999986641423799996577753554355411025158764586642123388 Q gi|254780799|r 590 PYIVVVIDEM-ADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISFQVSSKIDSRTILGE 668 (806) Q Consensus 590 p~ivviiDEl-aDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf~v~s~~dSrtild~ 668 (806) |-| +|-||= +-|=-.+..+|-+.|.+| .+.-|+-.|+.|.+-+. + ++ -.||.+ .|---|..+ T Consensus 159 P~i-LilDEPTs~LD~~~~~~i~~~l~~L---~~~~g~TvI~itHdl~~------~-~~-aDRiiv-----m~~G~Iv~~ 221 (279) T PRK13635 159 PDI-LILDEATSMLDPQGRREVLETVRQL---KEQKGITVLSITHDLDE------A-AQ-ADRVIV-----MNKGEILEE 221 (279) T ss_pred CCE-EEECCCCCCCCHHHHHHHHHHHHHH---HHCCCCEEEEEEECHHH------H-HC-CCEEEE-----EECCEEEEE T ss_conf 998-9973874548989999999999999---98379899999767899------9-63-998999-----989999998 Q ss_pred CCHHHHCCCCCEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHCCCCC Q ss_conf 645786588754773689832588833489889999999997128974 Q gi|254780799|r 669 QGAEQLLGQGDMLYMTGGGRVQRIHGPFVSDIEVEKVVSHLKTQGEAK 716 (806) Q Consensus 669 ~gae~Llg~gdml~~~~~~~~~r~~g~~v~~~ev~~v~~~~~~q~~~~ 716 (806) +-.|.++.+.+.|.. +.+.-..+-++.+.++.+|-+. T Consensus 222 Gtp~elf~~~~~l~~-----------~~l~~P~~~~l~~~l~~~g~~~ 258 (279) T PRK13635 222 GTPEEIFKSGHMLQE-----------IGLDVPFSVKLKELLKRNGILL 258 (279) T ss_pred CCHHHHHCCHHHHHH-----------CCCCCCHHHHHHHHHHHCCCCC T ss_conf 699999779889997-----------7999994999999999759999 No 319 >COG1136 SalX ABC-type antimicrobial peptide transport system, ATPase component [Defense mechanisms] Probab=65.92 E-value=5.2 Score=19.72 Aligned_cols=150 Identities=21% Similarity=0.307 Sum_probs=78.1 Q ss_pred EECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHH---HHH------------HCCCCHHHHCCCC Q ss_conf 5410020235530477406799999999999829957847888523100---111------------0277034312233 Q gi|254780799|r 449 ADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKML---ELS------------VYDGIPNLLTPVV 513 (806) Q Consensus 449 ~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~v---Els------------~Y~~iPHLl~pVv 513 (806) ..+.+==.+-|-|..|||||-.+| |++.|-+.+.-+|.+-=.|-..+ +++ -||=||+| T Consensus 26 l~i~~Ge~vaI~GpSGSGKSTLLn--iig~ld~pt~G~v~i~g~~~~~l~~~~~~~~R~~~iGfvFQ~~nLl~~l----- 98 (226) T COG1136 26 LEIEAGEFVAIVGPSGSGKSTLLN--LLGGLDKPTSGEVLINGKDLTKLSEKELAKLRRKKIGFVFQNFNLLPDL----- 98 (226) T ss_pred EEECCCCEEEEECCCCCCHHHHHH--HHHCCCCCCCCEEEECCEECCCCCHHHHHHHHHHHEEEECCCCCCCCCC----- T ss_conf 887499899998999998999999--9964667888469999888675898899999777489987567778988----- Q ss_pred CCHHHHHHHHHHH--HH-HHHHHHHHHHHC-CCC---------CHHHH-HHHHHHHHHCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 4304566899999--99-999999999870-899---------68999-9999988744786677544677654543222 Q gi|254780799|r 514 TNPQKAVTVLKWL--VC-EMEERYQKMSKI-GVR---------NIDGF-NLKVAQYHNTGKKFNRTVQTGFDRKTGEAIY 579 (806) Q Consensus 514 Td~~kA~~aL~w~--V~-EMe~RY~l~a~~-~vR---------ni~~y-N~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 579 (806) |=-....-.|... -. ++++|.+.+.+. |-- .+.|= -++|.-|++- T Consensus 99 tv~ENv~lpl~~~~~~~~~~~~~~~~l~~~lgl~~~~~~~~p~~LSGGqqQRVAIARAL--------------------- 157 (226) T COG1136 99 TVLENVELPLLIAGKSAGRRKRAAEELLEVLGLEDRLLKKKPSELSGGQQQRVAIARAL--------------------- 157 (226) T ss_pred CHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHCCCCCHHCCHHHHHHHHHHHHH--------------------- T ss_conf 89999986998747873679999999998669812323588122697999999999998--------------------- Q ss_pred CCCCCCCCCCCEEEEEHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCC Q ss_conf 332232346986877634468888732100588999999866414237999965777 Q gi|254780799|r 580 ETEHFDFQHMPYIVVVIDEMADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPS 636 (806) Q Consensus 580 ~~~~~~~~~lp~ivviiDElaDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPs 636 (806) -.=|-|++--..-+-|=.-.+++|.+.+ .+.++-.|.-+|++|--|. T Consensus 158 -------~~~P~iilADEPTgnLD~~t~~~V~~ll---~~~~~~~g~tii~VTHD~~ 204 (226) T COG1136 158 -------INNPKIILADEPTGNLDSKTAKEVLELL---RELNKERGKTIIMVTHDPE 204 (226) T ss_pred -------HCCCCEEEEECCCCCCCHHHHHHHHHHH---HHHHHHCCCEEEEECCCHH T ss_conf -------2499869960766658867899999999---9998746989999908989 No 320 >COG0469 PykF Pyruvate kinase [Carbohydrate transport and metabolism] Probab=65.91 E-value=4.3 Score=20.30 Aligned_cols=116 Identities=28% Similarity=0.308 Sum_probs=71.6 Q ss_pred HHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHH-HHCCCC--CCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 312233430456689999999999999999870899689999999988-744786--67754467765454322233223 Q gi|254780799|r 508 LLTPVVTNPQKAVTVLKWLVCEMEERYQKMSKIGVRNIDGFNLKVAQY-HNTGKK--FNRTVQTGFDRKTGEAIYETEHF 584 (806) Q Consensus 508 Ll~pVvTd~~kA~~aL~w~V~EMe~RY~l~a~~~vRni~~yN~k~~~~-~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~ 584 (806) |-.|-+|+..+ ..|++.++. --+.+|..=|||-+.-++--+.. ...|.. +--. .| .+. T Consensus 168 l~~palteKD~--~dl~f~~~~---gvD~vA~SFVr~~~Dv~~~R~~l~~~~~~~~~iiaK-------------IE-~~e 228 (477) T COG0469 168 LSLPALTEKDK--EDLKFGLEQ---GVDFVALSFVRNAEDVEEVREILAETGGRDVKIIAK-------------IE-NQE 228 (477) T ss_pred CCCCCCCCCCH--HHHHHHHHC---CCCEEEEECCCCHHHHHHHHHHHHHHCCCCCEEEEE-------------EC-CHH T ss_conf 79887883478--889979865---998899966688889999999999837987249995-------------05-877 Q ss_pred CCCCCCEEEEEHHH---HHHHHHHCCCH--HHH-------HHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEE Q ss_conf 23469868776344---68888732100--588-------9999998664142379999657775355435541102515 Q gi|254780799|r 585 DFQHMPYIVVVIDE---MADLMMVARKD--IES-------AVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRI 652 (806) Q Consensus 585 ~~~~lp~ivviiDE---laDlmm~~~~~--ve~-------~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~ri 652 (806) ..+. +|| .+|=.|+|.-| ||- .=-|+-+++|-+|.-.|.|||=+ .-+|+.-.|+|- T Consensus 229 av~N-------ldeIi~~SDGIMVARGDLGVEip~e~Vp~~QK~iI~~~~~~gkpVItATQML-----eSMi~np~PTRA 296 (477) T COG0469 229 AVDN-------LDEIIEASDGIMVARGDLGVEIPLEEVPIIQKRIIRKARRAGKPVITATQML-----ESMIENPRPTRA 296 (477) T ss_pred HHHH-------HHHHHHHCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEECCH-----HHHHHCCCCCCH T ss_conf 7757-------9999986383299865650534778736999999999987599609970638-----888408998710 Q ss_pred EE Q ss_conf 87 Q gi|254780799|r 653 SF 654 (806) Q Consensus 653 af 654 (806) =+ T Consensus 297 Ev 298 (477) T COG0469 297 EV 298 (477) T ss_pred HH T ss_conf 05 No 321 >PRK13633 cobalt transporter ATP-binding subunit; Provisional Probab=65.59 E-value=5 Score=19.85 Aligned_cols=215 Identities=19% Similarity=0.294 Sum_probs=97.2 Q ss_pred EEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEE--ECCCHH----HHHHCCCC----H--HHHCCCCC Q ss_conf 785410020235530477406799999999999829957847888--523100----11102770----3--43122334 Q gi|254780799|r 447 IIADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIM--IDPKML----ELSVYDGI----P--NLLTPVVT 514 (806) Q Consensus 447 vv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evklil--iDPK~v----Els~Y~~i----P--HLl~pVvT 514 (806) +-.++.+==-+-|-|..|||||..++.|- - |+ .|.+=+... .|.+.. ++.-.-|+ | +|....|- T Consensus 30 Isl~i~~GE~v~iiG~nGsGKSTL~r~l~-g-l~--~P~~G~I~i~G~~~~~~~~~~~~r~~ig~vfQ~P~~~l~~~tV~ 105 (281) T PRK13633 30 VNLEVKKGEFLVILGHNGSGKSTIAKHMN-A-LL--LPSEGKVYVDGLDTSDEENLWDIRNKAGMVFQNPDNQIVATIVE 105 (281) T ss_pred EEEEECCCCEEEEECCCCCHHHHHHHHHH-C-CC--CCCCCEEEECCEECCCHHHHHHHHHCEEEEEECCCCCCCHHHHH T ss_conf 07688799899999999984999999997-5-88--78885699999987885669998736089866886420288999 Q ss_pred CHHHHHHHHH-HH--HHHHHHHH-HHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 3045668999-99--99999999-99987089968999999998874478667754467765454322233223234698 Q gi|254780799|r 515 NPQKAVTVLK-WL--VCEMEERY-QKMSKIGVRNIDGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQHMP 590 (806) Q Consensus 515 d~~kA~~aL~-w~--V~EMe~RY-~l~a~~~vRni~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp 590 (806) ...+.+++ +- -.|+++|- +.+...|..+....+ ...-..|++. ++ .-.-.+-.=| T Consensus 106 --e~i~fg~~~~g~~~~e~~~rv~~~l~~~gL~~~~~~~---p~~LSGGqkQ--Rv--------------aiA~aLa~~P 164 (281) T PRK13633 106 --EDVAFGPENLGIPPEEIRERVDESLKKVGMYEYRRHA---PHLLSGGQKQ--RV--------------AIAGILAMRP 164 (281) T ss_pred --HHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHCC---CHHCCHHHHH--HH--------------HHHHHHHCCC T ss_conf --9999889886999999999999999867948766389---1008985999--99--------------9999998599 Q ss_pred EEEEEHHHH-HHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEEEECCCCCCHHHCCCC Q ss_conf 687763446-8888732100588999999866414237999965777535543554110251587645866421233886 Q gi|254780799|r 591 YIVVVIDEM-ADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISFQVSSKIDSRTILGEQ 669 (806) Q Consensus 591 ~ivviiDEl-aDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf~v~s~~dSrtild~~ 669 (806) - |++-||= +-|=..+.+++-..+.+|. +--|+-.|+.|...+. . +. -.||.+- -|-| |.-++ T Consensus 165 ~-iLilDEPTs~LDp~~~~~i~~~l~~l~---~e~g~Tii~vTHdl~~------~-~~-aDrv~vm----~~G~-Iv~~G 227 (281) T PRK13633 165 E-CIIFDEPTAMLDPSGRREVVNTIKELN---KKYGITIILITHYMEE------A-VE-ADRIIVM----DKGK-VVMEG 227 (281) T ss_pred C-EEEEECCCCCCCHHHHHHHHHHHHHHH---HHCCCEEEEECCCHHH------H-HC-CCEEEEE----ECCE-EEEEC T ss_conf 9-999818734389899999999999999---8409899998678899------9-73-9989999----8999-99977 Q ss_pred CHHHHCCCCCEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHCCC Q ss_conf 457865887547736898325888334898899999999971289 Q gi|254780799|r 670 GAEQLLGQGDMLYMTGGGRVQRIHGPFVSDIEVEKVVSHLKTQGE 714 (806) Q Consensus 670 gae~Llg~gdml~~~~~~~~~r~~g~~v~~~ev~~v~~~~~~q~~ 714 (806) -.+.++.+-+.| +-+-+.-.++-++..-++..|- T Consensus 228 ~p~evf~~~~~L-----------~~~~l~~P~~~~l~~~L~~~g~ 261 (281) T PRK13633 228 TPKEIFKEVEMM-----------KKIGLDVPQVTELAYELRKEGV 261 (281) T ss_pred CHHHHHCCHHHH-----------HHCCCCCCCHHHHHHHHHHCCC T ss_conf 999997698899-----------9779999919999999997499 No 322 >cd03229 ABC_Class3 This class is comprised of all BPD (Binding Protein Dependent) systems that are largely represented in archaea and eubacteria and are primarily involved in scavenging solutes from the environment. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=65.52 E-value=5 Score=19.83 Aligned_cols=27 Identities=22% Similarity=0.388 Sum_probs=19.9 Q ss_pred EECCCCCEEEEEECCCCHHHHHHHHHH Q ss_conf 541002023553047740679999999 Q gi|254780799|r 449 ADLARMPHLLIAGTTGSGKSVAINTMI 475 (806) Q Consensus 449 ~DLakMPHLLIAGtTGSGKSV~iN~iI 475 (806) .++.+==-+-|-|..|||||..++.|- T Consensus 21 ~~i~~Ge~~~ivGpSG~GKSTllr~i~ 47 (178) T cd03229 21 LNIEAGEIVALLGPSGSGKSTLLRCIA 47 (178) T ss_pred EEECCCCEEEEECCCCCHHHHHHHHHH T ss_conf 698899899999999983999999998 No 323 >COG0593 DnaA ATPase involved in DNA replication initiation [DNA replication, recombination, and repair] Probab=65.38 E-value=9.8 Score=17.66 Aligned_cols=49 Identities=22% Similarity=0.190 Sum_probs=36.8 Q ss_pred CCCCCHHHCCCCCCCCEEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHC Q ss_conf 22100021045666666785410020235530477406799999999999829 Q gi|254780799|r 430 NQCDLAINLGKSIEGKPIIADLARMPHLLIAGTTGSGKSVAINTMILSLLYRM 482 (806) Q Consensus 430 s~~~L~iaLGKdI~G~pvv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~ 482 (806) +...+..++.+.++-.|-- .-+-|+|.|-+|+|||=-++++---.+-.. T Consensus 93 ~~N~~A~aa~~~va~~~g~----~~nplfi~G~~GlGKTHLl~Aign~~~~~~ 141 (408) T COG0593 93 PSNRLAYAAAKAVAENPGG----AYNPLFIYGGVGLGKTHLLQAIGNEALANG 141 (408) T ss_pred CCHHHHHHHHHHHHHCCCC----CCCCEEEECCCCCCHHHHHHHHHHHHHHHC T ss_conf 7449999999998756688----689579987999978999999999998629 No 324 >cd00286 Tubulin_FtsZ Tubulin/FtsZ: Family includes tubulin alpha-, beta-, gamma-, delta-, and epsilon-tubulins as well as FtsZ, all of which are involved in polymer formation. Tubulin is the major component of microtubules, but also exists as a heterodimer and as a curved oligomer. Microtubules exist in all eukaryotic cells and are responsible for many functions, including cellular transport, cell motility, and mitosis. FtsZ forms a ring-shaped septum at the site of bacterial cell division, which is required for constriction of cell membrane and cell envelope to yield two daughter cells. FtsZ can polymerize into tubes, sheets, and rings in vitro and is ubiquitous in eubacteria, archaea, and chloroplasts. Probab=65.34 E-value=5.1 Score=19.76 Aligned_cols=58 Identities=22% Similarity=0.398 Sum_probs=27.9 Q ss_pred EEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCCCCHHHHCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 5304774067999999999998299578478885231001110277034312233430456689999999999 Q gi|254780799|r 459 IAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYDGIPNLLTPVVTNPQKAVTVLKWLVCEME 531 (806) Q Consensus 459 IAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~~iPHLl~pVvTd~~kA~~aL~w~V~EMe 531 (806) ++|-|||| +-+.|+..|-..-|+..++ ..++| |+--.-||++|-.|+-+|++..+.+| T Consensus 98 lgGGTGSG----~gs~l~e~L~~~y~~~~~~--------~~~v~---Ps~~~~~vvepYN~~Lsl~~L~e~~d 155 (328) T cd00286 98 LGGGTGSG----LGPVLAERLKDEYPKRLKI--------TFSIL---PGPDEGVIVRPYNSILTLHTLTEHSD 155 (328) T ss_pred CCCCCCCC----HHHHHHHHHHHHCCCCEEE--------EEEEC---CCCCCCCEEEECHHHHHHHHHHHHCC T ss_conf 48877675----8899999998762674053--------13751---69988623433187776899997486 No 325 >KOG2802 consensus Probab=65.33 E-value=7.3 Score=18.60 Aligned_cols=229 Identities=17% Similarity=0.219 Sum_probs=119.0 Q ss_pred EEEHHHCCCHHHHHCCCCCHHHCCCCCCCCEEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCC Q ss_conf 87342307213430221000210456666667854100202355304774067999999999998299578478885231 Q gi|254780799|r 416 VMLRDLIVSRVFEKNQCDLAINLGKSIEGKPIIADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPK 495 (806) Q Consensus 416 V~lreil~s~~f~~s~~~L~iaLGKdI~G~pvv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK 495 (806) +.|-|++.|-. +.-..+-+|||-.=| ++-|.+|- --|=+|==.+-++|.- |-..+.- T Consensus 234 smfAEaIHS~a--D~~NQ~lLa~Gis~S--------~q~PD~lh--PYGYsnmRyVsSLISg---------vGIfc~G-- 290 (503) T KOG2802 234 SMFAEAIHSLA--DTCNQLLLALGISKS--------VQTPDPLH--PYGYSNMRYVSSLISG---------VGIFCMG-- 290 (503) T ss_pred HHHHHHHHHHH--HHHHHHHHHHHHHHC--------CCCCCCCC--CCCCCCHHHHHHHHHC---------CCEEEEC-- T ss_conf 89999999999--999899999602430--------46999878--7765306679999732---------4426642-- Q ss_pred HHHHHHCCCCHHHHCC-CCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCC Q ss_conf 0011102770343122-334304566899999999999999998708996899999999887447866775446776545 Q gi|254780799|r 496 MLELSVYDGIPNLLTP-VVTNPQKAVTVLKWLVCEMEERYQKMSKIGVRNIDGFNLKVAQYHNTGKKFNRTVQTGFDRKT 574 (806) Q Consensus 496 ~vEls~Y~~iPHLl~p-VvTd~~kA~~aL~w~V~EMe~RY~l~a~~~vRni~~yN~k~~~~~~~~~~~~~~~~~~~~~~~ 574 (806) --||+|+||--||+| -++++.-|.-+|- | -|-.-|.-=+-.||+-.+.|+++|-.+.+-+|++-||.+ T Consensus 291 -~GlSiyhGv~gLlhpePi~~l~~ay~il~-----g-----Sl~~eGasllvAi~evkr~Ak~~gmSi~dYV~~~~DPs~ 359 (503) T KOG2802 291 -CGLSIYHGVMGLLHPEPIESLLWAYCILA-----G-----SLVSEGASLLVAINEVKRNAKAKGMSIYDYVMESRDPST 359 (503) T ss_pred -CCCHHHHCCCCCCCCCCCCCHHHHHHHHH-----H-----HHHHCCHHHHHHHHHHHHHHHHCCCCHHHHHHHCCCCCC T ss_conf -66043330023468999950689999996-----6-----788543399999999998788749879898760689862 Q ss_pred CCCCCCCCCCCCCCCCEEEEEHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEE Q ss_conf 43222332232346986877634468888732100588999999866414237999965777535543554110251587 Q gi|254780799|r 575 GEAIYETEHFDFQHMPYIVVVIDEMADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISF 654 (806) Q Consensus 575 ~~~~~~~~~~~~~~lp~ivviiDElaDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf 654 (806) +||.-| | +.-|-..+.- ||-|-|.=-|--|=-|-+--+.--|+=.-+ T Consensus 360 ------------------nvVl~E--D-----tAAVtGv~IA------aa~m~lss~tgnPIyD~~GSivvGaLLGmV-- 406 (503) T KOG2802 360 ------------------NVVLLE--D-----TAAVTGVIIA------AACMGLSSITGNPIYDSLGSIVVGALLGMV-- 406 (503) T ss_pred ------------------EEEEEC--C-----HHHHHHHHHH------HHHHHHHHHCCCCCCCCCCHHHHHHHHHHH-- T ss_conf ------------------389943--4-----1788889999------998888872189786542037898888777-- Q ss_pred EECCCCCCHHHCCCCCHHHHCCCCCEEEECC-CCCEEEEEECCCCHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 6458664212338864578658875477368-98325888334898899999999971289742110 Q gi|254780799|r 655 QVSSKIDSRTILGEQGAEQLLGQGDMLYMTG-GGRVQRIHGPFVSDIEVEKVVSHLKTQGEAKYIDI 720 (806) Q Consensus 655 ~v~s~~dSrtild~~gae~Llg~gdml~~~~-~~~~~r~~g~~v~~~ev~~v~~~~~~q~~~~y~~~ 720 (806) .|+ |-|..+- -+|.|-.-|..- .=.-+-+---|..-+|-.+...-+...+.|+-.+. T Consensus 407 -----e~d--iyDvK~~--diG~g~vRfKAE~DFdGr~vtrsYL~kqd~akml~e~~~~~~~e~le~ 464 (503) T KOG2802 407 -----END--IYDVKAT--DIGLGKVRFKAEVDFDGRVVTRSYLEKQDFAKMLQEIQEVKTPEELET 464 (503) T ss_pred -----HHH--HHHCCCE--EECCCEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHH T ss_conf -----776--5413223--404641688878526722168978728899999999982099899999 No 326 >COG4618 ArpD ABC-type protease/lipase transport system, ATPase and permease components [General function prediction only] Probab=65.21 E-value=7.3 Score=18.62 Aligned_cols=146 Identities=25% Similarity=0.330 Sum_probs=77.1 Q ss_pred CCCEEEEEECCCCHHHHHHHHHHHHHH-------------HHCCHHHE-EEEEECCCHHHHH---HCCCCHHHHCCCCCC Q ss_conf 020235530477406799999999999-------------82995784-7888523100111---027703431223343 Q gi|254780799|r 453 RMPHLLIAGTTGSGKSVAINTMILSLL-------------YRMTPAQC-RLIMIDPKMLELS---VYDGIPNLLTPVVTN 515 (806) Q Consensus 453 kMPHLLIAGtTGSGKSV~iN~iI~SlL-------------yk~~P~ev-kliliDPK~vEls---~Y~~iPHLl~pVvTd 515 (806) +=--|=|=|--|||||--.-.+.=+.+ +.-.|+++ |.|=-=|-.|||- +=+.|-..= + =-| T Consensus 361 ~G~~lgIIGPSgSGKSTLaR~lvG~w~p~~G~VRLDga~l~qWd~e~lG~hiGYLPQdVeLF~GTIaeNIaRf~-~-~~d 438 (580) T COG4618 361 AGEALGIIGPSGSGKSTLARLLVGIWPPTSGSVRLDGADLRQWDREQLGRHIGYLPQDVELFDGTIAENIARFG-E-EAD 438 (580) T ss_pred CCCEEEEECCCCCCHHHHHHHHHCCCCCCCCCEEECCHHHHCCCHHHHCCCCCCCCCCCEECCCCHHHHHHHCC-C-CCC T ss_conf 88667887888765778999998113567873775626451279888511317276543005771999987316-6-688 Q ss_pred HHHHHHHHHHH-HHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCC-----CCCCC----C Q ss_conf 04566899999-99999999999870899689999999988744786677544677654543222-----33223----2 Q gi|254780799|r 516 PQKAVTVLKWL-VCEMEERYQKMSKIGVRNIDGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIY-----ETEHF----D 585 (806) Q Consensus 516 ~~kA~~aL~w~-V~EMe~RY~l~a~~~vRni~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~----~ 585 (806) ++|.+.|=+-+ |.||=- ++.+|||...|+... +.+.+ . T Consensus 439 ~~kIieAA~lAgvHelIl--------------------------------~lP~GYdT~iG~~G~~LSgGQRQRIaLARA 486 (580) T COG4618 439 PEKVIEAARLAGVHELIL--------------------------------RLPQGYDTRIGEGGATLSGGQRQRIALARA 486 (580) T ss_pred HHHHHHHHHHCCHHHHHH--------------------------------HCCCCCCCCCCCCCCCCCCHHHHHHHHHHH T ss_conf 799999998737589997--------------------------------176876676578988777237899999999 Q ss_pred CCCCCEEEEEHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCC Q ss_conf 346986877634468888732100588999999866414237999965777 Q gi|254780799|r 586 FQHMPYIVVVIDEMADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPS 636 (806) Q Consensus 586 ~~~lp~ivviiDElaDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPs 636 (806) +=-.|++||. ||=.-=. ..+=|....+=-+-+|+.|+-.|+.|+||| T Consensus 487 lYG~P~lvVL-DEPNsNL---D~~GE~AL~~Ai~~~k~rG~~vvviaHRPs 533 (580) T COG4618 487 LYGDPFLVVL-DEPNSNL---DSEGEAALAAAILAAKARGGTVVVIAHRPS 533 (580) T ss_pred HCCCCCEEEE-CCCCCCC---CHHHHHHHHHHHHHHHHCCCEEEEEECCHH T ss_conf 7089708995-5898776---526799999999999976987999926777 No 327 >pfam00580 UvrD-helicase UvrD/REP helicase. The Rep family helicases are composed of four structural domains. The Rep family function as dimers. REP helicases catalyse ATP dependent unwinding of double stranded DNA to single stranded DNA. Bacillus subtilis addA and Escherichia coli exodeoxyribonuclease V beta have large insertions near to the carboxy-terminus relative to other members of the family. Probab=65.09 E-value=9.9 Score=17.62 Aligned_cols=20 Identities=15% Similarity=0.355 Sum_probs=9.3 Q ss_pred CCCHHHHHHHHHHHHHCCCC Q ss_conf 48988999999999712897 Q gi|254780799|r 696 FVSDIEVEKVVSHLKTQGEA 715 (806) Q Consensus 696 ~v~~~ev~~v~~~~~~q~~~ 715 (806) |-...||..+..++|-=..| T Consensus 384 f~~~~ev~dlla~Lr~i~np 403 (494) T pfam00580 384 FFDRKEIKDLLALLRLIDNP 403 (494) T ss_pred CCCCHHHHHHHHHHHHHCCC T ss_conf 00189999999999987698 No 328 >cd03277 ABC_SMC5_euk Eukaryotic SMC5 proteins; SMC proteins are large (approximately 110 to 170 kDa), and each is arranged into five recognizable domains. Amino-acid sequence homology of SMC proteins between species is largely confined to the amino- and carboxy-terminal globular domains. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T, in the single-letter amino-acid code), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif, and a motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. In eukaryotic cells, the proteins are found as heterodimers of SMC1 paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6 (for Probab=65.09 E-value=5.9 Score=19.29 Aligned_cols=27 Identities=41% Similarity=0.613 Sum_probs=21.2 Q ss_pred CEE-EEEECCCCHHHHHHHHHHHHHHHH Q ss_conf 023-553047740679999999999982 Q gi|254780799|r 455 PHL-LIAGTTGSGKSVAINTMILSLLYR 481 (806) Q Consensus 455 PHL-LIAGtTGSGKSV~iN~iI~SlLyk 481 (806) ||| +|.|.-|||||--+.+|.+.|=.+ T Consensus 23 p~lN~IiGpNGSGKSsIv~AI~lgLGG~ 50 (213) T cd03277 23 PSLNMIIGPNGSGKSSIVCAICLGLGGK 50 (213) T ss_pred CCEEEEECCCCCCHHHHHHHHHHHHCCC T ss_conf 9757998899887899999999881898 No 329 >PRK09493 glnQ glutamine ABC transporter ATP-binding protein; Reviewed Probab=65.03 E-value=6.3 Score=19.06 Aligned_cols=45 Identities=24% Similarity=0.341 Sum_probs=28.5 Q ss_pred CEEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECC Q ss_conf 66785410020235530477406799999999999829957847888523 Q gi|254780799|r 445 KPIIADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDP 494 (806) Q Consensus 445 ~pvv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDP 494 (806) +-+-.++.+==.+-|.|-.|||||..+++ |+.+ + +|+.=+ |++|- T Consensus 18 ~dvs~~i~~Ge~~~liG~nGsGKSTll~~-i~Gl-~--~~~~G~-i~~~G 62 (240) T PRK09493 18 HNIDLNIDQGEVVVIIGPSGSGKSTLLRC-INKL-E--EITSGD-LIVDG 62 (240) T ss_pred ECEEEEECCCCEEEEECCCCCHHHHHHHH-HHCC-C--CCCCCE-EEECC T ss_conf 13078987998999999999809999999-9638-9--999974-87899 No 330 >cd03252 ABCC_Hemolysin The ABC-transporter hemolysin B is a central component of the secretion machinery that translocates the toxin, hemolysin A, in a Sec-independent fashion across both membranes of E. coli. The hemolysin A (HlyA) transport machinery is composed of the ATP-binding cassette (ABC) transporter HlyB located in the inner membrane, hemolysin D (HlyD), also anchored in the inner membrane, and TolC, which resides in the outer membrane. HlyD apparently forms a continuous channel that bridges the entire periplasm, interacting with TolC and HlyB. This arrangement prevents the appearance of periplasmic intermediates of HlyA during substrate transport. Little is known about the molecular details of HlyA transport, but it is evident that ATP-hydrolysis by the ABC-transporter HlyB is a necessary source of energy. Probab=64.87 E-value=7.7 Score=18.44 Aligned_cols=176 Identities=20% Similarity=0.332 Sum_probs=80.5 Q ss_pred EECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHC-CHHH---------------------EEEEEECCCHHHHHHCCCCH Q ss_conf 5410020235530477406799999999999829-9578---------------------47888523100111027703 Q gi|254780799|r 449 ADLARMPHLLIAGTTGSGKSVAINTMILSLLYRM-TPAQ---------------------CRLIMIDPKMLELSVYDGIP 506 (806) Q Consensus 449 ~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~-~P~e---------------------vkliliDPK~vEls~Y~~iP 506 (806) .++.+==..-|-|.+|||||--++. |.+. .|.+ +-.+.=||....-++++.|- T Consensus 23 ~~i~~G~~vaivG~sGsGKSTll~l-----l~gl~~p~~G~I~i~g~di~~~~~~~~r~~i~~v~Q~~~lf~~Ti~eNi~ 97 (237) T cd03252 23 LRIKPGEVVGIVGRSGSGKSTLTKL-----IQRFYVPENGRVLVDGHDLALADPAWLRRQVGVVLQENVLFNRSIRDNIA 97 (237) T ss_pred EEECCCCEEEEECCCCCHHHHHHHH-----HHCCCCCCCCEEEECCEECCCCCHHHHHHHCEEEECCCCCCCCCHHHHHH T ss_conf 9987999999999999859999999-----96776579878999999955189999986018995877155782898872 Q ss_pred HHHCCCCCCHHHHHHH-----HHHHHHHHHHHHHHHHHCCCCCHHHH-HHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 4312233430456689-----99999999999999987089968999-99999887447866775446776545432223 Q gi|254780799|r 507 NLLTPVVTNPQKAVTV-----LKWLVCEMEERYQKMSKIGVRNIDGF-NLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYE 580 (806) Q Consensus 507 HLl~pVvTd~~kA~~a-----L~w~V~EMe~RY~l~a~~~vRni~~y-N~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 580 (806) +-.|-+++. +...| |.-.+.++...|+.+-..+-+|+.|= -+|+.-|++- T Consensus 98 -~g~~~~~~~-~i~~a~~~~~l~~~i~~l~~g~~t~i~~~g~~LSgGqkQRlalARal---------------------- 153 (237) T cd03252 98 -LADPGMSME-RVIEAAKLAGAHDFISELPEGYDTIVGEQGAGLSGGQRQRIAIARAL---------------------- 153 (237) T ss_pred -CCCCCCCHH-HHHHHHHHHCCHHHHHHCCCCCCEEECCCCCCCCHHHHHHHHHHHHH---------------------- T ss_conf -389999999-99999999685444663666662055797884899999999999999---------------------- Q ss_pred CCCCCCCCCCEEEEEHHHH-HHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEEEECCC Q ss_conf 3223234698687763446-888873210058899999986641423799996577753554355411025158764586 Q gi|254780799|r 581 TEHFDFQHMPYIVVVIDEM-ADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISFQVSSK 659 (806) Q Consensus 581 ~~~~~~~~lp~ivviiDEl-aDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf~v~s~ 659 (806) =.=|- |++.||- +-|=....+.+...+.++ ..+.-.|+.|.|++. + ..--||.+ T Consensus 154 ------l~~~~-illlDEpts~LD~~te~~i~~~l~~~-----~~~~Tvi~itHrl~~------l--~~aD~Iiv----- 208 (237) T cd03252 154 ------IHNPR-ILIFDEATSALDYESEHAIMRNMHDI-----CAGRTVIIIAHRLST------V--KNADRIIV----- 208 (237) T ss_pred ------HCCCC-EEEEECCCCCCCHHHHHHHHHHHHHH-----HCCCEEEEEECCHHH------H--HHCCEEEE----- T ss_conf ------66999-99994864568989999999999998-----299999998078889------9--84999999----- Q ss_pred CCCHHHCCCCCHHHHCCCC Q ss_conf 6421233886457865887 Q gi|254780799|r 660 IDSRTILGEQGAEQLLGQG 678 (806) Q Consensus 660 ~dSrtild~~gae~Llg~g 678 (806) .|.=.|..++--+.|+-++ T Consensus 209 l~~G~iv~~G~~~eLl~~~ 227 (237) T cd03252 209 MEKGRIVEQGSHDELLAEN 227 (237) T ss_pred EECCEEEEECCHHHHHHCC T ss_conf 9899999989879998699 No 331 >PRK10513 sugar phosphatase; Provisional Probab=64.65 E-value=6.4 Score=19.05 Aligned_cols=34 Identities=35% Similarity=0.548 Sum_probs=22.9 Q ss_pred CCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCC Q ss_conf 2100588999999866414237999965777535 Q gi|254780799|r 606 ARKDIESAVQRLAQMARASGIHVIMATQRPSVDV 639 (806) Q Consensus 606 ~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdv 639 (806) ..+++-....+--+++|+.|||.++||=||--.+ T Consensus 17 ~~~~i~~~~~~ai~~l~~~Gi~~~iaTGR~~~~~ 50 (270) T PRK10513 17 PDHTISPAVKNAIAAARAKGVNVVLTTGRPYAGV 50 (270) T ss_pred CCCCCCHHHHHHHHHHHHCCCEEEEECCCCHHHH T ss_conf 9796899999999999978999999979986878 No 332 >COG4172 ABC-type uncharacterized transport system, duplicated ATPase component [General function prediction only] Probab=64.63 E-value=6.8 Score=18.82 Aligned_cols=89 Identities=24% Similarity=0.305 Sum_probs=48.5 Q ss_pred EEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHH------------------HCCHHHE--------EEEEECCCHHHHH Q ss_conf 7854100202355304774067999999999998------------------2995784--------7888523100111 Q gi|254780799|r 447 IIADLARMPHLLIAGTTGSGKSVAINTMILSLLY------------------RMTPAQC--------RLIMIDPKMLELS 500 (806) Q Consensus 447 vv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLy------------------k~~P~ev--------kliliDPK~vEls 500 (806) +-.|+..=--+-+-|-.||||||.-|+|.-=|=| ++++.++ -||.=.| |. T Consensus 29 isf~i~~GEtlAlVGESGSGKSvTa~sim~LLp~~~~~~~sg~i~f~G~dlL~~se~~lr~iRG~~I~MIFQEP-Mt--- 104 (534) T COG4172 29 ISFDIEAGETLALVGESGSGKSVTALSILGLLPSPAAAHPSGSILFDGEDLLAASERQLRGVRGNKIGMIFQEP-MT--- 104 (534) T ss_pred CEEEECCCCEEEEEECCCCCCCHHHHHHHHHCCCCCCCCCCCEEEECCHHHHCCCHHHHHHHCCCCEEEEECCC-CC--- T ss_conf 03664389779998348898228999999746898545766326786701321898898521266068895266-23--- Q ss_pred HCCCCHHHHCCCCCCHHHHHHHHHHHHH-----HHHHHHHHHHHCCCCCHH Q ss_conf 0277034312233430456689999999-----999999999870899689 Q gi|254780799|r 501 VYDGIPNLLTPVVTNPQKAVTVLKWLVC-----EMEERYQKMSKIGVRNID 546 (806) Q Consensus 501 ~Y~~iPHLl~pVvTd~~kA~~aL~w~V~-----EMe~RY~l~a~~~vRni~ 546 (806) =|.|+-|=-+.-+.+|+|--. -=+|=-+++-..|.|+=+ T Consensus 105 -------SLNPl~tIg~Qi~E~l~~Hrg~~~~~Ar~r~lelL~~VgI~~p~ 148 (534) T COG4172 105 -------SLNPLHTIGKQLAEVLRLHRGLSRAAARARALELLELVGIPEPE 148 (534) T ss_pred -------CCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCHH T ss_conf -------36717679999999999972554778999999999971998623 No 333 >PRK02412 aroD 3-dehydroquinate dehydratase; Provisional Probab=64.52 E-value=3.9 Score=20.69 Aligned_cols=171 Identities=20% Similarity=0.354 Sum_probs=96.8 Q ss_pred CCCHHHHCCCCC-CHHHHHHHHHHH----HHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHC--CCCC---CCCCCCCCCC Q ss_conf 770343122334-304566899999----99999999999870899689999999988744--7866---7754467765 Q gi|254780799|r 503 DGIPNLLTPVVT-NPQKAVTVLKWL----VCEMEERYQKMSKIGVRNIDGFNLKVAQYHNT--GKKF---NRTVQTGFDR 572 (806) Q Consensus 503 ~~iPHLl~pVvT-d~~kA~~aL~w~----V~EMe~RY~l~a~~~vRni~~yN~k~~~~~~~--~~~~---~~~~~~~~~~ 572 (806) +|+|-.-.|++- +.++.....+-+ ++=.|=|.+.|.+.. |.+..-+..++.+.. +.|+ .|+.+.|..- T Consensus 13 ~g~pkIcvpi~~~t~~e~~~~~~~~~~~~aDlvE~RlD~l~~~~--~~~~~~~~~~~i~~~~~~~PlI~T~R~~~EGG~~ 90 (253) T PRK02412 13 EGAPKIVVPIMAKTIEEVAQEAEAIKYADADIIEWRADFLEKVS--DVEEVLAAAPAIREKFPGKPLLFTFRTAKEGGEI 90 (253) T ss_pred CCCCEEEEEECCCCHHHHHHHHHHHHHCCCCEEEEECCCCCCCC--CHHHHHHHHHHHHHHCCCCCEEEEECCHHHCCCC T ss_conf 99978999967899999999999984469998999603134569--9999999999999846899789992474438986 Q ss_pred CCCCCCCCCCCCCCCCCCEEEEEHHHHHHHHHHC---C----CHHH-----HHHHHHHHHHHHCCEEEEEEE----CCCC Q ss_conf 4543222332232346986877634468888732---1----0058-----899999986641423799996----5777 Q gi|254780799|r 573 KTGEAIYETEHFDFQHMPYIVVVIDEMADLMMVA---R----KDIE-----SAVQRLAQMARASGIHVIMAT----QRPS 636 (806) Q Consensus 573 ~~~~~~~~~~~~~~~~lp~ivviiDElaDlmm~~---~----~~ve-----~~i~rlaq~ara~GiHli~aT----qrPs 636 (806) +-. -+|..++...+ | =||| ....+|.|.||..|+.+|+-- ..|| T Consensus 91 ~~~--------------------e~~~~~il~~~~~~~~~d~iDiEl~~~~~~~~~l~~~a~~~~~~vI~S~HdF~~TP~ 150 (253) T PRK02412 91 ELS--------------------DEEYLALYKAVIESGLPDYIDVELFSGKDVVKEMVAFAHEHGVKVVLSYHDFHKTPS 150 (253) T ss_pred CCC--------------------HHHHHHHHHHHHHCCCCCEEEEECCCCHHHHHHHHHHHHHCCCEEEEEECCCCCCCC T ss_conf 699--------------------999999999999759985797531377789999999888669989998548988979 Q ss_pred CCCCCHHHH------HCCCCEEEEEECCCCCCHHHCCCCCHHHHCC-CCCEEEEC-C-CCCEEEEEECC Q ss_conf 535543554------1102515876458664212338864578658-87547736-8-98325888334 Q gi|254780799|r 637 VDVITGTIK------ANFPTRISFQVSSKIDSRTILGEQGAEQLLG-QGDMLYMT-G-GGRVQRIHGPF 696 (806) Q Consensus 637 vdvitg~ik------an~p~riaf~v~s~~dSrtild~~gae~Llg-~gdml~~~-~-~~~~~r~~g~~ 696 (806) .+.+...++ |.+. .||....|.-|-=++|.-.-.-+..+ ..-..-++ | .+++.|+.|+. T Consensus 151 ~~~l~~~~~~m~~~gADIv-Kia~~~~~~~Dvl~ll~~t~~~~~~~~~~Pli~~~MG~~G~~sRi~~~~ 218 (253) T PRK02412 151 AEEIVERLRKMEELGADIV-KIAVMPQSEQDVLTLLNATREMKELYADQPIITMSMGKLGRISRLAGEV 218 (253) T ss_pred HHHHHHHHHHHHHCCCCEE-EEEEECCCHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCHHHHHHHHH T ss_conf 9999999999997099989-9998569888999999999999960799988999679864899998998 No 334 >PRK13663 hypothetical protein; Provisional Probab=64.52 E-value=4.9 Score=19.88 Aligned_cols=102 Identities=25% Similarity=0.349 Sum_probs=62.2 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEEECCEEEEEEEEECCCCCHHHHHHHHHHHHHHHHHCCCCC---CCCCC Q ss_conf 556325665457899999974248632899841044244444321478639999978899998863001210---00388 Q gi|254780799|r 324 QMTFSPKVMQNNACTLKSVLSDFGIQGEIVNVRPGPVITLYELEPAPGIKSSRIIGLSDDIARSMSAISARV---AVIPR 400 (806) Q Consensus 324 ~~~~s~eeL~~nA~lLE~tL~dFGVe~~Vv~v~pGPVVTrYEi~PApGVKvSKI~nLadDIA~aLsa~svRI---apIPG 400 (806) ..+++++.|+ |.+.|.++|+.+.-+ |+|||+=+|++-. --.-|.-..|++ -|||| T Consensus 87 GItYd~dVlR-----LiD~fr~~gl~V~sV------VITqy~~qpaa~~-----------F~~~Le~~gIkvy~Hy~i~G 144 (493) T PRK13663 87 GISYDQDVLR-----LIDDFRELDIYVGSV------VITQYEGQPAADA-----------FRNKLERRGIKVYLHYPIKG 144 (493) T ss_pred CCCHHHHHHH-----HHHHHHHCCCEECEE------EEECCCCCHHHHH-----------HHHHHHHCCCEEEEECCCCC T ss_conf 8862599999-----999998649653629------9862589857999-----------99999877970698446688 Q ss_pred CCEEEEEECCCCCCEEEEHHHCCCHHHHHCCCCCHHHCCCCCCCCEEEEECCCCCEEEEEECCCCHH-HHHHHHHHHHHH Q ss_conf 6125553024566238734230721343022100021045666666785410020235530477406-799999999999 Q gi|254780799|r 401 RNAIGIELPNDIRETVMLRDLIVSRVFEKNQCDLAINLGKSIEGKPIIADLARMPHLLIAGTTGSGK-SVAINTMILSLL 479 (806) Q Consensus 401 K~~VGIEiPN~~r~~V~lreil~s~~f~~s~~~L~iaLGKdI~G~pvv~DLakMPHLLIAGtTGSGK-SV~iN~iI~SlL 479 (806) = |+.-.-+|+ .|=..-.+|-++..+| |+ .-|=--|||| .+|+ |=| T Consensus 145 Y-------P~dv~~IVS-~eGyG~N~yIeTtrpl-------------VV--------VTaPGPGSGKlATCL-----SQl 190 (493) T PRK13663 145 Y-------PTDVDHIVS-DEGYGKNDYIETTRPL-------------VV--------VTAPGPGSGKLATCL-----SQL 190 (493) T ss_pred C-------CCCCCEEEC-CCCCCCCCCCCCCCCE-------------EE--------EECCCCCCCHHHHHH-----HHH T ss_conf 8-------876341555-4666878751346876-------------99--------958999954798999-----999 Q ss_pred HH Q ss_conf 82 Q gi|254780799|r 480 YR 481 (806) Q Consensus 480 yk 481 (806) |. T Consensus 191 Yh 192 (493) T PRK13663 191 YH 192 (493) T ss_pred HH T ss_conf 99 No 335 >PRK11022 dppD dipeptide transporter ATP-binding subunit; Provisional Probab=64.52 E-value=5.7 Score=19.42 Aligned_cols=183 Identities=16% Similarity=0.186 Sum_probs=80.8 Q ss_pred EEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCH--HHEEEEEE-----CCCHH------HHH-HCCCCHHHHCCC Q ss_conf 78541002023553047740679999999999982995--78478885-----23100------111-027703431223 Q gi|254780799|r 447 IIADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTP--AQCRLIMI-----DPKML------ELS-VYDGIPNLLTPV 512 (806) Q Consensus 447 vv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P--~evklili-----DPK~v------Els-~Y~~iPHLl~pV 512 (806) |-.++.+==-+=|.|-.|||||+-.++|+--+-+..++ .++.|==. ++|.. +.+ +|.+--.-|.|+ T Consensus 26 Vsf~i~~GEilgivGeSGsGKSTl~~~ilgll~~~~~~~~g~i~~~g~dl~~~~~~~~~~~~g~~i~~vfQdp~~sLnP~ 105 (327) T PRK11022 26 ISYSVKQGEVVGIVGESGSGKSVSSLAIMGLIDYPGRVMAEKLEFNGQDLQRISEKERRNLVGAEVAMIFQDPMTSLNPC 105 (327) T ss_pred EEEEECCCCEEEEECCCCCHHHHHHHHHHCCCCCCCCEEEEEEEECCEECCCCCHHHHHHHHCCCEEEEECCCHHHCCHH T ss_conf 18798899999999999878999999997488989976542799999997749999999863776699960851320745 Q ss_pred CCCHHHHHHHHHH----HHHH-HHHHHHHHHHCCCCCHHHH------------HHHHHHHHHCCCCCCCCCCCCCCCCCC Q ss_conf 3430456689999----9999-9999999987089968999------------999998874478667754467765454 Q gi|254780799|r 513 VTNPQKAVTVLKW----LVCE-MEERYQKMSKIGVRNIDGF------------NLKVAQYHNTGKKFNRTVQTGFDRKTG 575 (806) Q Consensus 513 vTd~~kA~~aL~w----~V~E-Me~RY~l~a~~~vRni~~y------------N~k~~~~~~~~~~~~~~~~~~~~~~~~ 575 (806) .|=-..-..+|+- --+| .+|-.++|...|..+-+.+ .+|+.-|++- T Consensus 106 ~tv~~~i~e~l~~~~~~~~~~~~~r~~elL~~vgl~~~~~~l~~yP~eLSGGq~QRV~IArAL----------------- 168 (327) T PRK11022 106 YTVGFQIMEAIKVHQGGNKKTRRQRAIDLLNQVGIPDPASRLDVYPHQLSGGMSQRVMIAMAI----------------- 168 (327) T ss_pred HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCCHHHHHHCCHHHCCHHHHHHHHHHHHH----------------- T ss_conf 555576777888752788899999999999871585688897428554699999999999999----------------- Q ss_pred CCCCCCCCCCCCCCCEEEEEHHHHH-HHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEE Q ss_conf 3222332232346986877634468-888732100588999999866414237999965777535543554110251587 Q gi|254780799|r 576 EAIYETEHFDFQHMPYIVVVIDEMA-DLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISF 654 (806) Q Consensus 576 ~~~~~~~~~~~~~lp~ivviiDEla-Dlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf 654 (806) -.=|- ++|-||=- -|=.+...+|=+.+.+| .+--|+-+|+-|.--++ =+.+--||+- T Consensus 169 -----------~~~P~-lLIaDEPTsaLD~~~q~~Il~ll~~l---~~~~g~til~ITHDl~~-------v~~~aDri~V 226 (327) T PRK11022 169 -----------ACRPK-LLIADEPTTALDVTIQAQIIELLLEL---QQKENMALVLITHDLAL-------VAEAAHKIIV 226 (327) T ss_pred -----------HHCCC-EEEECCCCCCCCHHHHHHHHHHHHHH---HHHHCCCEEEEECCHHH-------HHHHCCEEEE T ss_conf -----------70999-99983887657999999999999999---99719948999288999-------9986998999 Q ss_pred EECCCC----CCHHHCCC Q ss_conf 645866----42123388 Q gi|254780799|r 655 QVSSKI----DSRTILGE 668 (806) Q Consensus 655 ~v~s~~----dSrtild~ 668 (806) -=.-++ +...|++. T Consensus 227 My~G~iVE~G~~~~i~~~ 244 (327) T PRK11022 227 MYAGQVVETGDAHEIFRA 244 (327) T ss_pred EECCEEEEECCHHHHHHC T ss_conf 989889997779999707 No 336 >PRK09334 30S ribosomal protein S25e; Provisional Probab=64.42 E-value=10 Score=17.54 Aligned_cols=63 Identities=17% Similarity=0.329 Sum_probs=54.3 Q ss_pred CCCCHHHHHHHHHHHHCCCEEEEHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHCCCCCEEEEC Q ss_conf 777703899999999659850001422200117789999999997798680227887267317 Q gi|254780799|r 737 SVADDLYKQAVDIVLRDNKASISYIQRRLGIGYNRAASIIENMEEKGVIGPASSTGKREILIS 799 (806) Q Consensus 737 ~~~d~l~~~a~~~v~~~~~~s~s~lqr~~~igy~raar~~~~~e~~giv~~~~g~~~r~vl~~ 799 (806) .-++++|++...=|-...-.-.+.|--+++|--.-|-|++..||.+|+|--..++..-+|.++ T Consensus 44 tvde~~~~ki~kEV~k~k~VTPy~La~K~~I~~SvArkvLr~L~~rG~l~lvsk~rR~~IYvp 106 (107) T PRK09334 44 TLDEELLKRIAKEVKKEKIVTPYTLASKYGIKISVAKKVLRELEKRGVLVLYSKGRRTPIYVP 106 (107) T ss_pred ECCHHHHHHHHHHHCCCCEECHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEECCCCEEEECC T ss_conf 547999999999850483675899998857108999999999997897899818974365357 No 337 >smart00534 MUTSac ATPase domain of DNA mismatch repair MUTS family. Probab=64.41 E-value=8.2 Score=18.21 Aligned_cols=118 Identities=20% Similarity=0.272 Sum_probs=68.9 Q ss_pred EEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 35530477406799999999999829957847888523100111027703431223343045668999999999999999 Q gi|254780799|r 457 LLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYDGIPNLLTPVVTNPQKAVTVLKWLVCEMEERYQK 536 (806) Q Consensus 457 LLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~~iPHLl~pVvTd~~kA~~aL~w~V~EMe~RY~l 536 (806) ++|-|.-.||||+.+-++-+..++-..= .-|==+..++++|+.| ++ -+-+.......+-.-..||.+=-++ T Consensus 2 ~iiTGpN~sGKSt~lk~i~l~~~laq~G-----~~vpa~~~~~~~~d~i---~~-~i~~~d~~~~~~S~F~~e~~~~~~i 72 (185) T smart00534 2 VIITGPNMGGKSTYLRQVGLIVIMAQIG-----SFVPAESAELPVFDRI---FT-RIGASDSLAQGLSTFMVEMKETANI 72 (185) T ss_pred EEEECCCCCCHHHHHHHHHHHHHHHHHC-----CCEEECCCEEECCCEE---EE-EECCCCCCCCCCCHHHHHHHHHHHH T ss_conf 8997999884899999999999999978-----8861577199503100---23-3564100215641899999999999 Q ss_pred HHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHHHHHHCCCHHHH---- Q ss_conf 9870899689999999988744786677544677654543222332232346986877634468888732100588---- Q gi|254780799|r 537 MSKIGVRNIDGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQHMPYIVVVIDEMADLMMVARKDIES---- 612 (806) Q Consensus 537 ~a~~~vRni~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivviiDElaDlmm~~~~~ve~---- 612 (806) +..++ +.-.|++||+.- |-+.++ T Consensus 73 l~~~~------------------------------------------------~~sLvliDEl~~-----GT~~~eg~al 99 (185) T smart00534 73 LKNAT------------------------------------------------ENSLVLLDELGR-----GTSTYDGVAI 99 (185) T ss_pred HHHCC------------------------------------------------CCEEEEECCCCC-----CCCHHHHHHH T ss_conf 98389------------------------------------------------873763041136-----8881478999 Q ss_pred HHHHHHHHHHHCCEEEEEEECCCC Q ss_conf 999999866414237999965777 Q gi|254780799|r 613 AVQRLAQMARASGIHVIMATQRPS 636 (806) Q Consensus 613 ~i~rlaq~ara~GiHli~aTqrPs 636 (806) +.+=|.+.++-.|-+.|+||--.. T Consensus 100 a~aile~l~~~~~~~~iitTH~~e 123 (185) T smart00534 100 AAAVLEYLLEKIGALTLFATHYHE 123 (185) T ss_pred HHHHHHHHHHCCCCEEEEEECHHH T ss_conf 999999999636975998604378 No 338 >KOG0057 consensus Probab=64.33 E-value=4.9 Score=19.92 Aligned_cols=42 Identities=31% Similarity=0.468 Sum_probs=25.8 Q ss_pred CEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEEC---CCHHHHHHC Q ss_conf 023553047740679999999999982995784788852---310011102 Q gi|254780799|r 455 PHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMID---PKMLELSVY 502 (806) Q Consensus 455 PHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliD---PK~vEls~Y 502 (806) --+=|.|.-|||||-.||++. |+-- +=-=|+|| =|.+++.-- T Consensus 379 ekVaIvG~nGsGKSTilr~Ll-----rF~d-~sG~I~IdG~dik~~~~~Sl 423 (591) T KOG0057 379 EKVAIVGSNGSGKSTILRLLL-----RFFD-YSGSILIDGQDIKEVSLESL 423 (591) T ss_pred CEEEEECCCCCCHHHHHHHHH-----HHHC-CCCCEEECCEEHHHHCHHHH T ss_conf 789897899987889999999-----9744-68859998733765075776 No 339 >PRK09206 pyruvate kinase; Provisional Probab=64.30 E-value=3.6 Score=20.90 Aligned_cols=236 Identities=22% Similarity=0.298 Sum_probs=124.9 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCEEEEEEEECCEEEEEEEEECCCCCHHHHHHHHHHHHHHHHH-CCCCCCCCCCCCEEEE Q ss_conf 256654578999999742486328998410442444443214786399999788999988630-0121000388612555 Q gi|254780799|r 328 SPKVMQNNACTLKSVLSDFGIQGEIVNVRPGPVITLYELEPAPGIKSSRIIGLSDDIARSMSA-ISARVAVIPRRNAIGI 406 (806) Q Consensus 328 s~eeL~~nA~lLE~tL~dFGVe~~Vv~v~pGPVVTrYEi~PApGVKvSKI~nLadDIA~aLsa-~svRIapIPGK~~VGI 406 (806) +.++.++.-+.+.+..+++|...-|--=.+||-+=.-+++...-++ +.. .-+++. -+++.+| T Consensus 39 ~~e~h~~~i~~iR~~~~~~~~~v~Il~Dl~GPkIR~g~l~~~~~i~--------------l~~G~~~~l~--~~~~~~g- 101 (470) T PRK09206 39 DYAEHGQRIQNLRNVMSKTGKKAAILLDTKGPEIRTMKLEGGNDVS--------------LKAGQTFTFT--TDKSVVG- 101 (470) T ss_pred CHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCEEEEECCCCCEEE--------------ECCCCEEEEE--CCCCCCC- T ss_conf 9999999999999999973998279997889804898537997358--------------6289889995--6876478- Q ss_pred EECCCCCCEEEEHHHCCCHHHHHCCCCCHHHCC------CCCCCCEEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHH Q ss_conf 302456623873423072134302210002104------56666667854100202355304774067999999999998 Q gi|254780799|r 407 ELPNDIRETVMLRDLIVSRVFEKNQCDLAINLG------KSIEGKPIIADLARMPHLLIAGTTGSGKSVAINTMILSLLY 480 (806) Q Consensus 407 EiPN~~r~~V~lreil~s~~f~~s~~~L~iaLG------KdI~G~pvv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLy 480 (806) |.++-.|...++++. . +....+-+.=| .++.++-+.+ -.+-+|.=+|.|.|-+ T Consensus 102 ---~~~~i~v~~~~l~~~--v-~~Gd~I~idDG~i~l~V~~v~~~~i~~------~V~~gG~L~s~Kgvnl--------- 160 (470) T PRK09206 102 ---NNEIVAVTYEGFTTD--L-SVGNTVLVDDGLIGMEVTAITGNEVIC------KVLNNGDLGENKGVNL--------- 160 (470) T ss_pred ---CCCEEEECCHHHHHH--C-CCCCEEEEECCEEEEEEEEEECCEEEE------EEEECCEECCCCCCCC--------- T ss_conf ---777897463565855--7-889989984896899999970997999------9974838548864233--------- Q ss_pred HCCHHHEEEEEECCCHHHHHHCCCCH--HHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHC Q ss_conf 29957847888523100111027703--4312233430456689999999999999999870899689999999988744 Q gi|254780799|r 481 RMTPAQCRLIMIDPKMLELSVYDGIP--NLLTPVVTNPQKAVTVLKWLVCEMEERYQKMSKIGVRNIDGFNLKVAQYHNT 558 (806) Q Consensus 481 k~~P~evkliliDPK~vEls~Y~~iP--HLl~pVvTd~~kA~~aL~w~V~EMe~RY~l~a~~~vRni~~yN~k~~~~~~~ 558 (806) | .+-.|..|+-.+ ..|+|++ +.--+..+-.=||+=+.-.+--+-.+.. T Consensus 161 -------------------------P~~~~~lp~ltekD~--~di~f~~---~~~vD~IalSFVrsa~DV~~lr~~l~~~ 210 (470) T PRK09206 161 -------------------------PGVSIALPALAEKDK--QDLIFGC---EQGVDFVAASFIRKRSDVLEIREHLKAH 210 (470) T ss_pred -------------------------CCCCCCCCCCCHHHH--HHHHHHH---HCCCCEEEECCCCCHHHHHHHHHHHHHH T ss_conf -------------------------444578678898899--9999999---8599999985879878899999999972 Q ss_pred -CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHHHHHHCCCH--HHHHH-------HHHHHHHHHCCEEE Q ss_conf -786677544677654543222332232346986877634468888732100--58899-------99998664142379 Q gi|254780799|r 559 -GKKFNRTVQTGFDRKTGEAIYETEHFDFQHMPYIVVVIDEMADLMMVARKD--IESAV-------QRLAQMARASGIHV 628 (806) Q Consensus 559 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivviiDElaDlmm~~~~~--ve~~i-------~rlaq~ara~GiHl 628 (806) |+.. ++.. ..| ....++.+.-|+ |.+|=.|+|.-| ||-++ -++-++||.+|.-. T Consensus 211 ~~~~~--~IIA---------KIE-~~~av~NldeIi----~~sDgIMIARGDLgvei~~e~vp~~Qk~II~~c~~~gKPv 274 (470) T PRK09206 211 GGENI--QIIS---------KIE-NQEGLNNFDEIL----EASDGIMVARGDLGVEIPVEEVIFAQKMMIEKCNRARKVV 274 (470) T ss_pred CCCCC--EEEE---------EEC-CHHHHHHHHHHH----HHCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHCCCCE T ss_conf 69973--4999---------846-888887399999----8677999935865331688885899999999999739988 Q ss_pred EEEECCCCCCCCCHHHHHCCCCEE Q ss_conf 999657775355435541102515 Q gi|254780799|r 629 IMATQRPSVDVITGTIKANFPTRI 652 (806) Q Consensus 629 i~aTqrPsvdvitg~ikan~p~ri 652 (806) |+||| +.--+|....|||- T Consensus 275 ivATq-----mLeSMi~~p~PTRA 293 (470) T PRK09206 275 ITATQ-----MLDSMIKNPRPTRA 293 (470) T ss_pred EEEEC-----CHHHHCCCCCCCHH T ss_conf 99937-----77874489998657 No 340 >PRK10619 histidine/lysine/arginine/ornithine transporter subunit; Provisional Probab=64.20 E-value=6.4 Score=19.06 Aligned_cols=39 Identities=23% Similarity=0.393 Sum_probs=26.7 Q ss_pred EEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEE Q ss_conf 7854100202355304774067999999999998299578478 Q gi|254780799|r 447 IIADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRL 489 (806) Q Consensus 447 vv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkl 489 (806) +-.++.+=-.+-|.|..|||||.-+++| +-| .+|++=+. T Consensus 24 isl~i~~Gei~~iiG~sGsGKSTLl~~i--~gl--~~p~~G~I 62 (257) T PRK10619 24 VSLQANAGDVISIIGSSGSGKSTFLRCI--NFL--EKPSEGSI 62 (257) T ss_pred CEEEECCCCEEEEECCCCCHHHHHHHHH--HCC--CCCCCCEE T ss_conf 0668879979999989998199999999--659--99998189 No 341 >pfam00910 RNA_helicase RNA helicase. This family includes RNA helicases thought to be involved in duplex unwinding during viral RNA replication. Members of this family are found in a variety of single stranded RNA viruses. Probab=64.16 E-value=7.1 Score=18.68 Aligned_cols=25 Identities=28% Similarity=0.425 Sum_probs=21.7 Q ss_pred EEEEECCCCHHHHHHHHHHHHHHHH Q ss_conf 3553047740679999999999982 Q gi|254780799|r 457 LLIAGTTGSGKSVAINTMILSLLYR 481 (806) Q Consensus 457 LLIAGtTGSGKSV~iN~iI~SlLyk 481 (806) +.+-|..|.|||...+.+..-|+-+ T Consensus 1 i~l~G~~G~GKS~~a~~la~~~~~~ 25 (105) T pfam00910 1 IWLYGPPGCGKSTLAKYLARALLDH 25 (105) T ss_pred CEEECCCCCCHHHHHHHHHHHHHHH T ss_conf 9897999898899999999999998 No 342 >PRK09580 sufC cysteine desulfurase ATPase component; Reviewed Probab=64.10 E-value=4.6 Score=20.09 Aligned_cols=53 Identities=26% Similarity=0.219 Sum_probs=31.6 Q ss_pred CCCCCCEEEEECC----CCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCC Q ss_conf 5666666785410----0202355304774067999999999998299578478885231 Q gi|254780799|r 440 KSIEGKPIIADLA----RMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPK 495 (806) Q Consensus 440 KdI~G~pvv~DLa----kMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK 495 (806) |...|++++-|+. +==.+-|.|..|||||.-+|+|.= +....|+.=. |.+|-+ T Consensus 9 ~~~~~~~vL~~isl~i~~Gei~~iiG~nGaGKSTLl~~i~G--~~~~~~~~G~-I~~~G~ 65 (248) T PRK09580 9 VSVEDKAILRGLNLEVRPGEVHAIMGPNGSGKSTLSATLAG--REDYEVTGGT-VTFKGK 65 (248) T ss_pred EEECCEEEEECCEEEECCCCEEEEECCCCCCHHHHHHHHHC--CCCCCCCCCE-EEECCE T ss_conf 99899999965188984997999999999999999999837--7556875235-999999 No 343 >PRK03573 transcriptional regulator SlyA; Provisional Probab=64.03 E-value=10 Score=17.58 Aligned_cols=31 Identities=26% Similarity=0.446 Sum_probs=14.6 Q ss_pred EEEEHHHHHHCCCHHHHHHHHHHHHHCCCCC Q ss_conf 5000142220011778999999999779868 Q gi|254780799|r 756 ASISYIQRRLGIGYNRAASIIENMEEKGVIG 786 (806) Q Consensus 756 ~s~s~lqr~~~igy~raar~~~~~e~~giv~ 786 (806) .|.+-|-+++.|--.-.+|++|.||++|.|- T Consensus 47 ~tq~eLa~~l~v~~~t~tr~ld~LE~~GlI~ 77 (144) T PRK03573 47 QSQIQLAKAIGIEQPSLVRTLDQLEEKGLIS 77 (144) T ss_pred CCHHHHHHHHCCCHHHHHHHHHHHHHCCCEE T ss_conf 8999999997987836999999999888978 No 344 >cd03296 ABC_CysA_sulfate_importer Part of the ABC transporter complex cysAWTP involved in sulfate import. Responsible for energy coupling to the transport system. The complex is composed of two ATP-binding proteins (cysA), two transmembrane proteins (cysT and cysW), and a solute-binding protein (cysP). ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=63.91 E-value=5.6 Score=19.47 Aligned_cols=169 Identities=15% Similarity=0.219 Sum_probs=80.0 Q ss_pred EECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHH-HHHH--------CCCCHHHHCCCCCCHHHH Q ss_conf 5410020235530477406799999999999829957847888523100-1110--------277034312233430456 Q gi|254780799|r 449 ADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKML-ELSV--------YDGIPNLLTPVVTNPQKA 519 (806) Q Consensus 449 ~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~v-Els~--------Y~~iPHLl~pVvTd~~kA 519 (806) .++.+==-+-|-|-+|||||.-+|+|- .| .+|+.=+. .+|=|-+ .++. |.+ +-|-|--|=-... T Consensus 23 ~~v~~Ge~~~iiGpSGsGKSTll~~i~-Gl---~~p~~G~I-~~~g~~i~~~~~~~r~ig~vfQ~--~~Lfp~ltV~eNi 95 (239) T cd03296 23 LDIPSGELVALLGPSGSGKTTLLRLIA-GL---ERPDSGTI-LFGGEDATDVPVQERNVGFVFQH--YALFRHMTVFDNV 95 (239) T ss_pred EEECCCCEEEEECCCCCHHHHHHHHHH-CC---CCCCCEEE-EECCEECCCCCHHHCCEEEECCC--CCCCCCCCHHHHH T ss_conf 698899899999999977999999997-69---99986399-99999999999656776798178--2106799699999 Q ss_pred HHHHHH-------HHHHHHHH-HHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCE Q ss_conf 689999-------99999999-9999870899689999999988744786677544677654543222332232346986 Q gi|254780799|r 520 VTVLKW-------LVCEMEER-YQKMSKIGVRNIDGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQHMPY 591 (806) Q Consensus 520 ~~aL~w-------~V~EMe~R-Y~l~a~~~vRni~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ 591 (806) +..|+- --.||++| .+++...|.. +|-.+....-..|+... + .-.-.+-.-| T Consensus 96 ~~~l~~~~~~~~~~~~e~~~rv~~~l~~v~l~---~~~~~~p~eLSGGq~QR--V--------------aiARAl~~~P- 155 (239) T cd03296 96 AFGLRVKPRSERPPEAEIRAKVHELLKLVQLD---WLADRYPAQLSGGQRQR--V--------------ALARALAVEP- 155 (239) T ss_pred HHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCH---HHHHCCHHHCCHHHHHH--H--------------HHHHHHHCCC- T ss_conf 87997335456998999999999998654997---67748966699989899--9--------------9999876499- Q ss_pred EEEEHHH-HHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEE Q ss_conf 8776344-68888732100588999999866414237999965777535543554110251587 Q gi|254780799|r 592 IVVVIDE-MADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISF 654 (806) Q Consensus 592 ivviiDE-laDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf 654 (806) =|++-|| ++-|=-....++...|.+| .+.-||-.|+.|.-+.. =..+--||++ T Consensus 156 ~vlllDEP~s~LD~~~~~~i~~~l~~l---~~e~~~T~i~vTHd~~~-------a~~laDri~v 209 (239) T cd03296 156 KVLLLDEPFGALDAKVRKELRRWLRRL---HDELHVTTVFVTHDQEE-------ALEVADRVVV 209 (239) T ss_pred CEEEECCCCCCCCHHHHHHHHHHHHHH---HHHHCCEEEEECCCHHH-------HHHHCCEEEE T ss_conf 989973886646999999999999999---99859989999889999-------9996999999 No 345 >cd03297 ABC_ModC_molybdenum_transporter ModC is an ABC-type transporter and the ATPase component of a molybdate transport system that also includes the periplasmic binding protein ModA and the membrane protein ModB. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=63.90 E-value=4.7 Score=20.05 Aligned_cols=28 Identities=29% Similarity=0.342 Sum_probs=18.7 Q ss_pred EEEEECCCCHHHHHHHHHHHHHHHHCCHHHEE Q ss_conf 35530477406799999999999829957847 Q gi|254780799|r 457 LLIAGTTGSGKSVAINTMILSLLYRMTPAQCR 488 (806) Q Consensus 457 LLIAGtTGSGKSV~iN~iI~SlLyk~~P~evk 488 (806) +=|-|..|||||.-+|.|- =|. +|+.=+ T Consensus 26 ~~iiGpSGsGKSTll~~i~--GL~--~p~sG~ 53 (214) T cd03297 26 TGIFGASGAGKSTLLRCIA--GLE--KPDGGT 53 (214) T ss_pred EEEECCCCCHHHHHHHHHH--CCC--CCCCEE T ss_conf 9999999735999999998--499--999649 No 346 >PRK10253 iron-enterobactin transporter ATP-binding protein; Provisional Probab=63.74 E-value=7 Score=18.74 Aligned_cols=57 Identities=26% Similarity=0.451 Sum_probs=36.0 Q ss_pred CCHHHCCCCCCCCEEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECC Q ss_conf 00021045666666785410020235530477406799999999999829957847888523 Q gi|254780799|r 433 DLAINLGKSIEGKPIIADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDP 494 (806) Q Consensus 433 ~L~iaLGKdI~G~pvv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDP 494 (806) .|++..|+-..=+-+-.++.+==.+-|-|..|||||-.+|+|- .++ +|..=+ |.+|= T Consensus 12 nLs~~yg~~~~l~~isl~I~~Ge~~~iiGpNGaGKSTLlk~i~-Gll---~p~~G~-I~l~g 68 (265) T PRK10253 12 QLTLGYGKYTVAENLTVEIPDGHFTAIIGPNGCGKSTLLRTLS-RLM---TPAHGH-VWLDG 68 (265) T ss_pred EEEEEECCEEEEEEEEEEECCCCEEEEECCCCCHHHHHHHHHH-CCC---CCCCEE-EEECC T ss_conf 3999999999984028898599799999998839999999997-498---888529-99999 No 347 >PRK13639 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=63.63 E-value=5.7 Score=19.42 Aligned_cols=94 Identities=17% Similarity=0.308 Sum_probs=46.0 Q ss_pred EEEHHHH-HHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEEEECCCCCCHHHCCCCCH Q ss_conf 7763446-888873210058899999986641423799996577753554355411025158764586642123388645 Q gi|254780799|r 593 VVVIDEM-ADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISFQVSSKIDSRTILGEQGA 671 (806) Q Consensus 593 vviiDEl-aDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf~v~s~~dSrtild~~ga 671 (806) |+|.||= |-|=....+++-..+.+|. + -|+-.|++|..... =+.+-.|++. --+-|.|.| +-. T Consensus 158 iliLDEPTagLDp~~~~~i~~ll~~l~---~-~G~Tii~iTHdm~~-------~~~~adrv~v----l~~G~iv~~-G~p 221 (275) T PRK13639 158 IMVLDEPTSGLDPMGASQIMKLLYDLN---K-EGITIIISTHDVDL-------VPIYANKIYV----LNDGKIIKG-GTP 221 (275) T ss_pred EEEECCCCCCCCHHHHHHHHHHHHHHH---H-CCCEEEEECCCHHH-------HHHHCCEEEE----EECCEEEEE-CCH T ss_conf 999779755489999999999999999---7-69999999389999-------9996999999----989989998-788 Q ss_pred HHHCCCCCEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHCC Q ss_conf 786588754773689832588833489889999999997128 Q gi|254780799|r 672 EQLLGQGDMLYMTGGGRVQRIHGPFVSDIEVEKVVSHLKTQG 713 (806) Q Consensus 672 e~Llg~gdml~~~~~~~~~r~~g~~v~~~ev~~v~~~~~~q~ 713 (806) +.++-+-|+|. .+-+.-..+-++...+++++ T Consensus 222 ~evf~~~~~l~-----------~~~l~~P~~~~l~~~L~~~~ 252 (275) T PRK13639 222 KEIFSDSETIR-----------SANLRLPRVAHLIELLNKED 252 (275) T ss_pred HHHHCCHHHHH-----------HCCCCCCHHHHHHHHHHHCC T ss_conf 99874999999-----------77999990999999997536 No 348 >cd03283 ABC_MutS-like MutS-like homolog in eukaryotes. The MutS protein initiates DNA mismatch repair by recognizing mispaired and unpaired bases embedded in duplex DNA and activating endo- and exonucleases to remove the mismatch. Members of the MutS family possess C-terminal domain with a conserved ATPase activity that belongs to the ATP binding cassette (ABC) superfamily. MutS homologs (MSH) have been identified in most prokaryotic and all eukaryotic organisms examined. Prokaryotes have two homologs (MutS1 and MutS2), whereas seven MSH proteins (MSH1 to MSH7) have been identified in eukaryotes. The homodimer MutS1 and heterodimers MSH2-MSH3 and MSH2-MSH6 are primarily involved in mitotic mismatch repair, whereas MSH4-MSH5 is involved in resolution of Holliday junctions during meiosis. All members of the MutS family contain the highly conserved Walker A/B ATPase domain, and many share a common mechanism of action. MutS1, MSH2-MSH3, MSH2-MSH6, and MSH4-MSH5 dimerize to form slid Probab=63.45 E-value=11 Score=17.41 Aligned_cols=144 Identities=15% Similarity=0.161 Sum_probs=71.7 Q ss_pred HHHHCCCCCHHHCCCCCCCCEEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCCCC Q ss_conf 34302210002104566666678541002023553047740679999999999982995784788852310011102770 Q gi|254780799|r 426 VFEKNQCDLAINLGKSIEGKPIIADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYDGI 505 (806) Q Consensus 426 ~f~~s~~~L~iaLGKdI~G~pvv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~~i 505 (806) ++++..|||- -+++--.|.+..+=.+ -++|-|.-.||||+.+-++-+..++-..= .-|-=|-.++. |.+| T Consensus 1 E~k~~rHPll--~~~~~V~Ndi~l~~~~--~~iiTGpN~~GKSt~Lk~i~l~~ilaq~G-----~~vpa~~~~~~-~~~i 70 (199) T cd03283 1 EAKNLGHPLI--GREKRVANDIDMEKKN--GILITGSNMSGKSTFLRTIGVNVILAQAG-----APVCASSFELP-PVKI 70 (199) T ss_pred CCCCCCCCCC--CCCCCCCCCEEECCCC--EEEEECCCCCCHHHHHHHHHHHHHHHHHC-----CEEEHHHEEEC-CCEE T ss_conf 9443688834--7898327725767985--89998999986599999999999999968-----93862447823-3089 Q ss_pred HHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 34312233430456689999999999999999870899689999999988744786677544677654543222332232 Q gi|254780799|r 506 PNLLTPVVTNPQKAVTVLKWLVCEMEERYQKMSKIGVRNIDGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFD 585 (806) Q Consensus 506 PHLl~pVvTd~~kA~~aL~w~V~EMe~RY~l~a~~~vRni~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 585 (806) ++- +.+.......+.---.||.|==+++.. +.+ + T Consensus 71 ---~t~-i~~~d~~~~~~S~F~~E~~~~~~il~~---------------~~~-~-------------------------- 104 (199) T cd03283 71 ---FTS-IRVSDDLRDGISYFYAELRRLKEIVEK---------------AKK-G-------------------------- 104 (199) T ss_pred ---EEE-EECCCHHHCCCCHHHHHHHHHHHHHHH---------------HCC-C-------------------------- T ss_conf ---999-834751224535799999999999997---------------248-9-------------------------- Q ss_pred CCCCCEEEEEHHHHHHHHHHCCCHHHH----HHHHHHHHHHHCCEEEEEEECCC Q ss_conf 346986877634468888732100588----99999986641423799996577 Q gi|254780799|r 586 FQHMPYIVVVIDEMADLMMVARKDIES----AVQRLAQMARASGIHVIMATQRP 635 (806) Q Consensus 586 ~~~lp~ivviiDElaDlmm~~~~~ve~----~i~rlaq~ara~GiHli~aTqrP 635 (806) ..-.|++||+.- |-+.++ +.+=|-.+.+ -+.+-++||-=+ T Consensus 105 ----~~sLvliDEl~~-----GT~~~eg~a~a~a~le~l~~-~~~~~iitTH~~ 148 (199) T cd03283 105 ----EPVLFLLDEIFK-----GTNSRERQAASAAVLKFLKN-KNTIGIISTHDL 148 (199) T ss_pred ----CCEEEEECCCCC-----CCCHHHHHHHHHHHHHHHHH-CCCEEEEECCCH T ss_conf ----857996143247-----99867899999999999986-798799988728 No 349 >PRK04841 transcriptional regulator MalT; Provisional Probab=63.43 E-value=11 Score=17.41 Aligned_cols=26 Identities=4% Similarity=-0.031 Sum_probs=12.8 Q ss_pred HHHHHCCCCCCCCCHHHHHHHHHHHH Q ss_conf 24674034544311278999999999 Q gi|254780799|r 5 MSFIISNKNENFLLSDWSKKKMKIVA 30 (806) Q Consensus 5 ~~~~~~~~~~~~~l~~f~~rrl~Ei~ 30 (806) |++..-.....++.++=+.+++.+.. T Consensus 5 tK~~pP~~~~~~v~R~rL~~~L~~~~ 30 (903) T PRK04841 5 SKLSRPVRLDHTVVRERLLAKLSGAN 30 (903) T ss_pred CCCCCCCCCCCCCCCHHHHHHHHHHC T ss_conf 25689999878255477999998515 No 350 >COG1123 ATPase components of various ABC-type transport systems, contain duplicated ATPase [General function prediction only] Probab=63.20 E-value=8.6 Score=18.07 Aligned_cols=27 Identities=33% Similarity=0.397 Sum_probs=18.6 Q ss_pred EEEEECCCCCEEEEEECCCCHHHHHHH Q ss_conf 678541002023553047740679999 Q gi|254780799|r 446 PIIADLARMPHLLIAGTTGSGKSVAIN 472 (806) Q Consensus 446 pvv~DLakMPHLLIAGtTGSGKSV~iN 472 (806) -|-.|+.+==-|=+-|-.|||||.--. T Consensus 309 ~VSf~l~~GE~lglVGeSGsGKSTlar 335 (539) T COG1123 309 DVSFDLREGETLGLVGESGSGKSTLAR 335 (539) T ss_pred EEEEEECCCCEEEEECCCCCCHHHHHH T ss_conf 124576388789998899999899999 No 351 >TIGR03598 GTPase_YsxC ribosome biogenesis GTP-binding protein YsxC/EngB. Members of this protein family are a GTPase associated with ribosome biogenesis, typified by YsxC from Bacillus subutilis. The family is widely but not universally distributed among bacteria. Members commonly are called EngB based on homology to EngA, one of several other GTPases of ribosome biogenesis. Cutoffs as set find essentially all bacterial members, but also identify large numbers of eukaryotic (probably organellar) sequences. This protein is found in about 80 percent of bacterial genomes. Probab=62.70 E-value=6.8 Score=18.83 Aligned_cols=24 Identities=33% Similarity=0.472 Sum_probs=22.1 Q ss_pred CCCEEEEEECCCCHHHHHHHHHHH Q ss_conf 020235530477406799999999 Q gi|254780799|r 453 RMPHLLIAGTTGSGKSVAINTMIL 476 (806) Q Consensus 453 kMPHLLIAGtTGSGKSV~iN~iI~ 476 (806) .-||.-|.|...+|||--+|+++= T Consensus 17 ~~p~IaivGrpNvGKSTL~N~L~g 40 (179) T TIGR03598 17 DGPEIAFAGRSNVGKSSLINALTN 40 (179) T ss_pred CCCEEEEECCCCCCHHHHHHHHHC T ss_conf 897899986999888999999868 No 352 >pfam10643 Cytochrome-c551 Photosystem P840 reaction-centre cytochrome c-551. A photosynthetic reaction-centre complex is found in certain green sulphur bacteria such as Chlorobium vibrioforme which are anaerobic photo-auto-trophic organisms. The primary electron donor is P840, a probable B-Chl a dimer, and the primary electron acceptor is a B-Chl monomer. Also on the donor side c-type cytochromes are known to function as electron donors to photo-oxidized P840. This family is thus the secondary endogenous donor of the photosynthetic reaction-centre complex and is a membrane-bound cytochrome containing a single haem group. Probab=62.63 E-value=11 Score=17.31 Aligned_cols=73 Identities=15% Similarity=0.203 Sum_probs=42.6 Q ss_pred HCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-CC-CCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 403454431127899999999999999999999999844-27-0689631368864500013467899999999999999 Q gi|254780799|r 9 ISNKNENFLLSDWSKKKMKIVAGLILLCTVFAITLALGT-WD-VYDPSFSYITLRSPKNFLGYGGAIFADVAIQFFGIAS 86 (806) Q Consensus 9 ~~~~~~~~~l~~f~~rrl~Ei~Gl~Li~~al~l~iSL~S-Ys-p~DPs~~~~s~~~v~N~~G~lGA~iAd~L~~lFG~~A 86 (806) ||+|+|-.+.. -.+|-..+.-++|..+|+.+ |. |.| .+++.+. -|..++||.. T Consensus 1 MDNkSngkLia--------lAiGGAvlmgaLFfgvSflTGyk~pAe---------niS~~lt--------PL~Sf~GWfl 55 (213) T pfam10643 1 MDNKSNGKLIA--------LAIGGAVLMGALFFGVSFLTGYKVPAE---------NISPVLT--------PLRSFMGWFL 55 (213) T ss_pred CCCCCCCEEEE--------EHHHHHHHHHHHHHHHHHHHCCCCCCC---------CCCHHHH--------HHHHHHHHHH T ss_conf 99766772662--------035159999999999999844878755---------4316664--------5888999999 Q ss_pred HHHHHHHHHHHHHHHHCCCH Q ss_conf 99999999999999827652 Q gi|254780799|r 87 VFFLPPPTMWALSLLFDKKI 106 (806) Q Consensus 87 yllpl~Ll~~g~~ll~~k~~ 106 (806) .++..-+++||+--+..+-. T Consensus 56 Lif~ASliimglGkMS~~IS 75 (213) T pfam10643 56 LIFCASLIIMGLGKMSSRIS 75 (213) T ss_pred HHHHHHHHHHHHHHHHHHHC T ss_conf 99999999998877653540 No 353 >PRK05599 hypothetical protein; Provisional Probab=62.59 E-value=11 Score=17.30 Aligned_cols=73 Identities=18% Similarity=0.206 Sum_probs=47.2 Q ss_pred CCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEEECCEEEEEEE--EECCCCCHHHHHHHHHHHHHHHHHCCCCC-CCCCCC Q ss_conf 563256654578999999742486328998410442444443--21478639999978899998863001210-003886 Q gi|254780799|r 325 MTFSPKVMQNNACTLKSVLSDFGIQGEIVNVRPGPVITLYEL--EPAPGIKSSRIIGLSDDIARSMSAISARV-AVIPRR 401 (806) Q Consensus 325 ~~~s~eeL~~nA~lLE~tL~dFGVe~~Vv~v~pGPVVTrYEi--~PApGVKvSKI~nLadDIA~aLsa~svRI-apIPGK 401 (806) -..+|..+..-++-|-.-|..+||. |..|.||+|-|...= .|.|-. ++ -.-.|+.|-+++...--++ ..+||+ T Consensus 150 Y~ASKaal~~~~~~L~~el~~~gI~--V~~v~PG~V~T~mt~~~~~~p~~-~s-pe~~A~~i~~~i~~~k~~~~i~~P~~ 225 (246) T PRK05599 150 YGSTKAGLDAFCQGLADSLHGSHVR--LIIARPGFVIGSMTTGMKPAPMS-VY-PRDVAAAVVSAITSKKRSTTLWIPGR 225 (246) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCE--EEEEECCCCCCCCCCCCCCCCCC-CC-HHHHHHHHHHHHHHCCCCEEEEECHH T ss_conf 8699999999999999995377968--99984498836200799989875-89-99999999999981898669997879 No 354 >pfam10165 Ric8 Guanine nucleotide exchange factor synembryn. Ric8 is involved in the EGL-30 neurotransmitter signalling pathway. It is a guanine nucleotide exchange factor that regulates neurotransmitter secretion. Probab=62.58 E-value=11 Score=17.30 Aligned_cols=122 Identities=15% Similarity=0.218 Sum_probs=65.5 Q ss_pred CCCCCEEEEEHHHHHHHHHHCC-CHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEEEECCCCCCHH Q ss_conf 3469868776344688887321-005889999998664142379999657775355435541102515876458664212 Q gi|254780799|r 586 FQHMPYIVVVIDEMADLMMVAR-KDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISFQVSSKIDSRT 664 (806) Q Consensus 586 ~~~lp~ivviiDElaDlmm~~~-~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf~v~s~~dSrt 664 (806) +-..|+ .-+=|=-|+||.+-- ++|...|- -.=-+-|||. T Consensus 306 LmT~p~-~~lk~~~aelLFvLckena~r~vk-y~GYGnAAGl-------------------------------------- 345 (438) T pfam10165 306 LMTSPD-TDLKDAASELLFVLCKEDASRFIK-YIGYGNAAGL-------------------------------------- 345 (438) T ss_pred HHCCCC-HHHHHHHHHHHHHHHCCCHHHHHH-HCCCHHHHHH-------------------------------------- T ss_conf 745887-028999999999995267999999-7485178999-------------------------------------- Q ss_pred HCCCCCHHHHCCCCCEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHH Q ss_conf 33886457865887547736898325888334898899999999971289742110012455666778888877770389 Q gi|254780799|r 665 ILGEQGAEQLLGQGDMLYMTGGGRVQRIHGPFVSDIEVEKVVSHLKTQGEAKYIDIKDKILLNEEMRFSENSSVADDLYK 744 (806) Q Consensus 665 ild~~gae~Llg~gdml~~~~~~~~~r~~g~~v~~~ev~~v~~~~~~q~~~~y~~~~~~~~~~~~~~~~~~~~~~d~l~~ 744 (806) |=+=.|| -|.|.+++|-||.|-..--+|- |.++.. +.++....+.+......+=.+++.=. T Consensus 346 ----------La~rgll------~~~~~~~~ySsd~edsdteey~--~~~~~I-NPVTG~~~~~e~~n~~~~MTeEeKE~ 406 (438) T pfam10165 346 ----------LANRGLL------GPGRAQEAYSSDSEDSDTEEYK--SAKRAI-NPITGQVEDAEKPNPMEGMTEEEKER 406 (438) T ss_pred ----------HHHCCCC------CCCCCCCCCCCCCCCCCHHHHH--HCCCCC-CCCCCCCCCCCCCCCCCCCCHHHHHH T ss_conf ----------9874887------8876542346787765304566--406777-98768767877899988789889999 Q ss_pred HHHHHHHHCCCEEEEHHHHHHCCCHHHHHHHHHHHHHCCCCCHHH Q ss_conf 999999965985000142220011778999999999779868022 Q gi|254780799|r 745 QAVDIVLRDNKASISYIQRRLGIGYNRAASIIENMEEKGVIGPAS 789 (806) Q Consensus 745 ~a~~~v~~~~~~s~s~lqr~~~igy~raar~~~~~e~~giv~~~~ 789 (806) ||.++++ ++|+|.+.|||.|.| T Consensus 407 EAerL~~-----------------------LfdrL~~~GvIq~~n 428 (438) T pfam10165 407 EAERLFV-----------------------LFERLKKTGVVQVEN 428 (438) T ss_pred HHHHHHH-----------------------HHHHHHHCCCCCCCC T ss_conf 9999999-----------------------999998669767886 No 355 >PRK13796 GTP-binding protein YqeH; Provisional Probab=62.54 E-value=6.6 Score=18.95 Aligned_cols=82 Identities=26% Similarity=0.400 Sum_probs=50.8 Q ss_pred EEEEEEEEECCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCEEEEEECCCCCCEEEEHHHCCCHHHHHCCCCCHHHCC Q ss_conf 24444432147863999997889999886300121000388612555302456623873423072134302210002104 Q gi|254780799|r 360 VITLYELEPAPGIKSSRIIGLSDDIARSMSAISARVAVIPRRNAIGIELPNDIRETVMLRDLIVSRVFEKNQCDLAINLG 439 (806) Q Consensus 360 VVTrYEi~PApGVKvSKI~nLadDIA~aLsa~svRIapIPGK~~VGIEiPN~~r~~V~lreil~s~~f~~s~~~L~iaLG 439 (806) |+|.-.+-|. -++.+||.+--...++.......-|.-|-+|.--||| ++++.= ++ ..-| T Consensus 104 V~NKvDLLP~-~~~~~~~~~wv~~~~~~~gl~~~dV~lvSak~g~gv~------------~L~~~i--~~------~~~~ 162 (367) T PRK13796 104 VGNKADLLPK-SVKKNKVKNWLRQEAKELGLRPVDVVLISAQKGQGID------------ELLDAI--EK------YREG 162 (367) T ss_pred EEECHHCCCC-CCCHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCHH------------HHHHHH--HH------HCCC T ss_conf 9982333887-6787899999999999759985528999465788999------------999999--98------6379 Q ss_pred CCCCCCEEEEECCCCCEEEEEECCCCHHHHHHHHHHHH Q ss_conf 56666667854100202355304774067999999999 Q gi|254780799|r 440 KSIEGKPIIADLARMPHLLIAGTTGSGKSVAINTMILS 477 (806) Q Consensus 440 KdI~G~pvv~DLakMPHLLIAGtTGSGKSV~iN~iI~S 477 (806) +|| .+-|+|-+|||-.||+|+-+ T Consensus 163 ~dv---------------yvVG~tNvGKSTLiN~ll~~ 185 (367) T PRK13796 163 RDV---------------YVVGVTNVGKSTLINRIIKE 185 (367) T ss_pred CCE---------------EEEECCCCCHHHHHHHHHHH T ss_conf 858---------------99915757589999999987 No 356 >COG2261 Predicted membrane protein [Function unknown] Probab=62.41 E-value=11 Score=17.30 Aligned_cols=18 Identities=28% Similarity=0.420 Sum_probs=12.5 Q ss_pred HHHHHHHHHHHHHHHHHH Q ss_conf 013467899999999999 Q gi|254780799|r 66 FLGYGGAIFADVAIQFFG 83 (806) Q Consensus 66 ~~G~lGA~iAd~L~~lFG 83 (806) .+|.+||+++.+++..|| T Consensus 34 ilGIVGA~vg~~l~~~~g 51 (82) T COG2261 34 ILGIVGAFVGGWLLGALG 51 (82) T ss_pred HHHHHHHHHHHHHHHHHC T ss_conf 999999999989999856 No 357 >cd02430 PTH2 Peptidyl-tRNA hydrolase, type 2 (PTH2). Peptidyl-tRNA hydrolase (PTH) activity releases tRNA from the premature translation termination product peptidyl-tRNA, therefore allowing the tRNA and peptide to be reused in protein synthesis. PTH2 is present in archaea and eukaryotes. Probab=62.34 E-value=11 Score=17.27 Aligned_cols=47 Identities=26% Similarity=0.523 Sum_probs=32.1 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCEEEE------EEEECCEEEEEEEEECCCCCHHHHHH Q ss_conf 25665457899999974248632899------84104424444432147863999997 Q gi|254780799|r 328 SPKVMQNNACTLKSVLSDFGIQGEIV------NVRPGPVITLYELEPAPGIKSSRIIG 379 (806) Q Consensus 328 s~eeL~~nA~lLE~tL~dFGVe~~Vv------~v~pGPVVTrYEi~PApGVKvSKI~n 379 (806) |+++|++ |.++.++.|+...++ ++-|| ++|.--|-|+|--++.+|++ T Consensus 58 ~e~~l~~----l~~~a~~~~l~~~~V~DAG~Tev~pg-t~TvlaigP~~~~~id~itg 110 (115) T cd02430 58 SEEELLE----LKKKAKSLGLPTSLIQDAGRTQIAPG-TITVLGIGPAPEELIDKVTG 110 (115) T ss_pred CHHHHHH----HHHHHHHCCCCEEEEECCCCCCCCCC-CEEEEEECCCCHHHHHHHHC T ss_conf 9999999----99999987997899987987402899-97899986788889888648 No 358 >PRK10584 putative ABC transporter ATP-binding protein YbbA; Provisional Probab=62.30 E-value=6.6 Score=18.91 Aligned_cols=28 Identities=25% Similarity=0.339 Sum_probs=21.2 Q ss_pred EECCCCCEEEEEECCCCHHHHHHHHHHH Q ss_conf 5410020235530477406799999999 Q gi|254780799|r 449 ADLARMPHLLIAGTTGSGKSVAINTMIL 476 (806) Q Consensus 449 ~DLakMPHLLIAGtTGSGKSV~iN~iI~ 476 (806) .++.+==.+-|-|..|||||.-+|.|.- T Consensus 31 l~i~~GE~v~ivG~sGsGKSTLl~~i~G 58 (228) T PRK10584 31 LVVKRGETIALIGESGSGKSTLLAILAG 58 (228) T ss_pred EEECCCCEEEEECCCCCHHHHHHHHHHC T ss_conf 8999998999999998589999999966 No 359 >COG2850 Uncharacterized conserved protein [Function unknown] Probab=62.21 E-value=4.1 Score=20.44 Aligned_cols=22 Identities=0% Similarity=0.071 Sum_probs=9.8 Q ss_pred HHHHHHHHHHHHHHCCEEEEEE Q ss_conf 5889999998664142379999 Q gi|254780799|r 610 IESAVQRLAQMARASGIHVIMA 631 (806) Q Consensus 610 ve~~i~rlaq~ara~GiHli~a 631 (806) |-..+.+.-|.+|..+=|+++- T Consensus 103 v~~l~~~FrflP~wr~ddiMIS 124 (383) T COG2850 103 VAALMEPFRFLPDWRIDDIMIS 124 (383) T ss_pred HHHHHHHHCCCCCCCCCCEEEE T ss_conf 9999987236755534545899 No 360 >PRK06835 DNA replication protein DnaC; Validated Probab=62.20 E-value=9.9 Score=17.62 Aligned_cols=28 Identities=39% Similarity=0.521 Sum_probs=25.5 Q ss_pred EEEEEECCCCHHHHHHHHHHHHHHHHCC Q ss_conf 2355304774067999999999998299 Q gi|254780799|r 456 HLLIAGTTGSGKSVAINTMILSLLYRMT 483 (806) Q Consensus 456 HLLIAGtTGSGKSV~iN~iI~SlLyk~~ 483 (806) -||+.|.||.||+---|+|---||-+.. T Consensus 185 nLlf~G~~G~GKTfLa~~IA~ell~~g~ 212 (330) T PRK06835 185 NLLFYGNTGTGKTFLSNCIAKELLDRGK 212 (330) T ss_pred CEEEECCCCCCHHHHHHHHHHHHHHCCC T ss_conf 6698899999889999999999998799 No 361 >cd03292 ABC_FtsE_transporter FtsE is a hydrophilic nucleotide-binding protein that binds FtsX to form a heterodimeric ATP-binding cassette (ABC)-type transporter that associates with the bacterial inner membrane. The FtsE/X transporter is thought to be involved in cell division and is important for assembly or stability of the septal ring. Probab=61.95 E-value=6.4 Score=19.05 Aligned_cols=150 Identities=17% Similarity=0.220 Sum_probs=70.6 Q ss_pred EEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHH---H-----------HHHCCCCHHHHCCC Q ss_conf 785410020235530477406799999999999829957847888523100---1-----------11027703431223 Q gi|254780799|r 447 IIADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKML---E-----------LSVYDGIPNLLTPV 512 (806) Q Consensus 447 vv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~v---E-----------ls~Y~~iPHLl~pV 512 (806) +-.++.+==.+-|-|-.|||||..+|.| +-|.+.+-=++.+==.|-+.+ + +.-|+=+||| T Consensus 20 vsl~i~~Ge~v~i~GpSGsGKSTLl~~i--~gl~~p~sG~i~i~g~~~~~~~~~~~~~~Rr~iG~VfQ~~~L~~~l---- 93 (214) T cd03292 20 INISISAGEFVFLVGPSGAGKSTLLKLI--YKEELPTSGTIRVNGQDVSDLRGRAIPYLRRKIGVVFQDFRLLPDR---- 93 (214) T ss_pred EEEEECCCCEEEEECCCCCHHHHHHHHH--HCCCCCCCEEEEECCEECCCCCHHHHHHHHCCEEEEEECCCCCCCC---- T ss_conf 1779859989999979995399999999--6298988649999999989899778999866749990187647999---- Q ss_pred CCCHHHHHHHH---HHHHHHHHHHH-HHHHHCCCCCHHHH---------HHHHHHHHHCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 34304566899---99999999999-99987089968999---------9999988744786677544677654543222 Q gi|254780799|r 513 VTNPQKAVTVL---KWLVCEMEERY-QKMSKIGVRNIDGF---------NLKVAQYHNTGKKFNRTVQTGFDRKTGEAIY 579 (806) Q Consensus 513 vTd~~kA~~aL---~w~V~EMe~RY-~l~a~~~vRni~~y---------N~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 579 (806) |=-...+-+| .+--+|+.+|- +++...|..+.... -+|+.-|++- T Consensus 94 -tV~eNv~~~l~~~~~~~~~~~~rv~~~L~~vgL~~~~~~~p~~LSGGqkQRvaIARAL--------------------- 151 (214) T cd03292 94 -NVYENVAFALEVTGVPPREIRKRVPAALELVGLSHKHRALPAELSGGEQQRVAIARAI--------------------- 151 (214) T ss_pred -CHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHCCCHHHCCHHHHHHHHHHHHH--------------------- T ss_conf -7999999999984999999999999999877996575499424888999999999999--------------------- Q ss_pred CCCCCCCCCCCEEEEEHHH-HHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCC Q ss_conf 3322323469868776344-68888732100588999999866414237999965777 Q gi|254780799|r 580 ETEHFDFQHMPYIVVVIDE-MADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPS 636 (806) Q Consensus 580 ~~~~~~~~~lp~ivviiDE-laDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPs 636 (806) -.=|- |++-|| .+.|==...++|-+.+..| +..|+-.|++|.-+. T Consensus 152 -------v~~P~-ill~DEPT~~LD~~~~~~i~~ll~~l----~~~g~Tii~vTHd~~ 197 (214) T cd03292 152 -------VNSPT-ILIADEPTGNLDPDTTWEIMNLLKKI----NKAGTTVVVATHAKE 197 (214) T ss_pred -------HCCCC-EEEECCCCCCCCHHHHHHHHHHHHHH----HHCCCEEEEECCCHH T ss_conf -------72999-99983987877989999999999999----850999999898989 No 362 >PRK10419 nikE nickel transporter ATP-binding protein; Provisional Probab=61.84 E-value=9.4 Score=17.80 Aligned_cols=164 Identities=16% Similarity=0.226 Sum_probs=72.2 Q ss_pred EEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHH-------------HCCCCHHHHCCCC Q ss_conf 785410020235530477406799999999999829957847888523100111-------------0277034312233 Q gi|254780799|r 447 IIADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELS-------------VYDGIPNLLTPVV 513 (806) Q Consensus 447 vv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls-------------~Y~~iPHLl~pVv 513 (806) |-.++.+==.+-|.|..|||||.-.++|.- | .+|+.=...+=+=....++ +|.+-..-+.|.. T Consensus 31 vs~~i~~GE~l~ivGeSGsGKSTL~r~i~g--l--~~p~sG~I~~~g~~l~~~~~~~~~~~rr~i~~VfQ~~~~slnP~~ 106 (266) T PRK10419 31 VSLTLKSGETVALLGRSGCGKSTLARLLVG--L--ESPSQGNISWRGEPLAKLNRAQRKAFRRDIQMVFQDSISAVNPRK 106 (266) T ss_pred CEEEECCCCEEEEECCCCCHHHHHHHHHHC--C--CCCCCEEEEECCEECCCCCHHHHHHHHCCCEEEEECCHHHCCHHH T ss_conf 175888998999999999779999999966--9--999962998899956758999999975473899739136368164 Q ss_pred CCHHHHHHHHHH----HHHHH-HHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 430456689999----99999-9999999870899689999999988744786677544677654543222332232346 Q gi|254780799|r 514 TNPQKAVTVLKW----LVCEM-EERYQKMSKIGVRNIDGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQH 588 (806) Q Consensus 514 Td~~kA~~aL~w----~V~EM-e~RY~l~a~~~vRni~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 588 (806) |=-+.....|+. --.|. ++--++|...|... +-+|+...+ -..|+.. ++ .-.-.+-. T Consensus 107 tv~~~i~epl~~~~~~~~~~~~~~~~~~L~~vgL~~-~~~~~yP~e-LSGGq~Q--RV--------------aIArAL~~ 168 (266) T PRK10419 107 TVREILREPLRHLLSLDKAERLARASEMLKAVDLDD-SVLDKRPPQ-LSGGQLQ--RV--------------CLARALAV 168 (266) T ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCH-HHHHCCHHH-CCHHHHH--HH--------------HHHHHHCC T ss_conf 899999999998149999999999999998749988-987178433-7927877--78--------------98666406 Q ss_pred CCEEEEEHHH-HHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCC Q ss_conf 9868776344-68888732100588999999866414237999965777 Q gi|254780799|r 589 MPYIVVVIDE-MADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPS 636 (806) Q Consensus 589 lp~ivviiDE-laDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPs 636 (806) =|-+ +|-|| .+-|=.+...+|-+.+.+|. +.-|+.+|+-|.--+ T Consensus 169 ~P~l-Li~DEPtsaLD~~~q~~il~ll~~l~---~~~g~t~i~ITHDl~ 213 (266) T PRK10419 169 EPKL-LILDEAVSNLDLVLQAGVIRLLKKLQ---QQFGTACLFITHDLR 213 (266) T ss_pred CCCE-EEEECCCCCCCHHHHHHHHHHHHHHH---HHHCCEEEEECCCHH T ss_conf 9878-99968865369999999999999999---975989999889999 No 363 >PRK11169 leucine-responsive transcriptional regulator; Provisional Probab=61.72 E-value=11 Score=17.19 Aligned_cols=31 Identities=16% Similarity=0.158 Sum_probs=12.0 Q ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHHHCCEE Q ss_conf 4468888732100588999999866414237 Q gi|254780799|r 597 DEMADLMMVARKDIESAVQRLAQMARASGIH 627 (806) Q Consensus 597 DElaDlmm~~~~~ve~~i~rlaq~ara~GiH 627 (806) -|+|+.+=++..-|-.=|.||-+.+==-|.+ T Consensus 32 ~eLA~~vglS~stv~~RikrLe~~GiI~gy~ 62 (164) T PRK11169 32 VELSKRVGLSPTPCLERVRRLERQGFIQGYT 62 (164) T ss_pred HHHHHHHCCCHHHHHHHHHHHHHCCCEEEEE T ss_conf 9999998929999999999999789886689 No 364 >TIGR00634 recN DNA repair protein RecN; InterPro: IPR004604 DNA repair protein RecN is thought to be DNA damage inducible and involved in recombinational processes. The N-terminal region of most of the bacterial RecN proteins sequenced to date contains an ATP/GTP binding domain within an SMC-like motif. SMC-like domains are involved in chromosomal scaffolding and segregation. It is possible that the function of RecN in homologous recombination is either structural or enzymatic or both. RecN may be involved in the proper positioning of the recombining segments of DNA, ensuring normal recombination. The observation that inactivation of this gene leads to a decreased transformation efficiency, as well as increased sensitivity to DNA-damaging agents, may be due to some defect in chromosomal partitioning or positioning during these recombination-dependent processes . The protein may function presynaptically to process double-stranded breaks to produce 3 single-stranded DNA intermediates during recombination. ; GO: 0005524 ATP binding, 0006281 DNA repair, 0006310 DNA recombination. Probab=61.72 E-value=4.5 Score=20.16 Aligned_cols=26 Identities=15% Similarity=0.248 Sum_probs=11.0 Q ss_pred HHHHHHHHHHCCCCEEEEEEEECCEE Q ss_conf 89999997424863289984104424 Q gi|254780799|r 336 ACTLKSVLSDFGIQGEIVNVRPGPVI 361 (806) Q Consensus 336 A~lLE~tL~dFGVe~~Vv~v~pGPVV 361 (806) |..+=+.|+.--..-+|+-||.=|-| T Consensus 507 A~~ia~~L~~LS~~~QV~cvTHlp~v 532 (605) T TIGR00634 507 AQAIAKKLAQLSERKQVLCVTHLPQV 532 (605) T ss_pred HHHHHHHHHHHHHHCCEEEECCCHHH T ss_conf 99999999998530958998183798 No 365 >TIGR02773 addB_Gpos ATP-dependent nuclease subunit B; InterPro: IPR014140 DNA repair is accomplished by several different systems in prokaryotes. Recombinational repair of double-stranded DNA breaks involves the RecBCD pathway in some lineages, and AddAB (also called RexAB) in others. AddA is conserved between the firmicutes and the alphaproteobacteria, while its partner protein (RexB) is not. This entry describes the ATP-dependent nuclease subunit B (AddB/RexB) protein as found Bacillus subtilis and related species. Although the RexB protein of Streptococcus and Lactococcus is considered to be orthologous and functionally equivalent, merely named differently, all members of this protein family have a P-loop nucleotide binding motif GxxGxGK[ST] at the N-terminus, unlike RexB proteins, and a CxxCxxxxxC motif at the C-terminus, both of which may be relevant to function.. Probab=61.66 E-value=5.2 Score=19.72 Aligned_cols=14 Identities=29% Similarity=0.458 Sum_probs=6.2 Q ss_pred HHHHHHHHHHHCCC Q ss_conf 78999999742486 Q gi|254780799|r 335 NACTLKSVLSDFGI 348 (806) Q Consensus 335 nA~lLE~tL~dFGV 348 (806) -..+++.+|.+|+| T Consensus 358 Y~~Lv~~vf~~YeI 371 (1192) T TIGR02773 358 YEKLVEAVFSDYEI 371 (1192) T ss_pred HHHHHHHHHCCCCC T ss_conf 68999998513778 No 366 >PRK13649 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=61.66 E-value=6.6 Score=18.95 Aligned_cols=205 Identities=21% Similarity=0.254 Sum_probs=98.3 Q ss_pred EECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCH----------HHHHHCCCC----H--HHHCCC Q ss_conf 541002023553047740679999999999982995784788852310----------011102770----3--431223 Q gi|254780799|r 449 ADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKM----------LELSVYDGI----P--NLLTPV 512 (806) Q Consensus 449 ~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~----------vEls~Y~~i----P--HLl~pV 512 (806) .++.+==-+.|-|.+|||||.-++.|-- | ..|++=+... |-+- -++.-+-|+ | .|.... T Consensus 28 l~i~~Ge~~aiiG~nGsGKSTLl~~l~G--l--~~p~~G~i~~-~g~~i~~~~~~~~~~~~~~~iG~vfQ~p~~ql~~~t 102 (280) T PRK13649 28 LDILDGSYTAFIGHTGSGKSTIMQLLNG--L--HVPTTGIVSV-DDTDITSHSKNKEIKSIRKKVGLVFQFPESQLFEET 102 (280) T ss_pred EEECCCCEEEEECCCCCHHHHHHHHHHC--C--CCCCCEEEEE-CCEECCCCCCHHHHHHHHHHEEEEEECCCCCCCCCH T ss_conf 8987998999995999869999999966--9--9988608999-999987778201399998764699746521236030 Q ss_pred CCCHHHHHHHHH---HHHHHHHHHH-HHHHHCCCC-CH--------HHH-HHHHHHHHHCCCCCCCCCCCCCCCCCCCCC Q ss_conf 343045668999---9999999999-999870899-68--------999-999998874478667754467765454322 Q gi|254780799|r 513 VTNPQKAVTVLK---WLVCEMEERY-QKMSKIGVR-NI--------DGF-NLKVAQYHNTGKKFNRTVQTGFDRKTGEAI 578 (806) Q Consensus 513 vTd~~kA~~aL~---w~V~EMe~RY-~l~a~~~vR-ni--------~~y-N~k~~~~~~~~~~~~~~~~~~~~~~~~~~~ 578 (806) |-| ..+..++ +.-.|+++|. +.+...|-. ++ .|= -+|+.-|.. T Consensus 103 V~e--ev~fg~~~~g~~~~e~~~~v~~~l~~~gL~e~~~~r~p~~LSGGqkqRvaiA~a--------------------- 159 (280) T PRK13649 103 VLK--DVAFGPQNFGVSPEEAEALAREKLALVGISENLFEKNPFELSGGQMRRVAIAGI--------------------- 159 (280) T ss_pred HHH--HHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHCCCCCCCHHHHHHHHHHHH--------------------- T ss_conf 999--998689886999999999999999876997466542900099999999999999--------------------- Q ss_pred CCCCCCCCCCCCEEEEEHHH-HHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEEEEC Q ss_conf 23322323469868776344-68888732100588999999866414237999965777535543554110251587645 Q gi|254780799|r 579 YETEHFDFQHMPYIVVVIDE-MADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISFQVS 657 (806) Q Consensus 579 ~~~~~~~~~~lp~ivviiDE-laDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf~v~ 657 (806) +-.=|-| +|-|| .+-|=..+.+++-+.|.+|.| .|+-.|+.|.+-. . =+++--||.+= T Consensus 160 -------L~~~P~i-LllDEPTsgLDp~~~~~i~~ll~~l~~----~G~Tii~vTHdl~--~-----v~~~aDrv~vl-- 218 (280) T PRK13649 160 -------LAMEPKI-LVLDEPTAGLDPKGRKELMTIFKKLHQ----SGMTIVLVTHLMD--D-----VANYADFVYVL-- 218 (280) T ss_pred -------HHCCCCE-EEEECCCCCCCHHHHHHHHHHHHHHHH----CCCEEEEECCCHH--H-----HHHHCCEEEEE-- T ss_conf -------9749999-998487554899999999999999986----3999999875899--9-----99979999999-- Q ss_pred CCCCCHHHCCCCCHHHHCCCCCEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHCCCCC Q ss_conf 86642123388645786588754773689832588833489889999999997128974 Q gi|254780799|r 658 SKIDSRTILGEQGAEQLLGQGDMLYMTGGGRVQRIHGPFVSDIEVEKVVSHLKTQGEAK 716 (806) Q Consensus 658 s~~dSrtild~~gae~Llg~gdml~~~~~~~~~r~~g~~v~~~ev~~v~~~~~~q~~~~ 716 (806) -+-|.|.|. -.+.++ +.+..++.+.+.-..+.++.+.+++.|-|. T Consensus 219 --~~G~iv~~G-~p~evf-----------~~~~~l~~~~l~~P~~~~~~~~L~~~g~~~ 263 (280) T PRK13649 219 --EKGKIVLSG-KPKDIF-----------QQVSFLEKKQLGVPKITKFAQRLVDRGIPF 263 (280) T ss_pred --ECCEEEEEC-CHHHHH-----------CCHHHHHHCCCCCCHHHHHHHHHHHCCCCC T ss_conf --899999987-899997-----------598999877999991999999999759999 No 367 >PRK11598 putative cell division protein; Provisional Probab=61.38 E-value=11 Score=17.15 Aligned_cols=24 Identities=17% Similarity=0.123 Sum_probs=15.5 Q ss_pred HHHHHHHHHHHCCCCEEEEEEEEC Q ss_conf 789999997424863289984104 Q gi|254780799|r 335 NACTLKSVLSDFGIQGEIVNVRPG 358 (806) Q Consensus 335 nA~lLE~tL~dFGVe~~Vv~v~pG 358 (806) +.+-|-+.|+.=|++.--.+-..| T Consensus 304 ~~~nlldil~~aG~~v~W~~N~~g 327 (545) T PRK11598 304 HQEGLLDIIQRAGINVLWNDNDGG 327 (545) T ss_pred CCCCHHHHHHHCCCEEEEEECCCC T ss_conf 356699999986986999727987 No 368 >PRK13538 cytochrome c biogenesis protein CcmA; Provisional Probab=61.30 E-value=9.4 Score=17.80 Aligned_cols=45 Identities=18% Similarity=0.300 Sum_probs=28.1 Q ss_pred CCCCCCEEEEECC----CCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEE Q ss_conf 5666666785410----020235530477406799999999999829957847 Q gi|254780799|r 440 KSIEGKPIIADLA----RMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCR 488 (806) Q Consensus 440 KdI~G~pvv~DLa----kMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evk 488 (806) |...+++++-|+. +==.+.|-|.-|||||--+|+| +.++ +|+.=+ T Consensus 9 ~~~g~~~vl~~vs~~i~~Ge~~~l~G~NGaGKSTLl~~l-~Gl~---~p~~G~ 57 (204) T PRK13538 9 CERDERILFEGLSFTLNAGELVQIEGPNGAGKTSLLRIL-AGLA---RPDAGE 57 (204) T ss_pred EEECCEEEEECEEEEECCCCEEEEECCCCCCHHHHHHHH-HCCC---CCCCCE T ss_conf 998999998051779879989999999998599999999-7688---888737 No 369 >cd00268 DEADc DEAD-box helicases. A diverse family of proteins involved in ATP-dependent RNA unwinding, needed in a variety of cellular processes including splicing, ribosome biogenesis and RNA degradation. The name derives from the sequence of the Walker B motif (motif II). This domain contains the ATP- binding region. Probab=61.29 E-value=11 Score=17.33 Aligned_cols=39 Identities=26% Similarity=0.307 Sum_probs=22.5 Q ss_pred EEEEEECCCCHHHHHHHHHHHHHHHH-CCHHHEEEEEECC Q ss_conf 23553047740679999999999982-9957847888523 Q gi|254780799|r 456 HLLIAGTTGSGKSVAINTMILSLLYR-MTPAQCRLIMIDP 494 (806) Q Consensus 456 HLLIAGtTGSGKSV~iN~iI~SlLyk-~~P~evkliliDP 494 (806) ++++..-|||||..|-=-=++.-+.+ ...+.++-+-+=| T Consensus 38 dvi~~a~TGSGKTlay~lpil~~l~~~~~~~~~~alil~P 77 (203) T cd00268 38 DVIGQAQTGSGKTAAFLIPILEKLDPSPKKDGPQALILAP 77 (203) T ss_pred CEEEECCCCCCCHHHHHHHHHHHHHCCCCCCCCEEEEEEC T ss_conf 8899757997222888869999986166768966999968 No 370 >cd03231 ABC_CcmA_heme_exporter CcmA, the ATP-binding component of the bacterial CcmAB transporter. The CCM family is involved in bacterial cytochrome c biogenesis. Cytochrome c maturation in E. coli requires the ccm operon, which encodes eight membrane proteins (CcmABCDEFGH). CcmE is a periplasmic heme chaperone that binds heme covalently and transfers it onto apocytochrome c in the presence of CcmF, CcmG, and CcmH. The CcmAB proteins represent an ABC transporter and the CcmCD proteins participate in heme transfer to CcmE. Probab=61.21 E-value=9.4 Score=17.79 Aligned_cols=38 Identities=24% Similarity=0.454 Sum_probs=26.1 Q ss_pred CCCCCCCCEEEEECC----CCCEEEEEECCCCHHHHHHHHHH Q ss_conf 045666666785410----02023553047740679999999 Q gi|254780799|r 438 LGKSIEGKPIIADLA----RMPHLLIAGTTGSGKSVAINTMI 475 (806) Q Consensus 438 LGKdI~G~pvv~DLa----kMPHLLIAGtTGSGKSV~iN~iI 475 (806) |.+...|++++-|+. +==-+-|-|..|||||--+++|. T Consensus 6 ls~~~g~~~il~~vs~~i~~Ge~~~l~G~NGsGKSTLlk~i~ 47 (201) T cd03231 6 LTCERDGRALFSGLSFTLAAGEALQVTGPNGSGKTTLLRILA 47 (201) T ss_pred EEEEECCEEEEECCEEEECCCCEEEEECCCCCCHHHHHHHHH T ss_conf 999999999995307888799599999999999999999996 No 371 >cd03300 ABC_PotA_N PotA is an ABC-type transporter and the ATPase component of the spermidine/putrescine-preferential uptake system consisting of PotA, -B, -C, and -D. PotA has two domains with the N-terminal domain containing the ATPase activity and the residues required for homodimerization with PotA and heterdimerization with PotB. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=61.12 E-value=5.7 Score=19.39 Aligned_cols=158 Identities=22% Similarity=0.354 Sum_probs=75.7 Q ss_pred EEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHH-H-----------HHHCCCCHHHHCCCCCC Q ss_conf 85410020235530477406799999999999829957847888523100-1-----------11027703431223343 Q gi|254780799|r 448 IADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKML-E-----------LSVYDGIPNLLTPVVTN 515 (806) Q Consensus 448 v~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~v-E-----------ls~Y~~iPHLl~pVvTd 515 (806) -.++.+==-+-|-|-+|||||.-+|+|- -| .+|++=+ |++|-|-+ . +.-|.=-||| |= T Consensus 20 sl~v~~Ge~~~iiGpSGsGKSTllr~i~--Gl--~~p~~G~-I~~~g~~v~~~~~~~r~ig~VfQ~~~Lfp~l-----tV 89 (232) T cd03300 20 SLDIKEGEFFTLLGPSGCGKTTLLRLIA--GF--ETPTSGE-ILLDGKDITNLPPHKRPVNTVFQNYALFPHL-----TV 89 (232) T ss_pred EEEECCCCEEEEECCCCCHHHHHHHHHH--CC--CCCCCEE-EEECCEECCCCCHHHCCEEEEECCCCCCCCC-----CH T ss_conf 7488799899999999983999999997--79--9998539-9999999999995457756991488547789-----19 Q ss_pred HHHHHHHHH---HHHHHHHHHHH-HHHHCCCCCHHH---------HHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 045668999---99999999999-998708996899---------99999988744786677544677654543222332 Q gi|254780799|r 516 PQKAVTVLK---WLVCEMEERYQ-KMSKIGVRNIDG---------FNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETE 582 (806) Q Consensus 516 ~~kA~~aL~---w~V~EMe~RY~-l~a~~~vRni~~---------yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 582 (806) -...+..|+ |--.|-++|-+ ++...|..+... --++|.-|++ T Consensus 90 ~~Nva~~l~~~~~~~~e~~~rv~e~l~~v~l~~~~~~~p~~LSGGqkQRVaiARA------------------------- 144 (232) T cd03300 90 FENIAFGLRLKKLPKAEIKERVAEALDLVQLEGYANRKPSQLSGGQQQRVAIARA------------------------- 144 (232) T ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHCCCHHHCCHHHHHHHHHHHH------------------------- T ss_conf 9998779987699999999999999875897787619966699899999999999------------------------- Q ss_pred CCCCCCCCEEEEEHHH-HHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEE Q ss_conf 2323469868776344-68888732100588999999866414237999965777535543554110251587 Q gi|254780799|r 583 HFDFQHMPYIVVVIDE-MADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISF 654 (806) Q Consensus 583 ~~~~~~lp~ivviiDE-laDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf 654 (806) +-.=|- |++-|| ++-|=-....++.+.|.+| .+..|+..|+.|--+.. =..+--||++ T Consensus 145 ---l~~~P~-llllDEP~s~LD~~~~~~i~~~l~~l---~~~~~~T~i~VTHd~~e-------a~~ladri~v 203 (232) T cd03300 145 ---LVNEPK-VLLLDEPLGALDLKLRKDMQLELKRL---QKELGITFVFVTHDQEE-------ALTMSDRIAV 203 (232) T ss_pred ---HHCCCC-EEEECCCCCCCCHHHHHHHHHHHHHH---HHHHCCEEEEECCCHHH-------HHHHCCEEEE T ss_conf ---865999-99980887646999999999999999---99859999999999999-------9996999999 No 372 >PRK12288 ribosome-associated GTPase; Reviewed Probab=61.04 E-value=4.4 Score=20.24 Aligned_cols=90 Identities=21% Similarity=0.273 Sum_probs=49.1 Q ss_pred EEEECCCCHHHHHHHHHHHHHHH------------HCCHHHEEEEE-------EC-CCHHHHHHCCCCHHHHCCCCCCHH Q ss_conf 55304774067999999999998------------29957847888-------52-310011102770343122334304 Q gi|254780799|r 458 LIAGTTGSGKSVAINTMILSLLY------------RMTPAQCRLIM-------ID-PKMLELSVYDGIPNLLTPVVTNPQ 517 (806) Q Consensus 458 LIAGtTGSGKSV~iN~iI~SlLy------------k~~P~evklil-------iD-PK~vEls~Y~~iPHLl~pVvTd~~ 517 (806) ..+|..|.|||--||+++-.+-. |+|-.-.+|+- || |=+=||... |+ T Consensus 211 vf~GqSGVGKSSLiN~L~p~~~~~tgeIS~~~~~GrHTTt~~~L~~l~~gG~lIDTPG~ref~L~----~i--------- 277 (344) T PRK12288 211 IFVGQSGVGKSSLINALLPEAEILVGDVSENSGLGQHTTTTARLYHFPHGGDLIDSPGVREFGLW----HL--------- 277 (344) T ss_pred EEEECCCCCHHHHHHHCCCHHHHHHHHHHHHCCCCCCEEEEEEEEECCCCCEEEECCCCCCCCCC----CC--------- T ss_conf 99806876788887610753342232014333886640116899992899879738987655777----78--------- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCC Q ss_conf 56689999999999999999870899689999999988744786 Q gi|254780799|r 518 KAVTVLKWLVCEMEERYQKMSKIGVRNIDGFNLKVAQYHNTGKK 561 (806) Q Consensus 518 kA~~aL~w~V~EMe~RY~l~a~~~vRni~~yN~k~~~~~~~~~~ 561 (806) ....|.++-.|+..--....-..++++.+=+=.|++|-+.|+- T Consensus 278 -~~~~l~~~F~Ef~~~~~~CkF~dC~H~~EPgCaV~~Ave~G~I 320 (344) T PRK12288 278 -EPEQVTQGFIEFRDYLGTCKFRDCKHDDDPGCALREAVEEGKI 320 (344) T ss_pred -CHHHHHHHCHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHCCCC T ss_conf -9899998677899986789894898999988689999986998 No 373 >PRK13880 conjugal transfer coupling protein TraG; Provisional Probab=61.04 E-value=2.4 Score=22.23 Aligned_cols=62 Identities=16% Similarity=0.249 Sum_probs=42.2 Q ss_pred EEEEHHHHHHHHHHCCC--HHHHHHHHHHHHHHHCCEEEEEEECCCCC-CCC-C-----HHHHHCCCCEEEEEECCCCC Q ss_conf 87763446888873210--05889999998664142379999657775-355-4-----35541102515876458664 Q gi|254780799|r 592 IVVVIDEMADLMMVARK--DIESAVQRLAQMARASGIHVIMATQRPSV-DVI-T-----GTIKANFPTRISFQVSSKID 661 (806) Q Consensus 592 ivviiDElaDlmm~~~~--~ve~~i~rlaq~ara~GiHli~aTqrPsv-dvi-t-----g~ikan~p~riaf~v~s~~d 661 (806) ...+.|||+-| || -+|.+++ -| |.-||.+.+=+|-.+= +=+ | -.|-.|...||+|+-...-| T Consensus 445 ~LlLLDEF~aL----Grl~iie~Ala---~m-AGYglR~~lI~Qsl~QL~~~y~gYG~~~til~Nc~v~i~yapnd~~t 515 (636) T PRK13880 445 LLMMLDEFPSL----GKLEILQESLA---FV-AGYGIKCYLICQDINQLKSRETGYGHDESITSNCHVQNAYPPNRVET 515 (636) T ss_pred EEEEEECCHHC----CCCHHHHHHHH---HH-CCCCCEEEEEEECHHHHHHHHHCCCCCCHHHHCCCEEEEECCCCHHH T ss_conf 79885151314----88289998899---87-14587899999669998666514573313441560699867898778 No 374 >PRK13650 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=60.98 E-value=7.3 Score=18.60 Aligned_cols=98 Identities=17% Similarity=0.265 Sum_probs=55.3 Q ss_pred EEEHHHH-HHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEEEECCCCCCHHHCCCCCH Q ss_conf 7763446-888873210058899999986641423799996577753554355411025158764586642123388645 Q gi|254780799|r 593 VVVIDEM-ADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISFQVSSKIDSRTILGEQGA 671 (806) Q Consensus 593 vviiDEl-aDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf~v~s~~dSrtild~~ga 671 (806) |+|-||= +-|=-...+++-..+.+| .+.-|+-.|+.|.+.+ . + | +-.||.+ .+.--|+.++-. T Consensus 158 lLilDEPTs~LD~~~~~~i~~~l~~l---~~~~g~Tvi~iTHdl~--~----v-~-~aDrviv-----m~~G~Iv~~Gtp 221 (276) T PRK13650 158 IIILDEATSMLDPEGRLELIKTIKNI---RDDYQLTVISITHDLD--E----V-A-LSDRVLV-----MKDGQVESTSTP 221 (276) T ss_pred EEEEECCCCCCCHHHHHHHHHHHHHH---HHHCCCEEEEEEECHH--H----H-H-CCCEEEE-----EECCEEEEECCH T ss_conf 99983886658999999999999999---9842989999957789--9----9-6-0999999-----989999997689 Q ss_pred HHHCCCCCEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHCCC-CCC Q ss_conf 7865887547736898325888334898899999999971289-742 Q gi|254780799|r 672 EQLLGQGDMLYMTGGGRVQRIHGPFVSDIEVEKVVSHLKTQGE-AKY 717 (806) Q Consensus 672 e~Llg~gdml~~~~~~~~~r~~g~~v~~~ev~~v~~~~~~q~~-~~y 717 (806) +.++-+-|-|- -+-+.-..+.+++..++.+|- .+| T Consensus 222 ~evf~~p~~l~-----------~~~l~~P~~~~~~~~L~~~g~~~~~ 257 (276) T PRK13650 222 RELFSRGDELL-----------QLGLDIPFTTSLVQMLQEEGYDLDY 257 (276) T ss_pred HHHHCCHHHHH-----------HCCCCCCHHHHHHHHHHHCCCCCCC T ss_conf 99974989999-----------7799998699999999965999998 No 375 >PRK10762 D-ribose transporter ATP binding protein; Provisional Probab=60.94 E-value=7.3 Score=18.60 Aligned_cols=29 Identities=17% Similarity=0.326 Sum_probs=22.1 Q ss_pred EEEECCCCCEEEEEECCCCHHHHHHHHHH Q ss_conf 78541002023553047740679999999 Q gi|254780799|r 447 IIADLARMPHLLIAGTTGSGKSVAINTMI 475 (806) Q Consensus 447 vv~DLakMPHLLIAGtTGSGKSV~iN~iI 475 (806) +-.++.+==.+-|+|..|||||-..++|- T Consensus 271 vs~~v~~GEi~gi~G~nGsGKsTL~~~l~ 299 (501) T PRK10762 271 VSFTLRKGEILGVSGLMGAGRTELMKVLY 299 (501) T ss_pred EEEEECCCCEEEEECCCCCCHHHHHHHHH T ss_conf 44476688189966788876889999981 No 376 >PRK10976 putative sugar phosphatase; Provisional Probab=60.81 E-value=8.6 Score=18.06 Aligned_cols=81 Identities=21% Similarity=0.272 Sum_probs=40.4 Q ss_pred CHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEEEECCCCCCHHHCCCCCHHHHCCCCCEEEECCCC Q ss_conf 00588999999866414237999965777535543554110251587645866421233886457865887547736898 Q gi|254780799|r 608 KDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISFQVSSKIDSRTILGEQGAEQLLGQGDMLYMTGGG 687 (806) Q Consensus 608 ~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf~v~s~~dSrtild~~gae~Llg~gdml~~~~~~ 687 (806) +.+-..-...-+++++.|||.++||=||-..+-. + ..++.- |. -++-.+|| .+|-. .+ T Consensus 18 ~~is~~~~~ai~~l~~~Gi~~viATGR~~~~~~~-~-~~~l~~----------~~-~~I~~NGa--------~i~d~-~~ 75 (266) T PRK10976 18 HTLSPYAKETLKLLTARGIHFVFATGRHHVDVGQ-I-RDNLEI----------KS-YMITSNGA--------RVHDT-DG 75 (266) T ss_pred CCCCHHHHHHHHHHHHCCCEEEEECCCCHHHHHH-H-HHHCCC----------CC-EEEECCCE--------EEEEC-CC T ss_conf 9719999999999997899999997999799999-9-997399----------74-18962877--------99917-98 Q ss_pred CEEEEEECCCCHHHHHHHHHHHHHC Q ss_conf 3258883348988999999999712 Q gi|254780799|r 688 RVQRIHGPFVSDIEVEKVVSHLKTQ 712 (806) Q Consensus 688 ~~~r~~g~~v~~~ev~~v~~~~~~q 712 (806) + .+.--.++.+.+.++.++++.. T Consensus 76 ~--~i~~~~l~~~~~~~i~~~~~~~ 98 (266) T PRK10976 76 N--LIFSHNLDRDIASDLFGVVNDN 98 (266) T ss_pred E--EEEEECCCHHHHHHHHHHHHHC T ss_conf 1--9997269999999999999768 No 377 >PRK13642 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=60.76 E-value=11 Score=17.40 Aligned_cols=196 Identities=17% Similarity=0.230 Sum_probs=99.3 Q ss_pred EEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHH-------------------HEEEEEECC--CHHHHHHCCCC Q ss_conf 785410020235530477406799999999999829957-------------------847888523--10011102770 Q gi|254780799|r 447 IIADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPA-------------------QCRLIMIDP--KMLELSVYDGI 505 (806) Q Consensus 447 vv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~-------------------evkliliDP--K~vEls~Y~~i 505 (806) +-.++.+=-.+-|-|..|||||.....|. .|+ +.+-- ++-+|.=|| .....++++.| T Consensus 26 is~~i~~Ge~~aiiG~sGsGKSTL~~~l~-Gl~-~~~~G~I~~~G~~i~~~~~~~~r~~ig~VfQ~p~~~l~~~tV~e~i 103 (277) T PRK13642 26 VSFSITKGEWVSIIGQNGSGKSTTARLID-GLF-EEFEGIVKIDGERLTAENVWNLRRKIGMVFQNPDNQFVGATVEDDV 103 (277) T ss_pred EEEEECCCCEEEEECCCCCHHHHHHHHHH-CCC-CCCCCEEEECCEECCCCCHHHHHCCCEEEEECCCCCCCCCCHHHHH T ss_conf 07998899899999999968999999996-389-9888489999999985788888517689998976325755088889 Q ss_pred HHHHCCCCCCHHHHHHHHH---HHHHHHHHHHHH-HHHCCCCCHHHH---------HHHHHHHHHCCCCCCCCCCCCCCC Q ss_conf 3431223343045668999---999999999999-987089968999---------999998874478667754467765 Q gi|254780799|r 506 PNLLTPVVTNPQKAVTVLK---WLVCEMEERYQK-MSKIGVRNIDGF---------NLKVAQYHNTGKKFNRTVQTGFDR 572 (806) Q Consensus 506 PHLl~pVvTd~~kA~~aL~---w~V~EMe~RY~l-~a~~~vRni~~y---------N~k~~~~~~~~~~~~~~~~~~~~~ 572 (806) - ..++ ..-+||++|.+- +...|..++... -+|+.-|.+ T Consensus 104 ~--------------~g~~~~~~~~~e~~~~v~~~l~~~~l~~~~~~~P~~LSGGqrQRvaIA~a--------------- 154 (277) T PRK13642 104 A--------------FGMENQGIPREEMIKRVDEALLAVNMLDFKTREPARLSGGQKQRVAVAGI--------------- 154 (277) T ss_pred H--------------HHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHCCHHHCCHHHHHHHHHHHH--------------- T ss_conf 8--------------77766699999999999999987799656557912289999999999999--------------- Q ss_pred CCCCCCCCCCCCCCCCCCEEEEEHHH-HHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCE Q ss_conf 45432223322323469868776344-68888732100588999999866414237999965777535543554110251 Q gi|254780799|r 573 KTGEAIYETEHFDFQHMPYIVVVIDE-MADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTR 651 (806) Q Consensus 573 ~~~~~~~~~~~~~~~~lp~ivviiDE-laDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~r 651 (806) +-.=|- ++|-|| .+-|=....++|-+.|.+|.+ .-|+-+|+.|..... . +. -.| T Consensus 155 -------------La~~P~-ililDEPTs~LD~~~~~~i~~ll~~L~~---~~~~Tii~iTHdl~~------~-~~-aDr 209 (277) T PRK13642 155 -------------IALRPE-IIILDESTSMLDPTGRSEIMRVIHEIKD---KYHLTVLSITHDLDE------A-AS-SDR 209 (277) T ss_pred -------------HHCCCC-EEEEECCCCCCCHHHHHHHHHHHHHHHH---HCCCEEEEEEECHHH------H-HH-CCE T ss_conf -------------966999-9999588765898999999999999998---169899999458899------9-71-998 Q ss_pred EEEEECCCCCCHHHCCCCCHHHHCCCCCEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHCCC Q ss_conf 587645866421233886457865887547736898325888334898899999999971289 Q gi|254780799|r 652 ISFQVSSKIDSRTILGEQGAEQLLGQGDMLYMTGGGRVQRIHGPFVSDIEVEKVVSHLKTQGE 714 (806) Q Consensus 652 iaf~v~s~~dSrtild~~gae~Llg~gdml~~~~~~~~~r~~g~~v~~~ev~~v~~~~~~q~~ 714 (806) |.+ .+-=.|..+.-.+.++ +.|..++-+.+.-..+-++...+|..|. T Consensus 210 v~v-----m~~G~Iv~~G~~~evf-----------~~p~~l~~~~l~~P~~~~l~~~L~~~g~ 256 (277) T PRK13642 210 ILV-----MRAGEIIKEAAPSELF-----------ATSEDMVEIGLDVPFSSNLMKDLRTNGF 256 (277) T ss_pred EEE-----EECCEEEEECCHHHHH-----------CCHHHHHHCCCCCCHHHHHHHHHHHCCC T ss_conf 999-----9899999976899987-----------6967798779999879999999997599 No 378 >COG3264 Small-conductance mechanosensitive channel [Cell envelope biogenesis, outer membrane] Probab=60.71 E-value=12 Score=17.07 Aligned_cols=11 Identities=18% Similarity=0.377 Sum_probs=5.1 Q ss_pred HHEEEEEECCC Q ss_conf 78478885231 Q gi|254780799|r 485 AQCRLIMIDPK 495 (806) Q Consensus 485 ~evkliliDPK 495 (806) .++..||=||. T Consensus 742 ~~~p~Vl~~P~ 752 (835) T COG3264 742 REHPRVLKDPA 752 (835) T ss_pred HHCCCCCCCCC T ss_conf 73987659999 No 379 >pfam07724 AAA_2 AAA domain (Cdc48 subfamily). This Pfam entry includes some of the AAA proteins not detected by the pfam00004 model. Probab=60.61 E-value=12 Score=17.13 Aligned_cols=100 Identities=24% Similarity=0.425 Sum_probs=53.8 Q ss_pred EEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCCCCHHHHCCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 23553047740679999999999982995784788852310011102770343122334304566899999999999999 Q gi|254780799|r 456 HLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYDGIPNLLTPVVTNPQKAVTVLKWLVCEMEERYQ 535 (806) Q Consensus 456 HLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~~iPHLl~pVvTd~~kA~~aL~w~V~EMe~RY~ 535 (806) -+|.+|.||.||+- +=-.+...||.. +-+|+-+| ++-|.. +|.+..-+-.|.- |. T Consensus 5 ~~l~~GPsGvGKT~-lAk~la~~l~~~---~~~~i~~d-----m~e~~~-~~~v~~l~g~~~g---------------yv 59 (168) T pfam07724 5 SFLFLGPTGVGKTE-LAKALAELLFGD---ERALIRID-----MSEYME-EHSVSRLIGAPPG---------------YV 59 (168) T ss_pred EEEEECCCCCCHHH-HHHHHHHHHCCC---CCCEEEEC-----CCCCCC-HHHHHHHCCCCCC---------------CC T ss_conf 99988989989999-999999996798---53448855-----756542-5699987058998---------------72 Q ss_pred HHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHHHHHHCCCHHHHHHH Q ss_conf 99870899689999999988744786677544677654543222332232346986877634468888732100588999 Q gi|254780799|r 536 KMSKIGVRNIDGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQHMPYIVVVIDEMADLMMVARKDIESAVQ 615 (806) Q Consensus 536 l~a~~~vRni~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivviiDElaDlmm~~~~~ve~~i~ 615 (806) -.++.|. ..+ .+.+-||=||.+||.-- |..+|-...- T Consensus 60 g~~~~G~-----l~~----------------------------------~v~~~p~~VillDEIeK----a~~~V~~~LL 96 (168) T pfam07724 60 GYEEGGQ-----LTE----------------------------------AVRRKPYSIVLIDEIEK----AHPGVQNDLL 96 (168) T ss_pred CCCCCCC-----HHH----------------------------------HHHHCCCCEEEEHHHHH----HCHHHHHHHH T ss_conf 6242650-----789----------------------------------99838984898657766----5899999999 Q ss_pred HHHHHHHH Q ss_conf 99986641 Q gi|254780799|r 616 RLAQMARA 623 (806) Q Consensus 616 rlaq~ara 623 (806) ++-+-++- T Consensus 97 ~ild~g~~ 104 (168) T pfam07724 97 QILEGGTL 104 (168) T ss_pred HHCCCCCE T ss_conf 87058706 No 380 >cd03262 ABC_HisP_GlnQ_permeases HisP and GlnQ are the ATP-binding components of the bacterial periplasmic histidine and glutamine permeases, repectively. Histidine permease is a multisubunit complex containing the HisQ and HisM integral membrane subunits and two copies of HisP. HisP has properties intermediate between those of integral and peripheral membrane proteins and is accessible from both sides of the membrane, presumably by its interaction with HisQ and HisM. The two HisP subunits form a homodimer within the complex. The domain structure of the amino acid uptake systems is typical for prokaryote extracellular solute binding protein-dependent uptake systems. All of the amino acid uptake systems also have at least one, and in a few cases, two extracellular solute binding proteins located in the periplasm of Gram-negative bacteria, or attached to the cell membrane of Gram-positive bacteria. The best-studied member of the PAAT (polar amino acid transport) family is the HisJQM Probab=60.57 E-value=6.9 Score=18.78 Aligned_cols=145 Identities=23% Similarity=0.326 Sum_probs=77.1 Q ss_pred EECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHH----------------HHHCCCCHHHHCCC Q ss_conf 54100202355304774067999999999998299578478885231001----------------11027703431223 Q gi|254780799|r 449 ADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLE----------------LSVYDGIPNLLTPV 512 (806) Q Consensus 449 ~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vE----------------ls~Y~~iPHLl~pV 512 (806) .++.+==-+-|=|..|||||.-+|+|-- | -+|++=+ |+||=+-+. +.-|+ |-|- T Consensus 21 l~i~~Ge~~~ivGpSGsGKSTLL~~i~g--L--~~p~~G~-i~i~g~~i~~~~~~~~~~rr~iG~VFQ~~~-----L~p~ 90 (213) T cd03262 21 LTVKKGEVVVIIGPSGSGKSTLLRCINL--L--EEPDSGT-IIIDGLKLTDDKKNINELRQKVGMVFQQFN-----LFPH 90 (213) T ss_pred EEECCCCEEEEECCCCCHHHHHHHHHHC--C--CCCCCEE-EEECCEECCCCHHHHHHHHCCCEEEECCCC-----CCCC T ss_conf 5988998999999998449999999981--9--9998649-999999999981569998678279967987-----5899 Q ss_pred CCCHHHHHHHHHHH----HHHHHHH-HHHHHHCCCCCHHHH---------HHHHHHHHHCCCCCCCCCCCCCCCCCCCCC Q ss_conf 34304566899999----9999999-999987089968999---------999998874478667754467765454322 Q gi|254780799|r 513 VTNPQKAVTVLKWL----VCEMEER-YQKMSKIGVRNIDGF---------NLKVAQYHNTGKKFNRTVQTGFDRKTGEAI 578 (806) Q Consensus 513 vTd~~kA~~aL~w~----V~EMe~R-Y~l~a~~~vRni~~y---------N~k~~~~~~~~~~~~~~~~~~~~~~~~~~~ 578 (806) -|=-...+.+|.+. -+|..+| .+++...|-.+.... -+||.-|++- T Consensus 91 ltv~eNV~~~l~~~~~~~~~e~~~~a~~~L~~vgL~~~~~~~P~~LSGGqqQRVAIARAL-------------------- 150 (213) T cd03262 91 LTVLENITLAPIKVKGMSKAEAEERALELLEKVGLADKADAYPAQLSGGQQQRVAIARAL-------------------- 150 (213) T ss_pred CCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHCCCHHHCCCHHHHHHHHHHHH-------------------- T ss_conf 919999999999976999999999999999868997887499444692999999999996-------------------- Q ss_pred CCCCCCCCCCCCEEEEEHHH-HHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCC Q ss_conf 23322323469868776344-68888732100588999999866414237999965777 Q gi|254780799|r 579 YETEHFDFQHMPYIVVVIDE-MADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPS 636 (806) Q Consensus 579 ~~~~~~~~~~lp~ivviiDE-laDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPs 636 (806) -.=|- +++-|| .+-|=-...++|.+.+.+|++ - |+-.|+.|.-+. T Consensus 151 --------a~~P~-ilL~DEPts~LD~~~~~~i~~ll~~l~~---~-g~T~i~VTHD~~ 196 (213) T cd03262 151 --------AMNPK-VMLFDEPTSALDPELVGEVLDVMKDLAE---E-GMTMVVVTHEMG 196 (213) T ss_pred --------CCCCC-EEEECCCCCCCCHHHHHHHHHHHHHHHH---C-CCEEEEECCCHH T ss_conf --------37999-9997088887798999999999999986---2-999999998999 No 381 >PRK10982 galactose/methyl galaxtoside transporter ATP-binding protein; Provisional Probab=60.57 E-value=7.4 Score=18.56 Aligned_cols=125 Identities=22% Similarity=0.270 Sum_probs=61.1 Q ss_pred CCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEE---------------EEECCEEEEEEEEECCCC Q ss_conf 03661353320012455563256654578999999742486328998---------------410442444443214786 Q gi|254780799|r 308 VLPSKEILSTSQSPVNQMTFSPKVMQNNACTLKSVLSDFGIQGEIVN---------------VRPGPVITLYELEPAPGI 372 (806) Q Consensus 308 kLPsLdLL~~~~~~~~~~~~s~eeL~~nA~lLE~tL~dFGVe~~Vv~---------------v~pGPVVTrYEi~PApGV 372 (806) .=|.+=+||+|.. ..|..+.++.-++|. .|++-|+-+=++. ..-|=+| +-.|..++ T Consensus 151 ~~p~ililDEPt~-----~LD~~~~~~l~~~l~-~l~~~g~til~itH~l~~v~~~~Drv~vm~~G~iv---~~~~~~~~ 221 (491) T PRK10982 151 YNAKIVIMDEPTS-----SLTEKEVNHLFTIIR-KLKERGCGIVYISHKMEEIFQLCDEITILRDGQWI---ATEPLAGL 221 (491) T ss_pred CCCCEEEECCCCC-----CCCHHHHHHHHHHHH-HHHHCCCEEEEECCCHHHHHHCCCEEEECCCCEEE---EECCHHHC T ss_conf 3998898158734-----558788899988888-87742853678624367442158678975697498---54661118 Q ss_pred CHHHHHHHHHHHHHHHHHCCC--CC---CCCCCCCEEEEEECCCCCCEEEEHHHCCCHHHHHCCCCCHHHCCCCCCCCEE Q ss_conf 399999788999988630012--10---0038861255530245662387342307213430221000210456666667 Q gi|254780799|r 373 KSSRIIGLSDDIARSMSAISA--RV---AVIPRRNAIGIELPNDIRETVMLRDLIVSRVFEKNQCDLAINLGKSIEGKPI 447 (806) Q Consensus 373 KvSKI~nLadDIA~aLsa~sv--RI---apIPGK~~VGIEiPN~~r~~V~lreil~s~~f~~s~~~L~iaLGKdI~G~pv 447 (806) ...++.. .|..... |. ...||... +|+-| + .....+ +=+.+ T Consensus 222 ~~~~~~~-------~~~g~~~~~~~~~~~~~~~~~~--l~~~~----------l------~~~~~~---------~l~~v 267 (491) T PRK10982 222 TMDQIIA-------MMVGRSLTQRFPDKENKPGEVI--LEVRN----------L------TSLRQP---------SIRDV 267 (491) T ss_pred CHHHHHH-------HHHCCCHHHHCCCCCCCCCCEE--EEEEC----------C------CCCCCC---------CEEEE T ss_conf 9999997-------6506222431786457987617--99952----------3------237887---------42026 Q ss_pred EEECCCCCEEEEEECCCCHHHHHHHHHH Q ss_conf 8541002023553047740679999999 Q gi|254780799|r 448 IADLARMPHLLIAGTTGSGKSVAINTMI 475 (806) Q Consensus 448 v~DLakMPHLLIAGtTGSGKSV~iN~iI 475 (806) -.++.+==.+-|+|.-|||||--.+.|- T Consensus 268 sf~v~~GEivgl~G~nGsGKsTL~~~l~ 295 (491) T PRK10982 268 SFDLHKGEILGIAGLVGAKRTDIVETLF 295 (491) T ss_pred EEEEECCCEEEEECCCCCCHHHHHHHHH T ss_conf 7999689689977899997889999981 No 382 >PRK12289 ribosome-associated GTPase; Reviewed Probab=60.51 E-value=5.1 Score=19.77 Aligned_cols=35 Identities=34% Similarity=0.527 Sum_probs=22.2 Q ss_pred EEEECCCCHHHHHHHHHHHHHHH------------HCCHHHEEEEEE Q ss_conf 55304774067999999999998------------299578478885 Q gi|254780799|r 458 LIAGTTGSGKSVAINTMILSLLY------------RMTPAQCRLIMI 492 (806) Q Consensus 458 LIAGtTGSGKSV~iN~iI~SlLy------------k~~P~evklili 492 (806) .++|..|-|||--||+++-.+-. |||-..+.|+-+ T Consensus 175 v~~G~SGVGKSSLIN~L~p~~~l~TgeVS~k~~rGrHTTr~~eL~~L 221 (351) T PRK12289 175 VVAGPSGVGKSSLINRLIPDVELRVGEVSGKLQRGRHTTRHVELFEL 221 (351) T ss_pred EEECCCCCCHHHHHHHHCCCHHHHHHHHCCCCCCCCCCCCEEEEEEC T ss_conf 99817988788988763741234566540648998783551689994 No 383 >pfam07088 GvpD GvpD gas vesicle protein. This family consists of several archaeal GvpD gas vesicle proteins. GvpD is thought to be involved in the regulation of gas vesicle formation. Probab=60.40 E-value=8.7 Score=18.03 Aligned_cols=18 Identities=22% Similarity=0.261 Sum_probs=12.8 Q ss_pred CCCCEEEEEHHHHHHHHH Q ss_conf 469868776344688887 Q gi|254780799|r 587 QHMPYIVVVIDEMADLMM 604 (806) Q Consensus 587 ~~lp~ivviiDElaDlmm 604 (806) ...|++.||==+-|-+|. T Consensus 387 s~gp~l~iis~D~l~~~f 404 (484) T pfam07088 387 SKGPLLHVISMDTAYLAF 404 (484) T ss_pred CCCCEEEEEEHHHHHHHH T ss_conf 589829998568999987 No 384 >cd03256 ABC_PhnC_transporter ABC-type phosphate/phosphonate transport system. Phosphonates are a class of organophosphorus compounds characterized by a chemically stable carbon-to-phosphorus (C-P) bond. Phosphonates are widespread among naturally occurring compounds in all kingdoms of wildlife, but only procaryotic microorganisms are able to cleave this bond. Certain bacteria such as E. coli can use alkylphosphonates as a phosphorus source. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=60.36 E-value=5.9 Score=19.28 Aligned_cols=66 Identities=24% Similarity=0.412 Sum_probs=36.5 Q ss_pred EEEHHH-HHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEEEECCCCCCHHHCCCCCH Q ss_conf 776344-6888873210058899999986641423799996577753554355411025158764586642123388645 Q gi|254780799|r 593 VVVIDE-MADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISFQVSSKIDSRTILGEQGA 671 (806) Q Consensus 593 vviiDE-laDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf~v~s~~dSrtild~~ga 671 (806) +++.|| .+.|=-...++|-+.+.+|+ +.-|+-.|+.|.-++. =+.+-.||.+ --|-|.|-|..-. T Consensus 165 ill~DEPts~LD~~~~~~i~~ll~~l~---~~~g~Tii~vtHdl~~-------a~~~adri~~----l~~G~iv~~G~p~ 230 (241) T cd03256 165 LILADEPVASLDPASSRQVMDLLKRIN---REEGITVIVSLHQVDL-------AREYADRIVG----LKDGRIVFDGPPA 230 (241) T ss_pred EEEECCCCCCCCHHHHHHHHHHHHHHH---HHCCCEEEEECCCHHH-------HHHHCCEEEE----EECCEEEEECCHH T ss_conf 899628766589999999999999999---8519899999579899-------9996998999----9699999987869 Q ss_pred H Q ss_conf 7 Q gi|254780799|r 672 E 672 (806) Q Consensus 672 e 672 (806) | T Consensus 231 e 231 (241) T cd03256 231 E 231 (241) T ss_pred H T ss_conf 9 No 385 >PRK09473 oppD oligopeptide transporter ATP-binding component; Provisional Probab=60.24 E-value=6.4 Score=19.02 Aligned_cols=171 Identities=22% Similarity=0.271 Sum_probs=81.9 Q ss_pred EEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHC-----------------CH--------HHEEEEEECCCHHHHHHC Q ss_conf 85410020235530477406799999999999829-----------------95--------784788852310011102 Q gi|254780799|r 448 IADLARMPHLLIAGTTGSGKSVAINTMILSLLYRM-----------------TP--------AQCRLIMIDPKMLELSVY 502 (806) Q Consensus 448 v~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~-----------------~P--------~evkliliDPK~vEls~Y 502 (806) -.++.+==-+=|.|-.|||||.-.++|.- ||-.. ++ .++-+|.=|| +--| T Consensus 36 sf~i~~GEilgivGeSGsGKSTl~~~i~g-ll~~~~~~sG~I~~~G~~i~~~~~~~~~~~r~~~I~~vfQdp-~~sL--- 110 (330) T PRK09473 36 NFSLRAGETLGIVGESGSGKSQTAFALMG-LLAANGRIGGSATFNGREILNLPEKELNKLRAEQISMIFQDP-MTSL--- 110 (330) T ss_pred EEEECCCCEEEEECCCCCHHHHHHHHHHC-CCCCCCCEEEEEEECCEECCCCCHHHHHHHHCCCEEEEEECC-CHHC--- T ss_conf 76888998999986898779999999976-888888335899999998665899999986306679996075-0113--- Q ss_pred CCCHHHHCCCCCCHHHHHHHHHHH-----HHHHHHHHHHHHHCCCCCHH------------HHHHHHHHHHHCCCCCCCC Q ss_conf 770343122334304566899999-----99999999999870899689------------9999999887447866775 Q gi|254780799|r 503 DGIPNLLTPVVTNPQKAVTVLKWL-----VCEMEERYQKMSKIGVRNID------------GFNLKVAQYHNTGKKFNRT 565 (806) Q Consensus 503 ~~iPHLl~pVvTd~~kA~~aL~w~-----V~EMe~RY~l~a~~~vRni~------------~yN~k~~~~~~~~~~~~~~ 565 (806) .|+.|=-+.-...|+.- -.-.++..++|...|.-+.. |-.+||.-|++- T Consensus 111 -------nP~~~i~~~l~e~l~~~~~~~~~~~~~~~~~lL~~v~l~~~~~~l~~yP~eLSGGq~QRV~IArAL------- 176 (330) T PRK09473 111 -------NPYMRVGEQLMEVLMLHKGMSKAEAFEESVRMLDAVKMPEARKRMKMYPHEFSGGMRQRVMIAMAL------- 176 (330) T ss_pred -------CCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCHHHHHHHCCHHHCCHHHHHHHHHHHHH------- T ss_conf -------841045665557898853899899999999888876072178887348153398899999999999------- Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHH-HHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHH Q ss_conf 446776545432223322323469868776344688-8873210058899999986641423799996577753554355 Q gi|254780799|r 566 VQTGFDRKTGEAIYETEHFDFQHMPYIVVVIDEMAD-LMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTI 644 (806) Q Consensus 566 ~~~~~~~~~~~~~~~~~~~~~~~lp~ivviiDElaD-lmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~i 644 (806) -.=|- ++|-||=-- |=.+..++|-+.|.+|. |--|+-+|+-|.--++ T Consensus 177 ---------------------~~~P~-lLI~DEPTsaLDv~~q~~Il~ll~~l~---~e~g~til~ITHDl~~------- 224 (330) T PRK09473 177 ---------------------LCRPK-LLIADEPTTALDVTVQAQIMTLLNELK---REFNTAIIMITHDLGV------- 224 (330) T ss_pred ---------------------HCCCC-EEEECCCCCCCCHHHHHHHHHHHHHHH---HHCCCCEEEECCCHHH------- T ss_conf ---------------------70999-999738755479999999999999999---9749947998288999------- Q ss_pred HHCCCCEEEEEECCCC----CCHHHCCCC Q ss_conf 4110251587645866----421233886 Q gi|254780799|r 645 KANFPTRISFQVSSKI----DSRTILGEQ 669 (806) Q Consensus 645 kan~p~riaf~v~s~~----dSrtild~~ 669 (806) =+.+--||+--=.-+| +...|++.+ T Consensus 225 v~~~~DrI~VMy~G~iVE~G~~~~i~~~P 253 (330) T PRK09473 225 VAGICDKVLVMYAGRTMEYGNARDVFYQP 253 (330) T ss_pred HHHHCCEEEEEECCEEEEECCHHHHHHCC T ss_conf 99869989999898899978899997379 No 386 >PRK05580 primosome assembly protein PriA; Validated Probab=60.20 E-value=12 Score=17.01 Aligned_cols=101 Identities=21% Similarity=0.223 Sum_probs=70.2 Q ss_pred EEEEEHHHHHHHHHHCC--CHHHHHHHHHHHHHHHCCE-----EEEEEECCCCCCCCCHHHHHCCCCEEEEEECCCCCCH Q ss_conf 68776344688887321--0058899999986641423-----7999965777535543554110251587645866421 Q gi|254780799|r 591 YIVVVIDEMADLMMVAR--KDIESAVQRLAQMARASGI-----HVIMATQRPSVDVITGTIKANFPTRISFQVSSKIDSR 663 (806) Q Consensus 591 ~ivviiDElaDlmm~~~--~~ve~~i~rlaq~ara~Gi-----Hli~aTqrPsvdvitg~ikan~p~riaf~v~s~~dSr 663 (806) -.|+|+| ||.++..+ +--|...++|.|-|.=||= .+|+-|--|.-.++.-+++-|..+ T Consensus 523 ~lv~vld--aD~~l~~pd~ra~E~~~qll~qvagragr~~~~g~viiQt~~p~~~~~~~l~~~d~~~------------- 587 (699) T PRK05580 523 TLVGVLD--ADTGLFSPDFRAAERTFQLLTQVAGRAGRAEKPGEVLIQTYEPEHPLLQALLRWDYDA------------- 587 (699) T ss_pred EEEEEEC--HHHHCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCEEEEEECCCCCHHHHHHHHCCHHH------------- T ss_conf 6998732--2653548883479999999999987655678898799993799989999999689999------------- Q ss_pred HHCCCCCHHHHCCCCCEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHC Q ss_conf 2338864578658875477368983258883348988999999999712 Q gi|254780799|r 664 TILGEQGAEQLLGQGDMLYMTGGGRVQRIHGPFVSDIEVEKVVSHLKTQ 712 (806) Q Consensus 664 tild~~gae~Llg~gdml~~~~~~~~~r~~g~~v~~~ev~~v~~~~~~q 712 (806) =+++-|..=-.+.+||-++..++.+.=-+.+.+.+..+.+.+. T Consensus 588 ------f~~~el~~R~~~~~PPf~~l~~i~~~~~~~~~~~~~~~~~~~~ 630 (699) T PRK05580 588 ------FAEAELAERREAGLPPFVRLALVRGSAKAVAKALKFAQQLALL 630 (699) T ss_pred ------HHHHHHHHHHHCCCCCHHHHHEEEEECCCHHHHHHHHHHHHHH T ss_conf ------9999999999709998687517899539999999999999998 No 387 >PRK13643 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=60.19 E-value=7.1 Score=18.70 Aligned_cols=98 Identities=18% Similarity=0.237 Sum_probs=57.3 Q ss_pred EEEHHHH-HHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEEEECCCCCCHHHCCCCCH Q ss_conf 7763446-888873210058899999986641423799996577753554355411025158764586642123388645 Q gi|254780799|r 593 VVVIDEM-ADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISFQVSSKIDSRTILGEQGA 671 (806) Q Consensus 593 vviiDEl-aDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf~v~s~~dSrtild~~ga 671 (806) |+|-||= |-|=-...+++-+.+.+|.+ .|.-.|+.|..... =|.+-.||.. --|-|++.|..-. T Consensus 165 vLlLDEPTs~LDp~~~~~i~~ll~~l~~----~G~TiI~vtHd~~~-------v~~~adrviv----l~~G~Iv~~Gtp~ 229 (288) T PRK13643 165 VLVLDEPTAGLDPKARIEMMQLFESIHQ----SGQTVVLVTHLMDD-------VADYADYVYL----LEKGHIISCGTPS 229 (288) T ss_pred EEEEECCCCCCCHHHHHHHHHHHHHHHH----CCCEEEEECCCHHH-------HHHHCCEEEE----EECCEEEEECCHH T ss_conf 9999588555899999999999999995----39999998608999-------9997999999----9899999987889 Q ss_pred HHHCCCCCEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHCCCCCC Q ss_conf 7865887547736898325888334898899999999971289742 Q gi|254780799|r 672 EQLLGQGDMLYMTGGGRVQRIHGPFVSDIEVEKVVSHLKTQGEAKY 717 (806) Q Consensus 672 e~Llg~gdml~~~~~~~~~r~~g~~v~~~ev~~v~~~~~~q~~~~y 717 (806) .++ +.+..++.+.+.-..+-++..-++..|...+ T Consensus 230 -evf-----------~~~~~l~~~~l~~P~~~~l~~~L~~~g~~~~ 263 (288) T PRK13643 230 -DVF-----------QEVDFLKAHELGVPKATHFADQLQKTGAVTF 263 (288) T ss_pred -HHH-----------CCHHHHHHCCCCCCHHHHHHHHHHHCCCCCC T ss_conf -986-----------6999999769999849999999997699886 No 388 >KOG2670 consensus Probab=60.08 E-value=12 Score=17.08 Aligned_cols=60 Identities=33% Similarity=0.434 Sum_probs=38.5 Q ss_pred CCEEEEEHHHHHHHHHHCCCHHHHHHHH---------------------HHHHHHHCCEEEEEEECC----------CCC Q ss_conf 9868776344688887321005889999---------------------998664142379999657----------775 Q gi|254780799|r 589 MPYIVVVIDEMADLMMVARKDIESAVQR---------------------LAQMARASGIHVIMATQR----------PSV 637 (806) Q Consensus 589 lp~ivviiDElaDlmm~~~~~ve~~i~r---------------------laq~ara~GiHli~aTqr----------Psv 637 (806) .--|-||=| ||-+|-+|-|+.+|.+ -+++||++|--..+--.. =+| T Consensus 312 ~~~iqiVgD---DLtvTnpkri~~Ai~~k~cN~LLlKvNQIGtvtEsieA~~~a~~~GwgvMvSHRSGETeDtFIaDL~V 388 (433) T KOG2670 312 EVGIQIVGD---DLTVTNPKRIATAIEEKACNALLLKVNQIGTVTESIEAAKLARSAGWGVMVSHRSGETEDTFIADLVV 388 (433) T ss_pred CCCEEEECC---CCCCCCHHHHHHHHHHHHCCCEEEECCCCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCHHHHHHHH T ss_conf 564499637---53215889999999874114157530115304999999999987596078752578864302666332 Q ss_pred CCCCHHHHHCCCCE Q ss_conf 35543554110251 Q gi|254780799|r 638 DVITGTIKANFPTR 651 (806) Q Consensus 638 dvitg~ikan~p~r 651 (806) -+-||-||+--|.| T Consensus 389 Gl~tgqIKtGApcR 402 (433) T KOG2670 389 GLGTGQIKTGAPCR 402 (433) T ss_pred HHCCCCCCCCCCCH T ss_conf 10124012689750 No 389 >PRK11629 lolD lipoprotein transporter ATP-binding subunit; Provisional Probab=60.04 E-value=11 Score=17.30 Aligned_cols=38 Identities=26% Similarity=0.386 Sum_probs=25.7 Q ss_pred EEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEE Q ss_conf 785410020235530477406799999999999829957847 Q gi|254780799|r 447 IIADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCR 488 (806) Q Consensus 447 vv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evk 488 (806) +-.++.+==.+-|.|..|||||..++. |.-+ .+|++=+ T Consensus 28 vs~~i~~GE~v~iiG~sGsGKSTLl~~-i~Gl---~~p~~G~ 65 (233) T PRK11629 28 VSFSIGEGEMMAIVGSSGSGKSTLLHL-LGGL---DTPTSGD 65 (233) T ss_pred EEEEECCCCEEEEECCCCCCHHHHHHH-HHCC---CCCCCEE T ss_conf 289988998999999999409999999-9669---9998639 No 390 >cd03299 ABC_ModC_like Archeal protein closely related to ModC. ModC is an ABC-type transporter and the ATPase component of a molybdate transport system that also includes the periplasmic binding protein ModA and the membrane protein ModB. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=60.03 E-value=7.6 Score=18.47 Aligned_cols=163 Identities=19% Similarity=0.294 Sum_probs=83.1 Q ss_pred EEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHC--------CCCHHHHCCCCCCHHH Q ss_conf 78541002023553047740679999999999982995784788852310011102--------7703431223343045 Q gi|254780799|r 447 IIADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVY--------DGIPNLLTPVVTNPQK 518 (806) Q Consensus 447 vv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y--------~~iPHLl~pVvTd~~k 518 (806) +-.++.+==.+-|-|-.|||||..+++|.- | .+|+.=+..+=+-.+..++.. .+ +-|-|--|=-.. T Consensus 18 vs~~v~~Ge~~~iiGpSGsGKSTLlr~i~G--l--~~p~~G~I~~~G~di~~~~~~~r~ig~vfQ~--~~Lfp~~tV~eN 91 (235) T cd03299 18 VSLEVERGDYFVILGPTGSGKSVLLETIAG--F--IKPDSGKILLNGKDITNLPPEKRDISYVPQN--YALFPHMTVYKN 91 (235) T ss_pred CEEEECCCCEEEEECCCCCHHHHHHHHHHC--C--CCCCCEEEEECCEECCCCCHHHCCEEEECCC--CCCCCCCCHHHH T ss_conf 487988998999999996359999999974--9--9999659999999999999767897894579--866899909999 Q ss_pred HHHHHHH---HHHHHHHH-HHHHHHCCCCCHHH---------HHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 6689999---99999999-99998708996899---------99999988744786677544677654543222332232 Q gi|254780799|r 519 AVTVLKW---LVCEMEER-YQKMSKIGVRNIDG---------FNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFD 585 (806) Q Consensus 519 A~~aL~w---~V~EMe~R-Y~l~a~~~vRni~~---------yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 585 (806) -+.+|+. --.|+++| .+++...|..++.. --+||.-|++ T Consensus 92 i~~~l~~~~~~~~e~~~rv~e~l~~~gl~~~~~~~p~~LSGGq~QRVaiARA---------------------------- 143 (235) T cd03299 92 IAYGLKKRKVDKKEIERKVLEIAEMLGIDHLLNRKPETLSGGEQQRVAIARA---------------------------- 143 (235) T ss_pred HHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHCCHHHCCHHHHHHHHHHHH---------------------------- T ss_conf 9999987699999999999999987799778748944589999999999999---------------------------- Q ss_pred CCCCCEEEEEHHH-HHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEE Q ss_conf 3469868776344-68888732100588999999866414237999965777535543554110251587 Q gi|254780799|r 586 FQHMPYIVVVIDE-MADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISF 654 (806) Q Consensus 586 ~~~lp~ivviiDE-laDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf 654 (806) +-.=|-+ ++-|| ++-|=-....++-+.+.+ ..+.-|+-.|+.|.-++. =+.+--||++ T Consensus 144 l~~~P~l-lllDEP~s~LD~~~~~~i~~~l~~---l~~~~~~T~i~vTHd~~~-------a~~~aDri~v 202 (235) T cd03299 144 LVVNPKI-LLLDEPFSALDVRTKEKLREELKK---IRKEFGVTVLHVTHDFEE-------AWALADKVAI 202 (235) T ss_pred HHCCCCE-EEEECCCCCCCHHHHHHHHHHHHH---HHHHHCCEEEEECCCHHH-------HHHHCCEEEE T ss_conf 9738998-999288764699999999999999---999829999998789999-------9996999999 No 391 >CHL00131 ycf16 sulfate ABC transporter protein; Validated Probab=60.01 E-value=7.6 Score=18.47 Aligned_cols=28 Identities=25% Similarity=0.220 Sum_probs=21.5 Q ss_pred EECCCCCEEEEEECCCCHHHHHHHHHHH Q ss_conf 5410020235530477406799999999 Q gi|254780799|r 449 ADLARMPHLLIAGTTGSGKSVAINTMIL 476 (806) Q Consensus 449 ~DLakMPHLLIAGtTGSGKSV~iN~iI~ 476 (806) .++.+==.+-|.|..|||||--+|+|.= T Consensus 27 l~i~~Gei~aiiG~nGsGKSTL~~~i~G 54 (252) T CHL00131 27 LSINAGEIHAIMGPNGSGKSTLSKVIAG 54 (252) T ss_pred EEECCCCEEEEECCCCCCHHHHHHHHCC T ss_conf 7887998999999999999999999727 No 392 >PRK11440 hypothetical protein; Provisional Probab=59.78 E-value=8.5 Score=18.10 Aligned_cols=42 Identities=29% Similarity=0.388 Sum_probs=28.7 Q ss_pred EEEEHHHHHHHHHH-----CCCHHHHHHHHHHHHHHHCCEEEEEEEC Q ss_conf 87763446888873-----2100588999999866414237999965 Q gi|254780799|r 592 IVVVIDEMADLMMV-----ARKDIESAVQRLAQMARASGIHVIMATQ 633 (806) Q Consensus 592 ivviiDElaDlmm~-----~~~~ve~~i~rlaq~ara~GiHli~aTq 633 (806) =++|||=-.|.+-- ...++-..+.||...||++|++.|..+. T Consensus 10 ALlvID~Qn~f~~~~~~~~~~~~~i~~~~~L~~~aR~~g~pVi~v~~ 56 (188) T PRK11440 10 ALVVIDLQEGILPFAGGPHTADEVVARAARLAAKFRASGSPVVLVRV 56 (188) T ss_pred EEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCEEEEEEE T ss_conf 79999377885486778547999999999999999987990999986 No 393 >PRK13647 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=59.55 E-value=8 Score=18.31 Aligned_cols=151 Identities=19% Similarity=0.220 Sum_probs=73.5 Q ss_pred EECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCC---HHHHHHCCCC----H--HHHCCCCCCHHHH Q ss_conf 54100202355304774067999999999998299578478885231---0011102770----3--4312233430456 Q gi|254780799|r 449 ADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPK---MLELSVYDGI----P--NLLTPVVTNPQKA 519 (806) Q Consensus 449 ~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK---~vEls~Y~~i----P--HLl~pVvTd~~kA 519 (806) ..+.+==.+-|-|..|||||-.++.|-- ||+.+--++++-=.|.. .-|+.-.=|+ | ++....|- ... T Consensus 26 ~~i~~Ge~~aiiG~NGsGKSTLl~~l~G--l~~p~~G~I~i~G~~i~~~~~~~lr~~iG~vfQ~p~~ql~~~tV~--e~v 101 (273) T PRK13647 26 LVIPEGSKTAILGPNGAGKSTLLLHLNG--IYTAQRGRVKVMGKVVNEENIRWIRKKVGLVFQDPDDQLFSSTVW--DDV 101 (273) T ss_pred EEECCCCEEEEECCCCCHHHHHHHHHHC--CCCCCCEEEEECCEECCCCCHHHHHHHCEEEECCCHHHHCCCHHH--HHH T ss_conf 8988998999999999759999999966--988886199999999996898999875248810702430524199--999 Q ss_pred HHHH---HHHHHHHHHHH-HHHHHCCCCCHHHH--------H-HHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 6899---99999999999-99987089968999--------9-9999887447866775446776545432223322323 Q gi|254780799|r 520 VTVL---KWLVCEMEERY-QKMSKIGVRNIDGF--------N-LKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFDF 586 (806) Q Consensus 520 ~~aL---~w~V~EMe~RY-~l~a~~~vRni~~y--------N-~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 586 (806) +.++ ...-+||++|. +++...|..++..- . +|+.-|.. + T Consensus 102 ~fg~~~~g~~~~e~~~rv~~~L~~~~l~~~~~~~p~~LSGGqkqRvaiA~a----------------------------L 153 (273) T PRK13647 102 AFGPQNLELDKKEILRRVEEALKAVDMWDLREKAPYHLSYGQKKRVAIAGV----------------------------L 153 (273) T ss_pred HHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHCCHHHCCHHHHHHHHHHHH----------------------------H T ss_conf 999988599999999999999987795876647933399989999999999----------------------------9 Q ss_pred CCCCEEEEEHHHH-HHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCC Q ss_conf 4698687763446-8888732100588999999866414237999965777 Q gi|254780799|r 587 QHMPYIVVVIDEM-ADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPS 636 (806) Q Consensus 587 ~~lp~ivviiDEl-aDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPs 636 (806) -.=|- ++|-||= |-|=..+.+++-..+.+|.+. |+-.|++|.+.. T Consensus 154 ~~~P~-lliLDEPtagLDp~~~~~l~~~l~~L~~~----G~Tvi~vtHdl~ 199 (273) T PRK13647 154 AMEPD-IIVLDEPMAYLDPRGKEELTAILNRLNNE----GKTVIVATHDVD 199 (273) T ss_pred HHCCC-EEEEECCCCCCCHHHHHHHHHHHHHHHHC----CCEEEEEECCHH T ss_conf 81999-99997976579999999999999999848----999999941789 No 394 >cd03301 ABC_MalK_N The N-terminal ATPase domain of the maltose transporter, MalK. ATP binding cassette (ABC) proteins function from bacteria to human, mediating the translocation of substances into and out of cells or organelles. ABC transporters contain two transmembrane-spanning domains (TMDs) or subunits and two nucleotide binding domains (NBDs) or subunits that couple transport to the hydrolysis of ATP. In the maltose transport system, the periplasmic maltose binding protein (MBP) stimulates the ATPase activity of the membrane-associated transporter, which consists of two transmembrane subunits, MalF and MalG, and two copies of the ATP binding subunit, MalK, and becomes tightly bound to the transporter in the catalytic transition state, ensuring that maltose is passed to the transporter as ATP is hydrolyzed. Probab=59.47 E-value=11 Score=17.20 Aligned_cols=147 Identities=21% Similarity=0.287 Sum_probs=70.7 Q ss_pred EEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHH------------HHHCCCCHHHHCCCCCC Q ss_conf 854100202355304774067999999999998299578478885231001------------11027703431223343 Q gi|254780799|r 448 IADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLE------------LSVYDGIPNLLTPVVTN 515 (806) Q Consensus 448 v~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vE------------ls~Y~~iPHLl~pVvTd 515 (806) -.++.+==.+-|-|..|||||.-+|+|-- |+ +|+.=. |.+|=|.+. +.-|. |-|--|= T Consensus 20 sl~v~~Ge~~~i~GpSG~GKSTlLr~iaG--l~--~p~~G~-I~~~g~~i~~~~~~~R~ig~VfQ~~~-----LfP~ltV 89 (213) T cd03301 20 NLDIADGEFVVLLGPSGCGKTTTLRMIAG--LE--EPTSGR-IYIGGRDVTDLPPKDRDIAMVFQNYA-----LYPHMTV 89 (213) T ss_pred EEEECCCCEEEEECCCCCHHHHHHHHHHC--CC--CCCCEE-EEECCEECCCCCHHHCCEEEEECCCC-----CCCCCCH T ss_conf 77986998999999998809999999976--99--998639-99999999999976788789945876-----4654709 Q ss_pred HHHHHHHHH---HHHHHHHHHH-HHHHHCCCCCHHH---------HHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 045668999---9999999999-9998708996899---------99999988744786677544677654543222332 Q gi|254780799|r 516 PQKAVTVLK---WLVCEMEERY-QKMSKIGVRNIDG---------FNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETE 582 (806) Q Consensus 516 ~~kA~~aL~---w~V~EMe~RY-~l~a~~~vRni~~---------yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 582 (806) -..-+..|+ +.-.|+++|- +++...|..++.. --+|+.-|++- T Consensus 90 ~eNI~~~l~~~~~~~~e~~~~v~~~l~~~gl~~~~~~~P~~LSGGqkQRVaiARAl------------------------ 145 (213) T cd03301 90 YDNIAFGLKLRKVPKDEIDERVREVAELLQIEHLLDRKPKQLSGGQRQRVALGRAI------------------------ 145 (213) T ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHCCCHHHCCHHHHHHHHHHHHH------------------------ T ss_conf 99999899985999899999999999875992465099556999999999999998------------------------ Q ss_pred CCCCCCCCEEEEEHHH-HHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCC Q ss_conf 2323469868776344-68888732100588999999866414237999965777 Q gi|254780799|r 583 HFDFQHMPYIVVVIDE-MADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPS 636 (806) Q Consensus 583 ~~~~~~lp~ivviiDE-laDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPs 636 (806) -.=|-| ++-|| ++-|=-....++.+. |.+.-+.-|+-.|+.|--+. T Consensus 146 ----~~~P~l-LLlDEP~saLD~~~r~~i~~~---l~~~~~~~~~T~i~vTHd~~ 192 (213) T cd03301 146 ----VREPKV-FLMDEPLSNLDAKLRVQMRAE---LKRLQQRLGTTTIYVTHDQV 192 (213) T ss_pred ----HCCCCE-EEECCCCCCCCHHHHHHHHHH---HHHHHHHHCCEEEEECCCHH T ss_conf ----759998-998388764298999999999---99999974998999999989 No 395 >COG0572 Udk Uridine kinase [Nucleotide transport and metabolism] Probab=59.46 E-value=9.3 Score=17.83 Aligned_cols=103 Identities=21% Similarity=0.200 Sum_probs=46.0 Q ss_pred EEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCCCCHHHHC--CCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 2355304774067999999999998299578478885231001110277034312--23343045668999999999999 Q gi|254780799|r 456 HLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYDGIPNLLT--PVVTNPQKAVTVLKWLVCEMEER 533 (806) Q Consensus 456 HLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~~iPHLl~--pVvTd~~kA~~aL~w~V~EMe~R 533 (806) =..|||..|||||-. +-.|...=+.+ +..+|- +=+-|.+.-|+-. ...+|- ....|+.| T Consensus 10 iIgIaG~SgSGKTTv-----~~~l~~~~~~~-~~~~I~----~D~YYk~~~~~~~~~~~~~n~-d~p~A~D~-------- 70 (218) T COG0572 10 IIGIAGGSGSGKTTV-----AKELSEQLGVE-KVVVIS----LDDYYKDQSHLPFEERNKINY-DHPEAFDL-------- 70 (218) T ss_pred EEEEECCCCCCHHHH-----HHHHHHHHCCC-CCEEEE----CCCCCCCHHHCCHHHCCCCCC-CCHHHHCH-------- T ss_conf 999867987788999-----99999982867-524765----223202530166755378574-48234368-------- Q ss_pred HHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHH Q ss_conf 9999870899689999999988744786677544677654543222332232346986877634 Q gi|254780799|r 534 YQKMSKIGVRNIDGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQHMPYIVVVID 597 (806) Q Consensus 534 Y~l~a~~~vRni~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivviiD 597 (806) +=+++.+. ..+.|++..-|+ +|-.++.... +...-.|.=||||+ T Consensus 71 ------------dLl~~~L~-~L~~g~~v~~P~---yd~~~~~r~~----~~i~~~p~~VIIvE 114 (218) T COG0572 71 ------------DLLIEHLK-DLKQGKPVDLPV---YDYKTHTREP----ETIKVEPNDVVIVE 114 (218) T ss_pred ------------HHHHHHHH-HHHCCCCCCCCC---CCHHCCCCCC----CCCCCCCCCEEEEE T ss_conf ------------99999999-997699224564---2031363257----73313897289994 No 396 >KOG1547 consensus Probab=59.42 E-value=10 Score=17.46 Aligned_cols=45 Identities=27% Similarity=0.376 Sum_probs=31.2 Q ss_pred EEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHH Q ss_conf 2355304774067999999999998299578478885231001110 Q gi|254780799|r 456 HLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSV 501 (806) Q Consensus 456 HLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~ 501 (806) ...|-|..|.|||-.|||+-.|-+.+.+-.+-.= .==||-+|+.. T Consensus 48 NIMVVgqSglgkstlinTlf~s~v~~~s~~~~~~-~p~pkT~eik~ 92 (336) T KOG1547 48 NIMVVGQSGLGKSTLINTLFKSHVSDSSSSDNSA-EPIPKTTEIKS 92 (336) T ss_pred EEEEEECCCCCCHHHHHHHHHHHHHHCCCCCCCC-CCCCCEEEEEE T ss_conf 7999806877711567888888876125897656-75564278875 No 397 >COG4598 HisP ABC-type histidine transport system, ATPase component [Amino acid transport and metabolism] Probab=59.25 E-value=6 Score=19.23 Aligned_cols=82 Identities=26% Similarity=0.432 Sum_probs=47.5 Q ss_pred EEEEECCCCHHHHHHHHHHHHHHHHCCHH---------HEE--------EEEECCCHHH---------HHHCCCCHHH-- Q ss_conf 35530477406799999999999829957---------847--------8885231001---------1102770343-- Q gi|254780799|r 457 LLIAGTTGSGKSVAINTMILSLLYRMTPA---------QCR--------LIMIDPKMLE---------LSVYDGIPNL-- 508 (806) Q Consensus 457 LLIAGtTGSGKSV~iN~iI~SlLyk~~P~---------evk--------liliDPK~vE---------ls~Y~~iPHL-- 508 (806) .-|-|+.|||||-++-+|= +| -+|. |+| ++-.|+|.++ +..||=-+|+ T Consensus 35 isIIGsSGSGKSTfLRCiN--~L--E~P~~G~I~v~geei~~k~~~~G~l~~ad~~q~~r~Rs~L~mVFQ~FNLWsHmtv 110 (256) T COG4598 35 ISIIGSSGSGKSTFLRCIN--FL--EKPSAGSIRVNGEEIRLKRDKDGQLKPADKRQLQRLRTRLGMVFQHFNLWSHMTV 110 (256) T ss_pred EEEECCCCCCHHHHHHHHH--HH--CCCCCCEEEECCEEEEEEECCCCCEEECCHHHHHHHHHHHHHHHHHCCHHHHHHH T ss_conf 9996589986268999998--63--4888754888786898640788976107899999999886676542324677899 Q ss_pred HCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 122334304566899999999999999998708996 Q gi|254780799|r 509 LTPVVTNPQKAVTVLKWLVCEMEERYQKMSKIGVRN 544 (806) Q Consensus 509 l~pVvTd~~kA~~aL~w~V~EMe~RY~l~a~~~vRn 544 (806) |.-||--|-.....=+--. -|+--+++++.|.-+ T Consensus 111 LeNViEaPvhVLg~~k~ea--~e~Ae~~L~kVGi~e 144 (256) T COG4598 111 LENVIEAPVHVLGVSKAEA--IERAEKYLAKVGIAE 144 (256) T ss_pred HHHHHHCCHHHHCCCHHHH--HHHHHHHHHHHCCHH T ss_conf 9888750367655778899--999999999717154 No 398 >KOG0760 consensus Probab=59.08 E-value=11 Score=17.17 Aligned_cols=51 Identities=25% Similarity=0.290 Sum_probs=20.6 Q ss_pred HHHHHHHHCCCEEE-EHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHCCCCCEEE Q ss_conf 99999996598500-014222001177899999999977986802278872673 Q gi|254780799|r 745 QAVDIVLRDNKASI-SYIQRRLGIGYNRAASIIENMEEKGVIGPASSTGKREIL 797 (806) Q Consensus 745 ~a~~~v~~~~~~s~-s~lqr~~~igy~raar~~~~~e~~giv~~~~g~~~r~vl 797 (806) +++.-+.++|..|+ |.+||+=--|..+|+|.|= +..|+-|=..|-+||-+. T Consensus 224 DvvKT~Lq~q~s~~~~~~~~~k~~gi~~a~R~Iy--~~~G~~gf~rG~~~Rm~~ 275 (302) T KOG0760 224 DVVKTLLQTQGSSALSILIRRKASGISDAFRTIY--QKHGVKGFFRGLKPRMVY 275 (302) T ss_pred HHHHHHHHHHCCHHHHHHHHHHHCCHHHHHHHHH--HHCCCHHHHHCCCCEEEE T ss_conf 9999999761002367888875401999999999--733721565456640430 No 399 >TIGR01073 pcrA ATP-dependent DNA helicase PcrA; InterPro: IPR005751 The DNA helicase PcrA is found mainly in Gram-positive bacteria and belongs to a superfamily of helicases, together with Rep and UvrD helicases from Escherichia coli. The action of the 3'5' bacterial DNA helicase is based upon two separate but coupled activities, ssDNA translocation and duplex destabilisation, and is driven by energy derived from the continuous ATP-binding and hydrolysis events that take place in the active-site cleft. The resulting conformational changes that accompany these events underpin the coupling process and allow the helicase to translocate along the DNA, destabilising the duplex and separating the two strands in an active process .; GO: 0004003 ATP-dependent DNA helicase activity, 0006268 DNA unwinding during replication, 0005737 cytoplasm. Probab=59.02 E-value=7.1 Score=18.70 Aligned_cols=20 Identities=20% Similarity=0.207 Sum_probs=11.1 Q ss_pred HHHHHHHHHHHCCCCEEEEE Q ss_conf 78999999742486328998 Q gi|254780799|r 335 NACTLKSVLSDFGIQGEIVN 354 (806) Q Consensus 335 nA~lLE~tL~dFGVe~~Vv~ 354 (806) .+|.+|++|-.=|+.=+||+ T Consensus 415 QSR~~EE~l~k~n~pY~iVG 434 (811) T TIGR01073 415 QSRVFEETLLKANIPYKIVG 434 (811) T ss_pred CCHHHHHHHHHCCCCEEEEC T ss_conf 00789999986489806634 No 400 >PRK12727 flagellar biosynthesis regulator FlhF; Provisional Probab=58.93 E-value=12 Score=16.86 Aligned_cols=64 Identities=19% Similarity=0.359 Sum_probs=52.6 Q ss_pred ECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHH----HHHHCCCCHHHHCCCCCCHHHHHHHHH Q ss_conf 0477406799999999999829957847888523100----111027703431223343045668999 Q gi|254780799|r 461 GTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKML----ELSVYDGIPNLLTPVVTNPQKAVTVLK 524 (806) Q Consensus 461 GtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~v----Els~Y~~iPHLl~pVvTd~~kA~~aL~ 524 (806) |-||.||.--|--|=.-...++.++.|=||=.|-=++ -|..|-.|=..-.-|+.|+..-..+|. T Consensus 355 GpTGvGKTTT~aKlAa~~~~~~g~~~valit~DtyRiga~eQL~~y~~ilgvpv~~~~~~~~l~~~l~ 422 (557) T PRK12727 355 GPTGAGKTTTIAKLAQRFAAQHAPRDVALVTTDTQRVGGREQLHSYGRQLGIAVHEADSAESLLDLLE 422 (557) T ss_pred CCCCCCCHHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHHHHHHHHCCEEEEECCHHHHHHHHH T ss_conf 37776731179999999999739981899972664087999999999983975798289999999999 No 401 >TIGR03597 GTPase_YqeH ribosome biogenesis GTPase YqeH. This family describes YqeH, a member of a larger family of GTPases involved in ribosome biogenesis. Like YqlF, it shows a cyclical permutation relative to GTPases EngA (in which the GTPase domain is duplicated), Era, and others. Members of this protein family are found in a relatively small number of bacterial species, including Bacillus subtilis but not Escherichia coli. Probab=58.83 E-value=8.1 Score=18.25 Aligned_cols=81 Identities=22% Similarity=0.340 Sum_probs=49.0 Q ss_pred EEEEEEEEECCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCEEEEEECCCCCCEEEEHHHCCCHHHHHCCCCCHHHCC Q ss_conf 24444432147863999997889999886300121000388612555302456623873423072134302210002104 Q gi|254780799|r 360 VITLYELEPAPGIKSSRIIGLSDDIARSMSAISARVAVIPRRNAIGIELPNDIRETVMLRDLIVSRVFEKNQCDLAINLG 439 (806) Q Consensus 360 VVTrYEi~PApGVKvSKI~nLadDIA~aLsa~svRIapIPGK~~VGIEiPN~~r~~V~lreil~s~~f~~s~~~L~iaLG 439 (806) |+|.-.+-|. -++.++|.+--.+.++.......-|.-+-+|.--||| ++++.= ++. .-| T Consensus 96 V~NK~DLLP~-~~~~~~i~~wv~~~~~~~gl~~~~V~lvSa~~g~gi~------------~l~~~i--~~~------~~~ 154 (360) T TIGR03597 96 VGNKIDLLPK-SVNLSKIKEWMKKRAKELGLKPVDIILVSAKKGNGID------------ELLDKI--KKA------RNK 154 (360) T ss_pred EEEHHHCCCC-CCCHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCHH------------HHHHHH--HHH------HCC T ss_conf 9980542887-6787999999999999859983668999688898999------------999999--987------169 Q ss_pred CCCCCCEEEEECCCCCEEEEEECCCCHHHHHHHHHHH Q ss_conf 5666666785410020235530477406799999999 Q gi|254780799|r 440 KSIEGKPIIADLARMPHLLIAGTTGSGKSVAINTMIL 476 (806) Q Consensus 440 KdI~G~pvv~DLakMPHLLIAGtTGSGKSV~iN~iI~ 476 (806) +| ..+-|+|-+|||-.||+|+- T Consensus 155 ~d---------------vyvvG~tNvGKSTliN~Ll~ 176 (360) T TIGR03597 155 KD---------------VYVVGVTNVGKSSLINKLLK 176 (360) T ss_pred CC---------------EEEECCCCCCHHHHHHHHHH T ss_conf 95---------------89991686658999999987 No 402 >KOG3030 consensus Probab=58.82 E-value=13 Score=16.84 Aligned_cols=46 Identities=11% Similarity=-0.108 Sum_probs=20.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHC--CC-----------HHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999827--65-----------23678999999999999999999 Q gi|254780799|r 83 GIASVFFLPPPTMWALSLLFD--KK-----------IYCFSKRATAWLINILVSATFFA 128 (806) Q Consensus 83 G~~Ayllpl~Ll~~g~~ll~~--k~-----------~~~~~~Rl~~~il~ll~~s~ll~ 128 (806) ...++++|+..++++..+... ++ ......|.++..++-++...++. T Consensus 68 ~~i~~~~P~~vI~v~e~~r~~~~~~~~~~~~~~~~~~~~~l~~~~~~~lfgl~~t~~~t 126 (317) T KOG3030 68 LAIAVLLPLLVILVVEFIRACLKSKSTESNICCLNPDVRRLYRFVGVFLFGLAATQLFT 126 (317) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999865999999999998752023334555420899999999999999999999999 No 403 >TIGR01821 5aminolev_synth 5-aminolevulinic acid synthase; InterPro: IPR010961 Tetrapyrroles are large macrocyclic compounds derived from a common biosynthetic pathway . The end-product, uroporphyrinogen III, is used to synthesise a number of important molecules, including vitamin B12, haem, sirohaem, chlorophyll, coenzyme F430 and phytochromobilin . The first stage in tetrapyrrole synthesis is the synthesis of 5-aminoaevulinic acid ALA via two possible routes: (1) condensation of succinyl CoA and glycine (C4 pathway) using ALA synthase (2.3.1.37 from EC), or (2) decarboxylation of glutamate (C5 pathway) via three different enzymes, glutamyl-tRNA synthetase (6.1.1.17 from EC) to charge a tRNA with glutamate, glutamyl-tRNA reductase (1.2.1.70 from EC) to reduce glutamyl-tRNA to glutamate-1-semialdehyde (GSA), and GSA aminotransferase (5.4.3.8 from EC) to catalyse a transamination reaction to produce ALA. The second stage is to convert ALA to uroporphyrinogen III, the first macrocyclic tetrapyrrolic structure in the pathway. This is achieved by the action of three enzymes in one common pathway: porphobilinogen (PBG) synthase (or ALA dehydratase, 4.2.1.24 from EC) to condense two ALA molecules to generate porphobilinogen; hydroxymethylbilane synthase (or PBG deaminase, 2.5.1.61 from EC) to polymerise four PBG molecules into preuroporphyrinogen (tetrapyrrole structure); and uroporphyrinogen III synthase (4.2.1.75 from EC) to link two pyrrole units together (rings A and D) to yield uroporphyrinogen III. Uroporphyrinogen III is the first branch point of the pathway. To synthesise cobalamin (vitamin B12), sirohaem, and coenzyme F430, uroporphyrinogen III needs to be converted into precorrin-2 by the action of uroporphyrinogen III methyltransferase (2.1.1.107 from EC). To synthesise haem and chlorophyll, uroporphyrinogen III needs to be decarboxylated into coproporphyrinogen III by the action of uroporphyrinogen III decarboxylase (4.1.1.37 from EC) . This entry represents 5-aminoaevulinic acid (ALA) synthase (2.3.1.37 from EC), which catalyses the first stage of tetrapyrrole biosynthesis by the C4 pathway, namely the condensation of succinyl CoA and glycine. ALA synthase is a pyridoxal-phosphate-dependent enzyme. During catalysis, glycine initially binds to the enzyme cofactor, and after condensation with succinyl CoA, CoA, carbon dioxide and 5-aminolevulinic acid are produced .; GO: 0003870 5-aminolevulinate synthase activity, 0030170 pyridoxal phosphate binding, 0033014 tetrapyrrole biosynthetic process. Probab=58.81 E-value=3.8 Score=20.71 Aligned_cols=44 Identities=20% Similarity=0.111 Sum_probs=25.9 Q ss_pred HHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEEECCE Q ss_conf 1353320012455563256654578999999742486328998410442 Q gi|254780799|r 312 KEILSTSQSPVNQMTFSPKVMQNNACTLKSVLSDFGIQGEIVNVRPGPV 360 (806) Q Consensus 312 LdLL~~~~~~~~~~~~s~eeL~~nA~lLE~tL~dFGVe~~Vv~v~pGPV 360 (806) +-+|+..+. + .-.+.-|+++..|.+.|.++|+-+=...=+.=|| T Consensus 314 IRhLK~~eg----~-~lR~~hQ~~~~~lK~~L~~~GlPv~~~pSHIvPv 357 (427) T TIGR01821 314 IRHLKESEG----Q-DLRRAHQERVKRLKKLLEALGLPVIDNPSHIVPV 357 (427) T ss_pred HHHHHHCCC----H-HHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEE T ss_conf 999751376----7-7899998999999999995777754887738876 No 404 >pfam02233 PNTB NAD(P) transhydrogenase beta subunit. This family corresponds to the beta subunit of NADP transhydrogenase in prokaryotes, and either the protein N- or C terminal in eukaryotes. The domain is often found in conjunction with pfam01262. Pyridine nucleotide transhydrogenase catalyses the reduction of NAD+ to NADPH. A complete loss of activity occurs upon mutation of Gly314 in E. coli. Probab=58.78 E-value=13 Score=16.84 Aligned_cols=19 Identities=16% Similarity=0.333 Sum_probs=10.8 Q ss_pred HHHHHHHHHHHHHHHHHCCCCEE Q ss_conf 56654578999999742486328 Q gi|254780799|r 329 PKVMQNNACTLKSVLSDFGIQGE 351 (806) Q Consensus 329 ~eeL~~nA~lLE~tL~dFGVe~~ 351 (806) |..+++ |.+.|++-|++++ T Consensus 325 Q~~v~e----l~~~L~~~g~~V~ 343 (464) T pfam02233 325 QHPVAD----LAKLLEERGVNVR 343 (464) T ss_pred HHHHHH----HHHHHHHCCCEEE T ss_conf 899999----9999997898699 No 405 >cd00878 Arf_Arl Arf (ADP-ribosylation factor)/Arl (Arf-like) small GTPases. Arf proteins are activators of phospholipase D isoforms. Unlike Ras proteins they lack cysteine residues at their C-termini and therefore are unlikely to be prenylated. Arfs are N-terminally myristoylated. Members of the Arf family are regulators of vesicle formation in intracellular traffic that interact reversibly with membranes of the secretory and endocytic compartments in a GTP-dependent manner. They depart from other small GTP-binding proteins by a unique structural device, interswitch toggle, that implements front-back communication from N-terminus to the nucleotide binding site. Arf-like (Arl) proteins are close relatives of the Arf, but only Arl1 has been shown to function in membrane traffic like the Arf proteins. Arl2 has an unrelated function in the folding of native tubulin, and Arl4 may function in the nucleus. Most other Arf family proteins are so far relatively poorly characterized. Thu Probab=58.74 E-value=3.9 Score=20.65 Aligned_cols=21 Identities=33% Similarity=0.631 Sum_probs=16.8 Q ss_pred EEEEECCCCHHHHHHHHHHHH Q ss_conf 355304774067999999999 Q gi|254780799|r 457 LLIAGTTGSGKSVAINTMILS 477 (806) Q Consensus 457 LLIAGtTGSGKSV~iN~iI~S 477 (806) +++.|..|||||--+|.+... T Consensus 2 i~ilG~~~vGKTsll~~l~~~ 22 (158) T cd00878 2 ILILGLDGAGKTTILYKLKLG 22 (158) T ss_pred EEEECCCCCCHHHHHHHHHCC T ss_conf 999999999889999999539 No 406 >PRK10919 ATP-dependent DNA helicase Rep; Provisional Probab=58.61 E-value=13 Score=16.82 Aligned_cols=21 Identities=14% Similarity=0.058 Sum_probs=12.8 Q ss_pred HHHHHHHHHHHCCCCEEEEEE Q ss_conf 789999997424863289984 Q gi|254780799|r 335 NACTLKSVLSDFGIQGEIVNV 355 (806) Q Consensus 335 nA~lLE~tL~dFGVe~~Vv~v 355 (806) .++.+|+.|...||...+.+- T Consensus 354 ~~~~le~~L~~~~IPy~i~g~ 374 (672) T PRK10919 354 QSRVFEKFLMQNRIPYKISGG 374 (672) T ss_pred HHHHHHHHHHHCCCCEEEECC T ss_conf 226899999977998799578 No 407 >cd03238 ABC_UvrA The excision repair protein UvrA; Nucleotide excision repair in eubacteria is a process that repairs DNA damage by the removal of a 12-13-mer oligonucleotide containing the lesion. Recognition and cleavage of the damaged DNA is a multistep ATP-dependent reaction that requires the UvrA, UvrB, and UvrC proteins. Both UvrA and UvrB are ATPases, with UvrA having two ATP binding sites, which have the characteristic signature of the family of ABC proteins, and UvrB having one ATP binding site that is structurally related to that of helicases. Probab=58.57 E-value=8.9 Score=17.95 Aligned_cols=41 Identities=17% Similarity=0.257 Sum_probs=26.8 Q ss_pred EEEEHHHH-HHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCC Q ss_conf 87763446-8888732100588999999866414237999965777 Q gi|254780799|r 592 IVVVIDEM-ADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPS 636 (806) Q Consensus 592 ivviiDEl-aDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPs 636 (806) -+++.||= +-|=....+++.+.|.+|.+ .|.-.|+.|.++. T Consensus 109 ~ililDEPtsgLD~~~~~~l~~~l~~l~~----~g~TvI~vtHd~~ 150 (176) T cd03238 109 TLFILDEPSTGLHQQDINQLLEVIKGLID----LGNTVILIEHNLD 150 (176) T ss_pred CEEEECCCCCCCCHHHHHHHHHHHHHHHH----CCCEEEEEECCHH T ss_conf 68997177445898799999999999998----7998999947879 No 408 >TIGR00655 PurU formyltetrahydrofolate deformylase; InterPro: IPR004810 An Escherichia coli gene designated purU has been identified and characterised. The gene codes for a 280-amino-acid protein, PurU (P37051 from SWISSPROT, 3.5.1.10 from EC). PurU is an enzyme that catalyses the hydrolysis of 10-formyltetrahydrofolate (formyl-FH4) to FH4 and formate ,.10-formyltetrahydrofolate + H(2)O = formate +tetrahydrofolate Formyl-FH4 hydrolase generates the formate that is used by purT-encoded 5'-phosphoribosylglycinamide transformylase for step three of de novo purine nucleotide synthesis. Formyl-FH4 hydrolase, a hexamer of 32kDa subunits, is activated by methionine and inhibited by glycine. Heterotropic cooperativity is observed for activation by methionine in the presence of glycine and for inhibition by glycine in the presence of methionine. These results suggest that formyl-FH4 hydrolase is a regulatory enzyme whose main function is to balance the pools of FH4 and C1-FH4 in response to changing growth conditions. The enzyme uses methionine and glycine to sense the pools of C1-FH4 and FH4, respectively. ; GO: 0008864 formyltetrahydrofolate deformylase activity, 0006189 'de novo' IMP biosynthetic process. Probab=58.42 E-value=3.6 Score=20.87 Aligned_cols=12 Identities=42% Similarity=0.648 Sum_probs=4.8 Q ss_pred HHHCCCHHHHHH Q ss_conf 873210058899 Q gi|254780799|r 603 MMVARKDIESAV 614 (806) Q Consensus 603 mm~~~~~ve~~i 614 (806) ||-+|||||-.+ T Consensus 260 l~r~GrDiEk~V 271 (294) T TIGR00655 260 LIRAGRDIEKVV 271 (294) T ss_pred HHHCCCCHHHHH T ss_conf 986067512678 No 409 >PRK05419 putative sulfite oxidase subunit YedZ; Reviewed Probab=58.30 E-value=13 Score=16.78 Aligned_cols=61 Identities=18% Similarity=0.130 Sum_probs=36.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 346789999999999999999999-9999999998276523678999999999999999999 Q gi|254780799|r 68 GYGGAIFADVAIQFFGIASVFFLP-PPTMWALSLLFDKKIYCFSKRATAWLINILVSATFFA 128 (806) Q Consensus 68 G~lGA~iAd~L~~lFG~~Ayllpl-~Ll~~g~~ll~~k~~~~~~~Rl~~~il~ll~~s~ll~ 128 (806) |.+|+-=...+...+|..|..+.+ .+.+.-++.+.+.+.....+|.+|...+.-.+.=++. T Consensus 33 ~~LG~dPv~~~~~~tG~~al~~Ll~tL~itPl~~~~~~~~l~~~RR~lGL~aFfYa~lH~~~ 94 (203) T PRK05419 33 GGLGADPVKDIEHFTGLWALVFLLATLAITPLRRLTGQPLLIRTRRLLGLWAFFYALLHLLA 94 (203) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 87899869999999989999999999988799988298699999999999999999999999 No 410 >cd04153 Arl5_Arl8 Arl5/Arl8 subfamily. Arl5 (Arf-like 5) and Arl8, like Arl4 and Arl7, are localized to the nucleus and nucleolus. Arl5 is developmentally regulated during embryogenesis in mice. Human Arl5 interacts with the heterochromatin protein 1-alpha (HP1alpha), a nonhistone chromosomal protein that is associated with heterochromatin and telomeres, and prevents telomere fusion. Arl5 may also play a role in embryonic nuclear dynamics and/or signaling cascades. Arl8 was identified from a fetal cartilage cDNA library. It is found in brain, heart, lung, cartilage, and kidney. No function has been assigned for Arl8 to date. Probab=58.28 E-value=7.1 Score=18.68 Aligned_cols=29 Identities=21% Similarity=0.504 Sum_probs=20.8 Q ss_pred EEEEEECCCCHHHHHHHHHHHHHHHHCCH Q ss_conf 23553047740679999999999982995 Q gi|254780799|r 456 HLLIAGTTGSGKSVAINTMILSLLYRMTP 484 (806) Q Consensus 456 HLLIAGtTGSGKSV~iN~iI~SlLyk~~P 484 (806) -++|.|-.|||||-.+|-+...-.-...| T Consensus 17 KililG~~~sGKTsil~~l~~~~~~~~~p 45 (174) T cd04153 17 KVIIVGLDNAGKTTILYQFLLGEVVHTSP 45 (174) T ss_pred EEEEECCCCCCHHHHHHHHHCCCCCCCCC T ss_conf 99999899998899999997399277167 No 411 >PRK06921 hypothetical protein; Provisional Probab=58.03 E-value=12 Score=17.05 Aligned_cols=25 Identities=24% Similarity=0.456 Sum_probs=22.7 Q ss_pred EEEEECCCCHHHHHHHHHHHHHHHH Q ss_conf 3553047740679999999999982 Q gi|254780799|r 457 LLIAGTTGSGKSVAINTMILSLLYR 481 (806) Q Consensus 457 LLIAGtTGSGKSV~iN~iI~SlLyk 481 (806) |+..|.+|+||+=-.++|.-.||-+ T Consensus 119 l~f~G~~G~GKThLa~aIa~~Ll~~ 143 (265) T PRK06921 119 IALLGQPGSGKTHLLTAAANELMRK 143 (265) T ss_pred EEEECCCCCCHHHHHHHHHHHHHHH T ss_conf 7997289898899999999999996 No 412 >cd03298 ABC_ThiQ_thiamine_transporter ABC-type thiamine tranport system; part of the binding-protein-dependent transport system tbpA-thiPQ for thiamine and TPP. Probably responsible for the translocation of thiamine across the membrane. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=57.83 E-value=7 Score=18.73 Aligned_cols=140 Identities=24% Similarity=0.311 Sum_probs=65.5 Q ss_pred CCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHH------------HHHCCCCHHHHCCCCCCHHHH Q ss_conf 00202355304774067999999999998299578478885231001------------110277034312233430456 Q gi|254780799|r 452 ARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLE------------LSVYDGIPNLLTPVVTNPQKA 519 (806) Q Consensus 452 akMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vE------------ls~Y~~iPHLl~pVvTd~~kA 519 (806) .+==.+-|=|..|||||..+|.| .-+ .+|+.=+ |+||=+-+- +.-|+=.||| |=-... T Consensus 22 ~~Ge~~~ilGpSGsGKSTLl~li-~Gl---~~p~sG~-I~i~G~di~~~~~~~r~ig~vfQ~~~L~p~l-----tV~eNi 91 (211) T cd03298 22 AQGEITAIVGPSGSGKSTLLNLI-AGF---ETPQSGR-VLINGVDVTAAPPADRPVSMLFQENNLFAHL-----TVEQNV 91 (211) T ss_pred CCCCEEEEECCCCCHHHHHHHHH-HCC---CCCCCEE-EEECCEECCCCCHHHCCEEEEECCCCCCCCC-----CHHHHH T ss_conf 89989999999995599999999-769---9988529-9999999999998898679995388668999-----499998 Q ss_pred HHHHH--HH-HHHHHHHH-HHHHHCCCCCHHHHH------------HHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 68999--99-99999999-999870899689999------------9999887447866775446776545432223322 Q gi|254780799|r 520 VTVLK--WL-VCEMEERY-QKMSKIGVRNIDGFN------------LKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEH 583 (806) Q Consensus 520 ~~aL~--w~-V~EMe~RY-~l~a~~~vRni~~yN------------~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 583 (806) +-+|. +. -.|..+|- +++...|. .+|- +|+.-|++- T Consensus 92 ~~~l~~~~~~~~~~~~~v~~~l~~~gl---~~~~~~~p~~LSGGqkQRvaiARAL------------------------- 143 (211) T cd03298 92 GLGLSPGLKLTAEDRQAIEVALARVGL---AGLEKRLPGELSGGERQRVALARVL------------------------- 143 (211) T ss_pred HHHHHHCCCCHHHHHHHHHHHHHHCCC---HHHHHCCHHHCCHHHHHHHHHHHHH------------------------- T ss_conf 758864688829999999999987699---8787289455898999999999998------------------------- Q ss_pred CCCCCCCEEEEEHHH-HHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCC Q ss_conf 323469868776344-68888732100588999999866414237999965777 Q gi|254780799|r 584 FDFQHMPYIVVVIDE-MADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPS 636 (806) Q Consensus 584 ~~~~~lp~ivviiDE-laDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPs 636 (806) -.=|- |++-|| +|-|=....+++-..+.+| .+--|+-.|+.|..|. T Consensus 144 ---~~~P~-ilLlDEPts~LD~~~~~~l~~~l~~l---~~~~~~Tvi~vTHd~~ 190 (211) T cd03298 144 ---VRDKP-VLLLDEPFAALDPALRAEMLDLVLDL---HAETKMTVLMVTHQPE 190 (211) T ss_pred ---HCCCC-EEEECCCCCCCCHHHHHHHHHHHHHH---HHHHCCEEEEECCCHH T ss_conf ---65999-99971887655989999999999999---9974998999988999 No 413 >COG2607 Predicted ATPase (AAA+ superfamily) [General function prediction only] Probab=57.76 E-value=10 Score=17.48 Aligned_cols=71 Identities=21% Similarity=0.331 Sum_probs=42.0 Q ss_pred CCCCCCEEEEHHHCCCHHHHHC--CCCCHHHCCCCCCCCEEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHH Q ss_conf 2456623873423072134302--21000210456666667854100202355304774067999999999998299578 Q gi|254780799|r 409 PNDIRETVMLRDLIVSRVFEKN--QCDLAINLGKSIEGKPIIADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQ 486 (806) Q Consensus 409 PN~~r~~V~lreil~s~~f~~s--~~~L~iaLGKdI~G~pvv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~e 486 (806) |=.+...|.|.+++.-+.-++. +...-++=|. | .-|.|.=|+-|+|||.-+-+++... .-+- T Consensus 50 pv~~~~~i~L~~l~Gvd~qk~~L~~NT~~F~~G~-----p-------ANnVLLwGaRGtGKSSLVKA~~~e~----~~~g 113 (287) T COG2607 50 PVPDPDPIDLADLVGVDRQKEALVRNTEQFAEGL-----P-------ANNVLLWGARGTGKSSLVKALLNEY----ADEG 113 (287) T ss_pred CCCCCCCCCHHHHHCCHHHHHHHHHHHHHHHCCC-----C-------CCCEEEECCCCCCHHHHHHHHHHHH----HHCC T ss_conf 8898997678987273189999999899997288-----6-------5236776377777479999999998----7417 Q ss_pred EEEEEECCC Q ss_conf 478885231 Q gi|254780799|r 487 CRLIMIDPK 495 (806) Q Consensus 487 vkliliDPK 495 (806) +|||=||+- T Consensus 114 lrLVEV~k~ 122 (287) T COG2607 114 LRLVEVDKE 122 (287) T ss_pred CEEEEECHH T ss_conf 707997688 No 414 >KOG0346 consensus Probab=57.75 E-value=8.6 Score=18.07 Aligned_cols=19 Identities=11% Similarity=-0.118 Sum_probs=11.2 Q ss_pred HHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999982 Q gi|254780799|r 84 IASVFFLPPPTMWALSLLF 102 (806) Q Consensus 84 ~~Ayllpl~Ll~~g~~ll~ 102 (806) -+||++|++=.++.....- T Consensus 70 T~AYliPllqkll~~k~t~ 88 (569) T KOG0346 70 TAAYLIPLLQKLLAEKKTN 88 (569) T ss_pred HHHHHHHHHHHHHHHHHCC T ss_conf 3788999999999764036 No 415 >PTZ00012 alpha-tubulin II; Provisional Probab=57.68 E-value=8.4 Score=18.16 Aligned_cols=81 Identities=23% Similarity=0.360 Sum_probs=39.2 Q ss_pred EECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCCCCHH-HHCCCCCCHHHHHHHHHHHHHH--------H Q ss_conf 304774067999999999998299578478885231001110277034-3122334304566899999999--------9 Q gi|254780799|r 460 AGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYDGIPN-LLTPVVTNPQKAVTVLKWLVCE--------M 530 (806) Q Consensus 460 AGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~~iPH-Ll~pVvTd~~kA~~aL~w~V~E--------M 530 (806) +|=|||| +-+.|+..|.-.-|+-.++ ..++| |+ -..-||++|=.++-+|+|...- = T Consensus 142 ~GGTGSG----lgs~l~e~L~d~yp~~~~~--------~~~v~---Ps~~~~~~vv~pYN~~Lsl~~l~e~~d~~~~~dN 206 (450) T PTZ00012 142 GGGTGSG----LGCLLLERLAIDYGKKSKL--------NFCSW---PSPQVSTAVVEPYNSVLSTHSLLEHTDVAIMLDN 206 (450) T ss_pred CCCCCCH----HHHHHHHHHHHHCCCCEEE--------EEEEC---CCCCCCCCCCCCCCCHHHHHHHHHHCCEEEEECC T ss_conf 8877431----9999999988765745043--------26861---7876678523144021022346752856998200 Q ss_pred HHHHHHHHH-CCCC--CHHHHHHHHHHH Q ss_conf 999999987-0899--689999999988 Q gi|254780799|r 531 EERYQKMSK-IGVR--NIDGFNLKVAQY 555 (806) Q Consensus 531 e~RY~l~a~-~~vR--ni~~yN~k~~~~ 555 (806) |-=|++..+ +++. +....|+-|.+. T Consensus 207 ~aL~~ic~~~l~i~~psf~~~N~lIaq~ 234 (450) T PTZ00012 207 EAIYDICKKNLDIERPTYTNLNRLIAQV 234 (450) T ss_pred HHHHHHHHHHCCCCCCCHHHHHHHHHHH T ss_conf 9999999985699999978999999988 No 416 >pfam01078 Mg_chelatase Magnesium chelatase, subunit ChlI. Magnesium-chelatase is a three-component enzyme that catalyses the insertion of Mg2+ into protoporphyrin IX. This is the first unique step in the synthesis of (bacterio)chlorophyll. Due to this, it is thought that Mg-chelatase has an important role in channelling inter- mediates into the (bacterio)chlorophyll branch in response to conditions suitable for photosynthetic growth. ChlI and BchD have molecular weight between 38-42 kDa. Probab=57.66 E-value=7.4 Score=18.58 Aligned_cols=23 Identities=35% Similarity=0.546 Sum_probs=18.6 Q ss_pred CCCCEEEEEECCCCHHHHHHHHH Q ss_conf 00202355304774067999999 Q gi|254780799|r 452 ARMPHLLIAGTTGSGKSVAINTM 474 (806) Q Consensus 452 akMPHLLIAGtTGSGKSV~iN~i 474 (806) |-+=|||+-|.-|+|||.....| T Consensus 20 aG~H~lLl~GpPG~GKTmlA~rl 42 (207) T pfam01078 20 AGGHNLLMIGPPGSGKTMLAKRL 42 (207) T ss_pred CCCCCEEEECCCCCCHHHHHHHH T ss_conf 47875897889980299999763 No 417 >cd02576 PseudoU_synth_ScPUS7 PseudoU_synth_ScPUS7: Pseudouridine synthase, TruD family. This group consists of eukaryotic pseudouridine synthases similar to Saccharomyces cerevisiae Pus7. Pseudouridine synthases catalyze the isomerization of specific uridines in an RNA molecule to pseudouridines (5-ribosyluracil, psi). Saccharomyces cerevisiae Pus7 makes psi35 in U2 small nuclear RNA (U2 snRNA), psi13 in cytoplasmic tRNAs and psi35 in pre-tRNATyr. Psi35 in yeast U2 snRNA and psi13 in tRNAs are highly phylogenetically conserved. Psi34 is the mammalian U2 snRNA counterpart of yeast U2 snRNA psi35. Probab=57.55 E-value=10 Score=17.48 Aligned_cols=22 Identities=27% Similarity=0.306 Sum_probs=10.4 Q ss_pred HHHHHHHHHHHHHHHHHCCCCE Q ss_conf 5665457899999974248632 Q gi|254780799|r 329 PKVMQNNACTLKSVLSDFGIQG 350 (806) Q Consensus 329 ~eeL~~nA~lLE~tL~dFGVe~ 350 (806) .+.++..|+.+--..+++|.-| T Consensus 43 ~~a~~~lak~~gi~~~~igyAG 64 (371) T cd02576 43 MEAINKIAKLLRVKPSDFSYAG 64 (371) T ss_pred HHHHHHHHHHHCCCHHHEEECC T ss_conf 9999999998399778842763 No 418 >PRK00215 LexA repressor; Validated Probab=57.49 E-value=13 Score=16.69 Aligned_cols=38 Identities=21% Similarity=0.326 Sum_probs=20.1 Q ss_pred HHHHHHHHHH-CCCHHHHHHHHHHHHH-------HHCCEEEEEEEC Q ss_conf 3446888873-2100588999999866-------414237999965 Q gi|254780799|r 596 IDEMADLMMV-ARKDIESAVQRLAQMA-------RASGIHVIMATQ 633 (806) Q Consensus 596 iDElaDlmm~-~~~~ve~~i~rlaq~a-------ra~GiHli~aTq 633 (806) +.|+|+.|=. +..-|-..+.+|.+|+ +|-||.++-..+ T Consensus 26 ~rEI~~~~g~~S~~tV~~~l~~Le~kG~i~r~~~~~R~i~i~~~~~ 71 (204) T PRK00215 26 RREIADALGLRSPSAVHEHLKALERKGFIRRDPGRSRAIEVLPAEA 71 (204) T ss_pred HHHHHHHHCCCCCHHHHHHHHHHHHCCCEECCCCCCCEEEEECCCC T ss_conf 9999998099981899999999987978870699973389715766 No 419 >KOG0350 consensus Probab=57.49 E-value=6.6 Score=18.91 Aligned_cols=52 Identities=35% Similarity=0.492 Sum_probs=39.9 Q ss_pred EEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCH-HHHHHCCCCHHH Q ss_conf 3553047740679999999999982995784788852310-011102770343 Q gi|254780799|r 457 LLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKM-LELSVYDGIPNL 508 (806) Q Consensus 457 LLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~-vEls~Y~~iPHL 508 (806) .+|+-+|||||..|---=|.-+|.+..=+-+|-+.|=|-+ +-+.+|+-+-.| T Consensus 186 IcV~ApTGSGKTLaY~iPIVQ~L~~R~v~~LRavVivPtr~L~~QV~~~f~~~ 238 (620) T KOG0350 186 ICVNAPTGSGKTLAYVIPIVQLLSSRPVKRLRAVVIVPTRELALQVYDTFKRL 238 (620) T ss_pred EEEECCCCCCCEEEEHHHHHHHHCCCCCCCEEEEEEEEHHHHHHHHHHHHHHH T ss_conf 47755789884566513789997038734057999954799999999999985 No 420 >PRK09279 pyruvate phosphate dikinase; Provisional Probab=57.36 E-value=3.6 Score=20.88 Aligned_cols=40 Identities=33% Similarity=0.369 Sum_probs=21.2 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHCC Q ss_conf 864500013467899999999999999999999999--999998276 Q gi|254780799|r 60 LRSPKNFLGYGGAIFADVAIQFFGIASVFFLPPPTM--WALSLLFDK 104 (806) Q Consensus 60 ~~~v~N~~G~lGA~iAd~L~~lFG~~Ayllpl~Ll~--~g~~ll~~k 104 (806) +..-.|++|-=||.+..+.- .|+ -+|..|.+ -+++.|... T Consensus 17 ~~~~~~lvGGK~AsLgEM~~--~Gl---pVPpGF~iTt~Ay~~fl~~ 58 (875) T PRK09279 17 NASMKNLLGGKGANLAEMTN--LGL---PVPPGFTITTEACNYYYDN 58 (875) T ss_pred CHHHHHCCCHHHHHHHHHHH--CCC---CCCCCEEEHHHHHHHHHHC T ss_conf 76578606716660999997--789---9999887249999999971 No 421 >PRK01158 phosphoglycolate phosphatase; Provisional Probab=57.25 E-value=11 Score=17.40 Aligned_cols=29 Identities=17% Similarity=0.192 Sum_probs=18.0 Q ss_pred CHHHHHHHHHHHHHHHCCEEEEEEECCCC Q ss_conf 00588999999866414237999965777 Q gi|254780799|r 608 KDIESAVQRLAQMARASGIHVIMATQRPS 636 (806) Q Consensus 608 ~~ve~~i~rlaq~ara~GiHli~aTqrPs 636 (806) +.+-....+--+++|..|||.++||=||- T Consensus 19 ~~i~~~~~~ai~~l~~~Gi~v~iaTGR~~ 47 (226) T PRK01158 19 RRLHLKAVEAIRKAEKLGVPVILVTGNIL 47 (226) T ss_pred CCCCHHHHHHHHHHHHCCCEEEEECCCCH T ss_conf 97199999999999988998999899977 No 422 >pfam03205 MobB Molybdopterin guanine dinucleotide synthesis protein B. This protein contains a P-loop. Probab=57.21 E-value=13 Score=16.66 Aligned_cols=46 Identities=24% Similarity=0.286 Sum_probs=31.8 Q ss_pred CEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHH Q ss_conf 02355304774067999999999998299578478885231001110 Q gi|254780799|r 455 PHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSV 501 (806) Q Consensus 455 PHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~ 501 (806) |-++|.|..+||||..+-.++-.++-+ .-.-+=+.=.||-+-|++. T Consensus 1 p~v~i~G~~~sGKttl~~~L~~~~~~~-g~~~~~~~~~d~gq~~~~~ 46 (122) T pfam03205 1 PIVLVVGPKDSGKTTLIRKLLNYLKRR-GYRVAVVKHLDHGQGEIDK 46 (122) T ss_pred CEEEEECCCCCCHHHHHHHHHHHHHHC-CCCEEEEEECCCCCCCCCC T ss_conf 979999489998999999999999987-9944899989999877689 No 423 >pfam01080 Presenilin Presenilin. Mutations in presenilin-1 are a major cause of early onset Alzheimer's disease. It has been found that presenilin-1 binds to beta-catenin in-vivo. This family also contains SPE proteins from C.elegans. Probab=57.17 E-value=13 Score=16.65 Aligned_cols=17 Identities=18% Similarity=0.288 Sum_probs=7.5 Q ss_pred HHHHHHH-CCCCCCCCCC Q ss_conf 9999844-2706896313 Q gi|254780799|r 41 ITLALGT-WDVYDPSFSY 57 (806) Q Consensus 41 l~iSL~S-Ysp~DPs~~~ 57 (806) ..++.+| |+..|...-+ T Consensus 29 ~~i~~~s~~~~~~~~lvY 46 (403) T pfam01080 29 ATIKSVSFYEQNNGSLLY 46 (403) T ss_pred HHHHHCCCCCCCCCCEEE T ss_conf 998404544568860477 No 424 >cd03261 ABC_Org_Solvent_Resistant ABC (ATP-binding cassette) transport system involved in resistant to organic solvents; ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=57.12 E-value=9.2 Score=17.85 Aligned_cols=149 Identities=26% Similarity=0.340 Sum_probs=75.4 Q ss_pred EECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHH------HHH--------HCCCCHHHHCCCCC Q ss_conf 5410020235530477406799999999999829957847888523100------111--------02770343122334 Q gi|254780799|r 449 ADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKML------ELS--------VYDGIPNLLTPVVT 514 (806) Q Consensus 449 ~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~v------Els--------~Y~~iPHLl~pVvT 514 (806) .++.+==-+-|=|..|||||..+++|. -|+ +|+.=+.. +|-+-+ ++. +|.+ |+ |-|--| T Consensus 21 l~i~~Ge~~~iiG~SGsGKSTll~~i~--gL~--~p~~G~I~-~~g~~i~~~~~~~~~~~r~~ig~vfQ~-~~-Lf~~lT 93 (235) T cd03261 21 LDVRRGEILAIIGPSGSGKSTLLRLIV--GLL--RPDSGEVL-IDGEDISGLSEAELYRLRRRMGMLFQS-GA-LFDSLT 93 (235) T ss_pred EEECCCCEEEEECCCCCHHHHHHHHHH--CCC--CCCCCEEE-ECCEECCCCCHHHHHHHHCCEEEEECC-CC-CCCCCC T ss_conf 488799899999999972999999997--599--98985899-999999989988999975782997049-86-589996 Q ss_pred CHHHHHHHHHH----HHHHHHHHHH-HHHHCCCCCHHHH---------HHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 30456689999----9999999999-9987089968999---------99999887447866775446776545432223 Q gi|254780799|r 515 NPQKAVTVLKW----LVCEMEERYQ-KMSKIGVRNIDGF---------NLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYE 580 (806) Q Consensus 515 d~~kA~~aL~w----~V~EMe~RY~-l~a~~~vRni~~y---------N~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 580 (806) =-...+..|++ -=+|+++|-+ ++...|-.+.... -+|+.-|++ T Consensus 94 v~eNv~~~l~~~~~~~~~~~~~r~~~~L~~vgL~~~~~~~p~~LSGGq~QRvaIARA----------------------- 150 (235) T cd03261 94 VFENVAFPLREHTRLSEEEIREIVLEKLEAVGLRGAEDLYPAELSGGMKKRVALARA----------------------- 150 (235) T ss_pred HHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHCCCCCCCHHHHHHHHHHHH----------------------- T ss_conf 999999999995799999999999999986799257647841069999999999999----------------------- Q ss_pred CCCCCCCCCCEEEEEHHH-HHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCC Q ss_conf 322323469868776344-68888732100588999999866414237999965777 Q gi|254780799|r 581 TEHFDFQHMPYIVVVIDE-MADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPS 636 (806) Q Consensus 581 ~~~~~~~~lp~ivviiDE-laDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPs 636 (806) +-.=|- +++-|| .+-|=-...++|.+.|.+|. +.-|+-.|+.|---. T Consensus 151 -----Lv~~P~-illlDEPts~LDp~~~~~i~~li~~l~---~~~g~T~i~vTHd~~ 198 (235) T cd03261 151 -----LALDPE-LLLYDEPTAGLDPIASGVIDDLIRSLK---KELGLTSIMVTHDLD 198 (235) T ss_pred -----HHCCCC-EEEEECCCCCCCHHHHHHHHHHHHHHH---HHHCCEEEEECCCHH T ss_conf -----854899-899808866479899999999999999---972999999898989 No 425 >cd00876 Ras Ras family. The Ras family of the Ras superfamily includes classical N-Ras, H-Ras, and K-Ras, as well as R-Ras, Rap, Ral, Rheb, Rhes, ARHI, RERG, Rin/Rit, RSR1, RRP22, Ras2, Ras-dva, and RGK proteins. Ras proteins regulate cell growth, proliferation and differentiation. Ras is activated by guanine nucleotide exchange factors (GEFs) that release GDP and allow GTP binding. Many RasGEFs have been identified. These are sequestered in the cytosol until activation by growth factors triggers recruitment to the plasma membrane or Golgi, where the GEF colocalizes with Ras. Active GTP-bound Ras interacts with several effector proteins: among the best characterized are the Raf kinases, phosphatidylinositol 3-kinase (PI3K), RalGEFs and NORE/MST1. Most Ras proteins contain a lipid modification site at the C-terminus, with a typical sequence motif CaaX, where a = an aliphatic amino acid and X = any amino acid. Lipid binding is essential for membrane attachment, a key feature of m Probab=57.12 E-value=8.5 Score=18.11 Aligned_cols=47 Identities=21% Similarity=0.406 Sum_probs=31.1 Q ss_pred EEEEECCCCHHHHHHHHHHHHHH-HHCCH----HHEEEEEECCCHHHHHHCC Q ss_conf 35530477406799999999999-82995----7847888523100111027 Q gi|254780799|r 457 LLIAGTTGSGKSVAINTMILSLL-YRMTP----AQCRLIMIDPKMLELSVYD 503 (806) Q Consensus 457 LLIAGtTGSGKSV~iN~iI~SlL-yk~~P----~evkliliDPK~vEls~Y~ 503 (806) ++|-|..|+|||-.||.++..-- -...| .-.|.+.+|.+.+.+.+++ T Consensus 2 i~ivG~~~vGKTsli~r~~~~~f~~~~~pTi~~~~~~~~~~~~~~~~l~i~D 53 (160) T cd00876 2 VVVLGAGGVGKSAITIQFVKGTFVEEYDPTIEDSYRKTIVVDGETYTLDILD 53 (160) T ss_pred EEEECCCCCCHHHHHHHHHHCCCCCCCCCCEEEEEEEEEEECCEEEEEEEEE T ss_conf 9999969967999999996195998778830048999999766999999997 No 426 >KOG1292 consensus Probab=57.10 E-value=13 Score=16.64 Aligned_cols=25 Identities=16% Similarity=0.454 Sum_probs=16.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 8999999999999999999999998 Q gi|254780799|r 21 WSKKKMKIVAGLILLCTVFAITLAL 45 (806) Q Consensus 21 f~~rrl~Ei~Gl~Li~~al~l~iSL 45 (806) .-++|+|++-|-++++..+-.++-+ T Consensus 109 ~~~~~mr~iqGAlivas~vqiilG~ 133 (510) T KOG1292 109 RFQHRMREIQGALIVASLVQIILGF 133 (510) T ss_pred HHHHHHHHHCCHHHHHHHHHHHHHH T ss_conf 8999999861213699889999745 No 427 >TIGR01858 tag_bisphos_ald class II aldolase, tagatose bisphosphate family; InterPro: IPR011288 This entry describes tagatose-1,6-bisphosphate aldolases, and perhaps other closely related class II aldolases. This tetrameric, Zn2+-dependent enzyme is related to the class II fructose bisphosphate aldolase; fructose 1,6-bisphosphate and tagatose 1,6 bisphosphate differ only in chirality at C4 , .; GO: 0008270 zinc ion binding, 0016832 aldehyde-lyase activity, 0019512 lactose catabolic process via tagatose-6-phosphate. Probab=57.10 E-value=11 Score=17.35 Aligned_cols=51 Identities=20% Similarity=0.145 Sum_probs=26.5 Q ss_pred HHHHHHHHHHCCCCCCCCCCCCCCCC-CCCCCCCCCCCCCHHH---HHHHHHHHHCCCEE Q ss_conf 99999999712897421100124556-6677888887777038---99999999659850 Q gi|254780799|r 702 VEKVVSHLKTQGEAKYIDIKDKILLN-EEMRFSENSSVADDLY---KQAVDIVLRDNKAS 757 (806) Q Consensus 702 v~~v~~~~~~q~~~~y~~~~~~~~~~-~~~~~~~~~~~~d~l~---~~a~~~v~~~~~~s 757 (806) |..|||||-+|-- .++.+... ++.+++-.-++.|-+| ++|.+||-.||-=| T Consensus 115 vk~VVdFcHr~D~-----sVEAELG~LgG~EDDl~Vde~~AlyTdP~~A~~Fve~TGvDS 169 (282) T TIGR01858 115 VKEVVDFCHRYDA-----SVEAELGRLGGVEDDLSVDEEDALYTDPDEAKEFVEATGVDS 169 (282) T ss_pred HHHHHHHHCCCCC-----EEEEEECCCCCCCCCCEECCCCCCCCCHHHHHHHHHHCCCHH T ss_conf 6666644216786-----576660846553266425401023589689999987427547 No 428 >cd01128 rho_factor Transcription termination factor rho is a bacterial ATP-dependent RNA/DNA helicase. It is a homohexamer. Each monomer consists of an N-terminal domain of the OB fold, which is responsible for binding to cysteine rich nucleotides. This alignment is of the C-terminal ATP binding domain. Probab=57.07 E-value=13 Score=16.64 Aligned_cols=73 Identities=15% Similarity=0.157 Sum_probs=51.4 Q ss_pred EEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCCCCHHHHCCCCCCHHHHHHHHHHHHHHH Q ss_conf 235530477406799999999999829957847888523100111027703431223343045668999999999 Q gi|254780799|r 456 HLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYDGIPNLLTPVVTNPQKAVTVLKWLVCEM 530 (806) Q Consensus 456 HLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~~iPHLl~pVvTd~~kA~~aL~w~V~EM 530 (806) -.+|=|--|+|||+-++.|+.++-.+++-..|=..+|+=+-=|..-+ +-.+.+-||..+-.....+++-+..| T Consensus 18 R~gI~g~~gvGKT~Ll~~i~~~~~~~~~~~~~v~~lIGER~rEv~e~--~~~~~~~vv~st~d~~p~~r~~~a~~ 90 (249) T cd01128 18 RGLIVAPPKAGKTTLLQSIANAITKNHPEVYLIVLLIDERPEEVTDM--QRSVKGEVIASTFDEPPERHVQVAEM 90 (249) T ss_pred EEEEECCCCCCHHHHHHHHHHHHHHHCCCEEEEEEEEECCHHHHHHH--HHHHCEEEEEECCCCCHHHHHHHHHH T ss_conf 77887899988999999999999985898499999971657999999--99716169995699998999999999 No 429 >PRK10908 cell division protein FtsE; Provisional Probab=56.95 E-value=8.6 Score=18.08 Aligned_cols=42 Identities=26% Similarity=0.403 Sum_probs=26.4 Q ss_pred CEEEEEHHH-HHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCC Q ss_conf 868776344-68888732100588999999866414237999965777 Q gi|254780799|r 590 PYIVVVIDE-MADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPS 636 (806) Q Consensus 590 p~ivviiDE-laDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPs 636 (806) |-| ++.|| .+-|=....++|-+.+.++. | .|+-.|++|...+ T Consensus 156 P~i-LllDEPt~~LD~~~~~~v~~~l~~l~---~-~g~tvl~vtHd~~ 198 (222) T PRK10908 156 PAV-LLADEPTGNLDDALSEGILRLFEEFN---R-VGVTVLMATHDIG 198 (222) T ss_pred CCE-EEEECCCCCCCHHHHHHHHHHHHHHH---H-CCCEEEEECCCHH T ss_conf 999-99909876679999999999999998---6-1999999947999 No 430 >cd01854 YjeQ_engC YjeQ/EngC. YjeQ (YloQ in Bacillus subtilis) represents a protein family whose members are broadly conserved in bacteria and have been shown to be essential to the growth of E. coli and B. subtilis. Proteins of the YjeQ family contain all sequence motifs typical of the vast class of P-loop-containing GTPases, but show a circular permutation, with a G4-G1-G3 pattern of motifs as opposed to the regular G1-G3-G4 pattern seen in most GTPases. All YjeQ family proteins display a unique domain architecture, which includes an N-terminal OB-fold RNA-binding domain, the central permuted GTPase domain, and a zinc knuckle-like C-terminal cysteine domain. This domain architecture suggests a role for YjeQ as a regulator of translation. Probab=56.94 E-value=6.8 Score=18.85 Aligned_cols=89 Identities=19% Similarity=0.220 Sum_probs=44.8 Q ss_pred EEEECCCCHHHHHHHHHHHHHHHHCC------------HHHEEEEEEC--------CCHHHHHHCCCCHHHHCCCCCCHH Q ss_conf 55304774067999999999998299------------5784788852--------310011102770343122334304 Q gi|254780799|r 458 LIAGTTGSGKSVAINTMILSLLYRMT------------PAQCRLIMID--------PKMLELSVYDGIPNLLTPVVTNPQ 517 (806) Q Consensus 458 LIAGtTGSGKSV~iN~iI~SlLyk~~------------P~evkliliD--------PK~vEls~Y~~iPHLl~pVvTd~~ 517 (806) ..+|..|.|||--||+++-..-.+.. --..+|+.++ |=+=||...+-=+ T Consensus 165 v~~G~SGvGKSSLiN~L~~~~~~~t~~vs~~~~rGrHTTt~~~L~~l~~gg~iiDTPG~r~~~l~~~~~----------- 233 (287) T cd01854 165 VLVGQSGVGKSTLINALLPDLDLATGEISEKLGRGRHTTTHRELFPLPGGGLLIDTPGFREFGLLHIDP----------- 233 (287) T ss_pred EEECCCCCCHHHHHHHHCCHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCEEEECCCCCCCCCCCCCH----------- T ss_conf 998899888899998746212125666677608985014157999928995898689876356777898----------- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCC Q ss_conf 5668999999999999999987089968999999998874478 Q gi|254780799|r 518 KAVTVLKWLVCEMEERYQKMSKIGVRNIDGFNLKVAQYHNTGK 560 (806) Q Consensus 518 kA~~aL~w~V~EMe~RY~l~a~~~vRni~~yN~k~~~~~~~~~ 560 (806) ..|..+-.|++.--....-..++++.+=+=.|++|-+.|+ T Consensus 234 ---~~l~~~F~ei~~~~~~CkF~dC~H~~EPgCaV~~ave~g~ 273 (287) T cd01854 234 ---EELAHYFPEFRELAGQCKFRDCTHTNEPGCAVKAAVEAGE 273 (287) T ss_pred ---HHHHHHCHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHCCC T ss_conf ---9998868789998678989399889899978999998699 No 431 >pfam09848 DUF2075 Uncharacterized conserved protein (DUF2075). This domain, found in various prokaryotic proteins (including putative ATP/GTP binding proteins), has no known function. Probab=56.91 E-value=13 Score=16.62 Aligned_cols=21 Identities=29% Similarity=0.599 Sum_probs=16.1 Q ss_pred EEEEECCCCHHHHHHHHHHHH Q ss_conf 355304774067999999999 Q gi|254780799|r 457 LLIAGTTGSGKSVAINTMILS 477 (806) Q Consensus 457 LLIAGtTGSGKSV~iN~iI~S 477 (806) .+|-|.-||||||....+..- T Consensus 4 ~~V~G~pGtGKTvv~l~l~~~ 24 (348) T pfam09848 4 FLVTGGPGTGKTVVALNLFAE 24 (348) T ss_pred EEEECCCCCCHHHHHHHHHHH T ss_conf 999777993899999999999 No 432 >cd03281 ABC_MSH5_euk MutS5 homolog in eukaryotes. The MutS protein initiates DNA mismatch repair by recognizing mispaired and unpaired bases embedded in duplex DNA and activating endo- and exonucleases to remove the mismatch. Members of the MutS family possess C-terminal domain with a conserved ATPase activity that belongs to the ATP binding cassette (ABC) superfamily. MutS homologs (MSH) have been identified in most prokaryotic and all eukaryotic organisms examined. Prokaryotes have two homologs (MutS1 and MutS2), whereas seven MSH proteins (MSH1 to MSH7) have been identified in eukaryotes. The homodimer MutS1 and heterodimers MSH2-MSH3 and MSH2-MSH6 are primarily involved in mitotic mismatch repair, whereas MSH4-MSH5 is involved in resolution of Holliday junctions during meiosis. All members of the MutS family contain the highly conserved Walker A/B ATPase domain, and many share a common mechanism of action. MutS1, MSH2-MSH3, MSH2-MSH6, and MSH4-MSH5 dimerize to form sliding c Probab=56.81 E-value=13 Score=16.61 Aligned_cols=94 Identities=14% Similarity=0.162 Sum_probs=50.8 Q ss_pred HHCCCCCHHHCCCCCCCCEEEEECCCCC--EEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCCCC Q ss_conf 3022100021045666666785410020--23553047740679999999999982995784788852310011102770 Q gi|254780799|r 428 EKNQCDLAINLGKSIEGKPIIADLARMP--HLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYDGI 505 (806) Q Consensus 428 ~~s~~~L~iaLGKdI~G~pvv~DLakMP--HLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~~i 505 (806) ++..|||--...+..--+.+. |.+=. =++|-|.-.||||+.+-++-+..++-..= .-|==+--++++|+.| T Consensus 3 ~~~RHPlle~~~~~~VpNdi~--l~~~~~~~~iiTGpN~gGKSt~lkti~l~~ilaq~G-----~~vpa~~~~~~~~~~i 75 (213) T cd03281 3 QGGRHPLLELFVDSFVPNDTE--IGGGGPSIMVITGPNSSGKSVYLKQVALIVFLAHIG-----SFVPADSATIGLVDKI 75 (213) T ss_pred CCCCCCEEECCCCCEECCEEE--ECCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHC-----CCEEECCEEEECCCCE T ss_conf 757586582567964676589--779982599998999876599999999999999858-----8576740399721223 Q ss_pred HHHHCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 343122334304566899999999999 Q gi|254780799|r 506 PNLLTPVVTNPQKAVTVLKWLVCEMEE 532 (806) Q Consensus 506 PHLl~pVvTd~~kA~~aL~w~V~EMe~ 532 (806) ++ -+-+...-...+.--..||.+ T Consensus 76 ---~~-~i~~~d~~~~~~StF~~E~~~ 98 (213) T cd03281 76 ---FT-RMSSRESVSSGQSAFMIDLYQ 98 (213) T ss_pred ---EE-EECCHHHHHCCCCHHHHHHHH T ss_conf ---76-767643444132589999999 No 433 >cd03294 ABC_Pro_Gly_Bertaine This family comprises the glycine betaine/L-proline ATP binding subunit in bacteria and its equivalents in archaea. This transport system belong to the larger ATP-Binding Cassette (ABC) transporter superfamily. The characteristic feature of these transporters is the obligatory coupling of ATP hydrolysis to substrate translocation. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=56.72 E-value=8.8 Score=18.00 Aligned_cols=165 Identities=21% Similarity=0.323 Sum_probs=82.0 Q ss_pred CCHHHCCCCCCCCEEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHH-HHH----------- Q ss_conf 00021045666666785410020235530477406799999999999829957847888523100-111----------- Q gi|254780799|r 433 DLAINLGKSIEGKPIIADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKML-ELS----------- 500 (806) Q Consensus 433 ~L~iaLGKdI~G~pvv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~v-Els----------- 500 (806) .++=-.|...+=+-+-.++.+==-+-|-|..|||||.-++. |.-| .+|++=+.. +|=+-+ .++ T Consensus 29 ~v~K~fG~~~aL~~vsl~i~~GE~~~ivG~SGsGKSTLLr~-i~GL---~~p~~G~I~-~~G~~i~~~~~~~l~~~r~~~ 103 (269) T cd03294 29 EILKKTGQTVGVNDVSLDVREGEIFVIMGLSGSGKSTLLRC-INRL---IEPTSGKVL-IDGQDIAAMSRKELRELRRKK 103 (269) T ss_pred HEEEECCCEEEECCCEEEECCCCEEEEECCCCCHHHHHHHH-HHCC---CCCCCEEEE-ECCEECCCCCHHHHHHHHCCC T ss_conf 85885099279777475888999999998998489999999-9759---999975999-999999999989998852564 Q ss_pred ---HCCCCHHHHCCCCCCHHHHHHHHH---HHHHHHHHHH-HHHHHCCCCCHHHH---------HHHHHHHHHCCCCCCC Q ss_conf ---027703431223343045668999---9999999999-99987089968999---------9999988744786677 Q gi|254780799|r 501 ---VYDGIPNLLTPVVTNPQKAVTVLK---WLVCEMEERY-QKMSKIGVRNIDGF---------NLKVAQYHNTGKKFNR 564 (806) Q Consensus 501 ---~Y~~iPHLl~pVvTd~~kA~~aL~---w~V~EMe~RY-~l~a~~~vRni~~y---------N~k~~~~~~~~~~~~~ 564 (806) +|.+ +-|-|--|=-...+..|+ +--.|.++|- +++...|..+.... -+||.-|++ T Consensus 104 igmVFQ~--~aL~P~ltV~eNV~~~L~~~~~~~~e~~~rv~e~L~~vgL~~~~~~~P~qLSGGq~QRVaIARA------- 174 (269) T cd03294 104 ISMVFQS--FALLPHRTVLENVAFGLEVQGVPRAEREERAAEALELVGLEGWEHKYPDELSGGMQQRVGLARA------- 174 (269) T ss_pred EEEEEEC--CCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHCCHHHHCHHHHHHHHHHHH------- T ss_conf 6999615--7547678799998688885289978999999999986798677756967849488889999999------- Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHH-HHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCC Q ss_conf 5446776545432223322323469868776344-68888732100588999999866414237999965777 Q gi|254780799|r 565 TVQTGFDRKTGEAIYETEHFDFQHMPYIVVVIDE-MADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPS 636 (806) Q Consensus 565 ~~~~~~~~~~~~~~~~~~~~~~~~lp~ivviiDE-laDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPs 636 (806) +-.=|-| ++-|| ++-|=-....++.+.+.+| -+.-|+-.|+.|--+. T Consensus 175 ---------------------La~~P~i-LLlDEPtsaLD~~~~~~i~~~l~~l---~~~~~~T~i~VTHD~~ 222 (269) T cd03294 175 ---------------------LAVDPDI-LLMDEAFSALDPLIRREMQDELLRL---QAELQKTIVFITHDLD 222 (269) T ss_pred ---------------------HHCCCCE-EEECCCCCCCCHHHHHHHHHHHHHH---HHHHCCEEEEECCCHH T ss_conf ---------------------8639989-9975875425999999999999999---9974999999999899 No 434 >cd01893 Miro1 Miro1 subfamily. Miro (mitochondrial Rho) proteins have tandem GTP-binding domains separated by a linker region containing putative calcium-binding EF hand motifs. Genes encoding Miro-like proteins were found in several eukaryotic organisms. This CD represents the N-terminal GTPase domain of Miro proteins. These atypical Rho GTPases have roles in mitochondrial homeostasis and apoptosis. Most Rho proteins contain a lipid modification site at the C-terminus; however, Miro is one of few Rho subfamilies that lack this feature. Probab=56.67 E-value=11 Score=17.37 Aligned_cols=43 Identities=21% Similarity=0.390 Sum_probs=29.3 Q ss_pred EEEEECCCCHHHHHHHHHHHHHHHHCCHHHE--------EEEEECCCHHHHHHCC Q ss_conf 3553047740679999999999982995784--------7888523100111027 Q gi|254780799|r 457 LLIAGTTGSGKSVAINTMILSLLYRMTPAQC--------RLIMIDPKMLELSVYD 503 (806) Q Consensus 457 LLIAGtTGSGKSV~iN~iI~SlLyk~~P~ev--------kliliDPK~vEls~Y~ 503 (806) +++-|..|.|||--|+..+ ...-|++. +-+.+|.|.+.|.+++ T Consensus 3 iv~vGd~~VGKTsli~r~~----~~~F~~~~~~t~~~~~~~~~~~~~~v~l~i~D 53 (166) T cd01893 3 IVLIGDEGVGKSSLIMSLV----SEEFPENVPRVLPEITIPADVTPERVPTTIVD 53 (166) T ss_pred EEEECCCCCCHHHHHHHHH----HCCCCCCCCCCCCCEEEEEEECCEEEEEEEEE T ss_conf 9999999989999999998----49788877763456899999889099999998 No 435 >PRK11701 phnK phosphonates transport ATP-binding protein; Provisional Probab=56.44 E-value=8.9 Score=17.97 Aligned_cols=29 Identities=34% Similarity=0.444 Sum_probs=21.5 Q ss_pred EEEEECCCCCEEEEEECCCCHHHHHHHHH Q ss_conf 67854100202355304774067999999 Q gi|254780799|r 446 PIIADLARMPHLLIAGTTGSGKSVAINTM 474 (806) Q Consensus 446 pvv~DLakMPHLLIAGtTGSGKSV~iN~i 474 (806) -|-.++.+==-+-|-|..|||||.-+|+| T Consensus 24 ~Vs~~v~~GEi~~iiG~nGaGKSTLl~~i 52 (258) T PRK11701 24 DVSFDLYPGEVLGIVGESGSGKTTLLNAL 52 (258) T ss_pred CEEEEECCCCEEEEECCCCCCHHHHHHHH T ss_conf 22778879979999888998899999998 No 436 >cd03214 ABC_Iron-Siderophores_B12_Hemin ABC transporters, involved in the uptake of siderophores, heme, and vitamin B12, are widely conserved in bacteria and archaea. Only very few species lack representatives of the siderophore family transporters. The E. coli BtuCD protein is an ABC transporter mediating vitamin B12 uptake. The two ATP-binding cassettes (BtuD) are in close contact with each other, as are the two membrane-spanning subunits (BtuC); this arrangement is distinct from that observed for the E. coli lipid flippase MsbA. The BtuC subunits provide 20 transmembrane helices grouped around a translocation pathway that is closed to the cytoplasm by a gate region, whereas the dimer arrangement of the BtuD subunits resembles the ATP-bound form of the Rad50 DNA repair enzyme. A prominent cytoplasmic loop of BtuC forms the contact region with the ATP-binding cassette and represent a conserved motif among the ABC transporters. Probab=56.44 E-value=11 Score=17.30 Aligned_cols=42 Identities=24% Similarity=0.462 Sum_probs=26.9 Q ss_pred CCEEEE----ECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEE Q ss_conf 666785----4100202355304774067999999999998299578478 Q gi|254780799|r 444 GKPIIA----DLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRL 489 (806) Q Consensus 444 G~pvv~----DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkl 489 (806) +++++- ++.+==++-|-|..|||||.-++.|.= + .+|++=+. T Consensus 11 ~~~il~~is~~i~~Ge~~~liG~nGsGKTTLl~~i~G-~---~~~~~G~I 56 (180) T cd03214 11 GRTVLDDLSLSIEAGEIVGILGPNGAGKSTLLKTLAG-L---LKPSSGEI 56 (180) T ss_pred CEEEEECEEEEECCCCEEEEECCCCCCHHHHHHHHHC-C---CCCCCCEE T ss_conf 9998804377886997999998999889999999957-9---89987289 No 437 >PRK13644 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=56.27 E-value=14 Score=16.55 Aligned_cols=94 Identities=16% Similarity=0.245 Sum_probs=55.0 Q ss_pred EEEHHH-HHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEEEECCCCCCHHHCCCCCH Q ss_conf 776344-6888873210058899999986641423799996577753554355411025158764586642123388645 Q gi|254780799|r 593 VVVIDE-MADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISFQVSSKIDSRTILGEQGA 671 (806) Q Consensus 593 vviiDE-laDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf~v~s~~dSrtild~~ga 671 (806) |+|-|| .|-|=....+++-+.+.+|.+ .|.-.|+.|.+-+ . ++ + --||..= -+-|.|.| +-. T Consensus 157 iLiLDEPTs~LD~~~~~~i~~~l~~L~~----~g~TvI~itHdl~--~----~~-~-aDrvivl----~~G~Iv~~-G~p 219 (274) T PRK13644 157 CLIFDEVTSMLDPDSGIAVLERIKKLHE----KGKTIVYITHNLE--E----LH-D-ADRIIVM----DRGKIVLE-GEP 219 (274) T ss_pred EEEEECCCCCCCHHHHHHHHHHHHHHHH----CCCEEEEEEECHH--H----HH-C-CCEEEEE----ECCEEEEE-CCH T ss_conf 9999798667899999999999999986----8999999833789--9----97-1-9989999----89999998-798 Q ss_pred HHHCCCCCEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHCCCC Q ss_conf 78658875477368983258883348988999999999712897 Q gi|254780799|r 672 EQLLGQGDMLYMTGGGRVQRIHGPFVSDIEVEKVVSHLKTQGEA 715 (806) Q Consensus 672 e~Llg~gdml~~~~~~~~~r~~g~~v~~~ev~~v~~~~~~q~~~ 715 (806) |.++.+ +. ++.+-+.-.++-++...++.+|-+ T Consensus 220 ~eif~~-----------~~-l~~~~l~~P~~~~~~~~L~~~g~~ 251 (274) T PRK13644 220 ENVLSD-----------VS-LQTLGLTPPSLIELAENLKMHGVV 251 (274) T ss_pred HHHHCC-----------HH-HHHCCCCCCHHHHHHHHHHHCCCC T ss_conf 998489-----------62-887699998499999999976998 No 438 >PRK09544 znuC high-affinity zinc transporter ATPase; Reviewed Probab=56.23 E-value=11 Score=17.41 Aligned_cols=44 Identities=25% Similarity=0.363 Sum_probs=28.5 Q ss_pred EEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCC Q ss_conf 7854100202355304774067999999999998299578478885231 Q gi|254780799|r 447 IIADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPK 495 (806) Q Consensus 447 vv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK 495 (806) +-.++.+==.+-+.|-.|||||-.++.|.- + .+|+.=. |..++| T Consensus 23 vs~~i~~Gei~~LiGpNGaGKSTLlk~I~G--l--~~p~~G~-I~~~~~ 66 (251) T PRK09544 23 VSLELKPGKILTLLGPNGAGKSTLVRVVLG--L--VAPDEGV-IKRNGK 66 (251) T ss_pred EEEEECCCCEEEEECCCCCCHHHHHHHHHC--C--CCCCCCE-EEECCC T ss_conf 078987997999998999889999999966--8--8898608-999994 No 439 >PRK10870 transcriptional repressor MprA; Provisional Probab=56.11 E-value=11 Score=17.34 Aligned_cols=45 Identities=22% Similarity=0.310 Sum_probs=27.2 Q ss_pred CEEEEHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHC-CCCCEEEEC Q ss_conf 850001422200117789999999997798680227-887267317 Q gi|254780799|r 755 KASISYIQRRLGIGYNRAASIIENMEEKGVIGPASS-TGKREILIS 799 (806) Q Consensus 755 ~~s~s~lqr~~~igy~raar~~~~~e~~giv~~~~g-~~~r~vl~~ 799 (806) ..|.|=|..++.+-=..+.|++|.||++|.|--... .-.|.+.|. T Consensus 71 ~l~PseLa~~l~~s~a~iTr~vD~LEkkGlV~R~~s~~DRR~~~V~ 116 (176) T PRK10870 71 SIQPSELSCALGSSRTNATRIADELEKRGWIERRESDNDRRCLHLQ 116 (176) T ss_pred CCCHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEECCCCCCCEEEEE T ss_conf 8598999999778710199999999988997872371235347889 No 440 >cd04146 RERG_RasL11_like RERG/RasL11-like subfamily. RERG (Ras-related and Estrogen- Regulated Growth inhibitor) and Ras-like 11 are members of a novel subfamily of Ras that were identified based on their behavior in breast and prostate tumors, respectively. RERG expression was decreased or lost in a significant fraction of primary human breast tumors that lack estrogen receptor and are correlated with poor clinical prognosis. Elevated RERG expression correlated with favorable patient outcome in a breast tumor subtype that is positive for estrogen receptor expression. In contrast to most Ras proteins, RERG overexpression inhibited the growth of breast tumor cells in vitro and in vivo. RasL11 was found to be ubiquitously expressed in human tissue, but down-regulated in prostate tumors. Both RERG and RasL11 lack the C-terminal CaaX prenylation motif, where a = an aliphatic amino acid and X = any amino acid, and are localized primarily in the cytoplasm. Both are believed to have tu Probab=56.05 E-value=13 Score=16.82 Aligned_cols=47 Identities=23% Similarity=0.434 Sum_probs=30.9 Q ss_pred EEEEECCCCHHHHHHHHHHHHHH---HHCCHH--HEEEEEECCCHHHHHHCC Q ss_conf 35530477406799999999999---829957--847888523100111027 Q gi|254780799|r 457 LLIAGTTGSGKSVAINTMILSLL---YRMTPA--QCRLIMIDPKMLELSVYD 503 (806) Q Consensus 457 LLIAGtTGSGKSV~iN~iI~SlL---yk~~P~--evkliliDPK~vEls~Y~ 503 (806) +++-|..|.|||--++..+..-- |..+.. --|-+.+|.+.++|.+++ T Consensus 2 iv~vGd~~VGKTsli~rf~~~~f~~~y~~T~~~~~~~~~~v~~~~~~l~iwD 53 (165) T cd04146 2 IAVLGASGVGKSALVVRFLTKRFIGEYDPNLESLYSRQVTIDGEQVSLEILD 53 (165) T ss_pred EEEECCCCCCHHHHHHHHHHCCCCCCCCCCCCCEEEEEEEECCEEEEEEEEE T ss_conf 9999989977899999997498998759955630579999999999999992 No 441 >cd06927 RNAP_L L subunit of Archaeal RNA polymerase. The archaeal L subunit of RNA polymerase (RNAP) is involved in the assembly of RNAP subunits. RNAP is a large multi-subunit complex responsible for the synthesis of RNA. It is the principal enzyme of the transcription process, and is a final target in many regulatory pathways that control gene expression in all living cells. A single distinct RNAP complex is found in archaea, which may be responsible for the synthesis of all RNAs. The archaeal RNAP harbors homologues of all eukaryotic RNAP II subunits with two exceptions (RPB8 and RPB9). The 12 archaeal subunits are designated by letters and can be divided into three functional groups that are engaged in: (I) catalysis (A'/A", B'/B" or B); (II) assembly (L, N, D and P); and (III) auxiliary functions (F, E, H and K). The assembly of the two largest archaeal RNAP subunits that provide most of the enzyme's catalytic functions depends on the presence of the archaeal D/L heterodimer. Probab=56.02 E-value=6.9 Score=18.78 Aligned_cols=43 Identities=21% Similarity=0.322 Sum_probs=30.0 Q ss_pred CCCHHHHHHCCCCHHHHCC-----CCCCH-HHHHHHHHHHHHHHHHHHHHH Q ss_conf 2310011102770343122-----33430-456689999999999999999 Q gi|254780799|r 493 DPKMLELSVYDGIPNLLTP-----VVTNP-QKAVTVLKWLVCEMEERYQKM 537 (806) Q Consensus 493 DPK~vEls~Y~~iPHLl~p-----VvTd~-~kA~~aL~w~V~EMe~RY~l~ 537 (806) || .|+++.|. +||=+.+ |-|+. +++..||+++++.-....+-| T Consensus 33 ~~-~V~fa~Y~-i~HPl~~~~~l~I~T~~~~~p~~al~~a~~~l~~~~~~~ 81 (83) T cd06927 33 DP-GVKVASYD-IEHPLLSNPVLKIKTDGGVDPLEALKEAAKRLIDLCEEF 81 (83) T ss_pred CC-CEEEEEEE-CCCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 99-85899632-899988920899998999998999999999999999875 No 442 >PRK04837 ATP-dependent RNA helicase RhlB; Provisional Probab=55.96 E-value=14 Score=16.51 Aligned_cols=17 Identities=6% Similarity=0.075 Sum_probs=9.2 Q ss_pred CCCCHHHHH---HHHHHHHH Q ss_conf 348988999---99999971 Q gi|254780799|r 695 PFVSDIEVE---KVVSHLKT 711 (806) Q Consensus 695 ~~v~~~ev~---~v~~~~~~ 711 (806) .|++++|.. +|-+|++. T Consensus 358 tf~~~~~~~~l~~ie~~~~~ 377 (423) T PRK04837 358 SFACEEYALNLPAIETYIGH 377 (423) T ss_pred EEECHHHHHHHHHHHHHHCC T ss_conf 98739999999999999689 No 443 >cd00157 Rho Rho (Ras homology) family. Members of the Rho family include RhoA, Cdc42, Rac, Rnd, Wrch1, RhoBTB, and Rop. There are 22 human Rho family members identified currently. These proteins are all involved in the reorganization of the actin cytoskeleton in response to external stimuli. They also have roles in cell transformation by Ras in cytokinesis, in focal adhesion formation and in the stimulation of stress-activated kinase. These various functions are controlled through distinct effector proteins and mediated through a GTP-binding/GTPase cycle involving three classes of regulating proteins: GAPs (GTPase-activating proteins), GEFs (guanine nucleotide exchange factors), and GDIs (guanine nucleotide dissociation inhibitors). Most Rho proteins contain a lipid modification site at the C-terminus, with a typical sequence motif CaaX, where a = an aliphatic amino acid and X = any amino acid. Lipid binding is essential for membrane attachment, a key feature of most Rho protein Probab=55.88 E-value=11 Score=17.19 Aligned_cols=43 Identities=19% Similarity=0.418 Sum_probs=30.5 Q ss_pred EEEEECCCCHHHHHHHHHHHHHHHHCCHHH---------EEEEEECCCHHHHHHCC Q ss_conf 355304774067999999999998299578---------47888523100111027 Q gi|254780799|r 457 LLIAGTTGSGKSVAINTMILSLLYRMTPAQ---------CRLIMIDPKMLELSVYD 503 (806) Q Consensus 457 LLIAGtTGSGKSV~iN~iI~SlLyk~~P~e---------vkliliDPK~vEls~Y~ 503 (806) +++-|..|.|||-.++..+ +..-|++ .+-+.+|-+.+.|.+++ T Consensus 3 i~llGd~~VGKTsli~r~~----~~~f~~~y~~Ti~~~~~~~~~~~~~~~~l~iwD 54 (171) T cd00157 3 IVVVGDGAVGKTCLLISYT----TGKFPTEYVPTVFDNYSATVTVDGKQVNLGLWD 54 (171) T ss_pred EEEECCCCCCHHHHHHHHH----CCCCCCCCCCEEEEEEEEEEEECCEEEEEEEEE T ss_conf 9999999966999999996----299998758803466689999999999999998 No 444 >PRK10938 putative molybdenum transport ATP-binding protein ModF; Provisional Probab=55.83 E-value=14 Score=16.50 Aligned_cols=28 Identities=25% Similarity=0.357 Sum_probs=21.0 Q ss_pred EEECCCCCEEEEEECCCCHHHHHHHHHH Q ss_conf 8541002023553047740679999999 Q gi|254780799|r 448 IADLARMPHLLIAGTTGSGKSVAINTMI 475 (806) Q Consensus 448 v~DLakMPHLLIAGtTGSGKSV~iN~iI 475 (806) -.++.+==.+-|+|.-|||||-.++.|. T Consensus 280 s~~v~~GE~~~i~G~nGsGKSTLl~~l~ 307 (490) T PRK10938 280 SWQVNPGEHWQIVGPNGAGKSTLLSLIT 307 (490) T ss_pred EEEECCCCEEEEECCCCCCHHHHHHHHH T ss_conf 7898389889998678887999999980 No 445 >COG3840 ThiQ ABC-type thiamine transport system, ATPase component [Coenzyme metabolism] Probab=55.82 E-value=7.6 Score=18.48 Aligned_cols=185 Identities=17% Similarity=0.281 Sum_probs=97.7 Q ss_pred CCHHHCCCCCCCCEEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCCC-------- Q ss_conf 000210456666667854100202355304774067999999999998299578478885231001110277-------- Q gi|254780799|r 433 DLAINLGKSIEGKPIIADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYDG-------- 504 (806) Q Consensus 433 ~L~iaLGKdI~G~pvv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~~-------- 504 (806) .+|+.+--.|...-+ .-|.|..|||||--+|-| ... ..|..=.+.+=+--+--..+|+. T Consensus 13 ~~~~~fdl~v~~ge~---------vAi~GpSGaGKSTLLnLI-AGF---~~P~~G~i~i~g~d~t~~~P~~RPVSmlFQE 79 (231) T COG3840 13 HLPMRFDLTVPAGEI---------VAILGPSGAGKSTLLNLI-AGF---ETPASGEILINGVDHTASPPAERPVSMLFQE 79 (231) T ss_pred CCEEEEEEEECCCCE---------EEEECCCCCCHHHHHHHH-HHC---CCCCCCEEEECCEECCCCCCCCCCHHHHHHC T ss_conf 500788876067857---------999778886578899998-742---4778745898572147689544873111100 Q ss_pred ---CHHHHCCCCCCHHHHH-HHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf ---0343122334304566-899999999999999998708996899999999887447866775446776545432223 Q gi|254780799|r 505 ---IPNLLTPVVTNPQKAV-TVLKWLVCEMEERYQKMSKIGVRNIDGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYE 580 (806) Q Consensus 505 ---iPHLl~pVvTd~~kA~-~aL~w~V~EMe~RY~l~a~~~vRni~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 580 (806) -+||-. -++.---. -.||---.+=++.-...+..| |++|-++.......|+...-.+-. .... T Consensus 80 nNLFaHLtV--~qNigLGl~P~LkL~a~~r~~v~~aa~~vG---l~~~~~RLP~~LSGGqRQRvALAR--------clvR 146 (231) T COG3840 80 NNLFAHLTV--AQNIGLGLSPGLKLNAEQREKVEAAAAQVG---LAGFLKRLPGELSGGQRQRVALAR--------CLVR 146 (231) T ss_pred CCCCHHHHH--HHHHCCCCCCCCCCCHHHHHHHHHHHHHHC---HHHHHHHCCCCCCCHHHHHHHHHH--------HHHC T ss_conf 564211026--553245678665248889999999999857---555754096224740778999999--------8802 Q ss_pred CCCCCCCCCCEEEEEHHH-HHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEEEECCC Q ss_conf 322323469868776344-6888873210058899999986641423799996577753554355411025158764586 Q gi|254780799|r 581 TEHFDFQHMPYIVVVIDE-MADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISFQVSSK 659 (806) Q Consensus 581 ~~~~~~~~lp~ivviiDE-laDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf~v~s~ 659 (806) +.=|...|| ||-| -+.==++..+-++|..+--|.-+++-|..|+- -+-+-.|+.|-.--+ T Consensus 147 ---------~~PilLLDEPFsAL---dP~LR~eMl~Lv~~l~~E~~~TllmVTH~~~D-------a~~ia~~~~~l~~Gr 207 (231) T COG3840 147 ---------EQPILLLDEPFSAL---DPALRAEMLALVSQLCDERKMTLLMVTHHPED-------AARIADRVVFLDNGR 207 (231) T ss_pred ---------CCCEEEECCCHHHC---CHHHHHHHHHHHHHHHHHHCCEEEEEECCHHH-------HHHHHHCEEEEECCE T ss_conf ---------68757754811331---97889999999999988428779999578889-------997652069985777 Q ss_pred CCC Q ss_conf 642 Q gi|254780799|r 660 IDS 662 (806) Q Consensus 660 ~dS 662 (806) |.- T Consensus 208 i~~ 210 (231) T COG3840 208 IAA 210 (231) T ss_pred EEE T ss_conf 876 No 446 >PRK12338 hypothetical protein; Provisional Probab=55.77 E-value=6 Score=19.22 Aligned_cols=122 Identities=19% Similarity=0.256 Sum_probs=69.1 Q ss_pred CCCCCCEEEEEHHHHHHH---------HHHCCCHHHH-HHHHHHH-----HHHHCCEEEEEEECCC-CCCCCCHHHHHCC Q ss_conf 234698687763446888---------8732100588-9999998-----6641423799996577-7535543554110 Q gi|254780799|r 585 DFQHMPYIVVVIDEMADL---------MMVARKDIES-AVQRLAQ-----MARASGIHVIMATQRP-SVDVITGTIKANF 648 (806) Q Consensus 585 ~~~~lp~ivviiDElaDl---------mm~~~~~ve~-~i~rlaq-----~ara~GiHli~aTqrP-svdvitg~ikan~ 648 (806) .+-..|+.|++.||=.-+ |-..+|-|+. .-.|+-| .|+--||--|=-|-=- ||+-|-+.|+-+ T Consensus 129 ~~~~i~F~i~~~~E~kH~eRF~~Rak~m~r~~Kyvkyf~niR~Iqd~L~~~Adeh~iP~I~N~nid~sV~~i~~~I~~~- 207 (320) T PRK12338 129 EYANIHFFILYADEEEHKERFVKRAMEIKRGGKHLEYFRENRIIHDFLVSQADEHGIPVIKNDDIDRTVSKVLSVIREV- 207 (320) T ss_pred CCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHH- T ss_conf 6666369999817689999999999885026508999998899999998416761997304763556899999999987- Q ss_pred CCEEEEEECCCCCCHHHCCCCCHHHHCCCC----CEEEECCC-CCE-EEEEECCCCHHHHHHHHHHHHHCCC Q ss_conf 251587645866421233886457865887----54773689-832-5888334898899999999971289 Q gi|254780799|r 649 PTRISFQVSSKIDSRTILGEQGAEQLLGQG----DMLYMTGG-GRV-QRIHGPFVSDIEVEKVVSHLKTQGE 714 (806) Q Consensus 649 p~riaf~v~s~~dSrtild~~gae~Llg~g----dml~~~~~-~~~-~r~~g~~v~~~ev~~v~~~~~~q~~ 714 (806) ++.-+-+-|--|.+. -+|=..-+| |-=|--|| ..| .|--|-+ +.+|.++..+-++.+.+ T Consensus 208 -~~~~~~~hsVddl~~-----e~eII~~~~~~i~dI~y~ipgfk~pl~r~v~v~-d~~e~d~Fik~ln~~p~ 272 (320) T PRK12338 208 -SVVVKLVHSVEDLDV-----EDEIIKENNGRITDISYPIPGFKDPLKRSVNIY-DRDEADKFIKILNEEKK 272 (320) T ss_pred -HEEEECCCCHHHHHH-----HHHHHHHCCCEEEEEEECCCCCCCCCEEECCCC-CHHHHHHHHHHHHCCHH T ss_conf -346522676888888-----789999739758751002887574312344778-70789999999830855 No 447 >COG5635 Predicted NTPase (NACHT family) [Signal transduction mechanisms] Probab=55.73 E-value=14 Score=16.49 Aligned_cols=62 Identities=26% Similarity=0.280 Sum_probs=43.5 Q ss_pred EEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCCCCHHHHCCCCCCHHHHH--HHHHHHHHHHHH Q ss_conf 23553047740679999999999982995784788852310011102770343122334304566--899999999999 Q gi|254780799|r 456 HLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYDGIPNLLTPVVTNPQKAV--TVLKWLVCEMEE 532 (806) Q Consensus 456 HLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~~iPHLl~pVvTd~~kA~--~aL~w~V~EMe~ 532 (806) |++|=|.=|||||..++-+-+....+.-+ ..+-.||+++.+....-+-.- .-..|++.+|.. T Consensus 224 ~~~Ilg~pGsGKTtfl~~lA~~~~~~~~~---------------~~~vpi~~~l~~~~~~~~~~~q~~~~~~l~~~~~~ 287 (824) T COG5635 224 KLLILGAPGSGKTTFLQRLALWLAQRTLE---------------PEDVPIFLLLNAFALARKFEKQLSLIDYLAEELFS 287 (824) T ss_pred HEEEECCCCCCHHHHHHHHHHHHHHCCCC---------------CCCCCEEEEECCHHHHHHHHHCCCHHHHHHHHHHH T ss_conf 33344688876267999999997415256---------------65677344423156544445216499999999851 No 448 >pfam04317 DUF463 YcjX-like family, DUF463. Some members of this family are thought to possess an ATP-binding domain towards their N-terminus. Probab=55.64 E-value=8.7 Score=18.03 Aligned_cols=13 Identities=23% Similarity=0.417 Sum_probs=8.5 Q ss_pred HHHHHHHHHHHCC Q ss_conf 3899999999659 Q gi|254780799|r 742 LYKQAVDIVLRDN 754 (806) Q Consensus 742 l~~~a~~~v~~~~ 754 (806) -.|+|.++.+.++ T Consensus 430 RLD~alqFLlGDk 442 (443) T pfam04317 430 RLDRALQFLLGDK 442 (443) T ss_pred CHHHHHHHHHCCC T ss_conf 6889999875077 No 449 >smart00763 AAA_PrkA PrkA AAA domain. This is a family of PrkA bacterial and archaeal serine kinases approximately 630 residues long. This is the N-terminal AAA domain. Probab=55.62 E-value=13 Score=16.83 Aligned_cols=99 Identities=22% Similarity=0.320 Sum_probs=56.5 Q ss_pred HHHHHHHHHHHHHHCCCCEEEEEEEECCEEEEEEEEECCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCEEEEEECCC Q ss_conf 54578999999742486328998410442444443214786399999788999988630012100038861255530245 Q gi|254780799|r 332 MQNNACTLKSVLSDFGIQGEIVNVRPGPVITLYELEPAPGIKSSRIIGLSDDIARSMSAISARVAVIPRRNAIGIELPND 411 (806) Q Consensus 332 L~~nA~lLE~tL~dFGVe~~Vv~v~pGPVVTrYEi~PApGVKvSKI~nLadDIA~aLsa~svRIapIPGK~~VGIEiPN~ 411 (806) .+.-++.+-+-..+||++-.+ ..|=++++|.+= +| ...|.| T Consensus 17 ~~~A~qrl~dmI~~~G~~~~~---~~~~~i~ry~~F--------------~d------------------~~fG~e---- 57 (361) T smart00763 17 AANAHQRLLDMIGEAGVEDVV---ENNRGIKRYRFF--------------DH------------------DFFGME---- 57 (361) T ss_pred HHHHHHHHHHHHHCCCEEEEC---CCCCEEEECCCH--------------HH------------------HHCCCH---- T ss_conf 876999999998667918813---799607855011--------------01------------------311648---- Q ss_pred CCCEEEEHHHCCCHHHHHCCCCCHHHCCCCCCCCEEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHH-HHCCHHHEEEE Q ss_conf 66238734230721343022100021045666666785410020235530477406799999999999-82995784788 Q gi|254780799|r 412 IRETVMLRDLIVSRVFEKNQCDLAINLGKSIEGKPIIADLARMPHLLIAGTTGSGKSVAINTMILSLL-YRMTPAQCRLI 490 (806) Q Consensus 412 ~r~~V~lreil~s~~f~~s~~~L~iaLGKdI~G~pvv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlL-yk~~P~evkli 490 (806) +.|++ |- ..--.-|.|-...-+ =||.-|-+|||||-.++.+...|= |-.+|+--+.- T Consensus 58 --------~~i~~--~V--~~~k~AA~g~~~~k~----------IllL~GPvGsGKStl~~~Lk~~lE~ys~t~eG~~Y~ 115 (361) T smart00763 58 --------EAIER--FV--NYFKSAAQGLEERKQ----------ILYLLGPVGGGKSSLVECLKRGLEEYSKTPEGRRYT 115 (361) T ss_pred --------HHHHH--HH--HHHHHHHHCCCCCCE----------EEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCEEE T ss_conf --------99999--99--999999844671256----------999988998877999999999999862677685599 Q ss_pred E Q ss_conf 8 Q gi|254780799|r 491 M 491 (806) Q Consensus 491 l 491 (806) + T Consensus 116 ~ 116 (361) T smart00763 116 F 116 (361) T ss_pred E T ss_conf 9 No 450 >PRK00098 ribosome-associated GTPase; Reviewed Probab=55.41 E-value=7.7 Score=18.42 Aligned_cols=94 Identities=18% Similarity=0.187 Sum_probs=51.6 Q ss_pred EEEECCCCHHHHHHHHHHHHHHH------------HCCHHHEEEEEECCCHHHHHHCCCCHHHHCCCCCCHHH---HHHH Q ss_conf 55304774067999999999998------------29957847888523100111027703431223343045---6689 Q gi|254780799|r 458 LIAGTTGSGKSVAINTMILSLLY------------RMTPAQCRLIMIDPKMLELSVYDGIPNLLTPVVTNPQK---AVTV 522 (806) Q Consensus 458 LIAGtTGSGKSV~iN~iI~SlLy------------k~~P~evkliliDPK~vEls~Y~~iPHLl~pVvTd~~k---A~~a 522 (806) ..+|..|.|||--||+++-..-. |||--..+|+..+ +|. =.=||=+.+..- -... T Consensus 168 v~~G~SGVGKSSLiN~L~~~~~~~t~~is~~~~rGrHTTt~~~l~~l~---------~G~-iiDTPG~r~~~l~~~~~~~ 237 (298) T PRK00098 168 VLAGQSGVGKSTLLNALVPELELKTGEISEALGRGKHTTTHVELYDLP---------GGL-LIDTPGFSSFGLEHLEAED 237 (298) T ss_pred EEECCCCCCHHHHHHHHCCHHHHHHHHHHHHCCCCCEEEEEEEEEECC---------CCE-EEECCCCCCCCCCCCCHHH T ss_conf 998789887888887607144456675566518972133046899869---------975-9818987755667799799 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCC Q ss_conf 999999999999999870899689999999988744786 Q gi|254780799|r 523 LKWLVCEMEERYQKMSKIGVRNIDGFNLKVAQYHNTGKK 561 (806) Q Consensus 523 L~w~V~EMe~RY~l~a~~~vRni~~yN~k~~~~~~~~~~ 561 (806) |.++-.|++.--....-..++++.+=+=.|++|-++|+- T Consensus 238 l~~~F~e~~~~~~~CkF~dC~H~~EPgCaV~~Ave~g~I 276 (298) T PRK00098 238 LEHAFPEFRPLSGECKFRNCTHTHEPGCAVKAAVEEGEI 276 (298) T ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHCCCC T ss_conf 988678899986889893898898988588999985998 No 451 >PRK04322 peptidyl-tRNA hydrolase; Provisional Probab=55.31 E-value=14 Score=16.44 Aligned_cols=47 Identities=26% Similarity=0.494 Sum_probs=24.3 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCEEEE------EEEECCEEEEEEEEECCCCCHHHHHH Q ss_conf 25665457899999974248632899------84104424444432147863999997 Q gi|254780799|r 328 SPKVMQNNACTLKSVLSDFGIQGEIV------NVRPGPVITLYELEPAPGIKSSRIIG 379 (806) Q Consensus 328 s~eeL~~nA~lLE~tL~dFGVe~~Vv------~v~pGPVVTrYEi~PApGVKvSKI~n 379 (806) |+++|.+ |.++.++.|+...++ ++-|| ++|---|-|+|--++.+|++ T Consensus 59 ~e~~L~~----l~~~a~~~gl~~~~V~DAG~Tei~pg-t~TvlaiGP~p~~~id~itg 111 (116) T PRK04322 59 SEEELLE----LKKKAERLGLPTALIRDAGLTQIPPG-TVTALGIGPAPEELIDKITG 111 (116) T ss_pred CHHHHHH----HHHHHHHCCCCEEEEECCCCCCCCCC-CEEEEEECCCCHHHHHHHHC T ss_conf 9999999----99999987996899976887353899-85899988788889888738 No 452 >KOG3054 consensus Probab=55.25 E-value=8.5 Score=18.10 Aligned_cols=14 Identities=21% Similarity=0.373 Sum_probs=5.2 Q ss_pred EHHHCCCHHHHHCC Q ss_conf 34230721343022 Q gi|254780799|r 418 LRDLIVSRVFEKNQ 431 (806) Q Consensus 418 lreil~s~~f~~s~ 431 (806) ..|+..-.+|.+.. T Consensus 259 ~eEl~AVAkfIkqr 272 (299) T KOG3054 259 MEELAAVAKFIKQR 272 (299) T ss_pred HHHHHHHHHHHHHC T ss_conf 99999999999974 No 453 >cd03255 ABC_MJ0796_Lo1CDE_FtsE This family is comprised of MJ0796 ATP-binding cassette, macrolide-specific ABC-type efflux carrier (MacAB), and proteins involved in cell division (FtsE), and release of liporoteins from the cytoplasmic membrane (LolCDE). They are clustered together phylogenetically. MacAB is an exporter that confers resistance to macrolides, while the LolCDE system is not a transporter at all. An FtsE null mutants showed filamentous growth and appeared viable on high salt medium only, indicating a role for FtsE in cell division and/or salt transport. The LolCDE complex catalyses the release of lipoproteins from the cytoplasmic membrane prior to their targeting to the outer membrane. Probab=55.17 E-value=14 Score=16.42 Aligned_cols=156 Identities=19% Similarity=0.318 Sum_probs=76.1 Q ss_pred EEEECCCCCEEEEEECCCCHHHHHHHHHHHHHH--------------HHCCHH--------HEEEEEECCCHH-HHHHCC Q ss_conf 785410020235530477406799999999999--------------829957--------847888523100-111027 Q gi|254780799|r 447 IIADLARMPHLLIAGTTGSGKSVAINTMILSLL--------------YRMTPA--------QCRLIMIDPKML-ELSVYD 503 (806) Q Consensus 447 vv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlL--------------yk~~P~--------evkliliDPK~v-Els~Y~ 503 (806) +-.++.+==.+-|.|..|||||..++. |.-|+ ++.+|. .+-++.=||... .|++|+ T Consensus 23 isl~i~~Ge~~~iiG~sGsGKTTll~~-i~Gl~~p~~G~I~~~g~~i~~~~~~~~~~~rr~~Ig~v~Q~~~L~~~ltV~e 101 (218) T cd03255 23 VSLSIEKGEFVAIVGPSGSGKSTLLNI-LGGLDRPTSGEVRVDGTDISKLSEKELAAFRRRHIGFVFQSFNLLPDLTALE 101 (218) T ss_pred EEEEECCCCEEEEECCCCCHHHHHHHH-HHCCCCCCCEEEEECCEECCCCCHHHHHHHHHCCEEEECCCCCCCCCCCHHH T ss_conf 289986998999999999869999999-9669999964999999998879989999986504789866752155643999 Q ss_pred CCHHHHCCCCCCHHHHHHHHHHHHHHHHHH-HHHHH--------HCCCCCHHHHH-HHHHHHHHCCCCCCCCCCCCCCCC Q ss_conf 703431223343045668999999999999-99998--------70899689999-999988744786677544677654 Q gi|254780799|r 504 GIPNLLTPVVTNPQKAVTVLKWLVCEMEER-YQKMS--------KIGVRNIDGFN-LKVAQYHNTGKKFNRTVQTGFDRK 573 (806) Q Consensus 504 ~iPHLl~pVvTd~~kA~~aL~w~V~EMe~R-Y~l~a--------~~~vRni~~yN-~k~~~~~~~~~~~~~~~~~~~~~~ 573 (806) .|- +....+... -.|+++| ++++. ..-++++.|=+ +|+.-|++- T Consensus 102 ni~--~~~~~~~~~---------~~~~~~~v~~~l~~l~l~~~~~~~~~~LSGG~kQRv~iAraL--------------- 155 (218) T cd03255 102 NVE--LPLLLAGVP---------KKERRERAEELLERVGLGDRLNHYPSELSGGQQQRVAIARAL--------------- 155 (218) T ss_pred HHH--HHHHHCCCC---------HHHHHHHHHHHHHHCCCHHHHHCCCCCCCHHHHHHHHHHHHH--------------- T ss_conf 999--999984999---------899999999876767937887388763899999999999998--------------- Q ss_pred CCCCCCCCCCCCCCCCCEEEEEHHHH-HHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEE Q ss_conf 54322233223234698687763446-88887321005889999998664142379999657775355435541102515 Q gi|254780799|r 574 TGEAIYETEHFDFQHMPYIVVVIDEM-ADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRI 652 (806) Q Consensus 574 ~~~~~~~~~~~~~~~lp~ivviiDEl-aDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~ri 652 (806) -.=|- |++-||= +-|=-.+..+|-+.+.+|+ +--|+..|++|..+ |++ .+-.|| T Consensus 156 -------------~~~P~-llllDEPTs~LD~~~~~~i~~~l~~l~---~~~~~tii~itHd~--~~~------~~aDrv 210 (218) T cd03255 156 -------------ANDPK-IILADEPTGNLDSETGKEVMELLRELN---KEAGTTIVVVTHDP--ELA------EYADRI 210 (218) T ss_pred -------------HCCCC-EEEEECCCCCCCHHHHHHHHHHHHHHH---HHCCCEEEEECCCH--HHH------HHCCEE T ss_conf -------------55999-999818887689999999999999999---96298999989688--999------869989 Q ss_pred EE Q ss_conf 87 Q gi|254780799|r 653 SF 654 (806) Q Consensus 653 af 654 (806) .+ T Consensus 211 ~~ 212 (218) T cd03255 211 IE 212 (218) T ss_pred EE T ss_conf 99 No 454 >cd04154 Arl2 Arl2 subfamily. Arl2 (Arf-like 2) GTPases are members of the Arf family that bind GDP and GTP with very low affinity. Unlike most Arf family proteins, Arl2 is not myristoylated at its N-terminal helix. The protein PDE-delta, first identified in photoreceptor rod cells, binds specifically to Arl2 and is structurally very similar to RhoGDI. Despite the high structural similarity between Arl2 and Rho proteins and between PDE-delta and RhoGDI, the interactions between the GTPases and their effectors are very different. In its GTP bound form, Arl2 interacts with the protein Binder of Arl2 (BART), and the complex is believed to play a role in mitochondrial adenine nucleotide transport. In its GDP bound form, Arl2 interacts with tubulin- folding Cofactor D; this interaction is believed to play a role in regulation of microtubule dynamics that impact the cytoskeleton, cell division, and cytokinesis. Probab=55.10 E-value=10 Score=17.49 Aligned_cols=23 Identities=22% Similarity=0.407 Sum_probs=18.9 Q ss_pred CCEEEEEECCCCHHHHHHHHHHH Q ss_conf 20235530477406799999999 Q gi|254780799|r 454 MPHLLIAGTTGSGKSVAINTMIL 476 (806) Q Consensus 454 MPHLLIAGtTGSGKSV~iN~iI~ 476 (806) =..+++.|-.|||||-.+|.+.- T Consensus 14 ~~KililG~~~sGKTsll~~l~~ 36 (173) T cd04154 14 EMRILILGLDNAGKTTILKKLLG 36 (173) T ss_pred CEEEEEECCCCCCHHHHHHHHCC T ss_conf 31899998999788999999839 No 455 >PRK09598 lipid A phosphoethanolamine transferase; Reviewed Probab=54.80 E-value=14 Score=16.38 Aligned_cols=20 Identities=25% Similarity=0.252 Sum_probs=11.1 Q ss_pred HHHHHHHCCCCEEEEEEEEC Q ss_conf 99997424863289984104 Q gi|254780799|r 339 LKSVLSDFGIQGEIVNVRPG 358 (806) Q Consensus 339 LE~tL~dFGVe~~Vv~v~pG 358 (806) +-+.|+.=|+++-=.+-.-| T Consensus 296 l~dvl~~aG~~v~W~~N~~g 315 (522) T PRK09598 296 LPTYLTRAGIKVFWRSANDG 315 (522) T ss_pred HHHHHHHCCCEEEEEECCCC T ss_conf 99999977977999867889 No 456 >cd03233 ABC_PDR_domain1 The pleiotropic drug resistance (PDR) family of ATP-binding cassette (ABC) transporters. PDR is a well-described phenomenon occurring in fungi and shares several similarities with processes in bacteria and higher eukaryotes. This PDR subfamily represents domain I of its (ABC-IM)2 organization. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds including sugars, ions, peptides, and more complex organic molecules. The nucleotide-binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=54.47 E-value=10 Score=17.57 Aligned_cols=20 Identities=30% Similarity=0.546 Sum_probs=17.1 Q ss_pred EEEEEECCCCHHHHHHHHHH Q ss_conf 23553047740679999999 Q gi|254780799|r 456 HLLIAGTTGSGKSVAINTMI 475 (806) Q Consensus 456 HLLIAGtTGSGKSV~iN~iI 475 (806) .+.+-|..|||||--+|+|- T Consensus 35 i~~llG~nGsGKSTLl~~l~ 54 (202) T cd03233 35 MVLVLGRPGSGCSTLLKALA 54 (202) T ss_pred EEEEECCCCCCHHHHHHHHH T ss_conf 99999899998899999983 No 457 >PRK12724 flagellar biosynthesis regulator FlhF; Provisional Probab=54.45 E-value=15 Score=16.34 Aligned_cols=66 Identities=23% Similarity=0.305 Sum_probs=50.0 Q ss_pred EEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHH----HHHHCCCCHHHHCCCCCCHHHHHHHH Q ss_conf 35530477406799999999999829957847888523100----11102770343122334304566899 Q gi|254780799|r 457 LLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKML----ELSVYDGIPNLLTPVVTNPQKAVTVL 523 (806) Q Consensus 457 LLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~v----Els~Y~~iPHLl~pVvTd~~kA~~aL 523 (806) ...-|-||.||---|==|=....++.. ..|-||=+|-=|+ -|-.|-+|=.+=.-||.||++...+| T Consensus 226 i~lVGPTGVGKTTTiAKLAA~~~l~~~-kkVaLIT~DTYRIgAvEQLktYa~Il~iPv~vv~~~~el~~al 295 (432) T PRK12724 226 VFFVGPTGSGKTTSIAKLAAKYFLHMG-KSVSLYTTDNYRIAAIEQLKRYADTMGMPFYPVKDIKKFKETL 295 (432) T ss_pred EEEECCCCCCHHHHHHHHHHHHHHHCC-CEEEEEECCCCCHHHHHHHHHHHHHHCCCEEEEECHHHHHHHH T ss_conf 999899998889999999999999749-9279995266537799999999998599459951899999999 No 458 >PRK11147 ABC transporter ATPase component; Reviewed Probab=54.42 E-value=15 Score=16.34 Aligned_cols=62 Identities=26% Similarity=0.500 Sum_probs=35.0 Q ss_pred CCEEEEE----CCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCCCCHHHHCC Q ss_conf 6667854----1002023553047740679999999999982995784788852310011102770343122 Q gi|254780799|r 444 GKPIIAD----LARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYDGIPNLLTP 511 (806) Q Consensus 444 G~pvv~D----LakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~~iPHLl~p 511 (806) |++++-| +.+==++-|.|.-|||||-.++.|. .+ .+|+.=+. -+.++ |.++-|+.-...|.| T Consensus 331 ~~~~l~~vsl~i~~Ge~ialvG~NGsGKSTLlk~l~-G~---l~p~~G~i-~~g~~-~~i~y~~Q~~~~l~~ 396 (632) T PRK11147 331 GKQLVKDFSAQVQRGDKIALIGPNGCGKTTLLKLML-GQ---LQADSGRI-HCGTK-LEVAYFDQYRAELDP 396 (632) T ss_pred CEEEEEEECCCCCCCCEEEEECCCCCCHHHHHHHHH-CC---CCCCCCEE-EECCC-CEEEEEHHHHHHCCC T ss_conf 976776533335788779998898842779999860-66---68998779-98998-707755154764597 No 459 >PRK10875 recD exonuclease V subunit alpha; Provisional Probab=54.41 E-value=15 Score=16.34 Aligned_cols=58 Identities=22% Similarity=0.283 Sum_probs=32.7 Q ss_pred EEEEHHHCC----CHHHH-HCCCCCHHHCCCCCCCCEEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEE Q ss_conf 387342307----21343-0221000210456666667854100202355304774067999999999998299578478 Q gi|254780799|r 415 TVMLRDLIV----SRVFE-KNQCDLAINLGKSIEGKPIIADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRL 489 (806) Q Consensus 415 ~V~lreil~----s~~f~-~s~~~L~iaLGKdI~G~pvv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkl 489 (806) ...+.-+|. +..|. +.+.||+ .+.+|+|=|.|= ..-++.=|.++-|..+|+ T Consensus 235 a~TiHRLLg~~p~~~~f~~~~~nPL~--------~DvlIVDEASMV----------------Dl~Lm~~LL~Alp~~aRL 290 (607) T PRK10875 235 ASTLHRLLGAQPGSQRLRYHAGNPLH--------LDVLVVDEASMI----------------DLPMMSRLIDALPDHARV 290 (607) T ss_pred CEEHHHHCCCCCCCCCCCCCCCCCCC--------CCEEEEECCHHH----------------HHHHHHHHHHHCCCCCEE T ss_conf 66589752967898765657799998--------898999073366----------------599999999828999889 Q ss_pred EEE-CCCH Q ss_conf 885-2310 Q gi|254780799|r 490 IMI-DPKM 496 (806) Q Consensus 490 ili-DPK~ 496 (806) ||+ ||.. T Consensus 291 ILvGD~dQ 298 (607) T PRK10875 291 IFLGDRDQ 298 (607) T ss_pred EEECCHHH T ss_conf 99656232 No 460 >PRK08533 flagellar accessory protein FlaH; Reviewed Probab=54.40 E-value=15 Score=16.34 Aligned_cols=27 Identities=26% Similarity=0.410 Sum_probs=20.1 Q ss_pred EEEEEECCCCHHHHHHHHHHHHHHHHC Q ss_conf 235530477406799999999999829 Q gi|254780799|r 456 HLLIAGTTGSGKSVAINTMILSLLYRM 482 (806) Q Consensus 456 HLLIAGtTGSGKSV~iN~iI~SlLyk~ 482 (806) -.||.|..|+|||+.---++-..|-+- T Consensus 26 ~~li~G~~GtGKsi~~~~~~~~~l~~g 52 (230) T PRK08533 26 IILIEGDESTGKSILSQRLAYGFLQNG 52 (230) T ss_pred EEEEECCCCCCHHHHHHHHHHHHHHCC T ss_conf 899986899878999999999998789 No 461 >pfam03348 Serinc Serine incorporator (Serinc). This is a family of eukaryotic membrane proteins which incorporate serine into membranes and facilitate the synthesis of the serine-derived lipids phosphatidylserine and sphingolipid. Members of this family contain 11 transmembrane domains and form intracellular complexes with key enzymes involved in serine and sphingolipid biosynthesis. Probab=54.31 E-value=15 Score=16.33 Aligned_cols=69 Identities=20% Similarity=0.273 Sum_probs=33.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCC---CC--CCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999844270---68--96313688645000134678999999999999999999999999999 Q gi|254780799|r 25 KMKIVAGLILLCTVFAITLALGTWDV---YD--PSFSYITLRSPKNFLGYGGAIFADVAIQFFGIASVFFLPPPTMWALS 99 (806) Q Consensus 25 rl~Ei~Gl~Li~~al~l~iSL~SYsp---~D--Ps~~~~s~~~v~N~~G~lGA~iAd~L~~lFG~~Ayllpl~Ll~~g~~ 99 (806) ..|...++++++..+...+.|..+-. .+ |.+... +-+..+..|..+.| ...||.+-|.+.+.++..|.+ T Consensus 24 ~tRi~Ya~~ll~~~~~s~iml~~~~~~~l~~~~~~~~~~-~c~~~~c~G~~aVy-----Rv~f~~~~Ff~l~~l~~i~v~ 97 (426) T pfam03348 24 STRIAYALILLLNSIVSWIMLSPGVIKKLKKKIPGFCGV-DCPGSECVGYSAVY-----RVCFALALFFLILALLMIGVK 97 (426) T ss_pred HHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHCCCCCCC-CCCCCCCEEEEHHH-----HHHHHHHHHHHHHHHHHHCCC T ss_conf 999999999999999999981552899998727774157-88976403221036-----786899999999999993758 No 462 >pfam03215 Rad17 Rad17 cell cycle checkpoint protein. Probab=54.30 E-value=10 Score=17.55 Aligned_cols=13 Identities=23% Similarity=0.123 Sum_probs=7.6 Q ss_pred CCCCHHHHHHHHH Q ss_conf 3343045668999 Q gi|254780799|r 512 VVTNPQKAVTVLK 524 (806) Q Consensus 512 VvTd~~kA~~aL~ 524 (806) --+|-.+++.|+. T Consensus 362 F~sdIdd~~~a~~ 374 (490) T pfam03215 362 FCSNISDARYVAD 374 (490) T ss_pred HHHCHHHHHHHHH T ss_conf 5505999999997 No 463 >PRK11308 dppF dipeptide transporter ATP-binding subunit; Provisional Probab=54.24 E-value=10 Score=17.52 Aligned_cols=67 Identities=16% Similarity=0.239 Sum_probs=35.3 Q ss_pred EEEHHHHH-HHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEEEECCCC----CCHHHCC Q ss_conf 77634468-888732100588999999866414237999965777535543554110251587645866----4212338 Q gi|254780799|r 593 VVVIDEMA-DLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISFQVSSKI----DSRTILG 667 (806) Q Consensus 593 vviiDEla-Dlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf~v~s~~----dSrtild 667 (806) ++|-||=- -|=.+...+|-+.+.+|. +--|+-+|+-|..-++ =+.+--||+.--.-+| +...|+. T Consensus 175 lLi~DEPTsaLD~~~q~~Il~ll~~l~---~~~~~t~l~ITHDl~~-------v~~iaDri~VMy~G~IVE~G~~~~v~~ 244 (327) T PRK11308 175 VVVADEPVSALDVSVQAQVLNLMMDLQ---QELGLSYVFISHDLSV-------VEHIADDVMVMYLGRCVEKGTKEQIFN 244 (327) T ss_pred EEEEECCCCCCCHHHHHHHHHHHHHHH---HHCCCEEEEECCCHHH-------HHHHCCEEEEEECCEEEEECCHHHHHH T ss_conf 999847865469999999999999999---7009769998698999-------998699899998988999788999972 Q ss_pred CC Q ss_conf 86 Q gi|254780799|r 668 EQ 669 (806) Q Consensus 668 ~~ 669 (806) .+ T Consensus 245 ~P 246 (327) T PRK11308 245 NP 246 (327) T ss_pred CC T ss_conf 79 No 464 >PRK11248 tauB taurine transporter ATP-binding subunit; Provisional Probab=54.20 E-value=10 Score=17.50 Aligned_cols=152 Identities=22% Similarity=0.359 Sum_probs=68.3 Q ss_pred CCEEEEEC----CCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHH-----HH-HCCCCHHHHCCCC Q ss_conf 66678541----00202355304774067999999999998299578478885231001-----11-0277034312233 Q gi|254780799|r 444 GKPIIADL----ARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLE-----LS-VYDGIPNLLTPVV 513 (806) Q Consensus 444 G~pvv~DL----akMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vE-----ls-~Y~~iPHLl~pVv 513 (806) |.+++-|+ .+==-+-|-|-.|||||.-+|. |+.| ++ |+.=. |.+|=+-++ -+ +|.+ ++| .|-- T Consensus 13 ~~~~L~dvsl~i~~Ge~~~lvGpnGaGKSTLl~~-i~Gl-~~--p~~G~-I~~~G~~i~~~~~~~g~vfQ~-~~l-~p~~ 85 (255) T PRK11248 13 GKPALEDINLTLESGELLVVLGPSGCGKTTLLNL-IAGF-VP--PQHGS-ITLDGKPVEGPGAERGVVFQN-EGL-LPWR 85 (255) T ss_pred CEEEEECEEEEECCCCEEEEECCCCCHHHHHHHH-HHCC-CC--CCCCE-EEECCCCCCCCCHHCEEEECC-CCC-CCCC T ss_conf 9988813177986998999999998469999999-9759-98--89971-857996478862110699455-754-7568 Q ss_pred CCHHHHHHHHHHH---HHHHHHHH-HHHHHCCCCCHHH---------HHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 4304566899999---99999999-9998708996899---------999999887447866775446776545432223 Q gi|254780799|r 514 TNPQKAVTVLKWL---VCEMEERY-QKMSKIGVRNIDG---------FNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYE 580 (806) Q Consensus 514 Td~~kA~~aL~w~---V~EMe~RY-~l~a~~~vRni~~---------yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 580 (806) |=-...+.+|+.. -.|++.|. +++...|..+... --+|+.-|++- T Consensus 86 tv~env~~~l~~~g~~~~~~~~~~~~~L~~vgL~~~~~~~p~~LSGGqkQRVaiArAL---------------------- 143 (255) T PRK11248 86 NVQDNVAFGLQLAGVEKMQRLEIAHQMLKKVGLEGAEKRYIWQLSGGQRQRVGIARAL---------------------- 143 (255) T ss_pred CHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHCCCHHHCCHHHHHHHHHHHHH---------------------- T ss_conf 7999999899874898789999999999976990244189334999999999999999---------------------- Q ss_pred CCCCCCCCCCEEEEEHHH-HHHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECC Q ss_conf 322323469868776344-688887321005889999998664142379999657 Q gi|254780799|r 581 TEHFDFQHMPYIVVVIDE-MADLMMVARKDIESAVQRLAQMARASGIHVIMATQR 634 (806) Q Consensus 581 ~~~~~~~~lp~ivviiDE-laDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqr 634 (806) -.=|- |++-|| ++-|=.....++.+.|.+|. +..|+-.|+.|.. T Consensus 144 ------~~~P~-iLllDEPt~~LD~~~r~~l~~ll~~l~---~~~g~Til~vTHd 188 (255) T PRK11248 144 ------AANPQ-LLLLDEPFGALDAFTREQMQELLLKLW---QETGKQVLLITHD 188 (255) T ss_pred ------HCCCC-EEEEECCCCCCCHHHHHHHHHHHHHHH---HHCCCEEEEECCC T ss_conf ------72999-999808877799899999999999999---9619999998868 No 465 >pfam03029 ATP_bind_1 Conserved hypothetical ATP binding protein. Members of this family are found in a range of archaea and eukaryotes and have hypothesized ATP binding activity. Probab=54.13 E-value=15 Score=16.31 Aligned_cols=49 Identities=29% Similarity=0.460 Sum_probs=34.5 Q ss_pred EEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEEECCCHHHHHHCC---CCHHHHC Q ss_conf 530477406799999999999829957847888523100111027---7034312 Q gi|254780799|r 459 IAGTTGSGKSVAINTMILSLLYRMTPAQCRLIMIDPKMLELSVYD---GIPNLLT 510 (806) Q Consensus 459 IAGtTGSGKSV~iN~iI~SlLyk~~P~evkliliDPK~vEls~Y~---~iPHLl~ 510 (806) |.|.-|||||-..++|---+--. -..|-.|=-||-- |--.|+ ||-.+++ T Consensus 1 ViGpaGSGKTT~~~~l~~~l~~~--~r~~~vvNLDPA~-e~~pY~~~iDIrd~i~ 52 (234) T pfam03029 1 VVGGAGSGKTTFVGALSEILPLR--GRSVYVVNLDPAA-ENLPYEADIDIRELIT 52 (234) T ss_pred CCCCCCCCHHHHHHHHHHHHHHC--CCCEEEEECCCCC-CCCCCCCCCCHHHHCC T ss_conf 98989898899999999999977--9975999789866-5899987771787467 No 466 >COG0210 UvrD Superfamily I DNA and RNA helicases [DNA replication, recombination, and repair] Probab=54.00 E-value=15 Score=16.29 Aligned_cols=35 Identities=20% Similarity=0.292 Sum_probs=14.2 Q ss_pred CCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHH Q ss_conf 00202355304774067999999999998299578 Q gi|254780799|r 452 ARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQ 486 (806) Q Consensus 452 akMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~e 486 (806) ..+|...+.|.|.-=..--|+.++.-+..-..|++ T Consensus 372 ~~ipy~i~~gg~~f~~r~eikd~l~~l~~~~n~~d 406 (655) T COG0210 372 AGIPYRIVIGGTSFFERKEIKDLLAYLRLVLNPDD 406 (655) T ss_pred CCCCEEEECCCHHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 28974775370766516999999999732347675 No 467 >PRK09765 2-O-a-mannosyl-D-glycerate specific PTS transporten components IIABC; Provisional Probab=53.94 E-value=15 Score=16.29 Aligned_cols=15 Identities=20% Similarity=0.040 Sum_probs=7.4 Q ss_pred HHHHHHHHHHHHHHH Q ss_conf 999999999999998 Q gi|254780799|r 87 VFFLPPPTMWALSLL 101 (806) Q Consensus 87 yllpl~Ll~~g~~ll 101 (806) +.+|++-.|.++.+- T Consensus 345 lmvPvla~yIAySIa 359 (638) T PRK09765 345 LMVPVLAAYTAYSLA 359 (638) T ss_pred HHHHHHHHHHHHHHC T ss_conf 999999999999862 No 468 >KOG3817 consensus Probab=53.92 E-value=15 Score=16.28 Aligned_cols=13 Identities=15% Similarity=0.559 Sum_probs=5.6 Q ss_pred HHHHHHHHHHHHH Q ss_conf 9999999999999 Q gi|254780799|r 172 MILFLAMSWLLIY 184 (806) Q Consensus 172 iil~l~is~l~i~ 184 (806) ..++..+.|.... T Consensus 289 ~~l~~pIrw~~~~ 301 (452) T KOG3817 289 VALYFPIRWTNQI 301 (452) T ss_pred HHHHCCHHHHHHH T ss_conf 8882637899999 No 469 >cd01849 YlqF_related_GTPase YlqF-related GTPases. These proteins are found in bacteria, eukaryotes, and archaea. They all exhibit a circular permutation of the GTPase signature motifs so that the order of the conserved G box motifs is G4-G5-G1-G2-G3, with G4 and G5 being permuted from the C-terminal region of proteins in the Ras superfamily to the N-terminus of YlqF-related GTPases. Probab=53.90 E-value=10 Score=17.50 Aligned_cols=20 Identities=30% Similarity=0.479 Sum_probs=13.7 Q ss_pred EEEEEECCCCHHHHHHHHHH Q ss_conf 23553047740679999999 Q gi|254780799|r 456 HLLIAGTTGSGKSVAINTMI 475 (806) Q Consensus 456 HLLIAGtTGSGKSV~iN~iI 475 (806) -+.|-|-...|||-.||+++ T Consensus 102 ~i~ivG~PNVGKSsliN~L~ 121 (155) T cd01849 102 TVGVIGYPNVGKSSVINALL 121 (155) T ss_pred EEEEECCCCCCHHHHHHHHH T ss_conf 89998777744778999984 No 470 >pfam03297 Ribosomal_S25 S25 ribosomal protein. Probab=53.90 E-value=15 Score=16.35 Aligned_cols=58 Identities=24% Similarity=0.312 Sum_probs=46.2 Q ss_pred CHHHHHHHHHHHHCCCEEEEHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHCCCCCEEE Q ss_conf 7038999999996598500014222001177899999999977986802278872673 Q gi|254780799|r 740 DDLYKQAVDIVLRDNKASISYIQRRLGIGYNRAASIIENMEEKGVIGPASSTGKREIL 797 (806) Q Consensus 740 d~l~~~a~~~v~~~~~~s~s~lqr~~~igy~raar~~~~~e~~giv~~~~g~~~r~vl 797 (806) -+.|++-..=|-...-.+.|.+--||+|+=.=|-+.+..||++|+|-|-.....-.|+ T Consensus 43 k~tydKl~kEVPk~KlITpsvvseRlKI~gSLAR~aL~~L~~kGlIk~V~~h~~q~IY 100 (104) T pfam03297 43 KATYDKLLKEVPKYKLITPSVLSDRLKINGSLARRALRELEEKGLIKPVVKHSAQLIY 100 (104) T ss_pred HHHHHHHHHHCCCCEEECHHHHHHHHHCCHHHHHHHHHHHHHCCCEEEEECCCCEEEE T ss_conf 9999999986676637739997777602189999999999988998877115760788 No 471 >COG1201 Lhr Lhr-like helicases [General function prediction only] Probab=53.89 E-value=15 Score=16.28 Aligned_cols=66 Identities=18% Similarity=0.280 Sum_probs=27.7 Q ss_pred EEEEECCCHHHHHHCCC----CHHH---HCCCCCCHHHHHHHHHH--------HHHHHHHHHHHHHHCCCCCHHHHHHHH Q ss_conf 78885231001110277----0343---12233430456689999--------999999999999870899689999999 Q gi|254780799|r 488 RLIMIDPKMLELSVYDG----IPNL---LTPVVTNPQKAVTVLKW--------LVCEMEERYQKMSKIGVRNIDGFNLKV 552 (806) Q Consensus 488 kliliDPK~vEls~Y~~----iPHL---l~pVvTd~~kA~~aL~w--------~V~EMe~RY~l~a~~~vRni~~yN~k~ 552 (806) |++=||...|-.+.-+| ||-- ..||=+|--+.+..++- +.+.--..|..- +....+|.+|+... T Consensus 511 rv~~i~~~~v~V~~a~~~~~~iPsW~Ge~ip~~~ela~~V~~~r~~~l~~g~~~~~~l~~~~~~d-~~~~~~i~~~~~eq 589 (814) T COG1201 511 RVLEIDGNKVIVKPAEGAKPTIPSWFGEMLPLSFELAQDVGRFRLELLLTGKAAAEELAEKYGID-ENAAKAIIQYLREQ 589 (814) T ss_pred EEEEEECCEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCC-HHHHHHHHHHHHHH T ss_conf 99998688799998889999897645888897899999999998876630336799887622543-24478999999999 Q ss_pred HH Q ss_conf 98 Q gi|254780799|r 553 AQ 554 (806) Q Consensus 553 ~~ 554 (806) .+ T Consensus 590 ~~ 591 (814) T COG1201 590 RE 591 (814) T ss_pred HH T ss_conf 87 No 472 >TIGR03415 ABC_choXWV_ATP choline ABC transporter, ATP-binding protein. Members of this protein family are the ATP-binding subunit of a three-protein transporter. This family belongs, more broadly, to the family of proline and glycine-betaine transporters, but members have been identified by direct characterization and by bioinformatic means as choline transporters. Many species have several closely-related members of this family, probably with variable abilities to act additionally on related quaternary amines. Probab=53.83 E-value=13 Score=16.73 Aligned_cols=107 Identities=20% Similarity=0.203 Sum_probs=51.7 Q ss_pred CCCCCCCCEEEEECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCHHHEEEEE------ECCCHH---HH--------H Q ss_conf 045666666785410020235530477406799999999999829957847888------523100---11--------1 Q gi|254780799|r 438 LGKSIEGKPIIADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMTPAQCRLIM------IDPKML---EL--------S 500 (806) Q Consensus 438 LGKdI~G~pvv~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~P~evklil------iDPK~v---El--------s 500 (806) -|..++=+-+-.++.+==-+-|-|-.|||||..++.|. -| ..|+.=+... +|+... ++ + T Consensus 34 tg~~vAV~dvsl~I~~GEi~~lvGpSGsGKSTLLr~i~-GL---~~pt~G~I~i~~~~~~~d~~~~~~~~lr~~R~~~Ig 109 (382) T TIGR03415 34 TGLVVGVANASLDIEEGEICVLMGLSGSGKSSLLRAVN-GL---NPVSRGSVLVKDGDGSIDVANCDAATLRRLRTHRVS 109 (382) T ss_pred HCCEEEEECCEEEECCCCEEEEECCCCCHHHHHHHHHH-CC---CCCCCEEEEEECCCCCCCCCCCCHHHHHHHCCCCEE T ss_conf 39999896517488799899999999734999999997-59---998852999926864224565998998763057669 Q ss_pred -HCCCCHHHHCCCCCCHHHHHHHHHH-HHHHHHHHHHHHHHCCCCCHHHHHH Q ss_conf -0277034312233430456689999-9999999999998708996899999 Q gi|254780799|r 501 -VYDGIPNLLTPVVTNPQKAVTVLKW-LVCEMEERYQKMSKIGVRNIDGFNL 550 (806) Q Consensus 501 -~Y~~iPHLl~pVvTd~~kA~~aL~w-~V~EMe~RY~l~a~~~vRni~~yN~ 550 (806) +|.+ +-|-|.-|=-+..+.+|+- -+.+-|+|-+....+.-=++++|-+ T Consensus 110 mVFQ~--~aL~P~~TV~eNia~~L~~~g~~~~e~~~rv~e~L~~VgL~~~~~ 159 (382) T TIGR03415 110 MVFQK--FALMPWLTVEENVAFGLEMQGMPEAERRKRVDEQLELVGLAQWAD 159 (382) T ss_pred EEECC--CCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHH T ss_conf 99637--864656819999989998869999999999999987359846554 No 473 >cd03234 ABCG_White The White subfamily represents ABC transporters homologous to the Drosophila white gene, which acts as a dimeric importer for eye pigment precursors. The eye pigmentation of Drosophila is developed from the synthesis and deposition in the cells of red pigments, which are synthesized from guanine, and brown pigments, which are synthesized from tryptophan. The pigment precursors are encoded by the white, brown, and scarlet genes, respectively. Evidence from genetic and biochemical studies suggest that the White and Brown proteins function as heterodimers to import guanine, while the White and Scarlet proteins function to import tryptophan. However, a recent study also suggests that White may be involved in the transport of a metabolite, such as 3-hydroxykynurenine, across intracellular membranes. Mammalian ABC transporters belonging to the White subfamily (ABCG1, ABCG5, and ABCG8) have been shown to be involved in the regulation of lipid-trafficking mechanisms in Probab=53.80 E-value=11 Score=17.41 Aligned_cols=54 Identities=26% Similarity=0.392 Sum_probs=30.6 Q ss_pred CEEEEEHHHH-HHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEE Q ss_conf 8687763446-8888732100588999999866414237999965777535543554110251587 Q gi|254780799|r 590 PYIVVVIDEM-ADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISF 654 (806) Q Consensus 590 p~ivviiDEl-aDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf 654 (806) |- |++-||= +-|=..+..++-..|.+|++. |--.|+.|..|..++ ..+-.||.+ T Consensus 162 P~-illLDEPTsgLD~~~~~~i~~~l~~l~~~----~~~vi~~~H~~~~~~------~~~~Drv~v 216 (226) T cd03234 162 PK-VLILDEPTSGLDSFTALNLVSTLSQLARR----NRIVILTIHQPRSDL------FRLFDRILL 216 (226) T ss_pred CC-EEEEECCCCCCCHHHHHHHHHHHHHHHHC----CCEEEEEECCCHHHH------HHHCCEEEE T ss_conf 99-89995886568999999999999999978----999999988898999------997999999 No 474 >PRK11247 ssuB aliphatic sulfonates transport ATP-binding subunit; Provisional Probab=53.75 E-value=11 Score=17.15 Aligned_cols=54 Identities=19% Similarity=0.387 Sum_probs=31.6 Q ss_pred CEEEEEHHHH-HHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCHHHHHCCCCEEEE Q ss_conf 8687763446-8888732100588999999866414237999965777535543554110251587 Q gi|254780799|r 590 PYIVVVIDEM-ADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPSVDVITGTIKANFPTRISF 654 (806) Q Consensus 590 p~ivviiDEl-aDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPsvdvitg~ikan~p~riaf 654 (806) |- +++-||= +-|=....+++.+.| .+.++.-|+-.|+.|..... =..+-.||.+ T Consensus 152 P~-lLlLDEPtsgLD~~~~~~i~~ll---~~L~~e~g~TIi~vTHdl~e-------a~~laDRI~v 206 (257) T PRK11247 152 PR-LLLLDEPLGALDALTRIEMQDLI---ESLWQQHGFTVLLVTHDVSE-------AVAMADRVLL 206 (257) T ss_pred CC-EEEEECCCCCCCHHHHHHHHHHH---HHHHHHHCCEEEEECCCHHH-------HHHHCCEEEE T ss_conf 99-99980987657999999999999---99999609899998879999-------9996999999 No 475 >cd03219 ABC_Mj1267_LivG_branched The Mj1267/LivG ABC transporter subfamily is involved in the transport of the hydrophobic amino acids leucine, isoleucine and valine. MJ1267 is a branched-chain amino acid transporter with 29% similarity to both the LivF and LivG components of the E. coli branched-chain amino acid transporter. MJ1267 contains an insertion from residues 114 to 123 characteristic of LivG (Leucine-Isoleucine-Valine) homologs. The branched-chain amino acid transporter from E. coli comprises a heterodimer of ABCs (LivF and LivG), a heterodimer of six-helix TM domains (LivM and LivH), and one of two alternative soluble periplasmic substrate binding proteins (LivK or LivJ). Probab=53.66 E-value=9 Score=17.93 Aligned_cols=40 Identities=20% Similarity=0.385 Sum_probs=24.1 Q ss_pred EEEHHHHH-HHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCC Q ss_conf 77634468-888732100588999999866414237999965777 Q gi|254780799|r 593 VVVIDEMA-DLMMVARKDIESAVQRLAQMARASGIHVIMATQRPS 636 (806) Q Consensus 593 vviiDEla-Dlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPs 636 (806) |++.||=- -|=-.+.++|-+.|..|+ +.|+-.|+.|+... T Consensus 164 lliLDEPT~gLD~~~~~~i~~~l~~l~----~~G~tii~vsHdl~ 204 (236) T cd03219 164 LLLLDEPAAGLNPEETEELAELIRELR----ERGITVLLVEHDMD 204 (236) T ss_pred EEEEECCCCCCCHHHHHHHHHHHHHHH----HCCCEEEEECCCHH T ss_conf 999948765899999999999999999----65999999917489 No 476 >cd03293 ABC_NrtD_SsuB_transporters NrtD and SsuB are the ATP-binding subunits of the bacterial ABC-type nitrate and sulfonate transport systems, respectively. ABC transporters are a large family of proteins involved in the transport of a wide variety of different compounds, like sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases that contain a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=53.66 E-value=11 Score=17.38 Aligned_cols=143 Identities=21% Similarity=0.294 Sum_probs=65.3 Q ss_pred EECCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCC--------------HHHEEEEEECCCHHH-HHHCCCCHHHHCCCC Q ss_conf 54100202355304774067999999999998299--------------578478885231001-110277034312233 Q gi|254780799|r 449 ADLARMPHLLIAGTTGSGKSVAINTMILSLLYRMT--------------PAQCRLIMIDPKMLE-LSVYDGIPNLLTPVV 513 (806) Q Consensus 449 ~DLakMPHLLIAGtTGSGKSV~iN~iI~SlLyk~~--------------P~evkliliDPK~vE-ls~Y~~iPHLl~pVv 513 (806) .++.+==-+-|-|..|||||..++.| +-|+ +.+ +.++-++.=||-... +++|+.|-.-+ T Consensus 25 l~i~~Ge~~~iiGpsGsGKSTLl~~i-~Gl~-~p~~G~I~~~G~~i~~~~~~ig~vfQ~~~L~p~~tv~eni~~~l---- 98 (220) T cd03293 25 LSVEEGEFVALVGPSGCGKSTLLRII-AGLE-RPTSGEVLVDGEPVTGPGPDRGYVFQQDALLPWLTVLDNVALGL---- 98 (220) T ss_pred EEECCCCEEEEECCCCCHHHHHHHHH-HCCC-CCCCCEEEECCEECCCCCCCEEEEECCCCCCCCCCHHHHHHHHH---- T ss_conf 89879989999999995799999999-7599-98873899999967888988799924885377887999998899---- Q ss_pred CCHHHHHHHHHHHHHHHHHH-HHHHHHCCCCCH--------HHH-HHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 43045668999999999999-999987089968--------999-99999887447866775446776545432223322 Q gi|254780799|r 514 TNPQKAVTVLKWLVCEMEER-YQKMSKIGVRNI--------DGF-NLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEH 583 (806) Q Consensus 514 Td~~kA~~aL~w~V~EMe~R-Y~l~a~~~vRni--------~~y-N~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 583 (806) ....|--.|+++| .+++...|..+. .|= -+|+.-|++- T Consensus 99 -------~~~~~~~~~~~~~v~~~l~~~gL~~~~~~~p~~LSGGqkQRvaiARaL------------------------- 146 (220) T cd03293 99 -------ELQGVPKAEARERAEELLELVGLSGFENAYPHQLSGGMRQRVALARAL------------------------- 146 (220) T ss_pred -------HHCCCCHHHHHHHHHHHHHHCCCHHHHHCCHHHCCHHHHHHHHHHHHH------------------------- T ss_conf -------865999899999999999987895476189312999999999999998------------------------- Q ss_pred CCCCCCCEEEEEHHHH-HHHHHHCCCHHHHHHHHHHHHHHHCCEEEEEEECCCC Q ss_conf 3234698687763446-8888732100588999999866414237999965777 Q gi|254780799|r 584 FDFQHMPYIVVVIDEM-ADLMMVARKDIESAVQRLAQMARASGIHVIMATQRPS 636 (806) Q Consensus 584 ~~~~~lp~ivviiDEl-aDlmm~~~~~ve~~i~rlaq~ara~GiHli~aTqrPs 636 (806) -.=|- +++-||= +-|=....++|-+.+.+ ..+--|+-.|+.|..+. T Consensus 147 ---~~~P~-llllDEPts~LD~~~~~~i~~~l~~---l~~~~g~tii~vTHdl~ 193 (220) T cd03293 147 ---AVDPD-VLLLDEPFSALDALTREQLQEELLD---IWRETGKTVLLVTHDID 193 (220) T ss_pred ---HCCCC-EEEEECCCCCCCHHHHHHHHHHHHH---HHHHCCCEEEEECCCHH T ss_conf ---66999-9998088765699999999999999---99851999999888899 No 477 >cd04155 Arl3 Arl3 subfamily. Arl3 (Arf-like 3) is an Arf family protein that differs from most Arf family members in the N-terminal extension. In is inactive, GDP-bound form, the N-terminal extension forms an elongated loop that is hydrophobically anchored into the membrane surface; however, it has been proposed that this region might form a helix in the GTP-bound form. The delta subunit of the rod-specific cyclic GMP phosphodiesterase type 6 (PDEdelta) is an Arl3 effector. Arl3 binds microtubules in a regulated manner to alter specific aspects of cytokinesis via interactions with retinitis pigmentosa 2 (RP2). It has been proposed that RP2 functions in concert with Arl3 to link the cell membrane and the cytoskeleton in photoreceptors as part of the cell signaling or vesicular transport machinery. In mice, the absence of Arl3 is associated with abnormal epithelial cell proliferation and cyst formation. Probab=53.63 E-value=12 Score=17.01 Aligned_cols=25 Identities=24% Similarity=0.497 Sum_probs=20.8 Q ss_pred CCCEEEEEECCCCHHHHHHHHHHHH Q ss_conf 0202355304774067999999999 Q gi|254780799|r 453 RMPHLLIAGTTGSGKSVAINTMILS 477 (806) Q Consensus 453 kMPHLLIAGtTGSGKSV~iN~iI~S 477 (806) +=|-++|.|-.|||||--+|.+... T Consensus 13 ~~~Ki~ilG~~~sGKTsll~~l~~~ 37 (173) T cd04155 13 EEPRILILGLDNAGKTTILKQLASE 37 (173) T ss_pred CCCEEEEECCCCCCHHHHHHHHHCC T ss_conf 7758999979999889999998569 No 478 >cd02552 PseudoU_synth_TruD_like PseudoU_synth_TruD_like: Pseudouridine synthase, TruD family. This group consists of eukaryotic, bacterial and archeal pseudouridine synthases similar to Escherichia coli TruD and Saccharomyces cerevisiae Pus7. Pseudouridine synthases catalyze the isomerization of specific uridines in an RNA molecule to pseudouridines (5-ribosyluracil, psi). E. coli TruD and S. cerevisiae Pus7 make psi13 in cytoplasmic tRNAs. In addition S. cerevisiae Pus7 makes psi35 in U2 small nuclear RNA (U2 snRNA) and psi35 in pre-tRNATyr. Psi35 in U2 snRNA and psi13 in tRNAs are highly phylogenetically conserved. Psi34 is the mammalian U2 snRNA counterpart of yeast U2 snRNA psi35. Probab=53.59 E-value=15 Score=16.25 Aligned_cols=110 Identities=25% Similarity=0.349 Sum_probs=70.0 Q ss_pred CCEEEEEHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCEEEE------------EEECCCCCCCCCHHHHHCCCCEEEEEE Q ss_conf 98687763446888873210058899999986641423799------------996577753554355411025158764 Q gi|254780799|r 589 MPYIVVVIDEMADLMMVARKDIESAVQRLAQMARASGIHVI------------MATQRPSVDVITGTIKANFPTRISFQV 656 (806) Q Consensus 589 lp~ivviiDElaDlmm~~~~~ve~~i~rlaq~ara~GiHli------------~aTqrPsvdvitg~ikan~p~riaf~v 656 (806) =+|.++.+- -.+.+-.+.+ .+.||+.||+.- +.||+=||.-... .++.. - T Consensus 30 G~~~~~~l~-------K~~~~T~~a~---~~lA~~l~i~~~~igyAG~KDr~AvT~Q~~sv~~~~~-------~~~~~-~ 91 (232) T cd02552 30 GEYLHFTLY-------KENKDTMEAL---REIAKALGVPPRDIGYAGTKDKRAVTTQRVSVHKPGK-------ERLAL-L 91 (232) T ss_pred CCEEEEEEE-------ECCCCHHHHH---HHHHHHHCCCHHHEEECCCCCCCEEEEEEEEEECCCC-------CCCCH-H T ss_conf 788999999-------8789999999---9999981998666304434667610699999867860-------12132-2 Q ss_pred CCCCCCHHHCCCCCHHHHCCCCCEEEECCCCC-EEEEEECCCCHHH-HHHHHHHHHHCCCCCCCCC Q ss_conf 58664212338864578658875477368983-2588833489889-9999999971289742110 Q gi|254780799|r 657 SSKIDSRTILGEQGAEQLLGQGDMLYMTGGGR-VQRIHGPFVSDIE-VEKVVSHLKTQGEAKYIDI 720 (806) Q Consensus 657 ~s~~dSrtild~~gae~Llg~gdml~~~~~~~-~~r~~g~~v~~~e-v~~v~~~~~~q~~~~y~~~ 720 (806) ....+.=.|++.....+=|..||.. |++ -+++.+.=..+.+ +++..+.++.+|-|+|-.. T Consensus 92 ~~~~~~i~i~~~~~~~~~L~~G~l~----GN~F~I~lR~v~~~~~~~~~~~l~~i~~~G~pNYFG~ 153 (232) T cd02552 92 NLELKGIRILEFGRHDKKLRLGDLK----GNRFTITLRGVSPEEDELIERRLESLKEKGFPNYFGL 153 (232) T ss_pred HCCCCCEEEEEEEECCCCCCCCCCC----CCEEEEEEECCCCHHHHHHHHHHHHHHHCCCCCCCCC T ss_conf 2467985999988507987777666----8879999922895138999999999985880673357 No 479 >KOG3354 consensus Probab=53.48 E-value=6.7 Score=18.91 Aligned_cols=17 Identities=35% Similarity=0.577 Sum_probs=13.4 Q ss_pred EEEEEECCCCHHHHHHH Q ss_conf 23553047740679999 Q gi|254780799|r 456 HLLIAGTTGSGKSVAIN 472 (806) Q Consensus 456 HLLIAGtTGSGKSV~iN 472 (806) -++|+|..|||||--=- T Consensus 14 ~i~vmGvsGsGKSTigk 30 (191) T KOG3354 14 VIVVMGVSGSGKSTIGK 30 (191) T ss_pred EEEEEECCCCCHHHHHH T ss_conf 59998358877445999 No 480 >smart00173 RAS Ras subfamily of RAS small GTPases. Similar in fold and function to the bacterial EF-Tu GTPase. p21Ras couples receptor Tyr kinases and G protein receptors to protein kinase cascades Probab=53.48 E-value=14 Score=16.49 Aligned_cols=43 Identities=26% Similarity=0.353 Sum_probs=28.9 Q ss_pred EEEEECCCCHHHHHHHHHHHHHHHHCCHHH---------EEEEEECCCHHHHHHCC Q ss_conf 355304774067999999999998299578---------47888523100111027 Q gi|254780799|r 457 LLIAGTTGSGKSVAINTMILSLLYRMTPAQ---------CRLIMIDPKMLELSVYD 503 (806) Q Consensus 457 LLIAGtTGSGKSV~iN~iI~SlLyk~~P~e---------vkliliDPK~vEls~Y~ 503 (806) +++-|..|.|||-.|+..+.. .-+++ .|.+-+|-+.+.|.+++ T Consensus 3 iv~vGd~~vGKTsli~r~~~~----~f~~~y~~Ti~~~~~k~i~~~~~~~~l~iwD 54 (164) T smart00173 3 LVVLGSGGVGKSALTIQFVQG----HFVDDYDPTIEDSYRKQIEIDGEVCLLDILD 54 (164) T ss_pred EEEECCCCCCHHHHHHHHHHC----CCCCCCCCCEEEEEEEEEEECCEEEEEEEEE T ss_conf 999999997899999999729----7998778813678999999999999999998 No 481 >CHL00174 accD acetyl-CoA carboxylase beta subunit; Reviewed Probab=53.41 E-value=15 Score=16.23 Aligned_cols=84 Identities=15% Similarity=0.285 Sum_probs=49.4 Q ss_pred CHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEHHHHHHHHHHCCCHHHHHHHHHHHHHHH Q ss_conf 68999999998874478667754467765454322233223234698687763446888873210058899999986641 Q gi|254780799|r 544 NIDGFNLKVAQYHNTGKKFNRTVQTGFDRKTGEAIYETEHFDFQHMPYIVVVIDEMADLMMVARKDIESAVQRLAQMARA 623 (806) Q Consensus 544 ni~~yN~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lp~ivviiDElaDlmm~~~~~ve~~i~rlaq~ara 623 (806) +-+.|-+|+++++++-. +.+.+.+|.- ...-.|-+|.+.| |+=+|=.-|.-|-+-|+|.++.|.. T Consensus 112 D~K~Y~dRL~~arkkTg-~~dAvv~g~G-------------~I~g~~vvv~vmD-F~FmGGSMGsvvGEki~ra~e~A~~ 176 (305) T CHL00174 112 EEEPYKDRIDSYQKKTG-LTDAVQTGIG-------------QLNGIPVALGVMD-FQFMGGSMGSVVGEKITRLIEYATN 176 (305) T ss_pred CCCCHHHHHHHHHHHHC-CCEEEEEEEE-------------EECCEEEEEEEEE-CCCCCCCCHHHHHHHHHHHHHHHHH T ss_conf 54337899999999719-9807999999-------------9999779999972-4011565207889999999999997 Q ss_pred CCEEEEEEECCCCCCCCCH Q ss_conf 4237999965777535543 Q gi|254780799|r 624 SGIHVIMATQRPSVDVITG 642 (806) Q Consensus 624 ~GiHli~aTqrPsvdvitg 642 (806) -..-||+-|..=-...=.| T Consensus 177 ~k~PlIi~saSGGARMQEG 195 (305) T CHL00174 177 ESLPLIIVCASGGARMQEG 195 (305) T ss_pred CCCCEEEEECCCCCCCCCC T ss_conf 3997899907876343462 No 482 >cd01371 KISc_KIF3 Kinesin motor domain, kinesins II or KIF3_like proteins. Subgroup of kinesins, which form heterotrimers composed of 2 kinesins and one non-motor accessory subunit. Kinesins II play important roles in ciliary transport, and have been implicated in neuronal transport, melanosome transport, the secretory pathway, and mitosis. This catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Kinesins are microtubule-dependent molecular motors that play important roles in intracellular transport and in cell division. In this group the motor domain is found at the N-terminus (N-type). N-type kinesins are (+) end-directed motors, i.e. they transport cargo towards the (+) end of the microtubule. Kinesin motor domains hydrolyze ATP at a rate of about 80 per second, and move along the microtubule at a speed of about 6400 Angstroms per second. To achieve that, kinesin head groups work in pairs. Upon replacing ADP with ATP, a kinesin motor domain Probab=53.31 E-value=5.3 Score=19.63 Aligned_cols=18 Identities=28% Similarity=0.612 Sum_probs=14.0 Q ss_pred EEEEEECCCCHHHHHHHH Q ss_conf 235530477406799999 Q gi|254780799|r 456 HLLIAGTTGSGKSVAINT 473 (806) Q Consensus 456 HLLIAGtTGSGKSV~iN~ 473 (806) -++.-|.|||||+=-+.. T Consensus 84 ti~aYGqTGSGKTyTm~G 101 (333) T cd01371 84 TIFAYGQTGTGKTFTMEG 101 (333) T ss_pred EEEEECCCCCCCEEECCC T ss_conf 899743689987177468 No 483 >TIGR01884 cas_HTH CRISPR locus-related DNA-binding protein; InterPro: IPR010163 Most but not all examples of this family are associated with CRISPR loci, a combination of DNA repeats and characteristic proteins encoded near the repeat cluster. The C-terminal region of this protein is homologous to DNA-binding helix-turn-helix domains with predicted transcriptional regulatory activity.. Probab=53.28 E-value=15 Score=16.21 Aligned_cols=57 Identities=12% Similarity=0.317 Sum_probs=50.0 Q ss_pred HHHHHHHHHC-CCEEEEHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHCCCCCEEEECH Q ss_conf 9999999965-98500014222001177899999999977986802278872673171 Q gi|254780799|r 744 KQAVDIVLRD-NKASISYIQRRLGIGYNRAASIIENMEEKGVIGPASSTGKREILISS 800 (806) Q Consensus 744 ~~a~~~v~~~-~~~s~s~lqr~~~igy~raar~~~~~e~~giv~~~~g~~~r~vl~~~ 800 (806) -+.++.+..+ |+.|++-|=|+|.+-=.-.-|-+.+||..|+|..-.++||+.+-+++ T Consensus 172 ~~vL~~L~~~~Ge~~v~~iA~~lg~s~~T~~~~~~~Lek~gLv~~~~~~k~~~~~~t~ 229 (231) T TIGR01884 172 LKVLEALKAEKGEKSVKNIAKKLGKSLSTISRHLAELEKKGLVESKGRGKGKRYSVTK 229 (231) T ss_pred HHHHHHHHCCCCEECHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCEECC T ss_conf 9999998507880528779988578887999999988744754563046787200013 No 484 >cd01367 KISc_KIF2_like Kinesin motor domain, KIF2-like group. KIF2 is a protein expressed in neurons, which has been associated with axonal transport and neuron development; alternative splice forms have been implicated in lysosomal translocation. This catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Kinesins are microtubule-dependent molecular motors that play important roles in intracellular transport and in cell division. In this subgroup the motor domain is found in the middle (M-type) of the protein chain. M-type kinesins are (+) end-directed motors, i.e. they transport cargo towards the (+) end of the microtubule. Kinesin motor domains hydrolyze ATP at a rate of about 80 per second, and move along the microtubule at a speed of about 6400 Angstroms per second (KIF2 may be slower). To achieve that, kinesin head groups work in pairs. Upon replacing ADP with ATP, a kinesin motor domain increases its affinity for microtubule binding and lo Probab=53.22 E-value=5.6 Score=19.47 Aligned_cols=16 Identities=38% Similarity=0.592 Sum_probs=12.0 Q ss_pred EEEEEECCCCHHHHHH Q ss_conf 2355304774067999 Q gi|254780799|r 456 HLLIAGTTGSGKSVAI 471 (806) Q Consensus 456 HLLIAGtTGSGKSV~i 471 (806) -++.-|.|||||+=-+ T Consensus 87 ti~aYGqTGSGKTyTm 102 (322) T cd01367 87 TCFAYGQTGSGKTYTM 102 (322) T ss_pred EEEEECCCCCCCCEEE T ss_conf 8999426888872684 No 485 >pfam11737 DUF3300 Protein of unknown function (DUF3300). This hypothetical bacterial gene product has a long hydrophobic segment and is thus likely to be a membrane protein. Probab=53.16 E-value=14 Score=16.60 Aligned_cols=29 Identities=34% Similarity=0.546 Sum_probs=23.4 Q ss_pred HHHHHHHHCCCHHHHHHHHHHHHHHHCCE Q ss_conf 46888873210058899999986641423 Q gi|254780799|r 598 EMADLMMVARKDIESAVQRLAQMARASGI 626 (806) Q Consensus 598 ElaDlmm~~~~~ve~~i~rlaq~ara~Gi 626 (806) .|.|=-..-.+||=++||||-|+|.++|. T Consensus 87 ~LG~Afl~q~~dVm~aIQ~LR~~A~~~G~ 115 (230) T pfam11737 87 KLGDAFLAQEADVMDAIQRLRQKAQAAGN 115 (230) T ss_pred HHHHHHHHCHHHHHHHHHHHHHHHHHCCC T ss_conf 99999985999999999999999997399 No 486 >TIGR02868 CydC ABC transporter, CydDC cysteine exporter (CydDC-E) family, permease/ATP-binding protein CydC; InterPro: IPR014223 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This entry represents CydC, a member of a heterodimeric ATP-binding cassette-type transporter (ABC transporter). It is involved in the export of glutathione from the cytoplasm to the periplasm and is required for the assembly of both cytochrome c and cytochrome bd , , .; GO: 0042626 ATPase activity coupled to transmembrane movement of substances, 0006810 transport, 0016021 integral to membrane. Probab=53.05 E-value=11 Score=17.39 Aligned_cols=21 Identities=14% Similarity=0.294 Sum_probs=8.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999998 Q gi|254780799|r 25 KMKIVAGLILLCTVFAITLAL 45 (806) Q Consensus 25 rl~Ei~Gl~Li~~al~l~iSL 45 (806) +.+...++++-.+++...++| T Consensus 12 ~~rl~~a~~lG~L~~~savaL 32 (566) T TIGR02868 12 RRRLALAVLLGVLALGSAVAL 32 (566) T ss_pred CCHHHHHHHHHHHHHHHHHHH T ss_conf 631048999999999999999 No 487 >KOG0245 consensus Probab=52.78 E-value=4.3 Score=20.32 Aligned_cols=23 Identities=17% Similarity=0.071 Sum_probs=14.5 Q ss_pred CCHHHCCCCCHHHHCCCCCEEEE Q ss_conf 42123388645786588754773 Q gi|254780799|r 661 DSRTILGEQGAEQLLGQGDMLYM 683 (806) Q Consensus 661 dSrtild~~gae~Llg~gdml~~ 683 (806) +.+.++|.+-+++.|++-||=|. T Consensus 875 ~~~~~~~~~~~~~fl~~~~~~~s 897 (1221) T KOG0245 875 IAMEDLDSNCQERFLHRKDEALS 897 (1221) T ss_pred CHHHHHHCCCCHHHCCCCCCCCC T ss_conf 32544312350210033333534 No 488 >COG1735 Php Predicted metal-dependent hydrolase with the TIM-barrel fold [General function prediction only] Probab=52.75 E-value=8.6 Score=18.08 Aligned_cols=38 Identities=29% Similarity=0.495 Sum_probs=19.7 Q ss_pred EHHHHHHHHHHCCCH--------HHHHHHHHHHHHHHCCEEEEEEE Q ss_conf 634468888732100--------58899999986641423799996 Q gi|254780799|r 595 VIDEMADLMMVARKD--------IESAVQRLAQMARASGIHVIMAT 632 (806) Q Consensus 595 iiDElaDlmm~~~~~--------ve~~i~rlaq~ara~GiHli~aT 632 (806) ++-|+.++|---|+- +---+.++++.|+|.|+|.|.|| T Consensus 50 ~~~e~~~~~a~Gg~TIVD~T~~~~GRdv~~m~~vs~atglnIV~~T 95 (316) T COG1735 50 AIAELKRLMARGGQTIVDATNIGIGRDVLKMRRVAEATGLNIVAAT 95 (316) T ss_pred HHHHHHHHHHCCCCEEEECCCCCCCCCHHHHHHHHHHHCCCEEEEC T ss_conf 9999999997489768607764547488999999997198589704 No 489 >cd01376 KISc_KID_like Kinesin motor domain, KIF22/Kid-like subgroup. Members of this group might play a role in regulating chromosomal movement along microtubules in mitosis. This catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Kinesins are microtubule-dependent molecular motors that play important roles in intracellular transport and in cell division. In most kinesins, the motor domain is found at the N-terminus (N-type). N-type kinesins are (+) end-directed motors, i.e. they transport cargo towards the (+) end of the microtubule. Kinesin motor domains hydrolyze ATP at a rate of about 80 per second, and move along the microtubule at a speed of about 6400 Angstroms per second. To achieve that, kinesin head groups work in pairs. Upon replacing ADP with ATP, a kinesin motor domain increases its affinity for microtubule binding and locks in place. Also, the neck linker binds to the motor domain, which repositions the other head domain through Probab=52.74 E-value=5.8 Score=19.38 Aligned_cols=51 Identities=20% Similarity=0.240 Sum_probs=27.4 Q ss_pred HHCCCCCHHHHCCCCCEEEECCCCCEEEEEECCC--CHHHHHHHHHHHHHC-CCCCCCCCC Q ss_conf 2338864578658875477368983258883348--988999999999712-897421100 Q gi|254780799|r 664 TILGEQGAEQLLGQGDMLYMTGGGRVQRIHGPFV--SDIEVEKVVSHLKTQ-GEAKYIDIK 721 (806) Q Consensus 664 tild~~gae~Llg~gdml~~~~~~~~~r~~g~~v--~~~ev~~v~~~~~~q-~~~~y~~~~ 721 (806) .+.|-.|.|++--.| ......-.+.+| |=--..+|+.-++.. ...-|.+.. T Consensus 224 ~lVDLAGSEr~~~~~-------~~g~~l~E~~~IN~SL~~L~~vi~aL~~~~~~iPyR~Sk 277 (319) T cd01376 224 NLIDLAGSEDNRRTG-------NEGIRLKESAAINSSLFVLSKVVDALNKGLPRIPYRESK 277 (319) T ss_pred EEEECCCCCCCCCCC-------CCCCEEEECCCCCCHHHHHHHHHHHHHCCCCCCCCCCCH T ss_conf 986445554444347-------887301103212725778999999986699978874578 No 490 >PRK07471 DNA polymerase III subunit delta'; Validated Probab=52.62 E-value=13 Score=16.76 Aligned_cols=35 Identities=34% Similarity=0.422 Sum_probs=0.0 Q ss_pred CCCEE-EEEECCCCHHHHHHHHHHHHHHHHCCHHHE Q ss_conf 02023-553047740679999999999982995784 Q gi|254780799|r 453 RMPHL-LIAGTTGSGKSVAINTMILSLLYRMTPAQC 487 (806) Q Consensus 453 kMPHL-LIAGtTGSGKSV~iN~iI~SlLyk~~P~ev 487 (806) +|||- |..|..|.||.-.--.+.-.||-...++.+ T Consensus 37 rl~HA~Lf~Gp~GiGK~tlA~~~A~~ll~~~~~~~~ 72 (363) T PRK07471 37 RLHHAWLIGGPQGIGKATLAYRMARFLLATPPPGGD 72 (363) T ss_pred CCCCEEEEECCCCCCHHHHHHHHHHHHHCCCCCCCC T ss_conf 976458767999818899999999998579997777 No 491 >TIGR01162 purE phosphoribosylaminoimidazole carboxylase, catalytic subunit; InterPro: IPR000031 Phosphoribosylaminoimidazole carboxylase is a fusion protein in plants and fungi, but consists of two non-interacting proteins in bacteria, PurK and PurE. PurK, N5-carboxyaminoimidazole ribonucleotide (N5_CAIR) synthetase, catalyzes the conversion of 5-aminoimidazole ribonucleotide (AIR), ATP, and bicarbonate to N5-CAIR, ADP, and Pi. PurE converts N5-CAIR to CAIR, the sixth step of de novo purine biosynthesis. In the presence of high concentrations of bicarbonate, PurE is reported able to convert AIR to CAIR directly and without ATP. Some members of this family contain two copies of this domain . The crystal structure of PurE indicates a unique quaternary structure that confirms the octameric nature of the enzyme .; GO: 0004638 phosphoribosylaminoimidazole carboxylase activity, 0006189 'de novo' IMP biosynthetic process, 0009320 phosphoribosylaminoimidazole carboxylase complex. Probab=52.51 E-value=14 Score=16.56 Aligned_cols=26 Identities=23% Similarity=0.457 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHCCC--CEEEEEEEECC Q ss_conf 578999999742486--32899841044 Q gi|254780799|r 334 NNACTLKSVLSDFGI--QGEIVNVRPGP 359 (806) Q Consensus 334 ~nA~lLE~tL~dFGV--e~~Vv~v~pGP 359 (806) ...+..-++|++||| |.+|+++|..| T Consensus 12 ~~m~~a~~~L~~fgi~~e~~V~SAHRTP 39 (159) T TIGR01162 12 ETMKKAAEILEEFGIPYELRVVSAHRTP 39 (159) T ss_pred HHHHHHHHHHHHCCCCEEEEEECCCCCH T ss_conf 8999999999855996678986067780 No 492 >pfam04123 DUF373 Domain of unknown function (DUF373). Archaeal domain of unknown function. Predicted to be an integral membrane protein with six transmembrane regions. Probab=52.49 E-value=16 Score=16.12 Aligned_cols=133 Identities=13% Similarity=0.079 Sum_probs=0.0 Q ss_pred CCCCCCCCCCCCCCCHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHH---------CCCHHHH-------HHHH Q ss_conf 689631368864500013467899999999-9999999999999999999982---------7652367-------8999 Q gi|254780799|r 51 YDPSFSYITLRSPKNFLGYGGAIFADVAIQ-FFGIASVFFLPPPTMWALSLLF---------DKKIYCF-------SKRA 113 (806) Q Consensus 51 ~DPs~~~~s~~~v~N~~G~lGA~iAd~L~~-lFG~~Ayllpl~Ll~~g~~ll~---------~k~~~~~-------~~Rl 113 (806) +||-|...- +|+.|..+--+-+. ++|++++.+..+.++.|..++. .+-.... ...+ T Consensus 154 ~dp~~~~~~-------l~~pG~~lli~~l~~~~~~~~~~~~~i~~~lG~y~l~kG~gid~~~~~~~~~~r~~l~~g~is~ 226 (344) T pfam04123 154 SDPEYRRTF-------LGVPGLLLLIYGLLALANLPAYALGVILLILGLYLLYRGFGLDDFLRELLKRVRESLYEGRITF 226 (344) T ss_pred CCHHHHEEE-------EHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCEEEE T ss_conf 382041014-------3486799999999999577087999999999999999973799999999999999886473771 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH--------------HHHHHHHHH Q ss_conf 999999999999999850444322333432488999999999986468999999999--------------999999999 Q gi|254780799|r 114 TAWLINILVSATFFASFSPSQSWPIQNGFGGIIGDLIIRLPFLFFESYPRKLGILFF--------------QMILFLAMS 179 (806) Q Consensus 114 ~~~il~ll~~s~ll~~~~~~~~~~l~~g~GGiiG~~l~~~l~~~lg~~g~~l~lli~--------------~iil~l~is 179 (806) +.+++.+++.......-............+++.+.++......+....+....-.++ ...+++... T Consensus 227 it~vva~~l~iigi~~g~~~~~~~~~~~~~~~~~~fl~~~v~wl~~a~l~~~~G~~id~~l~~~~~~~~~i~~~f~~~a~ 306 (344) T pfam04123 227 ITYVVSIILILIGIIYGYLSLEKLSAISAGGVAATFLNGAVPWLALAALIAFLGKLIDEYLNGRFRLWRYINAPFFVLAI 306 (344) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHH T ss_conf 79999999999999999999987325631499999999989999999999999999999974895459999999999999 Q ss_pred HHHHHHHHHHH Q ss_conf 99999998886 Q gi|254780799|r 180 WLLIYSSSAIF 190 (806) Q Consensus 180 ~l~i~~~~~~~ 190 (806) |+.++.....+ T Consensus 307 ~~vv~~~~~y~ 317 (344) T pfam04123 307 GLVVYGLSGYF 317 (344) T ss_pred HHHHHHHHHHH T ss_conf 99999999999 No 493 >cd03276 ABC_SMC6_euk Eukaryotic SMC6 proteins; SMC proteins are large (approximately 110 to 170 kDa), and each is arranged into five recognizable domains. Amino-acid sequence homology of SMC proteins between species is largely confined to the amino- and carboxy-terminal globular domains. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T, in the single-letter amino-acid code), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif, and a motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. In eukaryotic cells, the proteins are found as heterodimers of SMC1 paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6 (for Probab=52.48 E-value=12 Score=17.00 Aligned_cols=21 Identities=38% Similarity=0.748 Sum_probs=0.0 Q ss_pred EEEECCCCHHHHHHHHHHHHH Q ss_conf 553047740679999999999 Q gi|254780799|r 458 LIAGTTGSGKSVAINTMILSL 478 (806) Q Consensus 458 LIAGtTGSGKSV~iN~iI~Sl 478 (806) +|-|-.|||||--+.+|.+-| T Consensus 25 ~IiG~NGsGKSsIl~AI~lgL 45 (198) T cd03276 25 FIVGNNGSGKSAILTALTIGL 45 (198) T ss_pred EEECCCCCCHHHHHHHHHHHC T ss_conf 998899998899999999863 No 494 >PRK10895 putative ABC transporter ATP-binding protein YhbG; Provisional Probab=52.45 E-value=9.7 Score=17.69 Aligned_cols=16 Identities=25% Similarity=0.605 Sum_probs=0.0 Q ss_pred EEECCCCHHHHHHHHH Q ss_conf 5304774067999999 Q gi|254780799|r 459 IAGTTGSGKSVAINTM 474 (806) Q Consensus 459 IAGtTGSGKSV~iN~i 474 (806) +.|..|||||..+|+| T Consensus 34 liGpNGaGKSTLl~~i 49 (241) T PRK10895 34 LLGPNGAGKTTTFYMV 49 (241) T ss_pred EECCCCCCHHHHHHHH T ss_conf 9889998699999999 No 495 >COG2717 Predicted membrane protein [Function unknown] Probab=52.45 E-value=16 Score=16.12 Aligned_cols=91 Identities=13% Similarity=0.152 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 99999999999999999999844270689631368864500013467899999999999999999999999999998276 Q gi|254780799|r 25 KMKIVAGLILLCTVFAITLALGTWDVYDPSFSYITLRSPKNFLGYGGAIFADVAIQFFGIASVFFLPPPTMWALSLLFDK 104 (806) Q Consensus 25 rl~Ei~Gl~Li~~al~l~iSL~SYsp~DPs~~~~s~~~v~N~~G~lGA~iAd~L~~lFG~~Ayllpl~Ll~~g~~ll~~k 104 (806) |.|...|+ ...++.++=+.+|-..|-+++ .+..+.-+..--+-..|..||++.+.+++.+++..+.+ T Consensus 75 ~~Rr~LGl---~af~~~~lH~~~Y~~~~l~~~----------~~~~~~d~~~rpyitiG~iaflll~pLalTS~k~~~rr 141 (209) T COG2717 75 RIRRALGL---WAFFYALLHFTAYLVLDLGLD----------LALLGLDLLKRPYITIGMIAFLLLIPLALTSFKWVRRR 141 (209) T ss_pred HHHHHHHH---HHHHHHHHHHHHHHHHHHHCC----------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999---999999999999999998124----------98756778870999999999999999999713999999 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 52367899999999999999999985 Q gi|254780799|r 105 KIYCFSKRATAWLINILVSATFFASF 130 (806) Q Consensus 105 ~~~~~~~Rl~~~il~ll~~s~ll~~~ 130 (806) -- ..|+.+--+..++.....++.+ T Consensus 142 lG--~rW~~LHrLvYl~~~L~~lH~~ 165 (209) T COG2717 142 LG--KRWKKLHRLVYLALILGALHYL 165 (209) T ss_pred HH--HHHHHHHHHHHHHHHHHHHHHH T ss_conf 87--7999999999999999999998 No 496 >PRK05313 hypothetical protein; Provisional Probab=52.44 E-value=16 Score=16.12 Aligned_cols=34 Identities=21% Similarity=0.482 Sum_probs=0.0 Q ss_pred CC-CCHHHHHH-HHHHHHHHH---HHCCCCCCCCCCCCE Q ss_conf 78-63999997-889999886---300121000388612 Q gi|254780799|r 370 PG-IKSSRIIG-LSDDIARSM---SAISARVAVIPRRNA 403 (806) Q Consensus 370 pG-VKvSKI~n-LadDIA~aL---sa~svRIapIPGK~~ 403 (806) || +-.++|.+ ++|..|..| +...||+-|+|||.. T Consensus 370 PGdt~aetisaiIaDeaAIG~in~KttaVRlIPvpGkk~ 408 (450) T PRK05313 370 PGDTPAETISAIIADEAAIGVINNKTTAVRIIPVPGKKV 408 (450) T ss_pred CCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCC T ss_conf 999989999999999999886617851399986799998 No 497 >PRK09700 D-allose transporter ATP-binding protein; Provisional Probab=52.39 E-value=11 Score=17.16 Aligned_cols=18 Identities=28% Similarity=0.558 Sum_probs=0.0 Q ss_pred EEEEECCCCHHHHHHHHH Q ss_conf 355304774067999999 Q gi|254780799|r 457 LLIAGTTGSGKSVAINTM 474 (806) Q Consensus 457 LLIAGtTGSGKSV~iN~i 474 (806) +.+.|..|||||..++.+ T Consensus 34 ~~liG~nGaGKSTL~~~l 51 (510) T PRK09700 34 HALLGENGAGKSTLMKVL 51 (510) T ss_pred EEEECCCCCHHHHHHHHH T ss_conf 999899997299999998 No 498 >PRK09862 putative ATP-dependent protease; Provisional Probab=52.30 E-value=10 Score=17.45 Aligned_cols=19 Identities=32% Similarity=0.705 Sum_probs=0.0 Q ss_pred EEEEEECCCCHHHHHHHHH Q ss_conf 2355304774067999999 Q gi|254780799|r 456 HLLIAGTTGSGKSVAINTM 474 (806) Q Consensus 456 HLLIAGtTGSGKSV~iN~i 474 (806) |||.-|.-|||||..-..+ T Consensus 212 nlLl~GpPG~GKTMlA~rl 230 (506) T PRK09862 212 NLLLIGPPGTGKTMLASRI 230 (506) T ss_pred CEEEECCCCCCHHHHHHHH T ss_conf 6598769994598999775 No 499 >PRK13547 hmuV hemin importer ATP-binding subunit; Provisional Probab=52.24 E-value=13 Score=16.67 Aligned_cols=43 Identities=14% Similarity=0.160 Sum_probs=0.0 Q ss_pred CCCHHHCCCCCCCCEEEEECCCCCEEEEEECCCCHHHHHHHHH Q ss_conf 1000210456666667854100202355304774067999999 Q gi|254780799|r 432 CDLAINLGKSIEGKPIIADLARMPHLLIAGTTGSGKSVAINTM 474 (806) Q Consensus 432 ~~L~iaLGKdI~G~pvv~DLakMPHLLIAGtTGSGKSV~iN~i 474 (806) ..|.+..|....=+-+-.++.+==-+-|-|..|||||--+|+| T Consensus 5 ~~L~~~~~~~~vL~~vsl~i~~Ge~~aliG~nGaGKSTLl~~i 47 (273) T PRK13547 5 DHLHVARRHRAILRDLSLRIEPGRVTALLGRNGAGKSTLLKVL 47 (273) T ss_pred EEEEEEECCEEEEECEEEEECCCCEEEEECCCCCHHHHHHHHH T ss_conf 6489999999999760889989989999999997699999999 No 500 >PRK11124 artP arginine transporter ATP-binding subunit; Provisional Probab=52.10 E-value=14 Score=16.39 Aligned_cols=41 Identities=22% Similarity=0.293 Sum_probs=0.0 Q ss_pred CHHHCCCCCCCCEEEEECCCCCEEEEEECCCCHHHHHHHHH Q ss_conf 00210456666667854100202355304774067999999 Q gi|254780799|r 434 LAINLGKSIEGKPIIADLARMPHLLIAGTTGSGKSVAINTM 474 (806) Q Consensus 434 L~iaLGKdI~G~pvv~DLakMPHLLIAGtTGSGKSV~iN~i 474 (806) |+..-|+.-.=+-+-.++.+==.+-|.|..|||||--++.| T Consensus 8 ls~~yg~~~aL~dvsl~i~~Ge~~~iiG~nGaGKSTLl~~l 48 (242) T PRK11124 8 INCFYGAHQALFDITLDCPQGETLVLLGPSGAGKSSLLRVL 48 (242) T ss_pred EEEEECCEEEEEEEEEEECCCCEEEEECCCCCCHHHHHHHH T ss_conf 99998999989420778879989999999997199999999 Done!