Query gi|254780671|ref|YP_003065084.1| putative cell division protein [Candidatus Liberibacter asiaticus str. psy62] Match_columns 105 No_of_seqs 100 out of 686 Neff 7.1 Searched_HMMs 39220 Date Sun May 29 23:49:52 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780671.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK00888 ftsB cell division pr 99.8 4.1E-18 1E-22 131.7 12.5 87 15-104 3-89 (105) 2 pfam04977 DivIC Septum formati 99.7 8.1E-17 2.1E-21 123.8 9.8 79 23-104 1-79 (80) 3 COG2919 Septum formation initi 99.7 3.8E-16 9.7E-21 119.7 12.8 101 1-104 12-112 (117) 4 TIGR02209 ftsL_broad cell divi 97.9 0.00026 6.5E-09 47.4 10.8 60 39-102 24-83 (85) 5 pfam04999 FtsL Cell division p 97.1 0.01 2.6E-07 37.6 9.9 56 39-98 35-90 (97) 6 PRK10772 cell division protein 95.9 0.16 4.2E-06 30.2 11.6 59 39-101 58-119 (121) 7 COG3116 FtsL Cell division pro 95.5 0.24 6.1E-06 29.2 10.5 53 40-96 44-96 (105) 8 COG4839 FtsL Protein required 94.2 0.51 1.3E-05 27.2 11.5 58 39-101 60-117 (120) 9 TIGR02894 DNA_bind_RsfA transc 91.2 1.4 3.6E-05 24.5 8.3 53 20-75 83-135 (163) 10 pfam05137 PilN Fimbrial assemb 89.1 2.1 5.3E-05 23.5 10.3 82 6-88 13-105 (162) 11 pfam07407 Seadorna_VP6 Seadorn 88.7 0.77 2E-05 26.1 4.3 41 46-86 32-72 (420) 12 pfam11853 DUF3373 Protein of u 87.3 1.7 4.3E-05 24.1 5.3 34 40-73 25-58 (485) 13 pfam08961 DUF1875 Domain of un 86.8 2.1 5.4E-05 23.4 5.6 57 39-97 129-191 (243) 14 pfam06005 DUF904 Protein of un 85.2 3.5 8.9E-05 22.1 6.3 53 39-94 18-70 (72) 15 PRK11677 cytochrome d ubiquino 83.2 4.2 0.00011 21.6 9.0 51 20-72 12-62 (134) 16 pfam03245 Phage_lysis Bacterio 82.0 4.7 0.00012 21.3 9.4 49 38-88 25-73 (151) 17 pfam11382 DUF3186 Protein of u 81.3 5 0.00013 21.2 11.6 57 41-103 34-90 (307) 18 pfam12097 DUF3573 Protein of u 79.9 5.5 0.00014 20.9 5.9 31 44-74 33-63 (375) 19 PRK09039 hypothetical protein; 77.8 6.4 0.00016 20.5 7.8 35 40-74 54-88 (343) 20 pfam08936 CsoSCA Carboxysome S 76.7 6.9 0.00018 20.3 5.2 53 43-98 7-62 (459) 21 pfam06305 DUF1049 Protein of u 76.2 7.1 0.00018 20.2 6.0 29 37-65 51-79 (80) 22 pfam02183 HALZ Homeobox associ 73.7 8.2 0.00021 19.8 4.9 35 40-74 6-40 (45) 23 pfam07423 DUF1510 Protein of u 73.5 3.8 9.8E-05 21.9 3.2 32 4-36 8-39 (214) 24 KOG4797 consensus 72.1 9 0.00023 19.6 6.3 45 41-86 62-106 (123) 25 pfam08537 NBP1 Fungal Nap bind 70.6 9.7 0.00025 19.4 5.9 42 41-85 175-223 (313) 26 KOG4571 consensus 69.2 10 0.00027 19.2 5.8 47 40-89 241-288 (294) 27 TIGR03007 pepcterm_ChnLen poly 68.9 10 0.00026 19.2 4.5 36 40-75 323-358 (510) 28 pfam01166 TSC22 TSC-22/dip/bun 67.6 11 0.00029 19.0 6.4 43 42-85 10-52 (57) 29 pfam07106 TBPIP Tat binding pr 66.5 12 0.0003 18.9 7.0 50 38-87 71-120 (169) 30 pfam06295 DUF1043 Protein of u 61.3 15 0.00038 18.3 8.6 49 22-72 10-58 (128) 31 pfam04111 APG6 Autophagy prote 60.3 15 0.00039 18.2 5.0 45 40-84 71-115 (356) 32 COG4942 Membrane-bound metallo 59.7 16 0.0004 18.1 7.1 36 39-74 38-73 (420) 33 pfam09889 DUF2116 Uncharacteri 58.9 15 0.00038 18.3 3.8 29 2-30 28-56 (59) 34 pfam10883 DUF2681 Protein of u 57.5 17 0.00044 17.9 8.2 48 17-71 6-53 (78) 35 KOG4603 consensus 57.4 17 0.00044 17.9 5.3 48 39-86 79-126 (201) 36 pfam10211 Ax_dynein_light Axon 54.8 19 0.00049 17.6 5.8 47 37-86 111-157 (189) 37 pfam07716 bZIP_2 Basic region 54.4 19 0.00049 17.6 5.3 34 41-74 20-53 (54) 38 PRK13922 rod shape-determining 52.1 21 0.00054 17.3 10.0 47 44-94 60-106 (264) 39 pfam05377 FlaC_arch Flagella a 52.1 18 0.00047 17.7 3.4 45 43-87 4-49 (55) 40 KOG0483 consensus 50.9 22 0.00056 17.2 4.3 49 27-75 92-141 (198) 41 TIGR03064 sortase_srtB sortase 50.4 13 0.00034 18.6 2.4 39 13-54 3-41 (232) 42 pfam00170 bZIP_1 bZIP transcri 50.2 23 0.00058 17.2 5.2 34 41-74 16-49 (59) 43 PRK03992 proteasome-activating 50.2 23 0.00058 17.2 4.9 33 41-73 11-43 (390) 44 pfam08172 CASP_C CASP C termin 50.0 23 0.00058 17.1 6.2 50 41-93 88-137 (245) 45 pfam03518 consensus 49.5 23 0.00059 17.1 5.8 45 41-87 9-53 (349) 46 KOG4196 consensus 49.5 23 0.00059 17.1 5.2 37 38-74 73-109 (135) 47 pfam11119 DUF2633 Protein of u 49.5 18 0.00047 17.7 3.0 29 4-32 2-30 (59) 48 TIGR02976 phageshock_pspB phag 48.3 24 0.00062 17.0 5.0 25 50-74 39-63 (75) 49 KOG3705 consensus 47.6 25 0.00063 16.9 7.0 81 13-95 13-95 (580) 50 pfam03980 Nnf1 Nnf1. NNF1 is a 47.0 25 0.00065 16.8 5.1 34 41-74 74-107 (108) 51 KOG0642 consensus 46.4 26 0.00066 16.8 3.5 55 42-96 37-91 (577) 52 KOG3335 consensus 46.4 22 0.00056 17.2 3.0 27 48-74 101-127 (181) 53 COG3074 Uncharacterized protei 45.7 27 0.00068 16.7 5.3 49 41-92 27-75 (79) 54 KOG0999 consensus 44.8 27 0.0007 16.6 7.2 53 38-90 106-158 (772) 55 KOG0288 consensus 44.5 28 0.00071 16.6 6.1 46 41-88 29-74 (459) 56 pfam03962 Mnd1 Mnd1 family. Th 44.1 28 0.00072 16.6 6.0 36 50-87 107-142 (188) 57 KOG0994 consensus 43.6 29 0.00073 16.5 7.4 50 47-96 1455-1508(1758) 58 pfam09851 DUF2078 Predicted me 43.5 18 0.00045 17.8 2.2 61 12-72 11-71 (73) 59 COG3879 Uncharacterized protei 43.2 29 0.00074 16.5 10.0 65 29-93 39-108 (247) 60 KOG1103 consensus 42.9 29 0.00075 16.5 5.6 62 35-99 234-298 (561) 61 TIGR01242 26Sp45 26S proteasom 42.6 30 0.00076 16.4 5.3 37 39-75 6-42 (364) 62 smart00338 BRLZ basic region l 42.0 30 0.00077 16.4 4.4 34 41-74 21-54 (65) 63 COG3883 Uncharacterized protei 41.0 31 0.0008 16.3 9.0 10 80-89 101-110 (265) 64 PRK11415 hypothetical protein; 40.9 32 0.0008 16.3 5.6 54 39-92 3-58 (74) 65 pfam06103 DUF948 Bacterial pro 40.9 32 0.0008 16.3 9.5 58 15-74 4-61 (90) 66 pfam12269 zf-CpG_bind_C CpG bi 38.1 35 0.00089 16.0 5.5 49 32-80 22-70 (236) 67 pfam07926 TPR_MLP1_2 TPR/MLP1/ 37.7 35 0.0009 16.0 4.9 29 44-72 103-131 (132) 68 PRK13169 DNA replication intia 37.0 36 0.00092 15.9 5.5 35 39-73 22-56 (107) 69 TIGR01000 bacteriocin_acc bact 37.0 30 0.00078 16.4 2.6 30 46-75 98-127 (476) 70 pfam06156 DUF972 Protein of un 36.6 37 0.00094 15.9 5.0 33 40-72 23-55 (106) 71 PRK03784 vtamin B12-transporte 36.3 37 0.00095 15.8 4.2 36 3-38 4-39 (330) 72 cd07429 Cby_like Chibby, a nuc 36.1 37 0.00096 15.8 5.0 34 46-85 72-105 (108) 73 PTZ00234 variable surface prot 35.6 6.3 0.00016 20.5 -1.1 63 6-69 358-429 (433) 74 pfam05103 DivIVA DivIVA protei 35.4 38 0.00098 15.7 4.2 35 40-74 19-53 (131) 75 pfam06447 consensus 35.4 39 0.00098 15.7 5.2 22 51-72 88-109 (407) 76 TIGR00393 kpsF sugar isomerase 34.6 24 0.0006 17.0 1.7 21 84-104 36-56 (272) 77 pfam11855 DUF3375 Protein of u 34.5 40 0.001 15.7 4.2 42 51-92 139-183 (471) 78 pfam06936 Selenoprotein_S Sele 33.9 41 0.001 15.6 5.8 18 16-33 38-55 (190) 79 KOG2895 consensus 32.8 34 0.00087 16.1 2.2 43 21-68 346-388 (408) 80 pfam05833 FbpA Fibronectin-bin 32.7 43 0.0011 15.5 8.1 45 40-85 380-424 (447) 81 pfam07047 OPA3 Optic atrophy 3 31.3 45 0.0011 15.3 8.2 27 47-73 106-132 (134) 82 COG5278 Predicted periplasmic 31.0 45 0.0012 15.3 8.1 23 61-83 85-108 (207) 83 TIGR01367 pyrE_Therm orotate p 30.3 27 0.00069 16.7 1.4 13 92-104 107-119 (205) 84 PRK10998 malG maltose transpor 30.2 47 0.0012 15.2 3.7 28 1-28 1-28 (296) 85 PRK12705 hypothetical protein; 29.6 48 0.0012 15.2 9.8 14 49-62 49-62 (485) 86 KOG4643 consensus 29.1 49 0.0013 15.1 5.0 34 41-74 525-558 (1195) 87 pfam03131 bZIP_Maf bZIP Maf tr 28.3 51 0.0013 15.0 5.5 37 38-74 52-88 (95) 88 PRK09458 pspB phage shock prot 27.3 53 0.0013 14.9 5.4 37 38-74 23-63 (75) 89 COG4499 Predicted membrane pro 27.1 53 0.0014 14.9 6.2 74 6-93 214-290 (434) 90 pfam05384 DegS Sensor protein 27.0 53 0.0014 14.9 5.9 39 45-90 26-64 (159) 91 pfam10224 DUF2205 Predicted co 26.8 54 0.0014 14.8 6.5 40 43-88 27-66 (81) 92 pfam06667 PspB Phage shock pro 26.5 54 0.0014 14.8 4.3 37 38-74 23-63 (75) 93 pfam11285 DUF3086 Protein of u 26.3 55 0.0014 14.8 6.3 37 39-75 4-40 (283) 94 pfam06810 Phage_GP20 Phage min 26.0 56 0.0014 14.8 6.6 35 41-75 29-66 (156) 95 PRK11091 aerobic respiration c 25.4 57 0.0015 14.7 8.4 13 62-74 105-117 (779) 96 pfam05812 Herpes_BLRF2 Herpesv 25.4 57 0.0015 14.7 3.8 27 47-73 4-30 (118) 97 TIGR02351 thiH thiazole biosyn 25.4 46 0.0012 15.3 1.9 40 49-90 22-62 (378) 98 pfam08285 DPM3 Dolichol-phosph 25.2 58 0.0015 14.7 7.2 48 9-56 34-85 (91) 99 pfam04415 DUF515 Protein of un 25.2 58 0.0015 14.7 9.7 67 4-74 26-99 (416) 100 PRK13729 conjugal transfer pil 24.8 59 0.0015 14.6 5.1 30 44-73 81-110 (474) 101 pfam08614 ATG16 Autophagy prot 24.5 59 0.0015 14.6 5.1 31 43-73 106-136 (194) 102 COG3394 Uncharacterized protei 24.3 54 0.0014 14.8 2.1 21 78-98 210-230 (257) 103 KOG3156 consensus 24.0 61 0.0015 14.5 6.0 40 44-88 106-146 (220) 104 PRK10549 signal transduction h 23.8 61 0.0016 14.5 8.1 15 31-45 32-46 (467) 105 PRK03147 thiol-disulfide oxido 23.7 51 0.0013 15.0 1.8 29 6-34 3-31 (176) 106 PRK08475 F0F1 ATP synthase sub 23.3 63 0.0016 14.4 9.9 61 12-74 29-90 (170) 107 PRK04335 cell division protein 23.3 63 0.0016 14.4 2.7 22 12-33 4-25 (319) 108 KOG3584 consensus 23.1 63 0.0016 14.4 3.2 27 47-73 313-339 (348) 109 cd01570 NAPRTase_A Nicotinate 22.8 64 0.0016 14.4 2.9 34 71-104 252-285 (327) 110 pfam10205 KLRAQ Predicted coil 22.7 65 0.0016 14.4 4.5 40 41-83 35-74 (102) 111 KOG3119 consensus 22.5 65 0.0017 14.3 4.7 25 49-73 218-242 (269) 112 TIGR02977 phageshock_pspA phag 22.0 67 0.0017 14.3 6.3 48 38-90 91-138 (222) 113 pfam03100 CcmE CcmE. CcmE is t 21.8 67 0.0017 14.3 2.7 25 8-32 5-29 (130) 114 pfam09340 NuA4 Histone acetylt 21.6 68 0.0017 14.2 4.0 32 47-83 3-34 (80) 115 pfam00952 Bunya_nucleocap Buny 21.5 41 0.001 15.6 1.0 29 69-97 159-187 (228) 116 PRK06835 DNA replication prote 21.2 69 0.0018 14.2 3.1 33 52-84 64-97 (330) 117 pfam08941 USP8_interact USP8 i 21.2 69 0.0018 14.2 5.0 36 40-75 12-47 (179) 118 PRK13726 conjugal transfer pil 21.1 63 0.0016 14.4 1.9 22 71-92 59-80 (187) 119 COG4467 Regulator of replicati 20.8 70 0.0018 14.1 4.5 48 41-88 17-64 (114) 120 TIGR01499 folC FolC bifunction 20.7 53 0.0014 14.9 1.4 24 80-103 161-184 (460) 121 pfam05565 Sipho_Gp157 Siphovir 20.5 72 0.0018 14.1 6.1 43 39-87 47-89 (162) 122 cd01401 PncB_like Nicotinate p 20.2 72 0.0018 14.1 2.9 34 71-104 276-311 (377) 123 PRK05431 seryl-tRNA synthetase 20.2 73 0.0019 14.1 7.7 39 40-78 29-67 (422) 124 pfam04065 Not3 Not1 N-terminal 20.2 73 0.0019 14.0 6.1 43 48-91 165-207 (233) 125 COG4768 Uncharacterized protei 20.1 73 0.0019 14.0 9.0 35 40-74 32-66 (139) No 1 >PRK00888 ftsB cell division protein FtsB; Reviewed Probab=99.78 E-value=4.1e-18 Score=131.73 Aligned_cols=87 Identities=14% Similarity=0.279 Sum_probs=81.2 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCC Q ss_conf 99999999999999999837203999999999999999999999999999999999502799898999999999809878 Q gi|254780671|r 15 IFLVIAFCCVVYFTNHAIVGDYGLKANKSLEKSLIERERFLSELKENRSRLERKVKLMSDGSLEKDLLDEKARYSLNLSR 94 (105) Q Consensus 15 ~~~~~~~~~~~~f~~~~~~g~~Gl~~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~~~D~d~lEe~aR~~Lg~~k 94 (105) .+.+++++++++++|.+|||++|+.++..++++++..+++++.++++|+.|..+|..|++ |++++|+.||++|||+| T Consensus 3 ~l~~~L~~ll~~LQY~lwfg~~g~~~~~~L~~~i~~~~~~n~~L~~~N~~L~~EI~~Lk~---~~~aIE~~AR~eLGmIK 79 (105) T PRK00888 3 LLTLVLLALLVWLQYPLWFGKNGWLDYHRLNQQVAAQQQTNAKLKARNDQLFAEIDDLKG---GQEAIEERARNELGMVK 79 (105) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC---CHHHHHHHHHHHCCCCC T ss_conf 699999999999999994048969999999999999999999999999999999999877---87999999999869866 Q ss_pred CCCEEEEECC Q ss_conf 9868767527 Q gi|254780671|r 95 SDEIILFYSD 104 (105) Q Consensus 95 p~E~iii~~d 104 (105) |||++|.+.| T Consensus 80 ~gE~fy~~v~ 89 (105) T PRK00888 80 PGETFYRIVP 89 (105) T ss_pred CCCEEEEECC T ss_conf 9987888579 No 2 >pfam04977 DivIC Septum formation initiator. DivIC from B. subtilis is necessary for both vegetative and sporulation septum formation. These proteins are mainly composed of an amino terminal coiled-coil. Probab=99.71 E-value=8.1e-17 Score=123.79 Aligned_cols=79 Identities=28% Similarity=0.382 Sum_probs=75.5 Q ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCCCEEEEE Q ss_conf 99999999983720399999999999999999999999999999999950279989899999999980987898687675 Q gi|254780671|r 23 CVVYFTNHAIVGDYGLKANKSLEKSLIERERFLSELKENRSRLERKVKLMSDGSLEKDLLDEKARYSLNLSRSDEIILFY 102 (105) Q Consensus 23 ~~~~f~~~~~~g~~Gl~~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~~~D~d~lEe~aR~~Lg~~kp~E~iii~ 102 (105) ++++++|++|+|++|+.++.++++++..++++++.++++|+.|+.+|..|++ |||++|++||++|||++|||++|.+ T Consensus 1 l~~~l~~~~~~g~~g~~~~~~l~~~i~~l~~~~~~l~~~n~~L~~ei~~L~~---d~~~ie~~AR~~lg~~~~gE~v~~v 77 (80) T pfam04977 1 LLLLFQYLLIFGVGGLSAYYQLNQEIAALQAELAKLKAENEELEAEVKDLKS---DPDYIEERARSELGLVKPGETIYRV 77 (80) T ss_pred CCHHHHHHHHEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC---CHHHHHHHHHHHCCCCCCCCEEEEE T ss_conf 9044527773048879999999999999999999999999999999998508---9999999999985983899889971 Q ss_pred CC Q ss_conf 27 Q gi|254780671|r 103 SD 104 (105) Q Consensus 103 ~d 104 (105) +| T Consensus 78 ~~ 79 (80) T pfam04977 78 VE 79 (80) T ss_pred CC T ss_conf 68 No 3 >COG2919 Septum formation initiator [Cell division and chromosome partitioning] Probab=99.70 E-value=3.8e-16 Score=119.67 Aligned_cols=101 Identities=23% Similarity=0.349 Sum_probs=96.6 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHH Q ss_conf 96249898677999999999999999999998372039999999999999999999999999999999995027998989 Q gi|254780671|r 1 MWTKYYKKNHFFRAIFLVIAFCCVVYFTNHAIVGDYGLKANKSLEKSLIERERFLSELKENRSRLERKVKLMSDGSLEKD 80 (105) Q Consensus 1 m~tk~~r~~~~~~~~~~~~~~~~~~~f~~~~~~g~~Gl~~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~~~D~d 80 (105) ||++++.++...+.++..+++.+..+|+|++|+|..|+.++..+..+++..+++++.|.+++..|+.+|+.|+++ .| T Consensus 12 ~~~~~~~~~~~~~~~l~~~l~~~l~~f~~~~~~g~~~~~~~~~l~~qi~~~~~e~~~L~~~~~~l~~ei~~L~dg---~~ 88 (117) T COG2919 12 KATRQGERRVRRRRILTLVLLALLALFQYLAWFGKNGAADVLQLQRQIAAQQAELEKLSARNTALEAEIKDLKDG---RD 88 (117) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC---HH T ss_conf 999998872778999999999999999999999870599999999999999999999987899999999870353---78 Q ss_pred HHHHHHHHHHCCCCCCCEEEEECC Q ss_conf 999999998098789868767527 Q gi|254780671|r 81 LLDEKARYSLNLSRSDEIILFYSD 104 (105) Q Consensus 81 ~lEe~aR~~Lg~~kp~E~iii~~d 104 (105) |+|+.||..|||++|||++|.+++ T Consensus 89 ~i~e~AR~~l~~~~~gEi~f~i~~ 112 (117) T COG2919 89 YIEERARSELGMSKPGEIFFRLVK 112 (117) T ss_pred HHHHHHHHHHCCCCCCCEEEEECC T ss_conf 999999998288889987787153 No 4 >TIGR02209 ftsL_broad cell division protein FtsL; InterPro: IPR011922 This entry represents FtsL, both forms similar to that in Escherichia coli and similar to that in Bacillus subtilis. FtsL is one of the later proteins active in cell division septum formation. FtsL is small, low in complexity, and highly divergent , .; GO: 0007049 cell cycle, 0051301 cell division, 0016021 integral to membrane. Probab=97.90 E-value=0.00026 Score=47.39 Aligned_cols=60 Identities=17% Similarity=0.242 Sum_probs=54.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCCCEEEEE Q ss_conf 9999999999999999999999999999999950279989899999999980987898687675 Q gi|254780671|r 39 KANKSLEKSLIERERFLSELKENRSRLERKVKLMSDGSLEKDLLDEKARYSLNLSRSDEIILFY 102 (105) Q Consensus 39 ~~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~~~D~d~lEe~aR~~Lg~~kp~E~iii~ 102 (105) .+.+.+..++...+.+.+.+++++..|.-|+..|.+ +.=|+++|++.|||-.|+.-.-++ T Consensus 24 ~~~r~~~~e~~~~~~~~~~~~~e~~~L~lE~~~L~~----~~Ri~~iA~~~L~m~~~~~~~~~~ 83 (85) T TIGR02209 24 HQTRQLNNELQKLQLEIDKLQKEWRDLQLEVAELSR----HERIEKIAKKQLGMKLPDANIKVV 83 (85) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC----HHHHHHHHHHHCCCCCCCCCCEEC T ss_conf 999999999999999999999999989999986014----889999999865899876554010 No 5 >pfam04999 FtsL Cell division protein FtsL. In Escherichia coli, nine gene products are known to be essential for assembly of the division septum. One of these, FtsL, is a bitopic membrane protein whose precise function is not understood. It has been proposed that FtsL interacts with the DivIC protein pfam04977, however this interaction may be indirect. Probab=97.06 E-value=0.01 Score=37.60 Aligned_cols=56 Identities=16% Similarity=0.177 Sum_probs=48.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCCCE Q ss_conf 999999999999999999999999999999995027998989999999998098789868 Q gi|254780671|r 39 KANKSLEKSLIERERFLSELKENRSRLERKVKLMSDGSLEKDLLDEKARYSLNLSRSDEI 98 (105) Q Consensus 39 ~~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~~~D~d~lEe~aR~~Lg~~kp~E~ 98 (105) ..++.+-.+.+.++.+-..|+.+-..|.-|-..|.+ +.-+|.+||++|||..|+-. T Consensus 35 h~~R~l~~~l~~l~~e~~~L~~ew~~L~LEqstla~----~~RIe~~A~~~L~M~~P~~~ 90 (97) T pfam04999 35 HQTRQLTSELQKLLQERDRLEIEWGNLLLEQSTLAE----HSRVERIAREQLGMKRPDPE 90 (97) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC----HHHHHHHHHHCCCCCCCCCC T ss_conf 998999999999999999999999999999998614----76799999864799999987 No 6 >PRK10772 cell division protein FtsL; Provisional Probab=95.92 E-value=0.16 Score=30.24 Aligned_cols=59 Identities=15% Similarity=0.096 Sum_probs=43.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCC---CEEEE Q ss_conf 9999999999999999999999999999999950279989899999999980987898---68767 Q gi|254780671|r 39 KANKSLEKSLIERERFLSELKENRSRLERKVKLMSDGSLEKDLLDEKARYSLNLSRSD---EIILF 101 (105) Q Consensus 39 ~~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~~~D~d~lEe~aR~~Lg~~kp~---E~iii 101 (105) ...+.+-.+...+..+-+.|+.|=..|.-|-+.|.+ ..=+|..|+++|||.+|+ |++++ T Consensus 58 H~TR~l~~e~e~L~~erd~Ld~EW~nL~LEqstl~~----hsRIE~~A~~~L~M~~pdp~qEviVv 119 (121) T PRK10772 58 HHTRLLTAQREQLVLERDALDIEWRNLILEENALGD----HSRVERIATEKLQMQHVDPSQENIVV 119 (121) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHCCCCCCCCCCCEEEE T ss_conf 898999988999999999999999999999986221----65899999985598369941044653 No 7 >COG3116 FtsL Cell division protein [Cell division and chromosome partitioning] Probab=95.49 E-value=0.24 Score=29.24 Aligned_cols=53 Identities=15% Similarity=0.125 Sum_probs=39.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCC Q ss_conf 999999999999999999999999999999950279989899999999980987898 Q gi|254780671|r 40 ANKSLEKSLIERERFLSELKENRSRLERKVKLMSDGSLEKDLLDEKARYSLNLSRSD 96 (105) Q Consensus 40 ~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~~~D~d~lEe~aR~~Lg~~kp~ 96 (105) ..+.+-.+...+..+-+.++.|=..|.-|-+.|. |..=+|++||+.|+|-.|+ T Consensus 44 ~tRqL~~e~~~~~~er~~L~~EwrnLilEe~tl~----~hsRVe~iAreqL~M~~pd 96 (105) T COG3116 44 HTRQLIAELEQLVLERDALNIEWRNLILEENTLG----DHSRVESIAREQLKMKHPD 96 (105) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHCCCCCC T ss_conf 9999998758999999998759998898887534----5778999999984677898 No 8 >COG4839 FtsL Protein required for the initiation of cell division [Cell division and chromosome partitioning] Probab=94.25 E-value=0.51 Score=27.20 Aligned_cols=58 Identities=19% Similarity=0.237 Sum_probs=50.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCCCEEEE Q ss_conf 999999999999999999999999999999995027998989999999998098789868767 Q gi|254780671|r 39 KANKSLEKSLIERERFLSELKENRSRLERKVKLMSDGSLEKDLLDEKARYSLNLSRSDEIILF 101 (105) Q Consensus 39 ~~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~~~D~d~lEe~aR~~Lg~~kp~E~iii 101 (105) .+..+.+.++..++..+++-+.++..++.+|..|.+ |+=+=++|- +.||.-.||-|=+ T Consensus 60 ~~~yqvq~ei~~Le~kIs~q~~e~~dlkqeV~dLss----~eRIldiAk-k~gLkl~~~NVkv 117 (120) T COG4839 60 TKAYQVQGEITDLESKISEQKTENDDLKQEVKDLSS----PERILDIAK-KAGLKLNNENVKV 117 (120) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC----HHHHHHHHH-HCCCCCCCCCCHH T ss_conf 999999769999999999998430069999998456----889999998-7566678764200 No 9 >TIGR02894 DNA_bind_RsfA transcription factor, RsfA family; InterPro: IPR014243 This entry represents a set of transcription factors found in some endospore-forming bacteria within the Firmicutes (low-GC Gram-positive bacteria). In some species these proteins are encoded multiple times. The best characterised protein in this entry is the prespore-specific transcription factor RsfA from Bacillus subtilis, previously known as YwfN. Expression of RsfA is controlled by sigma factor F, and it seems to improve the efficiency of sporulation by fine-tuning the expression of genes in the sigma F regulon, particularly the timing of their expression . It also negatively regulates spoIIR and its own synthesis. A paralog in B. subtilis is designated YlbO, though its function is not known. A highly variable linker region separates two very strongly conserved sequence regions within the protein. . Probab=91.17 E-value=1.4 Score=24.55 Aligned_cols=53 Identities=15% Similarity=0.232 Sum_probs=41.8 Q ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 99999999999983720399999999999999999999999999999999950279 Q gi|254780671|r 20 AFCCVVYFTNHAIVGDYGLKANKSLEKSLIERERFLSELKENRSRLERKVKLMSDG 75 (105) Q Consensus 20 ~~~~~~~f~~~~~~g~~Gl~~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~ 75 (105) +.=-++.|.=++ ..+..+...++.+...+..++..|..+|+.|+.++..|... T Consensus 83 Tl~~VI~FLq~~---~~~~~~~~~l~~E~~~L~~~~~~Lq~~ne~L~~el~~L~~~ 135 (163) T TIGR02894 83 TLQDVISFLQNL---KTTNPSDQALQKENERLKKELESLQKRNEELEKELEKLEKR 135 (163) T ss_pred CHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 878899999998---88779999999899999999999999999999999999998 No 10 >pfam05137 PilN Fimbrial assembly protein (PilN). Probab=89.08 E-value=2.1 Score=23.49 Aligned_cols=82 Identities=21% Similarity=0.362 Sum_probs=30.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHH----HHHHHHHH---HHHHHHHHHHHHHHHHHHHHHCC---C Q ss_conf 89867799999999999999999999837203999999----99999999---99999999999999999995027---9 Q gi|254780671|r 6 YKKNHFFRAIFLVIAFCCVVYFTNHAIVGDYGLKANKS----LEKSLIER---ERFLSELKENRSRLERKVKLMSD---G 75 (105) Q Consensus 6 ~r~~~~~~~~~~~~~~~~~~~f~~~~~~g~~Gl~~~~~----l~~~~~~~---~~~l~~l~~~~~~L~~~i~~L~~---~ 75 (105) .+++.+..+++.++++++++.+..+.+. ++.+..... +..++..+ -++++.++++.+.+..+++.+++ + T Consensus 13 ~~~~~f~~~~~~~~~~~~~~~~~~~~~~-~~~i~~q~~~~~~L~~ei~~L~~~i~ei~~l~~~~~~l~~r~~~i~~Lq~~ 91 (162) T pfam05137 13 QRKRRFLTLLLLVVVAGVLLAFAGWLYL-DNQIEAQQERNAFLRTQIQQLDGQLKEIQALKQERQALLERMKKIQELQQN 91 (162) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 9999999999999999999999999999-999999999999999999999999999999999999999999999999971 Q ss_pred CCC-HHHHHHHHHH Q ss_conf 989-8999999999 Q gi|254780671|r 76 SLE-KDLLDEKARY 88 (105) Q Consensus 76 ~~D-~d~lEe~aR~ 88 (105) ... ...++++++- T Consensus 92 r~~~~~ll~~L~~~ 105 (162) T pfam05137 92 RPQTVRVFDELARL 105 (162) T ss_pred CCHHHHHHHHHHHH T ss_conf 82689999999875 No 11 >pfam07407 Seadorna_VP6 Seadornavirus VP6 protein. This family consists of several VP6 proteins from the Banna virus as well as a related protein VP5 from the Kadipiro virus. Members of this family are typically of around 420 residues in length. The function of this family is unknown. Probab=88.65 E-value=0.77 Score=26.12 Aligned_cols=41 Identities=15% Similarity=0.220 Sum_probs=35.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHH Q ss_conf 99999999999999999999999995027998989999999 Q gi|254780671|r 46 KSLIERERFLSELKENRSRLERKVKLMSDGSLEKDLLDEKA 86 (105) Q Consensus 46 ~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~~~D~d~lEe~a 86 (105) .+++.++.+.++|+++|+.|..+|.+|.+++...|.+|++- T Consensus 32 delaalreenaklkkenealktkihrlesdwttsdivekie 72 (420) T pfam07407 32 DELAALREENAKLKKENEALKTKIHRLESDWTTSDIVEKIE 72 (420) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHH T ss_conf 99898887645565452888766754414661677888888 No 12 >pfam11853 DUF3373 Protein of unknown function (DUF3373). This family of proteins are functionally uncharacterized. This protein is found in bacteria. Proteins in this family are typically between 472 to 574 amino acids in length. Probab=87.26 E-value=1.7 Score=24.06 Aligned_cols=34 Identities=24% Similarity=0.292 Sum_probs=24.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 9999999999999999999999999999999502 Q gi|254780671|r 40 ANKSLEKSLIERERFLSELKENRSRLERKVKLMS 73 (105) Q Consensus 40 ~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~ 73 (105) +-..+.++|+.+++++++|+++...+..+++.-. T Consensus 25 ~d~~~~qkI~~L~~ql~eLk~~~~~~~~~v~~~e 58 (485) T pfam11853 25 ADIDLLQKIEALKKELAELKAQLKDLNKRVDKTE 58 (485) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 4344999999999999999999998776655566 No 13 >pfam08961 DUF1875 Domain of unknown function (DUF1875). The MIT domain, found in Nuclear receptor-binding factor 2, has no known function. Probab=86.81 E-value=2.1 Score=23.44 Aligned_cols=57 Identities=12% Similarity=0.168 Sum_probs=41.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC------CCHHHHHHHHHHHHCCCCCCC Q ss_conf 99999999999999999999999999999999502799------898999999999809878986 Q gi|254780671|r 39 KANKSLEKSLIERERFLSELKENRSRLERKVKLMSDGS------LEKDLLDEKARYSLNLSRSDE 97 (105) Q Consensus 39 ~~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~~------~D~d~lEe~aR~~Lg~~kp~E 97 (105) .....+++...-+-++++.|+.+|.+|..+..||..+. +|.|++|.- +--|+-.|+| T Consensus 129 T~IadLkr~Ve~LvaeNerLrkENkqL~AekaRL~k~p~eKeldvDaDfvekS--ELW~L~~~se 191 (243) T pfam08961 129 TKIADLKRHVEFLVAENERLRKENKQLKAEKARLLKGPIEKELDVDADFVETS--ELWVLPQHNE 191 (243) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHCCCCHHHHHHH--HHCCCCCCCC T ss_conf 77999999999998600999998888868998875185032115640145666--6506899864 No 14 >pfam06005 DUF904 Protein of unknown function (DUF904). This family consists of several bacterial and archaeal hypothetical proteins of unknown function. Probab=85.17 E-value=3.5 Score=22.12 Aligned_cols=53 Identities=19% Similarity=0.199 Sum_probs=42.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCC Q ss_conf 99999999999999999999999999999999502799898999999999809878 Q gi|254780671|r 39 KANKSLEKSLIERERFLSELKENRSRLERKVKLMSDGSLEKDLLDEKARYSLNLSR 94 (105) Q Consensus 39 ~~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~~~D~d~lEe~aR~~Lg~~k 94 (105) -...-++.++..++.+...+..+|..|..+...|+. +.+.-.+..|.-||-.. T Consensus 18 dtI~lLqmEieeLke~n~~l~~~~~~L~~EN~qLk~---e~~~wq~rLr~lLgk~~ 70 (72) T pfam06005 18 DTIALLQMEIEELKEENDQLSEENEELEEENKKLKE---ERNAWQSRLRALLGKLD 70 (72) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHC T ss_conf 999999999999997788899878999999999998---99999999999985131 No 15 >PRK11677 cytochrome d ubiquinol oxidase subunit III; Provisional Probab=83.17 E-value=4.2 Score=21.60 Aligned_cols=51 Identities=12% Similarity=0.068 Sum_probs=35.6 Q ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999983720399999999999999999999999999999999950 Q gi|254780671|r 20 AFCCVVYFTNHAIVGDYGLKANKSLEKSLIERERFLSELKENRSRLERKVKLM 72 (105) Q Consensus 20 ~~~~~~~f~~~~~~g~~Gl~~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L 72 (105) ..+++++|+. -++..+...-.++.+++++.+.+++..+++....=.+...| T Consensus 12 vGiiIG~~~~--R~~~~~~~~q~~Le~eLe~~k~el~~Yr~ev~~HF~~tA~L 62 (134) T PRK11677 12 VGIIIGAVAM--RFGNRKLRQQQALQYELEKNKAELEEYRQELVDHFARSAEL 62 (134) T ss_pred HHHHHHHHHH--HHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999999--98175116799999999999999999999999999999999 No 16 >pfam03245 Phage_lysis Bacteriophage lysis protein. This protein is involved in host lysis. This family is not considered to be a peptidase according to the MEROPs database. Probab=81.97 E-value=4.7 Score=21.32 Aligned_cols=49 Identities=10% Similarity=0.062 Sum_probs=38.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHH Q ss_conf 999999999999999999999999999999999502799898999999999 Q gi|254780671|r 38 LKANKSLEKSLIERERFLSELKENRSRLERKVKLMSDGSLEKDLLDEKARY 88 (105) Q Consensus 38 l~~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~~~D~d~lEe~aR~ 88 (105) -......+.+..+...++......+..++.++..+.. +|.-|-+|.+.. T Consensus 25 ~~~a~~~k~q~~~~~~~l~~~~a~i~~~q~r~r~la~--LD~k~tkEL~~A 73 (151) T pfam03245 25 RDNADTAERQRDQATAELALANATIEDMQTRQRDLAA--LDAKHTKELADA 73 (151) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHH T ss_conf 8779999999999999999999799999999999999--978999999988 No 17 >pfam11382 DUF3186 Protein of unknown function (DUF3186). This bacterial family of proteins has no known function. Probab=81.34 E-value=5 Score=21.18 Aligned_cols=57 Identities=11% Similarity=0.104 Sum_probs=41.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCCCEEEEEC Q ss_conf 999999999999999999999999999999502799898999999999809878986876752 Q gi|254780671|r 41 NKSLEKSLIERERFLSELKENRSRLERKVKLMSDGSLEKDLLDEKARYSLNLSRSDEIILFYS 103 (105) Q Consensus 41 ~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~~~D~d~lEe~aR~~Lg~~kp~E~iii~~ 103 (105) ...+..+...++++.+.|+.+++.++.+++... +|++..+..-+.=.-.|..|-++. T Consensus 34 ~~~l~~~~~~Lr~e~~~l~~~~~~l~~~~~~~d------~f~~~~~p~lv~g~L~g~~Valv~ 90 (307) T pfam11382 34 LSGLEDEFSDLRTENDRLRAEREALNEQASAAD------QFIAALAPRLVAGRLTGKSVALVR 90 (307) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHCCCCCCEEEEEE T ss_conf 878888999999999999999999999999999------999998899973754897599997 No 18 >pfam12097 DUF3573 Protein of unknown function (DUF3573). This family of proteins is functionally uncharacterized. This protein is found in bacteria. Proteins in this family are typically between 372 to 530 amino acids in length. Probab=79.90 E-value=5.5 Score=20.88 Aligned_cols=31 Identities=13% Similarity=0.194 Sum_probs=14.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 9999999999999999999999999995027 Q gi|254780671|r 44 LEKSLIERERFLSELKENRSRLERKVKLMSD 74 (105) Q Consensus 44 l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~ 74 (105) ...+....++++.+|+++...|+.+|..|++ T Consensus 33 ~~~s~~~~~~~i~~Lq~QI~~Lq~eIn~l~~ 63 (375) T pfam12097 33 QTDSNQIDKKDISKLQKQIKSLQAQINHLEE 63 (375) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 2748775388999999999999999987775 No 19 >PRK09039 hypothetical protein; Validated Probab=77.76 E-value=6.4 Score=20.48 Aligned_cols=35 Identities=20% Similarity=0.233 Sum_probs=19.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 99999999999999999999999999999995027 Q gi|254780671|r 40 ANKSLEKSLIERERFLSELKENRSRLERKVKLMSD 74 (105) Q Consensus 40 ~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~ 74 (105) ++.+++.+++++...+..-+.++..|+.++..|++ T Consensus 54 ~L~~L~~qia~L~~~L~Ler~~~~~L~~~l~~L~~ 88 (343) T PRK09039 54 ALDRLNAQIAELADLLSLERQGNQDLQDSVAQLRA 88 (343) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999987656554569999999999 No 20 >pfam08936 CsoSCA Carboxysome Shell Carbonic Anhydrase. Carboxysome Shell Carbonic Anhydrase is a bacterial carbonic anhydrase localized in the carboxysome, where it converts bicarbonate ions to carbon dioxide for use in carbon fixation. It contains three domains, these being: (1) an N-terminal domain composed primarily of four alpha-helices; (2) a catalytic domain containing a tightly bound zinc ion and (3) a C-terminal domain with weak structural similarity to the catalytic domain. Probab=76.66 E-value=6.9 Score=20.29 Aligned_cols=53 Identities=17% Similarity=0.266 Sum_probs=40.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHH---HHHHHHHCCCCCCCE Q ss_conf 99999999999999999999999999995027998989999---999998098789868 Q gi|254780671|r 43 SLEKSLIERERFLSELKENRSRLERKVKLMSDGSLEKDLLD---EKARYSLNLSRSDEI 98 (105) Q Consensus 43 ~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~~~D~d~lE---e~aR~~Lg~~kp~E~ 98 (105) ..++.+...+.+++.-=....-+-++|..++. +||++| ++||++||+--|..+ T Consensus 7 ~~N~~L~~yE~~vk~rFd~Ivp~Lk~isa~Qh---~~dF~~~AQ~la~~~LGf~LP~~i 62 (459) T pfam08936 7 EANERLQDYEREVKGRFDRIVPVLKEISALQH---EPDFVERAQQLARAELGFDLPEHI 62 (459) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC---CCCHHHHHHHHHHHHHCCCCCHHH T ss_conf 89999999999999887777899999874236---765899999999997589984889 No 21 >pfam06305 DUF1049 Protein of unknown function (DUF1049). This family consists of several hypothetical bacterial proteins of unknown function. Probab=76.21 E-value=7.1 Score=20.22 Aligned_cols=29 Identities=21% Similarity=0.253 Sum_probs=20.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 39999999999999999999999999999 Q gi|254780671|r 37 GLKANKSLEKSLIERERFLSELKENRSRL 65 (105) Q Consensus 37 Gl~~~~~l~~~~~~~~~~l~~l~~~~~~L 65 (105) ++..+.+.+.++..++++++.++++.+.+ T Consensus 51 ~~~~~l~~r~~~r~l~k~lk~l~~e~~~l 79 (80) T pfam06305 51 SLPFYLRLRRRLRRLKKQLKKLEKELEQL 79 (80) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 99999999999999999999999998846 No 22 >pfam02183 HALZ Homeobox associated leucine zipper. Probab=73.73 E-value=8.2 Score=19.84 Aligned_cols=35 Identities=11% Similarity=0.150 Sum_probs=30.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 99999999999999999999999999999995027 Q gi|254780671|r 40 ANKSLEKSLIERERFLSELKENRSRLERKVKLMSD 74 (105) Q Consensus 40 ~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~ 74 (105) .|.-+++....+..+...|..+|..|..++..|++ T Consensus 6 Dy~~LK~~yd~Lk~~~~~L~~e~~~L~ae~~~L~~ 40 (45) T pfam02183 6 DYEVLKRCYDALKAENDSLQKENEKLRAEVLSLKE 40 (45) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999988999988899999999999999999998 No 23 >pfam07423 DUF1510 Protein of unknown function (DUF1510). This family consists of several hypothetical bacterial proteins of around 200 residues in length. The function of this family is unknown. Probab=73.54 E-value=3.8 Score=21.85 Aligned_cols=32 Identities=16% Similarity=0.101 Sum_probs=18.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 498986779999999999999999999983720 Q gi|254780671|r 4 KYYKKNHFFRAIFLVIAFCCVVYFTNHAIVGDY 36 (105) Q Consensus 4 k~~r~~~~~~~~~~~~~~~~~~~f~~~~~~g~~ 36 (105) |+||.+. ...+.+++.++++++.+|.++++.. T Consensus 8 k~Rk~n~-vLNiaI~iV~llIiiva~~lf~~~~ 39 (214) T pfam07423 8 KRRKINR-VLNIAIGIVVVLIIIVAYQLFFPSS 39 (214) T ss_pred HHHHHHH-HHHHHHHHHHHHHHHHHHHEEECCC T ss_conf 7764534-5579999999999997664023588 No 24 >KOG4797 consensus Probab=72.12 E-value=9 Score=19.60 Aligned_cols=45 Identities=20% Similarity=0.222 Sum_probs=35.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHH Q ss_conf 9999999999999999999999999999995027998989999999 Q gi|254780671|r 41 NKSLEKSLIERERFLSELKENRSRLERKVKLMSDGSLEKDLLDEKA 86 (105) Q Consensus 41 ~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~~~D~d~lEe~a 86 (105) ....+.+.+.++.++.+|.+.|..|+++-..|++.. -||-+.+.- T Consensus 62 mfAVREEVe~Lk~qI~eL~er~~~Le~EN~lLk~~~-spe~L~ql~ 106 (123) T KOG4797 62 MFAVREEVEVLKEQIRELEERNSALERENSLLKTLA-SPEQLAQLP 106 (123) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC-CHHHHHHHH T ss_conf 999999999999999999999999999878998508-999999988 No 25 >pfam08537 NBP1 Fungal Nap binding protein NBP1. NBP1 is a nuclear protein which has been shown in Saccharomyces cerevisiae to be essential for the G2/M transition of the cell cycle. Probab=70.63 E-value=9.7 Score=19.39 Aligned_cols=42 Identities=29% Similarity=0.471 Sum_probs=24.3 Q ss_pred HHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHCCCCCCHHHHHHH Q ss_conf 99999999999999999-------9999999999999502799898999999 Q gi|254780671|r 41 NKSLEKSLIERERFLSE-------LKENRSRLERKVKLMSDGSLEKDLLDEK 85 (105) Q Consensus 41 ~~~l~~~~~~~~~~l~~-------l~~~~~~L~~~i~~L~~~~~D~d~lEe~ 85 (105) +.++..++.+++++|.. .++.|..|+ ..|.+.++|++|+.-. T Consensus 175 i~~le~kl~e~~~EL~~tqkkL~f~qEKn~LLe---sLLDdaNId~~YvKSR 223 (313) T pfam08537 175 IDKLEEKLAELEQELQSTQKKLQFVKEKNHLLE---SLLDDANIDDEYVKSR 223 (313) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHCCCCCHHHHHHH T ss_conf 999999999999999999999998999999999---9730235778988777 No 26 >KOG4571 consensus Probab=69.22 E-value=10 Score=19.21 Aligned_cols=47 Identities=9% Similarity=-0.026 Sum_probs=26.4 Q ss_pred HHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHH Q ss_conf 9999-9999999999999999999999999995027998989999999998 Q gi|254780671|r 40 ANKS-LEKSLIERERFLSELKENRSRLERKVKLMSDGSLEKDLLDEKARYS 89 (105) Q Consensus 40 ~~~~-l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~~~D~d~lEe~aR~~ 89 (105) +|++ .+.+-+.+..+++.|+.+|.+|..+...|... -+|+-+..++. T Consensus 241 RYRqKkRae~E~l~ge~~~Le~rN~~LK~qa~~lerE---I~ylKqli~e~ 288 (294) T KOG4571 241 RYRQKKRAEKEALLGELEGLEKRNEELKDQASELERE---IRYLKQLILEV 288 (294) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHH T ss_conf 9999989999888878777888779999999999999---99999999999 No 27 >TIGR03007 pepcterm_ChnLen polysaccharide chain length determinant protein, PEP-CTERM locus subfamily; InterPro: IPR014345 Members of this entry belong to the family of polysaccharide chain length determinant proteins. They are found in species that encode the PEP-CTERM/exosortase system and are predicted to act in protein sorting in a number of Gram-negative bacteria, they are also found near the epsH homologue that is the putative exosortase gene.. Probab=68.95 E-value=10 Score=19.22 Aligned_cols=36 Identities=17% Similarity=0.241 Sum_probs=31.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 999999999999999999999999999999950279 Q gi|254780671|r 40 ANKSLEKSLIERERFLSELKENRSRLERKVKLMSDG 75 (105) Q Consensus 40 ~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~ 75 (105) -|.+++.++++++++++.|+...+.++.+|.+|++. T Consensus 323 vyQql~~~l~~~eA~~asl~~r~~~~~~~~~~l~~~ 358 (510) T TIGR03007 323 VYQQLQIELAEAEAEIASLEARVAELTARIERLESL 358 (510) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 137789999999999999999999999999999999 No 28 >pfam01166 TSC22 TSC-22/dip/bun family. Probab=67.55 E-value=11 Score=19.00 Aligned_cols=43 Identities=21% Similarity=0.305 Sum_probs=35.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHH Q ss_conf 99999999999999999999999999999502799898999999 Q gi|254780671|r 42 KSLEKSLIERERFLSELKENRSRLERKVKLMSDGSLEKDLLDEK 85 (105) Q Consensus 42 ~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~~~D~d~lEe~ 85 (105) ...+.+.+.++..+.+|..++.+|+.+-..|++.. -||-+++. T Consensus 10 ~AVREEVevLk~~I~eL~~~~~~le~EN~~Lk~~~-spe~l~ql 52 (57) T pfam01166 10 YAVREEVEVLKEQIKELEEKNSQLERENTLLKSLA-SPEQLEQL 52 (57) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC-CHHHHHHH T ss_conf 99999999999999999999999999999998609-99999999 No 29 >pfam07106 TBPIP Tat binding protein 1(TBP-1)-interacting protein (TBPIP). This family consists of several eukaryotic TBP-1 interacting protein (TBPIP) sequences. TBP-1 has been demonstrated to interact with the human immunodeficiency virus type 1 (HIV-1) viral protein Tat, then modulate the essential replication process of HIV. In addition, TBP-1 has been shown to be a component of the 26S proteasome, a basic multiprotein complex that degrades ubiquitinated proteins in an ATP-dependent fashion. Human TBPIP interacts with human TBP-1 then modulates the inhibitory action of human TBP-1 on HIV-Tat-mediated transactivation. Probab=66.54 E-value=12 Score=18.88 Aligned_cols=50 Identities=20% Similarity=0.205 Sum_probs=38.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHH Q ss_conf 99999999999999999999999999999999950279989899999999 Q gi|254780671|r 38 LKANKSLEKSLIERERFLSELKENRSRLERKVKLMSDGSLEKDLLDEKAR 87 (105) Q Consensus 38 l~~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~~~D~d~lEe~aR 87 (105) --....+..++..++.++..++.+...++.+...|.+.....++.++++. T Consensus 71 ~eel~~~d~~i~~L~~~l~~l~~~~~~l~~el~~L~s~~t~eel~~~i~~ 120 (169) T pfam07106 71 DEELNKLDMEIEELREEVQLLKQDCSTLEIELKSLTSDLTTEELQEEIQE 120 (169) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHH T ss_conf 88998769999999999999999999999999998758999999999999 No 30 >pfam06295 DUF1043 Protein of unknown function (DUF1043). This family consists of several hypothetical bacterial proteins of unknown function. Probab=61.32 E-value=15 Score=18.28 Aligned_cols=49 Identities=14% Similarity=0.078 Sum_probs=34.7 Q ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999983720399999999999999999999999999999999950 Q gi|254780671|r 22 CCVVYFTNHAIVGDYGLKANKSLEKSLIERERFLSELKENRSRLERKVKLM 72 (105) Q Consensus 22 ~~~~~f~~~~~~g~~Gl~~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L 72 (105) +++++++. -++..+...-.++.+++++.+.+++..+++....=.+...| T Consensus 10 ~iiG~~~~--R~~~~~~~~q~~Le~eLe~~k~el~~Y~~ev~~HF~~tA~L 58 (128) T pfam06295 10 LIIGLIIM--RLTNPTYKNQKKIKKELEEAKEELDEQRQELKDHFAQSAEL 58 (128) T ss_pred HHHHHHHH--HHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999--97274076899999999999999999999999999999999 No 31 >pfam04111 APG6 Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either associated with the membrane or in a retrieval step of the carboxypeptidase Y sorting pathway. Probab=60.34 E-value=15 Score=18.17 Aligned_cols=45 Identities=13% Similarity=0.075 Sum_probs=29.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHH Q ss_conf 999999999999999999999999999999950279989899999 Q gi|254780671|r 40 ANKSLEKSLIERERFLSELKENRSRLERKVKLMSDGSLEKDLLDE 84 (105) Q Consensus 40 ~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~~~D~d~lEe 84 (105) .+..+.++...+.+++..++++.+.+..++..|.......+.-|+ T Consensus 71 el~~l~~ee~~l~~eL~~le~e~~~l~~el~~le~e~~~l~~ee~ 115 (356) T pfam04111 71 ELDELKKEEERLLDELEELEKEDDDLDGELVELQEEKEQLENEEL 115 (356) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999999999999999999999999999999999999999 No 32 >COG4942 Membrane-bound metallopeptidase [Cell division and chromosome partitioning] Probab=59.70 E-value=16 Score=18.11 Aligned_cols=36 Identities=25% Similarity=0.462 Sum_probs=23.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 999999999999999999999999999999995027 Q gi|254780671|r 39 KANKSLEKSLIERERFLSELKENRSRLERKVKLMSD 74 (105) Q Consensus 39 ~~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~ 74 (105) ..+....++++..+.++.....+...|+.+++.+.. T Consensus 38 ~~l~q~q~ei~~~~~~i~~~~~~~~kL~~~lk~~e~ 73 (420) T COG4942 38 KQLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLET 73 (420) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 889989999999999999999999999999999998 No 33 >pfam09889 DUF2116 Uncharacterized protein containing a Zn-ribbon (DUF2116). This domain, found in various hypothetical archaeal proteins, has no known function. Probab=58.86 E-value=15 Score=18.27 Aligned_cols=29 Identities=17% Similarity=0.125 Sum_probs=15.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 62498986779999999999999999999 Q gi|254780671|r 2 WTKYYKKNHFFRAIFLVIAFCCVVYFTNH 30 (105) Q Consensus 2 ~tk~~r~~~~~~~~~~~~~~~~~~~f~~~ 30 (105) |.+.+|+-...+.+++.+++++++++.+. T Consensus 28 ~~~~rkk~~~~q~i~f~i~il~i~v~~~~ 56 (59) T pfam09889 28 YRKKRKKMRKSQQILFGIFILFLAVYAVL 56 (59) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999999999999999998 No 34 >pfam10883 DUF2681 Protein of unknown function (DUF2681). This family of proteins with unknown function appears to be restricted to Haemophilus spp. Probab=57.54 E-value=17 Score=17.88 Aligned_cols=48 Identities=15% Similarity=0.164 Sum_probs=23.1 Q ss_pred HHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999998372039999999999999999999999999999999995 Q gi|254780671|r 17 LVIAFCCVVYFTNHAIVGDYGLKANKSLEKSLIERERFLSELKENRSRLERKVKL 71 (105) Q Consensus 17 ~~~~~~~~~~f~~~~~~g~~Gl~~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~ 71 (105) .+++.+++.+-+|..| ....-+++++.+.++.+.|+.+..--+.+++- T Consensus 6 ~a~a~~v~l~~gYv~~-------~l~ka~~~~~kL~~qNeQLqtEKaVAetQVKn 53 (78) T pfam10883 6 LAFAVAVVLLVGYVYY-------RLRQAQRRIEKLQKENEQLQTEKAVAETQVKH 53 (78) T ss_pred HHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999999999999-------99999999999999899999999999999888 No 35 >KOG4603 consensus Probab=57.38 E-value=17 Score=17.86 Aligned_cols=48 Identities=15% Similarity=0.240 Sum_probs=37.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHH Q ss_conf 999999999999999999999999999999995027998989999999 Q gi|254780671|r 39 KANKSLEKSLIERERFLSELKENRSRLERKVKLMSDGSLEKDLLDEKA 86 (105) Q Consensus 39 ~~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~~~D~d~lEe~a 86 (105) -....+.-.+..+++....+.++....+.+|..|++.-.-|++.|+++ T Consensus 79 eel~~ld~~i~~l~ek~q~l~~t~s~veaEik~L~s~Lt~eemQe~i~ 126 (201) T KOG4603 79 EELQVLDGKIVALTEKVQSLQQTCSYVEAEIKELSSALTTEEMQEEIQ 126 (201) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHH T ss_conf 899988679999999999999898889999999998637499999999 No 36 >pfam10211 Ax_dynein_light Axonemal dynein light chain. Axonemal dynein light chain proteins play a dynamic role in flagellar and cilia motility. Eukaryotic cilia and flagella are complex organelles consisting of a core structure, the axoneme, which is composed of nine microtubule doublets forming a cylinder that surrounds a pair of central singlet microtubules. This ultra-structural arrangement seems to be one of the most stable micro-tubular assemblies known and is responsible for the flagellar and ciliary movement of a large number of organisms ranging from protozoan to mammals. This light chain interacts directly with the N-terminal half of the heavy chains. Probab=54.76 E-value=19 Score=17.60 Aligned_cols=47 Identities=19% Similarity=0.356 Sum_probs=28.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHH Q ss_conf 39999999999999999999999999999999995027998989999999 Q gi|254780671|r 37 GLKANKSLEKSLIERERFLSELKENRSRLERKVKLMSDGSLEKDLLDEKA 86 (105) Q Consensus 37 Gl~~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~~~D~d~lEe~a 86 (105) |+-+-..-.+....++++++.|+.++..|+.++.-|+.. -+.+|..+ T Consensus 111 g~RKal~~e~~k~~l~~~i~~Le~e~~~Le~~v~el~~k---~e~~ekr~ 157 (189) T pfam10211 111 GMRKALKAEQGKSELEQEIKKLEEEKEELEKEVAELEAK---LEEIEKRE 157 (189) T ss_pred HHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHH T ss_conf 889999887276889999999999999999999999999---99999879 No 37 >pfam07716 bZIP_2 Basic region leucine zipper. Probab=54.42 E-value=19 Score=17.57 Aligned_cols=34 Identities=18% Similarity=0.285 Sum_probs=28.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 9999999999999999999999999999995027 Q gi|254780671|r 41 NKSLEKSLIERERFLSELKENRSRLERKVKLMSD 74 (105) Q Consensus 41 ~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~ 74 (105) ....+.....++.+...|..+|..|..+|..|.. T Consensus 20 R~rkk~~~~~le~~~~~L~~eN~~L~~~i~~L~~ 53 (54) T pfam07716 20 REKKKQREEELEERVKELERENAQLRQKIEQLEK 53 (54) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 9999999999999999999999999999999754 No 38 >PRK13922 rod shape-determining protein MreC; Provisional Probab=52.08 E-value=21 Score=17.34 Aligned_cols=47 Identities=23% Similarity=0.167 Sum_probs=24.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCC Q ss_conf 999999999999999999999999999502799898999999999809878 Q gi|254780671|r 44 LEKSLIERERFLSELKENRSRLERKVKLMSDGSLEKDLLDEKARYSLNLSR 94 (105) Q Consensus 44 l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~~~D~d~lEe~aR~~Lg~~k 94 (105) .-.+...+.++++.|++++..|+.+...+.. ...=-+..|+-|++.. T Consensus 60 ~~~~~~~l~~en~~L~~e~~~l~~~~~~~~~----l~~En~~Lr~lL~~~~ 106 (264) T PRK13922 60 NVSSLFDLREENEELKEELLLLESRLLELEQ----LKQENARLRELLGLKE 106 (264) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHH T ss_conf 9987999999999999999999999999999----9999999999872222 No 39 >pfam05377 FlaC_arch Flagella accessory protein C (FlaC). Although archaeal flagella appear superficially similar to those of bacteria, they are quite distinct. In several archaea, the flagellin genes are followed immediately by the flagellar accessory genes flaCDEFGHIJ. The gene products may have a role in translocation, secretion, or assembly of the flagellum. FlaC is a protein whose exact role is unknown but it has been shown to be membrane-associated (by immuno-blotting fractionated cells). Probab=52.07 E-value=18 Score=17.71 Aligned_cols=45 Identities=16% Similarity=0.289 Sum_probs=20.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC-HHHHHHHHH Q ss_conf 999999999999999999999999999950279989-899999999 Q gi|254780671|r 43 SLEKSLIERERFLSELKENRSRLERKVKLMSDGSLE-KDLLDEKAR 87 (105) Q Consensus 43 ~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~~~D-~d~lEe~aR 87 (105) ++.+++..+...+..++++++.+...+..+.++-.| .+..|-+.| T Consensus 4 elEn~~~~l~~~i~t~~~en~~i~~~ie~i~envk~v~~lYEvVS~ 49 (55) T pfam05377 4 ELENKLPKISSMVSTVRKENEELSEEVEKIDENVKDVLSLYEVVSN 49 (55) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 8886758899999989988999999999999999999999999981 No 40 >KOG0483 consensus Probab=50.89 E-value=22 Score=17.22 Aligned_cols=49 Identities=20% Similarity=0.186 Sum_probs=39.4 Q ss_pred HHHHHHHCCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 9999983720-399999999999999999999999999999999950279 Q gi|254780671|r 27 FTNHAIVGDY-GLKANKSLEKSLIERERFLSELKENRSRLERKVKLMSDG 75 (105) Q Consensus 27 f~~~~~~g~~-Gl~~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~ 75 (105) -+..+||-++ .-++..++..+...++..+..+..++..|+.++..|+.. T Consensus 92 RQVavWFQNRRARwK~kqlE~d~~~Lk~~~~~l~~~~~~Lq~e~~eL~~~ 141 (198) T KOG0483 92 RQVAVWFQNRRARWKTKQLEKDYESLKRQLESLRSENDRLQSEVQELVAE 141 (198) T ss_pred HHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 68999984013100104355568999999988733216788889999998 No 41 >TIGR03064 sortase_srtB sortase, SrtB family. Members of this transpeptidase family are, in most cases, designated sortase B, product of the srtB gene. This protein shows only distant similarity to the sortase A family, for which there may be several members in a single bacterial genome. Typical SrtB substrate motifs include NAKTN, NPKSS, etc, and otherwise resemble the LPXTG sorting signals recognized by sortase A proteins. Probab=50.40 E-value=13 Score=18.59 Aligned_cols=39 Identities=8% Similarity=0.001 Sum_probs=17.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999983720399999999999999999 Q gi|254780671|r 13 RAIFLVIAFCCVVYFTNHAIVGDYGLKANKSLEKSLIERERF 54 (105) Q Consensus 13 ~~~~~~~~~~~~~~f~~~~~~g~~Gl~~~~~l~~~~~~~~~~ 54 (105) +.++.+++++++++-++.++. .+..+.+-.+..+++++. T Consensus 3 ~~ii~iv~l~Vf~~s~~~l~~---~~~~y~~~~~~~~~~~~~ 41 (232) T TIGR03064 3 KKSITLLFLIVFFYSLYKLGQ---IFYDYYTNRQVLAEAQQV 41 (232) T ss_pred HHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHH T ss_conf 688999999999999999999---998876636999999988 No 42 >pfam00170 bZIP_1 bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region. Probab=50.21 E-value=23 Score=17.15 Aligned_cols=34 Identities=15% Similarity=0.182 Sum_probs=22.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 9999999999999999999999999999995027 Q gi|254780671|r 41 NKSLEKSLIERERFLSELKENRSRLERKVKLMSD 74 (105) Q Consensus 41 ~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~ 74 (105) ....+..+..++.+...|..+|..|..++..|+. T Consensus 16 R~RKk~~~~~Le~~v~~L~~eN~~L~~~~~~L~~ 49 (59) T pfam00170 16 RLRKKAYIEELEEKVKELEAENKTLRSELERLKK 49 (59) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999999999999999999999999999999 No 43 >PRK03992 proteasome-activating nucleotidase; Provisional Probab=50.20 E-value=23 Score=17.15 Aligned_cols=33 Identities=30% Similarity=0.424 Sum_probs=15.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 999999999999999999999999999999502 Q gi|254780671|r 41 NKSLEKSLIERERFLSELKENRSRLERKVKLMS 73 (105) Q Consensus 41 ~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~ 73 (105) +..+..+..+++.++..++.++..|+.++.+|+ T Consensus 11 ~~~l~~~~~~l~~~~~~~~~~~~~l~~~~~~l~ 43 (390) T PRK03992 11 EEQLESRNEELEEQLRDLEAENERLERELERLK 43 (390) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999999999999999999999999999 No 44 >pfam08172 CASP_C CASP C terminal. This domain is the C-terminal region of the CASP family of proteins. It is a Golgi membrane protein which is thought to have a role in vesicle transport. Probab=50.04 E-value=23 Score=17.14 Aligned_cols=50 Identities=24% Similarity=0.250 Sum_probs=33.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCC Q ss_conf 99999999999999999999999999999950279989899999999980987 Q gi|254780671|r 41 NKSLEKSLIERERFLSELKENRSRLERKVKLMSDGSLEKDLLDEKARYSLNLS 93 (105) Q Consensus 41 ~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~~~D~d~lEe~aR~~Lg~~ 93 (105) +-+.++...++++++..+.++...|+.++..|+.+ ---|=|..|..=.|- T Consensus 88 RDRFr~Rn~ELE~elr~~~~~~~~l~~ev~~L~~D---N~kLYEK~RylqsY~ 137 (245) T pfam08172 88 RDRFRKRNTELEEELRKLNQTISSLRQELASLKAD---NIKLYEKIRYLQSYN 137 (245) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHCC T ss_conf 99999999999999999999999999999999988---899999999886056 No 45 >pfam03518 consensus Probab=49.54 E-value=23 Score=17.09 Aligned_cols=45 Identities=9% Similarity=0.120 Sum_probs=28.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHH Q ss_conf 99999999999999999999999999999950279989899999999 Q gi|254780671|r 41 NKSLEKSLIERERFLSELKENRSRLERKVKLMSDGSLEKDLLDEKAR 87 (105) Q Consensus 41 ~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~~~D~d~lEe~aR 87 (105) |-..-+...........++++.+.+.++.+.|+++ ||.|-.-.+- T Consensus 9 ~~~~~~~~~~ad~k~~~~~kK~n~~q~~l~~l~P~--dP~y~k~~~~ 53 (349) T pfam03518 9 YEAQIKKLDNADSKIDAAAKKINQAQNKLQELDPA--DPEKAKAEAA 53 (349) T ss_pred HHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCC--CCCHHHHHHH T ss_conf 99998842016688999999999999887558999--9306899989 No 46 >KOG4196 consensus Probab=49.54 E-value=23 Score=17.09 Aligned_cols=37 Identities=30% Similarity=0.440 Sum_probs=25.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 9999999999999999999999999999999995027 Q gi|254780671|r 38 LKANKSLEKSLIERERFLSELKENRSRLERKVKLMSD 74 (105) Q Consensus 38 l~~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~ 74 (105) +....++..+...+.++++.|..++..+..+.+.+++ T Consensus 73 v~Qk~eLE~~k~~L~qqv~~L~~e~s~~~~E~da~k~ 109 (135) T KOG4196 73 VQQKHELEKEKAELQQQVEKLKEENSRLRRELDAYKS 109 (135) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9889999999999999999999999999999999999 No 47 >pfam11119 DUF2633 Protein of unknown function (DUF2633). This family is conserved largely in the Bacillaceae family. Several members are named as YfgG. The function is not known. Probab=49.48 E-value=18 Score=17.69 Aligned_cols=29 Identities=14% Similarity=0.157 Sum_probs=18.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 49898677999999999999999999998 Q gi|254780671|r 4 KYYKKNHFFRAIFLVIAFCCVVYFTNHAI 32 (105) Q Consensus 4 k~~r~~~~~~~~~~~~~~~~~~~f~~~~~ 32 (105) |.++.....++++++.++++++=|+|.++ T Consensus 2 r~r~~~~mt~iVLLISFiilFgRl~Y~aI 30 (59) T pfam11119 2 RRKKNGRMTKIVLLISFIIFFGRLIYSAI 30 (59) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 97766612568999999999999999877 No 48 >TIGR02976 phageshock_pspB phage shock protein B; InterPro: IPR009554 This family consists of several bacterial phage shock protein B (PspB) sequences. The phage shock protein (psp) operon is induced in response to heat, ethanol, osmotic shock and infection by filamentous bacteriophages . Expression of the operon requires the alternative sigma factor sigma54 and the transcriptional activator PspF. In addition, PspA plays a negative regulatory role, and the integral-membrane proteins PspB and PspC play a positive one .. Probab=48.31 E-value=24 Score=16.97 Aligned_cols=25 Identities=24% Similarity=0.453 Sum_probs=16.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 9999999999999999999995027 Q gi|254780671|r 50 ERERFLSELKENRSRLERKVKLMSD 74 (105) Q Consensus 50 ~~~~~l~~l~~~~~~L~~~i~~L~~ 74 (105) .-.+.|++|.++-+.|+++|+-|.. T Consensus 39 ~d~~~L~~L~~~A~Rl~~Ri~tLE~ 63 (75) T TIGR02976 39 DDLQLLQELYAKADRLEERIDTLER 63 (75) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999999999899999999999 No 49 >KOG3705 consensus Probab=47.57 E-value=25 Score=16.90 Aligned_cols=81 Identities=10% Similarity=0.158 Sum_probs=35.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC-CCH-HHHHHHHHHHH Q ss_conf 9999999999999999999837203999999999999999999999999999999999502799-898-99999999980 Q gi|254780671|r 13 RAIFLVIAFCCVVYFTNHAIVGDYGLKANKSLEKSLIERERFLSELKENRSRLERKVKLMSDGS-LEK-DLLDEKARYSL 90 (105) Q Consensus 13 ~~~~~~~~~~~~~~f~~~~~~g~~Gl~~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~~-~D~-d~lEe~aR~~L 90 (105) .++.++++.+.+.||.+.-+..++|--. .-.+.+.+..+.++.++++|++|..-++.|+.+. +|. ..++..-+-+= T Consensus 13 ~li~~v~aW~tl~~~~~~~L~n~q~~~~--~s~R~~sq~l~~le~l~qqNEdLk~~~e~lr~~~~~d~~~am~~v~~Le~ 90 (580) T KOG3705 13 ALITVVLAWMTLFLFLYSQLSNNQSQGG--DSIRAWSQTLEALEKLQQQNEDLKSILEKLRQERNDDHKKAMEQVHQLEP 90 (580) T ss_pred HHHHHHHHHHHHHHHHHEEECCCCCCCC--CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCC T ss_conf 9999999999998763002103688876--31899999999999998751899999999725235413568888761888 Q ss_pred CCCCC Q ss_conf 98789 Q gi|254780671|r 91 NLSRS 95 (105) Q Consensus 91 g~~kp 95 (105) ...+| T Consensus 91 ~~~np 95 (580) T KOG3705 91 NPENP 95 (580) T ss_pred CCCCC T ss_conf 86786 No 50 >pfam03980 Nnf1 Nnf1. NNF1 is an essential yeast gene that is necessary for chromosome segregation. It is associated with the spindle poles and forms part of a kinetochore subcomplex called MIND. Probab=46.95 E-value=25 Score=16.84 Aligned_cols=34 Identities=18% Similarity=0.250 Sum_probs=27.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 9999999999999999999999999999995027 Q gi|254780671|r 41 NKSLEKSLIERERFLSELKENRSRLERKVKLMSD 74 (105) Q Consensus 41 ~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~ 74 (105) +..+..+...+...+..++.+|..|..+|..++. T Consensus 74 ~~~~~~~~~~L~~~L~~~~~eN~~L~~~v~~~r~ 107 (108) T pfam03980 74 VPYLLKQLDELDAKLQKIQAENAALADEIQALRE 107 (108) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 7789999999999999999999999999986534 No 51 >KOG0642 consensus Probab=46.40 E-value=26 Score=16.79 Aligned_cols=55 Identities=13% Similarity=-0.011 Sum_probs=26.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCC Q ss_conf 9999999999999999999999999999950279989899999999980987898 Q gi|254780671|r 42 KSLEKSLIERERFLSELKENRSRLERKVKLMSDGSLEKDLLDEKARYSLNLSRSD 96 (105) Q Consensus 42 ~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~~~D~d~lEe~aR~~Lg~~kp~ 96 (105) ..+..+.++++++++.|+.|+..+++....|+....=++|--+.-|.+....++| T Consensus 37 ~~WElERaElqariAfLqgErk~qenlk~dl~rR~kmlE~~lkeerak~~~lq~g 91 (577) T KOG0642 37 ARWELERAELQARIAFLQGERKGQENLKMDLVRRIKMLEFALKEERAKYNKLQPG 91 (577) T ss_pred HHEEHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCC T ss_conf 5122125567788998740102068889999998752430367764343103645 No 52 >KOG3335 consensus Probab=46.39 E-value=22 Score=17.24 Aligned_cols=27 Identities=19% Similarity=0.273 Sum_probs=15.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 999999999999999999999995027 Q gi|254780671|r 48 LIERERFLSELKENRSRLERKVKLMSD 74 (105) Q Consensus 48 ~~~~~~~l~~l~~~~~~L~~~i~~L~~ 74 (105) .+..+.++.+|+.+...|+++|..++. T Consensus 101 ee~~~~e~~elr~~~~~l~~~i~~~~~ 127 (181) T KOG3335 101 EEKRKQEIMELRLKVEKLENAIAELTK 127 (181) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 799999999999999999999999999 No 53 >COG3074 Uncharacterized protein conserved in bacteria [Function unknown] Probab=45.70 E-value=27 Score=16.72 Aligned_cols=49 Identities=16% Similarity=0.184 Sum_probs=26.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCC Q ss_conf 9999999999999999999999999999995027998989999999998098 Q gi|254780671|r 41 NKSLEKSLIERERFLSELKENRSRLERKVKLMSDGSLEKDLLDEKARYSLNL 92 (105) Q Consensus 41 ~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~~~D~d~lEe~aR~~Lg~ 92 (105) .-+++...+.+..+...+...++.|+.+-..|++. ...--+..|.-||- T Consensus 27 ieELKEknn~l~~e~q~~q~~reaL~~eneqlk~e---~~~WQerlrsLLGk 75 (79) T COG3074 27 IEELKEKNNSLSQEVQNAQHQREALERENEQLKEE---QNGWQERLRALLGK 75 (79) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHH T ss_conf 99999875177788888898799999888999989---97799999998710 No 54 >KOG0999 consensus Probab=44.81 E-value=27 Score=16.64 Aligned_cols=53 Identities=21% Similarity=0.249 Sum_probs=35.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHH Q ss_conf 99999999999999999999999999999999950279989899999999980 Q gi|254780671|r 38 LKANKSLEKSLIERERFLSELKENRSRLERKVKLMSDGSLEKDLLDEKARYSL 90 (105) Q Consensus 38 l~~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~~~D~d~lEe~aR~~L 90 (105) +.+..++..++.+++.++...+.+|+.+......+.+++-+.+.--...|.++ T Consensus 106 l~kI~eleneLKq~r~el~~~q~E~erl~~~~sd~~~~~~~~E~qR~rlr~el 158 (772) T KOG0999 106 LQKILELENELKQLRQELTNVQEENERLEKVHSDLKESNAAVEDQRRRLRDEL 158 (772) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHH T ss_conf 99999999999999999998877899999998776302104588998999999 No 55 >KOG0288 consensus Probab=44.48 E-value=28 Score=16.61 Aligned_cols=46 Identities=15% Similarity=0.186 Sum_probs=20.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHH Q ss_conf 999999999999999999999999999999502799898999999999 Q gi|254780671|r 41 NKSLEKSLIERERFLSELKENRSRLERKVKLMSDGSLEKDLLDEKARY 88 (105) Q Consensus 41 ~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~~~D~d~lEe~aR~ 88 (105) +-.+..|...++++...+.+.-...|.++.+|+.++ .+..|+..|+ T Consensus 29 ~s~~~aq~~~~~a~~~ai~a~~~~~E~~l~~Lq~e~--~~l~e~~v~~ 74 (459) T KOG0288 29 QSRLSAQLVILRAESRAIKAKLQEKELELNRLQEEN--TQLNEERVRE 74 (459) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHH T ss_conf 999988899999989999999888899999999988--8888999988 No 56 >pfam03962 Mnd1 Mnd1 family. This family of proteins includes MND1 from S. cerevisiae. The mnd1 protein forms a complex with hop2 to promote homologous chromosome pairing and meiotic double-strand break repair. Probab=44.11 E-value=28 Score=16.57 Aligned_cols=36 Identities=11% Similarity=0.220 Sum_probs=19.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHH Q ss_conf 99999999999999999999950279989899999999 Q gi|254780671|r 50 ERERFLSELKENRSRLERKVKLMSDGSLEKDLLDEKAR 87 (105) Q Consensus 50 ~~~~~l~~l~~~~~~L~~~i~~L~~~~~D~d~lEe~aR 87 (105) .+-.+++.|+.++..|..+...++.. ||+.+++.-+ T Consensus 107 ~ll~~l~~l~~~~~~L~~el~~~~~~--Dp~~i~~~~~ 142 (188) T pfam03962 107 ELLEELKQLEKELKKLKAELEKYEKN--DPERIEKLKE 142 (188) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHC--CHHHHHHHHH T ss_conf 99999999999999999999998836--9999999999 No 57 >KOG0994 consensus Probab=43.64 E-value=29 Score=16.53 Aligned_cols=50 Identities=18% Similarity=0.217 Sum_probs=31.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH----CCCCCCHHHHHHHHHHHHCCCCCC Q ss_conf 99999999999999999999999950----279989899999999980987898 Q gi|254780671|r 47 SLIERERFLSELKENRSRLERKVKLM----SDGSLEKDLLDEKARYSLNLSRSD 96 (105) Q Consensus 47 ~~~~~~~~l~~l~~~~~~L~~~i~~L----~~~~~D~d~lEe~aR~~Lg~~kp~ 96 (105) -..+..+-...+++.|++|++-|..+ ....-|||-++++|-+.|++--|- T Consensus 1455 ~~~~a~as~~q~~~s~~el~~Li~~v~~Flt~~~adp~si~~vA~~vL~l~lp~ 1508 (1758) T KOG0994 1455 ALEQANASRSQMEESNRELRNLIQQVRDFLTQPDADPDSIEEVAEEVLALELPL 1508 (1758) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCCC T ss_conf 999988889988889999999999999874088999799999999998532789 No 58 >pfam09851 DUF2078 Predicted membrane protein (DUF2078). This domain, found in various hypothetical prokaryotic proteins, has no known function. Probab=43.48 E-value=18 Score=17.80 Aligned_cols=61 Identities=15% Similarity=0.163 Sum_probs=22.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999983720399999999999999999999999999999999950 Q gi|254780671|r 12 FRAIFLVIAFCCVVYFTNHAIVGDYGLKANKSLEKSLIERERFLSELKENRSRLERKVKLM 72 (105) Q Consensus 12 ~~~~~~~~~~~~~~~f~~~~~~g~~Gl~~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L 72 (105) ...++.++.+++++++.+..+.+.++-.....-+.-.+.++++++.=+-+.++.+++.+.| T Consensus 11 ~~~i~~~lli~~vv~l~~~~~~~~~~~~~~~~~~~al~IL~eRyArGEI~eeEy~~rk~~L 71 (73) T pfam09851 11 LMLLFWLLLIAGVVYLLVRLFGGTRGRSDSPPDSDALEILKERYARGEIDEEEYEERKKDL 71 (73) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHCCCCCHHHHHHHHHHH T ss_conf 9999999999999999999970789889998876899999999865898999999999886 No 59 >COG3879 Uncharacterized protein conserved in bacteria [Function unknown] Probab=43.18 E-value=29 Score=16.49 Aligned_cols=65 Identities=18% Similarity=0.163 Sum_probs=40.0 Q ss_pred HHHHHCCCHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC-CCCCHHHHHH---HHHHHHCCC Q ss_conf 999837203999-9999999999999999999999999999995027-9989899999---999980987 Q gi|254780671|r 29 NHAIVGDYGLKA-NKSLEKSLIERERFLSELKENRSRLERKVKLMSD-GSLEKDLLDE---KARYSLNLS 93 (105) Q Consensus 29 ~~~~~g~~Gl~~-~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~-~~~D~d~lEe---~aR~~Lg~~ 93 (105) +..|-++....+ -..+..++...+++...|..+.+.++++++..+. ..-|...+|. ..|...|++ T Consensus 39 ~q~~k~~~~~~~r~~~L~~e~~s~Q~~~~~L~~ev~~~~~~~~s~~~~~~t~~~~ie~~l~~l~~~aG~v 108 (247) T COG3879 39 FQTSKGESVRRARDLDLVKELRSLQKKVNTLAAEVEDLENKLDSVRRSVLTDDAALEDRLEKLRMLAGSV 108 (247) T ss_pred HHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 8512685032235668999989999999999999999999998877767769999999999998885557 No 60 >KOG1103 consensus Probab=42.92 E-value=29 Score=16.46 Aligned_cols=62 Identities=21% Similarity=0.270 Sum_probs=46.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHH---HHCCCCCCCEE Q ss_conf 203999999999999999999999999999999999502799898999999999---80987898687 Q gi|254780671|r 35 DYGLKANKSLEKSLIERERFLSELKENRSRLERKVKLMSDGSLEKDLLDEKARY---SLNLSRSDEII 99 (105) Q Consensus 35 ~~Gl~~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~~~D~d~lEe~aR~---~Lg~~kp~E~i 99 (105) ++|+..-.++..++.+...+-+.|+.+.+..+++-+.|++. -+.+-+++|. .+-+.+|||.+ T Consensus 234 erglqteaqvek~i~EfdiEre~LRAel~ree~r~K~lKeE---meSLkeiVkdlEA~hQh~~pNeqL 298 (561) T KOG1103 234 ERGLQTEAQVEKLIEEFDIEREFLRAELEREEKRQKMLKEE---MESLKEIVKDLEADHQHLRPNEQL 298 (561) T ss_pred HHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 63230378899999999899999999988777788899999---999999986156666633843003 No 61 >TIGR01242 26Sp45 26S proteasome subunit P45 family; InterPro: IPR005937 Intracellular proteins, including short-lived proteins such as cyclin, Mos, Myc, p53, NF-kappaB, and IkappaB, are degraded by the ubiquitin-proteasome system. The 26S proteasome (a 2 MDa complex) is made up of two subcomplexes: the 20S proteasome and the regulatory complex. The former is a 700 kDa cylindrical protease complex consisting of four stacks of heptameric rings with 28 subunits (i.e., 7777) with molecular masses of about 20-35 kDa, whereas the latter is a 700-1000 kDa complex consisting of at least 18 subunits with molecular masses of 28-110 kDa, including 6 putative ATPases (Rpt1-Rpt6) and 12 non-ATPase subunits (Rpn1-12). Members of the 26S proteasome subunit P45 family: ATPase p45/Sug1/Rpt6 may be phosphorylated within the proteasome. This phosphorylation event may play a key role in ATP-dependent proteolysis because a good correlation exists between the inhibition pattern of protein kinase inhibitors against the phosphorylation of p45 and that against the ATP-dependent proteolytic activity , . More information about these protein can be found at Protein of the Month: AAA ATPases .; GO: 0016787 hydrolase activity, 0030163 protein catabolic process, 0005634 nucleus, 0005737 cytoplasm. Probab=42.55 E-value=30 Score=16.43 Aligned_cols=37 Identities=16% Similarity=0.153 Sum_probs=31.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 9999999999999999999999999999999950279 Q gi|254780671|r 39 KANKSLEKSLIERERFLSELKENRSRLERKVKLMSDG 75 (105) Q Consensus 39 ~~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~ 75 (105) ..+..|+.+...++.+-..++.|...|..|+++|++. T Consensus 6 ~~ir~Le~~~~~le~e~~~l~~E~~~lr~E~~rlr~P 42 (364) T TIGR01242 6 ERIRKLEDEKKLLEKEKIRLERELERLRSEIERLRSP 42 (364) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 9999888888878877778999999999999984589 No 62 >smart00338 BRLZ basic region leucin zipper. Probab=42.03 E-value=30 Score=16.38 Aligned_cols=34 Identities=18% Similarity=0.347 Sum_probs=22.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 9999999999999999999999999999995027 Q gi|254780671|r 41 NKSLEKSLIERERFLSELKENRSRLERKVKLMSD 74 (105) Q Consensus 41 ~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~ 74 (105) ....+..+..++.++..|..+|..|..++..|+. T Consensus 21 R~rKk~~~~~Le~~~~~L~~en~~L~~~~~~L~~ 54 (65) T smart00338 21 RERKKAEIEELERKVEQLEAENERLKKEIERLRR 54 (65) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999999999999999999999999999999 No 63 >COG3883 Uncharacterized protein conserved in bacteria [Function unknown] Probab=41.00 E-value=31 Score=16.28 Aligned_cols=10 Identities=40% Similarity=0.621 Sum_probs=4.6 Q ss_pred HHHHHHHHHH Q ss_conf 9999999998 Q gi|254780671|r 80 DLLDEKARYS 89 (105) Q Consensus 80 d~lEe~aR~~ 89 (105) +.++++||.. T Consensus 101 ~~l~~raRAm 110 (265) T COG3883 101 ELLKKRARAM 110 (265) T ss_pred HHHHHHHHHH T ss_conf 9999999999 No 64 >PRK11415 hypothetical protein; Provisional Probab=40.87 E-value=32 Score=16.27 Aligned_cols=54 Identities=11% Similarity=0.115 Sum_probs=42.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC--CHHHHHHHHHHHHCC Q ss_conf 999999999999999999999999999999995027998--989999999998098 Q gi|254780671|r 39 KANKSLEKSLIERERFLSELKENRSRLERKVKLMSDGSL--EKDLLDEKARYSLNL 92 (105) Q Consensus 39 ~~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~~~--D~d~lEe~aR~~Lg~ 92 (105) ..|+++-.++....+.-..|=.+-++|.++|..|..++. .++.+|..-+++|.+ T Consensus 3 pEyRdLIs~LK~~d~hF~~LF~kHneLD~eI~~lE~~~~~~~~~eve~LKkeKL~L 58 (74) T PRK11415 3 PEYRDLISRLKNENPRFMSLFDKHNKLDHEIARKEGSDGRGYNAEVVRMKKQKLQL 58 (74) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHH T ss_conf 07899999985138689999999878889999985788899828999999999852 No 65 >pfam06103 DUF948 Bacterial protein of unknown function (DUF948). This family consists of bacterial sequences several of which are thought to be general stress proteins. Probab=40.87 E-value=32 Score=16.27 Aligned_cols=58 Identities=17% Similarity=0.266 Sum_probs=33.5 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 999999999999999998372039999999999999999999999999999999995027 Q gi|254780671|r 15 IFLVIAFCCVVYFTNHAIVGDYGLKANKSLEKSLIERERFLSELKENRSRLERKVKLMSD 74 (105) Q Consensus 15 ~~~~~~~~~~~~f~~~~~~g~~Gl~~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~ 74 (105) ...++++++++.|.....-. --....++++-+..++++.+.+..|-+.|-++.+.|-+ T Consensus 4 ~i~AiAf~vLvi~l~~~L~~--~~~tl~~~~~Ti~~l~~~vd~i~~et~~lL~ktN~L~~ 61 (90) T pfam06103 4 LIIAIAFLVLVIFLIKTLKS--LSKTLDEVNKTLAGLTKQVDGITTETEDLLAKTNVLLE 61 (90) T ss_pred HHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999999999--99999999999999998789899999999998888999 No 66 >pfam12269 zf-CpG_bind_C CpG binding protein zinc finger C terminal domain. This domain family is found in eukaryotes, and is approximately 240 amino acids in length. This domain is the zinc finger domain of a CpG binding DNA methyltransferase protein. It contains a CxxC motif which forms the zinc finger and binds to DNA. Probab=38.06 E-value=35 Score=16.00 Aligned_cols=49 Identities=18% Similarity=0.242 Sum_probs=42.6 Q ss_pred HHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHH Q ss_conf 8372039999999999999999999999999999999995027998989 Q gi|254780671|r 32 IVGDYGLKANKSLEKSLIERERFLSELKENRSRLERKVKLMSDGSLEKD 80 (105) Q Consensus 32 ~~g~~Gl~~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~~~D~d 80 (105) +.+++|...+-++.++....+..|++|...-..|+.-|.+.++-.+|++ T Consensus 22 ~A~e~~~~~Le~Ir~kq~~~~~~L~eLe~~~~eL~~iI~rak~~~~~~~ 70 (236) T pfam12269 22 VAEEIGKKALERIRKKQQVARTRLQELERRHQELETIISRAKRLTVDEN 70 (236) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 4244557999999999999999999999999999999998862245433 No 67 >pfam07926 TPR_MLP1_2 TPR/MLP1/MLP2-like protein. The sequences featured in this family are similar to a region of human TPR protein and to yeast myosin-like proteins 1 (MLP1) and 2 (MLP2). These proteins share a number of features; for example, they all have coiled-coil regions and all three are associated with nuclear pores. TPR is thought to be a component of nuclear pore complex- attached intra-nuclear filaments, and is implicated in nuclear protein import. Moreover, its N-terminal region is involved in the activation of oncogenic kinases, possibly by mediating the dimerization of kinase domains or by targeting these kinases to the nuclear pore complex. MLP1 and MLP2 are involved in the process of telomere length regulation, where they are thought to interact with proteins such as Tel1p and modulate their activity. Probab=37.71 E-value=35 Score=15.96 Aligned_cols=29 Identities=34% Similarity=0.525 Sum_probs=12.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999999999950 Q gi|254780671|r 44 LEKSLIERERFLSELKENRSRLERKVKLM 72 (105) Q Consensus 44 l~~~~~~~~~~l~~l~~~~~~L~~~i~~L 72 (105) +..++..++..+..|..+|..|...|..| T Consensus 103 le~E~~~l~~r~edL~~QN~lLh~Qie~l 131 (132) T pfam07926 103 LEDELSELEKRIEELNEQNKLLHDQIELL 131 (132) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 99999999988999999999999997645 No 68 >PRK13169 DNA replication intiation control protein YabA; Reviewed Probab=37.03 E-value=36 Score=15.90 Aligned_cols=35 Identities=17% Similarity=0.217 Sum_probs=22.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 99999999999999999999999999999999502 Q gi|254780671|r 39 KANKSLEKSLIERERFLSELKENRSRLERKVKLMS 73 (105) Q Consensus 39 ~~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~ 73 (105) ....+++.++..+-.+++.|+-+|..|...+..+. T Consensus 22 ~e~~~LK~~~~~l~EEN~~L~iEN~~Lre~l~~~~ 56 (107) T PRK13169 22 KELGALKQQLAELLEENTALRLENEKLRERLEELE 56 (107) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999988999999999999999875 No 69 >TIGR01000 bacteriocin_acc bacteriocin secretion accessory protein; InterPro: IPR005696 Bacteriocins are antibacterial proteinaceous compounds produced by bacteria. In Gram-positive bacteria, they are divided into four classes. Within the class II, constituted by non-modified peptides produced mainly by lactic acid bacteria, bacteriocins of the subclass IIa, such as mesentericin Y105, are of particular interest. They are active against the foodborne pathogen Listeria monocytogenes and share a similar primary structure, with a conserved N-terminal motif (YGNGV). Subclass IIa bacteriocins induce membrane permeabilization of sensitive strains, but their target specificity and their molecular mode of action remain elusive. This family of proteins is involved in the secretion of such bacteriocins although the mechanism of this process is not well understood. ; GO: 0005215 transporter activity, 0006810 transport. Probab=36.96 E-value=30 Score=16.36 Aligned_cols=30 Identities=23% Similarity=0.366 Sum_probs=13.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 999999999999999999999999950279 Q gi|254780671|r 46 KSLIERERFLSELKENRSRLERKVKLMSDG 75 (105) Q Consensus 46 ~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~ 75 (105) .+......++..|..+...|+.-+..+.+| T Consensus 98 ~~~~~~~~q~~~L~~q~~~L~~l~~Si~~g 127 (476) T TIGR01000 98 NQKQELEQQLDNLKDQKKSLDTLKESIENG 127 (476) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 789999999999999999999999864348 No 70 >pfam06156 DUF972 Protein of unknown function (DUF972). This family consists of several hypothetical bacterial sequences. The function of this family is unknown. Probab=36.57 E-value=37 Score=15.85 Aligned_cols=33 Identities=18% Similarity=0.199 Sum_probs=19.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999999999999999950 Q gi|254780671|r 40 ANKSLEKSLIERERFLSELKENRSRLERKVKLM 72 (105) Q Consensus 40 ~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L 72 (105) ...+++.++..+-.+++.|+-+|..|..++..+ T Consensus 23 el~~LK~~~~~l~EEN~~L~iEN~~Lr~~l~~~ 55 (106) T pfam06156 23 ELGELKQQLAELLEENAELRIENEHLRERLEEL 55 (106) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999999998899999999999999988 No 71 >PRK03784 vtamin B12-transporter permease; Provisional Probab=36.31 E-value=37 Score=15.83 Aligned_cols=36 Identities=11% Similarity=0.065 Sum_probs=18.1 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHH Q ss_conf 249898677999999999999999999998372039 Q gi|254780671|r 3 TKYYKKNHFFRAIFLVIAFCCVVYFTNHAIVGDYGL 38 (105) Q Consensus 3 tk~~r~~~~~~~~~~~~~~~~~~~f~~~~~~g~~Gl 38 (105) |-.++|++..+..+.++.+++++.+..++..|+..+ T Consensus 4 ~~~~~~~rr~~~~l~~L~~ll~~~~~~sL~~G~~~i 39 (330) T PRK03784 4 TLARQQQRRNRRWLLCLSVLLLLALLLSLCAGEQWI 39 (330) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 799998765889999999999999999996388137 No 72 >cd07429 Cby_like Chibby, a nuclear inhibitor of Wnt/beta-catenin mediated transcription, and similar proteins. Chibby(Cby) is a well-conserved nuclear protein that functions as part of the Wnt/beta-catenin signaling pathway. Specifically, Cby binds directly to beta-catenin by interacting with its central region, which harbors armadillo repeats. Cby-beta-catenin interactions may also involve 14-3-3 proteins. By competing with other binding partners of beta-catenin, the Tcf/Lef transcription factors, Cby inhibits transcriptional activation. Cby has been shown to play a role in adipocyte differentiation. The C-terminal region of Cby appears to contain an alpha-helical coiled-coil motif. Probab=36.09 E-value=37 Score=15.81 Aligned_cols=34 Identities=21% Similarity=0.411 Sum_probs=25.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHH Q ss_conf 9999999999999999999999999502799898999999 Q gi|254780671|r 46 KSLIERERFLSELKENRSRLERKVKLMSDGSLEKDLLDEK 85 (105) Q Consensus 46 ~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~~~D~d~lEe~ 85 (105) .+...++.+...|++||+.|.-+|+.|- |++.+. T Consensus 72 ~e~~rlkkk~~~LeEENNlLklKievLl------DMLtet 105 (108) T cd07429 72 REVLRLKKKNQQLEEENNLLKLKIEVLL------DMLAET 105 (108) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHH T ss_conf 7888888898999998869999999999------998887 No 73 >PTZ00234 variable surface protein Vir12; Provisional Probab=35.64 E-value=6.3 Score=20.52 Aligned_cols=63 Identities=16% Similarity=0.173 Sum_probs=38.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHH---------HHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 89867799999999999999999999837203999999---------99999999999999999999999999 Q gi|254780671|r 6 YKKNHFFRAIFLVIAFCCVVYFTNHAIVGDYGLKANKS---------LEKSLIERERFLSELKENRSRLERKV 69 (105) Q Consensus 6 ~r~~~~~~~~~~~~~~~~~~~f~~~~~~g~~Gl~~~~~---------l~~~~~~~~~~l~~l~~~~~~L~~~i 69 (105) ...+.++|.++++++++-+++|.|..--+ .|+-.... -.+-.++.++||..-..|++.|+.+. T Consensus 358 ~LdSNffRNIIMaaAILGtIfFLFYYN~S-S~lkS~~~KrKrKKk~~ehNYYeEYEkEL~~Y~Se~~~ldSq~ 429 (433) T PTZ00234 358 ILKSEYFRHSIVGASIIGVLVFLFFFFKS-TPIRSQTNKGEKKKRKPQNNYYDEYEEELPRYESQQSLAESQM 429 (433) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHEECC-CCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCHHCCCCCCC T ss_conf 00018999999999999999998630036-6643457876666676543238999876035652000001100 No 74 >pfam05103 DivIVA DivIVA protein. The Bacillus subtilis divIVA1 mutation causes misplacement of the septum during cell division, resulting in the formation of small, circular, anucleate mini-cells. Inactivation of divIVA produces a mini-cell phenotype, whereas overproduction of DivIVA results in a filamentation phenotype. These proteins appear to contain coiled-coils. Probab=35.42 E-value=38 Score=15.74 Aligned_cols=35 Identities=9% Similarity=0.169 Sum_probs=16.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 99999999999999999999999999999995027 Q gi|254780671|r 40 ANKSLEKSLIERERFLSELKENRSRLERKVKLMSD 74 (105) Q Consensus 40 ~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~ 74 (105) ...++..-+.....++..|..+|..|..++..|.. T Consensus 19 d~~EVD~FLd~va~e~e~l~~en~~L~~~~~~l~~ 53 (131) T pfam05103 19 DPDEVDEFLDQVIKDYEALYKENEELKEEIERLEE 53 (131) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999999999999999999999999 No 75 >pfam06447 consensus Probab=35.37 E-value=39 Score=15.73 Aligned_cols=22 Identities=9% Similarity=0.246 Sum_probs=8.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999950 Q gi|254780671|r 51 RERFLSELKENRSRLERKVKLM 72 (105) Q Consensus 51 ~~~~l~~l~~~~~~L~~~i~~L 72 (105) .+++++.|.+++.+|.++++.+ T Consensus 88 ~q~~l~~l~qe~~~L~~q~~~~ 109 (407) T pfam06447 88 NQERIASLEQENQQLREQLAAG 109 (407) T ss_pred HHHHHHHHHHHHHHHHHHHHHH T ss_conf 8999999999999999999988 No 76 >TIGR00393 kpsF sugar isomerase, KpsF/GutQ family; InterPro: IPR004800 This is a family of closely related proteins with the phosphosugar-binding domain SIS (Sugar ISomerase) followed by two copies of the CBS (named after Cystathionine Beta Synthase) domain. The group includes GutQ, a protein of the glucitol operon and KpsF, a virulence factor involved in capsular polysialic acid biosynthesis in some pathogenic strains of Escherichia coli.; GO: 0016853 isomerase activity, 0005975 carbohydrate metabolic process. Probab=34.59 E-value=24 Score=17.04 Aligned_cols=21 Identities=19% Similarity=0.309 Sum_probs=16.0 Q ss_pred HHHHHHHCCCCCCCEEEEECC Q ss_conf 999998098789868767527 Q gi|254780671|r 84 EKARYSLNLSRSDEIILFYSD 104 (105) Q Consensus 84 e~aR~~Lg~~kp~E~iii~~d 104 (105) |-+.-.|||++|.++++..+. T Consensus 36 EA~HGDLGMv~~~Dvvl~ISY 56 (272) T TIGR00393 36 EALHGDLGMVEPRDVVLLISY 56 (272) T ss_pred HHCCCCCCCCCCCCEEEEEEC T ss_conf 450364357586747999876 No 77 >pfam11855 DUF3375 Protein of unknown function (DUF3375). This family of proteins are functionally uncharacterized. This protein is found in bacteria. Proteins in this family are typically between 479 to 499 amino acids in length. Probab=34.53 E-value=40 Score=15.65 Aligned_cols=42 Identities=26% Similarity=0.427 Sum_probs=31.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCC---CCHHHHHHHHHHHHCC Q ss_conf 99999999999999999999502799---8989999999998098 Q gi|254780671|r 51 RERFLSELKENRSRLERKVKLMSDGS---LEKDLLDEKARYSLNL 92 (105) Q Consensus 51 ~~~~l~~l~~~~~~L~~~i~~L~~~~---~D~d~lEe~aR~~Lg~ 92 (105) -...++.|+.++.+++++|.++..|- +|++-+.+.+|.-+.+ T Consensus 139 p~~Ri~~L~~e~~~i~~EI~~l~aG~v~~ld~~q~~er~~~i~~l 183 (471) T pfam11855 139 PEARLAELEREIDELDAEIDRLEAGDVPLLDDTQVRERFRQILDL 183 (471) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHH T ss_conf 999999999999999999999877998889878999999999999 No 78 >pfam06936 Selenoprotein_S Selenoprotein S (SelS). This family consists of several mammalian selenoprotein S (SelS) sequences. SelS is a plasma membrane protein and is present in a variety of tissues and cell types. The function of this family is unknown. Probab=33.92 E-value=41 Score=15.59 Aligned_cols=18 Identities=11% Similarity=0.379 Sum_probs=10.8 Q ss_pred HHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999983 Q gi|254780671|r 16 FLVIAFCCVVYFTNHAIV 33 (105) Q Consensus 16 ~~~~~~~~~~~f~~~~~~ 33 (105) .++++.++++|+++.-+. T Consensus 38 WyiL~~~v~ly~l~qkL~ 55 (190) T pfam06936 38 WYLLLGCVGIYLLIQKLR 55 (190) T ss_pred HHHHHHHHHHHHHHHHHH T ss_conf 799999999999999871 No 79 >KOG2895 consensus Probab=32.78 E-value=34 Score=16.06 Aligned_cols=43 Identities=12% Similarity=0.013 Sum_probs=21.8 Q ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999998372039999999999999999999999999999999 Q gi|254780671|r 21 FCCVVYFTNHAIVGDYGLKANKSLEKSLIERERFLSELKENRSRLERK 68 (105) Q Consensus 21 ~~~~~~f~~~~~~g~~Gl~~~~~l~~~~~~~~~~l~~l~~~~~~L~~~ 68 (105) +++++.|+++.|.|..=++.-... ..++|.+.++++-+..+.. T Consensus 346 ~Fl~l~f~wavwNGasyyidv~gr-----r~~~e~e~l~q~lqp~~~~ 388 (408) T KOG2895 346 IFLILKFSWAVWNGASYYIDVMGR-----RVIQEEEKLKQELQPIEEQ 388 (408) T ss_pred HHHHHHHHHHHCCCCEEEHHHHHH-----HHHHHHHHHHHHCCCCHHH T ss_conf 999999888511783665465348-----9999999998763897464 No 80 >pfam05833 FbpA Fibronectin-binding protein A N-terminus (FbpA). This family consists of the N-terminal region of the prokaryotic fibronectin-binding protein. Fibronectin binding is considered to be an important virulence factor in streptococcal infections. Fibronectin is a dimeric glycoprotein that is present in a soluble form in plasma and extracellular fluids; it is also present in a fibrillar form on cell surfaces. Both the soluble and cellular forms of fibronectin may be incorporated into the extracellular tissue matrix. While fibronectin has critical roles in eukaryotic cellular processes, such as adhesion, migration and differentiation, it is also a substrate for the attachment of bacteria. The binding of pathogenic Streptococcus pyogenes and Staphylococcus aureus to epithelial cells via fibronectin facilitates their internalisation and systemic spread within the host. Probab=32.67 E-value=43 Score=15.46 Aligned_cols=45 Identities=22% Similarity=0.212 Sum_probs=21.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHH Q ss_conf 9999999999999999999999999999999502799898999999 Q gi|254780671|r 40 ANKSLEKSLIERERFLSELKENRSRLERKVKLMSDGSLEKDLLDEK 85 (105) Q Consensus 40 ~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~~~D~d~lEe~ 85 (105) +|.++++....++.+++..+++.+.|+.-...|.. ..|++-++++ T Consensus 380 kykK~K~~~~~~~~~l~~~~~el~yLe~~~~~l~~-a~~~~~l~eI 424 (447) T pfam05833 380 KYKKLKRAVEAVEEQIEETKEEIEYLESVEAQLEN-AESLEDLEEI 424 (447) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-CCCHHHHHHH T ss_conf 99999888999999999999999999999999985-6999999999 No 81 >pfam07047 OPA3 Optic atrophy 3 protein (OPA3). This family consists of several optic atrophy 3 (OPA3) proteins. OPA3 deficiency causes type III 3-methylglutaconic aciduria (MGA) in humans. This disease manifests with early bilateral optic atrophy, spasticity, extrapyramidal dysfunction, ataxia, and cognitive deficits, but normal longevity. Probab=31.34 E-value=45 Score=15.33 Aligned_cols=27 Identities=26% Similarity=0.212 Sum_probs=13.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 999999999999999999999999502 Q gi|254780671|r 47 SLIERERFLSELKENRSRLERKVKLMS 73 (105) Q Consensus 47 ~~~~~~~~l~~l~~~~~~L~~~i~~L~ 73 (105) ..+..+++++.|+.+...|+.+++.++ T Consensus 106 Kee~~~~~l~~L~~~i~~L~~~~e~~~ 132 (134) T pfam07047 106 KEEELQQELEELEQRVEELELEVERQK 132 (134) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 899999999999999999999999885 No 82 >COG5278 Predicted periplasmic ligand-binding sensor domain [Signal transduction mechanisms] Probab=31.05 E-value=45 Score=15.30 Aligned_cols=23 Identities=17% Similarity=0.369 Sum_probs=8.4 Q ss_pred HHHHHHHHHHHHCCCCCC-HHHHH Q ss_conf 999999999950279989-89999 Q gi|254780671|r 61 NRSRLERKVKLMSDGSLE-KDLLD 83 (105) Q Consensus 61 ~~~~L~~~i~~L~~~~~D-~d~lE 83 (105) .++.+..+++.|+.-..| |++.+ T Consensus 85 a~~~~~~~~~~l~~l~~d~p~~~~ 108 (207) T COG5278 85 ATEELDQKLEELRALTADDPELLE 108 (207) T ss_pred HHHHHHHHHHHHHHHCCCCHHHHH T ss_conf 999999999999876389889999 No 83 >TIGR01367 pyrE_Therm orotate phosphoribosyltransferase; InterPro: IPR006273 This group of sequences are a distinct clade of orotate phosphoribosyltransferases. Members include the experimentally determined example from Thermus aquaticus and additional examples from Caulobacter crescentus, Helicobacter pylori (Campylobacter pylori), Mesorhizobium loti, and related species. ; GO: 0004588 orotate phosphoribosyltransferase activity, 0019856 pyrimidine base biosynthetic process. Probab=30.33 E-value=27 Score=16.65 Aligned_cols=13 Identities=15% Similarity=0.250 Sum_probs=10.8 Q ss_pred CCCCCCEEEEECC Q ss_conf 8789868767527 Q gi|254780671|r 92 LSRSDEIILFYSD 104 (105) Q Consensus 92 ~~kp~E~iii~~d 104 (105) .++|||.++++.| T Consensus 107 ~v~pGek~v~vED 119 (205) T TIGR01367 107 EVKPGEKVVVVED 119 (205) T ss_pred EECCCCEEEEEEE T ss_conf 3369977999962 No 84 >PRK10998 malG maltose transporter permease; Provisional Probab=30.18 E-value=47 Score=15.21 Aligned_cols=28 Identities=14% Similarity=0.120 Sum_probs=14.6 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9624989867799999999999999999 Q gi|254780671|r 1 MWTKYYKKNHFFRAIFLVIAFCCVVYFT 28 (105) Q Consensus 1 m~tk~~r~~~~~~~~~~~~~~~~~~~f~ 28 (105) |+..++|++++.+++..++++++.+++. T Consensus 1 m~~~~~k~~~~~~~~~y~~li~~~~~~l 28 (296) T PRK10998 1 MAMVQPKSQKLRLFITHLLLLCFIAAIM 28 (296) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 9866642167999999999999999999 No 85 >PRK12705 hypothetical protein; Provisional Probab=29.63 E-value=48 Score=15.15 Aligned_cols=14 Identities=21% Similarity=0.230 Sum_probs=5.0 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 99999999999999 Q gi|254780671|r 49 IERERFLSELKENR 62 (105) Q Consensus 49 ~~~~~~l~~l~~~~ 62 (105) .+..++...+++++ T Consensus 49 ~~a~ke~~~~r~e~ 62 (485) T PRK12705 49 ANAYKKAKDLKEQN 62 (485) T ss_pred HHHHHHHHHHHHHH T ss_conf 99999999999999 No 86 >KOG4643 consensus Probab=29.06 E-value=49 Score=15.09 Aligned_cols=34 Identities=24% Similarity=0.387 Sum_probs=14.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 9999999999999999999999999999995027 Q gi|254780671|r 41 NKSLEKSLIERERFLSELKENRSRLERKVKLMSD 74 (105) Q Consensus 41 ~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~ 74 (105) |-.+.++...+.+.+..++.+|+.|..+|..|+. T Consensus 525 ye~~~~k~eeLe~~l~~lE~ENa~LlkqI~~Lk~ 558 (1195) T KOG4643 525 YELLSNKLEELEELLGNLEEENAHLLKQIQSLKT 558 (1195) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9998779999999876478777999999999998 No 87 >pfam03131 bZIP_Maf bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170. Probab=28.25 E-value=51 Score=15.00 Aligned_cols=37 Identities=22% Similarity=0.373 Sum_probs=20.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 9999999999999999999999999999999995027 Q gi|254780671|r 38 LKANKSLEKSLIERERFLSELKENRSRLERKVKLMSD 74 (105) Q Consensus 38 l~~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~ 74 (105) +.....+..+...+..+++.|..+++.+..+.+.++. T Consensus 52 ~~q~~~LE~e~~~L~~e~e~Lk~E~~~~~~E~d~~k~ 88 (95) T pfam03131 52 VQQRHELEKEKSQLQQQVEQLKQENARLRRERDALKA 88 (95) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999999999999999999999999999999999 No 88 >PRK09458 pspB phage shock protein B; Provisional Probab=27.30 E-value=53 Score=14.90 Aligned_cols=37 Identities=14% Similarity=0.259 Sum_probs=23.6 Q ss_pred HHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 99999999999----99999999999999999999995027 Q gi|254780671|r 38 LKANKSLEKSL----IERERFLSELKENRSRLERKVKLMSD 74 (105) Q Consensus 38 l~~~~~l~~~~----~~~~~~l~~l~~~~~~L~~~i~~L~~ 74 (105) ++.|....+.- ++-++.+++|...-+.++++|.-|.+ T Consensus 23 iLHY~sk~k~~~~Ls~~d~~~L~~L~~~A~~m~eRI~tLE~ 63 (75) T PRK09458 23 WLHYRSKRQVSQGLSQEEQQRLAQLTEKAEKMRERIQTLEA 63 (75) T ss_pred HHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99866240246799999999999999999999999999999 No 89 >COG4499 Predicted membrane protein [Function unknown] Probab=27.12 E-value=53 Score=14.88 Aligned_cols=74 Identities=16% Similarity=0.273 Sum_probs=38.5 Q ss_pred HHHHHHHHHH---HHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHH Q ss_conf 8986779999---9999999999999999837203999999999999999999999999999999999502799898999 Q gi|254780671|r 6 YKKNHFFRAI---FLVIAFCCVVYFTNHAIVGDYGLKANKSLEKSLIERERFLSELKENRSRLERKVKLMSDGSLEKDLL 82 (105) Q Consensus 6 ~r~~~~~~~~---~~~~~~~~~~~f~~~~~~g~~Gl~~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~~~D~d~l 82 (105) +++..+++++ +.++++.+++||.|..++- ..++.+-..+.-+-| +.....-|.-|.+ .||.-+ T Consensus 214 K~k~~ifk~~giGliillvl~li~~~Y~~f~~---------~p~qeai~~a~~aFL---~~nY~qVittLe~--ydp~kl 279 (434) T COG4499 214 KKKYTIFKYFGIGLIILLVLLLIYFTYYYFSN---------QPKQEAIITANTAFL---KNNYDQVITTLEN--YDPEKL 279 (434) T ss_pred CCCCEEHHHHHHHHHHHHHHHHHHHHHHHHHC---------CHHHHHHHHHHHHHH---HCCHHHHHHHCCC--CCHHHC T ss_conf 55511114688869999999999999999971---------726888998899998---5358988630234--891237 Q ss_pred HHHHHHHHCCC Q ss_conf 99999980987 Q gi|254780671|r 83 DEKARYSLNLS 93 (105) Q Consensus 83 Ee~aR~~Lg~~ 93 (105) .+-|+..|-.+ T Consensus 280 Pksv~Y~LA~S 290 (434) T COG4499 280 PKSVQYILAVS 290 (434) T ss_pred CHHHHHHHHHH T ss_conf 68889999887 No 90 >pfam05384 DegS Sensor protein DegS. This is small family of Bacillus DegS proteins. The DegS-DegU two-component regulatory system of Bacillus subtilis controls various processes that characterize the transition from the exponential to the stationary growth phase, including the induction of extracellular degradative enzymes, expression of late competence genes and down-regulation of the sigma D regulon. The family also contains one sequence from Thermoanaerobacter tengcongensis which is described as a sensory transduction histidine kinase. Probab=26.98 E-value=53 Score=14.87 Aligned_cols=39 Identities=26% Similarity=0.304 Sum_probs=18.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHH Q ss_conf 9999999999999999999999999950279989899999999980 Q gi|254780671|r 45 EKSLIERERFLSELKENRSRLERKVKLMSDGSLEKDLLDEKARYSL 90 (105) Q Consensus 45 ~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~~~D~d~lEe~aR~~L 90 (105) .++...++.+|..++.+....-.+++.|. ..+..||.+| T Consensus 26 R~E~~~l~~EL~evk~~v~~vI~evD~Le-------~~~r~aR~RL 64 (159) T pfam05384 26 RQEYERLEQELEEVKEEVSEVIKEVDKLE-------KQERRARKRL 64 (159) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHH T ss_conf 99999999999999999999999999999-------9999999999 No 91 >pfam10224 DUF2205 Predicted coiled-coil protein (DUF2205). This entry represent a highly conserved 100 residue region which is likely to be a coiled-coil structure. The exact function is unknown. Probab=26.82 E-value=54 Score=14.85 Aligned_cols=40 Identities=18% Similarity=0.326 Sum_probs=19.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHH Q ss_conf 9999999999999999999999999999502799898999999999 Q gi|254780671|r 43 SLEKSLIERERFLSELKENRSRLERKVKLMSDGSLEKDLLDEKARY 88 (105) Q Consensus 43 ~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~~~D~d~lEe~aR~ 88 (105) .++..+..+-.++..++.++..|+.+-.-|+ +||+...+. T Consensus 27 ~LQ~sL~~l~~Rid~VKeE~dKL~sEN~fLq------~YIgnLms~ 66 (81) T pfam10224 27 ELQDSLQDLSERVDAVKEENDKLESENQFLQ------QYIENLMSA 66 (81) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHH T ss_conf 9999999999999999999998884508899------999999874 No 92 >pfam06667 PspB Phage shock protein B. This family consists of several bacterial phage shock protein B (PspB) sequences. The phage shock protein (psp) operon is induced in response to heat, ethanol, osmotic shock and infection by filamentous bacteriophages. Expression of the operon requires the alternative sigma factor sigma54 and the transcriptional activator PspF. In addition, PspA plays a negative regulatory role, and the integral-membrane proteins PspB and PspC play a positive one. Probab=26.52 E-value=54 Score=14.81 Aligned_cols=37 Identities=24% Similarity=0.323 Sum_probs=23.5 Q ss_pred HHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 99999999999----99999999999999999999995027 Q gi|254780671|r 38 LKANKSLEKSL----IERERFLSELKENRSRLERKVKLMSD 74 (105) Q Consensus 38 l~~~~~l~~~~----~~~~~~l~~l~~~~~~L~~~i~~L~~ 74 (105) ++.|....+.- ++-++.+++|.+.-+.++++|.-|.+ T Consensus 23 ~LHYrsk~k~~~~Ls~~d~~~L~~L~~~A~rm~~RI~tLE~ 63 (75) T pfam06667 23 ILHYRSKWKVSQGLSEEDEQLLEELLETAEKLQERIQTLER 63 (75) T ss_pred HHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99656023467789999999999999999999999999999 No 93 >pfam11285 DUF3086 Protein of unknown function (DUF3086). This family of proteins with unknown function appears to be restricted to Cyanobacteria. Probab=26.33 E-value=55 Score=14.79 Aligned_cols=37 Identities=22% Similarity=0.300 Sum_probs=29.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 9999999999999999999999999999999950279 Q gi|254780671|r 39 KANKSLEKSLIERERFLSELKENRSRLERKVKLMSDG 75 (105) Q Consensus 39 ~~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~ 75 (105) .++.++.++...++.+++.|+..++.+++++..==.| T Consensus 4 ~aL~eL~qrr~aL~~eIe~LErRk~rie~EmrtsFaG 40 (283) T pfam11285 4 EALKDLEQRRQALEIEIEKLERRKEQIEQELRTSFAG 40 (283) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 8999999999999999999999999999999722465 No 94 >pfam06810 Phage_GP20 Phage minor structural protein GP20. This family consists of several phage minor structural protein GP20 sequences of around 180 residues in length. The function of this family is unknown. Probab=25.99 E-value=56 Score=14.75 Aligned_cols=35 Identities=26% Similarity=0.531 Sum_probs=21.2 Q ss_pred HHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHCCC Q ss_conf 99999999999999999999---999999999950279 Q gi|254780671|r 41 NKSLEKSLIERERFLSELKE---NRSRLERKVKLMSDG 75 (105) Q Consensus 41 ~~~l~~~~~~~~~~l~~l~~---~~~~L~~~i~~L~~~ 75 (105) ...++.|+......|..|.. .|+.|..+|..|+.. T Consensus 29 ~~~lk~Ql~~rd~ql~~Lk~~~~d~e~L~~~ie~lq~~ 66 (156) T pfam06810 29 RDSLKKQLKNRDKQIEDLKKKVKDNEELQKKIEKLKQQ 66 (156) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH T ss_conf 99999999999988999887158889999999999999 No 95 >PRK11091 aerobic respiration control sensor protein ArcB; Provisional Probab=25.44 E-value=57 Score=14.69 Aligned_cols=13 Identities=8% Similarity=0.314 Sum_probs=7.0 Q ss_pred HHHHHHHHHHHCC Q ss_conf 9999999995027 Q gi|254780671|r 62 RSRLERKVKLMSD 74 (105) Q Consensus 62 ~~~L~~~i~~L~~ 74 (105) |..|+..|.+|+. T Consensus 105 n~~l~~ni~~ln~ 117 (779) T PRK11091 105 NVQLKDNIAQLNQ 117 (779) T ss_pred HHHHHHHHHHHHH T ss_conf 8888877888667 No 96 >pfam05812 Herpes_BLRF2 Herpesvirus BLRF2 protein. This family consists of several Herpesvirus BLRF2 proteins. The family also contains the C terminal region of hypothetical human and mouse sequences, which align with the N terminus of the viral sequences. Probab=25.44 E-value=57 Score=14.69 Aligned_cols=27 Identities=19% Similarity=0.233 Sum_probs=22.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 999999999999999999999999502 Q gi|254780671|r 47 SLIERERFLSELKENRSRLERKVKLMS 73 (105) Q Consensus 47 ~~~~~~~~l~~l~~~~~~L~~~i~~L~ 73 (105) ..+++.+++..|+-||..|..++..=. T Consensus 4 t~EeLaaeL~kL~mENk~LKkkl~~~~ 30 (118) T pfam05812 4 TVEELAAELQKLQMENKALKKKLRQSV 30 (118) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 599999999999987899999998258 No 97 >TIGR02351 thiH thiazole biosynthesis protein ThiH; InterPro: IPR012726 Members of this protein family are the ThiH proteins involved in thiamine biosynthesis. They are homologues of the BioB protein of biotin biosynthesis. Genes encoding these proteins are generally found in operons with other thiamin biosynthesis genes. ; GO: 0005506 iron ion binding, 0051539 4 iron 4 sulfur cluster binding, 0009228 thiamin biosynthetic process. Probab=25.35 E-value=46 Score=15.25 Aligned_cols=40 Identities=20% Similarity=0.228 Sum_probs=32.0 Q ss_pred HHHHHHHHHHHHH-HHHHHHHHHHHCCCCCCHHHHHHHHHHHH Q ss_conf 9999999999999-99999999950279989899999999980 Q gi|254780671|r 49 IERERFLSELKEN-RSRLERKVKLMSDGSLEKDLLDEKARYSL 90 (105) Q Consensus 49 ~~~~~~l~~l~~~-~~~L~~~i~~L~~~~~D~d~lEe~aR~~L 90 (105) ...+.....|.+. +..++.-+.+|++-. .+|||+.||+-- T Consensus 22 ~~~~DVeraL~~~e~~~~~D~~aLlSPaA--~~YLE~mA~~a~ 62 (378) T TIGR02351 22 FTAADVERALNKREHLSLEDFLALLSPAA--EPYLEEMAQKAK 62 (378) T ss_pred HHHHHHHHHHCCCCCCCHHHHHHHCCCCC--CHHHHHHHHHHH T ss_conf 42899999724576789888863114301--068999999999 No 98 >pfam08285 DPM3 Dolichol-phosphate mannosyltransferase subunit 3 (DPM3). This family corresponds to subunit 3 of dolichol-phosphate mannosyltransferase, an enzyme which generates mannosyl donors for glycosylphosphatidylinositols, N-glycan and protein O- and C-mannosylation. DPM3 is an integral membrane protein and plays a role in stabilising the dolichol-phosphate mannosyl transferase complex. Probab=25.24 E-value=58 Score=14.67 Aligned_cols=48 Identities=17% Similarity=0.127 Sum_probs=20.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHC----CCHHHHHHHHHHHHHHHHHHHH Q ss_conf 67799999999999999999999837----2039999999999999999999 Q gi|254780671|r 9 NHFFRAIFLVIAFCCVVYFTNHAIVG----DYGLKANKSLEKSLIERERFLS 56 (105) Q Consensus 9 ~~~~~~~~~~~~~~~~~~f~~~~~~g----~~Gl~~~~~l~~~~~~~~~~l~ 56 (105) ..+...+.+-..+.+.+|..+.+-+| +.-=.++.++++|+.++++.+. T Consensus 34 ~~ii~~lP~~~lv~fG~Y~l~~vg~~v~tFndcpea~~eL~~eI~eAk~dL~ 85 (91) T pfam08285 34 QEILPLLPFYLLVSFGSYSLGTVGYGVLTFNDCPEAAKELQKEIKEAKADLR 85 (91) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 9999981699999999999999987441168888999999999999999999 No 99 >pfam04415 DUF515 Protein of unknown function (DUF515). Family of hypothetical Archaeal proteins. Probab=25.19 E-value=58 Score=14.66 Aligned_cols=67 Identities=18% Similarity=0.204 Sum_probs=27.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHCC Q ss_conf 498986779999999999999999999983720399999999999999999999999-------99999999995027 Q gi|254780671|r 4 KYYKKNHFFRAIFLVIAFCCVVYFTNHAIVGDYGLKANKSLEKSLIERERFLSELKE-------NRSRLERKVKLMSD 74 (105) Q Consensus 4 k~~r~~~~~~~~~~~~~~~~~~~f~~~~~~g~~Gl~~~~~l~~~~~~~~~~l~~l~~-------~~~~L~~~i~~L~~ 74 (105) |.+|++.+...+++++.++++.+-.|..+ .. .+..++....+..-+++..+=. +...+.++|..=++ T Consensus 26 rerr~riiIga~il~iIIl~~~~~~Y~~~-~~---~~~qELenaK~~kiaevN~~F~ge~~Nd~~K~~~inkI~~AkS 99 (416) T pfam04415 26 KERRKRLIIGALVLSVIILLLVFAIYYYI-EN---KAYQELENAKAAKIAEVNALFAGELANDPQKQAYINKIQAANN 99 (416) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HH---HHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCC T ss_conf 23344100689999999999999999998-72---7789998889988999999853667998588999999885269 No 100 >PRK13729 conjugal transfer pilus assembly protein TraB; Provisional Probab=24.78 E-value=59 Score=14.62 Aligned_cols=30 Identities=23% Similarity=0.359 Sum_probs=10.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 999999999999999999999999999502 Q gi|254780671|r 44 LEKSLIERERFLSELKENRSRLERKVKLMS 73 (105) Q Consensus 44 l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~ 73 (105) +++++..++.+++.+.++....++++..|. T Consensus 81 lek~l~~l~~~l~~~~~~~~~~q~~l~~l~ 110 (474) T PRK13729 81 MQKQYEEIRRELDVLNKQRGDDQRRIEKLG 110 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999999999876562899999999 No 101 >pfam08614 ATG16 Autophagy protein 16 (ATG16). Autophagy is a ubiquitous intracellular degradation system for eukaryotic cells. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. ATG16 (also known as Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate in the yeast autophagy pathway. Probab=24.49 E-value=59 Score=14.58 Aligned_cols=31 Identities=19% Similarity=0.423 Sum_probs=13.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 9999999999999999999999999999502 Q gi|254780671|r 43 SLEKSLIERERFLSELKENRSRLERKVKLMS 73 (105) Q Consensus 43 ~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~ 73 (105) ++...+......+..+..++..|+.++..+. T Consensus 106 ~l~~~l~~~~~~l~~l~~~~~~L~~~~~~l~ 136 (194) T pfam08614 106 QLRREIQQLEKTIAELRSEITSLETEIRDLR 136 (194) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999889999999999999999999999 No 102 >COG3394 Uncharacterized protein conserved in bacteria [Function unknown] Probab=24.30 E-value=54 Score=14.82 Aligned_cols=21 Identities=19% Similarity=0.172 Sum_probs=19.1 Q ss_pred CHHHHHHHHHHHHCCCCCCCE Q ss_conf 989999999998098789868 Q gi|254780671|r 78 EKDLLDEKARYSLNLSRSDEI 98 (105) Q Consensus 78 D~d~lEe~aR~~Lg~~kp~E~ 98 (105) +|.|+++..|+..+|..|.+. T Consensus 210 hPa~lD~~l~~~s~y~~pR~~ 230 (257) T COG3394 210 HPAFLDETLRDCSSYCEPRLT 230 (257) T ss_pred CHHHHHHHHHHCCCCCCCCCH T ss_conf 689988898851554465530 No 103 >KOG3156 consensus Probab=23.98 E-value=61 Score=14.52 Aligned_cols=40 Identities=13% Similarity=0.301 Sum_probs=27.6 Q ss_pred HHHHHHHH-HHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHH Q ss_conf 99999999-9999999999999999999502799898999999999 Q gi|254780671|r 44 LEKSLIER-ERFLSELKENRSRLERKVKLMSDGSLEKDLLDEKARY 88 (105) Q Consensus 44 l~~~~~~~-~~~l~~l~~~~~~L~~~i~~L~~~~~D~d~lEe~aR~ 88 (105) ++.++... +.+-..++++|+.|.++++++++ ++.+|+-+. T Consensus 106 iRsel~S~e~sEF~~lr~e~EklkndlEk~ks-----~lr~ei~~~ 146 (220) T KOG3156 106 IRSELVSIERSEFANLRAENEKLKNDLEKLKS-----SLRHEISKT 146 (220) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHC T ss_conf 99999988888999999889999989999999-----999998741 No 104 >PRK10549 signal transduction histidine-protein kinase BaeS; Provisional Probab=23.82 E-value=61 Score=14.50 Aligned_cols=15 Identities=13% Similarity=-0.037 Sum_probs=10.0 Q ss_pred HHHCCCHHHHHHHHH Q ss_conf 983720399999999 Q gi|254780671|r 31 AIVGDYGLKANKSLE 45 (105) Q Consensus 31 ~~~g~~Gl~~~~~l~ 45 (105) .|.-++|+..|.+.. T Consensus 32 ~~~f~~~f~~y~~~~ 46 (467) T PRK10549 32 RISFERGFIDYIKHG 46 (467) T ss_pred HHHHHHHHHHHHHHH T ss_conf 999999899999998 No 105 >PRK03147 thiol-disulfide oxidoreductase; Provisional Probab=23.74 E-value=51 Score=15.01 Aligned_cols=29 Identities=28% Similarity=0.423 Sum_probs=16.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 89867799999999999999999999837 Q gi|254780671|r 6 YKKNHFFRAIFLVIAFCCVVYFTNHAIVG 34 (105) Q Consensus 6 ~r~~~~~~~~~~~~~~~~~~~f~~~~~~g 34 (105) ||+..+++.+++++++++++|+.+..... T Consensus 3 kk~~~~~~~~i~~~~~~~~~~~~~~~~~~ 31 (176) T PRK03147 3 KKNRLLFRTIILLILLAAVGYTLYQNFFA 31 (176) T ss_pred CCEEEHHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 62043999999999999999998616777 No 106 >PRK08475 F0F1 ATP synthase subunit B; Validated Probab=23.31 E-value=63 Score=14.44 Aligned_cols=61 Identities=26% Similarity=0.256 Sum_probs=32.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 9999999999999999999983720399-99999999999999999999999999999995027 Q gi|254780671|r 12 FRAIFLVIAFCCVVYFTNHAIVGDYGLK-ANKSLEKSLIERERFLSELKENRSRLERKVKLMSD 74 (105) Q Consensus 12 ~~~~~~~~~~~~~~~f~~~~~~g~~Gl~-~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~ 74 (105) -+.+=+++++.++.||+...+-+ .+. +.......+...+..+.+..+.-...+.++...+. T Consensus 29 ~r~iNf~I~~gIL~yf~~kpi~~--~l~~R~~~I~~~L~eae~~~~eA~~~l~~~~~kLe~Ak~ 90 (170) T PRK08475 29 ERTINFLIFVGILWYFAAKPIKN--FYKSRINSISKRLEEIQAKLKESKEKKKDALKKLEEAKE 90 (170) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999992878999--999899999999999999999999999999999999999 No 107 >PRK04335 cell division protein ZipA; Provisional Probab=23.30 E-value=63 Score=14.44 Aligned_cols=22 Identities=14% Similarity=0.324 Sum_probs=15.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999983 Q gi|254780671|r 12 FRAIFLVIAFCCVVYFTNHAIV 33 (105) Q Consensus 12 ~~~~~~~~~~~~~~~f~~~~~~ 33 (105) .+++++++.+++++.+.+|-++ T Consensus 4 LrlILIIvGaIAIiALL~HGlW 25 (319) T PRK04335 4 LRFVLIVVGALAIAALLFHGLW 25 (319) T ss_pred HHHHHHHHHHHHHHHHHHHCCC T ss_conf 4029999999999999984011 No 108 >KOG3584 consensus Probab=23.06 E-value=63 Score=14.41 Aligned_cols=27 Identities=15% Similarity=0.130 Sum_probs=13.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 999999999999999999999999502 Q gi|254780671|r 47 SLIERERFLSELKENRSRLERKVKLMS 73 (105) Q Consensus 47 ~~~~~~~~l~~l~~~~~~L~~~i~~L~ 73 (105) -+.-++.+.+.|+.+|..|-+|.+.|+ T Consensus 313 YVKCLENRVAVLENQNKaLIEELKtLK 339 (348) T KOG3584 313 YVKCLENRVAVLENQNKALIEELKTLK 339 (348) T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHH T ss_conf 999887577887514589999999899 No 109 >cd01570 NAPRTase_A Nicotinate phosphoribosyltransferase (NAPRTase), subgroup A. Nicotinate phosphoribosyltransferase catalyses the formation of NAMN and PPi from 5-phosphoribosy -1-pyrophosphate (PRPP) and nicotinic acid, this is the first, and also rate limiting, reaction in the NAD salvage synthesis. This salvage pathway serves to recycle NAD degradation products. This subgroup is present in bacteria and eukaryota (except funghi). Probab=22.81 E-value=64 Score=14.38 Aligned_cols=34 Identities=18% Similarity=0.106 Sum_probs=20.6 Q ss_pred HHCCCCCCHHHHHHHHHHHHCCCCCCCEEEEECC Q ss_conf 5027998989999999998098789868767527 Q gi|254780671|r 71 LMSDGSLEKDLLDEKARYSLNLSRSDEIILFYSD 104 (105) Q Consensus 71 ~L~~~~~D~d~lEe~aR~~Lg~~kp~E~iii~~d 104 (105) -++.+|=||..+.+.+|+.|--..-+.+-|+++| T Consensus 252 GVR~DSGD~~~~~~~~r~~ld~~G~~~~kIv~Sd 285 (327) T cd01570 252 GVRIDSGDLAYLSKEARKMLDEAGLTKVKIVASN 285 (327) T ss_pred EEEECCCCHHHHHHHHHHHHHHCCCCCCEEEEEC T ss_conf 7984798879999999999987799997899979 No 110 >pfam10205 KLRAQ Predicted coiled-coil domain-containing protein. This is the N-terminal 100 amino acid domain of a family of proteins conserved from nematodes to humans. It carries a characteristic KLRAQ sequence-motif. The function is not known. Probab=22.66 E-value=65 Score=14.36 Aligned_cols=40 Identities=33% Similarity=0.428 Sum_probs=28.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHH Q ss_conf 9999999999999999999999999999995027998989999 Q gi|254780671|r 41 NKSLEKSLIERERFLSELKENRSRLERKVKLMSDGSLEKDLLD 83 (105) Q Consensus 41 ~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~~~D~d~lE 83 (105) +....+.+...++|.+.|.=.|.+|..++..|++ ++|.-+ T Consensus 35 Lk~keq~lRk~eqE~DsL~FrNqQL~kRV~~LQ~---el~~~~ 74 (102) T pfam10205 35 LKQKEQSLRKTEQEVDSLGFRNKQLERRVELLQE---ELELSE 74 (102) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHH T ss_conf 9999999999999864452139999999999999---999862 No 111 >KOG3119 consensus Probab=22.51 E-value=65 Score=14.35 Aligned_cols=25 Identities=20% Similarity=0.279 Sum_probs=11.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 9999999999999999999999502 Q gi|254780671|r 49 IERERFLSELKENRSRLERKVKLMS 73 (105) Q Consensus 49 ~~~~~~l~~l~~~~~~L~~~i~~L~ 73 (105) .+...+...|+++|+.|..+|..|+ T Consensus 218 ~e~~~r~~~leken~~lr~~v~~l~ 242 (269) T KOG3119 218 DEMAHRVAELEKENEALRTQVEQLK 242 (269) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999999999999999999999 No 112 >TIGR02977 phageshock_pspA phage shock protein A; InterPro: IPR014319 Members of this protein are the phage shock protein PspA, from the phage shock operon. PspA appears to maintain the proton motive force under stress conditions that include over expression of certain phage secretins, heat shock, ethanol and protein export defects.. Probab=21.97 E-value=67 Score=14.28 Aligned_cols=48 Identities=17% Similarity=0.238 Sum_probs=37.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHH Q ss_conf 99999999999999999999999999999999950279989899999999980 Q gi|254780671|r 38 LKANKSLEKSLIERERFLSELKENRSRLERKVKLMSDGSLEKDLLDEKARYSL 90 (105) Q Consensus 38 l~~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~~~D~d~lEe~aR~~L 90 (105) |.........+..+..++..+......+..+|..|+. -+-|-.||.+. T Consensus 91 L~EK~K~~~~~~~L~~E~~~~~e~l~~~~~di~~Lq~-----KL~eArarQka 138 (222) T TIGR02977 91 LIEKQKAAELAEALEQELAAVEETLAKLQEDIAKLQA-----KLAEARARQKA 138 (222) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHH T ss_conf 9999999999999999999999999998989999999-----98889999999 No 113 >pfam03100 CcmE CcmE. CcmE is the product of one of a cluster of Ccm genes that are necessary for cytochrome c biosynthesis in eubacteria. Expression of these proteins is induced when the organisms are grown under anaerobic conditions with nitrate or nitrite as the final electron acceptor. Probab=21.85 E-value=67 Score=14.26 Aligned_cols=25 Identities=16% Similarity=0.142 Sum_probs=10.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 8677999999999999999999998 Q gi|254780671|r 8 KNHFFRAIFLVIAFCCVVYFTNHAI 32 (105) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~f~~~~~ 32 (105) ++.+...++.++++++.+++++.++ T Consensus 5 ~kRl~~i~~~~~~~~~~~~Lil~al 29 (130) T pfam03100 5 KRRLLLVLAALAALGLAVALVLYAL 29 (130) T ss_pred EEEHHHHHHHHHHHHHHHHHHHHHH T ss_conf 2454789999999999999999987 No 114 >pfam09340 NuA4 Histone acetyltransferase subunit NuA4. The NuA4 histone acetyltransferase (HAT) multisubunit complex is responsible for acetylation of histone H4 and H2A N-terminal tails in yeast. NuA4 complexes are highly conserved in eukaryotes and play primary roles in transcription, cellular response to DNA damage, and cell cycle control. Probab=21.58 E-value=68 Score=14.23 Aligned_cols=32 Identities=22% Similarity=0.237 Sum_probs=17.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHH Q ss_conf 9999999999999999999999995027998989999 Q gi|254780671|r 47 SLIERERFLSELKENRSRLERKVKLMSDGSLEKDLLD 83 (105) Q Consensus 47 ~~~~~~~~l~~l~~~~~~L~~~i~~L~~~~~D~d~lE 83 (105) ++.++-..-..|+.+-..||.+|..+.+ +|++ T Consensus 3 ~L~~ll~kK~~Le~~L~~lE~qIY~~Et-----~YL~ 34 (80) T pfam09340 3 KLKELLQKKKKLEKELAALERQIYDKET-----EYLE 34 (80) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHH T ss_conf 8999999999999999999999999999-----9983 No 115 >pfam00952 Bunya_nucleocap Bunyavirus nucleocapsid (N) protein. The bunyaviruses are enveloped viruses with a genome consisting of 3 ssRNA segments (called L, M and S). The nucleocapsid protein is encode on the small (S) genomic RNA. The N protein is the major component of the nucleocapsids. This protein is thought to interact with the L protein, virus RNA and/or other N proteins. Probab=21.48 E-value=41 Score=15.56 Aligned_cols=29 Identities=7% Similarity=0.176 Sum_probs=21.9 Q ss_pred HHHHCCCCCCHHHHHHHHHHHHCCCCCCC Q ss_conf 99502799898999999999809878986 Q gi|254780671|r 69 VKLMSDGSLEKDLLDEKARYSLNLSRSDE 97 (105) Q Consensus 69 i~~L~~~~~D~d~lEe~aR~~Lg~~kp~E 97 (105) |.+-+.+.-||.|+++..|.++|=..+++ T Consensus 159 iyRV~k~~Mdp~fl~K~lRQRYggl~A~~ 187 (228) T pfam00952 159 MYRVQKDGMDVNFLEKVLRQRYGGLTAEQ 187 (228) T ss_pred EEEEHHCCCCHHHHHHHHHHHHCCCCHHH T ss_conf 53202137597999999999847998899 No 116 >PRK06835 DNA replication protein DnaC; Validated Probab=21.19 E-value=69 Score=14.18 Aligned_cols=33 Identities=30% Similarity=0.359 Sum_probs=17.3 Q ss_pred HHHHHHHHHHHHHHHHHH-HHHCCCCCCHHHHHH Q ss_conf 999999999999999999-950279989899999 Q gi|254780671|r 52 ERFLSELKENRSRLERKV-KLMSDGSLEKDLLDE 84 (105) Q Consensus 52 ~~~l~~l~~~~~~L~~~i-~~L~~~~~D~d~lEe 84 (105) +..++.|++++..|..+. .+|.....++||+|. T Consensus 64 ~~~l~~lk~~i~~L~~~k~~LL~~~Gyp~DYLe~ 97 (330) T PRK06835 64 EEELESLKEKITDLRVKKSELLVSNGYPPDYLEM 97 (330) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHCCC T ss_conf 8999999999999999999999985999565578 No 117 >pfam08941 USP8_interact USP8 interacting. This domain interacts with the UBP deubiquitinating enzyme USP8. Probab=21.17 E-value=69 Score=14.18 Aligned_cols=36 Identities=14% Similarity=0.202 Sum_probs=17.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 999999999999999999999999999999950279 Q gi|254780671|r 40 ANKSLEKSLIERERFLSELKENRSRLERKVKLMSDG 75 (105) Q Consensus 40 ~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~ 75 (105) +..+++++....+.++.+.+.|.+-++.=+..|+.. T Consensus 12 ~i~elk~~~~~q~~~ineqkREleLlq~YirAlR~t 47 (179) T pfam08941 12 EIAELKHTQVDHEIQINEQKRELELLKYYIRALRST 47 (179) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 999999999999999999999999999999999707 No 118 >PRK13726 conjugal transfer pilus assembly protein TraE; Provisional Probab=21.14 E-value=63 Score=14.43 Aligned_cols=22 Identities=23% Similarity=0.228 Sum_probs=14.7 Q ss_pred HHCCCCCCHHHHHHHHHHHHCC Q ss_conf 5027998989999999998098 Q gi|254780671|r 71 LMSDGSLEKDLLDEKARYSLNL 92 (105) Q Consensus 71 ~L~~~~~D~d~lEe~aR~~Lg~ 92 (105) -.++++.|++|+|+.+|.-+.+ T Consensus 59 ~vS~~~~d~~YLe~Ma~~~~~L 80 (187) T PRK13726 59 AVSQNSADASYLQQMALSFIAL 80 (187) T ss_pred EEECCCCCHHHHHHHHHHHHHH T ss_conf 9857867989999999999987 No 119 >COG4467 Regulator of replication initiation timing [Replication, recombination, and repair] Probab=20.81 E-value=70 Score=14.13 Aligned_cols=48 Identities=17% Similarity=0.076 Sum_probs=24.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHH Q ss_conf 999999999999999999999999999999502799898999999999 Q gi|254780671|r 41 NKSLEKSLIERERFLSELKENRSRLERKVKLMSDGSLEKDLLDEKARY 88 (105) Q Consensus 41 ~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~~~D~d~lEe~aR~ 88 (105) ...+-+++..+++.+.++-.+|..|.-+-..|+.---+|...+..+-+ T Consensus 17 l~~l~~el~~lK~~l~~lvEEN~~L~lENe~LR~RL~~~~~e~~~~~k 64 (114) T COG4467 17 LGVLLAELGGLKQHLGSLVEENTALRLENEKLRERLGEPTLEKTAVKK 64 (114) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHC T ss_conf 999999999999999999876589886578999985773211124420 No 120 >TIGR01499 folC FolC bifunctional protein; InterPro: IPR001645 Folylpolyglutamate synthase 6.3.2.17 from EC (FPGS) is responsible for the addition of a polyglutamate tail to folate and folate derivatives, is an ATP-dependent enzyme isolated from eukaryotic and bacterial sources, where it plays a key role in the retention of the intracellular folate pool Its sequence is moderately conserved between prokaryotes (gene folC) and eukaryotes. FPGS belongs to a protein family that contains a number of related peptidoglycan synthetases (Mur) 6.3.2. from EC (see). A crystal structure of the MgATP complex of the enzyme from Lactobacillus casei reveals that folylpolyglutamate synthetase is a modular protein consisting of two domains, one with a typical mononucleotide-binding fold and the other strikingly similar to the folate-binding enzyme dihydrofolate reductase. The active site of the enzyme is located in a large interdomain cleft adjacent to an ATP-binding P-loop motif. Opposite this site, in the C domain, a cavity likely to be the folate binding site has been identified, and inspection of this cavity and the surrounding protein structure suggests that the glutamate tail of the substrate may project into the active site. A further feature of the structure is a well defined Omega loop, which contributes both to the active site and to interdomain interactions .; GO: 0004326 tetrahydrofolylpolyglutamate synthase activity, 0005524 ATP binding, 0009396 folic acid and derivative biosynthetic process. Probab=20.70 E-value=53 Score=14.86 Aligned_cols=24 Identities=21% Similarity=0.118 Sum_probs=20.4 Q ss_pred HHHHHHHHHHHCCCCCCCEEEEEC Q ss_conf 999999999809878986876752 Q gi|254780671|r 80 DLLDEKARYSLNLSRSDEIILFYS 103 (105) Q Consensus 80 d~lEe~aR~~Lg~~kp~E~iii~~ 103 (105) |-++++|+++.|-+|||.-+++-. T Consensus 161 ~t~~~IA~eKAGI~k~g~P~v~~~ 184 (460) T TIGR01499 161 DTLEEIAWEKAGIIKEGVPIVTGE 184 (460) T ss_pred CCHHHHHHHHHHHHCCCCCEEEEC T ss_conf 337999997457621676568714 No 121 >pfam05565 Sipho_Gp157 Siphovirus Gp157. This family contains both viral and bacterial proteins which are related to the Gp157 protein of the Streptococcus thermophilus SFi bacteriophages. It is thought that bacteria possessing the gene coding for this protein have an increased resistance to the bacteriophage. Probab=20.47 E-value=72 Score=14.09 Aligned_cols=43 Identities=30% Similarity=0.401 Sum_probs=34.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHH Q ss_conf 9999999999999999999999999999999950279989899999999 Q gi|254780671|r 39 KANKSLEKSLIERERFLSELKENRSRLERKVKLMSDGSLEKDLLDEKAR 87 (105) Q Consensus 39 ~~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~~~D~d~lEe~aR 87 (105) ....++..++....++...|.......++++.+|+ +|+...-- T Consensus 47 ~~iknlea~~~~~k~e~krl~erkk~~enk~~~LK------~yl~~~m~ 89 (162) T pfam05565 47 KVIKNLEADIEAIKAEIKRLAERKKSIENKVKRLK------DYLEEAME 89 (162) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHH T ss_conf 99983297899999999999999999998999999------99999999 No 122 >cd01401 PncB_like Nicotinate phosphoribosyltransferase (NAPRTase), related to PncB. Nicotinate phosphoribosyltransferase catalyses the formation of NAMN and PPi from 5-phosphoribosy -1-pyrophosphate (PRPP) and nicotinic acid, this is the first, and also rate limiting, reaction in the NAD salvage synthesis. This salvage pathway serves to recycle NAD degradation products. This subgroup is present in bacteria, archea and funghi. Probab=20.25 E-value=72 Score=14.06 Aligned_cols=34 Identities=21% Similarity=0.047 Sum_probs=21.9 Q ss_pred HHCCCCCCHHHHHHHHHHHHCCC--CCCCEEEEECC Q ss_conf 50279989899999999980987--89868767527 Q gi|254780671|r 71 LMSDGSLEKDLLDEKARYSLNLS--RSDEIILFYSD 104 (105) Q Consensus 71 ~L~~~~~D~d~lEe~aR~~Lg~~--kp~E~iii~~d 104 (105) -.+-+|=||-++-+.+|+.+.=. .|.-.+|+++| T Consensus 276 GVRlDSGDl~~~~~~~r~~~d~~G~~~~~k~iv~S~ 311 (377) T cd01401 276 GVRHDSGDPFEWGEKAIAHYEKLGIDPKTKTLVFSD 311 (377) T ss_pred EEECCCCCHHHHHHHHHHHHHHCCCCCCCCEEEEEC T ss_conf 885378678999999999999779895762899918 No 123 >PRK05431 seryl-tRNA synthetase; Provisional Probab=20.19 E-value=73 Score=14.05 Aligned_cols=39 Identities=15% Similarity=0.159 Sum_probs=20.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 999999999999999999999999999999950279989 Q gi|254780671|r 40 ANKSLEKSLIERERFLSELKENRSRLERKVKLMSDGSLE 78 (105) Q Consensus 40 ~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~~~~D 78 (105) ...++..+...++.+++.|+++.+.+..+|..++....| T Consensus 29 ~il~lD~~rr~l~~e~e~Lr~erN~iSK~Ig~~kk~g~~ 67 (422) T PRK05431 29 ELLELDEERRELQTETEELQAERNALSKEIGQAKAKGED 67 (422) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 999999999999999999999999999999998748977 No 124 >pfam04065 Not3 Not1 N-terminal domain, CCR4-Not complex component. Probab=20.19 E-value=73 Score=14.05 Aligned_cols=43 Identities=26% Similarity=0.446 Sum_probs=26.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHC Q ss_conf 99999999999999999999999502799898999999999809 Q gi|254780671|r 48 LIERERFLSELKENRSRLERKVKLMSDGSLEKDLLDEKARYSLN 91 (105) Q Consensus 48 ~~~~~~~l~~l~~~~~~L~~~i~~L~~~~~D~d~lEe~aR~~Lg 91 (105) +..++..+...+-....|+.=...|.++.+||+-++++ ++.+- T Consensus 165 ~~~l~~~~erh~~Hi~kLE~llr~L~N~~l~~e~v~~i-kedie 207 (233) T pfam04065 165 IAELEELLERHKFHISKLELILRLLENGELDPEQVDDI-KEDIE 207 (233) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHH-HHHHH T ss_conf 99999999999999999999999988599999999998-88899 No 125 >COG4768 Uncharacterized protein containing a divergent version of the methyl-accepting chemotaxis-like domain [General function prediction only] Probab=20.13 E-value=73 Score=14.04 Aligned_cols=35 Identities=11% Similarity=0.173 Sum_probs=25.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 99999999999999999999999999999995027 Q gi|254780671|r 40 ANKSLEKSLIERERFLSELKENRSRLERKVKLMSD 74 (105) Q Consensus 40 ~~~~l~~~~~~~~~~l~~l~~~~~~L~~~i~~L~~ 74 (105) ...+..+-++.++.+.+.+..+-+.|-++.+.|.+ T Consensus 32 tldevakt~~~l~~qv~gi~~eT~~Ll~K~N~L~e 66 (139) T COG4768 32 TLDEVAKTLKGLTSQVDGITHETEELLHKTNTLAE 66 (139) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999877778878999999989999 Done!