Query gi|254781104|ref|YP_003065517.1| hypothetical protein CLIBASIA_05030 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 125 No_of_seqs 143 out of 371 Neff 5.9 Searched_HMMs 39220 Date Mon May 30 05:29:21 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254781104.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 COG5462 Predicted secreted (pe 99.9 2.7E-23 6.8E-28 162.5 10.4 113 1-114 1-113 (138) 2 TIGR02209 ftsL_broad cell divi 98.8 1.5E-07 3.8E-12 66.5 10.6 82 2-86 1-82 (85) 3 pfam04999 FtsL Cell division p 98.4 6.5E-06 1.7E-10 56.5 10.1 82 3-87 13-94 (97) 4 COG4839 FtsL Protein required 97.8 0.00028 7.2E-09 46.5 10.0 74 4-79 39-112 (120) 5 PRK10772 cell division protein 97.8 0.00056 1.4E-08 44.7 10.4 80 4-86 37-116 (121) 6 COG3116 FtsL Cell division pro 97.4 0.0038 9.6E-08 39.6 10.5 84 1-87 19-102 (105) 7 PRK00888 ftsB cell division pr 96.6 0.037 9.5E-07 33.6 9.8 74 2-78 1-78 (105) 8 pfam04977 DivIC Septum formati 96.3 0.021 5.3E-07 35.1 7.0 52 25-79 17-69 (80) 9 pfam12097 DUF3573 Protein of u 94.0 0.25 6.3E-06 28.5 6.5 60 1-60 1-63 (375) 10 PRK09039 hypothetical protein; 89.4 1.8 4.7E-05 23.3 7.0 53 5-57 26-78 (343) 11 pfam11853 DUF3373 Protein of u 87.1 1.5 3.7E-05 23.8 5.0 51 1-52 1-51 (485) 12 PRK09973 putative outer membra 84.4 3.2 8.1E-05 21.8 5.6 39 22-60 28-66 (85) 13 COG2919 Septum formation initi 82.9 4 0.0001 21.2 10.5 61 14-77 39-100 (117) 14 PRK11677 cytochrome d ubiquino 80.2 5 0.00013 20.6 8.2 70 1-73 1-72 (134) 15 pfam03245 Phage_lysis Bacterio 79.8 5.2 0.00013 20.5 9.6 62 1-75 1-70 (151) 16 TIGR00593 pola DNA polymerase 78.8 5.3 0.00014 20.4 5.1 58 23-83 618-675 (1005) 17 TIGR01710 typeII_sec_gspG gene 75.5 6.8 0.00017 19.8 5.5 88 2-96 3-90 (137) 18 PRK08928 consensus 74.8 5.2 0.00013 20.5 4.1 58 23-83 478-535 (861) 19 COG0749 PolA DNA polymerase I 72.9 5.5 0.00014 20.3 3.8 61 23-86 209-270 (593) 20 PRK08434 consensus 70.1 8.7 0.00022 19.1 4.3 58 23-83 510-567 (887) 21 TIGR01711 gspJ general secreti 69.5 9.5 0.00024 18.9 7.6 51 7-57 9-59 (202) 22 pfam10066 DUF2304 Uncharacteri 69.5 9.5 0.00024 18.9 7.2 45 3-47 64-108 (115) 23 PRK05929 consensus 66.9 4.8 0.00012 20.7 2.5 55 26-83 495-549 (870) 24 PRK07625 consensus 66.3 9.2 0.00023 19.0 3.8 57 23-82 539-595 (922) 25 pfam05137 PilN Fimbrial assemb 65.6 11 0.00029 18.4 8.7 76 16-98 33-109 (162) 26 pfam10828 DUF2570 Protein of u 65.3 12 0.00029 18.4 8.1 42 1-42 1-42 (110) 27 COG3883 Uncharacterized protei 64.9 12 0.0003 18.3 7.2 55 1-55 1-75 (265) 28 pfam00476 DNA_pol_A DNA polyme 63.6 12 0.00031 18.3 4.0 54 27-83 3-56 (383) 29 TIGR03064 sortase_srtB sortase 62.5 7.5 0.00019 19.5 2.8 37 3-39 4-40 (232) 30 PRK08786 consensus 61.3 9.4 0.00024 18.9 3.1 55 25-82 546-600 (927) 31 PRK05755 DNA polymerase I; Pro 61.3 11 0.00028 18.5 3.5 56 24-82 509-564 (889) 32 PRK08835 consensus 61.2 10 0.00026 18.7 3.3 55 25-82 549-603 (931) 33 PRK07997 consensus 59.2 8.1 0.00021 19.3 2.5 56 26-84 548-603 (928) 34 PRK06887 consensus 57.4 12 0.00031 18.2 3.2 54 26-82 575-628 (954) 35 PRK08076 consensus 55.9 11 0.00029 18.4 2.8 56 24-82 495-550 (877) 36 pfam06305 DUF1049 Protein of u 55.3 17 0.00044 17.3 7.1 21 23-43 58-78 (80) 37 PRK07300 consensus 55.2 14 0.00035 17.9 3.1 51 24-77 496-546 (880) 38 PRK07556 consensus 52.6 15 0.00037 17.8 2.9 55 26-83 592-646 (977) 39 pfam11336 DUF3138 Protein of u 50.4 18 0.00045 17.2 3.1 34 27-60 20-53 (514) 40 PRK05797 consensus 49.5 17 0.00044 17.3 2.9 57 24-83 488-544 (869) 41 PRK07456 consensus 49.1 22 0.00055 16.7 3.6 56 23-81 575-630 (975) 42 pfam11382 DUF3186 Protein of u 48.9 22 0.00056 16.7 7.3 36 24-59 31-66 (307) 43 PRK07898 consensus 47.3 23 0.00059 16.5 3.5 54 25-81 520-573 (902) 44 COG3917 NahD 2-hydroxychromene 45.9 10 0.00026 18.7 1.2 37 53-90 18-54 (203) 45 PRK00575 tatA twin arginine tr 45.4 25 0.00063 16.4 5.2 48 2-49 4-51 (88) 46 pfam06667 PspB Phage shock pro 44.9 25 0.00064 16.3 5.9 38 16-53 21-63 (75) 47 pfam10805 DUF2730 Protein of u 42.8 27 0.00069 16.1 8.8 58 12-69 14-72 (106) 48 pfam07665 consensus 42.7 27 0.0007 16.1 6.0 40 21-60 8-47 (57) 49 PRK08307 stage III sporulation 42.3 28 0.00071 16.1 7.6 60 1-69 1-62 (172) 50 pfam08172 CASP_C CASP C termin 41.1 29 0.00074 16.0 5.7 45 23-69 91-135 (245) 51 pfam07047 OPA3 Optic atrophy 3 41.0 29 0.00074 15.9 7.3 44 10-55 85-128 (134) 52 pfam01166 TSC22 TSC-22/dip/bun 40.7 29 0.00075 15.9 7.4 41 29-69 11-54 (57) 53 KOG0842 consensus 38.9 31 0.0008 15.7 3.5 35 38-81 162-196 (307) 54 PRK13455 F0F1 ATP synthase sub 37.7 33 0.00083 15.6 8.7 43 7-49 33-75 (184) 55 smart00338 BRLZ basic region l 36.7 34 0.00087 15.5 5.7 38 23-60 24-61 (65) 56 pfam05680 ATP-synt_E ATP synth 36.5 34 0.00087 15.5 7.4 53 11-68 17-70 (81) 57 KOG4797 consensus 36.5 34 0.00087 15.5 7.3 41 30-70 65-108 (123) 58 pfam07219 HemY_N HemY protein 35.7 33 0.00084 15.6 2.6 14 60-73 100-113 (134) 59 cd01279 HTH_HspR-like Helix-Tu 34.8 36 0.00093 15.3 4.2 36 28-63 61-96 (98) 60 COG2456 Uncharacterized conser 32.4 40 0.001 15.1 5.2 36 8-43 76-111 (121) 61 COG3167 PilO Tfp pilus assembl 32.2 40 0.001 15.1 7.6 19 5-23 27-45 (211) 62 TIGR01411 tatAE twin arginine- 32.2 40 0.001 15.1 3.5 46 1-46 1-47 (48) 63 pfam07106 TBPIP Tat binding pr 31.0 42 0.0011 15.0 5.4 38 25-62 72-109 (169) 64 KOG0488 consensus 29.6 44 0.0011 14.8 3.7 37 40-88 183-219 (309) 65 pfam06295 DUF1043 Protein of u 29.4 45 0.0011 14.8 8.0 64 7-73 3-68 (128) 66 COG3322 Predicted periplasmic 28.4 47 0.0012 14.7 7.9 34 25-58 42-75 (295) 67 TIGR02238 recomb_DMC1 meiotic 28.1 31 0.00078 15.8 1.4 12 60-71 143-154 (314) 68 PRK09458 pspB phage shock prot 27.7 48 0.0012 14.6 6.9 39 15-53 20-63 (75) 69 CHL00167 consensus 27.4 49 0.0012 14.6 8.4 53 5-57 37-89 (182) 70 pfam05545 FixQ Cbb3-type cytoc 27.1 47 0.0012 14.6 2.2 23 4-26 13-35 (49) 71 pfam10925 DUF2680 Protein of u 26.8 50 0.0013 14.5 3.9 34 27-60 2-35 (59) 72 COG5509 Uncharacterized small 26.3 51 0.0013 14.5 3.8 25 33-57 26-50 (65) 73 COG2143 Thioredoxin-related pr 25.5 53 0.0013 14.4 2.6 22 1-22 1-22 (182) 74 PRK00442 tatA twin arginine tr 23.5 58 0.0015 14.1 3.6 44 3-46 5-48 (90) 75 pfam06720 Phi-29_GP16_7 Bacter 23.4 58 0.0015 14.1 5.4 34 23-56 25-58 (130) 76 TIGR01061 parC_Gpos DNA topois 23.1 59 0.0015 14.1 5.6 68 20-91 429-499 (745) 77 COG3416 Uncharacterized protei 23.0 59 0.0015 14.1 5.1 49 19-69 42-90 (233) 78 TIGR01242 26Sp45 26S proteasom 22.8 60 0.0015 14.0 5.5 47 22-68 3-49 (364) 79 pfam03978 Borrelia_REV Borreli 21.5 63 0.0016 13.9 9.2 71 5-75 7-86 (160) 80 PRK09173 F0F1 ATP synthase sub 21.3 64 0.0016 13.9 8.3 43 7-49 8-50 (159) 81 pfam11488 Lge1 Transcriptional 21.1 64 0.0016 13.8 5.2 35 26-60 178-212 (234) 82 pfam09889 DUF2116 Uncharacteri 21.0 65 0.0017 13.8 2.5 21 2-22 36-56 (59) 83 PRK04561 tatA twin arginine tr 20.4 67 0.0017 13.7 3.4 44 2-45 4-47 (75) 84 smart00394 RIIa RIIalpha, Regu 20.1 50 0.0013 14.5 1.2 20 58-77 16-35 (38) No 1 >COG5462 Predicted secreted (periplasmic) protein [Function unknown] Probab=99.90 E-value=2.7e-23 Score=162.46 Aligned_cols=113 Identities=36% Similarity=0.614 Sum_probs=101.4 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCEEC Q ss_conf 91189999999999999999999999999999999999999999999999999999718979999999987764185405 Q gi|254781104|r 1 MFKTLDFIILGVVLASITITYSIKHETEGKKEKLRILENKITSEQNYIDLLKAQWALLIQPDRIKDLVSLYQKELQLQAT 80 (125) Q Consensus 1 M~r~~~ii~~~~~i~l~~~~y~iK~~t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~P~RL~~La~~y~~~L~L~Pi 80 (125) |+|+|+++.+++++..++.+|.||+++.....+++++.++|+.|++.|.+||||||||++|.||++||..|..+|+|+|+ T Consensus 1 mlRtfdivli~imv~Aa~vtysIK~~ae~~l~~vrkl~~qI~sE~dtIdlLkAdwAlLtqP~RLekLa~aY~t~L~Lepv 80 (138) T COG5462 1 MLRTFDIVLIAIMVAAATVTYSIKHEAETQLAEVRKLHAQIKSEEDTIDLLKADWALLTQPNRLEKLAAAYKTELKLEPV 80 (138) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCC T ss_conf 95146779999999999998768888888999999999998412112789998599970816899999999977346746 Q ss_pred CHHHCCCHHHCCCCCCCCCCCCCCCCCHHHHHHC Q ss_conf 8888322888565543357864545401444421 Q gi|254781104|r 81 NPINLITYDDLARLKKHTLLPENRSNLPKRTVER 114 (125) Q Consensus 81 ~p~qi~~~~~l~~~~~~~~~~~~~~~~~~~~~~r 114 (125) .|.|++....+|++....+ +....++.+...+| T Consensus 81 ~s~Qlv~~~~~P~l~~~lp-~~~~~~i~k~~~~r 113 (138) T COG5462 81 KSTQLVGPPELPMLRSDLP-FNPNIDIQKILDGR 113 (138) T ss_pred CHHHHCCCCCCCCHHHHCC-CCCCHHHHHHHHHH T ss_conf 7787518833443665289-99971388888889 No 2 >TIGR02209 ftsL_broad cell division protein FtsL; InterPro: IPR011922 This entry represents FtsL, both forms similar to that in Escherichia coli and similar to that in Bacillus subtilis. FtsL is one of the later proteins active in cell division septum formation. FtsL is small, low in complexity, and highly divergent , .; GO: 0007049 cell cycle, 0051301 cell division, 0016021 integral to membrane. Probab=98.75 E-value=1.5e-07 Score=66.47 Aligned_cols=82 Identities=22% Similarity=0.233 Sum_probs=75.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCEECC Q ss_conf 11899999999999999999999999999999999999999999999999999997189799999999877641854058 Q gi|254781104|r 2 FKTLDFIILGVVLASITITYSIKHETEGKKEKLRILENKITSEQNYIDLLKAQWALLIQPDRIKDLVSLYQKELQLQATN 81 (125) Q Consensus 2 ~r~~~ii~~~~~i~l~~~~y~iK~~t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~P~RL~~La~~y~~~L~L~Pi~ 81 (125) +++++++.+++++++++.+-..+|.++..-.++.+++.++...+...+-|+.|-+-|++|+|++++|+. +|||.|.. T Consensus 1 ~~~l~~~l~~~~l~~a~~~v~~~~~~r~~~~e~~~~~~~~~~~~~e~~~L~lE~~~L~~~~Ri~~iA~~---~L~m~~~~ 77 (85) T TIGR02209 1 EKKLYVLLLLAILVSAISVVSVQHQTRQLNNELQKLQLEIDKLQKEWRDLQLEVAELSRHERIEKIAKK---QLGMKLPD 77 (85) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHH---HCCCCCCC T ss_conf 917999999999999999999999999999999999999999999999899999860148899999998---65899876 Q ss_pred HHHCC Q ss_conf 88832 Q gi|254781104|r 82 PINLI 86 (125) Q Consensus 82 p~qi~ 86 (125) +++-. T Consensus 78 ~~~~~ 82 (85) T TIGR02209 78 ANIKV 82 (85) T ss_pred CCCEE T ss_conf 55401 No 3 >pfam04999 FtsL Cell division protein FtsL. In Escherichia coli, nine gene products are known to be essential for assembly of the division septum. One of these, FtsL, is a bitopic membrane protein whose precise function is not understood. It has been proposed that FtsL interacts with the DivIC protein pfam04977, however this interaction may be indirect. Probab=98.36 E-value=6.5e-06 Score=56.49 Aligned_cols=82 Identities=17% Similarity=0.171 Sum_probs=74.5 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCEECCH Q ss_conf 18999999999999999999999999999999999999999999999999999971897999999998776418540588 Q gi|254781104|r 3 KTLDFIILGVVLASITITYSIKHETEGKKEKLRILENKITSEQNYIDLLKAQWALLIQPDRIKDLVSLYQKELQLQATNP 82 (125) Q Consensus 3 r~~~ii~~~~~i~l~~~~y~iK~~t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~P~RL~~La~~y~~~L~L~Pi~p 82 (125) .++.++++++++++|+.+-...|.++..-.++.+++.+-...+.+-.-|..|-+.+++|.|++.+|. +.|+|.+..+ T Consensus 13 ~~l~i~L~~~v~~SAl~vV~~~h~~R~l~~~l~~l~~e~~~L~~ew~~L~LEqstla~~~RIe~~A~---~~L~M~~P~~ 89 (97) T pfam04999 13 GKLLLLLLLLVLVSAVGVVYSTHQTRQLTSELQKLLQERDRLEIEWGNLLLEQSTLAEHSRVERIAR---EQLGMKRPDP 89 (97) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHH---HCCCCCCCCC T ss_conf 8899999999999999999999998999999999999999999999999999998614767999998---6479999998 Q ss_pred HHCCC Q ss_conf 88322 Q gi|254781104|r 83 INLIT 87 (125) Q Consensus 83 ~qi~~ 87 (125) ++++- T Consensus 90 ~~~v~ 94 (97) T pfam04999 90 EQEVI 94 (97) T ss_pred CCEEE T ss_conf 76665 No 4 >COG4839 FtsL Protein required for the initiation of cell division [Cell division and chromosome partitioning] Probab=97.83 E-value=0.00028 Score=46.51 Aligned_cols=74 Identities=23% Similarity=0.222 Sum_probs=58.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCEE Q ss_conf 8999999999999999999999999999999999999999999999999999971897999999998776418540 Q gi|254781104|r 4 TLDFIILGVVLASITITYSIKHETEGKKEKLRILENKITSEQNYIDLLKAQWALLIQPDRIKDLVSLYQKELQLQA 79 (125) Q Consensus 4 ~~~ii~~~~~i~l~~~~y~iK~~t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~P~RL~~La~~y~~~L~L~P 79 (125) .|++..+.+++.++++.-..++.+-....+|..|+..|.+++.++..|+-|-.-|+||+|+-+.|+.. .|+|.+ T Consensus 39 vly~~~~va~L~vai~ii~~q~~~yqvq~ei~~Le~kIs~q~~e~~dlkqeV~dLss~eRIldiAkk~--gLkl~~ 112 (120) T COG4839 39 VLYTTLAVAALVVAISIISVQTKAYQVQGEITDLESKISEQKTENDDLKQEVKDLSSPERILDIAKKA--GLKLNN 112 (120) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHC--CCCCCC T ss_conf 99999999999999999999999999976999999999999843006999999845688999999875--666787 No 5 >PRK10772 cell division protein FtsL; Provisional Probab=97.75 E-value=0.00056 Score=44.70 Aligned_cols=80 Identities=14% Similarity=0.096 Sum_probs=71.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCEECCHH Q ss_conf 89999999999999999999999999999999999999999999999999999718979999999987764185405888 Q gi|254781104|r 4 TLDFIILGVVLASITITYSIKHETEGKKEKLRILENKITSEQNYIDLLKAQWALLIQPDRIKDLVSLYQKELQLQATNPI 83 (125) Q Consensus 4 ~~~ii~~~~~i~l~~~~y~iK~~t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~P~RL~~La~~y~~~L~L~Pi~p~ 83 (125) +..+++++++++.|+.+-.+-|.|+.+-.+..+|..+=.....+-.-|..|-+-|+...|++.+|. +.|+|.+..|+ T Consensus 37 Kl~llLli~Vl~SAl~VV~~tH~TR~l~~e~e~L~~erd~Ld~EW~nL~LEqstl~~hsRIE~~A~---~~L~M~~pdp~ 113 (121) T PRK10772 37 KLPLCLFICIILTAVTVVTTAHHTRLLTAQREQLVLERDALDIEWRNLILEENALGDHSRVERIAT---EKLQMQHVDPS 113 (121) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHCCCCCCCCC T ss_conf 099999999999989999998898999988999999999999999999999986221658999999---85598369941 Q ss_pred HCC Q ss_conf 832 Q gi|254781104|r 84 NLI 86 (125) Q Consensus 84 qi~ 86 (125) |-+ T Consensus 114 qEv 116 (121) T PRK10772 114 QEN 116 (121) T ss_pred CCE T ss_conf 044 No 6 >COG3116 FtsL Cell division protein [Cell division and chromosome partitioning] Probab=97.38 E-value=0.0038 Score=39.64 Aligned_cols=84 Identities=14% Similarity=0.108 Sum_probs=73.9 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCEEC Q ss_conf 91189999999999999999999999999999999999999999999999999999718979999999987764185405 Q gi|254781104|r 1 MFKTLDFIILGVVLASITITYSIKHETEGKKEKLRILENKITSEQNYIDLLKAQWALLIQPDRIKDLVSLYQKELQLQAT 80 (125) Q Consensus 1 M~r~~~ii~~~~~i~l~~~~y~iK~~t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~P~RL~~La~~y~~~L~L~Pi 80 (125) |++++.++++++++..|..+-..-|.|+.+-.+..++..+=...+.+-.-|..|-+-+..-.|++.+|. +.|+|+-. T Consensus 19 ~~~kl~l~LLi~ivlsAi~vv~~tH~tRqL~~e~~~~~~er~~L~~EwrnLilEe~tl~~hsRVe~iAr---eqL~M~~p 95 (105) T COG3116 19 TSGKLPLLLLIAIVLSAIGVVYTTHHTRQLIAELEQLVLERDALNIEWRNLILEENTLGDHSRVESIAR---EQLKMKHP 95 (105) T ss_pred HCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHCCCCC T ss_conf 427199999999999999999999999999987589999999987599988988875345778999999---98467789 Q ss_pred CHHHCCC Q ss_conf 8888322 Q gi|254781104|r 81 NPINLIT 87 (125) Q Consensus 81 ~p~qi~~ 87 (125) .|.|.+- T Consensus 96 d~~qe~~ 102 (105) T COG3116 96 DPAQEVI 102 (105) T ss_pred CHHHHHH T ss_conf 8576443 No 7 >PRK00888 ftsB cell division protein FtsB; Reviewed Probab=96.59 E-value=0.037 Score=33.56 Aligned_cols=74 Identities=14% Similarity=0.091 Sum_probs=56.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC-HHHHHHHHHHHHHHHCC Q ss_conf 11899999999999999999999---999999999999999999999999999999997189-79999999987764185 Q gi|254781104|r 2 FKTLDFIILGVVLASITITYSIK---HETEGKKEKLRILENKITSEQNYIDLLKAQWALLIQ-PDRIKDLVSLYQKELQL 77 (125) Q Consensus 2 ~r~~~ii~~~~~i~l~~~~y~iK---~~t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~-P~RL~~La~~y~~~L~L 77 (125) ||.+.++++++++.+-+..+.-+ .+...+.+++..++.++...+++-..|++|-..|.+ ++-++..|+. +||| T Consensus 1 Mr~l~~~L~~ll~~LQY~lwfg~~g~~~~~~L~~~i~~~~~~n~~L~~~N~~L~~EI~~Lk~~~~aIE~~AR~---eLGm 77 (105) T PRK00888 1 MRLLTLVLLALLVWLQYPLWFGKNGWLDYHRLNQQVAAQQQTNAKLKARNDQLFAEIDDLKGGQEAIEERARN---ELGM 77 (105) T ss_pred CCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHH---HCCC T ss_conf 9669999999999999999404896999999999999999999999999999999999987787999999999---8698 Q ss_pred E Q ss_conf 4 Q gi|254781104|r 78 Q 78 (125) Q Consensus 78 ~ 78 (125) - T Consensus 78 I 78 (105) T PRK00888 78 V 78 (105) T ss_pred C T ss_conf 6 No 8 >pfam04977 DivIC Septum formation initiator. DivIC from B. subtilis is necessary for both vegetative and sporulation septum formation. These proteins are mainly composed of an amino terminal coiled-coil. Probab=96.29 E-value=0.021 Score=35.14 Aligned_cols=52 Identities=19% Similarity=0.183 Sum_probs=43.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-CCHHHHHHHHHHHHHHHCCEE Q ss_conf 9999999999999999999999999999999971-897999999998776418540 Q gi|254781104|r 25 HETEGKKEKLRILENKITSEQNYIDLLKAQWALL-IQPDRIKDLVSLYQKELQLQA 79 (125) Q Consensus 25 ~~t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL-~~P~RL~~La~~y~~~L~L~P 79 (125) .....+.+++..++.++...+++...|++|-..| ++|+.+++.|+. +|||.- T Consensus 17 ~~~~~l~~~i~~l~~~~~~l~~~n~~L~~ei~~L~~d~~~ie~~AR~---~lg~~~ 69 (80) T pfam04977 17 SAYYQLNQEIAALQAELAKLKAENEELEAEVKDLKSDPDYIEERARS---ELGLVK 69 (80) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHH---HCCCCC T ss_conf 99999999999999999999999999999999850899999999999---859838 No 9 >pfam12097 DUF3573 Protein of unknown function (DUF3573). This family of proteins is functionally uncharacterized. This protein is found in bacteria. Proteins in this family are typically between 372 to 530 amino acids in length. Probab=93.98 E-value=0.25 Score=28.54 Aligned_cols=60 Identities=20% Similarity=0.136 Sum_probs=27.8 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 91189999999999999999999999---9999999999999999999999999999997189 Q gi|254781104|r 1 MFKTLDFIILGVVLASITITYSIKHE---TEGKKEKLRILENKITSEQNYIDLLKAQWALLIQ 60 (125) Q Consensus 1 M~r~~~ii~~~~~i~l~~~~y~iK~~---t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~ 60 (125) |+++..++.++++++....-....|. +.+..++-.-.+++|.+++..|+.|++|-.+|.. T Consensus 1 ~f~K~~li~~~~l~~s~~~~~~~s~~~~~~~q~~~s~~~~~~~i~~Lq~QI~~Lq~eIn~l~~ 63 (375) T pfam12097 1 MFIKKLLILIFFLTISFLFFIGRSYAVVNAIQQTDSNQIDKKDISKLQKQIKSLQAQINHLEE 63 (375) T ss_pred CCHHHHHHHHHHHHHHHHHEEHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 916778899999999987500046533334422748775388999999999999999987775 No 10 >PRK09039 hypothetical protein; Validated Probab=89.38 E-value=1.8 Score=23.27 Aligned_cols=53 Identities=17% Similarity=0.246 Sum_probs=30.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999999999999999999999999999999999997 Q gi|254781104|r 5 LDFIILGVVLASITITYSIKHETEGKKEKLRILENKITSEQNYIDLLKAQWAL 57 (125) Q Consensus 5 ~~ii~~~~~i~l~~~~y~iK~~t~~~~~ei~~L~~~I~~~~~~i~~LkAEway 57 (125) +.++.+|++++..+.-|...+.....+..+..|+++|....+.+..-+.+-+. T Consensus 26 LLmViIFvL~IFvv~Q~~Lsq~lsgqd~~L~~L~~qia~L~~~L~Ler~~~~~ 78 (343) T PRK09039 26 LLLVIMFLLTVFVVAQFFLSQEISGKDKALDRLNAQIAELADLLSLERQGNQD 78 (343) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999999999997255999999999999999998765655456 No 11 >pfam11853 DUF3373 Protein of unknown function (DUF3373). This family of proteins are functionally uncharacterized. This protein is found in bacteria. Proteins in this family are typically between 472 to 574 amino acids in length. Probab=87.13 E-value=1.5 Score=23.84 Aligned_cols=51 Identities=14% Similarity=0.139 Sum_probs=28.0 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9118999999999999999999999999999999999999999999999999 Q gi|254781104|r 1 MFKTLDFIILGVVLASITITYSIKHETEGKKEKLRILENKITSEQNYIDLLK 52 (125) Q Consensus 1 M~r~~~ii~~~~~i~l~~~~y~iK~~t~~~~~ei~~L~~~I~~~~~~i~~Lk 52 (125) |.+++.+++..+++++...++-. ..-..++++|..|++|+.+.++++..++ T Consensus 1 Mkk~~~l~~~aall~~s~~~~a~-~~d~~~~qkI~~L~~ql~eLk~~~~~~~ 51 (485) T pfam11853 1 MKKLVSLSLVAALLALSLPAAAA-AADIDLLQKIEALKKELAELKAQLKDLN 51 (485) T ss_pred CCHHHHHHHHHHHHHHCCCHHCC-CCCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 91147899999999750410013-4434499999999999999999999877 No 12 >PRK09973 putative outer membrane lipoprotein; Provisional Probab=84.41 E-value=3.2 Score=21.79 Aligned_cols=39 Identities=13% Similarity=0.078 Sum_probs=25.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 999999999999999999999999999999999997189 Q gi|254781104|r 22 SIKHETEGKKEKLRILENKITSEQNYIDLLKAQWALLIQ 60 (125) Q Consensus 22 ~iK~~t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~ 60 (125) ....+++.+..+|.+|+.+|...+-.+...|.|-+.-|+ T Consensus 28 qLsSdVqtLN~KVdqLs~Dv~alr~~vqaAK~EAaRANq 66 (85) T PRK09973 28 QLASNVQTLNAKIARLEQDMKALRPQIYAAKSEANRANT 66 (85) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 998889999999999998899876999998889998874 No 13 >COG2919 Septum formation initiator [Cell division and chromosome partitioning] Probab=82.93 E-value=4 Score=21.19 Aligned_cols=61 Identities=11% Similarity=0.000 Sum_probs=48.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCH-HHHHHHHHHHHHHHCC Q ss_conf 999999999999999999999999999999999999999999971897-9999999987764185 Q gi|254781104|r 14 LASITITYSIKHETEGKKEKLRILENKITSEQNYIDLLKAQWALLIQP-DRIKDLVSLYQKELQL 77 (125) Q Consensus 14 i~l~~~~y~iK~~t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~P-~RL~~La~~y~~~L~L 77 (125) ...+.+.-.-...+.....++...+++......+...|++|-.-|++. +-+++.|.. .|++ T Consensus 39 ~~~~~~g~~~~~~~~~l~~qi~~~~~e~~~L~~~~~~l~~ei~~L~dg~~~i~e~AR~---~l~~ 100 (117) T COG2919 39 QYLAWFGKNGAADVLQLQRQIAAQQAELEKLSARNTALEAEIKDLKDGRDYIEERARS---ELGM 100 (117) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHH---HHCC T ss_conf 9999998705999999999999999999999878999999998703537899999999---8288 No 14 >PRK11677 cytochrome d ubiquinol oxidase subunit III; Provisional Probab=80.24 E-value=5 Score=20.60 Aligned_cols=70 Identities=10% Similarity=0.138 Sum_probs=42.5 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHCCHHHHHHHHHHHHH Q ss_conf 91189999999999999999999999999999999999999999999999999999--71897999999998776 Q gi|254781104|r 1 MFKTLDFIILGVVLASITITYSIKHETEGKKEKLRILENKITSEQNYIDLLKAQWA--LLIQPDRIKDLVSLYQK 73 (125) Q Consensus 1 M~r~~~ii~~~~~i~l~~~~y~iK~~t~~~~~ei~~L~~~I~~~~~~i~~LkAEwa--yL~~P~RL~~La~~y~~ 73 (125) |...+-++++++-++.|++..+. +....++..+|++++...+.++...+.|.+ +-.+-+=+..|++-|.+ T Consensus 1 M~W~~ali~lvvGiiIG~~~~R~---~~~~~~~q~~Le~eLe~~k~el~~Yr~ev~~HF~~tA~Ll~~l~~~Y~~ 72 (134) T PRK11677 1 MTWEYALIGLVVGIIIGAVAMRF---GNRKLRQQQALQYELEKNKAELEEYRQELVDHFARSAELLDTMAKDYRQ 72 (134) T ss_pred CCHHHHHHHHHHHHHHHHHHHHH---CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 92899999999999999999998---1751167999999999999999999999999999999999999999999 No 15 >pfam03245 Phage_lysis Bacteriophage lysis protein. This protein is involved in host lysis. This family is not considered to be a peptidase according to the MEROPs database. Probab=79.81 E-value=5.2 Score=20.52 Aligned_cols=62 Identities=18% Similarity=0.237 Sum_probs=28.6 Q ss_pred CCCHHHHHHHHHHHHHHHHH---HHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHH Q ss_conf 91189999999999999999---999999-----9999999999999999999999999999997189799999999877 Q gi|254781104|r 1 MFKTLDFIILGVVLASITIT---YSIKHE-----TEGKKEKLRILENKITSEQNYIDLLKAQWALLIQPDRIKDLVSLYQ 72 (125) Q Consensus 1 M~r~~~ii~~~~~i~l~~~~---y~iK~~-----t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~P~RL~~La~~y~ 72 (125) |-|++.+++++++++++.+. |.-+.. ..+...++...+..|...+.++. .+..|-.+|. T Consensus 1 m~rl~~~iia~~i~l~~al~w~hy~~~a~~~k~q~~~~~~~l~~~~a~i~~~q~r~r-------------~la~LD~k~t 67 (151) T pfam03245 1 MSRLLAILIAVLIALLLALAWATYRDNADTAERQRDQATAELALANATIEDMQTRQR-------------DLAALDAKHT 67 (151) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------------HHHHHHHHHH T ss_conf 933799999999999999999999877999999999999999999979999999999-------------9999978999 Q ss_pred HHH Q ss_conf 641 Q gi|254781104|r 73 KEL 75 (125) Q Consensus 73 ~~L 75 (125) ++| T Consensus 68 kEL 70 (151) T pfam03245 68 KEL 70 (151) T ss_pred HHH T ss_conf 999 No 16 >TIGR00593 pola DNA polymerase I; InterPro: IPR002298 DNA carries the biological information that instructs cells how to exist in an ordered fashion. Accurate replication is thus one of the most important events in the cell life cycle. This function is mediated by DNA-directed DNA polymerases, which add nucleotide triphosphate (dNTP) residues to the 5'-end of the growing DNA chain, using a complementary DNA as template. Small RNA molecules are generally used as primers for chain elongation, although terminal proteins may also be used. DNA-dependent DNA polymerases have been grouped into families, denoted A, B and X, on the basis of sequence similarities , . Members of family A, which includes bacterial and bacteriophage polymerases, share significant similarity to Escherichia coli polymerase I; hence family A is also known as the pol I family. The bacterial polymerases also contain an exonuclease activity, which is coded for in the N-terminal portion. Three motifs, A, B and C , are seen to be conserved across all DNA polymerases, with motifs A and C also seen in RNA polymerases. They are centred on invariant residues, and their structural significance was implied from the Klenow (E. coli) structure. Motif A contains a strictly-conserved aspartate at the junction of a beta-strand and an alpha-helix; motif B contains an alpha-helix with positive charges; and motif C has a doublet of negative charges, located in a beta-turn-beta secondary structure .; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006260 DNA replication. Probab=78.81 E-value=5.3 Score=20.43 Aligned_cols=58 Identities=14% Similarity=0.149 Sum_probs=46.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCEECCHH Q ss_conf 9999999999999999999999999999999999718979999999987764185405888 Q gi|254781104|r 23 IKHETEGKKEKLRILENKITSEQNYIDLLKAQWALLIQPDRIKDLVSLYQKELQLQATNPI 83 (125) Q Consensus 23 iK~~t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~P~RL~~La~~y~~~L~L~Pi~p~ 83 (125) |+-+...+..==.++..+|....++|..+=-+|-.||||+-|+..= |++|+|-+.+-. T Consensus 618 i~~D~~~L~~L~~e~~~~l~~lE~~Iy~~aG~~FNinSPKQL~~vL---F~KL~Lp~~kK~ 675 (1005) T TIGR00593 618 IKVDKDYLQELSKEFGEELLDLEEEIYELAGEEFNINSPKQLGEVL---FEKLGLPVGKKT 675 (1005) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHH---HHHHCCCCCCCC T ss_conf 7870899999999999999999999998749520688733555886---998478820034 No 17 >TIGR01710 typeII_sec_gspG general secretion pathway protein G; InterPro: IPR010054 This entry represents GspG, protein G of the main terminal branch of the general secretion pathway, also called type II secretion. It transports folded proteins across the bacterial outer membrane and is widely distributed in Gram-negative pathogens.; GO: 0008565 protein transporter activity, 0015628 protein secretion by the type II secretion system, 0015627 type II protein secretion system complex. Probab=75.54 E-value=6.8 Score=19.76 Aligned_cols=88 Identities=17% Similarity=0.240 Sum_probs=64.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCEECC Q ss_conf 11899999999999999999999999999999999999999999999999999997189799999999877641854058 Q gi|254781104|r 2 FKTLDFIILGVVLASITITYSIKHETEGKKEKLRILENKITSEQNYIDLLKAQWALLIQPDRIKDLVSLYQKELQLQATN 81 (125) Q Consensus 2 ~r~~~ii~~~~~i~l~~~~y~iK~~t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~P~RL~~La~~y~~~L~L~Pi~ 81 (125) |.++.++.+++++|+.+.+-.=|+-.+.=..+...-..+|...++.+...|.|--.. |..=|.|+. |--.|.. T Consensus 3 FTLlE~~VVlvIlGLLaalv~P~l~s~~d~A~~~~a~~qi~~l~~ALd~yrLDng~y--Pt~eQGL~A-----LV~kP~~ 75 (137) T TIGR01710 3 FTLLEIMVVLVILGLLAALVVPKLFSQADKAKAQVAKAQIKALKNALDMYRLDNGRY--PTEEQGLAA-----LVKKPSG 75 (137) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCC--CCCHHHHHH-----HCCCCCC T ss_conf 326788999999999999986553487553578889999999999987665662589--881578998-----2779587 Q ss_pred HHHCCCHHHCCCCCC Q ss_conf 888322888565543 Q gi|254781104|r 82 PINLITYDDLARLKK 96 (125) Q Consensus 82 p~qi~~~~~l~~~~~ 96 (125) +.+=.++..=|+++. T Consensus 76 ~p~P~~W~~DgYl~k 90 (137) T TIGR01710 76 EPLPKNWKGDGYLRK 90 (137) T ss_pred CCCCCCCCCCCCEEC T ss_conf 668876188886425 No 18 >PRK08928 consensus Probab=74.79 E-value=5.2 Score=20.48 Aligned_cols=58 Identities=14% Similarity=0.115 Sum_probs=43.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCEECCHH Q ss_conf 9999999999999999999999999999999999718979999999987764185405888 Q gi|254781104|r 23 IKHETEGKKEKLRILENKITSEQNYIDLLKAQWALLIQPDRIKDLVSLYQKELQLQATNPI 83 (125) Q Consensus 23 iK~~t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~P~RL~~La~~y~~~L~L~Pi~p~ 83 (125) |+-++..+.+--..++.+|...+++|..+--+=--+|||..|..+ -|++|+|-+.+-. T Consensus 478 I~VD~~~L~~ls~el~~~l~~Le~~I~~lAG~eFNInSPKQLgeI---LFekL~Lp~~KKt 535 (861) T PRK08928 478 IKVDANYLKQLSAEFGTEILKLEEEIYKLSGTKFNIASPKQLGEI---LFEKMGLPKGKKS 535 (861) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHH---HHHHCCCCCCCCC T ss_conf 601699999999999999999999999974987475999999999---7997299867767 No 19 >COG0749 PolA DNA polymerase I - 3'-5' exonuclease and polymerase domains [DNA replication, recombination, and repair] Probab=72.85 E-value=5.5 Score=20.34 Aligned_cols=61 Identities=16% Similarity=0.128 Sum_probs=49.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCEE-CCHHHCC Q ss_conf 999999999999999999999999999999999971897999999998776418540-5888832 Q gi|254781104|r 23 IKHETEGKKEKLRILENKITSEQNYIDLLKAQWALLIQPDRIKDLVSLYQKELQLQA-TNPINLI 86 (125) Q Consensus 23 iK~~t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~P~RL~~La~~y~~~L~L~P-i~p~qi~ 86 (125) |+-++..+...-..+..++....+.+..|-.|--.++||+.|+.+- |+.|+|-| ....+.+ T Consensus 209 i~vD~~~L~~l~~el~~~l~~le~eiy~laG~~FNi~SPKQL~~IL---feKl~Lp~~~kKtktG 270 (593) T COG0749 209 IKVDVQYLKELSKELGCELAELEEEIYELAGEEFNINSPKQLGEIL---FEKLGLPPGLKKTKTG 270 (593) T ss_pred CEECHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHH---HHHCCCCCCCCCCCCC T ss_conf 5642999999999999999999999999826857899778999998---8851898555545789 No 20 >PRK08434 consensus Probab=70.07 E-value=8.7 Score=19.14 Aligned_cols=58 Identities=17% Similarity=0.112 Sum_probs=40.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCEECCHH Q ss_conf 9999999999999999999999999999999999718979999999987764185405888 Q gi|254781104|r 23 IKHETEGKKEKLRILENKITSEQNYIDLLKAQWALLIQPDRIKDLVSLYQKELQLQATNPI 83 (125) Q Consensus 23 iK~~t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~P~RL~~La~~y~~~L~L~Pi~p~ 83 (125) |+-++..+.+--..+.++|...+++|..+--+=--+|||..|..+ -|++|+|-+.+-. T Consensus 510 I~VD~~~L~~ls~el~~~i~~Le~~Iy~lAG~eFNInSPKQLgeI---LFekLgLp~~KKT 567 (887) T PRK08434 510 IKLDTNFLEELKKKFEEEIKELTEEIYELCGEEFNINSTKQLGAV---LFEKLKLPSGKKT 567 (887) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHH---HHHHCCCCCCCCC T ss_conf 502699999999999999999999999855766477999999999---8986499888888 No 21 >TIGR01711 gspJ general secretion pathway protein J; InterPro: IPR010055 This entry represents GspJ, one of two proteins highly conserved at their N-termini and described by Bacterial type II secretion system protein I/J but is easily separable phylogenetically. The other is the General secretion pathway protein I. Both GspI and GspJ are proteins of the type II secretion pathway, or main terminal branch of the general secretion pathway. This pathway carries proteins across the outer membrane. Note that proteins of type II secretion are cryptic in Escherichia coli K-12 - present but not yet demonstrated to act on any target.; GO: 0008565 protein transporter activity, 0015628 protein secretion by the type II secretion system, 0015627 type II protein secretion system complex. Probab=69.50 E-value=9.5 Score=18.90 Aligned_cols=51 Identities=10% Similarity=0.139 Sum_probs=42.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999999999999999999999999999999999997 Q gi|254781104|r 7 FIILGVVLASITITYSIKHETEGKKEKLRILENKITSEQNYIDLLKAQWAL 57 (125) Q Consensus 7 ii~~~~~i~l~~~~y~iK~~t~~~~~ei~~L~~~I~~~~~~i~~LkAEway 57 (125) +++|.+|..+++.+|++=+.+..-++=-..=++-+.+++-...++.+|..= T Consensus 9 LvAIAIFAsLs~gaYqvl~~V~~sde~~~~~eaRl~EL~rAm~~merD~tQ 59 (202) T TIGR01711 9 LVAIAIFASLSLGAYQVLDSVLRSDEVAKKKEARLAELQRAMVLMERDLTQ 59 (202) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999999999998875246789899887799999999998655563 No 22 >pfam10066 DUF2304 Uncharacterized conserved protein (DUF2304). Members of this family of hypothetical archaeal proteins have no known function. Probab=69.49 E-value=9.5 Score=18.90 Aligned_cols=45 Identities=9% Similarity=0.103 Sum_probs=35.7 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 189999999999999999999999999999999999999999999 Q gi|254781104|r 3 KTLDFIILGVVLASITITYSIKHETEGKKEKLRILENKITSEQNY 47 (125) Q Consensus 3 r~~~ii~~~~~i~l~~~~y~iK~~t~~~~~ei~~L~~~I~~~~~~ 47 (125) |..+++.++.++++.+.+|..-.+..+.+.++.+|-++|+-.++. T Consensus 64 rg~n~lf~~~i~~l~~~~f~l~~~is~le~~it~L~~~iAi~e~~ 108 (115) T pfam10066 64 RGLDALFYLGIGLLFYLLFKLYLKIDKLEREITKLVREIAIRERE 108 (115) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 301599999999999999999999999999999999999987256 No 23 >PRK05929 consensus Probab=66.86 E-value=4.8 Score=20.69 Aligned_cols=55 Identities=18% Similarity=0.161 Sum_probs=37.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCEECCHH Q ss_conf 9999999999999999999999999999999718979999999987764185405888 Q gi|254781104|r 26 ETEGKKEKLRILENKITSEQNYIDLLKAQWALLIQPDRIKDLVSLYQKELQLQATNPI 83 (125) Q Consensus 26 ~t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~P~RL~~La~~y~~~L~L~Pi~p~ 83 (125) ++..+.+--..+.++|...+++|..+--+---+|||..|.. --|++|+|.|++-. T Consensus 495 D~~~L~~ls~el~~~l~~Le~~I~~~AG~eFNinSPKQL~e---vLFekLgL~~~kKt 549 (870) T PRK05929 495 DVEGLAVLERDLSKELAVLTQEIYDLAGCPFNIKSPKQLSD---ILYNRLGLRPIDKA 549 (870) T ss_pred EHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHH---HHHHHCCCCCCCCC T ss_conf 19999989999999999999999997299988789899999---98986399988888 No 24 >PRK07625 consensus Probab=66.26 E-value=9.2 Score=18.98 Aligned_cols=57 Identities=12% Similarity=0.098 Sum_probs=40.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCEECCH Q ss_conf 999999999999999999999999999999999971897999999998776418540588 Q gi|254781104|r 23 IKHETEGKKEKLRILENKITSEQNYIDLLKAQWALLIQPDRIKDLVSLYQKELQLQATNP 82 (125) Q Consensus 23 iK~~t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~P~RL~~La~~y~~~L~L~Pi~p 82 (125) |+-++..+.+.-..+.+++...+++|..+--+---+|||..|..+- |++|+|-+++- T Consensus 539 I~vD~~~L~~l~~~l~~~l~~le~~i~~~aG~eFNinSpkQL~~iL---Fe~L~Lp~~kK 595 (922) T PRK07625 539 VLIDADRLSRQSGEIATRLLALEQEAYELAGGEFNLGSPKQIGQIF---FERLQLPVVKK 595 (922) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHH---HHHCCCCCCCC T ss_conf 4026999999999999999999998777549977899999999999---98538987885 No 25 >pfam05137 PilN Fimbrial assembly protein (PilN). Probab=65.64 E-value=11 Score=18.42 Aligned_cols=76 Identities=20% Similarity=0.271 Sum_probs=39.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCH-HHHHHHHHHHHHHHCCEECCHHHCCCHHHCCCC Q ss_conf 9999999999999999999999999999999999999999971897-999999998776418540588883228885655 Q gi|254781104|r 16 SITITYSIKHETEGKKEKLRILENKITSEQNYIDLLKAQWALLIQP-DRIKDLVSLYQKELQLQATNPINLITYDDLARL 94 (125) Q Consensus 16 l~~~~y~iK~~t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~P-~RL~~La~~y~~~L~L~Pi~p~qi~~~~~l~~~ 94 (125) ++.+.+............+..|+.+|......++..+.-..-...= .|++.+... +--++.-.--+++|+.. T Consensus 33 ~~~~~~~~~~~i~~q~~~~~~L~~ei~~L~~~i~ei~~l~~~~~~l~~r~~~i~~L-------q~~r~~~~~ll~~L~~~ 105 (162) T pfam05137 33 AFAGWLYLDNQIEAQQERNAFLRTQIQQLDGQLKEIQALKQERQALLERMKKIQEL-------QQNRPQTVRVFDELARL 105 (162) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHCCCHHHHHHHHHHHH T ss_conf 99999999999999999999999999999999999999999999999999999999-------97182689999999875 Q ss_pred CCCC Q ss_conf 4335 Q gi|254781104|r 95 KKHT 98 (125) Q Consensus 95 ~~~~ 98 (125) .|+. T Consensus 106 lP~g 109 (162) T pfam05137 106 LPDG 109 (162) T ss_pred CCCC T ss_conf 8997 No 26 >pfam10828 DUF2570 Protein of unknown function (DUF2570). This is a family of proteins with unknown function. Probab=65.30 E-value=12 Score=18.38 Aligned_cols=42 Identities=10% Similarity=0.128 Sum_probs=27.1 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 911899999999999999999999999999999999999999 Q gi|254781104|r 1 MFKTLDFIILGVVLASITITYSIKHETEGKKEKLRILENKIT 42 (125) Q Consensus 1 M~r~~~ii~~~~~i~l~~~~y~iK~~t~~~~~ei~~L~~~I~ 42 (125) |++.++..+.++++|++.|+..-......+..+-..=-.-|. T Consensus 1 m~~~i~~~l~~lilgL~gwlw~QS~~Id~LrAen~~QaqtI~ 42 (110) T pfam10828 1 MSKYIKATLAALILGLGGWSWYQSQKIDSLRAENKAQAQTIE 42 (110) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 906899999999999999999998888999986788999999 No 27 >COG3883 Uncharacterized protein conserved in bacteria [Function unknown] Probab=64.88 E-value=12 Score=18.33 Aligned_cols=55 Identities=18% Similarity=0.211 Sum_probs=25.3 Q ss_pred CCCHHHHHHHHHHHHHH-HHHHHHHHH-------------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 91189999999999999-999999999-------------------99999999999999999999999999999 Q gi|254781104|r 1 MFKTLDFIILGVVLASI-TITYSIKHE-------------------TEGKKEKLRILENKITSEQNYIDLLKAQW 55 (125) Q Consensus 1 M~r~~~ii~~~~~i~l~-~~~y~iK~~-------------------t~~~~~ei~~L~~~I~~~~~~i~~LkAEw 55 (125) |.|.+....+..++++. ++.+.+..+ .+..+.+|..|..+|......+.-++.|. T Consensus 1 M~kk~~~a~~~s~v~~s~~~~t~V~a~~~~~~i~~~ds~l~~~~~~~~~~q~ei~~L~~qi~~~~~k~~~~~~~i 75 (265) T COG3883 1 MKKKILLAVLLSLVIISTAFLTTVFAALLSDKIQNQDSKLSELQKEKKNIQNEIESLDNQIEEIQSKIDELQKEI 75 (265) T ss_pred CCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 914899999999999998110002133533477766889999999999899999999999999999888789888 No 28 >pfam00476 DNA_pol_A DNA polymerase family A. Probab=63.59 E-value=12 Score=18.26 Aligned_cols=54 Identities=24% Similarity=0.114 Sum_probs=36.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCEECCHH Q ss_conf 999999999999999999999999999999718979999999987764185405888 Q gi|254781104|r 27 TEGKKEKLRILENKITSEQNYIDLLKAQWALLIQPDRIKDLVSLYQKELQLQATNPI 83 (125) Q Consensus 27 t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~P~RL~~La~~y~~~L~L~Pi~p~ 83 (125) +..+++--..+..++.....+|..+--+=--++||.- |++..|++|+|.|.... T Consensus 3 ~~~l~~~~~~l~~~~~~l~~~i~~~aG~~fN~~S~~q---l~~vLf~~l~l~~~~~t 56 (383) T pfam00476 3 VEYLEELSNELGAQLADLEFKIPELAGEEFNLGSPKQ---LGVLLFEELGLPKTKKT 56 (383) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHH---HHHHHHHHCCCCCCCCC T ss_conf 8999999999999999999999999599769999899---99984973599999977 No 29 >TIGR03064 sortase_srtB sortase, SrtB family. Members of this transpeptidase family are, in most cases, designated sortase B, product of the srtB gene. This protein shows only distant similarity to the sortase A family, for which there may be several members in a single bacterial genome. Typical SrtB substrate motifs include NAKTN, NPKSS, etc, and otherwise resemble the LPXTG sorting signals recognized by sortase A proteins. Probab=62.51 E-value=7.5 Score=19.51 Aligned_cols=37 Identities=16% Similarity=0.118 Sum_probs=24.1 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 1899999999999999999999999999999999999 Q gi|254781104|r 3 KTLDFIILGVVLASITITYSIKHETEGKKEKLRILEN 39 (125) Q Consensus 3 r~~~ii~~~~~i~l~~~~y~iK~~t~~~~~ei~~L~~ 39 (125) |++.++++++|+++++.+..+=++-....++-..++. T Consensus 4 ~ii~iv~l~Vf~~s~~~l~~~~~~y~~~~~~~~~~~~ 40 (232) T TIGR03064 4 KSITLLFLIVFFYSLYKLGQIFYDYYTNRQVLAEAQQ 40 (232) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 8899999999999999999999887663699999998 No 30 >PRK08786 consensus Probab=61.28 E-value=9.4 Score=18.92 Aligned_cols=55 Identities=18% Similarity=0.180 Sum_probs=37.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCEECCH Q ss_conf 9999999999999999999999999999999971897999999998776418540588 Q gi|254781104|r 25 HETEGKKEKLRILENKITSEQNYIDLLKAQWALLIQPDRIKDLVSLYQKELQLQATNP 82 (125) Q Consensus 25 ~~t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~P~RL~~La~~y~~~L~L~Pi~p 82 (125) -++..+.+--..+.+++...+++|..+--+.--+|||..|+.+ -|++|+|-+.+- T Consensus 546 vD~~~L~~l~~~l~~~l~~le~~I~~~aG~eFNinSpKQL~~i---LFekL~Lp~~kK 600 (927) T PRK08786 546 VDAAELRRQSADLSKRMLAAQQKATELAGRTFNLDSPKQLQAL---LFDELKLPAVVK 600 (927) T ss_pred ECHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHH---HHHHCCCCCCCC T ss_conf 8899999999999999999999999970855578999999999---975028987873 No 31 >PRK05755 DNA polymerase I; Provisional Probab=61.27 E-value=11 Score=18.53 Aligned_cols=56 Identities=20% Similarity=0.157 Sum_probs=35.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCEECCH Q ss_conf 99999999999999999999999999999999971897999999998776418540588 Q gi|254781104|r 24 KHETEGKKEKLRILENKITSEQNYIDLLKAQWALLIQPDRIKDLVSLYQKELQLQATNP 82 (125) Q Consensus 24 K~~t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~P~RL~~La~~y~~~L~L~Pi~p 82 (125) +-+...+.+--..++++|...++.|..+--+---+|||..|. +--|++|+|-+.+- T Consensus 509 ~VD~~~L~~ls~el~~~l~~le~~I~~~aG~eFNinSPKQL~---~iLFekL~Lp~~KK 564 (889) T PRK05755 509 KVDREYLEELSAELAQRLAELEQEIYELAGEEFNLNSPKQLG---EILFEKLGLPVTKK 564 (889) T ss_pred EECHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHH---HHHHHHCCCCCCCC T ss_conf 866999999999999999999999998749903779989999---99998539997887 No 32 >PRK08835 consensus Probab=61.19 E-value=10 Score=18.69 Aligned_cols=55 Identities=5% Similarity=0.049 Sum_probs=37.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCEECCH Q ss_conf 9999999999999999999999999999999971897999999998776418540588 Q gi|254781104|r 25 HETEGKKEKLRILENKITSEQNYIDLLKAQWALLIQPDRIKDLVSLYQKELQLQATNP 82 (125) Q Consensus 25 ~~t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~P~RL~~La~~y~~~L~L~Pi~p 82 (125) -++..+.+--..+.++|...+++|..+--+=-.+|||..|..+ -|+.|+|-+.+- T Consensus 549 vD~~~L~~ls~el~~~l~~le~~I~~~aG~eFNinSPKQL~~v---LF~~l~Lp~~kK 603 (931) T PRK08835 549 IDDMLLSAQSQEIAARLDELEQKAYEIAGQEFNMNSPKQLQTI---LFEKMGLPVIKK 603 (931) T ss_pred ECHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHH---HHHHCCCCCCCC T ss_conf 7899999999999999999999998762785677999999999---998609987885 No 33 >PRK07997 consensus Probab=59.23 E-value=8.1 Score=19.31 Aligned_cols=56 Identities=4% Similarity=0.008 Sum_probs=36.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCEECCHHH Q ss_conf 99999999999999999999999999999997189799999999877641854058888 Q gi|254781104|r 26 ETEGKKEKLRILENKITSEQNYIDLLKAQWALLIQPDRIKDLVSLYQKELQLQATNPIN 84 (125) Q Consensus 26 ~t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~P~RL~~La~~y~~~L~L~Pi~p~q 84 (125) ++..+.+--..+.+++...++.|..+--+=--+|||..|.++ -|++|+|-|.+-.. T Consensus 548 D~~~L~~ls~el~~~l~~le~~I~~~aG~eFNinSpKQLgeI---LFekLgLp~~KKTk 603 (928) T PRK07997 548 DPAVLHAHSEELTLRLAELEKKAHEIAGEPFNLSSTKQLQTI---LFEKQGIKPLKKTP 603 (928) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHH---HHHHCCCCCCCCCC T ss_conf 899999999999999999999999862998887899999999---89862999888588 No 34 >PRK06887 consensus Probab=57.40 E-value=12 Score=18.23 Aligned_cols=54 Identities=11% Similarity=0.143 Sum_probs=35.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCEECCH Q ss_conf 999999999999999999999999999999971897999999998776418540588 Q gi|254781104|r 26 ETEGKKEKLRILENKITSEQNYIDLLKAQWALLIQPDRIKDLVSLYQKELQLQATNP 82 (125) Q Consensus 26 ~t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~P~RL~~La~~y~~~L~L~Pi~p 82 (125) ++..+.+--..+..++...++.|..+-.+---+|||..|..+- |++|+|-+.+- T Consensus 575 D~~~L~~ls~el~~~l~~le~~iy~~aG~eFNinSpKQL~eiL---FekLgLp~~KK 628 (954) T PRK06887 575 DSDALLAQSNEIAQRLTALEQQAYELAGQPFNLASTKQLQEIL---FDKLGLPVLKK 628 (954) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHH---HHHCCCCCCCC T ss_conf 7999999999999999999999988609954899989999998---98429998885 No 35 >PRK08076 consensus Probab=55.89 E-value=11 Score=18.40 Aligned_cols=56 Identities=16% Similarity=0.140 Sum_probs=38.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCEECCH Q ss_conf 99999999999999999999999999999999971897999999998776418540588 Q gi|254781104|r 24 KHETEGKKEKLRILENKITSEQNYIDLLKAQWALLIQPDRIKDLVSLYQKELQLQATNP 82 (125) Q Consensus 24 K~~t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~P~RL~~La~~y~~~L~L~Pi~p 82 (125) +-++..+.+--..+.++|...+++|..+--+=--+|||..| ++-.|++|+|-+.+- T Consensus 495 ~VD~~~L~~ls~~l~~~l~~le~~I~~laG~eFNinSpKQL---~~ILFekL~Lp~~KK 550 (877) T PRK08076 495 KVDTERLENMGEELAGRLKELEQEIHELAGEEFNINSPKQL---GVILFEKLGLPVIKK 550 (877) T ss_pred EECHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHH---HHHHHHHCCCCCCCC T ss_conf 89899999999999999999999999972898788999999---999998538988887 No 36 >pfam06305 DUF1049 Protein of unknown function (DUF1049). This family consists of several hypothetical bacterial proteins of unknown function. Probab=55.33 E-value=17 Score=17.31 Aligned_cols=21 Identities=19% Similarity=0.406 Sum_probs=9.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999999 Q gi|254781104|r 23 IKHETEGKKEKLRILENKITS 43 (125) Q Consensus 23 iK~~t~~~~~ei~~L~~~I~~ 43 (125) .|.+.+.+.+++.++++++.. T Consensus 58 ~r~~~r~l~k~lk~l~~e~~~ 78 (80) T pfam06305 58 LRRRLRRLKKQLKKLEKELEQ 78 (80) T ss_pred HHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999999999999884 No 37 >PRK07300 consensus Probab=55.22 E-value=14 Score=17.93 Aligned_cols=51 Identities=20% Similarity=0.066 Sum_probs=35.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCC Q ss_conf 999999999999999999999999999999999718979999999987764185 Q gi|254781104|r 24 KHETEGKKEKLRILENKITSEQNYIDLLKAQWALLIQPDRIKDLVSLYQKELQL 77 (125) Q Consensus 24 K~~t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~P~RL~~La~~y~~~L~L 77 (125) +-++..+..--..++++|...+++|..+--+=--+|||..|..+ -|++|+| T Consensus 496 ~VD~~~L~~ls~el~~~l~~Le~~Iy~lAG~eFNInSPKQLgeI---LFekLgL 546 (880) T PRK07300 496 KVNRETLQEMAAENEVVIEALTQEIYDLAGQEFNINSPKQLGVI---LFEKMGL 546 (880) T ss_pred EECHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHH---HHHHCCC T ss_conf 88899999999999999999999999865876577999999999---9984499 No 38 >PRK07556 consensus Probab=52.59 E-value=15 Score=17.75 Aligned_cols=55 Identities=11% Similarity=0.040 Sum_probs=34.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCEECCHH Q ss_conf 9999999999999999999999999999999718979999999987764185405888 Q gi|254781104|r 26 ETEGKKEKLRILENKITSEQNYIDLLKAQWALLIQPDRIKDLVSLYQKELQLQATNPI 83 (125) Q Consensus 26 ~t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~P~RL~~La~~y~~~L~L~Pi~p~ 83 (125) ++..+.+--..+..++...+++|..+--+=--+|||..|.. .-|++|+|.+.+-. T Consensus 592 D~~~L~~l~~~l~~~l~~le~~I~~~aG~eFNi~SpkQL~~---iLF~~l~lp~~kkt 646 (977) T PRK07556 592 DRQVLSRLSGEFAQKAARLEAEIYELAGEPFNIGSPKQLGD---ILFGKMGLPGGSKT 646 (977) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHH---HHHHHCCCCCCCCC T ss_conf 69999999999999999999999987089878788999999---99986089888868 No 39 >pfam11336 DUF3138 Protein of unknown function (DUF3138). This family of proteins with unknown function appear to be restricted to Proteobacteria. Probab=50.42 E-value=18 Score=17.24 Aligned_cols=34 Identities=18% Similarity=0.263 Sum_probs=25.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 9999999999999999999999999999997189 Q gi|254781104|r 27 TEGKKEKLRILENKITSEQNYIDLLKAQWALLIQ 60 (125) Q Consensus 27 t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~ 60 (125) ......+|..|+.++..++...+.|+++.+--.. T Consensus 20 a~s~a~~ikalqaq~~~lq~q~~~l~a~laak~~ 53 (514) T pfam11336 20 AASDASQIKALQAQLTALQQQVNELRAALAAKPA 53 (514) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 2589999999999999999999999998614876 No 40 >PRK05797 consensus Probab=49.53 E-value=17 Score=17.31 Aligned_cols=57 Identities=16% Similarity=0.161 Sum_probs=37.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCEECCHH Q ss_conf 999999999999999999999999999999999718979999999987764185405888 Q gi|254781104|r 24 KHETEGKKEKLRILENKITSEQNYIDLLKAQWALLIQPDRIKDLVSLYQKELQLQATNPI 83 (125) Q Consensus 24 K~~t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~P~RL~~La~~y~~~L~L~Pi~p~ 83 (125) +-++..+.+--..+.+++...+++|..+--+=--+|||..|. +--|++|+|.+.+-. T Consensus 488 ~VD~~~L~~l~~~l~~~l~~le~~I~~~aG~eFNinSPKQL~---~iLFe~LgLp~~KKt 544 (869) T PRK05797 488 KVDKDILDELSKKFKEEIEKTQKEIYELAGEEFNINSPKQLG---KILFEKLDLPVIKKT 544 (869) T ss_pred EECHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHH---HHHHHHCCCCCCCCC T ss_conf 999999999999999999999999998628866789989999---998986499988878 No 41 >PRK07456 consensus Probab=49.15 E-value=22 Score=16.71 Aligned_cols=56 Identities=20% Similarity=0.167 Sum_probs=37.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCEECC Q ss_conf 99999999999999999999999999999999997189799999999877641854058 Q gi|254781104|r 23 IKHETEGKKEKLRILENKITSEQNYIDLLKAQWALLIQPDRIKDLVSLYQKELQLQATN 81 (125) Q Consensus 23 iK~~t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~P~RL~~La~~y~~~L~L~Pi~ 81 (125) |+-++..+..--..+.++|...++.|..+--+=--+|||..|.. --|++|+|.+.. T Consensus 575 I~vD~~~L~~l~~~l~~~l~~le~~i~~~aG~eFNinSpkQL~e---vLFekL~L~~~k 630 (975) T PRK07456 575 IRIDIPYLKELSQELESTLERLEQQAYELAGEDFNLSSPKQLGE---LLFEKLGLDRKK 630 (975) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHH---HHHHHCCCCCCC T ss_conf 50169999999999999999999999986489889899899999---989863999877 No 42 >pfam11382 DUF3186 Protein of unknown function (DUF3186). This bacterial family of proteins has no known function. Probab=48.88 E-value=22 Score=16.68 Aligned_cols=36 Identities=11% Similarity=0.028 Sum_probs=23.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 999999999999999999999999999999999718 Q gi|254781104|r 24 KHETEGKKEKLRILENKITSEQNYIDLLKAQWALLI 59 (125) Q Consensus 24 K~~t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~ 59 (125) ..-...++.+...|+.+....+.+...++++....+ T Consensus 31 ~~l~~~l~~~~~~Lr~e~~~l~~~~~~l~~~~~~~d 66 (307) T pfam11382 31 ENLLSGLEDEFSDLRTENDRLRAEREALNEQASAAD 66 (307) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 567878888999999999999999999999999999 No 43 >PRK07898 consensus Probab=47.34 E-value=23 Score=16.54 Aligned_cols=54 Identities=11% Similarity=0.079 Sum_probs=35.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCEECC Q ss_conf 999999999999999999999999999999997189799999999877641854058 Q gi|254781104|r 25 HETEGKKEKLRILENKITSEQNYIDLLKAQWALLIQPDRIKDLVSLYQKELQLQATN 81 (125) Q Consensus 25 ~~t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~P~RL~~La~~y~~~L~L~Pi~ 81 (125) -++..+.+--..+.++|...++.|..+--+=--+|||..|..+ -|++|+|-+.+ T Consensus 520 vD~~~L~~ls~el~~~l~~le~~I~~~aG~eFNinSpKQL~ei---LFdkL~Lp~~K 573 (902) T PRK07898 520 VDLAHLTELQSQFAGQVRDAAEAAYAVIGKQINLGSPKQLQVV---LFDELGMPKTK 573 (902) T ss_pred ECHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHH---HHHHCCCCCCC T ss_conf 8199999899999999999999999981895689998999999---89970999888 No 44 >COG3917 NahD 2-hydroxychromene-2-carboxylate isomerase [Secondary metabolites biosynthesis, transport, and catabolism] Probab=45.88 E-value=10 Score=18.73 Aligned_cols=37 Identities=19% Similarity=0.270 Sum_probs=29.2 Q ss_pred HHHHHHCCHHHHHHHHHHHHHHHCCEECCHHHCCCHHH Q ss_conf 99997189799999999877641854058888322888 Q gi|254781104|r 53 AQWALLIQPDRIKDLVSLYQKELQLQATNPINLITYDD 90 (125) Q Consensus 53 AEwayL~~P~RL~~La~~y~~~L~L~Pi~p~qi~~~~~ 90 (125) .-|+||..| |+..||..|.-.+.+.|+...-+....- T Consensus 18 SP~ayL~~~-~~~~laq~~ga~v~~rP~llg~vfk~tG 54 (203) T COG3917 18 SPYAYLAWP-RLPALAQAYGAAVALRPILLGGVFKATG 54 (203) T ss_pred CCHHHHHHH-HHHHHHHHCCCCEEEEEEEECEEEEECC T ss_conf 815876024-4599999809852787530020586237 No 45 >PRK00575 tatA twin arginine translocase protein A; Provisional Probab=45.38 E-value=25 Score=16.35 Aligned_cols=48 Identities=17% Similarity=0.253 Sum_probs=38.3 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 118999999999999999999999999999999999999999999999 Q gi|254781104|r 2 FKTLDFIILGVVLASITITYSIKHETEGKKEKLRILENKITSEQNYID 49 (125) Q Consensus 2 ~r~~~ii~~~~~i~l~~~~y~iK~~t~~~~~ei~~L~~~I~~~~~~i~ 49 (125) +..-+++.+++++.+.|..-.+-.-.+.+-+-++..+.++++.+++.+ T Consensus 4 l~~w~~~ii~~vv~llFG~kkLp~~arslG~s~r~fK~~~~e~~~e~~ 51 (88) T PRK00575 4 LSPWHWAILAVVVIVLFGAKKLPDAARSLGKSMRIFKSEVREMQNENK 51 (88) T ss_pred CCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 517999999999999967540168999999999999999987551000 No 46 >pfam06667 PspB Phage shock protein B. This family consists of several bacterial phage shock protein B (PspB) sequences. The phage shock protein (psp) operon is induced in response to heat, ethanol, osmotic shock and infection by filamentous bacteriophages. Expression of the operon requires the alternative sigma factor sigma54 and the transcriptional activator PspF. In addition, PspA plays a negative regulatory role, and the integral-membrane proteins PspB and PspC play a positive one. Probab=44.93 E-value=25 Score=16.31 Aligned_cols=38 Identities=18% Similarity=0.191 Sum_probs=26.2 Q ss_pred HHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999-----999999999999999999999999 Q gi|254781104|r 16 SITITYSIKHETEG-----KKEKLRILENKITSEQNYIDLLKA 53 (125) Q Consensus 16 l~~~~y~iK~~t~~-----~~~ei~~L~~~I~~~~~~i~~LkA 53 (125) =.+.+|.-|.++.. -.+.+..|-..-...+++|..|+. T Consensus 21 WL~LHYrsk~k~~~~Ls~~d~~~L~~L~~~A~rm~~RI~tLE~ 63 (75) T pfam06667 21 WLILHYRSKWKVSQGLSEEDEQLLEELLETAEKLQERIQTLER 63 (75) T ss_pred HHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999656023467789999999999999999999999999999 No 47 >pfam10805 DUF2730 Protein of unknown function (DUF2730). This family of proteins with unknown function appears to be restricted to Gammaproteobacteria. Probab=42.84 E-value=27 Score=16.11 Aligned_cols=58 Identities=16% Similarity=0.131 Sum_probs=38.5 Q ss_pred HHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHH Q ss_conf 999999999999999999-9999999999999999999999999997189799999999 Q gi|254781104|r 12 VVLASITITYSIKHETEG-KKEKLRILENKITSEQNYIDLLKAQWALLIQPDRIKDLVS 69 (125) Q Consensus 12 ~~i~l~~~~y~iK~~t~~-~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~P~RL~~La~ 69 (125) .++++++.++..+.+.+. .-.++..+...+....+++..++.+..+|-+-+-+.+|-- T Consensus 14 ~~~~~~~~~~~~~l~ktyakkedv~~l~~~~~~~~~Rl~~lE~~v~~lPt~~dv~~L~~ 72 (106) T pfam10805 14 AVLSIAGGAFWALLSKTYAKREDVEKLIDIIASNDKRLTEMETKLESLPTAKDVHALKI 72 (106) T ss_pred HHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHH T ss_conf 99999999999999843213988999999988777689999999853996878999999 No 48 >pfam07665 consensus Probab=42.66 E-value=27 Score=16.09 Aligned_cols=40 Identities=13% Similarity=0.145 Sum_probs=31.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 9999999999999999999999999999999999997189 Q gi|254781104|r 21 YSIKHETEGKKEKLRILENKITSEQNYIDLLKAQWALLIQ 60 (125) Q Consensus 21 y~iK~~t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~ 60 (125) -..|..+..+...+..+...|...++..++|+++-|-+-+ T Consensus 8 ~~lk~Dv~~LK~Dmv~vK~Di~~LK~dV~~Lk~Dvs~iK~ 47 (57) T pfam07665 8 KQLKTDVAVLKTDMVEVKSDIKILKEDVAVLKKDVSKIKK 47 (57) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 9999889999999999999999999999999998999860 No 49 >PRK08307 stage III sporulation protein SpoAB; Provisional Probab=42.34 E-value=28 Score=16.06 Aligned_cols=60 Identities=17% Similarity=0.279 Sum_probs=32.4 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCH--HHHHHHHH Q ss_conf 9118999999999999999999999999999999999999999999999999999971897--99999999 Q gi|254781104|r 1 MFKTLDFIILGVVLASITITYSIKHETEGKKEKLRILENKITSEQNYIDLLKAQWALLIQP--DRIKDLVS 69 (125) Q Consensus 1 M~r~~~ii~~~~~i~l~~~~y~iK~~t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~P--~RL~~La~ 69 (125) |++.+-.+++++..++.-+.|.-++ +..-++++ ..+.-+..|++|-+|=..| +=+.+++. T Consensus 1 MlKliGa~LIi~s~~~~G~~~a~~~--~~R~~qL~-------~l~~~l~~L~~EI~Y~~tpL~eal~~i~~ 62 (172) T PRK08307 1 MMKLLGAVLIVAASTWIGFLYAERY--KERPRQLR-------ELKAALQSLEAEIMYGHTPLPEALENIAK 62 (172) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHH-------HHHHHHHHHHHHHHHCCCCHHHHHHHHHH T ss_conf 9189999999999889999999999--99999999-------99999999999998808989999999987 No 50 >pfam08172 CASP_C CASP C terminal. This domain is the C-terminal region of the CASP family of proteins. It is a Golgi membrane protein which is thought to have a role in vesicle transport. Probab=41.14 E-value=29 Score=15.95 Aligned_cols=45 Identities=29% Similarity=0.347 Sum_probs=34.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHH Q ss_conf 99999999999999999999999999999999997189799999999 Q gi|254781104|r 23 IKHETEGKKEKLRILENKITSEQNYIDLLKAQWALLIQPDRIKDLVS 69 (125) Q Consensus 23 iK~~t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~P~RL~~La~ 69 (125) -|-....+|+++++++..+...+.++.-|++|-.-|= ++++=|.. T Consensus 91 Fr~Rn~ELE~elr~~~~~~~~l~~ev~~L~~DN~kLY--EK~Rylqs 135 (245) T pfam08172 91 FRKRNTELEEELRKLNQTISSLRQELASLKADNIKLY--EKIRYLQS 135 (245) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHH T ss_conf 9999999999999999999999999999998889999--99998860 No 51 >pfam07047 OPA3 Optic atrophy 3 protein (OPA3). This family consists of several optic atrophy 3 (OPA3) proteins. OPA3 deficiency causes type III 3-methylglutaconic aciduria (MGA) in humans. This disease manifests with early bilateral optic atrophy, spasticity, extrapyramidal dysfunction, ataxia, and cognitive deficits, but normal longevity. Probab=40.96 E-value=29 Score=15.93 Aligned_cols=44 Identities=11% Similarity=0.209 Sum_probs=21.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999999999999999999999999999999 Q gi|254781104|r 10 LGVVLASITITYSIKHETEGKKEKLRILENKITSEQNYIDLLKAQW 55 (125) Q Consensus 10 ~~~~i~l~~~~y~iK~~t~~~~~ei~~L~~~I~~~~~~i~~LkAEw 55 (125) |.+.+++.+|=| .-..+...++-..++.++...++++..|+.+- T Consensus 85 F~Va~~li~~E~--~Rs~~ke~~Kee~~~~~l~~L~~~i~~L~~~~ 128 (134) T pfam07047 85 FSVAGGLLVYEY--QRSSRKEAKKEEELQQELEELEQRVEELELEV 128 (134) T ss_pred HHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999999--99842476589999999999999999999999 No 52 >pfam01166 TSC22 TSC-22/dip/bun family. Probab=40.71 E-value=29 Score=15.91 Aligned_cols=41 Identities=22% Similarity=0.403 Sum_probs=31.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---CCHHHHHHHHH Q ss_conf 999999999999999999999999999971---89799999999 Q gi|254781104|r 29 GKKEKLRILENKITSEQNYIDLLKAQWALL---IQPDRIKDLVS 69 (125) Q Consensus 29 ~~~~ei~~L~~~I~~~~~~i~~LkAEwayL---~~P~RL~~La~ 69 (125) ..-+++.-|...|....++++.|+.|-++| -+|+-|..+.. T Consensus 11 AVREEVevLk~~I~eL~~~~~~le~EN~~Lk~~~spe~l~ql~~ 54 (57) T pfam01166 11 AVREEVEVLKEQIKELEEKNSQLERENTLLKSLASPEQLEQLQS 54 (57) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHH T ss_conf 99999999999999999999999999999986099999999976 No 53 >KOG0842 consensus Probab=38.89 E-value=31 Score=15.74 Aligned_cols=35 Identities=23% Similarity=0.360 Sum_probs=24.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCEECC Q ss_conf 99999999999999999997189799999999877641854058 Q gi|254781104|r 38 ENKITSEQNYIDLLKAQWALLIQPDRIKDLVSLYQKELQLQATN 81 (125) Q Consensus 38 ~~~I~~~~~~i~~LkAEwayL~~P~RL~~La~~y~~~L~L~Pi~ 81 (125) +.+.-+++.+++.-| ||+-||| +.||.. |+|.|+. T Consensus 162 qAQV~ELERRFrqQR----YLSAPER-E~LA~~----LrLT~TQ 196 (307) T KOG0842 162 QAQVYELERRFRQQR----YLSAPER-EHLASS----LRLTPTQ 196 (307) T ss_pred HHHHHHHHHHHHHHH----CCCCHHH-HHHHHH----CCCCCHH T ss_conf 668999999987510----3561739-999986----1787201 No 54 >PRK13455 F0F1 ATP synthase subunit B; Provisional Probab=37.72 E-value=33 Score=15.62 Aligned_cols=43 Identities=0% Similarity=-0.065 Sum_probs=25.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999999999999999999999999999 Q gi|254781104|r 7 FIILGVVLASITITYSIKHETEGKKEKLRILENKITSEQNYID 49 (125) Q Consensus 7 ii~~~~~i~l~~~~y~iK~~t~~~~~ei~~L~~~I~~~~~~i~ 49 (125) .++|++|+++.+|...-+.=+..++++...++.+|...+..-. T Consensus 33 ~IsFvif~~iL~~~~vp~~I~~~LD~R~~~I~~dLdeAe~lre 75 (184) T PRK13455 33 TLAFLLFIGILVYFKVPGMIGGMLDKRAEGIRSELEEARALRE 75 (184) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999999999955899999999999999999999999999 No 55 >smart00338 BRLZ basic region leucin zipper. Probab=36.72 E-value=34 Score=15.53 Aligned_cols=38 Identities=24% Similarity=0.203 Sum_probs=29.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 99999999999999999999999999999999997189 Q gi|254781104|r 23 IKHETEGKKEKLRILENKITSEQNYIDLLKAQWALLIQ 60 (125) Q Consensus 23 iK~~t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~ 60 (125) -|-....++.++..|+.+....+..+..|+.|...|.. T Consensus 24 Kk~~~~~Le~~~~~L~~en~~L~~~~~~L~~e~~~Lk~ 61 (65) T smart00338 24 KKAEIEELERKVEQLEAENERLKKEIERLRRELEKLKS 61 (65) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999999999999999999999999999 No 56 >pfam05680 ATP-synt_E ATP synthase E chain. This family consists of several ATP synthase E chain sequences which are components of the CF(0) subunit. Probab=36.52 E-value=34 Score=15.51 Aligned_cols=53 Identities=17% Similarity=0.117 Sum_probs=34.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHH-HHHHH Q ss_conf 99999999999999999999999999999999999999999999997189799-99999 Q gi|254781104|r 11 GVVLASITITYSIKHETEGKKEKLRILENKITSEQNYIDLLKAQWALLIQPDR-IKDLV 68 (125) Q Consensus 11 ~~~i~l~~~~y~iK~~t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~P~R-L~~La 68 (125) ++++|+.++.|.-.+- .+.....+++...+.-|...|+||+-...|.- ...|+ T Consensus 17 aL~~Gi~yGa~h~~~L-----~k~~~~~re~e~keklI~~AK~e~aKk~~~~~e~~~l~ 70 (81) T pfam05680 17 ALVAGVVYGAKHQRSL-----KKKEEKIREYEAQEKLIEKAKAAYAKKKQAKQEAKALA 70 (81) T ss_pred HHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHH T ss_conf 9999999989999888-----88999999999999999999999987537754555663 No 57 >KOG4797 consensus Probab=36.49 E-value=34 Score=15.50 Aligned_cols=41 Identities=20% Similarity=0.369 Sum_probs=29.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH---CCHHHHHHHHHH Q ss_conf 99999999999999999999999999971---897999999998 Q gi|254781104|r 30 KKEKLRILENKITSEQNYIDLLKAQWALL---IQPDRIKDLVSL 70 (125) Q Consensus 30 ~~~ei~~L~~~I~~~~~~i~~LkAEwayL---~~P~RL~~La~~ 70 (125) .-.|+..|..+|.+..++.+.|+.|-+.| .+|+.|..|-.. T Consensus 65 VREEVe~Lk~qI~eL~er~~~Le~EN~lLk~~~spe~L~ql~~~ 108 (123) T KOG4797 65 VREEVEVLKEQIRELEERNSALERENSLLKTLASPEQLAQLPAQ 108 (123) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHH T ss_conf 99999999999999999999999987899850899999998873 No 58 >pfam07219 HemY_N HemY protein N-terminus. This family represents the N-terminus (approximately 150 residues) of bacterial HemY porphyrin biosynthesis proteins. This is a membrane protein involved in a late step of protoheme IX synthesis. Probab=35.69 E-value=33 Score=15.60 Aligned_cols=14 Identities=7% Similarity=-0.137 Sum_probs=5.4 Q ss_pred CHHHHHHHHHHHHH Q ss_conf 97999999998776 Q gi|254781104|r 60 QPDRIKDLVSLYQK 73 (125) Q Consensus 60 ~P~RL~~La~~y~~ 73 (125) +.++=++++.+-.+ T Consensus 100 d~~~A~k~~~ka~k 113 (134) T pfam07219 100 DWALAERLARKAAE 113 (134) T ss_pred CHHHHHHHHHHHHH T ss_conf 99999999999876 No 59 >cd01279 HTH_HspR-like Helix-Turn-Helix DNA binding domain of HspR-like transcription regulators. Helix-turn-helix (HTH) transcription regulator HspR and related proteins, N-terminal domain. Heat shock protein regulators (HspR) have been shown to regulate expression of specific regulons in response to high temperature or high osmolarity in Streptomyces and Helicobacter, respectively. These proteins share the N-terminal DNA binding domain with other transcription regulators of the MerR superfamily that promote transcription by reconfiguring the spacer between the -35 and -10 promoter elements. A typical MerR regulator is comprised of two distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their conserved N-terminal domains contain predicted winged HTH motifs that mediate DNA binding, while the dissimilar C-terminal domains bind specific coactivator molecules. Probab=34.80 E-value=36 Score=15.34 Aligned_cols=36 Identities=11% Similarity=-0.103 Sum_probs=30.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHH Q ss_conf 999999999999999999999999999997189799 Q gi|254781104|r 28 EGKKEKLRILENKITSEQNYIDLLKAQWALLIQPDR 63 (125) Q Consensus 28 ~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~P~R 63 (125) -.-.++|..++.++...+.+...|++|-.-.+.|.| T Consensus 61 l~gik~iL~l~~q~~~l~~~~~~l~~el~~~~~~~~ 96 (98) T cd01279 61 LAGIKRIIELYPQVLLLQCRSCEHATELIGCAKPCR 96 (98) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 999999999899999999999999999998766650 No 60 >COG2456 Uncharacterized conserved protein [Function unknown] Probab=32.44 E-value=40 Score=15.10 Aligned_cols=36 Identities=6% Similarity=0.122 Sum_probs=22.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999999999999999999999 Q gi|254781104|r 8 IILGVVLASITITYSIKHETEGKKEKLRILENKITS 43 (125) Q Consensus 8 i~~~~~i~l~~~~y~iK~~t~~~~~ei~~L~~~I~~ 43 (125) +.++.+.++-+.+|.+--...+.+++|..|.++|+- T Consensus 76 L~vitI~~ayyLlfrlYl~IdkiereITelvreIAi 111 (121) T COG2456 76 LFVITIGLAYYLLFRLYLDIDKIEREITELVREIAI 111 (121) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 467999999999999999987899999999999988 No 61 >COG3167 PilO Tfp pilus assembly protein PilO [Cell motility and secretion / Intracellular trafficking and secretion] Probab=32.24 E-value=40 Score=15.08 Aligned_cols=19 Identities=16% Similarity=0.099 Sum_probs=10.1 Q ss_pred HHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999 Q gi|254781104|r 5 LDFIILGVVLASITITYSI 23 (125) Q Consensus 5 ~~ii~~~~~i~l~~~~y~i 23 (125) |.++..+++.++|+|-|.- T Consensus 27 ~~lL~~~~V~~lGy~f~~s 45 (211) T COG3167 27 FCLLAVAAVLGLGYAFYLS 45 (211) T ss_pred HHHHHHHHHHHHHHHHHHC T ss_conf 9999999999999999854 No 62 >TIGR01411 tatAE twin arginine-targeting protein translocase, TatA/E family; InterPro: IPR006312 This family represents TatA and TatE. The Tat (twin-arginine translocation) system is a Sec-independent exporter for folded proteins, often with a redox cofactor already bound, across the bacterial inner membrane. Functionally equivalent systems are found in the chloroplast and some in archaeal species. The signal peptide recognized by the Tat system is modeled by IPR006311 from INTERPRO. ; GO: 0015031 protein transport, 0016021 integral to membrane. Probab=32.23 E-value=40 Score=15.08 Aligned_cols=46 Identities=7% Similarity=0.143 Sum_probs=39.4 Q ss_pred CC-CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 91-18999999999999999999999999999999999999999999 Q gi|254781104|r 1 MF-KTLDFIILGVVLASITITYSIKHETEGKKEKLRILENKITSEQN 46 (125) Q Consensus 1 M~-r~~~ii~~~~~i~l~~~~y~iK~~t~~~~~ei~~L~~~I~~~~~ 46 (125) ++ ...+.+.+++++.+.|.+-.+..-.+++-+=|+.-++..+++++ T Consensus 1 ~~~s~~~~liiL~v~~LlFG~kKLp~~grdlG~aik~FKka~~~~~~ 47 (48) T TIGR01411 1 GLTSPWHLLIILVVILLLFGAKKLPELGRDLGKAIKEFKKALKEDED 47 (48) T ss_pred CCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 98779999999999886417753478999998988998863202246 No 63 >pfam07106 TBPIP Tat binding protein 1(TBP-1)-interacting protein (TBPIP). This family consists of several eukaryotic TBP-1 interacting protein (TBPIP) sequences. TBP-1 has been demonstrated to interact with the human immunodeficiency virus type 1 (HIV-1) viral protein Tat, then modulate the essential replication process of HIV. In addition, TBP-1 has been shown to be a component of the 26S proteasome, a basic multiprotein complex that degrades ubiquitinated proteins in an ATP-dependent fashion. Human TBPIP interacts with human TBP-1 then modulates the inhibitory action of human TBP-1 on HIV-Tat-mediated transactivation. Probab=31.03 E-value=42 Score=14.96 Aligned_cols=38 Identities=11% Similarity=0.043 Sum_probs=28.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHH Q ss_conf 99999999999999999999999999999999718979 Q gi|254781104|r 25 HETEGKKEKLRILENKITSEQNYIDLLKAQWALLIQPD 62 (125) Q Consensus 25 ~~t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~P~ 62 (125) -+...++.+|..|+.++...+..++.|++|.+-|++-- T Consensus 72 eel~~~d~~i~~L~~~l~~l~~~~~~l~~el~~L~s~~ 109 (169) T pfam07106 72 EELNKLDMEIEELREEVQLLKQDCSTLEIELKSLTSDL 109 (169) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 89987699999999999999999999999999987589 No 64 >KOG0488 consensus Probab=29.64 E-value=44 Score=14.81 Aligned_cols=37 Identities=22% Similarity=0.275 Sum_probs=21.9 Q ss_pred HHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCEECCHHHCCCH Q ss_conf 9999999999999999971897999999998776418540588883228 Q gi|254781104|r 40 KITSEQNYIDLLKAQWALLIQPDRIKDLVSLYQKELQLQATNPINLITY 88 (125) Q Consensus 40 ~I~~~~~~i~~LkAEwayL~~P~RL~~La~~y~~~L~L~Pi~p~qi~~~ 88 (125) ++..++.++...| ||+.||| .+||.. |+|.- .|+.++ T Consensus 183 Ql~~LEkrF~~QK----YLS~~DR-~~LA~~----LgLTd---aQVKtW 219 (309) T KOG0488 183 QLFELEKRFEKQK----YLSVADR-IELAAS----LGLTD---AQVKTW 219 (309) T ss_pred HHHHHHHHHHHHH----CCCHHHH-HHHHHH----CCCCH---HHHHHH T ss_conf 9999999887750----2567889-999998----29925---568887 No 65 >pfam06295 DUF1043 Protein of unknown function (DUF1043). This family consists of several hypothetical bacterial proteins of unknown function. Probab=29.42 E-value=45 Score=14.79 Aligned_cols=64 Identities=16% Similarity=0.158 Sum_probs=35.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHCCHHHHHHHHHHHHH Q ss_conf 99999999999999999999999999999999999999999999999999--71897999999998776 Q gi|254781104|r 7 FIILGVVLASITITYSIKHETEGKKEKLRILENKITSEQNYIDLLKAQWA--LLIQPDRIKDLVSLYQK 73 (125) Q Consensus 7 ii~~~~~i~l~~~~y~iK~~t~~~~~ei~~L~~~I~~~~~~i~~LkAEwa--yL~~P~RL~~La~~y~~ 73 (125) ++++++-+++|+++.+. +....++..+|++++...+.++...+.|.+ +-.+-+=+..|++-|.+ T Consensus 3 li~lvvG~iiG~~~~R~---~~~~~~~q~~Le~eLe~~k~el~~Y~~ev~~HF~~tA~Ll~~l~~~Y~~ 68 (128) T pfam06295 3 IIGLIVGLIIGLIIMRL---TNPTYKNQKKIKKELEEAKEELDEQRQELKDHFAQSAELLETLGKDYHK 68 (128) T ss_pred HHHHHHHHHHHHHHHHH---CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 89999999999999997---2740768999999999999999999999999999999999999999999 No 66 >COG3322 Predicted periplasmic ligand-binding sensor domain [Signal transduction mechanisms] Probab=28.37 E-value=47 Score=14.68 Aligned_cols=34 Identities=29% Similarity=0.528 Sum_probs=29.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999999999999999971 Q gi|254781104|r 25 HETEGKKEKLRILENKITSEQNYIDLLKAQWALL 58 (125) Q Consensus 25 ~~t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL 58 (125) -+.+...++...+++.+..++..+.-.-.+||.= T Consensus 42 ~d~~~~~~~~~~Ve~~l~~e~~~i~~~~~d~a~W 75 (295) T COG3322 42 LDEQALLENVELVENALDSEQKSIDRFVSDWAFW 75 (295) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 6799999889999999999999998663014468 No 67 >TIGR02238 recomb_DMC1 meiotic recombinase Dmc1; InterPro: IPR011940 This entry describes DMC1, a subfamily of a larger family of DNA repair and recombination proteins. It is eukaryotic only and most closely related to eukaryotic RAD51. It also resembles archaeal RadA (IPR011938 from INTERPRO) and RadB (IPR011939 from INTERPRO) and bacterial RecA. It has been characterised for human as a recombinase active only in meiosis .; GO: 0003677 DNA binding, 0005524 ATP binding, 0008094 DNA-dependent ATPase activity, 0007131 meiotic recombination, 0005634 nucleus. Probab=28.09 E-value=31 Score=15.78 Aligned_cols=12 Identities=42% Similarity=0.872 Sum_probs=9.9 Q ss_pred CHHHHHHHHHHH Q ss_conf 979999999987 Q gi|254781104|r 60 QPDRIKDLVSLY 71 (125) Q Consensus 60 ~P~RL~~La~~y 71 (125) ||+|+..+|++| T Consensus 143 RPdri~~IAerf 154 (314) T TIGR02238 143 RPDRIKAIAERF 154 (314) T ss_pred CHHHHHHHHHHC T ss_conf 817899887540 No 68 >PRK09458 pspB phage shock protein B; Provisional Probab=27.75 E-value=48 Score=14.61 Aligned_cols=39 Identities=21% Similarity=0.292 Sum_probs=26.2 Q ss_pred HHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999-----9999999999999999999999999 Q gi|254781104|r 15 ASITITYSIKHETE-----GKKEKLRILENKITSEQNYIDLLKA 53 (125) Q Consensus 15 ~l~~~~y~iK~~t~-----~~~~ei~~L~~~I~~~~~~i~~LkA 53 (125) .=.+.+|.-|.++. .-.+.+..|-..-...+++|..|++ T Consensus 20 iWLiLHY~sk~k~~~~Ls~~d~~~L~~L~~~A~~m~eRI~tLE~ 63 (75) T PRK09458 20 IWLWLHYRSKRQVSQGLSQEEQQRLAQLTEKAEKMRERIQTLEA 63 (75) T ss_pred HHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999866240246799999999999999999999999999999 No 69 >CHL00167 consensus Probab=27.35 E-value=49 Score=14.57 Aligned_cols=53 Identities=13% Similarity=0.092 Sum_probs=32.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999999999999999999999999999999999997 Q gi|254781104|r 5 LDFIILGVVLASITITYSIKHETEGKKEKLRILENKITSEQNYIDLLKAQWAL 57 (125) Q Consensus 5 ~~ii~~~~~i~l~~~~y~iK~~t~~~~~ei~~L~~~I~~~~~~i~~LkAEway 57 (125) ..++.+++.++-.+........-......|..-+.......+.+...+++|+- T Consensus 37 ~ivigiL~~fg~~~L~~~L~~Rke~I~~~I~eAE~r~~eA~~~L~eak~~L~q 89 (182) T CHL00167 37 LILLSGLIYLGKNFLGSSLEERQQKVLEAIQESEERLEQANSRLSESEKQLAQ 89 (182) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999984676898999999999999999999999999999999999999 No 70 >pfam05545 FixQ Cbb3-type cytochrome oxidase component FixQ. This family consists of several Cbb3-type cytochrome oxidase components (FixQ/CcoQ). FixQ is found in nitrogen fixing bacteria. Since nitrogen fixation is an energy-consuming process, effective symbioses depend on operation of a respiratory chain with a high affinity for O2, closely coupled to ATP production. This requirement is fulfilled by a special three-subunit terminal oxidase (cytochrome terminal oxidase cbb3), which was first identified in Bradyrhizobium japonicum as the product of the fixNOQP operon. Probab=27.07 E-value=47 Score=14.65 Aligned_cols=23 Identities=9% Similarity=0.176 Sum_probs=16.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 89999999999999999999999 Q gi|254781104|r 4 TLDFIILGVVLASITITYSIKHE 26 (125) Q Consensus 4 ~~~ii~~~~~i~l~~~~y~iK~~ 26 (125) ...++.++++++..+|+|.-+++ T Consensus 13 ~~~l~~~~~Figiv~wa~~p~~k 35 (49) T pfam05545 13 WGTLVLFVFFIGIVFWAYSPRRK 35 (49) T ss_pred HHHHHHHHHHHHHHHHHCCCCCH T ss_conf 99999999999999987287750 No 71 >pfam10925 DUF2680 Protein of unknown function (DUF2680). Members in this family of proteins are annotated as yckD however currently no function is known. Probab=26.78 E-value=50 Score=14.50 Aligned_cols=34 Identities=12% Similarity=0.151 Sum_probs=28.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 9999999999999999999999999999997189 Q gi|254781104|r 27 TEGKKEKLRILENKITSEQNYIDLLKAQWALLIQ 60 (125) Q Consensus 27 t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~ 60 (125) |...-++|..|+++|.+.+..+-.=.+||..++. T Consensus 2 T~qQk~ei~~l~kqi~el~Kqii~KyVe~G~iT~ 35 (59) T pfam10925 2 TDQQKKEIEALYKQIAELRKQVIDKYVEAGVITK 35 (59) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCH T ss_conf 6888999999999999999999999998275649 No 72 >COG5509 Uncharacterized small protein containing a coiled-coil domain [Function unknown] Probab=26.31 E-value=51 Score=14.45 Aligned_cols=25 Identities=36% Similarity=0.329 Sum_probs=15.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999999997 Q gi|254781104|r 33 KLRILENKITSEQNYIDLLKAQWAL 57 (125) Q Consensus 33 ei~~L~~~I~~~~~~i~~LkAEway 57 (125) -+..|+..|...++.|.-|+||.+- T Consensus 26 sV~El~eRIalLq~EIeRlkAe~~k 50 (65) T COG5509 26 SVAELEERIALLQAEIERLKAELAK 50 (65) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 1888999999999999999999986 No 73 >COG2143 Thioredoxin-related protein [Posttranslational modification, protein turnover, chaperones] Probab=25.52 E-value=53 Score=14.36 Aligned_cols=22 Identities=14% Similarity=0.166 Sum_probs=16.5 Q ss_pred CCCHHHHHHHHHHHHHHHHHHH Q ss_conf 9118999999999999999999 Q gi|254781104|r 1 MFKTLDFIILGVVLASITITYS 22 (125) Q Consensus 1 M~r~~~ii~~~~~i~l~~~~y~ 22 (125) |||.+.++++++++++++-+.. T Consensus 1 ~mRvl~i~Lliis~fl~a~~s~ 22 (182) T COG2143 1 VMRVLLIVLLIISLFLSACKSN 22 (182) T ss_pred CCCHHHHHHHHHHHHHHHHHCC T ss_conf 9001999999999999997178 No 74 >PRK00442 tatA twin arginine translocase protein A; Provisional Probab=23.50 E-value=58 Score=14.13 Aligned_cols=44 Identities=16% Similarity=0.206 Sum_probs=30.6 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 18999999999999999999999999999999999999999999 Q gi|254781104|r 3 KTLDFIILGVVLASITITYSIKHETEGKKEKLRILENKITSEQN 46 (125) Q Consensus 3 r~~~ii~~~~~i~l~~~~y~iK~~t~~~~~ei~~L~~~I~~~~~ 46 (125) ...+.+.+++++.+.|.+-.+++--..+-+-|...++..+.+.+ T Consensus 5 s~~hwlivl~iv~llFGtkkl~~lg~dlG~~iK~Fkk~m~d~~~ 48 (90) T PRK00442 5 DWKHWIVILVVVVLVFGTKKLKNLGSDVGESIKGFRKAMNEEED 48 (90) T ss_pred HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 69999999999999946403546788899999999987333345 No 75 >pfam06720 Phi-29_GP16_7 Bacteriophage phi-29 early protein GP16.7. This family consists of several bacteriophage phi-29 early protein GP16.7 sequences of around 130 residues in length. The function of this family is unknown. Probab=23.40 E-value=58 Score=14.12 Aligned_cols=34 Identities=15% Similarity=0.176 Sum_probs=23.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999999999999999999 Q gi|254781104|r 23 IKHETEGKKEKLRILENKITSEQNYIDLLKAQWA 56 (125) Q Consensus 23 iK~~t~~~~~ei~~L~~~I~~~~~~i~~LkAEwa 56 (125) -+.+.+..+.++..|++.|+..+.-++.+.---+ T Consensus 25 ~~~e~~e~~~~~e~l~~~i~q~~qil~e~ne~i~ 58 (130) T pfam06720 25 KIQEARELEDYLEDLNKRIAQRTQILSELNEVIS 58 (130) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 1788988899999999999999999999999870 No 76 >TIGR01061 parC_Gpos DNA topoisomerase IV, A subunit; InterPro: IPR005741 DNA topoisomerases regulate the number of topological links between two DNA strands (i.e. change the number of superhelical turns) by catalysing transient single- or double-strand breaks, crossing the strands through one another, then resealing the breaks. These enzymes have several functions: to remove DNA supercoils during transcription and DNA replication; for strand breakage during recombination; for chromosome condensation; and to disentangle intertwined DNA during mitosis , . DNA topoisomerases are divided into two classes: type I enzymes (5.99.1.2 from EC; topoisomerases I, III and V) break single-strand DNA, and type II enzymes (5.99.1.3 from EC; topoisomerases II, IV and VI) break double-strand DNA . Type II topoisomerases are ATP-dependent enzymes, and can be subdivided according to their structure and reaction mechanisms: type IIA (topoisomerase II or gyrase, and topoisomerase IV) and type IIB (topoisomerase VI). These enzymes are responsible for relaxing supercoiled DNA as well as for introducing both negative and positive supercoils . This entry represents subunit A (parC) of topoisomerase IV from Gram positive bacteria. Topoisomerase IV primarily decatenates DNA and relaxes positive supercoils, which is important in bacteria, where the circular chromosome becomes catenated, or linked, during replication . Topoisomerase IV consists of two polypeptide subunits, parC (subunit A), which is homologous to gyrA of topoisomerase II, and parE (subunit B), which is homologous to gyrB of topoisomerase II. More information about this protein can be found at Protein of the Month: DNA Topoisomerase .; GO: 0003677 DNA binding, 0003918 DNA topoisomerase (ATP-hydrolyzing) activity, 0005524 ATP binding, 0006265 DNA topological change, 0006268 DNA unwinding during replication, 0005694 chromosome. Probab=23.05 E-value=59 Score=14.07 Aligned_cols=68 Identities=31% Similarity=0.392 Sum_probs=44.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC---HHHHHHHHHHHHHHHCCEECCHHHCCCHHHC Q ss_conf 99999999999999999999999999999999999997189---7999999998776418540588883228885 Q gi|254781104|r 20 TYSIKHETEGKKEKLRILENKITSEQNYIDLLKAQWALLIQ---PDRIKDLVSLYQKELQLQATNPINLITYDDL 91 (125) Q Consensus 20 ~y~iK~~t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~---P~RL~~La~~y~~~L~L~Pi~p~qi~~~~~l 91 (125) .+..|.|...+.+++..|+.-|++++.+.++|+-+.--+-- -.|...|-+ .+.=.-+.-++++.-+++ T Consensus 429 i~~L~~E~~eL~~~~~~l~~~i~~~k~~~~~lk~~l~~~KK~F~~~R~s~l~~----~~~~~~i~~~~li~~~~~ 499 (745) T TIGR01061 429 IFELKEEEEELEKKIKSLEEIIASEKARKKLLKKKLKELKKKFAQERRSKLED----EIEKIKINEDELIEKEDL 499 (745) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHH----HHHHEECCHHHHHHHCCC T ss_conf 77778789999999999999876578999998999998723105750456751----165301046785321464 No 77 >COG3416 Uncharacterized protein conserved in bacteria [Function unknown] Probab=23.01 E-value=59 Score=14.07 Aligned_cols=49 Identities=12% Similarity=0.122 Sum_probs=37.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHH Q ss_conf 999999999999999999999999999999999999997189799999999 Q gi|254781104|r 19 ITYSIKHETEGKKEKLRILENKITSEQNYIDLLKAQWALLIQPDRIKDLVS 69 (125) Q Consensus 19 ~~y~iK~~t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~P~RL~~La~ 69 (125) ..|..--.+-..|+-|.+...+|++++.+|+.|+++- =.++.=|..+.+ T Consensus 42 a~Y~laQ~vliqE~ALk~a~~~i~eLe~ri~~lq~~~--~~sgsFLs~~f~ 90 (233) T COG3416 42 AAYYLAQRVLIQEQALKKASTQIKELEKRIAILQAGE--AGSGSFLSNAFK 90 (233) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC--CCCCCHHHHHCC T ss_conf 4899999999999999999999999999999984146--788613321103 No 78 >TIGR01242 26Sp45 26S proteasome subunit P45 family; InterPro: IPR005937 Intracellular proteins, including short-lived proteins such as cyclin, Mos, Myc, p53, NF-kappaB, and IkappaB, are degraded by the ubiquitin-proteasome system. The 26S proteasome (a 2 MDa complex) is made up of two subcomplexes: the 20S proteasome and the regulatory complex. The former is a 700 kDa cylindrical protease complex consisting of four stacks of heptameric rings with 28 subunits (i.e., 7777) with molecular masses of about 20-35 kDa, whereas the latter is a 700-1000 kDa complex consisting of at least 18 subunits with molecular masses of 28-110 kDa, including 6 putative ATPases (Rpt1-Rpt6) and 12 non-ATPase subunits (Rpn1-12). Members of the 26S proteasome subunit P45 family: ATPase p45/Sug1/Rpt6 may be phosphorylated within the proteasome. This phosphorylation event may play a key role in ATP-dependent proteolysis because a good correlation exists between the inhibition pattern of protein kinase inhibitors against the phosphorylation of p45 and that against the ATP-dependent proteolytic activity , . More information about these protein can be found at Protein of the Month: AAA ATPases .; GO: 0016787 hydrolase activity, 0030163 protein catabolic process, 0005634 nucleus, 0005737 cytoplasm. Probab=22.76 E-value=60 Score=14.04 Aligned_cols=47 Identities=17% Similarity=0.251 Sum_probs=39.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHH Q ss_conf 99999999999999999999999999999999999718979999999 Q gi|254781104|r 22 SIKHETEGKKEKLRILENKITSEQNYIDLLKAQWALLIQPDRIKDLV 68 (125) Q Consensus 22 ~iK~~t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~P~RL~~La 68 (125) ..+...+.++.+...|+++=...+.+...|+.|...|.+|=-+...+ T Consensus 3 ~l~~~ir~Le~~~~~le~e~~~l~~E~~~lr~E~~rlr~PP~~~~~V 49 (364) T TIGR01242 3 ELKERIRKLEDEKKLLEKEKIRLERELERLRSEIERLRSPPLIVGTV 49 (364) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEE T ss_conf 27899998888888788777789999999999999845898278875 No 79 >pfam03978 Borrelia_REV Borrelia burgdorferi REV protein. This family consists of several REV proteins from Borrelia burgdorferi (Lyme disease spirochete). The function of REV is unknown although it known that gene is induced during the ingesting of host blood suggesting a role in the metabolic activation of borreliae to adapt to physiological stimuli. Probab=21.52 E-value=63 Score=13.88 Aligned_cols=71 Identities=21% Similarity=0.305 Sum_probs=45.1 Q ss_pred HHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHCCHH---HHHHHHHHHHHHH Q ss_conf 9999999999999999999-99999999999999999999-----9999999999999718979---9999999877641 Q gi|254781104|r 5 LDFIILGVVLASITITYSI-KHETEGKKEKLRILENKITS-----EQNYIDLLKAQWALLIQPD---RIKDLVSLYQKEL 75 (125) Q Consensus 5 ~~ii~~~~~i~l~~~~y~i-K~~t~~~~~ei~~L~~~I~~-----~~~~i~~LkAEwayL~~P~---RL~~La~~y~~~L 75 (125) +.++.+.+++.+++-+|.+ |-+...+...+..|+..... -++.|..|+-..-|-..-+ .+-.|-..|++.| T Consensus 7 ~kLfF~smlfvmaCk~yv~ek~~Idsl~~~Vs~Lnn~~d~~~fknyk~kin~LKe~lKdv~NAEl~EK~L~Lq~lFQdKL 86 (160) T pfam03978 7 VKLFFVSMLFVMACKAYVEEKKEIDSLLEGVSKLNNKSDETKFKNYKEKINELKEDLKDVGNAELEEKLLNLQSLFQDKL 86 (160) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999999999999999999888615733288998799999999999887331899999999999999999 No 80 >PRK09173 F0F1 ATP synthase subunit B; Validated Probab=21.31 E-value=64 Score=13.86 Aligned_cols=43 Identities=14% Similarity=0.137 Sum_probs=21.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999999999999999999999999999 Q gi|254781104|r 7 FIILGVVLASITITYSIKHETEGKKEKLRILENKITSEQNYID 49 (125) Q Consensus 7 ii~~~~~i~l~~~~y~iK~~t~~~~~ei~~L~~~I~~~~~~i~ 49 (125) +++|++|+++..|...=+.-+..++++-..++.+|...+..-. T Consensus 8 ~I~fvi~~~ll~~~~~p~~i~~~Ld~R~~~I~~~l~~Ae~~~~ 50 (159) T PRK09173 8 FVGLVLFLALVVYLKVPGMIGRSLDERADRIKNELDEARRLRE 50 (159) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999999999977899999999999999999999998899 No 81 >pfam11488 Lge1 Transcriptional regulatory protein LGE1. This family of proteins is conserved in yeasts where it is involved in the ubiquitination of histones H2A and H2B. This ubiquitination step is a vital one in the regulation of the transcriptional activity of RNA polymerase II. In S. cerevisiae, Rad6 and Bre1 are present in a complex, also containing Lge1, that is required for H2B ubiquitination. Bre1 is the H2B ubiquitin ligase that interacts with acidic activators, such as Gal4, and recruits Rad6 and its binding partner Lge1 to target promoters. In S. pombe the equivalent protein to Lge1 appears to be Shf1. Probab=21.11 E-value=64 Score=13.83 Aligned_cols=35 Identities=26% Similarity=0.194 Sum_probs=27.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 99999999999999999999999999999997189 Q gi|254781104|r 26 ETEGKKEKLRILENKITSEQNYIDLLKAQWALLIQ 60 (125) Q Consensus 26 ~t~~~~~ei~~L~~~I~~~~~~i~~LkAEwayL~~ 60 (125) +.+..-++...+..+|...+..+..|+.||+.|.. T Consensus 178 ~le~~~~e~~~~d~~L~~~~~~~~k~e~~~~~le~ 212 (234) T pfam11488 178 KLEERYKELEELDQKLEELQLKKLKLELELALLET 212 (234) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99998766888899999999999999999999988 No 82 >pfam09889 DUF2116 Uncharacterized protein containing a Zn-ribbon (DUF2116). This domain, found in various hypothetical archaeal proteins, has no known function. Probab=20.98 E-value=65 Score=13.82 Aligned_cols=21 Identities=10% Similarity=-0.070 Sum_probs=10.9 Q ss_pred CCHHHHHHHHHHHHHHHHHHH Q ss_conf 118999999999999999999 Q gi|254781104|r 2 FKTLDFIILGVVLASITITYS 22 (125) Q Consensus 2 ~r~~~ii~~~~~i~l~~~~y~ 22 (125) +|.-.++..++++++++|+|. T Consensus 36 ~~~q~i~f~i~il~i~v~~~~ 56 (59) T pfam09889 36 RKSQQILFGIFILFLAVYAVL 56 (59) T ss_pred HHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999999999999998 No 83 >PRK04561 tatA twin arginine translocase protein A; Provisional Probab=20.38 E-value=67 Score=13.74 Aligned_cols=44 Identities=7% Similarity=0.200 Sum_probs=33.7 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 11899999999999999999999999999999999999999999 Q gi|254781104|r 2 FKTLDFIILGVVLASITITYSIKHETEGKKEKLRILENKITSEQ 45 (125) Q Consensus 2 ~r~~~ii~~~~~i~l~~~~y~iK~~t~~~~~ei~~L~~~I~~~~ 45 (125) |.+-+.+.+++++.+.|.+-.+++--.++-+-|...++.+.++. T Consensus 4 ~Siwh~lIv~~ivvllFGtkkL~~lg~DlG~aik~FKk~m~ddd 47 (75) T PRK04561 4 FSIWHWLVVLVIVLLVFGTKRLTSGAKDLGSAVKEFKKGMHDDD 47 (75) T ss_pred CCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 53899999999999983653031478889999999998734345 No 84 >smart00394 RIIa RIIalpha, Regulatory subunit portion of type II PKA R-subunit. RIIalpha, Regulatory subunit portion of type II PKA R-subunit. Contains dimerisation interface and binding site for A-kinase-anchoring proteins (AKAPs). Probab=20.07 E-value=50 Score=14.51 Aligned_cols=20 Identities=20% Similarity=0.321 Sum_probs=16.6 Q ss_pred HCCHHHHHHHHHHHHHHHCC Q ss_conf 18979999999987764185 Q gi|254781104|r 58 LIQPDRIKDLVSLYQKELQL 77 (125) Q Consensus 58 L~~P~RL~~La~~y~~~L~L 77 (125) -++|+++-..+..||++|.- T Consensus 16 ~~~P~d~~~F~~~yF~kL~~ 35 (38) T smart00394 16 RAQPSDLVQFAADYFEKLEE 35 (38) T ss_pred HHCCCCHHHHHHHHHHHHHH T ss_conf 83987399999999999877 Done!