Query gi|254780730|ref|YP_003065143.1| hypothetical protein CLIBASIA_03085 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 120 No_of_seqs 110 out of 335 Neff 2.6 Searched_HMMs 39220 Date Sun May 29 19:34:09 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780730.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 COG3847 Flp Flp pilus assembly 99.7 1.6E-16 4.1E-21 112.8 7.3 56 3-58 2-57 (58) 2 pfam04964 Flp_Fap Flp/Fap pili 99.6 1.3E-15 3.2E-20 107.8 5.0 47 9-55 1-47 (47) 3 COG4961 TadG Flp pilus assembl 97.2 0.0016 4.1E-08 40.1 7.0 35 3-37 9-43 (185) 4 pfam05307 Bundlin Bundlin. Thi 86.6 0.67 1.7E-05 25.5 3.0 31 10-40 8-38 (97) 5 TIGR01129 secD protein-export 82.2 3.9 0.0001 21.2 5.3 39 14-52 364-402 (522) 6 COG0342 SecD Preprotein transl 81.5 4.5 0.00011 20.9 5.4 44 12-55 335-378 (506) 7 COG4537 ComGC Competence prote 80.6 2.9 7.5E-05 21.9 4.2 40 3-42 1-40 (107) 8 pfam04021 Class_IIIsignal Clas 80.4 1.4 3.5E-05 23.8 2.4 23 13-35 1-23 (28) 9 COG4966 PilW Tfp pilus assembl 74.1 3 7.6E-05 21.9 2.7 58 7-65 6-80 (318) 10 TIGR01707 gspI general secreti 70.8 8.2 0.00021 19.4 4.4 52 15-66 1-65 (103) 11 pfam07811 TadE TadE-like prote 59.7 13 0.00034 18.3 3.7 23 15-37 1-23 (43) 12 PRK13024 bifunctional preprote 58.1 16 0.00041 17.8 5.4 38 14-51 256-293 (741) 13 PRK05812 secD preprotein trans 53.7 19 0.00049 17.3 5.3 39 14-52 343-381 (513) 14 COG1681 FlaB Archaeal flagelli 52.0 15 0.00039 17.9 3.0 31 12-42 1-32 (209) 15 PRK12933 secD preprotein trans 49.8 22 0.00057 17.0 5.1 41 14-54 434-474 (604) 16 pfam12555 TPPK_C Thiamine pyro 48.9 23 0.00058 16.9 5.0 46 7-55 4-49 (53) 17 PRK10506 hypothetical protein; 43.9 28 0.0007 16.5 4.2 35 10-44 1-35 (155) 18 COG2165 PulG Type II secretory 43.6 28 0.00071 16.4 4.8 37 10-46 4-40 (149) 19 TIGR00872 gnd_rel 6-phosphoglu 41.7 12 0.0003 18.5 1.1 31 18-60 196-227 (341) 20 PRK08808 general secretion pat 41.4 30 0.00077 16.2 4.4 31 11-41 3-33 (209) 21 PRK00442 tatA twin arginine tr 41.2 30 0.00078 16.2 6.8 70 23-94 10-81 (90) 22 pfam04917 Shufflon_N Bacterial 37.4 35 0.0009 15.9 5.6 37 10-46 2-38 (356) 23 TIGR01710 typeII_sec_gspG gene 36.0 37 0.00095 15.7 3.8 39 14-54 1-42 (137) 24 pfam10731 Anophelin Thrombin i 33.7 41 0.001 15.5 4.3 41 23-83 6-46 (65) 25 TIGR02523 type_IV_pilV type IV 32.3 43 0.0011 15.4 5.6 42 13-54 1-46 (163) 26 PRK13464 F0F1 ATP synthase sub 31.0 45 0.0012 15.3 3.0 39 5-43 3-41 (101) 27 TIGR03322 alt_F1F0_F0_C altern 28.8 12 0.0003 18.5 -0.6 45 35-79 8-52 (86) 28 COG4967 PilV Tfp pilus assembl 26.8 54 0.0014 14.8 5.2 38 11-48 7-44 (162) 29 PRK10332 hypothetical protein; 26.7 54 0.0014 14.8 5.0 40 9-49 4-43 (107) 30 COG1991 Uncharacterized conser 26.4 8.7 0.00022 19.3 -1.7 33 4-36 3-35 (131) 31 PRK08541 flagellin; Validated 25.6 56 0.0014 14.7 2.3 30 12-41 1-31 (212) 32 PRK02958 tatA twin arginine tr 24.2 60 0.0015 14.6 4.2 43 22-64 9-53 (75) 33 TIGR02600 TIGR02600 Verrucomic 24.1 61 0.0015 14.5 2.7 28 16-43 2-29 (1697) 34 TIGR01708 typeII_sec_gspH gene 23.8 62 0.0016 14.5 3.2 48 11-58 1-53 (175) 35 PRK01026 tetrahydromethanopter 23.5 62 0.0016 14.5 3.4 26 17-42 48-73 (75) 36 PRK13892 conjugal transfer pro 23.1 63 0.0016 14.4 2.3 28 22-49 62-89 (141) 37 pfam12301 CD99L2 CD99 antigen 22.7 57 0.0015 14.7 1.9 28 14-41 103-130 (154) 38 PRK13470 consensus 22.7 20 0.0005 17.3 -0.5 33 47-79 21-53 (88) 39 TIGR01410 tatB twin arginine-t 22.3 66 0.0017 14.3 3.8 35 23-57 9-52 (82) 40 COG4970 FimT Tfp pilus assembl 22.0 67 0.0017 14.3 3.7 49 13-61 7-60 (181) 41 PRK13871 conjugal transfer pro 22.0 51 0.0013 14.9 1.6 28 22-49 52-79 (134) 42 COG2976 Uncharacterized protei 21.6 68 0.0017 14.3 3.6 45 2-56 10-66 (207) 43 COG3043 NapB Nitrate reductase 21.0 70 0.0018 14.2 3.7 43 49-91 33-79 (155) 44 PRK13471 F0F1 ATP synthase sub 20.5 22 0.00057 17.0 -0.5 52 28-79 7-58 (85) 45 pfam06295 DUF1043 Protein of u 20.4 72 0.0018 14.1 3.2 12 47-58 66-77 (128) 46 TIGR00411 redox_disulf_1 redox 20.2 26 0.00066 16.6 -0.3 15 12-26 40-54 (82) No 1 >COG3847 Flp Flp pilus assembly protein, pilin Flp [Intracellular trafficking and secretion] Probab=99.67 E-value=1.6e-16 Score=112.79 Aligned_cols=56 Identities=41% Similarity=0.746 Sum_probs=52.7 Q ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 69999996122570299999999999999999998763589999999973101478 Q gi|254780730|r 3 MNIIKKILKNGSGATAIEYGLLASLVSVAIISAVSTLGDRMKGVYQTISTELDKGD 58 (120) Q Consensus 3 Mn~ikrFl~DEsGATAIEYgLIaALIaVAIIaavt~LGt~L~~~FttVst~LnkG~ 58 (120) .++++||+|||+|||+||||||++||++++|++++.+|+++++.|+.+++.|+.+. T Consensus 2 ~~~~~rF~rDE~GAtaiEYglia~lIav~ii~~~~~l~~~l~~~ft~i~~al~~a~ 57 (58) T COG3847 2 KKLLRRFLRDEDGATAIEYGLIAALIAVVIIAGGSTLGTALKGAFTAIGAALTGAA 57 (58) T ss_pred CHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 17899997745651899999999999999999999998999999999999870468 No 2 >pfam04964 Flp_Fap Flp/Fap pilin component. Probab=99.59 E-value=1.3e-15 Score=107.79 Aligned_cols=47 Identities=53% Similarity=0.813 Sum_probs=45.3 Q ss_pred HHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 96122570299999999999999999998763589999999973101 Q gi|254780730|r 9 ILKNGSGATAIEYGLLASLVSVAIISAVSTLGDRMKGVYQTISTELD 55 (120) Q Consensus 9 Fl~DEsGATAIEYgLIaALIaVAIIaavt~LGt~L~~~FttVst~Ln 55 (120) |++||+|+||||||||++||++++|++++.+|++++++|++|++.|. T Consensus 1 F~kde~GaTAIEYgLIaalIav~iI~~~~~~g~~l~~~f~~i~~~l~ 47 (47) T pfam04964 1 FLKDESGATAIEYGLIAALIAVVIIAYVTTLGTALKTKFTSIGTALT 47 (47) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 96565641599999999999999999999970129889999886239 No 3 >COG4961 TadG Flp pilus assembly protein TadG [Intracellular trafficking and secretion] Probab=97.21 E-value=0.0016 Score=40.15 Aligned_cols=35 Identities=20% Similarity=0.481 Sum_probs=28.9 Q ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 69999996122570299999999999999999998 Q gi|254780730|r 3 MNIIKKILKNGSGATAIEYGLLASLVSVAIISAVS 37 (120) Q Consensus 3 Mn~ikrFl~DEsGATAIEYgLIaALIaVAIIaavt 37 (120) ..++++|++|++|+.|||.+||+-++.+++.+.+- T Consensus 9 ~~~~~rF~rdr~Ga~AVeFAlvap~ll~l~~g~ve 43 (185) T COG4961 9 RGLLRRFRRDRRGAAAVEFALVAPPLLLLVFGIVE 43 (185) T ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999887648768999999999999999999999 No 4 >pfam05307 Bundlin Bundlin. This family consists of several bundlin proteins from E. coli. Bundlin is a type IV pilin protein that is the only known structural component of enteropathogenic Escherichia coli bundle-forming pili (BFP). BFP play a role in virulence, antigenicity, autoaggregation, and localized adherence to epithelial cells. Probab=86.58 E-value=0.67 Score=25.46 Aligned_cols=31 Identities=16% Similarity=0.171 Sum_probs=26.6 Q ss_pred HHCCCCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 6122570299999999999999999998763 Q gi|254780730|r 10 LKNGSGATAIEYGLLASLVSVAIISAVSTLG 40 (120) Q Consensus 10 l~DEsGATAIEYgLIaALIaVAIIaavt~LG 40 (120) .+.|.|.+.||-++.+|++++.+.+++--+- T Consensus 8 kK~~kGlsLiE~~mVLal~A~vIAGvf~YY~ 38 (97) T pfam05307 8 KKYEKGLSLIESAMVLALAATVTAGVMFYYQ 38 (97) T ss_pred HHHHCCCHHHHHHHHHHHHHHHHHHHHEEHH T ss_conf 6675252199999999999999977510307 No 5 >TIGR01129 secD protein-export membrane protein SecD; InterPro: IPR005791 Secretion across the inner membrane in some Gram-negative bacteria occurs via the preprotein translocase pathway. Proteins are produced in the cytoplasm as precursors, and require a chaperone subunit to direct them to the translocase component. . From there, the mature proteins are either targeted to the outer membrane, or remain as periplasmic proteins. The translocase protein subunits are encoded on the bacterial chromosome. The translocase itself comprises 7 proteins, including a chaperone protein (SecB), an ATPase (SecA), an integral membrane complex (SecCY, SecE and SecG), and two additional membrane proteins that promote the release of the mature peptide into the periplasm (SecD and SecF) . The chaperone protein SecB is a highly acidic homotetrameric protein that exists as a "dimer of dimers" in the bacterial cytoplasm. SecB maintains preproteins in an unfolded state after translation, and targets these to the peripheral membrane protein ATPase SecA for secretion . Together with SecY and SecG, SecE forms a multimeric channel through which preproteins are translocated, using both proton motive forces and ATP-driven secretion. The latter is mediated by SecA. The structure of the Escherichia coli SecYEG assembly revealed a sandwich of two membranes interacting through the extensive cytoplasmic domains . Each membrane is composed of dimers of SecYEG. The monomeric complex contains 15 transmembrane helices. This entry describes the SecD family of transport proteins. Members of this family are highly variable in length immediately after the well-conserved motif LGLGLXGG at the amino-terminal end of this model. Archaeal homologs are not included in the seed. SecD from Mycobacterium tuberculosis has a long Pro-rich insert. ; GO: 0015450 P-P-bond-hydrolysis-driven protein transmembrane transporter activity, 0006886 intracellular protein transport, 0009276 1-2nm peptidoglycan-based cell wall. Probab=82.18 E-value=3.9 Score=21.19 Aligned_cols=39 Identities=26% Similarity=0.394 Sum_probs=35.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 570299999999999999999998763589999999973 Q gi|254780730|r 14 SGATAIEYGLLASLVSVAIISAVSTLGDRMKGVYQTIST 52 (120) Q Consensus 14 sGATAIEYgLIaALIaVAIIaavt~LGt~L~~~FttVst 52 (120) =|+-+||=|++|+++++++|..+..+==.+-++|..++= T Consensus 364 LG~d~i~~G~~A~~~Gl~lV~~FM~~yY~~~G~~A~~aL 402 (522) T TIGR01129 364 LGADSIEAGIKAGLIGLVLVLVFMIVYYRLFGLIAAIAL 402 (522) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 438999999999999999999999999767889999999 No 6 >COG0342 SecD Preprotein translocase subunit SecD [Intracellular trafficking and secretion] Probab=81.47 E-value=4.5 Score=20.88 Aligned_cols=44 Identities=32% Similarity=0.490 Sum_probs=34.3 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 22570299999999999999999998763589999999973101 Q gi|254780730|r 12 NGSGATAIEYGLLASLVSVAIISAVSTLGDRMKGVYQTISTELD 55 (120) Q Consensus 12 DEsGATAIEYgLIaALIaVAIIaavt~LGt~L~~~FttVst~Ln 55 (120) -+=|+..++.|+++++++++++..+..+=-.+-+++..++.-++ T Consensus 335 psLG~~~i~~gi~Agl~g~~~V~vfm~~~Yr~~Gvia~ial~~n 378 (506) T COG0342 335 PTLGADSIKAGLIAGLIGLALVAVFMLLYYRLAGVIAAIALGLN 378 (506) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 45476789978999999999999999999987589999999999 No 7 >COG4537 ComGC Competence protein ComGC [Intracellular trafficking and secretion] Probab=80.63 E-value=2.9 Score=21.90 Aligned_cols=40 Identities=25% Similarity=0.413 Sum_probs=32.7 Q ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 6999999612257029999999999999999999876358 Q gi|254780730|r 3 MNIIKKILKNGSGATAIEYGLLASLVSVAIISAVSTLGDR 42 (120) Q Consensus 3 Mn~ikrFl~DEsGATAIEYgLIaALIaVAIIaavt~LGt~ 42 (120) |..+..|++++.|-|.||--+.+..|++-++..+-.+-.+ T Consensus 1 m~~~~k~~~~~kgFTLvEMLiVLlIISiLlLl~iPNltKq 40 (107) T COG4537 1 MKKMKKFLKHKKGFTLVEMLIVLLIISILLLLFIPNLTKQ 40 (107) T ss_pred CHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCHHHH T ss_conf 9167777776326259999999999999999974625443 No 8 >pfam04021 Class_IIIsignal Class III signal peptide. This family of archaeal proteins contains. an amino terminal motif QXSXEXXXL that has been suggested to be part of a class III signal sequence. With the Q being the +1 residue of the signal peptidase cleavage site. Two members of this family are cleaved by a type IV pilin-like signal peptidase. Probab=80.39 E-value=1.4 Score=23.76 Aligned_cols=23 Identities=35% Similarity=0.371 Sum_probs=17.5 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 25702999999999999999999 Q gi|254780730|r 13 GSGATAIEYGLLASLVSVAIISA 35 (120) Q Consensus 13 EsGATAIEYgLIaALIaVAIIaa 35 (120) .+|+.++||.++.+.+.++.+.+ T Consensus 1 ~kGQ~SlE~~lLi~~vlv~~~i~ 23 (28) T pfam04021 1 KKGQISLEFLLLILAVLVVAIIA 23 (28) T ss_pred CCCEEEHHHHHHHHHHHHHHHHH T ss_conf 98654399999999999999771 No 9 >COG4966 PilW Tfp pilus assembly protein PilW [Cell motility and secretion / Intracellular trafficking and secretion] Probab=74.14 E-value=3 Score=21.86 Aligned_cols=58 Identities=17% Similarity=0.247 Sum_probs=35.2 Q ss_pred HHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHH-----------------HHHHHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 999612257029999999999999999999876-----------------35899999999731014789876788 Q gi|254780730|r 7 KKILKNGSGATAIEYGLLASLVSVAIISAVSTL-----------------GDRMKGVYQTISTELDKGDVPPTKPG 65 (120) Q Consensus 7 krFl~DEsGATAIEYgLIaALIaVAIIaavt~L-----------------Gt~L~~~FttVst~LnkG~~pp~~p~ 65 (120) +...+.++|.+.||. ||+.+|++.++.++..+ -+.....++-|+-+|..++-....+. T Consensus 6 ~~~~rrqrG~SLIEL-MIallIglivL~av~s~y~~sr~~y~~~~~~a~~qE~gR~Al~litrdlr~AG~~~~~n~ 80 (318) T COG4966 6 RARPRRQRGFSLIEL-MIALLIGLIVLLAVGSLYLSSRQLYTTLADRASLQENGRFALELITRDLRMAGFFGCSNN 80 (318) T ss_pred CCCCCCCCCCCHHHH-HHHHHHHHHHHHHHHHEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCC T ss_conf 355455577439999-999999999999776303435677887766653123157899999999997188667888 No 10 >TIGR01707 gspI general secretion pathway protein I; InterPro: IPR010052 This entry represents GspI, one of two proteins highly conserved at their N-termini but easily separable phylogenetically. The other is GspJ (IPR010055 from INTERPRO). Both GspI and GspJ are proteins of the type II secretion pathway, or main terminal branch of the general secretion pathway. This pathway carries proteins across the outer membrane. Note that proteins of type II secretion are cryptic in Escherichia coli K-12 - present but not yet demonstrated to act on any target.; GO: 0008565 protein transporter activity, 0015628 protein secretion by the type II secretion system, 0015627 type II protein secretion system complex. Probab=70.78 E-value=8.2 Score=19.39 Aligned_cols=52 Identities=23% Similarity=0.418 Sum_probs=41.2 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------------HHHHHHHCCCCCCCCCCCC Q ss_conf 702999999999999999999987635899999-------------9997310147898767887 Q gi|254780730|r 15 GATAIEYGLLASLVSVAIISAVSTLGDRMKGVY-------------QTISTELDKGDVPPTKPGS 66 (120) Q Consensus 15 GATAIEYgLIaALIaVAIIaavt~LGt~L~~~F-------------ttVst~LnkG~~pp~~p~~ 66 (120) |-|.+|-=+-+|.+++|+++.+.+.|..+++.= +.+.+++.-+-+||..+++ T Consensus 1 GFTLlEVLVALAIFa~aa~~~~~~~~~~~~~~~rL~dk~LA~~~aDNrl~se~~l~v~P~~~~~~ 65 (103) T TIGR01707 1 GFTLLEVLVALAIFALAALAVLSSVGEQTNAVGRLRDKTLALWVADNRLASELSLGVTPPSLGRN 65 (103) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCC T ss_conf 91499999999999999999999998765443566899999999987776577606798634677 No 11 >pfam07811 TadE TadE-like protein. The members of this family are similar to a region of the protein product of the bacterial tadE locus. In various bacterial species, the tad locus is closely linked to flp-like genes, which encode proteins required for the production of pili involved in adherence to surfaces. It is thought that the tad loci encode proteins that act to assemble or export an Flp pilus in various bacteria. All tad loci but TadA have putative transmembrane regions, and in fact the region in question is this family has a high proportion of hydrophobic amino acid residues. Probab=59.71 E-value=13 Score=18.25 Aligned_cols=23 Identities=26% Similarity=0.557 Sum_probs=18.1 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 70299999999999999999998 Q gi|254780730|r 15 GATAIEYGLLASLVSVAIISAVS 37 (120) Q Consensus 15 GATAIEYgLIaALIaVAIIaavt 37 (120) |+++||.+|++-++.+.+.+.+. T Consensus 1 G~a~VEfalv~p~~l~l~~~~~~ 23 (43) T pfam07811 1 GAAAVEFALVLPVLLLLLFGIVE 23 (43) T ss_pred CHHHHHHHHHHHHHHHHHHHHHH T ss_conf 91699999999999999999999 No 12 >PRK13024 bifunctional preprotein translocase subunit SecD/SecF; Reviewed Probab=58.12 E-value=16 Score=17.76 Aligned_cols=38 Identities=26% Similarity=0.419 Sum_probs=18.5 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 57029999999999999999999876358999999997 Q gi|254780730|r 14 SGATAIEYGLLASLVSVAIISAVSTLGDRMKGVYQTIS 51 (120) Q Consensus 14 sGATAIEYgLIaALIaVAIIaavt~LGt~L~~~FttVs 51 (120) -|+.+++.|+++++|+++++..+..+-=.+-+....++ T Consensus 256 LG~~si~~g~~A~~ig~~lV~lfMi~~Yr~~G~iA~ia 293 (741) T PRK13024 256 LGQDAINAGIIAGIIGFALIALFMLLFYGLPGLIANIA 293 (741) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 10999999999999999999999999986769999999 No 13 >PRK05812 secD preprotein translocase subunit SecD; Reviewed Probab=53.68 E-value=19 Score=17.34 Aligned_cols=39 Identities=31% Similarity=0.439 Sum_probs=28.9 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 570299999999999999999998763589999999973 Q gi|254780730|r 14 SGATAIEYGLLASLVSVAIISAVSTLGDRMKGVYQTIST 52 (120) Q Consensus 14 sGATAIEYgLIaALIaVAIIaavt~LGt~L~~~FttVst 52 (120) =|+.+|+=|+++++|++++|..+..+--.+-+++..++= T Consensus 343 LG~~si~~g~~A~~ig~~lV~~fmi~~Yr~~G~~a~~aL 381 (513) T PRK05812 343 LGADSIRAGLIAGLIGLALVLLFMLLYYRLFGLIANIAL 381 (513) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 009999999999999999999999999857699999999 No 14 >COG1681 FlaB Archaeal flagellins [Cell motility and secretion] Probab=52.05 E-value=15 Score=17.87 Aligned_cols=31 Identities=26% Similarity=0.317 Sum_probs=20.8 Q ss_pred CCCCHHHHHHHHH-HHHHHHHHHHHHHHHHHH Q ss_conf 2257029999999-999999999999876358 Q gi|254780730|r 12 NGSGATAIEYGLL-ASLVSVAIISAVSTLGDR 42 (120) Q Consensus 12 DEsGATAIEYgLI-aALIaVAIIaavt~LGt~ 42 (120) |++|++.||=+++ .|+|-+|.+++...+++. T Consensus 1 ~rrG~~GIgtlIVfIAmVlVAAVaA~VlInt~ 32 (209) T COG1681 1 DRRGATGIGTLIVFIAMVLVAAVAAYVLINTG 32 (209) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 98411043289999999999999999986534 No 15 >PRK12933 secD preprotein translocase subunit SecD; Reviewed Probab=49.81 E-value=22 Score=16.98 Aligned_cols=41 Identities=24% Similarity=0.191 Sum_probs=29.7 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 57029999999999999999999876358999999997310 Q gi|254780730|r 14 SGATAIEYGLLASLVSVAIISAVSTLGDRMKGVYQTISTEL 54 (120) Q Consensus 14 sGATAIEYgLIaALIaVAIIaavt~LGt~L~~~FttVst~L 54 (120) =|+-+|+-|+++++|+++++..+..+=-.+-+++.+++=-+ T Consensus 434 LG~dsi~~g~~A~~iG~~lV~~fM~~~Yr~~G~iA~iaL~~ 474 (604) T PRK12933 434 LGAENIENGFAALALGMGITLLFMALWYRRLGWVANVALIA 474 (604) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 55999999999999999999999999985529999999999 No 16 >pfam12555 TPPK_C Thiamine pyrophosphokinase C terminal. This domain family is found in bacteria, and is approximately 50 amino acids in length. The proteins in this family catalyses the pyrophosphorylation of thiamine in yeast and synthesizes thiamine pyrophosphate (TPP), a thiamine coenzyme. Probab=48.94 E-value=23 Score=16.90 Aligned_cols=46 Identities=20% Similarity=0.314 Sum_probs=32.5 Q ss_pred HHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 9996122570299999999999999999998763589999999973101 Q gi|254780730|r 7 KKILKNGSGATAIEYGLLASLVSVAIISAVSTLGDRMKGVYQTISTELD 55 (120) Q Consensus 7 krFl~DEsGATAIEYgLIaALIaVAIIaavt~LGt~L~~~FttVst~Ln 55 (120) .++-|......++=-.++++|++++++..++..|. ..|+.+.+.+| T Consensus 4 s~LYrsris~~~~~~lvla~Lva~~val~~s~~g~---~~~~~l~~~w~ 49 (53) T pfam12555 4 SRLYRSRISGGALLLLVLAALVALAVALLVSPAGQ---SFLDLLADRWN 49 (53) T ss_pred HHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCCHH---HHHHHHHHHHH T ss_conf 88989227788999999999999999998084229---99999999987 No 17 >PRK10506 hypothetical protein; Provisional Probab=43.90 E-value=28 Score=16.45 Aligned_cols=35 Identities=20% Similarity=0.098 Sum_probs=27.4 Q ss_pred HHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 61225702999999999999999999987635899 Q gi|254780730|r 10 LKNGSGATAIEYGLLASLVSVAIISAVSTLGDRMK 44 (120) Q Consensus 10 l~DEsGATAIEYgLIaALIaVAIIaavt~LGt~L~ 44 (120) ++.|+|-|.||--+..++|++....++-.+..... T Consensus 1 ~~~q~GFTLiEllvvi~ii~il~~~a~p~~~~~~q 35 (155) T PRK10506 1 MKKQRGYTLIETLVAMTLVVILSAWGLYGWQYWQQ 35 (155) T ss_pred CCCCCCEEHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 98556627999999999999999887777999999 No 18 >COG2165 PulG Type II secretory pathway, pseudopilin PulG [Cell motility and secretion / Intracellular trafficking and secretion] Probab=43.63 E-value=28 Score=16.43 Aligned_cols=37 Identities=19% Similarity=0.302 Sum_probs=28.6 Q ss_pred HHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 6122570299999999999999999998763589999 Q gi|254780730|r 10 LKNGSGATAIEYGLLASLVSVAIISAVSTLGDRMKGV 46 (120) Q Consensus 10 l~DEsGATAIEYgLIaALIaVAIIaavt~LGt~L~~~ 46 (120) .+.++|=|.||--+.+++|++....++..+....... T Consensus 4 ~~~qrGFTLiElLVvl~Iigil~~~~~p~~~~~~~~~ 40 (149) T COG2165 4 MKKQRGFTLIELLVVLAIIGILAALALPSLQGSIDKA 40 (149) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 4145673899999999999999999999999988999 No 19 >TIGR00872 gnd_rel 6-phosphogluconate dehydrogenase (decarboxylating); InterPro: IPR004849 This family resembles a larger family of bacterial and eukaryotic 6-phosphogluconate dehydrogenases (Gnd) but differs from it by a deep split in a UPGMA similarity clustering tree and the lack of a central region of about 140 residues. Among complete genomes, it is found is found in Bacillus subtilis and Mycobacterium tuberculosis, both of which also contain Gnd, and in Aquifex aeolicus. . Probab=41.72 E-value=12 Score=18.49 Aligned_cols=31 Identities=26% Similarity=0.631 Sum_probs=20.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHCCCCCC Q ss_conf 99999999999999999998763589999999-97310147898 Q gi|254780730|r 18 AIEYGLLASLVSVAIISAVSTLGDRMKGVYQT-ISTELDKGDVP 60 (120) Q Consensus 18 AIEYgLIaALIaVAIIaavt~LGt~L~~~Ftt-Vst~LnkG~~p 60 (120) -||||+++|+ ++.++-+=++ |.+.+..|+.- T Consensus 196 GIEYG~Maai------------aEGfe~L~nsdvGtrvqhGdae 227 (341) T TIGR00872 196 GIEYGMMAAI------------AEGFEVLKNSDVGTRVQHGDAE 227 (341) T ss_pred CCHHHHHHHH------------HHHHHHHHHCCCCCCEECCCCC T ss_conf 5138889999------------8778987524655500037544 No 20 >PRK08808 general secretion pathway protein J; Validated Probab=41.41 E-value=30 Score=16.23 Aligned_cols=31 Identities=29% Similarity=0.306 Sum_probs=26.6 Q ss_pred HCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 1225702999999999999999999987635 Q gi|254780730|r 11 KNGSGATAIEYGLLASLVSVAIISAVSTLGD 41 (120) Q Consensus 11 ~DEsGATAIEYgLIaALIaVAIIaavt~LGt 41 (120) +-+.|=|.||--+-.+|+++.++..+++|-+ T Consensus 3 r~~~GFTLVEvLvalsLlslillll~saLrt 33 (209) T PRK08808 3 RRAAGFTLIEVLLATVLLAGGLALAFATLRS 33 (209) T ss_pred CCCCCCEEHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 4458837619999999999999999999998 No 21 >PRK00442 tatA twin arginine translocase protein A; Provisional Probab=41.24 E-value=30 Score=16.21 Aligned_cols=70 Identities=16% Similarity=0.341 Sum_probs=35.7 Q ss_pred HHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 9999999999999--98763589999999973101478987678875578677877432458877653432123 Q gi|254780730|r 23 LLASLVSVAIISA--VSTLGDRMKGVYQTISTELDKGDVPPTKPGSVPMQPESSNPSTRLQPPAKPTSIPVKTK 94 (120) Q Consensus 23 LIaALIaVAIIaa--vt~LGt~L~~~FttVst~LnkG~~pp~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 94 (120) ||+.+|++.+.+. +..+|..|....-..-..|+..+..|+....-|-||. .+.....+-..++.|++..+ T Consensus 10 livl~iv~llFGtkkl~~lg~dlG~~iK~Fkk~m~d~~~k~~~~~~~~a~p~--~~~q~~a~~~~~~Tid~qA~ 81 (90) T PRK00442 10 IVILVVVVLVFGTKKLKNLGSDVGESIKGFRKAMNEEEDKPAEQTPPPAQPV--PPVQNTAQLNQPHTIDAQAQ 81 (90) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCC--CCCCCCCCCCCCCCCHHHHH T ss_conf 9999999999464035467888999999999873333457656899877888--86434544468975106776 No 22 >pfam04917 Shufflon_N Bacterial shufflon protein, N-terminal constant region. This family represents the high-similarity N-terminal 'constant region' shared by shufflon proteins. Probab=37.37 E-value=35 Score=15.86 Aligned_cols=37 Identities=22% Similarity=0.358 Sum_probs=30.3 Q ss_pred HHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 6122570299999999999999999998763589999 Q gi|254780730|r 10 LKNGSGATAIEYGLLASLVSVAIISAVSTLGDRMKGV 46 (120) Q Consensus 10 l~DEsGATAIEYgLIaALIaVAIIaavt~LGt~L~~~ 46 (120) .+.++|-+.+|-.+.++.+++++...+.-+.+.+..+ T Consensus 2 r~~~kGf~LlE~~~~L~I~~~~~~~~~~~~~~~~~~~ 38 (356) T pfam04917 2 KKTDKGVSLLEVGAVLLIVVMVIPKVAENIEDYLNNV 38 (356) T ss_pred CEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 1203443089999999999999999999999899999 No 23 >TIGR01710 typeII_sec_gspG general secretion pathway protein G; InterPro: IPR010054 This entry represents GspG, protein G of the main terminal branch of the general secretion pathway, also called type II secretion. It transports folded proteins across the bacterial outer membrane and is widely distributed in Gram-negative pathogens.; GO: 0008565 protein transporter activity, 0015628 protein secretion by the type II secretion system, 0015627 type II protein secretion system complex. Probab=36.00 E-value=37 Score=15.74 Aligned_cols=39 Identities=21% Similarity=0.340 Sum_probs=24.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHH Q ss_conf 570299999999999999999998---76358999999997310 Q gi|254780730|r 14 SGATAIEYGLLASLVSVAIISAVS---TLGDRMKGVYQTISTEL 54 (120) Q Consensus 14 sGATAIEYgLIaALIaVAIIaavt---~LGt~L~~~FttVst~L 54 (120) +|=|.+| +|+.+|.+.+.+++. .++..=...-+...++| T Consensus 1 ~GFTLlE--~~VVlvIlGLLaalv~P~l~s~~d~A~~~~a~~qi 42 (137) T TIGR01710 1 RGFTLLE--IMVVLVILGLLAALVVPKLFSQADKAKAQVAKAQI 42 (137) T ss_pred CCCHHHH--HHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHH T ss_conf 9732678--89999999999999865534875535788899999 No 24 >pfam10731 Anophelin Thrombin inhibitor from mosquito. Members of this family are all inhibitors of thrombin, the peptidase that is at the end of the blood coagulation cascade and which creates the clot by cleaving fibrinogen. The interaction between thrombin and fibrinogen involves two different areas of contact - via the thrombin active site and via a second substrate-binding site known as an exosite. The inhibitor acts by blocking the exosite, rather than by interacting with the active site. The inhibitors are from mosquitoes that feed on human blood and which, by inhibiting thrombin, prevent the blood from clotting and keep it flowing. Probab=33.67 E-value=41 Score=15.52 Aligned_cols=41 Identities=27% Similarity=0.553 Sum_probs=28.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 9999999999999987635899999999731014789876788755786778774324588 Q gi|254780730|r 23 LLASLVSVAIISAVSTLGDRMKGVYQTISTELDKGDVPPTKPGSVPMQPESSNPSTRLQPP 83 (120) Q Consensus 23 LIaALIaVAIIaavt~LGt~L~~~FttVst~LnkG~~pp~~p~~~~~~~~~~~~~~~~~~~ 83 (120) ++.+|+++++++.++ ..|--+||..|---|...-+..|+|- T Consensus 6 ~vialLC~aLvavvQ--------------------~APQYa~GeePtYDEdd~d~e~l~ph 46 (65) T pfam10731 6 IVIALLCVALVAVVQ--------------------SAPQYAPGDEPTYDEDDDDDEPLKPH 46 (65) T ss_pred HHHHHHHHHHHHHHH--------------------CCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 279999999999994--------------------07544789998767667765555668 No 25 >TIGR02523 type_IV_pilV type IV pilus modification protein PilV; InterPro: IPR013362 Pilus systems categorized as type IV pilins differ greatly from one another, with some showing greater similarity to type II or type III secretion systems than to each other. Members of this protein family represent the PilV protein of type IV pilus systems found in Pseudomonas aeruginosa PAO1, Pseudomonas syringae DC3000, Neisseria meningitidis MC58, Xylella fastidiosa 9a5c, and other species.. Probab=32.32 E-value=43 Score=15.39 Aligned_cols=42 Identities=14% Similarity=0.199 Sum_probs=29.0 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHH Q ss_conf 2570299999999999999999998763589999----99997310 Q gi|254780730|r 13 GSGATAIEYGLLASLVSVAIISAVSTLGDRMKGV----YQTISTEL 54 (120) Q Consensus 13 EsGATAIEYgLIaALIaVAIIaavt~LGt~L~~~----FttVst~L 54 (120) +.|.|+||-=+-+=+++|++.+........+.-. ++.++..| T Consensus 1 q~G~~miEVLvAlL~LaiGvLg~~alql~~~~~t~~A~~~~~A~~l 46 (163) T TIGR02523 1 QAGVSMIEVLVALLVLAIGVLGMAALQLKAVRYTKSASERTIASML 46 (163) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9751799999999999999999999998688876889999999999 No 26 >PRK13464 F0F1 ATP synthase subunit C; Provisional Probab=30.99 E-value=45 Score=15.26 Aligned_cols=39 Identities=28% Similarity=0.388 Sum_probs=29.2 Q ss_pred HHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999996122570299999999999999999998763589 Q gi|254780730|r 5 IIKKILKNGSGATAIEYGLLASLVSVAIISAVSTLGDRM 43 (120) Q Consensus 5 ~ikrFl~DEsGATAIEYgLIaALIaVAIIaavt~LGt~L 43 (120) +.-+++.+-.|.|+|--||+.++=++.---++..+|+.. T Consensus 3 m~~q~la~iq~~taia~~i~igl~AlGtaiG~GllggKf 41 (101) T PRK13464 3 MSLQVLGNLNGLTAVAVALLISLPALGTAIGFGVLGGKY 41 (101) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 659999999989999999999999998788898873488 No 27 >TIGR03322 alt_F1F0_F0_C alternate F1F0 ATPase, F0 subunit C. A small number of taxonomically diverse prokaryotic species, including Methanosarcina barkeri, have what appears to be a second ATP synthase, in addition to the normal F1F0 ATPase in bacteria and A1A0 ATPase in archaea. These enzymes use ion gradients to synthesize ATP, and in principle may run in either direction. This model represents the F0 subunit C of this apparent second ATP synthase. Probab=28.82 E-value=12 Score=18.50 Aligned_cols=45 Identities=22% Similarity=0.285 Sum_probs=20.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 998763589999999973101478987678875578677877432 Q gi|254780730|r 35 AVSTLGDRMKGVYQTISTELDKGDVPPTKPGSVPMQPESSNPSTR 79 (120) Q Consensus 35 avt~LGt~L~~~FttVst~LnkG~~pp~~p~~~~~~~~~~~~~~~ 79 (120) +.+.+|-.+.--+-.+...+..|.+-..+-.++--|||.++--+| T Consensus 8 ~~Sii~AGla~gigaiGpgiGeG~aa~~A~e~iaRqPea~~~I~~ 52 (86) T TIGR03322 8 VASIVTAGLTIAIGSIGPALGEGRAVAQALTALAQQPDASNTITR 52 (86) T ss_pred HHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCCCCCHHH T ss_conf 999999999999983053245879999999998749420020499 No 28 >COG4967 PilV Tfp pilus assembly protein PilV [Cell motility and secretion / Intracellular trafficking and secretion] Probab=26.80 E-value=54 Score=14.84 Aligned_cols=38 Identities=21% Similarity=0.273 Sum_probs=23.2 Q ss_pred HCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 12257029999999999999999999876358999999 Q gi|254780730|r 11 KNGSGATAIEYGLLASLVSVAIISAVSTLGDRMKGVYQ 48 (120) Q Consensus 11 ~DEsGATAIEYgLIaALIaVAIIaavt~LGt~L~~~Ft 48 (120) +.++|-++||--+-+-+|++++.+......-.+...|+ T Consensus 7 ~~q~GfSLIEVLIA~lll~vglLgla~lQ~~s~~~t~~ 44 (162) T COG4967 7 KKQSGFSLIEVLIAMLLLSVGLLGLAGLQARTLRNTQE 44 (162) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 34466519999999999999999999999999998899 No 29 >PRK10332 hypothetical protein; Provisional Probab=26.67 E-value=54 Score=14.82 Aligned_cols=40 Identities=30% Similarity=0.366 Sum_probs=28.3 Q ss_pred HHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 96122570299999999999999999998763589999999 Q gi|254780730|r 9 ILKNGSGATAIEYGLLASLVSVAIISAVSTLGDRMKGVYQT 49 (120) Q Consensus 9 Fl~DEsGATAIEYgLIaALIaVAIIaavt~LGt~L~~~Ftt 49 (120) -++.++|-...| .||++++.++++.++...-..+..-|.. T Consensus 4 ~l~~Q~GFSLpE-vlvA~lLf~isl~aL~~Yqq~L~~sf~~ 43 (107) T PRK10332 4 SLKNQRGFSLPE-VLLAMVLMVMIVTALSGYQRTLMNSFAS 43 (107) T ss_pred CCCCCCCCCHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 221367887299-9999999999999999999999999999 No 30 >COG1991 Uncharacterized conserved protein [Function unknown] Probab=26.39 E-value=8.7 Score=19.27 Aligned_cols=33 Identities=27% Similarity=0.370 Sum_probs=26.3 Q ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999612257029999999999999999999 Q gi|254780730|r 4 NIIKKILKNGSGATAIEYGLLASLVSVAIISAV 36 (120) Q Consensus 4 n~ikrFl~DEsGATAIEYgLIaALIaVAIIaav 36 (120) .++..|..+.+|+-..||.|+...+.++.+.+. T Consensus 3 ~~i~~~~~~nkgQiSLEf~Ll~l~ivla~~i~~ 35 (131) T COG1991 3 TYITKIILSNKGQISLEFSLLLLAIVLAASIAG 35 (131) T ss_pred CEEEEEEECCCCCEEEEHHHHHHHHHHHHHHEE T ss_conf 166254224666256413899999999731211 No 31 >PRK08541 flagellin; Validated Probab=25.63 E-value=56 Score=14.74 Aligned_cols=30 Identities=23% Similarity=0.265 Sum_probs=16.3 Q ss_pred CCCCHHHHHHHHH-HHHHHHHHHHHHHHHHH Q ss_conf 2257029999999-99999999999987635 Q gi|254780730|r 12 NGSGATAIEYGLL-ASLVSVAIISAVSTLGD 41 (120) Q Consensus 12 DEsGATAIEYgLI-aALIaVAIIaavt~LGt 41 (120) +++|++.||-.++ .|||-||.+++...+++ T Consensus 1 ~kkG~~GigtlIVfIAmVlVAAvaa~Vli~t 31 (212) T PRK08541 1 MKKGAVGIGTLIVFIAMVLVAAVAASVLINT 31 (212) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHEECC T ss_conf 9753024208999999999999877523221 No 32 >PRK02958 tatA twin arginine translocase protein A; Provisional Probab=24.24 E-value=60 Score=14.56 Aligned_cols=43 Identities=12% Similarity=0.389 Sum_probs=29.1 Q ss_pred HHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 99999999999999--98763589999999973101478987678 Q gi|254780730|r 22 GLLASLVSVAIISA--VSTLGDRMKGVYQTISTELDKGDVPPTKP 64 (120) Q Consensus 22 gLIaALIaVAIIaa--vt~LGt~L~~~FttVst~LnkG~~pp~~p 64 (120) =||+.+|++.+.+. +..+|..+....-..-..++..+.++..+ T Consensus 9 ~livl~iv~llFG~kkL~~lg~dlG~aik~FK~~m~~~~~~~~~~ 53 (75) T PRK02958 9 WLIVLLIVALVFGTKKLRNIGNDLGSAVKGFKDGMKEGEAPAAEA 53 (75) T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCC T ss_conf 999999999982753154688889999999998715555675643 No 33 >TIGR02600 TIGR02600 Verrucomicrobium spinosum paralogous protein TIGR02600. Probab=24.10 E-value=61 Score=14.55 Aligned_cols=28 Identities=11% Similarity=0.227 Sum_probs=12.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 0299999999999999999998763589 Q gi|254780730|r 16 ATAIEYGLLASLVSVAIISAVSTLGDRM 43 (120) Q Consensus 16 ATAIEYgLIaALIaVAIIaavt~LGt~L 43 (120) ...|==-+|+|||.|.+++.+...-++. T Consensus 2 ~ALi~vL~~lALiT~LVl~fls~v~~E~ 29 (1697) T TIGR02600 2 MALIMVLIVLALITILVLAFLSMVRTET 29 (1697) T ss_pred CHHHHHHHHHHHHHHHHHHHHHCHHHHH T ss_conf 0488889999999999999871216899 No 34 >TIGR01708 typeII_sec_gspH general secretion pathway protein H; InterPro: IPR010053 This entry represents GspH, protein H of the main terminal branch of the general secretion pathway, also called type II secretion. It transports folded proteins across the bacterial outer membrane and is widely distributed in Gram-negative pathogens.; GO: 0008565 protein transporter activity, 0015628 protein secretion by the type II secretion system, 0015627 type II protein secretion system complex. Probab=23.76 E-value=62 Score=14.51 Aligned_cols=48 Identities=21% Similarity=0.288 Sum_probs=30.9 Q ss_pred HCCCCHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHCCCC Q ss_conf 122570299999999999999999998763-----589999999973101478 Q gi|254780730|r 11 KNGSGATAIEYGLLASLVSVAIISAVSTLG-----DRMKGVYQTISTELDKGD 58 (120) Q Consensus 11 ~DEsGATAIEYgLIaALIaVAIIaavt~LG-----t~L~~~FttVst~LnkG~ 58 (120) +-++|=|.||-=+.+++++++--.+.-+++ ..|...-..++..|.... T Consensus 1 ~r~~GFtLiElLvVl~i~gL~aa~~~~~l~~~~~~~~l~~~a~rl~~~l~~~~ 53 (175) T TIGR01708 1 HRQSGFTLIELLVVLAIMGLVAAAAALSLDSASGTKELDQAAQRLAAELRLAQ 53 (175) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHH T ss_conf 98775108899999999999999996116778886217899999999999999 No 35 >PRK01026 tetrahydromethanopterin S-methyltransferase subunit G; Provisional Probab=23.51 E-value=62 Score=14.48 Aligned_cols=26 Identities=23% Similarity=0.249 Sum_probs=20.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 29999999999999999999876358 Q gi|254780730|r 17 TAIEYGLLASLVSVAIISAVSTLGDR 42 (120) Q Consensus 17 TAIEYgLIaALIaVAIIaavt~LGt~ 42 (120) -.|=||++++++...+...+..+... T Consensus 48 iGIlYG~viGlli~~~~~~~~~~~~~ 73 (75) T PRK01026 48 IGILYGLVIGLLIVLVYIKLLPIFMI 73 (75) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999999999999987 No 36 >PRK13892 conjugal transfer protein TrbC; Provisional Probab=23.12 E-value=63 Score=14.44 Aligned_cols=28 Identities=14% Similarity=0.523 Sum_probs=19.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999998763589999999 Q gi|254780730|r 22 GLLASLVSVAIISAVSTLGDRMKGVYQT 49 (120) Q Consensus 22 gLIaALIaVAIIaavt~LGt~L~~~Ftt 49 (120) +.-+++|++++.++.-.+|+.+++-|.+ T Consensus 62 Af~isiigiv~AGa~LIFGGElngF~Rt 89 (141) T PRK13892 62 AFALSIIGIVVAGGILIFGGELNGFFRT 89 (141) T ss_pred HHHHHHHHHHHHCHHEEECCHHHHHHHH T ss_conf 9999999999836001555127789999 No 37 >pfam12301 CD99L2 CD99 antigen like protein 2. This family of proteins is found in eukaryotes. Proteins in this family are typically between 165 and 237 amino acids in length. CD99L2 and CD99 are involved in trans-endothelial migration of neutrophils in vitro and in the recruitment of neutrophils into inflamed peritoneum. Probab=22.72 E-value=57 Score=14.69 Aligned_cols=28 Identities=18% Similarity=0.315 Sum_probs=14.6 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 5702999999999999999999987635 Q gi|254780730|r 14 SGATAIEYGLLASLVSVAIISAVSTLGD 41 (120) Q Consensus 14 sGATAIEYgLIaALIaVAIIaavt~LGt 41 (120) ..-+..|-++|+++|+.+.++.+.++.. T Consensus 103 ~~~~~~~~G~IaGIvsAv~vAl~GAvSS 130 (154) T pfam12301 103 GPEGGAETGTIAGIVSAVAVALLGAVSS 130 (154) T ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 7655657761467999999999988999 No 38 >PRK13470 consensus Probab=22.66 E-value=20 Score=17.29 Aligned_cols=33 Identities=27% Similarity=0.389 Sum_probs=14.8 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 999973101478987678875578677877432 Q gi|254780730|r 47 YQTISTELDKGDVPPTKPGSVPMQPESSNPSTR 79 (120) Q Consensus 47 FttVst~LnkG~~pp~~p~~~~~~~~~~~~~~~ 79 (120) +-.+...+..|.+-..+-.++--|||.++--+| T Consensus 21 igaiGp~iG~G~aa~~A~e~iaRqPea~g~i~~ 53 (88) T PRK13470 21 IGSIGPALGEGRAVATALSALAQQPDASATITR 53 (88) T ss_pred HHHHCCHHHHHHHHHHHHHHHHCCCCCCCHHHH T ss_conf 984144022769999999998739420030299 No 39 >TIGR01410 tatB twin arginine-targeting protein translocase TatB; InterPro: IPR003998 Translocation of proteins across the two membranes of Gram-negative bacteria can be carried out via a number of routes. Most proteins marked for export carry a secretion signal at their N-terminus, and are secreted by the general secretory pathway. The signal peptide is cleaved as they pass through the outer membrane. Other secretion systems include the type III system found in a select group of Gram-negative plant and animal pathogens, and the CagA system of Helicobacter pylori . In some bacterial species, however, there exists a system that operates independently of the Sec pathway . It selectively translocates periplasmic-bound molecules that are synthesised with, or are in close association with, "partner" proteins bearing an (S/T)RRXFLK twin arginine motif at the N-terminus. The pathway is therefore termed the Twin-Arginine Translocation or TAT system. Surprisingly, the four components that make up the TAT system are structurally and mechanistically related to a pH-dependent import system in plant chloroplast thylakoid membranes . The gene products responsible for the Sec-independent pathway are called TatA, TatB, TatC and TatE. TatB is essential for the secretion of these specific proteins, and forms a scaffold upon which TatC assembles in the membrane. Deletion studies have shown that it is crucial in the forming of the TAT complex, and is therefore necessary for the correct working of the system.; GO: 0015450 P-P-bond-hydrolysis-driven protein transmembrane transporter activity, 0009306 protein secretion, 0016020 membrane. Probab=22.30 E-value=66 Score=14.34 Aligned_cols=35 Identities=29% Similarity=0.569 Sum_probs=23.1 Q ss_pred HHHHHHHHHHH---------HHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 99999999999---------999876358999999997310147 Q gi|254780730|r 23 LLASLVSVAII---------SAVSTLGDRMKGVYQTISTELDKG 57 (120) Q Consensus 23 LIaALIaVAII---------aavt~LGt~L~~~FttVst~LnkG 57 (120) |+.+.|+++++ -.+..+=.++.++++++-++|++- T Consensus 9 Lli~vVaLvVLGPerLP~a~R~v~~~~~~~r~~a~~~~~~l~~e 52 (82) T TIGR01410 9 LLIAVVALVVLGPERLPVAIRAVGKFVRRLRGMASDVKNELDEE 52 (82) T ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999872625327899999999999999898999999998 No 40 >COG4970 FimT Tfp pilus assembly protein FimT [Cell motility and secretion / Intracellular trafficking and secretion] Probab=22.02 E-value=67 Score=14.31 Aligned_cols=49 Identities=12% Similarity=0.289 Sum_probs=31.6 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHCCCCCCC Q ss_conf 2570299999999999999999998763-----589999999973101478987 Q gi|254780730|r 13 GSGATAIEYGLLASLVSVAIISAVSTLG-----DRMKGVYQTISTELDKGDVPP 61 (120) Q Consensus 13 EsGATAIEYgLIaALIaVAIIaavt~LG-----t~L~~~FttVst~LnkG~~pp 61 (120) ..|-|.+|--+.+|++++-..-++-.|- +.+.+.-+.+++.|.-+-+.+ T Consensus 7 ~rGfTL~ELliviAIlAIla~~A~P~fs~~i~~~rl~s~a~eL~a~l~lARteA 60 (181) T COG4970 7 TRGFTLLELLIVLAILAILAVIAAPNFSQWIRSQRLRSAADELAAALQLARTEA 60 (181) T ss_pred CCCEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 675049999999999999999851008888778789889999999999999999 No 41 >PRK13871 conjugal transfer protein TrbC; Provisional Probab=21.97 E-value=51 Score=14.95 Aligned_cols=28 Identities=21% Similarity=0.315 Sum_probs=17.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999998763589999999 Q gi|254780730|r 22 GLLASLVSVAIISAVSTLGDRMKGVYQT 49 (120) Q Consensus 22 gLIaALIaVAIIaavt~LGt~L~~~Ftt 49 (120) +.-++++++++.+++-.+|+.+++-... T Consensus 52 A~~ialiai~~AGa~LIFGGElN~F~Rr 79 (134) T PRK13871 52 AGFIALAAVAIAGAMLIFGGELNDFARR 79 (134) T ss_pred HHHHHHHHHHHHCCEEEECCCHHHHHHH T ss_conf 9999999999828235766208899999 No 42 >COG2976 Uncharacterized protein conserved in bacteria [Function unknown] Probab=21.63 E-value=68 Score=14.26 Aligned_cols=45 Identities=20% Similarity=0.291 Sum_probs=27.5 Q ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH------------HHHHHHHHHHHHHCC Q ss_conf 2699999961225702999999999999999999987635------------899999999731014 Q gi|254780730|r 2 KMNIIKKILKNGSGATAIEYGLLASLVSVAIISAVSTLGD------------RMKGVYQTISTELDK 56 (120) Q Consensus 2 rMn~ikrFl~DEsGATAIEYgLIaALIaVAIIaavt~LGt------------~L~~~FttVst~Lnk 56 (120) ++.-|++|+++- | -+||+.+|++....||= .-+..|+.+...++. T Consensus 10 ql~~ik~wwkeN-G---------k~li~gviLg~~~lfGW~ywq~~q~~q~~~AS~~Y~~~i~~~~a 66 (207) T COG2976 10 QLEAIKDWWKEN-G---------KALIVGVILGLGGLFGWRYWQSHQVEQAQEASAQYQNAIKAVQA 66 (207) T ss_pred HHHHHHHHHHHC-C---------CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 999999999977-9---------07899999999999988999998788888999999999999855 No 43 >COG3043 NapB Nitrate reductase cytochrome c-type subunit [Energy production and conversion] Probab=21.00 E-value=70 Score=14.19 Aligned_cols=43 Identities=26% Similarity=0.333 Sum_probs=25.7 Q ss_pred HHHHHHCCCC-CCCCCCCCCCCCCC-CCCC--CCCCCCCCCCCCCCC Q ss_conf 9973101478-98767887557867-7877--432458877653432 Q gi|254780730|r 49 TISTELDKGD-VPPTKPGSVPMQPE-SSNP--STRLQPPAKPTSIPV 91 (120) Q Consensus 49 tVst~LnkG~-~pp~~p~~~~~~~~-~~~~--~~~~~~~~~~~~~~~ 91 (120) .+.-+|.+++ +--+.++-.||+++ +... +---|||--|.||-- T Consensus 33 ~~~~dl~~~~~~~~~~~~a~~~pk~~~~r~e~nYvnQPP~IPHsi~~ 79 (155) T COG3043 33 AVGPDLSQSPEVSGTEEAAHPMPKWEQTRMELNYVNQPPMIPHSIEG 79 (155) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCC T ss_conf 77754467864334566778788003220454268799977633467 No 44 >PRK13471 F0F1 ATP synthase subunit C; Provisional Probab=20.51 E-value=22 Score=16.98 Aligned_cols=52 Identities=17% Similarity=0.215 Sum_probs=30.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 9999999998763589999999973101478987678875578677877432 Q gi|254780730|r 28 VSVAIISAVSTLGDRMKGVYQTISTELDKGDVPPTKPGSVPMQPESSNPSTR 79 (120) Q Consensus 28 IaVAIIaavt~LGt~L~~~FttVst~LnkG~~pp~~p~~~~~~~~~~~~~~~ 79 (120) ++-++|.+.+.+|-.+.--+-.|...+..|.+-..+-.++--|||.++.-+| T Consensus 7 ~~~~li~a~s~igAGlamgigaIGPgiGeG~aa~~A~eaiarqPe~~g~I~~ 58 (85) T PRK13471 7 LADALILMGKAIGAGLCMGIGAIGPGIGEGNIGAHAMDAMARQPEMVGTITT 58 (85) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCHHHHCCHHH T ss_conf 9999999999999999999830266400879999999998759055221899 No 45 >pfam06295 DUF1043 Protein of unknown function (DUF1043). This family consists of several hypothetical bacterial proteins of unknown function. Probab=20.40 E-value=72 Score=14.11 Aligned_cols=12 Identities=17% Similarity=0.473 Sum_probs=4.5 Q ss_pred HHHHHHHHCCCC Q ss_conf 999973101478 Q gi|254780730|r 47 YQTISTELDKGD 58 (120) Q Consensus 47 FttVst~LnkG~ 58 (120) |..|-..|.+|. T Consensus 66 Y~~ly~HlA~~a 77 (128) T pfam06295 66 YHKLYQHMAKSS 77 (128) T ss_pred HHHHHHHHHHHH T ss_conf 999999999899 No 46 >TIGR00411 redox_disulf_1 redox-active disulfide protein 1; InterPro: IPR004502 This group of proteins includes thioredoxins, glutaredoxins, protein-disulphide isomerases, and others, some of which have several such domains. The sequence of proteins in this group at the redox-active disufide site, CPYC, matches glutaredoxins rather than thioredoxins, although overall the sequence seems closer to thioredoxins. Proteins may be involved in a ribonucleotide-reducing system component distinct from thioredoxin or glutaredoxin.; GO: 0009055 electron carrier activity, 0015035 protein disulfide oxidoreductase activity, 0045454 cell redox homeostasis. Probab=20.24 E-value=26 Score=16.62 Aligned_cols=15 Identities=33% Similarity=0.512 Sum_probs=11.4 Q ss_pred CCCCHHHHHHHHHHH Q ss_conf 225702999999999 Q gi|254780730|r 12 NGSGATAIEYGLLAS 26 (120) Q Consensus 12 DEsGATAIEYgLIaA 26 (120) ||+-.-|+||||.+. T Consensus 40 ~e~~~kA~~yGi~aV 54 (82) T TIGR00411 40 MEDLKKALEYGIMAV 54 (82) T ss_pred CCCHHHHHHCCCCCC T ss_conf 548478875163526 Done!