Query gi|254780542|ref|YP_003064955.1| hypothetical protein CLIBASIA_02145 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 210 No_of_seqs 121 out of 549 Neff 4.0 Searched_HMMs 39220 Date Sun May 29 22:35:37 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780542.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 pfam09923 DUF2155 Uncharacteri 100.0 1.2E-38 3.1E-43 262.5 10.7 91 30-121 1-91 (91) 2 COG4765 Uncharacterized protei 100.0 6.5E-36 1.7E-40 245.4 2.7 108 18-125 49-156 (164) 3 PRK03641 hypothetical protein; 70.8 8.7 0.00022 18.9 4.6 53 1-54 1-59 (220) 4 pfam12504 DUF3711 Aluminium in 69.7 1.8 4.7E-05 23.2 0.8 27 93-119 13-39 (58) 5 KOG4509 consensus 68.2 9.4 0.00024 18.7 4.2 46 77-122 183-229 (247) 6 pfam04920 DUF656 Family of unk 56.9 9.8 0.00025 18.6 2.6 37 29-65 56-92 (126) 7 pfam05428 CRF-BP Corticotropin 50.0 20 0.00052 16.6 4.3 61 72-132 87-152 (311) 8 PRK03598 hypothetical protein; 39.8 29 0.00074 15.6 6.4 35 31-65 43-77 (331) 9 TIGR03519 Bac_Flav_fam_1 Bacte 38.0 25 0.00063 16.1 2.2 24 1-24 1-24 (292) 10 KOG0641 consensus 37.1 32 0.00082 15.4 5.4 62 56-129 161-224 (350) 11 cd01910 Wali7 This domain is p 36.6 13 0.00034 17.8 0.7 27 92-118 181-207 (224) 12 TIGR00654 PhzF_family phenazin 32.8 25 0.00065 16.0 1.6 21 92-113 221-241 (314) 13 PRK01904 hypothetical protein; 32.4 38 0.00097 14.9 4.8 52 1-53 1-59 (217) 14 pfam10915 DUF2709 Protein of u 31.4 7.3 0.00018 19.4 -1.4 30 94-127 91-120 (238) 15 PRK10877 thiol:disulfide inter 31.4 15 0.00037 17.5 0.2 31 1-31 1-32 (232) 16 pfam03968 OstA OstA-like prote 28.8 44 0.0011 14.5 5.4 20 4-23 3-22 (141) 17 PRK13414 flagellar biosynthesi 27.6 33 0.00085 15.3 1.5 25 1-25 1-25 (209) 18 TIGR03516 ppisom_GldI peptidyl 26.4 42 0.0011 14.6 1.8 18 1-18 1-18 (177) 19 pfam09151 DUF1936 Domain of un 25.2 41 0.001 14.7 1.6 17 111-127 13-33 (36) 20 PRK12789 flgI flagellar basal 24.2 53 0.0014 14.0 2.5 38 1-38 1-51 (367) 21 PRK10287 thiosulfate:cyanide s 22.0 52 0.0013 14.0 1.6 22 1-24 2-23 (104) 22 pfam05628 Borrelia_P13 Borreli 20.8 27 0.00069 15.8 -0.1 24 4-27 3-26 (165) No 1 >pfam09923 DUF2155 Uncharacterized protein conserved in bacteria (DUF2155). This domain, found in various hypothetical prokaryotic proteins, has no known function. Probab=100.00 E-value=1.2e-38 Score=262.51 Aligned_cols=91 Identities=40% Similarity=0.763 Sum_probs=86.5 Q ss_pred EEEEEEECCCCEEEEEEEECCCEEEECCEEEEEEECCCCCCCCCCCCEEEEEEEEECCCCCCCEEECCCEEEECCCCCCC Q ss_conf 89999834345048999707985896327999984016898888761699999960357763144427575207578823 Q gi|254780542|r 30 AEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAI 109 (210) Q Consensus 30 A~L~~LDKITakvs~ieI~vGe~~~FG~L~I~v~~C~~~~p~e~pe~~Ayl~I~d~~~~~~~~~IF~GWMfASSPsLnal 109 (210) |+||+|||||||++++++++|++++||+|+|++++|++++|++.||++||++|.+ ...+..++||+||||||||+||+| T Consensus 1 A~l~~LDKiTak~~~i~i~vg~~~~fg~L~I~v~~C~~~~p~e~pe~~A~~~i~~-~~~~~~~~iF~GWMfassPsl~~~ 79 (91) T pfam09923 1 AVLRGLDKITGRTTDLELKVGETARFGALQVTLRECEKRYPEENPDGDAFAELTI-RERGDGEPIFSGWMFASSPALNAL 79 (91) T ss_pred CEEEEEECCCCCEEEEEEECCCEEEECCEEEEEHHCCCCCCCCCCCCCEEEEEEE-CCCCCCCEEECCCEEECCCCCCCC T ss_conf 9788771624740699994899889820899922027889988867318999996-458866502977588358776766 Q ss_pred CCCCEEEEEEEC Q ss_conf 385214898613 Q gi|254780542|r 110 DHSIYDIWLMQC 121 (210) Q Consensus 110 EHPvYDIWvl~C 121 (210) |||||||||++| T Consensus 80 eHPvYDvwv~~C 91 (91) T pfam09923 80 EHPRYDVWVLRC 91 (91) T ss_pred CCCCCEEEEECC T ss_conf 786504798639 No 2 >COG4765 Uncharacterized protein conserved in bacteria [Function unknown] Probab=100.00 E-value=6.5e-36 Score=245.39 Aligned_cols=108 Identities=55% Similarity=0.987 Sum_probs=102.0 Q ss_pred HHCCCHHCCCCEEEEEEEECCCCEEEEEEEECCCEEEECCEEEEEEECCCCCCCCCCCCEEEEEEEEECCCCCCCEEECC Q ss_conf 21010001376589999834345048999707985896327999984016898888761699999960357763144427 Q gi|254780542|r 18 KFANSARFANKVAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVCYSRDDREAQRIDAFVSISEIFTDRIVRSIFSG 97 (210) Q Consensus 18 ~~A~s~~~~~~~A~L~~LDKITakvs~ieI~vGe~~~FG~L~I~v~~C~~~~p~e~pe~~Ayl~I~d~~~~~~~~~IF~G 97 (210) ..+.+.++.|.+|+|.+||||||+++.|++++||+++||+|.|+++.|+++..+|+|+++||++|.+.+-+.+.+.||.| T Consensus 49 ~~a~a~r~~n~va~~~aldKIT~~~~~fdv~igE~v~fg~lqvt~r~C~s~~~~e~p~~~a~v~Vde~~ldr~~r~if~G 128 (164) T COG4765 49 DPAEAARFKNYVAKFSALDKITGRITEFDVYIGETVQFGALQVTPRVCYSRLDDEAPKTDAFVTVDEITLDRKIRRIFTG 128 (164) T ss_pred CHHHHHHHHHHHHHHHHHHHEEEEEEEEEEECCCEEEEEEEEEEEEEEECCCCCCCCCCCEEEEEEEEECCCHHHHHEEE T ss_conf 71567766415777622411003579987852887786028987523334678768764148998877616007655400 Q ss_pred CEEEECCCCCCCCCCCEEEEEEECCCCC Q ss_conf 5752075788233852148986134788 Q gi|254780542|r 98 WMFADSPAMNAIDHSIYDIWLMQCKDPI 125 (210) Q Consensus 98 WMfASSPsLnalEHPvYDIWvl~Ck~p~ 125 (210) |||||||+||++||||||||+++||... T Consensus 129 WM~asSpgLna~EHPIYdvwlk~Ckq~s 156 (164) T COG4765 129 WMFASSPGLNAVEHPIYDVWLKDCKQKS 156 (164) T ss_pred EEECCCCCCCCCCCHHHHHHHHHHHHCC T ss_conf 2650688766433257899998740437 No 3 >PRK03641 hypothetical protein; Provisional Probab=70.76 E-value=8.7 Score=18.93 Aligned_cols=53 Identities=15% Similarity=0.302 Sum_probs=39.6 Q ss_pred CCHHHHHHHHHHHHHHHHHCCCHHCCCCEEEEEEEE--CCCC----EEEEEEEECCCEEE Q ss_conf 914789999999986442101000137658999983--4345----04899970798589 Q gi|254780542|r 1 MKYRVLLLILFFVFSHAKFANSARFANKVAEFAGMD--KITG----RVLTFDVEINQSAQ 54 (210) Q Consensus 1 MK~~ILlLilFfifs~~~~A~s~~~~~~~A~L~~LD--KITa----kvs~ieI~vGe~~~ 54 (210) ||+.|+.-++.+.++.+.+|..-.+... +++-.+| |+++ +..+++++-|+.-. T Consensus 1 m~~~~i~~llal~~S~~~~A~tL~lp~~-ielL~vnGk~v~~slf~~~~~l~L~~G~hQI 59 (220) T PRK03641 1 MKTGIVTTLLALCLPVSVFATTLRLSPE-VDLLVLDGKKVSSSLLRGADSIELDNGPHQL 59 (220) T ss_pred CCHHHHHHHHHHHHHHHHHEEEEECCCC-EEEEEECCEECCCCCCCCCCEEEECCCCEEE T ss_conf 9116999999998623645046647986-4699988988544345677526717994489 No 4 >pfam12504 DUF3711 Aluminium induced protein. This domain family is found in eukaryotes, and is approximately 60 amino acids in length. There are two completely conserved residues (L and P) that may be functionally important. Probab=69.70 E-value=1.8 Score=23.16 Aligned_cols=27 Identities=11% Similarity=0.308 Sum_probs=22.3 Q ss_pred EEECCCEEEECCCCCCCCCCCEEEEEE Q ss_conf 444275752075788233852148986 Q gi|254780542|r 93 SIFSGWMFADSPAMNAIDHSIYDIWLM 119 (210) Q Consensus 93 ~IF~GWMfASSPsLnalEHPvYDIWvl 119 (210) ..=.|-||.|+-.|.++|||..-+..+ T Consensus 13 pFP~GC~f~s~~GL~SfEhP~nk~kav 39 (58) T pfam12504 13 PFPKGCFFSSAGGLRSFEHPKNKVKAV 39 (58) T ss_pred CCCCCCEEECCCCCCCCCCCHHHCCCC T ss_conf 899970686486601031736235446 No 5 >KOG4509 consensus Probab=68.23 E-value=9.4 Score=18.70 Aligned_cols=46 Identities=15% Similarity=0.211 Sum_probs=29.8 Q ss_pred EEEEEEEEECCCCCCCEEE-CCCEEEECCCCCCCCCCCEEEEEEECC Q ss_conf 6999999603577631444-275752075788233852148986134 Q gi|254780542|r 77 DAFVSISEIFTDRIVRSIF-SGWMFADSPAMNAIDHSIYDIWLMQCK 122 (210) Q Consensus 77 ~Ayl~I~d~~~~~~~~~IF-~GWMfASSPsLnalEHPvYDIWvl~Ck 122 (210) .++++|.-.+.-.+.+-+| +|||+.+--+|.-|--|---.-+-.|- T Consensus 183 GVl~eVefSssIHDREI~FdNGWmiK~GRGLDYFK~p~~kfsLG~~D 229 (247) T KOG4509 183 GVLFEVEFSSSIHDREIIFDNGWMIKIGRGLDYFKAPDGKFSLGACD 229 (247) T ss_pred CEEEEEEECCCCCCEEEEECCCEEEEECCCCCCCCCCCCCEEECCCC T ss_conf 54899986042022047851854898547632010788735413356 No 6 >pfam04920 DUF656 Family of unknown function (DUF656). A family of hypothetical proteins from Beet necrotic yellow vein virus. Probab=56.94 E-value=9.8 Score=18.59 Aligned_cols=37 Identities=16% Similarity=0.235 Sum_probs=30.0 Q ss_pred EEEEEEEECCCCEEEEEEEECCCEEEECCEEEEEEEC Q ss_conf 5899998343450489997079858963279999840 Q gi|254780542|r 29 VAEFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVC 65 (210) Q Consensus 29 ~A~L~~LDKITakvs~ieI~vGe~~~FG~L~I~v~~C 65 (210) .-.+-+|+|||+-.-.+.+|+-....-|...+.+..| T Consensus 56 ~difpwlnki~sinvs~dvp~~s~~~~g~~~v~if~~ 92 (126) T pfam04920 56 IDIFPWLNKIMSINVSLDVPAFAGICGGRIMVLIFIT 92 (126) T ss_pred CCHHHHHHHCEEEEEEECCCCCEEECCEEEEEEEEEE T ss_conf 2303554430178998814520001102899999997 No 7 >pfam05428 CRF-BP Corticotropin-releasing factor binding protein (CRF-BP). This family consists of several eukaryotic corticotropin-releasing factor binding proteins (CRF-BP or CRH-BP). Corticotropin-releasing hormone (CRH) plays multiple roles in vertebrate species. In mammals, it is the major hypothalamic releasing factor for pituitary adrenocorticotropin secretion, and is a neurotransmitter or neuromodulator at other sites in the central nervous system. In non-mammalian vertebrates, CRH not only acts as a neurotransmitter and hypophysiotropin, it also acts as a potent thyrotropin-releasing factor, allowing CRH to regulate both the adrenal and thyroid axes, especially in development. CRH-BP is thought to play an inhibitory role in which it binds CRH and other CRH-like ligands and prevents the activation of CRH receptors. There is however evidence that CRH-BP may also exhibit diverse extra and intracellular roles in a cell specific fashion and at specific times in development. Probab=50.00 E-value=20 Score=16.62 Aligned_cols=61 Identities=15% Similarity=0.210 Sum_probs=35.8 Q ss_pred CCCCCEEEEEEE--EEC-CCCCCCEEECCCEEEECCCCCCCCCCC--EEEEEEECCCCCCCCCCHH Q ss_conf 887616999999--603-577631444275752075788233852--1489861347888320000 Q gi|254780542|r 72 EAQRIDAFVSIS--EIF-TDRIVRSIFSGWMFADSPAMNAIDHSI--YDIWLMQCKDPINDSISNS 132 (210) Q Consensus 72 e~pe~~Ayl~I~--d~~-~~~~~~~IF~GWMfASSPsLnalEHPv--YDIWvl~Ck~p~~~~~~~~ 132 (210) ..||...-++.+ |+. ..+.--.+|.||...-.-==++-|||. ++=..-=|.+-.+...-.| T Consensus 87 aePde~I~i~~d~vdidC~~Gd~i~vfDGW~LkGEkFPs~qDHplPl~eRy~dfC~s~~~k~~~rS 152 (311) T pfam05428 87 GEPDEVIEIEYKHVDIDCEGGDLLKVFDGWELKGEKFPSSQDHPLPLMERYTDFCNSGSNKRTFRS 152 (311) T ss_pred ECCCEEEEEEEEEEECCCCCCCEEEEEEEEEECCCCCCCCCCCCCCHHHHHHHHHCCCCCCCEEEC T ss_conf 278618999976670257999879998610884668848467877378889998617765334560 No 8 >PRK03598 hypothetical protein; Provisional Probab=39.75 E-value=29 Score=15.63 Aligned_cols=35 Identities=11% Similarity=0.151 Sum_probs=29.5 Q ss_pred EEEEEECCCCEEEEEEEECCCEEEECCEEEEEEEC Q ss_conf 99998343450489997079858963279999840 Q gi|254780542|r 31 EFAGMDKITGRVLTFDVEINQSAQFGSLIIKPMVC 65 (210) Q Consensus 31 ~L~~LDKITakvs~ieI~vGe~~~FG~L~I~v~~C 65 (210) ++.+==||+|++..+.+..|+.++=|.+...+..= T Consensus 43 ~V~vs~~v~G~V~~v~V~eGd~Vk~Gq~La~LD~~ 77 (331) T PRK03598 43 TVNLSFRVGGRLASLAVDEGDAVKAGQVLGELDHA 77 (331) T ss_pred EEEECCCCCEEEEEEECCCCCEECCCCEEEEECCH T ss_conf 99991327759999986897987799889998808 No 9 >TIGR03519 Bac_Flav_fam_1 Bacteroidetes-specific putative membrane protein. This model describes a protein family unique to, and greatly expanded in, the Bacteriodetes. Species in this lineage include several, such as Cytophaga hutchinsonii and Flavobacterium johnsoniae, that exhibit a poorly understood rapid gliding phenotype. Several members of this protein family are found in operons with other genes whose loss leads to a loss a this motility. Probab=38.03 E-value=25 Score=16.08 Aligned_cols=24 Identities=33% Similarity=0.548 Sum_probs=15.2 Q ss_pred CCHHHHHHHHHHHHHHHHHCCCHH Q ss_conf 914789999999986442101000 Q gi|254780542|r 1 MKYRVLLLILFFVFSHAKFANSAR 24 (210) Q Consensus 1 MK~~ILlLilFfifs~~~~A~s~~ 24 (210) ||..++++++|+++.....+++.. T Consensus 1 Mkki~l~~~~~~~~~~~~~aqq~~ 24 (292) T TIGR03519 1 MKKILLLLLLLLLLTVQAFAQQDP 24 (292) T ss_pred CCCHHHHHHHHHHHHHHHHHHCCH T ss_conf 960578899999998778750374 No 10 >KOG0641 consensus Probab=37.08 E-value=32 Score=15.36 Aligned_cols=62 Identities=18% Similarity=0.313 Sum_probs=33.4 Q ss_pred CCEEEEEEECCCCCCCCCC--CCEEEEEEEEECCCCCCCEEECCCEEEECCCCCCCCCCCEEEEEEECCCCCCCCC Q ss_conf 3279999840168988887--6169999996035776314442757520757882338521489861347888320 Q gi|254780542|r 56 GSLIIKPMVCYSRDDREAQ--RIDAFVSISEIFTDRIVRSIFSGWMFADSPAMNAIDHSIYDIWLMQCKDPINDSI 129 (210) Q Consensus 56 G~L~I~v~~C~~~~p~e~p--e~~Ayl~I~d~~~~~~~~~IF~GWMfASSPsLnalEHPvYDIWvl~Ck~p~~~~~ 129 (210) |+-.|++..|..-.+-... .+.-.|. -+-.+||||+|-.-=... --+|+.|-.|-+-++.+- T Consensus 161 gdc~iy~tdc~~g~~~~a~sghtghila----------lyswn~~m~~sgsqdkti--rfwdlrv~~~v~~l~~~~ 224 (350) T KOG0641 161 GDCKIYITDCGRGQGFHALSGHTGHILA----------LYSWNGAMFASGSQDKTI--RFWDLRVNSCVNTLDNDF 224 (350) T ss_pred CCCEEEEEECCCCCCCEEECCCCCCEEE----------EEEECCCEEECCCCCCEE--EEEEEECCCEEEECCCCC T ss_conf 7635998526899752130477653899----------988547178725887348--988631252555136754 No 11 >cd01910 Wali7 This domain is present in Wali7, a protein of unknown function, expressed in wheat and induced by aluminum. Wali7 has a single domain similar to the glutamine amidotransferase domain of glucosamine-fructose 6-phosphate synthase (GLMS or GFAT), glutamine phosphoribosylpyrophosphate (Prpp) amidotransferase (GPATase), asparagine synthetase B (AsnB), beta lactam synthetase (beta-LS) and glutamate synthase (GltS). The Wali7 domain is also somewhat similar to the Ntn hydrolase fold of the proteasomal alph and beta subunits. Probab=36.58 E-value=13 Score=17.75 Aligned_cols=27 Identities=11% Similarity=0.299 Sum_probs=21.5 Q ss_pred CEEECCCEEEECCCCCCCCCCCEEEEE Q ss_conf 144427575207578823385214898 Q gi|254780542|r 92 RSIFSGWMFADSPAMNAIDHSIYDIWL 118 (210) Q Consensus 92 ~~IF~GWMfASSPsLnalEHPvYDIWv 118 (210) ...=.|-||+|+-.|.++|||..-+.. T Consensus 181 apFP~Gc~f~S~gGL~sfehP~nk~ka 207 (224) T cd01910 181 APFPKGCFFHSEGGLRSFEHPMNKLKA 207 (224) T ss_pred CCCCCCCEEECCCCEEECCCCHHHCCC T ss_conf 889998417668650204173621753 No 12 >TIGR00654 PhzF_family phenazine biosynthesis protein PhzF family; InterPro: IPR003719 Five genes, phzF, phzA, phzB, phzC and phzD, encode enzymes for phenazine biosynthesis in the biological control bacterium Pseudomonas chlororaphis (Pseudomonas aureofaciens). Protein PhzF is similar to 3-deoxy-D-arabino-heptulosonate-7-phosphate synthases of solanaceous plants. PhzC is responsible for the conversion of phenazine-I-carboxylic acid to 2-hydroxy-phenazine-I-carboxylic acid .; GO: 0003824 catalytic activity, 0009058 biosynthetic process. Probab=32.79 E-value=25 Score=16.00 Aligned_cols=21 Identities=29% Similarity=0.721 Sum_probs=14.9 Q ss_pred CEEECCCEEEECCCCCCCCCCC Q ss_conf 1444275752075788233852 Q gi|254780542|r 92 RSIFSGWMFADSPAMNAIDHSI 113 (210) Q Consensus 92 ~~IF~GWMfASSPsLnalEHPv 113 (210) ..+..|=||| ||+++-.|.|| T Consensus 221 nkl~~gR~Fa-sP~~gi~EDPv 241 (314) T TIGR00654 221 NKLLHGRMFA-SPVIGIVEDPV 241 (314) T ss_pred CHHHCCEEEC-CCCCCCEECCC T ss_conf 1010260002-78776255572 No 13 >PRK01904 hypothetical protein; Provisional Probab=32.36 E-value=38 Score=14.89 Aligned_cols=52 Identities=29% Similarity=0.376 Sum_probs=35.2 Q ss_pred CCHHHHHHHHHHH-HHHHHHCCCHHCCCCEEEEEEEE--CCCC----EEEEEEEECCCEE Q ss_conf 9147899999999-86442101000137658999983--4345----0489997079858 Q gi|254780542|r 1 MKYRVLLLILFFV-FSHAKFANSARFANKVAEFAGMD--KITG----RVLTFDVEINQSA 53 (210) Q Consensus 1 MK~~ILlLilFfi-fs~~~~A~s~~~~~~~A~L~~LD--KITa----kvs~ieI~vGe~~ 53 (210) ||++..++.++.+ .+.+.+|..-.+..+ .++-.+| |+.+ ....++++-|++. T Consensus 1 MK~~~~~~al~~l~~S~~a~A~tL~lp~~-ielL~vdG~k~~~slf~~~~sl~L~~G~~H 59 (217) T PRK01904 1 MKLRAAALAVATLLTSTASFAGMVTTSSN-VEFLAIDGQKASKSLLGKAKSFNVDDTQVH 59 (217) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHCCCCCC-CEEEEECCEECCCCCCCCCCCEECCCCCCE T ss_conf 97007999999998608988751407887-138997682777654567761563799704 No 14 >pfam10915 DUF2709 Protein of unknown function (DUF2709). This bacterial family of proteins has no known function. Probab=31.44 E-value=7.3 Score=19.41 Aligned_cols=30 Identities=40% Similarity=0.862 Sum_probs=23.7 Q ss_pred EECCCEEEECCCCCCCCCCCEEEEEEECCCCCCC Q ss_conf 4427575207578823385214898613478883 Q gi|254780542|r 94 IFSGWMFADSPAMNAIDHSIYDIWLMQCKDPIND 127 (210) Q Consensus 94 IF~GWMfASSPsLnalEHPvYDIWvl~Ck~p~~~ 127 (210) .|.|=.|+..-.-||- .++|| ||-+| |-|. T Consensus 91 PFTGKVF~DNt~~nPQ-DAIYD-WvSkC--PeN~ 120 (238) T pfam10915 91 PFTGKVFGDNTHPNPQ-DAIYD-WVSKC--PENK 120 (238) T ss_pred CCCCCCCCCCCCCCHH-HHHHH-HHHHC--CCCH T ss_conf 7767422678899858-88999-98618--6520 No 15 >PRK10877 thiol:disulfide interchange protein DsbC; Provisional Probab=31.39 E-value=15 Score=17.51 Aligned_cols=31 Identities=29% Similarity=0.382 Sum_probs=20.0 Q ss_pred CCHHHHHHHHHHH-HHHHHHCCCHHCCCCEEE Q ss_conf 9147899999999-864421010001376589 Q gi|254780542|r 1 MKYRVLLLILFFV-FSHAKFANSARFANKVAE 31 (210) Q Consensus 1 MK~~ILlLilFfi-fs~~~~A~s~~~~~~~A~ 31 (210) ||..++++.++++ |+...+|+...+..+.+. T Consensus 1 mkk~l~~~~~l~~~~s~~a~aDea~i~~~l~k 32 (232) T PRK10877 1 MKKGFMLFTLLAAAFSGFAHADDAAIQQTLAK 32 (232) T ss_pred CCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHH T ss_conf 92669999999999862524379999999998 No 16 >pfam03968 OstA OstA-like protein. This family of proteins are mostly uncharacterized. However the family does include E. coli OstA that has been characterized as an organic solvent tolerance protein. Probab=28.82 E-value=44 Score=14.51 Aligned_cols=20 Identities=35% Similarity=0.512 Sum_probs=9.6 Q ss_pred HHHHHHHHHHHHHHHHCCCH Q ss_conf 78999999998644210100 Q gi|254780542|r 4 RVLLLILFFVFSHAKFANSA 23 (210) Q Consensus 4 ~ILlLilFfifs~~~~A~s~ 23 (210) .+++++++++++...+|.+. T Consensus 3 ~~~~~~~~~l~~~~~~a~~~ 22 (141) T pfam03968 3 LLLLLLLLLLASASALAAQA 22 (141) T ss_pred HHHHHHHHHHHHHHHHHCCC T ss_conf 57999999987357641432 No 17 >PRK13414 flagellar biosynthesis protein FliZ; Provisional Probab=27.56 E-value=33 Score=15.25 Aligned_cols=25 Identities=28% Similarity=0.506 Sum_probs=17.9 Q ss_pred CCHHHHHHHHHHHHHHHHHCCCHHC Q ss_conf 9147899999999864421010001 Q gi|254780542|r 1 MKYRVLLLILFFVFSHAKFANSARF 25 (210) Q Consensus 1 MK~~ILlLilFfifs~~~~A~s~~~ 25 (210) |..++++++++.++....+|++... T Consensus 1 mn~kflFlvf~~f~~~~~fAq~ne~ 25 (209) T PRK13414 1 MNNKFLFLVFLIFFNFFLFAQSNEQ 25 (209) T ss_pred CCCEEHHHHHHHHHHHHHHHCCHHH T ss_conf 9843315899999988987330255 No 18 >TIGR03516 ppisom_GldI peptidyl-prolyl isomerase, gliding motility-associated. Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldI is a FKBP-type peptidyl-prolyl cis-trans isomerase (pfam00254) linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae and Cytophaga hutchinsonii. Knockout of this gene abolishes the gliding phenotype. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. This family is only found in Bacteroidetes containing the suite of genes proposed to confer the gliding motility phenotype. Probab=26.35 E-value=42 Score=14.64 Aligned_cols=18 Identities=22% Similarity=0.453 Sum_probs=12.3 Q ss_pred CCHHHHHHHHHHHHHHHH Q ss_conf 914789999999986442 Q gi|254780542|r 1 MKYRVLLLILFFVFSHAK 18 (210) Q Consensus 1 MK~~ILlLilFfifs~~~ 18 (210) ||+.+.+++++++|.++. T Consensus 1 m~~~i~~i~~~~~~~sC~ 18 (177) T TIGR03516 1 MKHLIAVILLLLLLLGCK 18 (177) T ss_pred CCCHHHHHHHHHHHHHCC T ss_conf 943047898989987269 No 19 >pfam09151 DUF1936 Domain of unknown function (DUF1936). This domain is found in a set of hypothetical Archaeal proteins. Its exact function has not, as yet, been defined. It possesses a zinc ribbon fold. Probab=25.23 E-value=41 Score=14.71 Aligned_cols=17 Identities=35% Similarity=0.888 Sum_probs=9.9 Q ss_pred CCCEE----EEEEECCCCCCC Q ss_conf 85214----898613478883 Q gi|254780542|r 111 HSIYD----IWLMQCKDPIND 127 (210) Q Consensus 111 HPvYD----IWvl~Ck~p~~~ 127 (210) .|+|| |.+.+|.||--| T Consensus 13 epvydekgeikvfrcsnpacd 33 (36) T pfam09151 13 EPVYDEKGEIKVFRCSNPACD 33 (36) T ss_pred EEECCCCCCEEEEECCCCCCC T ss_conf 320057773889972797666 No 20 >PRK12789 flgI flagellar basal body P-ring protein; Reviewed Probab=24.23 E-value=53 Score=13.99 Aligned_cols=38 Identities=21% Similarity=0.332 Sum_probs=21.0 Q ss_pred CCHHHHHHHHHHHHHHHHHCCCHHCCC-------------CEEEEEEEECC Q ss_conf 914789999999986442101000137-------------65899998343 Q gi|254780542|r 1 MKYRVLLLILFFVFSHAKFANSARFAN-------------KVAEFAGMDKI 38 (210) Q Consensus 1 MK~~ILlLilFfifs~~~~A~s~~~~~-------------~~A~L~~LDKI 38 (210) |..+++++++|.++..+..+...++.. .|.-.-+||-. T Consensus 1 m~~~~~~~~~~~l~~~~~~~~a~RIkDi~~v~GvR~NqLiGYGLVVGL~GT 51 (367) T PRK12789 1 MIRRLLLAVLLALLAGPAAAAATRIKDIATLQGVRDNQLVGYGLVTGLQGT 51 (367) T ss_pred CHHHHHHHHHHHHHHCCCHHHHHHHHHHEEEEEEEECCEEEEEEEEECCCC T ss_conf 908999999999985530666665640200400244355677899813887 No 21 >PRK10287 thiosulfate:cyanide sulfurtransferase; Provisional Probab=22.05 E-value=52 Score=14.04 Aligned_cols=22 Identities=36% Similarity=0.401 Sum_probs=11.8 Q ss_pred CCHHHHHHHHHHHHHHHHHCCCHH Q ss_conf 914789999999986442101000 Q gi|254780542|r 1 MKYRVLLLILFFVFSHAKFANSAR 24 (210) Q Consensus 1 MK~~ILlLilFfifs~~~~A~s~~ 24 (210) ||..++.++++ |+...+|...+ T Consensus 2 ~~~~~~~l~~~--f~~~~~a~~~~ 23 (104) T PRK10287 2 FKKGLLALALV--FSLPVFAAEHW 23 (104) T ss_pred CHHHHHHHHHH--HHCHHHHCCEE T ss_conf 24479999999--95655415837 No 22 >pfam05628 Borrelia_P13 Borrelia membrane protein P13. This family consists of P13 proteins from Borrelia species. P13 is a 13kDa integral membrane protein which is post-translationally processed at both ends and modified by an unknown mechanism. Probab=20.77 E-value=27 Score=15.81 Aligned_cols=24 Identities=25% Similarity=0.568 Sum_probs=17.1 Q ss_pred HHHHHHHHHHHHHHHHCCCHHCCC Q ss_conf 789999999986442101000137 Q gi|254780542|r 4 RVLLLILFFVFSHAKFANSARFAN 27 (210) Q Consensus 4 ~ILlLilFfifs~~~~A~s~~~~~ 27 (210) ++|+++++|.+....||++...+. T Consensus 3 Ki~~liLif~lt~qiFAq~d~~~~ 26 (165) T pfam05628 3 KIFTLILIFSLTMQIFAQEDKLEK 26 (165) T ss_pred EEEEHHHHHHHHHHHHCCCCHHHC T ss_conf 266205667557541200033321 Done!