Query gi|254780500|ref|YP_003064913.1| hypothetical protein CLIBASIA_01935 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 159 No_of_seqs 128 out of 302 Neff 5.3 Searched_HMMs 39220 Date Sun May 29 23:16:55 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780500.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 COG2913 OlmA Outer membrane li 99.9 2.3E-23 6E-28 161.0 13.9 133 17-154 7-146 (147) 2 PRK11548 hypothetical protein; 99.8 6.9E-18 1.8E-22 128.1 10.5 93 20-122 7-106 (113) 3 pfam04355 SmpA_OmlA SmpA / Oml 99.6 1.8E-15 4.7E-20 113.5 8.1 71 41-121 1-71 (71) 4 PRK11251 DNA-binding transcrip 97.5 0.00043 1.1E-08 45.1 6.8 96 19-126 4-99 (109) 5 pfam11399 DUF3192 Protein of u 93.7 0.17 4.4E-06 29.5 5.3 50 43-92 22-73 (102) 6 TIGR03516 ppisom_GldI peptidyl 67.8 7.9 0.0002 19.5 3.8 80 20-104 4-91 (177) 7 TIGR01781 Trep_dent_lipo Trepo 66.7 3.4 8.7E-05 21.7 1.7 28 14-41 2-29 (463) 8 pfam11006 DUF2845 Protein of u 65.2 5.7 0.00015 20.3 2.6 28 50-77 11-38 (85) 9 PRK10866 outer membrane protei 59.9 7.7 0.0002 19.6 2.5 21 18-38 5-25 (243) 10 COG2088 SpoVG Uncharacterized 49.7 22 0.00055 16.8 3.5 39 104-142 18-57 (95) 11 pfam07467 BLIP Beta-lactamase 48.6 14 0.00037 17.9 2.4 82 26-125 15-99 (183) 12 TIGR03524 GldJ gliding motilit 48.5 17 0.00044 17.4 2.8 24 16-39 5-28 (559) 13 TIGR03302 OM_YfiO outer membra 44.6 15 0.00038 17.8 1.9 23 19-41 3-25 (235) 14 PRK11023 hypothetical protein; 43.4 18 0.00045 17.4 2.2 25 15-39 1-25 (191) 15 PRK10802 peptidoglycan-associa 42.1 20 0.0005 17.1 2.2 22 18-39 7-28 (173) 16 TIGR01661 ELAV_HUD_SF ELAV/HuD 38.7 17 0.00044 17.4 1.5 14 143-156 365-378 (436) 17 TIGR03659 IsdE heme ABC transp 37.3 31 0.0008 15.9 2.7 51 20-70 6-57 (289) 18 TIGR02747 TraV type IV conjuga 37.2 26 0.00066 16.4 2.2 52 20-71 7-63 (174) 19 TIGR03114 cas_csf1 CRISPR-asso 34.1 38 0.00097 15.4 4.3 55 56-119 64-118 (202) 20 pfam06291 Lambda_Bor Bor prote 33.8 31 0.00079 15.9 2.2 27 19-45 3-29 (97) 21 PRK11063 metQ DL-methionine tr 33.4 37 0.00096 15.4 2.5 50 11-69 1-53 (271) 22 TIGR03511 GldH_lipo gliding mo 31.5 33 0.00085 15.7 2.0 24 19-42 6-30 (156) 23 PRK10215 hypothetical protein; 31.2 10 0.00026 18.8 -0.6 64 14-89 5-70 (219) 24 COG5633 Predicted periplasmic 30.6 16 0.00041 17.6 0.3 22 57-78 45-66 (123) 25 TIGR02722 lp_ uncharacterized 28.7 47 0.0012 14.8 2.5 94 19-125 5-108 (215) 26 TIGR02887 spore_ger_x_C germin 27.7 40 0.001 15.2 1.9 32 18-49 3-40 (400) 27 PRK09967 putative outer membra 27.2 46 0.0012 14.9 2.1 17 20-36 6-22 (160) 28 pfam02323 ELH Egg-laying hormo 27.2 23 0.00059 16.7 0.6 66 20-87 11-81 (255) 29 PRK04405 prsA peptidylprolyl i 26.8 51 0.0013 14.6 4.4 34 22-55 13-51 (298) 30 PRK09859 multidrug efflux syst 26.3 48 0.0012 14.8 2.1 16 20-35 8-23 (385) 31 pfam08006 DUF1700 Protein of u 25.8 49 0.0012 14.7 2.1 27 45-71 30-56 (181) 32 pfam04218 CENP-B_N CENP-B N-te 25.7 51 0.0013 14.6 2.1 25 47-71 14-38 (53) 33 PRK11671 mltC murein transglyc 25.2 46 0.0012 14.9 1.8 119 18-157 3-144 (360) 34 PRK11162 mltA murein transglyc 23.8 49 0.0012 14.7 1.7 29 18-46 3-31 (362) 35 COG4851 CamS Protein involved 23.8 58 0.0015 14.3 2.6 63 20-92 6-69 (382) 36 smart00242 MYSc Myosin. Large 23.5 59 0.0015 14.2 3.1 64 59-153 137-204 (677) 37 pfam10566 Glyco_hydro_97 Glyco 22.5 54 0.0014 14.5 1.7 18 22-39 9-26 (643) 38 PRK12407 flgH flagellar basal 22.3 51 0.0013 14.6 1.6 20 19-38 2-21 (220) 39 PRK11627 hypothetical protein; 21.8 64 0.0016 14.0 2.1 20 19-38 4-23 (192) 40 TIGR00752 slp outer membrane l 21.1 66 0.0017 13.9 2.0 26 18-43 3-29 (190) 41 TIGR03525 GldK gliding motilit 21.1 54 0.0014 14.4 1.5 54 19-74 3-71 (449) 42 PRK03002 prsA peptidylprolyl i 21.1 58 0.0015 14.3 1.7 30 20-49 8-43 (285) 43 pfam11359 gpUL132 Glycoprotein 20.2 69 0.0018 13.8 2.0 105 20-137 65-184 (235) No 1 >COG2913 OlmA Outer membrane lipoprotein OmlA (small protein A) [Cell envelope biogenesis, outer membrane] Probab=99.91 E-value=2.3e-23 Score=161.04 Aligned_cols=133 Identities=23% Similarity=0.330 Sum_probs=117.2 Q ss_pred HHHHHHHHHHHHHHCCCCCC-------CEECCCCCCHHHHHHHCCCCCHHHHHHHHCCCCCCCCCCCCCCCEEEEEEEEE Q ss_conf 89999999999984355542-------10135003878886404899999999983785212244677530488872100 Q gi|254780500|r 17 FNKALFIIVIGLVVDYHVSY-------GSISGASLDMSAISLVSQGSSRSHVIESLGSPSFSILHNGNRSQSFYYVSQKK 89 (159) Q Consensus 17 ~~k~i~~~l~~~~~~~c~~~-------~~~~G~~id~~~i~~i~~g~tk~~V~~~LG~Ps~~~~f~~n~~~~~YYi~~k~ 89 (159) ...+++.+++.+.+++|... ..++||++|+++++++++||||+||..+||+|+++++|+.| +|||+++.. T Consensus 7 ~~~~~~a~l~~~als~Cst~~~~~~~~~i~qGn~~~~~~v~ql~~Gmtk~QV~~lLGtP~~~d~F~~~---~WyYv~~~~ 83 (147) T COG2913 7 AILAIAALLGAAALSGCSTLKVVYYNDDINQGNVLDQDAVDQLKVGMTKDQVLYLLGTPSLVDTFDTN---TWYYVFQQQ 83 (147) T ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHHHHHCCCCCHHHHHHHHCCCEEECCCCCC---EEEEEEEEE T ss_conf 89999999999983148754222345432456515988998714798899999981997012577883---489998876 Q ss_pred ECCCHHCCCCEEEEEEEEEEEECCCCCEEEEEECCCCCCCEECCCCCCCCCCCCHHHHHHHHHCC Q ss_conf 00000101211113578999998998684321104345614034541588885224389998533 Q gi|254780500|r 90 KWFPVKFLSPKIMEYIVLKITFGEKGVVSSVSMERLPNRRFNPNPHTIPAPIQTADGFLRRLLNP 154 (159) Q Consensus 90 ~~~~~af~~pkvier~Vl~I~FD~~g~V~nI~~~~L~dg~~~~~~~~~~~~~~~~~~fl~~l~~~ 154 (159) .+. .+| .|++++|.+|+|+||++|+++++.-|...|++..+-.++.|.+++.+..+|+|++++ T Consensus 84 ~~~-~~~-~~~~~~q~~l~v~Fd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~l~~~~~~ 146 (147) T COG2913 84 RRG-VAF-GPEIVEQKTLTVYFDKDGVVTSIAVYKPADGKAFEAVNRTTPTPGKELTFLQQLVGN 146 (147) T ss_pred ECC-CCC-CCCCCCEEEEEEEECCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCC T ss_conf 235-444-777652047999987898278863026565554344457889987147899876148 No 2 >PRK11548 hypothetical protein; Provisional Probab=99.76 E-value=6.9e-18 Score=128.12 Aligned_cols=93 Identities=16% Similarity=0.188 Sum_probs=76.6 Q ss_pred HHHHHHHHHHHCCCCCC------C-EECCCCCCHHHHHHHCCCCCHHHHHHHHCCCCCCCCCCCCCCCEEEEEEEEEECC Q ss_conf 99999999984355542------1-0135003878886404899999999983785212244677530488872100000 Q gi|254780500|r 20 ALFIIVIGLVVDYHVSY------G-SISGASLDMSAISLVSQGSSRSHVIESLGSPSFSILHNGNRSQSFYYVSQKKKWF 92 (159) Q Consensus 20 ~i~~~l~~~~~~~c~~~------~-~~~G~~id~~~i~~i~~g~tk~~V~~~LG~Ps~~~~f~~n~~~~~YYi~~k~~~~ 92 (159) ....+++++.+++|++- . ..+||+++++++++|++||||+||..+||+|.++++|++|+|+|+||.+.. T Consensus 7 ~~a~ll~~lllsgCS~~~~~vYk~dI~QGN~i~~~~v~~L~~GMTk~QV~~iLGtP~l~d~F~~~rWdYvy~~~~g---- 82 (113) T PRK11548 7 TAAAAVLLMLTAGCSTLERVVYRPDINQGNYLTPNDVAKIRVGMTQQQVAYALGTPMMSDPFGTNTWFYVFRQQPG---- 82 (113) T ss_pred HHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHHHHHCCCCCHHHHHHHCCCCCCCCCCCCCCEEEEEEEECC---- T ss_conf 9999999999801157877565765575676699999876189999999987189632388788746999999608---- Q ss_pred CHHCCCCEEEEEEEEEEEECCCCCEEEEEE Q ss_conf 001012111135789999989986843211 Q gi|254780500|r 93 PVKFLSPKIMEYIVLKITFGEKGVVSSVSM 122 (159) Q Consensus 93 ~~af~~pkvier~Vl~I~FD~~g~V~nI~~ 122 (159) -+..+++-+.+.||++|+|.+|+- T Consensus 83 ------~~~~~~~~ltl~Fd~~~ll~~Id~ 106 (113) T PRK11548 83 ------HEKVTQQTLTLTFNSSGVLTNIDN 106 (113) T ss_pred ------CCCEEEEEEEEEECCCCCEEECCC T ss_conf ------983588899999839997883489 No 3 >pfam04355 SmpA_OmlA SmpA / OmlA family. Lipoprotein Bacterial outer membrane lipoprotein, possibly involved in in maintaining the structural integrity of the cell envelope. Lipid attachment site is a conserved N terminal cysteine residue. Sometimes found adjacent to the OmpA domain (pfam00691). Probab=99.62 E-value=1.8e-15 Score=113.52 Aligned_cols=71 Identities=24% Similarity=0.408 Sum_probs=62.0 Q ss_pred CCCCCHHHHHHHCCCCCHHHHHHHHCCCCCCCCCCCCCCCEEEEEEEEEECCCHHCCCCEEEEEEEEEEEECCCCCEEEE Q ss_conf 50038788864048999999999837852122446775304888721000000010121111357899999899868432 Q gi|254780500|r 41 GASLDMSAISLVSQGSSRSHVIESLGSPSFSILHNGNRSQSFYYVSQKKKWFPVKFLSPKIMEYIVLKITFGEKGVVSSV 120 (159) Q Consensus 41 G~~id~~~i~~i~~g~tk~~V~~~LG~Ps~~~~f~~n~~~~~YYi~~k~~~~~~af~~pkvier~Vl~I~FD~~g~V~nI 120 (159) ||+++++++++|++||||+||..+||+|+++++|++++| ||+.+..+ .|+..++++++|.||++++|+.+ T Consensus 1 Gn~~~~~~l~~l~~Gmtk~QV~~lLGtP~~~~~f~~~~W---~Yi~~~~~-------~~~~~~~~~l~v~Fd~~~~v~~~ 70 (71) T pfam04355 1 GNYVTQEDVDQLKPGMTKEQVRYLLGTPSLEDTFGPNTW---YYVYNFRR-------GGGAVEQQVLAVYFDDDGLVRNI 70 (71) T ss_pred CCCCCHHHHHHCCCCCCHHHHHHHCCCCCCCCCCCCCCE---EEEEEEEE-------CCCCEEEEEEEEEECCCCEEEEC T ss_conf 972899999751799999999987089621177678617---99999972-------89825999999998399788621 Q ss_pred E Q ss_conf 1 Q gi|254780500|r 121 S 121 (159) Q Consensus 121 ~ 121 (159) . T Consensus 71 ~ 71 (71) T pfam04355 71 G 71 (71) T ss_pred C T ss_conf 9 No 4 >PRK11251 DNA-binding transcriptional activator OsmE; Provisional Probab=97.48 E-value=0.00043 Score=45.10 Aligned_cols=96 Identities=16% Similarity=0.181 Sum_probs=65.3 Q ss_pred HHHHHHHHHHHHCCCCCCCEECCCCCCHHHHHHHCCCCCHHHHHHHHCCCCCCCCCCCCCCCEEEEEEEEEECCCHHCCC Q ss_conf 99999999998435554210135003878886404899999999983785212244677530488872100000001012 Q gi|254780500|r 19 KALFIIVIGLVVDYHVSYGSISGASLDMSAISLVSQGSSRSHVIESLGSPSFSILHNGNRSQSFYYVSQKKKWFPVKFLS 98 (159) Q Consensus 19 k~i~~~l~~~~~~~c~~~~~~~G~~id~~~i~~i~~g~tk~~V~~~LG~Ps~~~~f~~n~~~~~YYi~~k~~~~~~af~~ 98 (159) .++..+.....+.+|..+...--++. +-.++.++.||||+||..+=|+||.+...-..+---.-|+ +. T Consensus 4 ~~l~aaa~~~~LaGCt~~d~~~~~~~-ePlVkdV~~GMT~qqV~~igG~Pss~~~~~~~~GtC~~Yi-----------L~ 71 (109) T PRK11251 4 GILSAAAVLTMLAGCTAYDRNPVQYV-EPVVKDVKKGMTRQQVLQIGGKPSSEVSMIHARGTCQTYI-----------LG 71 (109) T ss_pred HHHHHHHHHHHHHHCCCCCCCHHHHC-CHHHHHHHCCCCHHHHHHHCCCCCCEEEEECCCCCCCEEE-----------EE T ss_conf 58899999999971447767813431-5069888747579999985499864178860785220037-----------32 Q ss_pred CEEEEEEEEEEEECCCCCEEEEEECCCC Q ss_conf 1111357899999899868432110434 Q gi|254780500|r 99 PKIMEYIVLKITFGEKGVVSSVSMERLP 126 (159) Q Consensus 99 pkvier~Vl~I~FD~~g~V~nI~~~~L~ 126 (159) .+--+++-.-|.||+.|.|.+-..-+-. T Consensus 72 ~~dGk~q~YfVsfD~tG~V~~~GF~TC~ 99 (109) T PRK11251 72 NRDGKAQTYFVSFDDTGHVDNSGFQTCA 99 (109) T ss_pred CCCCCEEEEEEEECCCCCCCCCCHHHHH T ss_conf 4689615579997789858351220588 No 5 >pfam11399 DUF3192 Protein of unknown function (DUF3192). Some members in this family of proteins are annotated as lipoproteins however this cannot be confirmed. Probab=93.72 E-value=0.17 Score=29.47 Aligned_cols=50 Identities=26% Similarity=0.393 Sum_probs=36.8 Q ss_pred CCCHHHHHHHCCCCCHHHHHHHHCCCCCCCCC--CCCCCCEEEEEEEEEECC Q ss_conf 03878886404899999999983785212244--677530488872100000 Q gi|254780500|r 43 SLDMSAISLVSQGSSRSHVIESLGSPSFSILH--NGNRSQSFYYVSQKKKWF 92 (159) Q Consensus 43 ~id~~~i~~i~~g~tk~~V~~~LG~Ps~~~~f--~~n~~~~~YYi~~k~~~~ 92 (159) ......+++|..|+++++|..+||+|-++-.+ +++.....||=.++.+.- T Consensus 22 ~~Nr~~i~~L~lg~s~~~V~~~lG~aDfsEa~~~~~~~~qVLFYRTqh~~sD 73 (102) T pfam11399 22 AYNRQKIAKLSLGQSKDQVLTLLGTADFSEAKTKDDKQYQVLFYRTQHVKSD 73 (102) T ss_pred HHHHHHHHHCCCCCCHHHHHHHHCCCCHHHHHHCCCCEEEEEEEEEEEECCC T ss_conf 9999988753568969999998389863888423896899999865431489 No 6 >TIGR03516 ppisom_GldI peptidyl-prolyl isomerase, gliding motility-associated. Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldI is a FKBP-type peptidyl-prolyl cis-trans isomerase (pfam00254) linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae and Cytophaga hutchinsonii. Knockout of this gene abolishes the gliding phenotype. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. This family is only found in Bacteroidetes containing the suite of genes proposed to confer the gliding motility phenotype. Probab=67.85 E-value=7.9 Score=19.47 Aligned_cols=80 Identities=14% Similarity=0.147 Sum_probs=34.5 Q ss_pred HHHHHHHHHHHCCCCCCC------EECCCCCCHHHHHHHCCCCC-HHHHHHHHCC-CCCCCCCCCCCCCEEEEEEEEEEC Q ss_conf 999999999843555421------01350038788864048999-9999998378-521224467753048887210000 Q gi|254780500|r 20 ALFIIVIGLVVDYHVSYG------SISGASLDMSAISLVSQGSS-RSHVIESLGS-PSFSILHNGNRSQSFYYVSQKKKW 91 (159) Q Consensus 20 ~i~~~l~~~~~~~c~~~~------~~~G~~id~~~i~~i~~g~t-k~~V~~~LG~-Ps~~~~f~~n~~~~~YYi~~k~~~ 91 (159) .++++++.+++.+|.... ..+|.+++.+....-..-.. .+++...+-. +..+ +.....-.||++.++... T Consensus 4 ~i~~i~~~~~~~sC~~~e~RkPi~~~s~~f~~~S~~rNK~l~a~Ee~~I~~~i~~ds~~~--y~~S~sG~~Y~~~~k~~~ 81 (177) T TIGR03516 4 LIAVILLLLLLLGCKTPEARRPISRSSGTTIKLSAERNKKLIAAEEAAIKRIISADSIVK--YETSQNGFWYYYNQKDTG 81 (177) T ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC--EEECCCCCEEEEEECCCC T ss_conf 047898989987269964668866587514999999989999999999999998698553--568899628999623788 Q ss_pred CCHHCCCCEEEEE Q ss_conf 0001012111135 Q gi|254780500|r 92 FPVKFLSPKIMEY 104 (159) Q Consensus 92 ~~~af~~pkvier 104 (159) .+ ..|+.-|. T Consensus 82 ~~---~~P~~gd~ 91 (177) T TIGR03516 82 EG---TTPEFGDL 91 (177) T ss_pred CC---CCCCCCCE T ss_conf 88---89999998 No 7 >TIGR01781 Trep_dent_lipo Treponema denticola clustered lipoprotein; InterPro: IPR010103 This entry represents a family of six predicted lipoproteins from a region of about 20 tandemly arranged genes in the Treponema denticola genome. Two other neighbouring genes share the lipoprotein signal peptide region but do not show more extensive homology. The function of this locus is unknown.. Probab=66.71 E-value=3.4 Score=21.67 Aligned_cols=28 Identities=18% Similarity=0.090 Sum_probs=22.5 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCEECC Q ss_conf 7888999999999998435554210135 Q gi|254780500|r 14 KLSFNKALFIIVIGLVVDYHVSYGSISG 41 (159) Q Consensus 14 K~~~~k~i~~~l~~~~~~~c~~~~~~~G 41 (159) |....|+||+++|+..|-||.....-+| T Consensus 2 kKdKlKlIFIL~LAvLLFSC~ke~ke~~ 29 (463) T TIGR01781 2 KKDKLKLIFILMLAVLLFSCKKEVKESK 29 (463) T ss_pred CCCHHHHHHHHHHHHHHHCCCCCCCCCC T ss_conf 6540468889999998604762113310 No 8 >pfam11006 DUF2845 Protein of unknown function (DUF2845). This bacterial family of proteins has no known function. Probab=65.18 E-value=5.7 Score=20.33 Aligned_cols=28 Identities=29% Similarity=0.438 Sum_probs=23.1 Q ss_pred HHHCCCCCHHHHHHHHCCCCCCCCCCCC Q ss_conf 6404899999999983785212244677 Q gi|254780500|r 50 SLVSQGSSRSHVIESLGSPSFSILHNGN 77 (159) Q Consensus 50 ~~i~~g~tk~~V~~~LG~Ps~~~~f~~n 77 (159) .++..|+++.+|+..-|+|..+...... T Consensus 11 ~lVs~GDs~~eVl~KCGeP~~k~~~~~~ 38 (85) T pfam11006 11 GLVSEGDSKAEVLAKCGEPASKDQRGEP 38 (85) T ss_pred CEECCCCCHHHHHHHCCCCCCCEEECEE T ss_conf 6305899779999761998753041103 No 9 >PRK10866 outer membrane protein assembly complex subunit YfiO; Provisional Probab=59.87 E-value=7.7 Score=19.55 Aligned_cols=21 Identities=5% Similarity=-0.069 Sum_probs=14.7 Q ss_pred HHHHHHHHHHHHHCCCCCCCE Q ss_conf 999999999998435554210 Q gi|254780500|r 18 NKALFIIVIGLVVDYHVSYGS 38 (159) Q Consensus 18 ~k~i~~~l~~~~~~~c~~~~~ 38 (159) .+++.++++++.++||.+.+. T Consensus 5 k~l~~~~~l~l~l~gCs~~~~ 25 (243) T PRK10866 5 KYLVAAATLSLFLAGCSGSKE 25 (243) T ss_pred HHHHHHHHHHHHHHHCCCCCC T ss_conf 999999999999987689988 No 10 >COG2088 SpoVG Uncharacterized protein, involved in the regulation of septum location [Cell envelope biogenesis, outer membrane] Probab=49.73 E-value=22 Score=16.84 Aligned_cols=39 Identities=18% Similarity=0.154 Sum_probs=32.4 Q ss_pred EEEEEEEECCCCCEEEEEECCCCCCCEECCC-CCCCCCCC Q ss_conf 5789999989986843211043456140345-41588885 Q gi|254780500|r 104 YIVLKITFGEKGVVSSVSMERLPNRRFNPNP-HTIPAPIQ 142 (159) Q Consensus 104 r~Vl~I~FD~~g~V~nI~~~~L~dg~~~~~~-~~~~~~~~ 142 (159) +-++.|+||+.=+|.++....=++|-++++. +|+|++.= T Consensus 18 kA~vsvT~D~efVvhdirVi~G~~GlfVAMPSrrt~dgEF 57 (95) T COG2088 18 KAYVSVTLDNEFVVHDIRVIEGNNGLFVAMPSRRTPDGEF 57 (95) T ss_pred EEEEEEEECCEEEEECCEEEECCCCEEEECCCCCCCCCCH T ss_conf 9999998436289822189837862599866765899631 No 11 >pfam07467 BLIP Beta-lactamase inhibitor (BLIP). The structure of BLIP reveals two structural domains, which form a polar, concave surface that docks onto a predominantly polar, convex protrusion on beta-lactamase. The ability of BLIP to adapt to a variety of class A beta-lactamases is thought to be due to flexibility between these two domains. Probab=48.63 E-value=14 Score=17.90 Aligned_cols=82 Identities=15% Similarity=0.086 Sum_probs=47.2 Q ss_pred HHHHHCCCCCCCEECCCCCCHHHHHHHCCCCCHHHHHHHHCCC---CCCCCCCCCCCCEEEEEEEEEECCCHHCCCCEEE Q ss_conf 9998435554210135003878886404899999999983785---2122446775304888721000000010121111 Q gi|254780500|r 26 IGLVVDYHVSYGSISGASLDMSAISLVSQGSSRSHVIESLGSP---SFSILHNGNRSQSFYYVSQKKKWFPVKFLSPKIM 102 (159) Q Consensus 26 ~~~~~~~c~~~~~~~G~~id~~~i~~i~~g~tk~~V~~~LG~P---s~~~~f~~n~~~~~YYi~~k~~~~~~af~~pkvi 102 (159) ...++.+-.+-.... -+..+...+|+-||++.||.++.|.- +..+.+.+. ..-|-+ .. +..| T Consensus 15 agavlata~sa~a~g--~~T~eKy~qIqFGMtr~qV~~i~Gae~~C~~~G~~g~~---i~C~~~-~~------dy~~--- 79 (183) T pfam07467 15 AGAVLATAGSANAAG--GFTGEKYTQIQFGMTRQQVWDIAGAEQNCETGGSFGDS---IHCRGH-AG------DYYP--- 79 (183) T ss_pred HHHHEECCCCCHHCC--CCCHHHHHHHHHCCCHHHHHHHHCCCCCCCCCCCCCCE---EEEECC-CC------CCCC--- T ss_conf 533121354401115--75898976666155699999985766666688867754---788545-77------7454--- Q ss_pred EEEEEEEEECCCCCEEEEEECCC Q ss_conf 35789999989986843211043 Q gi|254780500|r 103 EYIVLKITFGEKGVVSSVSMERL 125 (159) Q Consensus 103 er~Vl~I~FD~~g~V~nI~~~~L 125 (159) -=-..|++.+.|.+.....| T Consensus 80 ---yg~F~Ft~adkL~~K~q~~l 99 (183) T pfam07467 80 ---YGTFGFTSAAKLDSKRQEKL 99 (183) T ss_pred ---CCEEEECCHHHHCCCCCCCC T ss_conf ---00167525366420021533 No 12 >TIGR03524 GldJ gliding motility-associated lipoprotein GldJ. Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldJ is a lipoprotein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae. Knockouts of GldJ abolish the gliding phenotype. GldJ is homologous to GldK. There is a GldJ homolog in Cytophaga hutchinsonii and several other species that has a different, shorter architecture and is represented by a separate model. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility. Probab=48.47 E-value=17 Score=17.41 Aligned_cols=24 Identities=21% Similarity=0.191 Sum_probs=17.7 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCEE Q ss_conf 889999999999984355542101 Q gi|254780500|r 16 SFNKALFIIVIGLVVDYHVSYGSI 39 (159) Q Consensus 16 ~~~k~i~~~l~~~~~~~c~~~~~~ 39 (159) .+.|++|++++++.+.||...++. T Consensus 5 ~~~~~~~~~~~~~~~~sC~~~~~~ 28 (559) T TIGR03524 5 NVFKITFIVFVSLLFVSCKKKSSS 28 (559) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 889999999999997642588888 No 13 >TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795). Probab=44.61 E-value=15 Score=17.83 Aligned_cols=23 Identities=9% Similarity=0.065 Sum_probs=16.1 Q ss_pred HHHHHHHHHHHHCCCCCCCEECC Q ss_conf 99999999998435554210135 Q gi|254780500|r 19 KALFIIVIGLVVDYHVSYGSISG 41 (159) Q Consensus 19 k~i~~~l~~~~~~~c~~~~~~~G 41 (159) ++++++++++.++||.+.+.... T Consensus 3 ~~~~~~~~~~~l~sCs~~~~~~~ 25 (235) T TIGR03302 3 LLILLLALLLLLAGCSSKKKKEA 25 (235) T ss_pred HHHHHHHHHHHHHHCCCCCCCCC T ss_conf 99999999999987389976655 No 14 >PRK11023 hypothetical protein; Provisional Probab=43.39 E-value=18 Score=17.35 Aligned_cols=25 Identities=12% Similarity=0.208 Sum_probs=17.6 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCEE Q ss_conf 8889999999999984355542101 Q gi|254780500|r 15 LSFNKALFIIVIGLVVDYHVSYGSI 39 (159) Q Consensus 15 ~~~~k~i~~~l~~~~~~~c~~~~~~ 39 (159) |.+.+.+.++++++.+++|++.... T Consensus 1 mk~~~~l~~l~~~~~L~GC~~~~i~ 25 (191) T PRK11023 1 MKALSPLAVLISALLLQGCVAAAVV 25 (191) T ss_pred CCHHHHHHHHHHHHHHCCCHHHEEE T ss_conf 9448899999999997268233151 No 15 >PRK10802 peptidoglycan-associated outer membrane lipoprotein; Provisional Probab=42.10 E-value=20 Score=17.11 Aligned_cols=22 Identities=18% Similarity=0.099 Sum_probs=15.1 Q ss_pred HHHHHHHHHHHHHCCCCCCCEE Q ss_conf 9999999999984355542101 Q gi|254780500|r 18 NKALFIIVIGLVVDYHVSYGSI 39 (159) Q Consensus 18 ~k~i~~~l~~~~~~~c~~~~~~ 39 (159) .|.+++++.++.+.+|.+.+.. T Consensus 7 ~K~l~ia~~~l~LaACSS~~~~ 28 (173) T PRK10802 7 LKGLMIALPVMAIAACSSNKNA 28 (173) T ss_pred HHHHHHHHHHHHHHHCCCCCCC T ss_conf 9999999999999855799988 No 16 >TIGR01661 ELAV_HUD_SF ELAV/HuD family splicing factor; InterPro: IPR006548 These sequences represent the ELAV/HuD subfamily of splicing factors found in metazoa. HuD stands for the human paraneoplastic encephalomyelitis antigen D of which there are 4 variants in human . ELAV stands for the Drosophila Embryonic lethal abnormalvisual protein . ELAV-like splicing factors are also known in human as HuB (ELAV-like protein 2), HuC (ELAV-like protein 3, Paraneoplastic cerebellar degeneration-associated antigen) and HuR (ELAV-like protein 1). These genes are most closely related to the sex-lethal subfamily of splicing factors found in Dipteran insects (IPR006546 from INTERPRO). ; GO: 0003723 RNA binding. Probab=38.74 E-value=17 Score=17.44 Aligned_cols=14 Identities=36% Similarity=0.641 Sum_probs=5.1 Q ss_pred CHHHHHHHHHCCCC Q ss_conf 22438999853346 Q gi|254780500|r 143 TADGFLRRLLNPNG 156 (159) Q Consensus 143 ~~~~fl~~l~~~~G 156 (159) +|+..|-||||.+| T Consensus 365 tde~~LWQLFgpFG 378 (436) T TIGR01661 365 TDEAVLWQLFGPFG 378 (436) T ss_pred CHHHHHHHHCCCCC T ss_conf 15679987427676 No 17 >TIGR03659 IsdE heme ABC transporter, heme-binding protein isdE. This family of ABC substrate-binding proteins is observed primarily in close proximity with proteins localized to the cell wall and bearing the NEAT (NEAr Transporter, pfam05031) heme-binding domain. IsdE has been shown to bind heme and is involved in the process of scavenging heme for the purpose of obtaining iron. Probab=37.32 E-value=31 Score=15.88 Aligned_cols=51 Identities=12% Similarity=0.066 Sum_probs=29.8 Q ss_pred HHHHHHHHHHHCCCCCCCEECCCCCCHHHHHHHCCC-CCHHHHHHHHCCCCC Q ss_conf 999999999843555421013500387888640489-999999998378521 Q gi|254780500|r 20 ALFIIVIGLVVDYHVSYGSISGASLDMSAISLVSQG-SSRSHVIESLGSPSF 70 (159) Q Consensus 20 ~i~~~l~~~~~~~c~~~~~~~G~~id~~~i~~i~~g-~tk~~V~~~LG~Ps~ 70 (159) +++++++.+++.+|.+...-++.--.+..-..|+.. .+-.+++..||-+.. T Consensus 6 ~~~~~~~~~~l~~C~~~~~~~~~~~~~~~p~RIV~l~~~~~eil~~Lg~~~V 57 (289) T TIGR03659 6 LILLALLSLGLTGCSSSKAESKVSNKKSKEERIVATSVAVTEILDKLDLDLV 57 (289) T ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCEEEECCCCHHHHHHHCCCCEE T ss_conf 9999999999843289987776677789996599868637999998699759 No 18 >TIGR02747 TraV type IV conjugative transfer system protein TraV; InterPro: IPR014118 This entry includes TraV, which is a component of conjugative type IV secretion system. TraV is an outer membrane lipoprotein that is believed to interact with the secretin TraK , , . The alignment contains three conserved cysteines in the N-terminal half.. Probab=37.18 E-value=26 Score=16.38 Aligned_cols=52 Identities=12% Similarity=0.165 Sum_probs=32.6 Q ss_pred HHHHHHHHHHHCCCCC-CCEECCCCCCHHHH-HHHCCCCCHHHHHH---HHCCCCCC Q ss_conf 9999999998435554-21013500387888-64048999999999---83785212 Q gi|254780500|r 20 ALFIIVIGLVVDYHVS-YGSISGASLDMSAI-SLVSQGSSRSHVIE---SLGSPSFS 71 (159) Q Consensus 20 ~i~~~l~~~~~~~c~~-~~~~~G~~id~~~i-~~i~~g~tk~~V~~---~LG~Ps~~ 71 (159) ++.+.++.+.|++|++ -+.+|+--++=+.. +.+..-.+.+||-+ +|+.|-.. T Consensus 7 L~~~~~~~f~LtGCsa~~g~i~~~~f~C~~~r~~~~~C~~md~V~~~~~~l~~q~~~ 63 (174) T TIGR02747 7 LLLLACVAFLLTGCSASLGLIHNSDFSCEGTRGGVGTCATMDQVNDDKRLLKAQKEA 63 (174) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCCEECCCCCCCCEEEEHHHHHHHHHHHCCCHHH T ss_conf 999999987200257745652588975545338854200287767787642274111 No 19 >TIGR03114 cas_csf1 CRISPR-associated protein, Csf1 family. Members of this family show up near CRISPR repeats in Acidithiobacillus ferrooxidans ATCC 23270, Azoarcus sp. EbN1, and Rhodoferax ferrireducens DSM 15236. In the latter two species, the CRISPR/cas locus is found on a plasmid. This family is one of several characteristic of a type of CRISPR-associated (cas) gene cluster we designate Aferr after A. ferrooxidans, where it is both chromosomal and the only type of cas gene cluster found. The gene is designated csf1 (CRISPR/cas Subtype as in A. ferrooxidans protein 1), as it lies closest to the repeats. Probab=34.08 E-value=38 Score=15.36 Aligned_cols=55 Identities=15% Similarity=0.228 Sum_probs=39.2 Q ss_pred CCHHHHHHHHCCCCCCCCCCCCCCCEEEEEEEEEECCCHHCCCCEEEEEEEEEEEECCCCCEEE Q ss_conf 9999999983785212244677530488872100000001012111135789999989986843 Q gi|254780500|r 56 SSRSHVIESLGSPSFSILHNGNRSQSFYYVSQKKKWFPVKFLSPKIMEYIVLKITFGEKGVVSS 119 (159) Q Consensus 56 ~tk~~V~~~LG~Ps~~~~f~~n~~~~~YYi~~k~~~~~~af~~pkvier~Vl~I~FD~~g~V~n 119 (159) +++++|..-|=.| .+.+| ...|-.++..+ +++.+|.-..|.++.+..|++.++-. T Consensus 64 ~sn~~V~~~llnp-pepPf------va~~aTrk~qH--m~WrapVnLSrd~m~vRvgd~~L~IR 118 (202) T TIGR03114 64 ASNVDVARGLLNP-PEPPF------VALYATRKSQH--MAWRAPVNLSRDLMYVRVGDRDLKIR 118 (202) T ss_pred CCHHHHHHHHHCC-CCCCE------EEEEECCHHHH--HHEECCCCCCCCEEEEEECCCEEEEE T ss_conf 4268999998289-99988------87553011110--11006655570117999767215773 No 20 >pfam06291 Lambda_Bor Bor protein. This family consists of several Bacteriophage lambda Bor and Escherichia coli Iss proteins. Expression of bor significantly increases the survival of the Escherichia coli host cell in animal serum. This property is a well known bacterial virulence determinant indeed, bor and its adjacent sequences are highly homologous to the iss serum resistance locus of the plasmid ColV2-K94, which confers virulence in animals. It has been suggested that lysogeny may generally have a role in bacterial survival in animal hosts, and perhaps in pathogenesis. Probab=33.82 E-value=31 Score=15.90 Aligned_cols=27 Identities=19% Similarity=0.201 Sum_probs=19.5 Q ss_pred HHHHHHHHHHHHCCCCCCCEECCCCCC Q ss_conf 999999999984355542101350038 Q gi|254780500|r 19 KALFIIVIGLVVDYHVSYGSISGASLD 45 (159) Q Consensus 19 k~i~~~l~~~~~~~c~~~~~~~G~~id 45 (159) |+++.+.+++.+++|..+...=|+-.+ T Consensus 3 k~~~~~~~allLtgCa~QT~~~~~~~~ 29 (97) T pfam06291 3 KMLFAAALALLITGCAQQTFTVGNKPT 29 (97) T ss_pred HHHHHHHHHHHHCCCCCEEEEECCCCC T ss_conf 259999999997213304999357877 No 21 >PRK11063 metQ DL-methionine transporter substrate-binding subunit; Provisional Probab=33.41 E-value=37 Score=15.40 Aligned_cols=50 Identities=12% Similarity=-0.060 Sum_probs=29.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCEECCCCCCHHHHHHHCCCCC---HHHHHHHHCCCC Q ss_conf 87778889999999999984355542101350038788864048999---999999837852 Q gi|254780500|r 11 FFGKLSFNKALFIIVIGLVVDYHVSYGSISGASLDMSAISLVSQGSS---RSHVIESLGSPS 69 (159) Q Consensus 11 ~~~K~~~~k~i~~~l~~~~~~~c~~~~~~~G~~id~~~i~~i~~g~t---k~~V~~~LG~Ps 69 (159) |-+||....++..+++.+.+++|-+... .-+.|++|.+ .+++.+..-.|. T Consensus 1 m~~~~k~~~~~~~l~~~l~l~gCg~~~~---------~~~~ikVG~~~gp~~ei~e~~~~~~ 53 (271) T PRK11063 1 MAFKFKTFAAVGALIGSLALVGCGQDEK---------DPNHIKVGVIVGAEQQVAEVAQKVA 53 (271) T ss_pred CCHHHHHHHHHHHHHHHHHHCCCCCCCC---------CCCEEEEEECCCCHHHHHHHHHHHH T ss_conf 9602999999999999997500589856---------7981899846898799999988888 No 22 >TIGR03511 GldH_lipo gliding motility-associated lipoprotein GldH. Members of this protein family are predicted lipoproteins, exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). Members include GldH, a protein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae and Cytophaga hutchinsonii. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Not all Bacteroidetes with members of this protein family may have gliding motility. Probab=31.50 E-value=33 Score=15.70 Aligned_cols=24 Identities=4% Similarity=-0.090 Sum_probs=15.0 Q ss_pred HHHHHHHHHHHHCCCCCCC-EECCC Q ss_conf 9999999999843555421-01350 Q gi|254780500|r 19 KALFIIVIGLVVDYHVSYG-SISGA 42 (159) Q Consensus 19 k~i~~~l~~~~~~~c~~~~-~~~G~ 42 (159) ..++++++++++.||...+ ..+-| T Consensus 6 ~~i~~~l~~~ll~SC~~~~~vy~~y 30 (156) T TIGR03511 6 NSISFFLGACVLVSCTENTDVYHSY 30 (156) T ss_pred HHHHHHHHHHHEEEECCCCCEEEEE T ss_conf 4499999999707638996335676 No 23 >PRK10215 hypothetical protein; Provisional Probab=31.21 E-value=10 Score=18.76 Aligned_cols=64 Identities=19% Similarity=0.278 Sum_probs=37.1 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCEECCCCCCHHHHHHHCCCC--CHHHHHHHHCCCCCCCCCCCCCCCEEEEEEEEE Q ss_conf 7888999999999998435554210135003878886404899--999999983785212244677530488872100 Q gi|254780500|r 14 KLSFNKALFIIVIGLVVDYHVSYGSISGASLDMSAISLVSQGS--SRSHVIESLGSPSFSILHNGNRSQSFYYVSQKK 89 (159) Q Consensus 14 K~~~~k~i~~~l~~~~~~~c~~~~~~~G~~id~~~i~~i~~g~--tk~~V~~~LG~Ps~~~~f~~n~~~~~YYi~~k~ 89 (159) +.-+.|+..++.+...+++|.+..-+ +...-=.-|. -|.|.+--|+.. -+.|+...|-+|.+.. T Consensus 5 ~k~~~k~~~l~~~~f~LSgC~t~~L~-------~~~~~~~~~~t~wk~D~I~~ls~a-----~~~ng~~gwvFvGe~~ 70 (219) T PRK10215 5 FKGFFKAAGLLPLAFFLSGCISFGLV-------STTAGGEKGYSQWKSDTITGLSLA-----KDSNGTKGWVFVGEHF 70 (219) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHH-------HCCCCCCCCCEEECCCEEEEEEEE-----ECCCCCEEEEEEECCC T ss_conf 03544452476689999877665542-------302578766322047658996433-----1278833479993445 No 24 >COG5633 Predicted periplasmic lipoprotein [General function prediction only] Probab=30.62 E-value=16 Score=17.61 Aligned_cols=22 Identities=32% Similarity=0.371 Sum_probs=14.2 Q ss_pred CHHHHHHHHCCCCCCCCCCCCC Q ss_conf 9999999837852122446775 Q gi|254780500|r 57 SRSHVIESLGSPSFSILHNGNR 78 (159) Q Consensus 57 tk~~V~~~LG~Ps~~~~f~~n~ 78 (159) .+.+..+.=|+|+..+...+++ T Consensus 45 e~~~l~~sd~~~~~~s~l~N~~ 66 (123) T COG5633 45 EKPVLSESDGQPSASSVLKNKR 66 (123) T ss_pred CCCEEEEECCCCCEEEEEECCC T ss_conf 5870266315321158975155 No 25 >TIGR02722 lp_ uncharacterized lipoprotein; InterPro: IPR014094 Members of this protein family are restricted to the Proteobacteria, and all are predicted lipoproteins. In genomes that contain the thiK gene for the salvage enzyme thiamin kinase, the member of this family is encoded nearby.. Probab=28.70 E-value=47 Score=14.81 Aligned_cols=94 Identities=11% Similarity=0.148 Sum_probs=51.6 Q ss_pred HHHHHHHHHHHHCCCCCCC--EECCCCCC-HHHHHH---HCCCCCHHHHHHHHCCCCCCCCCCCCCCCEEEEEEEEEECC Q ss_conf 9999999999843555421--01350038-788864---04899999999983785212244677530488872100000 Q gi|254780500|r 19 KALFIIVIGLVVDYHVSYG--SISGASLD-MSAISL---VSQGSSRSHVIESLGSPSFSILHNGNRSQSFYYVSQKKKWF 92 (159) Q Consensus 19 k~i~~~l~~~~~~~c~~~~--~~~G~~id-~~~i~~---i~~g~tk~~V~~~LG~Ps~~~~f~~n~~~~~YYi~~k~~~~ 92 (159) .+++++++.+++++|++.. ...++-.+ .+.+.. +........+....=.|..... | |--..+= T Consensus 5 ~l~~~~~~Al~L~GC~~~~~~y~~~~~~~~~~~~~~tl~~~~~d~~~~a~~~~~pPk~~~~-d------wsm~~~P---- 73 (215) T TIGR02722 5 ILIFVALLALLLSGCVSQRVAYTDATDPQPVEPVTPTLGLSSTDLEPIANKMIQPPKVRKI-D------WSMLSSP---- 73 (215) T ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCEEEECCCCCCCHHHHHHHCCCCCHHHH-H------HHHHCCC---- T ss_conf 7899999999985468872665688756677760577668886723347651388741256-4------6652688---- Q ss_pred CHHCCCC----EEEEEEEEEEEECCCCCEEEEEECCC Q ss_conf 0010121----11135789999989986843211043 Q gi|254780500|r 93 PVKFLSP----KIMEYIVLKITFGEKGVVSSVSMERL 125 (159) Q Consensus 93 ~~af~~p----kvier~Vl~I~FD~~g~V~nI~~~~L 125 (159) +|-.+ ++..+.||.|.==+|..=.+|....| T Consensus 74 --~v~~~~~~~gv~~~~il~vd~i~N~T~~~idt~~l 108 (215) T TIGR02722 74 --AVAQITPGDGVETRPILLVDSIKNDTSEHIDTESL 108 (215) T ss_pred --CCCCCCCCCCCCCCCEEEEEEEECCCCHHHHHHHH T ss_conf --40223677872126778985121688401338999 No 26 >TIGR02887 spore_ger_x_C germination protein, Ger(x)C family; InterPro: IPR008844 The GerAA, -AB, and -AC proteins of the Bacillus subtilis spore are required for the germination response to L-alanine as the sole germinant. Members of GerAC family are thought to be located in the inner spore membrane. Although the function of this family is unclear, they are likely to encode the components of the germination apparatus that respond directly to this germinant, mediating the spore's response .; GO: 0009847 spore germination, 0016020 membrane. Probab=27.67 E-value=40 Score=15.23 Aligned_cols=32 Identities=22% Similarity=0.202 Sum_probs=23.1 Q ss_pred HHHHHHHHHHHHHCCCCCCCEE------CCCCCCHHHH Q ss_conf 9999999999984355542101------3500387888 Q gi|254780500|r 18 NKALFIIVIGLVVDYHVSYGSI------SGASLDMSAI 49 (159) Q Consensus 18 ~k~i~~~l~~~~~~~c~~~~~~------~G~~id~~~i 49 (159) .++++++++++.+++|-....+ +|--+|...- T Consensus 3 ~~~~~~~~~~~lL~GCwd~~ei~~~~~v~~~g~D~~e~ 40 (400) T TIGR02887 3 LKILLLILALLLLTGCWDKREIEDLALVQGIGIDKGED 40 (400) T ss_pred HHHHHHHHHHHHHCCCCCCHHHHHEEEEEEEEEECCCC T ss_conf 66789999999832665620064211444789851357 No 27 >PRK09967 putative outer membrane lipoprotein; Provisional Probab=27.16 E-value=46 Score=14.88 Aligned_cols=17 Identities=18% Similarity=0.061 Sum_probs=12.4 Q ss_pred HHHHHHHHHHHCCCCCC Q ss_conf 99999999984355542 Q gi|254780500|r 20 ALFIIVIGLVVDYHVSY 36 (159) Q Consensus 20 ~i~~~l~~~~~~~c~~~ 36 (159) +..++++++++++|.+. T Consensus 6 ~~~~l~~~l~lsgC~s~ 22 (160) T PRK09967 6 VAPLIFTSLILTGCQSP 22 (160) T ss_pred HHHHHHHHHHHHHCCCC T ss_conf 99999999999845899 No 28 >pfam02323 ELH Egg-laying hormone precursor. This family consists of egg-laying hormone (ELH) precursor and atrial gland peptides form little and California sea hare. The family also includes ovulation prohormone precursor from great pond snail. This family thus represents a conserved gastropoda ovulation and egg production prohormone. Note that many of the proteins present are further cleaved to give individual peptides. Neuropeptidergic bag cells of the marine mollusk Aplysia californica synthesize an egg-laying hormone (ELH) precursor protein which is cleaved to generate several bioactive peptides including ELH, bag cell peptides (BCP) and acidic peptide (AP). Probab=27.15 E-value=23 Score=16.69 Aligned_cols=66 Identities=17% Similarity=0.085 Sum_probs=40.3 Q ss_pred HHHHHHHHHHHCCCCCCCEE--CCC-CCCHHHHHHHCC--CCCHHHHHHHHCCCCCCCCCCCCCCCEEEEEEE Q ss_conf 99999999984355542101--350-038788864048--999999999837852122446775304888721 Q gi|254780500|r 20 ALFIIVIGLVVDYHVSYGSI--SGA-SLDMSAISLVSQ--GSSRSHVIESLGSPSFSILHNGNRSQSFYYVSQ 87 (159) Q Consensus 20 ~i~~~l~~~~~~~c~~~~~~--~G~-~id~~~i~~i~~--g~tk~~V~~~LG~Ps~~~~f~~n~~~~~YYi~~ 87 (159) ++++++|+..-+.|++-.+. ||. +.++.++.--.+ -.+-++--+.-+.|.....|+... ..||... T Consensus 11 ~~~li~Cl~lSslc~ss~S~~vhG~~f~~~Ravk~ssp~vVlSp~D~g~n~~~~~~~eafdkss--~dydd~~ 81 (255) T pfam02323 11 TMFLILCLWLSSLCVSSQSASVHGKDFATNRAVKGSSPEVVLSPEDSGSNRPQPVYGEAFDKSS--ADYDDGS 81 (255) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCHHCCCCCCCCCCEEEECCCCCCCCCCCCCCCCHHCCCC--CCCCCCC T ss_conf 7899999998767653001234561000134646788637746121245577654220001353--1023675 No 29 >PRK04405 prsA peptidylprolyl isomerase; Provisional Probab=26.76 E-value=51 Score=14.60 Aligned_cols=34 Identities=15% Similarity=0.022 Sum_probs=17.5 Q ss_pred HHHHHHHHHCCCCCC----CEECCCCCCHHH-HHHHCCC Q ss_conf 999999984355542----101350038788-8640489 Q gi|254780500|r 22 FIIVIGLVVDYHVSY----GSISGASLDMSA-ISLVSQG 55 (159) Q Consensus 22 ~~~l~~~~~~~c~~~----~~~~G~~id~~~-i~~i~~g 55 (159) ..+++.+.+++|.+. .+++|.-+..+. .+.++.. T Consensus 13 ~~~~~~l~LaaCss~~~~Vat~kg~~IT~~e~y~~~k~~ 51 (298) T PRK04405 13 ASAGLLLSLAGCSSNNKTVATYKGGKITESQYYKEMKQS 51 (298) T ss_pred HHHHHHHHHHHCCCCCCEEEEECCCCCCHHHHHHHHHCC T ss_conf 999999999871799976998639872599999998644 No 30 >PRK09859 multidrug efflux system protein MdtE; Provisional Probab=26.27 E-value=48 Score=14.79 Aligned_cols=16 Identities=6% Similarity=0.113 Sum_probs=7.4 Q ss_pred HHHHHHHHHHHCCCCC Q ss_conf 9999999998435554 Q gi|254780500|r 20 ALFIIVIGLVVDYHVS 35 (159) Q Consensus 20 ~i~~~l~~~~~~~c~~ 35 (159) ++.++++.+.+++|.. T Consensus 8 li~ll~~~~lL~gC~~ 23 (385) T PRK09859 8 LIPLLFCGAMLTACDD 23 (385) T ss_pred HHHHHHHHHHHHCCCC T ss_conf 9999999999953799 No 31 >pfam08006 DUF1700 Protein of unknown function (DUF1700). This family contains many hypothetical bacterial proteins and two putative membrane proteins. Probab=25.80 E-value=49 Score=14.72 Aligned_cols=27 Identities=22% Similarity=0.334 Sum_probs=20.9 Q ss_pred CHHHHHHHCCCCCHHHHHHHHCCCCCC Q ss_conf 878886404899999999983785212 Q gi|254780500|r 45 DMSAISLVSQGSSRSHVIESLGSPSFS 71 (159) Q Consensus 45 d~~~i~~i~~g~tk~~V~~~LG~Ps~~ 71 (159) ++--.+....|.|-+|+++.||+|.-- T Consensus 30 ~e~f~~~~~~G~se~ei~~~LG~P~~i 56 (181) T pfam08006 30 EEHFYEGEEEGKSEEEIIKELGNPKTI 56 (181) T ss_pred HHHHHHHHHCCCCHHHHHHHHCCHHHH T ss_conf 999998766899999999882999999 No 32 >pfam04218 CENP-B_N CENP-B N-terminal DNA-binding domain. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding region, which binds to a corresponding 17bp CENP-B box sequence. CENP-B dimers either bind two separate DNA molecules or alternatively, they may bind two CENP-B boxes on one DNA molecule, with the intervening stretch of DNA forming a loop structure. The CENP-B DNA-binding domain consists of two repeating domains, RP1 and RP2. This family corresponds to RP1 has been shown to consist of four helices in a helix-turn-helix structure. Probab=25.71 E-value=51 Score=14.61 Aligned_cols=25 Identities=16% Similarity=0.328 Sum_probs=21.2 Q ss_pred HHHHHHCCCCCHHHHHHHHCCCCCC Q ss_conf 8886404899999999983785212 Q gi|254780500|r 47 SAISLVSQGSSRSHVIESLGSPSFS 71 (159) Q Consensus 47 ~~i~~i~~g~tk~~V~~~LG~Ps~~ 71 (159) +.++.+..|.++++|...+|-|..+ T Consensus 14 ~iI~~~e~G~s~~~lar~~gi~~ST 38 (53) T pfam04218 14 EVIQRSEEGESKASLARRFNVPRST 38 (53) T ss_pred HHHHHHHCCCCHHHHHHHHCCCHHH T ss_conf 9999998589688999996987979 No 33 >PRK11671 mltC murein transglycosylase C; Provisional Probab=25.17 E-value=46 Score=14.87 Aligned_cols=119 Identities=12% Similarity=0.135 Sum_probs=55.5 Q ss_pred HHHHHHHHHHHHHCCCCCC-CEECCC---CCCHHHHHHHCCCCCHHHHHHHHCCC--CCCCCCCCCCCCEEEEEEEEEEC Q ss_conf 9999999999984355542-101350---03878886404899999999983785--21224467753048887210000 Q gi|254780500|r 18 NKALFIIVIGLVVDYHVSY-GSISGA---SLDMSAISLVSQGSSRSHVIESLGSP--SFSILHNGNRSQSFYYVSQKKKW 91 (159) Q Consensus 18 ~k~i~~~l~~~~~~~c~~~-~~~~G~---~id~~~i~~i~~g~tk~~V~~~LG~P--s~~~~f~~n~~~~~YYi~~k~~~ 91 (159) .|+++++++...+.+|.+. ++..-+ .=|...++.+. |.=...+..+-|.- ...+. .+|.=|...-..+ T Consensus 3 ~k~~~~~~~~~~l~~c~~~~~~~~~~~~~~kdt~~~~~l~-~~f~~ni~~~Wg~~e~~~a~~-----~~YVKYtd~Y~tR 76 (360) T PRK11671 3 KKYLALALIAPLLISCSSSKKGDTYNEAWVKDTNGFDILM-GQFAHNIENIWGFKEVLIAGP-----KDYVKYTDQYQTR 76 (360) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHCCCHHHHHH-HHHHHHHHHHCCCCCCCCCCC-----CCEEEECCCCCEE T ss_conf 4688999999999871046888763377761230799999-998765887438764313677-----5347714665301 Q ss_pred CCHHCCCCEEEEEEEEEEEECCCCC-EEEEE--------------ECCC-CCCC-EECCCCCCCCCCCCHHHHHHHHHCC Q ss_conf 0001012111135789999989986-84321--------------1043-4561-4034541588885224389998533 Q gi|254780500|r 92 FPVKFLSPKIMEYIVLKITFGEKGV-VSSVS--------------MERL-PNRR-FNPNPHTIPAPIQTADGFLRRLLNP 154 (159) Q Consensus 92 ~~~af~~pkvier~Vl~I~FD~~g~-V~nI~--------------~~~L-~dg~-~~~~~~~~~~~~~~~~~fl~~l~~~ 154 (159) . .|.||+.-+ |+.++ ..-. +|-+ ++-.+..-+.+...++.+..|.+++ T Consensus 77 a---------------~vdF~~G~I~vETl~~~~p~~~L~~AIv~tLL~~~Dp~~vDl~sd~~~~~~~~~PfL~gqV~D~ 141 (360) T PRK11671 77 S---------------HINFDDGTITIETIAGTNPAAHLRRAIIKTLLMGDDPSSIDLYSDVDDIPISKEPFLYGQVVDN 141 (360) T ss_pred E---------------EEECCCCEEEEEECCCCCHHHHHHHHHHHHHCCCCCCCHHCCCCCCCCCCCCCCCEEHHHEECC T ss_conf 2---------------6634788399996577685799999999997688995110343555557668985345420348 Q ss_pred CCC Q ss_conf 467 Q gi|254780500|r 155 NGR 157 (159) Q Consensus 155 ~Gr 157 (159) -|. T Consensus 142 ~g~ 144 (360) T PRK11671 142 TGQ 144 (360) T ss_pred CCC T ss_conf 998 No 34 >PRK11162 mltA murein transglycosylase A; Provisional Probab=23.85 E-value=49 Score=14.73 Aligned_cols=29 Identities=14% Similarity=0.041 Sum_probs=21.5 Q ss_pred HHHHHHHHHHHHHCCCCCCCEECCCCCCH Q ss_conf 99999999999843555421013500387 Q gi|254780500|r 18 NKALFIIVIGLVVDYHVSYGSISGASLDM 46 (159) Q Consensus 18 ~k~i~~~l~~~~~~~c~~~~~~~G~~id~ 46 (159) .|.+...+++..+.+|.+++..+|..-++ T Consensus 3 ~~~~~~~~~~~~l~~c~~~p~~~g~qy~~ 31 (362) T PRK11162 3 VKYLLMGIVVALLAACSSKPTDRGQQYKD 31 (362) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCCCC T ss_conf 89999999999997604798766755557 No 35 >COG4851 CamS Protein involved in sex pheromone biosynthesis [General function prediction only] Probab=23.83 E-value=58 Score=14.27 Aligned_cols=63 Identities=11% Similarity=0.032 Sum_probs=35.1 Q ss_pred HHHHHHHHHHHCCCCCCCEECCCCCCHHHHHH-HCCCCCHHHHHHHHCCCCCCCCCCCCCCCEEEEEEEEEECC Q ss_conf 99999999984355542101350038788864-04899999999983785212244677530488872100000 Q gi|254780500|r 20 ALFIIVIGLVVDYHVSYGSISGASLDMSAISL-VSQGSSRSHVIESLGSPSFSILHNGNRSQSFYYVSQKKKWF 92 (159) Q Consensus 20 ~i~~~l~~~~~~~c~~~~~~~G~~id~~~i~~-i~~g~tk~~V~~~LG~Ps~~~~f~~n~~~~~YYi~~k~~~~ 92 (159) .|.++-+.+++++|. ..+|...-+. -+-+.+.+|+..+.-.-.+.+-+-.- ..+|...+.+.. T Consensus 6 ~i~~ta~vliLs~C~-------~~~dd~edkv~qk~~~sseq~k~Iv~k~nis~n~Ykt---vLpyk~~kaRGl 69 (382) T COG4851 6 GIAATASVLILSGCF-------PFVDDTEDKVVQKEGKSSEQEKGIVPKANISENYYKT---VLPYKAGKARGL 69 (382) T ss_pred HHHHHHHHHHHHHCC-------CCCCCCCCHHHHCCCCCHHHCCCCCCCCCCCCCCHHE---EEEECCCCCCCH T ss_conf 589999999986116-------7668730023430377033201121354444331322---422102444550 No 36 >smart00242 MYSc Myosin. Large ATPases. ATPase; molecular motor. Muscle contraction consists of a cyclical interaction between myosin and actin. The core of the myosin structure is similar in fold to that of kinesin. Probab=23.54 E-value=59 Score=14.23 Aligned_cols=64 Identities=22% Similarity=0.363 Sum_probs=38.8 Q ss_pred HHHHHHHCCCCCCCCCCCCCCCEEEEEEEEEECCCHHCCCCEEEEEEEEEEEECCCCCEE--EEEECCCCCCCEECCCCC Q ss_conf 999998378521224467753048887210000000101211113578999998998684--321104345614034541 Q gi|254780500|r 59 SHVIESLGSPSFSILHNGNRSQSFYYVSQKKKWFPVKFLSPKIMEYIVLKITFGEKGVVS--SVSMERLPNRRFNPNPHT 136 (159) Q Consensus 59 ~~V~~~LG~Ps~~~~f~~n~~~~~YYi~~k~~~~~~af~~pkvier~Vl~I~FD~~g~V~--nI~~~~L~dg~~~~~~~~ 136 (159) +-|++.+|...|. -|+| ++| |. +.+.+.||+.|.+. .|+.|-||--|++. T Consensus 137 npiLEAFGNAkT~--~N~N--------SSR-------FG-------K~~~l~f~~~g~i~ga~i~~yLLEksRv~~---- 188 (677) T smart00242 137 NPILEAFGNAKTV--RNNN--------SSR-------FG-------KFIEIHFDAKGKIVGAKIETYLLEKSRVVS---- 188 (677) T ss_pred HHHHHHHCCCCCC--CCCC--------CCC-------CC-------EEEEEEECCCCCCCCEEEEEEECCCCEEEE---- T ss_conf 0499873056456--7987--------475-------00-------789999889995740488984032254443---- Q ss_pred CCCCCCCHH--HHHHHHHC Q ss_conf 588885224--38999853 Q gi|254780500|r 137 IPAPIQTAD--GFLRRLLN 153 (159) Q Consensus 137 ~~~~~~~~~--~fl~~l~~ 153 (159) .+..|. -+|-||+. T Consensus 189 ---~~~gErNfHIFYqLl~ 204 (677) T smart00242 189 ---QAKGERNYHIFYQLLA 204 (677) T ss_pred ---CCCCCCCEEEHHHHHC T ss_conf ---2798764022999983 No 37 >pfam10566 Glyco_hydro_97 Glycoside hydrolase 97. This is the 97th family of glycosidases, in this case bacterial. The central part of the GH97 family protein sequences represents a typical and complete (beta/alpha)8-barrel or catalytic TIM-barrel type domain. The N- and C-terminal parts of the sequences, mainly consisting of beta-strands, most probably form two additional non-catalytic domains with as yet unknown functions. The non-catalytic domains of glycosidases from the alpha-galactosidase and alpha-glucosidase superfamilies are also predominantly composed of beta-strands, and at least some of these domains are involved in oligomerisation and carbohydrate binding. In all known glycosidases with the (beta-alpha)8-barrel fold, the amino acid residues at the active site are located on the C-termini of the beta-strands. Probab=22.47 E-value=54 Score=14.47 Aligned_cols=18 Identities=0% Similarity=-0.175 Sum_probs=6.4 Q ss_pred HHHHHHHHHCCCCCCCEE Q ss_conf 999999984355542101 Q gi|254780500|r 22 FIIVIGLVVDYHVSYGSI 39 (159) Q Consensus 22 ~~~l~~~~~~~c~~~~~~ 39 (159) +.+++++++++++..++. T Consensus 9 ~~~~~~~~~~~~~~~k~~ 26 (643) T pfam10566 9 LAFLLLIGNLSLAARKNY 26 (643) T ss_pred HHHHHHHHHHHHHHCCEE T ss_conf 999999976567541623 No 38 >PRK12407 flgH flagellar basal body L-ring protein; Reviewed Probab=22.28 E-value=51 Score=14.62 Aligned_cols=20 Identities=15% Similarity=-0.031 Sum_probs=15.0 Q ss_pred HHHHHHHHHHHHCCCCCCCE Q ss_conf 99999999998435554210 Q gi|254780500|r 19 KALFIIVIGLVVDYHVSYGS 38 (159) Q Consensus 19 k~i~~~l~~~~~~~c~~~~~ 38 (159) |+++++++++.+++|+.... T Consensus 2 r~l~l~~~~l~LsGC~~~~~ 21 (220) T PRK12407 2 RFLILTPMVLALCGCESPAL 21 (220) T ss_pred CCHHHHHHHHHHHCCCCCCC T ss_conf 15799999999736588887 No 39 >PRK11627 hypothetical protein; Provisional Probab=21.81 E-value=64 Score=14.02 Aligned_cols=20 Identities=20% Similarity=0.168 Sum_probs=13.8 Q ss_pred HHHHHHHHHHHHCCCCCCCE Q ss_conf 99999999998435554210 Q gi|254780500|r 19 KALFIIVIGLVVDYHVSYGS 38 (159) Q Consensus 19 k~i~~~l~~~~~~~c~~~~~ 38 (159) |.++.++..+.+.+|.+.++ T Consensus 4 k~~~~l~a~~lL~gCas~p~ 23 (192) T PRK11627 4 KILFPLVALFMLAGCATPPT 23 (192) T ss_pred HHHHHHHHHHHHHHHCCCCC T ss_conf 79999999999986069976 No 40 >TIGR00752 slp outer membrane lipoprotein, Slp family; InterPro: IPR004658 Slp superfamily members are present in the Gram-negative gamma proteobacteria Escherichia coli (which also contains a close paralog), Haemophilus influenzae and Pasteurella multocida and Vibrio cholerae. The known members of the family to date share a motif LX[GA]C near the N-terminus, which is compatible with the possibility that the protein is modified into a lipoprotein with Cys as the new N-terminus. Slp from E. coli is known to be a lipoprotein of the outer membrane and to be expressed in response to carbon starvation.; GO: 0019867 outer membrane. Probab=21.15 E-value=66 Score=13.94 Aligned_cols=26 Identities=15% Similarity=0.321 Sum_probs=20.1 Q ss_pred HHHHHHHHHHHHHCCCCCCCEE-CCCC Q ss_conf 9999999999984355542101-3500 Q gi|254780500|r 18 NKALFIIVIGLVVDYHVSYGSI-SGAS 43 (159) Q Consensus 18 ~k~i~~~l~~~~~~~c~~~~~~-~G~~ 43 (159) .|-++++-+|+.|++|++.+.. +|+. T Consensus 3 ~~g~l~~~l~f~L~gC~~~P~~ikg~~ 29 (190) T TIGR00752 3 KKGLLITALAFLLTGCIAVPKAIKGER 29 (190) T ss_pred CHHHHHHHHHHHHHHHCCCCCCCCCCC T ss_conf 014568889988753146742345567 No 41 >TIGR03525 GldK gliding motility-associated lipoprotein GldK. Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldK is a lipoprotein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae. Knockouts of GldK abolish the gliding phenotype. GldK is homologous to GldJ. There is a GldK homolog in Cytophaga hutchinsonii and several other species that has a different, shorter architecture and is represented by a separate model. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility. Probab=21.12 E-value=54 Score=14.44 Aligned_cols=54 Identities=19% Similarity=0.191 Sum_probs=27.2 Q ss_pred HHHHHHHHHHHHCCCCCCCEECCCCCCHHHH----------HHHCCC-----CCHHHHHHHHCCCCCCCCC Q ss_conf 9999999999843555421013500387888----------640489-----9999999983785212244 Q gi|254780500|r 19 KALFIIVIGLVVDYHVSYGSISGASLDMSAI----------SLVSQG-----SSRSHVIESLGSPSFSILH 74 (159) Q Consensus 19 k~i~~~l~~~~~~~c~~~~~~~G~~id~~~i----------~~i~~g-----~tk~~V~~~LG~Ps~~~~f 74 (159) |+++++++..++.||-+. -+|-++-.+.- -.|..| .+.+|+....=.|+-+-.. T Consensus 3 k~~~~~~~~~~~~SCg~~--~~GELvGv~~~~w~~~~PyGMV~IPgGsF~MG~sDdDv~~~~~ap~k~VtV 71 (449) T TIGR03525 3 KYLVFAALVVLVYSCGSG--DKGELVGVEGKKWYPEKPYGMVLVPGGSFIMGKSDEDIAGVMNAPTKTVTV 71 (449) T ss_pred HHHHHHHHHHHHHHCCCC--CCCEEEEECCCCCCCCCCCEEEEECCCEEECCCCCCCCCCCCCCCCCEEEE T ss_conf 248999999987210589--971387314875578999835896685676688753433545588705888 No 42 >PRK03002 prsA peptidylprolyl isomerase; Reviewed Probab=21.08 E-value=58 Score=14.26 Aligned_cols=30 Identities=7% Similarity=0.086 Sum_probs=15.5 Q ss_pred HHHHHHHHHHHCCCCCC------CEECCCCCCHHHH Q ss_conf 99999999984355542------1013500387888 Q gi|254780500|r 20 ALFIIVIGLVVDYHVSY------GSISGASLDMSAI 49 (159) Q Consensus 20 ~i~~~l~~~~~~~c~~~------~~~~G~~id~~~i 49 (159) ++..++..+.+++|.+. -+++|.-+....+ T Consensus 8 ~~~~~~svl~LaaC~~~~~~~~VAtv~g~~IT~~df 43 (285) T PRK03002 8 IITALISILMLSACGQKNSSATVATATDSTITKSDF 43 (285) T ss_pred HHHHHHHHHHHHHHCCCCCCCEEEEECCCEECHHHH T ss_conf 999999999999845799997799868978739999 No 43 >pfam11359 gpUL132 Glycoprotein UL132. Glycoprotein UL132 is a low-abundance structural component of Human cytomegalovirus (HCMV). The function of this protein is not fully understood. Probab=20.16 E-value=69 Score=13.81 Aligned_cols=105 Identities=14% Similarity=0.136 Sum_probs=48.1 Q ss_pred HHHHHHHHHHHCCCCCCCEECCCCCCHHHHHHHCC------------CCCHHHHHHHHCCCCCC-CCCCCCCCCE-EEEE Q ss_conf 99999999984355542101350038788864048------------99999999983785212-2446775304-8887 Q gi|254780500|r 20 ALFIIVIGLVVDYHVSYGSISGASLDMSAISLVSQ------------GSSRSHVIESLGSPSFS-ILHNGNRSQS-FYYV 85 (159) Q Consensus 20 ~i~~~l~~~~~~~c~~~~~~~G~~id~~~i~~i~~------------g~tk~~V~~~LG~Ps~~-~~f~~n~~~~-~YYi 85 (159) ..++.+.+.+.++|+-+.-.+--+-|++...+|.- |+-|.|+-+ |+||.. -.++..-|+- |--- T Consensus 65 s~llVl~a~ly~sc~~r~gr~~~fsd~E~a~Ll~~~dd~g~sspfgsgSRRgqipa--gssSs~yqrL~~~D~de~~~~~ 142 (235) T pfam11359 65 SFLLVLVAVLYSSCHTRPGRKTKFSDQEGAKLLDDTDDSGGSSPFGSGSRRGQIPA--GSSSSKYQRLNKGDYDELWQSQ 142 (235) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCC--CCCCCCCCCCCCCCHHHHHHHH T ss_conf 99999998777766417886556544777764225554566466777754565667--8886545677654466777765 Q ss_pred EEEEECCCHHCCCCEEEEEEEEEEEECCCCCEEE-EEECCCCCCCEECCCCCC Q ss_conf 2100000001012111135789999989986843-211043456140345415 Q gi|254780500|r 86 SQKKKWFPVKFLSPKIMEYIVLKITFGEKGVVSS-VSMERLPNRRFNPNPHTI 137 (159) Q Consensus 86 ~~k~~~~~~af~~pkvier~Vl~I~FD~~g~V~n-I~~~~L~dg~~~~~~~~~ 137 (159) +.. +..+. -.|+ =+|+|+++|.+.. .- +-.=|+=.|+...- T Consensus 143 saa--re~m~-~Dp~------NViYF~KdGnLdTSFV--NPnYG~gSpmtIES 184 (235) T pfam11359 143 SAA--RETMD-DDPD------NVIYFDKDGNLDTSFV--NPNYGHGSPMTIES 184 (235) T ss_pred HHH--HHHCC-CCCC------CEEEECCCCCCCCCEE--CCCCCCCCCEEEEC T ss_conf 554--54203-7966------3688815887322001--77778899606641 Done!