Query gi|254780199|ref|YP_003064612.1| outer membrane lipoprotein [Candidatus Liberibacter asiaticus str. psy62] Match_columns 160 No_of_seqs 20 out of 22 Neff 2.5 Searched_HMMs 39220 Date Mon May 23 13:08:13 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780199.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR03066 Gem_osc_para_1 Gemma 92.0 1 2.7E-05 24.7 8.6 104 23-128 4-110 (111) 2 COG3056 Uncharacterized lipopr 87.5 0.39 9.8E-06 27.3 2.1 38 4-41 11-48 (204) 3 PRK10477 outer membrane lipopr 82.2 3.1 8E-05 21.8 4.8 48 9-60 4-56 (177) 4 KOG1357 consensus 66.8 4.1 0.0001 21.1 2.1 50 34-86 100-150 (519) 5 PRK12788 flgH flagellar basal 56.5 14 0.00035 17.8 3.3 31 9-39 1-32 (231) 6 TIGR02262 benz_CoA_lig benzoat 54.5 3.9 0.0001 21.2 0.3 43 40-86 375-417 (520) 7 PRK10397 hypothetical protein; 52.5 14 0.00035 17.8 2.8 39 16-55 8-49 (137) 8 PRK11627 hypothetical protein; 50.3 15 0.00039 17.6 2.7 128 10-138 4-158 (192) 9 TIGR02299 HpaE 5-carboxymethyl 48.6 4.9 0.00012 20.6 -0.0 27 43-77 5-31 (494) 10 TIGR00673 cynS cyanate hydrata 45.8 8.2 0.00021 19.2 0.8 21 129-151 74-94 (156) 11 pfam05055 DUF677 Protein of un 39.6 7.3 0.00019 19.5 -0.3 35 14-49 194-228 (336) 12 COG0033 Pgm Phosphoglucomutase 36.4 29 0.00075 15.8 2.4 33 40-74 459-491 (524) 13 PRK00022 lolB outer membrane l 34.0 37 0.00095 15.2 8.6 99 16-125 10-111 (203) 14 pfam06679 DUF1180 Protein of u 29.2 45 0.0011 14.7 3.7 44 26-69 39-86 (163) 15 TIGR00104 TIGR00104 conserved 28.9 21 0.00054 16.7 0.6 15 59-73 87-101 (152) 16 PRK12700 flgH flagellar basal 28.2 39 0.00098 15.1 1.9 34 3-38 3-38 (230) 17 cd02858 Esterase_N_term Estera 27.5 48 0.0012 14.5 3.4 42 36-82 13-56 (85) 18 PRK10802 peptidoglycan-associa 26.1 34 0.00087 15.4 1.3 27 1-29 1-27 (173) 19 COG3017 LolB Outer membrane li 25.2 53 0.0013 14.3 6.1 85 9-103 7-95 (206) 20 PRK00249 flgH flagellar basal 24.6 54 0.0014 14.2 3.2 27 9-35 4-30 (231) 21 COG3364 Zn-ribbon containing p 24.5 55 0.0014 14.2 2.7 26 79-104 63-88 (112) 22 pfam06085 Rz1 Lipoprotein Rz1 24.3 55 0.0014 14.2 3.0 19 12-30 8-26 (59) 23 TIGR01004 PulS_OutS lipoprotei 22.5 60 0.0015 14.0 2.5 32 4-36 3-35 (136) 24 pfam08194 DIM DIM protein. Dro 20.3 49 0.0013 14.5 1.2 24 5-28 2-25 (37) No 1 >TIGR03066 Gem_osc_para_1 Gemmata obscuriglobus paralogous family TIGR03066. This model represents an uncharacterized paralogous family in Gemmata obscuriglobus UQM 2246, a member of the Planctomycetes. This family shows sequence similarity to TIGR03067, which is also found in Gemmata obscuriglobus as well as in a few other species. Probab=91.98 E-value=1 Score=24.67 Aligned_cols=104 Identities=21% Similarity=0.330 Sum_probs=67.0 Q ss_pred HHCCCCCCCCCCCCCCCCCCCEEEEEC---CCCEEEEEEECCEEEEECCCCCCCEEEEEEEEEECCCEEEEEEEECCCCC Q ss_conf 405777656653337676562025726---89814468608789984257777466544688516857999997200388 Q gi|254780199|r 23 SCSALSVVVDSSELLPEPMDITGYWED---DNGILSYFQKDGKFKTISTDGSSSVLATGSYHVKINQDVEIKLTSLIRNT 99 (160) Q Consensus 23 ac~~~~~~~~~~~~~P~P~gVeG~W~D---~NGivSsf~~~G~F~TrstDGTn~~LA~GTY~~~~~~~VEIn~tSlvR~T 99 (160) +|.+....+...+-. -|.-+=|.|.. .-..+--|.++|++.-+-.++-....++|||...++. +-.++-+.-..- T Consensus 4 a~~~~~~a~~~~ek~-Da~kLvGKW~~~~~~~~vviEF~kdGkl~v~~~~~Gk~vk~~GTYkl~G~k-Lt~~l~~gg~e~ 81 (111) T TIGR03066 4 ALAGGGRADDKDEKI-DAKKLVGKWETSKTKDDVVIEFAKDGKLVVTIGEKGKEVKADGTYKLDGNK-LTLTLKAGGKEK 81 (111) T ss_pred EECCCCCCCCCCCCC-CHHHHEEEEEECCCCCEEEEEEECCCEEEEEECCCCEEEEECCEEEEECCE-EEEEEECCCEEE T ss_conf 981455368864235-877933541617877507999914987999975698389835479974578-789980587062 Q ss_pred CEEEEEEEECCCEEEEECCCCCEEEEEEE Q ss_conf 30589996063436676067987999973 Q gi|254780199|r 100 SGKIQCQFLDSNKLNCLAKDQKQFYLRRT 128 (160) Q Consensus 100 ~skVnC~L~spsQLnCTS~sG~QF~L~Rt 128 (160) ..-+.-....-.+|--....|..-.|.|- T Consensus 82 k~tlTv~kltd~elv~~d~~gk~~t~~Rv 110 (111) T TIGR03066 82 KETLTVKKLTDDELVGKDPDGKKDTLKRV 110 (111) T ss_pred EEEEEEEEECCCCEEEECCCCCHHHEEEC T ss_conf 24789997136425876899980340562 No 2 >COG3056 Uncharacterized lipoprotein [Cell envelope biogenesis, outer membrane] Probab=87.45 E-value=0.39 Score=27.30 Aligned_cols=38 Identities=24% Similarity=0.371 Sum_probs=31.7 Q ss_pred CEEEEHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCC Q ss_conf 03551256899999998764057776566533376765 Q gi|254780199|r 4 RCFLSLSFLFPLVAAIAVVSCSALSVVVDSSELLPEPM 41 (160) Q Consensus 4 r~~lsl~~~~~Lv~alav~ac~~~~~~~~~~~~~P~P~ 41 (160) |+.+-|.++|+|++.+.++.|++++--....|-+|-|. T Consensus 11 ~k~~t~k~L~~laa~~lLagC~a~~~tl~~tpp~~~~s 48 (204) T COG3056 11 SKNMTKKILFPLAAIFLLAGCAAPPTTLIVTPPAPLPS 48 (204) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCEEECCCCCCCCC T ss_conf 17889999999999999975379975031369976644 No 3 >PRK10477 outer membrane lipoprotein Blc; Provisional Probab=82.17 E-value=3.1 Score=21.77 Aligned_cols=48 Identities=23% Similarity=0.354 Sum_probs=32.3 Q ss_pred HHHHHHHHHHHHHHHHCCCCCC-----CCCCCCCCCCCCCEEEEECCCCEEEEEEEC Q ss_conf 2568999999987640577765-----665333767656202572689814468608 Q gi|254780199|r 9 LSFLFPLVAAIAVVSCSALSVV-----VDSSELLPEPMDITGYWEDDNGILSYFQKD 60 (160) Q Consensus 9 l~~~~~Lv~alav~ac~~~~~~-----~~~~~~~P~P~gVeG~W~D~NGivSsf~~~ 60 (160) |.++.+|++++.+++|++..++ +.+.++ .---|+|-----+.+.|++| T Consensus 4 ~~~~~~~~~~~~~~~c~~~~~p~~~~~V~~~Dl----~RYmG~WYEIAr~P~~fe~g 56 (177) T PRK10477 4 LPVVAAVTAAFLVVACSSPTPPKGVTVVNNFDA----KRYLGTWYEIARFDHRFERG 56 (177) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCH----HHCCEEEEEEEECCCHHHCC T ss_conf 789999999999976469999998841673474----46488028997048668878 No 4 >KOG1357 consensus Probab=66.77 E-value=4.1 Score=21.08 Aligned_cols=50 Identities=22% Similarity=0.273 Sum_probs=37.6 Q ss_pred CCCCCCCC-CCEEEEECCCCEEEEEEECCEEEEECCCCCCCEEEEEEEEEECCC Q ss_conf 33376765-620257268981446860878998425777746654468851685 Q gi|254780199|r 34 SELLPEPM-DITGYWEDDNGILSYFQKDGKFKTISTDGSSSVLATGSYHVKINQ 86 (160) Q Consensus 34 ~~~~P~P~-gVeG~W~D~NGivSsf~~~G~F~TrstDGTn~~LA~GTY~~~~~~ 86 (160) -+.--||. ||-|+|+|---.+|.-. |-.| .++.+ +...|--|+|+|++=. T Consensus 100 ~Dc~~RPi~~VPG~~v~l~dR~S~D~-n~tf-k~tG~-~~~~lNlgSYNYLGFa 150 (519) T KOG1357 100 RDCFERPICSVPGAWVDLMDRVSDDY-NWTF-KYTGT-RSRCLNLGSYNYLGFA 150 (519) T ss_pred HHHHCCCCCCCCCCEEEEEEEECCCC-CCEE-EECCC-EEEEEEECCCCCCCCC T ss_conf 66533887689875688332120478-8558-84364-4665420332233423 No 5 >PRK12788 flgH flagellar basal body L-ring protein; Reviewed Probab=56.54 E-value=14 Score=17.83 Aligned_cols=31 Identities=16% Similarity=0.448 Sum_probs=20.8 Q ss_pred HHHHHHHHHHHHHHHHCCCCC-CCCCCCCCCC Q ss_conf 256899999998764057776-5665333767 Q gi|254780199|r 9 LSFLFPLVAAIAVVSCSALSV-VVDSSELLPE 39 (160) Q Consensus 9 l~~~~~Lv~alav~ac~~~~~-~~~~~~~~P~ 39 (160) |-++..+.++++++.|.+... +++.+++.|- T Consensus 1 mr~l~~~~a~l~LsGCa~~~~~~~~~P~~sPv 32 (231) T PRK12788 1 MRLLVAILACLALAGCANNLADALQAPRMSPI 32 (231) T ss_pred CCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCC T ss_conf 90899999999976403874336789987888 No 6 >TIGR02262 benz_CoA_lig benzoate-CoA ligase family; InterPro: IPR011957 Characterised members of this protein family include benzoate-CoA ligase, 4-hydroxybenzoate-CoA ligase, 2-aminobenzoate-CoA ligase, etc. Members are related to fatty acid and acetate CoA ligases.. Probab=54.52 E-value=3.9 Score=21.15 Aligned_cols=43 Identities=26% Similarity=0.452 Sum_probs=35.6 Q ss_pred CCCCEEEEECCCCEEEEEEECCEEEEECCCCCCCEEEEEEEEEECCC Q ss_conf 65620257268981446860878998425777746654468851685 Q gi|254780199|r 40 PMDITGYWEDDNGILSYFQKDGKFKTISTDGSSSVLATGSYHVKINQ 86 (160) Q Consensus 40 P~gVeG~W~D~NGivSsf~~~G~F~TrstDGTn~~LA~GTY~~~~~~ 86 (160) |--=+|+|....-.-.+|. |-. ||+=| +=-+=++|.|+|-+=+ T Consensus 375 PSSA~~YWn~R~Ksr~TF~--G~W-TrsGD-KY~r~~dG~YtYaGR~ 417 (520) T TIGR02262 375 PSSAAGYWNNREKSRATFE--GEW-TRSGD-KYVRNDDGVYTYAGRT 417 (520) T ss_pred CCCCCCCCCCHHHHHCCCC--CCC-CCCCC-CEEECCCCEEEEECCC T ss_conf 7545567988146132378--895-22487-1036378506870244 No 7 >PRK10397 hypothetical protein; Provisional Probab=52.54 E-value=14 Score=17.82 Aligned_cols=39 Identities=21% Similarity=0.569 Sum_probs=25.3 Q ss_pred HHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEECC---CCEEE Q ss_conf 99998764057776566533376765620257268---98144 Q gi|254780199|r 16 VAAIAVVSCSALSVVVDSSELLPEPMDITGYWEDD---NGILS 55 (160) Q Consensus 16 v~alav~ac~~~~~~~~~~~~~P~P~gVeG~W~D~---NGivS 55 (160) .++++++-|...+ --..+--.|.|+|.+|+|-.. .+.+| T Consensus 8 g~~l~LsGCa~v~-nY~~vVk~PaPagl~GyWqs~gPQs~lvS 49 (137) T PRK10397 8 GALMLLAGCAEVE-NYNNVVKTPAPAGLAGYWQSKGPQRALVS 49 (137) T ss_pred HHHHHHHCCCCCC-CHHHHHCCCCCCCCCEEEECCCCCCCCCC T ss_conf 8999973532441-06656448997332304452587233369 No 8 >PRK11627 hypothetical protein; Provisional Probab=50.31 E-value=15 Score=17.56 Aligned_cols=128 Identities=17% Similarity=0.333 Sum_probs=63.3 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCC---CCEEEE-----EC---CCCEEEEEEECCEEEEECC-CCCC----- Q ss_conf 56899999998764057776566533376765---620257-----26---8981446860878998425-7777----- Q gi|254780199|r 10 SFLFPLVAAIAVVSCSALSVVVDSSELLPEPM---DITGYW-----ED---DNGILSYFQKDGKFKTIST-DGSS----- 72 (160) Q Consensus 10 ~~~~~Lv~alav~ac~~~~~~~~~~~~~P~P~---gVeG~W-----~D---~NGivSsf~~~G~F~Trst-DGTn----- 72 (160) .+++||.+.+.+++|+++|--..-.|.++-|+ .+.|.= +| .+- +..+.++|.-++-.+ ..-. T Consensus 4 k~~~~l~a~~lL~gCas~p~~l~l~P~~~~~~~~~~~~~~~vsv~s~D~R~~~~-va~~~~~g~~~~L~~s~~l~~~lq~ 82 (192) T PRK11627 4 KILFPLVALFMLAGCATPPTTLEVSPKITLPQQDPSLMGVTVSINGADQRQDQA-LAKVTRDNQLVTLTPSRDLRFLLQE 82 (192) T ss_pred HHHHHHHHHHHHHHHCCCCCEEEECCCCCCCCCCCCCCCEEEEEEEEECCCCCE-EEEECCCCCEEEECCCCCHHHHHHH T ss_conf 799999999999860699763786788766434554577059998775366532-6798069946770478688999999 Q ss_pred ---CEEEEEEEEEEC--CCEEEEEEEECCCC-CCEEEEEEEECCCEEEEE--CCCCCEEEE--EEEEEECCCCCCC Q ss_conf ---466544688516--85799999720038-830589996063436676--067987999--9733210777786 Q gi|254780199|r 73 ---SVLATGSYHVKI--NQDVEIKLTSLIRN-TSGKIQCQFLDSNKLNCL--AKDQKQFYL--RRTHLTELPSSKP 138 (160) Q Consensus 73 ---~~LA~GTY~~~~--~~~VEIn~tSlvR~-T~skVnC~L~spsQLnCT--S~sG~QF~L--~Rt~~~~~~~~~~ 138 (160) ..|.+--|.... +..+++.+--+.-+ +++.+.-.+.+..++.-+ +..|.+|+= +++--.|-|-+-- T Consensus 83 ~l~~ql~sqG~~i~~~a~~~l~v~i~~l~a~V~q~~~~y~i~t~v~i~I~v~~~ng~k~tK~y~~~~~~eG~lsa~ 158 (192) T PRK11627 83 VLEKQMTARGYMIGPNGAVNLQIIVNQLYADVSQGNVRYNIATKADISIIATAQNGNKMTKNYRASYNVEGAFQAS 158 (192) T ss_pred HHHHHHHHCCEEECCCCCCEEEEEHHHHCCCCCCCEEEEEECCEEEEEEEEECCCCCEEEEEEECCEECCCCCCCC T ss_conf 9999998588171588762599996893160255504777524799999998378977778872322115677888 No 9 >TIGR02299 HpaE 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase; InterPro: IPR011985 This entry represents the dehydrogenase responsible for the conversion of 5-carboxymethyl-2-hydroxymuconate semialdehyde to 5-carboxymethyl-2-hydroxymuconate (a tricarboxylic acid). This is the step in the degradation of 4-hydroxyphenylacetic acid via homoprotocatechuate following the oxidative opening of the aromatic ring .. Probab=48.56 E-value=4.9 Score=20.60 Aligned_cols=27 Identities=30% Similarity=0.485 Sum_probs=21.4 Q ss_pred CEEEEECCCCEEEEEEECCEEEEECCCCCCCEEEE Q ss_conf 20257268981446860878998425777746654 Q gi|254780199|r 43 ITGYWEDDNGILSYFQKDGKFKTISTDGSSSVLAT 77 (160) Q Consensus 43 VeG~W~D~NGivSsf~~~G~F~TrstDGTn~~LA~ 77 (160) |+|+|++.. ++++|++..+- +|++|++ T Consensus 5 I~Ge~v~~~-------~getF~~~~P~-~~e~L~~ 31 (494) T TIGR02299 5 IDGEFVPSE-------SGETFETLSPA-TNEVLAS 31 (494) T ss_pred CCCEEEECC-------CCCCCCCCCCC-CCCHHHH T ss_conf 056454267-------78848888876-6202465 No 10 >TIGR00673 cynS cyanate hydratase; InterPro: IPR008076 Cyanase, an enzyme found in bacteria and plants, catalyses the reaction of cyanate with bicarbonate to produce ammonia and carbon dioxide, allowing the host organisms to overcome the toxicity of environmental cyanate . The cyanate lyase monomer is composed of two domains: an N-terminal domain that shows structural similarity to the DNA-binding alpha-helix bundle motif, and a C-terminal domain that has an 'open fold' that shows no structural similarity to other proteins . The enzyme is active as a homodecamer of 17kDa subunits, and displays half-site binding of substrates or substrate analogues. The dimer structure reveals the C-terminal domains to be intertwined; the decamer is formed from a pentamer of these dimers. The active site of the enzyme is located between dimers and comprises residues from four adjacent subunits of the homodecamer. Synonym(s): cyanase lyase, cyanase hydrolase ; GO: 0016836 hydro-lyase activity, 0008152 metabolic process. Probab=45.83 E-value=8.2 Score=19.21 Aligned_cols=21 Identities=33% Similarity=0.608 Sum_probs=16.5 Q ss_pred EEECCCCCCCCHHHHHCCCCCCC Q ss_conf 32107777860015437778888 Q gi|254780199|r 129 HLTELPSSKPQDDVAEIPEDPTT 151 (160) Q Consensus 129 ~~~~~~~~~~~~~~~~~~~~~~~ 151 (160) .|+..|+.+|+++| +|-||+- T Consensus 74 eL~~aP~r~~~~pv--~PTDP~~ 94 (156) T TIGR00673 74 ELKDAPLREGLDPV--VPTDPLV 94 (156) T ss_pred HHHCCCCCCCCCCC--CCCCCHH T ss_conf 65406666888898--9677606 No 11 >pfam05055 DUF677 Protein of unknown function (DUF677). This family consists of AT14A like proteins from Arabidopsis thaliana. At14a has a small domain that has sequence similarities to integrins from fungi, insects and humans. Transcripts of At14a are found in all Arabidopsis tissues and localizes partly to the plasma membrane. Probab=39.65 E-value=7.3 Score=19.52 Aligned_cols=35 Identities=26% Similarity=0.418 Sum_probs=26.8 Q ss_pred HHHHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEC Q ss_conf 999999876405777656653337676562025726 Q gi|254780199|r 14 PLVAAIAVVSCSALSVVVDSSELLPEPMDITGYWED 49 (160) Q Consensus 14 ~Lv~alav~ac~~~~~~~~~~~~~P~P~gVeG~W~D 49 (160) +|+.+ -+++|-++|++.+-+....-|-+.-|.|.+ T Consensus 194 Vli~s-vvaa~~a~p~~va~a~~~s~Pi~~~gkW~~ 228 (336) T pfam05055 194 VLVLS-VVAAAMGAPPVVGVAGALAVPLEAVGKWVG 228 (336) T ss_pred HHHHH-HHHHHCCCHHHHHHHHHHCCCCHHHHHHHH T ss_conf 99999-999973754887668876166001788999 No 12 >COG0033 Pgm Phosphoglucomutase [Carbohydrate transport and metabolism] Probab=36.42 E-value=29 Score=15.83 Aligned_cols=33 Identities=15% Similarity=0.189 Sum_probs=25.9 Q ss_pred CCCCEEEEECCCCEEEEEEECCEEEEECCCCCCCE Q ss_conf 65620257268981446860878998425777746 Q gi|254780199|r 40 PMDITGYWEDDNGILSYFQKDGKFKTISTDGSSSV 74 (160) Q Consensus 40 P~gVeG~W~D~NGivSsf~~~G~F~TrstDGTn~~ 74 (160) -.-++|+=..-+|+-..|. +|+|+.|.+ ||+.. T Consensus 459 ~~~~~Gn~s~~~GLkV~~~-ng~fa~R~S-GT~~t 491 (524) T COG0033 459 LTPAPGNGAAIGGLKVTTE-NGWFAARPS-GTEAT 491 (524) T ss_pred CCCCCCCHHHCCCEEEEEE-CCEEEEECC-CCCHH T ss_conf 2579986010176699961-758998337-76413 No 13 >PRK00022 lolB outer membrane lipoprotein LolB; Provisional Probab=33.98 E-value=37 Score=15.21 Aligned_cols=99 Identities=14% Similarity=0.240 Sum_probs=53.6 Q ss_pred HHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEECCCC---EEEEEEECCEEEEECCCCCCCEEEEEEEEEECCCEEEEEE Q ss_conf 9999876405777656653337676562025726898---1446860878998425777746654468851685799999 Q gi|254780199|r 16 VAAIAVVSCSALSVVVDSSELLPEPMDITGYWEDDNG---ILSYFQKDGKFKTISTDGSSSVLATGSYHVKINQDVEIKL 92 (160) Q Consensus 16 v~alav~ac~~~~~~~~~~~~~P~P~gVeG~W~D~NG---ivSsf~~~G~F~TrstDGTn~~LA~GTY~~~~~~~VEIn~ 92 (160) .+++.+++|+++|+.-+.... ..+-.|--..- .+..|.-.|+++-+..+ ... =|.=.... .+...+|++ T Consensus 10 ~~~llLsgCat~p~~~~~~~~-----~~~~~w~~~q~~l~~l~~w~~~Gkiav~~~~-~~~-sa~f~W~q-~~~~~~i~L 81 (203) T PRK00022 10 LAALLLAGCASLPPPKGPGKS-----PTSPQWQQHQQQLQQIQQYQARGRFAYISPQ-QRV-SGRFDWQQ-QGDRYRLLL 81 (203) T ss_pred HHHHHHHHCCCCCCCCCCCCC-----CCCHHHHHHHHHHHHHHCEEEEEEEEEECCC-CEE-EEEEEEEE-CCCEEEEEE T ss_conf 999998661489998887777-----5667499999999734234884689998788-547-89999997-598169999 Q ss_pred EECCCCCCEEEEEEEECCCEEEEECCCCCEEEE Q ss_conf 720038830589996063436676067987999 Q gi|254780199|r 93 TSLIRNTSGKIQCQFLDSNKLNCLAKDQKQFYL 125 (160) Q Consensus 93 tSlvR~T~skVnC~L~spsQLnCTS~sG~QF~L 125 (160) ++-+-.+...++- ++....-...+|..++- T Consensus 82 ~~pLG~~~~~i~~---~~~~~~L~~~~g~~~~a 111 (203) T PRK00022 82 TSPLGSTELELTG---RPGGATLEDNNGQRYTA 111 (203) T ss_pred EECCCCEEEEEEE---CCCEEEEEECCCCEEEC T ss_conf 8026654999998---79979999799988867 No 14 >pfam06679 DUF1180 Protein of unknown function (DUF1180). This family consists of several hypothetical mammalian proteins of around 190 residues in length. The function of this family is unknown. Probab=29.25 E-value=45 Score=14.73 Aligned_cols=44 Identities=18% Similarity=0.101 Sum_probs=27.2 Q ss_pred CCCCCCCCCCCCCCCC-CC---EEEEECCCCEEEEEEECCEEEEECCC Q ss_conf 7776566533376765-62---02572689814468608789984257 Q gi|254780199|r 26 ALSVVVDSSELLPEPM-DI---TGYWEDDNGILSYFQKDGKFKTISTD 69 (160) Q Consensus 26 ~~~~~~~~~~~~P~P~-gV---eG~W~D~NGivSsf~~~G~F~TrstD 69 (160) ..|....+++..|-|. ++ -|.-.+.+|-.|+...+|..+|+.+- T Consensus 39 ~~P~p~~~~~~pPgPtpa~pt~~~~~~~~~G~~ss~S~gg~Lat~~s~ 86 (163) T pfam06679 39 SQPGPETQPPLPPGPTPANPTALGSGDAAGGGGSSTSDGGALATRPSP 86 (163) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCC T ss_conf 998877788998989998753305554567888756668601423688 No 15 >TIGR00104 TIGR00104 conserved hypothetical protein TIGR00104; InterPro: IPR001378 This domain had been observed is a number of proteins of archaea and bacterial origin. The function of this domain is unknown.. Probab=28.94 E-value=21 Score=16.72 Aligned_cols=15 Identities=27% Similarity=0.388 Sum_probs=10.6 Q ss_pred ECCEEEEECCCCCCC Q ss_conf 087899842577774 Q gi|254780199|r 59 KDGKFKTISTDGSSS 73 (160) Q Consensus 59 ~~G~F~TrstDGTn~ 73 (160) +-|.|+||++-==|= T Consensus 87 ~~GVFaTRsp~RPNP 101 (152) T TIGR00104 87 RRGVFATRSPHRPNP 101 (152) T ss_pred CCCEEECCCCCCCCC T ss_conf 143441358787787 No 16 >PRK12700 flgH flagellar basal body L-ring protein; Reviewed Probab=28.18 E-value=39 Score=15.11 Aligned_cols=34 Identities=18% Similarity=0.250 Sum_probs=14.4 Q ss_pred CCEEEEHHHHHHHHHHHHHHHHCCCC--CCCCCCCCCC Q ss_conf 50355125689999999876405777--6566533376 Q gi|254780199|r 3 ARCFLSLSFLFPLVAAIAVVSCSALS--VVVDSSELLP 38 (160) Q Consensus 3 ar~~lsl~~~~~Lv~alav~ac~~~~--~~~~~~~~~P 38 (160) -|.++-+.++.. ..+.++.|...+ |..+.+..+| T Consensus 3 ~~t~~Rl~~~~~--~~~ll~GCa~~~~~p~~~~P~~a~ 38 (230) T PRK12700 3 LKTVLRLPVCAA--LLALAAGCAMIPPEPVVTGPLTAP 38 (230) T ss_pred HHHHHHHHHHHH--HHHHHHCCCCCCCCCCCCCCCCCC T ss_conf 588999999999--999974446899998788988789 No 17 >cd02858 Esterase_N_term Esterase N-terminal domain. Esterases catalyze the hydrolysis of organic esters to release an alcohol or thiol and acid. The term can be applied to enzymes that hydrolyze carboxylate, phosphate and sulphate esters, but is more often restricted to the first class of substrate. The N-terminus of esterase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitinase. Probab=27.47 E-value=48 Score=14.54 Aligned_cols=42 Identities=21% Similarity=0.419 Sum_probs=23.0 Q ss_pred CCCCCC--CCEEEEECCCCEEEEEEECCEEEEECCCCCCCEEEEEEEEE Q ss_conf 376765--62025726898144686087899842577774665446885 Q gi|254780199|r 36 LLPEPM--DITGYWEDDNGILSYFQKDGKFKTISTDGSSSVLATGSYHV 82 (160) Q Consensus 36 ~~P~P~--gVeG~W~D~NGivSsf~~~G~F~TrstDGTn~~LA~GTY~~ 82 (160) .||.++ -|.|.|...+-..-.-..+|.++ .+++ -|+.|.|.| T Consensus 13 ~AP~A~~V~V~g~~~~~~~~~MtK~~~GvWs-~t~~----pl~pg~y~Y 56 (85) T cd02858 13 FAPKANEVQVRGSWGGAGSHPMTKDEAGVWS-VTTG----PLAPGIYTY 56 (85) T ss_pred ECCCCCEEEEEEECCCCCCCCCEECCCCEEE-EEEC----CCCCCEEEE T ss_conf 8999979999815488843054588997599-9868----879824889 No 18 >PRK10802 peptidoglycan-associated outer membrane lipoprotein; Provisional Probab=26.07 E-value=34 Score=15.45 Aligned_cols=27 Identities=26% Similarity=0.322 Sum_probs=15.9 Q ss_pred CCCCEEEEHHHHHHHHHHHHHHHHCCCCC Q ss_conf 97503551256899999998764057776 Q gi|254780199|r 1 MQARCFLSLSFLFPLVAAIAVVSCSALSV 29 (160) Q Consensus 1 ~~ar~~lsl~~~~~Lv~alav~ac~~~~~ 29 (160) |+..||+-. +...++.+++++|++.+. T Consensus 1 M~~nk~~K~--l~ia~~~l~LaACSS~~~ 27 (173) T PRK10802 1 MQLNKVLKG--LMIALPVMAIAACSSNKN 27 (173) T ss_pred CCHHHHHHH--HHHHHHHHHHHHCCCCCC T ss_conf 960789999--999999999985579998 No 19 >COG3017 LolB Outer membrane lipoprotein involved in outer membrane biogenesis [Cell envelope biogenesis, outer membrane] Probab=25.22 E-value=53 Score=14.28 Aligned_cols=85 Identities=19% Similarity=0.296 Sum_probs=49.3 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEECCC----CEEEEEEECCEEEEECCCCCCCEEEEEEEEEEC Q ss_conf 2568999999987640577765665333767656202572689----814468608789984257777466544688516 Q gi|254780199|r 9 LSFLFPLVAAIAVVSCSALSVVVDSSELLPEPMDITGYWEDDN----GILSYFQKDGKFKTISTDGSSSVLATGSYHVKI 84 (160) Q Consensus 9 l~~~~~Lv~alav~ac~~~~~~~~~~~~~P~P~gVeG~W~D~N----GivSsf~~~G~F~TrstDGTn~~LA~GTY~~~~ 84 (160) +.+..+.++.+.+++|..+....-.. .|. +-.|-... -|-+|-+ -|+|+-++.+ - ..-+.=-..+. T Consensus 7 ~~~~l~~~As~LL~aC~~~~~~~~~~----~~~--~~~Wq~hq~~l~~i~~yq~-~G~~~yis~~-q-~~s~rF~Wqq~- 76 (206) T COG3017 7 LLFLLLALASLLLTACTLTASRPPNN----SPD--SPQWQQHQQQLQEIQSYQA-RGRLAYISDQ-Q-RGSARFFWQQQ- 76 (206) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCC----CCC--CHHHHHHHHHHHHHHHHHC-CCEEEEECCC-C-CCEEEEEEEEC- T ss_conf 87899999999997565667778777----874--1667777889886465421-4237998677-6-53688999975- Q ss_pred CCEEEEEEEECCCCCCEEE Q ss_conf 8579999972003883058 Q gi|254780199|r 85 NQDVEIKLTSLIRNTSGKI 103 (160) Q Consensus 85 ~~~VEIn~tSlvR~T~skV 103 (160) +....|++++.+-.|--.+ T Consensus 77 p~~y~L~Ls~pLg~t~l~L 95 (206) T COG3017 77 PDRYRLLLSNPLGSTLLEL 95 (206) T ss_pred CCCEEEEEECCCCCCEEEE T ss_conf 8847999815567612799 No 20 >PRK00249 flgH flagellar basal body L-ring protein; Reviewed Probab=24.57 E-value=54 Score=14.21 Aligned_cols=27 Identities=7% Similarity=0.190 Sum_probs=18.6 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 256899999998764057776566533 Q gi|254780199|r 9 LSFLFPLVAAIAVVSCSALSVVVDSSE 35 (160) Q Consensus 9 l~~~~~Lv~alav~ac~~~~~~~~~~~ 35 (160) +-+...|++.+.+++|...++....+. T Consensus 4 ~~~~~~~~~~~~LsgCa~~~~~~~~p~ 30 (231) T PRK00249 4 MLIALALLLLLLLSGCASIPPKPLIPA 30 (231) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCCC T ss_conf 899999999999875348999888877 No 21 >COG3364 Zn-ribbon containing protein [General function prediction only] Probab=24.50 E-value=55 Score=14.20 Aligned_cols=26 Identities=19% Similarity=0.160 Sum_probs=20.3 Q ss_pred EEEEECCCEEEEEEEECCCCCCEEEE Q ss_conf 68851685799999720038830589 Q gi|254780199|r 79 SYHVKINQDVEIKLTSLIRNTSGKIQ 104 (160) Q Consensus 79 TY~~~~~~~VEIn~tSlvR~T~skVn 104 (160) |-...+++..|||+.||.-+..-.|. T Consensus 63 tIrI~~pG~YeiNl~~Lld~~~iVva 88 (112) T COG3364 63 TIRILRPGVYEINLESLLDRDEIVVA 88 (112) T ss_pred EEEEECCCEEEEEHHHHCCCCCEEEE T ss_conf 89994496489744563067735999 No 22 >pfam06085 Rz1 Lipoprotein Rz1 precursor. This family consists of several bacteria and phage lipoprotein Rz1 precursors. Rz1 is a proline-rich lipoprotein from bacteriophage lambda which is known to have fusogenic properties. Rz1-induced liposome fusion is thought to be mediated primarily by the generation of local perturbation in the bilayer lipid membrane and to a lesser extent by electrostatic forces. Probab=24.26 E-value=55 Score=14.17 Aligned_cols=19 Identities=26% Similarity=0.457 Sum_probs=14.3 Q ss_pred HHHHHHHHHHHHHCCCCCC Q ss_conf 8999999987640577765 Q gi|254780199|r 12 LFPLVAAIAVVSCSALSVV 30 (160) Q Consensus 12 ~~~Lv~alav~ac~~~~~~ 30 (160) ++.|...|.|++|.+++++ T Consensus 8 l~~~m~~L~vsaC~S~p~v 26 (59) T pfam06085 8 LCVMMLPLVVSACSSKPPV 26 (59) T ss_pred HHHHHHHHHHHHHCCCCCC T ss_conf 9999989999982699986 No 23 >TIGR01004 PulS_OutS lipoprotein, PulS/OutS family; InterPro: IPR005699 This family comprises lipoproteins from four gamma proteobacterial species: PulS protein of Klebsiella pneumoniae (P20440 from SWISSPROT), the OutS protein of Erwinia chrysanthemi (Q01567 from SWISSPROT) and Pectobacterium chrysanthemi, and the functionally uncharacterised E. coli protein EtpO. PulS and OutS have been shown to interact with and facilitate insertion of secretins into the outer membrane, suggesting a chaperone-like, or piloting function for members of this family.; GO: 0008565 protein transporter activity, 0006886 intracellular protein transport. Probab=22.49 E-value=60 Score=13.96 Aligned_cols=32 Identities=19% Similarity=0.453 Sum_probs=23.3 Q ss_pred CEEE-EHHHHHHHHHHHHHHHHCCCCCCCCCCCC Q ss_conf 0355-12568999999987640577765665333 Q gi|254780199|r 4 RCFL-SLSFLFPLVAAIAVVSCSALSVVVDSSEL 36 (160) Q Consensus 4 r~~l-sl~~~~~Lv~alav~ac~~~~~~~~~~~~ 36 (160) +-|| .|.++++|+.-+.+++|+++.|+ ++.+. T Consensus 3 ~n~lKkln~~~~~~~~v~lsgCqtPaPv-qn~~~ 35 (136) T TIGR01004 3 RNILKKLNVVLSLLVVVLLSGCQTPAPV-QNRAI 35 (136) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCC-CCCCC T ss_conf 1378999999999999986404789988-88755 No 24 >pfam08194 DIM DIM protein. Drosophila immune-induced molecules (DIMs) are short proteins induced during the immune response of Drosophila. This family includes DIMs 1 to 4 that have masses below 5 kDa. Probab=20.34 E-value=49 Score=14.48 Aligned_cols=24 Identities=33% Similarity=0.383 Sum_probs=15.8 Q ss_pred EEEEHHHHHHHHHHHHHHHHCCCC Q ss_conf 355125689999999876405777 Q gi|254780199|r 5 CFLSLSFLFPLVAAIAVVSCSALS 28 (160) Q Consensus 5 ~~lsl~~~~~Lv~alav~ac~~~~ 28 (160) |+||+.|+.-|.+.+|.+.--.+| T Consensus 2 K~lsl~~~l~LLa~La~a~pl~pG 25 (37) T pfam08194 2 KFLSLAFALGLLATLAFAVPLNPG 25 (37) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 089999999999999972579997 Done!