Query gi|254780677|ref|YP_003065090.1| hypothetical protein CLIBASIA_02820 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 152 No_of_seqs 107 out of 1165 Neff 8.3 Searched_HMMs 39220 Date Sun May 29 23:15:37 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780677.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK10724 hypothetical protein; 100.0 4.4E-41 1.4E-45 283.0 19.3 145 1-150 14-158 (158) 2 COG2867 Oligoketide cyclase/li 100.0 6.5E-40 1.7E-44 275.5 15.4 146 1-150 1-146 (146) 3 KOG3177 consensus 100.0 4.4E-32 1.1E-36 225.5 10.8 147 3-152 69-217 (227) 4 pfam03364 Polyketide_cyc Polyk 99.9 4.5E-24 1.1E-28 174.3 15.8 124 10-140 1-125 (125) 5 COG5637 Predicted integral mem 99.6 8.3E-14 2.1E-18 108.6 13.1 139 3-151 71-214 (217) 6 pfam10604 Polyketide_cyc2 Poly 99.6 8.7E-13 2.2E-17 102.2 17.1 136 2-144 1-139 (139) 7 pfam06240 COXG Carbon monoxide 98.8 2.1E-06 5.3E-11 61.4 17.5 130 7-145 2-137 (140) 8 COG3427 Carbon monoxide dehydr 98.6 1.2E-05 3.2E-10 56.4 16.3 133 3-144 2-140 (146) 9 cd00177 START START(STeroidoge 98.5 2.7E-05 7E-10 54.2 16.2 138 7-146 58-212 (212) 10 pfam01852 START START domain. 97.8 0.0022 5.7E-08 42.0 13.7 142 3-144 46-206 (207) 11 smart00234 START in StAR and p 97.7 0.0052 1.3E-07 39.6 14.3 140 3-144 46-205 (206) 12 pfam00407 Bet_v_1 Pathogenesis 97.6 0.0065 1.7E-07 39.0 17.2 114 3-118 4-124 (150) 13 COG4276 Uncharacterized conser 97.4 0.0053 1.3E-07 39.6 11.9 132 1-137 1-140 (153) 14 COG3832 Uncharacterized conser 96.3 0.1 2.6E-06 31.4 11.7 95 3-104 9-108 (149) 15 pfam10698 DUF2505 Protein of u 95.6 0.22 5.6E-06 29.3 15.2 132 4-138 1-150 (159) 16 pfam08327 AHSA1 Activator of H 94.3 0.48 1.2E-05 27.1 12.8 94 11-113 1-97 (124) 17 pfam11687 DUF3284 Domain of un 94.1 0.53 1.3E-05 26.8 11.8 106 5-127 2-117 (120) 18 KOG2761 consensus 63.7 12 0.00031 18.1 14.1 140 2-141 53-214 (219) 19 pfam08982 DUF1857 Domain of un 45.5 24 0.00061 16.2 15.8 87 13-112 19-106 (147) 20 pfam02713 DUF220 Domain of unk 34.6 18 0.00045 17.1 1.0 18 83-100 25-42 (74) 21 pfam04707 PRELI PRELI-like fam 31.4 40 0.001 14.8 13.8 124 22-145 6-153 (157) 22 KOG3849 consensus 25.7 32 0.00082 15.4 1.1 19 133-151 353-371 (386) 23 pfam02974 Inh Protease inhibit 24.9 52 0.0013 14.1 2.4 19 91-110 11-29 (98) 24 PRK05590 hypothetical protein; 22.3 34 0.00086 15.3 0.7 32 83-115 67-98 (165) 25 smart00136 LamNT Laminin N-ter 22.2 33 0.00084 15.3 0.6 17 134-150 127-143 (238) No 1 >PRK10724 hypothetical protein; Provisional Probab=100.00 E-value=4.4e-41 Score=283.02 Aligned_cols=145 Identities=24% Similarity=0.529 Sum_probs=138.8 Q ss_pred CCCEEEEEEECCCHHHHHHHHHHHHHCHHHCCCCCEEEEEECCCCCCCEEEEEEEEECCCEEEEEEEEEEECCCCCEEEE Q ss_conf 97248999835987999999875742310187520579997268888419999999623126899988861576620896 Q gi|254780677|r 1 MYHFTADRIVNHSSQQMLSLVSDIERYPEFVPLCKKVVIHERDNYGENEVLVASMTINYACMQREFMTQVRINQKEHYIA 80 (152) Q Consensus 1 M~~~~~~~~i~~~~e~i~~iV~Dve~YpeFlP~c~~~~v~~r~~~~~~~~~~a~l~v~~~~~~~~~~s~~~~~~~~~~i~ 80 (152) |++++++++|||||+|+||||+|||+||+|+|||.+++|+++++ ..+.|+++++++|+.++|+|++++.++. .|. T Consensus 14 M~~i~~s~~vpys~~qmf~LV~DVe~YpeFlPwC~~s~Il~~~~----~~~~A~L~v~~~g~~~sFtS~~~~~~~~-~I~ 88 (158) T PRK10724 14 MPQISRTALVPYSAEQMYQLVNDVQSYPQFLPGCTGSRVLESTP----GQMTAAVDVSKAGISKTFTTRNQLTSNQ-SIL 88 (158) T ss_pred CCCCEEEEECCCCHHHHHHHHHHHHHHCHHCCCCCCCEEEECCC----CEEEEEEEEEECCCEEEEEEEEEECCCC-EEE T ss_conf 77604899857899999999998732120041515278998579----8499999998135228999757843876-254 Q ss_pred EECCCCCEEEEEEEEEEEECCCCCEEEEEEEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 3124541021137999998799818999999985088789999999999999999999999999770899 Q gi|254780677|r 81 VKHIKNLFNFLENHWHFEEISESKCKVHFSIKYELKNRLFDMMLKAIFDPSFLSFAKAFEERAHKIYHLP 150 (152) Q Consensus 81 ~~~~~g~f~~~~g~W~f~~~~~~~t~V~~~l~fe~~~~~~~~~~~~~~~~~~~~~i~aF~~Ra~~~yg~~ 150 (152) ..+.+|||++|.|+|+|+|+++++|+|+|+++|||+|++++.+++++|+.++++||+||++||+++||+. T Consensus 89 ~~lv~GPFk~L~~~W~F~~~~~~~c~V~f~l~fef~n~ll~~~~~~~f~~v~~~mV~AF~~RA~~lY~~r 158 (158) T PRK10724 89 MQLVDGPFKKLIGGWKFTPLSQEACRIEFHLDFEFTNKLIELAFGRVFKELASNMVQAFTVRAKEVYSAR 158 (158) T ss_pred EEECCCCHHHEEEEEEEEECCCCCEEEEEEEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 5554797022057789997799958999999999786999999999999999999999999999974889 No 2 >COG2867 Oligoketide cyclase/lipid transport protein [Lipid metabolism] Probab=100.00 E-value=6.5e-40 Score=275.54 Aligned_cols=146 Identities=32% Similarity=0.648 Sum_probs=141.6 Q ss_pred CCCEEEEEEECCCHHHHHHHHHHHHHCHHHCCCCCEEEEEECCCCCCCEEEEEEEEECCCEEEEEEEEEEECCCCCEEEE Q ss_conf 97248999835987999999875742310187520579997268888419999999623126899988861576620896 Q gi|254780677|r 1 MYHFTADRIVNHSSQQMLSLVSDIERYPEFVPLCKKVVIHERDNYGENEVLVASMTINYACMQREFMTQVRINQKEHYIA 80 (152) Q Consensus 1 M~~~~~~~~i~~~~e~i~~iV~Dve~YpeFlP~c~~~~v~~r~~~~~~~~~~a~l~v~~~~~~~~~~s~~~~~~~~~~i~ 80 (152) |++++.+++|+|||++||++|+||++||+|+|||.+++|+++++. .+.|+++|++++++++|+|+|++.++.+.|. T Consensus 1 m~~~~~s~lv~y~a~~mF~LV~dV~~YP~FlP~C~~s~v~~~~~~----~l~A~l~V~~k~i~e~F~Trv~~~~~~~~I~ 76 (146) T COG2867 1 MPQIERTALVPYSASQMFDLVNDVESYPEFLPWCSASRVLERNER----ELIAELDVGFKGIRETFTTRVTLKPTARSID 76 (146) T ss_pred CCEEEEEEECCCCHHHHHHHHHHHHHCCCCCCCCCCCEEECCCCC----EEEEEEEEEHHHEEEEEEEEEEECCCHHHHH T ss_conf 970676230368989999999999857301365243267614852----1367778782011015663005357522000 Q ss_pred EECCCCCEEEEEEEEEEEECCCCCEEEEEEEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 3124541021137999998799818999999985088789999999999999999999999999770899 Q gi|254780677|r 81 VKHIKNLFNFLENHWHFEEISESKCKVHFSIKYELKNRLFDMMLKAIFDPSFLSFAKAFEERAHKIYHLP 150 (152) Q Consensus 81 ~~~~~g~f~~~~g~W~f~~~~~~~t~V~~~l~fe~~~~~~~~~~~~~~~~~~~~~i~aF~~Ra~~~yg~~ 150 (152) +...+|||+.|.|.|+|+|+++++|+|+|.|+|||+|++++++++++|+.++++|++||++||+++||.+ T Consensus 77 m~l~~GPFk~L~~~W~F~pl~~~~ckV~f~ldfeF~s~ll~~~~g~~f~~~a~~mv~aF~kRA~~~y~~~ 146 (146) T COG2867 77 MKLIDGPFKYLKGGWQFTPLSEDACKVEFFLDFEFKSRLLGALIGPVFKRLASKMVEAFEKRAKEVYGLA 146 (146) T ss_pred HHHHCCCHHHHCCCEEEEECCCCCEEEEEEEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 2321476566357157887798846899999764516889999999999999999999999998861589 No 3 >KOG3177 consensus Probab=99.98 E-value=4.4e-32 Score=225.50 Aligned_cols=147 Identities=29% Similarity=0.501 Sum_probs=140.0 Q ss_pred CEEEEEEECCCHHHHHHHHHHHHHCHHHCCCCCEEEEEECCCCCCCEEEEEEEEECCCEEEEEEEEEEECCCCCEEEEEE Q ss_conf 24899983598799999987574231018752057999726888841999999962312689998886157662089631 Q gi|254780677|r 3 HFTADRIVNHSSQQMLSLVSDIERYPEFVPLCKKVVIHERDNYGENEVLVASMTINYACMQREFMTQVRINQKEHYIAVK 82 (152) Q Consensus 3 ~~~~~~~i~~~~e~i~~iV~Dve~YpeFlP~c~~~~v~~r~~~~~~~~~~a~l~v~~~~~~~~~~s~~~~~~~~~~i~~~ 82 (152) -|+++++|+|||++||++|+||+.|.+|+|||+++.|+++.|+ ...+|+|.|||+++.++|++.++++++..+.... T Consensus 69 ~Y~errligysp~~my~vVS~V~~Y~~FVPwC~kS~V~~~~P~---~~~kA~LeVGFk~l~E~y~S~Vt~~~p~l~kt~~ 145 (227) T KOG3177 69 AYSERRLIGYSPSEMYSVVSNVSEYHEFVPWCKKSDVTSRRPS---GPLKADLEVGFKPLDERYTSNVTCVKPHLTKTVC 145 (227) T ss_pred HHHHHHHHCCCHHHHHHHHHHHHHHHCCCCCEECCCEEECCCC---CCCEEEEEECCCCCCHHHEEEEEEECCCCEEEEE T ss_conf 8898761088989999999868876144641332320232799---9820227843742043442013873654167763 Q ss_pred CCCCCEEEEEEEEEEEECC--CCCEEEEEEEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 2454102113799999879--981899999998508878999999999999999999999999977089999 Q gi|254780677|r 83 HIKNLFNFLENHWHFEEIS--ESKCKVHFSIKYELKNRLFDMMLKAIFDPSFLSFAKAFEERAHKIYHLPSL 152 (152) Q Consensus 83 ~~~g~f~~~~g~W~f~~~~--~~~t~V~~~l~fe~~~~~~~~~~~~~~~~~~~~~i~aF~~Ra~~~yg~~~~ 152 (152) .+++.|++|.+.|+|+|.. ++.|.|+|.++|||++++++.++..+|+.+++.|++||++||+.+||+|+. T Consensus 146 ~d~rLF~~L~t~Wsf~pg~~~p~tc~ldf~v~FeF~S~lhs~va~~Ffd~V~~~~v~AF~~ra~~~~gp~~~ 217 (227) T KOG3177 146 ADGRLFNHLITIWSFKPGPNIPRTCTLDFSVSFEFKSLLHSQVATIFFDEVVKQMVNAFEQRARKLYGPESL 217 (227) T ss_pred CCCCHHHHHHHEEEECCCCCCCCEEEEEEEEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 265077765422442358999973789899999861478888999999999999999999999986196013 No 4 >pfam03364 Polyketide_cyc Polyketide cyclase / dehydrase and lipid transport. This family contains polyketide cylcases/dehydrases which are enzymes involved in polyketide synthesis. The family also includes proteins which are involved in the binding/transport of lipids. Probab=99.92 E-value=4.5e-24 Score=174.28 Aligned_cols=124 Identities=30% Similarity=0.573 Sum_probs=112.6 Q ss_pred ECCCHHHHHHHHHHHHHCHHHCCCCCEEEEEECCCCCCCEEEEEEEEECCCEEEEEEEEEEECCCCCEEEEEECCCCCEE Q ss_conf 35987999999875742310187520579997268888419999999623126899988861576620896312454102 Q gi|254780677|r 10 VNHSSQQMLSLVSDIERYPEFVPLCKKVVIHERDNYGENEVLVASMTINYACMQREFMTQVRINQKEHYIAVKHIKNLFN 89 (152) Q Consensus 10 i~~~~e~i~~iV~Dve~YpeFlP~c~~~~v~~r~~~~~~~~~~a~l~v~~~~~~~~~~s~~~~~~~~~~i~~~~~~g~f~ 89 (152) |++||+++|++|+|+|+||+|+|||.+++|+++++++ ..+.++++++..+|++++...++.. +.+...+|+|+ T Consensus 1 i~ap~~~v~~~v~Dve~yp~f~P~~~~~~vl~~~~~~------~~~~v~~~~~~~~~~~~~~~~~~~~-i~~~~~~g~f~ 73 (125) T pfam03364 1 VPAPAEQVWALVTDVERYPEFLPWCKSVEVLDRDGSL------ADWRVAFGGLRRSFTARVTLQPPER-IEMVLVDGDFK 73 (125) T ss_pred CCCCHHHHHHHHHHHHHHHHHCHHHCEEEEEECCCCE------EEEEEEECEEEEEEEEEEEEEECCE-EEEEECCCCHH T ss_conf 9999899999999887776657334289999607982------3899972100489999999972427-99995689904 Q ss_pred EEEEEEEEEECCCC-CEEEEEEEEEEECCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 11379999987998-1899999998508878999999999999999999999 Q gi|254780677|r 90 FLENHWHFEEISES-KCKVHFSIKYELKNRLFDMMLKAIFDPSFLSFAKAFE 140 (152) Q Consensus 90 ~~~g~W~f~~~~~~-~t~V~~~l~fe~~~~~~~~~~~~~~~~~~~~~i~aF~ 140 (152) .+.|+|+|.+++++ .|+|+|+++|++++++++.+++.+++.+.++|++||+ T Consensus 74 ~l~g~W~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~af~ 125 (125) T pfam03364 74 RLEGSWRFEPGGPGTRVKVTLELDFEFASPLPGALLGFVFRRVLRTLLEAFR 125 (125) T ss_pred HEEEEEEEEECCCCCCEEEEEEEEEEECCHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 3777899997489980899999999988899999999999999999998509 No 5 >COG5637 Predicted integral membrane protein [Function unknown] Probab=99.57 E-value=8.3e-14 Score=108.64 Aligned_cols=139 Identities=12% Similarity=0.186 Sum_probs=121.7 Q ss_pred CEEEEEEECCCHHHHHHHHHHHHHCHHHCCCCCEEEEEECCCCCCCEEEEEEEEECC-CEEEEEEEEEEECCCCCEEEEE Q ss_conf 248999835987999999875742310187520579997268888419999999623-1268999888615766208963 Q gi|254780677|r 3 HFTADRIVNHSSQQMLSLVSDIERYPEFVPLCKKVVIHERDNYGENEVLVASMTINY-ACMQREFMTQVRINQKEHYIAV 81 (152) Q Consensus 3 ~~~~~~~i~~~~e~i~~iV~Dve~YpeFlP~c~~~~v~~r~~~~~~~~~~a~l~v~~-~~~~~~~~s~~~~~~~~~~i~~ 81 (152) .++....|..|++++|+++.|.|+.|.|+.|..+++|+..++ .+|.++. .+...+|.++++-+.++.+|.| T Consensus 71 ~v~~~V~I~kPae~vy~~W~dLe~lP~~Mkhl~SVkVlddkr--------SrW~~~ap~g~~v~Wea~it~d~~~e~I~W 142 (217) T COG5637 71 EVEVQVTIDKPAEQVYAYWRDLENLPLWMKHLDSVKVLDDKR--------SRWKANAPLGLEVEWEAEITKDIPGERIQW 142 (217) T ss_pred EEEEEEEECCHHHHHHHHHHHHHHHHHHHHHHCEEECCCCCC--------CCEEECCCCCCEEEEEEHHHCCCCCCEEEE T ss_conf 888989958809999999972331158987631054467884--------302672798743887512100577867766 Q ss_pred ECCCCCEEEEEEEEEEEECCCCCEEEEEEEEEEECCHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHCCCCC Q ss_conf 124541021137999998799818999999985088789999999999999999999----9999997708999 Q gi|254780677|r 82 KHIKNLFNFLENHWHFEEISESKCKVHFSIKYELKNRLFDMMLKAIFDPSFLSFAKA----FEERAHKIYHLPS 151 (152) Q Consensus 82 ~~~~g~f~~~~g~W~f~~~~~~~t~V~~~l~fe~~~~~~~~~~~~~~~~~~~~~i~a----F~~Ra~~~yg~~~ 151 (152) .+.+|.--.+.|.-+|.+...++|.|...|.|.++..+.+.++..+|.+..+..|.+ |++..+. |.|| T Consensus 143 ~Sl~Ga~v~NsG~VrF~~~pg~~t~V~v~lsY~~Pgg~~~a~va~~fgeepeqqI~~DL~RFk~~~e~--~q~s 214 (217) T COG5637 143 ESLPGARVENSGAVRFYDAPGDSTEVKVTLSYRPPGGLLGAVVAKLFGEEPEQQIQDDLERFKEYQEN--GQPS 214 (217) T ss_pred ECCCCCCCCCCCCEEEEECCCCCEEEEEEEEECCCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHC--CCCC T ss_conf 41788757888607866079981699999985499637899999986616578999999999999973--6843 No 6 >pfam10604 Polyketide_cyc2 Polyketide cyclase / dehydrase and lipid transport. This family contains polyketide cylcases/dehydrases which are enzymes involved in polyketide synthesis. It also includes other proteins of the START superfamily. Probab=99.56 E-value=8.7e-13 Score=102.15 Aligned_cols=136 Identities=10% Similarity=0.116 Sum_probs=98.0 Q ss_pred CCEEEEEEECCCHHHHHHHHHHHHHCHHHCCCCCEEEEEECCCCCCCEEEEEEEEECCCEEEEEEEEEEECCCCCEEEEE Q ss_conf 72489998359879999998757423101875205799972688884199999996231268999888615766208963 Q gi|254780677|r 2 YHFTADRIVNHSSQQMLSLVSDIERYPEFVPLCKKVVIHERDNYGENEVLVASMTINYACMQREFMTQVRINQKEHYIAV 81 (152) Q Consensus 2 ~~~~~~~~i~~~~e~i~~iV~Dve~YpeFlP~c~~~~v~~r~~~~~~~~~~a~l~v~~~~~~~~~~s~~~~~~~~~~i~~ 81 (152) .+++.+..|++||+++|++++|+++||+|.|++.++++..- +... ...+.........+... ...+++.+.+.+ T Consensus 1 ~~v~~si~I~ap~e~V~~~i~d~~~~~~W~~~~~~~~~~~~---g~~~--~~~~~~~~~~~~~~~~i-~~~~~~~~~~~~ 74 (139) T pfam10604 1 AKVEVSITIAAPPEQVWALVSDFENWPRWHPGVLRCEGEGG---GPGV--VGTLRVGGRRGTVREEL-VEYDPPRRRLEY 74 (139) T ss_pred CEEEEEEEECCCHHHHHHHHHCCCHHHHHCCCCEEEEEECC---CCCE--EEEEEECCCCCEEEEEE-EEECCCCCEEEE T ss_conf 97999999779999999988684508661875168998649---9864--89999837774499999-998277829999 Q ss_pred ECCCCC-EEEEEEEEEEEECCCCCEEEEEEEEEEEC--CHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 124541-02113799999879981899999998508--8789999999999999999999999999 Q gi|254780677|r 82 KHIKNL-FNFLENHWHFEEISESKCKVHFSIKYELK--NRLFDMMLKAIFDPSFLSFAKAFEERAH 144 (152) Q Consensus 82 ~~~~g~-f~~~~g~W~f~~~~~~~t~V~~~l~fe~~--~~~~~~~~~~~~~~~~~~~i~aF~~Ra~ 144 (152) ...+++ +..+.+.|+|+|.++| |+|++..+|+.+ .++...++...+....+..++.++++++ T Consensus 75 ~~~~~~~~~~~~~~~~~~p~~~g-t~v~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~Lk~~~E 139 (139) T pfam10604 75 RIVGGPPVANYVGTWTVTPAGGG-TRVTWTGEFDPPLLGPLEGPAAAKAVKGDYEAGLANLKALLE 139 (139) T ss_pred EEECCCCCCCEEEEEEEEECCCC-EEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 98338971125899999987998-899999999649874205899999877789999999998619 No 7 >pfam06240 COXG Carbon monoxide dehydrogenase subunit G (CoxG). The CO dehydrogenase structural genes coxMSL are flanked by nine accessory genes arranged as the cox gene cluster. The cox genes are specifically and coordinately transcribed under chemolithoautotrophic conditions in the presence of CO as carbon and energy source. Probab=98.79 E-value=2.1e-06 Score=61.38 Aligned_cols=130 Identities=14% Similarity=0.166 Sum_probs=91.0 Q ss_pred EEEECCCHHHHHHHHHHHHHCHHHCCCCCEEEEEECCCCCCCEEEEEEEEECCCEEEEEEEEEEECC---CCCEE-EEEE Q ss_conf 9983598799999987574231018752057999726888841999999962312689998886157---66208-9631 Q gi|254780677|r 7 DRIVNHSSQQMLSLVSDIERYPEFVPLCKKVVIHERDNYGENEVLVASMTINYACMQREFMTQVRIN---QKEHY-IAVK 82 (152) Q Consensus 7 ~~~i~~~~e~i~~iV~Dve~YpeFlP~c~~~~v~~r~~~~~~~~~~a~l~v~~~~~~~~~~s~~~~~---~~~~~-i~~~ 82 (152) +-.|++||+++|+++.|.++...=+|+|.+.++ .. +.+.+.+.+..++++.+|..++... ++.+. +... T Consensus 2 sf~v~a~~~~vw~~L~Dp~~l~~CiPG~~~~e~---~~----~~~~~~v~v~vGpi~~~f~g~v~l~~~~~~~~~~i~~~ 74 (140) T pfam06240 2 SFRVPAPPEEVWEFLNDPEVLARCIPGVKSVEK---NG----DEYKAKVKLKVGPLKGSYTGRVEFSDLDPPGSYTISIK 74 (140) T ss_pred EEEECCCHHHHHHHHCCHHHHHHHCCCHHEEEE---CC----CEEEEEEEEEEEEEEEEEEEEEEEEECCCCCEEEEEEE T ss_conf 498039999999986699999853888200475---48----76999999997128899999999953489851899999 Q ss_pred CCCCC--EEEEEEEEEEEECCCCCEEEEEEEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 24541--0211379999987998189999999850887899999999999999999999999997 Q gi|254780677|r 83 HIKNL--FNFLENHWHFEEISESKCKVHFSIKYELKNRLFDMMLKAIFDPSFLSFAKAFEERAHK 145 (152) Q Consensus 83 ~~~g~--f~~~~g~W~f~~~~~~~t~V~~~l~fe~~~~~~~~~~~~~~~~~~~~~i~aF~~Ra~~ 145 (152) -.++. +..-.....+. -++++|+|+|+.++++..++-+.- ..+++.+++++++.|.++-.+ T Consensus 75 G~g~~~~~~~~~~~~~~~-~~~~~T~v~~~~~~~~~G~la~~G-~~li~~~a~~li~~f~~~l~~ 137 (140) T pfam06240 75 GKGRGAGFAIGFAIVVLV-EPGGGTRLSWSAKADVGGKLAQLG-SRLIDSAARKLINRFFECIAS 137 (140) T ss_pred EECCCCCCCEEEEEEEEE-CCCCCEEEEEEEEEEECCCHHHHC-HHHHHHHHHHHHHHHHHHHHH T ss_conf 723766441026999996-599977999999999785288767-789999999999999999998 No 8 >COG3427 Carbon monoxide dehydrogenase subunit G, CoxG [Energy production and conversion] Probab=98.58 E-value=1.2e-05 Score=56.41 Aligned_cols=133 Identities=13% Similarity=0.182 Sum_probs=95.9 Q ss_pred CEEEEEEECCCHHHHHHHHHHHHHCHHHCCCCCEEEEEECCCCCCCEEEEEEEEECCCEEEEEEEEEEEC---CCCCEEE Q ss_conf 2489998359879999998757423101875205799972688884199999996231268999888615---7662089 Q gi|254780677|r 3 HFTADRIVNHSSQQMLSLVSDIERYPEFVPLCKKVVIHERDNYGENEVLVASMTINYACMQREFMTQVRI---NQKEHYI 79 (152) Q Consensus 3 ~~~~~~~i~~~~e~i~~iV~Dve~YpeFlP~c~~~~v~~r~~~~~~~~~~a~l~v~~~~~~~~~~s~~~~---~~~~~~i 79 (152) .|+.+-.|++||+++++.++|.|+...-+|+|.+.+.. +..+.+.+.+..+.++.++..++.+ +.+.+.+ T Consensus 2 ~~~G~f~V~~p~e~Vw~~L~dpe~~a~ciPG~qs~e~~-------g~e~~~~v~l~ig~l~~~~~g~~~~~~v~~~~~~~ 74 (146) T COG3427 2 DYEGTFRVAAPPEAVWEFLNDPEQVAACIPGVQSVETN-------GDEYTAKVKLKIGPLKGTFSGRVRFVNVDEPPRSI 74 (146) T ss_pred CCCCEEEECCCHHHHHHHHCCHHHHHHHCCCCCEEEEC-------CCEEEEEEEEEECCEEEEEEEEEEECCCCCCCCEE T ss_conf 41213881699999998843999987516774133414-------87499999984144058998889971247898569 Q ss_pred EEECCCC---CEEEEEEEEEEEECCCCCEEEEEEEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 6312454---1021137999998799818999999985088789999999999999999999999999 Q gi|254780677|r 80 AVKHIKN---LFNFLENHWHFEEISESKCKVHFSIKYELKNRLFDMMLKAIFDPSFLSFAKAFEERAH 144 (152) Q Consensus 80 ~~~~~~g---~f~~~~g~W~f~~~~~~~t~V~~~l~fe~~~~~~~~~~~~~~~~~~~~~i~aF~~Ra~ 144 (152) ...-.+| -|-.....-.+.|-+++ |+|++..+-+....+ ..+-..+++.+++++++-|.+.-. T Consensus 75 ~i~g~G~~~~g~~~~~~~v~l~~~g~g-t~v~w~~~~~~gg~l-aqlGsr~i~~~~~kli~~~~~~l~ 140 (146) T COG3427 75 TINGSGGGAAGFADGTVDVQLEPSGEG-TRVNWFADANVGGKL-AQLGSRLIDSVARKLINRFFDCLS 140 (146) T ss_pred EEEEECCCCCCEEECEEEEEEEECCCC-CEEEEEEECCCCHHH-HHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 997345653012101268999873898-489999972423779-988577778999999999999999 No 9 >cd00177 START START(STeroidogenic Acute Regulatory (STAR) related lipid Transfer) Domain. These domains are 200-210 amino acid in length and occur in proteins involved in lipid transport (phosphatidylcholine) and metabolism, signal transduction, and transcriptional regulation. The most striking feature of the START domain structure is a predominantly hydrophobic tunnel extending nearly the entire protein and used to binding a single molecule of large lipophilic compounds, like cholesterol. Probab=98.50 E-value=2.7e-05 Score=54.21 Aligned_cols=138 Identities=13% Similarity=0.218 Sum_probs=101.7 Q ss_pred EEEECCCHHHHHHHHHHHHHCHHHCCCCCEEEEEECCCCCCCEEEEEEEEECCCE--EEEEEEE-EEEC-CC-CCEEEEE Q ss_conf 9983598799999987574231018752057999726888841999999962312--6899988-8615-76-6208963 Q gi|254780677|r 7 DRIVNHSSQQMLSLVSDIERYPEFVPLCKKVVIHERDNYGENEVLVASMTINYAC--MQREFMT-QVRI-NQ-KEHYIAV 81 (152) Q Consensus 7 ~~~i~~~~e~i~~iV~Dve~YpeFlP~c~~~~v~~r~~~~~~~~~~a~l~v~~~~--~~~~~~s-~~~~-~~-~~~~i~~ 81 (152) +.++..+|+++++++.|.+.+++|-|.+...+|+++.+.+ ..+.+.+.....+ -...|+. |... .. ..-+|.. T Consensus 58 ~~~v~~~~~~l~d~l~d~~~r~~Wd~~~~~~~vi~~~~~~--~~~~~~~~~~~p~lv~~RDfv~lR~~~~~~~~~~~i~~ 135 (212) T cd00177 58 TGVVDVTAETLVDVLMDVEKRPEWDPNVKTLEVIETGDGG--DLMIYELQAPSPGLVPPRDFVFLRYCKRLEDGTYVIVD 135 (212) T ss_pred EEEECCCHHHHHHHHHCHHHHHHHHHCCCEEEEEEECCCC--CEEEEEEECCCCCCCCCEEEEEEEEEEECCCCCEEEEE T ss_conf 8763288999999996947776653134579999862799--58999997478764365289999999994999699999 Q ss_pred ECCCCC----------EEEEEEEEEEEECCCCCEEEEEEEEEEECCH--HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 124541----------0211379999987998189999999850887--8999999999999999999999999977 Q gi|254780677|r 82 KHIKNL----------FNFLENHWHFEEISESKCKVHFSIKYELKNR--LFDMMLKAIFDPSFLSFAKAFEERAHKI 146 (152) Q Consensus 82 ~~~~g~----------f~~~~g~W~f~~~~~~~t~V~~~l~fe~~~~--~~~~~~~~~~~~~~~~~i~aF~~Ra~~~ 146 (152) .+.+.+ -+.+-+.|.+.|.++|.|+|++-...+++.. +...+++..+...+.+.+.++.+.|+.+ T Consensus 136 ~Sv~~~~~p~~~~~vR~~~~~sg~~i~~~~~g~skvt~v~~~d~~g~~~lp~~lv~~~~~~~~~~~~~~L~~~c~~~ 212 (212) T cd00177 136 VSVDHPSVPPSPGYVRAENLPSGCLIQPLPPGYSKVTWVEHVDLKGWGSLPRPLVNSGLAFGAARWVATLRRQCERL 212 (212) T ss_pred EECCCCCCCCCCCCEEEEEECCEEEEEECCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 95668888876772445221526999988999719999998668885216899987687867999999999997439 No 10 >pfam01852 START START domain. Probab=97.77 E-value=0.0022 Score=41.98 Aligned_cols=142 Identities=13% Similarity=0.126 Sum_probs=92.5 Q ss_pred CEEEEEEECCCHHHHHHHHHHHHHCH-HHCCCCCEEEEEECCCCCC-CEEEEEEEEECCCEEEEEEEE-EEECC-CCCE- Q ss_conf 24899983598799999987574231-0187520579997268888-419999999623126899988-86157-6620- Q gi|254780677|r 3 HFTADRIVNHSSQQMLSLVSDIERYP-EFVPLCKKVVIHERDNYGE-NEVLVASMTINYACMQREFMT-QVRIN-QKEH- 77 (152) Q Consensus 3 ~~~~~~~i~~~~e~i~~iV~Dve~Yp-eFlP~c~~~~v~~r~~~~~-~~~~~a~l~v~~~~~~~~~~s-~~~~~-~~~~- 77 (152) .++.+.+|..+|..+++.+.|..+|. +|-+-|.+++++++-.++. ...+.+.+.+........|.. |.... ..+. T Consensus 46 a~R~~g~V~~~~~~lv~~l~~~~~~~~~Wd~~v~~~~~i~~i~~~~~i~~~~~~~~~~~pl~pRDfv~lR~~~~~~~g~~ 125 (207) T pfam01852 46 ASRASGVVPMVLALLVEELLDDMEYRAQWDKDVRSAETLEVISSGGALQYYVAELQAPTPLSPRDFVFVRYWRELRDGVY 125 (207) T ss_pred EEEEEEEECCCHHHHHHHHHCCHHHHHCCHHHHHHEEEEEEECCCCEEEEEEEHHHCCCCCCCCEEEEEEEEEECCCCCE T ss_conf 99999998178999999997480655205056211178899479978999997300689868867999999999289819 Q ss_pred EEEEECCCCC----------EEEEEEEEEEEECCCCCEEEEEEEEEEECCHHHHHHHHHHHH----HHHHHHHHHHHHHH Q ss_conf 8963124541----------021137999998799818999999985088789999999999----99999999999999 Q gi|254780677|r 78 YIAVKHIKNL----------FNFLENHWHFEEISESKCKVHFSIKYELKNRLFDMMLKAIFD----PSFLSFAKAFEERA 143 (152) Q Consensus 78 ~i~~~~~~g~----------f~~~~g~W~f~~~~~~~t~V~~~l~fe~~~~~~~~~~~~~~~----~~~~~~i~aF~~Ra 143 (152) .|.-.+++.+ -+.+-+.|-++|.++|.|+|++-...|++..+...++++++. ..+++.+.++.+-| T Consensus 126 ~i~~~Sv~~~~~p~~~~~vR~~~~psG~li~~~~~g~skvt~v~h~d~~g~~p~~l~~~~~~s~~a~ga~~w~a~Lq~~c 205 (207) T pfam01852 126 VIVDRSVDHPQFPPSSGYVRAERLPSGYLIEPCGNGPSKVTWVEHTDLKGWLPHWLIRSLLKSGMAFGAKTWVATLQRLC 205 (207) T ss_pred EEEEEECCCCCCCCCCCEEEEEEECCEEEEEECCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99998235887887777041135245069998799971999999866899975899976414146889999999999974 Q ss_pred H Q ss_conf 9 Q gi|254780677|r 144 H 144 (152) Q Consensus 144 ~ 144 (152) + T Consensus 206 e 206 (207) T pfam01852 206 E 206 (207) T ss_pred C T ss_conf 5 No 11 >smart00234 START in StAR and phosphatidylcholine transfer protein. putative lipid-binding domain in StAR and phosphatidylcholine transfer protein Probab=97.66 E-value=0.0052 Score=39.63 Aligned_cols=140 Identities=14% Similarity=0.139 Sum_probs=90.0 Q ss_pred CEEEEEEECCCHHHHHHHHHHHHHC-HHHCCCCCEEEEEECCCCCCC-EEEEEEEEECCCEE-EEEEEE-EEEC-CCCCE Q ss_conf 2489998359879999998757423-101875205799972688884-19999999623126-899988-8615-76620 Q gi|254780677|r 3 HFTADRIVNHSSQQMLSLVSDIERY-PEFVPLCKKVVIHERDNYGEN-EVLVASMTINYACM-QREFMT-QVRI-NQKEH 77 (152) Q Consensus 3 ~~~~~~~i~~~~e~i~~iV~Dve~Y-peFlP~c~~~~v~~r~~~~~~-~~~~a~l~v~~~~~-~~~~~s-~~~~-~~~~~ 77 (152) .++.+.+|..++..+++.+-|..+| ++|-+-|.+++++++.++++. ..+.+... .+.+ ...|+. |... ..++. T Consensus 46 a~R~~gvV~~~~~~l~~~l~d~~~~r~~Wd~~v~~~~~v~~i~~~~~i~~~~~~~~--~~p~~pRDfv~lR~~~~~~~g~ 123 (206) T smart00234 46 ASRAVGVVPMVCADLVEELMDDLRYRPEWDKNVAKAETLEVIDNGTVIYHYVSKFV--AGPVSPRDFVFVRYWRELVDGS 123 (206) T ss_pred EEEEEEEECCCHHHHHHHHHCCCHHHHHCCHHHHEEEEEEEECCCCEEEEEECCCC--CCCCCCCEEEEEEEEEECCCCC T ss_conf 99989998377999999997276104207245300799999579978999992237--8866886799999999908995 Q ss_pred -EEEEECCCCC----------EEEEEEEEEEEECCCCCEEEEEEEEEEECCHHHHHHHHHHHHH----HHHHHHHHHHHH Q ss_conf -8963124541----------0211379999987998189999999850887899999999999----999999999999 Q gi|254780677|r 78 -YIAVKHIKNL----------FNFLENHWHFEEISESKCKVHFSIKYELKNRLFDMMLKAIFDP----SFLSFAKAFEER 142 (152) Q Consensus 78 -~i~~~~~~g~----------f~~~~g~W~f~~~~~~~t~V~~~l~fe~~~~~~~~~~~~~~~~----~~~~~i~aF~~R 142 (152) .|.-.+++.+ -+.+-+.|.++|.++|.|+|++-...+++..+...++++++.. .+++.+.++.+- T Consensus 124 ~~i~~~Sv~~~~~p~~~~~vR~~~~psG~~i~p~~~g~skvt~v~~~d~~g~~p~~l~~~~~~s~~~~~a~~~~a~L~~~ 203 (206) T smart00234 124 YAVVDVSVTHPTSPPTSGYVRAENLPSGLLIEPLGNGPSKVTWVSHADLKGWLPHWLVRSLIKSGLAEFAKTWVATLQKH 203 (206) T ss_pred EEEEEEECCCCCCCCCCCCCCCEEECCEEEEEECCCCCEEEEEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999936788788667762002766369999889997899999986589997389998775510677899999999998 Q ss_pred HH Q ss_conf 99 Q gi|254780677|r 143 AH 144 (152) Q Consensus 143 a~ 144 (152) |+ T Consensus 204 c~ 205 (206) T smart00234 204 CA 205 (206) T ss_pred HC T ss_conf 55 No 12 >pfam00407 Bet_v_1 Pathogenesis-related protein Bet v I family. This family is named after Bet v 1, the major birch pollen allergen. This protein belongs to family 10 of plant pathogenesis-related proteins (PR-10), cytoplasmic proteins of 15-17 kd that are wide-spread among dicotyledonous plants. In recent years, a number of diverse plant proteins with low sequence similarity to Bet v 1 was identified. A classification by sequence similarity yielded several subfamilies related to PR-10: - Pathogenesis-related proteins PR-10: These proteins were identified as major tree pollen allergens in birch and related species (hazel, alder), as plant food allergens expressed in high levels in fruits, vegetables and seeds (apple, celery, hazelnut), and as pathogenesis-related proteins whose expression is induced by pathogen infection, wounding, or abiotic stress. Hyp-1, an enzyme involved in the synthesis of the bioactive naphthodianthrone hypericin in St. John's wort (Hypericum perforatum) also bel Probab=97.64 E-value=0.0065 Score=39.02 Aligned_cols=114 Identities=12% Similarity=0.136 Sum_probs=79.3 Q ss_pred CEEEEEEECCCHHHHHHHHHH-HHHCHHHCCC-CCEEEEEECCCCCCCEEEEEEEEECCCEEEEEEEEEEE-CCCCCEEE Q ss_conf 248999835987999999875-7423101875-20579997268888419999999623126899988861-57662089 Q gi|254780677|r 3 HFTADRIVNHSSQQMLSLVSD-IERYPEFVPL-CKKVVIHERDNYGENEVLVASMTINYACMQREFMTQVR-INQKEHYI 79 (152) Q Consensus 3 ~~~~~~~i~~~~e~i~~iV~D-ve~YpeFlP~-c~~~~v~~r~~~~~~~~~~a~l~v~~~~~~~~~~s~~~-~~~~~~~i 79 (152) .++.+..+++||+++|.+..+ -..-|+-+|. ++++++++=+.+..+... .+...-++-......|+. .|..+..+ T Consensus 4 ~~e~Ev~~~~~a~k~~k~~~~~~~liPk~~P~~i~sve~~eGdgg~~GsIk--~~~f~~gg~~~~~Kerie~iD~~n~~~ 81 (150) T pfam00407 4 KLETEVEIKVSADKFYKAFKLEDHLIPKAIPHHIQGVEVHEGEWGTGGSIK--KWTYTEDGKPKVVKERIEADDEKNKTI 81 (150) T ss_pred EEEEEEECCCCHHHHHHHHHCCCCCCCCCCHHHCEEEEEECCCCCCCCEEE--EEEECCCCCEEEEEEEEEEECCCCCEE T ss_conf 799998716899999999833533154007445307999835889898799--999848995368999999883878599 Q ss_pred EEECCCCC----EEEEEEEEEEEECCCCCEEEEEEEEEEECCH Q ss_conf 63124541----0211379999987998189999999850887 Q gi|254780677|r 80 AVKHIKNL----FNFLENHWHFEEISESKCKVHFSIKYELKNR 118 (152) Q Consensus 80 ~~~~~~g~----f~~~~g~W~f~~~~~~~t~V~~~l~fe~~~~ 118 (152) .+..++|. ++.+.....+.|.++|+|.+..+++|+.++. T Consensus 82 ~y~viEGd~l~~~~~~~~~~~~~p~~~ggsi~k~t~eYe~~~~ 124 (150) T pfam00407 82 TYSVLEGDVLEDYKSFITTLQVIPKSNGGSIVKWTIEYEKIGD 124 (150) T ss_pred EEEEEECCCCCCEEEEEEEEEEEECCCCCEEEEEEEEEEECCC T ss_conf 9999923356780899999999877999789999999999999 No 13 >COG4276 Uncharacterized conserved protein [Function unknown] Probab=97.44 E-value=0.0053 Score=39.61 Aligned_cols=132 Identities=13% Similarity=0.101 Sum_probs=85.8 Q ss_pred CCCEEEEEEECCCHHHHHHHHHHHHHCHHHCCCCCEEEEEE-CCCCCCCEEEEEEEEECCCEEEEEEEEEEEC--CCCCE Q ss_conf 97248999835987999999875742310187520579997-2688884199999996231268999888615--76620 Q gi|254780677|r 1 MYHFTADRIVNHSSQQMLSLVSDIERYPEFVPLCKKVVIHE-RDNYGENEVLVASMTINYACMQREFMTQVRI--NQKEH 77 (152) Q Consensus 1 M~~~~~~~~i~~~~e~i~~iV~Dve~YpeFlP~c~~~~v~~-r~~~~~~~~~~a~l~v~~~~~~~~~~s~~~~--~~~~~ 77 (152) |-++..+-+|.+|+|.+++.-+........-|.-. .|+. -++-..+.++.-.+.+-.-+-..+|+++.+. .-+.. T Consensus 1 m~tF~~~~~i~aP~E~VWafhsrpd~lq~LTppw~--VV~p~g~eitqgtri~m~l~pfglp~~~tW~Arhte~~~d~~~ 78 (153) T COG4276 1 MGTFVYRTTITAPHEMVWAFHSRPDALQRLTPPWI--VVLPLGSEITQGTRIAMGLTPFGLPAGLTWVARHTESGFDNGS 78 (153) T ss_pred CCCEEEEEEECCCHHHHHHHHCCCCHHHHCCCCCE--EECCCCCCCCCEEEEEECCEEECCCCCCEEEEEEEECCCCCCC T ss_conf 96258766725978887565248457885589857--8615888311012432052640478774689996502347864 Q ss_pred EEEEECCCCCEEEEEEEEE----EEECCCCCEEEEEEEEEEECCH-HHHHHHHHHHHHHHHHHHH Q ss_conf 8963124541021137999----9987998189999999850887-8999999999999999999 Q gi|254780677|r 78 YIAVKHIKNLFNFLENHWH----FEEISESKCKVHFSIKYELKNR-LFDMMLKAIFDPSFLSFAK 137 (152) Q Consensus 78 ~i~~~~~~g~f~~~~g~W~----f~~~~~~~t~V~~~l~fe~~~~-~~~~~~~~~~~~~~~~~i~ 137 (152) ...=..+.|||..+ .|+ |.+. +|+|.+.=++.||++.. +.+.+...+.+.+++.|-. T Consensus 79 ~FtDv~i~gPfp~~--~WrHtH~F~~e-gg~TvliD~Vsye~p~g~~~~~~g~~l~q~~l~~mFr 140 (153) T COG4276 79 RFTDVCITGPFPAL--NWRHTHNFVDE-GGGTVLIDSVSYELPAGTLTGMFGYRLTQLILDLMFR 140 (153) T ss_pred EEEEEEECCCCCCE--EEEEEEEEECC-CCCEEEEEEEEEECCCCCEECHHHHHHHHHHHHHHHH T ss_conf 04544312876540--14777401648-9967998667765367502100002328999999998 No 14 >COG3832 Uncharacterized conserved protein [Function unknown] Probab=96.32 E-value=0.1 Score=31.38 Aligned_cols=95 Identities=8% Similarity=0.012 Sum_probs=54.0 Q ss_pred CEEEEEEECCCHHHHHHHHHHHHHCHHHCC-CCCEEEEEECCCCCCCEEEEEEEEECCCEEEEEEEEEEECCCCCEEE-- Q ss_conf 248999835987999999875742310187-52057999726888841999999962312689998886157662089-- Q gi|254780677|r 3 HFTADRIVNHSSQQMLSLVSDIERYPEFVP-LCKKVVIHERDNYGENEVLVASMTINYACMQREFMTQVRINQKEHYI-- 79 (152) Q Consensus 3 ~~~~~~~i~~~~e~i~~iV~Dve~YpeFlP-~c~~~~v~~r~~~~~~~~~~a~l~v~~~~~~~~~~s~~~~~~~~~~i-- 79 (152) .++.+++|++||++||+.+.|=+..++|+. .+ .+..-|... .......... +.. +.+..++..-.+++.| T Consensus 9 ~~~~er~i~aP~e~Vf~A~Tdpe~l~~W~~~~~--~~~d~r~gg--~~~~~~~~~~--g~~-~~~~~~~~~v~p~~rIv~ 81 (149) T COG3832 9 TLEIERLIDAPPEKVFEALTDPELLARWFMPGG--AEFDARTGG--GERVRFRGPD--GPV-HSFEGEYLEVVPPERIVF 81 (149) T ss_pred EEEEEEEECCCHHHHHHHHHCHHHHHHHCCCCC--CCCCEECCC--CEEEEEECCC--CCC-CCCCEEEEEECCCCEEEE T ss_conf 199999968998999998549889987348998--732211277--3389874378--871-011259999558977999 Q ss_pred EEECCCC--CEEEEEEEEEEEECCCCC Q ss_conf 6312454--102113799999879981 Q gi|254780677|r 80 AVKHIKN--LFNFLENHWHFEEISESK 104 (152) Q Consensus 80 ~~~~~~g--~f~~~~g~W~f~~~~~~~ 104 (152) .+...++ ++..-...|+|.+..+|+ T Consensus 82 tw~~~~~~~~~~~~~v~~~l~~~~~g~ 108 (149) T COG3832 82 TWDFDEDGEPFLKSLVTITLTPEDDGG 108 (149) T ss_pred EECCCCCCCCCCCCEEEEEEEEECCCC T ss_conf 963677777666754899999942898 No 15 >pfam10698 DUF2505 Protein of unknown function (DUF2505). Members of this family are all Actinobacteria. The function is not known. Probab=95.57 E-value=0.22 Score=29.27 Aligned_cols=132 Identities=8% Similarity=0.068 Sum_probs=73.7 Q ss_pred EEEEEEECCCHHHHHHHHHHHHHCHHHCCCC--CEEEEEECCCCCCCEEEEEEEEECC------------CE----EEEE Q ss_conf 4899983598799999987574231018752--0579997268888419999999623------------12----6899 Q gi|254780677|r 4 FTADRIVNHSSQQMLSLVSDIERYPEFVPLC--KKVVIHERDNYGENEVLVASMTINY------------AC----MQRE 65 (152) Q Consensus 4 ~~~~~~i~~~~e~i~~iV~Dve~YpeFlP~c--~~~~v~~r~~~~~~~~~~a~l~v~~------------~~----~~~~ 65 (152) +..+.-.++|++++|....|=+=|..-+-.. ...++.+-..++++..+.-...+.. +. ..++ T Consensus 1 ~~~~~~y~~~ve~V~~a~~de~yw~~r~~~~g~~~~~v~~~~~~~~g~~v~~~~~lp~~~LP~~v~k~v~~~l~i~r~e~ 80 (159) T pfam10698 1 FEHSAEYPATVERVHAAFTDEDYWEARLEELGGDNAELESLTVDGDGIRVVTTQVLPADDLPGAVRKFVGGDLKIKREET 80 (159) T ss_pred CEEEEECCCCHHHHHHHHCCHHHHHHHHHHHCCCCCEEEEEEECCCCEEEEEEEECCHHHCCHHHHHHCCCCCEEEEEEE T ss_conf 93884659999999999769999999999709997269999856994699999954701283889975677716999999 Q ss_pred EEEEEECCCCCEEEEEECCCCCEEEEEEEEEEEECCCCCEEEEEEEEEEECCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9888615766208963124541021137999998799818999999985088789999999999999999999 Q gi|254780677|r 66 FMTQVRINQKEHYIAVKHIKNLFNFLENHWHFEEISESKCKVHFSIKYELKNRLFDMMLKAIFDPSFLSFAKA 138 (152) Q Consensus 66 ~~s~~~~~~~~~~i~~~~~~g~f~~~~g~W~f~~~~~~~t~V~~~l~fe~~~~~~~~~~~~~~~~~~~~~i~a 138 (152) |. ..........+.....+.|. .+.|.-++.+.+ ++|+.++..+++.+.||++..+.++.-..+.+.+.+ T Consensus 81 W~-~~~dg~~~g~~~~~i~G~Pv-~~~Gt~~L~~~g-~gt~~~~~~~v~v~IPlvGgkiE~~v~~~l~~~~~~ 150 (159) T pfam10698 81 WT-PLDDGEATGEVQVSIPGAPV-SLTGTMRLSPDG-GGSRLEVTGTVEVKVPLVGGKIEKLVADQLVDLLRA 150 (159) T ss_pred EE-ECCCCEEEEEEEEEEECCEE-EEEEEEEEEECC-CCEEEEEEEEEEEEECCCCHHHHHHHHHHHHHHHHH T ss_conf 81-07999189999999706348-899999998389-966999998999996275389999999999999998 No 16 >pfam08327 AHSA1 Activator of Hsp90 ATPase homolog 1-like protein. This family includes eukaryotic, prokaryotic and archaeal proteins that bear similarity to a C-terminal region of human activator of 90 kDa heat shock protein ATPase homolog 1 (AHSA1/p38). This protein is known to interact with the middle domain of Hsp90, and stimulate its ATPase activity. It is probably a general upregulator of Hsp90 function, particularly contributing to its efficiency in conditions of increased stress. p38 is also known to interact with the cytoplasmic domain of the VSV G protein, and may thus be involved in protein transport. It has also been reported as being underexpressed in Down's syndrome. This region is found repeated in two members of this family. Probab=94.32 E-value=0.48 Score=27.10 Aligned_cols=94 Identities=7% Similarity=0.000 Sum_probs=50.4 Q ss_pred CCCHHHHHHHHHHHHHCHHHCCCCCEEEEEECCCCCCCEEEEEEEEECCCEEEEEEEE-EEECCCCCEEE-EEECCCC-C Q ss_conf 5987999999875742310187520579997268888419999999623126899988-86157662089-6312454-1 Q gi|254780677|r 11 NHSSQQMLSLVSDIERYPEFVPLCKKVVIHERDNYGENEVLVASMTINYACMQREFMT-QVRINQKEHYI-AVKHIKN-L 87 (152) Q Consensus 11 ~~~~e~i~~iV~Dve~YpeFlP~c~~~~v~~r~~~~~~~~~~a~l~v~~~~~~~~~~s-~~~~~~~~~~i-~~~~~~g-~ 87 (152) ++|+++||+.++|-+...+|... ..+++.-|. |+...+. . -.+......- -...+++.++. .+...+. + T Consensus 1 ~Ap~e~Vw~A~t~pe~l~~W~~~-~~~~~d~~~--GG~~~~~----~-~~g~~~~~~g~~~ev~p~~~l~~tw~~~~~~~ 72 (124) T pfam08327 1 DAPPERVWRALTDPELLARWFTT-TEAEMDLRP--GGRFRFR----G-PDGEEFGGNGTYLEVVPPERIVYTWRTFDWPE 72 (124) T ss_pred CCCHHHHHHHHCCHHHHHHHCCC-CCCCCCCCC--CCEEEEE----E-CCCCEEEEEEEEEEEECCEEEEEEECCCCCCC T ss_conf 94989999997298899750599-964113206--9679999----5-89967763699999932548999971478887 Q ss_pred EEEEEEEEEEEECCCCCEEEEEEEEE Q ss_conf 02113799999879981899999998 Q gi|254780677|r 88 FNFLENHWHFEEISESKCKVHFSIKY 113 (152) Q Consensus 88 f~~~~g~W~f~~~~~~~t~V~~~l~f 113 (152) .....-.|.|++.+ ++|+|++...- T Consensus 73 ~~~~~v~~~l~~~~-~gT~ltl~~~g 97 (124) T pfam08327 73 GGYSTVTVELEEEG-GGTRLTLTHTG 97 (124) T ss_pred CCCEEEEEEEEEEC-CCEEEEEEEEC T ss_conf 87579999999908-97999999977 No 17 >pfam11687 DUF3284 Domain of unknown function (DUF3284). This family of proteins with unknown function appears to be restricted to Firmicutes. Probab=94.10 E-value=0.53 Score=26.82 Aligned_cols=106 Identities=8% Similarity=0.100 Sum_probs=56.9 Q ss_pred EEEEEECCCHHHHHHHHHH-----HHHCHH-HCCCCCEEEEEECCCCCCCEEEEEEEEECCCEEEEEEEEEE-ECCCCCE Q ss_conf 8999835987999999875-----742310-18752057999726888841999999962312689998886-1576620 Q gi|254780677|r 5 TADRIVNHSSQQMLSLVSD-----IERYPE-FVPLCKKVVIHERDNYGENEVLVASMTINYACMQREFMTQV-RINQKEH 77 (152) Q Consensus 5 ~~~~~i~~~~e~i~~iV~D-----ve~Ype-FlP~c~~~~v~~r~~~~~~~~~~a~l~v~~~~~~~~~~s~~-~~~~~~~ 77 (152) .-++.+++|++++|+.+.+ +.+... =+|-+. -. +- ...-.++.+. ....++ ...+|.. T Consensus 2 kI~~~lnvsa~~fF~~i~~s~~~Di~~~tgk~~~~~~--------l~--G~--~Y~K~~~~~~---~~~vkIt~~~~n~~ 66 (120) T pfam11687 2 KITKTLNVPAKDFFDKLIKSALADIKQATGKELTRAS--------LQ--GY--EYQKQFGNGR---RATVKITEYEPNQK 66 (120) T ss_pred EEEEEECCCHHHHHHHHHHHHHHHHHHHHCCCCCHHH--------HC--CE--EEEEECCCCC---EEEEEEEEECCCCE T ss_conf 6999957799999999999999999997189876556--------25--83--8998628884---79999999618968 Q ss_pred EEEEECCCCCEEEEEEEEEEEECCCCCEEEEEEEEEEECCHH---HHHHHHHH Q ss_conf 896312454102113799999879981899999998508878---99999999 Q gi|254780677|r 78 YIAVKHIKNLFNFLENHWHFEEISESKCKVHFSIKYELKNRL---FDMMLKAI 127 (152) Q Consensus 78 ~i~~~~~~g~f~~~~g~W~f~~~~~~~t~V~~~l~fe~~~~~---~~~~~~~~ 127 (152) .... ...+......+|.+.+++++.|+|+|+=+++-.+.+ .+++++.+ T Consensus 67 -Y~~~-~~s~~~~~~v~Y~l~~l~~~~~~v~y~E~~~~~~~~~~~n~~l~~~~ 117 (120) T pfam11687 67 -YAYT-TSTNRDTFTVRYQLKPLGEGTTKVTYSETMSTKNFFGKANNKLVGLL 117 (120) T ss_pred -EEEE-EEECCCCEEEEEEEEECCCCCEEEEEEEEEECCCHHHHHHHHHHHHH T ss_conf -9999-97069968999999876999589999998732149999989999883 No 18 >KOG2761 consensus Probab=63.74 E-value=12 Score=18.11 Aligned_cols=140 Identities=13% Similarity=0.193 Sum_probs=82.8 Q ss_pred CCEEEEEEE-CCCHHHHHHHHHHHHHCHHHCCCCCEEEEEECCCCCCCEEEEEEEEECCCEEEEEEEE-EEECCC--CCE Q ss_conf 724899983-5987999999875742310187520579997268888419999999623126899988-861576--620 Q gi|254780677|r 2 YHFTADRIV-NHSSQQMLSLVSDIERYPEFVPLCKKVVIHERDNYGENEVLVASMTINYACMQREFMT-QVRINQ--KEH 77 (152) Q Consensus 2 ~~~~~~~~i-~~~~e~i~~iV~Dve~YpeFlP~c~~~~v~~r~~~~~~~~~~a~l~v~~~~~~~~~~s-~~~~~~--~~~ 77 (152) ..|....+. .++|+.++|+-.|-|==++|=..+.+.+.+++++..+.........+.+......|+. |..++. ... T Consensus 53 ~~Yk~~~vfeDvtp~~~~Dv~~D~eYRkkWD~~vi~~e~ie~d~~tg~~vv~w~~kfP~p~~~RdYV~~Rr~~~~~~k~~ 132 (219) T KOG2761 53 YEYKSRTVFEDVTPEIVRDVQWDDEYRKKWDDMVIELETIEEDPVTGTEVVYWVKKFPFPMSNRDYVYVRRWWESDEKDY 132 (219) T ss_pred EEEEEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHEEEEECCCCCCEEEEEEEECCCCCCCCCEEEEEEEEECCCCEE T ss_conf 79999999758999999887701677877888765332544327888669999884784367863799999973588448 Q ss_pred EEEEECC---CCC-------EEEEEEEEEEE----ECCCCCEEEEEEEEEEECCHHHHHHHHHHHHH----HHHHHHHHH Q ss_conf 8963124---541-------02113799999----87998189999999850887899999999999----999999999 Q gi|254780677|r 78 YIAVKHI---KNL-------FNFLENHWHFE----EISESKCKVHFSIKYELKNRLFDMMLKAIFDP----SFLSFAKAF 139 (152) Q Consensus 78 ~i~~~~~---~g~-------f~~~~g~W~f~----~~~~~~t~V~~~l~fe~~~~~~~~~~~~~~~~----~~~~~i~aF 139 (152) .+..... .-| .......|... ..+.++|.+.+.+--....++..-+++..++. .+++|-.|. T Consensus 133 ~i~s~~v~h~s~P~~~~~vRv~~~~s~~~I~~~~~~~~~~~~~~~~~~~~~p~~~iP~~~v~~~~~~gmp~~vkKm~~a~ 212 (219) T KOG2761 133 YIVSKSVQHPSYPPLKKKVRVTVYRSGWLIRVESRSGDEQGCACEYLYFHNPGGGIPKWVVKLAVRKGMPGAVKKMEKAL 212 (219) T ss_pred EEEEECCCCCCCCCCCCCEEEEEEEEEEEEECCCCCCCCCCCEEEEEEEECCCCCCCHHHHHHHHHHCCHHHHHHHHHHH T ss_conf 99981226887587677379999998889972322588986479999997899987399998898852738899999999 Q ss_pred HH Q ss_conf 99 Q gi|254780677|r 140 EE 141 (152) Q Consensus 140 ~~ 141 (152) .+ T Consensus 213 ~~ 214 (219) T KOG2761 213 LA 214 (219) T ss_pred HH T ss_conf 75 No 19 >pfam08982 DUF1857 Domain of unknown function (DUF1857). This domain has no known function. It is found in various hypothetical bacterial and fungal proteins. Probab=45.46 E-value=24 Score=16.22 Aligned_cols=87 Identities=14% Similarity=0.063 Sum_probs=53.2 Q ss_pred CHHHHHH-HHHHHHHCHHHCCCCCEEEEEECCCCCCCEEEEEEEEECCCEEEEEEEEEEECCCCCEEEEEECCCCCEEEE Q ss_conf 8799999-987574231018752057999726888841999999962312689998886157662089631245410211 Q gi|254780677|r 13 SSQQMLS-LVSDIERYPEFVPLCKKVVIHERDNYGENEVLVASMTINYACMQREFMTQVRINQKEHYIAVKHIKNLFNFL 91 (152) Q Consensus 13 ~~e~i~~-iV~Dve~YpeFlP~c~~~~v~~r~~~~~~~~~~a~l~v~~~~~~~~~~s~~~~~~~~~~i~~~~~~g~f~~~ 91 (152) ..+|+|. |+.-+++=..|+|++.+++|+++++ +.+.-++.++..- ...+++..++. .+.+....|..-.+ T Consensus 19 Tr~QlW~GL~lrar~p~~Fvp~i~~c~Vl~~~~----~~~~Rel~fg~~~----v~e~v~l~~~~-~v~f~~~~G~~lt~ 89 (147) T pfam08982 19 TRAQLWRGLVLRAREPQLFVPGLDSCEVLSESA----TELERELHFGGAK----IRDVVTLEPPS-RVHFAQADGGSLTN 89 (147) T ss_pred CHHHHHHHHHHHHCCHHHCCCCCCEEEEEECCC----CEEEEEEEECCCE----EEEEEEEECCC-EEEEEECCCCEEEE T ss_conf 899998778871388445435244278994379----8699999988964----68899970671-89998169977899 Q ss_pred EEEEEEEECCCCCEEEEEEEE Q ss_conf 379999987998189999999 Q gi|254780677|r 92 ENHWHFEEISESKCKVHFSIK 112 (152) Q Consensus 92 ~g~W~f~~~~~~~t~V~~~l~ 112 (152) .+++..+|.=-++|..+ T Consensus 90 ----~Ieep~~g~L~lrf~ye 106 (147) T pfam08982 90 ----TIEEPEPGALFLRFVYE 106 (147) T ss_pred ----EECCCCCCEEEEEEEEE T ss_conf ----98128997189999997 No 20 >pfam02713 DUF220 Domain of unknown function DUF220. This is family consists of a region in several Arabidopsis thaliana hypothetical proteins none of which have any known function. The aligned region contains two cysteine residues. Probab=34.63 E-value=18 Score=17.07 Aligned_cols=18 Identities=17% Similarity=0.479 Sum_probs=15.3 Q ss_pred CCCCCEEEEEEEEEEEEC Q ss_conf 245410211379999987 Q gi|254780677|r 83 HIKNLFNFLENHWHFEEI 100 (152) Q Consensus 83 ~~~g~f~~~~g~W~f~~~ 100 (152) ...+.++.++|+|..+|+ T Consensus 25 ~km~FMK~FEG~wKVEPl 42 (74) T pfam02713 25 EKMMFMKVFEGSWKVEPL 42 (74) T ss_pred HHHHHHHEECCEEEEEEE T ss_conf 652224203341556656 No 21 >pfam04707 PRELI PRELI-like family. This family includes a conserved region found in the PRELI protein and yeast YLR168C gene MSF1 product. The function of this protein is unknown, though it is thought to be involved in intra-mitochondrial protein sorting. This region is also found in a number of other eukaryotic proteins. Probab=31.36 E-value=40 Score=14.81 Aligned_cols=124 Identities=10% Similarity=0.130 Sum_probs=62.4 Q ss_pred HHHHHCHH-HCCCCCEEEEEECCCCCCCEEEEEEEEECC-----------CEEEE-EEEEEEECCCCCEEEEEECCCCCE Q ss_conf 75742310-187520579997268888419999999623-----------12689-998886157662089631245410 Q gi|254780677|r 22 SDIERYPE-FVPLCKKVVIHERDNYGENEVLVASMTINY-----------ACMQR-EFMTQVRINQKEHYIAVKHIKNLF 88 (152) Q Consensus 22 ~Dve~Ype-FlP~c~~~~v~~r~~~~~~~~~~a~l~v~~-----------~~~~~-~~~s~~~~~~~~~~i~~~~~~g~f 88 (152) +-..+||+ +.|++.++.+++|.-+.++....-++..-. ++... ....+...|+.++......-.-.| T Consensus 6 A~w~kYPNP~~~hVls~Dvl~R~vd~~G~L~t~Rli~~~~~~P~w~~kl~g~~~~~y~~E~S~vD~~~r~m~~~t~Nlt~ 85 (157) T pfam04707 6 AFWRRYPNPYSPHVISVDVLERKVDDDGKLHTERLLTKQGRLPRWLKKLIGVADTVYVLEVSVVDPKKRTLTLETRNLTF 85 (157) T ss_pred HHHHHCCCCCCCCEEEEEEEEEEECCCCCEEEEEEEEECCCCHHHHHHHCCCCCCEEEEEEEEEECCCCEEEEEEEECCC T ss_conf 66845899878837764578889889982999998873067838999865998719999999995888989999891686 Q ss_pred E---EEEEEEEEEECC--CCCEEEEEEEEEEECCHHHH------HHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 2---113799999879--98189999999850887899------999999999999999999999997 Q gi|254780677|r 89 N---FLENHWHFEEIS--ESKCKVHFSIKYELKNRLFD------MMLKAIFDPSFLSFAKAFEERAHK 145 (152) Q Consensus 89 ~---~~~g~W~f~~~~--~~~t~V~~~l~fe~~~~~~~------~~~~~~~~~~~~~~i~aF~~Ra~~ 145 (152) . ...=.-++.+.+ ++.|..+=+..+...+.+.+ ...-.-|...+.+-.++|+-=.++ T Consensus 86 ~~~~~V~E~~~Y~~~pen~~~T~~~q~a~i~v~~~~~g~~~~iE~~~~~rf~~N~~kgr~g~e~vl~~ 153 (157) T pfam04707 86 SSRLSVDERCTYTPHPENPNWTCFDQEASISVKGSFFGFSSKVEKWSLKRFSQNAKKGREGMEYVLQE 153 (157) T ss_pred CCEEEEEEEEEEEECCCCCCCEEEEEEEEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 68889999999997999999589999999999833264799999999999999999999999999999 No 22 >KOG3849 consensus Probab=25.75 E-value=32 Score=15.41 Aligned_cols=19 Identities=32% Similarity=0.270 Sum_probs=16.0 Q ss_pred HHHHHHHHHHHHHHCCCCC Q ss_conf 9999999999997708999 Q gi|254780677|r 133 LSFAKAFEERAHKIYHLPS 151 (152) Q Consensus 133 ~~~i~aF~~Ra~~~yg~~~ 151 (152) -+--.||+||-+++.|+|| T Consensus 353 vSsfsafvKRERD~~GrPS 371 (386) T KOG3849 353 VSSFSAFVKRERDHAGRPS 371 (386) T ss_pred HHHHHHHHHHHHCCCCCCC T ss_conf 7788998866420368950 No 23 >pfam02974 Inh Protease inhibitor Inh. The Inh inhibitor is secreted into the periplasm where its presumed physiological function is to protect periplasmic proteins against the action of secreted proteases. A range of proteases including A, B and C from E. chrysanthemi, alkaline protease from Pseudomonas aeruginosa and the 50 kDa protease from Serratia marcescens are inhibited. Probab=24.95 E-value=52 Score=14.09 Aligned_cols=19 Identities=16% Similarity=0.728 Sum_probs=7.3 Q ss_pred EEEEEEEEECCCCCEEEEEE Q ss_conf 13799999879981899999 Q gi|254780677|r 91 LENHWHFEEISESKCKVHFS 110 (152) Q Consensus 91 ~~g~W~f~~~~~~~t~V~~~ 110 (152) +.|.|++.. +++.|+|.+. T Consensus 11 LaG~W~ls~-~~~~C~i~L~ 29 (98) T pfam02974 11 LAGQWQLSK-GGQVCDIELT 29 (98) T ss_pred HCEEEEEEC-CCCCEEEEEE T ss_conf 220678704-9982488931 No 24 >PRK05590 hypothetical protein; Provisional Probab=22.25 E-value=34 Score=15.27 Aligned_cols=32 Identities=16% Similarity=0.398 Sum_probs=21.7 Q ss_pred CCCCCEEEEEEEEEEEECCCCCEEEEEEEEEEE Q ss_conf 245410211379999987998189999999850 Q gi|254780677|r 83 HIKNLFNFLENHWHFEEISESKCKVHFSIKYEL 115 (152) Q Consensus 83 ~~~g~f~~~~g~W~f~~~~~~~t~V~~~l~fe~ 115 (152) -.+|.=..|...-.+..++++ |.|.+.+|+|- T Consensus 67 FLDGINdSLk~~~~le~~eed-t~v~l~iD~EK 98 (165) T PRK05590 67 FLDGINDSLKEPLDLEKLEED-TNISLDIDFEK 98 (165) T ss_pred ECCCCHHHHCCCCCHHHCCCC-CEEEEEECHHH T ss_conf 102635651898753330567-64888627999 No 25 >smart00136 LamNT Laminin N-terminal domain (domain VI). N-terminal domain of laminins and laminin-related protein such as Unc-6/ netrins. Probab=22.25 E-value=33 Score=15.33 Aligned_cols=17 Identities=12% Similarity=0.227 Sum_probs=10.0 Q ss_pred HHHHHHHHHHHHHCCCC Q ss_conf 99999999999770899 Q gi|254780677|r 134 SFAKAFEERAHKIYHLP 150 (152) Q Consensus 134 ~~i~aF~~Ra~~~yg~~ 150 (152) +.-+=|-..|.+.||.| T Consensus 127 ~p~qyfa~~C~~~f~~~ 143 (238) T smart00136 127 QPWQYFSSDCRRTFGRP 143 (238) T ss_pred HHHHHHHHHHHHHCCCC T ss_conf 11878476677763999 Done!