Query         psy7158
Match_columns 402
No_of_seqs    181 out of 675
Neff          6.6 
Searched_HMMs 46136
Date          Fri Aug 16 19:39:16 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy7158.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/7158hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1162|consensus              100.0 8.2E-69 1.8E-73  557.8  27.2  355   27-393   127-560 (617)
  2 PF03124 EXS:  EXS family;  Int 100.0 9.4E-44   2E-48  356.2  15.9  226  163-391     2-311 (345)
  3 COG5409 EXS domain-containing  100.0 4.1E-29 8.9E-34  241.8   8.6  246  131-386     4-328 (384)
  4 PF03105 SPX:  SPX domain;  Int  99.1 1.8E-10 3.8E-15  110.3   5.7   58   20-77    217-275 (275)
  5 KOG1161|consensus               99.0 4.5E-10 9.7E-15  109.4   6.9   75   33-108   102-177 (310)
  6 COG5408 SPX domain-containing   98.1 3.3E-06 7.1E-11   82.4   5.4   51   25-75    239-290 (296)
  7 COG5036 SPX domain-containing   98.1 4.3E-06 9.4E-11   84.0   5.6   75   24-107    93-168 (509)
  8 PF06570 DUF1129:  Protein of u  86.2      29 0.00063   32.3  17.9   16  237-252   175-190 (206)
  9 KOG4031|consensus               79.9      13 0.00029   34.8   8.9   66    2-67    106-176 (216)
 10 COG0598 CorA Mg2+ and Co2+ tra  66.9      78  0.0017   31.5  11.7   34   28-61    177-211 (322)
 11 PF10669 Phage_Gp23:  Protein g  50.6      37 0.00079   28.2   4.9   22    8-30     49-70  (121)
 12 COG5522 Predicted integral mem  38.7 3.5E+02  0.0077   25.7  11.3   75  114-197   110-185 (236)
 13 PF11143 DUF2919:  Protein of u  34.9 1.9E+02  0.0042   25.7   7.5   59  127-188    53-112 (149)
 14 PF15038 Jiraiya:  Jiraiya       33.3   1E+02  0.0022   28.3   5.5   58  131-191   110-167 (175)
 15 PF03904 DUF334:  Domain of unk  31.1 4.3E+02  0.0093   25.4   9.4   18   50-67     95-112 (230)
 16 TIGR00383 corA magnesium Mg(2+  29.7 5.4E+02   0.012   25.1  13.3   32   28-59    172-204 (318)
 17 PF08367 M16C_assoc:  Peptidase  27.8      86  0.0019   29.9   4.4   45    1-45      1-46  (248)
 18 KOG4326|consensus               26.6      74  0.0016   24.9   2.9   31    6-36     43-74  (81)
 19 PF04644 Motilin_ghrelin:  Moti  25.9      56  0.0012   20.8   1.8   16   10-26      8-23  (28)
 20 PF09925 DUF2157:  Predicted me  25.8 4.4E+02  0.0095   22.8  12.1    7  224-230   107-113 (145)
 21 COG4062 MtrB Tetrahydromethano  24.3 1.7E+02  0.0036   24.5   4.7   70   77-148    25-98  (108)
 22 PF02268 TFIIA_gamma_N:  Transc  24.2   1E+02  0.0022   22.3   3.1   31   33-65     15-45  (49)
 23 COG5110 RPN1 26S proteasome re  23.2      50  0.0011   35.7   1.9   61    9-69     28-112 (881)
 24 PF01086 Clathrin_lg_ch:  Clath  23.0 1.4E+02   0.003   28.4   4.7   62    2-63    112-179 (225)
 25 PF07716 bZIP_2:  Basic region   22.2   2E+02  0.0043   20.7   4.4   31   10-40     21-51  (54)
 26 TIGR02894 DNA_bind_RsfA transc  21.8 1.7E+02  0.0036   26.6   4.7   27   17-43    121-147 (161)
 27 PF06160 EzrA:  Septation ring   21.1 5.5E+02   0.012   27.8   9.4   91   16-117   103-198 (560)
 28 PF10669 Phage_Gp23:  Protein g  20.3   4E+02  0.0086   22.2   6.1   10   11-20     45-54  (121)
 29 PF04930 FUN14:  FUN14 family;   20.1      79  0.0017   26.0   2.2   34  104-138    66-99  (100)

No 1  
>KOG1162|consensus
Probab=100.00  E-value=8.2e-69  Score=557.82  Aligned_cols=355  Identities=28%  Similarity=0.425  Sum_probs=299.1

Q ss_pred             HHHHHHHHHHHHHHHhhhh-hhchHHHHHHHHHhhccccccccchHHHHhccccCCcccchhHHHHHHHHHHHHHHhhcc
Q psy7158          27 EEEEEISMRLVHSLINVSN-AAISLTDLGFFASHVKTLLNVDVGAKWRAEHVDVSHFYVNKDINHLISETEATVTSELEE  105 (402)
Q Consensus        27 ~~~~~~~~e~~~~L~~L~~-~~LN~tgF~KILKKyDK~~~~~~~~~~~~~~v~~s~F~~~~~l~~li~~ve~lYa~~f~~  105 (402)
                      +....++.|+|+.|.+||+ ++||.|||+||+|||||+++++. ..| .+.|++++|.++|++++++.+||++|+++|++
T Consensus       127 ~~lk~af~Efy~~L~llk~y~~lN~~~f~KI~KKyDK~~~~~~-~~~-~~~v~~s~f~~~~~i~~l~~~Ve~~f~~~fan  204 (617)
T KOG1162|consen  127 EKLKLAFSEFYLKLRLLKNYQFLNVTAFRKILKKYDKITSRDA-KRY-VKMVDKSYFTSSDEITRLMLEVEETFTKHFAN  204 (617)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccch-HHH-HHHHHHHhcccHHHHHHHHHHHHHHHHHHHhC
Confidence            3445689999999999997 99999999999999999999998 787 88999999999999999999999999999999


Q ss_pred             CcHHHHHhhcCCCCCCCCCCC--chhhHHHHHHHHHHHHHHHHHHHhhccCCchh-hhHHHhhhhhHHHHHHHHHHHHHH
Q psy7158         106 GDRQKAMKRLRVPPLGEQQSP--WTTFKVGLFLGCFAILFAIILLRVFFDTSQED-FKMAMRLYRGPLLIIQFIFLMGVN  182 (402)
Q Consensus       106 gdrk~A~~~LR~~~~~~~~~~--~~~F~~GlflG~s~~lli~~li~~~~~~~~~~-~~~~~~i~r~~~l~~l~~~l~g~n  182 (402)
                      |||++||+.||.+ .++++.+  ...|.+|.++|+++++.+.+-+..+..++++. ...++|+|+ +++++++++++|+|
T Consensus       205 ~nr~~~m~~lr~~-~~e~h~~~~~~~f~~g~~~~l~val~~~~~~~~~~~~~~~~~~~~~~~l~~-~~~v~l~~fl~~~n  282 (617)
T KOG1162|consen  205 GNRRKAMKVLRPK-LKEKHRPTFSTGFFVGCGIGLSVALVALIYLRNILQSEQRFYMETMFPLYG-FGLVVLHKFLYNVN  282 (617)
T ss_pred             CChhHhhhhcCCc-ccccCCCchhHHHHHHHHHHHHHHHHHHHHHHHHhcccchhHHHHHHHHHH-HHHHHHHHHHhcCc
Confidence            9999999999998 4443322  44555555555555555444444444333222 455777777 99999999999999


Q ss_pred             HHHHhhcCCCceEEEeeCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccchhHHHHHHHHHHHHcCCcccch
Q psy7158         183 VYGWRSSGVNHVLIFELDPRNHLSEQHLMEISLIMGNVWALSLLCFFFSHVFSIPPYSQPVALLLIMLAFLLNPSRTLKY  262 (402)
Q Consensus       183 ~~iw~~~~INy~~Ife~d~r~~~~~~~~~~~a~~~~~~~~lsl~~~~~~~~~~~~~~~~Pl~~~~~~~~~l~~P~~i~~~  262 (402)
                      +++|+++||||++|||+|+++++.+++++++++..+++|+++++.++.....+.++++.|+++++++++++++|++++|+
T Consensus       283 iy~W~~~rVNy~fIf~~~~~~~l~~~~~l~i~~~~~~~~~l~~l~~l~~~~~~~~~~~~Pl~ll~~~~~~L~~Pf~~fY~  362 (617)
T KOG1162|consen  283 IYEWSRTRVNYKFIFEFDQRTELGYRDILLIHNTNGILEFLPVLKNLDMSMSGQTTELSPLILLLLFFFLLVCPFNTFYR  362 (617)
T ss_pred             hHHHHHhcCCceeeecCCccccccHHHHHHHHHHHHHHHHHHHHHHhccccCCCCcccchHHHHHHHHheeeccchhhhH
Confidence            99999999999999999999999999999999999999999999998876666678899999999999999999999999


Q ss_pred             hhhhhHhhhc------cc----------c--hhh---h------hhhhhcC-------------------------C---
Q psy7158         263 DARFWLLRVT------PV----------A--LLL---I------MLAFLLN-------------------------P---  287 (402)
Q Consensus       263 ~~R~~~lr~l------Pf----------g--L~s---s------~~C~y~~-------------------------P---  287 (402)
                      ++|+|+++++      |+          |  ++|   +      ++|+|..                         |   
T Consensus       363 sSRf~ll~~l~rvi~spl~~V~~~DFfl~Dql~S~v~a~~~l~~~~C~y~~~~~~~~~~~~~~~~~~~~~~iva~lP~~~  442 (617)
T KOG1162|consen  363 SSRFWLLKRLFRVLSSPLYKVLFVDFFLADQLTSLVLALRDLEFFICYYGTGDFQARRTCYCKDDYIEFQSIVAVLPYWF  442 (617)
T ss_pred             hhHHHHHHHHHHHHhccceeeccccccHHHHHHHHHHHHHhHHHhheeecccccccccccccccchHHHHHHHHHHHHHH
Confidence            9999999776      55          2  333   2      8888871                         1   


Q ss_pred             ----------------cc--chhhhhHHHHHHHH--hhccCCCCCCchhHHHHHHHHHhhhhhhhhhhhccccCCCCCCC
Q psy7158         288 ----------------SR--TLKYDARFWLLRVT--VKKDQSVMDNPFFYPWLAASVISSIYSYIWDIKMDWGLFDKNAP  347 (402)
Q Consensus       288 ----------------~l--a~KY~~~~~~l~~~--~~~~~~~~~~~~~~~wi~~~~i~S~Ys~~WDv~mDWgL~~~~~~  347 (402)
                                      ++  |+||++++++.++.  ++.+.+  .+.++++|++++.+||+|+++||++|||||++++  
T Consensus       443 RfLQClRR~~d~~~~~hL~NAlKY~~~i~~v~~~~~y~~~~~--~~~~~~l~~~~s~vaS~y~~~WDvv~DWgLl~~~--  518 (617)
T KOG1162|consen  443 RFLQCLRRYRDEKAFPHLLNALKYSTTILAVMLTTLYRILPG--SSLWFALWILSSKVASLYTTYWDVVMDWGLLNRK--  518 (617)
T ss_pred             HHHHHHHHHHhccchHhHHHHHHHHHHHHHHHHHHHHHHcCC--cchHHHHHHHHHHHHHHHHHHHHHheeccccccc--
Confidence                            23  99999999995444  333321  1589999999999999999999999999999988  


Q ss_pred             CCCCCCCccccccCCccchhhhhhhhhccccccchhhhhchHHHhh
Q psy7158         348 PDNPFLREETVYSSTEMKALLYQAMEEAPSLETTEPEVISRYFRMF  393 (402)
Q Consensus       348 ~~~~~LR~~l~y~~~~~~~~Yy~Aiv~n~ilR~~W~~~l~~~~~~f  393 (402)
                      ++ |+||++++||+   |++||+||+.|+++|++|++.+...++..
T Consensus       519 ~~-~~lRd~l~~p~---k~vYy~aiv~N~vLR~aW~~~~i~~~~~~  560 (617)
T KOG1162|consen  519 SK-PWLRDNLLYPQ---KWVYYSAIVLNFVLRFAWFFKTILVFHEE  560 (617)
T ss_pred             Cc-hhhHHHhcccc---hheehhHHHHHHHHHHHHHHHHHHHHHHh
Confidence            45 99999999999   89999999999999999988877666543


No 2  
>PF03124 EXS:  EXS family;  InterPro: IPR004342 The EXS domain is named after ERD1/XPR1/SYG1 and proteins containing this motif include the C-terminal of the SYG1 G-protein associated signal transduction protein from Saccharomyces cerevisiae, and sequences that are thought to be Murine leukemia virus (MLV) receptors (XPR1. The N-terminal of these proteins often have an SPX domain (IPR004331 from INTERPRO) []. While the N-terminal is thought to be involved in signal transduction, the role of the C-terminal is not known. This region of similarity contains several predicted transmembrane helices. This family also includes the ERD1 (ERD: ER retention defective) S. cerevisiae proteins. ERD1 proteins are involved in the localization of endogenous endoplasmic reticulum (ER) proteins. Erd1 null mutants secrete such proteins even though they possess the C-terminal HDEL ER lumen localization label sequence. In addition, null mutants also exhibit defects in the Golgi-dependent processing of several glycoproteins, which led to the suggestion that the sorting of luminal ER proteins actually occurs in the Golgi, with subsequent return of these proteins to the ER via `salvage' vesicles [].; GO: 0016021 integral to membrane
Probab=100.00  E-value=9.4e-44  Score=356.21  Aligned_cols=226  Identities=33%  Similarity=0.553  Sum_probs=192.2

Q ss_pred             HhhhhhHHHHHHHHHHHHHHHHHHhhcCCCceEEEeeCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHhhccCCC--C---
Q psy7158         163 MRLYRGPLLIIQFIFLMGVNVYGWRSSGVNHVLIFELDPRNHLSEQHLMEISLIMGNVWALSLLCFFFSHVFSI--P---  237 (402)
Q Consensus       163 ~~i~r~~~l~~l~~~l~g~n~~iw~~~~INy~~Ife~d~r~~~~~~~~~~~a~~~~~~~~lsl~~~~~~~~~~~--~---  237 (402)
                      +|+||+++++++++|+||+|+++|+++||||++|||+||+++++++|+++++++++++|++++.+++.....+.  +   
T Consensus         2 ~~~~R~~~L~~l~~~l~~~nl~v~~~~~Iny~~If~~~~~~~~~~~~~~~~a~~~~~~~~l~~~~~l~~~~~~~~~~~~~   81 (345)
T PF03124_consen    2 PPPFRGLLLLILGLWLWGINLYVWSRYRINYVFIFELDPRNSLSYRQLFELASFLTIIWLLCFLIYLASVSPSIISFANW   81 (345)
T ss_pred             cchHHHHHHHHHHHHHHHHHHHHHHHcCcCHHHHhcCCcccCCcHHHHHhhhHHHHHHHHHHHHHHHHHHcCCcccccch
Confidence            57899999999999999999999999999999999999999999999999999999999999988886543322  1   


Q ss_pred             --ccchhHHHHHHHHHHHHcCCcccchhhhhhHhhhc------cc----------c--hhhh---------hhhhhc---
Q psy7158         238 --PYSQPVALLLIMLAFLLNPSRTLKYDARFWLLRVT------PV----------A--LLLI---------MLAFLL---  285 (402)
Q Consensus       238 --~~~~Pl~~~~~~~~~l~~P~~i~~~~~R~~~lr~l------Pf----------g--L~ss---------~~C~y~---  285 (402)
                        .+++|+++++++++++++|++++++++|+|+++++      |+          |  |||.         ++|+|.   
T Consensus        82 ~~~~~~Pll~~~~~~~~l~~P~~~~~~~~R~~~~~~l~ril~~~~~~v~f~d~~laD~LtS~~~~l~D~~~~~c~~~~~~  161 (345)
T PF03124_consen   82 YFVEYIPLLLLLILLLLLFFPFNIFYRSSRRWFLRTLKRILLAPFYPVRFRDFFLADQLTSLSKVLGDLEFTICYYFSGS  161 (345)
T ss_pred             hhHHHhhHHHHHHHHHHhhcccchhhhhHHHHHHHHHHHHHhcccccchHHHHHHHHHHHhHHHHHHHHHHHHHHHhccc
Confidence              13689999999999999999999999999999887      44          1  5552         788776   


Q ss_pred             --C-------------------C--------------------cc--chhhhhHHHHH-HHHhhcc--CCCCCCchhHHH
Q psy7158         286 --N-------------------P--------------------SR--TLKYDARFWLL-RVTVKKD--QSVMDNPFFYPW  319 (402)
Q Consensus       286 --~-------------------P--------------------~l--a~KY~~~~~~l-~~~~~~~--~~~~~~~~~~~w  319 (402)
                        +                   |                    |+  |+||++|++++ ..+....  ++.+....+.+|
T Consensus       162 ~~~~~~~c~~~~~~~~~~~~~lP~~iR~~QClrry~~~~~~~~hL~Na~KY~~~~~v~~~~~~~~~~~~~~~~~~~~~~w  241 (345)
T PF03124_consen  162 FTSPSNQCGSSSWYIDPIVASLPYWIRFLQCLRRYRDTGDRFPHLFNALKYSSSIPVIILSALYRFYPSSDSSIWLFILW  241 (345)
T ss_pred             ccCCCCcCccccHhHHhHHHHHHHHHHHHHHHHHHHhCCCCchhHHHHHHHHHHHHHHHHHHHHHhcCCccchhhHHHHH
Confidence              1                   1                    12  99999999994 3333222  111223578999


Q ss_pred             HHHHHHhhhhhhhhhhhccccCCCCCCCCCCCCCCccccccCCccchhhhhhhhhccccccchhhhhch-HHH
Q psy7158         320 LAASVISSIYSYIWDIKMDWGLFDKNAPPDNPFLREETVYSSTEMKALLYQAMEEAPSLETTEPEVISR-YFR  391 (402)
Q Consensus       320 i~~~~i~S~Ys~~WDv~mDWgL~~~~~~~~~~~LR~~l~y~~~~~~~~Yy~Aiv~n~ilR~~W~~~l~~-~~~  391 (402)
                      ++++++||+||++|||+|||||+++++++++++||++++||+   |++||.||+.|+++|++|++++++ +..
T Consensus       242 ~~~~~i~s~Ys~~WDv~~DWgL~~~~~~~~~~~LR~~l~~~~---~~~Yy~ai~~n~ilRf~W~~~~~~~~~~  311 (345)
T PF03124_consen  242 IIFALINSLYSFYWDVKMDWGLFQPKKKSKNWLLRRRLLYPR---KWFYYFAIILNFILRFAWILTLSPPHFS  311 (345)
T ss_pred             HHHHHHHHHHHHHHHHHhcccccCCccccCCCCCccccccCC---cchhhhHHHHHHHHHHHHHHHHHHhhcc
Confidence            999999999999999999999998875578999999999998   899999999999999999999988 443


No 3  
>COG5409 EXS domain-containing protein [Signal transduction mechanisms]
Probab=99.95  E-value=4.1e-29  Score=241.79  Aligned_cols=246  Identities=22%  Similarity=0.219  Sum_probs=176.1

Q ss_pred             HHHHHHHHHHHHHHHHHHHhhccCCchh-hhHHHhhhhhHHHHHHHHHHHHHHHHHHhhcCCCceEEEeeCCCCC----C
Q psy7158         131 KVGLFLGCFAILFAIILLRVFFDTSQED-FKMAMRLYRGPLLIIQFIFLMGVNVYGWRSSGVNHVLIFELDPRNH----L  205 (402)
Q Consensus       131 ~~GlflG~s~~lli~~li~~~~~~~~~~-~~~~~~i~r~~~l~~l~~~l~g~n~~iw~~~~INy~~Ife~d~r~~----~  205 (402)
                      ++|+-.|.+..+.+.+.......+.+.. ...++++|.+..+......++++|+++|.+.+|||++||+++.++.    .
T Consensus         4 vv~~~~g~~l~l~~~i~~l~~~~~~~~~~~~ill~lw~~~~l~~~~~~lf~v~~~i~~r~~inyr~i~~~e~~~~~a~~~   83 (384)
T COG5409           4 VVGLEKGVSLSLGLYIQNLLNVGEPQSFIVLILLALWGGWILVFFLAFLFDVSCYILTRTPINYRFIFLFEQLSSTARNF   83 (384)
T ss_pred             eeeeeccccccHHHHHHHheeCCchHHHHHHHHHHHHhHHHHHHHHHHHhcCceeeEEeccccchhhhhHhHhhhhhccc
Confidence            3566677777766655544222211111 4568999999999999999999999999999999999998886543    2


Q ss_pred             CHHHHHHHHHHHHHHHHHHHHHHHhhcc---CCC-CccchhHHHHHHHHHHHHcCCcccchhhhhhHhhhc---------
Q psy7158         206 SEQHLMEISLIMGNVWALSLLCFFFSHV---FSI-PPYSQPVALLLIMLAFLLNPSRTLKYDARFWLLRVT---------  272 (402)
Q Consensus       206 ~~~~~~~~a~~~~~~~~lsl~~~~~~~~---~~~-~~~~~Pl~~~~~~~~~l~~P~~i~~~~~R~~~lr~l---------  272 (402)
                      +.++-.  ++.-...|.+++..|.-...   +.+ -...+|+++.+....++++|+++.++.+|+|++.++         
T Consensus        84 ~~dfh~--~~i~~~~~~~slfiFl~~v~g~~~~l~~~~~~P~l~v~~vf~~ll~Pf~ii~y~sRr~Li~sliRv~l~~~~  161 (384)
T COG5409          84 NLDFHR--IIIPFHFFTTSLFIFLNAVEGLKFILLFVYFLPLLQVGTVFWFLLKPFQIIYYWSRRYLIESLIRVFLFGYS  161 (384)
T ss_pred             chhhHH--HhhhHHHHHHHHHHHHHHhhccccceehhhhccHHHHHHHHHHhhcccceeeecchhhHHHHHHHHHHhccc
Confidence            222211  33334445566666653322   222 246789999999999999999999999999998665         


Q ss_pred             cc-------c-----hhhh------hhhhhc----CC-------------------------------------cc--ch
Q psy7158         273 PV-------A-----LLLI------MLAFLL----NP-------------------------------------SR--TL  291 (402)
Q Consensus       273 Pf-------g-----L~ss------~~C~y~----~P-------------------------------------~l--a~  291 (402)
                      |+       |     +|++      ++|+|+    +|                                     ++  |+
T Consensus       162 ~v~f~dF~f~di~~SlTya~gdi~~FfCv~~~~~~~pLc~sshs~~~g~~~~lP~ivR~lQCLRry~ds~~~fphLlNAL  241 (384)
T COG5409         162 LVRFTDFFFGDILISLTYALGDIYIFFCVYSLLFREPLCKSSHSDLSGLAALLPVIVRFLQCLRRYRDSLHEFPHLLNAL  241 (384)
T ss_pred             eeeeeeeehhhHHHHHHHhhhhhhhHHHHHhhhccccccccCCcchHhHHHhhHHHHHHHHHHHHHHhhhhhhHHHHHHH
Confidence            22       2     4543      788887    12                                     23  99


Q ss_pred             hhhhHHHHHHHHhhccCCCCCCchhHHHHHHHHHhhhhhhhhhhhccccCCCCCCCCCCCCCCccccccCCccchhhhhh
Q psy7158         292 KYDARFWLLRVTVKKDQSVMDNPFFYPWLAASVISSIYSYIWDIKMDWGLFDKNAPPDNPFLREETVYSSTEMKALLYQA  371 (402)
Q Consensus       292 KY~~~~~~l~~~~~~~~~~~~~~~~~~wi~~~~i~S~Ys~~WDv~mDWgL~~~~~~~~~~~LR~~l~y~~~~~~~~Yy~A  371 (402)
                      ||++++|+.+.....+..+.+...+++|+.++.+||+||+.|||.||||+..+.. +.++-.|++++++       |+.|
T Consensus       242 KYs~~i~v~~~~~~~r~~~~~~~l~~l~~~~a~lnS~yT~~WDV~mDW~l~~~~~-~~~~~kr~~~~l~-------y~~a  313 (384)
T COG5409         242 KYSLNIPVLFCLWLYRVYEGEERLFHLQIWFALLNSIYTSFWDVFMDWSLDSLTS-LRSWSKRAVTLLK-------YHIA  313 (384)
T ss_pred             HHHHhhHHHHHHHHHHhccCcceeehHHHHHHHHHHHHHHhHHhheeeeeccccc-chhccccchhHHH-------HHHH
Confidence            9999999944332211111135678999999999999999999999999976653 4556677776655       8999


Q ss_pred             hhhccccccchhhhh
Q psy7158         372 MEEAPSLETTEPEVI  386 (402)
Q Consensus       372 iv~n~ilR~~W~~~l  386 (402)
                      |+.|+++|++|++..
T Consensus       314 ~~inFllR~~Wi~~~  328 (384)
T COG5409         314 MIINFLLRFSWIVYY  328 (384)
T ss_pred             HHHHHHHHHHHHHhh
Confidence            999999999999984


No 4  
>PF03105 SPX:  SPX domain;  InterPro: IPR004331 The SPX domain is named after SYG1/Pho81/XPR1 proteins. This 180 residue length domain is found at the amino terminus of a variety of proteins. In the yeast protein SYG1, the N terminus directly binds to the G- protein beta subunit and inhibits transduction of the mating pheromone signal [] suggesting that all the members of this family are involved in G-protein associated signal transduction. The C-terminal of these proteins often have an EXS domain (IPR004342 from INTERPRO) []. The N-termini of several proteins involved in the regulation of phosphate transport, including the putative phosphate level sensors PHO81 from Saccharomyces cerevisiae and NUC-2 from Neurospora crassa, are also members of this family [, ]. NUC-2 contains several ankyrin repeats (IPR002110 from INTERPRO). Several members of this family are the XPR1 proteins: the xenotropic and polytropic retrovirus receptor confers susceptibility to infection with Murine leukemia virus (MLV) []. The similarity between SYG1, phosphate regulators and XPR1 sequences has been previously noted, as has the additional similarity to several predicted proteins, of unknown function, from Drosophila melanogaster, Arabidopsis thaliana, Caenorhabditis elegans, Schizosaccharomyces pombe, and Saccharomyces cerevisiae [, ]. In addition, given the similarities between XPR1 and SYG1 and phosphate regulatory proteins, it has been proposed that XPR1 might be involved in G-protein associated signal transduction [, , ] and may itself function as a phosphate sensor []. 
Probab=99.05  E-value=1.8e-10  Score=110.29  Aligned_cols=58  Identities=10%  Similarity=0.123  Sum_probs=49.7

Q ss_pred             HHhhhhhHHHHHHHHHHHHHHHhhhh-hhchHHHHHHHHHhhccccccccchHHHHhcc
Q psy7158          20 EKKNKKGEEEEEISMRLVHSLINVSN-AAISLTDLGFFASHVKTLLNVDVGAKWRAEHV   77 (402)
Q Consensus        20 ~~~~~~~~~~~~~~~e~~~~L~~L~~-~~LN~tgF~KILKKyDK~~~~~~~~~~~~~~v   77 (402)
                      ..+.....+..+++.|+|+.|.+|++ ++||+|||+||||||||+++++.++.||.+.|
T Consensus       217 ~~~~~~~~~l~~~~~e~y~~l~~Lk~f~~LN~taf~KIlKK~DK~~~~~~~~~y~~~v~  275 (275)
T PF03105_consen  217 KSLKKARKQLKKAFIELYRELELLKSFVELNRTAFRKILKKYDKVTGTSLSDDYMEEVV  275 (275)
T ss_pred             hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchHhhhhccC
Confidence            34444555667889999999999996 99999999999999999999999999887654


No 5  
>KOG1161|consensus
Probab=99.02  E-value=4.5e-10  Score=109.42  Aligned_cols=75  Identities=5%  Similarity=0.051  Sum_probs=61.9

Q ss_pred             HHHHHHHHHhhh-hhhchHHHHHHHHHhhccccccccchHHHHhccccCCcccchhHHHHHHHHHHHHHHhhccCcH
Q psy7158          33 SMRLVHSLINVS-NAAISLTDLGFFASHVKTLLNVDVGAKWRAEHVDVSHFYVNKDINHLISETEATVTSELEEGDR  108 (402)
Q Consensus        33 ~~e~~~~L~~L~-~~~LN~tgF~KILKKyDK~~~~~~~~~~~~~~v~~s~F~~~~~l~~li~~ve~lYa~~f~~gdr  108 (402)
                      +.+.-..+.+|. |++||+|||+||||||||.++-... .++...|..+||++.+.++.++.+++.+|....+.++-
T Consensus       102 l~~~~~em~~L~~fs~LN~tGf~KILKK~DKrtg~~l~-~~f~~~l~~~Pf~~~e~~~~Lv~e~~~l~~~l~~~~~~  177 (310)
T KOG1161|consen  102 LVDFHGEMVLLENFSRLNYTGFAKILKKHDKRTGYRLR-PYFQVRLLHQPFFTTEQLFRLVYEISILLDLLRPSNRN  177 (310)
T ss_pred             HHHHHHHHHHHHHHHHHhHHHHHHHHHHHhcccccccc-cHHHHHHHhCCCchhhhHHHHHHHHHHHHHHhcccccc
Confidence            444445555566 4999999999999999999996554 55788899999999999999999999999988776544


No 6  
>COG5408 SPX domain-containing protein [Signal transduction mechanisms]
Probab=98.12  E-value=3.3e-06  Score=82.39  Aligned_cols=51  Identities=10%  Similarity=0.134  Sum_probs=44.7

Q ss_pred             hhHHHHHHHHHHHHHHHhhhh-hhchHHHHHHHHHhhccccccccchHHHHh
Q psy7158          25 KGEEEEEISMRLVHSLINVSN-AAISLTDLGFFASHVKTLLNVDVGAKWRAE   75 (402)
Q Consensus        25 ~~~~~~~~~~e~~~~L~~L~~-~~LN~tgF~KILKKyDK~~~~~~~~~~~~~   75 (402)
                      +.....+.++++|+.|..++. ++||+|||+||+|||||+++++....||..
T Consensus       239 ~R~~lkk~ii~~y~~l~~lksf~eLN~tGf~Ki~KK~DK~l~~~~~~~~~s~  290 (296)
T COG5408         239 KRSLLKKRIIELYIQLHQLKSFIELNYTGFSKITKKYDKTLHQNLRHEYMSR  290 (296)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHH
Confidence            445567789999999999995 999999999999999999999988887644


No 7  
>COG5036 SPX domain-containing protein involved in vacuolar polyphosphate accumulation [Inorganic ion transport and metabolism]
Probab=98.08  E-value=4.3e-06  Score=84.05  Aligned_cols=75  Identities=12%  Similarity=0.092  Sum_probs=58.5

Q ss_pred             hhhHHHHHHHHHHHHHHHhhh-hhhchHHHHHHHHHhhccccccccchHHHHhccccCCcccchhHHHHHHHHHHHHHHh
Q psy7158          24 KKGEEEEEISMRLVHSLINVS-NAAISLTDLGFFASHVKTLLNVDVGAKWRAEHVDVSHFYVNKDINHLISETEATVTSE  102 (402)
Q Consensus        24 ~~~~~~~~~~~e~~~~L~~L~-~~~LN~tgF~KILKKyDK~~~~~~~~~~~~~~v~~s~F~~~~~l~~li~~ve~lYa~~  102 (402)
                      +.+|||+.    -+|.   |. +..||+|||.||+|||||-++....+- +.-.++..|| .+.+.+.|+-++-.+|.-.
T Consensus        93 ~~LeE~L~----~v~~---l~kF~RLN~tGF~KIvKKHDK~~~y~lkpv-fqvrLk~~p~-~se~yd~Ll~kis~LY~~l  163 (509)
T COG5036          93 KILEEELD----TVHD---LAKFSRLNFTGFKKIVKKHDKHTGYSLKPV-FQVRLKAKPF-FSEQYDPLLYKISSLYNIL  163 (509)
T ss_pred             hHHHHHHH----HHHH---HHhhhhhhhHHHHHHHHhhcCCCCceechh-HHHHhccCCc-chhhhcHHHHHHHHHHHHH
Confidence            35566644    3344   44 499999999999999999999877766 4667788888 5678999999999999776


Q ss_pred             hccCc
Q psy7158         103 LEEGD  107 (402)
Q Consensus       103 f~~gd  107 (402)
                      -++++
T Consensus       164 R~~~~  168 (509)
T COG5036         164 RSSLS  168 (509)
T ss_pred             HhcCC
Confidence            66665


No 8  
>PF06570 DUF1129:  Protein of unknown function (DUF1129);  InterPro: IPR009214 There are currently no experimental data for members of this group or their homologues. However, these proteins contain predicted integral membrane proteins (with several transmembrane segments).
Probab=86.16  E-value=29  Score=32.32  Aligned_cols=16  Identities=19%  Similarity=0.276  Sum_probs=7.9

Q ss_pred             CccchhHHHHHHHHHH
Q psy7158         237 PPYSQPVALLLIMLAF  252 (402)
Q Consensus       237 ~~~~~Pl~~~~~~~~~  252 (402)
                      .|.+-|++.+++.++.
T Consensus       175 np~l~~~~~iiig~i~  190 (206)
T PF06570_consen  175 NPVLPPWVYIIIGVIA  190 (206)
T ss_pred             CcCCCHHHHHHHHHHH
Confidence            3445455555444444


No 9  
>KOG4031|consensus
Probab=79.87  E-value=13  Score=34.83  Aligned_cols=66  Identities=18%  Similarity=0.091  Sum_probs=38.0

Q ss_pred             ccccccHHHHhhhhHHHHHHhhhhhHHH----HHHHHHHHHHHHhhhh-hhchHHHHHHHHHhhccccccc
Q psy7158           2 EPLLKWKEEKKKMEEEEEEKKNKKGEEE----EEISMRLVHSLINVSN-AAISLTDLGFFASHVKTLLNVD   67 (402)
Q Consensus         2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~e~~~~L~~L~~-~~LN~tgF~KILKKyDK~~~~~   67 (402)
                      |++-|||||.++.-||+-++--++++|.    ...|-++|+..+.-.. ...|-.+=.--++|.++..+..
T Consensus       106 E~IRkWkeeQ~~rl~ekD~~sek~k~ElrekAkKelddwy~~~~ek~~k~~~~nk~eee~~~~e~~~~~~g  176 (216)
T KOG4031|consen  106 EKIRKWKEEQMKRLQEKDEASEKLKEELREKAKKELDDWYDQQNEKLEKTKANNKAEEEALVKENEEFSPG  176 (216)
T ss_pred             HHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHhhcccCCC
Confidence            6788999999887666544333222222    3456677776554322 3344334444567777766554


No 10 
>COG0598 CorA Mg2+ and Co2+ transporters [Inorganic ion transport and metabolism]
Probab=66.91  E-value=78  Score=31.54  Aligned_cols=34  Identities=26%  Similarity=0.017  Sum_probs=26.6

Q ss_pred             HHHHHHHHHHHHHHhhhh-hhchHHHHHHHHHhhc
Q psy7158          28 EEEEISMRLVHSLINVSN-AAISLTDLGFFASHVK   61 (402)
Q Consensus        28 ~~~~~~~e~~~~L~~L~~-~~LN~tgF~KILKKyD   61 (402)
                      ++++.+.++-+.+.-++. ..-+..++.+..+...
T Consensus       177 ~~l~~l~~l~~~l~~lr~~l~~~~~~l~~l~~~~~  211 (322)
T COG0598         177 EELERLGELRRSLVYLRRALAPLRDVLLRLARRPL  211 (322)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHhcCc
Confidence            777888888888888886 6666778888777764


No 11 
>PF10669 Phage_Gp23:  Protein gp23 (Bacteriophage A118);  InterPro: IPR018926  This entry is represented by the major tail subunit protein, Gp23 of Listeria phage A118 and prophage found in Bacilli. The function is currently unknown. 
Probab=50.57  E-value=37  Score=28.18  Aligned_cols=22  Identities=41%  Similarity=0.684  Sum_probs=10.8

Q ss_pred             HHHHhhhhHHHHHHhhhhhHHHH
Q psy7158           8 KEEKKKMEEEEEEKKNKKGEEEE   30 (402)
Q Consensus         8 ~~~~~~~~~~~~~~~~~~~~~~~   30 (402)
                      |||..|.|+|| +||+++-++|.
T Consensus        49 KEER~K~E~~~-q~r~rES~~Er   70 (121)
T PF10669_consen   49 KEERSKKEEKR-QKRNRESKRER   70 (121)
T ss_pred             HHHHHHHHHHH-HHHhhhhHHHH
Confidence            55555554443 34455555553


No 12 
>COG5522 Predicted integral membrane protein [Function unknown]
Probab=38.66  E-value=3.5e+02  Score=25.71  Aligned_cols=75  Identities=21%  Similarity=0.288  Sum_probs=49.1

Q ss_pred             hcCCCCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHhhccCCchhhhHHHhhhhhHHHHHHHHHHHHHHHHHHhh-cCCC
Q psy7158         114 RLRVPPLGEQQSPWTTFKVGLFLGCFAILFAIILLRVFFDTSQEDFKMAMRLYRGPLLIIQFIFLMGVNVYGWRS-SGVN  192 (402)
Q Consensus       114 ~LR~~~~~~~~~~~~~F~~GlflG~s~~lli~~li~~~~~~~~~~~~~~~~i~r~~~l~~l~~~l~g~n~~iw~~-~~IN  192 (402)
                      .+-.|..+..+.++..+..=+++-+++.+.+...+. ..+..+.        =||.-.-++....+|+|++.|++ .|-|
T Consensus       110 AlltPDl~~~~~p~l~~~lffitH~svfls~v~~~v-hfreRpg--------ksgl~~svl~~~~lg~~~lfinrrLGtN  180 (236)
T COG5522         110 ALLTPDLQYLQVPWLEFLLFFITHISVFLSAVILIV-HFRERPG--------KSGLVMSVLVAISLGIMCLFINRRLGTN  180 (236)
T ss_pred             HHHcCccccccchHHHHHHHHHHHHHHHHHHHHHHH-HhccCCC--------ccchhHHHHHHHHHHHHHHHHHHHhcCc
Confidence            344555556666777777777777777766544433 3332221        24455677788889999999987 8999


Q ss_pred             ceEEE
Q psy7158         193 HVLIF  197 (402)
Q Consensus       193 y~~If  197 (402)
                      |-++=
T Consensus       181 Ylyls  185 (236)
T COG5522         181 YLYLS  185 (236)
T ss_pred             eeEee
Confidence            99864


No 13 
>PF11143 DUF2919:  Protein of unknown function (DUF2919);  InterPro: IPR021318  This bacterial family of proteins has no known function. Some members are annotated as YfeZ however this cannot be confirmed. 
Probab=34.91  E-value=1.9e+02  Score=25.74  Aligned_cols=59  Identities=25%  Similarity=0.227  Sum_probs=30.4

Q ss_pred             chhhHHHHHHHHHHHHHHHHHHHhhccCCchhhhHHHhhhhhHHHHHHHHHHHHHHHH-HHhh
Q psy7158         127 WTTFKVGLFLGCFAILFAIILLRVFFDTSQEDFKMAMRLYRGPLLIIQFIFLMGVNVY-GWRS  188 (402)
Q Consensus       127 ~~~F~~GlflG~s~~lli~~li~~~~~~~~~~~~~~~~i~r~~~l~~l~~~l~g~n~~-iw~~  188 (402)
                      ...|..|+.+|+-.++..  .+....+..+.-...+.+--|.+.+..+ +.-.+..++ .|+.
T Consensus        53 ~~~f~lgL~~g~Pall~~--~l~~~R~~~~~~~~~lwr~~r~lLl~~~-~~~l~~~~~~l~~~  112 (149)
T PF11143_consen   53 HSDFYLGLAAGLPALLLM--LLSGRRHRSPRWLNRLWRAGRWLLLLSL-LVDLVWQLYHLWLQ  112 (149)
T ss_pred             chHHHHHHHHhHHHHHHH--HHHccCCCCchHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHc
Confidence            688999999998887765  2222222222223335555554433333 333344444 5443


No 14 
>PF15038 Jiraiya:  Jiraiya
Probab=33.30  E-value=1e+02  Score=28.33  Aligned_cols=58  Identities=19%  Similarity=0.173  Sum_probs=35.9

Q ss_pred             HHHHHHHHHHHHHHHHHHHhhccCCchhhhHHHhhhhhHHHHHHHHHHHHHHHHHHhhcCC
Q psy7158         131 KVGLFLGCFAILFAIILLRVFFDTSQEDFKMAMRLYRGPLLIIQFIFLMGVNVYGWRSSGV  191 (402)
Q Consensus       131 ~~GlflG~s~~lli~~li~~~~~~~~~~~~~~~~i~r~~~l~~l~~~l~g~n~~iw~~~~I  191 (402)
                      .+++++|+.+.+....++..+.-...+  ...--+..|.+.+++..++ ..|++.|+|..=
T Consensus       110 ~~~ff~sI~vfl~~l~ly~~l~f~~~~--~~~~s~ilG~g~vfl~~~~-vh~l~~w~r~~~  167 (175)
T PF15038_consen  110 VGLFFCSISVFLAALILYMLLQFHSEP--GIATSIILGSGAVFLGAAM-VHNLYRWQRETR  167 (175)
T ss_pred             hhhHHhhhhHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHH-HHHHHHHHHhcc
Confidence            467788888888777766543221111  1123345566666665555 899999998653


No 15 
>PF03904 DUF334:  Domain of unknown function (DUF334);  InterPro: IPR005602 This is a family of proteins found in Staphylococcus aureus plasmid with no characterised function.
Probab=31.11  E-value=4.3e+02  Score=25.35  Aligned_cols=18  Identities=11%  Similarity=0.021  Sum_probs=10.3

Q ss_pred             HHHHHHHHHhhccccccc
Q psy7158          50 LTDLGFFASHVKTLLNVD   67 (402)
Q Consensus        50 ~tgF~KILKKyDK~~~~~   67 (402)
                      ..+|.++||+.=|.++++
T Consensus        95 ~~dF~~~Lq~~Lk~V~td  112 (230)
T PF03904_consen   95 HNDFQDILQDELKDVDTD  112 (230)
T ss_pred             HHHHHHHHHHHHHhhchH
Confidence            456666666665555443


No 16 
>TIGR00383 corA magnesium Mg(2+) and cobalt Co(2+) transport protein (corA). The article in Microb Comp Genomics 1998;3(3):151-69 (Medline:98448512) discusses this family and suggests that some members may have functions other than Mg2+ transport.
Probab=29.65  E-value=5.4e+02  Score=25.09  Aligned_cols=32  Identities=13%  Similarity=0.046  Sum_probs=20.8

Q ss_pred             HHHHHHHHHHHHHHhhhh-hhchHHHHHHHHHh
Q psy7158          28 EEEEISMRLVHSLINVSN-AAISLTDLGFFASH   59 (402)
Q Consensus        28 ~~~~~~~e~~~~L~~L~~-~~LN~tgF~KILKK   59 (402)
                      +..+.+.+.-+.+..++. ..-|...+.++.+.
T Consensus       172 ~~l~~l~~l~~~l~~l~~~l~~~~~vl~~l~~~  204 (318)
T TIGR00383       172 TLMDEILSLRTELLALRRSLWPLRDVLNFLLRK  204 (318)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcC
Confidence            344556666677777775 67777777776543


No 17 
>PF08367 M16C_assoc:  Peptidase M16C associated;  InterPro: IPR013578 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This domain appears in eukaryotes as well as bacteria and tends to be found near the C terminus of metalloproteases and related sequences belonging to MEROPS peptidase family M16 (subfamily M16C, clan ME). These include: eupitrilysin, falcilysin, PreP peptidase, CYM1 peptidase and subfamily M16C non-peptidase homologues.; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis; PDB: 2FGE_B 3S5I_A 3S5H_A 3S5M_A 3S5K_A.
Probab=27.77  E-value=86  Score=29.90  Aligned_cols=45  Identities=22%  Similarity=0.209  Sum_probs=33.9

Q ss_pred             CccccccHHHHhhhhHHHHHHhhhhh-HHHHHHHHHHHHHHHhhhh
Q psy7158           1 MEPLLKWKEEKKKMEEEEEEKKNKKG-EEEEEISMRLVHSLINVSN   45 (402)
Q Consensus         1 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~e~~~~L~~L~~   45 (402)
                      |.|--.+.+++++.|+++.+++.+++ ++|.+.+.+....|...|.
T Consensus         1 m~P~~~~~~~~~~~e~~~L~~~k~~Ls~~e~~~i~~~~~~L~~~Q~   46 (248)
T PF08367_consen    1 MKPDPGLSEKQEEEEKEKLAAYKASLSEEEKEKIIEQTKELKERQE   46 (248)
T ss_dssp             EEEETTHHHHHHHHHHHHHHHHHHCS-HHHHHHHHHHHHHHHHHHC
T ss_pred             CCCCchHHHHHHHHHHHHHHHHHhhCCHHHHHHHHHHHHHHHHHhC
Confidence            56777788888888877777766665 7788888888888777665


No 18 
>KOG4326|consensus
Probab=26.62  E-value=74  Score=24.93  Aligned_cols=31  Identities=19%  Similarity=0.278  Sum_probs=17.3

Q ss_pred             ccH-HHHhhhhHHHHHHhhhhhHHHHHHHHHH
Q psy7158           6 KWK-EEKKKMEEEEEEKKNKKGEEEEEISMRL   36 (402)
Q Consensus         6 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~e~   36 (402)
                      .|. +||++...|++++|.+-++.|.+.+.|.
T Consensus        43 ei~a~eKav~da~~a~ekKr~a~~eaR~Lae~   74 (81)
T KOG4326|consen   43 EIDAHEKAVADAEEAAEKKRWAKDEARYLAEV   74 (81)
T ss_pred             HHHHHHHHHHhHHHHHHHHhhHHHHHHHHHHh
Confidence            354 5566655555555544466666666554


No 19 
>PF04644 Motilin_ghrelin:  Motilin/ghrelin;  InterPro: IPR006738 Motilin is a gastrointestinal regulatory polypeptide produced by motilin cells in the duodenal epithelium. It is released into the general circulation at about 100-min intervals during the inter-digestive state and is the most important factor in controlling the inter-digestive migrating contractions. Motilin also stimulates endogenous release of the endocrine pancreas [].  This domain is also found in ghrelin, a growth hormone secretagogue synthesised by endocrine cells in the stomach. Ghrelin stimulates growth hormone secretagogue receptors in the pituitary. These receptors are distinct from the growth hormone-releasing hormone receptors, and thus provide a means of controlling pituitary growth hormone release by the gastrointestinal system [].; GO: 0005179 hormone activity, 0005576 extracellular region; PDB: 1LBJ_A.
Probab=25.92  E-value=56  Score=20.85  Aligned_cols=16  Identities=38%  Similarity=0.650  Sum_probs=9.2

Q ss_pred             HHhhhhHHHHHHhhhhh
Q psy7158          10 EKKKMEEEEEEKKNKKG   26 (402)
Q Consensus        10 ~~~~~~~~~~~~~~~~~   26 (402)
                      |-.|| ||+|++|-.++
T Consensus         8 e~qr~-QekE~nk~~kK   23 (28)
T PF04644_consen    8 EHQRM-QEKERNKGQKK   23 (28)
T ss_dssp             HHHHH-HHHHHHHH---
T ss_pred             HHHHH-HHHHhccCccc
Confidence            55677 77777775543


No 20 
>PF09925 DUF2157:  Predicted membrane protein (DUF2157);  InterPro: IPR018677 This family of various hypothetical prokaryotic proteins has no known function.
Probab=25.76  E-value=4.4e+02  Score=22.82  Aligned_cols=7  Identities=14%  Similarity=0.449  Sum_probs=2.8

Q ss_pred             HHHHHHh
Q psy7158         224 SLLCFFF  230 (402)
Q Consensus       224 sl~~~~~  230 (402)
                      .+++-.+
T Consensus       107 aligQ~y  113 (145)
T PF09925_consen  107 ALIGQIY  113 (145)
T ss_pred             HHHHhHh
Confidence            3344433


No 21 
>COG4062 MtrB Tetrahydromethanopterin S-methyltransferase, subunit B [Coenzyme metabolism]
Probab=24.27  E-value=1.7e+02  Score=24.49  Aligned_cols=70  Identities=19%  Similarity=0.121  Sum_probs=37.6

Q ss_pred             cccCCcccchhHHHHHHHHHHHHHHhhccCcHHHHHhhcCCCCCCCCCCC----chhhHHHHHHHHHHHHHHHHHH
Q psy7158          77 VDVSHFYVNKDINHLISETEATVTSELEEGDRQKAMKRLRVPPLGEQQSP----WTTFKVGLFLGCFAILFAIILL  148 (402)
Q Consensus        77 v~~s~F~~~~~l~~li~~ve~lYa~~f~~gdrk~A~~~LR~~~~~~~~~~----~~~F~~GlflG~s~~lli~~li  148 (402)
                      -+...|.+-+.+++-++++|.+-.+....=|...  --|.+-|.++..+.    ...+.-|+.+|..++.++++++
T Consensus        25 ~edvi~ldv~pi~Eqi~kLe~~vddl~~sldPst--p~lnS~PgRegv~~~aG~~tna~yGfviGl~i~aLlAlil   98 (108)
T COG4062          25 FEDVIFLDVDPIEEQIKKLETLVDDLENSLDPST--PPLNSFPGREGVYATAGYLTNAFYGFVIGLGIMALLALIL   98 (108)
T ss_pred             cCceEEEeccHHHHHHHHHHHHHHHHHhccCCCC--CCcccCCCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3455677778888888888876554332222211  11111122233221    4556678888877766665554


No 22 
>PF02268 TFIIA_gamma_N:  Transcription initiation factor IIA, gamma subunit, helical domain;  InterPro: IPR015872 Transcription factor IIA (TFIIA) is one of several factors that form part of a transcription pre-initiation complex along with RNA polymerase II, the TATA-box-binding protein (TBP) and TBP-associated factors, on the TATA-box sequence upstream of the initiation start site. After initiation, some components of the pre-initiation complex (including TFIIA) remain attached and re-initiate a subsequent round of transcription. TFIIA binds to TBP to stabilise TBP binding to the TATA element. TFIIA also inhibits the cytokine HMGB1 (high mobility group 1 protein) binding to TBP [], and can dissociate HMGB1 already bound to TBP/TATA-box. Human and Drosophila TFIIA have three subunits: two large subunits, LN/alpha and LC/beta, derived from the same gene, and a small subunit, S/gamma. Yeast TFIIA has two subunits: a large TOA1 subunit that shows sequence similarity to the N-terminal of LN/alpha and the C-terminal of LC/beta, and a small subunit, TOA2 that is highly homologous with S/gamma. The conserved regions of the large and small subunits of TFIIA combine to form two domains: a four-helix bundle (helical domain) composed of two helices from each of the N-terminal regions of TOA1 and TOA2 in yeast; and a beta-barrel (beta-barrel domain) composed of beta-sheets from the C-terminal regions of TOA1 and TOA2 []. This entry represents the alpha-helical domain found at the N-terminal of the gamma subunit of transcription factor TFIIA.; GO: 0006367 transcription initiation from RNA polymerase II promoter, 0005672 transcription factor TFIIA complex; PDB: 1NVP_D 1RM1_B 1YTF_D 1NH2_D.
Probab=24.18  E-value=1e+02  Score=22.34  Aligned_cols=31  Identities=10%  Similarity=0.036  Sum_probs=21.3

Q ss_pred             HHHHHHHHHhhhhhhchHHHHHHHHHhhccccc
Q psy7158          33 SMRLVHSLINVSNAAISLTDLGFFASHVKTLLN   65 (402)
Q Consensus        33 ~~e~~~~L~~L~~~~LN~tgF~KILKKyDK~~~   65 (402)
                      +.+-+..+  ++...++-.=-.|||.+|||...
T Consensus        15 L~dtLDel--i~~~~I~p~La~kVL~~FDksi~   45 (49)
T PF02268_consen   15 LTDTLDEL--IQEGKITPQLAMKVLEQFDKSIN   45 (49)
T ss_dssp             HHHHHHHH--HHTTSS-HHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHH--HHcCCCCHHHHHHHHHHHHHHHH
Confidence            44444443  34478888888999999999864


No 23 
>COG5110 RPN1 26S proteasome regulatory complex component [Posttranslational modification, protein turnover, chaperones]
Probab=23.22  E-value=50  Score=35.73  Aligned_cols=61  Identities=20%  Similarity=0.194  Sum_probs=37.8

Q ss_pred             HHHhhhhHHHHHHhhhhhHHHHHHHHHHH---------HHHHhhhh----hhchHHHHHH-----------HHHhhcccc
Q psy7158           9 EEKKKMEEEEEEKKNKKGEEEEEISMRLV---------HSLINVSN----AAISLTDLGF-----------FASHVKTLL   64 (402)
Q Consensus         9 ~~~~~~~~~~~~~~~~~~~~~~~~~~e~~---------~~L~~L~~----~~LN~tgF~K-----------ILKKyDK~~   64 (402)
                      ..|||.|+|..-..-++++-|+|.+-|-.         .+|++|+-    +.-..|+.-|           .++-|||.+
T Consensus        28 ~~k~kee~e~lseed~~lk~dLellVeriqd~d~~l~~~sLn~LkeviksStSsmtavpkplkfLrp~y~dl~~iydkw~  107 (881)
T COG5110          28 KDKKKEEEEQLSEEDAMLKGDLELLVERIQDPDIDLQNNSLNMLKEVIKSSTSSMTAVPKPLKFLRPNYLDLLEIYDKWL  107 (881)
T ss_pred             cchhhhhHhhhchhhhhhcccHHHHHHHhhCCChHHHHHHHHHHHHHHhccccccccCCchhhhcCCCcchHHHHHhhcc
Confidence            34566666665556666666777666544         36666652    3444566655           566789988


Q ss_pred             ccccc
Q psy7158          65 NVDVG   69 (402)
Q Consensus        65 ~~~~~   69 (402)
                      ..+..
T Consensus       108 ~~n~K  112 (881)
T COG5110         108 EGNKK  112 (881)
T ss_pred             Ccchh
Confidence            77644


No 24 
>PF01086 Clathrin_lg_ch:  Clathrin light chain;  InterPro: IPR000996 Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. These vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transport []. Clathrin coats contain both clathrin (acts as a scaffold) and adaptor complexes that link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. The two major types of clathrin adaptor complexes are the heterotetrameric adaptor protein (AP) complexes, and the monomeric GGA (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) adaptors [, ].  Clathrin is a trimer composed of three heavy chains and three light chains, each monomer projecting outwards like a leg; this three-legged structure is known as a triskelion [, ]. The heavy chains form the legs, their N-terminal beta-propeller regions extending outwards, while their C-terminal alpha-alpha-superhelical regions form the central hub of the triskelion. Peptide motifs can bind between the beta-propeller blades. The light chains appear to have a regulatory role, and may help orient the assembly and disassembly of clathrin coats as they interact with hsc70 uncoating ATPase []. Clathrin triskelia self-polymerise into a curved lattice by twisting individual legs together. The clathrin lattice forms around a vesicle as it buds from the TGN, plasma membrane or endosomes, acting to stabilise the vesicle and facilitate the budding process []. The multiple blades created when the triskelia polymerise are involved in multiple protein interactions, enabling the recruitment of different cargo adaptors and membrane attachment proteins [].  This entry represents clathrin light chains, which are more divergent in sequence than the heavy chains []. In higher eukaryotes, two genes encode distinct but related light chains, each of which can yield two separate forms via alternative splicing. In yeast there is a single light chain whose sequence is only distantly related to that of higher eukaryotes. Clathrin light chains have a conserved acidic N-terminal domain, a central coiled-coil domain and a conserved C-terminal domain.  More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030130 clathrin coat of trans-Golgi network vesicle, 0030132 clathrin coat of coated pit; PDB: 3LVG_E 3LVH_D.
Probab=23.00  E-value=1.4e+02  Score=28.43  Aligned_cols=62  Identities=23%  Similarity=0.303  Sum_probs=26.1

Q ss_pred             ccccccHHHHhhh----hHHHHHHhhhhhHHHHHHHHHHHHHHHhhh--hhhchHHHHHHHHHhhccc
Q psy7158           2 EPLLKWKEEKKKM----EEEEEEKKNKKGEEEEEISMRLVHSLINVS--NAAISLTDLGFFASHVKTL   63 (402)
Q Consensus         2 ~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~e~~~~L~~L~--~~~LN~tgF~KILKKyDK~   63 (402)
                      |++-+|+|+.++.    ++++++||-.-.++=.+.+.++|.......  +...|+..=-..+++-|+.
T Consensus       112 e~ireWre~~~~~i~ekD~~e~~kk~e~~~~A~k~lddfY~~~~~k~e~~k~~nr~~ee~fl~~~~~~  179 (225)
T PF01086_consen  112 EAIREWREERDKRIEEKDAEEEEKKEEIKEKAKKELDDFYENRNEKKEKNKKQNREEEEEFLAKREEF  179 (225)
T ss_dssp             THHHHHHHHHTTTSTHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHTT--------------
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccc
Confidence            4677899986543    333444444444444445666665544333  3666665555555555554


No 25 
>PF07716 bZIP_2:  Basic region leucine zipper;  InterPro: IPR011700 The basic-leucine zipper (bZIP) transcription factors [, ] of eukaryotes are proteins that contain a basic region mediating sequence-specific DNA-binding, followed by a leucine zipper region (see IPR002158 from INTERPRO), which is required for dimerization.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0043565 sequence-specific DNA binding, 0046983 protein dimerization activity, 0006355 regulation of transcription, DNA-dependent; PDB: 1NWQ_A 1H89_B 1H88_A 1GTW_B 2E43_A 1IO4_A 1GU4_B 2E42_A 1H8A_B 1GU5_B ....
Probab=22.19  E-value=2e+02  Score=20.70  Aligned_cols=31  Identities=35%  Similarity=0.382  Sum_probs=21.5

Q ss_pred             HHhhhhHHHHHHhhhhhHHHHHHHHHHHHHH
Q psy7158          10 EKKKMEEEEEEKKNKKGEEEEEISMRLVHSL   40 (402)
Q Consensus        10 ~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~L   40 (402)
                      ++||+..+..+.+...+++++..|..-...|
T Consensus        21 ~rkk~~~~~le~~~~~L~~en~~L~~~i~~L   51 (54)
T PF07716_consen   21 QRKKQREEELEQEVQELEEENEQLRQEIAQL   51 (54)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            5667777777777777777777776555543


No 26 
>TIGR02894 DNA_bind_RsfA transcription factor, RsfA family. In a subset of endospore-forming members of the Firmcutes, members of this protein family are found, several to a genome. Two very strongly conserved sequences regions are separated by a highly variable linker region. Much of the linker region was excised from the seed alignment for this model. A characterized member is the prespore-specific transcription RsfA from Bacillus subtilis, previously called YwfN, which is controlled by sigma factor F and seems to fine-tune expression of some genes in the sigma-F regulon. A paralog in Bacillus subtilis is designated YlbO.
Probab=21.79  E-value=1.7e+02  Score=26.58  Aligned_cols=27  Identities=26%  Similarity=0.202  Sum_probs=20.5

Q ss_pred             HHHHHhhhhhHHHHHHHHHHHHHHHhh
Q psy7158          17 EEEEKKNKKGEEEEEISMRLVHSLINV   43 (402)
Q Consensus        17 ~~~~~~~~~~~~~~~~~~e~~~~L~~L   43 (402)
                      |+.++++.+++++.+++.|.|++|...
T Consensus       121 e~Le~e~~~L~~~~~~~~eDY~~L~~I  147 (161)
T TIGR02894       121 EELEKELEKLRQRLSTIEEDYQTLIDI  147 (161)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            466777777888888888888887553


No 27 
>PF06160 EzrA:  Septation ring formation regulator, EzrA ;  InterPro: IPR010379 During the bacterial cell cycle, the tubulin-like cell-division protein FtsZ polymerises into a ring structure that establishes the location of the nascent division site. EzrA modulates the frequency and position of FtsZ ring formation [].; GO: 0000921 septin ring assembly, 0005940 septin ring, 0016021 integral to membrane
Probab=21.07  E-value=5.5e+02  Score=27.80  Aligned_cols=91  Identities=13%  Similarity=0.118  Sum_probs=57.7

Q ss_pred             HHHHHHhhhhhHHHHHHHHHHHHHHHhhhhhhchHHHHHHHHHhhccccccccchHHHHhcccc-CCccc-chhHHHHHH
Q psy7158          16 EEEEEKKNKKGEEEEEISMRLVHSLINVSNAAISLTDLGFFASHVKTLLNVDVGAKWRAEHVDV-SHFYV-NKDINHLIS   93 (402)
Q Consensus        16 ~~~~~~~~~~~~~~~~~~~e~~~~L~~L~~~~LN~tgF~KILKKyDK~~~~~~~~~~~~~~v~~-s~F~~-~~~l~~li~   93 (402)
                      ..+.++.....+++.+.+.+.+..|  ++.-+-|......+.++|...-..         ...+ ..|.. .+.+++.++
T Consensus       103 i~~~~~~l~~~e~~i~~i~~~l~~L--~~~e~~nr~~i~~l~~~y~~lrk~---------ll~~~~~~G~a~~~Le~~L~  171 (560)
T PF06160_consen  103 IKEIEEQLDEIEEDIKEILDELDEL--LESEEKNREEIEELKEKYRELRKE---------LLAHSFSYGPAIEELEKQLE  171 (560)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHH---------HHHhhhhhchhHHHHHHHHH
Confidence            3444555555666644444444443  234788899999999999654211         1112 22443 477888899


Q ss_pred             HHHHHHHHh---hccCcHHHHHhhcCC
Q psy7158          94 ETEATVTSE---LEEGDRQKAMKRLRV  117 (402)
Q Consensus        94 ~ve~lYa~~---f~~gdrk~A~~~LR~  117 (402)
                      ++|..|..+   -++||.-+|.+.|..
T Consensus       172 ~ie~~F~~f~~lt~~GD~~~A~eil~~  198 (560)
T PF06160_consen  172 NIEEEFSEFEELTENGDYLEAREILEK  198 (560)
T ss_pred             HHHHHHHHHHHHHHCCCHHHHHHHHHH
Confidence            999988864   357999888877764


No 28 
>PF10669 Phage_Gp23:  Protein gp23 (Bacteriophage A118);  InterPro: IPR018926  This entry is represented by the major tail subunit protein, Gp23 of Listeria phage A118 and prophage found in Bacilli. The function is currently unknown. 
Probab=20.26  E-value=4e+02  Score=22.22  Aligned_cols=10  Identities=30%  Similarity=0.594  Sum_probs=5.1

Q ss_pred             HhhhhHHHHH
Q psy7158          11 KKKMEEEEEE   20 (402)
Q Consensus        11 ~~~~~~~~~~   20 (402)
                      +-.|.|||+.
T Consensus        45 r~~MKEER~K   54 (121)
T PF10669_consen   45 RIRMKEERSK   54 (121)
T ss_pred             HHHHHHHHHH
Confidence            4456665443


No 29 
>PF04930 FUN14:  FUN14 family;  InterPro: IPR007014 This is a family of short proteins found in eukaryotes and some archaea. Although the function of these proteins is not known they may contain transmembrane helices.
Probab=20.13  E-value=79  Score=25.96  Aligned_cols=34  Identities=21%  Similarity=0.220  Sum_probs=17.8

Q ss_pred             ccCcHHHHHhhcCCCCCCCCCCCchhhHHHHHHHH
Q psy7158         104 EEGDRQKAMKRLRVPPLGEQQSPWTTFKVGLFLGC  138 (402)
Q Consensus       104 ~~gdrk~A~~~LR~~~~~~~~~~~~~F~~GlflG~  138 (402)
                      +..|.+...++++.- ..+..+....|.+|+++|+
T Consensus        66 ~~~~~~~~~~~~~~~-l~~~~~~~~gF~~Gf~lG~   99 (100)
T PF04930_consen   66 DAPDIKSLWNKLMDF-LKSNLPFSAGFLAGFLLGF   99 (100)
T ss_pred             cchhHHHHHHHHHHH-HHcCccHhHHHHHHHHHHc
Confidence            344555445454432 1122334677778877774


Done!