Query         psy17488
Match_columns 1446
No_of_seqs    1 out of 3
Neff          1.0 
Searched_HMMs 46136
Date          Fri Aug 16 23:46:30 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy17488.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/17488hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF14228 MOR2-PAG1_mid:  Cell m  99.4 6.8E-13 1.5E-17  155.6   8.1  151   89-291   114-267 (1120)
  2 KOG1825|consensus               97.9 1.1E-05 2.5E-10   99.4   6.3  217   68-294   792-1024(2206)
  3 PF13513 HEAT_EZ:  HEAT-like re  92.6   0.051 1.1E-06   42.4   0.9   28  223-250    28-55  (55)
  4 PF14098 SSPI:  Small, acid-sol  81.4     1.5 3.3E-05   39.2   3.2   33  238-277     2-34  (65)
  5 TIGR03092 SASP_sspI small, aci  80.0     1.8   4E-05   38.9   3.2   33  238-277     1-33  (65)
  6 PF13646 HEAT_2:  HEAT repeats;  79.7     1.4   3E-05   35.8   2.2   36  224-259    32-67  (88)
  7 PRK02955 small acid-soluble sp  77.4     2.4 5.3E-05   38.4   3.2   34  237-277     3-36  (68)
  8 smart00185 ARM Armadillo/beta-  76.1     1.2 2.6E-05   31.9   0.8   29  223-251    12-40  (41)
  9 KOG4676|consensus               67.4     4.9 0.00011   45.6   3.4   24   77-102    11-34  (479)
 10 PF00514 Arm:  Armadillo/beta-c  66.4     2.4 5.3E-05   32.0   0.6   29  223-251    12-40  (41)
 11 PF02985 HEAT:  HEAT repeat;  I  63.0     2.8 6.1E-05   30.7   0.4   25  226-250     3-27  (31)
 12 PF03130 HEAT_PBS:  PBS lyase H  53.6      16 0.00034   26.7   2.9   24  239-262     1-25  (27)
 13 COG5064 SRP1 Karyopherin (impo  49.6      10 0.00023   42.9   2.1   37  223-259   157-195 (526)
 14 PF11236 DUF3037:  Protein of u  45.3     5.3 0.00011   37.2  -0.8   71  104-174    20-93  (118)
 15 PF02969 TAF:  TATA box binding  45.0      34 0.00074   30.3   4.1   42  242-283    11-54  (66)
 16 smart00567 EZ_HEAT E-Z type HE  42.0      29 0.00063   24.8   2.7   23  238-260     2-25  (30)
 17 TIGR01635 tail_comp_S phage vi  41.9      34 0.00073   32.5   3.9   40  234-273     1-40  (144)
 18 PF09415 CENP-X:  CENP-S associ  40.9      17 0.00038   32.3   1.7   30  249-278    19-48  (72)
 19 cd00020 ARM Armadillo/beta-cat  40.2      32 0.00069   28.1   3.1   29  223-251    49-77  (120)
 20 PF01934 DUF86:  Protein of unk  34.4      46 0.00099   29.4   3.4   36  235-270    81-116 (119)
 21 PF13646 HEAT_2:  HEAT repeats;  33.8      73  0.0016   25.7   4.2   51  228-278     4-59  (88)
 22 PF07112 DUF1368:  Protein of u  33.6      17 0.00038   40.9   0.8   21   82-110    28-48  (404)
 23 PTZ00033 60S ribosomal protein  33.3      21 0.00044   35.4   1.1   35   93-127    21-66  (125)
 24 PF12363 DUF3647:  Phage protei  33.0      69  0.0015   29.9   4.4   40  222-261    36-79  (113)
 25 PF13513 HEAT_EZ:  HEAT-like re  32.6 1.1E+02  0.0025   23.7   5.0   38  238-275     2-42  (55)
 26 PRK09687 putative lyase; Provi  32.3      42 0.00092   34.9   3.2   26  225-250   161-186 (280)
 27 KOG4843|consensus               31.6      30 0.00065   38.5   2.1  121   91-222   191-333 (355)
 28 cd01899 Ygr210 Ygr210 subfamil  29.3      57  0.0012   34.8   3.7   56  228-283   103-159 (318)
 29 PF15209 IL31:  Interleukin 31   27.9      59  0.0013   32.8   3.2   69  230-303    46-116 (137)
 30 PRK09687 putative lyase; Provi  26.2      58  0.0013   33.9   3.0   39  224-262   129-168 (280)
 31 PF12972 NAGLU_C:  Alpha-N-acet  26.1      74  0.0016   33.3   3.7   88  157-259    16-114 (267)
 32 PF13976 gag_pre-integrs:  GAG-  24.8      50  0.0011   26.7   1.8   20  241-260    24-44  (67)
 33 KOG0414|consensus               24.7      55  0.0012   41.2   2.9   31  226-256   315-345 (1251)
 34 KOG2549|consensus               24.5 1.1E+02  0.0024   36.2   5.1   48  246-293    25-80  (576)
 35 cd00020 ARM Armadillo/beta-cat  24.4      78  0.0017   25.8   2.9   31  222-252     6-36  (120)
 36 PF14123 DUF4290:  Domain of un  23.0 1.9E+02  0.0042   30.0   5.9   71  157-249    58-137 (176)
 37 PF04484 DUF566:  Family of unk  21.4      91   0.002   33.7   3.4   21  228-248   250-273 (311)
 38 PF08757 CotH:  CotH protein;    20.9 2.4E+02  0.0053   27.8   6.1   31  243-273   261-292 (323)
 39 PF12755 Vac14_Fab1_bd:  Vacuol  20.8      93   0.002   28.4   2.9   28  224-251    25-55  (97)
 40 PF05069 Phage_tail_S:  Phage v  20.4 1.4E+02   0.003   28.0   4.1   38  235-274     2-39  (148)
 41 TIGR00274 N-acetylmuramic acid  20.3      71  0.0015   33.8   2.3   34  238-271   258-291 (291)
 42 TIGR03354 VI_FHA type VI secre  20.2 2.4E+02  0.0051   31.3   6.3   47  233-280   204-254 (396)

No 1  
>PF14228 MOR2-PAG1_mid:  Cell morphogenesis central region
Probab=99.37  E-value=6.8e-13  Score=155.62  Aligned_cols=151  Identities=33%  Similarity=0.573  Sum_probs=116.4

Q ss_pred             CCchhhHHHhhhcCceeeechhhhhhcchhhhhhccccccccCCccCCccchhhccccCCCCCCCccchhhHHHHHHHHh
Q psy17488         89 DPWGFCLFQFLQKGRVLTYCPSAVAQAWPIVYSRLHTLFPTIDPTPVSDNRASLLRSSAPPRKPVSERDSFLSLWRYLAI  168 (1446)
Q Consensus        89 dpwgfclfqflqkgrvltycpsavaqawpivysrlhtlfptidptpvsdnrasllrssapprkpvserdsflslwrylai  168 (1446)
                      ..|.-||=.||.-  +...||.+||.+|-+|..||-.+-|. |.   .. ++.   .+++       -|..+..|+--.|
T Consensus       114 tlW~r~fp~lir~--~fe~CP~tval~R~~v~~RL~~i~~~-~l---~~-k~~---~s~~-------~e~~ieQWKlYLI  176 (1120)
T PF14228_consen  114 TLWFRCFPNLIRI--AFERCPFTVALCRSEVCQRLQQITPI-EL---GS-KSS---QSTD-------PEVLIEQWKLYLI  176 (1120)
T ss_pred             HHHHHHHHHHHHH--HHHhCcHHHHHHHHHHHHHhccCCch-hc---cC-ccC---CCCC-------ccHHHHHHHHHHH
Confidence            4798888888765  56679999999999999999766542 21   11 111   1222       2789999999999


Q ss_pred             hhccccCCCCCCCCCCCCCCCccccCCCCCCCCCCCCCCcCCCCCCCCCCCCccchhhhhhccccccccchhhHHHHHhc
Q psy17488        169 FSARVVPPVPYPTPRCASPDLSLSWSPENVGGERGGAGVENKPAPTASPPPSPTGLYKLLVPLLRCDTTDVRDAVVHALG  248 (1446)
Q Consensus       169 fsarvvppvpyptprcaspdlslswspenvggerggagvenkpaptaspppsptglykllvpllrcdttdvrdavvhalg  248 (1446)
                      |..             ++|--+    -.                  +....+..-||+++||+|+|+.++|+||+|.|||
T Consensus       177 fAC-------------ttp~~s----~s------------------~~~i~SaR~LFk~ivPlLks~~~~~r~AaVlaLG  221 (1120)
T PF14228_consen  177 FAC-------------TTPTDS----RS------------------QQKITSARELFKLIVPLLKSESSSFRDAAVLALG  221 (1120)
T ss_pred             HHh-------------cCCCcC----cc------------------ccccCCHHHHHHHHhhhhccCcHHHHHHHHHhcC
Confidence            975             222111    00                  1224467889999999999999999999999999


Q ss_pred             cCCHHHHHHHHHHHHHHHHHHh---hhhhHHHHHHhhhchhhhccc
Q psy17488        249 NINSEALKDLMEELSVYIREAV---DRKTENMRRRRRRDALRLQSY  291 (1446)
Q Consensus       249 ninsealkdlmeelsvyireav---drktenmrrrrrrdalrlqsy  291 (1446)
                      +||..+++.||++|+++|.++.   +.+..+|++++|+|.||++-.
T Consensus       222 ~~n~~v~~~LleeL~~~i~~~~~e~e~r~~~k~rr~Rrd~LR~ev~  267 (1120)
T PF14228_consen  222 SINLNVYRTLLEELQSYIEECNSEAESRPKWKRRRRRRDRLRTEVT  267 (1120)
T ss_pred             CCCHHHHHHHHHHHHHHHHHHHHHHhcccccccchhhhhhHHHHHH
Confidence            9999999999999999999874   577889999999999998753


No 2  
>KOG1825|consensus
Probab=97.93  E-value=1.1e-05  Score=99.43  Aligned_cols=217  Identities=13%  Similarity=0.071  Sum_probs=156.3

Q ss_pred             ccccCCCCCceeecCCCCCCCCCchhhHHHhhhcCceeeechhhhhhcchhhhhhccccccccCCc-cCCccchhhcccc
Q psy17488         68 IHEDSNRNGSVLNLSPSCSAPDPWGFCLFQFLQKGRVLTYCPSAVAQAWPIVYSRLHTLFPTIDPT-PVSDNRASLLRSS  146 (1446)
Q Consensus        68 ihedsnrngsvlnlspscsapdpwgfclfqflqkgrvltycpsavaqawpivysrlhtlfptidpt-pvsdnrasllrss  146 (1446)
                      +|.+..-.+.|.+.--+|-.-|+||+|||-+.+.--+++-|+.+++++|++...++-+.-++.|+. |-.|+...+++-.
T Consensus       792 ~~~~i~sd~~~~~~~f~~~~k~kwg~~L~~~~~~~~l~~~c~~~~s~s~~ll~~~~~a~~~~~~~s~~q~d~~s~~~~~~  871 (2206)
T KOG1825|consen  792 DSDTIPSDSTLWSIIFSRLDKNKWGLLLSELVKYAGLLKPCSVQESKSEILLRRAHSAPSEFGGKSNPQNDLDSKLLRWL  871 (2206)
T ss_pred             CCCCCCCcceehhhhhhhhccchhhhhhHHHHHHhcCCCCcchhhhhhhHHHHHHhccchhccCCCccccccchHHHhhh
Confidence            444555556666677778788999999999999999999999999999999999999999999997 5567787777765


Q ss_pred             CCCCCC-----CccchhhHHHHHHHHhhhccccCCCCCCCCCCCCCCCccccCCCCCCCCCCCCCC-------cCCCCCC
Q psy17488        147 APPRKP-----VSERDSFLSLWRYLAIFSARVVPPVPYPTPRCASPDLSLSWSPENVGGERGGAGV-------ENKPAPT  214 (1446)
Q Consensus       147 apprkp-----vserdsflslwrylaifsarvvppvpyptprcaspdlslswspenvggerggagv-------enkpapt  214 (1446)
                      --+---     +---+.-+.+|+-.-||...|.++.      -.+++---+-.|.+.++.-+|.--       .++|+.+
T Consensus       872 ~~~~~v~fk~~~~~~~~~v~~w~~yl~~~~~~~~~~------~~s~~h~rs~~pt~~~~s~~~~fS~~~~~~s~~tpi~s  945 (2206)
T KOG1825|consen  872 LYAMFVCFKPPDGLDEGSVASWRDYLHLIFGVLRFG------SESGNHLRSNAPTISHTSCEGMFSEVAISFSDETPIKS  945 (2206)
T ss_pred             ccccceecccccccchhhhhhhhhhHHHhhhccccc------cccccchhhcCcccccCCccceehhhhhhccccCccch
Confidence            443322     3334556788997777776665432      223343445566766666444321       2233222


Q ss_pred             CCCCCCccchhhhhhccccccccchhhHHHHHhccCCHHHHHHHHHHHHH--HHHHHhhhhhH-HHHHHhhhchhhhccc
Q psy17488        215 ASPPPSPTGLYKLLVPLLRCDTTDVRDAVVHALGNINSEALKDLMEELSV--YIREAVDRKTE-NMRRRRRRDALRLQSY  291 (1446)
Q Consensus       215 aspppsptglykllvpllrcdttdvrdavvhalgninsealkdlmeelsv--yireavdrkte-nmrrrrrrdalrlqsy  291 (1446)
                          +.-.++-++.|++..-..+++.+-+|-..|.+|.-+..-+.++|.-  -.++++.|.-+ |..++-|.|.|++|-+
T Consensus       946 ----~~k~~i~~~~~~~~~~~~~~i~~~~v~~~~~~~~~vf~~~~~~l~~r~~~r~sl~R~e~~~~~~~~r~dLl~~~i~ 1021 (2206)
T KOG1825|consen  946 ----KIKKVIRRENVRVHVENIYRITEENVWPFMLARKPVFALLVLRLIERSTRRNSLARSESFHIMAPLRYDLLSVLIF 1021 (2206)
T ss_pred             ----HHHHHHhhhhHHHHHHhccccchhheeccccccccchhhhhhhhhhcccchhhhhcchhhhhhhhhhHHHHHHHhc
Confidence                2334566667777777777788888888888888777777777665  56789999887 9999999999999987


Q ss_pred             CCC
Q psy17488        292 RPE  294 (1446)
Q Consensus       292 rpe  294 (1446)
                      +-.
T Consensus      1022 ~~~ 1024 (2206)
T KOG1825|consen 1022 LDP 1024 (2206)
T ss_pred             ccH
Confidence            643


No 3  
>PF13513 HEAT_EZ:  HEAT-like repeat; PDB: 2Z5J_A 2OT8_B 2Z5O_A 2H4M_A 2QMR_A 1QBK_B 2Z5M_A 2Z5K_A 2Z5N_A 1GCJ_B ....
Probab=92.56  E-value=0.051  Score=42.42  Aligned_cols=28  Identities=46%  Similarity=0.676  Sum_probs=24.9

Q ss_pred             chhhhhhccccccccchhhHHHHHhccC
Q psy17488        223 GLYKLLVPLLRCDTTDVRDAVVHALGNI  250 (1446)
Q Consensus       223 glykllvpllrcdttdvrdavvhalgni  250 (1446)
                      .+..+|+++|.-|..+||.+.++|||||
T Consensus        28 ~~~~~L~~~L~d~~~~VR~~A~~aLg~l   55 (55)
T PF13513_consen   28 ELLPALIPLLQDDDDSVRAAAAWALGNL   55 (55)
T ss_dssp             HHHHHHHHHTTSSSHHHHHHHHHHHHCH
T ss_pred             HHHHHHHHHHcCCCHHHHHHHHHHHhcC
Confidence            4677888999888899999999999997


No 4  
>PF14098 SSPI:  Small, acid-soluble spore protein I
Probab=81.43  E-value=1.5  Score=39.22  Aligned_cols=33  Identities=33%  Similarity=0.573  Sum_probs=27.7

Q ss_pred             chhhHHHHHhccCCHHHHHHHHHHHHHHHHHHhhhhhHHH
Q psy17488        238 DVRDAVVHALGNINSEALKDLMEELSVYIREAVDRKTENM  277 (1446)
Q Consensus       238 dvrdavvhalgninsealkdlmeelsvyireavdrktenm  277 (1446)
                      |+|.||+|-|-|-+.+.|++.       |.+|||.+-|+|
T Consensus         2 dlR~AI~~nv~g~s~~el~~~-------I~daI~sgEE~~   34 (65)
T PF14098_consen    2 DLRQAIIHNVKGSSKEELKDT-------IEDAIQSGEEKA   34 (65)
T ss_pred             CHHHHHHHHccCCCHHHHHHH-------HHHHHhccchhc
Confidence            799999999999998887774       667888877765


No 5  
>TIGR03092 SASP_sspI small, acid-soluble spore protein I. This protein family is restricted to a subset of endospore-forming bacteria such as Bacillus subtilis, all of which are in the Firmicutes (low-GC Gram-positive) lineage. It is a minor SASP (small, acid-soluble spore protein) designated SspI. The gene in Bacillus subtilis previously was designated ysfA.
Probab=80.01  E-value=1.8  Score=38.89  Aligned_cols=33  Identities=27%  Similarity=0.454  Sum_probs=28.0

Q ss_pred             chhhHHHHHhccCCHHHHHHHHHHHHHHHHHHhhhhhHHH
Q psy17488        238 DVRDAVVHALGNINSEALKDLMEELSVYIREAVDRKTENM  277 (1446)
Q Consensus       238 dvrdavvhalgninsealkdlmeelsvyireavdrktenm  277 (1446)
                      |+|+||+|-|-|-+.+.||+.       |-.|||.+-|+|
T Consensus         1 nlR~Ai~~nv~~~s~~elk~~-------I~daI~sgEEk~   33 (65)
T TIGR03092         1 NLRRAILANISNNTKEQLEAT-------IVDAIQSGEEKM   33 (65)
T ss_pred             CHHHHHHHHhcCCCHHHHHHH-------HHHHHhccchhc
Confidence            789999999999998877775       667888888876


No 6  
>PF13646 HEAT_2:  HEAT repeats; PDB: 1OYZ_A 3FGA_A 2PF4_C 2IAE_A 3B2A_A.
Probab=79.65  E-value=1.4  Score=35.75  Aligned_cols=36  Identities=28%  Similarity=0.339  Sum_probs=27.4

Q ss_pred             hhhhhhccccccccchhhHHHHHhccCCHHHHHHHH
Q psy17488        224 LYKLLVPLLRCDTTDVRDAVVHALGNINSEALKDLM  259 (1446)
Q Consensus       224 lykllvpllrcdttdvrdavvhalgninsealkdlm  259 (1446)
                      +...|+.+|..+..+||.+++.|||+|......+.|
T Consensus        32 ~~~~L~~~l~d~~~~vr~~a~~aL~~i~~~~~~~~L   67 (88)
T PF13646_consen   32 AIPALIELLKDEDPMVRRAAARALGRIGDPEAIPAL   67 (88)
T ss_dssp             HHHHHHHHHTSSSHHHHHHHHHHHHCCHHHHTHHHH
T ss_pred             HHHHHHHHHcCCCHHHHHHHHHHHHHhCCHHHHHHH
Confidence            456778888888888999999999999765544433


No 7  
>PRK02955 small acid-soluble spore protein SspI; Provisional
Probab=77.44  E-value=2.4  Score=38.39  Aligned_cols=34  Identities=26%  Similarity=0.387  Sum_probs=28.4

Q ss_pred             cchhhHHHHHhccCCHHHHHHHHHHHHHHHHHHhhhhhHHH
Q psy17488        237 TDVRDAVVHALGNINSEALKDLMEELSVYIREAVDRKTENM  277 (1446)
Q Consensus       237 tdvrdavvhalgninsealkdlmeelsvyireavdrktenm  277 (1446)
                      -|+|+||+|-|-|-+.+.|++       +|-+||+.+-|+|
T Consensus         3 ~nlR~Ai~~nv~g~s~eel~~-------~I~daIqsgEEk~   36 (68)
T PRK02955          3 FNLRGAVLANVSGNSKEELEG-------TIVDAIQSGEEKM   36 (68)
T ss_pred             ccHHHHHHHHccCCCHHHHHH-------HHHHHHhccchhc
Confidence            589999999999988877766       5677888888876


No 8  
>smart00185 ARM Armadillo/beta-catenin-like repeats. Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
Probab=76.09  E-value=1.2  Score=31.86  Aligned_cols=29  Identities=31%  Similarity=0.449  Sum_probs=25.2

Q ss_pred             chhhhhhccccccccchhhHHHHHhccCC
Q psy17488        223 GLYKLLVPLLRCDTTDVRDAVVHALGNIN  251 (1446)
Q Consensus       223 glykllvpllrcdttdvrdavvhalgnin  251 (1446)
                      |....||.||.++..+|..+++.||.||.
T Consensus        12 g~i~~L~~ll~~~~~~i~~~a~~aL~nl~   40 (41)
T smart00185       12 GGLPALVELLKSEDEEVVKEAAWALSNLS   40 (41)
T ss_pred             CCHHHHHHHHcCCCHHHHHHHHHHHHHHc
Confidence            45678999999988999999999999984


No 9  
>KOG4676|consensus
Probab=67.37  E-value=4.9  Score=45.62  Aligned_cols=24  Identities=38%  Similarity=0.296  Sum_probs=15.8

Q ss_pred             ceeecCCCCCCCCCchhhHHHhhhcC
Q psy17488         77 SVLNLSPSCSAPDPWGFCLFQFLQKG  102 (1446)
Q Consensus        77 svlnlspscsapdpwgfclfqflqkg  102 (1446)
                      .|.||+|+...  --...||.||-|=
T Consensus        11 qvanispsat~--dqm~tlFg~lGkI   34 (479)
T KOG4676|consen   11 QVANISPSATK--DQMQTLFGNLGKI   34 (479)
T ss_pred             eecccCchhhH--HHHHHHHhhcccc
Confidence            47788887542  3345788888663


No 10 
>PF00514 Arm:  Armadillo/beta-catenin-like repeat;  InterPro: IPR000225 The armadillo (Arm) repeat is an approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila melanogaster segment polarity gene armadillo involved in signal transduction through wingless. Animal Arm-repeat proteins function in various processes, including intracellular signalling and cytoskeletal regulation, and include such proteins as beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumour suppressor protein, and the nuclear transport factor importin-alpha, amongst others []. A subset of these proteins is conserved across eukaryotic kingdoms. In higher plants, some Arm-repeat proteins function in intracellular signalling like their mammalian counterparts, while others have novel functions []. The 3-dimensional fold of an armadillo repeat is known from the crystal structure of beta-catenin, where the 12 repeats form a superhelix of alpha helices with three helices per unit []. The cylindrical structure features a positively charged grove, which presumably interacts with the acidic surfaces of the known interaction partners of beta-catenin.; GO: 0005515 protein binding; PDB: 2Z6G_A 1IQ1_C 3RZX_A 2C1M_A 3BTR_C 3OQS_A 3TPO_A 1IAL_A 1Q1S_C 1PJM_B ....
Probab=66.42  E-value=2.4  Score=31.95  Aligned_cols=29  Identities=34%  Similarity=0.518  Sum_probs=25.9

Q ss_pred             chhhhhhccccccccchhhHHHHHhccCC
Q psy17488        223 GLYKLLVPLLRCDTTDVRDAVVHALGNIN  251 (1446)
Q Consensus       223 glykllvpllrcdttdvrdavvhalgnin  251 (1446)
                      |....||.||.++..+|+..++-||+||-
T Consensus        12 g~i~~Lv~ll~~~~~~v~~~a~~al~nl~   40 (41)
T PF00514_consen   12 GGIPPLVQLLKSPDPEVQEEAAWALGNLA   40 (41)
T ss_dssp             THHHHHHHHTTSSSHHHHHHHHHHHHHHH
T ss_pred             ccHHHHHHHHcCCCHHHHHHHHHHHHHHh
Confidence            56778999999999999999999999973


No 11 
>PF02985 HEAT:  HEAT repeat;  InterPro: IPR000357 The HEAT repeat is a tandemly repeated, 37-47 amino acid long module occurring in a number of cytoplasmic proteins, including the four name-giving proteins huntingtin, elongation factor 3 (EF3), the 65 Kd alpha regulatory subunit of protein phosphatase 2A (PP2A) and the yeast PI3-kinase TOR1 []. Arrays of HEAT repeats consists of 3 to 36 units forming a rod-like helical structure and appear to function as protein-protein interaction surfaces. It has been noted that many HEAT repeat-containing proteins are involved in intracellular transport processes. In the crystal structure of PP2A PR65/A [], the HEAT repeats consist of pairs of antiparallel alpha helices [].; GO: 0005515 protein binding; PDB: 3FGA_A 2PF4_C 2IAE_A 2BKU_D 3EA5_B 3ND2_A 2BPT_A 2NYL_A 2NPP_D 2PKG_B ....
Probab=62.97  E-value=2.8  Score=30.66  Aligned_cols=25  Identities=36%  Similarity=0.578  Sum_probs=19.4

Q ss_pred             hhhhccccccccchhhHHHHHhccC
Q psy17488        226 KLLVPLLRCDTTDVRDAVVHALGNI  250 (1446)
Q Consensus       226 kllvpllrcdttdvrdavvhalgni  250 (1446)
                      -.|+-+|.=+..+||+|++.+||+|
T Consensus         3 p~l~~~l~D~~~~VR~~a~~~l~~i   27 (31)
T PF02985_consen    3 PILLQLLNDPSPEVRQAAAECLGAI   27 (31)
T ss_dssp             HHHHHHHT-SSHHHHHHHHHHHHHH
T ss_pred             HHHHHHcCCCCHHHHHHHHHHHHHH
Confidence            3556677767789999999999986


No 12 
>PF03130 HEAT_PBS:  PBS lyase HEAT-like repeat;  InterPro: IPR004155 These proteins contain a short bi-helical repeat that is related to HEAT. Cyanobacteria and red algae harvest light energy using macromolecular complexes known as phycobilisomes (PBS), peripherally attached to the photosynthetic membrane. The major components of PBS are the phycobiliproteins. These heterodimeric proteins are covalently attached to phycobilins: open-chain tetrapyrrole chromophores, which function as the photosynthetic light-harvesting pigments. Phycobiliproteins differ in sequence and in the nature and number of attached phycobilins to each of their subunits. These proteins include the lyase enzymes that specifically attach particular phycobilins to apophycobiliprotein subunits. The most comprehensively studied of these is the CpcE/Flyase P31967 from SWISSPROT, P31968 from SWISSPROT, which attaches phycocyanobilin (PCB) to the alpha subunit of apophycocyanin []. Similarly, MpeU/V attaches phycoerythrobilin to phycoerythrin II, while CpeY/Z is thought to be involved in phycoerythrobilin (PEB) attachment to phycoerythrin (PE) I (PEs I and II differ in sequence and in the number of attached molecules of PEB: PE I has five, PE II has six) []. All the reactions of the above lyases involve an apoprotein cysteine SH addition to a terminal delta 3,3'-double bond. Such a reaction is not possible in the case of phycoviolobilin (PVB), the phycobilin of alpha-phycoerythrocyanin (alpha-PEC). It is thought that in this case, PCB, not PVB, is first added to apo-alpha-PEC, and is then isomerized to PVB. The addition reaction has been shown to occur in the presence of either of the components of alpha-PEC-PVB lyase PecE or PecF (or both). The isomerisation reaction occurs only when both PecE and PecF components are present, i.e. the PecE/F phycobiliprotein lyase is also a phycobilin isomerase []. Another member of this family is the NblB protein, whose similarity to the phycobiliprotein lyases was previously noted []. This constitutively expressed protein is not known to have any lyase activity. It is thought to be involved in the coordination of PBS degradation with environmental nutrient limitation. It has been suggested that the similarity of NblB to the phycobiliprotein lyases is due to the ability to bind tetrapyrrole phycobilins via the common repeated motif [].; PDB: 1TE4_A.
Probab=53.61  E-value=16  Score=26.67  Aligned_cols=24  Identities=46%  Similarity=0.672  Sum_probs=18.0

Q ss_pred             hhhHHHHHhccCCH-HHHHHHHHHH
Q psy17488        239 VRDAVVHALGNINS-EALKDLMEEL  262 (1446)
Q Consensus       239 vrdavvhalgnins-ealkdlmeel  262 (1446)
                      ||-+.+-|||+|.. +|+..|++-|
T Consensus         1 VR~~Aa~aLg~igd~~ai~~L~~~L   25 (27)
T PF03130_consen    1 VRRAAARALGQIGDPRAIPALIEAL   25 (27)
T ss_dssp             HHHHHHHHHGGG-SHHHHHHHHHHH
T ss_pred             CHHHHHHHHHHcCCHHHHHHHHHHh
Confidence            68889999999988 6666666654


No 13 
>COG5064 SRP1 Karyopherin (importin) alpha [Intracellular trafficking and secretion]
Probab=49.61  E-value=10  Score=42.93  Aligned_cols=37  Identities=43%  Similarity=0.678  Sum_probs=29.8

Q ss_pred             chhhhhhccccccccchhhHHHHHhccC--CHHHHHHHH
Q psy17488        223 GLYKLLVPLLRCDTTDVRDAVVHALGNI--NSEALKDLM  259 (1446)
Q Consensus       223 glykllvpllrcdttdvrdavvhalgni--nsealkdlm  259 (1446)
                      |..-|++-||---..||++.+|.|||||  +|++..|.+
T Consensus       157 ~AVPlfiqlL~s~~~~V~eQavWALGNiAGDS~~~RD~v  195 (526)
T COG5064         157 GAVPLFIQLLSSTEDDVREQAVWALGNIAGDSEGCRDYV  195 (526)
T ss_pred             CchHHHHHHHcCchHHHHHHHHHHhccccCCchhHHHHH
Confidence            3344788888888889999999999999  577777754


No 14 
>PF11236 DUF3037:  Protein of unknown function (DUF3037);  InterPro: IPR021398  This bacterial family of proteins has no known function. 
Probab=45.25  E-value=5.3  Score=37.22  Aligned_cols=71  Identities=20%  Similarity=0.342  Sum_probs=44.4

Q ss_pred             eeeechhh--hhhcchhhhhhccccccccCCccCCccchhhccccCCCC-CCCccchhhHHHHHHHHhhhcccc
Q psy17488        104 VLTYCPSA--VAQAWPIVYSRLHTLFPTIDPTPVSDNRASLLRSSAPPR-KPVSERDSFLSLWRYLAIFSARVV  174 (1446)
Q Consensus       104 vltycpsa--vaqawpivysrlhtlfptidptpvsdnrasllrssappr-kpvserdsflslwrylaifsarvv  174 (1446)
                      ||.|||.+  +.-.|-.-..||+.|||.+|..-+.++-.++-..-+--. ......+...+.|++|.---.-||
T Consensus        20 Vvl~~~~~~~l~~r~~~~~~Rl~~f~~~~D~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~f~~lt~~rstvi   93 (118)
T PF11236_consen   20 VVLFCPEQGFLDFRFHLDRKRLRAFFPELDIDLVRAALEAFERECAGIKEAGPIAQLDIAERFRWLTAPRSTVI   93 (118)
T ss_pred             EEEEeCCCCeEEEEEeCCHHHHHHhCccCCHHHHHHHHHHHHHHHccCCCCCCcCcccHHHHHHHhhhcccCcE
Confidence            66788843  444555556999999999999988887766533222111 111445557788888764433333


No 15 
>PF02969 TAF:  TATA box binding protein associated factor (TAF);  InterPro: IPR004823 The TATA box binding protein associated factor (TAF) is part of the transcription initiation factor TFIID multimeric protein complex. TFIID plays a central role in mediating promoter responses to various activators and repressors. It binds tightly to TAFII-250 and directly interacts with TAFII-40. TFIID is composed of TATA binding protein (TBP)and a number of TBP-associated factors (TAFS). TAF proteins adopt a histone-like fold.; GO: 0006352 transcription initiation, DNA-dependent, 0005634 nucleus; PDB: 1TAF_B.
Probab=45.01  E-value=34  Score=30.32  Aligned_cols=42  Identities=29%  Similarity=0.484  Sum_probs=32.3

Q ss_pred             HHHHHh--ccCCHHHHHHHHHHHHHHHHHHhhhhhHHHHHHhhh
Q psy17488        242 AVVHAL--GNINSEALKDLMEELSVYIREAVDRKTENMRRRRRR  283 (1446)
Q Consensus       242 avvhal--gninsealkdlmeelsvyireavdrktenmrrrrrr  283 (1446)
                      +|.-.|  .||+.++++.|.+++..-|++.|..-..-|+.-+|.
T Consensus        11 ~iAes~Gi~~l~de~a~~La~dveyrlreiiq~a~kfm~hskR~   54 (66)
T PF02969_consen   11 DIAESLGISNLSDEAAKALAEDVEYRLREIIQEALKFMRHSKRT   54 (66)
T ss_dssp             HHHHHTT---B-HHHHHHHHHHHHHHHHHHHHHHHHHHHHTT-S
T ss_pred             HHHHHcCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC
Confidence            344444  489999999999999999999999999999887765


No 16 
>smart00567 EZ_HEAT E-Z type HEAT repeats. Present in subunits of cyanobacterial phycocyanin lyase, and other proteins. Probable scaffolding role.
Probab=42.01  E-value=29  Score=24.84  Aligned_cols=23  Identities=35%  Similarity=0.531  Sum_probs=16.2

Q ss_pred             chhhHHHHHhccCCH-HHHHHHHH
Q psy17488        238 DVRDAVVHALGNINS-EALKDLME  260 (1446)
Q Consensus       238 dvrdavvhalgnins-ealkdlme  260 (1446)
                      -||-+++-+||||.. +|+..|++
T Consensus         2 ~vR~~aa~aLg~~~~~~a~~~L~~   25 (30)
T smart00567        2 LVRHEAAFALGQLGDEEAVPALIK   25 (30)
T ss_pred             HHHHHHHHHHHHcCCHhHHHHHHH
Confidence            378889999999954 45544444


No 17 
>TIGR01635 tail_comp_S phage virion morphogenesis (putative tail completion) protein. This model describes protein S of phage P2, suggested experimentally to act in tail completion and stable head joining, and related proteins from a number of phage.
Probab=41.88  E-value=34  Score=32.49  Aligned_cols=40  Identities=8%  Similarity=0.217  Sum_probs=33.2

Q ss_pred             ccccchhhHHHHHhccCCHHHHHHHHHHHHHHHHHHhhhh
Q psy17488        234 CDTTDVRDAVVHALGNINSEALKDLMEELSVYIREAVDRK  273 (1446)
Q Consensus       234 cdttdvrdavvhalgninsealkdlmeelsvyireavdrk  273 (1446)
                      .|+.+|.+++-++|.+++...++.||.+|.-+|+..++..
T Consensus         1 ~~~~~~~~~l~~l~~~l~~~~~~~l~~~Ig~~l~~~~~~r   40 (144)
T TIGR01635         1 EDRSYIQVALERLLQVLDPKTRDKMLRRIAAQLASSTRKN   40 (144)
T ss_pred             CcHHHHHHHHHHHHHhcChhhHHHHHHHHHHHHHHHHHHH
Confidence            3778899999999999988788899999999888776543


No 18 
>PF09415 CENP-X:  CENP-S associating Centromere protein X;  InterPro: IPR018552 Centromere protein X (CENP-X) is a component of the CENP-S complex. The CENP-S complex is composed of at least of CENP-S and CENP-X and is essential for the stable assembly of the outer kinetchore [].  CENP-X is also a DNA-binding component of the Fanconi anemia (FA) core complex involved in DNA damage repair and genome maintenance. The FA complex is composed of CENPS, FANCA, FANCB, FANCC, FANCE, FANCF, FANCG, FANCL/PHF9, FANCM, FAAP24 and CENPX. Interacts with CENPS, FANCM and FAAP24 [, ].; PDB: 4DRB_L 4DRA_H 3V9R_D.
Probab=40.88  E-value=17  Score=32.35  Aligned_cols=30  Identities=40%  Similarity=0.556  Sum_probs=25.3

Q ss_pred             cCCHHHHHHHHHHHHHHHHHHhhhhhHHHH
Q psy17488        249 NINSEALKDLMEELSVYIREAVDRKTENMR  278 (1446)
Q Consensus       249 ninsealkdlmeelsvyireavdrktenmr  278 (1446)
                      +|+.+||+-+-+=|-++|+|||-|-.++-+
T Consensus        19 kIs~dal~l~~eyl~iFV~EAv~Ra~~~a~   48 (72)
T PF09415_consen   19 KISKDALKLSAEYLRIFVREAVARAAEQAE   48 (72)
T ss_dssp             EE-CCCHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             CcCHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            688999999999999999999999887543


No 19 
>cd00020 ARM Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
Probab=40.24  E-value=32  Score=28.14  Aligned_cols=29  Identities=34%  Similarity=0.470  Sum_probs=26.0

Q ss_pred             chhhhhhccccccccchhhHHHHHhccCC
Q psy17488        223 GLYKLLVPLLRCDTTDVRDAVVHALGNIN  251 (1446)
Q Consensus       223 glykllvpllrcdttdvrdavvhalgnin  251 (1446)
                      |+...|+.+|..+..+|+.+.+-||+||-
T Consensus        49 ~~i~~l~~~l~~~~~~v~~~a~~~L~~l~   77 (120)
T cd00020          49 GGLPALVQLLKSEDEEVVKAALWALRNLA   77 (120)
T ss_pred             CChHHHHHHHhCCCHHHHHHHHHHHHHHc
Confidence            67889999999998999999999999983


No 20 
>PF01934 DUF86:  Protein of unknown function DUF86;  InterPro: IPR008201 This entry describes prokaryotic proteins of unknown function.; PDB: 1YLM_A.
Probab=34.36  E-value=46  Score=29.39  Aligned_cols=36  Identities=19%  Similarity=0.355  Sum_probs=24.5

Q ss_pred             cccchhhHHHHHhccCCHHHHHHHHHHHHHHHHHHh
Q psy17488        235 DTTDVRDAVVHALGNINSEALKDLMEELSVYIREAV  270 (1446)
Q Consensus       235 dttdvrdavvhalgninsealkdlmeelsvyireav  270 (1446)
                      ..+++|+.+||.-.+||.+.|-+++.+.--.+.++|
T Consensus        81 ~~~g~RN~lvH~Y~~id~~~i~~~i~~~l~~l~~~~  116 (119)
T PF01934_consen   81 KMVGFRNRLVHDYDSIDDEIIYEIIKEDLPDLEEFI  116 (119)
T ss_dssp             HHHTTHHHHHT-GGG--HHHHHHHHHHTHHHHHHHH
T ss_pred             HHHHHHHHHccccccCCHHHHHHHHHHHHHHHHHHH
Confidence            356899999999999999999888876444444443


No 21 
>PF13646 HEAT_2:  HEAT repeats; PDB: 1OYZ_A 3FGA_A 2PF4_C 2IAE_A 3B2A_A.
Probab=33.76  E-value=73  Score=25.68  Aligned_cols=51  Identities=25%  Similarity=0.342  Sum_probs=30.9

Q ss_pred             hhccc-cccccchhhHHHHHhccCCHHHHHHHHHHH----HHHHHHHhhhhhHHHH
Q psy17488        228 LVPLL-RCDTTDVRDAVVHALGNINSEALKDLMEEL----SVYIREAVDRKTENMR  278 (1446)
Q Consensus       228 lvpll-rcdttdvrdavvhalgninsealkdlmeel----svyireavdrktenmr  278 (1446)
                      |+-+| .=...+||..++++||++..+.+-+++-++    ...||.++-.-..+|.
T Consensus         4 L~~~l~~~~~~~vr~~a~~~L~~~~~~~~~~~L~~~l~d~~~~vr~~a~~aL~~i~   59 (88)
T PF13646_consen    4 LLQLLQNDPDPQVRAEAARALGELGDPEAIPALIELLKDEDPMVRRAAARALGRIG   59 (88)
T ss_dssp             HHHHHHTSSSHHHHHHHHHHHHCCTHHHHHHHHHHHHTSSSHHHHHHHHHHHHCCH
T ss_pred             HHHHHhcCCCHHHHHHHHHHHHHcCCHhHHHHHHHHHcCCCHHHHHHHHHHHHHhC
Confidence            44455 444468999999999999977554443333    2445554444444443


No 22 
>PF07112 DUF1368:  Protein of unknown function (DUF1368);  InterPro: IPR010778 This family consists of several proteins with seem to be specific to red algae plasmids. Members of this family are typically around 415 residues in length. The function of this family is unknown.
Probab=33.62  E-value=17  Score=40.94  Aligned_cols=21  Identities=43%  Similarity=0.977  Sum_probs=17.7

Q ss_pred             CCCCCCCCCchhhHHHhhhcCceeeechh
Q psy17488         82 SPSCSAPDPWGFCLFQFLQKGRVLTYCPS  110 (1446)
Q Consensus        82 spscsapdpwgfclfqflqkgrvltycps  110 (1446)
                      ++|||. |-|.||||       ||.||+.
T Consensus        28 ~~S~SE-dDW~ycll-------ilkfi~p   48 (404)
T PF07112_consen   28 SPSPSE-DDWQYCLL-------ILKFINP   48 (404)
T ss_pred             CCCCCc-cchhheee-------eeeecCc
Confidence            788886 55999997       7899986


No 23 
>PTZ00033 60S ribosomal protein L24; Provisional
Probab=33.34  E-value=21  Score=35.44  Aligned_cols=35  Identities=26%  Similarity=0.492  Sum_probs=29.3

Q ss_pred             hhHHHhhhcCceeeechhhhh-----------hcchhhhhhccccc
Q psy17488         93 FCLFQFLQKGRVLTYCPSAVA-----------QAWPIVYSRLHTLF  127 (1446)
Q Consensus        93 fclfqflqkgrvltycpsava-----------qawpivysrlhtlf  127 (1446)
                      |.-|.||..|.|+++|-+-..           .+|-.+|-|+|.--
T Consensus        21 ~Vr~~~~~Dgkv~~F~~sKc~~~~~~krnPRkl~WT~~yRr~~kK~   66 (125)
T PTZ00033         21 YVPFAFLSTKPVLTFLRPKCFALYMRKKNPRFLPWTRTYRRINRKT   66 (125)
T ss_pred             eeecccCCCCCEEEEecHHHHHHHHCcCCCccchHHHHHHHHhCCc
Confidence            566889999999999987543           47999999999875


No 24 
>PF12363 DUF3647:  Phage protein ;  InterPro: IPR024410 Proteins in this entry are frequently annotated as phage proteins, however there is little accompanying literature to back this up or to describe the nature of these phage proteins.
Probab=32.96  E-value=69  Score=29.88  Aligned_cols=40  Identities=28%  Similarity=0.416  Sum_probs=31.7

Q ss_pred             cchhhhhhccccccccchhhHHHHHh----ccCCHHHHHHHHHH
Q psy17488        222 TGLYKLLVPLLRCDTTDVRDAVVHAL----GNINSEALKDLMEE  261 (1446)
Q Consensus       222 tglykllvpllrcdttdvrdavvhal----gninsealkdlmee  261 (1446)
                      .|+-.|++-|+.+|+.-+.|++-+|+    .++..++|.+.+++
T Consensus        36 ~G~~~l~~~l~~~d~~al~d~i~~a~~~~~~~~s~~eIe~~ie~   79 (113)
T PF12363_consen   36 MGLSQLVPGLLQGDPVALADIIYAATAHEKKRPSREEIEDYIED   79 (113)
T ss_pred             cCHHHHHHHHHcCCHHHHHHHHHHHhcccCCCCCHHHHHHHHHH
Confidence            47788899999999999999998888    45566666666664


No 25 
>PF13513 HEAT_EZ:  HEAT-like repeat; PDB: 2Z5J_A 2OT8_B 2Z5O_A 2H4M_A 2QMR_A 1QBK_B 2Z5M_A 2Z5K_A 2Z5N_A 1GCJ_B ....
Probab=32.60  E-value=1.1e+02  Score=23.67  Aligned_cols=38  Identities=26%  Similarity=0.359  Sum_probs=24.4

Q ss_pred             chhhHHHHHhccCC---HHHHHHHHHHHHHHHHHHhhhhhH
Q psy17488        238 DVRDAVVHALGNIN---SEALKDLMEELSVYIREAVDRKTE  275 (1446)
Q Consensus       238 dvrdavvhalgnin---sealkdlmeelsvyireavdrkte  275 (1446)
                      -||.+.+.|||+|.   .+.++.++.+|--.+...+.....
T Consensus         2 ~vR~~A~~aLg~l~~~~~~~~~~~~~~~~~~L~~~L~d~~~   42 (55)
T PF13513_consen    2 RVRRAAAWALGRLAEGCPELLQPYLPELLPALIPLLQDDDD   42 (55)
T ss_dssp             HHHHHHHHHHHCTTTTTHHHHHHHHHHHHHHHHHHTTSSSH
T ss_pred             HHHHHHHHHHhhHhcccHHHHHHHHHHHHHHHHHHHcCCCH
Confidence            37899999999764   556665555555555555543333


No 26 
>PRK09687 putative lyase; Provisional
Probab=32.31  E-value=42  Score=34.88  Aligned_cols=26  Identities=35%  Similarity=0.405  Sum_probs=15.5

Q ss_pred             hhhhhccccccccchhhHHHHHhccC
Q psy17488        225 YKLLVPLLRCDTTDVRDAVVHALGNI  250 (1446)
Q Consensus       225 ykllvpllrcdttdvrdavvhalgni  250 (1446)
                      ..+|+.+|.-+..+||.+.+.|||+|
T Consensus       161 i~~L~~~L~d~~~~VR~~A~~aLg~~  186 (280)
T PRK09687        161 IPLLINLLKDPNGDVRNWAAFALNSN  186 (280)
T ss_pred             HHHHHHHhcCCCHHHHHHHHHHHhcC
Confidence            45555666555556666666666665


No 27 
>KOG4843|consensus
Probab=31.59  E-value=30  Score=38.52  Aligned_cols=121  Identities=24%  Similarity=0.264  Sum_probs=78.1

Q ss_pred             chhhHHHhhhcC--cee--eechhhhhhcchhhhhhccccccccCCc--------------cCCccchhhccccCCCCCC
Q psy17488         91 WGFCLFQFLQKG--RVL--TYCPSAVAQAWPIVYSRLHTLFPTIDPT--------------PVSDNRASLLRSSAPPRKP  152 (1446)
Q Consensus        91 wgfclfqflqkg--rvl--tycpsavaqawpivysrlhtlfptidpt--------------pvsdnrasllrssapprkp  152 (1446)
                      -|-|+|..+.--  .-|  --|-+.+|..-| |--++|.+|-||.|.              .-+|+-+-|.+-.|++-+|
T Consensus       191 ~~e~~f~~ie~~~~~~le~p~~~~~~~~~~p-~~V~i~~~~~~~~pr~~~~~qlwgTdVytddsD~vt~~~~g~~sll~p  269 (355)
T KOG4843|consen  191 VGECMFELIELWKEYDLEHPDKSGDFANNGP-LEVRIPAEHVTATPRQVRGGQLWGTDVYTDDSDLVTGYCRGTASLLPP  269 (355)
T ss_pred             chhhhhhhhhccccccccCcccccccccCCc-eEEEeeccceeccccccccceeecCccccccchhhheecccccCCCCC
Confidence            456777654321  111  124455555555 344566666666553              4478888888888888888


Q ss_pred             CccchhhHHHHHHHHhhhccccCCCCCCCCCCCCCCCccccCCCC----CCCCCCCCCCcCCCCCCCCCCCCcc
Q psy17488        153 VSERDSFLSLWRYLAIFSARVVPPVPYPTPRCASPDLSLSWSPEN----VGGERGGAGVENKPAPTASPPPSPT  222 (1446)
Q Consensus       153 vserdsflslwrylaifsarvvppvpyptprcaspdlslswspen----vggerggagvenkpaptaspppspt  222 (1446)
                      |-          +.|....+|+|+.-|=|----.---|-+|+-+.    -|=--|+.-|+-||+-|.+---+|+
T Consensus       270 t~----------~~l~v~L~vLP~l~~Y~s~~rn~I~SR~WgS~~~~~~dgls~gi~~I~~kp~l~~~~~~~ph  333 (355)
T KOG4843|consen  270 TM----------QELRVTLRVLPSLDYYTSKLRNNIRSRAWGSGIGCSYDGLSKGIGTIELKPSLTHSSTVEPH  333 (355)
T ss_pred             cc----------ceeEEEEEecCchhhccchhccceeecccccCCceeecccccccceEEEecccccccccCcc
Confidence            73          456667788999877665545555677898776    5666677778888887766555554


No 28 
>cd01899 Ygr210 Ygr210 subfamily.  Ygr210 is a member of Obg-like family and present in archaea and fungi.  They are characterized by a distinct glycine-rich motif immediately following the Walker B motif.  The Ygr210 and YyaF/YchF subfamilies appear to form one major branch of the Obg-like family.  Among eukaryotes, the Ygr210 subfamily is represented only in fungi.  These fungal proteins form a tight cluster with their archaeal orthologs, which suggests the possibility of horizontal transfer from archaea to fungi.
Probab=29.27  E-value=57  Score=34.80  Aligned_cols=56  Identities=14%  Similarity=0.229  Sum_probs=43.0

Q ss_pred             hhccccccccchhhHHHHHhccCC-HHHHHHHHHHHHHHHHHHhhhhhHHHHHHhhh
Q psy17488        228 LVPLLRCDTTDVRDAVVHALGNIN-SEALKDLMEELSVYIREAVDRKTENMRRRRRR  283 (1446)
Q Consensus       228 lvpllrcdttdvrdavvhalgnin-sealkdlmeelsvyireavdrktenmrrrrrr  283 (1446)
                      |+=+++|..++-.+.|.|..|++| .+.|.-|.+||-.++-+.|.++.+.+.++...
T Consensus       103 ii~Vvd~~~~~d~~~~~~~~~~~dp~~d~~~i~~El~~~d~~~~~~~~~~~~~~~~~  159 (318)
T cd01899         103 LIHVVDASGGTDAEGNGVETGGHDPLEDIEFLENEIDMWIYGILEKNWEKIVRKADA  159 (318)
T ss_pred             EEEEEeCCCCcccccccccCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence            555667754334467889999998 56788889999999999999888888866553


No 29 
>PF15209 IL31:  Interleukin 31
Probab=27.94  E-value=59  Score=32.84  Aligned_cols=69  Identities=26%  Similarity=0.298  Sum_probs=39.4

Q ss_pred             ccccccccchhhHHHHHhccCCHHHHHHHHHHHHHHHHH-HhhhhhHHHHHHhhhchhhhcccCCC-Cccccceec
Q psy17488        230 PLLRCDTTDVRDAVVHALGNINSEALKDLMEELSVYIRE-AVDRKTENMRRRRRRDALRLQSYRPE-SGERKSYRL  303 (1446)
Q Consensus       230 pllrcdttdvrdavvhalgninsealkdlmeelsvyire-avdrktenmrrrrrrdalrlqsyrpe-sgerksyrl  303 (1446)
                      .+|.|=|-|-     ||..||||-|+..-++.+...... +++.-++.+-.-+-.|++-.+-=.|. +-|+|+|-|
T Consensus        46 ~~LPCft~d~-----~~~~nI~sSaIl~ylk~Ik~l~~n~~i~~Ii~~L~kl~~q~~pEt~iS~P~d~fE~K~FiL  116 (137)
T PF15209_consen   46 YQLPCFTPDA-----QAPNNINSSAILPYLKAIKRLSNNTVIDEIIEQLDKLKFQNPPETNISVPTDSFECKSFIL  116 (137)
T ss_pred             ccCCCCCCCC-----CCCccccHHHHHHHHHHHHHhcccchHHHHHHHHHhcccCCCCCCCCCCCCCcchhhhhHH
Confidence            3567777773     899999999998888877764433 23333444333333444443333332 334555544


No 30 
>PRK09687 putative lyase; Provisional
Probab=26.16  E-value=58  Score=33.88  Aligned_cols=39  Identities=31%  Similarity=0.210  Sum_probs=26.8

Q ss_pred             hhhhhhccccccccchhhHHHHHhccCCH-HHHHHHHHHH
Q psy17488        224 LYKLLVPLLRCDTTDVRDAVVHALGNINS-EALKDLMEEL  262 (1446)
Q Consensus       224 lykllvpllrcdttdvrdavvhalgnins-ealkdlmeel  262 (1446)
                      +..+|..++.-+...||-++|+|||+|.. +++..|+.-|
T Consensus       129 a~~~l~~~~~D~~~~VR~~a~~aLg~~~~~~ai~~L~~~L  168 (280)
T PRK09687        129 IVEQSQITAFDKSTNVRFAVAFALSVINDEAAIPLLINLL  168 (280)
T ss_pred             HHHHHHHHhhCCCHHHHHHHHHHHhccCCHHHHHHHHHHh
Confidence            44556666666677899999999999964 4454444433


No 31 
>PF12972 NAGLU_C:  Alpha-N-acetylglucosaminidase (NAGLU) C-terminal domain;  InterPro: IPR024732 Alpha-N-acetylglucosaminidase is a lysosomal enzyme required for the stepwise degradation of heparan sulphate []. Mutations on the alpha-N-acetylglucosaminidase (NAGLU) gene can lead to Mucopolysaccharidosis type IIIB (MPS IIIB, or Sanfilippo syndrome type B) characterised by neurological dysfunction but relatively mild somatic manifestations []. The structure shows that the enzyme is composed of three domains. This C-terminal domain has an all alpha helical fold [].; PDB: 2VC9_A 2VCC_A 2VCB_A 2VCA_A 4A4A_A.
Probab=26.13  E-value=74  Score=33.33  Aligned_cols=88  Identities=19%  Similarity=0.178  Sum_probs=38.2

Q ss_pred             hhhHHHHHHHH--hhhccccCCCCCCCCCCCCCCCccccCCCCCCCCCCCCCCcCCCCCCCCCCCCccchhhhhhccccc
Q psy17488        157 DSFLSLWRYLA--IFSARVVPPVPYPTPRCASPDLSLSWSPENVGGERGGAGVENKPAPTASPPPSPTGLYKLLVPLLRC  234 (1446)
Q Consensus       157 dsflslwryla--ifsarvvppvpyptprcaspdlslswspenvggerggagvenkpaptaspppsptglykllvpllrc  234 (1446)
                      +.....|..|.  |++.......+....-|+-|.|........               ....-+=.|.-|.+.+--||.|
T Consensus        16 ~~~~~AW~iL~~tvY~~~~~~~~~~~si~~~rPsl~~~~~~~~---------------~~~~~~Yd~~~l~~A~~~ll~a   80 (267)
T PF12972_consen   16 PEAREAWQILLRTVYNNTGGQYGRGESIFCARPSLNGNSASTW---------------GPPGIWYDPADLEKAWRLLLKA   80 (267)
T ss_dssp             HHHHHHHHHHHHTTTS---S---SS--GGGS-S-SS---SSTT---------------------S-HHHHHHHHHHHHHC
T ss_pred             HHHHHHHHHHHhhhCCCCCCCCCCCcCceeeCCCCCcccccCC---------------CCCCCCCCHHHHHHHHHHHHHh
Confidence            34567788775  333332222222355689998874321111               1111223455566666666665


Q ss_pred             cc---------cchhhHHHHHhccCCHHHHHHHH
Q psy17488        235 DT---------TDVRDAVVHALGNINSEALKDLM  259 (1446)
Q Consensus       235 dt---------tdvrdavvhalgninsealkdlm  259 (1446)
                      ..         -||-|..-|+|+|.-...+.+|+
T Consensus        81 ~~~l~~~~~yryDlvDvtRQvL~n~~~~~~~~~~  114 (267)
T PF12972_consen   81 ADELKDSETYRYDLVDVTRQVLSNYADELYQQLV  114 (267)
T ss_dssp             HHHHTT-HHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             HHhcCCCchHHhHHHHHHHHHHHHHHHHHHHHHH
Confidence            44         25666666777665555444444


No 32 
>PF13976 gag_pre-integrs:  GAG-pre-integrase domain
Probab=24.84  E-value=50  Score=26.71  Aligned_cols=20  Identities=30%  Similarity=0.416  Sum_probs=16.1

Q ss_pred             hHHHHH-hccCCHHHHHHHHH
Q psy17488        241 DAVVHA-LGNINSEALKDLME  260 (1446)
Q Consensus       241 davvha-lgninsealkdlme  260 (1446)
                      -.+.|+ ||-+|..+|+.|+.
T Consensus        24 ~~lWH~RLGH~~~~~l~~l~~   44 (67)
T PF13976_consen   24 TDLWHRRLGHPSFQVLKKLSK   44 (67)
T ss_pred             HHHHHHhcCCCCHHHHHHHhh
Confidence            345564 99999999999887


No 33 
>KOG0414|consensus
Probab=24.72  E-value=55  Score=41.23  Aligned_cols=31  Identities=32%  Similarity=0.519  Sum_probs=26.8

Q ss_pred             hhhhccccccccchhhHHHHHhccCCHHHHH
Q psy17488        226 KLLVPLLRCDTTDVRDAVVHALGNINSEALK  256 (1446)
Q Consensus       226 kllvpllrcdttdvrdavvhalgninsealk  256 (1446)
                      -+||+||.|..-++|.|||.++||+--.+|.
T Consensus       315 ~~lv~lld~es~~lRnavlei~~n~V~~~l~  345 (1251)
T KOG0414|consen  315 TLLVDLLDSESYTLRNAVLEICANLVASELR  345 (1251)
T ss_pred             HHHHHhcCCchHHHHHHHHHHHHHHHHHHhc
Confidence            4688999999999999999999998666555


No 34 
>KOG2549|consensus
Probab=24.47  E-value=1.1e+02  Score=36.19  Aligned_cols=48  Identities=31%  Similarity=0.380  Sum_probs=39.0

Q ss_pred             HhccCCHHHHHHHHHHHHHHHHHHhhhhhHHHHHHhhh--------chhhhcccCC
Q psy17488        246 ALGNINSEALKDLMEELSVYIREAVDRKTENMRRRRRR--------DALRLQSYRP  293 (1446)
Q Consensus       246 algninsealkdlmeelsvyireavdrktenmrrrrrr--------dalrlqsyrp  293 (1446)
                      .++|||.||++-|-++|.--|+|+|..-++-|+--.|+        -|||+-.--|
T Consensus        25 Gi~nl~deaa~~La~dv~yrikEI~Q~aaKfm~hskR~kLtv~DV~~ALr~~nVep   80 (576)
T KOG2549|consen   25 GITNLNDEAALLLAEDVEYRIKEIVQDAAKFMVHSKRTKLTVDDVDYALRSLNVEP   80 (576)
T ss_pred             CccccCHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCcCcHHHHHHHHhhccccc
Confidence            46799999999999999999999999999999876654        2566655444


No 35 
>cd00020 ARM Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
Probab=24.37  E-value=78  Score=25.84  Aligned_cols=31  Identities=29%  Similarity=0.417  Sum_probs=26.2

Q ss_pred             cchhhhhhccccccccchhhHHHHHhccCCH
Q psy17488        222 TGLYKLLVPLLRCDTTDVRDAVVHALGNINS  252 (1446)
Q Consensus       222 tglykllvpllrcdttdvrdavvhalgnins  252 (1446)
                      .|+...|+.+|.=+..+++.+.+.+|+||-.
T Consensus         6 ~~~i~~l~~~l~~~~~~~~~~a~~~l~~l~~   36 (120)
T cd00020           6 AGGLPALVSLLSSSDENVQREAAWALSNLSA   36 (120)
T ss_pred             cCChHHHHHHHHcCCHHHHHHHHHHHHHHhc
Confidence            4677888899987778999999999999864


No 36 
>PF14123 DUF4290:  Domain of unknown function (DUF4290)
Probab=23.01  E-value=1.9e+02  Score=30.03  Aligned_cols=71  Identities=28%  Similarity=0.436  Sum_probs=43.7

Q ss_pred             hhhHHHHHHHHhhhcc---ccCCCCCCCCCCCCCCCccccCCCCCCCCCCCCCCcCCCCCCCCCCCCc------cchhhh
Q psy17488        157 DSFLSLWRYLAIFSAR---VVPPVPYPTPRCASPDLSLSWSPENVGGERGGAGVENKPAPTASPPPSP------TGLYKL  227 (1446)
Q Consensus       157 dsflslwrylaifsar---vvppvpyptprcaspdlslswspenvggerggagvenkpaptaspppsp------tglykl  227 (1446)
                      |----||--|||.|.-   |-.|+|-|++             ++         +..||.+-+=|...+      ..+..|
T Consensus        58 Df~hKLWDhL~ImS~FkLDVD~Py~ip~~-------------e~---------l~~kP~~l~Yp~~~i~yRhYG~~i~~m  115 (176)
T PF14123_consen   58 DFKHKLWDHLFIMSDFKLDVDSPYPIPSK-------------EK---------LGKKPEPLPYPQNRIKYRHYGKNIEEM  115 (176)
T ss_pred             hHHHHHHHHHHHHhCCCccCCCCCCCCCH-------------HH---------hccCCCCCCCCCCCCCcccccHHHHHH
Confidence            3345799999998754   4455555443             11         123454444333322      134556


Q ss_pred             hhccccccccchhhHHHHHhcc
Q psy17488        228 LVPLLRCDTTDVRDAVVHALGN  249 (1446)
Q Consensus       228 lvpllrcdttdvrdavvhalgn  249 (1446)
                      +--++-+...+.+||+|-+++|
T Consensus       116 i~ka~~~e~geer~~lv~~iAn  137 (176)
T PF14123_consen  116 IDKAKEMEDGEERDALVRLIAN  137 (176)
T ss_pred             HHHHHhCCCHHHHHHHHHHHHH
Confidence            6667788889999999988875


No 37 
>PF04484 DUF566:  Family of unknown function (DUF566) ;  InterPro: IPR007573 This is a family of related proteins that is plant specific.
Probab=21.43  E-value=91  Score=33.69  Aligned_cols=21  Identities=19%  Similarity=0.487  Sum_probs=14.7

Q ss_pred             hhccc---cccccchhhHHHHHhc
Q psy17488        228 LVPLL---RCDTTDVRDAVVHALG  248 (1446)
Q Consensus       228 lvpll---rcdttdvrdavvhalg  248 (1446)
                      -|||.   +.|+.+|.+||.+|+.
T Consensus       250 rlP~~~ga~~d~~~l~~al~sAvd  273 (311)
T PF04484_consen  250 RLPVTGGAKADVQSLKEALSSAVD  273 (311)
T ss_pred             cccccCCceecHHHHHHHHHHHHH
Confidence            35554   5677788888888764


No 38 
>PF08757 CotH:  CotH protein;  InterPro: IPR014867 Members of this group include the Bacillus subtilis spore coat protein H (CotH). Assembly of CotH requires both CotE and GerE and is reuired for the correct assembly of both inner and outer layers of the coat. CotH appears to be a structural component of the coat being localised at the interface of the 2 coat layers [, , ]. 
Probab=20.95  E-value=2.4e+02  Score=27.77  Aligned_cols=31  Identities=23%  Similarity=0.370  Sum_probs=24.7

Q ss_pred             HHHHh-ccCCHHHHHHHHHHHHHHHHHHhhhh
Q psy17488        243 VVHAL-GNINSEALKDLMEELSVYIREAVDRK  273 (1446)
Q Consensus       243 vvhal-gninsealkdlmeelsvyireavdrk  273 (1446)
                      +...+ |.++.+.|..++..+...|..|+.+-
T Consensus       261 ~~~l~~~~~~~~~~~~~i~~~~~~i~~~~~~d  292 (323)
T PF08757_consen  261 WKELRNGVFSPERLEARIDDYAALIRPAMERD  292 (323)
T ss_pred             HHHHHHccCCHHHHHHHHHHHHHHHHHHHHhC
Confidence            34444 78899999999999999999887654


No 39 
>PF12755 Vac14_Fab1_bd:  Vacuolar 14 Fab1-binding region
Probab=20.81  E-value=93  Score=28.45  Aligned_cols=28  Identities=39%  Similarity=0.553  Sum_probs=21.7

Q ss_pred             hhhhhhccccccc---cchhhHHHHHhccCC
Q psy17488        224 LYKLLVPLLRCDT---TDVRDAVVHALGNIN  251 (1446)
Q Consensus       224 lykllvpllrcdt---tdvrdavvhalgnin  251 (1446)
                      +=++|-|+|.|=+   ..||-+..+||.||-
T Consensus        25 l~~Il~pVL~~~~D~d~rVRy~AcEaL~ni~   55 (97)
T PF12755_consen   25 LDEILPPVLKCFDDQDSRVRYYACEALYNIS   55 (97)
T ss_pred             HHHHHHHHHHHcCCCcHHHHHHHHHHHHHHH
Confidence            4467778888844   248999999999994


No 40 
>PF05069 Phage_tail_S:  Phage virion morphogenesis family ;  InterPro: IPR006522 This entry represents a number of known and suspected phage virion morphogenesis proteins. The S protein from phage P2 is thought to act in tail completion and stable head joining [].
Probab=20.42  E-value=1.4e+02  Score=27.99  Aligned_cols=38  Identities=16%  Similarity=0.372  Sum_probs=27.5

Q ss_pred             cccchhhHHHHHhccCCHHHHHHHHHHHHHHHHHHhhhhh
Q psy17488        235 DTTDVRDAVVHALGNINSEALKDLMEELSVYIREAVDRKT  274 (1446)
Q Consensus       235 dttdvrdavvhalgninsealkdlmeelsvyireavdrkt  274 (1446)
                      |+..|.+++-+++.+  ...+.+||..|..+|+..+++..
T Consensus         2 ~~~~l~~~L~~L~~~--~~~~~~l~~~ig~~l~~~~~~~f   39 (148)
T PF05069_consen    2 DLEQLQRALERLAAQ--PKDRRPLLRKIGEYLRKSTRRRF   39 (148)
T ss_pred             cHHHHHHHHHHHHhC--hHhHHHHHHHHHHHHHHHHHHHH
Confidence            455566666555555  78899999999999988775433


No 41 
>TIGR00274 N-acetylmuramic acid 6-phosphate etherase. This protein, MurQ, is involved in recycling components of the bacterial murein sacculus turned over during cell growth. The cell wall metabolite anhydro-N-acetylmuramic acid (anhMurNAc) is converted by a kinase, AnmK, to MurNAc-phosphate, then converted to N-acetylglucosamine-phosphate by this etherase, called MurQ. This family of proteins is similar to the C-terminal half of a number of vertebrate glucokinase regulator proteins and contains a Prosite pattern which is shared by this group of proteins in a region of local similarity.
Probab=20.33  E-value=71  Score=33.80  Aligned_cols=34  Identities=26%  Similarity=0.475  Sum_probs=25.4

Q ss_pred             chhhHHHHHhccCCHHHHHHHHHHHHHHHHHHhh
Q psy17488        238 DVRDAVVHALGNINSEALKDLMEELSVYIREAVD  271 (1446)
Q Consensus       238 dvrdavvhalgninsealkdlmeelsvyireavd  271 (1446)
                      +|.-|||-+|++++.+..+.++..-.-.||.|+|
T Consensus       258 ~vk~Ai~~~~~~~~~~~a~~~l~~~~g~~~~~l~  291 (291)
T TIGR00274       258 NVKLAIVMILSTLSASEAKVLLDRHGGFLRQALD  291 (291)
T ss_pred             CcHHHHHHHHhCCCHHHHHHHHHHcCCcHHHhhC
Confidence            6777777778888877777777777777777765


No 42 
>TIGR03354 VI_FHA type VI secretion system FHA domain protein. Members of this protein family are FHA (forkhead-associated) domain-containing proteins that are part of type VI secretion loci in a considerable number of bacteria, most of which are known pathogens. Species include Pseudomonas aeruginosa PAO1, Aeromonas hydrophila, Yersinia pestis, Burkholderia mallei, etc.
Probab=20.23  E-value=2.4e+02  Score=31.31  Aligned_cols=47  Identities=30%  Similarity=0.461  Sum_probs=35.2

Q ss_pred             cccccchhhHHHHHhc----cCCHHHHHHHHHHHHHHHHHHhhhhhHHHHHH
Q psy17488        233 RCDTTDVRDAVVHALG----NINSEALKDLMEELSVYIREAVDRKTENMRRR  280 (1446)
Q Consensus       233 rcdttdvrdavvhalg----ninsealkdlmeelsvyireavdrktenmrrr  280 (1446)
                      .+|.+ +..|+..+||    .++.++..+||++|...++.+|+--.+-++.|
T Consensus       204 ~~~~~-~~~af~~GlG~~~~~~~~~~~~~~~~~lG~~lr~~v~Gl~~lL~~R  254 (396)
T TIGR03354       204 RSDAV-DLTPFLRGLGLPLELLDSQAAEDVLEELGRLLRAMVEGLLQLLRAR  254 (396)
T ss_pred             chhhH-HHHHHHHhCCCCcccCChhhHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            55555 5557777776    36677778899999999999998877777654


Done!