Query         psy15563
Match_columns 614
No_of_seqs    291 out of 453
Neff          5.7 
Searched_HMMs 46136
Date          Sat Aug 17 00:27:52 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy15563.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/15563hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 cd00257 Fascin Fascin-like dom  99.9 7.9E-22 1.7E-26  178.7  16.0  117  261-382     2-119 (119)
  2 PF06268 Fascin:  Fascin domain  99.8 1.1E-19 2.4E-24  163.7  12.2  110   37-176     1-111 (111)
  3 PF06268 Fascin:  Fascin domain  99.8 1.6E-19 3.6E-24  162.6  12.9  110  269-382     1-111 (111)
  4 cd00257 Fascin Fascin-like dom  99.8 3.2E-19 6.9E-24  161.6  14.3  117   29-176     2-119 (119)
  5 PF06229 FRG1:  FRG1-like famil  99.7 2.8E-16 6.2E-21  154.4  10.0   90   84-178    27-117 (191)
  6 PF06229 FRG1:  FRG1-like famil  99.5 3.4E-13 7.3E-18  132.8  12.5   97  286-385    21-118 (191)
  7 PF04601 DUF569:  Protein of un  99.0 1.3E-09 2.7E-14  102.5  10.9  109   99-224     7-133 (142)
  8 PF04601 DUF569:  Protein of un  98.7 3.7E-07 8.1E-12   86.0  14.2  101  262-365     9-118 (142)
  9 KOG3962|consensus               98.6 1.4E-07 3.1E-12   93.6   9.9   97  262-363    49-149 (246)
 10 KOG3962|consensus               98.5 1.6E-07 3.4E-12   93.3   5.1   79   89-172    86-165 (246)
 11 smart00791 Agglutinin Amaranth  98.0 0.00019   4E-09   67.0  13.3  112  261-376     6-131 (139)
 12 PF00167 FGF:  Fibroblast growt  97.9 0.00061 1.3E-08   62.5  15.2  101  307-408     2-105 (122)
 13 PF07468 Agglutinin:  Agglutini  97.7  0.0011 2.4E-08   63.2  13.7  120  262-382     7-151 (153)
 14 cd00058 FGF Acidic and basic f  97.5  0.0026 5.7E-08   58.9  13.5   97  310-407     3-102 (123)
 15 PF00167 FGF:  Fibroblast growt  97.4  0.0046 9.9E-08   56.7  14.2  101  263-368     3-107 (122)
 16 smart00442 FGF Acidic and basi  97.4  0.0067 1.4E-07   56.4  14.9  100  307-407     4-106 (126)
 17 PF14200 RicinB_lectin_2:  Rici  97.1  0.0075 1.6E-07   53.4  11.9   92  291-383     1-105 (105)
 18 smart00791 Agglutinin Amaranth  97.1  0.0057 1.2E-07   57.3  11.3   97  307-403     7-111 (139)
 19 cd00058 FGF Acidic and basic f  97.0  0.0044 9.5E-08   57.4   9.1   76  102-179     3-78  (123)
 20 PF14200 RicinB_lectin_2:  Rici  96.9   0.014 2.9E-07   51.7  11.2   88   88-176     4-104 (105)
 21 PF07468 Agglutinin:  Agglutini  96.9   0.012 2.5E-07   56.4  11.0  100  490-606     7-125 (153)
 22 smart00442 FGF Acidic and basi  96.4   0.026 5.7E-07   52.5   9.9   78  102-179     5-82  (126)
 23 KOG3885|consensus               95.1   0.079 1.7E-06   51.1   7.7   74  307-382    29-103 (155)
 24 PF05270 AbfB:  Alpha-L-arabino  94.2     1.4 3.1E-05   42.0  13.8  108  262-371     2-124 (142)
 25 KOG3885|consensus               90.4     1.1 2.5E-05   43.3   7.8   61  101-162    29-90  (155)
 26 PF05270 AbfB:  Alpha-L-arabino  81.4      12 0.00026   35.8   9.5  106   34-163     8-122 (142)
 27 cd00161 RICIN Ricin-type beta-  64.3      91   0.002   26.6  11.4   85  269-363     9-100 (124)
 28 PF03498 CDtoxinA:  Cytolethal   63.5      21 0.00046   34.4   6.5   78  101-180    53-145 (150)
 29 KOG1407|consensus               58.6 2.5E+02  0.0055   29.9  16.4   21  489-510   191-212 (313)
 30 PF03498 CDtoxinA:  Cytolethal   52.5      77  0.0017   30.6   8.3   57  305-363    51-111 (150)
 31 KOG0308|consensus               47.7      43 0.00094   39.2   6.6   63  101-163   206-281 (735)
 32 KOG2243|consensus               46.8      40 0.00087   42.6   6.3  204  145-361   122-377 (5019)
 33 PF00652 Ricin_B_lectin:  Ricin  43.9 1.5E+02  0.0033   25.6   8.4   93  262-363     4-103 (124)
 34 COG2706 3-carboxymuconate cycl  40.2 4.6E+02    0.01   28.8  12.6  196  106-351    97-329 (346)
 35 COG0211 RpmA Ribosomal protein  33.9 1.1E+02  0.0024   26.8   5.5   61  483-546    13-82  (87)
 36 cd00161 RICIN Ricin-type beta-  30.9 3.3E+02  0.0071   23.0  10.9   59   88-156    37-99  (124)
 37 KOG1446|consensus               30.3 6.7E+02   0.015   27.2  11.7    6  442-447    89-94  (311)
 38 PF00652 Ricin_B_lectin:  Ricin  27.9 3.9E+02  0.0084   22.9   8.8   61   88-157    39-103 (124)
 39 KOG0649|consensus               26.4 8.1E+02   0.017   26.1  11.8   52  262-320   161-213 (325)
 40 PF02751 TFIIA_gamma_C:  Transc  25.0 1.2E+02  0.0026   24.3   3.9   33   86-124    19-52  (52)
 41 smart00458 RICIN Ricin-type be  22.0   5E+02   0.011   22.1  10.8   63  290-363    28-93  (117)
 42 COG5402 Uncharacterized conser  21.9 1.8E+02  0.0038   29.0   5.1  105   80-187    35-153 (194)
 43 smart00472 MIR Domain in ryano  21.6 2.3E+02   0.005   21.8   5.0   38  100-137     8-56  (57)
 44 COG3211 PhoX Predicted phospha  21.2 4.4E+02  0.0096   31.0   8.9   88   88-178   287-381 (616)

No 1  
>cd00257 Fascin Fascin-like domain; members include actin-bundling/crosslinking proteins facsin, histoactophilin and singed;  identified in sea urchin, Drosophila, Xenopus, rodents, and humans; The fascin-like domain adopts a beta-trefoil topology and contains an internal threefold repeat; the fascin subgroup contains four copies of the domain; Structurally similar to fibroblast  growth factor (FGF)
Probab=99.88  E-value=7.9e-22  Score=178.66  Aligned_cols=117  Identities=33%  Similarity=0.471  Sum_probs=109.2

Q ss_pred             eEEEEEECCCceEEEecCCeEEEecCCCCCCCceEEEEEcCCCCccEEEEEecCCcEEEEccCCeEEEeccCCCCCceEE
Q psy15563        261 WTIGLINAQFKYLTAETFGFKINANGASLKKKQVWILEPANGGDTLSVYLRSHLGKYLAVDSFGNVTCEADEKEPGSLFQ  340 (614)
Q Consensus       261 ~~V~L~s~~GkYLtae~~g~~V~An~~~l~~~E~F~Le~~~~~~~~~v~LRS~~GkYLs~~~~G~V~a~a~~~g~~E~F~  340 (614)
                      .+|+|++.+||||++++.|..|.|+++.++++|+|+|++.   +.+.|+|||++||||+++.+|.|.++++ ++++|+|+
T Consensus         2 p~v~Lrs~~gkyl~~~~~g~~v~a~~~~~~~~e~F~l~~~---~~g~v~Lrs~~G~yls~~~~g~l~~~~~-~~~~e~F~   77 (119)
T cd00257           2 PQVVLRSVNGRYLSAEAGGDKVDANRDSLKGDETFTLEFD---NTGKYALRSHDGKYLSADSDGGVQLEGH-PNADCRFT   77 (119)
T ss_pred             cEEEEEEcCCCEEEEeccCCEEEEcCccCCCceEEEEEEC---CCCeEEEEECCCcEEEEECCCCEEecCC-CCCCcEEE
Confidence            3899999999999999887799999999999999999973   4679999999999999999999999999 99999999


Q ss_pred             EEEeCCCeEEEEECCCCeEEEee-CCeEEEeecCCCCCccEEE
Q psy15563        341 IHVSDEGKWAFKNKTRGYFLGAS-SDKLICTAKIPANPEYWLA  382 (614)
Q Consensus       341 le~~~dGk~aLrs~~~G~yL~a~-~G~L~a~~~s~g~~E~F~v  382 (614)
                      +|++++|+|+||+. +|+||+++ +|.|.|+++.++++|.|.+
T Consensus        78 ~e~~~~g~~al~~~-~G~yl~~~~~g~l~~~~~~~~~~e~f~~  119 (119)
T cd00257          78 LEFHGDGKWALRAE-NGRYLGGDGSGTLKASSETVGPDELFEL  119 (119)
T ss_pred             EEECCCCeEEEEcC-CCCEEeecCCCeEEEecCCCCccceecC
Confidence            99998899999999 99999996 7999999999999999974


No 2  
>PF06268 Fascin:  Fascin domain;  InterPro: IPR022768  This family consists of several eukaryotic fascin or singed proteins. The fascins are a structurally unique and evolutionarily conserved group of actin cross-linking proteins. Fascins function in the organisation of two major forms of actin-based structures: dynamic, cortical cell protrusions and cytoplasmic microfilament bundles. The cortical structures, which include filopodia, spikes, lamellipodial ribs, oocyte microvilli and the dendrites of dendritic cells, have roles in cell-matrix adhesion, cell interactions and cell migration, whereas the cytoplasmic actin bundles appear to participate in cell architecture []. Dictyostelium hisactophilin, another actin-binding protein, is a submembranous pH sensor that signals slight changes of the H+ concentration to actin by inducing actin polymerisation and binding to microfilaments only at pH values below seven []. Members of this family are histidine rich, typically contain the repeated motif of HHXH []. ; GO: 0030674 protein binding, bridging, 0051015 actin filament binding; PDB: 1DFC_B 3P53_B 3LLP_A 1HCE_A 1HCD_A.
Probab=99.82  E-value=1.1e-19  Score=163.74  Aligned_cols=110  Identities=35%  Similarity=0.516  Sum_probs=101.2

Q ss_pred             CCceeeeeecCeeEEeccccccccccccccceecceeecccccccccccceeEEEEecCCCCceEEEEeecCCceEEecC
Q psy15563         37 QFKYLTAETFGFKINANGASLKKKQFKYLTAETFGFKINANGASLKKKQDGVWILEPANGGDTLSVYLRSHLGKYLAVDS  116 (614)
Q Consensus        37 ~~~YLtaE~fgfkvna~~~slkkkQ~~~~~~~~~~~~~~~~~~~~~~~q~~vW~l~~~~~~~~~~v~lkSh~GrYL~~d~  116 (614)
                      +++|+++|.+|++|+|++.+|++.|                          +|+|+..++  +..|+||||+||||++|.
T Consensus         1 ~~~~~~~~k~~~~l~an~~~~~~~e--------------------------~f~le~~~~--~~~v~lrs~~GkYls~~~   52 (111)
T PF06268_consen    1 ANGYLVSEKFGAHLNANRASLSDWE--------------------------TFQLEFDDG--SYKVALRSHNGKYLSVDS   52 (111)
T ss_dssp             TTEEEEETTCTCBEEEEESSSSCGG--------------------------SEEEEEETT--EEEEEEECTTSEEEEEET
T ss_pred             CCcEEEEEEcCCEEECChhcCcccE--------------------------EEEEEEECC--CCEEEEEcCCCCEEEEcC
Confidence            5899999999999999999999999                          999998765  567999999999999999


Q ss_pred             CeeEEEecccCCCCceEEEEEcCCCeEEEEeeccCceee-ecCCeEEEEecCCCCCcceEE
Q psy15563        117 FGNVTCEADEKEPGSLFQIHVSDEGKWAFKNKTRGYFLG-ASSDKLICTAKIPANPEYWLA  176 (614)
Q Consensus       117 dG~v~~~~e~~~~~~~F~i~~~~dG~waLqs~~~g~yl~-~~~d~~~c~~~~~~~~~~W~~  176 (614)
                      ||.|.|+++.+++++.|+|+.+  |+|.++...+|+||+ ...+.|.|.++.+++.|+|++
T Consensus        53 ~G~v~~~~~~~~~~~~F~i~~~--~~~~~~~~~nGkYl~~~~~g~l~a~~~~~~~~elf~~  111 (111)
T PF06268_consen   53 DGSVVADSETPGPDEFFEIEWH--GGKVALRASNGKYLSAGPNGQLKANATSPGKDELFEY  111 (111)
T ss_dssp             TSEEEEEESSSSGGGCBEEEEE--TTEEEEECTTSCEEEEETTTEEEEEESSSSGGGEEEE
T ss_pred             CCeEEecCCCCCCCcEEEEEEC--CCEEEEECCCCCEEeeCCCCeEEEcCCCCCcceEEeC
Confidence            9999999999999999999999  666666669999999 688999999999999999975


No 3  
>PF06268 Fascin:  Fascin domain;  InterPro: IPR022768  This family consists of several eukaryotic fascin or singed proteins. The fascins are a structurally unique and evolutionarily conserved group of actin cross-linking proteins. Fascins function in the organisation of two major forms of actin-based structures: dynamic, cortical cell protrusions and cytoplasmic microfilament bundles. The cortical structures, which include filopodia, spikes, lamellipodial ribs, oocyte microvilli and the dendrites of dendritic cells, have roles in cell-matrix adhesion, cell interactions and cell migration, whereas the cytoplasmic actin bundles appear to participate in cell architecture []. Dictyostelium hisactophilin, another actin-binding protein, is a submembranous pH sensor that signals slight changes of the H+ concentration to actin by inducing actin polymerisation and binding to microfilaments only at pH values below seven []. Members of this family are histidine rich, typically contain the repeated motif of HHXH []. ; GO: 0030674 protein binding, bridging, 0051015 actin filament binding; PDB: 1DFC_B 3P53_B 3LLP_A 1HCE_A 1HCD_A.
Probab=99.81  E-value=1.6e-19  Score=162.60  Aligned_cols=110  Identities=37%  Similarity=0.564  Sum_probs=100.8

Q ss_pred             CCceEEEecCCeEEEecCCCCCCCceEEEEEcCCCCccEEEEEecCCcEEEEccCCeEEEeccCCCCCceEEEEEeCCCe
Q psy15563        269 QFKYLTAETFGFKINANGASLKKKQVWILEPANGGDTLSVYLRSHLGKYLAVDSFGNVTCEADEKEPGSLFQIHVSDEGK  348 (614)
Q Consensus       269 ~GkYLtae~~g~~V~An~~~l~~~E~F~Le~~~~~~~~~v~LRS~~GkYLs~~~~G~V~a~a~~~g~~E~F~le~~~dGk  348 (614)
                      .++|++++.++++|.||+..++++|+|+||...+  ...|+|||++||||+++.+|.|+|+++.+++++.|+|+++ +++
T Consensus         1 ~~~~~~~~k~~~~l~an~~~~~~~e~f~le~~~~--~~~v~lrs~~GkYls~~~~G~v~~~~~~~~~~~~F~i~~~-~~~   77 (111)
T PF06268_consen    1 ANGYLVSEKFGAHLNANRASLSDWETFQLEFDDG--SYKVALRSHNGKYLSVDSDGSVVADSETPGPDEFFEIEWH-GGK   77 (111)
T ss_dssp             TTEEEEETTCTCBEEEEESSSSCGGSEEEEEETT--EEEEEEECTTSEEEEEETTSEEEEEESSSSGGGCBEEEEE-TTE
T ss_pred             CCcEEEEEEcCCEEECChhcCcccEEEEEEEECC--CCEEEEEcCCCCEEEEcCCCeEEecCCCCCCCcEEEEEEC-CCE
Confidence            3689999999999999999999999999997653  7788999999999999999999999999999999999999 677


Q ss_pred             EEEEECCCCeEEEee-CCeEEEeecCCCCCccEEE
Q psy15563        349 WAFKNKTRGYFLGAS-SDKLICTAKIPANPEYWLA  382 (614)
Q Consensus       349 ~aLrs~~~G~yL~a~-~G~L~a~~~s~g~~E~F~v  382 (614)
                      ++|++. +|+||++. +|.|.|+++.|+++|+|.+
T Consensus        78 ~~~~~~-nGkYl~~~~~g~l~a~~~~~~~~elf~~  111 (111)
T PF06268_consen   78 VALRAS-NGKYLSAGPNGQLKANATSPGKDELFEY  111 (111)
T ss_dssp             EEEECT-TSCEEEEETTTEEEEEESSSSGGGEEEE
T ss_pred             EEEECC-CCCEEeeCCCCeEEEcCCCCCcceEEeC
Confidence            777777 99999975 7999999999999999975


No 4  
>cd00257 Fascin Fascin-like domain; members include actin-bundling/crosslinking proteins facsin, histoactophilin and singed;  identified in sea urchin, Drosophila, Xenopus, rodents, and humans; The fascin-like domain adopts a beta-trefoil topology and contains an internal threefold repeat; the fascin subgroup contains four copies of the domain; Structurally similar to fibroblast  growth factor (FGF)
Probab=99.81  E-value=3.2e-19  Score=161.64  Aligned_cols=117  Identities=33%  Similarity=0.462  Sum_probs=108.1

Q ss_pred             EEEEEEecCCceeeeeecCeeEEeccccccccccccccceecceeecccccccccccceeEEEEecCCCCceEEEEeecC
Q psy15563         29 WTIGLINAQFKYLTAETFGFKINANGASLKKKQFKYLTAETFGFKINANGASLKKKQDGVWILEPANGGDTLSVYLRSHL  108 (614)
Q Consensus        29 ~~~Glin~~~~YLtaE~fgfkvna~~~slkkkQ~~~~~~~~~~~~~~~~~~~~~~~q~~vW~l~~~~~~~~~~v~lkSh~  108 (614)
                      +++.|.+.+||||++|..|+.|.|++..+..+|                          +|.+++.+.   ..|.|||+.
T Consensus         2 p~v~Lrs~~gkyl~~~~~g~~v~a~~~~~~~~e--------------------------~F~l~~~~~---g~v~Lrs~~   52 (119)
T cd00257           2 PQVVLRSVNGRYLSAEAGGDKVDANRDSLKGDE--------------------------TFTLEFDNT---GKYALRSHD   52 (119)
T ss_pred             cEEEEEEcCCCEEEEeccCCEEEEcCccCCCce--------------------------EEEEEECCC---CeEEEEECC
Confidence            689999999999999999999999999999988                          999998643   369999999


Q ss_pred             CceEEecCCeeEEEecccCCCCceEEEEEcCCCeEEEEeeccCceeeec-CCeEEEEecCCCCCcceEE
Q psy15563        109 GKYLAVDSFGNVTCEADEKEPGSLFQIHVSDEGKWAFKNKTRGYFLGAS-SDKLICTAKIPANPEYWLA  176 (614)
Q Consensus       109 GrYL~~d~dG~v~~~~e~~~~~~~F~i~~~~dG~waLqs~~~g~yl~~~-~d~~~c~~~~~~~~~~W~~  176 (614)
                      ||||+++++|.|+++++ +++.|+|.++.++||+|+||+ .+|+||+++ .+.|.|.++++++.|+|.+
T Consensus        53 G~yls~~~~g~l~~~~~-~~~~e~F~~e~~~~g~~al~~-~~G~yl~~~~~g~l~~~~~~~~~~e~f~~  119 (119)
T cd00257          53 GKYLSADSDGGVQLEGH-PNADCRFTLEFHGDGKWALRA-ENGRYLGGDGSGTLKASSETVGPDELFEL  119 (119)
T ss_pred             CcEEEEECCCCEEecCC-CCCCcEEEEEECCCCeEEEEc-CCCCEEeecCCCeEEEecCCCCccceecC
Confidence            99999999999999999 999999999999999999999 699999996 6789999999999999974


No 5  
>PF06229 FRG1:  FRG1-like family;  InterPro: IPR010414 This entry represents Frg1 (FSHD region gene 1), a protein that is considered to be a candidate for facioscapulohumeral muscular dystrophy (FSHD). FSHD is a dominant neuromuscular disorder caused by deletions in a number of tandem repeat units (called D4Z4) located on chromosome 4q35. D4Z4 contains a transcriptional silencer whose deletion causes the over-expression in skeletal muscle of 4q35 genes, including Frg1 [, ]. Frg1 is localised to nucleoli and appears to be a component of the human spliceosome, but its exact function is unknown [].; PDB: 2YUG_A.
Probab=99.66  E-value=2.8e-16  Score=154.45  Aligned_cols=90  Identities=24%  Similarity=0.378  Sum_probs=59.3

Q ss_pred             ccceeEEEEecCCCCceEEEEeecCCceEEecCCeeEEEecccCCCCceEEEEEcCCCeEEEEeeccCceeeecC-CeEE
Q psy15563         84 KQDGVWILEPANGGDTLSVYLRSHLGKYLAVDSFGNVTCEADEKEPGSLFQIHVSDEGKWAFKNKTRGYFLGASS-DKLI  162 (614)
Q Consensus        84 ~q~~vW~l~~~~~~~~~~v~lkSh~GrYL~~d~dG~v~~~~e~~~~~~~F~i~~~~dG~waLqs~~~g~yl~~~~-d~~~  162 (614)
                      +|  |||+.++++  +..|.||||+||||+||++|.|+|.+||+||.|+|+++++ +|++++++..++.||+.++ +.|.
T Consensus        27 ~q--V~va~~v~~--~~~iafKs~~GkYLs~Dk~G~v~a~sdAiGp~E~f~~V~~-~~~~a~~~~~~~~FLs~~~~~~i~  101 (191)
T PF06229_consen   27 RQ--VWVATRVPG--DEKIAFKSGHGKYLSCDKDGIVSARSDAIGPQEQFEPVFQ-DGKPALFSSSNNKFLSVDEEGDIR  101 (191)
T ss_dssp             TT---EEEEE--S--SS-EEEEETTS-BEEE-SSSBEEE--SS--TTTBEEEE-S-TT--EEEE-TTS-BEEE-SSS-EE
T ss_pred             hH--eEEEEEecC--CCceEeeccCccEEEEcCCCcEEEEeecCCCceEEEEEEC-CCCeEEEecCCCeEEEEecccCee
Confidence            56  999999975  4579999999999999999999999999999999999998 6788888779999999965 5688


Q ss_pred             EEecCCCCCcceEEEE
Q psy15563        163 CTAKIPANPEYWLAHL  178 (614)
Q Consensus       163 c~~~~~~~~~~W~~hl  178 (614)
                      |.++++++.|++.|+-
T Consensus       102 a~s~~a~~~e~~~iR~  117 (191)
T PF06229_consen  102 ADSKTAGENEMIKIRS  117 (191)
T ss_dssp             E--S---TTT--EEEE
T ss_pred             eccccCCCCceEEEEE
Confidence            9999999999988854


No 6  
>PF06229 FRG1:  FRG1-like family;  InterPro: IPR010414 This entry represents Frg1 (FSHD region gene 1), a protein that is considered to be a candidate for facioscapulohumeral muscular dystrophy (FSHD). FSHD is a dominant neuromuscular disorder caused by deletions in a number of tandem repeat units (called D4Z4) located on chromosome 4q35. D4Z4 contains a transcriptional silencer whose deletion causes the over-expression in skeletal muscle of 4q35 genes, including Frg1 [, ]. Frg1 is localised to nucleoli and appears to be a component of the human spliceosome, but its exact function is unknown [].; PDB: 2YUG_A.
Probab=99.47  E-value=3.4e-13  Score=132.82  Aligned_cols=97  Identities=24%  Similarity=0.366  Sum_probs=61.2

Q ss_pred             CCCCCCCceEEEEEcCCCCccEEEEEecCCcEEEEccCCeEEEeccCCCCCceEEEEEeCCCeEEEEECCCCeEEEeeC-
Q psy15563        286 GASLKKKQVWILEPANGGDTLSVYLRSHLGKYLAVDSFGNVTCEADEKEPGSLFQIHVSDEGKWAFKNKTRGYFLGASS-  364 (614)
Q Consensus       286 ~~~l~~~E~F~Le~~~~~~~~~v~LRS~~GkYLs~~~~G~V~a~a~~~g~~E~F~le~~~dGk~aLrs~~~G~yL~a~~-  364 (614)
                      +..+.+.|+|++...++  +.+|+|||++||||+++++|.|+|+++++|++|+|++++. +|++++.+..+++||++++ 
T Consensus        21 ~~~p~p~qV~va~~v~~--~~~iafKs~~GkYLs~Dk~G~v~a~sdAiGp~E~f~~V~~-~~~~a~~~~~~~~FLs~~~~   97 (191)
T PF06229_consen   21 GEGPDPRQVWVATRVPG--DEKIAFKSGHGKYLSCDKDGIVSARSDAIGPQEQFEPVFQ-DGKPALFSSSNNKFLSVDEE   97 (191)
T ss_dssp             S----TTT-EEEEE--S--SS-EEEEETTS-BEEE-SSSBEEE--SS--TTTBEEEE-S-TT--EEEE-TTS-BEEE-SS
T ss_pred             CCCCChhHeEEEEEecC--CCceEeeccCccEEEEcCCCcEEEEeecCCCceEEEEEEC-CCCeEEEecCCCeEEEEecc
Confidence            33456899999998753  6799999999999999999999999999999999999998 5678888834999999974 


Q ss_pred             CeEEEeecCCCCCccEEEEec
Q psy15563        365 DKLICTAKIPANPEYWLAHLA  385 (614)
Q Consensus       365 G~L~a~~~s~g~~E~F~v~La  385 (614)
                      +.|.|+++++++.|.+.|+..
T Consensus        98 ~~i~a~s~~a~~~e~~~iR~~  118 (191)
T PF06229_consen   98 GDIRADSKTAGENEMIKIRSD  118 (191)
T ss_dssp             S-EEE--S---TTT--EEEE-
T ss_pred             cCeeeccccCCCCceEEEEEe
Confidence            679999999999999998744


No 7  
>PF04601 DUF569:  Protein of unknown function (DUF569);  InterPro: IPR007679 This is a family of hypothetical proteins. Some family members contain two copies of the region.
Probab=99.05  E-value=1.3e-09  Score=102.53  Aligned_cols=109  Identities=21%  Similarity=0.278  Sum_probs=84.5

Q ss_pred             ceEEEEeecCCceEEecCCee-EEEecccCCCCceEEEEEcCC--CeEEEEeeccCceeeecCCeEE----------EEe
Q psy15563         99 TLSVYLRSHLGKYLAVDSFGN-VTCEADEKEPGSLFQIHVSDE--GKWAFKNKTRGYFLGASSDKLI----------CTA  165 (614)
Q Consensus        99 ~~~v~lkSh~GrYL~~d~dG~-v~~~~e~~~~~~~F~i~~~~d--G~waLqs~~~g~yl~~~~d~~~----------c~~  165 (614)
                      ...|+||||+|+||.||.||. |+.+....+....|.++.-++  .-.-||| ++||||+++++.+.          +..
T Consensus         7 ~~~VRLRS~~~kYL~ADeDg~~Vs~~~~~~s~na~W~Ve~v~~~~~~v~L~s-aYGrYL~as~~~~~lG~~G~~v~Q~~~   85 (142)
T PF04601_consen    7 GKHVRLRSHHGKYLHADEDGEGVSQDRRGASLNAAWTVERVPGSPNYVRLRS-AYGRYLAASDEPALLGHTGRRVVQTDP   85 (142)
T ss_pred             CCEEEEEecCCCEEEEcCCCCeEEECCCCCCCcceEEEEEecCCCCEEEEee-ccCceEeccCCcCCCCCCCCEEEecCC
Confidence            458999999999999999996 999999999999999987544  7899999 99999999888653          122


Q ss_pred             cCCCCCcceEE-EEcccceeeeccCCCCcccCCCCCCCCceeeeeeCCCC----CcccCCCCCc
Q psy15563        166 KIPANPEYWLA-HLAARPQLRSCGSPPCASIGLPSSRGLPCRYISSSPST----KAVHKPPPLL  224 (614)
Q Consensus       166 ~~~~~~~~W~~-hlA~Hpqv~~~~~~~~~~~~~~~~~~~~~ry~~l~~~~----~~v~~~~pwg  224 (614)
                      ..+...-+|.| .-+.++.++.+++                |||+.++.-    +.|+.|+|-.
T Consensus        86 ~~~d~~~~Wepvr~g~~V~Lr~~~g----------------r~LRANG~~~~Wrn~VT~D~~~~  133 (142)
T PF04601_consen   86 DRLDSSVEWEPVRDGFYVKLRHRSG----------------RYLRANGGYPPWRNSVTVDVPHR  133 (142)
T ss_pred             ccCCCCceEEEecCCCEEEEEecCC----------------ceEEcCCCCCCCcceEEecCCCC
Confidence            22566789998 5555555555543                699887765    3777777754


No 8  
>PF04601 DUF569:  Protein of unknown function (DUF569);  InterPro: IPR007679 This is a family of hypothetical proteins. Some family members contain two copies of the region.
Probab=98.69  E-value=3.7e-07  Score=86.03  Aligned_cols=101  Identities=23%  Similarity=0.329  Sum_probs=78.9

Q ss_pred             EEEEEECCCceEEEecCCeEEEecCCCCCCCceEEEEEcCCCCccEEEEEecCCcEEEEccC-------C-eE-EEeccC
Q psy15563        262 TIGLINAQFKYLTAETFGFKINANGASLKKKQVWILEPANGGDTLSVYLRSHLGKYLAVDSF-------G-NV-TCEADE  332 (614)
Q Consensus       262 ~V~L~s~~GkYLtae~~g~~V~An~~~l~~~E~F~Le~~~~~~~~~v~LRS~~GkYLs~~~~-------G-~V-~a~a~~  332 (614)
                      -|.|+++.||||.|++.|..|.-.+........|.+|...+ +...|.|||+.||||++.+.       | .| +.+.+.
T Consensus         9 ~VRLRS~~~kYL~ADeDg~~Vs~~~~~~s~na~W~Ve~v~~-~~~~v~L~saYGrYL~as~~~~~lG~~G~~v~Q~~~~~   87 (142)
T PF04601_consen    9 HVRLRSHHGKYLHADEDGEGVSQDRRGASLNAAWTVERVPG-SPNYVRLRSAYGRYLAASDEPALLGHTGRRVVQTDPDR   87 (142)
T ss_pred             EEEEEecCCCEEEEcCCCCeEEECCCCCCCcceEEEEEecC-CCCEEEEeeccCceEeccCCcCCCCCCCCEEEecCCcc
Confidence            58999999999999999999988888778889999998764 56799999999999999752       2 23 333334


Q ss_pred             CCCCceEEEEEeCCCeEEEEECCCCeEEEeeCC
Q psy15563        333 KEPGSLFQIHVSDEGKWAFKNKTRGYFLGASSD  365 (614)
Q Consensus       333 ~g~~E~F~le~~~dGk~aLrs~~~G~yL~a~~G  365 (614)
                      ....-.|+.+.. +..+.||+. +|+||.+.++
T Consensus        88 ~d~~~~Wepvr~-g~~V~Lr~~-~gr~LRANG~  118 (142)
T PF04601_consen   88 LDSSVEWEPVRD-GFYVKLRHR-SGRYLRANGG  118 (142)
T ss_pred             CCCCceEEEecC-CCEEEEEec-CCceEEcCCC
Confidence            444455555432 358999998 8999999875


No 9  
>KOG3962|consensus
Probab=98.63  E-value=1.4e-07  Score=93.64  Aligned_cols=97  Identities=21%  Similarity=0.300  Sum_probs=78.3

Q ss_pred             EEEEEECCCceEEEecCC-eEEEecC---CCCCCCceEEEEEcCCCCccEEEEEecCCcEEEEccCCeEEEeccCCCCCc
Q psy15563        262 TIGLINAQFKYLTAETFG-FKINANG---ASLKKKQVWILEPANGGDTLSVYLRSHLGKYLAVDSFGNVTCEADEKEPGS  337 (614)
Q Consensus       262 ~V~L~s~~GkYLtae~~g-~~V~An~---~~l~~~E~F~Le~~~~~~~~~v~LRS~~GkYLs~~~~G~V~a~a~~~g~~E  337 (614)
                      .|++.-..|.||-+.+.| +.+.+..   ++..+.|+|.+...+   +.+|+|+|..||||+++++|.|++.++++++.|
T Consensus        49 ~v~ie~~~~~yl~a~dng~ft~g~ph~~~~gp~p~e~f~avki~---dsrIaLKsGyGKYlsinsdglvvg~qeAvG~~E  125 (246)
T KOG3962|consen   49 TVAIEIDDGTYLGAMDNGLFTLGAPHDEVDGPEPEEQFMAVKIS---DSRIALKSGYGKYLSINSDGLVVGRQEAVGSRE  125 (246)
T ss_pred             EEEEEecCceEEEEEecCceeeccCCccccCCCchhhEEEEEcc---CceEEecccccceeeecCCccEEEehhhcCcHh
Confidence            355555678898877655 7776663   344577888887665   379999999999999999999999999999999


Q ss_pred             eEEEEEeCCCeEEEEECCCCeEEEee
Q psy15563        338 LFQIHVSDEGKWAFKNKTRGYFLGAS  363 (614)
Q Consensus       338 ~F~le~~~dGk~aLrs~~~G~yL~a~  363 (614)
                      +|..+++ .|+.+|.+. ++.|....
T Consensus       126 Qw~~vFq-~~r~a~~as-~s~~~~~~  149 (246)
T KOG3962|consen  126 QWEPVFQ-EGRMALLAS-NSCFIRCN  149 (246)
T ss_pred             hchhhhh-ccceEEeec-cceeEEec
Confidence            9999999 577888877 68877664


No 10 
>KOG3962|consensus
Probab=98.46  E-value=1.6e-07  Score=93.28  Aligned_cols=79  Identities=13%  Similarity=0.120  Sum_probs=56.9

Q ss_pred             EEEEecCCCCceEEEEeecCCceEEecCCeeEEEecccCCCCceEEEEEcCCCeEEEEeeccCceeeecCC-eEEEEecC
Q psy15563         89 WILEPANGGDTLSVYLRSHLGKYLAVDSFGNVTCEADEKEPGSLFQIHVSDEGKWAFKNKTRGYFLGASSD-KLICTAKI  167 (614)
Q Consensus        89 W~l~~~~~~~~~~v~lkSh~GrYL~~d~dG~v~~~~e~~~~~~~F~i~~~~dG~waLqs~~~g~yl~~~~d-~~~c~~~~  167 (614)
                      +.|-+.+.   +.|.|||.+||||++++||.|+|.+|++|++|+|+++|+..+.|.|.|.++-+|+  .+. ++..-+++
T Consensus        86 f~avki~d---srIaLKsGyGKYlsinsdglvvg~qeAvG~~EQw~~vFq~~r~a~~as~s~~~~~--~e~~d~~va~kt  160 (246)
T KOG3962|consen   86 FMAVKISD---SRIALKSGYGKYLSINSDGLVVGRQEAVGSREQWEPVFQEGRMALLASNSCFIRC--NEAGDAEVASKT  160 (246)
T ss_pred             EEEEEccC---ceEEecccccceeeecCCccEEEehhhcCcHhhchhhhhccceEEeeccceeEEe--chhhchhhhccc
Confidence            56666653   5899999999999999999999999999999999999995555555554444444  333 33333445


Q ss_pred             CCCCc
Q psy15563        168 PANPE  172 (614)
Q Consensus       168 ~~~~~  172 (614)
                      ++..+
T Consensus       161 ~t~~~  165 (246)
T KOG3962|consen  161 ATEEP  165 (246)
T ss_pred             CCCCc
Confidence            55443


No 11 
>smart00791 Agglutinin Amaranthus caudatus agglutinin or amaranthin is a lectin from the ancient South American crop, amaranth grain. Although its biological function is unknown, it has a high binding specificity for the methyl-glycoside of the T-antigen, found linked to serine or threonine residues of cell surface glycoproteins PUBMED:2271665. The protein is comprised of a homodimer, with each homodimer consisting of two beta-trefoil domains PUBMED:9334739.
Probab=97.95  E-value=0.00019  Score=66.97  Aligned_cols=112  Identities=14%  Similarity=0.096  Sum_probs=80.0

Q ss_pred             eEEEEEECCCceEEEecCC--eEEEecCC-CCCCCceEEEEEcCCCCccEEEEEe-cCCcEEEEccCCeEEEeccCC---
Q psy15563        261 WTIGLINAQFKYLTAETFG--FKINANGA-SLKKKQVWILEPANGGDTLSVYLRS-HLGKYLAVDSFGNVTCEADEK---  333 (614)
Q Consensus       261 ~~V~L~s~~GkYLtae~~g--~~V~An~~-~l~~~E~F~Le~~~~~~~~~v~LRS-~~GkYLs~~~~G~V~a~a~~~---  333 (614)
                      ..|+|++.|||||.+..-+  .=+.-.++ -..+.-.|.++..+   .+.|.+|+ ++||||... +-=|.|+++++   
T Consensus         6 ~~V~FKg~n~kYLry~~~~~~~~lqf~~ddI~dp~v~~ev~~~~---dg~V~ik~~~~nKfWr~s-~~WI~a~s~d~~e~   81 (139)
T smart00791        6 IYVLFKGNNQKYLRYQSIQQYGLLQFSADKILDPLVQFEVFPTY---NGLVHIKSNYTNKFWRLS-HYWITADANDPDEN   81 (139)
T ss_pred             EEEEEEcCCCceEEEEeecccceeEecccccCCcceeEEEEEcC---CCcEEEEecCCCceEccC-CCEEEecCCCCccC
Confidence            4699999999999977422  11222333 33466778888743   56999999 899999765 44577887777   


Q ss_pred             -CCCceEEEEEeCCCeEEEEECCCCeEEEee--CC----eEEEeecCCCC
Q psy15563        334 -EPGSLFQIHVSDEGKWAFKNKTRGYFLGAS--SD----KLICTAKIPAN  376 (614)
Q Consensus       334 -g~~E~F~le~~~dGk~aLrs~~~G~yL~a~--~G----~L~a~~~s~g~  376 (614)
                       ++.+.|..+..+++.++||+.++|+|+.--  +|    -|.|..+++..
T Consensus        82 ~sscTLF~Pv~~d~~~i~lr~vq~~~~~~r~t~~~~~~~CL~A~~~~i~~  131 (139)
T smart00791       82 KSACTLFRPLYVEMKKIRLLNVQLGHYTKRYTVGKSFVSCLNAASSQIDT  131 (139)
T ss_pred             CCcccEEeEEeccCceEEEEEecCCceEEeecCCCCcccceeeccccccc
Confidence             346889999987789999999999998552  23    45555555544


No 12 
>PF00167 FGF:  Fibroblast growth factor;  InterPro: IPR002348 The interleukin-1 (IL1) and heparin-binding growth factor (HBGF) families share low sequence similarity (about 25% []) but have very similar structures. Coupled with the Kunitz-type soybean trypsin inhibitors (STI), they form a structural superfamily. Despite their structural correspondence, however, they show no sequence similarity to the STI family. The crystal structures of interleukin-1 beta and HBGF1 have been solved, showing both families to have the same 12-stranded beta-sheet structure []; the beta-sheets are arranged in 3 similar lobes around a central axis, 6 strands forming an anti-parallel beta-barrel [, ]. The beta-sheets are generally well preserved and the crystal structures superimpose in these areas. The intervening loops are less well conserved - the loop between beta-strands 6 and 7 is slightly longer in interleukin-1 beta.; GO: 0008083 growth factor activity; PDB: 1AFC_F 1BAR_A 2P39_A 1EV2_D 2BFH_A 4FGF_A 1BAS_A 1BFG_A 1FQ9_B 1CVS_A ....
Probab=97.87  E-value=0.00061  Score=62.51  Aligned_cols=101  Identities=19%  Similarity=0.242  Sum_probs=84.8

Q ss_pred             EEEEEecCCcEEEEccCCeEEEeccCCCCCceEEEEEeCCCeEEEEECCCCeEEEeeC-CeEEEeecCCCCCccEEEEec
Q psy15563        307 SVYLRSHLGKYLAVDSFGNVTCEADEKEPGSLFQIHVSDEGKWAFKNKTRGYFLGASS-DKLICTAKIPANPEYWLAHLA  385 (614)
Q Consensus       307 ~v~LRS~~GkYLs~~~~G~V~a~a~~~g~~E~F~le~~~dGk~aLrs~~~G~yL~a~~-G~L~a~~~s~g~~E~F~v~La  385 (614)
                      .++|-+.+|+||.+.++|.|.++.+..++.+.|+++....|.+.||+...++||+.+. |.|.+.. .+.++..|.-++.
T Consensus         2 ~~~Ly~~~~~~L~i~~~g~V~gt~~~~~~~s~~~i~~~~~g~V~i~~~~s~~YLcmn~~G~ly~~~-~~~~~C~F~e~~~   80 (122)
T PF00167_consen    2 HVQLYCRTGYFLQINPNGTVDGTGDDNSPYSVFEIHSVGFGVVRIRGVKSCRYLCMNKCGRLYGSK-NFNKDCVFREELL   80 (122)
T ss_dssp             EEEEEETTSEEEEEETTSBEEEESSTTSTTGEEEEEEEETTEEEEEETTTTEEEEEBTTSBEEEES-SBTGGGEEEEEEE
T ss_pred             CEEEEECCCeEEEECCCCeEeCCCCcCcceeEEEEEeccceEEEEEEecceEEEEECCCCeEcccc-ccCCCceEEEEEc
Confidence            4677787799999999999999999999999999999977999999999999999975 9999854 4556899998876


Q ss_pred             cCceeEEeee--cceeeeeecCCCc
Q psy15563        386 ARPQVNLRSV--GRKRFAHLSENLD  408 (614)
Q Consensus       386 nrP~v~Lr~v--~~~~yv~ls~~~~  408 (614)
                      ..-...+.++  ++..|+.++.+|.
T Consensus        81 ~n~y~~~~s~~~~~~~yla~~~~G~  105 (122)
T PF00167_consen   81 ENGYNTYESAKYGRGWYLAFNRRGK  105 (122)
T ss_dssp             TTSEEEEEESTTGTTEBCEBCTTSB
T ss_pred             cCCEEEEEeccCCccEEEEECCCCC
Confidence            5566667775  4677888877764


No 13 
>PF07468 Agglutinin:  Agglutinin;  InterPro: IPR008998 Agglutinins are sugar-specific lectins that can agglutinate erythrocytes and other cell types. Lectins occur widely in plants, as well as some microorganisms and animal []. Agglutinin from Amaranthus caudatus (amaranthin) is a lectin from the ancient South American crop, amaranth grain. Although its biological function is unknown, it can agglutinate A, B and O red blood cells, and has a carbohydrate-binding site that is specific for the methyl-glycoside of the T-antigen found linked to serine or threonine residues of cell surface glycoproteins []. The protein is comprised of a homodimer, with each homodimer consisting of two beta-trefoil domains []. Lectin B chains from ricin and related toxins also contain beta-trefoil domain, however they are not related to agglutinin, showing little sequence similarity [].; PDB: 1JLY_B 1JLX_B.
Probab=97.66  E-value=0.0011  Score=63.24  Aligned_cols=120  Identities=18%  Similarity=0.124  Sum_probs=78.0

Q ss_pred             EEEEEECCCceEEEecCC----eEEEecC-CCCCCCceEEEEEcCCCCccEEEEEe-cCCcEEEEccCC--eEEEeccCC
Q psy15563        262 TIGLINAQFKYLTAETFG----FKINANG-ASLKKKQVWILEPANGGDTLSVYLRS-HLGKYLAVDSFG--NVTCEADEK  333 (614)
Q Consensus       262 ~V~L~s~~GkYLtae~~g----~~V~An~-~~l~~~E~F~Le~~~~~~~~~v~LRS-~~GkYLs~~~~G--~V~a~a~~~  333 (614)
                      -|.++..|||||.+...+    .-+.-++ +-..+.-.|++|.+.. .++.|.+|+ ++||||.....+  =|.|+++++
T Consensus         7 ~V~fkg~N~kYLry~~e~~~~~~~LqF~~edi~dP~v~fev~~~~~-~dG~V~Ir~~y~nKfWrr~s~n~~WI~ada~~p   85 (153)
T PF07468_consen    7 YVAFKGDNGKYLRYRTEDIQQYGYLQFSGEDIGDPYVKFEVEPSKT-HDGLVHIRCCYNNKFWRRSSPNDYWIWADADDP   85 (153)
T ss_dssp             CEEEETTTS-EEEEEESSCTTCCEEEEEESSTT-CCG-EEEEE-SS-TTT-EEEEETTTTEEEEESCCC--BEEEEESSH
T ss_pred             EEEEEcCCCcEEEEEecccccceeEEecCCcCCCCceeEEEEEccc-CCCeEEEEeccCCceeEeCCCCCcEEEecCCCc
Confidence            356666899999987622    2243333 3345888999998433 367999999 899999975555  688987765


Q ss_pred             CC------CceEEEEEeCC---CeEEEEECCCCeEEEe--eC----CeEEEeecCCCCCc--cEEE
Q psy15563        334 EP------GSLFQIHVSDE---GKWAFKNKTRGYFLGA--SS----DKLICTAKIPANPE--YWLA  382 (614)
Q Consensus       334 g~------~E~F~le~~~d---Gk~aLrs~~~G~yL~a--~~----G~L~a~~~s~g~~E--~F~v  382 (614)
                      ..      .+.|+++.-++   ..++||+..+++|+.-  .+    +-|.|.+++|....  .|+|
T Consensus        86 ~ed~s~~~cTLF~Pv~vd~~~~~~i~l~~~~n~~~~~r~t~~~~~~sCL~A~~~~i~~~~~~~~~~  151 (153)
T PF07468_consen   86 DEDQSKPSCTLFEPVKVDVKDFNVIALRNMQNGHFCKRLTYGGKFVSCLNAASSSIDREAKDVFTI  151 (153)
T ss_dssp             HH-TCSTCGG-EEEEESCCCETTEEEEEETTTTEEEEEE--STTBSSEEEEEESS--CCGTT-EEE
T ss_pred             ccccCCCCceEEEEEEecCCCccEEEEEecCCceEEEEEccCCcceeeEeeccCcccccccceEEE
Confidence            52      45599987532   5799999999999844  22    45888888886654  5554


No 14 
>cd00058 FGF Acidic and basic fibroblast growth factor family; FGFs are mitogens, which stimulate growth or differentiation of cells of mesodermal or neuroectodermal origin. The family plays essential roles in patterning and differentiation during vertebrate embryogenesis, and has neurotrophic activities. FGFs have a high affinity for heparan sulfate proteoglycans and require heparan sulfate to activate one of four cell surface FGF receptors. Upon binding to FGF, the receptors dimerize and their intracellular tyrosine kinase domains become active. FGFs have internal pseudo-threefold symmetry (beta-trefoil topology).
Probab=97.49  E-value=0.0026  Score=58.88  Aligned_cols=97  Identities=15%  Similarity=0.112  Sum_probs=78.0

Q ss_pred             EEecCCcEEEEccCCeEEEeccCCCCCceEEEEEeCCCeEEEEECCCCeEEEee-CCeEEEeecCCCCCccEEEEeccCc
Q psy15563        310 LRSHLGKYLAVDSFGNVTCEADEKEPGSLFQIHVSDEGKWAFKNKTRGYFLGAS-SDKLICTAKIPANPEYWLAHLAARP  388 (614)
Q Consensus       310 LRS~~GkYLs~~~~G~V~a~a~~~g~~E~F~le~~~dGk~aLrs~~~G~yL~a~-~G~L~a~~~s~g~~E~F~v~LanrP  388 (614)
                      |-+.+|.||.+..+|.|.++.+..++...|+++....|.++||+...++||+.+ .|.|.+.. .+.++..|.-++...-
T Consensus         3 Ly~~~~~~L~I~~dG~V~Gt~~~~~~~s~l~~~s~~~g~v~i~~v~s~~YLCmn~~G~ly~s~-~~~~dC~F~E~~~~n~   81 (123)
T cd00058           3 LYCRTGFHLQILPDGTVDGTRDDSSSYTILERIAVAVGVVSIKGVASCRYLCMNKCGKLYGSK-GFTEECLFREELLENN   81 (123)
T ss_pred             EEEcCCeEEEEcCCCcEecccCCCCCCceEEEEECCCCEEEEEEcccceEEEECCCCCEEECC-CCCCCCEEEEEEccCC
Confidence            334449999999999999999988889999999987799999999999999997 48898877 7888899987766544


Q ss_pred             eeEEeeec--ceeeeeecCCC
Q psy15563        389 QVNLRSVG--RKRFAHLSENL  407 (614)
Q Consensus       389 ~v~Lr~v~--~~~yv~ls~~~  407 (614)
                      .-..++.-  +.-|+.++.+|
T Consensus        82 Y~~y~S~~~~~~~ylal~~~G  102 (123)
T cd00058          82 YNTYASAKYRRRWYLALNKKG  102 (123)
T ss_pred             cEEEEEcccCCCcEEEECCCC
Confidence            55555542  34677776554


No 15 
>PF00167 FGF:  Fibroblast growth factor;  InterPro: IPR002348 The interleukin-1 (IL1) and heparin-binding growth factor (HBGF) families share low sequence similarity (about 25% []) but have very similar structures. Coupled with the Kunitz-type soybean trypsin inhibitors (STI), they form a structural superfamily. Despite their structural correspondence, however, they show no sequence similarity to the STI family. The crystal structures of interleukin-1 beta and HBGF1 have been solved, showing both families to have the same 12-stranded beta-sheet structure []; the beta-sheets are arranged in 3 similar lobes around a central axis, 6 strands forming an anti-parallel beta-barrel [, ]. The beta-sheets are generally well preserved and the crystal structures superimpose in these areas. The intervening loops are less well conserved - the loop between beta-strands 6 and 7 is slightly longer in interleukin-1 beta.; GO: 0008083 growth factor activity; PDB: 1AFC_F 1BAR_A 2P39_A 1EV2_D 2BFH_A 4FGF_A 1BAS_A 1BFG_A 1FQ9_B 1CVS_A ....
Probab=97.42  E-value=0.0046  Score=56.74  Aligned_cols=101  Identities=14%  Similarity=0.238  Sum_probs=81.2

Q ss_pred             EEEEECCCceEEEecCCeEEEecCCCCCCCceEEEEEcCCCCccEEEEEe-cCCcEEEEccCCeEEEeccCCCCCceEEE
Q psy15563        263 IGLINAQFKYLTAETFGFKINANGASLKKKQVWILEPANGGDTLSVYLRS-HLGKYLAVDSFGNVTCEADEKEPGSLFQI  341 (614)
Q Consensus       263 V~L~s~~GkYLtae~~g~~V~An~~~l~~~E~F~Le~~~~~~~~~v~LRS-~~GkYLs~~~~G~V~a~a~~~g~~E~F~l  341 (614)
                      +.|-+.+|+||.+.. ...|.+.++.-.+...|.++...   .+.|.||+ ..++||+.++.|.|.+. ..+.+++.|.-
T Consensus         3 ~~Ly~~~~~~L~i~~-~g~V~gt~~~~~~~s~~~i~~~~---~g~V~i~~~~s~~YLcmn~~G~ly~~-~~~~~~C~F~e   77 (122)
T PF00167_consen    3 VQLYCRTGYFLQINP-NGTVDGTGDDNSPYSVFEIHSVG---FGVVRIRGVKSCRYLCMNKCGRLYGS-KNFNKDCVFRE   77 (122)
T ss_dssp             EEEEETTSEEEEEET-TSBEEEESSTTSTTGEEEEEEEE---TTEEEEEETTTTEEEEEBTTSBEEEE-SSBTGGGEEEE
T ss_pred             EEEEECCCeEEEECC-CCeEeCCCCcCcceeEEEEEecc---ceEEEEEEecceEEEEECCCCeEccc-cccCCCceEEE
Confidence            557777799999875 34787777777888999999764   36999999 68999999999999996 44666999999


Q ss_pred             EEeCCCeEEEEECCC--CeEEEee-CCeEE
Q psy15563        342 HVSDEGKWAFKNKTR--GYFLGAS-SDKLI  368 (614)
Q Consensus       342 e~~~dGk~aLrs~~~--G~yL~a~-~G~L~  368 (614)
                      ++.+++-..+.+..+  +.||+.+ .|+..
T Consensus        78 ~~~~n~y~~~~s~~~~~~~yla~~~~G~~~  107 (122)
T PF00167_consen   78 ELLENGYNTYESAKYGRGWYLAFNRRGKPR  107 (122)
T ss_dssp             EEETTSEEEEEESTTGTTEBCEBCTTSBBE
T ss_pred             EEccCCEEEEEeccCCccEEEEECCCCCCC
Confidence            888778888888854  8888886 46543


No 16 
>smart00442 FGF Acidic and basic fibroblast growth factor family. Mitogens that stimulate growth or differentiation of cells of mesodermal or neuroectodermal origin. The family play essential roles in patterning and differentiation during vertebrate embryogenesis, and have neurotrophic activities.
Probab=97.39  E-value=0.0067  Score=56.44  Aligned_cols=100  Identities=16%  Similarity=0.118  Sum_probs=78.3

Q ss_pred             EEEEEecCCcEEEEccCCeEEEeccCCCCCceEEEEEeCCCeEEEEECCCCeEEEee-CCeEEEeecCCCCCccEEEEec
Q psy15563        307 SVYLRSHLGKYLAVDSFGNVTCEADEKEPGSLFQIHVSDEGKWAFKNKTRGYFLGAS-SDKLICTAKIPANPEYWLAHLA  385 (614)
Q Consensus       307 ~v~LRS~~GkYLs~~~~G~V~a~a~~~g~~E~F~le~~~dGk~aLrs~~~G~yL~a~-~G~L~a~~~s~g~~E~F~v~La  385 (614)
                      .++|-+.+|.||.+.++|.|.++.++..+...|++...+.|.+.||+...++||+.+ .|.|.+... +.++..|.=++.
T Consensus         4 ~~~Ly~~~~~~L~I~~~G~V~Gt~~~~~~~~ile~~s~~~g~V~ik~~~s~~YLCmn~~G~ly~s~~-~~~dC~F~E~~~   82 (126)
T smart00442        4 LRQLYCRNGQHLQILPDGTVDGTRDESSSFTILEIIAVAVGVVAIKGVASCRYLCMNKCGKLYGSKN-FTEDCVFREEME   82 (126)
T ss_pred             EEEEEeCCCeEEEEcCCceEecccCCCCcceEEEEEeccCCEEEEEEcccceEEEECCCCCEEEccc-CCCCcEEEEEec
Confidence            567777788999999999999999988999999999987799999999999999997 489988665 678888886655


Q ss_pred             cCceeEEeeecce--eeeeecCCC
Q psy15563        386 ARPQVNLRSVGRK--RFAHLSENL  407 (614)
Q Consensus       386 nrP~v~Lr~v~~~--~yv~ls~~~  407 (614)
                      ..-.-..++.-.+  -|+.++.+|
T Consensus        83 ~n~y~~y~S~~~~~~~ylal~~~G  106 (126)
T smart00442       83 ENGYNTYASAKYRKRWYVALNKKG  106 (126)
T ss_pred             cCCeEEEEEcccCCceEEEECCCC
Confidence            4344444443222  466665554


No 17 
>PF14200 RicinB_lectin_2:  Ricin-type beta-trefoil lectin domain-like; PDB: 2X2S_C 2X2T_A 2VSE_B 2VSA_A 3EF2_A 2IHO_A 3HZB_H 1YBI_B 3PHZ_A 3NBE_A ....
Probab=97.13  E-value=0.0075  Score=53.36  Aligned_cols=92  Identities=20%  Similarity=0.301  Sum_probs=67.8

Q ss_pred             CCceEEEEEcCCCCccEEEEEe-cCCcEEEEccCC-----eEEEeccCCCCCceEEEEEeCCCeEEEEECCCCeEEEeeC
Q psy15563        291 KKQVWILEPANGGDTLSVYLRS-HLGKYLAVDSFG-----NVTCEADEKEPGSLFQIHVSDEGKWAFKNKTRGYFLGASS  364 (614)
Q Consensus       291 ~~E~F~Le~~~~~~~~~v~LRS-~~GkYLs~~~~G-----~V~a~a~~~g~~E~F~le~~~dGk~aLrs~~~G~yL~a~~  364 (614)
                      +-|.|++....+ ..+.+.|++ ..|+||.+..+.     .|+.-...-.+.++|.|+...+|.+.|++...|++|.+.+
T Consensus         1 ~nQ~W~~~~~~~-~~g~Y~i~n~~sg~~L~v~~~~~~~g~~v~~~~~~~~~~Q~W~i~~~~~g~y~I~n~~s~~~Ldv~~   79 (105)
T PF14200_consen    1 DNQQWTFTPVGD-SDGYYKIRNVNSGKYLDVAGGSTANGTNVQQWTCNGNDNQQWKIEPVGDGYYRIRNKNSGKVLDVAG   79 (105)
T ss_dssp             CGGEEEEEEEET-TTTEEEEEETTTTEEEEEGCTTCSTTEBEEEEESSSSGGGEEEEEESTTSEEEEEETSTTEEEEEGG
T ss_pred             CCCEEEEEEecC-CCCEEEEEECCCCCEEEeCCCCcCCCcEEEEecCCCCcCcEEEEEEecCCeEEEEECCCCcEEEECC
Confidence            358899998642 357899999 799999998643     4555444447899999999988999999999999998752


Q ss_pred             -----C-eEEEeec-CCCCCccEEEE
Q psy15563        365 -----D-KLICTAK-IPANPEYWLAH  383 (614)
Q Consensus       365 -----G-~L~a~~~-s~g~~E~F~v~  383 (614)
                           | .|..-.. .-++..+|.|+
T Consensus        80 ~~~~~g~~v~~~~~~~~~~~Q~W~l~  105 (105)
T PF14200_consen   80 GSTANGTNVQQWEYDNGSDNQQWKLE  105 (105)
T ss_dssp             GSSSTTEBEEEEE-STSSGGGEEEEE
T ss_pred             CCCCCCCEEEEEeCCCCCccCEEEeC
Confidence                 3 3433332 45667777764


No 18 
>smart00791 Agglutinin Amaranthus caudatus agglutinin or amaranthin is a lectin from the ancient South American crop, amaranth grain. Although its biological function is unknown, it has a high binding specificity for the methyl-glycoside of the T-antigen, found linked to serine or threonine residues of cell surface glycoproteins PUBMED:2271665. The protein is comprised of a homodimer, with each homodimer consisting of two beta-trefoil domains PUBMED:9334739.
Probab=97.13  E-value=0.0057  Score=57.26  Aligned_cols=97  Identities=18%  Similarity=0.185  Sum_probs=75.9

Q ss_pred             EEEEEecCCcEEEEcc---CCeEEEeccCCCC-CceEEEEEeCCCeEEEEECCCCeEEEeeCCeEEEeecCCC----CCc
Q psy15563        307 SVYLRSHLGKYLAVDS---FGNVTCEADEKEP-GSLFQIHVSDEGKWAFKNKTRGYFLGASSDKLICTAKIPA----NPE  378 (614)
Q Consensus       307 ~v~LRS~~GkYLs~~~---~G~V~a~a~~~g~-~E~F~le~~~dGk~aLrs~~~G~yL~a~~G~L~a~~~s~g----~~E  378 (614)
                      -|.||+-||+||....   .+-|.+.++.+.+ -..|+++...||.+.+|+.-+|+|=..+..=+.|.++.+.    ...
T Consensus         7 ~V~FKg~n~kYLry~~~~~~~~lqf~~ddI~dp~v~~ev~~~~dg~V~ik~~~~nKfWr~s~~WI~a~s~d~~e~~sscT   86 (139)
T smart00791        7 YVLFKGNNQKYLRYQSIQQYGLLQFSADKILDPLVQFEVFPTYNGLVHIKSNYTNKFWRLSHYWITADANDPDENKSACT   86 (139)
T ss_pred             EEEEEcCCCceEEEEeecccceeEecccccCCcceeEEEEEcCCCcEEEEecCCCceEccCCCEEEecCCCCccCCCccc
Confidence            5899999999999875   4569999998766 7899999998999999999899998777777888887772    345


Q ss_pred             cEEEEeccCceeEEeeecceeeeee
Q psy15563        379 YWLAHLAARPQVNLRSVGRKRFAHL  403 (614)
Q Consensus       379 ~F~v~LanrP~v~Lr~v~~~~yv~l  403 (614)
                      +|..-....-.+.||+++...|++.
T Consensus        87 LF~Pv~~d~~~i~lr~vq~~~~~~r  111 (139)
T smart00791       87 LFRPLYVEMKKIRLLNVQLGHYTKR  111 (139)
T ss_pred             EEeEEeccCceEEEEEecCCceEEe
Confidence            6665444566677777766666653


No 19 
>cd00058 FGF Acidic and basic fibroblast growth factor family; FGFs are mitogens, which stimulate growth or differentiation of cells of mesodermal or neuroectodermal origin. The family plays essential roles in patterning and differentiation during vertebrate embryogenesis, and has neurotrophic activities. FGFs have a high affinity for heparan sulfate proteoglycans and require heparan sulfate to activate one of four cell surface FGF receptors. Upon binding to FGF, the receptors dimerize and their intracellular tyrosine kinase domains become active. FGFs have internal pseudo-threefold symmetry (beta-trefoil topology).
Probab=96.97  E-value=0.0044  Score=57.42  Aligned_cols=76  Identities=16%  Similarity=0.134  Sum_probs=66.5

Q ss_pred             EEEeecCCceEEecCCeeEEEecccCCCCceEEEEEcCCCeEEEEeeccCceeeecCCeEEEEecCCCCCcceEEEEc
Q psy15563        102 VYLRSHLGKYLAVDSFGNVTCEADEKEPGSLFQIHVSDEGKWAFKNKTRGYFLGASSDKLICTAKIPANPEYWLAHLA  179 (614)
Q Consensus       102 v~lkSh~GrYL~~d~dG~v~~~~e~~~~~~~F~i~~~~dG~waLqs~~~g~yl~~~~d~~~c~~~~~~~~~~W~~hlA  179 (614)
                      ++.|+  |.||..++||+|.++.+..++...|++.....|..+||++.+++||..+..-..-.++..++...|..++.
T Consensus         3 Ly~~~--~~~L~I~~dG~V~Gt~~~~~~~s~l~~~s~~~g~v~i~~v~s~~YLCmn~~G~ly~s~~~~~dC~F~E~~~   78 (123)
T cd00058           3 LYCRT--GFHLQILPDGTVDGTRDDSSSYTILERIAVAVGVVSIKGVASCRYLCMNKCGKLYGSKGFTEECLFREELL   78 (123)
T ss_pred             EEEcC--CeEEEEcCCCcEecccCCCCCCceEEEEECCCCEEEEEEcccceEEEECCCCCEEECCCCCCCCEEEEEEc
Confidence            44555  99999999999999999999999999999989999999999999999977765555558888899988775


No 20 
>PF14200 RicinB_lectin_2:  Ricin-type beta-trefoil lectin domain-like; PDB: 2X2S_C 2X2T_A 2VSE_B 2VSA_A 3EF2_A 2IHO_A 3HZB_H 1YBI_B 3PHZ_A 3NBE_A ....
Probab=96.88  E-value=0.014  Score=51.72  Aligned_cols=88  Identities=20%  Similarity=0.320  Sum_probs=66.3

Q ss_pred             eEEEEecCCCCceEEEEee-cCCceEEecCC----e-eEEEecccCCCCceEEEEEcCCCeEEEEeeccCceeeecC---
Q psy15563         88 VWILEPANGGDTLSVYLRS-HLGKYLAVDSF----G-NVTCEADEKEPGSLFQIHVSDEGKWAFKNKTRGYFLGASS---  158 (614)
Q Consensus        88 vW~l~~~~~~~~~~v~lkS-h~GrYL~~d~d----G-~v~~~~e~~~~~~~F~i~~~~dG~waLqs~~~g~yl~~~~---  158 (614)
                      .|++.+.++. ...+.|++ +-|+||.++..    | +|......-.+..+|.|+...||.|.|++...+++|...+   
T Consensus         4 ~W~~~~~~~~-~g~Y~i~n~~sg~~L~v~~~~~~~g~~v~~~~~~~~~~Q~W~i~~~~~g~y~I~n~~s~~~Ldv~~~~~   82 (105)
T PF14200_consen    4 QWTFTPVGDS-DGYYKIRNVNSGKYLDVAGGSTANGTNVQQWTCNGNDNQQWKIEPVGDGYYRIRNKNSGKVLDVAGGST   82 (105)
T ss_dssp             EEEEEEEETT-TTEEEEEETTTTEEEEEGCTTCSTTEBEEEEESSSSGGGEEEEEESTTSEEEEEETSTTEEEEEGGGSS
T ss_pred             EEEEEEecCC-CCEEEEEECCCCCEEEeCCCCcCCCcEEEEecCCCCcCcEEEEEEecCCeEEEEECCCCcEEEECCCCC
Confidence            8999997632 23699998 59999999864    3 4666666568999999999999999999988899998742   


Q ss_pred             ---CeEEEEec-CCCCCcceEE
Q psy15563        159 ---DKLICTAK-IPANPEYWLA  176 (614)
Q Consensus       159 ---d~~~c~~~-~~~~~~~W~~  176 (614)
                         ..|..... .-++..+|.+
T Consensus        83 ~~g~~v~~~~~~~~~~~Q~W~l  104 (105)
T PF14200_consen   83 ANGTNVQQWEYDNGSDNQQWKL  104 (105)
T ss_dssp             STTEBEEEEE-STSSGGGEEEE
T ss_pred             CCCCEEEEEeCCCCCccCEEEe
Confidence               24555444 5556666754


No 21 
>PF07468 Agglutinin:  Agglutinin;  InterPro: IPR008998 Agglutinins are sugar-specific lectins that can agglutinate erythrocytes and other cell types. Lectins occur widely in plants, as well as some microorganisms and animal []. Agglutinin from Amaranthus caudatus (amaranthin) is a lectin from the ancient South American crop, amaranth grain. Although its biological function is unknown, it can agglutinate A, B and O red blood cells, and has a carbohydrate-binding site that is specific for the methyl-glycoside of the T-antigen found linked to serine or threonine residues of cell surface glycoproteins []. The protein is comprised of a homodimer, with each homodimer consisting of two beta-trefoil domains []. Lectin B chains from ricin and related toxins also contain beta-trefoil domain, however they are not related to agglutinin, showing little sequence similarity [].; PDB: 1JLY_B 1JLX_B.
Probab=96.86  E-value=0.012  Score=56.39  Aligned_cols=100  Identities=14%  Similarity=0.149  Sum_probs=64.6

Q ss_pred             eEEEEcCCCCeEeeeC----CCceEEeccc-CCCCceeEEEEecCCcEEEEEeeCCceEEEEeceEEEEe-CCCCeEEec
Q psy15563        490 QIALRDNSGAYLSPIG----SKAILKTRSN-TVTKDELFTLEDSLPQASFIAALNSKYVSVKQGQIALRD-NSGAYLSPI  563 (614)
Q Consensus       490 ~valr~~nGkyL~~~g----~~g~L~a~~~-~~~~~elF~le~~~pqv~l~s~~n~kyVs~~~G~~~~R~-~~gkyw~~~  563 (614)
                      .|++|..|||||+...    .-+-|+..++ -.+.-..|+++.+.         +      ..|..++|. .+||||...
T Consensus         7 ~V~fkg~N~kYLry~~e~~~~~~~LqF~~edi~dP~v~fev~~~~---------~------~dG~V~Ir~~y~nKfWrr~   71 (153)
T PF07468_consen    7 YVAFKGDNGKYLRYRTEDIQQYGYLQFSGEDIGDPYVKFEVEPSK---------T------HDGLVHIRCCYNNKFWRRS   71 (153)
T ss_dssp             CEEEETTTS-EEEEEESSCTTCCEEEEEESSTT-CCG-EEEEE-S---------S------TTT-EEEEETTTTEEEEES
T ss_pred             EEEEEcCCCcEEEEEecccccceeEEecCCcCCCCceeEEEEEcc---------c------CCCeEEEEeccCCceeEeC
Confidence            6888999999999864    3466887554 44566778887643         1      124456776 479999865


Q ss_pred             cCC--ceEeecCCCCC------CCccEEEEEcC----CeEEEEE-cCCCCeEEEee
Q psy15563        564 GSK--AILKTRSNTVT------KDELFTLEDSL----PQASFIA-ALNSKYVSVKQ  606 (614)
Q Consensus       564 ~~g--~v~a~~~~~~~------~~~~F~ie~~~----~~v~lik-a~NGkYl~~k~  606 (614)
                      +.+  -|.| .++.++      +.++|.....+    ..|+| | ..||.|.+--.
T Consensus        72 s~n~~WI~a-da~~p~ed~s~~~cTLF~Pv~vd~~~~~~i~l-~~~~n~~~~~r~t  125 (153)
T PF07468_consen   72 SPNDYWIWA-DADDPDEDQSKPSCTLFEPVKVDVKDFNVIAL-RNMQNGHFCKRLT  125 (153)
T ss_dssp             CCC--BEEE-EESSHHH-TCSTCGG-EEEEESCCCETTEEEE-EETTTTEEEEEE-
T ss_pred             CCCCcEEEe-cCCCcccccCCCCceEEEEEEecCCCccEEEE-EecCCceEEEEEc
Confidence            555  4778 544333      56679998764    67888 6 68999987543


No 22 
>smart00442 FGF Acidic and basic fibroblast growth factor family. Mitogens that stimulate growth or differentiation of cells of mesodermal or neuroectodermal origin. The family play essential roles in patterning and differentiation during vertebrate embryogenesis, and have neurotrophic activities.
Probab=96.39  E-value=0.026  Score=52.50  Aligned_cols=78  Identities=17%  Similarity=0.107  Sum_probs=65.6

Q ss_pred             EEEeecCCceEEecCCeeEEEecccCCCCceEEEEEcCCCeEEEEeeccCceeeecCCeEEEEecCCCCCcceEEEEc
Q psy15563        102 VYLRSHLGKYLAVDSFGNVTCEADEKEPGSLFQIHVSDEGKWAFKNKTRGYFLGASSDKLICTAKIPANPEYWLAHLA  179 (614)
Q Consensus       102 v~lkSh~GrYL~~d~dG~v~~~~e~~~~~~~F~i~~~~dG~waLqs~~~g~yl~~~~d~~~c~~~~~~~~~~W~~hlA  179 (614)
                      +.|=+..|.||..++||.|.++.+..++..-|++...+.|..+||++.+++||..+..--.-.+...+++..|..++.
T Consensus         5 ~~Ly~~~~~~L~I~~~G~V~Gt~~~~~~~~ile~~s~~~g~V~ik~~~s~~YLCmn~~G~ly~s~~~~~dC~F~E~~~   82 (126)
T smart00442        5 RQLYCRNGQHLQILPDGTVDGTRDESSSFTILEIIAVAVGVVAIKGVASCRYLCMNKCGKLYGSKNFTEDCVFREEME   82 (126)
T ss_pred             EEEEeCCCeEEEEcCCceEecccCCCCcceEEEEEeccCCEEEEEEcccceEEEECCCCCEEEcccCCCCcEEEEEec
Confidence            445556679999999999999999999999999999988999999999999999977655555566777777777654


No 23 
>KOG3885|consensus
Probab=95.12  E-value=0.079  Score=51.10  Aligned_cols=74  Identities=22%  Similarity=0.298  Sum_probs=62.6

Q ss_pred             EEEEEecCCcEEEEccCCeEEEeccCCCCCceEEEEEeCCCeEEEEECCCCeEEEee-CCeEEEeecCCCCCccEEE
Q psy15563        307 SVYLRSHLGKYLAVDSFGNVTCEADEKEPGSLFQIHVSDEGKWAFKNKTRGYFLGAS-SDKLICTAKIPANPEYWLA  382 (614)
Q Consensus       307 ~v~LRS~~GkYLs~~~~G~V~a~a~~~g~~E~F~le~~~dGk~aLrs~~~G~yL~a~-~G~L~a~~~s~g~~E~F~v  382 (614)
                      +.+.|+. |.+|.+.++|.|..++++-++...++|+...=|.|+||....++||+.+ .|+|.+. ....++..|.=
T Consensus        29 ~LY~~t~-g~hLqi~p~g~V~Gt~~~~s~~siLei~sv~~GvV~IkGV~s~~YL~Mn~~G~LygS-~~~t~eC~F~E  103 (155)
T KOG3885|consen   29 LLYCRNG-GHFLRILPDGTVDGTRDRSDQHTIFEIITVAVGVVAIKGVESELYLAMNKEGKLYAS-KEFTEECKFKE  103 (155)
T ss_pred             EEEEcCC-CEEEEEcCCCccccccccCCCceeEEEEEeeecEEEEEEeeceeEEEECCCCcEecC-CCCCccceeEE
Confidence            3444445 9999999999999999999999999998875599999999999999997 6999998 66667777763


No 24 
>PF05270 AbfB:  Alpha-L-arabinofuranosidase B (ABFB);  InterPro: IPR007934 This family consists of several fungal alpha-L-arabinofuranosidase B proteins. L-Arabinose is a constituent of plant cell wall polysaccharides. It is found in a polymeric form in L-arabinan, in which the backbone is formed by 1,5-a- linked l-arabinose residues that can be branched via 1,2-a- and 1,3-a-linked l-arabinofuranose side chains. AbfB hydrolyses 1,5-a, 1,3-a and 1,2-a linkages in both oligosaccharides and polysaccharides, which contain terminal non-reducing l-arabinofuranoses in side chains [].; GO: 0046556 alpha-N-arabinofuranosidase activity, 0046373 L-arabinose metabolic process; PDB: 3AKI_A 3AKF_A 3AKH_A 3AKG_A 2D44_A 1WD3_A 2D43_A 1WD4_A 3KMV_E.
Probab=94.24  E-value=1.4  Score=41.96  Aligned_cols=108  Identities=18%  Similarity=0.206  Sum_probs=70.7

Q ss_pred             EEEEEE--CCCceEEEecCCeEEEecCCCCC----CCceEEEEEcCCCCccEEEEEecC--CcEEEEccCCeEEEeccCC
Q psy15563        262 TIGLIN--AQFKYLTAETFGFKINANGASLK----KKQVWILEPANGGDTLSVYLRSHL--GKYLAVDSFGNVTCEADEK  333 (614)
Q Consensus       262 ~V~L~s--~~GkYLtae~~g~~V~An~~~l~----~~E~F~Le~~~~~~~~~v~LRS~~--GkYLs~~~~G~V~a~a~~~  333 (614)
                      .+.|.+  ..++||.-..+...+.+......    ...+|.+..--. +.+.|.|.|.+  |.||.. .++.|.-.+...
T Consensus         2 ~~~~~s~~~~~ryirh~~~~~~~~~v~~~s~~~~r~da~f~vvpGLa-~~~~vSfES~~~PG~yLrh-~~~~v~l~~~d~   79 (142)
T PF05270_consen    2 SLRLTSPNYPDRYIRHRGSLVRLDPVSSSSSALDRADATFRVVPGLA-DSSCVSFESVNYPGYYLRH-SNFRVRLEKNDG   79 (142)
T ss_dssp             EEEEEESSSTTEEEEEETTEEEEEES-SSGGHHHHHGG-EEEEE-SS--TTCEEEEESSSTTEEEEE-ETTEEEEEE--S
T ss_pred             eEEEECCCCCCeEEEEcCceEEEeeccCCcchhhccCceEEEEEccC-CCCEEEEEECCCCCcEEEE-ECCEEEEeecCC
Confidence            456666  47999998765566644444332    467899986322 35689999977  999998 566766665544


Q ss_pred             C----CCceEEEEEe--CCCeEEEEECC-CCeEEEeeCCeEEEee
Q psy15563        334 E----PGSLFQIHVS--DEGKWAFKNKT-RGYFLGASSDKLICTA  371 (614)
Q Consensus       334 g----~~E~F~le~~--~dGk~aLrs~~-~G~yL~a~~G~L~a~~  371 (614)
                      +    ...+|.++.-  ..|.++|++.+ -|+||.-.++.|....
T Consensus        80 s~~F~~dATF~~~~Gl~~~g~~sfeS~n~Pg~ylrh~~~~l~l~~  124 (142)
T PF05270_consen   80 SALFREDATFCPRPGLAGPGYVSFESYNYPGRYLRHYNGELYLAP  124 (142)
T ss_dssp             SHHHHHHT-EEEEE-SSSTTEEEEEESSSTTEEEEEETTEEEEEE
T ss_pred             CccccCCceEEEecCCCCCCcceEEEecCCCeEEEEECCEEEEec
Confidence            4    3788998885  23778999885 4788988877665543


No 25 
>KOG3885|consensus
Probab=90.42  E-value=1.1  Score=43.27  Aligned_cols=61  Identities=25%  Similarity=0.396  Sum_probs=53.4

Q ss_pred             EEEEeecCCceEEecCCeeEEEecccCCCCceEEEEEcCCCeEEEEeeccCceeeecCC-eEE
Q psy15563        101 SVYLRSHLGKYLAVDSFGNVTCEADEKEPGSLFQIHVSDEGKWAFKNKTRGYFLGASSD-KLI  162 (614)
Q Consensus       101 ~v~lkSh~GrYL~~d~dG~v~~~~e~~~~~~~F~i~~~~dG~waLqs~~~g~yl~~~~d-~~~  162 (614)
                      ..+.++. |.+|-+.+||.|..++|.-++..-++|++-.=|..+||.+..++||..+.+ .|.
T Consensus        29 ~LY~~t~-g~hLqi~p~g~V~Gt~~~~s~~siLei~sv~~GvV~IkGV~s~~YL~Mn~~G~Ly   90 (155)
T KOG3885|consen   29 LLYCRNG-GHFLRILPDGTVDGTRDRSDQHTIFEIITVAVGVVAIKGVESELYLAMNKEGKLY   90 (155)
T ss_pred             EEEEcCC-CEEEEEcCCCccccccccCCCceeEEEEEeeecEEEEEEeeceeEEEECCCCcEe
Confidence            3566666 999999999999999999999999999988679999999999999998655 444


No 26 
>PF05270 AbfB:  Alpha-L-arabinofuranosidase B (ABFB);  InterPro: IPR007934 This family consists of several fungal alpha-L-arabinofuranosidase B proteins. L-Arabinose is a constituent of plant cell wall polysaccharides. It is found in a polymeric form in L-arabinan, in which the backbone is formed by 1,5-a- linked l-arabinose residues that can be branched via 1,2-a- and 1,3-a-linked l-arabinofuranose side chains. AbfB hydrolyses 1,5-a, 1,3-a and 1,2-a linkages in both oligosaccharides and polysaccharides, which contain terminal non-reducing l-arabinofuranoses in side chains [].; GO: 0046556 alpha-N-arabinofuranosidase activity, 0046373 L-arabinose metabolic process; PDB: 3AKI_A 3AKF_A 3AKH_A 3AKG_A 2D44_A 1WD3_A 2D43_A 1WD4_A 3KMV_E.
Probab=81.41  E-value=12  Score=35.79  Aligned_cols=106  Identities=19%  Similarity=0.203  Sum_probs=66.5

Q ss_pred             EecCCceeeeeecCeeEEeccccccccccccccceecceeecccccccccccceeEEEEecCCCCceEEEEeecC--Cce
Q psy15563         34 INAQFKYLTAETFGFKINANGASLKKKQFKYLTAETFGFKINANGASLKKKQDGVWILEPANGGDTLSVYLRSHL--GKY  111 (614)
Q Consensus        34 in~~~~YLtaE~fgfkvna~~~slkkkQ~~~~~~~~~~~~~~~~~~~~~~~q~~vW~l~~~~~~~~~~v~lkSh~--GrY  111 (614)
                      .|..++||..-.+--.+-+..++..                      -.-|||..|++.+--. +...|+|.|..  |.|
T Consensus         8 ~~~~~ryirh~~~~~~~~~v~~~s~----------------------~~~r~da~f~vvpGLa-~~~~vSfES~~~PG~y   64 (142)
T PF05270_consen    8 PNYPDRYIRHRGSLVRLDPVSSSSS----------------------ALDRADATFRVVPGLA-DSSCVSFESVNYPGYY   64 (142)
T ss_dssp             SSSTTEEEEEETTEEEEEES-SSGG----------------------HHHHHGG-EEEEE-SS--TTCEEEEESSSTTEE
T ss_pred             CCCCCeEEEEcCceEEEeeccCCcc----------------------hhhccCceEEEEEccC-CCCEEEEEECCCCCcE
Confidence            4667788877655555444433332                      1223345899987433 34579999986  999


Q ss_pred             EEecCCeeEEEecccCC----CCceEEEEEc--CCCeEEEEeec-cCceeeecCCeEEE
Q psy15563        112 LAVDSFGNVTCEADEKE----PGSLFQIHVS--DEGKWAFKNKT-RGYFLGASSDKLIC  163 (614)
Q Consensus       112 L~~d~dG~v~~~~e~~~----~~~~F~i~~~--~dG~waLqs~~-~g~yl~~~~d~~~c  163 (614)
                      |-. .++.|.-+....+    .+..|.++.-  ..|-.+|+|.. .|+||--.+..|..
T Consensus        65 Lrh-~~~~v~l~~~d~s~~F~~dATF~~~~Gl~~~g~~sfeS~n~Pg~ylrh~~~~l~l  122 (142)
T PF05270_consen   65 LRH-SNFRVRLEKNDGSALFREDATFCPRPGLAGPGYVSFESYNYPGRYLRHYNGELYL  122 (142)
T ss_dssp             EEE-ETTEEEEEE--SSHHHHHHT-EEEEE-SSSTTEEEEEESSSTTEEEEEETTEEEE
T ss_pred             EEE-ECCEEEEeecCCCccccCCceEEEecCCCCCCcceEEEecCCCeEEEEECCEEEE
Confidence            988 7777777666544    6789999875  44777999965 57888776665543


No 27 
>cd00161 RICIN Ricin-type beta-trefoil; Carbohydrate-binding domain formed from presumed gene triplication. The domain is found in a variety of molecules serving diverse functions such as enzymatic activity, inhibitory toxicity and signal transduction. Highly specific ligand binding occurs on exposed surfaces of the compact domain sturcture.
Probab=64.32  E-value=91  Score=26.59  Aligned_cols=85  Identities=15%  Similarity=0.133  Sum_probs=46.4

Q ss_pred             CCceEEEecC--CeEEEec-CCCCCCCceEEEEEcCCCCccEEEEEecCCcEEEEccC---CeEEE-eccCCCCCceEEE
Q psy15563        269 QFKYLTAETF--GFKINAN-GASLKKKQVWILEPANGGDTLSVYLRSHLGKYLAVDSF---GNVTC-EADEKEPGSLFQI  341 (614)
Q Consensus       269 ~GkYLtae~~--g~~V~An-~~~l~~~E~F~Le~~~~~~~~~v~LRS~~GkYLs~~~~---G~V~a-~a~~~g~~E~F~l  341 (614)
                      .++.|.+...  +..|... -....+.|.|.+..     .+.++++. .+++|.+...   ..|+. .-....+..+|.+
T Consensus         9 ~~~cL~~~~~~~~~~v~~~~c~~~~~~Q~W~~~~-----~g~~~~~~-~~~Cl~~~~~~~~~~~~~~~c~~~~~~Q~W~~   82 (124)
T cd00161           9 TGLCLDVNGGSDGGPVQLYPCHGNGNNQKWTLTS-----DGTIRIKS-SNLCLDVGGDAPGSKVRLYTCSGGSDNQRWTF   82 (124)
T ss_pred             CCeEEECCCCCCCCEEEEEECCCCCccCCEEEeC-----CCeEEEcC-CCeEEcccCCCCCCEEEEEECCCCCcCCEEEE
Confidence            4666665532  2333211 11114688999875     34677776 7888887654   33433 2222356788887


Q ss_pred             EEeCCCeEEEEECCCCeEEEee
Q psy15563        342 HVSDEGKWAFKNKTRGYFLGAS  363 (614)
Q Consensus       342 e~~~dGk~aLrs~~~G~yL~a~  363 (614)
                      +..  +++  ++...+.-|.+.
T Consensus        83 ~~~--~~i--~~~~~~~cl~~~  100 (124)
T cd00161          83 NKD--GTI--RNLKSGKCLDVK  100 (124)
T ss_pred             CCC--cEE--EECCCCeEEeCC
Confidence            643  444  333346666554


No 28 
>PF03498 CDtoxinA:  Cytolethal distending toxin A/C family;  InterPro: IPR003558 Escherichia coli, Haemophilus spp and Campylobacter spp. all produce a toxin that is seen to cause distension in certain cell lines [, ], which eventually disintegrate and die. This novel toxin, termed cytolethal distending toxin (cdt), has three subunits: A, B and C. Their sizes are approx. 27.7, 29.5 and 19.9kDa respectively [], and they appear to be entirely novel [].  Further research on the complete toxin has revealed that it blocks the cell cycle at stage G2, through inactivation of the cyclin-dependent kinase Cdk1, and without induction of DNA breaks. This leads to multipolar abortive mitosis and micronucleation, associated with centrosomal amplification []. The roles of each subunit are unclear, but it is believed that they have separate roles in pathogenicity. This entry represents the A and C subunits.; GO: 0009405 pathogenesis; PDB: 2F2F_A 1SR4_C.
Probab=63.51  E-value=21  Score=34.40  Aligned_cols=78  Identities=21%  Similarity=0.309  Sum_probs=53.0

Q ss_pred             EEEEeec-CCceEEecCCeeEE---EecccCCCCceEEEEEcCCCeEEEEeeccCceeeecCCe-------E---EE-Ee
Q psy15563        101 SVYLRSH-LGKYLAVDSFGNVT---CEADEKEPGSLFQIHVSDEGKWAFKNKTRGYFLGASSDK-------L---IC-TA  165 (614)
Q Consensus       101 ~v~lkSh-~GrYL~~d~dG~v~---~~~e~~~~~~~F~i~~~~dG~waLqs~~~g~yl~~~~d~-------~---~c-~~  165 (614)
                      .|.||.. -|.=|.++++|.+.   |+.+  .....|.|++...|.+.|||...+.=|......       +   .| .+
T Consensus        53 ~Vqf~n~~~~~CL~~~~~G~~~~~~C~~~--~~~q~F~iiPtttgAVQIks~~~~~Cl~~~~~~~~~~~~~i~l~~Cd~~  130 (150)
T PF03498_consen   53 YVQFVNPKTGTCLAAYGNGVFHYKSCDQD--NLEQVFSIIPTTTGAVQIKSLSTGECLQTFNNSRTPIYYSIGLTPCDKS  130 (150)
T ss_dssp             EEEEEETTTSEEEEEETTCEEEE--TTTC--HGHH-EEEEEBTTS-EEEEETTT--EEEE-STTSS-SSEEEEEE---S-
T ss_pred             EEEEEcCCCCcceeecCCCeEeecccCCC--ChhceEEEEEcCCCcEEEEecCCCceEEecCCCceeEEeeEEeeeCCCc
Confidence            7999998 78899999999977   5555  667899999999999999998888666663322       1   25 34


Q ss_pred             cCCCCCcceEEEEcc
Q psy15563        166 KIPANPEYWLAHLAA  180 (614)
Q Consensus       166 ~~~~~~~~W~~hlA~  180 (614)
                      ..+...++|.+-.++
T Consensus       131 ~~~~l~~lw~itPP~  145 (150)
T PF03498_consen  131 KEINLDQLWFITPPL  145 (150)
T ss_dssp             EEETGGGEEEEE--S
T ss_pred             CCCCHHHcEEEcCCc
Confidence            455667888876653


No 29 
>KOG1407|consensus
Probab=58.61  E-value=2.5e+02  Score=29.88  Aligned_cols=21  Identities=33%  Similarity=0.608  Sum_probs=16.8

Q ss_pred             ceEEEE-cCCCCeEeeeCCCceE
Q psy15563        489 GQIALR-DNSGAYLSPIGSKAIL  510 (614)
Q Consensus       489 g~valr-~~nGkyL~~~g~~g~L  510 (614)
                      +.|+|+ +..|||++. |++-.|
T Consensus       191 nCicI~f~p~GryfA~-GsADAl  212 (313)
T KOG1407|consen  191 NCICIEFDPDGRYFAT-GSADAL  212 (313)
T ss_pred             ceEEEEECCCCceEee-ccccce
Confidence            789999 899999985 765444


No 30 
>PF03498 CDtoxinA:  Cytolethal distending toxin A/C family;  InterPro: IPR003558 Escherichia coli, Haemophilus spp and Campylobacter spp. all produce a toxin that is seen to cause distension in certain cell lines [, ], which eventually disintegrate and die. This novel toxin, termed cytolethal distending toxin (cdt), has three subunits: A, B and C. Their sizes are approx. 27.7, 29.5 and 19.9kDa respectively [], and they appear to be entirely novel [].  Further research on the complete toxin has revealed that it blocks the cell cycle at stage G2, through inactivation of the cyclin-dependent kinase Cdk1, and without induction of DNA breaks. This leads to multipolar abortive mitosis and micronucleation, associated with centrosomal amplification []. The roles of each subunit are unclear, but it is believed that they have separate roles in pathogenicity. This entry represents the A and C subunits.; GO: 0009405 pathogenesis; PDB: 2F2F_A 1SR4_C.
Probab=52.51  E-value=77  Score=30.61  Aligned_cols=57  Identities=23%  Similarity=0.256  Sum_probs=42.4

Q ss_pred             ccEEEEEe-cCCcEEEEccCCeEE---EeccCCCCCceEEEEEeCCCeEEEEECCCCeEEEee
Q psy15563        305 TLSVYLRS-HLGKYLAVDSFGNVT---CEADEKEPGSLFQIHVSDEGKWAFKNKTRGYFLGAS  363 (614)
Q Consensus       305 ~~~v~LRS-~~GkYLs~~~~G~V~---a~a~~~g~~E~F~le~~~dGk~aLrs~~~G~yL~a~  363 (614)
                      .+.|+|+. ..|..|.+..+|.+.   |+.+  .....|+|.+-..|-+-||+..+|.=|+..
T Consensus        51 ~g~Vqf~n~~~~~CL~~~~~G~~~~~~C~~~--~~~q~F~iiPtttgAVQIks~~~~~Cl~~~  111 (150)
T PF03498_consen   51 FGYVQFVNPKTGTCLAAYGNGVFHYKSCDQD--NLEQVFSIIPTTTGAVQIKSLSTGECLQTF  111 (150)
T ss_dssp             TCEEEEEETTTSEEEEEETTCEEEE--TTTC--HGHH-EEEEEBTTS-EEEEETTT--EEEE-
T ss_pred             CCEEEEEcCCCCcceeecCCCeEeecccCCC--ChhceEEEEEcCCCcEEEEecCCCceEEec
Confidence            47999999 688899998888776   5433  457899999997799999999888766664


No 31 
>KOG0308|consensus
Probab=47.66  E-value=43  Score=39.19  Aligned_cols=63  Identities=22%  Similarity=0.372  Sum_probs=48.3

Q ss_pred             EEEEeec--CCceEEecCCee--EEEecccC-------CCCceEEEEEcCCCeEEEEeecc--CceeeecCCeEEE
Q psy15563        101 SVYLRSH--LGKYLAVDSFGN--VTCEADEK-------EPGSLFQIHVSDEGKWAFKNKTR--GYFLGASSDKLIC  163 (614)
Q Consensus       101 ~v~lkSh--~GrYL~~d~dG~--v~~~~e~~-------~~~~~F~i~~~~dG~waLqs~~~--g~yl~~~~d~~~c  163 (614)
                      ...||.|  +=|=|..+.||.  |++++++.       -.+|.+.+++|.+|-|||++.+.  --|.|+....|.+
T Consensus       206 imkLrGHTdNVr~ll~~dDGt~~ls~sSDgtIrlWdLgqQrCl~T~~vH~e~VWaL~~~~sf~~vYsG~rd~~i~~  281 (735)
T KOG0308|consen  206 IMKLRGHTDNVRVLLVNDDGTRLLSASSDGTIRLWDLGQQRCLATYIVHKEGVWALQSSPSFTHVYSGGRDGNIYR  281 (735)
T ss_pred             eeeeeccccceEEEEEcCCCCeEeecCCCceEEeeeccccceeeeEEeccCceEEEeeCCCcceEEecCCCCcEEe
Confidence            5788888  668899999995  77777753       36788999999999999999653  2466665555555


No 32 
>KOG2243|consensus
Probab=46.80  E-value=40  Score=42.56  Aligned_cols=204  Identities=15%  Similarity=0.097  Sum_probs=0.0

Q ss_pred             EEeeccCceeee-----cCCeEEEE-----ecCCCCCcceEEEEcccce-----eeeccCCCCcccCCCCCCCCceeeee
Q psy15563        145 FKNKTRGYFLGA-----SSDKLICT-----AKIPANPEYWLAHLAARPQ-----LRSCGSPPCASIGLPSSRGLPCRYIS  209 (614)
Q Consensus       145 Lqs~~~g~yl~~-----~~d~~~c~-----~~~~~~~~~W~~hlA~Hpq-----v~~~~~~~~~~~~~~~~~~~~~ry~~  209 (614)
                      |+-.-.|-||..     +..+-.+|     ..+.+++.+|++|+|-.-.     |++=.       |+-.....-.||||
T Consensus       122 lrhs~sgmyl~clstsrs~tdklafdvgl~e~~tgeacwwtihpaskqrsegekvrigd-------dlilvsvsserylh  194 (5019)
T KOG2243|consen  122 LRHSFSGMYLCCLSTSRSQTDKLAFDVGLQEDATGEACWWTIHPASKQRSEGEKVRIGD-------DLILVSVSSERYLH  194 (5019)
T ss_pred             hhccccceEEEEecccccccccceeeccccccCcCceeeEEecccccccccCceeeecC-------ceEEEEecccceEE


Q ss_pred             e--CCCCCcccCCCCCcccceEEEEEEeecCccceeecCCCccccCCCCccceeEEEEEE-CCCceEEEecCCeEEEecC
Q psy15563        210 S--SPSTKAVHKPPPLLKRSVFPLIMNIMNGTNGHHENGNGELNGVGFSSKGVWTIGLIN-AQFKYLTAETFGFKINANG  286 (614)
Q Consensus       210 l--~~~~~~v~~~~pwg~d~l~tl~~~~~~~~~~~~~~~~~~L~~~G~~~~~~~~V~L~s-~~GkYLtae~~g~~V~An~  286 (614)
                      |  +...++||...-++-+.|-|.---..-.++.++-|+--+|-..-..+|.++--+=.. ...|-+--|.++..+.|..
T Consensus       195 ls~~ng~i~vdaaf~qtlw~v~p~~sgs~~~egyligg~v~rl~hghmdecltips~d~~d~q~r~i~ye~g~~~~~a~s  274 (5019)
T KOG2243|consen  195 LSVGNGNIHVDAAFMQTLWNVAPICSGSEAAEGYLIGGDVLRLFHGHMDECLTIPSGDHGDEQHRRIFYEAGAAGTHARS  274 (5019)
T ss_pred             EEecCCceeecHHHHHHHhccCCCcCCchhhhceeecchHHHHhccCccceeecCCCCccchhheeeeeeccccccchhh


Q ss_pred             CCCCCCceEEEEEcCCCC------ccEEEEEe-cCCcEEEEccCCeEEEeccCCCC--CceEEEEEe-------------
Q psy15563        287 ASLKKKQVWILEPANGGD------TLSVYLRS-HLGKYLAVDSFGNVTCEADEKEP--GSLFQIHVS-------------  344 (614)
Q Consensus       287 ~~l~~~E~F~Le~~~~~~------~~~v~LRS-~~GkYLs~~~~G~V~a~a~~~g~--~E~F~le~~-------------  344 (614)
                      -    |.+=.|-..  |.      +..++||- ..|+||++..|-.++--..+..+  .+.|.....             
T Consensus       275 l----wr~e~lria--wsgs~irwgq~frlrh~ttg~ylal~edk~lil~dk~kad~kstafsfraskelkekld~~~kk  348 (5019)
T KOG2243|consen  275 L----WRLEPLRIA--WSGSHIRWGQAFRLRHLTTGHYLALMEDKGLILMDKAKADTKSTAFSFRASKELKEKLDSGHKK  348 (5019)
T ss_pred             h----hcccceeEE--eccCccccCcceeeeeccccceeeeeccCCceeechhhcccccceeEeeccHHHHHhhcccccc


Q ss_pred             ------------CCCeEEEEECCCCeEEE
Q psy15563        345 ------------DEGKWAFKNKTRGYFLG  361 (614)
Q Consensus       345 ------------~dGk~aLrs~~~G~yL~  361 (614)
                                  +|.-+-+|+...|-||+
T Consensus       349 didgmg~~eikygdsvc~iqhi~sglw~t  377 (5019)
T KOG2243|consen  349 DIDGMGTPEIKYGDSVCFIQHIASGLWLT  377 (5019)
T ss_pred             ccccCCCcceecCceeeeeeeeccceeEE


No 33 
>PF00652 Ricin_B_lectin:  Ricin-type beta-trefoil lectin domain;  InterPro: IPR000772 Ricin is a legume lectin from the seeds of the castor bean plant, Ricinus communis. The seeds are poisonous to people, animals and insects and just one milligram of ricin can kill an adult. Primary structure analysis has shown the presence of a similar domain in many carbohydrate-recognition proteins like plant and bacterial AB-toxins, glycosidases or proteases [, , ]. This domain, known as the ricin B lectin domain, can be present in one or more copies and has been shown in some instance to bind simple sugars, such as galactose or lactose. The ricin B lectin domain is composed of three homologous subdomains of 40 amino acids (alpha, beta and gamma) and a linker peptide of around 15 residues (lambda). It has been proposed that the ricin B lectin domain arose by gene triplication from a primitive 40 residue galactoside-binding peptide [, ]. The most characteristic, though not completely conserved, sequence feature is the presence of a Q-W pattern. Consequently, the ricin B lectin domain as also been refered as the (QxW)3 domain and the three homologous regions as the QxW repeats [, ]. A disulphide bond is also conserved in some of the QxW repeats []. The 3D structure of the ricin B chain has shown that the three QxW repeats pack around a pseudo threefold axis that is stabilised by the lambda linker []. The ricin B lectin domain has no major segments of a helix or beta sheet but each of the QxW repeats contains an omega loop []. An idealized omega-loop is a compact, contiguous segment of polypeptide that traces a 'loop-shaped' path in three-dimensional space; the main chain resembles a Greek omega.; PDB: 2VLC_B 3A22_B 3A21_B 3A23_B 1GGP_B 1VCL_A 2Z48_B 2Z49_A 2D7R_A 2D7I_A ....
Probab=43.94  E-value=1.5e+02  Score=25.59  Aligned_cols=93  Identities=16%  Similarity=0.216  Sum_probs=50.5

Q ss_pred             EEEEEECCCceEEEe---cCCeEEEecCCCCCCCceEEEEEcCCCCccEEEEEecCCcEEEEccCC---eEEE-eccCCC
Q psy15563        262 TIGLINAQFKYLTAE---TFGFKINANGASLKKKQVWILEPANGGDTLSVYLRSHLGKYLAVDSFG---NVTC-EADEKE  334 (614)
Q Consensus       262 ~V~L~s~~GkYLtae---~~g~~V~An~~~l~~~E~F~Le~~~~~~~~~v~LRS~~GkYLs~~~~G---~V~a-~a~~~g  334 (614)
                      +|.+++.+++.|++.   ..+..|....-.-.+.|.|.+...     +.+.........|.+..+.   .|.- .-+.-.
T Consensus         4 ~i~~~~~~~~cl~~~~~~~~~~~v~l~~c~~~~~Q~w~~~~~-----~~i~~~~~~~~CL~~~~~~~~~~i~l~~C~~~~   78 (124)
T PF00652_consen    4 YIRNVNKSGLCLDVQGSTKNGSPVVLYPCDGSDNQLWRFDPD-----GQIRSNNNPNLCLDVDGSSPGTKIVLWPCDSNS   78 (124)
T ss_dssp             EEEEEEGGGEEEEEGGSSSTTTBEEEEE--SSGGGEEEEETT-----SBEEETTETTEEEEESSSSTTEBEEEEETTTTG
T ss_pred             EEEEeeCCCCeEEEcCCCCCCCEEEEEECCCCCceeEEEcCC-----CceeeccCcceEEEeeccCCCceEEEeeccCCc
Confidence            345555577888877   334444222111156799988743     2344433444578777665   3333 333333


Q ss_pred             CCceEEEEEeCCCeEEEEECCCCeEEEee
Q psy15563        335 PGSLFQIHVSDEGKWAFKNKTRGYFLGAS  363 (614)
Q Consensus       335 ~~E~F~le~~~dGk~aLrs~~~G~yL~a~  363 (614)
                      ..+.|.+..  +  ..|++...++.|...
T Consensus        79 ~~Q~W~~~~--~--~~i~n~~s~~cL~~~  103 (124)
T PF00652_consen   79 SNQRWKFDP--D--GRIRNKNSGLCLDVK  103 (124)
T ss_dssp             GGGBEEEET--T--SBEEETTTTEEEEEG
T ss_pred             cCCeEEEcC--C--eeEEeCCCCEEEEec
Confidence            347888765  3  345555467888775


No 34 
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=40.15  E-value=4.6e+02  Score=28.81  Aligned_cols=196  Identities=15%  Similarity=0.211  Sum_probs=0.0

Q ss_pred             ecCCceEEecCCee----EEEecccCCCCceEEEEEcCCC-----------eEEEEeeccCceeee---cCCeEE-----
Q psy15563        106 SHLGKYLAVDSFGN----VTCEADEKEPGSLFQIHVSDEG-----------KWAFKNKTRGYFLGA---SSDKLI-----  162 (614)
Q Consensus       106 Sh~GrYL~~d~dG~----v~~~~e~~~~~~~F~i~~~~dG-----------~waLqs~~~g~yl~~---~~d~~~-----  162 (614)
                      ++.||||.+-.++.    |.=-.+.-+...-=.++.+..+           ..+..+ +.|+||-.   -.|++.     
T Consensus        97 d~~g~~vf~AnY~~g~v~v~p~~~dG~l~~~v~~~~h~g~~p~~rQ~~~h~H~a~~t-P~~~~l~v~DLG~Dri~~y~~~  175 (346)
T COG2706          97 DEDGRFVFVANYHSGSVSVYPLQADGSLQPVVQVVKHTGSGPHERQESPHVHSANFT-PDGRYLVVPDLGTDRIFLYDLD  175 (346)
T ss_pred             CCCCCEEEEEEccCceEEEEEcccCCccccceeeeecCCCCCCccccCCccceeeeC-CCCCEEEEeecCCceEEEEEcc


Q ss_pred             ----EEecCCCCCcceEE-EEcccceeeeccCCCCcccCCCCCCCCceeee--eeCCCCCcccCCCCCcccceEEEEEEe
Q psy15563        163 ----CTAKIPANPEYWLA-HLAARPQLRSCGSPPCASIGLPSSRGLPCRYI--SSSPSTKAVHKPPPLLKRSVFPLIMNI  235 (614)
Q Consensus       163 ----c~~~~~~~~~~W~~-hlA~Hpqv~~~~~~~~~~~~~~~~~~~~~ry~--~l~~~~~~v~~~~pwg~d~l~tl~~~~  235 (614)
                          +.+....-..---| ||++||.-...                   |+  +|+..-.....+..-|.-..+.-+-.+
T Consensus       176 dg~L~~~~~~~v~~G~GPRHi~FHpn~k~a-------------------Y~v~EL~stV~v~~y~~~~g~~~~lQ~i~tl  236 (346)
T COG2706         176 DGKLTPADPAEVKPGAGPRHIVFHPNGKYA-------------------YLVNELNSTVDVLEYNPAVGKFEELQTIDTL  236 (346)
T ss_pred             cCccccccccccCCCCCcceEEEcCCCcEE-------------------EEEeccCCEEEEEEEcCCCceEEEeeeeccC


Q ss_pred             ecCccceeecCCCccccCCCCccceeEEEEEECCCceEEEecCC------eEEEecCCCCCCCceEEEEEcCCCCccEEE
Q psy15563        236 MNGTNGHHENGNGELNGVGFSSKGVWTIGLINAQFKYLTAETFG------FKINANGASLKKKQVWILEPANGGDTLSVY  309 (614)
Q Consensus       236 ~~~~~~~~~~~~~~L~~~G~~~~~~~~V~L~s~~GkYLtae~~g------~~V~An~~~l~~~E~F~Le~~~~~~~~~v~  309 (614)
                      .++.-|--.+..--++.||                |||=|.+-|      ++|..++..+.--+.+..|-..+       
T Consensus       237 P~dF~g~~~~aaIhis~dG----------------rFLYasNRg~dsI~~f~V~~~~g~L~~~~~~~teg~~P-------  293 (346)
T COG2706         237 PEDFTGTNWAAAIHISPDG----------------RFLYASNRGHDSIAVFSVDPDGGKLELVGITPTEGQFP-------  293 (346)
T ss_pred             ccccCCCCceeEEEECCCC----------------CEEEEecCCCCeEEEEEEcCCCCEEEEEEEeccCCcCC-------


Q ss_pred             EEecCCcEEEEccCCeEEEeccCCCCC-ceEEEEEeCCCeEEE
Q psy15563        310 LRSHLGKYLAVDSFGNVTCEADEKEPG-SLFQIHVSDEGKWAF  351 (614)
Q Consensus       310 LRS~~GkYLs~~~~G~V~a~a~~~g~~-E~F~le~~~dGk~aL  351 (614)
                            |-...+++|.+..-+.+.+++ ..|.+... -|++.+
T Consensus       294 ------R~F~i~~~g~~Liaa~q~sd~i~vf~~d~~-TG~L~~  329 (346)
T COG2706         294 ------RDFNINPSGRFLIAANQKSDNITVFERDKE-TGRLTL  329 (346)
T ss_pred             ------ccceeCCCCCEEEEEccCCCcEEEEEEcCC-CceEEe


No 35 
>COG0211 RpmA Ribosomal protein L27 [Translation, ribosomal structure and biogenesis]
Probab=33.94  E-value=1.1e+02  Score=26.79  Aligned_cols=61  Identities=23%  Similarity=0.401  Sum_probs=35.7

Q ss_pred             CCCCCcceEEEEcCCCCeEeeeCCCceEEecccCC---------CCceeEEEEecCCcEEEEEeeCCceEEEE
Q psy15563        483 IRSPATGQIALRDNSGAYLSPIGSKAILKTRSNTV---------TKDELFTLEDSLPQASFIAALNSKYVSVK  546 (614)
Q Consensus       483 ~~~~~~g~valr~~nGkyL~~~g~~g~L~a~~~~~---------~~~elF~le~~~pqv~l~s~~n~kyVs~~  546 (614)
                      .+|+++-+|-.|-..|.++.+  .+=+++.+.+.+         .+++||.|.+-.-.+..++. +.|||||.
T Consensus        13 gRDS~~krLGvK~~~Gq~v~a--G~IivRQRGTk~hpG~NVG~GkDhTlFAl~dG~Vkf~~k~~-~rk~vsV~   82 (87)
T COG0211          13 GRDSESKRLGVKKFGGQFVKA--GSIIVRQRGTKFHPGVNVGRGKDHTLFALVDGVVKFETKGK-NRKYVSVV   82 (87)
T ss_pred             CccccccccceeeeCCeEEec--ccEEEEeccccccCCcccccCCCceEEEeeccEEEEEEccC-CccEEEEE
Confidence            455666777777777777765  222344444432         26789999774423333443 66777764


No 36 
>cd00161 RICIN Ricin-type beta-trefoil; Carbohydrate-binding domain formed from presumed gene triplication. The domain is found in a variety of molecules serving diverse functions such as enzymatic activity, inhibitory toxicity and signal transduction. Highly specific ligand binding occurs on exposed surfaces of the compact domain sturcture.
Probab=30.93  E-value=3.3e+02  Score=23.01  Aligned_cols=59  Identities=19%  Similarity=0.329  Sum_probs=33.6

Q ss_pred             eEEEEecCCCCceEEEEeecCCceEEecCC---eeEEE-ecccCCCCceEEEEEcCCCeEEEEeeccCceeee
Q psy15563         88 VWILEPANGGDTLSVYLRSHLGKYLAVDSF---GNVTC-EADEKEPGSLFQIHVSDEGKWAFKNKTRGYFLGA  156 (614)
Q Consensus        88 vW~l~~~~~~~~~~v~lkSh~GrYL~~d~d---G~v~~-~~e~~~~~~~F~i~~~~dG~waLqs~~~g~yl~~  156 (614)
                      .|++.. .+    .+++++ .++.|.++..   ..|.. .....++.-+|.+..  +|++  .+...+.-|..
T Consensus        37 ~W~~~~-~g----~~~~~~-~~~Cl~~~~~~~~~~~~~~~c~~~~~~Q~W~~~~--~~~i--~~~~~~~cl~~   99 (124)
T cd00161          37 KWTLTS-DG----TIRIKS-SNLCLDVGGDAPGSKVRLYTCSGGSDNQRWTFNK--DGTI--RNLKSGKCLDV   99 (124)
T ss_pred             CEEEeC-CC----eEEEcC-CCeEEcccCCCCCCEEEEEECCCCCcCCEEEECC--CcEE--EECCCCeEEeC
Confidence            899987 32    477776 7888888764   23332 222235667777763  3443  33234444444


No 37 
>KOG1446|consensus
Probab=30.35  E-value=6.7e+02  Score=27.16  Aligned_cols=6  Identities=67%  Similarity=1.154  Sum_probs=3.0

Q ss_pred             CCcEEe
Q psy15563        442 NNKYLS  447 (614)
Q Consensus       442 nGkYL~  447 (614)
                      ++|||+
T Consensus        89 dNkylR   94 (311)
T KOG1446|consen   89 DNKYLR   94 (311)
T ss_pred             cCceEE
Confidence            455554


No 38 
>PF00652 Ricin_B_lectin:  Ricin-type beta-trefoil lectin domain;  InterPro: IPR000772 Ricin is a legume lectin from the seeds of the castor bean plant, Ricinus communis. The seeds are poisonous to people, animals and insects and just one milligram of ricin can kill an adult. Primary structure analysis has shown the presence of a similar domain in many carbohydrate-recognition proteins like plant and bacterial AB-toxins, glycosidases or proteases [, , ]. This domain, known as the ricin B lectin domain, can be present in one or more copies and has been shown in some instance to bind simple sugars, such as galactose or lactose. The ricin B lectin domain is composed of three homologous subdomains of 40 amino acids (alpha, beta and gamma) and a linker peptide of around 15 residues (lambda). It has been proposed that the ricin B lectin domain arose by gene triplication from a primitive 40 residue galactoside-binding peptide [, ]. The most characteristic, though not completely conserved, sequence feature is the presence of a Q-W pattern. Consequently, the ricin B lectin domain as also been refered as the (QxW)3 domain and the three homologous regions as the QxW repeats [, ]. A disulphide bond is also conserved in some of the QxW repeats []. The 3D structure of the ricin B chain has shown that the three QxW repeats pack around a pseudo threefold axis that is stabilised by the lambda linker []. The ricin B lectin domain has no major segments of a helix or beta sheet but each of the QxW repeats contains an omega loop []. An idealized omega-loop is a compact, contiguous segment of polypeptide that traces a 'loop-shaped' path in three-dimensional space; the main chain resembles a Greek omega.; PDB: 2VLC_B 3A22_B 3A21_B 3A23_B 1GGP_B 1VCL_A 2Z48_B 2Z49_A 2D7R_A 2D7I_A ....
Probab=27.93  E-value=3.9e+02  Score=22.92  Aligned_cols=61  Identities=15%  Similarity=0.288  Sum_probs=38.2

Q ss_pred             eEEEEecCCCCceEEEEeecCCceEEecCCe---eEEEeccc-CCCCceEEEEEcCCCeEEEEeeccCceeeec
Q psy15563         88 VWILEPANGGDTLSVYLRSHLGKYLAVDSFG---NVTCEADE-KEPGSLFQIHVSDEGKWAFKNKTRGYFLGAS  157 (614)
Q Consensus        88 vW~l~~~~~~~~~~v~lkSh~GrYL~~d~dG---~v~~~~e~-~~~~~~F~i~~~~dG~waLqs~~~g~yl~~~  157 (614)
                      .|.+....     .++.......-|.++..+   .|.-..-. .....+|.+..    ...|++...++.|...
T Consensus        39 ~w~~~~~~-----~i~~~~~~~~CL~~~~~~~~~~i~l~~C~~~~~~Q~W~~~~----~~~i~n~~s~~cL~~~  103 (124)
T PF00652_consen   39 LWRFDPDG-----QIRSNNNPNLCLDVDGSSPGTKIVLWPCDSNSSNQRWKFDP----DGRIRNKNSGLCLDVK  103 (124)
T ss_dssp             EEEEETTS-----BEEETTETTEEEEESSSSTTEBEEEEETTTTGGGGBEEEET----TSBEEETTTTEEEEEG
T ss_pred             eEEEcCCC-----ceeeccCcceEEEeeccCCCceEEEeeccCCccCCeEEEcC----CeeEEeCCCCEEEEec
Confidence            99888752     356555566688888777   33333333 33337888875    5556665567777774


No 39 
>KOG0649|consensus
Probab=26.37  E-value=8.1e+02  Score=26.07  Aligned_cols=52  Identities=25%  Similarity=0.296  Sum_probs=29.7

Q ss_pred             EEEEEECCCceEE-EecCCeEEEecCCCCCCCceEEEEEcCCCCccEEEEEecCCcEEEE
Q psy15563        262 TIGLINAQFKYLT-AETFGFKINANGASLKKKQVWILEPANGGDTLSVYLRSHLGKYLAV  320 (614)
Q Consensus       262 ~V~L~s~~GkYLt-ae~~g~~V~An~~~l~~~E~F~Le~~~~~~~~~v~LRS~~GkYLs~  320 (614)
                      .|.+|++++.-|| +|++.++|=..++   .+++=.+|.-    ...-.||-++||++.+
T Consensus       161 ~vv~R~~~~qilsG~EDGtvRvWd~kt---~k~v~~ie~y----k~~~~lRp~~g~wiga  213 (325)
T KOG0649|consen  161 SVVGRNANGQILSGAEDGTVRVWDTKT---QKHVSMIEPY----KNPNLLRPDWGKWIGA  213 (325)
T ss_pred             eeeecccCcceeecCCCccEEEEeccc---cceeEEeccc----cChhhcCcccCceeEE
Confidence            6888999999998 4443355432222   2233334321    2245677778887665


No 40 
>PF02751 TFIIA_gamma_C:  Transcription initiation factor IIA, gamma subunit;  InterPro: IPR015871 Transcription factor IIA (TFIIA) is one of several factors that form part of a transcription pre-initiation complex along with RNA polymerase II, the TATA-box-binding protein (TBP) and TBP-associated factors, on the TATA-box sequence upstream of the initiation start site. After initiation, some components of the pre-initiation complex (including TFIIA) remain attached and re-initiate a subsequent round of transcription. TFIIA binds to TBP to stabilise TBP binding to the TATA element. TFIIA also inhibits the cytokine HMGB1 (high mobility group 1 protein) binding to TBP [], and can dissociate HMGB1 already bound to TBP/TATA-box. Human and Drosophila TFIIA have three subunits: two large subunits, LN/alpha and LC/beta, derived from the same gene, and a small subunit, S/gamma. Yeast TFIIA has two subunits: a large TOA1 subunit that shows sequence similarity to the N-terminal of LN/alpha and the C-terminal of LC/beta, and a small subunit, TOA2 that is highly homologous with S/gamma. The conserved regions of the large and small subunits of TFIIA combine to form two domains: a four-helix bundle (helical domain) composed of two helices from each of the N-terminal regions of TOA1 and TOA2 in yeast; and a beta-barrel (beta-barrel domain) composed of beta-sheets from the C-terminal regions of TOA1 and TOA2 []. This entry represents the beta-barrel domain found at the C-terminal of the gamma subunit of transcription factor TFIIA. ; GO: 0006367 transcription initiation from RNA polymerase II promoter, 0005672 transcription factor TFIIA complex; PDB: 1NVP_D 1RM1_B 1YTF_D 1NH2_D.
Probab=24.96  E-value=1.2e+02  Score=24.28  Aligned_cols=33  Identities=24%  Similarity=0.464  Sum_probs=25.6

Q ss_pred             ceeEEEEecCCCCceEEEEee-cCCceEEecCCeeEEEec
Q psy15563         86 DGVWILEPANGGDTLSVYLRS-HLGKYLAVDSFGNVTCEA  124 (614)
Q Consensus        86 ~~vW~l~~~~~~~~~~v~lkS-h~GrYL~~d~dG~v~~~~  124 (614)
                      |.|||....      .+.||- -.+.++.+|+--.|.|++
T Consensus        19 DnVWTFi~k------n~~fk~~~~~~~~~~dkvKIVACd~   52 (52)
T PF02751_consen   19 DNVWTFILK------NVEFKMEDNNETVKVDKVKIVACDS   52 (52)
T ss_dssp             TTEEEEEEE------EEEEEEE-SSEEEEEEEEEEEEEES
T ss_pred             CcEEEEEEc------CEEEEEecCCcEEEcCeEEEEEcCC
Confidence            569999874      377876 688899999888888874


No 41 
>smart00458 RICIN Ricin-type beta-trefoil. Carbohydrate-binding domain formed from presumed gene triplication.
Probab=22.01  E-value=5e+02  Score=22.09  Aligned_cols=63  Identities=17%  Similarity=0.326  Sum_probs=34.9

Q ss_pred             CCCceEEEEEcCCCCccEEEEEecCCcEEEEccCC--eEEE-eccCCCCCceEEEEEeCCCeEEEEECCCCeEEEee
Q psy15563        290 KKKQVWILEPANGGDTLSVYLRSHLGKYLAVDSFG--NVTC-EADEKEPGSLFQIHVSDEGKWAFKNKTRGYFLGAS  363 (614)
Q Consensus       290 ~~~E~F~Le~~~~~~~~~v~LRS~~GkYLs~~~~G--~V~a-~a~~~g~~E~F~le~~~dGk~aLrs~~~G~yL~a~  363 (614)
                      +..|.|.+..     .+.+.++  .++.|.+....  .|.. .-+...+..+|.+..  +|.  |++...+..|...
T Consensus        28 ~~~Q~w~~~~-----~g~~~~~--~~~Cl~~~~~~~~~v~l~~c~~~~~~Q~W~~~~--~~~--i~~~~~~~cl~~~   93 (117)
T smart00458       28 GGNQLWKLTS-----DGAIRIA--TDLCLTANGNTGSTVTLYSCDGDADNQYWTVNK--DGT--IRNPDSGLCLDVK   93 (117)
T ss_pred             CccceEEEeC-----CCeEEec--CCccCccCCCCCCEEEEEECCCCCcCCEEEECC--Cee--EEeCCCCEEEecC
Confidence            4568998873     3456665  67777765432  3322 222225667777654  343  4444467777654


No 42 
>COG5402 Uncharacterized conserved protein [Function unknown]
Probab=21.85  E-value=1.8e+02  Score=28.98  Aligned_cols=105  Identities=13%  Similarity=0.209  Sum_probs=72.9

Q ss_pred             ccccccceeEEEEecCCCCceEEEEeec---CCceEEecCCee-EEEecccCC----CCceEEEE--EcCCCeEEEEeec
Q psy15563         80 SLKKKQDGVWILEPANGGDTLSVYLRSH---LGKYLAVDSFGN-VTCEADEKE----PGSLFQIH--VSDEGKWAFKNKT  149 (614)
Q Consensus        80 ~~~~~q~~vW~l~~~~~~~~~~v~lkSh---~GrYL~~d~dG~-v~~~~e~~~----~~~~F~i~--~~~dG~waLqs~~  149 (614)
                      |+|--|   |++.+..+...+..+=|+|   +|.++.--..|. .++..++.+    ..|++.|+  +-|---|+|-...
T Consensus        35 siriG~---W~afp~agT~~ADpYskA~~Ar~g~l~lg~aEgl~F~A~~Ds~g~~Lr~~C~Yri~G~~ppARfWTly~a~  111 (194)
T COG5402          35 SIRIGP---WEAFPEAGTPKADPYSKARRARAGLLALGRAEGLSFTATTDSAGQRLRRECSYRIEGGTPPARFWTLYAAD  111 (194)
T ss_pred             ceeecc---cccccccCCccCChhHHHHHhhhheeecccccceeEEEeecCCCchhhccCcEEecCCCCCceeEEEEEec
Confidence            444444   9999887654456888887   788887777884 777777654    46788887  2234667876633


Q ss_pred             -cCceeeecCCeEEEE-ecCC--CCCcceEEEEcccceeeec
Q psy15563        150 -RGYFLGASSDKLICT-AKIP--ANPEYWLAHLAARPQLRSC  187 (614)
Q Consensus       150 -~g~yl~~~~d~~~c~-~~~~--~~~~~W~~hlA~Hpqv~~~  187 (614)
                       .+.+|.-.+...++. ++++  ++...|+++++-|+|=..|
T Consensus       112 ~d~~~l~p~~~~~~Al~s~~Vlrs~Dg~~~I~vsp~a~pgnW  153 (194)
T COG5402         112 QDLHVLAPGEGRPSALNSRTVLRSADGSWVITVSPDAQPGNW  153 (194)
T ss_pred             CCCceecCCCCCchheeceeeeeecCCcEEEEECCCCCCCce
Confidence             447777767766653 3443  4578999999999997666


No 43 
>smart00472 MIR Domain in ryanodine and inositol trisphosphate receptors and protein O-mannosyltransferases.
Probab=21.55  E-value=2.3e+02  Score=21.77  Aligned_cols=38  Identities=26%  Similarity=0.308  Sum_probs=24.4

Q ss_pred             eEEEEeec-CCceEEecCCe---------eEEEecccC-CCCceEEEEE
Q psy15563        100 LSVYLRSH-LGKYLAVDSFG---------NVTCEADEK-EPGSLFQIHV  137 (614)
Q Consensus       100 ~~v~lkSh-~GrYL~~d~dG---------~v~~~~e~~-~~~~~F~i~~  137 (614)
                      +.|+||-- .|+||.+...-         .|+|..+.. .....|.|+.
T Consensus         8 ~~vrL~H~~tg~yL~s~~~~~~~~~~~q~eVt~~~~~~~~~~~~W~ie~   56 (57)
T smart00472        8 DVVRLRHVTTGRYLHSHENKLPPWGDGQQEVTGYGNPAGDANTLWLIEP   56 (57)
T ss_pred             CEEEEEEhhhCcEeecCCCCCCCCCCCcceEEEECCCCCCCCCcEEEEe
Confidence            47999853 79999874322         477766554 4555666653


No 44 
>COG3211 PhoX Predicted phosphatase [General function prediction only]
Probab=21.24  E-value=4.4e+02  Score=30.96  Aligned_cols=88  Identities=16%  Similarity=0.260  Sum_probs=44.8

Q ss_pred             eEEEEecCCCCceEEEEe-ecCCce------EEecCCeeEEEecccCCCCceEEEEEcCCCeEEEEeeccCceeeecCCe
Q psy15563         88 VWILEPANGGDTLSVYLR-SHLGKY------LAVDSFGNVTCEADEKEPGSLFQIHVSDEGKWAFKNKTRGYFLGASSDK  160 (614)
Q Consensus        88 vW~l~~~~~~~~~~v~lk-Sh~GrY------L~~d~dG~v~~~~e~~~~~~~F~i~~~~dG~waLqs~~~g~yl~~~~d~  160 (614)
                      =|+.|.++- +....=+| +.+|||      +.+++||+|++=.-.- ....|+-.|-+.+++..-....++=|- ++..
T Consensus       287 gwvVEidp~-~p~stPvKrTalGRFkHE~a~v~v~~~~~vVvYlGDD-er~~~~YKFVsn~~y~~~~~~sn~~Ll-deGt  363 (616)
T COG3211         287 GWVVEIDPN-DPNSTPVKRTALGRFKHEAAAVDVNKDGRVVVYLGDD-ERGEYFYKFVSNGRYNPGASASNRDLL-DEGT  363 (616)
T ss_pred             cceeecCCC-CCCccceeeeccccccccceeEEecCCCeEEEEeccC-CCCceEEEeccCCCCCCCccchhhhhh-hcce
Confidence            599999875 33333333 789998      7788999876654321 122233344444544443321111110 1223


Q ss_pred             EEEEecCCCCCcceEEEE
Q psy15563        161 LICTAKIPANPEYWLAHL  178 (614)
Q Consensus       161 ~~c~~~~~~~~~~W~~hl  178 (614)
                      |..-.--..-+..|.+..
T Consensus       364 LyVAkfn~dGtg~W~pl~  381 (616)
T COG3211         364 LYVAKFNEDGTGNWLPLV  381 (616)
T ss_pred             EEEEEeCCCCCcceeeee
Confidence            333232334567887744


Done!