Query psy15563
Match_columns 614
No_of_seqs 291 out of 453
Neff 5.7
Searched_HMMs 46136
Date Sat Aug 17 00:27:52 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy15563.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/15563hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 cd00257 Fascin Fascin-like dom 99.9 7.9E-22 1.7E-26 178.7 16.0 117 261-382 2-119 (119)
2 PF06268 Fascin: Fascin domain 99.8 1.1E-19 2.4E-24 163.7 12.2 110 37-176 1-111 (111)
3 PF06268 Fascin: Fascin domain 99.8 1.6E-19 3.6E-24 162.6 12.9 110 269-382 1-111 (111)
4 cd00257 Fascin Fascin-like dom 99.8 3.2E-19 6.9E-24 161.6 14.3 117 29-176 2-119 (119)
5 PF06229 FRG1: FRG1-like famil 99.7 2.8E-16 6.2E-21 154.4 10.0 90 84-178 27-117 (191)
6 PF06229 FRG1: FRG1-like famil 99.5 3.4E-13 7.3E-18 132.8 12.5 97 286-385 21-118 (191)
7 PF04601 DUF569: Protein of un 99.0 1.3E-09 2.7E-14 102.5 10.9 109 99-224 7-133 (142)
8 PF04601 DUF569: Protein of un 98.7 3.7E-07 8.1E-12 86.0 14.2 101 262-365 9-118 (142)
9 KOG3962|consensus 98.6 1.4E-07 3.1E-12 93.6 9.9 97 262-363 49-149 (246)
10 KOG3962|consensus 98.5 1.6E-07 3.4E-12 93.3 5.1 79 89-172 86-165 (246)
11 smart00791 Agglutinin Amaranth 98.0 0.00019 4E-09 67.0 13.3 112 261-376 6-131 (139)
12 PF00167 FGF: Fibroblast growt 97.9 0.00061 1.3E-08 62.5 15.2 101 307-408 2-105 (122)
13 PF07468 Agglutinin: Agglutini 97.7 0.0011 2.4E-08 63.2 13.7 120 262-382 7-151 (153)
14 cd00058 FGF Acidic and basic f 97.5 0.0026 5.7E-08 58.9 13.5 97 310-407 3-102 (123)
15 PF00167 FGF: Fibroblast growt 97.4 0.0046 9.9E-08 56.7 14.2 101 263-368 3-107 (122)
16 smart00442 FGF Acidic and basi 97.4 0.0067 1.4E-07 56.4 14.9 100 307-407 4-106 (126)
17 PF14200 RicinB_lectin_2: Rici 97.1 0.0075 1.6E-07 53.4 11.9 92 291-383 1-105 (105)
18 smart00791 Agglutinin Amaranth 97.1 0.0057 1.2E-07 57.3 11.3 97 307-403 7-111 (139)
19 cd00058 FGF Acidic and basic f 97.0 0.0044 9.5E-08 57.4 9.1 76 102-179 3-78 (123)
20 PF14200 RicinB_lectin_2: Rici 96.9 0.014 2.9E-07 51.7 11.2 88 88-176 4-104 (105)
21 PF07468 Agglutinin: Agglutini 96.9 0.012 2.5E-07 56.4 11.0 100 490-606 7-125 (153)
22 smart00442 FGF Acidic and basi 96.4 0.026 5.7E-07 52.5 9.9 78 102-179 5-82 (126)
23 KOG3885|consensus 95.1 0.079 1.7E-06 51.1 7.7 74 307-382 29-103 (155)
24 PF05270 AbfB: Alpha-L-arabino 94.2 1.4 3.1E-05 42.0 13.8 108 262-371 2-124 (142)
25 KOG3885|consensus 90.4 1.1 2.5E-05 43.3 7.8 61 101-162 29-90 (155)
26 PF05270 AbfB: Alpha-L-arabino 81.4 12 0.00026 35.8 9.5 106 34-163 8-122 (142)
27 cd00161 RICIN Ricin-type beta- 64.3 91 0.002 26.6 11.4 85 269-363 9-100 (124)
28 PF03498 CDtoxinA: Cytolethal 63.5 21 0.00046 34.4 6.5 78 101-180 53-145 (150)
29 KOG1407|consensus 58.6 2.5E+02 0.0055 29.9 16.4 21 489-510 191-212 (313)
30 PF03498 CDtoxinA: Cytolethal 52.5 77 0.0017 30.6 8.3 57 305-363 51-111 (150)
31 KOG0308|consensus 47.7 43 0.00094 39.2 6.6 63 101-163 206-281 (735)
32 KOG2243|consensus 46.8 40 0.00087 42.6 6.3 204 145-361 122-377 (5019)
33 PF00652 Ricin_B_lectin: Ricin 43.9 1.5E+02 0.0033 25.6 8.4 93 262-363 4-103 (124)
34 COG2706 3-carboxymuconate cycl 40.2 4.6E+02 0.01 28.8 12.6 196 106-351 97-329 (346)
35 COG0211 RpmA Ribosomal protein 33.9 1.1E+02 0.0024 26.8 5.5 61 483-546 13-82 (87)
36 cd00161 RICIN Ricin-type beta- 30.9 3.3E+02 0.0071 23.0 10.9 59 88-156 37-99 (124)
37 KOG1446|consensus 30.3 6.7E+02 0.015 27.2 11.7 6 442-447 89-94 (311)
38 PF00652 Ricin_B_lectin: Ricin 27.9 3.9E+02 0.0084 22.9 8.8 61 88-157 39-103 (124)
39 KOG0649|consensus 26.4 8.1E+02 0.017 26.1 11.8 52 262-320 161-213 (325)
40 PF02751 TFIIA_gamma_C: Transc 25.0 1.2E+02 0.0026 24.3 3.9 33 86-124 19-52 (52)
41 smart00458 RICIN Ricin-type be 22.0 5E+02 0.011 22.1 10.8 63 290-363 28-93 (117)
42 COG5402 Uncharacterized conser 21.9 1.8E+02 0.0038 29.0 5.1 105 80-187 35-153 (194)
43 smart00472 MIR Domain in ryano 21.6 2.3E+02 0.005 21.8 5.0 38 100-137 8-56 (57)
44 COG3211 PhoX Predicted phospha 21.2 4.4E+02 0.0096 31.0 8.9 88 88-178 287-381 (616)
No 1
>cd00257 Fascin Fascin-like domain; members include actin-bundling/crosslinking proteins facsin, histoactophilin and singed; identified in sea urchin, Drosophila, Xenopus, rodents, and humans; The fascin-like domain adopts a beta-trefoil topology and contains an internal threefold repeat; the fascin subgroup contains four copies of the domain; Structurally similar to fibroblast growth factor (FGF)
Probab=99.88 E-value=7.9e-22 Score=178.66 Aligned_cols=117 Identities=33% Similarity=0.471 Sum_probs=109.2
Q ss_pred eEEEEEECCCceEEEecCCeEEEecCCCCCCCceEEEEEcCCCCccEEEEEecCCcEEEEccCCeEEEeccCCCCCceEE
Q psy15563 261 WTIGLINAQFKYLTAETFGFKINANGASLKKKQVWILEPANGGDTLSVYLRSHLGKYLAVDSFGNVTCEADEKEPGSLFQ 340 (614)
Q Consensus 261 ~~V~L~s~~GkYLtae~~g~~V~An~~~l~~~E~F~Le~~~~~~~~~v~LRS~~GkYLs~~~~G~V~a~a~~~g~~E~F~ 340 (614)
.+|+|++.+||||++++.|..|.|+++.++++|+|+|++. +.+.|+|||++||||+++.+|.|.++++ ++++|+|+
T Consensus 2 p~v~Lrs~~gkyl~~~~~g~~v~a~~~~~~~~e~F~l~~~---~~g~v~Lrs~~G~yls~~~~g~l~~~~~-~~~~e~F~ 77 (119)
T cd00257 2 PQVVLRSVNGRYLSAEAGGDKVDANRDSLKGDETFTLEFD---NTGKYALRSHDGKYLSADSDGGVQLEGH-PNADCRFT 77 (119)
T ss_pred cEEEEEEcCCCEEEEeccCCEEEEcCccCCCceEEEEEEC---CCCeEEEEECCCcEEEEECCCCEEecCC-CCCCcEEE
Confidence 3899999999999999887799999999999999999973 4679999999999999999999999999 99999999
Q ss_pred EEEeCCCeEEEEECCCCeEEEee-CCeEEEeecCCCCCccEEE
Q psy15563 341 IHVSDEGKWAFKNKTRGYFLGAS-SDKLICTAKIPANPEYWLA 382 (614)
Q Consensus 341 le~~~dGk~aLrs~~~G~yL~a~-~G~L~a~~~s~g~~E~F~v 382 (614)
+|++++|+|+||+. +|+||+++ +|.|.|+++.++++|.|.+
T Consensus 78 ~e~~~~g~~al~~~-~G~yl~~~~~g~l~~~~~~~~~~e~f~~ 119 (119)
T cd00257 78 LEFHGDGKWALRAE-NGRYLGGDGSGTLKASSETVGPDELFEL 119 (119)
T ss_pred EEECCCCeEEEEcC-CCCEEeecCCCeEEEecCCCCccceecC
Confidence 99998899999999 99999996 7999999999999999974
No 2
>PF06268 Fascin: Fascin domain; InterPro: IPR022768 This family consists of several eukaryotic fascin or singed proteins. The fascins are a structurally unique and evolutionarily conserved group of actin cross-linking proteins. Fascins function in the organisation of two major forms of actin-based structures: dynamic, cortical cell protrusions and cytoplasmic microfilament bundles. The cortical structures, which include filopodia, spikes, lamellipodial ribs, oocyte microvilli and the dendrites of dendritic cells, have roles in cell-matrix adhesion, cell interactions and cell migration, whereas the cytoplasmic actin bundles appear to participate in cell architecture []. Dictyostelium hisactophilin, another actin-binding protein, is a submembranous pH sensor that signals slight changes of the H+ concentration to actin by inducing actin polymerisation and binding to microfilaments only at pH values below seven []. Members of this family are histidine rich, typically contain the repeated motif of HHXH []. ; GO: 0030674 protein binding, bridging, 0051015 actin filament binding; PDB: 1DFC_B 3P53_B 3LLP_A 1HCE_A 1HCD_A.
Probab=99.82 E-value=1.1e-19 Score=163.74 Aligned_cols=110 Identities=35% Similarity=0.516 Sum_probs=101.2
Q ss_pred CCceeeeeecCeeEEeccccccccccccccceecceeecccccccccccceeEEEEecCCCCceEEEEeecCCceEEecC
Q psy15563 37 QFKYLTAETFGFKINANGASLKKKQFKYLTAETFGFKINANGASLKKKQDGVWILEPANGGDTLSVYLRSHLGKYLAVDS 116 (614)
Q Consensus 37 ~~~YLtaE~fgfkvna~~~slkkkQ~~~~~~~~~~~~~~~~~~~~~~~q~~vW~l~~~~~~~~~~v~lkSh~GrYL~~d~ 116 (614)
+++|+++|.+|++|+|++.+|++.| +|+|+..++ +..|+||||+||||++|.
T Consensus 1 ~~~~~~~~k~~~~l~an~~~~~~~e--------------------------~f~le~~~~--~~~v~lrs~~GkYls~~~ 52 (111)
T PF06268_consen 1 ANGYLVSEKFGAHLNANRASLSDWE--------------------------TFQLEFDDG--SYKVALRSHNGKYLSVDS 52 (111)
T ss_dssp TTEEEEETTCTCBEEEEESSSSCGG--------------------------SEEEEEETT--EEEEEEECTTSEEEEEET
T ss_pred CCcEEEEEEcCCEEECChhcCcccE--------------------------EEEEEEECC--CCEEEEEcCCCCEEEEcC
Confidence 5899999999999999999999999 999998765 567999999999999999
Q ss_pred CeeEEEecccCCCCceEEEEEcCCCeEEEEeeccCceee-ecCCeEEEEecCCCCCcceEE
Q psy15563 117 FGNVTCEADEKEPGSLFQIHVSDEGKWAFKNKTRGYFLG-ASSDKLICTAKIPANPEYWLA 176 (614)
Q Consensus 117 dG~v~~~~e~~~~~~~F~i~~~~dG~waLqs~~~g~yl~-~~~d~~~c~~~~~~~~~~W~~ 176 (614)
||.|.|+++.+++++.|+|+.+ |+|.++...+|+||+ ...+.|.|.++.+++.|+|++
T Consensus 53 ~G~v~~~~~~~~~~~~F~i~~~--~~~~~~~~~nGkYl~~~~~g~l~a~~~~~~~~elf~~ 111 (111)
T PF06268_consen 53 DGSVVADSETPGPDEFFEIEWH--GGKVALRASNGKYLSAGPNGQLKANATSPGKDELFEY 111 (111)
T ss_dssp TSEEEEEESSSSGGGCBEEEEE--TTEEEEECTTSCEEEEETTTEEEEEESSSSGGGEEEE
T ss_pred CCeEEecCCCCCCCcEEEEEEC--CCEEEEECCCCCEEeeCCCCeEEEcCCCCCcceEEeC
Confidence 9999999999999999999999 666666669999999 688999999999999999975
No 3
>PF06268 Fascin: Fascin domain; InterPro: IPR022768 This family consists of several eukaryotic fascin or singed proteins. The fascins are a structurally unique and evolutionarily conserved group of actin cross-linking proteins. Fascins function in the organisation of two major forms of actin-based structures: dynamic, cortical cell protrusions and cytoplasmic microfilament bundles. The cortical structures, which include filopodia, spikes, lamellipodial ribs, oocyte microvilli and the dendrites of dendritic cells, have roles in cell-matrix adhesion, cell interactions and cell migration, whereas the cytoplasmic actin bundles appear to participate in cell architecture []. Dictyostelium hisactophilin, another actin-binding protein, is a submembranous pH sensor that signals slight changes of the H+ concentration to actin by inducing actin polymerisation and binding to microfilaments only at pH values below seven []. Members of this family are histidine rich, typically contain the repeated motif of HHXH []. ; GO: 0030674 protein binding, bridging, 0051015 actin filament binding; PDB: 1DFC_B 3P53_B 3LLP_A 1HCE_A 1HCD_A.
Probab=99.81 E-value=1.6e-19 Score=162.60 Aligned_cols=110 Identities=37% Similarity=0.564 Sum_probs=100.8
Q ss_pred CCceEEEecCCeEEEecCCCCCCCceEEEEEcCCCCccEEEEEecCCcEEEEccCCeEEEeccCCCCCceEEEEEeCCCe
Q psy15563 269 QFKYLTAETFGFKINANGASLKKKQVWILEPANGGDTLSVYLRSHLGKYLAVDSFGNVTCEADEKEPGSLFQIHVSDEGK 348 (614)
Q Consensus 269 ~GkYLtae~~g~~V~An~~~l~~~E~F~Le~~~~~~~~~v~LRS~~GkYLs~~~~G~V~a~a~~~g~~E~F~le~~~dGk 348 (614)
.++|++++.++++|.||+..++++|+|+||...+ ...|+|||++||||+++.+|.|+|+++.+++++.|+|+++ +++
T Consensus 1 ~~~~~~~~k~~~~l~an~~~~~~~e~f~le~~~~--~~~v~lrs~~GkYls~~~~G~v~~~~~~~~~~~~F~i~~~-~~~ 77 (111)
T PF06268_consen 1 ANGYLVSEKFGAHLNANRASLSDWETFQLEFDDG--SYKVALRSHNGKYLSVDSDGSVVADSETPGPDEFFEIEWH-GGK 77 (111)
T ss_dssp TTEEEEETTCTCBEEEEESSSSCGGSEEEEEETT--EEEEEEECTTSEEEEEETTSEEEEEESSSSGGGCBEEEEE-TTE
T ss_pred CCcEEEEEEcCCEEECChhcCcccEEEEEEEECC--CCEEEEEcCCCCEEEEcCCCeEEecCCCCCCCcEEEEEEC-CCE
Confidence 3689999999999999999999999999997653 7788999999999999999999999999999999999999 677
Q ss_pred EEEEECCCCeEEEee-CCeEEEeecCCCCCccEEE
Q psy15563 349 WAFKNKTRGYFLGAS-SDKLICTAKIPANPEYWLA 382 (614)
Q Consensus 349 ~aLrs~~~G~yL~a~-~G~L~a~~~s~g~~E~F~v 382 (614)
++|++. +|+||++. +|.|.|+++.|+++|+|.+
T Consensus 78 ~~~~~~-nGkYl~~~~~g~l~a~~~~~~~~elf~~ 111 (111)
T PF06268_consen 78 VALRAS-NGKYLSAGPNGQLKANATSPGKDELFEY 111 (111)
T ss_dssp EEEECT-TSCEEEEETTTEEEEEESSSSGGGEEEE
T ss_pred EEEECC-CCCEEeeCCCCeEEEcCCCCCcceEEeC
Confidence 777777 99999975 7999999999999999975
No 4
>cd00257 Fascin Fascin-like domain; members include actin-bundling/crosslinking proteins facsin, histoactophilin and singed; identified in sea urchin, Drosophila, Xenopus, rodents, and humans; The fascin-like domain adopts a beta-trefoil topology and contains an internal threefold repeat; the fascin subgroup contains four copies of the domain; Structurally similar to fibroblast growth factor (FGF)
Probab=99.81 E-value=3.2e-19 Score=161.64 Aligned_cols=117 Identities=33% Similarity=0.462 Sum_probs=108.1
Q ss_pred EEEEEEecCCceeeeeecCeeEEeccccccccccccccceecceeecccccccccccceeEEEEecCCCCceEEEEeecC
Q psy15563 29 WTIGLINAQFKYLTAETFGFKINANGASLKKKQFKYLTAETFGFKINANGASLKKKQDGVWILEPANGGDTLSVYLRSHL 108 (614)
Q Consensus 29 ~~~Glin~~~~YLtaE~fgfkvna~~~slkkkQ~~~~~~~~~~~~~~~~~~~~~~~q~~vW~l~~~~~~~~~~v~lkSh~ 108 (614)
+++.|.+.+||||++|..|+.|.|++..+..+| +|.+++.+. ..|.|||+.
T Consensus 2 p~v~Lrs~~gkyl~~~~~g~~v~a~~~~~~~~e--------------------------~F~l~~~~~---g~v~Lrs~~ 52 (119)
T cd00257 2 PQVVLRSVNGRYLSAEAGGDKVDANRDSLKGDE--------------------------TFTLEFDNT---GKYALRSHD 52 (119)
T ss_pred cEEEEEEcCCCEEEEeccCCEEEEcCccCCCce--------------------------EEEEEECCC---CeEEEEECC
Confidence 689999999999999999999999999999988 999998643 369999999
Q ss_pred CceEEecCCeeEEEecccCCCCceEEEEEcCCCeEEEEeeccCceeeec-CCeEEEEecCCCCCcceEE
Q psy15563 109 GKYLAVDSFGNVTCEADEKEPGSLFQIHVSDEGKWAFKNKTRGYFLGAS-SDKLICTAKIPANPEYWLA 176 (614)
Q Consensus 109 GrYL~~d~dG~v~~~~e~~~~~~~F~i~~~~dG~waLqs~~~g~yl~~~-~d~~~c~~~~~~~~~~W~~ 176 (614)
||||+++++|.|+++++ +++.|+|.++.++||+|+||+ .+|+||+++ .+.|.|.++++++.|+|.+
T Consensus 53 G~yls~~~~g~l~~~~~-~~~~e~F~~e~~~~g~~al~~-~~G~yl~~~~~g~l~~~~~~~~~~e~f~~ 119 (119)
T cd00257 53 GKYLSADSDGGVQLEGH-PNADCRFTLEFHGDGKWALRA-ENGRYLGGDGSGTLKASSETVGPDELFEL 119 (119)
T ss_pred CcEEEEECCCCEEecCC-CCCCcEEEEEECCCCeEEEEc-CCCCEEeecCCCeEEEecCCCCccceecC
Confidence 99999999999999999 999999999999999999999 699999996 6789999999999999974
No 5
>PF06229 FRG1: FRG1-like family; InterPro: IPR010414 This entry represents Frg1 (FSHD region gene 1), a protein that is considered to be a candidate for facioscapulohumeral muscular dystrophy (FSHD). FSHD is a dominant neuromuscular disorder caused by deletions in a number of tandem repeat units (called D4Z4) located on chromosome 4q35. D4Z4 contains a transcriptional silencer whose deletion causes the over-expression in skeletal muscle of 4q35 genes, including Frg1 [, ]. Frg1 is localised to nucleoli and appears to be a component of the human spliceosome, but its exact function is unknown [].; PDB: 2YUG_A.
Probab=99.66 E-value=2.8e-16 Score=154.45 Aligned_cols=90 Identities=24% Similarity=0.378 Sum_probs=59.3
Q ss_pred ccceeEEEEecCCCCceEEEEeecCCceEEecCCeeEEEecccCCCCceEEEEEcCCCeEEEEeeccCceeeecC-CeEE
Q psy15563 84 KQDGVWILEPANGGDTLSVYLRSHLGKYLAVDSFGNVTCEADEKEPGSLFQIHVSDEGKWAFKNKTRGYFLGASS-DKLI 162 (614)
Q Consensus 84 ~q~~vW~l~~~~~~~~~~v~lkSh~GrYL~~d~dG~v~~~~e~~~~~~~F~i~~~~dG~waLqs~~~g~yl~~~~-d~~~ 162 (614)
+| |||+.++++ +..|.||||+||||+||++|.|+|.+||+||.|+|+++++ +|++++++..++.||+.++ +.|.
T Consensus 27 ~q--V~va~~v~~--~~~iafKs~~GkYLs~Dk~G~v~a~sdAiGp~E~f~~V~~-~~~~a~~~~~~~~FLs~~~~~~i~ 101 (191)
T PF06229_consen 27 RQ--VWVATRVPG--DEKIAFKSGHGKYLSCDKDGIVSARSDAIGPQEQFEPVFQ-DGKPALFSSSNNKFLSVDEEGDIR 101 (191)
T ss_dssp TT---EEEEE--S--SS-EEEEETTS-BEEE-SSSBEEE--SS--TTTBEEEE-S-TT--EEEE-TTS-BEEE-SSS-EE
T ss_pred hH--eEEEEEecC--CCceEeeccCccEEEEcCCCcEEEEeecCCCceEEEEEEC-CCCeEEEecCCCeEEEEecccCee
Confidence 56 999999975 4579999999999999999999999999999999999998 6788888779999999965 5688
Q ss_pred EEecCCCCCcceEEEE
Q psy15563 163 CTAKIPANPEYWLAHL 178 (614)
Q Consensus 163 c~~~~~~~~~~W~~hl 178 (614)
|.++++++.|++.|+-
T Consensus 102 a~s~~a~~~e~~~iR~ 117 (191)
T PF06229_consen 102 ADSKTAGENEMIKIRS 117 (191)
T ss_dssp E--S---TTT--EEEE
T ss_pred eccccCCCCceEEEEE
Confidence 9999999999988854
No 6
>PF06229 FRG1: FRG1-like family; InterPro: IPR010414 This entry represents Frg1 (FSHD region gene 1), a protein that is considered to be a candidate for facioscapulohumeral muscular dystrophy (FSHD). FSHD is a dominant neuromuscular disorder caused by deletions in a number of tandem repeat units (called D4Z4) located on chromosome 4q35. D4Z4 contains a transcriptional silencer whose deletion causes the over-expression in skeletal muscle of 4q35 genes, including Frg1 [, ]. Frg1 is localised to nucleoli and appears to be a component of the human spliceosome, but its exact function is unknown [].; PDB: 2YUG_A.
Probab=99.47 E-value=3.4e-13 Score=132.82 Aligned_cols=97 Identities=24% Similarity=0.366 Sum_probs=61.2
Q ss_pred CCCCCCCceEEEEEcCCCCccEEEEEecCCcEEEEccCCeEEEeccCCCCCceEEEEEeCCCeEEEEECCCCeEEEeeC-
Q psy15563 286 GASLKKKQVWILEPANGGDTLSVYLRSHLGKYLAVDSFGNVTCEADEKEPGSLFQIHVSDEGKWAFKNKTRGYFLGASS- 364 (614)
Q Consensus 286 ~~~l~~~E~F~Le~~~~~~~~~v~LRS~~GkYLs~~~~G~V~a~a~~~g~~E~F~le~~~dGk~aLrs~~~G~yL~a~~- 364 (614)
+..+.+.|+|++...++ +.+|+|||++||||+++++|.|+|+++++|++|+|++++. +|++++.+..+++||++++
T Consensus 21 ~~~p~p~qV~va~~v~~--~~~iafKs~~GkYLs~Dk~G~v~a~sdAiGp~E~f~~V~~-~~~~a~~~~~~~~FLs~~~~ 97 (191)
T PF06229_consen 21 GEGPDPRQVWVATRVPG--DEKIAFKSGHGKYLSCDKDGIVSARSDAIGPQEQFEPVFQ-DGKPALFSSSNNKFLSVDEE 97 (191)
T ss_dssp S----TTT-EEEEE--S--SS-EEEEETTS-BEEE-SSSBEEE--SS--TTTBEEEE-S-TT--EEEE-TTS-BEEE-SS
T ss_pred CCCCChhHeEEEEEecC--CCceEeeccCccEEEEcCCCcEEEEeecCCCceEEEEEEC-CCCeEEEecCCCeEEEEecc
Confidence 33456899999998753 6799999999999999999999999999999999999998 5678888834999999974
Q ss_pred CeEEEeecCCCCCccEEEEec
Q psy15563 365 DKLICTAKIPANPEYWLAHLA 385 (614)
Q Consensus 365 G~L~a~~~s~g~~E~F~v~La 385 (614)
+.|.|+++++++.|.+.|+..
T Consensus 98 ~~i~a~s~~a~~~e~~~iR~~ 118 (191)
T PF06229_consen 98 GDIRADSKTAGENEMIKIRSD 118 (191)
T ss_dssp S-EEE--S---TTT--EEEE-
T ss_pred cCeeeccccCCCCceEEEEEe
Confidence 679999999999999998744
No 7
>PF04601 DUF569: Protein of unknown function (DUF569); InterPro: IPR007679 This is a family of hypothetical proteins. Some family members contain two copies of the region.
Probab=99.05 E-value=1.3e-09 Score=102.53 Aligned_cols=109 Identities=21% Similarity=0.278 Sum_probs=84.5
Q ss_pred ceEEEEeecCCceEEecCCee-EEEecccCCCCceEEEEEcCC--CeEEEEeeccCceeeecCCeEE----------EEe
Q psy15563 99 TLSVYLRSHLGKYLAVDSFGN-VTCEADEKEPGSLFQIHVSDE--GKWAFKNKTRGYFLGASSDKLI----------CTA 165 (614)
Q Consensus 99 ~~~v~lkSh~GrYL~~d~dG~-v~~~~e~~~~~~~F~i~~~~d--G~waLqs~~~g~yl~~~~d~~~----------c~~ 165 (614)
...|+||||+|+||.||.||. |+.+....+....|.++.-++ .-.-||| ++||||+++++.+. +..
T Consensus 7 ~~~VRLRS~~~kYL~ADeDg~~Vs~~~~~~s~na~W~Ve~v~~~~~~v~L~s-aYGrYL~as~~~~~lG~~G~~v~Q~~~ 85 (142)
T PF04601_consen 7 GKHVRLRSHHGKYLHADEDGEGVSQDRRGASLNAAWTVERVPGSPNYVRLRS-AYGRYLAASDEPALLGHTGRRVVQTDP 85 (142)
T ss_pred CCEEEEEecCCCEEEEcCCCCeEEECCCCCCCcceEEEEEecCCCCEEEEee-ccCceEeccCCcCCCCCCCCEEEecCC
Confidence 458999999999999999996 999999999999999987544 7899999 99999999888653 122
Q ss_pred cCCCCCcceEE-EEcccceeeeccCCCCcccCCCCCCCCceeeeeeCCCC----CcccCCCCCc
Q psy15563 166 KIPANPEYWLA-HLAARPQLRSCGSPPCASIGLPSSRGLPCRYISSSPST----KAVHKPPPLL 224 (614)
Q Consensus 166 ~~~~~~~~W~~-hlA~Hpqv~~~~~~~~~~~~~~~~~~~~~ry~~l~~~~----~~v~~~~pwg 224 (614)
..+...-+|.| .-+.++.++.+++ |||+.++.- +.|+.|+|-.
T Consensus 86 ~~~d~~~~Wepvr~g~~V~Lr~~~g----------------r~LRANG~~~~Wrn~VT~D~~~~ 133 (142)
T PF04601_consen 86 DRLDSSVEWEPVRDGFYVKLRHRSG----------------RYLRANGGYPPWRNSVTVDVPHR 133 (142)
T ss_pred ccCCCCceEEEecCCCEEEEEecCC----------------ceEEcCCCCCCCcceEEecCCCC
Confidence 22566789998 5555555555543 699887765 3777777754
No 8
>PF04601 DUF569: Protein of unknown function (DUF569); InterPro: IPR007679 This is a family of hypothetical proteins. Some family members contain two copies of the region.
Probab=98.69 E-value=3.7e-07 Score=86.03 Aligned_cols=101 Identities=23% Similarity=0.329 Sum_probs=78.9
Q ss_pred EEEEEECCCceEEEecCCeEEEecCCCCCCCceEEEEEcCCCCccEEEEEecCCcEEEEccC-------C-eE-EEeccC
Q psy15563 262 TIGLINAQFKYLTAETFGFKINANGASLKKKQVWILEPANGGDTLSVYLRSHLGKYLAVDSF-------G-NV-TCEADE 332 (614)
Q Consensus 262 ~V~L~s~~GkYLtae~~g~~V~An~~~l~~~E~F~Le~~~~~~~~~v~LRS~~GkYLs~~~~-------G-~V-~a~a~~ 332 (614)
-|.|+++.||||.|++.|..|.-.+........|.+|...+ +...|.|||+.||||++.+. | .| +.+.+.
T Consensus 9 ~VRLRS~~~kYL~ADeDg~~Vs~~~~~~s~na~W~Ve~v~~-~~~~v~L~saYGrYL~as~~~~~lG~~G~~v~Q~~~~~ 87 (142)
T PF04601_consen 9 HVRLRSHHGKYLHADEDGEGVSQDRRGASLNAAWTVERVPG-SPNYVRLRSAYGRYLAASDEPALLGHTGRRVVQTDPDR 87 (142)
T ss_pred EEEEEecCCCEEEEcCCCCeEEECCCCCCCcceEEEEEecC-CCCEEEEeeccCceEeccCCcCCCCCCCCEEEecCCcc
Confidence 58999999999999999999988888778889999998764 56799999999999999752 2 23 333334
Q ss_pred CCCCceEEEEEeCCCeEEEEECCCCeEEEeeCC
Q psy15563 333 KEPGSLFQIHVSDEGKWAFKNKTRGYFLGASSD 365 (614)
Q Consensus 333 ~g~~E~F~le~~~dGk~aLrs~~~G~yL~a~~G 365 (614)
....-.|+.+.. +..+.||+. +|+||.+.++
T Consensus 88 ~d~~~~Wepvr~-g~~V~Lr~~-~gr~LRANG~ 118 (142)
T PF04601_consen 88 LDSSVEWEPVRD-GFYVKLRHR-SGRYLRANGG 118 (142)
T ss_pred CCCCceEEEecC-CCEEEEEec-CCceEEcCCC
Confidence 444455555432 358999998 8999999875
No 9
>KOG3962|consensus
Probab=98.63 E-value=1.4e-07 Score=93.64 Aligned_cols=97 Identities=21% Similarity=0.300 Sum_probs=78.3
Q ss_pred EEEEEECCCceEEEecCC-eEEEecC---CCCCCCceEEEEEcCCCCccEEEEEecCCcEEEEccCCeEEEeccCCCCCc
Q psy15563 262 TIGLINAQFKYLTAETFG-FKINANG---ASLKKKQVWILEPANGGDTLSVYLRSHLGKYLAVDSFGNVTCEADEKEPGS 337 (614)
Q Consensus 262 ~V~L~s~~GkYLtae~~g-~~V~An~---~~l~~~E~F~Le~~~~~~~~~v~LRS~~GkYLs~~~~G~V~a~a~~~g~~E 337 (614)
.|++.-..|.||-+.+.| +.+.+.. ++..+.|+|.+...+ +.+|+|+|..||||+++++|.|++.++++++.|
T Consensus 49 ~v~ie~~~~~yl~a~dng~ft~g~ph~~~~gp~p~e~f~avki~---dsrIaLKsGyGKYlsinsdglvvg~qeAvG~~E 125 (246)
T KOG3962|consen 49 TVAIEIDDGTYLGAMDNGLFTLGAPHDEVDGPEPEEQFMAVKIS---DSRIALKSGYGKYLSINSDGLVVGRQEAVGSRE 125 (246)
T ss_pred EEEEEecCceEEEEEecCceeeccCCccccCCCchhhEEEEEcc---CceEEecccccceeeecCCccEEEehhhcCcHh
Confidence 355555678898877655 7776663 344577888887665 379999999999999999999999999999999
Q ss_pred eEEEEEeCCCeEEEEECCCCeEEEee
Q psy15563 338 LFQIHVSDEGKWAFKNKTRGYFLGAS 363 (614)
Q Consensus 338 ~F~le~~~dGk~aLrs~~~G~yL~a~ 363 (614)
+|..+++ .|+.+|.+. ++.|....
T Consensus 126 Qw~~vFq-~~r~a~~as-~s~~~~~~ 149 (246)
T KOG3962|consen 126 QWEPVFQ-EGRMALLAS-NSCFIRCN 149 (246)
T ss_pred hchhhhh-ccceEEeec-cceeEEec
Confidence 9999999 577888877 68877664
No 10
>KOG3962|consensus
Probab=98.46 E-value=1.6e-07 Score=93.28 Aligned_cols=79 Identities=13% Similarity=0.120 Sum_probs=56.9
Q ss_pred EEEEecCCCCceEEEEeecCCceEEecCCeeEEEecccCCCCceEEEEEcCCCeEEEEeeccCceeeecCC-eEEEEecC
Q psy15563 89 WILEPANGGDTLSVYLRSHLGKYLAVDSFGNVTCEADEKEPGSLFQIHVSDEGKWAFKNKTRGYFLGASSD-KLICTAKI 167 (614)
Q Consensus 89 W~l~~~~~~~~~~v~lkSh~GrYL~~d~dG~v~~~~e~~~~~~~F~i~~~~dG~waLqs~~~g~yl~~~~d-~~~c~~~~ 167 (614)
+.|-+.+. +.|.|||.+||||++++||.|+|.+|++|++|+|+++|+..+.|.|.|.++-+|+ .+. ++..-+++
T Consensus 86 f~avki~d---srIaLKsGyGKYlsinsdglvvg~qeAvG~~EQw~~vFq~~r~a~~as~s~~~~~--~e~~d~~va~kt 160 (246)
T KOG3962|consen 86 FMAVKISD---SRIALKSGYGKYLSINSDGLVVGRQEAVGSREQWEPVFQEGRMALLASNSCFIRC--NEAGDAEVASKT 160 (246)
T ss_pred EEEEEccC---ceEEecccccceeeecCCccEEEehhhcCcHhhchhhhhccceEEeeccceeEEe--chhhchhhhccc
Confidence 56666653 5899999999999999999999999999999999999995555555554444444 333 33333445
Q ss_pred CCCCc
Q psy15563 168 PANPE 172 (614)
Q Consensus 168 ~~~~~ 172 (614)
++..+
T Consensus 161 ~t~~~ 165 (246)
T KOG3962|consen 161 ATEEP 165 (246)
T ss_pred CCCCc
Confidence 55443
No 11
>smart00791 Agglutinin Amaranthus caudatus agglutinin or amaranthin is a lectin from the ancient South American crop, amaranth grain. Although its biological function is unknown, it has a high binding specificity for the methyl-glycoside of the T-antigen, found linked to serine or threonine residues of cell surface glycoproteins PUBMED:2271665. The protein is comprised of a homodimer, with each homodimer consisting of two beta-trefoil domains PUBMED:9334739.
Probab=97.95 E-value=0.00019 Score=66.97 Aligned_cols=112 Identities=14% Similarity=0.096 Sum_probs=80.0
Q ss_pred eEEEEEECCCceEEEecCC--eEEEecCC-CCCCCceEEEEEcCCCCccEEEEEe-cCCcEEEEccCCeEEEeccCC---
Q psy15563 261 WTIGLINAQFKYLTAETFG--FKINANGA-SLKKKQVWILEPANGGDTLSVYLRS-HLGKYLAVDSFGNVTCEADEK--- 333 (614)
Q Consensus 261 ~~V~L~s~~GkYLtae~~g--~~V~An~~-~l~~~E~F~Le~~~~~~~~~v~LRS-~~GkYLs~~~~G~V~a~a~~~--- 333 (614)
..|+|++.|||||.+..-+ .=+.-.++ -..+.-.|.++..+ .+.|.+|+ ++||||... +-=|.|+++++
T Consensus 6 ~~V~FKg~n~kYLry~~~~~~~~lqf~~ddI~dp~v~~ev~~~~---dg~V~ik~~~~nKfWr~s-~~WI~a~s~d~~e~ 81 (139)
T smart00791 6 IYVLFKGNNQKYLRYQSIQQYGLLQFSADKILDPLVQFEVFPTY---NGLVHIKSNYTNKFWRLS-HYWITADANDPDEN 81 (139)
T ss_pred EEEEEEcCCCceEEEEeecccceeEecccccCCcceeEEEEEcC---CCcEEEEecCCCceEccC-CCEEEecCCCCccC
Confidence 4699999999999977422 11222333 33466778888743 56999999 899999765 44577887777
Q ss_pred -CCCceEEEEEeCCCeEEEEECCCCeEEEee--CC----eEEEeecCCCC
Q psy15563 334 -EPGSLFQIHVSDEGKWAFKNKTRGYFLGAS--SD----KLICTAKIPAN 376 (614)
Q Consensus 334 -g~~E~F~le~~~dGk~aLrs~~~G~yL~a~--~G----~L~a~~~s~g~ 376 (614)
++.+.|..+..+++.++||+.++|+|+.-- +| -|.|..+++..
T Consensus 82 ~sscTLF~Pv~~d~~~i~lr~vq~~~~~~r~t~~~~~~~CL~A~~~~i~~ 131 (139)
T smart00791 82 KSACTLFRPLYVEMKKIRLLNVQLGHYTKRYTVGKSFVSCLNAASSQIDT 131 (139)
T ss_pred CCcccEEeEEeccCceEEEEEecCCceEEeecCCCCcccceeeccccccc
Confidence 346889999987789999999999998552 23 45555555544
No 12
>PF00167 FGF: Fibroblast growth factor; InterPro: IPR002348 The interleukin-1 (IL1) and heparin-binding growth factor (HBGF) families share low sequence similarity (about 25% []) but have very similar structures. Coupled with the Kunitz-type soybean trypsin inhibitors (STI), they form a structural superfamily. Despite their structural correspondence, however, they show no sequence similarity to the STI family. The crystal structures of interleukin-1 beta and HBGF1 have been solved, showing both families to have the same 12-stranded beta-sheet structure []; the beta-sheets are arranged in 3 similar lobes around a central axis, 6 strands forming an anti-parallel beta-barrel [, ]. The beta-sheets are generally well preserved and the crystal structures superimpose in these areas. The intervening loops are less well conserved - the loop between beta-strands 6 and 7 is slightly longer in interleukin-1 beta.; GO: 0008083 growth factor activity; PDB: 1AFC_F 1BAR_A 2P39_A 1EV2_D 2BFH_A 4FGF_A 1BAS_A 1BFG_A 1FQ9_B 1CVS_A ....
Probab=97.87 E-value=0.00061 Score=62.51 Aligned_cols=101 Identities=19% Similarity=0.242 Sum_probs=84.8
Q ss_pred EEEEEecCCcEEEEccCCeEEEeccCCCCCceEEEEEeCCCeEEEEECCCCeEEEeeC-CeEEEeecCCCCCccEEEEec
Q psy15563 307 SVYLRSHLGKYLAVDSFGNVTCEADEKEPGSLFQIHVSDEGKWAFKNKTRGYFLGASS-DKLICTAKIPANPEYWLAHLA 385 (614)
Q Consensus 307 ~v~LRS~~GkYLs~~~~G~V~a~a~~~g~~E~F~le~~~dGk~aLrs~~~G~yL~a~~-G~L~a~~~s~g~~E~F~v~La 385 (614)
.++|-+.+|+||.+.++|.|.++.+..++.+.|+++....|.+.||+...++||+.+. |.|.+.. .+.++..|.-++.
T Consensus 2 ~~~Ly~~~~~~L~i~~~g~V~gt~~~~~~~s~~~i~~~~~g~V~i~~~~s~~YLcmn~~G~ly~~~-~~~~~C~F~e~~~ 80 (122)
T PF00167_consen 2 HVQLYCRTGYFLQINPNGTVDGTGDDNSPYSVFEIHSVGFGVVRIRGVKSCRYLCMNKCGRLYGSK-NFNKDCVFREELL 80 (122)
T ss_dssp EEEEEETTSEEEEEETTSBEEEESSTTSTTGEEEEEEEETTEEEEEETTTTEEEEEBTTSBEEEES-SBTGGGEEEEEEE
T ss_pred CEEEEECCCeEEEECCCCeEeCCCCcCcceeEEEEEeccceEEEEEEecceEEEEECCCCeEcccc-ccCCCceEEEEEc
Confidence 4677787799999999999999999999999999999977999999999999999975 9999854 4556899998876
Q ss_pred cCceeEEeee--cceeeeeecCCCc
Q psy15563 386 ARPQVNLRSV--GRKRFAHLSENLD 408 (614)
Q Consensus 386 nrP~v~Lr~v--~~~~yv~ls~~~~ 408 (614)
..-...+.++ ++..|+.++.+|.
T Consensus 81 ~n~y~~~~s~~~~~~~yla~~~~G~ 105 (122)
T PF00167_consen 81 ENGYNTYESAKYGRGWYLAFNRRGK 105 (122)
T ss_dssp TTSEEEEEESTTGTTEBCEBCTTSB
T ss_pred cCCEEEEEeccCCccEEEEECCCCC
Confidence 5566667775 4677888877764
No 13
>PF07468 Agglutinin: Agglutinin; InterPro: IPR008998 Agglutinins are sugar-specific lectins that can agglutinate erythrocytes and other cell types. Lectins occur widely in plants, as well as some microorganisms and animal []. Agglutinin from Amaranthus caudatus (amaranthin) is a lectin from the ancient South American crop, amaranth grain. Although its biological function is unknown, it can agglutinate A, B and O red blood cells, and has a carbohydrate-binding site that is specific for the methyl-glycoside of the T-antigen found linked to serine or threonine residues of cell surface glycoproteins []. The protein is comprised of a homodimer, with each homodimer consisting of two beta-trefoil domains []. Lectin B chains from ricin and related toxins also contain beta-trefoil domain, however they are not related to agglutinin, showing little sequence similarity [].; PDB: 1JLY_B 1JLX_B.
Probab=97.66 E-value=0.0011 Score=63.24 Aligned_cols=120 Identities=18% Similarity=0.124 Sum_probs=78.0
Q ss_pred EEEEEECCCceEEEecCC----eEEEecC-CCCCCCceEEEEEcCCCCccEEEEEe-cCCcEEEEccCC--eEEEeccCC
Q psy15563 262 TIGLINAQFKYLTAETFG----FKINANG-ASLKKKQVWILEPANGGDTLSVYLRS-HLGKYLAVDSFG--NVTCEADEK 333 (614)
Q Consensus 262 ~V~L~s~~GkYLtae~~g----~~V~An~-~~l~~~E~F~Le~~~~~~~~~v~LRS-~~GkYLs~~~~G--~V~a~a~~~ 333 (614)
-|.++..|||||.+...+ .-+.-++ +-..+.-.|++|.+.. .++.|.+|+ ++||||.....+ =|.|+++++
T Consensus 7 ~V~fkg~N~kYLry~~e~~~~~~~LqF~~edi~dP~v~fev~~~~~-~dG~V~Ir~~y~nKfWrr~s~n~~WI~ada~~p 85 (153)
T PF07468_consen 7 YVAFKGDNGKYLRYRTEDIQQYGYLQFSGEDIGDPYVKFEVEPSKT-HDGLVHIRCCYNNKFWRRSSPNDYWIWADADDP 85 (153)
T ss_dssp CEEEETTTS-EEEEEESSCTTCCEEEEEESSTT-CCG-EEEEE-SS-TTT-EEEEETTTTEEEEESCCC--BEEEEESSH
T ss_pred EEEEEcCCCcEEEEEecccccceeEEecCCcCCCCceeEEEEEccc-CCCeEEEEeccCCceeEeCCCCCcEEEecCCCc
Confidence 356666899999987622 2243333 3345888999998433 367999999 899999975555 688987765
Q ss_pred CC------CceEEEEEeCC---CeEEEEECCCCeEEEe--eC----CeEEEeecCCCCCc--cEEE
Q psy15563 334 EP------GSLFQIHVSDE---GKWAFKNKTRGYFLGA--SS----DKLICTAKIPANPE--YWLA 382 (614)
Q Consensus 334 g~------~E~F~le~~~d---Gk~aLrs~~~G~yL~a--~~----G~L~a~~~s~g~~E--~F~v 382 (614)
.. .+.|+++.-++ ..++||+..+++|+.- .+ +-|.|.+++|.... .|+|
T Consensus 86 ~ed~s~~~cTLF~Pv~vd~~~~~~i~l~~~~n~~~~~r~t~~~~~~sCL~A~~~~i~~~~~~~~~~ 151 (153)
T PF07468_consen 86 DEDQSKPSCTLFEPVKVDVKDFNVIALRNMQNGHFCKRLTYGGKFVSCLNAASSSIDREAKDVFTI 151 (153)
T ss_dssp HH-TCSTCGG-EEEEESCCCETTEEEEEETTTTEEEEEE--STTBSSEEEEEESS--CCGTT-EEE
T ss_pred ccccCCCCceEEEEEEecCCCccEEEEEecCCceEEEEEccCCcceeeEeeccCcccccccceEEE
Confidence 52 45599987532 5799999999999844 22 45888888886654 5554
No 14
>cd00058 FGF Acidic and basic fibroblast growth factor family; FGFs are mitogens, which stimulate growth or differentiation of cells of mesodermal or neuroectodermal origin. The family plays essential roles in patterning and differentiation during vertebrate embryogenesis, and has neurotrophic activities. FGFs have a high affinity for heparan sulfate proteoglycans and require heparan sulfate to activate one of four cell surface FGF receptors. Upon binding to FGF, the receptors dimerize and their intracellular tyrosine kinase domains become active. FGFs have internal pseudo-threefold symmetry (beta-trefoil topology).
Probab=97.49 E-value=0.0026 Score=58.88 Aligned_cols=97 Identities=15% Similarity=0.112 Sum_probs=78.0
Q ss_pred EEecCCcEEEEccCCeEEEeccCCCCCceEEEEEeCCCeEEEEECCCCeEEEee-CCeEEEeecCCCCCccEEEEeccCc
Q psy15563 310 LRSHLGKYLAVDSFGNVTCEADEKEPGSLFQIHVSDEGKWAFKNKTRGYFLGAS-SDKLICTAKIPANPEYWLAHLAARP 388 (614)
Q Consensus 310 LRS~~GkYLs~~~~G~V~a~a~~~g~~E~F~le~~~dGk~aLrs~~~G~yL~a~-~G~L~a~~~s~g~~E~F~v~LanrP 388 (614)
|-+.+|.||.+..+|.|.++.+..++...|+++....|.++||+...++||+.+ .|.|.+.. .+.++..|.-++...-
T Consensus 3 Ly~~~~~~L~I~~dG~V~Gt~~~~~~~s~l~~~s~~~g~v~i~~v~s~~YLCmn~~G~ly~s~-~~~~dC~F~E~~~~n~ 81 (123)
T cd00058 3 LYCRTGFHLQILPDGTVDGTRDDSSSYTILERIAVAVGVVSIKGVASCRYLCMNKCGKLYGSK-GFTEECLFREELLENN 81 (123)
T ss_pred EEEcCCeEEEEcCCCcEecccCCCCCCceEEEEECCCCEEEEEEcccceEEEECCCCCEEECC-CCCCCCEEEEEEccCC
Confidence 334449999999999999999988889999999987799999999999999997 48898877 7888899987766544
Q ss_pred eeEEeeec--ceeeeeecCCC
Q psy15563 389 QVNLRSVG--RKRFAHLSENL 407 (614)
Q Consensus 389 ~v~Lr~v~--~~~yv~ls~~~ 407 (614)
.-..++.- +.-|+.++.+|
T Consensus 82 Y~~y~S~~~~~~~ylal~~~G 102 (123)
T cd00058 82 YNTYASAKYRRRWYLALNKKG 102 (123)
T ss_pred cEEEEEcccCCCcEEEECCCC
Confidence 55555542 34677776554
No 15
>PF00167 FGF: Fibroblast growth factor; InterPro: IPR002348 The interleukin-1 (IL1) and heparin-binding growth factor (HBGF) families share low sequence similarity (about 25% []) but have very similar structures. Coupled with the Kunitz-type soybean trypsin inhibitors (STI), they form a structural superfamily. Despite their structural correspondence, however, they show no sequence similarity to the STI family. The crystal structures of interleukin-1 beta and HBGF1 have been solved, showing both families to have the same 12-stranded beta-sheet structure []; the beta-sheets are arranged in 3 similar lobes around a central axis, 6 strands forming an anti-parallel beta-barrel [, ]. The beta-sheets are generally well preserved and the crystal structures superimpose in these areas. The intervening loops are less well conserved - the loop between beta-strands 6 and 7 is slightly longer in interleukin-1 beta.; GO: 0008083 growth factor activity; PDB: 1AFC_F 1BAR_A 2P39_A 1EV2_D 2BFH_A 4FGF_A 1BAS_A 1BFG_A 1FQ9_B 1CVS_A ....
Probab=97.42 E-value=0.0046 Score=56.74 Aligned_cols=101 Identities=14% Similarity=0.238 Sum_probs=81.2
Q ss_pred EEEEECCCceEEEecCCeEEEecCCCCCCCceEEEEEcCCCCccEEEEEe-cCCcEEEEccCCeEEEeccCCCCCceEEE
Q psy15563 263 IGLINAQFKYLTAETFGFKINANGASLKKKQVWILEPANGGDTLSVYLRS-HLGKYLAVDSFGNVTCEADEKEPGSLFQI 341 (614)
Q Consensus 263 V~L~s~~GkYLtae~~g~~V~An~~~l~~~E~F~Le~~~~~~~~~v~LRS-~~GkYLs~~~~G~V~a~a~~~g~~E~F~l 341 (614)
+.|-+.+|+||.+.. ...|.+.++.-.+...|.++... .+.|.||+ ..++||+.++.|.|.+. ..+.+++.|.-
T Consensus 3 ~~Ly~~~~~~L~i~~-~g~V~gt~~~~~~~s~~~i~~~~---~g~V~i~~~~s~~YLcmn~~G~ly~~-~~~~~~C~F~e 77 (122)
T PF00167_consen 3 VQLYCRTGYFLQINP-NGTVDGTGDDNSPYSVFEIHSVG---FGVVRIRGVKSCRYLCMNKCGRLYGS-KNFNKDCVFRE 77 (122)
T ss_dssp EEEEETTSEEEEEET-TSBEEEESSTTSTTGEEEEEEEE---TTEEEEEETTTTEEEEEBTTSBEEEE-SSBTGGGEEEE
T ss_pred EEEEECCCeEEEECC-CCeEeCCCCcCcceeEEEEEecc---ceEEEEEEecceEEEEECCCCeEccc-cccCCCceEEE
Confidence 557777799999875 34787777777888999999764 36999999 68999999999999996 44666999999
Q ss_pred EEeCCCeEEEEECCC--CeEEEee-CCeEE
Q psy15563 342 HVSDEGKWAFKNKTR--GYFLGAS-SDKLI 368 (614)
Q Consensus 342 e~~~dGk~aLrs~~~--G~yL~a~-~G~L~ 368 (614)
++.+++-..+.+..+ +.||+.+ .|+..
T Consensus 78 ~~~~n~y~~~~s~~~~~~~yla~~~~G~~~ 107 (122)
T PF00167_consen 78 ELLENGYNTYESAKYGRGWYLAFNRRGKPR 107 (122)
T ss_dssp EEETTSEEEEEESTTGTTEBCEBCTTSBBE
T ss_pred EEccCCEEEEEeccCCccEEEEECCCCCCC
Confidence 888778888888854 8888886 46543
No 16
>smart00442 FGF Acidic and basic fibroblast growth factor family. Mitogens that stimulate growth or differentiation of cells of mesodermal or neuroectodermal origin. The family play essential roles in patterning and differentiation during vertebrate embryogenesis, and have neurotrophic activities.
Probab=97.39 E-value=0.0067 Score=56.44 Aligned_cols=100 Identities=16% Similarity=0.118 Sum_probs=78.3
Q ss_pred EEEEEecCCcEEEEccCCeEEEeccCCCCCceEEEEEeCCCeEEEEECCCCeEEEee-CCeEEEeecCCCCCccEEEEec
Q psy15563 307 SVYLRSHLGKYLAVDSFGNVTCEADEKEPGSLFQIHVSDEGKWAFKNKTRGYFLGAS-SDKLICTAKIPANPEYWLAHLA 385 (614)
Q Consensus 307 ~v~LRS~~GkYLs~~~~G~V~a~a~~~g~~E~F~le~~~dGk~aLrs~~~G~yL~a~-~G~L~a~~~s~g~~E~F~v~La 385 (614)
.++|-+.+|.||.+.++|.|.++.++..+...|++...+.|.+.||+...++||+.+ .|.|.+... +.++..|.=++.
T Consensus 4 ~~~Ly~~~~~~L~I~~~G~V~Gt~~~~~~~~ile~~s~~~g~V~ik~~~s~~YLCmn~~G~ly~s~~-~~~dC~F~E~~~ 82 (126)
T smart00442 4 LRQLYCRNGQHLQILPDGTVDGTRDESSSFTILEIIAVAVGVVAIKGVASCRYLCMNKCGKLYGSKN-FTEDCVFREEME 82 (126)
T ss_pred EEEEEeCCCeEEEEcCCceEecccCCCCcceEEEEEeccCCEEEEEEcccceEEEECCCCCEEEccc-CCCCcEEEEEec
Confidence 567777788999999999999999988999999999987799999999999999997 489988665 678888886655
Q ss_pred cCceeEEeeecce--eeeeecCCC
Q psy15563 386 ARPQVNLRSVGRK--RFAHLSENL 407 (614)
Q Consensus 386 nrP~v~Lr~v~~~--~yv~ls~~~ 407 (614)
..-.-..++.-.+ -|+.++.+|
T Consensus 83 ~n~y~~y~S~~~~~~~ylal~~~G 106 (126)
T smart00442 83 ENGYNTYASAKYRKRWYVALNKKG 106 (126)
T ss_pred cCCeEEEEEcccCCceEEEECCCC
Confidence 4344444443222 466665554
No 17
>PF14200 RicinB_lectin_2: Ricin-type beta-trefoil lectin domain-like; PDB: 2X2S_C 2X2T_A 2VSE_B 2VSA_A 3EF2_A 2IHO_A 3HZB_H 1YBI_B 3PHZ_A 3NBE_A ....
Probab=97.13 E-value=0.0075 Score=53.36 Aligned_cols=92 Identities=20% Similarity=0.301 Sum_probs=67.8
Q ss_pred CCceEEEEEcCCCCccEEEEEe-cCCcEEEEccCC-----eEEEeccCCCCCceEEEEEeCCCeEEEEECCCCeEEEeeC
Q psy15563 291 KKQVWILEPANGGDTLSVYLRS-HLGKYLAVDSFG-----NVTCEADEKEPGSLFQIHVSDEGKWAFKNKTRGYFLGASS 364 (614)
Q Consensus 291 ~~E~F~Le~~~~~~~~~v~LRS-~~GkYLs~~~~G-----~V~a~a~~~g~~E~F~le~~~dGk~aLrs~~~G~yL~a~~ 364 (614)
+-|.|++....+ ..+.+.|++ ..|+||.+..+. .|+.-...-.+.++|.|+...+|.+.|++...|++|.+.+
T Consensus 1 ~nQ~W~~~~~~~-~~g~Y~i~n~~sg~~L~v~~~~~~~g~~v~~~~~~~~~~Q~W~i~~~~~g~y~I~n~~s~~~Ldv~~ 79 (105)
T PF14200_consen 1 DNQQWTFTPVGD-SDGYYKIRNVNSGKYLDVAGGSTANGTNVQQWTCNGNDNQQWKIEPVGDGYYRIRNKNSGKVLDVAG 79 (105)
T ss_dssp CGGEEEEEEEET-TTTEEEEEETTTTEEEEEGCTTCSTTEBEEEEESSSSGGGEEEEEESTTSEEEEEETSTTEEEEEGG
T ss_pred CCCEEEEEEecC-CCCEEEEEECCCCCEEEeCCCCcCCCcEEEEecCCCCcCcEEEEEEecCCeEEEEECCCCcEEEECC
Confidence 358899998642 357899999 799999998643 4555444447899999999988999999999999998752
Q ss_pred -----C-eEEEeec-CCCCCccEEEE
Q psy15563 365 -----D-KLICTAK-IPANPEYWLAH 383 (614)
Q Consensus 365 -----G-~L~a~~~-s~g~~E~F~v~ 383 (614)
| .|..-.. .-++..+|.|+
T Consensus 80 ~~~~~g~~v~~~~~~~~~~~Q~W~l~ 105 (105)
T PF14200_consen 80 GSTANGTNVQQWEYDNGSDNQQWKLE 105 (105)
T ss_dssp GSSSTTEBEEEEE-STSSGGGEEEEE
T ss_pred CCCCCCCEEEEEeCCCCCccCEEEeC
Confidence 3 3433332 45667777764
No 18
>smart00791 Agglutinin Amaranthus caudatus agglutinin or amaranthin is a lectin from the ancient South American crop, amaranth grain. Although its biological function is unknown, it has a high binding specificity for the methyl-glycoside of the T-antigen, found linked to serine or threonine residues of cell surface glycoproteins PUBMED:2271665. The protein is comprised of a homodimer, with each homodimer consisting of two beta-trefoil domains PUBMED:9334739.
Probab=97.13 E-value=0.0057 Score=57.26 Aligned_cols=97 Identities=18% Similarity=0.185 Sum_probs=75.9
Q ss_pred EEEEEecCCcEEEEcc---CCeEEEeccCCCC-CceEEEEEeCCCeEEEEECCCCeEEEeeCCeEEEeecCCC----CCc
Q psy15563 307 SVYLRSHLGKYLAVDS---FGNVTCEADEKEP-GSLFQIHVSDEGKWAFKNKTRGYFLGASSDKLICTAKIPA----NPE 378 (614)
Q Consensus 307 ~v~LRS~~GkYLs~~~---~G~V~a~a~~~g~-~E~F~le~~~dGk~aLrs~~~G~yL~a~~G~L~a~~~s~g----~~E 378 (614)
-|.||+-||+||.... .+-|.+.++.+.+ -..|+++...||.+.+|+.-+|+|=..+..=+.|.++.+. ...
T Consensus 7 ~V~FKg~n~kYLry~~~~~~~~lqf~~ddI~dp~v~~ev~~~~dg~V~ik~~~~nKfWr~s~~WI~a~s~d~~e~~sscT 86 (139)
T smart00791 7 YVLFKGNNQKYLRYQSIQQYGLLQFSADKILDPLVQFEVFPTYNGLVHIKSNYTNKFWRLSHYWITADANDPDENKSACT 86 (139)
T ss_pred EEEEEcCCCceEEEEeecccceeEecccccCCcceeEEEEEcCCCcEEEEecCCCceEccCCCEEEecCCCCccCCCccc
Confidence 5899999999999875 4569999998766 7899999998999999999899998777777888887772 345
Q ss_pred cEEEEeccCceeEEeeecceeeeee
Q psy15563 379 YWLAHLAARPQVNLRSVGRKRFAHL 403 (614)
Q Consensus 379 ~F~v~LanrP~v~Lr~v~~~~yv~l 403 (614)
+|..-....-.+.||+++...|++.
T Consensus 87 LF~Pv~~d~~~i~lr~vq~~~~~~r 111 (139)
T smart00791 87 LFRPLYVEMKKIRLLNVQLGHYTKR 111 (139)
T ss_pred EEeEEeccCceEEEEEecCCceEEe
Confidence 6665444566677777766666653
No 19
>cd00058 FGF Acidic and basic fibroblast growth factor family; FGFs are mitogens, which stimulate growth or differentiation of cells of mesodermal or neuroectodermal origin. The family plays essential roles in patterning and differentiation during vertebrate embryogenesis, and has neurotrophic activities. FGFs have a high affinity for heparan sulfate proteoglycans and require heparan sulfate to activate one of four cell surface FGF receptors. Upon binding to FGF, the receptors dimerize and their intracellular tyrosine kinase domains become active. FGFs have internal pseudo-threefold symmetry (beta-trefoil topology).
Probab=96.97 E-value=0.0044 Score=57.42 Aligned_cols=76 Identities=16% Similarity=0.134 Sum_probs=66.5
Q ss_pred EEEeecCCceEEecCCeeEEEecccCCCCceEEEEEcCCCeEEEEeeccCceeeecCCeEEEEecCCCCCcceEEEEc
Q psy15563 102 VYLRSHLGKYLAVDSFGNVTCEADEKEPGSLFQIHVSDEGKWAFKNKTRGYFLGASSDKLICTAKIPANPEYWLAHLA 179 (614)
Q Consensus 102 v~lkSh~GrYL~~d~dG~v~~~~e~~~~~~~F~i~~~~dG~waLqs~~~g~yl~~~~d~~~c~~~~~~~~~~W~~hlA 179 (614)
++.|+ |.||..++||+|.++.+..++...|++.....|..+||++.+++||..+..-..-.++..++...|..++.
T Consensus 3 Ly~~~--~~~L~I~~dG~V~Gt~~~~~~~s~l~~~s~~~g~v~i~~v~s~~YLCmn~~G~ly~s~~~~~dC~F~E~~~ 78 (123)
T cd00058 3 LYCRT--GFHLQILPDGTVDGTRDDSSSYTILERIAVAVGVVSIKGVASCRYLCMNKCGKLYGSKGFTEECLFREELL 78 (123)
T ss_pred EEEcC--CeEEEEcCCCcEecccCCCCCCceEEEEECCCCEEEEEEcccceEEEECCCCCEEECCCCCCCCEEEEEEc
Confidence 44555 99999999999999999999999999999989999999999999999977765555558888899988775
No 20
>PF14200 RicinB_lectin_2: Ricin-type beta-trefoil lectin domain-like; PDB: 2X2S_C 2X2T_A 2VSE_B 2VSA_A 3EF2_A 2IHO_A 3HZB_H 1YBI_B 3PHZ_A 3NBE_A ....
Probab=96.88 E-value=0.014 Score=51.72 Aligned_cols=88 Identities=20% Similarity=0.320 Sum_probs=66.3
Q ss_pred eEEEEecCCCCceEEEEee-cCCceEEecCC----e-eEEEecccCCCCceEEEEEcCCCeEEEEeeccCceeeecC---
Q psy15563 88 VWILEPANGGDTLSVYLRS-HLGKYLAVDSF----G-NVTCEADEKEPGSLFQIHVSDEGKWAFKNKTRGYFLGASS--- 158 (614)
Q Consensus 88 vW~l~~~~~~~~~~v~lkS-h~GrYL~~d~d----G-~v~~~~e~~~~~~~F~i~~~~dG~waLqs~~~g~yl~~~~--- 158 (614)
.|++.+.++. ...+.|++ +-|+||.++.. | +|......-.+..+|.|+...||.|.|++...+++|...+
T Consensus 4 ~W~~~~~~~~-~g~Y~i~n~~sg~~L~v~~~~~~~g~~v~~~~~~~~~~Q~W~i~~~~~g~y~I~n~~s~~~Ldv~~~~~ 82 (105)
T PF14200_consen 4 QWTFTPVGDS-DGYYKIRNVNSGKYLDVAGGSTANGTNVQQWTCNGNDNQQWKIEPVGDGYYRIRNKNSGKVLDVAGGST 82 (105)
T ss_dssp EEEEEEEETT-TTEEEEEETTTTEEEEEGCTTCSTTEBEEEEESSSSGGGEEEEEESTTSEEEEEETSTTEEEEEGGGSS
T ss_pred EEEEEEecCC-CCEEEEEECCCCCEEEeCCCCcCCCcEEEEecCCCCcCcEEEEEEecCCeEEEEECCCCcEEEECCCCC
Confidence 8999997632 23699998 59999999864 3 4666666568999999999999999999988899998742
Q ss_pred ---CeEEEEec-CCCCCcceEE
Q psy15563 159 ---DKLICTAK-IPANPEYWLA 176 (614)
Q Consensus 159 ---d~~~c~~~-~~~~~~~W~~ 176 (614)
..|..... .-++..+|.+
T Consensus 83 ~~g~~v~~~~~~~~~~~Q~W~l 104 (105)
T PF14200_consen 83 ANGTNVQQWEYDNGSDNQQWKL 104 (105)
T ss_dssp STTEBEEEEE-STSSGGGEEEE
T ss_pred CCCCEEEEEeCCCCCccCEEEe
Confidence 24555444 5556666754
No 21
>PF07468 Agglutinin: Agglutinin; InterPro: IPR008998 Agglutinins are sugar-specific lectins that can agglutinate erythrocytes and other cell types. Lectins occur widely in plants, as well as some microorganisms and animal []. Agglutinin from Amaranthus caudatus (amaranthin) is a lectin from the ancient South American crop, amaranth grain. Although its biological function is unknown, it can agglutinate A, B and O red blood cells, and has a carbohydrate-binding site that is specific for the methyl-glycoside of the T-antigen found linked to serine or threonine residues of cell surface glycoproteins []. The protein is comprised of a homodimer, with each homodimer consisting of two beta-trefoil domains []. Lectin B chains from ricin and related toxins also contain beta-trefoil domain, however they are not related to agglutinin, showing little sequence similarity [].; PDB: 1JLY_B 1JLX_B.
Probab=96.86 E-value=0.012 Score=56.39 Aligned_cols=100 Identities=14% Similarity=0.149 Sum_probs=64.6
Q ss_pred eEEEEcCCCCeEeeeC----CCceEEeccc-CCCCceeEEEEecCCcEEEEEeeCCceEEEEeceEEEEe-CCCCeEEec
Q psy15563 490 QIALRDNSGAYLSPIG----SKAILKTRSN-TVTKDELFTLEDSLPQASFIAALNSKYVSVKQGQIALRD-NSGAYLSPI 563 (614)
Q Consensus 490 ~valr~~nGkyL~~~g----~~g~L~a~~~-~~~~~elF~le~~~pqv~l~s~~n~kyVs~~~G~~~~R~-~~gkyw~~~ 563 (614)
.|++|..|||||+... .-+-|+..++ -.+.-..|+++.+. + ..|..++|. .+||||...
T Consensus 7 ~V~fkg~N~kYLry~~e~~~~~~~LqF~~edi~dP~v~fev~~~~---------~------~dG~V~Ir~~y~nKfWrr~ 71 (153)
T PF07468_consen 7 YVAFKGDNGKYLRYRTEDIQQYGYLQFSGEDIGDPYVKFEVEPSK---------T------HDGLVHIRCCYNNKFWRRS 71 (153)
T ss_dssp CEEEETTTS-EEEEEESSCTTCCEEEEEESSTT-CCG-EEEEE-S---------S------TTT-EEEEETTTTEEEEES
T ss_pred EEEEEcCCCcEEEEEecccccceeEEecCCcCCCCceeEEEEEcc---------c------CCCeEEEEeccCCceeEeC
Confidence 6888999999999864 3466887554 44566778887643 1 124456776 479999865
Q ss_pred cCC--ceEeecCCCCC------CCccEEEEEcC----CeEEEEE-cCCCCeEEEee
Q psy15563 564 GSK--AILKTRSNTVT------KDELFTLEDSL----PQASFIA-ALNSKYVSVKQ 606 (614)
Q Consensus 564 ~~g--~v~a~~~~~~~------~~~~F~ie~~~----~~v~lik-a~NGkYl~~k~ 606 (614)
+.+ -|.| .++.++ +.++|.....+ ..|+| | ..||.|.+--.
T Consensus 72 s~n~~WI~a-da~~p~ed~s~~~cTLF~Pv~vd~~~~~~i~l-~~~~n~~~~~r~t 125 (153)
T PF07468_consen 72 SPNDYWIWA-DADDPDEDQSKPSCTLFEPVKVDVKDFNVIAL-RNMQNGHFCKRLT 125 (153)
T ss_dssp CCC--BEEE-EESSHHH-TCSTCGG-EEEEESCCCETTEEEE-EETTTTEEEEEE-
T ss_pred CCCCcEEEe-cCCCcccccCCCCceEEEEEEecCCCccEEEE-EecCCceEEEEEc
Confidence 555 4778 544333 56679998764 67888 6 68999987543
No 22
>smart00442 FGF Acidic and basic fibroblast growth factor family. Mitogens that stimulate growth or differentiation of cells of mesodermal or neuroectodermal origin. The family play essential roles in patterning and differentiation during vertebrate embryogenesis, and have neurotrophic activities.
Probab=96.39 E-value=0.026 Score=52.50 Aligned_cols=78 Identities=17% Similarity=0.107 Sum_probs=65.6
Q ss_pred EEEeecCCceEEecCCeeEEEecccCCCCceEEEEEcCCCeEEEEeeccCceeeecCCeEEEEecCCCCCcceEEEEc
Q psy15563 102 VYLRSHLGKYLAVDSFGNVTCEADEKEPGSLFQIHVSDEGKWAFKNKTRGYFLGASSDKLICTAKIPANPEYWLAHLA 179 (614)
Q Consensus 102 v~lkSh~GrYL~~d~dG~v~~~~e~~~~~~~F~i~~~~dG~waLqs~~~g~yl~~~~d~~~c~~~~~~~~~~W~~hlA 179 (614)
+.|=+..|.||..++||.|.++.+..++..-|++...+.|..+||++.+++||..+..--.-.+...+++..|..++.
T Consensus 5 ~~Ly~~~~~~L~I~~~G~V~Gt~~~~~~~~ile~~s~~~g~V~ik~~~s~~YLCmn~~G~ly~s~~~~~dC~F~E~~~ 82 (126)
T smart00442 5 RQLYCRNGQHLQILPDGTVDGTRDESSSFTILEIIAVAVGVVAIKGVASCRYLCMNKCGKLYGSKNFTEDCVFREEME 82 (126)
T ss_pred EEEEeCCCeEEEEcCCceEecccCCCCcceEEEEEeccCCEEEEEEcccceEEEECCCCCEEEcccCCCCcEEEEEec
Confidence 445556679999999999999999999999999999988999999999999999977655555566777777777654
No 23
>KOG3885|consensus
Probab=95.12 E-value=0.079 Score=51.10 Aligned_cols=74 Identities=22% Similarity=0.298 Sum_probs=62.6
Q ss_pred EEEEEecCCcEEEEccCCeEEEeccCCCCCceEEEEEeCCCeEEEEECCCCeEEEee-CCeEEEeecCCCCCccEEE
Q psy15563 307 SVYLRSHLGKYLAVDSFGNVTCEADEKEPGSLFQIHVSDEGKWAFKNKTRGYFLGAS-SDKLICTAKIPANPEYWLA 382 (614)
Q Consensus 307 ~v~LRS~~GkYLs~~~~G~V~a~a~~~g~~E~F~le~~~dGk~aLrs~~~G~yL~a~-~G~L~a~~~s~g~~E~F~v 382 (614)
+.+.|+. |.+|.+.++|.|..++++-++...++|+...=|.|+||....++||+.+ .|+|.+. ....++..|.=
T Consensus 29 ~LY~~t~-g~hLqi~p~g~V~Gt~~~~s~~siLei~sv~~GvV~IkGV~s~~YL~Mn~~G~LygS-~~~t~eC~F~E 103 (155)
T KOG3885|consen 29 LLYCRNG-GHFLRILPDGTVDGTRDRSDQHTIFEIITVAVGVVAIKGVESELYLAMNKEGKLYAS-KEFTEECKFKE 103 (155)
T ss_pred EEEEcCC-CEEEEEcCCCccccccccCCCceeEEEEEeeecEEEEEEeeceeEEEECCCCcEecC-CCCCccceeEE
Confidence 3444445 9999999999999999999999999998875599999999999999997 6999998 66667777763
No 24
>PF05270 AbfB: Alpha-L-arabinofuranosidase B (ABFB); InterPro: IPR007934 This family consists of several fungal alpha-L-arabinofuranosidase B proteins. L-Arabinose is a constituent of plant cell wall polysaccharides. It is found in a polymeric form in L-arabinan, in which the backbone is formed by 1,5-a- linked l-arabinose residues that can be branched via 1,2-a- and 1,3-a-linked l-arabinofuranose side chains. AbfB hydrolyses 1,5-a, 1,3-a and 1,2-a linkages in both oligosaccharides and polysaccharides, which contain terminal non-reducing l-arabinofuranoses in side chains [].; GO: 0046556 alpha-N-arabinofuranosidase activity, 0046373 L-arabinose metabolic process; PDB: 3AKI_A 3AKF_A 3AKH_A 3AKG_A 2D44_A 1WD3_A 2D43_A 1WD4_A 3KMV_E.
Probab=94.24 E-value=1.4 Score=41.96 Aligned_cols=108 Identities=18% Similarity=0.206 Sum_probs=70.7
Q ss_pred EEEEEE--CCCceEEEecCCeEEEecCCCCC----CCceEEEEEcCCCCccEEEEEecC--CcEEEEccCCeEEEeccCC
Q psy15563 262 TIGLIN--AQFKYLTAETFGFKINANGASLK----KKQVWILEPANGGDTLSVYLRSHL--GKYLAVDSFGNVTCEADEK 333 (614)
Q Consensus 262 ~V~L~s--~~GkYLtae~~g~~V~An~~~l~----~~E~F~Le~~~~~~~~~v~LRS~~--GkYLs~~~~G~V~a~a~~~ 333 (614)
.+.|.+ ..++||.-..+...+.+...... ...+|.+..--. +.+.|.|.|.+ |.||.. .++.|.-.+...
T Consensus 2 ~~~~~s~~~~~ryirh~~~~~~~~~v~~~s~~~~r~da~f~vvpGLa-~~~~vSfES~~~PG~yLrh-~~~~v~l~~~d~ 79 (142)
T PF05270_consen 2 SLRLTSPNYPDRYIRHRGSLVRLDPVSSSSSALDRADATFRVVPGLA-DSSCVSFESVNYPGYYLRH-SNFRVRLEKNDG 79 (142)
T ss_dssp EEEEEESSSTTEEEEEETTEEEEEES-SSGGHHHHHGG-EEEEE-SS--TTCEEEEESSSTTEEEEE-ETTEEEEEE--S
T ss_pred eEEEECCCCCCeEEEEcCceEEEeeccCCcchhhccCceEEEEEccC-CCCEEEEEECCCCCcEEEE-ECCEEEEeecCC
Confidence 456666 47999998765566644444332 467899986322 35689999977 999998 566766665544
Q ss_pred C----CCceEEEEEe--CCCeEEEEECC-CCeEEEeeCCeEEEee
Q psy15563 334 E----PGSLFQIHVS--DEGKWAFKNKT-RGYFLGASSDKLICTA 371 (614)
Q Consensus 334 g----~~E~F~le~~--~dGk~aLrs~~-~G~yL~a~~G~L~a~~ 371 (614)
+ ...+|.++.- ..|.++|++.+ -|+||.-.++.|....
T Consensus 80 s~~F~~dATF~~~~Gl~~~g~~sfeS~n~Pg~ylrh~~~~l~l~~ 124 (142)
T PF05270_consen 80 SALFREDATFCPRPGLAGPGYVSFESYNYPGRYLRHYNGELYLAP 124 (142)
T ss_dssp SHHHHHHT-EEEEE-SSSTTEEEEEESSSTTEEEEEETTEEEEEE
T ss_pred CccccCCceEEEecCCCCCCcceEEEecCCCeEEEEECCEEEEec
Confidence 4 3788998885 23778999885 4788988877665543
No 25
>KOG3885|consensus
Probab=90.42 E-value=1.1 Score=43.27 Aligned_cols=61 Identities=25% Similarity=0.396 Sum_probs=53.4
Q ss_pred EEEEeecCCceEEecCCeeEEEecccCCCCceEEEEEcCCCeEEEEeeccCceeeecCC-eEE
Q psy15563 101 SVYLRSHLGKYLAVDSFGNVTCEADEKEPGSLFQIHVSDEGKWAFKNKTRGYFLGASSD-KLI 162 (614)
Q Consensus 101 ~v~lkSh~GrYL~~d~dG~v~~~~e~~~~~~~F~i~~~~dG~waLqs~~~g~yl~~~~d-~~~ 162 (614)
..+.++. |.+|-+.+||.|..++|.-++..-++|++-.=|..+||.+..++||..+.+ .|.
T Consensus 29 ~LY~~t~-g~hLqi~p~g~V~Gt~~~~s~~siLei~sv~~GvV~IkGV~s~~YL~Mn~~G~Ly 90 (155)
T KOG3885|consen 29 LLYCRNG-GHFLRILPDGTVDGTRDRSDQHTIFEIITVAVGVVAIKGVESELYLAMNKEGKLY 90 (155)
T ss_pred EEEEcCC-CEEEEEcCCCccccccccCCCceeEEEEEeeecEEEEEEeeceeEEEECCCCcEe
Confidence 3566666 999999999999999999999999999988679999999999999998655 444
No 26
>PF05270 AbfB: Alpha-L-arabinofuranosidase B (ABFB); InterPro: IPR007934 This family consists of several fungal alpha-L-arabinofuranosidase B proteins. L-Arabinose is a constituent of plant cell wall polysaccharides. It is found in a polymeric form in L-arabinan, in which the backbone is formed by 1,5-a- linked l-arabinose residues that can be branched via 1,2-a- and 1,3-a-linked l-arabinofuranose side chains. AbfB hydrolyses 1,5-a, 1,3-a and 1,2-a linkages in both oligosaccharides and polysaccharides, which contain terminal non-reducing l-arabinofuranoses in side chains [].; GO: 0046556 alpha-N-arabinofuranosidase activity, 0046373 L-arabinose metabolic process; PDB: 3AKI_A 3AKF_A 3AKH_A 3AKG_A 2D44_A 1WD3_A 2D43_A 1WD4_A 3KMV_E.
Probab=81.41 E-value=12 Score=35.79 Aligned_cols=106 Identities=19% Similarity=0.203 Sum_probs=66.5
Q ss_pred EecCCceeeeeecCeeEEeccccccccccccccceecceeecccccccccccceeEEEEecCCCCceEEEEeecC--Cce
Q psy15563 34 INAQFKYLTAETFGFKINANGASLKKKQFKYLTAETFGFKINANGASLKKKQDGVWILEPANGGDTLSVYLRSHL--GKY 111 (614)
Q Consensus 34 in~~~~YLtaE~fgfkvna~~~slkkkQ~~~~~~~~~~~~~~~~~~~~~~~q~~vW~l~~~~~~~~~~v~lkSh~--GrY 111 (614)
.|..++||..-.+--.+-+..++.. -.-|||..|++.+--. +...|+|.|.. |.|
T Consensus 8 ~~~~~ryirh~~~~~~~~~v~~~s~----------------------~~~r~da~f~vvpGLa-~~~~vSfES~~~PG~y 64 (142)
T PF05270_consen 8 PNYPDRYIRHRGSLVRLDPVSSSSS----------------------ALDRADATFRVVPGLA-DSSCVSFESVNYPGYY 64 (142)
T ss_dssp SSSTTEEEEEETTEEEEEES-SSGG----------------------HHHHHGG-EEEEE-SS--TTCEEEEESSSTTEE
T ss_pred CCCCCeEEEEcCceEEEeeccCCcc----------------------hhhccCceEEEEEccC-CCCEEEEEECCCCCcE
Confidence 4667788877655555444433332 1223345899987433 34579999986 999
Q ss_pred EEecCCeeEEEecccCC----CCceEEEEEc--CCCeEEEEeec-cCceeeecCCeEEE
Q psy15563 112 LAVDSFGNVTCEADEKE----PGSLFQIHVS--DEGKWAFKNKT-RGYFLGASSDKLIC 163 (614)
Q Consensus 112 L~~d~dG~v~~~~e~~~----~~~~F~i~~~--~dG~waLqs~~-~g~yl~~~~d~~~c 163 (614)
|-. .++.|.-+....+ .+..|.++.- ..|-.+|+|.. .|+||--.+..|..
T Consensus 65 Lrh-~~~~v~l~~~d~s~~F~~dATF~~~~Gl~~~g~~sfeS~n~Pg~ylrh~~~~l~l 122 (142)
T PF05270_consen 65 LRH-SNFRVRLEKNDGSALFREDATFCPRPGLAGPGYVSFESYNYPGRYLRHYNGELYL 122 (142)
T ss_dssp EEE-ETTEEEEEE--SSHHHHHHT-EEEEE-SSSTTEEEEEESSSTTEEEEEETTEEEE
T ss_pred EEE-ECCEEEEeecCCCccccCCceEEEecCCCCCCcceEEEecCCCeEEEEECCEEEE
Confidence 988 7777777666544 6789999875 44777999965 57888776665543
No 27
>cd00161 RICIN Ricin-type beta-trefoil; Carbohydrate-binding domain formed from presumed gene triplication. The domain is found in a variety of molecules serving diverse functions such as enzymatic activity, inhibitory toxicity and signal transduction. Highly specific ligand binding occurs on exposed surfaces of the compact domain sturcture.
Probab=64.32 E-value=91 Score=26.59 Aligned_cols=85 Identities=15% Similarity=0.133 Sum_probs=46.4
Q ss_pred CCceEEEecC--CeEEEec-CCCCCCCceEEEEEcCCCCccEEEEEecCCcEEEEccC---CeEEE-eccCCCCCceEEE
Q psy15563 269 QFKYLTAETF--GFKINAN-GASLKKKQVWILEPANGGDTLSVYLRSHLGKYLAVDSF---GNVTC-EADEKEPGSLFQI 341 (614)
Q Consensus 269 ~GkYLtae~~--g~~V~An-~~~l~~~E~F~Le~~~~~~~~~v~LRS~~GkYLs~~~~---G~V~a-~a~~~g~~E~F~l 341 (614)
.++.|.+... +..|... -....+.|.|.+.. .+.++++. .+++|.+... ..|+. .-....+..+|.+
T Consensus 9 ~~~cL~~~~~~~~~~v~~~~c~~~~~~Q~W~~~~-----~g~~~~~~-~~~Cl~~~~~~~~~~~~~~~c~~~~~~Q~W~~ 82 (124)
T cd00161 9 TGLCLDVNGGSDGGPVQLYPCHGNGNNQKWTLTS-----DGTIRIKS-SNLCLDVGGDAPGSKVRLYTCSGGSDNQRWTF 82 (124)
T ss_pred CCeEEECCCCCCCCEEEEEECCCCCccCCEEEeC-----CCeEEEcC-CCeEEcccCCCCCCEEEEEECCCCCcCCEEEE
Confidence 4666665532 2333211 11114688999875 34677776 7888887654 33433 2222356788887
Q ss_pred EEeCCCeEEEEECCCCeEEEee
Q psy15563 342 HVSDEGKWAFKNKTRGYFLGAS 363 (614)
Q Consensus 342 e~~~dGk~aLrs~~~G~yL~a~ 363 (614)
+.. +++ ++...+.-|.+.
T Consensus 83 ~~~--~~i--~~~~~~~cl~~~ 100 (124)
T cd00161 83 NKD--GTI--RNLKSGKCLDVK 100 (124)
T ss_pred CCC--cEE--EECCCCeEEeCC
Confidence 643 444 333346666554
No 28
>PF03498 CDtoxinA: Cytolethal distending toxin A/C family; InterPro: IPR003558 Escherichia coli, Haemophilus spp and Campylobacter spp. all produce a toxin that is seen to cause distension in certain cell lines [, ], which eventually disintegrate and die. This novel toxin, termed cytolethal distending toxin (cdt), has three subunits: A, B and C. Their sizes are approx. 27.7, 29.5 and 19.9kDa respectively [], and they appear to be entirely novel []. Further research on the complete toxin has revealed that it blocks the cell cycle at stage G2, through inactivation of the cyclin-dependent kinase Cdk1, and without induction of DNA breaks. This leads to multipolar abortive mitosis and micronucleation, associated with centrosomal amplification []. The roles of each subunit are unclear, but it is believed that they have separate roles in pathogenicity. This entry represents the A and C subunits.; GO: 0009405 pathogenesis; PDB: 2F2F_A 1SR4_C.
Probab=63.51 E-value=21 Score=34.40 Aligned_cols=78 Identities=21% Similarity=0.309 Sum_probs=53.0
Q ss_pred EEEEeec-CCceEEecCCeeEE---EecccCCCCceEEEEEcCCCeEEEEeeccCceeeecCCe-------E---EE-Ee
Q psy15563 101 SVYLRSH-LGKYLAVDSFGNVT---CEADEKEPGSLFQIHVSDEGKWAFKNKTRGYFLGASSDK-------L---IC-TA 165 (614)
Q Consensus 101 ~v~lkSh-~GrYL~~d~dG~v~---~~~e~~~~~~~F~i~~~~dG~waLqs~~~g~yl~~~~d~-------~---~c-~~ 165 (614)
.|.||.. -|.=|.++++|.+. |+.+ .....|.|++...|.+.|||...+.=|...... + .| .+
T Consensus 53 ~Vqf~n~~~~~CL~~~~~G~~~~~~C~~~--~~~q~F~iiPtttgAVQIks~~~~~Cl~~~~~~~~~~~~~i~l~~Cd~~ 130 (150)
T PF03498_consen 53 YVQFVNPKTGTCLAAYGNGVFHYKSCDQD--NLEQVFSIIPTTTGAVQIKSLSTGECLQTFNNSRTPIYYSIGLTPCDKS 130 (150)
T ss_dssp EEEEEETTTSEEEEEETTCEEEE--TTTC--HGHH-EEEEEBTTS-EEEEETTT--EEEE-STTSS-SSEEEEEE---S-
T ss_pred EEEEEcCCCCcceeecCCCeEeecccCCC--ChhceEEEEEcCCCcEEEEecCCCceEEecCCCceeEEeeEEeeeCCCc
Confidence 7999998 78899999999977 5555 667899999999999999998888666663322 1 25 34
Q ss_pred cCCCCCcceEEEEcc
Q psy15563 166 KIPANPEYWLAHLAA 180 (614)
Q Consensus 166 ~~~~~~~~W~~hlA~ 180 (614)
..+...++|.+-.++
T Consensus 131 ~~~~l~~lw~itPP~ 145 (150)
T PF03498_consen 131 KEINLDQLWFITPPL 145 (150)
T ss_dssp EEETGGGEEEEE--S
T ss_pred CCCCHHHcEEEcCCc
Confidence 455667888876653
No 29
>KOG1407|consensus
Probab=58.61 E-value=2.5e+02 Score=29.88 Aligned_cols=21 Identities=33% Similarity=0.608 Sum_probs=16.8
Q ss_pred ceEEEE-cCCCCeEeeeCCCceE
Q psy15563 489 GQIALR-DNSGAYLSPIGSKAIL 510 (614)
Q Consensus 489 g~valr-~~nGkyL~~~g~~g~L 510 (614)
+.|+|+ +..|||++. |++-.|
T Consensus 191 nCicI~f~p~GryfA~-GsADAl 212 (313)
T KOG1407|consen 191 NCICIEFDPDGRYFAT-GSADAL 212 (313)
T ss_pred ceEEEEECCCCceEee-ccccce
Confidence 789999 899999985 765444
No 30
>PF03498 CDtoxinA: Cytolethal distending toxin A/C family; InterPro: IPR003558 Escherichia coli, Haemophilus spp and Campylobacter spp. all produce a toxin that is seen to cause distension in certain cell lines [, ], which eventually disintegrate and die. This novel toxin, termed cytolethal distending toxin (cdt), has three subunits: A, B and C. Their sizes are approx. 27.7, 29.5 and 19.9kDa respectively [], and they appear to be entirely novel []. Further research on the complete toxin has revealed that it blocks the cell cycle at stage G2, through inactivation of the cyclin-dependent kinase Cdk1, and without induction of DNA breaks. This leads to multipolar abortive mitosis and micronucleation, associated with centrosomal amplification []. The roles of each subunit are unclear, but it is believed that they have separate roles in pathogenicity. This entry represents the A and C subunits.; GO: 0009405 pathogenesis; PDB: 2F2F_A 1SR4_C.
Probab=52.51 E-value=77 Score=30.61 Aligned_cols=57 Identities=23% Similarity=0.256 Sum_probs=42.4
Q ss_pred ccEEEEEe-cCCcEEEEccCCeEE---EeccCCCCCceEEEEEeCCCeEEEEECCCCeEEEee
Q psy15563 305 TLSVYLRS-HLGKYLAVDSFGNVT---CEADEKEPGSLFQIHVSDEGKWAFKNKTRGYFLGAS 363 (614)
Q Consensus 305 ~~~v~LRS-~~GkYLs~~~~G~V~---a~a~~~g~~E~F~le~~~dGk~aLrs~~~G~yL~a~ 363 (614)
.+.|+|+. ..|..|.+..+|.+. |+.+ .....|+|.+-..|-+-||+..+|.=|+..
T Consensus 51 ~g~Vqf~n~~~~~CL~~~~~G~~~~~~C~~~--~~~q~F~iiPtttgAVQIks~~~~~Cl~~~ 111 (150)
T PF03498_consen 51 FGYVQFVNPKTGTCLAAYGNGVFHYKSCDQD--NLEQVFSIIPTTTGAVQIKSLSTGECLQTF 111 (150)
T ss_dssp TCEEEEEETTTSEEEEEETTCEEEE--TTTC--HGHH-EEEEEBTTS-EEEEETTT--EEEE-
T ss_pred CCEEEEEcCCCCcceeecCCCeEeecccCCC--ChhceEEEEEcCCCcEEEEecCCCceEEec
Confidence 47999999 688899998888776 5433 457899999997799999999888766664
No 31
>KOG0308|consensus
Probab=47.66 E-value=43 Score=39.19 Aligned_cols=63 Identities=22% Similarity=0.372 Sum_probs=48.3
Q ss_pred EEEEeec--CCceEEecCCee--EEEecccC-------CCCceEEEEEcCCCeEEEEeecc--CceeeecCCeEEE
Q psy15563 101 SVYLRSH--LGKYLAVDSFGN--VTCEADEK-------EPGSLFQIHVSDEGKWAFKNKTR--GYFLGASSDKLIC 163 (614)
Q Consensus 101 ~v~lkSh--~GrYL~~d~dG~--v~~~~e~~-------~~~~~F~i~~~~dG~waLqs~~~--g~yl~~~~d~~~c 163 (614)
...||.| +=|=|..+.||. |++++++. -.+|.+.+++|.+|-|||++.+. --|.|+....|.+
T Consensus 206 imkLrGHTdNVr~ll~~dDGt~~ls~sSDgtIrlWdLgqQrCl~T~~vH~e~VWaL~~~~sf~~vYsG~rd~~i~~ 281 (735)
T KOG0308|consen 206 IMKLRGHTDNVRVLLVNDDGTRLLSASSDGTIRLWDLGQQRCLATYIVHKEGVWALQSSPSFTHVYSGGRDGNIYR 281 (735)
T ss_pred eeeeeccccceEEEEEcCCCCeEeecCCCceEEeeeccccceeeeEEeccCceEEEeeCCCcceEEecCCCCcEEe
Confidence 5788888 668899999995 77777753 36788999999999999999653 2466665555555
No 32
>KOG2243|consensus
Probab=46.80 E-value=40 Score=42.56 Aligned_cols=204 Identities=15% Similarity=0.097 Sum_probs=0.0
Q ss_pred EEeeccCceeee-----cCCeEEEE-----ecCCCCCcceEEEEcccce-----eeeccCCCCcccCCCCCCCCceeeee
Q psy15563 145 FKNKTRGYFLGA-----SSDKLICT-----AKIPANPEYWLAHLAARPQ-----LRSCGSPPCASIGLPSSRGLPCRYIS 209 (614)
Q Consensus 145 Lqs~~~g~yl~~-----~~d~~~c~-----~~~~~~~~~W~~hlA~Hpq-----v~~~~~~~~~~~~~~~~~~~~~ry~~ 209 (614)
|+-.-.|-||.. +..+-.+| ..+.+++.+|++|+|-.-. |++=. |+-.....-.||||
T Consensus 122 lrhs~sgmyl~clstsrs~tdklafdvgl~e~~tgeacwwtihpaskqrsegekvrigd-------dlilvsvsserylh 194 (5019)
T KOG2243|consen 122 LRHSFSGMYLCCLSTSRSQTDKLAFDVGLQEDATGEACWWTIHPASKQRSEGEKVRIGD-------DLILVSVSSERYLH 194 (5019)
T ss_pred hhccccceEEEEecccccccccceeeccccccCcCceeeEEecccccccccCceeeecC-------ceEEEEecccceEE
Q ss_pred e--CCCCCcccCCCCCcccceEEEEEEeecCccceeecCCCccccCCCCccceeEEEEEE-CCCceEEEecCCeEEEecC
Q psy15563 210 S--SPSTKAVHKPPPLLKRSVFPLIMNIMNGTNGHHENGNGELNGVGFSSKGVWTIGLIN-AQFKYLTAETFGFKINANG 286 (614)
Q Consensus 210 l--~~~~~~v~~~~pwg~d~l~tl~~~~~~~~~~~~~~~~~~L~~~G~~~~~~~~V~L~s-~~GkYLtae~~g~~V~An~ 286 (614)
| +...++||...-++-+.|-|.---..-.++.++-|+--+|-..-..+|.++--+=.. ...|-+--|.++..+.|..
T Consensus 195 ls~~ng~i~vdaaf~qtlw~v~p~~sgs~~~egyligg~v~rl~hghmdecltips~d~~d~q~r~i~ye~g~~~~~a~s 274 (5019)
T KOG2243|consen 195 LSVGNGNIHVDAAFMQTLWNVAPICSGSEAAEGYLIGGDVLRLFHGHMDECLTIPSGDHGDEQHRRIFYEAGAAGTHARS 274 (5019)
T ss_pred EEecCCceeecHHHHHHHhccCCCcCCchhhhceeecchHHHHhccCccceeecCCCCccchhheeeeeeccccccchhh
Q ss_pred CCCCCCceEEEEEcCCCC------ccEEEEEe-cCCcEEEEccCCeEEEeccCCCC--CceEEEEEe-------------
Q psy15563 287 ASLKKKQVWILEPANGGD------TLSVYLRS-HLGKYLAVDSFGNVTCEADEKEP--GSLFQIHVS------------- 344 (614)
Q Consensus 287 ~~l~~~E~F~Le~~~~~~------~~~v~LRS-~~GkYLs~~~~G~V~a~a~~~g~--~E~F~le~~------------- 344 (614)
- |.+=.|-.. |. +..++||- ..|+||++..|-.++--..+..+ .+.|.....
T Consensus 275 l----wr~e~lria--wsgs~irwgq~frlrh~ttg~ylal~edk~lil~dk~kad~kstafsfraskelkekld~~~kk 348 (5019)
T KOG2243|consen 275 L----WRLEPLRIA--WSGSHIRWGQAFRLRHLTTGHYLALMEDKGLILMDKAKADTKSTAFSFRASKELKEKLDSGHKK 348 (5019)
T ss_pred h----hcccceeEE--eccCccccCcceeeeeccccceeeeeccCCceeechhhcccccceeEeeccHHHHHhhcccccc
Q ss_pred ------------CCCeEEEEECCCCeEEE
Q psy15563 345 ------------DEGKWAFKNKTRGYFLG 361 (614)
Q Consensus 345 ------------~dGk~aLrs~~~G~yL~ 361 (614)
+|.-+-+|+...|-||+
T Consensus 349 didgmg~~eikygdsvc~iqhi~sglw~t 377 (5019)
T KOG2243|consen 349 DIDGMGTPEIKYGDSVCFIQHIASGLWLT 377 (5019)
T ss_pred ccccCCCcceecCceeeeeeeeccceeEE
No 33
>PF00652 Ricin_B_lectin: Ricin-type beta-trefoil lectin domain; InterPro: IPR000772 Ricin is a legume lectin from the seeds of the castor bean plant, Ricinus communis. The seeds are poisonous to people, animals and insects and just one milligram of ricin can kill an adult. Primary structure analysis has shown the presence of a similar domain in many carbohydrate-recognition proteins like plant and bacterial AB-toxins, glycosidases or proteases [, , ]. This domain, known as the ricin B lectin domain, can be present in one or more copies and has been shown in some instance to bind simple sugars, such as galactose or lactose. The ricin B lectin domain is composed of three homologous subdomains of 40 amino acids (alpha, beta and gamma) and a linker peptide of around 15 residues (lambda). It has been proposed that the ricin B lectin domain arose by gene triplication from a primitive 40 residue galactoside-binding peptide [, ]. The most characteristic, though not completely conserved, sequence feature is the presence of a Q-W pattern. Consequently, the ricin B lectin domain as also been refered as the (QxW)3 domain and the three homologous regions as the QxW repeats [, ]. A disulphide bond is also conserved in some of the QxW repeats []. The 3D structure of the ricin B chain has shown that the three QxW repeats pack around a pseudo threefold axis that is stabilised by the lambda linker []. The ricin B lectin domain has no major segments of a helix or beta sheet but each of the QxW repeats contains an omega loop []. An idealized omega-loop is a compact, contiguous segment of polypeptide that traces a 'loop-shaped' path in three-dimensional space; the main chain resembles a Greek omega.; PDB: 2VLC_B 3A22_B 3A21_B 3A23_B 1GGP_B 1VCL_A 2Z48_B 2Z49_A 2D7R_A 2D7I_A ....
Probab=43.94 E-value=1.5e+02 Score=25.59 Aligned_cols=93 Identities=16% Similarity=0.216 Sum_probs=50.5
Q ss_pred EEEEEECCCceEEEe---cCCeEEEecCCCCCCCceEEEEEcCCCCccEEEEEecCCcEEEEccCC---eEEE-eccCCC
Q psy15563 262 TIGLINAQFKYLTAE---TFGFKINANGASLKKKQVWILEPANGGDTLSVYLRSHLGKYLAVDSFG---NVTC-EADEKE 334 (614)
Q Consensus 262 ~V~L~s~~GkYLtae---~~g~~V~An~~~l~~~E~F~Le~~~~~~~~~v~LRS~~GkYLs~~~~G---~V~a-~a~~~g 334 (614)
+|.+++.+++.|++. ..+..|....-.-.+.|.|.+... +.+.........|.+..+. .|.- .-+.-.
T Consensus 4 ~i~~~~~~~~cl~~~~~~~~~~~v~l~~c~~~~~Q~w~~~~~-----~~i~~~~~~~~CL~~~~~~~~~~i~l~~C~~~~ 78 (124)
T PF00652_consen 4 YIRNVNKSGLCLDVQGSTKNGSPVVLYPCDGSDNQLWRFDPD-----GQIRSNNNPNLCLDVDGSSPGTKIVLWPCDSNS 78 (124)
T ss_dssp EEEEEEGGGEEEEEGGSSSTTTBEEEEE--SSGGGEEEEETT-----SBEEETTETTEEEEESSSSTTEBEEEEETTTTG
T ss_pred EEEEeeCCCCeEEEcCCCCCCCEEEEEECCCCCceeEEEcCC-----CceeeccCcceEEEeeccCCCceEEEeeccCCc
Confidence 345555577888877 334444222111156799988743 2344433444578777665 3333 333333
Q ss_pred CCceEEEEEeCCCeEEEEECCCCeEEEee
Q psy15563 335 PGSLFQIHVSDEGKWAFKNKTRGYFLGAS 363 (614)
Q Consensus 335 ~~E~F~le~~~dGk~aLrs~~~G~yL~a~ 363 (614)
..+.|.+.. + ..|++...++.|...
T Consensus 79 ~~Q~W~~~~--~--~~i~n~~s~~cL~~~ 103 (124)
T PF00652_consen 79 SNQRWKFDP--D--GRIRNKNSGLCLDVK 103 (124)
T ss_dssp GGGBEEEET--T--SBEEETTTTEEEEEG
T ss_pred cCCeEEEcC--C--eeEEeCCCCEEEEec
Confidence 347888765 3 345555467888775
No 34
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=40.15 E-value=4.6e+02 Score=28.81 Aligned_cols=196 Identities=15% Similarity=0.211 Sum_probs=0.0
Q ss_pred ecCCceEEecCCee----EEEecccCCCCceEEEEEcCCC-----------eEEEEeeccCceeee---cCCeEE-----
Q psy15563 106 SHLGKYLAVDSFGN----VTCEADEKEPGSLFQIHVSDEG-----------KWAFKNKTRGYFLGA---SSDKLI----- 162 (614)
Q Consensus 106 Sh~GrYL~~d~dG~----v~~~~e~~~~~~~F~i~~~~dG-----------~waLqs~~~g~yl~~---~~d~~~----- 162 (614)
++.||||.+-.++. |.=-.+.-+...-=.++.+..+ ..+..+ +.|+||-. -.|++.
T Consensus 97 d~~g~~vf~AnY~~g~v~v~p~~~dG~l~~~v~~~~h~g~~p~~rQ~~~h~H~a~~t-P~~~~l~v~DLG~Dri~~y~~~ 175 (346)
T COG2706 97 DEDGRFVFVANYHSGSVSVYPLQADGSLQPVVQVVKHTGSGPHERQESPHVHSANFT-PDGRYLVVPDLGTDRIFLYDLD 175 (346)
T ss_pred CCCCCEEEEEEccCceEEEEEcccCCccccceeeeecCCCCCCccccCCccceeeeC-CCCCEEEEeecCCceEEEEEcc
Q ss_pred ----EEecCCCCCcceEE-EEcccceeeeccCCCCcccCCCCCCCCceeee--eeCCCCCcccCCCCCcccceEEEEEEe
Q psy15563 163 ----CTAKIPANPEYWLA-HLAARPQLRSCGSPPCASIGLPSSRGLPCRYI--SSSPSTKAVHKPPPLLKRSVFPLIMNI 235 (614)
Q Consensus 163 ----c~~~~~~~~~~W~~-hlA~Hpqv~~~~~~~~~~~~~~~~~~~~~ry~--~l~~~~~~v~~~~pwg~d~l~tl~~~~ 235 (614)
+.+....-..---| ||++||.-... |+ +|+..-.....+..-|.-..+.-+-.+
T Consensus 176 dg~L~~~~~~~v~~G~GPRHi~FHpn~k~a-------------------Y~v~EL~stV~v~~y~~~~g~~~~lQ~i~tl 236 (346)
T COG2706 176 DGKLTPADPAEVKPGAGPRHIVFHPNGKYA-------------------YLVNELNSTVDVLEYNPAVGKFEELQTIDTL 236 (346)
T ss_pred cCccccccccccCCCCCcceEEEcCCCcEE-------------------EEEeccCCEEEEEEEcCCCceEEEeeeeccC
Q ss_pred ecCccceeecCCCccccCCCCccceeEEEEEECCCceEEEecCC------eEEEecCCCCCCCceEEEEEcCCCCccEEE
Q psy15563 236 MNGTNGHHENGNGELNGVGFSSKGVWTIGLINAQFKYLTAETFG------FKINANGASLKKKQVWILEPANGGDTLSVY 309 (614)
Q Consensus 236 ~~~~~~~~~~~~~~L~~~G~~~~~~~~V~L~s~~GkYLtae~~g------~~V~An~~~l~~~E~F~Le~~~~~~~~~v~ 309 (614)
.++.-|--.+..--++.|| |||=|.+-| ++|..++..+.--+.+..|-..+
T Consensus 237 P~dF~g~~~~aaIhis~dG----------------rFLYasNRg~dsI~~f~V~~~~g~L~~~~~~~teg~~P------- 293 (346)
T COG2706 237 PEDFTGTNWAAAIHISPDG----------------RFLYASNRGHDSIAVFSVDPDGGKLELVGITPTEGQFP------- 293 (346)
T ss_pred ccccCCCCceeEEEECCCC----------------CEEEEecCCCCeEEEEEEcCCCCEEEEEEEeccCCcCC-------
Q ss_pred EEecCCcEEEEccCCeEEEeccCCCCC-ceEEEEEeCCCeEEE
Q psy15563 310 LRSHLGKYLAVDSFGNVTCEADEKEPG-SLFQIHVSDEGKWAF 351 (614)
Q Consensus 310 LRS~~GkYLs~~~~G~V~a~a~~~g~~-E~F~le~~~dGk~aL 351 (614)
|-...+++|.+..-+.+.+++ ..|.+... -|++.+
T Consensus 294 ------R~F~i~~~g~~Liaa~q~sd~i~vf~~d~~-TG~L~~ 329 (346)
T COG2706 294 ------RDFNINPSGRFLIAANQKSDNITVFERDKE-TGRLTL 329 (346)
T ss_pred ------ccceeCCCCCEEEEEccCCCcEEEEEEcCC-CceEEe
No 35
>COG0211 RpmA Ribosomal protein L27 [Translation, ribosomal structure and biogenesis]
Probab=33.94 E-value=1.1e+02 Score=26.79 Aligned_cols=61 Identities=23% Similarity=0.401 Sum_probs=35.7
Q ss_pred CCCCCcceEEEEcCCCCeEeeeCCCceEEecccCC---------CCceeEEEEecCCcEEEEEeeCCceEEEE
Q psy15563 483 IRSPATGQIALRDNSGAYLSPIGSKAILKTRSNTV---------TKDELFTLEDSLPQASFIAALNSKYVSVK 546 (614)
Q Consensus 483 ~~~~~~g~valr~~nGkyL~~~g~~g~L~a~~~~~---------~~~elF~le~~~pqv~l~s~~n~kyVs~~ 546 (614)
.+|+++-+|-.|-..|.++.+ .+=+++.+.+.+ .+++||.|.+-.-.+..++. +.|||||.
T Consensus 13 gRDS~~krLGvK~~~Gq~v~a--G~IivRQRGTk~hpG~NVG~GkDhTlFAl~dG~Vkf~~k~~-~rk~vsV~ 82 (87)
T COG0211 13 GRDSESKRLGVKKFGGQFVKA--GSIIVRQRGTKFHPGVNVGRGKDHTLFALVDGVVKFETKGK-NRKYVSVV 82 (87)
T ss_pred CccccccccceeeeCCeEEec--ccEEEEeccccccCCcccccCCCceEEEeeccEEEEEEccC-CccEEEEE
Confidence 455666777777777777765 222344444432 26789999774423333443 66777764
No 36
>cd00161 RICIN Ricin-type beta-trefoil; Carbohydrate-binding domain formed from presumed gene triplication. The domain is found in a variety of molecules serving diverse functions such as enzymatic activity, inhibitory toxicity and signal transduction. Highly specific ligand binding occurs on exposed surfaces of the compact domain sturcture.
Probab=30.93 E-value=3.3e+02 Score=23.01 Aligned_cols=59 Identities=19% Similarity=0.329 Sum_probs=33.6
Q ss_pred eEEEEecCCCCceEEEEeecCCceEEecCC---eeEEE-ecccCCCCceEEEEEcCCCeEEEEeeccCceeee
Q psy15563 88 VWILEPANGGDTLSVYLRSHLGKYLAVDSF---GNVTC-EADEKEPGSLFQIHVSDEGKWAFKNKTRGYFLGA 156 (614)
Q Consensus 88 vW~l~~~~~~~~~~v~lkSh~GrYL~~d~d---G~v~~-~~e~~~~~~~F~i~~~~dG~waLqs~~~g~yl~~ 156 (614)
.|++.. .+ .+++++ .++.|.++.. ..|.. .....++.-+|.+.. +|++ .+...+.-|..
T Consensus 37 ~W~~~~-~g----~~~~~~-~~~Cl~~~~~~~~~~~~~~~c~~~~~~Q~W~~~~--~~~i--~~~~~~~cl~~ 99 (124)
T cd00161 37 KWTLTS-DG----TIRIKS-SNLCLDVGGDAPGSKVRLYTCSGGSDNQRWTFNK--DGTI--RNLKSGKCLDV 99 (124)
T ss_pred CEEEeC-CC----eEEEcC-CCeEEcccCCCCCCEEEEEECCCCCcCCEEEECC--CcEE--EECCCCeEEeC
Confidence 899987 32 477776 7888888764 23332 222235667777763 3443 33234444444
No 37
>KOG1446|consensus
Probab=30.35 E-value=6.7e+02 Score=27.16 Aligned_cols=6 Identities=67% Similarity=1.154 Sum_probs=3.0
Q ss_pred CCcEEe
Q psy15563 442 NNKYLS 447 (614)
Q Consensus 442 nGkYL~ 447 (614)
++|||+
T Consensus 89 dNkylR 94 (311)
T KOG1446|consen 89 DNKYLR 94 (311)
T ss_pred cCceEE
Confidence 455554
No 38
>PF00652 Ricin_B_lectin: Ricin-type beta-trefoil lectin domain; InterPro: IPR000772 Ricin is a legume lectin from the seeds of the castor bean plant, Ricinus communis. The seeds are poisonous to people, animals and insects and just one milligram of ricin can kill an adult. Primary structure analysis has shown the presence of a similar domain in many carbohydrate-recognition proteins like plant and bacterial AB-toxins, glycosidases or proteases [, , ]. This domain, known as the ricin B lectin domain, can be present in one or more copies and has been shown in some instance to bind simple sugars, such as galactose or lactose. The ricin B lectin domain is composed of three homologous subdomains of 40 amino acids (alpha, beta and gamma) and a linker peptide of around 15 residues (lambda). It has been proposed that the ricin B lectin domain arose by gene triplication from a primitive 40 residue galactoside-binding peptide [, ]. The most characteristic, though not completely conserved, sequence feature is the presence of a Q-W pattern. Consequently, the ricin B lectin domain as also been refered as the (QxW)3 domain and the three homologous regions as the QxW repeats [, ]. A disulphide bond is also conserved in some of the QxW repeats []. The 3D structure of the ricin B chain has shown that the three QxW repeats pack around a pseudo threefold axis that is stabilised by the lambda linker []. The ricin B lectin domain has no major segments of a helix or beta sheet but each of the QxW repeats contains an omega loop []. An idealized omega-loop is a compact, contiguous segment of polypeptide that traces a 'loop-shaped' path in three-dimensional space; the main chain resembles a Greek omega.; PDB: 2VLC_B 3A22_B 3A21_B 3A23_B 1GGP_B 1VCL_A 2Z48_B 2Z49_A 2D7R_A 2D7I_A ....
Probab=27.93 E-value=3.9e+02 Score=22.92 Aligned_cols=61 Identities=15% Similarity=0.288 Sum_probs=38.2
Q ss_pred eEEEEecCCCCceEEEEeecCCceEEecCCe---eEEEeccc-CCCCceEEEEEcCCCeEEEEeeccCceeeec
Q psy15563 88 VWILEPANGGDTLSVYLRSHLGKYLAVDSFG---NVTCEADE-KEPGSLFQIHVSDEGKWAFKNKTRGYFLGAS 157 (614)
Q Consensus 88 vW~l~~~~~~~~~~v~lkSh~GrYL~~d~dG---~v~~~~e~-~~~~~~F~i~~~~dG~waLqs~~~g~yl~~~ 157 (614)
.|.+.... .++.......-|.++..+ .|.-..-. .....+|.+.. ...|++...++.|...
T Consensus 39 ~w~~~~~~-----~i~~~~~~~~CL~~~~~~~~~~i~l~~C~~~~~~Q~W~~~~----~~~i~n~~s~~cL~~~ 103 (124)
T PF00652_consen 39 LWRFDPDG-----QIRSNNNPNLCLDVDGSSPGTKIVLWPCDSNSSNQRWKFDP----DGRIRNKNSGLCLDVK 103 (124)
T ss_dssp EEEEETTS-----BEEETTETTEEEEESSSSTTEBEEEEETTTTGGGGBEEEET----TSBEEETTTTEEEEEG
T ss_pred eEEEcCCC-----ceeeccCcceEEEeeccCCCceEEEeeccCCccCCeEEEcC----CeeEEeCCCCEEEEec
Confidence 99888752 356555566688888777 33333333 33337888875 5556665567777774
No 39
>KOG0649|consensus
Probab=26.37 E-value=8.1e+02 Score=26.07 Aligned_cols=52 Identities=25% Similarity=0.296 Sum_probs=29.7
Q ss_pred EEEEEECCCceEE-EecCCeEEEecCCCCCCCceEEEEEcCCCCccEEEEEecCCcEEEE
Q psy15563 262 TIGLINAQFKYLT-AETFGFKINANGASLKKKQVWILEPANGGDTLSVYLRSHLGKYLAV 320 (614)
Q Consensus 262 ~V~L~s~~GkYLt-ae~~g~~V~An~~~l~~~E~F~Le~~~~~~~~~v~LRS~~GkYLs~ 320 (614)
.|.+|++++.-|| +|++.++|=..++ .+++=.+|.- ...-.||-++||++.+
T Consensus 161 ~vv~R~~~~qilsG~EDGtvRvWd~kt---~k~v~~ie~y----k~~~~lRp~~g~wiga 213 (325)
T KOG0649|consen 161 SVVGRNANGQILSGAEDGTVRVWDTKT---QKHVSMIEPY----KNPNLLRPDWGKWIGA 213 (325)
T ss_pred eeeecccCcceeecCCCccEEEEeccc---cceeEEeccc----cChhhcCcccCceeEE
Confidence 6888999999998 4443355432222 2233334321 2245677778887665
No 40
>PF02751 TFIIA_gamma_C: Transcription initiation factor IIA, gamma subunit; InterPro: IPR015871 Transcription factor IIA (TFIIA) is one of several factors that form part of a transcription pre-initiation complex along with RNA polymerase II, the TATA-box-binding protein (TBP) and TBP-associated factors, on the TATA-box sequence upstream of the initiation start site. After initiation, some components of the pre-initiation complex (including TFIIA) remain attached and re-initiate a subsequent round of transcription. TFIIA binds to TBP to stabilise TBP binding to the TATA element. TFIIA also inhibits the cytokine HMGB1 (high mobility group 1 protein) binding to TBP [], and can dissociate HMGB1 already bound to TBP/TATA-box. Human and Drosophila TFIIA have three subunits: two large subunits, LN/alpha and LC/beta, derived from the same gene, and a small subunit, S/gamma. Yeast TFIIA has two subunits: a large TOA1 subunit that shows sequence similarity to the N-terminal of LN/alpha and the C-terminal of LC/beta, and a small subunit, TOA2 that is highly homologous with S/gamma. The conserved regions of the large and small subunits of TFIIA combine to form two domains: a four-helix bundle (helical domain) composed of two helices from each of the N-terminal regions of TOA1 and TOA2 in yeast; and a beta-barrel (beta-barrel domain) composed of beta-sheets from the C-terminal regions of TOA1 and TOA2 []. This entry represents the beta-barrel domain found at the C-terminal of the gamma subunit of transcription factor TFIIA. ; GO: 0006367 transcription initiation from RNA polymerase II promoter, 0005672 transcription factor TFIIA complex; PDB: 1NVP_D 1RM1_B 1YTF_D 1NH2_D.
Probab=24.96 E-value=1.2e+02 Score=24.28 Aligned_cols=33 Identities=24% Similarity=0.464 Sum_probs=25.6
Q ss_pred ceeEEEEecCCCCceEEEEee-cCCceEEecCCeeEEEec
Q psy15563 86 DGVWILEPANGGDTLSVYLRS-HLGKYLAVDSFGNVTCEA 124 (614)
Q Consensus 86 ~~vW~l~~~~~~~~~~v~lkS-h~GrYL~~d~dG~v~~~~ 124 (614)
|.|||.... .+.||- -.+.++.+|+--.|.|++
T Consensus 19 DnVWTFi~k------n~~fk~~~~~~~~~~dkvKIVACd~ 52 (52)
T PF02751_consen 19 DNVWTFILK------NVEFKMEDNNETVKVDKVKIVACDS 52 (52)
T ss_dssp TTEEEEEEE------EEEEEEE-SSEEEEEEEEEEEEEES
T ss_pred CcEEEEEEc------CEEEEEecCCcEEEcCeEEEEEcCC
Confidence 569999874 377876 688899999888888874
No 41
>smart00458 RICIN Ricin-type beta-trefoil. Carbohydrate-binding domain formed from presumed gene triplication.
Probab=22.01 E-value=5e+02 Score=22.09 Aligned_cols=63 Identities=17% Similarity=0.326 Sum_probs=34.9
Q ss_pred CCCceEEEEEcCCCCccEEEEEecCCcEEEEccCC--eEEE-eccCCCCCceEEEEEeCCCeEEEEECCCCeEEEee
Q psy15563 290 KKKQVWILEPANGGDTLSVYLRSHLGKYLAVDSFG--NVTC-EADEKEPGSLFQIHVSDEGKWAFKNKTRGYFLGAS 363 (614)
Q Consensus 290 ~~~E~F~Le~~~~~~~~~v~LRS~~GkYLs~~~~G--~V~a-~a~~~g~~E~F~le~~~dGk~aLrs~~~G~yL~a~ 363 (614)
+..|.|.+.. .+.+.++ .++.|.+.... .|.. .-+...+..+|.+.. +|. |++...+..|...
T Consensus 28 ~~~Q~w~~~~-----~g~~~~~--~~~Cl~~~~~~~~~v~l~~c~~~~~~Q~W~~~~--~~~--i~~~~~~~cl~~~ 93 (117)
T smart00458 28 GGNQLWKLTS-----DGAIRIA--TDLCLTANGNTGSTVTLYSCDGDADNQYWTVNK--DGT--IRNPDSGLCLDVK 93 (117)
T ss_pred CccceEEEeC-----CCeEEec--CCccCccCCCCCCEEEEEECCCCCcCCEEEECC--Cee--EEeCCCCEEEecC
Confidence 4568998873 3456665 67777765432 3322 222225667777654 343 4444467777654
No 42
>COG5402 Uncharacterized conserved protein [Function unknown]
Probab=21.85 E-value=1.8e+02 Score=28.98 Aligned_cols=105 Identities=13% Similarity=0.209 Sum_probs=72.9
Q ss_pred ccccccceeEEEEecCCCCceEEEEeec---CCceEEecCCee-EEEecccCC----CCceEEEE--EcCCCeEEEEeec
Q psy15563 80 SLKKKQDGVWILEPANGGDTLSVYLRSH---LGKYLAVDSFGN-VTCEADEKE----PGSLFQIH--VSDEGKWAFKNKT 149 (614)
Q Consensus 80 ~~~~~q~~vW~l~~~~~~~~~~v~lkSh---~GrYL~~d~dG~-v~~~~e~~~----~~~~F~i~--~~~dG~waLqs~~ 149 (614)
|+|--| |++.+..+...+..+=|+| +|.++.--..|. .++..++.+ ..|++.|+ +-|---|+|-...
T Consensus 35 siriG~---W~afp~agT~~ADpYskA~~Ar~g~l~lg~aEgl~F~A~~Ds~g~~Lr~~C~Yri~G~~ppARfWTly~a~ 111 (194)
T COG5402 35 SIRIGP---WEAFPEAGTPKADPYSKARRARAGLLALGRAEGLSFTATTDSAGQRLRRECSYRIEGGTPPARFWTLYAAD 111 (194)
T ss_pred ceeecc---cccccccCCccCChhHHHHHhhhheeecccccceeEEEeecCCCchhhccCcEEecCCCCCceeEEEEEec
Confidence 444444 9999887654456888887 788887777884 777777654 46788887 2234667876633
Q ss_pred -cCceeeecCCeEEEE-ecCC--CCCcceEEEEcccceeeec
Q psy15563 150 -RGYFLGASSDKLICT-AKIP--ANPEYWLAHLAARPQLRSC 187 (614)
Q Consensus 150 -~g~yl~~~~d~~~c~-~~~~--~~~~~W~~hlA~Hpqv~~~ 187 (614)
.+.+|.-.+...++. ++++ ++...|+++++-|+|=..|
T Consensus 112 ~d~~~l~p~~~~~~Al~s~~Vlrs~Dg~~~I~vsp~a~pgnW 153 (194)
T COG5402 112 QDLHVLAPGEGRPSALNSRTVLRSADGSWVITVSPDAQPGNW 153 (194)
T ss_pred CCCceecCCCCCchheeceeeeeecCCcEEEEECCCCCCCce
Confidence 447777767766653 3443 4578999999999997666
No 43
>smart00472 MIR Domain in ryanodine and inositol trisphosphate receptors and protein O-mannosyltransferases.
Probab=21.55 E-value=2.3e+02 Score=21.77 Aligned_cols=38 Identities=26% Similarity=0.308 Sum_probs=24.4
Q ss_pred eEEEEeec-CCceEEecCCe---------eEEEecccC-CCCceEEEEE
Q psy15563 100 LSVYLRSH-LGKYLAVDSFG---------NVTCEADEK-EPGSLFQIHV 137 (614)
Q Consensus 100 ~~v~lkSh-~GrYL~~d~dG---------~v~~~~e~~-~~~~~F~i~~ 137 (614)
+.|+||-- .|+||.+...- .|+|..+.. .....|.|+.
T Consensus 8 ~~vrL~H~~tg~yL~s~~~~~~~~~~~q~eVt~~~~~~~~~~~~W~ie~ 56 (57)
T smart00472 8 DVVRLRHVTTGRYLHSHENKLPPWGDGQQEVTGYGNPAGDANTLWLIEP 56 (57)
T ss_pred CEEEEEEhhhCcEeecCCCCCCCCCCCcceEEEECCCCCCCCCcEEEEe
Confidence 47999853 79999874322 477766554 4555666653
No 44
>COG3211 PhoX Predicted phosphatase [General function prediction only]
Probab=21.24 E-value=4.4e+02 Score=30.96 Aligned_cols=88 Identities=16% Similarity=0.260 Sum_probs=44.8
Q ss_pred eEEEEecCCCCceEEEEe-ecCCce------EEecCCeeEEEecccCCCCceEEEEEcCCCeEEEEeeccCceeeecCCe
Q psy15563 88 VWILEPANGGDTLSVYLR-SHLGKY------LAVDSFGNVTCEADEKEPGSLFQIHVSDEGKWAFKNKTRGYFLGASSDK 160 (614)
Q Consensus 88 vW~l~~~~~~~~~~v~lk-Sh~GrY------L~~d~dG~v~~~~e~~~~~~~F~i~~~~dG~waLqs~~~g~yl~~~~d~ 160 (614)
=|+.|.++- +....=+| +.+||| +.+++||+|++=.-.- ....|+-.|-+.+++..-....++=|- ++..
T Consensus 287 gwvVEidp~-~p~stPvKrTalGRFkHE~a~v~v~~~~~vVvYlGDD-er~~~~YKFVsn~~y~~~~~~sn~~Ll-deGt 363 (616)
T COG3211 287 GWVVEIDPN-DPNSTPVKRTALGRFKHEAAAVDVNKDGRVVVYLGDD-ERGEYFYKFVSNGRYNPGASASNRDLL-DEGT 363 (616)
T ss_pred cceeecCCC-CCCccceeeeccccccccceeEEecCCCeEEEEeccC-CCCceEEEeccCCCCCCCccchhhhhh-hcce
Confidence 599999875 33333333 789998 7788999876654321 122233344444544443321111110 1223
Q ss_pred EEEEecCCCCCcceEEEE
Q psy15563 161 LICTAKIPANPEYWLAHL 178 (614)
Q Consensus 161 ~~c~~~~~~~~~~W~~hl 178 (614)
|..-.--..-+..|.+..
T Consensus 364 LyVAkfn~dGtg~W~pl~ 381 (616)
T COG3211 364 LYVAKFNEDGTGNWLPLV 381 (616)
T ss_pred EEEEEeCCCCCcceeeee
Confidence 333232334567887744
Done!